NAME
Parse::Path - Parser for paths
SYNOPSIS
use v5.10;
use Parse::Path;
my $path = Parse::Path->new(
path => 'gophers[0].food.count',
style => 'DZIL', # default
);
my $step = $path->shift; # { key => 'count', ... }
say $path->as_string;
$path->push($path, '[2]');
foreach my $p (@$path) {
say sprintf('%-6s %s --> %s', @$p{qw(type step key)});
}
DESCRIPTION
Parse::Path is, well, a parser for paths. File paths, object paths, URLs... A path is whatever string that can be translated into hash/array keys. Unlike modules like File::Spec or File::Basename, which are designed for interacting with file systems paths in a portable manner, Parse::Path is designed for interacting with any path, filesystem or otherwise, at the lowest level possible.
Paths are split out into steps. Internally, these are stored as "step hashes". However, there is some exposure to these hashes as both input and output, so we'll describe them here:
{
type => 'HASH', # must be either HASH or ARRAY
key => 'foo bar', # as it would be represented as a key
step => '"foo bar"', # as it would be represented in a path
pos => 'X+1', # used to determine depth
}
For the purposes of this manual, a "step" is usually referring to a step hash, unless specified.
CONSTRUCTOR
my $path = Parse::Path->new(
path => $path, # required
style => 'DZIL', # default
);
Creates a new path object. Parse::Path is really just a dispatcher to other Parse::Path modules, but it serves as a common API for all of them.
Accepts the following arguments:
path
path => 'gophers[0].food.count'
String used to create path. Can also be another Parse::Path object, a step, an array of step hashes, an array of paths, or whatever makes sense.
This parameter is required.
style
style => 'File::Unix'
style => '=MyApp::Parse::Path::Foobar'
Class used to create the new path object. With a =
prefix, it will use that as the full class. Otherwise, the class will be intepreted as Parse::Path::$class
.
Default is DZIL.
auto_normalize
auto_normalize => 1
my $will_normalize = $path->auto_normalize;
$path->auto_normalize(1);
If on, calls "normalize" after any new step has been added (ie: new, "unshift", "push", "splice").
Default is off. This attribute is read-write.
auto_cleanup
auto_cleanup => 1
my $will_cleanup = $path->auto_cleanup;
$path->auto_cleanup(1);
If on, calls "cleanup" after any new step has been added (ie: new, "unshift", "push", "splice").
Default is off. This attribute is read-write.
METHODS
step_count
my $count = $path->step_count;
Returns the number of steps in the path. Unlike "depth", negative-seeking steps (like ..
for most file-based paths) will not lower the step count.
depth
my $depth = $path->depth;
Returns path depth. In most cases, this is the number of steps to the path, a la "step_count". However, relative paths might make this lower, or even negative. For example:
my $path = Parse::Path->new(
path => '../../../foo/bar.txt',
path_style => 'File::Unix',
);
say $path->step_count; # 5
say $path->depth; # -1
Despite the similarity to the pos value of a step hash, this method doesn't tell you whether it's relative or absolute. Use "is_absolute" for that.
is_absolute
my $is_absolute = $path->is_absolute;
Returns a true value if this path is absolute. Hint: most paths are relative. For example, if the following paths were File::Unix paths:
foo/bar.txt # relative
../bar.txt # relative
bar.txt # relative
/home/foo/bar.txt # absolute
/home/../bar.txt # absolute (even prior to cleanup)
as_string
my $path_str = $path->as_string;
Returns the string form of the path. This involves taking the individual step strings of the path and placing the delimiters in the right place.
as_array
my $step_hashes = $path->as_array;
Returns the full path as an arrayref of step hashes. The steps are cloned for integrity. If you want a simplier representation of the path, consider "as_string".
shift
my $step_hash = $path->shift;
Works just like the Perl version. Removes a step from the beginning of the path and returns it. The step is cloned for integrity.
pop
my $step_hash = $path->pop;
Works just like the Perl version. Removes a step from the end of the path and returns it. The step is cloned for integrity.
unshift
my $count = $path->unshift($step_or_path);
Works just like the Perl version. Adds a step (or other path-like thingy) to the beginning of the path and returns the number of new steps prepended. Will also call "cleanup" afterwards, if "auto_cleanup" is enabled.
push
my $count = $path->push($step_or_path);
Works just like the Perl version. Adds a step (or other path-like thingy) to the end of the path and returns the number of new steps appended. Will also call "cleanup" afterwards, if "auto_cleanup" is enabled.
splice
my @step_hashes = $path->splice($offset, $length, $step_or_path);
my @step_hashes = $path->splice($offset, $length);
my @step_hashes = $path->splice($offset);
my $last_step_hash = $path->splice($offset);
Works just like the Perl version. Removes elements designated by the offset and length, and replaces them with the new step/path. The steps are cloned for integrity. Returns the steps removed in list context, or the last step removed in scalar context. Will also call "cleanup" afterwards, if "auto_cleanup" is enabled.
clear
$path->clear;
Clears out the path.
Returns itself for chaining.
replace
$path->replace;
Replaces the path with a new one. Basically just sugar for "clear" + "push". Unlike the argument form of "clone", this retains the same object and just replaces the internal path.
Returns the number of new steps created.
clone
my $same_path = $path->clone;
my $similar_path = $path->clone($new_path);
Clones the path object and returns it.
Optionally takes another path (object or string or whatever) and puts that path into the clone. This is handy if you want to use the same options and class, but just want a different path.
normalize
$path->normalize;
Normalizes the steps in the path. This ensures that the keys of the step hash and the steps will be the same thing. Or to put it another way, this will make a "round trip" of string-to-path-to-string work commutatively. For example, if the following paths were DZIL paths:
'"Oh, but it can..." said the spider'.[0].value # Before normalize
"\"Oh, but it can...\" said the spider"[0].value # After normalize
a.b...c[0].""."".'' # Before normalize
a.b.""."".c[0]."".""."" # After normalize
Returns itself for chaining.
cleanup
$path->cleanup;
Cleans up the path. Think of this in terms of cleanup
within Path::Class. This will remove unnecessary relative steps, and try as best as possible to present an absolute path, or at least one that progresses in a sequential manner. For example, if the following paths were File::Unix paths:
/foo/baz/../foo.txt # /foo/foo.txt
/foo//baz/./foo.txt # /foo/baz/foo.txt
../../foo/../bar.txt # ../../bar.txt
./command # command
Returns itself for chaining.
UTILITY METHODS
These step conversion methods are available to use, but are somewhat internal, so they might be subject to change. In most cases, you can use the more public methods to achieve the same goals.
key2hash
my $step_hash = $path->key2hash($key, $type, $pos);
my $step_hash = $path->key2hash($key, $type);
Figures out the missing pieces of a key/type pair, and returns a complete four-key step hash. The "normalize" method works by throwing away the existing step and using this method.
Since pos translation works by using both step+delimiter, and key2hash
doesn't have access to the delimiter, it's more accurate to pass the pos value than leave it out.
path_str2array
my $path_array = $path->path_str2array($path_str);
Converts a path string into a path array (of step hashes).
shift_path_str
my $step_hash = $self->shift_path_str(\$path_str);
Removes a step from the beginning of the path string, and returns a complete four-key step hash. This is the workhorse for most of Parse::Path's use cases.
blueprint
my $data = $self->blueprint->{$blueprint_key};
Provides access to the blueprint for parsing the path style. More informaton about what this hashref contains in the role documentation.
Cloned for sanity. Create your own Path class if you need to change the specs.
OVERLOADS
In addition to its standard methods, Parse::Path also has several overloads that are useful:
String Concatenation (.=)
$path .= 'q.r.s[1]';
$path .= [qw( q r s[1] )];
Modifies the path by calling "push" on the RHS thing.
Numeric Comparisons
$pathA < $pathB
$pathA <= $pathB
$pathB > $pathA
$pathB >= $pathA
$pathA == $pathA
$pathA != $pathB
$pathA <=> $pathB
Uses "depth" for the numeric comparison. Still works in cases of a non-path on one side.
String Comparisons
$pathA lt $pathB
$pathA le $pathB
$pathB gt $pathA
$pathB ge $pathA
$pathA eq $pathA
$pathA ne $pathB
$pathA cmp $pathB
If both sides are P:P objects, each key of the path is compared separately until a difference is found. This effectively bypasses delimiters as an obstacle for path comparisons. If a step is found to be an ARRAY type on both sides, a numeric comparison (<=>
) is done. Mismatched step types are allowed (and checked with cmp
), so sanity check your paths if this isn't desired.
If either side is a non-path, this will fallback to a simple path string comparison.
Other overloads
!$path # !$path->step_count (ie: does the path contain anything?)
"$path" # $path->as_string
0+$path # $path->depth
$$path # $path->as_string
@$path # @{ $path->as_array }
These all work pretty much as you'd expect them to.
CONVERSION
Different path styles can be used with ease. Convert Unix paths to Window paths? No problem:
my $unix_path = Parse::Path->new(
path => '/root/tmp/file.txt',
style => 'File::Unix'
);
my $win_path = Parse::Path->new(
path => $unix_path,
style => 'File::Win32',
);
$win_path->as_string; # \root\tmp\file.txt
$win_path->volume('C');
$win_path->as_string; # C:\root\tmp\file.txt
$win_path->splice(-1, 1, '..\foobar.gif');
$win_path->cleanup->as_string; # C:\root\foobar.gif
$unix_path->replace($win_path);
$unix_path->as_string; # /root/foobar.gif
CAVEATS
Absolute paths and step removal
Steps can be removed from the path as needed, but keep in mind that "cleanup" doesn't get called methods like "shift", even if "auto_cleanup" is set. This doesn't make a difference on absolute paths as the depth they are given is permanent. Appending two absolute paths may end up cancelling each other out:
my $path = Parse::Path->new(
path => '/root/tmp/file.txt',
style => 'File::Unix',
auto_cleanup => 1,
);
$path->shift; # remove the blank root
$path->shift; # now a dangling 'tmp/file.txt', tied to position 2
$path->unshift('/home/bbyrd');
$path->as_string; # /home/tmp/file.txt
This problem can be sidestepped by using the string forms:
$path->shift;
$path->shift; # tmp/file.txt
$path->replace( [ '/home/bbyrd', $path->as_string ] );
$path->as_string; # /home/bbyrd/tmp/file.txt
This may be fixed in a later release.
Normalization of splits
While "auto_normalize" controls normalization of steps, delimiter normalization is still automatic. For example:
my $path = Parse::Path->new(
path => 'foo//////bar.txt',
style => 'File::Unix',
);
say $path->as_string; # foo/bar.txt
This is because delimiters are not actually stored anywhere after parsing. The "as_string" method takes the hash steps and re-adds the delimiters, per rules on the blueprint of the path class. (See "delimiter_placement" in Parse::Path::Role::Path.)
Sparse arrays and memory usage
Since arrays within paths are based on indexes, there's a potential security issue with large indexes causing abnormal memory usage with certain modules that would use these paths. In Perl, these two arrays would have drastically different memory footprints:
my @small;
$small[0] = 1;
my @large;
$large[999999] = 1;
This can be mitigated by making sure the Path style you use will limit the total digits for array indexes. Parse::Path handles this on all of its paths, but it's something to be aware of if you create your own path classes.
SEE ALSO
Data::SplitSerializer - Uses this module for path parsing
AVAILABILITY
The project homepage is https://github.com/SineSwiper/Parse-Path/wiki.
The latest version of this module is available from the Comprehensive Perl Archive Network (CPAN). Visit http://www.perl.com/CPAN/ to find a CPAN site near you, or see https://metacpan.org/module/Parse::Path/.
SUPPORT
Internet Relay Chat
You can get live help by using IRC ( Internet Relay Chat ). If you don't know what IRC is, please read this excellent guide: http://en.wikipedia.org/wiki/Internet_Relay_Chat. Please be courteous and patient when talking to us, as we might be busy or sleeping! You can join those networks/channels and get help:
irc.perl.org
You can connect to the server at 'irc.perl.org' and talk to this person for help: SineSwiper.
Bugs / Feature Requests
Please report any bugs or feature requests via https://github.com/SineSwiper/Parse-Path/issues.
AUTHOR
Brendan Byrd <bbyrd@cpan.org>
COPYRIGHT AND LICENSE
This software is Copyright (c) 2013 by Brendan Byrd.
This is free software, licensed under:
The Artistic License 2.0 (GPL Compatible)