TITLE

DRAFT: Synopsis 16: IO / Name Services

AUTHORS

Largely, the authors of the related Perl 5 docs.
Larry Wall <larry@wall.org>
Mark Stosberg <mark@summersault.com>
Tim Nelson <wayland@wayland.id.au>
Daniel Ruoso <daniel@ruoso.com>

VERSION

Created: 12 Sep 2006

Last Modified: 19 Nov 2009
Version: 23

This is a draft document. Many of these functions will work as in Perl 5, except we're trying to rationalize everything into roles. For now you can assume most of the important functions will automatically be in the * namespace. However, with IO operations in particular, many of them are really methods on an IO handle, and if there is a corresponding global function, it's merely an exported version of the method.

IO

Overridable IO handles

In Perl 6, there are the standard IO handles, and any number of overriding inner filehandles for the same symbol.

The standard handles are our old familiar friends (with new names). Standard input changed from STDIN to $*IN, standard output changed from STDOUT to $*OUT, and standard error changed from STDERR to $*ERR. In Perl 6 these symbols represent more of a concept than a given filehandle, since the meaning is contextually determined. The process's version of these handles live in the PROCESS:: namespace, which is more global than the per-interpreter GLOBAL:: namespace.

When no explicit filehandle is used, the standard IO operators are defined in terms of the dynamic variables. So the print function prints to $*OUT, while warn warns to $*ERR. The lines() term inputs from $*ARGFILES which defaults to $*IN in the absence of any filenames. So any given dynamic scope (interpreter, thread, function or method call) may redefine the current meaning of any of those filehandles within the dynamic scope of itself and of its called routines.

So to put it another way, when you write something like

say "Howdy, world!"

the say function looks for the current meaning of $*OUT, and takes the closest definition it can find in its callers. If none of the callers have overridden the definition, it looks in the interpreter's GLOBAL namespace. If the interpreter hasn't overridden the meaning, it takes the meaning from PROCESS. In essence, any dynamic scope in Perl 6 is allowed to do IO redirection much like a Unix shell does with its subprocesses, albeit with a different syntax:

{
    temp $*OUT = open $newfile, :w;
    foo() # all stdout goes to $newfile
}
# stdout reverts to outer scope's definition

Roles and Classes

The roles and classes that define most of the functionality for IO are defined in S32-setting-library/IO.pod. The main functions used are listed in S29 with references to S32-setting-library/IO.pod.

Special Quoting Syntax

The use of filenames requires a special quoting syntax. It works as follows:

qp{/path/to/file}
q:p{/path/to/file}

Both of the above result in the same thing.

The quote characters can be any of the usual ones, although / is probably a bad choice for filenames.

The code shown above returns a Path object (or a descendant thereof).

Naturally you can also ask for interpolation in filenames:

qp:qq{$directory/$file}
qq:p{$directory/$file}

There are a number of special adverbs that can be applied to the file quoting operator. Most of these are filesystem-specific. They confine what can be included in a filename.

Any path that starts with a "/" is considered an absolute path, otherwise the path is considered relative.

When creating a path with qp{}, the Path.Encoding attribute is set to $?ENC, unless the :bin modifier (see below) is used.

Default constraints

The default constraints can be set with the "use path" pragma, for example:

use path :posix;
use path :modern;
use path :local;

The default when in strict mode is "use path :posix", whereas the default in lax mode is "use path :local".

:posix constraints

The :modern set of constraints paths are portable POSIX paths (see POSIX.1-2008 sections 4.7 and 3.276). If platform/filesystem specific behavior is needed, specific constraints should be applied as needed (see below).

The default constraints are to only allow "/" as separator and only allows portable POSIX filenames. That means A-Z, a-z, 0-9, <period>, <underscore>, and <hyphen>. Additionally, hyphen may not be the first character in the path.

Any path that starts with a "/" is considered an absolute path, otherwise the path is considered relative.

:portable

In addition to the POSIX constraints above, the path should fit into 8 characters, followed by a full stop, and then three more characters. Only the one full stop should appear in the filename. Additionally, no path may be longer than 64 characters.

:local

This is :win on a Windows platform, :unix on a Unix platform, etc. Note that this is specifically not portable between platforms with different constraint sets.

:modern constraints

The :modern set of constraints are the same as :posix, except that they may contain any UTF-8 character, rather than just those listed.

:win constraints

We allow Windows style paths so converting and maintaining code on this platform is not a pain.

my Path $path = p:win{C:\Program Files\MS Access\file.file};

Note that this specifically excludes the backslash quoting usually used with q{}.

:unix constraints

For Unix specific behavior we have a p:unix{} literal. Here the only limits are what is defined by the locale and the filesystem type. So we won't be able to write full Unicode if locale is set to Latin1.

my Path $path = p:unix{/usr/src/bla/myfile?:%.file};

:bin constraints (no constraints at all)

If the above are causing problems, p:bin{} can be used as no checking is done here. However, this leaves the Path.Encoding attribute undefined, which means that certain features of Path will remain unavailable unless this attribute is set.

Other constraints

It is expected that other sets of constraints, such as VMS, DOS, and old-style Mac will be provided by modules.

$*CWD

The global variable $*CWD is a Path object with certain special properties, mainly:

* It must check whether the path exists before changing its value
* It can only be assigned an absolute path; if you want to make relative 
  changes, use the Array interface

$*CWD is specific to the current thread, unlike %*ENV<PATH> or the native getcwd/chdir path, which are both usually process-wide.

The variable is used in at least these cases:

* When a Path object is created, if the string it is given is not an
  absolute path, then $*CWD is prepended to it.
* When a subprocess is executed, it is executed with the current
  thread's $*CWD as its working directory.

Name Services

User role

role User {
    has $username; # Username (some descendants(?) may want to implement a real $name)
    has $id;  # User ID
    has $dir; # Home directory for files
}
new
method User new($Username?, $UID?) {...}

Creates a new User object, fetching the information either by username or user ID.

write
method write() {...}

Tries to write the current User object to the user database. This may well fail.

Str

When converted to a Str, returns $username.

Num

When converted to a Num, returns $uid.

OS::Unix::User role

role OS::Unix::User does User {
    has $password;
    has $gid;
    has $gecos;
    has $shell;
}

All the information is naturally fetched from the system via getpwuid, getpwnam, or the like.

Group role

role Group {
    has $name;
    has $id;
    has @members;
}
new
method Group new(:$Name, :$ID);
write
method write();

Tries to write the group entry into the system group database.

OS::Unix::NameServices role

The NameServices role has a bunch of functions that between them will return the whole Name Services database between them, as lists of objects. The lists are specifically intended to be lazy.

role NameServices {
    method List of User     users()     {...} # getpwent, setpwent, endpwent
    method List of Group    groups()    {...} # getgrent, setgrent, endgrent
    method List of Service  services()  {...} # getservent, setservent, endservent
    method List of Protocol protocols() {...} # getprotoent, setprotoent, endprotoent
    method List of Network  networks()  {...} # getnetent, setnetent, endnetent
    method List of Host     hosts()     {...} # gethostent, sethostent, endhostent
}

Additions

Please post errors and feedback to perl6-language. If you are making a general laundry list, please separate messages by topic.