NAME
Shell::Base - A generic class to build line-oriented command interpreters.
SYNOPSIS
package My::Shell;
use Shell::Base;
use base qw(Shell::Base);
sub do_greeting {
return "Hello!"
}
DESCRIPTION
Shell::Base is a base class designed for building command line programs. It defines a number of useful defaults, simplifies adding commands and help, and integrates well with Term::ReadLine.
After writing several REP (Read-Eval-Print) loops in Perl, I found myself wishing for something a little more convenient than starting with:
while(1) {
my $line = <STDIN>;
last unless defined $line;
chomp $line;
if ($line =~ /^...
Features
Shell::Base provides simple access to many of the things I always write into my REP's, as well as support for many thing that I always intend to, but never find time for:
- readline support
-
Shell::Base provides simple access to the readline library via Term::ReadLine, including built-in tab-completion and easy integration with the history file features.
If a subclass does want or need Term::ReadLine support, then it can be replaced in subclasses by overriding a few methods. See "Using Shell::Base Without readline", below.
- Trivial to add commands
-
Adding commands to your shell is as simple as creating methods: the command
foo
is dispatched todo_foo
. In addition, there are hooks for unknown commands and for when the user just hits <Return>, both of which a subclass can override. - Integrated help system
-
Shell::Base makes it simple to integrate online help within alongside your command methods. Help for a command
foo
can be retrieved withhelp foo
, with the addition of one method. In addition, a generalhelp
command lists all possible help commands; this list is generated at run time, so there's no possibility of forgetting to add help methods to the list of available topics. - Pager integration
-
Output can be sent through the user's default pager (as defined by $ENV{'PAGER'}, with a reasonable default) or dumped directly to STDOUT.
- Customizable output stream(s)
-
Printing is handled through a print() method, which can be overridden in a subclass to send output anywhere.
- Pre- and post-processing methods
-
Input received from readline() can be processed before it is parsed, and output from command methods can be post-processed before it is sent to print().
- Automatic support for RC files
-
A simple RC-file parser is built in, which handles name = value type configuration files. This parser handles comments, whitespace, multiline definitions, boolean and (name, value) option types, and multiple files (e.g., /etc/foorc, $HOME/.foorc).
Shell::Base was originally based, conceptually, on Python's cmd.Cmd
class, though it has expanded far beyond what Cmd
offers.
METHODS
There are two basic types of methods: methods that control how a Shell::Base-derived object behaves, and methods that add command to the shell.
All aspects of a Shell::Base-derived object are available via accessors, from the Term::ReadLine instance to data members, to make life easier for subclass implementors.
NB: The following list isn't really in any order!
- new
-
The constructor is called
new
, and should be inherited from Shell::Base (and not overridden).new
should be called with a reference to a hash of name => value parameters:my %options = (HISTFILE => glob("~/.myshell_history"), OPTION_1 => $one, OPTION_2 => $two); my $shell = My::Shell->new(\%options);
new
calls a number of initializing methods, each of which will be called with a reference to the passed in hash of parameters as the only argument:- init_rl(\%args)
-
init_rl
initializes the Term::ReadLine instance. If a subclass does not intend to use Term::ReadLine, this method can be overridden. (There are other methods that need to be overridden to eliminate readline completely; see "Using Shell::Base Without readline" for more details.)The completion method,
complete
, is set here, though the list of possible completions is generated in theinit_completions
method.If a HISTFILE parameter is passed to
init_rl
, then the internal Term::ReadLine instance will attempt to use that file for history functions. See "History Functions" in Term::ReadLine::Gnu for more details. - init_rcfiles(\%args)
-
init_rcfiles
treats each element in the RCFILES array (passed into the contructor) as a configuration file, and attempts to read and parse it. See "RC Files", below. - init_help(\%args)
-
init_help
generates the list of available help topics, which is all methods that match the pattern^help_
, by default. Once this list is generated, it is stored using thehelps
method (see "helps"). - init_completions(\%args)
-
init_completions
creates the list of methods that are tab-completable, and sets them using thecompletions
method. By default, it finds all methods that begin with^do_
in the current class and superclass(es).The default completion method,
complete
, chooses completions from this list based on the line and word being completed. See "complete". - init(\%args)
-
A general purpose
init
method, designed to be overridden by subclasses. The defaultinit
method in Shell::Base does nothing.In general, subclass-specific initializations should go in this method.
A subclass's
init
method should be carful about deleting from the hash that they get as a parameter -- items removed from the hash are really gone. At the same time, items can be added to the hash, and will persist. The original parameters can be retrieved at run time using theargs
method.Similarly, configuration data parsed from RCFILES can be retrieved using the
config
method. - run
-
The main "loop" of the program is a method called
run
-- all other methods are called in preparation for the call torun
, or are called from withinrun
.run
takes no parameters, and does not return.$shell = My::Shell->new(); $shell->run();
At the top of the loop,
run
prints the value of $self->intro, if it is defined:my $intro = $self->intro(); $self->print("$intro\n") if defined $intro;
run
does several things for each iteration of the REP loop that are worth noting:Reads a line of input using $self->readline(), passing the value of $self->prompt():
$line = $self->readline($self->prompt);
Passes that line through $self->precmd(), for possible manipulation:
$line = $self->precmd($line);
Parses the line:
($cmd, $env, @args) = $self->parseline($line);
See "parseline" for details about
parseline
, and what $cmd, $env, and @args are.Update environment variables with entries from %$env, for the command $cmd only.
Checks the contents of $cmd; there are a few special cases:
If $cmd matches $Shell::Base::RE_QUIT, the method
quit
is invoked:$output = $self->quit();
$RE_QUIT is
^(?i)\s*(quit|exit|logout)
by defaultOtherwise, if $cmd matches $Shell::Base::RE_HELP, the method
help
is invoked, with @args as parameters:$output = $self->help(@args);
$RE_HELP is
^(?i)\s*(help|\?)
by default.Otherwise, if $cmd matches $Shell::Base::RE_SHEBANG, the method
do_shell
is invoked, with @args as parameters:$output = $self->do_shell(@args);
$RE_SHEBANG is
^\s*!\s*$
by default.Otherwise, the command
do_$cmd
is invoked, with @args as parameters:my $method = "do_$cmd"; $output = $self->$method(@args);
$output is passed to $self->postcmd() for postprocessing:
$output = $self->postcmd($output);
Finally, if $output is not
undef
, it is passed to $self->print(), with a newline appended:$self->print("$output\n") if defined $output;
When the main loop ends, usually through the
exit
orquit
commands, or when the user issues CTRL-D,run
calls thequit
method. - args([$what])
-
The original hash of arguments passed into the constructor is stored in the instance, and can be retrieved using the args method, which is an accessor only (though the hash returned by
args
is live, and changes will propogate).If
args
is passed a value, then the value associated with that key will be returned. An example:my $shell = My::Shell->new(FOO => "foo", BAR => "bar"); my $foo = $shell->args("FOO"); # $foo contains "foo" my $bar = $shell->args("BAR"); # $bar contains "bar" my $baz = $shell->args("BAZ"); # $baz is undefined my $args = $shell->args(); # $args is a ref to the whole hash
As a convenience, if a specified argument is not found, it is uppercased, and then tried again, so:
my $foo = $shell->args("FOO");
and
my $foo = $shell->args("foo");
are identical if there is a
FOO
arg and nofoo
arg. - config([$what])
-
Configuration data gleaned from RCFILES can be retrieved using the
config
method.config
behaves similarly to theargs
method. - helps
-
When called without arguments,
helps
returns a list of all the available help_foo methods, as a list.When called with arguments,
helps
uses these arguments to set the current list of help methods.This is the method called by
init_help
to fill in the list of available help methods, andhelp
when it needs to figure out the available help topics. - completions
-
Similar to
helps
, except that completions returns or sets the list of completions possible when the user hits <tab>. -
The
print
method, well, prints its data.print
is a method so that subclasses can override it; here is a small example class,Tied::Shell
, that wraps around a Tie::File instance, in which all data is printed to the Tie::File instance, as well as to the normal place. This makes it ideal for (e.g.) logging sessions:package Tied::Shell; use Shell::Base; use Tie::File; use strict; use base qw(Shell::Base); sub init { my ($self, $args) = @_; my @file; tie @file, 'Tie::File', $args->{ FILENAME }; $self->{ TIEFILE } = \@file; } # Append to self, then call SUPER::print sub print { my ($self, @lines) = @_; push @{ $self->{ TIEFILE } }, @lines; return $self->SUPER::print(@lines); } sub quit { my $self = shift; untie @{ $self->{ TIEFILE } }; $self->SUPER::quit(@_); }
(See Tie::File for the appropriate details.)
- readline
-
The
readline
method is a wrapper for $self->term->readline; it is called at the top of the REP loop withinrun
to get the next line of input.readline
is it's own method so that subclasses which do not use Term::ReadLine can override it specifically. A very basic, non-readlinereadline
could look like:sub readline { my ($self, $prompt) = @_; my $line; print $prompt; chomp($line = <STDIN>); return $line; }
As implied by the example,
readline
will be passed the prompt to be displayed, which should be a string (it will be treated like one).A good example of when this might be overridden would be on systems that prefer to use
editline
instead of GNU readline, using theTerm::EditLine
module (e.g., NetBSD):# Initialize Term::EditLine sub init_rl { my ($self, $args) = @_; require Term::EditLine; $self->{ TERM } = Term::EditLine->new(ref $self); return $self; } # Return the Term::EditLine instance sub term { my $self = shift; return $self->{ TERM }; } # Get a line of input sub readline { my ($self, $prompt) = @_; my $line; my $term = $self->term; $term->set_prompt($prompt); $line = $term->gets(); $term->history_enter($line); return $line; }
- default
-
When an unknown command is received, the
default
method is invoked, with ($cmd, @args) as the arguments. The defaultdefault
method simply returns an error string, but this can of course be overridden in a subclass:sub default { my ($self, @cmd) = @_; my $output = `@cmd`; chomp $output; # everything is printed with an extra "\n" return $output; }
- precmd
-
precmd
is called after a line of input is read, but before it is parsed.precmd
will be called with $line as the sole argument, and it is expected to return a string suitable for splitting withparseline
. Any amount of massaging can be done to $line, of course.The default
precmd
method does nothing:sub precmd { my ($self, $line) = @_; return $line; }
This would be a good place to handle things tilde-expansion:
sub precmd { my ($self, $line) = @_; $line =~ s{~([\w\d_-]*)} { $1 ? (getpwnam($1))[7] : $ENV{HOME} }e; return $line; }
- postcmd
-
postcmd
is called immediately before any output is printed.postcmd
will be passed a scalar containing the output of whatever commandrun
invoked.postcmd
is expected to return a string suitable for printing; if the return ofpostcmd
is undef, then nothing will be printed.The default
postcmd
method does nothing:sub postcmd { my ($self, $output) = @_; return $output; }
You can do fun output filtering here:
use Text::Bastardize; my $bastard = Text::Bastardize->new; sub postcmd { my ($self, $output) = @_; $bastard->charge($output); return $bastard->k3wlt0k() }
Or translation:
use Text::Iconv; my $converter; sub postcmd { my ($self, $output) = @_; unless (defined $converter) { # Read these values from the config files my $from_lang = $self->config("from_lang"); my $to_lang = $self->config("to_lang"); $converter = Text::Iconv->new($from_lang, $to_lang); # Return undef on error, don't croak $converter->raise_error(0); } # Fall back to unconverted output, not croak return $completer->convert($output) || $output; }
Or put the tildes back in:
sub postcmd { my ($self, $line) = @_; $line =~ s{(/home/([^/ ]+))} { -d $1 ? "~$2" : $1 }ge; return $line; }
- pager
-
The
pager
method attempts to determine what the user's preferred pager is, and return it. This can be used within an overriddenprint
method, for example, to send everything through a pager:sub print { my ($self, @stuff) = @_; my $pager = $self->pager; open my $P, "|$pager" or carp "Can't open $pager: $!"; CORE::print $P @stuff; close $P; }
Note the explicit use of CORE::print, to prevent infinite recursion.
- parseline
-
A line is divided into ($command, %env, @args) using $self->parseline(). A command
foo
is dispatched to a methoddo_foo
, with @args passed as an array, and with %ENV updated to include %env.If there is no
do_foo
method for a commandfoo
, then the methoddefault
will be called. Subclasses can override thedefault
method.%ENV is localized and updated with the contents of %env for the current command. %env is populated in a similar fashion to how /bin/sh does; the command:
FOO=bar baz
Invokes the
do_baz
method with $ENV{'FOO'} = "bar".Shell::Base doesn't (currently) do anything interesting with pipelines; the command:
foo | bar
will be parsed by parseline() as:
("foo", {}, "|", "bar")
rather than as two separate connected commands. Support for pipelines in on the TODO list.
- prompt
-
Gets or sets the current prompt. The default prompt is:
sprintf "(%s) \$ ", __PACKAGE__;
The prompt method can be overridden, of course, possibly using something like
String::Format
:use Cwd; use File::Basename qw(basename); use Net::Domain qw(hostfqdn); use String::Format qw(stringf); use Sys::Hostname qw(hostname); sub prompt { my $self = shift; my $fmt = $self->{ PROMPT_FMT }; return stringf $fmt => { '$' => $$, 'w' => cwd, 'W' => basename(cwd), '0' => $self->progname, '!' => $self->prompt_no, 'u' => scalar getpwuid($<), 'g' => scalar getgrgid($(), 'c' => ref($self), 'h' => hostname, 'H' => hostfqdn, }; }
Then $self->{ PROMPT_FMT } can be set to, for example,
%u@%h %w %%
, which might yield a prompt like:darren@tumbleweed /tmp/Shell-Base %
(See String::Format for the appropriate details.)
The value passed to
prompt
can be a code ref; if so, it is invoked with $self and any additional arguments passed toprompt
as the arguments:$self->prompt(\&func, @stuff);
Will call:
&$func($self, @stuff);
and use the return value as the prompt string.
- intro / outro
-
Text that is displayed when control enters
run
(intro
) andquit
(outro
). If the method returns a non-undef result, it will be passed to $self->print(). - quit
-
The
quit
method currently handles closing the history file; if it is overridden, $self->SUPER::quit() should be called, so that the history file will be written out.The results of $self->outro() will be passed to $self->print() as well.
Methods That Add Commands
Any command that run() doesn't recognize will be treated as a command; a method named do_$command
will be invoked, in an eval block. Remember that a line is parsed into ($command, %env, @args); do_$command
will be invoked with @args as @_, and %ENV updated to include the contents of %env. The effect is similar to:
my ($command, $env, @args) = $self->parseline($line);
my $method = "do_$command";
local %ENV = (%ENV, %$env);
my $output = $self->$method(@args);
$output will be passed to $self->print() if it is defined.
Here is method that implements the env
command:
sub do_env {
my ($self, @args) = @_;
my @output;
for (keys %ENV) {
push @output, "$_=$ENV{$_}";
}
return join "\n", @output;
}
And here is an rm
command:
sub do_rm {
my ($self, @files) = @_;
my ($file, @errors);
for $file (@files) {
unlink $file
or push @errors, $file;
}
if (@errors) {
return "Couldn't delete " . join ", ", @errors;
}
return;
}
MISCELLANEOUS
Quick Imports
If Shell::Base, or any Shell::Base subclass that does not does implement an import
method, is invoked as:
use My::Shell qw(shell);
a function named shell
is installed in the calling package. This shell
function is very simple, and turns this:
shell(%args);
into this:
My::Shell->new(%args)->run();
This is most useful for one-liners:
$ perl -MMy::Shell=shell -e shell
RC Files
The rcfile parser is simple, and parses (name, value) tuples from config files, according to these rules:
- Definitions
-
Most definitions are in name = value format:
foo = bar baz = quux
Boolean defitions in the form
wiffle
are allowed, and define
wiffle
as 1. Any definition without an = is considered a boolean definition. Boolean definitions in the formnowiffle
definewiffle
as 0:nowiffle
- Comments
-
Everything after a # is considered a comment, and is stripped from the line immediately
- Whitespace
-
Whitespace is (mostly) ignored. The following are equivalent:
foo=bar foo = bar
Whitespace after the beginning of the value is not ignored:
foo = bar baz quux
foo
containsbar baz quux
. - Line continuations
-
Lines ending with \ are continued on the next line:
form_letter = Dear %s,\ How are you today? \ Love, \ %s
Using Shell::Base Without readline
The appropriate methods to override in this case are:
- init_rl
-
The readline initialization method.
- term
-
Returns the Term::ReadLine instance; primarily used by the other methods listed in this section.
- readline
-
Returns the next line of input. Will be passed 1 argument, the prompt to display. See "readline" for an example of overriding
readline
. -
Called with the data to be printed. By default, this method prints to $self->term->OUT, but subclasses that aren't using Term::ReadLine will want to provide a useful alternative. One possibily might be:
sub print { my ($self, @print_me) = @_; CORE::print(@print_me); }
Another good example was given above, in "pager":
sub print { my ($self, @stuff) = @_; my $pager = $self->pager; open my $P, "|$pager" or carp "Can't open $pager: $!"; CORE::print $P @stuff; close $P; }
NOTES
Some parts of this API will likely change in the future. In an upcoming version, do_$foo
methods will mostly likely be expected to return a ($status, $output) pair rather than simply $output. Any API changes that are likely to break existing applications will be noted.
TODO
- abbreviations
-
Add abbreviation support, by default via Text::Abbrev, but overriddable, so that a shell can have (for example), \x type commands, or /x type commands. This can currently be done by overriding the precmd() method or parseline() methods; for example, this parseline() method strips a leading
/
, for IRC-like commands (/foo
,/bar
)sub parseline { my ($self, $line) = @_; my ($cmd, $env, @args) = $self->SUPER::parseline($line); $cmd =~ s:^/::; return ($cmd, $env, @args); }
Another way to implement abbreviations would be to override the
complete
method. - command pipelines
-
I have some ideas about how to implement pipelines, but, since I have yet to look at the code in any existing shells, I might be completely insane and totally on the wrong track. I therefore reserve the right to not implement this feature now, until I've looked at how some proper shells implement pipelines.
AUTHOR
darren chamberlain <darren@cpan.org>
REVISION
This documentation describes Shell::Base
, $Revision: 1.7 $.
COPYRIGHT
Copyright (C) 2003 darren chamberlain. All Rights Reserved.
This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.