NAME

OODoc::Parser::Markov - Parser for the MARKOV syntax

INHERITANCE

OODoc::Parser::Markov
  is an OODoc::Parser
  is an OODoc::Object

SYNOPSIS

DESCRIPTION

The "Markov parser" is named after the author, because the author likes to invite other people to write their own parser as well: every one has not only their own coding style, but also their own documentation wishes.

The task for the parser is to strip Perl package files into a code part and a documentation tree. The code is written to a directory where the module distribution is built, the documenation tree is later formatted into manual pages.

Extends "DESCRIPTION" in OODoc::Parser.

OVERLOADED

Extends "OVERLOADED" in OODoc::Parser.

overload: '!=' (numeric different): Inherited, see "OVERLOADED" in OODoc::Object
overload: '==' (numeric equivalent): Inherited, see "OVERLOADED" in OODoc::Object
overload: 'bool': Inherited, see "OVERLOADED" in OODoc::Object

METHODS

Extends "METHODS" in OODoc::Parser.

Constructors

Extends "Constructors" in OODoc::Parser.

$class->new(%options)

Inherited, see "Constructors" in OODoc::Object

-Option          --Defined in     --Default
 additional_rules                   +[]
 index                              <required>
 skip_links        OODoc::Parser    undef
 syntax            OODoc::Parser    'markov'

additional_rules => \@rules: An ARRAY which contains references to match-action pairs, as accepted by rule(). These rules get preference over the existing rules. Be warned: you pass an ARRAY of ARRAYs.
index => OODoc::Index object: [2.01] where the package and manual administration is.
skip_links => ARRAY|REGEXP|STRING
syntax => PACKAGE|$name

Attributes

Extends "Attributes" in OODoc::Parser.

$obj->currentManual( [$manual] ): Returns the manual object which is currently being filled with data. With a new $manual, a new one is set.
$obj->inDoc( [BOOLEAN] ): When a BOOLEAN is specified, the status changes. It returns the current status of the document reader.
$obj->index(): [2.01] Returns the documentation tree administration.
$obj->manual(): Inherited, see "Attributes" in OODoc::Object
$obj->rules(): Returns the ARRAY of active rules. You may modify it.
$obj->setBlock(REF-SCALAR): Set the scalar where the next documentation lines should be collected in.
$obj->unique(): Inherited, see "Attributes" in OODoc::Object

Collected

Extends "Collected" in OODoc::Parser.

$obj->publish(\%options): Inherited, see "Collected" in OODoc::Object

Parsing a file

Extends "Parsing a file" in OODoc::Parser.

$obj->findMatchingRule($line)

Check the list of rules whether this $line matches one of them. This is an ordered evaluation. Returned is the matched string and the required action. If the line fails to match anything, an empty list is returned.

» example:

if(my($match, $action) = $parser->findMatchingRule($line))
{  # do something with it
   $action->($parser, $match, $line);
}

$obj->parse(%options)

Process an input file into manual fragments. Improves base, see "Parsing a file" in OODoc::Parser

-Option      --Default
 distribution  <required>
 input         <required>
 notice        ''
 output        devnull
 version       <required>

distribution => $name
input => $file
notice => $text: Block of $text added in from of the output file.
output => $file
version => STRING

$obj->rule( <STRING|Regexp>, <$method|CODE> )

Register a rule which will be applied to a line in the input file. When a STRING is specified, it must start at the beginning of the line to be selected. You may also specify a regular expression which will match on the line.

The second argument is the action which will be taken when the line is selected. Either the named $method or the CODE reference will be called. Their arguments are:

$parser->METHOD($match, $line, $file, $linenumber);
CODE->($parser, $match, $line, $file, $linenumber);

Producing manuals

Extends "Producing manuals" in OODoc::Parser.

$obj->autoMarkup($manual, $chapter, \%options)

Used via finalizeManual() to apply automatic M, P, and C markup tags.

$obj->cleanupHtml($manual, $text, %options)

Inherited, see "Producing manuals" in OODoc::Parser

$obj->cleanupHtmlL($manual, $link, \%options)

$obj->cleanupHtmlM($manual, $link, \%options)

$obj->cleanupPod($manual, $text, %options)

Inherited, see "Producing manuals" in OODoc::Parser

$obj->cleanupPodL($manual, $link, $args)

The L markups for OODoc::Parser::Markov have the same syntax as standard POD has, however most standard pod-laters do no accept links in verbatim blocks. Therefore, the links have to be translated in their text in such a case. The translation itself is done in by this method.

$obj->cleanupPodM($manual, $link, $args)

$obj->decomposeL($manual, $link)

Decompose the L-tags. These tags are described in perlpod, but they will not refer to items: only headers.

$obj->decomposeM($manual, $link)

$any->filenameToPackage($file)

» example:

print $self->filenameToPackage('Mail/Box.pm'); # prints Mail::Box

$obj->finalizeManual($manual, %options)

In the last completion step of the manual, the parser will add some P markings: the "this is a parameter" marking. (But in the future it might do more) Improves base, see "Producing manuals" in OODoc::Parser

-Option     --Default
 skip_auto_m  false
 skip_auto_p  false
 wrap_c       [ qw/undef true false/ ]

skip_auto_m => BOOLEAN: Skip the automatic generation of M <> tags around packages names.
skip_auto_p => BOOLEAN: Do not add P <> tags around variables automatically.
wrap_c => ARRAY-of-WORDS

$obj->formatReferTo($manual, $object)

Inherited, see "Producing manuals" in OODoc::Parser

$obj->skipManualLink($package)

Inherited, see "Producing manuals" in OODoc::Parser

DETAILS

General Description

The Markov parser has some commonalities with the common POD syntax. You can use the same tags as are defined by POD, however these tags are "visual style", which means that OODoc can not treat it smart. The Markov parser adds many logical markups which will produce nicer pages.

Furthermore, the parser will remove the documentation from the source code, because otherwise the package installation would fail: Perl's default installation behavior will extract POD from packages, but the markup is not really POD, which will cause many complaints.

The version of the module is defined by the OODoc object which creates the manual page. Therefore, $VERSION will be added to each package automatically.

Disadvantages

The Markov parser removes all raw documentation from the package files, which means that people sending you patches will base them on the processed source: the line numbers will be wrong. Usually, it is not much of a problem to manually process the patch: you have to check the correctness anyway.

A second disadvantage is that you have to backup your sources separately: the sources differ from what is published on CPAN, so CPAN is not your backup anymore. The example scripts, contained in the distribution, show how to produce these "raw" packages.

Finally, a difference with the standard POD process: the manual-page must be preceeded with a package keyword.

Structural tags

Heading

=chapter       STRING
=section       STRING
=subsection    STRING
=subsubsection STRING

These text structures are used to group descriptive text and subroutines. You can use any name for a chapter, but the formatter expects certain names to be used: if you use a name which is not expected by the formatter, that documentation will be ignored.

auto-markup

Do not use the words undef, true, and false in other sense then undefined, and boolean, because they will get automaticly wrapped in markup.

Package names will automatically be wrapped in M <> markup.

Variables mentioned in subroutine parameter lists, diagnostics and as option parameter will automatically be wrapped in P <> markup.

Subroutines

Perl has many kinds of subroutines, which are distinguished in the logical markup. The output may be different per kind.

=i_method  NAME PARAMETERS   (instance method)
=c_method  NAME PARAMETERS   (class method)
=ci_method NAME PARAMETERS   (class and instance method)
=method    NAME PARAMETERS   (short for i_method)
=function  NAME PARAMETERS
=tie       NAME PARAMETERS
=overload  STRING

The NAME is the name of the subroutine, and the PARAMETERS an argument indicator.

Then the subroutine description follows. These tags have to follow the general description of the subroutines. You can use

=option    NAME PARAMETERS
=default   NAME VALUE
=requires  NAME PARAMETERS

If you have defined an =option, you have to provide a =default for this option anywhere. Use of =default for an option on a higher level will overrule the one in a subclass.

Subroutine parameters

The parser will turn this line

=c_method new %options

into an OODoc::Text::Subroutine, with attribute name set to new, and string %options as parameter string. The formatters and exporters will translate this subroutine call into

$class->new(%options)

A more complex list of parameters, by convension, is a LIST of

undef: undef is accepted on this spot;
$scalar: a single value;
@list: a LIST of values;
%options: a LIST of key-value PAIRS;
\@array: a reference to an ARRAY of values;
\%hash: a reference to a HASH;
[something]: the parameter is optional; and
($scalar|\@array|undef): alternative parameters on that position.

For instance:

=method scan $filename|$fh, $max_size, %options
=c_method new [$count], %options|\%options

Shown as:

$obj->scan($filename|$fh, $max_size, %options);
$class->new([$count], %options|\%options);

Include examples

Examples can be added to chapters, sections, subsections, subsubsections, and subroutines. They run until the next markup line, so can only come at the end of the documentation pieces.

=example
=examples

Include diagnostics

A subroutine description can also contain error or warning descriptions. These diagnostics are usually collected into a special chapter of the manual page.

=error this is very wrong
Of course this is not really wrong, but only as an example
how it works.

=warning wrong, but not sincerely
Warning message, which means that the program can create correct output
even though it found sometning wrong.

Compatibility

For comfort, all POD markups are supported as well

=head1 Heading Text   (same as =chapter)
=head2 Heading Text   (same as =section)
=head3 Heading Text   (same as =subsection)
=head4 Heading Text   (same as =subsubsection)
=over indentlevel
=item stuff
=back
=cut
=pod
=begin format
=end format
=for format text...

Text markup

Besides above structural markup, there is text markup. The parser supports the standard formatting codes, explained in the perlpod manual:

B<text>; bold text
C<code>; code text, fixed font
E<escape>; html entities
F<filename>; filenames
I<text>; italic text
L<link>; link to text block or url
S<text>; non-breaking spaces
X<topic name>; link destination
Z<>; null format code

This parser adds two more:

M<link>; link to subroutine and options
P<name>; marks the parameter of the sub

These two are defined in the next sections.

The parameter marker `P`

Within the description of a function or method, you often refer to parameter or options of that element. When you pick good names for these, then the description text can smoothly refer to them. Example:

=method changeTitle $manual, $title
Change the $title of this $manual without saving.

The element description must contain %options or \%options when it has (inherited) options.

In this example, the variables will get highlighted by the formatters automatically, unless they already have a style (like C<$title>). They will get marked-up with a P<$title>. You may also do this explicitly:

=method changeTitle $manual, $title
Change the C<$title> of this C<$manual> without saving.

The method link `M`

The M-link can not be nested inside other text markup items. It is used to refer to manuals, subroutines, and options. You can use an L-link to manuals as well, however then the POD output filter will modify the manual page while converting it to other manual formats.

Syntax of the M-link:

MZ<><OODoc::Object>
MZ<><OODoc::Object::new()>
MZ<><OODoc::Object::new(verbose)>
MZ<><new()>
MZ<><new(verbose)>

These links refer to a manual page, a subroutine within a manual page, and an option of a subroutine respectively. And then two abbreviations are shown: they refer to subroutines of the same manual page, in which case you may refer to inherited documentation as well.

The L-link

The standard POD defines a L markup tag. This can also be used with this Markov parser.

The following syntaxes are supported. Below, block can be any of a chapter, section, subsection, or subsubsection name. Here, manual refers to a (unix) manual page, might be a Perl module.

LZ<><manual>
LZ<><manual/block>
LZ<><manual/"block">
LZ<></block>
LZ<></"block">
LZ<><"block">
LZ<><url>

All above can also carry an alternative text to be displayed with the link. For instance:

LZ<><text|manual/"block">

The unix-manual MUST be formatted with its chapter number, for instance M<cat(1)>, otherwise a link will be created. See the following examples in the html version of these manual pages:

MZ<><perldoc>              illegal: not in distribution
LZ<><perldoc>              manual perldoc
LZ<><perldoc(1perl)>       manual perldoc(1perl)
MZ<><OODoc::Object>        OODoc::Object
LZ<><OODoc::Object>        OODoc::Object
LZ<><OODoc::Object(3pm)>   manual OODoc::Object(3pm)

Grouping subroutines

Subroutine descriptions can be grouped in a chapter, section, subsection, or subsubsection. It is very common to have a large number of subroutines, so some structure has to be imposed here.

If you document the same routine in more than one manual page with an inheritance relationship, the documentation location shall not conflict. You do not need to give the same level of detail about the exact location of a subroutine, as long as it is not conflicting. This relative freedom is created to be able to regroup existing documentation without too much effort.

For instance, in the code of OODoc itself (which is of course documented with OODoc), the following happens:

package OODoc::Object;
...
=chapter METHODS
=section Initiation
=c_method new OPTIONS

package OODoc;
use parent 'OODoc::Object';
=chapter METHODS
=c_method new OPTIONS

As you can see in the example, in the higher level of inheritance, the new method is not put in the Initiation section explicitly. However, it is located in the METHODS chapter, which is required to correspond to the base class. The generated documentation will show new in the Initiation section in both manual pages.

Caveats

The markov parser does not require blank lines before or after tags, like POD does. This means that the change to get into parsing problems have increased: lines within here documents which start with a = will cause confusion. However, I these case, you can usually simply add a backslash in front of the printed =, which will disappear once printed.

Examples

You may also take a look at the raw code archive for OODoc (the text as is before it was processed for distribution).

» Example: how subroutines are documented

=chapter FUNCTIONS

=function countCharacters FILE|STRING, OPTIONS
Returns the number of bytes in the FILE or STRING,
or undef if the string is undef or the character
set unknown.

=option  charset CHARSET
=default charset 'us-ascii'
Characters in, for instance, utf-8 or unicode encoding
require variable number of bytes per character.  The
correct CHARSET is needed for the correct result.

=examples

  my $count = countCharacters("monkey");
  my $count = countCharacters("monkey",
      charset => 'utf-8');

=error unknown character set $charset

The character set you can use is limited by the sets
defined by manual Encode.  The characters of the input can
not be seperated from each other without this definition.

=cut

# now the coding starts
sub countCharacters($@) {
   my ($self, $input, %options) = @_;
   ...
}

DIAGNOSTICS

Warning: Pod tag $tag does not terminate any doc in $file line $line: Cast by parse()
Warning: Pod tag $tag does not terminate any doc in $file line $number: There is no document to end here. Cast by parse()
Error: Unknown format key '$key' in manual $manual: Cast by cleanupPodL()
Warning: You may have accidentally captured code in doc $file line $linenr: Some keywords on the first position of a line are very common for code. However, code within doc should start with a blank to indicate pre-formatted lines. Rarely, this warning may be produced incorrectly. Cast by parse()
Fault: cannot read document from $file: $!: The document file can not be processed because it can not be read. Reading is required to be able to build a documentation tree. Cast by parse()
Fault: cannot write stripped code to $file: $!: Cast by parse()
Error: chapter $name before package statement in $file line $line: Cast by parse()
Error: chapter `$name' before package statement in $file line $number: A package file can contain more than one package: more than one name space. The docs are sorted after the name space. Therefore, each chapter must be preceeded by a package statement in the file to be sure that the correct name space is used. Cast by parse()
Warning: debugging remains in $file line $linenr: The author's way of debugging is by putting warn/die/carp etc on the first position of a line. Other lines in a method are always indented, which means that these debugging lines are clearly visible. You may simply ingnore this warning. Cast by parse()
Error: default for option $name outside subroutine in $file line $linenr: A default is set, however there is not subroutine in scope (yet). It is plausible that the option does not exist either, but that will be checked later. Cast by parse()
Warning: default line incorrect in $file line $linenr:\n $line: The shown $line is not in the right format: it should contain at least two words being the option name and the default value. Cast by parse()
Error: diagnostic $type outside subroutine in $file line $linenr: It is unclear to which subroutine this diagnostic message belongs. Cast by parse()
Warning: doc did not end in $file: When the whole $file was parsed, the documentation part was still open. Probably you forgot to terminate it with a =cut. Cast by parse()
Warning: empty L link in $manual: Cast by decomposeL()
Error: example outside chapter in $file line $linenr: An example can belong to a subroutine, chapter, section, and subsection. Apparently, this example was found before the first chapter started in the file. Cast by parse()
Warning: in $where, options but no call parameter %options: Cast by autoMarkup()
Warning: in $where, text uses unknown '$label': Cast by autoMarkup()
Warning: manual $manual links to unknown entry '$item' in $other_manual: Cast by decomposeL()
Warning: module $name is not on your system, but linked to in $manual: The module can not be found. This may be an error at your part (usually a typo) or you didn't install the module on purpose. This message will also be produced if some defined package is stored in one file together with an other module or when compilation errors are encountered. Cast by decomposeM()
Warning: no block for line $linenr in file $file\n $line: Cast by parse()
Warning: no diagnostic message supplied in $file line $linenr: The start of a diagnostics message was indicated, however not provided on the same line. Cast by parse()
Error: no input file to parse specified: The parser needs the name of a file to be read, otherwise it can not work. Cast by parse()
Warning: no manual for $package (correct casing?): The manual for $package cannot be found. If you have a module named this way, this may indicate that the NAME chapter of the manual page in that module differs from the package name. Often, this is a typo in the NAME... probably a difference in used cases. Cast by decomposeM()
Error: option $name outside subroutine in $file line $linenr: An option is set, however there is not subroutine in scope (yet). Cast by parse()
Warning: option '$name' unknown for $call() in $where, found in $manual: Cast by decomposeM()
Warning: option line incorrect in $file line $linenr:\n $line: The shown $line is not in the right format: it should contain at least two words being the option name and an abstract description of possible values. Cast by parse()
Warning: option {where} has no default: Cast by autoMarkup()
Warning: requires line incorrect in $file line $linenr:\n $line: Cast by parse()
Error: section '$name' outside chapter in $file line $line: Cast by parse()
Error: section '$name' outside chapter in $file line $number: Sections must be contained in chapters. Cast by parse()
Warning: subroutine $call() is not defined by $pkg, but linked to in $manual: Cast by decomposeM()
Error: subroutine $name outside chapter in $file line $line: Cast by parse()
Error: subroutine $name outside chapter in $file line $number: Subroutine descriptions (method, function, tie, ...) can only be used within a restricted set of chapters. You have not started any chapter yet. Cast by parse()
Error: subroutine type '$type' without name in $file line $line: Cast by parse()
Error: subsection '$name' outside section in $file line $line: Cast by parse()
Error: subsection '$name' outside section in $file line $number: Subsections are only allowed in a chapter when it is nested within a section. Cast by parse()
Error: subsubsection '$name' outside section in $file line $line: Cast by parse()
Error: subsubsection '$name' outside subsection in $file line $number: Subsubsections are only allowed in a chapter when it is nested within a subsection. Cast by parse()
Warning: unknown markup in $file line $linenr:\n $line: The standard pod and the extensions made by this parser define a long list of markup keys, but yours is not one of these predefined names. Cast by parse()
Warning: use problem for module $name in $manual;\n$err: In an attempt to check the correctness of your naming of a module, OODoc will try to compile ("require") the named module. Apparently, the module was found, but something else went wrong. The exact cause is not always easy to find. Cast by decomposeM()

LICENSE

For contributors see file ChangeLog.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.

To install OODoc, copy and paste the appropriate command in to your terminal.

cpanm

cpanm OODoc

CPAN shell

perl -MCPAN -e shell
install OODoc

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)

NAME

INHERITANCE

SYNOPSIS

DESCRIPTION

OVERLOADED

METHODS

Constructors

Attributes

Collected

Parsing a file

Producing manuals

DETAILS

General Description

Disadvantages

Structural tags

Heading

auto-markup

Subroutines

Subroutine parameters

Include examples

Include diagnostics

Compatibility

Text markup

The parameter marker P

The method link M

The L-link

Grouping subroutines

Caveats

Examples

» Example: how subroutines are documented

DIAGNOSTICS

SEE ALSO

LICENSE

Module Install Instructions

The parameter marker `P`

The method link `M`