NAME

Apache::LogRegex - Parse a line from an Apache logfile into a hash

VERSION

This document refers to version 1.00 of Apache::LogRegex, released January 22nd, 2004

SYNOPSIS

use Apache::LogRegex;

my $lr;

eval { $lr = Apache::LogRegex->new($log_format) };
die "Unable to parse log line: $@" if ($@);

my %data;

while ( my $line_from_logfile = <> ) {
    eval { %data = $lr->parse($line_from_logfile); };
    if (%data) {
        # We have data to process
    } else {
        # We could not parse this line
    }
}

DESCRIPTION

Overview

Designed as a simple class to parse Apache log files. It will construct a regex that will parse the given log file format and can then parse lines from the log file line by line returning a hash of each line.

The field names of the hash are derived from the log file format. Thus if the format is '%a %t \"%r\" %s %b %T \"%{Referer}i\" ...' then the keys of the hash will be %a, %t, %r, %s, %b, %T and %{Referer}i.

Should these key names be unusable, as I guess they probably are, then subclass and provide an override rename_this_name() method that can rename the keys before they are added in the array of field names.

Constructors and initialization

Apache::LogRegex->new( FORMAT )

Returns a Apache::LogRegex object that can parse a line from an Apache logfile that was written to with the FORMAT string. The FORMAT string is the CustomLog string from the httpd.conf file.

Class and object methods

parse( LINE )

Given a LINE from an Apache logfile it will parse the line and return a hash of all the elements of the line indexed by their format. If the line cannot be parsed an empty hash will be returned.

names()

Returns a list of field names that were extracted from the data. Such as '%a', '%t' and '%r' from the above example.

regex()

Returns a copy of the regex that will be used to parse the log file.

rename_this_name( NAME )

Use this method to rename the keys that will be used in the returned hash. The initial NAME is passed in and the method should return the new name.

ENVIRONMENT

Perl 5

DIAGNOSTICS

The only problem I can foresee is the various custom time formats but providing that they are encased in '[' and ']' all should be fine.

Apache::LogRegex->new() takes 1 argument

When the constructor is called it requires one argument. This message is given if more or less arguments were supplied.

Apache::LogRegex->new() argument 1 (FORMAT) is undefined

The correct number of arguments were supplied with the constructor call, however the first argument, FORMAT, was undefined.

Apache::LogRegex->parse() takes 1 argument

When the method is called it requires one argument. This message is given if more or less arguments were supplied.

Apache::LogRegex->parse() argument 1 (LINE) is undefined

The correct number of arguments were supplied with the method call, however the first argument, LINE, was undefined.

Apache::LogRegex->names() takes no argument

When the method is called it requires no arguments. This message is given if some arguments were supplied.

Apache::LogRegex->regex() takes no argument

When the method is called it requires no arguments. This message is given if some arguments were supplied.

BUGS

None so far

FILES

None

SEE ALSO

mod_log_config for a description of the Apache format commands

AUTHOR

Peter Hickman (peterhi@ntlworld.com)

COPYRIGHT

Copyright (c) 2004, Peter Hickman. All rights reserved. This module is free software. It may be used, redistributed and/or modified under the same terms as Perl itself.