NAME
File::Rdiff -- generate remote signatures and patch files using librsync
SYNOPSIS
use File::Rdiff
DESCRIPTION
A more-or-less direct interface to librsync (http://librsync.sourceforge.net/).
For usage examples (better than this very sparse documentation), see the two example scripts below.
- LIBRSYNC_VERSION
-
A constant describing the version of the rsync library used in this module. My version claimed to be "0.9.5 librsync" when I wrote this document ;)
- $oldlevel = trace_level [$newlevel]
-
Return the current tracelevel and optionally set a new one.
- $oldcb = trace_to [$newcb]
-
Return the current trace callback and optionally set a new one. The callback will be called with the log level as the first argument and the log message as the second one.
Calling
trace_to
withundef
will restore the default handler (which currently prints the message to standard error). - supports_trace
-
Returns wether debugging traces are supported in this version of the library.
- $msg = strerror $rcode
-
Returns a string representation of the given error code. You usually just "exit(1)" or something when a function/method fails, as all (most?) librsync functions log the error properly, so there is rarely a need to call this function.
- $md4 = md4_file $fh
- sig_file $old_fh, $sig_fh[, $block_len[, $strong_len]]
- $sig = loadsig_file $fh
- delta_file $signature, $new_fh, $delta_fh
- patch_file $base_fh, $delta_fh, $new_fh
The File::Rdiff::Job class
The File::Rdiff::Buffers class
This class contains the input and output buffers for the non-blocking interface. It is slightly unusual in that it allows direct manipulation of (some) of it's internal variables.
- new File::Rdiff::Buffers [$outsize]
-
Creates and initializes a new buffers structure.
$outsize
specifies the maximum number of bytes to be read into the output scalar until it is considered full. The default is 64k. - $buffers->in($in)
-
Set the next block of input to consume. Data will be read from this scalar (no copy will be made!) until all bytes have been consumed or a new input scalar is set.
- $out = $buffers->out
-
Return the current output data and create a new buffer. Returns
undef
if no data has been accumulated. - $buffers->eof
-
Set the eof flag to true. This indicates that no data is following the current input scalar.
- $buffers->avail_in
-
Returns the numer of bytes still available for input. If there are no input bytes available but the eof flag is set, returns -1 (to make boolean tests easy to check wether to supply more data easier).
- $buffers->avail_out
-
Returns the number of bytes still available in the output buffer.
- $buffers->size
-
The number of bytes that have been accumulated in the current buffer so far.
The File::Rdiff::Job class
It is possible to have multiple jobs running at the same time. The idea is to create job objects and then drive them incrementally with input or output data until all date has been processed.
- new_sig File::Rdiff::Job [$new_block_len[, $strong_sum_len]]
-
Create a job that converts a base stream into a signature stream (i.e. creates signatures).
- new_loadsig File::Rdiff::Job
-
Create a job that converts the input stream into a in-memory File::Rdiff::Signature object. The signature object can be fetched anytime with the
signature
-method. - new_delta File::Rdiff::Job $signature
-
Creates a job that creates (outputs) a delta between the input stream (the newer file) and the file represented by the given signature.
- new_patch File::Rdiff::Job $callback_or_filehandle
-
Creates a job that patches a file according to the input stream (a delta stream). The single argument is used to read the base file contents. If it is a filehandle, it must be a seekable handle to the base file.
If it is a coderef, it will be called whenever base file data must be read. Two arguments will be passed: the file offset and the length. The callback should eithe return the data read (must be a string, not a number!) or an error code.
- $job->iter($buffers)
-
Do as much work as possible given the input and/or output data in the File::Rdiff::Buffers structure and return either
DONE
when the job is finished,BLOCKED
if there aren't enough bytes available in the input or output buffers (in which case you should deplete the output buffer and/or fill the input buffer and loop), or some error code indicating that the operation failed. - $job->signature
-
Only valid for
new_loadsig
, so look there.
EXAMPLE PROGRAM ONE
Very simple program that mimics librsync's rdiff, using the simple file utility functions. see example below for the same program, written using the nonblocking API.
#!/usr/bin/perl
use File::Rdiff qw(:trace :file);
trace_level(LOG_INFO);
if ($ARGV[0] eq "signature") {
open $base, "<$ARGV[1]" or die "$ARGV[1]: $!";
open $sig, ">$ARGV[2]" or die "$ARGV[2]: $!";
File::Rdiff::sig_file $base, $sig;
} elsif ($ARGV[0] eq "delta") {
open $sig, "<$ARGV[1]" or die "$ARGV[1]: $!";
open $new, "<$ARGV[2]" or die "$ARGV[2]: $!";
open $delta, ">$ARGV[3]" or die "$ARGV[3]: $!";
$sig = loadsig_file $sig;
ref $sig or exit 1;
$sig->build_hash_table;
File::Rdiff::delta_file $sig, $new, $delta;
} elsif ($ARGV[0] eq "patch") {
open $base, "<$ARGV[1]" or die "$ARGV[1]: $!";
open $delta, "<$ARGV[2]" or die "$ARGV[2]: $!";
open $new, ">$ARGV[3]" or die "$ARGV[3]: $!";
File::Rdiff::patch_file $base, $delta, $new;
} else {
print <<EOF;
$0 signature BASE SIGNATURE
$0 delta SIGNATURE NEW DELTA
$0 patch BASE DELTA NEW
EOF
exit (1);
}
EXAMPLE PROGRAM TWO
Same as above, but written using the callback-based, "nonblocking", API.
#!/usr/bin/perl
use File::Rdiff qw(:trace :nonblocking);
trace_level(LOG_INFO);
if ($ARGV[0] eq "signature") {
open $basis, "<", $ARGV[1]
or die "$ARGV[1]: $!";
open $sig, ">", $ARGV[2]
or die "$ARGV[2]: $!";
my $job = new_sig File::Rdiff::Job 128;
my $buf = new File::Rdiff::Buffers 4096;
while ($job->iter($buf) == BLOCKED) {
# fetch more input data
$buf->avail_in or do {
my $in;
65536 == sysread $basis, $in, 65536 or $buf->eof;
$buf->in($in);
};
print $sig $buf->out;
}
print $sig $buf->out;
} elsif ($ARGV[0] eq "delta") {
open $sig, "<$ARGV[1]" or die "$ARGV[1]: $!";
open $new, "<$ARGV[2]" or die "$ARGV[2]: $!";
open $delta, ">$ARGV[3]" or die "$ARGV[3]: $!";
# first load the signature into memory
my $job = new_loadsig File::Rdiff::Job;
my $buf = new File::Rdiff::Buffers 0;
do {
$buf->avail_in or do {
my $in;
65536 == sysread $sig, $in, 65536 or $buf->eof;
$buf->in($in);
};
} while $job->iter($buf) == BLOCKED;
$sig = $job->signature;
$sig->build_hash_table;
# now create the delta file
my $job = new_delta File::Rdiff::Job $sig;
my $buf = new File::Rdiff::Buffers 65536;
do {
$buf->avail_in or do {
my $in;
65536 == sysread $new, $in, 65536 or $buf->eof;
$buf->in($in);
};
print $delta $buf->out;
} while $job->iter($buf) == BLOCKED;
print $delta $buf->out;
} elsif ($ARGV[0] eq "patch") {
open $base, "<$ARGV[1]" or die "$ARGV[1]: $!";
open $delta, "<$ARGV[2]" or die "$ARGV[2]: $!";
open $new, ">$ARGV[3]" or die "$ARGV[3]: $!";
# NYI
File::Rdiff::patch_file $base, $delta, $new;
} else {
print <<EOF;
$0 signature BASIS SIGNATURE
$0 delta SIGNATURE NEW DELTA
$0 patch BASE DELTA NEW
EOF
exit (1);
}
SEE ALSO
File::Rsync, rdiff1 (usage example using simple file API), rdiff2 (example using nonblocking API).
BUGS
- not well-tested so far.
- low memory will result in segfaults rather than croaks.
- no access to statistics yet
- documentation leaves much to be deserved.
AUTHOR
Marc Lehmann <schmorp@schmorp.de>
http://home.schmorp.de/