Name
File::Replace - Perl extension for replacing files by renaming a temp file over the original
Synopsis
This module provides three interfaces:
use File::Replace 'replace2';
my ($infh,$outfh) = replace2($filename);
while (<$infh>) {
# write whatever you like to $outfh here
print $outfh "X: $_";
}
close $infh; # closing both handles will
close $outfh; # trigger the replace
Or the more magical single filehandle, in which print
, printf
, and syswrite
go to the output file; binmode
to both; fileno
only reports open/closed status; and the other I/O functions go to the input file:
use File::Replace 'replace';
my $fh = replace($filename);
while (<$fh>) {
# can read _and_ write from/to $fh
print $fh "Y: $_";
}
close $fh;
Or the object oriented:
use File::Replace;
my $repl = File::Replace->new($filename);
my $infh = $repl->in_fh;
while (<$infh>) {
print {$repl->out_fh} "Z: $_";
}
$repl->finish;
Description
This module implements and hides the following pattern for you:
Open a temporary file for output
While reading from the original file, write output to the temporary file
rename
the temporary file over the original file
In many cases, in particular on many UNIX filesystems, the rename
operation is atomic*. This means that in such cases, the original filename will always exist, and will always point to either the new or the old version of the file, so a user attempting to open and read the file will always be able to do so, and never see an unfinished version of the file while it is being written.
* Unfortunately, whether or not a rename will actually be atomic in your specific circumstances is not always an easy question to answer, as it depends on exact details of the operating system and file system. Consult your system's documentation and search the Internet for "atomic rename" for more details.
Version
This documentation describes version 0.02 of this module.
This is an alpha version. While the module works and has a full test suite, I may still decide to change the API as I gain experience with it. I will try to make such changes compatible, but can't guarantee that just yet.
Constructors and Overview
The functions File::Replace->new()
, replace()
, and replace2()
take exactly the same arguments, and differ only in their return values. Note that replace()
and replace2()
are normal functions and not methods, don't attempt to call them as such. If you don't want to import them you can always call them as, for example, File::Replace::replace()
.
File::Replace->new( $filename );
File::Replace->new( $filename, $layers );
File::Replace->new( $filename, option => 'value', ... );
File::Replace->new( $filename, $layers, option => 'value', ... );
# replace(...) and replace2(...) take the same arguments
The options are described in "Options". The constructors will die
in case of errors. It is strongly recommended that you use warnings;
, as then this module will issue warnings which may be of interest to you.
File::Replace->new
use File::Replace;
my $replace_object = File::Replace->new($filename, ...);
Returns a new File::Replace
object. The central methods provided are ->in_fh
and ->out_fh
, which return the input resp. output filehandle which you can read resp. write, and ->finish
, which causes the files to be closed and the replace operation to be performed. There is also ->cancel
, which just discards the temporary output file without touching the input file. Additional helper methods are mentioned below.
finish
will die
on errors, while cancel
will only return a false value on errors. This module will try to clean up after itself (remove temporary files) as best it can, even when things go wrong.
Please don't re-open
the in_fh
and out_fh
handles, as this may lead to confusion.
The method ->is_open
will return a false value if the replace operation has been finish
ed or cancel
ed, or a true value if it is still active. The method ->filename
returns the filename passed to the constructor. The method ->options
in list context returns the options this object has set (including defaults) as a list of key/value pairs, in scalar context it returns a hashref of these options.
replace
use File::Replace 'replace';
my $magic_handle = replace($filename, ...);
Returns a single, "magical" tied filehandle. The operations print
, printf
, and syswrite
are passed through to the output filehandle, binmode
operates on both the input and output handle, and fileno
only reports -1
if the File::Replace
object is still active or undef
if the replace operation has finish
ed or been cancel
ed. All other I/O functions, such as <$handle>
, readline
, sysread
, seek
, tell
, eof
, etc. are passed through to the input handle. You can still access these operations on the output handle via e.g. eof( tied(*$handle)->out_fh )
or tied(*$handle)->out_fh->tell()
. The replace operation (finish
) is performed when you close
the handle.
Re-open
ing the handle causes a new underlying File::Replace
object to be created. You should explicitly close
the filehandle first so that the previous replace operation is performed (or cancel
that operation). The "mode" argument (or filename in the case of a two-argument open
) may not contain a read/write indicator (<
, >
, etc.), only PerlIO layers.
You can access the underlying File::Replace
object via tied(*$handle)->replace
. You can also access the original, untied filehandles via tied(*$handle)->in_fh
and tied(*$handle)->out_fh
, but please don't close
or re-open
these handles as this may lead to confusion.
replace2
use File::Replace 'replace2';
my ($input_handle, $output_handle) = replace2($filename, ...);
my $output_handle = replace2($filename, ...);
In list context, returns a two-element list of two tied filehandles, the first being the input filehandle, and the second the output filehandle, and the replace operation (finish
) is performed when both handles are closed. In scalar context, it returns only the output filehandle, and the replace operation is performed when this handle is closed.
You cannot re-open
these tied filehandles.
You can access the underlying File::Replace
object via tied(*$handle)->replace
on both the input and output handle. You can also access the original, untied filehandles via tied(*$handle)->in_fh
and tied(*$handle)->out_fh
, but please don't close
or re-open
these handles as this may lead to confusion.
Options
Filename
A filename. The temporary output file will be created in the same directory as this file, its name will be based on the original filename, but prefixed with a dot (.
) and suffixed with a random string and an extension of .tmp
.
If the input file does not exist (ENOENT
), then the behavior will depend on the options "devnull" (enabled by default) and "create". If either of these options are set, the input file will be created (just at different times), if neither are enabled, then attempting to open a nonexistent file will fail.
layers
This option can either be specified as the second argument to the constructors, or as the layers => '...'
option in the options hash, but not both. It is a list of PerlIO layers such as ":utf8"
, ":raw:crlf"
, or ":encoding(UTF-16)"
. Note that the default layers differ based on operating system, see "open" in perlfunc.
devnull
This option, which is enabled by default, causes the case of nonexistent input files to be handled by opening /dev/null or its equivalent instead of the input file. This means that while the output file is being written, the input file name will not exist, and only come into existence when the rename operation is performed. If you disable this option, and attempt to open a nonexistent file, then the constructor will die
.
The option "create" being set overrides this option.
create
Enabling this option causes the case of nonexistent input files to be handled by opening the input file name with a mode of +>
, meaning that it is created and opened in read-write mode. However, it is strongly recommended that you don't take advantage of the read-write mode by writing to the input file, as that contradicts the purpose of this module - instead, the input file will exist and remain empty until the replace operation.
Setting this option overrides "devnull". If this option is disabled (the default), "devnull" takes precedence.
perms
perms => 0640 # ok
perms => oct("640") # ok
perms => "0640" # WRONG!
Normally, just before the rename
is performed, File::Replace
will chmod
the temporary file to those permissions that the original file had when it was opened, or, if the original file did not yet exist, default permissions based on the current umask
. Setting this option to an octal value (a number, not a string!) will override those permissions. See also "chmod", which can be used to disable the chmod
operation.
chmod
This option is enabled by default, unless you set $File::Replace::DISABLE_CHMOD
to a true value. When you disable this option, the chmod
operation that is normally performed just before the rename
will not be attempted. This is mostly intended for systems where you know the chmod
will fail. See also "perms", which allows you to define what permissions will be used.
autocancel
If the File::Replace
object is destroyed (e.g. when it goes out of scope), and the replace operation has not been performed yet, normally it will cancel
the replace operation and issue a warning. Enabling this option makes that implicit canceling explicit, silencing the warning.
This option cannot be used together with autofinish
.
autofinish
When set, causes the finish
operation to be attempted when the object is destroyed (e.g. when it goes out of scope).
However, using this option is actually not recommended unless you know what you are doing. This is because the replace operation will also be attempted when your script is die
ing, in which case the output file may be incomplete, and you may not want the original file to be replaced. A second reason is that the replace operation may be attempted during global destruction, and it is not a good idea to rely on this always going well. In general it is better to finish
the replace operation explicitly.
This option cannot be used together with autocancel
.
debug
Enables some debug output for new
, finish
, and cancel
.
Author, Copyright, and License
Copyright (c) 2017 Hauke Daempfling (haukex@zero-g.net) at the Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB), Berlin, Germany, http://www.igb-berlin.de/
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.