NAME
Data::Record::Serialize - Flexible serialization of a record
SYNOPSIS
use Data::Record::Serialize;
# simple output to json
$s = Data::Record::Serialize->new( encode => 'json', \%attr );
$s->send( \%record );
# cleanup record before sending
$s = Data::Record::Serialize->new( encode => 'json',
fields => [ qw( obsid chip_id phi theta ) ],
format => 1,
format_types => { N => '%0.4f' },
format_fields => { obsid => '%05d' },
rename_fields => { chip_id => 'CHIP' },
types => { obsid => 'I', chip_id => 'S',
phi => 'N', theta => 'N' },
);
$s->send( \%record );
# send to an SQLite database
$s = Data::Record::Serialize->new(
encode => 'dbi',
dsn => [ 'SQLite', [ dbname => $dbname ] ],
table => 'stuff',
format => 1,
fields => [ qw( obsid chip_id phi theta ) ],
format_types => { N => '%0.4f' },
format_fields => { obsid => '%05d' },
rename_fields => { chip_id => 'CHIP' },
types => { obsid => 'I', chip_id => 'S',
phi => 'N', theta => 'N' },
);
$s->send( \%record );
DESCRIPTION
Data::Record::Serialize encodes data records and sends them somewhere. This module is primarily useful for output of sets of uniformly structured data records. It provides a uniform, thin, interface to various serializers and output sinks. Its raison d'etre is its ability to manipulate the records prior to encoding and output.
A record is a collection of fields, i.e. keys and scalar values.
All records are assumed to have the same structure.
Fields may have simple types which may be determined automatically. Some encoders use this information during encoding.
Fields may be renamed upon output
A subset of the fields may be selected for output.
Fields may be formatted via
sprintf
prior to output
Encoders
The available encoders and their respective documentation are:
dbi
- Data::Record::Serialize::Encode::dbiWrite to a database via DBI. This is a combined encoder and sink.
ddump
- Data::Record::Serialize::Encode::ddumpencode via Data::Dumper
null
- send the data to the bit bucket. This is a combined encoder and sink.
Sinks
Sinks are where encoded data are sent.
The available sinks and their documentation are:
null
- send the encoded data to the bit bucket.
Types
Some output encoders care about the type of a field. Data::Record::Serialize recognizes three types:
N
- NumericI
- IntegralS
- String
Not all encoders support a separate integral type; in those cases integer fields are treated as general numeric fields.
Output field and type determination
The selection of output fields and determination of their types depends upon the fields
, types
, and default_type
attributes.
fields
specified,types
not specifiedThe fields in
fields
are output. Types are derived from the values in the first record.fields
not specified,types
specifiedThe fields by
types
are output and are given the specified types.fields
specified,types
specifiedThe fields specified by the
fields
array are output with the types specified bytypes
. For fields not specified intypes
, thedefault_type
attribute value is used.fields
not specified,types
not specifiedThe first record determines the fields and types (by examination).
INTERFACE
new
$s = Data::Record::Serialize->new( <attributes> );
Construct a new object. attributes may either be a hashref or a list of key-value pairs.
The available attributes are:
encode
-
Required. The encoding format. Specific encoders may provide additional, or require specific, attributes. See "Encoders" for more information.
sink
-
Where the encoded data will be sent. Specific sinks may provide additional, or require specific attributes. See "Sinks" for more information.
The default output sink is
stream
, unless the encoder is also a sink.It is an error to specify a sink if the encoder already acts as one.
default_type
=type-
If the
types
attribute was specified, this type is assigned to fields given in thefields
attributes which were not specified via thetypes
attribute. types
-
A hash or array mapping input field names to types (
N
,I
,S
). If an array, the fields will be output in the specified order, provided the encoder permits it (see below, however). For example,# use order if possible types => [ c => 'N', a => 'N', b => 'N' ] # order doesn't matter types => { c => 'N', a => 'N', b => 'N' }
If
fields
is specified, then its order will override that specified here. If no type is specified for elements infields
, they will default to having the type specified by thedefault_type
attribute. For example,types => [ c => 'N', a => 'N' ], fields => [ qw( a b c ) ], default_type => 'I',
will result in fields being output in the order
a b c
with types
a => 'N', b => 'I', c => 'N',
fields
-
An array containing the input names of the fields to be output. The fields will be output in the specified order, provided the encoder permits it.
If this attribute is not specified, the fields specified by the
types
attribute will be output. If that is not specified, the fields as found in the first data record will be output.If a field name is specifed in
fields
but no type is defined intypes
, it defaults to what is specified viadefault_type
. rename_fields
-
A hash mapping input to output field names. By default the input field names are used unaltered.
format_fields
-
A hash mapping the input field names to a
sprintf
style format. This will be applied prior to encoding the record, but only if theformat
attribute is also set. Formats specified here override those specified informat_types
. format_types
-
A hash mapping a field type (
N
,I
,S
) to asprintf
style format. This will be applied prior to encoding the record, but only if theformat
attribute is also set. Formats specified here may be overriden for specific fields using theformat_fields
attribute. format
-
If true, format the output fields using the formats specified in the
format_fields
and/orformat_types
options. The default is false.
send
$s->send( \%record );
Encode and send the record to the associated sink.
WARNING: the passed hash is modified. If you need the original contents, pass in a copy.
output_fields
$array_ref = $s->fields;
The names of the transformed output fields, in order of output (not obeyed by all encoders);
output_types
$hash_ref = $s->output_types;
The mapping between output field name and output field type. If the encoder has specified a type map, the output types are the result of that mapping.
numeric_fields
$array_ref = $s->numeric_fields;
The input field names for those fields deemed to be numeric.
EXAMPLES
Generate a JSON stream to the standard output stream
$s = Data::Record::Serialize->new( encode => 'json' );
Only output select fields
$s = Data::Record::Serialize->new(
encode => 'json',
fields => [ qw( obsid chip_id phi theta ) ],
);
Format numeric fields
$s = Data::Record::Serialize->new(
encode => 'json',
fields => [ qw( obsid chip_id phi theta ) ],
format => 1,
format_types => { N => '%0.4f' },
);
Override formats for specific fields
$s = Data::Record::Serialize->new(
encode => 'json',
fields => [ qw( obsid chip_id phi theta ) ],
format_types => { N => '%0.4f' },
format_fields => { obsid => '%05d' },
);
Rename fields
$s = Data::Record::Serialize->new(
encode => 'json',
fields => [ qw( obsid chip_id phi theta ) ],
format_types => { N => '%0.4f' },
format_fields => { obsid => '%05d' },
rename_fields => { chip_id => 'CHIP' },
);
Specify field types
$s = Data::Record::Serialize->new(
encode => 'json',
fields => [ qw( obsid chip_id phi theta ) ],
format_types => { N => '%0.4f' },
format_fields => { obsid => '%05d' },
rename_fields => { chip_id => 'CHIP' },
types => { obsid => 'N', chip_id => 'S', phi => 'N', theta => 'N' }'
);
Switch to an SQLite database in $dbname
$s = Data::Record::Serialize->new(
encode => 'dbi',
dsn => [ 'SQLite', [ dbname => $dbname ] ],
table => 'stuff',
fields => [ qw( obsid chip_id phi theta ) ],
format_types => { N => '%0.4f' },
format_fields => { obsid => '%05d' },
rename_fields => { chip_id => 'CHIP' },
types => { obsid => 'N', chip_id => 'S', phi => 'N', theta => 'N' }'
);
BUGS AND LIMITATIONS
No bugs have been reported.
Please report any bugs or feature requests to bug-data-record-serialize@rt.cpan.org
, or through the web interface at http://rt.cpan.org/Public/Dist/Display.html?Name=Data-Record-Serialize.
SEE ALSO
Other modules:
LICENSE AND COPYRIGHT
Copyright (c) 2014 The Smithsonian Astrophysical Observatory
Data::Record::Serialize is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see <http://www.gnu.org/licenses/>.
AUTHOR
Diab Jerius <djerius@cpan.org>