NAME
Org::ReadTables - Import Org Mode tables into arrays, or directly into
database tables
SYNOPSIS
use Org::ReadTables;
my $op = Org::ReadTables->new( dbh => $dbh,
table => 'example',
tables => ['a_table']
);
# or:
# When called without a 'dbh' argument, saves values
# which can be retrieved via the 'saved' method.
#
my $op = Org::ReadTables->new( cb => \&row_callback,
cb_table => \&table_callback,
);
# then:
$op->parse( $mojo_file->slurp );
DESCRIPTION
Org::ReadTables loads data from one or more Emacs Org Mode tables in an
org file into a DBI style database which supports the
SQL::Abstract/insert method. The underlying DBD must also support the
returning option for insertion.
For example, given the following .org file:
#+NAME: LCCN_Serial
| LCCN | Publication | City | Start_Date | End_Date |
|------------+-----------------------+---------+------------+------------|
| sn92024097 | Adahooniłigii | Phoenix | | |
| sn87062098 | Arizona Daily Citizen | Tucson | | |
| sn84020558 | Arizona Republican | Phoenix | | 1930-11-10 |
| sn83045137 | Arizona Republic | Phoenix | 1930-11-11 | |
and a database containing a table called lccn_serial, the parse method
would insert four rows into it, in the fields whose names are given in
the column headings.
The NAME: org attribute specifies the table name; a default value may
be passed in the table parameter to the new method. Additionally, a
tables array may be passed by reference against which the names of such
tables will be validated; table names not listed will have their org
tables skipped.
Table names may also be specified in a Name property (not
case-sensitive) in an Orgmode Drawer preceding the table. For example:
:PROPERTIES:
:Name: Locos
:END:
| Wheel Arrangement | Locomotive Type |
|-------------------+-----------------|
| oo-oo> | American |
| ooo-oo> | Mogul |
Additionally, with the Drawer format, fixed column values may
optionally be specified:
:PROPERTIES:
:Name: Locos
:Country: .us
:END:
| Wheel Arrangement | Locomotive Type |
|-------------------+-----------------|
| oo-oo> | American |
| ooo-oo> | Mogul |
which would have the effect of adding a 'country' column to the right
of each record, all having the value '.us'.
Pivot Tables
NOTE: This is a future feature, not yet fully implemented.
When it is desirable to enter data two-dimensionally, a construct like
this may be used:
:PROPERTIES:
:Name: sizes
:Data: size_desc
| class> | A | B | C |
| size_code | | | |
|------------+-------+-------+-----|
| 1 | 1-2 | 22-26 | |
| 2 | 3-4 | 26-30 | XS |
| 3 | 5-6 | 30-34 | S |
where the `Data` property determines which field (column) is assigned
the pivoted value. The above table would generate eight data records
for the `sizes` table:
size_code='1', class='A', size_desc='1-2'
size_code='2', class='A', size_desc='3-4'
size_code='3', class='A', size_desc='5-6'
size_code='1', class='B', size_desc='22-26'
size_code='2', class='B', size_desc='26-30'
size_code='3', class='B', size_desc='30-34'
size_code='2', class='C', size_desc='XS'
size_code='3', class='C', size_desc='S'
Note that no record is created for class 'C' with size_code '1' as that
entry in the pivot table is blank.
ATTRIBUTES
Org::ReadTables implements the following attributes.
inserted
Returns a hashref, each element's key being the name of a table into
which rows were inserted, and its value being the number of rows
inserted into that table.
errors
Returns a reference to an array, each entry in which itself be an array
whose values are:
a hash (the column names and values to be inserted), and
the resulting error report from that insertion
METHODS
new
Creates a new Org::ReadTables object. Parameters include:
dbh
should be an open database handle from, e.g., DBD::SQLite, Mojo::SQLite
or Mojo::Pg. Each row to be saved will invoke the 'insert' method of
this handle (or, more generally, class instance); no other methods will
be called, so any object that has provides 'insert' may be used. For
each found record, the insert() method of this object will be called.
Note that no protection is given here against invalid column names or
other database errors.
cb
Reference to a callback function to be called for each found record, in
tables which are processed. Parameters passed are the name of the
table, and a reference to a hash of the record's column-names and
values. The function should return the count of records successfully
saved (either 0 or 1, usually).
cb_table
Reference to a callback function to be called at the start of
processing of a new table, as they are found in the orgfile. The
callback will be passed one argument, a hash with keys:
* name: A string with the name of the table
* nameref: A reference to the table-name string. This may be changed
by the callback.
* columns: A reference to the array of the names of the columns in
the table. The contents of the referred array may be maniuplated to
match the actual database field names, for example.
* fixed: A reference to a hash of fixed column key/values. This may
also be changed by the callback.
* caption: The caption, if any, attached above the table itself
* data_column: The name of the data column in a pivot table
...and should return a true value if the table is to be processed or
saved, or a false or 'undef' value to skip the table (the 'cb' callback
will not be called for rows in such tables).
table
(optional) the default table name, which will be used for all unnamed
tables. Use an Orgmode property NAME before each table to name it, as:
#+NAME: PostalAbbrev
| Code | State |
|------+---------|
| AZ | Arizona |
| FL | Florida |
| KS | Kansas |
tables
(optional) a reference to a list of valid table names to process
(others will be ignored). If tables is not given, table should be
present; otherwise for input not containing an Orgmode NAME property,
no processing will occur.
parse
$op->parse($text, [$default_table]);
Parses the given text which should be in Org Mode format. It is the
caller's responsibility to slurp a file or other data source. The
optional second parameter will be used as the default name of any table
not having an Orgmode NAME property, overriding any table value
provided via the new method.
saved
$op->saved();
$op->saved->($selected_table);
Returns a hash of the tables read; each key is a table name, the value
being an array of hashes of the rows. If a table name is passed,
returns the array of hashes only for that particular table, or undef if
no such table existed in any input.
inserted
$op->inserted();
$op->inserted->($selected_table);
With no parameter, returns a hash of the tables processed and a count
of rows found (and presumably inserted) in each. With a parameter,
returns the count of rows for that table, or undef if no such table was
processed.
BUGS
Report any issues to the author.
AUTHOR
William Lindley, wlindley@cpan.org
COPYRIGHT AND LICENSE
Copyright 2025, William Lindley.
This library is free software; you may redistribute it and/or modify it
under the terms of the Artistic License version 2.0.
SEE ALSO
Org::Parser, https://orgmode.org/