Sponsoring The Perl Toolchain Summit 2025: Help make this important event another success Learn more

NAME
SQL::Schema::Versioned - Routine and convention to create/update your
application's DB schema
VERSION
This document describes version 0.239 of SQL::Schema::Versioned (from
Perl distribution SQL-Schema-Versioned), released on 2021-08-08.
DESCRIPTION
To use this module, you typically run the "create_or_update_db_schema"()
routine at the start of your program/script, e.g.:
use DBI;
use SQL::Schema::Versioned qw(create_or_update_db_schema);
my $spec = { # the schema specification
latest_v => 3,
install => [
"CREATE TABLE t1 (i INT)",
"CREATE TABLE t4 (i INT)",
],
upgrade_to_v1 => [
"CREATE TABLE t1 (i INT)",
"CREATE TABLE t2 (i INT)",
"CREATE TABLE t3 (i INT)",
],
upgrade_to_v2 => [
"CREATE TABLE t4 (i INT)",
"DROP TABLE t3",
],
upgrade_to_v3 => [
"DROP TABLE t2",
],
install_v2 => [
"CREATE TABLE t1 (i INT)",
"CREATE TABLE t2 (i INT)",
"CREATE TABLE t4 (i INT)",
],
};
my $dbh = DBI->connect(...);
my $res = create_or_update_db_schema(dbh=>$dbh, spec=>$spec);
die "Cannot run the application: cannot create/upgrade database schema: $res->[1]"
unless $res->[0] == 200;
This way, your program automatically creates/updates database schema
when run. Users need not know anything.
See more elaborate examples in some applications that use this module
like App::lcpan or SQLite::Counter::Simple.
BEST PRACTICES
It is recommended that after you create the second and subsequent
version ("upgrade_to_v2", "upgrade_to_v3", and so on) you create and
keep "install_v1" so you can test migration from v1->v2, v2->v3, and so
on.
FUNCTIONS
create_or_update_db_schema
Usage:
create_or_update_db_schema(%args) -> [$status_code, $reason, $payload, \%result_meta]
Routine and convention to create/update your application's DB schema.
With this routine (and some convention) you can easily create and update
database schema for your application in a simple way using pure SQL.
*Version*: version is an integer and starts from 1. Each software
release with schema change will bump the version number by 1. Version
information is stored in a special table called "meta" (SELECT value
FROM meta WHERE name='schema_version').
You supply the SQL statements in "spec". "spec" is a hash which at least
must contain the key "latest_v" (an integer) and "install" (a series of
SQL statements to create the schema from nothing to the latest version).
There should also be zero or more "upgrade_to_v$VERSION" keys, the value
of each is a series of SQL statements to upgrade from ($VERSION-1) to
$VERSION. So there could be "upgrade_to_v2", "upgrade_to_v3", and so on
up the latest version. This is used to upgrade an existing database from
earlier version to the latest.
For testing purposes, you can also add one or more "install_v<VERSION>"
key, where "XXX" is an integer, the lowest version number that you still
want to support. So, for example, if "latest_v" is 5 and you still want
to support from version 2, you can have an "install_v2" key containing a
series of SQL statements to create the schema at version 2, and
"upgrade_to_v3", "upgrade_to_v4", "upgrade_to_v5" keys. This way
migrations from v2 to v3, v3 to v4, and v4 to v5 can be tested.
You can name "install_v1" key as "upgrade_to_v1" (to upgrade from
'nothing' a.k.a. v0 to v1), which is basically the same thing.
This routine will check the existence of the "meta" table and the
current schema version. If "meta" table does not exist yet, the SQL
statements in "install" will be executed. The "meta" table will also be
created and a row "('schema_version', 1)" is added. The
("schema_summary", <SUMMARY>) row will also be added if your spec
specifies a "summary".
If "meta" table already exists, schema version will be read from it and
one or more series of SQL statements from "upgrade_to_v$VERSION" will be
executed to bring the schema to the latest version.
Aside from SQL statement, the series can also contain coderefs for more
complex upgrade process. Each coderef will be called with $dbh as
argument and must not die (so to signify failure, you can die from
inside the coderef).
Currently only tested on MySQL, Postgres, and SQLite. Postgres is
recommended because it can do transactional DDL (a failed upgrade in the
middle will not cause the database schema state to be inconsistent, e.g.
in-between two versions).
Modular schema (components)
This routine supports so-called modular schema, where you can separate
your database schema into several *components* (sets of tables) and then
declare dependencies among them.
For example, say you are writing a stock management application. You
divide your application into several components: "quote" (component that
deals with importing stock quotes and querying stock prices),
"portfolio" (component that deals with computing the market value of
your portfolio, calculating gains/losses), "trade" (component that
connects to your broker API and perform trading by submitting buy/sell
orders).
The "quote" application component manages these tables: "daily_price",
"spot_price". The "portfolio" application component manages these
tables: "account" (list of accounts in stock brokerages), "balance"
(list of balances), "tx" (list of transactions). The "trade" application
component manages these tables: "order" (list of buy/sell orders).
The "portfolio" application component requires price information to be
able to calculate unrealized gains/losses. The "trade" component also
needs information from the "daily_price" e.g. to calculate 52-week
momentum, and writes to the "spot_price" to record intraday prices, and
reads/writes from the "account" and "balance" tables. Here are the
"spec"s for each component:
# spec for the price application component
{
component_name => 'price',
summary => "Price application component",
latest_v => 1,
provides => ['daily_price', 'spot_price'],
install => [...],
...
}
# spec for the portfolio application component
{
component_name => 'portfolio',
summary => "Portfolio application component",
latest_v => 1,
provides => ['account', 'balance', 'tx'],
deps => {
'daily_price' => 1,
'spot_price' => 1,
},
install => [...],
...
}
# spec for the trade application component
{
component_name => 'trade',
summary => "Trade application component",
latest_v => 1,
provides => ['order'],
deps => {
'daily_price' => 1,
'spot_price' => 1,
'account' => 1,
'balance' => 1,
},
install => [...],
...
}
You'll notice that the three keys new here are the "component_name",
"provides", and "deps".
When "component_name" is set, then instead of the "schema_version" key
in the "meta" table, your component will use the
"schema_version.<COMPONENT_NAME>" key. When "component_name" is not set,
it is assumed to be "main" and the "schema_version" key is used in the
"meta" table. The component "summary", if specified, will also be
written to "schema_summary.<COMPONENT_NAME>" key.
"provides" is an array of tables to help this routine know which
table(s) your component create and maintain. If unset, this routine will
try to guess from looking at "CREATE TABLE" SQL statements.
This routine will create "table.<TABLE_NAME>" keys in the "meta" table
to record which components currently maintain which tables. The value of
the key is "<COMPONENT_NAME>:<VERSION>". When a component no longer
maintain a table in the newest version, the corresponding
"table.<TABLE_NAME>" row in the "meta" will also be removed.
"deps" is a hash. The keys are table names that your component requires.
The values are integers, meaning the minimum version of the required
table (= component version). In the future, more complex dependency
relationship and version requirement will be supported.
This function is not exported by default, but exportable.
Arguments ('*' denotes required arguments):
* create_from_version => *int*
Instead of the latest, create from this version.
This can be useful during testing. By default, if given an empty
database, this function will use the "install" key of the spec to
create the schema from nothing to the latest version. However, if
this option is given, function wil use the corresponding
"install_v<VERSION>" key in the spec (which must exist) and then
upgrade using the "upgrade_to_v<VERSION>" keys to upgrade to the
latest version.
* dbh* => *obj*
DBI database handle.
* spec* => *hash*
Schema specification, e.g. SQL statements to create and update the
schema.
Example:
{
latest_v => 3,
# will install version 3 (latest)
install => [
'CREATE TABLE IF NOT EXISTS t1 (...)',
'CREATE TABLE IF NOT EXISTS t2 (...)',
'CREATE TABLE t3 (...)',
],
upgrade_to_v2 => [
'ALTER TABLE t1 ADD COLUMN c5 INT NOT NULL',
sub {
# this subroutine sets the values of c5 for the whole table
my $dbh = shift;
my $sth_sel = $dbh->prepare("SELECT c1 FROM t1");
my $sth_upd = $dbh->prepare("UPDATE t1 SET c5=? WHERE c1=?");
$sth_sel->execute;
while (my ($c1) = $sth_sel->fetchrow_array) {
my $c5 = ...; # calculate c5 value for the row
$sth_upd->execute($c5, $c1);
}
},
'CREATE UNIQUE INDEX i1 ON t2(c1)',
],
upgrade_to_v3 => [
'ALTER TABLE t2 DROP COLUMN c2',
'CREATE TABLE t3 (...)',
],
# provided for testing, so we can test migration from v1->v2, v2->v3
install_v1 => [
'CREATE TABLE IF NOT EXISTS t1 (...)',
'CREATE TABLE IF NOT EXISTS t2 (...)',
],
}
Returns an enveloped result (an array).
First element ($status_code) is an integer containing HTTP-like status
code (200 means OK, 4xx caller error, 5xx function error). Second
element ($reason) is a string containing error message, or something
like "OK" if status is 200. Third element ($payload) is the actual
result, but usually not present when enveloped result is an error
response ($status_code is not 2xx). Fourth element (%result_meta) is
called result metadata and is optional, a hash that contains extra
information, much like how HTTP response headers provide additional
metadata.
Return value: (any)
FAQ
Why use this module instead of other similar solution?
Mainly simplicity. I write simple application which is often
self-contained in a single module/script. This module works with
embedded SQL statements instead of having to put SQL in separate
files/subdirectory.
How do I see each SQL statement as it is being executed?
Try using Log::ger::DBI::Query, e.g.:
% perl -MLog::ger::DBI::Query -MLog::ger::Output=Screen -MLog::ger::Level::trace yourapp.pl ...
HOMEPAGE
Please visit the project's homepage at
SOURCE
Source repository is at
BUGS
Please report any bugs or feature requests on the bugtracker website
When submitting a bug or request, please include a test-file or a patch
to an existing test-file that illustrates the bug or desired feature.
SEE ALSO
Some other database migration tools on CPAN:
* DBIx::Migration
Pretty much similar to this module, with support for downgrades. OO
style, SQL in separate files/subdirectory.
* Database::Migrator
Pretty much similar. OO style, SQL in separate files/subdirectory.
Perl scripts can also be executed for each version upgrade. Meta
table is configurable (default recommended is 'AppliedMigrations').
* sqitch
A more proper database change management tool with dependency
resolution and VCS awareness. No numbering. Command-line script and
Perl library provided. Looks pretty awesome and something which I
hope to use for more complex applications.
AUTHOR
perlancar <perlancar@cpan.org>
CONTRIBUTOR
Steven Haryanto <sharyanto@cpan.org>
COPYRIGHT AND LICENSE
This software is copyright (c) 2021, 2018, 2017, 2015, 2014, 2013 by
perlancar@cpan.org.
This is free software; you can redistribute it and/or modify it under
the same terms as the Perl 5 programming language system itself.