NAME
mysql-copy-rows-adjust-pk - Copy rows from one table to another, adjust PK column if necessary
VERSION
This document describes version 0.013 of mysql-copy-rows-adjust-pk (from Perl distribution App-MysqlUtils), released on 2018-12-05.
SYNOPSIS
Usage:
% mysql-copy-rows-adjust-pk [options] <database>
DESCRIPTION
This utility can be used when you have rows in one table that you want to insert to another table, but the PK might clash. When that happens, the value of the other columns are inspected. When all the values of the other columns match, the row is assumed to be a duplicate and skipped. If some values of the other column differ, then the row is assumed to be different and a new value of the PK column is chosen (there are several choices on how to select the new PK).
An example:
% mysql-copy-rows-adjust-pk db1 --from t1 --to t2 --pk-column id --adjust "add 1000"
Suppose these are the rows in table t1
:
id date description user
-- ---- ----------- ----
1 2018-12-03 12:01:01 Created user u1 admin1
2 2018-12-03 12:44:33 Removed user u1 admin1
And here are the rows in table t2
:
id date description user
-- ---- ----------- ----
1 2018-12-03 12:01:01 Created user u1 admin1
2 2018-12-03 13:00:45 Rebooted machine1 admin1
3 2018-12-03 13:05:00 Created user u2 admin2
You can see that row id=1 in both tables are identical. This will be skipped. On the other hand, row id=2 in t1
is different with row id=2 in t2
. This row will be adjusted: id
will be changed to 2+1000=1002. So the final rows in table t2
will be (sorted by date):
id date description user
-- ---- ----------- ----
1 2018-12-03 12:01:01 Created user u1 admin1
1002 2018-12-03 12:44:33 Removed user u1 admin1
2 2018-12-03 13:00:45 Rebooted machine1 admin1
3 2018-12-03 13:05:00 Created user u2 admin2
So basically this utility is similar to MySQL's INSERT ... ON DUPLICATE KEY UPDATE, but will avoid inserting identical rows.
If the adjusted PK column clashes with another row in the target table, the row is skipped.
OPTIONS
*
marks required options.
Main options
- --adjust=s*
-
How to adjust the value of the PK column.
Currently the choices are:
* "add N" add N to the original value. * "subtract N" subtract N from the original value.
- --database=s*
- --from=s*
-
Name of source table.
- --pk-column=s*
-
Name of PK column.
- --to=s*
-
Name of target table.
Configuration options
- --config-path=filename, -c
-
Set path to configuration file.
- --config-profile=s, -P
-
Set configuration profile to use.
- --no-config, -C
-
Do not use any configuration file.
Connection options
- --host=s
-
Default value:
"localhost"
- --password=s
-
Will try to get default from `~/.my.cnf`.
- --port=s
-
Default value:
3306
- --username=s
-
Will try to get default from `~/.my.cnf`.
Environment options
Logging options
- --debug
-
Shortcut for --log-level=debug.
- --log-level=s
-
Set log level.
- --quiet
-
Shortcut for --log-level=error.
- --trace
-
Shortcut for --log-level=trace.
- --verbose
-
Shortcut for --log-level=info.
Output options
- --format=s
-
Choose output format, e.g. json, text.
Default value:
undef
- --json
-
Set output format to json.
- --naked-res
-
When outputing as JSON, strip result envelope.
Default value:
0
By default, when outputing as JSON, the full enveloped result is returned, e.g.:
[200,"OK",[1,2,3],{"func.extra"=>4}]
The reason is so you can get the status (1st element), status message (2nd element) as well as result metadata/extra result (4th element) instead of just the result (3rd element). However, sometimes you want just the result, e.g. when you want to pipe the result for more post-processing. In this case you can use `--naked-res` so you just get:
[1,2,3]
Other options
- --dry-run
-
Run in simulation mode (also via DRY_RUN=1).
- --help, -h, -?
-
Display help message and exit.
- --version, -v
-
Display program's version and exit.
COMPLETION
This script has shell tab completion capability with support for several shells.
bash
To activate bash completion for this script, put:
complete -C mysql-copy-rows-adjust-pk mysql-copy-rows-adjust-pk
in your bash startup (e.g. ~/.bashrc). Your next shell session will then recognize tab completion for the command. Or, you can also directly execute the line above in your shell to activate immediately.
It is recommended, however, that you install modules using cpanm-shcompgen which can activate shell completion for scripts immediately.
tcsh
To activate tcsh completion for this script, put:
complete mysql-copy-rows-adjust-pk 'p/*/`mysql-copy-rows-adjust-pk`/'
in your tcsh startup (e.g. ~/.tcshrc). Your next shell session will then recognize tab completion for the command. Or, you can also directly execute the line above in your shell to activate immediately.
It is also recommended to install shcompgen (see above).
other shells
For fish and zsh, install shcompgen as described above.
CONFIGURATION FILE
This script can read configuration files. Configuration files are in the format of IOD, which is basically INI with some extra features.
By default, these names are searched for configuration filenames (can be changed using --config-path
): ~/.config/mysqlutils.conf, ~/mysqlutils.conf, or /etc/mysqlutils.conf.
All found files will be read and merged.
To disable searching for configuration files, pass --no-config
.
You can put multiple profiles in a single file by using section names like [profile=SOMENAME]
or [SOMESECTION profile=SOMENAME]
. Those sections will only be read if you specify the matching --config-profile SOMENAME
.
You can also put configuration for multiple programs inside a single file, and use filter program=NAME
in section names, e.g. [program=NAME ...]
or [SOMESECTION program=NAME]
. The section will then only be used when the reading program matches.
Finally, you can filter a section by environment variable using the filter env=CONDITION
in section names. For example if you only want a section to be read if a certain environment variable is true: [env=SOMEVAR ...]
or [SOMESECTION env=SOMEVAR ...]
. If you only want a section to be read when the value of an environment variable has value equals something: [env=HOSTNAME=blink ...]
or [SOMESECTION env=HOSTNAME=blink ...]
. If you only want a section to be read when the value of an environment variable does not equal something: [env=HOSTNAME!=blink ...]
or [SOMESECTION env=HOSTNAME!=blink ...]
. If you only want a section to be read when an environment variable contains something: [env=HOSTNAME*=server ...]
or [SOMESECTION env=HOSTNAME*=server ...]
. Note that currently due to simplistic parsing, there must not be any whitespace in the value being compared because it marks the beginning of a new section filter or section name.
List of available configuration parameters:
adjust (see --adjust)
database (see --database)
format (see --format)
from (see --from)
host (see --host)
log_level (see --log-level)
naked_res (see --naked-res)
password (see --password)
pk_column (see --pk-column)
port (see --port)
to (see --to)
username (see --username)
ENVIRONMENT
MYSQL_COPY_ROWS_ADJUST_PK_OPT => str
Specify additional command-line options.
FILES
~/.config/mysqlutils.conf
~/mysqlutils.conf
/etc/mysqlutils.conf
HOMEPAGE
Please visit the project's homepage at https://metacpan.org/release/App-MysqlUtils.
SOURCE
Source repository is at https://github.com/perlancar/perl-App-MysqlUtils.
BUGS
Please report any bugs or feature requests on the bugtracker website https://rt.cpan.org/Public/Dist/Display.html?Name=App-MysqlUtils
When submitting a bug or request, please include a test-file or a patch to an existing test-file that illustrates the bug or desired feature.
AUTHOR
perlancar <perlancar@cpan.org>
COPYRIGHT AND LICENSE
This software is copyright (c) 2018, 2017, 2016 by perlancar@cpan.org.
This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.