NAME
Bencher::Scenario::DataCleansing::Object - Benchmark data cleansing (unblessing object)
VERSION
This document describes version 0.002 of Bencher::Scenario::DataCleansing::Object (from Perl distribution Bencher-Scenarios-DataCleansing), released on 2017-01-15.
SYNOPSIS
To run benchmark with default option:
% bencher -m DataCleansing::Object
To run module startup overhead benchmark:
% bencher --module-startup -m DataCleansing::Object
For more options (dump scenario, list/include/exclude/add participants, list/include/exclude/add datasets, etc), see bencher or run bencher --help
.
DESCRIPTION
Packaging a benchmark script as a Bencher scenario makes it convenient to include/exclude/add participants/datasets (either via CLI or Perl code), send the result to a central repository, among others . See Bencher and bencher (CLI) for more details.
BENCHMARKED MODULES
Version numbers shown below are the versions used when running the sample benchmark.
Acme::Damn 0.08
Data::Clean 0.48
Data::Rmap 0.64
JSON::PP 2.27300
Scalar::Util 1.46
BENCHMARK PARTICIPANTS
Data::Clean-inplace (perl_code) [inplace]
Code template:
state $cl = Data::Clean->new(-obj => ["unbless"]); $cl->clean_in_place(<data>)
Data::Clean-clone (perl_code)
Code template:
state $cl = Data::Clean->new(-obj => ["unbless"]); $cl->clone_and_clean(<data>)
JSON::PP (perl_code)
Code template:
state $json = JSON::PP->new->allow_blessed(1)->convert_blessed(1); $json->decode($json->encode(<data>))
Data::Rmap (perl_code) [inplace]
Code template:
my $data = <data>; Data::Rmap::rmap_ref(sub { Acme::Damn::damn($_) if Scalar::Util::blessed($_) }, $data); $data
BENCHMARK DATASETS
ary100-u1-obj
A 100-element array containing 1 "unclean" data: object
ary100-u100-obj
A 100-element array containing 100 "unclean" data: object
ary10k-u1-obj
A 10k-element array containing 1 "unclean" data: object
ary10k-u10k-obj
A 10k-element array containing 10k "unclean" data: object
SAMPLE BENCHMARK RESULTS
Run on: perl: v5.24.0, CPU: Intel(R) Core(TM) i7-4770 CPU @ 3.40GHz (4 cores), OS: GNU/Linux Debian version 8.5, OS kernel: Linux version 3.16.0-4-amd64.
Benchmark with default options (bencher -m DataCleansing::Object
):
#table1#
+---------------------+-----------------+-----------+-----------+------------+---------+---------+
| participant | dataset | rate (/s) | time (ms) | vs_slowest | errors | samples |
+---------------------+-----------------+-----------+-----------+------------+---------+---------+
| JSON::PP | ary10k-u1-obj | 295 | 3.39 | 1 | 2.7e-06 | 20 |
| Data::Rmap | ary10k-u10k-obj | 418 | 2.39 | 1.42 | 1.3e-06 | 20 |
| JSON::PP | ary10k-u10k-obj | 418 | 2.39 | 1.42 | 8.5e-07 | 20 |
| Data::Rmap | ary10k-u1-obj | 842 | 1.19 | 2.85 | 1.2e-06 | 20 |
| Data::Clean-clone | ary10k-u10k-obj | 1420 | 0.705 | 4.81 | 2.4e-07 | 25 |
| Data::Clean-inplace | ary10k-u10k-obj | 1726 | 0.5795 | 5.849 | 5.1e-08 | 22 |
| JSON::PP | ary100-u1-obj | 3150 | 0.318 | 10.7 | 2.2e-07 | 30 |
| JSON::PP | ary100-u100-obj | 4046.715 | 0.247114 | 13.71674 | 5.5e-12 | 21 |
| Data::Rmap | ary100-u100-obj | 4220 | 0.237 | 14.3 | 2.1e-07 | 21 |
| Data::Clean-clone | ary10k-u1-obj | 6350 | 0.158 | 21.5 | 4.9e-08 | 24 |
| Data::Clean-inplace | ary10k-u1-obj | 7510 | 0.133 | 25.5 | 5.3e-08 | 20 |
| Data::Rmap | ary100-u1-obj | 8200 | 0.12 | 28 | 2.1e-07 | 20 |
| Data::Clean-clone | ary100-u100-obj | 12600 | 0.0792 | 42.8 | 7.6e-08 | 22 |
| Data::Clean-inplace | ary100-u100-obj | 18000 | 0.056 | 60 | 1e-07 | 21 |
| Data::Clean-clone | ary100-u1-obj | 49500 | 0.0202 | 168 | 6.7e-09 | 20 |
| Data::Clean-inplace | ary100-u1-obj | 71000 | 0.014 | 240 | 2.6e-08 | 21 |
+---------------------+-----------------+-----------+-----------+------------+---------+---------+
Benchmark module startup overhead (bencher -m DataCleansing::Object --module-startup
):
#table2#
+---------------------+------------------------------+--------------------+----------------+-----------+------------------------+------------+---------+---------+
| participant | proc_private_dirty_size (MB) | proc_rss_size (MB) | proc_size (MB) | time (ms) | mod_overhead_time (ms) | vs_slowest | errors | samples |
+---------------------+------------------------------+--------------------+----------------+-----------+------------------------+------------+---------+---------+
| JSON::PP | 3.1 | 6.7 | 26 | 16 | 13.5 | 1 | 5.7e-05 | 20 |
| Data::Rmap | 1.4 | 4.9 | 23 | 7.7 | 5.2 | 2.1 | 2.9e-05 | 20 |
| Data::Clean | 1.1 | 4.4 | 20 | 5.2 | 2.7 | 3 | 2.7e-05 | 20 |
| perl -e1 (baseline) | 1.1 | 4.5 | 20 | 2.5 | 0 | 6.3 | 8.1e-06 | 20 |
+---------------------+------------------------------+--------------------+----------------+-----------+------------------------+------------+---------+---------+
To display as an interactive HTML table on a browser, you can add option --format html+datatables
.
HOMEPAGE
Please visit the project's homepage at https://metacpan.org/release/Bencher-Scenarios-DataCleansing.
SOURCE
Source repository is at https://github.com/perlancar/perl-Bencher-Scenarios-DataCleansing.
BUGS
Please report any bugs or feature requests on the bugtracker website https://rt.cpan.org/Public/Dist/Display.html?Name=Bencher-Scenarios-DataCleansing
When submitting a bug or request, please include a test-file or a patch to an existing test-file that illustrates the bug or desired feature.
SEE ALSO
AUTHOR
perlancar <perlancar@cpan.org>
COPYRIGHT AND LICENSE
This software is copyright (c) 2017 by perlancar@cpan.org.
This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.