NAME

Bencher::Scenario::DataCleansing::Object - Benchmark data cleansing (unblessing object)

VERSION

This document describes version 0.004 of Bencher::Scenario::DataCleansing::Object (from Perl distribution Bencher-Scenarios-DataCleansing), released on 2017-08-25.

SYNOPSIS

To run benchmark with default option:

% bencher -m DataCleansing::Object

To run module startup overhead benchmark:

% bencher --module-startup -m DataCleansing::Object

For more options (dump scenario, list/include/exclude/add participants, list/include/exclude/add datasets, etc), see bencher or run bencher --help.

DESCRIPTION

Packaging a benchmark script as a Bencher scenario makes it convenient to include/exclude/add participants/datasets (either via CLI or Perl code), send the result to a central repository, among others . See Bencher and bencher (CLI) for more details.

BENCHMARKED MODULES

Version numbers shown below are the versions used when running the sample benchmark.

Acme::Damn 0.08

Data::Clean 0.49

Data::Rmap 0.64

JSON::PP 2.27300

Scalar::Util 1.45

BENCHMARK PARTICIPANTS

  • Data::Clean-inplace (perl_code) [inplace]

    Code template:

    state $cl = Data::Clean->new(-obj => ["unbless"]); $cl->clean_in_place(<data>)
  • Data::Clean-clone (perl_code)

    Code template:

    state $cl = Data::Clean->new(-obj => ["unbless"]); $cl->clone_and_clean(<data>)
  • JSON::PP (perl_code)

    Code template:

    state $json = JSON::PP->new->allow_blessed(1)->convert_blessed(1); $json->decode($json->encode(<data>))
  • Data::Rmap (perl_code) [inplace]

    Code template:

    my $data = <data>; Data::Rmap::rmap_ref(sub { Acme::Damn::damn($_) if Scalar::Util::blessed($_) }, $data); $data

BENCHMARK DATASETS

  • ary100-u1-obj

    A 100-element array containing 1 "unclean" data: object

  • ary100-u100-obj

    A 100-element array containing 100 "unclean" data: object

  • ary10k-u1-obj

    A 10k-element array containing 1 "unclean" data: object

  • ary10k-u10k-obj

    A 10k-element array containing 10k "unclean" data: object

SAMPLE BENCHMARK RESULTS

Run on: perl: v5.24.0, CPU: Intel(R) Core(TM) M-5Y71 CPU @ 1.20GHz (2 cores), OS: GNU/Linux LinuxMint version 17.3, OS kernel: Linux version 3.19.0-32-generic.

Benchmark with default options (bencher -m DataCleansing::Object):

#table1#
+---------------------+-----------------+---------+-----------+-----------+------------+---------+---------+
| participant         | dataset         | p_tags  | rate (/s) | time (ms) | vs_slowest |  errors | samples |
+---------------------+-----------------+---------+-----------+-----------+------------+---------+---------+
| JSON::PP            | ary10k-u1-obj   |         |     165   | 6.06      |      1     | 5.4e-06 |      21 |
| Data::Rmap          | ary10k-u10k-obj | inplace |     219   | 4.57      |      1.33  | 1.8e-06 |      20 |
| JSON::PP            | ary10k-u10k-obj |         |     240   | 4.2       |      1.4   |   6e-06 |      20 |
| Data::Rmap          | ary10k-u1-obj   | inplace |     441   | 2.27      |      2.67  | 1.5e-06 |      20 |
| Data::Clean-clone   | ary10k-u10k-obj |         |     752   | 1.33      |      4.56  | 4.8e-07 |      20 |
| Data::Clean-inplace | ary10k-u10k-obj | inplace |     990   | 1         |      6     | 1.3e-06 |      22 |
| JSON::PP            | ary100-u1-obj   |         |    1560   | 0.64      |      9.47  | 6.2e-07 |      21 |
| JSON::PP            | ary100-u100-obj |         |    2020   | 0.494     |     12.3   | 1.9e-07 |      24 |
| Data::Rmap          | ary100-u100-obj | inplace |    2200   | 0.46      |     13     | 8.5e-07 |      20 |
| Data::Clean-clone   | ary10k-u1-obj   |         |    3100   | 0.32      |     19     |   6e-07 |      23 |
| Data::Clean-inplace | ary10k-u1-obj   | inplace |    4000   | 0.3       |     20     | 4.3e-06 |      37 |
| Data::Rmap          | ary100-u1-obj   | inplace |    4300   | 0.23      |     26     | 1.4e-06 |      20 |
| Data::Clean-clone   | ary100-u100-obj |         |    7100   | 0.14      |     43     | 2.1e-07 |      20 |
| Data::Clean-inplace | ary100-u100-obj | inplace |   10600   | 0.0939    |     64.5   | 2.6e-08 |      21 |
| Data::Clean-clone   | ary100-u1-obj   |         |   27792.1 | 0.0359814 |    168.432 | 1.1e-11 |      20 |
| Data::Clean-inplace | ary100-u1-obj   | inplace |   38000   | 0.026     |    230     |   1e-07 |      20 |
+---------------------+-----------------+---------+-----------+-----------+------------+---------+---------+

Benchmark module startup overhead (bencher -m DataCleansing::Object --module-startup):

#table2#
+---------------------+------------------------------+--------------------+----------------+-----------+------------------------+------------+---------+---------+
| participant         | proc_private_dirty_size (MB) | proc_rss_size (MB) | proc_size (MB) | time (ms) | mod_overhead_time (ms) | vs_slowest |  errors | samples |
+---------------------+------------------------------+--------------------+----------------+-----------+------------------------+------------+---------+---------+
| JSON::PP            | 3.12                         | 6.68               | 22.3           |      26.7 |                   20.6 |        1   | 1.2e-05 |      20 |
| Data::Rmap          | 1.4                          | 4.9                | 19             |      15   |                    8.9 |        1.8 |   3e-05 |      20 |
| Data::Clean         | 1.1                          | 4.5                | 16             |      11   |                    4.9 |        2.4 |   3e-05 |      20 |
| perl -e1 (baseline) | 1.1                          | 4.6                | 16             |       6.1 |                    0   |        4.4 | 1.7e-05 |      21 |
+---------------------+------------------------------+--------------------+----------------+-----------+------------------------+------------+---------+---------+

To display as an interactive HTML table on a browser, you can add option --format html+datatables.

HOMEPAGE

Please visit the project's homepage at https://metacpan.org/release/Bencher-Scenarios-DataCleansing.

SOURCE

Source repository is at https://github.com/perlancar/perl-Bencher-Scenarios-DataCleansing.

BUGS

Please report any bugs or feature requests on the bugtracker website https://rt.cpan.org/Public/Dist/Display.html?Name=Bencher-Scenarios-DataCleansing

When submitting a bug or request, please include a test-file or a patch to an existing test-file that illustrates the bug or desired feature.

SEE ALSO

AUTHOR

perlancar <perlancar@cpan.org>

COPYRIGHT AND LICENSE

This software is copyright (c) 2017, 2016 by perlancar@cpan.org.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.