NAME
Test::BinaryData - compare two things, give hex dumps if they differ
VERSION
version 0.004
$Id: /my/cs/projects/Test-BinaryData/trunk/lib/Test/BinaryData.pm 32188 2007-07-27T22:08:19.378295Z rjbs $
SYNOPSIS
use Test::BinaryData;
my $computed_data = do_something_complicated;
my $expected_data = read_file('correct.data');
is_binary(
$computed_data,
$expected_data,
"basic data computation",
);
DESCRIPTION
Sometimes using Test::More's is
test isn't good enough. Its diagnostics may make it easy to miss differences between strings.
For example, given two strings which differ only in their line endings, you can end up with diagnostic output like this:
not ok 1
# Failed test in demo.t at line 8.
# got: 'foo
# bar
# '
# expected: 'foo
# bar
# '
That's not very helpful, except to tell you that the alphanumeric characters seem to be in the right place. By using is_binary
instead of is
, this output would be generated instead:
not ok 2
# Failed test in demo.t at line 10.
# got (hex) got expect (hex) expect
# 666f6f0a6261720a---- foo.bar. ! 666f6f0d0a6261720d0a foo..bar..
The "!" tells us that the lines differ, and we can quickly scan the bytes that make up the line to see which differ.
When comparing very long strings, we can stop after we've seen a few differences. Here, we'll just look for two:
# got (hex) got expect (hex) expect
# 416c6c20435220616e64 All CR and = 416c6c20435220616e64 All CR and
# 206e6f204c46206d616b no LF mak = 206e6f204c46206d616b no LF mak
# 6573204d616320612064 es Mac a d = 6573204d616320612064 es Mac a d
# 756c6c20626f792e0d41 ull boy..A = 756c6c20626f792e0d41 ull boy..A
# 6c6c20435220616e6420 ll CR and = 6c6c20435220616e6420 ll CR and
# 6e6f204c46206d616b65 no LF make = 6e6f204c46206d616b65 no LF make
# 73204d61632061206475 s Mac a du = 73204d61632061206475 s Mac a du
# 6c6c20626f792e0d416c ll boy..Al ! 6c6c20626f792e0a416c ll boy..Al
# 6c20435220616e64206e l CR and n = 6c20435220616e64206e l CR and n
# 6f204c46206d616b6573 o LF makes = 6f204c46206d616b6573 o LF makes
# 204d616320612064756c Mac a dul = 204d616320612064756c Mac a dul
# 6c20626f792e0d416c6c l boy..All ! 6c20626f792e0a416c6c l boy..All
# 20435220616e64206e6f CR and no = 20435220616e64206e6f CR and no
# ...
is_binary
is_binary($got, $expected, $comment, \%arg);
This test behaves like Test::More's is
test, but if the given data are not string equal, the diagnostics emits four columns, describing the strings in parallel, showing a simplified ASCII representation and a hexadecimal dump.
Between the got and expected data for each line, a "=" or "!" indicates whether the chunks are identical or different.
The $comment
and %arg
arguments are optional. Valid arguments are:
columns - the number of screen columns available
if the COLUMNS environment variable is an positive integer, then
COLUMNS - is used; otherwise, the default is 79
max_diffs - if given, this is the maximum number of differing lines that will
be compared; if output would have been given beyond this line,
it will be replaced with an elipsis ("...")
TODO
optional position markers
got (hex) got expect (hex) expect 00 46726f6d206d6169 From mai = 46726f6d206d6169 From mai 08 3130353239406c6f 10529@lo = 3130353239406c6f 10529@lo 16 63616c686f737420 calhost = 63616c686f737420 calhost 24 5765642044656320 Wed Dec = 5765642044656320 Wed Dec 32 31382031323a3037 18 12:07 = 31382031323a3037 18 12:07 40 3a35352032303032 :55 2002 = 3a35352032303032 :55 2002 48 0a52656365697665 .Receive ! 0d0a526563656976 ..Receiv
investigate probably bugs with wide chars, multibyte strings
I wrote this primarily for detecting CRLF problems. It would probably be useful for wonky character encodings, but I know very little of them. Patches and tests welcome.
AUTHOR
Ricardo SIGNES, <rjbs at cpan.org>
COPYRIGHT & LICENSE
Copyright 2007, Ricardo SIGNES.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.