NAME

JSON::Parse - Convert JSON into a Perl variable

SYNOPSIS

use JSON::Parse 'parse_json';
my $json = '["golden", "fleece"]';
my $perl = parse_json ($json);
# Same effect as $perl = ['golden', 'fleece'];

Convert JSON into Perl.

DESCRIPTION

JSON::Parse converts JSON into equivalent Perl. The function "parse_json" takes one argument, a string containing JSON, and returns a Perl reference. The input to parse_json must be a complete JSON structure.

The module differs from the standard JSON module by simplifying the handling of Unicode. If its input is marked as Unicode characters, the strings in its output are also marked as Unicode characters.

JSON::Parse also provides a high speed validation function, "valid_json".

JSON means "JavaScript Object Notation" and it is specified in "RFC 4627".

FUNCTIONS

parse_json

use JSON::Parse 'parse_json';
my $perl = parse_json ('{"x":1, "y":2}');

This function converts JSON into a Perl structure, either an array reference or a hash reference.

If the first argument does not contain a valid JSON text, parse_json throws a fatal error ("dies").

If the argument contains valid JSON, the return value is either a hash or an array reference. If the input JSON text is a serialized object, a hash reference is returned:

my $perl = parse_json ('{"a":1, "b":2}');
print ref $perl, "\n";
# Prints "HASH".

If the input JSON text is a serialized array, an array reference is returned:

my $perl = parse_json ('["a", "b", "c"]');
print ref $perl, "\n";
# Prints "ARRAY".

json_file_to_perl

use JSON::Parse 'json_file_to_perl';
my $p = json_file_to_perl ('filename');

This is exactly the same as "parse_json" except that it works on a file name. The file must be in UTF-8.

valid_json

use JSON::Parse 'valid_json';
if (valid_json ($json)) {
    # do something
}

Valid_json returns 1 if its argument is valid JSON and 0 if not. It also returns 0 if the input is undefined or the empty string.

This is a high-speed validator which runs between three and ten times faster than "parse_json".

If you want the error messages where the JSON is invalid, you may find "validate_json" useful.

validate_json

use JSON::Parse 'validate_json';
eval {
    validate_json ('["xyz"]');
};
if ($@) {
    print "Your JSON was invalid: $@\n";
}

This is the underlying function for "valid_json". It runs at the same speed, but throws an error if the JSON is wrong, rather than returning 1 or 0. This may be useful if you want the exact error message where your input started to fail.

Mapping from JSON to Perl

JSON elements are mapped to Perl as follows:

JSON numbers

JSON numbers become Perl numbers, either integers or double-precision floating point numbers, or possibly strings containing the number if parsing of a number by the usual methods fails somehow.

JSON does not allow leading zeros, or leading plus signs, so numbers like +100 or 0123 will cause an error.

JSON strings

JSON strings become Perl strings. The JSON escape characters such as \t for the tab character (see section 2.5 of "RFC 4627") are mapped to the equivalent ASCII character.

If the input string is marked as Unicode, Unicode escape characters of the form \uXXXX (see page three of "RFC 4627") are mapped to UTF-8. It also handles surrogate pairs.

Handling of Unicode

If the input to "parse_json" is marked as Unicode characters, the output strings will be marked as Unicode characters. If the input is not marked as Unicode characters, the output strings will not be marked as Unicode characters. Thus,

# The scalar $sasori looks like Unicode to Perl
use utf8;
my $sasori = '["蠍"]';
my $p = parse_json ($sasori);
print utf8::is_utf8 ($p->[0]);
# Prints 1.

but

# The scalar $ebi does not look like Unicode to Perl
no utf8;
my $ebi = '["海老"]';
my $p = parse_json ($ebi);
print utf8::is_utf8 ($p->[0]);
# Prints nothing.

Regardless of whether Perl thinks the input string is Unicode, escapes like \u87f9 are converted into the equivalent UTF-8 bytes and the particular string in which they occur is marked as a character string:

no utf8;
# 蟹
my $kani = '["\u87f9"]';
my $p = parse_json ($kani);
print utf8::is_utf8 ($p->[0]);
# Prints 1, because it's upgraded regardless of the input string's
# flags.

JSON arrays

JSON arrays become Perl array references. The elements of the Perl array are in the same order as they appear in the JSON.

Thus

my $p = parse_json ('["monday", "tuesday", "wednesday"]');

has the same result as a Perl declaration of the form

my $p = [ 'monday', 'tuesday', 'wednesday' ];

JSON objects

JSON objects become Perl hashes. The members of the JSON object become key and value pairs in the Perl hash. The string part of each object member becomes the key of the Perl hash. The value part of each member is mapped to the value of the Perl hash.

Thus

my $j = <<EOF;
{"monday":["blue", "black"],
 "tuesday":["grey", "heart attack"],
 "friday":"Gotta get down on Friday"}
EOF

my $p = parse_json ($j);

has the same result as a Perl declaration of the form

my $p = {
    monday => ['blue', 'black'],
    tuesday => ['grey', 'heart attack'],
    friday => 'Gotta get down on Friday',
};

null

The JSON null literal is mapped to a scalar $JSON::Parse::null containing the undefined value.

true

The JSON true literal is mapped to a scalar $JSON::Parse::true containing the value 1.

false

The JSON false literal is mapped to a scalar $JSON::Parse::false containing the value 0.

RESTRICTIONS

This module imposes the following restrictions on its input.

JSON only

JSON::Parse is a strict parser. It only parses JSON which exactly meets the criteria of "RFC 4627". That means, for example, JSON::Parse does not accept single quotes (') instead of double quotes ("), or numbers with leading zeros, like 0123.

No incremental parsing

JSON::Parse does not do incremental parsing. JSON::Parse only parses fully-formed JSON strings which include opening and closing brackets.

UTF-8 only

Although JSON may come in various encodings of Unicode, JSON::Parse only parses the UTF-8 format. If input is in a different Unicode encoding than UTF-8, convert the input before handing it to this module. For example, for the UTF-16 format,

use Encode 'decode';
my $input_utf8 = decode ('UTF-16', $input);
my $perl = parse_json ($input_utf8);

or, for a file,

open my $input, "<:encoding(UTF-16)", 'some-json-file'; 

This module does not attempt to do the determination of the nature of the octet stream, as described in part 3 of "RFC 4627".

DIAGNOSTICS

"valid_json" does not produce error messages. "parse_json" and "validate_json" die on encountering invalid input.

Error messages have the line number and the byte number of the input which caused the problem. If you find the error message unclear, please report that as a bug.

Parsing errors are fatal, so to continue after an error occurs, put the parsing into an eval block:

my $p;                       
eval {                       
    $p = parse_json ($j);  
};                           
if ($@) {                    
    # handle error           
}

At the moment the exact content of the diagnostics is not documented so please review the source code if you need more details on potential outputs. The name of the exception throwing function is currently "failburger", so search through the C files in the top directory of the distribution for "failburger":

cd JSON-Parse
grep "failburger" Json3-*.c

SPEED

On the author's computer, the module's speed of parsing is approximately the same or slightly faster than JSON::XS, with small variations depending on the type of input. Some special types of input, such as floating point numbers containing an exponential part, like "1e09", are about two or three times faster to parse with this module than with JSON::XS. The parsing of exponentials is done by the system's strtod function in in JSON::Parse, but JSON::XS contains its own exponential parser.

For validation, "valid_json" is faster than any other module known to the author.

There is some benchmarking code in the github repository under the directory "benchmarks" for those wishing to test these claims. The script benchmarks/bench is an adaptation of the similar script in the JSON::XS distribution.

Here is an example with "benchmarks/long.json", originally downloaded from http://dist.schmorp.de/misc/json/long.json:

Repetitions: 50 x 100 = 5000
--------------+------------+------------+
module        |      1/min |        min |
--------------|------------|------------|
JP::valid     |   9691.538 |  0.0051591 |
JSON::Parse   |   3313.665 |  0.0150890 |
JSON::XS      |   3318.751 |  0.0150659 |
--------------+------------+------------+

Here JP::valid is the running time of JSON::Parse's "valid_json". A higher number in the second column, or a smaller number in the third column, is faster.

Here is an example with "benchmarks/words-array.json":

Repetitions: 50 x 100 = 5000		 
--------------+------------+------------+
module        |      1/min |        min |
--------------|------------|------------|
JP::valid     | 169535.327 |  0.0002949 |
JSON::Parse   |  22533.061 |  0.0022190 |
JSON::XS      |  21185.493 |  0.0023601 |
--------------+------------+------------+

Here is an example with "benchmarks/exp.json", containing floating point numbers:

Repetitions: 50 x 100 = 5000
--------------+------------+------------+
module        |      1/min |        min |
--------------|------------|------------|
JP::valid     |  83352.623 |  0.0005999 |
JSON::Parse   |  30826.870 |  0.0016220 |
JSON::XS      |  13545.743 |  0.0036912 |
--------------+------------+------------+

Here is an example with "benchmarks/literals.json", containing JSON literals:

Repetitions: 50 x 100 = 5000
--------------+------------+------------+
module        |      1/min |        min |
--------------|------------|------------|
JP::valid     | 182519.756 |  0.0002739 |
JSON::Parse   |  31564.600 |  0.0015841 |
JSON::XS      |  17939.709 |  0.0027871 |
--------------+------------+------------+

Here is an example with "benchmarks/cpantesters.json", a 250K file:

Repetitions: 5 x 10 = 50
--------------+------------+------------+
module        |      1/min |        min |
--------------|------------|------------|
JP::valid     |    876.736 |  0.0057030 |
JSON::Parse   |    140.631 |  0.0355539 |
JSON::XS      |    131.866 |  0.0379171 |
--------------+------------+------------+

SEE ALSO

RFC 4627

JSON is specified in http://www.ietf.org/rfc/rfc4627.txt.

JSON, JSON::XS

These modules allow both reading and writing of JSON.

TEST RESULTS

The ActiveState test results are at http://code.activestate.com/ppm/JSON-Parse/.

EXPORTS

The module exports nothing by default. Functions "parse_json" and "valid_json" can be exported on request.

AUTHOR

Ben Bullock, <bkb@cpan.org>

LICENSE

JSON::Parse can be used, copied, modified and redistributed under the same terms as Perl itself.