NAME
Validate::CodiceFiscale - Validate an Italian "Codice Fiscale"
VERSION
This document describes Validate::CodiceFiscale version 0.004.
SYNOPSIS
One-liner:
$ perl -MValidate::CodiceFiscale=r -er RSSMRA98S03B833G
Module usage:
use Validate::CodiceFiscale qw< assert_valid_cf is_valid_cf validate_cf >;
my $valid = 'RSSMRA98S03B833G';
my $invalid = 'RSICRL99C51C967X';
# the first does not warn, the second does warn
eval { assert_valid_cf($valid); 1 } or warn "died: $@";
eval { assert_valid_cf($invalid); 1 } or warn "died: $@";
# plain boolean test, any error short-circuits
if (is_valid_cf($cf)) { ... }
# get everything that's wrong
if (my $errors = validate_cf($invalid)) {
say for $errors->@*; # array with error report, one line per error
}
# it's possible to pass additional validation options, like specific
# data. All are optional, if present they're validate, otherwise
# ignored.
assert_valid_cf($cf,
{
data => {
name => 'Foo',
surname => 'Bar',
sex => 'f',
date => '1998-03-11',
place => 'B833',
}
}
);
# the assertion short-circuits by default, failing at the first
# error. It's possible to check everyting and get a longer error
# message, in case.
assert_valid_cf($cf, { all_errors => 1 });
# it's also possible to wrap the error generation, by returning the
# exception to throw
assert_valid_cf($cf,
{
all_errors => 1,
on_error => sub {
my @errors = @_;
return "number of errors: $n_errors\n";
}
}
);
# of course, it's possible to throw the exception directly
use Ouch;
assert_valid_cf($cf, { on_error => sub { ouch 400, $_[0] } });
DESCRIPTION
This module performs partial validation of Italian Codice Fiscale, mainly at the syntactic level. For proper and legally acknowledged validation please refer to the official service by Agenzia delle Entrate at https://telematici.agenziaentrate.gov.it/VerificaCF/Scegli.do?parameter=verificaCf.
At the basic level, it allows spotting common errors that might come from copying/typing the Codice Fiscale, e.g. typing a 0 (zero) instead of an uppercase o letter, or similar errors with other digits/letters.
The validation can optionally take additional data regarding the name, surname, birth date, birth place, and sex to perform a more thorough validation based on the official rules. This part is subject to false validations (negative or positive) for the following reasons:
Places are generally validated against available data collected from ANPR and ISTAT, but they might be lacking. Additionally, there is no attempt at matching similar names, or names that are valid but e.g. expressed in their original language.
The algorithm to generate the CodiceFiscale might lead to the same string for different people. This is accounted for by using a substitution of digits with letters, but only Agenzia delle Entrate can tell whether the specific person holds the specific variant of the code.
In the end, it provides a reaasonable way of spotting negatives, but it might validate codes that would otherwise be considered invalid by the reference authority because there is nobody tied to the otherwise synctactically valid code.
INTERFACE
There are four main functions for doing checks, each targeting a different style of use, plus an additional function that can be helpful for one-liners.
assert_valid_cf
assert_valid_cf($cf, %options);
Check validity of the provided $cf
and throw an exception if the check fails.
By default, the exception is thrown:
using Carp's
croak
with a message containing a string joining all errorsas soon as one of the validation checks fails.
Supported options:
all_errors
-
collect all errors to be fed into the exception, not only the first one.
data
-
pass additional data for validation, as a hash reference optionally containing keys
name
,surname
,date
,sex
, andplace
. on_error
-
wrap the exception generation with a sub reference used as a callback with the following signature:
sub (@error_strings) { ... }
The sub can throw the exception itself; otherwise, its return value will be used as the argument for
die
.
decode_cf
my $decoded = decode_cf($cf, %options);
Perform a full validation and provides back a data structure with the parsed data and the outcome of all available tests. This is the most complete form of validation that can be performed by this module.
Supported options:
data
-
pass additional data for validation, as a hash reference optionally containing keys
name
,surname
,date
,sex
, andplace
. years_baseline
-
the baseline year for disambiguating two-digits. As an example, people born in 1920 and people born in 2020 both have
20
as the year part in their respective codes, so this allows setting the right expectation. In lack of this parameter, whatever came last is selected (in the example above,20
would be interpreted as2020
).
The output data structure contains the following keys:
errors
-
pointing to an array reference with a list of validation errors (if any);
portions
-
pointing to a hash reference with the disassemble of the input data, if possible (it is always possible as long as the input length is 16 characters).
date
-
as
YYYY-MM-DD
) place
-
as string of characters
sex
-
as a single-letter character, either
F
orM
year
month
day
-
as integers.
The decoded data can be useful to perform further validations, e.g. for validating the place name in cases that are not covered by this module (e.g. when expressed in a language different from Italian, etc.).
is_valid_cf
my $boolean = is_valid_cf($cf, %options);
Check the validity; it short-circuits at the first error. Returns a boolean value.
It's possible to pass an additional key/value pair with key data
and a hash reference optionally containing keys name
, surname
, date
, sex
, and place
.
r
$ perl -MValidate::CodiceFiscale=r -er RSSMRA98S03B833G
The r
function can be useful in one-liners to check one or more codes from the command line. By default, the r
function will read input codes from @ARGV
.
validate_cf
my $errors = validate_cf($cf, %options);
Check the validity and return undef
if no errors were encountered, or a reference to an array containing the list of errors otherwise.
Supported options:
all_errors
-
collect all errors to be fed into the exception, not only the first one. This option defaults to a true value, to collect all errors; it's possible to pass a false value to short-circuit and exit at the first error.
data
-
pass additional data for validation, as a hash reference optionally containing keys
name
,surname
,date
,sex
, andplace
. years_baseline
-
the baseline year for disambiguating two-digits. As an example, people born in 1920 and people born in 2020 both have
20
as the year part in their respective codes, so this allows setting the right expectation. In lack of this parameter, whatever came last is selected (in the example above,20
would be interpreted as2020
).
SEE ALSO
String::CodiceFiscale is an alternative and precedent module to deal with Codice Fiscale. I'm a bit scared about using a class method error
to collect validation errors, hence this module.
BUGS AND LIMITATIONS
Minimum perl version 5.24.
Report bugs through GitHub (patches welcome) at https://github.com/polettix/Validate-CodiceFiscale.
AUTHOR
Flavio Poletti <flavio@polettix.it>
COPYRIGHT AND LICENSE
This distribution is mainly composed of code, but it also includes data (in the __DATA__
section of the module) that is derived from publicly available data sources.
Code
Copyright 2023 by Flavio Poletti <flavio@polettix.it>
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
Data
The data contained in the __DATA__
section of the main module is assembled from the following sources:
Archive of Italian codes for places, including periods of validity, from file ANPR_archivio_comuni.csv, available from Italian Ministero dell'Interno with license CC-BY 4.0 (according to page Note legali as of 2023-08-06). The file is accessible from page Archivio storico dei comuni.
Archive of codes for foreign places, from file Elenco codici e denominazioni delle unit� territoriali estere, available from ISTAT with license CC-BY 3.0 (according to page Note legali: responsabilit� e licenza as of 2023-08-06). The file is accessible from page Codici delle unit� territoriali estere.