NAME

libraries_overlap_stats.pl

SYNOPSIS

libraries_overlap_stats.pl [options/parameters]

Measure the number of records of a library A that overlap those of one or more reference libraries B1, B2, B3, ...

Input options for primary library.
    -p_type <Str>          input type (eg. DBIC, BED).
    -p_file <Str>          input file. Only works if p_type specifies a file type.
    -p_driver <Str>        driver for database connection (eg. mysql, SQLite). Only works if 
                           p_type is DBIC.
    -p_database <Str>      database name or path to database file for file based databases (eg. SQLite). Only works if p_type is DBIC.
    -p_table <Str>         database table. Only works if p_type is DBIC.
    -p_host <Str>          hostname for database connection. Only works if p_type is DBIC.
    -p_user <Str>          username for database connection. Only works if p_type is DBIC.
    -p_password <Str>      password for database connection. Only works if p_type is DBIC.
    -p_records_class <Str> type of records stored in database (Default:
                           GenOO::Data::DB::DBIC::Species::Schema::SampleResultBase::v3).

Input options for reference library.
    -r_type <Str>          input type (eg. DBIC, BED).
    -r_file <Str>          input file. Only works if r_type specifies a file type. If used more
                           than once, reference libraries are merged.
    -r_driver <Str>        driver for database connection (eg. mysql, SQLite). Only works if 
                           r_type is DBIC.
    -r_database <Str>      database name or path to database file for file based databases
                           (eg. SQLite). Only works if r_type is DBIC.
    -r_table <Str>         database table. Only works if r_type is DBIC. If used more
                           than once, reference libraries are merged.
    -r_host <Str>          hostname for database connection. Only works if r_type is DBIC.
    -r_user <Str>          username for database connection. Only works if r_type is DBIC.
    -r_password <Str>      password for database connection. Only works if r_type is DBIC.
    -r_records_class <Str> type of records stored in database (Default:
                           GenOO::Data::DB::DBIC::Species::Schema::SampleResultBase::v3).

Other input.
    -rname_sizes <Str>     file with sizes for reference alignment sequences (rnames). Must be tab
                           delimited (chromosome\tsize) with one line per rname.

Output.
    -o_file <Str>          filename for output file. If path does not exist it will be created.

Input Filters (only for DBIC input type).
    -p_filter <Filter>     filter primary collection. Option can be given multiple times. 
    -r_filter <Filter>     filter reference collection. Option can be given multiple times.
                           Syntax: column_name="pattern"
                             e.g. -p_filter deletion="def" -p_filter rmsk="undef" to keep only reads with deletions and not repeat masked.
                             e.g. -r_filter query_length=">31" -r_filter query_length="<=50" to keep reads longer than 31 and shorter or equal to 50.
                           Supported operators: ">", ">=", "<", "<=", "=", "!=","def", "undef"

Other options.
    -v                     verbosity. If used progress lines are printed.
    -h                     print help message
    -man                   show man page

DESCRIPTION

Measure the number of records of a library A that overlap those of one or more reference libraries B1, B2, B3, ... If more than one reference libraries are given then they are merged into a single one and the overlap is calculated afterwards.