parse
Usage : $ncbi_parser->parse ( $ncbi_nodes_path )
Returns : ref to a hash { child_id => parent_id }
Args : nodes.dmp path ( string )
Usage : $ncbi_parser->parse ( $ncbi_names_path, $name_type )
Returns : ref to a hash { ncbi_id => ncbi_name }
Args :
1. names.dmp path or nodes.dmp path, string
2. ncbi name type ( string, e.g. 'scientific name' ) if the first arg is names.dmp otherwise none
Function : parses the complete NCBI taxonomy
work
Usage : $NCBIParser->work ( $onto, $nodes, $names, $ncbi_ids )
Returns : map of added terms { NCBI ID => OBO::Core::Term object }
Args :
1. input ontology, OBO::Core::Ontology object
2. ref to a hash { child_id => parent_id }
3. ref to a hash { ncbi_id => scientific_name }
4. parental ontology term for the root of the taxonomy, OBO::Core::Term object
5. ref to a list of NCBI taxon ids ( \d+ )
Function : adds NCBI taxonomy to the input ontology for the specified taxa
NAME
OBO::Parser::NCBIParser - A NCBI taxonomy to OBO translator.
DESCRIPTION
This parser converts chosen parts of the NCBI taxonomy-tree into an OBO file. A taxon ID is given to the parser and the whole tree up to the root is reconstructed in the given OBO ontology, using scientific names.
The dump files ( nodes.dmp and names.dmp ) should be obtained from:
ftp://ftp.ncbi.nih.gov/pub/taxonomy/taxdump.tar.gz
TODO: include ranks and disjoints only in correlating ranks.
AUTHOR
Mikel Egana Aranguren and Vladimir Mironov
http://www.mikeleganaranguren.com, vladimir.n.mironov@gmail.com
COPYRIGHT AND LICENSE
Copyright (c) 2006 by Mikel Egana Aranguren
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.7 or, at your option, any later version of Perl 5 you may have available.