NAME

go2fmt.pl go2obo_xml go2owl go2rdf_xml go2obo_text

SYNOPSIS

go2fmt.pl -w obo_xml -e errlog.xml ontology/*.ontology
go2fmt.pl -w obo_xml -e errlog.xml ontology/gene_ontology.obo

DESCRIPTION

parses any GO/OBO style ontology file and writes out as a different format

ARGUMENTS

-e ERRFILE

writes parse errors in XML - defaults to STDERR (there should be no parse errors in well formed files)

-p FORMAT

determines which parser to use; if left unspecified, will make a guess based on file suffix. See below for formats

-w|writer FORMAT

format for output - see below for list

-use_cache

If this switch is specified, then caching mode is turned on.

With caching mode, the first time you parse a file, then an additional file will be exported in a special format that is fast to parse. This file will have the same filename as the original file, except it will have the ".cache" suffix.

The next time you parse the file, this program will automatically check for the existence of the ".cache" file. If it exists, and is more recent than the file you specified, this is parsed instead. If it does not exist, it is rebuilt.

This will bring a speed improvement for b<some> of the output formats below (such as pathlist). Most output formats work with event-based parsing, so caching the object brings no benefit and will in fact be slower than bypassing the cache

FORMATS

writable formats are

go_ont

Files with suffix ".ontology"

These store the ontology DAGs

go_def

Files with suffix ".defs"

go_xref

External database references for GO terms

Files with suffix "2go" (eg ec2go, metacyc2go)

go_assoc

Annotations of genes or gene products using GO

Files with prefix "gene-association."

obo_text

Files with suffix ".obo"

This is a new file format replacement for the existing GO flat file formats. It handles ontologies, definitions and xrefs (but not associations)

obo_xml

Files with suffix ".obo.xml" or ".obo-xml"

This is the XML version of the OBO flat file format above

prolog

prolog facts - you will need a prolog compiler/interpreter to use these. You can reason over these facts using Obol or the forthcoming Bio-LP project

tbl

simple (lossy) tabular representation

summary

can be used on both ontology files and association files

pathlist

shows all paths to the root

owl

OWL format (default: OWL-DL)

OWL is a W3C standard format for ontologies

You will need the XSL files from the full go-dev distribution to run this; see the XML section in http://www.godatabase.org/dev

obj_yaml

a YAML representation of a GO::Model::Graph object

obj_storable

A dump of the perl GO::Model::Graph object. You need Storable from CPAN for this to work. This is intended to cache objects on the filesystem, for fast access. The obj_storable representation may not be portable

text_html

A html-ified OBO output format

godb_prestore

XML that maps directly to the GODB relational schema (can then be loaded using stag-storenode.pl)

chadodb_prestore

XML that maps directly to the Chado relational schema (can then be loaded using stag-storenode.pl)

DOCUMENTATION

http://www.godatabase.org/dev