all_entities_ProteinSequence

Return all instances of the ProteinSequence entity.

We use the concept of ProteinSequence as an amino acid string with an associated MD5 value. It is easy to access the set of Features that relate to a ProteinSequence. While function is still associated with Features (and may be for some time), publications are associated with ProteinSequences (and the inferred impact on Features is through the relationship connecting ProteinSequences to Features).

Example:

all_entities_ProteinSequence -a 

would retrieve all entities of type ProteinSequence and include all fields in the entities in the output.

The ProteinSequence entity has the following relationship links:

HasAssertedFunctionFrom Source
IsATopicOf Publication
IsAlignedBy AlignmentTree
IsProjectedOnto ProteinSequence
IsProteinFor Feature
IsProteinMemberOf Family
ProjectsOnto ProteinSequence

Command-Line Options

-a

Return all fields.

-h

Display a list of the fields available for use.

-fields field-list

Choose a set of fields to return. Field-list is a comma-separated list of strings. The following fields are available:

sequence

Output Format

The standard output is a tab-delimited file. It consists of the input file with an extra column added for each requested field. Input lines that cannot be extended are written to stderr.