get_entity_Family
The Kbase will support the maintenance of protein families (as sets of Features with associated translations). We are initially only supporting the notion of a family as composed of a set of isofunctional homologs. That is, the families we initially support should be thought of as containing protein-encoding genes whose associated sequences all implement the same function (we do understand that the notion of "function" is somewhat ambiguous, so let us sweep this under the rug by calling a functional role a "primitive concept"). We currently support families in which the members are translations of features, and we think of Features as having an associated function. Identical protein sequences as products of translating distinct genes may or may not have identical functions, and we allow multiple members of the same Family to share identical protein sequences. This may be justified, since in a very, very, very few cases identical proteins do, in fact, have distinct functions. We would prefer to reach the point where our Families are sets of protein sequence, rather than sets of protein-encoding Features.
Example:
get_entity_Family -a < ids > table.with.fields.added
would read in a file of ids and add a column for each filed in the entity.
The standard input should be a tab-separated table (i.e., each line is a tab-separated set of fields). Normally, the last field in each line would contain the id. If some other column contains the id, use
-c N
where N is the column (from 1) that contains the id.
This is a pipe command. The input is taken from the standard input, and the output is to the standard output.
Related entities
The Family entity has the following relationship links:
- HasMember Feature
- HasProteinMember ProteinSequence
- IsCoupledTo Family
- IsCoupledWith Family
- IsFamilyFor Role
- IsRepresentedIn Genome
Command-Line Options
- -c Column
-
Use the specified column to define the id of the entity to retrieve.
- -h
-
Display a list of the fields available for use.
- -fields field-list
-
Choose a set of fields to return. Field-list is a comma-separated list of strings. The following fields are available:
- type
- release
- family_function
- alignment
Output Format
The standard output is a tab-delimited file. It consists of the input file with an extra column added for each requested field. Input lines that cannot be extended are written to stderr.