NAME

graph_profile.pl

A script to graph Y values along a genomic coordinate X-axis.

SYNOPSIS

graph_profile.pl <filename>

Options:
--in <filename>
--index <index1,index2,...>
--all
--cen
--xindex <index>
--skip <integer>
--offset <integer>
--xformat <integer>
--min <number>
--max <number>
--yformat <integer>
--ytick <integer>
--color <name,name,...>
--dir <foldername>
--version
--help

OPTIONS

The command line flags and descriptions:

--in <filename>

Specify an input file containing either a list of database features or genomic coordinates for which to collect data. The file should be a tab-delimited text file, one row per feature, with columns representing feature identifiers, attributes, coordinates, and/or data values. The first row should be column headers. Text files generated by other BioToolBox scripts are acceptable. Files may be gzipped compressed.

--index <index>

Specify the column number(s) corresponding to the dataset(s) in the file to graph. Number is 0-based index. Each dataset should be demarcated by a comma. A range of indices may also be specified using a dash to demarcate the beginning and end of the inclusive range. Multiple datasets to be graphed together should be joined with an ampersand. For example, "2,4-6,5&6" will individually graph datasets 2, 4, 5, 6, and a combination 5 and 6 graph.

If no dataset indices are specified, then they may be chosen interactively from a list.

--all

Automatically graph all available datasets present in the file.

--cen

Datasets should (not) be median centered prior to graphing. Useful when graphing multiple datasets together when they have different medians. Default is false.

--xindex <index>

Specify the column index of the X-axis dataset. Unless specified, the program automatically uses the columns labeled 'Midpoint' or 'Window', if present.

--skip <integer>

Specify the ordinal number of X axis major ticks to label. This avoids overlapping labels. The default is 1 (every tick is labeled).

--offset <integer>

Specify the number of X axis ticks to skip at the beginning before starting to label them. This may help in adjusting the look of the graph. The default is 0.

--xformat <integer>

Specify the number of decimal places the X axis labels should be formatted. The default is undefined (no formatting).

--min <number>
--max <number>

Specify the minimum and/or maximum values for the Y-axis. The default values are automatically determined from the dataset.

--yformat <integer>

Specify the number of decimal places the Y axis labels should be formatted. The default is undefined (no formatting).

--ytick <integer>

Specify explicitly the number of major ticks for the Y axes. The default is 8.

--color <name,name,...>

Optionally specify the colors for the data lines. The default set is lred, lgreen, lblue, lyellow, lpurple, cyan, and lorange. See the documentation for GD::Graph::colour for a complete list.

--dir

Optionally specify the name of the target directory to place the graphs. The default value is the basename of the input file appended with "_graphs".

--version

Print the version number.

--help

Print this help documenation

DESCRIPTION

This program will generate PNG graphic files representing a profile of values plotted along a specific X-axis. For example, plotting values along genomic coordinates or relative positions along a feature. The X-axis values are static and each dataset is plotted against it. One or more datasets may be plotted on a single graph, each in a different color, with a legend printed beneath the graph. The graph is a simple Bezier-smoothed line graph.

The resulting files are written to a subdirectory named after the input file. The files are named after the dataset name (column header) with a prefix.

AUTHOR

Timothy J. Parnell, PhD
Howard Hughes Medical Institute
Dept of Oncological Sciences
Huntsman Cancer Institute
University of Utah
Salt Lake City, UT, 84112

This package is free software; you can redistribute it and/or modify it under the terms of the Artistic License 2.0.