NAME

Bio::Util::DNA - Basic DNA utilities

SYNOPSES

use Bio::Util::DNA qw(:all);

my $clean_ref = cleanDNA($seq_ref);
my $seq_ref = randomDNA(100);
my $rev_ref = reverse_complement($seq_ref);

DESCRIPTION

Provides a set of functions and predefined variables which are handy when working with DNA.

VARIABLES

BASIC VARIABLES

Basic nucleotide variables that could be useful. All of the variables have a prefix and a suffix;

Prefixes

DNA [ACGT]
RNA [ACGU]
degenerate
all_nucleotide

Suffixes

${prefix}s: String of the different nucleotides
@{prefix}s: Array of the different nucleotides
${prefix}_match: Precompiled regular expression which matches nucleotide characters
${prefix}_fail: Precompiled regular expression which matches non-nucleotide characters

%degenerate2nucleotides

Hash of degenerate nucleotide definitions. Each entry contains a reference to an array of DNA nucleotides that each degenerate nucleotide stands for.

%nucleotides2degenerate

Reverse of %degenerate2nucleotides. Keys are alphabetically-sorted DNA nucleotides and values are the degenerate nucleotide that can represent those nucleotides.

%degenerate_hierarchy

Contains the heirarchy of degenerate nucleotides; N of course contains all the other degenerates, and the four degenerates that can stand for three different bases contain three of the two-base degenerates.

FUNCTIONS

cleanDNA

my $clean_ref = cleanDNA($seq_ref);

Cleans the sequence for use. Strips out comments (lines starting with '>') and whitespace, converts uracil to thymine, and capitalizes all characters.

Examples:

my $clean_ref = cleanDNA($seq_ref);

my $seq_ref = cleanDNA(\'actg');
my $seq_ref = cleanDNA(\'act tag cta');
my $seq_ref = cleanDNA(\'>some mRNA
                         acugauauagau
                         uauagacgaucc');

randomDNA

my $seq_ref = randomDNA($length);

Generate random DNA for testing this module or your own scripts. Default length is 100 nucleotides.

Example:

my $seq_ref = randomDNA();
my $seq_ref = randomDNA(600);

reverse_complement

rev_comp

my $reverse_ref = reverse_complement($seq_ref);

Finds the reverse complement of the sequence and handles degenerate nucleotides.

Example:

$reverse_ref = reverse_complement(\'act');

unrollDNA

my $seq_arrayref = unrollDNA( $seq_ref );

Unroll a DNA string containing degenerate nucleotides. The first entry of the arrayref will be the actual sequence.

Example:

my $seq_arrayref = unrollDNA( \'ACSTAD' ) =
    [
        'ACSTAD', 'ACCTAD', 'ACGTAD',
        'ACSTAR', 'ACCTAR', 'ACGTAR',
        'ACSTAW', 'ACCTAW', 'ACGTAW',
        'ACSTAK', 'ACCTAK', 'ACGTAK',
        'ACSTAA', 'ACCTAA', 'ACGTAA',
        'ACSTAG', 'ACCTAG', 'ACGTAG',
        'ACSTAT', 'ACCTAT', 'ACGTAT'
    ];

AUTHOR

Kevin Galinsky, <first initial last name plus cpan at gmail dot com>

COPYRIGHT AND LICENSE

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

To install Bio::Util::DNA, copy and paste the appropriate command in to your terminal.

cpanm

cpanm Bio::Util::DNA

CPAN shell

perl -MCPAN -e shell
install Bio::Util::DNA

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)

NAME

SYNOPSES

DESCRIPTION

VARIABLES

BASIC VARIABLES

Prefixes

Suffixes

%degenerate2nucleotides

%nucleotides2degenerate

%degenerate_hierarchy

FUNCTIONS

cleanDNA

randomDNA

reverse_complement

rev_comp

unrollDNA

AUTHOR

COPYRIGHT AND LICENSE

Module Install Instructions