NAME
Bio::DOOP::Util::Run::Fuzznuc - Fuzznuc module
VERSION
Version 0.7
SYNOPSIS
#!/usr/bin/perl -w
use Bio::DOOP::DOOP;
$db = Bio::DOOP::DBSQL->connect("user","pass","doop-plant-1_5","localhost");
@list = ("81001020","81001110","81001200","81001225","81001230","81001290","81001470","81001580","81001610","81001620","81001680");
$fuzznuc = Bio::DOOP::Util::Run::Fuzznuc->new($db,'500','M',\@list,"/data/DOOP/dummy.txt");
print $fuzznuc->get_tmp_file_name,"\n";
$error = $fuzznuc->run('TTGGGC' , 1 , 0);
if ($error == -1){
die "No results or error!\n";
}
@res = @{$fuzznuc->get_results};
for $result (@res){
print $$result[0]->get_id,"| ",$$result[1]," ",$$result[2]," ",$$result[3]," ",$$result[4],"\n";
}
DESCRIPTION
This module is a wrapper for the EMBOSS (http://emboss.sourceforge.net) program fuzznuc. You can search for patterns in the promoter sequences.
AUTHORS
Tibor Nagy, Godollo, Hungary and Endre Sebestyen, Martonvasar, Hungary
METHODS
new
Create new Fuzznuc object.
Arguments:
1. Bio::DOOP::DBSQL object 2. promoter type (500, 1000, 3000) 3. subset type (depends on reference species) 4. arrayref of clusters 5. temporary file name (default: /tmp/fuzznuc_run.txt, will contain fasta sequences)
$fuzznuc = Bio::DOOP::Util::Run::Fuzznuc->new($db,500,'M',\@list,'/tmp/tmpfile');
new_by_file
Create new fuzznuc object from query file, containing cluster ids.
Arguments:
1. Bio::DOOP::DBSQL object 2. promoter type (500, 1000, 3000) 3. subset type (depends on reference species) 4. file containing cluster ids 5. temporary file name (default: /tmp/fuzznuc_run.txt, will contain fasta sequences)
$fuzznuc = Bio::DOOP::Util::Run::Fuzznuc->new($db,500,'M','/tmp/clusters.txt','/tmp/tmpfile');
new_by_tmp
Create new fuzznuc object from existing temporary file, containing query sequences in fasta format.
Arguments:
1. Bio::DOOP::DBSQL object 2. file containing fasta sequences
$fuzznuc = Bio::DOOP::Util::Run::Fuzznuc->new($db,'/tmp/sequences.fasta');
get_tmp_file_name
Get the temporary file name.
$tempname = $fuzznuc->get_tmp_file_name;
get_emboss_version
Get the installed emboss version.
$version = $fuzznuc->get_emboss_version;
run
Runs fuzznuc, returns 0 on success, otherwise -1.
Arguments :
1. query pattern 2. mismatch number 3. complement (0 or 1)
$fuzznuc_error = $fuzznuc->run('AACCAGGTT','1','1');
run_background
Runs fuzznuc in background, returns the process id.
Arguments :
1. query pattern 2. mismatch number 3. complement (0 or 1) 4. output filename
$fuzznuc_pid = $fuzznuc->run_background('AACCAGGTT','1','1','/tmp/fuzznuc_result.txt');
get_raw_results
Returns an arrayref of arrays with the raw fuzznuc results, without Bio::DOOP objects. This is much faster as it does not use the database.
The results contain the following:
1. sequence ID 2. hit start 3. hit end 4. mismatch number 5. hit sequence 6. hit strand
@result = @{$fuzznuc->get_raw_results};
get_results
Returns an arrayref of arrays with sequence objects and other information of the results.
The results contain the following:
1. Bio::DOOP::Sequence object 2. hit start 3. hit end 4. mismatch number 5. hit sequence 6. hit strand
@result = @{$fuzznuc->get_raw_results};
get_results_from_file
Returns an arrayref of arrays with sequence objects and other information of the results from a results file. With this method you can fetch the results of different fuzznuc objects.
The results contain the following:
1. Bio::DOOP::Sequence object 2. hit start 3. hit end 4. mismatch number 5. hit sequence 6. hit strand
@result = @{$fuzznuc->get_results_from_file};