The London Perl and Raku Workshop takes place on 26th Oct 2024. If your company depends on Perl, please consider sponsoring and/or attending.

NAME

fu-cat - Concatenate FASTA or FASTQ files

VERSION

version 1.5.8

SYNOPSIS

  fu-cat [options] [FILE1 FILE2 FILE3...]

DESCRIPTION

This program parses a list of FASTA/FASTQ and will concatenate them ensuring consistent output. Will rename duplicate sequence names. Will try to autodetect the format of all files before executing and decide accordingly the output format (FASTA if at least one of the files is FASTA, otherwise FASTQ). If reading from STDIN the first sequence is in FASTQ format, will skip all the sequences without a quality string.

If no files are provided the program will try reading from STDIN, otherwise add a '-' to the list of files to also read from STDIN.

NAME

fu-cat - concatenate FASTA or FASTQ files

PARAMETERS

-s, --separator

When a second sequence with a name that was already printed is found, the program will append a progressive number, separated by this string. Use `fu-rename` if you need more options. [default: "."]

-f, --fasta

Force FASTA output

-q, --fastq

Force FASTQ output. Will not print any sequence without quality (they will be skipped)

-d, --dereplicate

Print each sequence only only once

-5, --rename-md5

(use with -d) rename each sequence name with the MD5sum of the sequence.

-l, --minlen INT

Do not print sequences shorter (exclusive) than INT

-m, --maxlen INT

Do not print sequences longer (exclusive) than INT

-u, --uppercase

Will print the whole sequence in uppercase

-w, --width INT

Size of the FASTA lines. Specifing 0 will print the whole sequence in the same line (default: 0)

--verbose

Print more details

--help

Display this help page

--version

Print version and exit

MODERN ALTERNATIVE

This suite of tools has been superseded by SeqFu, a compiled program providing faster and safer tools for sequence analysis. This suite is maintained for the higher portability of Perl scripts under certain circumstances.

SeqFu is available at https://github.com/telatin/seqfu2, and can be installed with BioConda conda install -c bioconda seqfu

CITING

Telatin A, Fariselli P, Birolo G. SeqFu: A Suite of Utilities for the Robust and Reproducible Manipulation of Sequence Files. Bioengineering 2021, 8, 59. https://doi.org/10.3390/bioengineering8050059

AUTHOR

Andrea Telatin <andrea@telatin.com>

COPYRIGHT AND LICENSE

This software is Copyright (c) 2018-2023 by Andrea Telatin.

This is free software, licensed under:

  The MIT (X11) License