chimeraslayer - detects likely chimeras in PCR amplified DNA
ChimeraSlayer is a chimeric sequence detection utility, compatible
with near-full length Sanger sequences and shorter 454-FLX sequences
(~500bp).
Chimera Slayer involves the following series of steps that operate
to flag chimeric 16S rRNA sequences:
- 1.
- the ends of a query sequence are searched against an included database of
reference chimera-free 16S sequences to identify potential parents of a
chimera
- 2.
- candidate parents of a chimera are selected as those that form a branched
best scoring alignment to the NAST-formatted query sequence
- 3.
- the NAST alignment of the query sequence is improved in a
‘chimera-aware’ profile-based NAST realignment to the
selected reference parent sequences
- 4.
- an evolutionary framework is used to flag query sequences found to exhibit
greater sequence homology to an in silico chimera formed between any two
of the selected reference parent sequences.
To run Chimera Slayer, you need NAST-formatted sequences generated
by the nast-ier utility.
ChimeraSlayer is part of the microbiomeutil suite.
- --query_NAST
- multi-fasta file containing query sequences in alignment format
- --db_NAST
- db in NAST format (default:
/usr/share/microbiomeutil-data/RESOURCES/rRNA16S.gold.NAST_ALIGNED.fasta)
- --db_FASTA
- db in fasta format (megablast formatted) (default:
/usr/share/microbiomeutil-data/RESOURCES/rRNA16S.gold.fasta)
- -n
- number of top matching database sequences to compare to (default 15)
- -R
- min divergence ratio default: 1.007
- -P
- min percent identity among matching sequences (default: 90)
Scoring parameters:
- -M
- match score (default: +5)
- -N
- mismatch penalty (default: -4)
- -Q
- min query coverage by matching database sequence (default: 70)
- -T
- maximum traverses of the multiple alignment (default: 1)
http://microbiomeutil.sourceforge.net/#A_CS
This manual page was written by Andreas Tille
<tille@debian.org> but can be freely used for any other
distribution.