Given a fasta-formatted alignment file, removes chimeric sequences.
Removes chimeric sequences using the UCHIME or the (faster) VSEARCH method. Chimeras are sequences formed from two or more biological sequences joined together. Amplicons with chimeric sequences can form during PCR when closely related sequences are amplified. The majority of chimeras are believed to arise from incomplete extension: During subsequent cycles of PCR, a partially extended strand can bind to a template derived from a different but similar sequence. This then acts as a primer that is extended to form a chimeric sequence.
You can use the 16S rRNA Silva gold bacterial set as a reference, or you can detect chimeras de novo using the more abundant sequences in your samples as a reference. De novo detection uses the more abundant sequences from the same sample to check the query sequence. When a sequence is flagged as chimeric in one sample, it can be removed from only that sample by setting dereplicate = true, or from all samples by setting dereplicate = false. If you use the Silva reference, you should be aware of their dual-use license: Silva is free for academic users, but commercial users need to purchase a license.
This tool is based on the Chimera.vsearch, Chimera.uchime, and Remove.seqs commands of the Mothur package.
The analysis output consists of the following:
Schloss PD, Westcott SL, Ryabin T, Hall JR, Hartmann M, Hollister EB, Lesniewski RA, Oakley BB, Parks DH, Robinson CJ, Sahl JW, Stres B, Thallinger GG, Van Horn DJ, Weber CF. Introducing mothur: Open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol, 2009. 75(23):7537-41.