Trim reads for several criteria

Description

This tool is a general trimming tool containing all the trimming methods used in the PRINSEQ package, including the poly-A/T and quality methods. The input data can be in FASTQ or in FASTA format.

Details

Only those trimming methods are used for which you assigned a value. You can use several trimming methods at the same time. For example trimming with parameters Trim left: 3 and Trim right: 10 would produce a reads set where all the reads are trimmed by removing 3 bases form the 5'-end and 10 bases from the 3'-end. For detailed description of the different trimming options, please check the PRINSEQ manual. From the table below, you can check what PRINSEQ command line options the parameter definitions correspond to:

Parameter nameCommand lineDescription
Trim to length-trim_to_lenTrim all reads from the 3-prime-end to result in reads with this length.
Trim left-trim_leftTrim reads at the 5-prime-end by given number of positions.
Trim right-trim.rightTrim reads at the 3-prime-end by given number of positions.
Trim left tails-trim_tail_leftTrim poly-A/T tail with a minimum length of the given value at the 5-prime-end.
Trim right tails-trim_tail_rightTrim poly-A/T tail with a minimum length of the given value at the 3-prime-end.
Trim left poly-N tails-trim_ns_leftTrim poly-N tail with a minimum length of the given value at the 5-prime-end.
Trim right poly-N tails-trim_ns_rightTrim poly-N tail with a minimum length of the given value at the 3-prime-end.
Trim 5-prime-end by quality-trim_qual_leftTrim reads by quality score from the 5-prime-end with the given threshold score.
Trim 3-prime-end by quality-trim_qual_rightTrim reads by quality score from the 3-prime-end with the given threshold score.
The quality score calculation method-trim_qual_typeType of quality score calculation to use
Quality score comparison condition-trim_qual_rule Rule to use to compare quality score threshold to calculated value.
Window for quality calculation-trim_qual_windowThe sliding window size used to calculate quality score by type. To stop at the first base that fails the rule defined, use a window size of 1.
Step size to move the quality window-trim_qual_step Step size used to move the sliding window. To move the window over all quality scores without missing any, the step size should be less or equal to the window size.
Base quality encoding-phred64 Select "Sanger" for Illumina 1.8+, Sanger, Roche/454, Ion Torrent and PacBio data.
Minimum length-min_lenSelect only reads that are longer than the given value after trimming.

Output

By default the tool outputs the reads that pass the trimming conditions. For paired end reads you can also choose to output the singletons to separate files. You can also print out a log file that contains information about the filtering task and statistics about how many reads were accepted and rejected.

Reference

This tool is based on the PRINSEQ package.