Preprocessing / Trim reads with Trimmomatic

Description

This tool performs a variety of trimming tasks for Illumina paired end and single end data.

Parameters

Details

Only the trimming steps with user specified parameters are performed. The steps with empty parameters are skipped. The steps are performed in following order:

  1. Adapter clipping (ILLUMINACLIP)
  2. Trim leading bases by quality (LEADING)
  3. Trim trailing bases by quality (TRAILING)
  4. Number of bases to keep from the start (CROP)
  5. Number of bases to remove from the start (HEADCROP)
  6. Sliding window trimming (SLIDINGWINDOW)
  7. Adaptive quality trimming (MAXINFO)
  8. Minimum average quality of reads to keep (AVGQUAL)
  9. Minimum length of reads to keep (MINLEN)

Adapter clipping is best performed first, as other clipping functions may remove parts of adapter sequence and thus make adapters more difficult to find. You can use the adapter sets available in Chipster (the Trimmomatic basic set of adapters) or your own adapter file in .fa -format. Minimum length filtering should be done last, or additional clipping may produce reads shorter than the minimum length specified. Other steps are less sensitive to the order they are performed, but if you wish to run the steps in a specific order that differs from the default order, you have to run each step separately.

For details on each trimming step and their parameters see the Trimmomatic manual.

The tool is based on the Trimmomatic package.

Output

The trimmed reads are in gzipped fastq format. Output files depend on whether the trimmimng was done on single end or paired end reads.

For single end reads:

For paired end reads: Optionally:

References

This tool uses the the Trimmomatic package. Please cite the article:

Bolger, A. M., Lohse, M., & Usadel, B. (2014). Trimmomatic: A flexible trimmer for Illumina Sequence Data. Bioinformatics, btu170.

Please see the Trimmomatic homepage for more details.