Bowtie2 for paired-end reads

Description

This tool uses Bowtie2 software to align paired-end reads to a reference genome or sequence set. You need to supply the reads in two fastq files containing the reads in the same order and a fasta formatted reference sequence.

After running Bowtie2, Chipster converts the alignment file to BAM format, and sorts and indexes it using the SAMtools package.

Mapping strategy

Bowtie2 can map the reads to the reference either by aligning the reads for they full length (end-to-end read alignment) or by using local alignments. Other possibility is to use local read alignment based mapping strategies. In this mode, Bowtie2 might "trim" or "clip" some read characters from one or both ends of the alignment if doing so maximizes the alignment score.

Bowtie2 uses heuristics for mapping the reads to the reference genome. This means that the reported mapping is not guaranteed to be the best mapping. Several Bowtie2 parameters affect to the mapping accuracy. However higher accuracy (sensitivity) also requires more computing time. In Chipster you can choose, for both end-to-end and local alignment strategy, the sensitivity level from a set of pre-defined parameter combinations that allow you to tune the balance between the computing time and mapping sensitivity:

StrategyCorresponding command line options
Very fast -D 5 -R 1 -N 0 -L 22 -i S,0,2.50
Fast -D 10 -R 2 -N 0 -L 22 -i S,0,2.50
Sensitive (default) -D 15 -R 2 -N 0 -L 22 -i S,1,1.15
Very sensitive -D 20 -R 3 -N 0 -L 20 -i S,1,0.50
Very fast local --local -D 5 -R 1 -N 0 -L 25 -i S,1,2.00
Fast local --local -D 10 -R 2 -N 0 -L 22 -i S,1,1.75
Sensitive local --local -D 15 -R 2 -N 0 -L 20 -i S,1,0.75
Very sensitive local --local -D 20 -R 3 -N 0 -L 20 -i S,1,0.50

By default Chipster does the alignment using the end-to-end alignment method with "Sensitive" parameters.

Other parameters

In addition to the mapping strategy, the Bowtie2 tool in Chipster allows users to modify many other mapping and output parameters too. Please check the Bowtie2 manual for more detailed description of these parameters.

ParameterCorresponding command line optionDescription of the parameter
Match bonus--maMatch bonus for a match in local alignment. Default value 2
Maximum penalty for mismatch --mpMaximum penalty for mismatch; lower quality = lower penalty. Default value 6
Penalty for non-ACGTs--npSets penalty for positions where the read, reference, or both, contain an ambiguous character such as N. Default: 1.
Gap opening penalty for the read--rdgGap opening penalty for the reads. Default value: 5.
Gap extension penalty for the reads--rdgGap extension penalty for the reads. Default value: 3.
Gap opening penalty for the reads--rfgGap opening penalty for the reference. Default value: 5.
Gap extension penalty for the reads--rfgGap extension penalty for the reference. Default value: 3.
Minimum insert length--mininsMinimum insert length between the mate pairs. Default value: 0
Maximum insert length--maxinsMaximum insert length between the mate pairs. Default value: 500
Order of mates to align--fr, --rf or --ffThe orientation of the mate pairs. Default: forward/revrse
Suppress unpaired alignments--no-mixedBy default, when bowtie2 cannot find a concordant or discordant alignment for a pair, it then tries to find alignments for the individual mates. This option disables that behavior.
Suppress discordant alignments--no-discordantBy default, bowtie2 looks for discordant alignments if it cannot find any concordant alignments. A discordant alignment is an alignment where both mates align uniquely, but that does not satisfy the paired-end constraints. This option disables that behavior
Not concordant when mates extend past each other--no-dovetailIf the mates "dovetail", that is if one mate alignment extends past the beginning of the other such that the wrong mate begins upstream, consider that to be concordant. Default: mates cannot dovetail in a concordant alignment.
Not concordant when one mate alignment contains other--no-containIf one mate alignment contains the other, consider that to be non-concordant. Default: a mate can contain the other in a concordant alignment.
Not concordant when mates overlap at all--no.overlapIf one mate alignment overlaps the other at all, consider that to be non-concordant. Default: mates can overlap in a concordant alignment.
Quality value format used--phred33, --phred64 or --ignore-qualsQuality scale used in the fastq-file.
How many valid alignments are reported per readnone, -k or --allBy default, Bowtie2 reports only the best alignment of the read (based on the mapping quality\). Optionally, if there are several, equally good alignments, you can choose how many of them should be reported?
Put reads that did not align concordantly to a separate file--un-concIf you choose not to have alignments for reads which map to multiple positions, with this option you can to store these reads to a separate fastq file?
Put unaligned reads to a separate file--unWould you like to store unaligned reads to a new fastq file? Note that also multireads will be added to this file, unless you asked them to be put to a separate file.

Output

This tool returns the alignment in BAM format and an index file for it (.bai). It also produces a log file, which allows you to see what percentage of the reads align with the selected parameter settings. Optionally also fastq files are produced for the unaligned reads, or reads that align discordantly.

Reference

Langmead B, Salzberg S. Fast gapped-read alignment with Bowtie 2. Nature Methods. 2012, 9:357-359.