Variants / Call SNPs and INDELs with SAMtools

Description

Calls SNPs and short INDELs for one or more diploid individuals.

Parameters

Details

You can provide your own reference sequence in FASTA format or choose one of the provided reference genomes. If you use the reference genomes provided by Chipster, please indicate the style of chromosome naming in your BAM file (e.g. chr1 or 1).

Alignment records are grouped by sample identifiers in @RG header lines. If sample identifiers are absent, each input file is regarded as one sample.

Maximum read depth should be adjusted to about twice the average read depth.

You can tailor the FORMAT filed in the output VCF by adding per sample number of high-quality bases (DP) and high-quality non-reference bases (DV). Also the INFO fields can be modified: By default only AC, DP, DP4, INDEL and IDV fields are included in the output. If you select yes, all the samtools' default INFO fields are included. These are AC, AN, BQB, DP, DP4, HDB, ICB, IDV, IMF, INDEL, MQ, MQB, MQOF, MQSB, RPB, SGB and VDB.

Output

Output is a VCF file. You can view VCF files in Chipster as text (which includes the header) or spreadsheet (which can be shorted). The location column in the spreadsheet can be used in Chipster genome browser to navigate quickly through a list of variants.

References

This tool is based on the SAMtools package. Please cite the article The Sequence alignment/map (SAM) format and SAMtools by Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R. and 1000 Genome Project Data Processing Subgroup (2009) Bioinformatics, 25, 2078-9. [PMID: 19505943].