Utilities / Remove duplicates from FASTQ

Description

Given a FASTQ file, this tool removes identical sequences.

Parameters

none

Details

Identical sequences are collapsed into a single sequence. The sequences are renamed with two numbers: a running number followed by how many times that sequence occurred.

Output

A FASTQ file containing unique reads.

Reference

This tool is based on the FASTA/Q Collapser tool of the FASTX package.