SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data
Top Cited Papers
Open Access
- 27 September 2010
- journal article
- software
- Published by Springer Nature in BMC Bioinformatics
- Vol. 11 (1), 1-6
- https://doi.org/10.1186/1471-2105-11-485
Abstract
Illumina's second-generation sequencing platform is playing an increasingly prominent role in modern DNA and RNA sequencing efforts. However, rapid, simple, standardized and independent measures of run quality are currently lacking, as are tools to process sequences for use in downstream applications based on read-level quality data. We present SolexaQA, a user-friendly software package designed to generate detailed statistics and at-a-glance graphics of sequence data quality both quickly and in an automated fashion. This package contains associated software to trim sequences dynamically using the quality scores of bases within individual reads. The SolexaQA package produces standardized outputs within minutes, thus facilitating ready comparison between flow cell lanes and machine runs, as well as providing immediate diagnostic information to guide the manipulation of sequence data for downstream analyses.Keywords
This publication has 8 references indexed in Scilit:
- The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variantsNucleic Acids Research, 2009
- Sequencing technologies — the next generationNature Reviews Genetics, 2009
- PIQA: pipeline for Illumina G1 genome analyzer data quality assessmentBioinformatics, 2009
- Probabilistic base calling of Solexa sequencing dataBMC Bioinformatics, 2008
- Substantial biases in ultra-short read data sets from high-throughput DNA sequencingNucleic Acids Research, 2008
- TileQC: A system for tile-based quality control of Solexa dataBMC Bioinformatics, 2008
- Velvet: Algorithms for de novo short read assembly using de Bruijn graphsGenome Research, 2008
- Matrix2png: a utility for visualizing matrix dataBioinformatics, 2003