Moderated statistical tests for assessing differences in tag abundance

Top Cited Papers

Open Access

19 September 2007

journal article
Published by Oxford University Press (OUP) in Bioinformatics

Vol. 23 (21), 2881-2887
https://doi.org/10.1093/bioinformatics/btm453

Abstract

Digital gene expression (DGE) technologies measure gene expression by counting sequence tags. They are sensitive technologies for measuring gene expression on a genomic scale, without the need for prior knowledge of the genome sequence. As the cost of sequencing DNA decreases, the number of DGE datasets is expected to grow dramatically. Various tests of differential expression have been proposed for replicated DGE data using binomial, Poisson, negative binomial or pseudo-likelihood (PL) models for the counts, but none of the these are usable when the number of replicates is very small. We develop tests using the negative binomial distribution to model overdispersion relative to the Poisson, and use conditional weighted likelihood to moderate the level of overdispersion across genes. Not only is our strategy applicable even with the smallest number of libraries, but it also proves to be more powerful than previous strategies when more libraries are available. The methodology is equally applicable to other counting technologies, such as proteomic spectral counts. An R package can be accessed from http://bioinf.wehi.edu.au/resources/

Keywords

This publication has 20 references indexed in Scilit:

Small-sample estimation of negative binomial dispersion, with applications to SAGE data
Biostatistics, 2007
Next-generation sequencing outpaces expectations
Nature Biotechnology, 2007
Approximating bayesian inference by weighted likelihood
The Canadian Journal of Statistics / La Revue Canadienne de Statistique, 2006
The colorectal microRNAome
Proceedings of the National Academy of Sciences, 2006
Genome sequencing in microfabricated high-density picolitre reactors
Nature, 2005
Distinct epigenetic changes in the stromal cells of breast cancers
Nature Genetics, 2005
Bayesian Inference for the Negative Binomial Distribution via Polynomial Expansions
Journal of Computational and Graphical Statistics, 2002
Significance analysis of microarrays applied to the ionizing radiation response
Proceedings of the National Academy of Sciences, 2001
Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays
Nature Biotechnology, 2000
Serial Analysis of Gene Expression
Science, 1995

Cited by 849 articles