Interpretation of multiple probe sets mapping to the same gene in Affymetrix GeneChips
Open Access
- 15 January 2007
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 8 (1), 13
- https://doi.org/10.1186/1471-2105-8-13
Abstract
Affymetrix GeneChip technology enables the parallel observations of tens of thousands of genes. It is important that the probe set annotations are reliable so that biological inferences can be made about genes which undergo differential expression. Probe sets representing the same gene might be expected to show similar fold changes/z-scores, however this is in fact not the case. We have made a case study of the mouse Surf4, chosen because it is a gene that was reported to be represented by the same eight probe sets on the MOE430A array by both Affymetrix and Bioconductor in early 2004. Only five of the probe sets actually detect Surf4 transcripts. Two of the probe sets detect splice variants of Surf2. We have also studied the expression changes of the eight probe sets in a public-domain microarray experiment. The transcripts for Surf4 are correlated in time, and similarly the transcripts for Surf2 are also correlated in time. However, the transcripts for Surf4 and Surf2 are not correlated. This proof of principle shows that observations of expression can be used to confirm, or otherwise, annotation discrepancies. We have also investigated groups of probe sets on the RAE230A array that are assigned to the same LocusID, but which show large variances in differential expression in any one of three different experiments on rat. The probe set groups with high variances are found to represent cases of alternative splicing, use of alternative poly(A) signals, or incorrect annotations. Our results indicate that some probe sets should not be considered as unique measures of transcription, because the individual probes map to more than one transcript dependent upon the biological condition. Our results highlight the need for care when assessing whether groups of probe sets all measure the same transcript.Keywords
This publication has 26 references indexed in Scilit:
- Tandem chimerism as a means to increase protein complexity in the human genomeGenome Research, 2005
- A physiogenomic approach to study the regulation of blood pressurePhysiological Genomics, 2005
- Study of stem cell function using microarray experimentsFEBS Letters, 2005
- Detecting false expression signals in high-density oligonucleotide arrays by an in silico approachGenomics, 2004
- Genome sequence of the Brown Norway rat yields insights into mammalian evolutionNature, 2004
- The UCSC Genome Browser DatabaseNucleic Acids Research, 2003
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- Microarray data normalization and transformationNature Genetics, 2002
- The Human Genome Browser at UCSCGenome Research, 2002
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002