Normalization of microarray data using a spatial mixed model analysis which includes splines

Abstract
Motivation: Microarray experiments with thousands of genes on a slide and multiple slides used in any experimental set represent a large body of data with many sources of variation. The identification of such sources of variation within microarray experimental sets is critical for correct deciphering of desired gene expression differences. Results: We describe new methods for the normalization using spatial mixed models which include splines and analysis of two-colour spotted microarrays for within slide variation and for a series of slides. The model typically explains 45–85% of the variation on a slide with only ∼1% of the total degrees of freedom. The results from our methods compare favourably with those from intensity dependent normalization loess methods where we accounted for twice as much uncontrolled and unwanted variation on the slides. We have also developed an index for each EST that combines the various measures of the differential response into a single value that researchers can use to rapidly assess the genes of interest. Availability: GenStat code is available from the first author.