The CIC library: a large insert YAC library for genome mapping in Arabidopsis thaliana

Abstract
A new Arabidopsis thaliana (ecotype Columbia) genomic library has been constructed in Yeast Artificial Chromosomes: the CIC library (for CEPH, INRA and CNRS). Optimization of plant culture conditions and protoplast preparation allowed the recovery of large amounts of viable protoplasts. Mechanical shearing of DNA was minimized by isolation of DNA from protoplasts embedded in agarose. Cloning of large inserts was favored by including two successive size fractionation steps (after partial EcoRI digestion and after ligation with the vector arms), which selected DNA fragments larger than 350 kb. The library consists of 1152 clones with an average insert size of 420 kb. Clones carrying chloroplast DNA and various nuclear repeated sequences have been identified. Twenty-one per cent of the clones are found to contain chloroplast DNA. Therefore, the library represents around four nuclear genome equivalents. The clones containing 5S rDNA genes, 18S-25S rDNA sequences and the 180 bp paracentromeric repeated element account for 3.6%, 8.9% and 5.8%, respectively. Only one clone was found to carry the 160 bp paracentromeric repeated element. Given the smaller size of clones carrying Arabidopsis repeated DNA, the average size of remaining clones is around 480 kb. The library was screened by PCR amplification using pairs of primers corresponding to sequences dispersed in the genome. Seventy out of 76 pairs of primers identified from one to seven YAC clones. Thus at least 92% of the genome is represented in the CIC library. The survey of the library for clones containing unlinked DNA sequences indicates that the proportion of chimeric clones is lower than 10%.