Large-Scale Identification of Single-Feature Polymorphisms in Complex Genomes

  1. Justin O. Borevitz1,
  2. David Liang2,
  3. David Plouffe2,
  4. Hur-Song Chang3,
  5. Tong Zhu3,
  6. Detlef Weigel4,
  7. Charles C. Berry5,
  8. Elizabeth Winzeler2,6,8, and
  9. Joanne Chory1,7,8
  1. 1Plant Biology Laboratory, The Salk Institute for Biological Studies, La Jolla, California 92037, USA; 2Genomics Institute of the Novartis Research Foundation, San Diego, California 92121, USA; 3Torrey Mesa Research Institute, San Diego, California 92121, USA; 4Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany; 5Department of Family/Preventive Medicine, University of California, San Diego, La Jolla, California 92093, USA; 6The Scripps Research Institute, San Diego, California 92121, USA; 7Howard Hughes Medical Institute, The Salk Institute for Biological Studies, La Jolla, California 92037, USA

Abstract

We have developed a high-throughput genotyping platform by hybridizing genomic DNA from Arabidopsis thaliana accessions to an RNA expression GeneChip (AtGenome1). Using newly developed analytical tools, a large number of single-feature polymorphisms (SFPs) were identified. A comparison of two accessions, the reference strain Columbia (Col) and the strain Landsberg erecta (Ler), identified nearly 4000 SFPs, which could be reliably scored at a 5% error rate. Ler sequence was used to confirm 117 of 121 SFPs and to determine the sensitivity of array hybridization. Features containing sequence repeats, as well as those from high copy genes, showed greater polymorphism rates. A linear clustering algorithm was developed to identify clusters of SFPs representing potential deletions in 111 genes at a 5% false discovery rate (FDR). Among the potential deletions were transposons, disease resistance genes, and genes involved in secondary metabolism. The applicability of this technique was demonstrated by genotyping a recombinant inbred line. Recombination break points could be clearly defined, and in one case delimited to an interval of 29 kb. We further demonstrate that array hybridization can be combined with bulk segregant analysis to quickly map mutations. The extension of these tools to organisms with complex genomes, such as Arabidopsis, will greatly increase our ability to map and clone quantitative trait loci (QTL).

[Supplemental material is available online atwww.genome.org.]

Footnotes

  • 8 Corresponding authors.

  • E-MAIL chory{at}salk.edu; FAX (858) 558-6379.

  • E-MAIL winzeler{at}scripps.edu; FAX (858) 784-9860.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.541303. Article published online before print in February 2003.

    • Received July 26, 2002.
    • Accepted December 13, 2002.
| Table of Contents

Preprint Server