Analysis of the genome sequence of the flowering plant Arabidopsis thaliana

Samir Kaul*, Hean L. Koo, Jennifer Jenkins, Michael Rizzo, Timothy Rooney, Luke J. Tallon, Tamara Feldblyum, William Nierman, Maria Ines Benito, Xiaoying Lin, Christopher D. Town, J. Craig Venter, Claire M. Fraser, Satoshi Tabata, Yasukazu Nakamura, Takakazu Kaneko, Shusei Sato, Erika Asamizu, Tomohiko Kato, Hirokazu Kotani & 136 others Shigemi Sasamoto, Joseph R. Ecker, Athanasios Theologis, Nancy A. Federspiel, Curtis J. Palm, Brian I. Osborne, Paul Shinn, Ken Dewar, Christopher J. Kim, Eugen Buehler, Patrick Dunn, Qimin Chao, Huaming Chen, Athanasios Theologis, Brian I. Osborne, Valentina S. Vysotskaia, Catherine A. Lenz, Christopher J. Kim, Nancy F. Hansen, Shirley X. Liu, Eugen Buehler, Hootan Alta, Hitomi Sakano, Patrick Dunn, Bao Lam, Paul K. Pham, Qimin Chao, Michelle Nguyen, Guixia Yu, Huaming Chen, Audrey Southwick, Jeong Mi Lee, Molly Miranda, Mitsue J. Toriumi, Ronald W. Davis, Nancy A. Federspiel, Curtis J. Palm, Aaron B. Conway, Lane Conn, Nancy F. Hansen, Altafi Hootan, Bao Lam, R. Wambutt, G. Murphy, A. Düsterhöft, W. Stiekema, T. Pohl, K. D. Entian, N. Terryn, G. Volckaert, M. Salanoubat, N. Choisne, F. Artiguenave, J. Weissenbach, F. Quetier, M. Rieger, W. Ansorge, M. Unseld, B. Fartmann, G. Valle, Richard K. Wilson, M. Sekhon, K. Pepin, J. Murray, D. Johnson, L. Hillier, Melissa de la Bastide, Emily Huang, Lori Spiegel, Lidia Gnoj, Kristina Habermann, Neilay Dedhia, Larry Parnell, Raymond Preston, M. Marra, W. Richard McCombie, Ellson Chen, Robert Martienssen, Klaus Mayer, Kai Lemcke, Brian Haas, Dirk Haase, Stephen Rudd, Heiko Schoof, Dimitrij Frishman, Burkhard Morgenstern, Paulo Zaccaria, Hans Werner Mewes, Owen White, Todd H. Creasy, Cord Bielke, Rama Maiti, Jeremy Peterson, Maria Ermolaeva, Mihaela Pertea, John Quackenbush, Natalia Volfovsky, Dongying Wu, Steven L. Salzberg, Michael Bevan, Todd M. Lowe, S. Rounsley, D. Bush, S. Subramaniam, I. Levin, S. Norris, R. Schmidt, A. Acarkan, I. Bancroft, A. Brennicke, J. A. Eisen, T. Bureau, B. A. Legault, Q. H. Le, N. Agrawal, Z. Yu, G. P. Copenhaver, S. Luo, D. Preuss, C. S. Pikaard, I. T. Paulsen, M. Sussman, A. B. Britt, D. A. Selinger, R. Pandey, V. L. Chandler, R. A. Jorgensen, D. W. Mount, C. Pikaard, G. Juergens, E. M. Meyerowitz, J. Dangl, J. D G Jones, M. Chen, J. Chory, C. Somerville

*Corresponding author for this work

Research output: Contribution to journalArticle

6314 Citations (Scopus)

Abstract

The flowering plant Arabidopsis thaliana is an important model system for identifying genes end determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans - the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.

Original languageEnglish
Pages (from-to)796-815
Number of pages20
JournalNature
Volume408
Issue number6814
DOIs
Publication statusPublished - 14 Dec 2000
Externally publishedYes

Fingerprint Dive into the research topics of 'Analysis of the genome sequence of the flowering plant Arabidopsis thaliana'. Together they form a unique fingerprint.

Cite this