We thank D. Leja for providing graphical expertise and support. Funding support is acknowledged from the following sources: National Institutes of Health, The European Union BioSapiens NoE, Affymetrix, Swiss National Science Foundation, the Spanish Ministerio de Educación y Ciencia, Spanish Ministry of Education and Science, CIBERESP, Genome Spain and Generalitat de Catalunya, Ministry of Education, Culture, Sports, Science and Technology of Japan, the NCCR Frontiers in Genetics, the Jésrôme Lejeune Foundation, the Childcare Foundation, the Novartis Foundations, the Danish Research Council, the Swedish Research Council, the Knut and Alice Wallenberg Foundation, the Wellcome Trust, the Howard Hughes Medical Institute, the Bio-X Institute, the RIKEN Institute, the US Army, National Science Foundation, the Deutsche Forschungsgemeinschaft, the Austrian Gen-AU program, the BBSRC and The European Molecular Biology Laboratory. We thank the Barcelona SuperComputing Center and the NIH Biowulf cluster for computer facilities. The Consortium thanks the ENCODE Scientific Advisory Panel for their advice on the project: G. Weinstock, M. Cherry, G. Churchill, M. Eisen, S. Elgin, J. Lis, J. Rine, M. Vidal and P. Zamore.
Analysis Coordination
Ewan Birney*,1, John A. Stamatoyannopoulos*,2, Anindya Dutta*,3, Roderic Guigó*,4, 5, Thomas R. Gingeras*,6, Elliott H. Margulies*,7, Zhiping Weng*,8, 9, Michael Snyder*,10, 11, Emmanouil T. Dermitzakis*,12;
Chromatin and Replication
John A. Stamatoyannopoulos*,2, Robert E. Thurman2, 13, Michael S. Kuehn2, 13, Christopher M. Taylor3, Shane Neph2, Christoph M. Koch12, Saurabh Asthana14, Ankit Malhotra3, Ivan Adzhubei14, Jason A. Greenbaum15, Robert M. Andrews12, Paul Flicek1, Patrick J. Boyle3, Hua Cao13, Nigel P. Carter12, Gayle K. Clelland12, Sean Davis16, Nathan Day2, Pawandeep Dhami12, Shane C. Dillon12, Michael O. Dorschner2, Heike Fiegler12, Paul G. Giresi17, Jeff Goldy2, Michael Hawrylycz18, Andrew Haydock2, Richard Humbert2, Keith D. James12, Brett E. Johnson13, Ericka M. Johnson13, Tristan T. Frum13, Elizabeth R. Rosenzweig13, Neerja Karnani3, Kirsten Lee2, Gregory C. Lefebvre12, Patrick A. Navas13, Fidencio Neri2, Stephen C. J. Parker15, Peter J. Sabo2, Richard Sandstrom2, Anthony Shafer2, David Vetrie12, Molly Weaver2, Sarah Wilcox12, Man Yu13, Francis S. Collins7, Job Dekker19, Jason D. Lieb17, Thomas D. Tullius15, Gregory E. Crawford20, Shamil Sunayev14, William S. Noble2, Ian Dunham12, Anindya Dutta*,3;
Genes and Transcripts
Roderic Guigó*,4, 5, France Denoeud5, Alexandre Reymond21, 22, Philipp Kapranov6, Joel Rozowsky11, Deyou Zheng11, Robert Castelo5, Adam Frankish12, Jennifer Harrow12, Srinka Ghosh6, Albin Sandelin23, Ivo L. Hofacker24, Robert Baertsch25, 26, Damian Keefe1, Paul Flicek1, Sujit Dike6, Jill Cheng6, Heather A. Hirsch27, Edward A. Sekinger27, Julien Lagarde5, Josep F. Abril5, 28, Atif Shahab29, Christoph Flamm24, 30, Claudia Fried30, Jörg Hackermüller31, Jana Hertel30, Manja Lindemeyer30, Kristin Missal30, 32, Andrea Tanzer24, 30, Stefan Washietl24, Jan Korbel11, Olof Emanuelsson11, Jakob S. Pedersen26, Nancy Holroyd12, Ruth Taylor12, David Swarbreck12, Nicholas Matthews12, Mark C. Dickson33, Daryl J. Thomas25, 26, Matthew T. Weirauch25, James Gilbert12, Jorg Drenkow6, Ian Bell6, XiaoDong Zhao34, K.G. Srinivasan34, Wing-Kin Sung34, Hong Sain Ooi34, Kuo Ping Chiu34, Sylvain Foissac4, Tyler Alioto4, Michael Brent35, Lior Pachter36, Michael L. Tress37, Alfonso Valencia37, Siew Woh Choo34, Chiou Yu Choo34, Catherine Ucla22, Caroline Manzano22, Carine Wyss22, Evelyn Cheung6, Taane G. Clark38, James B. Brown39, Madhavan Ganesh6, Sandeep Patel6, Hari Tammana6, Jacqueline Chrast21, Charlotte N. Henrichsen21, Chikatoshi Kai23, Jun Kawai23, 40, Ugrappa Nagalakshmi10, Jiaqian Wu10, Zheng Lian41, Jin Lian41, Peter Newburger42, Xueqing Zhang42, Peter Bickel43, John S. Mattick44, Piero Carninci40,Yoshihide Hayashizaki23, 40, Sherman Weissman41, Emmanouil T. Dermitzakis*,12, Elliott H. Margulies*,7, Tim Hubbard12, Richard M. Myers33, Jane Rogers12, Peter F. Stadler24, 30, 45, Todd M. Lowe25, Chia-Lin Wei34, Yijun Ruan34, Michael Snyder*,10, 11, Ewan Birney*,1, Kevin Struhl27, Mark Gerstein11, 46, 47, Stylianos E. Antonarakis22, Thomas R. Gingeras*,6;
Integrated Analysis and Manuscript Preparation
James B. Brown39, Paul Flicek1, Yutao Fu8, Damian Keefe1, Ewan Birney*,1, France Denoeud5, Mark Gerstein11, 46, 47, Eric D. Green7, 48, Philipp Kapranov6, Ulaf Karaöz8, Richard M. Myers33, William S. Noble2, Alexandre Reymond21, 22, Joel Rozowsky11, Kevin Struhl27, Adam Siepel25, 26, $, John A. Stamatoyannopoulos*,2, Christopher M. Taylor3, James Taylor49, 50, Robert E. Thurman2, 13, Thomas D. Tullius15, Stefan Washietl24, Deyou Zheng11;
Management Group
Laura A. Liefer51, Kris A. Wetterstrand51, Peter J. Good51, Elise A. Feingold51, Mark S. Guyer51, Francis S. Collins52;
Multi-species Sequence Analysis
Elliott H. Margulies*,7, Gregory M. Cooper33,%,
George Asimenos53, Daryl J. Thomas25, 26, Colin N. Dewey54,
Adam Siepel25, 26,$, Ewan Birney*,1 , Damian
Keefe1, Minmei Hou49, 50, James Taylor49, 50, Sergey
Nikolaev22, Juan I. Montoya-Burgos55, Ari Löytynoja1,
Simon Whelan1,¶, Fabio Pardi1, Tim Massingham1, James
B. Brown39, Haiyan Huang43, Nancy R. Zhang43, 56, Peter Bickel43, Ian Holmes57, James C. Mullikin7, 48, Abel Ureta-Vidal1, Benedict Paten1, Michael Seringhaus11, Deanna Church58, Kate Rosenbloom26, W. James Kent25, 26, NISC Comparative Sequencing Program‡, Baylor College of Medicine Human Genome Sequencing Center‡, Washington University Genome Sequencing Center‡, Broad Institute‡, Children’s Hospital Oakland Research Institute‡, Mark Gerstein11, 46, 47, Stylianos E. Antonarakis22, Serafim Batzoglou53, Nick Goldman1, Ross C. Hardison50, 59, David Haussler25, 26, 60, Webb Miller49, 50, 61, Lior Pachter36, Eric D. Green7, 48, Arend Sidow33, 62;
‡ A list of participants and affiliations appears below
NISC Comparative Sequencing Program
Gerard G. Bouffard7, 48, Xiaobin Guan48, Nancy F. Hansen48, Jacquelyn R. Idol7, Valerie V.B. Maduro7, Baishali Maskeri48, Jennifer C. McDowell48, Morgan Park48, Pamela J. Thomas48, Alice C. Young48, and Robert W. Blakesley7, 48;
Baylor College of Medicine Human Genome Sequencing Center
Donna M. Muzny63, Erica Sodergren63, David A. Wheeler63, Kim C. Worley63, Huaiyang Jiang63, George M. Weinstock63, and Richard A. Gibbs63;
Washington University Genome Sequencing Center
Tina Graves64, Robert Fulton64, Elaine R. Mardis64, and Richard K. Wilson64;
Broad Institute
Michele Clamp65, James Cuff65, Sante Gnerre65, David B. Jaffe65, Jean L. Chang65, Kerstin Lindblad-Toh65, and Eric S. Lander65, 66;
Children’s Hospital Oakland Research Institute
Maxim Koriabine67, Mikhail Nefedov67, Kazutoyo Osoegawa67, Yuko Yoshinaga67, Baoli Zhu67, and Pieter J. de Jong67;
Transcriptional Regulatory Elements
Zhiping Weng*,8, 9, Nathan D. Trinklein33,#, Yutao Fu8, Zhengdong D. Zhang11, Ulaf Karaöz8, Leah Barrera68, Rhona Stuart68, Deyou Zheng11, Srinka Ghosh6, Paul Flicek1, David C. King50, 59, James Taylor49, 50, Adam Ameur69, Stefan Enroth69, Mark C. Bieda70, Christoph M. Koch12, Heather A. Hirsch27, Chia-Lin Wei34, Jill Cheng6, Jonghwan Kim71, Akshay A. Bhinge71, Paul G. Giresi17, Nan Jiang72, Jun Liu34, Fei Yao34, Wing-Kin Sung34, Kuo Ping Chiu34, Vinsensius B. Vega34, Charlie W.H. Lee34, Patrick Ng34, Atif Shahab29, Edward A. Sekinger27, Annie Yang27, Zarmik Moqtaderi27, Zhou Zhu27, Xiaoqin Xu70, Sharon Squazzo70, Matthew J. Oberley73, David Inman73, Michael A. Singer72, Todd A. Richmond72, Kyle J. Munn72, 74, Alvaro Rada-Iglesias74, Ola Wallerman74, Jan Komorowski69, Gayle K. Clelland12, Sarah Wilcox12, Shane C. Dillon12, Robert M. Andrews12, Joanna C. Fowler12, Phillippe Couttet12, Keith D. James12, Gregory C. Lefebvre12, Alexander W. Bruce12, Oliver M. Dovey12, Peter D. Ellis12, Pawandeep Dhami12, Cordelia F. Langford12, Nigel P. Carter12, David Vetrie12, Philipp Kapranov6, David A. Nix6, Ian Bell6, Sandeep Patel6, Joel Rozowsky11, Ghia Euskirchen10, Stephen Hartman10, Jin Lian41, Jiaqian Wu10, Alexander E. Urban10, Peter Kraus10, Sara Van Calcar68, Nate Heintzman68, Tae Hoon Kim68, Kun Wang68, Chunxu Qu68, Gary Hon68, Rosa Luna75, Christopher K. Glass75, M. Geoff Rosenfeld75, Shelley Force Aldred33,#, Sara J. Cooper33, Anason Halees8, Jane M. Lin9, Hennady P. Shulha9, Xiaoling Zhang8, Mousheng Xu8, Jaafar N. S. Haidar9, Yong Yu9, Ewan Birney*,1, Sherman Weissman41, Yijun Ruan34, Jason D. Lieb17, Vishwanath R. Iyer71, Roland D. Green72, Thomas R. Gingeras*,6, Claes Wadelius74, Ian Dunham12, Kevin Struhl27, Ross C. Hardison50, 59, Mark Gerstein11, 46, 47, Peggy J. Farnham70, Richard M. Myers33, Bing Ren68, Michael Snyder*,10, 11;
UCSC Genome Browser
Daryl J. Thomas25, 26, Kate Rosenbloom26, Rachel A. Harte26, Angie S. Hinrichs26, Heather Trumbower26, Hiram Clawson26, Jennifer Hillman-Jackson26, Ann S. Zweig26, Kayla Smith26, Archana Thakkapallayil26, Galt Barber26, Robert M. Kuhn26, Donna Karolchik26, David Haussler25, 26, 60, W. James Kent25, 26;
Variation
Emmanouil T. Dermitzakis*,12, Lluis Armengol76, Christine P. Bird12, Taane G. Clark38, Gregory M. Cooper33,%, Paul I. W. de Bakker77, Andrew D. Kern26, Nuria Lopez-Bigas5, Joel D. Martin50, 59, Barbara E. Stranger12, Daryl J. Thomas25, 26, Abigail Woodroffe78, Serafim Batzoglou53, Eugene Davydov53, Antigone Dimas12, Eduardo Eyras5, Ingileif B. Hallgrímsdóttir79, Ross C. Hardison50, 59, Julian Huppert12, Arend Sidow33, 62, James Taylor49, 50, Heather Trumbower26, Michael C. Zody77, Roderic Guigó*,4, 5, James C. Mullikin7, Gonçalo R. Abecasis78, Xavier Estivill76, 80 and Ewan Birney*,1.
* Co-Chairs of the ENCODE analysis groups, and corresponding authors (E-mail: Ewan Birney, birney/at/ebi.ac.uk; John A. Stamatoyannopoulos, jstam/at/u.washington.edu; Anindya Dutta, ad8q/at/virginia.edu; Roderic Guigó, rguigo/at/imim.es; Thomas R. Gingeras, Tom_Gingeras/at/affymetrix.com; Elliott H. Margulies, elliott/at/nhgri.nih.gov; Zhiping Weng, zhiping/at/bu.edu; Michael Snyder, michael.snyder/at/yale.edu; Emmanouil T. Dermitzakis, md4/at/sanger.ac.uk)
% Current Address: Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.
$ Current Address: Department of Biological Statistics & Computational Biology, Cornell University, Ithaca, New York 14853, USA.
¶ Current Address: Faculty of Life Sciences, University of Manchester, Michael Smith Building, Oxford Road, Manchester, M13 9PT, UK.
# Current Address: SwitchGear Genomics, 1455 Adams Drive, Suite 2015, Menlo
Park, California 94025, USA.
- EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
- Department of Genome Sciences, 1705 NE Pacific Street, Box 357730, University of Washington, Seattle, Washington 98195, USA.
- Department of Biochemistry and Molecular Genetics, Jordan 1240, Box 800733, 1300 Jefferson Park Ave, University of Virginia School of Medicine, Charlottesville, Virginia 22908, USA.
- Genomic Bioinformatics Program, Center for Genomic Regulation, C/Dr. Aiguader 88, Barcelona Biomedical Research Park Building, 08003 Barcelona, Catalonia, Spain.
- Research Group in Biomedical Informatics, Institut Municipal d’Investigació Mèdica/Universitat Pompeu Fabra, C/Dr. Aiguader 88, Barcelona Biomedical Research Park Building, 08003 Barcelona, Catalonia, Spain.
- Affymetrix, Inc., Santa Clara, California 95051, USA.
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA.
- Bioinformatics Program, Boston University, 24 Cummington St., Boston, Massachusetts 02215, USA.
- Biomedical Engineering Department, Boston University, 44 Cummington St., Boston, Massachusetts 02215, USA.
- Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, Connecticut 06520, USA.
- Department of Molecular Biophysics and Biochemistry, Yale University, PO Box 208114, New Haven, Connecticut 06520, USA.
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
- Division of Medical Genetics, 1705 NE Pacific Street, Box 357720, University of Washington, Seattle, Washington 98195, USA.
- Division of Genetics, Brigham and Women’s Hospital and Harvard Medical School, 77 Avenue Louis Pasteur, Boston, Massachusetts 02115, USA.
- Department of Chemistry and Program in Bioinformatics, Boston University, 590 Commonwealth Ave, Boston, Massachusetts 02215, USA.
- Genetics Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA.
- Department of Biology and Carolina Center for Genome Sciences, CB# 3280, 202 Fordham Hall, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA.
- Allen Institute for Brain Sciences, 551 N. 34th Street, Seattle, Washington 98103, USA.
- Program in Gene Function and Expression and Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, 364 Plantation Street, Worcester, Massachusetts 01605, USA.
- Institute for Genome Sciences & Policy and Department of Pediatrics, 101 Science Drive, Duke University, Durham, North Carolina 27708, USA.
- Center for Integrative Genomics, University of Lausanne, Genopode building, 1015 Lausanne, Switzerland.
- Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland.
- Genome Exploration Research Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, 1-7-22, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan.
- Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, A-1090 Wien, Austria.
- Department of Biomolecular Engineering, University of California, Santa Cruz, 1156 High Street, Santa Cruz, California 95064, USA.
- Center for Biomolecular Science and Engineering, Engineering 2, Suite 501, Mail Stop CBSE/ITI, University of California, Santa Cruz, California 95064, USA.
- Department of Biological Chemistry & Molecular Pharmacology, Harvard Medical School, 240 Longwood Avenue, Boston, Massachusetts 02115, USA.
- Department of Genetics, Facultat de Biologia, Universitat de Barcelona, Av Diagonal, 645, 08028, Barcelona, Catalonia, Spain.
- Bioinformatics Institute, 30 Biopolis Street, #07-01 Matrix, Singapore, 138671, Singapore.
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstraße 16-18, D-04107 Leipzig, Germany.
- Fraunhofer Institut für Zelltherapie und Immunologie - IZI, Deutscher Platz 5e, D-04103 Leipzig, Germany.
- Interdisciplinary Center of Bioinformatics, University of Leipzig, Härtelstraße 16-18, D-04107 Leipzig, Germany.
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA.
- Genome Institute of Singapore, 60 Biopolis Street, Singapore 138672, Singapore.
- Laboratory for Computational Genomics, Washington University, Campus Box 1045, Saint Louis, Missouri 63130, USA.
- Department of Mathematics and Computer Science, University of California, Berkeley, California 94720, USA.
- Spanish National Cancer Research Centre, CNIO, Madrid, Spain, and Biosapiences NoE.
- Department of Epidemiology and Public Health, Imperial College, St Mary’s Campus, Norfolk Place, London W2 1PG, UK.
- Department of Applied Science & Technology, University of California, Berkeley, California 94720, USA.
- Genome Science Laboratory, Discovery and Research Institute, RIKEN Wako Institute, 2-1 Hirosawa, Wako, Saitama, 351-0198, Japan.
- Department of Genetics, Yale University School of Medicine, 333 Cedar Street, New Haven, Connecticut 06510, USA.
- Department of Pediatrics, University of Massachusetts Medical School, 55 Lake Avenue, North Worcester, Massachusetts 01605, USA.
- Department of Statistics, University of California, Berkeley, California 94720, USA.
- Institute for Molecular Bioscience, University of Queensland, St. Lucia, QLD 4072, Australia.
- The Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, New Mexico 87501, USA.
- Department of Computer Science, Yale University, PO Box 208114, New Haven, Connecticut 06520-8114, USA.
- Program in Computational Biology & Bioinformatics, Yale University, PO Box 208114, New Haven, Connecticut 06520-8114, USA.
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA.
- Department of Computer Science and Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, USA.
- Center for Comparative Genomics and Bioinformatics, Huck Institutes for Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA.
- Division of Extramural Research, National Human Genome Research Institute, National Institute of Health, 5635 Fishers Lane, Suite 4076, Bethesda, Maryland 20892-9305, USA.
- Office of the Director, National Human Genome Research Institute, 31 Center Drive, Suite 4B09, Bethesda, Maryland 20892-2152, USA.
- Department of Computer Science, Stanford University, Stanford, California 94305, USA.
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, 6720 MSC, 1300 University Ave, Madison, Wisconsin 53706, USA.
- Department of Zoology and Animal Biology, Faculty of Sciences, University of Geneva, Switzerland.
- Department of Statistics, Stanford University, Stanford, California 94305, USA.
- Department of Bioengineering, University of California, Berkeley, California 94720-1762, USA.
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland 20894, USA.
- Department of Biochemistry and Molecular Biology, Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA.
- Howard Hughes Medical Institute, University of California, Santa Cruz, California 95064, USA.
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA.
- Department of Pathology, Stanford University School of Medicine, Stanford, California 94305, USA.
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA.
- Genome Sequencing Center, Washington University School of Medicine, Campus Box 8501, 4444 Forest Park Avenue, Saint Louis, Missouri 63108, USA.
- Broad Institute of Harvard University and Massachusetts Institute of Technology, 320 Charles Street, Cambridge, Massachusetts 02141, USA.
- Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, Massachusetts 02142, USA.
- Children’s Hospital Oakland Research Institute, BACPAC Resources, 747 52nd Street, Oakland, California 94609, USA.
- Ludwig Institute for Cancer Research, 9500 Gilman Drive, La Jolla, California 92093-0653, USA.
- The Linnaeus Centre for Bioinformatics, Uppsala University, BMC, Box 598, SE-75124 Uppsala, Sweden.
- Department of Pharmacology and the Genome Center, University of California, Davis, California 95616, USA.
- Institute for Cellular & Molecular Biology, The University of Texas at Austin, 1 University Station A4800, Austin, Texas 78712, USA.
- NimbleGen Systems, Inc., 1 Science Court, Madison, Wisconsin 53711, USA.
- University of Wisconsin Medical School, Madison, Wisconsin 53706, USA.
- Department of Genetics and Pathology, Rudbeck Laboratory, Uppsala University, SE-75185 Uppsala, Sweden.
- University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093, USA.
- Genes and Disease Program, Center for Genomic Regulation, C/Dr. Aiguader 88, Barcelona Biomedical Research Park Building, 08003 Barcelona, Catalonia, Spain.
- Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, Massachusetts 02142, USA.
- Center for Statistical Genetics, Department of Biostatistics, SPH II, 1420 Washington Heights, Ann Arbor, Michigan 48109-2029, USA.
- Department of Statistics, University of Oxford, Oxford, UK.
- Universitat Pompeu Fabra, C/Dr. Aiguader 88, Barcelona Biomedical Research Park
Building, 08003 Barcelona, Catalonia, Spain.