Annotation contributors
The GO Consortium integrates resources from a variety of research groups, from model organism and protein databases to biological research communities actively involved in the development and implementation of the Gene Ontology. Curators, developers, and others from multiple groups work to maintain the GO Knowledgebase. Below are alphabetical lists of our current and past collaborators.
Current groups contributing GO annotations
| Name | Description | Contact | |
|---|---|---|---|
| The Arabidopsis Information Resource (TAIR) | Database of genetic and molecular biology data for the model higher plant Arabidopsis thaliana | TAIR | |
| Berkeley Bioinformatics Open-source Projects (BBOP) | Development, use, and integration of ontologies into biological data analysis; software development | BBOP | |
| Candida Genome Database (CGD) | CGD provides online access to genomic sequence data and manually curated functional information about genes and proteins of the human pathogen Candida albicans and related species | CGD | |
| Community Assessment of Community Annotation with Ontologies (CACAO), EcoWiki | The Community Assessment of Community Annotation with Ontologies (CACAO) is a competition for teams of undergrads around the world to improve the functional annotation of genes; CACAO was developed and is currently run at Texas A&M University | CACAO, EcoliWiki | |
| ComplexPortal | The Complex Portal is a manually curated, encyclopaedic resource of macromolecular complexes from a number of key model organisms. | ComplexPortal | |
| Critical Assessment of Functional Annotation | An ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function | CAFA | |
| dictyBase | Informatics resource for the social amoeba Dictyostelium discoideum and related species | dictyBase | |
| DisProt | The database of intrinsically disordered proteins | DisProt | |
| EcoCyc | EcoCyc is a scientific database for the bacterium Escherichia coli K-12 MG1655 | EcoCyc | |
| Ensembl | Predicted annotation for vertebrates based on orthology | Ensembl contact | |
| EnsemblFungi | Predicted annotation for fungi based on orthology | EnsemblFungi contact | |
| EnsemblPlants/Gramene | Predicted annotation for plants based on orthology | EnsemblPlants contact | |
| FlyBase | A Database of Drosophila Genes & Genomes | FlyBase contact | |
| GREEKC | COST Action dedicated to the construction of a high quality and interoperable Knowledge Commons that covers the area of Gene Regulation information. | ||
| HUGO Gene Nomenclature Committee | The HGNC is responsible for approving unique symbols and names for human loci, including protein coding genes, ncRNA genes and pseudogenes, to allow unambiguous scientific communication | HGNC | |
| The Human Protein Atlas | A Swedish-based program initiated in 2003 with the aim to map all the human proteins in cells, tissues, and organs using an integration of various omics technologies, including antibody-based imaging, mass spectrometry-based proteomics, transcriptomics, and systems biology | HPA | |
| IntAct molecular interaction database | High-quality, manually annotated binary protein interactions and curated complexes | IntAct | |
| InterPro | InterPro provides functional analysis of proteins by classifying them into families and predicting domains and important sites | InterPro | |
| JaponicusDB | JaponicusDB provides comprehensive structural and functional annotation, literature curation and access to large-scale data sets for Schizosaccharomyces japonicus. The S. japonicus community maintains JaponicusDB, led by the Francis Crick Institute and King’s College London, with contributions from PomBase (University of Cambridge) | JaponicusDB | |
| The Matrisome Project | Open access resource which aims to support and facilitate ECM research by sharing detailed protocols, tools, and datasets with the scientific community. | Matrisome contact information | |
| Mouse Genome Informatics (MGI) | Informatics resources for Laboratory Mouse and related Human Biology data | MGI | |
| PANTHER | The PANTHER (Protein ANalysis THrough Evolutionary Relationships) knowledgebase supports biomedical and other research by providing comprehensive information about the evolution of protein-coding gene families, particularly protein phylogeny, function and genetic variation impacting that function | PANTHER | |
| PomBase | PomBase is a comprehensive database for the fission yeast Schizosaccharomyces pombe, providing structural and functional annotation, literature curation and access to large-scale data sets | PomBase | |
| Rat Genome Database (RGD) | Database for the rat Rattus norvegicus | RGD | |
| Reactome | A knowledgebase of biological processes (formerly Genome Knowledgebase) | Reactome | |
| RHEA | Rhea is an expert-curated knowledgebase of chemical and transport reactions of biological interest - and the standard for enzyme and transporter annotation in UniProtKB | RHEA | |
| RNACentral | RNAcentral is a free, public resource that offers integrated access to a comprehensive and up-to-date set of non-coding RNA sequences provided by a collaborating group of Expert Databases representing a broad range of organisms and RNA types | RNACentral | |
| Saccharomyces Genome Database (SGD) | The Saccharomyces Genome Database (SGD) provides comprehensive integrated biological information for the budding yeast Saccharomyces cerevisiae | SGD | |
| SignatureScience | Collaboration with GO on annotation of genes with pathogenic potential | ||
| The Synapse Gene Ontology and Annotation Initiative | Interactive knowledgebase that accumulates available research about synapse biology using Gene Ontology | SynGO | |
| UniProt-Gene Ontology Annotation (UniProt-GOA) | Manual and electronic annotation of proteins in the UniProt Knowledgebase, carried out by a wide range of curation activities at the European Bioinformatics Institute, the SIB Swiss Institute of Bioinformatics and the Protein Information Resource | UniProt contact | |
| University College London Functional Gene Annotation | Manual annotation of human proteins and microRNAs involved in cardiovascular and dementia-relevant processes | UCL | |
| WormBase | WormBase is an international consortium of biologists and computer scientists dedicated to providing the research community with accurate, current, accessible information concerning the genetics, genomics and biology of Caenorhabditis elegans and related nematodes | WormBase | |
| Xenbase | Xenbase’s mission is to provide the international research community with a comprehensive, integrated and easy to use web based resource that gives access the diverse and rich genomic, expression and functional data available from Xenopus research | Xenbase | |
| Zebrafish Information Network (ZFIN) | The Zebrafish Model Organism Database is the community resource for genetic and genomic data involving zebrafish Danio rerio | ZFIN |
Funding of contributing groups
As described in the most recent GO publication, PMID:41413728.
- The core funding for the GOC is from the National Human Genome Research Institute (U41HG002273, U24HG012212).
- Curation activities supported by National Human Genome Research Institute grants U24HG002659 (ZFIN), U24HG002223 (WormBase), U41HG000739 (FlyBase), U24HG001315 (SGD), U24HG000330 (MGD), U24HG012198 (Reactome curation), U24HG011851 (Reactome - GO harmonization) and grant R01HL064541 from the National Heart, Lung and Blood Institute (RGD).
- Additional funding for GO curation at FlyBase is provided by UK Medical Research Council Award MR/W024233/1.
- PomBase is supported by Wellcome Trust 218236/Z/19/Z. Xenbase is supported by grant P41 HD064556 from the Eunice Kennedy Shriver National Institute of Child Health and Human Development.
- Functional Gene Annotation, University College London is supported by National Institute for Health Research University College London Hospitals Biomedical Research Centre.
- Planteome and Plant Reactome are supported by USDA-ARS, and DARPA awards.
- Some software development was funded by U24HG010859 (Alliance of Genome Resources Central).
- The TAIR project is funded by academic, institutional, corporate, and individual subscriptions; TAIR is administered by the 501(c)(3) non-profit Phoenix Bioinformatics.
- Chris Mungall, Seth Carbon, and Sierra Moxon were supported in part by Director, Office of Science, Office of Basic Energy Sciences of the U.S. Department of Energy Contract No. DE-AC02-05CH11231.
- Matrisome is supported in part by the National Human Genome Research Institute (NHGRI) of the National Institutes of Health and the National Institutes of Health Common Fund through the Office of Strategic Coordination/Office of the NIH Director (U01HG012680).
- UniProt is funded by National Human Genome Research Institute (NHGRI), Office of Director [OD/DPCPSI/ODSS]; National Institute of Allergy and Infectious Diseases (NIAID), National Institute on Aging (NIA), National Institute of General Medical Sciences (NIGMS), National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), National Eye Institute (NEI), National Cancer Institute (NCI), National Heart, Lung, and Blood Institute (NHLBI) of the National Institutes of Health [U24HG007822]; Biotechnology and Biological Sciences Research Council [BB/T010541/1, BB/S01781X/1]; Open Targets; Swiss Federal Government through the State Secretariat for Education, Research and Innovation SERI; European Molecular Biology Laboratory core funds.
Past GO Consortium contributing groups
These groups have contributed to the GO Consortium in the past. For questions on these annotations, please contact the GO Central helpdesk.
| Name | Description | Funding | Contact | |
|---|---|---|---|---|
| AgBase | A curated, open-source, web-accessible resource for functional analysis of agricultural plant and animal gene products | [AgBase] | ||
| Alzheimer’s Project at the University of Toronto | Curation of genes associated with Alzheimer’s disease that have been significant in previous genome wide association studies | |||
| ASAP | A Systematic Annotation Package for Community Analysis of Genomes | |||
| Aspergillus Genome Database (AspGD) | AspGD is an organized collection of genetic and molecular biological information about the filamentous fungi of the genus Aspergillus | |||
| AstraZeneca | AstraZeneca is a global, science-led biopharmaceutical business. | |||
| GeneDB | Curation for the whole range of organisms sequenced by the Sanger Institute’s Pathogen Genomics group; sunsetted in 2022 | |||
| Gramene | A comparative plant genomics resource for model and reference genomes of more than 30 plant genomes. It provides information for genomes, gene models, gene annotations, genome and gene tree alignments, synteny, genetic variation, gene expression, and a knowledgebase for plant pathways (Plant Reactome). | The National Science Foundation (NSF) supported this work through NSF Plant Genome Initiative grant award #0321685 during the years 2004-2007, NSF award #0851652 (REU Bioinformatics and Computational Biology Summer Undergraduate Program) in 2009-2012, and since 2007, through NSF Plant Genome Research Resource grant award #0703908. Current work is being supported by NSF Improving Plant Genome Annotation grant award #1127112 (Gramene - Exploring Function through Comparative Genomics and Network Analysis). More information | Gramene web submission form | |
| J Craig Venter Institute | Databases on several bacterial species; formerly known as The Institute for Genomic Research (TIGR). | |||
| Microbial ENergy processes Gene Ontology Project (MENGO) | The MENGO project is a multi-institutional collaborative effort that aims to develop new Gene Ontology terms to describe microbial bioenergy related processes | Office of Science (BER), U.S. Department of Energy. | ||
| Norwegian University of Science and Technology, Systems Biology team | Development of procedures and approaches to build high quality knowledge sources (the ‘Knowledge Commons’) for understanding gene regulation processes | NTNU_SB | ||
| Plant-Associated Microbe Gene Ontology (PAMGO) | Consortium A multi-institutional collaborative effort involving scientists working on plant pathogenic genomes: the bacteria Dickeya dadantii, Pseudomonas syringae pv tomato and Agrobacterium tumefaciens, the fungus Magnaporthe grisea, the oomycetes Phytophthora sojae and Phytophthora ramorum and the nematode Meloidogyne hapla | |||
| Pseudomonas Genome Database (PseudoCAP) | A resource for annotations for the Pseudomonas aeruginosa PAO1 reference strain’s genome and comparative analyses of several related Pseudomonas species | Cystic Fibrosis Foundation Therapeutics Inc. | PseudoCAP | |
| Renal Gene Ontology Annotation Initiative | European Bioinformatics Institute | Kidney Research UK | ||
| SGN | The Sol Genomics Network (SGN) is a clade-oriented database dedicated to the biology of the Solanaceae family. | SGN | ||
| SYSCILIA | The European project SYSCILIA is a systems biology approach to dissect cilia function and its disruption in human genetic disease | European Community’s Seventh Framework Programme (FP7/2007-2013) under the Health Cooperation Programme. | SYSCILIA | |
| Tetrahymena Genome Database (TGD) | Database of information about the Tetrahymena thermophila genome sequence determined at The Institute for Genomic Research (TIGR) | TGD |