Annotation contributors

The GO Consortium integrates resources from a variety of research groups, from model organism and protein databases to biological research communities actively involved in the development and implementation of the Gene Ontology. Curators, developers, and others from multiple groups work to maintain the GO Knowledgebase. Below are alphabetical lists of our current and past collaborators.

Current groups contributing GO annotations

Name Description Contact  
The Arabidopsis Information Resource (TAIR) Database of genetic and molecular biology data for the model higher plant Arabidopsis thaliana TAIR  
Berkeley Bioinformatics Open-source Projects (BBOP) Development, use, and integration of ontologies into biological data analysis; software development BBOP  
Candida Genome Database (CGD) CGD provides online access to genomic sequence data and manually curated functional information about genes and proteins of the human pathogen Candida albicans and related species CGD  
Community Assessment of Community Annotation with Ontologies (CACAO), EcoWiki The Community Assessment of Community Annotation with Ontologies (CACAO) is a competition for teams of undergrads around the world to improve the functional annotation of genes; CACAO was developed and is currently run at Texas A&M University CACAO, EcoliWiki  
ComplexPortal The Complex Portal is a manually curated, encyclopaedic resource of macromolecular complexes from a number of key model organisms. ComplexPortal  
Critical Assessment of Functional Annotation An ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function CAFA  
dictyBase Informatics resource for the social amoeba Dictyostelium discoideum and related species dictyBase  
DisProt The database of intrinsically disordered proteins DisProt  
EcoCyc EcoCyc is a scientific database for the bacterium Escherichia coli K-12 MG1655 EcoCyc  
Ensembl Predicted annotation for vertebrates based on orthology Ensembl contact  
EnsemblFungi Predicted annotation for fungi based on orthology EnsemblFungi contact  
EnsemblPlants/Gramene Predicted annotation for plants based on orthology EnsemblPlants contact  
FlyBase A Database of Drosophila Genes & Genomes FlyBase contact  
GREEKC COST Action dedicated to the construction of a high quality and interoperable Knowledge Commons that covers the area of Gene Regulation information.    
HUGO Gene Nomenclature Committee The HGNC is responsible for approving unique symbols and names for human loci, including protein coding genes, ncRNA genes and pseudogenes, to allow unambiguous scientific communication HGNC  
The Human Protein Atlas A Swedish-based program initiated in 2003 with the aim to map all the human proteins in cells, tissues, and organs using an integration of various omics technologies, including antibody-based imaging, mass spectrometry-based proteomics, transcriptomics, and systems biology HPA  
IntAct molecular interaction database High-quality, manually annotated binary protein interactions and curated complexes IntAct  
InterPro InterPro provides functional analysis of proteins by classifying them into families and predicting domains and important sites InterPro  
JaponicusDB JaponicusDB provides comprehensive structural and functional annotation, literature curation and access to large-scale data sets for Schizosaccharomyces japonicus. The S. japonicus community maintains JaponicusDB, led by the Francis Crick Institute and King’s College London, with contributions from PomBase (University of Cambridge) JaponicusDB  
The Matrisome Project Open access resource which aims to support and facilitate ECM research by sharing detailed protocols, tools, and datasets with the scientific community.  Matrisome contact information  
Mouse Genome Informatics (MGI) Informatics resources for Laboratory Mouse and related Human Biology data MGI  
PANTHER The PANTHER (Protein ANalysis THrough Evolutionary Relationships) knowledgebase supports biomedical and other research by providing comprehensive information about the evolution of protein-coding gene families, particularly protein phylogeny, function and genetic variation impacting that function PANTHER  
PomBase PomBase is a comprehensive database for the fission yeast Schizosaccharomyces pombe, providing structural and functional annotation, literature curation and access to large-scale data sets PomBase  
Rat Genome Database (RGD) Database for the rat Rattus norvegicus RGD  
Reactome A knowledgebase of biological processes (formerly Genome Knowledgebase) Reactome  
RHEA Rhea is an expert-curated knowledgebase of chemical and transport reactions of biological interest - and the standard for enzyme and transporter annotation in UniProtKB RHEA  
RNACentral RNAcentral is a free, public resource that offers integrated access to a comprehensive and up-to-date set of non-coding RNA sequences provided by a collaborating group of Expert Databases representing a broad range of organisms and RNA types RNACentral  
Saccharomyces Genome Database (SGD) The Saccharomyces Genome Database (SGD) provides comprehensive integrated biological information for the budding yeast Saccharomyces cerevisiae SGD  
SignatureScience Collaboration with GO on annotation of genes with pathogenic potential    
The Synapse Gene Ontology and Annotation Initiative Interactive knowledgebase that accumulates available research about synapse biology using Gene Ontology SynGO  
UniProt-Gene Ontology Annotation (UniProt-GOA) Manual and electronic annotation of proteins in the UniProt Knowledgebase, carried out by a wide range of curation activities at the European Bioinformatics Institute, the SIB Swiss Institute of Bioinformatics and the Protein Information Resource UniProt contact  
University College London Functional Gene Annotation Manual annotation of human proteins and microRNAs involved in cardiovascular and dementia-relevant processes UCL  
WormBase WormBase is an international consortium of biologists and computer scientists dedicated to providing the research community with accurate, current, accessible information concerning the genetics, genomics and biology of Caenorhabditis elegans and related nematodes WormBase  
Xenbase Xenbase’s mission is to provide the international research community with a comprehensive, integrated and easy to use web based resource that gives access the diverse and rich genomic, expression and functional data available from Xenopus research Xenbase  
Zebrafish Information Network (ZFIN) The Zebrafish Model Organism Database is the community resource for genetic and genomic data involving zebrafish Danio rerio ZFIN  

Funding of contributing groups

As described in the most recent GO publication, PMID:41413728.

  • The core funding for the GOC is from the National Human Genome Research Institute (U41HG002273, U24HG012212).
  • Curation activities supported by National Human Genome Research Institute grants U24HG002659 (ZFIN), U24HG002223 (WormBase), U41HG000739 (FlyBase), U24HG001315 (SGD), U24HG000330 (MGD), U24HG012198 (Reactome curation), U24HG011851 (Reactome - GO harmonization) and grant R01HL064541 from the National Heart, Lung and Blood Institute (RGD).
  • Additional funding for GO curation at FlyBase is provided by UK Medical Research Council Award MR/W024233/1.
  • PomBase is supported by Wellcome Trust 218236/Z/19/Z. Xenbase is supported by grant P41 HD064556 from the Eunice Kennedy Shriver National Institute of Child Health and Human Development.
  • Functional Gene Annotation, University College London is supported by National Institute for Health Research University College London Hospitals Biomedical Research Centre.
  • Planteome and Plant Reactome are supported by USDA-ARS, and DARPA awards.
  • Some software development was funded by U24HG010859 (Alliance of Genome Resources Central).
  • The TAIR project is funded by academic, institutional, corporate, and individual subscriptions; TAIR is administered by the 501(c)(3) non-profit Phoenix Bioinformatics.
  • Chris Mungall, Seth Carbon, and Sierra Moxon were supported in part by Director, Office of Science, Office of Basic Energy Sciences of the U.S. Department of Energy Contract No. DE-AC02-05CH11231.
  • Matrisome is supported in part by the National Human Genome Research Institute (NHGRI) of the National Institutes of Health and the National Institutes of Health Common Fund through the Office of Strategic Coordination/Office of the NIH Director (U01HG012680).
  • UniProt is funded by National Human Genome Research Institute (NHGRI), Office of Director [OD/DPCPSI/ODSS]; National Institute of Allergy and Infectious Diseases (NIAID), National Institute on Aging (NIA), National Institute of General Medical Sciences (NIGMS), National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), National Eye Institute (NEI), National Cancer Institute (NCI), National Heart, Lung, and Blood Institute (NHLBI) of the National Institutes of Health [U24HG007822]; Biotechnology and Biological Sciences Research Council [BB/T010541/1, BB/S01781X/1]; Open Targets; Swiss Federal Government through the State Secretariat for Education, Research and Innovation SERI; European Molecular Biology Laboratory core funds.

Past GO Consortium contributing groups

These groups have contributed to the GO Consortium in the past. For questions on these annotations, please contact the GO Central helpdesk.

Name Description Funding Contact  
AgBase A curated, open-source, web-accessible resource for functional analysis of agricultural plant and animal gene products   [AgBase]  
Alzheimer’s Project at the University of Toronto Curation of genes associated with Alzheimer’s disease that have been significant in previous genome wide association studies      
ASAP A Systematic Annotation Package for Community Analysis of Genomes      
Aspergillus Genome Database (AspGD) AspGD is an organized collection of genetic and molecular biological information about the filamentous fungi of the genus Aspergillus      
AstraZeneca AstraZeneca is a global, science-led biopharmaceutical business.      
GeneDB Curation for the whole range of organisms sequenced by the Sanger Institute’s Pathogen Genomics group; sunsetted in 2022      
Gramene A comparative plant genomics resource for model and reference genomes of more than 30 plant genomes. It provides information for genomes, gene models, gene annotations, genome and gene tree alignments, synteny, genetic variation, gene expression, and a knowledgebase for plant pathways (Plant Reactome). The National Science Foundation (NSF) supported this work through NSF Plant Genome Initiative grant award #0321685 during the years 2004-2007, NSF award #0851652 (REU Bioinformatics and Computational Biology Summer Undergraduate Program) in 2009-2012, and since 2007, through NSF Plant Genome Research Resource grant award #0703908. Current work is being supported by NSF Improving Plant Genome Annotation grant award #1127112 (Gramene - Exploring Function through Comparative Genomics and Network Analysis). More information Gramene web submission form  
J Craig Venter Institute Databases on several bacterial species; formerly known as The Institute for Genomic Research (TIGR).      
Microbial ENergy processes Gene Ontology Project (MENGO) The MENGO project is a multi-institutional collaborative effort that aims to develop new Gene Ontology terms to describe microbial bioenergy related processes Office of Science (BER), U.S. Department of Energy.    
Norwegian University of Science and Technology, Systems Biology team Development of procedures and approaches to build high quality knowledge sources (the ‘Knowledge Commons’) for understanding gene regulation processes   NTNU_SB  
Plant-Associated Microbe Gene Ontology (PAMGO) Consortium A multi-institutional collaborative effort involving scientists working on plant pathogenic genomes: the bacteria Dickeya dadantii, Pseudomonas syringae pv tomato and Agrobacterium tumefaciens, the fungus Magnaporthe grisea, the oomycetes Phytophthora sojae and Phytophthora ramorum and the nematode Meloidogyne hapla      
Pseudomonas Genome Database (PseudoCAP) A resource for annotations for the Pseudomonas aeruginosa PAO1 reference strain’s genome and comparative analyses of several related Pseudomonas species Cystic Fibrosis Foundation Therapeutics Inc. PseudoCAP  
Renal Gene Ontology Annotation Initiative  European Bioinformatics Institute Kidney Research UK    
SGN The Sol Genomics Network (SGN) is a clade-oriented database dedicated to the biology of the Solanaceae family.   SGN  
SYSCILIA The European project SYSCILIA is a systems biology approach to dissect cilia function and its disruption in human genetic disease European Community’s Seventh Framework Programme (FP7/2007-2013) under the Health Cooperation Programme. SYSCILIA  
Tetrahymena Genome Database (TGD) Database of information about the Tetrahymena thermophila genome sequence determined at The Institute for Genomic Research (TIGR)   TGD