Cross-references to external classification systems

Many Gene Ontology terms are cross-referenced to corresponding concepts from a number of external vocabularies, including Enzyme Commission numbers, KEGG, Reactome Pathways, and Wikipedia. Please report any errors or suggest alternatives to the GO helpdesk.

Using and citing cross-references and mappings

If you have used a mapping in a publication or presentation, please ensure that you cite both the GO knowledgebase and the source of the mapping (detailed below). See the GO citation guide for citing the GO knowledgebase.

Cross-references format

  • Header describing the date generated and the GO release used, for example:

    ! Generated on 2018-10-05T08:40Z from the ontology 'go' with data version: 'releases/YYYY-MM-DD'

  • Each cross-reference is on a different line, in the format:

    database:term identifier (id/name) > GO:GO term name ; GO:id

    For example:

    EC:1.1.1.1 > GO:alcohol dehydrogenase (NAD) activity ; GO:0004022

  • Note that cross-references to GO mappings can be many-to-many.

Mappings file directory

Direct access to the mappings file directory is available here: https://current.geneontology.org/ontology/external2go/.

Cross-references maintained by the GO Consortium

Those cross-references are maintained by GO editors. Please report issues in the GO GitHub tracker.

Mapping Entities mapped Direct access to mapping file
EC: Enzyme Commission EC numbers ec2go
EAWAG-BBD Enzymes um-bbd_enzymeid2go
EAWAG-BBD Pathways um-bbd_pathwayid2go
EAWAG-BBD Reactions um-bbd_reactionid2go
KEGG: Kyoto Encyclopaedia of Genes and Genomes Pathways and reactions   kegg_reaction2go
MetaCyc Pathways and reactions metacyc2go
Reactome Events and catalyst activities reactome2go
Rhea Biochemical reactions rhea2go

Cross-references maintained by external groups

Note: Those files are regenerated during the bi-monthly UniProtKB-GOA release. Please report issues in the GO GitHub tracker.

Mapping Direct access to mapping file
HAMAP: High-Quality Automated and Manual Annotation of Microbial Proteomes - Families
Maintainted by the UniProtKB team
PMID:18849571
hamap2go
InterPro - Protein families, domains and functional sites
Maintainted by the InterPro team
PMID:18940856
interpro2go
Pfam domains
Maintainted by the InterPro team
PMID:18940856
pfam2go
PIRSF - Protein superfamilies
Maintainted by the InterPro team
PMID:18940856
pirsf2go
PRINTS - Domains
Maintainted by the InterPro team
PMID:18940856
pirsf2go
ProDom - Domains
Maintainted by the InterPro team
PMID:18940856
prodom2go
ProSite - Domains
Maintainted by the InterPro team
PMID:18940856
prosite2go
Rfam - RNA families
rfam2go
SMART - Domains
Maintainted by the InterPro team
PMID:29040681
smart2go
UniProt Keywords
Mapping of GO terms to UniProt Knowledgebase keywords.
Maintainted by the UniProt and UniProtKB-GOA
PMID:25378336
uniprotkb_kw2go
UniProt Subcellular location
Mapping of GO terms to UniProt Subcellular location terms.
Maintainted by the UniProt and UniProtKB-GOA
PMID:25378336
uniprotkb_sl2go
UniRules
Mapping of GO terms to UniProt Subcellular location terms.
Maintainted by the UniProt team
unirule2go

Archived cross-references

Mapping Last update Direct access to mapping file
COG - Functional categories
Clusters of Orthologous Groups (COG)
Constructed by Michael Ashburner and Jane Lomax
June 2004 cog2go
EGAD: Expressed Gene Anatomy Database
Constructed by Michael Ashburner
Oct 2000 egad2go
GenProtEC: E. coli Genome and Proteome functional categories
Constructed by Heather Butler and Michael Ashburner
Dec 2000 genprotec2go
TIGRFAM: JCVI protein families
JCVI (TIGRFAM) protein families
Constructed by Michelle Gwinn and other TIGR staff
NA tigrfams2go
JCVI roles
JCVI roles
Constructed by Michael Ashburner
Jan 2004 tigr2go
MIPS FunCat
MIPS Functional Catalogue (FunCat)
Constructed by Michael Ashburner and Midori Harris
NA mips2go
MultiFun cell function assignment schema classifications
Constructed by Michael Ashburner, Jane Lomax and Margrethe Hauge Serres
Dec 2005 multifun2go