Current Annotations
Annotation Details and Downloads
The gene association files submitted by GO Consortium members are shown in the tables below. Files are in the GO annotation file format and are compressed using the UNIX gzip utility. Please see the appropriate README file for further details on the annotation set. Any errors or omissions in annotations should be reported by writing to the GO helpdesk.
Ontology and annotation data is integrated in the mySQL and XML files. See the GO database guide for more information.
Filtered Files
These files are taxon-specific and reflect the work of specific projects, primarily the model organisms database groups, to provide comprehensive, non-redundant annotation files for their organism. All the files in this table have been filtered using the annotation file QC checks script. A major component to the filtering is the requirement that particular taxon IDs can only be included within the association files provided by specific projects; please see the list of the authoritative groups for the major model organisms.
numbers as of May 12, 2008
| Species, Database | Gene Products Annotated | Annotations | Submission date MM/DD/YYYY | Download filtered files |
|---|---|---|---|---|
| Anaplasma phagocytophilum HZ TIGR |
1292 | 3495 (3495 non-IEA) |
3/22/2008 |
|
| Agrobacterium tumefaciensstr. C58 PAMGO |
31 | 50 (50 non-IEA) |
2/2/2008 |
|
| Arabidopsis thaliana TAIR/TIGR |
35596 | 108366 (85808 non-IEA) |
5/8/2008 |
|
| Bacillus anthracis Ames TIGR |
5287 | 13160 (13160 non-IEA) |
3/22/2008 |
|
| Bos taurus GO Annotations @ EBI |
22843 | 87981 (3274 non-IEA) |
5/7/2008 |
|
| Carboxydothermus hydrogenoformans Z-2901 TIGR |
2615 | 6421 (6421 non-IEA) |
3/22/2008 |
|
| Caenorhabditis elegans WormBase |
13772 | 81234 (36259 non-IEA) |
5/10/2008 |
|
| Campylobacter jejuni RM1221 TIGR |
1833 | 4678 (4678 non-IEA) |
3/22/2008 |
|
| Candida albicans CGD |
3728 | 16614 (5423 non-IEA) |
5/8/2008 |
|
| Clostridium perfringens ATCC13124 TIGR |
2895 | 7496 (7496 non-IEA) |
3/22/2008 |
|
| Colwellia psychrerythraea 34H TIGR |
4810 | 12391 (12194 non-IEA) |
3/22/2008 |
|
| Coxiella burnetii RSA 493 TIGR |
2036 | 5191 (5191 non-IEA) |
3/22/2008 |
|
| Danio rerio ZFIN |
14312 | 87853 (21292 non-IEA) |
5/12/2008 |
|
| Dehalococcoides ethenogenes 195 TIGR |
1584 | 3973 (3973 non-IEA) |
3/22/2008 |
|
| Dictyostelium discoideum dictyBase |
6964 | 29032 (17703 non-IEA) |
5/11/2008 |
|
| Drosophila melanogaster FlyBase |
12408 | 69540 (53514 non-IEA) |
5/10/2008 |
|
| Ehrlichia chaffeensis Arkansas TIGR |
1094 | 2881 (2881 non-IEA) |
3/22/2008 |
|
| Gallus gallus GO Annotations @ EBI |
16581 | 61169 (1933 non-IEA) |
5/7/2008 |
|
| Geobacter sulfurreducens PCA TIGR |
3417 | 8886 (8886 non-IEA) |
3/22/2008 |
|
| Homo sapiens GO Annotations @ EBI |
35551 | 183795 (58069 non-IEA) |
5/10/2008 |
|
| Hyphomonas neptunium ATCC 15444 TIGR |
3116 | 7913 (7864 non-IEA) |
3/22/2008 |
|
| Leishmania major Sanger GeneDB |
3616 | 11255 (30 non-IEA) |
3/31/2008 |
|
| Listeria monocytogenes 4b F2365 TIGR |
2823 | 7048 (7048 non-IEA) |
3/22/2008 |
|
| Magnaporthe grisea PAMGO |
12876 | 51711 (29275 non-IEA) |
4/19/2008 |
|
| Methylococcus capsulatus Bath TIGR |
2924 | 7065 (7065 non-IEA) |
3/22/2008 |
|
| Mus musculus MGI |
18126 | 154630 (65538 non-IEA) |
5/9/2008 |
|
| Neorickettsia sennetsu Miyayama TIGR |
930 | 2454 (2454 non-IEA) |
3/22/2008 |
|
| Oomycetes PAMGO |
30 | 126 (126 non-IEA) |
2/13/2008 |
|
| Oryza sativa Gramene |
52082 | 64119 (64119 non-IEA) |
5/3/2008 |
|
| Protein Data Bank [multispecies] GO Annotations @ EBI |
29571 | 154252 (0 non-IEA) |
5/7/2008 |
|
| Plasmodium falciparum Sanger GeneDB |
3243 | 11646 (4671 non-IEA) |
3/31/2008 |
|
| Pseudomonas aeruginosa PAO1 PseudoCAP |
1519 | 7381 (7381 non-IEA) |
3/22/2008 |
|
| Pseudomonas fluorescens Pf-5 TIGR |
4164 | 10744 (9730 non-IEA) |
3/22/2008 |
|
| Pseudomonas syringae DC3000 TIGR |
3902 | 9650 (9650 non-IEA) |
3/22/2008 |
|
| Pseudomonas syringae pv. phaseolicola 1448A TIGR |
3511 | 9065 (9065 non-IEA) |
3/22/2008 |
|
| Rattus norvegicus RGD |
25991 | 197008 (74444 non-IEA) |
5/10/2008 |
|
| Saccharomyces cerevisiae SGD |
6348 | 77649 (38241 non-IEA) |
5/10/2008 |
|
| Schizosaccharomyces pombe Sanger GeneDB |
5235 | 34664 (28654 non-IEA) |
5/10/2008 |
|
| Shewanella oneidensis MR-1 TIGR |
4850 | 13662 (13662 non-IEA) |
3/22/2008 |
|
| Silicibacter pomeroyi DSS-3 TIGR |
4257 | 10899 (10899 non-IEA) |
3/22/2008 |
|
| Solanaceae SGN |
38 | 68 (68 non-IEA) |
4/26/2008 |
|
| Trypanosoma brucei Sanger GeneDB |
3898 | 18858 (10572 non-IEA) |
5/3/2008 |
|
| Trypanosoma brucei chr 2 TIGR |
292 | 896 (896 non-IEA) |
2/16/2008 |
|
| UniProt [multispecies] GO Annotations @ EBI |
3486636 | 25417941 (22199 non-IEA) |
4/12/2008 |
|
| Vibrio cholerae TIGR |
3863 | 9448 (9448 non-IEA) |
5/10/2008 |
|
| Species, Database | Gene Products Annotated | Annotations | Submission date MM/DD/YYYY | Download filtered files |
Unfiltered Files
These files have not been filtered with the annotation file QC checks script. The most important difference between these files and the filtered files above is that gene products from certain taxa are not stripped out of the file; they may also contain annotations to obsolete terms or outdated IEA annotations. Please see the annotation file QC script documentation for full details of the checks performed.
Please note that if you use unfiltered files in conjunction with filtered files, there may be duplicated annotations.
numbers as of May 12, 2008
| Species, Database | Gene Products Annotated | Annotations | Submission date MM/DD/YYYY | Download unfiltered files |
|---|---|---|---|---|
| Arabidopsis thaliana GO Annotations @ EBI |
21453 | 84754 (7277 non-IEA) |
5/7/2008 |
|
| Mus musculus GO Annotations @ EBI |
33710 | 189108 (65858 non-IEA) |
5/7/2008 |
|
| Rattus norvegicus GO Annotations @ EBI |
28241 | 119606 (13397 non-IEA) |
5/7/2008 |
|
| Danio rerio GO Annotations @ EBI |
31274 | 112883 (4571 non-IEA) |
5/7/2008 |
|
| Protein Data Bank [multispecies] GO Annotations @ EBI |
45700 | 239167 (0 non-IEA) |
5/7/2008 |
|
| TIGR Gene Index [multispecies] TIGR |
281994 | 788343 (0 non-IEA) |
10/2/2005 |
|
| UniProt [multispecies] GO Annotations @ EBI |
3825597 | 28216336 (467894 non-IEA) |
4/11/2008 |
|
| Species, Database | Gene Products Annotated | Annotations | Submission date MM/DD/YYYY | Download unfiltered files |
In the tables above gene association counts are provided for all evidence codes and separately for everything except IEA, Inferred from Electronic Annotation. The IEA code means there has been no human involvement in the assignment of the association; see the GO evidence code documentation for more details.
gp2protein files
The gp2protein directory contains files that map between model organism database object IDs and UniProt accessions.