GENCODE

Statistics about the Mouse GENCODE Reference Release Set

* The statistics derive from the gtf files that contain only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.


Compare two reference releases »

Version M12 (August 2016 freeze, GRCm38) - Ensembl 87

General stats

Total No of Genes
49585
Protein-coding genes
21973
Long non-coding RNA genes
10481
Small non-coding RNA genes
6111
Pseudogenes
10524
   - processed pseudogenes:
7486
   - unprocessed pseudogenes:
2625
   - unitary pseudogenes:
34
   - polymorphic pseudogenes:
77
   - pseudogenes:
99
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
494
   - pseudogenes:
203
 
 
Total No of Transcripts
122968
Protein-coding transcripts
54250
   - full length protein-coding:
42226
   - partial length protein-coding:
12024
Nonsense mediated decay transcripts
5843
Long non-coding RNA loci transcripts
14610




Total No of distinct translations
42187
Genes that have more than one distinct translations
9633


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 2 3
antisense 2419 3618
bidirectional_promoter_lncRNA 61 118
IG_C_gene 13 18
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 155 155
lincRNA 4255 6489
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 564 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 20
nonsense_mediated_decay 0 5843
polymorphic_pseudogene 77 89
processed_pseudogene 7289 7290
processed_transcript 753 14151
protein_coding 21973 54250
pseudogene 99 124
retained_intron 0 18001
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
sense_intronic 270 294
sense_overlapping 25 50
snoRNA 1508 1508
snRNA 1383 1383
sRNA 2 2
TEC 2695 2773
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 197 201
transcribed_unitary_pseudogene 8 8
transcribed_unprocessed_pseudogene 191 212
translated_processed_pseudogene 0 12
unitary_pseudogene 26 26
unprocessed_pseudogene 2434 2442

Version M1 (July 2011 freeze, NCBIM37) - Ensembl 65

General stats

Total No of Genes
37310
Protein-coding genes
22380
Long non-coding RNA genes
3845
Small non-coding RNA genes
5395
Pseudogenes
5209
   - processed pseudogenes:
3837
   - unprocessed pseudogenes:
902
   - unitary pseudogenes:
2
   - polymorphic pseudogenes:
8
   - pseudogenes:
460
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
481
   - pseudogenes:
 
 
Total No of Transcripts
95495
Protein-coding transcripts
51733
   - full length protein-coding:
43329
   - partial length protein-coding:
8404
Nonsense mediated decay transcripts
3784
Long non-coding RNA loci transcripts
5669




Total No of distinct translations
43313
Genes that have more than one distinct translations
9953


Further details on this version's gene and transcript types

biotype genes transcripts
all IG_genes 481 482
all other pseudogenes 5209 5773
all RNA pseudogenes 0 0
all RNA_genes 5091 5890
ambiguous_orf 0 30
antisense 0 1876
disrupted_domain 0 1
IG_C_gene 13 13
IG_D_gene 25 25
IG_J_gene 88 88
IG_V_gene 355 356
lincRNA 1273 2072
miRNA 1577 1577
misc_RNA 487 487
Mt_rRNA 2 2
Mt_tRNA 22 22
ncrna_host 0 3
non_coding 0 75
nonsense_mediated_decay 0 3784
polymorphic_pseudogene 8 12
processed_pseudogene 0 147
processed_transcript 2572 12683
protein_coding 22380 51733
pseudogene 5201 541
retained_intron 0 11496
retrotransposed 0 259
rRNA 332 332
sense_intronic 0 87
snoRNA 1552 1552
snRNA 1423 1423
TEC 0 5
transcribed_processed_pseudogene 0 3761
transcribed_unprocessed_pseudogene 0 795
unitary_pseudogene 0 6
unprocessed_pseudogene 0 252
 
Cookies policy | Terms & Conditions. This site is hosted by the Wellcome Trust Sanger Institute.