GENCODE

Statistics about the Mouse GENCODE Reference Release Set

* The statistics derive from the gtf files that contain only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.


Compare two reference releases »

Version M13 (October 2016 freeze, GRCm38) - Ensembl 88

General stats

Total No of Genes
50600
Protein-coding genes
21968
Long non-coding RNA genes
11017
Small non-coding RNA genes
6110
Pseudogenes
11009
   - processed pseudogenes:
7941
   - unprocessed pseudogenes:
2662
   - unitary pseudogenes:
36
   - polymorphic pseudogenes:
76
   - pseudogenes:
91
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
494
   - pseudogenes:
203
 
 
Total No of Transcripts
125570
Protein-coding transcripts
54712
   - full length protein-coding:
42487
   - partial length protein-coding:
12225
Nonsense mediated decay transcripts
6000
Long non-coding RNA loci transcripts
15300




Total No of distinct translations
42529
Genes that have more than one distinct translations
9788


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 2 3
antisense 2480 3702
bidirectional_promoter_lncRNA 89 159
IG_C_gene 13 18
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 155 155
lincRNA 4549 6904
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 563 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 21
nonsense_mediated_decay 0 6000
polymorphic_pseudogene 76 89
processed_pseudogene 7733 7734
processed_transcript 757 14407
protein_coding 21968 54712
pseudogene 91 117
retained_intron 0 18550
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
sense_intronic 279 304
sense_overlapping 25 50
snoRNA 1508 1508
snRNA 1383 1383
sRNA 2 2
TEC 2835 2913
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 208 213
transcribed_unitary_pseudogene 8 8
transcribed_unprocessed_pseudogene 207 228
translated_processed_pseudogene 0 12
unitary_pseudogene 28 28
unprocessed_pseudogene 2455 2462

Version M1 (July 2011 freeze, NCBIM37) - Ensembl 65

General stats

Total No of Genes
37310
Protein-coding genes
22380
Long non-coding RNA genes
3845
Small non-coding RNA genes
5395
Pseudogenes
5209
   - processed pseudogenes:
3837
   - unprocessed pseudogenes:
902
   - unitary pseudogenes:
2
   - polymorphic pseudogenes:
8
   - pseudogenes:
460
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
481
   - pseudogenes:
 
 
Total No of Transcripts
95495
Protein-coding transcripts
51733
   - full length protein-coding:
43329
   - partial length protein-coding:
8404
Nonsense mediated decay transcripts
3784
Long non-coding RNA loci transcripts
5669




Total No of distinct translations
43313
Genes that have more than one distinct translations
9953


Further details on this version's gene and transcript types

biotype genes transcripts
all IG_genes 481 482
all other pseudogenes 5209 5773
all RNA pseudogenes 0 0
all RNA_genes 5091 5890
ambiguous_orf 0 30
antisense 0 1876
disrupted_domain 0 1
IG_C_gene 13 13
IG_D_gene 25 25
IG_J_gene 88 88
IG_V_gene 355 356
lincRNA 1273 2072
miRNA 1577 1577
misc_RNA 487 487
Mt_rRNA 2 2
Mt_tRNA 22 22
ncrna_host 0 3
non_coding 0 75
nonsense_mediated_decay 0 3784
polymorphic_pseudogene 8 12
processed_pseudogene 0 147
processed_transcript 2572 12683
protein_coding 22380 51733
pseudogene 5201 541
retained_intron 0 11496
retrotransposed 0 259
rRNA 332 332
sense_intronic 0 87
snoRNA 1552 1552
snRNA 1423 1423
TEC 0 5
transcribed_processed_pseudogene 0 3761
transcribed_unprocessed_pseudogene 0 795
unitary_pseudogene 0 6
unprocessed_pseudogene 0 252
 
Cookies policy | Terms & Conditions. This site is hosted by the Wellcome Trust Sanger Institute.