GENCODE

Statistics about the Mouse GENCODE Reference Release Set

* The statistics derive from the gtf files that contain only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.


Compare two reference releases »

Version M15 (May 2017 freeze, GRCm38) - Ensembl 90

General stats

Total No of Genes
52550
Protein-coding genes
21950
Long non-coding RNA genes
11975
Small non-coding RNA genes
6109
Pseudogenes
12020
   - processed pseudogenes:
8861
   - unprocessed pseudogenes:
2776
   - unitary pseudogenes:
30
   - polymorphic pseudogenes:
77
   - pseudogenes:
73
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
494
   - pseudogenes:
203
 
 
Total No of Transcripts
131100
Protein-coding transcripts
55819
   - full length protein-coding:
43077
   - partial length protein-coding:
12742
Nonsense mediated decay transcripts
6364
Long non-coding RNA loci transcripts
16679




Total No of distinct translations
43281
Genes that have more than one distinct translations
10116


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 2 3
antisense_RNA 2646 3970
bidirectional_promoter_lncRNA 124 213
IG_C_gene 13 21
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 155 155
lincRNA 5082 7725
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 563 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 23
nonsense_mediated_decay 0 6364
polymorphic_pseudogene 77 92
processed_pseudogene 8615 8616
processed_transcript 769 14979
protein_coding 21950 55819
pseudogene 73 100
retained_intron 0 19660
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
sense_intronic 307 336
sense_overlapping 27 52
snoRNA 1507 1507
snRNA 1383 1383
sRNA 2 2
TEC 3017 3096
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 246 252
transcribed_unitary_pseudogene 12 12
transcribed_unprocessed_pseudogene 228 247
translated_processed_pseudogene 0 12
unitary_pseudogene 18 18
unprocessed_pseudogene 2548 2555

Version M1 (July 2011 freeze, NCBIM37) - Ensembl 65

General stats

Total No of Genes
37310
Protein-coding genes
22380
Long non-coding RNA genes
3845
Small non-coding RNA genes
5395
Pseudogenes
5209
   - processed pseudogenes:
3837
   - unprocessed pseudogenes:
902
   - unitary pseudogenes:
2
   - polymorphic pseudogenes:
8
   - pseudogenes:
460
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
481
   - pseudogenes:
 
 
Total No of Transcripts
95495
Protein-coding transcripts
51733
   - full length protein-coding:
43329
   - partial length protein-coding:
8404
Nonsense mediated decay transcripts
3784
Long non-coding RNA loci transcripts
5669




Total No of distinct translations
43313
Genes that have more than one distinct translations
9953


Further details on this version's gene and transcript types

biotype genes transcripts
all IG_genes 481 482
all other pseudogenes 5209 5773
all RNA pseudogenes 0 0
all RNA_genes 5091 5890
ambiguous_orf 0 30
antisense 0 1876
disrupted_domain 0 1
IG_C_gene 13 13
IG_D_gene 25 25
IG_J_gene 88 88
IG_V_gene 355 356
lincRNA 1273 2072
miRNA 1577 1577
misc_RNA 487 487
Mt_rRNA 2 2
Mt_tRNA 22 22
ncrna_host 0 3
non_coding 0 75
nonsense_mediated_decay 0 3784
polymorphic_pseudogene 8 12
processed_pseudogene 0 147
processed_transcript 2572 12683
protein_coding 22380 51733
pseudogene 5201 541
retained_intron 0 11496
retrotransposed 0 259
rRNA 332 332
sense_intronic 0 87
snoRNA 1552 1552
snRNA 1423 1423
TEC 0 5
transcribed_processed_pseudogene 0 3761
transcribed_unprocessed_pseudogene 0 795
unitary_pseudogene 0 6
unprocessed_pseudogene 0 252
 
Cookies policy | Terms & Conditions. This site is hosted by the Wellcome Trust Sanger Institute.