GENCODE

Statistics about all Mouse GENCODE releases

* The statistics derive from the gtf files that contain only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.

Version M15 (May 2017 freeze, GRCm38) - Ensembl 90 Download release

General stats

Total No of Genes
52550
Protein-coding genes
21950
Long non-coding RNA genes
11975
Small non-coding RNA genes
6109
Pseudogenes
12020
   - processed pseudogenes:
8861
   - unprocessed pseudogenes:
2776
   - unitary pseudogenes:
30
   - polymorphic pseudogenes:
77
   - pseudogenes:
73
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
494
   - pseudogenes:
203
 
 
Total No of Transcripts
131100
Protein-coding transcripts
55819
   - full length protein-coding:
43077
   - partial length protein-coding:
12742
Nonsense mediated decay transcripts
6364
Long non-coding RNA loci transcripts
16679




Total No of distinct translations
43281
Genes that have more than one distinct translations
10116


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 2 3
antisense_RNA 2646 3970
bidirectional_promoter_lncRNA 124 213
IG_C_gene 13 21
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 155 155
lincRNA 5082 7725
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 563 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 23
nonsense_mediated_decay 0 6364
polymorphic_pseudogene 77 92
processed_pseudogene 8615 8616
processed_transcript 769 14979
protein_coding 21950 55819
pseudogene 73 100
retained_intron 0 19660
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
sense_intronic 307 336
sense_overlapping 27 52
snoRNA 1507 1507
snRNA 1383 1383
sRNA 2 2
TEC 3017 3096
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 246 252
transcribed_unitary_pseudogene 12 12
transcribed_unprocessed_pseudogene 228 247
translated_processed_pseudogene 0 12
unitary_pseudogene 18 18
unprocessed_pseudogene 2548 2555

Version M14 (January 2017 freeze, GRCm38) - Ensembl 89 Download release

General stats

Total No of Genes
51826
Protein-coding genes
21948
Long non-coding RNA genes
11607
Small non-coding RNA genes
6110
Pseudogenes
11665
   - processed pseudogenes:
8511
   - unprocessed pseudogenes:
2751
   - unitary pseudogenes:
39
   - polymorphic pseudogenes:
78
   - pseudogenes:
83
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
494
   - pseudogenes:
203
 
 
Total No of Transcripts
128622
Protein-coding transcripts
55252
   - full length protein-coding:
42762
   - partial length protein-coding:
12490
Nonsense mediated decay transcripts
6198
Long non-coding RNA loci transcripts
16113




Total No of distinct translations
42895
Genes that have more than one distinct translations
9953


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 2 3
antisense 2554 3828
bidirectional_promoter_lncRNA 111 193
IG_C_gene 13 21
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 155 155
lincRNA 4873 7388
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 563 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 22
nonsense_mediated_decay 0 6198
polymorphic_pseudogene 78 94
processed_pseudogene 8280 8281
processed_transcript 772 14713
protein_coding 21948 55252
pseudogene 83 109
retained_intron 0 19094
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
sense_intronic 297 324
sense_overlapping 27 52
snoRNA 1508 1508
snRNA 1383 1383
sRNA 2 2
TEC 2970 3049
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 231 236
transcribed_unitary_pseudogene 9 9
transcribed_unprocessed_pseudogene 226 247
translated_processed_pseudogene 0 12
unitary_pseudogene 30 30
unprocessed_pseudogene 2525 2531

Version M13 (October 2016 freeze, GRCm38) - Ensembl 88 Download release

General stats

Total No of Genes
50600
Protein-coding genes
21968
Long non-coding RNA genes
11017
Small non-coding RNA genes
6110
Pseudogenes
11009
   - processed pseudogenes:
7941
   - unprocessed pseudogenes:
2662
   - unitary pseudogenes:
36
   - polymorphic pseudogenes:
76
   - pseudogenes:
91
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
494
   - pseudogenes:
203
 
 
Total No of Transcripts
125570
Protein-coding transcripts
54712
   - full length protein-coding:
42487
   - partial length protein-coding:
12225
Nonsense mediated decay transcripts
6000
Long non-coding RNA loci transcripts
15300




Total No of distinct translations
42529
Genes that have more than one distinct translations
9788


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 2 3
antisense 2480 3702
bidirectional_promoter_lncRNA 89 159
IG_C_gene 13 18
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 155 155
lincRNA 4549 6904
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 563 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 21
nonsense_mediated_decay 0 6000
polymorphic_pseudogene 76 89
processed_pseudogene 7733 7734
processed_transcript 757 14407
protein_coding 21968 54712
pseudogene 91 117
retained_intron 0 18550
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
sense_intronic 279 304
sense_overlapping 25 50
snoRNA 1508 1508
snRNA 1383 1383
sRNA 2 2
TEC 2835 2913
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 208 213
transcribed_unitary_pseudogene 8 8
transcribed_unprocessed_pseudogene 207 228
translated_processed_pseudogene 0 12
unitary_pseudogene 28 28
unprocessed_pseudogene 2455 2462

Version M12 (August 2016 freeze, GRCm38) - Ensembl 87 Download release

General stats

Total No of Genes
49585
Protein-coding genes
21973
Long non-coding RNA genes
10481
Small non-coding RNA genes
6111
Pseudogenes
10524
   - processed pseudogenes:
7486
   - unprocessed pseudogenes:
2625
   - unitary pseudogenes:
34
   - polymorphic pseudogenes:
77
   - pseudogenes:
99
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
494
   - pseudogenes:
203
 
 
Total No of Transcripts
122968
Protein-coding transcripts
54250
   - full length protein-coding:
42226
   - partial length protein-coding:
12024
Nonsense mediated decay transcripts
5843
Long non-coding RNA loci transcripts
14610




Total No of distinct translations
42187
Genes that have more than one distinct translations
9633


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 2 3
antisense 2419 3618
bidirectional_promoter_lncRNA 61 118
IG_C_gene 13 18
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 155 155
lincRNA 4255 6489
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 564 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 20
nonsense_mediated_decay 0 5843
polymorphic_pseudogene 77 89
processed_pseudogene 7289 7290
processed_transcript 753 14151
protein_coding 21973 54250
pseudogene 99 124
retained_intron 0 18001
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
sense_intronic 270 294
sense_overlapping 25 50
snoRNA 1508 1508
snRNA 1383 1383
sRNA 2 2
TEC 2695 2773
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 197 201
transcribed_unitary_pseudogene 8 8
transcribed_unprocessed_pseudogene 191 212
translated_processed_pseudogene 0 12
unitary_pseudogene 26 26
unprocessed_pseudogene 2434 2442

Version M11 (March 2016 freeze, GRCm38) - Ensembl 86 Download release

General stats

Total No of Genes
48709
Protein-coding genes
22018
Long non-coding RNA genes
9989
Small non-coding RNA genes
6110
Pseudogenes
10096
   - processed pseudogenes:
7154
   - unprocessed pseudogenes:
2554
   - unitary pseudogenes:
30
   - polymorphic pseudogenes:
54
   - pseudogenes:
101
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
494
   - pseudogenes:
203
 
 
Total No of Transcripts
118925
Protein-coding transcripts
52382
   - full length protein-coding:
40617
   - partial length protein-coding:
11765
Nonsense mediated decay transcripts
5680
Long non-coding RNA loci transcripts
13904




Total No of distinct translations
41774
Genes that have more than one distinct translations
9408


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 2 3
antisense 2343 3493
bidirectional_promoter_lncRNA 47 97
IG_C_gene 13 18
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 155 155
lincRNA 3960 6019
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 563 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 19
nonsense_mediated_decay 0 5680
polymorphic_pseudogene 54 61
processed_pseudogene 6970 6971
processed_transcript 755 13773
protein_coding 22018 52382
pseudogene 101 126
retained_intron 0 17530
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
sense_intronic 263 287
sense_overlapping 25 49
snoRNA 1508 1508
snRNA 1383 1383
sRNA 2 2
TEC 2593 2668
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 184 188
transcribed_unitary_pseudogene 6 6
transcribed_unprocessed_pseudogene 179 199
translated_processed_pseudogene 0 12
unitary_pseudogene 24 24
unprocessed_pseudogene 2375 2384

Version M10 (January 2016 freeze, GRCm38) - Ensembl 85 Download release

General stats

Total No of Genes
48440
Protein-coding genes
22021
Long non-coding RNA genes
9856
Small non-coding RNA genes
6109
Pseudogenes
9958
   - processed pseudogenes:
7057
   - unprocessed pseudogenes:
2516
   - unitary pseudogenes:
25
   - polymorphic pseudogenes:
50
   - pseudogenes:
107
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
494
   - pseudogenes:
203
 
 
Total No of Transcripts
117667
Protein-coding transcripts
51959
   - full length protein-coding:
40362
   - partial length protein-coding:
11597
Nonsense mediated decay transcripts
5574
Long non-coding RNA loci transcripts
13722




Total No of distinct translations
41556
Genes that have more than one distinct translations
9302


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 2 3
all IG_genes 268 356
all other pseudogenes 9960 10031
all RNA pseudogenes 0 0
all RNA_genes 7830 9910
antisense 2304 3437
bidirectional_promoter_lncRNA 37 80
IG_C_gene 13 18
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 155 155
lincRNA 3905 5937
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 563 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 18
nonsense_mediated_decay 0 5574
polymorphic_pseudogene 50 56
processed_pseudogene 6875 6876
processed_transcript 753 13649
protein_coding 22021 51959
pseudogene 107 130
retained_intron 0 17254
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
sense_intronic 265 289
sense_overlapping 25 49
snoRNA 1508 1508
snRNA 1382 1382
sRNA 2 2
TEC 2564 2639
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 182 187
transcribed_unitary_pseudogene 5 5
transcribed_unprocessed_pseudogene 172 188
translated_processed_pseudogene 0 12
unitary_pseudogene 20 20
unprocessed_pseudogene 2344 2352

Version M9 (October 2015 freeze, GRCm38) - Ensembl 84 Download release

General stats

Total No of Genes
47643
Protein-coding genes
21971
Long non-coding RNA genes
9436
Small non-coding RNA genes
6109
Pseudogenes
9631
   - processed pseudogenes:
6775
   - unprocessed pseudogenes:
2477
   - unitary pseudogenes:
21
   - polymorphic pseudogenes:
39
   - pseudogenes:
116
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
494
   - pseudogenes:
203
 
 
Total No of Transcripts
115125
Protein-coding transcripts
51254
   - full length protein-coding:
39914
   - partial length protein-coding:
11340
Nonsense mediated decay transcripts
5375
Long non-coding RNA loci transcripts
13046




Total No of distinct translations
41084
Genes that have more than one distinct translations
9098


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 2 3
all IG_genes 268 356
all other pseudogenes 9633 9698
all RNA pseudogenes 0 0
all RNA_genes 7614 9506
antisense 2243 3308
bidirectional_promoter_lncRNA 19 43
IG_C_gene 13 18
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 155 155
lincRNA 3707 5570
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 563 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 17
nonsense_mediated_decay 0 5375
polymorphic_pseudogene 39 44
processed_pseudogene 6598 6599
processed_transcript 747 13468
protein_coding 21971 51254
pseudogene 116 135
retained_intron 0 16805
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
sense_intronic 258 282
sense_overlapping 24 48
snoRNA 1508 1508
snRNA 1382 1382
sRNA 2 2
TEC 2435 2506
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 177 182
transcribed_unitary_pseudogene 3 3
transcribed_unprocessed_pseudogene 158 173
translated_processed_pseudogene 0 12
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 18 18
unprocessed_pseudogene 2318 2326

Version M8 (August 2015 freeze, GRCm38) - Ensembl 83 Download release

General stats

Total No of Genes
46983
Protein-coding genes
21930
Long non-coding RNA genes
9072
Small non-coding RNA genes
6108
Pseudogenes
9379
   - processed pseudogenes:
6567
   - unprocessed pseudogenes:
2391
   - unitary pseudogenes:
16
   - polymorphic pseudogenes:
32
   - pseudogenes:
168
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
494
   - pseudogenes:
205
 
 
Total No of Transcripts
113231
Protein-coding transcripts
50620
   - full length protein-coding:
39590
   - partial length protein-coding:
11030
Nonsense mediated decay transcripts
5229
Long non-coding RNA loci transcripts
12557




Total No of distinct translations
40674
Genes that have more than one distinct translations
8978


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 2 3
all IG_genes 268 363
all other pseudogenes 9379 9442
all RNA pseudogenes 0 0
all RNA_genes 7464 9251
antisense 2189 3208
bidirectional_promoter_lncrna 12 22
IG_C_gene 13 20
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 4 4
IG_J_gene 14 14
IG_LV_gene 4 4
IG_V_gene 218 306
IG_V_pseudogene 156 156
lincRNA 3579 5362
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 563 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 15
nonsense_mediated_decay 0 5229
polymorphic_pseudogene 32 37
processed_pseudogene 6393 6394
processed_transcript 749 13422
protein_coding 21930 50620
pseudogene 168 186
retained_intron 0 16496
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
sense_intronic 253 277
sense_overlapping 23 47
snoRNA 1508 1508
snRNA 1382 1382
sRNA 2 2
TEC 2264 2333
TR_C_gene 8 11
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 174 179
transcribed_unitary_pseudogene 1 1
transcribed_unprocessed_pseudogene 148 162
translated_processed_pseudogene 0 12
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 15 15
unprocessed_pseudogene 2242 2250

Version M7 (June 2015 freeze, GRCm38) - Ensembl 82 Download release

General stats

Total No of Genes
46517
Protein-coding genes
21936
Long non-coding RNA genes
8793
Small non-coding RNA genes
6109
Pseudogenes
9185
   - processed pseudogenes:
6430
   - unprocessed pseudogenes:
2336
   - unitary pseudogenes:
16
   - polymorphic pseudogenes:
21
   - pseudogenes:
177
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
494
   - pseudogenes:
205
 
 
Total No of Transcripts
111706
Protein-coding transcripts
50162
   - full length protein-coding:
39393
   - partial length protein-coding:
10769
Nonsense mediated decay transcripts
5064
Long non-coding RNA loci transcripts
12169




Total No of distinct translations
40395
Genes that have more than one distinct translations
8861


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 2 3
all IG_genes 268 368
all other pseudogenes 9185 9227
all RNA pseudogenes 0 0
all RNA_genes 7381 9115
antisense 2137 3117
bidirectional_promoter_lncrna 8 17
IG_C_gene 13 20
IG_C_pseudogene 1 1
IG_D_gene 19 20
IG_D_pseudogene 4 4
IG_J_gene 14 18
IG_LV_gene 4 4
IG_V_gene 218 306
IG_V_pseudogene 156 156
lincRNA 3495 5226
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 564 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 15
nonsense_mediated_decay 0 5064
polymorphic_pseudogene 21 26
processed_pseudogene 6257 6258
processed_transcript 744 13436
protein_coding 21936 50162
pseudogene 177 177
retained_intron 0 16152
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
sense_intronic 250 270
sense_overlapping 22 44
snoRNA 1508 1508
snRNA 1382 1382
sRNA 2 2
TEC 2134 2206
TR_C_gene 8 11
TR_D_gene 4 5
TR_J_gene 70 76
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 173 178
transcribed_unitary_pseudogene 1 1
transcribed_unprocessed_pseudogene 142 153
translated_processed_pseudogene 0 12
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 15 15
unprocessed_pseudogene 2193 2201

Version M6 (March 2015 freeze, GRCm38) - Ensembl 81 Download release

General stats

Total No of Genes
45706
Protein-coding genes
21958
Long non-coding RNA genes
8359
Small non-coding RNA genes
6109
Pseudogenes
8787
   - processed pseudogenes:
6097
   - unprocessed pseudogenes:
2272
   - unitary pseudogenes:
15
   - polymorphic pseudogenes:
19
   - pseudogenes:
179
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
493
   - pseudogenes:
205
 
 
Total No of Transcripts
109617
Protein-coding transcripts
49676
   - full length protein-coding:
39126
   - partial length protein-coding:
10550
Nonsense mediated decay transcripts
4912
Long non-coding RNA loci transcripts
11649




Total No of distinct translations
40115
Genes that have more than one distinct translations
8722


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 2 3
all IG_genes 269 369
all other pseudogenes 8787 8826
all RNA pseudogenes 0 0
all RNA_genes 7283 8983
antisense 2060 2997
IG_C_gene 13 20
IG_C_pseudogene 1 1
IG_D_gene 19 20
IG_D_pseudogene 4 4
IG_J_gene 14 18
IG_LV_gene 5 5
IG_V_gene 218 306
IG_V_pseudogene 156 156
lincRNA 3397 5094
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 564 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 13
nonsense_mediated_decay 0 4912
polymorphic_pseudogene 19 24
processed_pseudogene 5931 5932
processed_transcript 746 13444
protein_coding 21958 49676
pseudogene 179 179
retained_intron 0 15621
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
sense_intronic 239 259
sense_overlapping 22 44
snoRNA 1508 1508
snRNA 1382 1382
sRNA 2 2
TEC 1892 1962
TR_C_gene 8 11
TR_D_gene 4 5
TR_J_gene 70 76
TR_J_pseudogene 10 10
TR_V_gene 142 192
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 166 170
transcribed_unprocessed_pseudogene 132 142
translated_processed_pseudogene 0 12
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 15 15
unprocessed_pseudogene 2139 2146

Version M5 (December 2014 freeze, GRCm38) - Ensembl 80 Download release

General stats

Total No of Genes
45232
Protein-coding genes
21953
Long non-coding RNA genes
7989
Small non-coding RNA genes
6109
Pseudogenes
8526
   - processed pseudogenes:
6077
   - unprocessed pseudogenes:
2235
   - unitary pseudogenes:
15
   - polymorphic pseudogenes:
19
   - pseudogenes:
136
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
494
   - pseudogenes:
44
 
 
Total No of Transcripts
107842
Protein-coding transcripts
49145
   - full length protein-coding:
38869
   - partial length protein-coding:
10276
Nonsense mediated decay transcripts
4800
Long non-coding RNA loci transcripts
11206




Total No of distinct translations
39790
Genes that have more than one distinct translations
8595


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 2 3
all IG_genes 268 368
all other pseudogenes 8687 8726
all RNA pseudogenes 0 0
all RNA_genes 7183 8801
antisense 2000 2925
IG_C_gene 13 20
IG_C_pseudogene 1 1
IG_D_gene 19 20
IG_D_pseudogene 4 4
IG_J_gene 14 18
IG_LV_gene 4 4
IG_V_gene 218 306
IG_V_pseudogene 156 156
lincRNA 3297 4912
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 564 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 10
nonsense_mediated_decay 0 4800
polymorphic_pseudogene 19 24
processed_pseudogene 5920 5921
processed_transcript 751 13406
protein_coding 21953 49145
pseudogene 136 136
retained_intron 0 15101
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
sense_intronic 231 251
sense_overlapping 22 44
snoRNA 1508 1508
snRNA 1382 1382
sRNA 2 2
TEC 1685 1752
TR_C_gene 8 11
TR_D_gene 4 5
TR_J_gene 70 76
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 156 160
transcribed_unprocessed_pseudogene 129 139
translated_processed_pseudogene 1 13
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 15 15
unprocessed_pseudogene 2105 2112

Version M4 (August 2014 freeze, GRCm38) - Ensembl 78, 79 Download release

General stats

Total No of Genes
43346
Protein-coding genes
22032
Long non-coding RNA genes
6951
Small non-coding RNA genes
5853
Pseudogenes
7957
   - processed pseudogenes:
5560
   - unprocessed pseudogenes:
2171
   - unitary pseudogenes:
15
   - polymorphic pseudogenes:
18
   - pseudogenes:
178
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
479
   - pseudogenes:
15
 
 
Total No of Transcripts
103639
Protein-coding transcripts
48482
   - full length protein-coding:
38578
   - partial length protein-coding:
9904
Nonsense mediated decay transcripts
4558
Long non-coding RNA loci transcripts
9962




Total No of distinct translations
39340
Genes that have more than one distinct translations
8406


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 2 3
all IG_genes 395 446
all other pseudogenes 8031 8079
all RNA pseudogenes 0 0
all RNA_genes 6878 8343
antisense 1838 2666
IG_C_gene 13 20
IG_C_pseudogene 1 1
IG_D_gene 21 22
IG_D_pseudogene 4 4
IG_J_gene 74 75
IG_LV_gene 196 197
IG_V_gene 91 132
IG_V_pseudogene 69 69
lincRNA 2998 4463
miRNA 1973 1973
misc_RNA 590 590
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 9
nonsense_mediated_decay 0 4558
polymorphic_pseudogene 18 23
processed_pseudogene 5420 5421
processed_transcript 756 13314
protein_coding 22032 48482
pseudogene 178 189
retained_intron 0 14221
rRNA 353 353
sense_intronic 149 161
sense_overlapping 19 25
snoRNA 1530 1530
snRNA 1383 1383
TEC 1189 1249
TR_C_gene 2 3
TR_D_gene 2 2
TR_J_gene 13 15
TR_J_pseudogene 1 1
TR_V_gene 67 90
TR_V_pseudogene 14 14
transcribed_processed_pseudogene 139 142
transcribed_unprocessed_pseudogene 128 138
translated_processed_pseudogene 1 13
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 15 15
unprocessed_pseudogene 2042 2048

Version M3 (April 2014 freeze, GRCm38) - Ensembl 76, 77 Download release

General stats

Total No of Genes
41128
Protein-coding genes
22026
Long non-coding RNA genes
5385
Small non-coding RNA genes
5853
Pseudogenes
7388
   - processed pseudogenes:
5161
   - unprocessed pseudogenes:
1996
   - unitary pseudogenes:
16
   - polymorphic pseudogenes:
13
   - pseudogenes:
200
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
476
   - pseudogenes:
2
 
 
Total No of Transcripts
99839
Protein-coding transcripts
47979
   - full length protein-coding:
38350
   - partial length protein-coding:
9629
Nonsense mediated decay transcripts
4382
Long non-coding RNA loci transcripts
8170




Total No of distinct translations
39033
Genes that have more than one distinct translations
8271


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 2 3
all IG_genes 431 435
all other pseudogenes 7388 7420
all RNA pseudogenes 0 0
all RNA_genes 6649 7981
antisense 1731 2511
IG_C_gene 12 14
IG_D_gene 25 25
IG_J_gene 88 88
IG_LV_gene 304 305
IG_V_gene 2 3
IG_V_pseudogene 1 1
lincRNA 2769 4101
miRNA 1973 1973
misc_RNA 590 590
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 7
nonsense_mediated_decay 0 4382
polymorphic_pseudogene 13 14
processed_pseudogene 0 5031
processed_transcript 738 13343
protein_coding 22026 47979
pseudogene 7373 199
retained_intron 0 13568
rRNA 353 353
sense_intronic 129 139
sense_overlapping 16 36
snoRNA 1530 1530
snRNA 1383 1383
TR_V_gene 45 62
TR_V_pseudogene 1 1
transcribed_processed_pseudogene 0 134
transcribed_unprocessed_pseudogene 0 131
translated_processed_pseudogene 0 13
translated_unprocessed_pseudogene 0 1
unitary_pseudogene 0 16
unprocessed_pseudogene 0 1879

Version M2 (July 2013 freeze, GRCm38) - Ensembl 74, 75 Download release

General stats

Total No of Genes
38924
Protein-coding genes
22572
Long non-coding RNA genes
4074
Small non-coding RNA genes
5853
Pseudogenes
5948
   - processed pseudogenes:
4556
   - unprocessed pseudogenes:
1157
   - unitary pseudogenes:
14
   - polymorphic pseudogenes:
15
   - pseudogenes:
204
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
477
   - pseudogenes:
2
 
 
Total No of Transcripts
94545
Protein-coding transcripts
47394
   - full length protein-coding:
38260
   - partial length protein-coding:
9134
Nonsense mediated decay transcripts
4134
Long non-coding RNA loci transcripts
6053




Total No of distinct translations
38862
Genes that have more than one distinct translations
7946


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 1 1
all IG_genes 432 434
all other pseudogenes 5948 6471
all RNA pseudogenes 0 0
all RNA_genes 5672 6398
antisense 1476 2066
IG_C_gene 13 15
IG_D_gene 25 25
IG_J_gene 88 88
IG_LV_gene 304 304
IG_V_gene 2 2
IG_V_pseudogene 1 2
lincRNA 1792 2518
miRNA 1973 1973
misc_RNA 590 590
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 5
nonsense_mediated_decay 0 4134
polymorphic_pseudogene 15 19
processed_pseudogene 0 4759
processed_transcript 705 12877
protein_coding 22572 47394
pseudogene 5931 261
retained_intron 0 12607
rRNA 353 353
sense_intronic 90 98
sense_overlapping 10 27
snoRNA 1530 1530
snRNA 1383 1383
TR_V_gene 45 60
TR_V_pseudogene 1 1
transcribed_processed_pseudogene 0 122
transcribed_unprocessed_pseudogene 0 94
translated_processed_pseudogene 0 12
translated_unprocessed_pseudogene 0 1
unitary_pseudogene 0 17
unprocessed_pseudogene 0 1183

Version M1 (July 2011 freeze, NCBIM37) - Ensembl 65 Download release

General stats

Total No of Genes
37310
Protein-coding genes
22380
Long non-coding RNA genes
3845
Small non-coding RNA genes
5395
Pseudogenes
5209
   - processed pseudogenes:
3837
   - unprocessed pseudogenes:
902
   - unitary pseudogenes:
2
   - polymorphic pseudogenes:
8
   - pseudogenes:
460
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
481
   - pseudogenes:
 
 
Total No of Transcripts
95495
Protein-coding transcripts
51733
   - full length protein-coding:
43329
   - partial length protein-coding:
8404
Nonsense mediated decay transcripts
3784
Long non-coding RNA loci transcripts
5669




Total No of distinct translations
43313
Genes that have more than one distinct translations
9953


Further details on this version's gene and transcript types

biotype genes transcripts
all IG_genes 481 482
all other pseudogenes 5209 5773
all RNA pseudogenes 0 0
all RNA_genes 5091 5890
ambiguous_orf 0 30
antisense 0 1876
disrupted_domain 0 1
IG_C_gene 13 13
IG_D_gene 25 25
IG_J_gene 88 88
IG_V_gene 355 356
lincRNA 1273 2072
miRNA 1577 1577
misc_RNA 487 487
Mt_rRNA 2 2
Mt_tRNA 22 22
ncrna_host 0 3
non_coding 0 75
nonsense_mediated_decay 0 3784
polymorphic_pseudogene 8 12
processed_pseudogene 0 147
processed_transcript 2572 12683
protein_coding 22380 51733
pseudogene 5201 541
retained_intron 0 11496
retrotransposed 0 259
rRNA 332 332
sense_intronic 0 87
snoRNA 1552 1552
snRNA 1423 1423
TEC 0 5
transcribed_processed_pseudogene 0 3761
transcribed_unprocessed_pseudogene 0 795
unitary_pseudogene 0 6
unprocessed_pseudogene 0 252
 
Cookies policy | Terms & Conditions. This site is hosted by the Wellcome Trust Sanger Institute.