GENCODE

Statistics about all Human GENCODE releases

* The statistics derive from the gtf files that contain only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.

Version 26 (October 2016 freeze, GRCh38) - Ensembl 88 Download release

General stats

Total No of Genes
58219
Protein-coding genes
19817
Long non-coding RNA genes
15787
Small non-coding RNA genes
7568
Pseudogenes
14636
   - processed pseudogenes:
10700
   - unprocessed pseudogenes:
3419
   - unitary pseudogenes:
211
   - polymorphic pseudogenes:
54
   - pseudogenes:
18
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
410
   - pseudogenes:
234
 
 
Total No of Transcripts
199325
Protein-coding transcripts
80531
   - full length protein-coding:
55030
   - partial length protein-coding:
25501
Nonsense mediated decay transcripts
14113
Long non-coding RNA loci transcripts
27720




Total No of distinct translations
60172
Genes that have more than one distinct translations
13546


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 31 35
antisense 5529 11038
bidirectional_promoter_lncRNA 8 24
IG_C_gene 14 17
IG_C_pseudogene 9 9
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_pseudogene 1 1
IG_V_gene 144 144
IG_V_pseudogene 188 188
lincRNA 7520 13231
macro_lncRNA 1 1
miRNA 1881 1881
misc_RNA 2213 2227
Mt_rRNA 2 2
Mt_tRNA 22 22
non_coding 3 3
non_stop_decay 0 84
nonsense_mediated_decay 0 14113
polymorphic_pseudogene 54 72
processed_pseudogene 10248 10251
processed_transcript 533 27915
protein_coding 19817 80531
pseudogene 18 38
retained_intron 0 27186
ribozyme 8 8
rRNA 543 543
scaRNA 49 49
scRNA 1 1
sense_intronic 904 961
sense_overlapping 190 343
snoRNA 943 955
snRNA 1900 1900
sRNA 5 5
TEC 1068 1168
TR_C_gene 6 6
TR_D_gene 4 4
TR_J_gene 79 79
TR_J_pseudogene 4 4
TR_V_gene 108 108
TR_V_pseudogene 30 30
transcribed_processed_pseudogene 452 452
transcribed_unitary_pseudogene 95 97
transcribed_unprocessed_pseudogene 749 753
unitary_pseudogene 116 116
unprocessed_pseudogene 2670 2671
vaultRNA 1 1

Version 25 (March 2016 freeze, GRCh38) - Ensembl 85, 86, 87 Download release

General stats

Total No of Genes
58037
Protein-coding genes
19950
Long non-coding RNA genes
15767
Small non-coding RNA genes
7258
Pseudogenes
14650
   - processed pseudogenes:
10725
   - unprocessed pseudogenes:
3400
   - unitary pseudogenes:
214
   - polymorphic pseudogenes:
51
   - pseudogenes:
21
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
411
   - pseudogenes:
239
 
 
Total No of Transcripts
198093
Protein-coding transcripts
80087
   - full length protein-coding:
54755
   - partial length protein-coding:
25332
Nonsense mediated decay transcripts
13769
Long non-coding RNA loci transcripts
27692




Total No of distinct translations
60033
Genes that have more than one distinct translations
13536


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 30 34
all IG_genes 214 217
all other pseudogenes 14651 14695
all RNA pseudogenes 0 0
all RNA_genes 13255 19008
antisense 5530 11036
bidirectional_promoter_lncRNA 4 11
IG_C_gene 14 17
IG_C_pseudogene 9 9
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_pseudogene 1 1
IG_V_gene 145 145
IG_V_pseudogene 193 193
lincRNA 7539 13255
macro_lncRNA 1 1
miRNA 1569 1871
misc_RNA 2213 2227
Mt_rRNA 2 2
Mt_tRNA 22 22
non_coding 3 3
non_stop_decay 0 83
nonsense_mediated_decay 0 13769
polymorphic_pseudogene 51 68
processed_pseudogene 10275 10277
processed_transcript 516 27754
protein_coding 19950 80087
pseudogene 21 42
retained_intron 0 26955
ribozyme 8 8
rRNA 544 544
scaRNA 49 49
scRNA 1 1
sense_intronic 909 966
sense_overlapping 187 309
snoRNA 944 956
snRNA 1900 1900
sRNA 5 5
TEC 1048 1135
TR_C_gene 6 6
TR_D_gene 4 4
TR_J_gene 79 79
TR_J_pseudogene 4 4
TR_V_gene 108 108
TR_V_pseudogene 30 30
transcribed_processed_pseudogene 450 450
transcribed_unitary_pseudogene 60 60
transcribed_unprocessed_pseudogene 737 739
unitary_pseudogene 154 155
unprocessed_pseudogene 2663 2664
vaultRNA 1 1

Version 24 (August 2015 freeze, GRCh38) - Ensembl 83, 84 Download release

General stats

Total No of Genes
60554
Protein-coding genes
19815
Long non-coding RNA genes
15941
Small non-coding RNA genes
9882
Pseudogenes
14505
   - processed pseudogenes:
10728
   - unprocessed pseudogenes:
3295
   - unitary pseudogenes:
174
   - polymorphic pseudogenes:
58
   - pseudogenes:
21
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
411
   - pseudogenes:
229
 
 
Total No of Transcripts
199169
Protein-coding transcripts
79930
   - full length protein-coding:
54806
   - partial length protein-coding:
25124
Nonsense mediated decay transcripts
13409
Long non-coding RNA loci transcripts
28031




Total No of distinct translations
59891
Genes that have more than one distinct translations
13565


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 29 33
all IG_genes 214 232
all other pseudogenes 14505 14548
all RNA pseudogenes 0 0
all RNA_genes 13456 19289
antisense 5564 11194
bidirectional_promoter_lncrna 3 5
IG_C_gene 14 19
IG_C_pseudogene 9 9
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 145 158
IG_V_pseudogene 183 183
lincRNA 7674 13481
macro_lncRNA 1 1
miRNA 4093 4093
misc_RNA 2298 2312
Mt_rRNA 2 2
Mt_tRNA 22 22
non_coding 3 3
non_stop_decay 0 81
nonsense_mediated_decay 0 13409
polymorphic_pseudogene 58 75
processed_pseudogene 10283 10285
processed_transcript 502 26977
protein_coding 19815 79930
pseudogene 21 44
retained_intron 0 26704
ribozyme 8 8
rRNA 544 544
scaRNA 49 49
sense_intronic 919 978
sense_overlapping 193 343
snoRNA 949 961
snRNA 1896 1896
sRNA 20 20
TEC 1053 1140
TR_C_gene 6 9
TR_D_gene 4 4
TR_J_gene 79 79
TR_J_pseudogene 4 4
TR_V_gene 108 110
TR_V_pseudogene 30 30
transcribed_processed_pseudogene 445 445
transcribed_unitary_pseudogene 3 3
transcribed_unprocessed_pseudogene 682 682
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 171 171
unprocessed_pseudogene 2612 2613
vaultRNA 1 1

Version 23 (March 2015 freeze, GRCh38) - Ensembl 81, 82 Download release

General stats

Total No of Genes
60498
Protein-coding genes
19797
Long non-coding RNA genes
15931
Small non-coding RNA genes
9882
Pseudogenes
14477
   - processed pseudogenes:
10727
   - unprocessed pseudogenes:
3271
   - unitary pseudogenes:
172
   - polymorphic pseudogenes:
59
   - pseudogenes:
21
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
411
   - pseudogenes:
227
 
 
Total No of Transcripts
198619
Protein-coding transcripts
79795
   - full length protein-coding:
54775
   - partial length protein-coding:
25020
Nonsense mediated decay transcripts
13307
Long non-coding RNA loci transcripts
27817




Total No of distinct translations
59774
Genes that have more than one distinct translations
13556


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 29 33
all IG_genes 216 246
all other pseudogenes 14477 14516
all RNA pseudogenes 0 0
all RNA_genes 13460 19109
antisense 5565 11203
IG_C_gene 14 31
IG_C_pseudogene 9 9
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 147 160
IG_V_pseudogene 181 181
lincRNA 7678 13301
macro_lncRNA 1 1
miRNA 4093 4093
misc_RNA 2298 2312
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 77
nonsense_mediated_decay 0 13307
polymorphic_pseudogene 59 73
processed_pseudogene 10285 10287
processed_transcript 497 26945
protein_coding 19797 79795
pseudogene 21 44
retained_intron 0 26616
ribozyme 8 8
rRNA 544 544
scaRNA 49 49
sense_intronic 917 976
sense_overlapping 194 344
snoRNA 949 961
snRNA 1896 1896
sRNA 20 20
TEC 1050 1137
TR_C_gene 6 23
TR_D_gene 4 4
TR_J_gene 79 79
TR_J_pseudogene 4 4
TR_V_gene 106 108
TR_V_pseudogene 30 30
transcribed_processed_pseudogene 442 442
transcribed_unitary_pseudogene 2 2
transcribed_unprocessed_pseudogene 668 667
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 170 170
unprocessed_pseudogene 2602 2603
vaultRNA 1 1

Version 22 (October 2014 freeze, GRCh38) - Ensembl 79, 80 Download release

General stats

Total No of Genes
60483
Protein-coding genes
19814
Long non-coding RNA genes
15900
Small non-coding RNA genes
9894
Pseudogenes
14285
   - processed pseudogenes:
10748
   - unprocessed pseudogenes:
3238
   - unitary pseudogenes:
170
   - polymorphic pseudogenes:
59
   - pseudogenes:
36
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
398
   - pseudogenes:
226
 
 
Total No of Transcripts
198442
Protein-coding transcripts
79712
   - full length protein-coding:
54766
   - partial length protein-coding:
24946
Nonsense mediated decay transcripts
13274
Long non-coding RNA loci transcripts
27670




Total No of distinct translations
59714
Genes that have more than one distinct translations
13539


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 29 33
all IG_genes 211 241
all other pseudogenes 14477 14517
all RNA pseudogenes 0 0
all RNA_genes 13450 19064
antisense 5565 11197
IG_C_gene 14 31
IG_C_pseudogene 9 9
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 142 155
IG_V_pseudogene 180 180
lincRNA 7656 13256
macro_lncRNA 1 1
miRNA 4093 4093
misc_RNA 2298 2312
Mt_rRNA 2 2
Mt_tRNA 22 22
non_coding 3 3
non_stop_decay 0 76
nonsense_mediated_decay 0 13274
polymorphic_pseudogene 59 73
processed_pseudogene 10304 10307
processed_transcript 484 27019
protein_coding 19814 79712
pseudogene 36 57
retained_intron 0 26542
ribozyme 8 8
rRNA 544 544
scaRNA 49 49
sense_intronic 920 979
sense_overlapping 197 347
snoRNA 961 961
snRNA 1896 1896
sRNA 20 20
TEC 1045 1131
TR_C_gene 5 19
TR_D_gene 3 3
TR_J_gene 73 73
TR_J_pseudogene 4 4
TR_V_gene 106 111
TR_V_pseudogene 30 30
transcribed_processed_pseudogene 443 443
transcribed_unitary_pseudogene 1 1
transcribed_unprocessed_pseudogene 663 663
translated_processed_pseudogene 1 1
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 169 169
unprocessed_pseudogene 2574 2576
vaultRNA 1 1

Version 21 (June 2014 freeze, GRCh38) - Ensembl 77, 78 Download release

General stats

Total No of Genes
60155
Protein-coding genes
19881
Long non-coding RNA genes
15877
Small non-coding RNA genes
9534
Pseudogenes
14467
   - processed pseudogenes:
10753
   - unprocessed pseudogenes:
3230
   - unitary pseudogenes:
170
   - polymorphic pseudogenes:
59
   - pseudogenes:
29
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
395
   - pseudogenes:
226
 
 
Total No of Transcripts
196327
Protein-coding transcripts
79377
   - full length protein-coding:
54420
   - partial length protein-coding:
24957
Nonsense mediated decay transcripts
13222
Long non-coding RNA loci transcripts
26414




Total No of distinct translations
59512
Genes that have more than one distinct translations
13526


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 27 31
all IG_genes 208 242
all other pseudogenes 14468 14507
all RNA pseudogenes 0 0
all RNA_genes 13363 18630
antisense 5542 10397
IG_C_gene 14 30
IG_C_pseudogene 9 9
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 139 157
IG_V_pseudogene 180 180
known_ncrna 2 2
lincRNA 7666 12919
miRNA 3837 3837
misc_RNA 2234 2248
Mt_rRNA 2 2
Mt_tRNA 22 22
non_coding 1 1
non_stop_decay 0 74
nonsense_mediated_decay 0 13222
polymorphic_pseudogene 59 73
processed_pseudogene 10312 10315
processed_transcript 468 26942
protein_coding 19881 79377
pseudogene 29 48
retained_intron 0 26412
rRNA 549 549
sense_intronic 915 975
sense_overlapping 198 324
snoRNA 978 978
snRNA 1912 1912
TEC 1058 1148
TR_C_gene 5 19
TR_D_gene 3 3
TR_J_gene 73 73
TR_J_pseudogene 4 4
TR_V_gene 106 111
TR_V_pseudogene 30 30
transcribed_processed_pseudogene 441 441
transcribed_unitary_pseudogene 1 1
transcribed_unprocessed_pseudogene 658 659
translated_processed_pseudogene 1 1
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 169 169
unprocessed_pseudogene 2571 2573

Version 20 (April 2014 freeze, GRCh38) - Ensembl 76 Download release

General stats

Total No of Genes
58688
Protein-coding genes
19942
Long non-coding RNA genes
14470
Small non-coding RNA genes
9519
Pseudogenes
14363
   - processed pseudogenes:
10736
   - unprocessed pseudogenes:
3202
   - unitary pseudogenes:
171
   - polymorphic pseudogenes:
26
   - pseudogenes:
2
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
392
   - pseudogenes:
226
 
 
Total No of Transcripts
194334
Protein-coding transcripts
79460
   - full length protein-coding:
54447
   - partial length protein-coding:
25013
Nonsense mediated decay transcripts
13229
Long non-coding RNA loci transcripts
24489




Total No of distinct translations
59575
Genes that have more than one distinct translations
13579


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 21 23
all IG_genes 205 239
all other pseudogenes 14365 14403
all RNA pseudogenes 0 0
all RNA_genes 13099 17891
antisense 5411 10033
IG_C_gene 14 30
IG_C_pseudogene 9 9
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 136 154
IG_V_pseudogene 182 182
lincRNA 7408 12186
miRNA 3828 3828
misc_RNA 2232 2246
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 72
nonsense_mediated_decay 0 13229
polymorphic_pseudogene 26 40
processed_pseudogene 10303 10307
processed_transcript 531 27335
protein_coding 19942 79460
pseudogene 2 19
retained_intron 0 26334
rRNA 545 545
sense_intronic 910 964
sense_overlapping 189 317
snoRNA 978 978
snRNA 1912 1912
TR_C_gene 5 19
TR_D_gene 3 3
TR_J_gene 73 73
TR_J_pseudogene 4 4
TR_V_gene 106 111
TR_V_pseudogene 28 28
transcribed_processed_pseudogene 433 433
transcribed_unprocessed_pseudogene 643 644
translated_processed_pseudogene 2 2
unitary_pseudogene 171 171
unprocessed_pseudogene 2559 2561

Version 19 (July 2013 freeze, GRCh37) - Ensembl 74, 75 Download release

General stats

Total No of Genes
57820
Protein-coding genes
20345
Long non-coding RNA genes
13870
Small non-coding RNA genes
9013
Pseudogenes
14206
   - processed pseudogenes:
10532
   - unprocessed pseudogenes:
2942
   - unitary pseudogenes:
161
   - polymorphic pseudogenes:
45
   - pseudogenes:
296
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
386
   - pseudogenes:
230
 
 
Total No of Transcripts
196520
Protein-coding transcripts
81814
   - full length protein-coding:
57005
   - partial length protein-coding:
24809
Nonsense mediated decay transcripts
13052
Long non-coding RNA loci transcripts
23898




Total No of distinct translations
61559
Genes that have more than one distinct translations
13600


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 21 25
all IG_genes 207 217
all other pseudogenes 14206 15343
all RNA pseudogenes 0 0
all RNA_genes 13072 17837
antisense 5276 9710
IG_C_gene 14 18
IG_C_pseudogene 9 10
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 138 144
IG_V_pseudogene 187 196
lincRNA 7114 11780
miRNA 3055 3116
misc_RNA 2034 2050
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 58
nonsense_mediated_decay 0 13052
polymorphic_pseudogene 45 59
processed_pseudogene 0 10623
processed_transcript 515 28082
protein_coding 20345 81814
pseudogene 13931 387
retained_intron 0 25955
rRNA 527 531
sense_intronic 742 802
sense_overlapping 202 330
snoRNA 1457 1529
snRNA 1916 1923
TR_C_gene 5 5
TR_D_gene 3 3
TR_J_gene 74 74
TR_J_pseudogene 4 4
TR_V_gene 97 97
TR_V_pseudogene 27 27
transcribed_processed_pseudogene 0 442
transcribed_unprocessed_pseudogene 0 860
translated_processed_pseudogene 0 1
unitary_pseudogene 0 182
unprocessed_pseudogene 0 2549

Version 18 (April 2013 freeze, GRCh37) - Ensembl 73 Download release

General stats

Total No of Genes
57445
Protein-coding genes
20318
Long non-coding RNA genes
13562
Small non-coding RNA genes
8998
Pseudogenes
14181
   - processed pseudogenes:
10585
   - unprocessed pseudogenes:
2873
   - unitary pseudogenes:
165
   - polymorphic pseudogenes:
36
   - pseudogenes:
292
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
386
   - pseudogenes:
230
 
 
Total No of Transcripts
195584
Protein-coding transcripts
81673
   - full length protein-coding:
56953
   - partial length protein-coding:
24720
Nonsense mediated decay transcripts
12985
Long non-coding RNA loci transcripts
23105




Total No of distinct translations
61482
Genes that have more than one distinct translations
13602


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 45 52
all IG_genes 207 217
all other pseudogenes 14181 15321
all RNA pseudogenes 0 0
all RNA_genes 12710 17164
antisense 5043 9082
IG_C_gene 14 18
IG_C_pseudogene 9 10
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 138 144
IG_V_pseudogene 187 196
lincRNA 6763 11120
miRNA 3051 3109
misc_RNA 2032 2048
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 56
nonsense_mediated_decay 0 12985
polymorphic_pseudogene 36 50
processed_pseudogene 0 10717
processed_transcript 805 28888
protein_coding 20318 81673
pseudogene 13915 387
retained_intron 0 25782
rRNA 527 531
sense_intronic 716 776
sense_overlapping 190 300
snoRNA 1451 1522
snRNA 1913 1919
TR_C_gene 5 5
TR_D_gene 3 3
TR_J_gene 74 74
TR_J_pseudogene 4 4
TR_V_gene 97 97
TR_V_pseudogene 27 27
transcribed_processed_pseudogene 0 407
transcribed_unprocessed_pseudogene 0 616
translated_processed_pseudogene 0 1
unitary_pseudogene 0 187
unprocessed_pseudogene 0 2716

Version 17 (February 2013 freeze, GRCh37) - Ensembl 72 Download release

General stats

Total No of Genes
57281
Protein-coding genes
20330
Long non-coding RNA genes
13333
Small non-coding RNA genes
9078
Pseudogenes
14154
   - processed pseudogenes:
10599
   - unprocessed pseudogenes:
2846
   - unitary pseudogenes:
162
   - polymorphic pseudogenes:
29
   - pseudogenes:
290
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
386
   - pseudogenes:
228
 
 
Total No of Transcripts
194871
Protein-coding transcripts
81565
   - full length protein-coding:
56950
   - partial length protein-coding:
24615
Nonsense mediated decay transcripts
12913
Long non-coding RNA loci transcripts
22631




Total No of distinct translations
61392
Genes that have more than one distinct translations
13589


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 36 44
all IG_genes 207 217
all other pseudogenes 14154 15289
all RNA pseudogenes 0 0
all RNA_genes 12012 15836
antisense 4589 8113
IG_C_gene 14 18
IG_C_pseudogene 9 10
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 138 144
IG_V_pseudogene 185 193
lincRNA 6020 9844
miRNA 3086 3086
misc_RNA 2031 2031
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 54
nonsense_mediated_decay 0 12913
polymorphic_pseudogene 29 44
processed_pseudogene 0 10770
processed_transcript 1873 30875
protein_coding 20330 81565
pseudogene 13897 387
retained_intron 0 25694
rRNA 527 527
sense_intronic 674 734
sense_overlapping 141 173
snoRNA 1506 1506
snRNA 1904 1904
TEC 0 99
TR_C_gene 5 5
TR_D_gene 3 3
TR_J_gene 74 74
TR_J_pseudogene 4 4
TR_V_gene 97 97
TR_V_pseudogene 27 27
transcribed_processed_pseudogene 0 367
transcribed_unprocessed_pseudogene 0 548
unitary_pseudogene 0 184
unprocessed_pseudogene 0 2752

Version 16 (November 2012 freeze, GRCh37) - Ensembl 71 Download release

General stats

Total No of Genes
56563
Protein-coding genes
20387
Long non-coding RNA genes
13220
Small non-coding RNA genes
9173
Pseudogenes
13419
   - processed pseudogenes:
9911
   - unprocessed pseudogenes:
2837
   - unitary pseudogenes:
158
   - polymorphic pseudogenes:
26
   - pseudogenes:
290
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
364
   - pseudogenes:
197
 
 
Total No of Transcripts
194034
Protein-coding transcripts
81626
   - full length protein-coding:
57084
   - partial length protein-coding:
24542
Nonsense mediated decay transcripts
12808
Long non-coding RNA loci transcripts
22444




Total No of distinct translations
61494
Genes that have more than one distinct translations
13610


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 38 45
all IG_genes 185 193
all other pseudogenes 13419 14560
all RNA pseudogenes 0 0
all RNA_genes 11892 15448
ambiguous_orf 0 52
antisense 4545 7895
IG_C_gene 14 18
IG_C_pseudogene 8 9
IG_D_gene 27 27
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 126 130
IG_V_pseudogene 155 159
lincRNA 5835 9391
miRNA 3116 3116
misc_RNA 2050 2050
Mt_rRNA 2 2
Mt_tRNA 22 22
non_coding 13 25
non_stop_decay 0 52
nonsense_mediated_decay 0 12808
polymorphic_pseudogene 26 41
processed_pseudogene 0 10105
processed_transcript 1990 31583
protein_coding 20387 81626
pseudogene 13196 387
retained_intron 0 25466
rRNA 531 531
sense_intronic 657 715
sense_overlapping 142 173
snoRNA 1529 1529
snRNA 1923 1923
TEC 0 98
TR_C_gene 5 5
TR_D_gene 3 3
TR_J_gene 74 74
TR_J_pseudogene 4 4
TR_V_gene 97 97
TR_V_pseudogene 27 27
transcribed_processed_pseudogene 0 350
transcribed_unprocessed_pseudogene 0 533
unitary_pseudogene 0 178
unprocessed_pseudogene 0 2764

Version 15 (August 2012 freeze, GRCh37) - Ensembl 70 Download release

General stats

Total No of Genes
56680
Protein-coding genes
20447
Long non-coding RNA genes
13249
Small non-coding RNA genes
9173
Pseudogenes
13447
   - processed pseudogenes:
9881
   - unprocessed pseudogenes:
2891
   - unitary pseudogenes:
160
   - polymorphic pseudogenes:
27
   - pseudogenes:
292
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
364
   - pseudogenes:
196
Total No of Transcripts
195433
Protein-coding transcripts
82336
   - full length protein-coding:
57664
   - partial length protein-coding:
24672
Nonsense mediated decay transcripts
12882
Long non-coding RNA loci transcripts
22531
 
 
 
 
Total No of distinct translations
61998
Genes that have more than one distinct translations
13636


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 37 43
all IG_genes 185 193
all other pseudogenes 13447 14657
all RNA pseudogenes 0 0
all RNA_genes 12515 15304
ambiguous_orf 0 53
antisense 4580 7948
IG_C_gene 14 18
IG_C_pseudogene 8 9
IG_D_gene 27 27
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 126 130
IG_V_pseudogene 154 157
lincRNA 6458 9247
miRNA 3116 3116
misc_RNA 2050 2050
Mt_rRNA 2 2
Mt_tRNA 22 22
non_coding 11 14
non_stop_decay 0 48
nonsense_mediated_decay 0 12882
polymorphic_pseudogene 27 42
processed_pseudogene 0 10105
processed_transcript 1371 32392
protein_coding 20447 82336
pseudogene 13224 388
retained_intron 0 25279
rRNA 531 531
sense_intronic 648 709
sense_overlapping 144 177
snoRNA 1529 1529
snRNA 1923 1923
TEC 0 103
TR_C_gene 5 5
TR_D_gene 3 3
TR_J_gene 74 74
TR_J_pseudogene 4 4
TR_V_gene 97 97
TR_V_pseudogene 27 27
transcribed_processed_pseudogene 0 351
transcribed_unprocessed_pseudogene 0 517
unitary_pseudogene 0 180
unprocessed_pseudogene 0 2874

Version 14 (June 2012 freeze, GRCh37) - Ensembl 69 Download release

General stats

Total No of Genes
55889
Protein-coding genes
20078
Long non-coding RNA genes
12933
Small non-coding RNA genes
9173
Pseudogenes
13341
   - processed pseudogenes:
10072
   - unprocessed pseudogenes:
2596
   - unitary pseudogenes:
155
   - polymorphic pseudogenes:
29
   - pseudogenes:
296
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
364
   - pseudogenes:
193
Total No of Transcripts
190051
Protein-coding transcripts
80413
   - full length protein-coding:
56728
   - partial length protein-coding:
23685
Nonsense mediated decay transcripts
12421
Long non-coding RNA loci transcripts
21271
 
 
 
 
Total No of distinct translations
60685
Genes that have more than one distinct translations
13435


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 36 42
all IG_genes 185 193
all other pseudogenes 13341 14461
all RNA pseudogenes 0 0
all RNA_genes 12379 14648
ambiguous_orf 0 55
antisense 4424 7226
IG_C_gene 14 18
IG_C_pseudogene 8 9
IG_D_gene 27 27
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 126 130
IG_V_pseudogene 151 153
lincRNA 6322 8591
miRNA 3116 3116
misc_RNA 2050 2050
Mt_rRNA 2 2
Mt_tRNA 22 22
non_coding 11 14
non_stop_decay 0 34
nonsense_mediated_decay 0 12421
polymorphic_pseudogene 29 43
processed_pseudogene 0 10127
processed_transcript 1393 32071
protein_coding 20078 80413
pseudogene 13119 388
retained_intron 0 24240
retrotransposed 0 212
rRNA 531 531
sense_intronic 608 662
sense_overlapping 139 175
snoRNA 1529 1529
snRNA 1923 1923
TEC 0 101
TR_C_gene 5 5
TR_D_gene 3 3
TR_J_gene 74 74
TR_J_pseudogene 4 4
TR_V_gene 97 97
TR_V_pseudogene 27 27
transcribed_processed_pseudogene 0 261
transcribed_unprocessed_pseudogene 0 476
unitary_pseudogene 0 175
unprocessed_pseudogene 0 2583

Version 13 (March 2012 freeze, GRCh37) - Ensembl 68 Download release

General stats

Total No of Genes
55123
Protein-coding genes
20070
Long non-coding RNA genes
12393
Small non-coding RNA genes
9173
Pseudogenes
13123
   - processed pseudogenes:
9895
   - unprocessed pseudogenes:
2550
   - unitary pseudogenes:
156
   - polymorphic pseudogenes:
31
   - pseudogenes:
298
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
364
   - pseudogenes:
193
Total No of Transcripts
182967
Protein-coding transcripts
77901
   - full length protein-coding:
55928
   - partial length protein-coding:
21973
Nonsense mediated decay transcripts
11549
Long non-coding RNA loci transcripts
19835
 
 
 
 
Total No of distinct translations
59175
Genes that have more than one distinct translations
13161


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 31 40
all IG_genes 185 193
all other pseudogenes 13123 14220
all RNA_genes 12153 13995
ambiguous_orf 0 59
antisense 4220 6534
IG_C_gene 14 18
IG_C_pseudogene 8 9
IG_D_gene 27 27
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 126 130
IG_V_pseudogene 151 153
lincRNA 6096 7938
miRNA 3116 3116
misc_RNA 2050 2050
Mt_rRNA 2 2
Mt_tRNA 22 22
non_coding 12 15
non_stop_decay 0 15
nonsense_mediated_decay 0 11549
polymorphic_pseudogene 31 46
processed_pseudogene 0 9953
processed_transcript 1335 31617
protein_coding 20070 77901
pseudogene 12899 388
retained_intron 0 22655
retrotransposed 0 210
rRNA 531 531
sense_intronic 557 606
sense_overlapping 142 175
snoRNA 1529 1529
snRNA 1923 1923
TEC 0 98
TR_C_gene 5 5
TR_D_gene 3 3
TR_J_gene 74 74
TR_J_pseudogene 4 4
TR_V_gene 97 97
TR_V_pseudogene 27 27
transcribed_processed_pseudogene 0 248
transcribed_unprocessed_pseudogene 0 453
unitary_pseudogene 0 175
unprocessed_pseudogene 0 2551

Version 12 (December 2011 freeze, GRCh37) - Ensembl 67 Download release

General stats

Total No of Genes
53934
Protein-coding genes
20110
Long non-coding RNA genes
11790
Small non-coding RNA genes
8801
Pseudogenes
12869
   - processed pseudogenes:
9688
   - unprocessed pseudogenes:
2497
   - unitary pseudogenes:
156
   - polymorphic pseudogenes:
29
   - pseudogenes:
307
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
364
   - pseudogenes:
192
Total No of Transcripts
183086
Protein-coding transcripts
81885
   - full length protein-coding:
60850
   - partial length protein-coding:
21035
Nonsense mediated decay transcripts
10891
Long non-coding RNA loci transcripts
18855
 
 
 
 
Total No of distinct translations
63467
Genes that have more than one distinct translations
13856


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 32 38
all IG_genes 185 193
all other pseudogenes 12869 13966
all RNA pseudogenes 1838 1838
all RNA_genes 10956 12493
ambiguous_orf 0 61
antisense 3981 5993
disrupted_domain 0 1
IG_C_gene 14 18
IG_C_pseudogene 6 6
IG_D_gene 27 27
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 126 130
IG_V_pseudogene 152 155
lincRNA 5749 7286
miRNA 1756 1756
miRNA_pseudogene 15 15
misc_RNA 1187 1187
misc_RNA_pseudogene 3 3
Mt_rRNA 2 2
Mt_tRNA 22 22
Mt_tRNA_pseudogene 580 580
non_coding 105 217
non_stop_decay 0 8
nonsense_mediated_decay 0 10891
polymorphic_pseudogene 29 44
processed_pseudogene 0 9753
processed_transcript 1288 31352
protein_coding 20110 81885
pseudogene 12648 399
retained_intron 0 21423
retrotransposed 0 210
rRNA 531 531
rRNA_pseudogene 179 179
scRNA_pseudogene 787 787
sense_intronic 486 528
sense_overlapping 149 178
snoRNA 1521 1521
snoRNA_pseudogene 73 73
snRNA 1944 1944
snRNA_pseudogene 73 73
TEC 0 86
TR_C_gene 5 5
TR_D_gene 3 3
TR_J_gene 74 74
TR_J_pseudogene 4 4
TR_V_gene 97 97
TR_V_pseudogene 27 27
transcribed_processed_pseudogene 0 239
transcribed_unprocessed_pseudogene 0 573
tRNA_pseudogene 128 128
unitary_pseudogene 0 176
unprocessed_pseudogene 0 2377

Version 11 (October 2011 freeze, GRCh37) - Ensembl 66 Download release

General stats

Total No of Genes
53639
Protein-coding genes
20107
Long non-coding RNA genes
11600
Small non-coding RNA genes
8801
Pseudogenes
12761
   - processed pseudogenes:
9387
   - unprocessed pseudogenes:
2446
   - unitary pseudogenes:
156
   - polymorphic pseudogenes:
27
   - pseudogenes:
553
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
370
   - pseudogenes:
192
Total No of Transcripts
180272
Protein-coding transcripts
81040
   - full length protein-coding:
60661
   - partial length protein-coding:
20379
Nonsense mediated decay transcripts
10525
Long non-coding RNA loci transcripts
18566
 
 
 
 
Total No of distinct translations
62978
Genes that have more than one distinct translations
13767


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 26 32
all IG_genes 191 194
all other pseudogenes 12761 13847
all RNA pseudogenes 1838 1838
all RNA_genes 10095 12337
ambiguous_orf 0 59
antisense 3895 5885
disrupted_domain 0 1
IG_C_gene 14 16
IG_C_pseudogene 7 7
IG_D_gene 27 27
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 132 133
IG_V_pseudogene 151 151
lincRNA 4888 7130
miRNA 1756 1756
miRNA_pseudogene 15 15
misc_RNA 1187 1187
misc_RNA_pseudogene 3 3
Mt_rRNA 2 2
Mt_tRNA 22 22
Mt_tRNA_pseudogene 580 580
non_coding 101 212
non_stop_decay 0 8
nonsense_mediated_decay 0 10525
polymorphic_pseudogene 27 42
processed_pseudogene 0 9464
processed_transcript 2097 30971
protein_coding 20107 81040
pseudogene 12542 649
retained_intron 0 20641
retrotransposed 0 211
rRNA 531 531
rRNA_pseudogene 179 179
scRNA_pseudogene 787 787
sense_intronic 457 498
sense_overlapping 136 162
snoRNA 1521 1521
snoRNA_pseudogene 73 73
snRNA 1944 1944
snRNA_pseudogene 73 73
TEC 0 87
TR_C_gene 5 5
TR_D_gene 3 3
TR_J_gene 74 74
TR_J_pseudogene 4 4
TR_V_gene 97 97
TR_V_pseudogene 27 27
transcribed_processed_pseudogene 0 220
transcribed_unprocessed_pseudogene 0 546
tRNA_pseudogene 128 128
unitary_pseudogene 0 176
unprocessed_pseudogene 0 2347

Version 10 (July 2011 freeze, GRCh37) - Ensembl 65 Download release

General stats

Total No of Genes
52376
Protein-coding genes
20007
Long non-coding RNA genes
10840
Small non-coding RNA genes
8801
Pseudogenes
12358
   - processed pseudogenes:
8908
   - unprocessed pseudogenes:
2266
   - unitary pseudogenes:
151
   - polymorphic pseudogenes:
27
   - pseudogenes:
814
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
370
   - pseudogenes:
192
Total No of Transcripts
172975
Protein-coding transcripts
78832
   - full length protein-coding:
59895
   - partial length protein-coding:
18937
Nonsense mediated decay transcripts
9619
Long non-coding RNA loci transcripts
17547
 
 
 
 
Total No of distinct translations
61675
Genes that have more than one distinct translations
13569


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 12 12
all IG_genes 191 194
all other pseudogenes 12358 13416
all RNA pseudogenes 1838 1838
all RNA_genes 10691 11949
ambiguous_orf 20 62
antisense 3526 5446
disrupted_domain 0 1
IG_C_gene 14 16
IG_C_pseudogene 7 7
IG_D_gene 27 27
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 132 133
IG_V_pseudogene 151 151
lincRNA 5484 6742
miRNA 1756 1756
miRNA_pseudogene 15 15
misc_RNA 1187 1187
misc_RNA_pseudogene 3 3
Mt_rRNA 2 2
Mt_tRNA 22 22
Mt_tRNA_pseudogene 580 580
non_coding 104 217
non_stop_decay 0 8
nonsense_mediated_decay 0 9619
polymorphic_pseudogene 27 42
processed_pseudogene 0 8985
processed_transcript * 1271 29900
protein_coding 20007 78832
pseudogene 12139 912
retained_intron 10 19015
retrotransposed 0 211
rRNA 531 531
rRNA_pseudogene 179 179
scRNA_pseudogene 787 787
sense_intronic 395 433
sense_overlapping 18 47
snoRNA 1521 1521
snoRNA_pseudogene 73 73
snRNA 1944 1944
snRNA_pseudogene 73 73
TEC 0 51
TR_C_gene 5 5
TR_D_gene 3 3
TR_J_gene 74 74
TR_J_pseudogene 4 4
TR_V_gene 97 97
TR_V_pseudogene 27 27
transcribed_processed_pseudogene 0 209
transcribed_unprocessed_pseudogene 0 471
tRNA_pseudogene 128 128
unitary_pseudogene 0 167
unprocessed_pseudogene 0 2227

* stats are according to gencode.v10.annotation_updated_ncrna_host.gtf file

Version 9 (May 2011 freeze, GRCh37) - Ensembl 64 Download release

General stats

Total No of Genes
51838
Protein-coding genes
20012
Long non-coding RNA genes
11004
Small non-coding RNA genes
8801
Pseudogenes
11651
   - processed pseudogenes:
8619
   - unprocessed pseudogenes:
1891
   - unitary pseudogenes:
103
   - polymorphic pseudogenes:
27
   - pseudogenes:
819
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
370
   - pseudogenes:
192
Total No of Transcripts
169419
Protein-coding transcripts
77808
   - full length protein-coding:
59522
   - partial length protein-coding:
18286
Nonsense mediated decay transcripts
9261
Long non-coding RNA loci transcripts
18878
 
 
 
 
Total No of distinct translations
61094
Genes that have more than one distinct translations
13497


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 0 9
all IG_genes 191 194
all other pseudogenes 11651 13084
all RNA pseudogenes 1838 1838
all RNA_genes 10679 11766
ambiguous_orf 0 56
antisense 0 5179
disrupted_domain 0 1
IG_C_gene 14 16
IG_C_pseudogene 7 7
IG_D_gene 27 27
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_V_gene 132 133
IG_V_pseudogene 151 151
lincRNA 5472 6559
miRNA 1756 1756
miRNA_pseudogene 15 15
misc_RNA 1187 1187
misc_RNA_pseudogene 3 3
Mt_rRNA 2 2
Mt_tRNA 22 22
Mt_tRNA_pseudogene 580 580
ncrna_host 0 71
non_coding 0 199
nonsense_mediated_decay 0 9261
polymorphic_pseudogene 27 41
processed_pseudogene 0 8832
processed_transcript 5532 29244
protein_coding 20012 77808
pseudogene 11432 917
retained_intron 0 18305
retrotransposed 0 213
rRNA 531 531
rRNA_pseudogene 179 179
scRNA_pseudogene 787 787
sense_intronic 0 406
sense_overlapping 0 19
snoRNA 1521 1521
snoRNA_pseudogene 73 73
snRNA 1944 1944
snRNA_pseudogene 73 73
TEC 0 44
TR_C_gene 5 5
TR_D_gene 3 3
TR_J_gene 74 74
TR_J_pseudogene 4 4
TR_V_gene 97 97
TR_V_pseudogene 27 27
transcribed_processed_pseudogene 0 205
transcribed_unprocessed_pseudogene 0 342
tRNA_pseudogene 128 128
unitary_pseudogene 0 157
unprocessed_pseudogene 0 2185

Version 8 (March 2011 freeze, GRCh37) - Ensembl 63 Download release

General stats

Total No of Genes
51096
Protein-coding genes
20026
Long non-coding RNA genes
10520
Small non-coding RNA genes
8801
Pseudogenes
11375
   - processed pseudogenes:
8384
   - unprocessed pseudogenes:
1865
   - unitary pseudogenes:
95
   - polymorphic pseudogenes:
26
   - pseudogenes:
823
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
374
   - pseudogenes:
182
Total No of Transcripts
165067
Protein-coding transcripts
76412
   - full length protein-coding:
59005
   - partial length protein-coding:
17407
Nonsense mediated decay transcripts
8896
Long non-coding RNA loci transcripts
18036
 
 
 
 
Total No of distinct translations
60282
Genes that have more than one distinct translations
13402


Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncrna 0 8
all IG_genes 292 295
all other pseudogenes 11375 12783
all RNA pseudogenes 1838 1838
all RNA_genes 6738 6252
ambiguous_orf 0 54
antisense 0 443
disrupted_domain 0 1
IG_C_gene 16 18
IG_C_pseudogene 7 7
IG_D_gene 30 30
IG_J_gene 83 83
IG_J_pseudogene 3 3
IG_V_gene 163 164
IG_V_pseudogene 151 151
lincRNA 1531 1045
miRNA 1756 1756
miRNA_pseudogene 15 15
misc_RNA 1187 1187
misc_RNA_pseudogene 3 3
Mt_rRNA 2 2
Mt_tRNA 22 22
Mt_tRNA_pseudogene 580 580
ncrna_host 0 69
non_coding 0 341
nonsense_mediated_decay 0 8896
polymorphic_pseudogene 26 40
processed_pseudogene 0 8600
processed_transcript 8989 38384
protein_coding 20026 76412
pseudogene 11167 922
retained_intron 0 17414
retrotransposed 0 222
rRNA 531 531
rRNA_pseudogene 179 179
scRNA_pseudogene 787 787
snoRNA 1521 1521
snoRNA_pseudogene 73 73
snRNA 1944 1944
snRNA_pseudogene 73 73
TEC 0 39
TR_C_gene 3 3
TR_J_gene 13 13
TR_V_gene 66 66
TR_V_pseudogene 21 21
transcribed_processed_pseudogene 0 174
transcribed_unprocessed_pseudogene 0 331
tRNA_pseudogene 128 128
unitary_pseudogene 0 148
unprocessed_pseudogene 0 2164

Version 7 (December 2010 freeze, GRCh37) - Ensembl 62 Download release

General stats

Total No of Genes
51082
Protein-coding genes
20687
Long non-coding RNA genes
9640
Small non-coding RNA genes
8801
Pseudogenes
11580
   - processed pseudogenes:
8298
   - unprocessed pseudogenes:
2117
   - unitary pseudogenes:
138
   - polymorphic pseudogenes:
19
   - pseudogenes:
826
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
374
   - pseudogenes:
182
Total No of Transcripts
161375
Protein-coding transcripts
76052
   - full length protein-coding:
59634
   - partial length protein-coding:
16418
Nonsense mediated decay transcripts
8356
Long non-coding RNA loci transcripts
15512
 
 
 
 
Total No of distinct translations
60495
Genes that have more than one distinct translations
13346


Further details on this version's gene and transcript types

biotype genes transcripts
all IG_genes 292 295
all other pseudogenes 11580 12460
all RNA pseudogenes 1838 1838
all RNA_genes 6446 5979
ambiguous_orf 0 54
antisense 0 133
IG_C_gene 16 18
IG_C_pseudogene 7 7
IG_D_gene 30 30
IG_J_gene 83 83
IG_J_pseudogene 3 3
IG_V_gene 163 164
IG_V_pseudogene 151 151
lincRNA 1239 772
miRNA 1756 1756
miRNA_pseudogene 15 15
misc_RNA 1187 1187
misc_RNA_pseudogene 3 3
Mt_rRNA 2 2
Mt_tRNA 22 22
Mt_tRNA_pseudogene 580 580
non_coding 0 326
nonsense_mediated_decay 0 8356
polymorphic_pseudogene 19 29
processed_pseudogene 0 8381
processed_transcript * 8401 37659
protein_coding 20687 76052
pseudogene 11379 920
retained_intron 0 16350
retrotransposed 0 215
rRNA 531 531
rRNA_pseudogene 179 179
scRNA_pseudogene 787 787
snoRNA 1521 1521
snoRNA_pseudogene 73 73
snRNA 1944 1944
snRNA_pseudogene 73 73
TEC 0 35
TR_C_gene 3 3
TR_J_gene 13 13
TR_V_gene 66 66
TR_V_pseudogene 21 21
transcribed_processed_pseudogene 0 160
transcribed_unprocessed_pseudogene 0 307
tRNA_pseudogene 128 128
unitary_pseudogene 0 144
unprocessed_pseudogene 0 2122

* stats are according to gencode.v7.annotation_updated_ncrna_host.gtf file

Version 6 (September 2010 freeze, GRCh37) - Ensembl 61 Download release

General stats

Total No of Genes
51564
Protein-coding genes
20540
Long non-coding RNA genes
10782
Small non-coding RNA genes
8801
Pseudogenes
11068
   - processed pseudogenes:
7970
   - unprocessed pseudogenes:
1806
   - unitary pseudogenes:
94
   - polymorphic pseudogenes:
18
   - pseudogenes:
1000
Immunoglobulin/T-cell receptor gene segments
 
   - protein coding segments:
373
   - pseudogenes:
180
Total No of Transcripts
158489
Protein-coding transcripts
74251
   - full length protein-coding:
58979
   - partial length protein-coding:
15272
Nonsense mediated decay transcripts
7856
Long non-coding RNA loci transcripts
17660
 
 
 
 
Total No of distinct translations
57241
Genes that have more than one distinct translations
12849


Further details on this version's gene and transcript types

biotype genes transcripts
all IG_genes 309 312
all other pseudogenes 11068 13308
all RNA pseudogenes 1838 1838
all RNA_genes 6558 6613
ambiguous_orf 0 55
antisense 0 36
IG_C_gene 16 18
IG_C_pseudogene 7 7
IG_D_gene 30 30
IG_J_gene 83 83
IG_J_pseudogene 3 3
IG_V_gene 180 181
IG_V_pseudogene 151 151
lincRNA 1351 1406
miRNA 1756 1756
miRNA_pseudogene 15 15
misc_RNA 1187 1187
misc_RNA_pseudogene 3 3
Mt_rRNA 2 2
Mt_tRNA 22 22
Mt_tRNA_pseudogene 580 580
ncrna_host 0 1
non_coding 0 293
nonsense_mediated_decay 0 7856
polymorphic_pseudogene 18 27
processed_pseudogene 0 7941
processed_transcript 9431 36860
protein_coding 20540 74251
pseudogene 10870 2381
retained_intron 0 15220
retrotransposed 0 283
rRNA 531 531
rRNA_pseudogene 179 179
scRNA_pseudogene 787 787
snoRNA 1521 1521
snoRNA_pseudogene 73 73
snRNA 1944 1944
snRNA_pseudogene 73 73
TEC 0 26
TR_C_gene 3 3
TR_J_gene 13 13
TR_V_gene 48 48
TR_V_pseudogene 19 19
transcribed_processed_pseudogene 0 144
transcribed_unprocessed_pseudogene 0 284
tRNA_pseudogene 128 128
unitary_pseudogene 0 139
unprocessed_pseudogene 0 1929

Version 3d (October 2009 freeze, GRCh37) - Ensembl 57 Download release

General stats

Total No of Genes
49787
Protein-coding genes
21304
Long non-coding RNA genes
10016
Small non-coding RNA genes
9203
Pseudogenes
8894
   - processed pseudogenes:
6232
   - unprocessed pseudogenes:
1147
   - unitary pseudogenes:
100
   - polymorphic pseudogenes:
0
   - pseudogenes:
1415
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
370
   - pseudogenes:
0
Total No of Transcripts
134443
Protein-coding transcripts
67766
   - full length protein-coding:
67766
   - partial length protein-coding:
0
Nonsense mediated decay transcripts
4703
Long non-coding RNA loci transcripts
14132
 
 
 
 
Total No of distinct translations
55103
Genes that have more than one distinct translations
12496

Version 3c (July 2009 freeze, GRCh37) - Ensembl 56 Download release

General stats

Total No of Genes
47553
Protein-coding genes
22550
Long non-coding RNA genes
6496
Small non-coding RNA genes
9243
Pseudogenes
8894
   - processed pseudogenes:
6232
   - unprocessed pseudogenes:
1147
   - unitary pseudogenes:
100
   - polymorphic pseudogenes:
0
   - pseudogenes:
1415
Immunoglobulin/T-cell receptor gene segments
   - protein coding segments:
370
   - pseudogenes:
0
Total No of Transcripts
132067
Protein-coding transcripts
68880
   - full length protein-coding:
67766
   - partial length protein-coding:
1114
Nonsense mediated decay transcripts
4703
Long non-coding RNA loci transcripts
10475
 
 
 
 
Total No of distinct translations
56217
Genes that have more than one distinct translations
12491
 
Cookies policy | Terms & Conditions. This site is hosted by the Wellcome Trust Sanger Institute.