Statistics about the GENCODE Release M24

The statistics derive from the gtf file that contains only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.

General stats

Total No of Genes 55385
Protein-coding genes 21856
Long non-coding RNA genes 13197
Small non-coding RNA genes 6108
Pseudogenes 13728
- processed pseudogenes 10302
- unprocessed pseudogenes 2989
- unitary pseudogenes 83
- polymorphic pseudogenes 88
- pseudogenes 60
Immunoglobulin/T-cell receptor gene segments
- protein coding segments 494
- pseudogenes 206
Total No of Transcripts 142552
Protein-coding transcripts 59252
- full length protein-coding 45442
- partial length protein-coding 13810
Nonsense mediated decay transcripts 7205
Long non-coding RNA loci transcripts 18864
 
Total No of distinct translations 45465
Genes that have more than one distinct translations 11000

Further details on this version's gene and transcript types

biotype genes transcripts
IG_C_gene 13 21
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 158 158
lncRNA 9959 15129
miRNA 2202 2202
misc_RNA 562 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 25
nonsense_mediated_decay 0 7205
polymorphic_pseudogene 88 102
processed_pseudogene 10002 10005
processed_transcript 0 15135
protein_coding 21856 59252
pseudogene 60 86
retained_intron 0 21915
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
snoRNA 1507 1507
snRNA 1383 1383
sRNA 2 2
TEC 3238 3324
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 300 304
transcribed_unitary_pseudogene 25 29
transcribed_unprocessed_pseudogene 272 286
translated_processed_pseudogene 0 12
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 58 58
unprocessed_pseudogene 2716 2727