Statistics about the current GENCODE Release (version M26)

The statistics derive from the gtf file that contains only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.

General stats

Total No of Genes 53647
Protein-coding genes 21848
Long non-coding RNA genes 13186
Small non-coding RNA genes 4394
Pseudogenes 13724
- processed pseudogenes 10288
- unprocessed pseudogenes 2992
- unitary pseudogenes 87
- polymorphic pseudogenes 88
- pseudogenes 62
Immunoglobulin/T-cell receptor gene segments
- protein coding segments 494
- pseudogenes 207
Total No of Transcripts 140670
Protein-coding transcripts 59152
- full length protein-coding 45389
- partial length protein-coding 13763
Nonsense mediated decay transcripts 7201
Long non-coding RNA loci transcripts 18833
 
Total No of distinct translations 45391
Genes that have more than one distinct translations 10956

Further details on this version's gene and transcript types

biotype genes transcripts
IG_C_gene 13 22
IG_C_pseudogene 1 1
IG_D_gene 19 20
IG_D_pseudogene 4 4
IG_J_gene 14 18
IG_LV_gene 4 4
IG_pseudogene 1 1
IG_V_gene 218 306
IG_V_pseudogene 158 158
lncRNA 9948 15105
miRNA 1256 1256
misc_RNA 36 36
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 25
nonsense_mediated_decay 0 7201
polymorphic_pseudogene 88 100
processed_pseudogene 9988 9990
processed_transcript 0 15089
protein_coding 21848 59152
pseudogene 62 86
retained_intron 0 21920
ribozyme 21 21
rRNA 84 84
scaRNA 36 36
scRNA 1 1
snoRNA 1610 1610
snRNA 1308 1308
TEC 3238 3324
TR_C_gene 8 10
TR_D_gene 4 5
TR_J_gene 70 76
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 300 304
transcribed_unitary_pseudogene 26 30
transcribed_unprocessed_pseudogene 271 285
translated_processed_pseudogene 0 12
translated_unprocessed_pseudogene 2 2
unitary_pseudogene 61 61
unprocessed_pseudogene 2719 2727
vault_RNA 2 2
Y_RNA 16 16