Statistics about the GENCODE Release M29)

The statistics derive from the gtf file that contains only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.

General stats

Total No of Genes 55357
Protein-coding genes 21833
Long non-coding RNA genes 13186
Small non-coding RNA genes 6105
Pseudogenes 13738
- processed pseudogenes 10298
- unprocessed pseudogenes 2992
- unitary pseudogenes 88
- polymorphic pseudogenes 91
- pseudogenes 62
Immunoglobulin/T-cell receptor gene segments
- protein coding segments 494
- pseudogenes 207
Total No of Transcripts 142379
Protein-coding transcripts 59138
- full length protein-coding 45388
- partial length protein-coding 13750
Nonsense mediated decay transcripts 7205
Long non-coding RNA loci transcripts 18831
 
Total No of distinct translations 45370
Genes that have more than one distinct translations 10951

Further details on this version's gene and transcript types

biotype genes transcripts
IG_C_gene 13 22
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 4 4
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 1 1
IG_V_gene 218 306
IG_V_pseudogene 158 158
lncRNA 9949 15104
miRNA 2201 2201
misc_RNA 562 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 26
nonsense_mediated_decay 0 7205
polymorphic_pseudogene 91 106
processed_pseudogene 9998 10000
processed_transcript 0 15098
protein_coding 21833 59138
pseudogene 62 83
retained_intron 0 21926
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
snoRNA 1507 1507
snRNA 1381 1381
sRNA 2 2
TEC 3237 3323
TR_C_gene 8 11
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 300 304
transcribed_unitary_pseudogene 27 31
transcribed_unprocessed_pseudogene 272 285
translated_unprocessed_pseudogene 2 2
unitary_pseudogene 61 61
unprocessed_pseudogene 2718 2726