Statistics about the GENCODE Release M22

The statistics derive from the gtf file that contains only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.

General stats

Total No of Genes 55487
Protein-coding genes 21823
Long non-coding RNA genes 13450
Small non-coding RNA genes 6108
Pseudogenes 13610
- processed pseudogenes 10262
- unprocessed pseudogenes 2931
- unitary pseudogenes 65
- polymorphic pseudogenes 86
- pseudogenes 60
Immunoglobulin/T-cell receptor gene segments
- protein coding segments 494
- pseudogenes 206
Total No of Transcripts 142238
Protein-coding transcripts 58899
- full length protein-coding 45140
- partial length protein-coding 13759
Nonsense mediated decay transcripts 7185
Long non-coding RNA loci transcripts 19112
 
Total No of distinct translations 45129
Genes that have more than one distinct translations 10928

Further details on this version's gene and transcript types

biotype genes transcripts
IG_C_gene 13 21
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 158 158
lncRNA 10209 30462
miRNA 2202 2202
misc_RNA 562 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 25
nonsense_mediated_decay 0 7185
polymorphic_pseudogene 86 100
processed_pseudogene 9963 9966
protein_coding 21823 58899
pseudogene 60 86
retained_intron 0 21896
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
snoRNA 1507 1507
snRNA 1383 1383
sRNA 2 2
TEC 3241 3326
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 299 303
transcribed_unitary_pseudogene 22 24
transcribed_unprocessed_pseudogene 265 278
translated_processed_pseudogene 0 12
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 43 43
unprocessed_pseudogene 2665 2675