Human

Statistics about the GENCODE Release 40)

The statistics derive from the gtf file that contains only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.

General stats

Total No of Genes 61544
Protein-coding genes 19988
Long non-coding RNA genes 18805
Small non-coding RNA genes 7567
Pseudogenes 14774
- processed pseudogenes 10661
- unprocessed pseudogenes 3566
- unitary pseudogenes 246
- polymorphic pseudogenes 50
- pseudogenes 15
Immunoglobulin/T-cell receptor gene segments
- protein coding segments 409
- pseudogenes 236
Total No of Transcripts 246624
Protein-coding transcripts 87814
- full length protein-coding 62232
- partial length protein-coding 25582
Nonsense mediated decay transcripts 20254
Long non-coding RNA loci transcripts 53029
 
Total No of distinct translations 64382
Genes that have more than one distinct translations 13594

Further details on this version's gene and transcript types

biotype genes transcripts
IG_C_gene 14 23
IG_C_pseudogene 9 9
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_pseudogene 1 1
IG_V_gene 145 145
IG_V_pseudogene 187 187
lncRNA 17748 51324
miRNA 1879 1879
misc_RNA 2212 2212
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 99
nonsense_mediated_decay 0 20254
polymorphic_pseudogene 50 71
processed_pseudogene 10154 10156
processed_transcript 0 30375
protein_coding 19988 87814
pseudogene 15 15
retained_intron 0 32826
ribozyme 8 8
rRNA 47 47
rRNA_pseudogene 497 497
scaRNA 49 49
scRNA 1 1
snoRNA 943 943
snRNA 1901 1901
sRNA 5 5
TEC 1057 1147
TR_C_gene 6 6
TR_D_gene 4 4
TR_J_gene 79 79
TR_J_pseudogene 4 4
TR_V_gene 106 106
TR_V_pseudogene 33 33
transcribed_processed_pseudogene 505 505
transcribed_unitary_pseudogene 149 151
transcribed_unprocessed_pseudogene 954 954
translated_processed_pseudogene 2 2
translated_unprocessed_pseudogene 2 2
unitary_pseudogene 97 96
unprocessed_pseudogene 2610 2611
vault_RNA 1 1