Human

Statistics about the current GENCODE Release (version 50)

The statistics derive from the gtf file that contains only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.

General stats

Total No of Genes 78733
Protein-coding genes 19442
- readthrough genes (not included) 665
Long non-coding RNA genes 35885
Small non-coding RNA genes 7608
Pseudogenes 14702
- processed pseudogenes 10634
- unprocessed pseudogenes 3535
- unitary pseudogenes 296
Immunoglobulin/T-cell receptor gene segments
- protein coding segments 412
- pseudogenes 237
Total No of Transcripts 644292
Protein-coding transcripts 278455
- full length protein-coding 253680
- partial length protein-coding 24775
Nonsense mediated decay transcripts 91818
Long non-coding RNA loci transcripts 191063
 
Total No of distinct translations 172117
Genes that have more than one distinct translations 16058

Further details on this version's gene and transcript types

biotype genes transcripts
artifact 19 19
IG_C_gene 14 23
IG_C_pseudogene 9 9
IG_D_gene 37 37
IG_J_gene 18 18
IG_J_pseudogene 3 3
IG_pseudogene 1 1
IG_V_gene 146 146
IG_V_pseudogene 187 187
lncRNA 34866 189136
miRNA 1878 1878
misc_RNA 2207 2207
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 106
nonsense_mediated_decay 0 91818
processed_pseudogene 9484 9485
processed_transcript 0 12
protein_coding 20107 278455
protein_coding_CDS_not_defined 0 26568
protein_coding_LoF 0 86
retained_intron 0 34250
ribozyme 8 8
rRNA 47 47
rRNA_pseudogene 497 497
scaRNA 52 52
snoRNA 985 985
snRNA 1901 1901
sRNA 5 5
TEC 1019 1108
TR_C_gene 6 6
TR_D_gene 5 5
TR_J_gene 79 79
TR_J_pseudogene 4 4
TR_V_gene 107 107
TR_V_pseudogene 33 33
transcribed_processed_pseudogene 1148 1148
transcribed_unitary_pseudogene 206 206
transcribed_unprocessed_pseudogene 1584 1586
translated_processed_pseudogene 2 2
unitary_pseudogene 90 90
unprocessed_pseudogene 1951 1951
vault_RNA 4 4