Statistics about the current GENCODE Release (version M19)

The statistics derive from the gtf file that contains only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.

General stats

Total No of Genes 54446
Protein-coding genes 21969
Long non-coding RNA genes 12840
Small non-coding RNA genes 6108
Pseudogenes 13033
- processed pseudogenes 9772
- unprocessed pseudogenes 2873
- unitary pseudogenes 39
- polymorphic pseudogenes 79
- pseudogenes 67
Immunoglobulin/T-cell receptor gene segments
- protein coding segments 494
- pseudogenes 203
Total No of Transcripts 137767
Protein-coding transcripts 57776
Nonsense mediated decay transcripts 6816
Long non-coding RNA loci transcripts 18065
 
Total No of distinct translations 44448
Genes that have more than one distinct translations 10609

Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 2 3
antisense 2826 4266
bidirectional_promoter_lncRNA 165 284
IG_C_gene 13 21
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_C_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 155 155
lincRNA 5559 8598
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 562 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 26
nonsense_mediated_decay 0 6816
polymorphic_pseudogene 79 94
processed_pseudogene 9496 9499
processed_transcript 780 15493
protein_coding 21969 57776
pseudogene 67 93
retained_intron 0 20988
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
sense_intronic 320 355
sense_overlapping 27 52
snoRNA 1507 1507
snRNA 1383 1383
sRNA 2 2
TEC 3160 3238
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 276 284
transcribed_unitary_pseudogene 16 16
transcribed_unprocessed_pseudogene 237 250
translated_processed_pseudogene 0 12
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 23 23
unprocessed_pseudogene 2635 2644