Statistics about the GENCODE Release M9

The statistics derive from the gtf file that contains only the annotation of the main chromosomes.

For details about the calculation of these statistics please see the README_stats.txt file.

General stats

Total No of Genes 47643
Protein-coding genes 21971
Long non-coding RNA genes 9436
Small non-coding RNA genes 6109
Pseudogenes 9631
- processed pseudogenes 6775
- unprocessed pseudogenes 2477
- unitary pseudogenes 21
- polymorphic pseudogenes 39
- pseudogenes 116
Immunoglobulin/T-cell receptor gene segments
- protein coding segments 494
- pseudogenes 203
Total No of Transcripts 115125
Protein-coding transcripts 51254
- full length protein-coding 39914
- partial length protein-coding 11340
Nonsense mediated decay transcripts 5375
Long non-coding RNA loci transcripts 13046
 
Total No of distinct translations 41000
Genes that have more than one distinct translations 9084

Further details on this version's gene and transcript types

biotype genes transcripts
3prime_overlapping_ncRNA 2 3
antisense 2243 3308
bidirectional_promoter_lncRNA 19 43
IG_C_gene 13 18
IG_C_pseudogene 1 1
IG_D_gene 19 19
IG_D_pseudogene 3 3
IG_J_gene 14 14
IG_LV_gene 4 4
IG_pseudogene 2 2
IG_V_gene 218 301
IG_V_pseudogene 155 155
lincRNA 3707 5570
macro_lncRNA 1 2
miRNA 2202 2202
misc_RNA 563 566
Mt_rRNA 2 2
Mt_tRNA 22 22
non_stop_decay 0 17
nonsense_mediated_decay 0 5375
polymorphic_pseudogene 39 44
processed_pseudogene 6598 6599
processed_transcript 747 13468
protein_coding 21971 51254
pseudogene 116 135
retained_intron 0 16805
ribozyme 22 22
rRNA 354 354
scaRNA 51 51
scRNA 1 1
sense_intronic 258 282
sense_overlapping 24 48
snoRNA 1508 1508
snRNA 1382 1382
sRNA 2 2
TEC 2435 2506
TR_C_gene 8 10
TR_D_gene 4 4
TR_J_gene 70 70
TR_J_pseudogene 10 10
TR_V_gene 144 194
TR_V_pseudogene 34 34
transcribed_processed_pseudogene 177 182
transcribed_unitary_pseudogene 3 3
transcribed_unprocessed_pseudogene 158 173
translated_processed_pseudogene 0 12
translated_unprocessed_pseudogene 1 1
unitary_pseudogene 18 18
unprocessed_pseudogene 2318 2326