Human protein-coding genes and gene feature statistics in 2019.
Allison PiovesanFrancesca AntonarosLorenza VitalePierluigi StrippoliMaria Chiara PelleriMaria CaracausiPublished in: BMC research notes (2019)
Comparison with previous reports reveals substantial change in the number of known nuclear protein-coding genes (now 19,116), the protein-coding non-redundant transcriptome space [now 59,281,518 base pair (bp), 10.1% increase], the number of exons (now 562,164, 36.2% increase) due to a relevant increase of the RNA isoforms recorded. Other parameters such as gene, exon or intron mean and extreme length appear to have reached a stability that is unlikely to be substantially modified by human genome data updates, at least regarding protein-coding genes. Finally, we confirm that there are no human introns shorter than 30 bp.
Keyphrases
- genome wide
- endothelial cells
- genome wide identification
- protein protein
- induced pluripotent stem cells
- dna methylation
- pluripotent stem cells
- copy number
- amino acid
- machine learning
- gene expression
- genome wide analysis
- emergency department
- small molecule
- bioinformatics analysis
- transcription factor
- big data
- deep learning
- electronic health record
- nucleic acid