Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Ribosome Profiling

Ribosome Profiling

Ribosome profiling or ribo-sequencing is a deep sequencing technique that produces a snapshot of active translation in a cell. It selectively sequences the mRNAs protected by ribosomes to get an insight into a cell’s translation landscape at any given point in time.
Applications of ribosome profiling
Ribosome profiling has many applications, including in vivo monitoring of translation inside a particular organ or tissue type and quantifying new protein synthesis levels.
The technique helps...

lncRNA - Long Non-coding RNAs

lncRNA - Long Non-coding RNAs

In humans, more than 80% of the genome gets transcribed. However, only around 2% of the genome codes for proteins. The remaining part produces non-coding RNAs which includes ribosomal RNAs, transfer RNAs, telomerase RNAs, and regulatory RNAs, among other types. A large number of regulatory non-coding RNAs have been classified into two groups depending upon their length – small non-coding RNAs, such as microRNA, which are less than 200 nucleotides in length, and long non-coding RNA (lncRNA)...

lncRNA - Long Non-coding RNAs

lncRNA - Long Non-coding RNAs

In humans, more than 80% of the genome gets transcribed. However, only around 2% of the genome codes for proteins. The remaining part produces non-coding RNAs which includes ribosomal RNAs, transfer RNAs, telomerase RNAs, and regulatory RNAs, among other types. A large number of regulatory non-coding RNAs have been classified into two groups depending upon their length – small non-coding RNAs, such as microRNA, which are less than 200 nucleotides in length, and long non-coding RNA (lncRNA)...

Genome Annotation and Assembly

Genome Annotation and Assembly

The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.

RNA-seq

RNA-seq

RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases.
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while microarray-based...

Multi-species Conserved Sequences

Multi-species Conserved Sequences

Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved DNA...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A visualization tool for individual gene expression profiles among males and females in GTEx tissues.

Biology of sex differences·2025

Same author

Housekeeping protein-coding genes interrogated with tissue and individual variations.

Scientific reports·2024

Same author

TEx-MST: tissue expression profiles of MANE select transcripts.

Database : the journal of biological databases and curation·2022

Same author

Dominant transcript expression profiles of human protein-coding genes interrogated with GTEx dataset.

Scientific reports·2022

Same author

Top-ranked expressed gene transcripts of human protein-coding genes investigated with GTEx dataset.

Scientific reports·2020

Same author

Overlapping protein-coding genes in human genome and their coincidental expression in tissues.

Scientific reports·2019

Same journal

Application of ephrin-B2 loaded glycol chitosan-silk fibroin hydrogel in the treatment of diabetic refractory wounds.

Scientific reports·2026

Same journal

International expert Delphi consensus on thromboprophylaxis in metabolic and bariatric surgery.

Scientific reports·2026

Same journal

Assessing the cross-region knowledge transfer capability of selected deep learning building vectorization methods in the context of available training datasets.

Scientific reports·2026

Same journal

Feasibility and preliminary effects of outdoor versus indoor cognitive-motor therapy in women with Alzheimer's disease: A randomized single-blind pilot study.

Scientific reports·2026

Same journal

Hallmarks of social action in the vocal turn-taking of wild common marmosets (Callithrix jacchus).

Scientific reports·2026

Same journal

Role and mechanism of AOPPs-induced NOX4-mediated ferroptosis in intervertebral disc degeneration.

Scientific reports·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 2, 2026

De novo Identification of Actively Translated Open Reading Frames with Ribosome Profiling Data

De novo Identification of Actively Translated Open Reading Frames with Ribosome Profiling Data

Published on: February 18, 2022

Reference protein-coding transcripts of human genes annotated using long-read transcriptome datasets.

Kuo-Feng Tung¹, Wen-Chang Lin²

¹Institute of Biomedical Sciences, Academia Sinica, 115, Taipei, Taiwan, R.O.C.

Scientific Reports

|July 1, 2026

Summary

This summary is machine-generated.

Long-read sequencing reveals dominant protein-coding transcripts (Ref-Tx) across 30 human tissues. This approach improves accuracy over short-read data for identifying key transcript isoforms and aids in understanding tissue-specific gene expression.

Keywords:

Dominant protein-coding transcripts Human protein-coding genes Long-read RNA-seq dataset MANE-select transcripts Wobble splicing transcripts

More Related Videos

Overexpressing Long Noncoding RNAs Using Gene-activating CRISPR

Overexpressing Long Noncoding RNAs Using Gene-activating CRISPR

Published on: March 1, 2019

A Rapid High-throughput Method for Mapping Ribonucleoproteins (RNPs) on Human pre-mRNA

A Rapid High-throughput Method for Mapping Ribonucleoproteins (RNPs) on Human pre-mRNA

Published on: December 2, 2009

Related Experiment Videos

Last Updated: Jul 2, 2026

De novo Identification of Actively Translated Open Reading Frames with Ribosome Profiling Data

De novo Identification of Actively Translated Open Reading Frames with Ribosome Profiling Data

Published on: February 18, 2022

Overexpressing Long Noncoding RNAs Using Gene-activating CRISPR

Overexpressing Long Noncoding RNAs Using Gene-activating CRISPR

Published on: March 1, 2019

A Rapid High-throughput Method for Mapping Ribonucleoproteins (RNPs) on Human pre-mRNA

A Rapid High-throughput Method for Mapping Ribonucleoproteins (RNPs) on Human pre-mRNA

Published on: December 2, 2009

Area of Science:

Genomics
Transcriptomics
Bioinformatics

Background:

Next-generation sequencing (NGS) suggests many alternatively spliced transcripts, but short-read data may overestimate this due to isoform ambiguity.
Accurate identification of tissue-specific expression profiles is essential for determining functional, translated peptide products.

Purpose of the Study:

To identify the most highly expressed protein-coding transcripts (Ref-Tx) in human genes using long-read sequencing data.
To assess the accuracy of dominant transcript isoform identification compared to existing annotation datasets.
To develop a bioinformatics tool for visualizing tissue-specific expression of dominant transcripts.

Main Methods:

Utilized the GSE192955 long-read nanopore sequencing dataset from 30 normal human tissues.
Identified 18,094 dominant representative protein-coding transcripts (Ref-Tx) from 18,557 human genes.
Compared identified Ref-Tx with MANE-select and APPRIS annotations.

Main Results:

Identified 18,094 dominant Ref-Tx from 18,557 genes using long-read data.
14,546 Ref-Tx matched MANE-select transcripts, indicating high concordance.
The long-read dataset showed better agreement with MANE genes and more topmost Ref-Tx compared to potential confounding factors.

Conclusions:

Long-read sequencing provides a more accurate assessment of dominant protein-coding transcripts than short-read data.
The identified Ref-Tx and the developed eCPG tool enhance the understanding of tissue-specific gene expression.
This work facilitates the accurate identification of bona fide translated peptide products.