Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Ribosome Profiling02:24

Ribosome Profiling

Ribosome profiling or ribo-sequencing is a deep sequencing technique that produces a snapshot of active translation in a cell. It selectively sequences the mRNAs protected by ribosomes to get an insight into a cell’s translation landscape at any given point in time.
Applications of ribosome profiling
Ribosome profiling has many applications, including in vivo monitoring of translation inside a particular organ or tissue type and quantifying new protein synthesis levels.
The technique helps...
lncRNA - Long Non-coding RNAs02:39

lncRNA - Long Non-coding RNAs

In humans, more than 80% of the genome gets transcribed. However, only around 2% of the genome codes for proteins. The remaining part produces non-coding RNAs which includes ribosomal RNAs, transfer RNAs, telomerase RNAs, and regulatory RNAs, among other types. A large number of regulatory non-coding RNAs have been classified into two groups depending upon their length – small non-coding RNAs, such as microRNA, which are less than 200 nucleotides in length, and long non-coding RNA (lncRNA)...
lncRNA - Long Non-coding RNAs02:39

lncRNA - Long Non-coding RNAs

In humans, more than 80% of the genome gets transcribed. However, only around 2% of the genome codes for proteins. The remaining part produces non-coding RNAs which includes ribosomal RNAs, transfer RNAs, telomerase RNAs, and regulatory RNAs, among other types. A large number of regulatory non-coding RNAs have been classified into two groups depending upon their length – small non-coding RNAs, such as microRNA, which are less than 200 nucleotides in length, and long non-coding RNA (lncRNA)...
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
RNA-seq03:21

RNA-seq

RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while microarray-based...
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved DNA...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A visualization tool for individual gene expression profiles among males and females in GTEx tissues.

Biology of sex differences·2025
Same author

Housekeeping protein-coding genes interrogated with tissue and individual variations.

Scientific reports·2024
Same author

TEx-MST: tissue expression profiles of MANE select transcripts.

Database : the journal of biological databases and curation·2022
Same author

Dominant transcript expression profiles of human protein-coding genes interrogated with GTEx dataset.

Scientific reports·2022
Same author

Top-ranked expressed gene transcripts of human protein-coding genes investigated with GTEx dataset.

Scientific reports·2020
Same author

Overlapping protein-coding genes in human genome and their coincidental expression in tissues.

Scientific reports·2019
Same journal

Application of ephrin-B2 loaded glycol chitosan-silk fibroin hydrogel in the treatment of diabetic refractory wounds.

Scientific reports·2026
Same journal

International expert Delphi consensus on thromboprophylaxis in metabolic and bariatric surgery.

Scientific reports·2026
Same journal

Assessing the cross-region knowledge transfer capability of selected deep learning building vectorization methods in the context of available training datasets.

Scientific reports·2026
Same journal

Feasibility and preliminary effects of outdoor versus indoor cognitive-motor therapy in women with Alzheimer's disease: A randomized single-blind pilot study.

Scientific reports·2026
Same journal

Hallmarks of social action in the vocal turn-taking of wild common marmosets (Callithrix jacchus).

Scientific reports·2026
Same journal

Role and mechanism of AOPPs-induced NOX4-mediated ferroptosis in intervertebral disc degeneration.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Jul 2, 2026

De novo Identification of Actively Translated Open Reading Frames with Ribosome Profiling Data
08:23

De novo Identification of Actively Translated Open Reading Frames with Ribosome Profiling Data

Published on: February 18, 2022

Reference protein-coding transcripts of human genes annotated using long-read transcriptome datasets.

Kuo-Feng Tung1, Wen-Chang Lin2

  • 1Institute of Biomedical Sciences, Academia Sinica, 115, Taipei, Taiwan, R.O.C.

Scientific Reports
|July 1, 2026
PubMed
Summary
This summary is machine-generated.

Long-read sequencing reveals dominant protein-coding transcripts (Ref-Tx) across 30 human tissues. This approach improves accuracy over short-read data for identifying key transcript isoforms and aids in understanding tissue-specific gene expression.

Keywords:
Dominant protein-coding transcriptsHuman protein-coding genesLong-read RNA-seq datasetMANE-select transcriptsWobble splicing transcripts

More Related Videos

Overexpressing Long Noncoding RNAs Using Gene-activating CRISPR
13:04

Overexpressing Long Noncoding RNAs Using Gene-activating CRISPR

Published on: March 1, 2019

A Rapid High-throughput Method for Mapping Ribonucleoproteins (RNPs) on Human pre-mRNA
13:00

A Rapid High-throughput Method for Mapping Ribonucleoproteins (RNPs) on Human pre-mRNA

Published on: December 2, 2009

Related Experiment Videos

Last Updated: Jul 2, 2026

De novo Identification of Actively Translated Open Reading Frames with Ribosome Profiling Data
08:23

De novo Identification of Actively Translated Open Reading Frames with Ribosome Profiling Data

Published on: February 18, 2022

Overexpressing Long Noncoding RNAs Using Gene-activating CRISPR
13:04

Overexpressing Long Noncoding RNAs Using Gene-activating CRISPR

Published on: March 1, 2019

A Rapid High-throughput Method for Mapping Ribonucleoproteins (RNPs) on Human pre-mRNA
13:00

A Rapid High-throughput Method for Mapping Ribonucleoproteins (RNPs) on Human pre-mRNA

Published on: December 2, 2009

Area of Science:

  • Genomics
  • Transcriptomics
  • Bioinformatics

Background:

  • Next-generation sequencing (NGS) suggests many alternatively spliced transcripts, but short-read data may overestimate this due to isoform ambiguity.
  • Accurate identification of tissue-specific expression profiles is essential for determining functional, translated peptide products.

Purpose of the Study:

  • To identify the most highly expressed protein-coding transcripts (Ref-Tx) in human genes using long-read sequencing data.
  • To assess the accuracy of dominant transcript isoform identification compared to existing annotation datasets.
  • To develop a bioinformatics tool for visualizing tissue-specific expression of dominant transcripts.

Main Methods:

  • Utilized the GSE192955 long-read nanopore sequencing dataset from 30 normal human tissues.
  • Identified 18,094 dominant representative protein-coding transcripts (Ref-Tx) from 18,557 human genes.
  • Compared identified Ref-Tx with MANE-select and APPRIS annotations.

Main Results:

  • Identified 18,094 dominant Ref-Tx from 18,557 genes using long-read data.
  • 14,546 Ref-Tx matched MANE-select transcripts, indicating high concordance.
  • The long-read dataset showed better agreement with MANE genes and more topmost Ref-Tx compared to potential confounding factors.

Conclusions:

  • Long-read sequencing provides a more accurate assessment of dominant protein-coding transcripts than short-read data.
  • The identified Ref-Tx and the developed eCPG tool enhance the understanding of tissue-specific gene expression.
  • This work facilitates the accurate identification of bona fide translated peptide products.