Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Alternative RNA Splicing02:18

Alternative RNA Splicing

25.2K
Alternative RNA splicing is the regulated splicing of exons and introns to produce different mature mRNAs from a single pre-mRNA. Unlike in constitutive splicing where a single gene produces a single type of mRNA, alternative splicing allows an organism to produce multiple proteins from a single gene and plays an important role in protein diversity.
There are five types of alternative RNA splicing that vary in the ways the pre-mRNA segments are removed or retained in the mature mRNA. The first...
25.2K
Alternative RNA Splicing02:18

Alternative RNA Splicing

5.2K
5.2K
RNA Splicing01:32

RNA Splicing

60.7K
Splicing is the process by which eukaryotic RNA is edited before its translation into protein. The RNA strand transcribed from eukaryotic DNA is called the primary transcript. The primary transcripts that become mRNAs are called precursor messenger RNAs (pre-mRNAs). Eukaryotic pre-mRNA contains alternating sequences of exons and introns. Exons are nucleotide sequences that code for proteins, whereas introns are the non-coding regions. In RNA splicing, introns are removed and exons are bonded...
60.7K
How Data are Classified: Numerical Data00:59

How Data are Classified: Numerical Data

38.1K
Data that are countable or measurable in specific units are called numerical or quantitative data. Quantitative data are always numbers. Quantitative data are the result of counting or measuring the attributes of a population. Amount of money, pulse rate, weight, number of people living in a town, and number of students who opt for statistics are examples of quantitative data.
Quantitative data may be either discrete or continuous. All quantitative data that take on only specific numerical...
38.1K
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

45.0K
A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
45.0K
RNA-seq03:21

RNA-seq

12.1K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
12.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Beyond gene length: Exon-intron architecture and isoform potential in the evolution of eukaryotic complexity.

bioRxiv : the preprint server for biology·2026
Same author

Real-time spatiotemporal tracking of infectious outbreaks in confined environments with a host-pathogen agent-based system.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

A spatial long-read approach at near-single-cell resolution reveals developmental regulation of splicing and polyadenylation sites in distinct cortical layers and cell types.

Nature communications·2025
Same author

Detection and Tracking of SARS-CoV-2 Lineages through National Wastewater Surveillance System Pathogen Genomics.

Emerging infectious diseases·2025
Same author

Discovery Phase Agrochemical Predictive Safety Assessment Using High Content <i>In Vitro</i> Data to Estimate an <i>In Vivo</i> Toxicity Point of Departure.

Journal of agricultural and food chemistry·2024
Same author

Correction: The genome of the soybean cyst nematode (Heterodera glycines) reveals complex patterns of duplications involved in the evolution of parasitism genes.

BMC genomics·2024
Same journal

Optimized tRNA structure-seq reveals robust tRNA secondary structures in <i>S. cerevisiae</i> under mild stress conditions.

RNA (New York, N.Y.)·2026
Same journal

SERIPH: A Two-Step Extraction Protocol for Selective Enrichment of Semi-Extractable RNAs.

RNA (New York, N.Y.)·2026
Same journal

Reduced Sensitivity to RNA Structural Differences Distinguishes Eukaryotic Pus4 from Bacterial TruB.

RNA (New York, N.Y.)·2026
Same journal

Puf3 contributes to changes in mRNA solubility, translation elongation dynamics at rare arginine codons and loss of protein homeostasis in cells lacking Not4.

RNA (New York, N.Y.)·2026
Same journal

RBM38 Regulates HORMAD1 Splicing to Enhances MEK Inhibitor Sensitivity in Breast Cancer.

RNA (New York, N.Y.)·2026
Same journal

EF-P Inhibits Ribosomal α-Hydroxy Acid Incorporation: Strategic tRNA Body Selection for Co-incorporating α-Hydroxy Acids and Nonproteinogenic Amino Acids into Depsipeptides.

RNA (New York, N.Y.)·2026
See all related articles

Related Experiment Video

Updated: Feb 8, 2026

Identification of Alternative Splicing and Polyadenylation in RNA-seq Data
08:35

Identification of Alternative Splicing and Polyadenylation in RNA-seq Data

Published on: June 24, 2021

6.5K

Biological classification with RNA-seq data: Can alternatively spliced transcript expression enhance machine learning

Nathan T Johnson1, Andi Dhroso1, Katelyn J Hughes1

  • 1Worcester Polytechnic Institute, Bioinformatics and Computational Biology Program, Worcester, Massachusetts 01609, USA.

RNA (New York, N.Y.)
|June 27, 2018
PubMed
Summary
This summary is machine-generated.

Transcript-level RNA sequencing (RNA-seq) data, when analyzed with supervised learning, significantly improves biological classification accuracy compared to gene-level data. This approach offers powerful insights for diverse biomedical questions.

Keywords:
RNA-seqalternative splicingclassificationgene expressionmachine learning

More Related Videos

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.5K
Detection of Alternative Splicing During Epithelial-Mesenchymal Transition
11:48

Detection of Alternative Splicing During Epithelial-Mesenchymal Transition

Published on: October 9, 2014

13.4K

Related Experiment Videos

Last Updated: Feb 8, 2026

Identification of Alternative Splicing and Polyadenylation in RNA-seq Data
08:35

Identification of Alternative Splicing and Polyadenylation in RNA-seq Data

Published on: June 24, 2021

6.5K
A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data
09:34

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

4.5K
Detection of Alternative Splicing During Epithelial-Mesenchymal Transition
11:48

Detection of Alternative Splicing During Epithelial-Mesenchymal Transition

Published on: October 9, 2014

13.4K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • RNA sequencing (RNA-seq) is a powerful tool for quantifying gene expression, surpassing DNA microarrays in providing gene or transcript-level insights.
  • Analyzing RNA-seq data necessitates advanced data mining and analytical methods, with supervised learning gaining traction in biological applications.
  • Supervised learning methods are increasingly applied to RNA-seq data for biological classification tasks.

Purpose of the Study:

  • To assess the effectiveness of supervised learning methods trained on RNA-seq data for various biological classification tasks.
  • To determine if transcript-level expression data is more informative than gene-level data for biological classification.
  • To evaluate the performance of different normalization techniques and machine learning classifiers on RNA-seq data.

Main Methods:

  • Conducted a large-scale assessment involving 61 biological classification problems across three independent RNA-seq datasets.
  • Utilized over 2000 samples from multiple organisms, lab groups, and RNA-seq analysis pipelines.
  • Explored the performance of three normalization techniques and six machine learning classifiers for tasks including tissue type, sex, age, and cancer phenotype prediction.

Main Results:

  • Transcript-based supervised learning classifiers consistently outperformed or matched gene-based methods across all 61 classification problems.
  • Top-performing methods achieved near-perfect classification accuracy, highlighting the utility of transcript-level data.
  • The study demonstrated the robustness of supervised learning for RNA-seq data analysis across diverse biological contexts.

Conclusions:

  • Transcript-level expression data provides superior or equivalent information for biological classification compared to gene-level data when using supervised learning.
  • Supervised learning methods are highly effective for analyzing RNA-seq data, enabling accurate biological classifications.
  • This research validates the application of transcript-level RNA-seq analysis with machine learning for advancing biological and biomedical research.