Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

11.7K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
11.7K
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

13.6K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
13.6K
Types of Genetic Transfer Between Organisms02:18

Types of Genetic Transfer Between Organisms

5.4K
5.4K
Genome Size and the Evolution of New Genes03:21

Genome Size and the Evolution of New Genes

2.5K
2.5K
Gene Conversion02:08

Gene Conversion

2.4K
2.4K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.8K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Learning inherent genetic patterns and trait associations with deep generative models for discrete genotype simulation.

GigaScience·2026
Same author

Self-supervised representation learning on gene expression data.

Bioinformatics (Oxford, England)·2025
Same author

CrossAttOmics: multiomics data integration with cross-attention.

Bioinformatics (Oxford, England)·2025
Same author

AttOmics: attention-based architecture for diagnosis and prognosis from omics data.

Bioinformatics (Oxford, England)·2023
Same author

A3SOM, abstained explainable semi-supervised neural network based on self-organizing map.

PloS one·2023
Same author

Assessment of deep learning and transfer learning for cancer prediction based on gene expression data.

BMC bioinformatics·2022
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Jul 25, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

808

GAN-based data augmentation for transcriptomics: survey and comparative assessment.

Alice Lacan1, Michèle Sebag2, Blaise Hanczar1

  • 1IBISC, University Paris-Saclay (Univ. Evry), Evry 91000, France.

Bioinformatics (Oxford, England)
|June 30, 2023
PubMed
Summary
This summary is machine-generated.

Generative adversarial networks (GANs) enhance transcriptomics data augmentation for improved cancer phenotype classification. GAN-based augmentation significantly boosts accuracy, especially with limited RNA-sequencing samples.

More Related Videos

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms

Published on: May 9, 2017

9.3K
A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research
09:35

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research

Published on: August 16, 2017

17.9K

Related Experiment Videos

Last Updated: Jul 25, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

808
Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms

Published on: May 9, 2017

9.3K
A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research
09:35

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research

Published on: August 16, 2017

17.9K

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Machine Learning in Genomics

Background:

  • High-throughput sequencing generates vast transcriptomics data, but data scarcity limits deep learning for phenotype prediction.
  • Data augmentation artificially expands training sets, but transcriptomic transformations remain undefined.
  • Generative Adversarial Networks (GANs) offer a potential solution for generating synthetic transcriptomic samples.

Purpose of the Study:

  • To analyze Generative Adversarial Network (GAN)-based data augmentation strategies for transcriptomics.
  • To evaluate the impact of GANs on cancer phenotype classification performance.
  • To assess the quality and utility of GAN-generated transcriptomic data.

Main Methods:

  • Employed Generative Adversarial Networks (GANs) to generate augmented transcriptomic data.
  • Trained binary and multiclass classifiers on original and augmented RNA-sequencing datasets.
  • Evaluated classification performance using accuracy metrics and analyzed GAN-generated data quality.

Main Results:

  • Augmentation strategies significantly improved classification performance, increasing accuracy from 94% to 98% (binary) and 70% to 94% (multiclass).
  • Training classifiers with 1000 augmented samples showed substantial gains compared to using only 50 original samples.
  • Richer GAN architectures and more extensive training yielded better augmentation performance and data quality.

Conclusions:

  • GAN-based data augmentation is a powerful strategy to overcome data scarcity in transcriptomics.
  • This approach substantially enhances deep learning model performance for cancer phenotype classification.
  • Multiple performance indicators are necessary for a comprehensive assessment of generated data quality.