Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multicompartment Models: Overview01:14

Multicompartment Models: Overview

435
Multicompartment models are mathematical constructs that depict how drugs are distributed and eliminated within the body. They segment the body into several compartments, symbolizing various physiological or anatomical areas connected through drug transfer processes such as absorption, metabolism, distribution, and elimination.
These models offer a more comprehensive representation of drug behavior in the body than one-compartment models. They accommodate the complexity of drug distribution,...
435
Gene Families01:57

Gene Families

9.7K
Gene families consist of groups of genes proposed to have originated from a common ancestor. Typically these arise through events in which a gene or genes are mistakenly duplicated during cell division. Unlike their parent genes (which are subject to selection pressure to maintain function), these gene copies do not need to preserve their sequences and may evolve at a relatively faster rate.
Occasionally these regions can be adapted to take on new roles within the organism, becoming novel genes...
9.7K
Improving Translational Accuracy02:07

Improving Translational Accuracy

13.9K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
13.9K
Improving Translational Accuracy02:07

Improving Translational Accuracy

3.4K
3.4K
Regulation of Expression at Multiple Steps01:23

Regulation of Expression at Multiple Steps

1.3K
The gene expression in cells is regulated at different stages: (i) transcription, (ii) RNA processing, (iii) RNA localization, and (iv) translation. Transcriptional regulation is mediated by regulatory proteins such as transcription factors, activators, or repressors—these control gene expression by initiating or inhibiting the transcription of genes. Once a precursor or pre-mRNA is produced, it undergoes post-transcriptional modification, including 5' capping, splicing, and the...
1.3K
Combinatorial Gene Control02:33

Combinatorial Gene Control

9.4K
Combinatorial gene control is the synergistic action of several transcriptional factors to regulate the expression of a single gene. The absence of one or more of these factors may lead to a significant difference in the level of gene expression or repression.
The expression of more than 30,000 genes is controlled by approximately 2000-3000 transcription factors. This is possible because a single transcription factor can recognize more than one regulatory sequence. The specificity in gene...
9.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

On the state of protein function prediction: a report on the fourth CAFA challenge.

bioRxiv : the preprint server for biology·2026
Same author

Advances in Protein Function Prediction from the Fifth CAFA Challenge.

bioRxiv : the preprint server for biology·2026
Same author

Transcriptomic subtypes in high-grade serous ovarian cancer are driven by tumor cellular composition.

bioRxiv : the preprint server for biology·2026
Same author

Single-cell hit calling in high-content imaging screens with Buscar.

bioRxiv : the preprint server for biology·2026
Same author

The Common Fund Data Ecosystem (CFDE).

bioRxiv : the preprint server for biology·2026
Same author

WebGWAS: a web server for instant GWAS on arbitrary phenotypes.

BioData mining·2026
Same journal

Integrated lipidomic and transcriptomic profiling of the host response in human malaria.

Genome biology·2026
Same journal

Centromeric satellite expansion drives genome evolution in the snowy owl.

Genome biology·2026
Same journal

Mapping the landscape of allele-specific expression in porcine genomes.

Genome biology·2026
Same journal

Genomic sequence evolution underlying human neocortical interareal diversification.

Genome biology·2026
Same journal

Regulatory mechanisms driven by functional 3'-UTR variants in alcohol use disorder and related traits.

Genome biology·2026
Same journal

A longitudinal single-nucleus transcriptomic atlas of bovine placentation reveals dynamic cellular hierarchies and regulatory programs.

Genome biology·2026
See all related articles

Related Experiment Video

Updated: Dec 21, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.6K

Compressing gene expression data using multiple latent space dimensionalities learns complementary biological

Gregory P Way1,2,3, Michael Zietz2, Vincent Rubinetti2

  • 1Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.

Genome Biology
|May 13, 2020
PubMed
Summary
This summary is machine-generated.

Selecting a single latent dimensionality for gene expression data analysis limits biological discoveries. Combining features from multiple compression algorithms and dimensionalities enhances biological representations for deeper insights.

Keywords:
AutoencodersCompressionDimensionality reductionGene expressionLatent spaceMachine learningNeural network interpretation

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K
Mining Spatial Transcriptomics Datasets using DeepSpaceDB
10:16

Mining Spatial Transcriptomics Datasets using DeepSpaceDB

Published on: September 5, 2025

544

Related Experiment Videos

Last Updated: Dec 21, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K
Mining Spatial Transcriptomics Datasets using DeepSpaceDB
10:16

Mining Spatial Transcriptomics Datasets using DeepSpaceDB

Published on: September 5, 2025

544

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Unsupervised compression algorithms analyze gene expression data to reveal hidden signals of variation.
  • Current methods require users to select a single latent space dimensionality, potentially limiting biological feature capture.
  • Most researchers use only one algorithm and latent dimensionality, risking incomplete biological insights.

Purpose of the Study:

  • To investigate how selecting a single latent dimensionality impacts the biological features captured from gene expression data.
  • To determine if using multiple algorithms and dimensionalities can improve biological discovery.
  • To assess the influence of latent space dimensionality on subsequent analyses of gene expression data.

Main Methods:

  • Gene expression data from adult normal, adult cancer, and pediatric cancer tissues were compressed.
  • Multiple unsupervised compression models were trained across a wide range of latent space dimensionalities.
  • Performance differences were observed, and associations with curated pathway gene sets were identified.

Main Results:

  • Intermediate latent dimensionalities revealed significant pathway gene set associations in denoising autoencoder and variational autoencoder models.
  • Combining compressed features from various algorithms and dimensionalities yielded the most comprehensive pathway-associated representations.
  • Models trained at different dimensionalities captured distinct biological signals, including sex, MYCN amplification, and cell types.
  • Tumor type was best captured at lower dimensionalities, while subtle pathway activities were identified at higher dimensionalities.

Conclusions:

  • No single latent dimensionality or compression algorithm is optimal for gene expression data analysis.
  • Utilizing features from diverse compression models and multiple latent space dimensionalities enhances biological representations.
  • A multi-faceted approach improves the depth and breadth of biological discoveries from gene expression data.