Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Effective dimensionality of large-scale expression data using principal component analysis.

Michael Hörnquist1, John Hertz, Mattias Wahde

  • 1Nordita, Blegdamsvej 17, DK-2100 Copenhagen, Denmark. micho@itn.liu.se

Bio Systems
|June 19, 2002
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Glassy dynamics near the interpolation transition in deep recurrent networks.

Physical review. E·2025
Same author

An interpretable method for automated classification of spoken transcripts and written text.

Evolutionary intelligence·2023
Same author

Sleepy drivers on a slippery road: A pilot study using a driving simulator.

Journal of sleep research·2021
Same author

Lidar-Camera Semi-Supervised Learning for Semantic Segmentation.

Sensors (Basel, Switzerland)·2021
Same author

Lowered Expression of Tumor Suppressor Candidate MYO1C Stimulates Cell Proliferation, Suppresses Cell Adhesion and Activates AKT.

PloS one·2016
Same author

Cumulants of Hawkes point processes.

Physical review. E, Statistical, nonlinear, and soft matter physics·2015
Same journal

Ruliological Resilience: Pattern Restoration and Robustness in Wolfram Patterns. A Basis for Regeneration, Not Just in Cone Shells?

Bio Systems·2026
Same journal

The Quantum-to-Classical Transducer: A Thermodynamic and Quantum Mechanical Framework for the Emergence of Bioenergetics.

Bio Systems·2026
Same journal

Forward-backward gene expression binarization for boolean state inference over a known regulatory network.

Bio Systems·2026
Same journal

Partial-label metric ceilings for evaluating gene regulatory networks inferred from single-cell foundation models.

Bio Systems·2026
Same journal

The impedance mismatch theory: A non-equilibrium thermodynamic framework for a shared energetic stress pathway in neurodegeneration.

Bio Systems·2026
Same journal

Immune signal-status misclassification: A theoretical framework for biological status assignment and failed status resolution.

Bio Systems·2026
See all related articles

Principal component analysis helps estimate the effective dimensionality of large gene expression datasets. This approach accounts for noise and time-series dependencies, constraining model complexity for better regulatory network inference.

Area of Science:

  • Genomics
  • Systems Biology
  • Bioinformatics

Background:

  • High-throughput gene expression measurements yield vast datasets.
  • Extracting regulatory networks from this data is challenging due to model complexity and parameterization.
  • Over-fitting models is a significant risk with large-scale biological data.

Purpose of the Study:

  • To estimate the effective dimensionality of large-scale gene expression datasets.
  • To provide a method for constraining the complexity of models built from gene expression data.
  • To inform the inference of gene regulatory networks.

Main Methods:

  • Application of principal component analysis (PCA).
  • Utilizing linear additive models for data analysis.

Related Experiment Videos

  • Accounting for temporal dependencies in time-series data.
  • Addressing the impact of measurement noise on data dimensionality.
  • Main Results:

    • PCA provides a method to estimate the effective dimensionality of gene expression data.
    • Both time-series correlations and measurement noise reduce effective dimensionality.
    • This reduction in dimensionality helps constrain model complexity.

    Conclusions:

    • Estimating effective dimensionality is crucial for building robust models from gene expression data.
    • The proposed PCA-based approach offers a way to manage model complexity.
    • This facilitates more accurate gene regulatory network inference.