Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Genome-wide Association Studies-GWAS

Genome-wide Association Studies-GWAS

Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...

Multiple Allele Traits

Multiple Allele Traits

The Concept of Multiple Allelism

Multiple Allele Traits

Multiple Allele Traits

The Concept of Multiple Allelism

Singularity Functions for Shear

Singularity Functions for Shear

In structural analysis, singularity functions are crucial in simplifying the representation of shear forces in beams under discontinuous loading. These functions describe discontinuous variations in shear force across a beam with varying loads by using a single mathematical expression, regardless of the complexity of the loading conditions. The singularity functions are derived from creating a free-body diagram of the beam and then making conceptual cuts at specific points to examine the shear...

Biostatistics: Overview

Biostatistics: Overview

Biostatistics plays a crucial role in understanding and analyzing data in healthcare and biology. Biostatisticians conduct experiments, gather evidence, and draw meaningful conclusions using statistical methods and techniques. Different variables form the foundation of biostatistical analysis, allowing researchers to understand and interpret data effectively. These variables are classified into different types, each serving a specific purpose in statistical analysis.
Discrete variables are...

Extraction: Partition and Distribution Coefficients

Extraction: Partition and Distribution Coefficients

The distribution law or Nernst's distribution law is the law that governs the distribution of a solute between two immiscible solvents. This law, also known as the partition law, states that if a solute is added to the mixture of two immiscible solvents at a constant temperature, the solute is distributed between the two solvents in such a way that the ratio of solute concentrations in the solvents remains constant at equilibrium.
For extracting a solute from an aqueous phase into an organic...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Unearthing soil biodiversity through collaborative genomic research and education.

Nature genetics·2025

Same author

Insights into the Datasets, Tools, and Training Needs of the AnVIL Community: 2024.

bioRxiv : the preprint server for biology·2025

Same author

ipd: an R package for conducting inference on predicted data.

Bioinformatics (Oxford, England)·2025

Same author

The evolution of computational research in a data-centric world.

Cell·2024

Same author

Best practices to evaluate the impact of biomedical research software-metric collection beyond citations.

Bioinformatics (Oxford, England)·2024

Same author

Large-scale genotype prediction from RNA sequence data necessitates a new ethical and policy framework.

Nature genetics·2024

Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026

Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026

Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026

Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026

Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026

Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 12, 2026

Large-Scale Multi-Omics Genome-Wide Association Studies (Mo-GWAS): Guidelines for Sample Preparation and Normalization

Large-Scale Multi-Omics Genome-Wide Association Studies (Mo-GWAS): Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

Asymptotic conditional singular value decomposition for high-dimensional genomic data.

Jeffrey T Leek¹

¹Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland 21205-2179, USA. jleek@jhsph.edu

|June 22, 2010

Summary

This summary is machine-generated.

This study identifies latent factors in high-dimensional genomic data using a conditional factor model. A new method consistently estimates the number of factors, improving analysis of gene expression and other complex biological datasets.

More Related Videos

Screening for Functional Non-coding Genetic Variants Using Electrophoretic Mobility Shift Assay (EMSA) and DNA-affinity Precipitation Assay (DAPA)

Screening for Functional Non-coding Genetic Variants Using Electrophoretic Mobility Shift Assay (EMSA) and DNA-affinity Precipitation Assay (DAPA)

Published on: August 21, 2016

Related Experiment Videos

Last Updated: Jun 12, 2026

Large-Scale Multi-Omics Genome-Wide Association Studies (Mo-GWAS): Guidelines for Sample Preparation and Normalization

Large-Scale Multi-Omics Genome-Wide Association Studies (Mo-GWAS): Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

Screening for Functional Non-coding Genetic Variants Using Electrophoretic Mobility Shift Assay (EMSA) and DNA-affinity Precipitation Assay (DAPA)

Screening for Functional Non-coding Genetic Variants Using Electrophoretic Mobility Shift Assay (EMSA) and DNA-affinity Precipitation Assay (DAPA)

Published on: August 21, 2016

Area of Science:

Genomics
Statistical Modeling
Bioinformatics

Background:

Genomic data (gene expression, sequencing) is high-dimensional with many features and few samples.
Identifying factors associated with multiple features is crucial for genomic analysis.
Determining the number of factors is essential for unsupervised learning methods like clustering.

Purpose of the Study:

To develop methods for identifying and estimating latent factors in high-dimensional genomic data.
To propose a consistent estimator for the dimension of conditional factor models.
To provide a practical approach for selecting the number of factors in real-world datasets.

Main Methods:

Utilized a conditional factor model for genomic data analysis.
Applied asymptotic consistency of right singular vectors for latent factor estimation.
Developed a scaled eigen-decomposition method for dimension estimation.
Employed the dependence kernel approach for practical factor selection.

Main Results:

Right singular vectors consistently estimate unobserved latent factors as features increase.
A novel, consistent estimator for the conditional factor model dimension was proposed.
Demonstrated utility in capturing batch effects in microarray data.

Conclusions:

The proposed methods provide robust estimation of latent factors and model dimension in high-dimensional genomics.
This work enhances the analysis of complex genomic datasets, including the identification of unmodeled effects.
The findings are applicable to various genomic data types, including gene expression, SNPs, and methylation data.