Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
Gaussian Elimination: Problem Solving01:30

Gaussian Elimination: Problem Solving

Systems of linear equations in several variables are pivotal in modeling complex scenarios involving multiple unknowns and constraints. Such systems are widely used in various fields to represent relationships where several conditions must be simultaneously satisfied. Each variable in the system corresponds to an unknown quantity, while each equation imposes a linear constraint, leading to a structured approach for analyzing and solving real-world problems.A system of three equations with three...
Quadratic Models01:23

Quadratic Models

Quadratic models are mathematical representations used to describe relationships in which the rate of change changes at a constant rate. These models appear in a wide variety of natural and engineered systems, especially those involving motion, forces, and optimization. One common application is analyzing the vertical motion of objects influenced by gravity, such as a ball thrown into the air.In such scenarios, the object's height changes over time in a curved pattern, rising to a maximum point...
Multiple Regression01:25

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
Calibration Curves: Linear Least Squares01:20

Calibration Curves: Linear Least Squares

A calibration curve is a plot of the instrument's response against a series of known concentrations of a substance. This curve is used to set the instrument response levels, using the substance and its concentrations as standards. Alternatively, or additionally, an equation is fitted to the calibration curve plot and subsequently used to calculate the unknown concentrations of other samples reliably.
For data that follow a straight line, the standard method for fitting is the linear...
Linearization and Approximation01:26

Linearization and Approximation

Linearization is a mathematical technique used to approximate complex, nonlinear functions with simpler linear models in the vicinity of a chosen reference point. The method is based on the idea that, although a function may be difficult to evaluate exactly, its behavior near a specific input value can often be closely approximated by the tangent line at that point. This approach is particularly useful when small deviations from a known value are involved.Consider the square root function, for...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

MINTsC learns multi-way chromatin interactions from single cell high throughput chromatin conformation data.

Nature communications·2026
Same author

MINTsC learns multi-way chromatin interactions from single cell high throughput chromatin conformation data.

bioRxiv : the preprint server for biology·2026
Same author

Systematic background selection with BasCoD enhances contrastive dimension reduction in single cell genomics.

Nature communications·2026
Same author

Oncogenic DEAD-box ATPase DDX41 establishes transcript ensembles via CLK3-dependent and -independent mechanisms.

Nature communications·2025
Same author

Sex-Specific Differential DNA Methylation in Mild Cognitive Impairment and Alzheimer's Disease.

bioRxiv : the preprint server for biology·2025
Same author

Generation of Hoxa11-3XFLAG and Hoxd11-3XFLAG alleles to investigate Hox11 genome-wide binding.

Developmental biology·2025
Same journal

Simplifying debiased inference via automatic differentiation and probabilistic programming.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Principal stratification with U-statistics under principal ignorability.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Causal K-Means Clustering.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Inference of dependency knowledge graph for Electronic Health Records.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Correction to: Inference of dependency knowledge graph for Electronic Health Records.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Harmonized Estimation of Subgroup-Specific Treatment Effects in Randomized Trials: The Use of External Control Data.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
See all related articles

Related Experiment Video

Updated: Jun 16, 2026

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression
06:50

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

Sparse partial least squares regression for simultaneous dimension reduction and variable selection.

Hyonho Chun1, Sündüz Keleş

  • 1University of Wisconsin Madison, USA.

Journal of the Royal Statistical Society. Series B, Statistical Methodology
|January 29, 2010
PubMed
Summary
This summary is machine-generated.

Partial least squares regression (PLSR) faces challenges with high-dimensional genomic data. A new sparse PLSR method improves predictive performance and variable selection for genomic analyses.

Related Experiment Videos

Last Updated: Jun 16, 2026

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression
06:50

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

Area of Science:

  • Genomics
  • Statistical modeling
  • Bioinformatics

Background:

  • Partial least squares regression (PLSR) is a statistical method used for analyzing data with multicollinearity.
  • PLSR has gained traction in high-dimensional genomic data analysis.
  • Traditional PLSR's asymptotic consistency is questioned under the 'large p, small n' paradigm common in genomics.

Purpose of the Study:

  • To investigate the limitations of standard PLSR in high-dimensional genomic data.
  • To develop an improved PLSR method for enhanced predictive performance and variable selection.
  • To address the challenges of multicollinearity and the 'large p, small n' problem in genomic data analysis.

Main Methods:

  • Demonstrated that known asymptotic consistency of PLSR does not hold for univariate and multivariate responses under the 'large p, small n' conditions.
  • Proposed a sparse partial least squares (sPLS) formulation.
  • Developed an efficient implementation of sPLS regression.

Main Results:

  • Established that standard PLSR estimators are not consistently reliable for high-dimensional genomic data.
  • The proposed sPLS formulation achieves both good predictive performance and effective variable selection.
  • Simulation experiments showed sPLS outperforms existing variable selection and dimension reduction methods.

Conclusions:

  • Sparse partial least squares regression is a viable and effective method for analyzing high-dimensional genomic data.
  • sPLS offers a robust approach for joint analysis of gene expression and genomewide binding data.
  • The findings have significant implications for statistical genomics and bioinformatics research.