Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

Classification of Systems-II

Classification of Systems-II

Continuous-time systems have continuous input and output signals, with time measured continuously. These systems are generally defined by differential or algebraic equations. For instance, in an RC circuit, the relationship between input and output voltage is expressed through a differential equation derived from Ohm's law and the capacitor relation,

Classification of Systems-I

Classification of Systems-I

Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:

Classification of Signals

Classification of Signals

In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...

Aggregates Classification

Aggregates Classification

Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...

Calibration Curves: Linear Least Squares

Calibration Curves: Linear Least Squares

A calibration curve is a plot of the instrument's response against a series of known concentrations of a substance. This curve is used to set the instrument response levels, using the substance and its concentrations as standards. Alternatively, or additionally, an equation is fitted to the calibration curve plot and subsequently used to calculate the unknown concentrations of other samples reliably.
For data that follow a straight line, the standard method for fitting is the linear...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Disease-dependent airway epithelial responses to acute electronic cigarette aerosol exposure: a pilot single-cell analysis.

Toxicological sciences : an official journal of the Society of Toxicology·2026

Same author

MINTsC learns multi-way chromatin interactions from single cell high throughput chromatin conformation data.

Nature communications·2026

Same author

CN-RNN: a Deep Learning Framework for Copy Number Variation Detection with Exome Sequencing Data.

bioRxiv : the preprint server for biology·2026

Same author

MINTsC learns multi-way chromatin interactions from single cell high throughput chromatin conformation data.

bioRxiv : the preprint server for biology·2026

Same author

Systematic background selection with BasCoD enhances contrastive dimension reduction in single cell genomics.

Nature communications·2026

Same author

AI for pathologists: a universal lymph node metastasis detection app that enhances efficiency while preserving diagnostic accuracy.

The journal of pathology. Clinical research·2026

Same journal

Balanced mediated pathway detection in genomic data.

Statistical applications in genetics and molecular biology·2026

Same journal

Annealed variational mixtures for disease subtyping and biomarker discovery.

Statistical applications in genetics and molecular biology·2026

Same journal

Performance of the permutation test approach with base calling errors for detecting changes in variant allele frequencies in ctDNA for a single patient.

Statistical applications in genetics and molecular biology·2026

Same journal

BLOG: Bayesian longitudinal omics with group constraints.

Statistical applications in genetics and molecular biology·2026

Same journal

AI-driven risk prediction and categorization in cystic fibrosis leveraging AttentiveLSTM and Fox Wolf Optimizer.

Statistical applications in genetics and molecular biology·2026

Same journal

Perfect collinearity not created equal: measuring and visualizing the severity of multi-collinearity of modern omics data.

Statistical applications in genetics and molecular biology·2026

See all related articles

Search research articles

Related Experiment Videos

Sparse partial least squares classification for high dimensional data.

Dongjun Chung¹, Sunduz Keles

¹University of Wisconsin, Madison, WI, USA. chungdon@stat.wisc.edu

Statistical Applications in Genetics and Molecular Biology

|April 6, 2010

Summary

This summary is machine-generated.

Sparse Partial Least Squares (SPLS) enhances genome biology classification by enabling simultaneous variable selection and dimension reduction. SPLS integrated with generalized linear models (GLM) improves tumor classification accuracy, especially with unbalanced data.

Related Experiment Videos

Area of Science:

Genomics
Bioinformatics
Statistical Learning

Background:

Partial Least Squares (PLS) is a dimension reduction technique applied to high-dimensional data.
Recent adaptations of PLS for genome biology classification face challenges with variable selection.
Sparse methods are needed to simultaneously reduce dimensions and select relevant variables.

Purpose of the Study:

To develop sparse versions of two PLS-based classification methods using Sparse Partial Least Squares (SPLS).
To evaluate the variable selection properties and classification performance of SPLS methods.
To compare SPLS approaches against existing methods in high-dimensional genomic datasets.

Main Methods:

Implementation of Sparse Partial Least Squares (SPLS) for binary and multicategory classification.
Integration of SPLS within a Generalized Linear Model (GLM) framework.
Analytical and simulation-based evaluations on publicly available tumor classification datasets.

Main Results:

SPLS integrated with GLM shows higher sensitivity for variable selection in multicategory classification with unbalanced samples.
The two-stage SPLS approach offers comparable sensitivity and better specificity as sample size increases.
For balanced datasets, the two-stage approach matches GLM performance in variable selection and prediction accuracy, with improved computational efficiency.

Conclusions:

SPLS provides a robust framework for dimension reduction and variable selection in high-dimensional genomic classification.
The choice between GLM and two-stage SPLS depends on data characteristics (e.g., sample balance) and desired outcomes (e.g., sensitivity vs. specificity).
SPLS methods offer computational advantages, particularly the two-stage approach, for genomic data analysis.