Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
Classification of Systems-II01:31

Classification of Systems-II

Continuous-time systems have continuous input and output signals, with time measured continuously. These systems are generally defined by differential or algebraic equations. For instance, in an RC circuit, the relationship between input and output voltage is expressed through a differential equation derived from Ohm's law and the capacitor relation,
Classification of Systems-I01:26

Classification of Systems-I

Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:
Classification of Signals01:30

Classification of Signals

In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...
Aggregates Classification01:29

Aggregates Classification

Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...
Calibration Curves: Linear Least Squares01:20

Calibration Curves: Linear Least Squares

A calibration curve is a plot of the instrument's response against a series of known concentrations of a substance. This curve is used to set the instrument response levels, using the substance and its concentrations as standards. Alternatively, or additionally, an equation is fitted to the calibration curve plot and subsequently used to calculate the unknown concentrations of other samples reliably.
For data that follow a straight line, the standard method for fitting is the linear...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Disease-dependent airway epithelial responses to acute electronic cigarette aerosol exposure: a pilot single-cell analysis.

Toxicological sciences : an official journal of the Society of Toxicology·2026
Same author

MINTsC learns multi-way chromatin interactions from single cell high throughput chromatin conformation data.

Nature communications·2026
Same author

CN-RNN: a Deep Learning Framework for Copy Number Variation Detection with Exome Sequencing Data.

bioRxiv : the preprint server for biology·2026
Same author

MINTsC learns multi-way chromatin interactions from single cell high throughput chromatin conformation data.

bioRxiv : the preprint server for biology·2026
Same author

Systematic background selection with BasCoD enhances contrastive dimension reduction in single cell genomics.

Nature communications·2026
Same author

AI for pathologists: a universal lymph node metastasis detection app that enhances efficiency while preserving diagnostic accuracy.

The journal of pathology. Clinical research·2026
Same journal

Balanced mediated pathway detection in genomic data.

Statistical applications in genetics and molecular biology·2026
Same journal

Annealed variational mixtures for disease subtyping and biomarker discovery.

Statistical applications in genetics and molecular biology·2026
Same journal

Performance of the permutation test approach with base calling errors for detecting changes in variant allele frequencies in ctDNA for a single patient.

Statistical applications in genetics and molecular biology·2026
Same journal

BLOG: Bayesian longitudinal omics with group constraints.

Statistical applications in genetics and molecular biology·2026
Same journal

AI-driven risk prediction and categorization in cystic fibrosis leveraging AttentiveLSTM and Fox Wolf Optimizer.

Statistical applications in genetics and molecular biology·2026
Same journal

Perfect collinearity not created equal: measuring and visualizing the severity of multi-collinearity of modern omics data.

Statistical applications in genetics and molecular biology·2026
See all related articles

Related Experiment Videos

Sparse partial least squares classification for high dimensional data.

Dongjun Chung1, Sunduz Keles

  • 1University of Wisconsin, Madison, WI, USA. chungdon@stat.wisc.edu

Statistical Applications in Genetics and Molecular Biology
|April 6, 2010
PubMed
Summary
This summary is machine-generated.

Sparse Partial Least Squares (SPLS) enhances genome biology classification by enabling simultaneous variable selection and dimension reduction. SPLS integrated with generalized linear models (GLM) improves tumor classification accuracy, especially with unbalanced data.

Related Experiment Videos

Area of Science:

  • Genomics
  • Bioinformatics
  • Statistical Learning

Background:

  • Partial Least Squares (PLS) is a dimension reduction technique applied to high-dimensional data.
  • Recent adaptations of PLS for genome biology classification face challenges with variable selection.
  • Sparse methods are needed to simultaneously reduce dimensions and select relevant variables.

Purpose of the Study:

  • To develop sparse versions of two PLS-based classification methods using Sparse Partial Least Squares (SPLS).
  • To evaluate the variable selection properties and classification performance of SPLS methods.
  • To compare SPLS approaches against existing methods in high-dimensional genomic datasets.

Main Methods:

  • Implementation of Sparse Partial Least Squares (SPLS) for binary and multicategory classification.
  • Integration of SPLS within a Generalized Linear Model (GLM) framework.
  • Analytical and simulation-based evaluations on publicly available tumor classification datasets.

Main Results:

  • SPLS integrated with GLM shows higher sensitivity for variable selection in multicategory classification with unbalanced samples.
  • The two-stage SPLS approach offers comparable sensitivity and better specificity as sample size increases.
  • For balanced datasets, the two-stage approach matches GLM performance in variable selection and prediction accuracy, with improved computational efficiency.

Conclusions:

  • SPLS provides a robust framework for dimension reduction and variable selection in high-dimensional genomic classification.
  • The choice between GLM and two-stage SPLS depends on data characteristics (e.g., sample balance) and desired outcomes (e.g., sensitivity vs. specificity).
  • SPLS methods offer computational advantages, particularly the two-stage approach, for genomic data analysis.