Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Quadratic Models

Quadratic Models

Quadratic models are mathematical representations used to describe relationships in which the rate of change changes at a constant rate. These models appear in a wide variety of natural and engineered systems, especially those involving motion, forces, and optimization. One common application is analyzing the vertical motion of objects influenced by gravity, such as a ball thrown into the air.In such scenarios, the object's height changes over time in a curved pattern, rising to a maximum point...

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

State Space Representation

State Space Representation

The frequency-domain technique, commonly used in analyzing and designing feedback control systems, is effective for linear, time-invariant systems. However, it falls short when dealing with nonlinear, time-varying, and multiple-input multiple-output systems. The time-domain or state-space approach addresses these limitations by utilizing state variables to construct simultaneous, first-order differential equations, known as state equations, for an nth-order system.
Consider an RLC circuit, a...

Application of Linearization and Approximation

Application of Linearization and Approximation

A drone flying through complex terrain often relies on more than one sensing method to estimate small changes in altitude. Along with direct measurements, air pressure provides a useful indirect indicator of vertical movement. Atmospheric pressure decreases as altitude increases, and this relationship is commonly described using an exponential model. Although accurate, converting pressure measurements into altitude values requires calculations that are too complex to perform repeatedly during...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

TikTok is a valuable data source for tracking the opioid crisis.

NPJ digital medicine·2026

Same author

Drug-Target Interaction Prediction with PIGLET.

bioRxiv : the preprint server for biology·2026

Same author

GATSBI: Improving context-aware protein embeddings through biologically motivated data splits.

bioRxiv : the preprint server for biology·2026

Same author

Biological data governance in an age of AI.

Science (New York, N.Y.)·2026

Same author

The Human Omnibus of Targetable Pockets.

Journal of cheminformatics·2025

Same author

Publisher Correction: CRISPR-GPT for agentic automation of gene-editing experiments.

Nature biomedical engineering·2025

Same journal

Correction to "AstraMEV (AI-Guided Structural Assembly of Multi-Epitope Vaccines) Against Infectious Bronchitis Virus".

Journal of chemical information and modeling·2026

Same journal

MolPy: A Large Language Model-Friendly Toolkit for Reactive Topology Editing in Polymer Simulations.

Journal of chemical information and modeling·2026

Same journal

Molecular Mechanisms of KIT Receptor Dimerization and Oncogenic Activation Revealed by Multiscale Simulations.

Journal of chemical information and modeling·2026

Same journal

Structural and Thermodynamic Discrimination between Agonists and Antagonists of Retinoic Acid Receptor γ and the Vitamin D Receptor.

Journal of chemical information and modeling·2026

Same journal

PACEff Builder: An Efficient Platform for Constructing PACE Hybrid-Resolution Models for Molecular Dynamics Simulations of Aqueous Protein, Peptide Assembly, and Membrane Protein Systems.

Journal of chemical information and modeling·2026

Same journal

TransKla: A Local-Global Cross-Attention Based Transformer Approach for Prediction of Lysine Lactylation Sites.

Journal of chemical information and modeling·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 26, 2026

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

Shallow Representation Learning via Kernel PCA Improves QSAR Modelability.

Stefano E Rensi¹, Russ B Altman¹

¹Department of Bioengineering, Stanford University , Shriram Center, Room 213, 443 Via Ortega MC 4245, Stanford, California 94305, United States.

Journal of Chemical Information and Modeling

|July 21, 2017

Summary

This summary is machine-generated.

Shallow representation learning enhances linear models like LASSO to match nonlinear QSAR performance. This approach using kernel principal component analysis (KPCA) offers faster computation than Support Vector Machines (SVMs).

More Related Videos

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

Related Experiment Videos

Last Updated: Feb 26, 2026

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

Area of Science:

Computational chemistry
Cheminformatics
Machine learning in drug discovery

Background:

Linear models for quantitative structure-activity relationships (QSARs) are efficient but often outperformed by nonlinear methods.
Support vector machines (SVMs) and neural networks excel by learning data representations, improving model accuracy.
Existing QSAR methods face challenges in balancing performance, computational efficiency, and flexibility.

Purpose of the Study:

To improve the performance of L1 regularized logistic regression (LASSO) using shallow representation learning.
To achieve performance comparable to nonlinear methods like Tanimoto SVM for QSAR prediction.
To evaluate the computational efficiency of enhanced linear models compared to nonlinear alternatives.

Main Methods:

Embedding chemical fingerprints into Euclidean space via Tanimoto similarity kernel principal component analysis (KPCA).
Applying LASSO and SVM classifiers to predict binding activities of chemical compounds against 102 virtual screening targets.
Comparing model performance, training times, and cross-validation efficiency between LASSO and SVM.

Main Results:

LASSO, enhanced with KPCA, demonstrated performance comparable to Tanimoto SVM.
Similar performance improvements were observed for both LASSO and SVM when using KPCA.
KPCA combined with LASSO classification showed significantly faster computation than linear SVM across various training set sizes.

Conclusions:

Powerful linear QSAR methods can achieve performance levels rivaling nonlinear methods through representation learning.
A modular approach to nonlinear classification enhances QSAR model prototyping, flexibility, and transferability.
This study highlights a computationally efficient strategy for improving QSAR model accuracy and applicability.