Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quadratic Models01:23

Quadratic Models

273
Quadratic models are mathematical representations used to describe relationships in which the rate of change changes at a constant rate. These models appear in a wide variety of natural and engineered systems, especially those involving motion, forces, and optimization. One common application is analyzing the vertical motion of objects influenced by gravity, such as a ball thrown into the air.In such scenarios, the object's height changes over time in a curved pattern, rising to a maximum point...
273
Model Approaches for Pharmacokinetic Data: Distributed Parameter Models01:06

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

298
Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...
298
Linear Approximation in Frequency Domain01:26

Linear Approximation in Frequency Domain

407
Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....
407
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

9.6K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
9.6K
State Space Representation01:27

State Space Representation

625
The frequency-domain technique, commonly used in analyzing and designing feedback control systems, is effective for linear, time-invariant systems. However, it falls short when dealing with nonlinear, time-varying, and multiple-input multiple-output systems. The time-domain or state-space approach addresses these limitations by utilizing state variables to construct simultaneous, first-order differential equations, known as state equations, for an nth-order system.
Consider an RLC circuit, a...
625
Application of Linearization and Approximation01:29

Application of Linearization and Approximation

118
A drone flying through complex terrain often relies on more than one sensing method to estimate small changes in altitude. Along with direct measurements, air pressure provides a useful indirect indicator of vertical movement. Atmospheric pressure decreases as altitude increases, and this relationship is commonly described using an exponential model. Although accurate, converting pressure measurements into altitude values requires calculations that are too complex to perform repeatedly during...
118

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

TikTok is a valuable data source for tracking the opioid crisis.

NPJ digital medicine·2026
Same author

Drug-Target Interaction Prediction with PIGLET.

bioRxiv : the preprint server for biology·2026
Same author

GATSBI: Improving context-aware protein embeddings through biologically motivated data splits.

bioRxiv : the preprint server for biology·2026
Same author

Biological data governance in an age of AI.

Science (New York, N.Y.)·2026
Same author

The Human Omnibus of Targetable Pockets.

Journal of cheminformatics·2025
Same author

Publisher Correction: CRISPR-GPT for agentic automation of gene-editing experiments.

Nature biomedical engineering·2025
Same journal

Correction to "AstraMEV (AI-Guided Structural Assembly of Multi-Epitope Vaccines) Against Infectious Bronchitis Virus".

Journal of chemical information and modeling·2026
Same journal

MolPy: A Large Language Model-Friendly Toolkit for Reactive Topology Editing in Polymer Simulations.

Journal of chemical information and modeling·2026
Same journal

Molecular Mechanisms of KIT Receptor Dimerization and Oncogenic Activation Revealed by Multiscale Simulations.

Journal of chemical information and modeling·2026
Same journal

Structural and Thermodynamic Discrimination between Agonists and Antagonists of Retinoic Acid Receptor γ and the Vitamin D Receptor.

Journal of chemical information and modeling·2026
Same journal

PACEff Builder: An Efficient Platform for Constructing PACE Hybrid-Resolution Models for Molecular Dynamics Simulations of Aqueous Protein, Peptide Assembly, and Membrane Protein Systems.

Journal of chemical information and modeling·2026
Same journal

TransKla: A Local-Global Cross-Attention Based Transformer Approach for Prediction of Lysine Lactylation Sites.

Journal of chemical information and modeling·2026
See all related articles

Related Experiment Video

Updated: Feb 26, 2026

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.9K

Shallow Representation Learning via Kernel PCA Improves QSAR Modelability.

Stefano E Rensi1, Russ B Altman1

  • 1Department of Bioengineering, Stanford University , Shriram Center, Room 213, 443 Via Ortega MC 4245, Stanford, California 94305, United States.

Journal of Chemical Information and Modeling
|July 21, 2017
PubMed
Summary
This summary is machine-generated.

Shallow representation learning enhances linear models like LASSO to match nonlinear QSAR performance. This approach using kernel principal component analysis (KPCA) offers faster computation than Support Vector Machines (SVMs).

More Related Videos

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

16.4K

Related Experiment Videos

Last Updated: Feb 26, 2026

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.9K
Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

16.4K

Area of Science:

  • Computational chemistry
  • Cheminformatics
  • Machine learning in drug discovery

Background:

  • Linear models for quantitative structure-activity relationships (QSARs) are efficient but often outperformed by nonlinear methods.
  • Support vector machines (SVMs) and neural networks excel by learning data representations, improving model accuracy.
  • Existing QSAR methods face challenges in balancing performance, computational efficiency, and flexibility.

Purpose of the Study:

  • To improve the performance of L1 regularized logistic regression (LASSO) using shallow representation learning.
  • To achieve performance comparable to nonlinear methods like Tanimoto SVM for QSAR prediction.
  • To evaluate the computational efficiency of enhanced linear models compared to nonlinear alternatives.

Main Methods:

  • Embedding chemical fingerprints into Euclidean space via Tanimoto similarity kernel principal component analysis (KPCA).
  • Applying LASSO and SVM classifiers to predict binding activities of chemical compounds against 102 virtual screening targets.
  • Comparing model performance, training times, and cross-validation efficiency between LASSO and SVM.

Main Results:

  • LASSO, enhanced with KPCA, demonstrated performance comparable to Tanimoto SVM.
  • Similar performance improvements were observed for both LASSO and SVM when using KPCA.
  • KPCA combined with LASSO classification showed significantly faster computation than linear SVM across various training set sizes.

Conclusions:

  • Powerful linear QSAR methods can achieve performance levels rivaling nonlinear methods through representation learning.
  • A modular approach to nonlinear classification enhances QSAR model prototyping, flexibility, and transferability.
  • This study highlights a computationally efficient strategy for improving QSAR model accuracy and applicability.