Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

56
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
56

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Multiscale Neural Network Potential with Anisotropic Message Passing for the Fast and Accurate Simulation of Protein Dynamics and Enzymatic Reactions.

Journal of the American Chemical Society·2026
Same author

Balancing Data Quantity and Quality: Evaluating Curation Strategies for Bioactivity Prediction in Lead Optimization.

Journal of chemical information and modeling·2026
Same author

Leveraging the Potential of Machine-Learning Interatomic Potentials for QM/MM Simulations.

Chimia·2026
Same author

Automated algorithm in catheter-associated urinary tract infection identification using digital health records: validity and barriers in implementation into clinical practice.

BMC medical informatics and decision making·2026
Same author

Cardiovascular care in adult cancer survivorship post-therapy: a cross-sectional analysis of clinical practice guidelines.

Cardio-oncology (London, England)·2026
Same author

Torsion angular bin strings: algorithmic update and additional validation.

Journal of cheminformatics·2026
Same journal

OpenStats: how to combine statistics and research data management (RDM) to leverage efficient scientific data analysis by guided statistics.

Journal of cheminformatics·2026
Same journal

Unified heterogeneity-aware benchmark of drug synergy prediction: a cross-study analysis of traditional machine learning and graph deep learning models.

Journal of cheminformatics·2026
Same journal

Count your bits: fingerprint benchmarking to assess broad chemical space representation.

Journal of cheminformatics·2026
Same journal

Sampling out-of-distribution chemical spaces via Bayesian flow.

Journal of cheminformatics·2026
Same journal

Hold on tight: the kinetic profiling of opioid receptor ligands using the CORAL-MD.

Journal of cheminformatics·2026
Same journal

Transformer-accelerated discovery of inhibitors targeting the RpsA<sub>Δ438</sub> deletion in PZA-resistant tuberculosis.

Journal of cheminformatics·2026
See all related articles

Related Experiment Video

Updated: Jul 8, 2025

Design and Analysis for Fall Detection System Simplification
08:05

Design and Analysis for Fall Detection System Simplification

Published on: April 6, 2020

10.7K

SIMPD: an algorithm for generating simulated time splits for validating machine learning approaches.

Gregory A Landrum1, Maximilian Beckers2, Jessica Lanini2

  • 1Department of Chemistry and Applied Biosciences, ETH Zurich, Vladimir-Prelog-Weg 2, Zurich, 8093, Switzerland. glandrum@ethz.ch.

Journal of Cheminformatics
|December 12, 2023
PubMed
Summary
This summary is machine-generated.

A new algorithm, SIMPD (simulated medicinal chemistry project data), creates realistic training and test data for machine learning in drug discovery. This method improves model validation for medicinal chemistry projects using public datasets.

Keywords:
Cross-validationLead optimizationMachine learning

More Related Videos

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

406
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.6K

Related Experiment Videos

Last Updated: Jul 8, 2025

Design and Analysis for Fall Detection System Simplification
08:05

Design and Analysis for Fall Detection System Simplification

Published on: April 6, 2020

10.7K
Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

406
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.6K

Area of Science:

  • Medicinal Chemistry
  • Computational Chemistry
  • Machine Learning

Background:

  • Time-split cross-validation is the standard for validating predictive models in medicinal chemistry.
  • Such data is scarce, particularly outside large pharmaceutical companies.
  • Existing data splitting methods do not accurately reflect real-world project data.

Purpose of the Study:

  • To introduce the SIMPD (simulated medicinal chemistry project data) algorithm.
  • To generate training and test sets that mimic real-world medicinal chemistry project data.
  • To create publicly available datasets for machine learning model validation.

Main Methods:

  • SIMPD employs a multi-objective genetic algorithm.
  • Objectives are based on extensive analysis of compound differences in lead optimization projects.
  • The algorithm was applied to ChEMBL bioactivity data.

Main Results:

  • SIMPD generates training/test splits that better reflect property and performance differences compared to random or neighbor splits.
  • The method accurately mimics differences observed in temporal splits of real-world data.
  • 99 public datasets were created using the SIMPD algorithm.

Conclusions:

  • SIMPD provides a valuable tool for validating machine learning models in medicinal chemistry.
  • The algorithm enables the use of public data for more realistic model assessment.
  • The code and datasets are available open-source for broader research use.