Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Random forest: a classification and regression tool for compound classification and QSAR modeling.

Vladimir Svetnik1, Andy Liaw, Christopher Tong

  • 1Biometrics Research, Merck Research Laboratories, PO Box 2000, Rahway, New Jersey 07065, USA. vladimir_svetnik@merck.com

Journal of Chemical Information and Computer Sciences
|November 25, 2003
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

LC-MS measurement of triglyceride hydrolysis: Application in studies of Patatin-like phospholipase domain-containing protein 3 activity.

Analytical biochemistry·2026
Same author

Conformal Selection for Efficient and Accurate Compound Screening in Drug Discovery.

Journal of chemical information and modeling·2025
Same author

Enhancing Permeability Prediction of Heterobifunctional Degraders Using Machine Learning and Metadynamics-Informed 3D Molecular Descriptors.

Journal of chemical information and modeling·2025
Same author

Cross-dataset transcriptomic pathway ranking highlights MAPK signaling in diquat-induced hepatocellular toxicity.

Toxicology mechanisms and methods·2025
Same author

Targeting FANCM using the antisense oligonucleotides to treat the ALT-positive cancers.

Molecular therapy. Nucleic acids·2025
Same author

Bayesian hierarchical model predicts biopharmaceutical stability indicators and shelf life with application to multivalent human papillomavirus vaccine.

Scientific reports·2025
Same journal

Future Papers.

Journal of chemical information and computer sciences·2016
Same journal

Future Papers.

Journal of chemical information and computer sciences·2016
Same journal

Future Papers.

Journal of chemical information and computer sciences·2016
Same journal

Future Papers.

Journal of chemical information and computer sciences·2016
Same journal

Future Papers.

Journal of chemical information and computer sciences·2016
Same journal

Future Papers.

Journal of chemical information and computer sciences·2016
See all related articles

Random Forest, a new machine learning tool, accurately predicts compound biological activity using molecular structure. This powerful method offers built-in assessments and feature importance measures, making it ideal for cheminformatics modeling.

Area of Science:

  • Cheminformatics
  • Machine Learning
  • Computational Chemistry

Background:

  • Predicting compound biological activity is crucial in drug discovery.
  • Existing methods may lack accuracy or comprehensive features for complex modeling.

Purpose of the Study:

  • Introduce and evaluate the Random Forest algorithm for predicting compound biological activity.
  • Assess its performance against established cheminformatics methods.

Main Methods:

  • Random Forest, an ensemble of decision trees, was applied.
  • Models were built using quantitative molecular descriptors.
  • Predictions aggregated via majority vote or averaging.

Main Results:

  • Random Forest demonstrated high predictive accuracy across six diverse cheminformatics datasets.

Related Experiment Videos

  • Performance was competitive with, and often superior to, existing state-of-the-art methods.
  • The algorithm provided built-in performance assessment and descriptor importance measures.
  • Conclusions:

    • Random Forest is a powerful and accurate tool for quantitative and categorical activity prediction in cheminformatics.
    • Its unique features, including descriptor importance and similarity measures, enhance its utility.
    • Random Forest is well-suited for complex cheminformatics modeling tasks.