Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Survival Tree01:19

Survival Tree

339
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
339
Improving Translational Accuracy02:07

Improving Translational Accuracy

13.9K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
13.9K
Improving Translational Accuracy02:07

Improving Translational Accuracy

3.5K
3.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Advancing Reproducibility and Open Data in Theoretical and Computational Chemistry.

Journal of chemical theory and computation·2026
Same author

Topological data analysis and topological deep learning beyond persistent homology: a review.

Artificial intelligence review·2026
Same author

Interpretability and Representability of Commutative Algebra, Algebraic Topology, and Topological Spectral Theory for Real-World Data.

Advanced intelligent discovery·2026
Same author

Commutative Algebra Modeling in Materials Science - A Case Study on Metal-Organic Frameworks (MOFs).

Journal of chemical information and modeling·2026
Same author

Computational Drug Repurposing for Alzheimer's Disease via Sheaf Theoretic Population-Scale Analysis of snRNA-Seq Data.

Journal of medicinal chemistry·2026
Same author

Mayer-Homology Learning Prediction of Protein-Ligand Binding Affinities.

Journal of computational biophysics and chemistry·2025
Same journal

QSAR in the Browser: An Interactive Cheminformatics Web Application.

Journal of chemical information and modeling·2026
Same journal

FoldDoF: Utilizing the Primary Degrees of Freedom of Protein Backbone for Geometric Modeling and Generation.

Journal of chemical information and modeling·2026
Same journal

Derisking Affinity Optimization for Macrocycles and Cyclic Peptides: High-Precision Free Energy Simulations across Five Diverse Targets.

Journal of chemical information and modeling·2026
Same journal

An End-User Audit of Reproducibility, Data Leakage, and Overfitting of the Top-Ranked ADMET Prediction Models in TDC Leaderboards.

Journal of chemical information and modeling·2026
Same journal

PFASGroups: An Open-Source Framework for Automated Identification, Structural Classification, and Prioritization of Per- and Polyfluoroalkyl Substances.

Journal of chemical information and modeling·2026
Same journal

DeepKbhb: Context-Aware Prediction of Human Lysine β-Hydroxybutyrylation Sites.

Journal of chemical information and modeling·2026
See all related articles

Related Experiment Video

Updated: Dec 30, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K

Boosting Tree-Assisted Multitask Deep Learning for Small Scientific Datasets.

Jian Jiang1,2, Rui Wang2, Menglun Wang2

  • 1Research Center of Nonlinear Science, College of Mathematics and Computer Science, Engineering Research Center of Hubei Province for Clothing Information, Wuhan Textile University, Wuhan, 430200, P R. China.

Journal of Chemical Information and Modeling
|January 25, 2020
PubMed
Summary
This summary is machine-generated.

This study introduces a new machine learning architecture, boosting tree-assisted multitask deep learning (BTAMDL), to improve predictions for small scientific datasets. BTAMDL effectively combines gradient boosting decision trees and deep learning, outperforming existing methods.

More Related Videos

Deep Neural Networks for Image-Based Dietary Assessment
13:19

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

9.9K
Collecting and Processing Drone-based Remotely Sensed Data for Use in Forest Recovery Monitoring
08:16

Collecting and Processing Drone-based Remotely Sensed Data for Use in Forest Recovery Monitoring

Published on: October 24, 2025

371

Related Experiment Videos

Last Updated: Dec 30, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K
Deep Neural Networks for Image-Based Dietary Assessment
13:19

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

9.9K
Collecting and Processing Drone-based Remotely Sensed Data for Use in Forest Recovery Monitoring
08:16

Collecting and Processing Drone-based Remotely Sensed Data for Use in Forest Recovery Monitoring

Published on: October 24, 2025

371

Area of Science:

  • Computational chemistry
  • Bioinformatics
  • Machine learning

Background:

  • Machine learning success relies heavily on large, high-quality datasets, which are scarce in scientific fields like bioinformatics and medical science.
  • Gradient boosting decision trees (GBDT) excel with small datasets, while deep learning performs better with large datasets.

Purpose of the Study:

  • To develop an advanced machine learning architecture that enhances prediction accuracy for small scientific datasets.
  • To leverage correlated large datasets to improve performance on limited scientific data.

Main Methods:

  • A novel boosting tree-assisted multitask deep learning (BTAMDL) architecture was developed, integrating GBDT and multitask deep learning (MDL).
  • Two BTAMDL models were created: one using only MDL output for GBDT input, and another incorporating additional features into the GBDT input.
  • Models were validated across diverse datasets, including toxicity, partition coefficient, solubility, and solvation.

Main Results:

  • The proposed BTAMDL models demonstrated superior performance compared to current state-of-the-art methods on small datasets.
  • The architecture effectively handles situations where a large, correlated dataset is available alongside a small target dataset.
  • Validation across multiple scientific data categories confirmed the robustness and efficacy of BTAMDL.

Conclusions:

  • BTAMDL offers a powerful solution for improving machine learning predictions on small, challenging scientific datasets.
  • The integration of GBDT and MDL provides a synergistic approach to overcome data limitations in scientific research.
  • This work advances the application of machine learning in data-scarce scientific domains.