Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Protein Complexes with Interchangeable Parts01:57

Protein Complexes with Interchangeable Parts

3.0K
Groups of proteins may form a complex where each protein in this complex has a different role in the overall execution of the complex’s function. Often some of the proteins in the complex can be replaced by a closely related variant to give a complex that contains many of the same components yet is functionally distinct.
The SCF ubiquitin ligase is a protein complex of five individual proteins. This complex attaches ubiquitin to other target proteins to mark them for degradation. In order...
3.0K
Protein Complexes with Interchangeable Parts01:57

Protein Complexes with Interchangeable Parts

2.2K
2.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Computational Identification of Active Drug Metabolites for Human Protein Targets.

Molecular pharmaceutics·2026
Same author

Linker-Driven Sampling of PROTAC-Induced Ternary Complexes.

Journal of medicinal chemistry·2026
Same author

Exploring Secondary Structure Predictions for RNA-Targeted Drug Discovery: Power and Challenges.

Journal of chemical information and modeling·2026
Same author

Prediction of Permeability and Efflux Using Multitask Learning.

ACS omega·2025
Same author

Correction: Enhanced Thompson sampling by roulette wheel selection for screening ultralarge combinatorial libraries.

Journal of cheminformatics·2025
Same author

Enhanced Thompson sampling by roulette wheel selection for screening ultralarge combinatorial libraries.

Journal of cheminformatics·2025
Same journal

Count your bits: fingerprint benchmarking to assess broad chemical space representation.

Journal of cheminformatics·2026
Same journal

Sampling out-of-distribution chemical spaces via Bayesian flow.

Journal of cheminformatics·2026
Same journal

Hold on tight: the kinetic profiling of opioid receptor ligands using the CORAL-MD.

Journal of cheminformatics·2026
Same journal

Transformer-accelerated discovery of inhibitors targeting the RpsA<sub>Δ438</sub> deletion in PZA-resistant tuberculosis.

Journal of cheminformatics·2026
Same journal

DICL: a manually curated database of ion channels and ligands as a useful platform for drug discovery targeting ion channels.

Journal of cheminformatics·2026
Same journal

DCPM-ADMET: fusion of dual-component pre-trained model and molecular fingerprints to enhance drug ADMET properties prediction.

Journal of cheminformatics·2026
See all related articles

Related Experiment Video

Updated: Feb 22, 2026

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

69.9K

PROTAC-Splitter: a machine learning framework for automated identification of PROTAC substructures.

Stefano Ribes1, Ranxuan Zhang1, Télio Cropsal1

  • 1Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg, Chalmersplatsen 1, 412 96, Gothenburg, Sweden.

Journal of Cheminformatics
|February 20, 2026
PubMed
Summary
This summary is machine-generated.

PROTAC-Splitter, a machine learning tool, automates the annotation of proteolysis-targeting chimera (PROTAC) components. It uses a hybrid approach to reliably analyze diverse PROTAC structures, overcoming data scarcity challenges.

Keywords:
CheminformaticsDrug discoveryMachine learningPROTACTargeted protein degradation

More Related Videos

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

1.2K
Fully Autonomous Characterization and Data Collection from Crystals of Biological Macromolecules
07:11

Fully Autonomous Characterization and Data Collection from Crystals of Biological Macromolecules

Published on: March 22, 2019

7.3K

Related Experiment Videos

Last Updated: Feb 22, 2026

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

69.9K
Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

1.2K
Fully Autonomous Characterization and Data Collection from Crystals of Biological Macromolecules
07:11

Fully Autonomous Characterization and Data Collection from Crystals of Biological Macromolecules

Published on: March 22, 2019

7.3K

Area of Science:

  • Medicinal Chemistry
  • Computational Chemistry
  • Drug Discovery

Background:

  • Proteolysis-targeting chimeras (PROTACs) are heterobifunctional molecules with therapeutic potential.
  • Manual annotation of PROTAC components (E3 ligase ligand, linker, warhead) is challenging and time-consuming.
  • Automated methods are needed to accurately identify and annotate PROTAC substructures.

Purpose of the Study:

  • To develop and validate PROTAC-Splitter, a machine learning framework for automated PROTAC substructure annotation.
  • To address data scarcity by generating and releasing a large synthetic dataset of annotated PROTACs.
  • To compare different machine learning models for PROTAC annotation.

Main Methods:

  • Development of PROTAC-Splitter, a machine learning framework.
  • Generation of a synthetic dataset of ~1.3 million annotated PROTAC structures.
  • Implementation of Transformer-based and XGBoost-based models for substructure annotation.
  • Evaluation on public and proprietary PROTAC datasets, including structurally novel compounds.
  • Development of a Transformer wrapper (Transformer-Δ) to correct prediction errors.

Main Results:

  • The Transformer model achieved 86% exact-match accuracy on public data but struggled with novel structures.
  • The XGBoost model ensured chemical validity and perfect reassembly but had lower exact-match accuracy.
  • Transformer-Δ improved reassembly accuracy to 96% (public) and 70% (internal datasets).
  • A hybrid approach combining Transformer-Δ and XGBoost demonstrated robust annotation across diverse chemical spaces.

Conclusions:

  • PROTAC-Splitter offers a reliable and scalable solution for automated PROTAC analysis.
  • The hybrid approach effectively annotates PROTACs, overcoming limitations of individual models.
  • The open-source availability of PROTAC-Splitter facilitates broader adoption in drug discovery.