Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
Conservative Site-specific Recombination and Phase Variation02:53

Conservative Site-specific Recombination and Phase Variation

Because the DNA segments are cut and reorganized in a direction-specific manner, site-specific recombination has emerged as an efficient genetic engineering technique. Flippase and Cyclization recombinases or Flp and Cre, respectively, are two members of the tyrosine recombinase family derived from bacteriophages, that are used to mediate site-specific DNA insertions, deletions, and targeted expression of proteins in mammalian cell lines.
The recognition sites for Cre recombinase called LoxP...
Combinatorial Gene Control02:33

Combinatorial Gene Control

Combinatorial gene control is the synergistic action of several transcriptional factors to regulate the expression of a single gene. The absence of one or more of these factors may lead to a significant difference in the level of gene expression or repression.
The expression of more than 30,000 genes is controlled by approximately 2000-3000 transcription factors. This is possible because a single transcription factor can recognize more than one regulatory sequence. The specificity in gene...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

VO: The Vaccine Ontology.

Scientific data·2026
Same author

Nested Named Entity Recognition using Multilayer BERT-based Model: Notebook for the BioASQ Lab at CLEF 2024.

CEUR workshop proceedings.·2026
Same author

STAR-GO: improving protein function prediction by learning to hierarchically integrate ontology-informed semantic embeddings.

Bioinformatics (Oxford, England)·2026
Same author

Guideline-based strategies to identify severe cytokine release syndrome in COVID-19 and cancer immunotherapy using large-scale electronic health records.

Frontiers in digital health·2026
Same author

Leveraging Large Language Models for Cancer Vaccine Adjuvant Name Extraction from Biomedical Literature.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same author

Enhancing Severe Neutropenia Prediction: PKPD-Informed Labeling for Machine Learning Models Trained on Real-World Data.

Clinical pharmacology and therapeutics·2025
Same journal

SpaceExpander: An Automated System for Drafting Markush Claims to Expand Chemical Space.

Molecular informatics·2026
Same journal

A Structure-Informed Atlas of Venom-Derived Peptides Reveals the Organization of Chemical Space.

Molecular informatics·2026
Same journal

Novel Molecules Generation Using Graph Generative Adversarial Networks.

Molecular informatics·2026
Same journal

An Attention-Driven Graph Transformer With Nonlinear Modeling and Neuro-Fuzzy Fusion for High-Order Toxic Molecular Graph Learning.

Molecular informatics·2026
Same journal

Molecular Modeling and Chemoinformatics in Ukraine.

Molecular informatics·2026
Same journal

ADME-DTI: Augmented Deep Meta Ensemble for Drug-Target Interaction Prediction.

Molecular informatics·2026
See all related articles

Related Experiment Videos

ConGen: Targeted Molecule Generation Through Contrastive Learning and Latent Optimization.

Can Koban1, Gökçe Uludoğan1, Elif Ozkirimli2

  • 1Department of Computer Engineering, Boğaziçi University, Istanbul, Turkey.

Molecular Informatics
|May 20, 2026
PubMed
Summary
This summary is machine-generated.

We developed ConGen, a novel framework for generating targeted drug molecules directly from protein sequences. By incorporating non-interacting molecule data, ConGen enhances prediction accuracy for drug discovery.

Related Experiment Videos

Area of Science:

  • Computational chemistry
  • Drug discovery
  • Bioinformatics

Background:

  • Designing target-specific molecules from protein structures is established, but prediction from protein sequences alone is challenging.
  • Existing models often focus solely on interacting molecule pairs, potentially limiting specificity and generalizability.

Purpose of the Study:

  • To introduce ConGen, a sequence-conditioned framework for targeted molecule generation from protein sequence.
  • To improve molecule prediction models by including information on noninteracting pairs.

Main Methods:

  • ConGen utilizes a two-stage approach: contrastive learning (CL) and latent optimization (LO) in a shared pretrained space.
  • CL maps proteins and molecules into a joint space, attracting interacting pairs and repelling noninteracting ones.
  • LO refines random molecule latents towards the target protein's embedding for candidate generation.

Main Results:

  • ConGen demonstrates performance comparable to state-of-the-art sequence-based models like EncDecLM for targeted drug generation.
  • An ablation study confirmed the significant contributions of both the CL and LO stages to ConGen's effectiveness.

Conclusions:

  • ConGen represents a novel approach for targeted molecule generation using only protein sequences.
  • The inclusion of noninteracting molecule information and the combined CL/LO strategy enhance model specificity and generalizability in drug discovery.