Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement Schedules01:24

Reinforcement Schedules

559
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
559
Observational Learning01:12

Observational Learning

1.1K
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
1.1K
Optimization Problems01:26

Optimization Problems

93
Optimization problems often involve identifying maximum or minimum values under specific constraints. A well-known example is determining the longest horizontal pipe that can be moved around a right-angled corner, where a 3-meter-wide hallway meets a 2-meter-wide hallway. This scenario, common in architectural design and industrial transport, can be understood conceptually through geometric and trigonometric reasoning.To visualize the problem, consider the pipe as a straight line that touches...
93
Reinforcement01:23

Reinforcement

992
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
992
Improving Translational Accuracy02:07

Improving Translational Accuracy

3.7K
3.7K
Improving Translational Accuracy02:07

Improving Translational Accuracy

15.2K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
15.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Adaptive genomic evolution and WD40-regulated temporal dynamics of anthocyanins support leaf photoplasticity in Parrotia subaequalis.

BMC plant biology·2026
Same author

Pathogenic Mechanisms of V. vulnificus and Its Role in the Development of Sepsis.

MicrobiologyOpen·2026
Same author

CauFinder: Steering Cell-State and Phenotype Transitions by Causal Disentanglement Learning.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2026
Same author

Mesenchymal Stem Cell Therapy for Type 2 Diabetes: Synergistic β-Cell Regeneration, Immune Modulation, and Exosome-Mediated Glucose Homeostasis.

Stem cells international·2026
Same author

Single-cell immunoprofiling reveals a dysfunctional-like immune microenvironment and malignant phenotype in aging penile squamous cell carcinoma.

Cell biology and toxicology·2026
Same author

The Adjunctive Role of Botulinum Toxin A in Wound Healing and Scar Management.

Aesthetic plastic surgery·2026
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Feb 24, 2026

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
10:58

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

17.7K

CodonRL: Multi-Objective Codon Sequence Optimization Using Demonstration-Guided Reinforcement Learning.

Shiyi Du1, Gün Kaynar1, Jiayi Li1

  • 1Carnegie Mellon University.

Biorxiv : the Preprint Server for Biology
|February 23, 2026
PubMed
Summary
This summary is machine-generated.

CodonRL, a novel reinforcement learning framework, optimizes mRNA sequences for enhanced translation efficiency and stability. It outperforms existing methods by learning structural priors and enabling user-controlled multi-objective trade-offs.

More Related Videos

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
11:53

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

13.5K
Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.2K

Related Experiment Videos

Last Updated: Feb 24, 2026

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
10:58

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

17.7K
Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
11:53

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

13.5K
Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.2K

Area of Science:

  • Computational biology
  • Synthetic biology
  • Bioinformatics

Background:

  • Optimizing mRNA sequences for translation efficiency, RNA stability, and compositional properties is complex due to vast search spaces and interacting objectives.
  • Existing methods like dynamic programming struggle with extensibility, while deep generative models require extensive training data, and reinforcement learning faces challenges with delayed rewards and large action spaces.

Purpose of the Study:

  • To develop a reinforcement learning framework, CodonRL, for efficient and customizable mRNA sequence optimization.
  • To address limitations of existing methods by incorporating structural priors and enabling multi-objective trade-offs during inference.

Main Methods:

  • CodonRL utilizes reinforcement learning with efficient folding feedback and demonstration-guided replay to learn structural priors for mRNA design.
  • It employs LinearFold for rapid intermediate reward computation during training and ViennaRNA for final evaluation.
  • Milestone-based intermediate rewards and expert sequence warm-up accelerate convergence and handle delayed feedback in long-range optimization.

Main Results:

  • CodonRL demonstrated superior performance compared to GEMORNA on 55 human proteins.
  • Achieved an average 9.5% higher codon adaptation index (CAI), 25.4 kcal/mol more favorable minimum free energy (MFE), and 3.4% lower uridine content.
  • Improved codon stabilization coefficient (CSC) in over 90% of benchmark proteins under matched constraints.

Conclusions:

  • CodonRL provides a powerful framework for designing mRNA sequences with improved translation efficiency, structural stability, and reduced immunogenicity.
  • The method allows for continuous objective reweighting at inference time, offering flexibility in mRNA design.
  • CodonRL represents a significant advancement in computational mRNA design, particularly in scenarios with scarce training data.