Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Collisions in Multiple Dimensions: Problem Solving

Collisions in Multiple Dimensions: Problem Solving

In multiple dimensions, the conservation of momentum applies in each direction independently. Hence, to solve collisions in multiple dimensions, we should write down the momentum conservation in each direction separately. To help understand collisions in multiple dimensions, consider an example.
A small car of mass 1,200 kg traveling east at 60 km/h collides at an intersection with a truck of mass 3,000 kg traveling due north at 40 km/h. The two vehicles are locked together. What is the...

Relative Motion Analysis using Rotating Axes-Problem Solving

Relative Motion Analysis using Rotating Axes-Problem Solving

Consider a crane whose telescopic boom rotates with an angular velocity of 0.04 rad/s and angular acceleration of 0.02 rad/s2. Along with the rotation, the boom also extends linearly with a uniform speed of 5 m/s. The extension of the boom is measured at point D, which is measured with respect to the fixed point C on the other end of the boom. For the given instant, the distance between points C and D is 60 meters.
Here, in order to determine the magnitude of velocity and acceleration for point...

Rolling Resistance: Problem Solving

Rolling Resistance: Problem Solving

Rolling resistance, also known as rolling friction, is the force that resists the motion of a rolling object, such as a wheel, tire, or ball, when it moves over a surface. It is caused by the deformation of the object and the surface in contact with each other, as well as other factors like internal friction, hysteresis, and energy losses within the materials. Rolling resistance opposes the object's motion, requiring additional energy to overcome it and maintain movement. In practical...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Generating Planar Trajectories for Neptunian System Exploration Using Motion Primitives.

The journal of the astronautical sciences·2026

Same author

Warming riverscapes annually challenge the role of thermal refuges for thermoregulating salmonids.

The Journal of applied ecology·2026

Same author

The production of the chemokine CCL2 by corneal sensory neurons initiates anti-viral immunity at the cornea and trigeminal ganglion.

Cell reports·2025

Same author

A database of life history parameters for Pacific coral reef fish.

Scientific data·2025

Same author

Measuring the Structure, Composition, and Change of Underwater Environments with Large-area Imaging.

Journal of visualized experiments : JoVE·2025

Same author

Motion Primitive Approach to Spacecraft Trajectory Design in a Multi-body System.

The journal of the astronautical sciences·2023

Same journal

Venus Flagship Mission Concept: A Decadal Survey Study.

IEEE Aerospace Conference. IEEE Aerospace Conference·2021

See all related articles

Search research articles

Related Experiment Video

Updated: Oct 6, 2025

Behavioral Training Procedures for Head-fixed Virtual Reality in Mice

Behavioral Training Procedures for Head-fixed Virtual Reality in Mice

Published on: September 6, 2024

Exploring Transfers between Earth-Moon Halo Orbits via Multi-Objective Reinforcement Learning.

Christopher J Sullivan¹, Natasha Bosanac¹, Rodney L Anderson²

¹Colorado Center for Astrodynamics, Smead Aerospace Engineering Sciences, University of Colorado Boulder, 429 UCB, Boulder, CO 80303.

IEEE Aerospace Conference. IEEE Aerospace Conference

|January 14, 2022

Summary

This summary is machine-generated.

Multi-Reward Proximal Policy Optimization trains spacecraft control schemes for efficient low-thrust trajectories between Earth-Moon orbits. This deep reinforcement learning method rapidly explores design options, balancing fuel use, flight time, and mission objectives.

More Related Videos

Simulating Imaging of Large Scale Radio Arrays on the Lunar Surface

Simulating Imaging of Large Scale Radio Arrays on the Lunar Surface

Published on: July 30, 2020

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

Published on: July 8, 2015

Related Experiment Videos

Last Updated: Oct 6, 2025

Behavioral Training Procedures for Head-fixed Virtual Reality in Mice

Behavioral Training Procedures for Head-fixed Virtual Reality in Mice

Published on: September 6, 2024

Simulating Imaging of Large Scale Radio Arrays on the Lunar Surface

Simulating Imaging of Large Scale Radio Arrays on the Lunar Surface

Published on: July 30, 2020

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

Published on: July 8, 2015

Area of Science:

Aerospace Engineering
Astrodynamics
Artificial Intelligence

Background:

Designing low-thrust trajectories for spacecraft transfers, especially for SmallSats between libration point orbits, presents complex multi-objective challenges.
Traditional methods can be computationally intensive and may not efficiently explore the full design space.
Balancing propellant usage, flight time, and trajectory accuracy is critical for mission success.

Purpose of the Study:

To investigate the design space of low-thrust trajectories for a SmallSat in the Earth-Moon system using a multi-objective deep reinforcement learning algorithm.
To efficiently train multiple policies simultaneously for distinct trajectory design scenarios.
To autonomously construct the solution space for rapid insights into trade-offs.

Main Methods:

Application of Multi-Reward Proximal Policy Optimization (MrPPO), a multi-objective deep reinforcement learning algorithm.
Training multiple policies on three distinct trajectory design scenarios, each with a unique reward function.
Evaluation of policies on perturbed initial conditions to generate performance metrics like propellant mass usage and flight time.

Main Results:

Successfully trained unique control schemes for each trajectory design scenario and reward function.
Generated data on propellant mass usage, flight time, and state discontinuities for various low-thrust trajectories.
Examined a subset of the multi-objective trade space, revealing insights into transfer geometry and performance.

Conclusions:

MrPPO enables efficient exploration of the multi-objective trade space for SmallSat trajectory design.
The algorithm autonomously constructs solutions, providing rapid insights into propellant mass, flight time, and transfer geometry.
This approach accelerates the understanding of complex orbital transfer dynamics and design parameters.