Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Observational Learning01:12

Observational Learning

361
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
361
Reinforcement Schedules01:24

Reinforcement Schedules

257
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
257
Reinforcement01:23

Reinforcement

416
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
416
Collisions in Multiple Dimensions: Problem Solving01:06

Collisions in Multiple Dimensions: Problem Solving

4.5K
In multiple dimensions, the conservation of momentum applies in each direction independently. Hence, to solve collisions in multiple dimensions, we should write down the momentum conservation in each direction separately. To help understand collisions in multiple dimensions, consider an example.
A small car of mass 1,200 kg traveling east at 60 km/h collides at an intersection with a truck of mass 3,000 kg traveling due north at 40 km/h. The two vehicles are locked together. What is the...
4.5K
Relative Motion Analysis using Rotating Axes-Problem Solving01:29

Relative Motion Analysis using Rotating Axes-Problem Solving

469
Consider a crane whose telescopic boom rotates with an angular velocity of 0.04 rad/s and angular acceleration of 0.02 rad/s2. Along with the rotation, the boom also extends linearly with a uniform speed of 5 m/s. The extension of the boom is measured at point D, which is measured with respect to the fixed point C on the other end of the boom. For the given instant, the distance between points C and D is 60 meters.
Here, in order to determine the magnitude of velocity and acceleration for point...
469
Rolling Resistance: Problem Solving01:17

Rolling Resistance: Problem Solving

490
Rolling resistance, also known as rolling friction, is the force that resists the motion of a rolling object, such as a wheel, tire, or ball, when it moves over a surface. It is caused by the deformation of the object and the surface in contact with each other, as well as other factors like internal friction, hysteresis, and energy losses within the materials. Rolling resistance opposes the object's motion, requiring additional energy to overcome it and maintain movement. In practical...
490

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Generating Planar Trajectories for Neptunian System Exploration Using Motion Primitives.

The journal of the astronautical sciences·2026
Same author

Warming riverscapes annually challenge the role of thermal refuges for thermoregulating salmonids.

The Journal of applied ecology·2026
Same author

The production of the chemokine CCL2 by corneal sensory neurons initiates anti-viral immunity at the cornea and trigeminal ganglion.

Cell reports·2025
Same author

A database of life history parameters for Pacific coral reef fish.

Scientific data·2025
Same author

Measuring the Structure, Composition, and Change of Underwater Environments with Large-area Imaging.

Journal of visualized experiments : JoVE·2025
Same author

Motion Primitive Approach to Spacecraft Trajectory Design in a Multi-body System.

The journal of the astronautical sciences·2023
Same journal

Venus Flagship Mission Concept: A Decadal Survey Study.

IEEE Aerospace Conference. IEEE Aerospace Conference·2021
See all related articles

Related Experiment Video

Updated: Oct 6, 2025

Behavioral Training Procedures for Head-fixed Virtual Reality in Mice
06:27

Behavioral Training Procedures for Head-fixed Virtual Reality in Mice

Published on: September 6, 2024

1.4K

Exploring Transfers between Earth-Moon Halo Orbits via Multi-Objective Reinforcement Learning.

Christopher J Sullivan1, Natasha Bosanac1, Rodney L Anderson2

  • 1Colorado Center for Astrodynamics, Smead Aerospace Engineering Sciences, University of Colorado Boulder, 429 UCB, Boulder, CO 80303.

IEEE Aerospace Conference. IEEE Aerospace Conference
|January 14, 2022
PubMed
Summary
This summary is machine-generated.

Multi-Reward Proximal Policy Optimization trains spacecraft control schemes for efficient low-thrust trajectories between Earth-Moon orbits. This deep reinforcement learning method rapidly explores design options, balancing fuel use, flight time, and mission objectives.

More Related Videos

Simulating Imaging of Large Scale Radio Arrays on the Lunar Surface
06:14

Simulating Imaging of Large Scale Radio Arrays on the Lunar Surface

Published on: July 30, 2020

5.0K
The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents
09:01

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

Published on: July 8, 2015

12.7K

Related Experiment Videos

Last Updated: Oct 6, 2025

Behavioral Training Procedures for Head-fixed Virtual Reality in Mice
06:27

Behavioral Training Procedures for Head-fixed Virtual Reality in Mice

Published on: September 6, 2024

1.4K
Simulating Imaging of Large Scale Radio Arrays on the Lunar Surface
06:14

Simulating Imaging of Large Scale Radio Arrays on the Lunar Surface

Published on: July 30, 2020

5.0K
The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents
09:01

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

Published on: July 8, 2015

12.7K

Area of Science:

  • Aerospace Engineering
  • Astrodynamics
  • Artificial Intelligence

Background:

  • Designing low-thrust trajectories for spacecraft transfers, especially for SmallSats between libration point orbits, presents complex multi-objective challenges.
  • Traditional methods can be computationally intensive and may not efficiently explore the full design space.
  • Balancing propellant usage, flight time, and trajectory accuracy is critical for mission success.

Purpose of the Study:

  • To investigate the design space of low-thrust trajectories for a SmallSat in the Earth-Moon system using a multi-objective deep reinforcement learning algorithm.
  • To efficiently train multiple policies simultaneously for distinct trajectory design scenarios.
  • To autonomously construct the solution space for rapid insights into trade-offs.

Main Methods:

  • Application of Multi-Reward Proximal Policy Optimization (MrPPO), a multi-objective deep reinforcement learning algorithm.
  • Training multiple policies on three distinct trajectory design scenarios, each with a unique reward function.
  • Evaluation of policies on perturbed initial conditions to generate performance metrics like propellant mass usage and flight time.

Main Results:

  • Successfully trained unique control schemes for each trajectory design scenario and reward function.
  • Generated data on propellant mass usage, flight time, and state discontinuities for various low-thrust trajectories.
  • Examined a subset of the multi-objective trade space, revealing insights into transfer geometry and performance.

Conclusions:

  • MrPPO enables efficient exploration of the multi-objective trade space for SmallSat trajectory design.
  • The algorithm autonomously constructs solutions, providing rapid insights into propellant mass, flight time, and transfer geometry.
  • This approach accelerates the understanding of complex orbital transfer dynamics and design parameters.