Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

First Pass Effect01:12

First Pass Effect

7.8K
Presystemic elimination, or the first-pass effect, is the metabolism of drugs that reduces their effective concentration at the site of action. Apart from the first-pass effect, the systemic bioavailability of the drug is also reduced by other factors, including incomplete absorption or chemical degradation of drugs.
Depending on the route of administration, drugs can be metabolized in the liver, intestine, lungs, and vasculature. Orally administered drugs are first absorbed through the...
7.8K
Adjusting a Traverse01:12

Adjusting a Traverse

233
In the site survey of a four-sided traverse, internal angles are essential to ensure geometric accuracy. The survey revealed that the sum of the measured internal angles was 359 degrees and 48 minutes, which is 12 minutes less than the expected 360 degrees. This discrepancy signals an error likely arising from measurement inaccuracies during the fieldwork.To rectify this error, the adjustment process involved distributing the 12-minute shortfall equally across the four internal angles. By...
233
Crossover Experiments01:16

Crossover Experiments

4.3K
Crossover experiments, also called the repeated-measurements design, is a study design in which all experimental units are exposed to all treatments in different periods. Crossover experiments are generally used in psychology, the pharmaceutical industry, agriculture, and medicine.
Crossover designs are performed even with smaller sample sizes since the samples can act as their controls. These are better than simple randomized trials since patients are exposed to all the treatments.
4.3K
Reflex Activity01:08

Reflex Activity

2.5K
A reflex activity is an automatic, involuntary response to specific stimuli. It is a part of our survival mechanism, designed to protect us from potential harm. For example, when a bright light suddenly shines into our eyes, we instinctively close them or look away. This is a simple reflex activity orchestrated by the nervous system without conscious thought or effort.
A reflex exam is a diagnostic procedure performed by a healthcare professional to evaluate the functionality of a patient's...
2.5K
Introspection01:29

Introspection

96
Introspection, long upheld as a reliable route to self-knowledge, involves examining one's thoughts, emotions, and mental processes. It underpins many psychological practices, from mindfulness meditation to psychotherapy and self-help strategies. However, empirical evidence challenges the accuracy of introspection as a means of understanding oneself.Limitations of Introspective InsightSeminal work by Nisbett and Wilson demonstrated that individuals are frequently unaware of the true causes...
96
Homologous Recombination02:31

Homologous Recombination

5.5K
5.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Towards end-to-end automation of AI research.

Nature·2026
Same author

AI discovers learning algorithm that outperforms those designed by humans.

Nature·2025
Same author

Automating the Search for Artificial Life With Foundation Models.

Artificial life·2025
Same author

Managing extreme AI risks amid rapid progress.

Science (New York, N.Y.)·2024
Same author

Environmental, individual and social traits of free-ranging raccoons influence performance in cognitive testing.

The Journal of experimental biology·2022
Same author

Evolving Multimodal Robot Behavior via Many Stepping Stones with the Combinatorial Multiobjective Evolutionary Algorithm.

Evolutionary computation·2021
Same journal

Retraction Note: NSD2 targeting reverses plasticity and drug resistance in prostate cancer.

Nature·2026
Same journal

Enhanced B cell priming induces broadly neutralizing HIV-1 apex antibodies.

Nature·2026
Same journal

Vaccination elicits HIV broadly neutralizing antibodies in primates.

Nature·2026
Same journal

Child online safety needs more than social-media bans.

Nature·2026
Same journal

Ebola preparedness must start with ecosystems and before humans show symptoms.

Nature·2026
Same journal

AI tools can speed up thinking, but evidence still comes from the lab bench.

Nature·2026
See all related articles

Related Experiment Video

Updated: Nov 16, 2025

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function
06:17

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Published on: January 26, 2024

2.4K

First return, then explore.

Adrien Ecoffet1,2, Joost Huizinga3,4, Joel Lehman5,6

  • 1Uber AI Labs, San Francisco, CA, USA. adrienecoffet@gmail.com.

Nature
|February 25, 2021
PubMed
Summary
This summary is machine-generated.

Go-Explore algorithms improve reinforcement learning by remembering states and returning to them before exploring. This approach solves previously unsolved games and advances AI exploration capabilities.

More Related Videos

Novel Object Exploration as a Potential Assay for Higher Order Repetitive Behaviors in Mice
08:28

Novel Object Exploration as a Potential Assay for Higher Order Repetitive Behaviors in Mice

Published on: August 20, 2016

8.7K
Examining Recall Memory in Infancy and Early Childhood Using the Elicited Imitation Paradigm
06:35

Examining Recall Memory in Infancy and Early Childhood Using the Elicited Imitation Paradigm

Published on: April 28, 2016

34.8K

Related Experiment Videos

Last Updated: Nov 16, 2025

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function
06:17

Author Spotlight: Investigating the Effects of Mind-Body-Movement Practices on Brain Function

Published on: January 26, 2024

2.4K
Novel Object Exploration as a Potential Assay for Higher Order Repetitive Behaviors in Mice
08:28

Novel Object Exploration as a Potential Assay for Higher Order Repetitive Behaviors in Mice

Published on: August 20, 2016

8.7K
Examining Recall Memory in Infancy and Early Childhood Using the Elicited Imitation Paradigm
06:35

Examining Recall Memory in Infancy and Early Childhood Using the Elicited Imitation Paradigm

Published on: April 28, 2016

34.8K

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Robotics

Background:

  • Reinforcement learning (RL) aims for autonomous decision-making but struggles with sparse or deceptive reward signals.
  • Effective environment exploration is crucial for RL but remains a significant challenge.
  • Existing RL algorithms often forget previously visited states or fail to revisit them before exploring new ones.

Purpose of the Study:

  • To address the challenges of detachment and derailment in reinforcement learning exploration.
  • To introduce a novel algorithm, Go-Explore, designed for more effective environment exploration.
  • To demonstrate the efficacy of Go-Explore on complex games and robotics tasks.

Main Methods:

  • Introduced Go-Explore, an algorithm family emphasizing remembering promising states and returning to them before exploration.
  • Applied Go-Explore to previously unsolved Atari games and hard-exploration benchmarks.
  • Tested Go-Explore on a sparse-reward pick-and-place robotics task.
  • Integrated goal-conditioned policies to further enhance exploration efficiency and handle stochasticity.

Main Results:

  • Go-Explore successfully solved all previously unsolved Atari games.
  • Achieved state-of-the-art performance on hard-exploration games, with significant improvements on Montezuma's Revenge and Pitfall.
  • Demonstrated practical applicability on a challenging sparse-reward robotics task.
  • Goal-conditioned policies enhanced Go-Explore's exploration efficiency and robustness to stochasticity.

Conclusions:

  • The principles of remembering, returning to, and exploring from states are a powerful and general approach to RL exploration.
  • Go-Explore offers substantial performance gains, suggesting a critical pathway toward more intelligent learning agents.
  • The findings highlight the importance of structured exploration strategies in overcoming RL limitations.