Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Significance of the Gradient Vector01:27

Significance of the Gradient Vector

A surface defined by a function of two variables can be understood by examining how it changes along specific directions. When one variable is held constant, the surface reduces to a curve that reflects variation in the other variable. For example, fixing one variable and moving parallel to a coordinate axis produces a cross-sectional curve. The slope of this curve at a given point represents how the function changes in that particular direction, providing a measure of local steepness.By...
Lagrange Multipliers: Two Constraints01:28

Lagrange Multipliers: Two Constraints

The method of Lagrange multipliers with two constraints is used to optimize a function subject to two independent constraints. In many applications, the objective function represents a quantity to be maximized or minimized, such as cost, area, distance, or energy. The two constraints represent requirements that the solution must satisfy, such as fixed volume, limited resources, or prescribed dimensions.For a function of three variables, each constraint forms a surface in three-dimensional space.
Gradient and Del Operator01:14

Gradient and Del Operator

In mathematics and physics, the gradient and del operator are fundamental concepts used to describe the behavior of functions and fields in space. The gradient is a mathematical operator that gives both the magnitude and direction of the maximum spatial rate of change. Consider a person standing on a mountain. The slope of the mountain at any given point is not defined unless it is quantified in a particular direction. For this reason, a "directional derivative" is defined, which is a vector...
Limits to Natural Selection01:38

Limits to Natural Selection

Organisms that are well-adapted to their environment are more likely to survive and reproduce. However, natural selection does not lead to perfectly adapted organisms. Several factors constrain natural selection.For one, natural selection can only act upon existing genetic variation. Hypothetically, redtusks may enhance elephant survival by deterring ivory-seeking poachers. However, if there are no gene variants—or alleles—for redtusks, natural selection cannot increase the prevalence of...
Randomized Experiments01:13

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
Methods of Medium Optimization01:28

Methods of Medium Optimization

Optimizing growth media enhances microbial proliferation and maximizes product yield. Statistical experimental design methodologies provide structured and reproducible approaches, offering progressively higher levels of robustness and efficiency.The One-Factor-at-a-Time (OFAT) MethodThe One-Factor-at-a-Time (OFAT) method involves adjusting a single variable while keeping all others constant. However, it cannot detect interactions between variables, often leading to suboptimal outcomes when...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Catecholamine precursor modulation of human exploration: Evidence from a large gender-balanced sample.

PLoS computational biology·2026
Same author

The earlier you know, the smoother you act: anticipatory control in solo and dyadic juggling.

Experimental brain research·2026
Same author

Exploration Strategies and Feature Prioritisation in Contour-based Haptic Perception of 2D Shape.

IEEE transactions on haptics·2026
Same author

Open science practices in behavioral addictions: An exploratory survey.

Journal of behavioral addictions·2026
Same author

[Use of continuous passive motion in inpatient rehabilitation after shoulder replacement-a retrospective study].

Orthopadie (Heidelberg, Germany)·2026
Same author

Hoffa-Kastert Syndrome: A Rare Cause of Acute Knee Blockade.

Indian journal of orthopaedics·2025
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: Jun 17, 2026

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

Parameter-exploring policy gradients.

Frank Sehnke1, Christian Osendorfer, Thomas Rückstiess

  • 1Faculty of Computer Science, Technische Universität München, Boltzmannstr.3, 85748 Garching, Germany. sehnke@in.tum.de

Neural Networks : the Official Journal of the International Neural Network Society
|January 12, 2010
PubMed
Summary
This summary is machine-generated.

We developed a novel model-free reinforcement learning approach for partially observable problems. This method achieves lower variance gradient estimates, outperforming existing algorithms in complex robotic control tasks.

Related Experiment Videos

Last Updated: Jun 17, 2026

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

Area of Science:

  • Robotics
  • Artificial Intelligence
  • Machine Learning

Background:

  • Partially observable Markov decision problems (POMDPs) present significant challenges for reinforcement learning.
  • Traditional policy gradient methods often suffer from high variance gradient estimates.
  • Model-free approaches are desirable for real-world applications where system dynamics are unknown.

Purpose of the Study:

  • To introduce a novel model-free reinforcement learning method for POMDPs.
  • To reduce gradient variance compared to standard policy gradient techniques.
  • To demonstrate superior performance on complex control tasks.

Main Methods:

  • Parameter space sampling for likelihood gradient estimation.
  • A novel model-free reinforcement learning algorithm designed for POMDPs.
  • Comparative analysis against standard policy gradients, finite difference, and population-based methods.

Main Results:

  • The proposed method yields lower variance gradient estimates.
  • Outperformance observed in complex control tasks, including humanoid robot locomotion.
  • Performance gains are maximized with symmetric parameter sampling.

Conclusions:

  • The developed method offers a significant improvement for model-free reinforcement learning in POMDPs.
  • Symmetric sampling is a key factor in enhancing performance.
  • Component analysis validates the effectiveness of individual method elements.