Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Significance of the Gradient Vector

Significance of the Gradient Vector

A surface defined by a function of two variables can be understood by examining how it changes along specific directions. When one variable is held constant, the surface reduces to a curve that reflects variation in the other variable. For example, fixing one variable and moving parallel to a coordinate axis produces a cross-sectional curve. The slope of this curve at a given point represents how the function changes in that particular direction, providing a measure of local steepness.By...

Lagrange Multipliers: Two Constraints

Lagrange Multipliers: Two Constraints

The method of Lagrange multipliers with two constraints is used to optimize a function subject to two independent constraints. In many applications, the objective function represents a quantity to be maximized or minimized, such as cost, area, distance, or energy. The two constraints represent requirements that the solution must satisfy, such as fixed volume, limited resources, or prescribed dimensions.For a function of three variables, each constraint forms a surface in three-dimensional space.

Gradient and Del Operator

Gradient and Del Operator

In mathematics and physics, the gradient and del operator are fundamental concepts used to describe the behavior of functions and fields in space. The gradient is a mathematical operator that gives both the magnitude and direction of the maximum spatial rate of change. Consider a person standing on a mountain. The slope of the mountain at any given point is not defined unless it is quantified in a particular direction. For this reason, a "directional derivative" is defined, which is a vector...

Limits to Natural Selection

Limits to Natural Selection

Organisms that are well-adapted to their environment are more likely to survive and reproduce. However, natural selection does not lead to perfectly adapted organisms. Several factors constrain natural selection.For one, natural selection can only act upon existing genetic variation. Hypothetically, redtusks may enhance elephant survival by deterring ivory-seeking poachers. However, if there are no gene variants—or alleles—for redtusks, natural selection cannot increase the prevalence of...

Randomized Experiments

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...

Methods of Medium Optimization

Methods of Medium Optimization

Optimizing growth media enhances microbial proliferation and maximizes product yield. Statistical experimental design methodologies provide structured and reproducible approaches, offering progressively higher levels of robustness and efficiency.The One-Factor-at-a-Time (OFAT) MethodThe One-Factor-at-a-Time (OFAT) method involves adjusting a single variable while keeping all others constant. However, it cannot detect interactions between variables, often leading to suboptimal outcomes when...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Catecholamine precursor modulation of human exploration: Evidence from a large gender-balanced sample.

PLoS computational biology·2026

Same author

The earlier you know, the smoother you act: anticipatory control in solo and dyadic juggling.

Experimental brain research·2026

Same author

Exploration Strategies and Feature Prioritisation in Contour-based Haptic Perception of 2D Shape.

IEEE transactions on haptics·2026

Same author

Open science practices in behavioral addictions: An exploratory survey.

Journal of behavioral addictions·2026

Same author

[Use of continuous passive motion in inpatient rehabilitation after shoulder replacement-a retrospective study].

Orthopadie (Heidelberg, Germany)·2026

Same author

Hoffa-Kastert Syndrome: A Rare Cause of Acute Knee Blockade.

Indian journal of orthopaedics·2025

Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 17, 2026

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

Parameter-exploring policy gradients.

Frank Sehnke¹, Christian Osendorfer, Thomas Rückstiess

¹Faculty of Computer Science, Technische Universität München, Boltzmannstr.3, 85748 Garching, Germany. sehnke@in.tum.de

Neural Networks : the Official Journal of the International Neural Network Society

|January 12, 2010

Summary

This summary is machine-generated.

We developed a novel model-free reinforcement learning approach for partially observable problems. This method achieves lower variance gradient estimates, outperforming existing algorithms in complex robotic control tasks.

Related Experiment Videos

Last Updated: Jun 17, 2026

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

Area of Science:

Robotics
Artificial Intelligence
Machine Learning

Background:

Partially observable Markov decision problems (POMDPs) present significant challenges for reinforcement learning.
Traditional policy gradient methods often suffer from high variance gradient estimates.
Model-free approaches are desirable for real-world applications where system dynamics are unknown.

Purpose of the Study:

To introduce a novel model-free reinforcement learning method for POMDPs.
To reduce gradient variance compared to standard policy gradient techniques.
To demonstrate superior performance on complex control tasks.

Main Methods:

Parameter space sampling for likelihood gradient estimation.
A novel model-free reinforcement learning algorithm designed for POMDPs.
Comparative analysis against standard policy gradients, finite difference, and population-based methods.

Main Results:

The proposed method yields lower variance gradient estimates.
Outperformance observed in complex control tasks, including humanoid robot locomotion.
Performance gains are maximized with symmetric parameter sampling.

Conclusions:

The developed method offers a significant improvement for model-free reinforcement learning in POMDPs.
Symmetric sampling is a key factor in enhancing performance.
Component analysis validates the effectiveness of individual method elements.