Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Masking and Demasking Agents

Masking and Demasking Agents

EDTA titrations may necessitate masking and demasking agents to temporarily protect a particular metal ion in a mixture from the EDTA reaction. These agents facilitate the sequential analysis of the metal ions by forming stable complexes with some—but not all—metal ions during certain steps.
There are many masking agents, such as cyanide, fluoride, triethanolamine, thiourea, and 2,3-bis(sulfanyl)propan-1-ol (formerly 2,3-dimercapto-1-propanol), with the masking agent chosen based on...

Force Classification

Force Classification

Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...

Automatic Processing and Automatic Social Behavior

Automatic Processing and Automatic Social Behavior

Automatic processing refers to the cognitive operations that occur without conscious intent or awareness, playing a fundamental role in shaping social cognition and behavior. These processes enable individuals to navigate complex social environments efficiently by relying on mental shortcuts and pre-existing knowledge structures known as schemas. One of the most influential mechanisms underlying automatic processing is priming, which subtly activates mental representations through exposure to...

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...

Facial Feedback Hypothesis

Facial Feedback Hypothesis

Charles Darwin proposed that facial expressions are an evolutionary adaptation for communication. He argued that these expressions are not influenced by culture but are universal across species. For example, a snarling expression with exposed teeth signals a threat in many animals, including humans. Darwin also suggested that displaying an emotion can intensify the feeling. Smiling, for example, could enhance one's sense of happiness. This idea laid the foundation for understanding the role...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Phase-shifting profilometry resistant to varying ambient light.

Optics express·2025

Same author

Quad-color dark-bright pulse trapping in a fiber laser.

Optics express·2025

Same author

Correction to "Engineered Platelet Microparticle-Membrane Camouflaged Nanoparticles for Targeting the Golgi Apparatus of Synovial Fibroblasts to Attenuate Rheumatoid Arthritis".

ACS nano·2025

Same author

Plasma lipoprotein subclasses and risk of incident knee osteoarthritis: A population-based cohort study.

Osteoarthritis and cartilage·2025

Same author

Investigation into the Efficient Cooperative Planning Approach for Dual-Arm Picking Sequences of Dwarf, High-Density Safflowers.

Sensors (Basel, Switzerland)·2025

Same author

Association of a Healthy Lifestyle With All-Cause and Cause-Specific Mortality Among Individuals With Probable Sarcopenia: Population-Based Cohort Study.

JMIR aging·2025

Same journal

Multiphysics Investigation on Thermal Characteristics of Internal Bio-Inspired V-Ribbed Cooling Channels for Outer Rotor PMSM.

Biomimetics (Basel, Switzerland)·2026

Same journal

Smart Logistics Model for Supply Chain Management via Brain-Inspired Geometric Deep Networks.

Biomimetics (Basel, Switzerland)·2026

Same journal

A Systematic Taxonomy of the Sunflower Optimization Algorithm: Variants, Hybridization Strategies, Applications, and Research Directions.

Biomimetics (Basel, Switzerland)·2026

Same journal

Toward a Compositional Theory of Trust in Embodied Intelligence: A QNLP Framework for Modeling Context, Interaction, and Trustworthiness.

Biomimetics (Basel, Switzerland)·2026

Same journal

Empirical Logic for Bio-Inspired Soft Computing: Illustrative Applications in Control Engineering and Cluster Analysis.

Biomimetics (Basel, Switzerland)·2026

Same journal

A Modified Multi-Strategy Dhole Optimization Algorithm and Its Engineering Applications.

Biomimetics (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 16, 2026

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

Self-Supervised Voice Denoising Network for Multi-Scenario Human-Robot Interaction.

Mu Li¹, Wenjin Xu¹, Chao Zeng²

¹Key Laboratory of Autonomous Systems and Networked Control, Ministry of Education, College of Automation Science and Engineering, South China University of Technology, Guangzhou 510640, China.

Biomimetics (Basel, Switzerland)

|September 26, 2025

Summary

This summary is machine-generated.

This study introduces a novel method to improve voice command recognition for robots in noisy environments. By using synthetic data and a self-supervised denoising network, the system achieves higher accuracy in real-world applications.

Keywords:

data synthesis human–robot interaction self-supervised learning voice denoising

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

SSVEP-based Experimental Procedure for Brain-Robot Interaction with Humanoid Robots

SSVEP-based Experimental Procedure for Brain-Robot Interaction with Humanoid Robots

Published on: November 24, 2015

Related Experiment Videos

Last Updated: Jan 16, 2026

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

SSVEP-based Experimental Procedure for Brain-Robot Interaction with Humanoid Robots

SSVEP-based Experimental Procedure for Brain-Robot Interaction with Humanoid Robots

Published on: November 24, 2015

Area of Science:

Robotics
Artificial Intelligence
Speech Processing

Background:

Human-robot interaction (HRI) using voice commands has advanced with Vision-Language-Action (VLA) models.
Current VLA systems struggle with environmental noise and overlapping speech in multi-speaker scenarios.
A specialized denoising network is needed for robust voice command isolation.

Purpose of the Study:

To enhance voice command-based HRI in noisy environments.
To improve the performance of self-supervised denoising networks for mixed-noise audio.
To increase the real-world applicability of voice-guided robot control.

Main Methods:

Leveraging synthetic data to train a self-supervised denoising network.
Scaling training data to improve network performance in denoising mixed-noise audio.
Developing a method to enhance voice command recognition in challenging acoustic conditions.

Main Results:

The proposed method outperforms existing approaches in simulated environments.
Achieved 7.5% higher accuracy compared to the state-of-the-art in noisy real-world environments.
Demonstrated enhanced voice-guided robot control in practical settings.

Conclusions:

The developed approach effectively enhances voice command recognition for HRI in noisy conditions.
Synthetic data and self-supervised learning are crucial for improving denoising network performance.
The method offers a significant advancement for robust voice-guided robot control.