Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Masking and Demasking Agents01:19

Masking and Demasking Agents

3.4K
EDTA titrations may necessitate masking and demasking agents to temporarily protect a particular metal ion in a mixture from the EDTA reaction. These agents facilitate the sequential analysis of the metal ions by forming stable complexes with some—but not all—metal ions during certain steps.
There are many masking agents, such as cyanide, fluoride, triethanolamine, thiourea, and 2,3-bis(sulfanyl)propan-1-ol (formerly 2,3-dimercapto-1-propanol), with the masking agent chosen based on...
3.4K
Force Classification01:22

Force Classification

2.3K
Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...
2.3K
Automatic Processing and Automatic Social Behavior01:28

Automatic Processing and Automatic Social Behavior

215
Automatic processing refers to the cognitive operations that occur without conscious intent or awareness, playing a fundamental role in shaping social cognition and behavior. These processes enable individuals to navigate complex social environments efficiently by relying on mental shortcuts and pre-existing knowledge structures known as schemas. One of the most influential mechanisms underlying automatic processing is priming, which subtly activates mental representations through exposure to...
215
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

392
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
392
Facial Feedback Hypothesis01:24

Facial Feedback Hypothesis

588
Charles Darwin proposed that facial expressions are an evolutionary adaptation for communication. He argued that these expressions are not influenced by culture but are universal across species. For example, a snarling expression with exposed teeth signals a threat in many animals, including humans. Darwin also suggested that displaying an emotion can intensify the feeling. Smiling, for example, could enhance one's sense of happiness. This idea laid the foundation for understanding the role...
588

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Phase-shifting profilometry resistant to varying ambient light.

Optics express·2025
Same author

Quad-color dark-bright pulse trapping in a fiber laser.

Optics express·2025
Same author

Correction to "Engineered Platelet Microparticle-Membrane Camouflaged Nanoparticles for Targeting the Golgi Apparatus of Synovial Fibroblasts to Attenuate Rheumatoid Arthritis".

ACS nano·2025
Same author

Plasma lipoprotein subclasses and risk of incident knee osteoarthritis: A population-based cohort study.

Osteoarthritis and cartilage·2025
Same author

Investigation into the Efficient Cooperative Planning Approach for Dual-Arm Picking Sequences of Dwarf, High-Density Safflowers.

Sensors (Basel, Switzerland)·2025
Same author

Association of a Healthy Lifestyle With All-Cause and Cause-Specific Mortality Among Individuals With Probable Sarcopenia: Population-Based Cohort Study.

JMIR aging·2025
Same journal

Multiphysics Investigation on Thermal Characteristics of Internal Bio-Inspired V-Ribbed Cooling Channels for Outer Rotor PMSM.

Biomimetics (Basel, Switzerland)·2026
Same journal

Smart Logistics Model for Supply Chain Management via Brain-Inspired Geometric Deep Networks.

Biomimetics (Basel, Switzerland)·2026
Same journal

A Systematic Taxonomy of the Sunflower Optimization Algorithm: Variants, Hybridization Strategies, Applications, and Research Directions.

Biomimetics (Basel, Switzerland)·2026
Same journal

Toward a Compositional Theory of Trust in Embodied Intelligence: A QNLP Framework for Modeling Context, Interaction, and Trustworthiness.

Biomimetics (Basel, Switzerland)·2026
Same journal

Empirical Logic for Bio-Inspired Soft Computing: Illustrative Applications in Control Engineering and Cluster Analysis.

Biomimetics (Basel, Switzerland)·2026
Same journal

A Modified Multi-Strategy Dhole Optimization Algorithm and Its Engineering Applications.

Biomimetics (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Jan 16, 2026

Asthma Detection Research Based on Voice Signal Processing and Machine Learning
04:04

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

930

Self-Supervised Voice Denoising Network for Multi-Scenario Human-Robot Interaction.

Mu Li1, Wenjin Xu1, Chao Zeng2

  • 1Key Laboratory of Autonomous Systems and Networked Control, Ministry of Education, College of Automation Science and Engineering, South China University of Technology, Guangzhou 510640, China.

Biomimetics (Basel, Switzerland)
|September 26, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a novel method to improve voice command recognition for robots in noisy environments. By using synthetic data and a self-supervised denoising network, the system achieves higher accuracy in real-world applications.

Keywords:
data synthesishuman–robot interactionself-supervised learningvoice denoising

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

2.0K
SSVEP-based Experimental Procedure for Brain-Robot Interaction with Humanoid Robots
11:01

SSVEP-based Experimental Procedure for Brain-Robot Interaction with Humanoid Robots

Published on: November 24, 2015

13.7K

Related Experiment Videos

Last Updated: Jan 16, 2026

Asthma Detection Research Based on Voice Signal Processing and Machine Learning
04:04

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

930
Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

2.0K
SSVEP-based Experimental Procedure for Brain-Robot Interaction with Humanoid Robots
11:01

SSVEP-based Experimental Procedure for Brain-Robot Interaction with Humanoid Robots

Published on: November 24, 2015

13.7K

Area of Science:

  • Robotics
  • Artificial Intelligence
  • Speech Processing

Background:

  • Human-robot interaction (HRI) using voice commands has advanced with Vision-Language-Action (VLA) models.
  • Current VLA systems struggle with environmental noise and overlapping speech in multi-speaker scenarios.
  • A specialized denoising network is needed for robust voice command isolation.

Purpose of the Study:

  • To enhance voice command-based HRI in noisy environments.
  • To improve the performance of self-supervised denoising networks for mixed-noise audio.
  • To increase the real-world applicability of voice-guided robot control.

Main Methods:

  • Leveraging synthetic data to train a self-supervised denoising network.
  • Scaling training data to improve network performance in denoising mixed-noise audio.
  • Developing a method to enhance voice command recognition in challenging acoustic conditions.

Main Results:

  • The proposed method outperforms existing approaches in simulated environments.
  • Achieved 7.5% higher accuracy compared to the state-of-the-art in noisy real-world environments.
  • Demonstrated enhanced voice-guided robot control in practical settings.

Conclusions:

  • The developed approach effectively enhances voice command recognition for HRI in noisy conditions.
  • Synthetic data and self-supervised learning are crucial for improving denoising network performance.
  • The method offers a significant advancement for robust voice-guided robot control.