Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

2.5K
2.5K
Perceiving Loudness, Pitch, and Location01:21

Perceiving Loudness, Pitch, and Location

177
The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by...
177
Force Classification01:22

Force Classification

1.1K
Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...
1.1K
Masking and Demasking Agents01:19

Masking and Demasking Agents

2.3K
EDTA titrations may necessitate masking and demasking agents to temporarily protect a particular metal ion in a mixture from the EDTA reaction. These agents facilitate the sequential analysis of the metal ions by forming stable complexes with some—but not all—metal ions during certain steps.
There are many masking agents, such as cyanide, fluoride, triethanolamine, thiourea, and 2,3-bis(sulfanyl)propan-1-ol (formerly 2,3-dimercapto-1-propanol), with the masking agent chosen based on...
2.3K
Amplifying Signals via Enzymatic Cascade01:22

Amplifying Signals via Enzymatic Cascade

8.2K
When a ligand binds to a cell-surface receptor, the receptor's intracellular domain changes shape, which may either activate its enzyme function or allow its binding to other molecules. The initial signal is amplified by most signal transduction pathways. This means that a single ligand molecule can activate multiple molecules of a downstream target. Proteins that relay a signal are most commonly phosphorylated at one or more sites, activating or inactivating the protein. Kinases catalyze...
8.2K
Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

392
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
392

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Object Detection and Recognition Based on Deep Learning.

Sensors (Basel, Switzerland)·2026
Same author

Personalizing Seizure Detection for Individual Patients by Optimal Selection of EEG Signals.

Sensors (Basel, Switzerland)·2025
Same author

An Experimental Evaluation of Smart Sensors for Pedestrian Attribute Recognition Using Multi-Task Learning and Vision Language Models.

Sensors (Basel, Switzerland)·2025
Same author

Automating parasite egg detection: insights from the first AI-KFM challenge.

Frontiers in artificial intelligence·2024
Same author

Assessment of Noninferiority Margins in Cardiovascular Medicine Trials.

JACC. Advances·2024
Same author

Endoscopic ear surgery in the treatment of chronic otitis media with atelectasis.

European archives of oto-rhino-laryngology : official journal of the European Federation of Oto-Rhino-Laryngological Societies (EUFOS) : affiliated with the German Society for Oto-Rhino-Laryngology - Head and Neck Surgery·2024
Same journal

Clinical crown height changes in mandibular anterior teeth retained with two types of fixed retainers over two years: findings from a randomized clinical trial.

Scientific reports·2026
Same journal

Rethinking water governance through indigenous systems: A comparative assessment of qanat and well irrigation productivity in Sabzevar County, Iran.

Scientific reports·2026
Same journal

Distributed Nash equilibrium seeking for second-order systems with finite/fixed-time convergence in the absence of velocity measurement.

Scientific reports·2026
Same journal

Determinants of pregnancy termination among ever-married women of reproductive age in Bangladesh.

Scientific reports·2026
Same journal

Occurrence and human health risk assessment of organochlorine pesticides in irrigated and non-irrigated agricultural soils of Wondogenet District, Ethiopia.

Scientific reports·2026
Same journal

High angular resolution diffusion imaging of neurodevelopment in children through data creation with deep learning.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: May 23, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

474

Context-aware data augmentation for enhanced speech command recognition in industrial environments.

Giuseppe De Simone1, Antonio Greco2, Francesco Rosa1

  • 1University of Salerno, Fisciano, 84084, Italy.

Scientific Reports
|May 20, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a robust speech command recognition system for Industry 4.0. The framework enhances accuracy and noise rejection, improving human-robot interaction on production lines.

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.4K
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

385

Related Experiment Videos

Last Updated: May 23, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

474
Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.4K
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

385

Area of Science:

  • Human-Robot Interaction
  • Industrial Automation
  • Speech Technology

Background:

  • Speech is a key communication channel in Human-Robot Interaction (HRI).
  • Industry 4.0 environments require efficient and accurate speech command recognition.
  • Challenges include balancing command accuracy with noise and irrelevant speech rejection.

Purpose of the Study:

  • To propose a modular framework for optimizing speech command recognition accuracy and robustness in industrial settings.
  • To minimize the need for extensive industrial datasets.
  • To enhance reliability in noisy production line environments.

Main Methods:

  • Developed an efficient Command Recognition module using laboratory data augmented with synthetic samples.
  • Implemented context-aware data augmentation and dynamic noise injection for robustness.
  • Integrated a Keyword Spotting module to activate recognition upon detecting a predefined keyword.

Main Results:

  • The system demonstrated high recall rates for both command recognition and noise rejection.
  • Evaluated using real-world samples from a noisy industrial setting.
  • Confirmed effectiveness in demanding industrial applications.

Conclusions:

  • The proposed modular framework effectively addresses the challenges of speech command recognition in Industry 4.0.
  • The system achieves high performance in accuracy and noise rejection.
  • This approach enhances productivity and efficiency on industrial production lines.