Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Control Subarea Division for Coordinated Signal Control: A Colored Random Walk and Path Entropy Approach to Traffic-State Propagation.

Entropy (Basel, Switzerland)·2026
Same author

Neuroinflammatory Mechanisms in Depression: From Biomarkers to Anti-Inflammatory Therapy.

Brain sciences·2026
Same author

An Overview of the Research Status and Advances in Precision Feeding Technology and Equipment in Aquaculture.

Animals : an open access journal from MDPI·2026
Same author

Study on the characteristics analysis and recognition method of vowels in patients with type â…¡ diabetes.

Frontiers in digital health·2026
Same author

Lens power and associated factors affect ocular response to highly aspherical lenslets (HAL) in myopic children.

Graefe's archive for clinical and experimental ophthalmology = Albrecht von Graefes Archiv fur klinische und experimentelle Ophthalmologie·2026
Same author

Development and validation of a machine learning-based diagnostic system for 22 pediatric respiratory pathogens: a large-scale multicenter study.

NPJ digital medicine·2026
Same journal

Research on a Regional Availability Evaluation Model for Road-Area High-Entropy Energy Based on Synergy Factors.

Entropy (Basel, Switzerland)·2026
Same journal

Atmospheric Turbulence Channel Modeling and Performance Analysis of a CO-ZP-OFDM Coherent Optical Communication System for UAV Air-to-Ground Scenarios.

Entropy (Basel, Switzerland)·2026
Same journal

Information Geometry and Asymptotic Theory for SMML Estimators.

Entropy (Basel, Switzerland)·2026
Same journal

Correlation Entropy and Power-Law Kinetics.

Entropy (Basel, Switzerland)·2026
Same journal

Research on the Contagion of Systemic Financial Risk Under the Impact of Climate Risks-From the Perspective of Complex Networks and Machine Learning.

Entropy (Basel, Switzerland)·2026
Same journal

The Statistical-Mechanical Meaning of the Wave Function of Quantum Mechanics.

Entropy (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Sep 18, 2025

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.6K

Analysis and Research on Spectrogram-Based Emotional Speech Signal Augmentation Algorithm.

Huawei Tao1,2,3, Sixian Li1,2, Xuemei Wang1,2

  • 1Key Laboratory of Grain Information Processing and Control, Henan University of Technology, Ministry of Education, Zhengzhou 450001, China.

Entropy (Basel, Switzerland)
|June 26, 2025
PubMed
Summary
This summary is machine-generated.

Data augmentation in speech emotion recognition can harm performance if not chosen carefully. Reverberation and resampling are effective, boosting accuracy by up to 7.1% without distorting emotional labels.

Keywords:
cross-entropy lossdata augmentationspectrogramspeech emotion recognition

More Related Videos

Computer-based Multitaper Spectrogram Program for Electroencephalographic Data
04:13

Computer-based Multitaper Spectrogram Program for Electroencephalographic Data

Published on: November 13, 2019

12.3K
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

541

Related Experiment Videos

Last Updated: Sep 18, 2025

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.6K
Computer-based Multitaper Spectrogram Program for Electroencephalographic Data
04:13

Computer-based Multitaper Spectrogram Program for Electroencephalographic Data

Published on: November 13, 2019

12.3K
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

541

Area of Science:

  • Speech processing
  • Machine learning
  • Affective computing

Background:

  • Data augmentation is crucial for improving speech emotion recognition (SER) models.
  • The impact of augmentation on emotional speech data remains underexplored, risking label distortion and performance degradation.

Purpose of the Study:

  • To systematically evaluate common data augmentation techniques' influence on SER.
  • To identify augmentation methods that enhance data diversity without compromising emotional integrity.

Main Methods:

  • Subjective auditory experiments to assess emotional expression changes.
  • Multi-dimensional feature extraction from spectrograms and heatmap visualization.
  • Objective evaluation using cross-entropy loss and statistical significance testing.

Main Results:

  • Time stretching significantly distorts speech features, negatively impacting emotion recognition accuracy.
  • "Reverberation" (RIR) and "resampling" showed minimal impact on emotional labels.
  • Combining reverberation and resampling improved model accuracy by up to 7.1%.

Conclusions:

  • Careful selection of data augmentation is vital for effective SER.
  • Reverberation and resampling are promising techniques for enhancing SER datasets.
  • Findings provide a foundation for optimizing augmentation strategies in SER.