Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Labeling Emotion01:20

Labeling Emotion

1.0K
Emotional labeling is a cognitive process that involves identifying and naming one's emotions, such as anger, fear, happiness, or sadness. It allows individuals to recognize and express their internal emotional states, a critical aspect of emotional regulation and communication. Labeling emotions requires more than mere recognition; it also involves drawing upon memory and contextual cues to understand the current situation and apply a corresponding emotional label. For instance, feeling...
1.0K
Non-Verbal Cues01:29

Non-Verbal Cues

774
Non-verbal communication extends beyond gestures and facial expressions to include vocal elements known as paralanguage. Paralanguage consists of non-verbal vocal cues such as pitch, loudness, speech rate, pauses, and non-verbal vocalizations like laughter, sighs, and moans. These elements not only accompany speech but also provide critical emotional and contextual information.The Role of Paralanguage in CommunicationParalanguage adds depth to spoken language by conveying emotions and...
774

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Listening to MS: AI-assisted speech analysis for diagnosis and fatigue prediction (COMMITMENT).

Frontiers in digital health·2026
Same author

Prior-aligned frequency-domain explanations for heart sound classification: a scale-consistent attribution approach.

Frontiers in artificial intelligence·2026
Same author

Explainable detection of machine generated music and early systematic evaluation.

Scientific reports·2026
Same author

Ear-Worn Inertial Sensors Can Predict Gait Metrics and Reconstruct Vertical Ground Reaction Force Curves During Running.

Journal of applied biomechanics·2026
Same author

A frequency analysis of filterbank initialisation and noise augmentation for LEAF.

Scientific reports·2026
Same author

Randomised controlled trial of a very brief nurse-delivered intervention followed by a digital intervention to support medication adherence and reduce blood pressure in people prescribed treatment for hypertension in primary care: protocol for the Programme on Adherence to Medication (PAM) trial.

NIHR open research·2026
Same journal

Analysis of End-Tidal CO2 Variability During Plateau Waves Episodes: An Information Theoretic Approach<sup></sup>.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same journal

AI and Tomosynthesis for Breast Cancer Molecular Subtyping: A step toward precision medicine<sup></sup>.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same journal

Towards Sustainable Protein Recovery from Biological Waste: Assessing Polyethersulfone-based Microfiltration.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same journal

Analysis of the cardiovascular response to standardized polymicrobial peritonitis experimental model.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same journal

Automated Wrist Ultrasound Image Bone Enhancement and Segmentation Using Deep Learning.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same journal

A Deep Learning approach for Depressive Symptoms assessment in Parkinson's disease patients using facial videos.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
See all related articles

Related Experiment Video

Updated: May 2, 2026

Exploring the Use of Isolated Expressions and Film Clips to Evaluate Emotion Recognition by People with Traumatic Brain Injury
05:51

Exploring the Use of Isolated Expressions and Film Clips to Evaluate Emotion Recognition by People with Traumatic Brain Injury

Published on: May 15, 2016

8.9K

Emotion Recognition from Speech Signals by Mel-Spectrogram and a CNN-RNN.

Roneel V Sharan, Cecilia Mascolo, Bjorn W Schuller

    Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
    |March 5, 2025
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a novel speech emotion recognition (SER) method using Mel-spectrograms and neural networks. The approach effectively identifies emotions like anger, happiness, and sadness from speech signals, showing promising results for health applications.

    More Related Videos

    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
    06:37

    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

    Published on: December 15, 2023

    2.5K
    Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
    05:48

    Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

    Published on: August 9, 2024

    1.4K

    Related Experiment Videos

    Last Updated: May 2, 2026

    Exploring the Use of Isolated Expressions and Film Clips to Evaluate Emotion Recognition by People with Traumatic Brain Injury
    05:51

    Exploring the Use of Isolated Expressions and Film Clips to Evaluate Emotion Recognition by People with Traumatic Brain Injury

    Published on: May 15, 2016

    8.9K
    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
    06:37

    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

    Published on: December 15, 2023

    2.5K
    Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
    05:48

    Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

    Published on: August 9, 2024

    1.4K

    Area of Science:

    • Computational linguistics
    • Affective computing
    • Machine learning for audio analysis

    Background:

    • Speech emotion recognition (SER) is crucial for understanding emotional well-being in health applications.
    • Existing methods require robust feature extraction and temporal modeling for accurate emotion detection.

    Purpose of the Study:

    • To propose and evaluate a novel SER method utilizing time-frequency representations and deep neural networks.
    • To enhance the accuracy of emotion detection in speech for potential health monitoring.

    Main Methods:

    • Speech signals are segmented and transformed into Mel-spectrograms.
    • A pretrained convolutional neural network (YAMNet) extracts spectral features.
    • A recurrent neural network (LSTM) models temporal dependencies between spectrograms.

    Main Results:

    • The proposed method achieved average accuracies of 0.711 and 0.780 on two SER datasets.
    • Demonstrated relative improvement over baseline methods in emotion classification.
    • Successfully identified angry, happy, sad, and neutral emotional states.

    Conclusions:

    • The combination of Mel-spectrograms, YAMNet, and LSTM networks offers a powerful approach for SER.
    • This method shows significant potential for real-world applications in mental health and well-being monitoring.
    • Further research can explore broader emotion ranges and diverse datasets.