Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Labeling Emotion

Labeling Emotion

Emotional labeling is a cognitive process that involves identifying and naming one's emotions, such as anger, fear, happiness, or sadness. It allows individuals to recognize and express their internal emotional states, a critical aspect of emotional regulation and communication. Labeling emotions requires more than mere recognition; it also involves drawing upon memory and contextual cues to understand the current situation and apply a corresponding emotional label. For instance, feeling...

Force Classification

Force Classification

Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...

Classification of Signals

Classification of Signals

In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...

Physiology of Emotion

Physiology of Emotion

The physiology of emotions is a multifaceted process involving the autonomic nervous system, brain structures, hormones, and neurotransmitters. This intricate interplay dictates how emotions manifest in the body and influence behavior.
Autonomic Nervous System
The autonomic nervous system (ANS) plays a critical role in emotional responses by regulating involuntary physiological functions. It consists of two main components: the sympathetic and parasympathetic systems. The sympathetic system...

Emotional Expression

Emotional Expression

Emotional expression encompasses how individuals convey their emotions through verbal communication and non-verbal cues. These non-verbal actions include facial expressions, body language, and physical gestures, such as frowning or smiling. Among these, facial expressions play a crucial role in emotional expression and are understood universally, indicating a biological basis for how humans communicate emotions.
Universal Facial Expressions
Psychologist Paul Ekman identified seven basic...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Comparative analysis of explainable machine learning integrated with hyperspectral imaging for early prediction of wheat yield.

Talanta·2026

Same author

KDH-Net: Explainable Medical AI for Multiclass Kidney Disease Characterization from CT Images.

Journal of clinical medicine·2026

Same author

A hybrid system for detecting semiconductor wafer defects using modified MobileNet with multi-head attention.

PloS one·2026

Same author

Gene expression and metadata based identification of key genes for lung cancer, COPD, and IPF using machine learning and statistical models.

PloS one·2026

Same author

Deep Learning-Based Eye-Writing Recognition with Improved Preprocessing and Data Augmentation Techniques.

Sensors (Basel, Switzerland)·2025

Same author

A Lightweight CNN for Multiclass Retinal Disease Screening with Explainable AI.

Journal of imaging·2025

Same journal

Human-AI Interaction in Interventional Radiology: A Narrative Review of Current Applications, Challenges, and Future Directions.

Journal of imaging·2026

Same journal

Coronary Artery Anomalies and Anatomical Variants: Cross-Sectional Diagnostic Imaging and Clinical Background.

Journal of imaging·2026

Same journal

YoLeTooth: A Unified Framework for Joint Tooth Segmentation and Periapical Lesion Detection in Panoramic Radiographs.

Journal of imaging·2026

Same journal

Radiomics-Guided Multi-Sequence Learning for Pathological Complete Response Prediction from Breast MRI with Missing Auxiliary Sequences.

Journal of imaging·2026

Same journal

Cutaneous Thermography in Arthropathies: Quantitative Imaging, Machine Learning, and Clinical Translation.

Journal of imaging·2026

Same journal

Two-Stage Dynamic Synergistic Segmentation Method for Myocardial Pathology.

Journal of imaging·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 10, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Bangla Speech Emotion Recognition Using Deep Learning-Based Ensemble Learning and Feature Fusion.

Md Shahid Ahammed Shakil¹, Fahmid Al Farid², Nitun Kumar Podder¹

¹Department of Computer Science and Engineering, Pabna University of Science and Technology, Pabna 6600, Bangladesh.

Journal of Imaging

|August 27, 2025

Summary

This summary is machine-generated.

This study introduces a novel deep learning approach for Bangla speech emotion recognition, significantly improving accuracy and generalization by fusing handcrafted and deep learning features. The method enhances human-computer interaction systems with more robust emotion identification capabilities.

Keywords:

CNN LSTM MFCC chromagram features data augmentation deep learning ensemble learning feature extraction feature fusion handcrafted feature speech-based emotion recognition (SER)time–frequency domain feature visualizable audio representations

More Related Videos

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Related Experiment Videos

Last Updated: Sep 10, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Area of Science:

Speech processing
Artificial Intelligence
Human-Computer Interaction

Background:

Bangla speech emotion recognition faces challenges in accuracy, speaker dependency, and generalization.
Existing methods using traditional or basic deep learning models lack robustness in varied conditions.

Purpose of the Study:

To propose a novel multi-stream deep learning feature fusion approach for Bangla speech emotion recognition.
To address limitations of existing methods by enhancing accuracy, robustness, and generalization.

Main Methods:

Data augmentation techniques applied to training datasets.
Extraction of handcrafted features (ZCR, MFCCs, etc.) and deep learning features.
Multi-stream deep learning architecture with 1D CNN, CNN-LSTM, and CNN-Bi-LSTM streams.
Ensemble learning with soft voting for final prediction.

Main Results:

Achieved high accuracies: 92.90% (SUBESCO), 85.20% (BanglaSER), 90.63% (merged), 67.71% (RAVDESS), 69.25% (EMODB).
Demonstrated improved robustness and generalization compared to existing methods.
Effectively combined handcrafted and deep learning features through ensemble learning.

Conclusions:

The proposed multi-stream deep learning feature fusion approach significantly enhances Bangla speech emotion recognition.
Combining diverse features and ensemble learning provides a more comprehensive and robust solution.
The method offers a promising advancement for emotion recognition in human-computer interaction systems.