Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Labeling Emotion01:20

Labeling Emotion

237
Emotional labeling is a cognitive process that involves identifying and naming one's emotions, such as anger, fear, happiness, or sadness. It allows individuals to recognize and express their internal emotional states, a critical aspect of emotional regulation and communication. Labeling emotions requires more than mere recognition; it also involves drawing upon memory and contextual cues to understand the current situation and apply a corresponding emotional label. For instance, feeling...
237
Force Classification01:22

Force Classification

1.6K
Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...
1.6K
Classification of Signals01:30

Classification of Signals

878
In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...
878
Physiology of Emotion01:20

Physiology of Emotion

1.4K
The physiology of emotions is a multifaceted process involving the autonomic nervous system, brain structures, hormones, and neurotransmitters. This intricate interplay dictates how emotions manifest in the body and influence behavior.
Autonomic Nervous System
The autonomic nervous system (ANS) plays a critical role in emotional responses by regulating involuntary physiological functions. It consists of two main components: the sympathetic and parasympathetic systems. The sympathetic system...
1.4K
Emotional Expression01:26

Emotional Expression

367
Emotional expression encompasses how individuals convey their emotions through verbal communication and non-verbal cues. These non-verbal actions include facial expressions, body language, and physical gestures, such as frowning or smiling. Among these, facial expressions play a crucial role in emotional expression and are understood universally, indicating a biological basis for how humans communicate emotions.
Universal Facial Expressions
Psychologist Paul Ekman identified seven basic...
367

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Comparative analysis of explainable machine learning integrated with hyperspectral imaging for early prediction of wheat yield.

Talanta·2026
Same author

KDH-Net: Explainable Medical AI for Multiclass Kidney Disease Characterization from CT Images.

Journal of clinical medicine·2026
Same author

A hybrid system for detecting semiconductor wafer defects using modified MobileNet with multi-head attention.

PloS one·2026
Same author

Gene expression and metadata based identification of key genes for lung cancer, COPD, and IPF using machine learning and statistical models.

PloS one·2026
Same author

Deep Learning-Based Eye-Writing Recognition with Improved Preprocessing and Data Augmentation Techniques.

Sensors (Basel, Switzerland)·2025
Same author

A Lightweight CNN for Multiclass Retinal Disease Screening with Explainable AI.

Journal of imaging·2025
Same journal

Human-AI Interaction in Interventional Radiology: A Narrative Review of Current Applications, Challenges, and Future Directions.

Journal of imaging·2026
Same journal

Coronary Artery Anomalies and Anatomical Variants: Cross-Sectional Diagnostic Imaging and Clinical Background.

Journal of imaging·2026
Same journal

YoLeTooth: A Unified Framework for Joint Tooth Segmentation and Periapical Lesion Detection in Panoramic Radiographs.

Journal of imaging·2026
Same journal

Radiomics-Guided Multi-Sequence Learning for Pathological Complete Response Prediction from Breast MRI with Missing Auxiliary Sequences.

Journal of imaging·2026
Same journal

Cutaneous Thermography in Arthropathies: Quantitative Imaging, Machine Learning, and Clinical Translation.

Journal of imaging·2026
Same journal

Two-Stage Dynamic Synergistic Segmentation Method for Myocardial Pathology.

Journal of imaging·2026
See all related articles

Related Experiment Video

Updated: Sep 10, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

4.1K

Bangla Speech Emotion Recognition Using Deep Learning-Based Ensemble Learning and Feature Fusion.

Md Shahid Ahammed Shakil1, Fahmid Al Farid2, Nitun Kumar Podder1

  • 1Department of Computer Science and Engineering, Pabna University of Science and Technology, Pabna 6600, Bangladesh.

Journal of Imaging
|August 27, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a novel deep learning approach for Bangla speech emotion recognition, significantly improving accuracy and generalization by fusing handcrafted and deep learning features. The method enhances human-computer interaction systems with more robust emotion identification capabilities.

Keywords:
CNNLSTMMFCCchromagram featuresdata augmentationdeep learningensemble learningfeature extractionfeature fusionhandcrafted featurespeech-based emotion recognition (SER)time–frequency domain featurevisualizable audio representations

More Related Videos

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

635

Related Experiment Videos

Last Updated: Sep 10, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

4.1K
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

635

Area of Science:

  • Speech processing
  • Artificial Intelligence
  • Human-Computer Interaction

Background:

  • Bangla speech emotion recognition faces challenges in accuracy, speaker dependency, and generalization.
  • Existing methods using traditional or basic deep learning models lack robustness in varied conditions.

Purpose of the Study:

  • To propose a novel multi-stream deep learning feature fusion approach for Bangla speech emotion recognition.
  • To address limitations of existing methods by enhancing accuracy, robustness, and generalization.

Main Methods:

  • Data augmentation techniques applied to training datasets.
  • Extraction of handcrafted features (ZCR, MFCCs, etc.) and deep learning features.
  • Multi-stream deep learning architecture with 1D CNN, CNN-LSTM, and CNN-Bi-LSTM streams.
  • Ensemble learning with soft voting for final prediction.

Main Results:

  • Achieved high accuracies: 92.90% (SUBESCO), 85.20% (BanglaSER), 90.63% (merged), 67.71% (RAVDESS), 69.25% (EMODB).
  • Demonstrated improved robustness and generalization compared to existing methods.
  • Effectively combined handcrafted and deep learning features through ensemble learning.

Conclusions:

  • The proposed multi-stream deep learning feature fusion approach significantly enhances Bangla speech emotion recognition.
  • Combining diverse features and ensemble learning provides a more comprehensive and robust solution.
  • The method offers a promising advancement for emotion recognition in human-computer interaction systems.