Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Non-Verbal Cues01:29

Non-Verbal Cues

332
Non-verbal communication extends beyond gestures and facial expressions to include vocal elements known as paralanguage. Paralanguage consists of non-verbal vocal cues such as pitch, loudness, speech rate, pauses, and non-verbal vocalizations like laughter, sighs, and moans. These elements not only accompany speech but also provide critical emotional and contextual information.The Role of Paralanguage in CommunicationParalanguage adds depth to spoken language by conveying emotions and...
332
Fixed Action Patterns01:06

Fixed Action Patterns

17.6K
A fixed action pattern (FAP) is a specific, hard-wired sequence of behaviors that occurs in response to an external stimulus, called a sign stimulus. The behavior is “fixed” because it is essentially unchangeable—proceeding similarly across individuals of a species every time it occurs.
17.6K
Measurement: Derived Units03:02

Measurement: Derived Units

55.4K
The International System of Units or SI system, by international agreement, has fixed measurement units for seven fundamental properties: length, mass, time, temperature, electric current, amount of substance, and luminosity. These are called the SI base units.
55.4K
Measurement: Standard Units03:38

Measurement: Standard Units

79.8K
Every measurement provides three kinds of information: the size or magnitude of the measurement (a number), a standard of comparison for the measurement (a unit), and an indication of the uncertainty of the measurement. While the number and unit are explicitly represented when a quantity is written, the uncertainty is an aspect of the errors in the measurement results.
79.8K
Muscles for Facial Expressions01:14

Muscles for Facial Expressions

4.9K
The craniofacial muscles are a collection of approximately 20 thin skeletal muscles situated beneath the skin of the face and scalp. These muscles, primarily responsible for the vast array of human facial expressions, originate from the bones or fibrous structures of the skull and extend outwards to connect with the skin. While most skeletal muscles in the body are enveloped in thick fascia, facial muscles generally have a more delicate fascial covering, with the buccinator muscle being a...
4.9K
Facial Feedback Hypothesis01:24

Facial Feedback Hypothesis

663
Charles Darwin proposed that facial expressions are an evolutionary adaptation for communication. He argued that these expressions are not influenced by culture but are universal across species. For example, a snarling expression with exposed teeth signals a threat in many animals, including humans. Darwin also suggested that displaying an emotion can intensify the feeling. Smiling, for example, could enhance one's sense of happiness. This idea laid the foundation for understanding the role...
663

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Honey Bees Reduce Pollen Viability While Foraging.

Insects·2026
Same author

Representation Learning for Interpersonal and Multimodal Behavior Dynamics: A Multiview Extension of Latent Change Score Models.

Proceedings of the ... ACM International Conference on Multimodal Interaction. ICMI (Conference)·2025
Same author

Beyond Additive Fusion: Learning Non-Additive Multimodal Interactions.

Findings of ACL. EMNLP. Conference on Empirical Methods in Natural Language Processing·2025
Same author

Computational Analysis of Expressive Behavior in Clinical Assessment.

Annual review of clinical psychology·2025
Same author

Dynamic and dyadic relationships between facial behavior, working alliance, and treatment outcomes during depression therapy.

Journal of consulting and clinical psychology·2025
Same author

A public mid-density genotyping platform for cultivated cranberry (Vaccinium macrocarpon Aiton).

The plant genome·2025
Same journal

Lessons from Collecting a Million Biometric Samples.

Image and vision computing·2024
Same journal

A survey on computer vision based human analysis in the COVID-19 era.

Image and vision computing·2022
Same journal

Progressive ShallowNet for large scale dynamic and spontaneous facial behaviour analysis in children.

Image and vision computing·2022
Same journal

FMD-Yolo: An efficient face mask detection method for COVID-19 prevention and control in public.

Image and vision computing·2021
Same journal

Postnatal gestational age estimation of newborns using Small Sample Deep Learning.

Image and vision computing·2019
Same journal

Dense 3D Face Alignment from 2D Video for Real-Time Use.

Image and vision computing·2018
See all related articles

Related Experiment Video

Updated: Feb 1, 2026

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task
11:18

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

11.1K

Learning Facial Action Units with Spatiotemporal Cues and Multi-label Sampling.

Wen-Sheng Chu1, Fernando De la Torre1, Jeffrey F Cohn2

  • 1Robotics Institute, Carnegie Mellon University, Pittsburgh, USA.

Image and Vision Computing
|December 8, 2018
PubMed
Summary
This summary is machine-generated.

This study introduces a hybrid network for facial action unit (AU) detection, integrating spatial and temporal data. The novel approach improves AU detection accuracy by considering AU correlations and addressing data imbalance.

Keywords:
00-0199-00Multi-label learningdeep learningfacial action unit detectionmulti-label samplingspatio-temporal learningvideo analysis

More Related Videos

Simultaneous Label-Free Autofluorescence Multi-Harmonic Microscopy
09:19

Simultaneous Label-Free Autofluorescence Multi-Harmonic Microscopy

Published on: August 29, 2025

611
Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning
11:20

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Published on: June 2, 2014

12.4K

Related Experiment Videos

Last Updated: Feb 1, 2026

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task
11:18

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

11.1K
Simultaneous Label-Free Autofluorescence Multi-Harmonic Microscopy
09:19

Simultaneous Label-Free Autofluorescence Multi-Harmonic Microscopy

Published on: August 29, 2025

611
Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning
11:20

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Published on: June 2, 2014

12.4K

Area of Science:

  • Computer Vision
  • Machine Learning
  • Human-Computer Interaction

Background:

  • Facial action units (AUs) are crucial for understanding facial expressions.
  • Previous research often analyzes AUs spatially or temporally, but not jointly.
  • Existing methods may exhibit person-specific biases and struggle with sparse AU data.

Purpose of the Study:

  • To develop a hybrid network architecture for joint spatial and temporal AU representation modeling.
  • To improve the accuracy and reduce biases in AU detection.
  • To address class imbalance issues in AU datasets.

Main Methods:

  • A hybrid network combining Convolutional Neural Networks (CNNs) for spatial features and Long Short-Term Memory (LSTM) networks for temporal dependencies.
  • A fusion network aggregates CNN and LSTM outputs for per-frame AU prediction.
  • Introduction of multi-labeling sampling strategies to handle class imbalance.

Main Results:

  • The hybrid system demonstrated reduced person-specific biases compared to state-of-the-art methods.
  • Increased accuracy in AU detection was achieved on the GFT and BP4D datasets.
  • Multi-labeling sampling strategies further enhanced accuracy, particularly for sparse AUs.

Conclusions:

  • Jointly modeling spatial, temporal, and correlational aspects of AUs leads to superior detection performance.
  • The proposed hybrid network offers a more robust and accurate approach to facial action unit recognition.
  • Visualizations provide novel insights into machine perception of facial actions.