Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Non-Verbal Cues

Non-Verbal Cues

Non-verbal communication extends beyond gestures and facial expressions to include vocal elements known as paralanguage. Paralanguage consists of non-verbal vocal cues such as pitch, loudness, speech rate, pauses, and non-verbal vocalizations like laughter, sighs, and moans. These elements not only accompany speech but also provide critical emotional and contextual information.The Role of Paralanguage in CommunicationParalanguage adds depth to spoken language by conveying emotions and...

Fixed Action Patterns

Fixed Action Patterns

A fixed action pattern (FAP) is a specific, hard-wired sequence of behaviors that occurs in response to an external stimulus, called a sign stimulus. The behavior is “fixed” because it is essentially unchangeable—proceeding similarly across individuals of a species every time it occurs.

Measurement: Derived Units

Measurement: Derived Units

The International System of Units or SI system, by international agreement, has fixed measurement units for seven fundamental properties: length, mass, time, temperature, electric current, amount of substance, and luminosity. These are called the SI base units.

Measurement: Standard Units

Measurement: Standard Units

Every measurement provides three kinds of information: the size or magnitude of the measurement (a number), a standard of comparison for the measurement (a unit), and an indication of the uncertainty of the measurement. While the number and unit are explicitly represented when a quantity is written, the uncertainty is an aspect of the errors in the measurement results.

Muscles for Facial Expressions

Muscles for Facial Expressions

The craniofacial muscles are a collection of approximately 20 thin skeletal muscles situated beneath the skin of the face and scalp. These muscles, primarily responsible for the vast array of human facial expressions, originate from the bones or fibrous structures of the skull and extend outwards to connect with the skin. While most skeletal muscles in the body are enveloped in thick fascia, facial muscles generally have a more delicate fascial covering, with the buccinator muscle being a...

Facial Feedback Hypothesis

Facial Feedback Hypothesis

Charles Darwin proposed that facial expressions are an evolutionary adaptation for communication. He argued that these expressions are not influenced by culture but are universal across species. For example, a snarling expression with exposed teeth signals a threat in many animals, including humans. Darwin also suggested that displaying an emotion can intensify the feeling. Smiling, for example, could enhance one's sense of happiness. This idea laid the foundation for understanding the role...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Honey Bees Reduce Pollen Viability While Foraging.

Insects·2026

Same author

Representation Learning for Interpersonal and Multimodal Behavior Dynamics: A Multiview Extension of Latent Change Score Models.

Proceedings of the ... ACM International Conference on Multimodal Interaction. ICMI (Conference)·2025

Same author

Beyond Additive Fusion: Learning Non-Additive Multimodal Interactions.

Findings of ACL. EMNLP. Conference on Empirical Methods in Natural Language Processing·2025

Same author

Computational Analysis of Expressive Behavior in Clinical Assessment.

Annual review of clinical psychology·2025

Same author

Dynamic and dyadic relationships between facial behavior, working alliance, and treatment outcomes during depression therapy.

Journal of consulting and clinical psychology·2025

Same author

A public mid-density genotyping platform for cultivated cranberry (Vaccinium macrocarpon Aiton).

The plant genome·2025

Same journal

Lessons from Collecting a Million Biometric Samples.

Image and vision computing·2024

Same journal

A survey on computer vision based human analysis in the COVID-19 era.

Image and vision computing·2022

Same journal

Progressive ShallowNet for large scale dynamic and spontaneous facial behaviour analysis in children.

Image and vision computing·2022

Same journal

FMD-Yolo: An efficient face mask detection method for COVID-19 prevention and control in public.

Image and vision computing·2021

Same journal

Postnatal gestational age estimation of newborns using Small Sample Deep Learning.

Image and vision computing·2019

Same journal

Dense 3D Face Alignment from 2D Video for Real-Time Use.

Image and vision computing·2018

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 1, 2026

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

Learning Facial Action Units with Spatiotemporal Cues and Multi-label Sampling.

Wen-Sheng Chu¹, Fernando De la Torre¹, Jeffrey F Cohn²

¹Robotics Institute, Carnegie Mellon University, Pittsburgh, USA.

Image and Vision Computing

|December 8, 2018

Summary

This summary is machine-generated.

This study introduces a hybrid network for facial action unit (AU) detection, integrating spatial and temporal data. The novel approach improves AU detection accuracy by considering AU correlations and addressing data imbalance.

Keywords:

00-01 99-00 Multi-label learning deep learning facial action unit detection multi-label sampling spatio-temporal learning video analysis

More Related Videos

Simultaneous Label-Free Autofluorescence Multi-Harmonic Microscopy

Simultaneous Label-Free Autofluorescence Multi-Harmonic Microscopy

Published on: August 29, 2025

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Published on: June 2, 2014

Related Experiment Videos

Last Updated: Feb 1, 2026

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

Simultaneous Label-Free Autofluorescence Multi-Harmonic Microscopy

Simultaneous Label-Free Autofluorescence Multi-Harmonic Microscopy

Published on: August 29, 2025

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Published on: June 2, 2014

Area of Science:

Computer Vision
Machine Learning
Human-Computer Interaction

Background:

Facial action units (AUs) are crucial for understanding facial expressions.
Previous research often analyzes AUs spatially or temporally, but not jointly.
Existing methods may exhibit person-specific biases and struggle with sparse AU data.

Purpose of the Study:

To develop a hybrid network architecture for joint spatial and temporal AU representation modeling.
To improve the accuracy and reduce biases in AU detection.
To address class imbalance issues in AU datasets.

Main Methods:

A hybrid network combining Convolutional Neural Networks (CNNs) for spatial features and Long Short-Term Memory (LSTM) networks for temporal dependencies.
A fusion network aggregates CNN and LSTM outputs for per-frame AU prediction.
Introduction of multi-labeling sampling strategies to handle class imbalance.

Main Results:

The hybrid system demonstrated reduced person-specific biases compared to state-of-the-art methods.
Increased accuracy in AU detection was achieved on the GFT and BP4D datasets.
Multi-labeling sampling strategies further enhanced accuracy, particularly for sparse AUs.

Conclusions:

Jointly modeling spatial, temporal, and correlational aspects of AUs leads to superior detection performance.
The proposed hybrid network offers a more robust and accurate approach to facial action unit recognition.
Visualizations provide novel insights into machine perception of facial actions.