Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Muscles for Facial Expressions

Muscles for Facial Expressions

The craniofacial muscles are a collection of approximately 20 thin skeletal muscles situated beneath the skin of the face and scalp. These muscles, primarily responsible for the vast array of human facial expressions, originate from the bones or fibrous structures of the skull and extend outwards to connect with the skin. While most skeletal muscles in the body are enveloped in thick fascia, facial muscles generally have a more delicate fascial covering, with the buccinator muscle being a...

Association Areas of the Cortex

Association Areas of the Cortex

Association areas are regions of the cerebral cortex that do not have a specific sensory or motor function. Instead, they integrate and interpret information from various sources to enable higher cognitive processes such as memory, learning, and decision-making. Some key association areas include the following:
Prefrontal Association Area: This area is located in the frontal lobe and is involved in planning, decision-making, and moderating social behavior. It connects with primary motor areas,...

Facial Feedback Hypothesis

Facial Feedback Hypothesis

Charles Darwin proposed that facial expressions are an evolutionary adaptation for communication. He argued that these expressions are not influenced by culture but are universal across species. For example, a snarling expression with exposed teeth signals a threat in many animals, including humans. Darwin also suggested that displaying an emotion can intensify the feeling. Smiling, for example, could enhance one's sense of happiness. This idea laid the foundation for understanding the role...

Vision

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

Prosopagnosia

Prosopagnosia

Prosopagnosia, also known as face blindness, is the inability to recognize faces. In severe cases, individuals with prosopagnosia may not recognize close family members, including parents and spouses, by their faces. For instance, someone with prosopagnosia might walk past their child in a crowd, only realizing their mistake upon noticing their child's distinctive backpack or favorite jacket. Prosopagnosia specifically impairs facial recognition, while the recognition of other objects or...

Transformers with Off-Nominal Turns Ratios

Transformers with Off-Nominal Turns Ratios

In scenarios involving parallel transformers with disparate ratings, developing per-unit models requires accommodating off-nominal turns ratios. This situation arises when the selected base voltages are not proportional to the transformer’s voltage ratings. Consider a transformer where the rated voltages are related by the term a. If the chosen voltage bases satisfy a relationship involving term b, term c is defined as the ratio of these bases. This ratio is then substituted into the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Sensing the Action: Rethinking Sensor Modalities and Multi-Modal Fusion in Vision-Language-Action Models for Robotic Manipulation.

Sensors (Basel, Switzerland)·2026

Same author

Keyword-Conditioned Image Segmentation via the Cross-Attentive Alignment of Language and Vision Sensor Data.

Sensors (Basel, Switzerland)·2025

Same author

Scene Graph and Natural Language-Based Semantic Image Retrieval Using Vision Sensor Data.

Sensors (Basel, Switzerland)·2025

Same author

Rethinking Attention Mechanisms in Vision Transformers with Graph Structures.

Sensors (Basel, Switzerland)·2024

Same author

High-Resolution Tactile-Sensation Diagnostic Imaging System for Thyroid Cancer.

Sensors (Basel, Switzerland)·2023

Same author

Lightweight Semantic-Guided Neural Networks Based on Single Head Attention for Action Recognition.

Sensors (Basel, Switzerland)·2022

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 21, 2025

Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders

Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders

Published on: November 1, 2024

Facial Expression Recognition Based on Squeeze Vision Transformer.

Sangwon Kim¹, Jaeyeal Nam¹, Byoung Chul Ko¹

¹Department of Computer Engineering, Keimyung University, Daegu 42601, Korea.

Sensors (Basel, Switzerland)

|May 28, 2022

Summary

This summary is machine-generated.

Squeeze ViT improves facial expression recognition (FER) by combining global and local image features. This novel approach enhances FER performance while reducing computational complexity, outperforming existing methods on diverse datasets.

Keywords:

facial expression recognition landmark token squeeze module vision transformer visual token

More Related Videos

Protocol for Data Collection and Analysis Applied to Automated Facial Expression Analysis Technology and Temporal Analysis for Sensory Evaluation

Protocol for Data Collection and Analysis Applied to Automated Facial Expression Analysis Technology and Temporal Analysis for Sensory Evaluation

Published on: August 26, 2016

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Related Experiment Videos

Last Updated: Sep 21, 2025

Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders

Author Spotlight: Deciphering Electrical Networks Behind Complex Brain Activities and Disorders

Published on: November 1, 2024

Protocol for Data Collection and Analysis Applied to Automated Facial Expression Analysis Technology and Temporal Analysis for Sensory Evaluation

Protocol for Data Collection and Analysis Applied to Automated Facial Expression Analysis Technology and Temporal Analysis for Sensory Evaluation

Published on: August 26, 2016

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Area of Science:

Computer Science
Artificial Intelligence
Machine Learning

Background:

Vision Transformers (ViT) excel in image classification by preserving global features.
ViTs struggle with Facial Expression Recognition (FER) due to loss of critical local features.
FER demands sensitivity to subtle, localized changes in facial imagery.

Purpose of the Study:

To introduce Squeeze ViT, a novel method for enhancing FER performance.
To address ViT's limitations in capturing local features crucial for FER.
To reduce computational complexity in FER models.

Main Methods:

Squeeze ViT combines global and local image features for FER.
Feature dimension reduction is employed to decrease computational load.
The method was evaluated on both lab-controlled and in-the-wild FER datasets.

Main Results:

Squeeze ViT demonstrated superior FER performance compared to state-of-the-art methods.
The proposed method achieved excellent results on both controlled and wild datasets.
Reduced feature dimensions did not impede, but rather enhanced, FER accuracy.

Conclusions:

Squeeze ViT effectively overcomes ViT's limitations in FER.
The method offers a computationally efficient and high-performing solution for FER.
Squeeze ViT represents a significant advancement in automated facial expression recognition technology.