Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Facial Feedback Hypothesis01:24

Facial Feedback Hypothesis

Charles Darwin proposed that facial expressions are an evolutionary adaptation for communication. He argued that these expressions are not influenced by culture but are universal across species. For example, a snarling expression with exposed teeth signals a threat in many animals, including humans. Darwin also suggested that displaying an emotion can intensify the feeling. Smiling, for example, could enhance one's sense of happiness. This idea laid the foundation for understanding the role of...
Transformation01:26

Transformation

Microbial communities are dynamic environments where cell lysis releases free DNA into the surroundings. Other cells can take up this extracellular DNA through a process known as transformation.When a cell incorporates this foreign DNA into its genome, resulting in genetic modification, the process is known as transformation. Cells capable of this process are termed competent. Competence can be natural, as observed in certain bacteria and archaea, or artificially induced in the...
Muscles for Facial Expressions01:14

Muscles for Facial Expressions

The craniofacial muscles are a collection of approximately 20 thin skeletal muscles situated beneath the skin of the face and scalp. These muscles, primarily responsible for the vast array of human facial expressions, originate from the bones or fibrous structures of the skull and extend outwards to connect with the skin. While most skeletal muscles in the body are enveloped in thick fascia, facial muscles generally have a more delicate fascial covering, with the buccinator muscle being a...
Impression Management Techniques IV: Altercasting01:14

Impression Management Techniques IV: Altercasting

Altercasting is a strategic communication technique in which an individual imposes a specific identity or social role onto another person to influence their behavior and shape the interaction. By presuming a role—such as “responsible leader” or “patient person”—altercasting encourages the target to conform to that identity, often aligning their behavior with the expectations associated with the role. The power of this tactic lies in its subtlety; once a role is assigned, it becomes socially...
Source Transformation01:15

Source Transformation

Source transformation is a fundamental technique employed in circuit analysis, offering a valuable tool for simplifying complex electrical circuits. This technique involves the replacement of either a voltage source in series with a resistor by a current source in parallel with a resistor, or vice versa. The key concept here is that when the original sources are deactivated (turned off), the equivalent resistance at the circuit's end terminals remains the same.
It is essential to note that when...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Deep learning-enabled self-powered bimodal flexible sensor for intelligent access control.

Nanotechnology·2026
Same author

Polygenic risk score analysis of noise-induced hearing loss: An integrated cross-sectional and longitudinal study.

Hearing research·2026
Same author

A three-pronged strategy with minimalist nattokinase nanocomposite eye drops breaks the vicious cycle in ultraviolet-B-induced cataract.

Journal of nanobiotechnology·2026
Same author

Ultrathin proton exchange membranes with enhanced dimensional stability towards acidic CO<sub>2</sub> electroreduction.

Chemical communications (Cambridge, England)·2026
Same author

A Dual-Gene Signature of PMAIP1 and GADD45A for Early Detection of Intrahepatic Cholangiocarcinoma in the Context of Primary Sclerosing Cholangitis.

International journal of molecular sciences·2026
Same author

The association between upper limb function, physical exercise, and cognitive ability among empty-nest elderly in China: A cross-sectional study based on CLHLS.

PloS one·2026
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: May 28, 2026

Holistic Facial Composite Creation and Subsequent Video Line-up Eyewitness Identification Paradigm
09:49

Holistic Facial Composite Creation and Subsequent Video Line-up Eyewitness Identification Paradigm

Published on: December 24, 2015

Cross-Identity Interaction Transformer for Facial Age Estimation.

Yiming Ma1, Chunlong Hu1, Changbin Shao1

  • 1School of Computer Science, Jiangsu University of Science and Technology, Zhenjiang 212003, China.

Sensors (Basel, Switzerland)
|May 27, 2026
PubMed
Summary
This summary is machine-generated.

Researchers developed a new artificial intelligence model to improve how computers estimate a person's age from facial images. By comparing a target face against a sequence of other individuals, the system better identifies universal aging signs like wrinkles while ignoring personal differences. This approach proves more accurate than previous methods across several standard testing datasets.

Keywords:
cross-identity interactionfacial age estimationprior-guided axial cross-image attentionbiometric analysisdeep learning visionage prediction modelstransformer architecture

Frequently Asked Questions

More Related Videos

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues
07:34

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Published on: June 3, 2013

Related Experiment Videos

Last Updated: May 28, 2026

Holistic Facial Composite Creation and Subsequent Video Line-up Eyewitness Identification Paradigm
09:49

Holistic Facial Composite Creation and Subsequent Video Line-up Eyewitness Identification Paradigm

Published on: December 24, 2015

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues
07:34

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Published on: June 3, 2013

Area of Science:

  • Computer vision and pattern recognition within Cross-Identity Interaction Transformer research
  • Artificial intelligence and machine learning applications in biometrics

Background:

No prior work has fully resolved the difficulty of estimating age due to significant appearance differences between people. It was already known that individual traits often mask universal aging signals. This gap motivated researchers to rethink how machines interpret facial data. Prior research has shown that standard models struggle when personal variation exceeds age-related changes. That uncertainty drove the development of new architectures. Previous approaches frequently failed to distinguish between unique features and common aging patterns. This study addresses these limitations by shifting the focus toward shared characteristics. The field required a more robust way to handle diverse facial appearances during analysis.

Purpose Of The Study:

The primary aim is to improve the accuracy of facial age estimation by addressing the challenge of intra-age appearance variations. Researchers sought to resolve the problem where individual differences often exceed actual age-related changes. This study investigates whether learning common cues across multiple identities can enhance performance. The authors propose a new architecture to reformulate age prediction as a multi-image learning task. They intend to demonstrate that comparing a query image to a sequence of references provides better context. The team also aims to preserve age evidence from fine textures to coarse structural changes. They seek to prove that guiding attention toward age-sensitive regions improves model robustness. Finally, the study evaluates if this approach works effectively across diverse, large-scale benchmark datasets.

Main Methods:

The review approach involves constructing a sequence of images for every query to facilitate comparative learning. This design treats age prediction as a multi-image task rather than a standard single-input problem. The researchers utilize a Transformer-based architecture to process these sequences through alternating attention blocks. One module refines local representations while another manages cross-image interactions guided by edge priors. A specialized regression network then aggregates these refined features to produce a final age estimate. The team evaluated their system using four distinct, publicly available benchmark datasets. They compared their results against established metrics to confirm the model's reliability. This methodology ensures that the system learns common aging patterns while ignoring individual-specific noise.

Main Results:

Key findings from the literature indicate that this model achieves superior performance across all four tested benchmark datasets. The system successfully captures shared facial characteristics that remain consistent across different individuals. By using multi-scale edge priors, the attention mechanism effectively highlights age-sensitive regions like wrinkles. The integration of local feature refinement and cross-image interaction leads to more accurate age predictions. The anchored regression network provides stability when processing diverse facial aging patterns. These results confirm that multi-image learning helps overcome the challenge of large intra-age appearance variations. The model consistently outperforms existing methods on standard evaluation metrics. This evidence supports the claim that the proposed architecture is highly effective for age estimation.

Conclusions:

The authors propose that their model effectively captures universal aging cues across different people. This synthesis suggests that multi-image learning improves accuracy compared to single-image approaches. The findings imply that focusing on shared facial traits reduces errors caused by individual appearance variations. Researchers claim that the integration of edge priors guides the system toward age-sensitive regions. The study demonstrates that their regression network provides stable predictions across diverse datasets. These results indicate that combining local and cross-image attention enhances feature representation. The authors conclude that their architecture outperforms existing methods on standard benchmarks. This work provides a framework for future developments in robust biometric age estimation.

The researchers propose a multi-image learning task where a query image is compared against a sequence of other identities. This mechanism allows the model to isolate universal aging cues from individual-specific facial features, which often obscure age-related information in traditional single-image analysis.

The Cross-Scale Embedding module preserves age evidence by extracting features at multiple levels of detail. It captures everything from fine skin textures to coarse structural changes, ensuring that the model retains comprehensive information necessary for precise age determination.

The Prior-Guided Axial Cross-Image Attention mechanism is necessary to focus the model on age-sensitive regions. By utilizing multi-scale edge priors, it directs the interaction toward specific areas like wrinkles, which are highly indicative of aging, rather than irrelevant facial zones.

The Anchored Regression Network acts as the final decision-making component. It calculates age by applying a soft-weighted combination of multiple linear regressors, which ensures robust performance even when faced with the diverse aging patterns found in large datasets.

The model was tested on four benchmark datasets: MORPH Album II, MegaAge-Asian, FG-NET, and Adience. These datasets provide a diverse range of facial images, allowing for a thorough evaluation of the system's performance across different demographic groups and aging conditions.

The authors claim that their approach achieves superior performance across multiple evaluation metrics. They suggest this validates the effectiveness of their transformer-based design in capturing shared facial characteristics that are consistent across different individuals.