Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Sleepwalking and Sleep Talking

Sleepwalking and Sleep Talking

Somnambulism, commonly known as sleepwalking, involves individuals engaging in activities ranging from simple walking to more complex behaviors such as driving. Sleepwalking typically occurs during the slow-wave sleep stages 3 and 4 early in the night when the person is not dreaming, contradicting the myth that sleepwalkers are acting out their dreams.
Factors that increase the likelihood of sleepwalking include sleep deprivation and alcohol consumption. Contrary to common beliefs, it is safe...

Bacterial Transformation

Bacterial Transformation

In 1928, bacteriologist Frederick Griffith worked on a vaccine for pneumonia, which is caused by Streptococcus pneumoniae bacteria. Griffith studied two pneumonia strains in mice: one pathogenic and one non-pathogenic. Only the pathogenic strain killed host mice.
Griffith made an unexpected discovery when he killed the pathogenic strain and mixed its remains with the live, non-pathogenic strain. Not only did the mixture kill host mice, but it also contained living pathogenic bacteria that...

Transformation

Transformation

Microbial communities are dynamic environments where cell lysis releases free DNA into the surroundings. Other cells can take up this extracellular DNA through a process known as transformation.When a cell incorporates this foreign DNA into its genome, resulting in genetic modification, the process is known as transformation. Cells capable of this process are termed competent. Competence can be natural, as observed in certain bacteria and archaea, or artificially induced in the...

Transformers

Transformers

A device that transforms voltages from one value to another using induction is called a transformer. A transformer consists of two separate coils, or windings, wrapped around the same soft iron core. However, they are electrically insulated from each other.
The iron core has a substantial relative permeability. Therefore, the magnetic field lines generated due to the current in one winding are almost entirely confined within the core, such that the same magnetic flux permeates each turn of both...

ATP Driven Pumps I: An Overview

ATP Driven Pumps I: An Overview

ATP-driven pumps, also known as transport ATPases, are integral membrane proteins. They have binding sites for ATP located on the membrane's cytosolic side and the ion-conducting domain in the transmembrane region. These pumps use the free energy released from ATP hydrolysis to move the solutes across cell membranes against an electrochemical gradient.
There are four main types of ATP-driven pumps - P-type, V-type, F-type, and ABC transporter. All these pumps are of varying complexities and...

The Ideal Transformer

The Ideal Transformer

In single-phase two-winding transformers, two windings are coiled around a magnetic core characterized by cross-sectional area A and magnetic permeability μ. A phasor current i1 enters the left winding while i2 exits the right winding, establishing the fundamental working of the transformer through electromagnetic principles.
Ampere's Law forms the basis of understanding the magnetic field within the transformer. It states that the integral of the magnetic field intensity's tangential...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A toehold-triggered switchable three-way junction protective nanoprobe for RNase H-assisted HBV rcDNA detection.

Journal of nanobiotechnology·2026

Same author

Hepatocyte TrkB Acts as a Gatekeeper Against MASH-Related Liver Fibrosis by Suppressing the TGFβ/CCL2 Axis and Macrophage Infiltration.

Cell proliferation·2026

Same author

Enhancing food system protection: synergistic action of engineered endolysin LYSMP and berberine hydrochloride against Streptococcus suis on pork.

Food research international (Ottawa, Ont.)·2026

Same author

Unveiling the prognostic role of FABP4 in early-onset colorectal cancer through big data analysis and preliminary clinical validation.

Frontiers in oncology·2026

Same author

The SIRT1-HMGB1 pathway in hippocampal microglia is involved in chronic heroin-induced cognitive impairment.

Pharmacology, biochemistry, and behavior·2026

Same author

Bimetallic plasmonic thermo-cycling driven by strong interfacial coupling in dual-hollow structure for enhanced photothermal hydrogen production.

Journal of colloid and interface science·2026

Same journal

Integrated multi-assessment and structural performance index framework for stacking-sequence optimisation of natural fibre reinforced laminates.

Scientific reports·2026

Same journal

SuperiorGAT: graph attention networks for sparse LiDAR point cloud reconstruction in autonomous systems.

Scientific reports·2026

Same journal

The effect of stretching the pectoralis major, sternocleidomastoid, and iliopsoas muscles on 800 m swimming performance in master swimmers.

Scientific reports·2026

Same journal

ISNR-PQC: isometry noise resilience post quantum cryptography primitive.

Scientific reports·2026

Same journal

Identification of high-yielding and stable genotypes of barley in the cold climate of Iran using AMMI and GGE biplot models.

Scientific reports·2026

Same journal

Bayesian negative binomial modelling of spatial and temporal patterns of road traffic deaths in Ghana.

Scientific reports·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 21, 2026

Light-driven Molecular Motors on Surfaces for Single Molecular Imaging

Light-driven Molecular Motors on Surfaces for Single Molecular Imaging

Published on: March 13, 2019

Audio-driven single image talking face animation with transformers.

Yixin Li¹, Xizhong Shen²

¹Department of Intelligent Technology, Shanghai Institute of Technology, 100 Haiquan Road, Shanghai, 201418, Fengxian District, China.

Scientific Reports

|January 19, 2026

Summary

This summary is machine-generated.

ExpNet generates realistic talking-head videos by decoupling head motion from facial expressions. This Transformer-based approach improves lip synchronization and video quality, even with emotional variations.

Keywords:

Audio-driven video generation Expression regression Talking-head synthesis Transformer-based model

More Related Videos

Investigating the Effect of Visual Imagery and Learning Shape-Audio Regularities on Bouba and Kiki

Investigating the Effect of Visual Imagery and Learning Shape-Audio Regularities on Bouba and Kiki

Published on: September 13, 2019

Computer-Generated Animal Model Stimuli

Computer-Generated Animal Model Stimuli

Published on: July 29, 2007

Related Experiment Videos

Last Updated: Jan 21, 2026

Light-driven Molecular Motors on Surfaces for Single Molecular Imaging

Light-driven Molecular Motors on Surfaces for Single Molecular Imaging

Published on: March 13, 2019

Investigating the Effect of Visual Imagery and Learning Shape-Audio Regularities on Bouba and Kiki

Investigating the Effect of Visual Imagery and Learning Shape-Audio Regularities on Bouba and Kiki

Published on: September 13, 2019

Computer-Generated Animal Model Stimuli

Computer-Generated Animal Model Stimuli

Published on: July 29, 2007

Area of Science:

Computer Vision
Artificial Intelligence
Human-Computer Interaction

Background:

Audio-driven talking-head video generation is vital for virtual humans and digital content.
Current methods struggle with unnatural lip movements and facial distortions, especially during emotional expressions.
These issues stem from audio signals entangling linguistic content, emotion, and speaker attributes.

Purpose of the Study:

To propose ExpNet, a novel framework for realistic audio-driven talking-head video generation.
To address limitations in existing methods regarding expression realism and lip synchronization.
To enhance the quality of generated videos, particularly under exaggerated expressions and emotional variations.

Main Methods:

Utilized a Transformer-based expression regression framework (ExpNet) employing 3DMM coefficients.
Implemented a conditional Variational Autoencoder (VAE) for head pose coefficient generation.
Employed a CNN-Transformer architecture with ALiBi-based relative positional bias for expression coefficient regression, conditioned on the first frame.

Main Results:

ExpNet demonstrated superior performance over existing methods in expression realism, lip synchronization, and video quality across multiple datasets (HDTF, MEAD, LRS3).
Ablation studies confirmed the importance of ALiBi, landmark supervision, and the Transformer module for temporal stability and reduced lip jitter.
The framework successfully preserved identity and emotion consistency throughout the generated videos.

Conclusions:

ExpNet effectively generates high-quality talking-head videos by decoupling head motion and facial expressions.
The proposed method offers significant improvements in realism and synchronization compared to prior approaches.
Key components like ALiBi and Transformer modules are critical for robust and consistent facial animation.