Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Sleepwalking and Sleep Talking01:17

Sleepwalking and Sleep Talking

919
Somnambulism, commonly known as sleepwalking, involves individuals engaging in activities ranging from simple walking to more complex behaviors such as driving. Sleepwalking typically occurs during the slow-wave sleep stages 3 and 4 early in the night when the person is not dreaming, contradicting the myth that sleepwalkers are acting out their dreams.
Factors that increase the likelihood of sleepwalking include sleep deprivation and alcohol consumption. Contrary to common beliefs, it is safe...
919
Bacterial Transformation01:33

Bacterial Transformation

59.5K
In 1928, bacteriologist Frederick Griffith worked on a vaccine for pneumonia, which is caused by Streptococcus pneumoniae bacteria. Griffith studied two pneumonia strains in mice: one pathogenic and one non-pathogenic. Only the pathogenic strain killed host mice.
Griffith made an unexpected discovery when he killed the pathogenic strain and mixed its remains with the live, non-pathogenic strain. Not only did the mixture kill host mice, but it also contained living pathogenic bacteria that...
59.5K
Transformation01:26

Transformation

720
Microbial communities are dynamic environments where cell lysis releases free DNA into the surroundings. Other cells can take up this extracellular DNA through a process known as transformation.When a cell incorporates this foreign DNA into its genome, resulting in genetic modification, the process is known as transformation. Cells capable of this process are termed competent. Competence can be natural, as observed in certain bacteria and archaea, or artificially induced in the...
720
Transformers01:26

Transformers

1.7K
A device that transforms voltages from one value to another using induction is called a transformer. A transformer consists of two separate coils, or windings, wrapped around the same soft iron core. However, they are electrically insulated from each other.
The iron core has a substantial relative permeability. Therefore, the magnetic field lines generated due to the current in one winding are almost entirely confined within the core, such that the same magnetic flux permeates each turn of both...
1.7K
ATP Driven Pumps I: An Overview01:27

ATP Driven Pumps I: An Overview

9.7K
ATP-driven pumps, also known as transport ATPases, are integral membrane proteins. They have binding sites for ATP located on the membrane's cytosolic side and the ion-conducting domain in the transmembrane region. These pumps use the free energy released from ATP hydrolysis to move the solutes across cell membranes against an electrochemical gradient.
There are four main types of ATP-driven pumps - P-type, V-type, F-type, and ABC transporter. All these pumps are of varying complexities and...
9.7K
The Ideal Transformer01:26

The Ideal Transformer

1.4K
In single-phase two-winding transformers, two windings are coiled around a magnetic core characterized by cross-sectional area A and magnetic permeability μ. A phasor current i1 enters the left winding while i2 exits the right winding, establishing the fundamental working of the transformer through electromagnetic principles.
Ampere's Law forms the basis of understanding the magnetic field within the transformer. It states that the integral of the magnetic field intensity's tangential...
1.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A toehold-triggered switchable three-way junction protective nanoprobe for RNase H-assisted HBV rcDNA detection.

Journal of nanobiotechnology·2026
Same author

Hepatocyte TrkB Acts as a Gatekeeper Against MASH-Related Liver Fibrosis by Suppressing the TGFβ/CCL2 Axis and Macrophage Infiltration.

Cell proliferation·2026
Same author

Enhancing food system protection: synergistic action of engineered endolysin LYSMP and berberine hydrochloride against Streptococcus suis on pork.

Food research international (Ottawa, Ont.)·2026
Same author

Unveiling the prognostic role of FABP4 in early-onset colorectal cancer through big data analysis and preliminary clinical validation.

Frontiers in oncology·2026
Same author

The SIRT1-HMGB1 pathway in hippocampal microglia is involved in chronic heroin-induced cognitive impairment.

Pharmacology, biochemistry, and behavior·2026
Same author

Bimetallic plasmonic thermo-cycling driven by strong interfacial coupling in dual-hollow structure for enhanced photothermal hydrogen production.

Journal of colloid and interface science·2026
Same journal

Integrated multi-assessment and structural performance index framework for stacking-sequence optimisation of natural fibre reinforced laminates.

Scientific reports·2026
Same journal

SuperiorGAT: graph attention networks for sparse LiDAR point cloud reconstruction in autonomous systems.

Scientific reports·2026
Same journal

The effect of stretching the pectoralis major, sternocleidomastoid, and iliopsoas muscles on 800 m swimming performance in master swimmers.

Scientific reports·2026
Same journal

ISNR-PQC: isometry noise resilience post quantum cryptography primitive.

Scientific reports·2026
Same journal

Identification of high-yielding and stable genotypes of barley in the cold climate of Iran using AMMI and GGE biplot models.

Scientific reports·2026
Same journal

Bayesian negative binomial modelling of spatial and temporal patterns of road traffic deaths in Ghana.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Jan 21, 2026

Light-driven Molecular Motors on Surfaces for Single Molecular Imaging
08:40

Light-driven Molecular Motors on Surfaces for Single Molecular Imaging

Published on: March 13, 2019

12.0K

Audio-driven single image talking face animation with transformers.

Yixin Li1, Xizhong Shen2

  • 1Department of Intelligent Technology, Shanghai Institute of Technology, 100 Haiquan Road, Shanghai, 201418, Fengxian District, China.

Scientific Reports
|January 19, 2026
PubMed
Summary
This summary is machine-generated.

ExpNet generates realistic talking-head videos by decoupling head motion from facial expressions. This Transformer-based approach improves lip synchronization and video quality, even with emotional variations.

Keywords:
Audio-driven video generationExpression regressionTalking-head synthesisTransformer-based model

More Related Videos

Investigating the Effect of Visual Imagery and Learning Shape-Audio Regularities on Bouba and Kiki
07:31

Investigating the Effect of Visual Imagery and Learning Shape-Audio Regularities on Bouba and Kiki

Published on: September 13, 2019

10.5K
Computer-Generated Animal Model Stimuli
26:43

Computer-Generated Animal Model Stimuli

Published on: July 29, 2007

11.3K

Related Experiment Videos

Last Updated: Jan 21, 2026

Light-driven Molecular Motors on Surfaces for Single Molecular Imaging
08:40

Light-driven Molecular Motors on Surfaces for Single Molecular Imaging

Published on: March 13, 2019

12.0K
Investigating the Effect of Visual Imagery and Learning Shape-Audio Regularities on Bouba and Kiki
07:31

Investigating the Effect of Visual Imagery and Learning Shape-Audio Regularities on Bouba and Kiki

Published on: September 13, 2019

10.5K
Computer-Generated Animal Model Stimuli
26:43

Computer-Generated Animal Model Stimuli

Published on: July 29, 2007

11.3K

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Human-Computer Interaction

Background:

  • Audio-driven talking-head video generation is vital for virtual humans and digital content.
  • Current methods struggle with unnatural lip movements and facial distortions, especially during emotional expressions.
  • These issues stem from audio signals entangling linguistic content, emotion, and speaker attributes.

Purpose of the Study:

  • To propose ExpNet, a novel framework for realistic audio-driven talking-head video generation.
  • To address limitations in existing methods regarding expression realism and lip synchronization.
  • To enhance the quality of generated videos, particularly under exaggerated expressions and emotional variations.

Main Methods:

  • Utilized a Transformer-based expression regression framework (ExpNet) employing 3DMM coefficients.
  • Implemented a conditional Variational Autoencoder (VAE) for head pose coefficient generation.
  • Employed a CNN-Transformer architecture with ALiBi-based relative positional bias for expression coefficient regression, conditioned on the first frame.

Main Results:

  • ExpNet demonstrated superior performance over existing methods in expression realism, lip synchronization, and video quality across multiple datasets (HDTF, MEAD, LRS3).
  • Ablation studies confirmed the importance of ALiBi, landmark supervision, and the Transformer module for temporal stability and reduced lip jitter.
  • The framework successfully preserved identity and emotion consistency throughout the generated videos.

Conclusions:

  • ExpNet effectively generates high-quality talking-head videos by decoupling head motion and facial expressions.
  • The proposed method offers significant improvements in realism and synchronization compared to prior approaches.
  • Key components like ALiBi and Transformer modules are critical for robust and consistent facial animation.