Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Stream Function01:20

Stream Function

2.1K
In two-dimensional incompressible fluid flow, the continuity equation is essential for ensuring mass conservation, meaning that any change in fluid entering or exiting a region is balanced by a corresponding change elsewhere. For incompressible flow, where density remains constant, this requirement simplifies to the condition that the divergence of the velocity field must be zero. Mathematically, this is expressed as,
2.1K
Steady Flow of a Fluid Stream01:27

Steady Flow of a Fluid Stream

690
Consider a control volume, such as a pipe with solid boundaries, through which fluid flows and changes direction due to the impulse exerted by the resulting force from the pipe walls. In steady flow, the mass of fluid entering the control volume at a given time, t, with velocity v1, is equal to the mass leaving after infinitesimal time dt, with velocity v2.
During this process, the momentum of the fluid within the control volume remains constant over the time interval dt. By applying the...
690
Polymer Classification: Architecture01:14

Polymer Classification: Architecture

3.7K
Polymers are classified as linear or branched on the basis of their chain architecture. The polymer chains in linear polymers have a long chain-like structure with minimal to no branching at all. Even if a polymer features large substituent groups on the monomer, which appear as branches to the skeleton, it is not considered a branched polymer. A branched polymer contains secondary polymer chains that arise from the main polymer chain. The branching occurs when the polymer growth shifts from...
3.7K
Neural Regulation01:37

Neural Regulation

43.3K
Digestion begins with a cephalic phase that prepares the digestive system to receive food. When our brain processes visual or olfactory information about food, it triggers impulses in the cranial nerves innervating the salivary glands and stomach to prepare for food.
43.3K
ATP Driven Pumps I: An Overview01:27

ATP Driven Pumps I: An Overview

9.8K
ATP-driven pumps, also known as transport ATPases, are integral membrane proteins. They have binding sites for ATP located on the membrane's cytosolic side and the ion-conducting domain in the transmembrane region. These pumps use the free energy released from ATP hydrolysis to move the solutes across cell membranes against an electrochemical gradient.
There are four main types of ATP-driven pumps - P-type, V-type, F-type, and ABC transporter. All these pumps are of varying complexities and...
9.8K
Predator-Prey Interactions02:39

Predator-Prey Interactions

21.2K
Predators consume prey for energy. Predators that acquire prey and prey that avoid predation both increase their chances of survival and reproduction (i.e., fitness). Routine predator-prey interactions elicit mutual adaptations that improve predator offenses, such as claws, teeth, and speed, as well as prey defenses, including crypsis, aposematism, and mimicry. Thus, predator-prey interactions resemble an evolutionary arms race.
21.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Enhancing Bone Conduction Sensor Signals via Self-Supervised Acoustic Priors and Key-Value Memory.

Sensors (Basel, Switzerland)·2026
Same author

Enhancing Emotion-Brain Representations With Orthogonal Fuzzy Power-Coherence Alignment.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same author

Self-Supervised Contrastive Pre-Training for EEG-Based Recognition via Cross Device Representation Consistency.

IEEE transactions on bio-medical engineering·2025
Same author

Recent Advances in Portable Dry Electrode EEG: Architecture and Applications in Brain-Computer Interfaces.

Sensors (Basel, Switzerland)·2025
Same author

MsDUNE: A multi-scale masked temporal fusion framework for speaker-independent lipreading via Dirichlet uncertainty estimation.

Neural networks : the official journal of the International Neural Network Society·2025
Same author

PanoGen++: Domain-adapted text-guided panoramic environment generation for vision-and-language navigation.

Neural networks : the official journal of the International Neural Network Society·2025
Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

FedCAD: Cross-modal semantic alignment and distillation for cross-domain heterogeneous federated learning.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Partial-encryption-decryption-based secure state estimation of singularly perturbed complex networks: A Paillier encryption approach.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

ResVaRe: Parameter-efficient fine-tuning for large language models via cross-layer residual vector adaptation and representation editing.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Brain network construction and analysis for epilepsy: A methodology review.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: Jan 25, 2026

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology
05:38

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Published on: June 29, 2021

2.8K

Sequential viseme-driven visual speech recognition through dual-stream interactive neural architecture.

Hao Yuan1, Yakun Zhang2, Xingyu Zhang2

  • 1School of Advanced Manufacturing and Robotics, Peking University, 100871, Beijing, China; Defense Innovation Institute, Academy of Military Sciences, 100071, Beijing, China; Intelligent Game and Decision Laboratory, 100071, Beijing, China.

Neural Networks : the Official Journal of the International Neural Network Society
|January 23, 2026
PubMed
Summary
This summary is machine-generated.

This study introduces a novel dual-stream architecture for lipreading that enhances fine-grained visual feature extraction using sequential viseme knowledge. The approach achieves state-of-the-art results on multiple datasets, improving accuracy and robustness in visual speech recognition.

Keywords:
Coarse and fine-grained interactionComputer visionDual-stream interactive neural networkLipreadingSequential visemeVisual speech recognition

More Related Videos

Dual-color Correlative Light and Electron Microscopy for the Visualization of Interactions between Mitochondria and Lysosomes
10:25

Dual-color Correlative Light and Electron Microscopy for the Visualization of Interactions between Mitochondria and Lysosomes

Published on: September 27, 2024

1.1K
Ultrasound Images of the Tongue: A Tutorial for Assessment and Remediation of Speech Sound Errors
08:32

Ultrasound Images of the Tongue: A Tutorial for Assessment and Remediation of Speech Sound Errors

Published on: January 3, 2017

23.1K

Related Experiment Videos

Last Updated: Jan 25, 2026

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology
05:38

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Published on: June 29, 2021

2.8K
Dual-color Correlative Light and Electron Microscopy for the Visualization of Interactions between Mitochondria and Lysosomes
10:25

Dual-color Correlative Light and Electron Microscopy for the Visualization of Interactions between Mitochondria and Lysosomes

Published on: September 27, 2024

1.1K
Ultrasound Images of the Tongue: A Tutorial for Assessment and Remediation of Speech Sound Errors
08:32

Ultrasound Images of the Tongue: A Tutorial for Assessment and Remediation of Speech Sound Errors

Published on: January 3, 2017

23.1K

Area of Science:

  • Computer Science
  • Artificial Intelligence
  • Machine Learning

Background:

  • Current deep neural network lipreading methods struggle with fine-grained feature extraction, impacting articulatory detail capture.
  • Preserving overall semantics often compromises the extraction of crucial local visual features in lipreading models.

Purpose of the Study:

  • To introduce a novel paradigm for sentence-level lipreading using sequential viseme knowledge and a dual-stream architecture.
  • To address the limitation of fine-grained feature extraction in existing lipreading models.
  • To enhance the accuracy and robustness of visual speech recognition systems.

Main Methods:

  • Conceptualization of sequential viseme knowledge and development of a dual-stream architecture.
  • Integration of sequential viseme dynamics to enhance frame and segment attention.
  • Facilitation of interaction between character and viseme granularity information through multiple pathways.

Main Results:

  • Achieved state-of-the-art performance on multiple sentence-level lipreading datasets (GRID, CMLR, LRS2, LRS3).
  • Demonstrated superior performance with word error rates (WER) as low as 0.6% on GRID and character error rates (CER) of 9.9% on CMLR.
  • Showcased robustness to large-scale pretraining and data corruption, highlighting the generalizability of viseme knowledge.

Conclusions:

  • The proposed dual-stream architecture effectively addresses fine-grained feature preservation in lipreading.
  • Sequential viseme knowledge is crucial for bridging cross-lingual gaps and enhancing visual speech recognition.
  • The method demonstrates significant advantages over current top-performing models, with potential for further research and application.