Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Stream Function

Stream Function

In two-dimensional incompressible fluid flow, the continuity equation is essential for ensuring mass conservation, meaning that any change in fluid entering or exiting a region is balanced by a corresponding change elsewhere. For incompressible flow, where density remains constant, this requirement simplifies to the condition that the divergence of the velocity field must be zero. Mathematically, this is expressed as,

Steady Flow of a Fluid Stream

Steady Flow of a Fluid Stream

Consider a control volume, such as a pipe with solid boundaries, through which fluid flows and changes direction due to the impulse exerted by the resulting force from the pipe walls. In steady flow, the mass of fluid entering the control volume at a given time, t, with velocity v1, is equal to the mass leaving after infinitesimal time dt, with velocity v2.
During this process, the momentum of the fluid within the control volume remains constant over the time interval dt. By applying the...

Polymer Classification: Architecture

Polymer Classification: Architecture

Polymers are classified as linear or branched on the basis of their chain architecture. The polymer chains in linear polymers have a long chain-like structure with minimal to no branching at all. Even if a polymer features large substituent groups on the monomer, which appear as branches to the skeleton, it is not considered a branched polymer. A branched polymer contains secondary polymer chains that arise from the main polymer chain. The branching occurs when the polymer growth shifts from...

Neural Regulation

Neural Regulation

Digestion begins with a cephalic phase that prepares the digestive system to receive food. When our brain processes visual or olfactory information about food, it triggers impulses in the cranial nerves innervating the salivary glands and stomach to prepare for food.

ATP Driven Pumps I: An Overview

ATP Driven Pumps I: An Overview

ATP-driven pumps, also known as transport ATPases, are integral membrane proteins. They have binding sites for ATP located on the membrane's cytosolic side and the ion-conducting domain in the transmembrane region. These pumps use the free energy released from ATP hydrolysis to move the solutes across cell membranes against an electrochemical gradient.
There are four main types of ATP-driven pumps - P-type, V-type, F-type, and ABC transporter. All these pumps are of varying complexities and...

Predator-Prey Interactions

Predator-Prey Interactions

Predators consume prey for energy. Predators that acquire prey and prey that avoid predation both increase their chances of survival and reproduction (i.e., fitness). Routine predator-prey interactions elicit mutual adaptations that improve predator offenses, such as claws, teeth, and speed, as well as prey defenses, including crypsis, aposematism, and mimicry. Thus, predator-prey interactions resemble an evolutionary arms race.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Enhancing Bone Conduction Sensor Signals via Self-Supervised Acoustic Priors and Key-Value Memory.

Sensors (Basel, Switzerland)·2026

Same author

Enhancing Emotion-Brain Representations With Orthogonal Fuzzy Power-Coherence Alignment.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

Same author

Self-Supervised Contrastive Pre-Training for EEG-Based Recognition via Cross Device Representation Consistency.

IEEE transactions on bio-medical engineering·2025

Same author

Recent Advances in Portable Dry Electrode EEG: Architecture and Applications in Brain-Computer Interfaces.

Sensors (Basel, Switzerland)·2025

Same author

MsDUNE: A multi-scale masked temporal fusion framework for speaker-independent lipreading via Dirichlet uncertainty estimation.

Neural networks : the official journal of the International Neural Network Society·2025

Same author

PanoGen++: Domain-adapted text-guided panoramic environment generation for vision-and-language navigation.

Neural networks : the official journal of the International Neural Network Society·2025

Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

FedCAD: Cross-modal semantic alignment and distillation for cross-domain heterogeneous federated learning.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Partial-encryption-decryption-based secure state estimation of singularly perturbed complex networks: A Paillier encryption approach.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

ResVaRe: Parameter-efficient fine-tuning for large language models via cross-layer residual vector adaptation and representation editing.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Brain network construction and analysis for epilepsy: A methodology review.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 25, 2026

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Published on: June 29, 2021

Sequential viseme-driven visual speech recognition through dual-stream interactive neural architecture.

Hao Yuan¹, Yakun Zhang², Xingyu Zhang²

¹School of Advanced Manufacturing and Robotics, Peking University, 100871, Beijing, China; Defense Innovation Institute, Academy of Military Sciences, 100071, Beijing, China; Intelligent Game and Decision Laboratory, 100071, Beijing, China.

Neural Networks : the Official Journal of the International Neural Network Society

|January 23, 2026

Summary

This summary is machine-generated.

This study introduces a novel dual-stream architecture for lipreading that enhances fine-grained visual feature extraction using sequential viseme knowledge. The approach achieves state-of-the-art results on multiple datasets, improving accuracy and robustness in visual speech recognition.

Keywords:

Coarse and fine-grained interaction Computer vision Dual-stream interactive neural network Lipreading Sequential viseme Visual speech recognition

More Related Videos

Dual-color Correlative Light and Electron Microscopy for the Visualization of Interactions between Mitochondria and Lysosomes

Dual-color Correlative Light and Electron Microscopy for the Visualization of Interactions between Mitochondria and Lysosomes

Published on: September 27, 2024

Ultrasound Images of the Tongue: A Tutorial for Assessment and Remediation of Speech Sound Errors

Ultrasound Images of the Tongue: A Tutorial for Assessment and Remediation of Speech Sound Errors

Published on: January 3, 2017

Related Experiment Videos

Last Updated: Jan 25, 2026

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Published on: June 29, 2021

Dual-color Correlative Light and Electron Microscopy for the Visualization of Interactions between Mitochondria and Lysosomes

Dual-color Correlative Light and Electron Microscopy for the Visualization of Interactions between Mitochondria and Lysosomes

Published on: September 27, 2024

Ultrasound Images of the Tongue: A Tutorial for Assessment and Remediation of Speech Sound Errors

Ultrasound Images of the Tongue: A Tutorial for Assessment and Remediation of Speech Sound Errors

Published on: January 3, 2017

Area of Science:

Computer Science
Artificial Intelligence
Machine Learning

Background:

Current deep neural network lipreading methods struggle with fine-grained feature extraction, impacting articulatory detail capture.
Preserving overall semantics often compromises the extraction of crucial local visual features in lipreading models.

Purpose of the Study:

To introduce a novel paradigm for sentence-level lipreading using sequential viseme knowledge and a dual-stream architecture.
To address the limitation of fine-grained feature extraction in existing lipreading models.
To enhance the accuracy and robustness of visual speech recognition systems.

Main Methods:

Conceptualization of sequential viseme knowledge and development of a dual-stream architecture.
Integration of sequential viseme dynamics to enhance frame and segment attention.
Facilitation of interaction between character and viseme granularity information through multiple pathways.

Main Results:

Achieved state-of-the-art performance on multiple sentence-level lipreading datasets (GRID, CMLR, LRS2, LRS3).
Demonstrated superior performance with word error rates (WER) as low as 0.6% on GRID and character error rates (CER) of 9.9% on CMLR.
Showcased robustness to large-scale pretraining and data corruption, highlighting the generalizability of viseme knowledge.

Conclusions:

The proposed dual-stream architecture effectively addresses fine-grained feature preservation in lipreading.
Sequential viseme knowledge is crucial for bridging cross-lingual gaps and enhancing visual speech recognition.
The method demonstrates significant advantages over current top-performing models, with potential for further research and application.