Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

132
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
132
High-Level and Low-Level Awareness01:19

High-Level and Low-Level Awareness

293
Controlled processes in human consciousness represent high-alert mental states where individuals deliberately focus their attention on achieving specific goals. Controlled processes can be seen in situations like mastering new technology, where a person might become so absorbed that they ignore surrounding distractions. Such processes involve selective attention, requiring one to concentrate on particular elements of experience while disregarding others. These are governed by executive...
293
Levels of Organization01:09

Levels of Organization

124.4K
Biological organization is the classification of biological structures, ranging from atoms at the bottom of the hierarchy to the Earth's biosphere. Each level of the hierarchy represents an increase in complexity that builds upon the previous level.
Molecules Are Composed of Atoms, and Biomolecules Are Assembled from Molecules:
The most basic levels include atoms, molecules, and biomolecules. Atoms, the smallest unit of ordinary matter, are composed of a nucleus and electrons. Molecules...
124.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Multi-center study of the recombinant cascade reagent (kinetic chromogenic assay) as an alternative method for bacterial endotoxin testing: Method validation, product suitability, and consistency evaluation with limulus amebocyte lysate.

Journal of pharmaceutical and biomedical analysis·2026
Same author

<i>In situ</i> synthesis of a binder-free MoS<sub>2</sub>/FeOOH@CC heterostructure as a photoelectrode for high-performance photo-rechargeable zinc-ion batteries.

Nanoscale·2026
Same author

Mechanistic insight into low-temperature alkylation of thiophene with styrene over zeolites MCM-22 and β.

Communications chemistry·2026
Same author

Boosting Nitrogen Selectivity in Photocatalytic Nitrate Reduction via Radical-Mediated Pathway over Cu-Ag Alloy Nanoclusters.

The journal of physical chemistry letters·2026
Same author

Design of PEDOT Derivative Films with Reduced Hygroscopicity by Dual-Sulfonated Polymer for Stable Si Hybrid Solar Cells.

The journal of physical chemistry letters·2026
Same author

Toward clean and efficient 3D Gaussian representations.

Applied optics·2026
Same journal

DSPE-ViT: a lightweight vision transformer with dynamic sparse positional encoding for dense small object detection in UAV imagery.

Frontiers in neurorobotics·2026
Same journal

ST-HONet: Spatio-Temporal Hierarchical Network for long-horizon bimanual visuomotor imitation.

Frontiers in neurorobotics·2026
Same journal

ST-HADP: Spatio-Temporal hierarchical attention diffusion policy for long-horizon generalizable bimanual visuomotor imitation.

Frontiers in neurorobotics·2026
Same journal

EQISP: efficient quantized image signal processing with multi-scale pyramid fusion for resource constrained embodied perception.

Frontiers in neurorobotics·2026
Same journal

Research on embodied agent multimodal perception and real-time path planning algorithms for complex unstructured environments.

Frontiers in neurorobotics·2026
Same journal

NL-YOLOv5: a model with a larger receptive field and the ability to globally acquire features.

Frontiers in neurorobotics·2026
See all related articles

Related Experiment Video

Updated: Jul 24, 2025

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology
09:44

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

4.9K

MLNet: a multi-level multimodal named entity recognition architecture.

Hanming Zhai1, Xiaojun Lv2, Zhiwen Hou1

  • 1School of Information Network Security, People's Public Security University of China, Beijing, China.

Frontiers in Neurorobotics
|July 6, 2023
PubMed
Summary
This summary is machine-generated.

This study introduces a novel multimodal named entity recognition (MNER) architecture to improve object identification accuracy. The model enhances semantic understanding by filtering visual information and reducing text noise for better robot interaction.

Keywords:
cross taskmulti-head attentionmultimodal named entity recognitionpre-trainingshort text

More Related Videos

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

583
Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

3.9K

Related Experiment Videos

Last Updated: Jul 24, 2025

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology
09:44

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

4.9K
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

583
Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

3.9K

Area of Science:

  • Human-Computer Interaction
  • Artificial Intelligence
  • Natural Language Processing
  • Computer Vision

Background:

  • Accurate identification of talking objects is crucial for robot decision-making and recommendations.
  • Named Entity Recognition (NER) and Object Detection (OD) are fundamental for object recognition in NLP and CV.
  • Existing multimodal approaches show promise but require optimization for noisy, short text-image data in Multimodal Named Entity Recognition (MNER).

Purpose of the Study:

  • To propose a new multi-level multimodal named entity recognition architecture.
  • To enhance semantic understanding and entity identification efficacy in MNER tasks.
  • To address limitations in current image-text-based MNER architectures, particularly with noisy data.

Main Methods:

  • Separate image and text encoding.
  • A symmetric Transformer-based neural network for multimodal feature fusion.
  • A gating mechanism to filter relevant visual information and enhance text understanding.
  • Character-level vector encoding to mitigate text noise.
  • Conditional Random Fields for label classification.

Main Results:

  • The proposed model demonstrated increased accuracy in the MNER task on the Twitter dataset.
  • Effective filtering of visual information improved semantic disambiguation.
  • Character-level encoding reduced the impact of text noise on recognition accuracy.

Conclusions:

  • The novel multi-level multimodal architecture significantly improves MNER task performance.
  • The gating mechanism and character-level encoding are effective strategies for handling noisy multimodal data.
  • This work advances the capabilities of robots in understanding and interacting with their environment through improved object recognition.