Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Protein Networks02:26

Protein Networks

An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
Protein Networks02:26

Protein Networks

An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Author Correction: Increasing the efficiency and targeting range of cytidine base editors through fusion of a single-stranded DNA-binding protein domain.

Nature cell biology·2026
Same author

CAR-NK cell therapy for hematologic malignancies: advances, challenges and optimization strategies.

Molecular cancer·2026
Same author

Letter to the Editor regarding the article 'Development and validation of an explainable machine learning model for predicting occult lymph node metastasis in early-stage oral tongue squamous cell carcinoma: A multi-center study'.

International journal of surgery (London, England)·2026
Same author

Immunotherapy-related cardiovascular toxicity: from mechanisms to management.

Frontiers in pharmacology·2026
Same author

Engineered Un1Cas12f1 for multiplex genome editing with enhanced activity and targeting scope.

Nature communications·2026
Same author

Methodological Considerations on the Association Between HPV Vaccination and Risk of Immune-Mediated Diseases.

QJM : monthly journal of the Association of Physicians·2025
Same journal

DARUMA: a gateway to fast and easy prediction of intrinsically disordered regions.

PeerJ. Computer science·2026
Same journal

Alzheimer's disease detection using a quantum deep neural network with Haralick feature extraction and simulated annealing optimization.

PeerJ. Computer science·2026
Same journal

Network anomaly detection using Deep Autoencoder and parallel Artificial Bee Colony algorithm-trained neural network.

PeerJ. Computer science·2026
Same journal

An anomaly detection model for multivariate time series with anomaly perception.

PeerJ. Computer science·2026
Same journal

Retraction: A wormhole attack detection method for tactical wireless sensor networks.

PeerJ. Computer science·2026
Same journal

Evaluation of mental disorder with prioritization of its type by utilizing the bipolar complex fuzzy decision-making approach based on Schweizer-Sklar prioritized aggregation operators.

PeerJ. Computer science·2026
See all related articles

Related Experiment Video

Updated: May 10, 2026

Deep Neural Networks for Image-Based Dietary Assessment
13:19

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

8.9K

Novel cross-dimensional coarse-fine-grained complementary network for image-text matching.

Meizhen Liu1,2, Anis Salwa Mohd Khairuddin1, Khairunnisa Hasikin3

  • 1Department of Electrical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia.

Peerj. Computer Science
|March 10, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a new Cross-Dimensional Coarse-Fine-Grained Complementary Network (CDGCN) to improve image-text matching by addressing the semantic gap. The CDGCN enhances multimodal understanding by integrating fine-grained and coarse-grained feature alignment.

Keywords:
ComplementaryCross-dimensionalImage-text matchingSemantic aggregationSemantic consistency

More Related Videos

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

450
Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping
07:11

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

1.4K

Related Experiment Videos

Last Updated: May 10, 2026

Deep Neural Networks for Image-Based Dietary Assessment
13:19

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

8.9K
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

450
Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping
07:11

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

1.4K

Area of Science:

  • Computer Science
  • Artificial Intelligence
  • Multimodal Machine Learning

Background:

  • Multimodal applications like image-text matching face challenges due to the heterogeneity gap between visual and textual data.
  • Existing methods often fail to integrate fine-grained word-region matching with coarse-grained image-text matching and ignore feature dimension differences.

Purpose of the Study:

  • To propose a novel network, the Cross-Dimensional Coarse-Fine-Grained Complementary Network (CDGCN), to overcome limitations in current image-text matching techniques.
  • To enhance semantic consistency and holistic understanding in multimodal applications by bridging the gap between local and global feature representations.

Main Methods:

  • The CDGCN employs fine-grained semantic alignment of image regions and text words using cross-dimensional dependencies.
  • A Coarse-Grained Cross-Dimensional Semantic Aggregation (CGDSA) module complements local alignment with global image-text matching, aggregating features across and within dimensions.

Main Results:

  • The CDGCN demonstrated substantial performance improvements on the Flickr30K and MS-COCO datasets.
  • The proposed method achieved performance increments ranging from 7.7% to 16% compared to state-of-the-art methods.

Conclusions:

  • The CDGCN effectively addresses the semantic gap and heterogeneity in multimodal data by integrating complementary matching strategies.
  • The network's ability to preserve semantic integrity through cross-dimensional and coarse-fine-grained feature aggregation leads to significant advancements in image-text matching accuracy.