Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: May 28, 2026

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

3.6K

D2MNet: Difference-Aware Decoupling and Multi-Prompt Learning for Medical Difference Visual Question Answering.

Lingge Lai1, Weihua Ou1, Jianping Gou2

  • 1School of Big Data and Computer Science, Guizhou Normal University, Guiyang 550025, China.

Journal of Imaging
|April 27, 2026
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

Learning Disabilities01:25

Learning Disabilities

Learning disabilities are cognitive disorders caused by neurological impairments that affect cognitive functions like language and reading, without indicating overall intellectual or developmental challenges. These disabilities differ from global intellectual or developmental disabilities as they are limited to distinct cognitive functions. Common learning disabilities include dysgraphia, dyslexia, and dyscalculia, each of which impacts unique aspects of learning.
Dyslexia
Dyslexia is a...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Evolving classifiers with background suppression transformer for open-set long-tailed class-incremental remote sensing scene classification.

Neural networks : the official journal of the International Neural Network Society·2026
Same author

System log anomaly detection based on contrastive learning and retrieval augmented.

Scientific reports·2025
Same author

Dual-focus memory contrastive learning for active domain adaptation.

Neural networks : the official journal of the International Neural Network Society·2025
Same author

Improved YOLO-Based Pulmonary Nodule Detection with Spatial-SE Attention and an Aspect Ratio Penalty.

Sensors (Basel, Switzerland)·2025
Same author

MHS-VIT: Mamba hybrid self-attention vision transformers for traffic image detection.

PloS one·2025
Same author

Intra-class progressive and adaptive self-distillation.

Neural networks : the official journal of the International Neural Network Society·2025
Same journal

Human-AI Interaction in Interventional Radiology: A Narrative Review of Current Applications, Challenges, and Future Directions.

Journal of imaging·2026
Same journal

Coronary Artery Anomalies and Anatomical Variants: Cross-Sectional Diagnostic Imaging and Clinical Background.

Journal of imaging·2026
Same journal

YoLeTooth: A Unified Framework for Joint Tooth Segmentation and Periapical Lesion Detection in Panoramic Radiographs.

Journal of imaging·2026
Same journal

Radiomics-Guided Multi-Sequence Learning for Pathological Complete Response Prediction from Breast MRI with Missing Auxiliary Sequences.

Journal of imaging·2026
Same journal

Cutaneous Thermography in Arthropathies: Quantitative Imaging, Machine Learning, and Clinical Translation.

Journal of imaging·2026
Same journal

Two-Stage Dynamic Synergistic Segmentation Method for Myocardial Pathology.

Journal of imaging·2026
See all related articles

The D2MNet framework improves medical difference visual question answering (Diff-VQA) by analyzing image changes and using multi-prompt learning for better answers. This approach enhances accuracy in identifying and explaining differences in medical images.

Area of Science:

  • Medical imaging analysis
  • Artificial intelligence in healthcare
  • Computer vision

Background:

  • Difference visual question answering (Diff-VQA) is crucial for identifying and reasoning about discrepancies in medical images.
  • Current Diff-VQA methods struggle with asymmetric descriptions of image changes and lack task-specific guidance for language models.
  • Pretrained language decoders require better integration for effective medical image difference analysis.

Purpose of the Study:

  • To introduce D2MNet, a novel framework for medical Diff-VQA that integrates change-aware reasoning with prompt-guided answer generation.
  • To address limitations in existing methods by improving the modeling of image differences and enhancing language decoder guidance.
  • To enhance the accuracy and contextual relevance of answers generated for medical Diff-VQA tasks.
Keywords:
chest X-raymedical vision question answeringmulti-prompt learning

Related Experiment Videos

Last Updated: May 28, 2026

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

3.6K

Main Methods:

  • Developed D2MNet, incorporating a Change Analysis Module (CAM) for change detection and prompt generation.
  • Implemented a Difference-Aware Module (DAM) utilizing dual attention for fine-grained feature extraction.
  • Employed a multi-prompt learning mechanism (MLM) to inject diverse prompts into the language decoder for improved generation.

Main Results:

  • D2MNet achieved a CIDEr score of 2.907 ± 0.040 on the MIMIC-DiffVQA benchmark.
  • The proposed framework significantly outperformed the strongest baseline, ReAl (2.409).
  • Experimental results validate the effectiveness of D2MNet's design for medical Diff-VQA.

Conclusions:

  • D2MNet demonstrates superior performance in medical Diff-VQA tasks.
  • The framework's approach to change-aware reasoning and prompt-guided generation is effective.
  • D2MNet shows promise for advancing difference-aware medical answer generation.