Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Vision

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Exploring the Real-World Effectiveness of Empagliflozin and Linagliptin Fixed-Dose Combination in Type 2 Diabetes Patients of Bangladesh: A Prospective Multi-Center Observational Study.

Diabetes, metabolic syndrome and obesity : targets and therapy·2025

Same author

NeuroNet-AD: A Multimodal Deep Learning Framework for Multiclass Alzheimer's Disease Diagnosis.

Bioengineering (Basel, Switzerland)·2025

Same author

MicroAIbiome: Decoding Cancer Types from Microbial Profiles Using Explainable Machine Learning.

Microorganisms·2025

Same author

Adventitious Pulmonary Sound Detection: Leveraging SHAP Explanations and Gradient Boosting Insights.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

Same author

Photobiomodulation Therapy: Survey and Principal Study Leading to Design Rules for Implants.

IEEE transactions on bio-medical engineering·2025

Same author

Integrated Gene Expression Data-Driven Identification of Molecular Signatures, Prognostic Biomarkers, and Drug Targets for Glioblastoma.

BioMed research international·2024

Same journal

AdaWGAN: Data Augmentation for Few-Shot HD-sEMG Gesture Recognition Using Single-Trial Data.

IEEE journal of biomedical and health informatics·2026

Same journal

NeuroBooster: a domain-informed self-supervised learning paradigm tailored for brain MRI analysis.

IEEE journal of biomedical and health informatics·2026

Same journal

Graph Convolutional Neural Network based Depression Detection using Brain Functional Connectivity Measures.

IEEE journal of biomedical and health informatics·2026

Same journal

Improving Multi-Sensor Non-Invasive Glucose Detection through AI: A Domain Generalization Approach.

IEEE journal of biomedical and health informatics·2026

Same journal

Unmixing the Neck: Accurate Jugular Venous Pulse Detection From Wearable PPG.

IEEE journal of biomedical and health informatics·2026

Same journal

AD-DAE: Alzheimer's Disease Progression Modeling with Unpaired Longitudinal MRI using Diffusion Auto-Encoders.

IEEE journal of biomedical and health informatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 16, 2025

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Text-Assisted Vision Model for Medical Image Segmentation.

Md Motiur Rahman, Saeka Rahman, Smriti Bhatt

IEEE Journal of Biomedical and Health Informatics

|May 14, 2025

Summary

This summary is machine-generated.

This study introduces a text-assisted vision (TAV) model for enhanced medical image segmentation. The novel triguided attention module (TGAM) improves segmentation accuracy by effectively integrating image and text data.

More Related Videos

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Published on: April 14, 2023

Related Experiment Videos

Last Updated: May 16, 2025

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Published on: April 14, 2023

Area of Science:

Medical Imaging
Artificial Intelligence
Computer Vision

Background:

Accurate medical image segmentation is crucial for automated diagnosis and treatment planning.
Deep learning models primarily rely on image data, often overlooking valuable information in text reports.
Existing attention mechanisms struggle with cross-modal alignment, limiting performance in multi-modal scenarios.

Purpose of the Study:

To develop a novel text-assisted vision (TAV) model for improved medical image segmentation.
To introduce a triguided attention module (TGAM) for effective cross-modal feature learning.
To enhance segmentation precision by leveraging both visual and textual data.

Main Methods:

Proposed a text-assisted vision (TAV) model incorporating a novel triguided attention module (TGAM).
TGAM computes visual-visual, language-language, and language-visual attention for feature correlation.
An attention gate (AG) was used to modulate TGAM's influence, preventing information overflow.

Main Results:

The TAV model achieved state-of-the-art performance on two medical image segmentation datasets.
TAV demonstrated performance improvements of 2-7% compared to existing models.
Extensive experiments validated the effectiveness of individual components within the TAV model.

Conclusions:

The TAV model represents a significant advancement in multi-modal medical image segmentation.
Integrating text reports via TGAM substantially enhances segmentation accuracy.
The proposed approach offers a promising direction for leveraging multi-modal data in medical AI.