Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Vision

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Retraction Note: Artificial intelligence in disease diagnosis: a systematic literature review, synthesizing framework and future research agenda.

Journal of ambient intelligence and humanized computing·2026

Same author

Omics in mini-livestock: a genomic perspective on the future of sustainable food systems.

Frontiers in genetics·2026

Same author

Schizophrenia detection from electroencephalogram signals using image encoding and wrapper-based deep feature selection approach.

Scientific reports·2025

Same author

A Comparative Study of Machine Learning and Deep Learning Models for Automatic Parkinson's Disease Detection from Electroencephalogram Signals.

Diagnostics (Basel, Switzerland)·2025

Same author

A fuzzy rank-based deep ensemble methodology for multi-class skin cancer classification.

Scientific reports·2025

Same author

A Case of Post-Partum Afebrile Perforated Appendicitis: A Diagnostic Dilemma.

Clinical case reports·2025

Same journal

Therapeutic potential of crude protein extracts from two Egyptian freshwater snails Lanistes carinatus and Bellamya unicolor.

Scientific reports·2026

Same journal

Microbial contamination of donor corneas and post-keratoplasty endophthalmitis: a comparison between Japanese and U.S. eye banks using cold storage.

Scientific reports·2026

Same journal

Prevalence and contributing factors of virological non-suppression among adult patients on first-line antiretroviral therapy in tertiary hospitals in Ethiopia.

Scientific reports·2026

Same journal

An in vitro comparison of color stability between alkasite and different restorative materials in various staining solutions.

Scientific reports·2026

Same journal

Toward accessible mRNA LNP formulation: systematic evaluation of mixing strategies and key parameters.

Scientific reports·2026

Same journal

A network analysis of personality traits, mentalizing, and psychological health in Chinese college students.

Scientific reports·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 24, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Implementing vision transformer for classifying 2D biomedical images.

Arindam Halder¹, Sanghita Gharami¹, Priyangshu Sadhu¹

¹Department of Information Technology, Jadavpur University, Jadavpur University Salt Lake Campus, Plot No. 8, Salt Lake Bypass, LB Block, Sector III, Kolkata, West Bengal, 700106, India.

Scientific Reports

|May 31, 2024

Summary

This summary is machine-generated.

Vision Transformer (ViT) models show strong performance in medical image classification tasks. This study achieved new benchmarks on BloodMNIST, BreastMNIST, PathMNIST, and RetinaMNIST datasets, demonstrating ViT

Keywords:

Biomedical image classification BloodMNIST BreastMNIST Deep learning MedMNISTv2 PathMNIST RetinaMNIST Vision transformer

More Related Videos

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

Related Experiment Videos

Last Updated: Jun 24, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

Area of Science:

Medical Imaging
Artificial Intelligence
Computer Vision

Background:

The rapid increase in medical imaging data necessitates advanced machine learning algorithms for healthcare applications.
Accurate classification of biomedical images is critical for disease diagnosis and treatment planning.
The MedMNISTv2 dataset provides a diverse benchmark for evaluating 2D medical image classification models.

Purpose of the Study:

To analyze the efficiency of the Vision Transformer (ViT) model on diverse medical imaging modalities within the MedMNISTv2 dataset.
To assess ViT's capability in capturing intricate patterns for medical image classification.
To establish new benchmark accuracies for BloodMNIST, BreastMNIST, PathMNIST, and RetinaMNIST datasets using ViT.

Main Methods:

Selected four subsets from MedMNISTv2: BloodMNIST, BreastMNIST, PathMNIST, and RetinaMNIST, chosen for their diverse modalities and sample sizes.
Pre-processed input images for model training.
Trained the ViT-base-patch16-224 model on the selected datasets and evaluated performance using key metrics.

Main Results:

Achieved new benchmark accuracies: 97.90% for BloodMNIST, 90.38% for BreastMNIST, 94.62% for PathMNIST, and 57% for RetinaMNIST.
Demonstrated the Vision Transformer model's effectiveness in classifying diverse medical imaging data.
The model substantially transcended existing benchmark metrics.

Conclusions:

Vision Transformer models show significant promise for medical image analysis and classification.
These findings support the adoption of ViT models in healthcare to enhance diagnostic accuracy.
Further exploration of ViT models can aid medical professionals in clinical decision-making.