Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Vision01:24

Vision

53.1K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
53.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Retraction Note: Artificial intelligence in disease diagnosis: a systematic literature review, synthesizing framework and future research agenda.

Journal of ambient intelligence and humanized computing·2026
Same author

Omics in mini-livestock: a genomic perspective on the future of sustainable food systems.

Frontiers in genetics·2026
Same author

Schizophrenia detection from electroencephalogram signals using image encoding and wrapper-based deep feature selection approach.

Scientific reports·2025
Same author

A Comparative Study of Machine Learning and Deep Learning Models for Automatic Parkinson's Disease Detection from Electroencephalogram Signals.

Diagnostics (Basel, Switzerland)·2025
Same author

A fuzzy rank-based deep ensemble methodology for multi-class skin cancer classification.

Scientific reports·2025
Same author

A Case of Post-Partum Afebrile Perforated Appendicitis: A Diagnostic Dilemma.

Clinical case reports·2025
Same journal

Therapeutic potential of crude protein extracts from two Egyptian freshwater snails Lanistes carinatus and Bellamya unicolor.

Scientific reports·2026
Same journal

Microbial contamination of donor corneas and post-keratoplasty endophthalmitis: a comparison between Japanese and U.S. eye banks using cold storage.

Scientific reports·2026
Same journal

Prevalence and contributing factors of virological non-suppression among adult patients on first-line antiretroviral therapy in tertiary hospitals in Ethiopia.

Scientific reports·2026
Same journal

An in vitro comparison of color stability between alkasite and different restorative materials in various staining solutions.

Scientific reports·2026
Same journal

Toward accessible mRNA LNP formulation: systematic evaluation of mixing strategies and key parameters.

Scientific reports·2026
Same journal

A network analysis of personality traits, mentalizing, and psychological health in Chinese college students.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Jun 24, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.8K

Implementing vision transformer for classifying 2D biomedical images.

Arindam Halder1, Sanghita Gharami1, Priyangshu Sadhu1

  • 1Department of Information Technology, Jadavpur University, Jadavpur University Salt Lake Campus, Plot No. 8, Salt Lake Bypass, LB Block, Sector III, Kolkata, West Bengal, 700106, India.

Scientific Reports
|May 31, 2024
PubMed
Summary
This summary is machine-generated.

Vision Transformer (ViT) models show strong performance in medical image classification tasks. This study achieved new benchmarks on BloodMNIST, BreastMNIST, PathMNIST, and RetinaMNIST datasets, demonstrating ViT

Keywords:
Biomedical image classificationBloodMNISTBreastMNISTDeep learningMedMNISTv2PathMNISTRetinaMNISTVision transformer

More Related Videos

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

388
Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping
07:11

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

1.5K

Related Experiment Videos

Last Updated: Jun 24, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.8K
Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

388
Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping
07:11

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

1.5K

Area of Science:

  • Medical Imaging
  • Artificial Intelligence
  • Computer Vision

Background:

  • The rapid increase in medical imaging data necessitates advanced machine learning algorithms for healthcare applications.
  • Accurate classification of biomedical images is critical for disease diagnosis and treatment planning.
  • The MedMNISTv2 dataset provides a diverse benchmark for evaluating 2D medical image classification models.

Purpose of the Study:

  • To analyze the efficiency of the Vision Transformer (ViT) model on diverse medical imaging modalities within the MedMNISTv2 dataset.
  • To assess ViT's capability in capturing intricate patterns for medical image classification.
  • To establish new benchmark accuracies for BloodMNIST, BreastMNIST, PathMNIST, and RetinaMNIST datasets using ViT.

Main Methods:

  • Selected four subsets from MedMNISTv2: BloodMNIST, BreastMNIST, PathMNIST, and RetinaMNIST, chosen for their diverse modalities and sample sizes.
  • Pre-processed input images for model training.
  • Trained the ViT-base-patch16-224 model on the selected datasets and evaluated performance using key metrics.

Main Results:

  • Achieved new benchmark accuracies: 97.90% for BloodMNIST, 90.38% for BreastMNIST, 94.62% for PathMNIST, and 57% for RetinaMNIST.
  • Demonstrated the Vision Transformer model's effectiveness in classifying diverse medical imaging data.
  • The model substantially transcended existing benchmark metrics.

Conclusions:

  • Vision Transformer models show significant promise for medical image analysis and classification.
  • These findings support the adoption of ViT models in healthcare to enhance diagnostic accuracy.
  • Further exploration of ViT models can aid medical professionals in clinical decision-making.