Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Visual System

Visual System

Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...

Vision

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Sensing the Action: Rethinking Sensor Modalities and Multi-Modal Fusion in Vision-Language-Action Models for Robotic Manipulation.

Sensors (Basel, Switzerland)·2026

Same author

Keyword-Conditioned Image Segmentation via the Cross-Attentive Alignment of Language and Vision Sensor Data.

Sensors (Basel, Switzerland)·2025

Same author

Rethinking Attention Mechanisms in Vision Transformers with Graph Structures.

Sensors (Basel, Switzerland)·2024

Same author

High-Resolution Tactile-Sensation Diagnostic Imaging System for Thyroid Cancer.

Sensors (Basel, Switzerland)·2023

Same author

Lightweight Semantic-Guided Neural Networks Based on Single Head Attention for Action Recognition.

Sensors (Basel, Switzerland)·2022

Same author

Facial Expression Recognition Based on Squeeze Vision Transformer.

Sensors (Basel, Switzerland)·2022

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 17, 2026

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Scene Graph and Natural Language-Based Semantic Image Retrieval Using Vision Sensor Data.

Jaehoon Kim¹, Byoung Chul Ko¹

¹Department of Computer Engineering, Keimyung University, Daegu 42601, Republic of Korea.

Sensors (Basel, Switzerland)

|September 19, 2025

Summary

This summary is machine-generated.

This study introduces a new graph neural network (GNN) approach for text-based image retrieval, improving accuracy by comparing semantic and scene graphs for better understanding of visual content.

Keywords:

graph neural network graph similarity learning scene graph generation semantic image retrieval subgraph extraction vision sensor

More Related Videos

Author Spotlight: Assessment of Visual Acuity in Central Vision Loss Through Motion-Based Peripheral Vision Testing

Author Spotlight: Assessment of Visual Acuity in Central Vision Loss Through Motion-Based Peripheral Vision Testing

Published on: February 23, 2024

Investigating Object Representations in the Macaque Dorsal Visual Stream Using Single-unit Recordings

Investigating Object Representations in the Macaque Dorsal Visual Stream Using Single-unit Recordings

Published on: August 1, 2018

Related Experiment Videos

Last Updated: Jan 17, 2026

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Author Spotlight: Assessment of Visual Acuity in Central Vision Loss Through Motion-Based Peripheral Vision Testing

Author Spotlight: Assessment of Visual Acuity in Central Vision Loss Through Motion-Based Peripheral Vision Testing

Published on: February 23, 2024

Investigating Object Representations in the Macaque Dorsal Visual Stream Using Single-unit Recordings

Investigating Object Representations in the Macaque Dorsal Visual Stream Using Single-unit Recordings

Published on: August 1, 2018

Area of Science:

Computer Vision
Artificial Intelligence
Machine Learning

Background:

Text-based image retrieval often uses keyword matching, which struggles with semantic nuances and limited query information.
Existing methods lack accuracy when dealing with complex scenes or novel sentences, failing to capture full contextual meaning.

Purpose of the Study:

To develop a novel approach for text-based image retrieval that overcomes the limitations of keyword matching.
To enhance retrieval accuracy by enabling quantitative comparison between textual descriptions and visual scene content.

Main Methods:

Transforming sentences into semantic graphs and images into scene graphs.
Utilizing a graph neural network (GNN) to learn node and edge features, generating graph embeddings for comparison.
Implementing a contrastive GNN framework with hard negative mining to match semantic and scene graphs.

Main Results:

The proposed GNN-based method achieved a top nDCG@50 score of 0.745 on the Visual Genome dataset.
Demonstrated an improvement of approximately 7.7 percentage points compared to random sampling with full graphs.
Successfully retrieved semantically relevant images by structurally interpreting complex scenes.

Conclusions:

The novel GNN-based approach effectively addresses limitations in text-based image retrieval.
Quantitative comparison of semantic and scene graphs significantly enhances retrieval accuracy.
Structural interpretation of scenes via graph embeddings enables robust image retrieval from natural language queries.