Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

Selected Data About Geographic Locations

Selected Data About Geographic Locations

Geographic Information Systems (GIS) rely on two core types of data: spatial data and attribute data.Spatial DataSpatial data defines the physical location of features within a coordinate system, typically expressed in terms of latitude and longitude. It provides precise positioning for elements like roads, rivers, or buildings.Attribute DataAttribute data complements spatial data by adding descriptive information about these features. For example, a road's spatial data includes its start and...

Levels of Use of a GIS

Levels of Use of a GIS

Geographic Information Systems (GIS) operate across three levels of application, each representing an increasing degree of complexity: data management, analysis, and prediction. These levels reflect the expanding functionality and versatility of GIS technology in handling spatial data for diverse purposes.Data ManagementAt its foundational level, GIS serves as a tool for data management, enabling the input, storage, retrieval, and organization of spatial data. This level is often employed in...

Manipulation and Analysis

Manipulation and Analysis

GIS manipulation and analysis functions are vital for decision-making and planning. These activities range from data retrieval tasks, such as selecting information based on specific criteria, to advanced analytical techniques that address complex spatial problems.One critical GIS analysis method is overlaying, which combines multiple data layers to examine impacts. For example, overlaying a river-dammed lake boundary with road networks can identify affected infrastructure. Another common...

Introduction to GIS

Introduction to GIS

Geographic Information Systems (GIS) are tools for storing, analyzing, and displaying spatial data alongside related attributes. Unlike traditional information systems that address general queries, GIS incorporates spatial components, enabling users to answer "where" and "how far." For example, GIS can process housing data linked to geographic locations like zip codes, allowing insights into population density or housing distribution through thematic maps.GIS integrates technologies such as...

Vision

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Comparison of three inoculum sources for acetate production and microbial succession in H<sub>2</sub>/CO<sub>2</sub>-fed anaerobic system.

Bioprocess and biosystems engineering·2026

Same author

Enhanced stability of RPA-CRISPR-Cas12a system for respiratory pathogen detection using Trehalose-Carboxymethyl Chitosan Lyoprotectant.

Diagnostic microbiology and infectious disease·2026

Same author

Skeletal and Peripheral Editing of Pyridinium Salts into Fully Substituted Pyrroles.

Organic letters·2026

Same author

Tangeretin prevents cardiac failure induced by reperfusion/ischaemia by inhibiting apoptosis, endoplasmic reticulum stress, and JNK/ERK pathway.

Archives of medical science : AMS·2026

Same author

Real-World Evidence on the Efficacy of Icaritin for Unresectable Advanced Hepatocellular Carcinoma: A Multicenter Retrospective Study.

International journal of cancer·2026

Same author

Enhanced PD-L1 targeting boosts the cytotoxic activity of FOLR1- CAR NK92 cells against ovarian cancer.

Cancer immunology, immunotherapy : CII·2026

Same journal

Human-AI Interaction in Interventional Radiology: A Narrative Review of Current Applications, Challenges, and Future Directions.

Journal of imaging·2026

Same journal

Coronary Artery Anomalies and Anatomical Variants: Cross-Sectional Diagnostic Imaging and Clinical Background.

Journal of imaging·2026

Same journal

YoLeTooth: A Unified Framework for Joint Tooth Segmentation and Periapical Lesion Detection in Panoramic Radiographs.

Journal of imaging·2026

Same journal

Radiomics-Guided Multi-Sequence Learning for Pathological Complete Response Prediction from Breast MRI with Missing Auxiliary Sequences.

Journal of imaging·2026

Same journal

Cutaneous Thermography in Arthropathies: Quantitative Imaging, Machine Learning, and Clinical Translation.

Journal of imaging·2026

Same journal

Two-Stage Dynamic Synergistic Segmentation Method for Myocardial Pathology.

Journal of imaging·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 18, 2025

Photorealistic Learned Landscapes for Augmented Reality

Photorealistic Learned Landscapes for Augmented Reality

Published on: June 27, 2025

CSANet: Context-Spatial Awareness Network for RGB-T Urban Scene Understanding.

Ruixiang Li¹, Zhen Wang^1,2, Jianxin Guo¹

¹School of Electronic Information, Xijing University, Xijing Road, Chang'an District, Xi'an 710123, China.

Journal of Imaging

|June 25, 2025

Summary

This summary is machine-generated.

CSANet improves semantic segmentation for autonomous driving using RGB and thermal infrared data. This Context Spatial Awareness Network (CSANet) enhances performance in challenging conditions like low light and bad weather.

Keywords:

RGB-T semantic segmentation attention mechanism encoder–decoder structure multi-modal fusion urban scene understanding

More Related Videos

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Related Experiment Videos

Last Updated: Sep 18, 2025

Photorealistic Learned Landscapes for Augmented Reality

Photorealistic Learned Landscapes for Augmented Reality

Published on: June 27, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Area of Science:

Computer Vision
Artificial Intelligence
Robotics

Background:

Semantic segmentation is vital for urban scene understanding in autonomous driving.
Existing methods struggle with low-light and adverse weather, limiting real-world application.
Integrating RGB and thermal infrared (TIR) data offers a promising solution.

Purpose of the Study:

To develop a novel framework, CSANet, for robust RGB-T semantic segmentation.
To enhance feature extraction and fusion for improved accuracy in challenging conditions.
To advance the capabilities of autonomous driving systems.

Main Methods:

CSANet utilizes an efficient encoder for local and global feature extraction.
A hierarchical fusion strategy selectively integrates visual and semantic information.
Key modules include Channel-Spatial Cross-Fusion (CSCFM), Multi-Head Fusion (MHFM), and Spatial Coordinate Attention (SCAM).

Main Results:

CSANet demonstrates state-of-the-art performance on benchmark datasets (MFNet, PST900).
The framework effectively fuses RGB and TIR modalities for superior semantic segmentation.
Significant improvements in object localization accuracy were observed.

Conclusions:

CSANet offers a robust solution for RGB-T semantic segmentation, especially in adverse conditions.
The proposed fusion strategies and attention mechanisms enhance understanding of complex urban environments.
This work contributes to safer and more reliable autonomous driving systems.