Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

972
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
972
Selected Data About Geographic Locations01:25

Selected Data About Geographic Locations

72
Geographic Information Systems (GIS) rely on two core types of data: spatial data and attribute data.Spatial DataSpatial data defines the physical location of features within a coordinate system, typically expressed in terms of latitude and longitude. It provides precise positioning for elements like roads, rivers, or buildings.Attribute DataAttribute data complements spatial data by adding descriptive information about these features. For example, a road's spatial data includes its start and...
72
Levels of Use of a GIS01:29

Levels of Use of a GIS

109
Geographic Information Systems (GIS) operate across three levels of application, each representing an increasing degree of complexity: data management, analysis, and prediction. These levels reflect the expanding functionality and versatility of GIS technology in handling spatial data for diverse purposes.Data ManagementAt its foundational level, GIS serves as a tool for data management, enabling the input, storage, retrieval, and organization of spatial data. This level is often employed in...
109
Manipulation and Analysis01:21

Manipulation and Analysis

63
GIS manipulation and analysis functions are vital for decision-making and planning. These activities range from data retrieval tasks, such as selecting information based on specific criteria, to advanced analytical techniques that address complex spatial problems.One critical GIS analysis method is overlaying, which combines multiple data layers to examine impacts. For example, overlaying a river-dammed lake boundary with road networks can identify affected infrastructure. Another common...
63
Introduction to GIS01:28

Introduction to GIS

198
Geographic Information Systems (GIS) are tools for storing, analyzing, and displaying spatial data alongside related attributes. Unlike traditional information systems that address general queries, GIS incorporates spatial components, enabling users to answer "where" and "how far." For example, GIS can process housing data linked to geographic locations like zip codes, allowing insights into population density or housing distribution through thematic maps.GIS integrates technologies such as...
198
Vision01:24

Vision

55.4K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
55.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Comparison of three inoculum sources for acetate production and microbial succession in H<sub>2</sub>/CO<sub>2</sub>-fed anaerobic system.

Bioprocess and biosystems engineering·2026
Same author

Enhanced stability of RPA-CRISPR-Cas12a system for respiratory pathogen detection using Trehalose-Carboxymethyl Chitosan Lyoprotectant.

Diagnostic microbiology and infectious disease·2026
Same author

Skeletal and Peripheral Editing of Pyridinium Salts into Fully Substituted Pyrroles.

Organic letters·2026
Same author

Tangeretin prevents cardiac failure induced by reperfusion/ischaemia by inhibiting apoptosis, endoplasmic reticulum stress, and JNK/ERK pathway.

Archives of medical science : AMS·2026
Same author

Real-World Evidence on the Efficacy of Icaritin for Unresectable Advanced Hepatocellular Carcinoma: A Multicenter Retrospective Study.

International journal of cancer·2026
Same author

Enhanced PD-L1 targeting boosts the cytotoxic activity of FOLR1- CAR NK92 cells against ovarian cancer.

Cancer immunology, immunotherapy : CII·2026
Same journal

Human-AI Interaction in Interventional Radiology: A Narrative Review of Current Applications, Challenges, and Future Directions.

Journal of imaging·2026
Same journal

Coronary Artery Anomalies and Anatomical Variants: Cross-Sectional Diagnostic Imaging and Clinical Background.

Journal of imaging·2026
Same journal

YoLeTooth: A Unified Framework for Joint Tooth Segmentation and Periapical Lesion Detection in Panoramic Radiographs.

Journal of imaging·2026
Same journal

Radiomics-Guided Multi-Sequence Learning for Pathological Complete Response Prediction from Breast MRI with Missing Auxiliary Sequences.

Journal of imaging·2026
Same journal

Cutaneous Thermography in Arthropathies: Quantitative Imaging, Machine Learning, and Clinical Translation.

Journal of imaging·2026
Same journal

Two-Stage Dynamic Synergistic Segmentation Method for Myocardial Pathology.

Journal of imaging·2026
See all related articles

Related Experiment Video

Updated: Sep 18, 2025

Photorealistic Learned Landscapes for Augmented Reality
06:54

Photorealistic Learned Landscapes for Augmented Reality

Published on: June 27, 2025

196

CSANet: Context-Spatial Awareness Network for RGB-T Urban Scene Understanding.

Ruixiang Li1, Zhen Wang1,2, Jianxin Guo1

  • 1School of Electronic Information, Xijing University, Xijing Road, Chang'an District, Xi'an 710123, China.

Journal of Imaging
|June 25, 2025
PubMed
Summary
This summary is machine-generated.

CSANet improves semantic segmentation for autonomous driving using RGB and thermal infrared data. This Context Spatial Awareness Network (CSANet) enhances performance in challenging conditions like low light and bad weather.

Keywords:
RGB-T semantic segmentationattention mechanismencoder–decoder structuremulti-modal fusionurban scene understanding

More Related Videos

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

650
Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.1K

Related Experiment Videos

Last Updated: Sep 18, 2025

Photorealistic Learned Landscapes for Augmented Reality
06:54

Photorealistic Learned Landscapes for Augmented Reality

Published on: June 27, 2025

196
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

650
Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.1K

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Robotics

Background:

  • Semantic segmentation is vital for urban scene understanding in autonomous driving.
  • Existing methods struggle with low-light and adverse weather, limiting real-world application.
  • Integrating RGB and thermal infrared (TIR) data offers a promising solution.

Purpose of the Study:

  • To develop a novel framework, CSANet, for robust RGB-T semantic segmentation.
  • To enhance feature extraction and fusion for improved accuracy in challenging conditions.
  • To advance the capabilities of autonomous driving systems.

Main Methods:

  • CSANet utilizes an efficient encoder for local and global feature extraction.
  • A hierarchical fusion strategy selectively integrates visual and semantic information.
  • Key modules include Channel-Spatial Cross-Fusion (CSCFM), Multi-Head Fusion (MHFM), and Spatial Coordinate Attention (SCAM).

Main Results:

  • CSANet demonstrates state-of-the-art performance on benchmark datasets (MFNet, PST900).
  • The framework effectively fuses RGB and TIR modalities for superior semantic segmentation.
  • Significant improvements in object localization accuracy were observed.

Conclusions:

  • CSANet offers a robust solution for RGB-T semantic segmentation, especially in adverse conditions.
  • The proposed fusion strategies and attention mechanisms enhance understanding of complex urban environments.
  • This work contributes to safer and more reliable autonomous driving systems.