Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Perception01:28

Perception

Perception is a fundamental psychological process that enables individuals to organize, interpret, and consciously experience sensory information. This process is crucial for understanding and interacting with the world around us. It includes both bottom-up and top-down processing, each playing a distinct role in how we perceive our environment.
Bottom-up processing begins at the sensory level, where receptors detect external environmental stimuli. These could include the tactile sensation of...
Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
Perceiving Loudness, Pitch, and Location01:21

Perceiving Loudness, Pitch, and Location

The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by identifying...
Gestalt Principles of Perception01:21

Gestalt Principles of Perception

Gestalt principles provide a framework for understanding how humans perceive objects as unified wholes within their context. These principles are essential in explaining the cognitive processes that make sense of complex visual stimuli by organizing them into coherent groups. One fundamental principle is proximity, which posits that objects located close to each other are perceived as a collective group. For instance, when dots are positioned near one another, the visual system interprets them...
Factors Affecting Perception01:25

Factors Affecting Perception

Perception is influenced by perceptual set, context, motivation, and emotion. Perceptual set, or perceptual expectancy, refers to the tendency to perceive things in a particular way, influenced by previous experiences and expectations. This phenomenon affects the interpretation of stimuli, creating a set of mental tendencies and assumptions that impact sensory perceptions of sound, taste, touch, and sight.
An illustrative example of a perceptual set is the scenario where an airline pilot told...
Perceptual Constancy01:12

Perceptual Constancy

Perceptual constancy is the ability to recognize that objects remain consistent and unchanged even when their appearance varies due to changes in sensory input. There are four main types of perceptual constancy: size constancy, shape constancy, color constancy, and brightness constancy.
Size constancy is the recognition that an object remains the same size, even when its image on the retina changes. For instance, a bus is perceived to be large enough to carry people, even if it looks tiny from...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A bibliometric analysis of artificial intelligence in ovarian cancer research from 2006 to 2025.

Discover oncology·2026
Same author

A Lightweight Net with Dual-Path Feature Enhancer and Bidirectional Gated Fusion for Cloud Detection.

Sensors (Basel, Switzerland)·2026
Same author

Global, regional, and national burden of schizophrenia: epidemiological trends, decomposition, joinpoint analysis, and projections to 2036 based on GBD 2021.

Frontiers in psychiatry·2026
Same author

Development and validation of a sensitive fluorescence RT-qPCR assay with TaqMan-MGB probe for detection of H6 subtype avian influenza A virus.

Journal of veterinary diagnostic investigation : official publication of the American Association of Veterinary Laboratory Diagnosticians, Inc·2026
Same author

Global burden of ovarian cancer attributable to high BMI, 1990-2021: spatiotemporal trends, risk factors, frontier analysis, and projections to 2036 based on GBD 2021 study.

Journal of ovarian research·2025
Same author

Single-cell transcriptome analysis defines novel molecular subtypes and reveals therapeutic implications of T/myeloid mixed-phenotype acute leukemia.

Genome medicine·2025
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: May 11, 2026

Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

19.9K

MixImages: An Urban Perception AI Method Based on Polarization Multimodalities.

Yan Mo1,2, Wanting Zhou1, Wei Chen3

  • 1School of Information Engineering, Nanchang Hangkong University, Nanchang 330063, China.

Sensors (Basel, Switzerland)
|August 10, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces MixImages, a novel semantic segmentation model that enhances urban perception by integrating polarization data with RGB images. The model significantly improves accuracy, especially in challenging shadow regions, outperforming traditional RGB-only methods.

Keywords:
deep learningpolarizationsemantic segmentationurban perception

More Related Videos

Author Spotlight: Unveiling the Potential of VSFG Microscopy in Studying Mesoscopically Heterogeneous Self-Assembled Structures
08:49

Author Spotlight: Unveiling the Potential of VSFG Microscopy in Studying Mesoscopically Heterogeneous Self-Assembled Structures

Published on: December 1, 2023

1.3K
Multimodal Optical Imaging Platform for Studying Cellular Metabolism
04:47

Multimodal Optical Imaging Platform for Studying Cellular Metabolism

Published on: June 6, 2025

141

Related Experiment Videos

Last Updated: May 11, 2026

Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

19.9K
Author Spotlight: Unveiling the Potential of VSFG Microscopy in Studying Mesoscopically Heterogeneous Self-Assembled Structures
08:49

Author Spotlight: Unveiling the Potential of VSFG Microscopy in Studying Mesoscopically Heterogeneous Self-Assembled Structures

Published on: December 1, 2023

1.3K
Multimodal Optical Imaging Platform for Studying Cellular Metabolism
04:47

Multimodal Optical Imaging Platform for Studying Cellular Metabolism

Published on: June 6, 2025

141

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Remote Sensing

Background:

  • Urban perception models often rely solely on RGB images, limiting performance in scenes with complex lighting and shadows.
  • Existing methods struggle with feature confusion caused by light and shadow interplay, diminishing perception accuracy.
  • Polarization data offers complementary information beyond RGB, crucial for enhancing shadow region representation.

Purpose of the Study:

  • To develop a novel semantic segmentation model, MixImages, for improved urban scene perception.
  • To leverage multimodal polarization data alongside RGB images to overcome limitations of unimodal approaches.
  • To enhance the representation of shadow regions and improve pixel-level perception in urban environments.

Main Methods:

  • Proposed a novel semantic segmentation model named MixImages.
  • Integrated multimodal polarization data with traditional RGB image inputs.
  • Utilized transformer architecture for its effective receptive field to capture discriminative cues.
  • Conducted experiments on a dedicated polarization dataset of urban scenes.

Main Results:

  • MixImages achieved a 3.43% accuracy advantage over RGB-only models in the unimodal benchmark.
  • The model demonstrated a 4.29% performance improvement in the multimodal benchmark.
  • Analysis of different polarization combinations provided insights for downstream task optimization.

Conclusions:

  • The proposed MixImages model offers a significant advancement in urban scene perception by effectively combining RGB and polarization data.
  • The integration of polarization data enhances model robustness, particularly in challenging lighting conditions like shadows.
  • MixImages presents a promising new approach for pixel-level perception tasks in complex urban environments.