Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Parallel Processing01:20

Parallel Processing

331
The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...
331
Vision01:24

Vision

55.8K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
55.8K
Gestalt Principles of Perception01:21

Gestalt Principles of Perception

615
Gestalt principles provide a framework for understanding how humans perceive objects as unified wholes within their context. These principles are essential in explaining the cognitive processes that make sense of complex visual stimuli by organizing them into coherent groups. One fundamental principle is proximity, which posits that objects located close to each other are perceived as a collective group. For instance, when dots are positioned near one another, the visual system interprets them...
615
Visual System01:26

Visual System

795
Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...
795
Color Vision01:24

Color Vision

834
Color perception begins in the retina, the light-sensitive layer at the back of the eye. Two main theories explain how colors are seen: the trichromatic theory and the opponent-process theory. The trichromatic theory, proposed by Thomas Young in 1802 and extended by Hermann von Helmholtz in 1852, suggests that color vision is based on three types of cone receptors in the retina. These cones are sensitive to different but overlapping ranges of wavelengths corresponding to red, blue, and green.
834
Perceptual Constancy01:12

Perceptual Constancy

676
Perceptual constancy is the ability to recognize that objects remain consistent and unchanged even when their appearance varies due to changes in sensory input. There are four main types of perceptual constancy: size constancy, shape constancy, color constancy, and brightness constancy.
Size constancy is the recognition that an object remains the same size, even when its image on the retina changes. For instance, a bus is perceived to be large enough to carry people, even if it looks tiny from...
676

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Determining chronicity and frequency of histologic lung lesions in feedyard cattle mortalities.

Journal of veterinary diagnostic investigation : official publication of the American Association of Veterinary Laboratory Diagnosticians, Inc·2026
Same author

Calcinosis Circumscripta with Iron Mineralization in an African Green Monkey (Chlorocebus aethiops sabaeus).

Journal of the American Association for Laboratory Animal Science : JAALAS·2025
Same author

Low-Resolution Self-Attention for Semantic Segmentation.

IEEE transactions on pattern analysis and machine intelligence·2025
Same author

Ocular and perineal squamous cell carcinomas in a Holstein Friesian cow.

Open veterinary journal·2024
Same author

Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition.

IEEE transactions on pattern analysis and machine intelligence·2024
Same author

Learnable Central Similarity Quantization for Efficient Image and Video Retrieval.

IEEE transactions on neural networks and learning systems·2023
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Oct 5, 2025

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss
07:12

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Published on: April 11, 2025

586

Vision Permutator: A Permutable MLP-Like Architecture for Visual Recognition.

Qibin Hou, Zihang Jiang, Li Yuan

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |January 25, 2022
    PubMed
    Summary
    This summary is machine-generated.

    Vision Permutator, a new MLP-like architecture, efficiently encodes spatial information for visual recognition. It achieves strong ImageNet performance without convolutions or attention, outperforming models of similar size.

    More Related Videos

    Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping
    07:11

    Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

    Published on: December 8, 2023

    2.0K
    Using Looming Visual Stimuli to Evaluate Mouse Vision
    05:07

    Using Looming Visual Stimuli to Evaluate Mouse Vision

    Published on: June 13, 2019

    11.6K

    Related Experiment Videos

    Last Updated: Oct 5, 2025

    Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss
    07:12

    Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

    Published on: April 11, 2025

    586
    Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping
    07:11

    Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

    Published on: December 8, 2023

    2.0K
    Using Looming Visual Stimuli to Evaluate Mouse Vision
    05:07

    Using Looming Visual Stimuli to Evaluate Mouse Vision

    Published on: June 13, 2019

    11.6K

    Area of Science:

    • Computer Science
    • Artificial Intelligence
    • Machine Learning

    Background:

    • Current MLP-like models often flatten spatial dimensions, losing crucial positional information.
    • Vision Transformers (ViTs) use attention, which can be computationally intensive.
    • Convolutional Neural Networks (CNNs) rely on spatial convolutions for feature extraction.

    Purpose of the Study:

    • To introduce Vision Permutator, a novel MLP-like architecture for visual recognition.
    • To develop a data-efficient model that effectively captures spatial information.
    • To offer a competitive alternative to CNNs and ViTs without relying on attention or convolutions.

    Main Methods:

    • Vision Permutator encodes height and width dimensions separately using linear projections.
    • It aggregates features in a mutually complementing manner.
    • The architecture avoids flattened spatial dimensions and attention mechanisms.

    Main Results:

    • Vision Permutator achieves 81.5% top-1 accuracy on ImageNet with 25M parameters, outperforming comparable CNNs and ViTs.
    • Scaling to 88M parameters yields 83.2% top-1 accuracy.
    • The model demonstrates strong performance without large-scale pre-training data.

    Conclusions:

    • Vision Permutator offers a conceptually simple and data-efficient approach to visual recognition.
    • The model effectively captures long-range dependencies and positional information.
    • This work encourages new methods for encoding spatial information in MLP-like models.