Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Vision

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Early prediction of joint space narrowing in rheumatoid arthritis using AI-quantified bilateral joint space asymmetry on hand radiography.

Japanese journal of radiology·2026

Same author

Effects of Hydrogen-Rich Water on Juvenile Largemouth Bass (<i>Micropterus salmoides</i>) Under Acute Low-Temperature Stress.

Antioxidants (Basel, Switzerland)·2026

Same author

Cell-type and spatiotemporal transcriptional signatures of white matter morphometric similarity network alterations in major depressive disorder.

Psychological medicine·2026

Same author

Tracking of volatile organic compound emissions from unsaturated polyester resin-based artificial stone using automated static headspace-gas chromatography-mass spectrometry.

Journal of chromatography. A·2026

Same author

Synergistic antioxidant and gene supplementation for high-efficacy retinitis pigmentosa therapy.

Science advances·2026

Same author

Cell-type-specific gene regulation in migraine and its subtypes links genetic risk to brain structure.

Progress in neuro-psychopharmacology & biological psychiatry·2026

Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 17, 2026

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Published on: April 11, 2025

VisionHub: Learning Task-Plugins for Efficient Universal Vision Model.

Haolin Wang, Yixuan Zhu, Wenliang Zhao

IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society

|September 25, 2025

Summary

This summary is machine-generated.

VisionHub is a novel universal vision model that efficiently handles multiple visual tasks using a U-Net backbone and lightweight plugins. It offers streamlined transferability for downstream applications with minimal overhead.

More Related Videos

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

A Standardized Obstacle Course for Assessment of Visual Function in Ultra Low Vision and Artificial Vision

A Standardized Obstacle Course for Assessment of Visual Function in Ultra Low Vision and Artificial Vision

Published on: February 11, 2014

Related Experiment Videos

Last Updated: Jan 17, 2026

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Published on: April 11, 2025

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

A Standardized Obstacle Course for Assessment of Visual Function in Ultra Low Vision and Artificial Vision

A Standardized Obstacle Course for Assessment of Visual Function in Ultra Low Vision and Artificial Vision

Published on: February 11, 2014

Area of Science:

Computer Vision
Machine Learning
Artificial Intelligence

Background:

Universal language models (NLP) have shown success, prompting research into unified frameworks for diverse visual tasks.
Existing universal vision models struggle with adaptability, computational costs, workflow complexity, and performance limitations in diverse applications.
Incomplete visual generation and perception capabilities hinder the generalizability of current models.

Purpose of the Study:

To introduce VisionHub, a novel universal vision model designed for concurrent visual restoration and perception tasks.
To enable streamlined transferability to downstream tasks with enhanced flexibility and efficiency.
To address the limitations of existing models in terms of computational expense, workflow complexity, and performance versatility.

Main Methods:

Leverages the frozen denoising U-Net architecture from Stable Diffusion as the core backbone.
Incorporates lightweight task-plugins and a task router integrated onto the U-Net backbone for enhanced flexibility.
Enables handling of various vision tasks via natural language instructions with minimal storage and operational overhead.

Main Results:

VisionHub demonstrates efficiency and effectiveness across 11 different vision tasks.
Achieves competitive performance on benchmarks, including 53.3% mIoU on ADE20K semantic segmentation.
Shows strong results in depth estimation (0.253 RMSE on NYUv2) and pose estimation (74.2 AP on MS-COCO).

Conclusions:

VisionHub offers a novel and efficient approach to universal vision modeling.
The proposed architecture effectively manages multiple visual tasks and facilitates transfer learning.
The model presents a promising solution for versatile and high-performance computer vision applications.