Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Vision01:24

Vision

59.4K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
59.4K
Observational Learning01:12

Observational Learning

838
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
838
Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

1.8K
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
1.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Early prediction of joint space narrowing in rheumatoid arthritis using AI-quantified bilateral joint space asymmetry on hand radiography.

Japanese journal of radiology·2026
Same author

Effects of Hydrogen-Rich Water on Juvenile Largemouth Bass (<i>Micropterus salmoides</i>) Under Acute Low-Temperature Stress.

Antioxidants (Basel, Switzerland)·2026
Same author

Cell-type and spatiotemporal transcriptional signatures of white matter morphometric similarity network alterations in major depressive disorder.

Psychological medicine·2026
Same author

Tracking of volatile organic compound emissions from unsaturated polyester resin-based artificial stone using automated static headspace-gas chromatography-mass spectrometry.

Journal of chromatography. A·2026
Same author

Synergistic antioxidant and gene supplementation for high-efficacy retinitis pigmentosa therapy.

Science advances·2026
Same author

Cell-type-specific gene regulation in migraine and its subtypes links genetic risk to brain structure.

Progress in neuro-psychopharmacology & biological psychiatry·2026
Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
See all related articles

Related Experiment Video

Updated: Jan 17, 2026

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss
07:12

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Published on: April 11, 2025

878

VisionHub: Learning Task-Plugins for Efficient Universal Vision Model.

Haolin Wang, Yixuan Zhu, Wenliang Zhao

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |September 25, 2025
    PubMed
    Summary
    This summary is machine-generated.

    VisionHub is a novel universal vision model that efficiently handles multiple visual tasks using a U-Net backbone and lightweight plugins. It offers streamlined transferability for downstream applications with minimal overhead.

    More Related Videos

    Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping
    07:11

    Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

    Published on: December 8, 2023

    2.3K
    A Standardized Obstacle Course for Assessment of Visual Function in Ultra Low Vision and Artificial Vision
    09:29

    A Standardized Obstacle Course for Assessment of Visual Function in Ultra Low Vision and Artificial Vision

    Published on: February 11, 2014

    13.5K

    Related Experiment Videos

    Last Updated: Jan 17, 2026

    Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss
    07:12

    Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

    Published on: April 11, 2025

    878
    Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping
    07:11

    Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

    Published on: December 8, 2023

    2.3K
    A Standardized Obstacle Course for Assessment of Visual Function in Ultra Low Vision and Artificial Vision
    09:29

    A Standardized Obstacle Course for Assessment of Visual Function in Ultra Low Vision and Artificial Vision

    Published on: February 11, 2014

    13.5K

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Artificial Intelligence

    Background:

    • Universal language models (NLP) have shown success, prompting research into unified frameworks for diverse visual tasks.
    • Existing universal vision models struggle with adaptability, computational costs, workflow complexity, and performance limitations in diverse applications.
    • Incomplete visual generation and perception capabilities hinder the generalizability of current models.

    Purpose of the Study:

    • To introduce VisionHub, a novel universal vision model designed for concurrent visual restoration and perception tasks.
    • To enable streamlined transferability to downstream tasks with enhanced flexibility and efficiency.
    • To address the limitations of existing models in terms of computational expense, workflow complexity, and performance versatility.

    Main Methods:

    • Leverages the frozen denoising U-Net architecture from Stable Diffusion as the core backbone.
    • Incorporates lightweight task-plugins and a task router integrated onto the U-Net backbone for enhanced flexibility.
    • Enables handling of various vision tasks via natural language instructions with minimal storage and operational overhead.

    Main Results:

    • VisionHub demonstrates efficiency and effectiveness across 11 different vision tasks.
    • Achieves competitive performance on benchmarks, including 53.3% mIoU on ADE20K semantic segmentation.
    • Shows strong results in depth estimation (0.253 RMSE on NYUv2) and pose estimation (74.2 AP on MS-COCO).

    Conclusions:

    • VisionHub offers a novel and efficient approach to universal vision modeling.
    • The proposed architecture effectively manages multiple visual tasks and facilitates transfer learning.
    • The model presents a promising solution for versatile and high-performance computer vision applications.