Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Parallel Processing

Parallel Processing

The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...

Nonconscious Mimicry

Nonconscious Mimicry

Nonconscious mimicry occurs when individuals alter their mannerisms to match the behaviors and expressions of those nearby, without intention.

Information Processing Approach

Information Processing Approach

The information-processing theory of cognitive development centers on fundamental mental processes, including attention, memory, and problem-solving skills. Researchers in this field examine how cognitive abilities, such as working memory, evolve and influence children's overall development. Studies indicate that children with stronger working memory tend to excel in reading comprehension, math, and problem-solving compared to peers with less efficient memory skills. Low working memory is...

Facial Feedback Hypothesis

Facial Feedback Hypothesis

Charles Darwin proposed that facial expressions are an evolutionary adaptation for communication. He argued that these expressions are not influenced by culture but are universal across species. For example, a snarling expression with exposed teeth signals a threat in many animals, including humans. Darwin also suggested that displaying an emotion can intensify the feeling. Smiling, for example, could enhance one's sense of happiness. This idea laid the foundation for understanding the role...

Masking and Demasking Agents

Masking and Demasking Agents

EDTA titrations may necessitate masking and demasking agents to temporarily protect a particular metal ion in a mixture from the EDTA reaction. These agents facilitate the sequential analysis of the metal ions by forming stable complexes with some—but not all—metal ions during certain steps.
There are many masking agents, such as cyanide, fluoride, triethanolamine, thiourea, and 2,3-bis(sulfanyl)propan-1-ol (formerly 2,3-dimercapto-1-propanol), with the masking agent chosen based on...

Uniform Depth Channel Flow: Problem Solving

Uniform Depth Channel Flow: Problem Solving

To calculate the flow rate for a trapezoidal channel, first, identify the bottom width, side slope, and flow depth of the channel. The cross-sectional area (A) corresponding to the depth of flow (y), channel bottom width (B), and side slope (θ) is determined by:Next, calculate the wetted perimeter, which includes the bottom width and the sloped side lengths in contact with the water. Using the values of the cross-sectional area and the wetted perimeter, determine the hydraulic radius by...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Gaze following requires early visual experience.

Proceedings of the National Academy of Sciences of the United States of America·2022

Same author

Oculo-retinal dynamics can explain the perception of minimal recognizable configurations.

Proceedings of the National Academy of Sciences of the United States of America·2021

Same author

Full interpretation of minimal images.

Cognition·2017

Same author

Atoms of recognition in human and computer vision.

Proceedings of the National Academy of Sciences of the United States of America·2016

Same journal

Tau protein as a regulator of mitochondrial function and dynamics.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

A scalable, dividing cell model for the robust propagation and quantification of human sporadic Creutzfeldt-Jakob disease prions.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Epigenetic regulation of mesenchymal BMP signaling directs postnatal organ innervation.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Single-shot wide-field biochemical imaging at 1 kHz frame rate.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Morphogenesis and topological evolution of a frustrated nematic liquid crystal under confinement.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

B cell-intrinsic CXCR3 drives efficient generation of ectopic pulmonary germinal center responses to influenza A virus infection.

Proceedings of the National Academy of Sciences of the United States of America·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 15, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Human-like scene interpretation by a guided counterstream processing.

Shimon Ullman¹, Liav Assif¹, Alona Strugatski¹

¹Department of Computer Science, the Weizmann Institute of Science, Rehovot 76100, Israel.

Proceedings of the National Academy of Sciences of the United States of America

|September 28, 2023

Summary

This summary is machine-generated.

This study introduces a novel AI model for guided scene interpretation, mimicking human perception by using sequential top-down instructions for efficient analysis and improved generalization in AI vision systems.

Keywords:

combinatorial generalization guided vision scene perception scene understanding top–down processing

More Related Videos

Profiling Maternal Behavior Responses During Whole-Brain Imaging

Profiling Maternal Behavior Responses During Whole-Brain Imaging

Published on: January 24, 2025

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Published on: June 3, 2013

Related Experiment Videos

Last Updated: Jul 15, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Profiling Maternal Behavior Responses During Whole-Brain Imaging

Profiling Maternal Behavior Responses During Whole-Brain Imaging

Published on: January 24, 2025

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Published on: June 3, 2013

Area of Science:

Computer Vision
Cognitive Science
Artificial Intelligence

Background:

Current AI models excel at recognizing scene components but struggle with holistic scene analysis.
Human scene perception involves selective, goal-directed interpretation rather than exhaustive scene graphing.
Guidance is essential for effective scene interpretation, as full scene representation is often computationally infeasible.

Purpose of the Study:

To develop a model that performs human-like guided scene interpretation.
To enable AI systems to interpret complex visual scenes efficiently and adaptively.
To address limitations in current AI models regarding combinatorial generalization and multimodal information integration.

Main Methods:

An iterative bottom-up, top-down processing model inspired by cortical circuitry.
Sequential application of top-down instructions to guide scene interpretation.
Integration of visual and non-visual information within each interpretation cycle.

Main Results:

The model successfully extracts viewer-relevant scene structures using automatically selected top-down instructions.
Demonstrated enhanced combinatorial generalization capabilities for unseen scene configurations.
Showcased the ability to integrate multi-modal information during scene interpretation.

Conclusions:

The proposed model offers a more human-like approach to scene interpretation in AI.
It overcomes key limitations of current AI vision models in generalization and multimodal integration.
This work advances AI vision systems by enabling more nuanced and goal-directed scene understanding.