Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Parallel Processing01:20

Parallel Processing

173
The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...
173
Nonconscious Mimicry01:13

Nonconscious Mimicry

4.6K
Nonconscious mimicry occurs when individuals alter their mannerisms to match the behaviors and expressions of those nearby, without intention.
4.6K
Information Processing Approach01:30

Information Processing Approach

60
The information-processing theory of cognitive development centers on fundamental mental processes, including attention, memory, and problem-solving skills. Researchers in this field examine how cognitive abilities, such as working memory, evolve and influence children's overall development. Studies indicate that children with stronger working memory tend to excel in reading comprehension, math, and problem-solving compared to peers with less efficient memory skills. Low working memory is...
60
Facial Feedback Hypothesis01:24

Facial Feedback Hypothesis

179
Charles Darwin proposed that facial expressions are an evolutionary adaptation for communication. He argued that these expressions are not influenced by culture but are universal across species. For example, a snarling expression with exposed teeth signals a threat in many animals, including humans. Darwin also suggested that displaying an emotion can intensify the feeling. Smiling, for example, could enhance one's sense of happiness. This idea laid the foundation for understanding the role...
179
Masking and Demasking Agents01:19

Masking and Demasking Agents

2.5K
EDTA titrations may necessitate masking and demasking agents to temporarily protect a particular metal ion in a mixture from the EDTA reaction. These agents facilitate the sequential analysis of the metal ions by forming stable complexes with some—but not all—metal ions during certain steps.
There are many masking agents, such as cyanide, fluoride, triethanolamine, thiourea, and 2,3-bis(sulfanyl)propan-1-ol (formerly 2,3-dimercapto-1-propanol), with the masking agent chosen based on...
2.5K
Uniform Depth Channel Flow: Problem Solving01:18

Uniform Depth Channel Flow: Problem Solving

82
To calculate the flow rate for a trapezoidal channel, first, identify the bottom width, side slope, and flow depth of the channel. The cross-sectional area (A) corresponding to the depth of flow (y), channel bottom width (B), and side slope (θ) is determined by:Next, calculate the wetted perimeter, which includes the bottom width and the sloped side lengths in contact with the water. Using the values of the cross-sectional area and the wetted perimeter, determine the hydraulic radius by...
82

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Gaze following requires early visual experience.

Proceedings of the National Academy of Sciences of the United States of America·2022
Same author

Oculo-retinal dynamics can explain the perception of minimal recognizable configurations.

Proceedings of the National Academy of Sciences of the United States of America·2021
Same author

Full interpretation of minimal images.

Cognition·2017
Same author

Atoms of recognition in human and computer vision.

Proceedings of the National Academy of Sciences of the United States of America·2016
Same journal

Tau protein as a regulator of mitochondrial function and dynamics.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

A scalable, dividing cell model for the robust propagation and quantification of human sporadic Creutzfeldt-Jakob disease prions.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Epigenetic regulation of mesenchymal BMP signaling directs postnatal organ innervation.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Single-shot wide-field biochemical imaging at 1 kHz frame rate.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Morphogenesis and topological evolution of a frustrated nematic liquid crystal under confinement.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

B cell-intrinsic CXCR3 drives efficient generation of ectopic pulmonary germinal center responses to influenza A virus infection.

Proceedings of the National Academy of Sciences of the United States of America·2026
See all related articles

Related Experiment Video

Updated: Jul 15, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
11:54

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

4.4K

Human-like scene interpretation by a guided counterstream processing.

Shimon Ullman1, Liav Assif1, Alona Strugatski1

  • 1Department of Computer Science, the Weizmann Institute of Science, Rehovot 76100, Israel.

Proceedings of the National Academy of Sciences of the United States of America
|September 28, 2023
PubMed
Summary
This summary is machine-generated.

This study introduces a novel AI model for guided scene interpretation, mimicking human perception by using sequential top-down instructions for efficient analysis and improved generalization in AI vision systems.

Keywords:
combinatorial generalizationguided visionscene perceptionscene understandingtop–down processing

More Related Videos

Profiling Maternal Behavior Responses During Whole-Brain Imaging
07:12

Profiling Maternal Behavior Responses During Whole-Brain Imaging

Published on: January 24, 2025

790
Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues
07:34

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Published on: June 3, 2013

17.4K

Related Experiment Videos

Last Updated: Jul 15, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
11:54

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

4.4K
Profiling Maternal Behavior Responses During Whole-Brain Imaging
07:12

Profiling Maternal Behavior Responses During Whole-Brain Imaging

Published on: January 24, 2025

790
Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues
07:34

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Published on: June 3, 2013

17.4K

Area of Science:

  • Computer Vision
  • Cognitive Science
  • Artificial Intelligence

Background:

  • Current AI models excel at recognizing scene components but struggle with holistic scene analysis.
  • Human scene perception involves selective, goal-directed interpretation rather than exhaustive scene graphing.
  • Guidance is essential for effective scene interpretation, as full scene representation is often computationally infeasible.

Purpose of the Study:

  • To develop a model that performs human-like guided scene interpretation.
  • To enable AI systems to interpret complex visual scenes efficiently and adaptively.
  • To address limitations in current AI models regarding combinatorial generalization and multimodal information integration.

Main Methods:

  • An iterative bottom-up, top-down processing model inspired by cortical circuitry.
  • Sequential application of top-down instructions to guide scene interpretation.
  • Integration of visual and non-visual information within each interpretation cycle.

Main Results:

  • The model successfully extracts viewer-relevant scene structures using automatically selected top-down instructions.
  • Demonstrated enhanced combinatorial generalization capabilities for unseen scene configurations.
  • Showcased the ability to integrate multi-modal information during scene interpretation.

Conclusions:

  • The proposed model offers a more human-like approach to scene interpretation in AI.
  • It overcomes key limitations of current AI vision models in generalization and multimodal integration.
  • This work advances AI vision systems by enabling more nuanced and goal-directed scene understanding.