Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Inductive Reasoning00:59

Inductive Reasoning

63.8K
Inductive reasoning is a form of logical thinking that uses related observations to arrive at a general conclusion. It is uncertain and operates in degrees to which the conclusions are credible. As such, inductive arguments can be weak or strong, rather than valid or invalid, and conclusions can be used to formulate testable, falsifiable hypotheses.
Inductive reasoning is common in descriptive science. A life scientist makes observations and records them. This data can be qualitative or...
63.8K
Deductive Reasoning01:16

Deductive Reasoning

63.0K
Deductive reasoning, or deduction, is the type of logic used in hypothesis-based science. In deductive reasoning, the pattern of thinking moves in the opposite direction as compared to inductive reasoning, which means that it uses a general principle or law to predict specific results. From those general principles, a scientist can deduce and predict the specific results that would be valid as long as the general principles are valid.
For example, a researcher can deduce specific predictions...
63.0K
Reasoning01:30

Reasoning

213
Reasoning is the action of thinking about something in a logical, sensible way. It is integral to problem-solving, decision-making, and critical thinking. Reasoning can be inductive or deductive. Reasoning involves transforming information into conclusions, which is essential for problem-solving, decision-making, and critical thinking.
Inductive reasoning involves deriving generalizations from specific observations. This type of reasoning helps form beliefs about the world. For example,...
213
Language and Cognition01:27

Language and Cognition

534
Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.
534
Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

1.3K
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
1.3K
Vision01:24

Vision

57.8K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
57.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Tracklet Pair Proposal and Context Reasoning for Video Scene Graph Generation.

Sensors (Basel, Switzerland)·2021
Same author

Hybrid Imitation Learning Framework for Robotic Manipulation Tasks.

Sensors (Basel, Switzerland)·2021
Same author

NMN-VD: A Neural Module Network for Visual Dialog.

Sensors (Basel, Switzerland)·2021
Same author

Joint Multimodal Embedding and Backtracking Search in Vision-and-Language Navigation.

Sensors (Basel, Switzerland)·2021
Same author

Spatio-Temporal Action Detection in Untrimmed Videos by Using Multimodal Features and Region Proposals.

Sensors (Basel, Switzerland)·2019
Same author

A Robotic Context Query-Processing Framework Based on Spatio-Temporal Context Ontology.

Sensors (Basel, Switzerland)·2018
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Nov 7, 2025

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

16.0K

Vision-Language-Knowledge Co-Embedding for Visual Commonsense Reasoning.

JaeYun Lee1, Incheol Kim1

  • 1Department of Computer Science, Kyonggi University, Suwon-si 16227, Korea.

Sensors (Basel, Switzerland)
|April 30, 2021
PubMed
Summary
This summary is machine-generated.

This study introduces a new Vision-Language-Knowledge Co-embedding (ViLaKC) model for visual commonsense reasoning. The ViLaKC model effectively integrates external knowledge graphs to improve image and text understanding for answering questions.

Keywords:
graph convolutional networkknowledge graphmultimodal co-embeddingpretrained multi-head self-attention networkvisual commonsense reasoning

More Related Videos

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss
07:12

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Published on: April 11, 2025

644
Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
09:27

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

10.3K

Related Experiment Videos

Last Updated: Nov 7, 2025

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

16.0K
Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss
07:12

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Published on: April 11, 2025

644
Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
09:27

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

10.3K

Area of Science:

  • Artificial Intelligence
  • Computer Vision
  • Natural Language Processing

Background:

  • Visual commonsense reasoning requires solving knowledge acquisition and multimodal alignment.
  • Existing models face challenges in integrating external knowledge effectively.

Purpose of the Study:

  • To propose a novel Vision-Language-Knowledge Co-embedding (ViLaKC) model.
  • To enhance visual commonsense reasoning by incorporating external knowledge graphs.

Main Methods:

  • The ViLaKC model extracts relevant knowledge graphs from ConceptNet.
  • It employs a pretrained vision-language-knowledge embedding module.
  • Graph convolutional neural networks and multi-head self-attention layers are used for co-embedding multimodal data.

Main Results:

  • The ViLaKC model demonstrates effectiveness and strong performance.
  • Experimental validation was conducted using the VCR v1.0 benchmark dataset.

Conclusions:

  • The proposed ViLaKC model offers a robust approach to visual commonsense reasoning.
  • Integrating external knowledge graphs significantly improves reasoning capabilities.