Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Language and Cognition01:27

Language and Cognition

524
Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.
524
Components of Language01:24

Components of Language

531
Language, whether spoken, signed, or written, consists of specific components: lexicon and grammar. The lexicon is the vocabulary of a language, comprising its words. Grammar is the set of rules used to convey meaning through the lexicon. For example, English grammar adds “-ed” to most verbs to indicate past tense. Words are formed by combining phonemes, which are the basic sound units of a language. Different languages have different sets of phonemes (e.g., “ah” vs.
531
Higher Mental Functions of the Brain: Language01:10

Higher Mental Functions of the Brain: Language

2.1K
Language is a system of communication that allows the expression of thoughts, ideas, and feelings. The brain processes language in both hemispheres.
Language formation and comprehension take place in the dominant hemisphere. The dominant hemisphere is responsible for understanding the meaning of spoken, written, or sign language, as well as the ability to communicate. For most people, the left hemisphere is the dominant one. The right hemisphere, then, gives tone and emotional context to the...
2.1K
Tip-of-the-Tongue Phenomenon01:10

Tip-of-the-Tongue Phenomenon

287
The tip-of-the-tongue (TOT) phenomenon is a cognitive experience characterized by a temporary inability to retrieve specific information from memory despite having a strong feeling of knowing the information. Although individuals cannot access the target word or detail, they frequently recall related elements, such as its initial letter, syllable count, or context. This partial retrieval often causes frustration, as one might recognize a familiar face or know that a name starts with a specific...
287
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

6.6K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
6.6K
Language Development01:22

Language Development

588
Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...
588

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same journal

The Neural Network for Sign Language Comprehension.

Language and linguistics compass·2025
Same journal

Taking language science to zoom school: Virtual outreach to elementary school students.

Language and linguistics compass·2022
Same journal

Five sources of bias in natural language processing.

Language and linguistics compass·2022
Same journal

Teaching & Learning Guide for: The neurocognitive basis of skilled reading in prelingually and profoundly deaf adults.

Language and linguistics compass·2021
Same journal

The neurocognitive basis of skilled reading in prelingually and profoundly deaf adults.

Language and linguistics compass·2021
Same journal

THE <i>POWER</i> OF NEURAL OSCILLATIONS TO INFORM SENTENCE COMPREHENSION: A LINGUISTIC PERSPECTIVE.

Language and linguistics compass·2020
See all related articles

Related Experiment Video

Updated: Oct 30, 2025

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
09:27

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

10.3K

Linguistic issues behind visual question answering.

Raffaella Bernardi1, Sandro Pezzelle2

  • 1CIMeC and DISI University of Trento Trento Italy.

Language and Linguistics Compass
|July 5, 2021
PubMed
Summary
This summary is machine-generated.

Answering questions grounded in images requires understanding language and visuals. While progress is made, a unified computational approach is still needed for visual question answering.

More Related Videos

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

791
Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

16.0K

Related Experiment Videos

Last Updated: Oct 30, 2025

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
09:27

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

10.3K
Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

791
Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

16.0K

Area of Science:

  • Computational Linguistics
  • Computer Vision
  • Cognitive Science

Background:

  • Visually-grounded question answering (VQA) is a complex task involving linguistic and visual understanding.
  • Historically, VQA has been a challenge for computational natural language understanding (NLU) systems.
  • Recent machine learning advancements have revitalized VQA research at the intersection of AI fields.

Purpose of the Study:

  • To review current approaches to VQA, including datasets, models, and frameworks.
  • To analyze VQA progress from a theoretical linguistics perspective using established desiderata.
  • To identify gaps and propose future research directions for a unified VQA approach.

Main Methods:

  • Literature review of VQA research.
  • Analysis of computational achievements against theoretical linguistic desiderata.
  • Synthesis of current trends and identification of future research needs.

Main Results:

  • Significant progress has been achieved in VQA, reconciling engineering and theoretical perspectives.
  • Current VQA systems demonstrate impressive capabilities in integrating visual and linguistic information.
  • However, a comprehensive approach addressing all linguistic challenges in VQA remains an open problem.

Conclusions:

  • Further research is essential to develop a unified computational framework for VQA.
  • Future work should focus on integrating semantic, syntactic, and pragmatic understanding with visual context.
  • The field needs to bridge the gap between theoretical linguistic requirements and current AI capabilities for VQA.