Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Visual Agnosia01:12

Visual Agnosia

Visual agnosia is a condition characterized by the inability to recognize visually presented objects despite having normal vision. For instance, a person with visual agnosia can describe the shape and color of an object but cannot identify or name it. This impairment does not affect their visual field, acuity, color vision, brightness discrimination, language, or memory. An example of this condition in a social setting is someone at a dinner party asking for "that silver thing with a round end"...
Visual System01:26

Visual System

Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...
Tip-of-the-Tongue Phenomenon01:10

Tip-of-the-Tongue Phenomenon

The tip-of-the-tongue (TOT) phenomenon is a cognitive experience characterized by a temporary inability to retrieve specific information from memory despite having a strong feeling of knowing the information. Although individuals cannot access the target word or detail, they frequently recall related elements, such as its initial letter, syllable count, or context. This partial retrieval often causes frustration, as one might recognize a familiar face or know that a name starts with a specific...
Non-Verbal Cues01:29

Non-Verbal Cues

Non-verbal communication extends beyond gestures and facial expressions to include vocal elements known as paralanguage. Paralanguage consists of non-verbal vocal cues such as pitch, loudness, speech rate, pauses, and non-verbal vocalizations like laughter, sighs, and moans. These elements not only accompany speech but also provide critical emotional and contextual information.The Role of Paralanguage in CommunicationParalanguage adds depth to spoken language by conveying emotions and...
Directional Terms01:14

Directional Terms

Directional terms are essential for describing the relative locations of different body structures. For instance, an anatomist might describe one band of tissue as "inferior to" another, or a physician might describe a tumor as "superficial to" a deeper body structure. These terms often use comparative terms in pairs to trace out the relative locations of one body part to another or descriptions of body tissues like the deeper ones from superficially present with reference to the body's upright...
Vision01:24

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Assessing facial weakness in myasthenia gravis with facial recognition software and deep learning.

Annals of clinical and translational neurology·2023
Same author

Prediction of long-term hospitalisation and all-cause mortality in patients with chronic heart failure on Dutch claims data: a machine learning approach.

BMC medical informatics and decision making·2021
Same author

Resolution Learning in Deep Convolutional Networks Using Scale-Space Theory.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2021
Same author

Finding Dutch natives in online forums.

Forensic sciences research·2018
Same author

Divide and Count: Generic Object Counting by Image Divisions.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2018
Same author

Point Light Source Position Estimation From RGB-D Images by Learning Surface Attributes.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2017
Same journal

HardFlow: Hard-Constrained Sampling for Flow-Matching Models Via Trajectory Optimization.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Industrial Brain: Self-Evolving Neuro-Symbolic Autonomy with Causal Resilience for Cyber-Physical Systems.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Adaptive Hardness-Driven Dictionary Distillation for Incomplete Streaming View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Task-KV: Task-aware KV Cache Optimization via Semantic Differentiation of Attention Heads.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Achieving Text-based Person Retrieval with Any Granularity.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Jun 12, 2026

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

Visual word ambiguity.

Jan C van Gemert1, Cor J Veenman, Arnold W M Smeulders

  • 1Département d'Informatique, Ecole Normale Supérieure, Paris, France. j.c.vangemert@gmail.com

IEEE Transactions on Pattern Analysis and Machine Intelligence
|May 22, 2010
PubMed
Summary
This summary is machine-generated.

This study enhances automatic image classification by introducing soft assignment in codebook models. This approach improves performance over traditional hard assignment, especially with large vocabularies and more categories.

More Related Videos

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
09:27

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology
05:38

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Published on: June 29, 2021

Related Experiment Videos

Last Updated: Jun 12, 2026

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
09:27

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology
05:38

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Published on: June 29, 2021

Area of Science:

  • Computer Vision
  • Machine Learning
  • Pattern Recognition

Background:

  • The codebook model represents images as collections of discrete visual words for classification.
  • Traditional codebook models use hard assignment, which mismatches discrete words with continuous image features.
  • This mismatch can limit classification accuracy.

Purpose of the Study:

  • To investigate soft assignment methods for visual words in image classification.
  • To improve the performance of the codebook model by addressing assignment ambiguity.
  • To compare the proposed soft assignment model against the traditional hard assignment approach.

Main Methods:

  • Developed and investigated four types of soft assignment for visual words to image features.
  • Compared the proposed soft assignment model with the traditional hard assignment codebook model.
  • Evaluated performance on five benchmark datasets: 15 natural scenes, Caltech-101, Caltech-256, and Pascal VOC 2007/2008.

Main Results:

  • Explicitly modeling assignment ambiguity through soft assignment significantly improves classification performance.
  • The proposed model maintains consistent performance with large codebook vocabulary sizes, unlike the traditional model.
  • The soft assignment method shows greater benefits in high-dimensional feature spaces and with an increased number of image categories.

Conclusions:

  • Soft assignment in codebook models offers a robust improvement over traditional hard assignment for automatic image classification.
  • The proposed method is more resilient to challenges like large vocabularies and benefits from complex feature spaces.
  • This work provides a more effective approach to image classification using codebook models.