Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Visual System

Visual System

Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...

Vision

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

Encoding

Encoding

Information enters the brain through encoding, which is the input of information into the memory system. Once sensory information is received from the environment, the brain labels or codes it. The information is then organized with similar information and connected to existing concepts. Encoding occurs through automatic processing and effortful processing.
Automatic processing involves the encoding of details like time, space, frequency, and the meaning of words, usually done without conscious...

Neural Circuits

Neural Circuits

Neural circuits and neuronal pools are two of the main structures found in the nervous system. Neural circuits are networks of neurons that work together to carry out a specific task or process. They consist of interconnected neurons and glial cells, which provide structural and metabolic support.
Neuronal pools are collections of nerve cells with similar functions and interact through chemical and electrical signals. These pools include both interneurons (the central neural circuit nodes that...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Toward Accurate Procedure Planning in Instructional Videos: Visual State Generation Helps Task-Selective Diffusion.

IEEE transactions on pattern analysis and machine intelligence·2025

Same author

Fusing Echocardiography Images and Medical Records for Continuous Patient Stratification.

IEEE transactions on ultrasonics, ferroelectrics, and frequency control·2025

Same author

Hierarchical clustering in mean-field coupled Stuart-Landau oscillators.

Chaos (Woodbury, N.Y.)·2025

Same author

TRUSTED: The Paired 3D Transabdominal Ultrasound and CT Human Data for Kidney Segmentation and Registration Research.

Scientific data·2025

Same author

Unveiling the Tapestry: The Interplay of Generalization and Forgetting in Continual Learning.

IEEE transactions on neural networks and learning systems·2025

Same author

Optimization of Rank Losses for Image Retrieval.

IEEE transactions on pattern analysis and machine intelligence·2025

Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026

Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026

Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026

Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Apr 20, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Learning deep hierarchical visual feature coding.

Hanlin Goh, Nicolas Thome, Matthieu Cord

IEEE Transactions on Neural Networks and Learning Systems

|November 25, 2014

Summary

This summary is machine-generated.

This study introduces a hybrid deep learning architecture for image categorization. The novel approach uses hierarchical coding with restricted Boltzmann machines (RBMs) to achieve competitive accuracy and efficient feature representation.

More Related Videos

Creating Objects and Object Categories for Studying Perception and Perceptual Learning

Creating Objects and Object Categories for Studying Perception and Perceptual Learning

Published on: November 2, 2012

Related Experiment Videos

Last Updated: Apr 20, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Creating Objects and Object Categories for Studying Perception and Perceptual Learning

Creating Objects and Object Categories for Studying Perception and Perceptual Learning

Published on: November 2, 2012

Area of Science:

Computer Vision
Machine Learning
Artificial Intelligence

Background:

Traditional bag-of-words models struggle with complex image representations.
Deep learning architectures offer powerful adaptability but can be computationally intensive.
Integrating these approaches is key to advancing image categorization.

Purpose of the Study:

To propose a hybrid architecture combining bag-of-words and deep learning for enhanced image categorization.
To develop a hierarchical coding scheme using spatial aggregating restricted Boltzmann machines (RBMs).
To evaluate the model's performance on standard image categorization datasets.

Main Methods:

Utilizing local gradient-based descriptors (e.g., SIFT) encoded via a hierarchical RBM scheme.
Regularizing RBMs with sparse and selective distribution fitting for each coding layer.
Employing supervised fine-tuning to optimize visual representations for categorization tasks.

Main Results:

Achieved competitive image categorization accuracies of 79.7% (Caltech-101) and 86.4% (15-Scenes).
Learned compact visual representations with fast inference compared to sparse coding methods.
Demonstrated transferability of learned low-level features across different image datasets.

Conclusions:

The proposed hybrid architecture effectively combines the strengths of traditional and deep learning methods.
Hierarchical coding with RBMs provides efficient and transferable visual representations.
Supervised fine-tuning significantly enhances categorization performance, especially in deeper architectures.