Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Visual System01:26

Visual System

2.4K
Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...
2.4K
Vision01:24

Vision

61.9K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
61.9K
Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

2.7K
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
2.7K
Encoding01:19

Encoding

1.0K
Information enters the brain through encoding, which is the input of information into the memory system. Once sensory information is received from the environment, the brain labels or codes it. The information is then organized with similar information and connected to existing concepts. Encoding occurs through automatic processing and effortful processing.
Automatic processing involves the encoding of details like time, space, frequency, and the meaning of words, usually done without conscious...
1.0K
Neural Circuits01:25

Neural Circuits

3.4K
Neural circuits and neuronal pools are two of the main structures found in the nervous system. Neural circuits are networks of neurons that work together to carry out a specific task or process. They consist of interconnected neurons and glial cells, which provide structural and metabolic support.
Neuronal pools are collections of nerve cells with similar functions and interact through chemical and electrical signals. These pools include both interneurons (the central neural circuit nodes that...
3.4K
Associative Learning01:27

Associative Learning

2.1K
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
2.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Toward Accurate Procedure Planning in Instructional Videos: Visual State Generation Helps Task-Selective Diffusion.

IEEE transactions on pattern analysis and machine intelligence·2025
Same author

Fusing Echocardiography Images and Medical Records for Continuous Patient Stratification.

IEEE transactions on ultrasonics, ferroelectrics, and frequency control·2025
Same author

Hierarchical clustering in mean-field coupled Stuart-Landau oscillators.

Chaos (Woodbury, N.Y.)·2025
Same author

TRUSTED: The Paired 3D Transabdominal Ultrasound and CT Human Data for Kidney Segmentation and Registration Research.

Scientific data·2025
Same author

Unveiling the Tapestry: The Interplay of Generalization and Forgetting in Continual Learning.

IEEE transactions on neural networks and learning systems·2025
Same author

Optimization of Rank Losses for Image Retrieval.

IEEE transactions on pattern analysis and machine intelligence·2025
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Apr 20, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

1.2K

Learning deep hierarchical visual feature coding.

Hanlin Goh, Nicolas Thome, Matthieu Cord

    IEEE Transactions on Neural Networks and Learning Systems
    |November 25, 2014
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a hybrid deep learning architecture for image categorization. The novel approach uses hierarchical coding with restricted Boltzmann machines (RBMs) to achieve competitive accuracy and efficient feature representation.

    More Related Videos

    Creating Objects and Object Categories for Studying Perception and Perceptual Learning
    14:38

    Creating Objects and Object Categories for Studying Perception and Perceptual Learning

    Published on: November 2, 2012

    12.4K

    Related Experiment Videos

    Last Updated: Apr 20, 2026

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    1.2K
    Creating Objects and Object Categories for Studying Perception and Perceptual Learning
    14:38

    Creating Objects and Object Categories for Studying Perception and Perceptual Learning

    Published on: November 2, 2012

    12.4K

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Artificial Intelligence

    Background:

    • Traditional bag-of-words models struggle with complex image representations.
    • Deep learning architectures offer powerful adaptability but can be computationally intensive.
    • Integrating these approaches is key to advancing image categorization.

    Purpose of the Study:

    • To propose a hybrid architecture combining bag-of-words and deep learning for enhanced image categorization.
    • To develop a hierarchical coding scheme using spatial aggregating restricted Boltzmann machines (RBMs).
    • To evaluate the model's performance on standard image categorization datasets.

    Main Methods:

    • Utilizing local gradient-based descriptors (e.g., SIFT) encoded via a hierarchical RBM scheme.
    • Regularizing RBMs with sparse and selective distribution fitting for each coding layer.
    • Employing supervised fine-tuning to optimize visual representations for categorization tasks.

    Main Results:

    • Achieved competitive image categorization accuracies of 79.7% (Caltech-101) and 86.4% (15-Scenes).
    • Learned compact visual representations with fast inference compared to sparse coding methods.
    • Demonstrated transferability of learned low-level features across different image datasets.

    Conclusions:

    • The proposed hybrid architecture effectively combines the strengths of traditional and deep learning methods.
    • Hierarchical coding with RBMs provides efficient and transferable visual representations.
    • Supervised fine-tuning significantly enhances categorization performance, especially in deeper architectures.