Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

2.6K
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
2.6K
Parallel Processing01:20

Parallel Processing

889
The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...
889
Neural Circuits01:25

Neural Circuits

3.3K
Neural circuits and neuronal pools are two of the main structures found in the nervous system. Neural circuits are networks of neurons that work together to carry out a specific task or process. They consist of interconnected neurons and glial cells, which provide structural and metabolic support.
Neuronal pools are collections of nerve cells with similar functions and interact through chemical and electrical signals. These pools include both interneurons (the central neural circuit nodes that...
3.3K
Deconvolution01:20

Deconvolution

695
Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...
695
Vision01:24

Vision

61.6K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
61.6K
Visual System01:26

Visual System

2.3K
Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...
2.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

One-pot formation of chiral polysubstituted 3,4-dihydropyrans via a novel organocatalytic domino sequence involving alkynal self-condensation.

Organic letters·2012
Same author

Non-invasive microelectrode cadmium flux measurements reveal the spatial characteristics and real-time kinetics of cadmium transport in hyperaccumulator and nonhyperaccumulator ecotypes of Sedum alfredii.

Journal of plant physiology·2012
Same author

NO inhibitory guaianolide-derived terpenoids from Artemisia argyi.

Fitoterapia·2012
Same author

Rac1+ cells distributed in accordance with CD 133+ cells in glioblastomas and the elevated invasiveness of CD 133+ glioma cells with higher Rac1 activity.

Chinese medical journal·2012
Same author

Selective adsorption of Hg(II) by γ-radiation synthesized silica-graft-vinyl imidazole adsorbent.

Journal of hazardous materials·2012
Same author

Reconciliation of sequence data and updated annotation of the genome of Agrobacterium tumefaciens C58, and distribution of a linear chromosome in the genus Agrobacterium.

Applied and environmental microbiology·2012
Same journal

HardFlow: Hard-Constrained Sampling for Flow-Matching Models Via Trajectory Optimization.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Industrial Brain: Self-Evolving Neuro-Symbolic Autonomy with Causal Resilience for Cyber-Physical Systems.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Adaptive Hardness-Driven Dictionary Distillation for Incomplete Streaming View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Task-KV: Task-aware KV Cache Optimization via Semantic Differentiation of Attention Heads.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Achieving Text-based Person Retrieval with Any Granularity.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Apr 4, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

1.2K

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.

Kaiming He, Xiangyu Zhang, Shaoqing Ren

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |September 10, 2015
    PubMed
    Summary
    This summary is machine-generated.

    Spatial pyramid pooling (SPP) enables deep convolutional neural networks (CNNs) to process arbitrary-sized images, improving accuracy and efficiency in image classification and object detection tasks.

    More Related Videos

    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
    04:48

    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

    Published on: July 5, 2024

    874

    Related Experiment Videos

    Last Updated: Apr 4, 2026

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    1.2K
    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
    04:48

    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

    Published on: July 5, 2024

    874

    Area of Science:

    • Computer Vision
    • Deep Learning
    • Machine Learning

    Background:

    • Existing deep convolutional neural networks (CNNs) are limited by fixed-size input requirements, potentially hindering recognition accuracy for variable-scale images.
    • This limitation necessitates artificial image resizing, which can negatively impact performance.

    Purpose of the Study:

    • To introduce a novel pooling strategy, spatial pyramid pooling (SPP), to overcome the fixed-size input limitation in CNNs.
    • To enhance the flexibility and accuracy of CNNs for image classification and object detection tasks.

    Main Methods:

    • Developed SPP-net, a new network architecture incorporating spatial pyramid pooling.
    • SPP generates a fixed-length representation from images of any size, robust to object deformations.
    • Applied SPP-net to image classification and object detection tasks, including training detectors with pooled features from a single full-image computation.

    Main Results:

    • SPP-net demonstrated improved accuracy across various CNN architectures on the ImageNet 2012 dataset.
    • Achieved state-of-the-art classification results on Pascal VOC 2007 and Caltech101 datasets without fine-tuning.
    • In object detection, SPP-net significantly accelerated processing (24-102x faster than R-CNN) while maintaining or improving accuracy on Pascal VOC 2007.
    • Achieved top rankings (#2 in object detection, #3 in image classification) in the ILSVRC 2014 competition.

    Conclusions:

    • Spatial pyramid pooling effectively addresses the fixed-size input limitation in CNNs, enhancing performance.
    • SPP-net offers significant improvements in both image classification and object detection, demonstrating superior speed and accuracy.
    • The method provides a robust and efficient approach for deep learning-based computer vision applications.