Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

Parallel Processing

Parallel Processing

The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...

Neural Circuits

Neural Circuits

Neural circuits and neuronal pools are two of the main structures found in the nervous system. Neural circuits are networks of neurons that work together to carry out a specific task or process. They consist of interconnected neurons and glial cells, which provide structural and metabolic support.
Neuronal pools are collections of nerve cells with similar functions and interact through chemical and electrical signals. These pools include both interneurons (the central neural circuit nodes that...

Deconvolution

Deconvolution

Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...

Vision

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

Visual System

Visual System

Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

One-pot formation of chiral polysubstituted 3,4-dihydropyrans via a novel organocatalytic domino sequence involving alkynal self-condensation.

Organic letters·2012

Same author

Non-invasive microelectrode cadmium flux measurements reveal the spatial characteristics and real-time kinetics of cadmium transport in hyperaccumulator and nonhyperaccumulator ecotypes of Sedum alfredii.

Journal of plant physiology·2012

Same author

NO inhibitory guaianolide-derived terpenoids from Artemisia argyi.

Fitoterapia·2012

Same author

Rac1+ cells distributed in accordance with CD 133+ cells in glioblastomas and the elevated invasiveness of CD 133+ glioma cells with higher Rac1 activity.

Chinese medical journal·2012

Same author

Selective adsorption of Hg(II) by γ-radiation synthesized silica-graft-vinyl imidazole adsorbent.

Journal of hazardous materials·2012

Same author

Reconciliation of sequence data and updated annotation of the genome of Agrobacterium tumefaciens C58, and distribution of a linear chromosome in the genus Agrobacterium.

Applied and environmental microbiology·2012

Same journal

HardFlow: Hard-Constrained Sampling for Flow-Matching Models Via Trajectory Optimization.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Industrial Brain: Self-Evolving Neuro-Symbolic Autonomy with Causal Resilience for Cyber-Physical Systems.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Adaptive Hardness-Driven Dictionary Distillation for Incomplete Streaming View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Task-KV: Task-aware KV Cache Optimization via Semantic Differentiation of Attention Heads.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Achieving Text-based Person Retrieval with Any Granularity.

IEEE transactions on pattern analysis and machine intelligence·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Apr 4, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.

Kaiming He, Xiangyu Zhang, Shaoqing Ren

IEEE Transactions on Pattern Analysis and Machine Intelligence

|September 10, 2015

Summary

This summary is machine-generated.

Spatial pyramid pooling (SPP) enables deep convolutional neural networks (CNNs) to process arbitrary-sized images, improving accuracy and efficiency in image classification and object detection tasks.

More Related Videos

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Related Experiment Videos

Last Updated: Apr 4, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Area of Science:

Computer Vision
Deep Learning
Machine Learning

Background:

Existing deep convolutional neural networks (CNNs) are limited by fixed-size input requirements, potentially hindering recognition accuracy for variable-scale images.
This limitation necessitates artificial image resizing, which can negatively impact performance.

Purpose of the Study:

To introduce a novel pooling strategy, spatial pyramid pooling (SPP), to overcome the fixed-size input limitation in CNNs.
To enhance the flexibility and accuracy of CNNs for image classification and object detection tasks.

Main Methods:

Developed SPP-net, a new network architecture incorporating spatial pyramid pooling.
SPP generates a fixed-length representation from images of any size, robust to object deformations.
Applied SPP-net to image classification and object detection tasks, including training detectors with pooled features from a single full-image computation.

Main Results:

SPP-net demonstrated improved accuracy across various CNN architectures on the ImageNet 2012 dataset.
Achieved state-of-the-art classification results on Pascal VOC 2007 and Caltech101 datasets without fine-tuning.
In object detection, SPP-net significantly accelerated processing (24-102x faster than R-CNN) while maintaining or improving accuracy on Pascal VOC 2007.
Achieved top rankings (#2 in object detection, #3 in image classification) in the ILSVRC 2014 competition.

Conclusions:

Spatial pyramid pooling effectively addresses the fixed-size input limitation in CNNs, enhancing performance.
SPP-net offers significant improvements in both image classification and object detection, demonstrating superior speed and accuracy.
The method provides a robust and efficient approach for deep learning-based computer vision applications.