Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Rapid Artificial Intelligence Autoplanning Rivals Manual Expert Planning for Cervical Brachytherapy.

Practical radiation oncology·2026
Same author

WTAP -mediated N6‑methylation of TRAF6 Facilitates Polycystic Ovary Syndrome by Inducing Ferroptosis.

Reproductive sciences (Thousand Oaks, Calif.)·2026
Same author

Macula Spatial Patterns and Their Association With Central Visual Field Progression in Glaucoma Using Artificial Intelligence.

Journal of glaucoma·2026
Same author

BMAL1 downregulation exacerbates age-related nonalcoholic steatohepatitis by promoting NLRP3 inflammasome activation via HIF-1É‘-mediated glycolysis.

Free radical biology & medicine·2026
Same author

Immunological Dynamics in PCOS T Cell Exhaustion TIGIT Upregulation Regulated by METTL3-Mediated N6 RNA Methylation.

American journal of reproductive immunology (New York, N.Y. : 1989)·2025
Same author

Integration of single-click, AI-based brachytherapy auto-planning for cervical cancer within a treatment planning system.

Brachytherapy·2025
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Jan 2, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

969

Cascade R-CNN: High Quality Object Detection and Instance Segmentation.

Zhaowei Cai, Nuno Vasconcelos

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |December 4, 2019
    PubMed
    Summary
    This summary is machine-generated.

    The Cascade R-CNN improves object detection by using sequential detectors trained at increasing Intersection over Union (IoU) thresholds. This method overcomes challenges like overfitting and quality mismatches, achieving state-of-the-art results.

    More Related Videos

    Automated Analysis of C. elegans Fluorescence Images using SegElegans
    06:27

    Automated Analysis of C. elegans Fluorescence Images using SegElegans

    Published on: October 10, 2025

    523
    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    3.3K

    Related Experiment Videos

    Last Updated: Jan 2, 2026

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    969
    Automated Analysis of C. elegans Fluorescence Images using SegElegans
    06:27

    Automated Analysis of C. elegans Fluorescence Images using SegElegans

    Published on: October 10, 2025

    523
    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    3.3K

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Deep Learning

    Background:

    • Intersection over Union (IoU) threshold is crucial for defining positive/negative samples in object detection.
    • Commonly used IoU thresholds (e.g., 0.5) can lead to noisy detections, while higher thresholds often degrade performance.
    • Existing methods face a paradox where increasing IoU thresholds for higher quality detections leads to overfitting and inference-time mismatches.

    Purpose of the Study:

    • To propose a novel multi-stage object detection architecture, Cascade R-CNN, to address the paradox of high-quality detection.
    • To improve detection performance by mitigating overfitting and resolving quality mismatches between detectors and hypotheses.
    • To generalize the Cascade R-CNN architecture for instance segmentation tasks.

    Main Methods:

    • A sequential cascade of detectors is trained with progressively increasing IoU thresholds.
    • The output of each detector is used as the training set for the subsequent detector in the cascade.
    • The same cascaded structure is applied during inference to ensure quality consistency.

    Main Results:

    • The Cascade R-CNN architecture achieves state-of-the-art performance on the COCO dataset.
    • Significant improvements in high-quality detection are observed across various datasets (VOC, KITTI, CityPerson, WiderFace).
    • The architecture, when generalized to instance segmentation, shows notable improvements over Mask R-CNN.

    Conclusions:

    • Cascade R-CNN effectively resolves the paradox of high-quality object detection by addressing overfitting and inference-time mismatches.
    • The proposed sequential training and inference strategy enhances detection quality and robustness.
    • The architecture offers a promising direction for both object detection and instance segmentation.