Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Performance evaluation and benchmarking of six-page segmentation algorithms.

Faisal Shafait1, Daniel Keysers, Thomas Breuel

  • 1Image understanding and Pattern Recongnition Research Group, German Research Center for Artificial Intelligence, Kaiserslautern, Germany. faisal.shafait@dfki.de

IEEE Transactions on Pattern Analysis and Machine Intelligence
|April 19, 2008
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The NWRD Dataset: An Open-Source Annotated Segmentation Dataset of Diseased Wheat Crop.

Sensors (Basel, Switzerland)·2023
Same author

Corrigendum: The NMT Scalp EEG Dataset: An Open-Source Annotated Dataset of Healthy and Pathological EEG Recordings for Predictive Modeling.

Frontiers in neuroscience·2022
Same author

Bacterial prediction using internet of things (IoT) and machine learning.

Environmental monitoring and assessment·2022
Same author

The NMT Scalp EEG Dataset: An Open-Source Annotated Dataset of Healthy and Pathological EEG Recordings for Predictive Modeling.

Frontiers in neuroscience·2022
Same author

Correction to: Two-stage framework for optic disc localization and glaucoma classification in retinal fundus images using deep learning.

BMC medical informatics and decision making·2019
Same author

Converting a Common Low-Cost Document Scanner into a Multispectral Scanner.

Sensors (Basel, Switzerland)·2019
Same journal

HardFlow: Hard-Constrained Sampling for Flow-Matching Models Via Trajectory Optimization.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Industrial Brain: Self-Evolving Neuro-Symbolic Autonomy with Causal Resilience for Cyber-Physical Systems.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Adaptive Hardness-Driven Dictionary Distillation for Incomplete Streaming View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Task-KV: Task-aware KV Cache Optimization via Semantic Differentiation of Attention Heads.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Achieving Text-based Person Retrieval with Any Granularity.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Current page segmentation evaluation scores are insufficient. A new vectorial score accurately identifies critical errors in optical character recognition (OCR) systems, improving performance analysis.

Area of Science:

  • Computer Science
  • Image Processing
  • Pattern Recognition

Background:

  • Page segmentation is a critical step in Optical Character Recognition (OCR) systems, often limiting overall performance.
  • Existing evaluation metrics for page segmentation are inadequate for diagnosing specific error types and can miss significant segmentation flaws.

Purpose of the Study:

  • To introduce a novel vectorial evaluation score for page segmentation in OCR.
  • To address the limitations of current scores in identifying over-segmentation, under-segmentation, and mis-segmentation errors.
  • To provide a method for pixel-accurate evaluation on arbitrary region shapes with canonical ground truth.

Main Methods:

  • Development of a vectorial evaluation score sensitive to key segmentation errors.

Related Experiment Videos

  • Establishment of a canonical representation for ground truth data.
  • Pixel-accurate evaluation of segmentation algorithms on diverse region shapes.
  • Main Results:

    • The proposed vectorial score effectively identifies over-, under-, and mis-segmentation errors.
    • The evaluation scheme revealed specific flaws in widely used segmentation algorithms like x-y cut, smearing, and whitespace analysis.
    • Demonstrated the superiority of the new scheme in diagnosing segmentation method weaknesses on the UW-III database.

    Conclusions:

    • The new vectorial score offers a more informative and accurate method for evaluating OCR page segmentation.
    • This enhanced evaluation facilitates targeted improvements in segmentation algorithms.
    • The method provides a robust framework for benchmarking and understanding segmentation performance.