Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Context-based multiscale classification of document images using wavelet coefficient distributions.

J Li1, R M Gray

  • 1Xerox Palo Alto Research Center, Palo Alto, CA 94304, USA. jiali@isl.stanford.edu

IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
|February 12, 2008
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The ribosomal DNA loci in Plasmodium falciparum accumulate mutations independently.

Journal of molecular biology·1995
Same author

Binding of phenylarsenoxide to Arg-tRNA protein transferase is independent of vicinal thiols.

Biochemistry·1995
Same author

The oncogene qin codes for a transcriptional repressor.

Cancer research·1995
Same author

Non-receptor cytosolic protein tyrosine kinases from various rat tissues.

Biochimica et biophysica acta·1995
Same author

Mutagenesis in the C-terminal region of human interleukin 5 reveals a central patch for receptor alpha chain recognition.

Proceedings of the National Academy of Sciences of the United States of America·1995
Same author

Identification and characterization of a novel related adhesion focal tyrosine kinase (RAFTK) from megakaryocytes and brain.

The Journal of biological chemistry·1995
Same journal

Style-Aware Contrastive Test-Time Adaptation: A Dual-Cache Model for Robust Vision-Language Alignment.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Semantic Frame Interpolation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Physics-Guided Cross-Modal Decoupling with Test-Time Adaptation for Hyperspectral Image Restoration.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
See all related articles

This study introduces a new algorithm for document image segmentation, classifying images into background, photograph, text, and graph categories using wavelet coefficients. The method offers adaptive multiscale classification and context accumulation for enhanced accuracy and speed.

Area of Science:

  • Computer Vision
  • Image Processing
  • Pattern Recognition

Background:

  • Accurate segmentation of document images is crucial for information retrieval and analysis.
  • Existing methods often struggle with class boundaries and overall efficiency.

Purpose of the Study:

  • To develop an adaptive algorithm for segmenting document images into four distinct classes: background, photograph, text, and graph.
  • To improve classification accuracy and processing speed through multiscale analysis and context accumulation.

Main Methods:

  • Feature extraction based on distribution patterns of wavelet coefficients in high-frequency bands.
  • Adaptive multiscale classification, processing images at various resolutions.
  • Incorporation of accumulated context information to refine classification.

Related Experiment Videos

Main Results:

  • The algorithm successfully segments document images into the four specified classes.
  • The multiscale nature enables accurate classification at boundaries and efficient overall processing.
  • Accumulated context information demonstrably improves classification accuracy.

Conclusions:

  • The developed algorithm provides an effective and efficient solution for document image segmentation.
  • Its adaptive multiscale and context-aware approach offers significant advantages over traditional methods.