Abstract: | In this paper, we present a binarization technique
specifically designed for historical document images.
Existing methods for this problem focus on either
finding a good global threshold or adapting the
threshold for each area so that to remove smear,
strains, uneven illumination etc. We propose a hybrid
approach that first applies a global thresholding
method and, then, identifies the image areas that are
more likely to still contain noise. Each of these areas is
re-processed separately to achieve better quality of
binarization. We evaluate the proposed approach for
different kinds of degradation problems. The results
show that our method can handle hard cases while
documents already in good condition are not affected
drastically. |