Abstract
This paper presents an algorithm based on Flood Fill, Component Labelling, and Region Adjacency Graphs for removing noisy borders in monochromatic images of documents introduced by the digitalization process using automatically fed scanners. The new algorithm was tested on 20,000 images and provided better quality images and time-space performance than its predecessors including the widespread used commercial tools.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Ávila, B.T., Lins, R.D.: A New Algorithm for Removing Noisy Borders from Monochromatic Documents. In: Proc. of ACM-SAC 2004, pp. 1219–1225. ACM Press, Cyprus (2004)
Ávila, B.T., Lins, R.D.: Removing Noise Borders from Monochromatic Scanned Documents (in preparation)
Baird, H.S.: Document image defect models and their uses. In: Proc. Snd Int. Conf. on Document Analysis and Recognition, Japan, pp. 62–67. IEEE Comp. Soc. Los Alamitos (1993)
Berger, M.: Computer Graphics with Pascal. Addison-Wesley, Reading (1986)
Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to algorithms, 2nd edn. MIT Press, Cambridge (2001)
Fan, K.C., Wang, Y.K., Lay, T.R.: Marginal noise removal of document images. Patt. Recog. 35, 2593–2611 (2002)
O’Gorman, L., Kasturi, R.: Document Image Analysis, IEEE Computer Society Executive Briefing (1997)
Kanungo, T., Haralick, R.M., Phillips, I.: Global and local document degradation models. In: Proc. Snd Int. Conf. Doc. Analysis and Recognition, pp. 730–734 (1993)
Le, D.X.: Automated borders detection and adaptive segmentation for binary document images. National Library of Medicine, http://archive.nlm.nih.gov/pubs/le/twocols/twocols.php
Lins, R.D., Guimarães Neto, M.S., França Neto, L.R., Rosa, L.G.: An Environment for Processing Images of Historical Documents. Microprocessing & Microprogramming, pp. 111–121. North-Holland, Amsterdam (1995)
Lins, R.D., Machado, D.S.A.: A Comparative Study of File Formats for Image Storage and Transmission. Journal of Electronic Imaging 13(1), 175–183 (2004)
Mello, C.A.B., Lins, R.D.: Image Segmentation of Historical Documents. In: Visual 2000, Mexico (August 2000)
Shapiro, L.G., Stockman, G.C.: Computer Vision (March 2000), http://www.cse.msu.edu/~stockman/Book/book.html
BlackIce Document Imaging SDK 10. BlackIce Software Inc., http://www.blackice.com/
ClearImage 5. Inlite Research Inc., http://www.inliteresearch.com
Kodak Digital Science Scanner 1500, http://www.kodak.com/global/en/business/docimaging/1500002/
Leadtools 13. Leadtools Inc., http://www.leadtools.com
ScanFix Bitonal Image Optimizer 4.21. TMS Sequoia, http://www.tmsinc.com
Skyline Tools Corporate Suite 7. Skyline Tools Imaging, http://www.skylinetools.com
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ávila, B.T., Lins, R.D. (2004). Efficient Removal of Noisy Borders from Monochromatic Documents. In: Campilho, A., Kamel, M. (eds) Image Analysis and Recognition. ICIAR 2004. Lecture Notes in Computer Science, vol 3212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30126-4_31
Download citation
DOI: https://doi.org/10.1007/978-3-540-30126-4_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23240-7
Online ISBN: 978-3-540-30126-4
eBook Packages: Springer Book Archive