Abstract
The amounts of ancient documents transcribed by means of Handwritten Text Recognition (HTR) technology have been rising dramatically over the last years. Consequently, the development and enhancement of HTR methods and algorithms have become an important issue in the field, with significant contributions in performance for documents with segmented layout. However, Layout Analysis remains a bottleneck in the development and generalization of HTR technology. In this work a new Interactive-Probabilistic method to obtain document layout is presented. This new method incorporates the user feedback in the Layout Analysis process, in order to provide not just a very accurate layout, but an interactive framework in which user feedback is used to help the system to fix any error.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
To avoid numerical problems, calculations are implemented in the logarithm form.
- 2.
Notice that, under deterministic feedback, the signal can be used directly in the system without any decoding [13].
References
Bosch, V., Bordes-Cabrera, I., Muñoz, P.C., Hernández-Tornero, C., Leiva, L.A., Pastor, M., Romero, V., Toselli, A.H., Vidal, E.: Computer-assisted transcription of a historical botanical specimen book: organization and process overview categories and subject descriptors. In: Proceedings of the First International Conference on Digital Access to Textual Cultural Heritage, Madrid, Spain, pp. 125–130 (2014)
Bukhari, S.S., Breuel, T.M., Asi, A., El-Sana, J.: Layout analysis for Arabic historical document images using machine learning. In: Proceedings of the 2012 ICFHR, pp. 639–644. (2012). http://dx.doi.org/10.1109/ICFHR.2012.227
Cattoni, R., Coianiz, T., Messelodi, S., Modena, C.M.: Geometric layout analysis techniques for document image understanding: a review. Technical report, ITC-irst (1998)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley-Interscience, Hoboken (2000)
Gonzalez, R.C., Woods, R.E.: Digital Image Processing, 3rd edn. Prentice Hall, Englewood Cliffs (2008)
Lazzara, G., Geraud, T., Levillain, R.: Planting, growing, and pruning trees: connected filters applied to document image analysis. In: 2014 11th IAPR International Workshop on Document Analysis Systems, pp. 36–40 (2014)
Okazaki, N.: CRFsuite: a fast implementation of conditional random fields (CRFs) (2007). http://www.chokkan.org/software/crfsuite/
Parker, C., Altun, Y., Tadepalli, P.: Guest editorial: special issue on structured prediction. Mach. Learn. 77(2–3), 161–164 (2009)
Pletschacher, S., Antonacopoulos, A.: The PAGE (Page Analysis and Ground-truth Elements) format framework. In: Proceedings - International Conference on Pattern Recognition, pp. 257–260 (2010)
Romero, V., Serrano, N., Toselli, A.H., Sanchez, J.A., Vidal, E.: Handwritten text recognition for historical documents. In: Proceedings of the Workshop on Language Technologies for Digital Humanities and Cultural Heritage, Hissar, Bulgaria, pp. 90–96, September 2011
Stamatopoulos, N., Louloudis, G., Gatos, B.: Goal-oriented performance evaluation methodology for page segmentation techniques. In: 13th International Confrence on Document Analysis and Recognition - ICDAR 2015, pp. 281–285 (2015)
Sutton, C., McCallum, A.: An introduction to conditional random fields. Found. Trends Mach. Learn. 4(4), 267–373 (2012)
Toselli, A.H., Vidal, E., Casacuberta, F.: Multimodal Interactive Pattern Recognition and Applications. Springer, Heidelberg (2011)
Vil’kin, A.M., Safonov, I.V., Egorova, M.A.: Algorithm for segmentation of documents based on texture features. Pattern Recogn. Image Anal. 23(1), 153–159 (2013)
Acknowledgements
First author has been partially supported by MICITT of Costa Rica through the PINN program (PEM-002-15-2). Moreover this work has been also partially supported by the Generalitat Valenciana under the Prometeo/2009/014 project grant ALMAMATER, by MINECO/ FEDER under project TIN2015-70924-C2-1-R (CoMUN-HaT), and through the EU projects: HIMANIS (JPICH programme, Spanish grant Ref. PCIN-2015-068) and READ (Horizon-2020 programme, grant Ref. 674943).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Quirós, L., Martínez-Hinarejos, CD., Toselli, A.H., Vidal, E. (2017). Interactive Layout Detection. In: Alexandre, L., Salvador Sánchez, J., Rodrigues, J. (eds) Pattern Recognition and Image Analysis. IbPRIA 2017. Lecture Notes in Computer Science(), vol 10255. Springer, Cham. https://doi.org/10.1007/978-3-319-58838-4_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-58838-4_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-58837-7
Online ISBN: 978-3-319-58838-4
eBook Packages: Computer ScienceComputer Science (R0)