Skip to main content

Interactive Layout Detection

  • Conference paper
  • First Online:
Pattern Recognition and Image Analysis (IbPRIA 2017)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10255))

Included in the following conference series:

Abstract

The amounts of ancient documents transcribed by means of Handwritten Text Recognition (HTR) technology have been rising dramatically over the last years. Consequently, the development and enhancement of HTR methods and algorithms have become an important issue in the field, with significant contributions in performance for documents with segmented layout. However, Layout Analysis remains a bottleneck in the development and generalization of HTR technology. In this work a new Interactive-Probabilistic method to obtain document layout is presented. This new method incorporates the user feedback in the Layout Analysis process, in order to provide not just a very accurate layout, but an interactive framework in which user feedback is used to help the system to fix any error.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    To avoid numerical problems, calculations are implemented in the logarithm form.

  2. 2.

    Notice that, under deterministic feedback, the signal can be used directly in the system without any decoding [13].

References

  1. Bosch, V., Bordes-Cabrera, I., Muñoz, P.C., Hernández-Tornero, C., Leiva, L.A., Pastor, M., Romero, V., Toselli, A.H., Vidal, E.: Computer-assisted transcription of a historical botanical specimen book: organization and process overview categories and subject descriptors. In: Proceedings of the First International Conference on Digital Access to Textual Cultural Heritage, Madrid, Spain, pp. 125–130 (2014)

    Google Scholar 

  2. Bukhari, S.S., Breuel, T.M., Asi, A., El-Sana, J.: Layout analysis for Arabic historical document images using machine learning. In: Proceedings of the 2012 ICFHR, pp. 639–644. (2012). http://dx.doi.org/10.1109/ICFHR.2012.227

  3. Cattoni, R., Coianiz, T., Messelodi, S., Modena, C.M.: Geometric layout analysis techniques for document image understanding: a review. Technical report, ITC-irst (1998)

    Google Scholar 

  4. Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley-Interscience, Hoboken (2000)

    MATH  Google Scholar 

  5. Gonzalez, R.C., Woods, R.E.: Digital Image Processing, 3rd edn. Prentice Hall, Englewood Cliffs (2008)

    Google Scholar 

  6. Lazzara, G., Geraud, T., Levillain, R.: Planting, growing, and pruning trees: connected filters applied to document image analysis. In: 2014 11th IAPR International Workshop on Document Analysis Systems, pp. 36–40 (2014)

    Google Scholar 

  7. Okazaki, N.: CRFsuite: a fast implementation of conditional random fields (CRFs) (2007). http://www.chokkan.org/software/crfsuite/

  8. Parker, C., Altun, Y., Tadepalli, P.: Guest editorial: special issue on structured prediction. Mach. Learn. 77(2–3), 161–164 (2009)

    Article  Google Scholar 

  9. Pletschacher, S., Antonacopoulos, A.: The PAGE (Page Analysis and Ground-truth Elements) format framework. In: Proceedings - International Conference on Pattern Recognition, pp. 257–260 (2010)

    Google Scholar 

  10. Romero, V., Serrano, N., Toselli, A.H., Sanchez, J.A., Vidal, E.: Handwritten text recognition for historical documents. In: Proceedings of the Workshop on Language Technologies for Digital Humanities and Cultural Heritage, Hissar, Bulgaria, pp. 90–96, September 2011

    Google Scholar 

  11. Stamatopoulos, N., Louloudis, G., Gatos, B.: Goal-oriented performance evaluation methodology for page segmentation techniques. In: 13th International Confrence on Document Analysis and Recognition - ICDAR 2015, pp. 281–285 (2015)

    Google Scholar 

  12. Sutton, C., McCallum, A.: An introduction to conditional random fields. Found. Trends Mach. Learn. 4(4), 267–373 (2012)

    Article  MATH  Google Scholar 

  13. Toselli, A.H., Vidal, E., Casacuberta, F.: Multimodal Interactive Pattern Recognition and Applications. Springer, Heidelberg (2011)

    Book  MATH  Google Scholar 

  14. Vil’kin, A.M., Safonov, I.V., Egorova, M.A.: Algorithm for segmentation of documents based on texture features. Pattern Recogn. Image Anal. 23(1), 153–159 (2013)

    Article  Google Scholar 

Download references

Acknowledgements

First author has been partially supported by MICITT of Costa Rica through the PINN program (PEM-002-15-2). Moreover this work has been also partially supported by the Generalitat Valenciana under the Prometeo/2009/014 project grant ALMAMATER, by MINECO/ FEDER under project TIN2015-70924-C2-1-R (CoMUN-HaT), and through the EU projects: HIMANIS (JPICH programme, Spanish grant Ref. PCIN-2015-068) and READ (Horizon-2020 programme, grant Ref. 674943).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lorenzo Quirós .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Quirós, L., Martínez-Hinarejos, CD., Toselli, A.H., Vidal, E. (2017). Interactive Layout Detection. In: Alexandre, L., Salvador Sánchez, J., Rodrigues, J. (eds) Pattern Recognition and Image Analysis. IbPRIA 2017. Lecture Notes in Computer Science(), vol 10255. Springer, Cham. https://doi.org/10.1007/978-3-319-58838-4_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-58838-4_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-58837-7

  • Online ISBN: 978-3-319-58838-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics