Recognising Tabular Mathematical Expressions Using Graph Rewriting

  • Mohamed Alkalai
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8259)

Abstract

While a number of techniques have been developed for table recognition in ordinary text documents, very little work has been done on tables that contain mathematical expressions. The latter problem is complicated by the fact that mathematical formulae often have a tabular layout themselves, thus not only blurring the distinction between table and content structure, but often leading to a number of possible, equally valid interpretations. However, a reliable understanding of the layout of a formula is often a necessary prerequisite to further semantic interpretation. In this paper, a graph representation for complex mathematical table structures is presented. A set of rewriting rules is applied to the graph allows for reliable re-composition of cells in order to identify several valid table interpretations. The effectiveness of the technique is demonstrated by applying it to a set of mathematical tables from standard text books that has been manually ground-truthed.

Keywords

Production Rule Virtual Node Mathematical Table Table Structure Initial Graph 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Alkalai, M., Sorge, V.: Issues in mathematical table recognition. In: Conferences on Intelligent Computer Mathematics (CICM 2012), MIR Workshop (2012)Google Scholar
  2. 2.
    Amano, A., Asada, N.: Graph grammar based analysis system of complex table form document. In: ICDAR, pp. 916–920. IEEE Computer Society (2003)Google Scholar
  3. 3.
    Cooperman, R., Armon Rahgozar, M.: A graph-based table recognition system. In: SPIE Proc., pp. 192–203 (1996)Google Scholar
  4. 4.
    Hu, J., Kashi, R.S., Lopresti, D.P., Wilfong, G.T., Nagy, G.: Why table ground-truthing is hard. In: ICDAR, pp. 129–133 (2001)Google Scholar
  5. 5.
    Jeffrey, A., Zwillinger, D.: Table of Integrals, Series, and Products. Elsevier Inc. (2007)Google Scholar
  6. 6.
    Ramel, J., Crucianu, M., Vincent, N., Faure, C.: Detection, extraction and representation of tables. In: Proceedings of the Seventh International Conference on Document Analysis and Recognition, vol. 1, pp. 374–378. IEEE Computer Society, Washington, DC (2003)CrossRefGoogle Scholar
  7. 7.
    Costa Silva, A., Jorge, A.M., Torgo, L.: Design of an end-to-end method to extract information from tables. International Journal Document Analysis Research 8(2), 144–171 (2006)CrossRefGoogle Scholar
  8. 8.
    Zanibbi, R., Blostein, D., Cordy, J.R.: A survey of table recognition: Models, observations, transformations, and inferences. Int. J. Doc. Anal. Recognit. 7(1), 1–16 (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Mohamed Alkalai
    • 1
  1. 1.School of Computer ScienceUniversity of BirminghamUK

Personalised recommendations