Abstract
In this paper, we propose an approach for understanding mathematical expressions in printed document. The system consists of three main components namely (i) detection of mathematical expressions in a document, (ii) recognition of the symbols present in the expression and (iii) meaningful arrangement of the recognized symbols. However, detection of mathematical expressions is done through recognition of symbols. Moreover, some structural features of the expressions are also used for this purpose. For recognition of the symbols a hybrid of feature based and template based recognition techniques is used. The bounding-box coordinates and the size information of the symbols help to determine the spatial relationships among the symbols. A set of predefined grammar rules is used to form the meaningful symbol groups to properly arrange the symbols. Experiments conducted using these approaches on a large number of documents show high accuracy.
Author for correspondence.
Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Reference
D. Blostein, A. Grbavec: Recognition of Mathematical Notation. In: H. Bunke, P. S. P. Wang (eds.): Handbook of Character Recognition and Document Image Analysis, World Scientific Publishing Company, (1997) 557–582
R. H. Anderson: Syntax-directed recognition of handprinted 2-D mathematics. Ph.D. Dissertation. Harvard University, Cambridge, M. A. (1968)
A. Grbavec, D. Blostein: mathematics recognition using graph rewriting. In: Proceedings of Third International Conference on Document Analysis and Recognition. Montreal, Canada (1995) 417–421
W. Martin: Computer input/output of mathematical expressions. In: Proceedings of Second Symposium on Symbolic and Algebraic Manipulations. New York (1971) 78–87
A. Belaid, J. Haton: A syntactic approach for handwritten mathematical formula Recognition. IEEE Transaction on pattern Analysis and machine Intelligence.6, 1 (1984) 105–111
S. K. Chang: A method for the structural analysis of 2-D mathematical expressions. Information Sciences. 2, 3 (1970) 253–272
M. Okamoto, H. Miyazawa: An experimental implementation of a document recognition system for papers containing mathematical expressions. In: Structured Document Image Analysis. Springer-Verlag (1992) 36–53
M. Okamoto, H. Twaakyondo: Structure Analysis and Recognition of Mathematical Expressions. IEEE Computer Society Press (1995) 430–437
S. Larvirotte, L. Pottier: Mathematical formula recognition using graph grammar. In: Proceedings of SPIE, Vol. 3305. California, USA (1998)
H. Lee, M. Lee: Understanding mathematical expressions using procedure oriented transformation. Pattern Recognition, 27, 3 (1994) 447–457
P. Chou: Recognition of equations using a two-dimensional context-free grammar. In: Proceedings of SPIE Visual Communication and Image Processing IV. Philadelphia PA (1989) 852–863
LATEX: A document Presentation System. Addison Wesley Publishing Company, Inc. (1986)
H. Lee and J. Wang: Design of a mathematical expression recognition system. In: Proceedings of Third International Conference on Document Analysis and Recognition. Montreal, Canada (1995) 1084–1087
U. Garain, B. B. Chaudhuri: Compound character recognition by a run number based metric distance. In: Proceedings of SPIE, Vol. 3305. San Jose (1998) 90–97
B. B. Chaudhuri, U. Garain: Automatic detection of italic, bold and all-capital words from documents. In: Proceedings of International Conference on Pattern Recognition. Australia (1998) 610–612
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chaudhuri, B.B., Garain, U. (1999). An Approach for Processing Mathematical Expressions in Printed Document. In: Lee, SW., Nakano, Y. (eds) Document Analysis Systems: Theory and Practice. DAS 1998. Lecture Notes in Computer Science, vol 1655. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48172-9_25
Download citation
DOI: https://doi.org/10.1007/3-540-48172-9_25
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66507-6
Online ISBN: 978-3-540-48172-0
eBook Packages: Springer Book Archive