Skip to main content

VMEXT: A Visualization Tool for Mathematical Expression Trees

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10383))

Abstract

Mathematical expressions can be represented as a tree consisting of terminal symbols, such as identifiers or numbers (leaf nodes), and functions or operators (non-leaf nodes). Expression trees are an important mechanism for storing and processing mathematical expressions as well as the most frequently used visualization of the structure of mathematical expressions. Typically, researchers and practitioners manually visualize expression trees using general-purpose tools. This approach is laborious, redundant, and error-prone. Manual visualizations represents a user’s notion of what the markup of an expression should be, but not necessarily what the actual markup is. This paper presents VMEXT – a free and open source tool to directly visualize expression trees from parallel . VMEXT simultaneously visualizes the presentation elements and the semantic structure of mathematical expressions to enable users to quickly spot deficiencies in the Content markup that does not affect the presentation of the expression. Identifying such discrepancies previously required reading the verbose and complex markup. VMEXT also allows one to visualize similar and identical elements of two expressions. Visualizing expression similarity can support developers in designing retrieval approaches and enable improved interaction concepts for users of mathematical information retrieval systems. We demonstrate VMEXT’s visualizations in two web-based applications. The first application presents the visualizations alone. The second application shows a possible integration of the visualizations in systems for mathematical knowledge management and mathematical information retrieval. The application converts input to parallel , computes basic similarity measures for mathematical expressions, and visualizes the results using VMEXT.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    www.w3.org/Math/.

  2. 2.

    http://www.citeplag.org.

  3. 3.

    https://www.mediawiki.org/wiki/Mathoid.

  4. 4.

    https://github.com/zorkow/speech-rule-engine.

  5. 5.

    http://vmext.formulasearchengine.com.

  6. 6.

    http://vmext-demo.formulasearchengine.com.

References

  1. Altamimi, M.E., Youssef, A.S.: A math query language with an expanded set of wildcards. Math. Comput. Sci. 305–331 (2008). doi:10.1007/s11786-008-0056-4

  2. Cervone, D., Krautzberger, P., Sorge, V.: Towards meaningful visual abstraction of mathematical notation. In: Proceedings CICM (2015)

    Google Scholar 

  3. Cohl, H.S., McClain, M.A., Saunders, B.V., Schubotz, M., Williams, J.C.: Digital repository of mathematical formulae. In: Watt, S.M., Davenport, J.H., Sexton, A.P., Sojka, P., Urban, J. (eds.) CICM 2014. LNCS (LNAI), vol. 8543, pp. 419–422. Springer, Cham (2014). doi:10.1007/978-3-319-08434-3_30

    Chapter  Google Scholar 

  4. Cohl, H.S., Schubotz, M., McClain, M.A., Saunders, B.V., Zou, C.Y., Mohammed, A.S., Danoff, A.A.: Growing the digital repository of mathematical formulae with generic LATEX sources. In: Kerber, M., Carette, J., Kaliszyk, C., Rabe, F., Sorge, V. (eds.) CICM 2015. LNCS, vol. 9150, pp. 280–287. Springer, Cham (2015). doi:10.1007/978-3-319-20615-8_18

    Chapter  Google Scholar 

  5. Corneli, J., Schubotz, M.: math.wikipedia.org: A vision for a collaborative semi-formal, language independent math(s) encyclopedia. In: Proceedings CAITP (2017)

    Google Scholar 

  6. Gipp, B., et al.: Demonstration of the first citation-based plagiarism detection prototype. In: Proceedings SIGIR, pp. 1119–1120 (2013). doi:10.1145/2484028.2484214

  7. Gipp, B., et al.: Web-based demonstration of semantic similarity detection using citation pattern visualization for a cross language plagiarism case. In: Proceedings of International Conference on Enterprise Information Systems, pp. 677–683 (2014). doi:10.5220/0004985406770683

  8. Gipp, B.: Citation-Based Plagiarism Detection - Detecting Disguised and Cross-Language Plagiarism Using Citation Pattern Analysis. Springer, Wiesbaden (2014)

    Google Scholar 

  9. Gipp, B., Meuschke, N., Breitinger, C.: Detection, citation-based plagiarism: practicability on a large-scale scientific corpus. JASIST 65, 1527–1540 (2014). doi:10.1002/asi.23228

    Google Scholar 

  10. Guidi, F., Sacerdoti Coen, C.: A survey on retrieval of mathematical knowledge. Math. Comput. Sci. 409–427 (2016). doi:10.1007/s11786-016-0274-0

  11. Hashimoto, H., Hijikata, Y., Nishida, S.: Incorporating breadth first search for indexing MathML objects. In: Proceedings SMC, pp. 3519–3523. IEEE (2008). doi:10.1109/ICSMC.2008.4811843

  12. Hagino, H., Saito, H.: Partial-match retrieval with structure-reflected indices at the NTCIR-10 MathTask. In: Proceedings NTCIR-10. National Institute of Informatics (2013)

    Google Scholar 

  13. Jansen, A.R., Marriott, K., Yelland, G.W.: Constituent structure in mathematical expressions. In: CogSci, vol. 22 (2000)

    Google Scholar 

  14. Kamali, S., Tompa, F.W.: Improving mathematics retrieval. In: Proceedings DML, pp. 37–48 (2009)

    Google Scholar 

  15. Kamali, S., Tompa, F.W.: A new mathematics retrieval system. In: Proceedings CIKM, pp. 1413–1416. ACM (2010). doi:10.1145/1871437.1871635

  16. Kamali, S., Tompa, F.W.: Structural similarity search for mathematics retrieval. In: Carette, J., Aspinall, D., Lange, C., Sojka, P., Windsteiger, W. (eds.) CICM 2013. LNCS (LNAI), vol. 7961, pp. 246–262. Springer, Heidelberg (2013). doi:10.1007/978-3-642-39320-4_16

    Chapter  Google Scholar 

  17. Meuschke, N., Gipp, B.: State of the art in detecting academic plagiarism. Int. J. Educ. Integrity 9, 50–71 (2013)

    Google Scholar 

  18. Meuschke, N., Gipp, B.: Reducing computational effort for plagiarism detection by using citation characteristics to limit retrieval space. In: Proceedings JCDL, pp. 197–200 (2014). doi:10.1109/JCDL.2014.6970168

  19. Meuschke, N., Gipp, B., Breitinger, C.: CitePlag: a citation-based plagiarism detection system prototype. In: Proceedings International Plagiarism Conference (2012)

    Google Scholar 

  20. Miller, B.R.: Strategies for parallel markup. In: Kerber, M., Carette, J., Kaliszyk, C., Rabe, F., Sorge, V. (eds.) CICM 2015. LNCS, vol. 9150, pp. 203–210. Springer, Cham (2015). doi:10.1007/978-3-319-20615-8_13

    Chapter  Google Scholar 

  21. Miner, R., Munavalli, R.: An approach to mathematical search through query formulation and data normalization. In: Kauers, M., Kerber, M., Miner, R., Windsteiger, W. (eds.) Calculemus/MKM -2007. LNCS, vol. 4573, pp. 342–355. Springer, Heidelberg (2007). doi:10.1007/978-3-540-73086-6_27

    Chapter  Google Scholar 

  22. Olver, F.W.J., Olde Daalhuis, A.B., Lozier, D.W., Schneider, B.I., Boisvert, R.F., Clark, C.W., Miller, B.R., Saunders, B.V. (eds.) NIST Digital Library of Mathematical Functions. http://dlmf.nist.gov/. Release 1.0.15 of 2017-06-01

  23. Pattaniyil, N., Zanibbi, R.: Combining TF-IDF text retrieval with an inverted index over symbol pairs in math expressions: the tangent math search engine. In: Proceedings NTCIR-11. National Institute of Informatics (2014)

    Google Scholar 

  24. Schubotz, M., et al.: Evaluation of similarity-measure factors for formulae based on the NTCIR-11 math task. In: Proceedings NTCIR-11. National Institute of Informatics (2014)

    Google Scholar 

  25. Schubotz, M., et al.: Semantification of identifiers in mathematics for better math information retrieval. In: Proceedings SIGIR, pp. 135–144. ACM (2016). doi:10.1145/2911451.2911503

  26. Schubotz, M.: Augmenting Mathematical Formulae for More Effective Querying & Effcient Presentation. Epubli Verlag, Berlin (2017, to appear)

    Google Scholar 

  27. Shneiderman, B.: The eyes have it: a task by data type taxonomy for information visualizations. In: Proceedings Visual Languages, pp. 336–343 (1996). doi:10.1109/VL.1996.545307

  28. Sojka, P., Líška, M.: The art of mathematics retrieval. In: Proceedings DocEng, pp. 57–60. ACM (2011). doi:10.1145/2034691.2034703

  29. Schubotz, M., Wicke, G.: Mathoid: robust, scalable, fast and accessible math rendering for wikipedia. In: Watt, S.M., Davenport, J.H., Sexton, A.P., Sojka, P., Urban, J. (eds.) CICM 2014. LNCS, vol. 8543, pp. 224–235. Springer, Cham (2014). doi:10.1007/978-3-319-08434-3_17

    Chapter  Google Scholar 

  30. Shatnawi, M., Youssef, A.: Equivalence detection using parse-tree normalization for math search. In: Proceedings ICDIM, pp. 643–648. IEEE (2007). doi:10.1109/ICDIM.2007.4444297

  31. Yokoi, K., Aizawa, A.: An approach to similarity search for mathematical expressions using MathML. In: Proceedings DML, pp. 27–35. Masaryk University Press, Brno (2009)

    Google Scholar 

  32. Youssef, A., Shatnawi, M.: Math search with equivalence detection using parse-tree normalization. In: Proceedings CoSIT (2006)

    Google Scholar 

  33. Zanibbi, R., Blostein, D.: Recognition and retrieval of mathematical expressions. Proc. IJDAR 331–357 (2012). doi:10.1007/s10032-011-0174-4

  34. Zhang, Q., Youssef, A.: An approach to math-similarity search. In: Watt, S.M., Davenport, J.H., Sexton, A.P., Sojka, P., Urban, J. (eds.) CICM 2014. LNCS (LNAI), vol. 8543, pp. 404–418. Springer, Cham (2014). doi:10.1007/978-3-319-08434-3_29

    Chapter  Google Scholar 

Download references

Acknowledgements

We thank Ludwig Goohsen and Stefan Kaufhold for their support in developing VMEXT. Furthermore, we thank the Wikimedia Foundation for providing a server to run the VMEXT demo.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Moritz Schubotz , Norman Meuschke , Thomas Hepp , Howard S. Cohl or Bela Gipp .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Schubotz, M., Meuschke, N., Hepp, T., Cohl, H.S., Gipp, B. (2017). VMEXT: A Visualization Tool for Mathematical Expression Trees. In: Geuvers, H., England, M., Hasan, O., Rabe, F., Teschke, O. (eds) Intelligent Computer Mathematics. CICM 2017. Lecture Notes in Computer Science(), vol 10383. Springer, Cham. https://doi.org/10.1007/978-3-319-62075-6_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-62075-6_24

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-62074-9

  • Online ISBN: 978-3-319-62075-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics