Explorations into the Use of Word Embedding in Math Search and Math Semantics

Youssef, Abdou; Miller, Bruce R.

doi:10.1007/978-3-030-23250-4_20

Abdou Youssef^18,19 &
Bruce R. Miller¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11617))

Included in the following conference series:

International Conference on Intelligent Computer Mathematics

619 Accesses
6 Citations

Abstract

Word embedding, which represents individual words with semantically rich numerical vectors, has made it possible to successfully apply deep learning to NLP tasks such as semantic role modeling, question answering, and machine translation. As math text consists of natural text as well as math expressions that similarly exhibit linear correlation and contextual characteristics, word embedding can be applied to math documents as well. On the other hand, math terms also show characteristics (e.g., abstractions) that are different from textual words. Accordingly, it is worthwhile to explore the use and effectiveness of word embedding in math language processing and MKM.

In this paper, we present exploratory investigations of math embedding by testing it on some basic tasks such as (1) math-term similarity, (2) analogy, (3) basic numerical concept-modeling using a novel approach based on computing the (weighted) centroid of the keywords that characterize a concept, and (4) math search, especially query expansion using the weighted centroid of the query keywords and then expanding the query with new keywords that are most similar to the centroid. Due to lack of benchmarks, our investigations were done using carefully selected illustrations on the DLMF. We draw from our investigations some general observations and lessons that form a trajectory for future statistically significant testing on large benchmarks. Our preliminary results and observations show that math embedding holds much promise but also point to the need for more robust embedding.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bordes, A., et al.: Joint learning of words and meaning representations for open-text semantic parsing. In: AISTATS (2012)
Google Scholar
Bowman, S.R., Angeli, G., Potts, C., Manning, C.D.: A large annotated corpus for learning natural language inference. In: EMNLP (2015)
Google Scholar
Cer, D., et al.: Universal sentence encoder. CoRR arXiv:1803.11175 (2018)
Chelba, C., et al.: One billion word benchmark for measuring progress in statistical language modeling. In: INTERSPEECH (2014)
Google Scholar
Chiu, J., Nichols, E.: Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Comput. Linguist. 4, 357–370 (2016)
Article Google Scholar
Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. In: EMNLP (2014)
Google Scholar
Clark, C., Gardner, M.: Simple and effective multi-paragraph reading comprehension. In: ACL 2018, Melbourne, Australia, 15–20 July 2018, pp. 845–855 (2018)
Google Scholar
Chen, Q., et al.: Enhanced LSTM for natural language inference. In: ACL (2017)
Google Scholar
Devlin, J., et al.: Fast and robust neural network joint models for statistical machine translation. In: Proceedings of the ACL (2014)
Google Scholar
Gao, L., et al.: Preliminary exploration of formula embedding for mathematical information retrieval: can mathematical formulae be embedded like a natural language? arXiv:1707.05154 (2017)
Gong, Y., Luo, H., Zhang, J.: Natural language inference over interaction space. In: ICLR (2018)
Google Scholar
He, L., Lee, K., Lewis, M., Zettlemoyer, L.S.: Deep semantic role labeling: what works and what’s next. In: ACL (2017)
Google Scholar
Iacobacci, I., Pilehvar, M.T., Navigli, R.: Embeddings for word sense disambiguation: an evaluation study. In: ACL (2016)
Google Scholar
Kim, Y.: Convolutional neural networks for sentence classification. In: EMNLP, Doha, Qatar, pp. 1746–1751, October 2014
Google Scholar
Krishna, A., Youssef, A., et al.: Query Expansion for Patent Searching using Word Embedding and Professional Crowdsourcing (in submission)
Google Scholar
Kstovski, K., Blei, D.M.: Equation embeddings. arXiv:1803.09123, March 2018
Lai, S., Liu, K., He, S., Zhao, J.: How to generate a good word embedding. IEEE Intell. Syst. 31(6), 5–14 (2016)
Article Google Scholar
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: International Conference on Machine Learning, pp. 1188–1196, January 2014
Google Scholar
Lee, K., He, L., Lewis, M., Zettlemoyer, L.S.: End-to-end neural coreference resolution. In: EMNLP (2017)
Google Scholar
Liu, X., Shen, Y., Duh, K., Gao, J.-F.: Stochastic answer networks for machine reading comprehension. arXiv:1712.03556
Mikolov, T., et al.: Efficient estimation of word representations in vector space. In: Workshops Track, International Conference on Learning Representations (2013)
Google Scholar
Mikolov, T., et al.: Distributed representations of words and phrases and their compositionality. In: NIPS, pp. 3111–3119 (2013)
Google Scholar
Nickel, M., Kiela, D.: Poincare embeddings for learning hierarchical representations. In: Advances in NIPS (2017)
Google Scholar
Olver, F.W.J., et al., (eds.): NIST Digital Library of Mathematical Functions. https://dlmf.nist.gov/, Release 1.0.18 of 27-03-2018
Palmer, M., et al.: The proposition bank: an annotated corpus of semantic roles. Comput. Linguist. 31, 71–106 (2005)
Article Google Scholar
Piotr, B., Grave, E., Joulin, A., Mikolov, T.: Enriching word vectors with subword information. Trans. ACL 5, 135–146 (2017)
Google Scholar
Peters, M.E., et al.: Deep contextualized word representations. In: Proceedings of NAACL-HLT, pp. 2227–2237 (2018)
Google Scholar
Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. InL EMNLP, 25–29 October 2014, pp. 1532–1543 (2014)
Google Scholar
Raganato, A., Bovi, C.D., Navigli, R.: Neural sequence learning models for word sense disambiguation. In: EMNLP (2017)
Google Scholar
Rajpurkar, P., Zhang, J., Lopyrev, K., Liang, P.: SQuAD: 100,000+ questions for machine comprehension of text. In: EMNLP (2016)
Google Scholar
Rudolph, M., Ruiz, F., Athey, S., Blei, D.M.: Structured embedding models for grouped data. In: NIPS, pp. 250–260 (2017)
Google Scholar
Socher, R., et al.: Recursive deep models for semantic compositionality over a sentiment treebank. In: EMNLP (2013)
Google Scholar
Wiseman, S., Rush, A.M., Shieber, S.M.: Learning global features for coreference resolution. In: HLT-NAACL (2016)
Google Scholar
Zhou, J., Xu, W.: End-to-end learning of semantic role labeling using recurrent neural networks. In: ACL (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

The George Washington University, Washington, D.C., WA, USA
Abdou Youssef
Applied and Computational Mathematics Division, NIST, Gaithersburg, MD, USA
Abdou Youssef & Bruce R. Miller

Authors

Abdou Youssef
View author publications
You can also search for this author in PubMed Google Scholar
Bruce R. Miller
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Abdou Youssef .

Editor information

Editors and Affiliations

University of Innsbruck, Innsbruck, Austria
Cezary Kaliszyk
University of St. Andrews, St. Andrews, UK
Edwin Brady
University of Applied Sciences, Neu-Ulm, Germany
Andrea Kohlhase
University of Bologna, Bologna, Italy
Claudio Sacerdoti Coen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Youssef, A., Miller, B.R. (2019). Explorations into the Use of Word Embedding in Math Search and Math Semantics. In: Kaliszyk, C., Brady, E., Kohlhase, A., Sacerdoti Coen, C. (eds) Intelligent Computer Mathematics. CICM 2019. Lecture Notes in Computer Science(), vol 11617. Springer, Cham. https://doi.org/10.1007/978-3-030-23250-4_20

Download citation

DOI: https://doi.org/10.1007/978-3-030-23250-4_20
Published: 03 July 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-23249-8
Online ISBN: 978-3-030-23250-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics