Mining the Temporal Structure of Thought from Text

Mei, Mei; Ren, Zhaowei; Minai, Ali A.

doi:10.1007/978-3-319-96661-8_31

Mei Mei⁶,
Zhaowei Ren⁶ &
Ali A. Minai⁶

Part of the book series: Springer Proceedings in Complexity ((SPCOM))

Included in the following conference series:

International Conference on Complex Systems

2804 Accesses
2 Citations
2 Altmetric

Abstract

Thinking is a self-organized dynamical process and, as such, interesting to characterize. However, direct, real-time access to thought at the semantic level is still very limited. The best that can be done is to look at spoken or written expression. The question we address in this research is the following: Is there a characteristic pitch of thought? To begin answering this complex question, we look at text documents from several large corpora at the sentence level – i.e., using sentences as the units of meaning – and considering each document to be the result of a random process in semantic space. Given a large corpus of multi-sentence documents, we build a lexical association network representing associations between words in the corpus. This network is used to induce a semantic similarity metric between sentences, and each document is segmented into multi-sentence semantically coherent blocks (SCBs) with occasional connecting text between the blocks. Based on this segmentation, the process of document generation is modeled as a sticky Markov chain at the sentence level. We show that most documents across all the corpora are sequences of blocks with a very consistent mean length of 6.4 sentences across the corpora. This consistency suggests that a value of 6-7 sentences may be the typical mean length for single coherent thoughts in texts. We have also described several ways of visualizing the semantic structure of documents in space and time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Aggarwal, C.C., Zhao, P.: Towards graphical models for text processing. Knowl. Inform. Syst. 36(1), 1–21 (2013). https://doi.org/10.1007/s10115-012-0552-3
Article Google Scholar
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003). https://doi.org/10.1162/jmlr.2003.3.4-5.993
Article MATH Google Scholar
Canolty, R.T., Soltani, M., Dalal, S.S., Edwards, E., Dronkers, N.F., Nagarajan, S.S., Kirsch, H.E., Barbaro, N.M., Knight, R.T.: Spatiotemporal dynamics of word processing in the human brain. Front. Neurosci. 1(1), 185–196 (2007). https://doi.org/10.3389/neuro.01.1.1.014.2007
Article Google Scholar
Friedenberg, J., Silverman, G.: Introduction: exploring inner space. In: Brace-Thompson, J., Crouppen, M.B., Robinson, S. (eds.) Cognitive Science An Introduction to the Study of Mind, chapter 1, pp. 2–3. Sage Publications, Inc., Thousand Oaks (2006)
Google Scholar
Hinton, G.E., Roweis, S.T.: Stochastic neighbor embedding. In: Advances in neural information processing systems, pp. 833–840 (2002)
Google Scholar
Hogan, J.P.: Mind Matters: Exploring the World of Artificial Intelligence, 1st edn. Ballantine Publication Group, New York (1998)
Google Scholar
Lamprier, S., Amghar, T., Levrat, B., Saubion, F.: SegGen: A genetic algorithm for linear text segmentation. In: Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), pp. 1647–1652 (2007)
Google Scholar
Mei, M., Vanarase, A., Minai, A.A.: Chunks of thought: finding salient semantic structures in texts. In: Proceedings of IJCNN 2014 (2014)
Google Scholar
Misra, H., Yvon, F., Cappé, O., Jose, J.: Text segmentation: a topic modeling perspective. Inform. Process. Manag. 47(4), 528–544 (2011). https://doi.org/10.1016/j.ipm.2010.11.008
Article Google Scholar
Morewedge, C.K., Giblin, C.E., Norton, M.I.: The (perceived) meaning of spontaneous thoughts. J. Exp. Psychol. Gen. 143(4), 1742–1754 (2014). https://doi.org/10.1037/a0036775
Article Google Scholar
Riedl, M., Biemann, C.: Text segmentation with topic models. J. Lang. Technol. Comput. Linguist. 27(1), 47–69 (2012)
Google Scholar
Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Comput. Appl. Math. 20, 53–65 (1987)
Article Google Scholar
Shen, G., Horikawa, T., Majima, K., Kamitani, Y.: Deep image reconstruction from human brain activity. bioRxiv (2017). 10.1101/240317
Google Scholar
Turian, J., Ratinov, L., Bengio, Y.: Word representations: a simple and general method for semi-supervised learning. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, July, pp. 384–394 (2010)
Google Scholar
Wang, J., Cherkassky, V.L., Just, M.A.: Predicting the brain activation pattern associated with the propositional content of a sentence: Modeling neural representations of events and states. Hum. Brain Mapp. 38, 4865–4881 (2017). https://doi.org/10.1002/hbm.23692
Article Google Scholar

Download references

Acknowledgement

This work was supported in part by National Science Foundation INSPIRE grant BCS-1247971 to Ali Minai.

Author information

Authors and Affiliations

Department of Electrical Engineering and Computer Science, University of Cincinnati, Cincinnati, OH, 45221-0030, USA
Mei Mei, Zhaowei Ren & Ali A. Minai

Authors

Mei Mei
View author publications
You can also search for this author in PubMed Google Scholar
Zhaowei Ren
View author publications
You can also search for this author in PubMed Google Scholar
Ali A. Minai
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mei Mei .

Editor information

Editors and Affiliations

New England Complex Systems Institute, Cambridge, MA, USA
Alfredo J. Morales
Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, Mexico, Distrito Federal, Mexico
Carlos Gershenson
New England Complex Systems Institute and University of Massachusetts, Cambridge, MA, USA
Dan Braha
Department of Electrical Engineering and Computer Science, University of Cincinnati, Cincinnati, OH, USA
Ali A. Minai
New England Complex Systems Institute, Cambridge, MA, USA
Yaneer Bar-Yam

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mei, M., Ren, Z., Minai, A.A. (2018). Mining the Temporal Structure of Thought from Text. In: Morales, A., Gershenson, C., Braha, D., Minai, A., Bar-Yam, Y. (eds) Unifying Themes in Complex Systems IX. ICCS 2018. Springer Proceedings in Complexity. Springer, Cham. https://doi.org/10.1007/978-3-319-96661-8_31

Download citation

DOI: https://doi.org/10.1007/978-3-319-96661-8_31
Published: 24 July 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-96660-1
Online ISBN: 978-3-319-96661-8
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)

Publish with us

Policies and ethics