Author Attribution Using Network Motifs

Al Rozz, Younis; Menezes, Ronaldo

doi:10.1007/978-3-319-73198-8_17

Younis Al Rozz⁶ &
Ronaldo Menezes⁶

Part of the book series: Springer Proceedings in Complexity ((SPCOM))

Included in the following conference series:

International Workshop on Complex Networks

864 Accesses
5 Citations

Abstract

The problem of recognizing the author of unknown text has concerned linguistics and scientists for a long period of time. The authorship of the famous Federalist Papers remained unknown until Mosteller and Wallace solved the mystery in 1964 using the frequency of functional words. After that, many statistical and computational studies were published in the fields of authorship attribution and stylistic analysis. Complex networks, gaining much popularity in recent years, may have a role to play in this field. Furthermore, several studies show that network motifs, defined as statistically significant subgraphs within a network, have the ability to distinguish networks from distinctive disciplines. In this paper, we succeed in the utilization of network motifs to distinguish the writing style of 10 famous authors. Using statistical learning algorithms, we achieved an accuracy of 77% in classifying 100 books written by ten different authors, which outperformed the results from other works. We believe that our method proved the importance of network motifs in author attribution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Akimushkin, C., Amancio, D.R., Oliveira Jr., O.N.: Text authorship identified using the dynamics of word co-occurrence networks. PloS one 12(1), e0170527 (2017)
Article Google Scholar
Al Rozz, Y., Hamoodat, H., Menezes, R.: Characterization of written languages using structural features from common corpora. In: Workshop on Complex Networks CompleNet, pp. 161–173. Springer, Berlin (2017)
Google Scholar
Amancio, D.R.: A complex network approach to stylometry. PloS one 10(8), e0136076 (2015)
Article Google Scholar
Arefin, A.S., Vimieiro, R., Riveros, C., Craig, H., Moscato, P.: An information theoretic clustering approach for unveiling authorship affinities in Shakespearean era plays and poems. PloS one 9(10), e111445 (2014)
Article ADS Google Scholar
Biber, D.: Variation Across Speech and Writing. Cambridge University Press, Cambridge (1991)
Google Scholar
Biemann, C., Krumov, L., Roos, S., Weihe, K.: Network motifs are a powerful tool for semantic distinction. Towards a Theoretical Framework for Analyzing Complex Linguistic Networks, pp. 83–105. Springer, Berlin (2016)
Google Scholar
Cabatbat, J.J.T., Monsanto, J.P., Tapang, G.A.: Preserved network metrics across translated texts. Int. J. Mod. Phys. C 25(02), 1350092 (2014)
Article ADS Google Scholar
Chen, X., Hao, P., Chandramouli, R., Subbalakshmi, K.P.: Authorship similarity detection from email messages. In: International Workshop on Machine Learning and Data Mining in Pattern Recognition, pp. 375–386. Springer, Berlin (2011)
Google Scholar
Li, J., Xiao, F., Zhou, J., Yang, Z.: Motifs and motif generalization in Chinese word networks. Procedia Comput. Sci. 9, 550–556 (2012)
Article Google Scholar
Marinho, V.Q., Hirst, G., Amancio, D.R.: Authorship attribution via network motifs identification. In: 2016 5th Brazilian Conference on Intelligent Systems (BRACIS), pp. 355–360. IEEE (2016)
Google Scholar
Milo, R., Shen-Orr, S., Itzkovitz, S., Kashtan, N., Chklovskii, D., Alon, U.: Network motifs: simple building blocks of complex networks. Science 298(5594), 824–827 (2002)
Article ADS Google Scholar
Milo, R., Itzkovitz, S., Kashtan, N., Levitt, R., Shen-Orr, S., Ayzenshtat, I., Sheffer, M., Alon, U.: Superfamilies of evolved and designed networks. Science 303(5663), 1538–1542 (2004)
Article ADS Google Scholar
Mosteller, F., Wallace, D.L.: Inference in an authorship problem: a comparative study of discrimination methods applied to the authorship of the disputed federalist papers. J. Am. Stat. Assoc. 58(302), 275–309 (1963)
MATH Google Scholar
Nunberg, G.: The Linguistics of Punctuation. CSLI Lecture Notes. Cambridge University Press, Cambridge (1990)
Google Scholar
Rizvić, H., Martinčić-Ipšić, S., Meštrović, A.: Network motifs analysis of croatian literature. arXiv:1411.4960 (2014)
Rocha, A., Scheirer, W.J., Forstall, C.W., Cavalcante, T., Theophilo, A., Shen, B., Carvalho, A.R.B., Stamatatos, E.: Authorship attribution for social media forensics. IEEE Trans. Inf. Forensic Secur. 12(1), 5–33 (2017)
Article Google Scholar
Segarra, S., Eisen, M., Ribeiro, A.: Authorship attribution through function word adjacency networks. IEEE Trans. Sig. Process. 63(20), 5464–5478 (2015)
Article ADS MathSciNet Google Scholar
Stamatatos, E.: A survey of modern authorship attribution methods. J. Assoc. Inf. Sci. Technol. 60(3), 538–556 (2009)
Article Google Scholar
Tran, N.T.L., DeLuccia, L., McDonald, A.F., Huang, C.-H.: Cross-disciplinary detection and analysis of network motifs. Bioinform. Biol. Insights 9, 49 (2015)
Article Google Scholar

Download references

Author information

Authors and Affiliations

BioComplex Laboratory, Computer Science, Florida Institute of Technology, Melbourne, FL, USA
Younis Al Rozz & Ronaldo Menezes

Authors

Younis Al Rozz
View author publications
You can also search for this author in PubMed Google Scholar
Ronaldo Menezes
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Younis Al Rozz .

Editor information

Editors and Affiliations

Center for Complex Network Research, Northeastern University, Boston, Massachusetts, USA
Sean Cornelius
Network Science Institute, Northeastern University, Boston, Massachusetts, USA
Kate Coronges
Center for Data Science, New York University, New York, New York, USA
Bruno Gonçalves
Center for Network Science, and Department of Mathematics and its Applications, Central European University, Budapest, Hungary
Roberta Sinatra
Bouvé College of Health Sciences, Northeastern University, Boston, Massachusetts, USA
Alessandro Vespignani

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Al Rozz, Y., Menezes, R. (2018). Author Attribution Using Network Motifs. In: Cornelius, S., Coronges, K., Gonçalves, B., Sinatra, R., Vespignani, A. (eds) Complex Networks IX. CompleNet 2018. Springer Proceedings in Complexity. Springer, Cham. https://doi.org/10.1007/978-3-319-73198-8_17

Download citation

DOI: https://doi.org/10.1007/978-3-319-73198-8_17
Published: 15 February 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73197-1
Online ISBN: 978-3-319-73198-8
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)

Publish with us

Policies and ethics