Abstract
Computational stylistics focuses on such description and quantifiable expression of linguistic styles of written documents and their authors that enable their characterisation, comparison, and attribution. Characterisation of a text and its author can yield information about educational experiences, social background, but also about the author gender which can be exploited within the automatic categorisation of texts. This is an example of a classification task with knowledge uncertain and incomplete. Therefore, techniques from the artificial intelligence area are particularly well suited to handle the problem. The paper presents research on application of ANN-based classifier in recognition of the author gender for literary texts, with some considerations on the performance of the classifier when the reduction of characteristic features based on elements of frequency analysis is attempted.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baayen, H., van Haltern, H., Twedeedie, F.: Outside the cave of shadows: using syntactic annotation to enhance authorship attribution. Literary and Linguistic Computing 11(3), 121â132 (1996)
Berber Sardinha, T.: Using key words in text analysis: practical aspects (1999), ftp://ftp.liv.ac.uk/pub/linguistics
Burrows, J.: Textual analysis. In: Schreibman, S., Siemens, R., Unsworth, J. (eds.) A Companion to Digital Humanities, ch. 23. Blackwell Publishing, Oxford (2004)
Koppel, M., Argamon, S., Shimoni, A.: Automatically categorizing written texts by author gender. Literary and Linguistic Computing 17(4), 401â412 (2002)
Lynam, T., Clarke, C., Cormack, G.: Information extraction with term frequencies. In: Proceedings of the Human Language Technology Conference, pp. 1â4. Association for Computational Linguistics, Stroudsburg (2001)
Munro, R.: A queing-theory model of word frequency distributions. In: Proceedings of the 1st Australasian Language Technology Workshop, pp. 1â8 (2003)
Peng, R., Hengartner, H.: Quantitative analysis of literary styles. The American Statistician 56(3), 15â38 (2002)
Shen, Q.: Rough feature selection for intelligent classifiers. In: Peters, J.F., Skowron, A., Marek, V.W., OrĆowska, E., SĆowiĆski, R., Ziarko, W.P. (eds.) Transactions on Rough Sets VII. LNCS, vol. 4400, pp. 244â255. Springer, Heidelberg (2007)
StaĆczyk, U.: Dominance-based rough set approach employed in search of authorial invariants. In: Kurzynski, M., WoĆșniak, M. (eds.) Computer Recognition Systems 3. Advances in Intelligent and Soft Computing, vol. 57, pp. 293â301. Springer, Heidelberg (2009)
StaĆczyk, U.: Relative reduct-based selection of features for ANN classifier. In: Cyran, K.A., Kozielski, S., Peters, J.F., StaĆczyk, U., Wakulicz-Deja, A. (eds.) Man-Machine Interactions. Advances in Intelligent and Soft Computing, vol. 59, pp. 335â344. Springer, Heidelberg (2009)
Waugh, S., Adams, A., Twedeedie, F.: Computational stylistics using artificial neural networks. Literary and Linguistic Computing 15(2), 187â198 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
StaĆczyk, U. (2011). Recognition of Author Gender for Literary Texts. In: CzachĂłrski, T., Kozielski, S., StaĆczyk, U. (eds) Man-Machine Interactions 2. Advances in Intelligent and Soft Computing, vol 103. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23169-8_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-23169-8_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23168-1
Online ISBN: 978-3-642-23169-8
eBook Packages: EngineeringEngineering (R0)