Abstract
In the context of forensic and criminalistics studies the problem of identifying the author of a manuscript is generally expressed as a supervised-classification problem. In this paper a new approach for modeling a manuscript at the word and text line levels is presented. This new approach introduces an eclectic paradigm between texture-related and structure-related modeling approaches. Compared to previously published works, the proposed method significantly reduces the number and complexity of the text-features to be extracted from the text. Extensive experimentation with the proposed model shows it to be faster and easier to implement than other models, making it ideal for extensive use in forensic and criminalistics studies.
Chapter PDF
Similar content being viewed by others
References
Srihari, S.N., Cha, S.-H., Arora, H., Lee, S.: Individuality of Handwriting. Journal of Forensic Sciences 47(4), Paper ID JFS2001227-474 (2001)
Pecharromán-Balbás, S.: Reconocimiento de escritor independiente de texto basado en características de textura. Tesis doctoral. Escuela Politécnica Superior, Universidad Autónoma de Madrid (2007)
Bensefia, A., Paquet, T., Heutte, L.: A writer identification and verification system. Pattern Recognition Letters 26, 2080–2092 (2005)
Srihari, S.N.: Recognition of handwritten and machine-printed text for postal address interpretation. Pattern Recognition Letters 14(4), 291–302 (1993)
Said, H., Tan, T., Baker, K.: Personal Identification Based on Handwriting. Pattern Recognition 33(1), 149–160 (2000)
Cover, T.M., Hart, P.E.: Nearest neighbour pattern classification. IEEE Trans. Inform. Theory, IT-13(1), 21–27 (1967)
Zois, E., Anastassopoulos, V.: Morphological waveform coding for writer identification. Pattern Recognition 33(3), 385–398 (2000)
Pervouchine, V., Leedham, G.: Extraction and analysis of forensic document examiner features used for writer identification. Pattern Recognition 40, 1004–1013 (2007)
Hertel, C., Bunke, H.: A Set of Novel Features for Writer Identification. In: Proc. Fourth Int’l Conf. Audio and Video-Based Biometric Person Authentication, pp. 679–687 (2003)
Plamondon, R., Srihari, S.N.: On-line and off-line handwriting recognition: A comprehensive survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 63–84 (2000)
Zimmermann, M., Bunke, H.: Automatic segmentation of the IAM off-line handwritten {English} text database. In: 16th International Conf. on Pattern Recognition, Canada, vol. 4, pp. 35–39 (2002)
Srihari, S.N.: Handwriting identification: research to study validity of individuality of handwriting and develop computer-assisted procedures for comparing handwriting. University of Buffalo, U.S.A. Center of Excellence for Document Analysis and Recognition. Tech. Rep. CEDAR-TR-01-1 (2001)
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Sys., Man., Cyber. 9, 62–66 (1979)
Khashman, A., Sekeroglu, B.: A Novel Thresholding Method for Text Separation and Document Enhancement. In: Proceedings of the 11th Pan-Hellenic Conference in Informatics, Greece, pp. 324–330 (2007)
Herrera-Luna, E., Felipe-Riverón, E., Godoy-Calderón, S.: A supervised algorithm with a new differentiated-weighting scheme for identifying the author of a handwritten text. Pattern Recognition Letters 32(2), 1139–1144 (2011)
Marti, U.V., Messerli, R., Bunke, H.: Writer identification using text line based features. In: Proc. ICDAR 2001, pp. 101–105 (2001)
Louloudis, G., Gatos, B., Pratikakis, I., Halatsis, C.: Text line and word segmentation of handwritten documents. Pattern Recognition 42, 3169–3183 (2009)
Bertolami, R., Bunke, H.: Hidden Markov model-based ensemble methods for offline handwritten text line recognition. Pattern Recognition 41, 3452–3460 (2008)
Vamvakas, G., Gatos, B., Perantonis, S.J.: Handwritten character recognition through two-stage foreground sub-sampling. Pattern Recognition 43, 2807–2816 (2010)
Jou, C., Lee, H.C.: Handwritten numeral recognition based on simplified structural classification and fuzzy memberships. Expert Systems with Applications 36, 11858–11863 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Godoy-Calderón, S., Felipe-Riverón, E.M., Herrera-Luna, E.C. (2012). Multi-level Modeling of Manuscripts for Authorship Identification with Collective Decision Systems. In: Alvarez, L., Mejail, M., Gomez, L., Jacobo, J. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2012. Lecture Notes in Computer Science, vol 7441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33275-3_93
Download citation
DOI: https://doi.org/10.1007/978-3-642-33275-3_93
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33274-6
Online ISBN: 978-3-642-33275-3
eBook Packages: Computer ScienceComputer Science (R0)