Abstract
This article describes the study results in the development of the method of natural language processing (NLP) of semi-structured Russian court decisions to improve the quality of knowledge extraction describing legal process. Improving the accuracy of information retrieval from electronic records of court decisions was achieved with using combination of TF-IDF and latent semantic analysis. As a result, the word combinations of facts of offenses and procedural facts that may affect the decision-making of the court are identified. The applicability of the results is shown on the example of development a decision tree ML model of the appointment of arrest or fine punishment if disobeying a police officer. Automated mapping of court decisions texts on Russian language is also possible use for the development of artificial intelligence systems and new generation decision support systems in law domain.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Friedman, C., Elhadad, N.: Natural language processing in health care and biomedicine. In: Shortliffe, E.H., Cimino, J.J. (eds.) Biomedical Informatics, pp. 255–284. Springer, London (2014). https://doi.org/10.1007/978-1-4471-4474-8_8
Doan, S., Conway, M., Phuong, T.M., Ohno-Machado, L.: Natural language processing in biomedicine: a unified system architecture overview. In: Trent, R. (ed.) Clinical Bioinformatics. Methods in Molecular Biology (Methods and Protocols), vol. 1168. Humana Press, New York (2014). https://doi.org/10.1007/978-1-4939-0847-9_16
Friedman, C., Rindflesch, T.C., Corn, M.: Natural language processing: state of the art and prospects for significant progress, a workshop sponsored by the National Library of Medicine. J. Biomed. Inform. 46(5), 765–773 (2013)
Metsker, O., Trofimov, E., Sikorsky, S., Kovalchuk, S.: Text and data mining techniques in judgment open data analysis for administrative practice control. In: Chugunov, A., Misnikov, Y., Roshchin, E., Trutnev, D. (eds.) EGOSE 2018. CCIS, vol. 947, pp. 169–180. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-13283-5_13
Kreimeyer, K., et al.: Natural language processing systems for capturing and standardizing unstructured clinical information: a systematic review. J. Biomed. Inform. 73, 14–29 (2017)
Backus, J.W.: The syntax and semantics of the proposed international algebraic language of the Zurich ACM-GAMM conference. In: Proceedings of the International Conference on Information Processing, New York, pp. 125–131 (1959)
Aletras, N., et al.: Predicting judicial decisions of the European court of human rights: a natural language processing perspective. PeerJ Comput. Sci. 2, e93 (2016). peerj.com
Stephan, W.: Linguistic description and automatic extraction of definitions from German court decisions. In: LREC (2008)
Aizawa, A.: An information-theoretic perspective of tf–idf measures. Inf. Process. Manage. 39(1), 45–65 (2003)
Dey, A., Jenamani, M., Thakkar, J.J.: Lexical TF-IDF: an n-gram feature space for cross-domain classification of sentiment reviews. In: Shankar, B.U., Ghosh, K., Mandal, D.P., Ray, S.S., Zhang, D., Pal, S.K. (eds.) PReMI 2017. LNCS, vol. 10597, pp. 380–386. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69900-4_48
Sidorov, G., et al.: Syntactic N-grams as machine learning features for natural language processing. Expert Syst. Appl. 41(3), 853–860 (2014)
Golub, G.H., Reinsch, C.: Singular value decomposition and least squares solutions. In: Bauer, F.L. (ed.) Linear Algebra. Handbook for Automatic Computation, vol. 2. Springer, Heidelberg (1971). https://doi.org/10.1007/978-3-662-39778-7_10
Shlens J.: A tutorial on principal component analysis: derivation, discussion and singular value decomposition. In: Derivation, Discussion and Singular Value Decomposition (2003)
Yalcinkaya, M., Singh, V.: Patterns and trends in building information modeling (BIM) research: a latent semantic analysis. Autom. Constr. 59, 68–80 (2015)
Acknowledgements
This research is financially supported by The Russian Science Foundation, Agreement #19-11-00326.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Metsker, O., Trofimov, E., Grechishcheva, S. (2020). Natural Language Processing of Russian Court Decisions for Digital Indicators Mapping for Oversight Process Control Efficiency: Disobeying a Police Officer Case. In: Chugunov, A., Khodachek, I., Misnikov, Y., Trutnev, D. (eds) Electronic Governance and Open Society: Challenges in Eurasia. EGOSE 2019. Communications in Computer and Information Science, vol 1135. Springer, Cham. https://doi.org/10.1007/978-3-030-39296-3_22
Download citation
DOI: https://doi.org/10.1007/978-3-030-39296-3_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-39295-6
Online ISBN: 978-3-030-39296-3
eBook Packages: Computer ScienceComputer Science (R0)