Skip to main content

Natural Language Processing of Russian Court Decisions for Digital Indicators Mapping for Oversight Process Control Efficiency: Disobeying a Police Officer Case

  • Conference paper
  • First Online:
Electronic Governance and Open Society: Challenges in Eurasia (EGOSE 2019)

Abstract

This article describes the study results in the development of the method of natural language processing (NLP) of semi-structured Russian court decisions to improve the quality of knowledge extraction describing legal process. Improving the accuracy of information retrieval from electronic records of court decisions was achieved with using combination of TF-IDF and latent semantic analysis. As a result, the word combinations of facts of offenses and procedural facts that may affect the decision-making of the court are identified. The applicability of the results is shown on the example of development a decision tree ML model of the appointment of arrest or fine punishment if disobeying a police officer. Automated mapping of court decisions texts on Russian language is also possible use for the development of artificial intelligence systems and new generation decision support systems in law domain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Friedman, C., Elhadad, N.: Natural language processing in health care and biomedicine. In: Shortliffe, E.H., Cimino, J.J. (eds.) Biomedical Informatics, pp. 255–284. Springer, London (2014). https://doi.org/10.1007/978-1-4471-4474-8_8

    Chapter  Google Scholar 

  2. Doan, S., Conway, M., Phuong, T.M., Ohno-Machado, L.: Natural language processing in biomedicine: a unified system architecture overview. In: Trent, R. (ed.) Clinical Bioinformatics. Methods in Molecular Biology (Methods and Protocols), vol. 1168. Humana Press, New York (2014). https://doi.org/10.1007/978-1-4939-0847-9_16

    Chapter  Google Scholar 

  3. Friedman, C., Rindflesch, T.C., Corn, M.: Natural language processing: state of the art and prospects for significant progress, a workshop sponsored by the National Library of Medicine. J. Biomed. Inform. 46(5), 765–773 (2013)

    Article  Google Scholar 

  4. Metsker, O., Trofimov, E., Sikorsky, S., Kovalchuk, S.: Text and data mining techniques in judgment open data analysis for administrative practice control. In: Chugunov, A., Misnikov, Y., Roshchin, E., Trutnev, D. (eds.) EGOSE 2018. CCIS, vol. 947, pp. 169–180. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-13283-5_13

    Chapter  Google Scholar 

  5. Kreimeyer, K., et al.: Natural language processing systems for capturing and standardizing unstructured clinical information: a systematic review. J. Biomed. Inform. 73, 14–29 (2017)

    Article  Google Scholar 

  6. Backus, J.W.: The syntax and semantics of the proposed international algebraic language of the Zurich ACM-GAMM conference. In: Proceedings of the International Conference on Information Processing, New York, pp. 125–131 (1959)

    Google Scholar 

  7. Aletras, N., et al.: Predicting judicial decisions of the European court of human rights: a natural language processing perspective. PeerJ Comput. Sci. 2, e93 (2016). peerj.com

    Article  Google Scholar 

  8. Stephan, W.: Linguistic description and automatic extraction of definitions from German court decisions. In: LREC (2008)

    Google Scholar 

  9. Aizawa, A.: An information-theoretic perspective of tf–idf measures. Inf. Process. Manage. 39(1), 45–65 (2003)

    Article  Google Scholar 

  10. Dey, A., Jenamani, M., Thakkar, J.J.: Lexical TF-IDF: an n-gram feature space for cross-domain classification of sentiment reviews. In: Shankar, B.U., Ghosh, K., Mandal, D.P., Ray, S.S., Zhang, D., Pal, S.K. (eds.) PReMI 2017. LNCS, vol. 10597, pp. 380–386. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69900-4_48

    Chapter  Google Scholar 

  11. Sidorov, G., et al.: Syntactic N-grams as machine learning features for natural language processing. Expert Syst. Appl. 41(3), 853–860 (2014)

    Article  MathSciNet  Google Scholar 

  12. Golub, G.H., Reinsch, C.: Singular value decomposition and least squares solutions. In: Bauer, F.L. (ed.) Linear Algebra. Handbook for Automatic Computation, vol. 2. Springer, Heidelberg (1971). https://doi.org/10.1007/978-3-662-39778-7_10

    Chapter  Google Scholar 

  13. Shlens J.: A tutorial on principal component analysis: derivation, discussion and singular value decomposition. In: Derivation, Discussion and Singular Value Decomposition (2003)

    Google Scholar 

  14. Yalcinkaya, M., Singh, V.: Patterns and trends in building information modeling (BIM) research: a latent semantic analysis. Autom. Constr. 59, 68–80 (2015)

    Article  Google Scholar 

Download references

Acknowledgements

This research is financially supported by The Russian Science Foundation, Agreement #19-11-00326.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Egor Trofimov or Sofia Grechishcheva .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Metsker, O., Trofimov, E., Grechishcheva, S. (2020). Natural Language Processing of Russian Court Decisions for Digital Indicators Mapping for Oversight Process Control Efficiency: Disobeying a Police Officer Case. In: Chugunov, A., Khodachek, I., Misnikov, Y., Trutnev, D. (eds) Electronic Governance and Open Society: Challenges in Eurasia. EGOSE 2019. Communications in Computer and Information Science, vol 1135. Springer, Cham. https://doi.org/10.1007/978-3-030-39296-3_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-39296-3_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-39295-6

  • Online ISBN: 978-3-030-39296-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics