Abstract
A method is proposed for express estimation of the degree of similarity of two pages of text with the help of recognition matrices in the conditions of a shortage of computing resources of an autonomous robot. The method is based on the geometrical approach of finding the hashes of words, taking into account their location on the page. An algorithm for generating recognition matrices of hashes of words and their coordinates on the page is proposed. The method and algorithm for comparing matrices using a single pass based on the method of a sweeping curve are considered. The assessment of the contribution of quantitative and qualitative factors in the formation of the magnitude of the degree of similarity.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Yevseyev, V.L., Novikov, G.G.: Graphical approach to the problem of finding similar texts. Sci. Vis. 7(2), 50–72 (2015)
Novikov, G.G., Yadykin, I.M.: Program for calculating the relevance function for fuzzy full-text search, implementing a geometric approach. Certificate RU No. 2014619280 State Register of Computer Programs, September 12, 2014 Bulletin No. 10(96) (2014)
Yigit. O.: sdbm—Substitute DBM or Berkeley ndbm for Every UN*X Made Simple (2010). https://github.com/jdkoftinoff/mb-linux-msli/blob/master/uClinux-dist/user/perl/ext/SDBM_File/sdbm/README
Nelyubin, A.P., Galkin, T.P., Galaev, A.A., Popov, D.D., Misyurin, SYu., Pilyugin, V.V.: Usage of visualization in the solution of multicriteria choice problems. Sci. Vis. 9(5), 59–70 (2017)
Novikov, G.G., Yadykin I.M.: Program for determining the degree of visual page-related relevance of the text, which implements the sweeping curve method, Certificate RU No. 2014660638 State Register of Computer Programs, October 13, 2014 Bulletin No. 11(97) (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Novikov, G.G., Yadykin, I.M. (2020). Recognition Matrix for Comparing Pages of Text by a Robot. In: Misyurin, S., Arakelian, V., Avetisyan, A. (eds) Advanced Technologies in Robotics and Intelligent Systems. Mechanisms and Machine Science, vol 80. Springer, Cham. https://doi.org/10.1007/978-3-030-33491-8_42
Download citation
DOI: https://doi.org/10.1007/978-3-030-33491-8_42
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33490-1
Online ISBN: 978-3-030-33491-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)