Abstract
In recent years, people can easily get various data and information through Internet. People can copy the entire downloaded data, digitized information into own paper work, and form some plagiarism problems. Previous studies use of statistics, vectors matrices to compare string among documents. When someone change the location of words, and add some superfluous words or sentences between strings, it will be greatly reduced the accurate rate of matching system; Moreover, it may cause students keep plagiarism if matching system cannot find the alignments correctly. This study uses Chinese Word Segmentation and Database Set Operation as a base to construct a string matching system to solve the excessive superfluous words and order problems. Database Set Operation may be more efficient than the program with lots of words inside its memory. This study creates a prototype system, and the result of the prototype shows that the accuracy performance is performed well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lee, J.T.: Development of XML-Based Geo-Spatial Information Distri-bution System. Journal of Cartography 16, 191–204 (2006)
Maarouf, M.Y., Chung, S.M.: XML Integrated Environment for Service-Oriented Data Management. In: IEEE International Conference on Tools with Artificial Intelligence, vol. 2, pp. 361–368 (2008)
Zhang, J., Lang, B., Duan, Y.: An XML Data Placement Strategy for Dis-tributed XML Storage and Parallel Query. In: 2011 12th International Conference on Parallel and Distributed Computing, Applications and Technologies, pp. 433–439 (2011)
Lin, C.X., Chen, Z.J., Ling, C.C.: Combined with a long term priority sequence labeled with Chinese word segmentation research. Information Security Communications 15(3-4), 161–179 (2010)
Wu, D., Zhou, X., Zhang, H.: The Pattern Matching Algorithms Formalized Analyze in Chinese Strings. Intelligent Information Technology Application 1, 403–407 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ying, MH., Lin, CY. (2014). A Study of String Matching System Based on Database Set Operation. In: Park, J., Chen, SC., Gil, JM., Yen, N. (eds) Multimedia and Ubiquitous Engineering. Lecture Notes in Electrical Engineering, vol 308. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54900-7_43
Download citation
DOI: https://doi.org/10.1007/978-3-642-54900-7_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54899-4
Online ISBN: 978-3-642-54900-7
eBook Packages: EngineeringEngineering (R0)