Korean spelling error correction using a Hangul similarity algorithm
Increasingly people use computers for word processing. This helps reduce word processing time and fatigue of hands, but may increase the possibility of occurrence of spelling errors. Although spelling errors are generally easy to find and correct, it is hard to make a document totally free of spelling errors partly due to lack of knowledge of users or presence of spelling errors which are difficult to notice. Since there is no set of online word processing rules and manners in place and problems of spelling errors are not often raised, spelling errors in important documents may lead to decrease in reliability. Even experts cannot correct spelling errors perfectly, so there is a need for research to come up with spelling correction methods for the general public. This study aims to correct spelling errors using Korean alphabet similarity algorithm. To this end, words most similar to misspelled words found in a corpus containing spelling errors collected by previous research were identified to correct spelling errors by measuring frequency of simultaneous appearance with adjacent words.
KeywordsWord Processing Edit Distance Cosine Similarity Similarity Algorithm Corrected Word
Unable to display preview. Download preview PDF.
- 1.최철, 박세진, 김철중, 권규식, “Analysis of Uncorrected Typing Rate of Keyboard Design Ergonomic Keyboard Based on Qwerty Keyboard” EEromonomics Society of Korea, vol. 2000-1 no.-, pp.142-145Google Scholar
- 2.Hyunsoo Choi, Hyukchul Kwon, Aesun Yoon. “Improving Recall for Context-Sensitive Spelling Correction Rules using Conditional Probability Model with Dynamic Window Sizes” Journal of KIISE, vol.42 no.5, 2015, pp.629-636Google Scholar
- 3.Jingzhi Jin, Sungki Chio, Hyuk-chul Kwon. “Adaptive Context-Sensitive Spelling Error Correction Techniques for The Extremely Unpredictable Error Generating Language Environments”, Korea InformatioGoogle Scholar
- 4.Hyunsoo Choi, Aesun Yoon, Hyuk-Chul Kwom, “Improving Recall for Context-Sensitive Spelling Correction Rules Using Integrated Method”, Korea Infomation Science Society, vol.2014 no.6, 215, pp.577-579Google Scholar
- 5.Minho Kim, Hyuk-Chul Kwon, Sungki Choi. “Context-sensitive Spelling Error Correction using Eojeol N-gram”, Journal of KIISE vol.41 no.12, 2014. pp.1081-1089Google Scholar
- 6.Minho Kim, Jingzhi Jin, Hyuk-Chul Kwon, “Statistical Context-sensitive Spelling Correction using Confusion Set”, Korea Infomaton Science Society, vol.2013 no.6, 2013, pp.607-609Google Scholar
- 7.SeungHyeon Bak, JunSeok Cha, TaekEun Hong, JuHyun Shin, PanKoo Kim, “Korean Spelling Error Detection Method for Research”, Spring Conference of KISM 2016, vol.5 no.2, 2016Google Scholar
- 8.Kangho Roh, Jin Wook Kim, Eunsang Kim, Kunsoo Park, Hwan-Gue Cho. “Edit Distance Problem for the Korean Alphabet”, Journal of KIISE: Computer Systems and Theory, vol.31 no.2, 2010. pp.103-109Google Scholar
- 9.Hankyu Lim, Ungmo Kim. “A Spelling Correction System Based on Statistcal Data of Spelling Errors”, The KIPS Transactionsty, vol.2 no.6, 1995. pp.839-846Google Scholar