Skip to main content

Classification of the Structure of Square Hmong Characters and Analysis of Its Statistical Properties

  • Conference paper
  • First Online:
Natural Language Processing and Chinese Computing (NLPCC 2018)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11109))

  • 1970 Accesses

Abstract

Analysis of the character structure characteristics can lay an information foundation for the intelligent processing of square Hmong characters. Combined with the analysis of character structure characteristics, this paper presents a definition of the linearization of square Hmong characters, a definition of equivalence class division of the structure of square Hmong characters, and proposes a decision algorithm of structure equivalence class. According to the above algorithm, the structure of square Hmong characters is divided into eight equivalent classes. Analysis of the statistical properties, including the cumulative probability distribution, complexity, and information entropy of square Hmong characters appearing in practical documents, shows that, first, more than 90% of square Hmong characters appearing in practical documents are composed of two components, and more than 80% of these characters possess a left-right, top-bottom, or lower-left-enclosed structure, second, the number of mean components in a square Hmong character is slightly greater than 2, third, the information entropy of the structure of Hmong characters is within the interval (1.19, 2.16). Results reveal that square Hmong characters appearing frequently in practical documents follow the principle of simple structure orientation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Instituse, R.: A fast smoothing & thinning method based on character structure. J. Chin. Inf. Process. 4(2), 49–55 (1990)

    Google Scholar 

  2. Shin, J., Suzuki, K., Hasegawa, A.: Handwritten Chinese character font generation based on stroke correspondence. Int. J. Comput. Process. Orient. Lang. 18(3), 211–226 (2005)

    Article  Google Scholar 

  3. Liu, M.Y., Duan, C.S., Pi, Y.G.: Basic elements knowledge acquisition study in the Chinese character intelligent formation system. J. Softw. Eng. Appl. 2(5), 316–322 (2009)

    Article  Google Scholar 

  4. Tan, J., Xie, X.H., Zheng, W.H., et al.: Radical extraction using affine sparse matrix factorization for printed Chinese characters recognition. Int. J. Pattern Recognit. Artif Intell. 26(3), 211–226 (2012)

    Article  MathSciNet  Google Scholar 

  5. Dobres, J., Chahine, N., Reimer, B., et al.: The effects of Chinese typeface design, stroke weight, and contrast polarity on glance based legibility. Displays 41, 42–49 (2016)

    Article  Google Scholar 

  6. Ai, J.Y., Yu, H.Z., Li, Y.H.: Statistical analysis on Tibetan shaped structure. J. Comput. Appl. 29(7), 2029–2031 (2009)

    Google Scholar 

  7. Cai, Z.J., CaiRang, Z.M.: Research on the distribution of Tibetan character forms. J. Chin. Inf. Process. 30(4), 98–105 (2016)

    Google Scholar 

  8. Kwon, Y.B.: Hangul tree classifier for type clustering using horizontal and vertical strokes. In: Proceedings of the 16th International Conference on Pattern Recognition, pp. 228–231. IEEE, Quebec City (2002)

    Google Scholar 

  9. Xu, R.J., Liu, C.P.: Grapheme segmentation and recognition in machine printed Hangul characters. J. Chin. Inf. Process. 20(2), 66–71 (2006)

    Google Scholar 

  10. Cui, R.Y., Kim, S.J.: Research on information structure of Korean characters. J. Chin. Inf. Process. 25(5), 114–119 (2011)

    Google Scholar 

  11. Mo, L.P., Zhou, K.Q.: Formal description of dynamic construction method for square Hmong language characters. J. Comput. Appl. 34(3), 861–864, 868 (2014)

    Google Scholar 

  12. Mo, L.P., Zhou, K.Q., Jiang, X.H.: Research on square Hmong language characters fonts based on OpenType technology. J. Chin. Inf. Process. 129(2), 150–156 (2015)

    Google Scholar 

  13. Mo, L.P., Zhou, K.Q.: A dynamical glyph generation method of Xiangxi Folk Hmong characters and its implementation approach. Acta Scicentiarum Naturalum Universitis Pekinesis 52(1), 141–147 (2016)

    MathSciNet  Google Scholar 

  14. Zhao, L.M., Liu, Z.Q.: Xiangxi square Hmong characters. Minor. Lang. China 12(1), 44–49 (1990)

    Google Scholar 

  15. Yang, Z.B., Luo, H.Y.: On the folk coinage of characters of the Miao People in Xiangxi area. J. Jishou Univ. (Soc. Sci. Edn.) 29(6), 130–134 (2008)

    Google Scholar 

  16. Long, Z.H.: Re-study of the coinage method of square characters of Miao language in the Youshui river basin of Yu, Xiang and E. J. Chongqing Educ. Coll. 25(5), 56–59 (2012)

    Google Scholar 

Download references

Acknowledgement

This work is supported by the National Natural Science Foundation of China (Nos. 61462029 and 61741205).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Li-Ping Mo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mo, LP., Zhou, KQ., Cao, LB., Jiang, W. (2018). Classification of the Structure of Square Hmong Characters and Analysis of Its Statistical Properties. In: Zhang, M., Ng, V., Zhao, D., Li, S., Zan, H. (eds) Natural Language Processing and Chinese Computing. NLPCC 2018. Lecture Notes in Computer Science(), vol 11109. Springer, Cham. https://doi.org/10.1007/978-3-319-99501-4_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-99501-4_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-99500-7

  • Online ISBN: 978-3-319-99501-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics