The subjective text is an important processed object of opinion mining, but sometimes there have been many informal expressions in a subjective text. The authors of the subjective texts have the personal expression habits which are not restricted to a formal grammar, so Network Informal Language (NIL) emerges. For example, the formal word “ (like)” is usually replaced by the NIL word “ tlx (conjee)” in some chatting tools like OICQ (Open ICQ) and MSN (Microsoft Ser vice Network). Currently, opinion mining tools are less effective to dispose these problems. For instance, they regard the NIL word “8 fa (non-word)” as a text noise, which expresses one's viewpoint, that is, the formal word “ (ok)” in fact. In this paper, we propose an approach to try to resolve the problems from NIL expres sions. Because NIL expressions are manually identified in general, we use minimal supervision technology to identify them. According to the two different types of NIL, we adopt different strategies for those. The experiment results have shown that the performance of the proposed approach is satisfied. Therefore, this approach is reasonable and effective. In the future, we will improve this approach, so that it can process more complicated NIL expressions.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Sproat R, Black AW, Chen S, Kumar S, Ostendorf M, Richards C (2001) Normalization of nonstandard words. J Computer Speech and Language, 15: 287-333
Xia YQ, Wong KF, Gao W (2005) NIL is not nothing: recognition of Chinese network informal language expressions. In: Proceedings of the 4th SIGHAN Workshop on Chinese Language Processing at IJCNLP’05, Jeju Island, Korea, pp 95-102
Yao TF, Peng SW (2007) A study of the classification approach for Chinese subjective and objective texts. In: Proceedings of the third national conference for information retrieval and content security, Suzhou, China, pp 117-123
Isozaki H, Kazawa H (2002) Efficient support vector classifiers for named entity recognition. In: Proceedings of the 19th international conference on computational linguistics (COLING 2002), Taipei, Taiwan, pp 1-8
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer Science + Business Media B.V
About this paper
Cite this paper
Zhang, X., Yao, T. (2008). A Study of Network Informal Language Using Minimal Supervision Approach. In: Mahr, B., Huanye, S. (eds) Autonomous Systems – Self-Organization, Management, and Control. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-8889-6_18
Download citation
DOI: https://doi.org/10.1007/978-1-4020-8889-6_18
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-8888-9
Online ISBN: 978-1-4020-8889-6
eBook Packages: EngineeringEngineering (R0)