Automatically Extracting Chinese Aliases of Prohibited Items Based on Web Searching

He, Tao; Liu, Juan; Li, Kai; Yang, Meini

doi:10.1007/978-3-642-25658-5_39

Tao He⁴,
Juan Liu⁴,
Kai Li⁴ &
…
Meini Yang⁵

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 124))

2556 Accesses
1 Citations

Abstract

With the development of e-commerce technologies, more and more sellers choose to open online shops in the e-commerce platforms due to the advantages of saving costs and spaces, and more and more buyers choose to do online shopping with the advantages of saving time and money, as well as the convenience. On one hand, this kind of web based trading greatly facilitates both sellers and buyers; on the other hand, it makes more technical demands on the e-commerce platforms providers, such as the automatic detection of the prohibited goods, especially with Chinese names or descriptions due to the ambiguity nature of Chinese language. In this paper, we propose a novel idea for addressing this problem by web based aliases extracting. The experiment results illustrate the effectiveness and feasibility of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 389.00; Price excludes VAT (USA)

Softcover Book: USD 499.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Wang, R.C., Cohen, W.W.: Language-independent set expansion of named entities using the web. In: Proceedings of the 2007 Seventh IEEE International Conference on Data Mining, pp. 342–350. IEEE Computer Society Press, Washington (2007)
Chapter Google Scholar
Etzioni, O., Cafarella, M., Downey, D., Popescu, A.M., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Unsupervised named-entity extraction from the web: an experimental study. Artif. Intell. 165, 91–134 (2005)
Article Google Scholar
Benjamin, V.D., Pasca, M.: Finding cars, goddesses and enzymes: parametrizable acquisition of labeled instances for open-domain information extraction. In: Proceedings of the 23rd National Conference on Artificial Intelligence, vol. 2, pp. 1243–1248. AAAI Press (2008)
Google Scholar
Talukdar, P.P., Reisinger, J., Paśca, M., Ravichandran, D., Bhagat, R., Pereira, F.: Weakly-supervised acquisition of labeled class instances using graph random walks. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2008, pp. 582–590. Association for Computational Linguistics, Stroudsburg (2008)
Chapter Google Scholar
Pantel, P., Crestan, E., Borkovsky, A., Popescu, A.M., Vyas, V.: Web-scale distributional similarity and entity set expansion. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, vol. 2. Association for Computational Linguistics, Stroudsburg (2009)
Google Scholar
Wang, R.C., Cohen, W.W.: Automatic set instance extraction using the web. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL 2009, vol. 1, pp. 441–449. Association for Computational Linguistics, Stroudsburg (2009)
Google Scholar
Zheng, Y., Liu, Z., Xie, L.: Growing related words from seed via user behaviors: a reranking based approach. In: Proceedings of the ACL 2010 Student Research Workshop, ACLstudent 2010, pp. 49–54. Association for Computational Linguistics, Stroudsburg (2010)
Google Scholar
Cilibrasi, R.L., Vitanyi, P.M.B.: The google similarity distance. IEEE Transactions on Knowledge And Data Engineering 19, 370–383 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, Wuhan University, Wuhan, 430072, China
Tao He, Juan Liu & Kai Li
College of Science, Naval Univ. of Engineering, Wuhan, 430033, China
Meini Yang

Authors

Tao He
View author publications
You can also search for this author in PubMed Google Scholar
Juan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Kai Li
View author publications
You can also search for this author in PubMed Google Scholar
Meini Yang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Shanghai Jiao Tong University, 800 Dongchuan Road, 200240, Shanghai, China
Yinglin Wang
School of Information Science and Technology, Southwest Jiaotong University, 610031, Chengdu, Sichuan Province, China
Tianrui Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

He, T., Liu, J., Li, K., Yang, M. (2011). Automatically Extracting Chinese Aliases of Prohibited Items Based on Web Searching. In: Wang, Y., Li, T. (eds) Practical Applications of Intelligent Systems. Advances in Intelligent and Soft Computing, vol 124. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25658-5_39

Download citation

DOI: https://doi.org/10.1007/978-3-642-25658-5_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25657-8
Online ISBN: 978-3-642-25658-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics