Abstract
In the current internet environment, the issue of information credibility is crucial given that anyone can put up information on the Web freely. Hence, it is urge and necessary to develop computational approaches to distinguish reliable information from the unreliable, inaccurate one. In the year 2010, CoNLL proposed a new shared task of hedge detection, aiming to push un-certainty detection for natural language processing applications. Among the pro-posed approaches, sequence labeling models exhibit promising performance. However, only shallow features (e.g. word, lemma, POS tags, etc.) were explored in the existing research. In this paper, we aim to exploit the advantage of topical features in sequence labeling based hedge detection. The experimental results illustrate the effectiveness of the high-level features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Blei, D., Ng, A., Jordan, M.: Latent dirichlet allocation. Journal of Machine Learning Research, 993–1022 (2003)
Farkas, R., et al.: The CoNLL 2010 Shared Task: Learning to Detect Hedges and their Scope in Natural Language Text. In: Proceedings of the Fourteenth Conference on Computational Natural Language Learning (CoNLL 2010): Shared Task, Uppsala, Sweden, pp. 1–12 (2010)
Ganter, V., Strube, M.: Finding Hedges by Chasing Weasels: Hedge Detection Using Wikipedia Tags and Shallow Linguistic Features. In: Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, pp. 173–176 (2009)
Hyland, K.: Hedging in scientific research articles. John Benjamins B.V., Amsterdam (1998)
Lafferty, J., et al.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proceedings of ICML 2001, pp. 282–289 (2001)
Lakoff, G.: Linguistics and natural logic. In: The Semantics of Natural Language, pp. 545–665. Reidel, Dordrecht (1972)
Light, M., Qiu, X., Srinivasan, P.: The Language of Bioscience: Facts, Speculations, and Statements in Between. In: Proceedings of the HLT-NAACL 2004 Workshop: Biolink 2004, Linking Biological Literature, Ontologies and Databases (2004)
Medlock, B., Briscoe, T.: Weakly Supervised Learning for Hedge Classification in Scientific Literature. In: Proceedings of the ACL (2007)
Nallapati, R., Surdeanu, M., Manning, C.: Blind Domain Transfer for Named Entity Recognition using Generative Latent Topic Models. In: Proceedings of the NIPS 2010 Workshop on Transfer Learning Via Rich Generative Models (2010)
Peng, J., Bo, L., Xu, J.: Conditional Neural Fields. In: Advances in Neural Information Processing Systems, NIPS (2009)
Szarvas, G.: Hedge Classification in Biomedical Texts with a Weakly Supervised Selection of Keywords. In: Proceedings of ACL 2008: HLT, pp. 281–289 (2008)
Wei, X., Croft, W.: LDA-Based Document Models for Ad-hoc Retrieval. In: Proceedings of SIGIR 2006 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Su, Q., Lou, H., Liu, P. (2013). Hedge Detection with Latent Features. In: Liu, P., Su, Q. (eds) Chinese Lexical Semantics. CLSW 2013. Lecture Notes in Computer Science(), vol 8229. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45185-0_46
Download citation
DOI: https://doi.org/10.1007/978-3-642-45185-0_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-45184-3
Online ISBN: 978-3-642-45185-0
eBook Packages: Computer ScienceComputer Science (R0)