Abstract
We present a multi-sentence question segmentation strategy for community question answering services to alleviate the complexity of long sentences. We develop a complete scheme and make a solution to complex-question segmentation, including a question detector to extract question sentences, a question compression process to remove duplicate information, and a graph model to segment multi-sentence questions. In the graph model, we train a SVM classifier to compute the initial weight and we calculate the authority of a vertex to guide the propagating. The experimental results show that our method gets a good balance between completeness and redundancy of information, and significantly outperforms state-of-the-art methods.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ang, J., Liu, Y., Shriberg, E.: Automatic dialog act segmentation and classification in multiparty meetings. In: Proc. Int. Conf. Acoust Speech, Signal Process (2005)
Takechi, M., Tokunaga, T., Matsumoto, Y.: Chunking-based question type identification for multi-sentence queries. In: SIGIR (2007)
Duan, H., Cao, Y., Lin, C.-Y., Yu, Y.: Searching questions by identifying question topic and question focus. In: HLT-ACL (2008)
Jeon, J., Croft, W.B., Lee, J.H.: Finding similar questions in large question and answer archives. In: CIKM (2005)
Riezler, S., Vasserman, A., Tsochantaridis, I., Mittal, V., Liu, Y.: Statistical machine translation for query expansion in answer retrieval. In: ACL (2007)
Wang, K., Ming, Z., Chua, T.-S.: A syntactic tree matching approach to finding similar questions in community-based qa services. In: SIGIR (2009)
Yinh, W., He, X., Meek, C.: Semantic parsing for single-relation question answering. In: ACL (2014)
Wang, K., Ming, Zh., Hu, X., Chua, T.: Segmentation of multi-sentence questions: towards effective question retrieval in cQA services. In: SIGIR (2010)
Cong, G., Wang, L., Lin, C., Song, Y., Sun, Y.: Finding question-answer pairs from online forums. In: SIGIR (2008)
Mihalcea, R.: Unsupervised large-vocabulary word sense disambiguation with graph-based algorithms for sequence data labeling. In: EMNLP (2005)
Navigli, R., Lapata, M.: Graph connectivity measures for unsupervised word sense disambiguation. In: IJCAI (2007)
Sinha, R., Mihalcea, R.: Unsupervised graph-based word sense disambiguation using measures of word semantic similarity. In: ICSC (2007)
Hessami, E., Mahmoud, F., Jadidinejad, A.: Unsupervised graph-based word sense disambiguation using lexical relation of WordNet. In: IJCSI (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Wang, Y., Wu, Y., Lv, X. (2015). Multi-sentence Question Segmentation and Compression for Question Answering. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2015. Lecture Notes in Computer Science(), vol 9362. Springer, Cham. https://doi.org/10.1007/978-3-319-25207-0_44
Download citation
DOI: https://doi.org/10.1007/978-3-319-25207-0_44
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25206-3
Online ISBN: 978-3-319-25207-0
eBook Packages: Computer ScienceComputer Science (R0)