Research on Topic Link Detection Method Based on Semantic Domain

  • Pei-Yu Liu
  • Yu-Zhen Yang
  • Shao-Dong Fei
  • Zhen Zhang
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8351)


Topic Link Detection aims to detect whether a pair of random stories discuss the same topic, which is an important subtask of Topic Detection and Tracking. In previous works, statistical method and machine-learning approach are used more often than not, however, the semantic distribution of a story and the structure relationship of contents are ignored. A new method based on the semantic domain is proposed for the purpose of improved the precision. In this method, every story is divided some semantic domain through analyzing internal semantic distribution and structure relationships of contexts. The results of experiment proved that the proposed method can improve performance of system.


topic link detection semantic domain topic model 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    The 2004 Topic Detection and Tracking (TDT 2004) Task Definition and Evaluation Plan (EB/OL) (2004),
  2. 2.
    Kumaran, G., Allan, J.: Text classification and named entities for new event detection. In: Proceeding of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 297–304. ACM, New York (2004)Google Scholar
  3. 3.
    Allan, J., Carbonell, J., Doddington, G., Yamron, J., Yiming, Y.: Topic detection and tracking pilot study final report. In: Proceedings of the Broadcast News Transcription and Understanding Workshop, vol. 2, pp. 1–25 (1998)Google Scholar
  4. 4.
    Naptali, W., Tsuchiya, M., Nakagawa, S.: Topic-dependent language model with voting on noun history. ACM Transactions on Asian Language Information Processing(TALIP) 9(7) (2010)Google Scholar
  5. 5.
    Ponte, J.M., Bruce Croft, W.: A language modeling approach to information retrieval. In: Proc. SIGIR, pp. 275–281 (1998)Google Scholar
  6. 6.
    Ha-Thuc, V., Srinivasan, P.: Topic models and a revisit of text-related applications. In: Proceedings of the 2nd PHD Workshop on Information and Knowledge Management, pp. 25–32 (2008)Google Scholar
  7. 7.
    Nallapati, R., Feng, A., Peng, F., Allan, J.: Event threading within news topics. In: Proceedings of the Thirteenth ACM Conference on Information and Knowledge Management (CIKM), pp. 446–453 (2004)Google Scholar
  8. 8.
    Chaitanya, C., Smyth, P., Steyvers, M.: Combining Concept Hierarchies and Statistical Topic Models. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, pp. 1469–1470 (2008)Google Scholar
  9. 9.
    Hong, Y., Zhang, Y., Fan, J.L., Liu, T., Sheng, L.: Chinese topic link detection based on semantic domain language model. Journal of Software 19(9), 2265–2275 (2008)CrossRefGoogle Scholar
  10. 10.
    Wang, L., Li, F.: Story Link Detection Based on Event Words. In: Gelbukh, A. (ed.) CICLing 2011, Part II. LNCS, vol. 6609, pp. 202–211. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  11. 11.
    Shah, C., Eguchi, K.: Improving document representation for story link detection by modeling term topicality. Information and Media Technologies 4(2), 433–441 (2009)Google Scholar
  12. 12.
    Hong, Y., Zhang, Y., Fan, J.L., Liu, T., Li, S.: New Event Detection Based on Division Comparison of Subtopic. Chinese Journal of Computers 31(4), 687–695 (2008)CrossRefGoogle Scholar
  13. 13.
    Lakshmi, K., Mukherjee, S.: Using Cohesion-model for story link detection system. Intenational Journal of Computer Science and Network Security 7(3), 59–66 (2007)Google Scholar
  14. 14.
    Zhang, K., Zi, J., Wu, L.G.: New event detection based on indexing-tree and named entity. In: Proceedings of the 30th Annual Int’1 ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 215–222 (2007)Google Scholar
  15. 15.
    Nomoto, T.: Two-tier Similarity Model for Story Link Detection. In: Proceedings of the 19th ACM International Conference Information and Knowledge Management, pp. 789–798 (2010)Google Scholar
  16. 16.
    Zhang, K., Li, J.Z., Wu, G., Wang, K.H.: Term-Committee-Based Event Identification Within Topics. Journal of Computer Research and Development 46(2), 245–252 (2009)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Pei-Yu Liu
    • 1
    • 2
  • Yu-Zhen Yang
    • 1
    • 2
  • Shao-Dong Fei
    • 1
    • 3
  • Zhen Zhang
    • 1
    • 2
  1. 1.School of Information Science and EngineeringShandong Normal UniversityShandongChina
  2. 2.Shandong Provincial Key Laboratory for Distributed Computer Software Novel TechnologyShandong Province, JinanChina
  3. 3.Shandong University of Finance and EconomicsShandong, JinanChina

Personalised recommendations