Abstract
Due to increasing amount of text data available in WWW, it becomes time consuming for information system users to explore every text source in detail. Automatic text summarization (ATS) is the process of generating summary by condensing text document automatically by a computer machine that can save users precious time. Major issue with most of the feature-based ATS methods is to find optimal feature weights for sentence scoring to optimize quality of text summary. This paper presents a novel voting-based approach that use modified reciprocal ranking approach which alleviates the issue of feature weighting and. Proposed approach use a specific prominent set of features for initial ranking that further boosts the performance. Experimental results on DUC 2002 dataset using ROUGE evaluation matrices show that our proposed voting approach performs better when compared to other statistical- and voting-based methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Luhn, H. P. (1958). The automatic creation of literature abstracts. IBM Journal of Research and Development, 2(2), 159–165.
Baxendale, P. B. (1958). Machine-made index for technical literature: An experiment. IBM Journal of Research and Development, 2(4), 354–361.
Edmundson, H. P. (1969). New methods in automatic extracting. Journal of the ACM, 16(2), 264–285.
Rush, J. E., Salvador, R., & Zamora, A. (1971). Automatic abstracting and indexing. ii. production of indicative abstracts by application of contextual inference and syntactic coherence criteria. Journal of the American Society for Information Science, 22(4), 260–274.
Pollock, J. J., & Zamora, A. (1975). Automatic abstracting research at chemical abstracts service. Chemical Information and Computer Sciences, 15(4), 226–232.
Brandow, R., Mitze, K., & Lisa, F. R. (1995). Automatic condensation of electronic publications by sentence selection. Information Processing and Management, 31, 675–685.
Salton, G., Fox, E. A., & Wu, H. (1983). Extended Boolean information retrieval. Communications of the ACM, 26(11), 1022–1036.
Church, K., & Gale, W. A. (1995). Inverse document frequency (idf): A measure of deviations from poisson. In Proceedings of the Third Workshop on Very Large Corpora (pp. 121–130).
Salton, G., Singhal, A., Mitra, M., & Buckley, C. (1997). Automatic text structuring and summarization. Information Processing and Management, 33(2), 193–207.
Mori, T. (2002). Information gain ratio as term weight: The case of summarization of ir results. In Proceedings of the 19th International Conference on Computational Linguistics (pp. 688–694). Association for Computational Linguistics Publisher.
Nobata, C., Sekine, S., Isahara, H., & Grishman, R. (2002). Summarization system integrated with named entity tagging and IE pattern discovery. In Proceedings of Third International Conference on Language Resources and Evaluation (LREC). Las Palmas, Canary Islands, Spain.
Mihalcea, R., & Tarau, P. (2004). Textrank: Bringing order into texts. In D. Lin, & D. Wu, (Eds.), Proceedings of EMNLP, Association for Computational Linguistics (pp. 404–411). Barcelona, Spain.
Fattah, M. A., & Ren, F. (2009). Ga, mr, ann, pnn and gmm based models for automatic text summarization. Computer Speech & Language, 23(1), 126–144.
Liu, X., Jonathan, J. W., & Chunyu, K.(2009). An extractive text summarizer based on significant words. In Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy (ICCPOL ‘09) (pp. 168–178). Heidelberg: Springer-Verlag, Berlin.
Zhang, M., Song, R., Lin, C., Ma, S., Jang, Z., Lin, Y., Liu, Y., & Zhao, L. (2002). Expansion-based technologies in finding relevant and new information: THU TREC2002: Novelty Track experiments. In Proceedings of TREC. Gaithersburg, MD.
Aslam, J. A., & Montague, M. (2001). Models for metasearch. In Proceedings of ACM SIGIR 2001 (pp. 276–284). New Orleans LA.
Ogilvie, P., & Callan, J. (2003). Combining document representations for known item search. In SIGIR (pp. 143–150). New York, NY, USA.
Macdonald, C., & Ounis, I. (2008). Voting techniques for expert search. Knowledge and Information Systems, 16(3), 259–280.
Macdonald, C., & Ounis, I. (2006). Voting for candidates: adapting data fusion techniques for an expert search task. In CIKM Proceedings of the 15th ACM International Conference on Information and Knowledge Management (pp. 387–396).
Kumar, Y. J., Salim, N., Abuobieda, A., & Tawfik, A. (2013). Multi document summarization based on cross-document relation using voting technique. In International Conference on Computing, Electrical and Electronics Engineering (ICCEEE) (pp. 609–614).
Kumar, Y. J., Goh, O. S., Ghani, M. K., Salim, N., & Albaham, A. T. (2014). Voting models for summary extraction from text documents. In International Conference on IT Convergence and Security (ICITCS) (pp. 1–4).
Wang, Y., & Maches, J. (2013). A comprehensive method for text summarization based on latent semantic analysis. NLPCC (pp. 394–401). Heidelberg: Springer-Verlag Berlin.
Lin, C. Y. (2004). ROUGE: A package for automatic evaluation of summaries. In Proceedings of the Workshop on Text Summarization Branches Out (WAS 2004). Barcelona, Spain.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media Singapore
About this paper
Cite this paper
Meena, Y.K., Gopalani, D. (2016). Improvement in Quality of Extractive Text Summaries Using Modified Reciprocal Ranking. In: Satapathy, S., Joshi, A., Modi, N., Pathak, N. (eds) Proceedings of International Conference on ICT for Sustainable Development. Advances in Intelligent Systems and Computing, vol 409. Springer, Singapore. https://doi.org/10.1007/978-981-10-0135-2_30
Download citation
DOI: https://doi.org/10.1007/978-981-10-0135-2_30
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-0133-8
Online ISBN: 978-981-10-0135-2
eBook Packages: EngineeringEngineering (R0)