Advertisement

Lexical Text Simplification Using WordNet

  • Debabrata SwainEmail author
  • Mrunmayee Tambe
  • Preeti Ballal
  • Vishal Dolase
  • Kajol Agrawal
  • Yogesh Rajmane
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 1046)

Abstract

Internet is distributed environment and hence, huge amount of information is available on it. People use internet to access the information on the web. While referring to any information people face difficulty to understand the complex sentences and words used related to technology and science. Technical and scientific words are mostly found in research papers, medical reports, newspapers and other reading material. Text simplification is a technique used to automatically transform complicated text into simpler form. In the proposed system an efficient text simplification technique has been developed using word net model available in the Natural Language toolkit (NLTK). The dataset used for experimentation is collected through a random survey from web sources. Here, the proposed system is divided into 3 phases. In the first phase data collection and pre-processing has been performed. In second phase complex words are identified and in the 3rd phase replacement of complex words with their simple synonyms is being done. The performance of the system has been analyzed by user review to accuracy of 87%.

Keywords

Text simplification Lexical simplification and syntactic simplification Tokenization 

References

  1. 1.
    Panzer, A.M., Kindig, D.A., Nielsen-Bohlman, L.: Health Literacy: A Prescription to End Confusion. National Academy Press, Washington, D.C. (2004)Google Scholar
  2. 2.
    Skeppstedt, M., Kvist, M., Abrahamsson, E., Forni, T.: Medical text simplification using synonym replacement: adapting assessment of word difficulty to a compounding language (2014)Google Scholar
  3. 3.
    Tyagi, S., Chopra, D., Mathur, I., Joshi, N.: Classifier based text simplification for improved machine translation (2015)Google Scholar
  4. 4.
    Wang, T., Qiang, J., Rochford, J., Chen, P.: Text simplification using neural machine translation (2016)Google Scholar
  5. 5.
    McCray, A.T., Bodenreider, O.: Exploring semantic groups through visual approaches. J. Biomed. Inform. 36(6), 414–432 (2003)CrossRefGoogle Scholar
  6. 6.
    Baldridge, J., Bierner, G., Morton, T., Kottmann, J.: OpenNLP. A Java-based NLP Toolkit (2005)Google Scholar
  7. 7.
    Siddharthan, A.: Syntactic simplification and text cohesion. Res. Lang. Comput. 4, 77–109 (2006)CrossRefGoogle Scholar
  8. 8.
    Curtis, D., Leroy, G., Zeng-Treitler, Q., Smith, C.A., Keselman, A., Logan, R.: Developing informatics tools and strategies for consumer-centered health communication. J. Am. Med. Inform. Assoc. 15(4), 473–483 (2008)CrossRefGoogle Scholar
  9. 9.
    Sleator, D., Grinberg, D., Lafferty, J.: A robust parsing algorithm for link grammars (1995)Google Scholar
  10. 10.
    Bingel, J., Paetzold, G.H., Søgaard, A.: Lexi: a tool for adaptive, personalized text simplification (2018)Google Scholar
  11. 11.
    Yimam, S.M., Biemann, C.: Par4Sim – adaptive paraphrasing for text simplificationGoogle Scholar
  12. 12.
    Biemann, C., Riedl, M.: Text: now in 2D! A framework for lexical expansion with contextual similarity. J. Lang. Model. 1(1), 55–95 (2013)CrossRefGoogle Scholar
  13. 13.
    Siddharthan, A.: An architecture for a text simplification system. In: Proceedings of the IEEE Language Engineering Conference (2002)Google Scholar
  14. 14.
    Dymetman, M., Cancedda, N., Specia, L., Turchi, M., Cristianini, N.: Estimating the sentence-level quality of machine translation systems. In: 13th Conference of the European Association for Machine Translation (2009)Google Scholar
  15. 15.
    Aluisio, S., Scarton, C., Specia, L., Gasperin, C.: Readability assessment for text simplification. In: Proceedings of the NAACL HLT 2010 Fifth Workshop on Innovative Use of NLP for Building Educational Applications. Association for Computational Linguistics (2010)Google Scholar
  16. 16.
    Etayo, E., Martínez, E.G., Saggion, H., Bourg, L., Anula, A.: Text simplification in simplext: making text more accessible. Procesamiento del lenguaje natural 47, 341–342 (2011)Google Scholar
  17. 17.
    Kauchak, D., Pentoney, C., Leroy, G., Mouradi, O.: Text simplification tools: using machine learning to discover features that identify difficult text (2014)Google Scholar
  18. 18.
    Petersen, E., Ostendorf, M.: Text simplification for language learners: a corpus analysis. In: Speech and Language Technology for Education (2007)Google Scholar
  19. 19.
    Chandrasekar, R., Doran, C., Srinivas, B.: Motivations and methods for text simplification. In: Proceedings of the International Conference on Computational Linguistics, pp. 1041–1044 (1996)Google Scholar
  20. 20.
    Mandya, A.A., Siddharthan, A.: Text simplification using synchronous dependency grammars: generalising automatically harvested rules. In: 8th International Natural Language Generation Conference, pp. 16–25 (2014). Research Output: Chapter in Book/Report/Conference Proceeding, Conference ContributionGoogle Scholar
  21. 21.
    Margarido, P.R.A., Pardo, T.A.S., Antonio, G.M., Fuentes, V.B., Aires, R., Aluísio, S.M., Fortes, R.P.M.: Automatic summarization for text simplification: evaluating text understanding by poor readers. In: Proceedings of the XIV Brazilian Symposium on Multimedia and the Web (2008).  https://doi.org/10.1145/1809980.1810057

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  • Debabrata Swain
    • 1
    Email author
  • Mrunmayee Tambe
    • 1
  • Preeti Ballal
    • 1
  • Vishal Dolase
    • 1
  • Kajol Agrawal
    • 1
  • Yogesh Rajmane
    • 1
  1. 1.Department of IT-MCAVishwakarma Institute of TechnologyPuneIndia

Personalised recommendations