Abstract
Since automatic language generation is a task able to enrich applications rooted in most of the language-related areas, from machine translation to interactive dialogue, it seems worthwhile to undertake a strategy focused on enhancing generation system’s adaptability and flexibility. It is our first objective to understand the relation between the factors that contribute to discourse articulation in order to devise the techniques that will generate it. From that point, we want to determine the appropriate methods to automatically learn those factors. The role of genre on this approach remains essential as provider of the stable forms that are required in the discourse to meet certain communicative goals. The arising of new web-based genres and the accessibility of the data due to its digital nature, has prompted us to use reviews in our first attempt to learn the characteristics of their singular non-rigid structure. The process and the preliminary results are explained in the present paper.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Actually, coherence has been accepted as a quality indicator [29].
- 2.
References
Bachand, F.-H., Davoodi, E., Kosseim, L.: An investigation on the influence of genres and textual organisation on the use of discourse relations. In: Gelbukh, A. (ed.) CICLing 2014. LNCS, vol. 8403, pp. 454–468. Springer, Heidelberg (2014). doi:10.1007/978-3-642-54906-9_37
Bakhtin, M.M.: Speech Genres and Other Late Essays. University of Texas Press, Austin (2010)
Barzilay, R.: Probabilistic approaches for modeling text structure and their application to text-to-text generation. In: Krahmer, E., Theune, M. (eds.) EACL/ENLG -2009. LNCS, vol. 5790, pp. 1–12. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15573-4_1
Barzilay, R., Lapata, M.: Collective content selection for concept-to-text generation. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 331–338. Association for Computational Linguistics (2005)
Bhatia, V.: Worlds of Written Discourse: A Genre-Based View. A&C Black, London (2004)
Buitelaar, P., Arcan, M., Iglesias Fernandez, C.A., Sánchez Rada, J.F., Strapparava, C.: Linguistic linked data for sentiment analysis (2013)
Cambria, E., Schuller, B., Xia, Y., Havasi, C.: New avenues in opinion mining and sentiment analysis. IEEE Intell. Syst. 28(2), 15–21 (2013)
Duboue, P.A., McKeown, K.R.: Statistical acquisition of content selection rules for natural language generation. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, pp. 121–128. Association for Computational Linguistics (2003)
García-Miguel, J.M., Vaamonde, G., Domínguez, F.G.: Adesse, a database with syntactic and semantic annotation of a corpus of Spanish. In: LREC (2010). http://dblp.uni-trier.de/db/conf/lrec/lrec2010.html#Garcia-MiguelVD10
Ge, T., Pei, W., Ji, H., Li, S., Chang, B., Sui, Z.: Bring you to the past: automatic generation of topically relevant event chronicles. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (vol. 1: Long Papers), pp. 575–585. Association for Computational Linguistics, Beijing (2015). http://www.aclweb.org/anthology/pp.15-1056
Gruber, H., Redeker, G.: The Pragmatics of Discourse Coherence: Theories and Applications, vol. 254. John Benjamins Publishing Company, Amsterdam (2014)
Halliday, M., Matthiessen, C.M., Matthiessen, C.: An Introduction to Functional Grammar. Routledge, London (2014)
Hearst, M.A.: Texttiling: segmenting text into multi-paragraph subtopic passages. Comput. Linguist. 23(1), 33–64 (1997)
Hu, Y., Wan, X.: Automatic generation of related work sections in scientific papers: an optimization approach. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1624–1633. Association for Computational Linguistics, Doha (2014). http://www.aclweb.org/anthology/D14-1170
Jewell, M.O., Lawrence, K.F., Tuffield, M.M., Prugel-Bennett, A., Millard, D.E., Nixon, M.S., Shadbolt, N.R., et al.: Ontomedia: an ontology for the representation of heterogeneous media. In: Proceeding of SIGIR Workshop on Mutlimedia Information Retrieval. ACM SIGIR (2005)
Jha, R., Finegan-Dollak, C., King, B., Coke, R., Radev, D.: Content models for survey generation: a factoid-based evaluation. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (vol. 1: Long Papers), pp. 441–450. Association for Computational Linguistics, Beijing (2015). http://www.aclweb.org/anthology/pp.15-1043
Kondadadi, R., Howald, B., Schilder, F.: A statistical NLG framework for aggregated planning and realization. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (vol. 1: Long Papers), pp. 1406–1415. Association for Computational Linguistics, Sofia (2013). http://www.aclweb.org/anthology/pp.13-1138
Konstas, I., Lapata, M.: A global model for concept-to-text generation. J. Artif. Intell. Res. 48, 305–346 (2013)
Li, B., Thakkar, M., Wang, Y., Riedl, M.O.: Storytelling with adjustable narrator styles and sentiments. In: Mitchell, A., Fernández-Vara, C., Thue, D. (eds.) ICIDS 2014. LNCS, vol. 8832, pp. 1–12. Springer, Cham (2014). doi:10.1007/978-3-319-12337-0_1
Lombardo, V., Damiano, R.: Semantic annotation of narrative media objects. Multimed. Tools Appl. 59(2), 407–439 (2012)
Matthiessen, C.M.: Registerial cartography: context-based mapping of text types and their rhetorical-relational organization (2014)
Padró, L., Stanilovsky, E.: FreeLing 3.0: towards wider multilinguality. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012). European Language Resources Association (ELRA) (2012)
Reiter, E., Dale, R., Feng, Z.: Building Natural Language Generation Systems, vol. 33. MIT Press, Cambridge (2000)
dos Santos, C.N., Gatti, M.: Deep convolutional neural networks for sentiment analysis of short texts. In: COLING, pp. 69–78 (2014)
Santosh, D.T., Vardhan, B.V.: Feature and sentiment based linked instance RDF data towards ontology based review categorization. In: Proceedings of the World Congress on Engineering, vol. 1 (2015)
Sauper, C., Barzilay, R.: Automatically generating wikipedia articles: a structure-aware approach. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, vol. 1, pp. 208–216. Association for Computational Linguistics (2009)
Swavels, J.: Genre Analysis: English in Academic and Research Settings. Cambridge University Press, Cambridge (1990)
Taboada, M.: Stages in an online review genre: text & talk. Interdisc. J. Lang. Discourse Commun. Stud. 31(2), 247–269 (2011)
Webber, B., Joshi, A.: Discourse structure and computation: past, present and future. In: Proceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries, pp. 42–54. Association for Computational Linguistics (2012)
Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques, 3rd edn. Morgan Kaufmann Publishers Inc., San Francisco (2011)
Acknowledgments
This work has been supported by the grant ACIF/2016/501 from the Generalitat Valenciana. Funds have been also received from the University of Alicante, Spanish Government and the European Commission through the projects “Explotación y tratamiento de la información disponible en Internet para la anotación y generación de textos adaptados al usuario” (GRE13-15) and “DIIM2.0: Desarrollo de técnicas Inteligentes e Interactivas de Minería y generación de información sobre la web 2.0” (PROMETEOII/2014/001), TIN2015-65100-R, TIN2015-65136-C2-2-R, and SAM (FP7-611312), respectively.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Vicente, M., Lloret, E. (2017). Exploring Flexibility in Natural Language Generation Through Discursive Analysis of New Textual Genres. In: Quesada, J., Martín Mateos , FJ., López Soto, T. (eds) Future and Emerging Trends in Language Technology. Machine Learning and Big Data. FETLT 2016. Lecture Notes in Computer Science(), vol 10341. Springer, Cham. https://doi.org/10.1007/978-3-319-69365-1_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-69365-1_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69364-4
Online ISBN: 978-3-319-69365-1
eBook Packages: Computer ScienceComputer Science (R0)