Abstract
As XML is going to become the standard document format, there is still the legacy problem of large amounts of text (written in the past as well as today) that are not available in this format. In order to exploit the benefits of XML, these legacy texts must be converted into XML. In this chapter, we discuss the issues of automatic XML markup of documents. We give a survey on existing approaches, and we describe a specific system in some detail.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Abolhassani, M., Fuhr, N., Gövert, N. (2003). Information Extraction and Automatic Markup for XML Documents. In: Blanken, H., Grabs, T., Schek, HJ., Schenkel, R., Weikum, G. (eds) Intelligent Search on XML Data. Lecture Notes in Computer Science, vol 2818. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45194-5_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-45194-5_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40768-3
Online ISBN: 978-3-540-45194-5
eBook Packages: Springer Book Archive