Abstract
The Internet haslong been used to transfer documents from one machine to another. These documents maybe anything from plain text to HTML, Microsoft Word, or PDF files. It’s difficult for a machine to extract data from documents written in a human language. Sometimes it’s useful to be able to extract information from documents and use it in new ways. Other times, it may simply be useful to keep the content the same, but adjust the layout—perhaps to permit the incorporation of a document into a larger compilation. For example, consider a large family-history book. You may wish to generate a list of all known addresses of people mentioned in the book, but they’re not listed in any standard way. Your program may have trouble retrieving two addresses from text such as the following:
Jeff Washington grew up in Chicago, IL at 132 S. Lake Shore Dr. After graduating from high school, he met Mary Davis of 300 E. Main St, Dallas, TX.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Rights and permissions
Copyright information
© 2004 John Goerzen
About this chapter
Cite this chapter
Goerzen, J. (2004). XML and XML-RPC. In: Foundations of Python Network Programming. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4302-0752-8_8
Download citation
DOI: https://doi.org/10.1007/978-1-4302-0752-8_8
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-59059-371-4
Online ISBN: 978-1-4302-0752-8
eBook Packages: Springer Book Archive