Definition
A document field is a part of a document or of the document metadata in which the text has a particular function. A document field can contain free or preformatted text. Each field, according to its function, has different characteristics, length, and term distributions.
Key Points
Textual documents have implicit structure, which aids the understanding of the text. Long textual documents are usually organized in chapters, sections, paragraphs, and each of those can have a concise description in the form of a title. In the case of hypertext documents, explicit links between documents in the form of hyperlinks are often associated with anchor text. News wire documents also have metadata such as date, or the name of the author. Efforts to standardize metadata about documents have resulted in projects such as the Dublin Core Metadata Initiative [1].
Fields are also being used to represent the annotations of text with semantic and syntactic information. For example, the semantic...
Recommended Reading
Dublin Core Metadata Initiative. Retrieved 15 Apr 2008. http://dublincore.org/
Jin R, Hauptmann A, Zhai C. Title language model for information retrieval. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2002. p. 42–8.
Zaragoza H, Rode H, Mika P, Atserias J, Ciaramita M, Attardi G. Ranking Very Many Typed Entities on Wikipedia. In: Proceedings of the International Conference on Information and Knowledge Management; 2007. p. 1015–8.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media LLC
About this entry
Cite this entry
Plachouras, V. (2016). Document Field. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_938-2
Download citation
DOI: https://doi.org/10.1007/978-1-4899-7993-3_938-2
Received:
Accepted:
Published:
Publisher Name: Springer, New York, NY
Online ISBN: 978-1-4899-7993-3
eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering