Overview of Text Mining

Weiss, Sholom M.; Indurkhya, Nitin; Zhang, Tong

doi:10.1007/978-1-4471-6750-1_1

Sholom M. Weiss⁶,
Nitin Indurkhya⁷ &
Tong Zhang⁸

Part of the book series: Texts in Computer Science ((TCS))

5885 Accesses
1 Citations

Abstract

Text mining and data mining are contrasted relative to automated prediction. Models are constructed by training on samples of unstructured documents, and results are projected to new text. A standard data format for input to prediction methods is described. The key objective of data preparation is to transform text into a numerical format, eventually sharing a common representation with numerical data mining. Different text-mining tasks are introduced that fit within a predictive framework for machine-learning. These include document classification, information retrieval, clustering of documents, information extraction, and performance evaluation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Department of Computer Science, Rutgers University, Piscataway, NJ, USA
Sholom M. Weiss
School of Computer Science and Engineering, University of New South Wales, Sydney, NSW, Australia
Nitin Indurkhya
Department of Statistics, Hill Center, Rutgers University, Piscataway, NJ, USA
Tong Zhang

Authors

Sholom M. Weiss
View author publications
You can also search for this author in PubMed Google Scholar
Nitin Indurkhya
View author publications
You can also search for this author in PubMed Google Scholar
Tong Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sholom M. Weiss .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Weiss, S.M., Indurkhya, N., Zhang, T. (2015). Overview of Text Mining. In: Fundamentals of Predictive Text Mining. Texts in Computer Science. Springer, London. https://doi.org/10.1007/978-1-4471-6750-1_1

Download citation

DOI: https://doi.org/10.1007/978-1-4471-6750-1_1
Published: 08 September 2015
Publisher Name: Springer, London
Print ISBN: 978-1-4471-6749-5
Online ISBN: 978-1-4471-6750-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics