Advertisement

Text Mining

Predictive Methods for Analyzing Unstructured Information

  • Sholom M. Weiss
  • Nitin Indurkhya
  • Tong Zhang
  • Fred J. Damerau

Table of contents

  1. Front Matter
    Pages i-xii
  2. Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 1-13
  3. Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 15-46
  4. Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 47-84
  5. Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 85-102
  6. Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 103-128
  7. Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 129-156
  8. Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 157-195
  9. Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 197-211
  10. Back Matter
    Pages 213-237

About this book

Introduction

One consequence of the pervasive use of computers is that most documents originate in digital form. Text mining—the process of searching, retrieving, and analyzing unstructured, natural-language text—is concerned with how to exploit the textual data embedded in these documents.

Text Mining presents a comprehensive introduction and overview of the field, integrating related topics (such as artificial intelligence and knowledge discovery and data mining) and providing practical advice on how readers can use text-mining methods to analyze their own data. Emphasizing predictive methods, the book unifies all key areas in text mining: preprocessing, text categorization, information search and retrieval, clustering of documents, and information extraction. In addition, it identifies emerging directions for those looking to do research in the area. Some background in data mining is beneficial, but not essential.

Topics and features:

* Presents a comprehensive and easy-to-read introduction to text mining

* Explores the application and utility of the methods, as well as the optimal techniques for specific scenarios

* Provides several descriptive case studies that take readers from problem description to system deployment in the real world

* Uses methods that rely on basic statistical techniques, thus allowing for relevance to all languages (not just English)

* Includes access to downloadable software (runs on any computer), as well as useful chapter-ending historical and bibliographical remarks, a detailed bibliography, and subject and author indexes

This authoritative and highly accessible text, written by a team of authorities on text mining, develops the foundation concepts, principles, and methods needed to expand beyond structured, numeric data to automated mining of text samples. Researchers, computer scientists, and advanced undergraduates and graduates with work and interests in data mining, machine learning, databases, and computational linguistics will find the work an essential resource.

Keywords

Active learning Clustering and matching Document classification and correction Extraction Retrieval Summarization classification clustering data mining information retrieval text mining

Authors and affiliations

  • Sholom M. Weiss
    • 1
  • Nitin Indurkhya
    • 2
  • Tong Zhang
    • 1
  • Fred J. Damerau
    • 1
  1. 1.TJ Watson LabsIBM ResearchYorktown HeightsUSA
  2. 2.School of Computer Science and EngineeringUniversity of New South WalesSydneyAustralia

Bibliographic information

  • DOI https://doi.org/10.1007/978-0-387-34555-0
  • Copyright Information Springer-Verlag New York 2005
  • Publisher Name Springer, New York, NY
  • eBook Packages Computer Science
  • Print ISBN 978-0-387-95433-2
  • Online ISBN 978-0-387-34555-0
  • Buy this book on publisher's site
Industry Sectors
Pharma
Automotive
Biotechnology
Finance, Business & Banking
Electronics
IT & Software
Telecommunications
Law
Consumer Packaged Goods
Energy, Utilities & Environment
Aerospace
Engineering