Skip to main content
  • Book
  • © 2011

Data Mining with Rattle and R

The Art of Excavating Data for Knowledge Discovery

Authors:

  • Encourages the concept of programming with data - more than just pushing data through tools, but learning to live and breathe the data

  • Accessible to many readers and not necessarily just those with strong backgrounds in computer science or statistics

  • Details some of the more popular algorithms for data mining, as well as covering model evaluation and model deployment

  • Includes supplementary material: sn.pub/extras

Part of the book series: Use R! (USE R)

Buying options

eBook USD 79.99
Price excludes VAT (USA)
  • ISBN: 978-1-4419-9890-3
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book USD 99.99
Price excludes VAT (USA)

This is a preview of subscription content, access via your institution.

Table of contents (18 chapters)

  1. Front Matter

    Pages i-xx
  2. Explorations

    1. Front Matter

      Pages 1-1
    2. Introduction

      • Graham Williams
      Pages 3-19
    3. Getting Started

      • Graham Williams
      Pages 21-55
    4. Working with Data

      • Graham Williams
      Pages 57-74
    5. Loading Data

      • Graham Williams
      Pages 75-98
    6. Exploring Data

      • Graham Williams
      Pages 99-136
    7. Interactive Graphics

      • Graham Williams
      Pages 137-148
    8. Transforming Data

      • Graham Williams
      Pages 149-168
  3. Building Models

    1. Front Matter

      Pages 169-169
    2. Descriptive and Predictive Analytics

      • Graham Williams
      Pages 171-177
    3. Cluster Analysis

      • Graham Williams
      Pages 179-192
    4. Association Analysis

      • Graham Williams
      Pages 193-203
    5. Decision Trees

      • Graham Williams
      Pages 205-244
    6. Random Forests

      • Graham Williams
      Pages 245-268
    7. Boosting

      • Graham Williams
      Pages 269-291
    8. Support Vector Machines

      • Graham Williams
      Pages 293-304
  4. Delivering Performance

    1. Front Matter

      Pages 305-305
    2. Model Performance Evaluation

      • Graham Williams
      Pages 307-321
    3. Deployment

      • Graham Williams
      Pages 323-327

About this book

Data mining is the art and science of intelligent data analysis. By building knowledge from information, data mining adds considerable value to the ever increasing stores of electronic data that abound today. In performing data mining many decisions need to be made regarding the choice of methodology, the choice of data, the choice of tools, and the choice of algorithms.

Throughout this book the reader is introduced to the basic concepts and some of the more popular algorithms of data mining. With a focus on the hands-on end-to-end process for data mining, Williams guides the reader through various capabilities of the easy to use, free, and open source Rattle Data Mining Software built on the sophisticated R Statistical Software. The focus on doing data mining rather than just reading about data mining is refreshing.

The book covers data understanding, data preparation, data refinement, model building, model evaluation,  and practical deployment. The reader will learn to rapidly deliver a data mining project using software easily installed for free from the Internet. Coupling Rattle with R delivers a very sophisticated data mining environment with all the power, and more, of the many commercial offerings.

Keywords

  • Data mining
  • R applications
  • Rattle software
  • analytics
  • data exploration
  • graphical user interfaces
  • machine learning
  • model building

Reviews

From the book reviews:

“The text does a great job of showing how to do each step using the data mining tool Rattle and related R concepts as appropriate. This makes it a great tool for someone who does not know much about R and wants to learn more about the powerful options available in R for data mining.” (Roger M. Sauter, Technometrics, Vol. 54 (3), August, 2012)

“This text is a manual for the impressive Rattle graphical user interface (GUI) for R, describing both the use of the GUI and the R code that is invoked to carry out the computations. … Data analysts … are likely to find Rattle a helpful tool that will allow them to quickly become productive with R. … There is extensive useful practical advice on data preparation and data manipulation. … is well suited for use in intermediate level courses on regression or classification.” (John H. Maindonald, International Statistical Review, Vol. 80 (1), 2012)

Authors and Affiliations

  • Togaware Pty Ltd, Jamison Centre, Australia

    Graham Williams

About the author

Dr Graham Williams is Senior Director of Analytics with the Australian Taxation Office, and previously Principal Computer Scientist for Data Mining with CSIRO. He is also Visiting Professor and Senior International Scientist with the Shenzhen Institutes of Advanced Analytics of the Chinese Academy of Sciences, Adjunct Professor, Data Mining, Fraud Prevention, Security, University of Canberra, and Adjunct Professor, Australian National University. Graham regularly teaches data mining courses and is author of the freely available, open source data mining system, Rattle. He has been involved in many data mining projects for clients from government and industry over his long career. His research developments included ensemble learning (1980's) and hot spots discovery (1990's). He is actively involved in the international artificial intelligence and data mining research communities, particularly as chair of the Pacific Asia Knowledge Discovery and Data Mining conference series and founder and co-chair of the Australasian Data Mining conference series. Graham has editted a number of books and authored many academic and industry papers and reports. His current focus is on making data mining technology readily accessible, ensuring research, innovation and discovery are repeatable and available, and encouraging the free and open sharing of knowledge.

Bibliographic Information

Buying options

eBook USD 79.99
Price excludes VAT (USA)
  • ISBN: 978-1-4419-9890-3
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book USD 99.99
Price excludes VAT (USA)