Skip to main content

Predicting Stock Trends with Time Series Data Mining and Web Content Mining

  • Chapter

Part of the book series: Studies in Computational Intelligence ((SCI,volume 23))

Summary

This paper presents a new methodology for predicting stock trends and making trading decisions based on the combination of Data Mining and Web Content Mining techniques. While research in both areas is quite extensive, inference from time series stock data and time-stamped news stories collected from the World Wide Web require further exploration. Our prediction models are based on the content of time-stamped web documents in addition to traditional Numerical Time Series Data. The stock trading system based on the proposed methodology (ADMIRAL) will be simulated and evaluated on real-world series of news stories and stocks data using several known classification algorithms. The main performance measures will be the prediction accuracy of the induced models and, more importantly, the profitability of the investments made by using system recommendations based on these predictions.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bodie Z, Kane A, Marcus AJ (2001) Investments, 4th Edition. McGraw Hill.

    Google Scholar 

  2. Cooley R, Mobasher B, Srivastava J (1997) Web Mining: Information and Pat tern Discovery on the World Wide Web. Proceedings of the IEEE International Conference on Tools with Artificial Intelligence (ICTAI’97), Newport Beach, CA.

    Google Scholar 

  3. Extractor DBI technologies (2003) [http://www.dbi-tech.com].

    Google Scholar 

  4. Fama EF (1970) Efficient Capital Markets: A Review of Theory and Empirical Work, Journal of Finance 25: 383–417.

    Article  Google Scholar 

  5. Fama EF (1991) Efficient Capital Markets: II, Journal of Finance 46: 1575–1617.

    Article  Google Scholar 

  6. Gidófalvi G (2001) Using News Articles to Predict Stock Price Movement, Online at: [http://citeseer.nj.nec.com/517027.html] or [http://www.cs.aau.dk/~gyg/docs/fmancial-prediction.pdf].

    Google Scholar 

  7. Huagen RA (1995) The New Finance: The Case Against Efficient Markets. Prentice-Hall.

    Google Scholar 

  8. Jain AK, Murty MN, Flynn PJ (1999) Data Clustering: A Review, ACM Computing Surveys, Vol 31 No. 3.

    Google Scholar 

  9. Kaboudan MA (2000) Genetic Programming Prediction of Stock Prices, Computational Economics 16: 207–236.

    Article  MATH  Google Scholar 

  10. Kosala R, Blockeel H, Web Mining Research: A Survey, SIGKDD Explorations, Vol 2, Issue 1.

    Google Scholar 

  11. Landry R Jr., Debreceny R, Grey GL (2004) Grab Your Picks and Shovels! There’s Gold in Your Data, Strategic Finance, January (85, 7).

    Google Scholar 

  12. Last M, Klein Y, Kandel A (2001) Knowledge Discovery in Time Series Databases, IEEE Transactions on Systems, Man and Cybernetics — Part B: Cybernetics, Vol 31 No. 1.

    Google Scholar 

  13. Lavrenko V, Schmill M, Lawrie D, Ogilvie P, Jensen D, Allan J (2000) Language Models for Financial News Recommendation, CIKM 2000, McLean, VA USA, ACM 2000.

    Google Scholar 

  14. Maimon O, Kandel A, Last M (2000) Knowledge Discovery and Data Mining, The Info-Fuzzy Network (IFN) Methodology, Norwell, MA: Kluwer.

    Google Scholar 

  15. Peramunetilleke D, Wong RK (2002) Currency Exchange Rate Forecasting from News Headlines, Thirteenth Australasian Database Conference (ADC2002), Melbourne, Australia, Conferences in Information Technology, Vol 5.

    Google Scholar 

  16. Quinlan JR (1993) C4.5: Programs for Machine Learning, Morgan Kaufman Publishers Inc., San Francisco, CA.

    Google Scholar 

  17. Wuthrich B, Cho V, Leung S, Permunetilleke D, Sankaran K, Zhang J, Lam W (1998) Daily Stock Market Forecast from Textual Web Data, IEEE International Conference on Systems, Man. and Cybernetics, Vol 3: 2720–2725.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Rachlin, G., Last, M. (2006). Predicting Stock Trends with Time Series Data Mining and Web Content Mining. In: Last, M., Szczepaniak, P.S., Volkovich, Z., Kandel, A. (eds) Advances in Web Intelligence and Data Mining. Studies in Computational Intelligence, vol 23. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-33880-2_19

Download citation

  • DOI: https://doi.org/10.1007/3-540-33880-2_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-33879-6

  • Online ISBN: 978-3-540-33880-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics