Abstract
Epidemics of infectious diseases are usually recognized by an observation of an abnormal cluster of cases. Usually, the recognition is not automated, and relies on the alertness of human health care workers. This can lead to significant delays in detection. Since real-time data from the physicians’ offices is not available. However, in Finland a Web-based collection of guidelines for primary care exists, and increases in queries concerning certain disease have been shown to correlate to epidemics. We introduce a simple method for automated online mining of probable epidemics from the log of this database. The method is based on deriving a smoothed time series from the data, on using a flexible selection of data for comparison, and on applying randomization statistics to estimate the significance of findings. Experimental results on simulated and real data show that the method can provide accurate and early detection of epidemics.
Chapter PDF
References
Physician’s Desk Reference and Database (in Finnish). Kustannus Oy Duodecim. Yearly revised edition (14.3. 2002), http://www.duodecim.fi/kustannus/cdrom/index.html
Jousimaa, J., Kunnamo, I.: Physicians’ patterns of using a computerized collection of guidelines of primary health care. Int. J. Technol. Assess Health Care 14, 484–493 (1998)
Jormanainen, V., Jousimaa, J., Kunnamo, I., Ruutu, P.: Physicians’ Database Searches as a Tool for Early Detection of Epidemics. Emerging Infectious Diseases 7(3) (2001)
Diekman, O., Heesterbek, J.A.P.: Mathematical Epidemiology of Infectious Diseases. Wiley Series in Mathematical and Computational Biology. John Wiley & Sons Ltd, Chichester (2000)
Farmer, R.D.T., Miller, D.L., Lawrenson, R.: Lecture Notes on Epidemiology and Public Health Medicine. Blackwell Scientific, Oxford (1996)
Manly, B.F.J.: Randomization, Bootstrap and Monte Carlo Methods in Biology, 2nd edn. Texts in Statistical Science Series, vol. 41. CRC Press, Boca Raton (1997)
Piatetsky-Shapiro, G., Matheus, C.J.: The Interestingness of Deviations. In: AAAI-94 Knowledge Discovery in Databases Workshop (1994)
Stern, L., Lightfoot, D.: Automated Outbreak Detection: a quantitative retrospective analysis. Epidemiol. Infect. 122, 103–110 (1999)
Salmenkivi. S., Computational Methods for Intensity Models. Department of Computer Science. University of Helsinki. Series of Publications A-2001-2 (2001)
Ogden, S., Sugiura, N.: Testing Change-points with Linear Trend. Communications in Statistics B: Simulation and Computation 23, 287–322 (1994)
Keogh, E., et al.: An Online Algorithm for Segmenting Time Series. In: IEEE International Conference on Data Mining (2001)
Guralnik, V., Srivastava, J.: Event Detection from Time Series Data. In: Proc. Fifth ACM SIGKDD (1999)
Last, M., Klein, Y., Kandel, A.: Knowledge discovery in Time Series Databases. Correspondence. IEEE Transactions on Systems, Man and Cybernetics - Part B: Cybernetics (2001)
Das, G., Lin, K., Mannila, H., Renganathan, G., Smyth, P.: Rule discovery from time series. In: Proc. KDD 1998, pp. 16–22 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Heino, J., Toivonen, H. (2003). Automated Detection of Epidemics from the Usage Logs of a Physicians’ Reference Database. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds) Knowledge Discovery in Databases: PKDD 2003. PKDD 2003. Lecture Notes in Computer Science(), vol 2838. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39804-2_18
Download citation
DOI: https://doi.org/10.1007/978-3-540-39804-2_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20085-7
Online ISBN: 978-3-540-39804-2
eBook Packages: Springer Book Archive