Abstract
A finite population of points is generated on the real line by a Poisson process. From each point an arrow is drawn to its nearest neighbor (NN). In this way clusters of points are generated which are connected by arrows. (1999) studied the distribution of the number (M) of NN clusters. Because too small or too large values of M may indicate deviations from the null hypothesis of no clustering, tests for the detection of clusters can be based on the distribution of M. Such tests are of interest in epidemiology if one wants to know whether the times of occurrence of a certain disease cluster. We observe that the problem formulated (1999) is equivalent to the problem of deriving the distribution of the number of peaks in randomization problems which was studied by (1962) and others. Because the distribution of M has to be calculated by means of a recurrence equation, which makes the performance of the tests mentioned above difficult, we propose some bounds for the P -values and compare these with the exact distribution and a normal approximation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
ABRAMOWITZ, M. and STEGUN, I. A. (1972): Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. Dover Publications, New York
DAVID, F. N. and BARTON, D. E. (1962): Combinatorial Chance. Charles Griffin, London
DAVID, F. N., KENDALL, M. G. and BARTON, D. E. (1966): Symmetric Function and Allied Tables. University Press, Cambridge
ENNS, E. G., EHLERS, P. F. and MISI, T. (1999): A Cluster Problem as Defined by Nearest Neighbours. Canadian Journal of Statistics, 27, 843–851.
GOULD, H. W. (1972): Combinatorial Identities. A Standardized Set of Tables Listing 500 Binomial Coefficient Summations. Morgantown
KNUTH, D. E. and BUCKHOLTZ, T.J. (1967): Computation of Tangent, Euler, and Bernoulli Numbers. Mathematics of Computation, 21, 663–688.
KRAUTH, J. (1998): Upper Bounds for the P-Values of a Scan Statistic with a Variable Window, in Balderjahn, Mathar, and Schader (Eds.): Classification, Data Anal ysis, and Data Highways. Springer, Berlin, Heidelberg, New York, 155–163
NAGARWALLA, N. (1996): A Scan Statistic with Variable Window. Statistics in Medicine, 15, 799–810.
WARREN, D. and SENETA, E. (1996): Peaks and Eul erian Numbers in a Random Sequence. Journal of Applied Probability, 33, 101–114.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Krauth, J. (2003). Tests for One-dimensional Nearest Neighbors Clusters. In: Schwaiger, M., Opitz, O. (eds) Exploratory Data Analysis in Empirical Research. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-55721-7_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-55721-7_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44183-0
Online ISBN: 978-3-642-55721-7
eBook Packages: Springer Book Archive