Smoothing by Local Regression: Principles and Methods

Cleveland, William S.; Loader, Clive

doi:10.1007/978-3-642-48425-4_2

William S. Cleveland³ &
Clive Loader³

Part of the book series: Contributions to Statistics ((CONTRIB.STAT.))

768 Accesses
150 Citations

Summary

Local regression is an old method for smoothing data, having origins in the graduation of mortality data and the smoothing of time series in the late 19th century and the early 20th century. Still, new work in local regression continues at a rapid pace. We review the history of local regression. We discuss four of its basic components that must be chosen in using local regression in practice — the weight function, the parametric family that is fitted locally, the bandwidth, and the assumptions about the distribution of the response. A major theme of the paper is that these choices represent a modeling of the data; different data sets deserve different choices. We describe polynomial mixing, a method for enlarging polynomial parametric families. We introduce an approach to adaptive fitting,assessment of parametric localization. We describe the use of this approach to design two adaptive procedures: one automatically chooses the mixing degree of mixing polynomials at each x using cross-validation, and the other chooses the bandwidth at each x using C _p. Finally, we comment on the efficacy of using asymptotics to provide guidance for methods of local regression.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Brillinger, D. (1977). Discussion of a paper of Stone. Ann. Statist5, 622–623.
Google Scholar
Cleveland, W. S. (1979). Robust locally weighted regression and smoothing scatterplots. J. Amer. Statist. Assn. 74, 829–836.
Google Scholar
Cleveland, W. S. (1993). Visualizing Data. Hobart Press, Summit, NJ, http://bookshobart.com
Google Scholar
Cleveland, W. S. and Devlin, S. J. (1988). Locally weighted regression: an approach to regression analysis by local fitting. J. Amer. Statist. Assn. 83, 596–610.
Article Google Scholar
Cleveland, W. S., Hastie, T., and Loader, C. (1995). Adaptive local regression by automatic selection of the polynomial mixing degree. In preparation.
Google Scholar
Cleveland, W. S. and Grosse, E. H. (1991). Computational methods for local regression.Statist, and Computing1, 47–62.
Article Google Scholar
Cleveland, W. S. and Grosse, E. H. and Shyu, M. J. (1992). Local regression models. Statistical Models in S, J. M. Chambers and T. Hastie, editors, pages 309–376. Chapman and Hall, New York.
Google Scholar
Cleveland, W. S. and Loader, C. (1995). Computational methods for local regression from the 19th century to the present. In preparation.
Google Scholar
Cox, D. R. (1958). Some problems connected with statistical inference. Ann. Math. Statist. 29, 357–372.
Article Google Scholar
Daniel, C. and Wood, F. (1971). Fitting Equations to Data. Wiley, New York.
Google Scholar
DeForest, E. L. (1873). On some methods of interpolation applicable to the graduation of irregular series. Annual Report of the Board of Regents of the Smithsonian Institution for 1871, 275–339.
Google Scholar
DeForest, E. L. (1874). Additions to a memoir on methods of interpolation applicable to the graduation of irregular series. Annual Report of the Board of Regents of the Smithsonian Institution for 1873, 319–353.
Google Scholar
Donoho, D. and Johnstone, I. (1994). Ideal spatial adaptation by wavelet shrinkage. Biometrika81, 425–455.
Article Google Scholar
Fan, J. (1993). Local linear regression smoothers and their minimax efficiencies. Ann. Statist. 21, 196–216.
Article Google Scholar
Fan, J. and Gijbels, I. (1992). Variable bandwidth and local linear regression smoothers. Ann. Statist. 20, 2008–2036.
Article Google Scholar
Fan, J. and Gijbels, I. (1994a). Censored regression: local linear approximations and their applications. J. Amer. Statist. Assn. 89, 560–570.
Article Google Scholar
Fan, J. and Gijbels, I. (1994b). Adaptive order polynomial fitting: bandwidth robustification and bias reduction. Unpublished manuscript.
Google Scholar
Fan, J. and Gijbels, I. (1995). Data-driven bandwidth selection in local polynomial fitting: variable bandwidth and spatial adaptation. J. Royal Statist. Soc, Ser. B. 57, 371–394.
Google Scholar
Friedman, J. H. Multivariate adaptive regression splines (1991). Ann. Statist. 19, 1–141.
Article Google Scholar
Friedman, J. H. and Stuetzle, W. (1981). Projection pursuit regression. J. Amer. Statist. Assn. 76, 817–823.
Article Google Scholar
Friedman, J. H. and Stuetzle, W. (1982). Smoothing of scatterplots. Technical Report Orion 3, Dept. Statistics, Stanford University.
Google Scholar
Gasser, Th. and Jennen-Steinmetz, C. (1988). A unifying approach to nonparametric regression estimation. J. Amer. Statist. Assn. 83, 1084–1089.
Article Google Scholar
Gram, J. P. (1883). Uber Entwickelung reeller Functionen in Reihen mittelst der Methode der kleinsten Quadrate. J. Math. 94, 41–73.
Google Scholar
Hall, P., Sheather, S. J., Jones, M. C. and Marron, J. S. (1991). On optimal data-based bandwidth selection in kernel density estimation. Biometrika78, 263–269.
Article Google Scholar
Hardle, W. (1990). Applied Nonparametric Regression. Oxford University Press, Oxford.
Google Scholar
Hastie, T. and Loader, C. (1993). Local regression: automatic kernel carpentry (with discussion). Statist. Science8, 120–143.
Article Google Scholar
Hastie, T. J. and Tibshirani, R. J. (1990). Generalized Additive Models. Chapman and Hall, London.
Google Scholar
Henderson, R. (1916). Note on graduation by adjusted average.Actuarial Soc. Amer. 17, 43–48.
Google Scholar
Henderson, R. (1924). A new method of graduation. Actuarial Soc. Amer. 25, 29–39.
Google Scholar
Hoem, J. M. (1983). The reticent trio: some little-known early discoveries in life insurance mathematics by L. H. F. Oppermann, T. N. Thiele and J. P. Gram. Inter. Stat. Rev. 51, 213–221.
Article Google Scholar
Hjort, N. L. (1994). Local Bayesian regression. Unpublished manuscript.
Google Scholar
Hjort, N. L. and Jones, M. C. (1994). Locally parametric nonparametric density estimation. Unpublished manuscript.
Google Scholar
Katkovnik, V. Ya. (1979). Linear and nonlinear methods of nonparametric regression analysis. Soviet Automatic Control 5, 25–34.
Google Scholar
Kendall, M. G. (1973). Time Series. Oxford University Press, Oxford.
Google Scholar
Kendall, M. G. and Stuart A. (1976). The Advanced Theory of Statistics, Vol. 3. Hafner, New York.
Google Scholar
Lancaster, P. and Salkauskas, K. (1981). Surfaces generated by moving least squares methods. Mathematics of Computation37, 141–158.
Article Google Scholar
Lancaster, P. and Salkauskas, K. (1986). Curve and Surface Fitting: An Introduction. Academic Press: London.
Google Scholar
Loader, C. (1993). Change point estimation using local regression. Unpublished manuscript, available by ftp from http://netlib.att.com.
Google Scholar
Loader, C.(1994). Computing nonparametric function estimates. Computing Science and Statistics: Proceedings of the 26th Symposium on the interface, 356–361.
Google Scholar
Loader, C. (1995). Local likelihood density estimation. Ann. Statist., to appear.
Google Scholar
Macaulay, F. R. (1931). The Smoothing of Time Series. National Bureau of Economic Research, New York.
Google Scholar
Mallows, C. (1973). Some comments onC_p. Technometrics15, 661–675.
Google Scholar
Mallows, C. (1974). Discussion of a paper of Beaton and Tukey.Technometrics16, 187–188.
Google Scholar
Mcdonald, J. A. and Owen, A. B. (1986). Smoothing with split linear fits. Technometrics28, 195–208.
Article Google Scholar
Müller, H.-G. (1984). Smooth optimum kernel estimators of densities, regression curves and modes. Ann. Statist. 12, 766–774.
Article Google Scholar
Müller, H.-G. (1987). Weighted local regression and kernel methods for nonparametric curve fitting. J. Amer. Statist. Assn. 82, 231–238.
Article Google Scholar
Nadaraya, E. A. (1964). On estimating regression. Theor. Probab. Appl9, 141–142.
Article Google Scholar
Parzen, E. (1962). On estimation of a probability density function and mode. Ann. Math. Statist. 33, 1065–1076.
Article Google Scholar
Rosenblatt, M. (1956). Remarks on some nonparametric estimates of a density function. Ann. Math. Statist. 27, 832–837.
Article Google Scholar
Rosenblatt, M. (1971). Curve estimates. Ann. Math. Statist. 42, 1815–1842.
Article Google Scholar
Ruppert, D. and Wand, M. P. (1992). Multivariate locally weighted least squares regression. Ann. Statist. 22, No. 3.
Google Scholar
Shishkin, J, Young, A. H., and Musgrave, J. C. (1967). The X-11 variant of the Census Method II seasonal adjustment program. Technical Paper 15, U.S. Bureau of the Census.
Google Scholar
Speckman, P. (1995). Discussion of a paper of Donoho et al. J. Royal Statist. Soc, Ser. B. 57, 337–338.
Google Scholar
Spencer, J. (1904a). On the graduation of the rates of sickness and mortality. J. Inst. Act38, 334–347.
Google Scholar
Spencer, J. (1904b). Graduation of a sickness table by Makeham’s hypothesis. Biometrika3, 52–57.
Article Google Scholar
Staniswalis, J. (1988). The kernel estimate of a regression function in likelihood-based models. J. Amer. Statist. Assn. 84, 276–283.
Article Google Scholar
Stigler, S. M. (1978). Mathematical statistics in the early States.Ann. Statist6, 239–265.
Article Google Scholar
Stoker, T. M. (1993). Smoothing bias in density derivative estimation. J. Amer. Statist. Assn. 88, 855–863.
Article Google Scholar
Stone, C. J. (1977). Consistent nonparametric regression (with discussion). Ann. Statist. 5, 595–620.
Article Google Scholar
Stone, C. J. (1980). Optimal rates of convergence for nonparametric estimators. Ann. Statist. 8, 1348–1360.
Article Google Scholar
Stone, M. (1974). Cross-validatory choice of assessment of statistical predictions (with discussion). J. R. Statist. Soc. B36, 111–47.
Google Scholar
Tibshirani, R. and Hastie, T. (1987). Local likelihood estimation. J. Amer. Statist. Assn. 82, 559–567.
Article Google Scholar
Tsybakov, A. B. (1986). Robust reconstruction of functions by the local-approximation method. Prob. Inform. Trans. 22, 69–84.
Google Scholar
Watson, G. S. (1964). Smooth regression analysis. Sankhya Ser. A 26, 359–372.
Google Scholar
Wahba, G. (1990). Spline Functions for Observational Data. SIAM, Philadelphia.
Google Scholar
Whittaker, E. T. (1923). On a new method of graduation. Proc. Edinburgh Math. Soc. 41, 63–75.
Google Scholar
Woolhouse, W. S. B. (1870). Explanation of a new method of adjusting mortality tables, with some observations upon Mr. Makeham’s modification of Gompertz’s theory. J. Inst. Act. 15, 389–410.
Google Scholar

Download references

Author information

Authors and Affiliations

AT&T Bell Laboratories, 600 Mountain Avenue, 07974, Murray Hill, NJ, USA
William S. Cleveland & Clive Loader

Authors

William S. Cleveland
View author publications
You can also search for this author in PubMed Google Scholar
Clive Loader
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Computational Statistics Institut für Statistik und Ökonometrie Wirtschaftswissenschaftliche Fakultät, Humboldt-Universität zu Berlin, Spandauer straße 1, D-10178, Berlin, Germany
Wolfgang Härdle
Medical Biometrics Group, University of Graz Medical Schools, Auenbruggerplatz 30/IV, A-8036, Graz, Austria
Michael G. Schimek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cleveland, W.S., Loader, C. (1996). Smoothing by Local Regression: Principles and Methods. In: Härdle, W., Schimek, M.G. (eds) Statistical Theory and Computational Aspects of Smoothing. Contributions to Statistics. Physica-Verlag HD. https://doi.org/10.1007/978-3-642-48425-4_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-48425-4_2
Publisher Name: Physica-Verlag HD
Print ISBN: 978-3-7908-0930-5
Online ISBN: 978-3-642-48425-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics