Transformation Invariance in Pattern Recognition — Tangent Distance and Tangent Propagation

Simard, Patrice Y.; LeCun, Yann A.; Denker, John S.; Victorri, Bernard

doi:10.1007/3-540-49430-8_13

Patrice Y. Simard⁶,
Yann A. LeCun⁶,
John S. Denker⁶ &
…
Bernard Victorri⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1524))

5638 Accesses
113 Citations
1 Altmetric

Abstract

In pattern recognition, statistical modeling, or regression, the amount of data is a critical factor a.ecting the performance. If the amount of data and computational resources are unlimited, even trivial algorithms will converge to the optimal solution. However, in the practical case, given limited data and other resources, satisfactory performance requires sophisticated methods to regularize the problem by introducing a priori knowledge. Invariance of the output with respect to certain transformations of the input is a typical example of such a priori knowledge. In this chapter, we introduce the concept of tangent vectors, which compactly represent the essence of these transformation invariances, and two classes of algorithms, “tangent distance” and “tangent propagation”, which make use of these invariances to improve performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

P. A. Devijver and J. Kittler. Pattern Recognition, A Statistical Approache. Prentice Hall, Englewood Cliffs, 1982.
Google Scholar
A. V. Aho, J. E. Hopcroft, and J. D. Ullman. Data Structure and Algorithms. Addison-Wesley, 1983.
Google Scholar
L. Bottou and V. N. Vapnik. Local learning algorithms. Neural Computation, 4(6):888–900, 1992.
Article Google Scholar
A. J. Broder. Strategies for e.cient incremental nearest neighbor search. Pattern Recognition, 23:171–178, 1990.
Article Google Scholar
D. S. Broomhead and D. Lowe. Multivariable functional interpolation and adaptive networks. Complex Systems, 2:321–355, 1988.
MATH MathSciNet Google Scholar
Y. Choquet-Bruhat, C. DeWitt-Morette, and M. Dillard-Bleick. Analysis, Manifolds and Physics. North-Holland, Amsterdam, Oxford, New York, Tokyo, 1982.
MATH Google Scholar
C. Cortes and V. Vapnik. Support vector networks. Machine Learning, 20:273–297, 1995.
MATH Google Scholar
B. V. Dasarathy. Nearest Neighbor (NN) Norms: NN Pattern classi.cation Techniques. IEEE Computer Society Press, Los Alamitos, California, 1991.
Google Scholar
H. Drucker, R. Schapire, and P. Y. Simard. Boosting performance in neural networks. International Journal of Pattern Recognition and Artificial Intelligence, 7, No. 4:705–719, 1993.
Article Google Scholar
K. Fukunaga and T. E. Flick. An optimal global nearest neighbor metric. IEEE transactions on Pattern analysis and Machine Intelligence, 6, No. 3:314–318, 1984.
Article MATH Google Scholar
R. Gilmore. Lie Groups, Lie Algebras and some of their Applications. Wiley, New York, 1974.
MATH Google Scholar
T. Hastie, E. Kishon, M. Clark, and J. Fan. A model for signature verification. Technical Report 11214-910715-07TM, AT&T Bell Laboratories, July 1991.
Google Scholar
T. Hastie and P. Y. Simard. Metrics and models for handwritten character recognition. Statistical Science, 13, 1998.
Google Scholar
T. J. Hastie and R. J. Tibshirani. Generalized Linear Models. Chapman and Hall, London, 1990.
Google Scholar
G. E. Hinton, C. K. I. Williams, and M. D. Revow. Adaptive elastic models for hand-printed character recognition. In Advances in Neural Information Processing Systems, pages 512–519. Morgan Kaufmann Publishers, 1992.
Google Scholar
A. E. Hoerl and R. W. Kennard. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics, 12:55–67, 1970.
Article MATH Google Scholar
T. Kohonen. Self-organization and associative memory. In Springer Series in Information Sciences, volume 8. Springer-Verlag, 1984.
Google Scholar
Y. Le Cun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel. Handwritten digit recognition with a back-propagation network. In David Touretzky, editor, Advances in Neural Information Processing Systems, volume 2, (Denver, 1989), 1990. Morgan Kaufman.
Google Scholar
Y. LeCun. Generalization and networkdesign strategies. In R. Pfeifer, Z. Schreter, F. Fogelman, and L. Steels, editors, Connectionism in Perspective, Zurich, Switzerland, 1989. Elsevier. an extended version was published as a technical report of the University of Toronto.
Google Scholar
Y. LeCun, L. D. Jackel, L. Bottou, C. Cortes, J. S. Denker, H. Drucker, I. Guyon, U. A. Muller, E. Sackinger, P. Simard, and V. Vapnik. Learning algorithms for classification: A comparison on handwritten digit recognition. In J. H. Oh, C. Kwon, and S. Cho, editors, Neural Networks: The Statistical Mechanics Perspective, pages 261–276. World Scientific, 1995.
Google Scholar
E. Parzen. On estimation of a probability density function and mode. Ann. Math. Stat., 33:1065–1076, 1962.
Article MATH MathSciNet Google Scholar
W. H. Press, B. P. Flannery, Teukolsky S. A., and Vetterling W. T. Numerical Recipes. Cambridge University Press, Cambridge, 1988.
MATH Google Scholar
H. Schwenk. The diabolo classifier. Neural Computation, in press, 1998.
Google Scholar
R. Sibson. Studies in the robustness of multidimensional scaling: Procrustes statistices. J. R. Statist. Soc., 40:234–238, 1978.
MATH Google Scholar
P. Y. Simard. E.cient computation of complex distance metrics using hierarchical filtering. In Advances in Neural Information Processing Systems. Morgan Kaufmann Publishers, 1994.
Google Scholar
F. Sinden and G. Wilfong. On-line recognition of handwritten symbols. Technical Report 11228-910930-02IM, AT&T Bell Laboratories, June 1992.
Google Scholar
V. N. Vapnik. Estimation of dependences based on empirical data. Springer Verlag, 1982.
Google Scholar
V. N. Vapnikand A. Ya. Chervonenkis. On the uniform convergence of relative frequencies of events to their probabilities. Th. Prob. and its Applications, 17(2):264–280, 1971.
Article Google Scholar
N. Vasconcelos and A. Lippman. Multiresolution tangent distance for a.neinvariant classification. In Advances in Neural Information Processing Systems, volume 10, pages 843–849. Morgan Kaufmann Publishers, 1998.
Google Scholar
J. Voisin and P. Devijver. An application of the multiedit-condensing technique to the reference selection problem in a print recognition system. Pattern Recogntion, 20 No 5:465–474, 1987.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Image Processing Services Research Lab, AT& T Labs - Research, 100 SchulzDrive, Red Bank, NJ, 07701-7033, USA
Patrice Y. Simard, Yann A. LeCun & John S. Denker
CNRS, ELSAP, ENS, 1 rue Maurice Arnoux, F-92120, Montrouge, France
Bernard Victorri

Authors

Patrice Y. Simard
View author publications
You can also search for this author in PubMed Google Scholar
Yann A. LeCun
View author publications
You can also search for this author in PubMed Google Scholar
John S. Denker
View author publications
You can also search for this author in PubMed Google Scholar
Bernard Victorri
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Willamette University, Salem, OR, 97301, USA
Genevieve B. Orr
GMD First (Forschungszentrum Informationstechnik), Rudower Chaussee 5, D-12489, Berlin, Germany
Klaus-Robert Müller

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Simard, P.Y., LeCun, Y.A., Denker, J.S., Victorri, B. (1998). Transformation Invariance in Pattern Recognition — Tangent Distance and Tangent Propagation. In: Orr, G.B., Müller, KR. (eds) Neural Networks: Tricks of the Trade. Lecture Notes in Computer Science, vol 1524. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49430-8_13

Download citation

DOI: https://doi.org/10.1007/3-540-49430-8_13
Published: 28 March 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65311-0
Online ISBN: 978-3-540-49430-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics