Learning Stochastic Tree Edit Distance

Bernard, Marc; Habrard, Amaury; Sebban, Marc

doi:10.1007/11871842_9

Marc Bernard²¹,
Amaury Habrard²² &
Marc Sebban²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4212))

Included in the following conference series:

European Conference on Machine Learning

5466 Accesses
6 Citations

Abstract

Trees provide a suited structural representation to deal with complex tasks such as web information extraction, RNA secondary structure prediction, or conversion of tree structured documents. In this context, many applications require the calculation of similarities between tree pairs. The most studied distance is likely the tree edit distance (ED) for which improvements in terms of complexity have been achieved during the last decade. However, this classic ED usually uses a priori fixed edit costs which are often difficult to tune, that leaves little room for tackling complex problems. In this paper, we focus on the learning of a stochastic tree ED. We use an adaptation of the Expectation-Maximization algorithm for learning the primitive edit costs. We carried out series of experiments that confirm the interest to learn a tree ED rather than a priori imposing edit costs.

This work is part of the ongoing ARA Marmota research project.

Download to read the full chapter text

Chapter PDF

An A*-algorithm for the Unordered Tree Edit Distance with Custom Costs

Top-down tree edit-distance of regular tree languages

Article 11 July 2018

A New Perspective on the Tree Edit Distance

Keywords

References

Bille, P.: A survey on tree edit distance and related problem. Theoretical Computer Science 337(1-3), 217–239 (2005)
Article MATH MathSciNet Google Scholar
Ristad, S., Yianilos, P.: Learning string-edit distance. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(5), 522–532 (1998)
Article Google Scholar
Oncina, J., Sebban, M.: Learning stochastic edit distance: application in handwritten character recognition. Journal of Pattern Recognition (to appear, 2006)
Google Scholar
McCallum, A., Bellare, K., Pereira, P.: A conditional random field for disciminatively-trained finite-state sting edit distance. In: UAI 2005 (2005)
Google Scholar
Durbin, R., Eddy, S., Krogh, A., Mitchison, G.: Biological sequence analysis. Cambridge University Press, Cambridge (1998)
Book MATH Google Scholar
Neuhaus, M., Bunke, H.: A probabilistic approach to learning costs for graph edit distance. In: 17th Int. Conf. on Pattern Recognition, pp. 389–393. IEEE, Los Alamitos (2004)
Chapter Google Scholar
Zhang, K., Shasha, D.: Simple fast algorithms for the editing distance between trees and related problems. SIAM Journal of Computing, 1245–1262 (1989)
Google Scholar
Klein, P.: Computing the edit-distance between unrooted ordered trees. In: Bilardi, G., Pietracaprina, A., Italiano, G.F., Pucci, G. (eds.) ESA 1998. LNCS, vol. 1461, pp. 91–102. Springer, Heidelberg (1998)
Chapter Google Scholar
Selkow, S.: The tree-to-tree editing problem. Information Processing Letters 6(6), 184–186 (1977)
Article MATH MathSciNet Google Scholar
Bouchard, G., Triggs, B.: The trade-off between generative and discrminative classifiers. In: COMPSTAT 2004. Springer, Heidelberg (2004)
Google Scholar
Dempster, A., Laird, M., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. B(39), 1–38 (1977)
MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

EURISE, Université Jean Monnet de Saint-Etienne, 23, rue Paul Michelon, 42023 cedex 2, Saint-Etienne, France
Marc Bernard & Marc Sebban
LIF, Université de Provence, 39, rue Frédéric Joliot Curie, 13453 cedex 13, Marseille, France
Amaury Habrard

Authors

Marc Bernard
View author publications
You can also search for this author in PubMed Google Scholar
Amaury Habrard
View author publications
You can also search for this author in PubMed Google Scholar
Marc Sebban
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Engineering Group, Technische Universität Darmstadt,
Johannes Fürnkranz
Max Planck Institute for Computer Science, Saarbrücken, Germany
Tobias Scheffer
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bernard, M., Habrard, A., Sebban, M. (2006). Learning Stochastic Tree Edit Distance. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Machine Learning: ECML 2006. ECML 2006. Lecture Notes in Computer Science(), vol 4212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871842_9

Download citation

DOI: https://doi.org/10.1007/11871842_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45375-8
Online ISBN: 978-3-540-46056-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Learning Stochastic Tree Edit Distance

Abstract

Chapter PDF

Similar content being viewed by others

An A*-algorithm for the Unordered Tree Edit Distance with Custom Costs

Top-down tree edit-distance of regular tree languages

A New Perspective on the Tree Edit Distance

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Learning Stochastic Tree Edit Distance

Abstract

Chapter PDF

Similar content being viewed by others

An A*-algorithm for the Unordered Tree Edit Distance with Custom Costs

Top-down tree edit-distance of regular tree languages

A New Perspective on the Tree Edit Distance

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation