Incremental Embedding Within a Dissimilarity-Based Framework
Structural pattern recognition methods based on strings or graphs provide a natural encoding of objects’ relationships but can usually be combined only with a few set of machine learning methods. This last decade has seen majors advancements aiming to link these two fields. The two majors research fields in this direction concern the design of new graph and string kernels and different explicit embedding schemes of structural data. Explicit embedding of structural data can be combined with any machine learning methods. Dissimilarity representation methods are important because they allow an explicit embedding and the connection with the kernel framework. However these methods require the whole universe to be known during the learning phase and to obtain a Euclidean embedding, the matrix of dissimilarity encoding dissimilarities between any pair of objects should be regularized. This last point somehow violates the usual separation between training and test sets since both sets should be jointly processed and is an important limitation in many practical applications where the test set is unbounded and unknown during the learning phase. Moreover, requiring the whole universe represents a bottleneck for the processing of massive dataset. In this paper, we propose to overcome these limitations following an incremental embedding based on dissimilarity representations. We study in this paper, the pros and cons of two methods, which allow computing implicitly, and separately the embedding of points in the test set and in the learning set. Conclusions are set following experiments performed on different datasets.
Unable to display preview. Download preview PDF.
- 4.Pekalska, E., Duin, R.: Classifiers for dissimilarity-based pattern recognition. In: Proceedings of the 15th International Conference on Pattern Recognition, vol. 2, pp. 12–16 (2000)Google Scholar
- 9.Cox, T.F., Cox, M.: Multidimensional Scaling, 2nd edn. Chapman and Hall/CRC (2000)Google Scholar
- 10.Dattorro, J.: Convex Optimization & Euclidean Distance Geometry. Meboo Publishing USA (2011)Google Scholar
- 14.Riesen, K., Bunke, H.: IAM graph database repository for graph based pattern recognition and machine learning. In: da Vitoria Lobo, N., Kasparis, T., Roli, F., Kwok, J.T., Georgiopoulos, M., Anagnostopoulos, G.C., Loog, M. (eds.) SSPR&SPR 2008. LNCS, vol. 5342, pp. 287–297. Springer, Heidelberg (2008)CrossRefGoogle Scholar
- 16.Chang, C.C., Lin, C.J.: LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2, 27:1–27:27 (2011)Google Scholar