Learning Similarity with Fuzzy Functions of Adaptable Complexity
A common approach in database queries involves the multi-dimensional representation of objects by a set of features. These features are compared to the query representation and then combined together to produce a total similarity metric. In this paper we introduce a novel technique for similarity learning within features (attributes) by manipulating fuzzy membership functions (FMFs) of different complexity. Our approach is based on a gradual complexity increase adaptable to problem requirements. The underlying idea is that less adaptable functions will act as approximations for more complex ones. We begin by interpolating a set of planes in the training dataset and due to linearity we get a fast first impression of the underlying complexity. We proceed to interpolate two asymmetrical sigmoidal functions whose initial approximations are calculated from the plane properties. If satisfactory accuracy is not achieved we provide advanced modeling capabilities by investigating FMFs parameters and convolving their output with additional functions.
KeywordsSimilarity Function Training Dataset Input Space Sigmoidal Function Fuzzy Membership Function
Unable to display preview. Download preview PDF.
- 1.Agrawal, R., Faloutsos, C., Swami, A.: Efficient similarity search in sequence databases. In: Proc. of the Fourth Intl. Conference on Foundations of Data Organization and Algorithms, pp. 69–84 (1993)Google Scholar
- 2.Aha, D.W., Kibler, D.F., Albert, M.K.: Instance-Based Learning Algorithms. Machine Learning. 6, 37–66 (1991)Google Scholar
- 4.Berchtold, S., Kriegel, H.-P.: S3: Similarity Search in CAD Database Systems. In: Proc. ACM SIGMOD Conf., pp. 564–567 (1997)Google Scholar
- 5.Carkacioglu, A., Fatos, Y.-V.: Learning Similarity Space. In: Proc. of Intl. Conference in Image Processing, pp. 405–408 (2002)Google Scholar
- 7.Gionis, A., Indyk, P., Motwani, R.: Similarity search in high dimensions via hashing. In: Proc. of the 25th Intl. Conf. on Very Large Data Bases (VLDB), pp. 518–529 (1999)Google Scholar
- 8.Ishii, N., Wang, Y.: Learning Feature Weights for Similarity Measures using Genetic Algorithms. In: Proc. of IEEE Intl. Joint Symp. on Intelligence and Systems, pp. 27–33 (1998)Google Scholar
- 11.Mitaim, S., Kosko, B.: Neural Fuzzy Agents that Learn a User’s Preference Map. In: Proc. of 4th International Forum on Research and Technology Advances in Digital Libraries, pp. 25–35 (1997)Google Scholar
- 15.Rafiei, D., Mendelzon, O.: Similarity-Based Queries for Time Series Data. In: Proc. ACM SIGMOD Conf., pp. 13–25 (1997)Google Scholar
- 17.Vlachos, M., Gunopulos, D., Kollios, G.: Robust Similarity Measures for Mobile Object Trajectories. In: Proc. of DEXA Workshops, pp. 721–728 (2002)Google Scholar
- 19.Yi, B.-K., Faloutsos, C.: Fast Time Sequence Indexing for Arbitrary Lp Norms. In: Proc. of the 26th Intl. Conf. on Very Large Data Bases (VLDB), pp. 385–394 (2000)Google Scholar