Graph of Words Embedding for Molecular Structure-Activity Relationship Analysis
Structure-Activity relationship analysis aims at discovering chemical activity of molecular compounds based on their structure. In this article we make use of a particular graph representation of molecules and propose a new graph embedding procedure to solve the problem of structure-activity relationship analysis. The embedding is essentially an arrangement of a molecule in the form of a vector by considering frequencies of appearing atoms and frequencies of covalent bonds between them. Results on two benchmark databases show the effectiveness of the proposed technique in terms of recognition accuracy while avoiding high operational costs in the transformation.
KeywordsFeature Vector Adjacency Matrix Graph Match Pattern Recognition Letter Edge Attribute
- 8.Helma, C., Kramer, T., Kramer, S., De Raedt, L.: Data Mining and Machine Learning Techniques for the Identification of Mutagenicity Inducing Substructures and Structure-Activity Relationship of Noncongeneric Compounds. Journal of Chemical Information and Computer Sciences 44(4), 1402–1411 (2004)CrossRefGoogle Scholar
- 9.Kashima, H., Tsuda, K., Inokuchi, A.: Marginalized Kernels Between Labeled Graphs. In: Proceedings of the 20th International Conference on Machine Learning, pp. 321–328. AAAI Press, Menlo Park (2003)Google Scholar
- 10.Kramer, S., De Raedt, L.: Feature construction with version spaces for biochemical application. In: Proceeding of the 18th International Conference on Machine Learning, pp. 258–265 (2001)Google Scholar
- 11.Lewis, D.: Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval. In: Proceedings of the 10th European Conference on Machine Learning, vol. (1398), pp. 4–15 (1998)Google Scholar
- 12.Schölkopf, B., Smola, A.J.: Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, Cambridge (2001)Google Scholar