Feature Discovery with Type Extension Trees

Frasconi, Paolo; Jaeger, Manfred; Passerini, Andrea

doi:10.1007/978-3-540-85928-4_13

Feature Discovery with Type Extension Trees

Paolo Frasconi²,
Manfred Jaeger¹ &
Andrea Passerini²

Conference paper

1358 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5194))

Abstract

We are interested in learning complex combinatorial features from relational data. We rely on an expressive and general representation language whose semantics allows us to express many features that have been used in different statistical relational learning settings. To avoid expensive exhaustive search over the space of relational features, we introduce a heuristic search algorithm guided by a generalized relational notion of information gain and a discriminant function. The algorithm succesfully finds interesting and interpretable features on artificial and real-world relational learning problems.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Van Assche, A., Vens, C., Blockeel, H., Dzeroski, S.: First order random forests: Learning relational classifiers with complex aggregates. Machine Learning 64, 149–182 (2006)
Article MATH Google Scholar
Blockeel, H., De Raedt, L.: Lookahead and discretization in ilp. In: Proc. of the 7th Int. Workshop on ILP, pp. 77–84 (1997)
Google Scholar
Blockeel, H., De Raedt, L.: Top-down induction of first-order logical decision trees. Artificial Intelligence (101), 285–297 (1998)
Google Scholar
Castillo, L.P., Wrobel, S.: A comparative study on methods for reducing myopia of hill-climbing search in multirelational learning. In: Proc. of ICML 2004 (2004)
Google Scholar
Friedman, N., Getoor, L., Koller, D., Pfeffer, A.: Learning probabilistic relational models. In: Proc. of IJCAI 1999 (1999)
Google Scholar
Hulo, N., Sigrist, C.J.A., Le Saux, V., Langendijk-Genevaux, P.S., Bordoli, L., Gattiker, A., De Castro, E., Bucher, P., Bairoch, A.: Recent improvements to the prosite database. Nucleic Acids Research 32(Database-Issue), 134–137 (2004)
Article Google Scholar
Jaeger, M.: Type extension trees: a unified framework for relational feature construction. In: Proceedings of Mining and Learning with Graphs (MLG 2006) (2006)
Google Scholar
Jensen, D., Neville, J., Hay, M.: Avoiding bias when aggregating relational data with degree disparity. In: Proc. of ICML 2003 (2003)
Google Scholar
Knobbe, A.J., Siebes, A., van der Wallen, D.: Multi-relational decision tree induction. In: Żytkow, J.M., Rauch, J. (eds.) PKDD 1999. LNCS (LNAI), vol. 1704, pp. 378–383. Springer, Heidelberg (1999)
Google Scholar
Neville, J., Jensen, D.: Collective classification with relational dependency networks. In: Proc. of 2nd Int. Workshop on Multi-Relational Data Mining, pp. 77–91 (2003)
Google Scholar
Neville, J., Jensen, D., Friedland, L., Hay, M.: Learning relational probability trees. In: Proceedings of SIGKDDD 2003 (2003)
Google Scholar
Passerini, A., Punta, M., Ceroni, A., Rost, B., Frasconi, P.: Identifying cysteines and histidines in transition-metal-binding sites using support vector machines and neural networks. Proteins 65(2), 305–316 (2006)
Article Google Scholar
Perlich, C., Provost, F.: Aggregation-based featrue invention and relational concept classes. In: Proc. of SIGKDD 2003 (2003)
Google Scholar
Popescul, A., Ungar, L.H.: Feature generation and selection in multi-relational statistical learning. In: Getoor, L., Taskar, B. (eds.) Statistical Relational Learning. MIT Press, Cambridge (2007)
Google Scholar
Singla, P., Domingos, P.: Entity resolution with markov logic. In: Perner, P. (ed.) ICDM 2006. LNCS (LNAI), vol. 4065. Springer, Heidelberg (2006)
Google Scholar
Hanley, J.A., McNeil, B.J.: A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology 148(3), 839–843 (1983)
Google Scholar

Download references

Author information

Authors and Affiliations

Department for Computer Science, Aalborg University, Denmark
Manfred Jaeger
Dipartimento di Sistemi e Informatica, Universitá degli Studi di Firenze, Italy
Paolo Frasconi & Andrea Passerini

Authors

Paolo Frasconi
View author publications
You can also search for this author in PubMed Google Scholar
Manfred Jaeger
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Passerini
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Filip Železný Nada Lavrač

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Frasconi, P., Jaeger, M., Passerini, A. (2008). Feature Discovery with Type Extension Trees. In: Železný, F., Lavrač, N. (eds) Inductive Logic Programming. ILP 2008. Lecture Notes in Computer Science(), vol 5194. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85928-4_13

Download citation

DOI: https://doi.org/10.1007/978-3-540-85928-4_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85927-7
Online ISBN: 978-3-540-85928-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics