Skip to main content

Feature Discovery with Type Extension Trees

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5194))

Abstract

We are interested in learning complex combinatorial features from relational data. We rely on an expressive and general representation language whose semantics allows us to express many features that have been used in different statistical relational learning settings. To avoid expensive exhaustive search over the space of relational features, we introduce a heuristic search algorithm guided by a generalized relational notion of information gain and a discriminant function. The algorithm succesfully finds interesting and interpretable features on artificial and real-world relational learning problems.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Van Assche, A., Vens, C., Blockeel, H., Dzeroski, S.: First order random forests: Learning relational classifiers with complex aggregates. Machine Learning 64, 149–182 (2006)

    Article  MATH  Google Scholar 

  2. Blockeel, H., De Raedt, L.: Lookahead and discretization in ilp. In: Proc. of the 7th Int. Workshop on ILP, pp. 77–84 (1997)

    Google Scholar 

  3. Blockeel, H., De Raedt, L.: Top-down induction of first-order logical decision trees. Artificial Intelligence (101), 285–297 (1998)

    Google Scholar 

  4. Castillo, L.P., Wrobel, S.: A comparative study on methods for reducing myopia of hill-climbing search in multirelational learning. In: Proc. of ICML 2004 (2004)

    Google Scholar 

  5. Friedman, N., Getoor, L., Koller, D., Pfeffer, A.: Learning probabilistic relational models. In: Proc. of IJCAI 1999 (1999)

    Google Scholar 

  6. Hulo, N., Sigrist, C.J.A., Le Saux, V., Langendijk-Genevaux, P.S., Bordoli, L., Gattiker, A., De Castro, E., Bucher, P., Bairoch, A.: Recent improvements to the prosite database. Nucleic Acids Research 32(Database-Issue), 134–137 (2004)

    Article  Google Scholar 

  7. Jaeger, M.: Type extension trees: a unified framework for relational feature construction. In: Proceedings of Mining and Learning with Graphs (MLG 2006) (2006)

    Google Scholar 

  8. Jensen, D., Neville, J., Hay, M.: Avoiding bias when aggregating relational data with degree disparity. In: Proc. of ICML 2003 (2003)

    Google Scholar 

  9. Knobbe, A.J., Siebes, A., van der Wallen, D.: Multi-relational decision tree induction. In: Żytkow, J.M., Rauch, J. (eds.) PKDD 1999. LNCS (LNAI), vol. 1704, pp. 378–383. Springer, Heidelberg (1999)

    Google Scholar 

  10. Neville, J., Jensen, D.: Collective classification with relational dependency networks. In: Proc. of 2nd Int. Workshop on Multi-Relational Data Mining, pp. 77–91 (2003)

    Google Scholar 

  11. Neville, J., Jensen, D., Friedland, L., Hay, M.: Learning relational probability trees. In: Proceedings of SIGKDDD 2003 (2003)

    Google Scholar 

  12. Passerini, A., Punta, M., Ceroni, A., Rost, B., Frasconi, P.: Identifying cysteines and histidines in transition-metal-binding sites using support vector machines and neural networks. Proteins 65(2), 305–316 (2006)

    Article  Google Scholar 

  13. Perlich, C., Provost, F.: Aggregation-based featrue invention and relational concept classes. In: Proc. of SIGKDD 2003 (2003)

    Google Scholar 

  14. Popescul, A., Ungar, L.H.: Feature generation and selection in multi-relational statistical learning. In: Getoor, L., Taskar, B. (eds.) Statistical Relational Learning. MIT Press, Cambridge (2007)

    Google Scholar 

  15. Singla, P., Domingos, P.: Entity resolution with markov logic. In: Perner, P. (ed.) ICDM 2006. LNCS (LNAI), vol. 4065. Springer, Heidelberg (2006)

    Google Scholar 

  16. Hanley, J.A., McNeil, B.J.: A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology 148(3), 839–843 (1983)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Filip Železný Nada Lavrač

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Frasconi, P., Jaeger, M., Passerini, A. (2008). Feature Discovery with Type Extension Trees. In: Železný, F., Lavrač, N. (eds) Inductive Logic Programming. ILP 2008. Lecture Notes in Computer Science(), vol 5194. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85928-4_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-85928-4_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-85927-7

  • Online ISBN: 978-3-540-85928-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics