Skip to main content

Part of the book series: Philosophical Studies Series ((PSSP,volume 134))

Abstract

In this paper, I reinterpret Kant’s Transcendental Analytic as a description of a cognitive architecture. I describe a computer implementation of this architecture, and show how it has been applied to two unsupervised learning tasks. The resulting program is very data efficient, able to learn from a tiny handful of examples. I show how the program achieves data-efficiency: the constraints described in the Analytic of Principles are reinterpreted as strong prior knowledge, constraining the set of possible solutions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    In making this claim, I am assuming a suitably red-blooded notion of “experience”. Of course, for some sufficiently thin notion of “experience”, the thermostat must “experience” the world in order to act at all. But there is a difference between merely responding to a stimulus and making sense of that stimulus: reinterpreting the stimulus as a representation of a coherent external world. The latter is “experience” in the strong sense I am using it.

  2. 2.

    Note that I am not defining intentionality in terms of the activity of counting-as (which would be uninformative). Rather, I am using counting-as to distinguish between original and derivative intentionality. Later, counting-as will itself be explicated in terms of the construction and application of rules.

  3. 3.

    All such references [A, B] are to the A and B editions of the Critique of Pure Reason, (Kant 1781).

  4. 4.

    By “willy-nilly”, I mean without justification from the application of a rule. Kant’s view is that the only mental actions that are justified are actions that result from applying a rule. What leaves room in this stern vision for spontaneity and autonomy is that the rules are not imposed from outside; rather, they are self-legislated.

  5. 5.

    Please note that these Kantian rules do not have to be linguistically articulated or consciously accessible. Rather, the rules that determine the activities of mental combination are implicit and consciously inaccessible, in the same way that the rules of a compiled Prolog program are inaccessible to the executing process.

  6. 6.

    See Kant (1781)(B201n): “the synthesis of a manifold of what does not necessarily belong to each other”.

  7. 7.

    See Kant (1781)(B201n): “the second combination is the synthesis of that which is manifold insofar as they necessarily belong to one another”.

  8. 8.

    See also [A105], [A177, B220].

  9. 9.

    In computational terms, think of a meta-interpreter that is able to construct pieces of code as data, and then execute these new pieces of code.

  10. 10.

    Kant makes the same point in the Metaphysical Deduction: “The same function that gives unity to the different representations in a judgement also gives unity to the mere synthesis of different representations in an intuition, which, expressed generally, is called the pure concept of the understanding. The same understanding, therefore, and indeed by means of the very same actions through which it brings the logical form of a judgement into concepts by means of the analytical unity, also brings a transcendental content into its representations by means of the synthetic unity of the manifold” (Kant 1781)(A79, B104-5). In other words, there is only one process (a process of constructing and applying rules) which explains both how we form judgements and how we form intuitions.

  11. 11.

    Some of the connection rules involved in characterising a concept do more than simply state that one concept is a sub-concept of another, or that one concept excludes another. Some of them relate the concept to another concept only conditionally – dependent on the existence of external factors. For example: “If the weather gets cold, trees lose their leaves”, “If a tree gets no water, it perishes” (Longuenesse 1998). Some of the conceptual inference rules, in Kantian terms, are hypothetical rather than categorical.

  12. 12.

    These activities are described in Kant (1781)(B185, A146) .

  13. 13.

    I use the Kantian term apprehension to denote a time-slice of an enduring object at a particular moment in time. Throughout, I use “apprehension” and “object-slice” interchangeably.

  14. 14.

    Here I assume the stable model semantics (Gelfond and Lifschitz 1988) for negation-as-failure.

  15. 15.

    See Reiter (1980).

  16. 16.

    For influential examples, see Kowalski and Sergot (1989) and McCarthy (1963).

  17. 17.

    I omit, for reasons of space, discussion of the Second Principle, the Anticipations of Perception. The Fourth Principle does not need its own relation.

  18. 18.

    This is claimed explicitly in a marginal note to the first edition.

  19. 19.

    This is called a “language bias” in the program induction literature.

  20. 20.

    Contrast with Shanahan (2005), who sees perception as a form of abduction.

  21. 21.

    The problem description for finding non-monotonic logic programs from positive and negative examples is actually somewhat more complicated, as there may be multiple models, each with their own positive and negative instances. See Law et al (2014) for details.

  22. 22.

    Hence Kant’s emphasis on spontaneity: the Kantian agent is both less free (because he can only perform actions by applying rules) and more free (because he can construct any set of rules he likes) than the empiricist can possibly imagine.

  23. 23.

    There are two major simplifications in the current implementation. The first is that the spatial framework needed to satisfy the Axioms of Intuition is given in advance, pre-specified, hand-coded. The agent is told that he is operating in a 2-dimensional grid world. The second major simplification is that the constraints involved in the Anticipations of Perception are ignored altogether: in the initial implementation, time is modelled as a series of discrete points, rather than being dense. In future work, I plan to overcome these limitations.

  24. 24.

    One important difference is that your tactile sensations are much more fine-grained: you receive a number of intermediate sensations as the object moves between your four knuckles. The robot just has four discrete boolean sensors (one for each knuckle).

  25. 25.

    We assume, for simplicity, that the alphabet is cyclic, so that the successor of z is a.

  26. 26.

    We need to be careful with notions of “correctness” in sequence induction tasks. There are always infinitely many ways of continuing a finite series, even if some appear more “natural” to us than others. In the case of the “Blackburn Dozen” and the “Hofstadter Fifteen”, the authors specified the intended continuation. I did not use these intended continuations when evaluating correctness. Instead, I gave the questions to 100 people, as an online form, and took the mode as the “correct” continuation. The Kantian constraints provide a way of formally specifying what is “natural” about the “natural” continuations.

References

  • Chalmers, D.J., R.M. French, and D.R. Hofstadter. 1992. High-level perception, representation, and analogy: A critique of artificial intelligence methodology. Journal of Experimental & Theoretical Artificial Intelligence 4(3): 185–211.

    Article  Google Scholar 

  • Corapi, D., A. Russo, and E. Lupu. 2010. Inductive logic programming as abductive search. In: ICLP (Technical Communications), 54–63.

    Google Scholar 

  • Corapi, D., A. Russo, and E. Lupu. 2012. Inductive logic programming in answer set programming. In: Inductive Logic Programming, 91–97. Heidelberg/New York: Springer.

    MATH  Google Scholar 

  • Frege, G., P. Geach, and M. Black. 1980. ‘Über Sinn und Bedeutung’, in Zeitschrift für Philosophie und philosophische Kritik, Translated as ‘On Sense and Reference’ by M. Black in Translations from the Philosophical Writings, 100: 25–50. Oxford: Blackwell, third edition.

    Google Scholar 

  • Gelfond, M., and V. Lifschitz. 1988. International Conference on Logic Programming. The stable model semantics for logic programming. In: ICLP/SLP, vol. 88, 1070–1080.

    Google Scholar 

  • Goodman, N.D., J.B. Tenenbaum, J. Feldman, and T.L. Griffiths. 2008. A rational analysis of rule-based concept learning. Cognitive Science 32(1): 108–154.

    Article  Google Scholar 

  • Graves, A., et al. 2012. Supervised sequence labelling with recurrent neural networks, vol. 385. University of Toronto, Springer.

    MATH  Google Scholar 

  • Haugeland, J. 1990. The intentionality all-stars. Philosophical Perspectives 4: 383–427.

    Article  Google Scholar 

  • Hernandez-Orallo, J., and N. Minaya-Collado. 1998. Engineering of Intelligent Systems, A formal definition of intelligence based on an intensional variant of algorithmic complexity. In: Proceedings of International Symposium of Engineering of Intelligent Systems (EIS98), February 11–13, 146–163.

    Google Scholar 

  • Hofstadter, D.R. 2008. Fluid Concepts and Creative Analogies: Computer Models of the Fundamental Mechanisms of Thought. New York: Basic Books.

    Google Scholar 

  • Hofstadter, D.R., M. Mitchell, et al. 1994. The copycat project: A model of mental fluidity and analogy-making. Advances in Connectionist and Neural Computation Theory 2(31–112): 29–30.

    Google Scholar 

  • Hutter, M. 2007. On universal prediction and Bayesian confirmation. Theoretical Computer Science 384(1): 33–48.

    Article  MathSciNet  Google Scholar 

  • Jordan, C., and L. Kaiser. 2013. Learning programs as logical queries. In: The ICALP 2013 Satellite Workshop on Learning Theory and Complexity, (ICALP is the “International Colloquium on Automata, Languages and Programming”).

    Google Scholar 

  • Kant, I. 1781. Critique of Pure Reason, Trans. P Guyer. Cambridge University Press.

    Google Scholar 

  • Kowalski, R., and M. Sergot. 1989. A logic-based calculus of events. In: Foundations of Knowledge Base Management, 23–55. Berlin: Springer.

    Google Scholar 

  • Lake, B.M., R. Salakhutdinov, and J.B. Tenenbaum. 2015. Human-level concept learning through probabilistic program induction. Science 350(6266): 1332–1338.

    Article  MathSciNet  Google Scholar 

  • Law, M., A. Russo, and K. Broda. 2014. Inductive learning of answer set programs. In: Logics in Artificial Intelligence, 311–325. Cham: Springer.

    Google Scholar 

  • Longuenesse, B. 1998. Kant and the Capacity to Judge. Princeton: Princeton University Press.

    Google Scholar 

  • McCarthy, J. 1963. Situations, actions, and causal laws. Technical Report, DTIC Document.

    Book  Google Scholar 

  • Meredith, M.J.E. 1986. Seek-whence: A model of pattern perception. Technical Report, Indiana University, Bloomington (USA).

    Google Scholar 

  • Mitchell, M. 1993. Analogy-making as perception: A computer model. Cambridge: MIT Press.

    Google Scholar 

  • Muggleton, S.H., D. Lin, and A. Tamaddoni-Nezhad. 2015. Meta-interpretive learning of higher-order dyadic datalog: Predicate invention revisited. Machine Learning 100(1): 49–73.

    Article  MathSciNet  Google Scholar 

  • Reiter, R. 1980. A logic for default reasoning. Artificial Intelligence 13(1): 81–132.

    Article  MathSciNet  Google Scholar 

  • Sellars, W. 1968. Science and metaphysics: Variations on Kantian themes. Ridgeview Publishing Company, Springer.

    Google Scholar 

  • Shanahan, M. 2005. Perception as abduction: Turning sensor data into meaningful representation. Cognitive Science 29(1): 103–134.

    Article  Google Scholar 

  • Sloman, A. 2008. Kantian philosophy of mathematics and young robots. In: International Conference on Intelligent Computer Mathematics, 558–573. Berlin/Heidelberg: Springer.

    Google Scholar 

  • Tenenbaum, J.B. 2000. Rules and similarity in concept learning. Advances in Neural Information Processing Systems 12: 59–65.

    Google Scholar 

  • Waxman, W. 2013. Kant’s Anatomy of the Intelligent Mind. Oxford: Oxford University Press.

    Book  Google Scholar 

  • Wittgenstein, L. 1958. The Blue and Brown Books. Oxford: Blackwell.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Richard Evans .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Evans, R. (2019). A Kantian Cognitive Architecture. In: Berkich, D., d'Alfonso, M. (eds) On the Cognitive, Ethical, and Scientific Dimensions of Artificial Intelligence. Philosophical Studies Series, vol 134. Springer, Cham. https://doi.org/10.1007/978-3-030-01800-9_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-01800-9_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-01799-6

  • Online ISBN: 978-3-030-01800-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics