A Kantian Cognitive Architecture

Evans, Richard

doi:10.1007/978-3-030-01800-9_13

Richard Evans^4,5

Part of the book series: Philosophical Studies Series ((PSSP,volume 134))

1475 Accesses
3 Citations

Abstract

In this paper, I reinterpret Kant’s Transcendental Analytic as a description of a cognitive architecture. I describe a computer implementation of this architecture, and show how it has been applied to two unsupervised learning tasks. The resulting program is very data efficient, able to learn from a tiny handful of examples. I show how the program achieves data-efficiency: the constraints described in the Analytic of Principles are reinterpreted as strong prior knowledge, constraining the set of possible solutions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 139.00; Price excludes VAT (USA)

Hardcover Book: USD 179.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
In making this claim, I am assuming a suitably red-blooded notion of “experience”. Of course, for some sufficiently thin notion of “experience”, the thermostat must “experience” the world in order to act at all. But there is a difference between merely responding to a stimulus and making sense of that stimulus: reinterpreting the stimulus as a representation of a coherent external world. The latter is “experience” in the strong sense I am using it.
2.
Note that I am not defining intentionality in terms of the activity of counting-as (which would be uninformative). Rather, I am using counting-as to distinguish between original and derivative intentionality. Later, counting-as will itself be explicated in terms of the construction and application of rules.
3.
All such references [A, B] are to the A and B editions of the Critique of Pure Reason, (Kant 1781).
4.
By “willy-nilly”, I mean without justification from the application of a rule. Kant’s view is that the only mental actions that are justified are actions that result from applying a rule. What leaves room in this stern vision for spontaneity and autonomy is that the rules are not imposed from outside; rather, they are self-legislated.
5.
Please note that these Kantian rules do not have to be linguistically articulated or consciously accessible. Rather, the rules that determine the activities of mental combination are implicit and consciously inaccessible, in the same way that the rules of a compiled Prolog program are inaccessible to the executing process.
6.
See Kant (1781)(B201n): “the synthesis of a manifold of what does not necessarily belong to each other”.
7.
See Kant (1781)(B201n): “the second combination is the synthesis of that which is manifold insofar as they necessarily belong to one another”.
8.
See also [A105], [A177, B220].
9.
In computational terms, think of a meta-interpreter that is able to construct pieces of code as data, and then execute these new pieces of code.
10.
Kant makes the same point in the Metaphysical Deduction: “The same function that gives unity to the different representations in a judgement also gives unity to the mere synthesis of different representations in an intuition, which, expressed generally, is called the pure concept of the understanding. The same understanding, therefore, and indeed by means of the very same actions through which it brings the logical form of a judgement into concepts by means of the analytical unity, also brings a transcendental content into its representations by means of the synthetic unity of the manifold” (Kant 1781)(A79, B104-5). In other words, there is only one process (a process of constructing and applying rules) which explains both how we form judgements and how we form intuitions.
11.
Some of the connection rules involved in characterising a concept do more than simply state that one concept is a sub-concept of another, or that one concept excludes another. Some of them relate the concept to another concept only conditionally – dependent on the existence of external factors. For example: “If the weather gets cold, trees lose their leaves”, “If a tree gets no water, it perishes” (Longuenesse 1998). Some of the conceptual inference rules, in Kantian terms, are hypothetical rather than categorical.
12.
These activities are described in Kant (1781)(B185, A146) .
13.
I use the Kantian term apprehension to denote a time-slice of an enduring object at a particular moment in time. Throughout, I use “apprehension” and “object-slice” interchangeably.
14.
Here I assume the stable model semantics (Gelfond and Lifschitz 1988) for negation-as-failure.
15.
See Reiter (1980).
16.
For influential examples, see Kowalski and Sergot (1989) and McCarthy (1963).
17.
I omit, for reasons of space, discussion of the Second Principle, the Anticipations of Perception. The Fourth Principle does not need its own relation.
18.
This is claimed explicitly in a marginal note to the first edition.
19.
This is called a “language bias” in the program induction literature.
20.
Contrast with Shanahan (2005), who sees perception as a form of abduction.
21.
The problem description for finding non-monotonic logic programs from positive and negative examples is actually somewhat more complicated, as there may be multiple models, each with their own positive and negative instances. See Law et al (2014) for details.
22.
Hence Kant’s emphasis on spontaneity: the Kantian agent is both less free (because he can only perform actions by applying rules) and more free (because he can construct any set of rules he likes) than the empiricist can possibly imagine.
23.
There are two major simplifications in the current implementation. The first is that the spatial framework needed to satisfy the Axioms of Intuition is given in advance, pre-specified, hand-coded. The agent is told that he is operating in a 2-dimensional grid world. The second major simplification is that the constraints involved in the Anticipations of Perception are ignored altogether: in the initial implementation, time is modelled as a series of discrete points, rather than being dense. In future work, I plan to overcome these limitations.
24.
One important difference is that your tactile sensations are much more fine-grained: you receive a number of intermediate sensations as the object moves between your four knuckles. The robot just has four discrete boolean sensors (one for each knuckle).
25.
We assume, for simplicity, that the alphabet is cyclic, so that the successor of z is a.
26.
We need to be careful with notions of “correctness” in sequence induction tasks. There are always infinitely many ways of continuing a finite series, even if some appear more “natural” to us than others. In the case of the “Blackburn Dozen” and the “Hofstadter Fifteen”, the authors specified the intended continuation. I did not use these intended continuations when evaluating correctness. Instead, I gave the questions to 100 people, as an online form, and took the mode as the “correct” continuation. The Kantian constraints provide a way of formally specifying what is “natural” about the “natural” continuations.

References

Chalmers, D.J., R.M. French, and D.R. Hofstadter. 1992. High-level perception, representation, and analogy: A critique of artificial intelligence methodology. Journal of Experimental & Theoretical Artificial Intelligence 4(3): 185–211.
Article Google Scholar
Corapi, D., A. Russo, and E. Lupu. 2010. Inductive logic programming as abductive search. In: ICLP (Technical Communications), 54–63.
Google Scholar
Corapi, D., A. Russo, and E. Lupu. 2012. Inductive logic programming in answer set programming. In: Inductive Logic Programming, 91–97. Heidelberg/New York: Springer.
MATH Google Scholar
Frege, G., P. Geach, and M. Black. 1980. ‘Über Sinn und Bedeutung’, in Zeitschrift für Philosophie und philosophische Kritik, Translated as ‘On Sense and Reference’ by M. Black in Translations from the Philosophical Writings, 100: 25–50. Oxford: Blackwell, third edition.
Google Scholar
Gelfond, M., and V. Lifschitz. 1988. International Conference on Logic Programming. The stable model semantics for logic programming. In: ICLP/SLP, vol. 88, 1070–1080.
Google Scholar
Goodman, N.D., J.B. Tenenbaum, J. Feldman, and T.L. Griffiths. 2008. A rational analysis of rule-based concept learning. Cognitive Science 32(1): 108–154.
Article Google Scholar
Graves, A., et al. 2012. Supervised sequence labelling with recurrent neural networks, vol. 385. University of Toronto, Springer.
MATH Google Scholar
Haugeland, J. 1990. The intentionality all-stars. Philosophical Perspectives 4: 383–427.
Article Google Scholar
Hernandez-Orallo, J., and N. Minaya-Collado. 1998. Engineering of Intelligent Systems, A formal definition of intelligence based on an intensional variant of algorithmic complexity. In: Proceedings of International Symposium of Engineering of Intelligent Systems (EIS98), February 11–13, 146–163.
Google Scholar
Hofstadter, D.R. 2008. Fluid Concepts and Creative Analogies: Computer Models of the Fundamental Mechanisms of Thought. New York: Basic Books.
Google Scholar
Hofstadter, D.R., M. Mitchell, et al. 1994. The copycat project: A model of mental fluidity and analogy-making. Advances in Connectionist and Neural Computation Theory 2(31–112): 29–30.
Google Scholar
Hutter, M. 2007. On universal prediction and Bayesian confirmation. Theoretical Computer Science 384(1): 33–48.
Article MathSciNet Google Scholar
Jordan, C., and L. Kaiser. 2013. Learning programs as logical queries. In: The ICALP 2013 Satellite Workshop on Learning Theory and Complexity, (ICALP is the “International Colloquium on Automata, Languages and Programming”).
Google Scholar
Kant, I. 1781. Critique of Pure Reason, Trans. P Guyer. Cambridge University Press.
Google Scholar
Kowalski, R., and M. Sergot. 1989. A logic-based calculus of events. In: Foundations of Knowledge Base Management, 23–55. Berlin: Springer.
Google Scholar
Lake, B.M., R. Salakhutdinov, and J.B. Tenenbaum. 2015. Human-level concept learning through probabilistic program induction. Science 350(6266): 1332–1338.
Article MathSciNet Google Scholar
Law, M., A. Russo, and K. Broda. 2014. Inductive learning of answer set programs. In: Logics in Artificial Intelligence, 311–325. Cham: Springer.
Google Scholar
Longuenesse, B. 1998. Kant and the Capacity to Judge. Princeton: Princeton University Press.
Google Scholar
McCarthy, J. 1963. Situations, actions, and causal laws. Technical Report, DTIC Document.
Book Google Scholar
Meredith, M.J.E. 1986. Seek-whence: A model of pattern perception. Technical Report, Indiana University, Bloomington (USA).
Google Scholar
Mitchell, M. 1993. Analogy-making as perception: A computer model. Cambridge: MIT Press.
Google Scholar
Muggleton, S.H., D. Lin, and A. Tamaddoni-Nezhad. 2015. Meta-interpretive learning of higher-order dyadic datalog: Predicate invention revisited. Machine Learning 100(1): 49–73.
Article MathSciNet Google Scholar
Reiter, R. 1980. A logic for default reasoning. Artificial Intelligence 13(1): 81–132.
Article MathSciNet Google Scholar
Sellars, W. 1968. Science and metaphysics: Variations on Kantian themes. Ridgeview Publishing Company, Springer.
Google Scholar
Shanahan, M. 2005. Perception as abduction: Turning sensor data into meaningful representation. Cognitive Science 29(1): 103–134.
Article Google Scholar
Sloman, A. 2008. Kantian philosophy of mathematics and young robots. In: International Conference on Intelligent Computer Mathematics, 558–573. Berlin/Heidelberg: Springer.
Google Scholar
Tenenbaum, J.B. 2000. Rules and similarity in concept learning. Advances in Neural Information Processing Systems 12: 59–65.
Google Scholar
Waxman, W. 2013. Kant’s Anatomy of the Intelligent Mind. Oxford: Oxford University Press.
Book Google Scholar
Wittgenstein, L. 1958. The Blue and Brown Books. Oxford: Blackwell.
Google Scholar

Download references

Author information

Authors and Affiliations

Imperial College London, London, UK
Richard Evans
DeepMind, London, UK
Richard Evans

Authors

Richard Evans
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Richard Evans .

Editor information

Editors and Affiliations

Department of Humanities, Texas A&M University Corpus Christi, Corpus Christi, TX, USA
Don Berkich
Dipartimento di Studi Umanistici, Università di Ferrara, Ferrara, Italy
Matteo Vincenzo d'Alfonso

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Evans, R. (2019). A Kantian Cognitive Architecture. In: Berkich, D., d'Alfonso, M. (eds) On the Cognitive, Ethical, and Scientific Dimensions of Artificial Intelligence. Philosophical Studies Series, vol 134. Springer, Cham. https://doi.org/10.1007/978-3-030-01800-9_13

Download citation

DOI: https://doi.org/10.1007/978-3-030-01800-9_13
Published: 29 January 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01799-6
Online ISBN: 978-3-030-01800-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics