Variations on U-Shaped Learning

Carlucci, Lorenzo; Jain, Sanjay; Kinber, Efim; Stephan, Frank

doi:10.1007/11503415_26

Lorenzo Carlucci^20,21,
Sanjay Jain²²,
Efim Kinber²³ &
…
Frank Stephan²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3559))

Included in the following conference series:

International Conference on Computational Learning Theory

3441 Accesses
2 Citations

Abstract

The paper deals with the following problem: is returning to wrong conjectures necessary to achieve full power of learning? Returning to wrong conjectures complements the paradigm of U-shaped learning [2,6,8,20,24] when a learner returns to old correct conjectures. We explore our problem for classical models of learning in the limit: TxtEx-learning – when a learner stabilizes on a correct conjecture, and TxtBc-learning – when a learner stabilizes on a sequence of grammars representing the target concept. In all cases, we show that, surprisingly, returning to wrong conjectures is sometimes necessary to achieve full power of learning. On the other hand it is not necessary to return to old “overgeneralizing” conjectures containing elements not belonging to the target language. We also consider our problem in the context of so-called vacillatory learning when a learner stabilizes to a finite number of correct grammars. In this case we show that both returning to old wrong conjectures and returning to old “overgeneralizing” conjectures is necessary for full learning power. We also show that, surprisingly, learners consistent with the input seen so far can be made decisive [2,21] – they do not have to return to any old conjectures – wrong or right.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Angluin, D.: Inductive inference of formal languages from positive data. Information and Control 45, 117–135 (1980)
Article MATH MathSciNet Google Scholar
Baliga, G., Case, J., Merkle, W., Stephan, F., Wiehagen, R.: When unlearning helps (Manuscript 2005); Preliminary version of the paper appeared in ICALP (2000), http://www.cis.udel.edu/~case/papers/decisive.ps
Bārzdiņš, J.: Inductive inference of automata, functions and programs. In: Int. Math. Congress, Vancouver, pp. 771–776 (1974)
Google Scholar
Blum, L., Blum, M.: Toward a mathematical theory of inductive inference. Information and Control 28, 125–155 (1975)
Article MATH MathSciNet Google Scholar
Blum, M.: A machine-independent theory of the complexity of recursive functions. Journal of the ACM 14, 322–336 (1967)
Article MATH Google Scholar
Bowerman, M.: Starting to talk worse: Clues to language acquisition from children’s late speech errors. In: Strauss, S., Stavy, R. (eds.) U-Shaped Behavioral Growth. Developmental Psychology Series, Academic Press, New York (1982)
Google Scholar
Carey, S.: An analysis of a learning paradigm. In: Strauss, S., Stavy, R. (eds.) U-Shaped Behavioral Growth. Developmental Psychology Series, Academic Press, New York (1982)
Google Scholar
Carlucci, L., Case, J., Jain, S., Stephan, F.: U-shaped learning may be necessary. Technical Report TRA11/04, School of Computing, National University of Singapore (November 2004)
Google Scholar
Case, J.: The power of vacillation in language learning. SIAM Journal on Computing 28(6), 1941–1969 (1999)
Article MATH MathSciNet Google Scholar
Case, J., Lynes, C.: Machine inductive inference and language identification. In: Nielsen, M., Schmidt, E.M. (eds.) ICALP 1982. LNCS, vol. 140, pp. 107–115. Springer, Heidelberg (1982)
Chapter Google Scholar
Case, J., Smith, C.: Comparison of identification criteria for machine inductive inference. Theoretical Computer Science 25, 193–220 (1983)
Article MATH MathSciNet Google Scholar
Fulk, M.: Prudence and other conditions on formal language learning. Information and Computation 85, 1–11 (1990)
Article MATH MathSciNet Google Scholar
Fulk, M., Jain, S., Osherson, D.: Open problems in systems that learn. Journal of Computer and System Sciences 49(3), 589–604 (1994)
Article MathSciNet Google Scholar
Gold, E.M.: Language identification in the limit. Information and Control 10, 447–474 (1967)
Article MATH Google Scholar
Hopcroft, J., Ullman, J.: Introduction to Automata Theory, Languages, and Computation. Addison-Wesley, Reading (1979)
MATH Google Scholar
Jantke, K., Beick, H.: Combining postulates of naturalness in inductive inference. Journal of Information Processing and Cybernetics (EIK) 17, 465–484 (1981)
MATH MathSciNet Google Scholar
Kurtz, S., Royer, J.: Prudence in language learning. In: Haussler, D., Pitt, L. (eds.) Proceedings of the Workshop on Computational Learning Theory, pp. 143–156. Morgan Kaufmann, San Francisco (1988)
Google Scholar
Lange, S., Wiehagen, R.: Polynomial time inference of arbitrary pattern languages. New Generation Computing 8, 361–370 (1991)
Article MATH Google Scholar
Machtey, M., Young, P.: An Introduction to the General Theory of Algorithms. North-Holland, New York (1978)
MATH Google Scholar
Marcus, G., Pinker, S., Ullman, M., Hollander, M., Rosen, T., Xu, F.: Overregularization in Language Acquisition. In: Monographs of the Society for Research in Child Development, vol. 57(4), University of Chicago Press, Chicago (1992); Includes commentary by Harold Clahsen
Google Scholar
Osherson, D., Stob, M., Weinstein, S.: Systems that Learn: An Introduction to Learning Theory for Cognitive and Computer Scientists. MIT Press, Cambridge (1986)
Google Scholar
Plunkett, K., Marchman, V.: U-shaped learning and frequency effects in a multi-layered perceptron: implications for child language acquisition. Cognition 38(1), 43–102 (1991)
Article Google Scholar
Rogers, H.: Theory of Recursive Functions and Effective Computability. MIT Press, Cambridge (1987); Reprinted by MIT Press in 1987
Google Scholar
Strauss, S., Stavy, R.: U-Shaped Behavioral Growth. In: Developmental Psychology Series, Academic Press, New York (1982)
Google Scholar
Strauss, S., Stavy, R., Orpaz, N.: The child’s development of the concept of temperature. Manuscript, Tel-Aviv University (1977)
Google Scholar
Taatgen, N.A., Anderson, J.R.: Why do children learn to say broke? a model of learning the past tense without feedback. Cognition 86(2), 123–155 (2002)
Article Google Scholar
Wiehagen, R., Liepe, W.: Charakteristische Eigenschaften von erkennbaren Klassen rekursiver Funktionen. Journal of Information Processing and Cybernetics (EIK) 12, 421–438 (1976)
MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer and Information Sciences, University of Delaware, Newark, DE, 19716-2586, USA
Lorenzo Carlucci
Dipartimento di Matematica, Università di Siena, Pian dei Mantellini 44, Siena, Italy
Lorenzo Carlucci
School of Computing, National University of Singapore, 117543, Singapore
Sanjay Jain
Department of Computer Science, Sacred Heart University, Fairfield, CT, 06432-1000, U.S.A
Efim Kinber
School of Computing and Department of Mathematics, National University of Singapore, 117543, Singapore
Frank Stephan

Authors

Lorenzo Carlucci
View author publications
You can also search for this author in PubMed Google Scholar
Sanjay Jain
View author publications
You can also search for this author in PubMed Google Scholar
Efim Kinber
View author publications
You can also search for this author in PubMed Google Scholar
Frank Stephan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Leoben, A-8700, Leoben, Austria
Peter Auer
Department of Electrical Engineering, Technion, P.O. Box, 3200, Haifa, Israel
Ron Meir

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Carlucci, L., Jain, S., Kinber, E., Stephan, F. (2005). Variations on U-Shaped Learning. In: Auer, P., Meir, R. (eds) Learning Theory. COLT 2005. Lecture Notes in Computer Science(), vol 3559. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11503415_26

Download citation

DOI: https://doi.org/10.1007/11503415_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26556-6
Online ISBN: 978-3-540-31892-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics