Fuzzy World: A Tool Training Agent from Concept Cognitive to Logic Inference

Luo, Minzhong

doi:10.1007/978-3-030-75762-5_1

Minzhong Luo^15,16

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12712))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

3712 Accesses

Abstract

Not like many visual systems or NLP frameworks, human generally use both visual and semantic information for reasoning tasks. In this paper, we present a 3D virtual simulation learning environment Fuzzy World based on gradual learning paradigm to train visual-semantic reasoning agent for complex logic reasoning tasks. Furthermore our baseline approach employed semantic graphs and deep reinforcement learning architecture shows the significant performance over the tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Implemented at https://github.com/Luomin1993/fuzzy-world-tool.

References

Kaelbling, L.P., Littman, M.L., Cassandra, A.R.: Planning and acting in partially observable stochastic domains. Artif. Intell. 101(1–2), 99–134 (1998)
Article MathSciNet Google Scholar
Angluin, D.: Queries and concept learning. Mach. Learn. 2(4), 319–342 (1988)
Article MathSciNet Google Scholar
Apt, K.R., Bol, R.N.: Logic programming and negation: a survey. J. Logic Program. 19(94), 9–71 (1994)
Article MathSciNet Google Scholar
Apt, K.R., Emden, M.H.V.: Contributions to the theory of logic programming. J. ACM 29(3), 841–862 (1982)
Article MathSciNet Google Scholar
Besold, T., et al.: Neural-symbolic learning and reasoning: a survey and interpretation
Google Scholar
Chein, M., Mugnier, M.-L.: Graph-based knowledge representation: computational foundations of conceptual graphs. Univ. Aberdeen 13(3), 329–347 (2009)
MATH Google Scholar
Chen, D.L., Mooney, R.J.: Learning to interpret natural language navigation instructions from observations. In: AAAI Conference on Artificial Intelligence, AAAI 2011, San Francisco, California, USA (2011)
Google Scholar
Chen, H., Suhr, A., Misra, D., Snavely, N., Artzi, Y.: Touchdown: natural language navigation and spatial reasoning in visual street environments
Google Scholar
Chollet, F., et al.: Keras (2015). https://keras.io
Dai, W.Z., Xu, Q.L., Yu, Y., Zhou, Z.H.: Tunneling neural perception and logic reasoning through abductive learning
Google Scholar
Das, A., Datta, S., Gkioxari, G., Lee, S., Parikh, D., Batra, D.: Embodied question answering. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (2018)
Google Scholar
Duda, R., Gaschnig, J., Hart, P.: Model design in the prospector consultant system for mineral exploration. Read. Artif. Intell. 334–348 (1981)
Google Scholar
Feigenbaum, E.A., Buchanan, B.G. Lederberg, J.: Generality and problem solving: a case study using the DENDRAL program. Stanford University (1970)
Google Scholar
Frome, A., Corrado, G.S., Shlens, J., Bengio, S., Dean, J., Ranzato, M., Mikolov, T.: DeViSE: a deep visual-semantic embedding model. In: International Conference on Neural Information Processing Systems, pp. 2121–2129 (2013)
Google Scholar
Gordon, D., Kembhavi, A., Rastegari, M., Redmon, J., Fox, D., Farhadi, A.: IQA: visual question answering in interactive environments
Google Scholar
Hermann, K.M., Felix Hill, S.G., Fumin Wang, P.B.: Grounded language learning in a simulated 3D world. In: NIPS Workshop (2017)
Google Scholar
Higgins, I., et al.: SCAN: learning abstract hierarchical compositional visual concepts
Google Scholar
Mamdani, A.S.: An experiment in linguistic synthesis with a fuzzy logic controller. Int. J. Man-Mach. Stud. 7, 1–13 (1975)
Article Google Scholar
Mccarthy, J.: Programs with common sense. Semant. Inf. Proces. 130(5), 403–418 (1959)
Google Scholar
Mordatch, I.: Concept learning with energy-based models. In: ICLR Workshop (2018)
Google Scholar
Ohlbach, H.J.: The semantic clause graph procedure - a first overview. In: Gwai-86 Und 2 Österreichische Artificial-intelligence-tagung (1986)
Google Scholar
Regneri, M., Rohrbach, M., Wetzel, D., Thater, S., Pinkal, M.: Grounding action descriptions in videos. Trans. Assoc. Comput. Lingus 1(3), 25–36 (2013)
Google Scholar
Shortliffe, E.H.: A rule-based computer program for advising physicians regarding antimicrobial therapy selection. Stanford University (1974)
Google Scholar
Shridhar, M., et al.: ALFRED: a benchmark for interpreting grounded instructions for everyday tasks
Google Scholar
Tang, J., Qu, M., Wang, M., Zhang, M., Yan, J., Mei, Q.: Line: large-scale information network embedding (2015)
Google Scholar
Tellex, S., et al.: Understanding natural language commands for robotic navigation and mobile manipulation. In: AAAI Conference on Artificial Intelligence, pp. 1507–1514 (2011)
Google Scholar
Tenenbaum, J.B.: Bayesian modeling of human concept learning. In: Conference on Advances in Neural Information Processing Systems II, pp. 59–65 (1998)
Google Scholar
Torralba, X., et al.: VirtualHome: simulating household activities via programs. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (2018)
Google Scholar
Watkins, C.J., Dayan, P.: Technical note: Q-learning. Mach. Learn. 8(3–4), 279–292 (1992)
Article MATH Google Scholar
Winograd, T.: Procedures as a representation for data in a computer program for understanding natural language. Technical report, Massachusetts Institute of Technology (1971)
Google Scholar
Yu, H., Lian, X., Zhang, H., Xu, W.: Guided feature transformation (GFT): a neural language grounding module for embodied agents
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
Minzhong Luo
School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China
Minzhong Luo

Authors

Minzhong Luo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Minzhong Luo .

Editor information

Editors and Affiliations

IIIT, Hyderabad, Hyderabad, India
Kamal Karlapalem
Chinese University of Hong Kong, Shatin, Hong Kong
Hong Cheng
Virginia Tech, Arlington, VA, USA
Naren Ramakrishnan
Jawaharlal Nehru University, New Delhi, India
R. K. Agrawal
IIIT Hyderabad, Hyderabad, India
P. Krishna Reddy
University of Minnesota, Minneapolis, MN, USA
Jaideep Srivastava
IIIT Delhi, New Delhi, India
Tanmoy Chakraborty

Appendix

1.1 6.1 Using Second Order Derivative Gradient for Cross Training Parameter

Notice that the prediction of the model $\mathcal {P}(L_A|V, L_Q)$ is in one-hot form of space concept like:[up and down, left and right, top left and bottom right...], then the loss of last layer employed softmax cross entropy loss is $\mathcal {L}(A_S) = -\hat{y} \odot log(f_{softmax}(A_S \odot C^{*T}))$. The next is the provement of an upper bound of $\hat{\mathcal {L}}(A+ \alpha \varDelta A)$.

Note that the updating of parameters A takes the simple SGD: $ A^{t+1} \leftarrow A^t+\alpha \nabla _A \hat{\mathcal {L}} $.

Theorem 1

When $\nabla ^2_A \hat{\mathcal {L}} \le MI$, we have$\hat{\mathcal {L}}(A+ \alpha \varDelta A) \le \hat{\mathcal {L}}(A) + \gamma ||\nabla _A \hat{\mathcal {L}}||^2$.

Proof

Easy to know $-\nabla _A \hat{\mathcal {L}}(A) = \varDelta A$, do Taylor expansion to $\hat{\mathcal {L}}(A+ \alpha \varDelta A)$:

$$\hat{\mathcal {L}}(A+ \alpha \varDelta A) = \hat{\mathcal {L}}(A)+ \alpha \nabla _A \hat{\mathcal {L}}(A) \odot \varDelta A +\nabla ^2_A \hat{\mathcal {L}} ||\varDelta A||^2 \alpha ^2 /2 $$

$$\le \hat{\mathcal {L}}(A)+ \alpha \nabla _A \hat{\mathcal {L}} \odot (-\nabla _A \hat{\mathcal {L}}) +M||\varDelta A||^2 \alpha ^2 /2 $$

$$= \hat{\mathcal {L}}(A)+(\alpha ^2M /2 - \alpha )||\nabla _A \hat{\mathcal {L}}||^2 $$

Now let $\gamma = \alpha ^2M /2 - \alpha \le 0$ then the below is satisfied:

$$\hat{\mathcal {L}}(A+ \alpha \varDelta A) \le \hat{\mathcal {L}}(A+ \alpha \varDelta A) +(\alpha -\alpha ^2M /2 )||\nabla _A \hat{\mathcal {L}}||^2 \le \hat{\mathcal {L}}(A)$$

.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Luo, M. (2021). Fuzzy World: A Tool Training Agent from Concept Cognitive to Logic Inference. In: Karlapalem, K., et al. Advances in Knowledge Discovery and Data Mining. PAKDD 2021. Lecture Notes in Computer Science(), vol 12712. Springer, Cham. https://doi.org/10.1007/978-3-030-75762-5_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-75762-5_1
Published: 09 May 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-75761-8
Online ISBN: 978-3-030-75762-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Fuzzy World: A Tool Training Agent from Concept Cognitive to Logic Inference

Abstract

Access this chapter

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendix

Appendix

1.1 6.1 Using Second Order Derivative Gradient for Cross Training Parameter

Theorem 1

Proof

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation