Learning Guided Enumerative Synthesis for Superoptimization

Singh, Shikhar; Zhang, Mengshi; Khurshid, Sarfraz

doi:10.1007/978-3-030-30923-7_10

Shikhar Singh¹¹,
Mengshi Zhang¹¹ &
Sarfraz Khurshid¹¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11636))

Included in the following conference series:

International Symposium on Model Checking Software

625 Accesses
2 Citations

Abstract

The field of program synthesis has seen substantial recent progress in new ideas, e.g., program sketching and synthesis modulo pruning, and applications, e.g., in program repair and superoptimization, which is our focus in this paper. The goal of superoptimization is to generate a program which is functionally equivalent to the given program but is optimal with respect to some desired criteria. We develop a learning-based approach to guide the exploration of the space of candidate programs to parts of the space where an optimal solution likely exists. We introduce the techniques of bulk and sequence orderings which enable this directed search. We integrate these machine learning techniques with an enumerative superoptimizer and experimentally evaluate our framework using a suite of subjects. Our findings demonstrate that machine learning techniques can play a useful role in reducing the amount of candidate program space that the enumerative search must to explore in order to find an optimal solution; for the subject programs, the reduction is up to 80% on average.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Abadi, M., et al.: TensorFlow: a system for large-scale machine learning. In: Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation, OSDI 2016, pp. 265–283. USENIX Association, Berkeley (2016). http://dl.acm.org/citation.cfm?id=3026877.3026899
Alur, R., et al.: Syntax-guided synthesis. In: 2013 Formal Methods in Computer-Aided Design, pp. 1–8, October 2013. https://doi.org/10.1109/FMCAD.2013.6679385
Alur, R., Radhakrishna, A., Udupa, A.: Scaling enumerative program synthesis via divide and conquer. In: Legay, A., Margaria, T. (eds.) TACAS 2017. LNCS, vol. 10205, pp. 319–336. Springer, Heidelberg (2017). https://doi.org/10.1007/978-3-662-54577-5_18
Chapter Google Scholar
Balog, M., Gaunt, A.L., Brockschmidt, M., Nowozin, S., Tarlow, D.: DeepCoder: learning to write programs. arXiv preprint arXiv:1611.01989 (2016)
Bansal, S., Aiken, A.: Automatic generation of peephole superoptimizers. In: Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS XII, pp. 394–403. ACM, New York (2006). https://doi.org/10.1145/1168857.1168906
Barrett, C., Sebastiani, R., Seshia, S., Tinelli, C.: Satisfiability modulo theories. In: Frontiers in Artificial Intelligence and Applications, vol. 185, pp. 825–885, 1st edn (2009). https://doi.org/10.3233/978-1-58603-929-5-825
Bornholt, J., Torlak, E., Grossman, D., Ceze, L.: Optimizing synthesis with metasketches. In: Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, POPL 2016, pp. 775–788. ACM, New York (2016). https://doi.org/10.1145/2837614.2837666
Bunel, R., Desmaison, A., Kumar, M.P., Torr, P.H.S., Kohli, P.: Learning to superoptimize programs. CoRR abs/1611.01787 (2016)
Google Scholar
Cho, K., van Merrienboer, B., Bahdanau, D., Bengio, Y.: On the properties of neural machine translation: encoder-decoder approaches. CoRR abs/1409.1259 (2014). http://arxiv.org/abs/1409.1259
Chollet, F.: Keras (2015). https://github.com/fchollet/keras
Chung, J., Gülçehre, Ç., Cho, K., Bengio, Y.: Empirical evaluation of gated recurrent neural networks on sequence modeling. CoRR abs/1412.3555 (2014). http://arxiv.org/abs/1412.3555
D’Antoni, L., Samanta, R., Singh, R.: Qlose: program repair with quantitative objectives. In: Chaudhuri, S., Farzan, A. (eds.) CAV 2016. LNCS, vol. 9780, pp. 383–401. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-41540-6_21
Chapter Google Scholar
Desai, A., et al.: Program synthesis using natural language. CoRR abs/1509.00413 (2015). http://arxiv.org/abs/1509.00413
Devlin, J., Uesato, J., Bhupatiraju, S., Singh, R., Mohamed, A.R., Kohli, P.: RobustFill: neural program learning under noisy I/O. arXiv preprint arXiv:1703.07469 (2017)
Elman, J.L.: Finding structure in time. Cogn. Sc. 14(2), 179–211 (1990). https://doi.org/10.1016/0364-0213(90)90002-E. http://www.sciencedirect.com/science/article/pii/036402139090002E
Article Google Scholar
Feser, J.K., Chaudhuri, S., Dillig, I.: Synthesizing data structure transformations from input-output examples. In: Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2015, pp. 229–239. ACM, New York (2015). https://doi.org/10.1145/2737924.2737977
Gao, K.: IBM pytorch-seq2seq (2017). https://github.com/IBM/pytorch-seq2seq/
Granlund, T., Kenner, R.: Eliminating branches using a superoptimizer and the GNU C compiler. In: Proceedings of the ACM SIGPLAN 1992 Conference on Programming Language Design and Implementation, PLDI 1992, pp. 341–352. ACM, New York (1992). https://doi.org/10.1145/143095.143146
Graves, A.: Generating sequences with recurrent neural networks. CoRR abs/1308.0850 (2013). http://arxiv.org/abs/1308.0850
Gulwani, S.: Dimensions in program synthesis. In: Proceedings of the 12th International ACM SIGPLAN Symposium on Principles and Practice of Declarative Programming, PPDP 2010, pp. 13–24. ACM, New York (2010). https://doi.org/10.1145/1836089.1836091
Gulwani, S.: Automating string processing in spreadsheets using input-output examples. In: Proceedings of the 38th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, POPL 2011, pp. 317–330. ACM, New York (2011). https://doi.org/10.1145/1926385.1926423
Gulwani, S.: Applications of program synthesis to end-user programming and intelligent tutoring systems. In: Proceedings of the Companion Publication of the 2014 Annual Conference on Genetic and Evolutionary Computation, GECCO Comp 2014, pp. 5–6. ACM, New York (2014). https://doi.org/10.1145/2598394.2598397
Gulwani, S., Jha, S., Tiwari, A., Venkatesan, R.: Synthesis of loop-free programs. In: Proceedings of the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2011, pp. 62–73. ACM, New York (2011). https://doi.org/10.1145/1993498.1993506
Gulwani, S., Jojic, N.: Program verification as probabilistic inference. In: Proceedings of the 34th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, POPL 2007, pp. 277–289. ACM, New York (2007). https://doi.org/10.1145/1190216.1190258
Gulwani, S., Polozov, O., Singh, R.: Program synthesis. Found. Trends® Program. Lang. 4(1–2), 1–119 (2017). https://doi.org/10.1561/2500000010
Article Google Scholar
Jojic, V., Gulwani, S., Jojic, N.: Probabilistic inference of programs from input/output examples, April 2018
Google Scholar
Joshi, R., Nelson, G., Randall, K.: Denali: A goal-directed superoptimizer. In: Proceedings of the ACM SIGPLAN 2002 Conference on Programming Language Design and Implementation, PLDI 2002, pp. 304–314. ACM, New York (2002). https://doi.org/10.1145/512529.512566
Katz, G., Peled, D.: Genetic programming and model checking: synthesizing new mutual exclusion algorithms. In: Cha, S.S., Choi, J.-Y., Kim, M., Lee, I., Viswanathan, M. (eds.) ATVA 2008. LNCS, vol. 5311, pp. 33–47. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-88387-6_5
Chapter MATH Google Scholar
Kuniyoshi, Y., Inaba, M., Inoue, H.: Learning by watching: extracting reusable task knowledge from visual observation of human performance. IEEE Trans. Robot. Autom. 10(6), 799–822 (1994). https://doi.org/10.1109/70.338535
Article Google Scholar
Lamb, A., Goyal, A., Zhang, Y., Zhang, S., Courville, A., Bengio, Y.: Professor forcing: a new algorithm for training recurrent networks. ArXiv e-prints, October 2016
Google Scholar
Massalin, H.: Superoptimizer: a look at the smallest program. In: Proceedings of the Second International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS II, pp. 122–126. IEEE Computer Society Press, Los Alamitos (1987). https://doi.org/10.1145/36206.36194
Paszke, A., et al.: Automatic differentiation in PyTorch. In: NIPS-W (2017)
Google Scholar
Phothilimthana, P.M., Thakur, A., Bodik, R., Dhurjati, D.: Greenthumb: superoptimizer construction framework. In: Proceedings of the 25th International Conference on Compiler Construction, CC 2016, pp. 261–262. ACM, New York (2016). https://doi.org/10.1145/2892208.2892233
Phothilimthana, P.M., Thakur, A., Bodik, R., Dhurjati, D.: Scaling up superoptimization. SIGPLAN Not. 51(4), 297–310 (2016). https://doi.org/10.1145/2954679.2872387
Article Google Scholar
Polozov, O., Gulwani, S.: FlashMeta: a framework for inductive program synthesis. In: Proceedings of the 2015 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications, OOPSLA 2015, pp. 107–126. ACM, New York (2015). https://doi.org/10.1145/2814270.2814310
Ruck, D.W., Rogers, S.K., Kabrisky, M., Oxley, M.E., Suter, B.W.: The multilayer perceptron as an approximation to a Bayes optimal discriminant function. IEEE Trans. Neural Netw. 1(4), 296–298 (1990). https://doi.org/10.1109/72.80266
Article Google Scholar
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. In: Neurocomputing: Foundations of Research, pp. 696–699. MIT Press, Cambridge (1988). http://dl.acm.org/citation.cfm?id=65669.104451
Schkufza, E., Sharma, R., Aiken, A.: Stochastic superoptimization. In: Proceedings of the Eighteenth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS 2013, pp. 305–316. ACM, New York (2013). https://doi.org/10.1145/2451116.2451150
Schkufza, E., Sharma, R., Aiken, A.: Stochastic optimization of floating-point programs with tunable precision. In: Proceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2014, pp. 53–64. ACM, New York (2014). https://doi.org/10.1145/2594291.2594302
Solar-Lezama, A., Rabbah, R., Bodík, R., Ebcioğlu, K.: Programming by sketching for bit-streaming programs. In: Proceedings of the 2005 ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2005, pp. 281–294. ACM, New York (2005). https://doi.org/10.1145/1065010.1065045
Solar-Lezama, A., Tancau, L., Bodik, R., Seshia, S., Saraswat, V.: Combinatorial sketching for finite programs. In: Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS XII, pp. 404–415. ACM, New York (2006). https://doi.org/10.1145/1168857.1168907
Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. CoRR abs/1409.3215 (2014). http://arxiv.org/abs/1409.3215
Udupa, A., Raghavan, A., Deshmukh, J.V., Mador-Haim, S., Martin, M.M., Alur, R.: TRANSIT: specifying protocols with concolic snippets. In: Proceedings of the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2013, pp. 287–296. ACM, New York (2013). https://doi.org/10.1145/2491956.2462174
Ruck, D.W., Rogers, S., Kabrisky, M.: Feature selection using a multilayer perceptron 2, July 1993
Google Scholar
Weimer, W., Nguyen, T., Le Goues, C., Forrest, S.: Automatically finding patches using genetic programming. In: Proceedings of the 31st International Conference on Software Engineering, ICSE 2009, pp. 364–374. IEEE Computer Society, Washington, DC (2009). https://doi.org/10.1109/ICSE.2009.5070536
Williams, R.J.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach. Learn. 8(3), 229–256 (1992). https://doi.org/10.1007/BF00992696
Article MathSciNet MATH Google Scholar
Zheng, S., et al.: Conditional random fields as recurrent neural networks. CoRR abs/1502.03240 (2015). http://arxiv.org/abs/1502.03240
Zohar, A., Wolf, L.: Automatic program synthesis of long programs with a learned garbage collector. CoRR abs/1809.04682 (2018). http://arxiv.org/abs/1809.04682

Download references

Acknowledgments

This work started as a class project for Program Synthesis taught by Dr. Keshav Pingali at UT Austin. The authors are grateful for his valuable input.

This research was partially supported by the US National Science Foundation under Grant Nos. CCF-1704790 and CCF-1718903.

Author information

Authors and Affiliations

University of Texas, Austin, USA
Shikhar Singh, Mengshi Zhang & Sarfraz Khurshid

Authors

Shikhar Singh
View author publications
You can also search for this author in PubMed Google Scholar
Mengshi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Sarfraz Khurshid
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shikhar Singh .

Editor information

Editors and Affiliations

Avast Software, Prague, Czech Republic
Fabrizio Biondi
Université Catholique de Louvain, Louvain-la-Neuve, Belgium
Thomas Given-Wilson
Université Catholique de Louvain, Louvain-la-Neuve, Belgium
Axel Legay

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Singh, S., Zhang, M., Khurshid, S. (2019). Learning Guided Enumerative Synthesis for Superoptimization. In: Biondi, F., Given-Wilson, T., Legay, A. (eds) Model Checking Software. SPIN 2019. Lecture Notes in Computer Science(), vol 11636. Springer, Cham. https://doi.org/10.1007/978-3-030-30923-7_10

Download citation

DOI: https://doi.org/10.1007/978-3-030-30923-7_10
Published: 02 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30922-0
Online ISBN: 978-3-030-30923-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics