Abstract
For this special section on software systems, six research leaders in software systems, as guest editors for this special section, discuss important issues that will shape this field’s future research directions. The essays included in this roundtable article cover research opportunities and challenges for large-scale software systems such as querying organizationwide software behaviors (Xusheng Xiao), logging and log analysis (Jian-Guang Lou), engineering reliable cloud distributed systems (Shan Lu), usage data (David C. Shepherd), clone detection and management (Xin Peng), and code search and beyond (Qian-Xiang Wang). — Tao Xie, Leading Editor of Software Systems.
Similar content being viewed by others
References
Yuan D, Park S, Zhou Y. Characterizing logging practices in open-source software. In Proc. the 34th ICSE, June 2012, pp.102-112.
Zong B, Xiao X, Li Z, Wu Z, Qian Z, Yan X, Singh A K, Jiang G. Behavior query discovery in system-generated temporal graphs. PVLDB, 2015, 9(4): 240-251.
Xu Z, Wu Z, Li Z, Jee K, Rhee J, Xiao X, Xu F, Wang H, Jiang G. High fidelity data reduction for big data security dependency analyses. In Proc. the 23rd ACM Conference on Computer and Communications Security (CCS), Oct. 2016.
Ding R, Zhou H, Lou J G, Zhang H, Lin Q, Fu Q, Zhang D, Xie T. Log2: A cost-aware logging mechanism for performance diagnosis. In Proc. the USENIX Annual Technical Conference, July 2015, pp.139-150.
Yuan D, Zheng J, Park S, Zhou Y, Savage S. Improving software diagnosability via log enhancement. In Proc. the 16th ASPLOS, March 2011, pp.3-14.
Fu Q, Zhu J, Hu W, Lou J G, Ding R, Lin Q, Zhang D, Xie T. Where do developers log? An empirical study on logging practices in industry. In Proc. ICSE, May 31-June 7, 2014, pp.24-33.
Yuan C, Lao N, Wen J R, Li J, Zhang Z, Wang Y M, Ma W Y. Automated known problem diagnosis with event traces. In Proc. the 1st ACM SIGOPS/EuroSys European Conference on Computer Systems (EuroSys), April 2006, pp.375-388.
Lo D, Cheng H, Han J, Khoo S C, Sun C. Classification of software behaviors for failure detection: A discriminative pattern mining approach. In Proc. the 15th SIGKDD, June 28-July 1, 2009, pp.557-566.
Xu W, Huang L, Fox A, Patterson D, Jordan M. Detecting large-scale system problems by mining console logs. In Proc. the 22nd ACM SOSP, Oct. 2009, pp.117-132.
Lou J G, Fu Q, Yang S, Xu Y, Li J. Mining invariants from console logs for system problem detection. In Proc. USENIX ATC, June 2010, p.24.
Reidemeister T, Jiang M, Ward P. Mining unstructured log files for recurrent fault diagnosis. In Proc. the 12th IFIP/IEEE International Symposium on Integrated Network Management, May 2011, pp.377-384.
Menzies T, Butcher A, Cok D et al. Local versus global lessons for defect prediction and effort estimation. IEEE Transactions on Software Engineering, 2013, 39(6): 822-834.
Lou J G, Lin Q, Ding R, Fu Q, Zhang D, Xie T. Software analytics for incident management of online services: An experience report. In Proc. the 28th ASE, Nov. 2013, pp.475-485.
Leesatapornwongsa T, Lukman J F, Lu S, Gunawi H S. TaxDC: A taxonomy of nondeterministic concurrency bugs in datacenter distributed systems. In Proc. the 21st International Conference on Architectural Support for Programming Languages and Operating Systems, April 2016, pp.517-530.
Leesatapornwongsa T, Hao M, Joshi P, Lukman J F, Gunawi H S. SAMC: Semantic-aware model checking for fast discovery of deep bugs in cloud systems. In Proc. the 11th USENIX Conference on Operating Systems Design and Implementation, Oct. 2014, pp.399-414.
Yuan D, Luo Y, Zhuang X, Rodrigues G R, Zhao X, Zhang Y, Jain P U, Stumm M. Simple testing can prevent most critical failures: An analysis of production failures in distributed data-intensive systems. In Proc. the 11th USENIX Conference on Operating Systems Design and Implementation, Oct. 2014, pp.249-265.
Fang L, Nguyen K, Xu G, Demsky B, Lu S. Interruptible tasks: Treating memory pressure as interrupts for highly scalable data-parallel programs. In Proc. the 25th Symposium on Operating Systems Principles, Oct. 2015, pp.394-409.
Nguyen K, Fang L, Xu G H, Demsky B, Lu S, Alamian S, Mutlu O. Yak: A high-performance big-data-friendly garbage collector. In Proc. the 12th USENIX Conference on Operating Systems Design and Implementation, Nov. 2016.
Murphy G C, Kersten M, Findlater L. How are Java software developers using the Eclipse IDE? IEEE Software, 2006, 23(4): 76-83.
Roy C K, Cordy J R. A survey on software clone detection research. Technical Report, TR 2007-541, School of Computing, Queen’s University at Kingston, 2007. http://research.cs.queensu.ca/TechReports/Reports/2007-541.pdf, Aug. 2016.
Kapser C J, Godfrey M W. “Cloning considered harmful” considered harmful: Patterns of cloning in software. Empirical Software Engineering, 2008, 13(6): 645-692.
Zhang G, Peng X, Xing Z, Zhao W. Cloning practices: Why developers clone and what can be changed. In Proc. the 28th IEEE International Conference on Software Maintenance, Sept. 2012, pp.285-294.
Roy C K, Zibran M F, Koschke R. The vision of software clone management: Past, present, and future (keynote paper). In Proc. the 2014 Software Evolution Week — IEEE Conference on Software Maintenance, Reengineering, and Reverse Engineering, Feb. 2014, pp.18-33.
Kim M, Sazawal V, Notkin D, Murphy G C. An empirical study of code clone genealogies. In Proc. the 10th ESEC/FSE, Sept. 2005, pp.187-196.
Zhang G, Peng X, Xing Z, Jiang S, Wang H, Zhao W. Towards contextual and ondemand code clone management by continuous monitoring. In Proc. the 28th IEEE/ACM International Conference on Automated Software Engineering, Nov. 2013, pp.497-507.
Lin Y, Peng X, Xing Z, Zheng D, Zhao W. Clone-based and interactive recommendation for modifying pasted code. In Proc. the 10th Joint Meeting on Foundations of Software Engineering, Aug. 30-Sept. 4, 2015, pp.520-531.
Zhong H, Xie T, Zhang L, Pei J, Mei H. MAPO: Mining and recommending API usage patterns. In Proc. the 23rd European Conference on Object-Oriented Programming, July 2009, pp.318-343.
Keivanloo I, Rilling J, Zou Y. Spotting working code examples. In Proc. the 36th International Conference on Software Engineering, May 31-June 7, 2014, pp.664-675.
Li X, Wang Z, Wang Q, Yan S, Xie T, Mei H. Relationshipaware code search for JavaScript frameworks. In Proc. the 24th International Symposium on the Foundations of Software Engineering, Nov. 2016.
Raghothaman M, Wei Y, Hamadi Y. SWIM: Synthesizing what I mean: Code search and idiomatic snippet synthesis. In Proc. the 38th International Conference on Software Engineering, Sept. 2016, pp.357-367.
Wang Q, Li X. Bug localization via searching crowdcontributed code. In Proc. the 6th Asia-Pacific Symposium on Internetware, Nov. 2014, pp.1-10.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Xiao, X., Lou, JG., Lu, S. et al. Roundtable: Research Opportunities and Challenges for Large-Scale Software Systems. J. Comput. Sci. Technol. 31, 851–860 (2016). https://doi.org/10.1007/s11390-016-1668-9
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-016-1668-9