Data integration for materials research
- 1.2k Downloads
A new data science initiative in materials research has been launched at The Johns Hopkins University within the Materials in Extreme Dynamic Environments (MEDE) Collaborative Research Alliance (CRA). Our first goal is to build a solution that facilitates seamless data sharing among MEDE scientists. We expect to shorten the design and development cycle of new materials by providing integrated storage, database, and analysis services, building on proven components of the SciServer project developed at the Institute for Data Intensive Engineering and Science (IDIES).
Here we present our system design and demonstrate the power of our approach through a use-case that enables easy comparison of simulations and measurements. This prototype effort, focusing on boron carbide (BC), brings together multiple materials research elements in the Ceramics group within the MEDE CRA.
Discussion and evaluation
The SciServer platform offers single-sign on access to various general purpose data analysis tools familiar to materials scientists in MEDE. During the case study deployment, users appreciated the simple data file upload process, automated database ingestion, and platform applicability to both students of the art and power users.
From our case study experience in aggregating data from both simulations and physical experiments, we developed a template workflow from which a user may run a common data comparison task outright or customize to another purpose. Next, we turn to acquiring data from more MEDE groups and expanding the user base to the Metals group.
KeywordsMaterials research Data science Infrastructure
Research was sponsored by the Army Research Laboratory and was accomplished under Cooperative Agreement Number W911NF-12-2-0022. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the Army Research Laboratory or the US Government. The US Government is authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright notation herein.
- 1.SciServer: Collaborative Data Driven Science. https://doi.org/www.sciserver.org/. Accessed 5 Jan 2016.
- 2.SciServer: Big Data Infrastructure for Science. https://doi.org/nsf.gov/discoveries/disc_summ.jsp?cntn_id=133526&org=NSF. Accessed 5 Jan 2016.
- 3.Mishin D, Medvedev D, Plante R, Graham M, Szalay S (2013) Data sharing and publication using the scidrive service. In: Manset N Forshay P (eds)Astronomical Data Analysis Software and Systems XXIII, Waikoloa Beach Marriott, Hawaii, USA September 29–October 3, 2013, vol. 485.. Taylor & Francis Group, Abingdon, England. https://doi.org/www.aspbooks.org/a/volumes/table_of_contents/?book_id=553, http://www.aspbooks.org/publishing_with_asp/.Google Scholar
- 4.Naldi M, Mastroeni L (2013) Cloud storage pricing: a comparison of current practices In: Proceedings of the 2013 International Workshop on Hot Topics in Cloud Services. HotTopiCS ’13, 27.. ACM, New York, NY, USA, doi:https://doi.org/dx.doi.org/10.1145/2462307.2462315. http://doi.acm.org/10.1145/2462307.2462315.CrossRefGoogle Scholar
- 6.Berzins M, Luitjens J, Meng Q, Harman T, Wight CA, Peterson JR (2010) Uintah: A scalable framework for hazard analysis In: Proceedings of the 2010 TeraGrid Conference. TG ’10, 3–138.. ACM, New York, NY, USA, doi:https://doi.org/dx.doi.org/10.1145/1838574.1838577. http://doi.acm.org/10.1145/1838574.1838577.Google Scholar
- 7.Childs H, Brugger E, Whitlock B, Meredith J, Ahern S, Pugmire D, Biagas K, Miller M, Harrison C, Weber GH, Krishnan H, Fogal T, Sanderson A, Garth C, Bethel EW, Camp D, Rübel O, Durant M, Favre JM, Navrátil P (2012) VisIt: an end-user tool for visualizing and analyzing very large data In: High Performance Visualization–Enabling Extreme-Scale Scientific Insight, 357–372.. Taylor & Francis Group, Abingdon, England.Google Scholar
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(https://doi.org/creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.