Query-Driven Knowledge-Sharing for Data Integration and Collaborative Data Science
Writing effective analytical queries requires data scientists to have in-depth knowledge of the existence, semantics, and usage context of data sources. Once gathered, such knowledge is informally shared within a specific team of data scientists, but usually is neither formalized nor shared with other teams. Potential synergies remain unused. We introduce our novel approach of Query-driven Knowledge-Sharing Systems (QKSS). A QKSS extends a data management system with knowledge-sharing capabilities to facilitate user collaboration without altering data analysis workflows. Collective knowledge from the query log is extracted to support data source discovery and data integration. Knowledge is formalized to enable its sharing across data scientist teams.
- 2.Eberius, J., Thiele, M., Braunschweig, K., Lehner, W.: DrillBeyond: processing multi-result open world SQL queries. In: SSDBM 2015 (2015)Google Scholar
- 3.Eirinaki, M., Abraham, S., Polyzotis, N., Shaikh, N.: QueRIE: collaborative database exploration. KDE 26(7), 1778–1790 (2014)Google Scholar
- 5.Khoussainova, N., Kwon, Y., Balazinska, M., Suciu, D.: SnipSuggest: context-aware autocompletion for SQL. PVLDB 4(1), 22–33 (2010)Google Scholar
- 6.Li, F., Pan, T., Jagadish, H.V.: Schema-free SQL. In: SIGMOD 2014 (2014)Google Scholar
- 7.Wahl, A.M.: A minimally-intrusive approach for query-driven data integration systems. In: ICDEW 2016 (2016)Google Scholar