Definition
Non-conventional data are any kind of data that are useful for business intelligence (BI) but that cannot be directly managed with traditional data warehousing (DW) technology. Non-conventional data cover a great variety of user-generated contents such as domain knowledge, corporate documents like contracts and e-mails, news feeds, messages posted on social media, and so on. Non-conventional data sources have in common a semi-structured, dynamic, and text-rich nature, which make difficult their integration within traditional corporate information systems (including data warehouses). Nowadays, non-conventional data mainly reside in the Web, adopting the standard formats proposed by the World Wide Web Consortium (W3C), like HTML, XML, RSS, RDF, etc. This entry is focused on those approaches aimed to either integrate non-conventional data with traditional DW/OLAP or to perform ad hoc DW of these data sources. This entry does not account for approaches that adopt Web languages...
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsRecommended Reading
Abelló A, Romero O, Pedersen TB, Berlanga R, Nebot V, Aramburu MJ, Simitsis A. Using semantic web technologies for exploratory OLAP: a survey. IEEE Trans Knowl Eng. 2015;27(2):571–588.
Bhide MA, Gupta A, Gupta R, Roy P, Mohania MK, Ichhaporia Z. LIPTUS: associating structured and unstructured information in a banking environment. In: Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data; 2007. p. 915–24.
Castellanos M, Wang S, Dayal U, Gupta C. SIE-OBI: a streaming information extraction platform for operational business intelligence. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data; 2010. p. 1105–10.
Chakaravarth, VT, Gupta H, Roy P, Mohania M. Efficiently linking text documents with relevant structured information. In: Proceedings of the 32nd International Conference on Very Large Data Bases; 2006. p. 667–78.
Francia M, Golfarelli M, Rizzi S. A methodology for social BI. In: Proceedings of the 18th International Database Engineering & Applications Symposium; 2014. p. 207–16.
García-Moya L, Kudama S, Aramburu MJ, Berlanga R. Storing and analysing voice of the market data in the corporate data warehouse. Inf Syst Front. 2013;15(3):331–49.
Löser A, Nagel C, Pieper S, Boden C. Beyond search: retrieving complete tuples from a text-database. Inf Syst Front. 2013;15(3):311–29.
Mothe J, Chrisment C, Dousset B, Alaux J. DocCube: multi-dimensional visualisation and exploration of large document sets. J Am Soc Inf Sci Technol. 2003;54(7):650–9.
Nebot V, Berlanga R. Building data warehouses with semantic web data. Decis Support Syst. 2012;52(4):853–68.
Park B, Song I. Incorporating text OLAP in business intelligence. In: Zorrilla M, Mazón J, Ferrández Ó, Garrigós I, Daniel F, Trujillo J, editors. Business intelligence applications and the web: models, systems and technologies. Hershey: IGI Global; 2012.p. 77–101.
Pérez JM, Berlanga R, Aramburu MJ, Pedersen TB. Contextualizing data warehouses with documents. Decis Support Syst. 2008;45(1):77–94.
Pérez JM, Berlanga R, Aramburu MJ, Pedersen TB. Integrating data warehouses with web data: a survey. IEEE Trans Knowl Data Eng. 2008;20(7):940–55.
Simitsis A, Baid A, Sismanis Y, Reinwald B. Multidimensional content exploration. Proc VLDB Endow. 2008;1(1):660–71.
Tseng FSC, Chou AYH. The concept of document warehousing for multi-dimensional modeling of textual-based business intelligence. Decis Support Syst. 2006;42(2):727–44.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this entry
Cite this entry
Berlanga, R. (2018). Data Warehousing on Nonconventional Data. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_80670
Download citation
DOI: https://doi.org/10.1007/978-1-4614-8265-9_80670
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering