Abstract
Data Science offers a set of powerful approaches for making new discoveries from large and complex data sets. It combines aspects of mathematics, statistics, machine learning, etc. to turn vast amounts of data into new insights and knowledge. However, the sole use of automatic data science techniques for large amounts of complex data limits the human user’s possibilities in the discovery process, since the user is estranged from the process of data exploration. This chapter describes the importance of Information Visualization (InfoVis) and visual analytics (VA) within data science and how interactive visualization can be used to support analysis and decision-making, empowering and complementing data science methods. Moreover, we review perceptual and cognitive aspects, together with design and evaluation methodologies for InfoVis and VA.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
Soup analogy:
“When the cook tastes other cook’s soups, that’s exploratory.
When the cook tastes his own soup while making it, that’s formative.
When the guests (or food critics) taste the soup, that’s summative.”
References
Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., et al. (2015). TensorFlow: Large-scale machine learning on heterogeneous systems. https://www.tensorflow.org/. Software available from tensorflow.org.
Andrews, K. (2010). Usability evaluation and information visualisation. IV’10 tutorial. In 14th International Conference Information Visualisation, London, UK.
Bae, J., Helldin, T., & Riveiro, M. (2017). Understanding indirect causal relationships in node-link graphs. In Computer graphics forum. https://doi.org/10.1111/cgf.13198.
Bae, J., & Watson, B. (2011). Developing and evaluating quilts for the depiction of large layered graphs. IEEE Transactions on Visualization & Computer Graphics, 17, 2268–2275. https://doi.org/10.1109/TVCG.2011.187.
Bastian, M., Heymann, S., & Jacomy, M. (2009). Gephi: An open source software for exploring and manipulating networks. http://www.aaai.org/ocs/index.php/ICWSM/09/paper/view/154.
Bederson, B. B., & Shneiderman, B. (2003). In: The craft of information visualization, interactive technologies. San Francisco: Morgan kaufmann. https://doi.org/10.1016/B978-1-55860-915-0.50056-1. https://www.sciencedirect.com/science/article/pii/B9781558609150500561.
Belmonte, N.G. (2011). JavaScript InfoVis Toolkit. Retrieved October 7, 2017, from http://philogb.github.io/jit/index.html.
Bertin, J. (1983). Semiology of graphics. University of Wisconsin Press.
Bertin, J. (2011). Semiology of graphics: Diagrams, networks, maps. Esri Press.
Bertini, E., & Santucci, G. (2006). Visual quality metrics. In Proceedings of the 2006 AVI Workshop on Beyond Time and Errors: Novel Evaluation Methods for Information Visualization (pp. 1–5). ACM.
Bostock, M., Ogievetsky, V., & Heer, J. (2011). D3: Data-driven documents. IEEE Transactions on Visualization and Computer Graphics (Proc. InfoVis). http://vis.stanford.edu/papers/d3.
Brehmer, M., & Munzner, T. (2013). A multi-level typology of abstract visualization tasks. IEEE Transactions on Visualization and Computer Graphics, 19(12), 2376–2385.
Card, S., Mackinlay, J. D., & Shneiderman, B. (1999). Readings in information visualization: Using vision to think. London: Academic Press.
Castellanos-Garzón, J. A., García, C. A., Novais, P., & Díaz, F. (2013). A visual analytics framework for cluster analysis of dna microarray data. Expert Systems with Applications, 40(2), 758–774.
Chang, R., Ziemkiewicz, C., Green, T. M., & Ribarsky, W. (2009). Defining insight for visual analytics. IEEE Computer Graphics and Applications, 29(2), 14–17.
Chen, C. (2005). Top 10 unsolved information visualization problems. IEEE Computer Graphics and Applications, 25(4), 12–16. https://doi.org/10.1109/MCG.2005.91. http://dx.doi.org/10.1109/MCG.2005.91.
Chen, C., & Czerwinski, M. (2000). Empirical evaluation of information visualizations: An introduction. International Journal of Human-Computer Studies, 53(5), 631–635.
Endert, A., Ribarsky, W., Turkay, C., Wong, B., Nabney, I., Daz Blanco, I., et al. (2017). The state of the art in integrating machine learning into visual analytics. https://doi.org/10.1111/cgf.13092.
Few, S. (2009). Now you see it: Simple visualization techniques for quantitative analysis. Oakland: Analytics Press.
Goodall, J. R., & Tesone, D. R. (2009). Visual analytics for network flow analysis. In Conference For Homeland Security, 2009. CATCH 2009. Cybersecurity Applications & Technology (pp. 199–204). IEEE.
Purchase, H. C., Andrienko, T, N., Jankun-Kelly, J., & Ward, M. (2008). Theoretical foundations of information visualization (Vol. 4950). Lecture notes in computer science. Berlin: Springer.
Heer, J., Bostock, M., & Ogievetsky, V. (2010). A tour through the visualization zoo. Communications of the ACM, 53(6), 59–67. https://doi.org/10.1145/1743546.1743567. http://doi.acm.org/10.1145/1743546.1743567.
Heer, J., Card, S. K., & Landay, J. (2005) Prefuse: A toolkit for interactive information visualization. In ACM human factors in computing systems (CHI) (pp. 421–430). http://vis.stanford.edu/papers/prefuse.
Inc., P.T. (2015). Collaborative data science. https://plot.ly.
Kay, M., & Heer, J. (2016). Beyond weber’s law: A second look at ranking visualizations of correlation. IEEE Transactions on Visualization and Computer Graphics, 22(1), 469–478.
Keim, D., Kohlhammer, J., Ellis, G., & Mansmann, F. (2010). Mastering the information age: Solving problems with visual analytics. In Eurographics (Vol. 2, p. 5).
Keim, D., Mansmann, F., Oelke, D., & Ziegler, H. (2008) Visual analytics: Combining automated discovery with interactive visualizations. In Discovery science (pp. 2–14). Springer.
Keim, D. A. (2002). Information visualization and visual data mining. IEEE Transactions on Visualization and Computer Graphics, 8(1), 1–8.
Keim, D. A., Mansmann, F., Schneidewind, J., & Ziegler, H. (2006). Challenges in visual data analysis. In Proceedings of the 10th International Conference on Information Visualization (pp. 9–16). IEEE.
Keim, D. A., Munzner, T., Rossi, F., & Verleysen, M. (2015). Bridging information visualization with machine learning (Dagstuhl Seminar 15101). Dagstuhl Reports, 5(3), 1–27. https://doi.org/10.4230/DagRep.5.3.1. http://drops.dagstuhl.de/opus/volltexte/2015/5266.
Kerracher, N., & Kennedy, J. (2017). Constructing and evaluating visualisation task classifications: Process and considerations. In Computer graphics forum (Vol. 36, pp. 47–59). Wiley Online Library.
Kerren, A., & Schreiber, F. (2012). Toward the role of interaction in visual analytics. In Proceedings of the 2012 Winter Simulation Conference (WSC) (pp. 1–13). IEEE.
Kiln.it (2014). In flight. Retrieved October 7, 2017, from https://www.theguardian.com/world/ng-interactive/2014/aviation-100-years.
Kosara, R. (2016). An empire built on sand: Reexamining what we think we know about visualization. In Proceedings of the Sixth Workshop on Beyond Time and Errors on Novel Evaluation Methods for Visualization, BELIV 2016 (pp. 162–168). New York, USA: ACM. https://doi.org/10.1145/2993901.2993909. http://doi.acm.org/10.1145/2993901.2993909.
Liu, S., Cui, W., Wu, Y., & Liu, M. (2014). A survey on information visualization: Recent advances and challenges. The Visual Computer, 30(12), 1373–1393. https://doi.org/10.1007/s00371-013-0892-3. http://dx.doi.org/10.1007/s00371-013-0892-3.
Mansmann, F. (2008). Visual analysis of network traffic: Interactive monitoring, detection, and interpretation of security threats. Ph.D. thesis.
Mark, G., & Kobsa, A. (2005). The effects of collaboration and system transparency on cive usage: An empirical study and model. Presence: Teleoperators and Virtual Environments, 14(1), 60–80. https://doi.org/10.1162/1054746053890279.
Mauri, M., Elli, T., Caviglia, G., Uboldi, G., & Azzi, M. (2017). Rawgraphs: A visualisation platform to create open outputs. In Proceedings of the 12th Biannual Conference on Italian SIGCHI Chapter, CHItaly 2017 (pp. 28:1–28:5). ACM. https://doi.org/10.1145/3125571.3125585. http://doi.acm.org/10.1145/3125571.3125585.
McGuirl, J. M., & Sarter, N. B. (2006). Supporting trust calibration and the effective use of decision aids by presenting dynamic system confidence information. Human Factors, 48(4), 656–665. https://doi.org/10.1518/001872006779166334. PMID: 17240714.
Meyer, M., Munzner, T., & Pfister, H. (2009). Mizbee: A multiscale synteny browser. IEEE Transactions on Visualization and Computer Graphics, 15(6), 897–904.
Meyer, M., Sedlmair, M., & Munzner, T. (2012). The four-level nested model revisited: Blocks and guidelines. In Proceedings of the 2012 BELIV Workshop: Beyond Time and Errors-Novel Evaluation Methods for Visualization (p. 11). ACM.
Meyer, M., Sedlmair, M., Quinan, P. S., & Munzner, T. (2015). The nested blocks and guidelines model. Information Visualization, 14(3), 234–249.
Munzner, T. (2009). A nested model for visualization design and validation. IEEE Transactions on Visualization and Computer Graphics, 15(6).
Munzner, T. (2014). Visualization analysis and design. CRC Press.
Nielsen, C. B., Jackman, S. D., Birol, I., & Jones, S. J. (2009). Abyss-explorer: Visualizing genome sequence assemblies. IEEE Transactions on Visualization and Computer Graphics, 15(6), 881–888.
North, C. (2006). Toward measuring visualization insight. In Position paper for the IEEE VAST Metrics for the Evaluation of Visual Analytics Workshop.
Offermann, P., Levina, O., Schönherr, M., & Bub, U. (2009). Outline of a design science research process. In Proceedings of the 4th International Conference on Design Science Research in Information Systems and Technology (pp. 1–11). New York, USA: ACM. http://doi.acm.org/10.1145/1555619.1555629.
Riveiro, M. (2014). Evaluation of normal model visualization for anomaly detection in maritime traffic. ACM Transactions on Interactive Intelligent Systems (TiiS), 4(1), 5.
Riveiro, M., Helldin, T., Falkman, G., & Lebram, M. (2014). Effects of visualizing uncertainty on decision-making in a target identification scenario. Computers & Graphics, 41, 84–98.
Riveiro, M., Helldin, T., Lebram, M., & Falkman, G. (2013). Towards future threat evaluation systems: User study, proposal and precepts for design. In 2013 16th International Conference on Information Fusion (FUSION) (pp. 1863–1870). IEEE.
Riveiro, M., Lebram, M., & Warston, H. (2014). On visualizing threat evaluation configuration processes: A design proposal. In 2014 17th International Conference on Information Fusion (FUSION) (pp. 1–8). IEEE.
Saraiya, P., North, C., Lam, V., & Duca, K. A. (2006). An insight-based longitudinal study of visual analytics. IEEE Transactions on Visualization and Computer Graphics, 12(6), 1511–1522.
Satyanarayan, A., Moritz, D., Wongsuphasawat, K., & Heer, J. (2017). Vega-lite: A grammar of interactive graphics. IEEE Transactions on Visualization and Computer Graphics (Proc. InfoVis). http://idl.cs.washington.edu/papers/vega-lite.
Schulz, H. J., Nocke, T., Heitzler, M., & Schumann, H. (2013). A design space of visualization tasks. IEEE Transactions on Visualization and Computer Graphics, 19(12), 2366–2375.
Sedlmair, M., Meyer, M., & Munzner, T. (2012). Design study methodology: Reflections from the trenches and the stacks. IEEE Transactions on Visualization and Computer graphics, 18(12), 2431–2440.
Shneiderman, B. (1992). Tree visualization with tree-maps: 2-d space-filling approach. ACM Transactions on Graphics, 11(1), 92–99. http://doi.acm.org/10.1145/102377.115768.
Shneiderman, B. (1996). The eyes have it: A task by data type taxonomy for information visualizations. In Proceedings of the IEEE Symposium on Visual Languages (pp. 336–343). IEEE.
Slingsby, A., Dykes, J., & Wood, J. (2011). Exploring uncertainty in geodemographics with interactive graphics. IEEE Transactions on Visualization and Computer Graphics, 17(12), 2545–2554.
Sun, G. D., Wu, Y. C., Liang, R. H., & Liu, S. X. (2013). A survey of visual analytics techniques and applications: State-of-the-art research and future challenges. Journal of Computer Science and Technology, 28(5), 852–867.
Teets, J. M., Tegarden, D. P., & Russell, R. S. (2010). Using cognitive fit theory to evaluate the effectiveness of information visualizations: An example using quality assurance data. IEEE Transactions on Visualization and Computer Graphics, 16(5), 841–853.
Thomas, J., & Cook, K. (2005). Illuminating the path: The research and development agenda for visual analytics. IEEE Computer Society Press.
Thomas, J., & Cook, K. (2006). A visual analytics agenda. Computer Graphics and Applications, 26(1), 10–13.
Thomas, J., & Kielman, J. (2009). Challenges for visual analytics. Information Visualization, 8(4), 309–314.
Treisman, A., & Gormican, S. (1988). Feature analysis in early vision: Evidence from search asymmetries. Psychological Review, 95(1), 15–48.
Tufte, E. (2001). The visual display of quantitative information. Cheshire, Conn: Graphics Press.
van der Maaten, L., & Hinton, G. (2008). Visualizing high-dimensional data using t-SNE. Journal of Machine Learning Research, 9, 2579–2605.
Vredenburg, K., Mao, J. Y., Smith, P. W., & Carey, T. (2002). A survey of user-centered design practice. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI 2002 (pp. 471–478). New York, USA: ACM.
Ward, M., Grinstein, G., & Keim, D. (2010). Interactive data visualization: Foundations, techniques, and applications. 360 degree business. CRC Press.
Ware, C. (2008). Visual thinking for design (1st ed.). Oxford: Elsevier LTD.
Ware, C. (2012). Information visualization: Perception for design (3rd ed.). Oxford: Elsevier LTD.
Yi, J. S., Kang, Y. a., Stasko, J. T., & Jacko, J. A. (2008). Understanding and characterizing insights: How do people gain insights using information visualization? In Proceedings of the 2008 Workshop on Beyond Time and Errors: Novel Evaluation Methods for Information Visualization (p. 4). ACM.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Bae, J., Falkman, G., Helldin, T., Riveiro, M. (2019). Visual Data Analysis. In: Said, A., Torra, V. (eds) Data Science in Practice. Studies in Big Data, vol 46. Springer, Cham. https://doi.org/10.1007/978-3-319-97556-6_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-97556-6_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97555-9
Online ISBN: 978-3-319-97556-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)