Abstract
Scientific solutions presented in this book rely on various technologies that emerged in computer science. Some of them emerged recently and are quite new in the bioinformatics field. Some of them are widely used in developing efficient and reliable IT systems supporting various forms of business for many years, but are not frequently used in bioinformatics. This chapter provides a technological road map for solutions presented in this book. It covers a brief introduction to the concept of cloud computing, cloud service, and deployment models. It also defines the Big Data challenge and presents benefits of using multi-threading in scientific computations. It then explains graphics processing units (GPU) and CUDA architecture. Finally, it focuses on relational databases and the SQL language used for declarative querying.
Technology is a useful servant but a dangerous master
Christian Lous Lange
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bai, C., Dhavale, D., Sarkis, J.: Complex investment decisions using rough set and Fuzzy C-means: an example of investment in green supply chains. Eur. J. Oper. Res. 248(2), 507–521 (2016)
Bondi, A.: Characteristics of scalability and their impact on performance. In: 2nd International Workshop on Software and Performance, WOSP 2000, pp. 195–203 (2000)
Chang, H., Mishra, N., Lin, C.: IoT Big-Data centred knowledge granule analytic and cluster framework for BI applications: a case base analysis. PLoS ONE 10, 1–23 (2015)
Codd, E.F.: A relational model of data for large shared data banks. Commun. ACM 13(6), 377–387 (1970). https://doi.org/10.1145/362384.362685
Date, C.: An Introduction to Database Systems, 8th edn. Addison-Wesley (2003)
Davis, G.B., Carley, K.M.: Clearing the fog: fuzzy, overlapping groups for social networks. Soc. Netw. 30(3), 201–212 (2008)
De Maio, C., Fenza, G., Loia, V., Senatore, S.: Hierarchical web resources retrieval by exploiting fuzzy formal concept analysis. Inf. Process. Manag. 48(3), 399 – 418 (2012). http://www.sciencedirect.com/science/article/pii/S0306457311000458
De Maio, C., Fenza, G., Loia, V., Parente, M.: Time aware knowledge extraction for microblog summarization on Twitter. Inf. Fusion 28, 60–74 (2016)
Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008). https://doi.org/10.1145/1327452.1327492
Ghemawat, S., Gobioff, H., Leung, S.T.: The google file system. SIGOPS Oper. Syst. Rev. 37(5), 29–43 (2003). https://doi.org/10.1145/1165389.945450
Ghosh, G., Banerjee, S., Yen, N.Y.: State transition in communication under social network: an analysis using fuzzy logic and density based clustering towards Big Data paradigm. Future Gener. Comput. Syst. 65, 207–220 (2016). http://www.sciencedirect.com/science/article/pii/S0167739X16300309
Guo, K., Zhang, R., Kuang, L.: TMR: towards an efficient semantic-based heterogeneous transportation media Big Data retrieval. Neurocomputing 181, 122–131 (2016)
Kundu, S., Pal, S.K.: FGSN: Fuzzy granular social networks model and applications. Inf. Sci. 314, 100–117 (2015). http://www.sciencedirect.com/science/article/pii/S0020025515002388
Kundu, S., Pal, S.: Fuzzy-rough community in social networks. Pattern Recognit. Lett. 67, Part 2, 145–152 (2015). http://www.sciencedirect.com/science/article/pii/S0167865515000537, granular Mining and Knowledge Discovery
Lu, H., Sun, Z., Qu, W., Wang, L.: Real-time corrected traffic correlation model for traffic flow forecasting. Math. Probl. Eng. 2015, 1–7 (2015)
Lu, H., Sun, Z., Qu, W.: Big Data-driven based real-time traffic flow state identification and prediction. Discret. Dyn. Nat. Soc. 2015, 1–11 (2015)
McKendrick, J.: Cloud computing market hot, but how hot? estimates are all over the map (2012) Accessed 24 Aug 2015. http://www.forbes.com/sites/joemckendrick/2012/02/13/cloud-computing-market-hot-but-how-hot-estimates-are-all-over-the-map/
Mell, P., Grance, T.: The NIST definition of Cloud Computing. Special Publication 800-145 (2011). Accessed 10 Oct 2017. http://nvlpubs.nist.gov/nistpubs/Legacy/SP/nistspecialpublication800-145.pdf
Meng, L., Tan, A., Wunsch, D.: Adaptive scaling of cluster boundaries for large-scale social media data clustering. IEEE Trans. Neur. Net. Lear. 27(12), 2656–2669 (2015)
Mrozek, D., Wieczorek, D., Malysiak-Mrozek, B., Kozielski, S.: PSS-SQL: protein secondary structure - structured query language. In: 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology, pp. 1073–1076 (2010)
Mrozek, D., Małysiak-Mrozek, B., Kłapciński, A.: Cloud4Psi: cloud computing for 3D protein structure similarity searching. Bioinformatics 30(19), 2822–2825 (2014)
Mrozek, D., Gosk, P., Małysiak-Mrozek, B.: Scaling Ab Initio predictions of 3D protein structures in Microsoft Azure cloud. J. Grid Comput. 13, 561–585 (2015)
Mrozek, D., Daniłowicz, P., Małysiak-Mrozek, B.: HDInsight4PSi: Boosting performance of 3D protein structure similarity searching with HDInsight clusters in Microsoft Azure cloud. Inf. Sci. 349–350, 77–101 (2016)
Mrozek, D., Socha, B., Kozielski, S., Małysiak-Mrozek, B.: An efficient and flexible scanning of databases of protein secondary structures. J. Intell. Inf. Syst. 46(1), 213–233 (2016). https://doi.org/10.1007/s10844-014-0353-0
Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kozielski, S.: Life sciences data analysis. Inf. Sci. 384, 86–89 (2017)
National Research Council: Frontiers in Massive Data Analysis. National Academy Press, Washington, D.C. (2013)
NVIDIA CUDA C Programming Guide (2018). http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html
Sanders, J., Kandrot, E.: CUDA by Example: An Introduction to General-Purpose GPU Programming, 1st edn. Addison-Wesley Professional, Pearson Education Inc, Boston (2010)
Tripathy, B.K., Mittal, D.: Hadoop based uncertain possibilistic kernelized C-means algorithms for image segmentation and a comparative analysis. Appl. Soft Comput. 46, 886–923 (2016)
Wang, Z., Tu, L., Guo, Z., Yang, L.T., Huang, B.: Analysis of user behaviors by mining large network data sets. Future Gener. Comput. Syst. 37, 429–437 (2014)
Wang, C., Li, X., Zhou, X., Wang, A., Nedjah, N.: Soft computing in Big Data intelligent transportation systems. Appl. Soft Comput. 38, 1099–1108 (2016)
Wei, X., Luo, X., Li, Q., Zhang, J., Xu, Z.: Online comment-based hotel quality automatic assessment using improved fuzzy comprehensive evaluation and Fuzzy Cognitive Map. IEEE Trans. Fuzzy Syst. 23(1), 72–84 (2015)
Zhong, Y., Zhang, L., Xing, S., Li, F., Wan, B.: The Big Data processing algorithm for water environment monitoring of the three gorges reservoir area. In: Abstract and Applied Analysis 2014 (2014)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Mrozek, D. (2018). Technological Roadmap. In: Scalable Big Data Analytics for Protein Bioinformatics. Computational Biology, vol 28. Springer, Cham. https://doi.org/10.1007/978-3-319-98839-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-98839-9_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98838-2
Online ISBN: 978-3-319-98839-9
eBook Packages: Computer ScienceComputer Science (R0)