Skip to main content

Technological Roadmap

  • Chapter
  • First Online:
Scalable Big Data Analytics for Protein Bioinformatics

Part of the book series: Computational Biology ((COBO,volume 28))

  • 831 Accesses

Abstract

Scientific solutions presented in this book rely on various technologies that emerged in computer science. Some of them emerged recently and are quite new in the bioinformatics field. Some of them are widely used in developing efficient and reliable IT systems supporting various forms of business for many years, but are not frequently used in bioinformatics. This chapter provides a technological road map for solutions presented in this book. It covers a brief introduction to the concept of cloud computing, cloud service, and deployment models. It also defines the Big Data challenge and presents benefits of using multi-threading in scientific computations. It then explains graphics processing units (GPU) and CUDA architecture. Finally, it focuses on relational databases and the SQL language used for declarative querying.

Technology is a useful servant but a dangerous master

Christian Lous Lange

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bai, C., Dhavale, D., Sarkis, J.: Complex investment decisions using rough set and Fuzzy C-means: an example of investment in green supply chains. Eur. J. Oper. Res. 248(2), 507–521 (2016)

    Article  MathSciNet  Google Scholar 

  2. Bondi, A.: Characteristics of scalability and their impact on performance. In: 2nd International Workshop on Software and Performance, WOSP 2000, pp. 195–203 (2000)

    Google Scholar 

  3. Chang, H., Mishra, N., Lin, C.: IoT Big-Data centred knowledge granule analytic and cluster framework for BI applications: a case base analysis. PLoS ONE 10, 1–23 (2015)

    Google Scholar 

  4. Codd, E.F.: A relational model of data for large shared data banks. Commun. ACM 13(6), 377–387 (1970). https://doi.org/10.1145/362384.362685

    Article  Google Scholar 

  5. Date, C.: An Introduction to Database Systems, 8th edn. Addison-Wesley (2003)

    Google Scholar 

  6. Davis, G.B., Carley, K.M.: Clearing the fog: fuzzy, overlapping groups for social networks. Soc. Netw. 30(3), 201–212 (2008)

    Article  Google Scholar 

  7. De Maio, C., Fenza, G., Loia, V., Senatore, S.: Hierarchical web resources retrieval by exploiting fuzzy formal concept analysis. Inf. Process. Manag. 48(3), 399 – 418 (2012). http://www.sciencedirect.com/science/article/pii/S0306457311000458

  8. De Maio, C., Fenza, G., Loia, V., Parente, M.: Time aware knowledge extraction for microblog summarization on Twitter. Inf. Fusion 28, 60–74 (2016)

    Article  Google Scholar 

  9. Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008). https://doi.org/10.1145/1327452.1327492

    Article  Google Scholar 

  10. Ghemawat, S., Gobioff, H., Leung, S.T.: The google file system. SIGOPS Oper. Syst. Rev. 37(5), 29–43 (2003). https://doi.org/10.1145/1165389.945450

    Article  Google Scholar 

  11. Ghosh, G., Banerjee, S., Yen, N.Y.: State transition in communication under social network: an analysis using fuzzy logic and density based clustering towards Big Data paradigm. Future Gener. Comput. Syst. 65, 207–220 (2016). http://www.sciencedirect.com/science/article/pii/S0167739X16300309

    Article  Google Scholar 

  12. Guo, K., Zhang, R., Kuang, L.: TMR: towards an efficient semantic-based heterogeneous transportation media Big Data retrieval. Neurocomputing 181, 122–131 (2016)

    Article  Google Scholar 

  13. Kundu, S., Pal, S.K.: FGSN: Fuzzy granular social networks model and applications. Inf. Sci. 314, 100–117 (2015). http://www.sciencedirect.com/science/article/pii/S0020025515002388

    Article  Google Scholar 

  14. Kundu, S., Pal, S.: Fuzzy-rough community in social networks. Pattern Recognit. Lett. 67, Part 2, 145–152 (2015). http://www.sciencedirect.com/science/article/pii/S0167865515000537, granular Mining and Knowledge Discovery

  15. Lu, H., Sun, Z., Qu, W., Wang, L.: Real-time corrected traffic correlation model for traffic flow forecasting. Math. Probl. Eng. 2015, 1–7 (2015)

    Google Scholar 

  16. Lu, H., Sun, Z., Qu, W.: Big Data-driven based real-time traffic flow state identification and prediction. Discret. Dyn. Nat. Soc. 2015, 1–11 (2015)

    MathSciNet  Google Scholar 

  17. McKendrick, J.: Cloud computing market hot, but how hot? estimates are all over the map (2012) Accessed 24 Aug 2015. http://www.forbes.com/sites/joemckendrick/2012/02/13/cloud-computing-market-hot-but-how-hot-estimates-are-all-over-the-map/

  18. Mell, P., Grance, T.: The NIST definition of Cloud Computing. Special Publication 800-145 (2011). Accessed 10 Oct 2017. http://nvlpubs.nist.gov/nistpubs/Legacy/SP/nistspecialpublication800-145.pdf

  19. Meng, L., Tan, A., Wunsch, D.: Adaptive scaling of cluster boundaries for large-scale social media data clustering. IEEE Trans. Neur. Net. Lear. 27(12), 2656–2669 (2015)

    Article  Google Scholar 

  20. Mrozek, D., Wieczorek, D., Malysiak-Mrozek, B., Kozielski, S.: PSS-SQL: protein secondary structure - structured query language. In: 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology, pp. 1073–1076 (2010)

    Google Scholar 

  21. Mrozek, D., Małysiak-Mrozek, B., Kłapciński, A.: Cloud4Psi: cloud computing for 3D protein structure similarity searching. Bioinformatics 30(19), 2822–2825 (2014)

    Article  Google Scholar 

  22. Mrozek, D., Gosk, P., Małysiak-Mrozek, B.: Scaling Ab Initio predictions of 3D protein structures in Microsoft Azure cloud. J. Grid Comput. 13, 561–585 (2015)

    Article  Google Scholar 

  23. Mrozek, D., Daniłowicz, P., Małysiak-Mrozek, B.: HDInsight4PSi: Boosting performance of 3D protein structure similarity searching with HDInsight clusters in Microsoft Azure cloud. Inf. Sci. 349–350, 77–101 (2016)

    Article  Google Scholar 

  24. Mrozek, D., Socha, B., Kozielski, S., Małysiak-Mrozek, B.: An efficient and flexible scanning of databases of protein secondary structures. J. Intell. Inf. Syst. 46(1), 213–233 (2016). https://doi.org/10.1007/s10844-014-0353-0

    Article  Google Scholar 

  25. Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kozielski, S.: Life sciences data analysis. Inf. Sci. 384, 86–89 (2017)

    Article  Google Scholar 

  26. National Research Council: Frontiers in Massive Data Analysis. National Academy Press, Washington, D.C. (2013)

    Google Scholar 

  27. NVIDIA CUDA C Programming Guide (2018). http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html

  28. Sanders, J., Kandrot, E.: CUDA by Example: An Introduction to General-Purpose GPU Programming, 1st edn. Addison-Wesley Professional, Pearson Education Inc, Boston (2010)

    Google Scholar 

  29. Tripathy, B.K., Mittal, D.: Hadoop based uncertain possibilistic kernelized C-means algorithms for image segmentation and a comparative analysis. Appl. Soft Comput. 46, 886–923 (2016)

    Article  Google Scholar 

  30. Wang, Z., Tu, L., Guo, Z., Yang, L.T., Huang, B.: Analysis of user behaviors by mining large network data sets. Future Gener. Comput. Syst. 37, 429–437 (2014)

    Article  Google Scholar 

  31. Wang, C., Li, X., Zhou, X., Wang, A., Nedjah, N.: Soft computing in Big Data intelligent transportation systems. Appl. Soft Comput. 38, 1099–1108 (2016)

    Article  Google Scholar 

  32. Wei, X., Luo, X., Li, Q., Zhang, J., Xu, Z.: Online comment-based hotel quality automatic assessment using improved fuzzy comprehensive evaluation and Fuzzy Cognitive Map. IEEE Trans. Fuzzy Syst. 23(1), 72–84 (2015)

    Article  Google Scholar 

  33. Zhong, Y., Zhang, L., Xing, S., Li, F., Wan, B.: The Big Data processing algorithm for water environment monitoring of the three gorges reservoir area. In: Abstract and Applied Analysis 2014 (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dariusz Mrozek .

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Mrozek, D. (2018). Technological Roadmap. In: Scalable Big Data Analytics for Protein Bioinformatics. Computational Biology, vol 28. Springer, Cham. https://doi.org/10.1007/978-3-319-98839-9_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-98839-9_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-98838-2

  • Online ISBN: 978-3-319-98839-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics