Advertisement

A Combined Data Analytics and Network Science Approach for Smart Real Estate Investment: Towards Affordable Housing

  • E. Sandeep Kumar
  • Viswanath Talasila
Chapter
Part of the EAI/Springer Innovations in Communication and Computing book series (EAISICC)

Abstract

Sophisticated tools for smart management and public services are crucial aspects of smart cities and especially affordable housing. In this context, a novel algorithm is introduced, which assists a user to identify locations for real estate investment. The methodology involves an application of data analytics for selection of top attributes of real estate for a user, and based on these attributes stacks of machine learning algorithms like decision trees, principal component analysis (PCA), and K-means clustering identify the location for investment. While data analytics comprising statistical modeling and machine learning techniques can compute the important attributes and thereby identify locations, it is nontrivial to get good insight at the scale of a large complex network consisting of hundreds of attributes and locations. This is mainly due to the underlying assumptions of i.i.d (independent and identically distributed) on random variables of many learning algorithms. Network science provides the necessary tools to analyze interactions and relations among entities in large networks considering the interdependencies of variables. In this chapter, a network created from the locations outputted by machine learning layers is described that utilizes network measures like eigen centrality that helps a user to determine the best location for investment, while providing deeper insight into the location identification problem. In addition, simulation of network dynamics provides the most influential and stable attribute of the designed real estate complex network, in the presence of the random link weight perturbations.

Real estate investment comprises many attributes that can be categorized into social, economic, governmental, and environmental. Of all these, only real estate factors are considered in this work. However, the same work can be extended to other factors as well.

Keywords

Data analytics Network science Machine learning Real estate investment 

Notes

Acknowledgements

Authors would like to thank Dr. Naphtali Rishe and Dr. S.S. Iyengar of School of Computing and Information Sciences, Florida International University, Miami, Florida, for providing the database and valuable suggestions throughout this work.

References

  1. 1.
    Internet document—“Affordable Housing in India”, An Inclusive Approach to Sheltering the Bottom of the PyramidGoogle Scholar
  2. 2.
    J.F. Schram, Real Estate Appraisal (Rockwell, Bellevue, 2006)Google Scholar
  3. 3.
    D.H. Carr, J. Lawson, J. Schultz, Dearborn Real Estate Education, Mastering Estate Appraisal (Dearborn Financial Publications, Chicago, 2003)Google Scholar
  4. 4.
    Y. Zhang, S. Liu, S. He, Z. Fang, Forecasting research on real-estate prices in Shanghai, in: 2009 International Conference on Grey Systems and Intelligent Services (GSIS 2009), Nanjing, 2009, pp. 625-629Google Scholar
  5. 5.
    W. Wei, T. Guang-ji, Z. Hong-rui, Empirical analysis on the housing price in Harbin City based on hedonic model, in: 2010 International conference on Management Science and Engineering 17th Annual Conference Proceeding, Melbourne, VIC, 2010, pp. 1659–1664Google Scholar
  6. 6.
    B. park, J.K. Bae, Using machine learning algorithms for housing price prediction: “The case of Fairfax County”, Virginia housing data. Expert Syst. Appl. 42(6), 2928–2934 (2015). ISSN: 0957-4174CrossRefGoogle Scholar
  7. 7.
    H. Xue, The prediction on residential real estate price based on BPNN, in: 2015 8th International Conference on Intelligent Computation Technology and Automation (ICICTA), Nanchang, 2015, pp. 1008–1013Google Scholar
  8. 8.
    B. Liu, B. Mavrin, D. Niu, L. Kong, House price modeling over heterogeneous regions with hierarchical spatial functional analysis, in: 2016 IEEE 16th International Conference on Data Mining (ICDM), Barcelona, 2016, pp. 1047–1052Google Scholar
  9. 9.
    C. Cheng, X. Cheng, M. Yuan, K. Chao, S. Zhou, J. Gao, L. Xu, T. Zhang, A novel architecture and machine learning algorithm for real estate. Signal Inf. Process. Netw. Comput. 473, 491–499 (2017). Springer, Singapore. Lecture Notes in Electrical EngineeringGoogle Scholar
  10. 10.
    Kecheng Zhao, Wei Shen, Spatial characteristic with individual house properties and multilevel approach to hedonic models, in: 2011 International Conference on Computer Science and Service System (CSSS), Nanjing, 2011, pp. 2579–2582Google Scholar
  11. 11.
    T. Oladunni, S. Sharma, Hedonic housing theory—a machine learning investigation, in: 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA), Anaheim, CA, 2016, pp. 522–527. doi:  https://doi.org/10.1109/ICMLA.2016.0092
  12. 12.
    B. Park, J.K. Bae, Using machine learning algorithms for housing price prediction. Expert Syst. Appl. 42, 2928–2934 (2015)CrossRefGoogle Scholar
  13. 13.
    I.D. Wilson, S.D. Paris, J.A. Ware, D.H. Jenkins, Residential property price time series forecasting with neural networks, in: The Twenty-First SGES International Conference on Knowledge Based Systems and Applied Artificial Intelligence, Cambridge, December 2001, pp. 17–28, Springer PublicationsGoogle Scholar
  14. 14.
    H. Xu, A. Gade, Smart real estate assessments using structured deep neural networks, in: 2017 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI), San Francisco, CA, 2017, pp. 1-7Google Scholar
  15. 15.
    S. Lu, Z. Li, Z. Qin, X. Yang, R.S.M. Goh, A hybrid regression technique for house prices prediction, in: 2017 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), Singapore, 2017, pp. 319–323Google Scholar
  16. 16.
    D. Sangani, K. Erickson, M. A. Hasan, Predicting zillow estimation error using linear regression and gradient boosting, in: 2017 IEEE 14th International Conference on Mobile Ad Hoc and Sensor Systems (MASS), Orlando, FL, 2017, pp. 530–534Google Scholar
  17. 17.
    W.T. Lim, L. Wang, Y. Wang, Q. Chang, Housing price prediction using neural networks, in: 2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD), Changsha, 2016, pp. 518–522Google Scholar
  18. 18.
    J. Demongeot, H. Pempelfort, J.M. Martinez, R. Vallejos, M. Barria, C. Taramasco, Information design of biological networks: application to genetic, immunologic, metabolic and social networks, in: 2013 27th International Conference on Advanced Information Networking and Applications Workshops, Barcelona, 2013, pp. 1533–1540Google Scholar
  19. 19.
    D.P. Cheung, M.H. Gunes, A complex network analysis of the United States air transportation, in: 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, Istanbul, 2012, pp. 699–701Google Scholar
  20. 20.
    Q. Xuan, Z.Y. Zhang, C. Fu, H.X. Hu, V. Filkov, Social synchrony on complex networks. IEEE Trans. Cybernetics 48(5), 1420–1431 (2018)CrossRefGoogle Scholar
  21. 21.
  22. 22.
  23. 23.
    J. Wang, Pearson correlation coefficient, in Encyclopedia of Systems Biology, ed. by W. Dubitzky, O. Wolkenhauer, K. H. Cho, H. Yokota, (Springer, New York, NY, 2013)Google Scholar
  24. 24.
    The data for our work was taken from: www.terrafly.com/
  25. 25.
    G. Skourletopoulos et al., Big data and cloud computing: a survey of the state-of-the-art and research challenges, in Advances in Mobile Cloud Computing and Big Data in the 5G Era. Studies in Big Data, ed. by C. Mavromoustakis, G. Mastorakis, C. Dobre, vol. 22, (Springer, Cham, 2017)Google Scholar
  26. 26.
    Alan Said, Data Science in Practice (Springer Publications, 2019)Google Scholar
  27. 27.
    E. Hart, Biological networks, in Encyclopedia of Astrobiology, ed. by R. Amils et al., (Springer, Berlin, Heidelberg, 2014)Google Scholar
  28. 28.
    V. Latora, V. Nicosia, G. Russo, Complex Networks: Principles, Methods and Applications (Cambridge University Press, Cambridge, UK, 2017)CrossRefGoogle Scholar
  29. 29.
    S. Han, Y. Ko, S. Kim, D.H. Shin, Home sales index prediction model based on cluster and principal component statistical approaches in a big data analytic concept. KSCE J. Civil Eng. 21(1), 67–75 (2017). Springer publicationsCrossRefGoogle Scholar
  30. 30.
    T. Oladunni, S. Sharma, Spatial dependency and hedonic housing regression model, in: 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA), Anaheim, CA, 2016, pp. 553–558Google Scholar
  31. 31.
    V. Del Giudice, P. De Paola, G.B. Cantisani, Valuation of real estate investments through fuzzy logic. Buildings 7, 26 (2017)CrossRefGoogle Scholar
  32. 32.
    C. Bagnoli, H.C. Smith, The theory of fuzzy logic and its application to real estate valuation. J. Real Estate Res. 16(2), 169–200 (1998). American Real Estate SocietyGoogle Scholar
  33. 33.
    W. Didimo, G. Liotta, F. Montecchiani, Network visualization for financial crime detection. J. Vis. Lang. Comput. 25(4), 433–451 (2014).  https://doi.org/10.1016/j.jvlc.2014.01.002CrossRefGoogle Scholar
  34. 34.
    S. Perera, M.G.H. Bell, M.C.J. Bliemer, Network science approach to modelling the topology and robustness of supply chain networks: a review and perspective. Appl. Netw. Sci. 2, 2–35 (2017).  https://doi.org/10.1007/s41109-017-0053-0CrossRefGoogle Scholar
  35. 35.
    Z. Wang, J. Han, Visualization of the UK stock market based on complex networks for company’s revenue forecast, in Information and Knowledge Management in Complex Systems. ICISO 2015. IFIP Advances in Information and Communication Technology, ed. by K. Liu, K. Nakata, W. Li, D. Galarreta, vol. 449, (Springer, Cham, 2015)Google Scholar
  36. 36.
  37. 37.
  38. 38.
  39. 39.
    Y. Dong, Value ranges of Spearman’s Rho and Kendall’s Tau of a class of copulas, in: 2010 International Conference on Computational and Information Sciences, Chengdu, 2010, pp. 182-185. doi:  https://doi.org/10.1109/ICCIS.2010.335
  40. 40.
    S.E. Kumar, V. Talasila, N. Rishe, T.V.S. Kumar, S.S. Iyengar, Location identification for real estate investment using data analytics. Int. J. Data Sci. Analytics, 1–25 (2019)Google Scholar
  41. 41.
    M.J. Moshkov, Time complexity of decision trees, in Transactions on Rough Sets III, ed. by J. F. Peters, A. Skowron, (Springer, Berlin, 2005), pp. 244–459CrossRefGoogle Scholar
  42. 42.
    D. Hu, Q. Liu, Q. Yan, Decision tree merging branches algorithm based on equal predictability, in: 2009 International Conference on Artificial Intelligence and Computational Intelligence, Shanghai, 2009, pp. 214–218Google Scholar
  43. 43.
    O.Z. Maimon, R. Lior, Data Mining with Decision Trees: Theory and Applications, 2nd edn. (World Scientific, Singapore, 2015)zbMATHGoogle Scholar
  44. 44.
    S. Sehgal, H. Singh, M. Agarwal, V. Bhasker and Shantanu, Data analysis using principal component analysis, in: 2014 International Conference on Medical Imaging, m-Health and Emerging Communication Systems (MedCom), Greater Noida, 2014, pp. 45–48Google Scholar
  45. 45.
    G.A. Wilkin, X. Huang, K-means clustering algorithms: implementation and comparison, in: Second International Multi-Symposiums on Computer and Computational Sciences (IMSCCS 2007), Iowa City, IA, 2007, pp. 133–136Google Scholar
  46. 46.
    I. Scholtes, Understanding complex systems: when big data meets network science. IT—Information Technology 57(4), 252–256 (2015).  https://doi.org/10.1515/itit-2015-0012CrossRefGoogle Scholar
  47. 47.
    A. Bihari, M. K. Pandia, Eigenvector centrality and its application in research professionals’ relationship network, in: 2015 International Conference on Futuristic Trends on Computational Analysis and Knowledge Management (ABLAZE), Noida, 2015, pp. 510–514Google Scholar
  48. 48.
    F. Grando, D. Noble, L. C. Lamb, An analysis of centrality measures for complex and social networks, in: 2016 IEEE Global Communications Conference (GLOBECOM), Washington, DC, 2016, pp. 1–6Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  • E. Sandeep Kumar
    • 1
  • Viswanath Talasila
    • 1
  1. 1.Department of Telecommunication EngineeringM.S Ramaiah Institute of TechnologyBengaluruIndia

Personalised recommendations