Modeling and elucidation of housing price
- 11 Downloads
Abstract
It is widely acknowledged that the value of a house is the mixture of a large number of characteristics. House price prediction thus presents a unique set of challenges in practice. While a large body of works are dedicated to this task, their performance and applications have been limited by the shortage of long time span of transaction data, the absence of real-world settings and the insufficiency of housing features. To this end, a time-aware latent hierarchical model is developed to capture underlying spatiotemporal interactions behind the evolution of house prices. The hierarchical perspective obviates the need for historical transaction data of exactly same houses when temporal effects are considered. The proposed framework is examined on a large-scale dataset of the property transaction in Beijing. The whole experimental procedure strictly conforms to the real-world scenario. The empirical evaluation results demonstrate the outperformance of our approach over alternative competitive methods. We also group housing features into both external and internal clusters. The further experiment unveils that external component shapes house prices much more heavily than the internal one does. More interestingly, the inference of latent neighborhood value in our model is empirically shown to be able to lessen the dependence on the critical external cluster of features in house price prediction.
Keywords
House prices Spatiotemporal effects Internal component External component Neighborhood valueNotes
References
- Ahearne AG, Ammer J, Doyle BM, Kole LS, Martin RF (2005) House prices and monetary policy: a cross-country study. In: International finance discussion papers 841Google Scholar
- Bailey MJ, Muth RF, Nourse HO (1963) A regression method for real estate price index construction. J Am Stat Assoc 58(304):933–942CrossRefGoogle Scholar
- Baral R, Li T (2017) Exploiting the roles of aspects in personalized poi recommender systems. Data Min Knowl Discov 32:320–343MathSciNetCrossRefGoogle Scholar
- Besag J (1986) On the statistical analysis of dirty pictures. J R Stat Soc Ser B 48(3):259–302MathSciNetzbMATHGoogle Scholar
- Boyd S, Vandenberghe L (2004) Convex optimization. Cambridge University Press, CambridgeCrossRefzbMATHGoogle Scholar
- Can A (1990) The measurement of neighborhood dynamics in urban house prices. Econ Geogr 66(3):254–272CrossRefGoogle Scholar
- Case B, Pollakowski HO, Wachter SM (1991) On choosing among house price index methodologies. Real Estate Econ 19(3):286–307CrossRefGoogle Scholar
- Case B, Clapp J, Dubin R, Rodriguez M (2004) Modeling spatial and temporal house price patterns: a comparison of four models. J Real Estate Finance Econ 29(2):167–191CrossRefGoogle Scholar
- Case KE, Shiller RJ (1989) The efficiency of the market for single-family homes. Am Econ Rev 79(1):125–137Google Scholar
- Case KE, Shiller RJ et al (1987) Prices of single-family homes since 1970: new indexes for four cities. N Engl Econ Rev (Sept/Oct):45–56Google Scholar
- Chopra S, Thampy T, Leahy J, Caplin A, LeCun Y (2007) Discovering the hidden structure of house prices with a non-parametric latent manifold model. In: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 173–182Google Scholar
- De Bruyne K, Van Hove J (2013) Explaining the spatial variation in housing prices: an economic geography approach. Appl Econ 45(13):1673–1689CrossRefGoogle Scholar
- Deng D, Shahabi C, Demiryurek U, Zhu L, Yu R, Liu Y (2016) Latent space model for road networks to predict time-varying traffic. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 1525–1534Google Scholar
- Fu Y, Ge Y, Zheng Y, Yao Z, Liu Y, Xiong H, Yuan J (2014a) Sparse real estate ranking with online user reviews and offline moving behaviors. In: Data mining (ICDM), 2014 IEEE international conference on. IEEE, pp 120–129Google Scholar
- Fu Y, Xiong H, Ge Y, Yao Z, Zheng Y, Zhou ZH (2014b) Exploiting geographic dependencies for real estate appraisal: a mutual perspective of ranking and clustering. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 1047–1056Google Scholar
- Fu Y, Liu G, Papadimitriou S, Xiong H, Ge Y, Zhu H, Zhu C (2015) Real estate ranking via mixed land-use latent models. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 299–308Google Scholar
- Fu Y, Xiong H, Ge Y, Zheng Y, Yao Z, Zhou ZH (2016) Modeling of geographic dependencies for real estate ranking. ACM Trans Knowl Discov Data 11(1):11CrossRefGoogle Scholar
- Gelfand AE, Ecker MD, Knight JR, Sirmans C (2004) The dynamics of location in home price. J Real Estate Finance Econ 29(2):149–166CrossRefGoogle Scholar
- Goetzmann WN, Peng L (2002) The bias of the RSR estimator and the accuracy of some alternatives. Real Estate Econ 30(1):13–39CrossRefGoogle Scholar
- Goodman AC (1978) Hedonic prices, price indices and housing markets. J Urban Econ 5(4):471–484CrossRefGoogle Scholar
- Gu Z, Gu L, Eils R, Schlesner M, Brors B (2014) Circlize implements and enhances circular visualization in R. Bioinformatics 30(19):2811–2812CrossRefGoogle Scholar
- Hyndman RJ, Koehler AB (2006) Another look at measures of forecast accuracy. Int J Forecast 22(4):679–688CrossRefGoogle Scholar
- Jiang S, Ferreira J, González MC (2012) Clustering daily patterns of human activities in the city. Data Min Knowl Discov 25:478–510MathSciNetCrossRefzbMATHGoogle Scholar
- Liu B, Mavrin B, Niu D, Kong L (2016) House price modeling over heterogeneous regions with hierarchical spatial functional analysis. In: Data mining (ICDM), 2016 IEEE 16th international conference on. IEEE, pp 1047–1052Google Scholar
- Lü L, Zhou T (2011) Link prediction in complex networks: a survey. Physica A 390(6):1150–1170CrossRefGoogle Scholar
- Meese R, Wallace N (1991) Nonparametric estimation of dynamic hedonic price models and the construction of residential housing price indices. Real Estate Econ 19(3):308–332CrossRefGoogle Scholar
- Nagaraja CH, Brown LD, Zhao LH (2011) An autoregressive approach to house price modeling. Ann Appl Stat 5(1):124–149MathSciNetCrossRefzbMATHGoogle Scholar
- Pace RK, Barry R, Clapp JM, Rodriquez M (1998) Spatiotemporal autoregressive models of neighborhood effects. J Real Estate Finance Econ 17(1):15–33CrossRefGoogle Scholar
- Pace RK, Barry R, Gilley OW, Sirmans C (2000) A method for spatial-temporal forecasting with an application to real estate prices. Int J Forecast 16(2):229–246CrossRefGoogle Scholar
- Peterson S, Flanagan A (2009) Neural network hedonic pricing models in mass real estate appraisal. J Real Estate Res 31(2):147–164Google Scholar
- Sangalli LM, Ramsay JO, Ramsay TO (2013) Spatial spline regression models. J R Stat Soc Ser B (Stat Methodol) 75(4):681–703MathSciNetCrossRefGoogle Scholar
- Shiller RJ (1991) Arithmetic repeat sales price estimators. J Hous Econ 1(1):110–126CrossRefGoogle Scholar
- Smith TE, Wu P (2009) A spatio-temporal model of housing prices based on individual sales transactions over time. J Geogr Syst 11(4):333CrossRefGoogle Scholar
- Tan F, Xia Y, Zhu B (2014) Link prediction in complex networks: a mutual information perspective. PLOS ONE 9(9):e107,056CrossRefGoogle Scholar
- Tan F, Cheng C, Wei Z (2016) Modeling real estate for school district identification. In: Data mining (ICDM), 2016 IEEE 16th international conference on. IEEE, pp 1227–1232Google Scholar
- Tan F, Cheng C, Wei Z (2017) Time-aware latent hierarchical model for predicting house prices. In: Data mining (ICDM), 2017 IEEE 16th international conference on. IEEE, pp 1111–1116Google Scholar
- Tan F, Du K, Wei Z, Liu H, Qin C, Zhu R (2018) Modeling item-specific effects for video click. In: Proceedings of the 2018 SIAM international conference on data mining. SIAM, pp 639–647Google Scholar
- Taylor LO (2003) The hedonic method. In: A primer on nonmarket valuation, pp 331–393Google Scholar
- Yao Z, Fu Y, Liu B, Xiong H (2016) The impact of community safety on house ranking. In: Proceedings of the 2016 SIAM international conference on data mining. SIAM, pp 459–467Google Scholar
- Zhou J, Wang F, Hu J, Ye J (2014) From micro to macro: data driven phenotyping by densification of longitudinal electronic medical records. In: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 135–144Google Scholar
- Zhu H, Xiong H, Tang F, Liu Q, Ge Y, Chen E, Fu Y (2016) Days on market: Measuring liquidity in real estate markets. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 393–402Google Scholar