Multiple Random Forests Modelling for Urban Water Consumption Forecasting
The precise forecasting of water consumption is the basis in water resources planning and management. However, predicting water consumption fluctuations is complicated, given their non-stationary and non-linear characteristics. In this paper, a multiple random forests model, integrated wavelet transform and random forests regression (W-RFR), is proposed for the prediction of daily urban water consumption in southwest of China. Raw time series were first decomposed into low- and high-frequency parts with discrete wavelet transformation (DWT). The random forests regression (RFR) method was then used for prediction using each subseries. In the process, the input and output constructions of the RFR model were proposed for each subseries on the basis of the delay times and the embedding dimension of the attractor reconstruction computed by the C-C method, respectively. The forecasting values of each subseries were summarized as the final results. Four performance criteria, i.e., correlation coefficient (R), mean absolute percentage error (MAPE), normalized root mean square error (NRMSE) and threshold static (TS), were used to evaluate the forecasting capacity of the W-RFR. The results indicated that the W-RFR can capture the basic dynamics of the daily urban water consumption. The forecasted performance of the proposed approach was also compared with those of models, i.e., the RFR and forward feed neural network (FFNN) models. The results indicated that among the models, the precision of the predictions of the proposed model was greater, which is attributed to good feature extractions from the multi-scale perspective and favorable feature learning performance using the decision trees.
KeywordsWavelet transform Random forests regression Water consumption Attractor reconstruction Forecasting
This work is supported by the Project in the National Science and Technology Pillar Programme during the Twelfth Five-year Plan Period (2012BAJ25B06-003) and the Key Project of University Natural Science Research of Anhui, China (KJ2016A168).
Compliance with Ethical Standards
Conflict of Interest
No interest conflict.
- Grossmann A, Morlet J (1984) Decomposition of Hardy function into square integrable wavelets of constant shape. J Math Anal Appl 5:723–736Google Scholar
- Ho TK (1995) Random decision forest. IEEE Comput Soc 278–282Google Scholar
- Li C, Sanchez R, Zurita G, Cerrada M, Cabrera D, Vásquez R (2016) Gearbox fault diagnosis based on deep random forest fusion of acoustic and vibratory signals. Mech Syst Signal Process 76–77:283–293Google Scholar
- Maroco J, Silva D, Rodrigues A, Guerreiro M, Santana I, de Mendonca A (2011) Data mining methods in the prediction of dementia: a real-data comparison of the accuracy, sensitivity and specificity of linear discriminant analysis, logistic regression, neural networks, support vector machines, classification trees and random forests. BMC Res Notes 4:299CrossRefGoogle Scholar