Abstract
Long-term travel time predictions are crucial for tactical and operational public transport planning in schedule design and resource allocation tasks. Similarly to any regression task, its success considerably depend on an adequate feature selection framework. In this paper, we approach the myopia of the State-of-the-Art method RReliefF on mining relevant inter-relationships of the feature space relevant for reducing the entropy around the target variable on regression tasks. A comparative study was conducted using baseline regression methods and LASSO as a valid alternative to RReliefF. Experimental results obtained on a real-world case study uncovered the bias/variance reduction obtained by each approach, pointing out promising ideas on this research line.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Moreira-Matias, L., Mendes-Moreira, J., de Sousa, J.F., Gama, J.: Improving mass transit operations by using AVL-based systems: a survey. IEEE Trans. Intell. Transp. Syst. 16(4), 1636–1653 (2015)
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)
Mishalani, R., McCord, M., Forman, S.: Schedule-based and autoregressive bus running time modeling in the presence of driver-bus heterogeneity. In: Hickman, M., Mirchandani, P., Voß, S. (eds.) Computer-Aided Systems in Public Transport, pp. 301–317. Springer, Heidelberg (2008)
Berkow, M., El-Geneidy, A., Bertini, R., Crout, D.: Beyond generating transit performance measures. Transp. Res. Rec. J. Transp. Res. Board 2111(1), 158–168 (2009)
El-Geneidy, A., Horning, J., Krizek, K.: Analyzing transit service reliability using detailed data from automatic vehicular locator systems. J. Adv. Transp. 45(1), 66–79 (2011)
Mazloumi, E., Rose, G., Currie, G., Sarvi, M.: An integrated framework to predict bus travel time and its variability using traffic flow data. J. Intell. Transp. Syst. 15(2), 75–90 (2011)
Mendes-Moreira, J., Jorge, A., de Sousa, J., Soares, C.: Comparing state-of-the-art regression methods for long term travel time prediction. Intell. Data Anal. 16(3), 427–449 (2012)
Robnik-Šikonja, M., Kononenko, I.: An adaptation of relief for attribute estimation in regression. In: Proceedings of the Fourteenth International Conference on Machine Learning, ICML 1997, pp. 296–304 (1997)
Mendes-Moreira, J., Moreira-Matias, L., Gama, J., de Sousa, J.: Validating the coverage of bus schedules: a machine learning approach. Inf. Sci. 293, 299–313 (2015)
Kira, K., Rendell, L.A.: A practical approach to feature selection. In: Proceedings of the Ninth International Workshop on Machine Learning, pp. 249–256 (1992)
Kononenko, I.: Estimating attributes: analysis and extensions of RELIEF. In: Bergadano, F., Raedt, L. (eds.) ECML 1994. LNCS, vol. 784, pp. 171–182. Springer, Heidelberg (1994). doi:10.1007/3-540-57868-4_57
Tibshirani, R.: Regression shrinkage and selection via the LASSO. J. Roy. Stat. Soc. Ser. B (Methodol.) 58(1), 267–288 (1996)
Breiman, L., Friedman, J., Stone, C.J., Olshen, R.A.: Classification and Regression Trees. CRC Press, New York (1984)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
Friedman, J., Stuetzle, W.: Projection pursuit regression. J. Am. Stat. Assoc. 76(376), 817–823 (1981)
R Core Team: R: A Language and Environment for Statistical Computing. R Foundation, Vienna (2012)
Romanski, P.: Fselector: selecting attributes. R package version 0.19 (2009)
Friedman, J., Hastie, T., Tibshirani, R.: Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33(1), 1 (2010)
Kuhn, M.: Caret package. J. Stat. Softw. 28(5), 1–26 (2008)
Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13(1), 281–305 (2012)
Zeileis, A., Hornik, K., Smola, A., Karatzoglou, A.: kernlab-an S4 package for kernel methods in R. J. Stat. Softw. 11(9), 1–20 (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Hassan, S.M., Moreira-Matias, L., Khiari, J., Cats, O. (2016). Feature Selection Issues in Long-Term Travel Time Prediction. In: Boström, H., Knobbe, A., Soares, C., Papapetrou, P. (eds) Advances in Intelligent Data Analysis XV. IDA 2016. Lecture Notes in Computer Science(), vol 9897. Springer, Cham. https://doi.org/10.1007/978-3-319-46349-0_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-46349-0_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46348-3
Online ISBN: 978-3-319-46349-0
eBook Packages: Computer ScienceComputer Science (R0)