Feature Selection Issues in Long-Term Travel Time Prediction

Hassan, Syed Murtaza; Moreira-Matias, Luis; Khiari, Jihed; Cats, Oded

doi:10.1007/978-3-319-46349-0_9

Syed Murtaza Hassan¹⁷,
Luis Moreira-Matias¹⁷,
Jihed Khiari¹⁷ &
…
Oded Cats¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9897))

Included in the following conference series:

International Symposium on Intelligent Data Analysis

1838 Accesses
8 Citations

Abstract

Long-term travel time predictions are crucial for tactical and operational public transport planning in schedule design and resource allocation tasks. Similarly to any regression task, its success considerably depend on an adequate feature selection framework. In this paper, we approach the myopia of the State-of-the-Art method RReliefF on mining relevant inter-relationships of the feature space relevant for reducing the entropy around the target variable on regression tasks. A comparative study was conducted using baseline regression methods and LASSO as a valid alternative to RReliefF. Experimental results obtained on a real-world case study uncovered the bias/variance reduction obtained by each approach, pointing out promising ideas on this research line.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Moreira-Matias, L., Mendes-Moreira, J., de Sousa, J.F., Gama, J.: Improving mass transit operations by using AVL-based systems: a survey. IEEE Trans. Intell. Transp. Syst. 16(4), 1636–1653 (2015)
Article Google Scholar
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)
MATH Google Scholar
Mishalani, R., McCord, M., Forman, S.: Schedule-based and autoregressive bus running time modeling in the presence of driver-bus heterogeneity. In: Hickman, M., Mirchandani, P., Voß, S. (eds.) Computer-Aided Systems in Public Transport, pp. 301–317. Springer, Heidelberg (2008)
Chapter Google Scholar
Berkow, M., El-Geneidy, A., Bertini, R., Crout, D.: Beyond generating transit performance measures. Transp. Res. Rec. J. Transp. Res. Board 2111(1), 158–168 (2009)
Article Google Scholar
El-Geneidy, A., Horning, J., Krizek, K.: Analyzing transit service reliability using detailed data from automatic vehicular locator systems. J. Adv. Transp. 45(1), 66–79 (2011)
Article Google Scholar
Mazloumi, E., Rose, G., Currie, G., Sarvi, M.: An integrated framework to predict bus travel time and its variability using traffic flow data. J. Intell. Transp. Syst. 15(2), 75–90 (2011)
Article Google Scholar
Mendes-Moreira, J., Jorge, A., de Sousa, J., Soares, C.: Comparing state-of-the-art regression methods for long term travel time prediction. Intell. Data Anal. 16(3), 427–449 (2012)
Google Scholar
Robnik-Šikonja, M., Kononenko, I.: An adaptation of relief for attribute estimation in regression. In: Proceedings of the Fourteenth International Conference on Machine Learning, ICML 1997, pp. 296–304 (1997)
Google Scholar
Mendes-Moreira, J., Moreira-Matias, L., Gama, J., de Sousa, J.: Validating the coverage of bus schedules: a machine learning approach. Inf. Sci. 293, 299–313 (2015)
Article Google Scholar
Kira, K., Rendell, L.A.: A practical approach to feature selection. In: Proceedings of the Ninth International Workshop on Machine Learning, pp. 249–256 (1992)
Google Scholar
Kononenko, I.: Estimating attributes: analysis and extensions of RELIEF. In: Bergadano, F., Raedt, L. (eds.) ECML 1994. LNCS, vol. 784, pp. 171–182. Springer, Heidelberg (1994). doi:10.1007/3-540-57868-4_57
Chapter Google Scholar
Tibshirani, R.: Regression shrinkage and selection via the LASSO. J. Roy. Stat. Soc. Ser. B (Methodol.) 58(1), 267–288 (1996)
MathSciNet MATH Google Scholar
Breiman, L., Friedman, J., Stone, C.J., Olshen, R.A.: Classification and Regression Trees. CRC Press, New York (1984)
MATH Google Scholar
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Article MathSciNet MATH Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
MATH Google Scholar
Friedman, J., Stuetzle, W.: Projection pursuit regression. J. Am. Stat. Assoc. 76(376), 817–823 (1981)
Article MathSciNet Google Scholar
R Core Team: R: A Language and Environment for Statistical Computing. R Foundation, Vienna (2012)
Google Scholar
Romanski, P.: Fselector: selecting attributes. R package version 0.19 (2009)
Google Scholar
Friedman, J., Hastie, T., Tibshirani, R.: Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33(1), 1 (2010)
Article Google Scholar
Kuhn, M.: Caret package. J. Stat. Softw. 28(5), 1–26 (2008)
Article Google Scholar
Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13(1), 281–305 (2012)
MathSciNet MATH Google Scholar
Zeileis, A., Hornik, K., Smola, A., Karatzoglou, A.: kernlab-an S4 package for kernel methods in R. J. Stat. Softw. 11(9), 1–20 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

NEC Laboratories Europe, 69115, Heidelberg, Germany
Syed Murtaza Hassan, Luis Moreira-Matias & Jihed Khiari
Department of Transport and Planning, TU Delft, 2600, Delft, Netherlands
Oded Cats

Authors

Syed Murtaza Hassan
View author publications
You can also search for this author in PubMed Google Scholar
Luis Moreira-Matias
View author publications
You can also search for this author in PubMed Google Scholar
Jihed Khiari
View author publications
You can also search for this author in PubMed Google Scholar
Oded Cats
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jihed Khiari .

Editor information

Editors and Affiliations

Stockholm University , Stockholm, Sweden
Henrik Boström
Leiden University , Leiden, The Netherlands
Arno Knobbe
University of Porto , Porto, Portugal
Carlos Soares
Stockholm University , Stockholm, Sweden
Panagiotis Papapetrou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hassan, S.M., Moreira-Matias, L., Khiari, J., Cats, O. (2016). Feature Selection Issues in Long-Term Travel Time Prediction. In: Boström, H., Knobbe, A., Soares, C., Papapetrou, P. (eds) Advances in Intelligent Data Analysis XV. IDA 2016. Lecture Notes in Computer Science(), vol 9897. Springer, Cham. https://doi.org/10.1007/978-3-319-46349-0_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-46349-0_9
Published: 21 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46348-3
Online ISBN: 978-3-319-46349-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics