Prediction of Methane Outbreaks in Coal Mines from Multivariate Time Series Using Random Forest
In recent years we have experienced unprecedented increase of use of sensors in many industrial applications. Examples of such are Health and Usage Monitoring Systems (HUMS) for vehicles, so-called intelligent buildings, or instrumentation on machinery in order to monitor performance, detect faults and gain insights in operational aspects. Modern sensors are capable of not only generating large volumes of data but as well transmitting that data through network and storing it for further analysis. Unfortunately, that collected data requires further analysis in order to provide useful information to the decision makers who want to reduce costs, improve safety, etc. Such analysis proved to be a challenge, as there are no generic methodologies that allow for automating data analysis and in practice costs required to analyze data are prohibitively high for many practical applications. This paper is a step in a direction of developing generic methods for sensor data analysis – it describes an application of a generic method that can be applied to arbitrary set of multivariate time series data in order to perform classification or regression tasks. The presented application relates to prediction of methane concentrations in coal mines based on time series data from various sensors. The method was tested within the framework of IJCRS’15 data mining competition and resulted in the winning model outperforming other solutions.
- 1.Zagorecki, A.: A Versatile Approach to Classification of Multivariate Time Series Data. In: The Proceedings of the 2015 Federated Conference on Computer Science and Information Systems (2015, to appear)Google Scholar
- 2.Meina, M., Janusz, A., Rykaczewski, K., Ślęzak, D., Celmer, B., Krasuski, A.: Tagging firefighter activities at the emergency scene: summary of AAIAâĂŹ15 data mining competition at knowledge pit. In: Proceedings of the 2015 Federated Conference on Computer Science and Information Systems (2015)Google Scholar
- 3.Janusz, A., Krasuski, A., Stawicki, S., Rosiak, M., Slezak, D., Nguyen, H.S.: Key risk factors for Polish State Fire Service: a data mining competition at knowledge pit. Federated Conference on Computer Science and Information Systems (FedCSIS) 2014, pp. 345–354 (2014) doi: 10.15439/2014F507
- 5.Hall, M.A.: Correlation-based Feature Subset Selection for Machine Learning. Hamilton, New Zealand (1998)Google Scholar
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 2.5 International License (http://creativecommons.org/licenses/by-nc/2.5/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.