Abstract
Road accidents are becoming a global crisis which does not only cost human life but also affect the economic condition of a nation. Therefore it is gaining the interest of several researchers to analyze the influential factors behind road accidents and suggest the preventive measures. A large country like India is also suffering from a substantial amount of road accidents and try to incorporate improve road safety measures. In this paper, we evaluate the performances of different Indian states in road safety measures based on the recorded road accident data. The current analysis is based on a novel line encoding method which categorizes Indian states into three categories, i.e., best performer, average performer, and stragglers. Results have efficiently identified the states that reduce road accidents and the potential states which need to improve on road safety to a great extent. The outcome after ranking the states based on our approach shows there is a high linear correlation between accident rate with population and number of vehicles whereas accident and literacy rate shares an inverse correlation. The satisfactory results based on our approach definitely enlight the interdisciplinary research domains.
Similar content being viewed by others
Keywords
1 Introduction
The vehicle density has been swiftly growing over few decades globally [1]. India is not an exception and tries to build its overall performance in different aspects worldwide by expanding road network and transportation system. The highly connected road and highway network in India is the backbone of the continuous development of Indian social economics. Besides the overall growth progresses, road accidents in India are also accelerating. The frequent severe road accidents not only result in life threats but also damage the infrastructure and aggregates massive economic loss. According to the Ministry of Road Transport and Highways [2], the number of accidental deaths have increased by 31% from 2007 to 2017 and take a lot of attention. Indian Government and different state authorities already implemented different safety measures and strict traffic rules.
Road safety is one of the primary parameters for the growth of a country combining other factors such as population, literacy, etc. To address the problem of insufficient road safety, it requires the dedicated action of several ministries, most notably law, planning, transport, education, public information, and health. Road safety assessment in periodical manner has played a major role in the theory and practice of transport management systems. The states are maintaining lots of statistical data which can be used to extract the progress of individual states in road safety awareness. It can be noted that every state has different population density, road infrastructure, and geographical location and number of road accidents in different states are not comparable. But the alarming situation demands an alternative data mining tool to measure the performance of the states in road safety aspects and relate them to influential parameters like population, number of vehicles, and literacy rate. The correlation among these attributes with road accidents may reveal the actual performance of the different Indian states in reducing traffic mishaps. In this paper, we develop a novel line encoding method to demonstrate the progress of any entity from time-varying data. The method is very easy to implement and disclose the proper pattern of positive or negative growth rate as the time progresses. In the current analysis, we categorize Indian states into three categories, i.e., best performer, average performer, and stragglers. Our method is unique and the results are satisfactory which can open up a new framework for performance analysis.
The remaining paper is structured as follows. We elaborate some previous works in Sect. 2. In Sect. 3, our approaches for performance analysis are proposed. We illustrate our method with an example in Sect. 4. Experimental analysis and results are shown in Sect. 5. Then in Sect. 6, we conclude.
2 Literature Review
Different computational work are already employed to analyze road network [3,4,5] and efficient transportation system [6,7,8]. Assuming the road network as a graph with road junction as vertices and highways between them as edges, different topological properties are analyzed in [9]. Topological properties are studied and found road network possesses the properties of small world assortative network. Based on betweenness centrality, most important junction points are identified. Most important junction points are concluded as potential congestion points. Wenxue et al. [10] proposed an improved method to analyze hazardous material road transportation accident rate. To analyze and divide regions of a state based on the pattern of accidents, an agglomerative hierarchical clustering algorithm is introduced [11]. To evaluate best distance measure, the Cophenetic correlation coefficient is used for clustering. The approach is applied to analyze hourly road accident data of twenty-six districts of Gujarat. Tracking vehicles and monitoring driving habits is proposed in [12] to reduce the number of road accidents. Telematics system is used to track each vehicles position and report the errors for further analysis. A modified method [13] of principal component analysis is proposed to analyze the relationship between traffic accident evaluating index and the different causes of traffic accident which includes number of people and vehicles, type of road, and corresponding environment. Data mining tools are used to predict the probability of accidents on specific roads like State highways (SHs) or Ordinary district roads (ODRs) by estimating the severity of accidents [14]. There are no such works which have been proposed for performance analysis of the states from the recorded road accident statistics. Our proposed approach is an attempt to examine past accidental data to ensure the progress of different Indian states based on the road safety measures.
3 Our Approach for Performance Analysis
In this section, we propose a novel line encoding method to compare progress of the entities \((e_1,e_2, \dots , e_n \in E)\) from time-varying data. Progress means the performance variation of different entities with respect to time. An index like progressive score will serve as the comparison base among the entities and signifies how much progress each entity has made throughout a given time range. For a entity \((e_i\in E)\), a set of random variables \(V\{v_1,v_2,v_3,\dots ,v_n\}\) at different time instant \(T\{t_1,t_2,t_3,\dots ,t_n\}\) is given. X-axis denotes time and Y-axis represents the real-valued attributes of different entities. From previous time instant \((t_n)\) to successive time instant \((t_{n+1})\), value of a entity \(e_i\) can increase, decrease, or remain moreover same. The change is denoted as \((\varDelta v)\). \(\varDelta v=v_{n+1}-v_n\). This change can be interpreted as a ray \(I_\alpha \) where the slope of the ray is \(\alpha \). The line can be oriented at any direction out of n positions. Here n is in the range from positive Y-axis to negative Y-axis in clock-wise direction. The aggregate value of all the \(I_\alpha \), i.e., \((I_{\alpha 1}+I_{\alpha 2}+I_{\alpha 3}+\dots + I_{\alpha {n-1}})\) generate the summative score for a single entity \(e_i\).
Encoded values for each \(e_i \in E\) is calculated using the following formula:
where \(|v_{max}|=max\{{v_1,v_2,v_3,\dots ,v_n}\}\).
4 Illustration with Example
Figure 1, illustrates the performance of four entities denoted as \(L_1, L_2, L_3\) and \(L_4\). Let, up to the time instant \(t_1\), all of there progressive score is 0. At time instant \(t_2\), progress of each entity is calculated. \(I_{L_1}=\frac{30-0}{30}=1\), \(I_{L_2}=\frac{10-0}{30}=0.33\), \(I_{L_3}=\frac{-20-0}{30}=-0.67\) and \(I_{L_4}=\frac{-30-0}{30}=-1\). In our experiment, the statistical performance data in lap of a year is assumed as a time-varying set of values. Each state are defined as different entities. Progressive score is translated differently for individual cases. For road accident rate and literacy, the states with low \((L_4)\) and high \((L_1)\) progressive score respectively are denoted as the best performer.
Let us take a set of 5 variables as illustrated in Table 1. Each variable is having a set of values at different time instant. The difference of values between two consecutive time instant is shown Table 2. Now using the Eq. 1 we get the results in Table 3. Here, we get V1 and V2 as the lowest and highest valued entities respectively. To prevent the accident, V1 performs better than other entities.
5 Result and Analysis
In this section, we analyze the actual data based on the proposed method. We also validate the result over the benchmark dataset.
5.1 Data Collection
We have collected different statistical data maintained by the Government of India to compare the performance of different states and union territories (UT). The data includes road accident data from 2001 to 2014, literacy rate from 1951 to 2011, total population data from 1951 to 2011, total registered motor vehicles from 2001 to 2012 which are gathered from NCRB (National Crime Records Bureau) [15, 16], Ministry of Home Affairs Govt. of India [17], Census of India [18], Ministry of Highways and Road Transports, Govt. of India [2] respectively to assess the performance of our proposed approach.
5.2 Result over Different Attributes
Figure 2 depicts the results obtained from road accident data based on our proposed method. It shows West Bengal is the most progressive state in preventing road accident throughout 2001 to 2014 whereas Kerala performed the worst. Here, steep slope signifies the higher increase in accident which is costly for progress whereas gradual angle signifies lesser increase in number of accidents. If the angle is negative then it signifies the reduction in accident which is economic whereas a horizontal straight line signifies no change between two successive points and thus, non-increasing. Tamil Nadu, Rajasthan, Uttar Pradesh have more gradual angles than steeper angles turning out to be average progress.
Classification of the states and Union Territories is shown in Table 4 based on their performance in alphabetical order. Most of the north-east states are classified as straggler which is a real concern. The geographical condition and environmental hazards might be the reason for their unsatisfactory performance. Some newly born states like Chhattisgarh, Uttarakhand are grouped as the best performer which is very much encouraging.
Next, we examine the literacy progress of different states. Figure 3 depicts Delhi, Kerala and Andaman & Nicobar Islands have done the best progress in literacy. Here, steeper angle signifies an increase in literacy which turns out to be good progress, whereas gradual angle turns out to be bad progress and horizontal straight line signifies non-increasing literacy rate. Mizoram, Pondicherry, and Tamil Nadu having only a few numbers of steeper angle but no negative angle denote they performed well in literacy. Himachal Pradesh has the most number of horizontal straight lines thus making it average performer.
Population control is another important attribute of progress for the states as a whole. Figure 4 depicts Karnataka and Gujarat have done the best progress in preventing increasing population. Here, steeper angle signifies an increase in population which turns out to be bad progress, whereas gradual angles turn out to be good progress and horizontal straight lines signify retaining the same population as of previous year. Most of the states except Karnataka and Gujarat have gradual angles and thus making them less progressive in controlling population.
We analyze the number of motor vehicles registered on the roads in different states. Figure 5 depicts Gujarat, Madhya Pradesh, Rajasthan, and Punjab have done the best progress in controlling the number of motor vehicles. Only best performer states have all the angles as gradual thus making it more progressive than others whereas Delhi and Karnataka have made bad progress which signifies them as the worse performer.
5.3 Correlation Analysis
We have emphasized on the prevention of road traffic accident which is our primary research concern and consider it as the dominant attribute behind progress of each state. Other attributes like literacy rate, population, and number of transport are also correlated with the accident. Correlation analysis shows how strongly pairs of variables are related. Two set of values are \(X=(x_1,x_2, \dots )\) and \(Y=(y_1,y_2, \dots )\) are correlated using Pearson correlation coefficient analysis (PCCA). The PCCA constant [19] r is defined as
\(S_x\) and \(S_y\) are the standard deviation of X and Y respectively. cov(x, y) is the covariation within x and y.
here n is the number of paired data. \(\overline{x}, \overline{y}\) is the mean and \(x_i\) and \(y_i\) is the ith instance value of set X and Y respectively. Pearson correlation coefficient between two variables always lie in between \(+\)1 and −1 where they denote observations have identical rank and dissimilar rank between themselves respectively. We analyze different attributes with road traffic accident to examine their relationship. Result has shown road accident is directly proportional with population and number transport increment with \(r = 0.92\) and 0.84 respectively and inverse proportional with literacy rate with \(r = -0.96\).
5.4 Validation of the Proposed Approach
We have validated our result on Adenoma attributes \((A1, A2, \dots , A8)\) as shown in sample dataset [20], to find its strength of association. Table 5 shows PCCA similarity measure of Adenoma attributes based on our approach. Strength of association among different attributes are shown in Table 6. According to our experiment, primary attributes are A3 and A4 which is moreover similar to the biological conclusion. Therefore, we infer our method works well in finding primary attributes and their strength of association.
6 Conclusions
Road accident does not happen only due to the improper infrastructure of the roads, rather it is influenced by other factors as well like literacy rate, number of vehicles, the population of individual states, etc. Proper road safety management results in overall development of the nation. Road safety has been made one of the most important dimensions out of several development goals set for a country like India. In this paper, we develop a line encoding method to analyze the progress of different Indian states in traffic awareness. Our result concludes that traffic control measures are not the only solution. The states have to control population and number of vehicles and make their resident more literate to control road traffic accidents. For last few years, the focus of the Indian government is to develop a holistic traffic system in a hope that it will accelerate the growth of the country. In future, we may consider the geographical position of the states, weather condition and influence of different highway network to reveal other factors for road traffic accidents.
References
Li, J., Cheng, H., Guo, H., Qiu, S.: Survey on artificial intelligence for vehicles. Automot. Innov. 1(1), 2–14 (2018)
Ministry of Road Transport and Highways, Government of India. http://morth.nic.in/
Chen, Y., Wu, C., Yao, M.: Dynamic topology construction for road network. In: 2008 Fourth International Conference on Networked Computing and Advanced Information Management, vol. 1, pp. 359–364 (2008). https://doi.org/10.1109/NCM.2008.45
Mambou, E.N., Nlend, S., Liu, H.: Study of the us road network based on social network analysis. In: 2017 IEEE AFRICON, pp. 974–978 (2017). https://doi.org/10.1109/AFRCON.2017.8095614
Mo, Y., Liu, J., Lv, S.: GIS-based analysis of fractal features of the urban road network. In: 2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS), pp. 845–848 (2015). https://doi.org/10.1109/ICSESS.2015.7339187
Chen, P., Li, K., Sun, J.: A method of traffic flow forecast and management. In: 2008 International Conference on Information Management, Innovation Management and Industrial Engineering, vol. 2, pp. 24–28 (2008). https://doi.org/10.1109/ICIII.2008.131
Muhuri, S., Das, D., Chakraborty, S.: An automated game theoretic approach for cooperative road traffic management in disaster. In: 2017 IEEE International Symposium on Nanoelectronic and Information Systems (iNIS), pp. 145–150 (2017). https://doi.org/10.1109/iNIS.2017.38
Spirou, S., Stiller, B.: Economic traffic management. In: 2009 IFIP/IEEE International Symposium on Integrated Network Management-Workshops, pp. 144–146 (2009). https://doi.org/10.1109/INMW.2009.5195951
Mukherjee, S.: Statistical analysis of the road network of India. Pramana 79(3), 483–491 (2012)
Wenxue, C., Hengpeng, W., Shangjiang, S., Lilu, A.: An improved hazardous material road transportation accident rate analysis model. In: 2010 International Conference on Logistics Systems and Intelligent Management (ICLSIM), vol. 2, pp. 1081–1085 (2010). https://doi.org/10.1109/ICLSIM.2010.5461123
Kumar, S., Toshniwal, D.: Analysis of hourly road accident counts using hierarchical clustering and cophenetic correlation coefficient (CPCC). J. Big Data 3(1), 13 (2016)
Khalil, O.K.: A study on road accidents in Abu Dhabi implementing a vehicle telematics system to reduce cost, risk and improve safety. In: 2017 10th International Conference on Developments in eSystems Engineering (DeSE), pp. 195–200. IEEE (2017)
Ming, H., Xiucheng, G.: The application of BP neural network principal component analysis in the forecasting the road traffic accident. In: 2009 Second International Conference on Intelligent Computation Technology and Automation, vol. 1, pp. 107–111 (2009). https://doi.org/10.1109/ICICTA.2009.35
Kaur, G., Kaur, E.H.: Prediction of the cause of accident and accident prone location on roads using data mining techniques. In: 2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT), pp. 1–7 (2017). https://doi.org/10.1109/ICCCNT.2017.8204001
National Crime Records Bureau. http://ncrb.gov.in/
Persons Injured in Road Accidents. https://data.gov.in/catalog/persons-injured-road-accidents
Ministry of Home Affairs, Government of India. https://mha.gov.in/
Census of India website. http://censusindia.gov.in/
Wu, W.J., Xu, Y.: Correlation analysis of visual verbs’ subcategorization based on pearson’s correlation coefficient. In: 2010 International Conference on Machine Learning and Cybernetics (ICMLC), vol. 4, pp. 2042–2046. IEEE (2010)
Mujahid, A.K., Thirumalai, C.: Pearson correlation coefficient analysis (PCCA) on adenoma carcinoma cancer. In: 2017 International Conference on Trends in Electronics and Informatics (ICEI), pp. 492–495. IEEE (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Muhuri, S., Chakraborty, S., Das, D. (2020). Line Encoding Based Method to Analyze Performance of Indian States in Road Safety. In: Das, A., Nayak, J., Naik, B., Pati, S., Pelusi, D. (eds) Computational Intelligence in Pattern Recognition. Advances in Intelligent Systems and Computing, vol 999. Springer, Singapore. https://doi.org/10.1007/978-981-13-9042-5_46
Download citation
DOI: https://doi.org/10.1007/978-981-13-9042-5_46
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9041-8
Online ISBN: 978-981-13-9042-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)