A data mining approach for classification of traffic violations types

Nor Aqilah Othman; Cik Feresa Mohd Foozy; Aida Mustapha; Salama A Mostafa; Shamala Palaniappan; Shafiza Ariffin Kashinath

doi:10.26555/ijain.v7i3.708


A data mining approach for classification of traffic violations types

⁽¹⁾ Nor Aqilah Othman

(Faculty of Computer Science & Information Technology, Universiti Tun Hussein Onn Malaysia, Malaysia)
⁽²⁾ Cik Feresa Mohd Foozy

(Faculty of Computer Science & Information Technology, Universiti Tun Hussein Onn Malaysia, Malaysia)
⁽³⁾ Aida Mustapha

(Faculty of Computer Science & Information Technology, Universiti Tun Hussein Onn Malaysia, Malaysia)
^{(4) *} Salama A Mostafa

(Faculty of Computer Science & Information Technology, Universiti Tun Hussein Onn Malaysia, Malaysia)
⁽⁵⁾ Shamala Palaniappan

(Faculty Science Computer and Mathematics, Universiti Teknologi MARA (UiTM), Malaysia)
⁽⁶⁾ Shafiza Ariffin Kashinath

(Sena Traffic Systems Sdn. Bhd, Kuala Lumpur, Malaysia)
^*corresponding author

Abstract

Traffic summons, also known as traffic tickets, is a notice issued by a law enforcement official to a motorist, who is a person who drives a car, lorry, or bus, and a person who rides a motorcycle. This study is set to perform a comparative experiment to compare the performance of three classification algorithms (Naive Bayes, Gradient Boosted Trees, and Deep Learning algorithm) in classifying the traffic violation types. The performance of all the three classification models developed in this work is measured and compared. The results show that the Gradient Boosted Trees and Deep Learning algorithm have the best value in accuracy and recall but low precision. NaÃ¯ve Bayes, on the other hand, has high recall since it is a picky classifier that only performs well in a dataset that is high in precision. This paperâ€™s results could serve as baseline results for investigations related to the classification of traffic violation types. It is also helpful for authorities to strategize and plan ways to reduce traffic violations among road users by studying the most common traffic violation types in an area, whether a citation, a warning, or an ESERO (Electronic Safety Equipment Repair Order).

DOI

https://doi.org/10.26555/ijain.v7i3.708

Article metrics

Abstract views : 2409 | PDF views : 302

Cite

How to cite item

Full Text

Download

References

[1] F. Kamanga, V. Smercina, B. G. Brents, D. Okamura, and V. Fuentes, â€œCosts and Consequences of Traffic Fines and Fees: A Case Study of Open Warrants in Las Vegas, Nevada,â€ Soc. Sci., vol. 10, no. 11, p. 440, Nov. 2021, doi: 10.3390/socsci10110440.

[2] A. J. Khattak, N. Ahmad, B. Wali, and E. Dumbaugh, â€œA taxonomy of driving errors and violations: Evidence from the naturalistic driving study,â€ Accid. Anal. Prev., vol. 151, p. 105873, Mar. 2021, doi: 10.1016/j.aap.2020.105873.

[3] N. A. S. Zaidi, A. Mustapha, S. A. Mostafa, and M. N. Razali, â€œA Classification Approach for Crime Prediction,â€ Khalaf M., Al-Jumeily D., Lisitsa A. Appl. Comput. to Support Ind. Innov. Technol. ACRIT 2019. Commun. Comput. Inf. Sci. vol 1174. Springer, Cham., pp. 68â€“78, 2020, doi: 10.1007/978-3-030-38752-5_6.

[4] R. Factor, â€œAn empirical analysis of the characteristics of drivers who are ticketed for traffic offences,â€ Transp. Res. Part F Traffic Psychol. Behav., vol. 53, pp. 1â€“13, Feb. 2018, doi: 10.1016/j.trf.2017.12.001.

[5] B. Jiang et al., â€œTransport and public health in China: the road to a healthy future,â€ Lancet, vol. 390, no. 10104, pp. 1781â€“1791, Oct. 2017, doi: 10.1016/S0140-6736(17)31958-X.

[6] A. M. PÃ©rez-MarÃn and M. Guillen, â€œSemi-autonomous vehicles: Usage-based data evidences of what could be expected from eliminating speed limit violations,â€ Accid. Anal. Prev., vol. 123, pp. 99â€“106, Feb. 2019, doi: 10.1016/j.aap.2018.11.005.

[7] S. Thapa and J. Lee, â€œData Mining Techniques on Traffic Violations,â€ Dep. Electr. Comput. Eng. Univ. Bridg. CT, 2016. Available: Google Scholar.

[8] X. Guo, â€œTraffic Flow Forecasting Model Based on Data Mining,â€ Proc. 2016 Int. Conf. Educ. Manag. Comput. Soc., pp. 1043â€“1046, 2016, doi: 10.2991/emcs-16.2016.257.

[9] R. Factor, â€œReducing traffic violations in minority localities: Designing a traffic enforcement program through a public participation process,â€ Accid. Anal. Prev., vol. 121, pp. 71â€“81, Dec. 2018, doi: 10.1016/j.aap.2018.09.005.

[10] N. Boyko, P. Mykhailyshyn, and Y. Kryvenchuk, â€œUse a cluster approach to organize and analyze data inside the cloud,â€ ECONTECHMOD An Int. Q. J. Econ. Technol. Model. Process., vol. 7, 2018. Available: Google Scholar.

[11] J. R. Ingram, â€œThe Effect of Neighborhood Characteristics on Traffic Citation Practices of the Police,â€ Police Q., vol. 10, no. 4, pp. 371â€“393, Dec. 2007, doi: 10.1177/1098611107306995.

[12] K. S. Hlaing and Y. M. K. K. Thaw, â€œApplications, Techniques and Trends of Data Mining and Knowledge Discovery Database,â€ Int. J. Trend Sci. Res. Dev., vol. 3, no. 5, pp. 1604â€“1606, 2019, [Online]. Available: https://www.ijtsrd.com/papers/ijtsrd26733.pdf.

[13] A. Azevedo, â€œData Mining and Knowledge Discovery in Databases,â€ Adv. Methodol. Technol. Netw. Archit. Mob. Comput. Data Anal., pp. 502â€“514, 2019, doi: 10.4018/978-1-5225-7598-6.ch037.

[14] M. A. Oâ€™Reilly, W. Johnston, C. Buckley, D. Whelan, and B. Caulfield, â€œThe influence of feature selection methods on exercise classification with inertial measurement units,â€ 2017 IEEE 14th Int. Conf. Wearable Implant. Body Sens. Networks, pp. 193â€“196, May 2017, doi: 10.1109/BSN.2017.7936039.

[15] J. Li et al., â€œFeature Selection,â€ ACM Comput. Surv., vol. 50, no. 6, pp. 1â€“45, Jan. 2018, doi: 10.1145/3136625.

[16] X. Chu, I. F. Ilyas, S. Krishnan, and J. Wang, â€œData Cleaning,â€ Proc. 2016 Int. Conf. Manag. Data, pp. 2201â€“2206, Jun. 2016, doi: 10.1145/2882903.2912574.

[17] V. Kunwar, K. Chandel, A. S. Sabitha, and A. Bansal, â€œChronic Kidney Disease analysis using data mining classification techniques,â€ 2016 6th Int. Conf. - Cloud Syst. Big Data Eng., pp. 300â€“305, Jan. 2016, doi: 10.1109/CONFLUENCE.2016.7508132.

[18] D. Leslie, â€œUnderstanding Artificial Intelligence Ethics and Safety: A Guide for the Responsible Design and Implementation of AI Systems in the Public Sector,â€ SSRN Electron. J., 2019, doi: 10.2139/ssrn.3403301.

[19] A. Tiron-Tudor and D. Deliu, â€œBig Dataâ€™s Disruptive Effect on Job Profiles: Management Accountantsâ€™ Case Study,â€ J. Risk Financ. Manag., vol. 14, no. 8, p. 376, Aug. 2021, doi: 10.3390/jrfm14080376.

[20] A. Fatima, N. Nazir, and M. G. Khan, â€œData Cleaning In Data Warehouse: A Survey of Data Pre-processing Techniques and Tools,â€ Int. J. Inf. Technol. Comput. Sci., vol. 9, no. 3, pp. 50â€“61, Mar. 2017, doi: 10.5815/ijitcs.2017.03.06.

[21] O. Adeniji, â€œBusiness to consumers (B2C): the effect of machine learning application in telecom customer churn management,â€ Dublin Business School, 2020. Available: Google Scholar.

[22] A. S. Gran, â€œAutomatic machine learning applied to time series forecasting for novice users in small to medium-sized businesses: a review of how companies accumulate and use data along with an interface for data preparation as well as easy and powerful prediction analysis capable of providing valuable insight,â€ 2019. Available: Google Scholar.

[23] T. Hastie, J. Friedman, and R. Tibshirani, â€œModel Assessment and Selection,â€ Elem. Stat. Learn. Springer Ser. Stat. Springer, New York, NY., pp. 193â€“224, 2001, doi: 10.1007/978-0-387-21606-5_7.

[24] K. Lan, D. Wang, S. Fong, L. Liu, K. K. L. Wong, and N. Dey, â€œA Survey of Data Mining and Deep Learning in Bioinformatics,â€ J. Med. Syst., vol. 42, no. 8, p. 139, Aug. 2018, doi: 10.1007/s10916-018-1003-9.

[25] P. Gaur, â€œNeural networks in data mining,â€ Int. J. Electron. Comput. Sci. Eng., vol. 1, no. 3, pp. 1449-1453, 2012. Available: Google Scholar.

[26] P. S. Patel and S. Desai, â€œA comparative study on data mining tools,â€ Int. J. Adv. Trends Comput. Sci. Eng., vol. 4, no. 2, 2015. Available: Google Scholar.

[27] J. Santos-Pereira, L. Gruenwald, and J. Bernardino, â€œTop data mining tools for the healthcare industry,â€ 2021, doi: 10.1016/j.jksuci.2021.06.002.

[28] A. Benussi et al., â€œClassification accuracy of TMS for the diagnosis of mild cognitive impairment,â€ Brain Stimul., 2021. doi: 10.1016/j.brs.2021.01.004.

[29] S. N. M. M. Nafi, A. Mustapha, S. A. Mostafa, S. H. Khaleefah, and M. N. Razali, â€œExperimenting Two Machine Learning Methods in Classifying River Water Quality,â€ Khalaf M., Al-Jumeily D., Lisitsa A. Appl. Comput. to Support Ind. Innov. Technol. ACRIT 2019. Commun. Comput. Inf. Sci. vol 1174. Springer, Cham., pp. 213â€“222, 2020, doi: 10.1007/978-3-030-38752-5_17.

[30] S. Saifullah, Y. Fauziyah, and A. S. Aribowo, â€œComparison of machine learning for sentiment analysis in detecting anxiety based on social media data,â€ J. Inform., vol. 15, no. 1, p. 45, Feb. 2021, doi: 10.26555/jifo.v15i1.a20111.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

___________________________________________________________
International Journal of Advances in Intelligent Informatics
ISSN 2442-6571 (print) | 2548-3161 (online)
Organized by UAD and ASCEE Computer Society
Published by Universitas Ahmad Dahlan
W: http://ijain.org
E: info@ijain.org (paper handling issues)
andri.pranolo.id@ieee.org (publication issues)

View IJAIN Stats

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0

Username
Password
Remember me