Open Access Open Access  Restricted Access Subscription Access

Application of Big Data Analysis with Decision Tree for Road Accident


Affiliations
1 Laboratory of Intelligent Energy Management and Information Systems, Faculty of Sciences Semlalia, Cadi Ayyad University, Marrakech, Morocco
 

Objectives: In transportation field, a huge amount of data collected by IoT systems, remote sensing and other data collection tools brings new challenges, the size of this data becomes extremely big and more complex for traditional techniques of data mining. To deal with this challenge, Apache Spark stand as a powerful large scale distributed computing platform that can be used successfully for machine learning against very large databases. This work employed large-scale machine learning techniques especially Decision Tree with Apache Spark framework for big data analysis to build a model that can predict the factors lead to road accidents based on several input variables related to traffic accidents. Based on this, the predicting model first preprocesses the big accident data and analyze it to create data for a learning system. Empirical results show that the proposed model could provide new information that can assist the decision makers to analyze and improve road safety

Keywords

Data mining, Big Data, Road accident, Decision Tree, Apache Spark, Mllib
User

Abstract Views: 171

PDF Views: 0




  • Application of Big Data Analysis with Decision Tree for Road Accident

Abstract Views: 171  |  PDF Views: 0

Authors

Addi Ait-Mlouk
Laboratory of Intelligent Energy Management and Information Systems, Faculty of Sciences Semlalia, Cadi Ayyad University, Marrakech, Morocco
Fatima Gharnati
Laboratory of Intelligent Energy Management and Information Systems, Faculty of Sciences Semlalia, Cadi Ayyad University, Marrakech, Morocco
Tarik Agouti
Laboratory of Intelligent Energy Management and Information Systems, Faculty of Sciences Semlalia, Cadi Ayyad University, Marrakech, Morocco

Abstract


Objectives: In transportation field, a huge amount of data collected by IoT systems, remote sensing and other data collection tools brings new challenges, the size of this data becomes extremely big and more complex for traditional techniques of data mining. To deal with this challenge, Apache Spark stand as a powerful large scale distributed computing platform that can be used successfully for machine learning against very large databases. This work employed large-scale machine learning techniques especially Decision Tree with Apache Spark framework for big data analysis to build a model that can predict the factors lead to road accidents based on several input variables related to traffic accidents. Based on this, the predicting model first preprocesses the big accident data and analyze it to create data for a learning system. Empirical results show that the proposed model could provide new information that can assist the decision makers to analyze and improve road safety

Keywords


Data mining, Big Data, Road accident, Decision Tree, Apache Spark, Mllib



DOI: https://doi.org/10.17485/ijst%2F2017%2Fv10i29%2F158572