The PDF file you selected should load here if your Web browser has a PDF reader plug-in installed (for example, a recent version of Adobe Acrobat Reader).

If you would like more information about how to print, save, and work with PDFs, Highwire Press provides a helpful Frequently Asked Questions about PDFs.

Alternatively, you can download the PDF file directly to your computer, from where it can be opened using a PDF reader. To download the PDF, click the Download link above.

Fullscreen Fullscreen Off


Objectives: In transportation field, a huge amount of data collected by IoT systems, remote sensing and other data collection tools brings new challenges, the size of this data becomes extremely big and more complex for traditional techniques of data mining. To deal with this challenge, Apache Spark stand as a powerful large scale distributed computing platform that can be used successfully for machine learning against very large databases. This work employed large-scale machine learning techniques especially Decision Tree with Apache Spark framework for big data analysis to build a model that can predict the factors lead to road accidents based on several input variables related to traffic accidents. Based on this, the predicting model first preprocesses the big accident data and analyze it to create data for a learning system. Empirical results show that the proposed model could provide new information that can assist the decision makers to analyze and improve road safety

Keywords

Data mining, Big Data, Road accident, Decision Tree, Apache Spark, Mllib
User