Market price prediction of used automobiles using voting ensemble method in ML

De Silva, Laththuwa Handi Sanjaya

Home
→
Dissertations & Thesis
→
MSc Bigdata Analytics
→
2021
→
View Item

Market price prediction of used automobiles using voting ensemble method in ML

De Silva, Laththuwa Handi Sanjaya

URI: http://dlib.iit.ac.lk/xmlui/handle/123456789/778

Date: 2021

Abstract:

The automobile industry is rapidly changing, and car prices are only going up every day (“Used Car Market Size & Trends Report, 2020-2027,” n.d.). With the current global economic situation, the majority of the people tend to buy used cars more than ever and also consider parameters like market value, life span, and reliability when buying them (Puneet, n.d.). Since insurance is also low when insuring used cars, people consider buying them more. Prescient & Strategic Intelligence states that global car market size for used cars is 115.2 million units in 2019, and will reach up to 275.3 million units by the year of 2030. This is a compound annual growth rate (CAGR) of 8.7% (Puneet, n.d.). But when deciding the market value of a used car can be more delicate. Since it depends on various factors like mileage, model, manufactured year, country, and how well it is being used, two cars might be having different market values even if they belong to the same car model. This can be problematic when deciding price, since one car model can be comparatively higher while another one being lower. This affects the buyers and they might end up paying more than what it is worth. To overcome this problem this research has focused on developing a Machine Learning (ML) based model to predict the market price of a used automobile. System has three voting ensemble models and it will pick the best model depending on the dataset and predict the price with minimum error. Random forest, Gradient boosting, and MLP are the models and 0.215, 0.219, 0.222 are current root mean squared errors respectively for each model. Random forest regressor was selected by the voting ensemble method, since it has 0.951 test accuracy, and 0.215 root mean squared error for the BMW data that was used for testing. Having a voting ensemble method will eliminate the algorithm dependent misclassifications

Show full item record