Estimating Time of Driver Arrival with Gradient Boosting Algorithms and Deep Neural Networks

Authors

  • Henrik T. Sergoyan American University of Armenia

DOI:

https://doi.org/10.51408/1963-0050

Keywords:

GG, Estimated Time of Driver Arriva, OSRM, XGBoost, CatBoost, Neural Networks

Abstract

Customer experience and resource management determine the degree to which transportation service providers can compete in today’s heavily saturated markets. The paper investigates and suggests a new methodology to optimize calculations for Estimated Time of Arrival (from now on ETA, meaning the time it will take for the driver to reach the designated location) based on the data provided by GG collected from rides made in 2018. GG is a transportation service providing company, and it currently uses The Open Source Routing Machine (OSRM) which exhibits significant errors in the prediction phase. This paper shows that implementing algorithms such as XGBoost, CatBoost, and Neural Networks for the said task will improve the accuracy of estimation. Paper discusses the benefits and drawbacks of each model and then considers the performance of the stacking algorithm that combines several models into one. Thus, using those techniques, final results showed that Mean Squared Error (MSE) was decreased by 54% compared to the current GG model.

References

M. Dodge and R. Kitchin, “Crowdsourced cartography: Mapping experience and knowledge”, Environment and Planning A: Economy and Space, vol. 45, no. 1, pp. 19–36, 2013.

S. Huber and Ch. Rust, “Calculate travel time and distance with openStreetMap data using the OpenSource Routing Machine (OSRM)”, The State Journal, vol. 16, no. 2, pp. 416–423, 2016.

Y. Li and K. Fu and Zh. Wang and C. Shahabi and J. Ye and Y. Liu, “Multi-task representation learning for travel time estimation”, Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1695–1704, 2018.

Zh. Wang and K. Fu and J. Ye, “Learning to estimate the travel time”, Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 858–866, 2018.

I. Jindal and T. Qin and X. Chen and M. Nokleby and J. Ye, “A unified neural network approach for estimating travel time and distance for a Taxi Trip”, arXiv:1710.04350v1, [stat.ML], 2017.

T. Chen and C. Guestrin, “XGBoost: A Scalable Tree Boosting System”, Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794, 2016.

L. Prokhorenkova and G. Gusev and A. Vorobev and A. V. Dorogush and A. Gulin, “CatBoost: unbiased boosting with categorical features”, Proceedings of the 32nd International Conference on Neural Information Processing Systems, pp. 6639–6649, 2018.

D. Wang and J. Zhang and W. Cao and J. Li and Y. Zheng, “When will you arrive? Estimating travel time based on deep neural networks”, AAAI, pp 2500–2507, 2018.

Downloads

Published

2021-12-10

How to Cite

Sergoyan, H. T. . (2021). Estimating Time of Driver Arrival with Gradient Boosting Algorithms and Deep Neural Networks. Mathematical Problems of Computer Science, 53, 29–38. https://doi.org/10.51408/1963-0050