Object detection using depth completion and camera-LiDAR fusion for autonomous driving

Carranza-García, Manuel; Galán-Sales, F. Javier; Luna-Romera, José María; Riquelme, José C.

doi:10.3233/ICA-220681

Object detection using depth completion and camera-LiDAR fusion for autonomous driving

Article type: Research Article

Authors: Carranza-García, Manuel^* | Galán-Sales, F. Javier | Luna-Romera, José María | Riquelme, José C.

Affiliations: Division of Computer Science, University of Sevilla, Sevilla, Spain

Correspondence: [*] Corresponding author: Manuel Carranza-García, Division of Computer Science, University of Sevilla, Sevilla, Spain. E-mail: mcarranzag@us.es.

Abstract: Autonomous vehicles are equipped with complimentary sensors to perceive the environment accurately. Deep learning models have proven to be the most effective approach for computer vision problems. Therefore, in autonomous driving, it is essential to design reliable networks to fuse data from different sensors. In this work, we develop a novel data fusion architecture using camera and LiDAR data for object detection in autonomous driving. Given the sparsity of LiDAR data, developing multi-modal fusion models is a challenging task. Our proposal integrates an efficient LiDAR sparse-to-dense completion network into the pipeline of object detection models, achieving a more robust performance at different times of the day. The Waymo Open Dataset has been used for the experimental study, which is the most diverse detection benchmark in terms of weather and lighting conditions. The depth completion network is trained with the KITTI depth dataset, and transfer learning is used to obtain dense maps on Waymo. With the enhanced LiDAR data and the camera images, we explore early and middle fusion approaches using popular object detection models. The proposed data fusion network provides a significant improvement compared to single-modal detection at all times of the day, and outperforms previous approaches that upsample depth maps with classical image processing algorithms. Our multi-modal and multi-source approach achieves a 1.5, 7.5, and 2.1 mean AP increase at day, night, and dawn/dusk, respectively, using four different object detection meta-architectures.

Keywords: Autonomous driving, data fusion, deep learning, object detection, transfer learning

DOI: 10.3233/ICA-220681

Journal: Integrated Computer-Aided Engineering, vol. 29, no. 3, pp. 241-258, 2022

Published: 21 June 2022

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl

Share this:

North America

Europe

Asia