TY - JOUR
T1 - Out-of-distribution- and location-aware PointNets for real-time 3D road user detection without a GPU
AU - Seppänen, Alvari
AU - Alamikkotervo, Eerik
AU - Ojala, Risto
AU - Dario, Giacomo
AU - Tammi, Kari
N1 - Publisher Copyright:
© 2023, The Author(s).
PY - 2024/12
Y1 - 2024/12
N2 - 3D road user detection is an essential task for autonomous vehicles and mobile robots, and it plays a key role, for instance, in obstacle avoidance and route planning tasks. Existing solutions for detection require expensive GPU units to run in real-time. This paper presents a light algorithm that runs in real-time without a GPU. The algorithm combines a classical point cloud proposal generator approach with a modern deep learning technique to achieve a small computational requirement and comparable accuracy to the state-of-the-art. Typical downsides of this approach, such as many out-of-distribution proposals and loss of location information, are examined, and solutions are proposed. We have evaluated the performance of the method with the KITTI dataset and with our own annotated dataset collected with a compact mobile robot platform equipped with a low-resolution LiDAR (16-channel). Our approach reaches a real-time inference on a standard CPU, unlike other solutions in the literature. Furthermore, we achieve superior speed on a GPU, which indicates that our method has a high degree of parallelism. Our method enables low-cost mobile robots to detect road users in real-time.
AB - 3D road user detection is an essential task for autonomous vehicles and mobile robots, and it plays a key role, for instance, in obstacle avoidance and route planning tasks. Existing solutions for detection require expensive GPU units to run in real-time. This paper presents a light algorithm that runs in real-time without a GPU. The algorithm combines a classical point cloud proposal generator approach with a modern deep learning technique to achieve a small computational requirement and comparable accuracy to the state-of-the-art. Typical downsides of this approach, such as many out-of-distribution proposals and loss of location information, are examined, and solutions are proposed. We have evaluated the performance of the method with the KITTI dataset and with our own annotated dataset collected with a compact mobile robot platform equipped with a low-resolution LiDAR (16-channel). Our approach reaches a real-time inference on a standard CPU, unlike other solutions in the literature. Furthermore, we achieve superior speed on a GPU, which indicates that our method has a high degree of parallelism. Our method enables low-cost mobile robots to detect road users in real-time.
KW - Deep learning
KW - Limited computational resources
KW - Object detection
KW - Perception
UR - http://www.scopus.com/inward/record.url?scp=85181256585&partnerID=8YFLogxK
U2 - 10.1186/s40537-023-00859-5
DO - 10.1186/s40537-023-00859-5
M3 - Article
AN - SCOPUS:85181256585
SN - 2196-1115
VL - 11
JO - Journal of Big Data
JF - Journal of Big Data
IS - 1
M1 - 2
ER -