Abstrakti
Image-based camera relocalization is an important problem in computer vision and robotics. Recent works utilize convolutional neural networks (CNNs) to regress for pixels in a query image their corresponding 3D world coordinates in the scene. The final pose is then solved via a RANSAC-based optimization scheme using the predicted coordinates. Usually, the CNN is trained with ground truth scene coordinates, but it has also been shown that the network can discover 3D scene geometry automatically by minimizing single-view reprojection loss. However, due to the deficiencies of the reprojection loss, the network needs to be carefully initialized. In this paper, we present a new angle-based reprojection loss, which resolves the issues of the original reprojection loss. With this new loss function, the network can be trained without careful initialization, and the system achieves more accurate results. The new loss also enables us to utilize available multi-view constraints, which further improve performance.
Alkuperäiskieli | Englanti |
---|---|
Otsikko | Computer Vision – ECCV 2018 Workshops |
Alaotsikko | Munich, Germany, September 8-14, 2018, Proceedings, Part III |
Kustantaja | Springer |
Sivut | 229-245 |
Vuosikerta | 3 |
ISBN (elektroninen) | 978-3-030-11015-4 |
ISBN (painettu) | 978-3-030-11014-7 |
DOI - pysyväislinkit | |
Tila | Julkaistu - 2019 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
Tapahtuma | EUROPEAN CONFERENCE ON COMPUTER VISION - Munich, Saksa Kesto: 8 syysk. 2018 → 14 syysk. 2018 Konferenssinumero: 15 |
Julkaisusarja
Nimi | Lecture Notes in Computer Science |
---|---|
Vuosikerta | 11131 |
Conference
Conference | EUROPEAN CONFERENCE ON COMPUTER VISION |
---|---|
Lyhennettä | ECCV |
Maa/Alue | Saksa |
Kaupunki | Munich |
Ajanjakso | 08/09/2018 → 14/09/2018 |