Image-based Localization using Hourglass Networks

Iaroslav Melekhov, Juha Ylioinas, Juho Kannala, Esa Rahtu

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

Abstract

In this paper, we propose an encoder-decoder convolutional neural network (CNN) architecture for estimating camera pose (orientation and location) from a single RGB image. The architecture has a hourglass shape consisting of a chain of convolution and up-convolution layers followed by a regression part. The up-convolution layers are introduced to preserve the fine-grained information of the input image. Following the common practice, we train our model in end-to-end manner utilizing transfer learning from large scale classification data. The experiments demonstrate the performance of the approach on data exhibiting different lighting conditions, reflections, and motion blur. The results indicate a clear improvement over the previous state-of-theart even when compared to methods that utilize sequence of test frames instead of a single frame.
Original languageEnglish
Title of host publication2017 IEEE International Conference on Computer Vision Workshops (ICCVW)
PublisherIEEE
Pages870-877
DOIs
Publication statusPublished - 2017
MoE publication typeA4 Conference publication
EventIEEE International Conference on Computer Vision - Venice, Italy
Duration: 22 Oct 201729 Oct 2017

Publication series

NameIEEE International Conference on Computer Vision workshops
PublisherIEEE
ISSN (Print)2473-9936
ISSN (Electronic)2473-9944

Conference

ConferenceIEEE International Conference on Computer Vision
Abbreviated titleICCV
Country/TerritoryItaly
CityVenice
Period22/10/201729/10/2017

Keywords

  • Cameras
  • Decoding
  • Three-dimensional displays
  • Feature extraction
  • Solid modeling
  • Convolutional codes
  • Computer architecture

Fingerprint

Dive into the research topics of 'Image-based Localization using Hourglass Networks'. Together they form a unique fingerprint.

Cite this