Abstract
In this paper we present a new RGB-D dataset captured with the Kinect sensor. The dataset is composed of typical children’s toys and contains a total of 449 RGB-D images alongside with their annotated ground truth images. Compared to existing RBG-D object segmentation datasets, the objects in our proposed dataset have more complex shapes and less texture. The images are also crowded and thus highly occluded. Three state-of-the-art segmentation methods are benchmarked using the dataset. These methods attack the problem of object segmentation from different starting points, providing a comprehensive view on the properties of the proposed dataset as well as the state-of-the-art performance. The results are mostly satisfactory but there remains plenty of room for improvement. This novel dataset thus poses the next challenge in the area of RGB-D object segmentation.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications |
| Publisher | SciTePress |
| Pages | 107-116 |
| ISBN (Electronic) | 978-989-758-175-5 |
| DOIs | |
| Publication status | Published - 2016 |
| MoE publication type | A4 Conference publication |
| Event | Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Rome, Italy Duration: 27 Feb 2016 → 29 Feb 2016 Conference number: 11 |
Conference
| Conference | Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications |
|---|---|
| Abbreviated title | VISIGRAPP |
| Country/Territory | Italy |
| City | Rome |
| Period | 27/02/2016 → 29/02/2016 |
Keywords
- benchmarking
- complex objects
- dataset
- object segmentation
- real world objects
- RGB-D segmentation