Object Category Recognition through RGB-D Data

20 May 2014

Pedro F. Proença ADETTI-IUL, ISCTE-IUL

Depth sensing technology of existing RGB-D sensors (e.g. Kinect), is now capable of capturing reliable 3D information of our world in real-time. So far, this availability of Depth along with RGB Information has led several researchers to prove the usefulness of this type of multimodality on several computer vision tasks: Object recognition, categorization, detection and pose estimation.

This talk will focus on the problem of object categorization, where the goal is to predict the category of a never-before-seen object instance. Our recent work has shown how an efficient non-parametric classifier: Naive Bayes Nearest Neighbor can compete with sophisticated learning-based approaches. In that work, local image descriptors and local 3D surface descriptors were used to exploit respectively the RGB and Depth channel. Experimental results on a large-scale object dataset (51 classes) will be discussed, regarding the performance of each descriptor-type. This talk will also address issues such as imbalanced training-sets, feature combination, scalability, segmentation and how the proposed approach can be easily extended to real live-data.



Pedro F. Proença completed both his MSc (2013) and BSc (2011) degree in Telecommunications and Computer Science at ISCTE – IUL, Lisbon University Institute, Portugal. Since then he is been working at ADETTI-IUL (Advanced IS/IT Research Center) as a researcher in Object Recognition and Emotion Recognition. His research interests are in Machine Learning, Computer Vision, 3D and signal processing.