Human Pose Estimation using Depth Video
-Research assistant, Computer vision lab, USC, 2011. 4 ~ current
Estimate and track human pose in depth video without initialization
3D Object Classification / Recognition
-Research assistant, Computer vision lab, USC, 2009. 1 ~ current
3D Object Recognition in Range Images Using Visibility Context
Abstract: Recognizing and localizing queried objects in range images play an important role for robotic manipulation and navigation. Even though it has been steadily studied, it is still a challenging task for scenes with occlusion and clutter.
We present a novel approach to object recognition that boosts dissimilarity between queried objects and similar-shaped background objects in the scene by maximizing use of the visibility context. We design a new point pair feature containing discriminative description inferred from the visibility context.
Also, we propose a pose estimation method that accurately localizes objects using these point pair matches. Finally, two measures of validity are suggested to discard false detections.
With 10 query objects, our approach is evaluated on depth images of cluttered office scenes captured from a real-time range sensor. The experimental results demonstrate that our method remarkably outperforms two state-of-the-art methods in terms of recognition (recall & precision) and runtime
performance.
Scalable Object Classification in Range Images
Abstract: We present a novel scalable framework for free-form object classification in range images. The framework includes an automatic 3D object recognition system in range images and a scalable database structure to learn new instances and new categories efficiently.We adopt the TAX model, previously proposed for unsupervised object modeling in 2D images, to construct our hierarchical model of object classes from unlabelled range images. The hierarchical model embodies unorganized shape patterns of 3D objects in various classes in a tree structure with probabilistic distributions. A new visual vocabulary is introduced to represent a range image as a set of visual words for the process of hierarchical model inference, classification and online learning. We also propose an online learning algorithm that updates the hierarchical model efficiently thanks to the tree structure, when a new object should be learned into the model. Extensive experiments demonstrate average classification rates of 94% on a large synthetic dataset (1,350 training images and 450 test images for 9 object classes) and 88.4% on 1,433 depth images captured from real-time range sensors. We also show that our approach outperforms the original TAX method in terms of recall rate and stability. We validated the module on a large-scale LIDAR dataset as well.
(Example of the HSD: Each node represents the discrete probabilistic distribution at each node. Due to the limited space, we only show the paths the test objects belong to, and the paths which share the node with new paths and have the most training images among the branches. For every existing path, the example training object under the path is displayed in the purple box)
(Example result on range images captured from Swiss Ranger SR 3000)
(Example result on range images captured from Prime sensor)
Related publication
Eunyoung Kim and Gerard Medioni, Scalable Object Classification Using Range images, 3DIMPVT 2011.


Sorry, the comment form is closed at this time.