Marras, I., Tzimiropoulos, Georgios, Zafeiriou, S. and Pantic, M. (2014) Online learning and fusion of orientation appearance models for robust rigid object tracking. Image and vision computing, 32 (10). pp. 707-727. ISSN 0262-8856
Documents |
|
![]() |
PDF
__ddat02_staffhome_jpartridge_1-s2.0-S0262885614000924-main.pdf - Whole Document Restricted to Repository staff only 6MB |
Item Type: | Article |
---|---|
Item Status: | Live Archive |
Abstract
We introduce a robust framework for learning and fusing of orientation appearance models based on both texture and depth information for rigid object tracking. Our framework fuses data obtained from a standard visual camera and dense depth maps obtained by low-cost consumer depth cameras such as the Kinect. To combine these two completely different modalities, we propose to use features that do not depend on the data representation: angles. More specifically, our framework combines image gradient orientations as extracted from intensity images with the directions of surface normals computed from dense depth fields. We propose to capture the correlations between the obtained orientation appearance models using a fusion approach motivated by the original Active Appearance Models (AAMs). To incorporate these features in a learning framework, we use a robust kernel based on the Euler representation of angles which does not require off-line training, and can be efficiently implemented online. The robustness of learning from orientation appearance models is presented both theoretically and experimentally in this work. This kernel enables us to cope with gross measurement errors, missing data as well as other typical problems such as illumination changes and occlusions. By combining the proposed models with a particle filter, the proposed framework was used for performing 2D plus 3D rigid object tracking, achieving robust performance in very difficult tracking scenarios including extreme pose variations. © 2014 Elsevier B.V. All rights reserved.
Additional Information: | Best of Automatic Face and Gesture Recognition 2013 |
---|---|
Keywords: | Cameras, Tracking (position), Appearance models, Face analysis, Online learning, RGB-D, Rigid objects, Subspace learning, E-learning, JCNotOpen |
Subjects: | G Mathematical and Computer Sciences > G740 Computer Vision |
Divisions: | College of Science > School of Computer Science |
Related URLs: | |
ID Code: | 14858 |
Deposited On: | 29 Sep 2014 09:48 |
Repository Staff Only: item control page