TY - GEN
T1 - Bio-inspired dynamic 3D discriminative skeletal features for human action recognition
AU - Chaudhry, Rizwan
AU - Ofli, Ferda
AU - Kurillo, Gregorij
AU - Bajcsy, Ruzena
AU - Vidal, Rene
PY - 2013
Y1 - 2013
N2 - Over the last few years, with the immense popularity of the Kinect, there has been renewed interest in developing methods for human gesture and action recognition from 3D data. A number of approaches have been proposed that extract representative features from 3D depth data, a reconstructed 3D surface mesh or more commonly from the recovered estimate of the human skeleton. Recent advances in neuroscience have discovered a neural encoding of static 3D shapes in primate infero-temporal cortex that can be represented as a hierarchy of medial axis and surface features. We hypothesize a similar neural encoding might also exist for 3D shapes in motion and propose a hierarchy of dynamic medial axis structures at several spatio-temporal scales that can be modeled using a set of Linear Dynamical Systems (LDSs). We then propose novel discriminative metrics for comparing these sets of LDSs for the task of human activity recognition. Combined with simple classification frameworks, our proposed features and corresponding hierarchical dynamical models provide the highest human activity recognition rates as compared to state-of-the-art methods on several skeletal datasets.
AB - Over the last few years, with the immense popularity of the Kinect, there has been renewed interest in developing methods for human gesture and action recognition from 3D data. A number of approaches have been proposed that extract representative features from 3D depth data, a reconstructed 3D surface mesh or more commonly from the recovered estimate of the human skeleton. Recent advances in neuroscience have discovered a neural encoding of static 3D shapes in primate infero-temporal cortex that can be represented as a hierarchy of medial axis and surface features. We hypothesize a similar neural encoding might also exist for 3D shapes in motion and propose a hierarchy of dynamic medial axis structures at several spatio-temporal scales that can be modeled using a set of Linear Dynamical Systems (LDSs). We then propose novel discriminative metrics for comparing these sets of LDSs for the task of human activity recognition. Combined with simple classification frameworks, our proposed features and corresponding hierarchical dynamical models provide the highest human activity recognition rates as compared to state-of-the-art methods on several skeletal datasets.
UR - https://www.scopus.com/pages/publications/84884947071
U2 - 10.1109/CVPRW.2013.153
DO - 10.1109/CVPRW.2013.153
M3 - Conference contribution
AN - SCOPUS:84884947071
SN - 9780769549903
T3 - IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
SP - 471
EP - 478
BT - Proceedings - 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2013
T2 - 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2013
Y2 - 23 June 2013 through 28 June 2013
ER -