![]() |
|
![]() |
|
Evans & Sutherland Distinguished Lecture Series Recognizing Objects and Actions in Images and Video Jitendra Malik University of California Berkeley
Host: Bill Thompson Abstract The object recognition problem is that of finding instances of object classes in an image or video sequence: faces, giraffes, the digit 5, chairs etc. This has to be accomplished while allowing for intra-class variation, as well as changes in illumination and viewpoint. We have developed a theory of object recognition by measuring shape similarity, using dense point correspondences based on robust relational descriptors: ``shape contexts'' and "geometric blur templates". I will show results on a variety of 2D and 3D recognition problems. The action recognition problem is that of finding instances of actions in video sequences: run, jump, kick etc. This has to be accomplished while allowing for variation in the person performing the action, clothing, illumination and viewpoint. We have developed two approaches to recognition of actions. In low resolution data, (``far field'') the approach is based on collecting low resolution optical flow measurements over a spatiotemporal volume for each moving figure, constructing a robust descriptor from this volume, and then matching these to stored sequences. We show generalization over person, clothing and illumination while pose variations are dealt in a multiple-view framework. In high resolution data (``near field'') the approach is based on extracting stick figures in each frame, and relying on joint level human body tracking to provide a complete intermediate representation which is robust to lighting, clothing as well as pose. This talk is based on joint work; please visit Vision Groupfor pointers to publications.
|
School of Computing 50 S. Central Campus Dr. Rm. 3190 Salt Lake City, UT 84112
801-581-8224 Send comments to webmaster@cs.utah.edu
Disclaimer