3D Pose and Shape Estimation
This research area is all about localizing the human body and hand joints in images and video. We target finding 2D and 3D keypoints as well as reconstructing the body and hand’s 3D surface, especially as a person interacts with other objects. Research outcomes will help downstream applications in human-computer interaction and augmented and virtual reality.
Video Understanding
This research area is all about the semantic interpretation of a video sequences. We specifically target videos with people to answer the fundamental question “What is the person doing?”. To that end, we tackle tasks such action recognition, temporal action segmentation, and action anticipation. Research outcomes will advance high-impact applications such as autonomous driving, home assistance robots, content-based video indexing and retrieval, and other intelligent visual systems.