Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos

Top Cited Papers

1 June 2009

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 2012-2019
https://doi.org/10.1109/cvpr.2009.5206492

Abstract

Analyzing videos of human activities involves not only recognizing actions (typically based on their appearances), but also determining the story/plot of the video. The storyline of a video describes causal relationships between actions. Beyond recognition of individual actions, discovering causal relationships helps to better understand the semantic meaning of the activities. We present an approach to learn a visually grounded storyline model of videos directly from weakly labeled data. The storyline model is represented as an AND-OR graph, a structure that can compactly encode storyline variation across videos. The edges in the AND-OR graph correspond to causal relationships which are represented in terms of spatio-temporal constraints. We formulate an Integer Programming framework for action recognition and storyline extraction using the storyline model and visual groundings learned from training data.

Keywords

This publication has 15 references indexed in Scilit:

Objects in Action: An Approach for Combining Action Understanding and Object Perception
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2007
Hello! My name is... Buffy'' -- Automatic Naming of Characters in TV Video
Published by British Machine Vision Association and Society for Pattern Recognition ,2006
Time as a guide to cause.
Journal of Experimental Psychology: Learning, Memory, and Cognition, 2006
Protocols from perceptual observations
Artificial Intelligence, 2005
On Space-Time Interest Points
International Journal of Computer Vision, 2005
Extracting actors, actions and events from sports video -a fundamental approach to story tracking
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Histograms of Oriented Gradients for Human Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Action recognition using probabilistic parsing
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
10.1162/153244303322533214
Applied Physics Letters, 2000
Parametric hidden Markov models for gesture recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1999

Cited by 148 articles