Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos
Top Cited Papers
- 1 June 2009
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 2012-2019
- https://doi.org/10.1109/cvpr.2009.5206492
Abstract
Analyzing videos of human activities involves not only recognizing actions (typically based on their appearances), but also determining the story/plot of the video. The storyline of a video describes causal relationships between actions. Beyond recognition of individual actions, discovering causal relationships helps to better understand the semantic meaning of the activities. We present an approach to learn a visually grounded storyline model of videos directly from weakly labeled data. The storyline model is represented as an AND-OR graph, a structure that can compactly encode storyline variation across videos. The edges in the AND-OR graph correspond to causal relationships which are represented in terms of spatio-temporal constraints. We formulate an Integer Programming framework for action recognition and storyline extraction using the storyline model and visual groundings learned from training data.Keywords
This publication has 15 references indexed in Scilit:
- Objects in Action: An Approach for Combining Action Understanding and Object PerceptionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Hello! My name is... Buffy'' -- Automatic Naming of Characters in TV VideoPublished by British Machine Vision Association and Society for Pattern Recognition ,2006
- Time as a guide to cause.Journal of Experimental Psychology: Learning, Memory, and Cognition, 2006
- Protocols from perceptual observationsArtificial Intelligence, 2005
- On Space-Time Interest PointsInternational Journal of Computer Vision, 2005
- Extracting actors, actions and events from sports video -a fundamental approach to story trackingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Histograms of Oriented Gradients for Human DetectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Action recognition using probabilistic parsingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- 10.1162/153244303322533214Applied Physics Letters, 2000
- Parametric hidden Markov models for gesture recognitionIEEE Transactions on Pattern Analysis and Machine Intelligence, 1999