Learning to recognize human action sequences
- 25 June 2003
- proceedings article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
One of the major sources of cues in developmental learning is that of watching another person. An observer can gain a comprehensive description of the purposes of actions by watching the other person's detailed bode, movements. Action recognition has traditionally studied processing fixed camera observations while ignoring nonvisual information. This paper explores the dynamic properties of eye movements in natural tasks: eye and head movements are quite tightly coupled with actions. We present a method that utilizes eye gaze and head position information to detect the performer's focus of attention. Attention, as represented by eye fixation, is used for spotting the target object related to the action. Attention switches are calculated and used to segment the action sequence into action units which are recognized by hidden Markov models. An experimental system is built for recognizing actions in the natural task of "stapling a letter", which demonstrates the effectiveness of the approach.Keywords
This publication has 15 references indexed in Scilit:
- Invariant features for 3-D gesture recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Coupled hidden Markov models for complex action recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Integration of speech and vision using mutual informationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Segmenting visual actions based on spatio-temporal motion patternsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Learning audio-visual associations using mutual informationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1999
- Movement, activity and action: the role of knowledge in the perception of motionPhilosophical Transactions Of The Royal Society B-Biological Sciences, 1997
- Grounding Language in PerceptionPublished by Springer Nature ,1995
- Seeded region growingIEEE Transactions on Pattern Analysis and Machine Intelligence, 1994
- Learning by watching: extracting reusable task knowledge from visual observation of human performanceIEEE Transactions on Robotics and Automation, 1994
- The objective basis of behavior units.Journal of Personality and Social Psychology, 1977