Learning to recognize human action sequences

25 June 2003

proceedings article
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 28-33
https://doi.org/10.1109/devlrn.2002.1011726

Abstract

One of the major sources of cues in developmental learning is that of watching another person. An observer can gain a comprehensive description of the purposes of actions by watching the other person's detailed bode, movements. Action recognition has traditionally studied processing fixed camera observations while ignoring nonvisual information. This paper explores the dynamic properties of eye movements in natural tasks: eye and head movements are quite tightly coupled with actions. We present a method that utilizes eye gaze and head position information to detect the performer's focus of attention. Attention, as represented by eye fixation, is used for spotting the target object related to the action. Attention switches are calculated and used to segment the action sequence into action units which are recognized by hidden Markov models. An experimental system is built for recognizing actions in the natural task of "stapling a letter", which demonstrates the effectiveness of the approach.

Keywords

This publication has 15 references indexed in Scilit:

Invariant features for 3-D gesture recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Coupled hidden Markov models for complex action recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Integration of speech and vision using mutual information
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Segmenting visual actions based on spatio-temporal motion patterns
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Learning audio-visual associations using mutual information
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1999
Movement, activity and action: the role of knowledge in the perception of motion
Philosophical Transactions Of The Royal Society B-Biological Sciences, 1997
Grounding Language in Perception
Published by Springer Nature ,1995
Seeded region growing
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1994
Learning by watching: extracting reusable task knowledge from visual observation of human performance
IEEE Transactions on Robotics and Automation, 1994
The objective basis of behavior units.
Journal of Personality and Social Psychology, 1977

Cited by 5 articles