Trajectory-based human action segmentation

Luís Santos, Kamrad Khoshhal, Jorge Dias

Research output: Contribution to journalArticlepeer-review

34 Scopus citations

Abstract

This paper proposes a sliding window approach, whose length and time shift are dynamically adaptable in order to improve model confidence, speed and segmentation accuracy in human action sequences. Activity recognition is the process of inferring an action class from a set of observations acquired by sensors. We address the temporal segmentation problem of body part trajectories in Cartesian Space in which features are generated using Discrete Fast Fourier Transform (DFFT) and Power Spectrum (PS). We pose this as an entropy minimization problem. Using entropy from the classifier output as a feedback parameter, we continuously adjust the two key parameters in a sliding window approach, to maximize the model confidence at every step. The proposed classifier is a Dynamic Bayesian Network (DBN) model where classes are estimated using Bayesian inference. We compare our approach with our previously developed fixed window method. Experiments show that our method accurately recognizes and segments activities, with improved model confidence and faster convergence times, exhibiting anticipatory capabilities. Our work demonstrates that entropy feedback mitigates variability problems, and our method is applicable in research areas where action segmentation and classification is used. A working demo source code is provided online for academical dissemination purposes, by requesting the authors.

Original languageBritish English
Pages (from-to)568-579
Number of pages12
JournalPattern Recognition
Volume48
Issue number2
DOIs
StatePublished - 1 Feb 2015

Keywords

  • Adaptive sliding window
  • Classification framework
  • Motion segmentation
  • Motion variability
  • Signal processing

Fingerprint

Dive into the research topics of 'Trajectory-based human action segmentation'. Together they form a unique fingerprint.

Cite this