LNCS Homepage
ContentsAuthor IndexSearch

Script Data for Attribute-Based Recognition of Composite Activities

Marcus Rohrbach1, Michaela Regneri2, Mykhaylo Andriluka1, Sikandar Amin1, 3, Manfred Pinkal2, and Bernt Schiele1

1Max Planck Institute for Informatics, Saarbrücken, Germany

2Department of Computational Linguistics, Saarland University, Germany

3Department of Computer Science, Technische Universität München, Germany

Abstract. State-of-the-art human activity recognition methods build on discriminative learning which requires a representative training set for good performance. This leads to scalability issues for the recognition of large sets of highly diverse activities. In this paper we leverage the fact that many human activities are compositional and that the essential components of the activities can be obtained from textual descriptions or scripts. To share and transfer knowledge between composite activities we model them by a common set of attributes corresponding to basic actions and object participants. This attribute representation allows to incorporate script data that delivers new variations of a composite activity or even to unseen composite activities. In our experiments on 41 composite cooking tasks, we found that script data to successfully capture the high variability of composite activities. We show improvements in a supervised case where training data for all composite cooking tasks is available, but we are also able to recognize unseen composites by just using script data and without any manual video annotation.

LNCS 7572, p. 144 ff.

Full article in PDF | BibTeX


lncs@springer.com
© Springer-Verlag Berlin Heidelberg 2012