In October 2003 a head pose video database was built at the Idiap research institute . The objective was to construct a video database allowing to perform quantitative evaluation of algorithms extracting information related to the head pose of people, such as head tracking and pose estimation algorithms, or focus of attention analysis. Such a database did not exist before (at least publicly). By making our database publicly available to researchers, rigorous algorithm comparisons will be allowed.


The database comprises two sets of video involving people engaged in natural activities. In the first one, people are participating in meetings and are debating statements displayed on a screen. In the second set, persons are performing some tasks in their office. In all cases, the head pose of people is continuously annotated thanks to the use of a 3D location and orientation magnetic trackers called flock of birds. In both sets, the head pose of 16 different persons has been recorded.

This database building was done in the context of two projects.

  • The first one is the Multi-Object, Multi-Camera Tracking and Activity Recognition (MUCATAR) project which is aimed at developping probabilistic algorithms for joint people tracking and activity recognition. The MUCATAR project is funded by the swiss National Center of Competence in Research (NCCR) on Interactive Multimodal Information Management (IM)2 which is devoted to the advancement of research, and the development of prototypes, in the field of man-machine interaction.
  • The second project is AMI (Augmented Multi-party Interaction), an EC IST funded projects which targets the advancement of computer enhanced multi-modal interaction in the context of meetings.
