Idiap on LinkedIn Idiap youtube channel Idiap on Twitter Idiap on Facebook
Personal tools
You are here: Home Research Resources AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking

AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking

— filed under:

The AV16.3 corpus is an audio-visual corpus of real indoor multispeaker data, designed to test algorithms for audio-only, video-only and audio-visual speaker localization and tracking.

Real human speakers were used. The variety of recordings was chosen to test algorithms to their limits, and to cover a wide range of applicative scenarii (meetings, surveillance). The emphasis is on overlapped speech and multiple moving speakers. Recordings include mostly dynamic scenarii, with single and multiple moving speakers. A few meeting scenarii, with mostly seated speakers, are also included.

 

The full database description and the download can be found here: AV16.3

Document Actions
Resource Information
Resource type: database
URL: http://www.idiap.ch/av16_3corpus
Date: Nov 17, 2008
Size: ~ 11GB
Distribution: 3 DVDs + web browsing (au lieu de 6DVDs)
License:

Idiap license

Contact: Contact us