Idiap on LinkedIn Idiap youtube channel Idiap on Twitter Idiap on Facebook
Personal tools
You are here: Home Research Resources Speaker Diarization Toolkit

Speaker Diarization Toolkit

— filed under:

The toolkit is intended to facilitate research in multistream speaker diarization providing a platform for research in novel audio, video or location features. It is based on the Information Bottleneck principle and is explicitely designed to use of several hetergenous feature streams.

Scientific papers [1,2,3] refer to results obtained on meeting recordings. The original formulation was proposed in [1] based on acoustic (MFCC) information only. Later the approach was exthended to include also MFCC and DOA features [2] and in [3] the IB was applied to the combination of four different feature streams.

References:

[1] An Information Theoretic Approach to Speaker Diarization of Meeting Data, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: IEEE Transactions on Audio Speech and Language Processing, 17(7), 2009

[2] An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: IEEE Transactions on Audio Speech and Language Processing, 19(2), 2011

[3] Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Speech Communication, 54(1), 2012

[4] Improving Real Time Factor of Information Bottleneck-based Speaker Diarization System, Madikeri Srikanth, Imseng David and Bourlard Hervé, Idiap-RR-18-2015, 2015

 

 

Download

IBDiarization

Document Actions
Resource Information
Resource type: software
URL: https://github.com/idiap/IBDiarization
Date: Sep 18, 2015
Nature: Source code distribution
Size: 27 264 KB
Audience: Speech research
Access: Web
Ownership: Idiap Research Institute
Distribution: Web
License:
Contact: Hervé BOURLARD
+41 277 217 720