-
AudioTag
by
—
last modified
Apr 21, 2026 11:36 AM
The goal of the project is :
1) to run an keyword spotter on audio files located on a server or on a mobile device and attach user-defined tags. The files are ...
Located in
Research
/
Projects
-
Accessing Dynamic Networked Multimedia Events
by
—
last modified
Apr 21, 2026 11:36 AM
The main goal of inEvent is to develop new means to structure, retrieve, and share large archives of networked, and dynamically changing, multimedia ...
Located in
Research
/
Projects
-
Semantically Self-Organized Distributed Web Search
by
—
last modified
Apr 21, 2026 11:36 AM
In this project we wish to develop a new search engine distributed over available web servers, in contrast to existing search engines centralized at a single ...
Located in
Research
/
Projects
-
Role based speaker diarization
by
—
last modified
Apr 24, 2026 12:29 AM
Speaker Diarization is the task of inferring "who spoke when" in an audio stream and is an essential step for facilitating the search and the indexing of ...
Located in
Research
/
Projects
-
Universal Spoken Term Detection with Deep Learning
by
—
last modified
Apr 21, 2026 10:28 PM
The overwhelming majority of state-of-the-art ASR systems follow the same path since about thirty years. The speech signal is first transformed into carefully ...
Located in
Research
/
Projects
-
Probabilistic Motifs for Video Action Recognition
by
—
last modified
Apr 23, 2026 06:38 AM
Action recognition is key for many tasks such as automatic annotation of videos, improved human-computer interaction and guidance in monitoring public spaces. ...
Located in
Research
/
Projects
-
Domain Adaptation Using Sub-Space Models
by
—
last modified
Apr 22, 2026 02:32 PM
This is a proposal in the area of acoustic modelling for automatic speech recognition (ASR). Current approaches to ASR are based on hidden Markov models ...
Located in
Research
/
Projects
-
ASR-Ktunes
by
—
last modified
Apr 21, 2026 11:36 AM
Le but de ce projet est la réalisation d’un système permettant la gestion de fichiers vidéo enregistrés, leur stockage et transmission vers un module ...
Located in
Research
/
Projects
-
YouBlog
by
—
last modified
Apr 21, 2026 11:36 AM
The goal of the YouBlog project is to perform proof-of-concept for automatic transcription of audio / video blogs as part of IM2-Idiap start-up ‘Koemei’. The ...
Located in
Research
/
Projects
-
Cross-Lingual Adaptation for Text to Speech Synthesis (CLAS3)
by
—
last modified
Apr 21, 2026 11:36 AM
Recent advances in statistical text to speech synthesis (TTS) have enabled voice personalization via the adaptation techniques normally associated with ...
Located in
Research
/
Projects