News | Jobs | Team | Grants

Speech & Audio Processing

Speech processing has been one of the mainstays of Idiap’s research portfolio for many years. Today it is still the largest group within the institute, and Idiap continues to be recognised as a leading proponent in the field.

The expertise of the group encompasses statistical automatic speech recognition—based on hidden Markov models, or hybrid systems exploiting connectionist approaches—, text-to-speech, and generic audio processing, covering sound source localization, microphone arrays, speaker diarization, audio indexing, very low bit-rate speech coding, and perceptual background noise analysis for telecommunication systems.

Group News

Postdoc Ajinkya Kulkarni Receives Jury Distinction at SNSF Scientific Image Competition 2026

research — Apr 28, 2026

Ajinkya Kulkarni, postdoctoral researcher, has received a jury distinction at the 2026 SNSF Scientific Image Competition, recognizing both the scientific relevance and visual impact of his work.

SDialog: Idiap's Open-Source Toolkit for Reproducible Conversational AI

research — Apr 24, 2026

Building reliable conversational AI systems—such as chatbots or virtual assistants like Siri or Alexa—is more challenging than it may initially appear. Although modern LLMs have improved dramatically in capability, researchers and developers still contend with a fragmented ecosystem. Datasets are often stored in incompatible formats, evaluation methodologies lack consistency, and reproducibility remains limited across studies and implementations.

New SNSF, EU and regional funding secured

research — Apr 13, 2026

The Institute has secured significant new competitive funding at national and European levels, reaffirming the excellence of its research in a challenging funding environment.

Idiap’s postdoc receives BRIDGE Proof of Concept grant for audio deepfake detector

research — Mar 25, 2026

Ajinkya Kulkarni, a postdoctoral researcher at Idiap, has been awarded an SNSF Bridge Proof of Concept grant. His project addresses one of the fastest-growing threats in digital security: voice fraud. Audio deepfakes, AI-generated recordings that convincingly mimic real voices, are already being used for identity fraud, investment scams, and political disinformation.

Idiap researchers are Lemanic Life Science Hackathon winners

institute — Jun 05, 2024

Tilak Purohit and Barbara Ruvolo, contributing to Idiap’s AI for Life research program, won the 1st prize at the Lemanic Life Science Hackathon organized at EPFL at the end of April 2024. The team also included EPFL Life Science bachelor's students Alexandra Psaltis, Jia Xian Jennifer Shan, and Elise Boyer Their outcome is an AI-enabled user interface prototype to support the detection of depression via speech.

More News

Group Job Openings

Our group is regularly posting job openings ranging from internships to researcher positions. To check the opportunities currently available or to submit a speculative applications use the link below.

Other Jobs