As artificial intelligence (AI) becomes more deeply embedded in everyday tech, from smartphones to medical devices, the demand for models that are not just powerful but also efficient and lightweight is rising fast. This is especially important in settings where computing resources are limited. To meet this challenge, Idiap researchers Mutian He and Philip Garner have developed a new agile method.
Audio Inference
Group News
Every year, the Institute nominates two students for its internal awards. In 2022, the Paper Award goes to Alexandre Bittar, and the Student Award goes to Teguh Lembono. Congratulations!
Idiap researchers published a paper describing an approach to speech processing based on the properties of the human brain. Their method proved as efficient as the current standard, whilst conserving the advantage of energy efficiency. Moreover, their work is replicable thanks to open access software paving the way for future applications.
Group Job Openings
Our group is regularly posting job openings ranging from internships to researcher positions. To check the opportunities currently available or to submit a speculative applications use the link below.Our group is regularly posting job openings ranging from internships to researcher positions. To check the opportunities currently available or to submit a speculative applications use the link below.
Current Group Members

GARNER, Philip
(Senior Research Scientist)
- website

AKSTINAITE, Vita
(Postdoctoral Researcher)
- website

HE, Mutian
(PhD Student / Research Assistant)
- website

CHEN, Haolin
(PhD Student / Research Assistant)
- website

RIMOLDI, Emanuele
(Research Intern)

KOVALENKO, Sophia
(Research Intern)
Alumni
Please note that this list is not exhaustive.
Current Projects
Recent Projects
- ADEL - Automatic Detection of Leadership from Voice and Body
- DAHL - DAHL: Domain Adaptation via Hierarchical Lexicons
- DEEPCHARISMA - Deep Learning Charisma
- EVOLANG - Evolving Language Phase 1
- L-PASS - Linguistic-Paralinguistic Speech Synthesis
- MASS - Multilingual Affective Speech Synthesis
- NAST - Neural Architectures for Speech Technology
- NATAI - The Nature of Artificial Intelligence
- NMTBENCHMARK - Training and Benchmarking Neural MT and ASR Systems for Swiss Languages
- SIWIS - Spoken Interaction with Interpretation in Switzerland
- SP2 - SCOPES Project on Speech Prosody
- V-FAST - Vocal-tract based Fast Adaptation for Speech Technology