Speech & Audio Processing
Current Group Members
MAGIMAI DOSS, Mathew
(Senior Research Scientist)
- website
MOTLICEK, Petr
(Senior Research Scientist)
- website
GARNER, Philip
(Senior Research Scientist)
- website
MADIKERI, Srikanth
(Research Associate)
- website
VLASENKO, Bogdan
(Research Associate)
- website
VILLATORO TELLO, Esaú
(Research Associate)
- website
HOVSEPYAN, Sevada
(Research Associate)
- website
MURALIDHAR, Skanda
(Research Associate)
- website
TORNAY, Sandrine
(Postdoctoral Researcher)
- website
HERMANN, Enno
(Postdoctoral Researcher)
- website
RANGAPPA, Pradeep
(Postdoctoral Researcher)
- website
BHATTACHARJEE, Mrinmoy
(Postdoctoral Researcher)
- website
SANCHEZ-CORTES, Dairazalia
(Postdoctoral Researcher)
- website
PRASAD, Amrutha
(PhD Student / Research Assistant)
- website
ZULUAGA GOMEZ, Juan Pablo
(PhD Student / Research Assistant)
- website
BITTAR, Alexandre
(PhD Student / Research Assistant)
- website
SARKAR, Eklavya
(PhD Student / Research Assistant)
- website
THORBECKE (NIGMATULINA), Iuliia
(PhD Student / Research Assistant)
- website
COPPIETERS DE GIBSON, Louise
(PhD Student / Research Assistant)
- website
TARIGOPULA, Neha
(PhD Student / Research Assistant)
- website
EL HAJAL, Karl
(PhD Student / Research Assistant)
- website
PUROHIT, Tilak
(PhD Student / Research Assistant)
- website
CHEN, Haolin
(PhD Student / Research Assistant)
- website
BURDISSO, Sergio (Gastón)
(R&D / Research Assistant)
HE, Mutian
(PhD Student / Research Assistant)
- website
KUMAR, Shashi
(PhD Student / Research Assistant)
- website
VIDAL, Maxime
(Research Assistant)
- website
MOSTAANI, Zohreh
(PhD Student / Research Assistant)
- website
KHALIL, Driss
(Junior R&D / Research Assistant)
- website
NADERI, Maryam
(AI Master Student)
- website
MUSCAT, Amanda
(Research Intern)
- website
RUVOLO, Barbara
(Research Intern)
- website
SANCHEZ LARA, Alejandra
(Research Intern)
- website
Alumni
Nothing to list
Current Projects
- TIPS - Towards Integrated processing of Physiological and Speech signals
- SMILE-II - SMILE-II Scalable Multimodal sign language technology for sIgn language Learning and assessmEnt Phase-II
- EVOLANG - Evolving Language
- CRITERIA - Comprehensive data-driven Risk and Threat Assessment Methods for the Early and Reliable Identification, Validation and Analysis of migration-related risks
- EMIL - Emotion in the loop – a step towards a comprehensive closed-loop deep brain stimulation in Parkinson’s disease
- IICT - Inclusive Information and Communication Technologies
- TRACY - A big-data analyTics from base-stations Registrations And Cdrs e-evidence sYstem
- EPOC - A personalized speech recognition framework for audio messaging on the edge
- PASS - PaSS - Pathological Speech Synthesis
- EUROCONTROL - Integrate the Automatic Speech Recognition system with eDEP, ESCAPE and audiolan
- ELOQUENCE - ELOQUENCE: Multilingual and Cross-cultural interactions for context-aware, and bias-controlled dialogue systems for safety-critical applications
Recent Projects
- TA2 - Together Anywhere, Together Anytime
- SCALE - Speech Communication with Adaptive Learning
- IM2-3 - Interactive Multimodal Information Management Phase 3
- EMIME - Effective Multilingual Interaction in Mobile Environments
- MULTI08 - Multimodal Interaction and Multimedia Data Mining
- VEOVOX - VeoVox: Voice-Controlled Order-Taking System for Restaurants
- AMIDA - Augmented Multi-party Interaction with Distance Access
- ICS-2010 - Interactive Cognitive Systems
- SESAME - SEarching Swiss Audio MEmories
- DM3 - Distributed MultiModal Media server, a low cost large capacity high throughput data storage system
- TA2-EEU - Together Anywhere, Together Anytime - Enlarged European Union
- MULTI08EXT - Multimodal Interaction and Multimedia Data Mining
- AMSP - Auditory-motivated signal processing and applications to robust speech enhancement and recognition
- TAO-CSR - Task Adaptation and Optimisation for Conversational Speech Recognition
- INEVENT - Accessing Dynamic Networked Multimedia Events
- RODI - Role based speaker diarization
- DAUM - Domain Adaptation Using Sub-Space Models
- CLAS3 - Cross-Lingual Adaptation for Text to Speech Synthesis (CLAS3)
- DBOX - D-Box: A generic dialog box for multilingual conversational applications
- ADDG2SU - Flexible Acoustic Data-Driven Grapheme to Subword Unit Conversion
- DIMHA - Diarizing Massive Amounts of Heterogeneous Audio
- PANDA - Perceptual Background Noise Analysis for the Newest Generation of Telecommunication Systems
- FLEXASR - Flexible Grapheme-Based Automatic Speech Recognition
- SIIP - Speaker Identification Integrated Project
- ROCKIT - Roadmap for Conversational Interaction Technologies
- DAUM2012 - Domain Adaptation Using Sub-Space Models
- MULTIVEO - High Accuracy Speaker-Independent Multilingual Automatic Speech Recognition System
- PHASER - PHASER: Parsimonious Hierarchical Automatic Speech Recognition
- GENEEMO - Geneemo: An Expressive Audio Content Generation Tool
- SCOREL2 - Automatic scoring and adaptive pedagogy for oral language learning
- AAMASSE - Acoustic Model Adaptation toward Spontaneous Speech and Environment
- UNITS - Unified Speech Processing Framework for Trustworthy Speaker Recognition
- SHISSM - Sparse and hierarchical Structures for Speech Modeling
- RECAPP - Making speech technology accessible to Swiss people
- DEEPSTD-EXT - Universal Spoken Term Detection with Deep Learning (extension)
- SMILE - Scalable Multimodal sign language Technology for sIgn language Learning and assessmEnt
- BIOWATCH - Biowatch
- MUMMER - MultiModal Mall Entertainment Robot
- SUMMA - Scalable Understanding of Multilingual Media
- MALORCA - Machine Learning of Speech Recognition Models for Controller Assistance
- ESGEM - Enhanced Swiss German mEdia Monitoring
- ADDG2SU_EXT - Flexible Acoustic data-driven Grapheme to Subword Unit Conversion
- ELEARNING-VALAIS_3.0 - eLearning-Valais 3.0
- FLOSS - Flexible Linguistically-guided Objective Speech aSessment
- PHASER-QUAD - Parsimonious Hierarchical Automatic Speech Recognition and Query Detection
- MEGANEPRO - Myo-Electricity, Gaze and Artificial Intelligence for Neurocognitive Examination and Prosthetics
- COBALT - Content Based Call Filtering
- MOSPEEDI - MoSpeeDi. Motor Speech Disorders: characterizing phonetic speech planning and motor speech programming/execution and their impairments
- TAPAS - Training Network on Automatic Processing of PAthological Speech
- SARAL - Summarization and domain-Adaptive Retrieval of Information Across Languages
- SHAPED - SHAPED: Speech Hybrid Analytics Platform for consumer and Enterprise Devices
- MPM - Multimodal People Monitoring
- DEVEL-IA - Formation « Développeurs spécialisés en Intelligence Artificielle » selon le modèle de formation continue duale postgrade
- REAPPS - Reinforced audio processing via physiological signals
- AI4EU - A European AI On Demand Platform and Ecosystem
- ROXANNE - Real time network, text, and speaker analytics for combating organized crime
- ATCO2 - Automatic collection and processing of voice data from air-traffic communications
- HAAWAII - Highly Automated Air Traffic Controller Workstations with Artificial Intelligence Integration
- CMM - Conversation Member Match
- STARFISH - STARFISH: Safety and Speech Recognition with Artificial Intelligence in the Use of Air Traffic Control
- WAVE2-96 - H2020-SESAR-PJ.10-W2-Solution 96
- JOGGL - jöggl (töggl for juristic applications)