Formation en IA

Le transfert de connaissances fait partie intégrante des activités de l'Idiap. Avec la recherche et le transfert de technologie, la formation est l'une des trois missions principales de l'institut. Nous concrétisons cet engagement à travers les cours donnés dans diverses institutions d'enseignement supérieur, ainsi qu'en proposant des formations continues aux entreprises et des activités de promotion des sciences pour les jeunes.




Sauf indication contraire, les projets listés ci-dessous conviennent pour des projets de master ou de diplôme.

Social media and crowdsourcing for social good

Project Summary: The student will contribute to a multidisciplinary initiative for the use of social media and mobile crowdsourcing for social good. Several projects are available. Specific topics include:

* Social media analytics
* Visualization of social and crowdsourced data
* Smartphone apps for mobile crowdsourcing

Students will be working with social computing researchers studying European and developing cities.

Contact: Prof. Daniel Gatica-Perez


A human-centered approach to understand local news consumption

Project Summary: The goal of the project is the design and implementation a framework to study the consumption of local news in the Swiss multicultural context. The project will include a combination of research methods for experimental design and data analysis, and will be done in the context of the AI4Media European project (A European Excellence Center for Media, Society, and Democracy). The main tasks of the project include: literature review; identification of local news sources; mixed-method experimental design; experiments and data analysis; and writing.

Contact: Prof. Daniel Gatica-Perez


Swiss Alpine Lakes & Citizen Science

Project Summary: In the context of the CLIMACT UNIL-EPFL initiative, we aim at cataloging all Swiss Alpine Lakes located above 2000 meters, including collection of water samples and in-depth analysis of their microbial diversity, with a citizen science approach to engage citizens and increase awareness regarding environmental conservation. The project will build upon existing data sources (including Wikipedia and government data), and computational tools including natural language processing, computer vision, and visualization to build online interactive functionalities that can be used as part of citizen science activities.

Contact: Prof. Daniel Gatica-Perez

Privacy-preserving machine learning methods for diversity-aware mobile computing

Project Summary: The goal of the project is to study privacy-preserving machine methods in the context of mobile, diversity-aware computing systems that support the local needs of communities. The project will include work in machine learning, mobile data analysis, and will be done in the context of the multidisciplinary WeNet European project. The main tasks of the project include: literature review; algorithm design and implementation; experiments and data analysis; and writing.

Contact: Prof. Daniel Gatica-Perez


A FATE framework for diversity-aware mobile computing

Project Summary: This project will study and propose a methodology to characterize and validate machine learning methods in the context of diversity-aware mobile computing from the FATE perspective (fairness, accountability, transparency, ethics.) Recent approaches include Google’s model cards for model reporting or Microsoft’s Guidelines for Human-AI interaction. The project will provide a set of best practices for this domain. The main tasks of the project include: literature review; method design and implementation; experiments and data analysis; and writing.

Contact: Prof. Daniel Gatica-Perez


The European AI Act and its impact on European cities

Project Summary: The April 2021 proposal by the European Commission on AI regulation (The AI Act) will impact many sectors of the economy and have important societal implications. This project will study this proposal, analyze its possible effects on how European cities use AI as part of their mission, and make recommendations for the future. The project will be done in the context of the multidisciplinary ICARUS European project, which involve a number of actors in cities and non-governmental organizations. The main tasks of the project include: literature review; conceptual analysis; data collection; data analysis, and writing.

Contact: Prof. Daniel Gatica-Perez


Robot Learning and Interaction

Project Summary: The Robot Learning and Interaction Group proposes various Master and Semester projects with topics related to robotics, machine learning, adaptive control and human-robot interaction.

The list of projects is available here.

Supervisor: Dr. Sylvain Calinon

Keywords: robotics, machine learning, adaptive control, human-robot interaction


Human-Robot Interfaces for Interactive Robot Programming

Project Summary: For robots to be widely adopted across industries and beyond structured manufacturing environments,it is critical for them to be programmable by a wide range of users. Growing research on End UserProgramming (EUP) for robotics aims to address this problem with novel user interfaces, programminglanguages, and techniques to aid or fully automate robot programming.

In this project, you will design a Human-Robot Interface for Robot Programming, integrating anexisting programming framework based on iterative Linear Quadratic Regulator (iLQR) [1] on a roboticmanipulator, the FRANKA EMIKA Panda. The interface will allow users to compose robot programsdefined as sequences of components like goal poses or constraints (e.g., maintaining a gripper orientation)which, in turn, inform the generation of executable robot trajectories.

For more details, see the following document
Supervisors: Dr. Jean-Marc Odobez, Dr. Sylvain Calinon
Advisors and point of contact: Dr. Mattia Racca


6D Pose Estimation for Robotic Manipulation Tasks

Project Summary: For robots to interact with their environment in a safe and efficient manner, they need robust ways of estimating the pose (location, orientation) of the objects to be manipulated. This is usually done with RGB-D cameras as input sensors, i.e. cameras that provide not only the color of each pixel (the RGB part) but also how far it is from the camera (the D for “depth”).

Most recent methods leverage deep learning architecture to achieve such estimation, leveraging convolutional neural networks and fully-connected layers.

In this semester project, you will use referenced methods (Wang et al 2019, He et al 2020 & 2021), testing their performance and usability on the available datasets. In particular, you will take advantage of the YCB Object Dataset, a popular robotics dataset used to evaluate 6D pose estimation techniques (Calli et al, 2015). This dataset, avaialble in the lab, consists of real-life items and their corresponding 3D model (in the form of point clouds or textured meshes). You will then integrate the methods in a lab setup, where the input images come, in real time, from a RBG-D camera (Intel Realsense d415, Kinect Azure). Finally, you will showcase the robustness of the methods, by having the estimated 6D pose of YCB items used as input for a robot grasping task.For more details about the tasks and goals, see the following link:

Supervisor: Dr. Jean-Marc Odobez
Advisors and point of contact: Anshul Gupta (research assistant), Dr. Mattia Racca


Interaction Manager for Human-Robot Interactions

Project Summary: This project aims to provide an easy way to create dialogues for human-robot interactions. For example, a robot introducing itself when someone looks at it, telling a joke if the person asks for one, and then using the person’s laughter to learn whether the joke was a good one. In this project (semester or master), you will use RASA, a commercial dialogue management system used in many chatbots on the web to handle turn-taking discussions. You will interface this system with a simulated robot as well as a real Pepper robot to create scenarios allowing people to interact with Pepper in different ways and ensuring that Pepper’s responses are appropriate both verbally and non-verbally. For this, you will need to use Pepper’s sensors to make sense of the world and understand people’s speech, relay this information to the dialogue manager in RASA, and treat the outputs of the dialogue manager to create real robot behaviors (speech and gestures).

For more details, see the following link to pdf
Supervisors: Dr. Jean-Marc Odobez, Dr. Emmanuel Senft
Point of contact: Dr. Emmanuel Senft



Cours de Bachelor

Les chercheurs de l’Idiap sont impliqués dans plusieurs cours de bachelor listés ci-dessous.


Introduction à l'apprentissage machine (EE-311) Enseignant(s): Michael Liebling

Ce cours présente une vue générale des techniques d'apprentissage automatiques, passant en revue les algorithmes, le formalisme théorique, et les protocoles expérimentaux.

Où : EPFL Langue : Français


Urban Thermodynamics (CIVIL-309) Enseignant(s): Jerôme Kämpf (Khovalyg Dolaana)

This course introduces the analysis of urban areas from a thermodynamics perspective, considering the heat exchange between different urban elements (buildings, vegetation, water surfaces, ground, and environment). Urban heat island effect and outdoor comfort topics are also discussed.

: EPFL Langue : Anglais



Cours de master

Les chercheurs de l’Idiap sont impliqués dans plusieurs cours de master listés ci-dessous par institution.



Automatic Speech Processing (EE-554) Enseignant(s) : Mathew Magimai Doss

The goal of this course is to provide the students with the main formalisms, models and algorithms required for the implementation of advanced speech processing applications (involving, among others, speech coding, speech analysis/synthesis, and speech recognition).

Langue: Anglais


Computational Social Media (DH-500) Enseignant(s) : Gatica-Perez Daniel

The course integrates concepts from media studies, machine learning, multimedia and network science to characterize social practices and analyze content in sites like Facebook, Twitter and YouTube. Students will learn computational methods to infer individual and networked phenomena in social media.

Langue: Anglais


Image processing II (MICRO-512) Enseignant(s) : Liebling Michael, Sage Daniel, Unser Michaël, Van De Ville Dimitri Nestor Alice

Study of advanced image processing; mathematical imaging. Development of image-processing software and prototyping in JAVA; application to real-world examples in industrial vision and biomedical imaging.

Langue: Anglais


Genomics and bioinformatics (BIO-463) Lecturer(s) Luisier Raphaëlle & Jacques Rougemont

This course covers various data analysis approaches associated with applications of DNA sequencing technologies, from genome sequencing to quantifying gene expression, transcription factor binding and chromosome conformation.

Langue: Anglais



Idiap & UniDistance

Practical Course in Linear Algebra and Probability (M01) Enseignant(s) : Théophile Gentilhomme, Ina Kodrasi
Langue: Anglais


Data structure and algorithms for AI (M02) Enseignant(s) : Olivier Bornet
Langue: Anglais


Signal Processing (M03) Enseignant(s) : Michael Liebling
Langue: Anglais


Foundations in statistics for AI (M04) Enseignant(s) : Phil Garner, Ina Kodrasi
Langue: Anglais


Open Science and Ethics (M05) Enseignant(s) : Sébastien Marcel, André Anjos
Langue: Anglais


Fundamentals in Machine Learning 1 (M06) Enseignant(s) : Sébastien Marcel, André Anjos, Andre Freitas, Jean-Marc Odobez
Langue: Anglais


Introduction to Image Processing and Computer Vision (M07) Enseignant(s) : Michael Liebling, Jean-Marc Odobez
Langue: Anglais


Fundamentals in Machine Learning 2 (M08) Enseignant(s) : Sébastien Marcel, André Anjos, Andre Freitas, Jean-Marc Odobez
Langue: Anglais


Introduction to Speech Processing (M09) Enseignant(s) : Mathew Magimai Doss
Langue: Anglais


Deep Learning (M10) Enseignant(s) : Olivier Canévet
Langue: Anglais


Biometrics (A01) Enseignant(s) : Sébastien Marcel
Langue: Anglais


Multimodal Computational Sensing of People (A02) Enseignant(s) : Jean-Marc Odobez
Langue: Anglais


Natural Language Processing (A03) Enseignant(s) : James Henderson
Langue: Anglais


Robotics (A04) Enseignant(s) : Sylvain Calinon
Langue: Anglais


AI Company Strategy and Project Definition (P01) Enseignant(s) : Olivier Bornet
Langue: Anglais


AI Project Development (P02) Enseignant(s) : Olivier Bornet
Langue: Anglais



Université de Lausanne

Biometrics (School of Criminal Justice (ESC)) Enseignant(s) : Sébastien Marcel

This course introduces to the analysis, modelling and interpretation of biometric data for biometric person recognition, forensic biometrics, cybersecurity and behavioural biometrics in man-machine communication.

Langue : Français




Human-Robot Interaction and Collaborative Robotics (M4C1) Enseignant(s) : Sylvain Calinon

This course presents the use of artificial intelligence and machine learning techniques in human-robot interaction applications. In particular, it will focus on techniques to transfer skills by demonstration, inspired by imitation mechanisms to teach new skills to robots with an intuitive interface for the end-user.

Langue : Anglais


Statistical, geometrical and dynamical representations of movement (M2C7) Enseignant(s) : Sylvain Calinon

This course will present various ways of representing movement data and gestures in a mathematical manner, with the goal of analyzing, compressing or generating movements. Several examples of applications will be covered, from generation of manipulation skills in robotics to the analysis of motion capture data.

Langue : Anglais



Cours de doctorat

Electrical Engineering Doctoral program EPFL

Perception and learning from multimodal sensors (EE-623) Enseignant(s) : Odobez Jean-Marc

The course will cover different aspects of multimodal processing (complementarity vs redundancy; alignment and synchrony; fusion), with an emphasis on the analysis of people, behaviors and interactions from multimodal sensor, using statistical models and deep learning as main modeling tools.

Où : EPFL Langue : Anglais


Digital Speech and Audio Coding (EE-719) Enseignant(s): Magimai Doss Mathew, Motlicek Petr

The goal of this course is to introduce the engineering students state-of-the-art speech and audio coding techniques with an emphasis on the integration of knowledge about sound production and auditory perception through signal processing techniques.

Où : EPFL Langue : Anglais


Fundamentals in statistical pattern recognition (EE-612) Enseignant(s): Anjos André, Marcel Sébastien, Canévet Olivier

This course provides in-depth understanding of the most fundamental algorithms in statistical pattern recognition as well as concrete tools (as source code) to PhD students for their work. It will cover regression, classification (MLP, SVM) and probability distribution modeling (k-Means, GMM, HMM).

Où : EPFL Langue : Anglais


Machine Learning for Engineers (EE-613) Enseignant(s): Calinon Sylvain, Odobez Jean-Marc

The objective of this course is to give an overview of machine learning techniques used for real-world applications, and to teach how to implement and use them in practice. Laboratories will be done in python using jupyter notebooks.

Où : EPFL Langue : Anglais


Deep Learning For Natural Language Processing (EE-608) Enseignant(s): James Henderson
The Deep Learning for NLP course provides an overview of neural network based methods applied to text. The focus is on models particularly suited to the properties of human language, such as categorical, unbounded, and structured representations, and very large input and output vocabularies.
Où : EPFL Langue : Anglais


Grâce à un accord avec les programmes doctoraux EDEE et EDIC de l'EPFL, nous finançons et encadrons un grand nombre de doctorants. L'Institut accueille également des étudiants internationaux en master et en stage. Vous trouverez les postes actuellement disponibles à l'Idiap sur notre page carrière.



Cours avancés

Les chercheurs de l'Idiap proposent régulièrement des tutoriels durant des conférences et des universités d'été.


Vision-Language Pretraining: Current Trends and the Future

par Damien Teney et ses collègues

The goal of this ACL tutorial is to give an overview of the ingredients needed for working on multimodal problems, particularly vision and language. We will also discuss some of the open problems and promising future directions in this area.



Formation continue

L'Institut est convaincu que les professionnels doivent pouvoir suivre des cours pour améliorer leurs compétences en lien avec les technologies de l'intelligence artificielle. L'Idiap offre une formation de pointe dans le cadre de ses propres cours de Master en intelligence artificielle énumérés ci-dessus. Sur demande et selon sa disponibilité, l'Idiap propose également des cours et des conférences aux associations d'enseignants, etc. afin de les tenir informés des dernières avancées technologiques.