Idiap on LinkedIn Idiap youtube channel Idiap on Twitter Idiap on Facebook
Personal tools
You are here: Home Research Resources

Resources

Software

Title Description
IdiapResource ACT ACT for Accuracy of Connective Translation is a reference-based metric to measure the accuracy of discourse connective translation, mainly for statistical machi...
IdiapResource BOB Bob is a free signal-processing and machine learning toolbox developed by the Biometrics group at Idiap Research Institute, Switzerland. The toolbox is written ...
IdiapResource DiscoConn-Classifier Classifier models and feature extractors for discourse relations
IdiapResource Exact Acceleration of Linear Object Detectors We describe a general and exact method to considerably speed up linear object detection systems operating in a sliding, multi-scale window fashion, such as the ...
IdiapResource Face Color Model This page contains the source code and data needed to train and use a model for skin, hair, clothing and background color modelling and segmentation.
IdiapResource facereclib - The Face Recognition Library This library is designed to perform a fair comparison of face recognition algorithms. It contains scripts to execute various kinds of face recognition experimen...
IdiapResource HEAT Image Retrieval System HEAT is an image retrieval web-application that is intended for large unstructured collections of images without semantic annotations. The system implements a n...
IdiapResource HTS-VTLN This software is a patch to HMM based statistical parametric speech synthesis toolkit (HTS 2.2).
IdiapResource ISS The Idiap Speech Scripts (ISS) is a collection of speech databases and dictionaries, and for training and testing of models for ASR. The scripts in turn are ...
IdiapResource Juicer Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).
IdiapResource mash-simulator mash-simulator is a 3D simulator for Linux and MacOS where a robot must complete a certain number of tasks in different randomized environments.
IdiapResource ML3 ML3 is an open source implementation of the Multiclass Latent Locally Linear Support Vector Machine algorithm, a multi-class local classifier based on a latent ...
IdiapResource MSER Linear time Maximally Stable Extremal Regions (MSER) implementation as described in D. Nistér and H. Stewénius, "Linear Time Maximally Stable Extremal Regions...
IdiapResource Probabilistic Models: temporal topic models and more Topic models such as Latent Dirichlet Allocation (LDA) have been used successfully in many domains for data mining. Originally designed for text documents, the...
IdiapResource SLOG - Similarity Learning on Graph SLOG contains implementation of similarity learning methods over relational data, where the relation between data points are given explicitly
IdiapResource Speaker Diarization Toolkit The toolkit is intended to facilitate research in multistream speaker diarization providing a platform for research in novel audio, video or location features. ...
IdiapResource SSP SSP stands for Speech Signal Processing. It is a fairly small package written in python. Its functionality is similar to tracter, with some overlap and some a...
IdiapResource Tasting Families of Features for Image Classification Please find below the code necessary to reproduce the experiments of the paper "Tasting Families of Features for Image Classification" under the GPL v2 license....
IdiapResource The Multi-Tracked Paths This is an implementation of the variant of KSP for tracking presented in (Berclaz et al. 2011). You can get more information and the reference implementation f...
IdiapResource Torch Statistical machine learning library containing most of the state-of-the-art algorithms. Written in Lua and C, the library is distributed under a BSD license.
IdiapResource Torch3vision Common software library for computer vision with machine learning algorithms. Written in simple C++, this library is based on Torch and distributed under a BSD...
IdiapResource Tracter Tracter is a data flow framework.
IdiapResource Webvalidation This software is a multi users, multi projects web annotation tool that help to organize the process of validating automatically generated transcriptions.
IdiapResource xbob.spkrec Speaker recognition library including feature extraction, background training, client enrolment, and score computation
IdiapResource xbob.thesis.elshafey2014 This package contains scripts to reproduce the experiments of Laurent El Shafey's Ph.D. thesis at Ecole Polytechnique Fédérale de Lausanne (EPFL).

Database

Title Description
IdiapResource 3DMAD The 3D Mask Attack Database (3DMAD) is a biometric (face) spoofing database. It currently contains 76500 frames of 17 persons, recorded using Kinect for both re...
IdiapResource AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking The AV16.3 corpus is an audio-visual corpus of real indoor multispeaker data, designed to test algorithms for audio-only, video-only and audio-visual speaker lo...
IdiapResource Biometric resources Find useful protocols, annotations, etc. that are provided to help encourage reproducible research.
IdiapResource Canal 9 Political Debates The Canal 9 political debate corpus is a collection of 72 political debates recorded by the Canal 9 local TV station and broadcast in Valais, Switzerland.
IdiapResource CCC - Cursive Character Challenge This is the home page of Cursive Character Challenge (C-Cube), the new benchmark for machine learning and pattern recognition algorithms. The database contains ...
IdiapResource Disco-Annotation Disco-Annotation is a collection of training and test sets with manually annoted discourse relations for 8 English discourse connectives in europarl texts.
IdiapResource DOME - DOminance in MEetings Dataset A Multimodal Corpus for studying dominance in small group conversations
IdiapResource EYEDIAP The EYEDIAP dataset was designed to train and evaluate gaze estimation algorithms from RGB and RGB-D data. It contains a diversity of participants, head poses, ...
IdiapResource Hand Posture and Gesture Datasets This webpage provides several benchmark databases for hand posture and hand gesture recognition.
IdiapResource Head Pose Database The objective was to construct a video database allowing to perform quantitative evaluation of algorithms extracting information related to the head pose of peo...
IdiapResource InteractPlay Dataset InteractPlay Dataset is a hand gesture database made of a 3D hand trajectories. It contains 16 hand gestures from 22 persons and provides 5 sessions and 10 reco...
IdiapResource Mediaparl Mediaparl is a Swiss accented bilingual database containing recordings in both French and German as they are spoken in Switzerland
IdiapResource Mobio The MOBIO database currently consists of 152 people (audio and video samples) with 12 sessions each.
IdiapResource Multichannel Overlapping Numbers Corpus (MONC) This corpus was collected by playing the original Numbers corpus (a speech corpus collected and distributed by CSLU at OGI) through loudspeakers in a meeting ro...
IdiapResource PRINT-ATTACK The Print-Attack Database consists of video samples of spoofing attacks using printed photos to 50 identities under different lighting conditions.
IdiapResource Speechdat - FIXED1SF This database comprises telephone recordings from 1000 speakers recorded directly over the fixed PSTN using an ISDN interface.
IdiapResource Speechdat - FIXED1SZ 2000 swiss-german speakers recorded over the SwissNet. They follow a protocole made up of 41 items (digits, words, numbers,sentences,..). An orthographical and ...
IdiapResource Speechdat - VERIF1SF 20 swiss-french people recorded 50 times overs the swiss telephone network follow a protocole made up of 54 items. (digits, words, numbers,sentences,..). An ort...
IdiapResource swiss-french-polyphone 4500 swiss-french speakers recorded over the SwissNet. They follow a protocole made up of 38 items. (digits, id number, natural numbers, money amount, names, wo...
IdiapResource swiss-french-polyvar Telephone recordings from about 143 swiss-french speakers. Each speaker recorded between 1 and 225 sessions. Each recording is made up of 55 items. (words, sent...
IdiapResource swiss-german-polyphone 4000 swiss-german speakers recorded directly over the telephone line, which various type of phones. They follow a protocole made up of 46 items. (digits,id numb...
IdiapResource TA2 The TA2 database consists of high-definition, simultaneous A/V recordings and annotations from two separate rooms, where the participants play games and communi...
IdiapResource The Replay-Attack Database The Replay-Attack Database for face spoofing consists of 1300 video clips of photo and video attack attempts to 50 clients, under different lighting conditions....
IdiapResource Two-Handed Datasets This database consists of different two-handed gestures (rotations in all the 6 directions and a "push" gesture).
IdiapResource wolf corpus The wolf corpus is an audio-visual data set containing around 81 hours of conversational data among groups of 8-12 people playing a role playing game.
Document Actions