a dataset for mobile face and speaker recognition

Get Data

Video 1 (1586 Ko)
Video 2 (486 Ko)


The MOBIO database consists of bi-modal (audio and video) data taken from 150 people. The database has a female-male ratio or nearly 1:2 (99 males and 51 females) and was collected from August 2008 until July 2010 in six different sites from five different countries. This led to a diverse bi-modal database with both native and non-native English speakers.

In total 12 sessions were captured for each client: 6 sessions for Phase I and 6 sessions for Phase II. The Phase I data consists of 21 questions with the question types ranging from: Short Response Questions, Short Response Free Speech, Set Speech, and Free Speech. The Phase II data consists of 11 questions with the question types ranging from: Short Response Questions, Set Speech, and Free Speech. A more detailed description of the questions asked of the clients is provided below.

The database was recorded using two mobile devices: a mobile phone and a laptop computer. The mobile phone used to capture the database was a NOKIA N93i mobile while the laptop computer was a standard 2008 MacBook. The laptop was only used to capture part of the first session, this first session consists of data captured on both the laptop and the mobile phone.

Detailed Description of Questions

Please note that the answers to the Short Response Free Speech and Free Speech questions DO NOT necessarily relate to the question as the sole purpose is to have the subject speaking free speech, therefore, the answers to ALL of these questions are assumed to be false.

1. Short Response Questions

The short response questions consisted of five pre-defined questions, which were:

  • What is your name? – the user supplies their fake name
  • What is your address? – the user supplies their fake address
  • What is your birthdate? – the user supplies their fake birthdate
  • What is your license number? – the user supplied their fake ID card number (the same for each person)
  • What is your credit card number? – the user supplies their fake Card number

2. Short Response Free Speech

There were five random questions taken form a list of 30-40 questions. The user had to answer these questions by speaking for approximately 5 seconds of recording (sometimes more and sometimes less).

3. Set Speech

The users were asked to read pre-defined text out aloud. This text was designed to take longer than 10 seconds to utter and the participants were allowed to correct themselves while reading these paragraphs.
The text that was read was:

  • I have signed the MOBIO consent form and I understand that my biometric data is being captured for a database that might be made publicly available for research purposes.
  • I understand that I am solely responsible for the content of my statements and my behaviour.
  • I will ensure that when answering a question I do not provide any personal information in response to any question.

4. Free Speech

The free speech session consisted of 10 random questions from a list of approximately 30 questions. The answers to each of these questions took approximately 10 seconds (sometimes less and sometimes more).


All publications that report on research that use the Corpus will acknowledge the MOBIO database by referring to the following publication:

Elie Khoury, Laurent El-Shafey, Christopher McCool, Manuel Günther, Sébastien Marcel, “Bi-modal biometric authentication on mobile phones in challenging conditions”, Image and Vision Computing Volume 32, Issue 12, 2014.