Personal tools
You are here: Home Scientific Research Projects Multimodal Processing and recognition

Multimodal Processing and recognition

Given the proliferation of electronic recording devices (cameras, microphones, EEGs, etc) with ever cheaper, and ever increasing processing speed, storage, and bandwidth, together with the advances in automatically extracting and managing information recorded from these devices (such as speech recognition, face tracking, etc), it becomes more and more feasible to simultaneously capture the same sequence of events (such as during a meeting) with several devices, generating richer and more robust sets of feature streams. Efficiently modeling such data coming from multiple channels, thus resulting in multiple observation streams, and using the underlying models in real applications, are the goals of IM2.MPR.

The main objectives of this IP are thus three-fold:

  • investigate fundamental aspects of multi-channel/multi-stream processing
  • continue more applied research on several tasks for multi-stream/multi-channel processing, including tracking, audio-visual speech recognition, person identification, segmentation, 3D scene reconstruction, and activity recognition
  • identify possible additional modalities (such as infra-red, laser, and various other sensors)
  • Partners

    Idiap, ITS/EPFL, LIDIAP/EPFL, CVML/UniGE.

    Document Actions
    Project Information
    Themes: Machine Learning
    Funding: Swiss National Science Foundation Projects
    Shortname: IM2.MPR
    Web: www.im2.ch/research/im2-projects/im2mpr
    Dates: Start date: Dec 31, 2005
    End date: Dec 30, 2007
    Contact: Contact us