|
|
- Info
Talks that were given at Idiap
Title of the talk: Extended Pen+ Tools for Multimodal Analysis and Interaction
Given By: Nadir Weibel
Abstract: Access to information is one of the most crucial aspects of everyday life. As computation becomes ubiquitous and our environment is enriched with new possibilities for communication and interaction, the existing infrastructure of science, business, and social interaction is confronted with the difficult challenges of supporting complex tasks, mediating networked interactions, and managing the increasing availability of digital information and technology. Despite the tremendous development in terms of both new digital devices and novel interaction techniques that we all witnessed during the last years, it is almost unbelievable how paper documents and pen-based interaction still represent a very important way of interacting with both physical and digital information spaces. In an effort of re-thinking what pen and paper user interfaces (PPUI) mean in a modern world, we are studying multi-modal interactions of pen+ a range of tangible devices at the intersection of the physical and the digital worlds.
In this talk I will present my latest research around pen- and paper-computing, looking at how multimodal interaction with this very old technology enables a range of novel affordances and supports communication and interaction.
In the first part of the talk, I will speak about the development of new systems and prototypes that encompasses pen and other modalities, such as speech and gestures, different devices, such as smart phones, tablets, high-resolution wall displays, as well as different domains such as healthcare, accessibility, data visualization and interaction, social networks, augmented office environments, and communication for early education, older adults and other specific populations. I will present some examples of the prototypes we developed and some brief extracts of the data we collected about their usage in the wild.
The second part of the talk will focus on pen- and paper-based techniques and tools to get richer access to multimodal data in various contexts. While a new generation of inexpensive digital recording devices and storage facilities is revolutionizing data collection in behavioral science, one of the main obstacles to fully capitalizing on this opportunity is the huge time investment required for analysis using current methods. To address this analysis bottleneck we developed ChronoViz, a system providing synchronized interactive visual representations of multiple data streams. By using two multimodal datasets (a recent study of pilot/co-pilot interaction in a Boeing 787 simulator, and an ongoing learning analytics research project), I will present how the analysis tool works and how the integration of paper-based annotations, analysis, and interactions as part of the tool itself enable the exploration of new exciting methods for observational research.
Brief Bio Dr. Nadir Weibel is a Post-doctoral fellow at the University of California San Diego, member of both the Distributed Cognition and Human-Computer Interaction Laboratory and the Ubiquitous Computing and Social Dynamics research group. He holds a Bachelor and Master in Computer Science from ETH Zurich (Dipl. Informatik-Ing. ETH), and a Ph.D. in Computer Science also from ETH Zurich. During his Ph.D, he explored new ways of enhancing a seemingly mundane, but ubiquitous, resource such as paper to support everyday work, interaction and collaboration as a member of the Global Information Systems research group at ETH.
His current research is situated at the intersection of computer science, communication, and social sciences, studying the cognitive consequences of the introduction and the deployment of interactive multimodal and tangible devices. His main interests ranges from software engineering to human computer interaction, including computer supported collaborative work, mobile and ubiquitous computing. In his work he is developing theory and methods, designing representations, implementing prototypes, and evaluating the effectiveness of interactive physical-digital systems in order to understand the broader design space in which they are situated. He is currently collaborating with researchers at UCSD, Stanford, Berkeley, Drexel University, Childrens Hospital in Washington DC, TU Darmstadt, INRIA Paris / Université Paris Sud and Telecom Paristech.
Day of the talk: Tuesday, 31 Jan 2012 11:00:00
The talk will be given at Idiap: 106, Conference room
|
|
Title of the talk: Combining Transcription-based and acoustic-based speaker identifications for Broadcast news
Given By: Sylvain Meignier, Le Maine University, F
Abstract: In this presentation, we consider the issue of speaker identification within audio records of broadcast news. The speaker identity information is extracted from both transcript-based and acoustic-based speaker identification systems. This information is combined in the belief functions framework, which makes coherent the knowledge representation of the problem. The Kuhn-Munkres algorithm is used to optimize the assignment problem of speaker identities and speaker clusters. Experiments carried out on French broadcast news from the French evaluation campaign ESTER show the efficiency of the proposed combination method.
keywords: speaker identification, speaker diarization, belief functions.
Day of the talk: Thursday, 22 Dec 2011 14:00:00
The talk will be given at Idiap: 106, Conference room
|
|
Title of the talk: Speaker Verification Using the Spectral and Time Parameters of Voice Signal
Given By: Prof. Victor Sorokin, R&D Director OOO V
Abstract: Abstract - The speaker verification system developed in the VOXSEAL project is based on variations in formantfrequencies at stationary fragments and transient processes of vowels, the spectral features of fricative sounds, and theduration of speech segments. The best features are chosen for each word from the fixed list of Russian numerals rangingfrom zero to nine. The password phrase is randomly generated by the system at each verification. The compensation fordynamic noise and the counteraction with respect to interference using the reproduction of the intercepted and recorded speech are provided by the repeated reproduction of several words. The total error probabilities for male andfemale voices are 0.006 and 0.025%, respectively, for 30 million tests, 429 speakers, and a maximum length of the passwordphrase of 10 words. Note that the probabilities of false identification and false rejection are almost equal
Author - Prof. Victor Sorokin, R&D Director OOO Voxseal, Skolkovo-Moscow Russian national, MSc. from Moscow Aviation Institute, PhD (Engineering), Doctor of Sc. Physics and Mathematics (1987). Leading Researcher of the Institute for Information Transmission Problems of Russian Academy of Sciences, member of the Acoustical Society of America, board member of the Russian Acoustical Society, author of the monographs "Theory of Speech Production" and "Speech Synthesis", and about 150 publications, owner of 8 patents in speech technology.
Day of the talk: Tuesday, 20 Dec 2011 14:00:00
The talk will be given at Idiap: 106, Conference room
|
|
Title of the talk: Building-up child-robot relationship for therapeutic purposes
Given By: Joan Pons
Abstract: Summary:
Socially assistive robots (SAR) have shown to be very promising in therapeutic programs with children.
Health-related goals such as in-clinic rehabilitation or quality of life improvement have been achieved
through social interaction. In this context, robots effectiveness depends strongly in its ability to
elicit long-term engagement in children. To explore the dynamics of social bonds emergence with robots
a field study with 49 sixth grade scholars (aged 11-12 years) and 4 different robots was carried out at
an elementary school. Childrens preferences, expectations on functionality and communication, and
interaction behavior were studied. The results showed that different robots appearance and performance
elicit in children distinctive perceptions and interactive behavior, and affect social processes as role
attribution and attachment. In a similar way, to explore the requirements of an effective human-robot
interaction, a quiz game was developed. A NAO robot was used to play the popular game of the 20 questions
to evaluate different interaction capabilities (i.e. face following, speech recognition, visual and audio
queues, and personalization).
ShortBio:
Joan Saez Pons did his PhD at the Mobile Machines and Vision Lab (MMVL), Sheffield Hallam University,
UK with the topic of multi-robot systems to collaborate with humans. He was as well a Marie-Curie
researcher at the Cognitive Neuroscience Department (KN) at University of Tuebingen, Germany. He has been
working at the Technical Research Centre for Dependency Care and Autonomous Living (CETpD), UPC, BarcelonaTech,
in the field of social robotics and human-robot interaction. His research interests include mobile robotics
navigation, multi-robot systems, cognitive robotics and human-robot interaction.
Day of the talk: Wednesday, 02 Nov 2011 16:00:00
The talk will be given at Idiap: 106, Conference room
|
|
Title of the talk: Convex Relaxation Methods for Image Processing
Given By: Xavier Bresson
Abstract: This talk will introduce recent methods to compute optimal solutions to fundamental problems in image processing.
Several meaningful problems in imaging are usually defined as non-convex energy minimization problems, which are
sensitive to initial condition and slow to minimize. The ultimate objective of our work is to overcome the bottleneck
problem of non-convexity. In other words, our goal is to convexify the original problems to produce more robust and
faster algorithms for real-world applications. Our approach consists in finding a convex relaxation of the original
non-convex optimization problems and thresholding the relaxed solution to reach the solution of the original problem.
We will show that this approach is able to convexify important and difficult image processing problems such as image
segmentation based on the level set method and image registration. Our algorithms are not only guaranteed to find a
global solution to the original problem, they are also at least as fast as graph-cuts combinatorial techniques while
being more accurate. Finally, I will introduce recent promising extensions of this approach in machine learning.
Bio: Prof. Xavier Bresson received his B.A. of Physics from University of Marseille and his Master of Electrical Engineering
from Ecole Superieure d'Electricite in Paris, France. He got his Ph.D. at the Swiss Federal Institute of Technology (EPFL)
in 2005. From 2006 to 2010, he was a Postdoctoral Scholar in the Department of Mathematics at University of California,
Los Angeles (UCLA). In 2010, he joined the Department of Computer Science at City University of Hong Kong as
Tenure-Track Assistant Professor. His current research works are focused on convex relaxation methods and unified geometric
methods in image processing and machine learning. He has published 38 papers in international journals and conferences.
Day of the talk: Thursday, 08 Sep 2011 11:00:00
The talk will be given at Idiap: 106, Conference room
|
|
Title of the talk: Scalable multi-class/multi-view object detection
Given By: Mr. Nima Razavi
Abstract: Scalability of object detectors with respect to the number of classes/views is a very important issue for applications where
many object classes need to be detected. While combining single-class detectors yields a linear complexity for testing,
multi-class detectors that localize all objects at once come often at the cost of a reduced detection accuracy. In this work,
we present a scalable multi-class detection algorithm which scales sublinearly with the number of classes without compromising accuracy.
To this end, a shared discriminative codebook of feature appearances is jointly trained for all classes and detection is also performed
for all classes jointly. Based on the learned sharing distributions of features among classes, we build a taxonomy of object classes.
The taxonomy is then exploited to further reduce the cost of multi-class object detection. Our method has linear training and sublinear
detection complexity in the number of classes. We have evaluated our method on the challenging PASCAL VOC06 and PASCAL VOC07
datasets and show that scaling the system does not lead to a loss in accuracy.
Day of the talk: Friday, 13 May 2011 14:30:00
The talk will be given at Idiap: 106, Conference room
|
|
Title of the talk: Latent Feature Models for the Structure and Meaning of Text
Given By: James Henderson and Paola Merlo
Abstract: Much of the meaning of text is
reflected in individual words or
phrases, but its full information
content requires structured analyses of
the syntax and semantics of natural
language. Our work on methods for
extracting such structured meaning
representations from natural language
has focused on the joint modelling of
syntactic and semantic dependency
structures. We have addressed this
problem by using latent variables to
model correlations between these two
structures without strong prior
assumptions about the nature of these
correlations. These models have
achieved state-of-the-art results in
both syntactic parsing and semantic
role labelling across several
languages. We have also used them to
exploit syntactic information in
correcting semantic roles automatically
transferred from translations.
Our use of latent variable models is in
part motivated by the recognition that
the supervised learning paradigm is
becoming increasingly impractical as
research in natural language processing
moves to more complex, deeper levels of
semantic analysis. By developing
robust efficient methods for learning
latent representations, we hope to be
able to induce semantic representations
from large quantities of data for
weakly correlated tasks, such as machine
translation. Our latent variable
models use vectors of latent features
for robust learning and exploit neural
networks for efficient approximate
inference, while still exploiting
methods from dependency parsing for
efficient decoding with sufficiently
powerful models.
(Work with Ivan Titov, Lonneke van der
Plas, Nikhil Garg, and Andrea Gesmundo.)
Day of the talk: Friday, 11 Mar 2011 11:00:00
The talk will be given at Idiap: 106, Conference room
|
|
Title of the talk: Face Recognition and Intelligent Video Surveillance
Given By: Prof Stan Z. Li
Abstract: Face recognition and intelligent video surveillance are important areas for the next generation ID management and public security.
In this talk, challenges and recent advances and applications of face biometric and intelligent video surveillance technologies will be described.
Shot Bio:
Stan Z. Li received his B.Eng from Hunan University, China, M.Eng from National University of Defense Technology, China, and PhD degree from Surrey University, UK.
He is currently a professor and the director of Center for Biometrics and Security Research (CBSR), Institute of Automation, Chinese Academy of Sciences (CASIA).
He worked at Microsoft Research Asia as a researcher from 2000 to 2004. Prior to that, he was an associate Professor at Nanyang Technological University, Singapore.
He was elevated to IEEE Fellow for his contributions to the fields of face recognition, pattern recognition and computer vision.
Day of the talk: Wednesday, 03 Nov 2010 14:00:00
The talk will be given at Idiap: 106, Conference room
|
|
Title of the talk: Social Sensing for Epidemiological Behavior Change
Given By: Anmol Madan
Abstract: Abstract: An important question in behavioral epidemiology and public health is to understand how individual behavior is affected by illness and stress. Although changes in individual behavior are intertwined with contagion, epidemiologists today do not have sensing or modeling tools to quantitatively measure its effects in real-world conditions. We propose a novel application of ubiquitous computing. We use mobile phone based co-location and communication sensing to measure characteristic behavior changes in symptomatic individuals, reflected in their total communication, interactions with respect to time of day (e.g., late night, early morning), diversity and entropy of face-to-face interactions and movement. Using these extracted mobile features, it is possible to predict the health status of an individual, without having actual health measurements from the subject. Finally, we estimate the temporal information flux and implied causality between physical symptoms, behavior and mental health.
Bio: Anmol Madan recently completed his PhD at the MIT Media Lab, with Prof. Alex Pentland. Currently, he is working as a post doctoral researcher at Northeastern University and Harvard University with Prof. David Lazer. He has received honors from the MIT 100k Competition and the MIT Enterprise Forum for various startup-related ideas. His research interests are in modeling human behavior using large-scale mobile phone sensor datasets, using applied machine learning and data mining methods. You might have also read about his research in popular media like CNN, BBC, New York Times, Wired, BusinessWeek and Slashdot.
Day of the talk: Friday, 01 Oct 2010 16:00:00
The talk will be given at Idiap: 106, Conference room
|
|
Title of the talk: Tell Me Where You have Lived, and I will Tell You What You Like: Adapting Interfaces to Cultural Preferences
Given By: Abraham Bernstein
Abstract: Adapting user interfaces to cultural preferences has been shown to improve a user's performance, but is oftentimes foregone because of its time-consuming and costly procedure. Moreover, it is usually limited to producing one uniform user interface (UI) for each nation disregarding the intangible nature of cultural backgrounds. To overcome these problems, we exemplify a new approach with our culturally adaptive web application MOCCA, which is able to map information in a cultural user model onto adaptation rules in order to create personalized UIs. Apart from introducing the adaptation flexibility of MOCCA, the talk describes a study with 30 participants in which we compared UI preferences to MOCCA's automatically generated UIs. Another experiment with over 40 participants from 3 coutnries showed a performance improvement for culturally adapted UIs over Results confirm that automatically predicting cultural UI preferences is possible, paving the way for low-cost cultural UI adaptations.
Bio
Abraham Bernstein is a full professor of informatics at the University of Zurich, Switzerland. His current research focuses on various aspects of the semantic web, knowledge discovery, service discovery/matchmaking, and mobile/pervasive computing. His work is based on both social science (organizational psychology/sociology/economics) and technical (computer science, artificial intelligence) foundations. Mr. Bernstein is a Ph.D. from MIT and has a Diploma in Computer Science (comparable to a M.S.) from the Swiss Federal Institute in Zurich (ETH). He is the program chair of this year's ISWC and on the editorial board of the International Journal on Semantic Web and Information Systems, the Informatik Spektrum by Springer, Journal of the Association for Information Systems, and the newly approved ACM Transactions on Intelligent Interactive Systems.
Day of the talk: Monday, 06 Sep 2010 11:00:00
The talk will be given at Idiap: 106, Conference room
|
|
Title of the talk: Conjugate Mixture Models for Clustering and Tracking Multimodal Data.
Given By: Vassil Khalidov
Abstract: The problem of multimodal tracking arises whenever the same objects are
observed through time by different sensors. We address the general case
when the observations from different modalities are not necessarily
aligned, in the sense that there is no obvious way to associate or to
compare them in some common space. Our objective is to construct a model that is able to estimate the number of objects and to cluster the data so that the clusters stay consistent across modalities through time. We use Bayesian treatment and present an approach, based on stochastic optimization and information criteria. The results are illustrated on a multiple audio-visual object tracking task with a ``robot head'' device, comprising a pair of stereoscopic cameras and a pair of microphones.
Day of the talk: Monday, 28 Jun 2010 11:00:00
The talk will be given at Idiap: 106, Conference room
|
|
Title of the talk: Statistical and knowledge-centric techniques in Natural Language Understanding: a valuable handshake?
Given By: Silvia Quateroni
Abstract: In this talk, I will draw from my experience in Information Retrieval and Spoken Dialogue Systems to discuss a number of situations where statistical (e.g. machine learning) techniques shake hands with knowledge-centric approaches to meet user needs and account for domain knowledge. I will present examples particularly from the areas of Question Answering and Spoken Language Understanding, two research fields that exhibit a number of common points.
Short biography: Silvia Quarteroni is a Senior Marie Curie Research Fellow involved in the ADAMACH project at the University of Trento. She received her MSc and BSc in Computer Engineering at the Swiss Federal Institute of Technology in Lausanne (EPFL) and her PhD in Computer Science at the University of York (UK). She has been working in several fields of Natural Language Processing, focusing on human-computer dialogue, information retrieval and personalization. She has published about 30 articles in international conferences and journals and is part of the programme committee of several of these.
Day of the talk: Thursday, 11 Mar 2010 11:00:00
The talk will be given at Idiap: 106, Conference room
|
|
Title of the talk: Subband temporal envelopes of speech signal and their central role in speech recognition by humans and machines
Given By: Cong-Thanh Do
Abstract: The subband temporal envelopes of speech signal have a central role in this presentation which can be split in three parts.
The first part of the presentation deals with the automatic recognition of cochlear implant-like spectrally reduced speech (SRS) [1]. The automatic speech recognition (ASR) system, which was trained on TI-digits database, is HMM-based and the speech feature vectors are the MFCCs along with the delta and acceleration coefficients. We show that from certain SRS spectral resolution, it is possible to achieve word accuracy as good as that attained with the original clean speech even though the SRS is synthesized only from subband temporal envelopes of the original clean speech [2]. This work motivated some perspectives on noise robust ASR and speech feature vector enhancement dedicated to ASR [3].
The human recognition of speech is addressed in the second part of the presentation. We present quantitative analyses on the speech fundamental frequency (F0) in the cochlear implant-like SRS which support the report of Zeng et al. 2005 [4], based on subjective tests, about the difficulty of cochlear implant users in identifying speakers. That is, the F0 distortion in state-of-the-art cochlear implant is great when the SRS, which is acoustic simulation of cochlear implant, is synthesized only from subband temporal envelopes [5]. The analyses revealed also a significant reduction of F0 distortion when the frequency modulation is integrated in cochlear implant, as proposed by Nie et al. 2005 [6]. On the other hand, the results of such quantitative analysis could be exploited to conduct subjective studies in cochlear implant research.
The third part of the presentation concerns the audio-visual speech processing in which a linear relationship between the subband temporal envelopes and the area of mouth opening was mathematically proposed [7]. This proposition is based on the pioneering research of Grant and Seitz [8] in which the author reported different degrees of correlation between acoustic envelopes and visible movements. Our mathematical model helps in estimating the area of mouth opening only from speech acoustics using blind deconvolution techniques [9]. The estimated area of mouth opening is sufficiently correlated with the manually measured ones with an average of correlation coefficients equals 0.73.
Biography: Cong-Thanh Do was born in Hanoi, Vietnam, in 1983. He received the Electrical Engineering degree from Hanoi University of Technology, Hanoi and Grenoble Institute of Technology, Grenoble, France, in 2006, through the Programme de Formation d'Ingénieurs d'Excellence au Vietnam (PFIEV). In 2007, he received the M.S degree in signal, image, speech, and telecommunication from the Grenoble Institute of Technology, Grenoble, France and performed a research internship in the Speech and Cognition Department of GIPSA-Lab, Grenoble, France.
He is currently working toward the Ph.D. degree in the Signal and Communications Department, Insitut Télécom, Télécom Bretagne, UMR CNRS 3192 Lab-STICC, Technopôle Brest-Iroise, Brest, France. His current research interests include automatic speech recognition, audio-visual speech processing and statistical signal processing.
Day of the talk: Friday, 05 Mar 2010 11:00:00
The talk will be given at Idiap: 106, Conference room
|
|
Title of the talk: IDIAP Newcomers
Given By: Hervé Bourlard
Abstract: If you are an IDIAP newcomer and we haven't had a chance to meet yet (e.g., at the previous
similar meeting), I would like to invite you for a meeting all together for informal introduction, discussions, and Q&As.
Day of the talk: Tuesday, 30 Jan 2007 17:00:00
The talk will be given at Idiap: Meeting Room (Villa Tissieres)
|
|
Title of the talk: Dry-run of my PhD defense
Given By: G. Lathoud
Abstract: Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays
Knowing the location of human speakers permits a wide spectrum of
applications, including hearing aids, hands-free speech processing in
cars, surveillance, intelligent homes and offices, autonomous
robots. This thesis focuses on the use of microphone arrays to analyze
spontaneous multi-party speech. This is a challenging task, because
such speech contains many very short utterances, and people interrupt
a lot each other (overlapped speech). Moreover, in order to build
applications with the least possible constraints on the users, we use
distant microphones only, for example on a meeting room table.
Finally, the developed approaches are as unsupervised as possible,
having in mind the dominant proportion of non-technical users. We
targeted the development of an automatic system that can handle both
moving and static speakers, in order to answer the question "Who spoke
where and when?". Several issues were investigated, from the signal
processing level (where? when?) to the speaker clustering level
(who?). The techniques developed in the course of this research were
successfully tested on a large variety of real indoor recordings,
including cases with multiple moving speakers as well as seated
speakers in meetings. The versatility of the proposed techniques is
illustrated by a direct application to two related cases: hands-free
speech acquisition in cars, and noise-robust speech recognition
through telephones. Finally, a close analysis of the speaker
clustering results leads to question the linearity of the transmission
channel in a real indoor environment, when a speaker is a few meters
away from a microphone.
Day of the talk: Friday, 24 Nov 2006 16:00:00
The talk will be given at Idiap: Main Conference Room (UBS)
|
|
Title of the talk: A Music Discovery Engine based on Audio Similarities
Given By: Nicolas SCARINGELLA
Abstract: A Music Discovery Engine based on Audio Similarities
In the context of Electronic Music Distribution, huge databases
coming from both restoration of existing analog archives and new
content have been created and are continuously growing. The biggest
online services are now proposing around 2 millions tracks urging for
efficient ways to browse collections. Providing the kind of robust
access to the worlds vast store of music that we currently provide for
textual material has been the goal of the Music Information Retrieval
(MIR) community over the past 10 years; however, it still remains a
very challenging problem in the case of audio data.
Music information is indeed a multifaceted and sometimes complex
data set that includes pitch, temporal (i.e. rhythm), harmonic, timbral
(e.g. orchestration), textual (i.e. lyrics), symbolic, editorial, and
metadata elements (without considering related visual elements). Music
information is also extremely dynamic. That is, any given work can have
its specific pitches altered, its rhythm modified, its harmony reset,
its orchestration changed, its performance reinterpreted, and its
performers arbitrarily chosen; yet, somehow, it remains the same
piece of music as the original. Within this extraordinarily fluid
environment, the concept of similarity becomes particularly
problematic while being crucial to design audio and music information
retrieval systems.
In this talk, we will discuss the concept of similarity between music
excerpts and propose possible research directions to build a music
discovery engine based on audio analysis.
Day of the talk: Monday, 10 Jul 2006 16:00:00
The talk will be given at Idiap: Main Conference Room (UBS)
|
|
Title of the talk: Prior Knowledge in Kernel Methods (PhD defense rehearsal)
Given By: Alexei Pozdnoukhov
Abstract: Kernel Methods are one of the most successful branches of Machine Learning.
They allow applying linear algorithms with well-founded properties
such as generalization ability, to non-linear real-life problems.
Support Vector Machine is a well-known example of a kernel method,
which has found a wide range of applications in data analysis nowadays.
In many practical applications, some additional prior knowledge
is often available. This can be the knowledge about the data domain,
invariant transformations, inner geometrical structures in data,
some properties of the underlying process, etc.
If used smartly, this information can provide significant
improvement to any data processing algorithm.
Thus, it is important to develop methods for incorporating
prior knowledge into data-dependent models.
The main objective of this thesis is to investigate approaches
towards learning with kernel methods using prior knowledge.
Invariant learning with kernel methods is considered in more details.
Day of the talk: Thursday, 29 Jun 2006 15:00:00
The talk will be given at Idiap: Main Conference Room (UBS)
|
|
Title of the talk: PhD defense Dry run:
Given By: Norman Poh
Abstract: This thesis presentation is about combining multiple systems applied to biometric authentication. Its two-fold contribution is to provide a better understanding of the problem of fusion (w.r.t to correlation, performance strength of individual systems and noise) and to exploit the knowledge of claimed identity to improve the combined system performance. Conditioning on the claimed identity is difficult because one has to deals with a small learning sample size.
Day of the talk: Wednesday, 24 May 2006 16:00:00
The talk will be given at Idiap: Main Conference Room (UBS)
|
|
Title of the talk: Testing the talk system
Given By: Roswell
Abstract: Taratata
Day of the talk: Wednesday, 17 May 2006 18:10:00
The talk will be given at Idiap: Main Conference Room (UBS)
|
|
Title of the talk: Using Auxiliary Sources of Knowledge for Automatic Speech Recognition
Given By: Mathew Magimai Doss
Abstract: This is the second rehearsal of my PhD defense presentation. Your comments and suggestions would be of great help. Thank You!
Day of the talk: Friday, 27 May 2005 16:00:00
The talk will be given at Idiap: Meeting Room (Villa Tissieres)
|
|
Title of the talk: ACM MultiMedia conference report
Given By: Florent Monay
Abstract: I will describe some papers and demos from ACM MultiMedia 2003 and MIR2003 workshop (content-based multimedia information retrieval, home videos browsing/editing, home photos browsing, surveillance, sports video indexing, ...).
A discussion about the corresponding research directions will follow.
Day of the talk: Monday, 24 Nov 2003 11:00:00
The talk will be given at Idiap: Smart Meeting Room (Pavillon)
|
|
|
|