Personal tools
You are here: Home Scientific Research Projects Very Large Sets of Heuristics for Scene Interpretation

Very Large Sets of Heuristics for Scene Interpretation

The goal of the research in machine learning has been so far to avoid the necessity for a detailed hand-designed prior knowledge. This proposal takes the opposite stance and aims precisely at combining a very large number of heuristics in a statistical framework. It is an attempt at bootstrapping an academic interest for the development of structurally complex priors for applied machine learning.

 

We define an heuristic to be any a feature extractor, an algorithm processing the available raw signal to produce values relevant to the problem at hand. This purposely general definition encompasses techniques spanning from simple rules to symbolic modeling or unsupervised locally trained predictors. We assume that high performance can only be achieved by combining thousands of such hand-designed modules, and propose to develop these heuristics in an open and collaborative framework similar to the successful development process of open-source software and collaborative encyclopedia. As many contributors will be involved, the resulting system will benefit from the completeness and redundancy of as many idiosyncratic viewpoints.

 

Although of potential interest to many applications, we study this approach for the problem of scene interpretation. Given an image with every day objects and furniture or an outdoor landscape, the goal is to detect and identify as many objects as possible. Solving this task requires to design a classifier able to recognize the identity of an isolated object (see Figure 1). We propose to combine a multitude of feature extractors capturing different aspects of the image to create this multi-class predictor.

 

The research to be tackle in this project can be structured in two main axis:

  1. The development of a machine learning technique to aggregate the heuristic responses for object classification. We will have to handle the classical difficulties arising in such situation of very large dimension, with the additional difficulty of dealing with an highly heterogeneous feature space. Our initial choice will be forests of decision trees(Breiman 2001), which have many properties, statistical and algorithmic, desirable for that project.

 

  1. The study of feedback tools to help and motivate the design of heuristics. Such tools will have to produce a ranking of the heuristics to provide the contributors with both a good estimate of their own progress, a good perception of their overall ranking in the community of contributors, and an assistance in figuring what are the main weakness of the current set of heuristics. Performance will be measured on real images such as the Caltech 256 or the VOC dataset to allow for comparison with state-of-the-art techniques, and on computer-generated images (see Figure 2), which can be produced along an exhaustive labeling with very limited resources.
Document Actions
Project Information
Themes: Machine Learning
Funding: Swiss National Science Foundation
Shortname: VELASH
Dates: Start date: Jun 01, 2009
End date: May 31, 2012
Contact: François FLEURET
+41 277 217 739