Learning verb tense translation from parallel corpora for statistical machine translation.

Applications are invited for a 6-month internship (preferably at the Master level) in the field of statistical machine translation (SMT). At Idiap, we are studying methods for using text-level information to improve SMT, in the context of the Swiss COMTIS project ( In particular, we have successfully combined classifiers for discourse connectives with state-of-the-art SMT systems, showing an improvement on such particles. In collaborations with linguists, we currently analyze the features that govern the translation of verb tenses, mainly from English to French. The main challenge for MT is that there is no one-to-one mapping from English to French tenses, and the correct choice depends on the context.

