Esaú VILLATORO

Google Scholar

Short Biography

Currently, I’m working as a Research Associate at Idiap research institute. I’m actively collaborating with the Speech and Audio Processing Group. Until December 2022, I held a tenured position at the Universidad Autónoma Metropolitana campus Cuajimalpa (UAM-C) in Mexico City. I’m an active member of the Language and Reasoning research group (LyR) in the Information Technologies Department. In addition, I’m an external member of the Laboratory of Language Technologies (LabTL) of the Computational Sciences Department at the National Institute of Astrophysics, Optics and Electronics (INAOE), located in Puebla, Mexico. I’m also a member of the Mexican Association for Natural Language Processing (AMPLN), the Hispano-American Network for Automatic Human Language Processing (RedHisTAL), and member of the Mexican Academy of Computer Science (AMEXCOMP).

I have over 10 years of experience in the Computer Science field. My primary research interests include the areas of Computational Linguistics (CL), Information Retrieval (IR), and Natural Language Processing (NLP). During the early stages of my career, I focused on statistical approaches and computational models to solve very specific NLP tasks, e.g., single and multiple document summarization, information retrieval, thematic and non-thematic text classification, text mining techniques, etc. More recently, I’ve been reorienting my research towards the intersection of NLP and Social Sciences, with special emphasis on applied NLP in Psycholinguistics, Authorship Analysis (AA), and Semantic Analysis (SA) from automatic transcripts.  

If you are interested, you can review my personal web page here.

You can check my latest publications since I started working at Idiap here.

Education

I hold a Ph. D. in Computer Science. Since year 2004 I’ve been working in the Natural Language Processing (NLP) area. Some of the particular sub-problems of NLP in which I’m interested and I’ve done some work are: 

  • Author Profiling (Sexual Predators Identification/Personality identification/ Mental health disorders identification)
  • Plagiarism Detection (Source Code Plagiarism Identification)
  • Sentiment Analysis (Opinion mining)
  • Single and Multiple Document Summarization
  • Information Retrieval (Geographic)
  • Automatic Speaker Identification
  • Question Answering Systems
  • Information Extraction from unstructured Texts
  • Automatic Text Classification

Current Projects

  • Comprehensive data-driven Risk and Threat Assessment Methods for the Early and Reliable Identification, Validation and Analysis of migration-related risks (CRITERIA). See here for a brief description.

Past Projects

  • Real time network, text, and speaker analytics for combating organized crime (ROXANNE): See here for a brief description.
  • Summarization and domain-Adaptive Retrieval of Information Across Languages (SARAL): See here for project description.


Tel: +41277217CP307-2
Office:
Contact