AFSANEH ASAEI

Education

Honors

Academic Service

Publications

Professional Experiences

Contact

Hi Dear, welcome to my homepage! I am a postdoctoral scientist at Idiap Research Institute, under the supervision of Prof. Herve BourlardThe subject of my research is on structured sparsity models for microphone array speech processing and recognition. The past 12 years of my professional work have been devoted to various disciplines in signal processing and machine learning research. Currently, my passion is identifying a domain where machine listening paradigms can be realized for multiparty environments.

 

 

Education

Thesis: “Model-based Sparse Component Analysis for Multi-party Distant Speech Recognition

Thesis: “Sound Localization by Beamforming Techniques for Robust Speech Recognition

Thesis: “Design and Implementation of a Band-Pass FIR Filter using TMS320C57 DSP processor

 

Professional Experience

  • Idiap Research Institute, Martigny, Switzerland, 2008-Current

     

    Postdoctoral Researcher at Speech Group

    Areas of Research:

      o   Sparse Component Analysis

      o   Microphone Array Signal Processing

      o  Speech Recognition

     

    Research Assistant at Speech Group

    Areas of Research:

      o   Sparse Component Analysis

      o   Microphone Array Signal Processing

      o  Distant Speech Recognition

     

  • Iran Telecommunication Research Centre (ITRC), Tehran, Iran, 2002-2008

     

    Multimedia Systems Research Group

     

            Member of the “Biometrics” research team, 2007-2008

            Initiated our team project on multimodal biometrics for web-based applications

            Areas of Research:

            o   Speaker Verification

            o   Classifier fusion

     

                     Member of the “Electronic Health Research Management” team, 2005-2007

            Assessment of the outsourced projects and proposals on:

            o   Blind Source Separation

            o   Source Localization and Tracking

            o   Automatic Speech Recognition

     

    Voice Over IP Project

     

           Supervisor of the “DSP” team in the following projects, 2003-2004

                 o   Design and Simulation of a Fixed-Point G.168 Multichannel Echo Canceller for Voice over IP Media Gateway on TMS320C6416 EVM platform

            o   Design and Simulation of a Fixed-Point Fax Tone Detector and Generator on TMS320C6416 code composer simulator

     

          Supervisor of the “PCI Device Driver Writing” team, 2003

            o  Writing a PCI Device Driver on Linux for TMS320C6416 EVM platform.

            o  PCI Device Driver on Windows Platforms

     

       o   Implementation of a TMS320C542 laboratory board communicating with PC through the DSP HPI port, 2002

       o   Implementation of a voice loop back  using codec & TMS320C542 DSP processor, 2001

           

Ç  TOP

Honors

  • IEEE Spoken Language Processing Grant, Blind selection out of 700 papers, 2011

  • Google Anita Borg Memorial Scholarship, Finalist, 2010

  • PhD Fellowship, SCALE (Speech Communication with Adaptive LEarning) Marie-Curie international training network, 2009-2012

  • Ranked 1st among 65 MSc. Students of  the major of Computer Architecture at Sharif University of Technology, 2006

  • Ranked 59th out of ~20,000 participant in the graduate entrance exam of Computer Architecture, Ranked 10th (interviewed) accepted at Sharif University, 2004

  • Ranked 170th in the National University Entrance Exam among ~250,000 students, 1997

  • Ranked 1st in Mathematics and Physics group, National organization for development of exceptional talents, 1994-1997

  • Provincial nominee for national physics olympiad competition, 1996-1997

 

       

  Ç  TOP

Publications 

Patent

      Method, apparatus and computer program product for determining the location of a plurality of speech sources, 2012US-13/654055, October 2012

Journal Papers

      “Structured Sparsity Models for Multiparty Speech Recovery from Reverberant Recordings”, with M. Golbabaee, H. Bourlard, V. Cevher, 2012. [bibtex]

      “Verified Speaker Localization Utilizing Voicing Level in Split-bands”, with M. J. Taghizadeh, M. Bahrololum, M. Ghanbari,  Signal Processing,  vol. 89, Issue 6, pp 1038-1049, June 2009. [bibtex]

      “Room Acoustic Modeling and Speech Dereverberation Exploiting Sparsity and Low-rank Structures”, with M. Golbabaee, H. Bourlard, V. Cevher, 2013

      “Optimal Structured Sparse Coding for Spatio-Spectral Information Recovery”, with B. Raj, H. Bourlard, V. Cevher, 2013

Conference Papers:

           Room Acoustic Modeling Exploiting Joint Sparsity and Low-rank Structures”, with M. Golbabaee, H. Bourlard, V. Cevher, Signal Processing with Adaptive Sparse Structured Representations (SPARS), Lausanne, Switzerland, July, 2013. [bibtex]

      A Multipath Sparse Beamfroming Method”, with B. Raj, H. Bourlard, V. Cevher, Signal Processing with Adaptive Sparse Structured Representations (SPARS), Lausanne, Switzerland, July, 2013; nominated for best paper award. [bibtex]

      “Structured Sparse Coding for Microphone Array Position Calibration”, with B. Raj, H. Bourlard, V. Cevher, SAPA-SCALE Conference, Intl. Speech Communication Association, Portland, OR, September, 2012. [bibtex]

      Computational Methods for Structured Sparse Component Analysis of Convolutive Speech Mixtures”, with M. E. Davies, H. Bourlard, V. Cevher,  Intl. Conference on Acoustic Speech and Signal Processing (ICASSP), Kyoto, Japan, March, 2012. [bibtex]

      Multi-party Speech Recovery Exploiting Structured Sparsity Models”, with M. J. Taghizadeh, H. Bourlard, V. Cevher, Intl. Speech Communication Association, INTERSPEECH’2011. [bibtex] - Demo

      An Integrated Framework for Multi-Channel Multi-Source Localization and Voice Activity Detection”, with M. J. Taghizadeh, P. N. Garner, H. Bourlard, H. R. Abutalebi, Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA), Edinburgh, Scotland, 2011. [bibtex]

      “Model-Based Compressive Sensing for Distant Multi-Party Speech Recognition”, with H. Bourlard, V. Cevher, Intl. Conference on Acoustic Speech and Signal Processing (ICASSP), Prague, Czech Republic, May 2011; winner of IEEE spoken language processing award. [bibtex] - Demo

            Sparse Component Analysis for Speech Recognition in Multi-speaker Environment”, with H. Bourlard and P. N. Garner, Intl. Speech Communication Association, INTERSPEECH’2010. [bibtex]

      Analysis of Phone Posterior Feature Space Exploiting Class-Specific Sparsity and MLP-Based Similarity”, with B. Picart, H. Bourlard, , Intl. Conference on Acoustic Speech and Signal Processing (ICASSP), Dallas, Texas, March 2010. [bibtex]

      Far-field Continuous Speech Recognition System based on Speaker Localization and Sub-band Beamforming”, with M. J. Taghizadeh, H. Sameti, , Proceedings of The 6th International ACS/IEEE Conference on Computer Systems and Applications ,Doha, Qatar, pp 495-500, April 2008. [bibtex]

      “Speaker Direction Finding for Practical Systems: A Comparison of Different Approaches”, with S. Ghanbari, M. J. Taghizadeh, H. Sameti, , Proceeding of the third Annual IEEE BENELUX/DSP valley signal processing symposium, Metropolis, Antwerp, Belgium, pp 129-133,  March 2007. [bibtex]

      Robust Speaker Localization Utilizing a Novel Beamforming Algorithm Based on Harmonic Structures”, with H. Sameti, M. Sh. Moin, , Proceeding of the 12th International Computer Society of Iran Computer Conference (CSICC'07), Tehran, Iran, February 2007. [bibtex]

Workshop

      “Speaker Verification and Localization by Microphone Array”, Presented in a one-day workshop on Biometrics at the 15th Iranian Conference on Electrical Engineering (ICEE’07), Tehran, May 12th, 2007

Technical Report

      * Structured Sparse Component Analysis of Compressive Acoustic Measurements”, with H. Bourlard and V. Cevher, IDIAP-RR-Internal, 2011, September 2011

     * Investigation of kNN Classifiers on Posterior Features Towards Application in Automatic Speech Recognition, with H. Bourlard and B. Picart, IDIAP-RR-76, May 2009

      * Kalman and Extended Kalman Filters for Source Tracking, ITRC Technical report, Multimedia Systems Research Group, August 2006

      * Speech Recognition Systems: A Survey Study, ITRC Technical report, Multimedia Systems Research Group,  May 2006

      * Voice Activity Detection: The Practical Solution for a Continuous Speech Recognition Engine, ITRC Technical report, Multimedia Systems Research Group,  September 2005

      * Test Procedure for G.168 and G.729 ITU-T Recommendations, ITRC Technical report, VOIP Project, December 2004

      * Design and Simulation of a Fixed-Point G.168 Multichannel Echo Canceller for TMS320C6416 EVM platform, ITRC Technical report, VOIP Project,  October 2004

       * Fax Tone Detection and Generation, Design and Simulation on the TMS320C6416 code composer, ITRC Technical report, VOIP Project, December 2003

      * How to Write a Linux PCI Device Driver for TMS320C6416 EVM platform, ITRC Technical report, VOIP Project, May, 2003

      * 2040 PCI Controller for DSP Boards, ITRC Technical report, VOIP Project, January 2003

 

Ç  TOP

Academic Service

Reviewing Manuscripts

      ACM transactions of speech and language processing

      Neural Computing and Applications Journal

      IEEE Transactions on Signal Processing

      EURASIP Journal on Advances in Signal Processing

      IEEE Signal Processing Letters

      IEEE Transactions on Audio, Speech and Language Processing

      IEEE Workshop on Applications of Signal Processing to Audio and Acoustics

      International Conference on Multimedia Computing and Information Technology

      2012 IEEE International Symposium on Circuits and Systems

Meetings and presentations

      Summer School on Machine Learning in Cambridge, UK, 29 August to 10 September 2009

      SCALE winter school on distant speech recognition at Saarland university, Germany, 11-22 January, 2010

      ICASSP conference in Dallas, Texas, USA, 14-19 March, 2010

      Google Scholars Retreat at Google office in Zurich, Switzerland, 28-30 June, 2010

      INTERSPEECH Conference in Makuhari, Japan, 26-30 September, 2010

      HSCMA workshop, Edinburgh, Scotland, 30 May to 1 June, 2011

      SPARS'11 workshop, Edinburgh, Scotland, 27-30 June, 2011

      INTERSPEECH Conference, Florence, Italy, 28-31 August, 2011, photos

      Interactive Multimodal Information Management Summer Institute, Switzerland, 2010-2011, photos

      SCALE winter school on "Beyond HMM" at Radboud university, Nijmegen, 24-27 January, 2012

          -  Session Chair: Keynote on "Exemplar-based methods for automatic speech recognition"

 

      IBM Watson Research Center, Yorktown Heights, NY, USA, 13 June, 2012 - Invited Visit

          -  Presentation: Model-based Sparse Component Analysis for Recovering Multi-Party Speech from Multi-channel Recordings

      Machine Learning Workshop, EPFL, Switzerland, 19 November, 2012  - Invited Talk

        -  Presentation: Structured Sparse Coding for Machine Listening

 

Ç  TOP

Contact

 

Ç  TOP

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All person copying this information are expected to adhere to the terms and constraints invoked by each document's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Last update on April 1st, 2013