Philip N. Garner: Publications

[1] Sibo Tong, Philip N. Garner, and Hervé Bourlard. An investigation of deep neural networks for multilingual speech recognition training and adaptation. In Proceedings of Interspeech, pages 714--718, Stockholm, Sweden, August 2017. [ bib | DOI ]
[2] Renars Liepins, Ulrich Germann, Guntis Barzdins, Alexandra Birch, Steve Renals, Susanne Weber, Peggy van der Kreeft, Herve Bourlard, João Prieto, Ondrej Klejch, Peter Bell, Alexandros Lazaridis, Alfonso Mendes, Sebastian Riedel, Mariana S. C. Almeida, Pedro Balage, Shay B. Cohen, Tomasz Dwojak, Philip N. Garner, Andreas Giefer, Marcin Junczys-Dowmunt, Hina Imran, David Nogueira, Ahmed Ali, Sebastião Miranda, Andrei Popescu-Belis, Lesly Miculicich Werlen, Nikos Papasarantopoulos, Abiola Obamuyide, Clive Jones, Fahim Dalvi, Andreas Vlachos, Yang Wang, Sibo Tong, Rico Sennrich, Nikolaos Pappas, Shashi Narayan, Marco Damonte, Nadir Durrani, Sameer Khurana, Ahmed Abdelali, Hassan Sajjad, Stephan Vogel, David Sheppey, Chris Hernon, and Jeff Mitchell. The summa platform prototype. In Proceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics, pages 116--119, Valencia, Spain, April 2017. [ bib | http ]
[3] Alexandros Lazaridis, Ivan Himawan, Petr Motlicek, Iosif Mporas, and Philip N. Garner. Investigating cross-lingual multi-level adaptive networks: The importance of the correlation of source and target languages. In Proceedings of the International Workshop on Spoken Language Translation, Seattle, WA, USA, December 2016. [ bib | http ]
[4] Milos Cernak, Alexandros Lazaridis, Afsaneh Asaei, and Philip N. Garner. Composition of deep and spiking neural networks for very low bit rate speech coding. IEEE/ACM Transactions on Audio, Speech and Language Processing, 24(12):2301--2312, December 2016. [ bib | DOI | .pdf ]
[5] Branislav Gerazov, Aleksandar Gjoreski, Aleksandar Melov, Pierre-Edouard Honnet, Zoran Ivanovski, and Philip N. Garner. Unified prosody model based on atom decomposition for emphasis detection. In Proceedings of ETAI, Struga, Macedonia, September 2016. [ bib | .pdf ]
[6] Pierre-Edouard Honnet and Philip N. Garner. Emphasis recreation for TTS using intonation atoms. In Proceedings of the 9th ISCA Speech Synthesis Workshop, Sunnyvale, CA, USA, September 2016. [ bib | DOI | .pdf ]
[7] Alexandros Lazaridis, Milos Cernak, Pierre-Edouard Honnet, and Philip N. Garner. Investigating spectral amplitude modulation phase hierarchy features in speech synthesis. In Proceedings of the 9th ISCA Speech Synthesis Workshop, Sunnyvale, CA, USA, September 2016. [ bib | DOI | .pdf ]
[8] Jean-Philippe Goldman, Pierre-Edouard Honnet, Robert Clark, Philip N. Garner, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Tiago Macedo, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli, and Junichi Yamagishi. The SIWIS database: a multilingual speech database with acted emphasis. In Proceedings of Interspeech, San Francisco, California, USA, September 2016. [ bib | DOI | .pdf ]
[9] Milos Cernak, Afsaneh Asaei, Pierre-Edouard Honnet, Philip N. Garner, and Hervé Bourlard. Sound pattern matching for automatic prosodic event detection. In Proceedings of Interspeech, San Francisco, California, USA, September 2016. [ bib | DOI | .pdf ]
[10] Alexandros Lazaridis, Milos Cernak, and Philip N. Garner. Probabilistic amplitude demodulation features in speech synthesis for improving prosody. In Proceedings of Interspeech, San Francisco, California, USA, September 2016. [ bib | DOI | .pdf ]
[11] Milos Cernak and Philip N. Garner. Phonvoc: A phonetic and phonological vocoding toolkit. In Proceedings of Interspeech, San Francisco, California, USA, September 2016. [ bib | DOI | .pdf ]
[12] Tamás Gábor Csapó, Géza Németh, Milos Cernak, and Philip N. Garner. Modeling unvoiced sounds in statistical parametric speech synthesis with a continuous vocoder. In The European Signal Processing Conference, Budapest, Hungary, August 2016. [ bib | .pdf ]
[13] Milan Sečujski, Branislav Gerazov, Tamás Gábor Csapó, Vlado Delić, Philip N. Garner, Aleksandar Gjoreski, David Guennec, Zoran Ivanovski, Aleksandar Melov, Géza Németh, Ana Stojković, and György Szaszák. Design of a speech corpus for research on cross-lingual prosody transfer. In Andrey Ronzhin, Rodmonga Potapova, and Géza Németh, editors, Speech and Computer, volume 9811 of Lecture Notes in Artificial Intelligence, pages 199--206. Springer International Publishing, Budapest, Hungary, August 2016. 18th International Conference, SPECOM 2016. [ bib | DOI | .pdf ]
[14] Branislav Gerazov and Philip N. Garner. An agonist-antagonist pitch production model. In Andrey Ronzhin, Rodmonga Potapova, and Géza Németh, editors, Speech and Computer, volume 9811 of Lecture Notes in Artificial Intelligence, pages 84--91. Springer International Publishing, Budapest, Hungary, August 2016. 18th International Conference, SPECOM 2016. [ bib | DOI | .pdf ]
[15] Branislav Gerazov and Philip N. Garner. An investigation of muscle models for physiologically based intonation modelling. In Proceedings of the 23rd Telecommunications Forum, pages 468--471, Belgrade, Serbia, November 2015. [ bib | DOI | .pdf ]
[16] Alexandros Lazaridis, Blaise Potard, and Philip N. Garner. DNN-based speech synthesis: Importance of input features and training data. In Andrey Ronzhin, Rodmonga Potapova, and Nikos Fakotakis, editors, Speech and Computer, volume 9319 of Lecture Notes in Computer Science, pages 193--200. Springer International Publishing, Athens, Greece, September 2015. 17th International Conference, SPECOM 2015. [ bib | DOI | .pdf ]
[17] Branislav Gerazov, Pierre-Edouard Honnet, Aleksandar Gjoreski, and Philip N. Garner. Weighted correlation based atom decomposition intonation modelling. In Proceedings of Interspeech, pages 1601--1605, Dresden, Germany, September 2015. [ bib | .pdf ]
[18] Mohammad J. Taghizadeh, Afsaneh Asaei, Saeid Haghighatshoar, Philip N. Garner, and Hervé Bourlard. Spatial sound localization via multipath Euclidean distance matrix recovery. IEEE Journal of Selected Topics in Signal Processing, 9(5):802--814, August 2015. Issue on Spatial Audio. [ bib | DOI | .pdf ]
[19] Milos Cernak, Philip N. Garner, Alexandros Lazaridis, Petr Motlicek, and Xingyu Na. Incremental syllable-context phonetic vocoding. IEEE Transactions on Audio, Speech and Language Processing, 23(6):1019--1030, June 2015. [ bib | DOI | .pdf ]
[20] Mohammad J. Taghizadeh, Saeid Haghighatshoar, Afsaneh Asaei, Philip N. Garner, and Hervé Bourlard. Robust microphone placement for source localization from noisy distance measurements. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Brisbane, Australia, April 2015. [ bib | DOI | .pdf ]
[21] Milos Cernak, Blaise Potard, and Philip N. Garner. Phonological vocoding using artificial neural networks'. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4844--4848, Brisbane, Australia, April 2015. [ bib | DOI | .pdf ]
[22] Pierre-Edouard Honnet, Branislav Gerazov, and Philip N. Garner. Atom decomposition-based intonation modelling. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4744--4748, Brisbane, Australia, April 2015. [ bib | DOI | .pdf ]
[23] Mohammad J. Taghizadeh, Reza Parhizkar, Philip N. Garner, Hervé Bourlard, and Afsaneh Asaei. Ad hoc microphone array calibration: Euclidean distance matrix completion algorithm and theoretical guarantees. Signal Processing, 107:123--140, February 2015. Special Issue on ad hoc microphone arrays and wireless acoustic sensor networks. [ bib | DOI | .pdf ]
[24] Petr Motlicek, David Imseng, Blaise Potard, Philip N. Garner, and Ivan Himawan. Exploiting foreign resources for DNN-based ASR. EURASIP Journal on Audio, Speech, and Music Processing, (2015:17), 2015. [ bib | DOI | .pdf ]
[25] György Szaszák, Tamás Gábor Csapó, Philip N. Garner, Branislav Gerazov, Zoran Ivanovski, Géza Németh, Bálint Tóth, Milan Sečujski, and Vlado Delić. The SP2 SCOPES project on speech prosody. In DOGS2014 - Digital speech and image processing, Novi Sad, Serbia, October 2014. [ bib | .pdf ]
[26] Milos Cernak, Alexandros Lazaridis, Philip N. Garner, and Petr Motlicek. Stress and accent transmission in HMM-based syllable-context very low bit rate speech coding. In Proceedings of Interspeech, Singapore, September 2014. [ bib | .pdf ]
[27] Philip N. Garner, David Imseng, and Thomas Meyer. Automatic speech recognition and translation of a Swiss German dialect: Walliserdeutsch. In Proceedings of Interspeech, Singapore, September 2014. [ bib | .pdf ]
[28] Mohammad J. Taghizadeh, Philip N. Garner, and Hervé Bourlard. Enhanced diffuse field model for ad-hoc microphone array calibration. Signal Processing, 101:242--255, August 2014. [ bib | DOI | .pdf ]
[29] Alexandros Lazaridis, Elie Khoury, Jean-Philippe Goldman, Mathieu Avanzi, Sébastien Marcel, and Philip N. Garner. Swiss French regional accent identification. In Proceedings of Odyssey 2014: The Speaker and Language Recognition Workshop, Joensuu, Finland, June 2014. [ bib | .pdf ]
[30] Mohammad J. Taghizadeh, Afsaneh Asaei, Philip N. Garner, and Hervé Bourlard. Ad-hoc microphone array calibration from partial distance measurements. In Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA 2014), Nancy, France, May 2014. Nominated for best paper award. [ bib | DOI | .pdf ]
[31] Alexandros Lazaridis, Pierre-Edouard Honnet, and Philip N. Garner. SVR vs MLP for phone duration modelling in HMM-based speech synthesis. In Proceedings of the 7th Speech Prosody Conference, Dublin, Ireland, May 2014. [ bib | .pdf ]
[32] Pierre-Edouard Honnet, Alexandros Lazaridis, Jean-Philippe Goldman, and Philip N. Garner. Prosody in Swiss French accents: Investigation using analysis by synthesis. In Proceedings of the 7th Speech Prosody Conference, Dublin, Ireland, May 2014. [ bib | .pdf ]
[33] Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner, and John Dines. Combining vocal tract length normalization with hierarchical linear transformations. IEEE Journal of Selected Topics in Signal Processing, 8(2):262--272, April 2014. Special Issue on Statistical Parametric Speech Synthesis. [ bib | DOI | .pdf ]
[34] Steve Renals, Jean Carletta, Keith Edwards, Hervé Bourlard, Phil Garner, Andrei Popescu-Belis, Dietrich Klakow, Andrey Girenko, Volha Petukova, Philippe Wacker, Andrew Joscelyne, Costis Kompis, Simon Aliwell, William Stevens, and Youssef Sabbah. ROCKIT: Roadmap for conversational interaction technologies. In Proceedings of the 2014 Workshop on Roadmapping the Future of Multimodal Interaction Research Including Business Opportunities and Challenges, pages 39--42, Istanbul, Turkey, 2014. ACM. [ bib ]
[35] Pierre-Edouard Honnet and Philip N. Garner. Importance of prosody in Swiss French accent for speech synthesis. Nouveaux cahiers de linguistique française, 31, 2014. 3rd Swiss Workshop on Prosody, Geneva, September 2014. [ bib | .pdf ]
[36] Alexandros Lazaridis and Philip N. Garner. Syllable-based regional Swiss French accent identification using prosodic features. Nouveaux cahiers de linguistique française, 31, 2014. 3rd Swiss Workshop on Prosody, Geneva, September 2014. [ bib | .pdf ]
[37] Philip N. Garner, Rob Clark, Jean-Philippe Goldman, Pierre-Edouard Honnet, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli, and Junichi Yamagishi. Translation and prosody in Swiss languages. Nouveaux cahiers de linguistique française, 31, 2014. 3rd Swiss Workshop on Prosody, Geneva, September 2014. [ bib | .pdf ]
[38] David Imseng, Petr Motlicek, Hervé Bourlard, and Philip N. Garner. Using out-of-language data to improve an under-resourced speech recognizer. Speech Communication, 56:142--151, January 2014. [ bib | DOI | .pdf ]
[39] György Szaszák and Philip N. Garner. Evaluating intra- and crosslingual adaptation for non-native speech recognition in a bilingual environment. In Proceedings of the IEEE International Conference on Cognitive Infocommunications, pages 357--362, Budapest, Hungary, December 2013. [ bib | DOI | .pdf ]
[40] David Imseng, Petr Motlicek, Philip N. Garner, and Hervé Bourlard. Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition. In Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, pages 332--337, Olomouc, Czech Republic, December 2013. [ bib | DOI | .pdf ]
[41] Petr Motlicek, David Imseng, and Philip N. Garner. Crosslingual tandem-SGMM: Exploiting out-of-language data for acoustic model and feature level adaptation. In Proceedings of Interspeech, Lyon, France, August 2013. [ bib | .pdf ]
[42] Milos Cernak, Xingyu Na, and Philip N. Garner. Syllable-based pitch encoding for low bit rate speech coding with recognition/synthesis architecture. In Proceedings of Interspeech, Lyon, France, August 2013. [ bib | .pdf ]
[43] David Imseng, Hervé Bourlard, John Dines, Philip N. Garner, and Mathew Magimai.-Doss. Applying multi- and cross-lingual stochastic phone space transformations to non-native speech recognition. IEEE Transactions on Audio, Speech and Language Processing, 21(8):1713--1726, August 2013. [ bib | DOI | http ]
[44] Mohammad J. Taghizadeh, Reza Parhizkar, Philip N. Garner, and Hervé Bourlard. Euclidean distance matrix completion for ad-hoc microphone array calibration. In Proceedings IEEE International Conference On Digital Signal Processing, pages 1--7, Santorini, Greece, July 2013. [ bib | DOI | .pdf ]
[45] Petr Motlicek, Philip N. Garner, Namhoon Kim, and Jeongmi Cho. Accent adaptation using subspace Gaussian mixture models. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 7170--7174, Vancouver, Canada, May 2013. [ bib | DOI | .pdf ]
[46] Milos Cernak, Petr Motlicek, and Philip N. Garner. On the (un)importance of the contextual factors in HMM-based speech synthesis and coding. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 8140--8143, Vancouver, Canada, May 2013. [ bib | DOI | .pdf ]
[47] Philip N. Garner, Milos Cernak, and Petr Motlicek. A simple continuous pitch estimation algorithm. IEEE Signal Processing Letters, 20(1):102--105, January 2013. [ bib | DOI | .pdf ]
[48] Cong-Thanh Do, Mohammad J. Taghizadeh, and Philip N. Garner. Combining cepstral normalization and cochlear implant-like speech processing for microphone array-based speech recognition. In Proceedings of the IEEE Workshop on Spoken Language Technology, pages 137--142, Miami, Florida, USA, December 2012. [ bib | DOI | .pdf ]
[49] David Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorvé, and Alexandre Nanchen. Mediaparl: Bilingual mixed language accented speech database. In Proceedings of the IEEE Workshop on Spoken Language Technology, pages 263--268, Miami, Florida, USA, December 2012. [ bib | DOI | .pdf ]
[50] David Imseng, John Dines, Petr Motlicek, Philip N. Garner, and Hervé Bourlard. Comparing different acoustic modeling techniques for multilingual boosting. In Proceedings of Interspeech, Portland, Oregon, September 2012. [ bib | .pdf ]
[51] Lakshmi Saheer, John Dines, and Philip N. Garner. Vocal tract length normalization for statistical parametric speech synthesis. IEEE Transactions on Audio, Speech and Language Processing, 20(7):2134--2148, September 2012. [ bib | DOI | .pdf ]
[52] Mohammad J. Taghizadeh, Philip N. Garner, and Hervé Bourlard. Microphone array beampattern characterization for hands-free speech applications. In Proceedings of the Seventh IEEE Sensor Array and Multichannel Signal Processing Workshop, pages 465--468, Hoboken, NJ, USA, June 2012. [ bib | DOI | .pdf ]
[53] David Imseng, Hervé Bourlard, and Philip N. Garner. Boosting under-resourced speech recognizers by exploiting out of language data - case study on Afrikaans. In Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, pages 60--67, Cape Town, South Africa, May 2012. [ bib | .pdf ]
[54] David Imseng, Hervé Bourlard, and Philip N. Garner. Using KL-divergence and multilingual information to improve ASR for under-resourced languages. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4869--4872, Kyoto, Japan, March 2012. [ bib | DOI | .pdf ]
[55] Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner, and John Dines. Combining vocal tract length normalization with hierarchial linear transformations. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4493--4496, Kyoto, Japan, March 2012. [ bib | DOI | .pdf ]
[56] Thomas Hain, Lukáš Burget, John Dines, Philip N. Garner, František Grézl, Asmaa El Hannani, Marijn Huijbregts, Martin Karafiát, Mike Lincoln, and Vincent Wan. Transcribing meetings with the AMIDA systems. IEEE Transactions on Audio, Speech and Language Processing, 20(2):486--498, February 2012. [ bib | DOI | .pdf ]
[57] Hervé Bourlard, John Dines, Mathew Magimai-Doss, Philip N. Garner, David Imseng, Petr Motlicek, Hui Liang, Lakshmi Saheer, and Fabio Valente. Current trends in multilingual speech processing. Sadhana, 36(5):885--915, October 2011. Invited paper for special issue on the topic of Speech Communication and Signal Processing. [ bib | DOI | .pdf ]
[58] Philip N. Garner. Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition. Speech Communication, 53(8):991--1001, October 2011. [ bib | DOI | .pdf ]
[59] Philip N. Garner. Bayesian Approaches to Uncertainty in Speech Processing. Phd by publication, School of Computing Sciences, University of East Anglia, September 2011. Awarded July 2012. [ bib | .pdf ]
[60] David Imseng, Hervé Bourlard, John Dines, Philip N. Garner, and Mathew Magimai.-Doss. Improving non-native ASR through stochastic multilingual phoneme space transformations. In Proceedings of Interspeech, Florence, Italy, August 2011. [ bib | .pdf ]
[61] Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen, and Philip N. Garner. A just-in-time retrieval system for dialogues or monologues. In Proceedings of the 12th Annual SIGDial Meeting on Discourse and Dialogue, pages 350--352, Portland, OR, USA, June 2011. [ bib | .pdf ]
[62] Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen, and Philip N. Garner. A speech-based just-in-time retrieval system using semantic search. In Proceedings of the ACL 2011 System Demonstrations, pages 80--86, Portland, OR, USA, June 2011. [ bib | http ]
[63] Mohammad J. Taghizadeh, Philip N. Garner, Hervé Bourlard, Hamid R. Abutalebi, and Asaei Afsaneh. An integrated framework for multi-channel multi-source localization and voice activity detection. In Proceedings of The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays, pages 92--97, Edinburgh, UK, May 2011. [ bib | DOI | .pdf ]
[64] Thomas Hain and Philip N. Garner. Speech recognition. In Steve Renals, Hervé Bourlard, Jean Carletta, and Andrei Popescu-Belis, editors, Multimodal Signal Processing: Human Interactions in Meetings, chapter 5. Cambridge University Press, The Edinburgh Building, Cambridge CB2 2RU, UK, 2011. [ bib ]
[65] Mirjam Wester, John Dines, Matthew Gibson, Hui Liang, Yi-Jian Wu, Lakshmi Saheer, Simon King, Keiichiro Oura, Philip N. Garner, William Byrne, Yong Guan, Teemu Hirsimäki, Reima Karhila, Mikko Kurimo, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, and Junichi Yamagishi. Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project. In Proceedings of the 7th ISCA Speech Synthesis Workshop, Kyoto, Japan, September 2010. [ bib | .pdf ]
[66] Lakshmi Saheer, John Dines, Philip N. Garner, and Hui Liang. Implementation of VTLN for statistical speech synthesis. In Proceedings of the 7th ISCA Speech Synthesis Workshop, Kyoto, Japan, September 2010. [ bib | .pdf ]
[67] Philip N. Garner and John Dines. Tracter: A lightweight dataflow framework. In Proceedings of Interspeech, Makuhari, Japan, September 2010. [ bib | .pdf ]
[68] Thomas Hain, Lukas Burget, John Dines, Philip N. Garner, Asmaa El Hannani, Marijn Huijbregts, Martin Karafiat, Mike Lincoln, and Vincent Wan. The AMIDA 2009 meeting transcription system. In Proceedings of Interspeech, Makuhari, Japan, September 2010. [ bib | .pdf ]
[69] Danil Korchagin, Philip N. Garner, and Petr Motlicek. Hands free audio analysis from home entertainment. In Proceedings of Interspeech, Makuhari, Japan, September 2010. [ bib | .pdf ]
[70] Petr Motlicek, Fabio Valente, and Philip N. Garner. English spoken term detection in multilingual recordings. In Proceedings of Interspeech, Makuhari, Japan, September 2010. [ bib | .pdf ]
[71] Afsaneh Asaei, Philip N. Garner, and Hervé Bourlard. Sparse component analysis for speech recognition in multi-speaker environment. In Proceedings of Interspeech, Makuhari, Japan, September 2010. [ bib | .pdf ]
[72] Mikko Kurimo, William Byrne, John Dines, Philip N. Garner, Matthew Gibson, Yong Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Mirjam Wester, Yi-Jian Wu, and Junichi Yamagishi. Personalising speech-to-speech translation in the EMIME project. In Proceedings of the ACL 2010 System Demonstrations, pages 48--53, Uppsala, Sweden, July 2010. [ bib | .pdf ]
[73] Danil Korchagin, Philip N. Garner, and John Dines. Automatic temporal alignment of AV data with confidence estimation. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 269--272, Dallas, USA, March 2010. [ bib | DOI | .pdf ]
[74] Lakshmi Saheer, Philip N. Garner, John Dines, and Hui Liang. VTLN adaptation for statistical speech synthesis. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4838--4841, Dallas, USA, March 2010. [ bib | DOI | .pdf ]
[75] Philip N. Garner. SNR features for automatic speech recognition. In Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, pages 182--187, Merano, Italy, December 2009. [ bib | DOI | .pdf ]
[76] Philip N. Garner, John Dines, Thomas Hain, Asmaa El Hannani, Martin Karafiát, Danil Korchagin, Mike Lincoln, Vincent Wan, and Le Zhang. Real-time ASR from meetings. In Proceedings of Interspeech, Brighton, UK, September 2009. [ bib | .pdf ]
[77] Kenichi Kumatani, John McDonough, Barbara Rauch, Dietrich Klakow, Philip N. Garner, and Weifeng Li. Beamforming with a maximum negentropy criterion. IEEE Transactions on Audio, Speech and Language Processing, 17(5):994--1008, July 2009. [ bib | DOI | .pdf ]
[78] Philip N. Garner. Silence models in weighted finite-state transducers. In Proceedings of Interspeech, Brisbane, Australia., September 2008. [ bib | .pdf ]
[79] Kenichi Kumatani, John McDonough, Barbara Rauch, Philip Garner, John Dines, and Weifeng Li. Maximum kurtosis beamforming with the generalized sidelobe canceller. In Proceedings of Interspeech, Brisbane, Australia., September 2008. [ bib | .pdf ]
[80] Kenichi Kumatani, John McDonough, Dietrich Klakow, Philip N. Garner, and Weifeng Li. Adaptive beamforming with a maximum negentropy criterion. In Proceedings of the Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA), pages 180--183, Italy, May 2008. [ bib | DOI | .pdf ]
[81] Kenichi Kumatani, John McDonough, Stefan Schacht, Dietrich Klakow, Philip Garner, and Weifeng Li. Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1608--1612, Las Vegas, April 2008. [ bib | DOI | .pdf ]
[82] Philip N. Garner, Toshiaki Fukada, and Yasuhiro Komori. A differential spectral voice activity detector. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 597--600, Montreal, May 2004. [ bib | DOI | .pdf ]
[83] Jason P. A. Charlesworth and Philip N. Garner. Spoken content. In B. S. Manjunath, Philippe Salembier, and Thomas Sikora, editors, Introduction to MPEG-7: Multimedia Content Description Interface, chapter 18, pages 299--316. John Wiley & Sons Ltd., July 2002. [ bib | .html ]
[84] Philip N. Garner and Adam T. Lindsay, editors. Information Technology - Multimedia Content Description Interface - Part 4: Audio. Number 15938-4:2002. ISO/IEC, 2002. International Standard. [ bib ]
[85] Jason P. A. Charlesworth and Philip N. Garner. SpokenContent representation in MPEG-7. IEEE Transactions on Circuits and Systems for Video Technology, 11(6):730--736, June 2001. Special Issue on MPEG-7. [ bib | DOI ]
[86] J. P. A Charlesworth and P. N. Garner. Spoken content metadata and MPEG-7. In Proceedings ACM Multimedia 2000 Workshops, pages 81--84, Marina Del Rey, California, November 2000. ACM, PO Box 11405, New York, NY 10286 1405. [ bib | .pdf ]
[87] Adam T. Lindsay, Savitha Srinivasan, Jason P. A. Charlesworth, Philip N. Garner, and Werner Kriechbaum. Representation and linking mechanisms for audio in MPEG-7. Signal Processing: Image Communication, 16(1--2):193--209, September 2000. [ bib | DOI | .pdf ]
[88] Andrew R. Webb and Philip N. Garner. A basis function approach to position estimation using microwave arrays. Applied Statistics, 48 part 2:197--209, 1999. [ bib | DOI | .pdf ]
[89] Philip N. Garner and Wendy J. Holmes. On the robust incorporation of formant features into hidden Markov models for automatic speech recognition. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume 1, pages 1--4, 1998. [ bib | DOI | .pdf ]
[90] Philip N. Garner. On topic identification and dialogue move recognition. Computer Speech and Language, 11(4):275--306, October 1997. [ bib | DOI | .pdf ]
[91] John N. Holmes, Wendy J. Holmes, and Philip N. Garner. Using formant frequencies in speech recognition. In Proceedings of EUROSPEECH, volume 4, pages 2083--2086, September 1997. [ bib | .pdf ]
[92] Philip N. Garner and Aidan Hemsworth. A keyword selection strategy for dialogue move recognition and multi-class topic identification. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume 3, pages 1823--1826, April 1997. [ bib | DOI | .pdf ]
[93] Philip N. Garner, Sue R. Browning, Roger K. Moore, and Martin J. Russell. A theory of word frequencies and its application to dialogue move recognition. In Proceedings of the International Conference on Spoken Language Processing, pages 1880--1883, October 1996. [ bib | .pdf ]
[94] Andrew R. Webb and Philip N. Garner. Source position estimation using radial basis functions. In Proceedings 13th International Conference on Pattern Recognition, volume IV, pages 3--7, Vienna, 1996. [ bib ]
[95] B. Steer, J. Kloske, P. Garner, L. LeBlanc, and S. Schock. Towards sonar based perception and modelling for unmanned untethered underwater vehicles. In Proceedings IEEE International Conference on Robotics and Automation, volume 2, pages 112--116, May 1993. [ bib | DOI | .pdf ]

This file was generated by bibtex2html 1.98.