| 2011 | 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, ASRU 2011, Waikoloa, HI, USA, December 11-15, 2011 David Nahamoo, Michael Picheny |
| 2011 | A Trajectory-based Parallel Model Combination with a unified static and dynamic parameter compensation for noisy speech recognition. Khe Chai Sim, Minh-Thang Luong |
| 2011 | A convergence analysis of log-linear training and its application to speech recognition. Simon Wiesler, Ralf Schlüter, Hermann Ney |
| 2011 | A convex hull approach to sparse representations for exemplar-based speech recognition. Tara N. Sainath, David Nahamoo, Dimitri Kanevsky, Bhuvana Ramabhadran, Parikshit M. Shah |
| 2011 | A dialogue system for accessing drug reviews. Jingjing Liu, Stephanie Seneff |
| 2011 | A factored conditional random field model for articulatory feature forced transcription. Rohit Prabhavalkar, Eric Fosler-Lussier, Karen Livescu |
| 2011 | A novel bottleneck-BLSTM front-end for feature-level context modeling in conversational speech recognition. Martin Wöllmer, Björn W. Schuller, Gerhard Rigoll |
| 2011 | A novel neural-based pronunciation modeling method for robust speech recognition. Guangpu Huang, Meng Joo Er |
| 2011 | A variational perspective on noise-robust speech recognition. Rogier C. van Dalen, Mark J. F. Gales |
| 2011 | Accent level adjustment in bilingual Thai-English text-to-speech synthesis. Chai Wutiwiwatchai, Ausdang Thangthai, Ananlada Chotimongkol, Chatchawarn Hansakunbuntheung, Nattanun Thatphithakkul |
| 2011 | Adapting n-gram maximum entropy language models with conditional entropy regularization. Ariya Rastrow, Mark Dredze, Sanjeev Khudanpur |
| 2011 | Alignment of spoken narratives for automated neuropsychological assessment. Emily Tucker Prud'hommeaux, Brian Roark |
| 2011 | An hierarchical exemplar-based sparse model of speech, with an application to ASR. Jort F. Gemmeke, Hugo Van hamme |
| 2011 | An investigation of heuristic, manual and statistical pronunciation derivation for Pashto. Upendra V. Chaudhari, Xiaodong Cui, Bowen Zhou, Rong Zhang |
| 2011 | Analyzing conversations using rich phrase patterns. Bin Zhang, Alex Marin, Brian Hutchinson, Mari Ostendorf |
| 2011 | Applying Multiclass Bandit algorithms to call-type classification. Liva Ralaivola, Benoît Favre, Pierre Gotab, Frédéric Béchet, Géraldine Damnati |
| 2011 | Applying feature bagging for more accurate and robust automated speaking assessment. Lei Chen |
| 2011 | Automatic detection of "g-dropping" in American English using forced alignment. Jiahong Yuan, Mark Y. Liberman |
| 2011 | Automatic detection of unnatural word-level segments in unit-selection speech synthesis. William Yang Wang, Kallirroi Georgila |
| 2011 | Bag of n-gram driven decoding for LVCSR system harnessing. Fethi Bougares, Yannick Estève, Paul Deléglise, Georges Linarès |
| 2011 | Bidirectional OM-LSA speech estimator for noise robust speech recognition. Yasunari Obuchi, Ryu Takeda, Masahito Togami |
| 2011 | Blind noise suppression for Non-Audible Murmur recognition with stereo signal processing. Shunta Ishii, Tomoki Toda, Hiroshi Saruwatari, Sakriani Sakti, Satoshi Nakamura |
| 2011 | Bootstrapping a spoken language identification system using unsupervised integrated sensing and processing decision trees. Shuai Huang, Damianos G. Karakos, Glen A. Coppersmith, Kenneth Ward Church, Sabato Marco Siniscalchi |
| 2011 | Building a conversational model from two-tweets. Ryuichiro Higashinaka, Noriaki Kawamae, Kugatsu Sadamitsu, Yasuhiro Minami, Toyomi Meguro, Kohji Dohsaka, Hirohito Inagaki |
| 2011 | Convolutive Bottleneck Network features for LVCSR. Karel Veselý, Martin Karafiát, Frantisek Grézl |
| 2011 | Cross-lingual portability of Chinese and english neural network features for French and German LVCSR. Christian Plahl, Ralf Schlüter, Hermann Ney |
| 2011 | Crowd-sourcing for difficult transcription of speech. Jason D. Williams, I. Dan Melamed, Tirso Alonso, Barbara Hollister, Jay G. Wilpon |
| 2011 | Decision of response timing for incremental speech recognition with reinforcement learning. Di Lu, Takuya Nishimoto, Nobuaki Minematsu |
| 2011 | Derivative kernels for noise robust ASR. Anton Ragni, Mark J. F. Gales |
| 2011 | Designing text corpus using phone-error distribution for acoustic modeling. Hiroko Murakami, Koichi Shinoda, Sadaoki Furui |
| 2011 | Detection of persons with Parkinson's disease by acoustic, vocal, and prosodic analysis. Tobias Bocklet, Elmar Nöth, Georg Stemmer, Hana Ruzickova, Jan Rusz |
| 2011 | Detection of precisely transcribed parts from inexact transcribed corpus. Kengo Ohta, Masatoshi Tsuchiya, Seiichi Nakagawa |
| 2011 | Detection-based accented speech recognition using articulatory features. Chao Zhang, Yi Liu, Chin-Hui Lee |
| 2011 | Discriminative reranking of ASR hypotheses with morpholexical and N-best-list features. Hasim Sak, Murat Saraclar, Tunga Gungor |
| 2011 | Discriminative splitting of Gaussian/log-linear mixture HMMs for speech recognition. Muhammad Ali Tahir, Ralf Schlüter, Hermann Ney |
| 2011 | Don't multiply lightly: Quantifying problems with the acoustic model assumptions in speech recognition. Dan Gillick, Larry Gillick, Steven Wegmann |
| 2011 | Efficient determinization of tagged word lattices using categorial and lexicographic semirings. Izhak Shafran, Richard Sproat, Mahsa Yarmohammadi, Brian Roark |
| 2011 | Efficient discriminative training of long-span language models. Ariya Rastrow, Mark Dredze, Sanjeev Khudanpur |
| 2011 | Efficient representation and fast look-up of Maximum Entropy language models. Jia Cui, Stanley F. Chen, Bowen Zhou |
| 2011 | Efficient spoken term discovery using randomized algorithms. Aren Jansen, Benjamin Van Durme |
| 2011 | Employing web search query click logs for multi-domain spoken language understanding. Dilek Hakkani-Tür, Gökhan Tür, Larry P. Heck, Asli Celikyilmaz, Ashley Fidler, Dustin Hillard, Rukmini Iyer, Sarangarajan Parthasarathy |
| 2011 | Estimating document frequencies in a speech corpus. Damianos G. Karakos, Mark Dredze, Ken Ward Church, Aren Jansen, Sanjeev Khudanpur |
| 2011 | Evaluating prosodic features for automated scoring of non-native read speech. Klaus Zechner, Xiaoming Xi, Lei Chen |
| 2011 | Evolutionary discriminative speaker adaptation. Sid-Ahmed Selouani |
| 2011 | Exploiting distance based similarity in topic models for user intent detection. Asli Celikyilmaz, Dilek Hakkani-Tür, Gökhan Tür, Ashley Fidler, Dustin Hillard |
| 2011 | Extending noise robust structured support vector machines to larger vocabulary tasks. Shi-Xiong Zhang, Mark J. F. Gales |
| 2011 | Factor analysis based session variability compensation for Automatic Speech Recognition. Mickael Rouvier, Mohamed Bouallegue, Driss Matrouf, Georges Linarès |
| 2011 | Factored adaptation for separable compensation of speaker and environmental variability. Michael L. Seltzer, Alex Acero |
| 2011 | Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition. David Imseng, Ramya Rasipuram, Mathew Magimai-Doss |
| 2011 | Fast speaker diarization using a high-level scripting language. Ekaterina Gonina, Gerald Friedland, Henry Cook, Kurt Keutzer |
| 2011 | Feature engineering in Context-Dependent Deep Neural Networks for conversational speech transcription. Frank Seide, Gang Li, Xie Chen, Dong Yu |
| 2011 | Frame-level AnyBoost for LVCSR with the MMI Criterion. Ryuki Tachibana, Takashi Fukuda, Upendra V. Chaudhari, Bhuvana Ramabhadran, Puming Zhan |
| 2011 | From Modern Standard Arabic to Levantine ASR: Leveraging GALE for dialects. Hagen Soltau, Lidia Mangu, Fadi Biadsy |
| 2011 | Gain estimation approaches in catalog-based single-channel speech-music separation. Cemil Demir, Ali Taylan Cemgil, Murat Saraclar |
| 2011 | Improved spoken term detection using support vector machines with acoustic and context features from pseudo-relevance feedback. Tsung-wei Tu, Hung-yi Lee, Lin-Shan Lee |
| 2011 | Improving reverberant VTS for hands-free robust speech recognition. Yongqiang Wang, Mark J. F. Gales |
| 2011 | Investigating the role of machine translated text in ASR domain adaptation: Unsupervised and semi-supervised methods. Horia Cucu, Laurent Besacier, Corneliu Burileanu, Andi Buzo |
| 2011 | Latent semantic analysis for question classification with neural networks. Babak Loni, Seyedeh Halleh Khoshnevis, Pascal Wiggers |
| 2011 | Leveraging large amounts of loosely transcribed corporate videos for acoustic model training. Matthias Paulik, Panchi Panchapagesan |
| 2011 | Linear versus mel frequency cepstral coefficients for speaker recognition. Xinhui Zhou, Daniel Garcia-Romero, Ramani Duraiswami, Carol Y. Espy-Wilson, Shihab A. Shamma |
| 2011 | Making Deep Belief Networks effective for large vocabulary continuous speech recognition. Tara N. Sainath, Brian Kingsbury, Bhuvana Ramabhadran, Petr Fousek, Petr Novák, Abdel-rahman Mohamed |
| 2011 | Matched-condition robust Dynamic Noise Adaptation. Steven J. Rennie, Pierre L. Dognin, Petr Fousek |
| 2011 | Maximum kurtosis beamforming with a subspace filter for distant speech recognition. Ken'ichi Kumatani, John W. McDonough, Bhiksha Raj |
| 2011 | Minimum Bayes risk discriminative language models for Arabic speech recognition. Hong-Kwang Jeff Kuo, Ebru Arisoy, Lidia Mangu, George Saon |
| 2011 | Minimum detection error training of subword detectors. Alfonso M. Canterla, Magne Hallstein Johnsen |
| 2011 | Model-based parametric features for emotion recognition from speech. Sankaranarayanan Ananthakrishnan, Aravind Namandi Vembu, Rohit Prasad |
| 2011 | Multi-level context-dependent acoustic modeling for automatic speech recognition. Hung-An Chang, James R. Glass |
| 2011 | Multi-site heterogeneous system fusions for the Albayzin 2010 Language Recognition Evaluation. Luis Javier Rodríguez, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel, David Martínez González, Jesús Antonio Villalba López, Antonio Miguel, Alfonso Ortega, Eduardo Lleida, Alberto Abad, Oscar Koller, Isabel Trancoso, Paula Lopez-Otero, Laura Docío Fernández, Carmen García-Mateo, Rahim Saeidi, Mehdi Soufifar, Tomi Kinnunen, Torbjørn Svendsen, Pasi Fränti |
| 2011 | Multi-taper MFCC features for speaker verification using I-vectors. Md. Jahangir Alam, Tomi Kinnunen, Patrick Kenny, Pierre Ouellet, Douglas D. O'Shaughnessy |
| 2011 | N-Best rescoring by adaboost phoneme classifiers for isolated word recognition. Hiroshi Fujimura, Masanobu Nakamura, Yusuke Shinohara, Takashi Masuko |
| 2011 | On-line policy optimisation of spoken dialogue systems via live interaction with human subjects. Milica Gasic, Filip Jurcícek, Blaise Thomson, Kai Yu, Steve J. Young |
| 2011 | Pruning exponential language models. Stanley F. Chen, Abhinav Sethy, Bhuvana Ramabhadran |
| 2011 | Query modeling for spoken document retrieval. Berlin Chen, Pei-Ning Chen, Kuan-Yu Chen |
| 2011 | Randomized maximum entropy language models. Puyang Xu, Sanjeev Khudanpur, Asela Gunawardana |
| 2011 | Regularized subspace Gaussian mixture models for cross-lingual speech recognition. Liang Lu, Arnab Ghoshal, Steve Renals |
| 2011 | Robust seed model training for speaker adaptation using pseudo-speaker features generated by inverse CMLLR transformation. Arata Itoh, Sunao Hara, Norihide Kitaoka, Kazuya Takeda |
| 2011 | Robust speech recognition using articulatory gestures in a Dynamic Bayesian Network framework. Vikramjit Mitra, Hosung Nam, Carol Y. Espy-Wilson |
| 2011 | Robust understanding of spoken Chinese through character-based tagging and prior knowledge exploitation. Weiqun Xu, Changchun Bao, Yali Li, Jielin Pan, Yonghong Yan |
| 2011 | Sentiment analysis of text-to-speech input using latent affective mapping. Jerome R. Bellegarda |
| 2011 | Socio-situational setting classification based on language use. Yangyang Shi, Pascal Wiggers, Catholijn M. Jonker |
| 2011 | Some properties of Bayesian sensing hidden Markov models. George Saon, Jen-Tzung Chien |
| 2011 | Sparse Maximum A Posteriori adaptation. Peder A. Olsen, Jing Huang, Vaibhava Goel, Steven J. Rennie |
| 2011 | Speaker adaptation based on speaker-dependent eigenphone estimation. Wen-Lin Zhang, Wei-Qiang Zhang, Bi-Cheng Li |
| 2011 | Speaker adaptation with an Exponential Transform. Daniel Povey, Geoffrey Zweig, Alex Acero |
| 2011 | Strategies for training large scale neural network language models. Tomás Mikolov, Anoop Deoras, Daniel Povey, Lukás Burget, Jan Cernocký |
| 2011 | Strategies for using MLP based features with limited target-language training data. Yanmin Qian, Ji Xu, Daniel Povey, Jia Liu |
| 2011 | Study of probabilistic and Bottle-Neck features in multilingual environment. Frantisek Grézl, Martin Karafiát, Milos Janda |
| 2011 | Subspace Gaussian Mixture Models for vectorial HMM-states representation. Mohamed Bouallegue, Driss Matrouf, Mickael Rouvier, Georges Linarès |
| 2011 | Subword-based automatic lexicon learning for Speech Recognition. Timo Mertens, Stephanie Seneff |
| 2011 | Subword-based multi-span pronunciation adaptation for recognizing accented speech. Timo Mertens, Kit Thambiratnam, Frank Seide |
| 2011 | Supervised and unsupervised feature selection for inferring social nature of telephone conversations from their content. Anthony P. Stark, Izhak Shafran, Jeffrey A. Kaye |
| 2011 | The IBM 2011 GALE Arabic speech transcription system. Lidia Mangu, Hong-Kwang Kuo, Stephen M. Chu, Brian Kingsbury, George Saon, Hagen Soltau, Fadi Biadsy |
| 2011 | Topic modeling for spoken documents using only phonetic information. Timothy J. Hazen, Man-Hung Siu, Herbert Gish, Steve Lowe, Arthur Chan |
| 2011 | Towards choosing better primes for spoken dialog systems. José Lopes, Maxine Eskénazi, Isabel Trancoso |
| 2011 | Unsupervised learning in cross-corpus acoustic emotion recognition. Zixing Zhang, Felix Weninger, Martin Wöllmer, Björn W. Schuller |
| 2011 | Utterance verification using garbage words for a hospital appointment system with speech interface. Mitsuru Takaoka, Hiromitsu Nishizaki, Yoshihiro Sekiguchi |
| 2011 | Wizard of Oz evaluation of listening-oriented dialogue control using POMDP. Toyomi Meguro, Yasuhiro Minami, Ryuichiro Higashinaka, Kohji Dohsaka |
| 2011 | iVector-based discriminative adaptation for automatic speech recognition. Martin Karafiát, Lukás Burget, Pavel Matejka, Ondrej Glembek, Jan Cernocký |