ASRU C

98 papers

YearTitle / Authors
20112011 IEEE Workshop on Automatic Speech Recognition & Understanding, ASRU 2011, Waikoloa, HI, USA, December 11-15, 2011
David Nahamoo, Michael Picheny
2011A Trajectory-based Parallel Model Combination with a unified static and dynamic parameter compensation for noisy speech recognition.
Khe Chai Sim, Minh-Thang Luong
2011A convergence analysis of log-linear training and its application to speech recognition.
Simon Wiesler, Ralf Schlüter, Hermann Ney
2011A convex hull approach to sparse representations for exemplar-based speech recognition.
Tara N. Sainath, David Nahamoo, Dimitri Kanevsky, Bhuvana Ramabhadran, Parikshit M. Shah
2011A dialogue system for accessing drug reviews.
Jingjing Liu, Stephanie Seneff
2011A factored conditional random field model for articulatory feature forced transcription.
Rohit Prabhavalkar, Eric Fosler-Lussier, Karen Livescu
2011A novel bottleneck-BLSTM front-end for feature-level context modeling in conversational speech recognition.
Martin Wöllmer, Björn W. Schuller, Gerhard Rigoll
2011A novel neural-based pronunciation modeling method for robust speech recognition.
Guangpu Huang, Meng Joo Er
2011A variational perspective on noise-robust speech recognition.
Rogier C. van Dalen, Mark J. F. Gales
2011Accent level adjustment in bilingual Thai-English text-to-speech synthesis.
Chai Wutiwiwatchai, Ausdang Thangthai, Ananlada Chotimongkol, Chatchawarn Hansakunbuntheung, Nattanun Thatphithakkul
2011Adapting n-gram maximum entropy language models with conditional entropy regularization.
Ariya Rastrow, Mark Dredze, Sanjeev Khudanpur
2011Alignment of spoken narratives for automated neuropsychological assessment.
Emily Tucker Prud'hommeaux, Brian Roark
2011An hierarchical exemplar-based sparse model of speech, with an application to ASR.
Jort F. Gemmeke, Hugo Van hamme
2011An investigation of heuristic, manual and statistical pronunciation derivation for Pashto.
Upendra V. Chaudhari, Xiaodong Cui, Bowen Zhou, Rong Zhang
2011Analyzing conversations using rich phrase patterns.
Bin Zhang, Alex Marin, Brian Hutchinson, Mari Ostendorf
2011Applying Multiclass Bandit algorithms to call-type classification.
Liva Ralaivola, Benoît Favre, Pierre Gotab, Frédéric Béchet, Géraldine Damnati
2011Applying feature bagging for more accurate and robust automated speaking assessment.
Lei Chen
2011Automatic detection of "g-dropping" in American English using forced alignment.
Jiahong Yuan, Mark Y. Liberman
2011Automatic detection of unnatural word-level segments in unit-selection speech synthesis.
William Yang Wang, Kallirroi Georgila
2011Bag of n-gram driven decoding for LVCSR system harnessing.
Fethi Bougares, Yannick Estève, Paul Deléglise, Georges Linarès
2011Bidirectional OM-LSA speech estimator for noise robust speech recognition.
Yasunari Obuchi, Ryu Takeda, Masahito Togami
2011Blind noise suppression for Non-Audible Murmur recognition with stereo signal processing.
Shunta Ishii, Tomoki Toda, Hiroshi Saruwatari, Sakriani Sakti, Satoshi Nakamura
2011Bootstrapping a spoken language identification system using unsupervised integrated sensing and processing decision trees.
Shuai Huang, Damianos G. Karakos, Glen A. Coppersmith, Kenneth Ward Church, Sabato Marco Siniscalchi
2011Building a conversational model from two-tweets.
Ryuichiro Higashinaka, Noriaki Kawamae, Kugatsu Sadamitsu, Yasuhiro Minami, Toyomi Meguro, Kohji Dohsaka, Hirohito Inagaki
2011Convolutive Bottleneck Network features for LVCSR.
Karel Veselý, Martin Karafiát, Frantisek Grézl
2011Cross-lingual portability of Chinese and english neural network features for French and German LVCSR.
Christian Plahl, Ralf Schlüter, Hermann Ney
2011Crowd-sourcing for difficult transcription of speech.
Jason D. Williams, I. Dan Melamed, Tirso Alonso, Barbara Hollister, Jay G. Wilpon
2011Decision of response timing for incremental speech recognition with reinforcement learning.
Di Lu, Takuya Nishimoto, Nobuaki Minematsu
2011Derivative kernels for noise robust ASR.
Anton Ragni, Mark J. F. Gales
2011Designing text corpus using phone-error distribution for acoustic modeling.
Hiroko Murakami, Koichi Shinoda, Sadaoki Furui
2011Detection of persons with Parkinson's disease by acoustic, vocal, and prosodic analysis.
Tobias Bocklet, Elmar Nöth, Georg Stemmer, Hana Ruzickova, Jan Rusz
2011Detection of precisely transcribed parts from inexact transcribed corpus.
Kengo Ohta, Masatoshi Tsuchiya, Seiichi Nakagawa
2011Detection-based accented speech recognition using articulatory features.
Chao Zhang, Yi Liu, Chin-Hui Lee
2011Discriminative reranking of ASR hypotheses with morpholexical and N-best-list features.
Hasim Sak, Murat Saraclar, Tunga Gungor
2011Discriminative splitting of Gaussian/log-linear mixture HMMs for speech recognition.
Muhammad Ali Tahir, Ralf Schlüter, Hermann Ney
2011Don't multiply lightly: Quantifying problems with the acoustic model assumptions in speech recognition.
Dan Gillick, Larry Gillick, Steven Wegmann
2011Efficient determinization of tagged word lattices using categorial and lexicographic semirings.
Izhak Shafran, Richard Sproat, Mahsa Yarmohammadi, Brian Roark
2011Efficient discriminative training of long-span language models.
Ariya Rastrow, Mark Dredze, Sanjeev Khudanpur
2011Efficient representation and fast look-up of Maximum Entropy language models.
Jia Cui, Stanley F. Chen, Bowen Zhou
2011Efficient spoken term discovery using randomized algorithms.
Aren Jansen, Benjamin Van Durme
2011Employing web search query click logs for multi-domain spoken language understanding.
Dilek Hakkani-Tür, Gökhan Tür, Larry P. Heck, Asli Celikyilmaz, Ashley Fidler, Dustin Hillard, Rukmini Iyer, Sarangarajan Parthasarathy
2011Estimating document frequencies in a speech corpus.
Damianos G. Karakos, Mark Dredze, Ken Ward Church, Aren Jansen, Sanjeev Khudanpur
2011Evaluating prosodic features for automated scoring of non-native read speech.
Klaus Zechner, Xiaoming Xi, Lei Chen
2011Evolutionary discriminative speaker adaptation.
Sid-Ahmed Selouani
2011Exploiting distance based similarity in topic models for user intent detection.
Asli Celikyilmaz, Dilek Hakkani-Tür, Gökhan Tür, Ashley Fidler, Dustin Hillard
2011Extending noise robust structured support vector machines to larger vocabulary tasks.
Shi-Xiong Zhang, Mark J. F. Gales
2011Factor analysis based session variability compensation for Automatic Speech Recognition.
Mickael Rouvier, Mohamed Bouallegue, Driss Matrouf, Georges Linarès
2011Factored adaptation for separable compensation of speaker and environmental variability.
Michael L. Seltzer, Alex Acero
2011Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition.
David Imseng, Ramya Rasipuram, Mathew Magimai-Doss
2011Fast speaker diarization using a high-level scripting language.
Ekaterina Gonina, Gerald Friedland, Henry Cook, Kurt Keutzer
2011Feature engineering in Context-Dependent Deep Neural Networks for conversational speech transcription.
Frank Seide, Gang Li, Xie Chen, Dong Yu
2011Frame-level AnyBoost for LVCSR with the MMI Criterion.
Ryuki Tachibana, Takashi Fukuda, Upendra V. Chaudhari, Bhuvana Ramabhadran, Puming Zhan
2011From Modern Standard Arabic to Levantine ASR: Leveraging GALE for dialects.
Hagen Soltau, Lidia Mangu, Fadi Biadsy
2011Gain estimation approaches in catalog-based single-channel speech-music separation.
Cemil Demir, Ali Taylan Cemgil, Murat Saraclar
2011Improved spoken term detection using support vector machines with acoustic and context features from pseudo-relevance feedback.
Tsung-wei Tu, Hung-yi Lee, Lin-Shan Lee
2011Improving reverberant VTS for hands-free robust speech recognition.
Yongqiang Wang, Mark J. F. Gales
2011Investigating the role of machine translated text in ASR domain adaptation: Unsupervised and semi-supervised methods.
Horia Cucu, Laurent Besacier, Corneliu Burileanu, Andi Buzo
2011Latent semantic analysis for question classification with neural networks.
Babak Loni, Seyedeh Halleh Khoshnevis, Pascal Wiggers
2011Leveraging large amounts of loosely transcribed corporate videos for acoustic model training.
Matthias Paulik, Panchi Panchapagesan
2011Linear versus mel frequency cepstral coefficients for speaker recognition.
Xinhui Zhou, Daniel Garcia-Romero, Ramani Duraiswami, Carol Y. Espy-Wilson, Shihab A. Shamma
2011Making Deep Belief Networks effective for large vocabulary continuous speech recognition.
Tara N. Sainath, Brian Kingsbury, Bhuvana Ramabhadran, Petr Fousek, Petr Novák, Abdel-rahman Mohamed
2011Matched-condition robust Dynamic Noise Adaptation.
Steven J. Rennie, Pierre L. Dognin, Petr Fousek
2011Maximum kurtosis beamforming with a subspace filter for distant speech recognition.
Ken'ichi Kumatani, John W. McDonough, Bhiksha Raj
2011Minimum Bayes risk discriminative language models for Arabic speech recognition.
Hong-Kwang Jeff Kuo, Ebru Arisoy, Lidia Mangu, George Saon
2011Minimum detection error training of subword detectors.
Alfonso M. Canterla, Magne Hallstein Johnsen
2011Model-based parametric features for emotion recognition from speech.
Sankaranarayanan Ananthakrishnan, Aravind Namandi Vembu, Rohit Prasad
2011Multi-level context-dependent acoustic modeling for automatic speech recognition.
Hung-An Chang, James R. Glass
2011Multi-site heterogeneous system fusions for the Albayzin 2010 Language Recognition Evaluation.
Luis Javier Rodríguez, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel, David Martínez González, Jesús Antonio Villalba López, Antonio Miguel, Alfonso Ortega, Eduardo Lleida, Alberto Abad, Oscar Koller, Isabel Trancoso, Paula Lopez-Otero, Laura Docío Fernández, Carmen García-Mateo, Rahim Saeidi, Mehdi Soufifar, Tomi Kinnunen, Torbjørn Svendsen, Pasi Fränti
2011Multi-taper MFCC features for speaker verification using I-vectors.
Md. Jahangir Alam, Tomi Kinnunen, Patrick Kenny, Pierre Ouellet, Douglas D. O'Shaughnessy
2011N-Best rescoring by adaboost phoneme classifiers for isolated word recognition.
Hiroshi Fujimura, Masanobu Nakamura, Yusuke Shinohara, Takashi Masuko
2011On-line policy optimisation of spoken dialogue systems via live interaction with human subjects.
Milica Gasic, Filip Jurcícek, Blaise Thomson, Kai Yu, Steve J. Young
2011Pruning exponential language models.
Stanley F. Chen, Abhinav Sethy, Bhuvana Ramabhadran
2011Query modeling for spoken document retrieval.
Berlin Chen, Pei-Ning Chen, Kuan-Yu Chen
2011Randomized maximum entropy language models.
Puyang Xu, Sanjeev Khudanpur, Asela Gunawardana
2011Regularized subspace Gaussian mixture models for cross-lingual speech recognition.
Liang Lu, Arnab Ghoshal, Steve Renals
2011Robust seed model training for speaker adaptation using pseudo-speaker features generated by inverse CMLLR transformation.
Arata Itoh, Sunao Hara, Norihide Kitaoka, Kazuya Takeda
2011Robust speech recognition using articulatory gestures in a Dynamic Bayesian Network framework.
Vikramjit Mitra, Hosung Nam, Carol Y. Espy-Wilson
2011Robust understanding of spoken Chinese through character-based tagging and prior knowledge exploitation.
Weiqun Xu, Changchun Bao, Yali Li, Jielin Pan, Yonghong Yan
2011Sentiment analysis of text-to-speech input using latent affective mapping.
Jerome R. Bellegarda
2011Socio-situational setting classification based on language use.
Yangyang Shi, Pascal Wiggers, Catholijn M. Jonker
2011Some properties of Bayesian sensing hidden Markov models.
George Saon, Jen-Tzung Chien
2011Sparse Maximum A Posteriori adaptation.
Peder A. Olsen, Jing Huang, Vaibhava Goel, Steven J. Rennie
2011Speaker adaptation based on speaker-dependent eigenphone estimation.
Wen-Lin Zhang, Wei-Qiang Zhang, Bi-Cheng Li
2011Speaker adaptation with an Exponential Transform.
Daniel Povey, Geoffrey Zweig, Alex Acero
2011Strategies for training large scale neural network language models.
Tomás Mikolov, Anoop Deoras, Daniel Povey, Lukás Burget, Jan Cernocký
2011Strategies for using MLP based features with limited target-language training data.
Yanmin Qian, Ji Xu, Daniel Povey, Jia Liu
2011Study of probabilistic and Bottle-Neck features in multilingual environment.
Frantisek Grézl, Martin Karafiát, Milos Janda
2011Subspace Gaussian Mixture Models for vectorial HMM-states representation.
Mohamed Bouallegue, Driss Matrouf, Mickael Rouvier, Georges Linarès
2011Subword-based automatic lexicon learning for Speech Recognition.
Timo Mertens, Stephanie Seneff
2011Subword-based multi-span pronunciation adaptation for recognizing accented speech.
Timo Mertens, Kit Thambiratnam, Frank Seide
2011Supervised and unsupervised feature selection for inferring social nature of telephone conversations from their content.
Anthony P. Stark, Izhak Shafran, Jeffrey A. Kaye
2011The IBM 2011 GALE Arabic speech transcription system.
Lidia Mangu, Hong-Kwang Kuo, Stephen M. Chu, Brian Kingsbury, George Saon, Hagen Soltau, Fadi Biadsy
2011Topic modeling for spoken documents using only phonetic information.
Timothy J. Hazen, Man-Hung Siu, Herbert Gish, Steve Lowe, Arthur Chan
2011Towards choosing better primes for spoken dialog systems.
José Lopes, Maxine Eskénazi, Isabel Trancoso
2011Unsupervised learning in cross-corpus acoustic emotion recognition.
Zixing Zhang, Felix Weninger, Martin Wöllmer, Björn W. Schuller
2011Utterance verification using garbage words for a hospital appointment system with speech interface.
Mitsuru Takaoka, Hiromitsu Nishizaki, Yoshihiro Sekiguchi
2011Wizard of Oz evaluation of listening-oriented dialogue control using POMDP.
Toyomi Meguro, Yasuhiro Minami, Ryuichiro Higashinaka, Kohji Dohsaka
2011iVector-based discriminative adaptation for automatic speech recognition.
Martin Karafiát, Lukás Burget, Pavel Matejka, Ondrej Glembek, Jan Cernocký