| 2013 | 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, Olomouc, Czech Republic, December 8-12, 2013 |
| 2013 | A generalized discriminative training framework for system combination. Yuuki Tachioka, Shinji Watanabe, Jonathan Le Roux, John R. Hershey |
| 2013 | A hierarchical system for word discovery exploiting DTW-based initialization. Oliver Walter, Timo Korthals, Reinhold Haeb-Umbach, Bhiksha Raj |
| 2013 | A propagation approach to modelling the joint distributions of clean and corrupted speech in the Mel-Cepstral domain. Ramón Fernandez Astudillo |
| 2013 | A study of supervised intrinsic spectral analysis for TIMIT phone classification. Reza Sahraeian, Dirk Van Compernolle |
| 2013 | ASR for electro-laryngeal speech. Anna Katharina Fuchs, Juan Andres Morales-Cordovilla, Martin Hagmüller |
| 2013 | Accelerating Hessian-free optimization for Deep Neural Networks by implicit preconditioning and sampling. Tara N. Sainath, Lior Horesh, Brian Kingsbury, Aleksandr Y. Aravkin, Bhuvana Ramabhadran |
| 2013 | Accelerating recurrent neural network training via two stage classes and parallelization. Zhiheng Huang, Geoffrey Zweig, Michael Levit, Benoît Dumoulin, Barlas Oguz, Shawn Chang |
| 2013 | Acoustic characteristics related to the perceptual pitch in whispered vowels. Hideaki Konno, Hideo Kanemitsu, Nobuyuki Takahashi, Mineichi Kudo |
| 2013 | Acoustic data-driven pronunciation lexicon for large vocabulary speech recognition. Liang Lu, Arnab Ghoshal, Steve Renals |
| 2013 | Acoustic modeling using transform-based phone-cluster adaptive training. Vimal Manohar, Srinivas C. Bhargav, Srinivasan Umesh |
| 2013 | Acoustic unit discovery and pronunciation generation from a grapheme-based lexicon. William Hartmann, Anindya Roy, Lori Lamel, Jean-Luc Gauvain |
| 2013 | An SVD-based scheme for MFCC compression in distributed speech recognition system. Azzedine Touazi, Mohamed Debyeche |
| 2013 | An empirical study of confusion modeling in keyword search for low resource languages. Murat Saraclar, Abhinav Sethy, Bhuvana Ramabhadran, Lidia Mangu, Jia Cui, Xiaodong Cui, Brian Kingsbury, Jonathan Mamou |
| 2013 | Automatic model complexity control for generalized variable parameter HMMs. Rongfeng Su, Xunying Liu, Lan Wang |
| 2013 | Automatic pronunciation clustering using a World English archive and pronunciation structure analysis. Han-Ping Shen, Nobuaki Minematsu, Takehiko Makino, Steven H. Weinberger, Teeraphon Pongkittiphan, Chung-Hsien Wu |
| 2013 | Automatic sentiment extraction from YouTube videos. Lakshmish Kaushik, Abhijeet Sangwan, John H. L. Hansen |
| 2013 | Barge-in effects in Bayesian dialogue act recognition and simulation. Heriberto Cuayáhuitl, Nina Dethlefs, Helen Wright Hastie, Oliver Lemon |
| 2013 | Combination of data borrowing strategies for low-resource LVCSR. Yanmin Qian, Kai Yu, Jia Liu |
| 2013 | Combining stochastic average gradient and Hessian-free optimization for sequence training of deep neural networks. Pierre L. Dognin, Vaibhava Goel |
| 2013 | Compact acoustic modeling based on acoustic manifold using a mixture of factor analyzers. Wen-Lin Zhang, Bi-Cheng Li, Wei-Qiang Zhang |
| 2013 | Context-dependent modelling of deep neural network using logistic regression. Guangsen Wang, Khe Chai Sim |
| 2013 | Convolutional neural network based triangular CRF for joint intent detection and slot filling. Puyang Xu, Ruhi Sarikaya |
| 2013 | Cross-lingual context sharing and parameter-tying for multi-lingual speech recognition. Aanchan Mohan, Richard C. Rose |
| 2013 | DNN acoustic modeling with modular multi-lingual feature extraction networks. Jonas Gehring, Quoc Bao Nguyen, Florian Metze, Alex Waibel |
| 2013 | Deep maxout networks for low-resource speech recognition. Yajie Miao, Florian Metze, Shourabh Rawat |
| 2013 | Deep maxout neural networks for speech recognition. Meng Cai, Yongzhe Shi, Jia Liu |
| 2013 | Dialogue management for leading the conversation in persuasive dialogue systems. Takuya Hiraoka, Yuki Yamauchi, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura |
| 2013 | Discriminative piecewise linear transformation based on deep learning for noise robust automatic speech recognition. Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose |
| 2013 | Discriminative semi-supervised training for keyword search in low resource languages. Roger Hsiao, Tim Ng, Frantisek Grézl, Damianos G. Karakos, Stavros Tsakalidis, Long Nguyen, Richard M. Schwartz |
| 2013 | Dysfluent speech detection by image forensics techniques. Juraj Pálfy, Sakhia Darjaa, Jiri Pospichal |
| 2013 | Effective pseudo-relevance feedback for language modeling in speech recognition. Berlin Chen, Yi-Wen Chen, Kuan-Yu Chen, Ea-Ee Jan |
| 2013 | Efficient nearly error-less LVCSR decoding based on incremental forward and backward passes. David Nolden, Ralf Schlüter, Hermann Ney |
| 2013 | Elastic spectral distortion for low resource speech recognition with deep neural networks. Naoyuki Kanda, Ryu Takeda, Yasunari Obuchi |
| 2013 | Emotion recognition from spontaneous speech using Hidden Markov models with deep belief networks. Duc Le, Emily Mower Provost |
| 2013 | Expert-based reward shaping and exploration scheme for boosting policy learning of dialogue management. Emmanuel Ferreira, Fabrice Lefèvre |
| 2013 | Fixed-dimensional acoustic embeddings of variable-length segments in low-resource settings. Keith D. Levin, Katharine Henry, Aren Jansen, Karen Livescu |
| 2013 | Hierarchical neural networks and enhanced class posteriors for social signal classification. Raymond Brueckner, Björn W. Schuller |
| 2013 | Hybrid acoustic models for distant and multichannel large vocabulary speech recognition. Pawel Swietojanski, Arnab Ghoshal, Steve Renals |
| 2013 | Hybrid speech recognition with Deep Bidirectional LSTM. Alex Graves, Navdeep Jaitly, Abdel-rahman Mohamed |
| 2013 | Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition. David Imseng, Petr Motlícek, Philip N. Garner, Hervé Bourlard |
| 2013 | Improved cepstral mean and variance normalization using Bayesian framework. N. Vishnu Prasad, Srinivasan Umesh |
| 2013 | Improved punctuation recovery through combination of multiple speech streams. João Miranda, João Paulo da Silva Neto, Alan W. Black |
| 2013 | Improvements to Deep Convolutional Neural Networks for LVCSR. Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran |
| 2013 | Improving robustness of deep neural networks via spectral masking for automatic speech recognition. Bo Li, Khe Chai Sim |
| 2013 | Investigation of multilingual deep neural networks for spoken term detection. Kate M. Knill, Mark J. F. Gales, Shakti P. Rath, Philip C. Woodland, Chao Zhang, Shi-Xiong Zhang |
| 2013 | Joint training of interpolated exponential n-gram models. Abhinav Sethy, Stanley F. Chen, Ebru Arisoy, Bhuvana Ramabhadran, Kartik Audhkhasi, Shrikanth S. Narayanan, Paul Vozila |
| 2013 | K-component recurrent neural network language models using curriculum learning. Yangyang Shi, Martha A. Larson, Catholijn M. Jonker |
| 2013 | Language style and domain adaptation for cross-language SLU porting. Evgeny A. Stepanov, Ilya Kashkarev, Ali Orkan Bayer, Giuseppe Riccardi, Arindam Ghosh |
| 2013 | Large scale deep neural network acoustic modeling with semi-supervised training data for YouTube video transcription. Hank Liao, Erik McDermott, Andrew W. Senior |
| 2013 | Learning a subword vocabulary based on unigram likelihood. Matti Varjokallio, Mikko Kurimo, Sami Virpioja |
| 2013 | Learning better lexical properties for recurrent OOV words. Long Qin, Alexander I. Rudnicky |
| 2013 | Learning filter banks within a deep neural network framework. Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, Bhuvana Ramabhadran |
| 2013 | Learning state labels for sparse classification of speech with matrix deconvolution. Antti Hurmalainen, Tuomas Virtanen |
| 2013 | Lightly supervised automatic subtitling of weather forecasts. Joris Driesen, Steve Renals |
| 2013 | Mixture of mixture n-gram language models. Hasim Sak, Cyril Allauzen, Kaisuke Nakajima, Françoise Beaufays |
| 2013 | Models of tone for tonal and non-tonal languages. Florian Metze, Zaid Sheikh, Alex Waibel, Jonas Gehring, Kevin Kilgour, Quoc Bao Nguyen, Van Huy Nguyen |
| 2013 | Modified splice and its extension to non-stereo data for noise robust speech recognition. D. S. Pavan Kumar, N. Vishnu Prasad, Vikas Joshi, Srinivasan Umesh |
| 2013 | Multi-stream temporally varying weight regression for cross-lingual speech recognition. Shilin Liu, Khe Chai Sim |
| 2013 | NMF-based keyword learning from scarce data. Bart Ons, Jort F. Gemmeke, Hugo Van hamme |
| 2013 | Neighbour selection and adaptation for rapid speaker-dependent ASR. Udhyakumar Nallasamy, Mark C. Fuhs, Monika Woszczyna, Florian Metze, Tanja Schultz |
| 2013 | On-line adaptation of semantic models for spoken language understanding. Ali Orkan Bayer, Giuseppe Riccardi |
| 2013 | Phonetic and anthropometric conditioning of MSA-KST cognitive impairment characterization system. Alexei V. Ivanov, Shahab Jalalvand, Roberto Gretter, Daniele Falavigna |
| 2013 | Porting concepts from DNNs back to GMMs. Kris Demuynck, Fabian Triefenbach |
| 2013 | Probabilistic lexical modeling and unsupervised training for zero-resourced ASR. Ramya Rasipuram, Marzieh Razavi, Mathew Magimai-Doss |
| 2013 | Query understanding enhanced by hierarchical parsing structures. Jingjing Liu, Panupong Pasupat, Yining Wang, Scott Cyphers, James R. Glass |
| 2013 | Score normalization and system combination for improved keyword spotting. Damianos G. Karakos, Richard M. Schwartz, Stavros Tsakalidis, Le Zhang, Shivesh Ranjan, Tim Ng, Roger Hsiao, Guruprasad Saikumar, Ivan Bulyko, Long Nguyen, John Makhoul, Frantisek Grézl, Mirko Hannemann, Martin Karafiát, Igor Szöke, Karel Veselý, Lori Lamel, Viet Bac Le |
| 2013 | Search results based N-best hypothesis rescoring with maximum entropy classification. Fuchun Peng, Scott Roy, Ben Shahshahani, Françoise Beaufays |
| 2013 | Semantic entity detection from multiple ASR hypotheses within the WFST framework. Jan Svec, Pavel Ircing, Lubos Smídl |
| 2013 | Semi-supervised bootstrapping approach for neural network feature extractor training. Frantisek Grézl, Martin Karafiát |
| 2013 | Semi-supervised training of Deep Neural Networks. Karel Veselý, Mirko Hannemann, Lukás Burget |
| 2013 | Speaker adaptation of neural network acoustic models using i-vectors. George Saon, Hagen Soltau, David Nahamoo, Michael Picheny |
| 2013 | The IBM keyword search system for the DARPA RATS program. Lidia Mangu, Hagen Soltau, Hong-Kwang Kuo, George Saon |
| 2013 | The TAO of ATWV: Probing the mysteries of keyword search performance. Steven Wegmann, Arlo Faria, Adam Janin, Korbinian Riedhammer, Nelson Morgan |
| 2013 | The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes. Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, Marco Matassoni |
| 2013 | Towards unsupervised semantic retrieval of spoken content with query expansion based on automatically discovered acoustic patterns. Yun-Chiao Li, Hung-yi Lee, Cheng-Tao Chung, Chun-an Chan, Lin-Shan Lee |
| 2013 | Unsupervised induction and filling of semantic slots for spoken dialogue systems using frame-semantic parsing. Yun-Nung Chen, William Yang Wang, Alexander I. Rudnicky |
| 2013 | Unsupervised word segmentation from noisy input. Jahn Heymann, Oliver Walter, Reinhold Haeb-Umbach, Bhiksha Raj |
| 2013 | Using proxies for OOV keywords in the keyword search task. Guoguo Chen, Oguz Yilmaz, Jan Trmal, Daniel Povey, Sanjeev Khudanpur |
| 2013 | Using web text to improve keyword spotting in speech. Ankur Gandhe, Long Qin, Florian Metze, Alexander I. Rudnicky, Ian R. Lane, Matthias Eck |
| 2013 | Vector Taylor series based HMM adaptation for generalized cepstrum in noisy environment. Soonho Baek, Hong-Goo Kang |