ASRU C

81 papers

YearTitle / Authors
20132013 IEEE Workshop on Automatic Speech Recognition and Understanding, Olomouc, Czech Republic, December 8-12, 2013
2013A generalized discriminative training framework for system combination.
Yuuki Tachioka, Shinji Watanabe, Jonathan Le Roux, John R. Hershey
2013A hierarchical system for word discovery exploiting DTW-based initialization.
Oliver Walter, Timo Korthals, Reinhold Haeb-Umbach, Bhiksha Raj
2013A propagation approach to modelling the joint distributions of clean and corrupted speech in the Mel-Cepstral domain.
Ramón Fernandez Astudillo
2013A study of supervised intrinsic spectral analysis for TIMIT phone classification.
Reza Sahraeian, Dirk Van Compernolle
2013ASR for electro-laryngeal speech.
Anna Katharina Fuchs, Juan Andres Morales-Cordovilla, Martin Hagmüller
2013Accelerating Hessian-free optimization for Deep Neural Networks by implicit preconditioning and sampling.
Tara N. Sainath, Lior Horesh, Brian Kingsbury, Aleksandr Y. Aravkin, Bhuvana Ramabhadran
2013Accelerating recurrent neural network training via two stage classes and parallelization.
Zhiheng Huang, Geoffrey Zweig, Michael Levit, Benoît Dumoulin, Barlas Oguz, Shawn Chang
2013Acoustic characteristics related to the perceptual pitch in whispered vowels.
Hideaki Konno, Hideo Kanemitsu, Nobuyuki Takahashi, Mineichi Kudo
2013Acoustic data-driven pronunciation lexicon for large vocabulary speech recognition.
Liang Lu, Arnab Ghoshal, Steve Renals
2013Acoustic modeling using transform-based phone-cluster adaptive training.
Vimal Manohar, Srinivas C. Bhargav, Srinivasan Umesh
2013Acoustic unit discovery and pronunciation generation from a grapheme-based lexicon.
William Hartmann, Anindya Roy, Lori Lamel, Jean-Luc Gauvain
2013An SVD-based scheme for MFCC compression in distributed speech recognition system.
Azzedine Touazi, Mohamed Debyeche
2013An empirical study of confusion modeling in keyword search for low resource languages.
Murat Saraclar, Abhinav Sethy, Bhuvana Ramabhadran, Lidia Mangu, Jia Cui, Xiaodong Cui, Brian Kingsbury, Jonathan Mamou
2013Automatic model complexity control for generalized variable parameter HMMs.
Rongfeng Su, Xunying Liu, Lan Wang
2013Automatic pronunciation clustering using a World English archive and pronunciation structure analysis.
Han-Ping Shen, Nobuaki Minematsu, Takehiko Makino, Steven H. Weinberger, Teeraphon Pongkittiphan, Chung-Hsien Wu
2013Automatic sentiment extraction from YouTube videos.
Lakshmish Kaushik, Abhijeet Sangwan, John H. L. Hansen
2013Barge-in effects in Bayesian dialogue act recognition and simulation.
Heriberto Cuayáhuitl, Nina Dethlefs, Helen Wright Hastie, Oliver Lemon
2013Combination of data borrowing strategies for low-resource LVCSR.
Yanmin Qian, Kai Yu, Jia Liu
2013Combining stochastic average gradient and Hessian-free optimization for sequence training of deep neural networks.
Pierre L. Dognin, Vaibhava Goel
2013Compact acoustic modeling based on acoustic manifold using a mixture of factor analyzers.
Wen-Lin Zhang, Bi-Cheng Li, Wei-Qiang Zhang
2013Context-dependent modelling of deep neural network using logistic regression.
Guangsen Wang, Khe Chai Sim
2013Convolutional neural network based triangular CRF for joint intent detection and slot filling.
Puyang Xu, Ruhi Sarikaya
2013Cross-lingual context sharing and parameter-tying for multi-lingual speech recognition.
Aanchan Mohan, Richard C. Rose
2013DNN acoustic modeling with modular multi-lingual feature extraction networks.
Jonas Gehring, Quoc Bao Nguyen, Florian Metze, Alex Waibel
2013Deep maxout networks for low-resource speech recognition.
Yajie Miao, Florian Metze, Shourabh Rawat
2013Deep maxout neural networks for speech recognition.
Meng Cai, Yongzhe Shi, Jia Liu
2013Dialogue management for leading the conversation in persuasive dialogue systems.
Takuya Hiraoka, Yuki Yamauchi, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura
2013Discriminative piecewise linear transformation based on deep learning for noise robust automatic speech recognition.
Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose
2013Discriminative semi-supervised training for keyword search in low resource languages.
Roger Hsiao, Tim Ng, Frantisek Grézl, Damianos G. Karakos, Stavros Tsakalidis, Long Nguyen, Richard M. Schwartz
2013Dysfluent speech detection by image forensics techniques.
Juraj Pálfy, Sakhia Darjaa, Jiri Pospichal
2013Effective pseudo-relevance feedback for language modeling in speech recognition.
Berlin Chen, Yi-Wen Chen, Kuan-Yu Chen, Ea-Ee Jan
2013Efficient nearly error-less LVCSR decoding based on incremental forward and backward passes.
David Nolden, Ralf Schlüter, Hermann Ney
2013Elastic spectral distortion for low resource speech recognition with deep neural networks.
Naoyuki Kanda, Ryu Takeda, Yasunari Obuchi
2013Emotion recognition from spontaneous speech using Hidden Markov models with deep belief networks.
Duc Le, Emily Mower Provost
2013Expert-based reward shaping and exploration scheme for boosting policy learning of dialogue management.
Emmanuel Ferreira, Fabrice Lefèvre
2013Fixed-dimensional acoustic embeddings of variable-length segments in low-resource settings.
Keith D. Levin, Katharine Henry, Aren Jansen, Karen Livescu
2013Hierarchical neural networks and enhanced class posteriors for social signal classification.
Raymond Brueckner, Björn W. Schuller
2013Hybrid acoustic models for distant and multichannel large vocabulary speech recognition.
Pawel Swietojanski, Arnab Ghoshal, Steve Renals
2013Hybrid speech recognition with Deep Bidirectional LSTM.
Alex Graves, Navdeep Jaitly, Abdel-rahman Mohamed
2013Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition.
David Imseng, Petr Motlícek, Philip N. Garner, Hervé Bourlard
2013Improved cepstral mean and variance normalization using Bayesian framework.
N. Vishnu Prasad, Srinivasan Umesh
2013Improved punctuation recovery through combination of multiple speech streams.
João Miranda, João Paulo da Silva Neto, Alan W. Black
2013Improvements to Deep Convolutional Neural Networks for LVCSR.
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran
2013Improving robustness of deep neural networks via spectral masking for automatic speech recognition.
Bo Li, Khe Chai Sim
2013Investigation of multilingual deep neural networks for spoken term detection.
Kate M. Knill, Mark J. F. Gales, Shakti P. Rath, Philip C. Woodland, Chao Zhang, Shi-Xiong Zhang
2013Joint training of interpolated exponential n-gram models.
Abhinav Sethy, Stanley F. Chen, Ebru Arisoy, Bhuvana Ramabhadran, Kartik Audhkhasi, Shrikanth S. Narayanan, Paul Vozila
2013K-component recurrent neural network language models using curriculum learning.
Yangyang Shi, Martha A. Larson, Catholijn M. Jonker
2013Language style and domain adaptation for cross-language SLU porting.
Evgeny A. Stepanov, Ilya Kashkarev, Ali Orkan Bayer, Giuseppe Riccardi, Arindam Ghosh
2013Large scale deep neural network acoustic modeling with semi-supervised training data for YouTube video transcription.
Hank Liao, Erik McDermott, Andrew W. Senior
2013Learning a subword vocabulary based on unigram likelihood.
Matti Varjokallio, Mikko Kurimo, Sami Virpioja
2013Learning better lexical properties for recurrent OOV words.
Long Qin, Alexander I. Rudnicky
2013Learning filter banks within a deep neural network framework.
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, Bhuvana Ramabhadran
2013Learning state labels for sparse classification of speech with matrix deconvolution.
Antti Hurmalainen, Tuomas Virtanen
2013Lightly supervised automatic subtitling of weather forecasts.
Joris Driesen, Steve Renals
2013Mixture of mixture n-gram language models.
Hasim Sak, Cyril Allauzen, Kaisuke Nakajima, Françoise Beaufays
2013Models of tone for tonal and non-tonal languages.
Florian Metze, Zaid Sheikh, Alex Waibel, Jonas Gehring, Kevin Kilgour, Quoc Bao Nguyen, Van Huy Nguyen
2013Modified splice and its extension to non-stereo data for noise robust speech recognition.
D. S. Pavan Kumar, N. Vishnu Prasad, Vikas Joshi, Srinivasan Umesh
2013Multi-stream temporally varying weight regression for cross-lingual speech recognition.
Shilin Liu, Khe Chai Sim
2013NMF-based keyword learning from scarce data.
Bart Ons, Jort F. Gemmeke, Hugo Van hamme
2013Neighbour selection and adaptation for rapid speaker-dependent ASR.
Udhyakumar Nallasamy, Mark C. Fuhs, Monika Woszczyna, Florian Metze, Tanja Schultz
2013On-line adaptation of semantic models for spoken language understanding.
Ali Orkan Bayer, Giuseppe Riccardi
2013Phonetic and anthropometric conditioning of MSA-KST cognitive impairment characterization system.
Alexei V. Ivanov, Shahab Jalalvand, Roberto Gretter, Daniele Falavigna
2013Porting concepts from DNNs back to GMMs.
Kris Demuynck, Fabian Triefenbach
2013Probabilistic lexical modeling and unsupervised training for zero-resourced ASR.
Ramya Rasipuram, Marzieh Razavi, Mathew Magimai-Doss
2013Query understanding enhanced by hierarchical parsing structures.
Jingjing Liu, Panupong Pasupat, Yining Wang, Scott Cyphers, James R. Glass
2013Score normalization and system combination for improved keyword spotting.
Damianos G. Karakos, Richard M. Schwartz, Stavros Tsakalidis, Le Zhang, Shivesh Ranjan, Tim Ng, Roger Hsiao, Guruprasad Saikumar, Ivan Bulyko, Long Nguyen, John Makhoul, Frantisek Grézl, Mirko Hannemann, Martin Karafiát, Igor Szöke, Karel Veselý, Lori Lamel, Viet Bac Le
2013Search results based N-best hypothesis rescoring with maximum entropy classification.
Fuchun Peng, Scott Roy, Ben Shahshahani, Françoise Beaufays
2013Semantic entity detection from multiple ASR hypotheses within the WFST framework.
Jan Svec, Pavel Ircing, Lubos Smídl
2013Semi-supervised bootstrapping approach for neural network feature extractor training.
Frantisek Grézl, Martin Karafiát
2013Semi-supervised training of Deep Neural Networks.
Karel Veselý, Mirko Hannemann, Lukás Burget
2013Speaker adaptation of neural network acoustic models using i-vectors.
George Saon, Hagen Soltau, David Nahamoo, Michael Picheny
2013The IBM keyword search system for the DARPA RATS program.
Lidia Mangu, Hagen Soltau, Hong-Kwang Kuo, George Saon
2013The TAO of ATWV: Probing the mysteries of keyword search performance.
Steven Wegmann, Arlo Faria, Adam Janin, Korbinian Riedhammer, Nelson Morgan
2013The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes.
Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, Marco Matassoni
2013Towards unsupervised semantic retrieval of spoken content with query expansion based on automatically discovered acoustic patterns.
Yun-Chiao Li, Hung-yi Lee, Cheng-Tao Chung, Chun-an Chan, Lin-Shan Lee
2013Unsupervised induction and filling of semantic slots for spoken dialogue systems using frame-semantic parsing.
Yun-Nung Chen, William Yang Wang, Alexander I. Rudnicky
2013Unsupervised word segmentation from noisy input.
Jahn Heymann, Oliver Walter, Reinhold Haeb-Umbach, Bhiksha Raj
2013Using proxies for OOV keywords in the keyword search task.
Guoguo Chen, Oguz Yilmaz, Jan Trmal, Daniel Povey, Sanjeev Khudanpur
2013Using web text to improve keyword spotting in speech.
Ankur Gandhe, Long Qin, Florian Metze, Alexander I. Rudnicky, Ian R. Lane, Matthias Eck
2013Vector Taylor series based HMM adaptation for generalized cepstrum in noisy environment.
Soonho Baek, Hong-Goo Kang