ASRU - RankMe – RankMe

81 papers

Year	Title / Authors
2013	2013 IEEE Workshop on Automatic Speech Recognition and Understanding, Olomouc, Czech Republic, December 8-12, 2013
2013	A generalized discriminative training framework for system combination. Yuuki Tachioka, Shinji Watanabe, Jonathan Le Roux, John R. Hershey
2013	A hierarchical system for word discovery exploiting DTW-based initialization. Oliver Walter, Timo Korthals, Reinhold Haeb-Umbach, Bhiksha Raj
2013	A propagation approach to modelling the joint distributions of clean and corrupted speech in the Mel-Cepstral domain. Ramón Fernandez Astudillo
2013	A study of supervised intrinsic spectral analysis for TIMIT phone classification. Reza Sahraeian, Dirk Van Compernolle
2013	ASR for electro-laryngeal speech. Anna Katharina Fuchs, Juan Andres Morales-Cordovilla, Martin Hagmüller
2013	Accelerating Hessian-free optimization for Deep Neural Networks by implicit preconditioning and sampling. Tara N. Sainath, Lior Horesh, Brian Kingsbury, Aleksandr Y. Aravkin, Bhuvana Ramabhadran
2013	Accelerating recurrent neural network training via two stage classes and parallelization. Zhiheng Huang, Geoffrey Zweig, Michael Levit, Benoît Dumoulin, Barlas Oguz, Shawn Chang
2013	Acoustic characteristics related to the perceptual pitch in whispered vowels. Hideaki Konno, Hideo Kanemitsu, Nobuyuki Takahashi, Mineichi Kudo
2013	Acoustic data-driven pronunciation lexicon for large vocabulary speech recognition. Liang Lu, Arnab Ghoshal, Steve Renals
2013	Acoustic modeling using transform-based phone-cluster adaptive training. Vimal Manohar, Srinivas C. Bhargav, Srinivasan Umesh
2013	Acoustic unit discovery and pronunciation generation from a grapheme-based lexicon. William Hartmann, Anindya Roy, Lori Lamel, Jean-Luc Gauvain
2013	An SVD-based scheme for MFCC compression in distributed speech recognition system. Azzedine Touazi, Mohamed Debyeche
2013	An empirical study of confusion modeling in keyword search for low resource languages. Murat Saraclar, Abhinav Sethy, Bhuvana Ramabhadran, Lidia Mangu, Jia Cui, Xiaodong Cui, Brian Kingsbury, Jonathan Mamou
2013	Automatic model complexity control for generalized variable parameter HMMs. Rongfeng Su, Xunying Liu, Lan Wang
2013	Automatic pronunciation clustering using a World English archive and pronunciation structure analysis. Han-Ping Shen, Nobuaki Minematsu, Takehiko Makino, Steven H. Weinberger, Teeraphon Pongkittiphan, Chung-Hsien Wu
2013	Automatic sentiment extraction from YouTube videos. Lakshmish Kaushik, Abhijeet Sangwan, John H. L. Hansen
2013	Barge-in effects in Bayesian dialogue act recognition and simulation. Heriberto Cuayáhuitl, Nina Dethlefs, Helen Wright Hastie, Oliver Lemon
2013	Combination of data borrowing strategies for low-resource LVCSR. Yanmin Qian, Kai Yu, Jia Liu
2013	Combining stochastic average gradient and Hessian-free optimization for sequence training of deep neural networks. Pierre L. Dognin, Vaibhava Goel
2013	Compact acoustic modeling based on acoustic manifold using a mixture of factor analyzers. Wen-Lin Zhang, Bi-Cheng Li, Wei-Qiang Zhang
2013	Context-dependent modelling of deep neural network using logistic regression. Guangsen Wang, Khe Chai Sim
2013	Convolutional neural network based triangular CRF for joint intent detection and slot filling. Puyang Xu, Ruhi Sarikaya
2013	Cross-lingual context sharing and parameter-tying for multi-lingual speech recognition. Aanchan Mohan, Richard C. Rose
2013	DNN acoustic modeling with modular multi-lingual feature extraction networks. Jonas Gehring, Quoc Bao Nguyen, Florian Metze, Alex Waibel
2013	Deep maxout networks for low-resource speech recognition. Yajie Miao, Florian Metze, Shourabh Rawat
2013	Deep maxout neural networks for speech recognition. Meng Cai, Yongzhe Shi, Jia Liu
2013	Dialogue management for leading the conversation in persuasive dialogue systems. Takuya Hiraoka, Yuki Yamauchi, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura
2013	Discriminative piecewise linear transformation based on deep learning for noise robust automatic speech recognition. Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose
2013	Discriminative semi-supervised training for keyword search in low resource languages. Roger Hsiao, Tim Ng, Frantisek Grézl, Damianos G. Karakos, Stavros Tsakalidis, Long Nguyen, Richard M. Schwartz
2013	Dysfluent speech detection by image forensics techniques. Juraj Pálfy, Sakhia Darjaa, Jiri Pospichal
2013	Effective pseudo-relevance feedback for language modeling in speech recognition. Berlin Chen, Yi-Wen Chen, Kuan-Yu Chen, Ea-Ee Jan
2013	Efficient nearly error-less LVCSR decoding based on incremental forward and backward passes. David Nolden, Ralf Schlüter, Hermann Ney
2013	Elastic spectral distortion for low resource speech recognition with deep neural networks. Naoyuki Kanda, Ryu Takeda, Yasunari Obuchi
2013	Emotion recognition from spontaneous speech using Hidden Markov models with deep belief networks. Duc Le, Emily Mower Provost
2013	Expert-based reward shaping and exploration scheme for boosting policy learning of dialogue management. Emmanuel Ferreira, Fabrice Lefèvre
2013	Fixed-dimensional acoustic embeddings of variable-length segments in low-resource settings. Keith D. Levin, Katharine Henry, Aren Jansen, Karen Livescu
2013	Hierarchical neural networks and enhanced class posteriors for social signal classification. Raymond Brueckner, Björn W. Schuller
2013	Hybrid acoustic models for distant and multichannel large vocabulary speech recognition. Pawel Swietojanski, Arnab Ghoshal, Steve Renals
2013	Hybrid speech recognition with Deep Bidirectional LSTM. Alex Graves, Navdeep Jaitly, Abdel-rahman Mohamed
2013	Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition. David Imseng, Petr Motlícek, Philip N. Garner, Hervé Bourlard
2013	Improved cepstral mean and variance normalization using Bayesian framework. N. Vishnu Prasad, Srinivasan Umesh
2013	Improved punctuation recovery through combination of multiple speech streams. João Miranda, João Paulo da Silva Neto, Alan W. Black
2013	Improvements to Deep Convolutional Neural Networks for LVCSR. Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran
2013	Improving robustness of deep neural networks via spectral masking for automatic speech recognition. Bo Li, Khe Chai Sim
2013	Investigation of multilingual deep neural networks for spoken term detection. Kate M. Knill, Mark J. F. Gales, Shakti P. Rath, Philip C. Woodland, Chao Zhang, Shi-Xiong Zhang
2013	Joint training of interpolated exponential n-gram models. Abhinav Sethy, Stanley F. Chen, Ebru Arisoy, Bhuvana Ramabhadran, Kartik Audhkhasi, Shrikanth S. Narayanan, Paul Vozila
2013	K-component recurrent neural network language models using curriculum learning. Yangyang Shi, Martha A. Larson, Catholijn M. Jonker
2013	Language style and domain adaptation for cross-language SLU porting. Evgeny A. Stepanov, Ilya Kashkarev, Ali Orkan Bayer, Giuseppe Riccardi, Arindam Ghosh
2013	Large scale deep neural network acoustic modeling with semi-supervised training data for YouTube video transcription. Hank Liao, Erik McDermott, Andrew W. Senior
2013	Learning a subword vocabulary based on unigram likelihood. Matti Varjokallio, Mikko Kurimo, Sami Virpioja
2013	Learning better lexical properties for recurrent OOV words. Long Qin, Alexander I. Rudnicky
2013	Learning filter banks within a deep neural network framework. Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, Bhuvana Ramabhadran
2013	Learning state labels for sparse classification of speech with matrix deconvolution. Antti Hurmalainen, Tuomas Virtanen
2013	Lightly supervised automatic subtitling of weather forecasts. Joris Driesen, Steve Renals
2013	Mixture of mixture n-gram language models. Hasim Sak, Cyril Allauzen, Kaisuke Nakajima, Françoise Beaufays
2013	Models of tone for tonal and non-tonal languages. Florian Metze, Zaid Sheikh, Alex Waibel, Jonas Gehring, Kevin Kilgour, Quoc Bao Nguyen, Van Huy Nguyen
2013	Modified splice and its extension to non-stereo data for noise robust speech recognition. D. S. Pavan Kumar, N. Vishnu Prasad, Vikas Joshi, Srinivasan Umesh
2013	Multi-stream temporally varying weight regression for cross-lingual speech recognition. Shilin Liu, Khe Chai Sim
2013	NMF-based keyword learning from scarce data. Bart Ons, Jort F. Gemmeke, Hugo Van hamme
2013	Neighbour selection and adaptation for rapid speaker-dependent ASR. Udhyakumar Nallasamy, Mark C. Fuhs, Monika Woszczyna, Florian Metze, Tanja Schultz
2013	On-line adaptation of semantic models for spoken language understanding. Ali Orkan Bayer, Giuseppe Riccardi
2013	Phonetic and anthropometric conditioning of MSA-KST cognitive impairment characterization system. Alexei V. Ivanov, Shahab Jalalvand, Roberto Gretter, Daniele Falavigna
2013	Porting concepts from DNNs back to GMMs. Kris Demuynck, Fabian Triefenbach
2013	Probabilistic lexical modeling and unsupervised training for zero-resourced ASR. Ramya Rasipuram, Marzieh Razavi, Mathew Magimai-Doss
2013	Query understanding enhanced by hierarchical parsing structures. Jingjing Liu, Panupong Pasupat, Yining Wang, Scott Cyphers, James R. Glass
2013	Score normalization and system combination for improved keyword spotting. Damianos G. Karakos, Richard M. Schwartz, Stavros Tsakalidis, Le Zhang, Shivesh Ranjan, Tim Ng, Roger Hsiao, Guruprasad Saikumar, Ivan Bulyko, Long Nguyen, John Makhoul, Frantisek Grézl, Mirko Hannemann, Martin Karafiát, Igor Szöke, Karel Veselý, Lori Lamel, Viet Bac Le
2013	Search results based N-best hypothesis rescoring with maximum entropy classification. Fuchun Peng, Scott Roy, Ben Shahshahani, Françoise Beaufays
2013	Semantic entity detection from multiple ASR hypotheses within the WFST framework. Jan Svec, Pavel Ircing, Lubos Smídl
2013	Semi-supervised bootstrapping approach for neural network feature extractor training. Frantisek Grézl, Martin Karafiát
2013	Semi-supervised training of Deep Neural Networks. Karel Veselý, Mirko Hannemann, Lukás Burget
2013	Speaker adaptation of neural network acoustic models using i-vectors. George Saon, Hagen Soltau, David Nahamoo, Michael Picheny
2013	The IBM keyword search system for the DARPA RATS program. Lidia Mangu, Hagen Soltau, Hong-Kwang Kuo, George Saon
2013	The TAO of ATWV: Probing the mysteries of keyword search performance. Steven Wegmann, Arlo Faria, Adam Janin, Korbinian Riedhammer, Nelson Morgan
2013	The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes. Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, Marco Matassoni
2013	Towards unsupervised semantic retrieval of spoken content with query expansion based on automatically discovered acoustic patterns. Yun-Chiao Li, Hung-yi Lee, Cheng-Tao Chung, Chun-an Chan, Lin-Shan Lee
2013	Unsupervised induction and filling of semantic slots for spoken dialogue systems using frame-semantic parsing. Yun-Nung Chen, William Yang Wang, Alexander I. Rudnicky
2013	Unsupervised word segmentation from noisy input. Jahn Heymann, Oliver Walter, Reinhold Haeb-Umbach, Bhiksha Raj
2013	Using proxies for OOV keywords in the keyword search task. Guoguo Chen, Oguz Yilmaz, Jan Trmal, Daniel Povey, Sanjeev Khudanpur
2013	Using web text to improve keyword spotting in speech. Ankur Gandhe, Long Qin, Florian Metze, Alexander I. Rudnicky, Ian R. Lane, Matthias Eck
2013	Vector Taylor series based HMM adaptation for generalized cepstrum in noisy environment. Soonho Baek, Hong-Goo Kang