| 2015 | 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015 |
| 2015 | A CHiME-3 challenge system: Long-term acoustic features for noise robust automatic speech recognition. Niko Moritz, Stephan Gerlach, Kamil Adiloglu, Jörn Anemüller, Birger Kollmeier, Stefan Goetze |
| 2015 | A comparative study of neural network models for lexical intent classification. Suman V. Ravuri, Andreas Stolcke |
| 2015 | A study of social-affective communication: Automatic prediction of emotion triggers and responses in television talk shows. Nurul Lubis, Sakriani Sakti, Graham Neubig, Koichiro Yoshino, Tomoki Toda, Satoshi Nakamura |
| 2015 | A system for automatic alignment of broadcast media captions using weighted finite-state transducers. Peter Bell, Steve Renals |
| 2015 | A universal model for flexible item selection in conversational dialogs. Asli Celikyilmaz, Zhaleh Feizollahi, Dilek Hakkani-Tür, Ruhi Sarikaya |
| 2015 | Acoustic model training based on node-wise weight boundary model increasing speed of discrete neural networks. Ryu Takeda, Kazunori Komatani, Kazuhiro Nakadai |
| 2015 | Acoustic modeling with neural graph embeddings. Yuzong Liu, Katrin Kirchhoff |
| 2015 | Acoustic modelling with CD-CTC-SMBR LSTM RNNS. Andrew W. Senior, Hasim Sak, Felix de Chaumont Quitry, Tara N. Sainath, Kanishka Rao |
| 2015 | Adaptive beamforming and adaptive training of DNN acoustic models for enhanced multichannel noisy speech recognition. Alexey Prudnikov, Maxim Korenevsky, Sergei Aleinik |
| 2015 | Adaptive selection from multiple response candidates in example-based dialogue. Masahiro Mizukami, Hideaki Kizuki, Toshio Nomura, Graham Neubig, Koichiro Yoshino, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura |
| 2015 | An i-Vector PLDA based gender identification approach for severely distorted and multilingual DARPA RATS data. Shivesh Ranjan, Gang Liu, John H. L. Hansen |
| 2015 | An information fusion approach to recognizing microphone array speech in the CHiME-3 challenge based on a deep learning framework. Jun Du, Qing Wang, Yanhui Tu, Xiao Bao, Li-Rong Dai, Chin-Hui Lee |
| 2015 | An iterative deep learning framework for unsupervised discovery of speech features and linguistic units with applications on spoken term detection. Cheng-Tao Chung, Cheng-Yu Tsai, Hsiang-Hung Lu, Chia-Hsiang Liu, Hung-yi Lee, Lin-Shan Lee |
| 2015 | Analysis of factors affecting system performance in the ASpIRE challenge. Jennifer Melot, Nicolas Malyska, Jessica Ray, Wade Shen |
| 2015 | Applying deep learning to answer selection: A study and an open task. Minwei Feng, Bing Xiang, Michael R. Glass, Lidan Wang, Bowen Zhou |
| 2015 | Automatic prosody prediction for Chinese speech synthesis using BLSTM-RNN and embedding features. Chuang Ding, Lei Xie, Jie Yan, Weini Zhang, Yang Liu |
| 2015 | Automation of system building for state-of-the-art large vocabulary speech recognition using evolution strategy. Takafumi Moriya, Tomohiro Tanaka, Takahiro Shinozaki, Shinji Watanabe, Kevin Duh |
| 2015 | BLSTM supported GEV beamformer front-end for the 3RD CHiME challenge. Jahn Heymann, Lukas Drude, Aleksej Chinaev, Reinhold Haeb-Umbach |
| 2015 | Boosted acoustic model learning and hypotheses rescoring on the CHiME-3 task. Shahab Jalalvand, Daniele Falavigna, Marco Matassoni, Piergiorgio Svaizer, Maurizio Omologo |
| 2015 | CRIM and LIUM approaches for multi-genre broadcast media transcription. Vishwa Gupta, Paul Deléglise, Gilles Boulianne, Yannick Estève, Sylvain Meignier, Anthony Rousseau |
| 2015 | Cambridge university transcription systems for the multi-genre broadcast challenge. Philip C. Woodland, Xunying Liu, Yanmin Qian, Chao Zhang, Mark J. F. Gales, Penny Karanasou, Pierre Lanchantin, Linlin Wang |
| 2015 | Combination of syllable based N-gram search and word search for spoken term detection through spoken queries and IV/OOV classification. Nagisa Sakamoto, Kazumasa Yamamoto, Seiichi Nakagawa |
| 2015 | Combining spectral feature mapping and multi-channel model-based source separation for noise-robust automatic speech recognition. Deblin Bagchi, Michael I. Mandel, Zhongqiu Wang, Yanzhang He, Andrew R. Plummer, Eric Fosler-Lussier |
| 2015 | Deep bi-directional recurrent networks over spectral windows. Abdel-rahman Mohamed, Frank Seide, Dong Yu, Jasha Droppo, Andreas Stolcke, Geoffrey Zweig, Gerald Penn |
| 2015 | Deep bottleneck features for i-vector based text-independent speaker verification. Sina Hamidi Ghalehjegh, Richard C. Rose |
| 2015 | Deep multimodal semantic embeddings for speech and images. David F. Harwath, James R. Glass |
| 2015 | Detecting actionable items in meetings by convolutional deep structured semantic models. Yun-Nung Chen, Dilek Hakkani-Tür, Xiaodong He |
| 2015 | Different word representations and their combination for proper name retrieval from diachronic documents. Irina Illina, Dominique Fohr |
| 2015 | Discriminative segmental cascades for feature-rich phone recognition. Hao Tang, Weiran Wang, Kevin Gimpel, Karen Livescu |
| 2015 | Discriminative training of context-dependent language model scaling factors and interpolation weights. Shuangyu Chang, Abhik Lahiri, Issac Alphonso, Barlas Oguz, Michael Levit, Benoît Dumoulin |
| 2015 | EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding. Yajie Miao, Mohammad Gowayyed, Florian Metze |
| 2015 | Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition. Ning Ma, Ricard Marxer, Jon Barker, Guy J. Brown |
| 2015 | High-performance Swahili keyword search with very limited language pack: The THUEE system for the OpenKWS15 evaluation. Meng Cai, Zhiqiang Lv, Cheng Lu, Jian Kang, Like Hui, Zhuo Zhang, Jia Liu |
| 2015 | Hilbert spectral analysis of vowels using intrinsic mode functions. Steven Sandoval, Phillip L. De Leon, Julie M. Liss |
| 2015 | Hybrid DNN-Latent structured SVM acoustic models for continuous speech recognition. Suman V. Ravuri |
| 2015 | Implementation of generic positive-negative tracker in extensible dialog system. Sangjun Koo, Seonghan Ryu, Gary Geunbae Lee |
| 2015 | Improved system fusion for keyword search. Zhiqiang Lv, Meng Cai, Cheng Lu, Jian Kang, Like Hui, Wei-Qiang Zhang, Jia Liu |
| 2015 | Improving data selection for low-resource STT and KWS. Thiago Fraga-Silva, Antoine Laurent, Jean-Luc Gauvain, Lori Lamel, Viet Bac Le, Abdelkhalek Messaoudi |
| 2015 | Improving robustness against reverberation for automatic speech recognition. Vikramjit Mitra, Julien van Hout, Wen Wang, Martin Graciarena, Mitchell McLaren, Horacio Franco, Dimitra Vergyri |
| 2015 | Improving the interpretability of deep neural networks with stimulated learning. Shawn Tan, Khe Chai Sim, Mark J. F. Gales |
| 2015 | Incorporating paragraph embeddings and density peaks clustering for spoken document summarization. Kuan-Yu Chen, Kai-Wun Shih, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang |
| 2015 | Incorporating user feedback to re-rank keyword search results. Scott Novotney, Kevin Jett, Owen Kimball |
| 2015 | Incremental LSTM-based dialog state tracker. Lukás Zilka, Filip Jurcícek |
| 2015 | Incremental sentence compression using LSTM recurrent networks. Sakriani Sakti, Faiz Ilham, Graham Neubig, Tomoki Toda, Ayu Purwarianti, Satoshi Nakamura |
| 2015 | Investigating sparse deep neural networks for speech recognition. Gueorgui Pironkov, Stéphane Dupont, Thierry Dutoit |
| 2015 | Investigation of back-off based interpolation between recurrent neural network and n-gram language models. Xie Chen, Xunying Liu, Mark J. F. Gales, Philip C. Woodland |
| 2015 | JHU ASpIRE system: Robust LVCSR with TDNNS, iVector adaptation and RNN-LMS. Vijayaditya Peddinti, Guoguo Chen, Vimal Manohar, Tom Ko, Daniel Povey, Sanjeev Khudanpur |
| 2015 | LSTM time and frequency recurrence for automatic speech recognition. Jinyu Li, Abdelrahman Mohamed, Geoffrey Zweig, Yifan Gong |
| 2015 | Latent Dirichlet Allocation based organisation of broadcast media archives for deep neural network adaptation. Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain |
| 2015 | Learning continuous representation of text for phone duration modeling in statistical parametric speech synthesis. Sai Krishna Rallabandi, Sai Sirisha Rallabandi, Padmini Bandi, Suryakanth V. Gangashetty |
| 2015 | Learning factorized feature transforms for speaker normalization. Lahiru Samarakoon, Khe Chai Sim |
| 2015 | Multi-channel speech processing architectures for noise robust speech recognition: 3rd CHiME challenge results. Lukas Pfeifenberger, Tobias Schrank, Matthias Zöhrer, Martin Hagmüller, Franz Pernkopf |
| 2015 | Multi-domain dialogue success classifiers for policy training. David Vandyke, Pei-Hao Su, Milica Gasic, Nikola Mrksic, Tsung-Hsien Wen, Steve J. Young |
| 2015 | Multi-reference WER for evaluating ASR for languages with no orthographic rules. Ahmed M. Ali, Walid Magdy, Peter Bell, Steve Renals |
| 2015 | Multi-task joint-learning of deep neural networks for robust speech recognition. Yanmin Qian, Maofan Yin, Yongbin You, Kai Yu |
| 2015 | Multilingual representations for low resource speech recognition and keyword search. Jia Cui, Brian Kingsbury, Bhuvana Ramabhadran, Abhinav Sethy, Kartik Audhkhasi, Xiaodong Cui, Ellen Kislal, Lidia Mangu, Markus Nußbaum-Thom, Michael Picheny, Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney, Mark J. F. Gales, Kate M. Knill, Anton Ragni, Haipeng Wang, Philip C. Woodland |
| 2015 | Multimodal embedding fusion for robust speaker role recognition in video broadcast. Mickael Rouvier, Sebastien Delecraz, Benoît Favre, Meriem Bendris, Frédéric Béchet |
| 2015 | Multitask learning and system combination for automatic speech recognition. Olivier Siohan, David Rybach |
| 2015 | Name-aware language model adaptation and sparse features for statistical machine translation. Wen Wang, Haibo Li, Heng Ji |
| 2015 | Natural language understanding for partial queries. Xiaohu Liu, Asli Celikyilmaz, Ruhi Sarikaya |
| 2015 | Naturalness and rapport in a pitch adaptive learning companion. Nichola Lubold, Heather Pon-Barry, Erin Walker |
| 2015 | On constructing and analysing an interpretable brain model for the DNN based on hidden activity patterns. Khe Chai Sim |
| 2015 | Open-domain personalized dialog system using user-interested topics in system responses. Jeesoo Bang, Sangdo Han, Kyusong Lee, Gary Geunbae Lee |
| 2015 | Optimizing human-interpretable dialog management policy using genetic algorithm. Hang Ren, Weiqun Xu, Yonghong Yan |
| 2015 | Personalizing universal recurrent neural network language model with user characteristic features by social network crowdsourcing. Bo-Hsiang Tseng, Hung-yi Lee, Lin-Shan Lee |
| 2015 | Phonetic unit selection for cross-lingual query-by-example spoken term detection. Paula Lopez-Otero, Laura Docío Fernández, Carmen García-Mateo |
| 2015 | Phonetically-oriented word error alignment for speech recognition error analysis in speech translation. Nicholas Ruiz, Marcello Federico |
| 2015 | Policy committee for adaptation in multi-domain spoken dialogue systems. Milica Gasic, Nikola Mrksic, Pei-Hao Su, David Vandyke, Tsung-Hsien Wen, Steve J. Young |
| 2015 | RNNDROP: A novel dropout for RNNS in ASR. Taesup Moon, Heeyoul Choi, Hoshik Lee, Inchul Song |
| 2015 | Recent improvements to NeuroCRFs for named entity recognition. Marc-Antoine Rondeau, Yi Su |
| 2015 | Robust ASR using neural network based speech enhancement and feature simulation. Sunit Sivasankaran, Aditya Arie Nugraha, Emmanuel Vincent, Juan Andres Morales-Cordovilla, Siddharth Dalmia, Irina Illina, Antoine Liutkus |
| 2015 | Robust speech recognition in unknown reverberant and noisy conditions. Roger Hsiao, Jeff Z. Ma, William Hartmann, Martin Karafiát, Frantisek Grézl, Lukás Burget, Igor Szöke, Jan Cernocký, Shinji Watanabe, Zhuo Chen, Sri Harish Reddy Mallidi, Hynek Hermansky, Stavros Tsakalidis, Richard M. Schwartz |
| 2015 | Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction. Shengkui Zhao, Xiong Xiao, Zhaofeng Zhang, Thi Ngoc Tho Nguyen, Xionghu Zhong, Bo Ren, Longbiao Wang, Douglas L. Jones, Engsiong Chng, Haizhou Li |
| 2015 | Semi-supervised slot tagging in spoken language understanding using recurrent transductive support vector machines. Yangyang Shi, Kaisheng Yao, Hu Chen, Yi-Cheng Pan, Mei-Yuh Hwang |
| 2015 | Single and multi-channel approaches for distant speech recognition under noisy reverberant conditions: I2R'S system description for the ASpIRE challenge. Jonathan William Dennis, Tran Huy Dat |
| 2015 | Sparse non-negative matrix language modeling for geo-annotated query session data. Ciprian Chelba, Noam Shazeer |
| 2015 | Speaker adaptive joint training of Gaussian mixture models and bottleneck features. Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney |
| 2015 | Speaker diarisation and longitudinal linking in multi-genre broadcast data. Penny Karanasou, Mark J. F. Gales, Pierre Lanchantin, Xunying Liu, Yanmin Qian, Linlin Wang, Philip C. Woodland, Chao Zhang |
| 2015 | Speaker intonation adaptation for transforming text-to-speech synthesis speaker identity. Mahsa Sadat Elyasi Langarani, Jan P. H. van Santen |
| 2015 | Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms. Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Arun Narayanan, Michiel Bacchiani, Andrew W. Senior |
| 2015 | Spectral learning with non negative probabilities for finite state automaton. Hadrien Glaude, Cyrille Enderli, Olivier Pietquin |
| 2015 | Speech enhancement using beamforming and non negative matrix factorization for robust speech recognition in the CHiME-3 challenge. Thanh T. Vu, Benjamin Bigot, Engsiong Chng |
| 2015 | Spoken language translation graphs re-decoding using automatic quality assessment. Laurent Besacier, Benjamin Lecouteux, Ngoc-Quang Luong, Ngoc-Tien Le |
| 2015 | Stochastic Gradient Variational Bayes for deep learning-based ASR. Andros Tjandra, Sakriani Sakti, Satoshi Nakamura, Mirna Adriani |
| 2015 | Structured discriminative models using deep neural-network features. Rogier C. van Dalen, Jingzhou Yang, Haipeng Wang, Anton Ragni, Chao Zhang, Mark J. F. Gales |
| 2015 | The 2015 sheffield system for longitudinal diarisation of broadcast media. Rosanna Milner, Oscar Saz, Salil Deena, Mortaza Doulaty, Raymond W. M. Ng, Thomas Hain |
| 2015 | The 2015 sheffield system for transcription of Multi-Genre Broadcast media. Oscar Saz, Mortaza Doulaty, Salil Deena, Rosanna Milner, Raymond W. M. Ng, Madina Hasan, Yulan Liu, Thomas Hain |
| 2015 | The Automatic Speech recogition In Reverberant Environments (ASpIRE) challenge. Mary Harper |
| 2015 | The DIRHA-ENGLISH corpus and related tasks for distant-speech recognition in domestic environments. Mirco Ravanelli, Luca Cristoforetti, Roberto Gretter, Marco Pellin, Alessandro Sosi, Maurizio Omologo |
| 2015 | The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition. Takaaki Hori, Zhuo Chen, Hakan Erdogan, John R. Hershey, Jonathan Le Roux, Vikramjit Mitra, Shinji Watanabe |
| 2015 | The MGB challenge: Evaluating multi-genre broadcast media recognition. Peter Bell, Mark J. F. Gales, Thomas Hain, Jonathan Kilgour, Pierre Lanchantin, Xunying Liu, Andrew McParland, Steve Renals, Oscar Saz, Mirjam Wester, Philip C. Woodland |
| 2015 | The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function. Quoc Truong Do, Michael Heck, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura |
| 2015 | The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices. Takuya Yoshioka, Nobutaka Ito, Marc Delcroix, Atsunori Ogawa, Keisuke Kinoshita, Masakiyo Fujimoto, Chengzhu Yu, Wojciech J. Fabian, Miquel Espi, Takuya Higuchi, Shoko Araki, Tomohiro Nakatani |
| 2015 | The development of the cambridge university alignment systems for the multi-genre broadcast challenge. Pierre Lanchantin, Mark J. F. Gales, Penny Karanasou, Xunying Liu, Yanmin Qian, Linlin Wang, Philip C. Woodland, Chao Zhang |
| 2015 | The third 'CHiME' speech separation and recognition challenge: Dataset, task and baselines. Jon Barker, Ricard Marxer, Emmanuel Vincent, Shinji Watanabe |
| 2015 | Time delay deep neural network-based universal background models for speaker recognition. David Snyder, Daniel Garcia-Romero, Daniel Povey |
| 2015 | Time-frequency convolutional networks for robust speech recognition. Vikramjit Mitra, Horacio Franco |
| 2015 | Topic-space based setup of a neural network for theme identification of highly imperfect transcriptions. Mohamed Morchid, Richard Dufour, Georges Linarès |
| 2015 | Towards structured deep neural network for automatic speech recognition. Yi-Hsiu Liao, Hung-yi Lee, Lin-Shan Lee |
| 2015 | Towards utterance-based neural network adaptation in acoustic modeling. Ivan Himawan, Petr Motlícek, Marc Ferras Font, Srikanth R. Madikeri |
| 2015 | Training data pseudo-shuffling and direct decoding framework for recurrent neural network based acoustic modeling. Naoyuki Kanda, Mitsuyoshi Tachimori, Xugang Lu, Hisashi Kawai |
| 2015 | Two-stage ASGD framework for parallel training of DNN acoustic models using Ethernet. Zhichao Wang, Xingyu Na, Xin Li, Jielin Pan, Yonghong Yan |
| 2015 | Uncertainty estimation of DNN classifiers. Sri Harish Reddy Mallidi, Tetsuji Ogawa, Hynek Hermansky |
| 2015 | Unified ASR system using LGM-based source separation, noise-robust feature extraction, and word hypothesis selection. Yusuke Fujita, Ryoichi Takashima, Takeshi Homma, Rintaro Ikeshita, Yohei Kawaguchi, Takashi Sumiyoshi, Takashi Endo, Masahito Togami |
| 2015 | Using bidirectional lstm recurrent neural networks to learn high-level abstractions of sequential features for automated scoring of non-native spontaneous speech. Zhou Yu, Vikram Ramanarayanan, David Suendermann-Oeft, Xinhao Wang, Klaus Zechner, Lei Chen, Jidong Tao, Aliaksei Ivanou, Yao Qian |
| 2015 | Utterance classification in speech-to-speech translation for zero-resource languages in the hospital administration domain. Lara J. Martin, Andrew Wilkinson, Sai Sumanth Miryala, Vivian Robison, Alan W. Black |
| 2015 | Variational Bayesian PLDA for speaker diarization in the MGB challenge. Jesús Antonio Villalba López, Alfonso Ortega, Antonio Miguel, Eduardo Lleida |