ASRU C

108 papers

YearTitle / Authors
20152015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015
2015A CHiME-3 challenge system: Long-term acoustic features for noise robust automatic speech recognition.
Niko Moritz, Stephan Gerlach, Kamil Adiloglu, Jörn Anemüller, Birger Kollmeier, Stefan Goetze
2015A comparative study of neural network models for lexical intent classification.
Suman V. Ravuri, Andreas Stolcke
2015A study of social-affective communication: Automatic prediction of emotion triggers and responses in television talk shows.
Nurul Lubis, Sakriani Sakti, Graham Neubig, Koichiro Yoshino, Tomoki Toda, Satoshi Nakamura
2015A system for automatic alignment of broadcast media captions using weighted finite-state transducers.
Peter Bell, Steve Renals
2015A universal model for flexible item selection in conversational dialogs.
Asli Celikyilmaz, Zhaleh Feizollahi, Dilek Hakkani-Tür, Ruhi Sarikaya
2015Acoustic model training based on node-wise weight boundary model increasing speed of discrete neural networks.
Ryu Takeda, Kazunori Komatani, Kazuhiro Nakadai
2015Acoustic modeling with neural graph embeddings.
Yuzong Liu, Katrin Kirchhoff
2015Acoustic modelling with CD-CTC-SMBR LSTM RNNS.
Andrew W. Senior, Hasim Sak, Felix de Chaumont Quitry, Tara N. Sainath, Kanishka Rao
2015Adaptive beamforming and adaptive training of DNN acoustic models for enhanced multichannel noisy speech recognition.
Alexey Prudnikov, Maxim Korenevsky, Sergei Aleinik
2015Adaptive selection from multiple response candidates in example-based dialogue.
Masahiro Mizukami, Hideaki Kizuki, Toshio Nomura, Graham Neubig, Koichiro Yoshino, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura
2015An i-Vector PLDA based gender identification approach for severely distorted and multilingual DARPA RATS data.
Shivesh Ranjan, Gang Liu, John H. L. Hansen
2015An information fusion approach to recognizing microphone array speech in the CHiME-3 challenge based on a deep learning framework.
Jun Du, Qing Wang, Yanhui Tu, Xiao Bao, Li-Rong Dai, Chin-Hui Lee
2015An iterative deep learning framework for unsupervised discovery of speech features and linguistic units with applications on spoken term detection.
Cheng-Tao Chung, Cheng-Yu Tsai, Hsiang-Hung Lu, Chia-Hsiang Liu, Hung-yi Lee, Lin-Shan Lee
2015Analysis of factors affecting system performance in the ASpIRE challenge.
Jennifer Melot, Nicolas Malyska, Jessica Ray, Wade Shen
2015Applying deep learning to answer selection: A study and an open task.
Minwei Feng, Bing Xiang, Michael R. Glass, Lidan Wang, Bowen Zhou
2015Automatic prosody prediction for Chinese speech synthesis using BLSTM-RNN and embedding features.
Chuang Ding, Lei Xie, Jie Yan, Weini Zhang, Yang Liu
2015Automation of system building for state-of-the-art large vocabulary speech recognition using evolution strategy.
Takafumi Moriya, Tomohiro Tanaka, Takahiro Shinozaki, Shinji Watanabe, Kevin Duh
2015BLSTM supported GEV beamformer front-end for the 3RD CHiME challenge.
Jahn Heymann, Lukas Drude, Aleksej Chinaev, Reinhold Haeb-Umbach
2015Boosted acoustic model learning and hypotheses rescoring on the CHiME-3 task.
Shahab Jalalvand, Daniele Falavigna, Marco Matassoni, Piergiorgio Svaizer, Maurizio Omologo
2015CRIM and LIUM approaches for multi-genre broadcast media transcription.
Vishwa Gupta, Paul Deléglise, Gilles Boulianne, Yannick Estève, Sylvain Meignier, Anthony Rousseau
2015Cambridge university transcription systems for the multi-genre broadcast challenge.
Philip C. Woodland, Xunying Liu, Yanmin Qian, Chao Zhang, Mark J. F. Gales, Penny Karanasou, Pierre Lanchantin, Linlin Wang
2015Combination of syllable based N-gram search and word search for spoken term detection through spoken queries and IV/OOV classification.
Nagisa Sakamoto, Kazumasa Yamamoto, Seiichi Nakagawa
2015Combining spectral feature mapping and multi-channel model-based source separation for noise-robust automatic speech recognition.
Deblin Bagchi, Michael I. Mandel, Zhongqiu Wang, Yanzhang He, Andrew R. Plummer, Eric Fosler-Lussier
2015Deep bi-directional recurrent networks over spectral windows.
Abdel-rahman Mohamed, Frank Seide, Dong Yu, Jasha Droppo, Andreas Stolcke, Geoffrey Zweig, Gerald Penn
2015Deep bottleneck features for i-vector based text-independent speaker verification.
Sina Hamidi Ghalehjegh, Richard C. Rose
2015Deep multimodal semantic embeddings for speech and images.
David F. Harwath, James R. Glass
2015Detecting actionable items in meetings by convolutional deep structured semantic models.
Yun-Nung Chen, Dilek Hakkani-Tür, Xiaodong He
2015Different word representations and their combination for proper name retrieval from diachronic documents.
Irina Illina, Dominique Fohr
2015Discriminative segmental cascades for feature-rich phone recognition.
Hao Tang, Weiran Wang, Kevin Gimpel, Karen Livescu
2015Discriminative training of context-dependent language model scaling factors and interpolation weights.
Shuangyu Chang, Abhik Lahiri, Issac Alphonso, Barlas Oguz, Michael Levit, Benoît Dumoulin
2015EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding.
Yajie Miao, Mohammad Gowayyed, Florian Metze
2015Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition.
Ning Ma, Ricard Marxer, Jon Barker, Guy J. Brown
2015High-performance Swahili keyword search with very limited language pack: The THUEE system for the OpenKWS15 evaluation.
Meng Cai, Zhiqiang Lv, Cheng Lu, Jian Kang, Like Hui, Zhuo Zhang, Jia Liu
2015Hilbert spectral analysis of vowels using intrinsic mode functions.
Steven Sandoval, Phillip L. De Leon, Julie M. Liss
2015Hybrid DNN-Latent structured SVM acoustic models for continuous speech recognition.
Suman V. Ravuri
2015Implementation of generic positive-negative tracker in extensible dialog system.
Sangjun Koo, Seonghan Ryu, Gary Geunbae Lee
2015Improved system fusion for keyword search.
Zhiqiang Lv, Meng Cai, Cheng Lu, Jian Kang, Like Hui, Wei-Qiang Zhang, Jia Liu
2015Improving data selection for low-resource STT and KWS.
Thiago Fraga-Silva, Antoine Laurent, Jean-Luc Gauvain, Lori Lamel, Viet Bac Le, Abdelkhalek Messaoudi
2015Improving robustness against reverberation for automatic speech recognition.
Vikramjit Mitra, Julien van Hout, Wen Wang, Martin Graciarena, Mitchell McLaren, Horacio Franco, Dimitra Vergyri
2015Improving the interpretability of deep neural networks with stimulated learning.
Shawn Tan, Khe Chai Sim, Mark J. F. Gales
2015Incorporating paragraph embeddings and density peaks clustering for spoken document summarization.
Kuan-Yu Chen, Kai-Wun Shih, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang
2015Incorporating user feedback to re-rank keyword search results.
Scott Novotney, Kevin Jett, Owen Kimball
2015Incremental LSTM-based dialog state tracker.
Lukás Zilka, Filip Jurcícek
2015Incremental sentence compression using LSTM recurrent networks.
Sakriani Sakti, Faiz Ilham, Graham Neubig, Tomoki Toda, Ayu Purwarianti, Satoshi Nakamura
2015Investigating sparse deep neural networks for speech recognition.
Gueorgui Pironkov, Stéphane Dupont, Thierry Dutoit
2015Investigation of back-off based interpolation between recurrent neural network and n-gram language models.
Xie Chen, Xunying Liu, Mark J. F. Gales, Philip C. Woodland
2015JHU ASpIRE system: Robust LVCSR with TDNNS, iVector adaptation and RNN-LMS.
Vijayaditya Peddinti, Guoguo Chen, Vimal Manohar, Tom Ko, Daniel Povey, Sanjeev Khudanpur
2015LSTM time and frequency recurrence for automatic speech recognition.
Jinyu Li, Abdelrahman Mohamed, Geoffrey Zweig, Yifan Gong
2015Latent Dirichlet Allocation based organisation of broadcast media archives for deep neural network adaptation.
Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain
2015Learning continuous representation of text for phone duration modeling in statistical parametric speech synthesis.
Sai Krishna Rallabandi, Sai Sirisha Rallabandi, Padmini Bandi, Suryakanth V. Gangashetty
2015Learning factorized feature transforms for speaker normalization.
Lahiru Samarakoon, Khe Chai Sim
2015Multi-channel speech processing architectures for noise robust speech recognition: 3rd CHiME challenge results.
Lukas Pfeifenberger, Tobias Schrank, Matthias Zöhrer, Martin Hagmüller, Franz Pernkopf
2015Multi-domain dialogue success classifiers for policy training.
David Vandyke, Pei-Hao Su, Milica Gasic, Nikola Mrksic, Tsung-Hsien Wen, Steve J. Young
2015Multi-reference WER for evaluating ASR for languages with no orthographic rules.
Ahmed M. Ali, Walid Magdy, Peter Bell, Steve Renals
2015Multi-task joint-learning of deep neural networks for robust speech recognition.
Yanmin Qian, Maofan Yin, Yongbin You, Kai Yu
2015Multilingual representations for low resource speech recognition and keyword search.
Jia Cui, Brian Kingsbury, Bhuvana Ramabhadran, Abhinav Sethy, Kartik Audhkhasi, Xiaodong Cui, Ellen Kislal, Lidia Mangu, Markus Nußbaum-Thom, Michael Picheny, Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney, Mark J. F. Gales, Kate M. Knill, Anton Ragni, Haipeng Wang, Philip C. Woodland
2015Multimodal embedding fusion for robust speaker role recognition in video broadcast.
Mickael Rouvier, Sebastien Delecraz, Benoît Favre, Meriem Bendris, Frédéric Béchet
2015Multitask learning and system combination for automatic speech recognition.
Olivier Siohan, David Rybach
2015Name-aware language model adaptation and sparse features for statistical machine translation.
Wen Wang, Haibo Li, Heng Ji
2015Natural language understanding for partial queries.
Xiaohu Liu, Asli Celikyilmaz, Ruhi Sarikaya
2015Naturalness and rapport in a pitch adaptive learning companion.
Nichola Lubold, Heather Pon-Barry, Erin Walker
2015On constructing and analysing an interpretable brain model for the DNN based on hidden activity patterns.
Khe Chai Sim
2015Open-domain personalized dialog system using user-interested topics in system responses.
Jeesoo Bang, Sangdo Han, Kyusong Lee, Gary Geunbae Lee
2015Optimizing human-interpretable dialog management policy using genetic algorithm.
Hang Ren, Weiqun Xu, Yonghong Yan
2015Personalizing universal recurrent neural network language model with user characteristic features by social network crowdsourcing.
Bo-Hsiang Tseng, Hung-yi Lee, Lin-Shan Lee
2015Phonetic unit selection for cross-lingual query-by-example spoken term detection.
Paula Lopez-Otero, Laura Docío Fernández, Carmen García-Mateo
2015Phonetically-oriented word error alignment for speech recognition error analysis in speech translation.
Nicholas Ruiz, Marcello Federico
2015Policy committee for adaptation in multi-domain spoken dialogue systems.
Milica Gasic, Nikola Mrksic, Pei-Hao Su, David Vandyke, Tsung-Hsien Wen, Steve J. Young
2015RNNDROP: A novel dropout for RNNS in ASR.
Taesup Moon, Heeyoul Choi, Hoshik Lee, Inchul Song
2015Recent improvements to NeuroCRFs for named entity recognition.
Marc-Antoine Rondeau, Yi Su
2015Robust ASR using neural network based speech enhancement and feature simulation.
Sunit Sivasankaran, Aditya Arie Nugraha, Emmanuel Vincent, Juan Andres Morales-Cordovilla, Siddharth Dalmia, Irina Illina, Antoine Liutkus
2015Robust speech recognition in unknown reverberant and noisy conditions.
Roger Hsiao, Jeff Z. Ma, William Hartmann, Martin Karafiát, Frantisek Grézl, Lukás Burget, Igor Szöke, Jan Cernocký, Shinji Watanabe, Zhuo Chen, Sri Harish Reddy Mallidi, Hynek Hermansky, Stavros Tsakalidis, Richard M. Schwartz
2015Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction.
Shengkui Zhao, Xiong Xiao, Zhaofeng Zhang, Thi Ngoc Tho Nguyen, Xionghu Zhong, Bo Ren, Longbiao Wang, Douglas L. Jones, Engsiong Chng, Haizhou Li
2015Semi-supervised slot tagging in spoken language understanding using recurrent transductive support vector machines.
Yangyang Shi, Kaisheng Yao, Hu Chen, Yi-Cheng Pan, Mei-Yuh Hwang
2015Single and multi-channel approaches for distant speech recognition under noisy reverberant conditions: I2R'S system description for the ASpIRE challenge.
Jonathan William Dennis, Tran Huy Dat
2015Sparse non-negative matrix language modeling for geo-annotated query session data.
Ciprian Chelba, Noam Shazeer
2015Speaker adaptive joint training of Gaussian mixture models and bottleneck features.
Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney
2015Speaker diarisation and longitudinal linking in multi-genre broadcast data.
Penny Karanasou, Mark J. F. Gales, Pierre Lanchantin, Xunying Liu, Yanmin Qian, Linlin Wang, Philip C. Woodland, Chao Zhang
2015Speaker intonation adaptation for transforming text-to-speech synthesis speaker identity.
Mahsa Sadat Elyasi Langarani, Jan P. H. van Santen
2015Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms.
Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Arun Narayanan, Michiel Bacchiani, Andrew W. Senior
2015Spectral learning with non negative probabilities for finite state automaton.
Hadrien Glaude, Cyrille Enderli, Olivier Pietquin
2015Speech enhancement using beamforming and non negative matrix factorization for robust speech recognition in the CHiME-3 challenge.
Thanh T. Vu, Benjamin Bigot, Engsiong Chng
2015Spoken language translation graphs re-decoding using automatic quality assessment.
Laurent Besacier, Benjamin Lecouteux, Ngoc-Quang Luong, Ngoc-Tien Le
2015Stochastic Gradient Variational Bayes for deep learning-based ASR.
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura, Mirna Adriani
2015Structured discriminative models using deep neural-network features.
Rogier C. van Dalen, Jingzhou Yang, Haipeng Wang, Anton Ragni, Chao Zhang, Mark J. F. Gales
2015The 2015 sheffield system for longitudinal diarisation of broadcast media.
Rosanna Milner, Oscar Saz, Salil Deena, Mortaza Doulaty, Raymond W. M. Ng, Thomas Hain
2015The 2015 sheffield system for transcription of Multi-Genre Broadcast media.
Oscar Saz, Mortaza Doulaty, Salil Deena, Rosanna Milner, Raymond W. M. Ng, Madina Hasan, Yulan Liu, Thomas Hain
2015The Automatic Speech recogition In Reverberant Environments (ASpIRE) challenge.
Mary Harper
2015The DIRHA-ENGLISH corpus and related tasks for distant-speech recognition in domestic environments.
Mirco Ravanelli, Luca Cristoforetti, Roberto Gretter, Marco Pellin, Alessandro Sosi, Maurizio Omologo
2015The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition.
Takaaki Hori, Zhuo Chen, Hakan Erdogan, John R. Hershey, Jonathan Le Roux, Vikramjit Mitra, Shinji Watanabe
2015The MGB challenge: Evaluating multi-genre broadcast media recognition.
Peter Bell, Mark J. F. Gales, Thomas Hain, Jonathan Kilgour, Pierre Lanchantin, Xunying Liu, Andrew McParland, Steve Renals, Oscar Saz, Mirjam Wester, Philip C. Woodland
2015The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function.
Quoc Truong Do, Michael Heck, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura
2015The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices.
Takuya Yoshioka, Nobutaka Ito, Marc Delcroix, Atsunori Ogawa, Keisuke Kinoshita, Masakiyo Fujimoto, Chengzhu Yu, Wojciech J. Fabian, Miquel Espi, Takuya Higuchi, Shoko Araki, Tomohiro Nakatani
2015The development of the cambridge university alignment systems for the multi-genre broadcast challenge.
Pierre Lanchantin, Mark J. F. Gales, Penny Karanasou, Xunying Liu, Yanmin Qian, Linlin Wang, Philip C. Woodland, Chao Zhang
2015The third 'CHiME' speech separation and recognition challenge: Dataset, task and baselines.
Jon Barker, Ricard Marxer, Emmanuel Vincent, Shinji Watanabe
2015Time delay deep neural network-based universal background models for speaker recognition.
David Snyder, Daniel Garcia-Romero, Daniel Povey
2015Time-frequency convolutional networks for robust speech recognition.
Vikramjit Mitra, Horacio Franco
2015Topic-space based setup of a neural network for theme identification of highly imperfect transcriptions.
Mohamed Morchid, Richard Dufour, Georges Linarès
2015Towards structured deep neural network for automatic speech recognition.
Yi-Hsiu Liao, Hung-yi Lee, Lin-Shan Lee
2015Towards utterance-based neural network adaptation in acoustic modeling.
Ivan Himawan, Petr Motlícek, Marc Ferras Font, Srikanth R. Madikeri
2015Training data pseudo-shuffling and direct decoding framework for recurrent neural network based acoustic modeling.
Naoyuki Kanda, Mitsuyoshi Tachimori, Xugang Lu, Hisashi Kawai
2015Two-stage ASGD framework for parallel training of DNN acoustic models using Ethernet.
Zhichao Wang, Xingyu Na, Xin Li, Jielin Pan, Yonghong Yan
2015Uncertainty estimation of DNN classifiers.
Sri Harish Reddy Mallidi, Tetsuji Ogawa, Hynek Hermansky
2015Unified ASR system using LGM-based source separation, noise-robust feature extraction, and word hypothesis selection.
Yusuke Fujita, Ryoichi Takashima, Takeshi Homma, Rintaro Ikeshita, Yohei Kawaguchi, Takashi Sumiyoshi, Takashi Endo, Masahito Togami
2015Using bidirectional lstm recurrent neural networks to learn high-level abstractions of sequential features for automated scoring of non-native spontaneous speech.
Zhou Yu, Vikram Ramanarayanan, David Suendermann-Oeft, Xinhao Wang, Klaus Zechner, Lei Chen, Jidong Tao, Aliaksei Ivanou, Yao Qian
2015Utterance classification in speech-to-speech translation for zero-resource languages in the hospital administration domain.
Lara J. Martin, Andrew Wilkinson, Sai Sumanth Miryala, Vivian Robison, Alan W. Black
2015Variational Bayesian PLDA for speaker diarization in the MGB challenge.
Jesús Antonio Villalba López, Alfonso Ortega, Antonio Miguel, Eduardo Lleida