ASRU C

144 papers

YearTitle / Authors
2019A Comparative Study on End-to-End Speech to Text Translation.
Parnia Bahar, Tobias Bieschke, Hermann Ney
2019A Comparative Study on Transformer vs RNN in Speech Applications.
Shigeki Karita, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto
2019A Comparison of End-to-End Models for Long-Form Speech Recognition.
Chung-Cheng Chiu, Anjuli Kannan, Rohit Prabhavalkar, Zhifeng Chen, Tara N. Sainath, Yonghui Wu, Wei Han, Yu Zhang, Ruoming Pang, Sergey Kishchenko, Patrick Nguyen, Arun Narayanan, Hank Liao, Shuyuan Zhang
2019A Comparison of Transformer and LSTM Encoder Decoder Models for ASR.
Albert Zeyer, Parnia Bahar, Kazuki Irie, Ralf Schlüter, Hermann Ney
2019A Cross-Corpus Study on Speech Emotion Recognition.
Rosanna Milner, Md Asif Jalal, Raymond W. M. Ng, Thomas Hain
2019A Density Ratio Approach to Language Model Fusion in End-to-End Automatic Speech Recognition.
Erik McDermott, Hasim Sak, Ehsan Variani
2019A Dropout-Based Single Model Committee Approach for Active Learning in ASR.
Jiayi Fu, Kuang Ru
2019A Modularized Neural Network with Language-Specific Output Layers for Cross-Lingual Voice Conversion.
Yi Zhou, Xiaohai Tian, Emre Yilmaz, Rohan Kumar Das, Haizhou Li
2019A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: The Deepmine Database.
Hossein Zeinali, Lukás Burget, Jan Honza Cernocký
2019A Unified Endpointer Using Multitask and Multidomain Training.
Shuo-Yiin Chang, Bo Li, Gabor Simko
2019Acoustic Model Adaptation from Raw Waveforms with Sincnet.
Joachim Fainberg, Ondrej Klejch, Erfan Loweimi, Peter Bell, Steve Renals
2019Adapting Pretrained Transformer to Lattices for Spoken Language Understanding.
Chao-Wei Huang, Yun-Nung Chen
2019Additional Shared Decoder on Siamese Multi-View Encoders for Learning Acoustic Word Embeddings.
Myunghun Jung, Hyungjun Lim, Jahyun Goo, Youngmoon Jung, Hoirin Kim
2019Advances in Online Audio-Visual Meeting Transcription.
Takuya Yoshioka, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Igor Abramovski, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao, Tianyan Zhou, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang
2019Adversarial Attacks on Spoofing Countermeasures of Automatic Speaker Verification.
Songxiang Liu, Haibin Wu, Hung-yi Lee, Helen Meng
2019An Investigation into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription.
Catalin Zorila, Christoph Böddeker, Rama Doddipatla, Reinhold Haeb-Umbach
2019An Investigation of LSTM-CTC based Joint Acoustic Model for Indian Language Identification.
Tirusha Mandava, Ravi Kumar Vuddagiri, Hari Krishna Vydana, Anil Kumar Vuppala
2019Analyzing Large Receptive Field Convolutional Networks for Distant Speech Recognition.
Salar Jafarlou, Soheil Khorram, Vinay Kothapally, John H. L. Hansen
2019Attention Based On-Device Streaming Speech Recognition with Large Speech Corpus.
Kwangyoun Kim, Seokyeong Jung, Jungin Lee, Myoungji Han, Chanwoo Kim, Kyungmin Lee, Dhananjaya Gowda, Junmo Park, Sungsoo Kim, Sichen Jin, Young-Yoon Lee, Jinsu Yeo, Daehyun Kim
2019Attention-Based Speech Recognition Using Gaze Information.
Osamu Segawa, Tomoki Hayashi, Kazuya Takeda
2019Bayesian Adversarial Learning for Speaker Recognition.
Jen-Tzung Chien, Chun Lin Kuo
2019Bootstrapping Non-Parallel Voice Conversion from Speaker-Adaptive Text-to-Speech.
Hieu-Thi Luong, Junichi Yamagishi
2019CNN with Phonetic Attention for Text-Independent Speaker Verification.
Tianyan Zhou, Yong Zhao, Jinyu Li, Yifan Gong, Jian Wu
2019Character-Aware Attention-Based End-to-End Speech Recognition.
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong
2019Controlling Emotion Strength with Relative Attribute for End-to-End Speech Synthesis.
Xiaolian Zhu, Shan Yang, Geng Yang, Lei Xie
2019Data Augmentation Based on Vowel Stretch for Improving Children's Speech Recognition.
Tohru Nagano, Takashi Fukuda, Masayuki Suzuki, Gakuto Kurata
2019Detecting Deception in Political Debates Using Acoustic and Textual Features.
Daniel Kopev, Ahmed Ali, Ivan Koychev, Preslav Nakov
2019Development of Voice Spoofing Detection Systems for 2019 Edition of Automatic Speaker Verification and Countermeasures Challenge.
João Monteiro, Jahangir Alam
2019Dialogue Environments are Different from Games: Investigating Variants of Deep Q-Networks for Dialogue Policy.
Yu-An Wang, Yun-Nung Chen
2019Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition.
Zhong Meng, Jinyu Li, Yashesh Gaur, Yifan Gong
2019Domain Expansion in DNN-Based Acoustic Models for Robust Speech Recognition.
Shahram Ghorbani, Soheil Khorram, John H. L. Hansen
2019Dover: A Method for Combining Diarization Outputs.
Andreas Stolcke, Takuya Yoshioka
2019Efficient Free Keyword Detection Based on CNN and End-to-End Continuous DP-Matching.
Tomohiro Tanaka, Takahiro Shinozaki
2019Efficient Semi-Supervised Learning for Natural Language Understanding by Optimizing Diversity.
Eunah Cho, He Xie, John P. Lalor, Varun Kumar, William M. Campbell
2019Embeddings for DNN Speaker Adaptive Training.
Joanna Rownicka, Peter Bell, Steve Renals
2019Emoception: An Inception Inspired Efficient Speech Emotion Recognition Network.
Chirag Singh, Abhay Kumar, Ajay Nagar, Suraj Tripathi, Promod Yenigalla
2019End-to-End Code-Switching ASR for Low-Resourced Language Pairs.
Xianghu Yue, Grandee Lee, Emre Yilmaz, Fang Deng, Haizhou Li
2019End-to-End Neural Speaker Diarization with Self-Attention.
Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi, Yawen Xue, Kenji Nagamatsu, Shinji Watanabe
2019End-to-End Overlapped Speech Detection and Speaker Counting with Raw Waveform.
Wangyou Zhang, Man Sun, Lan Wang, Yanmin Qian
2019End-to-End Training of a Large Vocabulary End-to-End Speech Recognition System.
Chanwoo Kim, Minkyoo Shin, Shatrughan Singh, Larry Heck, Dhananjaya Gowda, Sungsoo Kim, Kwangyoun Kim, Mehul Kumar, Jiyeon Kim, Kyungmin Lee, Changwoo Han, Abhinav Garg, Eunhyang Kim
2019Enhanced Bert-Based Ranking Models for Spoken Document Retrieval.
Hsiao-Yun Lin, Tien-Hong Lo, Berlin Chen
2019Espresso: A Fast End-to-End Neural Speech Recognition Toolkit.
Yiming Wang, Sanjeev Khudanpur, Tongfei Chen, Hainan Xu, Shuoyang Ding, Hang Lv, Yiwen Shao, Nanyun Peng, Lei Xie, Shinji Watanabe
2019Explicit Alignment of Text and Speech Encodings for Attention-Based End-to-End Speech Recognition.
Jennifer Drexler, James R. Glass
2019Exploring Effective Data Augmentation with TDNN-LSTM Neural Network Embedding for Speaker Recognition.
Chien-Lin Huang
2019Exploring Model Units and Training Strategies for End-to-End Speech Recognition.
Mingkun Huang, Yizhou Lu, Lan Wang, Yanmin Qian, Kai Yu
2019FaSNet: Low-Latency Adaptive Beamforming for Multi-Microphone Audio Processing.
Yi Luo, Cong Han, Nima Mesgarani, Enea Ceolini, Shih-Chii Liu
2019From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition.
Duc Le, Xiaohui Zhang, Weiyi Zheng, Christian Fügen, Geoffrey Zweig, Michael L. Seltzer
2019GANs for Children: A Generative Data Augmentation Strategy for Children Speech Recognition.
Peiyao Sheng, Zhuolin Yang, Yanmin Qian
2019Generalized Large-Context Language Models Based on Forward-Backward Hierarchical Recurrent Encoder-Decoder Models.
Ryo Masumura, Mana Ihori, Tomohiro Tanaka, Itsumi Saito, Kyosuke Nishida, Takanobu Oba
2019Hierarchical Transformers for Long Document Classification.
Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Yishay Carmiel, Najim Dehak
2019Highly Efficient Neural Network Language Model Compression Using Soft Binarization Training.
Rao Ma, Qi Liu, Kai Yu
2019IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2019, Singapore, December 14-18, 2019
2019Improved Multi-Stage Training of Online Attention-Based Encoder-Decoder Models.
Abhinav Garg, Dhananjaya Gowda, Ankur Kumar, Kwangyoun Kim, Mehul Kumar, Chanwoo Kim
2019Improving Fundamental Frequency Generation in EMG-to-Speech Conversion Using a Quantization Approach.
Lorenz Diener, Tejas Umesh, Tanja Schultz
2019Improving Grapheme-to-Phoneme Conversion by Investigating Copying Mechanism in Recurrent Architectures.
Abhishek Niranjan, M. Ali Basha Shaik
2019Improving Mandarin End-to-End Speech Synthesis by Self-Attention and Learnable Gaussian Bias.
Fengyu Yang, Shan Yang, Pengcheng Zhu, Pengju Yan, Lei Xie
2019Improving RNN Transducer Modeling for End-to-End Speech Recognition.
Jinyu Li, Rui Zhao, Hu Hu, Yifan Gong
2019Improving Speech Enhancement with Phonetic Embedding Features.
Bo Wu, Meng Yu, Lianwu Chen, Mingjie Jin, Dan Su, Dong Yu
2019Improving Speech-Based End-of-Turn Detection Via Cross-Modal Representation Learning with Punctuated Text Data.
Ryo Masumura, Mana Ihori, Tomohiro Tanaka, Atsushi Ando, Ryo Ishii, Takanobu Oba, Ryuichiro Higashinaka
2019In-the-Wild End-to-End Detection of Speech Affecting Diseases.
M. Joana Correia, Isabel Trancoso, Bhiksha Raj
2019Incorporating Prior Knowledge into Speaker Diarization and Linking for Identifying Common Speaker.
Tsun-Yat Leung, Lahiru Samarakoon, Albert Y. S. Lam
2019Incremental Lattice Determinization for WFST Decoders.
Zhehuai Chen, Mahsa Yarmohammadi, Hainan Xu, Hang Lv, Lei Xie, Daniel Povey, Sanjeev Khudanpur
2019Integrating Source-Channel and Attention-Based Sequence-to-Sequence Models for Speech Recognition.
Qiujia Li, Chao Zhang, Philip C. Woodland
2019Investigation of Shallow Wavenet Vocoder with Laplacian Distribution Output.
Patrick Lumban Tobing, Tomoki Hayashi, Tomoki Toda
2019Joint Distribution Learning in the Framework of Variational Autoencoders for Far-Field Speech Enhancement.
Mahesh K. Chelimilla, Shashi Kumar, Shakti P. Rath
2019Joint Learning of Word and Label Embeddings for Sequence Labelling in Spoken Language Understanding.
Jiewen Wu, Luis Fernando D'Haro, Nancy F. Chen, Pavitra Krishnaswamy, Rafael E. Banchs
2019Joint Optimization of Classification and Clustering for Deep Speaker Embedding.
Zhiming Wang, Kaisheng Yao, Shuo Fang, Xiaolong Li
2019Knowledge Distillation from Bert in Pre-Training and Fine-Tuning for Polyphone Disambiguation.
Hao Sun, Xu Tan, Jun-Wei Gan, Sheng Zhao, Dongxu Han, Hongzhi Liu, Tao Qin, Tie-Yan Liu
2019Language Model Bootstrapping Using Neural Machine Translation for Conversational Speech Recognition.
Surabhi Punjabi, Harish Arsikere, Sri Garimella
2019Latent Space Representation for Multi-Target Speaker Detection and Identification with a Sparse Dataset Using Triplet Neural Networks.
Kin Wai Cheuk, Balamurali B. T., Gemma Roig, Dorien Herremans
2019Lead2Gold: Towards Exploiting the Full Potential of Noisy Transcriptions for Speech Recognition.
Adrien Dufraux, Emmanuel Vincent, Awni Y. Hannun, Armelle Brun, Matthijs Douze
2019Learning Between Different Teacher and Student Models in ASR.
Jeremy Heng Meng Wong, Mark J. F. Gales, Yu Wang
2019Learning Hierarchical Representations for Expressive Speaking Style in End-to-End Speech Synthesis.
Xiaochun An, Yuxuan Wang, Shan Yang, Zejun Ma, Lei Xie
2019Leveraging Language ID in Multilingual End-to-End Speech Recognition.
Austin Waters, Neeraj Gaur, Parisa Haghani, Pedro J. Moreno, Zhongdi Qu
2019Listening While Speaking and Visualizing: Improving ASR Through Multimodal Chain.
Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
2019Logistic Similarity Metric Learning via Affinity Matrix for Text-Independent Speaker Verification.
Junyi Peng, Rongzhi Gu, Yuexian Zou
2019Long Range Acoustic and Deep Features Perspective on ASVspoof 2019.
Rohan Kumar Das, Jichen Yang, Haizhou Li
2019Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-Gans.
Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, Najim Dehak
2019MIMO-Speech: End-to-End Multi-Channel Multi-Speaker Speech Recognition.
Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe
2019Markov Recurrent Neural Network Language Model.
Jen-Tzung Chien, Che-Yu Kuo
2019Mixed Bandwidth Acoustic Modeling Leveraging Knowledge Distillation.
Takashi Fukuda, Samuel Thomas
2019Monotonic Recurrent Neural Network Transducer and Decoding Strategies.
Anshuman Tripathi, Han Lu, Hasim Sak, Hagen Soltau
2019Multilingual Bottleneck Features for Query by Example Spoken Term Detection.
Dhananjay Ram, Lesly Miculicich, Hervé Bourlard
2019Multilingual End-to-End Speech Translation.
Hirofumi Inaguma, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe
2019Native Language Identification from Raw Waveforms Using Deep Convolutional Neural Networks with Attentive Pooling.
Rutuja Ubale, Vikram Ramanarayanan, Yao Qian, Keelan Evanini, Chee Wee Leong, Chong Min Lee
2019Neural Machine Translation with Acoustic Embedding.
Takatomo Kano, Sakriani Sakti, Satoshi Nakamura
2019Novel Enhanced Teager Energy Based Cepstral Coefficients for Replay Spoof Detection.
Rajul Acharya, Hemant A. Patil, Harsh Kotta
2019On Temporal Context Information for Hybrid BLSTM-Based Phoneme Recognition.
Timo Lohrenz, Maximilian Strake, Tim Fingscheidt
2019On the Study of Generative Adversarial Networks for Cross-Lingual Voice Conversion.
Berrak Sisman, Mingyang Zhang, Minghui Dong, Haizhou Li
2019One-to-Many Multilingual End-to-End Speech Translation.
Mattia Antonino Di Gangi, Matteo Negri, Marco Turchi
2019Online Batch Normalization Adaptation for Automatic Speech Recognition.
Franco Mana, Felix Weninger, Roberto Gemello, Puming Zhan
2019Optimizing Neural Network Embeddings Using a Pair-Wise Loss for Text-Independent Speaker Verification.
Hira Dhamyal, Tianyan Zhou, Bhiksha Raj, Rita Singh
2019Orthogonality Constrained Multi-Head Attention for Keyword Spotting.
Mingu Lee, Jinkyu Lee, Hye Jin Jang, Byeonggeun Kim, Wonil Chang, Kyuwoong Hwang
2019Paraphrase Generation Based on VAE and Pointer-Generator Networks.
Lohith Ravuru, Hyungtak Choi, Siddarth K. M., Hojung Lee, Inchul Hwang
2019Personalization of End-to-End Speech Recognition on Mobile Devices for Named Entities.
Khe Chai Sim, Leif Johnson, Giovanni Motta, Lillian Zhou, Françoise Beaufays, Arnaud Benard, Dhruv Guliani, Andreas Kabel, Nikhil Khare, Tamar Lucassen, Petr Zadrazil, Harry Zhang
2019Power-Law Nonlinearity with Maximally Uniform Distribution Criterion for Improved Neural Network Training in Automatic Speech Recognition.
Chanwoo Kim, Mehul Kumar, Kwangyoun Kim, Dhananjaya Gowda
2019Probing the Information Encoded in X-Vectors.
Desh Raj, David Snyder, Daniel Povey, Sanjeev Khudanpur
2019Query-by-Example On-Device Keyword Spotting.
Byeonggeun Kim, Mingu Lee, Jinkyu Lee, Yeonseok Kim, Kyuwoong Hwang
2019Recognizing Long-Form Speech Using Streaming End-to-End Models.
Arun Narayanan, Rohit Prabhavalkar, Chung-Cheng Chiu, David Rybach, Tara N. Sainath, Trevor Strohman
2019Recurrent Neural Network Transducer for Audio-Visual Speech Recognition.
Takaki Makino, Hank Liao, Yannis M. Assael, Brendan Shillingford, Basilio Garcia, Otavio Braga, Olivier Siohan
2019Robust Belief State Space Representation for Statistical Dialogue Managers Using Deep Autoencoders.
Fotios Lygerakis, Vassilios Diakoloukas, Michail Lagoudakis, Margarita Kotti
2019SLU for Voice Command in Smart Home: Comparison of Pipeline and End-to-End Approaches.
Thierry Desot, François Portet, Michel Vacher
2019Scalable Neural Dialogue State Tracking.
Vevake Balaraman, Bernardo Magnini
2019Second Language Transfer Learning in Humans and Machines Using Image Supervision.
Kiran Praveen, Anshul Gupta, Akshara Soman, Sriram Ganapathy
2019Self-Adaptive Soft Voice Activity Detection Using Deep Neural Networks for Robust Speaker Verification.
Youngmoon Jung, Yeunju Choi, Hoirin Kim
2019Semi-Supervised Training and Data Augmentation for Adaptation of Automatic Broadcast News Captioning Systems.
Yinghui Huang, Samuel Thomas, Masayuki Suzuki, Zoltán Tüske, Larry Sansone, Michael Picheny
2019Short Utterance Compensation in Speaker Verification via Cosine-Based Teacher-Student Learning of Speaker Embeddings.
Jee-weon Jung, Hee-Soo Heo, Hye-jin Shim, Ha-Jin Yu
2019Simple Gated Convnet for Small Footprint Acoustic Modeling.
Lukas Lee, Jinhwan Park, Wonyong Sung
2019Simplified LSTMS for Speech Recognition.
George Saon, Zoltán Tüske, Kartik Audhkhasi, Brian Kingsbury, Michael Picheny, Samuel Thomas
2019Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models.
Naoyuki Kanda, Shota Horiguchi, Yusuke Fujita, Yawen Xue, Kenji Nagamatsu, Shinji Watanabe
2019Small-Footprint Keyword Spotting with Graph Convolutional Network.
Xi Chen, Shouyi Yin, Dandan Song, Peng Ouyang, Leibo Liu, Shaojun Wei
2019Spatio-Temporal Context Modelling for Speech Emotion Classification.
Md Asif Jalal, Roger K. Moore, Thomas Hain
2019Speaker Adaptive Training Using Model Agnostic Meta-Learning.
Ondrej Klejch, Joachim Fainberg, Peter Bell, Steve Renals
2019Speaker Verification with Application-Aware Beamforming.
Ladislav Mosner, Oldrich Plchot, Johan Rohdin, Lukás Burget, Jan Cernocký
2019Speaker and Language Aware Training for End-to-End ASR.
Shubham Bansal, Karan Malhotra, Sriram Ganapathy
2019Speaker-Aware Speech-Transformer.
Zhiyun Fan, Jie Li, Shiyu Zhou, Bo Xu
2019Speech Recognition with Augmented Synthesized Speech.
Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro J. Moreno, Yonghui Wu, Zelin Wu
2019Speech Reveals Future Risk of Developing Dementia: Predictive Dementia Screening from Biographic Interviews.
Jochen Weiner, Claudia Frankenberg, Johannes Schröder, Tanja Schultz
2019Speech Separation Using Speaker Inventory.
Peidong Wang, Zhuo Chen, Xiong Xiao, Zhong Meng, Takuya Yoshioka, Tianyan Zhou, Liang Lu, Jinyu Li
2019Speech-to-Speech Translation Between Untranscribed Unknown Languages.
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
2019Spherediar: An Effective Speaker Diarization System for Meeting Data.
Tuomas Kaseva, Aku Rouhe, Mikko Kurimo
2019Spoken Language Identification Using Bidirectional LSTM Based LID Sequential Senones.
H. Muralikrishna, Pulkit Sapra, Anuksha Jain, Dileep Aroor Dinesh
2019Spoken Multiple-Choice Question Answering Using Multimodal Convolutional Neural Networks.
Shang-Bao Luo, Hung-Shin Lee, Kuan-Yu Chen, Hsin-Min Wang
2019Spoof Detection Using Time-Delay Shallow Neural Network and Feature Switching.
Mari Ganesh Kumar, Suvidha Rupesh Kumar, M. S. Saranya, B. Bharathi, Hema A. Murthy
2019State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention with Dilated 1D Convolutions.
Kyu Jeong Han, Ramon Prieto, Tao Ma
2019Streaming End-to-End Speech Recognition with Joint CTC-Attention Based Models.
Niko Moritz, Takaaki Hori, Jonathan Le Roux
2019Syllable-Dependent Discriminative Learning for Small Footprint Text-Dependent Speaker Verification.
Junyi Peng, Yuexian Zou, Na Li, Deyi Tuo, Dan Su, Meng Yu, Chunlei Zhang, Dong Yu
2019Tacotron-Based Acoustic Model Using Phoneme Alignment for Practical Neural Text-to-Speech Systems.
Takuma Okamoto, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai
2019The MGB-5 Challenge: Recognition and Dialect Identification of Dialectal Arabic Speech.
Ahmed Ali, Suwon Shon, Younes Samih, Hamdy Mubarak, Ahmed Abdelali, James R. Glass, Steve Renals, Khalid Choukri
2019Time Domain Audio Visual Speech Separation.
Jian Wu, Yong Xu, Shi-Xiong Zhang, Lianwu Chen, Meng Yu, Lei Xie, Dong Yu
2019Time-Domain Speaker Extraction Network.
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li
2019Topic-Aware Pointer-Generator Networks for Summarizing Spoken Conversations.
Zhengyuan Liu, Angela Ng, Sheldon Lee Shao Guang, Ai Ti Aw, Nancy F. Chen
2019Towards Controlling False Alarm - Miss Trade-Off in Perceptual Speaker Comparison via Non-Neutral Listening Task Framing.
Rosa González Hautamäki, Tomi H. Kinnunen
2019Towards Real-Time Mispronunciation Detection in Kids' Speech.
Peter Plantinga, Eric Fosler-Lussier
2019Training Language Models for Long-Span Cross-Sentence Evaluation.
Kazuki Irie, Albert Zeyer, Ralf Schlüter, Hermann Ney
2019Transfer Learning for Context-Aware Spoken Language Understanding.
Qian Chen, Zhu Zhuo, Wen Wang, Qiuyun Xu
2019Transformer ASR with Contextual Block Processing.
Emiru Tsunoo, Yosuke Kashiwagi, Toshiyuki Kumakura, Shinji Watanabe
2019Unsupervised Adaptation of Acoustic Models for ASR Using Utterance-Level Embeddings from Squeeze and Excitation Networks.
Hardik B. Sailor, Salil Deena, Md Asif Jalal, Rasa Lileikyte, Thomas Hain
2019Using Very Deep Convolutional Neural Networks to Automatically Detect Plagiarized Spoken Responses.
Xinhao Wang, Keelan Evanini, Yao Qian, Klaus Zechner
2019Verifying Deep Keyword Spotting Detection with Acoustic Word Embeddings.
Yougen Yuan, Zhiqiang Lv, Shen Huang, Lei Xie
2019Virtual Adversarial Training for DS-CNN Based Small-Footprint Keyword Spotting.
Xiong Wang, Sining Sun, Lei Xie
2019WaveNet Factorization with Singular Value Decomposition for Voice Conversion.
Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li
2019Zero-Shot Code-Switching ASR and TTS with Multilingual Machine Speech Chain.
Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
2019Zero-Shot Pronunciation Lexicons for Cross-Language Acoustic Model Transfer.
Matthew Wiesner, Oliver Adams, David Yarowsky, Jan Trmal, Sanjeev Khudanpur