| 2014 | "was that your mother on the phone?": classifying interpersonal relationships between dialog participants with lexical and acoustic properties. Denys Katerenchuk, David Guy Brizan, Andrew Rosenberg |
| 2014 | 'houston, we have a solution': a case study of the analysis of astronaut speech during NASA apollo 11 for long-term speaker modeling. Chengzhu Yu, John H. L. Hansen, Douglas W. Oard |
| 2014 | 1-bit stochastic gradient descent and its application to data-parallel distributed training of speech DNNs. Frank Seide, Hao Fu, Jasha Droppo, Gang Li, Dong Yu |
| 2014 | 15th Annual Conference of the International Speech Communication Association, INTERSPEECH 2014, Singapore, September 14-18, 2014 Haizhou Li, Helen M. Meng, Bin Ma, Engsiong Chng, Lei Xie |
| 2014 | 3d tongue motion visualization based on ultrasound image sequences. Kele Xu, Yin Yang, A. Jaumard-Hakoun, Martine Adda-Decker, Angélique Amelot, Samer Al Kork, Lise Crevier-Buchman, Patrick Chawah, Gérard Dreyfus, Thibaut Fux, Claire Pillot-Loiseau, Pierre Roussel, Maureen Stone, Bruce Denby |
| 2014 | A CRF-based approach to automatic disfluency detection in a French call-centre corpus. Camille Dutrey, Chloé Clavel, Sophie Rosset, Ioana Vasilescu, Martine Adda-Decker |
| 2014 | A big data approach to acoustic model training corpus selection. Olga Kapralova, John Alex, Eugene Weinstein, Pedro J. Moreno, Olivier Siohan |
| 2014 | A client mobile application for Chinese-Spanish statistical machine translation. Jordi Centelles, Marta R. Costa-jussà, Rafael E. Banchs |
| 2014 | A comparative analytic study on the Gaussian mixture and context dependent deep neural network hidden Markov models. Yan Huang, Dong Yu, Chaojun Liu, Yifan Gong |
| 2014 | A comparative study of spectral transformation techniques for singing voice synthesis. Siu Wa Lee, Zhizheng Wu, Minghui Dong, Xiaohai Tian, Haizhou Li |
| 2014 | A comparison of GMM-HMM and DNN-HMM based pronunciation verification techniques for use in the assessment of childhood apraxia of speech. Mostafa Ali Shahin, Beena Ahmed, Jacqueline McKechnie, Kirrie J. Ballard, Ricardo Gutierrez-Osuna |
| 2014 | A comparison of multiple methods for rescoring keyword search lists for low resource languages. Victor Soto, Lidia Mangu, Andrew Rosenberg, Julia Hirschberg |
| 2014 | A comparison of open-source segmentation architectures for dealing with imperfect data from the media in speech synthesis. Ascensión Gallardo-Antolín, Juan Manuel Montero, Simon King |
| 2014 | A comparison of training approaches for discriminative segmental models. Hao Tang, Kevin Gimpel, Karen Livescu |
| 2014 | A cross-vocoder study of speaker independent synthetic speech detection using phase information. Jon Sánchez, Ibon Saratxaga, Inma Hernáez, Eva Navas, Daniel Erro |
| 2014 | A crosslinguistic and acquisitional perspective on intonational rises in French. Giuseppina Turco, Elisabeth Delais-Roussarie |
| 2014 | A data-driven approach to speech enhancement using Gaussian process. Sukanya Sonowal, Kisoo Kwon, Nam Soo Kim, Jong Won Shin |
| 2014 | A deep neural network approach for sentence boundary detection in broadcast news. Chenglin Xu, Lei Xie, Guangpu Huang, Xiong Xiao, Engsiong Chng, Haizhou Li |
| 2014 | A deep neural network speaker verification system targeting microphone speech. Yun Lei, Luciana Ferrer, Mitchell McLaren, Nicolas Scheffer |
| 2014 | A flexible front-end for HTS. Matthew P. Aylett, Rasmus Dall, Arnab Ghoshal, Gustav Eje Henter, Thomas Merritt |
| 2014 | A graph-based Gaussian component clustering approach to unsupervised acoustic modeling. Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li |
| 2014 | A hearing impairment simulation method using audiogram-based approximation of auditory charatecteristics. Nozomi Jinbo, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura |
| 2014 | A hierarchical viterbi algorithm for Mandarin hybrid speech synthesis system. Ran Zhang, Zhengqi Wen, Jianhua Tao, Ya Li, Bing Liu, Xiaoyan Lou |
| 2014 | A hybrid approach to 3d tongue modeling from vocal tract MRI using unsupervised image segmentation and mesh deformation. Alexander Hewer, Ingmar Steiner, Stefanie Wuhrer |
| 2014 | A hybrid approach to segmentation of speech using group delay processing and HMM based embedded reestimation. S. Aswin Shanmugam, Hema A. Murthy |
| 2014 | A keyword-boosted sMBR criterion to enhance keyword search performance in deep neural network based acoustic modeling. I-Fan Chen, Nancy F. Chen, Chin-Hui Lee |
| 2014 | A long, deep and wide artificial neural net for robust speech recognition in unknown noise. Feipeng Li, Phani S. Nidadavolu, Hynek Hermansky |
| 2014 | A low complexity model adaptation approach involving sparse coding over multiple dictionaries. Syed Shahnawazuddin, Rohit Sinha |
| 2014 | A measure of phase randomness for the harmonic model in speech synthesis. Gilles Degottex, Daniel Erro |
| 2014 | A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech. Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda |
| 2014 | A minimal-resource transliteration framework for vietnamese. Hoang Gia Ngo, Nancy F. Chen, Sunil Sivadas, Bin Ma, Haizhou Li |
| 2014 | A new auxiliary-vector algorithm with conjugate orthogonality for speech enhancement. Shengkui Zhao, Douglas L. Jones |
| 2014 | A next step towards measuring perceived quality of speech through physiology. Sebastian Arndt, Markus Wenzel, Jan-Niklas Antons, Friedemann Köster, Sebastian Möller, Gabriel Curio |
| 2014 | A novel boosting algorithm for improved i-vector based speaker verification in noisy environments. Sourjya Sarkar, K. Sreenivasa Rao |
| 2014 | A novel dynamic parameters calculation approach for model compensation. Suliang Bu, Yanmin Qian, Kai Yu |
| 2014 | A preliminary study on ASR-based detection of Chinese mispronunciation by Japanese learners. Richeng Duan, Jinsong Zhang, Wen Cao, Yanlu Xie |
| 2014 | A preliminary study on acoustic correlates of tone2+tone2 disyllabic word stress in Mandarin. Min Liu, Shuju Shi, Jinsong Zhang |
| 2014 | A real-time MRI study of articulatory setting in second language speech. Andrés Benítez, Vikram Ramanarayanan, Louis Goldstein, Shrikanth S. Narayanan |
| 2014 | A robust TDOA estimation method for in-car-noise environments. Weiwei Cui, Jaeyeon Cho, Seungyeol Lee |
| 2014 | A robust step-size control algorithm for frequency domain acoustic echo cancellation. Chao Wu, Kaiyu Jiang, Yanmeng Guo, Qiang Fu, Yonghong Yan |
| 2014 | A semi-Markov model for speech segmentation with an utterance-break prior. Mark Sinclair, Peter Bell, Alexandra Birch, Fergus McInnes |
| 2014 | A sparse reconstruction method for speech source localization using partial dictionaries over a spherical microphone array. Kushagra Singhal, Rajesh M. Hegde |
| 2014 | A speech system for estimating daily word counts. Ali Ziaei, Abhijeet Sangwan, John H. L. Hansen |
| 2014 | A study of invariant properties and variation patterns in the converter/distributor model for emotional speech. Jangwon Kim, Donna Erickson, Sungbok Lee, Shrikanth S. Narayanan |
| 2014 | A study on the improvement of measurement accuracy of the three-dimensional electromagnetic articulography. Hidetsugu Uchida, Kohei Wakamiya, Tokihiko Kaburagi |
| 2014 | A target approximation intonation model for yorùbá TTS. Daniel R. van Niekerk, Etienne Barnard |
| 2014 | A unified account of prominence effects in an optimization-based model of speech timing. Andreas Windmann, Juraj Simko, Petra Wagner |
| 2014 | A unified approach for underdetermined blind signal separation and source activity detection by multichannel factorial hidden Markov models. Takuya Higuchi, Hirofumi Takeda, Tomohiko Nakamura, Hirokazu Kameoka |
| 2014 | A whispered Mandarin corpus for speech technology applications. Pei Xuan Lee, Darren Wee, Hilary Si Yin Toh, Boon Pang Lim, Nancy F. Chen, Bin Ma |
| 2014 | ASR feature extraction with morphologically-filtered power-normalized cochleograms. Fernando de-la-Calle-Silos, Francisco José Valverde Albacete, Ascensión Gallardo-Antolín, Carmen Peláez-Moreno |
| 2014 | ATHENA: a Greek multi-sensory database for home automation control uthor: isidoros rodomagoulakis (NTUA, Greece). Antigoni Tsiami, Isidoros Rodomagoulakis, Panagiotis Giannoulis, Athanasios Katsamanis, Gerasimos Potamianos, Petros Maragos |
| 2014 | About combining forward and backward-based decoders for selecting data for unsupervised training of acoustic models. Denis Jouvet, Dominique Fohr |
| 2014 | Accent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling. Tomoki Koriyama, Hiroshi Suzuki, Takashi Nose, Takahiro Shinozaki, Takao Kobayashi |
| 2014 | Accuracy evaluation of esophageal voice analysis based on automatic topology generated-voicing source HMM. Akira Sasou |
| 2014 | Achievements and challenges of deep learning - from speech analysis and recognition to language and multimodal processing. Li Deng |
| 2014 | Acoustic and kinematic characteristics of vowel production through a virtual vocal tract in dysarthria. Jeffrey Berry, Andrew Kolb, Cassandra North, Michael T. Johnson |
| 2014 | Acoustic characteristics of critical message utterances in noise applied to speech intelligibility enhancement. Neehar Jathar, Preeti Rao |
| 2014 | Acoustic correlates of phonological status. Maarten Versteegh, Amanda Seidl, Alejandrina Cristià |
| 2014 | Acoustic event detection and localization with regression forests. Huy Phan, Marco Maaß, Radoslaw Mazur, Alfred Mertins |
| 2014 | Acoustic feature transformation using UBM-based LDA for speaker recognition. Chengzhu Yu, Gang Liu, John H. L. Hansen |
| 2014 | Acoustic features for robust classification of Mandarin tones. Hongbing Hu, Stephen A. Zahorian, Peter Guzewich, Jiang Wu |
| 2014 | Acoustic investigation of /t Shufang Xu |
| 2014 | Acoustic modeling with deep neural networks using raw time signal for LVCSR. Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney |
| 2014 | Acoustic properties of shared vowels in bilingual Mandarin-English children. Jing Yang, Robert Allen Fox |
| 2014 | Across-speaker articulatory normalization for speaker-independent silent speech recognition. Jun Wang, Ashok Samal, Jordan R. Green |
| 2014 | Adaptation of deep neural network acoustic models using factorised i-vectors. Penny Karanasou, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland |
| 2014 | Adapting dependency parsing to spontaneous speech for open domain spoken language understanding. Frédéric Béchet, Alexis Nasr, Benoît Favre |
| 2014 | Adapting prosodic chunking algorithm and synthesis system to specific style: the case of dictation. Elisabeth Delais-Roussarie, Damien Lolive, Hiyon Yoo, Nelly Barbot, Olivier Rosec |
| 2014 | Adaptive speech recognition and dialogue management for users with speech disorders. I. Casanueva, Heidi Christensen, Thomas Hain, Phil D. Green |
| 2014 | Advantages of wideband over narrowband channels for speaker verification employing MFCCs and LFCCs. Laura Fernández Gallardo, Michael Wagner, Sebastian Möller |
| 2014 | Aero-tactile integration in fricatives: converting audio to air flow information for speech perception enhancement. Donald Derrick, Greg A. O'Beirne, Tom De Rybel, Jennifer Hay |
| 2014 | Age and rhythmic variations: a study on Italian. Massimo Pettorino, Elisa Pellegrino |
| 2014 | Age, hearing loss and the perception of affective utterances in conversational speech. Juliane Schmidt, Esther Janse, Odette Scharenborg |
| 2014 | Alignment of spoken utterances with slide content for easier learning with recorded lectures using structured support vector machine (SVM). Han Lu, Sheng-syun Shen, Sz-Rung Shiang, Hung-yi Lee, Lin-Shan Lee |
| 2014 | An adaptive envelope compression strategy for speech processing in cochlear implants. Ying-Hui Lai, Fei Chen, Yu Tsao |
| 2014 | An annotation scheme for sighs in spontaneous dialogue. Khiet P. Truong, Gerben J. Westerhof, Franciska de Jong, Dirk Heylen |
| 2014 | An educational platform to capture, visualize and analyze rare singing. Patrick Chawah, Samer Al Kork, Thibaut Fux, Martine Adda-Decker, Angélique Amelot, Nicolas Audibert, Bruce Denby, Gérard Dreyfus, A. Jaumard-Hakoun, Claire Pillot-Loiseau, Pierre Roussel, Maureen Stone, Kele Xu, Lise Crevier-Buchman |
| 2014 | An empirical study of multilingual and low-resource spoken term detection using deep neural networks. Jie Li, Xiaorui Wang, Bo Xu |
| 2014 | An evaluation of machine learning methods for prominence detection in French. George Christodoulides, Mathieu Avanzi |
| 2014 | An evaluation of unsupervised acoustic model training for a dysarthric speech interface. Oliver Walter, Vladimir Despotovic, Reinhold Haeb-Umbach, Jort F. Gemmeke, Bart Ons, Hugo Van hamme |
| 2014 | An in-depth comparison of keyword specific thresholding and sum-to-one score normalization. Yun Wang, Florian Metze |
| 2014 | An initial investigation of long-term adaptation for meeting transcription. Xie Chen, Mark J. F. Gales, Kate M. Knill, Catherine Breslin, Langzhou Chen, K. K. Chin, Vincent Wan |
| 2014 | An introduction to computational networks and the computational network toolkit (invited talk). Dong Yu, Adam Eversole, Michael L. Seltzer, Kaisheng Yao, Brian Guenter, Oleksii Kuchaiev, Frank Seide, Huaming Wang, Jasha Droppo, Zhiheng Huang, Geoffrey Zweig, Christopher J. Rossbach, Jon Currey |
| 2014 | An investigation of likelihood normalization for robust ASR. Emmanuel Vincent, Aggelos Gkiokas, Dominik Schnitzer, Arthur Flexer |
| 2014 | An investigation of the application of dynamic sinusoidal models to statistical parametric speech synthesis. Qiong Hu, Yannis Stylianou, Ranniery Maia, Korin Richmond, Junichi Yamagishi, Javier Latorre |
| 2014 | An investigation of vocal arousal dynamics in child-psychologist interactions using synchrony measures and a conversation-based model. Daniel Bone, Chi-Chun Lee, Alexandros Potamianos, Shrikanth S. Narayanan |
| 2014 | An iterative approach to decision tree training for context dependent speech synthesis. Xiayu Chen, Yang Zhang, Mark Hasegawa-Johnson |
| 2014 | An iterative speaker re-diarization scheme for improving speaker-based entity extraction in multimedia archives. Houman Ghaemmaghami, David Dean, Sridha Sridharan |
| 2014 | Analysing the prosodic characteristics of speech-chunks preceding silences in task-based interactions. John Kane, Irena Yanushevskaya, Céline De Looze, Brian Vaughan, Ailbhe Ní Chasaide |
| 2014 | Analysis and identification of human scream: implications for speaker recognition. Mahesh Kumar Nandwana, John H. L. Hansen |
| 2014 | Analysis of emotional effect on speech-body gesture interplay. Zhaojun Yang, Shrikanth S. Narayanan |
| 2014 | Analysis of i-vector framework for speaker identification in TV-shows. Corinne Fredouille, Delphine Charlet |
| 2014 | Analysis of laughter events in real science classes by using multiple environment sensor data. Carlos Toshinori Ishi, Hiroaki Hatano, Norihiro Hagita |
| 2014 | Analysis of phonetic similarity in a silent speech interface based on permanent magnetic articulography. José A. González, Lam Aun Cheah, Jie Bai, Stephen R. Ell, James M. Gilbert, Roger K. Moore, Phil D. Green |
| 2014 | Analysis of spectral enhancement using global variance in HMM-based speech synthesis. Takashi Nose, Akinori Ito |
| 2014 | Analysis of spectrogram image methods for sound event classification. Jonathan William Dennis, Tran Huy Dat, Chng Eng Siong |
| 2014 | Analyzing perceptual dimensions of conversational speech quality. Friedemann Köster, Sebastian Möller |
| 2014 | Application of convolutional neural networks to speaker recognition in noisy conditions. Mitchell McLaren, Yun Lei, Nicolas Scheffer, Luciana Ferrer |
| 2014 | Application of image processing methods to filled pauses detection from spontaneous speech. Dmytro Prylipko, Olga Egorow, Ingo Siegert, Andreas Wendemuth |
| 2014 | Application of matrix variate Gaussian mixture model to statistical voice conversion. Daisuke Saito, Hidenobu Doi, Nobuaki Minematsu, Keikichi Hirose |
| 2014 | Applications of maximum entropy rankers to problems in spoken language processing. Richard Sproat, Keith B. Hall |
| 2014 | Articulation and neutralization: a preliminary study of lenition in scottish gaelic. Diana Archangeli, Samuel Johnston, Jae-Hyun Sung, Muriel Fisher, Michael Hammond, Andrew Carnie |
| 2014 | Articulatory controllable speech modification based on statistical feature mapping with Gaussian mixture models. Patrick Lumban Tobing, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura, Ayu Purwarianti |
| 2014 | Articulatory dynamics and coordination in classifying cognitive change with preclinical mTBI. Brian S. Helfer, Thomas F. Quatieri, James R. Williamson, Laurel Keyes, Benjamin Evans, W. Nicholas Greene, Trina Vian, Joseph Lacirignola, Trey E. Shenk, Thomas M. Talavage, Jeff Palmer, Kristin Heaton |
| 2014 | Assessing objective characterizations of phonetic convergence. Gérard Bailly, Amélie Martin |
| 2014 | Asynchronous stochastic optimization for sequence training of deep neural networks: towards big data. Erik McDermott, Georg Heigold, Pedro J. Moreno, Andrew W. Senior, Michiel Bacchiani |
| 2014 | Asynchronous, online, GMM-free training of a context dependent acoustic model for speech recognition. Michiel Bacchiani, Andrew W. Senior, Georg Heigold |
| 2014 | Audio thumbnails for spoken content without transcription based on a maximum motif coverage criterion. Guillaume Gravier, Nathan Souviraà-Labastie, Sébastien Campion, Frédéric Bimbot |
| 2014 | Audio watermarking based on multiple echoes hiding for FM radio. Xuejun Zhang, Xiang Xie |
| 2014 | Audio-to-text alignment for speech recognition with very limited resources. Xavier Anguera, Jordi Luque, Ciro Gracia |
| 2014 | Audio-visual signal processing in a multimodal assisted living environment. Alexey Karpov, Lale Akarun, Hülya Yalçin, Alexander L. Ronzhin, Baris Evrim Demiröz, Aysun Çoban, Milos Zelezný |
| 2014 | Audiovisual temporal sensitivity in typical and dyslexic adult readers. Ana A. Francisco, Alexandra Jesse, Margriet A. Groen, James M. McQueen |
| 2014 | Automated closed captioning for Russian live broadcasting. Kirill Levin, Irina Ponomareva, Anna Bulusheva, German A. Chernykh, Ivan Medennikov, Nickolay Merkin, Alexey Prudnikov, Natalia A. Tomashenko |
| 2014 | Automated production of true-cased punctuated subtitles for weather and news broadcasts. Joris Driesen, Alexandra Birch, Simon Grimsey, Saeid Safarfashandi, Juliet Gauthier, Matt Simpson, Steve Renals |
| 2014 | Automatic animation of an articulatory tongue model from ultrasound images using Gaussian mixture regression. Diandra Fabre, Thomas Hueber, Pierre Badin |
| 2014 | Automatic assessment of children's reading with the FLaVoR decoding using a phone confusion model. Emre Yilmaz, Joris Pelemans, Hugo Van hamme |
| 2014 | Automatic detection of parkinson's disease from words uttered in three different languages. Juan Rafael Orozco-Arroyave, Florian Hönig, Julián D. Arias-Londoño, Jesús Francisco Vargas-Bonilla, Sabine Skodda, Jan Rusz, Elmar Nöth |
| 2014 | Automatic estimation of the lip radiation effect in glottal inverse filtering. Manu Airaksinen, Tom Bäckström, Paavo Alku |
| 2014 | Automatic language identification using long short-term memory recurrent neural networks. Javier Gonzalez-Dominguez, Ignacio López-Moreno, Hasim Sak, Joaquin Gonzalez-Rodriguez, Pedro J. Moreno |
| 2014 | Automatic modelling of depressed speech: relevant features and relevance of gender. Florian Hönig, Anton Batliner, Elmar Nöth, Sebastian Schnieder, Jarek Krajewski |
| 2014 | Automatic recognition of attitudes in video blogs - prosodic and visual feature analysis. Noor Alhusna Madzlan, Jing Guang Han, Francesca Bonin, Nick Campbell |
| 2014 | Automatic recognition of speaker physical load using posterior probability based features from acoustic and phonetic tokens. Ming Li |
| 2014 | Automatic speech feature classification for children with cochlear implants. Jason Lilley, James J. Mahshie, H. Timothy Bunnell |
| 2014 | Automatic speech recognition and translation of a Swiss German dialect: Walliserdeutsch. Philip N. Garner, David Imseng, Thomas Meyer |
| 2014 | Automatic speech recognition with primarily temporal envelope information. Payton Lin, Fei Chen, Syu-Siang Wang, Ying-Hui Lai, Yu Tsao |
| 2014 | Automating an objective measure of pediatric speech intelligibility. Jason Lilley, Susan Nittrouer, H. Timothy Bunnell |
| 2014 | Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models. Navdeep Jaitly, Vincent Vanhoucke, Geoffrey E. Hinton |
| 2014 | BUT 2014 Babel system: analysis of adaptation in NN based systems. Martin Karafiát, Frantisek Grézl, Karel Veselý, Mirko Hannemann, Igor Szöke, Jan Cernocký |
| 2014 | Backoff inspired features for maximum entropy language models. Fadi Biadsy, Keith B. Hall, Pedro J. Moreno, Brian Roark |
| 2014 | Bayesian calibration for forensic evidence reporting. Niko Brümmer, Albert Swart |
| 2014 | Bayesian factorization and selection for speech and music separation. Po-Kai Yang, Chung-Chien Hsu, Jen-Tzung Chien |
| 2014 | Beyond cross-entropy: towards better frame-level objective functions for deep neural network training in automatic speech recognition. Zhen Huang, Jinyu Li, Chao Weng, Chin-Hui Lee |
| 2014 | Binary mask estimation based on frequency modulations. Chung-Chien Hsu, Jen-Tzung Chien, Tai-Shih Chi |
| 2014 | Binaural deep neural network classification for reverberant speech segregation. Yi Jiang, DeLiang Wang, Runsheng Liu |
| 2014 | BioKIT - real-time decoder for biosignal processing. Dominic Telaar, Michael Wand, Dirk Gehrig, Felix Putze, Christoph Amma, Dominic Heger, Ngoc Thang Vu, Mark Erhardt, Tim Schlippe, Matthias Janke, Christian Herff, Tanja Schultz |
| 2014 | Blind source extraction based on a direction-dependent a-priori SNR. Lukas Pfeifenberger, Franz Pernkopf |
| 2014 | Blind speech source localization, counting and separation for 2-channel convolutive mixtures in a reverberant environment. Sayeh Mirzaei, Hugo Van hamme, Yaser Norouzi |
| 2014 | Boosted deep neural networks and multi-resolution cochleagram features for voice activity detection. Xiao-Lei Zhang, DeLiang Wang |
| 2014 | Boosting bonsai trees for efficient features combination: application to speaker role identification. Antoine Laurent, Nathalie Camelin, Christian Raymond |
| 2014 | Boundary contraction training for acoustic models based on discrete deep neural networks. Ryu Takeda, Naoyuki Kanda, Nobuo Nukaga |
| 2014 | Building a naturalistic emotional speech corpus by retrieving expressive behaviors from existing speech corpora. Soroosh Mariooryad, Reza Lotfian, Carlos Busso |
| 2014 | Building a vocabulary self-learning speech recognition system. Long Qin, Alexander I. Rudnicky |
| 2014 | Building resources for Algerian Arabic dialects. Salima Harrat, Karima Meftouh, Mourad Abbas, Kamel Smaïli |
| 2014 | Can adolescents with autism perceive emotional prosody? Cristiane Hsu, Yi Xu |
| 2014 | Canonical correlation analysis and local fisher discriminant analysis based multi-view acoustic feature reduction for physical load prediction. Heysem Kaya, Tugçe Özkaptan, Albert Ali Salah, Sadik Fikret Gürgen |
| 2014 | Chaotic mixed excitation source for speech synthesis. Hemant A. Patil, Tanvina B. Patel |
| 2014 | Choosing useful word alternates for automatic speech recognition correction interfaces. David Harwath, Alexander Gruenstein, Ian McGraw |
| 2014 | Classification of cognitive load from speech using an i-vector framework. Maarten Van Segbroeck, Ruchir Travadi, Colin Vaz, Jangwon Kim, Matthew P. Black, Alexandros Potamianos, Shrikanth S. Narayanan |
| 2014 | Cluster based Chinese abbreviation modeling. Yangyang Shi, Yi-Cheng Pan, Mei-Yuh Hwang |
| 2014 | Clustering-based i-vector formulation for speaker recognition. Hung-Shin Lee, Yu Tsao, Hsin-Min Wang, Shyh-Kang Jeng |
| 2014 | Co-channel speech detection via spectral analysis of frequency modulated sub-bands. Navid Shokouhi, Seyed Omid Sadjadi, John H. L. Hansen |
| 2014 | Collecting a corpus of Dutch noise-induced 'slips of the ear'. Odette Scharenborg, Eric Sanders, Bert Cranen |
| 2014 | Combination of FST and CN search in spoken term detection. Justin T. Chiu, Yun Wang, Jan Trmal, Daniel Povey, Guoguo Chen, Alexander I. Rudnicky |
| 2014 | Combination of multilingual and semi-supervised training for under-resourced languages. Frantisek Grézl, Martin Karafiát |
| 2014 | Combining recurrent neural networks and factored language models during decoding of code-Switching speech. Heike Adel, Dominic Telaar, Ngoc Thang Vu, Katrin Kirchhoff, Tanja Schultz |
| 2014 | Combining source and system information for limited data speaker verification. Rohan Kumar Das, S. Abhiram, S. R. M. Prasanna, A. G. Ramakrishnan |
| 2014 | Combining tandem and hybrid systems for improved speech recognition and keyword spotting on low resource languages. Shakti P. Rath, Kate M. Knill, Anton Ragni, Mark J. F. Gales |
| 2014 | Comparing approaches to convert recurrent neural networks into backoff language models for efficient decoding. Heike Adel, Katrin Kirchhoff, Ngoc Thang Vu, Dominic Telaar, Tanja Schultz |
| 2014 | Comparing decoding strategies for subword-based keyword spotting in low-resourced languages. William Hartmann, Viet Bac Le, Abdelkhalek Messaoudi, Lori Lamel, Jean-Luc Gauvain |
| 2014 | Comparing parameterizations of pitch register and its discontinuities at prosodic boundaries for Hungarian. Uwe D. Reichel, Katalin Mády |
| 2014 | Comparing reaction time sequences from human participants and computational models. Louis ten Bosch, Mirjam Ernestus, Lou Boves |
| 2014 | Comparing time-frequency representations for directional derivative features. James Gibson, Maarten Van Segbroeck, Shrikanth S. Narayanan |
| 2014 | Comparison of speech quality with and without sensors in electromagnetic articulograph AG 501 recording. Nisha Meenakshi, Chiranjeevi Yarra, B. K. Yamini, Prasanta Kumar Ghosh |
| 2014 | Comparison of vocal tract transfer functions calculated using one-dimensional and three-dimensional acoustic simulation methods. Hironori Takemoto, Parham Mokhtari, Tatsuya Kitamura |
| 2014 | Component structuring and trajectory modeling for speech recognition. Arseniy Gorin, Denis Jouvet |
| 2014 | Concept-to-speech generation by integrating syntagmatic features into HMM-based speech synthesis. Xin Wang, Zhen-Hua Ling, Li-Rong Dai |
| 2014 | Consonant context effects on vowel sensorimotor adaptation. Jeffrey Berry, John Jaeger IV, Melissa Wiedenhoeft, Brittany Bernal, Michael T. Johnson |
| 2014 | Constrained speaker linking. David A. van Leeuwen, Niko Brümmer |
| 2014 | Content matching for short duration speaker recognition. Nicolas Scheffer, Yun Lei |
| 2014 | Context-dependent pronunciation error pattern discovery with limited annotations. Ann Lee, James R. Glass |
| 2014 | Contribution of tongue lateral to consonant production. Jun Wang, William F. Katz, Thomas F. Campbell |
| 2014 | Conversational structures affecting auditory likeability. Benjamin Weiss, Katrin Schoenenberg |
| 2014 | Conversion from facial myoelectric signals to speech: a unit selection approach. Marlene Zahner, Matthias Janke, Michael Wand, Tanja Schultz |
| 2014 | Convolutional deep maxout networks for phone recognition. László Tóth |
| 2014 | Corpus-based L2 phonological data and semi-automatic perceptual analysis: the case of nasal vowels produced by beginner Japanese learners of French. Sylvain Detey, Isabelle Racine, Julien Eychenne, Yuji Kawaguchi |
| 2014 | Corpus-testing a fricative discriminator; or, just how invariant is this invariant? Philip J. Roberts, Henning Reetz, Aditi Lahiri |
| 2014 | Cost-level integration of statistical and rule-based dialog managers. Shinji Watanabe, John R. Hershey, Tim K. Marks, Youichi Fujii, Yusuke Koji |
| 2014 | Cross-language perception of Japanese singleton and geminate consonants: preliminary data from non-native learners of Japanese and native speakers of Italian and australian English. Kimiko Tsukada, Felicity Cox, John Hajek |
| 2014 | Cross-language transfer of semantic annotation via targeted crowdsourcing. Shammur Absar Chowdhury, Arindam Ghosh, Evgeny A. Stepanov, Ali Orkan Bayer, Giuseppe Riccardi, Ioannis Klasinas |
| 2014 | Cross-lingual adaptation with multi-task adaptive networks. Peter Bell, Joris Driesen, Steve Renals |
| 2014 | Cross-lingual voice conversion-based polyglot speech synthesizer for indian languages. B. Ramani, M. P. Actlin Jeeva, P. Vijayalakshmi, T. Nagarajan |
| 2014 | Cross-linguistic investigations of oral and silent reading. Christophe Coupé, Yoon Mi Oh, François Pellegrino, Egidio Marsico |
| 2014 | Crowdee: mobile crowdsourcing micro-task platform for celebrating the diversity of languages. Babak Naderi, Tim Polzehl, André Beyer, Tibor Pilz, Sebastian Möller |
| 2014 | Crowdsourcing for situated dialog systems in a moving car. Teruhisa Misu |
| 2014 | DIAPIX-FL: a symmetric corpus of problem-solving dialogues in first and second languages. Mirjam Wester, María Luisa García Lecumberri, Martin Cooke |
| 2014 | DNN-based stochastic postfilter for HMM-based speech synthesis. Ling-Hui Chen, Tuomo Raitio, Cassia Valentini-Botinhao, Junichi Yamagishi, Zhen-Hua Ling |
| 2014 | Data augmentation for low resource languages. Anton Ragni, Kate M. Knill, Shakti P. Rath, Mark J. F. Gales |
| 2014 | Data augmentation, feature combination, and multilingual neural networks to improve ASR and KWS performance for low-resource languages. Zoltán Tüske, Pavel Golik, David Nolden, Ralf Schlüter, Hermann Ney |
| 2014 | Data-driven generation of text balloons based on linguistic and acoustic features of a comics-anime corpus. Sho Matsumiya, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura |
| 2014 | Decision learning in data science: where John Nash meets social media. K. J. Ray Liu |
| 2014 | Decorrelated innovative codebooks for ACELP using factorization of autocorrelation matrix. Tom Bäckström, Christian R. Helmrich |
| 2014 | Deep neural network based trainable voice source model for synthesis of speech with varying vocal effort. Tuomo Raitio, Antti Suni, Lauri Juvela, Martti Vainio, Paavo Alku |
| 2014 | Deep neural network bottleneck features for generalized variable parameter HMMs. Xurong Xie, Rongfeng Su, Xunying Liu, Lan Wang |
| 2014 | Deep scattering spectra with deep neural networks for LVCSR tasks. Tara N. Sainath, Vijayaditya Peddinti, Brian Kingsbury, Petr Fousek, Bhuvana Ramabhadran, David Nahamoo |
| 2014 | Detecting and labeling speakers on overlapping speech using vector taylor series. Pranay Dighe, Marc Ferras, Hervé Bourlard |
| 2014 | Detecting articulatory compensation in acoustic data through linear regression modeling. Alina Khasanova, Jennifer Cole, Mark Hasegawa-Johnson |
| 2014 | Detecting incorrectly-segmented utterances for posteriori restoration of turn-taking and ASR results. Naoki Hotta, Kazunori Komatani, Satoshi Sato, Mikio Nakano |
| 2014 | Detecting out-of-domain utterances addressed to a virtual personal assistant. Gökhan Tür, Anoop Deoras, Dilek Hakkani-Tür |
| 2014 | Detecting proximity from personal audio recordings. Daniel P. W. Ellis, Hiroyuki Satoh, Zhuo Chen |
| 2014 | Detecting speaker roles and topic changes in multiparty conversations using latent topic models. Ashtosh Sapru, Hervé Bourlard |
| 2014 | Detecting the intensity of cognitive and physical load using AdaBoost and deep rectifier neural networks. Gábor Gosztolya, Tamás Grósz, Róbert Busa-Fekete, László Tóth |
| 2014 | Detecting the number of competing speakers - human selective hearing versus spectrogram distance based estimator. Valentin Andrei, Horia Cucu, Andi Buzo, Corneliu Burileanu |
| 2014 | Detection of children's paralinguistic events in interaction with caregivers. Hrishikesh Rao, Jonathan C. Kim, Mark A. Clements, Agata Rozga, Daniel S. Messinger |
| 2014 | Detection of vowel onset points in voiced aspirated sounds of indian languages. Biswajit Dev Sarma, S. R. M. Prasanna |
| 2014 | Developing STT and KWS systems using limited language resources. Viet Bac Le, Lori Lamel, Abdelkhalek Messaoudi, William Hartmann, Jean-Luc Gauvain, Cécile Woehrling, Julien Despres, Anindya Roy |
| 2014 | Development of bilingual ASR system for MediaParl corpus. Petr Motlícek, David Imseng, Milos Cernak, Namhoon Kim |
| 2014 | Diagnostic techniques for spoken keyword discovery. Peter F. Schulam, Murat Akbacak |
| 2014 | Dialect levelling in Finnish: a universal speech attribute approach. Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Elie Khoury, Tommi Kurki, Tomi Kinnunen, Chin-Hui Lee |
| 2014 | Dialogue context sensitive speech synthesis using factorized decision trees. Pirros Tsiakoulis, Catherine Breslin, Milica Gasic, Matthew Henderson, Dongho Kim, Steve J. Young |
| 2014 | Diarizing large corpora using multi-modal speaker linking. Marc Ferras, Stefano Masneri, Oliver Schreer, Hervé Bourlard |
| 2014 | Dictionary-based pitch tracking with dynamic programming. Ewout van den Berg, Bhuvana Ramabhadran |
| 2014 | Differences of pitch profiles in Germanic and slavic languages. Bistra Andreeva, Grazyna Demenko, Bernd Möbius, Frank Zimmerer, Jeanin Jügler, Magdalena Oleskowicz-Popiel |
| 2014 | Difficulty in discriminating non-native vowels: are Dutch vowels easier for australian English than Spanish listeners? Samra Alispahic, Paola Escudero, Karen E. Mulak |
| 2014 | Diphthongized vowels in the yi county hui Chinese dialect. Fang Hu, Minghui Zhang |
| 2014 | Direct F Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura |
| 2014 | Direct word graph rescoring using a* search and RNNLM. Shahab Jalalvand, Daniele Falavigna |
| 2014 | Direction-of-arrival estimation of multiple speakers using a planar array. Dongwen Ying, Ruohua Zhou, Junfeng Li, Jielin Pan, Yonghong Yan |
| 2014 | Discriminative NMF and its application to single-channel source separation. Felix Weninger, Jonathan Le Roux, John R. Hershey, Shinji Watanabe |
| 2014 | Discriminative pronunciation modeling for dialectal speech recognition. Maider Lehr, Kyle Gorman, Izhak Shafran |
| 2014 | Distributed asynchronous optimization of convolutional neural networks. William Chan, Ian R. Lane |
| 2014 | Distributed learning of multilingual DNN feature extractors using GPUs. Yajie Miao, Hao Zhang, Florian Metze |
| 2014 | Does elderly speech recognition in noise benefit from spectral and visual cues? Yatin Mahajan, Jeesun Kim, Chris Davis |
| 2014 | Domain adaptation for text dependent speaker verification. Hagai Aronowitz, Asaf Rendel |
| 2014 | Dutch vowel production by Spanish learners: duration and spectral features. Pepi Burgos, Mátyás Jani, Catia Cucchiarini, Roeland van Hout, Helmer Strik |
| 2014 | Dynamic noise aware training for speech enhancement based on deep neural networks. Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee |
| 2014 | Dynamic stream weight estimation in coupled-HMM-based audio-visual speech recognition using multilayer perceptrons. Ahmed Hussen Abdelaziz, Dorothea Kolossa |
| 2014 | Effect of frequency weighting on MLP-based speaker canonicalization. Yuichi Kubota, Motoi Omachi, Tetsuji Ogawa, Tetsunori Kobayashi, Tsuneo Nitta |
| 2014 | Effect of long-term ageing on i-vector speaker verification. Finnian Kelly, Rahim Saeidi, Naomi Harte, David A. van Leeuwen |
| 2014 | Effect of spectral degradation to the intelligibility of vowel sentences. Fei Chen, Sharon W. K. Wong, Lena L. N. Wong |
| 2014 | Effective modulation spectrum factorization for robust speech recognition. Yu-Chen Kao, Yi-Ting Wang, Berlin Chen |
| 2014 | Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch. Xie Chen, Yongqiang Wang, Xunying Liu, Mark J. F. Gales, Philip C. Woodland |
| 2014 | Emotional speech classification using adaptive sinusoidal modelling. Theodora Yakoumaki, George P. Kafentzis, Yannis Stylianou |
| 2014 | Enabling controllability for continuous expression space. Langzhou Chen, Norbert Braunschweiler |
| 2014 | Encoding linear models as weighted finite-state transducers. Ke Wu, Cyril Allauzen, Keith B. Hall, Michael Riley, Brian Roark |
| 2014 | English consonant confusions by Greek listeners in quiet and noise and the role of phonological short-term memory. Angelos Lengeris, Katerina Nicolaidis |
| 2014 | Enhanced language modeling for extractive speech summarization with sentence relatedness information. Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, Wen-Lian Hsu |
| 2014 | Enhanced muting method in packet loss concealment of ITU-t g.722 using sigmoid function with on-line optimized parameters. Bong-Ki Lee, Inyoung Hwang, Jihwan Park, Joon-Hyuk Chang |
| 2014 | Enhancement of speech intelligibility in near-end noise conditions with phase modification. Emma Jokinen, Marko Takanen, Hannu Pulakka, Paavo Alku |
| 2014 | Enhancing audio source separability using spectro-temporal regularization with NMF. Colin Vaz, Dimitrios Dimitriadis, Shrikanth S. Narayanan |
| 2014 | Enhancing multimodal silent speech interfaces with feature selection. João Freitas, Artur J. Ferreira, Mário A. T. Figueiredo, António J. S. Teixeira, Miguel Sales Dias |
| 2014 | Ensemble deep learning for speech recognition. Li Deng, John C. Platt |
| 2014 | Ensemble modeling of denoising autoencoder for speech spectrum restoration. Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori |
| 2014 | Ensemble of machine learning algorithms for cognitive and physical speaker load detection. How Jing, Ting-Yao Hu, Hung-Shin Lee, Wei-Chen Chen, Chi-Chun Lee, Yu Tsao, Hsin-Min Wang |
| 2014 | Error correction of automatic speech recognition based on normalized web distance. E. Byambakhishig, Katsuyuki Tanaka, Ryo Aihara, Toru Nakashika, Tetsuya Takiguchi, Yasuo Ariki |
| 2014 | Error patterns of Mandarin disyllabic tones by Japanese learners. Jung-Yueh Tu, Yuwen Hsiung, Min-Da Wu, Yao-Ting Sung |
| 2014 | Estimation of the movement trajectories of non-crucial articulators based on the detection of crucial moments and physiological constraints. Jangwon Kim, Sungbok Lee, Shrikanth S. Narayanan |
| 2014 | Estimation of vocal-tract shape from speech spectrum and speech resynthesis based on a generative model. Tokihiko Kaburagi |
| 2014 | Euronews: a multilingual benchmark for ASR and LID. Roberto Gretter |
| 2014 | Evaluating coherence in open domain conversational systems. Ryuichiro Higashinaka, Toyomi Meguro, Kenji Imamura, Hiroaki Sugiyama, Toshiro Makino, Yoshihiro Matsuo |
| 2014 | Evaluating robust features on deep neural networks for speech recognition in noisy and channel mismatched conditions. Vikramjit Mitra, Wen Wang, Horacio Franco, Yun Lei, Chris Bartels, Martin Graciarena |
| 2014 | Evaluating speech features with the minimal-pair ABX task (II): resistance to noise. Thomas Schatz, Vijayaditya Peddinti, Xuan-Nga Cao, Francis R. Bach, Hynek Hermansky, Emmanuel Dupoux |
| 2014 | Evaluation of dictionary for sparse coding in speech processing. Yongjun He, Guanglu Sun, Guibin Zheng, Jiqing Han |
| 2014 | Excitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay with minimum phase response compensation. Hideki Kawahara, Masanori Morise, Tomoki Toda, Hideki Banno, Ryuichi Nisimura, Toshio Irino |
| 2014 | Excitation source features for discrimination of anger and happy emotions. P. Gangamohan, Sudarsana Reddy Kadiri, Suryakanth V. Gangashetty, B. Yegnanarayana |
| 2014 | Experiments on deep learning for speech denoising. Ding Liu, Paris Smaragdis, Minje Kim |
| 2014 | Exploiting vocal-source features to improve ASR accuracy for low-resource languages. Raul Fernandez, Jia Cui, Andrew Rosenberg, Bhuvana Ramabhadran, Xiaodong Cui |
| 2014 | Exploring modulation spectrum features for speech-based depression level classification. Elif Bozkurt, Orith Toledo-Ronen, Alexander Sorin, Ron Hoory |
| 2014 | Extended RSR2015 for text-dependent speaker verification over VHF channel. Anthony Larcher, Kong-Aik Lee, Pablo Luis Sordo Martinez, Trung Hieu Nguyen, Bin Ma, Haizhou Li |
| 2014 | Extending Limabeam with discrimination and coarse gradients. Charles Fox, Thomas Hain |
| 2014 | F0 estimation in noisy speech based on long-term harmonic feature analysis combined with neural network classification. Dongmei Wang, Philipos C. Loizou, John H. L. Hansen |
| 2014 | Factor analysis based semantic variability compensation for automatic conversation representation. Mohamed Bouallegue, Mohamed Morchid, Richard Dufour, Driss Matrouf, Georges Linarès, Renato De Mori |
| 2014 | Factor analysis with sampling methods for text dependent speaker recognition. Antonio Miguel, Jesús Antonio Villalba López, Alfonso Ortega, Eduardo Lleida, Carlos Vaquero |
| 2014 | Feature Switching in the i-vector framework for speaker verification. T. Asha, M. S. Saranya, D. S. Karthik Pandia, Srikanth R. Madikeri, Hema A. Murthy |
| 2014 | Feature extraction from analytic phase of speech signals for speaker verification. Karthika Vijayan, Vinay Kumar, K. Sri Rama Murty |
| 2014 | Feature space maximum a posteriori linear regression for adaptation of deep neural networks. Zhen Huang, Jinyu Li, Sabato Marco Siniscalchi, I-Fan Chen, Chao Weng, Chin-Hui Lee |
| 2014 | Feed forward pre-training for recurrent neural network language models. Siva Reddy Gangireddy, Fergus McInnes, Steve Renals |
| 2014 | Filtering and subspace selection for spectral features in detecting speech under physical stress. Jouni Pohjalainen, Paavo Alku |
| 2014 | Foreign accent recognition based on temporal information contained in lowpass-filtered speech. Marie-José Kolly, Adrian Leemann, Volker Dellwo |
| 2014 | Formant enhancement based speech watermarking for tampering detection. Shengbei Wang, Masashi Unoki, Nam Soo Kim |
| 2014 | Formant-controlled speech synthesis using hidden trajectory model. Ming-Qi Cai, Zhen-Hua Ling, Li-Rong Dai |
| 2014 | Fusion of knowledge-based and data-driven approaches to grammar induction. Spiros Georgiladakis, Christina Unger, Elias Iosif, Sebastian Walter, Philipp Cimiano, Euripides G. M. Petrakis, Alexandros Potamianos |
| 2014 | GMM-based bandwidth extension using sub-band basis spectrum model. Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Masami Akamine |
| 2014 | Generalizing time-frequency importance functions across noises, talkers, and phonemes. Michael I. Mandel, Sarah E. Yoho, Eric W. Healy |
| 2014 | Generating multiple-accent pronunciations for TTS using joint sequence model interpolation. BalaKrishna Kolluru, Vincent Wan, Javier Latorre, Kayoko Yanagisawa, Mark J. F. Gales |
| 2014 | Generating segmental foreign accent. María Luisa García Lecumberri, Roberto Barra-Chicote, Rubén Pérez Ramón, Junichi Yamagishi, Martin Cooke |
| 2014 | Generation of F0 contour using deep boltzmann machine and twin Gaussian process hybrid model for bengali language. Sankar Mukherjee, Shyamal Kumar Das Mandal |
| 2014 | Graph-based re-ranking using acoustic feature similarity between search results for spoken term detection on low-resource languages. Hung-yi Lee, Yu Zhang, Ekapol Chuangsuwanich, James R. Glass |
| 2014 | Grounding language models in spatiotemporal context. Brandon C. Roy, Soroush Vosoughi, Deb Roy |
| 2014 | Hierarchical modeling of F0 contours for voice conversion. Gerard Sanchez, Hanna Silén, Jani Nurminen, Moncef Gabbouj |
| 2014 | High-level speech event analysis for cognitive load classification. Claude Montacié, Marie-José Caraty |
| 2014 | High-order sequence modeling using speaker-dependent recurrent temporal restricted boltzmann machines for voice conversion. Toru Nakashika, Tetsuya Takiguchi, Yasuo Ariki |
| 2014 | Hybrid MLP/structured-SVM tandem systems for large vocabulary and robust ASR. Suman V. Ravuri |
| 2014 | Hybrid language models for speech transcription. Luiza Orosanu, Denis Jouvet |
| 2014 | Hypotheses ranking for robust domain classification and tracking in dialogue systems. Jean-Philippe Robichaud, Paul A. Crook, Puyang Xu, Omar Zia Khan, Ruhi Sarikaya |
| 2014 | I Minghui Dong, Siu Wa Lee, Haizhou Li, Paul Y. Chan, Xuejian Peng, Jochen Walter Ehnes, Dong-Yan Huang |
| 2014 | I-vector based representation of highly imperfect automatic transcriptions. Mohamed Morchid, Mohamed Bouallegue, Richard Dufour, Georges Linarès, Driss Matrouf, Renato De Mori |
| 2014 | I-vector speaker verification based on phonetic information under transmission channel effects. Laura Fernández Gallardo, Michael Wagner, Sebastian Möller |
| 2014 | Identification of age-group from children's speech by computers and humans. Saeid Safavi, Martin J. Russell, Peter Jancovic |
| 2014 | Identifying contributors in the BBC world service archive. Yves Raimond, Thomas Nixon |
| 2014 | Identifying the human-machine differences in complex binaural scenes: what can be learned from our auditory system. Constantin Spille, Bernd T. Meyer |
| 2014 | Impact of age in the production of European Portuguese vowels. Luciana Albuquerque, Catarina Oliveira, António J. S. Teixeira, Pedro Sá-Couto, João Freitas, Miguel Sales Dias |
| 2014 | Improving ASR performance on non-native speech using multilingual and crosslingual information. Ngoc Thang Vu, Yuanfan Wang, Marten Klose, Zlatka Mihaylova, Tanja Schultz |
| 2014 | Improving Mandarin prosodic boundary prediction with rich syntactic features. Hao Che, Jianhua Tao, Ya Li |
| 2014 | Improving deep neural network acoustic modeling for audio corpus indexing under the IARPA babel program. Xiaodong Cui, Brian Kingsbury, Jia Cui, Bhuvana Ramabhadran, Andrew Rosenberg, Mohammad Sadegh Rasooli, Owen Rambow, Nizar Habash, Vaibhava Goel |
| 2014 | Improving language-universal feature extraction with deep maxout and convolutional neural networks. Yajie Miao, Florian Metze |
| 2014 | Improving named entity recognition with prosodic features. Denys Katerenchuk, Andrew Rosenberg |
| 2014 | Improving native accent identification using deep neural networks. Mingming Chen, Zhanlei Yang, Hao Zheng, Wenju Liu |
| 2014 | Improving semi-supervised deep neural network for keyword search in low resource languages. Roger Hsiao, Tim Ng, Le Zhang, Shivesh Ranjan, Stavros Tsakalidis, Long Nguyen, Richard M. Schwartz |
| 2014 | Improving spoken document retrieval by unsupervised language model adaptation using utterance-based web search. Robert Herms, Marc Ritter, Thomas Wilhelm-Stein, Maximilian Eibl |
| 2014 | Improving the performance of far-field speaker verification using multi-condition training: the case of GMM-UBM and i-vector systems. Anderson R. Avila, Milton Orlando Sarria-Paja, Francisco J. Fraga, Douglas D. O'Shaughnessy, Tiago H. Falk |
| 2014 | Improving the speech activity detection for the DARPA RATS phase-3 evaluation. Jeff Ma |
| 2014 | Improving wideband acoustic models using mixed-bandwidth training data via DNN adaptation. Zhao You, Bo Xu |
| 2014 | In-domain versus out-of-domain training for text-dependent JFA. Patrick Kenny, Themos Stafylakis, Md. Jahangir Alam, Pierre Ouellet, Marcel Kockmann |
| 2014 | Incorporating lexical and prosodic information at different levels for meeting summarization. Catherine Lai, Steve Renals |
| 2014 | Incremental dialog processing in a task-oriented dialog. Fabrizio Ghigi, Maxine Eskénazi, M. Inés Torres, Sungjin Lee |
| 2014 | Incremental on-line adaptation of POMDP-based dialogue managers to extended domains. Milica Gasic, Dongho Kim, Pirros Tsiakoulis, Catherine Breslin, Matthew Henderson, Martin Szummer, Blaise Thomson, Steve J. Young |
| 2014 | Infant-directed speech enhances temporal rhythmic structure in the envelope. Victoria Leong, Marina Kalashnikova, Denis Burnham, Usha Goswami |
| 2014 | Influences of tone sandhi on word recognition in preschool children. Dilu Wewalaarachchi, Leher Singh |
| 2014 | Integrating sequence information in the audio-visual detection of word prominence in a human-machine interaction scenario. Andrea Schnall, Martin Heckmann |
| 2014 | Intelligibility analysis of fast synthesized speech. Cassia Valentini-Botinhao, Markus Toman, Michael Pucher, Dietmar Schabus, Junichi Yamagishi |
| 2014 | Intelligibility of high-pitched vowel sounds in the singing and speaking of a female Cantonese opera singer. Dieter Maurer, Peggy Mok, Daniel Friedrichs, Volker Dellwo |
| 2014 | Interlingual map task corpus collection. Hayakawa Akira, Nick Campbell, Saturnino Luz |
| 2014 | Interplay of informational content and energetic masking in speech perception in noise. Vincent Aubanel, Chris Davis, Jeesun Kim |
| 2014 | Intonational phonology and prosodic hierarchy in malay. Diyana Hamzah, James Sneed German |
| 2014 | Intrinsic spectral analysis based on temporal context features for query-by-example spoken term detection. Peng Yang, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li |
| 2014 | Introducing i-vectors for joint anti-spoofing and speaker verification. Elie Khoury, Tomi Kinnunen, Aleksandr Sizov, Zhizheng Wu, Sébastien Marcel |
| 2014 | Inverse reinforcement learning for micro-turn management. Dongho Kim, Catherine Breslin, Pirros Tsiakoulis, Milica Gasic, Matthew Henderson, Steve J. Young |
| 2014 | Investigating NMF speech enhancement for neural network based acoustic models. Jürgen T. Geiger, Jort F. Gemmeke, Björn W. Schuller, Gerhard Rigoll |
| 2014 | Investigating automatic & human filled pause insertion for speech synthesis. Rasmus Dall, Marcus Tomalin, Mirjam Wester, William J. Byrne, Simon King |
| 2014 | Investigating prosodic relations between initiating and responding laughs. Khiet P. Truong, Jürgen Trouvain |
| 2014 | Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis. Thomas Merritt, Tuomo Raitio, Simon King |
| 2014 | Investigating the effect of F0 and vocal intensity on harmonic magnitudes: data from high-speed laryngeal videoendoscopy. Gang Chen, Soo Jin Park, Jody Kreiman, Abeer Alwan |
| 2014 | Investigating the learning effect of multilingual bottle-neck features for ASR. Ngoc Thang Vu, Jochen Weiner, Tanja Schultz |
| 2014 | Investigation of cross-lingual bottleneck features in hybrid ASR systems. Jie Li, Rong Zheng, Bo Xu |
| 2014 | Investigation of deep neural networks for robust recognition of nonlinearly distorted speech. Ladislav Seps, Jirí Málek, Petr Cerva, Jan Nouza |
| 2014 | Investigation of the relative perceptual importance of temporal envelope and temporal fine structure between tonal and non-tonal languages. Dongmei Wang, James M. Kates, John H. L. Hansen |
| 2014 | Is incremental cross-show speaker diarization efficient for processing large volumes of data? Grégor Dupuy, Sylvain Meignier, Yannick Estève |
| 2014 | Is speech rhythm an intrinsic property of language? Jason Brown, Eden Matene |
| 2014 | Iterative refinement of amplitude and phase in single-channel speech enhancement. Pejman Mowlaee, Mario Kaoru Watanabe, Rahim Saeidi |
| 2014 | Joint adaptation and adaptive training of TVWR for robust automatic speech recognition. Shilin Liu, Khe Chai Sim |
| 2014 | Joint filtering and factorization for recovering latent structure from noisy speech data. Colin Vaz, Vikram Ramanarayanan, Shrikanth S. Narayanan |
| 2014 | Joint nonnegative matrix factorization for exemplar-based voice conversion. Zhizheng Wu, Chng Eng Siong, Haizhou Li |
| 2014 | Joint sequence training of phone and grapheme acoustic model based on multi-task learning deep neural networks. Dongpeng Chen, Brian Mak, Sunil Sivadas |
| 2014 | Kernel density-based acoustic model with cross-lingual bottleneck features for resource limited LVCSR. Van Hai Do, Xiong Xiao, Chng Eng Siong, Haizhou Li |
| 2014 | Language ID-based training of multilingual stacked bottleneck features. Anne Cutler, Yu Zhang, Ekapol Chuangsuwanich, James R. Glass |
| 2014 | Language diversity: speech processing in a multi-lingual context. Lori Lamel |
| 2014 | Language identification of code Switching sentences and multilingual sentences of under-resourced languages by using multi structural word information. Yin-Lai Yeong, Tien-Ping Tan |
| 2014 | Language identification of individual words with joint sequence models. Oluwapelumi Giwa, Marelie H. Davel |
| 2014 | Language independent and unsupervised acoustic models for speech recognition and keyword spotting. Kate M. Knill, Mark J. F. Gales, Anton Ragni, Shakti P. Rath |
| 2014 | Language modeling with sum-product networks. Wei-Chen Cheng, Stanley Kok, Hoai Vu Pham, Hai Leong Chieu, Kian Ming Adam Chai |
| 2014 | Language recognition using phonotactic-based shifted delta coefficients and multiple phone recognizers. Luis Fernando D'Haro, Ricardo de Córdoba, Christian Salamea Palacios, Javier Ferreiros |
| 2014 | Large-margin conditional random fields for single-microphone speech separation. Yu Ting Yeung, Tan Lee, Cheung-Chi Leung |
| 2014 | Lateral formants in three central australian languages. Marija Tabain, Andrew Butcher, Gavan Breen, Richard Beare |
| 2014 | Lattice decoding and rescoring with long-Span neural network language models. Martin Sundermeyer, Zoltán Tüske, Ralf Schlüter, Hermann Ney |
| 2014 | Learning L2 prosody is more difficult than you realize - F0 characteristics and chunking size of L1 English, TW L2 English and TW L1 Mandarin. Chiu-yu Tseng, Chao-yu Su |
| 2014 | Learning about speech. Anne Cutler |
| 2014 | Learning conditional random field with hierarchical representations for dialogue act recognition. Yucan Zhou, Qinghua Hu, Jie Liu, Yuan Jia |
| 2014 | Learning continuous-valued word representations for phrase break prediction. Anandaswarup Vadapalli, Kishore Prahallad |
| 2014 | Learning phrase patterns for text classification using a knowledge graph and unlabeled data. Alex Marin, Roman Holenstein, Ruhi Sarikaya, Mari Ostendorf |
| 2014 | Learning situated knowledge bases through dialog. Aasish Pappu, Alexander I. Rudnicky |
| 2014 | Learning small-size DNN with output-distribution-based criteria. Jinyu Li, Rui Zhao, Jui-Ting Huang, Yifan Gong |
| 2014 | Least squares phase estimation of mixed signals. Carlos Eduardo Cancino-Chacón, Pejman Mowlaee |
| 2014 | Least squares signal declipping for robust speech recognition. Mark J. Harvilla, Richard M. Stern |
| 2014 | Lexical modeling for Arabic ASR: a systematic approach. Tuka Al Hanai, James R. Glass |
| 2014 | Lexical representation of consonant, vowels and tones in early childhood. Hwee Hwee Goh, Charlene Hu, Kheng Hui Yeo, Leher Singh |
| 2014 | Limited labels for unlimited data: active learning for speaker recognition. Stephen H. Shum, Najim Dehak, James R. Glass |
| 2014 | Lipreading approach for isolated digits recognition under whisper and neutral speech. Fei Tao, Carlos Busso |
| 2014 | Lipreading using convolutional neural network. Kuniaki Noda, Yuki Yamaguchi, Kazuhiro Nakadai, Hiroshi G. Okuno, Tetsuya Ogata |
| 2014 | Listen with your skin: aerotak speech perception enhancement system. Donald Derrick, Tom De Rybel, Greg A. O'Beirne, Jennifer Hay |
| 2014 | Listener estimation of speaker age based on whispered speech. Angelika Braun, Daniela Decker |
| 2014 | Long short-term memory recurrent neural network architectures for large scale acoustic modeling. Hasim Sak, Andrew W. Senior, Françoise Beaufays |
| 2014 | Low-resource open vocabulary keyword search using point process models. Chunxi Liu, Aren Jansen, Guoguo Chen, Keith Kintzley, Jan Trmal, Sanjeev Khudanpur |
| 2014 | LuciawebGL: a new WebGL-Based talking head. Alberto Benin, Piero Cosi, Giuseppe Riccardo Leone, Giulio Paci |
| 2014 | Manifold regularized deep neural networks. Vikrant Singh Tomar, Richard C. Rose |
| 2014 | Manipulating stance and involvement using collaborative tasks: an exploratory comparison. Valerie Freeman, Julian Chan, Gina-Anne Levow, Richard A. Wright, Mari Ostendorf, Victoria Zayats |
| 2014 | Mapping emotions into acoustic space: the role of voice quality. Ting Wang, Hongwei Ding, Jianjing Kuang, Qiuwu Ma |
| 2014 | Mappings between vocal tract area functions, vocal tract resonances and speech formants for multiple speakers. Catherine Inez Watson |
| 2014 | Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech. Gustav Eje Henter, Thomas Merritt, Matt Shannon, Catherine Mayo, Simon King |
| 2014 | Methods for efficient semi-automatic pronunciation dictionary bootstrapping. Tim Schlippe, Matthias Merz, Tanja Schultz |
| 2014 | Microphone array post-filtering using supervised machine learning for speech enhancement. Pasi Pertilä, Joonas Nikunen |
| 2014 | Missing samples estimation in electromagnetic articulography data using equality constrained kalman smoother. P. Sujith, Prasanta Kumar Ghosh |
| 2014 | Mixture of latent words language models for domain adaptation. Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi |
| 2014 | Model and feature based compensation for whispered speech recognition. Shabnam Ghaffarzadegan, Hynek Boril, John H. L. Hansen |
| 2014 | Modeling DCT parameterized F0 trajectory at intonation phrase level with DNN or decision tree. Xiang Yin, Ming Lei, Yao Qian, Frank K. Soong, Lei He, Zhen-Hua Ling, Li-Rong Dai |
| 2014 | Modeling coarticulation in continuous speech. Brian O. Bush, Alexander Kain |
| 2014 | Modeling long temporal contexts for robust DNN-based speech recognition. Bo Li, Khe Chai Sim |
| 2014 | Modeling pronunciation, rhythm, and intonation for automatic assessment of speech quality in aphasia rehabilitation. Duc Le, Emily Mower Provost |
| 2014 | Modeling therapist empathy through prosody in drug addiction counseling. Bo Xiao, Daniel Bone, Maarten Van Segbroeck, Zac E. Imel, David C. Atkins, Panayiotis G. Georgiou, Shrikanth S. Narayanan |
| 2014 | Modelling primitive streaming of simple tone sequences through factorisation of modulation pattern tensors. Tom Barker, Hugo Van hamme, Tuomas Virtanen |
| 2014 | Modified-prior i-vector estimation for language identification of short duration utterances. Ruchir Travadi, Maarten Van Segbroeck, Shrikanth S. Narayanan |
| 2014 | Motor control primitives arising from a learned dynamical systems model of speech articulation. Vikram Ramanarayanan, Louis Goldstein, Shrikanth S. Narayanan |
| 2014 | Multi-accent deep neural network acoustic model with accent-specific top layer using the KLD-regularized model adaptation. Yan Huang, Dong Yu, Chaojun Liu, Yifan Gong |
| 2014 | Multi-channel speech enhancement using sparse coding on local time-frequency structures. Zhiyuan Zhou, Zhaogui Ding, Weifeng Li, Zhiyong Wu, Longbiao Wang, Qingmin Liao |
| 2014 | Multi-domain disfluency and repair detection. Victoria Zayats, Mari Ostendorf, Hannaneh Hajishirzi |
| 2014 | Multi-pass sentence-end detection of lecture speech. Madina Hasan, Rama Doddipatla, Thomas Hain |
| 2014 | Multi-source posteriors for speech activity detection on public talks. Marc Ferras, Hervé Bourlard |
| 2014 | Multi-sources separation for sound source localization. Mariem Bouafif, Zied Lachiri |
| 2014 | Multichannel automatic recognition of voice command in a multi-room smart home: an experiment involving seniors and users with visual impairment. Michel Vacher, Benjamin Lecouteux, François Portet |
| 2014 | Multichannel speech dereverberation based on convolutive nonnegative tensor factorization for ASR applications. Seyedmahdad Mirsamadi, John H. L. Hansen |
| 2014 | Multimodal exemplar-based voice conversion using lip features in noisy environments. Kenta Masaka, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki |
| 2014 | Multimodal understanding for person recognition in video broadcasts. Frédéric Béchet, Meriem Bendris, Delphine Charlet, Géraldine Damnati, Benoît Favre, Mickael Rouvier, Rémi Auguste, Benjamin Bigot, Richard Dufour, Corinne Fredouille, Georges Linarès, Jean Martinet, Grégory Senay, Pierre Tirilly |
| 2014 | Multiple-order non-negative matrix factorization for speech enhancement. Xabier Jaureguiberry, Emmanuel Vincent, Gaël Richard |
| 2014 | NMF-based speech enhancement incorporating deep neural network. Tae Gyoon Kang, Kisoo Kwon, Jong Won Shin, Nam Soo Kim |
| 2014 | Nasality in speech and its contribution to speaker individuality. Kanae Amino, Hisanori Makinae, Tatsuya Kitamura |
| 2014 | Nearest neighbor discriminant analysis for robust speaker recognition. Seyed Omid Sadjadi, Jason W. Pelecanos, Weizhong Zhu |
| 2014 | Neural network language models for low resource languages. Ankur Gandhe, Florian Metze, Ian R. Lane |
| 2014 | Neural network models for lexical addressee detection. Suman V. Ravuri, Andreas Stolcke |
| 2014 | Neural network phone duration model for speech recognition. Tanel Alumäe |
| 2014 | New insight into the use of phone log-likelihood ratios as features for language recognition. Mireia Díez, Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel |
| 2014 | Noise robust speech recognition based on noise-adapted HMMs using speech feature compensation. Yong-Joo Chung |
| 2014 | Noise spectrum estimation using Gaussian mixture model-based speech presence probability for robust speech recognition. Md. Jahangir Alam, Patrick Kenny, Pierre Dumouchel, Douglas D. O'Shaughnessy |
| 2014 | Noise-robust TTS speaker adaptation with statistics smoothing. Kayoko Yanagisawa, Langzhou Chen, Mark J. F. Gales |
| 2014 | Noisy speech enhancement based on long term harmonic model to improve speech intelligibility for hearing impaired listeners. Dongmei Wang, Philipos C. Loizou, John H. L. Hansen |
| 2014 | Non-native perception of regionally accented speech in a multitalker context. Robert Allen Fox, Ewa Jacewicz, Florence Hardjono |
| 2014 | Non-native word recognition in noise: the role of word-initial and word-final information. Juul Coumans, Roeland van Hout, Odette Scharenborg |
| 2014 | Nonword repetition of taiwanese disyllabic tonal sequences in adults with language attrition. Chia-Hsin Yeh, Chiung-Yao Wang, Jung-Yueh Tu |
| 2014 | Normalization of ASR confidence classifier scores via confidence mapping. Kshitiz Kumar, Chaojun Liu, Yifan Gong |
| 2014 | Novel speech duration modifier for packet based communication system. Senthil Kumar Mani, Jitendra Kumar Dhiman, K. Sri Rama Murty |
| 2014 | Objective evaluation of HMM-based speech synthesis system using kullback-leibler divergence. Cong-Thanh Do, Marc Evrard, A. Leman, Christophe d'Alessandro, Albert Rilliard, J.-L. Crebouw |
| 2014 | Objective quality evaluation of noise-suppressed speech: effects of temporal envelope and fine-structure cues. Fei Chen, Yi Hu |
| 2014 | On classification between normal and pathological voices using the MEEI-kayPENTAX database: issues and consequences. Khalid Daoudi, Blaise Bertrac |
| 2014 | On closed form calculation of line spectral frequencies (LSF). Paul Dalsgaard, Ove Andersen |
| 2014 | On predicting the unpleasantness level of a sound event. Stavros Ntalampiras, Ilyas Potamitis |
| 2014 | On recognition of non-native speech using probabilistic lexical model. Marzieh Razavi, Mathew Magimai-Doss |
| 2014 | On spectral and time domain energy reallocation for speech-in-noise intelligibility enhancement. Tudor-Catalin Zorila, Yannis Stylianou |
| 2014 | On the acoustic environment of a neonatal intensive care unit: initial description, and detection of equipment alarms. Ganna Raboshchuk, Climent Nadeu, Omid Ghahabi, Sergi Solvez, Blanca Muñoz Mahamud, Ana Riverola de Veciana, Santiago Navarro Hervas |
| 2014 | On the complementarity of short-time fourier analysis windows of different lengths for improved language recognition. Mireia Díez, Mikel Peñagarikano, Germán Bordel, Amparo Varona, Luis Javier Rodríguez-Fuentes |
| 2014 | On the conversant-specificity of stochastic turn-taking models. Kornel Laskowski |
| 2014 | On the role of missing data imputation and NMF feature enhancement in building synthetic voices using reverberant speech. Dhananjaya N. Gowda, Heikki Kallasjoki, Reima Karhila, Cristian Contan, Kalle J. Palomäki, Mircea Giurgiu, Mikko Kurimo |
| 2014 | On the selection of the impulse responses for distant-speech recognition based on contaminated speech training. Mirco Ravanelli, Maurizio Omologo |
| 2014 | On the use of Bhattacharyya based GMM distance and neural net features for identification of cognitive load levels. Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma |
| 2014 | On the use of the 'pure data' programming language for teaching and public outreach in speech processing. Roger K. Moore |
| 2014 | On the use of the Watson mixture model for clustering-based under-determined blind source separation. Ingrid Jafari, Roberto Togneri, Sven Nordholm |
| 2014 | One billion word benchmark for measuring progress in statistical language modeling. Ciprian Chelba, Tomás Mikolov, Mike Schuster, Qi Ge, Thorsten Brants, Phillipp Koehn, Tony Robinson |
| 2014 | Opti-speech: a real-time, 3d visual feedback system for speech training. William F. Katz, Thomas F. Campbell, Jun Wang, Eric Farrar, Jessie Colette Eubanks, Arvind Balasubramanian, Balakrishnan Prabhakaran, Rob Rennaker |
| 2014 | PLDA modeling in the fishervoice subspace for speaker verification. Jinghua Zhong, Weiwu Jiang, Wei Rao, Man-Wai Mak, Helen M. Meng |
| 2014 | PLLR features in language recognition system for RATS. Oldrich Plchot, Mireia Díez, Mehdi Soufifar, Lukás Burget |
| 2014 | Palate-referenced articulatory features for acoustic-to-articulator inversion. An Ji, Michael T. Johnson, Jeffrey Berry |
| 2014 | Parallel deep neural network training for LVCSR tasks using blue gene/Q. Tara N. Sainath, I-Hsin Chung, Bhuvana Ramabhadran, Michael Picheny, John A. Gunnels, Brian Kingsbury, George Saon, Vernon Austel, Upendra V. Chaudhari |
| 2014 | Parameterization of articulatory pattern in speakers with ALS. Panying Rong, Yana Yunusova, James D. Berry, Lorne Zinman, Jordan R. Green |
| 2014 | Parameterization of the glottal source with the phase plane plot. Manu Airaksinen, Paavo Alku |
| 2014 | Parsing named entity as syntactic structure. Xiantao Zhang, Dongchen Li, Xihong Wu |
| 2014 | Partial representations improve the prosody of incremental speech synthesis. Timo Baumann |
| 2014 | Perception of pitch tails at potential turn boundaries in Swedish. Margaret Zellers |
| 2014 | Perception of prosodic prominence and boundaries by L1 and L2 speakers of English. Gábor Pintér, Shinobu Mizuguchi, Koichi Tateishi |
| 2014 | Perception of sentence stress in English infant directed speech. Sofoklis Kakouros, Okko Räsänen |
| 2014 | Performance factor analysis for the 2012 NIST speaker recognition evaluation. Alvin F. Martin, Craig S. Greenberg, Vincent M. Stanford, John M. Howard, George R. Doddington, John J. Godfrey |
| 2014 | Phase distortion statistics as a representation of the glottal source: application to the classification of voice qualities. Gilles Degottex, Nicolas Obin |
| 2014 | Phase importance in speech processing applications. Pejman Mowlaee, Rahim Saeidi, Yannis Stylianou |
| 2014 | Phase-based harmonic/percussive separation. Estefanía Cano, Mark D. Plumbley, Christian Dittmar |
| 2014 | Phone classification by a hierarchy of invariant representation layers. Chiyuan Zhang, Stephen Voinea, Georgios Evangelopoulos, Lorenzo Rosasco, Tomaso A. Poggio |
| 2014 | Phoneme background model for information bottleneck based speaker diarization. Sree Harsha Yella, Petr Motlícek, Hervé Bourlard |
| 2014 | Phoneme category retuning in a non-native language. Polina Drozdova, Roeland van Hout, Odette Scharenborg |
| 2014 | Phonotactic language recognition based on time-gap-weighted lattice kernels. Wei-Wei Liu, Wei-Qiang Zhang, Jia Liu |
| 2014 | Post-masking: a hybrid approach to array processing for speech recognition. Amir R. Moghimi, Bhiksha Raj, Richard M. Stern |
| 2014 | Posterior-based sparse representation for automatic speech recognition. Sara Bahaadini, Afsaneh Asaei, David Imseng, Hervé Bourlard |
| 2014 | Predicting client's inclination towards target behavior change in motivational interviewing and investigating the role of laughter. Rahul Gupta, Panayiotis G. Georgiou, David C. Atkins, Shrikanth S. Narayanan |
| 2014 | Predicting when to laugh with structured classification. Bilal Piot, Olivier Pietquin, Matthieu Geist |
| 2014 | Prediction of cognitive load from speech with the VOQAL voice quality toolbox for the interspeech 2014 computational paralinguistics challenge. Mark A. Huckvale |
| 2014 | Prediction of cognitive performance in an animal fluency task based on rate and articulatory markers. Bea Yu, Thomas F. Quatieri, James R. Williamson, James C. Mundt |
| 2014 | Preservation of lexical tones in singing in a tone language. Anastasia Karlsson, Håkan Lundström, Jan-Olof Svantesson |
| 2014 | Principal components of auditory spectro-temporal receptive fields. Nagaraj Mahajan, Nima Mesgarani, Hynek Hermansky |
| 2014 | Probabilistic acoustic volume analysis for speech affected by depression. Nicholas Cummins, Vidhyasaharan Sethu, Julien Epps, Jarek Krajewski |
| 2014 | Probabilistic enrichment of knowledge graph entities for relation detection in conversational understanding. Dilek Hakkani-Tür, Asli Celikyilmaz, Larry P. Heck, Gökhan Tür, Geoffrey Zweig |
| 2014 | Probabilistic linear discriminant analysis with bottleneck features for speech recognition. Liang Lu, Steve Renals |
| 2014 | Progress in the BBN keyword search system for the DARPA RATS program. Tim Ng, Roger Hsiao, Le Zhang, Damianos G. Karakos, Sri Harish Reddy Mallidi, Martin Karafiát, Karel Veselý, Igor Szöke, Bing Zhang, Long Nguyen, Richard M. Schwartz |
| 2014 | Pronunciation learning for named-entities through crowd-sourcing. Attapol T. Rutherford, Fuchun Peng, Françoise Beaufays |
| 2014 | Pronunciation modeling of foreign words for Mandarin ASR by considering the effect of language transfer. Lei Wang, Rong Tong |
| 2014 | Pronunciation practice support system for children who have difficulty correctly pronouncing words. Ikuyo Masuda-Katsuse |
| 2014 | Pronunciation variation in read and conversational austrian German. Barbara Schuppler, Martine Adda-Decker, Juan Andres Morales-Cordovilla |
| 2014 | Prosodic phrasing modeling for vietnamese TTS using syntactic information. Thi Thu Trang Nguyen, Albert Rilliard, Do Dat Tran, Christophe d'Alessandro |
| 2014 | Prosody contour prediction with long short-term memory, bi-directional, deep recurrent neural networks. Raul Fernandez, Asaf Rendel, Bhuvana Ramabhadran, Ron Hoory |
| 2014 | Prosody perception, reading accuracy, nonliteral language comprehension, and music and tonal pitch discrimination in school aged children. Rose Thomas Kalathottukaren, Suzanne C. Purdy, Elaine Ballard |
| 2014 | Pruning deep neural networks by optimal brain damage. Chao Liu, Zhiyong Zhang, Dong Wang |
| 2014 | Query-by-example spoken term detection on multilingual unconstrained speech. Xavier Anguera, Luis Javier Rodríguez-Fuentes, Igor Szöke, Andi Buzo, Florian Metze, Mikel Peñagarikano |
| 2014 | RBM-PLDA subsystem for the NIST i-vector challenge. Sergey Novoselov, Timur Pekhovsky, Konstantin Simonchik, Andrey Shulipa |
| 2014 | RWTH LVCSR systems for quaero and EU-bridge: German, Polish, Spanish and Portuguese. M. Ali Basha Shaik, Zoltán Tüske, Muhammad Ali Tahir, Markus Nußbaum-Thom, Ralf Schlüter, Hermann Ney |
| 2014 | Random projections for large-scale speaker search. Ryan Leary, Walter Andrews |
| 2014 | Ranking severity of speech errors by their phonological impact in context. Sofia Strömbergsson, Christina Tånnander, Jens Edlund |
| 2014 | Rapidly building domain-specific entity-centric language models using semantic web knowledge sources. Murat Akbacak, Dilek Hakkani-Tür, Gökhan Tür |
| 2014 | Read and spontaneous speech classification based on variance of GMM supervectors. Taichi Asami, Ryo Masumura, Hirokazu Masataki, Sumitaka Sakauchi |
| 2014 | Recent advances in ASR applied to an Arabic transcription system for Al-Jazeera. Patrick Cardinal, Ahmed Ali, Najim Dehak, Yu Zhang, Tuka Al Hanai, Yifan Zhang, James R. Glass, Stephan Vogel |
| 2014 | Recent improvements in SRI's keyword detection system for noisy audio. Julien van Hout, Vikramjit Mitra, Yun Lei, Dimitra Vergyri, Martin Graciarena, Arindam Mandal, Horacio Franco |
| 2014 | Recent improvements in neural network acoustic modeling for LVCSR in low resource languages. Jia Cui, Bhuvana Ramabhadran, Xiaodong Cui, Andrew Rosenberg, Brian Kingsbury, Abhinav Sethy |
| 2014 | Reconstruction of mistracked articulatory trajectories. Qiang Fang, Jianguo Wei, Fang Hu |
| 2014 | Refined inter-segment joining in multi-form speech synthesis. Alexander Sorin, Slava Shechtman, Vincent Pollet |
| 2014 | Regularized feature-space discriminative adaptation for robust ASR. Takashi Fukuda, Osamu Ichikawa, Masafumi Nishimura, Steven J. Rennie, Vaibhava Goel |
| 2014 | Relating automatic vowel space estimates to talker intelligibility. Yi Luan, Richard A. Wright, Mari Ostendorf, Gina-Anne Levow |
| 2014 | Relative importance of AM and FM cues for speech comprehension: effects of speaking rate and their implications for neurophysiological processing of speech. Guangting Mai |
| 2014 | Removing redundancy from lattices. David Nolden, Hagen Soltau, Daniel Povey, Pegah Ghahremani, Lidia Mangu, Hermann Ney |
| 2014 | Replicate mismatch between test/background and development databases: the impact on the performance of likelihood ratio-based forensic voice comparison. Shunichi Ishihara |
| 2014 | Restructuring output layers of deep neural networks using minimum risk parameter clustering. Yotaro Kubo, Jun Suzuki, Takaaki Hori, Atsushi Nakamura |
| 2014 | Retroflex and bunched English /r/ with physical models of the human vocal tract. Takayuki Arai |
| 2014 | Revisiting the right-ear advantage for speech: implications for speech displays. Nandini Iyer, Eric R. Thompson, Brian D. Simpson, Griffin D. Romigh |
| 2014 | Rhythmic variability between some asian languages: results from an automatic analysis of temporal characteristics. Volker Dellwo, Peggy Mok, Mathias Jenny |
| 2014 | Robust CNN-based speech recognition with Gabor filter kernels. Shuo-Yiin Chang, Nelson Morgan |
| 2014 | Robust articulatory speech synthesis using deep neural networks for BCI applications. Florent Bocquelet, Thomas Hueber, Laurent Girin, Pierre Badin, Blaise Yvert |
| 2014 | Robust features for content-based audio copy detection. Chahid Ouali, Pierre Dumouchel, Vishwa Gupta |
| 2014 | Robust language identification using convolutional neural network features. Sriram Ganapathy, Kyu Jeong Han, Samuel Thomas, Mohamed Kamal Omar, Maarten Van Segbroeck, Shrikanth S. Narayanan |
| 2014 | Robust language recognition via adaptive language factor extraction. Brecht Desplanques, Kris Demuynck, Jean-Pierre Martens |
| 2014 | Robust low-resource sound localization in correlated noise. Lorin Netsch, Jacek Stachurski |
| 2014 | Robust retrieval models for false positive errors in spoken documents. Sho Kawasaki, Tomoyosi Akiba |
| 2014 | Robust speech recognition in reverberant environments using subband-based steady-state monaural and binaural suppression. Hyung-Min Park, Matthew Maciejewski, Chanwoo Kim, Richard M. Stern |
| 2014 | Robust speech recognition using long short-term memory recurrent neural networks for hybrid acoustic modelling. Jürgen T. Geiger, Zixing Zhang, Felix Weninger, Björn W. Schuller, Gerhard Rigoll |
| 2014 | Robust speech recognition using temporal masking and thresholding algorithm. Chanwoo Kim, Kean K. Chin, Michiel Bacchiani, Richard M. Stern |
| 2014 | Robust speech recognition with speech enhanced deep neural networks. Jun Du, Qing Wang, Tian Gao, Yong Xu, Li-Rong Dai, Chin-Hui Lee |
| 2014 | Room localization for distant speech recognition. Juan Andres Morales-Cordovilla, Hannes Pessentheiner, Martin Hagmüller, Gernot Kubin |
| 2014 | SARA - singapore's automated responsive assistant for the touristic domain. Andreea I. Niculescu, Rafael E. Banchs, Ridong Jiang, Seokhwan Kim, Kheng Hui Yeo, Arthur Niswar |
| 2014 | SNR-dependent mixture of PLDA for noise robust speaker verification. Man-Wai Mak |
| 2014 | SVM based speaker recognition: harnessing trials with multiple enrollment sessions. Jason W. Pelecanos, Weizhong Zhu, Sibel Yaman |
| 2014 | Segmentation and disfluency removal for conversational speech translation. Hany Hassan, Lee Schwartz, Dilek Hakkani-Tür, Gökhan Tür |
| 2014 | Segmentation in singer turns with the Bayesian information criterion. Marwa Thlithi, Thomas Pellegrini, Julien Pinquier, Régine André-Obrecht |
| 2014 | Selection of optimal vocal tract regions using real-time magnetic resonance imaging for robust voice activity detection. Abhay Prasad, Prasanta Kumar Ghosh, Shrikanth S. Narayanan |
| 2014 | Self-adaption in single-channel source separation. Michael Wohlmayr, Ludwig Mohr, Franz Pernkopf |
| 2014 | Semantic retrieval of personal photos using matrix factorization and two-layer random walk fusing sparse speech annotations with visual features. Yuan-ming Liou, Yi-Sheng Fu, Hung-yi Lee, Lin-Shan Lee |
| 2014 | Semantically based search in a social speech task. Fernando García, Emilio Sanchis, Ferran Pla |
| 2014 | Semi-supervised training for bottle-neck feature based DNN-HMM hybrid systems. Haihua Xu, Hang Su, Chng Eng Siong, Haizhou Li |
| 2014 | Sequence discriminative distributed training of long short-term memory recurrent neural networks. Hasim Sak, Oriol Vinyals, Georg Heigold, Andrew W. Senior, Erik McDermott, Rajat Monga, Mark Z. Mao |
| 2014 | Sequence error (SE) minimization training of neural network for voice conversion. Feng-Long Xie, Yao Qian, Yuchen Fan, Frank K. Soong, Haifeng Li |
| 2014 | Sequential maximum mutual information linear discriminant analysis for speech recognition. Yuuki Tachioka, Shinji Watanabe, Jonathan Le Roux, John R. Hershey |
| 2014 | Should deep neural nets have ears? the role of auditory features in deep learning approaches. Angel Mario Castro Martinez, Niko Moritz, Bernd T. Meyer |
| 2014 | Shrinkage based features for slot tagging with conditional random fields. Ruhi Sarikaya, Asli Celikyilmaz, Anoop Deoras, Minwoo Jeong |
| 2014 | Significance of aperiodicity in the pitch perception of expressive voices. Vinay Kumar Mittal, B. Yegnanarayana |
| 2014 | Simple Robert A. J. Clark |
| 2014 | Simple gesture-based error correction interface for smartphone speech recognition. Yuan Liang, Koji Iwano, Koichi Shinoda |
| 2014 | Simulation of 3d larynges with asymmetric distribution of viscoelastic properties in their vocal folds. Marcelo de Oliveira Rosa |
| 2014 | Simultaneous gender classification and voice activity detection using deep neural networks. Hiroshi Fujimura |
| 2014 | Single channel source separation with general stochastic networks. Matthias Zöhrer, Franz Pernkopf |
| 2014 | Single-channel dynamic exemplar-based speech enhancement. Nasser Mohammadiha, Simon Doclo |
| 2014 | Single-channel speech enhancement based on non-negative matrix factorization and online noise adaptation. Kwang Myung Jeon, Chan Jun Chun, Woo Kyeong Seong, Hong Kook Kim, Myung Kyu Choi |
| 2014 | Single-ended estimation of speech intelligibility using the ITU p.563 feature set. Toshihiro Sakano, Yosuke Kobayashi, Kazuhiro Kondo |
| 2014 | Sound patterns in language. William S.-Y. Wang |
| 2014 | Sparse smoothing of articulatory features from Gaussian mixture model based acoustic-to-articulatory inversion: benefit to speech recognition. Prasad Sudhakar, Prasanta Kumar Ghosh |
| 2014 | Sparse time-frequency representation of speech by the vandermonde transform. Christian Fischer Pedersen, Tom Bäckström |
| 2014 | Speaker adaptation based on sparse and low-rank eigenphone matrix estimation. Wen-Lin Zhang, Dan Qu, Wei-Qiang Zhang, Bi-Cheng Li |
| 2014 | Speaker adaptation of DNN-based ASR with i-vectors: does it actually adapt models to speakers? Mickael Rouvier, Benoît Favre |
| 2014 | Speaker adaptation of context dependent deep neural networks based on MAP-adaptation and GMM-derived feature processing. Natalia A. Tomashenko, Yuri Y. Khokhlov |
| 2014 | Speaker age estimation for elderly speech recognition in European Portuguese. Thomas Pellegrini, Vahid Hedayati, Isabel Trancoso, Annika Hämäläinen, Miguel Sales Dias |
| 2014 | Speaker dependent bottleneck layer training for speaker adaptation in automatic speech recognition. Rama Doddipatla, Madina Hasan, Thomas Hain |
| 2014 | Speaker diarization using eye-gaze information in multi-party conversations. Koji Inoue, Yukoh Wakabayashi, Hiromasa Yoshimoto, Tatsuya Kawahara |
| 2014 | Speaker diarization using gesture and speech. Binyam Gebrekidan Gebre, Peter Wittenburg, Sebastian Drude, Marijn Huijbregts, Tom Heskes |
| 2014 | Speaker idiosyncratic variability of intensity across syllables. Lei He, Volker Dellwo |
| 2014 | Speaker recognition via fusion of subglottal features and MFCCs. Harish Arsikere, Hitesh Anand Gupta, Abeer Alwan |
| 2014 | Speaker verification and spoken language identification using a generalized i-vector framework with phonetic tokenizations and tandem features. Ming Li, Wenbo Liu |
| 2014 | Spectral tilt modelling with GMMs for intelligibility enhancement of narrowband telephone speech. Emma Jokinen, Ulpu Remes, Marko Takanen, Kalle J. Palomäki, Mikko Kurimo, Paavo Alku |
| 2014 | Speech activity detection for NASA apollo space missions: challenges and solutions. Ali Ziaei, Lakshmish Kaushik, Abhijeet Sangwan, John H. L. Hansen, Douglas W. Oard |
| 2014 | Speech assistant system. László Czap |
| 2014 | Speech cohesion for topic segmentation of spoken contents. Abdessalam Bouchekif, Géraldine Damnati, Delphine Charlet |
| 2014 | Speech detection in transient noises. G. Aneeja, B. Yegnanarayana |
| 2014 | Speech emotion recognition using deep neural network and extreme learning machine. Kun Han, Dong Yu, Ivan Tashev |
| 2014 | Speech emotion recognition with cross-lingual databases. Bo-Chang Chiou, Chia-Ping Chen |
| 2014 | Speech enhancement by low-rank and convolutive dictionary spectrogram decomposition. Zhuo Chen, Brian McFee, Daniel P. W. Ellis |
| 2014 | Speech enhancement from additive noise and channel distortion - a corpus-based approach. Ji Ming, Danny Crookes |
| 2014 | Speech intonation for TTS: study on evaluation methodology. Javier Latorre, Kayoko Yanagisawa, Vincent Wan, BalaKrishna Kolluru, Mark J. F. Gales |
| 2014 | Speech pre-enhancement using a discriminative microscopic intelligibility model. Maryam Al Dabel, Jon Barker |
| 2014 | Speech prosody generation for text-to-speech synthesis based on generative model of F Kento Kadowaki, Tatsuma Ishihara, Nobukatsu Hojo, Hirokazu Kameoka |
| 2014 | Speech recognition based on Itakura-Saito divergence and dynamics/sparseness constraints from mixed sound of speech and music by non-negative matrix factorization. Naoaki Hashimoto, Shoichi Nakano, Kazumasa Yamamoto, Seiichi Nakagawa |
| 2014 | Speech recognition without a lexicon - bridging the gap between graphemic and phonetic systems. David F. Harwath, James R. Glass |
| 2014 | Speech synthesis in various communicative situations: impact of pronunciation variations. Sandrine Brognaux, Benjamin Picart, Thomas Drugman |
| 2014 | Speech synthesis reactive to dynamic noise environmental conditions. Susana Palmaz López-Peláez, Robert A. J. Clark |
| 2014 | Speech-based automatic and robust detection of very early dementia. Aharon Satt, Ron Hoory, Alexandra König, Pauline Aalten, Philippe H. Robert |
| 2014 | Speech-driven head motion synthesis using neural networks. Chuang Ding, Pengcheng Zhu, Lei Xie, Dongmei Jiang, Zhong-Hua Fu |
| 2014 | Speech-to-text technology to transcribe and disclose 100, 000+ hours of bilingual documents from historical Czech and Czechoslovak radio archive. Jan Nouza, Petr Cerva, Jindrich Zdánský, Karel Blavka, Marek Bohac, Jan Silovský, Josef Chaloupka, Michaela Kucharová, Ladislav Seps, Jirí Málek, Michal Rott |
| 2014 | Spoken dialogue system for restaurant recommendation and reservation. Rafael E. Banchs, Seokhwan Kim |
| 2014 | Spoken language recognition based on senone posteriors. Luciana Ferrer, Yun Lei, Mitchell McLaren, Nicolas Scheffer |
| 2014 | Spoken question answering using tree-structured conditional random fields and two-layer random walk. Sz-Rung Shiang, Hung-yi Lee, Lin-Shan Lee |
| 2014 | Statistical parametric speech synthesis using weighted multi-distribution deep belief network. Shiyin Kang, Helen M. Meng |
| 2014 | Statistical singing voice conversion with direct waveform modification based on the spectrum differential. Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura |
| 2014 | Stereo acoustic echo suppression using widely linear filtering in the frequency domain. Zhong-Hua Fu, Lei Xie |
| 2014 | Strategies for rescoring keyword search results using word-burst and acoustic features. Min Ma, Justin Richards, Victor Soto, Julia Hirschberg, Andrew Rosenberg |
| 2014 | Stress and accent transmission in HMM-based syllable-context very low bit rate speech coding. Milos Cernak, Alexandros Lazaridis, Philip N. Garner, Petr Motlícek |
| 2014 | Structured soft margin confidence weighted learning for grapheme-to-phoneme conversion. Keigo Kubo, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura |
| 2014 | Study of changes in glottal vibration characteristics during laughter. Vinay Kumar Mittal, B. Yegnanarayana |
| 2014 | Subjective voice quality evaluation of artificial bandwidth extension: comparing different audio bandwidths and speech codecs. Hannu Pulakka, Anssi Rämö, Ville Myllylä, Henri Toukomaa, Paavo Alku |
| 2014 | Subspace Gaussian mixture models for dialogues classification. Mohamed Bouallegue, Mohamed Morchid, Richard Dufour, Driss Matrouf, Georges Linarès, Renato De Mori |
| 2014 | Subword and phonetic search for detecting out-of-vocabulary keywords. Damianos G. Karakos, Richard M. Schwartz |
| 2014 | Summary and initial results of the 2013-2014 speaker recognition i-vector machine learning challenge. Désiré Bansé, George R. Doddington, Daniel Garcia-Romero, John J. Godfrey, Craig S. Greenberg, Alvin F. Martin, Alan McCree, Mark A. Przybocki, Douglas A. Reynolds |
| 2014 | Synchronic variation in the articulation and the acoustics of the Polish three-way place distinction in sibilants and its implications for diachronic change. Véronique Bukmaier, Jonathan Harrington, Ulrich Reubold, Felicitas Kleber |
| 2014 | Syncwords: a platform for semi-automated closed captioning and subtitles. Aleksandr Dubinsky |
| 2014 | System for automated speech and language analysis (SALSA). Kyle Marek-Spartz, Benjamin Knoll, Robert Bill, S. Thomas Christie, Serguei V. S. Pakhomov |
| 2014 | TTS synthesis with bidirectional LSTM based recurrent neural networks. Yuchen Fan, Yao Qian, Feng-Long Xie, Frank K. Soong |
| 2014 | Tandem deep features for text-dependent speaker verification. Tianfan Fu, Yanmin Qian, Yuan Liu, Kai Yu |
| 2014 | Targeted feature dropout for robust slot filling in natural language understanding. Puyang Xu, Ruhi Sarikaya |
| 2014 | Task-aware deep bottleneck features for spoken language identification. Bing Jiang, Yan Song, Si Wei, Ian Vince McLoughlin, Li-Rong Dai |
| 2014 | Text-independent voice conversion using speaker model alignment method from non-parallel speech. Peng Song, Yun Jin, Wenming Zheng, Li Zhao |
| 2014 | Text-to-speech with cross-lingual neural network-based grapheme-to-phoneme models. Xavi Gonzalvo, Monika Podsiadlo |
| 2014 | The DIRHA-GRID corpus: baseline and tools for multi-room distant speech recognition using distributed microphones. Marco Matassoni, Ramón Fernandez Astudillo, Athanasios Katsamanis, Mirco Ravanelli |
| 2014 | The EMG-UKA corpus for electromyographic speech processing. Michael Wand, Matthias Janke, Tanja Schultz |
| 2014 | The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load. Björn W. Schuller, Stefan Steidl, Anton Batliner, Julien Epps, Florian Eyben, Fabien Ringeval, Erik Marchi, Yue Zhang |
| 2014 | The Lombard effect with Thai lexical tones: an acoustic analysis of articulatory modifications in noise. Benjawan Kasisopa, Virginie Attina, Denis Burnham |
| 2014 | The NIST SRE summed channel speaker recognition system. Hanwu Sun, Bin Ma |
| 2014 | The UNSW submission to INTERSPEECH 2014 compare cognitive load challenge. Jia Min Karen Kua, Vidhyasaharan Sethu, Phu Ngoc Le, Eliathamby Ambikairajah |
| 2014 | The articulation of lexical and post-lexical palatalization in Korean. Jae-Hyun Sung |
| 2014 | The effect of filled pauses and speaking rate on speech comprehension in natural, vocoded and synthetic speech. Rasmus Dall, Mirjam Wester, Martin Corley |
| 2014 | The effect of regional and non-native accents on word recognition processes: a comparison of EEG responses in quiet to speech recognition in noise. Louise Stringer, Paul Iverson |
| 2014 | The effects of high and low variability phonetic training on the perception and production of English vowels /e/-/æ/ by Cantonese ESL learners with high and low L2 proficiency levels. Janice Wing Sze Wong |
| 2014 | The goodness of pronunciation algorithm applied to disordered speech. Thomas Pellegrini, Lionel Fontan, Julie Mauclair, Jérôme Farinas, Marina Robert |
| 2014 | The importance of phase on voice quality assessment. Maria Koutsogiannaki, Olympia Simantiraki, Gilles Degottex, Yannis Stylianou |
| 2014 | The influence of pitch and noise on the discriminability of filterbank features. Malcolm Slaney, Michael L. Seltzer |
| 2014 | The influence of sensory memory and attention on the context effect in talker normalization. Guo Li, Gang Peng |
| 2014 | The nested indian buffet process for flexible topic modeling. Jen-Tzung Chien, Ying-Lan Chang |
| 2014 | The obligatory contour principle in african and European varieties of French. Mathieu Avanzi, Guri Bordal, Gélase Nimbona |
| 2014 | The relationship between the second subglottal resonance and vowel class, standing height, trunk length, and F0 variation for Mandarin speakers. Jinxi Guo, Angli Liu, Harish Arsikere, Abeer Alwan, Steven M. Lulich |
| 2014 | The speech recognition virtual kitchen: launch party. Andrew R. Plummer, Eric Riebling, Anuj Kumar, Florian Metze, Eric Fosler-Lussier, Rebecca Bates |
| 2014 | The use of low-frequency ultrasound for voice activity detection. Ian Vince McLoughlin |
| 2014 | Theme identification in human-human conversations with features from specific speaker type hidden spaces. Mohamed Morchid, Richard Dufour, Mohamed Bouallegue, Georges Linarès, Renato De Mori |
| 2014 | Towards a complete binary key system for the speaker diarization task. Héctor Delgado, Corinne Fredouille, Javier Serrano |
| 2014 | Towards a neural measure of perceptual distance - classification of electroencephalographic responses to synthetic vowels. Manson Cheuk-Man Fong, James W. Minett, Thierry Blu, William S.-Y. Wang |
| 2014 | Towards a perceptual model of speech rhythm: integrating the influence of f0 on perceived duration. Robert Fuchs |
| 2014 | Towards a practical silent speech recognition system. Yunbin Deng, James T. Heaton, Geoffrey S. Meltzner |
| 2014 | Towards better performance with heterogeneous training data in acoustic modeling using deep neural networks. Yan Huang, Malcolm Slaney, Michael L. Seltzer, Yifan Gong |
| 2014 | Towards improving statistical model based voice activity detection. Ming Tu, Xiang Xie, Yishan Jiao |
| 2014 | Towards real-life application of EMG-based speech recognition by using unsupervised adaptation. Michael Wand, Tanja Schultz |
| 2014 | Towards speaker adaptive training of deep neural network acoustic models. Yajie Miao, Hao Zhang, Florian Metze |
| 2014 | Towards the adaptation of prosodic models for expressive text-to-speech synthesis. Mathieu Avanzi, George Christodoulides, Damien Lolive, Elisabeth Delais-Roussarie, Nelly Barbot |
| 2014 | Transcribing tone - a likelihood-based quantitative evaluation of chao's tone letters. Phil Rose |
| 2014 | Transform mapping using shared decision tree context clustering for HMM-based cross-lingual speech synthesis. Daiki Nagahama, Takashi Nose, Tomoki Koriyama, Takao Kobayashi |
| 2014 | UBM fused total variability modeling for language identification. Maarten Van Segbroeck, Ruchir Travadi, Shrikanth S. Narayanan |
| 2014 | Unfolded recurrent neural networks for speech recognition. George Saon, Hagen Soltau, Ahmad Emami, Michael Picheny |
| 2014 | Unsupervised language filtering using the latent dirichlet allocation. Wei Zhang, Robert A. J. Clark, Yongyuan Wang |
| 2014 | Unsupervised model selection for recognition of regional accented speech. Maryam Najafian, Andrea DeMarco, Stephen J. Cox, Martin J. Russell |
| 2014 | Unsupervised query-by-example spoken term detection using bag of acoustic words and non-segmental dynamic time warping. Basil George, Abhijeet Saxena, Gautam Varma Mantena, Kishore Prahallad, B. Yegnanarayana |
| 2014 | Unsupervised speaker diarization using riemannian manifold clustering. Che-Wei Huang, Bo Xiao, Panayiotis G. Georgiou, Shrikanth S. Narayanan |
| 2014 | Unsupervised spoken word retrieval using Gaussian-bernoulli restricted boltzmann machines. Raghavendra Reddy Pappagari, Shekhar Nayak, K. Sri Rama Murty |
| 2014 | Unsupervised training methods for discriminative language modeling. Erinç Dikici, Murat Saraçlar |
| 2014 | Using a hybrid approach to build a pronunciation dictionary for Brazilian Portuguese. Gustavo Mendonça, Sandra M. Aluísio |
| 2014 | Using conditional random fields to predict focus word pair in spontaneous spoken English. Xiao Zang, Zhiyong Wu, Helen M. Meng, Jia Jia, Lianhong Cai |
| 2014 | Using deep belief networks for vector-based speaker recognition. William M. Campbell |
| 2014 | Using deep neural networks to improve proficiency assessment for children English language learners. Angeliki Metallinou, Jian Cheng |
| 2014 | Using hidden Markov models for speech enhancement. Akihiro Kato, Ben Milner |
| 2014 | Using linguistic predictability and the lombard effect to increase the intelligibility of synthetic speech in noise. Cassia Valentini-Botinhao, Mirjam Wester |
| 2014 | Utilizing state-level distance vector representation for improved spoken term detection by text and spoken queries. Mitsuaki Makino, Naoki Yamamoto, Atsuhiko Kai |
| 2014 | Variable Span disfluency detection in ASR transcripts. Rahul Gupta, Sankaranarayanan Ananthakrishnan, Zhaojun Yang, Shrikanth S. Narayanan |
| 2014 | Variable-component deep neural network for robust speech recognition. Rui Zhao, Jinyu Li, Yifan Gong |
| 2014 | Verbal description of LEGO blocks. Diogo Henriques, Isabel Trancoso, Daniel Mendes, Alfredo Ferreira |
| 2014 | Virtual example for phonotactic language recognition. Rong Tong, Bin Ma, Haizhou Li |
| 2014 | Vocal tract length estimation based on vowels using a database consisting of 385 speakers and a database with MRI-based vocal tract shape information. Hideki Kawahara, Tatsuya Kitamura, Hironori Takemoto, Ryuichi Nisimura, Toshio Irino |
| 2014 | Voice conversion using generative trained deep neural networks with multiple frame spectral envelopes. Ling-Hui Chen, Zhen-Hua Ling, Li-Rong Dai |
| 2014 | Voice expression conversion with factorised HMM-TTS models. Javier Latorre, Vincent Wan, Kayoko Yanagisawa |
| 2014 | Vowel length impact on locus equation parameters: an investigation on jordanian Arabic. Mohammad Abuoudeh, Olivier Crouzet |
| 2014 | Vowel spectral contributions to English and Mandarin sentence intelligibility. Daniel Fogerty, Fei Chen |
| 2014 | Weighted spatial bispectrum correlation matrix for DOA estimation in the presence of interferences. Wei Xue, Shan Liang, Wenju Liu |
| 2014 | When voices get emotional: a study of emotion-enhanced memory and impairment during emotional prosody exposure. Cyrielle Chappuis, Didier Grandjean |
| 2014 | Where /ar/ the /r/s in standard austrian German? Anke Jackschina, Barbara Schuppler, Rudolf Muhr |
| 2014 | Word embeddings for speech recognition. Samy Bengio, Georg Heigold |
| 2014 | Word pair approximation for more efficient decoding with high-order language models. David Nolden, Ralf Schlüter, Hermann Ney |
| 2014 | Word-based probabilistic phonetic retrieval for low-resource spoken term detection. Di Xu, Florian Metze |
| 2014 | Word-level invariant representations from acoustic waveforms. Stephen Voinea, Chiyuan Zhang, Georgios Evangelopoulos, Lorenzo Rosasco, Tomaso A. Poggio |
| 2014 | Word-phrase-entity language models: getting more mileage out of n-grams. Michael Levit, Sarangarajan Parthasarathy, Shuangyu Chang, Andreas Stolcke, Benoît Dumoulin |
| 2014 | elite-HTS: a NLP tool for French HMM-based speech synthesis. Sophie Roekhaut, Sandrine Brognaux, Richard Beaufort, Thierry Dutoit |
| 2014 | rwthlm - the RWTH aachen university neural network language modeling toolkit. Martin Sundermeyer, Ralf Schlüter, Hermann Ney |