| 2011 | "What is... Dengue Fever?" - Modeling and Predicting Pronunciation Errors in a Text-to-Speech System. Andrew Rosenberg, Raul Fernandez, Bhuvana Ramabhadran |
| 2011 | "Would You Buy a Car from Me?" - On the Likability of Telephone Voices. Felix Burkhardt, Björn W. Schuller, Benjamin Weiss, Felix Weninger |
| 2011 | "You made me do it": Classification of Blame in Married Couples' Interactions by Fusing Automatically Derived Speech and Language Information. Matthew Black, Panayiotis G. Georgiou, Athanasios Katsamanis, Brian R. Baucom, Shrikanth S. Narayanan |
| 2011 | 'Are You Sure You're Paying Attention?' - 'Uh-Huh' Communicating Understanding as a Marker of Attentiveness. Hendrik Buschmeier, Zofia Malisz, Marcin Wlodarczak, Stefan Kopp, Petra Wagner |
| 2011 | 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011, Florence, Italy, August 27-31, 2011 |
| 2011 | A Bayesian Approach to Voice Conversion Based on GMMs Using Multiple Model Structures. Lei Li, Yoshihiko Nankaku, Keiichi Tokuda |
| 2011 | A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines. Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee |
| 2011 | A Comparative Acoustic Study on Speech of Glossectomy Patients and Normal Subjects. Xinhui Zhou, Maureen L. Stone, Carol Y. Espy-Wilson |
| 2011 | A Corpus-Based Study of English Pronunciation Variations. Sunhee Kim, Kyuwhan Lee, Minhwa Chung |
| 2011 | A Cross-Lingual Approach to the Development of an HMM-Based Speech Synthesis System for Malay. Mumtaz B. Mustafa, Raja Noor Ainon, Roziati Zainuddin, Zuraidah M. Don, Gerry Knowles |
| 2011 | A Cross-Lingual Spoken Content Search System. Jitendra Ajmera, Ashish Verma |
| 2011 | A Divide et impera Algorithm for Optimal Pitch Stylization. Antonio Origlia, Giovanni Abete, Francesco Cutugno, Iolanda Alfano, Renata Savy, Bogdan Ludusan |
| 2011 | A Dual Channel Coupled Decoder for Fillers and Feedback. Daniel Neiberg, Joakim Gustafson |
| 2011 | A Frequency Domain Approach to ARX-LF Voiced Speech Parameterization and Synthesis. Alan Ó Cinnéide, David Dorran, Mikel Gainza, Eugene Coyle |
| 2011 | A Fully Automated Derivation of State-Based Eigentriphones for Triphone Modeling with No Tied States Using Regularization. Tom Ko, Brian Mak |
| 2011 | A Grammar Based Approach to Style Specific Phrase Prediction. Alok Parlikar, Alan W. Black |
| 2011 | A High Resolution Multiple Source Localization Based on Generalized Cumulant Structure (GCS) Matrix. Jinho Choi, Chang D. Yoo |
| 2011 | A Hybrid Quasi-Harmonic/CELP Wideband Speech Coding Scheme for Unit Selection TTS Synthesis. Chang-Heon Lee, Olivier Rosec, Yannis Stylianou |
| 2011 | A Hybrid TTS Approach for Prosody and Acoustic Modules. Iñaki Sainz, Daniel Erro, Eva Navas, Inma Hernáez |
| 2011 | A Language Independent Approach to Audio Search. Vikram Gupta, Jitendra Ajmera, Arun Kumar, Ashish Verma |
| 2011 | A Level-Dependent Auditory Filter-Bank for Speech Recognition in Reverberant Environments. Hari Krishna Maganti, Marco Matassoni |
| 2011 | A Long-Term Harmonic Plus Noise Model for Speech Signals. Faten Ben Ali, Laurent Girin, Sonia Djaziri Larbi |
| 2011 | A Longest Matching Segment Approach with Baysian Adaptation - Application to Noise-Robust Speaker Recognition. Ayeh Jafari, Ramji Srinivasan, Danny Crookes, Ji Ming |
| 2011 | A Model-Based Spectral Envelope Wiener Filter for Perceptually Motivated Speech Enhancement. Najib Hadir, Friedrich Faubel, Dietrich Klakow |
| 2011 | A Multichannel Feature-Based Processing for Robust Speech Recognition. Mehrez Souden, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani |
| 2011 | A Multimodal Analysis of Vocal and Visual Backchannels in Spontaneous Dialogs. Khiet P. Truong, Ronald Poppe, Iwan de Kok, Dirk Heylen |
| 2011 | A Multimodal Approach to Dictation of Handwritten Historical Documents. Vicent Alabau, Verónica Romero, Antonio L. Lagarda, Carlos D. Martínez-Hinarejos |
| 2011 | A Multimodal Real-Time MRI Articulatory Corpus for Speech Research. Shrikanth S. Narayanan, Erik Bresch, Prasanta Kumar Ghosh, Louis Goldstein, Athanasios Katsamanis, Yoon Kim, Adam C. Lammert, Michael I. Proctor, Vikram Ramanarayanan, Yinghua Zhu |
| 2011 | A Multithreaded Implementation of Viterbi Decoding on Recursive Transition Networks. Fabio Brugnara |
| 2011 | A New Epsilon Filter for Efficient Composition of Weighted Finite-State Transducers. Frank Duckhorn, Matthias Wolff, Rüdiger Hoffmann |
| 2011 | A New Model-Based Mandarin-Speech Coding System. Chen-Yu Chiang, Jyh-Her Yang, Ming-Chieh Liu, Yih-Ru Wang, Yuan-Fu Liao, Sin-Horng Chen |
| 2011 | A New Perspective on GMM Subspace Compensation Based on PPCA and Wiener Filtering. Alan McCree, Douglas E. Sturim, Douglas A. Reynolds |
| 2011 | A New Phonetic Candidate Generator for Improving Search Query Efficiency. Bo Peng, Yao Qian, Frank K. Soong, Bo Zhang |
| 2011 | A Noise Estimation Method Based on Speech Presence Probability and Spectral Sparseness. Chao Li, Wenju Liu |
| 2011 | A Paradigm for Limited Vocabulary Speech Recognition Based on Redundant Spectro-Temporal Feature Sets. Sourish Chaudhuri, Bhiksha Raj, Tony Ezzat |
| 2011 | A Parametric Approach to Intonation Acquisition Research: Validation on Child-Directed Speech Data. Britta Lintfert, Antje Schweitzer, Bernd Möbius |
| 2011 | A Perceptual Expressivity Modeling Technique for Speech Synthesis Based on Multiple-Regression HSMM. Takashi Nose, Takao Kobayashi |
| 2011 | A Performance Monitoring Approach to Fusing Enhanced Spectrogram Channels in Robust Speech Recognition. Shirin Badiezadegan, Richard C. Rose |
| 2011 | A Piecewise Aggregate Approximation Lower-Bound Estimate for Posteriorgram-Based Dynamic Time Warping. Yaodong Zhang, James R. Glass |
| 2011 | A Pitch Tracking Corpus with Evaluation on Multipitch Tracking Scenario. Gregor Pirker, Michael Wohlmayr, Stefan Petrik, Franz Pernkopf |
| 2011 | A Pointwise Approach to Pronunciation Estimation for a TTS Front-End. Shinsuke Mori, Graham Neubig |
| 2011 | A Preliminary Model of Emotional Prosody Using Multidimensional Scaling. Sona Patel, Rahul Shrivastav |
| 2011 | A Preliminary Study on the Production of Signs in Brazilian Sign Language when One of the Manual Articulators is Unavailable. André N. Xavier, Plínio A. Barbosa |
| 2011 | A Qualitative Evaluation of Phoneme-to-Phoneme Technology. Marijn Schraagen, Gerrit Bloothooft |
| 2011 | A Quantitative Investigation of the Prosody of Verum Focus in Italian. Giuseppina Turco, Michele Gubian, Jessamyn Schertz |
| 2011 | A Rapid Adaptation Algorithm for Tracking Highly Non-Stationary Noises based on Bayesian Inference for On-Line Spectral Change Point Detection. Md Foezur Rahman Chowdhury, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy |
| 2011 | A Risk-Estimation-Based Comparison of Mean Square Error and Itakura-Saito Distortion Measures for Speech Enhancement. Nagarjuna Reddy Muraka, Chandra Sekhar Seelamantula |
| 2011 | A Robust Approach to Mining Repeated Sequence in Audio Stream. Jiansong Chen, Lei Zhu, Bailan Feng, Peng Ding, Bo Xu |
| 2011 | A Robust Estimation Method of Noise Mixture Model for Noise Suppression. Masakiyo Fujimoto, Shinji Watanabe, Tomohiro Nakatani |
| 2011 | A Scalable Approach to Building a Parallel Corpus from the Web. Vivek Kumar Rangarajan Sridhar, Luciano Barbosa, Srinivas Bangalore |
| 2011 | A Soft Decision-Based Speech Enhancement Using Acoustic Noise Classification. Jae-Hun Choi, Sang-Kyun Kim, Joon-Hyuk Chang |
| 2011 | A Speaker Line-Up for the Likelihood Ratio. David A. van Leeuwen, Niko Brümmer |
| 2011 | A Statistical Phrase/Accent Model for Intonation Modeling. Gopala Krishna Anumanchipalli, Luís C. Oliveira, Alan W. Black |
| 2011 | A Statistical Room Impulse Response Model with Frequency Dependent Reverberation Time for Single-Microphone Late Reverberation Suppression. Jan S. Erkelens, Richard Heusdens |
| 2011 | A Study of the Effectiveness of Articulatory Strokes for Phonemic Recognition. Carlos Molina, Sungbok Lee, Shrikanth S. Narayanan, Néstor Becerra Yoma |
| 2011 | A Study on Auditory Feature Spaces for Speech-Driven Lip Animation. Guylaine Le Jan, Yannick Benezeth, Guillaume Gravier, Frédéric Bimbot |
| 2011 | A Study on Bag of Gaussian Model with Application to Voice Conversion. Yu Qiao, Tong Tong, Nobuaki Minematsu |
| 2011 | A Study on Combining VTLN and SAT to Improve the Performance of Automatic Speech Recognition. Doddipatla Rama Sanand, Mikko Kurimo |
| 2011 | A Study on Speaker Normalized MLP Features in LVCSR. Zoltán Tüske, Christian Plahl, Ralf Schlüter |
| 2011 | A Study on the Effect of Pitch on LPCC and PLPC Features for Children's ASR in Comparison to MFCC. Shweta Ghai, Rohit Sinha |
| 2011 | A Study on the Perception of Tone and Intonation in Sesotho. Hansjörg Mixdorff, Lehlohonolo Mohasi, Malillo Machobane, Thomas Niesler |
| 2011 | A Tale of Two Tasks: Detecting Children's Off-Task Speech in a Reading Tutor. Wei Chen, Jack Mostow |
| 2011 | A Template Based Voice Trigger System Using Bhattacharyya Edit Distance. Evelyn Kurniawati, Samsudin Ng, Karthik Muralidhar, Sapna George |
| 2011 | A Transcription Task for Crowdsourcing with Automatic Quality Control. Chia-ying Lee, James R. Glass |
| 2011 | A Two-Stage Sample-Based Phone Boundary Detector Using Segmental Similarity Features. Yih-Ru Wang |
| 2011 | A Versatile Gaussian Splitting Approach to Non-Linear State Estimation and its Application to Noise-Robust ASR. Volker Leutnant, Alexander Krueger, Reinhold Haeb-Umbach |
| 2011 | A Web Based Speech Transcription Workplace. Markus Klehr, Andreas Ratzka, Thomas Roß |
| 2011 | A Web-Based Tool for Developing Multilingual Pronunciation Lexicons. Samantha Ainsley, Linne Ha, Martin Jansche, Ara Kim, Masayuki Nanzawa |
| 2011 | ASR for Human-Symbiotic Robot "EMIEW2" with Mechanical Noise and Floor-Level Noise Reduction. Takashi Sumiyoshi, Masahito Togami, Yasunari Obuchi |
| 2011 | AT&T VoiceBuilder: A Cloud-Based Text-to-Speech Voice Builder Tool. Yeon-Jun Kim, Thomas Okken, Alistair Conkie, Giuseppe Di Fabbrizio |
| 2011 | AUC Optimization Based Confidence Measure for Keyword Spotting. Haiyang Li, Jiqing Han, Tieran Zheng |
| 2011 | About Handling Boundary Uncertainty in a Speaking Rate Dependent Modeling Approach. Denis Jouvet, Dominique Fohr, Irina Illina |
| 2011 | Accelerated Parallelizable Neural Network Learning Algorithm for Speech Recognition. Dong Yu, Li Deng |
| 2011 | Acceleration Sensor Based Estimates of Subglottal Resonances: Short vs. Long Vowels. Wolfgang Wokurek, Andreas Madsack |
| 2011 | Accounting for Prosodic Information to Improve ASR-Based Topic Tracking for TV Broadcast News. Camille Guinaudeau, Julia Hirschberg |
| 2011 | Acoustic Analysis of Whispered Speech for Phoneme and Speaker Dependency. Xing Fan, Keith W. Godin, John H. L. Hansen |
| 2011 | Acoustic Correlates of Glottal Gaps. Gang Chen, Jody Kreiman, Yen-Liang Shue, Abeer Alwan |
| 2011 | Acoustic Forest for SMAP-Based Speaker Verification. Sangeeta Biswas, Marc Ferras, Koichi Shinoda, Sadaoki Furui |
| 2011 | Acoustic Look-Ahead for More Efficient Decoding in LVCSR. David Nolden, Ralf Schlüter, Hermann Ney |
| 2011 | Acoustic Model Training with Detecting Transcription Errors in the Training Data. Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura |
| 2011 | Acoustic Modeling with Bootstrap and Restructuring Based on Full Covariance. Xiaodong Cui, Xin Chen, Jian Xue, Peder A. Olsen, John R. Hershey, Bowen Zhou |
| 2011 | Acoustic and Prosodic Correlates of Social Behavior. Agustín Gravano, Rivka Levitan, Laura Willson, Stefan Benus, Julia Hirschberg, Ani Nenkova |
| 2011 | Acoustic and Visual Cues of Turn-Taking Dynamics in Dyadic Interactions. Bo Xiao, Viktor Rozgic, Athanasios Katsamanis, Brian R. Baucom, Panayiotis G. Georgiou, Shrikanth S. Narayanan |
| 2011 | Acoustic-Linguistic Recognition of Interest in Speech with Bottleneck-BLSTM Nets. Martin Wöllmer, Felix Weninger, Florian Eyben, Björn W. Schuller |
| 2011 | Acoustic-Similarity Based Technique to Improve Concept Recognition. Om Deshmukh, Shajith Ikbal, Ashish Verma, Etienne Marcheret |
| 2011 | Acquisition of Timing Patterns in Second Language. Mikhail Ordin, Leona Polyanskaya, Christiane Ulbrich |
| 2011 | Active Learning for Dialogue Act Classification. Björn Gambäck, Fredrik Olsson, Oscar Täckström |
| 2011 | Ad-Hoc Meeting Transcription on Clusters of Mobile Devices. Michele Cossalter, Priya Sundararajan, Ian R. Lane |
| 2011 | Adaptation of Prosody in Speech Synthesis by Changing Command Values of the Generation Process Model of Fundamental Frequency. Keikichi Hirose, Keiko Ochi, Ryusuke Mihara, Hiroya Hashimoto, Daisuke Saito, Nobuaki Minematsu |
| 2011 | Adaptation of Speaker-Specific Bases in Non-Negative Matrix Factorization for Single Channel Speech-Music Separation. Emad M. Grais, Hakan Erdogan |
| 2011 | Adaptive Blocking Beamformer for Speech Separation. Ngoc Thuy Tran, William G. Cowley, André Pollok |
| 2011 | Adaptive Estimation of Zeros of Time-Varying Z-Transforms. Christian Fischer Pedersen, Ove Andersen, Paul Dalsgaard |
| 2011 | Adaptive Regularization Framework for Robust Voice Activity Detection. Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura |
| 2011 | Adaptive Stream Fusion in Multistream Recognition of Speech. Nima Mesgarani, Samuel Thomas, Hynek Hermansky |
| 2011 | Adding Glottal Source Information to Intra-Lingual Voice Conversion. Javier Pérez, Antonio Bonafonte |
| 2011 | Adding a Speech Cursor to a Multimodal Dialogue System. Staffan Larsson, Alexander Berman, Jessica Villing |
| 2011 | Addressing the Data-Imbalance Problem in Kernel-Based Speaker Verification via Utterance Partitioning and Speaker Comparison. Wei Rao, Man-Wai Mak |
| 2011 | Age-Dependent Differences in the Neutralization of the Intervocalic Voicing Contrast: Evidence from an Apparent-Time Study on East Franconian. Viola Müller, Jonathan Harrington, Felicitas Kleber, Ulrich Reubold |
| 2011 | Agglomerative Hierarchical Clustering of Emotions in Speech Based on Subjective Relative Similarity. Ryoichi Takashima, Tohru Nagano, Ryuki Tachibana, Masafumi Nishimura |
| 2011 | Albayzín 2010: A Spanish Text to Speech Evaluation. Francisco Campillo, Francisco Méndez Pazó, Montserrat Arza, Laura Docío Fernández, Antonio Bonafonte, Eva Navas, Iñaki Sainz |
| 2011 | Alternative Frequency Scale Cepstral Coefficient for Robust Sound Event Recognition. Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, Haizhou Li |
| 2011 | An Accurate and Robust Gender Identification Algorithm. Andrea DeMarco, Stephen J. Cox |
| 2011 | An Active Learning Approach to Task Adaptation. Ji Wu, Zhiyang He, Ping Lv |
| 2011 | An Affective Spoken Storyteller. Felix Burkhardt |
| 2011 | An Analysis Framework Based on Random Subspace Sampling for Speaker Verification. Weiwu Jiang, Zhifeng Li, Helen M. Meng |
| 2011 | An Analysis of Automatic Speech Recognition with Multiple Microphones. Davide Marino, Thomas Hain |
| 2011 | An Analysis of PCA-Based Vocal Entrainment Measures in Married Couples' Affective Spoken Interactions. Chi-Chun Lee, Athanasios Katsamanis, Matthew P. Black, Brian R. Baucom, Panayiotis G. Georgiou, Shrikanth S. Narayanan |
| 2011 | An Analysis of Word Duration in Native Speakers and Japanese Speakers of English. Tomoko Nariai, Kazuyo Tanaka, Yoshiaki Itoh |
| 2011 | An Application to Test the Emotion Conveyed by Vocal and Musical Signals. Simone Carcone, Carlo Giovannella |
| 2011 | An Assessment of the Improvement Potential of Time-Frequency Masking for Speech Dereverberation. Chenxi Zheng, Tiago H. Falk, Wai-Yip Chan |
| 2011 | An Automatic Voice Pleasantness Classification System Based on Prosodic and Acoustic Patterns of Voice Preference. Luís Pinto Coelho, Daniela Braga, Miguel Sales Dias, Carmen García-Mateo |
| 2011 | An Efferent-Inspired Auditory Model Front-End for Speech Recognition. Chia-ying Lee, James R. Glass, Oded Ghitza |
| 2011 | An Efficient Pre-Processing Scheme to Improve the Sound Source Localization System in Noisy Environment. Sheng-Chieh Lee, K. Bharanitharan, Bo-Wei Chen, Jhing-Fa Wang, Chung-Hsien Wu, Min-Jian Liao |
| 2011 | An Efficient Unified Extraction Algorithm for Bilingual Data. Christoph Tillmann, Sanjika Hewavitharana |
| 2011 | An Electropalatographic and Acoustic Study on Anticipatory Coarticulation in V1#C2V2 Sequences in Standard Chinese. Yinghao Li, Jiangping Kong |
| 2011 | An Empirical Study of Multilingual Spoken Term Detection. Zejun Ma, Xiaorui Wang, Bo Xu |
| 2011 | An Empirical Study on Improving Hierarchical Phrase-Based Translation Using Alignment Features. Songfang Huang, Bowen Zhou |
| 2011 | An Engine-Independent Text-to-Speech Workplace. Margot Mieskes |
| 2011 | An Experimental Analysis of Pitch Patterns in Japanese Speakers of English with Verification by Speech Re-Synthesis. Tomoko Nariai, Kazuyo Tanaka |
| 2011 | An Exploratory Study of the Relations Between Perceived Emotion Strength and Articulatory Kinematics. Jangwon Kim, Sungbok Lee, Shrikanth S. Narayanan |
| 2011 | An HMM-Based Approach to the INTERSPEECH 2011 Speaker State Challenge. Albino Nogueiras Rodríguez |
| 2011 | An Informed Source Separation System for Speech Signals. Shuhua Zhang, Laurent Girin |
| 2011 | An International English Speech Corpus for Longitudinal Study of Accent Development. Rosemary Orr, Hugo Quené, Roeland van Beek, Thari Diefenbach, David A. van Leeuwen, Marijn Huijbregts |
| 2011 | An Investigation in Speech Recognition for Colloquial Arabic. Sarah Al-Shareef, Thomas Hain |
| 2011 | An Investigation of Depressed Speech Detection: Features and Normalization. Nicholas Cummins, Julien Epps, Michael Breakspear, Roland Goecke |
| 2011 | An i-vector Based Approach to Acoustic Sniffing for Irrelevant Variability Normalization Based Acoustic Model Training and Speech Recognition. Jian Xu, Yu Zhang, Zhi-Jie Yan, Qiang Huo |
| 2011 | An i-vector Based Approach to Training Data Clustering for Improved Speech Recognition. Yu Zhang, Jian Xu, Zhi-Jie Yan, Qiang Huo |
| 2011 | Analysing the Correspondence Between Automatic Prosodic Segmentation and Syntactic Structure. György Szaszák, Katalin Nagy, András Beke |
| 2011 | Analysis and Automatic Estimation of Children's Subglottal Resonances. Steven M. Lulich, Harish Arsikere, John R. Morton, Gary K. F. Leung, Abeer Alwan, Mitchell Sommers |
| 2011 | Analysis and Comparison of Recent MLP Features for LVCSR Systems. Fabio Valente, Mathew Magimai-Doss, Wen Wang |
| 2011 | Analysis of Acoustic-Prosodic Features Related to Paralinguistic Information Carried by Interjections in Dialogue Speech. Carlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita |
| 2011 | Analysis of Dialectal Influence in Pan-Arabic ASR. Udhyakumar Nallasamy, Michael Garbus, Florian Metze, Qin Jin, Thomas Schaaf, Tanja Schultz |
| 2011 | Analysis of HMM-Based Lombard Speech Synthesis. Tuomo Raitio, Antti Suni, Martti Vainio, Paavo Alku |
| 2011 | Analysis of Inter-Articulator Correlation in Acoustic-to-Articulatory Inversion Using Generalized Smoothness Criterion. Prasanta Kumar Ghosh, Shrikanth S. Narayanan |
| 2011 | Analysis of i-vector Length Normalization in Speaker Recognition Systems. Daniel Garcia-Romero, Carol Y. Espy-Wilson |
| 2011 | Analyzing Training Dependencies and Posterior Fusion in Discriminant Classification of Apnea Patients Based on Sustained and Connected Speech. José Luis Blanco Murillo, Rubén Fernández Pozo, Doroteo Torre Toledano, Javier Caminero, Eduardo López |
| 2011 | Analyzing the Nature of ECA Interactions in Children with Autism. Emily Mower, Chi-Chun Lee, James Gibson, Theodora Chaspari, Marian E. Williams, Shrikanth S. Narayanan |
| 2011 | Anger Recognition in Spoken Dialog Using Linguistic and Para-Linguistic Information. Narichika Nomoto, Masafumi Tamoto, Hirokazu Masataki, Osamu Yoshioka, Satoshi Takahashi |
| 2011 | Announcing the Electromagnetic Articulography (Day 1) Subset of the mngu0 Articulatory Corpus. Korin Richmond, Phil Hoole, Simon King |
| 2011 | Aperiodicity Analysis for Quality Estimation of Text-to-Speech Signals. Christoph Norrenbrock, Ulrich Heute, Florian Hinterleitner, Sebastian Möller |
| 2011 | Applying Rhythm Features to Automatically Assess Non-Native Speech. Lei Chen, Klaus Zechner |
| 2011 | Applying the Quantitative Target Approximation Model (qTA) to German and Brazilian Portuguese. Plínio Almeida Barbosa, Hansjörg Mixdorff, Sandra Madureira |
| 2011 | Approximate Inference for Domain Detection in Spoken Language Understanding. Asli Celikyilmaz, Dilek Hakkani-Tür, Gökhan Tür |
| 2011 | Articulatory Feature Classification Using Nearest Neighbors. Arild Brandrud Næss, Karen Livescu, Rohit Prabhavalkar |
| 2011 | Articulatory Reduction in Mandarin Chinese Words. Jeffrey Berry, Sunjing Ji, Ian R. Fasel, Diana Archangeli |
| 2011 | Assessing Acoustic Reduction: Exploiting Local Structure in Speech. Louis ten Bosch, Annika Hämäläinen, Mirjam Ernestus |
| 2011 | Asynchronous Multimodal Text Entry Using Speech and Gesture Keyboards. Per Ola Kristensson, Keith Vertanen |
| 2011 | Attention, Sobriety Checkpoint! Can Humans Determine by Means of Voice, if Someone is Drunk... and Can Automatic Classifiers Compete? Stefan Ultes, Alexander Schmitt, Wolfgang Minker |
| 2011 | Auditory Filterbank Improves Voice Morphing. Erika Okamoto, Toshio Irino, Ryuichi Nisimura, Hideki Kawahara |
| 2011 | Auditory Speech Processing is Affected by Visual Speech in the Periphery. Jeesun Kim, Chris Davis |
| 2011 | Automatic Analysis of Singleton and Geminate Consonant Articulation Using Real-Time Magnetic Resonance Imaging. Christina Hagedorn, Michael I. Proctor, Louis Goldstein |
| 2011 | Automatic Assessment of Prosody in High-Stakes English Tests. Jian Cheng |
| 2011 | Automatic Call Quality Monitoring Using Cost-Sensitive Classification. Youngja Park |
| 2011 | Automatic Comma Insertion of Lecture Transcripts Based on Multiple Annotations. Yuya Akita, Tatsuya Kawahara |
| 2011 | Automatic Data-Driven Learning of Articulatory Primitives from Real-Time MRI Data Using Convolutive NMF with Sparseness Constraints. Vikram Ramanarayanan, Athanasios Katsamanis, Shrikanth S. Narayanan |
| 2011 | Automatic Detection of Anger in Human-Human Call Center Dialogs. Mustafa Erden, Levent M. Arslan |
| 2011 | Automatic Detection of Depression in Speech Using Gaussian Mixture Modeling with Factor Analysis. Douglas E. Sturim, Pedro A. Torres-Carrasquillo, Thomas F. Quatieri, Nicolas Malyska, Alan McCree |
| 2011 | Automatic Detection of Speaker Attributes Based on Utterance Text. Wen Wang, Andreas Kathol, Harry Bratt |
| 2011 | Automatic Determination of the Standard Chinese Prosodic Phrase Boundaries by F0 Generation Model. Shehui Bu, Zhenjie Zhuo, Lingling Yang, Shuichi Itahashi |
| 2011 | Automatic Generation of Listening Comprehension Learning Material in European Portuguese. Thomas Pellegrini, Rui Correia, Isabel Trancoso, Jorge Baptista, Nuno J. Mamede |
| 2011 | Automatic Identification of Salient Acoustic Instances in Couples' Behavioral Interactions Using Diverse Density Support Vector Machines. James Gibson, Athanasios Katsamanis, Matthew P. Black, Shrikanth S. Narayanan |
| 2011 | Automatic Learning in Content Indexing Service Using Phonetic Alignment. Yeon-Jun Kim, David C. Gibbon |
| 2011 | Automatic Prosodic Events Detection by Using Syllable-Based Acoustic, Lexical and Syntactic Features. Chong-Jia Ni, Wenju Liu, Bo Xu |
| 2011 | Automatic Prosody Generation for Serbo-Croatian Speech Synthesis Based on Regression Trees. Milan Secujski, Darko Pekar, Niksa Jakovljevic |
| 2011 | Automatic Selection of Acoustic and Non-Linear Dynamic Features in Voice Signals for Hypernasality Detection. Juan Rafael Orozco-Arroyave, S. Murillo Rendón, Andrés Marino Álvarez-Meza, Julián D. Arias-Londoño, Edilson Delgado-Trejos, Jesús Francisco Vargas-Bonilla, César Germán Castellanos-Domínguez |
| 2011 | Automatic Sentence Selection from Speech Corpora Including Diverse Speech for Improved HMM-TTS Synthesis Quality. Norbert Braunschweiler, Sabine Buchholz |
| 2011 | Automatic Speech Codec Identification with Applications to Tampering Detection of Speech Recordings. Jingting Zhou, Daniel Garcia-Romero, Carol Y. Espy-Wilson |
| 2011 | Automatic Speech Recognition System Dedicated for Polish. Mariusz Ziólko, Jakub Galka, Bartosz Ziólko, Tomasz Jadczyk, Dawid Skurzok, Mariusz Masior |
| 2011 | Automatic Subtitling of the Basque Parliament Plenary Sessions Videos. Germán Bordel, Silvia Nieto, Mikel Peñagarikano, Luis Javier Rodríguez, Amparo Varona |
| 2011 | Automatic Viseme Clustering for Audiovisual Speech Synthesis. Wesley Mattheyses, Lukas Latacz, Werner Verhelst |
| 2011 | Automatically Creating a Diphone Set from a Speech Database. Thomas Ewender, Beat Pfister |
| 2011 | Automatically Optimizing Utterance Classification Performance without Human in the Loop. Yun-Cheng Ju, Jasha Droppo |
| 2011 | Bayesian Extension of MUSIC for Sound Source Localization and Tracking. Takuma Otsuka, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno |
| 2011 | Bayesian Language Model Interpolation for Mobile Speech Input. Cyril Allauzen, Michael Riley |
| 2011 | Bilingual Acoustic Model Adaptation by Unit Merging on Different Levels and Cross-Level Integration. Ching-Feng Yeh, Chao-Yu Huang, Lin-Shan Lee |
| 2011 | Binaural Cues for Fragment-Based Speech Recognition in Reverberant Multisource Environments. Ning Ma, Jon Barker, Heidi Christensen, Phil D. Green |
| 2011 | Binaural Noise-Reduction Method Based on Blind Source Separation and Perceptual Post Processing. Jorge I. Marin-Hurtado, Devangi N. Parikh, David V. Anderson |
| 2011 | Biomechanical Tongue Models: An Approach to Studying Inter-Speaker Variability. Ralf Winkler, Susanne Fuchs, Pascal Perrier, Mark Tiede |
| 2011 | Blind Source Separation for Robot Audition Using Fixed Beamforming with HRTFs. Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim |
| 2011 | Blind Speech Prior Estimation for Generalized Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator. Ryo Wakisaka, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani |
| 2011 | Blind Speech Separation in Multiple Environments Using a Frequency Oriented PCA Method for Convolutive Mixtures. Yasmina Benabderrahmane, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy |
| 2011 | Blind Speech Separation in Time-Domain Using Block-Toeplitz Structure of Reconstructed Signal Matrices. Zbynek Koldovský, Jirí Málek, Petr Tichavský |
| 2011 | Boosting Speaker Recognition Performance with Compact Representations. Sibel Yaman, Jason W. Pelecanos, Mohamed Kamal Omar |
| 2011 | Bootstrapping Domain Detection Using Query Click Logs for New Domains. Dilek Hakkani-Tür, Gökhan Tür, Larry P. Heck, Elizabeth Shriberg |
| 2011 | Breath-Detection-Based Telephony Speech Phrasing. Takashi Fukuda, Osamu Ichikawa, Masafumi Nishimura |
| 2011 | Building an Audio-Visual Corpus of Australian English: Large Corpus Collection with an Economical Portable and Replicable Black Box. Denis Burnham, Dominique Estival, Steven Fazio, Jette Viethen, Felicity Cox, Robert Dale, Steve Cassidy, Julien Epps, Roberto Togneri, Michael Wagner, Yuko Kinoshita, Roland Göcke, Joanne Arciuli, Mark Onslow, Trent W. Lewis, Andrew Butcher, John Hajek |
| 2011 | Can Audio-Visual Speech Recognition Outperform Acoustically Enhanced Speech Recognition in Automotive Environment? Rajitha Navarathna, Tristan Kleinschmidt, David Dean, Sridha Sridharan, Patrick Lucey |
| 2011 | Can Objective Measures Predict the Intelligibility of Modified HMM-Based Synthetic Speech in Noise? Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King |
| 2011 | Candidate Generation for ASR Output Error Correction Using a Context-Dependent Syllable Cluster-Based Confusion Matrix. Chao-Hong Liu, Chung-Hsien Wu, David Sarwono, Jhing-Fa Wang |
| 2011 | Characterizing Deletion Transformations Across Dialects Using a Sophisticated Tying Mechanism. Nancy F. Chen, Wade Shen, Joseph P. Campbell |
| 2011 | Cheap Bootstrap of Multi-Lingual Hidden Markov Models. Daniele Falavigna, Roberto Gretter |
| 2011 | Children's Recognition of their own Voice: Influence of Phonological Impairment. Sofia Strömbergsson |
| 2011 | Chinese and Italian Speech Rhythm: Normalization and the CCI Algorithm. Chiara Bertini, Pier Marco Bertinetto, Na Zhi |
| 2011 | Chorus Digitalis: Experiments in Chironomic Choir Singing. Sylvain Le Beux, Lionel Feugère, Christophe d'Alessandro |
| 2011 | Classification of Fricatives Using Feature Extrapolation of Acoustic-Phonetic Features in Telephone Speech. Jung-won Lee, Jeung-Yoon Choi, Hong-Goo Kang |
| 2011 | Clustering Expressive Speech Styles in Audiobooks Using Glottal Source Parameters. Éva Székely, João P. Cabral, Peter Cahill, Julie Carson-Berndsen |
| 2011 | Clustering with Modified Cosine Distance Learned from Constraints. Leonid Rachevsky, Dimitri Kanevsky, Ruhi Sarikaya, Bhuvana Ramabhadran |
| 2011 | Coarticulation Across Prosodic Domains in Italian: An Ultrasound Investigation. Barbara Gili Fivela, Antonio Stella, Sonia D'Apolito, Francesco Sigona |
| 2011 | Collecting Life Logs for Experience-Based Corpora. Fabiano Francesconi, Arindam Ghosh, Giuseppe Riccardi, Marco Ronchetti, Alex Vagin |
| 2011 | Combined Optical Distance Sensing and Electropalatography to Measure Articulation. Peter Birkholz, Christiane Neuschaefer-Rube |
| 2011 | Combining Active and Semi-Supervised Learning for Homograph Disambiguation in Mandarin Text-to-Speech Synthesis. Binbin Shen, Zhiyong Wu, Yongxin Wang, Lianhong Cai |
| 2011 | Combining Evidence from Spectral and Source-Like Features for Person Recognition from Humming. Hemant A. Patil, Maulik C. Madhavi, Keshab K. Parhi |
| 2011 | Combining Feature Space Discriminative Training with Long-Term Spectro-Temporal Features for Noise-Robust Speech Recognition. Takashi Fukuda, Osamu Ichikawa, Masafumi Nishimura |
| 2011 | Combining Frame and Segment Level Processing via Temporal Pooling for Phonetic Classification. Sumit Chopra, Patrick Haffner, Dimitrios Dimitriadis |
| 2011 | Combining Information Sources for Confidence Estimation with CRF Models. Matthew Stephen Seigel, Philip C. Woodland |
| 2011 | Combining Lattice-Based Language Dependent and Independent Approaches for Out-of-Language Detection in LVCSR. Yuxiang Shan, Yan Deng, Jia Liu |
| 2011 | Combining Multiple Phoneme-Based Classifiers with Audio Feature-Based Classifier for the Detection of Alcohol Intoxication. Claude Montacié, Marie-José Caraty |
| 2011 | Combining Phonological and Acoustic ASR-Free Features for Pathological Speech Intelligibility Assessment. Catherine Middag, Tobias Bocklet, Jean-Pierre Martens, Elmar Nöth |
| 2011 | Commas Recovery with Syntactic Features in French and in Czech. Christophe Cerisara, Pavel Král, Claire Gardent |
| 2011 | Comparing Different Flavors of Spectro-Temporal Features for ASR. Bernd T. Meyer, Suman V. Ravuri, Marc René Schädler, Nelson Morgan |
| 2011 | Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization. Viet-Anh Tran, Viet Bac Le, Claude Barras, Lori Lamel |
| 2011 | Comparing Syllable Frequencies in Corpora of Written and Spoken Language. Barbara Samlowski, Bernd Möbius, Petra Wagner |
| 2011 | Comparing System-Driven and Free Dialogue in In-Vehicle Interaction. Fredrik Kronlid, Jessica Villing, Alexander Berman, Staffan Larsson |
| 2011 | Comparing Word and Syllable Prominence Rated by Naïve Listeners. Denis Arnold, Bernd Möbius, Petra Wagner |
| 2011 | Comparing the Impact of Raised Vocal Effort on Various Spectral Parameters. Corinna Harwardt |
| 2011 | Comparison of Nasalance Measurements from Accelerometers and Microphones and Preliminary Development of Novel Features. Nicolas Audibert, Angélique Amelot |
| 2011 | Comparison of Smoothing Techniques for Robust Context Dependent Acoustic Modelling in Hybrid NN/HMM Systems. Guangsen Wang, Khe Chai Sim |
| 2011 | Comparison of Speaker Recognition Approaches for Real Applications. Sandro Cumani, Pier Domenico Batzu, Daniele Colibro, Claudio Vair, Pietro Laface, Vasileios Vasilakakis |
| 2011 | Comparison of Voice Activity Detectors for Interview Speech in NIST Speaker Recognition Evaluation. Hon-Bill Yu, Man-Wai Mak |
| 2011 | Compound Word Recombination for German LVCSR. Markus Nußbaum-Thom, Amr El-Desoky Mousa, Ralf Schlüter, Hermann Ney |
| 2011 | Computer and Human Recognition of Regional Accents of British English. Abualsoud Hanani, Martin J. Russell, Michael J. Carey |
| 2011 | Computer-Assisted Disfluency Counts for Stuttered Speech. Peter A. Heeman, Andy McMillin, J. Scott Yaruss |
| 2011 | Conditioned Hidden Markov Model Fusion for Multimodal Classification. Michael Glodek, Stefan Scherer, Friedhelm Schwenker |
| 2011 | Confidence Measures for Turkish Call Center Conversations. Ali Haznedaroglu, Levent M. Arslan |
| 2011 | Connected Digit Recognition by Means of Reservoir Computing. Azarakhsh Jalalvand, Fabian Triefenbach, David Verstraeten, Jean-Pierre Martens |
| 2011 | Constrained Cepstral Speaker Recognition Using Matched UBM and JFA Training. Michelle Hewlett Sanchez, Luciana Ferrer, Elizabeth Shriberg, Andreas Stolcke |
| 2011 | Context and Priming Effects in the Recognition of Emotion of Old and Young Listeners. Martijn Goudbeek, Marie Nilsenová |
| 2011 | Context and Speaker Dependency in the Relation of Vowel Formants and Subglottal Resonances - Evidence from Hungarian. Tekla Etelka Gráczi, Steven M. Lulich, Tamás Gábor Csapó, András Beke |
| 2011 | Context-Dependent Duration Modeling with Backoff Strategy and Look-Up Tables for Pronunciation Assessment and Mispronunciation Detection. Hongyan Li, Shen Huang, Shijin Wang, Bo Xu |
| 2011 | Continuous Control of the Degree of Articulation in HMM-Based Speech Synthesis. Benjamin Picart, Thomas Drugman, Thierry Dutoit |
| 2011 | Continuous Digits Recognition Leveraging Invariant Structure. Masayuki Suzuki, Gakuto Kurata, Masafumi Nishimura, Nobuaki Minematsu |
| 2011 | Continuous Episodic Memory Based Speech Recognition Using Articulatory Dynamics. Sébastien Demange, Slim Ouni |
| 2011 | Contributions of F1 and F2 (F2') to the Perception of Plosive Consonants. René Carré, Pierre L. Divenyi, Willy Serniclaes, Emmanuel Ferragne, Egidio Marsico, Viet Son Nguyen |
| 2011 | Convergence of Line Search A-Function Methods. Dimitri Kanevsky, David Nahamoo, Tara N. Sainath, Bhuvana Ramabhadran |
| 2011 | Conversational Speech Transcription Using Context-Dependent Deep Neural Networks. Frank Seide, Gang Li, Dong Yu |
| 2011 | Conversational-Side-Specific Inter-Session Variability Compensation. Mohamed Kamal Omar, Jason W. Pelecanos |
| 2011 | Conversing in the Presence of a Competing Conversation: Effects on Speech Production. Vincent Aubanel, Martin Cooke, Julián Villegas, María Luisa García Lecumberri |
| 2011 | Correlating Text with Prosody. Mohamed Abou-Zleikha, Julie Carson-Berndsen |
| 2011 | Correlation Analysis of Acoustic Features with Perceptual Voice Quality Similarity for Similar Speaker Selection. Yusuke Ijima, Mitsuaki Isogai, Hideyuki Mizuno |
| 2011 | Cross Likelihood Ratio Based Speaker Clustering Using Eigenvoice Models. David Wang, Robbie Vogt, Sridha Sridharan, David Dean |
| 2011 | Cross-Language Phone Recognition when the Target Language Phoneme Inventory is not Known. Timothy Kempton, Roger K. Moore, Thomas Hain |
| 2011 | Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech. Mirjam Wester, Hui Liang |
| 2011 | Cross-Lingual Study of ASR Errors: On the Role of the Context in Human Perception of Near-Homophones. Ioana Vasilescu, Dahbia Yahia, Natalie D. Snoeren, Martine Adda-Decker, Lori Lamel |
| 2011 | Cross-Rate Variation in the Intelligibility of Dual-Rate Gated Speech in Older Listeners. Valeriy Shafiro, Stanley Sheft, Robert Risley |
| 2011 | Crossmodal Prosodic and Gestural Contribution to the Perception of Contrastive Focus. Pilar Prieto, Cecilia Pugliesi, Joan Borràs-Comes, Ernesto Arroyo, Josep Blat |
| 2011 | Crowdsourcing Preference Tests, and How to Detect Cheating. Sabine Buchholz, Javier Latorre |
| 2011 | Crowdsourcing for Word Recognition in Noise. Martin Cooke, Jon Barker, María Luisa García Lecumberri, Krzysztof Wasilewski |
| 2011 | Data Sampling and Dimensionality Reduction Approaches for Reranking ASR Outputs Using Discriminative Language Models. Erinç Dikici, Murat Semerci, Murat Saraclar, Ethem Alpaydin |
| 2011 | Data Selection with Kurtosis and Nasality Features for Speaker Recognition. Howard Lei, Nikki Mirghafori |
| 2011 | Data-Driven Gaussian Component Selection for Fast GMM-Based Speaker Verification. Ce Zhang, Rong Zheng, Bo Xu |
| 2011 | Data-Driven UBM Generation via Tied Gaussians for GMM-Supervector Based Accent Identification. Rong Zheng, Ce Zhang, Bo Xu |
| 2011 | Decision Tree-Based Clustering with Outlier Detection for HMM-Based Speech Synthesis. Kyung Hwan Oh, June Sig Sung, Doo Hwa Hong, Nam Soo Kim |
| 2011 | Deep Belief Networks for Automatic Music Genre Classification. Xiaohong Yang, Qingcai Chen, Shusen Zhou, Xiaolong Wang |
| 2011 | Deep Convex Net: A Scalable Architecture for Speech Pattern Classification. Dong Yu, Li Deng |
| 2011 | Deep Learning of Speech Features for Improved Phonetic Recognition. Jaehyung Lee, Soo-Young Lee |
| 2011 | Denoising Using Optimized Wavelet Filtering for Automatic Speech Recognition. Randy Gomez, Tatsuya Kawahara |
| 2011 | Deploying Google Search by Voice in Cantonese. Yun-Hsuan Sung, Martin Jansche, Pedro J. Moreno |
| 2011 | Detecting Sleepiness by Fusing Classifiers Trained with Novel Acoustic Features. Tauhidur Rahman, Soroosh Mariooryad, Shalini Keshavamurthy, Gang Liu, John H. L. Hansen, Carlos Busso |
| 2011 | Detecting the Status of a Predictive Incremental Speech Understanding Model for Real-Time Decision-Making in a Spoken Dialogue System. David DeVault, Kenji Sagae, David R. Traum |
| 2011 | Detection of Shouted Speech in the Presence of Ambient Noise. Jouni Pohjalainen, Tuomo Raitio, Paavo Alku |
| 2011 | Detection of Task-Incomplete Dialogs Based on Utterance-and-Behavior Tag N-Gram for Spoken Dialog Systems. Sunao Hara, Norihide Kitaoka, Kazuya Takeda |
| 2011 | Determining what Questions to Ask, with the Help of Spectral Graph Theory. Abe Kazemzadeh, Sungbok Lee, Panayiotis G. Georgiou, Shrikanth S. Narayanan |
| 2011 | Developing a Broadband Automatic Speech Recognition System for Afrikaans. Febe de Wet, Alta de Waal, Gerhard B. Van Huyssteen |
| 2011 | Dialect and Accent Recognition Using Phonetic-Segmentation Supervectors. Fadi Biadsy, Julia Hirschberg, Daniel P. W. Ellis |
| 2011 | Dialog Methods for Improved Alphanumeric String Capture. Doug Peters, Peter Stubley |
| 2011 | Diarization-Based Speaker Retrieval for Broadcast Television Archives. Marijn Huijbregts, David A. van Leeuwen |
| 2011 | Dimensionality Reduction for Using High-Order n-Grams in SVM-Based Phonotactic Language Recognition. Mikel Peñagarikano, Amparo Varona, Luis Javier Rodríguez, Germán Bordel |
| 2011 | Direct Error Rate Minimization of Hidden Markov Models. Joseph Keshet, Chih-Chieh Cheng, Mark Stoehr, David A. McAllester |
| 2011 | Direct Estimation of Articulatory Kinematics from Real-Time Magnetic Resonance Image Sequences. Michael I. Proctor, Adam C. Lammert, Athanasios Katsamanis, Louis M. Goldstein, Christina Hagedorn, Shrikanth S. Narayanan |
| 2011 | Discrete Choice Models for Non-Intrusive Quality Assessment. Petko Nikolov Petkov, W. Bastiaan Kleijn, Bert de Vries |
| 2011 | Discrete/Continuous Modelling of Speaking Style in HMM-Based Speech Synthesis: Design and Evaluation. Nicolas Obin, Pierre Lanchantin, Anne Lacheret, Xavier Rodet |
| 2011 | Discriminant Sub-Space Projection of Spectro-Temporal Speech Features Based on Maximizing Mutual Information. Martin Heckmann, Claudius Gläser |
| 2011 | Discriminative Features for Language Identification. Christopher Alberti, Michiel Bacchiani |
| 2011 | Discriminatively Trained i-vector Extractor for Speaker Verification. Ondrej Glembek, Lukás Burget, Niko Brümmer, Oldrich Plchot, Pavel Matejka |
| 2011 | Distant Speech Recognition in a Smart Home: Comparison of Several Multisource ASRs in Realistic Conditions. Benjamin Lecouteux, Michel Vacher, François Portet |
| 2011 | Does it Groove or does it Stumble - Automatic Classification of Alcoholic Intoxication using Prosodic Features. Florian Hönig, Anton Batliner, Elmar Nöth |
| 2011 | Drink and Speak: On the Automatic Classification of Alcohol Intoxication by Acoustic, Prosodic and Text-Based Features. Tobias Bocklet, Korbinian Riedhammer, Elmar Nöth |
| 2011 | Dual-Mode AVQ Coding Based on Spectral Masking and Sparseness Detection for ITU-T G.711.1/G.722 Super-Wideband Extensions. Masahiro Fukui, Shigeaki Sasaki, Yusuke Hiwasaki, Sachiko Kurihara, Yoichi Haneda |
| 2011 | Dysperiodicity Analysis of Perceptually Assessed Synthetic Speech Stimuli. Ali Alpan, Francis Grenez, Jean Schoentgen |
| 2011 | ELAN - Aspects of Interoperability and Functionality. Han Sloetjes, Peter Wittenburg, Aarthy Somasundaram |
| 2011 | EM-Based Gain Adaptation for Probabilistic Multipitch Tracking. Michael Wohlmayr, Franz Pernkopf |
| 2011 | EasyAlign: An Automatic Phonetic Alignment Tool Under Praat. Jean-Philippe Goldman |
| 2011 | Effect of Language Experience on the Categorical Perception of Cantonese Vowel Duration. Caicai Zhang, Gang Peng, William S.-Y. Wang |
| 2011 | Effective Arabic Dialect Classification Using Diverse Phonotactic Models. Murat Akbacak, Dimitra Vergyri, Andreas Stolcke, Nicolas Scheffer, Arindam Mandal |
| 2011 | Effective Triphone Mapping for Acoustic Modeling in Speech Recognition. Sakhia Darjaa, Milos Cernak, Marián Trnka, Milan Rusko, Róbert Sabo |
| 2011 | Effects of Focus on f0 and Duration in Irish (Gaelic) Declaratives. Amelie Dorn, Ailbhe Ní Chasaide |
| 2011 | Effects of Query Expansion for Spoken Document Passage Retrieval. Tomoyosi Akiba, Koichiro Honda |
| 2011 | Effects of Shortening Speech Prompts of In-Car Voice User Interfaces on Users Mental Models. Julia Niemann, Kati Schulz, Ina Wechsung |
| 2011 | Efficient Harvesting of Internet Audio for Resource-Scarce ASR. Marelie H. Davel, Charl Johannes van Heerden, Neil Kleynhans, Etienne Barnard |
| 2011 | Efficient Probabilistic Tracking of User Goal and Dialog History for Spoken Dialog Systems. Antoine Raux, Yi Ma |
| 2011 | Efficient Speaker and Noise Normalization for Robust Speech Recognition. Vikas Joshi, Raghavendra Bilgi, Srinivasan Umesh, M. Carmen Benítez, Luz García |
| 2011 | Eigen-Voice Based Anchor Modeling System for Speaker Identification Using MLLR Super-Vector. Achintya Kumar Sarkar, Srinivasan Umesh |
| 2011 | Electroglottograph and Acoustic Cues for Phonation Contrasts in Taiwan Min Falling Tones. Ho-Hsien Pan, Mao-Hsu Chen, Shao-Ren Lyu |
| 2011 | Emotion Classification Using Inter- and Intra-Subband Energy Variation. Senaka Amarakeerthi, Tin Lay Nwe, Liyanage C. De Silva, Michael Cohen |
| 2011 | Emotion Classification of Infants' Cries Using Duration Ratios of Acoustic Segments. Kazuki Kitahara, Shinzi Michiwiki, Miku Sato, Shoichi Matsunaga, Masaru Yamashita, Kazuyuki Shinohara |
| 2011 | Emotion Detection Based on Concept Inference and Spoken Sentence Analysis for Customer Service. Ren-Ying Fang, Bo-Wei Chen, Jhing-Fa Wang, Chung-Hsien Wu |
| 2011 | Empirical Evaluation and Combination of Advanced Language Modeling Techniques. Tomás Mikolov, Anoop Deoras, Stefan Kombrink, Lukás Burget, Jan Cernocký |
| 2011 | Enhancements to the Training Process of Classifier-Based Speech Translator via Topic Modeling. Emil Ettelaie, Panayiotis G. Georgiou, Shrikanth S. Narayanan |
| 2011 | Enriching Text-to-Speech Synthesis Using Automatic Dialog Act Tags. Vivek Kumar Rangarajan Sridhar, Ann K. Syrdal, Alistair Conkie, Srinivas Bangalore |
| 2011 | Entropy-Rate Driven Inference of Stochastic Grammars. Unto K. Laine |
| 2011 | Epoch Extraction in High Pass Filtered Speech Using Hilbert Envelope. D. Govind, S. R. Mahadeva Prasanna, Debadatta Pati |
| 2011 | Error Selection for ASR-Based English Pronunciation Training in 'My Pronunciation Coach'. Catia Cucchiarini, Henk van den Heuvel, Eric Sanders, Helmer Strik |
| 2011 | Estimating Speaking Rate by Means of Rhythmicity Parameters. Christian Heinrich, Florian Schiel |
| 2011 | Estimation of Perceptual Spaces for Speaker Identities Based on the Cross-Lingual Discrimination Task. Minoru Tsuzaki, Keiichi Tokuda, Hisashi Kawai, Jinfu Ni |
| 2011 | Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-Based Speech Synthesis. Ling-Hui Chen, Yoshihiko Nankaku, Heiga Zen, Keiichi Tokuda, Zhen-Hua Ling, Li-Rong Dai |
| 2011 | Evaluating Artificial Bandwidth Extension by Conversational Tests in Car Using Mobile Devices with Integrated Hands-Free Functionality. Laura Laaksonen, Ville Myllylä, Riitta Niemistö |
| 2011 | Evaluating the Meaning of Synthesized Listener Vocalizations. Sathish Pammi, Marc Schröder |
| 2011 | Evaluation of Abnormal Sound Detection using Multi-Stage GMM in Various Environments. Akinori Ito, Akihito Aiba, Masashi Ito, Shozo Makino |
| 2011 | Evaluation of Bone-Conducted Ultrasonic Hearing-Aid Regarding Transmission of Speaker Discrimination Information. Takayuki Kagomiya, Seiji Nakagawa |
| 2011 | Evaluation of Fast Spoken Term Detection Using a Suffix Array. Kouichi Katsurada, Shinta Sawada, Shigeki Teshima, Yurie Iribe, Tsuneo Nitta |
| 2011 | Evaluation of Glottal Epoch Detection Algorithms on Different Voice Types. João P. Cabral, John Kane, Christer Gobl, Julie Carson-Berndsen |
| 2011 | Evaluation of Listening-Oriented Dialogue Control Rules Based on the Analysis of HMMs. Toyomi Meguro, Yasuhiro Minami, Ryuichiro Higashinaka, Kohji Dohsaka |
| 2011 | Evaluation of Tree-Trellis Based Decoding in Over-Million LVCSR. Naoaki Ito, Yoshihiko Nankaku, Akinobu Lee |
| 2011 | Evaluation of an Integrated Authoring Tool for Building Advanced Question-Answering Characters. Sudeep Gandhe, Michael Rushforth, Priti Aggarwal, David R. Traum |
| 2011 | Evaluation of i-vector Speaker Recognition Systems for Forensic Application. Miranti Indar Mandasari, Mitchell McLaren, David A. van Leeuwen |
| 2011 | Event Selection from Phone Posteriorgrams Using Matched Filters. Keith Kintzley, Aren Jansen, Hynek Hermansky |
| 2011 | Exploiting Intra-Conversation Variability for Speaker Diarization. Stephen Shum, Najim Dehak, Ekapol Chuangsuwanich, Douglas A. Reynolds, James R. Glass |
| 2011 | Exploiting Phone-Class Specific Landmarks for Refinement of Segment Boundaries in TTS Databases. Vijayaditya Peddinti, Kishore Prahallad |
| 2011 | Exploring Bessel Features for Detection of Glottal Closure Instants. Chetana Prakash, N. Dhananjaya, Suryakanth V. Gangashetty |
| 2011 | Extending Audio Notetaker to Browse WebASR Transcriptions. Roger C. F. Tucker, Dan Fry, Vincent Wan, Stuart N. Wrigley, Thomas Hain |
| 2011 | Extending the Task of Diarization to Speaker Attribution. Houman Ghaemmaghami, David Dean, Robbie Vogt, Sridha Sridharan |
| 2011 | Extraction of Narrative Recall Patterns for Neuropsychological Assessment. Emily Tucker Prud'hommeaux, Brian Roark |
| 2011 | Factor Analysis Back Ends for MLLR Transforms in Speaker Recognition. Nicolas Scheffer, Yun Lei, Luciana Ferrer |
| 2011 | Factored MLLR Adaptation for Singing Voice Generation. June Sig Sung, Doo Hwa Hong, Shin Jae Kang, Nam Soo Kim |
| 2011 | Factored Translation Models for Improving a Speech into Sign Language Translation System. Verónica López-Ludeña, Rubén San Segundo, Ricardo de Córdoba, Javier Ferreiros, Juan Manuel Montero, José Manuel Pardo |
| 2011 | Fast and Simple Iterative Algorithm of Lp-Norm Minimization for Under-Determined Speech Separation. Yasuharu Hirasawa, Naoki Yasuraoka, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno |
| 2011 | Feature Combination Approaches for Discriminative Language Models. Ebru Arisoy, Bhuvana Ramabhadran, Hong-Kwang Jeff Kuo |
| 2011 | Feature Compensation for Speech Recognition in Severely Adverse Environments Due to Background Noise and Channel Distortion. Wooil Kim, John H. L. Hansen |
| 2011 | Feature Extraction Assessment for an Acoustic-Event Classification Task Using the Entropy Triangle. David Mejía-Navarrete, Ascensión Gallardo-Antolín, Carmen Peláez-Moreno, Francisco J. Valverde-Albacete |
| 2011 | Feature Frame Stacking in RNN-Based Tandem ASR Systems - Learned vs. Predefined Context. Martin Wöllmer, Björn W. Schuller, Gerhard Rigoll |
| 2011 | Feature Normalization Using Structured Full Transforms for Robust Speech Recognition. Xiong Xiao, Jinyu Li, Chng Eng Siong, Haizhou Li |
| 2011 | Feature-Space Transform Tying in Unified Acoustic-Articulatory Modelling for Articulatory Control of HMM-Based Speech Synthesis. Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi |
| 2011 | Final /t/ Reduction in Dutch Past-Participles: The Role of Word Predictability and Morphological Decomposability. Iris Hanique, Mirjam Ernestus |
| 2011 | Fluency Changes with General Progress in L2 Proficiency. Jared Bernstein, Jian Cheng, Masanori Suzuki |
| 2011 | Formant Maps in Hungarian Vowels - Online Data Inventory for Research, and Education. Kálmán Abari, Zsuzsanna Zsófia Rácz, Gábor Olaszy |
| 2011 | Formant-Controlled HMM-Based Speech Synthesis. Ming Lei, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Li-Rong Dai |
| 2011 | Frame-Level Vocal Effort Likelihood Space Modeling for Improved Whisper-Island Detection. Chi Zhang, John H. L. Hansen |
| 2011 | Frequency-Domain Representation of Source-Filter Coupling and its Effect in the Production of Voice. Tokihiko Kaburagi |
| 2011 | Frequency-Warped and Stabilized Time-Varying Cepstral Coefficients. Trond Skogstad, Torbjørn Svendsen |
| 2011 | From Interview to News Text: A Study of Taiwan TV Political Interviews in Newspaper Reports. Chin-Chih Chiang |
| 2011 | From Single-Call to Multi-Call Quality: A Study on Long-Term Quality Integration in Audio-Visual Speech Communication. Sebastian Möller, Chihuy Bang, Teele Tamme, Markus Vaalgamaa, Benjamin Weiss |
| 2011 | From Teleoperated Androids to Cellphones as Surrogates. Hiroshi Ishiguro |
| 2011 | Front-End Compensation Methods for LVCSR Under Lombard Effect. Hynek Boril, Frantisek Grézl, John H. L. Hansen |
| 2011 | Fundamental Frequency Estimation Using Modified Higher Order Moments and Multiple Windows. Alipah Pawi, Saeed Vaseghi, Ben Milner, Seyed Ghorshi |
| 2011 | Fusing Multiple Confidence Measures for Chinese Spoken Term Detection. Zejun Ma, Xiaorui Wang, Bo Xu |
| 2011 | GMM-Based Missing-Feature Reconstruction on Multi-Frame Windows. Ulpu Remes, Yoshihiko Nankaku, Keiichi Tokuda |
| 2011 | Gaussian Process Experts for Voice Conversion. Nicholas Pilkington, Heiga Zen, Mark J. F. Gales |
| 2011 | Generalized Baum-Welch Algorithm and its Implication to a New Extended Baum-Welch Algorithm. Roger Hsiao, Tanja Schultz |
| 2011 | Generalized Method for Solving the Permutation Problem in Frequency-Domain Blind Source Separation of Convolved Speech Signals. Auxiliadora Sarmiento, Iván Durán-Díaz, Sergio Cruces, Pablo Aguilera |
| 2011 | Generalized Variable Parameter HMMs for Noise Robust Speech Recognition. Ning Cheng, Xunying Liu, Lan Wang |
| 2011 | Generalized-Log Spectral Mean Normalization for Speech Recognition. Hilman Ferdinandus Pardede, Koichi Shinoda |
| 2011 | Generating Animated Pronunciation from Speech Through Articulatory Feature Extraction. Yurie Iribe, Silasak Manosavanh, Kouichi Katsurada, Ryoko Hayashi, Chunyue Zhu, Tsuneo Nitta |
| 2011 | Genre Categorization and Modeling for Broadcast Speech Transcription. Qingqing Zhang, Lori Lamel, Jean-Luc Gauvain |
| 2011 | Gesture Design of Hand-to-Speech Converter Derived from Speech-to-Hand Converter Based on Probabilistic Integration Model. Aki Kunikoshi, Yu Qiao, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose |
| 2011 | Globality-Locality Consistent Discriminant Analysis for Phone Classification. Heyun Huang, Yang Liu, Jort F. Gemmeke, Louis ten Bosch, Bert Cranen, Lou Boves |
| 2011 | GorUp: An Ontology-Driven Audio Information Retrieval System that Suits the Requirements of Under-Resourced Languages. Nora Barroso, Karmele López de Ipiña, Aitzol Ezeiza, Carmen Hernández, Nerea Ezeiza, Odei Barroso, Unai Susperregi, Simeon Barroso |
| 2011 | Grapheme-Based Automatic Speech Recognition Using KL-HMM. Mathew Magimai-Doss, Ramya Rasipuram, Guillermo Aradilla, Hervé Bourlard |
| 2011 | Grapheme-to-Phoneme Conversion Using Conditional Random Fields. Irina Illina, Dominique Fohr, Denis Jouvet |
| 2011 | Graphone Model Interpolation and Arabic Pronunciation Generation. T. Li, Philip C. Woodland, Frank Diehl, Mark J. F. Gales |
| 2011 | Growing a Spoken Language Interface on Amazon Mechanical Turk. Ian McGraw, James R. Glass, Stephanie Seneff |
| 2011 | HMM-Based Emphatic Speech Synthesis Using Unsupervised Context Labeling. Yu Maeno, Takashi Nose, Takao Kobayashi, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka |
| 2011 | Harmonic Structure Transform for Speaker Recognition. Kornel Laskowski, Qin Jin |
| 2011 | Hidden Boosted MMI and Hierarchical State Posterior Feature for Automatic Speech Recognition Based on Hidden Conditional Neural Fields. Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa |
| 2011 | Hierarchical Audio Segmentation with HMM and Factor Analysis in Broadcast News Domain. Diego Castán, Carlos Vaquero, Alfonso Ortega, David Martínez González, Jesús Antonio Villalba López, Eduardo Lleida |
| 2011 | Hierarchical Stress Modeling in Mandarin Text-to-Speech. Ya Li, Jianhua Tao, Xiaoying Xu |
| 2011 | Hierarchical Tandem Features for ASR in Mandarin. Joel Pinto, Mathew Magimai-Doss, Hervé Bourlard |
| 2011 | How Realistic is Artificially Added Noise? Thomas Winkler |
| 2011 | Hybrid Language Models Using Mixed Types of Sub-Lexical Units for Open Vocabulary German LVCSR. M. Ali Basha Shaik, Amr El-Desoky Mousa, Ralf Schlüter, Hermann Ney |
| 2011 | Hybrid Speech Recognition for Voice Search: A Comparative Study. Evandro B. Gouvêa |
| 2011 | I3A Language Recognition System for Albayzin 2010 LRE. David Martínez González, Jesús Antonio Villalba López, Antonio Miguel, Alfonso Ortega, Eduardo Lleida |
| 2011 | Identifying Agreement/Disagreement in Conversational Speech: A Cross-Lingual Study. Wen Wang, Kristin Precoda, Colleen Richey, Geoffrey Raymond |
| 2011 | Identifying Regions of Non-Modal Phonation Using Features of the Wavelet Transform. John Kane, Christer Gobl |
| 2011 | Image Processing Filters for Line Detection-Based Spoken Term Detection. Kazuyuki Noritake, Hiroaki Nanjo, Takehiko Yoshimi |
| 2011 | Image Representation of the Subband Power Distribution for Robust Sound Classification. Jonathan William Dennis, Tran Huy Dat, Haizhou Li |
| 2011 | Impact of Different Feedback Mechanisms in EMG-Based Speech Recognition. Christian Herff, Matthias Janke, Michael Wand, Tanja Schultz |
| 2011 | Impact of Speaker Variability on Speech Perception in Non-Native Listeners. Wim A. van Dommelen, Valérie Hazan |
| 2011 | Implicit Segmentation in Two-Wire Speaker Recognition. Yosef A. Solewicz, Hagai Aronowitz |
| 2011 | Improved Acoustic Characterization of Breathy and Whispery Voices. Carlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita |
| 2011 | Improved Acoustic Feature Combination for LVCSR by Neural Networks. Christian Plahl, Ralf Schlüter, Hermann Ney |
| 2011 | Improved Bottleneck Features Using Pretrained Deep Neural Networks. Dong Yu, Michael L. Seltzer |
| 2011 | Improved Classification of Speaking Styles for Mental Health Monitoring Using Phoneme Dynamics. Keng-hao Chang, Howard Lei, John F. Canny |
| 2011 | Improved HNM-Based Vocoder for Statistical Synthesizers. Daniel Erro, Iñaki Sainz, Eva Navas, Inma Hernáez |
| 2011 | Improved Overlapped Speech Handling for Speaker Diarization. Kofi Boakye, Oriol Vinyals, Gerald Friedland |
| 2011 | Improved Quality for Conversational VoIP Using Path Diversity. Qipeng Gong, Peter Kabal |
| 2011 | Improved Spoken Query Transcription Using Co-Occurrence Information. Jonathan Mamou, Abhinav Sethy, Bhuvana Ramabhadran, Ron Hoory, Paul Vozila |
| 2011 | Improved Tonal Language Speech Recognition by Integrating Spectro-Temporal Evidence and Pitch Information with Properly Chosen Tonal Acoustic Units. Shang-wen Li, Yow-Bang Wang, Liang-Che Sun, Lin-Shan Lee |
| 2011 | Improved a posteriori Speech Presence Probability Estimation Based on Cepstro-Temporal Smoothing and Time-Frequency Correlation. Chao Li, Wenju Liu |
| 2011 | Improvement of Segmental Mispronunciation Detection with Prior Knowledge Extracted from Large L2 Speech Corpus. Dean Luo, Xuesong Yang, Lan Wang |
| 2011 | Improvements in Speaker Characterization Using Spectral Subband Energy Based on Harmonic plus Noise Model. Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong Dai, Wu Guo |
| 2011 | Improvements of a Dual-Input DBN for Noise Robust ASR. Yang Sun, Jort F. Gemmeke, Bert Cranen, Louis ten Bosch, Lou Boves |
| 2011 | Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation. Xunying Liu, Mark J. F. Gales, Philip C. Woodland |
| 2011 | Improving Multiband Position-Pitch Algorithm for Localization and Tracking of Multiple Concurrent Speakers by Using a Frequency Selective Criterion. Tania Habib, Harald Romsdorfer |
| 2011 | Improving Non-Native ASR Through Stochastic Multilingual Phoneme Space Transformations. David Imseng, Hervé Bourlard, John Dines, Philip N. Garner, Mathew Magimai-Doss |
| 2011 | In Search of Cues Discriminating West-African Accents in French. Philippe Boula de Mareüil, Jean-Luc Rouas, Manuela Yapomo |
| 2011 | Incorporating Regional Information to Enhance MAP-Based Stochastic Feature Compensation for Robust Speech Recognition. Yu Tsao, Paul R. Dixon, Chiori Hori, Hisashi Kawai |
| 2011 | Incorporating Speech Recognition Engine into an Intelligent Assistive Reading System for Dyslexic Students. Theologos Athanaselis, Stelios Bakamidis, Ioannis Dologlou, Evmorfia N. Argyriou, Antonios Symvonis |
| 2011 | Incremental Learning and Forgetting in Stochastic Turn-Taking Models. Kornel Laskowski, Jens Edlund, Mattias Heldner |
| 2011 | Individual Error Minimization Learning Framework and its Applications to Speech Recognition and Utterance Verification. Sunghwan Shin, Ho-Young Jung, Biing-Hwang Juang |
| 2011 | Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings. Sree Harsha Yella, Fabio Valente |
| 2011 | Instantaneous Speaker Adaptation Through Selection and Combination of fMLLR Transformation Matrices. Diego Giuliani, Fabio Brugnara |
| 2011 | Integrated Online Speaker Clustering and Adaptation. Catherine Breslin, K. K. Chin, Mark J. F. Gales, Kate M. Knill |
| 2011 | Integrating Recent MLP Feature Extraction Techniques into TRAP Architecture. Frantisek Grézl, Martin Karafiát |
| 2011 | Interactional Style Detection for Versatile Dialogue Response Using Prosodic and Semantic Features. Wei-Bin Liang, Chung-Hsien Wu, Chih-Hung Wang, Jhing-Fa Wang |
| 2011 | Intermediate-State HMMs to Capture Continuously-Changing Signal Features. Gustav Eje Henter, W. Bastiaan Kleijn |
| 2011 | Intersession Compensation and Scoring Methods in the i-vectors Space for Speaker Recognition. Pierre-Michel Bousquet, Driss Matrouf, Jean-François Bonastre |
| 2011 | Intonation Conversion from Neutral to Expressive Speech. Christophe Veaux, Xavier Rodet |
| 2011 | Intonation of Left Dislocated Topics in Modern Greek. David Le Gac, Hiyon Yoo |
| 2011 | Intoxicated Speech Detection by Fusion of Speaker Normalized Hierarchical Features and GMM Supervectors. Daniel Bone, Matthew Black, Ming Li, Angeliki Metallinou, Sungbok Lee, Shrikanth S. Narayanan |
| 2011 | Intoxication Detection Using Phonetic, Phonotactic and Prosodic Cues. Fadi Biadsy, William Yang Wang, Andrew Rosenberg, Julia Hirschberg |
| 2011 | Intra-, Inter-, and Cross-Cultural Classification of Vocal Affect. Daniel Neiberg, Petri Laukka, Hillary Anger Elfenbein |
| 2011 | Inverse Filtering Based Harmonic Plus Noise Excitation Model for HMM-Based Speech Synthesis. Zhengqi Wen, Jianhua Tao |
| 2011 | Investigating Robustness of Spectral Moments on Normal- and High-Effort Speech. Frederike Gottsmann, Corinna Harwardt |
| 2011 | Investigating the Effect of Number of Interlocutors on the Quality of Experience for Multi-Party Audio Conferencing. Janto Skowronek, Alexander Raake |
| 2011 | Investigating the Stability of Intergestural Timing Relations. Juraj Simko, Fred Cummins, Stefan Benus |
| 2011 | Investigation of Cross-Show Speaker Diarization. Qian Yang, Qin Jin, Tanja Schultz |
| 2011 | Investigation of Spontaneous Speech Characterization Applied to Speaker Role Recognition. Richard Dufour, Yannick Estève, Paul Deléglise |
| 2011 | Investigations on Speaking Mode Discrepancies in EMG-Based Speech Recognition. Michael Wand, Matthias Janke, Tanja Schultz |
| 2011 | Is the Perception of Voice Quality Language-Dependant? A Comparison of French and Italian Listeners and Dysphonic Speakers. Alain Ghio, Frédérique Weisz, Giovanna Baracca, Giovanna Cantarella, Danièle Robert, Virginie Woisard, Franco Fussi, Antoine Giovanni |
| 2011 | Italian in the No-Man's Land Between Stress-Timing and Syllable-Timing? Speakers are More Stress-Timed than Listeners. Bettina Braun, Sabine Geiselmann |
| 2011 | Iterative Improvement of Speaker Segmentation in a Noisy Environment Using High-Level Knowledge. Qiang Huang, Stephen J. Cox |
| 2011 | Java Visual Speech Components for Rapid Application Development of GUI Based Speech Processing Applications. Stefan Steidl, Korbinian Riedhammer, Tobias Bocklet, Florian Hönig, Elmar Nöth |
| 2011 | Jaw Movement in Vowels and Liquids Forming the Syllable Nucleus. Stefan Benus, Marianne Pouplier |
| 2011 | Joint Application of Speech and Speaker Recognition for Automation and Security in Smart Home. Kong-Aik Lee, Anthony Larcher, Helen Thai, Bin Ma, Haizhou Li |
| 2011 | Joint Bilinear Transformation Space Based Maximum a posteriori Linear Regression Adaptation Using Prior with Variance Function. Hwa Jeon Song, Yunkeun Lee, Hyung Soon Kim |
| 2011 | Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics. Thomas Drugman, Abeer Alwan |
| 2011 | Joint Target and Join Cost Weight Training for Unit Selection Synthesis. Lukas Latacz, Wesley Mattheyses, Werner Verhelst |
| 2011 | Kernel Alignment Maximization for Speaker Recognition Based on High-Level Features. Szymon Drgas, Adam Dabrowski |
| 2011 | Kernel Models for Affective Lexicon Creation. Nikos Malandrakis, Alexandros Potamianos, Elias Iosif, Shrikanth S. Narayanan |
| 2011 | Kernel PCA for Speech Enhancement. Christina Leitner, Franz Pernkopf, Gernot Kubin |
| 2011 | Kernel Partial Least Squares for Speaker Recognition. Balaji Vasan Srinivasan, Daniel Garcia-Romero, Dmitry N. Zotkin, Ramani Duraiswami |
| 2011 | Keyphrase Cloud Generation of Broadcast News. Luís Marujo, Márcio Viveiros, João Paulo Neto |
| 2011 | Kullback-Leibler Divergence-Based ASR Training Data Selection. Evandro Gouvêa, Marelie H. Davel |
| 2011 | L1/L2 Perception of Lexical Stress with F0 Peak-Delay: Effect of an Extra Syllable Added. Shinichi Tokuma, Yi Xu |
| 2011 | LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization. Sree Hari Krishnan Parthasarathi, Hervé Bourlard, Daniel Gatica-Perez |
| 2011 | Language Disorders: Viewpoints on a Complex Object. Gabriele Miceli |
| 2011 | Language Identification for Text Chats. Vesa Siivola, Bryan L. Pellom, Meagan Sills |
| 2011 | Language Model Expansion Using Webdata for Spoken Document Retrieval. Ryo Masumura, Seongjun Hahm, Akinori Ito |
| 2011 | Language Recognition in iVectors Space. David Martínez González, Oldrich Plchot, Lukás Burget, Ondrej Glembek, Pavel Matejka |
| 2011 | Language Recognition via i-vectors and Dimensionality Reduction. Najim Dehak, Pedro A. Torres-Carrasquillo, Douglas A. Reynolds, Réda Dehak |
| 2011 | Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus. Fabio Valente, Alessandro Vinciarelli |
| 2011 | Large Margin - Minimum Classification Error Using Sum of Shifted Sigmoids as the Loss Function. Madhavi Vedula Ratnagiri, Biing-Hwang Juang, Lawrence R. Rabiner |
| 2011 | Large Vocabulary SOUL Neural Network Language Models. Hai Son Le, Ilya Oparin, Abdelkhalek Messaoudi, Alexandre Allauzen, Jean-Luc Gauvain, François Yvon |
| 2011 | Large-Scale Experiments on Data-Driven Design of Commercial Spoken Dialog Systems. David Suendermann, Jackson Liscombe, Jonathan Bloom, Grace Li, Roberto Pieraccini |
| 2011 | Large-Scale Subjective Evaluations of Speech Rate Control Methods for HMM-Based Speech Synthesizers. Tsuneo Kato, Makoto Yamada, Nobuyuki Nishizawa, Keiichiro Oura, Keiichi Tokuda |
| 2011 | Laryngealization and Breathiness in Persian. Vahid Sadeghi |
| 2011 | Latent Topic Modeling for Audio Corpus Summarization. Timothy J. Hazen |
| 2011 | Lattice Based Discriminative Model Combination Using Automatically Induced Phonetic Contexts. Hao Huang, Bing Hu Li |
| 2011 | Lattice-Based Risk Minimization Training for Unsupervised Language Model Adaptation. Akio Kobayashi, Takahiro Oku, Shinichi Homma, Toru Imai, Seiichi Nakagawa |
| 2011 | Learning Influences from Word Use in Polylogue. Tomoharu Iwata, Shinji Watanabe |
| 2011 | Learning New Acoustic Events in an HMM-Based System Using MAP Adaptation. Jürgen T. Geiger, Mohamed Anouar Lakhal, Björn W. Schuller, Gerhard Rigoll |
| 2011 | Learning Place-Names from Spoken Utterances and Localization Results by Mobile Robot. Ryo Taguchi, Yuji Yamada, Koosuke Hattori, Taizo Umezaki, Masahiro Hoguro, Naoto Iwahashi, Kotaro Funakoshi, Mikio Nakano |
| 2011 | Learning Score Structure from Spoken Language for a Tennis Game. Qiang Huang, Stephen J. Cox |
| 2011 | Learning Weighted Entity Lists from Web Click Logs for Spoken Language Understanding. Dustin Hillard, Asli Celikyilmaz, Dilek Hakkani-Tür, Gökhan Tür |
| 2011 | Learning from Mistakes: Expanding Pronunciation Lexicons Using Word Recognition Errors. Sravana Reddy, Evandro B. Gouvêa |
| 2011 | Leja Ordering LSFs for Accurate Estimation of Predictor Coefficients. Christian Fischer Pedersen |
| 2011 | Let's All Speak Together! Exploring the Impact of Various Languages on the Comprehension of Speech in Multi-Linguistic Babble. Aurore Gautreau, Michel Hoen, Fanny Meunier |
| 2011 | Letter-to-Phoneme Conversion Based on Two-Stage Neural Network Focusing on Letter and Phoneme Contexts. Kheang Seng, Yurie Iribe, Tsuneo Nitta |
| 2011 | Leveraging Relevance Cues for Improved Spoken Document Retrieval. Pei-Ning Chen, Kuan-Yu Chen, Berlin Chen |
| 2011 | Linear Dynamic Models for Voice Activity Detection. Kannu Mehta, Chau Khoa Pham, Chng Eng Siong |
| 2011 | Log-Linear Optimization of Second-Order Polynomial Features with Subsequent Dimension Reduction for Speech Recognition. Muhammad Ali Tahir, Ralf Schlüter, Hermann Ney |
| 2011 | Long Term Average Speech Spectra in Yolngu Matha and Pitjantjatjara Speaking Females and Males. Hywel Stoakes, Andrew Butcher, Janet Fletcher, Marija Tabain |
| 2011 | Long-Distance Rhythmic Dependencies and their Application to Automatic Language Identification. Joseph Tepperman, Emily Nava |
| 2011 | Lossless Value Directed Compression of Complex User Goal States for Statistical Spoken Dialogue Systems. Paul A. Crook, Oliver Lemon |
| 2011 | Low and High, Short and Long by Crook or by Hook? Oliver Niebuhr, Astrid Wolf |
| 2011 | Low-Frequency Bandwidth Extension of Telephone Speech Using Sinusoidal Synthesis and Gaussian Mixture Model. Hannu Pulakka, Ulpu Remes, Santeri Yrttiaho, Kalle J. Palomäki, Mikko Kurimo, Paavo Alku |
| 2011 | Making an Automatic Speech Recognition Service Freely Available on the Web. Stuart N. Wrigley, Thomas Hain |
| 2011 | Mandarin Word-Character Hybrid-Input Neural Network Language Model. Moonyoung Kang, Tim Ng, Long Nguyen |
| 2011 | Mapping Sparse Representation to State Likelihoods in Noise-Robust Automatic Speech Recognition. Katariina Mahkonen, Antti Hurmalainen, Tuomas Virtanen, Jort F. Gemmeke |
| 2011 | Matrix-Variate Distribution of Training Models for Robust Speaker Adaptation. Yongwon Jeong, Young Kuk Kim |
| 2011 | Maximum Confidence Measure Based Interaural Phase Difference Estimation for Noise Masking in Dual-Microphone Robust Speech Recognition. Hsien-Cheng Liao, Yuan-Fu Liao, Chin-Hui Lee |
| 2011 | Maximum Entropy Based Data Selection for Speaker Recognition. Chien-Lin Huang, Bin Ma |
| 2011 | Maximum Likelihood i-vector Space Using PCA for Speaker Verification. Zhenchun Lei, Yingchun Yang |
| 2011 | Maximum a posteriori Estimation of Noise from Non-Acoustic Reference Signals in Very Low Signal-to-Noise Ratio Environments. Ben Milner |
| 2011 | Measurement of Objective Intelligibility of Japanese Accented English Using ERJ (English Read by Japanese) Database. Nobuaki Minematsu, Koji Okabe, Keisuke Ogaki, Keikichi Hirose |
| 2011 | Measuring Acoustic-Prosodic Entrainment with Respect to Multiple Levels and Dimensions. Rivka Levitan, Julia Hirschberg |
| 2011 | Measuring Final Lengthening for Speaker-Change Prediction. Anna Hjalmarsson, Kornel Laskowski |
| 2011 | Measuring Speakers' Similarity in Speech by Means of Prosodic Cues: Methods and Potential. Céline De Looze, Stéphane Rauzy |
| 2011 | Memory-Based Approximation of the Gaussian Mixture Model Framework for Bandwidth Extension of Narrowband Speech. Amr H. Nour-Eldin, Peter Kabal |
| 2011 | Method for Speech Inversion with Large Scale Statistical Evaluation. Heikki Rasilo, Unto K. Laine, Okko Johannes Räsänen, Toomas Altosaar |
| 2011 | Minimum Classification Error Based Spectro-Temporal Feature Extraction for Robust Audio Classification. Yuan-Fu Liao, Chia-Hsing Lin, We-Der Fang |
| 2011 | Mixture of Auto-Associative Neural Networks for Speaker Verification. Garimella S. V. S. Sivaram, Samuel Thomas, Hynek Hermansky |
| 2011 | Mixture of PLDA Models in i-vector Space for Gender-Independent Speaker Recognition. Mohammed Senoussaoui, Patrick Kenny, Niko Brümmer, Edward de Villiers, Pierre Dumouchel |
| 2011 | Modality Selection and Perceived Mental Effort in a Mobile Application. Stefan Schaffer, Benjamin Jöckel, Ina Wechsung, Robert Schleicher, Sebastian Möller |
| 2011 | Model Adaptation for Automatic Speech Recognition Based on Multiple Time Scale Evolution. Shinji Watanabe, Atsushi Nakamura, Biing-Hwang Juang |
| 2011 | Modeling Broad Context for Tone Recognition with Conditional Random Fields. Siwei Wang, Gina-Anne Levow |
| 2011 | Modeling Speaker Personality Using Voice. Tim Polzehl, Sebastian Möller, Florian Metze |
| 2011 | Modelling Novelty Preference in Word Learning. Maarten Versteegh, Louis ten Bosch, Lou Boves |
| 2011 | Modulation Spectrum Analysis for Recognition of Reverberant Speech. Sri Harish Reddy Mallidi, Sriram Ganapathy, Hynek Hermansky |
| 2011 | Monaural Azimuth Localization Using Spectral Dynamics of Speech. Roi Kliper, Hendrik Kayser, Daphna Weinshall, Israel Nelken, Jörn Anemüller |
| 2011 | Monaural Sound Localization. Anna Katharina Fuchs, Christian Feldbauer, Michael Stark |
| 2011 | Monaural Speech Separation Based on a 2D Processing and Harmonic Analysis. Azam Rabiee, Saeed Setayeshi, Soo-Young Lee |
| 2011 | Monaural Voiced Speech Segregation Based on Pitch and Comb Filter. Xueliang Zhang, Wenju Liu |
| 2011 | Morpheme Based Factored Language Models for German LVCSR. Amr El-Desoky Mousa, M. Ali Basha Shaik, Ralf Schlüter, Hermann Ney |
| 2011 | Morpheme Conversion for Connecting Speech Recognizer and Language Analyzers in Unsegmented Languages. Kenji Imamura, Tomoko Izumi, Kugatsu Sadamitsu, Kuniko Saito, Satoshi Kobashikawa, Hirokazu Masataki |
| 2011 | Morphological Variation in the Adult Vocal Tract: A Modeling Study of its Potential Acoustic Impact. Adam C. Lammert, Michael I. Proctor, Athanasios Katsamanis, Shrikanth S. Narayanan |
| 2011 | Mtrans: A Multi-Channel, Multi-Tier Speech Annotation Tool. Julián Villegas, Martin Cooke, Vincent Aubanel, Marco Aldo Piccolino Boniforti |
| 2011 | Multi-Accent Speech Recognition of Afrikaans, Black and White Varieties of South African English. Herman Kamper, Thomas Niesler |
| 2011 | Multi-Channel Voice Activity Detection Based on Conic Constraints. Gibak Kim |
| 2011 | Multi-Party Speech Recovery Exploiting Structured Sparsity Models. Afsaneh Asaei, Mohammad Javad Taghizadeh, Hervé Bourlard, Volkan Cevher |
| 2011 | Multi-Sensor Voice Activity Detection Based on Multiple Observation Hypothesis Testing. Theodore Petsatodis, Fotios Talantzis, Christos Boukis, Zheng-Hua Tan, Ramjee Prasad |
| 2011 | Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis. Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda |
| 2011 | Multi-Task Learning for Spoken Language Understanding with Shared Slots. Xiao Li, Ye-Yi Wang, Gökhan Tür |
| 2011 | Multi-View Approach for Speaker Turn Role Labeling in TV Broadcast News Shows. Géraldine Damnati, Delphine Charlet |
| 2011 | Multipulse Sequences for Residual Signal Modeling. Ranniery Maia, Heiga Zen, Kate M. Knill, Mark J. F. Gales, Sabine Buchholz |
| 2011 | Multistream Bandpass Modulation Features for Robust Speech Recognition. Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali |
| 2011 | N-Grams for Conditional Random Fields or a Failure-Transition(f) Posterior for Acyclic FSTs. Patrick Lehnen, Stefan Hahn, Hermann Ney |
| 2011 | NeMo: A Platform for Multilingual News Monitoring. Christian Girardi, Roberto Gretter, Daniele Falavigna, Fabio Brugnara, Diego Giuliani, Marcello Federico |
| 2011 | Nearest Neighbors with Learned Distances for Phonetic Frame Classification. John Labiak, Karen Livescu |
| 2011 | Neural Representations of Word Meanings. Tom M. Mitchell |
| 2011 | Neutral to Target Emotion Conversion Using Source and Suprasegmental Information. D. Govind, S. R. Mahadeva Prasanna, Bayya Yegnanarayana |
| 2011 | New Developments in Joint Factor Analysis for Speaker Verification. Hagai Aronowitz, Oren Barkan |
| 2011 | New Developments in Voice Biometrics for User Authentication. Hagai Aronowitz, Ron Hoory, Jason W. Pelecanos, David Nahamoo |
| 2011 | New Feature Parameters for Pronunciation Evaluation in English Presentations at International Conferences. Hiroshi Kibishi, Seiichi Nakagawa |
| 2011 | New Methods for Template Selection and Compression in Continuous Speech Recognition. Xie Sun, Yunxin Zhao |
| 2011 | Noise Robust Feature Extraction Based on Extended Weighted Linear Prediction in LVCSR. Sami Keronen, Jouni Pohjalainen, Paavo Alku, Mikko Kurimo |
| 2011 | Noise Robust Speaker-Independent Speech Recognition with Invariant-Integration Features Using Power-Bias Subtraction. Florian Müller, Alfred Mertins |
| 2011 | Novel VTEO Based Mel Cepstral Features for Classification of Normal and Pathological Voices. Hemant A. Patil, Pallavi N. Baljekar |
| 2011 | OOV Detection and Recovery Using Hybrid Models with Different Fragments. Long Qin, Ming Sun, Alexander I. Rudnicky |
| 2011 | OOV Sensitive Named-Entity Recognition in Speech. Carolina Parada, Mark Dredze, Frederick Jelinek |
| 2011 | Objective Intelligibility Prediction of Speech by Combining Correlation and Distortion Based Techniques. Angel M. Gomez, Belinda Schwerin, Kuldip K. Paliwal |
| 2011 | Off-Topic Detection in Automated Speech Assessment Applications. Jian Cheng, Jianqiang Shen |
| 2011 | On Building and Evaluating a Broadcast-News Audio Segmentation System. Taras Butko |
| 2011 | On Development of Consistently Punctuated Speech Corpora. Jáchym Kolár, Lori Lamel |
| 2011 | On Initial Seed Selection for Frequency Domain Blind Speech Separation. Dang Hai Tran Vu, Reinhold Haeb-Umbach |
| 2011 | On Mispronunciation Lexicon Generation Using Joint-Sequence Multigrams in Computer-Aided Pronunciation Training (CAPT). Xiaojun Qian, Helen M. Meng, Frank K. Soong |
| 2011 | On Noise Robust Voice Activity Detection. Tomas Dekens, Werner Verhelst |
| 2011 | On Noise Tracking for Noise Floor Estimation. Mahdi Triki |
| 2011 | On the Effectiveness of Statistical Modeling Based Template Matching Approach for Continuous Speech Recognition. Xie Sun, Xin Chen, Yunxin Zhao |
| 2011 | On the Estimation of Discount Parameters for Language Model Smoothing. Martin Sundermeyer, Ralf Schlüter, Hermann Ney |
| 2011 | On the Relationship Between Perceived Accentedness, Acoustic Similarity, and Processing Difficulty in Foreign-Accented Speech. Marijt J. Witteman, Andrea Weber, James M. McQueen |
| 2011 | On the Use of Extended Context for HMM-Based Spontaneous Conversational Speech Synthesis. Tomoki Koriyama, Takashi Nose, Takao Kobayashi |
| 2011 | On the Use of Lattices of Time-Synchronous Cross-Decoder Phone Co-Occurrences in a SVM-Phonotactic Language Recognition System. Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez, Germán Bordel |
| 2011 | On the Use of Linguistic Features in an Automatic System for Speech Analytics of Telephone Conversations. Benjamin Maza, Marc El-Bèze, Georges Linarès, Renato De Mori |
| 2011 | On the Use of Multimodal Cues for the Prediction of Degrees of Involvement in Spontaneous Conversation. Catharine Oertel, Stefan Scherer, Nick Campbell |
| 2011 | On the Use of the Rhythmogram for Automatic Syllabic Prominence Detection. Bogdan Ludusan, Antonio Origlia, Francesco Cutugno |
| 2011 | On-Line Language Model Biasing for Multi-Pass Automatic Speech Recognition. Sankaranarayanan Ananthakrishnan, Stavros Tsakalidis, Rohit Prasad, Premkumar Natarajan |
| 2011 | One-to-Many Voice Conversion Based on Tensor Representation of Speaker Space. Daisuke Saito, Keisuke Yamamoto, Nobuaki Minematsu, Keikichi Hirose |
| 2011 | Online Pattern Learning for Non-Negative Convolutive Sparse Coding. Dong Wang, Ravichander Vipperla, Nicholas W. D. Evans |
| 2011 | Online Speaker Adaptation with Pre-Computed FMLLR Transformations. Volker Fischer, Siegfried Kunzmann |
| 2011 | Online Speech Activity Detection in Broadcast News. Chao Gao, Guruprasad Saikumar, Saurabh Khanwalkar, Avi Herscovici, Anoop Kumar, Amit Srivastava, Premkumar Natarajan |
| 2011 | Open Source Multi-Language Audio Database for Spoken Language Processing Applications. Stephen A. Zahorian, Jiang Wu, Montri Karnjanadecha, Chandra Sekhar Vootkuri, Brian Wong, Andrew Hwang, Eldar Tokhtamyshev |
| 2011 | Open Source Voice Creation Toolkit for the MARY TTS Platform. Marc Schröder, Marcela Charfuelan, Sathish Pammi, Ingmar Steiner |
| 2011 | Optimal Models of Prosodic Prominence Using the Bayesian Information Criterion. Tim Mahrt, Jui-Ting Huang, Yoonsook Mo, Margaret M. Fleck, Mark Hasegawa-Johnson, Jennifer Cole |
| 2011 | Optimal Selection of Limited Vocabulary Speech Corpora. Hui Lin, Jeff A. Bilmes |
| 2011 | Optimal Syllabic Rates and Processing Units in Perceiving Mandarin Spoken Sentences. Guangting Mai, Gang Peng |
| 2011 | Optimization of the Gaussian Mixture Model Evaluation on GPU. Jan Vanek, Jan Trmal, Josef V. Psutka, Josef Psutka |
| 2011 | Optimized Feature Extraction and HMMs in Subword Detectors. Alfonso M. Canterla, Magne Hallstein Johnsen |
| 2011 | Optimizing Situated Dialogue Management in Unknown Environments. Heriberto Cuayáhuitl, Nina Dethlefs |
| 2011 | PLDA-Based Clustering for Speaker Diarization of Broadcast Streams. Jan Silovský, Jan Prazak, Petr Cerva, Jindrich Zdánský, Jan Nouza |
| 2011 | Painless WFST Cascade Construction for LVCSR - Transducersaurus. Josef R. Novak, Nobuaki Minematsu, Keikichi Hirose |
| 2011 | Parallel and Hierarchical Decision Making for Sparse Coding in Speech Recognition. Dong Wang, Ravichander Vipperla, Nicholas W. D. Evans |
| 2011 | Parallels in Infants' Attention to Speech Articulation and to Physical Changes in Speech-Unrelated Objects. Eeva Klintfors, Ellen Marklund, Francisco Lacerda |
| 2011 | Parametrising Degree of Articulator Movement from Dynamic MRI Data. Zeynab Raeesy, Ladan Baghai-Ravary, John S. Coleman |
| 2011 | Partitioning of Two-Speaker Conversation Datasets. Carlos Vaquero, Alfonso Ortega, Eduardo Lleida |
| 2011 | Perception of Alcoholic Intoxication in Speech. Florian Schiel |
| 2011 | Perceptual Improvement of a Two-Stage Algorithm for Speech Dereverberation. Thiago de M. Prego, Amaro A. de Lima, Sergio L. Netto |
| 2011 | Perceptual Learning of Liquids. Odette Scharenborg, Holger Mitterer, James M. McQueen |
| 2011 | Perceptual Quality Dimensions of Text-to-Speech Systems. Florian Hinterleitner, Sebastian Möller, Christoph Norrenbrock, Ulrich Heute |
| 2011 | Perceptual Representation of Consonant Sounds in Thai. Charturong Tantibundhit, Chutamanee Onsuwan, Tanawan Saimai, Nantaporn Saimai, Sumonmas Thatphithakkul, Patcharika Chootrakool, Krit Kosawat, Nattanun Thatphithakkul |
| 2011 | Perceptual Sensitivity to Dialectal and Generational Variations in Vowels. Robert Allen Fox, Ewa Jacewicz |
| 2011 | Perceptual Sensitivity to Prenuclear and Nuclear Intonational Patterns. Tomás Dubeda |
| 2011 | Perceptual Training of Vowel Length Contrast of Japanese by L2 Listeners: Effects of an Isolated Word versus a Word Embedded in Sentences. Mee Sonu, Keiichi Tajima, Hiroaki Kato, Yoshinori Sagisaka |
| 2011 | Perceptually-Inspired Processing for Multichannel Wiener Filter. Jorge I. Marin-Hurtado, David V. Anderson |
| 2011 | Percy - An HTML5 Framework for Media Rich Web Experiments on Mobile Devices. Christoph Draxler |
| 2011 | Performance Prediction of Speech Recognition Using Average-Voice-Based Speech Synthesis. Tatsuhiko Saito, Takashi Nose, Takao Kobayashi, Yohei Okato, Akio Horii |
| 2011 | Personalizing Model M for Voice-Search. Geoffrey Zweig, Shuangyu Chang |
| 2011 | Phase-Only Speech Reconstruction Using Very Short Frames. Erfan Loweimi, Seyed Mohammad Ahadi, Hamid Sheikhzadeh |
| 2011 | Phone Impact Based Speech Transmission Technique for Reliable Speech Recognition in Poor Wireless Network Conditions. Azar Taufique, Kumaran Vijayasankar, Wooil Kim, John H. L. Hansen, Marco Tacca, Andrea Fumagalli |
| 2011 | Phoneme Level Non-Native Pronunciation Analysis by an Auditory Model-Based Native Assessment Scheme. Christos Koniaris, Olov Engwall |
| 2011 | Phoneme-Dependent NMF for Speech Enhancement in Monaural Mixtures. Bhiksha Raj, Rita Singh, Tuomas Virtanen |
| 2011 | Phoneme-Level Text to Audio Synchronization on Speech Signals with Background Music. Agnès Pedone, Juan José Burred, Simon Maller, Pierre Leveau |
| 2011 | Phonemic Similarity Metrics to Compare Pronunciation Methods. Ben Hixon, Eric Schneider, Susan L. Epstein |
| 2011 | Phonetic Classification Using Controlled Random Walks. Katrin Kirchhoff, Andrei Alexandrescu |
| 2011 | Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation. Hui Liang, John Dines |
| 2011 | Phonotactic Constraints and the Segmentation of Cantonese Speech. Michael C. W. Yip |
| 2011 | Phrasal Prominences do not need Pitch Movements: Postfocal Phrasal Heads in Italian. Giuliano Bocci, Cinzia Avesani |
| 2011 | Phrases, Pitch and Perceived Prominence in Maori. Laura Thompson, Catherine Inez Watson, Ray Harlow, Jeanette King, Margaret Maclagan, Helen Charters, Peter Keegan |
| 2011 | Physical Models Producing Vowels with Pitch Variation. Takayuki Arai |
| 2011 | Places and Manner of Articulation of Bangla Consonants: A EPG Based Study. Shyamal Kr. Das Mandal, Somnath Chandra Vijay Kumar, Swaran Lata, Asoke Kumar Datta |
| 2011 | PodCastle: Recent Advances of a Spoken Document Retrieval Service Improved by Anonymous User Contributions. Masataka Goto, Jun Ogata |
| 2011 | Pointing Gestures do not Influence the Perception of Lexical Stress. Alexandra Jesse, Holger Mitterer |
| 2011 | Powered Wheelchair Control Using Acoustic-Based Recognition of Head Gesture Accompanying Speech. Akira Sasou |
| 2011 | Predicting Human Perceived Accuracy of ASR Systems. Taniya Mishra, Andrej Ljolje, Mazin Gilbert |
| 2011 | Predicting Speaker Changes and Listener Responses with and without Eye-Contact. Daniel Neiberg, Joakim Gustafson |
| 2011 | Predicting Taiwan Mandarin Tone Shapes from their Duration. Chierh Cheng, Michele Gubian |
| 2011 | Predicting Tongue Positions from Acoustics and Facial Features. Asterios Toutios, Slim Ouni |
| 2011 | Prediction of Binaural Intelligibility Level Differences in Reverberation. Jan Rennies, Thomas Brand, Birger Kollmeier |
| 2011 | Prediction of Voice Aperiodicity Based on Spectral Representations in HMM Speech Synthesis. Hanna Silén, Elina Helander, Moncef Gabbouj |
| 2011 | Privacy Preserving Speaker Verification Using Adapted GMMs. Manas A. Pathak, Bhiksha Raj |
| 2011 | Probabilistic Latent Semantic Analysis for Broadcast News Story Segmentation. Mimi Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li |
| 2011 | Probabilistic Spectrum Envelope: Categorized Audio-Features Representation for NMF-Based Sound Decomposition. Toru Nakashika, Tetsuya Takiguchi, Yasuo Ariki |
| 2011 | Problems Encountered by Japanese EL2 with English Short Vowels as Illustrated on a 3D Vowel Chart. Toshiko Isei-Jaakkola, Takatoshi Naka, Keikichi Hirose |
| 2011 | Processing of Stress Related Acoustic Cues as Indexed by ERPs. Ferenc Honbolygo, Valéria Csépe |
| 2011 | Production and Perception of Estonian Vowels by Native and Non-Native Speakers. Lya Meister, Einar Meister |
| 2011 | Progress and Prospects for Speech Technology: Results from Three Sexennial Surveys. Roger K. Moore |
| 2011 | Projectability of Transition-Relevance Places Using Prosodic Features in Japanese Spontaneous Conversation. Yuichi Ishimoto, Mika Enomoto, Hitoshi Iida |
| 2011 | Prominence Model for Prosodic Features in Automatic Lexical Stress and Pitch Accent Detection. Kun Li, Shuang Zhang, Mingxing Li, Wai Kit Lo, Helen M. Meng |
| 2011 | Prominence-Based Prosody Prediction for Unit Selection Speech Synthesis. Andreas Windmann, Igor Jauk, Fabio Tamburini, Petra Wagner |
| 2011 | Pronunciation Learning from Continuous Speech. Ibrahim Badr, Ian McGraw, James R. Glass |
| 2011 | Propagation of Uncertainty Through Multilayer Perceptrons for Robust Automatic Speech Recognition. Ramón Fernandez Astudillo, João Paulo da Silva Neto |
| 2011 | Prosodic Analysis and Perception of Mandarin Utterances Conveying Attitudes. Wentao Gu, Ting Zhang, Hiroya Fujisaki |
| 2011 | Prosodic Analysis of a Corpus of Tales. David Doukhan, Albert Rilliard, Sophie Rosset, Martine Adda-Decker, Christophe d'Alessandro |
| 2011 | Prosodic Correlates of Individual Physiological Response to Stress. Serguei V. S. Pakhomov, Michael E. Kotlyar |
| 2011 | Prosodic Highlights in Mandarin Continuous Speech - Cross-Genre Attributes and Implications. Chiu-yu Tseng, Zhao-yu Su, Chi-Feng Huang |
| 2011 | Prosodic Synchrony in Co-Operative Task-Based Dialogues: A Measure of Agreement and Disagreement. Brian Vaughan |
| 2011 | Prosodic and Phonetic Features for Speaker Clustering in Speaker Diarization Systems. Janez Zibert, France Mihelic |
| 2011 | Prosody Conversion for Emotional Mandarin Speech Synthesis Using the Tone Nucleus Model. Miaomiao Wen, Miaomiao Wang, Keikichi Hirose, Nobuaki Minematsu |
| 2011 | Prosody Toolkit: Integrating HTK, Praat and WEKA. S. Thomas Christie, Serguei V. S. Pakhomov |
| 2011 | Quality Aspects of Multimodal Dialog Systems: Identity, Stimulation and Success. Christine Kühnel, Benjamin Weiss, Matthias Schulz, Sebastian Möller |
| 2011 | Quality Assessment of Crowdsourcing Transcriptions for African Languages. Hadrien Gelas, Solomon Teferra Abate, Laurent Besacier, François Pellegrino |
| 2011 | Quality Improvement of Voice Conversion Systems Based on Trellis Structured Vector Quantization. Mahdi Eslami, Hamid Sheikhzadeh, Abolghasem Sayadiyan |
| 2011 | Quantifying Articulatory Distinctiveness of Vowels. Jun Wang, Jordan R. Green, Ashok Samal, David Marx |
| 2011 | Quantitative Analysis of Tone Coarticulation in Mandarin. Hussein Hussein, Hansjörg Mixdorff, Hue San Do, Rüdiger Hoffmann |
| 2011 | RANSAC-Based Training Data Selection for Speaker State Recognition. Elif Bozkurt, Engin Erzin, Çigdem Eroglu Erdem, A. Tanju Erdem |
| 2011 | ROVER Enhancement with Automatic Error Detection. Kacem Abida, Fakhri Karray |
| 2011 | Range Based Multi Microphone Array Fusion for Speaker Activity Detection in Small Meetings. Jani Even, Panikos Heracleous, Carlos Toshinori Ishi, Norihiro Hagita |
| 2011 | Rapid Adaptation of Foreign-Accented HMM-Based Speech Synthesis. Reima Karhila, Mirjam Wester |
| 2011 | Rapid Building of an ASR System for Under-Resourced Languages Based on Multilingual Unsupervised Training. Ngoc Thang Vu, Franziska Kraus, Tanja Schultz |
| 2011 | Rapid Evaluation of Speech Representations for Spoken Term Discovery. Michael A. Carlin, Samuel Thomas, Aren Jansen, Hynek Hermansky |
| 2011 | Rapid Training of Acoustic Models Using Graphics Processing Unit. Senaka Buthpitiya, Ian R. Lane, Jike Chong |
| 2011 | Reaction Time and Decision Difficulty in the Perception of Intonation. Katrin Schneider, Grzegorz Dogil, Bernd Möbius |
| 2011 | Real User Evaluation of Spoken Dialogue Systems Using Amazon Mechanical Turk. Filip Jurcícek, Simon Keizer, Milica Gasic, François Mairesse, Blaise Thomson, Kai Yu, Steve J. Young |
| 2011 | Real-Life Emotion Detection from Speech in Human-Robot Interaction: Experiments Across Diverse Corpora with Child and Adult Voices. Marie Tahon, Agnès Delaborde, Laurence Devillers |
| 2011 | Real-Time Prototype for Integration of Blind Source Extraction and Robust Automatic Speech Recognition. Francesco Nesta, Marco Matassoni, Hari Krishna Maganti |
| 2011 | Real-World Speech/Non-Speech Audio Classification Based on Sparse Representation Features and GPCs. Ziqiang Shi, Jiqing Han, Tieran Zheng |
| 2011 | Recognition and Real Time Performances of a Lightweight Ultrasound Based Silent Speech Interface Employing a Language Model. Jun Cai, Bruce Denby, Pierre Roussel-Ragot, Gérard Dreyfus, Lise Crevier-Buchman |
| 2011 | Recognition of Personality Traits from Human Spoken Conversations. Alexei V. Ivanov, Giuseppe Riccardi, Adam J. Sporka, Jakub Franc |
| 2011 | Recording Caregiver Interactions for Machine Acquisition of Spoken Language Using the KLAIR Virtual Infant. Mark A. Huckvale |
| 2011 | Recurrent Neural Network Based Language Modeling in Meeting Recognition. Stefan Kombrink, Tomás Mikolov, Martin Karafiát, Lukás Burget |
| 2011 | Reducing Computational Complexities of Exemplar-Based Sparse Representations with Applications to Large Vocabulary Speech Recognition. Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo, Dimitri Kanevsky |
| 2011 | Reduction of Highly Nonstationary Ambient Noise by Integrating Spectral and Locational Characteristics of Speech and Noise for Robust ASR. Tomohiro Nakatani, Shoko Araki, Marc Delcroix, Takuya Yoshioka, Masakiyo Fujimoto |
| 2011 | Redundancy Reduction in ASR of Spontaneous Speech Through Statistical Machine Translation. Daniele Falavigna |
| 2011 | Reformulating Prosodic Break Model into Segmental HMMs and Information Fusion. Nicolas Obin, Pierre Lanchantin, Anne Lacheret, Xavier Rodet |
| 2011 | Region Dependent Transform on MLP Features for Speech Recognition. Tim Ng, Bing Zhang, Spyridon Matsoukas, Long Nguyen |
| 2011 | Regularized Logistic Regression Fusion for Speaker Verification. Ville Hautamäki, Kong-Aik Lee, Tomi Kinnunen, Bin Ma, Haizhou Li |
| 2011 | Reinforcement Learning of Argumentation Dialogue Policies in Negotiation. Kallirroi Georgila, David R. Traum |
| 2011 | Relationships Between Phonetic Features and Speech Perception - A Statistical Investigation from a Large Anechoic British English Corpus. Ian R. Cushing, Francis F. Li, Ken Worrall, Tim D. Jackson |
| 2011 | Reliability-Weighted Acoustic Model Adaptation Using Crowd Sourced Transcriptions. Kartik Audhkhasi, Panayiotis G. Georgiou, Shrikanth S. Narayanan |
| 2011 | Report on Performance Results in the NIST 2010 Speaker Recognition Evaluation. Craig S. Greenberg, Alvin F. Martin, Bradford Barr, George R. Doddington |
| 2011 | Representing Phonological Features Through a Two-Level Finite State Model. Javier Mikel Olaso, M. Inés Torres, Raquel Justo |
| 2011 | Response Probability Based Decoding Algorithm for Large Vocabulary Continuous Speech Recognition. Zhanlei Yang, Hao Chao, Wenju Liu |
| 2011 | Restoring the Residual Speaker Information in Total Variability Modeling for Speaker Verification. Ce Zhang, Rong Zheng, Bo Xu |
| 2011 | Rhythm Metrics on Syllables and Feet do not Work as Expected. Paolo Mairano, Antonio Romano |
| 2011 | Robust Audio Fingerprinting Based on Local Spectral Luminance Maxima Scheme. Yongzhe Shi, Weiqiang Zhang, Jia Liu |
| 2011 | Robust Bimodal Person Identification Using Face and Speech with Limited Training Data and Corruption of Both Modalities. Niall McLaughlin, Ji Ming, Danny Crookes |
| 2011 | Robust HNR-Based Closed-Loop Pitch and Harmonic Parameters Estimation. Alexander Pavlovets, Alexander A. Petrovsky |
| 2011 | Robust Intonation Pattern Classification in Human Robot Interaction. Martin Heckmann, Kazuhiro Nakadai, Hirofumi Nakajima |
| 2011 | Robust Speaker Recognition in Non-Stationary Room Environments Based on Empirical Mode Decomposition. Taufiq Hasan, John H. L. Hansen |
| 2011 | Robust Speech Translation by Domain Adaptation. Xiaodong He, Li Deng |
| 2011 | Robust Voice Activity Detector for Real World Applications Using Harmonicity and Modulation Frequency. Ekapol Chuangsuwanich, James R. Glass |
| 2011 | Segregation of Whispered Speech Interleaved with Noise or Speech Maskers. Nandini Iyer, Douglas Brungart, Brian D. Simpson |
| 2011 | Semantic Graph Clustering for POMDP-Based Spoken Dialog Systems. Florian Pinault, Fabrice Lefèvre |
| 2011 | Semi-Automated Classifier Adaptation for Natural Language Call Routing. Silke M. Witt |
| 2011 | Semi-Automatic Acoustic Model Generation from Large Unsynchronized Audio and Text Chunks. Michele Alessandrini, Giorgio Biagetti, Alessandro Curzi, Claudio Turchetti |
| 2011 | Semi-Supervised Single-Channel Speech-Music Separation for Automatic Speech Recognition. Cemil Demir, A. Taylan Cemgil, Murat Saraclar |
| 2011 | Semi-Supervised Tree Support Vector Machine for Online Cough Recognition. Huynh Thai Hoa, An Vu Tran, Tran Huy Dat |
| 2011 | Sentence Selection by Direct Likelihood Maximization for Language Model Adaptation. Takahiro Shinozaki, Yu Kubota, Sadaoki Furui, Eiji Utsunomiya, Yasutaka Shindoh |
| 2011 | Separating Speaker and Environmental Variability Using Factored Transforms. Michael L. Seltzer, Alex Acero |
| 2011 | Sequential Classification Criteria for NNs in Automatic Speech Recognition. Guangsen Wang, Khe Chai Sim |
| 2011 | Shrinkage-Based Features for Natural Language Call Routing. Ruhi Sarikaya, Stanley F. Chen, Bhuvana Ramabhadran |
| 2011 | Signals and Speech. Alex Pentland |
| 2011 | Similar Vowels in L1/L2 Production: Confused or Discerned in Early L2 English Learners with Different Amount of Exposure. E.-Chin Wu |
| 2011 | Similarity Language Model. Christian Gillot, Christophe Cerisara |
| 2011 | Simulating Post-L F0 Bouncing by Modeling Articulatory Dynamics. Santitham Prom-on, Yi Xu, Fang Liu |
| 2011 | Sinewave Representations of Nonmodality. Nicolas Malyska, Thomas F. Quatieri, Robert B. Dunn |
| 2011 | Singing Voice Analysis Using Relative Harmonic Delays. Ricardo Teixeira Sousa, Aníbal J. S. Ferreira |
| 2011 | Singing Voice Synthesis: Singer-Dependent Vibrato Modeling and Coherent Processing of Spectral Envelope. Siu Wa Lee, Minghui Dong |
| 2011 | Single Channel Dereverberation Using Example-Based Speech Enhancement with Uncertainty Decoding Technique. Keisuke Kinoshita, Mehrez Souden, Marc Delcroix, Tomohiro Nakatani |
| 2011 | Single Channel Speech Enhancement Using MMSE Estimation of Short-Time Modulation Magnitude Spectrum. Kuldip K. Paliwal, Belinda Schwerin, Kamil K. Wójcicki |
| 2011 | Single Channel Speech Music Separation Using Nonnegative Matrix Factorization with Sliding Windows and Spectral Masks. Emad M. Grais, Hakan Erdogan |
| 2011 | Single-Channel Head Orientation Estimation Based on Discrimination of Acoustic Transfer Function. Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki |
| 2011 | Sinusoidal Approach for the Single-Channel Speech Separation and Recognition Challenge. Pejman Mowlaee, Rahim Saeidi, Zheng-Hua Tan, Mads Græsbøll Christensen, Tomi Kinnunen, Pasi Fränti, Søren Holdt Jensen |
| 2011 | Skew Gaussian Mixture Models for Speaker Recognition. Avi Matza |
| 2011 | Spatial Filter Calibration Based on Minimization of Modified LSD. Nobuaki Tanaka, Tetsuji Ogawa, Tetsunori Kobayashi |
| 2011 | Speak4it and the Multimodal Semantic Interpretation System. Michael Johnston, Patrick Ehlen |
| 2011 | Speaker Clustering Based on Non-Negative Matrix Factorization. Masafumi Nishida, Seiichi Yamamoto |
| 2011 | Speaker Clustering Based on Utterance-Oriented Dirichlet Process Mixture Model. Naohiro Tawara, Shinji Watanabe, Tetsuji Ogawa, Tetsunori Kobayashi |
| 2011 | Speaker Diarization Using a priori Acoustic Information. Hagai Aronowitz |
| 2011 | Speaker Identification for Whispered Speech Using a Training Feature Transformation from Neutral to Whisper. Xing Fan, John H. L. Hansen |
| 2011 | Speaker Modeling Using Local Binary Decisions. Jean-François Bonastre, Xavier Anguera Miró, Gabriel Hernández Sierra, Pierre-Michel Bousquet |
| 2011 | Speaker Recognition Using Temporal Contours in Linguistic Units: The Case of Formant and Formant-Bandwidth Trajectories. Joaquin Gonzalez-Rodriguez |
| 2011 | Speaker Role Recognition Using Question Detection and Characterization. Thierry Bazillon, Benjamin Maza, Mickael Rouvier, Frédéric Béchet, Alexis Nasr |
| 2011 | Speaker State Classification Based on Fusion of Asymmetric SIMPLS and Support Vector Machines. Dong-Yan Huang, Shuzhi Sam Ge, Zhengchen Zhang |
| 2011 | Speaker Verification Robust to Talking Style Variation Using Multiple Kernel Learning Based on Conditional Entropy Minimization. Tetsuji Ogawa, Hideitsu Hino, Noboru Murata, Tetsunori Kobayashi |
| 2011 | Speaker Verification Using Sparse Representations on Total Variability i-vectors. Ming Li, Xiang Zhang, Yonghong Yan, Shrikanth S. Narayanan |
| 2011 | Speaker-Adaptive Speech Synthesis Based on Eigenvoice Conversion and Language-Dependent Prosodic Conversion in Speech-to-Speech Translation. Nobuhiko Hattori, Tomoki Toda, Hisashi Kawai, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2011 | Speaking More Like You: Entrainment in Conversational Speech. Julia Hirschberg |
| 2011 | Speaking to the Crowd: Looking at Past Achievements in Using Crowdsourcing for Speech and Predicting Future Challenges. Gabriel Parent, Maxine Eskénazi |
| 2011 | Spectral Envelope Transformation Using DFW and Amplitude Scaling for Voice Conversion with Parallel or Nonparallel Corpora. Elizabeth Godoy, Olivier Rosec, Thierry Chonavel |
| 2011 | Spectral Features for Automatic Blind Intelligibility Estimation of Spastic Dysarthric Speech. Richard Hummel, Wai-Yip Chan, Tiago H. Falk |
| 2011 | Speech Enhancement Using Masking Properties in Adverse Environments. Atanu Saha, Tetsuya Shimamura |
| 2011 | Speech Enhancement by Reconstruction from Cleaned Acoustic Features. Philip Harding, Ben Milner |
| 2011 | Speech Events are Recoverable from Unlabeled Articulatory Data: Using an Unsupervised Clustering Approach on Data Obtained from Electromagnetic Midsaggital Articulography (EMA). Daniel Duran, Jagoda Bruni, Grzegorz Dogil, Hinrich Schütze |
| 2011 | Speech Indexing Using Semantic Context Inference. Chien-Lin Huang, Bin Ma, Haizhou Li, Chung-Hsien Wu |
| 2011 | Speech Modulation Features for Robust Nonnative Speech Accent Detection. Sethserey Sam, Xiong Xiao, Laurent Besacier, Eric Castelli, Haizhou Li, Chng Eng Siong |
| 2011 | Speech Processing Tools - An Introduction to Interoperability. Christoph Draxler, Toomas Altosaar, Sadaoki Furui, Mark Y. Liberman, Peter Wittenburg |
| 2011 | Speech Recognition in Mixed Sound of Speech and Music Based on Vector Quantization and Non-Negative Matrix Factorization. Shoichi Nakano, Kazumasa Yamamoto, Seiichi Nakagawa |
| 2011 | Speech Synthesis Based on Articulatory-Movement HMMs with Voice-Source Codebooks. Tsuneo Nitta, Takayuki Onoda, Masashi Kimura, Yurie Iribe, Kouichi Katsurada |
| 2011 | Speech Synthesis Parameter Generation for the Assistive Silent Speech Interface MVOCA. Robin Hofe, Stephen R. Ell, Michael J. Fagan, James M. Gilbert, Phil D. Green, Roger K. Moore, Sergey I. Rybchenko |
| 2011 | Speech Technology in (Re)Habilitation of Persons with Communication Disabilities. Björn Granström |
| 2011 | Speech Timing Organization for the Phonological Length Contrast in Italian Consonants. Claudio Zmarich, Barbara Gili Fivela, Pascal Perrier, Christophe Savariaux, Graziano Tisato |
| 2011 | Speech Transcript Evaluation for Information Retrieval. Laurens van der Werff, Wessel Kraaij, Franciska de Jong |
| 2011 | Speech Translation with Grammar Driven Probabilistic Phrasal Bilexica Extraction. Markus Saers, Dekai Wu, Chi-kiu Lo, Karteek Addanki |
| 2011 | Speech-Based Non-Prototypical Affect Recognition for Child-Robot Interaction in Reverberated Environments. Martin Wöllmer, Felix Weninger, Stefan Steidl, Anton Batliner, Björn W. Schuller |
| 2011 | SpeechForms: From Web to Speech and Back. Luciano Barbosa, Diamantino Caseiro, Giuseppe Di Fabbrizio |
| 2011 | Spoken Document Confidence Estimation Using Contextual Coherence. Taichi Asami, Narichika Nomoto, Satoshi Kobashikawa, Yoshikazu Yamaguchi, Hirokazu Masataki, Satoshi Takahashi |
| 2011 | Spoken Language Recognition in the Latent Topic Simplex. Kong-Aik Lee, Chang Huai You, Ville Hautamäki, Anthony Larcher, Haizhou Li |
| 2011 | Spoken Lecture Summarization by Random Walk over a Graph Constructed with Automatically Extracted Key Terms. Yun-Nung Chen, Yu Huang, Ching-Feng Yeh, Lin-Shan Lee |
| 2011 | Spoken Term Detection Results Using Plural Subword Models by Estimating Detection Performance for Each Query. Yoshiaki Itoh, Kohei Iwata, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee |
| 2011 | State-Level Data Borrowing for Low-Resource Speech Recognition Based on Subspace GMMs. Yanmin Qian, Daniel Povey, Jia Liu |
| 2011 | Statistical Mapping Between Articulatory and Acoustic Data for an Ultrasound-Based Silent Speech Interface. Thomas Hueber, Elie-Laurent Benaroya, Bruce Denby, Gérard Chollet |
| 2011 | Stop Consonant Recognition by Temporal Fine Structure of Burst. Seppo Fagerlund, Unto K. Laine |
| 2011 | Structural Joint Factor Analysis for Speaker Recognition. Marc Ferras, Koichi Shinoda, Sadaoki Furui |
| 2011 | Structured Support Vector Machines for Noise Robust Continuous Speech Recognition. Shi-Xiong Zhang, Mark J. F. Gales |
| 2011 | Study of Overlapped Speech Detection for NIST SRE Summed Channel Speaker Recognition. Hanwu Sun, Bin Ma |
| 2011 | Study on the Relevance Factor of Maximum a Posteriori with GMM for Language Recognition. Chang Huai You, Haizhou Li, Kong-Aik Lee |
| 2011 | Stylization and Trajectory Modelling of Short and Long Term Speech Prosody Variations. Nicolas Obin, Anne Lacheret, Xavier Rodet |
| 2011 | Sub-Band Level Histogram Equalization for Robust Speech Recognition. Vikas Joshi, Raghavendra Bilgi, Srinivasan Umesh, Luz García, M. Carmen Benítez |
| 2011 | Subjective and Objective Evaluation of Speech Intelligibility Enhancement Under Constant Energy and Duration Constraints. Yan Tang, Martin Cooke |
| 2011 | Super-Dirichlet Mixture Models Using Differential Line Spectral Frequencies for Text-Independent Speaker Identification. Zhanyu Ma, Arne Leijon |
| 2011 | Supervised Sparse Coding Strategy in Cochlear Implants. Jinqiu Sang, Guoping Li, Hongmei Hu, Mark E. Lutman, Stefan Bleeck |
| 2011 | Syllable Segmentation of Continuous Speech Using Auditory Attention Cues. Ozlem Kalinli |
| 2011 | Sylli: Automatic Phonological Syllabification for Italian. Luca Iacoponi, Renata Savy |
| 2011 | Symbolic and Direct Sequential Modeling of Prosody for Classification of Speaking-Style and Nativeness. Andrew Rosenberg |
| 2011 | Synchronous Reading: Learning French Orthography by Audiovisual Training. Gérard Bailly, Will Barbour |
| 2011 | Synthesis of Breathy, Normal, and Pressed Phonation Using a Two-Mass Model with a Triangular Glottis. Peter Birkholz, Bernd J. Kröger, Christiane Neuschaefer-Rube |
| 2011 | TSAB - Web Interface for Transcribed Speech Collections. Tanel Alumäe, Ahti Kitsik |
| 2011 | Tackling a Shilly-Shally Classifier for Predicting Task Success in Spoken Dialogue Interaction. Alexander Schmitt, Alexander Zgorzelski, Wolfgang Minker |
| 2011 | Target-Aware Lattice Rescoring for Dialect Recognition. Rong Tong, Bin Ma, Haizhou Li, Chng Eng Siong |
| 2011 | Template-Based Automatic Speech Recognition Meets Prosody. Dino Seppi, Kris Demuynck, Dirk Van Compernolle |
| 2011 | Temporal Performance of Dysarthric Patients in Speech and Tapping Tasks. Eiji Shimura, Kazuhiko Kakehi |
| 2011 | Temporal Relationship Between Auditory and Visual Prosodic Cues. Erin Cvejic, Jeesun Kim, Chris Davis |
| 2011 | Text Driven 3D Photo-Realistic Talking Head. Lijuan Wang, Wei Han, Frank K. Soong, Qiang Huo |
| 2011 | The "Fortis-Lenis" Distinction in Bulgarian and German. Bistra Andreeva, Magdalena Wolska |
| 2011 | The Albayzin 2010 Language Recognition Evaluation. Luis Javier Rodríguez, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel |
| 2011 | The Detection of Overlapping Speech with Prosodic Features for Speaker Diarization. Martin Zelenák, Javier Hernando |
| 2011 | The Effect of Seeing the Interlocutor on Speech Production in Different Noise Types. Michael Fitzpatrick, Jeesun Kim, Chris Davis |
| 2011 | The Effect of Using Normalized Models in Statistical Speech Synthesis. Matt Shannon, Heiga Zen, William J. Byrne |
| 2011 | The Effects of Phoneme Errors in Speaker Adaptation for HMM Speech Synthesis. Bálint Tóth, Tibor Fegyó, Géza Németh |
| 2011 | The Efficiency of Cross-Dialectal Word Recognition. Annelie Tuinman, Holger Mitterer, Anne Cutler |
| 2011 | The INTERSPEECH 2011 Speaker State Challenge. Björn W. Schuller, Stefan Steidl, Anton Batliner, Florian Schiel, Jarek Krajewski |
| 2011 | The JSafran Platform for Semi-Automatic Speech Processing. Christophe Cerisara, Claire Gardent |
| 2011 | The KLAIR Toolkit for Recording Interactive Dialogues with a Virtual Infant. Mark A. Huckvale |
| 2011 | The Lombard Effect in Spontaneous Dialog Speech. Laura Folk, Florian Schiel |
| 2011 | The Multi Timescale Phoneme Acquisition Model of the Self-Organizing Based on the Dynamic Features. Kouki Miyazawa, Hideaki Miura, Hideaki Kikuchi, Reiko Mazuka |
| 2011 | The Open Front Vowel /æ/ in the Production and Perception of Czech Students of English. Pavel Sturm, Radek Skarnitzl |
| 2011 | The Perception Boundary Between Single and Geminate Stops in 3- and 4-Mora Japanese Words. Shigeaki Amano, Yukari Hirata |
| 2011 | The Phonology and Phonetics of Perceived Prosody: What do Listeners Imitate? Jennifer Cole, Stefanie Shattuck-Hufnagel |
| 2011 | The Relation Between Perception and Production in L2 Phonological Processing. Sharon Peperkamp, Camillia Bouchon |
| 2011 | The Representation of Speech in a Nonlinear Auditory Model: Time-Domain Analysis of Simulated Auditory-Nerve Firing Patterns. Guy J. Brown, Tim Jürgens, Ray Meddis, Matthew Robertson, Nicholas R. Clark |
| 2011 | The Role of Variability in Non-Native Perceptual Learning of a Japanese Geminate-Singleton Fricative Contrast. Makiko Sadakata, James M. McQueen |
| 2011 | The Role of Word-Initial Glottal Stops in Recognizing English Words. Maria Paola Bissiri, María Luisa García Lecumberri, Martin Cooke, Jan Volín |
| 2011 | The Social Signal Interpretation Framework (SSI) for Real Time Signal Processing and Recognition. Johannes Wagner, Florian Lingenfelser, Elisabeth André |
| 2011 | The Time-Course of Talker-Specificity Effects for Newly-Learned Pseudowords: Evidence for a Hybrid Model of Lexical Representation. Helen Brown, M. Gareth Gaskell |
| 2011 | The USC CARE Corpus: Child-Psychologist Interactions of Children with Autism Spectrum Disorders. Matthew Black, Daniel Bone, Marian E. Williams, Phillip Gorrindo, Pat Levitt, Shrikanth S. Narayanan |
| 2011 | The Vocal Effort of Dominance in Scenario Meetings. Marcela Charfuelan, Marc Schröder |
| 2011 | Theoretical Analysis of Musical Noise and Speech Distortion in Structure-Generalized Parametric Blind Spatial Subtraction Array. Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2011 | Thresholding Word Activations for Response Scoring - Modelling Psycholinguistic Data. Christina Bergmann, Louis ten Bosch, Lou Boves |
| 2011 | Time- and Acoustic-Mediated Alignment Algorithms for Speech Recognition Evaluation. Simon Dobrisek, France Mihelic |
| 2011 | Time-Varying Signal Adaptive Transform and IHT Recovery of Compressive Sensed Speech. Ch. Srikanth Raj, Thippur V. Sreenivas |
| 2011 | Timing in Italian VNC Sequences at Different Speech Rates. Chiara Celata, Silvia Calamai |
| 2011 | To Weight or Not to Weight: Source-Normalised LDA for Speaker Recognition Using i-vectors. Mitchell McLaren, David A. van Leeuwen |
| 2011 | Tonal Alignment Defined: The Case of Southern Irish English. Raya Kalaldeh |
| 2011 | Tonal Variations in Mandarin: New Evidence from Spontaneous and Read Speech. Li-chiung Yang |
| 2011 | Tongue Gestures Awareness and Pronunciation Training. Slim Ouni |
| 2011 | Topic Identification from Audio Recordings Using Rich Recognition Results and Neural Network Based Classifiers. Roberto Gemello, Franco Mana, Pier Domenico Batzu |
| 2011 | Topic Segmentation of TV-Streams by Mathematical Morphology and Vectorization. Vincent Claveau, Sébastien Lefèvre |
| 2011 | Topic Switching Strategies for Spoken Dialogue Systems. Tobias Heinroth, Savina Koleva, Wolfgang Minker |
| 2011 | Toward a Continuous Modeling of French Prosodic Structure: Using Acoustic Features to Predict Prominence Location and Prominence Degree. Mathieu Avanzi, Nicolas Obin, Anne Lacheret-Dujour, Bernard Victorri |
| 2011 | Toward a Multi-Speaker Visual Articulatory Feedback System. Atef Ben Youssef, Thomas Hueber, Pierre Badin, Gérard Bailly |
| 2011 | Towards Context-Dependent Phonetic Spelling Error Correction in Children's Freely Composed Text for Diagnostic and Pedagogical Purposes. Sebastian Stüker, Johanna Fay, Kay Berkling |
| 2011 | Towards Fully Bayesian Speaker Recognition: Integrating Out the Between-Speaker Covariance. Jesús Antonio Villalba López, Niko Brümmer |
| 2011 | Towards Goat Detection in Text-Dependent Speaker Verification. Orith Toledo-Ronen, Hagai Aronowitz, Ron Hoory, Jason W. Pelecanos, David Nahamoo |
| 2011 | Towards High Performance LVCSR in Speech-to-Speech Translation System on Smart Phones. Jian Xue, Xiaodong Cui, Gregg Daggett, Etienne Marcheret, Bowen Zhou |
| 2011 | Towards Unsupervised Spoken Language Understanding: Exploiting Query Click Logs for Slot Filling. Gökhan Tür, Dilek Hakkani-Tür, Dustin Hillard, Asli Celikyilmaz |
| 2011 | Towards Unsupervised Training of Speaker Independent Acoustic Models. Aren Jansen, Kenneth Church |
| 2011 | Towards Voice-Input Symbolic Pattern Retrieval Using Parameter-Based Search. Yukiko Suzuki, Kiyoaki Aikawa |
| 2011 | Towards a Versatile Multi-Layered Description of Speech Corpora Using Algebraic Relations. Nelly Barbot, Vincent Barreaud, Olivier Boëffard, Laure Charonnat, Arnaud Delhay, Sébastien Le Maguer, Damien Lolive |
| 2011 | Tracking Pitch Contours Using Minimum Jerk Trajectories. Daniel Neiberg, Gopal Ananthakrishnan, Joakim Gustafson |
| 2011 | Training a Language Model Using Webdata for Large Vocabulary Japanese Spontaneous Speech Recognition. Ryo Masumura, Seongjun Hahm, Akinori Ito |
| 2011 | Tree Encoding for the ITU-T G.711.1 Speech Coder. Abdul Hannan Khan, Peter Kabal |
| 2011 | Tue-SeA Real-Time Speech Command Detector for a Smart Control Room. Daniel Reich, Felix Putze, Dominic Heger, Joris IJsselmuiden, Rainer Stiefelhagen, Tanja Schultz |
| 2011 | Unary Data Structures for Language Models. Jeffrey Sorensen, Cyril Allauzen |
| 2011 | Uncertainty Management for On-Line Optimisation of a POMDP-Based Large-Scale Spoken Dialogue System. Lucie Daubigney, Milica Gasic, Senthilkumar Chandramohan, Matthieu Geist, Olivier Pietquin, Steve J. Young |
| 2011 | Uncertainty Measures for Improving Exemplar-Based Source Separation. Heikki Kallasjoki, Ulpu Remes, Jort F. Gemmeke, Tuomas Virtanen, Kalle J. Palomäki |
| 2011 | Uncovering the Effect of Imitation on Tonal Patterns of French Accentual Phrases. Amandine Michelas, Noël Nguyen |
| 2011 | Underdetermined Blind Source Separation with Fuzzy Clustering for Arbitrarily Arranged Sensors. Ingrid Jafari, Serajul Haque, Roberto Togneri, Sven Nordholm |
| 2011 | Uniform Speech Parameterization for Multi-Form Segment Synthesis. Alexander Sorin, Slava Shechtman, Vincent Pollet |
| 2011 | University of Ljubljana System for Interspeech 2011 Speaker State Challenge. Rok Gajsek, Simon Dobrisek, France Mihelic |
| 2011 | Unsupervised Arabic Dialect Adaptation with Self-Training. Scott Novotney, Richard M. Schwartz, Sanjeev Khudanpur |
| 2011 | Unsupervised Audio Analysis for Categorizing Heterogeneous Consumer Domain Videos. Pradeep Natarajan, Stavros Tsakalidis, Vasant Manohar, Rohit Prasad, Premkumar Natarajan |
| 2011 | Unsupervised Audio Patterns Discovery Using HMM-Based Self-Organized Units. Man-Hung Siu, Herbert Gish, Steve Lowe, Arthur Chan |
| 2011 | Unsupervised Clustering of Utterances Using Non-Parametric Bayesian Methods. Ryuichiro Higashinaka, Noriaki Kawamae, Kugatsu Sadamitsu, Yasuhiro Minami, Toyomi Meguro, Kohji Dohsaka, Hirohito Inagaki |
| 2011 | Unsupervised Continuous-Valued Word Features for Phrase-Break Prediction without a Part-of-Speech Tagger. Oliver Watts, Junichi Yamagishi, Simon King |
| 2011 | Unsupervised Features from Text for Speech Synthesis in a Speech-to-Speech Translation System. Oliver Watts, Bowen Zhou |
| 2011 | Unsupervised Geometry Calibration of Acoustic Sensor Networks Using Source Correspondences. Joerg Schmalenstroeer, Florian Jacob, Reinhold Haeb-Umbach, Marius H. Hennecke, Gernot A. Fink |
| 2011 | Unsupervised Hidden Markov Modeling of Spoken Queries for Spoken Term Detection without Speech Recognition. Chun-an Chan, Lin-Shan Lee |
| 2011 | Unsupervised Latent Speaker Language Modeling. Yik-Cheung Tam, Paul Vozila |
| 2011 | Unsupervised Learning of Acoustic Events Using Dynamic Time Warping and Hierarchical K-Means++ Clustering. Joerg Schmalenstroeer, Markus Bartek, Reinhold Haeb-Umbach |
| 2011 | Unsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification. Sourish Chaudhuri, Mark Harvilla, Bhiksha Raj |
| 2011 | Unsupervised Testing Strategies for ASR. Brian Strope, Doug Beeferman, Alexander Gruenstein, Xin Lei |
| 2011 | Use of the Harmonic Phase in Speaker Recognition. Inma Hernáez, Ibon Saratxaga, Jon Sánchez, Eva Navas, Iker Luengo |
| 2011 | User Simulation in Dialogue Systems Using Inverse Reinforcement Learning. Senthilkumar Chandramohan, Matthieu Geist, Fabrice Lefèvre, Olivier Pietquin |
| 2011 | User Study of Spoken Decision Support System. Teruhisa Misu, Kiyonori Ohtake, Chiori Hori, Hisashi Kawai, Satoshi Nakamura |
| 2011 | Using Crowdsourcing to Provide Prosodic Annotations for Non-Native Speech. Keelan Evanini, Klaus Zechner |
| 2011 | Using Dynamic Time Warping to Compute Prosodic Similarity Measures. Albert Rilliard, Alexandre Allauzen, Philippe Boula de Mareüil |
| 2011 | Using Features from Topic Models to Alleviate Over-Generation in Hierarchical Phrase-Based Translation. Songfang Huang, Bowen Zhou |
| 2011 | Using Human Perception for Automatic Accent Assessment. Freddy William, Abhijeet Sangwan, John H. L. Hansen |
| 2011 | Using Imitation to Learn Infant-Adult Acoustic Mappings. Gopal Ananthakrishnan, Giampiero Salvi |
| 2011 | Using Latent Topic Features for Named Entity Extraction in Search Queries. Joe Polifroni, François Mairesse |
| 2011 | Using Multiple Databases for Training in Emotion Recognition: To Unite or to Vote? Björn W. Schuller, Zixing Zhang, Felix Weninger, Gerhard Rigoll |
| 2011 | Using Mutual Information to Identify Regions of Analysis for Prosodic Analysis. Andrew Rosenberg |
| 2011 | Using Prominence Detection to Generate Acoustic Feedback in Tutoring Scenarios. Lars Schillingmann, Petra Wagner, Christian Munier, Britta Wrede, Katharina J. Rohlfing |
| 2011 | Using Prosodic and Spectral Features in Detecting Depression in Elderly Males. Michelle Hewlett Sanchez, Dimitra Vergyri, Luciana Ferrer, Colleen Richey, Pablo Garcia, Bruce Knoth, William Jarrold |
| 2011 | Using Speaker ID to Discover Repeat Callers of a Spoken Dialog System. Andrew Fandrianto, Brian Langner, Alan W. Black |
| 2011 | Using Spectral Fluctuation of Speech in Multi-Feature HMM-Based Voice Activity Detection. Miquel Espi, Shigeki Miyabe, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama |
| 2011 | Using Unsupervised Feature-Based Speaker Adaptation for Improved Transcription of Spoken Archives. Petr Cerva, Karel Palecek, Jan Silovský, Jan Nouza |
| 2011 | Using a Genetic Algorithm to Estimate Parameters of a Coarticulation Model. Brian O. Bush, John-Paul Hosom, Alexander Kain, Akiko Amano-Kusumoto |
| 2011 | Utterance Verification for Automating the Hearing in Noise Test (HINT). H. Timothy Bunnell, Jason Lilley, Sigfrid D. Soli, Ivan Pal |
| 2011 | VTLN in the MFCC Domain: Band-Limited versus Local Interpolation. Ehsan Variani, Thomas Schaaf |
| 2011 | Validating a Second Language Perception Model for Classroom Context - A Longitudinal Study within the Perceptual Assimilation Model. Bianca Sisinni, Mirko Grimaldi |
| 2011 | Validating rt-MRI Based Articulatory Representations via Articulatory Recognition. Athanasios Katsamanis, Erik Bresch, Vikram Ramanarayanan, Shrikanth S. Narayanan |
| 2011 | Variation of Accent Type and of Context - Influences on Pragmatic Focus Interpretation. Charlotte Wollermann, Ulrich Schade, Bernhard Schröder |
| 2011 | Variational Bayesian Model Selection for GMM-Speaker Verification Using Universal Background Model. Timur Pekhovsky, Alexandra Lokhanova |
| 2011 | Verifying Human Users in Speech-Based Interactions. Sajad Shirali-Shahreza, Yashar Ganjali, Ravin Balakrishnan |
| 2011 | Very Large Vocabulary ASR for Spoken Russian with Syntactic and Morphemic Analysis. Alexey Karpov, Irina S. Kipyatkova, Andrey Ronzhin |
| 2011 | Very Short Utterances and Timing in Turn-Taking. Mattias Heldner, Jens Edlund, Anna Hjalmarsson, Kornel Laskowski |
| 2011 | Visual Speech Speeds Up Auditory Identification Responses. Tim Paris, Jeesun Kim, Chris Davis |
| 2011 | Visual Voice Mail to Text on the iPhone/iPad. Andrej Ljolje, Vincent Goffin, Diamantino Caseiro, Taniya Mishra, Mazin Gilbert |
| 2011 | Visualization of Vocal Tract Shape Using Interleaved Real-Time MRI of Multiple Scan Planes. Yoon-Chul Kim, Michael I. Proctor, Shrikanth S. Narayanan, Krishna S. Nayak |
| 2011 | Voice Activity Detection in MTF-Based Power Envelope Restoration. Masashi Unoki, Xugang Lu, Rico Petrick, Shota Morita, Masato Akagi, Rüdiger Hoffmann |
| 2011 | Voice Conversion Using GMM with Enhanced Global Variance. Hadas Benisty, David Malah |
| 2011 | Voice Processing by Dynamic Glottal Models with Applications to Speech Enhancement. Carlo Drioli, Andrea Calanca |
| 2011 | Voice Quality Characterization of IETF Opus Codec. Anssi Rämö, Henri Toukomaa |
| 2011 | Vowel Context and Speaker Interactions Influencing Glottal Open Quotient and Formant Frequency Shifts in Physical Task Stress. Keith W. Godin, John H. L. Hansen |
| 2011 | Vowels Formants Analysis Allows Straightforward Detection of High Arousal Acted and Spontaneous Emotions. Bogdan Vlasenko, Dmytro Prylipko, David Philippou-Hübner, Andreas Wendemuth |
| 2011 | Web-Based Automatic Speech Recognition Service - webASR. Stuart N. Wrigley, Thomas Hain |
| 2011 | Web-Enhanced Content Retrieval for Information Access Dialogue System. Donghyeon Lee, Cheongjae Lee, Minwoo Jeong, Kyungduk Kim, Seokhwan Kim, Junhwi Choi, Gary Geunbae Lee |
| 2011 | Weight Optimization for Bimodal Unit-Selection Talking Head Synthesis. Asterios Toutios, Utpala Musti, Slim Ouni, Vincent Colotte |
| 2011 | Weighted Ordered Classes - Nearest Neighbors: A New Framework for Automatic Emotion Recognition from Speech. Yazid Attabi, Pierre Dumouchel |
| 2011 | When Two Newly-Acquired Words are One: New Words Differing in Stress Alone are not Automatically Represented Differently. Simone Sulpizio, James M. McQueen |
| 2011 | Where Should Pitch Accents and Phrase Breaks Go? A Syntax Tree Transducer Solution. Joseph Tepperman, Emily Nava |
| 2011 | WinPitch: A Multimodal Tool for Speech Analysis of Endangered Languages. Philippe Martin |
| 2011 | Woefzela - An Open-Source Platform for ASR Data Collection in the Developing World. Nic J. de Vries, Jaco Badenhorst, Marelie H. Davel, Etienne Barnard, Alta de Waal |
| 2011 | Word Boundary Modelling and Full Covariance Gaussians for Arabic Speech-to-Text Systems. Frank Diehl, Mark John Francis Gales, Xunying Liu, Marcus Tomalin, Philip C. Woodland |
| 2011 | Your Mobile Virtual Assistant Just Got Smarter! Mazin Gilbert, Iker Arizmendi, Enrico Bocchieri, Diamantino Caseiro, Vincent Goffin, Andrej Ljolje, Mike Phillips, Chao Wang, Jay G. Wilpon |
| 2011 | Zero-Crossing-Based Channel Attentive Weighting of Cepstral Features for Robust Speech Recognition: The ETRI 2011 CHiME Challenge System. Young-Ik Kim, Hoon-Young Cho, Sang-Hun Kim |
| 2011 | Zero-Resource Audio-Only Spoken Term Detection Based on a Combination of Template Matching Techniques. Armando Muscariello, Guillaume Gravier, Frédéric Bimbot |
| 2011 | i-vector Based Speaker Recognition on Short Utterances. Ahilan Kanagasundaram, Robbie Vogt, David Dean, Sridha Sridharan, Michael Mason |
| 2011 | iVector Approach to Phonotactic Language Recognition. Mehdi Soufifar, Marcel Kockmann, Lukás Burget, Oldrich Plchot, Ondrej Glembek, Torbjørn Svendsen |
| 2011 | iVector Fusion of Prosodic and Cepstral Features for Speaker Verification. Marcel Kockmann, Luciana Ferrer, Lukás Burget, Jan Cernocký |
| 2011 | mTalk - A Multimodal Browser for Mobile Services. Michael Johnston, Giuseppe Di Fabbrizio, Simon Urbanek |