INTERSPEECH A

852 papers

YearTitle / Authors
2011"What is... Dengue Fever?" - Modeling and Predicting Pronunciation Errors in a Text-to-Speech System.
Andrew Rosenberg, Raul Fernandez, Bhuvana Ramabhadran
2011"Would You Buy a Car from Me?" - On the Likability of Telephone Voices.
Felix Burkhardt, Björn W. Schuller, Benjamin Weiss, Felix Weninger
2011"You made me do it": Classification of Blame in Married Couples' Interactions by Fusing Automatically Derived Speech and Language Information.
Matthew Black, Panayiotis G. Georgiou, Athanasios Katsamanis, Brian R. Baucom, Shrikanth S. Narayanan
2011'Are You Sure You're Paying Attention?' - 'Uh-Huh' Communicating Understanding as a Marker of Attentiveness.
Hendrik Buschmeier, Zofia Malisz, Marcin Wlodarczak, Stefan Kopp, Petra Wagner
201112th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011, Florence, Italy, August 27-31, 2011
2011A Bayesian Approach to Voice Conversion Based on GMMs Using Multiple Model Structures.
Lei Li, Yoshihiko Nankaku, Keiichi Tokuda
2011A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines.
Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee
2011A Comparative Acoustic Study on Speech of Glossectomy Patients and Normal Subjects.
Xinhui Zhou, Maureen L. Stone, Carol Y. Espy-Wilson
2011A Corpus-Based Study of English Pronunciation Variations.
Sunhee Kim, Kyuwhan Lee, Minhwa Chung
2011A Cross-Lingual Approach to the Development of an HMM-Based Speech Synthesis System for Malay.
Mumtaz B. Mustafa, Raja Noor Ainon, Roziati Zainuddin, Zuraidah M. Don, Gerry Knowles
2011A Cross-Lingual Spoken Content Search System.
Jitendra Ajmera, Ashish Verma
2011A Divide et impera Algorithm for Optimal Pitch Stylization.
Antonio Origlia, Giovanni Abete, Francesco Cutugno, Iolanda Alfano, Renata Savy, Bogdan Ludusan
2011A Dual Channel Coupled Decoder for Fillers and Feedback.
Daniel Neiberg, Joakim Gustafson
2011A Frequency Domain Approach to ARX-LF Voiced Speech Parameterization and Synthesis.
Alan Ó Cinnéide, David Dorran, Mikel Gainza, Eugene Coyle
2011A Fully Automated Derivation of State-Based Eigentriphones for Triphone Modeling with No Tied States Using Regularization.
Tom Ko, Brian Mak
2011A Grammar Based Approach to Style Specific Phrase Prediction.
Alok Parlikar, Alan W. Black
2011A High Resolution Multiple Source Localization Based on Generalized Cumulant Structure (GCS) Matrix.
Jinho Choi, Chang D. Yoo
2011A Hybrid Quasi-Harmonic/CELP Wideband Speech Coding Scheme for Unit Selection TTS Synthesis.
Chang-Heon Lee, Olivier Rosec, Yannis Stylianou
2011A Hybrid TTS Approach for Prosody and Acoustic Modules.
Iñaki Sainz, Daniel Erro, Eva Navas, Inma Hernáez
2011A Language Independent Approach to Audio Search.
Vikram Gupta, Jitendra Ajmera, Arun Kumar, Ashish Verma
2011A Level-Dependent Auditory Filter-Bank for Speech Recognition in Reverberant Environments.
Hari Krishna Maganti, Marco Matassoni
2011A Long-Term Harmonic Plus Noise Model for Speech Signals.
Faten Ben Ali, Laurent Girin, Sonia Djaziri Larbi
2011A Longest Matching Segment Approach with Baysian Adaptation - Application to Noise-Robust Speaker Recognition.
Ayeh Jafari, Ramji Srinivasan, Danny Crookes, Ji Ming
2011A Model-Based Spectral Envelope Wiener Filter for Perceptually Motivated Speech Enhancement.
Najib Hadir, Friedrich Faubel, Dietrich Klakow
2011A Multichannel Feature-Based Processing for Robust Speech Recognition.
Mehrez Souden, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani
2011A Multimodal Analysis of Vocal and Visual Backchannels in Spontaneous Dialogs.
Khiet P. Truong, Ronald Poppe, Iwan de Kok, Dirk Heylen
2011A Multimodal Approach to Dictation of Handwritten Historical Documents.
Vicent Alabau, Verónica Romero, Antonio L. Lagarda, Carlos D. Martínez-Hinarejos
2011A Multimodal Real-Time MRI Articulatory Corpus for Speech Research.
Shrikanth S. Narayanan, Erik Bresch, Prasanta Kumar Ghosh, Louis Goldstein, Athanasios Katsamanis, Yoon Kim, Adam C. Lammert, Michael I. Proctor, Vikram Ramanarayanan, Yinghua Zhu
2011A Multithreaded Implementation of Viterbi Decoding on Recursive Transition Networks.
Fabio Brugnara
2011A New Epsilon Filter for Efficient Composition of Weighted Finite-State Transducers.
Frank Duckhorn, Matthias Wolff, Rüdiger Hoffmann
2011A New Model-Based Mandarin-Speech Coding System.
Chen-Yu Chiang, Jyh-Her Yang, Ming-Chieh Liu, Yih-Ru Wang, Yuan-Fu Liao, Sin-Horng Chen
2011A New Perspective on GMM Subspace Compensation Based on PPCA and Wiener Filtering.
Alan McCree, Douglas E. Sturim, Douglas A. Reynolds
2011A New Phonetic Candidate Generator for Improving Search Query Efficiency.
Bo Peng, Yao Qian, Frank K. Soong, Bo Zhang
2011A Noise Estimation Method Based on Speech Presence Probability and Spectral Sparseness.
Chao Li, Wenju Liu
2011A Paradigm for Limited Vocabulary Speech Recognition Based on Redundant Spectro-Temporal Feature Sets.
Sourish Chaudhuri, Bhiksha Raj, Tony Ezzat
2011A Parametric Approach to Intonation Acquisition Research: Validation on Child-Directed Speech Data.
Britta Lintfert, Antje Schweitzer, Bernd Möbius
2011A Perceptual Expressivity Modeling Technique for Speech Synthesis Based on Multiple-Regression HSMM.
Takashi Nose, Takao Kobayashi
2011A Performance Monitoring Approach to Fusing Enhanced Spectrogram Channels in Robust Speech Recognition.
Shirin Badiezadegan, Richard C. Rose
2011A Piecewise Aggregate Approximation Lower-Bound Estimate for Posteriorgram-Based Dynamic Time Warping.
Yaodong Zhang, James R. Glass
2011A Pitch Tracking Corpus with Evaluation on Multipitch Tracking Scenario.
Gregor Pirker, Michael Wohlmayr, Stefan Petrik, Franz Pernkopf
2011A Pointwise Approach to Pronunciation Estimation for a TTS Front-End.
Shinsuke Mori, Graham Neubig
2011A Preliminary Model of Emotional Prosody Using Multidimensional Scaling.
Sona Patel, Rahul Shrivastav
2011A Preliminary Study on the Production of Signs in Brazilian Sign Language when One of the Manual Articulators is Unavailable.
André N. Xavier, Plínio A. Barbosa
2011A Qualitative Evaluation of Phoneme-to-Phoneme Technology.
Marijn Schraagen, Gerrit Bloothooft
2011A Quantitative Investigation of the Prosody of Verum Focus in Italian.
Giuseppina Turco, Michele Gubian, Jessamyn Schertz
2011A Rapid Adaptation Algorithm for Tracking Highly Non-Stationary Noises based on Bayesian Inference for On-Line Spectral Change Point Detection.
Md Foezur Rahman Chowdhury, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy
2011A Risk-Estimation-Based Comparison of Mean Square Error and Itakura-Saito Distortion Measures for Speech Enhancement.
Nagarjuna Reddy Muraka, Chandra Sekhar Seelamantula
2011A Robust Approach to Mining Repeated Sequence in Audio Stream.
Jiansong Chen, Lei Zhu, Bailan Feng, Peng Ding, Bo Xu
2011A Robust Estimation Method of Noise Mixture Model for Noise Suppression.
Masakiyo Fujimoto, Shinji Watanabe, Tomohiro Nakatani
2011A Scalable Approach to Building a Parallel Corpus from the Web.
Vivek Kumar Rangarajan Sridhar, Luciano Barbosa, Srinivas Bangalore
2011A Soft Decision-Based Speech Enhancement Using Acoustic Noise Classification.
Jae-Hun Choi, Sang-Kyun Kim, Joon-Hyuk Chang
2011A Speaker Line-Up for the Likelihood Ratio.
David A. van Leeuwen, Niko Brümmer
2011A Statistical Phrase/Accent Model for Intonation Modeling.
Gopala Krishna Anumanchipalli, Luís C. Oliveira, Alan W. Black
2011A Statistical Room Impulse Response Model with Frequency Dependent Reverberation Time for Single-Microphone Late Reverberation Suppression.
Jan S. Erkelens, Richard Heusdens
2011A Study of the Effectiveness of Articulatory Strokes for Phonemic Recognition.
Carlos Molina, Sungbok Lee, Shrikanth S. Narayanan, Néstor Becerra Yoma
2011A Study on Auditory Feature Spaces for Speech-Driven Lip Animation.
Guylaine Le Jan, Yannick Benezeth, Guillaume Gravier, Frédéric Bimbot
2011A Study on Bag of Gaussian Model with Application to Voice Conversion.
Yu Qiao, Tong Tong, Nobuaki Minematsu
2011A Study on Combining VTLN and SAT to Improve the Performance of Automatic Speech Recognition.
Doddipatla Rama Sanand, Mikko Kurimo
2011A Study on Speaker Normalized MLP Features in LVCSR.
Zoltán Tüske, Christian Plahl, Ralf Schlüter
2011A Study on the Effect of Pitch on LPCC and PLPC Features for Children's ASR in Comparison to MFCC.
Shweta Ghai, Rohit Sinha
2011A Study on the Perception of Tone and Intonation in Sesotho.
Hansjörg Mixdorff, Lehlohonolo Mohasi, Malillo Machobane, Thomas Niesler
2011A Tale of Two Tasks: Detecting Children's Off-Task Speech in a Reading Tutor.
Wei Chen, Jack Mostow
2011A Template Based Voice Trigger System Using Bhattacharyya Edit Distance.
Evelyn Kurniawati, Samsudin Ng, Karthik Muralidhar, Sapna George
2011A Transcription Task for Crowdsourcing with Automatic Quality Control.
Chia-ying Lee, James R. Glass
2011A Two-Stage Sample-Based Phone Boundary Detector Using Segmental Similarity Features.
Yih-Ru Wang
2011A Versatile Gaussian Splitting Approach to Non-Linear State Estimation and its Application to Noise-Robust ASR.
Volker Leutnant, Alexander Krueger, Reinhold Haeb-Umbach
2011A Web Based Speech Transcription Workplace.
Markus Klehr, Andreas Ratzka, Thomas Roß
2011A Web-Based Tool for Developing Multilingual Pronunciation Lexicons.
Samantha Ainsley, Linne Ha, Martin Jansche, Ara Kim, Masayuki Nanzawa
2011ASR for Human-Symbiotic Robot "EMIEW2" with Mechanical Noise and Floor-Level Noise Reduction.
Takashi Sumiyoshi, Masahito Togami, Yasunari Obuchi
2011AT&T VoiceBuilder: A Cloud-Based Text-to-Speech Voice Builder Tool.
Yeon-Jun Kim, Thomas Okken, Alistair Conkie, Giuseppe Di Fabbrizio
2011AUC Optimization Based Confidence Measure for Keyword Spotting.
Haiyang Li, Jiqing Han, Tieran Zheng
2011About Handling Boundary Uncertainty in a Speaking Rate Dependent Modeling Approach.
Denis Jouvet, Dominique Fohr, Irina Illina
2011Accelerated Parallelizable Neural Network Learning Algorithm for Speech Recognition.
Dong Yu, Li Deng
2011Acceleration Sensor Based Estimates of Subglottal Resonances: Short vs. Long Vowels.
Wolfgang Wokurek, Andreas Madsack
2011Accounting for Prosodic Information to Improve ASR-Based Topic Tracking for TV Broadcast News.
Camille Guinaudeau, Julia Hirschberg
2011Acoustic Analysis of Whispered Speech for Phoneme and Speaker Dependency.
Xing Fan, Keith W. Godin, John H. L. Hansen
2011Acoustic Correlates of Glottal Gaps.
Gang Chen, Jody Kreiman, Yen-Liang Shue, Abeer Alwan
2011Acoustic Forest for SMAP-Based Speaker Verification.
Sangeeta Biswas, Marc Ferras, Koichi Shinoda, Sadaoki Furui
2011Acoustic Look-Ahead for More Efficient Decoding in LVCSR.
David Nolden, Ralf Schlüter, Hermann Ney
2011Acoustic Model Training with Detecting Transcription Errors in the Training Data.
Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura
2011Acoustic Modeling with Bootstrap and Restructuring Based on Full Covariance.
Xiaodong Cui, Xin Chen, Jian Xue, Peder A. Olsen, John R. Hershey, Bowen Zhou
2011Acoustic and Prosodic Correlates of Social Behavior.
Agustín Gravano, Rivka Levitan, Laura Willson, Stefan Benus, Julia Hirschberg, Ani Nenkova
2011Acoustic and Visual Cues of Turn-Taking Dynamics in Dyadic Interactions.
Bo Xiao, Viktor Rozgic, Athanasios Katsamanis, Brian R. Baucom, Panayiotis G. Georgiou, Shrikanth S. Narayanan
2011Acoustic-Linguistic Recognition of Interest in Speech with Bottleneck-BLSTM Nets.
Martin Wöllmer, Felix Weninger, Florian Eyben, Björn W. Schuller
2011Acoustic-Similarity Based Technique to Improve Concept Recognition.
Om Deshmukh, Shajith Ikbal, Ashish Verma, Etienne Marcheret
2011Acquisition of Timing Patterns in Second Language.
Mikhail Ordin, Leona Polyanskaya, Christiane Ulbrich
2011Active Learning for Dialogue Act Classification.
Björn Gambäck, Fredrik Olsson, Oscar Täckström
2011Ad-Hoc Meeting Transcription on Clusters of Mobile Devices.
Michele Cossalter, Priya Sundararajan, Ian R. Lane
2011Adaptation of Prosody in Speech Synthesis by Changing Command Values of the Generation Process Model of Fundamental Frequency.
Keikichi Hirose, Keiko Ochi, Ryusuke Mihara, Hiroya Hashimoto, Daisuke Saito, Nobuaki Minematsu
2011Adaptation of Speaker-Specific Bases in Non-Negative Matrix Factorization for Single Channel Speech-Music Separation.
Emad M. Grais, Hakan Erdogan
2011Adaptive Blocking Beamformer for Speech Separation.
Ngoc Thuy Tran, William G. Cowley, André Pollok
2011Adaptive Estimation of Zeros of Time-Varying Z-Transforms.
Christian Fischer Pedersen, Ove Andersen, Paul Dalsgaard
2011Adaptive Regularization Framework for Robust Voice Activity Detection.
Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura
2011Adaptive Stream Fusion in Multistream Recognition of Speech.
Nima Mesgarani, Samuel Thomas, Hynek Hermansky
2011Adding Glottal Source Information to Intra-Lingual Voice Conversion.
Javier Pérez, Antonio Bonafonte
2011Adding a Speech Cursor to a Multimodal Dialogue System.
Staffan Larsson, Alexander Berman, Jessica Villing
2011Addressing the Data-Imbalance Problem in Kernel-Based Speaker Verification via Utterance Partitioning and Speaker Comparison.
Wei Rao, Man-Wai Mak
2011Age-Dependent Differences in the Neutralization of the Intervocalic Voicing Contrast: Evidence from an Apparent-Time Study on East Franconian.
Viola Müller, Jonathan Harrington, Felicitas Kleber, Ulrich Reubold
2011Agglomerative Hierarchical Clustering of Emotions in Speech Based on Subjective Relative Similarity.
Ryoichi Takashima, Tohru Nagano, Ryuki Tachibana, Masafumi Nishimura
2011Albayzín 2010: A Spanish Text to Speech Evaluation.
Francisco Campillo, Francisco Méndez Pazó, Montserrat Arza, Laura Docío Fernández, Antonio Bonafonte, Eva Navas, Iñaki Sainz
2011Alternative Frequency Scale Cepstral Coefficient for Robust Sound Event Recognition.
Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, Haizhou Li
2011An Accurate and Robust Gender Identification Algorithm.
Andrea DeMarco, Stephen J. Cox
2011An Active Learning Approach to Task Adaptation.
Ji Wu, Zhiyang He, Ping Lv
2011An Affective Spoken Storyteller.
Felix Burkhardt
2011An Analysis Framework Based on Random Subspace Sampling for Speaker Verification.
Weiwu Jiang, Zhifeng Li, Helen M. Meng
2011An Analysis of Automatic Speech Recognition with Multiple Microphones.
Davide Marino, Thomas Hain
2011An Analysis of PCA-Based Vocal Entrainment Measures in Married Couples' Affective Spoken Interactions.
Chi-Chun Lee, Athanasios Katsamanis, Matthew P. Black, Brian R. Baucom, Panayiotis G. Georgiou, Shrikanth S. Narayanan
2011An Analysis of Word Duration in Native Speakers and Japanese Speakers of English.
Tomoko Nariai, Kazuyo Tanaka, Yoshiaki Itoh
2011An Application to Test the Emotion Conveyed by Vocal and Musical Signals.
Simone Carcone, Carlo Giovannella
2011An Assessment of the Improvement Potential of Time-Frequency Masking for Speech Dereverberation.
Chenxi Zheng, Tiago H. Falk, Wai-Yip Chan
2011An Automatic Voice Pleasantness Classification System Based on Prosodic and Acoustic Patterns of Voice Preference.
Luís Pinto Coelho, Daniela Braga, Miguel Sales Dias, Carmen García-Mateo
2011An Efferent-Inspired Auditory Model Front-End for Speech Recognition.
Chia-ying Lee, James R. Glass, Oded Ghitza
2011An Efficient Pre-Processing Scheme to Improve the Sound Source Localization System in Noisy Environment.
Sheng-Chieh Lee, K. Bharanitharan, Bo-Wei Chen, Jhing-Fa Wang, Chung-Hsien Wu, Min-Jian Liao
2011An Efficient Unified Extraction Algorithm for Bilingual Data.
Christoph Tillmann, Sanjika Hewavitharana
2011An Electropalatographic and Acoustic Study on Anticipatory Coarticulation in V1#C2V2 Sequences in Standard Chinese.
Yinghao Li, Jiangping Kong
2011An Empirical Study of Multilingual Spoken Term Detection.
Zejun Ma, Xiaorui Wang, Bo Xu
2011An Empirical Study on Improving Hierarchical Phrase-Based Translation Using Alignment Features.
Songfang Huang, Bowen Zhou
2011An Engine-Independent Text-to-Speech Workplace.
Margot Mieskes
2011An Experimental Analysis of Pitch Patterns in Japanese Speakers of English with Verification by Speech Re-Synthesis.
Tomoko Nariai, Kazuyo Tanaka
2011An Exploratory Study of the Relations Between Perceived Emotion Strength and Articulatory Kinematics.
Jangwon Kim, Sungbok Lee, Shrikanth S. Narayanan
2011An HMM-Based Approach to the INTERSPEECH 2011 Speaker State Challenge.
Albino Nogueiras Rodríguez
2011An Informed Source Separation System for Speech Signals.
Shuhua Zhang, Laurent Girin
2011An International English Speech Corpus for Longitudinal Study of Accent Development.
Rosemary Orr, Hugo Quené, Roeland van Beek, Thari Diefenbach, David A. van Leeuwen, Marijn Huijbregts
2011An Investigation in Speech Recognition for Colloquial Arabic.
Sarah Al-Shareef, Thomas Hain
2011An Investigation of Depressed Speech Detection: Features and Normalization.
Nicholas Cummins, Julien Epps, Michael Breakspear, Roland Goecke
2011An i-vector Based Approach to Acoustic Sniffing for Irrelevant Variability Normalization Based Acoustic Model Training and Speech Recognition.
Jian Xu, Yu Zhang, Zhi-Jie Yan, Qiang Huo
2011An i-vector Based Approach to Training Data Clustering for Improved Speech Recognition.
Yu Zhang, Jian Xu, Zhi-Jie Yan, Qiang Huo
2011Analysing the Correspondence Between Automatic Prosodic Segmentation and Syntactic Structure.
György Szaszák, Katalin Nagy, András Beke
2011Analysis and Automatic Estimation of Children's Subglottal Resonances.
Steven M. Lulich, Harish Arsikere, John R. Morton, Gary K. F. Leung, Abeer Alwan, Mitchell Sommers
2011Analysis and Comparison of Recent MLP Features for LVCSR Systems.
Fabio Valente, Mathew Magimai-Doss, Wen Wang
2011Analysis of Acoustic-Prosodic Features Related to Paralinguistic Information Carried by Interjections in Dialogue Speech.
Carlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita
2011Analysis of Dialectal Influence in Pan-Arabic ASR.
Udhyakumar Nallasamy, Michael Garbus, Florian Metze, Qin Jin, Thomas Schaaf, Tanja Schultz
2011Analysis of HMM-Based Lombard Speech Synthesis.
Tuomo Raitio, Antti Suni, Martti Vainio, Paavo Alku
2011Analysis of Inter-Articulator Correlation in Acoustic-to-Articulatory Inversion Using Generalized Smoothness Criterion.
Prasanta Kumar Ghosh, Shrikanth S. Narayanan
2011Analysis of i-vector Length Normalization in Speaker Recognition Systems.
Daniel Garcia-Romero, Carol Y. Espy-Wilson
2011Analyzing Training Dependencies and Posterior Fusion in Discriminant Classification of Apnea Patients Based on Sustained and Connected Speech.
José Luis Blanco Murillo, Rubén Fernández Pozo, Doroteo Torre Toledano, Javier Caminero, Eduardo López
2011Analyzing the Nature of ECA Interactions in Children with Autism.
Emily Mower, Chi-Chun Lee, James Gibson, Theodora Chaspari, Marian E. Williams, Shrikanth S. Narayanan
2011Anger Recognition in Spoken Dialog Using Linguistic and Para-Linguistic Information.
Narichika Nomoto, Masafumi Tamoto, Hirokazu Masataki, Osamu Yoshioka, Satoshi Takahashi
2011Announcing the Electromagnetic Articulography (Day 1) Subset of the mngu0 Articulatory Corpus.
Korin Richmond, Phil Hoole, Simon King
2011Aperiodicity Analysis for Quality Estimation of Text-to-Speech Signals.
Christoph Norrenbrock, Ulrich Heute, Florian Hinterleitner, Sebastian Möller
2011Applying Rhythm Features to Automatically Assess Non-Native Speech.
Lei Chen, Klaus Zechner
2011Applying the Quantitative Target Approximation Model (qTA) to German and Brazilian Portuguese.
Plínio Almeida Barbosa, Hansjörg Mixdorff, Sandra Madureira
2011Approximate Inference for Domain Detection in Spoken Language Understanding.
Asli Celikyilmaz, Dilek Hakkani-Tür, Gökhan Tür
2011Articulatory Feature Classification Using Nearest Neighbors.
Arild Brandrud Næss, Karen Livescu, Rohit Prabhavalkar
2011Articulatory Reduction in Mandarin Chinese Words.
Jeffrey Berry, Sunjing Ji, Ian R. Fasel, Diana Archangeli
2011Assessing Acoustic Reduction: Exploiting Local Structure in Speech.
Louis ten Bosch, Annika Hämäläinen, Mirjam Ernestus
2011Asynchronous Multimodal Text Entry Using Speech and Gesture Keyboards.
Per Ola Kristensson, Keith Vertanen
2011Attention, Sobriety Checkpoint! Can Humans Determine by Means of Voice, if Someone is Drunk... and Can Automatic Classifiers Compete?
Stefan Ultes, Alexander Schmitt, Wolfgang Minker
2011Auditory Filterbank Improves Voice Morphing.
Erika Okamoto, Toshio Irino, Ryuichi Nisimura, Hideki Kawahara
2011Auditory Speech Processing is Affected by Visual Speech in the Periphery.
Jeesun Kim, Chris Davis
2011Automatic Analysis of Singleton and Geminate Consonant Articulation Using Real-Time Magnetic Resonance Imaging.
Christina Hagedorn, Michael I. Proctor, Louis Goldstein
2011Automatic Assessment of Prosody in High-Stakes English Tests.
Jian Cheng
2011Automatic Call Quality Monitoring Using Cost-Sensitive Classification.
Youngja Park
2011Automatic Comma Insertion of Lecture Transcripts Based on Multiple Annotations.
Yuya Akita, Tatsuya Kawahara
2011Automatic Data-Driven Learning of Articulatory Primitives from Real-Time MRI Data Using Convolutive NMF with Sparseness Constraints.
Vikram Ramanarayanan, Athanasios Katsamanis, Shrikanth S. Narayanan
2011Automatic Detection of Anger in Human-Human Call Center Dialogs.
Mustafa Erden, Levent M. Arslan
2011Automatic Detection of Depression in Speech Using Gaussian Mixture Modeling with Factor Analysis.
Douglas E. Sturim, Pedro A. Torres-Carrasquillo, Thomas F. Quatieri, Nicolas Malyska, Alan McCree
2011Automatic Detection of Speaker Attributes Based on Utterance Text.
Wen Wang, Andreas Kathol, Harry Bratt
2011Automatic Determination of the Standard Chinese Prosodic Phrase Boundaries by F0 Generation Model.
Shehui Bu, Zhenjie Zhuo, Lingling Yang, Shuichi Itahashi
2011Automatic Generation of Listening Comprehension Learning Material in European Portuguese.
Thomas Pellegrini, Rui Correia, Isabel Trancoso, Jorge Baptista, Nuno J. Mamede
2011Automatic Identification of Salient Acoustic Instances in Couples' Behavioral Interactions Using Diverse Density Support Vector Machines.
James Gibson, Athanasios Katsamanis, Matthew P. Black, Shrikanth S. Narayanan
2011Automatic Learning in Content Indexing Service Using Phonetic Alignment.
Yeon-Jun Kim, David C. Gibbon
2011Automatic Prosodic Events Detection by Using Syllable-Based Acoustic, Lexical and Syntactic Features.
Chong-Jia Ni, Wenju Liu, Bo Xu
2011Automatic Prosody Generation for Serbo-Croatian Speech Synthesis Based on Regression Trees.
Milan Secujski, Darko Pekar, Niksa Jakovljevic
2011Automatic Selection of Acoustic and Non-Linear Dynamic Features in Voice Signals for Hypernasality Detection.
Juan Rafael Orozco-Arroyave, S. Murillo Rendón, Andrés Marino Álvarez-Meza, Julián D. Arias-Londoño, Edilson Delgado-Trejos, Jesús Francisco Vargas-Bonilla, César Germán Castellanos-Domínguez
2011Automatic Sentence Selection from Speech Corpora Including Diverse Speech for Improved HMM-TTS Synthesis Quality.
Norbert Braunschweiler, Sabine Buchholz
2011Automatic Speech Codec Identification with Applications to Tampering Detection of Speech Recordings.
Jingting Zhou, Daniel Garcia-Romero, Carol Y. Espy-Wilson
2011Automatic Speech Recognition System Dedicated for Polish.
Mariusz Ziólko, Jakub Galka, Bartosz Ziólko, Tomasz Jadczyk, Dawid Skurzok, Mariusz Masior
2011Automatic Subtitling of the Basque Parliament Plenary Sessions Videos.
Germán Bordel, Silvia Nieto, Mikel Peñagarikano, Luis Javier Rodríguez, Amparo Varona
2011Automatic Viseme Clustering for Audiovisual Speech Synthesis.
Wesley Mattheyses, Lukas Latacz, Werner Verhelst
2011Automatically Creating a Diphone Set from a Speech Database.
Thomas Ewender, Beat Pfister
2011Automatically Optimizing Utterance Classification Performance without Human in the Loop.
Yun-Cheng Ju, Jasha Droppo
2011Bayesian Extension of MUSIC for Sound Source Localization and Tracking.
Takuma Otsuka, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno
2011Bayesian Language Model Interpolation for Mobile Speech Input.
Cyril Allauzen, Michael Riley
2011Bilingual Acoustic Model Adaptation by Unit Merging on Different Levels and Cross-Level Integration.
Ching-Feng Yeh, Chao-Yu Huang, Lin-Shan Lee
2011Binaural Cues for Fragment-Based Speech Recognition in Reverberant Multisource Environments.
Ning Ma, Jon Barker, Heidi Christensen, Phil D. Green
2011Binaural Noise-Reduction Method Based on Blind Source Separation and Perceptual Post Processing.
Jorge I. Marin-Hurtado, Devangi N. Parikh, David V. Anderson
2011Biomechanical Tongue Models: An Approach to Studying Inter-Speaker Variability.
Ralf Winkler, Susanne Fuchs, Pascal Perrier, Mark Tiede
2011Blind Source Separation for Robot Audition Using Fixed Beamforming with HRTFs.
Mounira Maazaoui, Yves Grenier, Karim Abed-Meraim
2011Blind Speech Prior Estimation for Generalized Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator.
Ryo Wakisaka, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani
2011Blind Speech Separation in Multiple Environments Using a Frequency Oriented PCA Method for Convolutive Mixtures.
Yasmina Benabderrahmane, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy
2011Blind Speech Separation in Time-Domain Using Block-Toeplitz Structure of Reconstructed Signal Matrices.
Zbynek Koldovský, Jirí Málek, Petr Tichavský
2011Boosting Speaker Recognition Performance with Compact Representations.
Sibel Yaman, Jason W. Pelecanos, Mohamed Kamal Omar
2011Bootstrapping Domain Detection Using Query Click Logs for New Domains.
Dilek Hakkani-Tür, Gökhan Tür, Larry P. Heck, Elizabeth Shriberg
2011Breath-Detection-Based Telephony Speech Phrasing.
Takashi Fukuda, Osamu Ichikawa, Masafumi Nishimura
2011Building an Audio-Visual Corpus of Australian English: Large Corpus Collection with an Economical Portable and Replicable Black Box.
Denis Burnham, Dominique Estival, Steven Fazio, Jette Viethen, Felicity Cox, Robert Dale, Steve Cassidy, Julien Epps, Roberto Togneri, Michael Wagner, Yuko Kinoshita, Roland Göcke, Joanne Arciuli, Mark Onslow, Trent W. Lewis, Andrew Butcher, John Hajek
2011Can Audio-Visual Speech Recognition Outperform Acoustically Enhanced Speech Recognition in Automotive Environment?
Rajitha Navarathna, Tristan Kleinschmidt, David Dean, Sridha Sridharan, Patrick Lucey
2011Can Objective Measures Predict the Intelligibility of Modified HMM-Based Synthetic Speech in Noise?
Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King
2011Candidate Generation for ASR Output Error Correction Using a Context-Dependent Syllable Cluster-Based Confusion Matrix.
Chao-Hong Liu, Chung-Hsien Wu, David Sarwono, Jhing-Fa Wang
2011Characterizing Deletion Transformations Across Dialects Using a Sophisticated Tying Mechanism.
Nancy F. Chen, Wade Shen, Joseph P. Campbell
2011Cheap Bootstrap of Multi-Lingual Hidden Markov Models.
Daniele Falavigna, Roberto Gretter
2011Children's Recognition of their own Voice: Influence of Phonological Impairment.
Sofia Strömbergsson
2011Chinese and Italian Speech Rhythm: Normalization and the CCI Algorithm.
Chiara Bertini, Pier Marco Bertinetto, Na Zhi
2011Chorus Digitalis: Experiments in Chironomic Choir Singing.
Sylvain Le Beux, Lionel Feugère, Christophe d'Alessandro
2011Classification of Fricatives Using Feature Extrapolation of Acoustic-Phonetic Features in Telephone Speech.
Jung-won Lee, Jeung-Yoon Choi, Hong-Goo Kang
2011Clustering Expressive Speech Styles in Audiobooks Using Glottal Source Parameters.
Éva Székely, João P. Cabral, Peter Cahill, Julie Carson-Berndsen
2011Clustering with Modified Cosine Distance Learned from Constraints.
Leonid Rachevsky, Dimitri Kanevsky, Ruhi Sarikaya, Bhuvana Ramabhadran
2011Coarticulation Across Prosodic Domains in Italian: An Ultrasound Investigation.
Barbara Gili Fivela, Antonio Stella, Sonia D'Apolito, Francesco Sigona
2011Collecting Life Logs for Experience-Based Corpora.
Fabiano Francesconi, Arindam Ghosh, Giuseppe Riccardi, Marco Ronchetti, Alex Vagin
2011Combined Optical Distance Sensing and Electropalatography to Measure Articulation.
Peter Birkholz, Christiane Neuschaefer-Rube
2011Combining Active and Semi-Supervised Learning for Homograph Disambiguation in Mandarin Text-to-Speech Synthesis.
Binbin Shen, Zhiyong Wu, Yongxin Wang, Lianhong Cai
2011Combining Evidence from Spectral and Source-Like Features for Person Recognition from Humming.
Hemant A. Patil, Maulik C. Madhavi, Keshab K. Parhi
2011Combining Feature Space Discriminative Training with Long-Term Spectro-Temporal Features for Noise-Robust Speech Recognition.
Takashi Fukuda, Osamu Ichikawa, Masafumi Nishimura
2011Combining Frame and Segment Level Processing via Temporal Pooling for Phonetic Classification.
Sumit Chopra, Patrick Haffner, Dimitrios Dimitriadis
2011Combining Information Sources for Confidence Estimation with CRF Models.
Matthew Stephen Seigel, Philip C. Woodland
2011Combining Lattice-Based Language Dependent and Independent Approaches for Out-of-Language Detection in LVCSR.
Yuxiang Shan, Yan Deng, Jia Liu
2011Combining Multiple Phoneme-Based Classifiers with Audio Feature-Based Classifier for the Detection of Alcohol Intoxication.
Claude Montacié, Marie-José Caraty
2011Combining Phonological and Acoustic ASR-Free Features for Pathological Speech Intelligibility Assessment.
Catherine Middag, Tobias Bocklet, Jean-Pierre Martens, Elmar Nöth
2011Commas Recovery with Syntactic Features in French and in Czech.
Christophe Cerisara, Pavel Král, Claire Gardent
2011Comparing Different Flavors of Spectro-Temporal Features for ASR.
Bernd T. Meyer, Suman V. Ravuri, Marc René Schädler, Nelson Morgan
2011Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization.
Viet-Anh Tran, Viet Bac Le, Claude Barras, Lori Lamel
2011Comparing Syllable Frequencies in Corpora of Written and Spoken Language.
Barbara Samlowski, Bernd Möbius, Petra Wagner
2011Comparing System-Driven and Free Dialogue in In-Vehicle Interaction.
Fredrik Kronlid, Jessica Villing, Alexander Berman, Staffan Larsson
2011Comparing Word and Syllable Prominence Rated by Naïve Listeners.
Denis Arnold, Bernd Möbius, Petra Wagner
2011Comparing the Impact of Raised Vocal Effort on Various Spectral Parameters.
Corinna Harwardt
2011Comparison of Nasalance Measurements from Accelerometers and Microphones and Preliminary Development of Novel Features.
Nicolas Audibert, Angélique Amelot
2011Comparison of Smoothing Techniques for Robust Context Dependent Acoustic Modelling in Hybrid NN/HMM Systems.
Guangsen Wang, Khe Chai Sim
2011Comparison of Speaker Recognition Approaches for Real Applications.
Sandro Cumani, Pier Domenico Batzu, Daniele Colibro, Claudio Vair, Pietro Laface, Vasileios Vasilakakis
2011Comparison of Voice Activity Detectors for Interview Speech in NIST Speaker Recognition Evaluation.
Hon-Bill Yu, Man-Wai Mak
2011Compound Word Recombination for German LVCSR.
Markus Nußbaum-Thom, Amr El-Desoky Mousa, Ralf Schlüter, Hermann Ney
2011Computer and Human Recognition of Regional Accents of British English.
Abualsoud Hanani, Martin J. Russell, Michael J. Carey
2011Computer-Assisted Disfluency Counts for Stuttered Speech.
Peter A. Heeman, Andy McMillin, J. Scott Yaruss
2011Conditioned Hidden Markov Model Fusion for Multimodal Classification.
Michael Glodek, Stefan Scherer, Friedhelm Schwenker
2011Confidence Measures for Turkish Call Center Conversations.
Ali Haznedaroglu, Levent M. Arslan
2011Connected Digit Recognition by Means of Reservoir Computing.
Azarakhsh Jalalvand, Fabian Triefenbach, David Verstraeten, Jean-Pierre Martens
2011Constrained Cepstral Speaker Recognition Using Matched UBM and JFA Training.
Michelle Hewlett Sanchez, Luciana Ferrer, Elizabeth Shriberg, Andreas Stolcke
2011Context and Priming Effects in the Recognition of Emotion of Old and Young Listeners.
Martijn Goudbeek, Marie Nilsenová
2011Context and Speaker Dependency in the Relation of Vowel Formants and Subglottal Resonances - Evidence from Hungarian.
Tekla Etelka Gráczi, Steven M. Lulich, Tamás Gábor Csapó, András Beke
2011Context-Dependent Duration Modeling with Backoff Strategy and Look-Up Tables for Pronunciation Assessment and Mispronunciation Detection.
Hongyan Li, Shen Huang, Shijin Wang, Bo Xu
2011Continuous Control of the Degree of Articulation in HMM-Based Speech Synthesis.
Benjamin Picart, Thomas Drugman, Thierry Dutoit
2011Continuous Digits Recognition Leveraging Invariant Structure.
Masayuki Suzuki, Gakuto Kurata, Masafumi Nishimura, Nobuaki Minematsu
2011Continuous Episodic Memory Based Speech Recognition Using Articulatory Dynamics.
Sébastien Demange, Slim Ouni
2011Contributions of F1 and F2 (F2') to the Perception of Plosive Consonants.
René Carré, Pierre L. Divenyi, Willy Serniclaes, Emmanuel Ferragne, Egidio Marsico, Viet Son Nguyen
2011Convergence of Line Search A-Function Methods.
Dimitri Kanevsky, David Nahamoo, Tara N. Sainath, Bhuvana Ramabhadran
2011Conversational Speech Transcription Using Context-Dependent Deep Neural Networks.
Frank Seide, Gang Li, Dong Yu
2011Conversational-Side-Specific Inter-Session Variability Compensation.
Mohamed Kamal Omar, Jason W. Pelecanos
2011Conversing in the Presence of a Competing Conversation: Effects on Speech Production.
Vincent Aubanel, Martin Cooke, Julián Villegas, María Luisa García Lecumberri
2011Correlating Text with Prosody.
Mohamed Abou-Zleikha, Julie Carson-Berndsen
2011Correlation Analysis of Acoustic Features with Perceptual Voice Quality Similarity for Similar Speaker Selection.
Yusuke Ijima, Mitsuaki Isogai, Hideyuki Mizuno
2011Cross Likelihood Ratio Based Speaker Clustering Using Eigenvoice Models.
David Wang, Robbie Vogt, Sridha Sridharan, David Dean
2011Cross-Language Phone Recognition when the Target Language Phoneme Inventory is not Known.
Timothy Kempton, Roger K. Moore, Thomas Hain
2011Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech.
Mirjam Wester, Hui Liang
2011Cross-Lingual Study of ASR Errors: On the Role of the Context in Human Perception of Near-Homophones.
Ioana Vasilescu, Dahbia Yahia, Natalie D. Snoeren, Martine Adda-Decker, Lori Lamel
2011Cross-Rate Variation in the Intelligibility of Dual-Rate Gated Speech in Older Listeners.
Valeriy Shafiro, Stanley Sheft, Robert Risley
2011Crossmodal Prosodic and Gestural Contribution to the Perception of Contrastive Focus.
Pilar Prieto, Cecilia Pugliesi, Joan Borràs-Comes, Ernesto Arroyo, Josep Blat
2011Crowdsourcing Preference Tests, and How to Detect Cheating.
Sabine Buchholz, Javier Latorre
2011Crowdsourcing for Word Recognition in Noise.
Martin Cooke, Jon Barker, María Luisa García Lecumberri, Krzysztof Wasilewski
2011Data Sampling and Dimensionality Reduction Approaches for Reranking ASR Outputs Using Discriminative Language Models.
Erinç Dikici, Murat Semerci, Murat Saraclar, Ethem Alpaydin
2011Data Selection with Kurtosis and Nasality Features for Speaker Recognition.
Howard Lei, Nikki Mirghafori
2011Data-Driven Gaussian Component Selection for Fast GMM-Based Speaker Verification.
Ce Zhang, Rong Zheng, Bo Xu
2011Data-Driven UBM Generation via Tied Gaussians for GMM-Supervector Based Accent Identification.
Rong Zheng, Ce Zhang, Bo Xu
2011Decision Tree-Based Clustering with Outlier Detection for HMM-Based Speech Synthesis.
Kyung Hwan Oh, June Sig Sung, Doo Hwa Hong, Nam Soo Kim
2011Deep Belief Networks for Automatic Music Genre Classification.
Xiaohong Yang, Qingcai Chen, Shusen Zhou, Xiaolong Wang
2011Deep Convex Net: A Scalable Architecture for Speech Pattern Classification.
Dong Yu, Li Deng
2011Deep Learning of Speech Features for Improved Phonetic Recognition.
Jaehyung Lee, Soo-Young Lee
2011Denoising Using Optimized Wavelet Filtering for Automatic Speech Recognition.
Randy Gomez, Tatsuya Kawahara
2011Deploying Google Search by Voice in Cantonese.
Yun-Hsuan Sung, Martin Jansche, Pedro J. Moreno
2011Detecting Sleepiness by Fusing Classifiers Trained with Novel Acoustic Features.
Tauhidur Rahman, Soroosh Mariooryad, Shalini Keshavamurthy, Gang Liu, John H. L. Hansen, Carlos Busso
2011Detecting the Status of a Predictive Incremental Speech Understanding Model for Real-Time Decision-Making in a Spoken Dialogue System.
David DeVault, Kenji Sagae, David R. Traum
2011Detection of Shouted Speech in the Presence of Ambient Noise.
Jouni Pohjalainen, Tuomo Raitio, Paavo Alku
2011Detection of Task-Incomplete Dialogs Based on Utterance-and-Behavior Tag N-Gram for Spoken Dialog Systems.
Sunao Hara, Norihide Kitaoka, Kazuya Takeda
2011Determining what Questions to Ask, with the Help of Spectral Graph Theory.
Abe Kazemzadeh, Sungbok Lee, Panayiotis G. Georgiou, Shrikanth S. Narayanan
2011Developing a Broadband Automatic Speech Recognition System for Afrikaans.
Febe de Wet, Alta de Waal, Gerhard B. Van Huyssteen
2011Dialect and Accent Recognition Using Phonetic-Segmentation Supervectors.
Fadi Biadsy, Julia Hirschberg, Daniel P. W. Ellis
2011Dialog Methods for Improved Alphanumeric String Capture.
Doug Peters, Peter Stubley
2011Diarization-Based Speaker Retrieval for Broadcast Television Archives.
Marijn Huijbregts, David A. van Leeuwen
2011Dimensionality Reduction for Using High-Order n-Grams in SVM-Based Phonotactic Language Recognition.
Mikel Peñagarikano, Amparo Varona, Luis Javier Rodríguez, Germán Bordel
2011Direct Error Rate Minimization of Hidden Markov Models.
Joseph Keshet, Chih-Chieh Cheng, Mark Stoehr, David A. McAllester
2011Direct Estimation of Articulatory Kinematics from Real-Time Magnetic Resonance Image Sequences.
Michael I. Proctor, Adam C. Lammert, Athanasios Katsamanis, Louis M. Goldstein, Christina Hagedorn, Shrikanth S. Narayanan
2011Discrete Choice Models for Non-Intrusive Quality Assessment.
Petko Nikolov Petkov, W. Bastiaan Kleijn, Bert de Vries
2011Discrete/Continuous Modelling of Speaking Style in HMM-Based Speech Synthesis: Design and Evaluation.
Nicolas Obin, Pierre Lanchantin, Anne Lacheret, Xavier Rodet
2011Discriminant Sub-Space Projection of Spectro-Temporal Speech Features Based on Maximizing Mutual Information.
Martin Heckmann, Claudius Gläser
2011Discriminative Features for Language Identification.
Christopher Alberti, Michiel Bacchiani
2011Discriminatively Trained i-vector Extractor for Speaker Verification.
Ondrej Glembek, Lukás Burget, Niko Brümmer, Oldrich Plchot, Pavel Matejka
2011Distant Speech Recognition in a Smart Home: Comparison of Several Multisource ASRs in Realistic Conditions.
Benjamin Lecouteux, Michel Vacher, François Portet
2011Does it Groove or does it Stumble - Automatic Classification of Alcoholic Intoxication using Prosodic Features.
Florian Hönig, Anton Batliner, Elmar Nöth
2011Drink and Speak: On the Automatic Classification of Alcohol Intoxication by Acoustic, Prosodic and Text-Based Features.
Tobias Bocklet, Korbinian Riedhammer, Elmar Nöth
2011Dual-Mode AVQ Coding Based on Spectral Masking and Sparseness Detection for ITU-T G.711.1/G.722 Super-Wideband Extensions.
Masahiro Fukui, Shigeaki Sasaki, Yusuke Hiwasaki, Sachiko Kurihara, Yoichi Haneda
2011Dysperiodicity Analysis of Perceptually Assessed Synthetic Speech Stimuli.
Ali Alpan, Francis Grenez, Jean Schoentgen
2011ELAN - Aspects of Interoperability and Functionality.
Han Sloetjes, Peter Wittenburg, Aarthy Somasundaram
2011EM-Based Gain Adaptation for Probabilistic Multipitch Tracking.
Michael Wohlmayr, Franz Pernkopf
2011EasyAlign: An Automatic Phonetic Alignment Tool Under Praat.
Jean-Philippe Goldman
2011Effect of Language Experience on the Categorical Perception of Cantonese Vowel Duration.
Caicai Zhang, Gang Peng, William S.-Y. Wang
2011Effective Arabic Dialect Classification Using Diverse Phonotactic Models.
Murat Akbacak, Dimitra Vergyri, Andreas Stolcke, Nicolas Scheffer, Arindam Mandal
2011Effective Triphone Mapping for Acoustic Modeling in Speech Recognition.
Sakhia Darjaa, Milos Cernak, Marián Trnka, Milan Rusko, Róbert Sabo
2011Effects of Focus on f0 and Duration in Irish (Gaelic) Declaratives.
Amelie Dorn, Ailbhe Ní Chasaide
2011Effects of Query Expansion for Spoken Document Passage Retrieval.
Tomoyosi Akiba, Koichiro Honda
2011Effects of Shortening Speech Prompts of In-Car Voice User Interfaces on Users Mental Models.
Julia Niemann, Kati Schulz, Ina Wechsung
2011Efficient Harvesting of Internet Audio for Resource-Scarce ASR.
Marelie H. Davel, Charl Johannes van Heerden, Neil Kleynhans, Etienne Barnard
2011Efficient Probabilistic Tracking of User Goal and Dialog History for Spoken Dialog Systems.
Antoine Raux, Yi Ma
2011Efficient Speaker and Noise Normalization for Robust Speech Recognition.
Vikas Joshi, Raghavendra Bilgi, Srinivasan Umesh, M. Carmen Benítez, Luz García
2011Eigen-Voice Based Anchor Modeling System for Speaker Identification Using MLLR Super-Vector.
Achintya Kumar Sarkar, Srinivasan Umesh
2011Electroglottograph and Acoustic Cues for Phonation Contrasts in Taiwan Min Falling Tones.
Ho-Hsien Pan, Mao-Hsu Chen, Shao-Ren Lyu
2011Emotion Classification Using Inter- and Intra-Subband Energy Variation.
Senaka Amarakeerthi, Tin Lay Nwe, Liyanage C. De Silva, Michael Cohen
2011Emotion Classification of Infants' Cries Using Duration Ratios of Acoustic Segments.
Kazuki Kitahara, Shinzi Michiwiki, Miku Sato, Shoichi Matsunaga, Masaru Yamashita, Kazuyuki Shinohara
2011Emotion Detection Based on Concept Inference and Spoken Sentence Analysis for Customer Service.
Ren-Ying Fang, Bo-Wei Chen, Jhing-Fa Wang, Chung-Hsien Wu
2011Empirical Evaluation and Combination of Advanced Language Modeling Techniques.
Tomás Mikolov, Anoop Deoras, Stefan Kombrink, Lukás Burget, Jan Cernocký
2011Enhancements to the Training Process of Classifier-Based Speech Translator via Topic Modeling.
Emil Ettelaie, Panayiotis G. Georgiou, Shrikanth S. Narayanan
2011Enriching Text-to-Speech Synthesis Using Automatic Dialog Act Tags.
Vivek Kumar Rangarajan Sridhar, Ann K. Syrdal, Alistair Conkie, Srinivas Bangalore
2011Entropy-Rate Driven Inference of Stochastic Grammars.
Unto K. Laine
2011Epoch Extraction in High Pass Filtered Speech Using Hilbert Envelope.
D. Govind, S. R. Mahadeva Prasanna, Debadatta Pati
2011Error Selection for ASR-Based English Pronunciation Training in 'My Pronunciation Coach'.
Catia Cucchiarini, Henk van den Heuvel, Eric Sanders, Helmer Strik
2011Estimating Speaking Rate by Means of Rhythmicity Parameters.
Christian Heinrich, Florian Schiel
2011Estimation of Perceptual Spaces for Speaker Identities Based on the Cross-Lingual Discrimination Task.
Minoru Tsuzaki, Keiichi Tokuda, Hisashi Kawai, Jinfu Ni
2011Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-Based Speech Synthesis.
Ling-Hui Chen, Yoshihiko Nankaku, Heiga Zen, Keiichi Tokuda, Zhen-Hua Ling, Li-Rong Dai
2011Evaluating Artificial Bandwidth Extension by Conversational Tests in Car Using Mobile Devices with Integrated Hands-Free Functionality.
Laura Laaksonen, Ville Myllylä, Riitta Niemistö
2011Evaluating the Meaning of Synthesized Listener Vocalizations.
Sathish Pammi, Marc Schröder
2011Evaluation of Abnormal Sound Detection using Multi-Stage GMM in Various Environments.
Akinori Ito, Akihito Aiba, Masashi Ito, Shozo Makino
2011Evaluation of Bone-Conducted Ultrasonic Hearing-Aid Regarding Transmission of Speaker Discrimination Information.
Takayuki Kagomiya, Seiji Nakagawa
2011Evaluation of Fast Spoken Term Detection Using a Suffix Array.
Kouichi Katsurada, Shinta Sawada, Shigeki Teshima, Yurie Iribe, Tsuneo Nitta
2011Evaluation of Glottal Epoch Detection Algorithms on Different Voice Types.
João P. Cabral, John Kane, Christer Gobl, Julie Carson-Berndsen
2011Evaluation of Listening-Oriented Dialogue Control Rules Based on the Analysis of HMMs.
Toyomi Meguro, Yasuhiro Minami, Ryuichiro Higashinaka, Kohji Dohsaka
2011Evaluation of Tree-Trellis Based Decoding in Over-Million LVCSR.
Naoaki Ito, Yoshihiko Nankaku, Akinobu Lee
2011Evaluation of an Integrated Authoring Tool for Building Advanced Question-Answering Characters.
Sudeep Gandhe, Michael Rushforth, Priti Aggarwal, David R. Traum
2011Evaluation of i-vector Speaker Recognition Systems for Forensic Application.
Miranti Indar Mandasari, Mitchell McLaren, David A. van Leeuwen
2011Event Selection from Phone Posteriorgrams Using Matched Filters.
Keith Kintzley, Aren Jansen, Hynek Hermansky
2011Exploiting Intra-Conversation Variability for Speaker Diarization.
Stephen Shum, Najim Dehak, Ekapol Chuangsuwanich, Douglas A. Reynolds, James R. Glass
2011Exploiting Phone-Class Specific Landmarks for Refinement of Segment Boundaries in TTS Databases.
Vijayaditya Peddinti, Kishore Prahallad
2011Exploring Bessel Features for Detection of Glottal Closure Instants.
Chetana Prakash, N. Dhananjaya, Suryakanth V. Gangashetty
2011Extending Audio Notetaker to Browse WebASR Transcriptions.
Roger C. F. Tucker, Dan Fry, Vincent Wan, Stuart N. Wrigley, Thomas Hain
2011Extending the Task of Diarization to Speaker Attribution.
Houman Ghaemmaghami, David Dean, Robbie Vogt, Sridha Sridharan
2011Extraction of Narrative Recall Patterns for Neuropsychological Assessment.
Emily Tucker Prud'hommeaux, Brian Roark
2011Factor Analysis Back Ends for MLLR Transforms in Speaker Recognition.
Nicolas Scheffer, Yun Lei, Luciana Ferrer
2011Factored MLLR Adaptation for Singing Voice Generation.
June Sig Sung, Doo Hwa Hong, Shin Jae Kang, Nam Soo Kim
2011Factored Translation Models for Improving a Speech into Sign Language Translation System.
Verónica López-Ludeña, Rubén San Segundo, Ricardo de Córdoba, Javier Ferreiros, Juan Manuel Montero, José Manuel Pardo
2011Fast and Simple Iterative Algorithm of Lp-Norm Minimization for Under-Determined Speech Separation.
Yasuharu Hirasawa, Naoki Yasuraoka, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
2011Feature Combination Approaches for Discriminative Language Models.
Ebru Arisoy, Bhuvana Ramabhadran, Hong-Kwang Jeff Kuo
2011Feature Compensation for Speech Recognition in Severely Adverse Environments Due to Background Noise and Channel Distortion.
Wooil Kim, John H. L. Hansen
2011Feature Extraction Assessment for an Acoustic-Event Classification Task Using the Entropy Triangle.
David Mejía-Navarrete, Ascensión Gallardo-Antolín, Carmen Peláez-Moreno, Francisco J. Valverde-Albacete
2011Feature Frame Stacking in RNN-Based Tandem ASR Systems - Learned vs. Predefined Context.
Martin Wöllmer, Björn W. Schuller, Gerhard Rigoll
2011Feature Normalization Using Structured Full Transforms for Robust Speech Recognition.
Xiong Xiao, Jinyu Li, Chng Eng Siong, Haizhou Li
2011Feature-Space Transform Tying in Unified Acoustic-Articulatory Modelling for Articulatory Control of HMM-Based Speech Synthesis.
Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi
2011Final /t/ Reduction in Dutch Past-Participles: The Role of Word Predictability and Morphological Decomposability.
Iris Hanique, Mirjam Ernestus
2011Fluency Changes with General Progress in L2 Proficiency.
Jared Bernstein, Jian Cheng, Masanori Suzuki
2011Formant Maps in Hungarian Vowels - Online Data Inventory for Research, and Education.
Kálmán Abari, Zsuzsanna Zsófia Rácz, Gábor Olaszy
2011Formant-Controlled HMM-Based Speech Synthesis.
Ming Lei, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Li-Rong Dai
2011Frame-Level Vocal Effort Likelihood Space Modeling for Improved Whisper-Island Detection.
Chi Zhang, John H. L. Hansen
2011Frequency-Domain Representation of Source-Filter Coupling and its Effect in the Production of Voice.
Tokihiko Kaburagi
2011Frequency-Warped and Stabilized Time-Varying Cepstral Coefficients.
Trond Skogstad, Torbjørn Svendsen
2011From Interview to News Text: A Study of Taiwan TV Political Interviews in Newspaper Reports.
Chin-Chih Chiang
2011From Single-Call to Multi-Call Quality: A Study on Long-Term Quality Integration in Audio-Visual Speech Communication.
Sebastian Möller, Chihuy Bang, Teele Tamme, Markus Vaalgamaa, Benjamin Weiss
2011From Teleoperated Androids to Cellphones as Surrogates.
Hiroshi Ishiguro
2011Front-End Compensation Methods for LVCSR Under Lombard Effect.
Hynek Boril, Frantisek Grézl, John H. L. Hansen
2011Fundamental Frequency Estimation Using Modified Higher Order Moments and Multiple Windows.
Alipah Pawi, Saeed Vaseghi, Ben Milner, Seyed Ghorshi
2011Fusing Multiple Confidence Measures for Chinese Spoken Term Detection.
Zejun Ma, Xiaorui Wang, Bo Xu
2011GMM-Based Missing-Feature Reconstruction on Multi-Frame Windows.
Ulpu Remes, Yoshihiko Nankaku, Keiichi Tokuda
2011Gaussian Process Experts for Voice Conversion.
Nicholas Pilkington, Heiga Zen, Mark J. F. Gales
2011Generalized Baum-Welch Algorithm and its Implication to a New Extended Baum-Welch Algorithm.
Roger Hsiao, Tanja Schultz
2011Generalized Method for Solving the Permutation Problem in Frequency-Domain Blind Source Separation of Convolved Speech Signals.
Auxiliadora Sarmiento, Iván Durán-Díaz, Sergio Cruces, Pablo Aguilera
2011Generalized Variable Parameter HMMs for Noise Robust Speech Recognition.
Ning Cheng, Xunying Liu, Lan Wang
2011Generalized-Log Spectral Mean Normalization for Speech Recognition.
Hilman Ferdinandus Pardede, Koichi Shinoda
2011Generating Animated Pronunciation from Speech Through Articulatory Feature Extraction.
Yurie Iribe, Silasak Manosavanh, Kouichi Katsurada, Ryoko Hayashi, Chunyue Zhu, Tsuneo Nitta
2011Genre Categorization and Modeling for Broadcast Speech Transcription.
Qingqing Zhang, Lori Lamel, Jean-Luc Gauvain
2011Gesture Design of Hand-to-Speech Converter Derived from Speech-to-Hand Converter Based on Probabilistic Integration Model.
Aki Kunikoshi, Yu Qiao, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose
2011Globality-Locality Consistent Discriminant Analysis for Phone Classification.
Heyun Huang, Yang Liu, Jort F. Gemmeke, Louis ten Bosch, Bert Cranen, Lou Boves
2011GorUp: An Ontology-Driven Audio Information Retrieval System that Suits the Requirements of Under-Resourced Languages.
Nora Barroso, Karmele López de Ipiña, Aitzol Ezeiza, Carmen Hernández, Nerea Ezeiza, Odei Barroso, Unai Susperregi, Simeon Barroso
2011Grapheme-Based Automatic Speech Recognition Using KL-HMM.
Mathew Magimai-Doss, Ramya Rasipuram, Guillermo Aradilla, Hervé Bourlard
2011Grapheme-to-Phoneme Conversion Using Conditional Random Fields.
Irina Illina, Dominique Fohr, Denis Jouvet
2011Graphone Model Interpolation and Arabic Pronunciation Generation.
T. Li, Philip C. Woodland, Frank Diehl, Mark J. F. Gales
2011Growing a Spoken Language Interface on Amazon Mechanical Turk.
Ian McGraw, James R. Glass, Stephanie Seneff
2011HMM-Based Emphatic Speech Synthesis Using Unsupervised Context Labeling.
Yu Maeno, Takashi Nose, Takao Kobayashi, Yusuke Ijima, Hideharu Nakajima, Hideyuki Mizuno, Osamu Yoshioka
2011Harmonic Structure Transform for Speaker Recognition.
Kornel Laskowski, Qin Jin
2011Hidden Boosted MMI and Hierarchical State Posterior Feature for Automatic Speech Recognition Based on Hidden Conditional Neural Fields.
Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa
2011Hierarchical Audio Segmentation with HMM and Factor Analysis in Broadcast News Domain.
Diego Castán, Carlos Vaquero, Alfonso Ortega, David Martínez González, Jesús Antonio Villalba López, Eduardo Lleida
2011Hierarchical Stress Modeling in Mandarin Text-to-Speech.
Ya Li, Jianhua Tao, Xiaoying Xu
2011Hierarchical Tandem Features for ASR in Mandarin.
Joel Pinto, Mathew Magimai-Doss, Hervé Bourlard
2011How Realistic is Artificially Added Noise?
Thomas Winkler
2011Hybrid Language Models Using Mixed Types of Sub-Lexical Units for Open Vocabulary German LVCSR.
M. Ali Basha Shaik, Amr El-Desoky Mousa, Ralf Schlüter, Hermann Ney
2011Hybrid Speech Recognition for Voice Search: A Comparative Study.
Evandro B. Gouvêa
2011I3A Language Recognition System for Albayzin 2010 LRE.
David Martínez González, Jesús Antonio Villalba López, Antonio Miguel, Alfonso Ortega, Eduardo Lleida
2011Identifying Agreement/Disagreement in Conversational Speech: A Cross-Lingual Study.
Wen Wang, Kristin Precoda, Colleen Richey, Geoffrey Raymond
2011Identifying Regions of Non-Modal Phonation Using Features of the Wavelet Transform.
John Kane, Christer Gobl
2011Image Processing Filters for Line Detection-Based Spoken Term Detection.
Kazuyuki Noritake, Hiroaki Nanjo, Takehiko Yoshimi
2011Image Representation of the Subband Power Distribution for Robust Sound Classification.
Jonathan William Dennis, Tran Huy Dat, Haizhou Li
2011Impact of Different Feedback Mechanisms in EMG-Based Speech Recognition.
Christian Herff, Matthias Janke, Michael Wand, Tanja Schultz
2011Impact of Speaker Variability on Speech Perception in Non-Native Listeners.
Wim A. van Dommelen, Valérie Hazan
2011Implicit Segmentation in Two-Wire Speaker Recognition.
Yosef A. Solewicz, Hagai Aronowitz
2011Improved Acoustic Characterization of Breathy and Whispery Voices.
Carlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita
2011Improved Acoustic Feature Combination for LVCSR by Neural Networks.
Christian Plahl, Ralf Schlüter, Hermann Ney
2011Improved Bottleneck Features Using Pretrained Deep Neural Networks.
Dong Yu, Michael L. Seltzer
2011Improved Classification of Speaking Styles for Mental Health Monitoring Using Phoneme Dynamics.
Keng-hao Chang, Howard Lei, John F. Canny
2011Improved HNM-Based Vocoder for Statistical Synthesizers.
Daniel Erro, Iñaki Sainz, Eva Navas, Inma Hernáez
2011Improved Overlapped Speech Handling for Speaker Diarization.
Kofi Boakye, Oriol Vinyals, Gerald Friedland
2011Improved Quality for Conversational VoIP Using Path Diversity.
Qipeng Gong, Peter Kabal
2011Improved Spoken Query Transcription Using Co-Occurrence Information.
Jonathan Mamou, Abhinav Sethy, Bhuvana Ramabhadran, Ron Hoory, Paul Vozila
2011Improved Tonal Language Speech Recognition by Integrating Spectro-Temporal Evidence and Pitch Information with Properly Chosen Tonal Acoustic Units.
Shang-wen Li, Yow-Bang Wang, Liang-Che Sun, Lin-Shan Lee
2011Improved a posteriori Speech Presence Probability Estimation Based on Cepstro-Temporal Smoothing and Time-Frequency Correlation.
Chao Li, Wenju Liu
2011Improvement of Segmental Mispronunciation Detection with Prior Knowledge Extracted from Large L2 Speech Corpus.
Dean Luo, Xuesong Yang, Lan Wang
2011Improvements in Speaker Characterization Using Spectral Subband Energy Based on Harmonic plus Noise Model.
Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong Dai, Wu Guo
2011Improvements of a Dual-Input DBN for Noise Robust ASR.
Yang Sun, Jort F. Gemmeke, Bert Cranen, Louis ten Bosch, Lou Boves
2011Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation.
Xunying Liu, Mark J. F. Gales, Philip C. Woodland
2011Improving Multiband Position-Pitch Algorithm for Localization and Tracking of Multiple Concurrent Speakers by Using a Frequency Selective Criterion.
Tania Habib, Harald Romsdorfer
2011Improving Non-Native ASR Through Stochastic Multilingual Phoneme Space Transformations.
David Imseng, Hervé Bourlard, John Dines, Philip N. Garner, Mathew Magimai-Doss
2011In Search of Cues Discriminating West-African Accents in French.
Philippe Boula de Mareüil, Jean-Luc Rouas, Manuela Yapomo
2011Incorporating Regional Information to Enhance MAP-Based Stochastic Feature Compensation for Robust Speech Recognition.
Yu Tsao, Paul R. Dixon, Chiori Hori, Hisashi Kawai
2011Incorporating Speech Recognition Engine into an Intelligent Assistive Reading System for Dyslexic Students.
Theologos Athanaselis, Stelios Bakamidis, Ioannis Dologlou, Evmorfia N. Argyriou, Antonios Symvonis
2011Incremental Learning and Forgetting in Stochastic Turn-Taking Models.
Kornel Laskowski, Jens Edlund, Mattias Heldner
2011Individual Error Minimization Learning Framework and its Applications to Speech Recognition and Utterance Verification.
Sunghwan Shin, Ho-Young Jung, Biing-Hwang Juang
2011Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings.
Sree Harsha Yella, Fabio Valente
2011Instantaneous Speaker Adaptation Through Selection and Combination of fMLLR Transformation Matrices.
Diego Giuliani, Fabio Brugnara
2011Integrated Online Speaker Clustering and Adaptation.
Catherine Breslin, K. K. Chin, Mark J. F. Gales, Kate M. Knill
2011Integrating Recent MLP Feature Extraction Techniques into TRAP Architecture.
Frantisek Grézl, Martin Karafiát
2011Interactional Style Detection for Versatile Dialogue Response Using Prosodic and Semantic Features.
Wei-Bin Liang, Chung-Hsien Wu, Chih-Hung Wang, Jhing-Fa Wang
2011Intermediate-State HMMs to Capture Continuously-Changing Signal Features.
Gustav Eje Henter, W. Bastiaan Kleijn
2011Intersession Compensation and Scoring Methods in the i-vectors Space for Speaker Recognition.
Pierre-Michel Bousquet, Driss Matrouf, Jean-François Bonastre
2011Intonation Conversion from Neutral to Expressive Speech.
Christophe Veaux, Xavier Rodet
2011Intonation of Left Dislocated Topics in Modern Greek.
David Le Gac, Hiyon Yoo
2011Intoxicated Speech Detection by Fusion of Speaker Normalized Hierarchical Features and GMM Supervectors.
Daniel Bone, Matthew Black, Ming Li, Angeliki Metallinou, Sungbok Lee, Shrikanth S. Narayanan
2011Intoxication Detection Using Phonetic, Phonotactic and Prosodic Cues.
Fadi Biadsy, William Yang Wang, Andrew Rosenberg, Julia Hirschberg
2011Intra-, Inter-, and Cross-Cultural Classification of Vocal Affect.
Daniel Neiberg, Petri Laukka, Hillary Anger Elfenbein
2011Inverse Filtering Based Harmonic Plus Noise Excitation Model for HMM-Based Speech Synthesis.
Zhengqi Wen, Jianhua Tao
2011Investigating Robustness of Spectral Moments on Normal- and High-Effort Speech.
Frederike Gottsmann, Corinna Harwardt
2011Investigating the Effect of Number of Interlocutors on the Quality of Experience for Multi-Party Audio Conferencing.
Janto Skowronek, Alexander Raake
2011Investigating the Stability of Intergestural Timing Relations.
Juraj Simko, Fred Cummins, Stefan Benus
2011Investigation of Cross-Show Speaker Diarization.
Qian Yang, Qin Jin, Tanja Schultz
2011Investigation of Spontaneous Speech Characterization Applied to Speaker Role Recognition.
Richard Dufour, Yannick Estève, Paul Deléglise
2011Investigations on Speaking Mode Discrepancies in EMG-Based Speech Recognition.
Michael Wand, Matthias Janke, Tanja Schultz
2011Is the Perception of Voice Quality Language-Dependant? A Comparison of French and Italian Listeners and Dysphonic Speakers.
Alain Ghio, Frédérique Weisz, Giovanna Baracca, Giovanna Cantarella, Danièle Robert, Virginie Woisard, Franco Fussi, Antoine Giovanni
2011Italian in the No-Man's Land Between Stress-Timing and Syllable-Timing? Speakers are More Stress-Timed than Listeners.
Bettina Braun, Sabine Geiselmann
2011Iterative Improvement of Speaker Segmentation in a Noisy Environment Using High-Level Knowledge.
Qiang Huang, Stephen J. Cox
2011Java Visual Speech Components for Rapid Application Development of GUI Based Speech Processing Applications.
Stefan Steidl, Korbinian Riedhammer, Tobias Bocklet, Florian Hönig, Elmar Nöth
2011Jaw Movement in Vowels and Liquids Forming the Syllable Nucleus.
Stefan Benus, Marianne Pouplier
2011Joint Application of Speech and Speaker Recognition for Automation and Security in Smart Home.
Kong-Aik Lee, Anthony Larcher, Helen Thai, Bin Ma, Haizhou Li
2011Joint Bilinear Transformation Space Based Maximum a posteriori Linear Regression Adaptation Using Prior with Variance Function.
Hwa Jeon Song, Yunkeun Lee, Hyung Soon Kim
2011Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics.
Thomas Drugman, Abeer Alwan
2011Joint Target and Join Cost Weight Training for Unit Selection Synthesis.
Lukas Latacz, Wesley Mattheyses, Werner Verhelst
2011Kernel Alignment Maximization for Speaker Recognition Based on High-Level Features.
Szymon Drgas, Adam Dabrowski
2011Kernel Models for Affective Lexicon Creation.
Nikos Malandrakis, Alexandros Potamianos, Elias Iosif, Shrikanth S. Narayanan
2011Kernel PCA for Speech Enhancement.
Christina Leitner, Franz Pernkopf, Gernot Kubin
2011Kernel Partial Least Squares for Speaker Recognition.
Balaji Vasan Srinivasan, Daniel Garcia-Romero, Dmitry N. Zotkin, Ramani Duraiswami
2011Keyphrase Cloud Generation of Broadcast News.
Luís Marujo, Márcio Viveiros, João Paulo Neto
2011Kullback-Leibler Divergence-Based ASR Training Data Selection.
Evandro Gouvêa, Marelie H. Davel
2011L1/L2 Perception of Lexical Stress with F0 Peak-Delay: Effect of an Extra Syllable Added.
Shinichi Tokuma, Yi Xu
2011LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization.
Sree Hari Krishnan Parthasarathi, Hervé Bourlard, Daniel Gatica-Perez
2011Language Disorders: Viewpoints on a Complex Object.
Gabriele Miceli
2011Language Identification for Text Chats.
Vesa Siivola, Bryan L. Pellom, Meagan Sills
2011Language Model Expansion Using Webdata for Spoken Document Retrieval.
Ryo Masumura, Seongjun Hahm, Akinori Ito
2011Language Recognition in iVectors Space.
David Martínez González, Oldrich Plchot, Lukás Burget, Ondrej Glembek, Pavel Matejka
2011Language Recognition via i-vectors and Dimensionality Reduction.
Najim Dehak, Pedro A. Torres-Carrasquillo, Douglas A. Reynolds, Réda Dehak
2011Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus.
Fabio Valente, Alessandro Vinciarelli
2011Large Margin - Minimum Classification Error Using Sum of Shifted Sigmoids as the Loss Function.
Madhavi Vedula Ratnagiri, Biing-Hwang Juang, Lawrence R. Rabiner
2011Large Vocabulary SOUL Neural Network Language Models.
Hai Son Le, Ilya Oparin, Abdelkhalek Messaoudi, Alexandre Allauzen, Jean-Luc Gauvain, François Yvon
2011Large-Scale Experiments on Data-Driven Design of Commercial Spoken Dialog Systems.
David Suendermann, Jackson Liscombe, Jonathan Bloom, Grace Li, Roberto Pieraccini
2011Large-Scale Subjective Evaluations of Speech Rate Control Methods for HMM-Based Speech Synthesizers.
Tsuneo Kato, Makoto Yamada, Nobuyuki Nishizawa, Keiichiro Oura, Keiichi Tokuda
2011Laryngealization and Breathiness in Persian.
Vahid Sadeghi
2011Latent Topic Modeling for Audio Corpus Summarization.
Timothy J. Hazen
2011Lattice Based Discriminative Model Combination Using Automatically Induced Phonetic Contexts.
Hao Huang, Bing Hu Li
2011Lattice-Based Risk Minimization Training for Unsupervised Language Model Adaptation.
Akio Kobayashi, Takahiro Oku, Shinichi Homma, Toru Imai, Seiichi Nakagawa
2011Learning Influences from Word Use in Polylogue.
Tomoharu Iwata, Shinji Watanabe
2011Learning New Acoustic Events in an HMM-Based System Using MAP Adaptation.
Jürgen T. Geiger, Mohamed Anouar Lakhal, Björn W. Schuller, Gerhard Rigoll
2011Learning Place-Names from Spoken Utterances and Localization Results by Mobile Robot.
Ryo Taguchi, Yuji Yamada, Koosuke Hattori, Taizo Umezaki, Masahiro Hoguro, Naoto Iwahashi, Kotaro Funakoshi, Mikio Nakano
2011Learning Score Structure from Spoken Language for a Tennis Game.
Qiang Huang, Stephen J. Cox
2011Learning Weighted Entity Lists from Web Click Logs for Spoken Language Understanding.
Dustin Hillard, Asli Celikyilmaz, Dilek Hakkani-Tür, Gökhan Tür
2011Learning from Mistakes: Expanding Pronunciation Lexicons Using Word Recognition Errors.
Sravana Reddy, Evandro B. Gouvêa
2011Leja Ordering LSFs for Accurate Estimation of Predictor Coefficients.
Christian Fischer Pedersen
2011Let's All Speak Together! Exploring the Impact of Various Languages on the Comprehension of Speech in Multi-Linguistic Babble.
Aurore Gautreau, Michel Hoen, Fanny Meunier
2011Letter-to-Phoneme Conversion Based on Two-Stage Neural Network Focusing on Letter and Phoneme Contexts.
Kheang Seng, Yurie Iribe, Tsuneo Nitta
2011Leveraging Relevance Cues for Improved Spoken Document Retrieval.
Pei-Ning Chen, Kuan-Yu Chen, Berlin Chen
2011Linear Dynamic Models for Voice Activity Detection.
Kannu Mehta, Chau Khoa Pham, Chng Eng Siong
2011Log-Linear Optimization of Second-Order Polynomial Features with Subsequent Dimension Reduction for Speech Recognition.
Muhammad Ali Tahir, Ralf Schlüter, Hermann Ney
2011Long Term Average Speech Spectra in Yolngu Matha and Pitjantjatjara Speaking Females and Males.
Hywel Stoakes, Andrew Butcher, Janet Fletcher, Marija Tabain
2011Long-Distance Rhythmic Dependencies and their Application to Automatic Language Identification.
Joseph Tepperman, Emily Nava
2011Lossless Value Directed Compression of Complex User Goal States for Statistical Spoken Dialogue Systems.
Paul A. Crook, Oliver Lemon
2011Low and High, Short and Long by Crook or by Hook?
Oliver Niebuhr, Astrid Wolf
2011Low-Frequency Bandwidth Extension of Telephone Speech Using Sinusoidal Synthesis and Gaussian Mixture Model.
Hannu Pulakka, Ulpu Remes, Santeri Yrttiaho, Kalle J. Palomäki, Mikko Kurimo, Paavo Alku
2011Making an Automatic Speech Recognition Service Freely Available on the Web.
Stuart N. Wrigley, Thomas Hain
2011Mandarin Word-Character Hybrid-Input Neural Network Language Model.
Moonyoung Kang, Tim Ng, Long Nguyen
2011Mapping Sparse Representation to State Likelihoods in Noise-Robust Automatic Speech Recognition.
Katariina Mahkonen, Antti Hurmalainen, Tuomas Virtanen, Jort F. Gemmeke
2011Matrix-Variate Distribution of Training Models for Robust Speaker Adaptation.
Yongwon Jeong, Young Kuk Kim
2011Maximum Confidence Measure Based Interaural Phase Difference Estimation for Noise Masking in Dual-Microphone Robust Speech Recognition.
Hsien-Cheng Liao, Yuan-Fu Liao, Chin-Hui Lee
2011Maximum Entropy Based Data Selection for Speaker Recognition.
Chien-Lin Huang, Bin Ma
2011Maximum Likelihood i-vector Space Using PCA for Speaker Verification.
Zhenchun Lei, Yingchun Yang
2011Maximum a posteriori Estimation of Noise from Non-Acoustic Reference Signals in Very Low Signal-to-Noise Ratio Environments.
Ben Milner
2011Measurement of Objective Intelligibility of Japanese Accented English Using ERJ (English Read by Japanese) Database.
Nobuaki Minematsu, Koji Okabe, Keisuke Ogaki, Keikichi Hirose
2011Measuring Acoustic-Prosodic Entrainment with Respect to Multiple Levels and Dimensions.
Rivka Levitan, Julia Hirschberg
2011Measuring Final Lengthening for Speaker-Change Prediction.
Anna Hjalmarsson, Kornel Laskowski
2011Measuring Speakers' Similarity in Speech by Means of Prosodic Cues: Methods and Potential.
Céline De Looze, Stéphane Rauzy
2011Memory-Based Approximation of the Gaussian Mixture Model Framework for Bandwidth Extension of Narrowband Speech.
Amr H. Nour-Eldin, Peter Kabal
2011Method for Speech Inversion with Large Scale Statistical Evaluation.
Heikki Rasilo, Unto K. Laine, Okko Johannes Räsänen, Toomas Altosaar
2011Minimum Classification Error Based Spectro-Temporal Feature Extraction for Robust Audio Classification.
Yuan-Fu Liao, Chia-Hsing Lin, We-Der Fang
2011Mixture of Auto-Associative Neural Networks for Speaker Verification.
Garimella S. V. S. Sivaram, Samuel Thomas, Hynek Hermansky
2011Mixture of PLDA Models in i-vector Space for Gender-Independent Speaker Recognition.
Mohammed Senoussaoui, Patrick Kenny, Niko Brümmer, Edward de Villiers, Pierre Dumouchel
2011Modality Selection and Perceived Mental Effort in a Mobile Application.
Stefan Schaffer, Benjamin Jöckel, Ina Wechsung, Robert Schleicher, Sebastian Möller
2011Model Adaptation for Automatic Speech Recognition Based on Multiple Time Scale Evolution.
Shinji Watanabe, Atsushi Nakamura, Biing-Hwang Juang
2011Modeling Broad Context for Tone Recognition with Conditional Random Fields.
Siwei Wang, Gina-Anne Levow
2011Modeling Speaker Personality Using Voice.
Tim Polzehl, Sebastian Möller, Florian Metze
2011Modelling Novelty Preference in Word Learning.
Maarten Versteegh, Louis ten Bosch, Lou Boves
2011Modulation Spectrum Analysis for Recognition of Reverberant Speech.
Sri Harish Reddy Mallidi, Sriram Ganapathy, Hynek Hermansky
2011Monaural Azimuth Localization Using Spectral Dynamics of Speech.
Roi Kliper, Hendrik Kayser, Daphna Weinshall, Israel Nelken, Jörn Anemüller
2011Monaural Sound Localization.
Anna Katharina Fuchs, Christian Feldbauer, Michael Stark
2011Monaural Speech Separation Based on a 2D Processing and Harmonic Analysis.
Azam Rabiee, Saeed Setayeshi, Soo-Young Lee
2011Monaural Voiced Speech Segregation Based on Pitch and Comb Filter.
Xueliang Zhang, Wenju Liu
2011Morpheme Based Factored Language Models for German LVCSR.
Amr El-Desoky Mousa, M. Ali Basha Shaik, Ralf Schlüter, Hermann Ney
2011Morpheme Conversion for Connecting Speech Recognizer and Language Analyzers in Unsegmented Languages.
Kenji Imamura, Tomoko Izumi, Kugatsu Sadamitsu, Kuniko Saito, Satoshi Kobashikawa, Hirokazu Masataki
2011Morphological Variation in the Adult Vocal Tract: A Modeling Study of its Potential Acoustic Impact.
Adam C. Lammert, Michael I. Proctor, Athanasios Katsamanis, Shrikanth S. Narayanan
2011Mtrans: A Multi-Channel, Multi-Tier Speech Annotation Tool.
Julián Villegas, Martin Cooke, Vincent Aubanel, Marco Aldo Piccolino Boniforti
2011Multi-Accent Speech Recognition of Afrikaans, Black and White Varieties of South African English.
Herman Kamper, Thomas Niesler
2011Multi-Channel Voice Activity Detection Based on Conic Constraints.
Gibak Kim
2011Multi-Party Speech Recovery Exploiting Structured Sparsity Models.
Afsaneh Asaei, Mohammad Javad Taghizadeh, Hervé Bourlard, Volkan Cevher
2011Multi-Sensor Voice Activity Detection Based on Multiple Observation Hypothesis Testing.
Theodore Petsatodis, Fotios Talantzis, Christos Boukis, Zheng-Hua Tan, Ramjee Prasad
2011Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis.
Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda
2011Multi-Task Learning for Spoken Language Understanding with Shared Slots.
Xiao Li, Ye-Yi Wang, Gökhan Tür
2011Multi-View Approach for Speaker Turn Role Labeling in TV Broadcast News Shows.
Géraldine Damnati, Delphine Charlet
2011Multipulse Sequences for Residual Signal Modeling.
Ranniery Maia, Heiga Zen, Kate M. Knill, Mark J. F. Gales, Sabine Buchholz
2011Multistream Bandpass Modulation Features for Robust Speech Recognition.
Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali
2011N-Grams for Conditional Random Fields or a Failure-Transition(f) Posterior for Acyclic FSTs.
Patrick Lehnen, Stefan Hahn, Hermann Ney
2011NeMo: A Platform for Multilingual News Monitoring.
Christian Girardi, Roberto Gretter, Daniele Falavigna, Fabio Brugnara, Diego Giuliani, Marcello Federico
2011Nearest Neighbors with Learned Distances for Phonetic Frame Classification.
John Labiak, Karen Livescu
2011Neural Representations of Word Meanings.
Tom M. Mitchell
2011Neutral to Target Emotion Conversion Using Source and Suprasegmental Information.
D. Govind, S. R. Mahadeva Prasanna, Bayya Yegnanarayana
2011New Developments in Joint Factor Analysis for Speaker Verification.
Hagai Aronowitz, Oren Barkan
2011New Developments in Voice Biometrics for User Authentication.
Hagai Aronowitz, Ron Hoory, Jason W. Pelecanos, David Nahamoo
2011New Feature Parameters for Pronunciation Evaluation in English Presentations at International Conferences.
Hiroshi Kibishi, Seiichi Nakagawa
2011New Methods for Template Selection and Compression in Continuous Speech Recognition.
Xie Sun, Yunxin Zhao
2011Noise Robust Feature Extraction Based on Extended Weighted Linear Prediction in LVCSR.
Sami Keronen, Jouni Pohjalainen, Paavo Alku, Mikko Kurimo
2011Noise Robust Speaker-Independent Speech Recognition with Invariant-Integration Features Using Power-Bias Subtraction.
Florian Müller, Alfred Mertins
2011Novel VTEO Based Mel Cepstral Features for Classification of Normal and Pathological Voices.
Hemant A. Patil, Pallavi N. Baljekar
2011OOV Detection and Recovery Using Hybrid Models with Different Fragments.
Long Qin, Ming Sun, Alexander I. Rudnicky
2011OOV Sensitive Named-Entity Recognition in Speech.
Carolina Parada, Mark Dredze, Frederick Jelinek
2011Objective Intelligibility Prediction of Speech by Combining Correlation and Distortion Based Techniques.
Angel M. Gomez, Belinda Schwerin, Kuldip K. Paliwal
2011Off-Topic Detection in Automated Speech Assessment Applications.
Jian Cheng, Jianqiang Shen
2011On Building and Evaluating a Broadcast-News Audio Segmentation System.
Taras Butko
2011On Development of Consistently Punctuated Speech Corpora.
Jáchym Kolár, Lori Lamel
2011On Initial Seed Selection for Frequency Domain Blind Speech Separation.
Dang Hai Tran Vu, Reinhold Haeb-Umbach
2011On Mispronunciation Lexicon Generation Using Joint-Sequence Multigrams in Computer-Aided Pronunciation Training (CAPT).
Xiaojun Qian, Helen M. Meng, Frank K. Soong
2011On Noise Robust Voice Activity Detection.
Tomas Dekens, Werner Verhelst
2011On Noise Tracking for Noise Floor Estimation.
Mahdi Triki
2011On the Effectiveness of Statistical Modeling Based Template Matching Approach for Continuous Speech Recognition.
Xie Sun, Xin Chen, Yunxin Zhao
2011On the Estimation of Discount Parameters for Language Model Smoothing.
Martin Sundermeyer, Ralf Schlüter, Hermann Ney
2011On the Relationship Between Perceived Accentedness, Acoustic Similarity, and Processing Difficulty in Foreign-Accented Speech.
Marijt J. Witteman, Andrea Weber, James M. McQueen
2011On the Use of Extended Context for HMM-Based Spontaneous Conversational Speech Synthesis.
Tomoki Koriyama, Takashi Nose, Takao Kobayashi
2011On the Use of Lattices of Time-Synchronous Cross-Decoder Phone Co-Occurrences in a SVM-Phonotactic Language Recognition System.
Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez, Germán Bordel
2011On the Use of Linguistic Features in an Automatic System for Speech Analytics of Telephone Conversations.
Benjamin Maza, Marc El-Bèze, Georges Linarès, Renato De Mori
2011On the Use of Multimodal Cues for the Prediction of Degrees of Involvement in Spontaneous Conversation.
Catharine Oertel, Stefan Scherer, Nick Campbell
2011On the Use of the Rhythmogram for Automatic Syllabic Prominence Detection.
Bogdan Ludusan, Antonio Origlia, Francesco Cutugno
2011On-Line Language Model Biasing for Multi-Pass Automatic Speech Recognition.
Sankaranarayanan Ananthakrishnan, Stavros Tsakalidis, Rohit Prasad, Premkumar Natarajan
2011One-to-Many Voice Conversion Based on Tensor Representation of Speaker Space.
Daisuke Saito, Keisuke Yamamoto, Nobuaki Minematsu, Keikichi Hirose
2011Online Pattern Learning for Non-Negative Convolutive Sparse Coding.
Dong Wang, Ravichander Vipperla, Nicholas W. D. Evans
2011Online Speaker Adaptation with Pre-Computed FMLLR Transformations.
Volker Fischer, Siegfried Kunzmann
2011Online Speech Activity Detection in Broadcast News.
Chao Gao, Guruprasad Saikumar, Saurabh Khanwalkar, Avi Herscovici, Anoop Kumar, Amit Srivastava, Premkumar Natarajan
2011Open Source Multi-Language Audio Database for Spoken Language Processing Applications.
Stephen A. Zahorian, Jiang Wu, Montri Karnjanadecha, Chandra Sekhar Vootkuri, Brian Wong, Andrew Hwang, Eldar Tokhtamyshev
2011Open Source Voice Creation Toolkit for the MARY TTS Platform.
Marc Schröder, Marcela Charfuelan, Sathish Pammi, Ingmar Steiner
2011Optimal Models of Prosodic Prominence Using the Bayesian Information Criterion.
Tim Mahrt, Jui-Ting Huang, Yoonsook Mo, Margaret M. Fleck, Mark Hasegawa-Johnson, Jennifer Cole
2011Optimal Selection of Limited Vocabulary Speech Corpora.
Hui Lin, Jeff A. Bilmes
2011Optimal Syllabic Rates and Processing Units in Perceiving Mandarin Spoken Sentences.
Guangting Mai, Gang Peng
2011Optimization of the Gaussian Mixture Model Evaluation on GPU.
Jan Vanek, Jan Trmal, Josef V. Psutka, Josef Psutka
2011Optimized Feature Extraction and HMMs in Subword Detectors.
Alfonso M. Canterla, Magne Hallstein Johnsen
2011Optimizing Situated Dialogue Management in Unknown Environments.
Heriberto Cuayáhuitl, Nina Dethlefs
2011PLDA-Based Clustering for Speaker Diarization of Broadcast Streams.
Jan Silovský, Jan Prazak, Petr Cerva, Jindrich Zdánský, Jan Nouza
2011Painless WFST Cascade Construction for LVCSR - Transducersaurus.
Josef R. Novak, Nobuaki Minematsu, Keikichi Hirose
2011Parallel and Hierarchical Decision Making for Sparse Coding in Speech Recognition.
Dong Wang, Ravichander Vipperla, Nicholas W. D. Evans
2011Parallels in Infants' Attention to Speech Articulation and to Physical Changes in Speech-Unrelated Objects.
Eeva Klintfors, Ellen Marklund, Francisco Lacerda
2011Parametrising Degree of Articulator Movement from Dynamic MRI Data.
Zeynab Raeesy, Ladan Baghai-Ravary, John S. Coleman
2011Partitioning of Two-Speaker Conversation Datasets.
Carlos Vaquero, Alfonso Ortega, Eduardo Lleida
2011Perception of Alcoholic Intoxication in Speech.
Florian Schiel
2011Perceptual Improvement of a Two-Stage Algorithm for Speech Dereverberation.
Thiago de M. Prego, Amaro A. de Lima, Sergio L. Netto
2011Perceptual Learning of Liquids.
Odette Scharenborg, Holger Mitterer, James M. McQueen
2011Perceptual Quality Dimensions of Text-to-Speech Systems.
Florian Hinterleitner, Sebastian Möller, Christoph Norrenbrock, Ulrich Heute
2011Perceptual Representation of Consonant Sounds in Thai.
Charturong Tantibundhit, Chutamanee Onsuwan, Tanawan Saimai, Nantaporn Saimai, Sumonmas Thatphithakkul, Patcharika Chootrakool, Krit Kosawat, Nattanun Thatphithakkul
2011Perceptual Sensitivity to Dialectal and Generational Variations in Vowels.
Robert Allen Fox, Ewa Jacewicz
2011Perceptual Sensitivity to Prenuclear and Nuclear Intonational Patterns.
Tomás Dubeda
2011Perceptual Training of Vowel Length Contrast of Japanese by L2 Listeners: Effects of an Isolated Word versus a Word Embedded in Sentences.
Mee Sonu, Keiichi Tajima, Hiroaki Kato, Yoshinori Sagisaka
2011Perceptually-Inspired Processing for Multichannel Wiener Filter.
Jorge I. Marin-Hurtado, David V. Anderson
2011Percy - An HTML5 Framework for Media Rich Web Experiments on Mobile Devices.
Christoph Draxler
2011Performance Prediction of Speech Recognition Using Average-Voice-Based Speech Synthesis.
Tatsuhiko Saito, Takashi Nose, Takao Kobayashi, Yohei Okato, Akio Horii
2011Personalizing Model M for Voice-Search.
Geoffrey Zweig, Shuangyu Chang
2011Phase-Only Speech Reconstruction Using Very Short Frames.
Erfan Loweimi, Seyed Mohammad Ahadi, Hamid Sheikhzadeh
2011Phone Impact Based Speech Transmission Technique for Reliable Speech Recognition in Poor Wireless Network Conditions.
Azar Taufique, Kumaran Vijayasankar, Wooil Kim, John H. L. Hansen, Marco Tacca, Andrea Fumagalli
2011Phoneme Level Non-Native Pronunciation Analysis by an Auditory Model-Based Native Assessment Scheme.
Christos Koniaris, Olov Engwall
2011Phoneme-Dependent NMF for Speech Enhancement in Monaural Mixtures.
Bhiksha Raj, Rita Singh, Tuomas Virtanen
2011Phoneme-Level Text to Audio Synchronization on Speech Signals with Background Music.
Agnès Pedone, Juan José Burred, Simon Maller, Pierre Leveau
2011Phonemic Similarity Metrics to Compare Pronunciation Methods.
Ben Hixon, Eric Schneider, Susan L. Epstein
2011Phonetic Classification Using Controlled Random Walks.
Katrin Kirchhoff, Andrei Alexandrescu
2011Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation.
Hui Liang, John Dines
2011Phonotactic Constraints and the Segmentation of Cantonese Speech.
Michael C. W. Yip
2011Phrasal Prominences do not need Pitch Movements: Postfocal Phrasal Heads in Italian.
Giuliano Bocci, Cinzia Avesani
2011Phrases, Pitch and Perceived Prominence in Maori.
Laura Thompson, Catherine Inez Watson, Ray Harlow, Jeanette King, Margaret Maclagan, Helen Charters, Peter Keegan
2011Physical Models Producing Vowels with Pitch Variation.
Takayuki Arai
2011Places and Manner of Articulation of Bangla Consonants: A EPG Based Study.
Shyamal Kr. Das Mandal, Somnath Chandra Vijay Kumar, Swaran Lata, Asoke Kumar Datta
2011PodCastle: Recent Advances of a Spoken Document Retrieval Service Improved by Anonymous User Contributions.
Masataka Goto, Jun Ogata
2011Pointing Gestures do not Influence the Perception of Lexical Stress.
Alexandra Jesse, Holger Mitterer
2011Powered Wheelchair Control Using Acoustic-Based Recognition of Head Gesture Accompanying Speech.
Akira Sasou
2011Predicting Human Perceived Accuracy of ASR Systems.
Taniya Mishra, Andrej Ljolje, Mazin Gilbert
2011Predicting Speaker Changes and Listener Responses with and without Eye-Contact.
Daniel Neiberg, Joakim Gustafson
2011Predicting Taiwan Mandarin Tone Shapes from their Duration.
Chierh Cheng, Michele Gubian
2011Predicting Tongue Positions from Acoustics and Facial Features.
Asterios Toutios, Slim Ouni
2011Prediction of Binaural Intelligibility Level Differences in Reverberation.
Jan Rennies, Thomas Brand, Birger Kollmeier
2011Prediction of Voice Aperiodicity Based on Spectral Representations in HMM Speech Synthesis.
Hanna Silén, Elina Helander, Moncef Gabbouj
2011Privacy Preserving Speaker Verification Using Adapted GMMs.
Manas A. Pathak, Bhiksha Raj
2011Probabilistic Latent Semantic Analysis for Broadcast News Story Segmentation.
Mimi Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li
2011Probabilistic Spectrum Envelope: Categorized Audio-Features Representation for NMF-Based Sound Decomposition.
Toru Nakashika, Tetsuya Takiguchi, Yasuo Ariki
2011Problems Encountered by Japanese EL2 with English Short Vowels as Illustrated on a 3D Vowel Chart.
Toshiko Isei-Jaakkola, Takatoshi Naka, Keikichi Hirose
2011Processing of Stress Related Acoustic Cues as Indexed by ERPs.
Ferenc Honbolygo, Valéria Csépe
2011Production and Perception of Estonian Vowels by Native and Non-Native Speakers.
Lya Meister, Einar Meister
2011Progress and Prospects for Speech Technology: Results from Three Sexennial Surveys.
Roger K. Moore
2011Projectability of Transition-Relevance Places Using Prosodic Features in Japanese Spontaneous Conversation.
Yuichi Ishimoto, Mika Enomoto, Hitoshi Iida
2011Prominence Model for Prosodic Features in Automatic Lexical Stress and Pitch Accent Detection.
Kun Li, Shuang Zhang, Mingxing Li, Wai Kit Lo, Helen M. Meng
2011Prominence-Based Prosody Prediction for Unit Selection Speech Synthesis.
Andreas Windmann, Igor Jauk, Fabio Tamburini, Petra Wagner
2011Pronunciation Learning from Continuous Speech.
Ibrahim Badr, Ian McGraw, James R. Glass
2011Propagation of Uncertainty Through Multilayer Perceptrons for Robust Automatic Speech Recognition.
Ramón Fernandez Astudillo, João Paulo da Silva Neto
2011Prosodic Analysis and Perception of Mandarin Utterances Conveying Attitudes.
Wentao Gu, Ting Zhang, Hiroya Fujisaki
2011Prosodic Analysis of a Corpus of Tales.
David Doukhan, Albert Rilliard, Sophie Rosset, Martine Adda-Decker, Christophe d'Alessandro
2011Prosodic Correlates of Individual Physiological Response to Stress.
Serguei V. S. Pakhomov, Michael E. Kotlyar
2011Prosodic Highlights in Mandarin Continuous Speech - Cross-Genre Attributes and Implications.
Chiu-yu Tseng, Zhao-yu Su, Chi-Feng Huang
2011Prosodic Synchrony in Co-Operative Task-Based Dialogues: A Measure of Agreement and Disagreement.
Brian Vaughan
2011Prosodic and Phonetic Features for Speaker Clustering in Speaker Diarization Systems.
Janez Zibert, France Mihelic
2011Prosody Conversion for Emotional Mandarin Speech Synthesis Using the Tone Nucleus Model.
Miaomiao Wen, Miaomiao Wang, Keikichi Hirose, Nobuaki Minematsu
2011Prosody Toolkit: Integrating HTK, Praat and WEKA.
S. Thomas Christie, Serguei V. S. Pakhomov
2011Quality Aspects of Multimodal Dialog Systems: Identity, Stimulation and Success.
Christine Kühnel, Benjamin Weiss, Matthias Schulz, Sebastian Möller
2011Quality Assessment of Crowdsourcing Transcriptions for African Languages.
Hadrien Gelas, Solomon Teferra Abate, Laurent Besacier, François Pellegrino
2011Quality Improvement of Voice Conversion Systems Based on Trellis Structured Vector Quantization.
Mahdi Eslami, Hamid Sheikhzadeh, Abolghasem Sayadiyan
2011Quantifying Articulatory Distinctiveness of Vowels.
Jun Wang, Jordan R. Green, Ashok Samal, David Marx
2011Quantitative Analysis of Tone Coarticulation in Mandarin.
Hussein Hussein, Hansjörg Mixdorff, Hue San Do, Rüdiger Hoffmann
2011RANSAC-Based Training Data Selection for Speaker State Recognition.
Elif Bozkurt, Engin Erzin, Çigdem Eroglu Erdem, A. Tanju Erdem
2011ROVER Enhancement with Automatic Error Detection.
Kacem Abida, Fakhri Karray
2011Range Based Multi Microphone Array Fusion for Speaker Activity Detection in Small Meetings.
Jani Even, Panikos Heracleous, Carlos Toshinori Ishi, Norihiro Hagita
2011Rapid Adaptation of Foreign-Accented HMM-Based Speech Synthesis.
Reima Karhila, Mirjam Wester
2011Rapid Building of an ASR System for Under-Resourced Languages Based on Multilingual Unsupervised Training.
Ngoc Thang Vu, Franziska Kraus, Tanja Schultz
2011Rapid Evaluation of Speech Representations for Spoken Term Discovery.
Michael A. Carlin, Samuel Thomas, Aren Jansen, Hynek Hermansky
2011Rapid Training of Acoustic Models Using Graphics Processing Unit.
Senaka Buthpitiya, Ian R. Lane, Jike Chong
2011Reaction Time and Decision Difficulty in the Perception of Intonation.
Katrin Schneider, Grzegorz Dogil, Bernd Möbius
2011Real User Evaluation of Spoken Dialogue Systems Using Amazon Mechanical Turk.
Filip Jurcícek, Simon Keizer, Milica Gasic, François Mairesse, Blaise Thomson, Kai Yu, Steve J. Young
2011Real-Life Emotion Detection from Speech in Human-Robot Interaction: Experiments Across Diverse Corpora with Child and Adult Voices.
Marie Tahon, Agnès Delaborde, Laurence Devillers
2011Real-Time Prototype for Integration of Blind Source Extraction and Robust Automatic Speech Recognition.
Francesco Nesta, Marco Matassoni, Hari Krishna Maganti
2011Real-World Speech/Non-Speech Audio Classification Based on Sparse Representation Features and GPCs.
Ziqiang Shi, Jiqing Han, Tieran Zheng
2011Recognition and Real Time Performances of a Lightweight Ultrasound Based Silent Speech Interface Employing a Language Model.
Jun Cai, Bruce Denby, Pierre Roussel-Ragot, Gérard Dreyfus, Lise Crevier-Buchman
2011Recognition of Personality Traits from Human Spoken Conversations.
Alexei V. Ivanov, Giuseppe Riccardi, Adam J. Sporka, Jakub Franc
2011Recording Caregiver Interactions for Machine Acquisition of Spoken Language Using the KLAIR Virtual Infant.
Mark A. Huckvale
2011Recurrent Neural Network Based Language Modeling in Meeting Recognition.
Stefan Kombrink, Tomás Mikolov, Martin Karafiát, Lukás Burget
2011Reducing Computational Complexities of Exemplar-Based Sparse Representations with Applications to Large Vocabulary Speech Recognition.
Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo, Dimitri Kanevsky
2011Reduction of Highly Nonstationary Ambient Noise by Integrating Spectral and Locational Characteristics of Speech and Noise for Robust ASR.
Tomohiro Nakatani, Shoko Araki, Marc Delcroix, Takuya Yoshioka, Masakiyo Fujimoto
2011Redundancy Reduction in ASR of Spontaneous Speech Through Statistical Machine Translation.
Daniele Falavigna
2011Reformulating Prosodic Break Model into Segmental HMMs and Information Fusion.
Nicolas Obin, Pierre Lanchantin, Anne Lacheret, Xavier Rodet
2011Region Dependent Transform on MLP Features for Speech Recognition.
Tim Ng, Bing Zhang, Spyridon Matsoukas, Long Nguyen
2011Regularized Logistic Regression Fusion for Speaker Verification.
Ville Hautamäki, Kong-Aik Lee, Tomi Kinnunen, Bin Ma, Haizhou Li
2011Reinforcement Learning of Argumentation Dialogue Policies in Negotiation.
Kallirroi Georgila, David R. Traum
2011Relationships Between Phonetic Features and Speech Perception - A Statistical Investigation from a Large Anechoic British English Corpus.
Ian R. Cushing, Francis F. Li, Ken Worrall, Tim D. Jackson
2011Reliability-Weighted Acoustic Model Adaptation Using Crowd Sourced Transcriptions.
Kartik Audhkhasi, Panayiotis G. Georgiou, Shrikanth S. Narayanan
2011Report on Performance Results in the NIST 2010 Speaker Recognition Evaluation.
Craig S. Greenberg, Alvin F. Martin, Bradford Barr, George R. Doddington
2011Representing Phonological Features Through a Two-Level Finite State Model.
Javier Mikel Olaso, M. Inés Torres, Raquel Justo
2011Response Probability Based Decoding Algorithm for Large Vocabulary Continuous Speech Recognition.
Zhanlei Yang, Hao Chao, Wenju Liu
2011Restoring the Residual Speaker Information in Total Variability Modeling for Speaker Verification.
Ce Zhang, Rong Zheng, Bo Xu
2011Rhythm Metrics on Syllables and Feet do not Work as Expected.
Paolo Mairano, Antonio Romano
2011Robust Audio Fingerprinting Based on Local Spectral Luminance Maxima Scheme.
Yongzhe Shi, Weiqiang Zhang, Jia Liu
2011Robust Bimodal Person Identification Using Face and Speech with Limited Training Data and Corruption of Both Modalities.
Niall McLaughlin, Ji Ming, Danny Crookes
2011Robust HNR-Based Closed-Loop Pitch and Harmonic Parameters Estimation.
Alexander Pavlovets, Alexander A. Petrovsky
2011Robust Intonation Pattern Classification in Human Robot Interaction.
Martin Heckmann, Kazuhiro Nakadai, Hirofumi Nakajima
2011Robust Speaker Recognition in Non-Stationary Room Environments Based on Empirical Mode Decomposition.
Taufiq Hasan, John H. L. Hansen
2011Robust Speech Translation by Domain Adaptation.
Xiaodong He, Li Deng
2011Robust Voice Activity Detector for Real World Applications Using Harmonicity and Modulation Frequency.
Ekapol Chuangsuwanich, James R. Glass
2011Segregation of Whispered Speech Interleaved with Noise or Speech Maskers.
Nandini Iyer, Douglas Brungart, Brian D. Simpson
2011Semantic Graph Clustering for POMDP-Based Spoken Dialog Systems.
Florian Pinault, Fabrice Lefèvre
2011Semi-Automated Classifier Adaptation for Natural Language Call Routing.
Silke M. Witt
2011Semi-Automatic Acoustic Model Generation from Large Unsynchronized Audio and Text Chunks.
Michele Alessandrini, Giorgio Biagetti, Alessandro Curzi, Claudio Turchetti
2011Semi-Supervised Single-Channel Speech-Music Separation for Automatic Speech Recognition.
Cemil Demir, A. Taylan Cemgil, Murat Saraclar
2011Semi-Supervised Tree Support Vector Machine for Online Cough Recognition.
Huynh Thai Hoa, An Vu Tran, Tran Huy Dat
2011Sentence Selection by Direct Likelihood Maximization for Language Model Adaptation.
Takahiro Shinozaki, Yu Kubota, Sadaoki Furui, Eiji Utsunomiya, Yasutaka Shindoh
2011Separating Speaker and Environmental Variability Using Factored Transforms.
Michael L. Seltzer, Alex Acero
2011Sequential Classification Criteria for NNs in Automatic Speech Recognition.
Guangsen Wang, Khe Chai Sim
2011Shrinkage-Based Features for Natural Language Call Routing.
Ruhi Sarikaya, Stanley F. Chen, Bhuvana Ramabhadran
2011Signals and Speech.
Alex Pentland
2011Similar Vowels in L1/L2 Production: Confused or Discerned in Early L2 English Learners with Different Amount of Exposure.
E.-Chin Wu
2011Similarity Language Model.
Christian Gillot, Christophe Cerisara
2011Simulating Post-L F0 Bouncing by Modeling Articulatory Dynamics.
Santitham Prom-on, Yi Xu, Fang Liu
2011Sinewave Representations of Nonmodality.
Nicolas Malyska, Thomas F. Quatieri, Robert B. Dunn
2011Singing Voice Analysis Using Relative Harmonic Delays.
Ricardo Teixeira Sousa, Aníbal J. S. Ferreira
2011Singing Voice Synthesis: Singer-Dependent Vibrato Modeling and Coherent Processing of Spectral Envelope.
Siu Wa Lee, Minghui Dong
2011Single Channel Dereverberation Using Example-Based Speech Enhancement with Uncertainty Decoding Technique.
Keisuke Kinoshita, Mehrez Souden, Marc Delcroix, Tomohiro Nakatani
2011Single Channel Speech Enhancement Using MMSE Estimation of Short-Time Modulation Magnitude Spectrum.
Kuldip K. Paliwal, Belinda Schwerin, Kamil K. Wójcicki
2011Single Channel Speech Music Separation Using Nonnegative Matrix Factorization with Sliding Windows and Spectral Masks.
Emad M. Grais, Hakan Erdogan
2011Single-Channel Head Orientation Estimation Based on Discrimination of Acoustic Transfer Function.
Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki
2011Sinusoidal Approach for the Single-Channel Speech Separation and Recognition Challenge.
Pejman Mowlaee, Rahim Saeidi, Zheng-Hua Tan, Mads Græsbøll Christensen, Tomi Kinnunen, Pasi Fränti, Søren Holdt Jensen
2011Skew Gaussian Mixture Models for Speaker Recognition.
Avi Matza
2011Spatial Filter Calibration Based on Minimization of Modified LSD.
Nobuaki Tanaka, Tetsuji Ogawa, Tetsunori Kobayashi
2011Speak4it and the Multimodal Semantic Interpretation System.
Michael Johnston, Patrick Ehlen
2011Speaker Clustering Based on Non-Negative Matrix Factorization.
Masafumi Nishida, Seiichi Yamamoto
2011Speaker Clustering Based on Utterance-Oriented Dirichlet Process Mixture Model.
Naohiro Tawara, Shinji Watanabe, Tetsuji Ogawa, Tetsunori Kobayashi
2011Speaker Diarization Using a priori Acoustic Information.
Hagai Aronowitz
2011Speaker Identification for Whispered Speech Using a Training Feature Transformation from Neutral to Whisper.
Xing Fan, John H. L. Hansen
2011Speaker Modeling Using Local Binary Decisions.
Jean-François Bonastre, Xavier Anguera Miró, Gabriel Hernández Sierra, Pierre-Michel Bousquet
2011Speaker Recognition Using Temporal Contours in Linguistic Units: The Case of Formant and Formant-Bandwidth Trajectories.
Joaquin Gonzalez-Rodriguez
2011Speaker Role Recognition Using Question Detection and Characterization.
Thierry Bazillon, Benjamin Maza, Mickael Rouvier, Frédéric Béchet, Alexis Nasr
2011Speaker State Classification Based on Fusion of Asymmetric SIMPLS and Support Vector Machines.
Dong-Yan Huang, Shuzhi Sam Ge, Zhengchen Zhang
2011Speaker Verification Robust to Talking Style Variation Using Multiple Kernel Learning Based on Conditional Entropy Minimization.
Tetsuji Ogawa, Hideitsu Hino, Noboru Murata, Tetsunori Kobayashi
2011Speaker Verification Using Sparse Representations on Total Variability i-vectors.
Ming Li, Xiang Zhang, Yonghong Yan, Shrikanth S. Narayanan
2011Speaker-Adaptive Speech Synthesis Based on Eigenvoice Conversion and Language-Dependent Prosodic Conversion in Speech-to-Speech Translation.
Nobuhiko Hattori, Tomoki Toda, Hisashi Kawai, Hiroshi Saruwatari, Kiyohiro Shikano
2011Speaking More Like You: Entrainment in Conversational Speech.
Julia Hirschberg
2011Speaking to the Crowd: Looking at Past Achievements in Using Crowdsourcing for Speech and Predicting Future Challenges.
Gabriel Parent, Maxine Eskénazi
2011Spectral Envelope Transformation Using DFW and Amplitude Scaling for Voice Conversion with Parallel or Nonparallel Corpora.
Elizabeth Godoy, Olivier Rosec, Thierry Chonavel
2011Spectral Features for Automatic Blind Intelligibility Estimation of Spastic Dysarthric Speech.
Richard Hummel, Wai-Yip Chan, Tiago H. Falk
2011Speech Enhancement Using Masking Properties in Adverse Environments.
Atanu Saha, Tetsuya Shimamura
2011Speech Enhancement by Reconstruction from Cleaned Acoustic Features.
Philip Harding, Ben Milner
2011Speech Events are Recoverable from Unlabeled Articulatory Data: Using an Unsupervised Clustering Approach on Data Obtained from Electromagnetic Midsaggital Articulography (EMA).
Daniel Duran, Jagoda Bruni, Grzegorz Dogil, Hinrich Schütze
2011Speech Indexing Using Semantic Context Inference.
Chien-Lin Huang, Bin Ma, Haizhou Li, Chung-Hsien Wu
2011Speech Modulation Features for Robust Nonnative Speech Accent Detection.
Sethserey Sam, Xiong Xiao, Laurent Besacier, Eric Castelli, Haizhou Li, Chng Eng Siong
2011Speech Processing Tools - An Introduction to Interoperability.
Christoph Draxler, Toomas Altosaar, Sadaoki Furui, Mark Y. Liberman, Peter Wittenburg
2011Speech Recognition in Mixed Sound of Speech and Music Based on Vector Quantization and Non-Negative Matrix Factorization.
Shoichi Nakano, Kazumasa Yamamoto, Seiichi Nakagawa
2011Speech Synthesis Based on Articulatory-Movement HMMs with Voice-Source Codebooks.
Tsuneo Nitta, Takayuki Onoda, Masashi Kimura, Yurie Iribe, Kouichi Katsurada
2011Speech Synthesis Parameter Generation for the Assistive Silent Speech Interface MVOCA.
Robin Hofe, Stephen R. Ell, Michael J. Fagan, James M. Gilbert, Phil D. Green, Roger K. Moore, Sergey I. Rybchenko
2011Speech Technology in (Re)Habilitation of Persons with Communication Disabilities.
Björn Granström
2011Speech Timing Organization for the Phonological Length Contrast in Italian Consonants.
Claudio Zmarich, Barbara Gili Fivela, Pascal Perrier, Christophe Savariaux, Graziano Tisato
2011Speech Transcript Evaluation for Information Retrieval.
Laurens van der Werff, Wessel Kraaij, Franciska de Jong
2011Speech Translation with Grammar Driven Probabilistic Phrasal Bilexica Extraction.
Markus Saers, Dekai Wu, Chi-kiu Lo, Karteek Addanki
2011Speech-Based Non-Prototypical Affect Recognition for Child-Robot Interaction in Reverberated Environments.
Martin Wöllmer, Felix Weninger, Stefan Steidl, Anton Batliner, Björn W. Schuller
2011SpeechForms: From Web to Speech and Back.
Luciano Barbosa, Diamantino Caseiro, Giuseppe Di Fabbrizio
2011Spoken Document Confidence Estimation Using Contextual Coherence.
Taichi Asami, Narichika Nomoto, Satoshi Kobashikawa, Yoshikazu Yamaguchi, Hirokazu Masataki, Satoshi Takahashi
2011Spoken Language Recognition in the Latent Topic Simplex.
Kong-Aik Lee, Chang Huai You, Ville Hautamäki, Anthony Larcher, Haizhou Li
2011Spoken Lecture Summarization by Random Walk over a Graph Constructed with Automatically Extracted Key Terms.
Yun-Nung Chen, Yu Huang, Ching-Feng Yeh, Lin-Shan Lee
2011Spoken Term Detection Results Using Plural Subword Models by Estimating Detection Performance for Each Query.
Yoshiaki Itoh, Kohei Iwata, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee
2011State-Level Data Borrowing for Low-Resource Speech Recognition Based on Subspace GMMs.
Yanmin Qian, Daniel Povey, Jia Liu
2011Statistical Mapping Between Articulatory and Acoustic Data for an Ultrasound-Based Silent Speech Interface.
Thomas Hueber, Elie-Laurent Benaroya, Bruce Denby, Gérard Chollet
2011Stop Consonant Recognition by Temporal Fine Structure of Burst.
Seppo Fagerlund, Unto K. Laine
2011Structural Joint Factor Analysis for Speaker Recognition.
Marc Ferras, Koichi Shinoda, Sadaoki Furui
2011Structured Support Vector Machines for Noise Robust Continuous Speech Recognition.
Shi-Xiong Zhang, Mark J. F. Gales
2011Study of Overlapped Speech Detection for NIST SRE Summed Channel Speaker Recognition.
Hanwu Sun, Bin Ma
2011Study on the Relevance Factor of Maximum a Posteriori with GMM for Language Recognition.
Chang Huai You, Haizhou Li, Kong-Aik Lee
2011Stylization and Trajectory Modelling of Short and Long Term Speech Prosody Variations.
Nicolas Obin, Anne Lacheret, Xavier Rodet
2011Sub-Band Level Histogram Equalization for Robust Speech Recognition.
Vikas Joshi, Raghavendra Bilgi, Srinivasan Umesh, Luz García, M. Carmen Benítez
2011Subjective and Objective Evaluation of Speech Intelligibility Enhancement Under Constant Energy and Duration Constraints.
Yan Tang, Martin Cooke
2011Super-Dirichlet Mixture Models Using Differential Line Spectral Frequencies for Text-Independent Speaker Identification.
Zhanyu Ma, Arne Leijon
2011Supervised Sparse Coding Strategy in Cochlear Implants.
Jinqiu Sang, Guoping Li, Hongmei Hu, Mark E. Lutman, Stefan Bleeck
2011Syllable Segmentation of Continuous Speech Using Auditory Attention Cues.
Ozlem Kalinli
2011Sylli: Automatic Phonological Syllabification for Italian.
Luca Iacoponi, Renata Savy
2011Symbolic and Direct Sequential Modeling of Prosody for Classification of Speaking-Style and Nativeness.
Andrew Rosenberg
2011Synchronous Reading: Learning French Orthography by Audiovisual Training.
Gérard Bailly, Will Barbour
2011Synthesis of Breathy, Normal, and Pressed Phonation Using a Two-Mass Model with a Triangular Glottis.
Peter Birkholz, Bernd J. Kröger, Christiane Neuschaefer-Rube
2011TSAB - Web Interface for Transcribed Speech Collections.
Tanel Alumäe, Ahti Kitsik
2011Tackling a Shilly-Shally Classifier for Predicting Task Success in Spoken Dialogue Interaction.
Alexander Schmitt, Alexander Zgorzelski, Wolfgang Minker
2011Target-Aware Lattice Rescoring for Dialect Recognition.
Rong Tong, Bin Ma, Haizhou Li, Chng Eng Siong
2011Template-Based Automatic Speech Recognition Meets Prosody.
Dino Seppi, Kris Demuynck, Dirk Van Compernolle
2011Temporal Performance of Dysarthric Patients in Speech and Tapping Tasks.
Eiji Shimura, Kazuhiko Kakehi
2011Temporal Relationship Between Auditory and Visual Prosodic Cues.
Erin Cvejic, Jeesun Kim, Chris Davis
2011Text Driven 3D Photo-Realistic Talking Head.
Lijuan Wang, Wei Han, Frank K. Soong, Qiang Huo
2011The "Fortis-Lenis" Distinction in Bulgarian and German.
Bistra Andreeva, Magdalena Wolska
2011The Albayzin 2010 Language Recognition Evaluation.
Luis Javier Rodríguez, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel
2011The Detection of Overlapping Speech with Prosodic Features for Speaker Diarization.
Martin Zelenák, Javier Hernando
2011The Effect of Seeing the Interlocutor on Speech Production in Different Noise Types.
Michael Fitzpatrick, Jeesun Kim, Chris Davis
2011The Effect of Using Normalized Models in Statistical Speech Synthesis.
Matt Shannon, Heiga Zen, William J. Byrne
2011The Effects of Phoneme Errors in Speaker Adaptation for HMM Speech Synthesis.
Bálint Tóth, Tibor Fegyó, Géza Németh
2011The Efficiency of Cross-Dialectal Word Recognition.
Annelie Tuinman, Holger Mitterer, Anne Cutler
2011The INTERSPEECH 2011 Speaker State Challenge.
Björn W. Schuller, Stefan Steidl, Anton Batliner, Florian Schiel, Jarek Krajewski
2011The JSafran Platform for Semi-Automatic Speech Processing.
Christophe Cerisara, Claire Gardent
2011The KLAIR Toolkit for Recording Interactive Dialogues with a Virtual Infant.
Mark A. Huckvale
2011The Lombard Effect in Spontaneous Dialog Speech.
Laura Folk, Florian Schiel
2011The Multi Timescale Phoneme Acquisition Model of the Self-Organizing Based on the Dynamic Features.
Kouki Miyazawa, Hideaki Miura, Hideaki Kikuchi, Reiko Mazuka
2011The Open Front Vowel /æ/ in the Production and Perception of Czech Students of English.
Pavel Sturm, Radek Skarnitzl
2011The Perception Boundary Between Single and Geminate Stops in 3- and 4-Mora Japanese Words.
Shigeaki Amano, Yukari Hirata
2011The Phonology and Phonetics of Perceived Prosody: What do Listeners Imitate?
Jennifer Cole, Stefanie Shattuck-Hufnagel
2011The Relation Between Perception and Production in L2 Phonological Processing.
Sharon Peperkamp, Camillia Bouchon
2011The Representation of Speech in a Nonlinear Auditory Model: Time-Domain Analysis of Simulated Auditory-Nerve Firing Patterns.
Guy J. Brown, Tim Jürgens, Ray Meddis, Matthew Robertson, Nicholas R. Clark
2011The Role of Variability in Non-Native Perceptual Learning of a Japanese Geminate-Singleton Fricative Contrast.
Makiko Sadakata, James M. McQueen
2011The Role of Word-Initial Glottal Stops in Recognizing English Words.
Maria Paola Bissiri, María Luisa García Lecumberri, Martin Cooke, Jan Volín
2011The Social Signal Interpretation Framework (SSI) for Real Time Signal Processing and Recognition.
Johannes Wagner, Florian Lingenfelser, Elisabeth André
2011The Time-Course of Talker-Specificity Effects for Newly-Learned Pseudowords: Evidence for a Hybrid Model of Lexical Representation.
Helen Brown, M. Gareth Gaskell
2011The USC CARE Corpus: Child-Psychologist Interactions of Children with Autism Spectrum Disorders.
Matthew Black, Daniel Bone, Marian E. Williams, Phillip Gorrindo, Pat Levitt, Shrikanth S. Narayanan
2011The Vocal Effort of Dominance in Scenario Meetings.
Marcela Charfuelan, Marc Schröder
2011Theoretical Analysis of Musical Noise and Speech Distortion in Structure-Generalized Parametric Blind Spatial Subtraction Array.
Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano
2011Thresholding Word Activations for Response Scoring - Modelling Psycholinguistic Data.
Christina Bergmann, Louis ten Bosch, Lou Boves
2011Time- and Acoustic-Mediated Alignment Algorithms for Speech Recognition Evaluation.
Simon Dobrisek, France Mihelic
2011Time-Varying Signal Adaptive Transform and IHT Recovery of Compressive Sensed Speech.
Ch. Srikanth Raj, Thippur V. Sreenivas
2011Timing in Italian VNC Sequences at Different Speech Rates.
Chiara Celata, Silvia Calamai
2011To Weight or Not to Weight: Source-Normalised LDA for Speaker Recognition Using i-vectors.
Mitchell McLaren, David A. van Leeuwen
2011Tonal Alignment Defined: The Case of Southern Irish English.
Raya Kalaldeh
2011Tonal Variations in Mandarin: New Evidence from Spontaneous and Read Speech.
Li-chiung Yang
2011Tongue Gestures Awareness and Pronunciation Training.
Slim Ouni
2011Topic Identification from Audio Recordings Using Rich Recognition Results and Neural Network Based Classifiers.
Roberto Gemello, Franco Mana, Pier Domenico Batzu
2011Topic Segmentation of TV-Streams by Mathematical Morphology and Vectorization.
Vincent Claveau, Sébastien Lefèvre
2011Topic Switching Strategies for Spoken Dialogue Systems.
Tobias Heinroth, Savina Koleva, Wolfgang Minker
2011Toward a Continuous Modeling of French Prosodic Structure: Using Acoustic Features to Predict Prominence Location and Prominence Degree.
Mathieu Avanzi, Nicolas Obin, Anne Lacheret-Dujour, Bernard Victorri
2011Toward a Multi-Speaker Visual Articulatory Feedback System.
Atef Ben Youssef, Thomas Hueber, Pierre Badin, Gérard Bailly
2011Towards Context-Dependent Phonetic Spelling Error Correction in Children's Freely Composed Text for Diagnostic and Pedagogical Purposes.
Sebastian Stüker, Johanna Fay, Kay Berkling
2011Towards Fully Bayesian Speaker Recognition: Integrating Out the Between-Speaker Covariance.
Jesús Antonio Villalba López, Niko Brümmer
2011Towards Goat Detection in Text-Dependent Speaker Verification.
Orith Toledo-Ronen, Hagai Aronowitz, Ron Hoory, Jason W. Pelecanos, David Nahamoo
2011Towards High Performance LVCSR in Speech-to-Speech Translation System on Smart Phones.
Jian Xue, Xiaodong Cui, Gregg Daggett, Etienne Marcheret, Bowen Zhou
2011Towards Unsupervised Spoken Language Understanding: Exploiting Query Click Logs for Slot Filling.
Gökhan Tür, Dilek Hakkani-Tür, Dustin Hillard, Asli Celikyilmaz
2011Towards Unsupervised Training of Speaker Independent Acoustic Models.
Aren Jansen, Kenneth Church
2011Towards Voice-Input Symbolic Pattern Retrieval Using Parameter-Based Search.
Yukiko Suzuki, Kiyoaki Aikawa
2011Towards a Versatile Multi-Layered Description of Speech Corpora Using Algebraic Relations.
Nelly Barbot, Vincent Barreaud, Olivier Boëffard, Laure Charonnat, Arnaud Delhay, Sébastien Le Maguer, Damien Lolive
2011Tracking Pitch Contours Using Minimum Jerk Trajectories.
Daniel Neiberg, Gopal Ananthakrishnan, Joakim Gustafson
2011Training a Language Model Using Webdata for Large Vocabulary Japanese Spontaneous Speech Recognition.
Ryo Masumura, Seongjun Hahm, Akinori Ito
2011Tree Encoding for the ITU-T G.711.1 Speech Coder.
Abdul Hannan Khan, Peter Kabal
2011Tue-SeA Real-Time Speech Command Detector for a Smart Control Room.
Daniel Reich, Felix Putze, Dominic Heger, Joris IJsselmuiden, Rainer Stiefelhagen, Tanja Schultz
2011Unary Data Structures for Language Models.
Jeffrey Sorensen, Cyril Allauzen
2011Uncertainty Management for On-Line Optimisation of a POMDP-Based Large-Scale Spoken Dialogue System.
Lucie Daubigney, Milica Gasic, Senthilkumar Chandramohan, Matthieu Geist, Olivier Pietquin, Steve J. Young
2011Uncertainty Measures for Improving Exemplar-Based Source Separation.
Heikki Kallasjoki, Ulpu Remes, Jort F. Gemmeke, Tuomas Virtanen, Kalle J. Palomäki
2011Uncovering the Effect of Imitation on Tonal Patterns of French Accentual Phrases.
Amandine Michelas, Noël Nguyen
2011Underdetermined Blind Source Separation with Fuzzy Clustering for Arbitrarily Arranged Sensors.
Ingrid Jafari, Serajul Haque, Roberto Togneri, Sven Nordholm
2011Uniform Speech Parameterization for Multi-Form Segment Synthesis.
Alexander Sorin, Slava Shechtman, Vincent Pollet
2011University of Ljubljana System for Interspeech 2011 Speaker State Challenge.
Rok Gajsek, Simon Dobrisek, France Mihelic
2011Unsupervised Arabic Dialect Adaptation with Self-Training.
Scott Novotney, Richard M. Schwartz, Sanjeev Khudanpur
2011Unsupervised Audio Analysis for Categorizing Heterogeneous Consumer Domain Videos.
Pradeep Natarajan, Stavros Tsakalidis, Vasant Manohar, Rohit Prasad, Premkumar Natarajan
2011Unsupervised Audio Patterns Discovery Using HMM-Based Self-Organized Units.
Man-Hung Siu, Herbert Gish, Steve Lowe, Arthur Chan
2011Unsupervised Clustering of Utterances Using Non-Parametric Bayesian Methods.
Ryuichiro Higashinaka, Noriaki Kawamae, Kugatsu Sadamitsu, Yasuhiro Minami, Toyomi Meguro, Kohji Dohsaka, Hirohito Inagaki
2011Unsupervised Continuous-Valued Word Features for Phrase-Break Prediction without a Part-of-Speech Tagger.
Oliver Watts, Junichi Yamagishi, Simon King
2011Unsupervised Features from Text for Speech Synthesis in a Speech-to-Speech Translation System.
Oliver Watts, Bowen Zhou
2011Unsupervised Geometry Calibration of Acoustic Sensor Networks Using Source Correspondences.
Joerg Schmalenstroeer, Florian Jacob, Reinhold Haeb-Umbach, Marius H. Hennecke, Gernot A. Fink
2011Unsupervised Hidden Markov Modeling of Spoken Queries for Spoken Term Detection without Speech Recognition.
Chun-an Chan, Lin-Shan Lee
2011Unsupervised Latent Speaker Language Modeling.
Yik-Cheung Tam, Paul Vozila
2011Unsupervised Learning of Acoustic Events Using Dynamic Time Warping and Hierarchical K-Means++ Clustering.
Joerg Schmalenstroeer, Markus Bartek, Reinhold Haeb-Umbach
2011Unsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification.
Sourish Chaudhuri, Mark Harvilla, Bhiksha Raj
2011Unsupervised Testing Strategies for ASR.
Brian Strope, Doug Beeferman, Alexander Gruenstein, Xin Lei
2011Use of the Harmonic Phase in Speaker Recognition.
Inma Hernáez, Ibon Saratxaga, Jon Sánchez, Eva Navas, Iker Luengo
2011User Simulation in Dialogue Systems Using Inverse Reinforcement Learning.
Senthilkumar Chandramohan, Matthieu Geist, Fabrice Lefèvre, Olivier Pietquin
2011User Study of Spoken Decision Support System.
Teruhisa Misu, Kiyonori Ohtake, Chiori Hori, Hisashi Kawai, Satoshi Nakamura
2011Using Crowdsourcing to Provide Prosodic Annotations for Non-Native Speech.
Keelan Evanini, Klaus Zechner
2011Using Dynamic Time Warping to Compute Prosodic Similarity Measures.
Albert Rilliard, Alexandre Allauzen, Philippe Boula de Mareüil
2011Using Features from Topic Models to Alleviate Over-Generation in Hierarchical Phrase-Based Translation.
Songfang Huang, Bowen Zhou
2011Using Human Perception for Automatic Accent Assessment.
Freddy William, Abhijeet Sangwan, John H. L. Hansen
2011Using Imitation to Learn Infant-Adult Acoustic Mappings.
Gopal Ananthakrishnan, Giampiero Salvi
2011Using Latent Topic Features for Named Entity Extraction in Search Queries.
Joe Polifroni, François Mairesse
2011Using Multiple Databases for Training in Emotion Recognition: To Unite or to Vote?
Björn W. Schuller, Zixing Zhang, Felix Weninger, Gerhard Rigoll
2011Using Mutual Information to Identify Regions of Analysis for Prosodic Analysis.
Andrew Rosenberg
2011Using Prominence Detection to Generate Acoustic Feedback in Tutoring Scenarios.
Lars Schillingmann, Petra Wagner, Christian Munier, Britta Wrede, Katharina J. Rohlfing
2011Using Prosodic and Spectral Features in Detecting Depression in Elderly Males.
Michelle Hewlett Sanchez, Dimitra Vergyri, Luciana Ferrer, Colleen Richey, Pablo Garcia, Bruce Knoth, William Jarrold
2011Using Speaker ID to Discover Repeat Callers of a Spoken Dialog System.
Andrew Fandrianto, Brian Langner, Alan W. Black
2011Using Spectral Fluctuation of Speech in Multi-Feature HMM-Based Voice Activity Detection.
Miquel Espi, Shigeki Miyabe, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama
2011Using Unsupervised Feature-Based Speaker Adaptation for Improved Transcription of Spoken Archives.
Petr Cerva, Karel Palecek, Jan Silovský, Jan Nouza
2011Using a Genetic Algorithm to Estimate Parameters of a Coarticulation Model.
Brian O. Bush, John-Paul Hosom, Alexander Kain, Akiko Amano-Kusumoto
2011Utterance Verification for Automating the Hearing in Noise Test (HINT).
H. Timothy Bunnell, Jason Lilley, Sigfrid D. Soli, Ivan Pal
2011VTLN in the MFCC Domain: Band-Limited versus Local Interpolation.
Ehsan Variani, Thomas Schaaf
2011Validating a Second Language Perception Model for Classroom Context - A Longitudinal Study within the Perceptual Assimilation Model.
Bianca Sisinni, Mirko Grimaldi
2011Validating rt-MRI Based Articulatory Representations via Articulatory Recognition.
Athanasios Katsamanis, Erik Bresch, Vikram Ramanarayanan, Shrikanth S. Narayanan
2011Variation of Accent Type and of Context - Influences on Pragmatic Focus Interpretation.
Charlotte Wollermann, Ulrich Schade, Bernhard Schröder
2011Variational Bayesian Model Selection for GMM-Speaker Verification Using Universal Background Model.
Timur Pekhovsky, Alexandra Lokhanova
2011Verifying Human Users in Speech-Based Interactions.
Sajad Shirali-Shahreza, Yashar Ganjali, Ravin Balakrishnan
2011Very Large Vocabulary ASR for Spoken Russian with Syntactic and Morphemic Analysis.
Alexey Karpov, Irina S. Kipyatkova, Andrey Ronzhin
2011Very Short Utterances and Timing in Turn-Taking.
Mattias Heldner, Jens Edlund, Anna Hjalmarsson, Kornel Laskowski
2011Visual Speech Speeds Up Auditory Identification Responses.
Tim Paris, Jeesun Kim, Chris Davis
2011Visual Voice Mail to Text on the iPhone/iPad.
Andrej Ljolje, Vincent Goffin, Diamantino Caseiro, Taniya Mishra, Mazin Gilbert
2011Visualization of Vocal Tract Shape Using Interleaved Real-Time MRI of Multiple Scan Planes.
Yoon-Chul Kim, Michael I. Proctor, Shrikanth S. Narayanan, Krishna S. Nayak
2011Voice Activity Detection in MTF-Based Power Envelope Restoration.
Masashi Unoki, Xugang Lu, Rico Petrick, Shota Morita, Masato Akagi, Rüdiger Hoffmann
2011Voice Conversion Using GMM with Enhanced Global Variance.
Hadas Benisty, David Malah
2011Voice Processing by Dynamic Glottal Models with Applications to Speech Enhancement.
Carlo Drioli, Andrea Calanca
2011Voice Quality Characterization of IETF Opus Codec.
Anssi Rämö, Henri Toukomaa
2011Vowel Context and Speaker Interactions Influencing Glottal Open Quotient and Formant Frequency Shifts in Physical Task Stress.
Keith W. Godin, John H. L. Hansen
2011Vowels Formants Analysis Allows Straightforward Detection of High Arousal Acted and Spontaneous Emotions.
Bogdan Vlasenko, Dmytro Prylipko, David Philippou-Hübner, Andreas Wendemuth
2011Web-Based Automatic Speech Recognition Service - webASR.
Stuart N. Wrigley, Thomas Hain
2011Web-Enhanced Content Retrieval for Information Access Dialogue System.
Donghyeon Lee, Cheongjae Lee, Minwoo Jeong, Kyungduk Kim, Seokhwan Kim, Junhwi Choi, Gary Geunbae Lee
2011Weight Optimization for Bimodal Unit-Selection Talking Head Synthesis.
Asterios Toutios, Utpala Musti, Slim Ouni, Vincent Colotte
2011Weighted Ordered Classes - Nearest Neighbors: A New Framework for Automatic Emotion Recognition from Speech.
Yazid Attabi, Pierre Dumouchel
2011When Two Newly-Acquired Words are One: New Words Differing in Stress Alone are not Automatically Represented Differently.
Simone Sulpizio, James M. McQueen
2011Where Should Pitch Accents and Phrase Breaks Go? A Syntax Tree Transducer Solution.
Joseph Tepperman, Emily Nava
2011WinPitch: A Multimodal Tool for Speech Analysis of Endangered Languages.
Philippe Martin
2011Woefzela - An Open-Source Platform for ASR Data Collection in the Developing World.
Nic J. de Vries, Jaco Badenhorst, Marelie H. Davel, Etienne Barnard, Alta de Waal
2011Word Boundary Modelling and Full Covariance Gaussians for Arabic Speech-to-Text Systems.
Frank Diehl, Mark John Francis Gales, Xunying Liu, Marcus Tomalin, Philip C. Woodland
2011Your Mobile Virtual Assistant Just Got Smarter!
Mazin Gilbert, Iker Arizmendi, Enrico Bocchieri, Diamantino Caseiro, Vincent Goffin, Andrej Ljolje, Mike Phillips, Chao Wang, Jay G. Wilpon
2011Zero-Crossing-Based Channel Attentive Weighting of Cepstral Features for Robust Speech Recognition: The ETRI 2011 CHiME Challenge System.
Young-Ik Kim, Hoon-Young Cho, Sang-Hun Kim
2011Zero-Resource Audio-Only Spoken Term Detection Based on a Combination of Template Matching Techniques.
Armando Muscariello, Guillaume Gravier, Frédéric Bimbot
2011i-vector Based Speaker Recognition on Short Utterances.
Ahilan Kanagasundaram, Robbie Vogt, David Dean, Sridha Sridharan, Michael Mason
2011iVector Approach to Phonotactic Language Recognition.
Mehdi Soufifar, Marcel Kockmann, Lukás Burget, Oldrich Plchot, Ondrej Glembek, Torbjørn Svendsen
2011iVector Fusion of Prosodic and Cepstral Features for Speaker Verification.
Marcel Kockmann, Luciana Ferrer, Lukás Burget, Jan Cernocký
2011mTalk - A Multimodal Browser for Mobile Services.
Michael Johnston, Giuseppe Di Fabbrizio, Simon Urbanek