| 2005 | "hello - is anybody at home?" - about the minimum word accuracy of a smart home spoken dialogue system. Jan Felix Krebber |
| 2005 | 9th European Conference on Speech Communication and Technology, INTERSPEECH-Eurospeech 2005, Lisbon, Portugal, September 4-8, 2005 |
| 2005 | A Bayesian network approach combining pitch and spectral envelope features to reduce channel mismatch in speaker verification and forensic speaker recognition. Mijail Arcienega, Anil Alexander, Philipp Zimmermann, Andrzej Drygajlo |
| 2005 | A German viseme-set for automatic transcription of input text used for audio-visual speech synthesis. Christian Weiss, Bianca Aschenberner |
| 2005 | A MFCC-based CELP speech coder for server-based speech recognition in network environments. Gil Ho Lee, Jae Sam Yoon, Hong Kook Kim |
| 2005 | A Portuguese spoken and multi-modal dialog corpora. Gloria Branco, Luís Almeida, Rui Gomes, Nuno Beires |
| 2005 | A bi-lingual Mandarin-to-taiwanese text-to-speech system. Min-Siong Liang, Ke-Chun Chuang, Rhuei-Cheng Yang, Yuang-Chin Chiang, Ren-yuan Lyu |
| 2005 | A category-dependent feature selection method for speech signals. Woojay Jeon, Biing-Hwang Juang |
| 2005 | A comparative study of two kernel eigenspace-based speaker adaptation methods on large vocabulary continuous speech recognition. Roger Wend-Huu Hsiao, Brian Kan-Wing Mak |
| 2005 | A comparison of human and computer recognition accuracy for children's speech. Shona D'Arcy, Martin J. Russell |
| 2005 | A comparison of methods for speaker-dependent pronunciation tuning for text-to-speech synthesis. Gabriel Webster, Tina Burrows, Katherine M. Knill |
| 2005 | A comparison of particle filtering variants for speech feature enhancement. Reinhold Haeb-Umbach, Joerg Schmalenstroeer |
| 2005 | A computational model of the speech reception threshold for laterally separated speech and noise. Guy J. Brown, Kalle J. Palomäki |
| 2005 | A confidence measure invariant to language and grammar. Daniele Colibro, Luciano Fissore, Claudio Vair, Emanuele Dalmasso, Pietro Laface |
| 2005 | A confidence-guided dynamic pruning approach - utilization of confidence measurement in speech recognition. Tibor Fábián, Robert Lieb, Günther Ruske, Matthias Thomae |
| 2005 | A cross-linguistic study of vowel quantity in different word structures: Japanese, Finnish and Czech. Toshiko Isei-Jaakkola, Satoshi Asakawa |
| 2005 | A data-driven approach for the model parameter compensation in noisy speech recognition. Yong-Joo Chung |
| 2005 | A database of German emotional speech. Felix Burkhardt, Astrid Paeschke, M. Rolfes, Walter F. Sendlmeier, Benjamin Weiss |
| 2005 | A discriminative approach to phrase break modelling. Stephen Cox |
| 2005 | A distance measure between GMMs based on the unscented transform and its application to speaker recognition. Jacob Goldberger, Hagai Aronowitz |
| 2005 | A flexible and integrated interface between speech recognition, speech interpretation and dialog management. Robert Lieb, Matthias Thomae, Günther Ruske, Daniel Bobbert, Frank Althoff |
| 2005 | A frame based spoken dialog system for home care. Daniele Falavigna, Toni Giorgino, Roberto Gretter |
| 2005 | A framework for estimation of clean speech by fusion of outputs from multiple speech enhancement systems. Venkatesh Krishnan, Phil Spencer Whitehead, David V. Anderson, Mark A. Clements |
| 2005 | A generalized framework for compensation of mel-filterbank outputs in feature extraction for robust ASR. Eric H. C. Choi |
| 2005 | A glimpse of the time-course of intonation processing in European Portuguese. Isabel Falé, Isabel Hub Faria |
| 2005 | A graphical model for multi-sensory speech processing in air-and-bone conductive microphones. Amarnag Subramanya, Zhengyou Zhang, Zicheng Liu, Jasha Droppo, Alex Acero |
| 2005 | A human-human train timetable dialogue corpus. Filip Jurcícek, Jirí Zahradil, Libor Jelínek |
| 2005 | A hybrid ANN/DBN approach to articulatory feature recognition. Joe Frankel, Simon King |
| 2005 | A hybrid Maxent/HMM based ASR system. Yasser Hifny, Steve Renals, Neil D. Lawrence |
| 2005 | A hybrid approach to automatic segmentation and labeling for Mandarin Chinese speech corpus. Cheng-Yuan Lin, Kuan-Ting Chen, Jyh-Shing Roger Jang |
| 2005 | A hybrid microphone array post-filter in a diffuse noise field. Junfeng Li, Masato Akagi |
| 2005 | A longitudinal analysis of the spectral peaks of vowels for a Japanese infant. Kentaro Ishizuka, Ryoko Mugitani, Hiroko Kato Solvang, Shigeaki Amano |
| 2005 | A method of multi-layered speech segmentation tailored for speech synthesis. Takashi Saito |
| 2005 | A methodology for comparing grammar-based and robust approaches to speech understanding. Manny Rayner, Pierrette Bouillon, Nikos Chatzichrisafis, Beth Ann Hockey, Marianne Santaholma, Marianne Starlander, Hitoshi Isahara, Kyoko Kanzaki, Yukie Nakao |
| 2005 | A model for selective segregation of a target instrument sound from the mixed sound of various instruments. Masashi Unoki, Masaaki Kubo, Atsushi Haniu, Masato Akagi |
| 2005 | A model space framework for efficient speaker detection. Mathieu Ben, Guillaume Gravier, Frédéric Bimbot |
| 2005 | A multi-layer fuzzy logical model for emotional speech perception. Chun-Fang Huang, Masato Akagi |
| 2005 | A multi-pass, dynamic-vocabulary approach to real-time, large-vocabulary speech recognition. I. Lee Hetherington |
| 2005 | A multiple classifier-based concept-spotting approach for robust spoken language understanding. Jihyun Eun, Minwoo Jeong, Gary Geunbae Lee |
| 2005 | A neural network approach for the design of the target cost function in unit-selection speech synthesis. Francisco Campillo Díaz, José Luis Alba, Eduardo Rodríguez Banga |
| 2005 | A new evaluation criteria for keyword spotting techniques and a new algorithm. Marius-Calin Silaghi, Rachna Vargiya |
| 2005 | A new posterior based audio-visual integration method for robust speech recognition. Rowan Seymour, Ji Ming, Darryl Stewart |
| 2005 | A new structural preprocessor for low-bit rate speech coding. Joon-Hyuk Chang, Jong Won Shin, Seung Yeol Lee, Nam Soo Kim |
| 2005 | A noise-robust pitch synchronous feature extraction algorithm for speaker recognition systems. Samuel Kim, Sung-Wan Yoon, Thomas Eriksson, Hong-Goo Kang, Dae Hee Youn |
| 2005 | A novel voicing cut-off determination for low bit-rate harmonic speech coding. Changchun Bao, Jason Lukasiak, Christian H. Ritz |
| 2005 | A partial decorrelation scheme for improved predictive open loop quantization with noise shaping. Hauke Krüger, Peter Vary |
| 2005 | A performance investigation of noisy voice recognition over IP telephony networks. Gang Chen, Douglas D. O'Shaughnessy, Hesham Tolba |
| 2005 | A phonetic study of the "er-hua" rimes in Beijing Mandarin. Wai-Sum Lee |
| 2005 | A pitch-based model for separation of reverberant speech. Nicoleta Roman, DeLiang Wang |
| 2005 | A pitch-synchronous pitch-cycle modification method for designing a hybrid i-MELP/waveform-matching speech coder. Ali Erdem Ertan, Thomas P. Barnwell III |
| 2005 | A posteriori multiple word-domain language model. Elvira I. Sicilia-Garcia, Ji Ming, Francis Jack Smith |
| 2005 | A preprocessing technique for improving speech intelligibility in reverberant environments: the effect of steady-state suppression on elderly people. Yusuke Miyauchi, Nao Hodoshima, Keiichi Yasu, Nahoko Hayashi, Takayuki Arai, Mitsuko Shindo |
| 2005 | A principled approach for rejection threshold optimization in spoken dialog systems. Dan Bohus, Alexander I. Rudnicky |
| 2005 | A probabilistic approach to prosodic word prediction for Mandarin Chinese TTS. Minghui Dong, Kim-Teng Lua, Haizhou Li |
| 2005 | A probabilistic approach to unit selection for corpus-based speech synthesis. Shinsuke Sakai, Han Shu |
| 2005 | A rapid prototyping tool for constructing web-based MMI applications. Kouichi Katsurada, Kunitoshi Sato, Hiroaki Adachi, Hirobumi Yamada, Tsuneo Nitta |
| 2005 | A rhythmic-prosodic model of poetic speech. Jörg Bröggelwirth |
| 2005 | A speaker biased SI recognizer for embedded mobile applications. Yaxin Zhang, Bian Wu, Xiaolin Ren, Xin He |
| 2005 | A speaker independent "liveness" test for audio-visual biometrics. Nicolas Eveno, Laurent Besacier |
| 2005 | A speaker independent continuous speech recognizer for Amharic. Hussien Seid, Björn Gambäck |
| 2005 | A spectral conversion approach to feature denoising and speech enhancement. Athanasios Mouchtaris, Jan Van der Spiegel, Paul Mueller, Panagiotis Tsakalides |
| 2005 | A spectrogram model for enhanced source localization and noise-robust ASR. Guillaume Lathoud, Mathew Magimai-Doss, Bertrand Mesot |
| 2005 | A speech centric mobile multimodal service useful for dyslectics and aphasics. Knut Kvale, Narada D. Warakagoda |
| 2005 | A speech similarity distance weighting for robust recognition. Michael J. Carey, Tuan P. Quang |
| 2005 | A statistical method of evaluating pronunciation proficiency for Japanese words. Kei Ohta, Seiichi Nakagawa |
| 2005 | A stereo input-output superdirective beamformer for dual channel noise reduction. Thomas Lotter, Bastian Sauert, Peter Vary |
| 2005 | A stochastic approach to phoneme and accent estimation. Tohru Nagano, Shinsuke Mori, Masafumi Nishimura |
| 2005 | A stream-based audio segmentation, classification and clustering pre-processing system for broadcast news using ANN models. Hugo Meinedo, João Paulo Neto |
| 2005 | A study of implicit and explicit modeling of coarticulation and pronunciation variation. Stéphane Dupont, Christophe Ris, Laurent Couvreur, Jean-Marc Boite |
| 2005 | A study of variable pulse allocation for MPE and CELP coders based on PESQ analysis. Shi-Han Chen, Kuo-Guan Wu, Chih-Chung Kuo |
| 2005 | A study of weighted CSP analysis with average speech spectrum for noise robust talker localization. Yuki Denda, Takanobu Nishiura, Yoichi Yamashita |
| 2005 | A study on separation between acoustic models and its applications. Yu Tsao, Jinyu Li, Chin-Hui Lee |
| 2005 | A study on the automatic detection and characterization of emotion in a voice service context. Christophe Blouin, Valérie Maffiolo |
| 2005 | A support vector approach to the acoustic-to-articulatory mapping. Asterios Toutios, Konstantinos G. Margaritis |
| 2005 | A system for audio-visual speech recognition. Islam Shdaifat, Rolf-Rainer Grigat |
| 2005 | A tagged-cine MRI investigation of German vowels. Marianne Pouplier, Maureen Stone |
| 2005 | A text categorization approach to automatic language identification. Sheng Gao, Bin Ma, Haizhou Li, Chin-Hui Lee |
| 2005 | A three-dimensional linear articulatory model of velum based on MRI data. Antoine Serrurier, Pierre Badin |
| 2005 | A timbre space for speech. Hiroko Terasawa, Malcolm Slaney, Jonathan Berger |
| 2005 | A toolkit for voice inverse filtering and parametrisation. Matti Airas, Hannu Pulakka, Tom Bäckström, Paavo Alku |
| 2005 | A transformation-based learning approach to language identification for mixed-lingual text-to-speech synthesis. J. C. Marcadet, Volker Fischer, Claire Waast-Richard |
| 2005 | A two-microphone diversity system and its application for hands-free car kits. Jürgen Freudenberger, Klaus Linhard |
| 2005 | A user study on the influence of mobile device class, synthesis method, data rate and lexicon on speech synthesis quality. Michael Pucher, Peter Fröhlich |
| 2005 | A wavelet based noise reduction algorithm for speech signal corrupted by coloured noise. Vladimir Braquet, Takao Kobayashi |
| 2005 | A web-based articulatory speech synthesis system for distance education. Kohichi Ogata |
| 2005 | ASR decoding in a computational model of human word recognition. Louis ten Bosch, Odette Scharenborg |
| 2005 | Abstractness in speech-metronome synchronisation: P-centres as cyclic attractors. Plínio A. Barbosa, Pablo Arantes, Alexsandro R. Meireles, Jussara M. Vieira |
| 2005 | Accent detection and speech recognition for Shanghai-accented Mandarin. Yanli Zheng, Richard Sproat, Liang Gu, Izhak Shafran, Haolang Zhou, Yi Su, Daniel Jurafsky, Rebecca Starr, Su-Youn Yoon |
| 2005 | Access for all - a talking internet service. Ove Andersen, Christian Hjulmand |
| 2005 | Acoustic analysis of Czech stress: intonation, duration and intensity revisited. Tomás Dubeda, Jan Votrubec |
| 2005 | Acoustic and phonetic confusions in accented speech recognition. Yi Liu, Pascale Fung |
| 2005 | Acoustic correlates of contrastive stress in German children. Britta Lintfert, Katrin Schneider |
| 2005 | Acoustic feedback cancellation in speech reinforcement systems for vehicles. Alfonso Ortega, Eduardo Lleida, Enrique Masgrau, Luis Buera, Antonio Miguel |
| 2005 | Acoustic properties of foreign accent: VOT variations in Moroccan-accented Italian. Laura Mori, Melissa Barkat-Defradas |
| 2005 | Acoustic/prosodic and lexical correlates of charismatic speech. Andrew Rosenberg, Julia Hirschberg |
| 2005 | Active learning with minimum expected error for spoken language understanding. Hong-Kwang Jeff Kuo, Vaibhava Goel |
| 2005 | Adapt Mandarin TTS system to Chinese dialect TTS systems. Hai Ping Li, Wei Zhang |
| 2005 | Adaptation and normalization experiments in speech recognition for 4 to 8 year old children. Daniel Elenius, Mats Blomberg |
| 2005 | Adapting dialog call-flows for pervasive devices. Nitendra Rajput, Amit Anil Nanavati, Abhishek Kumar, Neeraj Chaudhary |
| 2005 | Adaptive speech analytics: system, infrastructure, and behavior. Upendra V. Chaudhari, Ganesh N. Ramaswamy, Edward A. Epstein, Sasha Caskey, Mohamed Kamal Omar |
| 2005 | Advances in regional accent clustering in Swedish. Giampiero Salvi |
| 2005 | Advances in statistical estimation and tracking of AM-FM speech components. Athanassios Katsamanis, Petros Maragos |
| 2005 | Advances in word based dialect/accent classification. Rongqing Huang, John H. L. Hansen |
| 2005 | Aligning and recognizing spoken books in different varieties of Portuguese. Isabel Trancoso, António Joaquim Serralheiro, Céu Viana, Diamantino Caseiro |
| 2005 | Amplitude modulation of frication noise by voicing saturates. Jonathan Pincas, Philip J. B. Jackson |
| 2005 | An Amharic speech corpus for large vocabulary continuous speech recognition. Solomon Teferra Abate, Wolfgang Menzel, Bairu Tafila |
| 2005 | An acoustic segment modeling approach to automatic language identification. Bin Ma, Haizhou Li, Chin-Hui Lee |
| 2005 | An agent-based framework for speech investigation. Michael Walsh, Gregory M. P. O'Hare, Julie Carson-Berndsen |
| 2005 | An analysis of the intonational structure of stuttered speech. Timothy Arbisi-Kelm |
| 2005 | An approach to multi-strategy dialogue management. Shiu-Wah Chu, Ian M. O'Neill, Philip Hanna, Michael F. McTear |
| 2005 | An architecture for pluggable disambiguation mechanism for RDC based voice applications. Tanveer A. Faruquie, Pankaj Kankar, Nitendra Rajput, Abhishek Verma |
| 2005 | An architecture for seamless access to distributed multimodal services. David Pearce, Jonathan Engelsma, James C. Ferrans, John Johnson |
| 2005 | An articulatory study of emotional speech production. Sungbok Lee, Serdar Yildirim, Abe Kazemzadeh, Shrikanth S. Narayanan |
| 2005 | An automated linguistic knowledge-based cross-language transfer method for building acoustic models for a language without native training data. Chen Liu, Lynette Melnar |
| 2005 | An automatic intonation recognizer for the Polish language based on machine learning and expert knowledge. Mikolaj Wypych |
| 2005 | An automaton-based machine learning technique for automatic phonetic transcription. Paolo Massimino, Alberto Pacchiotti |
| 2005 | An elitist approach for extracting automatically well-realized speech sounds with high confidence. Jean-Baptiste Maj, Anne Bonneau, Dominique Fohr, Yves Laprie |
| 2005 | An embedded and concatenative approach to TTS of multiple languages. Gui-Lin Chen, Ke-Song Han, Zhen-Li Yu, Dong-Jian Yue, Yi-Qing Zu |
| 2005 | An energy search approach to variable frame rate front-end processing for robust ASR. Julien Epps, Eric H. C. Choi |
| 2005 | An error-corrective language-model adaptation for automatic speech recognition. Minwoo Jeong, Jihyun Eun, Sangkeun Jung, Gary Geunbae Lee |
| 2005 | An improved GMM-based voice quality predictor. Tiago H. Falk, Wai-Yip Chan, Peter Kabal |
| 2005 | An integration framework for a mobile multimodal dialogue system accessing the semantic web. Norbert Reithinger, Daniel Sonntag |
| 2005 | An investigation into a simulation of episodic memory for automatic speech recognition. Viktoria Maier, Roger K. Moore |
| 2005 | An n-gram-based statistical machine translation decoder. Josep Maria Crego, José B. Mariño, Adrià de Gispert |
| 2005 | An overview of nitech HMM-based speech synthesis system for blizzard challenge 2005. Heiga Zen, Tomoki Toda |
| 2005 | Analysis and modeling of fundamental frequency contours of hindi utterances. Hiroya Fujisaki, Sumio Ohno |
| 2005 | Analysis by synthesis of speech prosody: the Prozed environment. Daniel Hirst, Cyril Auran |
| 2005 | Analysis of major factors of naturalness degradation in concatenative synthesis. Toshio Hirai, Hisashi Kawai, Minoru Tsuzaki, Nobuyuki Nishizawa |
| 2005 | Analysis of spectral space reduction in spontaneous speech and its effects on speech recognition performances. Masanobu Nakamura, Koji Iwano, Sadaoki Furui |
| 2005 | Analysis of the effects of word emphasis and echo question on F0 contours of Cantonese utterances. Wentao Gu, Keikichi Hirose, Hiroya Fujisaki |
| 2005 | Analysis on command sequences of a F0 generation model for Mandarin speech and its application to their automatic extraction. Ke Li, Yoshinori Sagisaka |
| 2005 | Anatomy of an extremely fast LVCSR decoder. George Saon, Daniel Povey, Geoffrey Zweig |
| 2005 | Annotation-mining for rhythm model comparison in Brazilian portuguese. Dafydd Gibbon, Flaviane Romani Fernandes |
| 2005 | Application of a first-order differential microphone for efficient voice activity detection in a car platform. Agustín Álvarez-Marquina, Pedro Gómez, Victor Nieto Lluis, Rafael Martínez, Victoria Rodellar |
| 2005 | Application of auditory image model for speech event detection. Minoru Tsuzaki, Satomi Tanaka, Hiroaki Kato, Yoshinori Sagisaka |
| 2005 | Application of automatic speaker recognition techniques to pathological voice assessment (dysphonia). Corinne Fredouille, Gilles Pouchoulin, Jean-François Bonastre, M. Azzarello, Antoine Giovanni, Alain Ghio |
| 2005 | Application of confidence measures for dialogue systems through the use of parallel speech recognizers. David Pérez-Piñar López, Carmen García-Mateo |
| 2005 | Applications of NAM microphones in speech recognition for privacy in human-machine communication. Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2005 | Applying multiple regression models for predicting word duration in a corpus of spontaneous speech. Na'im R. Tyson |
| 2005 | Applying vocal tract length normalization to meeting recordings. Giulia Garau, Steve Renals, Thomas Hain |
| 2005 | Are there facial correlates of Thai syllabic tones? Hansjörg Mixdorff, Denis Burnham, Guillaume Vignali, Patavee Charnvivit |
| 2005 | Articulatory constraints and coronal stops: an EPG study. Mitsuhiro Nakamura |
| 2005 | Articulatory motivated acoustic features for speech recognition. Daniil Kocharov, András Zolnay, Ralf Schlüter, Hermann Ney |
| 2005 | Articulatory synthesis using corpus-based estimation of line spectrum pairs. Olov Engwall |
| 2005 | Artificial bandwidth extension of speech supported by watermark-transmitted side information. Bernd Geiser, Peter Jax, Peter Vary |
| 2005 | Assimilation and deletion phenomena involving word-final /n/ and word-initial /p, t, k/ in modern Greek: a codification of the observed variation intended for use in TTS synthesis. Constandinos Kalimeris, George K. Mikros, Stelios Bakamidis |
| 2005 | Asymptotically exact AM-FM decomposition based on iterated hilbert transform. Francesco Gianfelici, Giorgio Biagetti, Paolo Crippa, Claudio Turchetti |
| 2005 | Audio-video summarization of TV news using speech recognition and shot change detection. Chien-Lin Huang, Chia-Hsin Hsieh, Chung-Hsien Wu |
| 2005 | Audiovisual integration in dichotic listening. Kei Omata, Ken Mogi |
| 2005 | Audiovisual interaction on the perception of frequency glide of linear sweep tones. Kiyoaki Aikawa, Hayato Hashimoto |
| 2005 | Audiovisual production and perception of contrastive focus in French: a multispeaker study. Marion Dohen, Hélène Loevenbruck |
| 2005 | Auditory Teager energy cepstrum coefficients for robust speech recognition. Dimitrios Dimitriadis, Petros Maragos, Alexandros Potamianos |
| 2005 | Auditory image model features for automatic speech recognition. Mario E. Munich, Qiguang Lin |
| 2005 | Augmented state space acoustic decoding for modeling local variability in speech. Antonio Miguel, Eduardo Lleida, Richard C. Rose, Luis Buera, Alfonso Ortega |
| 2005 | Automated wizard-of-oz for spoken dialogue systems. Giuseppe Di Fabbrizio, Gökhan Tür, Dilek Hakkani-Tür |
| 2005 | Automatic data selection for MLP-based feature extraction for ASR. Carmen Peláez-Moreno, Qifeng Zhu, Barry Y. Chen, Nelson Morgan |
| 2005 | Automatic detection of frequent pronunciation errors made by L2-learners. Khiet P. Truong, Ambra Neri, Febe de Wet, Catia Cucchiarini, Helmer Strik |
| 2005 | Automatic detection of laughter. Khiet P. Truong, David A. van Leeuwen |
| 2005 | Automatic emotion recognition using prosodic parameters. Iker Luengo, Eva Navas, Inmaculada Hernáez, Jon Sánchez |
| 2005 | Automatic generation of domain-dependent pronunciation lexicon with data-driven rules and rule adaptation. Je Hun Jeon, Minhwa Chung |
| 2005 | Automatic music genre classification using second-order statistical measures for the prescriptive approach. Hassan Ezzaidi, Jean Rouat |
| 2005 | Automatic personal synthetic voice construction. H. Timothy Bunnell, Christopher A. Pennington, Debra Yarrington, John Gray |
| 2005 | Automatic prominence identification and prosodic typology. Fabio Tamburini |
| 2005 | Automatic speech recognition based on adaptation and clustering using temporal-difference learning. Masafumi Nishida, Yasuo Horiuchi, Akira Ichikawa |
| 2005 | Automatic speech recognition with neural spike trains. Marcus Holmberg, David Gelbart, Ulrich Ramacher, Werner Hemmert |
| 2005 | Automatic text dictation in computer-assisted translation. Shahram Khadivi, András Zolnay, Hermann Ney |
| 2005 | Automatic transcription of Czech, Russian, and Slovak spontaneous speech in the MALACH project. Josef Psutka, Pavel Ircing, Josef V. Psutka, Jan Hajic, William J. Byrne, Jirí Mírovský |
| 2005 | Automatic voice-source parameterization of natural speech. Javier Pérez, Antonio Bonafonte |
| 2005 | BNSI Slovenian broadcast news database - speech and text corpus. Andrej Zgank, Darinka Verdonik, Aleksandra Zögling Markus, Zdravko Kacic |
| 2005 | Back-channel feedback generation using linguistic and nonlinguistic information and its application to spoken dialogue system. Shinya Fujie, Kenta Fukushima, Tetsunori Kobayashi |
| 2005 | Background model based posterior probability for measuring confidence. Peng Liu, Ye Tian, Jian-Lai Zhou, Frank K. Soong |
| 2005 | Bandwidth expansion of narrowband speech using non-negative matrix factorization. Dhananjay Bansal, Bhiksha Raj, Paris Smaragdis |
| 2005 | Bayes risk minimization using metric loss functions. Ralf Schlüter, T. Scharrenbach, Volker Steinbiss, Hermann Ney |
| 2005 | Bayesian learning for latent semantic analysis. Jen-Tzung Chien, Meng-Sung Wu, Chia-Sheng Wu |
| 2005 | Bilingual aligned corpora for speech to speech translation for Spanish, English and Catalan. David Conejero, Alan Lounds, Carmen García-Mateo, Leandro Rodríguez Liñares, Raquel Mochales, Asunción Moreno |
| 2005 | Binaural feature selection for missing data speech recognition. Sue Harding, Jon P. Barker, Guy J. Brown |
| 2005 | Bootstrapping pronunciation dictionaries: practical issues. Marelie H. Davel, Etienne Barnard |
| 2005 | Broadcast news speaker tracking for ESTER 2005 campaign. Dan Istrate, Nicolas Scheffer, Corinne Fredouille, Jean-François Bonastre |
| 2005 | Building continuous space language models for transcribing european languages. Holger Schwenk, Jean-Luc Gauvain |
| 2005 | Building topic specific language models from webdata using competitive models. Abhinav Sethy, Panayiotis G. Georgiou, Shrikanth S. Narayanan |
| 2005 | Can we retrieve vocal tract dynamics that produced speech? toward a speaker articulatory strategy model. Slim Ouni |
| 2005 | Channel robust speaker verification via Bayesian blind stochastic feature transformation. Kwok-Kwong Yiu, Man-Wai Mak, Sun-Yuan Kung |
| 2005 | Characterising dialogue call-flows for pervasive environments. Amit Anil Nanavati, Nitendra Rajput |
| 2005 | Chinese prosodic phrasing with a constraint-based approach. Honghui Dong, Jianhua Tao, Bo Xu |
| 2005 | Choosing a scale for measuring perceived prominence. Christian Jensen, John Tøndering |
| 2005 | Clarification questions to improve dialogue flow and speech recognition in spoken dialogue systems. Ulf Krum, Hartwig Holzapfel, Alex Waibel |
| 2005 | Class-based variable memory length Markov model. Shinsuke Mori, Gakuto Kurata |
| 2005 | Class-dependent score combination for speaker recognition. Luciana Ferrer, M. Kemal Sönmez, Sachin S. Kajarekar |
| 2005 | Classical and novel discriminant features for affect recognition from speech. Raul Fernandez, Rosalind W. Picard |
| 2005 | Cluster-based modeling for ubiquitous speech recognition. Sadaoki Furui, Tomohisa Ichiba, Takahiro Shinozaki, Edward W. D. Whittaker, Koji Iwano |
| 2005 | Codec integrated voice conversion for embedded speech synthesis. Guntram Strecha, Oliver Jokisch, Matthias Eichner, Rüdiger Hoffmann |
| 2005 | Collaborative voice activity detection for hearing aids. Louisa Busca Grisoni, John H. L. Hansen |
| 2005 | Comb filter decomposition for robust ASR. Lech Szymanski, Martin Bouchard |
| 2005 | Combination of classifiers for automatic recognition of dialog acts. Pavel Král, Christophe Cerisara, Jana Klecková |
| 2005 | Combining models of prosodic phrasing and pausing. Tina Burrows, Peter Jackson, Katherine M. Knill, Dmitry Sityaev |
| 2005 | Combining multi-source far distance speech recognition strategies: beamforming, blind channel and confusion network combination. Matthias Wölfel, John W. McDonough |
| 2005 | Combining packet loss compensation methods for robust distributed speech recognition. Alastair Bruce James, Ben Milner |
| 2005 | Combining speaker identification and BIC for speaker diarization. Xuan Zhu, Claude Barras, Sylvain Meignier, Jean-Luc Gauvain |
| 2005 | Combining the flexibility of speech synthesis with the naturalness of pre-recorded audio: a comparison of two approaches to phrase-splicing TTS. Wael Hamza, John F. Pitrelli |
| 2005 | Combining voiceprint and face biometrics for speaker identification using SDWS. Dongdong Li, Yingchun Yang, Zhaohui Wu |
| 2005 | Communicative speech synthesis using constituent word attributes. Yoko Greenberg, Minoru Tsuzaki, Hiroaki Kato, Yoshinori Sagisaka |
| 2005 | Comparative objective and subjective evaluation of three data-driven techniques for proper name pronunciation. Tasanawan Soonklang, Robert I. Damper, Yannick Marchand |
| 2005 | Comparing ASR modeling methods for spoken dialogue simulation and optimal strategy learning. Olivier Pietquin, Richard Beaufort |
| 2005 | Comparing HMM, maximum entropy, and conditional random fields for disfluency detection. Yang Liu, Elizabeth Shriberg, Andreas Stolcke, Mary P. Harper |
| 2005 | Comparing lexical, acoustic/prosodic, structural and discourse features for speech summarization. Sameer Maskey, Julia Hirschberg |
| 2005 | Comparing several models for perceptual long-term modeling of amplitude and phase trajectories of sinusoidal speech. Mohammad Firouzmand, Laurent Girin, Sylvain Marchand |
| 2005 | Comparing spectral distance measures for join cost optimization in concatenative speech synthesis. Ingmund Bjrkan, Torbjørn Svendsen, Snorre Farner |
| 2005 | Comparing tongue positions of vowels in oral and nasal contexts. Takayuki Arai |
| 2005 | Comparison of different phone-based spoken document retrieval methods with text and spoken queries. Nicolas Moreau, Shan Jin, Thomas Sikora |
| 2005 | Comparison of keyword spotting approaches for informal continuous speech. Igor Szöke, Petr Schwarz, Pavel Matejka, Lukás Burget, Martin Karafiát, Michal Fapso, Jan Cernocký |
| 2005 | Comparison of low footprint acoustic modeling techniques for embedded ASR systems. Jussi Leppänen, Imre Kiss |
| 2005 | Compound rises and "uptalk" in spoken English. Janet Fletcher |
| 2005 | Comprehensive modulation representation for automatic speech recognition. Yadong Wang, Steven Greenberg, Jayaganesh Swaminathan, Ramdas Kumaresan, David Poeppel |
| 2005 | Conceiving a new sequence kernel and applying it to SVM speaker verification. Jérôme Louradour, Khalid Daoudi |
| 2005 | Conceptual language model design for spoken language understanding. Catherine Kobus, Géraldine Damnati, Lionel Delphin-Poulat, Renato De Mori |
| 2005 | Confidence measures in speech recognition based on probability distribution of likelihoods. Joel Pinto, R. N. V. Sitaram |
| 2005 | Confidence scoring and rejection using multi-pass speech recognition. Vincent Vanhoucke |
| 2005 | Confronting HMM-based phone labelling with human evaluation of speech production. Jan Volín, Radek Skarnitzl, Petr Pollák |
| 2005 | Considering speech quality in speaker verification fusion. Yosef A. Solewicz, Moshe Koppel |
| 2005 | Constraints on the acquisition of simplex and complex words in German. Angela Grimm, Jochen Trommer |
| 2005 | Constructing family trees of multilingual speech using Gaussian mixture models. Shuichi Itahashi, Shiwei Zhu, Mikio Yamamoto |
| 2005 | Construction and utilization of bilingual speech corpus for simultaneous machine interpretation research. Hitomi Tohyama, Shigeki Matsubara, Nobuo Kawaguchi, Yasuyoshi Inagaki |
| 2005 | Construction method of acoustic models dealing with various background noises based on combination of HMMs. Motoyuki Suzuki, Yusuke Kato, Akinori Ito, Shozo Makino |
| 2005 | Context in multi-lingual tone and pitch accent recognition. Gina-Anne Levow |
| 2005 | Context-dependent word duration modelling for robust speech recognition. Ning Ma, Phil D. Green |
| 2005 | Context-sensitive statistical language modeling. Alexander Gruenstein, Chao Wang, Stephanie Seneff |
| 2005 | Contextual constraints based on dialogue models in database search task for spoken dialogue systems. Kazunori Komatani, Naoyuki Kanda, Tetsuya Ogata, Hiroshi G. Okuno |
| 2005 | Contextual effect on perception of lexical tones in Cantonese. Joan K.-Y. Ma, Valter Ciocca, Tara L. Whitehill |
| 2005 | Continuous local codebook features for multi- and cross-lingual acoustic phonetic modelling. Frank Diehl, Asunción Moreno, Enric Monte |
| 2005 | Corpus-based extraction of F0 contour generation process model parameters. Keikichi Hirose, Yusuke Furuyama, Nobuaki Minematsu |
| 2005 | Correlating student acoustic-prosodic profiles with student learning in spoken tutoring dialogues. Katherine Forbes-Riley, Diane J. Litman |
| 2005 | Covariation of subglottal pressure, F0 and intensity. Gunnar Fant, Anita Kruckenberg |
| 2005 | Creating an ongoing research capability in speech technology for two minority languages: experiences from the WISPR project. Briony Williams, Delyth Prys, Ailbhe Ní Chasaide |
| 2005 | Cross-language perception of word stress. Hansjörg Mixdorff, Yu Hu |
| 2005 | Cross-language synthesis with a polyglot synthesizer. Javier Latorre, Koji Iwano, Sadaoki Furui |
| 2005 | Cross-linguistic comparison of two-year-old children's acoustic vowel spaces: contrasting Hungarian with dutch. Krisztina Zajdó, Jeannette M. van der Stelt, Ton G. Wempe, Louis C. W. Pols |
| 2005 | Cross-speaker articulatory position data for phonetic feature prediction. Arthur R. Toth, Alan W. Black |
| 2005 | Crosslingual and bilingual speech recognition with Slovak and Czech speechdat-e databases. Slavomír Lihan, Jozef Juhár, Anton Cizmar |
| 2005 | Customizing base unit set with speech database in TTS systems. Yining Chen, Yong Zhao, Min Chu |
| 2005 | Czech spontaneous speech corpus with structural metadata. Jáchym Kolár, Jan Svec, Stephanie M. Strassel, Christopher Walker, Dagmar Kozlíková, Josef Psutka |
| 2005 | Czech voiced labiodental continuant discrimination from basic acoustic data. Radek Skarnitzl, Jan Volín |
| 2005 | Data collection and evaluation of speech recognition for motorbike riders. H. Tanaka, Hiroshi Fujimura, Chiyomi Miyajima, Takanori Nishino, Katunobu Itou, Kazuya Takeda |
| 2005 | Data driven subword unit modeling for speech recognition and its application to interactive reading tutors. Andreas Hagen, Bryan L. Pellom |
| 2005 | Data sampling for improved speech recognizer training. Takahiro Shinozaki, Mari Ostendorf, Les E. Atlas |
| 2005 | Data-driven clustering for blind feature mapping in speaker verification. Michael Mason, Robbie Vogt, Brendan Baker, Sridha Sridharan |
| 2005 | Data-driven synthesis of expressive visual speech using an MPEG-4 talking head. Jonas Beskow, Mikael Nordenberg |
| 2005 | Decision trees with improved efficiency for fast speaker verification. Gilles Gonon, Rémi Gribonval, Frédéric Bimbot |
| 2005 | Denoising through source separation and minimum tracking. Sriram Srinivasan, Mattias Nilsson, W. Bastiaan Kleijn |
| 2005 | Deriving a bi-lingual dictionary from raw transcription data. Peter Juel Henrichsen |
| 2005 | Design and collection of Czech Lombard speech database. Hynek Boril, Petr Pollák |
| 2005 | Design of a voice-enabled interface for real-time access to stock exchange from a PDA through GPRS. Darío Martín-Iglesias, Yago Pereiro-Estevan, Ana I. García-Moral, Ascensión Gallardo-Antolín, Fernando Díaz-de-María |
| 2005 | Design of bandwidth scalable LSF quantization using interframe and intraframe prediction. Hiroyuki Ehara, Toshiyuki Morii, Masahiro Oshikiri, Koji Yoshida, Kouichi Honma |
| 2005 | Designing multiple distinctive phonetic feature extractors for canonicalization by using clustering technique. Takashi Fukuda, Muhammad Ghulam, Tsuneo Nitta |
| 2005 | Detecting Politeness and frustration state of a child in a conversational computer game. Serdar Yildirim, Chul Min Lee, Sungbok Lee, Alexandros Potamianos, Shrikanth S. Narayanan |
| 2005 | Detecting certainness in spoken tutorial dialogues. Jackson Liscombe, Julia Hirschberg, Jennifer J. Venditti |
| 2005 | Detection of acoustic change-points in audio records via global BIC maximization and dynamic programming. Jindrich Zdánský, Jan Nouza |
| 2005 | Detection of coughs from user utterances using imitated phoneme model. Shinya Takahashi, Tsuyoshi Morimoto, Sakashi Maeda, Naoyuki Tsuruta |
| 2005 | Detection of hypernasality using statistical pattern classifiers. P. Vijayalakshmi, M. Ramasubba Reddy |
| 2005 | Detection of real-life emotions in call centers. Laurence Vidrascu, Laurence Devillers |
| 2005 | Detection of recognition errors based on classifiers trained on artificially created data. Tomás Bartos, Ludek Müller |
| 2005 | Detection of vowel onset point events using excitation information. S. R. Mahadeva Prasanna, B. Yegnanarayana |
| 2005 | Developing and enhancing posterior based speech recognition systems. Hamed Ketabdar, Jithendra Vepa, Samy Bengio, Hervé Bourlard |
| 2005 | Developing extensible and reusable spoken dialogue components: an examination of the Queen's communicator. Philip Hanna, Ian M. O'Neill, Xingkun Liu, Michael F. McTear |
| 2005 | Development and evaluation of a spoken dialog system to access a newspaper web site. César González Ferreras, Valentín Cardeñoso-Payo |
| 2005 | Development of a Cantonese-English code-mixing speech corpus. Joyce Y. C. Chan, P. C. Ching, Tan Lee |
| 2005 | Development of a Kiswahili text to speech system. Mucemi Gakuru, Frederick K. Iraki, Roger C. F. Tucker, Ksenia Shalonova, Kamanda Ngugi |
| 2005 | Development of a conversational telephone speech recognizer for Levantine Arabic. Dimitra Vergyri, Katrin Kirchhoff, Venkata Ramana Rao Gadde, Andreas Stolcke, Jing Zheng |
| 2005 | Developmental change of phoneme duration in a Japanese infant and mother. Shigeaki Amano |
| 2005 | Diachronic vocabulary adaptation for broadcast news transcription. Alexandre Allauzen, Jean-Luc Gauvain |
| 2005 | Dialogue strategy to clarify user's queries for document retrieval system with speech interface. Teruhisa Misu, Tatsuya Kawahara |
| 2005 | Different size multilingual phone inventories and context-dependent acoustic models for language identification. Dong Zhu, Martine Adda-Decker, Fabien Antoine |
| 2005 | Directionally constrained minimization of power algorithm for speech signals. Takahiro Murakami, Kiyoshi Kurihara, Yoshihisa Ishida |
| 2005 | Discontinuity detection in concatenated speech synthesis based on nonlinear speech analysis. Yannis Pantazis, Yannis Stylianou, Esther Klabbers |
| 2005 | Discrimination between singing and speaking voices. Yasunori Ohishi, Masataka Goto, Katunobu Itou, Kazuya Takeda |
| 2005 | Discrimination of speech, musical instruments and singing voices using the temporal patterns of sinusoidal segments in audio signals. Toru Taniguchi, Akishige Adachi, Shigeki Okawa, Masaaki Honda, Katsuhiko Shirai |
| 2005 | Discriminative maximum entropy language model for speech recognition. Chuang-Hua Chueh, To-Chang Chien, Jen-Tzung Chien |
| 2005 | Discriminative speaker adaptation with eigenvoices. Jun Luo, Zhijian Ou, Zuoying Wang |
| 2005 | Discriminative training and support vector machine for natural language call routing. Imed Zitouni, Hui Jiang, Qiru Zhou |
| 2005 | Discriminative training of finite state decoding graphs. Shiuan-Sung Lin, François Yvon |
| 2005 | Discriminatively trained features using fMPE for multi-stream audio-visual speech recognition. Jing Huang, Daniel Povey |
| 2005 | Distinctive feature based SVM discriminant features for improvements to phone recognition on telephone band speech. Sarah Borys, Mark Hasegawa-Johnson |
| 2005 | Distinguishing deceptive from non-deceptive speech. Julia Hirschberg, Stefan Benus, Jason M. Brenier, Frank Enos, Sarah Friedman, Sarah Gilman, Cynthia Girand, Martin Graciarena, Andreas Kathol, Laura A. Michaelis, Bryan L. Pellom, Elizabeth Shriberg, Andreas Stolcke |
| 2005 | Distortion measures for vector quantization of noisy spectrum. Volodya Grancharov, Jonas Samuelsson, W. Bastiaan Kleijn |
| 2005 | Distributed ASR using speech coder data for efficient feature vector representation. Trond Skogstad, Torbjørn Svendsen |
| 2005 | Distributed dialogue management for smart terminal devices. Esa-Pekka Salonen, Markku Turunen, Jaakko Hakulinen, Leena Helin, Perttu Prusi, Anssi Kainulainen |
| 2005 | Distributed speaker recognition using speaker-dependent VQ codebook and earth mover's distance. Shingo Kuroiwa, Yoshiyuki Umeda, Satoru Tsuge, Fuji Ren |
| 2005 | Do speech recognizers prefer female speakers? Martine Adda-Decker, Lori Lamel |
| 2005 | Document driven machine translation enhanced ASR. Matthias Paulik, Christian Fügen, Sebastian Stüker, Tanja Schultz, Thomas Schaaf, Alex Waibel |
| 2005 | Does active learning help automatic dialog act tagging in meeting data? Anand Venkataraman, Yang Liu, Elizabeth Shriberg, Andreas Stolcke |
| 2005 | Does narrow focus activate alternative referents? Bettina Braun, Andrea Weber, Matthew W. Crocker |
| 2005 | Does vowel space size depend on language vowel inventories? evidence from two Arabic dialects and French. Jalal-Eddin Al-Tamimi, Emmanuel Ferragne |
| 2005 | Downstep effect on disyllabic words of citation forms in standard Chinese. Ziyu Xiong |
| 2005 | Duration and the temporal structure of Mandarin discourse. Li-chiung Yang |
| 2005 | Duration modeling and memory optimization in a Mandarin TTS system. Jilei Tian, Jani Nurminen, Imre Kiss |
| 2005 | Duration, intensity and pause predictions in relation to prosody organization. Chiu-yu Tseng, Bau-Ling Fu |
| 2005 | Duration-embedded bi-HMM for expressive voice conversion. Chi-Chun Hsia, Chung-Hsien Wu, Te-Hsien Liu |
| 2005 | Durational characteristics of Korean Lombard speech. Sunhee Kim |
| 2005 | Dynamic language model adaptation using variational Bayes inference. Yik-Cheung Tam, Tanja Schultz |
| 2005 | Dynamic programming based segmentation approach to LSF matrix reconstruction. Anindya Sarkar, T. V. Sreenivas |
| 2005 | Ecological language acquisition via incremental model-based clustering. Giampiero Salvi |
| 2005 | Effect of head orientation on the speaker localization performance in smart-room environment. Alberto Abad, Dusan Macho, Carlos Segura, Javier Hernando, Climent Nadeu |
| 2005 | Effective topic-tree based language model adaptation. Javier Dieguez-Tirado, Carmen García-Mateo, Antonio Cardenal López |
| 2005 | Effects of Bayesian predictive classification using variational Bayesian posteriors for sparse training data in speech recognition. Shinji Watanabe, Atsushi Nakamura |
| 2005 | Effects of F0 feedback on the learning of Chinese tones by native speakers of English. Felicia Zhang, Michael Wagner |
| 2005 | Effects of cortical and subcortical brain damage on the processing of emotional prosody. Marc D. Pell |
| 2005 | Effects of pitch accent type on interpreting information status in synthetic speech. Aoju Chen, Els den Os |
| 2005 | Effects of raddoppiamento sintattico on tonal alignment in Italian. Caterina Petrone |
| 2005 | Efficient blind dereverberation framework for automatic speech recognition. Keisuke Kinoshita, Tomohiro Nakatani, Masato Miyoshi |
| 2005 | Efficient pitch-based estimation of VTLN warp factors. Arlo Faria, David Gelbart |
| 2005 | Efficient speaker identification and retrieval. Hagai Aronowitz, David Burshtein |
| 2005 | Eigen-environment based noise compensation method for robust speech recognition. Hwa Jeon Song, Hyung Soon Kim |
| 2005 | Embedded Cantonese TTS for multi-device access to web content. Tien Ying Fung, Yuk-Chi Li, Eddie Sio, Icarus Lee, Helen M. Meng, P. C. Ching |
| 2005 | Embedding grammars into statistical language models. Harald Hning, Manuel Kirschner, Fritz Class, André Berton, Udo Haiber |
| 2005 | Emofilt: the simulation of emotional speech by prosody-transformation. Felix Burkhardt |
| 2005 | Emotional FESTIVAL-MBROLA TTS synthesis. Fabio Tesser, Piero Cosi, Carlo Drioli, Graziano Tisato |
| 2005 | Emotions in dubbed speech: an intercultural approach with respect to F0. Angelika Braun, Matthias Katerbow |
| 2005 | Energy-based frame selection for reliable feature normalization and transformation in robust speech recognition. Yi Chen, Lin-Shan Lee |
| 2005 | Enhanced speech coding based on phonetic class segmentation. Adriane Swalm Durey, Venkatesh Krishnan, Thomas P. Barnwell III |
| 2005 | Enhancement of mel log-power spectrum of speech using particle filtering. Ilyas Potamitis, Nikolaos D. Fakotakis |
| 2005 | Environment-independent mask estimation for missing-feature reconstruction. Wooil Kim, Richard M. Stern, Hanseok Ko |
| 2005 | Environmental compensation using ASR model adaptation by a Bayesian parametric representation method. Xuechuan Wang, Douglas D. O'Shaughnessy |
| 2005 | Estimation of LF glottal source parameters based on an ARX model. Damien Vincent, Olivier Rosec, Thierry Chonavel |
| 2005 | Estimation of intonation variation with constrained tone transformations. Jinfu Ni, Hisashi Kawai, Keikichi Hirose |
| 2005 | Estimation of speaker's height and vocal tract length from speech signal. Sorin Dusan |
| 2005 | Estimation of the acoustic properties of the nasal tract during the production of nasalized vowels. Xiaochuan Niu, Alexander Kain, Jan P. H. van Santen |
| 2005 | Evaluating communication effectiveness in team collaboration. Julie A. Parisi, Douglas Brungart |
| 2005 | Evaluating the DI@l-log system on a cohort of elderly, diabetic patients: results from a preliminary study. Lesley-Ann Black, Michael F. McTear, Norman D. Black, Roy Harper, Michelle Lemon |
| 2005 | Evaluating the pronunciation of proper names by four French grapheme-to-phoneme converters. Philippe Boula de Mareüil, Christophe d'Alessandro, Gérard Bailly, Frédéric Béchet, Marie-Neige Garcia, Michel Morel, Romain Prudon, Jean Véronis |
| 2005 | Evaluation and optimization of noise robust front-end technologies for the automatic recognition of Hungarian telephone speech. Péter Mihajlik, Zoltán Tobler, Zoltán Tüske, Géza Gordos |
| 2005 | Evaluation of VTLN-based voice conversion for embedded speech synthesis. David Sündermann, Guntram Strecha, Antonio Bonafonte, Harald Höge, Hermann Ney |
| 2005 | Evaluation of a long-contextual-Span hidden trajectory model and phonetic recognizer using a* lattice search. Dong Yu, Li Deng, Alex Acero |
| 2005 | Evaluation of a system for F0 contour prediction for european Portuguese. João Paulo Ramos Teixeira, Diamantino Freitas, Hiroya Fujisaki |
| 2005 | Experiments on speaker profile portability. Vincent Barreaud, Douglas D. O'Shaughnessy, Jean-Guy Dahan |
| 2005 | Experiments on speaker tracking and segmentation in radio broadcast news. Daniel Moraru, Mathieu Ben, Guillaume Gravier |
| 2005 | Experiments with probabilistic principal component analysis in LVCSR. Mike Schuster, Takaaki Hori, Atsushi Nakamura |
| 2005 | Explicit segmentation of speech based on frequency-domain AR modeling. T. Nagarajan, Douglas D. O'Shaughnessy |
| 2005 | Exploiting large quantities of spontaneous speech for unsupervised training of acoustic models. Bhuvana Ramabhadran |
| 2005 | Exploiting passage retrieval for n-best rescoring of spoken questions. Tomoyosi Akiba, Hiroyuki Abe |
| 2005 | Exploiting unlabeled data using multiple classifiers for improved natural language call-routing. Ruhi Sarikaya, Hong-Kwang Jeff Kuo, Vaibhava Goel, Yuqing Gao |
| 2005 | Exploration of different types of intonational deviations in foreign-accented and synthesized speech. Matthias Jilka |
| 2005 | Exploratory analysis of linguistic data based on genetic algorithm for robust modeling of the segmental duration of speech. Edmilson Morais, Fábio Violaro |
| 2005 | Extended baum-welch reestimation of Gaussian mixture models based on reverse Jensen inequality. Mohamed Afify |
| 2005 | Extraction of relevant speech features using the information bottleneck method. Ron M. Hecht, Naftali Tishby |
| 2005 | Extractive summarization of meeting recordings. Gabriel Murray, Steve Renals, Jean Carletta |
| 2005 | F0 estimation for adult and children's speech. John-Paul Hosom |
| 2005 | F0 stylisation with a free-knot b-spline model and simulated-annealing optimization. Nelly Barbot, Olivier Boëffard, Damien Lolive |
| 2005 | FSM and k-nearest-neighbor for corpus based video-realistic audio-visual synthesis. Christian Weiss |
| 2005 | Factors in classification of stop consonant place of articulation. Atiwong Suchato, Proadpran Punyabukkana |
| 2005 | Fast confidence measure algorithm for continuous speech recognition. Bin Dong, Qingwei Zhao, Yonghong Yan |
| 2005 | Fast unsupervised speaker adaptation through a discriminative eigen-MLLR algorithm. Bart Bakker, Carsten Meyer, Xavier L. Aubert |
| 2005 | Fast vocabulary-independent audio search using path-based graph indexing. Olivier Siohan, Michiel Bacchiani |
| 2005 | Feature adaptation using projection of Gaussian posteriors. Karthik Visweswariah, Peder A. Olsen |
| 2005 | Feature compensation based on switching linear dynamic model and soft decision. Woohyung Lim, Bong Kyoung Kim, Nam Soo Kim |
| 2005 | Filled pauses as cues to the complexity of following phrases. Michiko Watanabe, Keikichi Hirose, Yasuharu Den, Nobuaki Minematsu |
| 2005 | Fine-tuning speech registers: a comparison of the prosodic features of child-directed and foreigner-directed speech. Sonja Biersack, Vera Kempe, Lorna Knapton |
| 2005 | Finite-state transducer inference for a speech-input Portuguese-to-English machine translation system. David Picó, Jorge González, Francisco Casacuberta, Diamantino Caseiro, Isabel Trancoso |
| 2005 | Fixed distortion segmentation in efficient sound segment searching. Masahide Sugiyama |
| 2005 | Flavors of Gaussian warping. Pierre Ouellet, Gilles Boulianne, Patrick Kenny |
| 2005 | Flavoured acoustic model and combined spelling to sound for asymmetrical bilingual environment. R. Lejeune, J. Baude, C. Tchong, Hubert Crepy, Claire Waast-Richard |
| 2005 | Focal speakers: a speaker selection method able to deal with heterogeneous similarity criteria. Sacha Krstulovic, Frédéric Bimbot, Delphine Charlet, Olivier Boëffard |
| 2005 | Focused word segmentation for ASR. Amarnag Subramanya, Jeff A. Bilmes, Chia-Ping Chen |
| 2005 | Foreign accents in synthetic speech: development and evaluation. Laura Mayfield Tomokiyo, Alan W. Black, Kevin A. Lenzo |
| 2005 | Formant frequency prediction from MFCC vectors in noisy environments. Jonathan Darch, Ben P. Milner, Saeed Vaseghi |
| 2005 | Formant-tracking linear prediction models for speech processing in noisy environments. Qin Yan, Saeed Vaseghi, Esfandiar Zavarehei, Ben P. Milner |
| 2005 | Frame based model order selection of spectral envelopes. Matthias Wölfel |
| 2005 | Frequency-domain auditory suppression modelling (FASM) - a WDFT-based anthropomorphic noise-robust feature extraction algorithm for speech recognition. Alexei V. Ivanov, Marek Parfieniuk, Alexander A. Petrovsky |
| 2005 | From question answering to spoken dialogue: towards an information search assistant for interactive multimodal information extraction. Rieks op den Akker, Harry Bunt, Simon Keizer, Boris W. van Schooten |
| 2005 | From robust spoken language understanding to knowledge acquisition and management. Luís Seabra Lopes, António J. S. Teixeira, Marcelo Quinderé, Mário Rodrigues |
| 2005 | Fully automated non-native speech recognition using confusion-based acoustic model integration. Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean Paul Haton |
| 2005 | Fully automated system for Czech spoken broadcast transcription with very large (300k+) lexicon. Jan Nouza, Jindrich Zdánský, Petr David, Petr Cerva, Jan Kolorenc, Dana Nejedlová |
| 2005 | Fundamental frequency and tone in isizulu: initial experiments. Natasha Govender, Etienne Barnard, Marelie H. Davel |
| 2005 | Fundamental frequency and voicing prediction from MFCCs for speech reconstruction from unconstrained speech. Ben Milner, Xu Shao, Jonathan Darch |
| 2005 | Fundamental frequency estimation by least-squares harmonic model fitting. András Bánhalmi, Kornél Kovács, András Kocsor, László Tóth |
| 2005 | Gaussian elimination algorithm for HMM complexity reduction in continuous speech recognition systems. Glauco F. G. Yared, Fábio Violaro, Lívio C. Sousa |
| 2005 | Gaussian mixture modelling of broad phonetic and syllabic events for text-independent speaker verification. Brendan Baker, Robbie Vogt, Sridha Sridharan |
| 2005 | Gender in everyday speech and language: a corpus-based study. Diana Binnenpoorte, Christophe Van Bael, Els den Os, Lou Boves |
| 2005 | Generalized envelope matching technique for time-scale modification of speech (GEM-TSM). Atsuhiro Sakurai |
| 2005 | Generalized fast on-the-fly composition algorithm for WFST-based speech recognition. Takaaki Hori, Atsushi Nakamura |
| 2005 | Generalized filter-bank equalizer for noise reduction with reduced signal delay. Heinrich W. Löllmann, Peter Vary |
| 2005 | Generalized hebbian algorithm for incremental latent semantic analysis. Genevieve Gorrell, Brandyn Webb |
| 2005 | Generation of fundamental frequency contours for Mandarin speech synthesis based on tone nucleus model. Qinghua Sun, Keikichi Hirose, Wentao Gu, Nobuaki Minematsu |
| 2005 | Generation of word alternative pronunciations using weighted finite state transducers. Sérgio Paulo, Luís C. Oliveira |
| 2005 | Genetic triangulation of graphical models for speech and language processing. Chris D. Bartels, Kevin Duh, Jeff A. Bilmes, Katrin Kirchhoff, Simon King |
| 2005 | Gradually changing expression of singing voice based on morphing. Tomoko Yonezawa, Noriko Suzuki, Kenji Mase, Kiyoshi Kogure |
| 2005 | Grapheme-to-phoneme conversion based on TBL algorithm in Mandarin TTS system. Min Zheng, Qin Shi, Wei Zhang, Lianhong Cai |
| 2005 | Great expectations - introspective vs. perceptual prominence ratings and their acoustic correlates. Petra Wagner |
| 2005 | Group delay function as a means to assess quality of glottal inverse filtering. Paavo Alku, Matti Airas, Tom Bäckström, Hannu Pulakka |
| 2005 | Growing an n-gram language model. Vesa Siivola, Bryan L. Pellom |
| 2005 | HMM-based european Portuguese TTS system. Maria João Barros, Ranniery Maia, Keiichi Tokuda, Fernando Gil Resende, Diamantino Freitas |
| 2005 | Harmonic filtering for joint estimation of pitch and voiced source with single-microphone input. Siu Wa Lee, Frank K. Soong, Pak-Chung Ching |
| 2005 | Hidden Markov models for grapheme to phoneme conversion. Paul Taylor |
| 2005 | Hidden conditional random fields for phone classification. Asela Gunawardana, Milind Mahajan, Alex Acero, John C. Platt |
| 2005 | Hierarchical clustering of mixture tying using a partially observable Markov decision process. Michael Jonas, James G. Schmolze |
| 2005 | Hierarchical language models for one-stage speech interpretation. Matthias Thomae, Tibor Fábián, Robert Lieb, Günther Ruske |
| 2005 | Hierarchical topic organization and visual presentation of spoken documents using probabilistic latent semantic analysis (PLSA) for efficient retrieval/browsing applications. Te-Hsuan Li, Ming-Han Lee, Berlin Chen, Lin-Shan Lee |
| 2005 | High quality Spanish restricted-domain TTS oriented to a weather forecast application. Francesc Alías, Ignasi Iriondo Sanz, Lluís Formiga, Xavier Gonzalvo, Carlos Monzo, Xavier Sevillano |
| 2005 | High-density discrete HMM with the use of scalar quantization indexing. Brian Mak, Jeff Siu-Kei Au-Yeung, Yiu-Pong Lai, Man-Hung Siu |
| 2005 | High-quality memoryless subband coding of impulse responses at 22 bits per frame. Jan S. Erkelens |
| 2005 | High-resolution noise-robust spectral-based pitch estimation. Marián Képesi, Luis Weruaga |
| 2005 | Histogram-based quantization (HQ) for robust and scalable distributed speech recognition. Chia-Yu Wan, Lin-Shan Lee |
| 2005 | Hybrid syllable/triphone speech synthesis. Jindrich Matousek, Zdenek Hanzlícek, Daniel Tihelka |
| 2005 | INTERFACE: a new tool for building emotive/expressive talking heads. Graziano Tisato, Piero Cosi, Carlo Drioli, Fabio Tesser |
| 2005 | IR-based classification of customer-agent phone calls. Arjan van Hessen, Jaap Hinke |
| 2005 | Identifying singers of popular songs. Tin Lay Nwe, Haizhou Li |
| 2005 | Impact of duration on F1/F2 formant values of oral vowels: an automatic analysis of large broadcast news corpora in French and German. Cédric Gendrot, Martine Adda-Decker |
| 2005 | Implementing frequency-warping and VTLN through linear transformation of conventional MFCC. Srinivasan Umesh, András Zolnay, Hermann Ney |
| 2005 | Implicit control of noise canceller for speech enhancement. Julien Bourgeois, Jürgen Freudenberger, Guillaume Lathoud |
| 2005 | Improved "TEO" feature-based automatic stress detection using physiological and acoustic speech sensors. Evan Ruzanski, John H. L. Hansen, Don Finan, James Meyerhoff, William Norris, Terry Wollert |
| 2005 | Improved MLP structures for data-driven feature extraction for ASR. Qifeng Zhu, Barry Y. Chen, Frantisek Grézl, Nelson Morgan |
| 2005 | Improved blind dereverberation performance by using spatial information. Marc Delcroix, Takafumi Hikichi, Masato Miyoshi |
| 2005 | Improved covariance modeling for GMM in speaker identification. Xi Zhou, Zhiqiang Yao, Beiqian Dai |
| 2005 | Improved decision directed approach for speech enhancement using an adaptive time segmentation. Richard C. Hendriks, Richard Heusdens, Jesper Jensen |
| 2005 | Improved discriminative training using phone lattices. Jing Zheng, Andreas Stolcke |
| 2005 | Improved noise-robustness in distributed speech recognition via perceptually-weighted vector quantisation of filterbank energies. Stephen So, Kuldip K. Paliwal |
| 2005 | Improved semi-dynamic network decoding using WFSTs. Dong-Hoon Ahn, Su-Byeong Oh, Minhwa Chung |
| 2005 | Improved speech recognition word lattice translation by confidence measure. Abdulvohid Bozarov, Yoshinori Sagisaka, Ruiqiang Zhang, Gen-ichiro Kikui |
| 2005 | Improved spontaneous Mandarin speech recognition by disfluency interruption point (IP) detection using prosodic features. Che-Kuang Lin, Lin-Shan Lee |
| 2005 | Improvement of rejection performance of keyword spotting using anti-keywords derived from large vocabulary considering acoustical similarity to keywords. Makoto Yamada, Tsuneo Kato, Masaki Naito, Hisashi Kawai |
| 2005 | Improvements to fMPE for discriminative training of features. Daniel Povey |
| 2005 | Improvements to the BBN RT04 Mandarin conversational telephone speech recognition system. Jeff Z. Ma, Spyros Matsoukas |
| 2005 | Improving end-to-end performance of call classification through data confusion reduction and model tolerance enhancement. Cheng Wu, Xiang Li, Hong-Kwang Jeff Kuo, E. E. Jan, Vaibhava Goel, David M. Lubensky |
| 2005 | Improving lip-reading with feature space transforms for multi-stream audio-visual speech recognition. Jing Huang, Karthik Visweswariah |
| 2005 | Improving out-of-coverage language modelling in a multimodal dialogue system using small training sets. Louis ten Bosch |
| 2005 | Improving robustness of speech recognition performance to aggregate of noises by two-dimensional visualization. Makoto Shozakai, Goshu Nagino |
| 2005 | Improving speech recognition using a data-driven approach. Guillermo Aradilla, Jithendra Vepa, Hervé Bourlard |
| 2005 | Improving statistical machine translation by classifying and generalizing inflected verb forms. Adrià de Gispert, José B. Mariño, Josep Maria Crego |
| 2005 | Improving the discrimination between native accents when recorded over different channels. Tingyao Wu, Dirk Van Compernolle, Jacques Duchateau, Qian Yang, Jean-Pierre Martens |
| 2005 | Improving the speech recognition performance of beginners in spoken conversational interaction for language learning. Hui Ye, Steve J. Young |
| 2005 | In-set/out-of-set speaker identification based on discriminative speech frame selection. Xianxian Zhang, John H. L. Hansen |
| 2005 | Incorporating a Bayesian wide phonetic context model for acoustic rescoring. Sakriani Sakti, Satoshi Nakamura, Konstantin Markov |
| 2005 | Incorporating tone-related MLP posteriors in the feature representation for Mandarin ASR. Xin Lei, Mei-Yuh Hwang, Mari Ostendorf |
| 2005 | Incremental dependency parsing of Japanese spoken monologue based on clause boundaries. Tomohiro Ohno, Shigeki Matsubara, Hideki Kashioka, Naoto Kato, Yasuyoshi Inagaki |
| 2005 | Incremental largest margin linear regression and MAP adaptation for speech separation in telemedicine applications. Rusheng Hu, Jian Xue, Yunxin Zhao |
| 2005 | Indexing uncertainty for spoken document search. Ciprian Chelba, Alex Acero |
| 2005 | Inducing decision tree pronunciation variation models from annotated speech data. Per-Anders Jande |
| 2005 | Influence of F0 on Vietnamese syllable perception. Do Dat Tran, Eric Castelli, Jean-François Serignat, Van Loan Trinh, Le Xuan Hung |
| 2005 | Influence of syntax on prosodic boundary prediction. Tommy Ingulfsen, Tina Burrows, Sabine Buchholz |
| 2005 | Informed blending of databases for emotional speech synthesis. Gregor Hofer, Korin Richmond, Robert A. J. Clark |
| 2005 | Integrated development and on-the-fly simulation of multimodal dialogs. Silke Goronzy, Nicole Beringer |
| 2005 | Integrated n-best re-ranking for spoken language translation. V. H. Quan, Marcello Federico, Mauro Cettolo |
| 2005 | Integrating denotational meaning into a DBN language model. William Schuler, Tim Miller |
| 2005 | Integrating information from speech and physiological signals to achieve emotional sensitivity. Jonghwa Kim, Elisabeth André, Matthias Rehm, Thurid Vogt, Johannes Wagner |
| 2005 | Interactions between speech recognition problems and user emotions. Mihai Rotaru, Diane J. Litman, Katherine Forbes-Riley |
| 2005 | Interactive visualization of human-machine dialogs. Jeremy H. Wright, David A. Kapilow, Alicia Abella |
| 2005 | Internal noise suppression for speech recognition by small robots. Akinori Ito, Takashi Kanayama, Motoyuki Suzuki, Shozo Makino |
| 2005 | Intonational contrasts in EP: a categorical perception approach. Isabel Falé, Isabel Hub Faria |
| 2005 | Intonational sequences in tuscan Italian. Judith Bishop, Marc Peake, Dmitry Sityaev |
| 2005 | Introducing visual cues in acoustic-to-articulatory inversion. Olov Engwall |
| 2005 | Investigating the role of phoneme-level modifications in emotional speech resynthesis. Murtaza Bulut, Carlos Busso, Serdar Yildirim, Abe Kazemzadeh, Chul Min Lee, Sungbok Lee, Shrikanth S. Narayanan |
| 2005 | Investigating the role of the Lombard reflex in non-audible murmur (NAM) recognition. Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2005 | Investigation and modeling of coarticulation during speech. Jianwu Dang, Jianguo Wei, Takeharu Suzuki, Pascal Perrier |
| 2005 | Investigation of the relationship between turn-taking and prosodic features in spontaneous dialogue. Tomoko Ohsuga, Masafumi Nishida, Yasuo Horiuchi, Akira Ichikawa |
| 2005 | Investigations on ensemble based semi-supervised acoustic model training. Rong Zhang, Ziad Al Bawab, Arthur Chan, Ananlada Chotimongkol, David Huggins-Daines, Alexander I. Rudnicky |
| 2005 | Investigations on error minimizing training criteria for discriminative training in automatic speech recognition. Wolfgang Macherey, Lars Haferkamp, Ralf Schlüter, Hermann Ney |
| 2005 | Is color information really useful for lip-reading ? (or what is lost when color is not used). Philippe Daubias |
| 2005 | Italian children's speech recognition for advanced interactive literacy tutors. Piero Cosi, Bryan L. Pellom |
| 2005 | Italian geminates under speech rate and focalization changes: kinematic, acoustic, and perception data. Barbara Gili Fivela, Claudio Zmarich |
| 2005 | Japanese vowel recognition based on structural representation of speech. Takao Murakami, Kazutaka Maruyama, Nobuaki Minematsu, Keikichi Hirose |
| 2005 | Joint Bayesian predictive classification and parallel model combination for robust speech recognition. Svein Gunnar Pettersen, Magne Hallstein Johnsen, Tor André Myrvoll |
| 2005 | Joint source-channel coding of LSP parameters for bursty channels. José L. Pérez-Córdoba, Antonio M. Peinado, Angel M. Gomez, Antonio J. Rubio |
| 2005 | Joint uncertainty decoding for noise robust speech recognition. Hank Liao, Mark J. F. Gales |
| 2005 | Kalman and unscented kalman filter feature enhancement for noise robust ASR. Veronique Stouten, Hugo Van hamme, Patrick Wambacq |
| 2005 | Kalman filters for time delay of arrival-based source localization. Ulrich Klee, Tobias Gehrig, John W. McDonough |
| 2005 | L2 development of quantity perception: dutch listeners learning Finnish /t-t: /. Willemijn Heeren |
| 2005 | Language model adaptation for resource deficient languages using translated data. Arnar Thor Jensson, Edward W. D. Whittaker, Koji Iwano, Sadaoki Furui |
| 2005 | Language model data filtering via user simulation and dialogue resynthesis. Chao Wang, Stephanie Seneff, Grace Chung |
| 2005 | Large scale evaluation of corpus-based synthesizers: results and lessons from the blizzard challenge 2005. Christina L. Bennett |
| 2005 | Learning methods and features for corpus-based phrase break prediction on Thai. Chatchawarn Hansakunbuntheung, Ausdang Thangthai, Chai Wutiwiwatchai, Rungkarn Siricharoenchai |
| 2005 | Learning of stochastic dialog models through a dialog simulation technique. Francisco Torres, Emilio Sanchis, Encarna Segarra |
| 2005 | Learning statistically characterized resonance targets in a hidden trajectory model of speech coarticulation and reduction. Li Deng, Dong Yu, Alex Acero |
| 2005 | Learning to personalize spoken generation for dialogue systems. François Mairesse, Marilyn A. Walker |
| 2005 | Learning user simulations for information state update dialogue systems. Kallirroi Georgila, James Henderson, Oliver Lemon |
| 2005 | Let's go public! taking a spoken dialog system to the real world. Antoine Raux, Brian Langner, Dan Bohus, Alan W. Black, Maxine Eskénazi |
| 2005 | Leveraging speaker-dependent variation of adaptation. Arindam Mandal, Mari Ostendorf, Andreas Stolcke |
| 2005 | Lexical inhibition effects in time-compressed speech. Esther Janse |
| 2005 | Lexical out-of-vocabulary models for one-stage speech interpretation. Matthias Thomae, Tibor Fábián, Robert Lieb, Günther Ruske |
| 2005 | Lexical tone and pitch perception in tone and non-tone language speakers. Barbara Schwanhäußer, Denis Burnham |
| 2005 | Lexical tone perception in musicians and non-musicians. Jennifer A. Alexander, Patrick C. M. Wong, Ann R. Bradlow |
| 2005 | Linear models for structure prediction. Fernando C. N. Pereira |
| 2005 | Linguistic and acoustic features depending on different situations - the experiments considering speech recognition rate. Shinya Yamada, Toshihiko Itoh, Kenji Araki |
| 2005 | Linguistic features weighting for a text-to-speech system without prosody model. Vincent Colotte, Richard Beaufort |
| 2005 | Liveness detection using cross-modal correlations in face-voice person authentication. Girija Chetty, Michael Wagner |
| 2005 | Local word confidence measure using word graph and n-best list. Joseph Razik, Odile Mella, Dominique Fohr, Jean Paul Haton |
| 2005 | Low-dimensional feature space derivation for emotion recognition. Jaroslaw Cichosz, Krzysztof Slot |
| 2005 | MLLR transforms as features in speaker recognition. Andreas Stolcke, Luciana Ferrer, Sachin S. Kajarekar, Elizabeth Shriberg, Anand Venkataraman |
| 2005 | MLLR-like speaker adaptation based on linearization of VTLN with MFCC features. Xiaodong Cui, Abeer Alwan |
| 2005 | Mandarin/English mixed-lingual name recognition for mobile phone. Xiaolin Ren, Xin He, Yaxin Zhang |
| 2005 | Maximum conditional mutual information modeling for speaker verification. Mohamed Kamal Omar, Jirí Navrátil, Ganesh N. Ramaswamy |
| 2005 | Maximum margin learning and adaptation of MLP classifiers. Xiao Li, Jeff A. Bilmes, Jonathan Malkin |
| 2005 | Maximum mutual information SPLICE transform for seen and unseen conditions. Jasha Droppo, Alex Acero |
| 2005 | Measuring liveliness in presentation speech. Rebecca Hincks |
| 2005 | Measuring unsupervised acoustic clustering through phoneme pair merge-and-split tests. John Kominek, Alan W. Black |
| 2005 | Meeting acts: a labeling system for group interaction in meetings. Rebecca A. Bates, Patrick Menning, Elizabeth Willingham, Chad Kuyper |
| 2005 | Memory efficient approximative lattice generation for grammar based decoding. Miroslav Novak |
| 2005 | Memory-enhanced MMSE-based channel error mitigation for distributed speech recognition. Cheng-Lung Lee, Wen-Whei Chang |
| 2005 | Methods for combining language models in speech recognition. Simo Broman, Mikko Kurimo |
| 2005 | Minimum Bayes-risk decoding considering word significance for information retrieval system. Hiroaki Nanjo, Teruhisa Misu, Tatsuya Kawahara |
| 2005 | Minimum word error based discriminative training of language models. Jen-Wei Kuo, Berlin Chen |
| 2005 | Mining broadcast news data: robust information extraction from word lattices. Benoît Favre, Frédéric Béchet, Pascal Nocera |
| 2005 | Mixture of support vector machines for text-independent speaker recognition. Zhenchun Lei, Yingchun Yang, Zhaohui Wu |
| 2005 | Modality integration and dialog management for a robotic assistant. Ioannis Toptsis, Axel Haasch, Sonja Hwel, Jannik Fritsch, Gernot A. Fink |
| 2005 | Model adaptation and adaptive training using ESAT algorithm for HMM-based speech synthesis. Juri Isogai, Junichi Yamagishi, Takao Kobayashi |
| 2005 | Model adaptation by state splitting of HMM for long reverberation. Chandra Kant Raut, Takuya Nishimoto, Shigeki Sagayama |
| 2005 | Model based analysis of a diphone database for improved unit concatenation. Karl Schnell, Arild Lacroix |
| 2005 | Modeling and automating detection of errors in Arabic language learner speech. Abhinav Sethy, Shrikanth S. Narayanan, Nicolaus Mote, W. Lewis Johnson |
| 2005 | Modeling high-level information by using Gaussian mixture correlation for GMM-UBM based speaker recognition. Jing Deng, Thomas Fang Zheng, Zhanjiang Song, Jian Liu |
| 2005 | Modeling intra-speaker variability for speaker recognition. Hagai Aronowitz, Dror Irony, David Burshtein |
| 2005 | Modeling long and short-term prosody for language identification. Jean-Luc Rouas |
| 2005 | Modeling of between-speaker and within-speaker variation in spontaneous speech tempo. Hugo Quené |
| 2005 | Modeling the perception of multitalker speech. Soundararajan Srinivasan, DeLiang Wang |
| 2005 | Modeling the production of VCV sequences via the inversion of a biomechanical model of the tongue. Pascal Perrier, Liang Ma, Yohan Payan |
| 2005 | Modeling vowels for Arabic BN transcription. Abdelkhalek Messaoudi, Lori Lamel, Jean-Luc Gauvain |
| 2005 | Modelling pitch accent types for Polish speech synthesis. Dominika Oliver, Robert A. J. Clark |
| 2005 | Modelling session variability in text-independent speaker verification. Robbie Vogt, Brendan Baker, Sridha Sridharan |
| 2005 | Modified DISTBIC algorithm for speaker change detection. Petra Zochová, Vlasta Radová |
| 2005 | Mora timing organization in producing contrastive geminate/single consonants and long/short vowels by native and non-native speakers of Japanese: effects of speaking rate. Haiping Jia, Hiroki Mori, Hideki Kasuya |
| 2005 | Morphing spectral envelopes using audio flow. Tony Ezzat, Ethan Meyers, James R. Glass, Tomaso A. Poggio |
| 2005 | Multi-band approach of audio source discrimination with empirical mode decomposition. Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu |
| 2005 | Multi-level information and automatic dialog acts detection in human-human spoken dialogs. Sophie Rosset, Delphine Tribout |
| 2005 | Multi-resolution RASTA filtering for TANDEM-based ASR. Hynek Hermansky, Petr Fousek |
| 2005 | Multi-stage compaction approach to broadcast news summarisation. BalaKrishna Kolluru, Heidi Christensen, Yoshihiko Gotoh |
| 2005 | Multi-task learning strategies for a recurrent neural net in a hybrid tied-posteriors acoustic model. Jan Stadermann, Wolfram Koska, Gerhard Rigoll |
| 2005 | Multidimensional scaling of listener responses to synthetic speech. Catherine Mayo, Robert A. J. Clark, Simon King |
| 2005 | Multilingual models in the IBM bilingual text-to-speech systems. Jaime Botella Ordinas, Volker Fischer, Claire Waast-Richard |
| 2005 | Multilingual speech recognition: a unified approach. C. Santhosh Kumar, V. P. Mohandas, Haizhou Li |
| 2005 | Multimodal databases of everyday emotion: facing up to complexity. Ellen Douglas-Cowie, Laurence Devillers, Jean-Claude Martin, Roddy Cowie, Suzie Savvidou, Sarkis Abrilian, Cate Cox |
| 2005 | Multimodal interface for organization name input based on combination of isolated word recognition and continuous base-word recognition. Norihide Kitaoka, Hironori Oshikawa, Seiichi Nakagawa |
| 2005 | Multiple moving speaker tracking by microphone array on mobile robot. Masamitsu Murase, Shun'ichi Yamamoto, Jean-Marc Valin, Kazuhiro Nakadai, Kentaro Yamada, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
| 2005 | Multisyn voices from ARCTIC data for the blizzard challenge. Robert A. J. Clark, Korin Richmond, Simon King |
| 2005 | Multiword expressions in spontaneous speech: do we really speak like that? Helmer Strik, Diana Binnenpoorte, Catia Cucchiarini |
| 2005 | Mutual intelligibility of american, Chinese and dutch-accented speakers of English. Hongyan Wang, Vincent J. van Heuven |
| 2005 | Myoelectric signals for multimodal speech recognition. Raghunandan S. Kumaran, Karthik Narayanan, John N. Gowdy |
| 2005 | NAM-to-speech conversion with Gaussian mixture models. Tomoki Toda, Kiyohiro Shikano |
| 2005 | Named entity recognition from spontaneous open-domain speech. Mihai Surdeanu, Jordi Turmo, Eli Comelles |
| 2005 | Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT. Hideki Kawahara, Alain de Cheveigné, Hideki Banno, Toru Takahashi, Toshio Irino |
| 2005 | Neologos: an optimized database for the development of new speech processing algorithms. Delphine Charlet, Sacha Krstulovic, Frédéric Bimbot, Olivier Boëffard, Dominique Fohr, Odile Mella, Filip Korkmazsky, Djamel Mostefa, Khalid Choukri, Arnaud Vallée |
| 2005 | Neural bases of listening to speech in noise. Patrick C. M. Wong, Kiara M. Lee, Todd B. Parrish |
| 2005 | New pruning criteria for efficient decoding. Janne Pylkkönen |
| 2005 | New signal features for robust identification of isolated vowels. Aníbal J. S. Ferreira |
| 2005 | New word-level and sentence-level confidence scoring using graph theory calculus and its evaluation on speech understanding. Javier Ferreiros, Rubén San Segundo, Fernando Fernández Martínez, Luis Fernando D'Haro, Valentín Sama, Roberto Barra-Chicote, Pedro Mellén |
| 2005 | No laughing matter. Nick Campbell, Hideki Kashioka, Ryo Ohara |
| 2005 | Noise compensation using interacting multiple kalman filters. Jianping Deng, Martin Bouchard, Tet Hin Yeap |
| 2005 | Non-linear estimation of voice activity to improve automatic recognition of noisy speech. Roberto Gemello, Franco Mana, Renato De Mori |
| 2005 | Non-parametric speaker turn segmentation of meeting data. Petr Motlícek, Lukás Burget, Jan Cernocký |
| 2005 | Non-verbal speech processing for a communicative agent. Nick Campbell |
| 2005 | Nonlinear and linear transformations of speech features to compensate for channel and noise effects. Saurabh Prasad, Stephen A. Zahorian |
| 2005 | Numerical glottal sound source model as coupled problem between vocal cord vibration and glottal flow. Hideyuki Nomura, Tetsuo Funada |
| 2005 | Objective quality assessment of wideband speech by an extension of ITU-t recommendation p.862. Akira Takahashi, Atsuko Kurashima, Chiharu Morioka, Hideaki Yoshino |
| 2005 | Oldenburg logatome speech corpus (OLLO) for speech recognition experiments with humans and machines. Thorsten Wesker, Bernd T. Meyer, Kirsten Wagener, Jörn Anemüller, Alfred Mertins, Birger Kollmeier |
| 2005 | On building a concatenative speech synthesis system from the blizzard challenge speech databases. Wael Hamza, Raimo Bakis, Zhiwei Shuang, Heiga Zen |
| 2005 | On designing and evaluating speech event detectors. Jinyu Li, Chin-Hui Lee |
| 2005 | On european Portuguese automatic syllabification. Catarina Oliveira, Lurdes Castro Moutinho, António J. S. Teixeira |
| 2005 | On improvements to CI-based GMM selection. Arthur Chan, Mosur Ravishankar, Alexander I. Rudnicky |
| 2005 | On integrating insights from human speech perception into automatic speech recognition. Sorin Dusan, Lawrence R. Rabiner |
| 2005 | On noise gain estimation for HMM-based speech enhancement. David Yuheng Zhao, W. Bastiaan Kleijn |
| 2005 | On the acoustic characterization of ejective stops in Waima'a. John Hajek, Mary Stevens |
| 2005 | On the integration of speech recognition and statistical machine translation. Evgeny Matusov, Stephan Kanthak, Hermann Ney |
| 2005 | On the inter-syllable coarticulation effect of pitch modeling for Mandarin speech. Chen-Yu Chiang, Yih-Ru Wang, Sin-Horng Chen |
| 2005 | On the nature of acoustic information in identification of coarticulated vowels. Sorin Dusan |
| 2005 | On the relationship between intra-oral pressure and speech sonority. Anne Cros, Didier Demolin, Ana Georgina Flesia, Antonio Galves |
| 2005 | On the relationship between phonetic modeling precision and phonetic speaker recognition accuracy. Doroteo Torre Toledano, Carlos Fombella, Joaquin Gonzalez-Rodriguez, Luis A. Hernández Gómez |
| 2005 | On the use of a decimative spectral estimation method based on eigenanalysis and SVD for formant and bandwidth tracking of speech signals. Sotiris Karabetsos, Pirros Tsiakoulis, Stavroula-Evita Fotinea, Ioannis Dologlou |
| 2005 | On the use of morphological constraints in n-gram statistical language model. A. Ghaoui, François Yvon, Chafic Mokbel, Gérard Chollet |
| 2005 | On the use of speech recognition in computer assisted translation. Luis Rodríguez, Jorge Civera, Enrique Vidal, Francisco Casacuberta, César Ernesto Martínez |
| 2005 | On variable-scale piecewise stationary spectral analysis of speech signals for ASR. Vivek Tyagi, Christian Wellekens, Hervé Bourlard |
| 2005 | Online speaker adaptation and tracking for real-time speech recognition. Daben Liu, Daniel Kiecza, Amit Srivastava, Francis Kubala |
| 2005 | Open vocabulary speech recognition with flat hybrid models. Maximilian Bisani, Hermann Ney |
| 2005 | Open-set speaker identification using adapted Gaussian mixture models. J. Fortuna, P. Sivakumaran, Aladdin M. Ariyaeeinia, Amit S. Malegaonkar |
| 2005 | Operating a public spoken guidance system in real environment. Ryuichi Nisimura, Akinobu Lee, Masashi Yamada, Kiyohiro Shikano |
| 2005 | Optimal model order selection based on regression tree in speaker identification. Shilei Zhang, Junmei Bai, Shuwu Zhang, Bo Xu |
| 2005 | Optimization methods for discriminative training. Jonathan Le Roux, Erik McDermott |
| 2005 | Optimization of text-to-speech phonetic transcriptions using a-posteriori signal comparison. S. Revelin, Didier Cadic, Claire Waast-Richard |
| 2005 | Optimized selection of intonation dictionaries in corpus based intonation modelling. David Escudero Mancebo, Valentín Cardeñoso-Payo |
| 2005 | Optimizing the structure of partly-hidden Markov models using weighted likelihood-ratio maximization criterion. Tetsuji Ogawa, Tetsunori Kobayashi |
| 2005 | Optimizing user experience through design of the spoken language understanding (SLU) module. Maria Gabriela Alvarez-Ryan, Narendra K. Gupta, Barbara Hollister, Tirso Alonso |
| 2005 | Oriented global coherence field for the estimation of the head orientation in smart rooms equipped with distributed microphone arrays. Alessio Brutti, Maurizio Omologo, Piergiorgio Svaizer |
| 2005 | Outlier detection for acoustic model training using robust statistics. Shigeki Matsuda, Wolfgang Herbordt, Satoshi Nakamura |
| 2005 | Overlapping wavelet packet features for speaker verification. Mihalis Siafarikas, Todor Ganchev, Nikolaos D. Fakotakis, George K. Kokkinakis |
| 2005 | PCA of perturbation parameters in voice pathology detection. Pedro Gómez, Francisco Díaz Pérez, Agustín Álvarez-Marquina, Rafael Martínez, Victoria Rodellar, Roberto Fernández-Baíllo, Alberto Nieto, Francisco J. Fernandez |
| 2005 | POS-based language models for large vocabulary speech recognition on embedded systems. Petra Witschel, Sergey Astrov, Gabriele Bakenecker, Josef G. Bauer, Harald Höge |
| 2005 | PROSPECT features and their application to missing data techniques for vocal tract length normalization. Wim Jansen, Hugo Van hamme |
| 2005 | Parallels between HSR and ASR: how ASR can contribute to HSR. Odette Scharenborg |
| 2005 | Peak timing in two dialects of connaught irish. Martha Dalton, Ailbhe Ní Chasaide |
| 2005 | Perception experiment combining a parametric loudspeaker and a synthetic talking head. Gunilla Svanfeldt, Dirk Olszewski |
| 2005 | Perception of time-compressed rapid acoustic cues in French CV syllables. Caroline Jacquier, Fanny Meunier |
| 2005 | Perceptions of emotions in expressive storytelling. Cecilia Ovesdotter Alm, Richard Sproat |
| 2005 | Perceptual and linguistic category formation in infants. Tamami Sudo, Ken Mogi |
| 2005 | Perceptual development of the duration cue in dutch /a-a: /. Willemijn Heeren |
| 2005 | Perceptual magnet effect in German boundary tones. Katrin Schneider, Bernd Möbius |
| 2005 | Perceptual postfilter estimation for low bit rate speech coders using Gaussian mixture models. Wei Chen, Peter Kabal, Turaj Zakizadeh Shabestary |
| 2005 | Perceptual salience of language-specific acoustic differences in autonomous fillers across eight languages. Ioana Vasilescu, Maria Candea, Martine Adda-Decker |
| 2005 | Perceptual space of English fricatives for Japanese learners. Won Tokuma, Shinichi Tokuma |
| 2005 | Perceptually-based data-driven join costs: comparing join types. Ann K. Syrdal, Alistair Conkie |
| 2005 | Performance evaluation of style adaptation for hidden semi-Markov model based speech synthesis. Makoto Tachibana, Junichi Yamagishi, Takashi Masuko, Takao Kobayashi |
| 2005 | Phattsessionz: recording 1000 adolescent speakers in schools in Germany. Christoph Draxler, Alexander Steffen |
| 2005 | Phoneme alignment based on discriminative learning. Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer, Dan Chazan |
| 2005 | Phonetic ignorance is bliss: investigating the effects of phonetic information reduction on ASR performance. Eric Fosler-Lussier, C. Anton Rytting, Soundararajan Srinivasan |
| 2005 | Phonetic inventories in Italian children aged 18-27 months: a longitudinal study. Claudio Zmarich, Serena Bonifacio |
| 2005 | Phonetic labeling and segmentation of mixed-lingual prosody databases. Harald Romsdorfer, Beat Pfister |
| 2005 | Phonetic transcription verification with generalized posterior probability. Lijuan Wang, Yong Zhao, Min Chu, Frank K. Soong, Zhigang Cao |
| 2005 | Phonological analysis of schwa and liaison within the PFC project (phonologie du fran ais contemporain): how determinant are the prosodic factors? Anne Lacheret, Ch. Lyche, Michel Morel |
| 2005 | Phonotactic language identification using high quality phoneme recognition. Pavel Matejka, Petr Schwarz, Jan Cernocký, Pavel Chytil |
| 2005 | Physiological study of whispered speech in Moroccan Arabic. Chakir Zeroual, John H. Esling, Lise Crevier-Buchman |
| 2005 | Physiologically motivated audio-visual localisation and tracking. Stuart N. Wrigley, Guy J. Brown |
| 2005 | Piecewise linear stylization of pitch via wavelet analysis. Dagen Wang, Shrikanth S. Narayanan |
| 2005 | Pitch accent prediction: effects of genre and speaker. Jiahong Yuan, Jason M. Brenier, Daniel Jurafsky |
| 2005 | Pitch patterns of intonational phrases and intonational phrase groups in native and non-native speech. Hiroko Hirano, Goh Kawai |
| 2005 | Pitch-effects in diphone recording: are logatomes inappropriate? Ulrich Reubold, Alexander Steffen |
| 2005 | Pitch-synchronous time-scaling for high-frequency excitation regeneration. João P. Cabral, Luís C. Oliveira |
| 2005 | Pitch-synchronous time-scaling for prosodic and voice quality transformations. João P. Cabral, Luís C. Oliveira |
| 2005 | Polder dutch: aspects of the /ei/-lowering in standard dutch. Irene Jacobi, Louis C. W. Pols, Jan Stroop |
| 2005 | Polynomial dynamic time warping kernel support vector machines for dysarthric speech recognition with sparse training data. Vincent Wan, James Carmichael |
| 2005 | Predicting consonant duration with Bayesian belief networks. Olga Goubanova, Simon King |
| 2005 | Predicting end of utterance in multimodal and unimodal conditions. Pashiera Barkhuysen, Emiel Krahmer, Marc Swerts |
| 2005 | Probabilistic anchor models approach for speaker verification. Mikaël Collet, Yassine Mami, Delphine Charlet, Frédéric Bimbot |
| 2005 | Production and perception of Vietnamese vowels. Eric Castelli, René Carré |
| 2005 | Production of prominence in Japanese sign language. Saori Tanaka, Masafumi Nishida, Yasuo Horiuchi, Akira Ichikawa |
| 2005 | Pronunciation error detection method based on error rule clustering using a decision tree. Akinori Ito, Yen-Ling Lim, Motoyuki Suzuki, Shozo Makino |
| 2005 | Pronunciation variation modelling using accent features. Michael Tjalve, Mark A. Huckvale |
| 2005 | Pronunciation variations of Spanish-accented English spoken by young children. Hong You, Abeer Alwan, Abe Kazemzadeh, Shrikanth S. Narayanan |
| 2005 | Proposal of acoustic measures for automatic detection of vocal fry. Carlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita |
| 2005 | Prosodic cues for syntactically-motivated junctures. Ivan Chow |
| 2005 | Prosodic features based on wavelet analysis for speaker verification. Jixu Chen, Beiqian Dai, Jun Sun |
| 2005 | Prosodic realization of split noun phrases in Mandarin Chinese compared in topic and focus contexts. Bei Wang |
| 2005 | Prosody in public speech: analyses of a news announcement and a Political interview. Eva Strangert |
| 2005 | Quality control for UMTS-AMR speech channels. Marc Werner, Peter Vary |
| 2005 | Quantitative evaluation of effects of speech recognition errors on speech translation quality. Kenko Ohta, Keiji Yasuda, Gen-ichiro Kikui, Masuzo Yanagida |
| 2005 | Quasi-automatic extraction of tongue movement from a large existing speech cineradiographic database. Julie Fontecave, Frédéric Berthommier |
| 2005 | Rapid porting of ASR-systems to mobile devices. Thilo Köhler, Christian Fügen, Sebastian Stüker, Alex Waibel |
| 2005 | Rapid response and robust speech recognition by preliminary model adaptation for additive and convolutional noise. Satoshi Kobashikawa, Satoshi Takahashi, Yoshikazu Yamaguchi, Atsunori Ogawa |
| 2005 | Rapid speaker adaptation for continuous speech recognition using merging eigenvoices. Dong-jin Choi, Yung-Hwan Oh |
| 2005 | Rapid transition to new spoken dialogue domains: language model training using knowledge from previous domain applications and web text resources. Murat Akbacak, Yuqing Gao, Liang Gu, Hong-Kwang Jeff Kuo |
| 2005 | Rapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments. Randy Gomez, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2005 | Rapidly developing spoken Chinese dialogue systems with the d-ear SDS SDK. Xiaojun Wu, Thomas Fang Zheng, Michael Brasser, Zhanjiang Song |
| 2005 | Real-time outer lip contour tracking for HCI applications. Sabri Gurbuz |
| 2005 | Real-time pitch tracking based on combined SMDSF. Jian Liu, Thomas Fang Zheng, Jing Deng, Wenhu Wu |
| 2005 | Recent progress in Arabic broadcast news transcription at BBN. Mohamed Afify, Long Nguyen, Bing Xiang, Sherif M. Abdou, John Makhoul |
| 2005 | Recognition of (3) party conversation using prosody and gaze. Yosuke Matsusaka |
| 2005 | Recognition of German obstruents. Julia Hoelterhoff |
| 2005 | Recognizing speech from simultaneous speakers. Bhiksha Raj, Rita Singh, Paris Smaragdis |
| 2005 | Reconstruction of Polish diacritics in a text-to-speech system. Artur Janicki, Piotr Herman |
| 2005 | Reducing the corpus-based TTS signal degradation due to speaker's word pronunciations. Sérgio Paulo, Luís C. Oliveira |
| 2005 | Reducing the description amount in authoring MMI applications. Kouichi Katsurada, Kazumine Aoki, Hirobumi Yamada, Tsuneo Nitta |
| 2005 | Refining phoneme segmentations using speaker-adaptive context dependent boundary models. Yong Zhao, Lijuan Wang, Min Chu, Frank K. Soong, Zhigang Cao |
| 2005 | Regularizing linear discriminant analysis for speech recognition. Hakan Erdogan |
| 2005 | Relevant information extraction for discriminative training applied to speaker identification. Mohamed Mihoubi, Douglas D. O'Shaughnessy, Pierre Dumouchel |
| 2005 | Remodeling of the sensor for non-audible murmur (NAM). Yoshitaka Nakajima, Hideki Kashioka, Kiyohiro Shikano, Nick Campbell |
| 2005 | Results from a survey of attendees at ASRU 1997 and 2003. Roger K. Moore |
| 2005 | Revealing phonological similarities between German and dutch. Karin Müller |
| 2005 | Reversed speech comprehension depends on the auditory efferent system functionality. Claire-Léonie Grataloup, Michel Hoen, François Pellegrino, E. Veuillet, Lionel Collet, Fanny Meunier |
| 2005 | Review of statistical modeling of highly inflected lithuanian using very large vocabulary. Airenas Vaiciunas, Gailius Raskinis |
| 2005 | Revising Perceptual Linear Prediction (PLP). Florian Hönig, Georg Stemmer, Christian Hacker, Fabio Brugnara |
| 2005 | Ritel: an open-domain, human-computer dialog system. Olivier Galibert, Gabriel Illouz, Sophie Rosset |
| 2005 | Robust access to large structured data using voice form-filling. S. Parthasarathy, Cyril Allauzen, R. Munkong |
| 2005 | Robust algorithms and interaction strategies for voice spelling. Daniela Oria, Akos Vetek |
| 2005 | Robust and efficient semantic parsing of free word order languages in spoken dialogue systems. Ralf Engel |
| 2005 | Robust automatic speech recognition using a perceptually-based optimal spectral amplitude estimator speech enhancement algorithm in various low-SNR environments. Hesham Tolba, Zili Li, Douglas D. O'Shaughnessy |
| 2005 | Robust bandwidth extension of noise-corrupted narrowband speech. Michael L. Seltzer, Alex Acero, Jasha Droppo |
| 2005 | Robust detection of sonorant landmarks. Ken Schutte, James R. Glass |
| 2005 | Robust distant speaker recognition based on position dependent cepstral mean normalization. Longbiao Wang, Norihide Kitaoka, Seiichi Nakagawa |
| 2005 | Robust distant speech recognition based on position dependent CMN using a novel multiple microphone processing technique. Longbiao Wang, Norihide Kitaoka, Seiichi Nakagawa |
| 2005 | Robust feature compensation in nonstationary and multiple noise environments. Martin Graciarena, Horacio Franco, Gregory K. Myers, Victor Abrash |
| 2005 | Robust speaker localization through adaptive weighted pair TDOA (AWEPAT) estimation. Nilesh Madhu, Rainer Martin |
| 2005 | Robust speech recognition based on noise and SNR classification - a multiple-model framework. Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg |
| 2005 | Robust speech recognition for mobile devices in car noise. Panji Setiawan, Suhadi Suhadi, Tim Fingscheidt, Sorel Stan |
| 2005 | Robust speech recognition in cars using phoneme dependent multi-environment linear normalization. Luis Buera, Eduardo Lleida, Antonio Miguel, Alfonso Ortega |
| 2005 | Robust speech recognition in ubiquitous networking and context-aware computing. Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg, Haitian Xu |
| 2005 | Robust voice activity detection based on the entropy of noise-suppressed spectrum. Zoltán Tüske, Péter Mihajlik, Zoltán Tobler, Tibor Fegyó |
| 2005 | Root causes of lost time and user stress in a simple dialog system. Nigel G. Ward, Anais G. Rivera, Karen Ward, David G. Novick |
| 2005 | Rule-based grapheme-to-phoneme method for the Greek. Aimilios Chalamandaris, Spyros Raptis, Pirros Tsiakoulis |
| 2005 | SGStudio: rapid semantic grammar development for spoken language understanding. Ye-Yi Wang, Alex Acero |
| 2005 | SNR-dependent background noise compensation of PESQ values for cellular phone speech. Kengo Fujita, Tsuneo Kato, Hideaki Yamada, Hisashi Kawai |
| 2005 | SVitchboard 1: small vocabulary tasks from Switchboard. Simon King, Chris D. Bartels, Jeff A. Bilmes |
| 2005 | Scalable language model look-ahead for LVCSR. Dominique Massonié, Pascal Nocera, Georges Linarès |
| 2005 | Segment-based phonetic class detection using minimum verification error (MVE) training. Qiang Fu, Biing-Hwang Juang |
| 2005 | Segmental "anchorage" and the French late rise. Pauline Welby, Hélène Loevenbruck |
| 2005 | Segmentation of recordings based on partial transcriptions. Patrick Cardinal, Gilles Boulianne, Michel Comeau |
| 2005 | Selection of features and combination of classifiers using a fuzzy approach for acoustic event classification. Andrey Temko, Dusan Macho, Climent Nadeu |
| 2005 | Self-organizing chirp-sensitive artificial auditory cortical model. Luis Weruaga, Marián Képesi |
| 2005 | Semantic annotation of the French media dialog corpus. Hélène Bonneau-Maynard, Sophie Rosset, Christelle Ayache, Anne Kuhn, Djamel Mostefa |
| 2005 | Simultaneous adaptation of echo cancellation and spectral subtraction for in-car speech recognition. Osamu Ichikawa, Masafumi Nishimura |
| 2005 | Situation based speech recognition for structuring baseball live games. Atsushi Sako, Tetsuya Takiguchi, Yasuo Ariki |
| 2005 | Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling. Dan Chazan, Ron Hoory, Zvi Kons, Ariel Sagi, Slava Shechtman, Alexander Sorin |
| 2005 | Soft decision strategy and adaptive compensation for robust speech recognition against impulsive noise. Pei Ding |
| 2005 | Soft harmonic masks for recognising speech in the presence of a competing speaker. André Coy, Jon Barker |
| 2005 | Some experiments on iterative reconstruction of speech from STFT phase and magnitude spectra. Leigh D. Alsteris, Kuldip K. Paliwal |
| 2005 | Sound segregation based on binaural zero-crossings. Young-Ik Kim, Sung Jun An, Rhee Man Kil, Hyung-Min Park |
| 2005 | Speaker adaptation in the NIST speaker recognition evaluation 2004. David A. van Leeuwen |
| 2005 | Speaker adaptive acoustic modeling with mixture of adult and children's speech. Matteo Gerosa, Diego Giuliani, Fabio Brugnara |
| 2005 | Speaker clustering of unknown utterances based on maximum purity estimation. Wei-Ho Tsai, Hsin-Min Wang |
| 2005 | Speaker detection using acoustic event sequences. Nicolas Scheffer, Jean-François Bonastre |
| 2005 | Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles. Björn W. Schuller, Ronald Müller, Manfred K. Lang, Gerhard Rigoll |
| 2005 | Speaker verification improvement using blind inversion of distortions. Marcos Faúndez-Zanuy, Jordi Solé-Casals |
| 2005 | Speaker verification in noisy conditions using correlated subband features. James McAuley, Ji Ming, Pat Corr |
| 2005 | Speaker verification using Gaussian mixture models within changing real car environments. Xianxian Zhang, John H. L. Hansen, Pongtep Angkititrakul, Kazuya Takeda |
| 2005 | Speaker verification via articulatory feature-based conditional pronunciation modeling with vowel and consonant mixture models. Ka-Yee Leung, Man-Wai Mak, Man-Hung Siu, Sun-Yuan Kung |
| 2005 | Spectral cross-correlation features for audio indexing of broadcast news and meetings. Masahide Yamaguchi, Masaru Yamashita, Shoichi Matsunaga |
| 2005 | Spectral entropy feature in full-combination multi-stream for robust ASR. Hemant Misra, Hervé Bourlard |
| 2005 | Spectral subtraction using elliptic integral for multiplication factor. Takeshi S. Kobayakawa |
| 2005 | Speech activity detection fusing acoustic phonetic and energy features. Etienne Marcheret, Karthik Visweswariah, Gerasimos Potamianos |
| 2005 | Speech bandwidth extension by improved codebook mapping towards increased phonetic classification. Rongqiang Hu, Venkatesh Krishnan, David V. Anderson |
| 2005 | Speech enhancement in temporal DFT trajectories using Kalman filters. Esfandiar Zavarehei, Saeed Vaseghi |
| 2005 | Speech enhancement using Markov model of speech segments. T. M. Sunil Kumar, T. V. Sreenivas |
| 2005 | Speech enhancement using auditory phase opponency model. Om Deshmukh, Carol Y. Espy-Wilson |
| 2005 | Speech enhancement using non-acoustic sensors. Rongqiang Hu, Sunil D. Kamath, David V. Anderson |
| 2005 | Speech event detection using multiband modulation energy. Georgios Evangelopoulos, Petros Maragos |
| 2005 | Speech extraction in a car interior using frequency-domain ICA with rapid filter adaptations. Daisuke Saitoh, Atsunobu Kaminuma, Hiroshi Saruwatari, Tsuyoki Nishikawa, Akinobu Lee |
| 2005 | Speech intelligibility derived from time-frequency and source smearing. Toshio Irino, Satoru Satou, Shunsuke Nomura, Hideki Banno, Hideki Kawahara |
| 2005 | Speech interface for controlling an hi-fi audio system based on a Bayesian belief networks approach for dialog modeling. Fernando Fernández Martínez, Javier Ferreiros, Valentín Sama, Juan Manuel Montero, Rubén San Segundo, Javier Macías Guarasa, Rafael García |
| 2005 | Speech inversion and re-synthesis. Victor N. Sorokin, Alexander S. Leonov, I. S. Makarov, A. I. Tsyplikhin |
| 2005 | Speech operated smart-home control system for users with special needs. Anestis Vovos, Basilis Kladis, Nikolaos D. Fakotakis |
| 2005 | Speech parameter generation algorithm considering global variance for HMM-based speech synthesis. Tomoki Toda, Keiichi Tokuda |
| 2005 | Speech processing in the networked home environment - a view on the amigo project. Reinhold Haeb-Umbach, Basilis Kladis, Joerg Schmalenstroeer |
| 2005 | Speech recognition performance and learning in spoken dialogue tutoring. Diane J. Litman, Katherine Forbes-Riley |
| 2005 | Speech recognition with support vector machines in a hybrid system. Sven E. Krüger, Martin Schafföner, Marcel Katz, Edin Andelic, Andreas Wendemuth |
| 2005 | Speech repair: quick error correction just by using selection operation for speech input interfaces. Jun Ogata, Masataka Goto |
| 2005 | Speech retrieval of Mandarin broadcast news via mobile devices. Berlin Chen, Yi-Ting Chen, Chih-Hao Chang, Hung-Bin Chen |
| 2005 | Speech technology for e-inclusion of people with physical disabilities and disordered speech. Mark S. Hawley, Phil D. Green, Pam Enderby, Stuart P. Cunningham, Roger K. Moore |
| 2005 | Speech technology for language training and e-inclusion. Björn Granström |
| 2005 | Speech trajectory clustering for improved speech recognition. Yan Han, Johan de Veth, Lou Boves |
| 2005 | Speech translation for low-resource languages: the case of Pashto. Andreas Kathol, Kristin Precoda, Dimitra Vergyri, Wen Wang, Susanne Z. Riehemann |
| 2005 | Spirantization of /p t k/ in Sienese Italian and so-called semi-fricatives. Mary Stevens, John Hajek |
| 2005 | Spoken dialog system and its evaluation of geographic information system for elderly persons' mobility support. Takatoshi Jitsuhiro, Shigeki Matsuda, Yutaka Ashikari, Satoshi Nakamura, Ikuko Eguchi Yairi, Seiji Igi |
| 2005 | Spoken dialog system for real-time data capture. Esther Levin, Alex Levin |
| 2005 | Spoken language understanding using layered n-gram modeling. Nick J.-C. Wang |
| 2005 | Spontaneous speech consolidation for spoken language applications. Chiori Hori, Alex Waibel |
| 2005 | Spontaneous speech: how people really talk and why engineers should care. Elizabeth Shriberg |
| 2005 | State estimation of meetings by information fusion using Bayesian network. Michiaki Katoh, Kiyoshi Yamamoto, Jun Ogata, Takashi Yoshimura, Futoshi Asano, Hideki Asoh, Nobuhiko Kitawaki |
| 2005 | Statistical class-based MFCC enhancement of filtered and band-limited speech for robust ASR. Nicolás Morales, Doroteo T. Toledano, John H. L. Hansen, José Colás, Javier Garrido Salas |
| 2005 | Statistical language models for large vocabulary spontaneous speech recognition in dutch. Jacques Duchateau, Dong Hoon Van Uytsel, Hugo Van hamme, Patrick Wambacq |
| 2005 | Statistical noise compensation for cochlear implant processing. Hui Jiang, Qian-Jie Fu |
| 2005 | Statistical properties of the warped discrete cosine transform cepstrum compared with MFCC. R. Muralishankar, Abhijeet Sangwan, Douglas D. O'Shaughnessy |
| 2005 | Steady-state pre-processing for improving speech intelligibility in reverberant environments: evaluation in a hall with an electrical reverberator. Nahoko Hayashi, Takayuki Arai, Nao Hodoshima, Yusuke Miyauchi, Kiyohiro Kurisu |
| 2005 | Steerable highly directional audio beam loudspeaker. Dirk Olszewski, Fransiskus Prasetyo, Klaus Linhard |
| 2005 | Stimulus duration and type in perception of female and male speaker age. Susanne Schötz |
| 2005 | Stochastic and syntactic techniques for predicting phrase breaks. Ian Read, Stephen Cox |
| 2005 | Stochastic pronunciation modeling by ergodic-HMM of acoustic sub-word units. V. Ramasubramanian, P. Srinivas, T. V. Sreenivas |
| 2005 | Strategies of labial coarticulation. Vincent Robert, Brigitte Wrobel-Dautcourt, Yves Laprie, Anne Bonneau |
| 2005 | Stream-weight optimization by LDA and adaboost for multi-stream speaker verification. Taichi Asami, Koji Iwano, Sadaoki Furui |
| 2005 | Structural metadata annotation: moving beyond English. Stephanie M. Strassel, Jáchym Kolár, Zhiyi Song, Leila Barclay, Meghan Lammie Glenn |
| 2005 | Structural representation of the non-native pronunciations. Satoshi Asakawa, Nobuaki Minematsu, Toshiko Isei-Jaakkola, Keikichi Hirose |
| 2005 | Stylization of glottal-flow spectra produced by a mechanical vocal-fold model. Denisse Sciamarella, Christophe d'Alessandro |
| 2005 | Sub-band weighted projection measure for robust sub-band speech recognition. Babak Nasersharif, Ahmad Akbari |
| 2005 | Subglottal pressure and NAQ variation in voice production of classically trained baritone singers. Eva Björkner, Johan Sundberg, Paavo Alku |
| 2005 | Subjective and objective quality assessment of regression-enhanced speech in real car environments. Weifeng Li, Katunobu Itou, Kazuya Takeda, Fumitada Itakura |
| 2005 | Supergaussian GARCH models for speech signals. Israel Cohen |
| 2005 | Supporting the creation of TTS for local language voice information systems. Roger C. F. Tucker, Ksenia Shalonova |
| 2005 | Switched split vector quantisation of line spectral frequencies for wideband speech coding. Stephen So, Kuldip K. Paliwal |
| 2005 | Syllable structure in spoken Arabic: a comparative investigation. Rym Hamdi, Salem Ghazali, Melissa Barkat-Defradas |
| 2005 | Symbolic prosody driven unit selection for highly natural synthetic speech. Daniel Tihelka |
| 2005 | Synchronizing dialogue contributions of human users and virtual characters in a virtual reality environment. Norbert Pfleger, Markus Löckelt |
| 2005 | Synthesis of disordered speech. Julien Hanquinet, Francis Grenez, Jean Schoentgen |
| 2005 | Synthesising hyperarticulation in unit selection TTS. Matthew P. Aylett |
| 2005 | TBALL data collection: the making of a young children's speech corpus. Abe Kazemzadeh, Hong You, Markus Iseli, Barbara Jones, Xiaodong Cui, Margaret Heritage, Patti Price, Elaine Andersen, Shrikanth S. Narayanan, Abeer Alwan |
| 2005 | Tales of tuning - prototyping for automatic classification of emotional user states. Anton Batliner, Stefan Steidl, Christian Hacker, Elmar Nöth, Heinrich Niemann |
| 2005 | Teaching a vocal tract simulation to imitate stop consonants. Mark A. Huckvale, Ian S. Howard |
| 2005 | Temporal ICA for classification of acoustic events i a kitchen environment. Florian Kraft, Robert G. Malkin, Thomas Schaaf, Alex Waibel |
| 2005 | Temporally varying model parameters for large vocabulary continuous speech recognition. Khe Chai Sim, Mark J. F. Gales |
| 2005 | The 2004 BBN 1xRT recognition systems for English broadcast news and conversational telephone speech. Spyros Matsoukas, Rohit Prasad, Srinivas Laxminarayan, Bing Xiang, Long Nguyen, Richard M. Schwartz |
| 2005 | The 2004 BBN/LIMSI 20xRT English conversational telephone speech recognition system. Rohit Prasad, Spyros Matsoukas, Chia-Lin Kao, Jeff Z. Ma, Dongxin Xu, Thomas Colthurst, Owen Kimball, Richard M. Schwartz, Jean-Luc Gauvain, Lori Lamel, Holger Schwenk, Gilles Adda, Fabrice Lefèvre |
| 2005 | The BBN Mandarin broadcast news transcription system. Bing Xiang, Long Nguyen, Xuefeng Guo, Dongxin Xu |
| 2005 | The BBN RT04 English broadcast news transcription system. Long Nguyen, Bing Xiang, Mohamed Afify, Sherif M. Abdou, Spyros Matsoukas, Richard M. Schwartz, John Makhoul |
| 2005 | The COST278 broadcast news segmentation and speaker clustering evaluation - overview, methodology, systems, results. Janez Zibert, France Mihelic, Jean-Pierre Martens, Hugo Meinedo, João Paulo Neto, Laura Docío Fernández, Carmen García-Mateo, Petr David, Jindrich Zdánský, Matús Pleva, Anton Cizmar, Andrej Zgank, Zdravko Kacic, Csaba Teleki, Klára Vicsi |
| 2005 | The Cambridge University March 2005 speaker diarisation system. Rohit Sinha, S. E. Tranter, Mark J. F. Gales, Philip C. Woodland |
| 2005 | The ESTER phase II evaluation campaign for the rich transcription of French broadcast news. Sylvain Galliano, Edouard Geoffrois, Djamel Mostefa, Khalid Choukri, Jean-François Bonastre, Guillaume Gravier |
| 2005 | The FASil speech and multimodal corpora. Hans Dolfing, David Reitter, Luís Almeida, Nuno Beires, Michael Cody, Rui Gomes, Kerry Robinson, Roman Zielinski |
| 2005 | The LIUM speech transcription system: a CMU Sphinx III-based system for French broadcast news. Paul Deléglise, Yannick Estève, Sylvain Meignier, Téva Merlin |
| 2005 | The PF_STAR children's speech corpus. Anton Batliner, Mats Blomberg, Shona D'Arcy, Daniel Elenius, Diego Giuliani, Matteo Gerosa, Christian Hacker, Martin J. Russell, Stefan Steidl, Michael Wong |
| 2005 | The Swedish NICE corpus - spoken dialogues between children and embodied characters in a computer game scenario. Linda Bell, Johan Boye, Joakim Gustafson, Mattias Heldner, Anders Lindström, Mats Wirén |
| 2005 | The analysis on band-limited hypernasal speech using group delay based formant extraction technique. P. Vijayalakshmi, M. Ramasubba Reddy |
| 2005 | The blizzard challenge - 2005: evaluating corpus-based speech synthesis on common datasets. Alan W. Black, Keiichi Tokuda |
| 2005 | The blizzard challenge 2005 CMU entry - a method for improving speech synthesis systems. John Kominek, Christina L. Bennett, Brian Langner, Arthur R. Toth |
| 2005 | The correspondences between the perception of the speaker individualities contained in speech sounds and their acoustic properties. Kanae Amino, Tsutomu Sugawara, Takayuki Arai |
| 2005 | The detection of emphatic words using acoustic and lexical features. Jason M. Brenier, Daniel M. Cer, Daniel Jurafsky |
| 2005 | The dialog application metalanguage GDialogXML. Volker Schubert, Stefan W. Hamerich |
| 2005 | The effect of stress and boundaries on segmental duration in a corpus of authentic speech (british English). Daniel Hirst, Caroline Bouzon |
| 2005 | The effects of prosodic features on the interpretation of clarification ellipses. Jens Edlund, David House, Gabriel Skantze |
| 2005 | The effects of speech recognition and punctuation on information extraction performance. John Makhoul, Alex Baron, Ivan Bulyko, Long Nguyen, Lance A. Ramshaw, David Stallard, Richard M. Schwartz, Bing Xiang |
| 2005 | The feature [sonorant] in lexical access. Danny R. Moates, Zinny S. Bond, Russell Fox, Verna Stockmal |
| 2005 | The focus prosody: more than a simple binary function. Véronique Aubergé, Albert Rilliard |
| 2005 | The hidden vector state language model. Vidura Seneviratne, Steve J. Young |
| 2005 | The intelligibility of tracheoesophageal speech: first results. P. Jongmans, Frans J. M. Hilgers, Louis C. W. Pols, Corina J. van As-Brooks |
| 2005 | The interrelation between the perception and production of English vowels by native speakers of Brazilian Portuguese. Andréia S. Rauber, Paola Escudero, Ricardo Augusto Hoffmann Bion, Barbara O. Baptista |
| 2005 | The labial-coronal effect and CVCV stability during reiterant speech production: an acoustic analysis. Amélie Rochet-Capellan, Jean-Luc Schwartz |
| 2005 | The labial-coronal effect and CVCV stability during reiterant speech production: an articulatory analysis. Amélie Rochet-Capellan, Jean-Luc Schwartz |
| 2005 | The lexical statistics of word recognition problems caused by L2 phonetic confusion. Anne Cutler |
| 2005 | The multiple pronunciations in Taiwanese and the automatic transcription of Buddhist sutra with augmented read speech. Yuang-Chin Chiang, Min-Siong Liang, Hong-Yi Lin, Ren-yuan Lyu |
| 2005 | The multiple-channel cochlear implant: interfacing electronic technology to human consciousness. Graeme M. Clark |
| 2005 | The predictive differential amplitude spectrum for robust speaker recognition in stationary noises. Jing Deng, Thomas Fang Zheng, Jian Liu, Wenhu Wu |
| 2005 | The prosodic dimensions of emotion in speech: the relative weights of parameters. Nicolas Audibert, Véronique Aubergé, Albert Rilliard |
| 2005 | The simulation of realistic acoustic input scenarios for speech recognition systems. Hans-Günter Hirsch, Harald Finster |
| 2005 | The stress foot as a unit of planned timing: evidence from shortening in the prosodic phrase. Heejin Kim, Jennifer Cole |
| 2005 | The working memory token test (WMTT): preliminary findings in young adults with and without dyslexia. Shimon Sapir, Ravit Cohen Mimran |
| 2005 | Tightly integrated spoken language understanding using word-to-concept translation. Katsuhito Sudoh, Hajime Tsukada |
| 2005 | Timing of experimentally elicited minimal responses as quantitative evidence for the use of intonation in projecting TRPs. Wieneke Wesseling, R. J. J. H. van Son |
| 2005 | To recover from speech recognition errors in spoken document retrieval. Mikko Kurimo, Ville T. Turunen |
| 2005 | Tone recognition in Mandarin using focus. Dinoj Surendran, Gina-Anne Levow, Yi Xu |
| 2005 | Tongue kinematics in diphthong production in Ningbo Chinese. Fang Hu |
| 2005 | Toward multiple-language TTS: experiments in English and Mandarin. Raul Fernandez, Wei Zhang, Ellen Eide, Raimo Bakis, Wael Hamza, Yi Liu, Michael Picheny, John F. Pitrelli, Yong Qing, Zhiwei Shuang, Li Qin Shen |
| 2005 | Towards generic quality prediction models for spoken dialogue systems - a case study. Sebastian Möller |
| 2005 | Towards generic spatial object model and route guidance grammar for speech-based systems. Perttu Prusi, Anssi Kainulainen, Jaakko Hakulinen, Markku Turunen, Esa-Pekka Salonen, Leena Helin |
| 2005 | Towards user modelling in conversational dialogue systems: a qualitative study of the dynamics of dialogue parameters. Anna Hjalmarsson |
| 2005 | Towards voiceXML compilation for portable embedded applications in ubiquitous environments. Dirk Bühler, Stefan W. Hamerich |
| 2005 | Training a maximum entropy model for surface realization. Hua Cheng, Fuliang Weng, Niti Hantaweepant, Lawrence Cavedon, Stanley Peters |
| 2005 | Training the tilt intonation model using the JEMA methodology. Matej Rojc, Pablo Daniel Agüero, Antonio Bonafonte, Zdravko Kacic |
| 2005 | Transcribing lectures and seminars. Lori Lamel, Gilles Adda, Éric Bilinski, Jean-Luc Gauvain |
| 2005 | Transcription of conference room meetings: an investigation. Thomas Hain, John Dines, Giulia Garau, Martin Karafiát, Darren Moore, Vincent Wan, Roeland Ordelman, Steve Renals |
| 2005 | Tree-based prediction of prosodic phrase breaks on top of shallow textual features. Gerasimos Xydas, Panagiotis Zervas, Georgios Kouroupetroglou, Nikolaos D. Fakotakis, George K. Kokkinakis |
| 2005 | Trigger-based language model adaptation for automatic meeting transcription. Carlos Troncoso, Tatsuya Kawahara |
| 2005 | Two experiments comparing reading with listening for human processing of conversational telephone speech. Douglas A. Jones, Wade Shen, Elizabeth Shriberg, Andreas Stolcke, Teresa M. Kamm, Douglas A. Reynolds |
| 2005 | Two-pass strategy for handling OOVs in a large vocabulary recognition task. Odette Scharenborg, Stephanie Seneff |
| 2005 | Understanding phonology by phonetic implementation. Chilin Shih |
| 2005 | Unified probabilistic approach to error concealment for distributed speech recognition. Valentin Ion, Reinhold Haeb-Umbach |
| 2005 | Unit selection for speech synthesis based on a new acoustic target cost. Soufiane Rouibia, Olivier Rosec |
| 2005 | Unit selection synthesis database development using utterance verification. Ingunn Amdal, Torbjørn Svendsen |
| 2005 | Unsupervised clustering of spontaneous speech documents. Edgar Gonzàlez, Jordi Turmo |
| 2005 | Unsupervised identification of speech segments using kernel methods for clustering. José Anibal Arias |
| 2005 | Unsupervised segmentation and verification of multi-speaker conversational speech. Emanuele Dalmasso, Pietro Laface, Daniele Colibro, Claudio Vair |
| 2005 | Unsupervised segmentation of continuous speech using vector autoregressive time-frequency modeling errors. Petri Korhonen, Unto K. Laine |
| 2005 | Use of maximum entropy in natural word generation for statistical concept-based speech-to-speech translation. Liang Gu, Yuqing Gao |
| 2005 | User evaluation of conversational agent h. c. Andersen. Niels Ole Bernsen, Laila Dybkjær |
| 2005 | User's experience of a commercial speech dialogue system. Fang Chen, Yael Katzenellenbogen |
| 2005 | Using Hadamard ECOC in multi-class problems based on SVM. An-rong Yin, Xiang Xie, Jingming Kuang |
| 2005 | Using MLP features in SRI's conversational speech recognition system. Qifeng Zhu, Andreas Stolcke, Barry Y. Chen, Nelson Morgan |
| 2005 | Using context to improve emotion detection in spoken dialog systems. Jackson Liscombe, Giuseppe Riccardi, Dilek Hakkani-Tür |
| 2005 | Using dynamic codebook re-ordering to exploit inter-frame correlation in MELP coders. Venkatesh Krishnan, Thomas P. Barnwell III, David V. Anderson |
| 2005 | Using inter-frequency decorrelation to reduce the permutation inconsistency problem in blind source separation. Enrique Robledo-Arnuncio, Biing-Hwang Juang |
| 2005 | Using morphology and phoneme history to improve grapheme-to-phoneme conversion. Uwe D. Reichel, Florian Schiel |
| 2005 | Using open quotient for the characterisation of vietnamese glottalised tones. Vu Ngoc Tuan, Christophe d'Alessandro, Alexis Michaud |
| 2005 | Using output probability distribution for improving speech recognition in adverse environment. Shilei Huang, Xiang Xie, Jingming Kuang |
| 2005 | Using phonetic constraints in acoustic-to-articulatory inversion. Blaise Potard, Yves Laprie |
| 2005 | Using prosodic information for disambiguation purposes. Roberto Gretter, Dino Seppi |
| 2005 | Using random forest language models in the IBM RT-04 CTS system. Peng Xu, Lidia Mangu |
| 2005 | Using symbolic prominence to help design feature subsets for topic classification and clustering of natural human-human conversations. Constantinos Boulis, Mari Ostendorf |
| 2005 | Using the focus of visual attention to improve spontaneous speech recognition. Neil Cooke, Martin J. Russell |
| 2005 | Using word-level pitch features to better predict student emotions during spoken tutoring dialogues. Mihai Rotaru, Diane J. Litman |
| 2005 | Utterance verification incorporating in-domain confidence and discourse coherence measures. Ian R. Lane, Tatsuya Kawahara |
| 2005 | Variability of F0 peak alignment in moroccan Arabic accentual focus. Mohamed Yeou |
| 2005 | Variability of automatic speech recognition systems using different features. Loïc Barrault, Renato De Mori, Roberto Gemello, Franco Mana, Driss Matrouf |
| 2005 | Variable step size adaptive decorrelation filtering for competing speech separation. Rong Hu, Yunxin Zhao |
| 2005 | Variance reduction by using separate genuine- impostor statistics in multimodal biometrics. Pascual Ejarque, Javier Hernando |
| 2005 | Variational Bayesian speaker change detection. Fabio Valente, Christian Wellekens |
| 2005 | Vietnamese large vocabulary continuous speech recognition. Thang Tat Vu, Dung Tien Nguyen, Chi Mai Luong, John-Paul Hosom |
| 2005 | Visual cues in Mandarin tone perception. Hansjörg Mixdorff, Yu Hu, Denis Burnham |
| 2005 | Visual perception of anticipatory rounding gestures in French. Johanna-Pascale Roy |
| 2005 | Visualization of spoken dialogue systems for demonstration, debugging and tutoring. Jaakko Hakulinen, Markku Turunen, Esa-Pekka Salonen |
| 2005 | Vocal tract area function inversion by linear regression of cepstrum. Parham Mokhtari, Tatsuya Kitamura, Hironori Takemoto, Kiyoshi Honda |
| 2005 | Voice activity detection based on optimally weighted combination of multiple features. Yusuke Kida, Tatsuya Kawahara |
| 2005 | Voice and aspiration in German and east bengali stops: a cross-language study. Simone Mikuteit |
| 2005 | Voice and emotional expression transformation based on statistics of vowel parameters in an emotional speech database. Toru Takahashi, Takeshi Fujii, Masashi Nishi, Hideki Banno, Toshio Irino, Hideki Kawahara |
| 2005 | Voice quality and f0 cues for affect expression: implications for synthesis. Irena Yanushevskaya, Christer Gobl, Ailbhe Ní Chasaide |
| 2005 | Voice quality assessment by means of comparative judgments of speech tokens. Abdellah Kacha, Francis Grenez, Jean Schoentgen |
| 2005 | Voice quality dimensions of pitch accents. Britta Lintfert, Wolfgang Wokurek |
| 2005 | Voice quality in down syndrome children treated with rapid maxillary expansion. Carla Pinto Moura, D. Andrade, Luis M. Cunha, Maria J. Cunha, Helena Vilarinho, Henrique Barros, Diamantino Freitas, M. Pais-Clemente |
| 2005 | Voice quality interpolation for emotional text-to-speech synthesis. Oytun Türk, Marc Schröder, Baris Bozkurt, Levent M. Arslan |
| 2005 | Voice quality of falling tones in taiwan min. Ho-Hsien Pan |
| 2005 | Voice transformation using principle component analysis based LSF quantization and dynamic programming approach. Özgül Salor, Mübeccel Demirekler |
| 2005 | Voice user interface design for automated directory assistance. Esther Levin, Amir M. Mané |
| 2005 | Voice-controlled internet browsing for motor-handicapped users. design and implementation issues. Tom Brøndsted, Erik Aaskoven |
| 2005 | Voiced excitation as entrained primary response of a reconstructed glottal master oscillator. Friedhelm R. Drepper |
| 2005 | Voicing features for robust speech detection. Trausti T. Kristjansson, Sabine Deligne, Peder A. Olsen |
| 2005 | Vowel devoicing vs. mora-timed rhythm in spontaneous Japanese - inspection of phonetic labels of OGI_TS. Masahiko Komatsu, Makiko Aoyagi |
| 2005 | WPD-based noise suppression using nonlinearly weighted threshold quantile estimation and optimal wavelet shrinking. Tuan Van Pham, Gernot Kubin |
| 2005 | Webtalk: mining websites for interactively answering questions. Junlan Feng, Srihari Reddy, Murat Saraclar |
| 2005 | Where are we in transcribing French broadcast news? Jean-Luc Gauvain, Gilles Adda, Martine Adda-Decker, Alexandre Allauzen, Véronique Gendner, Lori Lamel, Holger Schwenk |
| 2005 | Which Italian do current systems speak? a first step towards pronunciation modelling of Italian varieties. Michelina Savino, Mario Refice, Massimo Mitaritonna |
| 2005 | Whistled speech: a natural phonetic description of languages adapted to human perception and to the acoustical environment. Julien Meyer |
| 2005 | Word error rate minimization using an integrated confidence measure. Akio Kobayashi, Kazuo Onoe, Shoei Sato, Toru Imai |
| 2005 | Ya-ya language box - a portable device for English pronunciation training with speech recognition technologies. Fu-Chiang Chou |
| 2005 | dPLRM-based speaker identification with log power spectrum. Tomoko Matsui, Kunio Tanabe |