| 2001 | "CU-move" : analysis & corpus development for interactive in-vehicle speech systems. John H. L. Hansen, Pongtep Angkititrakul, Jay P. Plucienkowski, Stephen Gallant, Umit H. Yapanel, Bryan L. Pellom, Wayne H. Ward, Ronald A. Cole |
| 2001 | A baseline method for compiling typed unification grammars into context free language models. Manny Rayner, John Dowding, Beth Ann Hockey |
| 2001 | A boosting approach for confidence scoring. Pedro J. Moreno, Beth Logan, Bhiksha Raj |
| 2001 | A case for multi-resolution auditory scene analysis. Sue Harding, Georg F. Meyer |
| 2001 | A comparative study of MLP-based artificial neural networks in text-independent speaker verification against GMM-based systems. Carlos E. Vivaracho, Javier Ortega-Garcia, Luis Alonso, Q. Isaac Moro |
| 2001 | A comparative study of pauses in dialogues and read speech. Sofia Gustafson-Capková, Beáta Megyesi |
| 2001 | A comparison between human vowel normalization strategies and acoustic vowel transformation techniques. Patti Adank, Roeland van Hout, Roel Smits |
| 2001 | A comparison of LPC and FFT-based acoustic features for noise robust ASR. Febe de Wet, Bert Cranen, Johan de Veth, Lou Boves |
| 2001 | A comparison of some different techniques for vector based call-routing. Stephen Cox, Ben Shahshahani |
| 2001 | A component by component listening test analysis of the IBM trainable speech synthesis system. Robert E. Donovan |
| 2001 | A computational efficient real time noise robust speech recognition based on improved spectral subtraction method. Bojan Kotnik, Zdravko Kacic, Bogomir Horvat |
| 2001 | A context adaptation approach for building context dependent models in LVCSR. Xiaoxing Liu, Baosheng Yuan, Yonghong Yan |
| 2001 | A data selection strategy for utterance verification in continuous speech recognition. Hui Jiang, Frank K. Soong, Chin-Hui Lee |
| 2001 | A dutch treatment of an elitist approach to articulatory-acoustic feature classification. Mirjam Wester, Steven Greenberg, Shuangyu Chang |
| 2001 | A face-to-muscle inversion of a biomechanical face model for audiovisual and motor control research. Michel Pitermann, Kevin G. Munhall |
| 2001 | A fast calculation method in LVCSRS by time-skipping and clustering of probability density distributions. Seiichi Nakagawa, Yukihisa Horibe |
| 2001 | A flexible multilingual TTS development and speech research tool. Géza Kiss, Géza Németh, Gábor Olaszy, Géza Gordos |
| 2001 | A functional approach to speech recognition evaluation. Ben Hutchinson |
| 2001 | A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency. Yuichi Ishimoto, Masashi Unoki, Masato Akagi |
| 2001 | A generalized multistage VQ approach for spectral magnitude quantization. Çagri Özgenc Etemoglu, Vladimir Cuperman |
| 2001 | A hybrid approach to enhance task portability of acoustic models in Chinese speech recognition. Jinsong Zhang, Shuwu Zhang, Yoshinori Sagisaka, Satoshi Nakamura |
| 2001 | A hybrid sub-band sinusoidal coding scheme. Meau Shin Ho, Derek J. Molyneux, Barry M. G. Cheetham |
| 2001 | A mixture of Gaussians front end for speech recognition. Matthew N. Stuttle, Mark J. F. Gales |
| 2001 | A model of F0 contour for arabic affirmative and interrogative sentences. Omar A. G. Ibrahim, Salwa H. El-Ramly, Nemat S. Abdel Kader |
| 2001 | A model of vowel production under positive pressure breathing. Allan J. South |
| 2001 | A multi-SNR subband model for speaker identification under noisy environments. Kenichi Yoshida, Kazuyuki Takagi, Kazuhiko Ozeki |
| 2001 | A multi-band approach based on the probabilistic union model and frequency-filtering features for robust speech recognition. Peter Jancovic, Ji Ming |
| 2001 | A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm. Bojan Kotnik, Zdravko Kacic, Bogomir Horvat |
| 2001 | A multidimensional scaling study of fricatives; a comparison of perceptual and physical dimensions. Wan Tokuma |
| 2001 | A multilingual, multimodal, speech training system, SPECO. Klára Vicsi, Peter Roach, Anne-Marie Öster, Zdravko Kacic, Ferenc Csatári, Anna Sfakianaki, R. Veronik, Géza Gordos |
| 2001 | A multilingual-supporting dialog system using a common dialog controller. YunBiao Xu, Masahiro Araki, Yasuhisa Niimi |
| 2001 | A new DP-like speaker clustering algorithm. Zhijian Ou, Zuoying Wang |
| 2001 | A new approach for wavelet speech enhancement. Mohammed Bahoura, Jean Rouat |
| 2001 | A new auditory based microphone array and objective evaluation using e-RASTI. José-Luis Sánchez-Bote, Joaquin Gonzalez-Rodriguez, Danilo Simon-Zorita |
| 2001 | A new dynamic HMM model for speech recognition. Feili Chen, Eric Chang |
| 2001 | A new feature driven cochlear implant speech processing strategy. Dashtseren Erdenebat, Shigeyoshi Kitazawa, Tatsuya Kitamura |
| 2001 | A new method for speech denoising and robust speech recognition using probabilistic models for clean speech and for noise. Hagai Attias, Li Deng, Alex Acero, John C. Platt |
| 2001 | A new method for speech recognition in the presence of non-stationary, unpredictable and high-level noise. Ikuyo Masuda-Katsuse |
| 2001 | A new method for testing communication efficiency and user acceptability of speech communication channels. Sander J. van Wijngaarden, Paula M. T. Smeele, Herman J. M. Steeneken |
| 2001 | A new multi-speaker formant synthesizer that applies voice conversion techniques. Juana M. Gutiérrez-Arriola, Juan Manuel Montero, José A. Vallejo, Ricardo de Córdoba, Rubén San Segundo, José Manuel Pardo |
| 2001 | A new technique based on augmented language models to improve the performance of spoken dialogue systems. Ramón López-Cózar, Diego H. Milone |
| 2001 | A new verification-based fast match approach to large vocabulary speech recognition. Feng Liu, Mohamed Afify, Hui Jiang, Olivier Siohan |
| 2001 | A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping. Bowen Zhou, John H. L. Hansen |
| 2001 | A novel target-driven MLLR adaptation algorithm with multi-layer structure. Lei Jia, Bo Xu |
| 2001 | A one pass semi-dynamic network decoder based on language model network. Dong-Hoon Ahn, Minhwa Chung |
| 2001 | A perspective on industry/university relationships in the US. Gary W. Strong |
| 2001 | A physiological analysis of nasals and nasalization in Chinese. Wing-Nga Fung, Sze-Lok Lau |
| 2001 | A portability study on natural language call steering. Hong-Kwang Jeff Kuo, Chin-Hui Lee |
| 2001 | A posteriori and a priori transformations for speaker adaptation in large vocabulary speech recognition systems. Driss Matrouf, Olivier Bellot, Pascal Nocera, Georges Linarès, Jean-François Bonastre |
| 2001 | A proposed method for measuring language dependency of narrow band voice coders. Sander J. van Wijngaarden, Herman J. M. Steeneken |
| 2001 | A quasi-one-dimensional model of aerodynamic and acoustic flow in the time-varying vocal tract: source and excitation mechanisms. Gordon Ramsay |
| 2001 | A real-time Japanese broadcast news closed-captioning system. Olivier Siohan, Akio Ando, Mohamed Afify, Hui Jiang, Chin-Hui Lee, Qi Li, Feng Liu, Kazuo Onoe, Frank K. Soong, Qiru Zhou |
| 2001 | A robust front-end algorithm for distributed speech recognition. Yan Ming Cheng, Dusan Macho, Yuanjun Wei, Douglas Ealey, Holly Kelleher, David Pearce, William Kushner, Tenkasi Ramabadran |
| 2001 | A robust front-end for ASR over IP snd GSM networks: an integrated scenario. Ascensión Gallardo-Antolín, Carmen Peláez-Moreno, Fernando Díaz-de-María |
| 2001 | A robust speaker verification system against imposture using an HMM-based speech synthesis system. Takayuki Satoh, Takashi Masuko, Takao Kobayashi, Keiichi Tokuda |
| 2001 | A rule based approach to extraction of topics and dialog acts in a spoken dialog system. Yasuhisa Niimi, Tomoki Oku, Takuya Nishimoto, Masahiro Araki |
| 2001 | A segmental mixture model for speaker recognition. Robert P. Stapert, John S. D. Mason |
| 2001 | A structured statistical language model conditioned by arbitrarily abstracted grammatical categories based on GLR parsing. Tomoyosi Akiba, Katunobu Itou |
| 2001 | A study of speech coding parameters in speech recognition. Jari Juhani Turunen, Damjan Vlaj |
| 2001 | A study on speech over the telephone and aging. Maxine Eskénazi, Alan W. Black |
| 2001 | A study on the production-perception link of English vowels produced by native and non-native speakers. Byunggon Yang |
| 2001 | A switched DPCM/subband coder for pre-echo reduction. S. Satheesh, T. V. Sreenivas |
| 2001 | A system for text dependent speaker verification - field trial evaluation and simulation results. Holger Schalk, Herbert Reininger, Stephan Euler |
| 2001 | A testbed for developing multilingual phonotactic descriptions. Simone Ashby, Julie Carson-Berndsen, Gina Joue |
| 2001 | A text-independent speaker verification system using support vector machines classifier. Yong Gu, Trevor Thomas |
| 2001 | A theme structure method for the ellipsis resolution. Yinfei Huang, Fang Zheng, Yi Su, Fang Li, Wenhu Wu |
| 2001 | A time-varying complex AR speech analysis based on GLS and ELS method. Keiichi Funaki |
| 2001 | A tool for automatic feedback on phonemic transcription. Martin Cooke, María Luisa García Lecumberri, John A. Maidment |
| 2001 | A transducer approach to word graph generation. Gilles Boulianne, Pierre Ouellet, Pierre Dumouchel |
| 2001 | A two-layer lexical tree based beam search in continuous Chinese speech recognition. Guoliang Zhang, Fang Zheng, Wenhu Wu |
| 2001 | A variable rate hybrid coder based on a synchronized harmonic excitation. Nilantha Katugampala, Ahmet M. Kondoz |
| 2001 | A weight pushing algorithm for large vocabulary speech recognition. Mehryar Mohri, Michael Riley |
| 2001 | A word graph interface for a flexible concept based speech understanding framework. Kadri Hacioglu, Wayne H. Ward |
| 2001 | A word- and turn-oriented approach to exploring the structure of Mandarin dialogues. Shu-Chuan Tseng |
| 2001 | ALGONQUIN: iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition. Brendan J. Frey, Li Deng, Alex Acero, Trausti T. Kristjansson |
| 2001 | AMR wideband codec - leap in mobile communication voice quality. J. Rotola-Pukkila, Janne Vainio, Hannu Mikkola, Kari Järvinen, Bruno Bessette, Roch Lefebvre, Redwan Salami, Milan Jelinek |
| 2001 | AMSTIVOC (AMsterdam system for transcription of infant VOCalizations) applied to utterances of deaf and normally hearing infants. Florien J. Koopmans-van Beinum, Chris J. Clement, Ineke Van den Dikkenberg-Pot |
| 2001 | ANVIL - a generic annotation tool for multimodal dialogue. Michael Kipp |
| 2001 | ASR - articulatory speech recognition. Joe Frankel, Simon King |
| 2001 | Accent label prediction by time delay neural networks using gating clusters. Achim F. Müller, Rüdiger Hoffmann |
| 2001 | Accent-independent universal HMM-based speech recognizer for american, australian and british English. Rathinavelu Chengalvarayan |
| 2001 | Acoustic correlates of emotion dimensions in view of speech synthesis. Marc Schröder, Roddy Cowie, Ellen Douglas-Cowie, Machiel Westerdijk, Stan C. A. M. Gielen |
| 2001 | Acoustic echo control and noise reduction for cabin car communication. Eduardo Lleida, Enrique Masgrau, Alfonso Ortega |
| 2001 | Acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments. Hong Kook Kim, Richard C. Rose, Hong-Goo Kang |
| 2001 | Acoustic modeling of foreign words in a German speech recognition system. Georg Stemmer, Elmar Nöth, Heinrich Niemann |
| 2001 | Acoustical and topological experiments for an HMM-based speech segmentation system. Samir Nefti, Olivier Boëffard |
| 2001 | Acquiring and implementing phonetic knowledge. Louis C. W. Pols |
| 2001 | Additive and convolutional noise canceling in speaker verification using a stochastic weighted viterbi algorithm. Néstor Becerra Yoma, Miguel Villar Fernandez |
| 2001 | Advances in automatic speech summarization. Chiori Hori, Sadaoki Furui |
| 2001 | African speech technology (AST) telephone speech databases: corpus design and contents. Philippa H. Louw, Justus C. Roux, Elizabeth C. Botha |
| 2001 | Agent-based error handling in spoken dialogue systems. Markku Turunen, Jaakko Hakulinen |
| 2001 | Aligning prosody and syntax in property grammars. Philippe Blache, Daniel Hirst |
| 2001 | Ambiguity representation and resolution in spoken dialogue systems. Egbert Ammicht, Alexandros Potamianos, Eric Fosler-Lussier |
| 2001 | An HMM/n-gram-based linguistic processing approach for Mandarin spoken document retrieval. Berlin Chen, Hsin-Min Wang, Lin-Shan Lee |
| 2001 | An MCE based classification tree using hierarchical feature-weighting in speech recognition. Fan Wang, Fang Zheng, Wenhu Wu |
| 2001 | An acoustical analysis of the vowels in beijing Mandarin. Eric Zee, Wai-Sum Lee |
| 2001 | An algorithm for finding line spectrum frequencies of added speech signals and its application to robust speech recognition. An-Tze Yu, Hsiao-Chuan Wang |
| 2001 | An approach to an Italian talking head. Catherine Pelachaud, Emanuela Magno Caldognetto, Claudio Zmarich, Piero Cosi |
| 2001 | An approach to automatic phonetic baseform generation based on Bayesian networks. Changxue Ma, Mark A. Randolph |
| 2001 | An auditory system-based feature for robust speech recognition. Qi Li, Frank K. Soong, Olivier Siohan |
| 2001 | An automatic dialogue system generator from the internet information contents. Masahiro Araki, Tasuku Ono, Kiyoshi Ueda, Takuya Nishimoto, Yasuhisa Niimi |
| 2001 | An efficient implementation of phonological rules using finite-state transducers. I. Lee Hetherington |
| 2001 | An efficient lipreading method using the symmetry of lip. Joohun Lee, Jin Young Kim |
| 2001 | An efficient transcoding algorithm for g.723.1 and g.729a speech coders. Sung-Wan Yoon, Sung-Kyo Jung, Young-Cheol Park, Dae Hee Youn |
| 2001 | An elitist approach to articulatory-acoustic feature classification. Shuangyu Chang, Steven Greenberg, Mirjam Wester |
| 2001 | An embodiment paradigm for speech recognition systems. Gina Joue, Julie Carson-Berndsen |
| 2001 | An improved wavelet-based speech enhancement system. Hamid Sheikhzadeh, Hamid Reza Abutalebi |
| 2001 | An interactive directory assistance service for Spanish with large-vocabulary recognition. Ricardo de Córdoba, Rubén San Segundo, Juan Manuel Montero, José Colás, Javier Ferreiros, Javier Macías Guarasa, José Manuel Pardo |
| 2001 | An investigation of HMM classifier combination strategies for improved audio-visual speech recognition. Simon Lucey, Sridha Sridharan, Vinod Chandran |
| 2001 | An investigation of modelling aspects for ratedependent speech recognition. Britta Wrede, Gernot A. Fink, Gerhard Sagerer |
| 2001 | An objective measure for assessment of the concatenative TTS segment inventories. Robert Batusek |
| 2001 | An objective measure for estimating MOS of synthesized speech. Min Chu, Hu Peng |
| 2001 | An online incremental language model adaptation method. Genqing Wu, Fang Zheng, Ling Jin, Wenhu Wu |
| 2001 | Analysis of n-best output hypotheses for fast speech in large vocabulary continuous speech recognition. Tibor Fábián, Thilo Pfau, Günther Ruske |
| 2001 | Analysis of speaker variability. Chao Huang, Tao Chen, Stan Z. Li, Eric Chang, Jian-Lai Zhou |
| 2001 | Analysis of the root-cepstrum for acoustic modeling and fast decoding in speech recognition. Ruhi Sarikaya, John H. L. Hansen |
| 2001 | Analysis of the voiced speech using the generalized fourier transform with quadratic phase. Davor Petrinovic, Vladimir Cuperman |
| 2001 | Aperiodicity control in ARX-based speech analysis-synthesis method. Takahiro Ohtsuka, Hideki Kasuya |
| 2001 | Application of the trended hidden Markov model to speech synthesis. John Dines, Sridha Sridharan, Miles Moody |
| 2001 | Applying parallel model compensation with mel-frequency discrete wavelet coefficients for noise-robust speech recognition. Zekeriya Tufekci, John N. Gowdy, Sabri Gurbuz, Eric K. Patterson |
| 2001 | Architecture for adaptive multimodal dialog systems based on voiceXML. Georg Niklfeld, Robert Finan, Michael Pucher |
| 2001 | Aspects of modern multi-modal/multi-media corpora exploitation environments. Daan Broeder, Hennie Brugman, Peter Wittenburg |
| 2001 | Auditory filter bank design using masking curves. Lee Lin, Eliathamby Ambikairajah, W. Harvey Holmes |
| 2001 | Auditory model based speech recognition in noisy environment. Xiaoqing Yu, Wanggen Wan, Daniel Pak-Kong Lun |
| 2001 | Auditory visual speech processing. Dominic W. Massaro |
| 2001 | Auditory-visual perception of lexical tone. Denis Burnham, Valter Ciocca, Stephanie Stokes |
| 2001 | Automated modeling of Chinese intonation in continuous speech. Greg Kochanski, Chilin Shih |
| 2001 | Automatic analysis of real dialogues and generating of training corpora. Jana Schwarz, Václav Matousek |
| 2001 | Automatic construction of CALL system from TV news program with captions. Takashi Tanaka, Kazumasa Mori, Satoshi Kobayashi, Seiichi Nakagawa |
| 2001 | Automatic labeling and digesting for lecture speech utilizing repeated speech by shift CDP. Yoshiaki Itoh, Kazuyo Tanaka |
| 2001 | Automatic learning of finite state automata for pronunciation modeling. Moisés Pastor-i-Gadea, Francisco Casacuberta |
| 2001 | Automatic n-gram language model creation from web resources. Ryuichi Nisimura, Kumiko Komatsu, Yuka Kuroda, Kentaro Nagatomo, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2001 | Automatic prosody generation - a model for hungarian. Gábor Olaszy, Géza Németh, Péter Olaszi |
| 2001 | Automatic rhythm modeling for language identification. Jérôme Farinas, François Pellegrino |
| 2001 | Automatic segmentation of recorded speech into syllables for speech synthesis. Eric Lewis, Mark Tatham |
| 2001 | Automatic word acquisition from continuous speech. Helmut Lucke, Masanori Omote |
| 2001 | Autoregressive time-frequency interpolation in the context of missing data theory for impulsive noise compensation. Ilyas Potamitis, Nikos Fakotakis |
| 2001 | Back-off smoothing evaluation over syntactic language models. Amparo Varona, Inés Torres |
| 2001 | Background learning of speaker voices for textindependent speaker identification. Wei-Ho Tsai, Y. C. Chu, Chao-Shih Huang, Wen-Whei Chang |
| 2001 | Bayesian methods for HMM speech recognition with limited training data. Darryl W. Purnell, Elizabeth C. Botha |
| 2001 | Blind source separation for speech based on fast-convergence algorithm with ICA and beamforming. Hiroshi Saruwatari, Toshiya Kawamura, Kiyohiro Shikano |
| 2001 | Blind speech separation of moving speakers using hybrid neural networks. Athanasios Koutras, Evangelos Dermatas, George K. Kokkinakis |
| 2001 | Boiling down prosody for the classification of boundaries and accents in German and English. Anton Batliner, Jan Buckow, Richard Huber, Volker Warnke, Elmar Nöth, Heinrich Niemann |
| 2001 | Breadth-first search for finding the optimal phonetic transcription from multiple utterances. Maximilian Bisani, Hermann Ney |
| 2001 | Broadcast news LM adaptation using contemporary texts. Marcello Federico, Nicola Bertoldi |
| 2001 | Building a corpus of natural speech - and tools for the processing of expressive speech. Nick Campbell |
| 2001 | Building an integrated prosodic model of German. Hansjörg Mixdorff, Oliver Jokisch |
| 2001 | Burst segmentation and evaluation of acoustic cues. Yves Laprie, Anne Bonneau |
| 2001 | Business listings in automatic directory assistance. Odette Scharenborg, Janienke Sturm, Lou Boves |
| 2001 | Calibration of microphone arrays for improved speech recognition. Michael L. Seltzer, Bhiksha Raj |
| 2001 | Caller identification for the SCANMail voicemail browser. Aaron E. Rosenberg, Julia Hirschberg, Michiel Bacchiani, Sarangarajan Parthasarathy, Philip L. Isenhour, Larry Stead |
| 2001 | Cantonese text-to-speech synthesis using sub-syllable units. Ka Man Law, Tan Lee, Wai H. Lau |
| 2001 | Class definition in discriminant feature analysis. Jacques Duchateau, Kris Demuynck, Dirk Van Compernolle, Patrick Wambacq |
| 2001 | Classification of transition sounds with application to automatic speech recognition. Zeev Litichever, Dan Chazan |
| 2001 | Classification of video genre using audio. Matthew Roach, John S. D. Mason |
| 2001 | Classifying emotions in speech: a comparison of methods. Noam Amir, Ori Kerret, Dimitry Karlinski |
| 2001 | Coarticulatory effects at prosodic boundaries: some acoustic results. Marija Tabain, Guillaume Rolland, Christophe Savariaux |
| 2001 | Coarticulatory effects in perception. Santiago Fernández, Sergio Feijóo |
| 2001 | Coding method for successive pitch periods. Ari Heikkinen, Vesa T. Ruoppila, Samuli Pietilä |
| 2001 | Cohorts based custom models for rapid speaker and dialect adaptation. Jian Wu, Eric Chang |
| 2001 | Combined front-end signal processing for in-vehicle speech systems. Jay P. Plucienkowski, John H. L. Hansen, Pongtep Angkititrakul |
| 2001 | Combined linear regression adaptation and Bayesian predictive classification for robust speech recognition. Jen-Tzung Chien |
| 2001 | Combined speech and audio coding with bit rate and bandwidth scalability. Maria Farrugia, Ahmet M. Kondoz |
| 2001 | Combining GMM's with suport vector machines for text-independent speaker verification. Jamal Kharroubi, Dijana Petrovska-Delacrétaz, Gérard Chollet |
| 2001 | Combining multi-party speech and text exchanges over the internet. Niels Ole Bernsen, Laila Dybkjær |
| 2001 | Combining word- and class-based language models: a comparative study in several languages using automatic and manual word-clustering techniques. Giulio Maltese, Paolo Bravetti, H. Crépy, B. J. Grainger, M. Herzog, Francisco Palou |
| 2001 | Communication aid for non-vocal people using corpusbased concatenative speech synthesis. Akemi Iida, Yosuke Sakurada, Nick Campbell, Michiaki Yasumura |
| 2001 | Compact word graph in spoken dialogue system. Shih-Chieh Chien, Sen-Chia Chang |
| 2001 | Comparative analysis for data-driven temporal filters obtained via principal component analysis (PCA) and linear discriminant analysis (LDA) in speech recognition. Jeih-weih Hung, Hsin-Min Wang, Lin-Shan Lee |
| 2001 | Comparative evaluation of F0 estimation algorithms. Alain de Cheveigné, Hideki Kawahara |
| 2001 | Comparing audio- and a-posteriori-probability-based stream confidence measures for audio-visual speech recognition. Martin Heckmann, Thorsten Wild, Frédéric Berthommier, Kristian Kroschel |
| 2001 | Comparing grammar-based and robust approaches to speech understanding: a case study. Sylvia Knight, Genevieve Gorrell, Manny Rayner, David Milward, Rob Koeling, Ian Lewin |
| 2001 | Comparing parameter tying methods for multilingual acoustic modelling. Mikko Harju, Petri Salmela, Jussi Leppänen, Olli Viikki, Jukka Saarinen |
| 2001 | Comparing the performance of two CSRs: how to determine the significance level of the differences. Helmer Strik, Catia Cucchiarini, Judith M. Kessens |
| 2001 | Comparing word-level intelligibility after linear vs. non-linear time-compression. Esther Janse |
| 2001 | Comparison of MFCC and PLP parameterizations in the speaker independent continuous speech recognition task. Josef Psutka, Ludek Müller, Josef V. Psutka |
| 2001 | Comparison of spectral derivative parameters for robust speech recognition. Dusan Macho, Climent Nadeu |
| 2001 | Comparison of width-wise and length-wise language model compression. Edward W. D. Whittaker, Bhiksha Raj |
| 2001 | Computationally efficient frequency-domain combination of acoustic echo cancellation and robust adaptive beamforming. Wolfgang Herbordt, Herbert Buchner, Walter Kellermann |
| 2001 | Concordancing for parallel spoken language corpora. Dafydd Gibbon, Thorsten Trippel, Serge Sharoff |
| 2001 | Confidence based lattice segmentation and minimum Bayes-risk decoding. Vaibhava Goel, Shankar Kumar, William Byrne |
| 2001 | Confidence measure (CM) estimation for large vocabulary speaker-independent continuous speech recognition system. Yaxin Zhang, Raymond Lee, Anton Madievski |
| 2001 | Considerations on what industry expects from universities. Yrjö Neuvo |
| 2001 | Constructing a segment database for greek time domain speech synthesis. Stavroula-Evita Fotinea, George Tambouratzis, George Carayannis |
| 2001 | Context-dependent probabilistic hierarchical sublexical modelling using finite state transducers. Xiaolong Mou, Stephanie Seneff, Victor Zue |
| 2001 | Corpus-based database of residual excitations used for speech reconstruction from MFCCs. Zbynek Tychtl, Josef Psutka |
| 2001 | Corpus-based synthesis of fundamental frequency contours based on a generation process model. Keikichi Hirose, Masaya Eto, Nobuaki Minematsu, Atsuhiro Sakurai |
| 2001 | Correction of the voice timbre distortions on telephone network. Gaël Mahé, André Gilloire |
| 2001 | Creating a european English broadcast news transcription corpus and system. Gerhard Backfried, Robert Hecht, Sabine Loots, Norbert Pfannerer, Jürgen Riedler, Christian Schiefer |
| 2001 | Credibility proof for speech content and speaker verification by fragile watermarking with consecutive frame-based processing. Yiou-Wen Cheng, Lin-Shan Lee |
| 2001 | Crosslingual speech recognition with multilingual acoustic models based on agglomerative and tree-based triphone clustering. Andrej Zgank, Bojan Imperl, Finn Tore Johansen, Zdravko Kacic, Bogomir Horvat |
| 2001 | Cues for perceived pitch register. Toni C. M. Rietveld, Patricia Vermillion |
| 2001 | DARPA communicator dialog travel planning systems: the june 2000 data collection. Marilyn A. Walker, John S. Aberdeen, Julie E. Boland, Elizabeth Owen Bratt, John S. Garofolo, Lynette Hirschman, Audrey N. Le, Sungbok Lee, Shrikanth S. Narayanan, Kishore Papineni, Bryan L. Pellom, Joseph Polifroni, Alexandros Potamianos, P. Prabhu, Alexander I. Rudnicky, Gregory A. Sanders, Stephanie Seneff, David Stallard, Steve Whittaker |
| 2001 | DIARCA: a component approach to voice recognition. Juan Carlos Díaz Martín, Juan-Luis García Zapata, José Manuel Rodríguez García, José F. Álvarez Salgado, Pablo Espada Bueno, Pedro Gómez-Vilda |
| 2001 | Data-driven semantic inference for unconstrained desktop command and control. Jerome R. Bellegarda, Kim E. A. Silverman |
| 2001 | Defining constraints for multilinear speech processing. Julie Carson-Berndsen, Michael Walsh |
| 2001 | Deriving document structure from prosodic cues. Martin Haase, Werner Kriechbaum, Gregor Möhler, Gerhard Stenzel |
| 2001 | Design of a semantic parser with support to ellipsis resolution in a Chinese spoken language dialogue system. Yi Su, Fang Zheng, Yinfei Huang |
| 2001 | Design of an optimal continuous speech database for text-to-speech synthesis considered as a set covering problem. Hélène François, Olivier Boëffard |
| 2001 | Design of speech corpus for text-to-speech synthesis. Jindrich Matousek, Josef Psutka, Jiri Kruta |
| 2001 | Designing very compact decision trees for grapheme-to-phoneme transcription. Anne K. Kienappel, Reinhard Kneser |
| 2001 | Detecting Japanese local speech rate deceleration in spontaneous conversational speech using a variable threshold. Keiichi Takamaru, Makoto Hiroshige, Kenji Araki, Koji Tochinai |
| 2001 | Detection of OOV words using generalized word models and a semantic class language model. Thomas Schaaf |
| 2001 | Detection of digital transmission systems for voice quality measurements. Thorsten Ludwig, Ulrich Heute |
| 2001 | Detection of recognition errors and out of the spelling dictionary names in a spelled name recognizer for Spanish. Rubén San Segundo, Javier Macías Guarasa, Javier Ferreiros, P. Martín, José Manuel Pardo |
| 2001 | Development of Russian lexical databases, corpora and supporting tools for speech products. Serge A. Yablonsky |
| 2001 | Development of an asynchronous multi-band system for continuous speech recognition. Yik-Cheung Tam, Brian Kan-Wing Mak |
| 2001 | Development of vowel quantity perception in late childhood. Dawn M. Behne, Peter E. Czigler, Kirk P. H. Sullivan |
| 2001 | Dialogue session: management using voiceXML. Augustine Tsai, Andrew N. Pargellis, Chin-Hui Lee, Joseph P. Olive |
| 2001 | Discriminant analysis of nasal vs. oral vowels in French: comparison between different parametric representations. Véronique Delvaux, Alain Soquet |
| 2001 | Discrimination between speech and music based on a low frequency modulation feature. Stefan Karnebäck |
| 2001 | Discriminative disfluency modeling for spontaneous speech recognition. Chung-Hsien Wu, Gwo-Lang Yan |
| 2001 | Discriminative speaker adaptation with conditional maximum likelihood linear regression. Asela Gunawardana, William Byrne |
| 2001 | Distinctive features for use in an automatic speech recognition system. Ellen Eide |
| 2001 | Distributed speech recognition using traditional and hybrid modeling techniques. Jan Stadermann, Ralf Meermeier, Gerhard Rigoll |
| 2001 | Do speakers realize the prosodic structure they say they do? Olga van Herwijnen, Jacques M. B. Terken |
| 2001 | Domain-independent spoken dialogue platform using key-phrase spotting based on combined language model. Kazunori Komatani, Katsuaki Tanaka, Hiroaki Kashima, Tatsuya Kawahara |
| 2001 | Dual channel speech enhancement using coherence function and MDL-based subspace approach in bark domain. Rolf Vetter, Philippe Renevey, Jens Krauss |
| 2001 | Dynamic lexicon using phonetic features. Kyung-Tak Lee, Christian Wellekens |
| 2001 | ELRA contribution to bridge the gap between industry and academia. Khalid Choukri |
| 2001 | EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, Aalborg, Denmark, September 3-7, 2001 Paul Dalsgaard, Børge Lindberg, Henrik Benner, Zheng-Hua Tan |
| 2001 | Effects of OOV rates on keyphrase rejection schemes. Gies Bouwman, Janienke Sturm, Lou Boves |
| 2001 | Effects of noise adaptation on the perception of voiced plosives in isolated syllables. William A. Ainsworth, T. Cervera |
| 2001 | Efficient decoding strategy for conversational speech recognition using state-space models for vocal-tract-resonance dynamics. Jeff Z. Ma, Li Deng |
| 2001 | Efficient implementation of ITU-t g.723.1 speech coder for multichannel voice transmission and storage. Sung-Kyo Jung, Young-Cheol Park, Sung-Wan Yoon, Kyung-Tae Kim, Dae Hee Youn |
| 2001 | Efficient periodicity extraction based on sine-wave representation and its application to pitch determination of speech signals. Dan Chazan, Meir Tzur, Ron Hoory, Gilad Cohen |
| 2001 | Efficient scalable speech compression for scalable speech recognition. Naveen Srinivasamurthy, Antonio Ortega, Shrikanth S. Narayanan |
| 2001 | Efficient speech enhancement by diffusive gain factors (DGF). Hyoung-Gook Kim, Klaus Obermayer, Mathias Bode, Dietmar Ruwisch |
| 2001 | Efficient stochastic finite-state networks for language modelling in spoken dialogue systems. Kallirroi Georgila, Nikos Fakotakis, George K. Kokkinakis |
| 2001 | Eigen-MLLR coefficients as new feature parameters for speaker identification. Nick J.-C. Wang, Wei-Ho Tsai, Lin-Shan Lee |
| 2001 | Ejective reduction in chaha is conditioned by more than prosodic position. Rachel Coulston |
| 2001 | Elderly acoustic model for large vocabulary continuous speech recognition. Akira Baba, Shinichi Yoshizawa, Miichi Yamada, Akinobu Lee, Kiyohiro Shikano |
| 2001 | Electromagnetic articulograph (EMA) based on a nonparametric representation of tthe magnetic field. Tokihiko Kaburagi, Masaaki Honda |
| 2001 | Emerging requirements for multi-modal annotation and analysis tools. Tony Bigbee, Dan Loehr, Lisa Harper |
| 2001 | Emotional speech synthesis: a review. Marc Schröder |
| 2001 | Enhancement of noisy speech by using improved global soft decision. Vladimir I. Shin, Doh-Suk Kim, Moo Young Kim, Jeongsu Kim |
| 2001 | Enhancement of speech using bark-scaled wavelet packet decomposition. Israel Cohen |
| 2001 | Enhancing GMM scores using SVM "hints". Shai Fine, Jirí Navrátil, Ramesh A. Gopinath |
| 2001 | Enhancing distributed speech recognition with back- end speech reconstruction. Tenkasi Ramabadran, Jeff Meunier, Mark A. Jasiuk, Bill Kushner |
| 2001 | Entropy based voice activity detection in very noisy conditions. Philippe Renevey, Andrzej Drygajlo |
| 2001 | Envelope information in speech processing: acoustic-phonetic analysis vs. auditory figure-ground segregation. Olivier Crouzet, William A. Ainsworth |
| 2001 | Equivalence between frequency domain blind source separation and frequency domain adaptive null beamformers. Shoko Araki, Shoji Makino, Ryo Mukai, Hiroshi Saruwatari |
| 2001 | Error correcting posterior combination for robust multi-band speech recognition. Astrid Hagen, Hervé Bourlard |
| 2001 | Estimating pronunciation variations from acoustic likelihood score for HMM reconstruction. Yi Liu, Pascale Fung |
| 2001 | Estimation of the modulation frequency and modulation depth of the fundamental frequency owing to vocal micro-tremor of the voice source signal. Jean Schoentgen |
| 2001 | European portuguese nasal vowels: an EMMA study. António J. S. Teixeira, Francisco A. C. Vaz |
| 2001 | Eutrans: a speech-to-speech translator prototype. Moisés Pastor-i-Gadea, Alberto Sanchís, Francisco Casacuberta, Enrique Vidal |
| 2001 | Evaluating the Aurora connected digit recognition task - a bell labs approach. Mohamed Afify, Hui Jiang, Filipp Korkmazskiy, Chin-Hui Lee, Qi Li, Olivier Siohan, Frank K. Soong, Arun C. Surendran |
| 2001 | Evaluation of PROS-3 for the assignment of prosodic structure, compared to assignment by human experts. Olga van Herwijnen, Jacques M. B. Terken |
| 2001 | Evaluation of a generalized dynamic cepstrum in distant speech recognition. Hiroshi Matsumoto, Akihiko Shimizu, Kazumasa Yamamoto |
| 2001 | Evaluation of an automatically obtained shape and appearance model for automatic audio visual speech recognition. Philippe Daubias, Paul Deléglise |
| 2001 | Evaluation of cross-language voice conversion based on GMM and straight. Mikiko Mashimo, Tomoki Toda, Kiyohiro Shikano, Nick Campbell |
| 2001 | Evaluation of front-end features and noise compensation methods for robust Mandarin speech recognition. Rathi Chengalvarayan |
| 2001 | Evaluation of recent speech grammar standardization efforts. Tom Brøndsted |
| 2001 | Evaluation of sublexical and lexical models of acoustic disfluencies for spontaneous speech recognition in Spanish. Luis Javier Rodríguez, Inés Torres, Amparo Varona |
| 2001 | Evaluation of the SPLICE algorithm on the Aurora2 database. Jasha Droppo, Li Deng, Alex Acero |
| 2001 | Evaluation on unsupervised speaker adaptation based on sufficient HMM statictics of selected speakers. Shinichi Yoshizawa, Akira Baba, Kanako Matsunami, Yuichiro Mera, Miichi Yamada, Akinobu Lee, Kiyohiro Shikano |
| 2001 | Everyday life sounds and speech analysis for a medical telemonitoring system. Eric Castelli, Dan Istrate |
| 2001 | Experimental evaluation on confidence of agreement among multiple Japanese LVCSR models. Yasuhiro Kodama, Takehito Utsuro, Hiromitsu Nishizaki, Seiichi Nakagawa |
| 2001 | Experiments on cross-language acoustic modeling. Tanja Schultz, Alex Waibel |
| 2001 | Experiments with the philips continuous ASR system on the AURORA noisy digits database. Markus Lieb, Alexander Fischer |
| 2001 | Explicit exploitation of stochastic characteristics of test utterance for text-independent speaker identification. Wei-Ho Tsai, Wen-Whei Chang, Chao-Shih Huang |
| 2001 | Exploring the null space of the acoustic-to- articulatory inversion using a hypercube codebook. Slim Ouni, Yves Laprie |
| 2001 | Extracting caller information from voicemail. Geoffrey Zweig, Jing Huang, Mukund Padmanabhan |
| 2001 | Extractive summarization of voicemail using lexical and prosodic feature subset selection. Konstantinos Koumpis, Steve Renals, Mahesan Niranjan |
| 2001 | F0 feature extraction by polynomial regression function for monosyllabic Thai tone recognition. Patavee Charnvivit, Somchai Jitapunkul, Visarut Ahkuputra, Ekkarit Maneenoi, Umavasee Thathong, Boonchai Thampanitchawong |
| 2001 | FST-based recognition techniques for multi-lingual and multi-domain spontaneous speech. Timothy J. Hazen, I. Lee Hetherington, Alex Park |
| 2001 | Factors affecting schwa-insertion in final consonant clusters in standard dutch. Marc Swerts, Hanne Kloots, Steven Gillis, Georges De Schutter |
| 2001 | Fast adaptation using constrained affine transformations with hierarchical priors. Tor André Myrvoll, Kuldip K. Paliwal, Torbjørn Svendsen |
| 2001 | Fast harmonic estimation using a low resolution pitch for low bit rate harmonic coding. Yong-Soo Choi, Dae Hee Youn |
| 2001 | Feature extraction and model-based noise compensation for noisy speech recognition evaluated on AURORA 2 task. Kaisheng Yao, Jingdong Chen, Kuldip K. Paliwal, Satoshi Nakamura |
| 2001 | Feature extraction by auditory modeling for unit selection in concatenative speech synthesis. Minoru Tsuzaki |
| 2001 | Feature extraction from time-frequency matrices for robust speech recognition. José C. Segura, M. Carmen Benítez, Ángel de la Torre, Antonio J. Rubio |
| 2001 | Feature vector selection to improve ASR robustness in noisy conditions. Johan de Veth, Laurent Mauuary, Bernhard Noé, Febe de Wet, Jürgen Sienel, Lou Boves, Denis Jouvet |
| 2001 | Festival speaks Italian! Piero Cosi, Fabio Tesser, Roberto Gretter, Cinzia Avesani, Mike Macon |
| 2001 | Finite state prosodic analysis of african corpus resources. Dafydd Gibbon |
| 2001 | First steps toward an adaptive spoken dialogue system in medical domain. Ivano Azzini, Daniele Falavigna, Roberto Gretter, Giordano Lanzola, Marco Orlandi |
| 2001 | Formant estimation using gammachirp filterbank. Kaïs Ouni, Zied Lachiri, Noureddine Ellouze |
| 2001 | Formant-broadened CMS using peak-picking in LOG spectrum. Yu-Jin Kim, Hea-Kyoung Jung, Jae-Ho Chung |
| 2001 | Forward masking for increased robustness in automatic speech recognition. Sascha Wendt, Gernot A. Fink, Franz Kummert |
| 2001 | From here to utility - melding phonetic insight with speech technology. Steven Greenberg |
| 2001 | From perceptual designs to linguistic typology and automatic language identification : overview and perspectives. Melissa Barkat, Ioana Vasilescu |
| 2001 | Fun or boring? a web-based evaluation of expressive synthesis for children. Kjell Gustafson, David House |
| 2001 | Gaussian subtraction (GS) algorithms for word spotting in continuous speech. Avi Faizakov, Arnon Cohen, Tzur Vaich |
| 2001 | Generalized source-filter structures for speech synthesis. Matti Karjalainen, Tuomas Paatero |
| 2001 | Generating F0 contours by statistical manipulation of natural F0 shapes. Takashi Saito, Masaharu Sakamoto |
| 2001 | Generating duration from a cognitively plausible model of rhythm production. Plínio A. Barbosa |
| 2001 | Good timing: place-dependent voice onset time in ejective stops. Ian Maddieson |
| 2001 | Graceful degradation of speech recognition performance over lossy packet networks. Eve A. Riskin, Constantinos Boulis, Scott Otterson, Mari Ostendorf |
| 2001 | Graphic platform for designing and developing practical voice interaction systems. Tomás Nouza, Jan Nouza |
| 2001 | HMM2- extraction of formant structures and their use for robust ASR. Katrin Weber, Samy Bengio, Hervé Bourlard |
| 2001 | Hansori 2001 - corpus-based implementation of the Korean hansori text-to-speech synthesizer. Attila Ferencz, Sung-Woo Choi, Ho-Eun Song, Myoung-Wan Koo |
| 2001 | Harmonic tunnelling: tracking non-stationary noises during speech. Douglas Ealey, Holly Kelleher, David Pearce |
| 2001 | Helium speech normalisation by codebook mapping. Adam Podhorski, Marek Czepulonis |
| 2001 | High quality voice conversion based on Gaussian mixture model with dynamic frequency warping. Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2001 | How visual co-presence and joint attention shape speaking. Susan E. Brennan |
| 2001 | Human language identification with reduced segmental information: comparison between monolinguals and bilinguals. Masahiko Komatsu, Kazuya Mori, Takayuki Arai, Yuji Murahara |
| 2001 | Hybrid natural language generation for spoken dialogue systems. Michel Galley, Eric Fosler-Lussier, Alexandros Potamianos |
| 2001 | Hypothesis-driven accent discrimination. Laura Mayfield Tomokiyo |
| 2001 | ISCA SALTMIL SIG: speech and language technology for minority languages. Climent Nadeu, Donncha Cróinín, Bojan Petek, Kepa Sarasola, Briony Williams |
| 2001 | ISIS: a learning system with combined interaction and delegation dialogs. Helen M. Meng, Shuk Fong Chan, Yee Fong Wong, Cheong Chat Chan, Yiu Wing Wong, Tien Ying Fung, Wai Ching Tsui, Ke Chen, Lan Wang, Ting-Yao Wu, Xiaolong Li, Tan Lee, Wing Nin Choi, P. C. Ching, Huisheng Chi |
| 2001 | Identification of accent and intonation in sentences for CALL systems. Carlos Toshinori Ishi, Nobuaki Minematsu, Ryuji Nishide, Keikichi Hirose |
| 2001 | Implementation effective one-channel noise reduction system. Jiri Tihelka, Pavel Sovka |
| 2001 | Improved context-dependent acoustic modeling for continuous Chinese speech recognition. Jiyong Zhang, Fang Zheng, Jing Li, Chunhua Luo, Guoliang Zhang |
| 2001 | Improved data-driven generation of pronunciation dictionaries using an adapted word list. Matthias Wolff, Matthias Eichner, Rüdiger Hoffmann |
| 2001 | Improved entropic gain for speech signals analysis/synthesis based on an adaptive time-frequency segmentation scheme. Gilles Gonon, Silvio Montrésor, Marc Baudry |
| 2001 | Improved maximum mutual information estimation training of continuous density HMMs. Jing Zheng, John Butzberger, Horacio Franco, Andreas Stolcke |
| 2001 | Improved phoneme-history-dependent search for large-vocabulary continuous-speech recognition. Takaaki Hori, Yoshiaki Noda, Shoichi Matsunaga |
| 2001 | Improved speech recognition using iterative decoding based on confidence measures. Jun Ogata, Yasuo Ariki |
| 2001 | Improved spoken document retrieval by exploring extra acoustic and linguistic cues. Berlin Chen, Hsin-Min Wang, Lin-Shan Lee |
| 2001 | Improved word confidence estimation using long range features. David D. Palmer, Mari Ostendorf |
| 2001 | Improvement of a structured language model: arbori-context tree. Shinsuke Mori, Masafumi Nishimura, Nobuyasu Itoh |
| 2001 | Improvement of speaker verification for Thai language. Chai Wutiwiwatchai, Varin Achariyakulporn, Sawit Kasuriya |
| 2001 | Improvements in audio processing and language modeling in the CU communicator. Jianping Zhang, Wayne H. Ward, Bryan L. Pellom, Xiuyang Yu, Kadri Hacioglu |
| 2001 | Improvements in the speaker identification rate using feature-sets. Daniel J. Mashao, N. Tinyiko Baloyi |
| 2001 | Improving automatic speech recognition using tangent distance. Wolfgang Macherey, Daniel Keysers, Jörg Dahmen, Hermann Ney |
| 2001 | Improving genericity for task-independent speech recognition. Fabrice Lefèvre, Jean-Luc Gauvain, Lori Lamel |
| 2001 | Improving performance of a keyword spotting system by using a new confidence measure. Luciana Ferrer, Claudio Estienne |
| 2001 | Improving simultaneous speech recognition in real room environments using overdetermined blind source separation. Athanasios Koutras, Evangelos Dermatas, George K. Kokkinakis |
| 2001 | Improving speaker recognition using phonetically structured Gaussian mixture models. Robert Faltlhauser, Günther Ruske |
| 2001 | Information extraction via heuristics for a movie showtime query system. Martin Jansche |
| 2001 | Information fusion for robust speaker verification. Conrad Sanderson, Kuldip K. Paliwal |
| 2001 | Instantaneous estimation of accentuation habits for Japanese students to learn English pronunciation. Naoki Nakamura, Nobuaki Minematsu, Seiichi Nakagawa |
| 2001 | Instrumental derivation of equipment impairment factors for describing telephone speech codec degradations. Sebastian Möller, Jens Berger |
| 2001 | Integrating contextual phonological rules in a large vocabulary decoder. Guillaume Gravier, François Yvon, Bruno Jacob, Frédéric Bimbot |
| 2001 | Integrating multiple knowledge sources for improved speech understanding. Sherif M. Abdou, Michael S. Scordilis |
| 2001 | Integrating speech technology in language learning: an overview of the activities of inSTIL. Philippe Delcloque |
| 2001 | Intonation modelling with a lexicon of natural F0 contours. Per Olav Heggtveit, Jon Emil Natvig |
| 2001 | Intonational phrase break prediction using decision tree and n-gram model. Xuejing Sun, Ted H. Applebaum |
| 2001 | Introducing phonetically motivated information into ASR. Heidi Christensen, Børge Lindberg, Ove Andersen |
| 2001 | Invariance of relative F0 change field of Chinese disyllabic words. Dawei Xu, Hiroki Mori, Hideki Kasuya |
| 2001 | Inverse filtering of tube models with frequency dependent tube terminations. Karl Schnell, Arild Lacroix |
| 2001 | Investigations into tandem acoustic modeling for the Aurora task. Daniel P. W. Ellis, Manuel J. Reyes Gomez |
| 2001 | Investigations on conversational speech recognition. Peter Beyerlein, Xavier L. Aubert, Matthew Harris, Carsten Meyer, Hauke Schramm |
| 2001 | Is non-native pronunciation modelling necessary ? Silke Goronzy, Marina Sahakyan, Wolfgang Wokurek |
| 2001 | Is speech data clustered? - statistical analysis of cepstral features. Tomi Kinnunen, Ismo Kärkkäinen, Pasi Fränti |
| 2001 | Is this conversation on track? Paul Carpenter, Chun Jin, Daniel Wilson, Rong Zhang, Dan Bohus, Alexander I. Rudnicky |
| 2001 | Iterative implementation of dialogue system modules. Lars Degerstedt, Arne Jönsson |
| 2001 | Japanese can be aware of syllables and morae: evidence from Japanese-English bilingual children. Takashi Otake, Yuka Yamaguchi |
| 2001 | Javaspeakerrecognition - interactive workbench for visualizing speaker recognition concepts on the WWW. Andrzej Drygajlo, Gary Garcia Molina |
| 2001 | Joint channel decoding - Viterbi recognition for wireless applications. Alexis Bernard, Abeer Alwan |
| 2001 | Joint source-channel coding for low bit-rate coding of LSP parameters. José L. Pérez-Córdoba, Antonio J. Rubio, Antonio M. Peinado, Ángel de la Torre |
| 2001 | Joint speech and audio coding combining sinusoidal modeling and wavelet packets. Márk Fék, Annamária R. Várkonyi-Kóczy, Jean-Marc Boucher |
| 2001 | Julius - an open source real-time large vocabulary recognition engine. Akinobu Lee, Tatsuya Kawahara, Kiyohiro Shikano |
| 2001 | Knowledge of language origin improves pronunciation accuracy of proper names. Ariadna Font Llitjós, Alan W. Black |
| 2001 | Language models conditioned on dialog state. Karthik Visweswariah, Harry Printz |
| 2001 | Language-specific effects of pitch range on the perception of universal intonational meaning. Aoju Chen, Toni C. M. Rietveld, Carlos Gussenhoven |
| 2001 | Large broadcast news and read speech corpora of spoken czech. Josef Psutka, Vlasta Radová, Ludek Müller, Jindrich Matousek, Pavel Ircing, David Graff |
| 2001 | Large vocabulary statistical language modeling for continuous speech recognition in finnish. Vesa Siivola, Mikko Kurimo, Krista Lagus |
| 2001 | Large-vocabulary audio-visual speech recognition by machines and humans. Gerasimos Potamianos, Chalapathy Neti, Giridharan Iyengar, Eric Helmuth |
| 2001 | Learning of user formulations for business listings in automatic directory assistance. Cosmin Popovici, Marco Andorno, Pietro Laface, Luciano Fissore, Mario Nigra, Claudio Vair |
| 2001 | Learning prosodic features using a tree representation. Julia Hirschberg, Owen Rambow |
| 2001 | Learning units for domain-independent out-of- vocabulary word modelling. Issam Bazzi, James R. Glass |
| 2001 | Lessons from the development of a conversational interface. Marianne Hickey, Paul St John Brittan |
| 2001 | Lexical stress modeling for improved speech recognition of spontaneous telephone speech in the jupiter domain. Chao Wang, Stephanie Seneff |
| 2001 | Lexicon optimization for dutch speech recognition in spoken document retrieval. Roeland Ordelman, Arjan van Hessen, Franciska de Jong |
| 2001 | Liaison and schwa deletion in French: an effect of lexical frequency and competition? Cécile Fougeron, Jean-Philippe Goldman, Ulrich H. Frauenfelder |
| 2001 | Limited enquiry negotiation dialogues. Ian Lewin |
| 2001 | Linear interpolation of cepstral variance for noisy speech recognition. Tai-Hwei Hwang, Kuo-Hwei Yuo, Hsiao-Chuan Wang |
| 2001 | Linguistic factors affecting timing in Korean with application to speech synthesis. Hyunsong Chung, Mark A. Huckvale |
| 2001 | Lip-reading from parametric lip contours for audio- visual speech recognition. Sabri Gurbuz, Eric K. Patterson, Zekeriya Tufekci, John N. Gowdy |
| 2001 | Local refinement of phonetic boundaries: a general framework and its application using different transition models. Doroteo Torre Toledano, Luis A. Hernández Gómez |
| 2001 | Low rate speech coding incorporating simultaneously masked spectrally weighted linear prediction. Jason Lukasiak, Ian S. Burnett, Christian H. Ritz |
| 2001 | Low-resource hidden Markov model speech recognition. Sabine Deligne, Ellen Eide, Ramesh A. Gopinath, Dimitri Kanevsky, Benoît Maison, Peder A. Olsen, Harry Printz, Jan Sedivý |
| 2001 | Lower WERs do not guarantee better transcriptions. Judith M. Kessens, Helmer Strik |
| 2001 | MAP combination of multi-stream HMM or HMM/ANN experts. Andrew C. Morris, Astrid Hagen, Hervé Bourlard |
| 2001 | MINOS-II: a prototype car navigation system with mixed initiative turn taking dialogue. Munehiko Sasajima, Takehide Yano, Taishi Shimomori, Tatsuya Uehara |
| 2001 | MMSE-based channel error mitigation for distributed speech recognition. Antonio M. Peinado, Victoria E. Sánchez, José C. Segura, José L. Pérez-Córdoba |
| 2001 | Making the tongue model talk: merging MRI & EMA measurements. Olov Engwall |
| 2001 | Map estimation for on-line noise compensation of time trajectories of spectral coefficients. Ilyas Potamitis, Nikos Fakotakis, George K. Kokkinakis |
| 2001 | Mathematical modeling of spoken human - machine dialogues including erroneous confirmations. D. Louloudis, Anastasios Tsopanoglou, Nikos Fakotakis, George K. Kokkinakis |
| 2001 | Maximum likelihood adaptation for distant speech recognition of stationary and moving speakers in reverberant environments. George Nokas, Evangelos Dermatas, George K. Kokkinakis |
| 2001 | Maximum likelihood non-linear transformation for environment adaptation in speech recognition. Mukund Padmanabhan, Satya Dharanipragada |
| 2001 | Maximum-likelihood affine cepstral filtering (MLACF) technique for speaker normalization. Yoon Kim |
| 2001 | Maximum-likelihood training of a bipartite acoustic model for speech recognition. Florent Perronnin, Roland Kuhn, Patrick Nguyen, Jean-Claude Junqua |
| 2001 | Measuring pitch range. Hanny den Ouden, Jacques M. B. Terken |
| 2001 | Measuring rhythmic deviation in second language speech. Felix Schaeffler |
| 2001 | Measuring speech rhythm. Dafydd Gibbon, Ulrike Gut |
| 2001 | Mechanical versus perceptual constraints as determinants of articulatory strategy. Ahmed M. Elgendy, Louis C. W. Pols |
| 2001 | Methodology for dialogue design in telephone-based spoken dialogue systems: a Spanish train information system. Rubén San Segundo, Juan Manuel Montero, José Colás, Juana M. Gutiérrez, J. M. Ramos, José Manuel Pardo |
| 2001 | Metrics for measuring domain independence of semantic classes. Andrew N. Pargellis, Eric Fosler-Lussier, Alexandros Potamianos, Chin-Hui Lee |
| 2001 | Minimax classification with parametric neighborhoods for noisy speech recognition. Mohamed Afify, Olivier Siohan, Chin-Hui Lee |
| 2001 | Minimum classification error training for speaker identification using Gaussian mixture models based on multi-space probability distribution. Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura |
| 2001 | Mixed excitation for HMM-based speech synthesis. Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura |
| 2001 | Mobile future. Yrjö Neuvo |
| 2001 | Model agglomeration for context-dependent acoustic modeling. Fabio Brugnara |
| 2001 | Model based stress decision method. Wooil Kim, Taeyun Kim, Sungjoo Ahn, Hanseok Ko |
| 2001 | Model complexity optimization for nonnative English speakers. Xiaodong He, Yunxin Zhao |
| 2001 | Model-based blind estimation of reverberation time: application to robust ASR in reverberant environments. Laurent Couvreur, Christophe Ris, Christophe Couvreur |
| 2001 | Model-based compensation of the additive noise for continuous speech recognition. experiments using the Aurora II database and tasks. José C. Segura, Ángel de la Torre, M. Carmen Benítez, Antonio M. Peinado |
| 2001 | Modeling auxiliary information in Bayesian network based ASR. Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard |
| 2001 | Modeling of conversational strategy for the robot participating in the group conversation. Yosuke Matsusaka, Shinya Fujie, Tetsunori Kobayashi |
| 2001 | Modeling pronunciation variation using context-dependent weighting and b/s refined acoustic modeling. Fang Zheng, Zhanjiang Song, Pascale Fung, William Byrne |
| 2001 | Modeling the mixtures of known noise and unknown unexpected noise for robust speech recognition. Ji Ming, Peter Jancovic, Philip Hanna, Darryl Stewart |
| 2001 | Modelling care of articulation with HMMs is dangerous. Matthew P. Aylett |
| 2001 | Modelling fundamental frequency in first post-tonic syllables in danish sentences. Niels Reinholt Petersen |
| 2001 | Modelling the perceptual identification of Japanese consonants from LPC cepstral distances. Masahiko Komatsu, Shinichi Tokuma, Won Tokuma, Takayuki Arai |
| 2001 | Mokusei: a telephone-based Japanese conversational system in the weather domain. Mikio Nakano, Yasuhiro Minami, Stephanie Seneff, Timothy J. Hazen, D. Scott Cyphers, James R. Glass, Joseph Polifroni, Victor Zue |
| 2001 | Morphological approaches for an English pronunciation lexicon. Susan Fitt |
| 2001 | Multi-class composite n-gram language model using multiple word clusters and word successions. Shuntaro Isogai, Katsuhiko Shirai, Hirofumi Yamamoto, Yoshinori Sagisaka |
| 2001 | Multi-keyword spotting of telephone speech using orthogonal transform-based SBR and RNN prosodic model. Wern-Jun Wang, Chun-Jen Lee, Eng-Fong Huang, Sin-Horng Chen |
| 2001 | Multi-parser architecture for query processing. Kui Xu, Fuliang Weng, Helen M. Meng, Po-Chui Luk |
| 2001 | Multi-scale retrieval in MEI: an English-Chinese translingual speech retrieval system. Wai Kit Lo, Patrick Schone, Helen M. Meng |
| 2001 | Multi-stream statistical n-gram modeling with application to automatic language identification. Katrin Kirchhoff, Sonia Parandekar |
| 2001 | Multilingual TTS for computer telephony: the aculab approach. Alex I. C. Monaghan, Mahmoud Kassaei, Mark Luckin, Mariscela Amador-Hernandez, Andrew Lowry, Daniel Faulkner, Fred Sannier |
| 2001 | Multilingual text-to-phoneme mapping. Søren Kamaric Riis, Morten With Pedersen, Kåre Jean Jensen |
| 2001 | Multimedia data collection of in-car speech communication. Nobuo Kawaguchi, Shigeki Matsubara, Kazuya Takeda, Fumitada Itakura |
| 2001 | Multipass algorithm for acquisition of salient acoustic morphemes. Michael Levit, Allen L. Gorin, Jeremy H. Wright |
| 2001 | Multiple source separation in the frequency domain using negative beamforming. Pedro Gómez-Vilda, Agustín Álvarez-Marquina, Victor Nieto Lluis, María Victoria Rodellar Biarge, Rafael Martínez-Olalla |
| 2001 | Must diphone synthesis be so unnatural? William J. Barry, Claus Nielsen, Ove Andersen |
| 2001 | N-best list generation using word and phoneme recognition fusion. Ernest Pusateri, Jean-Manuel Van Thong |
| 2001 | N-best speech hypotheses reordering using linear regression. Ananlada Chotimongkol, Alexander I. Rudnicky |
| 2001 | Narrowband perceptual audio coding: enhancements for speech. Hossein Najaf-Zadeh, Peter Kabal |
| 2001 | Native vs non-native production of English vowels in spontaneous speech: an acoustic phonetic study. Kimiko Tsukada |
| 2001 | Natural language understanding using statistical machine translation. Klaus Macherey, Franz Josef Och, Hermann Ney |
| 2001 | Neural processes underlying perceptual learning of a difficult second language phonetic contrast. Daniel E. Callan, Keiichi Tajima, Akiko E. Callan, Reiko Akahane-Yamada, Shinobu Masaki |
| 2001 | New language models using phrase structures extracted from parse trees. Takatoshi Jitsuhiro, Hirofumi Yamamoto, Setsuo Yamada, Yoshinori Sagisaka |
| 2001 | Noise estimation without explicit speech, non-speech detection: a comparison of mean, modal and median based approaches. Nicholas W. D. Evans, John S. D. Mason |
| 2001 | Noise reduction for noise robust feature extraction for distributed speech recognition. Bernhard Noé, Jürgen Sienel, Denis Jouvet, Laurent Mauuary, Johan de Veth, Lou Boves, Febe de Wet |
| 2001 | Noise reduction using paired-microphones for both far-field and near-field sound sources. Mitsunori Mizumachi, Satoshi Nakamura |
| 2001 | Noise robust feature extraction for ASR using the Aurora 2 database. Qifeng Zhu, Markus Iseli, Xiaodong Cui, Abeer Alwan |
| 2001 | Non-finality and pre-finality in bari Italian intonation: a preliminary account. Michelina Savino |
| 2001 | Non-linear predictive vector quantization of speech. Marcos Faúndez-Zanuy |
| 2001 | OASIS natural language call steering trial. Peter J. Durston, Mark Farrell, David Attwater, James Allen, Hong-Kwang Jeff Kuo, Mohamed Afify, Eric Fosler-Lussier, Chin-Hui Lee |
| 2001 | Objective evaluation of methods for quantization of variable-dimension spectral vectors in WI speech coding. Jani Nurminen, Ari Heikkinen, Jukka Saarinen |
| 2001 | Observations on overlap: findings and implications for automatic processing of multi-party conversation. Elizabeth Shriberg, Andreas Stolcke, Don Baron |
| 2001 | Off-talk - a problem for human-machine-interaction? Daniela Oppermann, Florian Schiel, Silke Steininger, Nicole Beringer |
| 2001 | On combining confidence measures for improved rejection of incorrect data. Delphine Charlet, Guy Mercier, Denis Jouvet |
| 2001 | On differential limen of word-based local speechrate variation in Japanese expressed by duration ratio. Makoto Hiroshige, Kenji Araki, Koji Tochinai |
| 2001 | On integrating the lexicon with the language model. Diamantino Caseiro, Isabel Trancoso |
| 2001 | On large vocabulary continuous speech recognition of highly inflectional language - czech. Pavel Ircing, Pavel Krbec, Jan Hajic, Josef Psutka, Sanjeev Khudanpur, Frederick Jelinek, William Byrne |
| 2001 | On the choice of classes in MCE based discriminative HMM-training for speech recognizers used in the telephone environment. Josef G. Bauer |
| 2001 | On the perception of voicing for plosives in noise. Marcia Chen, Abeer Alwan |
| 2001 | On the pronunciation of acronyms in French and in Italian. Philippe Boula de Mareüil, Franck Floricic |
| 2001 | On the prosody of German telephone numbers. Stefan Baumann, Jürgen Trouvain |
| 2001 | On the use of the Bayesian information criterion in multiple speaker detection. P. Sivakumaran, J. Fortuna, Aladdin M. Ariyaeeinia |
| 2001 | One-delayed-mass model for efficient synthesis of glottal flow. Federico Avanzini, Paavo Alku, Matti Karjalainen |
| 2001 | Pause information for dependency analysis of read Japanese sentences. Kazuyuki Takagi, Kazuhiko Ozeki |
| 2001 | Perceived prominence in terms of a linguistically motivated quantitative intonation model. Hansjörg Mixdorff, Christina Widera |
| 2001 | Perception of coda voicing from properties of the onset and nucleus of 'led' and 'let'. Sarah Hawkins, Noël Nguyen |
| 2001 | Perceptual categorization of maximal vowel spaces from birth to adulthood simulated by an articulatory model. Lucie Ménard, Louis-Jean Boë |
| 2001 | Perceptual cost functions for unit searching in large corpus-based text-to-speech. Minkyu Lee |
| 2001 | Perceptual experiments on enhanced and slowed down speech sentences for second language acquisition. Vincent Colotte, Yves Laprie, Anne Bonneau |
| 2001 | Perceptual identification and normalization of synthesized French vowels from birth to adulthood. Lucie Ménard, Jean-Luc Schwartz, Louis-Jean Boë, Sonia Kandel, Nathalie Vallée |
| 2001 | Phoneme-based topic spotting on the switchboard corpus. M. W. Theunissen, Konrad Scheffler, Johan A. du Preez |
| 2001 | Phonetic effects on listener detection of vowel concatenation. Ann K. Syrdal |
| 2001 | Phonetic events from the labeling the european portuguese database for speech synthesis, FEUP/IPBDB. João Paulo Ramos Teixeira, Diamantino Freitas, Daniela Braga, Maria João Barros, Vagner Latsch |
| 2001 | Phonetic speaker recognition. Walter D. Andrews, Mary A. Kohler, Joseph P. Campbell |
| 2001 | Phonetic transcriptions in the spoken dutch corpus: how to combine efficiency and good transcription quality. Catia Cucchiarini, Diana Binnenpoorte, Simo M. A. Goddijn |
| 2001 | Pitch-dependent GMMs for text-independent speaker recognition systems. Mijail Arcienega, Andrzej Drygajlo |
| 2001 | Planar superdirective microphone arrays for speech acquisition in the car. Rainer Martin, Alexey Petrovsky, Thomas Lotter |
| 2001 | Plosive spotting with margin classifiers. Joseph Keshet, Dan Chazan, Ben-Zion Bobrovsky |
| 2001 | Politeness and frustration language in child-machine interactions. Sudha Arunachalam, Dylan Gould, Elaine Andersen, Dani Byrd, Shrikanth S. Narayanan |
| 2001 | Pragmatic temporal voice range profile as a tool in the research of speech styles. Antti Iivonen |
| 2001 | Pre-liquid excrescent schwa: what happens when vocalic targets conflict. Bryan Gick, Ian Wilson |
| 2001 | Predicting visual consonant perception from physical measures. Jintao Jiang, Abeer Alwan, Edward T. Auer, Lynne E. Bernstein |
| 2001 | Prediction of intonation patterns of accented words in a corpus of read Swedish news through pitch contour stylization. Johan Frid |
| 2001 | Prediction of low recognition rate words for isolated word recognition system. Ryuta Terashima, Hiroyuki Hoshino, Toshihiro Wakita |
| 2001 | Preliminary experiments on language identification using broadcast news recordings. Laurent Benarousse, Edouard Geoffrois |
| 2001 | Probabilistic concept verification for language understanding in spoken dialogue systems. Yi-Chung Lin, Huei-Ming Wang |
| 2001 | Prominence correlates. a study of Swedish. Gunnar Fant, Anita Kruckenberg, Johan Liljencrants, Antonis Botinis |
| 2001 | Pronunciation modeling and lexical adaptation in midsize vocabulary ASR. Louis ten Bosch, Nick Cremelie |
| 2001 | Pronunciation modeling in hungarian number recognition. Tibor Fegyó, Péter Mihajlik, Péter Tatai, Géza Gordos |
| 2001 | Pronunciation variant analysis using speaking style parallel corpus. Hideharu Nakajima, Izumi Hirano, Yoshinori Sagisaka, Katsuhiko Shirai |
| 2001 | Pronunciation variation analysis with respect to various linguistic levels and contextual conditions for Mandarin Chinese. Ming-Yi Tsai, Fu-Chiang Chou, Lin-Shan Lee |
| 2001 | Prosodic interactions on segmental durations ingreek. Antonis Botinis, Marios Fourakis, Robert Bannert |
| 2001 | Prosodic models, automatic speech understanding, and speech synthesis: towards the common ground. Anton Batliner, Bernd Möbius, Gregor Möhler, Antje Schweitzer, Elmar Nöth |
| 2001 | Prosody control for speaking and singing styles. Chilin Shih, Greg Kochanski |
| 2001 | Prosody in finger braille and teletext receiver for finger braille. Yasuo Horiuchi, Akira Ichikawa |
| 2001 | Prototype of a vocal-tract model for vowel production designed for education in speech science. Takayuki Arai, Nobuyuki Usuki, Yuji Murahara |
| 2001 | Pruning of redundant synthesis instances based on weighted vector quantization. Sanghun Kim, Youngjik Lee, Keikichi Hirose |
| 2001 | Pseudo-articulatory representations and the recognition of syllable patterns in speech. William H. Edmondson, Li Zhang |
| 2001 | Quantile based histogram equalization for noise robust speech recognition. Florian Hilger, Hermann Ney |
| 2001 | Quantitative analysis of the effects of emphasis upon prosodic features of speech. Sumio Ohno, Hiroya Fujisaki |
| 2001 | Quantization-based language model compression. Edward W. D. Whittaker, Bhiksha Raj |
| 2001 | Rapid CODEC adaptation for cellular phone speech recognition. Masaki Naito, Shingo Kuroiwa, Tsuneo Kato, Tohru Shimizu, Norio Higuchi |
| 2001 | Rapid speaker adaptation using MLLR and subspace regression classes. Kwok-Man Wong, Brian Kan-Wing Mak |
| 2001 | Rapid vocal tract length normalization using maximum likelihood estimation. Tadashi Emori, Koichi Shinoda |
| 2001 | Real-time multilingual communication by means of prestored conversational units. Norman Alm, Mamoru Iwabuchi, Peter N. Andreasen, Kenryu Nakamura, Iain R. Murray |
| 2001 | Real-time multiple speaker tracking by multi-modal integration for mobile robots. Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi G. Okuno, Hiroaki Kitano |
| 2001 | Real-time sound source localization and separation system and its application to automatic speech recognition. Futoshi Asano, Masataka Goto, Katunobu Itou, Hideki Asoh |
| 2001 | Recent advances in speech recognition system for IBM DARPA communicator. Yuqing Gao, Hakan Erdogan, Yongxin Li, Vaibhava Goel, Michael Picheny |
| 2001 | Recognition of (almost) spoken words: evidence from word play in Japanese. Takashi Otake, Anne Cutler |
| 2001 | Recognition of slovenian speech: within and cross-language experiments on monophones using the speechdat(II). Andrej Iskra, Bojan Petek, Tom Brøndsted |
| 2001 | Recognition of spelled city names in automotive environments. Andreas Korthauer |
| 2001 | Recognition performance of the siemens front-end with and without frame dropping on the Aurora 2 database. Bernt Andrassy, Damjan Vlaj, Christophe Beaugeant |
| 2001 | Reconstructing dialogue history. Marc Swerts, Emiel Krahmer |
| 2001 | Reducing spectral mismatches in concatenative speech synthesis via systematic database enrichment. Maria Founda, George Tambouratzis, Aimilios Chalamandaris, George Carayannis |
| 2001 | Reduction of alternative pronunciations in the norwegian computational lexicon norkompleks. Torbjørn Nordgård, Arne Kjell Foldvik |
| 2001 | Relating frame accuracy with word error in hybrid ANN-HMM ASR. Michael L. Shire |
| 2001 | Relating phonepass scores overall scores to the council of europe framework level descriptors. John H. A. L. de Jong, Jared Bernstein |
| 2001 | Relations between vocal registers in voice breaks. Gerrit Bloothooft, Mieke van Wijck, Peter Pabon |
| 2001 | Representation of large lexica using finite-state transducers for the multilingual text-to-speech synthesis systems. Matej Rojc, Zdravko Kacic |
| 2001 | Resource-limited sentence boundary detection. David Carter, Ian Gransden |
| 2001 | Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise. Jon Barker, Martin Cooke, Phil D. Green |
| 2001 | Robust ASR front-end using spectral-based and discriminant features: experiments on the Aurora tasks. M. Carmen Benítez, Lukás Burget, Barry Y. Chen, Stéphane Dupont, Harinath Garudadri, Hynek Hermansky, Pratibha Jain, Sachin S. Kajarekar, Nelson Morgan, Sunil Sivadas |
| 2001 | Robust LP analysis using glottal source HMM with application to high-pitched and noise corrupted speech. Akira Sasou, Kazuyo Tanaka |
| 2001 | Robust automatic speech recognition in low-SNR car environments by the application of a connectionist subspace-based approach to the melbased cepstral coefficients. Sid-Ahmed Selouani, Hesham Tolba, Douglas D. O'Shaughnessy |
| 2001 | Robust digit recognition in noise: an evaluation using the AURORA corpus. Umit H. Yapanel, John H. L. Hansen, Ruhi Sarikaya, Bryan L. Pellom |
| 2001 | Robust digit recognition in noisy environments: the IBM Aurora 2 system. George Saon, Juan M. Huerta, Ea-Ee Jan |
| 2001 | Robust language understanding in mipad. Ye-Yi Wang |
| 2001 | Robust parameters for speech recognition based on subband spectral centroid histograms. Bojana Gajic, Kuldip K. Paliwal |
| 2001 | Robust parsing in spoken dialogue systems. Pengju Yan, Fang Zheng, Mingxing Xu |
| 2001 | Robust speech recognition against packet loss. Man-Hung Siu, Yu-Chung Chan |
| 2001 | Robust speech recognition based on selective use of missing frequency band HMMs. Takayoshi Kawamura, Kazuya Takeda, Fumitada Itakura |
| 2001 | Robust speech recognition in noise: an evaluation using the SPINE corpus. John H. L. Hansen, Ruhi Sarikaya, Umit H. Yapanel, Bryan L. Pellom |
| 2001 | Robust speech recognition techniques applied to a speech in noise task. Richard C. Rose, Hong Kook Kim, Donald Hindle |
| 2001 | Robust speech recognition using missing feature theory and vector quantization. Philippe Renevey, Rolf Vetter, Jens Krauss |
| 2001 | Robust speech/non-speech detection using LDA applied to MFCC for continuous speech recognition. Arnaud Martin, Géraldine Damnati, Laurent Mauuary |
| 2001 | SCANMail: browsing and searching speech data by content. Julia Hirschberg, Michiel Bacchiani, Donald Hindle, Philip L. Isenhour, Aaron E. Rosenberg, Litza A. Stark, Larry Stead, Steve Whittaker, Gary Zamchick |
| 2001 | SIGdial - special interest group on discourse and dialogue. Laila Dybkjær |
| 2001 | SPeaker and language characterization (spLC): a special interest group (SIG) of ISCA. Jean-François Bonastre, Ivan Magrin-Chagnolleau, Stephan Euler, François Pellegrino, Régine André-Obrecht, John S. D. Mason, Frédéric Bimbot |
| 2001 | Scaled likelihood linear regression for hidden Markov model adaptation. Frank Wallhoff, Daniel Willett, Gerhard Rigoll |
| 2001 | Schwa-assimilation in danish synthetic speech. Christian Jensen |
| 2001 | Second order statistics spectrum estimation method for robust speech recognition. Bojan Jarc, Rudolf Babic |
| 2001 | Segment-based recognition on the phonebook task: initial results and observations on duration modeling. Karen Livescu, James R. Glass |
| 2001 | Segmental eigenvoice for rapid speaker adaptation. Yu Tsao, Shang-Ming Lee, Fu-Chiang Chou, Lin-Shan Lee |
| 2001 | Selective MCE training strategy in Mandarin speech recognition. Jian-Lai Zhou, Eric Chang, Chao Huang |
| 2001 | Semantic abnormality and its realization in spoken language. Shimei Pan, Kathleen R. McKeown, Julia Hirschberg |
| 2001 | Semi-automatic grammar induction for bi-directional English-Chinese machine translation. Kai-Chung Siu, Helen M. Meng |
| 2001 | Separating speaker and environment variabilities for improved recognition in non-stationary conditions. Luca Rigazio, Patrick Nguyen, David Kryze, Jean-Claude Junqua |
| 2001 | Separating three simultaneous speeches with two microphones by integrating auditory and visual processing. Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano |
| 2001 | Separation and dereverberation performance of frequency domain blind source separation for speech in a reverberant environment. Ryo Mukai, Shoko Araki, Shoji Makino |
| 2001 | Sequential decisions for faster and more flexible verification. Arun C. Surendran |
| 2001 | Sequential noise compensation by a sequential kullback proximal algorithm. Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura |
| 2001 | Smartkom: multimodal communication with a life- like character. Wolfgang Wahlster, Norbert Reithinger, Anselm Blocher |
| 2001 | Smooth contour estimation in data-driven pitch modelling. Kim E. A. Silverman, Jerome R. Bellegarda, Kevin A. Lenzo |
| 2001 | Smoothing issues in the structured language model. Woosung Kim, Sanjeev Khudanpur, Jun Wu |
| 2001 | Social effects on vocal rate with echoic mimicry using prosody-only voice. Noriko Suzuki, Kazuhiko Kakehi, Yugo Takeuchi, Michio Okada |
| 2001 | Some practical considerations in the deployment of a wireless-communication interactive voice response system. Carmen García-Mateo, Laura Docío Fernández, Antonio Cardenal López |
| 2001 | Speaker adaptation in an ASR system based on nonlinear dynamical systems. Narada D. Warakagoda, Magne Hallstein Johnsen |
| 2001 | Speaker adaptation of output probabilities and state duration distributions for speech recognition. Néstor Becerra Yoma, Jorge F. Silva |
| 2001 | Speaker adaptation of quantized parameter HMMs. Marcel Vasilache, Olli Viikki |
| 2001 | Speaker identification for car infotainment applications. Javier Rodríguez Saeta, Christian Koechling, Javier Hernando |
| 2001 | Speaker normalization based on test to reference speaker mapping. Marcel Ogner, Zdravko Kacic |
| 2001 | Speaker recognition based on feature space trace. Yadong Wu, Zhizhu Li |
| 2001 | Speaker recognition based on idiolectal differences between speakers. George R. Doddington |
| 2001 | Speaker recognition by separating phonetic space and speaker space. Masafumi Nishida, Yasuo Ariki |
| 2001 | Speaker recognition in a multi-speaker environment. Alvin F. Martin, Mark A. Przybocki |
| 2001 | Speaker verification using target and background dependent linear transforms and multi-system fusion. Jirí Navrátil, Upendra V. Chaudhari, Ganesh N. Ramaswamy |
| 2001 | Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition. Hiroaki Nanjo, Kazuomi Kato, Tatsuya Kawahara |
| 2001 | Speaking while driving - preliminary results on spellings in the German speechdat-car database. Christoph Draxler, Klaus Bengler, Cristina Olaverri-Monreal |
| 2001 | Spectral correlates of voice open quotient and glottal flow asymmetry : theory, limits and experimental data. Nathalie Henrich, Christophe d'Alessandro, Boris Doval |
| 2001 | Spectral tilt as a perturbation-free measurement of noise levels in voice signals. Peter J. Murphy |
| 2001 | Speech emotion recognition using hidden Markov models. Albino Nogueiras, Asunción Moreno, Antonio Bonafonte, José B. Mariño |
| 2001 | Speech enhanced remote control for media terminal. Aseel Ibrahim, Jonas Lundberg, Jenny Johansson |
| 2001 | Speech enhancement and source separation based on binaural negative beamforming. Agustín Álvarez-Marquina, Pedro Gómez-Vilda, Rafael Martínez-Olalla, Victor Nieto Lluis, María Victoria Rodellar Biarge |
| 2001 | Speech enhancement based on IMM with NPHMM. Yunjung Lee, Joohun Lee, Ki Yong Lee, Katsuhiko Shirai |
| 2001 | Speech lab in a box: a Mandarin speech toolbox to jumpstart speech related research. Eric Chang, Yu Shi, Jian-Lai Zhou, Chao Huang |
| 2001 | Speech quality measure for voIP using wavelet based bark coherence function. Sang-Wook Park, Young-Cheol Park, Dae Hee Youn |
| 2001 | Speech recognition at multiple sampling rates. Hans-Günter Hirsch, K. Hellwig, Stefan Dobler |
| 2001 | Speech recognition for huge vocabularies by using optimized sub-word units. Jan Kneissler, Dietrich Klakow |
| 2001 | Speech recognition of Japanese news commentary. Shinichi Homma, Akio Kobayashi, Shoei Sato, Toru Imai, Akio Ando |
| 2001 | Speech recognition of broadcast sports news. Atsushi Matsui, Hiroyuki Segi, Akio Kobayashi, Toru Imai, Akio Ando |
| 2001 | Speech recognition over netmeeting connections. Florian Metze, John W. McDonough, Hagen Soltau |
| 2001 | Speech recognition under musical environments using kalman filter and iterative MLLR adaptation. Masakiyo Fujimoto, Yasuo Ariki |
| 2001 | Speech synthesis development made easy: the bonn open synthesis system. Esther Klabbers, Karlheinz Stöber, Raymond N. J. Veldhuis, Petra Wagner, Stefan Breuer |
| 2001 | Speech translation for French in the NESPOLE! European project. Laurent Besacier, Hervé Blanchon, Yannick Fouquet, Jean-Philippe Guilbaud, Stéphane Helme, Sylviane Mazenot, Daniel Moraru, Dominique Vaufreydaz |
| 2001 | Speech/noise-dominant decision for speech enhancement. Sukhyun Yoon, Chang D. Yoo |
| 2001 | Speechbuilder: facilitating spoken dialogue system development. James R. Glass, Eugene Weinstein |
| 2001 | Speechdat-e: five eastern european speech databases for voice-operated teleservices completed. Henk van den Heuvel, Jérôme Boudy, Zsolt Bakcsi, Jan Cernocký, Valery Galunov, Julia Kochanina, Wojciech Majewski, Petr Pollák, Milan Rusko, Jerzy Sadowski, Piotr Staroniewicz, Herbert S. Tropf |
| 2001 | Split-band perceptual harmonic cepstral coefficients as acoustic features for speech recognition. Liang Gu, Kenneth Rose |
| 2001 | Spoken dialogue management as planning and acting under uncertainty. Bo Zhang, Qingsheng Cai, Jianfeng Mao, Eric Chang, Baining Guo |
| 2001 | Squared error as a measure of phase distortion. Harald Pobloth, W. Bastiaan Kleijn |
| 2001 | Statistical language model based on a hierarchical approach: MCnv. Imed Zitouni, Kamel Smaïli, Jean Paul Haton |
| 2001 | Statistical sound source identification in a real acoustic environment for robust speech recognition using a microphone array. Takanobu Nishiura, Satoshi Nakamura, Kiyohiro Shikano |
| 2001 | Stochastic F0 contour model based on the clustering of F0 shapes of a syntactic unit. Yoichi Yamashita, Tomoyoshi Ishida |
| 2001 | Stochastic finite state automata language model triggered by dialogue states. Yannick Estève, Frédéric Béchet, Alexis Nasr, Renato De Mori |
| 2001 | Structural learning of dynamic Bayesian networks in speech recognition. Murat Deviren, Khalid Daoudi |
| 2001 | Structured language model for class identification of out-of-vocabulary words arising from multiple wordclasses. Shigehiko Onishi, Hirofumi Yamamoto, Yoshinori Sagisaka |
| 2001 | Study and auto-detection of stress based on tonal pitch range in Mandarin. Xipeng Shen, Bo Xu |
| 2001 | Study on factors influencing durations of syllables in Mandarin. Min Chu, Yongqiang Feng |
| 2001 | Sub-band based additive noise removal for robust speech recognition. Jingdong Chen, Kuldip K. Paliwal, Satoshi Nakamura |
| 2001 | Subjective assessment of speech-system interface usability. Kate S. Hone, Robert Graham |
| 2001 | Support vector machine with dynamic time-alignment kernel for speech recognition. Hiroshi Shimodaira, Ken-ichi Noma, Mitsuru Nakai, Shigeki Sagayama |
| 2001 | Supporting the construction of a user model in speech-only interfaces by adding multi-modality. Jacques M. B. Terken, Saskia te Riele |
| 2001 | Syllable prominence: a matter of vocal effort, phonetic distinct-ness and top-down processing. Anders Eriksson, Gunilla C. Thunberg, Hartmut Traunmller |
| 2001 | Synthesizing intonation of standard arabic language. A. Zaki, A. Rajouani, Mohamed Najim |
| 2001 | Systematic F0 glitches around nasal-vowel transitions. Hideki Kawahara, Parham Zolfaghari |
| 2001 | TALKING FOREIGN - concatenative speech synthesis and the language barrier. Nick Campbell |
| 2001 | TclBLASR: an automatic speech recognition extension for tcl. Qiru Zhou, Jinsong Zheng, Chin-Hui Lee |
| 2001 | Techniques for high-quality ACELP coding of wideband speech. Bruno Bessette, Roch Lefebvre, Redwan Salami, Milan Jelinek, Janne Vainio, J. Rotola-Pukkila, Hannu Mikkola, Kari Järvinen |
| 2001 | Temporal decomposition: a promising approach to low rate wideband speech compression. Christian H. Ritz, Ian S. Burnett |
| 2001 | Testing the perceptual relevance of syntactic completion and melodic configuration for turn-taking in dutch. Johanneke Caspers |
| 2001 | Text-to-speech scripting interface for appropriate vocalisation of e-texts. Gerasimos Xydas, Georgios Kouroupetroglou |
| 2001 | Text-to-speech synthesis with arbitrary speaker's voice from average voice. Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi |
| 2001 | Thai grapheme-to-phoneme using probabilistic GLR parser. Pongthai Tarsaku, Virach Sornlertlamvanich, Rachod Thongprasirt |
| 2001 | The IFA corpus: a phonemically segmented dutch "open source" speech database. R. J. J. H. van Son, Diana Binnenpoorte, Henk van den Heuvel, Louis C. W. Pols |
| 2001 | The ISCA special interest group on speech synthesis. Nick Campbell, Wolfgang Hess, Bernd Möbius, Jan P. H. van Santen |
| 2001 | The WITAS multi-modal dialogue system I. Oliver Lemon, Anne Bracy, Alexander Gruenstein, Stanley Peters |
| 2001 | The development of a portuguese version of a media watch system. Rui Amaral, Thibault Langlois, Hugo Meinedo, João Paulo Neto, Nuno Souto, Isabel Trancoso |
| 2001 | The effect of pitch and lexical tone on different Mandarin speech recognition tasks. Yiu Wing Wong, Eric Chang |
| 2001 | The effect of time stress on automatic speech recognition accuracy when using second language. Fang Chen, Jonas Sääv |
| 2001 | The fundamental frequency of cough by autocorrelation analysis. Annemie Van Hirtum, Daniel Berckmans |
| 2001 | The generation of speech for a search guide. Nicholas J. Cook, Ian D. Benest |
| 2001 | The influence of vocal effort on human speaker identification. Douglas Brungart, Kimberly R. Scott, Brian D. Simpson |
| 2001 | The mvprotek : m-commerce voice verification system. Y. J. Kyung, J. O. Jung, S. M. Sohn, H. J. Chun, S. Y. Moon, M. H. Kim, W. H. Sull |
| 2001 | The nespole! voIP dialogue database. Susanne Burger, Laurent Besacier, Paolo Coletti, Florian Metze, Céline Morel |
| 2001 | The perceptual relevance of glottal-pulse parameter variations. Ralph van Dinther, Raymond N. J. Veldhuis, Armin Kohlrausch |
| 2001 | The relation between speech intelligibility and the complex modulation spectrum. Steven Greenberg, Takayuki Arai |
| 2001 | The relationship between intraoral air pressure and tongue/palate contact during the articulation of norwegian /t/ and /d/. Inger Moen, Hanne Gram Simonsen, Morten Huseby, John Grue |
| 2001 | The role of duration as a correlate of accent in lekeitio basque. Gorka Elordieta, José Ignacio Hualde |
| 2001 | The role of the palate in tongue kinematics: an experimental assessment in v sequences from EPG and EMMA data. Susanne Fuchs, Pascal Perrier, Christine Mooshammer |
| 2001 | The schwa in albanian. Theodor Granser, Sylvia Moosmller |
| 2001 | The speech synthesis environment and parametric modeling of coarticulation. Mikolaj Wypych |
| 2001 | The study of the effect of training set on statistical language modeling. Xipeng Shen, Bo Xu |
| 2001 | The technical processing in smartkom data collection: a case study. Ulrich Trk |
| 2001 | The u.s. speechdat-car data collection. Peter A. Heeman, David Cole, Andrew Cronk |
| 2001 | The use of fundamental frequency raising as a strategy for increasing vocal intensity in soft, normal, and loud phonation. Paavo Alku, Juha Vintturi, Erkki Vilkman |
| 2001 | The use of noisy frame elimination and frequency spectrum magnitude reduction in noise robust speech recognition. Damjan Vlaj, Zdravko Kacic, Bogomir Horvat |
| 2001 | The use of prosody in a combined system for punctuation generation and speech recognition. Ji-Hwan Kim, Philip C. Woodland |
| 2001 | Three-dimensional modelling of speech corpora: added value through visualisation. Toomas Altosaar, Matti Karjalainen, Martti Vainio |
| 2001 | Time and memory efficient viterbi decoding for LVCSR using a precompiled search network. Daniel Willett, Erik McDermott, Yasuhiro Minami, Shigeru Katagiri |
| 2001 | Timing and interaction of visual cues for prominence in audiovisual speech perception. David House, Jonas Beskow, Björn Granström |
| 2001 | Tonal alignment, scaling and slope in Italian question and statement tunes. Mariapaola D'Imperio |
| 2001 | Topic detection for language model adaptation of highly-inflected languages by using a fuzzy comparison function. Mirjam Sepesy Maucec, Zdravko Kacic |
| 2001 | Topic styles in IR and TDT: effect on system behavior. Martin Franz, J. Scott McCarley, Todd Ward, Wei-Jing Zhu |
| 2001 | Toward noise-tolerant acoustic models. Edmondo Trentin, Marco Gori |
| 2001 | Towards SMIL as a foundation for multimodal, multimedia applications. Jennifer L. Beckham, Giuseppe Di Fabbrizio, Nils Klarlund |
| 2001 | Towards a model of target oriented production of prosody. Grzegorz Dogil, Bernd Möbius |
| 2001 | Towards automatic transcription of spontaneous presentations. Takahiro Shinozaki, Chiori Hori, Sadaoki Furui |
| 2001 | Towards combining pitch and MFCC for speaker recognition systems. Hassan Ezzaidi, Jean Rouat, Douglas D. O'Shaughnessy |
| 2001 | Towards discriminative lexicon optimization. Hauke Schramm, Peter Beyerlein |
| 2001 | Towards the creation of acoustic models for stressed Japanese speech. Kozo Okuda, Tomoko Matsui, Satoshi Nakamura |
| 2001 | Training a sentence planner for spoken dialog: the impact of syntactic and planning features. Monica Rogati, Marilyn A. Walker, Owen Rambow |
| 2001 | Training prosodic phrasing rules for Chinese TTS systems. Weijun Chen, Fuzong Lin, Jianmin Li, Bo Zhang |
| 2001 | Transducer optimizations for tight-coupled decoding. Alexander Seward |
| 2001 | Transformation-based learning of danish stress assignment. Peter Juel Henrichsen |
| 2001 | Tree based score computation for speaker verification. Raphaël Blouet, Frédéric Bimbot |
| 2001 | Triggering individual word domains in n-gram language models. Elvira I. Sicilia-Garcia, Ji Ming, Francis Jack Smith |
| 2001 | Triphone tying techniques combining a-priori rules and data driven methods. Ute Ziegenhain, Josef G. Bauer |
| 2001 | Turkish word segmentation using morphological analyzer. M. Oguzhan Külekci, Mehmed Özkan |
| 2001 | Two features to check phonetic transcriptions in text to speech systems. Stefano Sandri, Enrico Zovato |
| 2001 | Two-stage probabilistic approach to text segmentation. Yi-Chia Chen, Yi-Chung Lin |
| 2001 | Unit selection for speech synthesis using splicing costs with weighted finite state transducers. Ivan Bulyko, Mari Ostendorf |
| 2001 | Universalizing speech: notes from the USI project. Stefanie Shriver, Roni Rosenfeld, Xiaojin Zhu, Arthur R. Toth, Alexander I. Rudnicky, Markus D. Flückiger |
| 2001 | Universities and industry: marriage or co-operation between independent partners? Ilkka Niiniluoto |
| 2001 | Unsupervised noisy environment adaptation algorithm using MLLR and speaker selection. Miichi Yamada, Akira Baba, Shinichi Yoshizawa, Yuichiro Mera, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2001 | Up to what level can acoustical and textual features predict prominence. Barbertje M. Streefkerk, Louis C. W. Pols, Louis ten Bosch |
| 2001 | Use of acoustic prior information for confidence measure in ASR applications. Erhan Mengusoglu, Christophe Ris |
| 2001 | Use of clustering information for coarticulation compensation in speech synthesis by word concatenation. Christos Vosnidis, Vassilios Digalakis |
| 2001 | Use of real and contaminated speech for training of a hands-free in-car speech recognizer. Marco Matassoni, Maurizio Omologo, Piergiorgio Svaizer |
| 2001 | Use of topic knowledge in spoken dialogue information retrieval system for academic documents. Shinya Kiriyama, Keikichi Hirose, Nobuaki Minematsu |
| 2001 | Using aerial and geometric features in automatic lip-reading. Jacek C. Wojdel, Léon J. M. Rothkrantz |
| 2001 | Using boosting and POS word graph tagging to improve speech recognition. Christer Samuelsson, James Hieronymus |
| 2001 | Using information retrieval methods for language model adaptation. Langzhou Chen, Jean-Luc Gauvain, Lori Lamel, Gilles Adda, Martine Adda-Decker |
| 2001 | Using linguopalatal contact patterns to tune a 3d tongue model. Olov Engwall |
| 2001 | Using machine learning techniques for grapheme to phoneme transcription. Franco Mana, Paolo Massimino, Alberto Pacchiotti |
| 2001 | Using real words for recording diphones. Susan Fitt |
| 2001 | Using spatial correlation information in speech recognition. Peng Yu, Zuoying Wang |
| 2001 | Using the modulation complex wavelet transform for feature extraction in automatic speech recognition. Yasunori Momomura, Kenji Okada, Takayuki Arai, Noboru Kanedera, Yuji Murahara |
| 2001 | Variable-length acoustic units inference for text-to-speech synthesis. Olivier Boëffard |
| 2001 | Variation in final lengthening as a function of topic structure. Caroline L. Smith, Lisa A. Hogan |
| 2001 | Viseme recognition using multiple feature matching. Islam Shdaifat, Rolf-Rainer Grigat, Stefan Lütgert |
| 2001 | Vocal tract normalization equals linear transformation in cepstral space. Michael Pitz, Sirko Molau, Ralf Schlüter, Hermann Ney |
| 2001 | Voice activity detection in noisy environments. Jan Stadermann, V. Stahl, G. Rose |
| 2001 | Voice transformations: from speech synthesis to mammalian vocalizations. Min Tang, Chao Wang, Stephanie Seneff |
| 2001 | Voice-IF: a mixed-initiative spoken dialogue system for AT&t conference services. Mazin G. Rahim, Giuseppe Di Fabbrizio, Candace A. Kamm, Marilyn A. Walker, A. Pokrovsky, P. Ruscitti, Esther Levin, Sungbok Lee, Ann K. Syrdal, K. Schlosser |
| 2001 | Vowel height is intimately associated with stress accent in spontaneous american English discourse. Leah Hitchcock, Steven Greenberg |
| 2001 | What is the best type of prior distribution for EMAP speaker adaptation? Patrick Kenny, Gilles Boulianne, Pierre Dumouchel |
| 2001 | Whispery voiced nasal stops in rwanda. Didier Demolin, Véronique Delvaux |
| 2001 | Why is automatic recognition of children's speech difficult? Qun Li, Martin J. Russell |
| 2001 | Wideband ACELP at 16 kb/s with multi-band excitation. Sílvia Pujalte, Asunción Moreno |
| 2001 | Wideband LSF quantization by generalized voronoi codes. Stéphane Ragot, Hassan Lahdili, Roch Lefebvre |
| 2001 | Wideband speech coding algorithm with application of discrete wavelet transform to upper band. Seung Won Lee, Keun-Sung Bae |
| 2001 | Word final aspiration as a phrase boundary cue: data from spontaneous Swedish discourse. Victoria Johansson, Merle Horne, Sven Strömqvist |
| 2001 | Word level confidence annotation using combinations of features. Rong Zhang, Alexander I. Rudnicky |
| 2001 | Word level confidence measures using n-best sub-hypotheses likelihood ratio. Beng Tiong Tan, Yong Gu, Trevor Thomas |
| 2001 | Word unit based multilingual comparative analysis of text corpora. Géza Németh, Csaba Zainkó |
| 2001 | Writing script-based dialogues for AAC. Iain R. Murray, John L. Arnott, Norman Alm, Richard Dye, Gillian Harper |
| 2001 | XISL: an attempt to separate multimodal interactions from XML contents. Tsuneo Nitta, Kouichi Katsurada, Hirobumi Yamada, Yusaku Nakamura, Satoshi Kobayashi |