INTERSPEECH - RankMe

671 papers

Year	Title / Authors
2001	"CU-move" : analysis & corpus development for interactive in-vehicle speech systems. John H. L. Hansen, Pongtep Angkititrakul, Jay P. Plucienkowski, Stephen Gallant, Umit H. Yapanel, Bryan L. Pellom, Wayne H. Ward, Ronald A. Cole
2001	A baseline method for compiling typed unification grammars into context free language models. Manny Rayner, John Dowding, Beth Ann Hockey
2001	A boosting approach for confidence scoring. Pedro J. Moreno, Beth Logan, Bhiksha Raj
2001	A case for multi-resolution auditory scene analysis. Sue Harding, Georg F. Meyer
2001	A comparative study of MLP-based artificial neural networks in text-independent speaker verification against GMM-based systems. Carlos E. Vivaracho, Javier Ortega-Garcia, Luis Alonso, Q. Isaac Moro
2001	A comparative study of pauses in dialogues and read speech. Sofia Gustafson-Capková, Beáta Megyesi
2001	A comparison between human vowel normalization strategies and acoustic vowel transformation techniques. Patti Adank, Roeland van Hout, Roel Smits
2001	A comparison of LPC and FFT-based acoustic features for noise robust ASR. Febe de Wet, Bert Cranen, Johan de Veth, Lou Boves
2001	A comparison of some different techniques for vector based call-routing. Stephen Cox, Ben Shahshahani
2001	A component by component listening test analysis of the IBM trainable speech synthesis system. Robert E. Donovan
2001	A computational efficient real time noise robust speech recognition based on improved spectral subtraction method. Bojan Kotnik, Zdravko Kacic, Bogomir Horvat
2001	A context adaptation approach for building context dependent models in LVCSR. Xiaoxing Liu, Baosheng Yuan, Yonghong Yan
2001	A data selection strategy for utterance verification in continuous speech recognition. Hui Jiang, Frank K. Soong, Chin-Hui Lee
2001	A dutch treatment of an elitist approach to articulatory-acoustic feature classification. Mirjam Wester, Steven Greenberg, Shuangyu Chang
2001	A face-to-muscle inversion of a biomechanical face model for audiovisual and motor control research. Michel Pitermann, Kevin G. Munhall
2001	A fast calculation method in LVCSRS by time-skipping and clustering of probability density distributions. Seiichi Nakagawa, Yukihisa Horibe
2001	A flexible multilingual TTS development and speech research tool. Géza Kiss, Géza Németh, Gábor Olaszy, Géza Gordos
2001	A functional approach to speech recognition evaluation. Ben Hutchinson
2001	A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency. Yuichi Ishimoto, Masashi Unoki, Masato Akagi
2001	A generalized multistage VQ approach for spectral magnitude quantization. Çagri Özgenc Etemoglu, Vladimir Cuperman
2001	A hybrid approach to enhance task portability of acoustic models in Chinese speech recognition. Jinsong Zhang, Shuwu Zhang, Yoshinori Sagisaka, Satoshi Nakamura
2001	A hybrid sub-band sinusoidal coding scheme. Meau Shin Ho, Derek J. Molyneux, Barry M. G. Cheetham
2001	A mixture of Gaussians front end for speech recognition. Matthew N. Stuttle, Mark J. F. Gales
2001	A model of F0 contour for arabic affirmative and interrogative sentences. Omar A. G. Ibrahim, Salwa H. El-Ramly, Nemat S. Abdel Kader
2001	A model of vowel production under positive pressure breathing. Allan J. South
2001	A multi-SNR subband model for speaker identification under noisy environments. Kenichi Yoshida, Kazuyuki Takagi, Kazuhiko Ozeki
2001	A multi-band approach based on the probabilistic union model and frequency-filtering features for robust speech recognition. Peter Jancovic, Ji Ming
2001	A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm. Bojan Kotnik, Zdravko Kacic, Bogomir Horvat
2001	A multidimensional scaling study of fricatives; a comparison of perceptual and physical dimensions. Wan Tokuma
2001	A multilingual, multimodal, speech training system, SPECO. Klára Vicsi, Peter Roach, Anne-Marie Öster, Zdravko Kacic, Ferenc Csatári, Anna Sfakianaki, R. Veronik, Géza Gordos
2001	A multilingual-supporting dialog system using a common dialog controller. YunBiao Xu, Masahiro Araki, Yasuhisa Niimi
2001	A new DP-like speaker clustering algorithm. Zhijian Ou, Zuoying Wang
2001	A new approach for wavelet speech enhancement. Mohammed Bahoura, Jean Rouat
2001	A new auditory based microphone array and objective evaluation using e-RASTI. José-Luis Sánchez-Bote, Joaquin Gonzalez-Rodriguez, Danilo Simon-Zorita
2001	A new dynamic HMM model for speech recognition. Feili Chen, Eric Chang
2001	A new feature driven cochlear implant speech processing strategy. Dashtseren Erdenebat, Shigeyoshi Kitazawa, Tatsuya Kitamura
2001	A new method for speech denoising and robust speech recognition using probabilistic models for clean speech and for noise. Hagai Attias, Li Deng, Alex Acero, John C. Platt
2001	A new method for speech recognition in the presence of non-stationary, unpredictable and high-level noise. Ikuyo Masuda-Katsuse
2001	A new method for testing communication efficiency and user acceptability of speech communication channels. Sander J. van Wijngaarden, Paula M. T. Smeele, Herman J. M. Steeneken
2001	A new multi-speaker formant synthesizer that applies voice conversion techniques. Juana M. Gutiérrez-Arriola, Juan Manuel Montero, José A. Vallejo, Ricardo de Córdoba, Rubén San Segundo, José Manuel Pardo
2001	A new technique based on augmented language models to improve the performance of spoken dialogue systems. Ramón López-Cózar, Diego H. Milone
2001	A new verification-based fast match approach to large vocabulary speech recognition. Feng Liu, Mohamed Afify, Hui Jiang, Olivier Siohan
2001	A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping. Bowen Zhou, John H. L. Hansen
2001	A novel target-driven MLLR adaptation algorithm with multi-layer structure. Lei Jia, Bo Xu
2001	A one pass semi-dynamic network decoder based on language model network. Dong-Hoon Ahn, Minhwa Chung
2001	A perspective on industry/university relationships in the US. Gary W. Strong
2001	A physiological analysis of nasals and nasalization in Chinese. Wing-Nga Fung, Sze-Lok Lau
2001	A portability study on natural language call steering. Hong-Kwang Jeff Kuo, Chin-Hui Lee
2001	A posteriori and a priori transformations for speaker adaptation in large vocabulary speech recognition systems. Driss Matrouf, Olivier Bellot, Pascal Nocera, Georges Linarès, Jean-François Bonastre
2001	A proposed method for measuring language dependency of narrow band voice coders. Sander J. van Wijngaarden, Herman J. M. Steeneken
2001	A quasi-one-dimensional model of aerodynamic and acoustic flow in the time-varying vocal tract: source and excitation mechanisms. Gordon Ramsay
2001	A real-time Japanese broadcast news closed-captioning system. Olivier Siohan, Akio Ando, Mohamed Afify, Hui Jiang, Chin-Hui Lee, Qi Li, Feng Liu, Kazuo Onoe, Frank K. Soong, Qiru Zhou
2001	A robust front-end algorithm for distributed speech recognition. Yan Ming Cheng, Dusan Macho, Yuanjun Wei, Douglas Ealey, Holly Kelleher, David Pearce, William Kushner, Tenkasi Ramabadran
2001	A robust front-end for ASR over IP snd GSM networks: an integrated scenario. Ascensión Gallardo-Antolín, Carmen Peláez-Moreno, Fernando Díaz-de-María
2001	A robust speaker verification system against imposture using an HMM-based speech synthesis system. Takayuki Satoh, Takashi Masuko, Takao Kobayashi, Keiichi Tokuda
2001	A rule based approach to extraction of topics and dialog acts in a spoken dialog system. Yasuhisa Niimi, Tomoki Oku, Takuya Nishimoto, Masahiro Araki
2001	A segmental mixture model for speaker recognition. Robert P. Stapert, John S. D. Mason
2001	A structured statistical language model conditioned by arbitrarily abstracted grammatical categories based on GLR parsing. Tomoyosi Akiba, Katunobu Itou
2001	A study of speech coding parameters in speech recognition. Jari Juhani Turunen, Damjan Vlaj
2001	A study on speech over the telephone and aging. Maxine Eskénazi, Alan W. Black
2001	A study on the production-perception link of English vowels produced by native and non-native speakers. Byunggon Yang
2001	A switched DPCM/subband coder for pre-echo reduction. S. Satheesh, T. V. Sreenivas
2001	A system for text dependent speaker verification - field trial evaluation and simulation results. Holger Schalk, Herbert Reininger, Stephan Euler
2001	A testbed for developing multilingual phonotactic descriptions. Simone Ashby, Julie Carson-Berndsen, Gina Joue
2001	A text-independent speaker verification system using support vector machines classifier. Yong Gu, Trevor Thomas
2001	A theme structure method for the ellipsis resolution. Yinfei Huang, Fang Zheng, Yi Su, Fang Li, Wenhu Wu
2001	A time-varying complex AR speech analysis based on GLS and ELS method. Keiichi Funaki
2001	A tool for automatic feedback on phonemic transcription. Martin Cooke, María Luisa García Lecumberri, John A. Maidment
2001	A transducer approach to word graph generation. Gilles Boulianne, Pierre Ouellet, Pierre Dumouchel
2001	A two-layer lexical tree based beam search in continuous Chinese speech recognition. Guoliang Zhang, Fang Zheng, Wenhu Wu
2001	A variable rate hybrid coder based on a synchronized harmonic excitation. Nilantha Katugampala, Ahmet M. Kondoz
2001	A weight pushing algorithm for large vocabulary speech recognition. Mehryar Mohri, Michael Riley
2001	A word graph interface for a flexible concept based speech understanding framework. Kadri Hacioglu, Wayne H. Ward
2001	A word- and turn-oriented approach to exploring the structure of Mandarin dialogues. Shu-Chuan Tseng
2001	ALGONQUIN: iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition. Brendan J. Frey, Li Deng, Alex Acero, Trausti T. Kristjansson
2001	AMR wideband codec - leap in mobile communication voice quality. J. Rotola-Pukkila, Janne Vainio, Hannu Mikkola, Kari Järvinen, Bruno Bessette, Roch Lefebvre, Redwan Salami, Milan Jelinek
2001	AMSTIVOC (AMsterdam system for transcription of infant VOCalizations) applied to utterances of deaf and normally hearing infants. Florien J. Koopmans-van Beinum, Chris J. Clement, Ineke Van den Dikkenberg-Pot
2001	ANVIL - a generic annotation tool for multimodal dialogue. Michael Kipp
2001	ASR - articulatory speech recognition. Joe Frankel, Simon King
2001	Accent label prediction by time delay neural networks using gating clusters. Achim F. Müller, Rüdiger Hoffmann
2001	Accent-independent universal HMM-based speech recognizer for american, australian and british English. Rathinavelu Chengalvarayan
2001	Acoustic correlates of emotion dimensions in view of speech synthesis. Marc Schröder, Roddy Cowie, Ellen Douglas-Cowie, Machiel Westerdijk, Stan C. A. M. Gielen
2001	Acoustic echo control and noise reduction for cabin car communication. Eduardo Lleida, Enrique Masgrau, Alfonso Ortega
2001	Acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments. Hong Kook Kim, Richard C. Rose, Hong-Goo Kang
2001	Acoustic modeling of foreign words in a German speech recognition system. Georg Stemmer, Elmar Nöth, Heinrich Niemann
2001	Acoustical and topological experiments for an HMM-based speech segmentation system. Samir Nefti, Olivier Boëffard
2001	Acquiring and implementing phonetic knowledge. Louis C. W. Pols
2001	Additive and convolutional noise canceling in speaker verification using a stochastic weighted viterbi algorithm. Néstor Becerra Yoma, Miguel Villar Fernandez
2001	Advances in automatic speech summarization. Chiori Hori, Sadaoki Furui
2001	African speech technology (AST) telephone speech databases: corpus design and contents. Philippa H. Louw, Justus C. Roux, Elizabeth C. Botha
2001	Agent-based error handling in spoken dialogue systems. Markku Turunen, Jaakko Hakulinen
2001	Aligning prosody and syntax in property grammars. Philippe Blache, Daniel Hirst
2001	Ambiguity representation and resolution in spoken dialogue systems. Egbert Ammicht, Alexandros Potamianos, Eric Fosler-Lussier
2001	An HMM/n-gram-based linguistic processing approach for Mandarin spoken document retrieval. Berlin Chen, Hsin-Min Wang, Lin-Shan Lee
2001	An MCE based classification tree using hierarchical feature-weighting in speech recognition. Fan Wang, Fang Zheng, Wenhu Wu
2001	An acoustical analysis of the vowels in beijing Mandarin. Eric Zee, Wai-Sum Lee
2001	An algorithm for finding line spectrum frequencies of added speech signals and its application to robust speech recognition. An-Tze Yu, Hsiao-Chuan Wang
2001	An approach to an Italian talking head. Catherine Pelachaud, Emanuela Magno Caldognetto, Claudio Zmarich, Piero Cosi
2001	An approach to automatic phonetic baseform generation based on Bayesian networks. Changxue Ma, Mark A. Randolph
2001	An auditory system-based feature for robust speech recognition. Qi Li, Frank K. Soong, Olivier Siohan
2001	An automatic dialogue system generator from the internet information contents. Masahiro Araki, Tasuku Ono, Kiyoshi Ueda, Takuya Nishimoto, Yasuhisa Niimi
2001	An efficient implementation of phonological rules using finite-state transducers. I. Lee Hetherington
2001	An efficient lipreading method using the symmetry of lip. Joohun Lee, Jin Young Kim
2001	An efficient transcoding algorithm for g.723.1 and g.729a speech coders. Sung-Wan Yoon, Sung-Kyo Jung, Young-Cheol Park, Dae Hee Youn
2001	An elitist approach to articulatory-acoustic feature classification. Shuangyu Chang, Steven Greenberg, Mirjam Wester
2001	An embodiment paradigm for speech recognition systems. Gina Joue, Julie Carson-Berndsen
2001	An improved wavelet-based speech enhancement system. Hamid Sheikhzadeh, Hamid Reza Abutalebi
2001	An interactive directory assistance service for Spanish with large-vocabulary recognition. Ricardo de Córdoba, Rubén San Segundo, Juan Manuel Montero, José Colás, Javier Ferreiros, Javier Macías Guarasa, José Manuel Pardo
2001	An investigation of HMM classifier combination strategies for improved audio-visual speech recognition. Simon Lucey, Sridha Sridharan, Vinod Chandran
2001	An investigation of modelling aspects for ratedependent speech recognition. Britta Wrede, Gernot A. Fink, Gerhard Sagerer
2001	An objective measure for assessment of the concatenative TTS segment inventories. Robert Batusek
2001	An objective measure for estimating MOS of synthesized speech. Min Chu, Hu Peng
2001	An online incremental language model adaptation method. Genqing Wu, Fang Zheng, Ling Jin, Wenhu Wu
2001	Analysis of n-best output hypotheses for fast speech in large vocabulary continuous speech recognition. Tibor Fábián, Thilo Pfau, Günther Ruske
2001	Analysis of speaker variability. Chao Huang, Tao Chen, Stan Z. Li, Eric Chang, Jian-Lai Zhou
2001	Analysis of the root-cepstrum for acoustic modeling and fast decoding in speech recognition. Ruhi Sarikaya, John H. L. Hansen
2001	Analysis of the voiced speech using the generalized fourier transform with quadratic phase. Davor Petrinovic, Vladimir Cuperman
2001	Aperiodicity control in ARX-based speech analysis-synthesis method. Takahiro Ohtsuka, Hideki Kasuya
2001	Application of the trended hidden Markov model to speech synthesis. John Dines, Sridha Sridharan, Miles Moody
2001	Applying parallel model compensation with mel-frequency discrete wavelet coefficients for noise-robust speech recognition. Zekeriya Tufekci, John N. Gowdy, Sabri Gurbuz, Eric K. Patterson
2001	Architecture for adaptive multimodal dialog systems based on voiceXML. Georg Niklfeld, Robert Finan, Michael Pucher
2001	Aspects of modern multi-modal/multi-media corpora exploitation environments. Daan Broeder, Hennie Brugman, Peter Wittenburg
2001	Auditory filter bank design using masking curves. Lee Lin, Eliathamby Ambikairajah, W. Harvey Holmes
2001	Auditory model based speech recognition in noisy environment. Xiaoqing Yu, Wanggen Wan, Daniel Pak-Kong Lun
2001	Auditory visual speech processing. Dominic W. Massaro
2001	Auditory-visual perception of lexical tone. Denis Burnham, Valter Ciocca, Stephanie Stokes
2001	Automated modeling of Chinese intonation in continuous speech. Greg Kochanski, Chilin Shih
2001	Automatic analysis of real dialogues and generating of training corpora. Jana Schwarz, Václav Matousek
2001	Automatic construction of CALL system from TV news program with captions. Takashi Tanaka, Kazumasa Mori, Satoshi Kobayashi, Seiichi Nakagawa
2001	Automatic labeling and digesting for lecture speech utilizing repeated speech by shift CDP. Yoshiaki Itoh, Kazuyo Tanaka
2001	Automatic learning of finite state automata for pronunciation modeling. Moisés Pastor-i-Gadea, Francisco Casacuberta
2001	Automatic n-gram language model creation from web resources. Ryuichi Nisimura, Kumiko Komatsu, Yuka Kuroda, Kentaro Nagatomo, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano
2001	Automatic prosody generation - a model for hungarian. Gábor Olaszy, Géza Németh, Péter Olaszi
2001	Automatic rhythm modeling for language identification. Jérôme Farinas, François Pellegrino
2001	Automatic segmentation of recorded speech into syllables for speech synthesis. Eric Lewis, Mark Tatham
2001	Automatic word acquisition from continuous speech. Helmut Lucke, Masanori Omote
2001	Autoregressive time-frequency interpolation in the context of missing data theory for impulsive noise compensation. Ilyas Potamitis, Nikos Fakotakis
2001	Back-off smoothing evaluation over syntactic language models. Amparo Varona, Inés Torres
2001	Background learning of speaker voices for textindependent speaker identification. Wei-Ho Tsai, Y. C. Chu, Chao-Shih Huang, Wen-Whei Chang
2001	Bayesian methods for HMM speech recognition with limited training data. Darryl W. Purnell, Elizabeth C. Botha
2001	Blind source separation for speech based on fast-convergence algorithm with ICA and beamforming. Hiroshi Saruwatari, Toshiya Kawamura, Kiyohiro Shikano
2001	Blind speech separation of moving speakers using hybrid neural networks. Athanasios Koutras, Evangelos Dermatas, George K. Kokkinakis
2001	Boiling down prosody for the classification of boundaries and accents in German and English. Anton Batliner, Jan Buckow, Richard Huber, Volker Warnke, Elmar Nöth, Heinrich Niemann
2001	Breadth-first search for finding the optimal phonetic transcription from multiple utterances. Maximilian Bisani, Hermann Ney
2001	Broadcast news LM adaptation using contemporary texts. Marcello Federico, Nicola Bertoldi
2001	Building a corpus of natural speech - and tools for the processing of expressive speech. Nick Campbell
2001	Building an integrated prosodic model of German. Hansjörg Mixdorff, Oliver Jokisch
2001	Burst segmentation and evaluation of acoustic cues. Yves Laprie, Anne Bonneau
2001	Business listings in automatic directory assistance. Odette Scharenborg, Janienke Sturm, Lou Boves
2001	Calibration of microphone arrays for improved speech recognition. Michael L. Seltzer, Bhiksha Raj
2001	Caller identification for the SCANMail voicemail browser. Aaron E. Rosenberg, Julia Hirschberg, Michiel Bacchiani, Sarangarajan Parthasarathy, Philip L. Isenhour, Larry Stead
2001	Cantonese text-to-speech synthesis using sub-syllable units. Ka Man Law, Tan Lee, Wai H. Lau
2001	Class definition in discriminant feature analysis. Jacques Duchateau, Kris Demuynck, Dirk Van Compernolle, Patrick Wambacq
2001	Classification of transition sounds with application to automatic speech recognition. Zeev Litichever, Dan Chazan
2001	Classification of video genre using audio. Matthew Roach, John S. D. Mason
2001	Classifying emotions in speech: a comparison of methods. Noam Amir, Ori Kerret, Dimitry Karlinski
2001	Coarticulatory effects at prosodic boundaries: some acoustic results. Marija Tabain, Guillaume Rolland, Christophe Savariaux
2001	Coarticulatory effects in perception. Santiago Fernández, Sergio Feijóo
2001	Coding method for successive pitch periods. Ari Heikkinen, Vesa T. Ruoppila, Samuli Pietilä
2001	Cohorts based custom models for rapid speaker and dialect adaptation. Jian Wu, Eric Chang
2001	Combined front-end signal processing for in-vehicle speech systems. Jay P. Plucienkowski, John H. L. Hansen, Pongtep Angkititrakul
2001	Combined linear regression adaptation and Bayesian predictive classification for robust speech recognition. Jen-Tzung Chien
2001	Combined speech and audio coding with bit rate and bandwidth scalability. Maria Farrugia, Ahmet M. Kondoz
2001	Combining GMM's with suport vector machines for text-independent speaker verification. Jamal Kharroubi, Dijana Petrovska-Delacrétaz, Gérard Chollet
2001	Combining multi-party speech and text exchanges over the internet. Niels Ole Bernsen, Laila Dybkjær
2001	Combining word- and class-based language models: a comparative study in several languages using automatic and manual word-clustering techniques. Giulio Maltese, Paolo Bravetti, H. Crépy, B. J. Grainger, M. Herzog, Francisco Palou
2001	Communication aid for non-vocal people using corpusbased concatenative speech synthesis. Akemi Iida, Yosuke Sakurada, Nick Campbell, Michiaki Yasumura
2001	Compact word graph in spoken dialogue system. Shih-Chieh Chien, Sen-Chia Chang
2001	Comparative analysis for data-driven temporal filters obtained via principal component analysis (PCA) and linear discriminant analysis (LDA) in speech recognition. Jeih-weih Hung, Hsin-Min Wang, Lin-Shan Lee
2001	Comparative evaluation of F0 estimation algorithms. Alain de Cheveigné, Hideki Kawahara
2001	Comparing audio- and a-posteriori-probability-based stream confidence measures for audio-visual speech recognition. Martin Heckmann, Thorsten Wild, Frédéric Berthommier, Kristian Kroschel
2001	Comparing grammar-based and robust approaches to speech understanding: a case study. Sylvia Knight, Genevieve Gorrell, Manny Rayner, David Milward, Rob Koeling, Ian Lewin
2001	Comparing parameter tying methods for multilingual acoustic modelling. Mikko Harju, Petri Salmela, Jussi Leppänen, Olli Viikki, Jukka Saarinen
2001	Comparing the performance of two CSRs: how to determine the significance level of the differences. Helmer Strik, Catia Cucchiarini, Judith M. Kessens
2001	Comparing word-level intelligibility after linear vs. non-linear time-compression. Esther Janse
2001	Comparison of MFCC and PLP parameterizations in the speaker independent continuous speech recognition task. Josef Psutka, Ludek Müller, Josef V. Psutka
2001	Comparison of spectral derivative parameters for robust speech recognition. Dusan Macho, Climent Nadeu
2001	Comparison of width-wise and length-wise language model compression. Edward W. D. Whittaker, Bhiksha Raj
2001	Computationally efficient frequency-domain combination of acoustic echo cancellation and robust adaptive beamforming. Wolfgang Herbordt, Herbert Buchner, Walter Kellermann
2001	Concordancing for parallel spoken language corpora. Dafydd Gibbon, Thorsten Trippel, Serge Sharoff
2001	Confidence based lattice segmentation and minimum Bayes-risk decoding. Vaibhava Goel, Shankar Kumar, William Byrne
2001	Confidence measure (CM) estimation for large vocabulary speaker-independent continuous speech recognition system. Yaxin Zhang, Raymond Lee, Anton Madievski
2001	Considerations on what industry expects from universities. Yrjö Neuvo
2001	Constructing a segment database for greek time domain speech synthesis. Stavroula-Evita Fotinea, George Tambouratzis, George Carayannis
2001	Context-dependent probabilistic hierarchical sublexical modelling using finite state transducers. Xiaolong Mou, Stephanie Seneff, Victor Zue
2001	Corpus-based database of residual excitations used for speech reconstruction from MFCCs. Zbynek Tychtl, Josef Psutka
2001	Corpus-based synthesis of fundamental frequency contours based on a generation process model. Keikichi Hirose, Masaya Eto, Nobuaki Minematsu, Atsuhiro Sakurai
2001	Correction of the voice timbre distortions on telephone network. Gaël Mahé, André Gilloire
2001	Creating a european English broadcast news transcription corpus and system. Gerhard Backfried, Robert Hecht, Sabine Loots, Norbert Pfannerer, Jürgen Riedler, Christian Schiefer
2001	Credibility proof for speech content and speaker verification by fragile watermarking with consecutive frame-based processing. Yiou-Wen Cheng, Lin-Shan Lee
2001	Crosslingual speech recognition with multilingual acoustic models based on agglomerative and tree-based triphone clustering. Andrej Zgank, Bojan Imperl, Finn Tore Johansen, Zdravko Kacic, Bogomir Horvat
2001	Cues for perceived pitch register. Toni C. M. Rietveld, Patricia Vermillion
2001	DARPA communicator dialog travel planning systems: the june 2000 data collection. Marilyn A. Walker, John S. Aberdeen, Julie E. Boland, Elizabeth Owen Bratt, John S. Garofolo, Lynette Hirschman, Audrey N. Le, Sungbok Lee, Shrikanth S. Narayanan, Kishore Papineni, Bryan L. Pellom, Joseph Polifroni, Alexandros Potamianos, P. Prabhu, Alexander I. Rudnicky, Gregory A. Sanders, Stephanie Seneff, David Stallard, Steve Whittaker
2001	DIARCA: a component approach to voice recognition. Juan Carlos Díaz Martín, Juan-Luis García Zapata, José Manuel Rodríguez García, José F. Álvarez Salgado, Pablo Espada Bueno, Pedro Gómez-Vilda
2001	Data-driven semantic inference for unconstrained desktop command and control. Jerome R. Bellegarda, Kim E. A. Silverman
2001	Defining constraints for multilinear speech processing. Julie Carson-Berndsen, Michael Walsh
2001	Deriving document structure from prosodic cues. Martin Haase, Werner Kriechbaum, Gregor Möhler, Gerhard Stenzel
2001	Design of a semantic parser with support to ellipsis resolution in a Chinese spoken language dialogue system. Yi Su, Fang Zheng, Yinfei Huang
2001	Design of an optimal continuous speech database for text-to-speech synthesis considered as a set covering problem. Hélène François, Olivier Boëffard
2001	Design of speech corpus for text-to-speech synthesis. Jindrich Matousek, Josef Psutka, Jiri Kruta
2001	Designing very compact decision trees for grapheme-to-phoneme transcription. Anne K. Kienappel, Reinhard Kneser
2001	Detecting Japanese local speech rate deceleration in spontaneous conversational speech using a variable threshold. Keiichi Takamaru, Makoto Hiroshige, Kenji Araki, Koji Tochinai
2001	Detection of OOV words using generalized word models and a semantic class language model. Thomas Schaaf
2001	Detection of digital transmission systems for voice quality measurements. Thorsten Ludwig, Ulrich Heute
2001	Detection of recognition errors and out of the spelling dictionary names in a spelled name recognizer for Spanish. Rubén San Segundo, Javier Macías Guarasa, Javier Ferreiros, P. Martín, José Manuel Pardo
2001	Development of Russian lexical databases, corpora and supporting tools for speech products. Serge A. Yablonsky
2001	Development of an asynchronous multi-band system for continuous speech recognition. Yik-Cheung Tam, Brian Kan-Wing Mak
2001	Development of vowel quantity perception in late childhood. Dawn M. Behne, Peter E. Czigler, Kirk P. H. Sullivan
2001	Dialogue session: management using voiceXML. Augustine Tsai, Andrew N. Pargellis, Chin-Hui Lee, Joseph P. Olive
2001	Discriminant analysis of nasal vs. oral vowels in French: comparison between different parametric representations. Véronique Delvaux, Alain Soquet
2001	Discrimination between speech and music based on a low frequency modulation feature. Stefan Karnebäck
2001	Discriminative disfluency modeling for spontaneous speech recognition. Chung-Hsien Wu, Gwo-Lang Yan
2001	Discriminative speaker adaptation with conditional maximum likelihood linear regression. Asela Gunawardana, William Byrne
2001	Distinctive features for use in an automatic speech recognition system. Ellen Eide
2001	Distributed speech recognition using traditional and hybrid modeling techniques. Jan Stadermann, Ralf Meermeier, Gerhard Rigoll
2001	Do speakers realize the prosodic structure they say they do? Olga van Herwijnen, Jacques M. B. Terken
2001	Domain-independent spoken dialogue platform using key-phrase spotting based on combined language model. Kazunori Komatani, Katsuaki Tanaka, Hiroaki Kashima, Tatsuya Kawahara
2001	Dual channel speech enhancement using coherence function and MDL-based subspace approach in bark domain. Rolf Vetter, Philippe Renevey, Jens Krauss
2001	Dynamic lexicon using phonetic features. Kyung-Tak Lee, Christian Wellekens
2001	ELRA contribution to bridge the gap between industry and academia. Khalid Choukri
2001	EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, Aalborg, Denmark, September 3-7, 2001 Paul Dalsgaard, Børge Lindberg, Henrik Benner, Zheng-Hua Tan
2001	Effects of OOV rates on keyphrase rejection schemes. Gies Bouwman, Janienke Sturm, Lou Boves
2001	Effects of noise adaptation on the perception of voiced plosives in isolated syllables. William A. Ainsworth, T. Cervera
2001	Efficient decoding strategy for conversational speech recognition using state-space models for vocal-tract-resonance dynamics. Jeff Z. Ma, Li Deng
2001	Efficient implementation of ITU-t g.723.1 speech coder for multichannel voice transmission and storage. Sung-Kyo Jung, Young-Cheol Park, Sung-Wan Yoon, Kyung-Tae Kim, Dae Hee Youn
2001	Efficient periodicity extraction based on sine-wave representation and its application to pitch determination of speech signals. Dan Chazan, Meir Tzur, Ron Hoory, Gilad Cohen
2001	Efficient scalable speech compression for scalable speech recognition. Naveen Srinivasamurthy, Antonio Ortega, Shrikanth S. Narayanan
2001	Efficient speech enhancement by diffusive gain factors (DGF). Hyoung-Gook Kim, Klaus Obermayer, Mathias Bode, Dietmar Ruwisch
2001	Efficient stochastic finite-state networks for language modelling in spoken dialogue systems. Kallirroi Georgila, Nikos Fakotakis, George K. Kokkinakis
2001	Eigen-MLLR coefficients as new feature parameters for speaker identification. Nick J.-C. Wang, Wei-Ho Tsai, Lin-Shan Lee
2001	Ejective reduction in chaha is conditioned by more than prosodic position. Rachel Coulston
2001	Elderly acoustic model for large vocabulary continuous speech recognition. Akira Baba, Shinichi Yoshizawa, Miichi Yamada, Akinobu Lee, Kiyohiro Shikano
2001	Electromagnetic articulograph (EMA) based on a nonparametric representation of tthe magnetic field. Tokihiko Kaburagi, Masaaki Honda
2001	Emerging requirements for multi-modal annotation and analysis tools. Tony Bigbee, Dan Loehr, Lisa Harper
2001	Emotional speech synthesis: a review. Marc Schröder
2001	Enhancement of noisy speech by using improved global soft decision. Vladimir I. Shin, Doh-Suk Kim, Moo Young Kim, Jeongsu Kim
2001	Enhancement of speech using bark-scaled wavelet packet decomposition. Israel Cohen
2001	Enhancing GMM scores using SVM "hints". Shai Fine, Jirí Navrátil, Ramesh A. Gopinath
2001	Enhancing distributed speech recognition with back- end speech reconstruction. Tenkasi Ramabadran, Jeff Meunier, Mark A. Jasiuk, Bill Kushner
2001	Entropy based voice activity detection in very noisy conditions. Philippe Renevey, Andrzej Drygajlo
2001	Envelope information in speech processing: acoustic-phonetic analysis vs. auditory figure-ground segregation. Olivier Crouzet, William A. Ainsworth
2001	Equivalence between frequency domain blind source separation and frequency domain adaptive null beamformers. Shoko Araki, Shoji Makino, Ryo Mukai, Hiroshi Saruwatari
2001	Error correcting posterior combination for robust multi-band speech recognition. Astrid Hagen, Hervé Bourlard
2001	Estimating pronunciation variations from acoustic likelihood score for HMM reconstruction. Yi Liu, Pascale Fung
2001	Estimation of the modulation frequency and modulation depth of the fundamental frequency owing to vocal micro-tremor of the voice source signal. Jean Schoentgen
2001	European portuguese nasal vowels: an EMMA study. António J. S. Teixeira, Francisco A. C. Vaz
2001	Eutrans: a speech-to-speech translator prototype. Moisés Pastor-i-Gadea, Alberto Sanchís, Francisco Casacuberta, Enrique Vidal
2001	Evaluating the Aurora connected digit recognition task - a bell labs approach. Mohamed Afify, Hui Jiang, Filipp Korkmazskiy, Chin-Hui Lee, Qi Li, Olivier Siohan, Frank K. Soong, Arun C. Surendran
2001	Evaluation of PROS-3 for the assignment of prosodic structure, compared to assignment by human experts. Olga van Herwijnen, Jacques M. B. Terken
2001	Evaluation of a generalized dynamic cepstrum in distant speech recognition. Hiroshi Matsumoto, Akihiko Shimizu, Kazumasa Yamamoto
2001	Evaluation of an automatically obtained shape and appearance model for automatic audio visual speech recognition. Philippe Daubias, Paul Deléglise
2001	Evaluation of cross-language voice conversion based on GMM and straight. Mikiko Mashimo, Tomoki Toda, Kiyohiro Shikano, Nick Campbell
2001	Evaluation of front-end features and noise compensation methods for robust Mandarin speech recognition. Rathi Chengalvarayan
2001	Evaluation of recent speech grammar standardization efforts. Tom Brøndsted
2001	Evaluation of sublexical and lexical models of acoustic disfluencies for spontaneous speech recognition in Spanish. Luis Javier Rodríguez, Inés Torres, Amparo Varona
2001	Evaluation of the SPLICE algorithm on the Aurora2 database. Jasha Droppo, Li Deng, Alex Acero
2001	Evaluation on unsupervised speaker adaptation based on sufficient HMM statictics of selected speakers. Shinichi Yoshizawa, Akira Baba, Kanako Matsunami, Yuichiro Mera, Miichi Yamada, Akinobu Lee, Kiyohiro Shikano
2001	Everyday life sounds and speech analysis for a medical telemonitoring system. Eric Castelli, Dan Istrate
2001	Experimental evaluation on confidence of agreement among multiple Japanese LVCSR models. Yasuhiro Kodama, Takehito Utsuro, Hiromitsu Nishizaki, Seiichi Nakagawa
2001	Experiments on cross-language acoustic modeling. Tanja Schultz, Alex Waibel
2001	Experiments with the philips continuous ASR system on the AURORA noisy digits database. Markus Lieb, Alexander Fischer
2001	Explicit exploitation of stochastic characteristics of test utterance for text-independent speaker identification. Wei-Ho Tsai, Wen-Whei Chang, Chao-Shih Huang
2001	Exploring the null space of the acoustic-to- articulatory inversion using a hypercube codebook. Slim Ouni, Yves Laprie
2001	Extracting caller information from voicemail. Geoffrey Zweig, Jing Huang, Mukund Padmanabhan
2001	Extractive summarization of voicemail using lexical and prosodic feature subset selection. Konstantinos Koumpis, Steve Renals, Mahesan Niranjan
2001	F0 feature extraction by polynomial regression function for monosyllabic Thai tone recognition. Patavee Charnvivit, Somchai Jitapunkul, Visarut Ahkuputra, Ekkarit Maneenoi, Umavasee Thathong, Boonchai Thampanitchawong
2001	FST-based recognition techniques for multi-lingual and multi-domain spontaneous speech. Timothy J. Hazen, I. Lee Hetherington, Alex Park
2001	Factors affecting schwa-insertion in final consonant clusters in standard dutch. Marc Swerts, Hanne Kloots, Steven Gillis, Georges De Schutter
2001	Fast adaptation using constrained affine transformations with hierarchical priors. Tor André Myrvoll, Kuldip K. Paliwal, Torbjørn Svendsen
2001	Fast harmonic estimation using a low resolution pitch for low bit rate harmonic coding. Yong-Soo Choi, Dae Hee Youn
2001	Feature extraction and model-based noise compensation for noisy speech recognition evaluated on AURORA 2 task. Kaisheng Yao, Jingdong Chen, Kuldip K. Paliwal, Satoshi Nakamura
2001	Feature extraction by auditory modeling for unit selection in concatenative speech synthesis. Minoru Tsuzaki
2001	Feature extraction from time-frequency matrices for robust speech recognition. José C. Segura, M. Carmen Benítez, Ángel de la Torre, Antonio J. Rubio
2001	Feature vector selection to improve ASR robustness in noisy conditions. Johan de Veth, Laurent Mauuary, Bernhard Noé, Febe de Wet, Jürgen Sienel, Lou Boves, Denis Jouvet
2001	Festival speaks Italian! Piero Cosi, Fabio Tesser, Roberto Gretter, Cinzia Avesani, Mike Macon
2001	Finite state prosodic analysis of african corpus resources. Dafydd Gibbon
2001	First steps toward an adaptive spoken dialogue system in medical domain. Ivano Azzini, Daniele Falavigna, Roberto Gretter, Giordano Lanzola, Marco Orlandi
2001	Formant estimation using gammachirp filterbank. Kaïs Ouni, Zied Lachiri, Noureddine Ellouze
2001	Formant-broadened CMS using peak-picking in LOG spectrum. Yu-Jin Kim, Hea-Kyoung Jung, Jae-Ho Chung
2001	Forward masking for increased robustness in automatic speech recognition. Sascha Wendt, Gernot A. Fink, Franz Kummert
2001	From here to utility - melding phonetic insight with speech technology. Steven Greenberg
2001	From perceptual designs to linguistic typology and automatic language identification : overview and perspectives. Melissa Barkat, Ioana Vasilescu
2001	Fun or boring? a web-based evaluation of expressive synthesis for children. Kjell Gustafson, David House
2001	Gaussian subtraction (GS) algorithms for word spotting in continuous speech. Avi Faizakov, Arnon Cohen, Tzur Vaich
2001	Generalized source-filter structures for speech synthesis. Matti Karjalainen, Tuomas Paatero
2001	Generating F0 contours by statistical manipulation of natural F0 shapes. Takashi Saito, Masaharu Sakamoto
2001	Generating duration from a cognitively plausible model of rhythm production. Plínio A. Barbosa
2001	Good timing: place-dependent voice onset time in ejective stops. Ian Maddieson
2001	Graceful degradation of speech recognition performance over lossy packet networks. Eve A. Riskin, Constantinos Boulis, Scott Otterson, Mari Ostendorf
2001	Graphic platform for designing and developing practical voice interaction systems. Tomás Nouza, Jan Nouza
2001	HMM2- extraction of formant structures and their use for robust ASR. Katrin Weber, Samy Bengio, Hervé Bourlard
2001	Hansori 2001 - corpus-based implementation of the Korean hansori text-to-speech synthesizer. Attila Ferencz, Sung-Woo Choi, Ho-Eun Song, Myoung-Wan Koo
2001	Harmonic tunnelling: tracking non-stationary noises during speech. Douglas Ealey, Holly Kelleher, David Pearce
2001	Helium speech normalisation by codebook mapping. Adam Podhorski, Marek Czepulonis
2001	High quality voice conversion based on Gaussian mixture model with dynamic frequency warping. Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2001	How visual co-presence and joint attention shape speaking. Susan E. Brennan
2001	Human language identification with reduced segmental information: comparison between monolinguals and bilinguals. Masahiko Komatsu, Kazuya Mori, Takayuki Arai, Yuji Murahara
2001	Hybrid natural language generation for spoken dialogue systems. Michel Galley, Eric Fosler-Lussier, Alexandros Potamianos
2001	Hypothesis-driven accent discrimination. Laura Mayfield Tomokiyo
2001	ISCA SALTMIL SIG: speech and language technology for minority languages. Climent Nadeu, Donncha Cróinín, Bojan Petek, Kepa Sarasola, Briony Williams
2001	ISIS: a learning system with combined interaction and delegation dialogs. Helen M. Meng, Shuk Fong Chan, Yee Fong Wong, Cheong Chat Chan, Yiu Wing Wong, Tien Ying Fung, Wai Ching Tsui, Ke Chen, Lan Wang, Ting-Yao Wu, Xiaolong Li, Tan Lee, Wing Nin Choi, P. C. Ching, Huisheng Chi
2001	Identification of accent and intonation in sentences for CALL systems. Carlos Toshinori Ishi, Nobuaki Minematsu, Ryuji Nishide, Keikichi Hirose
2001	Implementation effective one-channel noise reduction system. Jiri Tihelka, Pavel Sovka
2001	Improved context-dependent acoustic modeling for continuous Chinese speech recognition. Jiyong Zhang, Fang Zheng, Jing Li, Chunhua Luo, Guoliang Zhang
2001	Improved data-driven generation of pronunciation dictionaries using an adapted word list. Matthias Wolff, Matthias Eichner, Rüdiger Hoffmann
2001	Improved entropic gain for speech signals analysis/synthesis based on an adaptive time-frequency segmentation scheme. Gilles Gonon, Silvio Montrésor, Marc Baudry
2001	Improved maximum mutual information estimation training of continuous density HMMs. Jing Zheng, John Butzberger, Horacio Franco, Andreas Stolcke
2001	Improved phoneme-history-dependent search for large-vocabulary continuous-speech recognition. Takaaki Hori, Yoshiaki Noda, Shoichi Matsunaga
2001	Improved speech recognition using iterative decoding based on confidence measures. Jun Ogata, Yasuo Ariki
2001	Improved spoken document retrieval by exploring extra acoustic and linguistic cues. Berlin Chen, Hsin-Min Wang, Lin-Shan Lee
2001	Improved word confidence estimation using long range features. David D. Palmer, Mari Ostendorf
2001	Improvement of a structured language model: arbori-context tree. Shinsuke Mori, Masafumi Nishimura, Nobuyasu Itoh
2001	Improvement of speaker verification for Thai language. Chai Wutiwiwatchai, Varin Achariyakulporn, Sawit Kasuriya
2001	Improvements in audio processing and language modeling in the CU communicator. Jianping Zhang, Wayne H. Ward, Bryan L. Pellom, Xiuyang Yu, Kadri Hacioglu
2001	Improvements in the speaker identification rate using feature-sets. Daniel J. Mashao, N. Tinyiko Baloyi
2001	Improving automatic speech recognition using tangent distance. Wolfgang Macherey, Daniel Keysers, Jörg Dahmen, Hermann Ney
2001	Improving genericity for task-independent speech recognition. Fabrice Lefèvre, Jean-Luc Gauvain, Lori Lamel
2001	Improving performance of a keyword spotting system by using a new confidence measure. Luciana Ferrer, Claudio Estienne
2001	Improving simultaneous speech recognition in real room environments using overdetermined blind source separation. Athanasios Koutras, Evangelos Dermatas, George K. Kokkinakis
2001	Improving speaker recognition using phonetically structured Gaussian mixture models. Robert Faltlhauser, Günther Ruske
2001	Information extraction via heuristics for a movie showtime query system. Martin Jansche
2001	Information fusion for robust speaker verification. Conrad Sanderson, Kuldip K. Paliwal
2001	Instantaneous estimation of accentuation habits for Japanese students to learn English pronunciation. Naoki Nakamura, Nobuaki Minematsu, Seiichi Nakagawa
2001	Instrumental derivation of equipment impairment factors for describing telephone speech codec degradations. Sebastian Möller, Jens Berger
2001	Integrating contextual phonological rules in a large vocabulary decoder. Guillaume Gravier, François Yvon, Bruno Jacob, Frédéric Bimbot
2001	Integrating multiple knowledge sources for improved speech understanding. Sherif M. Abdou, Michael S. Scordilis
2001	Integrating speech technology in language learning: an overview of the activities of inSTIL. Philippe Delcloque
2001	Intonation modelling with a lexicon of natural F0 contours. Per Olav Heggtveit, Jon Emil Natvig
2001	Intonational phrase break prediction using decision tree and n-gram model. Xuejing Sun, Ted H. Applebaum
2001	Introducing phonetically motivated information into ASR. Heidi Christensen, Børge Lindberg, Ove Andersen
2001	Invariance of relative F0 change field of Chinese disyllabic words. Dawei Xu, Hiroki Mori, Hideki Kasuya
2001	Inverse filtering of tube models with frequency dependent tube terminations. Karl Schnell, Arild Lacroix
2001	Investigations into tandem acoustic modeling for the Aurora task. Daniel P. W. Ellis, Manuel J. Reyes Gomez
2001	Investigations on conversational speech recognition. Peter Beyerlein, Xavier L. Aubert, Matthew Harris, Carsten Meyer, Hauke Schramm
2001	Is non-native pronunciation modelling necessary ? Silke Goronzy, Marina Sahakyan, Wolfgang Wokurek
2001	Is speech data clustered? - statistical analysis of cepstral features. Tomi Kinnunen, Ismo Kärkkäinen, Pasi Fränti
2001	Is this conversation on track? Paul Carpenter, Chun Jin, Daniel Wilson, Rong Zhang, Dan Bohus, Alexander I. Rudnicky
2001	Iterative implementation of dialogue system modules. Lars Degerstedt, Arne Jönsson
2001	Japanese can be aware of syllables and morae: evidence from Japanese-English bilingual children. Takashi Otake, Yuka Yamaguchi
2001	Javaspeakerrecognition - interactive workbench for visualizing speaker recognition concepts on the WWW. Andrzej Drygajlo, Gary Garcia Molina
2001	Joint channel decoding - Viterbi recognition for wireless applications. Alexis Bernard, Abeer Alwan
2001	Joint source-channel coding for low bit-rate coding of LSP parameters. José L. Pérez-Córdoba, Antonio J. Rubio, Antonio M. Peinado, Ángel de la Torre
2001	Joint speech and audio coding combining sinusoidal modeling and wavelet packets. Márk Fék, Annamária R. Várkonyi-Kóczy, Jean-Marc Boucher
2001	Julius - an open source real-time large vocabulary recognition engine. Akinobu Lee, Tatsuya Kawahara, Kiyohiro Shikano
2001	Knowledge of language origin improves pronunciation accuracy of proper names. Ariadna Font Llitjós, Alan W. Black
2001	Language models conditioned on dialog state. Karthik Visweswariah, Harry Printz
2001	Language-specific effects of pitch range on the perception of universal intonational meaning. Aoju Chen, Toni C. M. Rietveld, Carlos Gussenhoven
2001	Large broadcast news and read speech corpora of spoken czech. Josef Psutka, Vlasta Radová, Ludek Müller, Jindrich Matousek, Pavel Ircing, David Graff
2001	Large vocabulary statistical language modeling for continuous speech recognition in finnish. Vesa Siivola, Mikko Kurimo, Krista Lagus
2001	Large-vocabulary audio-visual speech recognition by machines and humans. Gerasimos Potamianos, Chalapathy Neti, Giridharan Iyengar, Eric Helmuth
2001	Learning of user formulations for business listings in automatic directory assistance. Cosmin Popovici, Marco Andorno, Pietro Laface, Luciano Fissore, Mario Nigra, Claudio Vair
2001	Learning prosodic features using a tree representation. Julia Hirschberg, Owen Rambow
2001	Learning units for domain-independent out-of- vocabulary word modelling. Issam Bazzi, James R. Glass
2001	Lessons from the development of a conversational interface. Marianne Hickey, Paul St John Brittan
2001	Lexical stress modeling for improved speech recognition of spontaneous telephone speech in the jupiter domain. Chao Wang, Stephanie Seneff
2001	Lexicon optimization for dutch speech recognition in spoken document retrieval. Roeland Ordelman, Arjan van Hessen, Franciska de Jong
2001	Liaison and schwa deletion in French: an effect of lexical frequency and competition? Cécile Fougeron, Jean-Philippe Goldman, Ulrich H. Frauenfelder
2001	Limited enquiry negotiation dialogues. Ian Lewin
2001	Linear interpolation of cepstral variance for noisy speech recognition. Tai-Hwei Hwang, Kuo-Hwei Yuo, Hsiao-Chuan Wang
2001	Linguistic factors affecting timing in Korean with application to speech synthesis. Hyunsong Chung, Mark A. Huckvale
2001	Lip-reading from parametric lip contours for audio- visual speech recognition. Sabri Gurbuz, Eric K. Patterson, Zekeriya Tufekci, John N. Gowdy
2001	Local refinement of phonetic boundaries: a general framework and its application using different transition models. Doroteo Torre Toledano, Luis A. Hernández Gómez
2001	Low rate speech coding incorporating simultaneously masked spectrally weighted linear prediction. Jason Lukasiak, Ian S. Burnett, Christian H. Ritz
2001	Low-resource hidden Markov model speech recognition. Sabine Deligne, Ellen Eide, Ramesh A. Gopinath, Dimitri Kanevsky, Benoît Maison, Peder A. Olsen, Harry Printz, Jan Sedivý
2001	Lower WERs do not guarantee better transcriptions. Judith M. Kessens, Helmer Strik
2001	MAP combination of multi-stream HMM or HMM/ANN experts. Andrew C. Morris, Astrid Hagen, Hervé Bourlard
2001	MINOS-II: a prototype car navigation system with mixed initiative turn taking dialogue. Munehiko Sasajima, Takehide Yano, Taishi Shimomori, Tatsuya Uehara
2001	MMSE-based channel error mitigation for distributed speech recognition. Antonio M. Peinado, Victoria E. Sánchez, José C. Segura, José L. Pérez-Córdoba
2001	Making the tongue model talk: merging MRI & EMA measurements. Olov Engwall
2001	Map estimation for on-line noise compensation of time trajectories of spectral coefficients. Ilyas Potamitis, Nikos Fakotakis, George K. Kokkinakis
2001	Mathematical modeling of spoken human - machine dialogues including erroneous confirmations. D. Louloudis, Anastasios Tsopanoglou, Nikos Fakotakis, George K. Kokkinakis
2001	Maximum likelihood adaptation for distant speech recognition of stationary and moving speakers in reverberant environments. George Nokas, Evangelos Dermatas, George K. Kokkinakis
2001	Maximum likelihood non-linear transformation for environment adaptation in speech recognition. Mukund Padmanabhan, Satya Dharanipragada
2001	Maximum-likelihood affine cepstral filtering (MLACF) technique for speaker normalization. Yoon Kim
2001	Maximum-likelihood training of a bipartite acoustic model for speech recognition. Florent Perronnin, Roland Kuhn, Patrick Nguyen, Jean-Claude Junqua
2001	Measuring pitch range. Hanny den Ouden, Jacques M. B. Terken
2001	Measuring rhythmic deviation in second language speech. Felix Schaeffler
2001	Measuring speech rhythm. Dafydd Gibbon, Ulrike Gut
2001	Mechanical versus perceptual constraints as determinants of articulatory strategy. Ahmed M. Elgendy, Louis C. W. Pols
2001	Methodology for dialogue design in telephone-based spoken dialogue systems: a Spanish train information system. Rubén San Segundo, Juan Manuel Montero, José Colás, Juana M. Gutiérrez, J. M. Ramos, José Manuel Pardo
2001	Metrics for measuring domain independence of semantic classes. Andrew N. Pargellis, Eric Fosler-Lussier, Alexandros Potamianos, Chin-Hui Lee
2001	Minimax classification with parametric neighborhoods for noisy speech recognition. Mohamed Afify, Olivier Siohan, Chin-Hui Lee
2001	Minimum classification error training for speaker identification using Gaussian mixture models based on multi-space probability distribution. Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura
2001	Mixed excitation for HMM-based speech synthesis. Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura
2001	Mobile future. Yrjö Neuvo
2001	Model agglomeration for context-dependent acoustic modeling. Fabio Brugnara
2001	Model based stress decision method. Wooil Kim, Taeyun Kim, Sungjoo Ahn, Hanseok Ko
2001	Model complexity optimization for nonnative English speakers. Xiaodong He, Yunxin Zhao
2001	Model-based blind estimation of reverberation time: application to robust ASR in reverberant environments. Laurent Couvreur, Christophe Ris, Christophe Couvreur
2001	Model-based compensation of the additive noise for continuous speech recognition. experiments using the Aurora II database and tasks. José C. Segura, Ángel de la Torre, M. Carmen Benítez, Antonio M. Peinado
2001	Modeling auxiliary information in Bayesian network based ASR. Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard
2001	Modeling of conversational strategy for the robot participating in the group conversation. Yosuke Matsusaka, Shinya Fujie, Tetsunori Kobayashi
2001	Modeling pronunciation variation using context-dependent weighting and b/s refined acoustic modeling. Fang Zheng, Zhanjiang Song, Pascale Fung, William Byrne
2001	Modeling the mixtures of known noise and unknown unexpected noise for robust speech recognition. Ji Ming, Peter Jancovic, Philip Hanna, Darryl Stewart
2001	Modelling care of articulation with HMMs is dangerous. Matthew P. Aylett
2001	Modelling fundamental frequency in first post-tonic syllables in danish sentences. Niels Reinholt Petersen
2001	Modelling the perceptual identification of Japanese consonants from LPC cepstral distances. Masahiko Komatsu, Shinichi Tokuma, Won Tokuma, Takayuki Arai
2001	Mokusei: a telephone-based Japanese conversational system in the weather domain. Mikio Nakano, Yasuhiro Minami, Stephanie Seneff, Timothy J. Hazen, D. Scott Cyphers, James R. Glass, Joseph Polifroni, Victor Zue
2001	Morphological approaches for an English pronunciation lexicon. Susan Fitt
2001	Multi-class composite n-gram language model using multiple word clusters and word successions. Shuntaro Isogai, Katsuhiko Shirai, Hirofumi Yamamoto, Yoshinori Sagisaka
2001	Multi-keyword spotting of telephone speech using orthogonal transform-based SBR and RNN prosodic model. Wern-Jun Wang, Chun-Jen Lee, Eng-Fong Huang, Sin-Horng Chen
2001	Multi-parser architecture for query processing. Kui Xu, Fuliang Weng, Helen M. Meng, Po-Chui Luk
2001	Multi-scale retrieval in MEI: an English-Chinese translingual speech retrieval system. Wai Kit Lo, Patrick Schone, Helen M. Meng
2001	Multi-stream statistical n-gram modeling with application to automatic language identification. Katrin Kirchhoff, Sonia Parandekar
2001	Multilingual TTS for computer telephony: the aculab approach. Alex I. C. Monaghan, Mahmoud Kassaei, Mark Luckin, Mariscela Amador-Hernandez, Andrew Lowry, Daniel Faulkner, Fred Sannier
2001	Multilingual text-to-phoneme mapping. Søren Kamaric Riis, Morten With Pedersen, Kåre Jean Jensen
2001	Multimedia data collection of in-car speech communication. Nobuo Kawaguchi, Shigeki Matsubara, Kazuya Takeda, Fumitada Itakura
2001	Multipass algorithm for acquisition of salient acoustic morphemes. Michael Levit, Allen L. Gorin, Jeremy H. Wright
2001	Multiple source separation in the frequency domain using negative beamforming. Pedro Gómez-Vilda, Agustín Álvarez-Marquina, Victor Nieto Lluis, María Victoria Rodellar Biarge, Rafael Martínez-Olalla
2001	Must diphone synthesis be so unnatural? William J. Barry, Claus Nielsen, Ove Andersen
2001	N-best list generation using word and phoneme recognition fusion. Ernest Pusateri, Jean-Manuel Van Thong
2001	N-best speech hypotheses reordering using linear regression. Ananlada Chotimongkol, Alexander I. Rudnicky
2001	Narrowband perceptual audio coding: enhancements for speech. Hossein Najaf-Zadeh, Peter Kabal
2001	Native vs non-native production of English vowels in spontaneous speech: an acoustic phonetic study. Kimiko Tsukada
2001	Natural language understanding using statistical machine translation. Klaus Macherey, Franz Josef Och, Hermann Ney
2001	Neural processes underlying perceptual learning of a difficult second language phonetic contrast. Daniel E. Callan, Keiichi Tajima, Akiko E. Callan, Reiko Akahane-Yamada, Shinobu Masaki
2001	New language models using phrase structures extracted from parse trees. Takatoshi Jitsuhiro, Hirofumi Yamamoto, Setsuo Yamada, Yoshinori Sagisaka
2001	Noise estimation without explicit speech, non-speech detection: a comparison of mean, modal and median based approaches. Nicholas W. D. Evans, John S. D. Mason
2001	Noise reduction for noise robust feature extraction for distributed speech recognition. Bernhard Noé, Jürgen Sienel, Denis Jouvet, Laurent Mauuary, Johan de Veth, Lou Boves, Febe de Wet
2001	Noise reduction using paired-microphones for both far-field and near-field sound sources. Mitsunori Mizumachi, Satoshi Nakamura
2001	Noise robust feature extraction for ASR using the Aurora 2 database. Qifeng Zhu, Markus Iseli, Xiaodong Cui, Abeer Alwan
2001	Non-finality and pre-finality in bari Italian intonation: a preliminary account. Michelina Savino
2001	Non-linear predictive vector quantization of speech. Marcos Faúndez-Zanuy
2001	OASIS natural language call steering trial. Peter J. Durston, Mark Farrell, David Attwater, James Allen, Hong-Kwang Jeff Kuo, Mohamed Afify, Eric Fosler-Lussier, Chin-Hui Lee
2001	Objective evaluation of methods for quantization of variable-dimension spectral vectors in WI speech coding. Jani Nurminen, Ari Heikkinen, Jukka Saarinen
2001	Observations on overlap: findings and implications for automatic processing of multi-party conversation. Elizabeth Shriberg, Andreas Stolcke, Don Baron
2001	Off-talk - a problem for human-machine-interaction? Daniela Oppermann, Florian Schiel, Silke Steininger, Nicole Beringer
2001	On combining confidence measures for improved rejection of incorrect data. Delphine Charlet, Guy Mercier, Denis Jouvet
2001	On differential limen of word-based local speechrate variation in Japanese expressed by duration ratio. Makoto Hiroshige, Kenji Araki, Koji Tochinai
2001	On integrating the lexicon with the language model. Diamantino Caseiro, Isabel Trancoso
2001	On large vocabulary continuous speech recognition of highly inflectional language - czech. Pavel Ircing, Pavel Krbec, Jan Hajic, Josef Psutka, Sanjeev Khudanpur, Frederick Jelinek, William Byrne
2001	On the choice of classes in MCE based discriminative HMM-training for speech recognizers used in the telephone environment. Josef G. Bauer
2001	On the perception of voicing for plosives in noise. Marcia Chen, Abeer Alwan
2001	On the pronunciation of acronyms in French and in Italian. Philippe Boula de Mareüil, Franck Floricic
2001	On the prosody of German telephone numbers. Stefan Baumann, Jürgen Trouvain
2001	On the use of the Bayesian information criterion in multiple speaker detection. P. Sivakumaran, J. Fortuna, Aladdin M. Ariyaeeinia
2001	One-delayed-mass model for efficient synthesis of glottal flow. Federico Avanzini, Paavo Alku, Matti Karjalainen
2001	Pause information for dependency analysis of read Japanese sentences. Kazuyuki Takagi, Kazuhiko Ozeki
2001	Perceived prominence in terms of a linguistically motivated quantitative intonation model. Hansjörg Mixdorff, Christina Widera
2001	Perception of coda voicing from properties of the onset and nucleus of 'led' and 'let'. Sarah Hawkins, Noël Nguyen
2001	Perceptual categorization of maximal vowel spaces from birth to adulthood simulated by an articulatory model. Lucie Ménard, Louis-Jean Boë
2001	Perceptual cost functions for unit searching in large corpus-based text-to-speech. Minkyu Lee
2001	Perceptual experiments on enhanced and slowed down speech sentences for second language acquisition. Vincent Colotte, Yves Laprie, Anne Bonneau
2001	Perceptual identification and normalization of synthesized French vowels from birth to adulthood. Lucie Ménard, Jean-Luc Schwartz, Louis-Jean Boë, Sonia Kandel, Nathalie Vallée
2001	Phoneme-based topic spotting on the switchboard corpus. M. W. Theunissen, Konrad Scheffler, Johan A. du Preez
2001	Phonetic effects on listener detection of vowel concatenation. Ann K. Syrdal
2001	Phonetic events from the labeling the european portuguese database for speech synthesis, FEUP/IPBDB. João Paulo Ramos Teixeira, Diamantino Freitas, Daniela Braga, Maria João Barros, Vagner Latsch
2001	Phonetic speaker recognition. Walter D. Andrews, Mary A. Kohler, Joseph P. Campbell
2001	Phonetic transcriptions in the spoken dutch corpus: how to combine efficiency and good transcription quality. Catia Cucchiarini, Diana Binnenpoorte, Simo M. A. Goddijn
2001	Pitch-dependent GMMs for text-independent speaker recognition systems. Mijail Arcienega, Andrzej Drygajlo
2001	Planar superdirective microphone arrays for speech acquisition in the car. Rainer Martin, Alexey Petrovsky, Thomas Lotter
2001	Plosive spotting with margin classifiers. Joseph Keshet, Dan Chazan, Ben-Zion Bobrovsky
2001	Politeness and frustration language in child-machine interactions. Sudha Arunachalam, Dylan Gould, Elaine Andersen, Dani Byrd, Shrikanth S. Narayanan
2001	Pragmatic temporal voice range profile as a tool in the research of speech styles. Antti Iivonen
2001	Pre-liquid excrescent schwa: what happens when vocalic targets conflict. Bryan Gick, Ian Wilson
2001	Predicting visual consonant perception from physical measures. Jintao Jiang, Abeer Alwan, Edward T. Auer, Lynne E. Bernstein
2001	Prediction of intonation patterns of accented words in a corpus of read Swedish news through pitch contour stylization. Johan Frid
2001	Prediction of low recognition rate words for isolated word recognition system. Ryuta Terashima, Hiroyuki Hoshino, Toshihiro Wakita
2001	Preliminary experiments on language identification using broadcast news recordings. Laurent Benarousse, Edouard Geoffrois
2001	Probabilistic concept verification for language understanding in spoken dialogue systems. Yi-Chung Lin, Huei-Ming Wang
2001	Prominence correlates. a study of Swedish. Gunnar Fant, Anita Kruckenberg, Johan Liljencrants, Antonis Botinis
2001	Pronunciation modeling and lexical adaptation in midsize vocabulary ASR. Louis ten Bosch, Nick Cremelie
2001	Pronunciation modeling in hungarian number recognition. Tibor Fegyó, Péter Mihajlik, Péter Tatai, Géza Gordos
2001	Pronunciation variant analysis using speaking style parallel corpus. Hideharu Nakajima, Izumi Hirano, Yoshinori Sagisaka, Katsuhiko Shirai
2001	Pronunciation variation analysis with respect to various linguistic levels and contextual conditions for Mandarin Chinese. Ming-Yi Tsai, Fu-Chiang Chou, Lin-Shan Lee
2001	Prosodic interactions on segmental durations ingreek. Antonis Botinis, Marios Fourakis, Robert Bannert
2001	Prosodic models, automatic speech understanding, and speech synthesis: towards the common ground. Anton Batliner, Bernd Möbius, Gregor Möhler, Antje Schweitzer, Elmar Nöth
2001	Prosody control for speaking and singing styles. Chilin Shih, Greg Kochanski
2001	Prosody in finger braille and teletext receiver for finger braille. Yasuo Horiuchi, Akira Ichikawa
2001	Prototype of a vocal-tract model for vowel production designed for education in speech science. Takayuki Arai, Nobuyuki Usuki, Yuji Murahara
2001	Pruning of redundant synthesis instances based on weighted vector quantization. Sanghun Kim, Youngjik Lee, Keikichi Hirose
2001	Pseudo-articulatory representations and the recognition of syllable patterns in speech. William H. Edmondson, Li Zhang
2001	Quantile based histogram equalization for noise robust speech recognition. Florian Hilger, Hermann Ney
2001	Quantitative analysis of the effects of emphasis upon prosodic features of speech. Sumio Ohno, Hiroya Fujisaki
2001	Quantization-based language model compression. Edward W. D. Whittaker, Bhiksha Raj
2001	Rapid CODEC adaptation for cellular phone speech recognition. Masaki Naito, Shingo Kuroiwa, Tsuneo Kato, Tohru Shimizu, Norio Higuchi
2001	Rapid speaker adaptation using MLLR and subspace regression classes. Kwok-Man Wong, Brian Kan-Wing Mak
2001	Rapid vocal tract length normalization using maximum likelihood estimation. Tadashi Emori, Koichi Shinoda
2001	Real-time multilingual communication by means of prestored conversational units. Norman Alm, Mamoru Iwabuchi, Peter N. Andreasen, Kenryu Nakamura, Iain R. Murray
2001	Real-time multiple speaker tracking by multi-modal integration for mobile robots. Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi G. Okuno, Hiroaki Kitano
2001	Real-time sound source localization and separation system and its application to automatic speech recognition. Futoshi Asano, Masataka Goto, Katunobu Itou, Hideki Asoh
2001	Recent advances in speech recognition system for IBM DARPA communicator. Yuqing Gao, Hakan Erdogan, Yongxin Li, Vaibhava Goel, Michael Picheny
2001	Recognition of (almost) spoken words: evidence from word play in Japanese. Takashi Otake, Anne Cutler
2001	Recognition of slovenian speech: within and cross-language experiments on monophones using the speechdat(II). Andrej Iskra, Bojan Petek, Tom Brøndsted
2001	Recognition of spelled city names in automotive environments. Andreas Korthauer
2001	Recognition performance of the siemens front-end with and without frame dropping on the Aurora 2 database. Bernt Andrassy, Damjan Vlaj, Christophe Beaugeant
2001	Reconstructing dialogue history. Marc Swerts, Emiel Krahmer
2001	Reducing spectral mismatches in concatenative speech synthesis via systematic database enrichment. Maria Founda, George Tambouratzis, Aimilios Chalamandaris, George Carayannis
2001	Reduction of alternative pronunciations in the norwegian computational lexicon norkompleks. Torbjørn Nordgård, Arne Kjell Foldvik
2001	Relating frame accuracy with word error in hybrid ANN-HMM ASR. Michael L. Shire
2001	Relating phonepass scores overall scores to the council of europe framework level descriptors. John H. A. L. de Jong, Jared Bernstein
2001	Relations between vocal registers in voice breaks. Gerrit Bloothooft, Mieke van Wijck, Peter Pabon
2001	Representation of large lexica using finite-state transducers for the multilingual text-to-speech synthesis systems. Matej Rojc, Zdravko Kacic
2001	Resource-limited sentence boundary detection. David Carter, Ian Gransden
2001	Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise. Jon Barker, Martin Cooke, Phil D. Green
2001	Robust ASR front-end using spectral-based and discriminant features: experiments on the Aurora tasks. M. Carmen Benítez, Lukás Burget, Barry Y. Chen, Stéphane Dupont, Harinath Garudadri, Hynek Hermansky, Pratibha Jain, Sachin S. Kajarekar, Nelson Morgan, Sunil Sivadas
2001	Robust LP analysis using glottal source HMM with application to high-pitched and noise corrupted speech. Akira Sasou, Kazuyo Tanaka
2001	Robust automatic speech recognition in low-SNR car environments by the application of a connectionist subspace-based approach to the melbased cepstral coefficients. Sid-Ahmed Selouani, Hesham Tolba, Douglas D. O'Shaughnessy
2001	Robust digit recognition in noise: an evaluation using the AURORA corpus. Umit H. Yapanel, John H. L. Hansen, Ruhi Sarikaya, Bryan L. Pellom
2001	Robust digit recognition in noisy environments: the IBM Aurora 2 system. George Saon, Juan M. Huerta, Ea-Ee Jan
2001	Robust language understanding in mipad. Ye-Yi Wang
2001	Robust parameters for speech recognition based on subband spectral centroid histograms. Bojana Gajic, Kuldip K. Paliwal
2001	Robust parsing in spoken dialogue systems. Pengju Yan, Fang Zheng, Mingxing Xu
2001	Robust speech recognition against packet loss. Man-Hung Siu, Yu-Chung Chan
2001	Robust speech recognition based on selective use of missing frequency band HMMs. Takayoshi Kawamura, Kazuya Takeda, Fumitada Itakura
2001	Robust speech recognition in noise: an evaluation using the SPINE corpus. John H. L. Hansen, Ruhi Sarikaya, Umit H. Yapanel, Bryan L. Pellom
2001	Robust speech recognition techniques applied to a speech in noise task. Richard C. Rose, Hong Kook Kim, Donald Hindle
2001	Robust speech recognition using missing feature theory and vector quantization. Philippe Renevey, Rolf Vetter, Jens Krauss
2001	Robust speech/non-speech detection using LDA applied to MFCC for continuous speech recognition. Arnaud Martin, Géraldine Damnati, Laurent Mauuary
2001	SCANMail: browsing and searching speech data by content. Julia Hirschberg, Michiel Bacchiani, Donald Hindle, Philip L. Isenhour, Aaron E. Rosenberg, Litza A. Stark, Larry Stead, Steve Whittaker, Gary Zamchick
2001	SIGdial - special interest group on discourse and dialogue. Laila Dybkjær
2001	SPeaker and language characterization (spLC): a special interest group (SIG) of ISCA. Jean-François Bonastre, Ivan Magrin-Chagnolleau, Stephan Euler, François Pellegrino, Régine André-Obrecht, John S. D. Mason, Frédéric Bimbot
2001	Scaled likelihood linear regression for hidden Markov model adaptation. Frank Wallhoff, Daniel Willett, Gerhard Rigoll
2001	Schwa-assimilation in danish synthetic speech. Christian Jensen
2001	Second order statistics spectrum estimation method for robust speech recognition. Bojan Jarc, Rudolf Babic
2001	Segment-based recognition on the phonebook task: initial results and observations on duration modeling. Karen Livescu, James R. Glass
2001	Segmental eigenvoice for rapid speaker adaptation. Yu Tsao, Shang-Ming Lee, Fu-Chiang Chou, Lin-Shan Lee
2001	Selective MCE training strategy in Mandarin speech recognition. Jian-Lai Zhou, Eric Chang, Chao Huang
2001	Semantic abnormality and its realization in spoken language. Shimei Pan, Kathleen R. McKeown, Julia Hirschberg
2001	Semi-automatic grammar induction for bi-directional English-Chinese machine translation. Kai-Chung Siu, Helen M. Meng
2001	Separating speaker and environment variabilities for improved recognition in non-stationary conditions. Luca Rigazio, Patrick Nguyen, David Kryze, Jean-Claude Junqua
2001	Separating three simultaneous speeches with two microphones by integrating auditory and visual processing. Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano
2001	Separation and dereverberation performance of frequency domain blind source separation for speech in a reverberant environment. Ryo Mukai, Shoko Araki, Shoji Makino
2001	Sequential decisions for faster and more flexible verification. Arun C. Surendran
2001	Sequential noise compensation by a sequential kullback proximal algorithm. Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura
2001	Smartkom: multimodal communication with a life- like character. Wolfgang Wahlster, Norbert Reithinger, Anselm Blocher
2001	Smooth contour estimation in data-driven pitch modelling. Kim E. A. Silverman, Jerome R. Bellegarda, Kevin A. Lenzo
2001	Smoothing issues in the structured language model. Woosung Kim, Sanjeev Khudanpur, Jun Wu
2001	Social effects on vocal rate with echoic mimicry using prosody-only voice. Noriko Suzuki, Kazuhiko Kakehi, Yugo Takeuchi, Michio Okada
2001	Some practical considerations in the deployment of a wireless-communication interactive voice response system. Carmen García-Mateo, Laura Docío Fernández, Antonio Cardenal López
2001	Speaker adaptation in an ASR system based on nonlinear dynamical systems. Narada D. Warakagoda, Magne Hallstein Johnsen
2001	Speaker adaptation of output probabilities and state duration distributions for speech recognition. Néstor Becerra Yoma, Jorge F. Silva
2001	Speaker adaptation of quantized parameter HMMs. Marcel Vasilache, Olli Viikki
2001	Speaker identification for car infotainment applications. Javier Rodríguez Saeta, Christian Koechling, Javier Hernando
2001	Speaker normalization based on test to reference speaker mapping. Marcel Ogner, Zdravko Kacic
2001	Speaker recognition based on feature space trace. Yadong Wu, Zhizhu Li
2001	Speaker recognition based on idiolectal differences between speakers. George R. Doddington
2001	Speaker recognition by separating phonetic space and speaker space. Masafumi Nishida, Yasuo Ariki
2001	Speaker recognition in a multi-speaker environment. Alvin F. Martin, Mark A. Przybocki
2001	Speaker verification using target and background dependent linear transforms and multi-system fusion. Jirí Navrátil, Upendra V. Chaudhari, Ganesh N. Ramaswamy
2001	Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition. Hiroaki Nanjo, Kazuomi Kato, Tatsuya Kawahara
2001	Speaking while driving - preliminary results on spellings in the German speechdat-car database. Christoph Draxler, Klaus Bengler, Cristina Olaverri-Monreal
2001	Spectral correlates of voice open quotient and glottal flow asymmetry : theory, limits and experimental data. Nathalie Henrich, Christophe d'Alessandro, Boris Doval
2001	Spectral tilt as a perturbation-free measurement of noise levels in voice signals. Peter J. Murphy
2001	Speech emotion recognition using hidden Markov models. Albino Nogueiras, Asunción Moreno, Antonio Bonafonte, José B. Mariño
2001	Speech enhanced remote control for media terminal. Aseel Ibrahim, Jonas Lundberg, Jenny Johansson
2001	Speech enhancement and source separation based on binaural negative beamforming. Agustín Álvarez-Marquina, Pedro Gómez-Vilda, Rafael Martínez-Olalla, Victor Nieto Lluis, María Victoria Rodellar Biarge
2001	Speech enhancement based on IMM with NPHMM. Yunjung Lee, Joohun Lee, Ki Yong Lee, Katsuhiko Shirai
2001	Speech lab in a box: a Mandarin speech toolbox to jumpstart speech related research. Eric Chang, Yu Shi, Jian-Lai Zhou, Chao Huang
2001	Speech quality measure for voIP using wavelet based bark coherence function. Sang-Wook Park, Young-Cheol Park, Dae Hee Youn
2001	Speech recognition at multiple sampling rates. Hans-Günter Hirsch, K. Hellwig, Stefan Dobler
2001	Speech recognition for huge vocabularies by using optimized sub-word units. Jan Kneissler, Dietrich Klakow
2001	Speech recognition of Japanese news commentary. Shinichi Homma, Akio Kobayashi, Shoei Sato, Toru Imai, Akio Ando
2001	Speech recognition of broadcast sports news. Atsushi Matsui, Hiroyuki Segi, Akio Kobayashi, Toru Imai, Akio Ando
2001	Speech recognition over netmeeting connections. Florian Metze, John W. McDonough, Hagen Soltau
2001	Speech recognition under musical environments using kalman filter and iterative MLLR adaptation. Masakiyo Fujimoto, Yasuo Ariki
2001	Speech synthesis development made easy: the bonn open synthesis system. Esther Klabbers, Karlheinz Stöber, Raymond N. J. Veldhuis, Petra Wagner, Stefan Breuer
2001	Speech translation for French in the NESPOLE! European project. Laurent Besacier, Hervé Blanchon, Yannick Fouquet, Jean-Philippe Guilbaud, Stéphane Helme, Sylviane Mazenot, Daniel Moraru, Dominique Vaufreydaz
2001	Speech/noise-dominant decision for speech enhancement. Sukhyun Yoon, Chang D. Yoo
2001	Speechbuilder: facilitating spoken dialogue system development. James R. Glass, Eugene Weinstein
2001	Speechdat-e: five eastern european speech databases for voice-operated teleservices completed. Henk van den Heuvel, Jérôme Boudy, Zsolt Bakcsi, Jan Cernocký, Valery Galunov, Julia Kochanina, Wojciech Majewski, Petr Pollák, Milan Rusko, Jerzy Sadowski, Piotr Staroniewicz, Herbert S. Tropf
2001	Split-band perceptual harmonic cepstral coefficients as acoustic features for speech recognition. Liang Gu, Kenneth Rose
2001	Spoken dialogue management as planning and acting under uncertainty. Bo Zhang, Qingsheng Cai, Jianfeng Mao, Eric Chang, Baining Guo
2001	Squared error as a measure of phase distortion. Harald Pobloth, W. Bastiaan Kleijn
2001	Statistical language model based on a hierarchical approach: MCnv. Imed Zitouni, Kamel Smaïli, Jean Paul Haton
2001	Statistical sound source identification in a real acoustic environment for robust speech recognition using a microphone array. Takanobu Nishiura, Satoshi Nakamura, Kiyohiro Shikano
2001	Stochastic F0 contour model based on the clustering of F0 shapes of a syntactic unit. Yoichi Yamashita, Tomoyoshi Ishida
2001	Stochastic finite state automata language model triggered by dialogue states. Yannick Estève, Frédéric Béchet, Alexis Nasr, Renato De Mori
2001	Structural learning of dynamic Bayesian networks in speech recognition. Murat Deviren, Khalid Daoudi
2001	Structured language model for class identification of out-of-vocabulary words arising from multiple wordclasses. Shigehiko Onishi, Hirofumi Yamamoto, Yoshinori Sagisaka
2001	Study and auto-detection of stress based on tonal pitch range in Mandarin. Xipeng Shen, Bo Xu
2001	Study on factors influencing durations of syllables in Mandarin. Min Chu, Yongqiang Feng
2001	Sub-band based additive noise removal for robust speech recognition. Jingdong Chen, Kuldip K. Paliwal, Satoshi Nakamura
2001	Subjective assessment of speech-system interface usability. Kate S. Hone, Robert Graham
2001	Support vector machine with dynamic time-alignment kernel for speech recognition. Hiroshi Shimodaira, Ken-ichi Noma, Mitsuru Nakai, Shigeki Sagayama
2001	Supporting the construction of a user model in speech-only interfaces by adding multi-modality. Jacques M. B. Terken, Saskia te Riele
2001	Syllable prominence: a matter of vocal effort, phonetic distinct-ness and top-down processing. Anders Eriksson, Gunilla C. Thunberg, Hartmut Traunmller
2001	Synthesizing intonation of standard arabic language. A. Zaki, A. Rajouani, Mohamed Najim
2001	Systematic F0 glitches around nasal-vowel transitions. Hideki Kawahara, Parham Zolfaghari
2001	TALKING FOREIGN - concatenative speech synthesis and the language barrier. Nick Campbell
2001	TclBLASR: an automatic speech recognition extension for tcl. Qiru Zhou, Jinsong Zheng, Chin-Hui Lee
2001	Techniques for high-quality ACELP coding of wideband speech. Bruno Bessette, Roch Lefebvre, Redwan Salami, Milan Jelinek, Janne Vainio, J. Rotola-Pukkila, Hannu Mikkola, Kari Järvinen
2001	Temporal decomposition: a promising approach to low rate wideband speech compression. Christian H. Ritz, Ian S. Burnett
2001	Testing the perceptual relevance of syntactic completion and melodic configuration for turn-taking in dutch. Johanneke Caspers
2001	Text-to-speech scripting interface for appropriate vocalisation of e-texts. Gerasimos Xydas, Georgios Kouroupetroglou
2001	Text-to-speech synthesis with arbitrary speaker's voice from average voice. Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi
2001	Thai grapheme-to-phoneme using probabilistic GLR parser. Pongthai Tarsaku, Virach Sornlertlamvanich, Rachod Thongprasirt
2001	The IFA corpus: a phonemically segmented dutch "open source" speech database. R. J. J. H. van Son, Diana Binnenpoorte, Henk van den Heuvel, Louis C. W. Pols
2001	The ISCA special interest group on speech synthesis. Nick Campbell, Wolfgang Hess, Bernd Möbius, Jan P. H. van Santen
2001	The WITAS multi-modal dialogue system I. Oliver Lemon, Anne Bracy, Alexander Gruenstein, Stanley Peters
2001	The development of a portuguese version of a media watch system. Rui Amaral, Thibault Langlois, Hugo Meinedo, João Paulo Neto, Nuno Souto, Isabel Trancoso
2001	The effect of pitch and lexical tone on different Mandarin speech recognition tasks. Yiu Wing Wong, Eric Chang
2001	The effect of time stress on automatic speech recognition accuracy when using second language. Fang Chen, Jonas Sääv
2001	The fundamental frequency of cough by autocorrelation analysis. Annemie Van Hirtum, Daniel Berckmans
2001	The generation of speech for a search guide. Nicholas J. Cook, Ian D. Benest
2001	The influence of vocal effort on human speaker identification. Douglas Brungart, Kimberly R. Scott, Brian D. Simpson
2001	The mvprotek : m-commerce voice verification system. Y. J. Kyung, J. O. Jung, S. M. Sohn, H. J. Chun, S. Y. Moon, M. H. Kim, W. H. Sull
2001	The nespole! voIP dialogue database. Susanne Burger, Laurent Besacier, Paolo Coletti, Florian Metze, Céline Morel
2001	The perceptual relevance of glottal-pulse parameter variations. Ralph van Dinther, Raymond N. J. Veldhuis, Armin Kohlrausch
2001	The relation between speech intelligibility and the complex modulation spectrum. Steven Greenberg, Takayuki Arai
2001	The relationship between intraoral air pressure and tongue/palate contact during the articulation of norwegian /t/ and /d/. Inger Moen, Hanne Gram Simonsen, Morten Huseby, John Grue
2001	The role of duration as a correlate of accent in lekeitio basque. Gorka Elordieta, José Ignacio Hualde
2001	The role of the palate in tongue kinematics: an experimental assessment in v sequences from EPG and EMMA data. Susanne Fuchs, Pascal Perrier, Christine Mooshammer
2001	The schwa in albanian. Theodor Granser, Sylvia Moosmller
2001	The speech synthesis environment and parametric modeling of coarticulation. Mikolaj Wypych
2001	The study of the effect of training set on statistical language modeling. Xipeng Shen, Bo Xu
2001	The technical processing in smartkom data collection: a case study. Ulrich Trk
2001	The u.s. speechdat-car data collection. Peter A. Heeman, David Cole, Andrew Cronk
2001	The use of fundamental frequency raising as a strategy for increasing vocal intensity in soft, normal, and loud phonation. Paavo Alku, Juha Vintturi, Erkki Vilkman
2001	The use of noisy frame elimination and frequency spectrum magnitude reduction in noise robust speech recognition. Damjan Vlaj, Zdravko Kacic, Bogomir Horvat
2001	The use of prosody in a combined system for punctuation generation and speech recognition. Ji-Hwan Kim, Philip C. Woodland
2001	Three-dimensional modelling of speech corpora: added value through visualisation. Toomas Altosaar, Matti Karjalainen, Martti Vainio
2001	Time and memory efficient viterbi decoding for LVCSR using a precompiled search network. Daniel Willett, Erik McDermott, Yasuhiro Minami, Shigeru Katagiri
2001	Timing and interaction of visual cues for prominence in audiovisual speech perception. David House, Jonas Beskow, Björn Granström
2001	Tonal alignment, scaling and slope in Italian question and statement tunes. Mariapaola D'Imperio
2001	Topic detection for language model adaptation of highly-inflected languages by using a fuzzy comparison function. Mirjam Sepesy Maucec, Zdravko Kacic
2001	Topic styles in IR and TDT: effect on system behavior. Martin Franz, J. Scott McCarley, Todd Ward, Wei-Jing Zhu
2001	Toward noise-tolerant acoustic models. Edmondo Trentin, Marco Gori
2001	Towards SMIL as a foundation for multimodal, multimedia applications. Jennifer L. Beckham, Giuseppe Di Fabbrizio, Nils Klarlund
2001	Towards a model of target oriented production of prosody. Grzegorz Dogil, Bernd Möbius
2001	Towards automatic transcription of spontaneous presentations. Takahiro Shinozaki, Chiori Hori, Sadaoki Furui
2001	Towards combining pitch and MFCC for speaker recognition systems. Hassan Ezzaidi, Jean Rouat, Douglas D. O'Shaughnessy
2001	Towards discriminative lexicon optimization. Hauke Schramm, Peter Beyerlein
2001	Towards the creation of acoustic models for stressed Japanese speech. Kozo Okuda, Tomoko Matsui, Satoshi Nakamura
2001	Training a sentence planner for spoken dialog: the impact of syntactic and planning features. Monica Rogati, Marilyn A. Walker, Owen Rambow
2001	Training prosodic phrasing rules for Chinese TTS systems. Weijun Chen, Fuzong Lin, Jianmin Li, Bo Zhang
2001	Transducer optimizations for tight-coupled decoding. Alexander Seward
2001	Transformation-based learning of danish stress assignment. Peter Juel Henrichsen
2001	Tree based score computation for speaker verification. Raphaël Blouet, Frédéric Bimbot
2001	Triggering individual word domains in n-gram language models. Elvira I. Sicilia-Garcia, Ji Ming, Francis Jack Smith
2001	Triphone tying techniques combining a-priori rules and data driven methods. Ute Ziegenhain, Josef G. Bauer
2001	Turkish word segmentation using morphological analyzer. M. Oguzhan Külekci, Mehmed Özkan
2001	Two features to check phonetic transcriptions in text to speech systems. Stefano Sandri, Enrico Zovato
2001	Two-stage probabilistic approach to text segmentation. Yi-Chia Chen, Yi-Chung Lin
2001	Unit selection for speech synthesis using splicing costs with weighted finite state transducers. Ivan Bulyko, Mari Ostendorf
2001	Universalizing speech: notes from the USI project. Stefanie Shriver, Roni Rosenfeld, Xiaojin Zhu, Arthur R. Toth, Alexander I. Rudnicky, Markus D. Flückiger
2001	Universities and industry: marriage or co-operation between independent partners? Ilkka Niiniluoto
2001	Unsupervised noisy environment adaptation algorithm using MLLR and speaker selection. Miichi Yamada, Akira Baba, Shinichi Yoshizawa, Yuichiro Mera, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano
2001	Up to what level can acoustical and textual features predict prominence. Barbertje M. Streefkerk, Louis C. W. Pols, Louis ten Bosch
2001	Use of acoustic prior information for confidence measure in ASR applications. Erhan Mengusoglu, Christophe Ris
2001	Use of clustering information for coarticulation compensation in speech synthesis by word concatenation. Christos Vosnidis, Vassilios Digalakis
2001	Use of real and contaminated speech for training of a hands-free in-car speech recognizer. Marco Matassoni, Maurizio Omologo, Piergiorgio Svaizer
2001	Use of topic knowledge in spoken dialogue information retrieval system for academic documents. Shinya Kiriyama, Keikichi Hirose, Nobuaki Minematsu
2001	Using aerial and geometric features in automatic lip-reading. Jacek C. Wojdel, Léon J. M. Rothkrantz
2001	Using boosting and POS word graph tagging to improve speech recognition. Christer Samuelsson, James Hieronymus
2001	Using information retrieval methods for language model adaptation. Langzhou Chen, Jean-Luc Gauvain, Lori Lamel, Gilles Adda, Martine Adda-Decker
2001	Using linguopalatal contact patterns to tune a 3d tongue model. Olov Engwall
2001	Using machine learning techniques for grapheme to phoneme transcription. Franco Mana, Paolo Massimino, Alberto Pacchiotti
2001	Using real words for recording diphones. Susan Fitt
2001	Using spatial correlation information in speech recognition. Peng Yu, Zuoying Wang
2001	Using the modulation complex wavelet transform for feature extraction in automatic speech recognition. Yasunori Momomura, Kenji Okada, Takayuki Arai, Noboru Kanedera, Yuji Murahara
2001	Variable-length acoustic units inference for text-to-speech synthesis. Olivier Boëffard
2001	Variation in final lengthening as a function of topic structure. Caroline L. Smith, Lisa A. Hogan
2001	Viseme recognition using multiple feature matching. Islam Shdaifat, Rolf-Rainer Grigat, Stefan Lütgert
2001	Vocal tract normalization equals linear transformation in cepstral space. Michael Pitz, Sirko Molau, Ralf Schlüter, Hermann Ney
2001	Voice activity detection in noisy environments. Jan Stadermann, V. Stahl, G. Rose
2001	Voice transformations: from speech synthesis to mammalian vocalizations. Min Tang, Chao Wang, Stephanie Seneff
2001	Voice-IF: a mixed-initiative spoken dialogue system for AT&t conference services. Mazin G. Rahim, Giuseppe Di Fabbrizio, Candace A. Kamm, Marilyn A. Walker, A. Pokrovsky, P. Ruscitti, Esther Levin, Sungbok Lee, Ann K. Syrdal, K. Schlosser
2001	Vowel height is intimately associated with stress accent in spontaneous american English discourse. Leah Hitchcock, Steven Greenberg
2001	What is the best type of prior distribution for EMAP speaker adaptation? Patrick Kenny, Gilles Boulianne, Pierre Dumouchel
2001	Whispery voiced nasal stops in rwanda. Didier Demolin, Véronique Delvaux
2001	Why is automatic recognition of children's speech difficult? Qun Li, Martin J. Russell
2001	Wideband ACELP at 16 kb/s with multi-band excitation. Sílvia Pujalte, Asunción Moreno
2001	Wideband LSF quantization by generalized voronoi codes. Stéphane Ragot, Hassan Lahdili, Roch Lefebvre
2001	Wideband speech coding algorithm with application of discrete wavelet transform to upper band. Seung Won Lee, Keun-Sung Bae
2001	Word final aspiration as a phrase boundary cue: data from spontaneous Swedish discourse. Victoria Johansson, Merle Horne, Sven Strömqvist
2001	Word level confidence annotation using combinations of features. Rong Zhang, Alexander I. Rudnicky
2001	Word level confidence measures using n-best sub-hypotheses likelihood ratio. Beng Tiong Tan, Yong Gu, Trevor Thomas
2001	Word unit based multilingual comparative analysis of text corpora. Géza Németh, Csaba Zainkó
2001	Writing script-based dialogues for AAC. Iain R. Murray, John L. Arnott, Norman Alm, Richard Dye, Gillian Harper
2001	XISL: an attempt to separate multimodal interactions from XML contents. Tsuneo Nitta, Kouichi Katsurada, Hirobumi Yamada, Yusaku Nakamura, Satoshi Kobayashi