INTERSPEECH - RankMe

679 papers

Year	Title / Authors
2002	10 years of phondat-II: a reassessment. Hartmut R. Pfitzinger
2002	2-d processing of speech with application to pitch estimation. Thomas F. Quatieri
2002	7th International Conference on Spoken Language Processing, ICSLP2002 - INTERSPEECH 2002, Denver, Colorado, USA, September 16-20, 2002 John H. L. Hansen, Bryan L. Pellom
2002	A Gaussian selection method for multi-mixture HMM based continuous speech recognition. Raymond H. Lee, Eric H. C. Choi
2002	A case study of portuguese and English bilinguality. Luis M. T. Jesus, Christine H. Shadle
2002	A combined model of statics-dynamics of speech optimized using maximum mutual information. Zhijian Ou, Zuoying Wang
2002	A comparative study of adaptation methods for speaker verification. Johnny Mariéthoz, Samy Bengio
2002	A comparative study of approximations for parallel model combination of static and dynamic parameters. Yifan Gong
2002	A comparison between feedback strategies in human-to-human and human-machine communication. Loredana Cerrato
2002	A comparison of HTK, ISIP and julius in slovenian large vocabulary continuous speech recognition. Tomaz Rotovnik, Mirjam Sepesy Maucec, Bogomir Horvat, Zdravko Kacic
2002	A comparison of L1 and african-mother-tongue acoustic models for south african English speech recognition. Janus D. Brink, Elizabeth C. Botha
2002	A comparison of four language models for large vocabulary turkish speech recognition. Helin Dutagaci, Levent M. Arslan
2002	A comparison of front-end analyses for Thai speech recognition. Montri Karnjanadecha, Patimakorn Kimsawad
2002	A comparison of two LVR search optimization techniques. Stephan Kanthak, Hermann Ney, Michael Riley, Mehryar Mohri
2002	A confidence measure based on agreement among multiple LVCSR models - correlation between pair of acoustic models and confidence. Takehito Utsuro, Tetsuji Harada, Hiromitsu Nishizaki, Seiichi Nakagawa
2002	A context clustering technique for average voice model in HMM-based speech synthesis. Junichi Yamagishi, Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi
2002	A copy synthesis method to pilot the klatt synthesiser. Yves Laprie, Anne Bonneau
2002	A corpus-based study of danish laryngealization. Kathleen Murray, Betina Simonsen
2002	A data-driven approach to source-formant type text-to-speech system. Hiroki Mori, Takahiro Ohtsuka, Hideki Kasuya
2002	A distributed multimodal dialogue system based on dialogue system and web convergence. Feng Liu, Antoine Saad, Li Li, Wu Chou
2002	A figure of merit for the analysis of spoken dialog systems. Kadri Hacioglu, Wayne H. Ward
2002	A flexible stream architecture for ASR using articulatory features. Florian Metze, Alex Waibel
2002	A handset identifier using support vector machines. Purdy Ho
2002	A hybrid HMM/traps model for robust voice activity detection. Brian Kingsbury, Pratibha Jain, André Gustavo Adami
2002	A hybrid approach to compounds in LVCSR. Tom Laureys, Vincent Vandeghinste, Jacques Duchateau
2002	A link between cepstral shrinking and the weighted product rule in audio-visual speech recognition. Simon Lucey, Sridha Sridharan, Vinod Chandran
2002	A low-resource, miniature implementation of the ETSI distributed speech recognition front-end. Etienne Cornu, Hamid Sheikhzadeh, Robert L. Brennan
2002	A maximum entropy semantic parser using word classes. Norbert Pfannerer
2002	A method for evaluating incremental utterance understanding in spoken dialogue systems. Ryuichiro Higashinaka, Noboru Miyazaki, Mikio Nakano, Kiyoaki Aikawa
2002	A miniature Chinese TTS system based on tailored corpus. Zhiwei Shuang, Yu Hu, Zhen-Hua Ling, Ren-Hua Wang
2002	A modality-independent MMI system architecture. Kouichi Katsurada, Yoshihiko Ootani, Yusaku Nakamura, Satoshi Kobayashi, Hirobumi Yamada, Tsuneo Nitta
2002	A multi-class approach for modelling out-of-vocabulary words. Issam Bazzi, James R. Glass
2002	A new approach to speech enhancement by a microphone array using EM and mixture models. Hagai Attias, Li Deng
2002	A new computer-based analytical speech perception test for prelingually deaf children and children with speech disorders. Anne-Marie Öster
2002	A new lexicon optimization method for LVCSR based on linguistic and acoustic characteristics of words. Takahiro Shinozaki, Sadaoki Furui
2002	A new method for testing dialogue systems based on simulations of real-world conditions. Ramón López-Cózar, Ángel de la Torre, José C. Segura, Antonio J. Rubio, Juan M. López-Soler
2002	A new method of building decision tree based on target information. Yi-Jian Wu, Yu Hu, Xiaoru Wu, Ren-Hua Wang
2002	A perceptually motivated subspace approach for speech enhancement. Yi Hu, Philipos C. Loizou
2002	A phoneme recognizer for the hearing impaired. Mathias Johansson, Mats Blomberg, Kjell Elenius, Lars-Erik Hoffsten, Anders Torberger
2002	A phonetic study of vietnamese tones: acoustic and electroglottographic measurements. Vu Ngoc Tuan, Christophe d'Alessandro, Sophie Rosset
2002	A portable, server-side dialog framework for voiceXML. Bob Carpenter, Sasha Caskey, Krishna Dayanidhi, Caroline Drouin, Roberto Pieraccini
2002	A pragmatic confirmation mechanism for an object-based spoken dialogue manager. Ian M. O'Neill, Michael F. McTear
2002	A psychoacoustic basis for spectral sharpening. Peggy B. Nelson, Jeffrey J. DiGiovanni, Robert S. Schlauch
2002	A real-time acoustic human-machine front-end for multimedia applications integrating robust adaptive beamforming and stereophonic acoustic echo cancellation. Wolfgang Herbordt, J. Ying, Herbert Buchner, Walter Kellermann
2002	A reverse turing test using speech. Greg Kochanski, Daniel P. Lopresti, Chilin Shih
2002	A sound source classification system based on subband processing. Oytun Türk, Ömer Sayli, Helin Dutagaci, Levent M. Arslan
2002	A sparse modeling approach to speech recognition based on relevance vector machines. Jonathan E. Hamaker, Joseph Picone, Aravind Ganapathiraju
2002	A spatio-temporal speech enhancement scheme for robust speech recognition. Erik M. Visser, Manabu Otsuka, Te-Won Lee
2002	A state-tying approach to building syllable HMMs. Darryl Stewart, Ming Ji, Philip Hanna, Francis Jack Smith
2002	A statistically motivated database pruning technique for unit selection synthesis. Peter Rutten, Matthew P. Aylett, Justin Fackrell, Paul Taylor
2002	A study of multi-speaker dialogue system for mobile information retrieval. Hsien-Chang Wang, Chieh-Yi Huang, Chung-Hsien Yang, Jhing-Fa Wang
2002	A study of the two-mass model in terms of acoustic parameters. Denisse Sciamarella, Christophe d'Alessandro
2002	A study on the classification of whispered and normally phonated speech. Stanley J. Wenndt, Edward J. Cupples, Richard M. Floyd
2002	A system that learns to describe objects in visual scenes. Deb Roy
2002	A text-to-speech synthesis system for telugu. Jithendra Vepa, Jahnavi Ayachitam, K. V. K. Kalpana Reddy
2002	A trainable spoken language understanding system for visual object selection. Deb Roy, Peter Gorniak, Niloy Mukherjee, Joshua Juster
2002	A training prompts generation algorithm for connected spoken word recognition. Ha-Jin Yu, Jin Suk Kim
2002	ACIMET: access to meteorological information by telephone. Jaume Padrell, Javier Hernando
2002	ACT: a graphical dialogue annotation comparison tool. Fan Yang, Susan E. Strayer, Peter A. Heeman
2002	ASR dependent techniques for speaker identification. Alex Park, Timothy J. Hazen
2002	ASR in a human word recognition model: generating phonemic input for shortlist. Odette Scharenborg, Lou Boves, Johan de Veth
2002	AT&t help desk. Giuseppe Di Fabbrizio, Dawn Dutton, Narendra K. Gupta, Barbara Hollister, Mazin G. Rahim, Giuseppe Riccardi, Robert E. Schapire, Juergen Schroeter
2002	Absolute pitch and lexical tones: tone perception by non-musician, musician, and absolute pitch non-tonal language speakers. Denis K. Burnham, Ron Brooker
2002	Access to homophonic meanings during spoken language comprehension: effects of context and neighborhood density. Michael C. W. Yip
2002	Accounting for perceptual identification of consonants and vowels through acoustic dissimilarity. Jianxia Xue, Sumiko Takayanagi, Lynne E. Bernstein
2002	Accumulated kullback divergence for analysis of ASR performance in the presence of noise. Febe de Wet, Johan de Veth, Bert Cranen, Lou Boves
2002	Acoustic and word lattice based algorithms for confidence scores. Daniele Falavigna, Roberto Gretter, Giuseppe Riccardi
2002	Acoustic correlates of task load and stress. Klaus R. Scherer, Didier Grandjean, Tom Johnstone, Gudrun Klasmeyer, Thomas Bänziger
2002	Acoustic echo cancellation based on m-channel IIR cosine-modulated filter bank. Sang-Gyun Kim, Chang D. Yoo
2002	Acoustic measures vs. phonetic features as predictors of audible discontinuity in concatenative speech synthesis. Hisashi Kawai, Minoru Tsuzaki
2002	Acoustic modeling of sentence stress using differential features between syllables for English rhythm learning system development. Nobuaki Minematsu, Satoshi Kobashikawa, Keikichi Hirose, Donna Erickson
2002	Acoustic-to-articulatory inverse mapping using an HMM-based speech production model. Sadao Hiroya, Masaaki Honda
2002	Acoustical correlates to SD ratings of speaker characteristics in two speaking styles. Yasuki Yamashita, Hiroshi Matsumoto
2002	Active speech cancellation for cellular speech. Kazuhiro Kondo, Kiyoshi Nakagawa
2002	Adaptation of users² spoken dialogue patterns in a conversational interface. Courtney Darves, Sharon L. Oviatt
2002	Adaptive estimation of time-varying features from high-pitched speech based on an excitation source HMM. Akira Sasou, Kazuyo Tanaka
2002	Adaptive model combination for dynamic speaker selection training. Chao Huang, Tao Chen, Eric Chang
2002	Adding intelligent help to mixed-initiative spoken dialogue systems. Genevieve Gorrell, Ian Lewin, Manny Rayner
2002	Algorithms for distributed speech recognition in a noisy automobile environment. Hong Kook Kim, Richard C. Rose
2002	All-pole modeling of wide-band speech using weighted sum of the LSP polynomials. Paavo Alku, Tom Bäckström
2002	Amplitude convergence in children²s conversational speech with animated personas. Rachel Coulston, Sharon L. Oviatt, Courtney Darves
2002	An EPG therapy protocol for remediation and assessment of articulation disorders. Alan Wrench, Fiona Gibbon, Alison M. McNeill, Sara Wood
2002	An IPA vowel diagram approach to analysing L1 effects on vowel production and perception. Olga I. Dioubina, Hartmut R. Pfitzinger
2002	An acoustic comparison between american English and australian English vowels. Kimiko Tsukada
2002	An adaptive speaker verification system with speaker dependent a priori decision thresholds. Nikki Mirghafori, Larry P. Heck
2002	An analysis of the causes of increased error rates in children²s speech recognition. Qun Li, Martin J. Russell
2002	An analysis of transcription consistency in spontaneous speech from the buckeye corpus. William D. Raymond, Mark A. Pitt, Keith Johnson, Elizabeth Hume, Matthew J. Makashay, Robin Dautricourt, Craig Hilts
2002	An architecture for a multi-modal web browser. Cristiana Armaroli, Ivano Azzini, Lorenza Ferrario, Toni Giorgino, Luca Nardelli, Marco Orlandi, Carla Rognoni
2002	An audio-visual corpus for multimodal speech recognition in dutch language. Jacek C. Wojdel, Pascal Wiggers, Léon J. M. Rothkrantz
2002	An automatic sentence boundary detector based on a structured language model. Shinsuke Mori
2002	An education software in teaching automatic speech recognition (ASR). Hong Kai Sze, Sh-Hussain Salleh
2002	An effect of amplitude modulation on perceptual segregation of tone sequences. Mamoru Iwaki, Hiromi Seki
2002	An effective unsupervised scheme for multiple-speaker-change detection. P. Sivakumaran, Aladdin M. Ariyaeeinia, J. Fortuna
2002	An efficient algorithm for the n-best-strings problem. Mehryar Mohri, Michael Riley
2002	An efficient dialogue control method using decision tree-based estimation of out-of-vocabulary word attributes. Yasuhiro Takahashi, Kohji Dohsaka, Kiyoaki Aikawa
2002	An environment compensated minimum classification error training approach and its evaluation on Aurora2 database. Jian Wu, Qiang Huo
2002	An evaluation of using mutual information for selection of acoustic-features representation of phonemes for speech recognition. Mohamed Kamal Omar, Ken Chen, Mark Hasegawa-Johnson, Yigal Brandman
2002	Analysis and synthesis of the phonatory excitation signal by means of a pair of polynomial shaping functions. Jean Schoentgen
2002	Analysis of user behavior under error conditions in spoken dialogs. Jongho Shin, Shrikanth S. Narayanan, Laurie Gerber, Abe Kazemzadeh, Dani Byrd
2002	Application of microprosody models in text to speech synthesis. Phuay Hui Low, Saeed Vaseghi
2002	Application of over-complete blind source separation for robust automatic speech recognition. Shubha Kadambe
2002	Application of real-time AMDF pitch-detection in a voice gender normalisation system. E. Jung, A. Th. Schwarzbacher, K. Humphreys, Robert Lawlor
2002	Application of the lee silverman voice treatment (LSVT) to individuals with multiple sclerosis, ataxic dysarthria, and stroke. Leslie Will, Lorraine O. Ramig, Jennifer L. Spielman
2002	Applying a hybrid intonation model to a seamless speech synthesizer. Takashi Saito, Masaharu Sakamoto
2002	Applying fallback to prosodic unit selection from a small imitation database. Joram Meron
2002	Approaches to language identification using Gaussian mixture models and shifted delta cepstral features. Pedro A. Torres-Carrasquillo, Elliot Singer, Mary A. Kohler, Richard J. Greene, Douglas A. Reynolds, John R. Deller Jr.
2002	Arc minimization in finite state decoding graphs with cross-word acoustic context. Geoffrey Zweig, George Saon, François Yvon
2002	Assessment of consonant articulation in glossectomee speech by dynamic MRI. Katalin Mády, Robert Sader, Alexander Zimmermann, Philip Hoole, Ambros Beer, Hans-Florian Zeilhofer, Ch. Hannig
2002	Audio-visual continuous speech recognition using a coupled hidden Markov model. Xiaoxing Liu, Yibao Zhao, Xiaobo Pi, Luhong Liang, Ara V. Nefian
2002	Audio-visual scene analysis: evidence for a "very-early" integration process in audio-visual speech perception. Jean-Luc Schwartz, Frédéric Berthommier, Christophe Savariaux
2002	Audio-visual speech enhancement with AVCDCN (audio-visual codebook dependent cepstral normalization). Sabine Deligne, Gerasimos Potamianos, Chalapathy Neti
2002	Audio-visual speech sources separation: a new approach exploiting the audio-visual coherence of speech stimuli. David Sodoyer, Laurent Girin, Christian Jutten, Jean-Luc Schwartz
2002	Audiovisual integration of speech by children and adults with cochlear implants. Karen Iler Kirk, David B. Pisoni, Lorin Lachs
2002	Audiovisual perception in L2 learners. Valérie Hazan, Anke Sennema, Andrew Faulkner
2002	Audiovisual speech synthesis. from ground truth to models. Gérard Bailly
2002	Auditory fovea based speech enhancement and its application to human-robot dialog system. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano
2002	Auditory-visual speech perception examined by brain imaging and reaction time. Kaoru Sekiyama, Yoichi Sugita
2002	Automatic concept identification in goal-oriented conversations. Ananlada Chotimongkol, Alexander I. Rudnicky
2002	Automatic enrollment for speaker authentication. Qi Li, Hui Jiang, Qiru Zhou, Jinsong Zheng
2002	Automatic extraction of model parameters from fundamental frequency contours of English utterances. Shuichi Narusawa, Nobuaki Minematsu, Keikichi Hirose, Hiroya Fujisaki
2002	Automatic generation of phonetic transcriptions for large speech corpora. Kris Demuynck, Tom Laureys, Steven Gillis
2002	Automatic intelligibility assessment and diagnosis of critical pronunciation errors for computer-assisted pronunciation learning. Antoine Raux, Tatsuya Kawahara
2002	Automatic language identification using acoustic sub-word units. A. K. V. Sai Jayram, V. Ramasubramanian, T. V. Sreenivas
2002	Automatic phoneme alignment based on acoustic-phonetic modeling. John-Paul Hosom
2002	Automatic prosodic break labeling for Mandarin Chinese speech data. Minghui Dong, Kim-Teng Lua
2002	Automatic punctuation and disfluency detection in multi-party meetings using prosodic and lexical cues. Don Baron, Elizabeth Shriberg, Andreas Stolcke
2002	Automatic recognition of dutch dysarthric speech: a pilot study. Eric Sanders, Marina B. Ruiter, Lilian Beijer, Helmer Strik
2002	Automatic segmentation combining an HMM-based approach and spectral boundary correction. Yeon-Jun Kim, Alistair Conkie
2002	Automatic sign translation. Ying Zhang, Bing Zhao, Jie Yang, Alex Waibel
2002	Automatic transcription of courtroom speech. Rohit Prasad, Long Nguyen, Richard M. Schwartz, John Makhoul
2002	Automatic user-adaptive speaking rate selection for information delivery. Nigel Ward, Satoshi Nakagawa
2002	Auxiliary variables in conditional Gaussian mixtures for automatic speech recognition. Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard
2002	Backoff hierarchical class n-gram language modelling for automatic speech recognition systems. Imed Zitouni, Olivier Siohan, Hong-Kwang Jeff Kuo, Chin-Hui Lee
2002	Baldini: baldi speaks italian! Piero Cosi, Michael M. Cohen, Dominic W. Massaro
2002	Bark resolution from speech data. Naren Malayath, Hynek Hermansky
2002	Basque intonation modelling for text to speech conversion. Eva Navas, Inmaculada Hernáez, Juan María Sánchez
2002	Basurde[lite], a machine-driven dialogue system for accessing railway timetable information. Roger Trias-Sanz, José B. Mariño
2002	Belief network based disambiguation of object reference in spoken dialogue system for robot. Yoko Yamakata, Tatsuya Kawahara, Hiroshi G. Okuno
2002	Bell labs approach to Aurora evaluation on connected digit recognition. Jingdong Chen, Dimitris Dimitriadis, Hui Jiang, Qi Li, Tor André Myrvoll, Olivier Siohan, Frank K. Soong
2002	Benefit and cost analysis of using the improved vector quantizer design algorithm for glottal source waveform compression. Peter Veprek, Alan B. Bradley
2002	Bilingual corpus cleaning focusing on translation literality. Kenji Imamura, Eiichiro Sumita
2002	Blind normalization of speech from different channels and speakers. David N. Levin
2002	Bridges: regions between discourse segments. Nanette Veilleux
2002	Building an ASR system for noisy environments: SRI's 2001 SPINE evaluation system. Venkata Ramana Rao Gadde, Andreas Stolcke, Dimitra Vergyri, Jing Zheng, M. Kemal Sönmez, Anand Venkataraman
2002	Building voiceXML-based applications. Christina L. Bennett, Ariadna Font Llitjós, Stefanie Shriver, Alexander I. Rudnicky, Alan W. Black
2002	CU VOCAL: corpus-based syllable concatenation for Chinese speech synthesis across domains and dialects. Helen M. Meng, Chi-Kin Keung, Kai-Chung Siu, Tien Ying Fung, P. C. Ching
2002	CU animate tools for enabling conversations with animated characters. Jiyong Ma, Jie Yan, Ronald A. Cole
2002	Can confidence scores help users post-editing speech recognizer output? Taku Endo, Nigel Ward, Minoru Terada
2002	Channel error protection scheme for distributed speech recognition. Zheng-Hua Tan, Paul Dalsgaard
2002	Channel noise robustness for low-bitrate remote speech recognition. Alexis Bernard, Abeer Alwan
2002	Characteristics of a low reject mode speaker verification system. Daniel Elenius, Mats Blomberg
2002	Chinese spoken language analyzing based on combination of statistical and rule methods. Guodong Xie, Chengqing Zong, Bo Xu
2002	Choosing speech or touchtone modality for navigation within a telephony natural language system. Jennifer C. Lai, Kwan Min Lee
2002	Classification error from the theoretical Bayes classification risk. Erik McDermott, Shigeru Katagiri
2002	Cluster identification for speaker-environment tracking. J. T. Wickramaratna, Philip C. Woodland
2002	Clustering and feature learning based F0 prediction for Chinese speech synthesis. Jianhua Tao, Lianhong Cai
2002	Codebook dependent dynamic channel estimation for Mandarin speech recognition over telephone. Huayun Zhang, Zhaobing Han, Bo Xu
2002	Coding speech at very low rates using straight and temporal decomposition. Phu Chien Nguyen, Takao Ochi, Masato Akagi
2002	Collecting mobile multimodal data for match. Patrick Ehlen, Michael Johnston, Gunaranjan Vasireddy
2002	Combination of pause and F0 information in dependency analysis of Japanese sentences. Kazuyuki Takagi, Hajime Kubota, Kazuhiko Ozeki
2002	Combination of statistical and rule-based approaches for spoken language understanding. Ye-Yi Wang, Alex Acero, Ciprian Chelba, Brendan J. Frey, Leon Wong
2002	Combined binary classifiers with applications to speech recognition. Aldebaro Klautau, Nikola Jevtic, Alon Orlitsky
2002	Combined prosody and candidate unit selections for corpus-based text-to-speech systems. Francisco Campillo Díaz, Eduardo Rodríguez Banga
2002	Combining a Gaussian mixture model front end with MFCC parameters. Matthew N. Stuttle, Mark J. F. Gales
2002	Combining acoustic and language information for emotion recognition. Chul Min Lee, Shrikanth S. Narayanan, Roberto Pieraccini
2002	Combining information sources for memory-based pitch accent placement. Erwin Marsi, Bertjan Busser, Walter Daelemans, Véronique Hoste, Martin Reynaert, Antal van den Bosch
2002	Combining lexical and morphological knowledge in language model for inflectional (czech) language. Jan Nouza, Jindra Drabkova
2002	Combining maximum likelihood and maximum a posteriori estimation for detailed acoustic modeling of context dependency. Michiel Bacchiani
2002	Combining search spaces of heterogeneous recognizers for improved speech recogniton. Xiang Li, Rita Singh, Richard M. Stern
2002	Combining speaker and speech recognition systems. Larry P. Heck, Dominique Genoud
2002	Comfort noise detection and GSM-FR-codec detection for speech-quality evaluations in telephone networks. Thorsten Ludwig
2002	Compact subnetwork-based large vocabulary continuous speech recognition. Dong-Hoon Ahn, Minhwa Chung
2002	Comparative evaluation of CASA and BSS models for subband cocktail-party speech separation. Frédéric Berthommier, Seungjin Choi
2002	Comparative experiments to evaluate the use of auditory-based acoustic distinctive features and formant cues for automatic speech recognition using a multi-stream paradigm. Hesham Tolba, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy
2002	Comparing intelligibility of several non-native accent classes in noise. Shawn A. Weil
2002	Comparing isolately spoken keywords with spontaneously spoken queries for Japanese spoken document retrieval. Hiromitsu Nishizaki, Seiichi Nakagawa
2002	Comparison and combination of RASTA-PLP and FF features in a hybrid HMM/MLP speech recognition system. Pere Pujol Marsal, Susagna Pol, Astrid Hagen, Hervé Bourlard, Climent Nadeu
2002	Comparison of acoustic distance measures for automatic cross-language phoneme mapping. Jayren J. Sooful, Elizabeth C. Botha
2002	Compensating for hyperarticulation by modeling articulatory properties. Hagen Soltau, Florian Metze, Alex Waibel
2002	Compensation of channel effect on line spectrum frequencies. An-Tze Yu, Hsiao-Chuan Wang
2002	Comprehension of non-native speech: inaccurate phoneme processing and activation of lexical competitors. Mirjam Broersma
2002	Computationally efficient method of speech enhancement based on block representation of signal in state space and vector quantization. Vasyl Semenov, Alexander Kovtonyuk, Alexander Kalyuzhny
2002	Computationally efficient noise compensation for robust automatic speech recognition assessed under the Aurora 2/3 framework. Nicholas W. D. Evans, John S. D. Mason
2002	Computationally efficient time-scale modification of speech using 3 level clipping. Sung-Joo Lee, Hyung Soon Kim
2002	Computer-assisted second-language speech learning: generalization of prosody-focused training. Debra M. Hardison
2002	Confidence metrics for speaker identification. Mark C. Huggins, John J. Grieco
2002	Confusion-based query expansion for OOV words in spoken document retrieval. Beth Logan, Jean-Manuel Van Thong
2002	Constructing shared-state hidden Markov models based on a Bayesian approach. Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda
2002	Constructing small language models from grammars. Francis Picard, Dominique Boucher, Guy Lapalme
2002	Construction of decision tree from data driven clustering. Junho Park, Hanseok Ko
2002	Contextual effects in the perception of fricative place of articulation: a rotational hypothesis. Willy Serniclaes, René Carré
2002	Contextual effects on voicing judgment of stop consonants in Japanese. Makiko Aoyagi
2002	Continuous environmental adaptation of a speech recogniser in telephone line conditions. Carlos M. G. S. Lima, Luís B. Almeida, João L. Monteiro
2002	Contribution to topic identification by using word similarity. Armelle Brun, Kamel Smaïli, Jean Paul Haton
2002	Control system for talking robot to replicate articulatory movement of natural speech. Takemi Mochida, Masaaki Honda, Kouki Hayashi, Toshiharu Kuwae, Kunihiro Tanahashi, Kazufumi Nishikawa, Atsuo Takanishi
2002	Controling anticipatory behavior for rounding in French cued speech. Virginie Attina, Marie-Agnès Cathiard, Denis Beautemps
2002	Controlling perceived degradation in spectrum envelope modeling via predistortion. Pushkar Patwardhan, Preeti Rao
2002	Coordination of hand and orofacial movements for CV sequences in French cued speech. Virginie Attina, Denis Beautemps, Marie-Agnès Cathiard
2002	Coordination of referring expressions in multimodal human-computer dialogue. Gabriel Skantze
2002	Corpus-based analysis of English spoken by Japanese students in view of the entire phonemic system of English. Nobuaki Minematsu, Gakuto Kurata, Keikichi Hirose
2002	DARPA communicator evaluation: progress from 2000 to 2001. Marilyn A. Walker, Alexander I. Rudnicky, John S. Aberdeen, Elizabeth Owen Bratt, John S. Garofolo, Helen Wright Hastie, Audrey N. Le, Bryan L. Pellom, Alexandros Potamianos, Rebecca J. Passonneau, Rashmi Prasad, Salim Roukos, Gregory A. Sanders, Stephanie Seneff, David Stallard
2002	DARPA communicator: cross-system results for the 2001 evaluation. Marilyn A. Walker, Alexander I. Rudnicky, Rashmi Prasad, John S. Aberdeen, Elizabeth Owen Bratt, John S. Garofolo, Helen Wright Hastie, Audrey N. Le, Bryan L. Pellom, Alexandros Potamianos, Rebecca J. Passonneau, Salim Roukos, Gregory A. Sanders, Stephanie Seneff, David Stallard
2002	DCT-based video features for audio-visual speech recognition. Martin Heckmann, Kristian Kroschel, Christophe Savariaux, Frédéric Berthommier
2002	DETAC: a discriminative criterion for speaker verification. Jirí Navrátil, Ganesh N. Ramaswamy
2002	Data, annotation schemes and coding tools for natural interactivity. Laila Dybkjær, Niels Ole Bernsen
2002	Data-driven segment preselection in the IBM trainable speech synthesis system. Wael Hamza, Robert E. Donovan
2002	Data-driven temporal filters obtained via different optimization criteria evaluated on Aurora2 database. Jeih-weih Hung, Lin-Shan Lee
2002	Data-driven vector clustering for low-memory footprint ASR. Karim Filali, Xiao Li, Jeff A. Bilmes
2002	Decision tree distribution tying based on a dimensional split technique. Heiga Zen, Keiichi Tokuda, Tadashi Kitamura
2002	Design for a speech-to-speech translator for field use. David Stallard, Premkumar Natarajan, Mohammed Noamany, Richard M. Schwartz, John Makhoul
2002	Design of a Mandarin sentence set for corpus-based speech synthesis by use of a multi-tier algorithm taking account of the varied prosodic and spectral characteristics. Jinfu Ni, Hisashi Kawai
2002	Design of an audio-visual speech corpus for the czech audio-visual speech synthesis. Milos Zelezný, Petr Císar, Zdenek Krnoul, Jan Novák
2002	Design of system-initiated digressive proposals for automated banking dialogues. Jenny Wilkie, Mervyn A. Jack, Peter J. Littlewood
2002	Designing Japanese speech database covering wide range in prosody for hybrid speech synthesizer. Hiromichi Kawanami, Tsuyoshi Masuda, Tomoki Toda, Kiyohiro Shikano
2002	Designing a speaker-discriminative adaptive filter bank for speaker recognition. Tomi Kinnunen
2002	Detection and recognition of repaired speech on misrecognized utterances for speech input of car navigation system. Naoko Kakutani, Norihide Kitaoka, Seiichi Nakagawa
2002	Development of Japanese infant speech database and speaking rate analysis. Shigeaki Amano, Kazumi Kato, Tadahisa Kondo
2002	Development of a GUI-based articulatory speech synthesis system. Kohichi Ogata, Yorinobu Sonoda
2002	Discrimination of English vowels in consonantal contexts by native speakers of Japanese and its relations to dynamic information of formants. Akiyo Joto, Motohisa Imaishi, Yoshiki Nagase, Seiya Funatsu
2002	Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation. Stavros Tsakalidis, Vlasios Doumpiotis, William Byrne
2002	Discriminative training for call classification and routing. Hong-Kwang Jeff Kuo, Chin-Hui Lee, Imed Zitouni, Eric Fosler-Lussier, Egbert Ammicht
2002	Distributed Chinese keyword spotting and verification for spoken dialogues under wireless environment. Yun-Tien Lee, Cheng-Huang Wu, Yumin Lee, Lin-Shan Lee
2002	Distributed audio-visual speech synchronization. Peter Poller, Jochen Müller
2002	Distributed speech recognition over IP networks on the Aurora 3 database. Laura Docío Fernández, Carmen García-Mateo
2002	Distributed speech recognition using noise-robust MFCC and traps-estimated manner features. Pratibha Jain, Hynek Hermansky, Brian Kingsbury
2002	Divergence-based out-of-class rejection for telephone handset identification. Chi-Leung Tsang, Man-Wai Mak, Sun-Yuan Kung
2002	Double the trouble: handling noise and reverberation in far-field automatic speech recognition. David Gelbart, Nelson Morgan
2002	Duration and F0 as perceptual cues to Japanese vowel quantity. Keisuke Kinoshita, Dawn M. Behne, Takayuki Arai
2002	Duration modeling for arabic text to speech synthesis. Yasser Hifny, Mohsen A. Rashwan
2002	Duration related phase realignment of Thai tones. John J. Ohala, Rungpat Roengpitya
2002	Dutch HLT resources: from BLARK to priority lists. Helmer Strik, Walter Daelemans, Diana Binnenpoorte, Janienke Sturm, Folkert de Vriend, Catia Cucchiarini
2002	Dynamic search-space pruning for time-constrained speech recognition. Sascha Wendt, Gernot A. Fink, Franz Kummert
2002	Dynamic tuning of language model score in speech recognition using a confidence measure. Sherif M. Abdou, Michael S. Scordilis
2002	E-mail goes mobile: the design and implementation of a spoken language interface to e-mail. Daniela Oria, Esa Koskinen
2002	EM training of finite-state transducers and its application to pronunciation modeling. Han Shu, I. Lee Hetherington
2002	Effect of F0 fluctuation and amplitude modulation of natural vowels on vowel identification in noisy environments. Kentaro Ishizuka, Kiyoaki Aikawa
2002	Effects of intra-phrase position on acceptability of changes in segmental duration in sentence speech. Makiko Muto, Hiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka
2002	Effects of production training with visual feedback on the acquisition of Japanese pitch and durational contrasts. Yukari Hirata
2002	Effects of word error rate in the DARPA communicator data during 2000 and 2001. Gregory A. Sanders, Audrey N. Le, John S. Garofolo
2002	Efficient additive and convolutional noise reduction procedures. Bojan Kotnik, Damjan Vlaj, Zdravko Kacic, Bogomir Horvat
2002	Efficient and scalable methods for text script generation in corpus-based TTS design. Chih-Chung Kuo, Jing-Yi Huang
2002	Efficient combination of type-in and wizard-of-oz tests in speech interface development process. Saija-Maaria Lemmelä, Péter Pál Boda
2002	Efficient construction of long-range language models using log-linear interpolation. Edward W. D. Whittaker, Dietrich Klakow
2002	Efficient precalculation of LM contexts for large vocabulary continuous speech recognition. Javier Dieguez-Tirado, Antonio Cardenal López
2002	Eigenvoices for HMM-based speech synthesis. Kengo Shichiri, Atsushi Sawabe, Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura
2002	Emotion recognition from textual input using an emotional semantic network. Ze-Jing Chuang, Chung-Hsien Wu
2002	Emotional space improves emotion recognition. Raquel Tato, Rocío Santos, Ralf Kompe, José M. Pardo
2002	English call system with functions of speech segmentation and pronunciation evaluation using speech recognition technology. Yasuo Ariki, Jun Ogata
2002	Enhanced histogram normalization in the acoustic feature space. Sirko Molau, Florian Hilger, Daniel Keysers, Hermann Ney
2002	Enhancement of single channel speech using perception-based wavelet transform. Ching-Ta Lu, Hsiao-Chuan Wang
2002	Entropy of energy operator as feature for large vocabulary Mandarin speaker independent speech recognition. Fadhil H. T. Al-Dulaimy, Zuoying Wang
2002	Error-tolerant spoken language understanding with confidence measuring. Huei-Ming Wang, Yi-Chung Lin
2002	Estimating syntactic structure from F0 contour and pause duration in Japanese speech. Yasuo Horiuchi, Tomoko Ohsuga, Akira Ichikawa
2002	Evaluation of SPLICE on the Aurora 2 and 3 tasks. Jasha Droppo, Li Deng, Alex Acero
2002	Evaluation of a noise adaptive speech recognition system on the Aurora 3 database. Kaisheng Yao, Donglai Zhu, Satoshi Nakamura
2002	Evaluation of a noise-robust DSR front-end on Aurora databases. Duncan Macho, Laurent Mauuary, Bernhard Noé, Yan Ming Cheng, Douglas Ealey, Denis Jouvet, Holly Kelleher, David Pearce, Fabien Saadoun
2002	Evaluation of a speech recognition / generation method based on HMM and straight. Toshio Irino, Yasuhiro Minami, Tomohiro Nakatani, Minoru Tsuzaki, H. Tagawa
2002	Evaluation of a system for concatenative articulatory visual speech synthesis. Olov Engwall
2002	Evaluation of cross-language voice conversion using bilingual and non-bilingual databases. Mikiko Mashimo, Tomoki Toda, Hiromichi Kawanami, Hideki Kashioka, Kiyohiro Shikano, Nick Campbell
2002	Evaluation of formant-like features for ASR. Katrin Weber, Febe de Wet, Bert Cranen, Lou Boves, Samy Bengio, Hervé Bourlard
2002	Evaluation of noise robust features on the Aurora databases. Xiaodong Cui, Markus Iseli, Qifeng Zhu, Abeer Alwan
2002	Evaluation of noisy speech recognition based on noise reduction and acoustic model adaptation on the Aurora2 tasks. Masakiyo Fujimoto, Yasuo Ariki
2002	Evaluation of spectral subtraction with smoothing of time direction on the Aurora 2 task. Norihide Kitaoka, Seiichi Nakagawa
2002	Evaluation of the method to detect Japanese local speech rate deceleration applying the variable threshold with a constant term. Keiichi Takamaru, Makoto Hiroshige, Kenji Araki, Koji Tochinai
2002	Evidence for efficiency in vowel production. R. J. J. H. van Son, Louis C. W. Pols
2002	Expanded examinations of a low frequency modulation feature for speech/music discrimination. Stefan Karnebäck
2002	Experiments in confidence scoring for word and sentence verification. Marco Andorno, Pietro Laface, Roberto Gemello
2002	Experiments on recognition of lavalier microphone speech and whispered speech in real world environments. Kiyoshi Tatara, Taisuke Ito, Parham Zolfaghari, Kazuya Takeda, Fumitada Itakura
2002	Experiments on speaker-independent voice command recognition using in-vehicle hands free speech. Yifan Gong, Lorin Netsch
2002	Exploiting support vector machines in hidden Markov models for speaker verification. Dong Xin, Zhaohui Wu, Yingchun Yang
2002	Exploiting variances in robust feature extraction based on a parametric model of speech distortion. Li Deng, Jasha Droppo, Alex Acero
2002	Exploring sub-word features and linear support vector machines for German spoken document classification. Martha A. Larson, Stefan Eickeler, Gerhard Paaß, Edda Leopold, Jörg Kindermann
2002	Expressive speech synthesis using a concatenative synthesizer. Murtaza Bulut, Shrikanth S. Narayanan, Ann K. Syrdal
2002	Extracting clauses for spoken language understanding in conversational systems. Narendra K. Gupta, Srinivas Bangalore, Mazin G. Rahim
2002	Extraction of important sentences using F0 information for speech summarization. Yoichi Yamashita, Akira Inoue
2002	Eye-fixation as a measure of real-time processing of synthesized words. Mary D. Swift, Ellen Campana, James F. Allen, Michael K. Tanenhaus
2002	Eyebrow movements and voice variations in dialogue situations: an experimental investigation. Christian Cavé, Isabelle Guaïtella, Serge Santi
2002	F0 generation for speech synthesis using a multi-tier approach. Xuejing Sun
2002	FORM: an extensible, kinematically-based gesture annotation scheme. Craig Martell
2002	FPGA hardware for speech recognition using hidden Markov models. José Luis Gómez-Cipriano, Roger Pizzatto Nunes, Dante A. C. Barone
2002	Factor analyzed Gaussian mixture models for speaker identification. Peng Ding, Yang Liu, Bo Xu
2002	Factors in human language identification. Ian Maddieson, Ioana Vasilescu
2002	Fast hierarchical grammar optimization algorithm toward time and space efficiency. Jing Zheng, Horacio Franco
2002	Feature extraction combining spectral noise reduction and cepstral histogram equalization for robust ASR. José C. Segura, M. Carmen Benítez, Ángel de la Torre, Antonio J. Rubio
2002	Feature extraction for unit selection in concatenative speech synthesis: comparison between AIM, LPC, and MFCC. Minoru Tsuzaki, Hisashi Kawai
2002	Feed the tiger: a method for evoking reliable jaw stretch reflexes in children. Donald S. Finan, Anne Smith, Michael Ho
2002	Feedback in computer assisted pronunciation training: technology push or demand pull? Ambra Neri, Catia Cucchiarini, Helmer Strik
2002	Filter bank subtraction for robust speech recognition. Kazuo Onoe, Hiroyuki Segi, Takeshi Kobayakawa, Shoei Sato, Toru Imai, Akio Ando
2002	Finite-state transducer based hungarian LVCSR with explicit modeling of phonological changes. Máté Szarvas, Sadaoki Furui
2002	Fixed-length segment coding of LSF parameters. Evgeni Yakhnich, Yuval Bistritz
2002	Flexible dialogue management in the talk'n'travel system. David Stallard
2002	Flexible multimodal human-machine interaction in mobile environments. Dirk Bühler, Wolfgang Minker, Jochen Häußler, Sven Krger
2002	Floating-point adaptive multi-rate wideband speech codec. Toni P. Nieminen
2002	Formant model estimation and transformation for voice morphing. Ching-Hsiang Ho, Dimitrios Rentzos, Saeed Vaseghi
2002	Forms of introduction in map task dialogues: case of L2 Russian speakers. Olga Goubanova
2002	Framewise phone classification using support vector machines. Jesper Salomon, Simon King, Miles Osborne
2002	French nasal vowels: acoustic and articulatory properties. Véronique Delvaux, Thierry Metens, Alain Soquet
2002	Frequency band analysis for stress detection using a teager energy operator based feature. Mandar A. Rahurkar, John H. L. Hansen, James Meyerhoff, George Saviolakis, Michael Koenig
2002	Frequency dependence of vocal-tract length. Takuya Niikawa, Takanori Ando, Masafumi Matsumura
2002	From text to prosody without toBI. Volker Strom
2002	Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases. Chia-Ping Chen, Karim Filali, Jeff A. Bilmes
2002	Full-text story alignment models for Chinese-English bilingual news corpora. Bing Zhao, Stephan Vogel
2002	Functional modeling of face movements during speech. Shinji Maeda, Martine Toda, Andreas J. Carlen, Lyes Meftahi
2002	Generalization of state-observation-dependency in partly hidden Markov models. Tetsuji Ogawa, Tetsunori Kobayashi
2002	Generating script using statistical information of the context variation unit vector. Haiping Li, Fangxin Chen, Liqin Shen
2002	German broadcast news transcription. Robert Hecht, Jürgen Riedler, Gerhard Backfried
2002	Gestural spatialization in natural discourse segmentation. Francis K. H. Quek, David McNeill, Robert K. Bryll, Mary P. Harper
2002	Gestural trajectory symmetries and discourse segmentation. Francis K. H. Quek, Yingen Xiong, David McNeill
2002	Globalphone: a multilingual speech and text database developed at karlsruhe university. Tanja Schultz
2002	Goal-directed ASR in a multimedia indexing and searching environment (MUMIS). Mirjam Wester, Judith M. Kessens, Helmer Strik
2002	Grammar specialisation meets language modelling. Manny Rayner, Beth Ann Hockey, John Dowding
2002	Grapheme-to-phoneme conversion using pseudo-morphological units. Ulla Uebler
2002	HMM COmposition-based rapid model adaptation using a priori noise GMM adaptation evaluation on Aurora2 corpus. Masaki Ida, Satoshi Nakamura
2002	HMM-based methods for channel error mitigation in distributed speech recognition. Antonio M. Peinado, Victoria E. Sánchez, José L. Pérez-Córdoba, José C. Segura, Antonio J. Rubio
2002	Hearing-aid benefits and limitations: predictions from a cochlear model. James M. Kates
2002	Hierarchical Gaussian mixture model for speaker verification. Ming Liu, Eric Chang, Bei-qian Dai
2002	High performance digit recognition in real car environments. Umit H. Yapanel, Xianxian Zhang, John H. L. Hansen
2002	Highly oversampled subband adaptive filters for noise cancellation on a low-resource DSP system. King Tam, Hamid Sheikhzadeh, Todd Schneider
2002	Holds as gestural correlates to empty and filled speech pauses. Anna Esposito, Susan Duncan, Francis K. H. Quek
2002	How speakers with and without speech impairment mark the question statement contrast. Rupal Patel
2002	Hypophonia in parkinson disease: neural correlates of voice treatment with LSVT revealed by PET. Mario Liotti, Lorraine O. Ramig, Deanie Vogel, Pamela New, Chris Cook, Peter Fox
2002	ISIS: a multi-modal, trilingual, distributed spoken dialog system developed with CORBA, java, XML and KQML. Helen M. Meng, P. C. Ching, Yee Fong Wong, Cheong Chat Chan
2002	Implementation of an intonational quality assessment system. Chanwoo Kim, Wonyong Sung
2002	Implementation testing of a hybrid symbolic/statistical multimodal architecture. Edward C. Kaiser, Philip R. Cohen
2002	Implementing vocal tract length normalization in the MLLR framework. Guo-Hong Ding, Yi-Fei Zhu, Chengrong Li, Bo Xu
2002	Improve latent semantic analysis based language model by integrating multiple level knowledge. Rong Zhang, Alexander I. Rudnicky
2002	Improved Chinese spoken document retrieval with hybrid modeling and data-driven indexing features. Chun-Jen Wang, Berlin Chen, Lin-Shan Lee
2002	Improved corpus-based synthesis of fundamental frequency contours using generation process model. Keikichi Hirose, Masaya Eto, Nobuaki Minematsu
2002	Improved katz smoothing for language modeling in speech recogniton. Genqing Wu, Fang Zheng, Wenhu Wu, Mingxing Xu, Ling Jin
2002	Improved performance speech codec for mobile communications. K. Humphreys, Robert Lawlor
2002	Improved phone recognition on TIMIT using formant frequency data and confidence measures. N. J. Wilkinson, Martin J. Russell
2002	Improved structural maximum likelihood eigenspace mapping for rapid speaker adaptation. Bowen Zhou, John H. L. Hansen
2002	Improvement of the ELS-based time-varying complex speech analysis. Keiichi Funaki
2002	Improvements to the IBM Aurora 2 multi-condition system. George Saon, Juan M. Huerta
2002	Improving latent semantic indexing based classifier with information gain. Li Li, Wu Chou
2002	Improving parametric trajectory modeling by integration of pitch and tone information. Yiyan Zhang, Wenju Liu, Bo Xu, Huayun Zhang
2002	Improving performance of an HMM-based ASR system by using monophone-level normalized confidence measure. Muhammad Ghulam, Takashi Fukuda, Takaharu Sato, Tsuneo Nitta
2002	Improving phone-level discrimination in LDA with subphone-level classes. Hwa Jeon Song, Hyung Soon Kim
2002	Improving speech recognition performance of small microphone arrays using missing data techniques. Iain McCowan, Andrew C. Morris, Hervé Bourlard
2002	Improving spoken language understanding using word confusion networks. Gökhan Tür, Jerry H. Wright, Allen L. Gorin, Giuseppe Riccardi, Dilek Hakkani-Tür
2002	Improving statistical machine translation for a speech-to-speech translation task. Stephan Vogel, Alicia Tribble
2002	Improving the role of unvoiced speech segments by spectral normalisation in robust speech recognition. Carlos M. G. S. Lima, Luís B. Almeida, João L. Monteiro
2002	Improving word accuracy with Gabor feature extraction. Michael Kleinschmidt, David Gelbart
2002	Incremental on-line feature space MLLR adaptation for telephony speech recognition. Yongxin Li, Hakan Erdogan, Yuqing Gao, Etienne Marcheret
2002	Individual word language models and the frequency approach. Elvira I. Sicilia-Garcia, Ji Ming, Francis Jack Smith
2002	Influence of different dialogue situations on user²s behavior in spoken corrections. Atsuhiko Kai, Yukari Nonomura, Toshihiko Itoh, Tatsuhiro Konishi, Yukihiro Itoh
2002	Influence of prosody, context, and word order in the identification of focus in Japanese dialogue. Tatsuya Kitamura, Kayo Itoh, Toshihiko Itoh, Shigeyoshi Kitazawa
2002	Influence of transmission errors on ASR systems. Carmen Peláez-Moreno, Ascensión Gallardo-Antolín, Jesús Vicente-Peña, Fernando Díaz-de-María
2002	Information retrieval based on speech recognition results. Masatoshi Watanabe, Masahide Sugiyama
2002	Information-theoretic criteria for unit selection synthesis. Jon R. W. Yi, James R. Glass
2002	Ingressive speech as an indication that humans are talking to humans (and not to machines). Robert Eklund
2002	Integrating multiple pronunciations during MCE-based acoustic model training for large vocabulary speech recognition. Rathi Chengalvarayan
2002	Integrating speech with keypad input for automatic entry of spelling and pronunciation of new words. Grace Chung, Stephanie Seneff
2002	Integration of MLLR adaptation with pronunciation proficiency adaptation for non-native speech recognition. Nobuaki Minematsu, Gakuto Kurata, Keikichi Hirose
2002	Integration of phonetic length properties in the acoustic models of false starts and out-of-vocabulary words. H. Hamimed, Géraldine Damnati
2002	Integration of supra-lexical linguistic models with speech recognition using shallow parsing and finite state transducers. Xiaolong Mou, Stephanie Seneff, Victor Zue
2002	Integration of two stochastic context-free grammars. Anna Corazza
2002	Intelligibility of reverse speech in French: a perceptual study. Ivan Magrin-Chagnolleau, Melissa Barkat, Fanny Meunier
2002	Interaction of voice over internet protocol speech coders and disordered speech samples. Vijay Parsa, Donald G. Jamieson
2002	Interlingua based statistical machine translation. Manuel Kauers, Stephan Vogel, Christian Fügen, Alex Waibel
2002	Interpreting meaning from context: modeling the prosody of discourse markers in speech. Li-chiung Yang
2002	Intonation modelling for the synthesis of structured documents. Jeska Buhmann, Jean-Pierre Martens, Lieve Macken, Bert Van Coile
2002	Intonational and visual cues in the perception of interrogative mode in Swedish. David House
2002	Intrasyllabic articulatory control constraints in verbal working memory. Marc Sato, Jean-Luc Schwartz, Marie-Agnès Cathiard, Christian Abry, Hélène Loevenbruck
2002	Intrinsic phone durations are speaker-specific. Hartmut R. Pfitzinger
2002	Introduction of constraints in an acoustic-to-articulatory inversion method based on a hypercubic articulatory table. Yves Laprie, Slim Ouni
2002	Investigation of coarticulation based on electromagnetic articulographic data. Jianwu Dang, Masaaki Honda, Kiyoshi Honda
2002	Investigations on joint-multigram models for grapheme-to-phoneme conversion. Maximilian Bisani, Hermann Ney
2002	Is the speaker done yet? faster and more accurate end-of-utterance detection using prosody. Luciana Ferrer, Elizabeth Shriberg, Andreas Stolcke
2002	Issues in automatic transcription of historical audio data. Fabio Brugnara, Mauro Cettolo, Marcello Federico, Diego Giuliani
2002	Issues in the development of a stochastic speech understanding system. Fabrice Lefèvre, Hélène Bonneau-Maynard
2002	Japanese broadcast news transcription. Long Nguyen, Xuefeng Guo, Richard M. Schwartz, John Makhoul
2002	Juncture segmentation of Japanese prosodic unit based on the spectrographic features. Shigeyoshi Kitazawa, Toshihiko Itoh, Tatsuya Kitamura
2002	Kymographic imaging of the vocal fold oscillations. Jan G. Svec, Frantisek Sram
2002	LU factorization for feature transformation. Patrick Nguyen, Luca Rigazio, Christian Wellekens, Jean-Claude Junqua
2002	Large vocabulary conversational speech recognition with the extended maximum likelihood linear transformation (EMLLT) model. Jing Huang, Vaibhava Goel, Ramesh Gopinath, Brian Kingsbury, Peder A. Olsen, Karthik Visweswariah
2002	Laryngoscopic analysis of tibetan chanting modes and their relationship to register in sino-tibetan. John H. Esling
2002	Learning decision trees to determine turn-taking by spoken dialogue systems. Ryo Sato, Ryuichiro Higashinaka, Masafumi Tamoto, Mikio Nakano, Kiyoaki Aikawa
2002	Learning syllable duration and intonation of Mandarin Chinese. Oliver Jokisch, Hongwei Ding, Hans Kruschke, Guntram Strecha
2002	Likelihood combination and recognition output voting for the decoding of non-native speech with multilingual HMMs. Volker Fischer, Eric Janke, Siegfried Kunzmann
2002	Linguistic and acoustic changes of user²s utterances caused by different dialogue situations. Toshihiko Itoh, Atsuhiko Kai, Tatsuhiro Konishi, Yukihiro Itoh
2002	Lip gestures in English sibilants: articulatory - acoustic relationship. Martine Toda, Shinji Maeda, Andreas J. Carlen, Lyes Meftahi
2002	Lip-reading based on a fully automatic statistical model. Philippe Daubias, Paul Deléglise
2002	Low complexity Mandarin speaker-independent isolated word recognition. Xia Wang, Juha Iso-Sipilä
2002	Low complexity techniques for embedded ASR systems. Imre Kiss, Marcel Vasilache
2002	Low cost duration modelling for noise robust speech recognition. Andrew C. Morris, Simon Payne, Hervé Bourlard
2002	Low-resource noise-robust feature post-processing on Aurora 2.0. Chia-Ping Chen, Jeff A. Bilmes, Katrin Kirchhoff
2002	Markov models based on speaker space model evolution. Dong Kook Kim, Nam Soo Kim
2002	Maximum entropy model for punctuation annotation from speech. Jing Huang, Geoffrey Zweig
2002	Maximum expected likelihood based model selection and adaptation for nonnative English speakers. Xiaodong He, Yunxin Zhao
2002	Maximum likelihood estimation of eigenvoices and residual variances for large vocabulary speech recognition tasks. Patrick Kenny, Gilles Boulianne, Pierre Dumouchel
2002	Maximum mutual information training of hidden Markov models with vector linear predictors. K. K. Chin, Philip C. Woodland
2002	Medium vocabulary continuous audio-visual speech recognition. Pascal Wiggers, Jacek C. Wojdel, Léon J. M. Rothkrantz
2002	Mel-scaled wavelet filter based features for noisy unvoiced phoneme recognition. Omar Farooq, Sekharjit Datta
2002	Memory space reduction for hidden Markov models in low-resource speech recognition systems. Sergey Astrov
2002	Methods to improve Gaussian mixture model based language identification system. Eddie Wong, Sridha Sridharan
2002	Minimum perfect hashing for fast n-gram language model lookup. Xiao Zhang, Yunxin Zhao
2002	Model partial pronunciation variations for spontaneous Mandarin speech recognition. Yi Liu, Pascale Fung
2002	Model-based independent component analysis for robust multi-microphone automatic speech recognition. Laurent Couvreur, Christophe Ris
2002	Model-based predictions of intensity discrimination for normal- and impaired-hearing listeners. Lisa G. Huettel, Leslie M. Collins
2002	Modeling HMM state distributions with Bayesian networks. Konstantin Markov, Satoshi Nakamura
2002	Modeling and automatic detection of English sentence stress for computer-assisted English prosody learning system. Kazunori Imoto, Yasushi Tsubota, Antoine Raux, Tatsuya Kawahara, Masatake Dantsuji
2002	Modeling articulatory dynamics in autoregressive linear system. Kiyoshi Hashimoto
2002	Modeling durational variability in reading aloud a connected text. Caroline L. Smith
2002	Modeling frequent allophones in Japanese speech recognition. Long Nguyen, Xuefeng Guo, John Makhoul
2002	Modeling recognition of speech sounds with minerva2. Travis Wade, Deborah K. Eakin, Russell Webb, Arvin Agah, Frank Brown, Allard Jongman, John Gauch, Thomas A. Schreiber, Joan A. Sereno
2002	Modeling the perception of frequency-shifted vowels. Peter F. Assmann, Terrance M. Nearey, Jack M. Scott
2002	Modeling tones in continuous Cantonese speech. Tan Lee, Greg Kochanski, Chilin Shih, Yujia Li
2002	Modeling varying pauses to develop robust acoustic models for recognizing noisy conversational speech. Jinsong Zhang, Satoshi Nakamura
2002	Modeling with a subspace constraint on inverse covariance matrices. Scott Axelrod, Ramesh Gopinath, Peder A. Olsen
2002	Models of speech dynamics in a segmental-HMM recognizer using intermediate linear representations. Philip J. B. Jackson, Martin J. Russell
2002	Motor specifications of a baby robot via the analysis of infants² vocalizations. Jihène Serkhane, Jean-Luc Schwartz, Louis-Jean Boë, Barbara L. Davis, Christine L. Matyear
2002	Multi-dimensional analysis of sonority: perception, acoustics, and phonology. Masahiko Komatsu, Shinichi Tokuma, Won Tokuma, Takayuki Arai
2002	Multi-scale and multi-model integration for improved performance in Chinese spoken document retrieval. Wai Kit Lo, Helen M. Meng, P. C. Ching
2002	Multilingual pronunciation modeling for improving multilingual speech recognition. Jilei Tian, Juha Häkkinen, Olli Viikki
2002	Multilingual speech recognition with language identification. Bin Ma, Cuntai Guan, Haizhou Li, Chin-Hui Lee
2002	Multimodal integration patterns in children. Benfang Xiao, Cynthia Girand, Sharon L. Oviatt
2002	Multimodal language processing for mobile information access. Michael Johnston, Srinivas Bangalore, Amanda Stent, Gunaranjan Vasireddy, Patrick Ehlen
2002	Multiparty multimodal interaction: a preliminary analysis. Philip R. Cohen, Rachel Coulston, Kelly Krout
2002	Multiple regression of log-spectra for in-car speech recognition. Tetsuya Shinde, Kazuya Takeda, Fumitada Itakura
2002	Mutual information phone clustering for decision tree induction. Ciprian Chelba, Rachel Morton
2002	N-word-sequence frequency noise mitigation for SLM based on binomial distribution. Yibao Zhao, Guojun Zhou
2002	Named entity extraction from spontaneous speech in how may i help you? Frédéric Béchet, Allen L. Gorin, Jerry H. Wright, Dilek Hakkani-Tür
2002	Native and vietnamese production of compound and phrasal stress patterns. Thu Nguyen, John Ingram
2002	Network-based vs. distributed speech recognition in adaptive multi-rate wireless systems. Tim Fingscheidt, Stefanie Aalburg, Sorel Stan, Christophe Beaugeant
2002	Neurocognitive basis for audiovisual speech perception: evidence from event-related potentials. Curtis W. Ponton, Edward T. Auer, Lynne E. Bernstein
2002	New model for speech residual signal shaping with static nonlinearity. Jari Juhani Turunen, Juha T. Tanttu, Pekka Loula
2002	Noise adaptive speech recognition with acoustic models trained from noisy speech evaluated on Aurora-2 database. Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura
2002	Noise estimation for efficient speech enhancement and robust speech recognition. Petr Motlícek, Lukás Burget
2002	Noise from corrupted speech log mel-spectral energies. Jasha Droppo, Alex Acero, Li Deng
2002	Noise robust speech recognition using F0 contour extracted by hough transform. Koji Iwano, Takahiro Seki, Sadaoki Furui
2002	Noise-robust speech recognition in car environments using genetic algorithms and a mel-cepstral subspace approach. Sid-Ahmed Selouani, Douglas D. O'Shaughnessy
2002	Non-linear techniques for dysphonic voice analysis and correction. Claudia Manfredi, Lorenzo Matassini
2002	Objective distance measures for spectral discontinuities in concatenative speech synthesis. Jithendra Vepa, Simon King, Paul Taylor
2002	On F0 trajectory optimization for very high-quality speech manipulation. Hideki Kawahara, Parham Zolfaghari, Alain de Cheveigné
2002	On developing new text and audio corpora and speech recognition tools for the turkish language. Özgül Salor, Bryan L. Pellom, Tolga Çiloglu, Kadri Hacioglu, Mübeccel Demirekler
2002	On effective speaker verification based on subword model. Sungjoo Ahn, Sunmee Kang, Hanseok Ko
2002	On improving the performance of analysis-by-synthesis coding using a multi-magnitude algebraic code-book excitation signal. Omar Halmi, Hesham Tolba, Driss Guerchi, Douglas D. O'Shaughnessy
2002	On text-based language identification for multilingual speech recognition systems. Jilei Tian, Juha Häkkinen, Søren Riis, Kåre Jean Jensen
2002	On the estimation of signal-to-noise ratio in continuous speech for abnormal voices. Vijay Parsa, Donald G. Jamieson, Karen Stenning, Herbert A. Leeper
2002	On the function of the late rise and the early fall in dutch dialogue: a perception experiment. Johanneke Caspers
2002	On the relevance of bandwidth extension for speaker verification. Marcos Faúndez-Zanuy, Mattias Nilsson, W. Bastiaan Kleijn
2002	On the role of the "schwa" in the perception of plosive consonants. René Carré, Jean-Sylvain Liénard, Egidio Marsico, Willy Serniclaes
2002	On the use of Gaussian mixture model for speaker variability analysis. Tao Chen, Chao Huang, Eric Chang, Jingchun Wang
2002	On the use of structures in language models for dialogue. Renato De Mori, Yannick Estève, Christian Raymond
2002	On use of duration modeling for continuous digits speech recognition. Rong Dong, Jie Zhu
2002	Operations for context-based multimodal interpretation in conversational systems. Joyce Yue Chai
2002	Optimal selection of speech data for automatic speech recognition systems. Arkadiusz Nagórski, Lou Boves, Herman J. M. Steeneken
2002	Optimal speech signal partition into one-quasiperiodical segments. Taras K. Vintsiuk
2002	Optimization of hidden Markov models for embedded systems. Klaus Reinhard, Jochen Junkawitsch, Andreas Kießling, Stefan Dobler
2002	Oral-laryngeal control patterns for fricatives in 5-year-olds and adults. Laura L. Koenig, Jorge C. Lucero
2002	Orientel: speech-based interactive communication applications for the mediterranean and the middle east. Imed Zitouni, Joseph P. Olive, Dorota J. Iskra, Khalid Choukri, Ossama Emam, Oren Gedge, Emmanuel Maragoudakis, Herbert S. Tropf, Asunción Moreno, Albino Nogueiras Rodríguez, Barbara Heuft, Rainer Siemund
2002	Oro-facial changes in parkinson²s disease following intensive voice therapy (LSVT). Jennifer L. Spielman, Lorraine O. Ramig, Joan C. Borod
2002	Overview on recent activities in speech understanding and dialogue systems evaluation. Wolfgang Minker
2002	Overview on recent activities in speech understanding and dialogue systems evaluation. Wolfgang Minker
2002	Parametric trajectory segment model for LVCSR. Lei Jia, Bo Xu
2002	Part-of-speech tagging in French text-to-speech synthesis: experiments in tagset selection. Hongyan Jing, Evelyne Tzoukermann
2002	Pause duration and variability in read texts. Elena Zvonik, Fred Cummins
2002	Perceived boundary strength. Petra Hansson
2002	Perception and integration of audiovisual speech in human infants. David J. Lewkowicz
2002	Perception of prosodic phrasing by hearing-impaired listeners. Dragana Barac-Cikoja, Sally Revoile
2002	Perception of tone and vowel quantity in Thai. Hansjörg Mixdorff, Sudaporn Luksaneeyanawin, Hiroya Fujisaki, Patavee Charnvivit
2002	Perceptual adjustment to foreign-accented English with short term exposure. Constance M. Clarke
2002	Perceptual effects of assimilation-induced violation of final devoicing in dutch. Cecile T. L. Kuijpers, Wilma van Donselaar, Anne Cutler
2002	Perceptual evaluation of audiovisual cues for prominence. Emiel Krahmer, Zsófia Ruttkay, Marc Swerts, Wieger Wesselink
2002	Perceptual evaluation of naturalness due to substitution of Chinese syllable for concatenative speech synthesis. Jinlin Lu, Hisashi Kawai
2002	Perceptual learning of second-language syllable rhythm by elderly listeners. Keiichi Tajima, Reiko Akahane-Yamada, Tsuneo Yamada
2002	Performance of discriminatively trained auditory features on Aurora2 and Aurora3. Brian Kan-Wing Mak, Yik-Cheung Tam
2002	Perpetually optimizing the cost function for unit selection in a TTS system with one single run of MOS evaluation. Hu Peng, Yong Zhao, Min Chu
2002	Phonetic normalization using z-score in segmental prosody estimation for corpus-based TTS system. Hoeun Song, Jaein Kim, Kyongrok Lee, Jinyoung Kim
2002	Phonetic speaker identification. Qin Jin, Tanja Schultz, Alex Waibel
2002	Phonological norms in faroese speech synthesis. Pétur Helgason, Sjrðhur Gullbein
2002	Pitch accent prediction using ensemble machine learning. Xuejing Sun
2002	Pitch contour model for Chinese text-to-speech using CART and statistical model. Minghui Dong, Kim-Teng Lua
2002	Pitch extraction of speech signals using an eigen-based subspace method. Takahiro Murakami, Munehiro Namba, Tetsuya Hoya, Yoshihisa Ishida
2002	Porting channel robustness across languages. Françoise Beaufays, Daniel Boies, Mitch Weintraub
2002	Power spectral density based channel equalization of large speech database for concatenative TTS system. Yu Shi, Eric Chang, Hu Peng, Min Chu
2002	Preaspirated stops in southern Swedish. Mechtild Tronnier
2002	Predicting oral reading miscues. Jack Mostow, Joseph Beck, S. Vanessa Winter, Shaojun Wang, Brian Tobin
2002	Preliminary data on effects of behavioral and levodopa therapies on speech-accompanying gesture in parkinson²s disease. Susan Duncan
2002	Probabilistic ranking of constraints. Louis ten Bosch
2002	Probabilistic retrieval based on document representations. Wolfgang Macherey, Hans Jörg Viechtbauer, Hermann Ney
2002	Processing of temporal cues marking phrasal boundaries in individuals with brain damage. Wendi A. Aasland, Shari R. Baum
2002	Production and perception of pauses and their linguistic context in read and spontaneous speech in Swedish. Beáta Megyesi, Sofia Gustafson-Capková
2002	Production based pitch modification of voiced speech. Yinglong Jiang, Peter Murphy
2002	Progress with the philips continuous ASR system on the Aurora 2 noisy digits database. Markus Lieb, Alexander Fischer
2002	Pronunciation of proper names with a joint n-gram model for bi-directional grapheme-to-phoneme conversion. Lucian Galescu, James F. Allen
2002	Prosodic parameter for speaker identification. Katarina Bartkova, David Le Gac, Delphine Charlet, Denis Jouvet
2002	Prosodic phrasing with inductive learning. Sheng Zhao, Jianhua Tao, Lianhong Cai
2002	Prosody-based automatic detection of annoyance and frustration in human-computer dialog. Jeremy Ang, Rajdip Dhillon, Ashley Krupski, Elizabeth Shriberg, Andreas Stolcke
2002	Qualcomm-ICSI-OGI features for ASR. André Gustavo Adami, Lukás Burget, Stéphane Dupont, Harinath Garudadri, Frantisek Grézl, Hynek Hermansky, Pratibha Jain, Sachin S. Kajarekar, Nelson Morgan, Sunil Sivadas
2002	Quantile based histogram equalization for online applications. Florian Hilger, Sirko Molau, Hermann Ney
2002	Quantitative evaluation of relevant prosodic factors for text-to-speech synthesis in Spanish. David Escudero Mancebo, César González Ferreras, Valentín Cardeñoso-Payo
2002	RUSLANA: a database of Russian emotional utterances. Veronika Makarova, Valery A. Petrushin
2002	Radiodoc: a voice-accessible document system. Takuya Nishimoto, Masahiro Araki, Yasuhisa Niimi
2002	Rapid development of speech-to-speech translation systems. Alan W. Black, Ralf D. Brown, Robert E. Frederking, Kevin A. Lenzo, John Moody, Alexander I. Rudnicky, Rita Singh, Eric Steinbrecher
2002	Rapid speaker adaptation using speaker clustering. Ernest Pusateri, Timothy J. Hazen
2002	Real-time rich-content transcription of Chinese broadcast news. Daben Liu, Jeffrey Ma, Dongxin Xu, Amit Srivastava, Francis Kubala
2002	Real-time sound source localization and separation for robot audition. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano
2002	Recognition and verification of English by Japanese students for computer-assisted language learning system. Yasushi Tsubota, Tatsuya Kawahara, Masatake Dantsuji
2002	Recognition error processing for speech understanding. Caroline Bousquet-Vernhettes, Nadine Vigouroux
2002	Recognition of continuous speech segments of monophone units using support vector machines. Weifeng Lee, C. Chandra Sekhar, Kazuya Takeda, Fumitada Itakura
2002	Recognition of noisy speech using normalized moments. Jingdong Chen, Yiteng Huang, Qi Li, Frank K. Soong
2002	Recurrent neural network-enhanced HMM speech recognition systems. Jan W. F. Thirion, Elizabeth C. Botha
2002	Reducing pronunciation lexicon confusion and using more data without phonetic transcription for pronunciation modeling. Fang Zheng, Zhanjiang Song, Pascale Fung, William Byrne
2002	Reducing the footprint of the IBM trainable speech synthesis system. Dan Chazan, Ron Hoory, Zvi Kons, Dorel Silberstein, Alexander Sorin
2002	Reference resolution by human partners in a natural interactive problem-solving task. Ellen Campana, Sarah Brown-Schmidt, Michael K. Tanenhaus
2002	Refined speech segmentation for concatenative speech synthesis. Abhinav Sethy, Shrikanth S. Narayanan
2002	Refocussing on the text normalisation process in text-to-speech systems. Andrew P. Breen, Barry Eggleton, Peter Dion, Steve Minnis
2002	Reliability measures for translation quality. Eiichiro Sumita, Yasuhiro Akiba, Kenji Imamura
2002	Rethinking derived acoustic features in speech recognition. Kevin S. Van Horn
2002	Retrieving phrases by selecting the history: application to automatic speech recognition. David Langlois, Kamel Smaïli, Jean Paul Haton
2002	Risk based lattice cutting for segmental minimum Bayes-risk decoding. Shankar Kumar, William Byrne
2002	Robust HMM training for unified dutch and German speech recognition. Rathi Chengalvarayan
2002	Robust MMSE-FW-LAASR scheme at low SNRs. Tao Xu, Zhigang Cao
2002	Robust feature extraction in a variety of input devices on the basis of ETSI standard DSR front-end. Satoru Tsuge, Shingo Kuroiwa, Masami Shishibori, Fuji Ren, Kenji Kita
2002	Robust fundamental frequency estimation against background noise and spectral distortion. Tomohiro Nakatani, Toshio Irino
2002	Robust multiple resolution analysis for automatic speech recognition. Roberto Gemello, Franco Mana, Paolo Pegoraro, Renato De Mori
2002	Robust semantic confidence scoring. Didier Guillevic, Simona Gandrabur, Yves Normandin
2002	Robust speech / music classification in audio documents. Julien Pinquier, Jean-Luc Rouas, Régine André-Obrecht
2002	Robust speech recognition against short-time noise. Man-Hung Siu, Yu-Chung Chan
2002	Robust speech recognition using a voiced-unvoiced feature. András Zolnay, Ralf Schlüter, Hermann Ney
2002	Robust speech recognition using inter-speaker and intra-speaker adaptation. Baojie Li, Keikichi Hirose, Nobuaki Minematsu
2002	Robust time-synchronous environmental adaptation for continuous speech recognition systems. Thomas Plötz, Gernot A. Fink
2002	Robust voiced-unvoiced decision associated to continuous pitch tracking in noisy telephone speech. Mijail Arcienega, Andrzej Drygajlo
2002	Run time information fusion in speech recognition. Chengyi Zheng, Yonghong Yan
2002	SALT: a spoken language interface for web-based multimodal dialog systems. Kuansan Wang
2002	SPIN: language understanding for spoken dialogue systems using a production system approach. Ralf Engel
2002	SRILM - an extensible language modeling toolkit. Andreas Stolcke
2002	Same talker, different language: a replication. Verna Stockmal, Zinny S. Bond
2002	Seeing tongue movements from outside. Gérard Bailly, Pierre Badin
2002	Segment duration in spoken korean. Hyunsong Chung
2002	Segmentation of glides with tonal alignment as reference. Yi Xu, Fang Liu
2002	Selective back-off smoothing for incorporating grammatical constraints into the n-gram language model. Tomoyosi Akiba, Katunobu Itou, Atsushi Fujii, Tetsuya Ishikawa
2002	Selective multi-path acoustic model based on database likelihoods. Akinobu Lee, Yuichiro Mera, Hiroshi Saruwatari, Kiyohiro Shikano
2002	Semantic inference: a data-driven solution for NL interaction. Jerome R. Bellegarda
2002	Semantic structured language models. Hakan Erdogan, Ruhi Sarikaya, Yuqing Gao, Michael Picheny
2002	Separation of voiced source characteristics and vocal tract transfer function characteristics for speech sounds by iterative analysis based on AR-HMM model. Nobuyuki Nishizawa, Keikichi Hirose, Nobuaki Minematsu
2002	Sequential MAP noise estimation and a phase-sensitive model of the acoustic environment. Li Deng, Jasha Droppo, Alex Acero
2002	Serving complex user wishes with an enhanced spoken dialogue system. Sunna Torge, Stefan Rapp, Ralf Kompe
2002	Sharing relative stress of cross-word syllables and lexical stress to spontaneous speech recognition. Farshad Almasganj, Farhad D. Dehnavi, Mahmood Bijankhan
2002	Sharing trend information of trajectory in segmental-feature HMM. Young-Sun Yun
2002	Sign language translation using an error tolerant retrieval algorithm. Chung-Hsien Wu, Yu-Hsien Chiu, Kung-Wei Cheng
2002	Similarities of words in noise in Japanese. Kiyoko Yoneyama
2002	Sources of variability in the perceptual training of /r/ and /l/: interaction of adjacent vowel, word position, talkers² visual and acoustic cues. Debra M. Hardison
2002	Sparse and independent representations of speech signals based on parametric models. Hugo Leonardo Rufiner, Luís F. Rocha, John Goddard Close
2002	Speaker change detection using a new weighted distance measure. Soonil Kwon, Shrikanth S. Narayanan
2002	Speaker identification by location in an optimal space of anchor models. Yassine Mami, Delphine Charlet
2002	Speaker independent speech recognition using features based on glottal sound source. Norihide Kitaoka, Daisuke Yamada, Seiichi Nakagawa
2002	Speaker intelligibility of adults and children. D. Markham, Valérie Hazan
2002	Speaker recognition using discriminative features selection. Bogdan Sabac
2002	Speaker recognizability evaluation of a voicefont-based text-to-speech system. Masaharu Sakamoto, Takashi Saito
2002	Speaker utterances tying among speaker segmented audio documents using hierarchical classification: towards speaker indexing of audio databases. Sylvain Meignier, Jean-François Bonastre, Ivan Magrin-Chagnolleau
2002	Speaker verification using Gaussian component strings in dynamic trajectory space. Bing Xiang
2002	Speaker verification with data fusion and model adaptation. Kevin R. Farrell
2002	Speaking rate compensation based on likelihood criterion in acoustic model training and decoding. Kozo Okuda, Tatsuya Kawahara, Satoshi Nakamura
2002	Special session: issues in audiovisual spoken language processing (when, where, and how?). Lynne E. Bernstein, Denis Burnham, Jean-Luc Schwartz
2002	Specification and realisation of multimodal output in dialogue systems. Jonas Beskow, Jens Edlund, Magnus Nordstrand
2002	Spectral enhancement preprocessing for the HNM coding of noisy speech. Gautam Moharir, Pushkar Patwardhan, Preeti Rao
2002	Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics. Shingo Yamade, Kanako Matsunami, Akira Baba, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano
2002	Speech and language processing for a constrained speech translation system. Stephen Cox
2002	Speech coding and transmission for improved automatic recognition. Xin Zhong, Jon A. Arrowood, Mark A. Clements
2002	Speech completion: on-demand completion assistance using filled pauses for speech input interfaces. Masataka Goto, Katunobu Itou, Satoru Hayamizu
2002	Speech enhancement based on a perceptual modification of wiener filtering. Lee Lin, W. Harvey Holmes, Eliathamby Ambikairajah
2002	Speech enhancement based on combining perceptual enhancement and short-time spectral attenuation. Ilyas Potamitis, Nikos Fakotakis, George Kokkinakis
2002	Speech enhancement based on generalized singular value decomposition approach. Gwo-hwa Ju, Lin-Shan Lee
2002	Speech enhancement in car environment using blind source separation. Hiroshi Saruwatari, Katsuyuki Sawai, Akinobu Lee, Kiyohiro Shikano, Atsunobu Kaminuma, Masao Sakata
2002	Speech enhancement in non-stationary noise environments. Hyoung-Gook Kim, Dietmar Ruwisch
2002	Speech enhancement using wavelet packet transform. Sungwook Chang, Sung-il Jung, Younghun Kwon, Sung-Il Yang
2002	Speech modeling using variational Bayesian mixture of Gaussians. Panu Somervuo
2002	Speech pauses and gestural holds in parkinson²s disease. Francis K. H. Quek, Mary P. Harper, Yonca Haciahmetoglu, Lei Chen, Lorraine O. Ramig
2002	Speech recognition for language teaching and evaluating: a study of existing commercial products. Rebecca Hincks
2002	Speech recognition performance comparison between DSR and AMR transcoded speech. Holly Kelleher, David Pearce, Douglas Ealey, Laurent Mauuary
2002	Speech recognition using combined acoustic and articulatory information with retraining of acoustic model parameters. Ka-Yee Leung, Man-Hung Siu
2002	Speech recognition using fundamental frequency and voicing in acoustic modeling. Andrej Ljolje
2002	Speech recognition using syllable patterns. Li Zhang, William H. Edmondson
2002	Speech recognition with a re-speak method for subtitling live broadcasts. Toru Imai, Atsushi Matsui, Shinichi Homma, Takeshi Kobayakawa, Kazuo Onoe, Shoei Sato, Akio Ando
2002	Speech reconstruction from mel-frequency cepstral coefficients using a source-filter model. Ben Milner, Xu Shao
2002	Speech synthesis, speech simulation and speech science. Mark A. Huckvale
2002	Speech to speech translation system for monologues-data driven approach. Hideki Tanaka, Stephen Nightingale, Hideki Kashioka, Kenji Matsumoto, Masamchi Nishiwaki, Tadashi Kumano, Takehiko Maruyama
2002	Speech watermarking through parametric modeling. Aparna Gurijala, John R. Deller Jr., Michael S. Seadle, John H. L. Hansen
2002	Speech, music and songs discrimination in the context of handsets variability. Hassan Ezzaidi, Jean Rouat
2002	Speech-enabled natural language call routing: BBN call director. Premkumar Natarajan, Rohit Prasad, Bernhard Suhm, Daniel McCarthy
2002	Speech-to-speech translation system evaluation: results for French for the NESPOLE! project first showcase. Solange Rossato, Hervé Blanchon, Laurent Besacier
2002	Speechfind: an experimental on-line spoken document retrieval system for historical audio archives. Bowen Zhou, John H. L. Hansen
2002	Spoken dialogue system for home health care. Shinya Takahashi, Tsuyoshi Morimoto, Sakashi Maeda, Naoyuki Tsuruta
2002	State clustering improvements for continuous HMMs in a Spanish large vocabulary recognition system. Ricardo de Córdoba, Javier Macías Guarasa, Javier Ferreiros, Juan Manuel Montero, José Manuel Pardo
2002	Statistical adaptation of acoustic models to noise conditions for robust speech recognition. Ángel de la Torre, Dominique Fohr, Jean Paul Haton
2002	Statistical language modeling with prosodic boundaries and its use for continuous speech recognition. Keikichi Hirose, Nobuaki Minematsu, Makoto Terao
2002	Statistical machine translation decoder based on phrase. Taro Watanabe, Eiichiro Sumita
2002	Statistical natural language generation for speech-to-speech machine translation systems. Bowen Zhou, Yuqing Gao, Jeffrey S. Sorensen, Zijian Diao, Michael Picheny
2002	Statistically based approach to rejection of incorrectly recognized words. Ludek Müller, Tomás Bartos
2002	Stochastic suprasegmentals: relationship between the spectral characteristics of vowels, redundancy and prosodic structure. Matthew P. Aylett
2002	Stochastic trajectory model analysis for accent classification. Pongtep Angkititrakul, John H. L. Hansen
2002	Stop epenthesis at syllable boundaries. Natasha Warner, Andrea Weber
2002	Structural Gaussian mixture models for efficient text-independent speaker verification. Bing Xiang, Toby Berger
2002	Studying pronunciation variants in French by using alignment techniques. Philippe Boula de Mareüil, Martine Adda-Decker
2002	Subband based voice conversion. Oytun Türk, Levent M. Arslan
2002	Subjective assessment of frequency bands for perception of speaker identity. Eda Ormanci, U. Hakan Nikbay, Oytun Türk, Levent M. Arslan
2002	Submoraic awareness by Japanese school children: evidence from a novel game. Takashi Otake, Akemi Iijima
2002	Subset languages for conversing with collaborative interface agents. Candace L. Sidner, Clifton Forlines
2002	Subspace speech enhancement using subband whitening filter. Jong Uk Kim, Chang D. Yoo
2002	Suitable design of adaptive beamformer based on average speech spectrum for noisy speech recognition. Takanobu Nishiura, Satoshi Nakamura, Yuka Okada, Takeshi Yamada, Kiyohiro Shikano
2002	Swallowing and voice effects of lee silverman voice treatment (LSVT). Jeri Logemann, Ralph Sundin, Jean Sundin
2002	Syllable processing in English. Ruth Kearns, Dennis Norris, Anne Cutler
2002	Syllable recognition using syllable-segment statistics and syllable-based HMM. Nobutoshi Takahashi, Seiichi Nakagawa
2002	Syntax over focus. Sun-Ah Jun
2002	Talking to machines (statistically speaking). Steve J. Young
2002	Tempo modulations in English: selected pilot study results. Sandra P. Kirkham
2002	Text-dependent speaker verification using lyapunov exponents. Adriano Petry, Dante Augusto Couto Barone
2002	The 2001 GMTK-based SPINE ASR system. Özgür Çetin, Harriet J. Nock, Katrin Kirchhoff, Jeff A. Bilmes, Mari Ostendorf
2002	The 2ch hybrid subtractive beamformer applied to line sound sources. Mitsunori Mizumachi, Satoshi Nakamura
2002	The AT&t German text-to-speech system: realistic linguistic description. Matthias Jilka, Ann K. Syrdal
2002	The ISL meeting corpus: the impact of meeting type on speech style. Susanne Burger, Victoria MacLaren, Hua Yu
2002	The acoustic realization of anger, fear, joy and sadness in Chinese. Jiahong Yuan, Liqin Shen, Fangxin Chen
2002	The carnegie mellon communicator corpus. Christina L. Bennett, Alexander I. Rudnicky
2002	The effect of auditory-visual information and orthographic background in L2 acquisition. V. Dogu Erdener, Denis Burnham
2002	The effects of F0 manipulation on the perceived distance of speech. Douglas Brungart, Alexander J. Kordik, Koel Das, Arnab K. Shaw
2002	The effects of speech compression on speech recognition and text-to-speech synthesis. Yeshwant K. Muthusamy, Yifan Gong, Roshan Gupta
2002	The evolution of spoken language: a comparative approach. W. Tecumseh Fitch
2002	The influence of identification training on identification and production of the american English mid and low vowels by native speakers of Japanese. Stephen G. Lambacher, William L. Martens, Kazuhiko Kakehi
2002	The influence of speech coding on recognition performance in telecommunication networks. Hans-Günter Hirsch
2002	The perception of stop consonant sequences in dyslexic and normal children. Noël Nguyen, Ludovic Jankowski, Michel Habib
2002	The perceptual basis for audiovisual speech integration. Lawrence D. Rosenblum
2002	The relationship between pure-tone sequential stream segregation and perceptual separation of male and female talkers by listeners with hearing loss. Carol L. Mackersie
2002	The reliability of the ITU-t p.85 standard for the evaluation of text-to-speech systems. Yolanda Vazquez-Alvarez, Mark A. Huckvale
2002	The stimulus as basis for audiovisual integration. Eric Vatikiotis-Bateson, Harold Hill, Miyuki Kamachi, Karen Lander, Kevin G. Munhall
2002	The structure and its implementation of hidden dynamic HMM for Mandarin speech recognition. Feili Chen, Jie Zhu, Wentao Song
2002	Think big, from voice to limb movement therapy. Becky G. Farley
2002	Three-dimensional electromagnetic articulograph based on a nonparametric representation of the magnetic field. Tokihiko Kaburagi, Kohei Wakamiya, Masaaki Honda
2002	Time-compressing natural and synthetic speech. Esther Janse
2002	Time-frequency transforms and beamforming for speaker recognition. Antonio Satué-Villar, Juan Fernández-Rubio
2002	Tone recognition in Thai continuous speech based on coarticulaion, intonation and stress effects. Nuttakorn Thubthong, Boonserm Kijsirikul, Sudaporn Luksaneeyanawin
2002	Topic detection of an utterance for speech dialogue processing. Katsushi Asami, Toshiyuki Takezawa, Gen-ichiro Kikui
2002	Topic tracking using subject templates. Yoshimi Suzuki, Fumiyo Fukumoto, Yoshihiro Sekiguchi
2002	Towards a grammar of spoken language: incorporating paralinguistic information. Nick Campbell
2002	Towards an intonation module for a portuguese TTS system. Diamantino Freitas, Daniela Braga
2002	Towards automatic closed captioning : low latency real time broadcast news transcription. Murat Saraclar, Michael Riley, Enrico Bocchieri, Vincent Goffin
2002	Towards every-citizen²s speech interface: an application generator for speech interfaces to databases. Arthur R. Toth, Thomas K. Harris, James Sanders, Stefanie Shriver, Roni Rosenfeld
2002	Towards the question: why has speaking rate such an impact on speech recognition performance? Robert Faltlhauser, Günther Ruske, Matthias Thomae
2002	Training topic classifiers for conversational speech with limited data. Rukmini Iyer, Jeffrey Z. Ma, Herbert Gish, Owen Kimball
2002	Transducer search space modelings for large-vocabulary speech recognition. Hans J. G. A. Dolfing
2002	Transform-based feature vector compression for distributed speech recognition. Ben Milner, Xu Shao
2002	Transformation of spectral envelope for voice conversion based on radial basis function networks. Tomomi Watanabe, Takahiro Murakami, Munehiro Namba, Tetsuya Hoya, Yoshihisa Ishida
2002	Transmission characteristics of outer ear canal. Karel Pellant, Jan Mejzlík, Karel Prikryl, Zdenek Skvor
2002	Tree-structured maximum a posteriori adaptation for a segment-based speech recognition system. Irina Illina
2002	Unconstrained versus constrained acoustic normalisation in confidence scoring. Jacques Duchateau, Patrick Wambacq
2002	Unified task knowledge for spoken language understanding and dialog management. Jerry H. Wright, Alicia Abella, Allen L. Gorin
2002	Unknown-multiple speaker clustering using HMM. Jitendra Ajmera, Hervé Bourlard, I. Lapidot, Iain McCowan
2002	Unsupervised acoustic model adaptation based on phoneme error minimization. Jun Ogata, Yasuo Ariki
2002	Unsupervised language model adaptation for lecture speech transcription. Thomas Niesler, Daniel Willett
2002	Unsupervised n-best based model adaptation using model-level confidence measures. Ka-Yan Kwan, Tan Lee, Chen Yang
2002	Unsupervised speaker segmentation of telephone conversations. Aaron E. Rosenberg, Allen L. Gorin, Zhu Liu, Sarangarajan Parthasarathy
2002	User-customized password speaker verification based on HMM/ANN and GMM models. Mohamed Faouzi BenZeghiba, Hervé Bourlard
2002	User-tailored generation for spoken dialogue: an experiment. Amanda Stent, Marilyn A. Walker, Steve Whittaker, Preetam Maloor
2002	Using EM-trained string-edit distances for approximate matching of acoustic morphemes. Michael Levit, Elmar Nöth, Allen L. Gorin
2002	Using adaptive signal limiter together with weighting techniques for noisy speech recognition. Wei-Wen Hung
2002	Using cross-language cues for story-specific language modeling. Sanjeev Khudanpur, Woosung Kim
2002	Using dynamic WFST composition for recognizing broadcast news. Diamantino Caseiro, Isabel Trancoso
2002	Using observation uncertainty in HMM decoding. Jon A. Arrowood, Mark A. Clements
2002	Using part-of-speech tags, context thresholding, and trigram contexts to improve the auto-induction of semantic classes. Andrew N. Pargellis, Eric Fosler-Lussier, Augustine Tsai
2002	Using start/end timings of spectral transitions between phonemes in concatenative speech synthesis. Toshio Hirai, Seiichi Tenpaku, Kiyohiro Shikano
2002	Using time-stretched pulses for accurate splitting of speech utterances played back in noisy reverberant environments. Dorothea Kolossa, Qiang Huo
2002	Using x-grams for speech-to-speech translation. Adrià de Gispert, José B. Mariño
2002	Utterance verification based on neighborhood information and Bayes factors. Hui Jiang, Chin-Hui Lee
2002	Validation and improvement of automatic phonetic transcriptions. Catia Cucchiarini, Diana Binnenpoorte
2002	Variability in direction of dorsal movement during production of /l/. Natasha Warner, Allard Jongman, Doris Mcke
2002	Variability in the production of glottalized sonorants: data from yapese. Ian Maddieson, Julie Larson
2002	VisSTA: a tool for analyzing multimodal discourse data. Francis K. H. Quek, Yang Shi, Cemil Kirbas, Shunguang Wu
2002	Vocabulary independent OOV detection using support vector machines. Tommi Lahti, Janne Suontausta
2002	Vocalization age as a clinical tool. Harriet J. Fell, Joel MacAuslan, Linda J. Ferrier, Susan G. Worst, Karen Chenausky
2002	Voice transformations for improving children²s speech recognition in a publicly available dialogue system. Joakim Gustafson, Kåre Sjölander
2002	Vowel classification for computer-based visual feedback for speech training for the hearing impaired. Stephen A. Zahorian, A. Matthew Zimmer, Fansheng Meng
2002	Warped-LP residual resampling using DCT for pitch modification. R. Muralishankar, A. G. Ramakrishnan, P. Prathibha
2002	Weighted graph based decision tree optimization for high accuracy acoustic modeling. Sheng Gao, Jinsong Zhang, Satoshi Nakamura, Chin-Hui Lee, Tat-Seng Chua
2002	What relationship between protrusion anticipation and auditory perception? Rudolph Sock, Béatrice Vaxelaire, Véronique Hecker, Fabrice Hirsch
2002	Wizard of oz evaluation of a dialogue with communicator system in Chile. Néstor Becerra Yoma, Angela Cortés, Mauricio Hormazábal, Enrique López
2002	Word endpoints detection in the presence of non-stationary noise. Mario Toma, Andrea Lodi, Roberto Guerrieri
2002	X-JToBI: an extended j-toBI for spontaneous speech. Kikuo Maekawa, Hideaki Kikuchi, Yosuke Igarashi, Jennifer J. Venditti