| 2002 | 10 years of phondat-II: a reassessment. Hartmut R. Pfitzinger |
| 2002 | 2-d processing of speech with application to pitch estimation. Thomas F. Quatieri |
| 2002 | 7th International Conference on Spoken Language Processing, ICSLP2002 - INTERSPEECH 2002, Denver, Colorado, USA, September 16-20, 2002 John H. L. Hansen, Bryan L. Pellom |
| 2002 | A Gaussian selection method for multi-mixture HMM based continuous speech recognition. Raymond H. Lee, Eric H. C. Choi |
| 2002 | A case study of portuguese and English bilinguality. Luis M. T. Jesus, Christine H. Shadle |
| 2002 | A combined model of statics-dynamics of speech optimized using maximum mutual information. Zhijian Ou, Zuoying Wang |
| 2002 | A comparative study of adaptation methods for speaker verification. Johnny Mariéthoz, Samy Bengio |
| 2002 | A comparative study of approximations for parallel model combination of static and dynamic parameters. Yifan Gong |
| 2002 | A comparison between feedback strategies in human-to-human and human-machine communication. Loredana Cerrato |
| 2002 | A comparison of HTK, ISIP and julius in slovenian large vocabulary continuous speech recognition. Tomaz Rotovnik, Mirjam Sepesy Maucec, Bogomir Horvat, Zdravko Kacic |
| 2002 | A comparison of L1 and african-mother-tongue acoustic models for south african English speech recognition. Janus D. Brink, Elizabeth C. Botha |
| 2002 | A comparison of four language models for large vocabulary turkish speech recognition. Helin Dutagaci, Levent M. Arslan |
| 2002 | A comparison of front-end analyses for Thai speech recognition. Montri Karnjanadecha, Patimakorn Kimsawad |
| 2002 | A comparison of two LVR search optimization techniques. Stephan Kanthak, Hermann Ney, Michael Riley, Mehryar Mohri |
| 2002 | A confidence measure based on agreement among multiple LVCSR models - correlation between pair of acoustic models and confidence. Takehito Utsuro, Tetsuji Harada, Hiromitsu Nishizaki, Seiichi Nakagawa |
| 2002 | A context clustering technique for average voice model in HMM-based speech synthesis. Junichi Yamagishi, Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi |
| 2002 | A copy synthesis method to pilot the klatt synthesiser. Yves Laprie, Anne Bonneau |
| 2002 | A corpus-based study of danish laryngealization. Kathleen Murray, Betina Simonsen |
| 2002 | A data-driven approach to source-formant type text-to-speech system. Hiroki Mori, Takahiro Ohtsuka, Hideki Kasuya |
| 2002 | A distributed multimodal dialogue system based on dialogue system and web convergence. Feng Liu, Antoine Saad, Li Li, Wu Chou |
| 2002 | A figure of merit for the analysis of spoken dialog systems. Kadri Hacioglu, Wayne H. Ward |
| 2002 | A flexible stream architecture for ASR using articulatory features. Florian Metze, Alex Waibel |
| 2002 | A handset identifier using support vector machines. Purdy Ho |
| 2002 | A hybrid HMM/traps model for robust voice activity detection. Brian Kingsbury, Pratibha Jain, André Gustavo Adami |
| 2002 | A hybrid approach to compounds in LVCSR. Tom Laureys, Vincent Vandeghinste, Jacques Duchateau |
| 2002 | A link between cepstral shrinking and the weighted product rule in audio-visual speech recognition. Simon Lucey, Sridha Sridharan, Vinod Chandran |
| 2002 | A low-resource, miniature implementation of the ETSI distributed speech recognition front-end. Etienne Cornu, Hamid Sheikhzadeh, Robert L. Brennan |
| 2002 | A maximum entropy semantic parser using word classes. Norbert Pfannerer |
| 2002 | A method for evaluating incremental utterance understanding in spoken dialogue systems. Ryuichiro Higashinaka, Noboru Miyazaki, Mikio Nakano, Kiyoaki Aikawa |
| 2002 | A miniature Chinese TTS system based on tailored corpus. Zhiwei Shuang, Yu Hu, Zhen-Hua Ling, Ren-Hua Wang |
| 2002 | A modality-independent MMI system architecture. Kouichi Katsurada, Yoshihiko Ootani, Yusaku Nakamura, Satoshi Kobayashi, Hirobumi Yamada, Tsuneo Nitta |
| 2002 | A multi-class approach for modelling out-of-vocabulary words. Issam Bazzi, James R. Glass |
| 2002 | A new approach to speech enhancement by a microphone array using EM and mixture models. Hagai Attias, Li Deng |
| 2002 | A new computer-based analytical speech perception test for prelingually deaf children and children with speech disorders. Anne-Marie Öster |
| 2002 | A new lexicon optimization method for LVCSR based on linguistic and acoustic characteristics of words. Takahiro Shinozaki, Sadaoki Furui |
| 2002 | A new method for testing dialogue systems based on simulations of real-world conditions. Ramón López-Cózar, Ángel de la Torre, José C. Segura, Antonio J. Rubio, Juan M. López-Soler |
| 2002 | A new method of building decision tree based on target information. Yi-Jian Wu, Yu Hu, Xiaoru Wu, Ren-Hua Wang |
| 2002 | A perceptually motivated subspace approach for speech enhancement. Yi Hu, Philipos C. Loizou |
| 2002 | A phoneme recognizer for the hearing impaired. Mathias Johansson, Mats Blomberg, Kjell Elenius, Lars-Erik Hoffsten, Anders Torberger |
| 2002 | A phonetic study of vietnamese tones: acoustic and electroglottographic measurements. Vu Ngoc Tuan, Christophe d'Alessandro, Sophie Rosset |
| 2002 | A portable, server-side dialog framework for voiceXML. Bob Carpenter, Sasha Caskey, Krishna Dayanidhi, Caroline Drouin, Roberto Pieraccini |
| 2002 | A pragmatic confirmation mechanism for an object-based spoken dialogue manager. Ian M. O'Neill, Michael F. McTear |
| 2002 | A psychoacoustic basis for spectral sharpening. Peggy B. Nelson, Jeffrey J. DiGiovanni, Robert S. Schlauch |
| 2002 | A real-time acoustic human-machine front-end for multimedia applications integrating robust adaptive beamforming and stereophonic acoustic echo cancellation. Wolfgang Herbordt, J. Ying, Herbert Buchner, Walter Kellermann |
| 2002 | A reverse turing test using speech. Greg Kochanski, Daniel P. Lopresti, Chilin Shih |
| 2002 | A sound source classification system based on subband processing. Oytun Türk, Ömer Sayli, Helin Dutagaci, Levent M. Arslan |
| 2002 | A sparse modeling approach to speech recognition based on relevance vector machines. Jonathan E. Hamaker, Joseph Picone, Aravind Ganapathiraju |
| 2002 | A spatio-temporal speech enhancement scheme for robust speech recognition. Erik M. Visser, Manabu Otsuka, Te-Won Lee |
| 2002 | A state-tying approach to building syllable HMMs. Darryl Stewart, Ming Ji, Philip Hanna, Francis Jack Smith |
| 2002 | A statistically motivated database pruning technique for unit selection synthesis. Peter Rutten, Matthew P. Aylett, Justin Fackrell, Paul Taylor |
| 2002 | A study of multi-speaker dialogue system for mobile information retrieval. Hsien-Chang Wang, Chieh-Yi Huang, Chung-Hsien Yang, Jhing-Fa Wang |
| 2002 | A study of the two-mass model in terms of acoustic parameters. Denisse Sciamarella, Christophe d'Alessandro |
| 2002 | A study on the classification of whispered and normally phonated speech. Stanley J. Wenndt, Edward J. Cupples, Richard M. Floyd |
| 2002 | A system that learns to describe objects in visual scenes. Deb Roy |
| 2002 | A text-to-speech synthesis system for telugu. Jithendra Vepa, Jahnavi Ayachitam, K. V. K. Kalpana Reddy |
| 2002 | A trainable spoken language understanding system for visual object selection. Deb Roy, Peter Gorniak, Niloy Mukherjee, Joshua Juster |
| 2002 | A training prompts generation algorithm for connected spoken word recognition. Ha-Jin Yu, Jin Suk Kim |
| 2002 | ACIMET: access to meteorological information by telephone. Jaume Padrell, Javier Hernando |
| 2002 | ACT: a graphical dialogue annotation comparison tool. Fan Yang, Susan E. Strayer, Peter A. Heeman |
| 2002 | ASR dependent techniques for speaker identification. Alex Park, Timothy J. Hazen |
| 2002 | ASR in a human word recognition model: generating phonemic input for shortlist. Odette Scharenborg, Lou Boves, Johan de Veth |
| 2002 | AT&t help desk. Giuseppe Di Fabbrizio, Dawn Dutton, Narendra K. Gupta, Barbara Hollister, Mazin G. Rahim, Giuseppe Riccardi, Robert E. Schapire, Juergen Schroeter |
| 2002 | Absolute pitch and lexical tones: tone perception by non-musician, musician, and absolute pitch non-tonal language speakers. Denis K. Burnham, Ron Brooker |
| 2002 | Access to homophonic meanings during spoken language comprehension: effects of context and neighborhood density. Michael C. W. Yip |
| 2002 | Accounting for perceptual identification of consonants and vowels through acoustic dissimilarity. Jianxia Xue, Sumiko Takayanagi, Lynne E. Bernstein |
| 2002 | Accumulated kullback divergence for analysis of ASR performance in the presence of noise. Febe de Wet, Johan de Veth, Bert Cranen, Lou Boves |
| 2002 | Acoustic and word lattice based algorithms for confidence scores. Daniele Falavigna, Roberto Gretter, Giuseppe Riccardi |
| 2002 | Acoustic correlates of task load and stress. Klaus R. Scherer, Didier Grandjean, Tom Johnstone, Gudrun Klasmeyer, Thomas Bänziger |
| 2002 | Acoustic echo cancellation based on m-channel IIR cosine-modulated filter bank. Sang-Gyun Kim, Chang D. Yoo |
| 2002 | Acoustic measures vs. phonetic features as predictors of audible discontinuity in concatenative speech synthesis. Hisashi Kawai, Minoru Tsuzaki |
| 2002 | Acoustic modeling of sentence stress using differential features between syllables for English rhythm learning system development. Nobuaki Minematsu, Satoshi Kobashikawa, Keikichi Hirose, Donna Erickson |
| 2002 | Acoustic-to-articulatory inverse mapping using an HMM-based speech production model. Sadao Hiroya, Masaaki Honda |
| 2002 | Acoustical correlates to SD ratings of speaker characteristics in two speaking styles. Yasuki Yamashita, Hiroshi Matsumoto |
| 2002 | Active speech cancellation for cellular speech. Kazuhiro Kondo, Kiyoshi Nakagawa |
| 2002 | Adaptation of users² spoken dialogue patterns in a conversational interface. Courtney Darves, Sharon L. Oviatt |
| 2002 | Adaptive estimation of time-varying features from high-pitched speech based on an excitation source HMM. Akira Sasou, Kazuyo Tanaka |
| 2002 | Adaptive model combination for dynamic speaker selection training. Chao Huang, Tao Chen, Eric Chang |
| 2002 | Adding intelligent help to mixed-initiative spoken dialogue systems. Genevieve Gorrell, Ian Lewin, Manny Rayner |
| 2002 | Algorithms for distributed speech recognition in a noisy automobile environment. Hong Kook Kim, Richard C. Rose |
| 2002 | All-pole modeling of wide-band speech using weighted sum of the LSP polynomials. Paavo Alku, Tom Bäckström |
| 2002 | Amplitude convergence in children²s conversational speech with animated personas. Rachel Coulston, Sharon L. Oviatt, Courtney Darves |
| 2002 | An EPG therapy protocol for remediation and assessment of articulation disorders. Alan Wrench, Fiona Gibbon, Alison M. McNeill, Sara Wood |
| 2002 | An IPA vowel diagram approach to analysing L1 effects on vowel production and perception. Olga I. Dioubina, Hartmut R. Pfitzinger |
| 2002 | An acoustic comparison between american English and australian English vowels. Kimiko Tsukada |
| 2002 | An adaptive speaker verification system with speaker dependent a priori decision thresholds. Nikki Mirghafori, Larry P. Heck |
| 2002 | An analysis of the causes of increased error rates in children²s speech recognition. Qun Li, Martin J. Russell |
| 2002 | An analysis of transcription consistency in spontaneous speech from the buckeye corpus. William D. Raymond, Mark A. Pitt, Keith Johnson, Elizabeth Hume, Matthew J. Makashay, Robin Dautricourt, Craig Hilts |
| 2002 | An architecture for a multi-modal web browser. Cristiana Armaroli, Ivano Azzini, Lorenza Ferrario, Toni Giorgino, Luca Nardelli, Marco Orlandi, Carla Rognoni |
| 2002 | An audio-visual corpus for multimodal speech recognition in dutch language. Jacek C. Wojdel, Pascal Wiggers, Léon J. M. Rothkrantz |
| 2002 | An automatic sentence boundary detector based on a structured language model. Shinsuke Mori |
| 2002 | An education software in teaching automatic speech recognition (ASR). Hong Kai Sze, Sh-Hussain Salleh |
| 2002 | An effect of amplitude modulation on perceptual segregation of tone sequences. Mamoru Iwaki, Hiromi Seki |
| 2002 | An effective unsupervised scheme for multiple-speaker-change detection. P. Sivakumaran, Aladdin M. Ariyaeeinia, J. Fortuna |
| 2002 | An efficient algorithm for the n-best-strings problem. Mehryar Mohri, Michael Riley |
| 2002 | An efficient dialogue control method using decision tree-based estimation of out-of-vocabulary word attributes. Yasuhiro Takahashi, Kohji Dohsaka, Kiyoaki Aikawa |
| 2002 | An environment compensated minimum classification error training approach and its evaluation on Aurora2 database. Jian Wu, Qiang Huo |
| 2002 | An evaluation of using mutual information for selection of acoustic-features representation of phonemes for speech recognition. Mohamed Kamal Omar, Ken Chen, Mark Hasegawa-Johnson, Yigal Brandman |
| 2002 | Analysis and synthesis of the phonatory excitation signal by means of a pair of polynomial shaping functions. Jean Schoentgen |
| 2002 | Analysis of user behavior under error conditions in spoken dialogs. Jongho Shin, Shrikanth S. Narayanan, Laurie Gerber, Abe Kazemzadeh, Dani Byrd |
| 2002 | Application of microprosody models in text to speech synthesis. Phuay Hui Low, Saeed Vaseghi |
| 2002 | Application of over-complete blind source separation for robust automatic speech recognition. Shubha Kadambe |
| 2002 | Application of real-time AMDF pitch-detection in a voice gender normalisation system. E. Jung, A. Th. Schwarzbacher, K. Humphreys, Robert Lawlor |
| 2002 | Application of the lee silverman voice treatment (LSVT) to individuals with multiple sclerosis, ataxic dysarthria, and stroke. Leslie Will, Lorraine O. Ramig, Jennifer L. Spielman |
| 2002 | Applying a hybrid intonation model to a seamless speech synthesizer. Takashi Saito, Masaharu Sakamoto |
| 2002 | Applying fallback to prosodic unit selection from a small imitation database. Joram Meron |
| 2002 | Approaches to language identification using Gaussian mixture models and shifted delta cepstral features. Pedro A. Torres-Carrasquillo, Elliot Singer, Mary A. Kohler, Richard J. Greene, Douglas A. Reynolds, John R. Deller Jr. |
| 2002 | Arc minimization in finite state decoding graphs with cross-word acoustic context. Geoffrey Zweig, George Saon, François Yvon |
| 2002 | Assessment of consonant articulation in glossectomee speech by dynamic MRI. Katalin Mády, Robert Sader, Alexander Zimmermann, Philip Hoole, Ambros Beer, Hans-Florian Zeilhofer, Ch. Hannig |
| 2002 | Audio-visual continuous speech recognition using a coupled hidden Markov model. Xiaoxing Liu, Yibao Zhao, Xiaobo Pi, Luhong Liang, Ara V. Nefian |
| 2002 | Audio-visual scene analysis: evidence for a "very-early" integration process in audio-visual speech perception. Jean-Luc Schwartz, Frédéric Berthommier, Christophe Savariaux |
| 2002 | Audio-visual speech enhancement with AVCDCN (audio-visual codebook dependent cepstral normalization). Sabine Deligne, Gerasimos Potamianos, Chalapathy Neti |
| 2002 | Audio-visual speech sources separation: a new approach exploiting the audio-visual coherence of speech stimuli. David Sodoyer, Laurent Girin, Christian Jutten, Jean-Luc Schwartz |
| 2002 | Audiovisual integration of speech by children and adults with cochlear implants. Karen Iler Kirk, David B. Pisoni, Lorin Lachs |
| 2002 | Audiovisual perception in L2 learners. Valérie Hazan, Anke Sennema, Andrew Faulkner |
| 2002 | Audiovisual speech synthesis. from ground truth to models. Gérard Bailly |
| 2002 | Auditory fovea based speech enhancement and its application to human-robot dialog system. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano |
| 2002 | Auditory-visual speech perception examined by brain imaging and reaction time. Kaoru Sekiyama, Yoichi Sugita |
| 2002 | Automatic concept identification in goal-oriented conversations. Ananlada Chotimongkol, Alexander I. Rudnicky |
| 2002 | Automatic enrollment for speaker authentication. Qi Li, Hui Jiang, Qiru Zhou, Jinsong Zheng |
| 2002 | Automatic extraction of model parameters from fundamental frequency contours of English utterances. Shuichi Narusawa, Nobuaki Minematsu, Keikichi Hirose, Hiroya Fujisaki |
| 2002 | Automatic generation of phonetic transcriptions for large speech corpora. Kris Demuynck, Tom Laureys, Steven Gillis |
| 2002 | Automatic intelligibility assessment and diagnosis of critical pronunciation errors for computer-assisted pronunciation learning. Antoine Raux, Tatsuya Kawahara |
| 2002 | Automatic language identification using acoustic sub-word units. A. K. V. Sai Jayram, V. Ramasubramanian, T. V. Sreenivas |
| 2002 | Automatic phoneme alignment based on acoustic-phonetic modeling. John-Paul Hosom |
| 2002 | Automatic prosodic break labeling for Mandarin Chinese speech data. Minghui Dong, Kim-Teng Lua |
| 2002 | Automatic punctuation and disfluency detection in multi-party meetings using prosodic and lexical cues. Don Baron, Elizabeth Shriberg, Andreas Stolcke |
| 2002 | Automatic recognition of dutch dysarthric speech: a pilot study. Eric Sanders, Marina B. Ruiter, Lilian Beijer, Helmer Strik |
| 2002 | Automatic segmentation combining an HMM-based approach and spectral boundary correction. Yeon-Jun Kim, Alistair Conkie |
| 2002 | Automatic sign translation. Ying Zhang, Bing Zhao, Jie Yang, Alex Waibel |
| 2002 | Automatic transcription of courtroom speech. Rohit Prasad, Long Nguyen, Richard M. Schwartz, John Makhoul |
| 2002 | Automatic user-adaptive speaking rate selection for information delivery. Nigel Ward, Satoshi Nakagawa |
| 2002 | Auxiliary variables in conditional Gaussian mixtures for automatic speech recognition. Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard |
| 2002 | Backoff hierarchical class n-gram language modelling for automatic speech recognition systems. Imed Zitouni, Olivier Siohan, Hong-Kwang Jeff Kuo, Chin-Hui Lee |
| 2002 | Baldini: baldi speaks italian! Piero Cosi, Michael M. Cohen, Dominic W. Massaro |
| 2002 | Bark resolution from speech data. Naren Malayath, Hynek Hermansky |
| 2002 | Basque intonation modelling for text to speech conversion. Eva Navas, Inmaculada Hernáez, Juan María Sánchez |
| 2002 | Basurde[lite], a machine-driven dialogue system for accessing railway timetable information. Roger Trias-Sanz, José B. Mariño |
| 2002 | Belief network based disambiguation of object reference in spoken dialogue system for robot. Yoko Yamakata, Tatsuya Kawahara, Hiroshi G. Okuno |
| 2002 | Bell labs approach to Aurora evaluation on connected digit recognition. Jingdong Chen, Dimitris Dimitriadis, Hui Jiang, Qi Li, Tor André Myrvoll, Olivier Siohan, Frank K. Soong |
| 2002 | Benefit and cost analysis of using the improved vector quantizer design algorithm for glottal source waveform compression. Peter Veprek, Alan B. Bradley |
| 2002 | Bilingual corpus cleaning focusing on translation literality. Kenji Imamura, Eiichiro Sumita |
| 2002 | Blind normalization of speech from different channels and speakers. David N. Levin |
| 2002 | Bridges: regions between discourse segments. Nanette Veilleux |
| 2002 | Building an ASR system for noisy environments: SRI's 2001 SPINE evaluation system. Venkata Ramana Rao Gadde, Andreas Stolcke, Dimitra Vergyri, Jing Zheng, M. Kemal Sönmez, Anand Venkataraman |
| 2002 | Building voiceXML-based applications. Christina L. Bennett, Ariadna Font Llitjós, Stefanie Shriver, Alexander I. Rudnicky, Alan W. Black |
| 2002 | CU VOCAL: corpus-based syllable concatenation for Chinese speech synthesis across domains and dialects. Helen M. Meng, Chi-Kin Keung, Kai-Chung Siu, Tien Ying Fung, P. C. Ching |
| 2002 | CU animate tools for enabling conversations with animated characters. Jiyong Ma, Jie Yan, Ronald A. Cole |
| 2002 | Can confidence scores help users post-editing speech recognizer output? Taku Endo, Nigel Ward, Minoru Terada |
| 2002 | Channel error protection scheme for distributed speech recognition. Zheng-Hua Tan, Paul Dalsgaard |
| 2002 | Channel noise robustness for low-bitrate remote speech recognition. Alexis Bernard, Abeer Alwan |
| 2002 | Characteristics of a low reject mode speaker verification system. Daniel Elenius, Mats Blomberg |
| 2002 | Chinese spoken language analyzing based on combination of statistical and rule methods. Guodong Xie, Chengqing Zong, Bo Xu |
| 2002 | Choosing speech or touchtone modality for navigation within a telephony natural language system. Jennifer C. Lai, Kwan Min Lee |
| 2002 | Classification error from the theoretical Bayes classification risk. Erik McDermott, Shigeru Katagiri |
| 2002 | Cluster identification for speaker-environment tracking. J. T. Wickramaratna, Philip C. Woodland |
| 2002 | Clustering and feature learning based F0 prediction for Chinese speech synthesis. Jianhua Tao, Lianhong Cai |
| 2002 | Codebook dependent dynamic channel estimation for Mandarin speech recognition over telephone. Huayun Zhang, Zhaobing Han, Bo Xu |
| 2002 | Coding speech at very low rates using straight and temporal decomposition. Phu Chien Nguyen, Takao Ochi, Masato Akagi |
| 2002 | Collecting mobile multimodal data for match. Patrick Ehlen, Michael Johnston, Gunaranjan Vasireddy |
| 2002 | Combination of pause and F0 information in dependency analysis of Japanese sentences. Kazuyuki Takagi, Hajime Kubota, Kazuhiko Ozeki |
| 2002 | Combination of statistical and rule-based approaches for spoken language understanding. Ye-Yi Wang, Alex Acero, Ciprian Chelba, Brendan J. Frey, Leon Wong |
| 2002 | Combined binary classifiers with applications to speech recognition. Aldebaro Klautau, Nikola Jevtic, Alon Orlitsky |
| 2002 | Combined prosody and candidate unit selections for corpus-based text-to-speech systems. Francisco Campillo Díaz, Eduardo Rodríguez Banga |
| 2002 | Combining a Gaussian mixture model front end with MFCC parameters. Matthew N. Stuttle, Mark J. F. Gales |
| 2002 | Combining acoustic and language information for emotion recognition. Chul Min Lee, Shrikanth S. Narayanan, Roberto Pieraccini |
| 2002 | Combining information sources for memory-based pitch accent placement. Erwin Marsi, Bertjan Busser, Walter Daelemans, Véronique Hoste, Martin Reynaert, Antal van den Bosch |
| 2002 | Combining lexical and morphological knowledge in language model for inflectional (czech) language. Jan Nouza, Jindra Drabkova |
| 2002 | Combining maximum likelihood and maximum a posteriori estimation for detailed acoustic modeling of context dependency. Michiel Bacchiani |
| 2002 | Combining search spaces of heterogeneous recognizers for improved speech recogniton. Xiang Li, Rita Singh, Richard M. Stern |
| 2002 | Combining speaker and speech recognition systems. Larry P. Heck, Dominique Genoud |
| 2002 | Comfort noise detection and GSM-FR-codec detection for speech-quality evaluations in telephone networks. Thorsten Ludwig |
| 2002 | Compact subnetwork-based large vocabulary continuous speech recognition. Dong-Hoon Ahn, Minhwa Chung |
| 2002 | Comparative evaluation of CASA and BSS models for subband cocktail-party speech separation. Frédéric Berthommier, Seungjin Choi |
| 2002 | Comparative experiments to evaluate the use of auditory-based acoustic distinctive features and formant cues for automatic speech recognition using a multi-stream paradigm. Hesham Tolba, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy |
| 2002 | Comparing intelligibility of several non-native accent classes in noise. Shawn A. Weil |
| 2002 | Comparing isolately spoken keywords with spontaneously spoken queries for Japanese spoken document retrieval. Hiromitsu Nishizaki, Seiichi Nakagawa |
| 2002 | Comparison and combination of RASTA-PLP and FF features in a hybrid HMM/MLP speech recognition system. Pere Pujol Marsal, Susagna Pol, Astrid Hagen, Hervé Bourlard, Climent Nadeu |
| 2002 | Comparison of acoustic distance measures for automatic cross-language phoneme mapping. Jayren J. Sooful, Elizabeth C. Botha |
| 2002 | Compensating for hyperarticulation by modeling articulatory properties. Hagen Soltau, Florian Metze, Alex Waibel |
| 2002 | Compensation of channel effect on line spectrum frequencies. An-Tze Yu, Hsiao-Chuan Wang |
| 2002 | Comprehension of non-native speech: inaccurate phoneme processing and activation of lexical competitors. Mirjam Broersma |
| 2002 | Computationally efficient method of speech enhancement based on block representation of signal in state space and vector quantization. Vasyl Semenov, Alexander Kovtonyuk, Alexander Kalyuzhny |
| 2002 | Computationally efficient noise compensation for robust automatic speech recognition assessed under the Aurora 2/3 framework. Nicholas W. D. Evans, John S. D. Mason |
| 2002 | Computationally efficient time-scale modification of speech using 3 level clipping. Sung-Joo Lee, Hyung Soon Kim |
| 2002 | Computer-assisted second-language speech learning: generalization of prosody-focused training. Debra M. Hardison |
| 2002 | Confidence metrics for speaker identification. Mark C. Huggins, John J. Grieco |
| 2002 | Confusion-based query expansion for OOV words in spoken document retrieval. Beth Logan, Jean-Manuel Van Thong |
| 2002 | Constructing shared-state hidden Markov models based on a Bayesian approach. Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda |
| 2002 | Constructing small language models from grammars. Francis Picard, Dominique Boucher, Guy Lapalme |
| 2002 | Construction of decision tree from data driven clustering. Junho Park, Hanseok Ko |
| 2002 | Contextual effects in the perception of fricative place of articulation: a rotational hypothesis. Willy Serniclaes, René Carré |
| 2002 | Contextual effects on voicing judgment of stop consonants in Japanese. Makiko Aoyagi |
| 2002 | Continuous environmental adaptation of a speech recogniser in telephone line conditions. Carlos M. G. S. Lima, Luís B. Almeida, João L. Monteiro |
| 2002 | Contribution to topic identification by using word similarity. Armelle Brun, Kamel Smaïli, Jean Paul Haton |
| 2002 | Control system for talking robot to replicate articulatory movement of natural speech. Takemi Mochida, Masaaki Honda, Kouki Hayashi, Toshiharu Kuwae, Kunihiro Tanahashi, Kazufumi Nishikawa, Atsuo Takanishi |
| 2002 | Controling anticipatory behavior for rounding in French cued speech. Virginie Attina, Marie-Agnès Cathiard, Denis Beautemps |
| 2002 | Controlling perceived degradation in spectrum envelope modeling via predistortion. Pushkar Patwardhan, Preeti Rao |
| 2002 | Coordination of hand and orofacial movements for CV sequences in French cued speech. Virginie Attina, Denis Beautemps, Marie-Agnès Cathiard |
| 2002 | Coordination of referring expressions in multimodal human-computer dialogue. Gabriel Skantze |
| 2002 | Corpus-based analysis of English spoken by Japanese students in view of the entire phonemic system of English. Nobuaki Minematsu, Gakuto Kurata, Keikichi Hirose |
| 2002 | DARPA communicator evaluation: progress from 2000 to 2001. Marilyn A. Walker, Alexander I. Rudnicky, John S. Aberdeen, Elizabeth Owen Bratt, John S. Garofolo, Helen Wright Hastie, Audrey N. Le, Bryan L. Pellom, Alexandros Potamianos, Rebecca J. Passonneau, Rashmi Prasad, Salim Roukos, Gregory A. Sanders, Stephanie Seneff, David Stallard |
| 2002 | DARPA communicator: cross-system results for the 2001 evaluation. Marilyn A. Walker, Alexander I. Rudnicky, Rashmi Prasad, John S. Aberdeen, Elizabeth Owen Bratt, John S. Garofolo, Helen Wright Hastie, Audrey N. Le, Bryan L. Pellom, Alexandros Potamianos, Rebecca J. Passonneau, Salim Roukos, Gregory A. Sanders, Stephanie Seneff, David Stallard |
| 2002 | DCT-based video features for audio-visual speech recognition. Martin Heckmann, Kristian Kroschel, Christophe Savariaux, Frédéric Berthommier |
| 2002 | DETAC: a discriminative criterion for speaker verification. Jirí Navrátil, Ganesh N. Ramaswamy |
| 2002 | Data, annotation schemes and coding tools for natural interactivity. Laila Dybkjær, Niels Ole Bernsen |
| 2002 | Data-driven segment preselection in the IBM trainable speech synthesis system. Wael Hamza, Robert E. Donovan |
| 2002 | Data-driven temporal filters obtained via different optimization criteria evaluated on Aurora2 database. Jeih-weih Hung, Lin-Shan Lee |
| 2002 | Data-driven vector clustering for low-memory footprint ASR. Karim Filali, Xiao Li, Jeff A. Bilmes |
| 2002 | Decision tree distribution tying based on a dimensional split technique. Heiga Zen, Keiichi Tokuda, Tadashi Kitamura |
| 2002 | Design for a speech-to-speech translator for field use. David Stallard, Premkumar Natarajan, Mohammed Noamany, Richard M. Schwartz, John Makhoul |
| 2002 | Design of a Mandarin sentence set for corpus-based speech synthesis by use of a multi-tier algorithm taking account of the varied prosodic and spectral characteristics. Jinfu Ni, Hisashi Kawai |
| 2002 | Design of an audio-visual speech corpus for the czech audio-visual speech synthesis. Milos Zelezný, Petr Císar, Zdenek Krnoul, Jan Novák |
| 2002 | Design of system-initiated digressive proposals for automated banking dialogues. Jenny Wilkie, Mervyn A. Jack, Peter J. Littlewood |
| 2002 | Designing Japanese speech database covering wide range in prosody for hybrid speech synthesizer. Hiromichi Kawanami, Tsuyoshi Masuda, Tomoki Toda, Kiyohiro Shikano |
| 2002 | Designing a speaker-discriminative adaptive filter bank for speaker recognition. Tomi Kinnunen |
| 2002 | Detection and recognition of repaired speech on misrecognized utterances for speech input of car navigation system. Naoko Kakutani, Norihide Kitaoka, Seiichi Nakagawa |
| 2002 | Development of Japanese infant speech database and speaking rate analysis. Shigeaki Amano, Kazumi Kato, Tadahisa Kondo |
| 2002 | Development of a GUI-based articulatory speech synthesis system. Kohichi Ogata, Yorinobu Sonoda |
| 2002 | Discrimination of English vowels in consonantal contexts by native speakers of Japanese and its relations to dynamic information of formants. Akiyo Joto, Motohisa Imaishi, Yoshiki Nagase, Seiya Funatsu |
| 2002 | Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation. Stavros Tsakalidis, Vlasios Doumpiotis, William Byrne |
| 2002 | Discriminative training for call classification and routing. Hong-Kwang Jeff Kuo, Chin-Hui Lee, Imed Zitouni, Eric Fosler-Lussier, Egbert Ammicht |
| 2002 | Distributed Chinese keyword spotting and verification for spoken dialogues under wireless environment. Yun-Tien Lee, Cheng-Huang Wu, Yumin Lee, Lin-Shan Lee |
| 2002 | Distributed audio-visual speech synchronization. Peter Poller, Jochen Müller |
| 2002 | Distributed speech recognition over IP networks on the Aurora 3 database. Laura Docío Fernández, Carmen García-Mateo |
| 2002 | Distributed speech recognition using noise-robust MFCC and traps-estimated manner features. Pratibha Jain, Hynek Hermansky, Brian Kingsbury |
| 2002 | Divergence-based out-of-class rejection for telephone handset identification. Chi-Leung Tsang, Man-Wai Mak, Sun-Yuan Kung |
| 2002 | Double the trouble: handling noise and reverberation in far-field automatic speech recognition. David Gelbart, Nelson Morgan |
| 2002 | Duration and F0 as perceptual cues to Japanese vowel quantity. Keisuke Kinoshita, Dawn M. Behne, Takayuki Arai |
| 2002 | Duration modeling for arabic text to speech synthesis. Yasser Hifny, Mohsen A. Rashwan |
| 2002 | Duration related phase realignment of Thai tones. John J. Ohala, Rungpat Roengpitya |
| 2002 | Dutch HLT resources: from BLARK to priority lists. Helmer Strik, Walter Daelemans, Diana Binnenpoorte, Janienke Sturm, Folkert de Vriend, Catia Cucchiarini |
| 2002 | Dynamic search-space pruning for time-constrained speech recognition. Sascha Wendt, Gernot A. Fink, Franz Kummert |
| 2002 | Dynamic tuning of language model score in speech recognition using a confidence measure. Sherif M. Abdou, Michael S. Scordilis |
| 2002 | E-mail goes mobile: the design and implementation of a spoken language interface to e-mail. Daniela Oria, Esa Koskinen |
| 2002 | EM training of finite-state transducers and its application to pronunciation modeling. Han Shu, I. Lee Hetherington |
| 2002 | Effect of F0 fluctuation and amplitude modulation of natural vowels on vowel identification in noisy environments. Kentaro Ishizuka, Kiyoaki Aikawa |
| 2002 | Effects of intra-phrase position on acceptability of changes in segmental duration in sentence speech. Makiko Muto, Hiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka |
| 2002 | Effects of production training with visual feedback on the acquisition of Japanese pitch and durational contrasts. Yukari Hirata |
| 2002 | Effects of word error rate in the DARPA communicator data during 2000 and 2001. Gregory A. Sanders, Audrey N. Le, John S. Garofolo |
| 2002 | Efficient additive and convolutional noise reduction procedures. Bojan Kotnik, Damjan Vlaj, Zdravko Kacic, Bogomir Horvat |
| 2002 | Efficient and scalable methods for text script generation in corpus-based TTS design. Chih-Chung Kuo, Jing-Yi Huang |
| 2002 | Efficient combination of type-in and wizard-of-oz tests in speech interface development process. Saija-Maaria Lemmelä, Péter Pál Boda |
| 2002 | Efficient construction of long-range language models using log-linear interpolation. Edward W. D. Whittaker, Dietrich Klakow |
| 2002 | Efficient precalculation of LM contexts for large vocabulary continuous speech recognition. Javier Dieguez-Tirado, Antonio Cardenal López |
| 2002 | Eigenvoices for HMM-based speech synthesis. Kengo Shichiri, Atsushi Sawabe, Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura |
| 2002 | Emotion recognition from textual input using an emotional semantic network. Ze-Jing Chuang, Chung-Hsien Wu |
| 2002 | Emotional space improves emotion recognition. Raquel Tato, Rocío Santos, Ralf Kompe, José M. Pardo |
| 2002 | English call system with functions of speech segmentation and pronunciation evaluation using speech recognition technology. Yasuo Ariki, Jun Ogata |
| 2002 | Enhanced histogram normalization in the acoustic feature space. Sirko Molau, Florian Hilger, Daniel Keysers, Hermann Ney |
| 2002 | Enhancement of single channel speech using perception-based wavelet transform. Ching-Ta Lu, Hsiao-Chuan Wang |
| 2002 | Entropy of energy operator as feature for large vocabulary Mandarin speaker independent speech recognition. Fadhil H. T. Al-Dulaimy, Zuoying Wang |
| 2002 | Error-tolerant spoken language understanding with confidence measuring. Huei-Ming Wang, Yi-Chung Lin |
| 2002 | Estimating syntactic structure from F0 contour and pause duration in Japanese speech. Yasuo Horiuchi, Tomoko Ohsuga, Akira Ichikawa |
| 2002 | Evaluation of SPLICE on the Aurora 2 and 3 tasks. Jasha Droppo, Li Deng, Alex Acero |
| 2002 | Evaluation of a noise adaptive speech recognition system on the Aurora 3 database. Kaisheng Yao, Donglai Zhu, Satoshi Nakamura |
| 2002 | Evaluation of a noise-robust DSR front-end on Aurora databases. Duncan Macho, Laurent Mauuary, Bernhard Noé, Yan Ming Cheng, Douglas Ealey, Denis Jouvet, Holly Kelleher, David Pearce, Fabien Saadoun |
| 2002 | Evaluation of a speech recognition / generation method based on HMM and straight. Toshio Irino, Yasuhiro Minami, Tomohiro Nakatani, Minoru Tsuzaki, H. Tagawa |
| 2002 | Evaluation of a system for concatenative articulatory visual speech synthesis. Olov Engwall |
| 2002 | Evaluation of cross-language voice conversion using bilingual and non-bilingual databases. Mikiko Mashimo, Tomoki Toda, Hiromichi Kawanami, Hideki Kashioka, Kiyohiro Shikano, Nick Campbell |
| 2002 | Evaluation of formant-like features for ASR. Katrin Weber, Febe de Wet, Bert Cranen, Lou Boves, Samy Bengio, Hervé Bourlard |
| 2002 | Evaluation of noise robust features on the Aurora databases. Xiaodong Cui, Markus Iseli, Qifeng Zhu, Abeer Alwan |
| 2002 | Evaluation of noisy speech recognition based on noise reduction and acoustic model adaptation on the Aurora2 tasks. Masakiyo Fujimoto, Yasuo Ariki |
| 2002 | Evaluation of spectral subtraction with smoothing of time direction on the Aurora 2 task. Norihide Kitaoka, Seiichi Nakagawa |
| 2002 | Evaluation of the method to detect Japanese local speech rate deceleration applying the variable threshold with a constant term. Keiichi Takamaru, Makoto Hiroshige, Kenji Araki, Koji Tochinai |
| 2002 | Evidence for efficiency in vowel production. R. J. J. H. van Son, Louis C. W. Pols |
| 2002 | Expanded examinations of a low frequency modulation feature for speech/music discrimination. Stefan Karnebäck |
| 2002 | Experiments in confidence scoring for word and sentence verification. Marco Andorno, Pietro Laface, Roberto Gemello |
| 2002 | Experiments on recognition of lavalier microphone speech and whispered speech in real world environments. Kiyoshi Tatara, Taisuke Ito, Parham Zolfaghari, Kazuya Takeda, Fumitada Itakura |
| 2002 | Experiments on speaker-independent voice command recognition using in-vehicle hands free speech. Yifan Gong, Lorin Netsch |
| 2002 | Exploiting support vector machines in hidden Markov models for speaker verification. Dong Xin, Zhaohui Wu, Yingchun Yang |
| 2002 | Exploiting variances in robust feature extraction based on a parametric model of speech distortion. Li Deng, Jasha Droppo, Alex Acero |
| 2002 | Exploring sub-word features and linear support vector machines for German spoken document classification. Martha A. Larson, Stefan Eickeler, Gerhard Paaß, Edda Leopold, Jörg Kindermann |
| 2002 | Expressive speech synthesis using a concatenative synthesizer. Murtaza Bulut, Shrikanth S. Narayanan, Ann K. Syrdal |
| 2002 | Extracting clauses for spoken language understanding in conversational systems. Narendra K. Gupta, Srinivas Bangalore, Mazin G. Rahim |
| 2002 | Extraction of important sentences using F0 information for speech summarization. Yoichi Yamashita, Akira Inoue |
| 2002 | Eye-fixation as a measure of real-time processing of synthesized words. Mary D. Swift, Ellen Campana, James F. Allen, Michael K. Tanenhaus |
| 2002 | Eyebrow movements and voice variations in dialogue situations: an experimental investigation. Christian Cavé, Isabelle Guaïtella, Serge Santi |
| 2002 | F0 generation for speech synthesis using a multi-tier approach. Xuejing Sun |
| 2002 | FORM: an extensible, kinematically-based gesture annotation scheme. Craig Martell |
| 2002 | FPGA hardware for speech recognition using hidden Markov models. José Luis Gómez-Cipriano, Roger Pizzatto Nunes, Dante A. C. Barone |
| 2002 | Factor analyzed Gaussian mixture models for speaker identification. Peng Ding, Yang Liu, Bo Xu |
| 2002 | Factors in human language identification. Ian Maddieson, Ioana Vasilescu |
| 2002 | Fast hierarchical grammar optimization algorithm toward time and space efficiency. Jing Zheng, Horacio Franco |
| 2002 | Feature extraction combining spectral noise reduction and cepstral histogram equalization for robust ASR. José C. Segura, M. Carmen Benítez, Ángel de la Torre, Antonio J. Rubio |
| 2002 | Feature extraction for unit selection in concatenative speech synthesis: comparison between AIM, LPC, and MFCC. Minoru Tsuzaki, Hisashi Kawai |
| 2002 | Feed the tiger: a method for evoking reliable jaw stretch reflexes in children. Donald S. Finan, Anne Smith, Michael Ho |
| 2002 | Feedback in computer assisted pronunciation training: technology push or demand pull? Ambra Neri, Catia Cucchiarini, Helmer Strik |
| 2002 | Filter bank subtraction for robust speech recognition. Kazuo Onoe, Hiroyuki Segi, Takeshi Kobayakawa, Shoei Sato, Toru Imai, Akio Ando |
| 2002 | Finite-state transducer based hungarian LVCSR with explicit modeling of phonological changes. Máté Szarvas, Sadaoki Furui |
| 2002 | Fixed-length segment coding of LSF parameters. Evgeni Yakhnich, Yuval Bistritz |
| 2002 | Flexible dialogue management in the talk'n'travel system. David Stallard |
| 2002 | Flexible multimodal human-machine interaction in mobile environments. Dirk Bühler, Wolfgang Minker, Jochen Häußler, Sven Krger |
| 2002 | Floating-point adaptive multi-rate wideband speech codec. Toni P. Nieminen |
| 2002 | Formant model estimation and transformation for voice morphing. Ching-Hsiang Ho, Dimitrios Rentzos, Saeed Vaseghi |
| 2002 | Forms of introduction in map task dialogues: case of L2 Russian speakers. Olga Goubanova |
| 2002 | Framewise phone classification using support vector machines. Jesper Salomon, Simon King, Miles Osborne |
| 2002 | French nasal vowels: acoustic and articulatory properties. Véronique Delvaux, Thierry Metens, Alain Soquet |
| 2002 | Frequency band analysis for stress detection using a teager energy operator based feature. Mandar A. Rahurkar, John H. L. Hansen, James Meyerhoff, George Saviolakis, Michael Koenig |
| 2002 | Frequency dependence of vocal-tract length. Takuya Niikawa, Takanori Ando, Masafumi Matsumura |
| 2002 | From text to prosody without toBI. Volker Strom |
| 2002 | Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases. Chia-Ping Chen, Karim Filali, Jeff A. Bilmes |
| 2002 | Full-text story alignment models for Chinese-English bilingual news corpora. Bing Zhao, Stephan Vogel |
| 2002 | Functional modeling of face movements during speech. Shinji Maeda, Martine Toda, Andreas J. Carlen, Lyes Meftahi |
| 2002 | Generalization of state-observation-dependency in partly hidden Markov models. Tetsuji Ogawa, Tetsunori Kobayashi |
| 2002 | Generating script using statistical information of the context variation unit vector. Haiping Li, Fangxin Chen, Liqin Shen |
| 2002 | German broadcast news transcription. Robert Hecht, Jürgen Riedler, Gerhard Backfried |
| 2002 | Gestural spatialization in natural discourse segmentation. Francis K. H. Quek, David McNeill, Robert K. Bryll, Mary P. Harper |
| 2002 | Gestural trajectory symmetries and discourse segmentation. Francis K. H. Quek, Yingen Xiong, David McNeill |
| 2002 | Globalphone: a multilingual speech and text database developed at karlsruhe university. Tanja Schultz |
| 2002 | Goal-directed ASR in a multimedia indexing and searching environment (MUMIS). Mirjam Wester, Judith M. Kessens, Helmer Strik |
| 2002 | Grammar specialisation meets language modelling. Manny Rayner, Beth Ann Hockey, John Dowding |
| 2002 | Grapheme-to-phoneme conversion using pseudo-morphological units. Ulla Uebler |
| 2002 | HMM COmposition-based rapid model adaptation using a priori noise GMM adaptation evaluation on Aurora2 corpus. Masaki Ida, Satoshi Nakamura |
| 2002 | HMM-based methods for channel error mitigation in distributed speech recognition. Antonio M. Peinado, Victoria E. Sánchez, José L. Pérez-Córdoba, José C. Segura, Antonio J. Rubio |
| 2002 | Hearing-aid benefits and limitations: predictions from a cochlear model. James M. Kates |
| 2002 | Hierarchical Gaussian mixture model for speaker verification. Ming Liu, Eric Chang, Bei-qian Dai |
| 2002 | High performance digit recognition in real car environments. Umit H. Yapanel, Xianxian Zhang, John H. L. Hansen |
| 2002 | Highly oversampled subband adaptive filters for noise cancellation on a low-resource DSP system. King Tam, Hamid Sheikhzadeh, Todd Schneider |
| 2002 | Holds as gestural correlates to empty and filled speech pauses. Anna Esposito, Susan Duncan, Francis K. H. Quek |
| 2002 | How speakers with and without speech impairment mark the question statement contrast. Rupal Patel |
| 2002 | Hypophonia in parkinson disease: neural correlates of voice treatment with LSVT revealed by PET. Mario Liotti, Lorraine O. Ramig, Deanie Vogel, Pamela New, Chris Cook, Peter Fox |
| 2002 | ISIS: a multi-modal, trilingual, distributed spoken dialog system developed with CORBA, java, XML and KQML. Helen M. Meng, P. C. Ching, Yee Fong Wong, Cheong Chat Chan |
| 2002 | Implementation of an intonational quality assessment system. Chanwoo Kim, Wonyong Sung |
| 2002 | Implementation testing of a hybrid symbolic/statistical multimodal architecture. Edward C. Kaiser, Philip R. Cohen |
| 2002 | Implementing vocal tract length normalization in the MLLR framework. Guo-Hong Ding, Yi-Fei Zhu, Chengrong Li, Bo Xu |
| 2002 | Improve latent semantic analysis based language model by integrating multiple level knowledge. Rong Zhang, Alexander I. Rudnicky |
| 2002 | Improved Chinese spoken document retrieval with hybrid modeling and data-driven indexing features. Chun-Jen Wang, Berlin Chen, Lin-Shan Lee |
| 2002 | Improved corpus-based synthesis of fundamental frequency contours using generation process model. Keikichi Hirose, Masaya Eto, Nobuaki Minematsu |
| 2002 | Improved katz smoothing for language modeling in speech recogniton. Genqing Wu, Fang Zheng, Wenhu Wu, Mingxing Xu, Ling Jin |
| 2002 | Improved performance speech codec for mobile communications. K. Humphreys, Robert Lawlor |
| 2002 | Improved phone recognition on TIMIT using formant frequency data and confidence measures. N. J. Wilkinson, Martin J. Russell |
| 2002 | Improved structural maximum likelihood eigenspace mapping for rapid speaker adaptation. Bowen Zhou, John H. L. Hansen |
| 2002 | Improvement of the ELS-based time-varying complex speech analysis. Keiichi Funaki |
| 2002 | Improvements to the IBM Aurora 2 multi-condition system. George Saon, Juan M. Huerta |
| 2002 | Improving latent semantic indexing based classifier with information gain. Li Li, Wu Chou |
| 2002 | Improving parametric trajectory modeling by integration of pitch and tone information. Yiyan Zhang, Wenju Liu, Bo Xu, Huayun Zhang |
| 2002 | Improving performance of an HMM-based ASR system by using monophone-level normalized confidence measure. Muhammad Ghulam, Takashi Fukuda, Takaharu Sato, Tsuneo Nitta |
| 2002 | Improving phone-level discrimination in LDA with subphone-level classes. Hwa Jeon Song, Hyung Soon Kim |
| 2002 | Improving speech recognition performance of small microphone arrays using missing data techniques. Iain McCowan, Andrew C. Morris, Hervé Bourlard |
| 2002 | Improving spoken language understanding using word confusion networks. Gökhan Tür, Jerry H. Wright, Allen L. Gorin, Giuseppe Riccardi, Dilek Hakkani-Tür |
| 2002 | Improving statistical machine translation for a speech-to-speech translation task. Stephan Vogel, Alicia Tribble |
| 2002 | Improving the role of unvoiced speech segments by spectral normalisation in robust speech recognition. Carlos M. G. S. Lima, Luís B. Almeida, João L. Monteiro |
| 2002 | Improving word accuracy with Gabor feature extraction. Michael Kleinschmidt, David Gelbart |
| 2002 | Incremental on-line feature space MLLR adaptation for telephony speech recognition. Yongxin Li, Hakan Erdogan, Yuqing Gao, Etienne Marcheret |
| 2002 | Individual word language models and the frequency approach. Elvira I. Sicilia-Garcia, Ji Ming, Francis Jack Smith |
| 2002 | Influence of different dialogue situations on user²s behavior in spoken corrections. Atsuhiko Kai, Yukari Nonomura, Toshihiko Itoh, Tatsuhiro Konishi, Yukihiro Itoh |
| 2002 | Influence of prosody, context, and word order in the identification of focus in Japanese dialogue. Tatsuya Kitamura, Kayo Itoh, Toshihiko Itoh, Shigeyoshi Kitazawa |
| 2002 | Influence of transmission errors on ASR systems. Carmen Peláez-Moreno, Ascensión Gallardo-Antolín, Jesús Vicente-Peña, Fernando Díaz-de-María |
| 2002 | Information retrieval based on speech recognition results. Masatoshi Watanabe, Masahide Sugiyama |
| 2002 | Information-theoretic criteria for unit selection synthesis. Jon R. W. Yi, James R. Glass |
| 2002 | Ingressive speech as an indication that humans are talking to humans (and not to machines). Robert Eklund |
| 2002 | Integrating multiple pronunciations during MCE-based acoustic model training for large vocabulary speech recognition. Rathi Chengalvarayan |
| 2002 | Integrating speech with keypad input for automatic entry of spelling and pronunciation of new words. Grace Chung, Stephanie Seneff |
| 2002 | Integration of MLLR adaptation with pronunciation proficiency adaptation for non-native speech recognition. Nobuaki Minematsu, Gakuto Kurata, Keikichi Hirose |
| 2002 | Integration of phonetic length properties in the acoustic models of false starts and out-of-vocabulary words. H. Hamimed, Géraldine Damnati |
| 2002 | Integration of supra-lexical linguistic models with speech recognition using shallow parsing and finite state transducers. Xiaolong Mou, Stephanie Seneff, Victor Zue |
| 2002 | Integration of two stochastic context-free grammars. Anna Corazza |
| 2002 | Intelligibility of reverse speech in French: a perceptual study. Ivan Magrin-Chagnolleau, Melissa Barkat, Fanny Meunier |
| 2002 | Interaction of voice over internet protocol speech coders and disordered speech samples. Vijay Parsa, Donald G. Jamieson |
| 2002 | Interlingua based statistical machine translation. Manuel Kauers, Stephan Vogel, Christian Fügen, Alex Waibel |
| 2002 | Interpreting meaning from context: modeling the prosody of discourse markers in speech. Li-chiung Yang |
| 2002 | Intonation modelling for the synthesis of structured documents. Jeska Buhmann, Jean-Pierre Martens, Lieve Macken, Bert Van Coile |
| 2002 | Intonational and visual cues in the perception of interrogative mode in Swedish. David House |
| 2002 | Intrasyllabic articulatory control constraints in verbal working memory. Marc Sato, Jean-Luc Schwartz, Marie-Agnès Cathiard, Christian Abry, Hélène Loevenbruck |
| 2002 | Intrinsic phone durations are speaker-specific. Hartmut R. Pfitzinger |
| 2002 | Introduction of constraints in an acoustic-to-articulatory inversion method based on a hypercubic articulatory table. Yves Laprie, Slim Ouni |
| 2002 | Investigation of coarticulation based on electromagnetic articulographic data. Jianwu Dang, Masaaki Honda, Kiyoshi Honda |
| 2002 | Investigations on joint-multigram models for grapheme-to-phoneme conversion. Maximilian Bisani, Hermann Ney |
| 2002 | Is the speaker done yet? faster and more accurate end-of-utterance detection using prosody. Luciana Ferrer, Elizabeth Shriberg, Andreas Stolcke |
| 2002 | Issues in automatic transcription of historical audio data. Fabio Brugnara, Mauro Cettolo, Marcello Federico, Diego Giuliani |
| 2002 | Issues in the development of a stochastic speech understanding system. Fabrice Lefèvre, Hélène Bonneau-Maynard |
| 2002 | Japanese broadcast news transcription. Long Nguyen, Xuefeng Guo, Richard M. Schwartz, John Makhoul |
| 2002 | Juncture segmentation of Japanese prosodic unit based on the spectrographic features. Shigeyoshi Kitazawa, Toshihiko Itoh, Tatsuya Kitamura |
| 2002 | Kymographic imaging of the vocal fold oscillations. Jan G. Svec, Frantisek Sram |
| 2002 | LU factorization for feature transformation. Patrick Nguyen, Luca Rigazio, Christian Wellekens, Jean-Claude Junqua |
| 2002 | Large vocabulary conversational speech recognition with the extended maximum likelihood linear transformation (EMLLT) model. Jing Huang, Vaibhava Goel, Ramesh Gopinath, Brian Kingsbury, Peder A. Olsen, Karthik Visweswariah |
| 2002 | Laryngoscopic analysis of tibetan chanting modes and their relationship to register in sino-tibetan. John H. Esling |
| 2002 | Learning decision trees to determine turn-taking by spoken dialogue systems. Ryo Sato, Ryuichiro Higashinaka, Masafumi Tamoto, Mikio Nakano, Kiyoaki Aikawa |
| 2002 | Learning syllable duration and intonation of Mandarin Chinese. Oliver Jokisch, Hongwei Ding, Hans Kruschke, Guntram Strecha |
| 2002 | Likelihood combination and recognition output voting for the decoding of non-native speech with multilingual HMMs. Volker Fischer, Eric Janke, Siegfried Kunzmann |
| 2002 | Linguistic and acoustic changes of user²s utterances caused by different dialogue situations. Toshihiko Itoh, Atsuhiko Kai, Tatsuhiro Konishi, Yukihiro Itoh |
| 2002 | Lip gestures in English sibilants: articulatory - acoustic relationship. Martine Toda, Shinji Maeda, Andreas J. Carlen, Lyes Meftahi |
| 2002 | Lip-reading based on a fully automatic statistical model. Philippe Daubias, Paul Deléglise |
| 2002 | Low complexity Mandarin speaker-independent isolated word recognition. Xia Wang, Juha Iso-Sipilä |
| 2002 | Low complexity techniques for embedded ASR systems. Imre Kiss, Marcel Vasilache |
| 2002 | Low cost duration modelling for noise robust speech recognition. Andrew C. Morris, Simon Payne, Hervé Bourlard |
| 2002 | Low-resource noise-robust feature post-processing on Aurora 2.0. Chia-Ping Chen, Jeff A. Bilmes, Katrin Kirchhoff |
| 2002 | Markov models based on speaker space model evolution. Dong Kook Kim, Nam Soo Kim |
| 2002 | Maximum entropy model for punctuation annotation from speech. Jing Huang, Geoffrey Zweig |
| 2002 | Maximum expected likelihood based model selection and adaptation for nonnative English speakers. Xiaodong He, Yunxin Zhao |
| 2002 | Maximum likelihood estimation of eigenvoices and residual variances for large vocabulary speech recognition tasks. Patrick Kenny, Gilles Boulianne, Pierre Dumouchel |
| 2002 | Maximum mutual information training of hidden Markov models with vector linear predictors. K. K. Chin, Philip C. Woodland |
| 2002 | Medium vocabulary continuous audio-visual speech recognition. Pascal Wiggers, Jacek C. Wojdel, Léon J. M. Rothkrantz |
| 2002 | Mel-scaled wavelet filter based features for noisy unvoiced phoneme recognition. Omar Farooq, Sekharjit Datta |
| 2002 | Memory space reduction for hidden Markov models in low-resource speech recognition systems. Sergey Astrov |
| 2002 | Methods to improve Gaussian mixture model based language identification system. Eddie Wong, Sridha Sridharan |
| 2002 | Minimum perfect hashing for fast n-gram language model lookup. Xiao Zhang, Yunxin Zhao |
| 2002 | Model partial pronunciation variations for spontaneous Mandarin speech recognition. Yi Liu, Pascale Fung |
| 2002 | Model-based independent component analysis for robust multi-microphone automatic speech recognition. Laurent Couvreur, Christophe Ris |
| 2002 | Model-based predictions of intensity discrimination for normal- and impaired-hearing listeners. Lisa G. Huettel, Leslie M. Collins |
| 2002 | Modeling HMM state distributions with Bayesian networks. Konstantin Markov, Satoshi Nakamura |
| 2002 | Modeling and automatic detection of English sentence stress for computer-assisted English prosody learning system. Kazunori Imoto, Yasushi Tsubota, Antoine Raux, Tatsuya Kawahara, Masatake Dantsuji |
| 2002 | Modeling articulatory dynamics in autoregressive linear system. Kiyoshi Hashimoto |
| 2002 | Modeling durational variability in reading aloud a connected text. Caroline L. Smith |
| 2002 | Modeling frequent allophones in Japanese speech recognition. Long Nguyen, Xuefeng Guo, John Makhoul |
| 2002 | Modeling recognition of speech sounds with minerva2. Travis Wade, Deborah K. Eakin, Russell Webb, Arvin Agah, Frank Brown, Allard Jongman, John Gauch, Thomas A. Schreiber, Joan A. Sereno |
| 2002 | Modeling the perception of frequency-shifted vowels. Peter F. Assmann, Terrance M. Nearey, Jack M. Scott |
| 2002 | Modeling tones in continuous Cantonese speech. Tan Lee, Greg Kochanski, Chilin Shih, Yujia Li |
| 2002 | Modeling varying pauses to develop robust acoustic models for recognizing noisy conversational speech. Jinsong Zhang, Satoshi Nakamura |
| 2002 | Modeling with a subspace constraint on inverse covariance matrices. Scott Axelrod, Ramesh Gopinath, Peder A. Olsen |
| 2002 | Models of speech dynamics in a segmental-HMM recognizer using intermediate linear representations. Philip J. B. Jackson, Martin J. Russell |
| 2002 | Motor specifications of a baby robot via the analysis of infants² vocalizations. Jihène Serkhane, Jean-Luc Schwartz, Louis-Jean Boë, Barbara L. Davis, Christine L. Matyear |
| 2002 | Multi-dimensional analysis of sonority: perception, acoustics, and phonology. Masahiko Komatsu, Shinichi Tokuma, Won Tokuma, Takayuki Arai |
| 2002 | Multi-scale and multi-model integration for improved performance in Chinese spoken document retrieval. Wai Kit Lo, Helen M. Meng, P. C. Ching |
| 2002 | Multilingual pronunciation modeling for improving multilingual speech recognition. Jilei Tian, Juha Häkkinen, Olli Viikki |
| 2002 | Multilingual speech recognition with language identification. Bin Ma, Cuntai Guan, Haizhou Li, Chin-Hui Lee |
| 2002 | Multimodal integration patterns in children. Benfang Xiao, Cynthia Girand, Sharon L. Oviatt |
| 2002 | Multimodal language processing for mobile information access. Michael Johnston, Srinivas Bangalore, Amanda Stent, Gunaranjan Vasireddy, Patrick Ehlen |
| 2002 | Multiparty multimodal interaction: a preliminary analysis. Philip R. Cohen, Rachel Coulston, Kelly Krout |
| 2002 | Multiple regression of log-spectra for in-car speech recognition. Tetsuya Shinde, Kazuya Takeda, Fumitada Itakura |
| 2002 | Mutual information phone clustering for decision tree induction. Ciprian Chelba, Rachel Morton |
| 2002 | N-word-sequence frequency noise mitigation for SLM based on binomial distribution. Yibao Zhao, Guojun Zhou |
| 2002 | Named entity extraction from spontaneous speech in how may i help you? Frédéric Béchet, Allen L. Gorin, Jerry H. Wright, Dilek Hakkani-Tür |
| 2002 | Native and vietnamese production of compound and phrasal stress patterns. Thu Nguyen, John Ingram |
| 2002 | Network-based vs. distributed speech recognition in adaptive multi-rate wireless systems. Tim Fingscheidt, Stefanie Aalburg, Sorel Stan, Christophe Beaugeant |
| 2002 | Neurocognitive basis for audiovisual speech perception: evidence from event-related potentials. Curtis W. Ponton, Edward T. Auer, Lynne E. Bernstein |
| 2002 | New model for speech residual signal shaping with static nonlinearity. Jari Juhani Turunen, Juha T. Tanttu, Pekka Loula |
| 2002 | Noise adaptive speech recognition with acoustic models trained from noisy speech evaluated on Aurora-2 database. Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura |
| 2002 | Noise estimation for efficient speech enhancement and robust speech recognition. Petr Motlícek, Lukás Burget |
| 2002 | Noise from corrupted speech log mel-spectral energies. Jasha Droppo, Alex Acero, Li Deng |
| 2002 | Noise robust speech recognition using F0 contour extracted by hough transform. Koji Iwano, Takahiro Seki, Sadaoki Furui |
| 2002 | Noise-robust speech recognition in car environments using genetic algorithms and a mel-cepstral subspace approach. Sid-Ahmed Selouani, Douglas D. O'Shaughnessy |
| 2002 | Non-linear techniques for dysphonic voice analysis and correction. Claudia Manfredi, Lorenzo Matassini |
| 2002 | Objective distance measures for spectral discontinuities in concatenative speech synthesis. Jithendra Vepa, Simon King, Paul Taylor |
| 2002 | On F0 trajectory optimization for very high-quality speech manipulation. Hideki Kawahara, Parham Zolfaghari, Alain de Cheveigné |
| 2002 | On developing new text and audio corpora and speech recognition tools for the turkish language. Özgül Salor, Bryan L. Pellom, Tolga Çiloglu, Kadri Hacioglu, Mübeccel Demirekler |
| 2002 | On effective speaker verification based on subword model. Sungjoo Ahn, Sunmee Kang, Hanseok Ko |
| 2002 | On improving the performance of analysis-by-synthesis coding using a multi-magnitude algebraic code-book excitation signal. Omar Halmi, Hesham Tolba, Driss Guerchi, Douglas D. O'Shaughnessy |
| 2002 | On text-based language identification for multilingual speech recognition systems. Jilei Tian, Juha Häkkinen, Søren Riis, Kåre Jean Jensen |
| 2002 | On the estimation of signal-to-noise ratio in continuous speech for abnormal voices. Vijay Parsa, Donald G. Jamieson, Karen Stenning, Herbert A. Leeper |
| 2002 | On the function of the late rise and the early fall in dutch dialogue: a perception experiment. Johanneke Caspers |
| 2002 | On the relevance of bandwidth extension for speaker verification. Marcos Faúndez-Zanuy, Mattias Nilsson, W. Bastiaan Kleijn |
| 2002 | On the role of the "schwa" in the perception of plosive consonants. René Carré, Jean-Sylvain Liénard, Egidio Marsico, Willy Serniclaes |
| 2002 | On the use of Gaussian mixture model for speaker variability analysis. Tao Chen, Chao Huang, Eric Chang, Jingchun Wang |
| 2002 | On the use of structures in language models for dialogue. Renato De Mori, Yannick Estève, Christian Raymond |
| 2002 | On use of duration modeling for continuous digits speech recognition. Rong Dong, Jie Zhu |
| 2002 | Operations for context-based multimodal interpretation in conversational systems. Joyce Yue Chai |
| 2002 | Optimal selection of speech data for automatic speech recognition systems. Arkadiusz Nagórski, Lou Boves, Herman J. M. Steeneken |
| 2002 | Optimal speech signal partition into one-quasiperiodical segments. Taras K. Vintsiuk |
| 2002 | Optimization of hidden Markov models for embedded systems. Klaus Reinhard, Jochen Junkawitsch, Andreas Kießling, Stefan Dobler |
| 2002 | Oral-laryngeal control patterns for fricatives in 5-year-olds and adults. Laura L. Koenig, Jorge C. Lucero |
| 2002 | Orientel: speech-based interactive communication applications for the mediterranean and the middle east. Imed Zitouni, Joseph P. Olive, Dorota J. Iskra, Khalid Choukri, Ossama Emam, Oren Gedge, Emmanuel Maragoudakis, Herbert S. Tropf, Asunción Moreno, Albino Nogueiras Rodríguez, Barbara Heuft, Rainer Siemund |
| 2002 | Oro-facial changes in parkinson²s disease following intensive voice therapy (LSVT). Jennifer L. Spielman, Lorraine O. Ramig, Joan C. Borod |
| 2002 | Overview on recent activities in speech understanding and dialogue systems evaluation. Wolfgang Minker |
| 2002 | Overview on recent activities in speech understanding and dialogue systems evaluation. Wolfgang Minker |
| 2002 | Parametric trajectory segment model for LVCSR. Lei Jia, Bo Xu |
| 2002 | Part-of-speech tagging in French text-to-speech synthesis: experiments in tagset selection. Hongyan Jing, Evelyne Tzoukermann |
| 2002 | Pause duration and variability in read texts. Elena Zvonik, Fred Cummins |
| 2002 | Perceived boundary strength. Petra Hansson |
| 2002 | Perception and integration of audiovisual speech in human infants. David J. Lewkowicz |
| 2002 | Perception of prosodic phrasing by hearing-impaired listeners. Dragana Barac-Cikoja, Sally Revoile |
| 2002 | Perception of tone and vowel quantity in Thai. Hansjörg Mixdorff, Sudaporn Luksaneeyanawin, Hiroya Fujisaki, Patavee Charnvivit |
| 2002 | Perceptual adjustment to foreign-accented English with short term exposure. Constance M. Clarke |
| 2002 | Perceptual effects of assimilation-induced violation of final devoicing in dutch. Cecile T. L. Kuijpers, Wilma van Donselaar, Anne Cutler |
| 2002 | Perceptual evaluation of audiovisual cues for prominence. Emiel Krahmer, Zsófia Ruttkay, Marc Swerts, Wieger Wesselink |
| 2002 | Perceptual evaluation of naturalness due to substitution of Chinese syllable for concatenative speech synthesis. Jinlin Lu, Hisashi Kawai |
| 2002 | Perceptual learning of second-language syllable rhythm by elderly listeners. Keiichi Tajima, Reiko Akahane-Yamada, Tsuneo Yamada |
| 2002 | Performance of discriminatively trained auditory features on Aurora2 and Aurora3. Brian Kan-Wing Mak, Yik-Cheung Tam |
| 2002 | Perpetually optimizing the cost function for unit selection in a TTS system with one single run of MOS evaluation. Hu Peng, Yong Zhao, Min Chu |
| 2002 | Phonetic normalization using z-score in segmental prosody estimation for corpus-based TTS system. Hoeun Song, Jaein Kim, Kyongrok Lee, Jinyoung Kim |
| 2002 | Phonetic speaker identification. Qin Jin, Tanja Schultz, Alex Waibel |
| 2002 | Phonological norms in faroese speech synthesis. Pétur Helgason, Sjrðhur Gullbein |
| 2002 | Pitch accent prediction using ensemble machine learning. Xuejing Sun |
| 2002 | Pitch contour model for Chinese text-to-speech using CART and statistical model. Minghui Dong, Kim-Teng Lua |
| 2002 | Pitch extraction of speech signals using an eigen-based subspace method. Takahiro Murakami, Munehiro Namba, Tetsuya Hoya, Yoshihisa Ishida |
| 2002 | Porting channel robustness across languages. Françoise Beaufays, Daniel Boies, Mitch Weintraub |
| 2002 | Power spectral density based channel equalization of large speech database for concatenative TTS system. Yu Shi, Eric Chang, Hu Peng, Min Chu |
| 2002 | Preaspirated stops in southern Swedish. Mechtild Tronnier |
| 2002 | Predicting oral reading miscues. Jack Mostow, Joseph Beck, S. Vanessa Winter, Shaojun Wang, Brian Tobin |
| 2002 | Preliminary data on effects of behavioral and levodopa therapies on speech-accompanying gesture in parkinson²s disease. Susan Duncan |
| 2002 | Probabilistic ranking of constraints. Louis ten Bosch |
| 2002 | Probabilistic retrieval based on document representations. Wolfgang Macherey, Hans Jörg Viechtbauer, Hermann Ney |
| 2002 | Processing of temporal cues marking phrasal boundaries in individuals with brain damage. Wendi A. Aasland, Shari R. Baum |
| 2002 | Production and perception of pauses and their linguistic context in read and spontaneous speech in Swedish. Beáta Megyesi, Sofia Gustafson-Capková |
| 2002 | Production based pitch modification of voiced speech. Yinglong Jiang, Peter Murphy |
| 2002 | Progress with the philips continuous ASR system on the Aurora 2 noisy digits database. Markus Lieb, Alexander Fischer |
| 2002 | Pronunciation of proper names with a joint n-gram model for bi-directional grapheme-to-phoneme conversion. Lucian Galescu, James F. Allen |
| 2002 | Prosodic parameter for speaker identification. Katarina Bartkova, David Le Gac, Delphine Charlet, Denis Jouvet |
| 2002 | Prosodic phrasing with inductive learning. Sheng Zhao, Jianhua Tao, Lianhong Cai |
| 2002 | Prosody-based automatic detection of annoyance and frustration in human-computer dialog. Jeremy Ang, Rajdip Dhillon, Ashley Krupski, Elizabeth Shriberg, Andreas Stolcke |
| 2002 | Qualcomm-ICSI-OGI features for ASR. André Gustavo Adami, Lukás Burget, Stéphane Dupont, Harinath Garudadri, Frantisek Grézl, Hynek Hermansky, Pratibha Jain, Sachin S. Kajarekar, Nelson Morgan, Sunil Sivadas |
| 2002 | Quantile based histogram equalization for online applications. Florian Hilger, Sirko Molau, Hermann Ney |
| 2002 | Quantitative evaluation of relevant prosodic factors for text-to-speech synthesis in Spanish. David Escudero Mancebo, César González Ferreras, Valentín Cardeñoso-Payo |
| 2002 | RUSLANA: a database of Russian emotional utterances. Veronika Makarova, Valery A. Petrushin |
| 2002 | Radiodoc: a voice-accessible document system. Takuya Nishimoto, Masahiro Araki, Yasuhisa Niimi |
| 2002 | Rapid development of speech-to-speech translation systems. Alan W. Black, Ralf D. Brown, Robert E. Frederking, Kevin A. Lenzo, John Moody, Alexander I. Rudnicky, Rita Singh, Eric Steinbrecher |
| 2002 | Rapid speaker adaptation using speaker clustering. Ernest Pusateri, Timothy J. Hazen |
| 2002 | Real-time rich-content transcription of Chinese broadcast news. Daben Liu, Jeffrey Ma, Dongxin Xu, Amit Srivastava, Francis Kubala |
| 2002 | Real-time sound source localization and separation for robot audition. Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano |
| 2002 | Recognition and verification of English by Japanese students for computer-assisted language learning system. Yasushi Tsubota, Tatsuya Kawahara, Masatake Dantsuji |
| 2002 | Recognition error processing for speech understanding. Caroline Bousquet-Vernhettes, Nadine Vigouroux |
| 2002 | Recognition of continuous speech segments of monophone units using support vector machines. Weifeng Lee, C. Chandra Sekhar, Kazuya Takeda, Fumitada Itakura |
| 2002 | Recognition of noisy speech using normalized moments. Jingdong Chen, Yiteng Huang, Qi Li, Frank K. Soong |
| 2002 | Recurrent neural network-enhanced HMM speech recognition systems. Jan W. F. Thirion, Elizabeth C. Botha |
| 2002 | Reducing pronunciation lexicon confusion and using more data without phonetic transcription for pronunciation modeling. Fang Zheng, Zhanjiang Song, Pascale Fung, William Byrne |
| 2002 | Reducing the footprint of the IBM trainable speech synthesis system. Dan Chazan, Ron Hoory, Zvi Kons, Dorel Silberstein, Alexander Sorin |
| 2002 | Reference resolution by human partners in a natural interactive problem-solving task. Ellen Campana, Sarah Brown-Schmidt, Michael K. Tanenhaus |
| 2002 | Refined speech segmentation for concatenative speech synthesis. Abhinav Sethy, Shrikanth S. Narayanan |
| 2002 | Refocussing on the text normalisation process in text-to-speech systems. Andrew P. Breen, Barry Eggleton, Peter Dion, Steve Minnis |
| 2002 | Reliability measures for translation quality. Eiichiro Sumita, Yasuhiro Akiba, Kenji Imamura |
| 2002 | Rethinking derived acoustic features in speech recognition. Kevin S. Van Horn |
| 2002 | Retrieving phrases by selecting the history: application to automatic speech recognition. David Langlois, Kamel Smaïli, Jean Paul Haton |
| 2002 | Risk based lattice cutting for segmental minimum Bayes-risk decoding. Shankar Kumar, William Byrne |
| 2002 | Robust HMM training for unified dutch and German speech recognition. Rathi Chengalvarayan |
| 2002 | Robust MMSE-FW-LAASR scheme at low SNRs. Tao Xu, Zhigang Cao |
| 2002 | Robust feature extraction in a variety of input devices on the basis of ETSI standard DSR front-end. Satoru Tsuge, Shingo Kuroiwa, Masami Shishibori, Fuji Ren, Kenji Kita |
| 2002 | Robust fundamental frequency estimation against background noise and spectral distortion. Tomohiro Nakatani, Toshio Irino |
| 2002 | Robust multiple resolution analysis for automatic speech recognition. Roberto Gemello, Franco Mana, Paolo Pegoraro, Renato De Mori |
| 2002 | Robust semantic confidence scoring. Didier Guillevic, Simona Gandrabur, Yves Normandin |
| 2002 | Robust speech / music classification in audio documents. Julien Pinquier, Jean-Luc Rouas, Régine André-Obrecht |
| 2002 | Robust speech recognition against short-time noise. Man-Hung Siu, Yu-Chung Chan |
| 2002 | Robust speech recognition using a voiced-unvoiced feature. András Zolnay, Ralf Schlüter, Hermann Ney |
| 2002 | Robust speech recognition using inter-speaker and intra-speaker adaptation. Baojie Li, Keikichi Hirose, Nobuaki Minematsu |
| 2002 | Robust time-synchronous environmental adaptation for continuous speech recognition systems. Thomas Plötz, Gernot A. Fink |
| 2002 | Robust voiced-unvoiced decision associated to continuous pitch tracking in noisy telephone speech. Mijail Arcienega, Andrzej Drygajlo |
| 2002 | Run time information fusion in speech recognition. Chengyi Zheng, Yonghong Yan |
| 2002 | SALT: a spoken language interface for web-based multimodal dialog systems. Kuansan Wang |
| 2002 | SPIN: language understanding for spoken dialogue systems using a production system approach. Ralf Engel |
| 2002 | SRILM - an extensible language modeling toolkit. Andreas Stolcke |
| 2002 | Same talker, different language: a replication. Verna Stockmal, Zinny S. Bond |
| 2002 | Seeing tongue movements from outside. Gérard Bailly, Pierre Badin |
| 2002 | Segment duration in spoken korean. Hyunsong Chung |
| 2002 | Segmentation of glides with tonal alignment as reference. Yi Xu, Fang Liu |
| 2002 | Selective back-off smoothing for incorporating grammatical constraints into the n-gram language model. Tomoyosi Akiba, Katunobu Itou, Atsushi Fujii, Tetsuya Ishikawa |
| 2002 | Selective multi-path acoustic model based on database likelihoods. Akinobu Lee, Yuichiro Mera, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2002 | Semantic inference: a data-driven solution for NL interaction. Jerome R. Bellegarda |
| 2002 | Semantic structured language models. Hakan Erdogan, Ruhi Sarikaya, Yuqing Gao, Michael Picheny |
| 2002 | Separation of voiced source characteristics and vocal tract transfer function characteristics for speech sounds by iterative analysis based on AR-HMM model. Nobuyuki Nishizawa, Keikichi Hirose, Nobuaki Minematsu |
| 2002 | Sequential MAP noise estimation and a phase-sensitive model of the acoustic environment. Li Deng, Jasha Droppo, Alex Acero |
| 2002 | Serving complex user wishes with an enhanced spoken dialogue system. Sunna Torge, Stefan Rapp, Ralf Kompe |
| 2002 | Sharing relative stress of cross-word syllables and lexical stress to spontaneous speech recognition. Farshad Almasganj, Farhad D. Dehnavi, Mahmood Bijankhan |
| 2002 | Sharing trend information of trajectory in segmental-feature HMM. Young-Sun Yun |
| 2002 | Sign language translation using an error tolerant retrieval algorithm. Chung-Hsien Wu, Yu-Hsien Chiu, Kung-Wei Cheng |
| 2002 | Similarities of words in noise in Japanese. Kiyoko Yoneyama |
| 2002 | Sources of variability in the perceptual training of /r/ and /l/: interaction of adjacent vowel, word position, talkers² visual and acoustic cues. Debra M. Hardison |
| 2002 | Sparse and independent representations of speech signals based on parametric models. Hugo Leonardo Rufiner, Luís F. Rocha, John Goddard Close |
| 2002 | Speaker change detection using a new weighted distance measure. Soonil Kwon, Shrikanth S. Narayanan |
| 2002 | Speaker identification by location in an optimal space of anchor models. Yassine Mami, Delphine Charlet |
| 2002 | Speaker independent speech recognition using features based on glottal sound source. Norihide Kitaoka, Daisuke Yamada, Seiichi Nakagawa |
| 2002 | Speaker intelligibility of adults and children. D. Markham, Valérie Hazan |
| 2002 | Speaker recognition using discriminative features selection. Bogdan Sabac |
| 2002 | Speaker recognizability evaluation of a voicefont-based text-to-speech system. Masaharu Sakamoto, Takashi Saito |
| 2002 | Speaker utterances tying among speaker segmented audio documents using hierarchical classification: towards speaker indexing of audio databases. Sylvain Meignier, Jean-François Bonastre, Ivan Magrin-Chagnolleau |
| 2002 | Speaker verification using Gaussian component strings in dynamic trajectory space. Bing Xiang |
| 2002 | Speaker verification with data fusion and model adaptation. Kevin R. Farrell |
| 2002 | Speaking rate compensation based on likelihood criterion in acoustic model training and decoding. Kozo Okuda, Tatsuya Kawahara, Satoshi Nakamura |
| 2002 | Special session: issues in audiovisual spoken language processing (when, where, and how?). Lynne E. Bernstein, Denis Burnham, Jean-Luc Schwartz |
| 2002 | Specification and realisation of multimodal output in dialogue systems. Jonas Beskow, Jens Edlund, Magnus Nordstrand |
| 2002 | Spectral enhancement preprocessing for the HNM coding of noisy speech. Gautam Moharir, Pushkar Patwardhan, Preeti Rao |
| 2002 | Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics. Shingo Yamade, Kanako Matsunami, Akira Baba, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2002 | Speech and language processing for a constrained speech translation system. Stephen Cox |
| 2002 | Speech coding and transmission for improved automatic recognition. Xin Zhong, Jon A. Arrowood, Mark A. Clements |
| 2002 | Speech completion: on-demand completion assistance using filled pauses for speech input interfaces. Masataka Goto, Katunobu Itou, Satoru Hayamizu |
| 2002 | Speech enhancement based on a perceptual modification of wiener filtering. Lee Lin, W. Harvey Holmes, Eliathamby Ambikairajah |
| 2002 | Speech enhancement based on combining perceptual enhancement and short-time spectral attenuation. Ilyas Potamitis, Nikos Fakotakis, George Kokkinakis |
| 2002 | Speech enhancement based on generalized singular value decomposition approach. Gwo-hwa Ju, Lin-Shan Lee |
| 2002 | Speech enhancement in car environment using blind source separation. Hiroshi Saruwatari, Katsuyuki Sawai, Akinobu Lee, Kiyohiro Shikano, Atsunobu Kaminuma, Masao Sakata |
| 2002 | Speech enhancement in non-stationary noise environments. Hyoung-Gook Kim, Dietmar Ruwisch |
| 2002 | Speech enhancement using wavelet packet transform. Sungwook Chang, Sung-il Jung, Younghun Kwon, Sung-Il Yang |
| 2002 | Speech modeling using variational Bayesian mixture of Gaussians. Panu Somervuo |
| 2002 | Speech pauses and gestural holds in parkinson²s disease. Francis K. H. Quek, Mary P. Harper, Yonca Haciahmetoglu, Lei Chen, Lorraine O. Ramig |
| 2002 | Speech recognition for language teaching and evaluating: a study of existing commercial products. Rebecca Hincks |
| 2002 | Speech recognition performance comparison between DSR and AMR transcoded speech. Holly Kelleher, David Pearce, Douglas Ealey, Laurent Mauuary |
| 2002 | Speech recognition using combined acoustic and articulatory information with retraining of acoustic model parameters. Ka-Yee Leung, Man-Hung Siu |
| 2002 | Speech recognition using fundamental frequency and voicing in acoustic modeling. Andrej Ljolje |
| 2002 | Speech recognition using syllable patterns. Li Zhang, William H. Edmondson |
| 2002 | Speech recognition with a re-speak method for subtitling live broadcasts. Toru Imai, Atsushi Matsui, Shinichi Homma, Takeshi Kobayakawa, Kazuo Onoe, Shoei Sato, Akio Ando |
| 2002 | Speech reconstruction from mel-frequency cepstral coefficients using a source-filter model. Ben Milner, Xu Shao |
| 2002 | Speech synthesis, speech simulation and speech science. Mark A. Huckvale |
| 2002 | Speech to speech translation system for monologues-data driven approach. Hideki Tanaka, Stephen Nightingale, Hideki Kashioka, Kenji Matsumoto, Masamchi Nishiwaki, Tadashi Kumano, Takehiko Maruyama |
| 2002 | Speech watermarking through parametric modeling. Aparna Gurijala, John R. Deller Jr., Michael S. Seadle, John H. L. Hansen |
| 2002 | Speech, music and songs discrimination in the context of handsets variability. Hassan Ezzaidi, Jean Rouat |
| 2002 | Speech-enabled natural language call routing: BBN call director. Premkumar Natarajan, Rohit Prasad, Bernhard Suhm, Daniel McCarthy |
| 2002 | Speech-to-speech translation system evaluation: results for French for the NESPOLE! project first showcase. Solange Rossato, Hervé Blanchon, Laurent Besacier |
| 2002 | Speechfind: an experimental on-line spoken document retrieval system for historical audio archives. Bowen Zhou, John H. L. Hansen |
| 2002 | Spoken dialogue system for home health care. Shinya Takahashi, Tsuyoshi Morimoto, Sakashi Maeda, Naoyuki Tsuruta |
| 2002 | State clustering improvements for continuous HMMs in a Spanish large vocabulary recognition system. Ricardo de Córdoba, Javier Macías Guarasa, Javier Ferreiros, Juan Manuel Montero, José Manuel Pardo |
| 2002 | Statistical adaptation of acoustic models to noise conditions for robust speech recognition. Ángel de la Torre, Dominique Fohr, Jean Paul Haton |
| 2002 | Statistical language modeling with prosodic boundaries and its use for continuous speech recognition. Keikichi Hirose, Nobuaki Minematsu, Makoto Terao |
| 2002 | Statistical machine translation decoder based on phrase. Taro Watanabe, Eiichiro Sumita |
| 2002 | Statistical natural language generation for speech-to-speech machine translation systems. Bowen Zhou, Yuqing Gao, Jeffrey S. Sorensen, Zijian Diao, Michael Picheny |
| 2002 | Statistically based approach to rejection of incorrectly recognized words. Ludek Müller, Tomás Bartos |
| 2002 | Stochastic suprasegmentals: relationship between the spectral characteristics of vowels, redundancy and prosodic structure. Matthew P. Aylett |
| 2002 | Stochastic trajectory model analysis for accent classification. Pongtep Angkititrakul, John H. L. Hansen |
| 2002 | Stop epenthesis at syllable boundaries. Natasha Warner, Andrea Weber |
| 2002 | Structural Gaussian mixture models for efficient text-independent speaker verification. Bing Xiang, Toby Berger |
| 2002 | Studying pronunciation variants in French by using alignment techniques. Philippe Boula de Mareüil, Martine Adda-Decker |
| 2002 | Subband based voice conversion. Oytun Türk, Levent M. Arslan |
| 2002 | Subjective assessment of frequency bands for perception of speaker identity. Eda Ormanci, U. Hakan Nikbay, Oytun Türk, Levent M. Arslan |
| 2002 | Submoraic awareness by Japanese school children: evidence from a novel game. Takashi Otake, Akemi Iijima |
| 2002 | Subset languages for conversing with collaborative interface agents. Candace L. Sidner, Clifton Forlines |
| 2002 | Subspace speech enhancement using subband whitening filter. Jong Uk Kim, Chang D. Yoo |
| 2002 | Suitable design of adaptive beamformer based on average speech spectrum for noisy speech recognition. Takanobu Nishiura, Satoshi Nakamura, Yuka Okada, Takeshi Yamada, Kiyohiro Shikano |
| 2002 | Swallowing and voice effects of lee silverman voice treatment (LSVT). Jeri Logemann, Ralph Sundin, Jean Sundin |
| 2002 | Syllable processing in English. Ruth Kearns, Dennis Norris, Anne Cutler |
| 2002 | Syllable recognition using syllable-segment statistics and syllable-based HMM. Nobutoshi Takahashi, Seiichi Nakagawa |
| 2002 | Syntax over focus. Sun-Ah Jun |
| 2002 | Talking to machines (statistically speaking). Steve J. Young |
| 2002 | Tempo modulations in English: selected pilot study results. Sandra P. Kirkham |
| 2002 | Text-dependent speaker verification using lyapunov exponents. Adriano Petry, Dante Augusto Couto Barone |
| 2002 | The 2001 GMTK-based SPINE ASR system. Özgür Çetin, Harriet J. Nock, Katrin Kirchhoff, Jeff A. Bilmes, Mari Ostendorf |
| 2002 | The 2ch hybrid subtractive beamformer applied to line sound sources. Mitsunori Mizumachi, Satoshi Nakamura |
| 2002 | The AT&t German text-to-speech system: realistic linguistic description. Matthias Jilka, Ann K. Syrdal |
| 2002 | The ISL meeting corpus: the impact of meeting type on speech style. Susanne Burger, Victoria MacLaren, Hua Yu |
| 2002 | The acoustic realization of anger, fear, joy and sadness in Chinese. Jiahong Yuan, Liqin Shen, Fangxin Chen |
| 2002 | The carnegie mellon communicator corpus. Christina L. Bennett, Alexander I. Rudnicky |
| 2002 | The effect of auditory-visual information and orthographic background in L2 acquisition. V. Dogu Erdener, Denis Burnham |
| 2002 | The effects of F0 manipulation on the perceived distance of speech. Douglas Brungart, Alexander J. Kordik, Koel Das, Arnab K. Shaw |
| 2002 | The effects of speech compression on speech recognition and text-to-speech synthesis. Yeshwant K. Muthusamy, Yifan Gong, Roshan Gupta |
| 2002 | The evolution of spoken language: a comparative approach. W. Tecumseh Fitch |
| 2002 | The influence of identification training on identification and production of the american English mid and low vowels by native speakers of Japanese. Stephen G. Lambacher, William L. Martens, Kazuhiko Kakehi |
| 2002 | The influence of speech coding on recognition performance in telecommunication networks. Hans-Günter Hirsch |
| 2002 | The perception of stop consonant sequences in dyslexic and normal children. Noël Nguyen, Ludovic Jankowski, Michel Habib |
| 2002 | The perceptual basis for audiovisual speech integration. Lawrence D. Rosenblum |
| 2002 | The relationship between pure-tone sequential stream segregation and perceptual separation of male and female talkers by listeners with hearing loss. Carol L. Mackersie |
| 2002 | The reliability of the ITU-t p.85 standard for the evaluation of text-to-speech systems. Yolanda Vazquez-Alvarez, Mark A. Huckvale |
| 2002 | The stimulus as basis for audiovisual integration. Eric Vatikiotis-Bateson, Harold Hill, Miyuki Kamachi, Karen Lander, Kevin G. Munhall |
| 2002 | The structure and its implementation of hidden dynamic HMM for Mandarin speech recognition. Feili Chen, Jie Zhu, Wentao Song |
| 2002 | Think big, from voice to limb movement therapy. Becky G. Farley |
| 2002 | Three-dimensional electromagnetic articulograph based on a nonparametric representation of the magnetic field. Tokihiko Kaburagi, Kohei Wakamiya, Masaaki Honda |
| 2002 | Time-compressing natural and synthetic speech. Esther Janse |
| 2002 | Time-frequency transforms and beamforming for speaker recognition. Antonio Satué-Villar, Juan Fernández-Rubio |
| 2002 | Tone recognition in Thai continuous speech based on coarticulaion, intonation and stress effects. Nuttakorn Thubthong, Boonserm Kijsirikul, Sudaporn Luksaneeyanawin |
| 2002 | Topic detection of an utterance for speech dialogue processing. Katsushi Asami, Toshiyuki Takezawa, Gen-ichiro Kikui |
| 2002 | Topic tracking using subject templates. Yoshimi Suzuki, Fumiyo Fukumoto, Yoshihiro Sekiguchi |
| 2002 | Towards a grammar of spoken language: incorporating paralinguistic information. Nick Campbell |
| 2002 | Towards an intonation module for a portuguese TTS system. Diamantino Freitas, Daniela Braga |
| 2002 | Towards automatic closed captioning : low latency real time broadcast news transcription. Murat Saraclar, Michael Riley, Enrico Bocchieri, Vincent Goffin |
| 2002 | Towards every-citizen²s speech interface: an application generator for speech interfaces to databases. Arthur R. Toth, Thomas K. Harris, James Sanders, Stefanie Shriver, Roni Rosenfeld |
| 2002 | Towards the question: why has speaking rate such an impact on speech recognition performance? Robert Faltlhauser, Günther Ruske, Matthias Thomae |
| 2002 | Training topic classifiers for conversational speech with limited data. Rukmini Iyer, Jeffrey Z. Ma, Herbert Gish, Owen Kimball |
| 2002 | Transducer search space modelings for large-vocabulary speech recognition. Hans J. G. A. Dolfing |
| 2002 | Transform-based feature vector compression for distributed speech recognition. Ben Milner, Xu Shao |
| 2002 | Transformation of spectral envelope for voice conversion based on radial basis function networks. Tomomi Watanabe, Takahiro Murakami, Munehiro Namba, Tetsuya Hoya, Yoshihisa Ishida |
| 2002 | Transmission characteristics of outer ear canal. Karel Pellant, Jan Mejzlík, Karel Prikryl, Zdenek Skvor |
| 2002 | Tree-structured maximum a posteriori adaptation for a segment-based speech recognition system. Irina Illina |
| 2002 | Unconstrained versus constrained acoustic normalisation in confidence scoring. Jacques Duchateau, Patrick Wambacq |
| 2002 | Unified task knowledge for spoken language understanding and dialog management. Jerry H. Wright, Alicia Abella, Allen L. Gorin |
| 2002 | Unknown-multiple speaker clustering using HMM. Jitendra Ajmera, Hervé Bourlard, I. Lapidot, Iain McCowan |
| 2002 | Unsupervised acoustic model adaptation based on phoneme error minimization. Jun Ogata, Yasuo Ariki |
| 2002 | Unsupervised language model adaptation for lecture speech transcription. Thomas Niesler, Daniel Willett |
| 2002 | Unsupervised n-best based model adaptation using model-level confidence measures. Ka-Yan Kwan, Tan Lee, Chen Yang |
| 2002 | Unsupervised speaker segmentation of telephone conversations. Aaron E. Rosenberg, Allen L. Gorin, Zhu Liu, Sarangarajan Parthasarathy |
| 2002 | User-customized password speaker verification based on HMM/ANN and GMM models. Mohamed Faouzi BenZeghiba, Hervé Bourlard |
| 2002 | User-tailored generation for spoken dialogue: an experiment. Amanda Stent, Marilyn A. Walker, Steve Whittaker, Preetam Maloor |
| 2002 | Using EM-trained string-edit distances for approximate matching of acoustic morphemes. Michael Levit, Elmar Nöth, Allen L. Gorin |
| 2002 | Using adaptive signal limiter together with weighting techniques for noisy speech recognition. Wei-Wen Hung |
| 2002 | Using cross-language cues for story-specific language modeling. Sanjeev Khudanpur, Woosung Kim |
| 2002 | Using dynamic WFST composition for recognizing broadcast news. Diamantino Caseiro, Isabel Trancoso |
| 2002 | Using observation uncertainty in HMM decoding. Jon A. Arrowood, Mark A. Clements |
| 2002 | Using part-of-speech tags, context thresholding, and trigram contexts to improve the auto-induction of semantic classes. Andrew N. Pargellis, Eric Fosler-Lussier, Augustine Tsai |
| 2002 | Using start/end timings of spectral transitions between phonemes in concatenative speech synthesis. Toshio Hirai, Seiichi Tenpaku, Kiyohiro Shikano |
| 2002 | Using time-stretched pulses for accurate splitting of speech utterances played back in noisy reverberant environments. Dorothea Kolossa, Qiang Huo |
| 2002 | Using x-grams for speech-to-speech translation. Adrià de Gispert, José B. Mariño |
| 2002 | Utterance verification based on neighborhood information and Bayes factors. Hui Jiang, Chin-Hui Lee |
| 2002 | Validation and improvement of automatic phonetic transcriptions. Catia Cucchiarini, Diana Binnenpoorte |
| 2002 | Variability in direction of dorsal movement during production of /l/. Natasha Warner, Allard Jongman, Doris Mcke |
| 2002 | Variability in the production of glottalized sonorants: data from yapese. Ian Maddieson, Julie Larson |
| 2002 | VisSTA: a tool for analyzing multimodal discourse data. Francis K. H. Quek, Yang Shi, Cemil Kirbas, Shunguang Wu |
| 2002 | Vocabulary independent OOV detection using support vector machines. Tommi Lahti, Janne Suontausta |
| 2002 | Vocalization age as a clinical tool. Harriet J. Fell, Joel MacAuslan, Linda J. Ferrier, Susan G. Worst, Karen Chenausky |
| 2002 | Voice transformations for improving children²s speech recognition in a publicly available dialogue system. Joakim Gustafson, Kåre Sjölander |
| 2002 | Vowel classification for computer-based visual feedback for speech training for the hearing impaired. Stephen A. Zahorian, A. Matthew Zimmer, Fansheng Meng |
| 2002 | Warped-LP residual resampling using DCT for pitch modification. R. Muralishankar, A. G. Ramakrishnan, P. Prathibha |
| 2002 | Weighted graph based decision tree optimization for high accuracy acoustic modeling. Sheng Gao, Jinsong Zhang, Satoshi Nakamura, Chin-Hui Lee, Tat-Seng Chua |
| 2002 | What relationship between protrusion anticipation and auditory perception? Rudolph Sock, Béatrice Vaxelaire, Véronique Hecker, Fabrice Hirsch |
| 2002 | Wizard of oz evaluation of a dialogue with communicator system in Chile. Néstor Becerra Yoma, Angela Cortés, Mauricio Hormazábal, Enrique López |
| 2002 | Word endpoints detection in the presence of non-stationary noise. Mario Toma, Andrea Lodi, Roberto Guerrieri |
| 2002 | X-JToBI: an extended j-toBI for spontaneous speech. Kikuo Maekawa, Hideaki Kikuchi, Yosuke Igarashi, Jennifer J. Venditti |