INTERSPEECH A

679 papers

YearTitle / Authors
200210 years of phondat-II: a reassessment.
Hartmut R. Pfitzinger
20022-d processing of speech with application to pitch estimation.
Thomas F. Quatieri
20027th International Conference on Spoken Language Processing, ICSLP2002 - INTERSPEECH 2002, Denver, Colorado, USA, September 16-20, 2002
John H. L. Hansen, Bryan L. Pellom
2002A Gaussian selection method for multi-mixture HMM based continuous speech recognition.
Raymond H. Lee, Eric H. C. Choi
2002A case study of portuguese and English bilinguality.
Luis M. T. Jesus, Christine H. Shadle
2002A combined model of statics-dynamics of speech optimized using maximum mutual information.
Zhijian Ou, Zuoying Wang
2002A comparative study of adaptation methods for speaker verification.
Johnny Mariéthoz, Samy Bengio
2002A comparative study of approximations for parallel model combination of static and dynamic parameters.
Yifan Gong
2002A comparison between feedback strategies in human-to-human and human-machine communication.
Loredana Cerrato
2002A comparison of HTK, ISIP and julius in slovenian large vocabulary continuous speech recognition.
Tomaz Rotovnik, Mirjam Sepesy Maucec, Bogomir Horvat, Zdravko Kacic
2002A comparison of L1 and african-mother-tongue acoustic models for south african English speech recognition.
Janus D. Brink, Elizabeth C. Botha
2002A comparison of four language models for large vocabulary turkish speech recognition.
Helin Dutagaci, Levent M. Arslan
2002A comparison of front-end analyses for Thai speech recognition.
Montri Karnjanadecha, Patimakorn Kimsawad
2002A comparison of two LVR search optimization techniques.
Stephan Kanthak, Hermann Ney, Michael Riley, Mehryar Mohri
2002A confidence measure based on agreement among multiple LVCSR models - correlation between pair of acoustic models and confidence.
Takehito Utsuro, Tetsuji Harada, Hiromitsu Nishizaki, Seiichi Nakagawa
2002A context clustering technique for average voice model in HMM-based speech synthesis.
Junichi Yamagishi, Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi
2002A copy synthesis method to pilot the klatt synthesiser.
Yves Laprie, Anne Bonneau
2002A corpus-based study of danish laryngealization.
Kathleen Murray, Betina Simonsen
2002A data-driven approach to source-formant type text-to-speech system.
Hiroki Mori, Takahiro Ohtsuka, Hideki Kasuya
2002A distributed multimodal dialogue system based on dialogue system and web convergence.
Feng Liu, Antoine Saad, Li Li, Wu Chou
2002A figure of merit for the analysis of spoken dialog systems.
Kadri Hacioglu, Wayne H. Ward
2002A flexible stream architecture for ASR using articulatory features.
Florian Metze, Alex Waibel
2002A handset identifier using support vector machines.
Purdy Ho
2002A hybrid HMM/traps model for robust voice activity detection.
Brian Kingsbury, Pratibha Jain, André Gustavo Adami
2002A hybrid approach to compounds in LVCSR.
Tom Laureys, Vincent Vandeghinste, Jacques Duchateau
2002A link between cepstral shrinking and the weighted product rule in audio-visual speech recognition.
Simon Lucey, Sridha Sridharan, Vinod Chandran
2002A low-resource, miniature implementation of the ETSI distributed speech recognition front-end.
Etienne Cornu, Hamid Sheikhzadeh, Robert L. Brennan
2002A maximum entropy semantic parser using word classes.
Norbert Pfannerer
2002A method for evaluating incremental utterance understanding in spoken dialogue systems.
Ryuichiro Higashinaka, Noboru Miyazaki, Mikio Nakano, Kiyoaki Aikawa
2002A miniature Chinese TTS system based on tailored corpus.
Zhiwei Shuang, Yu Hu, Zhen-Hua Ling, Ren-Hua Wang
2002A modality-independent MMI system architecture.
Kouichi Katsurada, Yoshihiko Ootani, Yusaku Nakamura, Satoshi Kobayashi, Hirobumi Yamada, Tsuneo Nitta
2002A multi-class approach for modelling out-of-vocabulary words.
Issam Bazzi, James R. Glass
2002A new approach to speech enhancement by a microphone array using EM and mixture models.
Hagai Attias, Li Deng
2002A new computer-based analytical speech perception test for prelingually deaf children and children with speech disorders.
Anne-Marie Öster
2002A new lexicon optimization method for LVCSR based on linguistic and acoustic characteristics of words.
Takahiro Shinozaki, Sadaoki Furui
2002A new method for testing dialogue systems based on simulations of real-world conditions.
Ramón López-Cózar, Ángel de la Torre, José C. Segura, Antonio J. Rubio, Juan M. López-Soler
2002A new method of building decision tree based on target information.
Yi-Jian Wu, Yu Hu, Xiaoru Wu, Ren-Hua Wang
2002A perceptually motivated subspace approach for speech enhancement.
Yi Hu, Philipos C. Loizou
2002A phoneme recognizer for the hearing impaired.
Mathias Johansson, Mats Blomberg, Kjell Elenius, Lars-Erik Hoffsten, Anders Torberger
2002A phonetic study of vietnamese tones: acoustic and electroglottographic measurements.
Vu Ngoc Tuan, Christophe d'Alessandro, Sophie Rosset
2002A portable, server-side dialog framework for voiceXML.
Bob Carpenter, Sasha Caskey, Krishna Dayanidhi, Caroline Drouin, Roberto Pieraccini
2002A pragmatic confirmation mechanism for an object-based spoken dialogue manager.
Ian M. O'Neill, Michael F. McTear
2002A psychoacoustic basis for spectral sharpening.
Peggy B. Nelson, Jeffrey J. DiGiovanni, Robert S. Schlauch
2002A real-time acoustic human-machine front-end for multimedia applications integrating robust adaptive beamforming and stereophonic acoustic echo cancellation.
Wolfgang Herbordt, J. Ying, Herbert Buchner, Walter Kellermann
2002A reverse turing test using speech.
Greg Kochanski, Daniel P. Lopresti, Chilin Shih
2002A sound source classification system based on subband processing.
Oytun Türk, Ömer Sayli, Helin Dutagaci, Levent M. Arslan
2002A sparse modeling approach to speech recognition based on relevance vector machines.
Jonathan E. Hamaker, Joseph Picone, Aravind Ganapathiraju
2002A spatio-temporal speech enhancement scheme for robust speech recognition.
Erik M. Visser, Manabu Otsuka, Te-Won Lee
2002A state-tying approach to building syllable HMMs.
Darryl Stewart, Ming Ji, Philip Hanna, Francis Jack Smith
2002A statistically motivated database pruning technique for unit selection synthesis.
Peter Rutten, Matthew P. Aylett, Justin Fackrell, Paul Taylor
2002A study of multi-speaker dialogue system for mobile information retrieval.
Hsien-Chang Wang, Chieh-Yi Huang, Chung-Hsien Yang, Jhing-Fa Wang
2002A study of the two-mass model in terms of acoustic parameters.
Denisse Sciamarella, Christophe d'Alessandro
2002A study on the classification of whispered and normally phonated speech.
Stanley J. Wenndt, Edward J. Cupples, Richard M. Floyd
2002A system that learns to describe objects in visual scenes.
Deb Roy
2002A text-to-speech synthesis system for telugu.
Jithendra Vepa, Jahnavi Ayachitam, K. V. K. Kalpana Reddy
2002A trainable spoken language understanding system for visual object selection.
Deb Roy, Peter Gorniak, Niloy Mukherjee, Joshua Juster
2002A training prompts generation algorithm for connected spoken word recognition.
Ha-Jin Yu, Jin Suk Kim
2002ACIMET: access to meteorological information by telephone.
Jaume Padrell, Javier Hernando
2002ACT: a graphical dialogue annotation comparison tool.
Fan Yang, Susan E. Strayer, Peter A. Heeman
2002ASR dependent techniques for speaker identification.
Alex Park, Timothy J. Hazen
2002ASR in a human word recognition model: generating phonemic input for shortlist.
Odette Scharenborg, Lou Boves, Johan de Veth
2002AT&t help desk.
Giuseppe Di Fabbrizio, Dawn Dutton, Narendra K. Gupta, Barbara Hollister, Mazin G. Rahim, Giuseppe Riccardi, Robert E. Schapire, Juergen Schroeter
2002Absolute pitch and lexical tones: tone perception by non-musician, musician, and absolute pitch non-tonal language speakers.
Denis K. Burnham, Ron Brooker
2002Access to homophonic meanings during spoken language comprehension: effects of context and neighborhood density.
Michael C. W. Yip
2002Accounting for perceptual identification of consonants and vowels through acoustic dissimilarity.
Jianxia Xue, Sumiko Takayanagi, Lynne E. Bernstein
2002Accumulated kullback divergence for analysis of ASR performance in the presence of noise.
Febe de Wet, Johan de Veth, Bert Cranen, Lou Boves
2002Acoustic and word lattice based algorithms for confidence scores.
Daniele Falavigna, Roberto Gretter, Giuseppe Riccardi
2002Acoustic correlates of task load and stress.
Klaus R. Scherer, Didier Grandjean, Tom Johnstone, Gudrun Klasmeyer, Thomas Bänziger
2002Acoustic echo cancellation based on m-channel IIR cosine-modulated filter bank.
Sang-Gyun Kim, Chang D. Yoo
2002Acoustic measures vs. phonetic features as predictors of audible discontinuity in concatenative speech synthesis.
Hisashi Kawai, Minoru Tsuzaki
2002Acoustic modeling of sentence stress using differential features between syllables for English rhythm learning system development.
Nobuaki Minematsu, Satoshi Kobashikawa, Keikichi Hirose, Donna Erickson
2002Acoustic-to-articulatory inverse mapping using an HMM-based speech production model.
Sadao Hiroya, Masaaki Honda
2002Acoustical correlates to SD ratings of speaker characteristics in two speaking styles.
Yasuki Yamashita, Hiroshi Matsumoto
2002Active speech cancellation for cellular speech.
Kazuhiro Kondo, Kiyoshi Nakagawa
2002Adaptation of users² spoken dialogue patterns in a conversational interface.
Courtney Darves, Sharon L. Oviatt
2002Adaptive estimation of time-varying features from high-pitched speech based on an excitation source HMM.
Akira Sasou, Kazuyo Tanaka
2002Adaptive model combination for dynamic speaker selection training.
Chao Huang, Tao Chen, Eric Chang
2002Adding intelligent help to mixed-initiative spoken dialogue systems.
Genevieve Gorrell, Ian Lewin, Manny Rayner
2002Algorithms for distributed speech recognition in a noisy automobile environment.
Hong Kook Kim, Richard C. Rose
2002All-pole modeling of wide-band speech using weighted sum of the LSP polynomials.
Paavo Alku, Tom Bäckström
2002Amplitude convergence in children²s conversational speech with animated personas.
Rachel Coulston, Sharon L. Oviatt, Courtney Darves
2002An EPG therapy protocol for remediation and assessment of articulation disorders.
Alan Wrench, Fiona Gibbon, Alison M. McNeill, Sara Wood
2002An IPA vowel diagram approach to analysing L1 effects on vowel production and perception.
Olga I. Dioubina, Hartmut R. Pfitzinger
2002An acoustic comparison between american English and australian English vowels.
Kimiko Tsukada
2002An adaptive speaker verification system with speaker dependent a priori decision thresholds.
Nikki Mirghafori, Larry P. Heck
2002An analysis of the causes of increased error rates in children²s speech recognition.
Qun Li, Martin J. Russell
2002An analysis of transcription consistency in spontaneous speech from the buckeye corpus.
William D. Raymond, Mark A. Pitt, Keith Johnson, Elizabeth Hume, Matthew J. Makashay, Robin Dautricourt, Craig Hilts
2002An architecture for a multi-modal web browser.
Cristiana Armaroli, Ivano Azzini, Lorenza Ferrario, Toni Giorgino, Luca Nardelli, Marco Orlandi, Carla Rognoni
2002An audio-visual corpus for multimodal speech recognition in dutch language.
Jacek C. Wojdel, Pascal Wiggers, Léon J. M. Rothkrantz
2002An automatic sentence boundary detector based on a structured language model.
Shinsuke Mori
2002An education software in teaching automatic speech recognition (ASR).
Hong Kai Sze, Sh-Hussain Salleh
2002An effect of amplitude modulation on perceptual segregation of tone sequences.
Mamoru Iwaki, Hiromi Seki
2002An effective unsupervised scheme for multiple-speaker-change detection.
P. Sivakumaran, Aladdin M. Ariyaeeinia, J. Fortuna
2002An efficient algorithm for the n-best-strings problem.
Mehryar Mohri, Michael Riley
2002An efficient dialogue control method using decision tree-based estimation of out-of-vocabulary word attributes.
Yasuhiro Takahashi, Kohji Dohsaka, Kiyoaki Aikawa
2002An environment compensated minimum classification error training approach and its evaluation on Aurora2 database.
Jian Wu, Qiang Huo
2002An evaluation of using mutual information for selection of acoustic-features representation of phonemes for speech recognition.
Mohamed Kamal Omar, Ken Chen, Mark Hasegawa-Johnson, Yigal Brandman
2002Analysis and synthesis of the phonatory excitation signal by means of a pair of polynomial shaping functions.
Jean Schoentgen
2002Analysis of user behavior under error conditions in spoken dialogs.
Jongho Shin, Shrikanth S. Narayanan, Laurie Gerber, Abe Kazemzadeh, Dani Byrd
2002Application of microprosody models in text to speech synthesis.
Phuay Hui Low, Saeed Vaseghi
2002Application of over-complete blind source separation for robust automatic speech recognition.
Shubha Kadambe
2002Application of real-time AMDF pitch-detection in a voice gender normalisation system.
E. Jung, A. Th. Schwarzbacher, K. Humphreys, Robert Lawlor
2002Application of the lee silverman voice treatment (LSVT) to individuals with multiple sclerosis, ataxic dysarthria, and stroke.
Leslie Will, Lorraine O. Ramig, Jennifer L. Spielman
2002Applying a hybrid intonation model to a seamless speech synthesizer.
Takashi Saito, Masaharu Sakamoto
2002Applying fallback to prosodic unit selection from a small imitation database.
Joram Meron
2002Approaches to language identification using Gaussian mixture models and shifted delta cepstral features.
Pedro A. Torres-Carrasquillo, Elliot Singer, Mary A. Kohler, Richard J. Greene, Douglas A. Reynolds, John R. Deller Jr.
2002Arc minimization in finite state decoding graphs with cross-word acoustic context.
Geoffrey Zweig, George Saon, François Yvon
2002Assessment of consonant articulation in glossectomee speech by dynamic MRI.
Katalin Mády, Robert Sader, Alexander Zimmermann, Philip Hoole, Ambros Beer, Hans-Florian Zeilhofer, Ch. Hannig
2002Audio-visual continuous speech recognition using a coupled hidden Markov model.
Xiaoxing Liu, Yibao Zhao, Xiaobo Pi, Luhong Liang, Ara V. Nefian
2002Audio-visual scene analysis: evidence for a "very-early" integration process in audio-visual speech perception.
Jean-Luc Schwartz, Frédéric Berthommier, Christophe Savariaux
2002Audio-visual speech enhancement with AVCDCN (audio-visual codebook dependent cepstral normalization).
Sabine Deligne, Gerasimos Potamianos, Chalapathy Neti
2002Audio-visual speech sources separation: a new approach exploiting the audio-visual coherence of speech stimuli.
David Sodoyer, Laurent Girin, Christian Jutten, Jean-Luc Schwartz
2002Audiovisual integration of speech by children and adults with cochlear implants.
Karen Iler Kirk, David B. Pisoni, Lorin Lachs
2002Audiovisual perception in L2 learners.
Valérie Hazan, Anke Sennema, Andrew Faulkner
2002Audiovisual speech synthesis. from ground truth to models.
Gérard Bailly
2002Auditory fovea based speech enhancement and its application to human-robot dialog system.
Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano
2002Auditory-visual speech perception examined by brain imaging and reaction time.
Kaoru Sekiyama, Yoichi Sugita
2002Automatic concept identification in goal-oriented conversations.
Ananlada Chotimongkol, Alexander I. Rudnicky
2002Automatic enrollment for speaker authentication.
Qi Li, Hui Jiang, Qiru Zhou, Jinsong Zheng
2002Automatic extraction of model parameters from fundamental frequency contours of English utterances.
Shuichi Narusawa, Nobuaki Minematsu, Keikichi Hirose, Hiroya Fujisaki
2002Automatic generation of phonetic transcriptions for large speech corpora.
Kris Demuynck, Tom Laureys, Steven Gillis
2002Automatic intelligibility assessment and diagnosis of critical pronunciation errors for computer-assisted pronunciation learning.
Antoine Raux, Tatsuya Kawahara
2002Automatic language identification using acoustic sub-word units.
A. K. V. Sai Jayram, V. Ramasubramanian, T. V. Sreenivas
2002Automatic phoneme alignment based on acoustic-phonetic modeling.
John-Paul Hosom
2002Automatic prosodic break labeling for Mandarin Chinese speech data.
Minghui Dong, Kim-Teng Lua
2002Automatic punctuation and disfluency detection in multi-party meetings using prosodic and lexical cues.
Don Baron, Elizabeth Shriberg, Andreas Stolcke
2002Automatic recognition of dutch dysarthric speech: a pilot study.
Eric Sanders, Marina B. Ruiter, Lilian Beijer, Helmer Strik
2002Automatic segmentation combining an HMM-based approach and spectral boundary correction.
Yeon-Jun Kim, Alistair Conkie
2002Automatic sign translation.
Ying Zhang, Bing Zhao, Jie Yang, Alex Waibel
2002Automatic transcription of courtroom speech.
Rohit Prasad, Long Nguyen, Richard M. Schwartz, John Makhoul
2002Automatic user-adaptive speaking rate selection for information delivery.
Nigel Ward, Satoshi Nakagawa
2002Auxiliary variables in conditional Gaussian mixtures for automatic speech recognition.
Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard
2002Backoff hierarchical class n-gram language modelling for automatic speech recognition systems.
Imed Zitouni, Olivier Siohan, Hong-Kwang Jeff Kuo, Chin-Hui Lee
2002Baldini: baldi speaks italian!
Piero Cosi, Michael M. Cohen, Dominic W. Massaro
2002Bark resolution from speech data.
Naren Malayath, Hynek Hermansky
2002Basque intonation modelling for text to speech conversion.
Eva Navas, Inmaculada Hernáez, Juan María Sánchez
2002Basurde[lite], a machine-driven dialogue system for accessing railway timetable information.
Roger Trias-Sanz, José B. Mariño
2002Belief network based disambiguation of object reference in spoken dialogue system for robot.
Yoko Yamakata, Tatsuya Kawahara, Hiroshi G. Okuno
2002Bell labs approach to Aurora evaluation on connected digit recognition.
Jingdong Chen, Dimitris Dimitriadis, Hui Jiang, Qi Li, Tor André Myrvoll, Olivier Siohan, Frank K. Soong
2002Benefit and cost analysis of using the improved vector quantizer design algorithm for glottal source waveform compression.
Peter Veprek, Alan B. Bradley
2002Bilingual corpus cleaning focusing on translation literality.
Kenji Imamura, Eiichiro Sumita
2002Blind normalization of speech from different channels and speakers.
David N. Levin
2002Bridges: regions between discourse segments.
Nanette Veilleux
2002Building an ASR system for noisy environments: SRI's 2001 SPINE evaluation system.
Venkata Ramana Rao Gadde, Andreas Stolcke, Dimitra Vergyri, Jing Zheng, M. Kemal Sönmez, Anand Venkataraman
2002Building voiceXML-based applications.
Christina L. Bennett, Ariadna Font Llitjós, Stefanie Shriver, Alexander I. Rudnicky, Alan W. Black
2002CU VOCAL: corpus-based syllable concatenation for Chinese speech synthesis across domains and dialects.
Helen M. Meng, Chi-Kin Keung, Kai-Chung Siu, Tien Ying Fung, P. C. Ching
2002CU animate tools for enabling conversations with animated characters.
Jiyong Ma, Jie Yan, Ronald A. Cole
2002Can confidence scores help users post-editing speech recognizer output?
Taku Endo, Nigel Ward, Minoru Terada
2002Channel error protection scheme for distributed speech recognition.
Zheng-Hua Tan, Paul Dalsgaard
2002Channel noise robustness for low-bitrate remote speech recognition.
Alexis Bernard, Abeer Alwan
2002Characteristics of a low reject mode speaker verification system.
Daniel Elenius, Mats Blomberg
2002Chinese spoken language analyzing based on combination of statistical and rule methods.
Guodong Xie, Chengqing Zong, Bo Xu
2002Choosing speech or touchtone modality for navigation within a telephony natural language system.
Jennifer C. Lai, Kwan Min Lee
2002Classification error from the theoretical Bayes classification risk.
Erik McDermott, Shigeru Katagiri
2002Cluster identification for speaker-environment tracking.
J. T. Wickramaratna, Philip C. Woodland
2002Clustering and feature learning based F0 prediction for Chinese speech synthesis.
Jianhua Tao, Lianhong Cai
2002Codebook dependent dynamic channel estimation for Mandarin speech recognition over telephone.
Huayun Zhang, Zhaobing Han, Bo Xu
2002Coding speech at very low rates using straight and temporal decomposition.
Phu Chien Nguyen, Takao Ochi, Masato Akagi
2002Collecting mobile multimodal data for match.
Patrick Ehlen, Michael Johnston, Gunaranjan Vasireddy
2002Combination of pause and F0 information in dependency analysis of Japanese sentences.
Kazuyuki Takagi, Hajime Kubota, Kazuhiko Ozeki
2002Combination of statistical and rule-based approaches for spoken language understanding.
Ye-Yi Wang, Alex Acero, Ciprian Chelba, Brendan J. Frey, Leon Wong
2002Combined binary classifiers with applications to speech recognition.
Aldebaro Klautau, Nikola Jevtic, Alon Orlitsky
2002Combined prosody and candidate unit selections for corpus-based text-to-speech systems.
Francisco Campillo Díaz, Eduardo Rodríguez Banga
2002Combining a Gaussian mixture model front end with MFCC parameters.
Matthew N. Stuttle, Mark J. F. Gales
2002Combining acoustic and language information for emotion recognition.
Chul Min Lee, Shrikanth S. Narayanan, Roberto Pieraccini
2002Combining information sources for memory-based pitch accent placement.
Erwin Marsi, Bertjan Busser, Walter Daelemans, Véronique Hoste, Martin Reynaert, Antal van den Bosch
2002Combining lexical and morphological knowledge in language model for inflectional (czech) language.
Jan Nouza, Jindra Drabkova
2002Combining maximum likelihood and maximum a posteriori estimation for detailed acoustic modeling of context dependency.
Michiel Bacchiani
2002Combining search spaces of heterogeneous recognizers for improved speech recogniton.
Xiang Li, Rita Singh, Richard M. Stern
2002Combining speaker and speech recognition systems.
Larry P. Heck, Dominique Genoud
2002Comfort noise detection and GSM-FR-codec detection for speech-quality evaluations in telephone networks.
Thorsten Ludwig
2002Compact subnetwork-based large vocabulary continuous speech recognition.
Dong-Hoon Ahn, Minhwa Chung
2002Comparative evaluation of CASA and BSS models for subband cocktail-party speech separation.
Frédéric Berthommier, Seungjin Choi
2002Comparative experiments to evaluate the use of auditory-based acoustic distinctive features and formant cues for automatic speech recognition using a multi-stream paradigm.
Hesham Tolba, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy
2002Comparing intelligibility of several non-native accent classes in noise.
Shawn A. Weil
2002Comparing isolately spoken keywords with spontaneously spoken queries for Japanese spoken document retrieval.
Hiromitsu Nishizaki, Seiichi Nakagawa
2002Comparison and combination of RASTA-PLP and FF features in a hybrid HMM/MLP speech recognition system.
Pere Pujol Marsal, Susagna Pol, Astrid Hagen, Hervé Bourlard, Climent Nadeu
2002Comparison of acoustic distance measures for automatic cross-language phoneme mapping.
Jayren J. Sooful, Elizabeth C. Botha
2002Compensating for hyperarticulation by modeling articulatory properties.
Hagen Soltau, Florian Metze, Alex Waibel
2002Compensation of channel effect on line spectrum frequencies.
An-Tze Yu, Hsiao-Chuan Wang
2002Comprehension of non-native speech: inaccurate phoneme processing and activation of lexical competitors.
Mirjam Broersma
2002Computationally efficient method of speech enhancement based on block representation of signal in state space and vector quantization.
Vasyl Semenov, Alexander Kovtonyuk, Alexander Kalyuzhny
2002Computationally efficient noise compensation for robust automatic speech recognition assessed under the Aurora 2/3 framework.
Nicholas W. D. Evans, John S. D. Mason
2002Computationally efficient time-scale modification of speech using 3 level clipping.
Sung-Joo Lee, Hyung Soon Kim
2002Computer-assisted second-language speech learning: generalization of prosody-focused training.
Debra M. Hardison
2002Confidence metrics for speaker identification.
Mark C. Huggins, John J. Grieco
2002Confusion-based query expansion for OOV words in spoken document retrieval.
Beth Logan, Jean-Manuel Van Thong
2002Constructing shared-state hidden Markov models based on a Bayesian approach.
Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda
2002Constructing small language models from grammars.
Francis Picard, Dominique Boucher, Guy Lapalme
2002Construction of decision tree from data driven clustering.
Junho Park, Hanseok Ko
2002Contextual effects in the perception of fricative place of articulation: a rotational hypothesis.
Willy Serniclaes, René Carré
2002Contextual effects on voicing judgment of stop consonants in Japanese.
Makiko Aoyagi
2002Continuous environmental adaptation of a speech recogniser in telephone line conditions.
Carlos M. G. S. Lima, Luís B. Almeida, João L. Monteiro
2002Contribution to topic identification by using word similarity.
Armelle Brun, Kamel Smaïli, Jean Paul Haton
2002Control system for talking robot to replicate articulatory movement of natural speech.
Takemi Mochida, Masaaki Honda, Kouki Hayashi, Toshiharu Kuwae, Kunihiro Tanahashi, Kazufumi Nishikawa, Atsuo Takanishi
2002Controling anticipatory behavior for rounding in French cued speech.
Virginie Attina, Marie-Agnès Cathiard, Denis Beautemps
2002Controlling perceived degradation in spectrum envelope modeling via predistortion.
Pushkar Patwardhan, Preeti Rao
2002Coordination of hand and orofacial movements for CV sequences in French cued speech.
Virginie Attina, Denis Beautemps, Marie-Agnès Cathiard
2002Coordination of referring expressions in multimodal human-computer dialogue.
Gabriel Skantze
2002Corpus-based analysis of English spoken by Japanese students in view of the entire phonemic system of English.
Nobuaki Minematsu, Gakuto Kurata, Keikichi Hirose
2002DARPA communicator evaluation: progress from 2000 to 2001.
Marilyn A. Walker, Alexander I. Rudnicky, John S. Aberdeen, Elizabeth Owen Bratt, John S. Garofolo, Helen Wright Hastie, Audrey N. Le, Bryan L. Pellom, Alexandros Potamianos, Rebecca J. Passonneau, Rashmi Prasad, Salim Roukos, Gregory A. Sanders, Stephanie Seneff, David Stallard
2002DARPA communicator: cross-system results for the 2001 evaluation.
Marilyn A. Walker, Alexander I. Rudnicky, Rashmi Prasad, John S. Aberdeen, Elizabeth Owen Bratt, John S. Garofolo, Helen Wright Hastie, Audrey N. Le, Bryan L. Pellom, Alexandros Potamianos, Rebecca J. Passonneau, Salim Roukos, Gregory A. Sanders, Stephanie Seneff, David Stallard
2002DCT-based video features for audio-visual speech recognition.
Martin Heckmann, Kristian Kroschel, Christophe Savariaux, Frédéric Berthommier
2002DETAC: a discriminative criterion for speaker verification.
Jirí Navrátil, Ganesh N. Ramaswamy
2002Data, annotation schemes and coding tools for natural interactivity.
Laila Dybkjær, Niels Ole Bernsen
2002Data-driven segment preselection in the IBM trainable speech synthesis system.
Wael Hamza, Robert E. Donovan
2002Data-driven temporal filters obtained via different optimization criteria evaluated on Aurora2 database.
Jeih-weih Hung, Lin-Shan Lee
2002Data-driven vector clustering for low-memory footprint ASR.
Karim Filali, Xiao Li, Jeff A. Bilmes
2002Decision tree distribution tying based on a dimensional split technique.
Heiga Zen, Keiichi Tokuda, Tadashi Kitamura
2002Design for a speech-to-speech translator for field use.
David Stallard, Premkumar Natarajan, Mohammed Noamany, Richard M. Schwartz, John Makhoul
2002Design of a Mandarin sentence set for corpus-based speech synthesis by use of a multi-tier algorithm taking account of the varied prosodic and spectral characteristics.
Jinfu Ni, Hisashi Kawai
2002Design of an audio-visual speech corpus for the czech audio-visual speech synthesis.
Milos Zelezný, Petr Císar, Zdenek Krnoul, Jan Novák
2002Design of system-initiated digressive proposals for automated banking dialogues.
Jenny Wilkie, Mervyn A. Jack, Peter J. Littlewood
2002Designing Japanese speech database covering wide range in prosody for hybrid speech synthesizer.
Hiromichi Kawanami, Tsuyoshi Masuda, Tomoki Toda, Kiyohiro Shikano
2002Designing a speaker-discriminative adaptive filter bank for speaker recognition.
Tomi Kinnunen
2002Detection and recognition of repaired speech on misrecognized utterances for speech input of car navigation system.
Naoko Kakutani, Norihide Kitaoka, Seiichi Nakagawa
2002Development of Japanese infant speech database and speaking rate analysis.
Shigeaki Amano, Kazumi Kato, Tadahisa Kondo
2002Development of a GUI-based articulatory speech synthesis system.
Kohichi Ogata, Yorinobu Sonoda
2002Discrimination of English vowels in consonantal contexts by native speakers of Japanese and its relations to dynamic information of formants.
Akiyo Joto, Motohisa Imaishi, Yoshiki Nagase, Seiya Funatsu
2002Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation.
Stavros Tsakalidis, Vlasios Doumpiotis, William Byrne
2002Discriminative training for call classification and routing.
Hong-Kwang Jeff Kuo, Chin-Hui Lee, Imed Zitouni, Eric Fosler-Lussier, Egbert Ammicht
2002Distributed Chinese keyword spotting and verification for spoken dialogues under wireless environment.
Yun-Tien Lee, Cheng-Huang Wu, Yumin Lee, Lin-Shan Lee
2002Distributed audio-visual speech synchronization.
Peter Poller, Jochen Müller
2002Distributed speech recognition over IP networks on the Aurora 3 database.
Laura Docío Fernández, Carmen García-Mateo
2002Distributed speech recognition using noise-robust MFCC and traps-estimated manner features.
Pratibha Jain, Hynek Hermansky, Brian Kingsbury
2002Divergence-based out-of-class rejection for telephone handset identification.
Chi-Leung Tsang, Man-Wai Mak, Sun-Yuan Kung
2002Double the trouble: handling noise and reverberation in far-field automatic speech recognition.
David Gelbart, Nelson Morgan
2002Duration and F0 as perceptual cues to Japanese vowel quantity.
Keisuke Kinoshita, Dawn M. Behne, Takayuki Arai
2002Duration modeling for arabic text to speech synthesis.
Yasser Hifny, Mohsen A. Rashwan
2002Duration related phase realignment of Thai tones.
John J. Ohala, Rungpat Roengpitya
2002Dutch HLT resources: from BLARK to priority lists.
Helmer Strik, Walter Daelemans, Diana Binnenpoorte, Janienke Sturm, Folkert de Vriend, Catia Cucchiarini
2002Dynamic search-space pruning for time-constrained speech recognition.
Sascha Wendt, Gernot A. Fink, Franz Kummert
2002Dynamic tuning of language model score in speech recognition using a confidence measure.
Sherif M. Abdou, Michael S. Scordilis
2002E-mail goes mobile: the design and implementation of a spoken language interface to e-mail.
Daniela Oria, Esa Koskinen
2002EM training of finite-state transducers and its application to pronunciation modeling.
Han Shu, I. Lee Hetherington
2002Effect of F0 fluctuation and amplitude modulation of natural vowels on vowel identification in noisy environments.
Kentaro Ishizuka, Kiyoaki Aikawa
2002Effects of intra-phrase position on acceptability of changes in segmental duration in sentence speech.
Makiko Muto, Hiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka
2002Effects of production training with visual feedback on the acquisition of Japanese pitch and durational contrasts.
Yukari Hirata
2002Effects of word error rate in the DARPA communicator data during 2000 and 2001.
Gregory A. Sanders, Audrey N. Le, John S. Garofolo
2002Efficient additive and convolutional noise reduction procedures.
Bojan Kotnik, Damjan Vlaj, Zdravko Kacic, Bogomir Horvat
2002Efficient and scalable methods for text script generation in corpus-based TTS design.
Chih-Chung Kuo, Jing-Yi Huang
2002Efficient combination of type-in and wizard-of-oz tests in speech interface development process.
Saija-Maaria Lemmelä, Péter Pál Boda
2002Efficient construction of long-range language models using log-linear interpolation.
Edward W. D. Whittaker, Dietrich Klakow
2002Efficient precalculation of LM contexts for large vocabulary continuous speech recognition.
Javier Dieguez-Tirado, Antonio Cardenal López
2002Eigenvoices for HMM-based speech synthesis.
Kengo Shichiri, Atsushi Sawabe, Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura
2002Emotion recognition from textual input using an emotional semantic network.
Ze-Jing Chuang, Chung-Hsien Wu
2002Emotional space improves emotion recognition.
Raquel Tato, Rocío Santos, Ralf Kompe, José M. Pardo
2002English call system with functions of speech segmentation and pronunciation evaluation using speech recognition technology.
Yasuo Ariki, Jun Ogata
2002Enhanced histogram normalization in the acoustic feature space.
Sirko Molau, Florian Hilger, Daniel Keysers, Hermann Ney
2002Enhancement of single channel speech using perception-based wavelet transform.
Ching-Ta Lu, Hsiao-Chuan Wang
2002Entropy of energy operator as feature for large vocabulary Mandarin speaker independent speech recognition.
Fadhil H. T. Al-Dulaimy, Zuoying Wang
2002Error-tolerant spoken language understanding with confidence measuring.
Huei-Ming Wang, Yi-Chung Lin
2002Estimating syntactic structure from F0 contour and pause duration in Japanese speech.
Yasuo Horiuchi, Tomoko Ohsuga, Akira Ichikawa
2002Evaluation of SPLICE on the Aurora 2 and 3 tasks.
Jasha Droppo, Li Deng, Alex Acero
2002Evaluation of a noise adaptive speech recognition system on the Aurora 3 database.
Kaisheng Yao, Donglai Zhu, Satoshi Nakamura
2002Evaluation of a noise-robust DSR front-end on Aurora databases.
Duncan Macho, Laurent Mauuary, Bernhard Noé, Yan Ming Cheng, Douglas Ealey, Denis Jouvet, Holly Kelleher, David Pearce, Fabien Saadoun
2002Evaluation of a speech recognition / generation method based on HMM and straight.
Toshio Irino, Yasuhiro Minami, Tomohiro Nakatani, Minoru Tsuzaki, H. Tagawa
2002Evaluation of a system for concatenative articulatory visual speech synthesis.
Olov Engwall
2002Evaluation of cross-language voice conversion using bilingual and non-bilingual databases.
Mikiko Mashimo, Tomoki Toda, Hiromichi Kawanami, Hideki Kashioka, Kiyohiro Shikano, Nick Campbell
2002Evaluation of formant-like features for ASR.
Katrin Weber, Febe de Wet, Bert Cranen, Lou Boves, Samy Bengio, Hervé Bourlard
2002Evaluation of noise robust features on the Aurora databases.
Xiaodong Cui, Markus Iseli, Qifeng Zhu, Abeer Alwan
2002Evaluation of noisy speech recognition based on noise reduction and acoustic model adaptation on the Aurora2 tasks.
Masakiyo Fujimoto, Yasuo Ariki
2002Evaluation of spectral subtraction with smoothing of time direction on the Aurora 2 task.
Norihide Kitaoka, Seiichi Nakagawa
2002Evaluation of the method to detect Japanese local speech rate deceleration applying the variable threshold with a constant term.
Keiichi Takamaru, Makoto Hiroshige, Kenji Araki, Koji Tochinai
2002Evidence for efficiency in vowel production.
R. J. J. H. van Son, Louis C. W. Pols
2002Expanded examinations of a low frequency modulation feature for speech/music discrimination.
Stefan Karnebäck
2002Experiments in confidence scoring for word and sentence verification.
Marco Andorno, Pietro Laface, Roberto Gemello
2002Experiments on recognition of lavalier microphone speech and whispered speech in real world environments.
Kiyoshi Tatara, Taisuke Ito, Parham Zolfaghari, Kazuya Takeda, Fumitada Itakura
2002Experiments on speaker-independent voice command recognition using in-vehicle hands free speech.
Yifan Gong, Lorin Netsch
2002Exploiting support vector machines in hidden Markov models for speaker verification.
Dong Xin, Zhaohui Wu, Yingchun Yang
2002Exploiting variances in robust feature extraction based on a parametric model of speech distortion.
Li Deng, Jasha Droppo, Alex Acero
2002Exploring sub-word features and linear support vector machines for German spoken document classification.
Martha A. Larson, Stefan Eickeler, Gerhard Paaß, Edda Leopold, Jörg Kindermann
2002Expressive speech synthesis using a concatenative synthesizer.
Murtaza Bulut, Shrikanth S. Narayanan, Ann K. Syrdal
2002Extracting clauses for spoken language understanding in conversational systems.
Narendra K. Gupta, Srinivas Bangalore, Mazin G. Rahim
2002Extraction of important sentences using F0 information for speech summarization.
Yoichi Yamashita, Akira Inoue
2002Eye-fixation as a measure of real-time processing of synthesized words.
Mary D. Swift, Ellen Campana, James F. Allen, Michael K. Tanenhaus
2002Eyebrow movements and voice variations in dialogue situations: an experimental investigation.
Christian Cavé, Isabelle Guaïtella, Serge Santi
2002F0 generation for speech synthesis using a multi-tier approach.
Xuejing Sun
2002FORM: an extensible, kinematically-based gesture annotation scheme.
Craig Martell
2002FPGA hardware for speech recognition using hidden Markov models.
José Luis Gómez-Cipriano, Roger Pizzatto Nunes, Dante A. C. Barone
2002Factor analyzed Gaussian mixture models for speaker identification.
Peng Ding, Yang Liu, Bo Xu
2002Factors in human language identification.
Ian Maddieson, Ioana Vasilescu
2002Fast hierarchical grammar optimization algorithm toward time and space efficiency.
Jing Zheng, Horacio Franco
2002Feature extraction combining spectral noise reduction and cepstral histogram equalization for robust ASR.
José C. Segura, M. Carmen Benítez, Ángel de la Torre, Antonio J. Rubio
2002Feature extraction for unit selection in concatenative speech synthesis: comparison between AIM, LPC, and MFCC.
Minoru Tsuzaki, Hisashi Kawai
2002Feed the tiger: a method for evoking reliable jaw stretch reflexes in children.
Donald S. Finan, Anne Smith, Michael Ho
2002Feedback in computer assisted pronunciation training: technology push or demand pull?
Ambra Neri, Catia Cucchiarini, Helmer Strik
2002Filter bank subtraction for robust speech recognition.
Kazuo Onoe, Hiroyuki Segi, Takeshi Kobayakawa, Shoei Sato, Toru Imai, Akio Ando
2002Finite-state transducer based hungarian LVCSR with explicit modeling of phonological changes.
Máté Szarvas, Sadaoki Furui
2002Fixed-length segment coding of LSF parameters.
Evgeni Yakhnich, Yuval Bistritz
2002Flexible dialogue management in the talk'n'travel system.
David Stallard
2002Flexible multimodal human-machine interaction in mobile environments.
Dirk Bühler, Wolfgang Minker, Jochen Häußler, Sven Krger
2002Floating-point adaptive multi-rate wideband speech codec.
Toni P. Nieminen
2002Formant model estimation and transformation for voice morphing.
Ching-Hsiang Ho, Dimitrios Rentzos, Saeed Vaseghi
2002Forms of introduction in map task dialogues: case of L2 Russian speakers.
Olga Goubanova
2002Framewise phone classification using support vector machines.
Jesper Salomon, Simon King, Miles Osborne
2002French nasal vowels: acoustic and articulatory properties.
Véronique Delvaux, Thierry Metens, Alain Soquet
2002Frequency band analysis for stress detection using a teager energy operator based feature.
Mandar A. Rahurkar, John H. L. Hansen, James Meyerhoff, George Saviolakis, Michael Koenig
2002Frequency dependence of vocal-tract length.
Takuya Niikawa, Takanori Ando, Masafumi Matsumura
2002From text to prosody without toBI.
Volker Strom
2002Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases.
Chia-Ping Chen, Karim Filali, Jeff A. Bilmes
2002Full-text story alignment models for Chinese-English bilingual news corpora.
Bing Zhao, Stephan Vogel
2002Functional modeling of face movements during speech.
Shinji Maeda, Martine Toda, Andreas J. Carlen, Lyes Meftahi
2002Generalization of state-observation-dependency in partly hidden Markov models.
Tetsuji Ogawa, Tetsunori Kobayashi
2002Generating script using statistical information of the context variation unit vector.
Haiping Li, Fangxin Chen, Liqin Shen
2002German broadcast news transcription.
Robert Hecht, Jürgen Riedler, Gerhard Backfried
2002Gestural spatialization in natural discourse segmentation.
Francis K. H. Quek, David McNeill, Robert K. Bryll, Mary P. Harper
2002Gestural trajectory symmetries and discourse segmentation.
Francis K. H. Quek, Yingen Xiong, David McNeill
2002Globalphone: a multilingual speech and text database developed at karlsruhe university.
Tanja Schultz
2002Goal-directed ASR in a multimedia indexing and searching environment (MUMIS).
Mirjam Wester, Judith M. Kessens, Helmer Strik
2002Grammar specialisation meets language modelling.
Manny Rayner, Beth Ann Hockey, John Dowding
2002Grapheme-to-phoneme conversion using pseudo-morphological units.
Ulla Uebler
2002HMM COmposition-based rapid model adaptation using a priori noise GMM adaptation evaluation on Aurora2 corpus.
Masaki Ida, Satoshi Nakamura
2002HMM-based methods for channel error mitigation in distributed speech recognition.
Antonio M. Peinado, Victoria E. Sánchez, José L. Pérez-Córdoba, José C. Segura, Antonio J. Rubio
2002Hearing-aid benefits and limitations: predictions from a cochlear model.
James M. Kates
2002Hierarchical Gaussian mixture model for speaker verification.
Ming Liu, Eric Chang, Bei-qian Dai
2002High performance digit recognition in real car environments.
Umit H. Yapanel, Xianxian Zhang, John H. L. Hansen
2002Highly oversampled subband adaptive filters for noise cancellation on a low-resource DSP system.
King Tam, Hamid Sheikhzadeh, Todd Schneider
2002Holds as gestural correlates to empty and filled speech pauses.
Anna Esposito, Susan Duncan, Francis K. H. Quek
2002How speakers with and without speech impairment mark the question statement contrast.
Rupal Patel
2002Hypophonia in parkinson disease: neural correlates of voice treatment with LSVT revealed by PET.
Mario Liotti, Lorraine O. Ramig, Deanie Vogel, Pamela New, Chris Cook, Peter Fox
2002ISIS: a multi-modal, trilingual, distributed spoken dialog system developed with CORBA, java, XML and KQML.
Helen M. Meng, P. C. Ching, Yee Fong Wong, Cheong Chat Chan
2002Implementation of an intonational quality assessment system.
Chanwoo Kim, Wonyong Sung
2002Implementation testing of a hybrid symbolic/statistical multimodal architecture.
Edward C. Kaiser, Philip R. Cohen
2002Implementing vocal tract length normalization in the MLLR framework.
Guo-Hong Ding, Yi-Fei Zhu, Chengrong Li, Bo Xu
2002Improve latent semantic analysis based language model by integrating multiple level knowledge.
Rong Zhang, Alexander I. Rudnicky
2002Improved Chinese spoken document retrieval with hybrid modeling and data-driven indexing features.
Chun-Jen Wang, Berlin Chen, Lin-Shan Lee
2002Improved corpus-based synthesis of fundamental frequency contours using generation process model.
Keikichi Hirose, Masaya Eto, Nobuaki Minematsu
2002Improved katz smoothing for language modeling in speech recogniton.
Genqing Wu, Fang Zheng, Wenhu Wu, Mingxing Xu, Ling Jin
2002Improved performance speech codec for mobile communications.
K. Humphreys, Robert Lawlor
2002Improved phone recognition on TIMIT using formant frequency data and confidence measures.
N. J. Wilkinson, Martin J. Russell
2002Improved structural maximum likelihood eigenspace mapping for rapid speaker adaptation.
Bowen Zhou, John H. L. Hansen
2002Improvement of the ELS-based time-varying complex speech analysis.
Keiichi Funaki
2002Improvements to the IBM Aurora 2 multi-condition system.
George Saon, Juan M. Huerta
2002Improving latent semantic indexing based classifier with information gain.
Li Li, Wu Chou
2002Improving parametric trajectory modeling by integration of pitch and tone information.
Yiyan Zhang, Wenju Liu, Bo Xu, Huayun Zhang
2002Improving performance of an HMM-based ASR system by using monophone-level normalized confidence measure.
Muhammad Ghulam, Takashi Fukuda, Takaharu Sato, Tsuneo Nitta
2002Improving phone-level discrimination in LDA with subphone-level classes.
Hwa Jeon Song, Hyung Soon Kim
2002Improving speech recognition performance of small microphone arrays using missing data techniques.
Iain McCowan, Andrew C. Morris, Hervé Bourlard
2002Improving spoken language understanding using word confusion networks.
Gökhan Tür, Jerry H. Wright, Allen L. Gorin, Giuseppe Riccardi, Dilek Hakkani-Tür
2002Improving statistical machine translation for a speech-to-speech translation task.
Stephan Vogel, Alicia Tribble
2002Improving the role of unvoiced speech segments by spectral normalisation in robust speech recognition.
Carlos M. G. S. Lima, Luís B. Almeida, João L. Monteiro
2002Improving word accuracy with Gabor feature extraction.
Michael Kleinschmidt, David Gelbart
2002Incremental on-line feature space MLLR adaptation for telephony speech recognition.
Yongxin Li, Hakan Erdogan, Yuqing Gao, Etienne Marcheret
2002Individual word language models and the frequency approach.
Elvira I. Sicilia-Garcia, Ji Ming, Francis Jack Smith
2002Influence of different dialogue situations on user²s behavior in spoken corrections.
Atsuhiko Kai, Yukari Nonomura, Toshihiko Itoh, Tatsuhiro Konishi, Yukihiro Itoh
2002Influence of prosody, context, and word order in the identification of focus in Japanese dialogue.
Tatsuya Kitamura, Kayo Itoh, Toshihiko Itoh, Shigeyoshi Kitazawa
2002Influence of transmission errors on ASR systems.
Carmen Peláez-Moreno, Ascensión Gallardo-Antolín, Jesús Vicente-Peña, Fernando Díaz-de-María
2002Information retrieval based on speech recognition results.
Masatoshi Watanabe, Masahide Sugiyama
2002Information-theoretic criteria for unit selection synthesis.
Jon R. W. Yi, James R. Glass
2002Ingressive speech as an indication that humans are talking to humans (and not to machines).
Robert Eklund
2002Integrating multiple pronunciations during MCE-based acoustic model training for large vocabulary speech recognition.
Rathi Chengalvarayan
2002Integrating speech with keypad input for automatic entry of spelling and pronunciation of new words.
Grace Chung, Stephanie Seneff
2002Integration of MLLR adaptation with pronunciation proficiency adaptation for non-native speech recognition.
Nobuaki Minematsu, Gakuto Kurata, Keikichi Hirose
2002Integration of phonetic length properties in the acoustic models of false starts and out-of-vocabulary words.
H. Hamimed, Géraldine Damnati
2002Integration of supra-lexical linguistic models with speech recognition using shallow parsing and finite state transducers.
Xiaolong Mou, Stephanie Seneff, Victor Zue
2002Integration of two stochastic context-free grammars.
Anna Corazza
2002Intelligibility of reverse speech in French: a perceptual study.
Ivan Magrin-Chagnolleau, Melissa Barkat, Fanny Meunier
2002Interaction of voice over internet protocol speech coders and disordered speech samples.
Vijay Parsa, Donald G. Jamieson
2002Interlingua based statistical machine translation.
Manuel Kauers, Stephan Vogel, Christian Fügen, Alex Waibel
2002Interpreting meaning from context: modeling the prosody of discourse markers in speech.
Li-chiung Yang
2002Intonation modelling for the synthesis of structured documents.
Jeska Buhmann, Jean-Pierre Martens, Lieve Macken, Bert Van Coile
2002Intonational and visual cues in the perception of interrogative mode in Swedish.
David House
2002Intrasyllabic articulatory control constraints in verbal working memory.
Marc Sato, Jean-Luc Schwartz, Marie-Agnès Cathiard, Christian Abry, Hélène Loevenbruck
2002Intrinsic phone durations are speaker-specific.
Hartmut R. Pfitzinger
2002Introduction of constraints in an acoustic-to-articulatory inversion method based on a hypercubic articulatory table.
Yves Laprie, Slim Ouni
2002Investigation of coarticulation based on electromagnetic articulographic data.
Jianwu Dang, Masaaki Honda, Kiyoshi Honda
2002Investigations on joint-multigram models for grapheme-to-phoneme conversion.
Maximilian Bisani, Hermann Ney
2002Is the speaker done yet? faster and more accurate end-of-utterance detection using prosody.
Luciana Ferrer, Elizabeth Shriberg, Andreas Stolcke
2002Issues in automatic transcription of historical audio data.
Fabio Brugnara, Mauro Cettolo, Marcello Federico, Diego Giuliani
2002Issues in the development of a stochastic speech understanding system.
Fabrice Lefèvre, Hélène Bonneau-Maynard
2002Japanese broadcast news transcription.
Long Nguyen, Xuefeng Guo, Richard M. Schwartz, John Makhoul
2002Juncture segmentation of Japanese prosodic unit based on the spectrographic features.
Shigeyoshi Kitazawa, Toshihiko Itoh, Tatsuya Kitamura
2002Kymographic imaging of the vocal fold oscillations.
Jan G. Svec, Frantisek Sram
2002LU factorization for feature transformation.
Patrick Nguyen, Luca Rigazio, Christian Wellekens, Jean-Claude Junqua
2002Large vocabulary conversational speech recognition with the extended maximum likelihood linear transformation (EMLLT) model.
Jing Huang, Vaibhava Goel, Ramesh Gopinath, Brian Kingsbury, Peder A. Olsen, Karthik Visweswariah
2002Laryngoscopic analysis of tibetan chanting modes and their relationship to register in sino-tibetan.
John H. Esling
2002Learning decision trees to determine turn-taking by spoken dialogue systems.
Ryo Sato, Ryuichiro Higashinaka, Masafumi Tamoto, Mikio Nakano, Kiyoaki Aikawa
2002Learning syllable duration and intonation of Mandarin Chinese.
Oliver Jokisch, Hongwei Ding, Hans Kruschke, Guntram Strecha
2002Likelihood combination and recognition output voting for the decoding of non-native speech with multilingual HMMs.
Volker Fischer, Eric Janke, Siegfried Kunzmann
2002Linguistic and acoustic changes of user²s utterances caused by different dialogue situations.
Toshihiko Itoh, Atsuhiko Kai, Tatsuhiro Konishi, Yukihiro Itoh
2002Lip gestures in English sibilants: articulatory - acoustic relationship.
Martine Toda, Shinji Maeda, Andreas J. Carlen, Lyes Meftahi
2002Lip-reading based on a fully automatic statistical model.
Philippe Daubias, Paul Deléglise
2002Low complexity Mandarin speaker-independent isolated word recognition.
Xia Wang, Juha Iso-Sipilä
2002Low complexity techniques for embedded ASR systems.
Imre Kiss, Marcel Vasilache
2002Low cost duration modelling for noise robust speech recognition.
Andrew C. Morris, Simon Payne, Hervé Bourlard
2002Low-resource noise-robust feature post-processing on Aurora 2.0.
Chia-Ping Chen, Jeff A. Bilmes, Katrin Kirchhoff
2002Markov models based on speaker space model evolution.
Dong Kook Kim, Nam Soo Kim
2002Maximum entropy model for punctuation annotation from speech.
Jing Huang, Geoffrey Zweig
2002Maximum expected likelihood based model selection and adaptation for nonnative English speakers.
Xiaodong He, Yunxin Zhao
2002Maximum likelihood estimation of eigenvoices and residual variances for large vocabulary speech recognition tasks.
Patrick Kenny, Gilles Boulianne, Pierre Dumouchel
2002Maximum mutual information training of hidden Markov models with vector linear predictors.
K. K. Chin, Philip C. Woodland
2002Medium vocabulary continuous audio-visual speech recognition.
Pascal Wiggers, Jacek C. Wojdel, Léon J. M. Rothkrantz
2002Mel-scaled wavelet filter based features for noisy unvoiced phoneme recognition.
Omar Farooq, Sekharjit Datta
2002Memory space reduction for hidden Markov models in low-resource speech recognition systems.
Sergey Astrov
2002Methods to improve Gaussian mixture model based language identification system.
Eddie Wong, Sridha Sridharan
2002Minimum perfect hashing for fast n-gram language model lookup.
Xiao Zhang, Yunxin Zhao
2002Model partial pronunciation variations for spontaneous Mandarin speech recognition.
Yi Liu, Pascale Fung
2002Model-based independent component analysis for robust multi-microphone automatic speech recognition.
Laurent Couvreur, Christophe Ris
2002Model-based predictions of intensity discrimination for normal- and impaired-hearing listeners.
Lisa G. Huettel, Leslie M. Collins
2002Modeling HMM state distributions with Bayesian networks.
Konstantin Markov, Satoshi Nakamura
2002Modeling and automatic detection of English sentence stress for computer-assisted English prosody learning system.
Kazunori Imoto, Yasushi Tsubota, Antoine Raux, Tatsuya Kawahara, Masatake Dantsuji
2002Modeling articulatory dynamics in autoregressive linear system.
Kiyoshi Hashimoto
2002Modeling durational variability in reading aloud a connected text.
Caroline L. Smith
2002Modeling frequent allophones in Japanese speech recognition.
Long Nguyen, Xuefeng Guo, John Makhoul
2002Modeling recognition of speech sounds with minerva2.
Travis Wade, Deborah K. Eakin, Russell Webb, Arvin Agah, Frank Brown, Allard Jongman, John Gauch, Thomas A. Schreiber, Joan A. Sereno
2002Modeling the perception of frequency-shifted vowels.
Peter F. Assmann, Terrance M. Nearey, Jack M. Scott
2002Modeling tones in continuous Cantonese speech.
Tan Lee, Greg Kochanski, Chilin Shih, Yujia Li
2002Modeling varying pauses to develop robust acoustic models for recognizing noisy conversational speech.
Jinsong Zhang, Satoshi Nakamura
2002Modeling with a subspace constraint on inverse covariance matrices.
Scott Axelrod, Ramesh Gopinath, Peder A. Olsen
2002Models of speech dynamics in a segmental-HMM recognizer using intermediate linear representations.
Philip J. B. Jackson, Martin J. Russell
2002Motor specifications of a baby robot via the analysis of infants² vocalizations.
Jihène Serkhane, Jean-Luc Schwartz, Louis-Jean Boë, Barbara L. Davis, Christine L. Matyear
2002Multi-dimensional analysis of sonority: perception, acoustics, and phonology.
Masahiko Komatsu, Shinichi Tokuma, Won Tokuma, Takayuki Arai
2002Multi-scale and multi-model integration for improved performance in Chinese spoken document retrieval.
Wai Kit Lo, Helen M. Meng, P. C. Ching
2002Multilingual pronunciation modeling for improving multilingual speech recognition.
Jilei Tian, Juha Häkkinen, Olli Viikki
2002Multilingual speech recognition with language identification.
Bin Ma, Cuntai Guan, Haizhou Li, Chin-Hui Lee
2002Multimodal integration patterns in children.
Benfang Xiao, Cynthia Girand, Sharon L. Oviatt
2002Multimodal language processing for mobile information access.
Michael Johnston, Srinivas Bangalore, Amanda Stent, Gunaranjan Vasireddy, Patrick Ehlen
2002Multiparty multimodal interaction: a preliminary analysis.
Philip R. Cohen, Rachel Coulston, Kelly Krout
2002Multiple regression of log-spectra for in-car speech recognition.
Tetsuya Shinde, Kazuya Takeda, Fumitada Itakura
2002Mutual information phone clustering for decision tree induction.
Ciprian Chelba, Rachel Morton
2002N-word-sequence frequency noise mitigation for SLM based on binomial distribution.
Yibao Zhao, Guojun Zhou
2002Named entity extraction from spontaneous speech in how may i help you?
Frédéric Béchet, Allen L. Gorin, Jerry H. Wright, Dilek Hakkani-Tür
2002Native and vietnamese production of compound and phrasal stress patterns.
Thu Nguyen, John Ingram
2002Network-based vs. distributed speech recognition in adaptive multi-rate wireless systems.
Tim Fingscheidt, Stefanie Aalburg, Sorel Stan, Christophe Beaugeant
2002Neurocognitive basis for audiovisual speech perception: evidence from event-related potentials.
Curtis W. Ponton, Edward T. Auer, Lynne E. Bernstein
2002New model for speech residual signal shaping with static nonlinearity.
Jari Juhani Turunen, Juha T. Tanttu, Pekka Loula
2002Noise adaptive speech recognition with acoustic models trained from noisy speech evaluated on Aurora-2 database.
Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura
2002Noise estimation for efficient speech enhancement and robust speech recognition.
Petr Motlícek, Lukás Burget
2002Noise from corrupted speech log mel-spectral energies.
Jasha Droppo, Alex Acero, Li Deng
2002Noise robust speech recognition using F0 contour extracted by hough transform.
Koji Iwano, Takahiro Seki, Sadaoki Furui
2002Noise-robust speech recognition in car environments using genetic algorithms and a mel-cepstral subspace approach.
Sid-Ahmed Selouani, Douglas D. O'Shaughnessy
2002Non-linear techniques for dysphonic voice analysis and correction.
Claudia Manfredi, Lorenzo Matassini
2002Objective distance measures for spectral discontinuities in concatenative speech synthesis.
Jithendra Vepa, Simon King, Paul Taylor
2002On F0 trajectory optimization for very high-quality speech manipulation.
Hideki Kawahara, Parham Zolfaghari, Alain de Cheveigné
2002On developing new text and audio corpora and speech recognition tools for the turkish language.
Özgül Salor, Bryan L. Pellom, Tolga Çiloglu, Kadri Hacioglu, Mübeccel Demirekler
2002On effective speaker verification based on subword model.
Sungjoo Ahn, Sunmee Kang, Hanseok Ko
2002On improving the performance of analysis-by-synthesis coding using a multi-magnitude algebraic code-book excitation signal.
Omar Halmi, Hesham Tolba, Driss Guerchi, Douglas D. O'Shaughnessy
2002On text-based language identification for multilingual speech recognition systems.
Jilei Tian, Juha Häkkinen, Søren Riis, Kåre Jean Jensen
2002On the estimation of signal-to-noise ratio in continuous speech for abnormal voices.
Vijay Parsa, Donald G. Jamieson, Karen Stenning, Herbert A. Leeper
2002On the function of the late rise and the early fall in dutch dialogue: a perception experiment.
Johanneke Caspers
2002On the relevance of bandwidth extension for speaker verification.
Marcos Faúndez-Zanuy, Mattias Nilsson, W. Bastiaan Kleijn
2002On the role of the "schwa" in the perception of plosive consonants.
René Carré, Jean-Sylvain Liénard, Egidio Marsico, Willy Serniclaes
2002On the use of Gaussian mixture model for speaker variability analysis.
Tao Chen, Chao Huang, Eric Chang, Jingchun Wang
2002On the use of structures in language models for dialogue.
Renato De Mori, Yannick Estève, Christian Raymond
2002On use of duration modeling for continuous digits speech recognition.
Rong Dong, Jie Zhu
2002Operations for context-based multimodal interpretation in conversational systems.
Joyce Yue Chai
2002Optimal selection of speech data for automatic speech recognition systems.
Arkadiusz Nagórski, Lou Boves, Herman J. M. Steeneken
2002Optimal speech signal partition into one-quasiperiodical segments.
Taras K. Vintsiuk
2002Optimization of hidden Markov models for embedded systems.
Klaus Reinhard, Jochen Junkawitsch, Andreas Kießling, Stefan Dobler
2002Oral-laryngeal control patterns for fricatives in 5-year-olds and adults.
Laura L. Koenig, Jorge C. Lucero
2002Orientel: speech-based interactive communication applications for the mediterranean and the middle east.
Imed Zitouni, Joseph P. Olive, Dorota J. Iskra, Khalid Choukri, Ossama Emam, Oren Gedge, Emmanuel Maragoudakis, Herbert S. Tropf, Asunción Moreno, Albino Nogueiras Rodríguez, Barbara Heuft, Rainer Siemund
2002Oro-facial changes in parkinson²s disease following intensive voice therapy (LSVT).
Jennifer L. Spielman, Lorraine O. Ramig, Joan C. Borod
2002Overview on recent activities in speech understanding and dialogue systems evaluation.
Wolfgang Minker
2002Overview on recent activities in speech understanding and dialogue systems evaluation.
Wolfgang Minker
2002Parametric trajectory segment model for LVCSR.
Lei Jia, Bo Xu
2002Part-of-speech tagging in French text-to-speech synthesis: experiments in tagset selection.
Hongyan Jing, Evelyne Tzoukermann
2002Pause duration and variability in read texts.
Elena Zvonik, Fred Cummins
2002Perceived boundary strength.
Petra Hansson
2002Perception and integration of audiovisual speech in human infants.
David J. Lewkowicz
2002Perception of prosodic phrasing by hearing-impaired listeners.
Dragana Barac-Cikoja, Sally Revoile
2002Perception of tone and vowel quantity in Thai.
Hansjörg Mixdorff, Sudaporn Luksaneeyanawin, Hiroya Fujisaki, Patavee Charnvivit
2002Perceptual adjustment to foreign-accented English with short term exposure.
Constance M. Clarke
2002Perceptual effects of assimilation-induced violation of final devoicing in dutch.
Cecile T. L. Kuijpers, Wilma van Donselaar, Anne Cutler
2002Perceptual evaluation of audiovisual cues for prominence.
Emiel Krahmer, Zsófia Ruttkay, Marc Swerts, Wieger Wesselink
2002Perceptual evaluation of naturalness due to substitution of Chinese syllable for concatenative speech synthesis.
Jinlin Lu, Hisashi Kawai
2002Perceptual learning of second-language syllable rhythm by elderly listeners.
Keiichi Tajima, Reiko Akahane-Yamada, Tsuneo Yamada
2002Performance of discriminatively trained auditory features on Aurora2 and Aurora3.
Brian Kan-Wing Mak, Yik-Cheung Tam
2002Perpetually optimizing the cost function for unit selection in a TTS system with one single run of MOS evaluation.
Hu Peng, Yong Zhao, Min Chu
2002Phonetic normalization using z-score in segmental prosody estimation for corpus-based TTS system.
Hoeun Song, Jaein Kim, Kyongrok Lee, Jinyoung Kim
2002Phonetic speaker identification.
Qin Jin, Tanja Schultz, Alex Waibel
2002Phonological norms in faroese speech synthesis.
Pétur Helgason, Sjrðhur Gullbein
2002Pitch accent prediction using ensemble machine learning.
Xuejing Sun
2002Pitch contour model for Chinese text-to-speech using CART and statistical model.
Minghui Dong, Kim-Teng Lua
2002Pitch extraction of speech signals using an eigen-based subspace method.
Takahiro Murakami, Munehiro Namba, Tetsuya Hoya, Yoshihisa Ishida
2002Porting channel robustness across languages.
Françoise Beaufays, Daniel Boies, Mitch Weintraub
2002Power spectral density based channel equalization of large speech database for concatenative TTS system.
Yu Shi, Eric Chang, Hu Peng, Min Chu
2002Preaspirated stops in southern Swedish.
Mechtild Tronnier
2002Predicting oral reading miscues.
Jack Mostow, Joseph Beck, S. Vanessa Winter, Shaojun Wang, Brian Tobin
2002Preliminary data on effects of behavioral and levodopa therapies on speech-accompanying gesture in parkinson²s disease.
Susan Duncan
2002Probabilistic ranking of constraints.
Louis ten Bosch
2002Probabilistic retrieval based on document representations.
Wolfgang Macherey, Hans Jörg Viechtbauer, Hermann Ney
2002Processing of temporal cues marking phrasal boundaries in individuals with brain damage.
Wendi A. Aasland, Shari R. Baum
2002Production and perception of pauses and their linguistic context in read and spontaneous speech in Swedish.
Beáta Megyesi, Sofia Gustafson-Capková
2002Production based pitch modification of voiced speech.
Yinglong Jiang, Peter Murphy
2002Progress with the philips continuous ASR system on the Aurora 2 noisy digits database.
Markus Lieb, Alexander Fischer
2002Pronunciation of proper names with a joint n-gram model for bi-directional grapheme-to-phoneme conversion.
Lucian Galescu, James F. Allen
2002Prosodic parameter for speaker identification.
Katarina Bartkova, David Le Gac, Delphine Charlet, Denis Jouvet
2002Prosodic phrasing with inductive learning.
Sheng Zhao, Jianhua Tao, Lianhong Cai
2002Prosody-based automatic detection of annoyance and frustration in human-computer dialog.
Jeremy Ang, Rajdip Dhillon, Ashley Krupski, Elizabeth Shriberg, Andreas Stolcke
2002Qualcomm-ICSI-OGI features for ASR.
André Gustavo Adami, Lukás Burget, Stéphane Dupont, Harinath Garudadri, Frantisek Grézl, Hynek Hermansky, Pratibha Jain, Sachin S. Kajarekar, Nelson Morgan, Sunil Sivadas
2002Quantile based histogram equalization for online applications.
Florian Hilger, Sirko Molau, Hermann Ney
2002Quantitative evaluation of relevant prosodic factors for text-to-speech synthesis in Spanish.
David Escudero Mancebo, César González Ferreras, Valentín Cardeñoso-Payo
2002RUSLANA: a database of Russian emotional utterances.
Veronika Makarova, Valery A. Petrushin
2002Radiodoc: a voice-accessible document system.
Takuya Nishimoto, Masahiro Araki, Yasuhisa Niimi
2002Rapid development of speech-to-speech translation systems.
Alan W. Black, Ralf D. Brown, Robert E. Frederking, Kevin A. Lenzo, John Moody, Alexander I. Rudnicky, Rita Singh, Eric Steinbrecher
2002Rapid speaker adaptation using speaker clustering.
Ernest Pusateri, Timothy J. Hazen
2002Real-time rich-content transcription of Chinese broadcast news.
Daben Liu, Jeffrey Ma, Dongxin Xu, Amit Srivastava, Francis Kubala
2002Real-time sound source localization and separation for robot audition.
Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano
2002Recognition and verification of English by Japanese students for computer-assisted language learning system.
Yasushi Tsubota, Tatsuya Kawahara, Masatake Dantsuji
2002Recognition error processing for speech understanding.
Caroline Bousquet-Vernhettes, Nadine Vigouroux
2002Recognition of continuous speech segments of monophone units using support vector machines.
Weifeng Lee, C. Chandra Sekhar, Kazuya Takeda, Fumitada Itakura
2002Recognition of noisy speech using normalized moments.
Jingdong Chen, Yiteng Huang, Qi Li, Frank K. Soong
2002Recurrent neural network-enhanced HMM speech recognition systems.
Jan W. F. Thirion, Elizabeth C. Botha
2002Reducing pronunciation lexicon confusion and using more data without phonetic transcription for pronunciation modeling.
Fang Zheng, Zhanjiang Song, Pascale Fung, William Byrne
2002Reducing the footprint of the IBM trainable speech synthesis system.
Dan Chazan, Ron Hoory, Zvi Kons, Dorel Silberstein, Alexander Sorin
2002Reference resolution by human partners in a natural interactive problem-solving task.
Ellen Campana, Sarah Brown-Schmidt, Michael K. Tanenhaus
2002Refined speech segmentation for concatenative speech synthesis.
Abhinav Sethy, Shrikanth S. Narayanan
2002Refocussing on the text normalisation process in text-to-speech systems.
Andrew P. Breen, Barry Eggleton, Peter Dion, Steve Minnis
2002Reliability measures for translation quality.
Eiichiro Sumita, Yasuhiro Akiba, Kenji Imamura
2002Rethinking derived acoustic features in speech recognition.
Kevin S. Van Horn
2002Retrieving phrases by selecting the history: application to automatic speech recognition.
David Langlois, Kamel Smaïli, Jean Paul Haton
2002Risk based lattice cutting for segmental minimum Bayes-risk decoding.
Shankar Kumar, William Byrne
2002Robust HMM training for unified dutch and German speech recognition.
Rathi Chengalvarayan
2002Robust MMSE-FW-LAASR scheme at low SNRs.
Tao Xu, Zhigang Cao
2002Robust feature extraction in a variety of input devices on the basis of ETSI standard DSR front-end.
Satoru Tsuge, Shingo Kuroiwa, Masami Shishibori, Fuji Ren, Kenji Kita
2002Robust fundamental frequency estimation against background noise and spectral distortion.
Tomohiro Nakatani, Toshio Irino
2002Robust multiple resolution analysis for automatic speech recognition.
Roberto Gemello, Franco Mana, Paolo Pegoraro, Renato De Mori
2002Robust semantic confidence scoring.
Didier Guillevic, Simona Gandrabur, Yves Normandin
2002Robust speech / music classification in audio documents.
Julien Pinquier, Jean-Luc Rouas, Régine André-Obrecht
2002Robust speech recognition against short-time noise.
Man-Hung Siu, Yu-Chung Chan
2002Robust speech recognition using a voiced-unvoiced feature.
András Zolnay, Ralf Schlüter, Hermann Ney
2002Robust speech recognition using inter-speaker and intra-speaker adaptation.
Baojie Li, Keikichi Hirose, Nobuaki Minematsu
2002Robust time-synchronous environmental adaptation for continuous speech recognition systems.
Thomas Plötz, Gernot A. Fink
2002Robust voiced-unvoiced decision associated to continuous pitch tracking in noisy telephone speech.
Mijail Arcienega, Andrzej Drygajlo
2002Run time information fusion in speech recognition.
Chengyi Zheng, Yonghong Yan
2002SALT: a spoken language interface for web-based multimodal dialog systems.
Kuansan Wang
2002SPIN: language understanding for spoken dialogue systems using a production system approach.
Ralf Engel
2002SRILM - an extensible language modeling toolkit.
Andreas Stolcke
2002Same talker, different language: a replication.
Verna Stockmal, Zinny S. Bond
2002Seeing tongue movements from outside.
Gérard Bailly, Pierre Badin
2002Segment duration in spoken korean.
Hyunsong Chung
2002Segmentation of glides with tonal alignment as reference.
Yi Xu, Fang Liu
2002Selective back-off smoothing for incorporating grammatical constraints into the n-gram language model.
Tomoyosi Akiba, Katunobu Itou, Atsushi Fujii, Tetsuya Ishikawa
2002Selective multi-path acoustic model based on database likelihoods.
Akinobu Lee, Yuichiro Mera, Hiroshi Saruwatari, Kiyohiro Shikano
2002Semantic inference: a data-driven solution for NL interaction.
Jerome R. Bellegarda
2002Semantic structured language models.
Hakan Erdogan, Ruhi Sarikaya, Yuqing Gao, Michael Picheny
2002Separation of voiced source characteristics and vocal tract transfer function characteristics for speech sounds by iterative analysis based on AR-HMM model.
Nobuyuki Nishizawa, Keikichi Hirose, Nobuaki Minematsu
2002Sequential MAP noise estimation and a phase-sensitive model of the acoustic environment.
Li Deng, Jasha Droppo, Alex Acero
2002Serving complex user wishes with an enhanced spoken dialogue system.
Sunna Torge, Stefan Rapp, Ralf Kompe
2002Sharing relative stress of cross-word syllables and lexical stress to spontaneous speech recognition.
Farshad Almasganj, Farhad D. Dehnavi, Mahmood Bijankhan
2002Sharing trend information of trajectory in segmental-feature HMM.
Young-Sun Yun
2002Sign language translation using an error tolerant retrieval algorithm.
Chung-Hsien Wu, Yu-Hsien Chiu, Kung-Wei Cheng
2002Similarities of words in noise in Japanese.
Kiyoko Yoneyama
2002Sources of variability in the perceptual training of /r/ and /l/: interaction of adjacent vowel, word position, talkers² visual and acoustic cues.
Debra M. Hardison
2002Sparse and independent representations of speech signals based on parametric models.
Hugo Leonardo Rufiner, Luís F. Rocha, John Goddard Close
2002Speaker change detection using a new weighted distance measure.
Soonil Kwon, Shrikanth S. Narayanan
2002Speaker identification by location in an optimal space of anchor models.
Yassine Mami, Delphine Charlet
2002Speaker independent speech recognition using features based on glottal sound source.
Norihide Kitaoka, Daisuke Yamada, Seiichi Nakagawa
2002Speaker intelligibility of adults and children.
D. Markham, Valérie Hazan
2002Speaker recognition using discriminative features selection.
Bogdan Sabac
2002Speaker recognizability evaluation of a voicefont-based text-to-speech system.
Masaharu Sakamoto, Takashi Saito
2002Speaker utterances tying among speaker segmented audio documents using hierarchical classification: towards speaker indexing of audio databases.
Sylvain Meignier, Jean-François Bonastre, Ivan Magrin-Chagnolleau
2002Speaker verification using Gaussian component strings in dynamic trajectory space.
Bing Xiang
2002Speaker verification with data fusion and model adaptation.
Kevin R. Farrell
2002Speaking rate compensation based on likelihood criterion in acoustic model training and decoding.
Kozo Okuda, Tatsuya Kawahara, Satoshi Nakamura
2002Special session: issues in audiovisual spoken language processing (when, where, and how?).
Lynne E. Bernstein, Denis Burnham, Jean-Luc Schwartz
2002Specification and realisation of multimodal output in dialogue systems.
Jonas Beskow, Jens Edlund, Magnus Nordstrand
2002Spectral enhancement preprocessing for the HNM coding of noisy speech.
Gautam Moharir, Pushkar Patwardhan, Preeti Rao
2002Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics.
Shingo Yamade, Kanako Matsunami, Akira Baba, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano
2002Speech and language processing for a constrained speech translation system.
Stephen Cox
2002Speech coding and transmission for improved automatic recognition.
Xin Zhong, Jon A. Arrowood, Mark A. Clements
2002Speech completion: on-demand completion assistance using filled pauses for speech input interfaces.
Masataka Goto, Katunobu Itou, Satoru Hayamizu
2002Speech enhancement based on a perceptual modification of wiener filtering.
Lee Lin, W. Harvey Holmes, Eliathamby Ambikairajah
2002Speech enhancement based on combining perceptual enhancement and short-time spectral attenuation.
Ilyas Potamitis, Nikos Fakotakis, George Kokkinakis
2002Speech enhancement based on generalized singular value decomposition approach.
Gwo-hwa Ju, Lin-Shan Lee
2002Speech enhancement in car environment using blind source separation.
Hiroshi Saruwatari, Katsuyuki Sawai, Akinobu Lee, Kiyohiro Shikano, Atsunobu Kaminuma, Masao Sakata
2002Speech enhancement in non-stationary noise environments.
Hyoung-Gook Kim, Dietmar Ruwisch
2002Speech enhancement using wavelet packet transform.
Sungwook Chang, Sung-il Jung, Younghun Kwon, Sung-Il Yang
2002Speech modeling using variational Bayesian mixture of Gaussians.
Panu Somervuo
2002Speech pauses and gestural holds in parkinson²s disease.
Francis K. H. Quek, Mary P. Harper, Yonca Haciahmetoglu, Lei Chen, Lorraine O. Ramig
2002Speech recognition for language teaching and evaluating: a study of existing commercial products.
Rebecca Hincks
2002Speech recognition performance comparison between DSR and AMR transcoded speech.
Holly Kelleher, David Pearce, Douglas Ealey, Laurent Mauuary
2002Speech recognition using combined acoustic and articulatory information with retraining of acoustic model parameters.
Ka-Yee Leung, Man-Hung Siu
2002Speech recognition using fundamental frequency and voicing in acoustic modeling.
Andrej Ljolje
2002Speech recognition using syllable patterns.
Li Zhang, William H. Edmondson
2002Speech recognition with a re-speak method for subtitling live broadcasts.
Toru Imai, Atsushi Matsui, Shinichi Homma, Takeshi Kobayakawa, Kazuo Onoe, Shoei Sato, Akio Ando
2002Speech reconstruction from mel-frequency cepstral coefficients using a source-filter model.
Ben Milner, Xu Shao
2002Speech synthesis, speech simulation and speech science.
Mark A. Huckvale
2002Speech to speech translation system for monologues-data driven approach.
Hideki Tanaka, Stephen Nightingale, Hideki Kashioka, Kenji Matsumoto, Masamchi Nishiwaki, Tadashi Kumano, Takehiko Maruyama
2002Speech watermarking through parametric modeling.
Aparna Gurijala, John R. Deller Jr., Michael S. Seadle, John H. L. Hansen
2002Speech, music and songs discrimination in the context of handsets variability.
Hassan Ezzaidi, Jean Rouat
2002Speech-enabled natural language call routing: BBN call director.
Premkumar Natarajan, Rohit Prasad, Bernhard Suhm, Daniel McCarthy
2002Speech-to-speech translation system evaluation: results for French for the NESPOLE! project first showcase.
Solange Rossato, Hervé Blanchon, Laurent Besacier
2002Speechfind: an experimental on-line spoken document retrieval system for historical audio archives.
Bowen Zhou, John H. L. Hansen
2002Spoken dialogue system for home health care.
Shinya Takahashi, Tsuyoshi Morimoto, Sakashi Maeda, Naoyuki Tsuruta
2002State clustering improvements for continuous HMMs in a Spanish large vocabulary recognition system.
Ricardo de Córdoba, Javier Macías Guarasa, Javier Ferreiros, Juan Manuel Montero, José Manuel Pardo
2002Statistical adaptation of acoustic models to noise conditions for robust speech recognition.
Ángel de la Torre, Dominique Fohr, Jean Paul Haton
2002Statistical language modeling with prosodic boundaries and its use for continuous speech recognition.
Keikichi Hirose, Nobuaki Minematsu, Makoto Terao
2002Statistical machine translation decoder based on phrase.
Taro Watanabe, Eiichiro Sumita
2002Statistical natural language generation for speech-to-speech machine translation systems.
Bowen Zhou, Yuqing Gao, Jeffrey S. Sorensen, Zijian Diao, Michael Picheny
2002Statistically based approach to rejection of incorrectly recognized words.
Ludek Müller, Tomás Bartos
2002Stochastic suprasegmentals: relationship between the spectral characteristics of vowels, redundancy and prosodic structure.
Matthew P. Aylett
2002Stochastic trajectory model analysis for accent classification.
Pongtep Angkititrakul, John H. L. Hansen
2002Stop epenthesis at syllable boundaries.
Natasha Warner, Andrea Weber
2002Structural Gaussian mixture models for efficient text-independent speaker verification.
Bing Xiang, Toby Berger
2002Studying pronunciation variants in French by using alignment techniques.
Philippe Boula de Mareüil, Martine Adda-Decker
2002Subband based voice conversion.
Oytun Türk, Levent M. Arslan
2002Subjective assessment of frequency bands for perception of speaker identity.
Eda Ormanci, U. Hakan Nikbay, Oytun Türk, Levent M. Arslan
2002Submoraic awareness by Japanese school children: evidence from a novel game.
Takashi Otake, Akemi Iijima
2002Subset languages for conversing with collaborative interface agents.
Candace L. Sidner, Clifton Forlines
2002Subspace speech enhancement using subband whitening filter.
Jong Uk Kim, Chang D. Yoo
2002Suitable design of adaptive beamformer based on average speech spectrum for noisy speech recognition.
Takanobu Nishiura, Satoshi Nakamura, Yuka Okada, Takeshi Yamada, Kiyohiro Shikano
2002Swallowing and voice effects of lee silverman voice treatment (LSVT).
Jeri Logemann, Ralph Sundin, Jean Sundin
2002Syllable processing in English.
Ruth Kearns, Dennis Norris, Anne Cutler
2002Syllable recognition using syllable-segment statistics and syllable-based HMM.
Nobutoshi Takahashi, Seiichi Nakagawa
2002Syntax over focus.
Sun-Ah Jun
2002Talking to machines (statistically speaking).
Steve J. Young
2002Tempo modulations in English: selected pilot study results.
Sandra P. Kirkham
2002Text-dependent speaker verification using lyapunov exponents.
Adriano Petry, Dante Augusto Couto Barone
2002The 2001 GMTK-based SPINE ASR system.
Özgür Çetin, Harriet J. Nock, Katrin Kirchhoff, Jeff A. Bilmes, Mari Ostendorf
2002The 2ch hybrid subtractive beamformer applied to line sound sources.
Mitsunori Mizumachi, Satoshi Nakamura
2002The AT&t German text-to-speech system: realistic linguistic description.
Matthias Jilka, Ann K. Syrdal
2002The ISL meeting corpus: the impact of meeting type on speech style.
Susanne Burger, Victoria MacLaren, Hua Yu
2002The acoustic realization of anger, fear, joy and sadness in Chinese.
Jiahong Yuan, Liqin Shen, Fangxin Chen
2002The carnegie mellon communicator corpus.
Christina L. Bennett, Alexander I. Rudnicky
2002The effect of auditory-visual information and orthographic background in L2 acquisition.
V. Dogu Erdener, Denis Burnham
2002The effects of F0 manipulation on the perceived distance of speech.
Douglas Brungart, Alexander J. Kordik, Koel Das, Arnab K. Shaw
2002The effects of speech compression on speech recognition and text-to-speech synthesis.
Yeshwant K. Muthusamy, Yifan Gong, Roshan Gupta
2002The evolution of spoken language: a comparative approach.
W. Tecumseh Fitch
2002The influence of identification training on identification and production of the american English mid and low vowels by native speakers of Japanese.
Stephen G. Lambacher, William L. Martens, Kazuhiko Kakehi
2002The influence of speech coding on recognition performance in telecommunication networks.
Hans-Günter Hirsch
2002The perception of stop consonant sequences in dyslexic and normal children.
Noël Nguyen, Ludovic Jankowski, Michel Habib
2002The perceptual basis for audiovisual speech integration.
Lawrence D. Rosenblum
2002The relationship between pure-tone sequential stream segregation and perceptual separation of male and female talkers by listeners with hearing loss.
Carol L. Mackersie
2002The reliability of the ITU-t p.85 standard for the evaluation of text-to-speech systems.
Yolanda Vazquez-Alvarez, Mark A. Huckvale
2002The stimulus as basis for audiovisual integration.
Eric Vatikiotis-Bateson, Harold Hill, Miyuki Kamachi, Karen Lander, Kevin G. Munhall
2002The structure and its implementation of hidden dynamic HMM for Mandarin speech recognition.
Feili Chen, Jie Zhu, Wentao Song
2002Think big, from voice to limb movement therapy.
Becky G. Farley
2002Three-dimensional electromagnetic articulograph based on a nonparametric representation of the magnetic field.
Tokihiko Kaburagi, Kohei Wakamiya, Masaaki Honda
2002Time-compressing natural and synthetic speech.
Esther Janse
2002Time-frequency transforms and beamforming for speaker recognition.
Antonio Satué-Villar, Juan Fernández-Rubio
2002Tone recognition in Thai continuous speech based on coarticulaion, intonation and stress effects.
Nuttakorn Thubthong, Boonserm Kijsirikul, Sudaporn Luksaneeyanawin
2002Topic detection of an utterance for speech dialogue processing.
Katsushi Asami, Toshiyuki Takezawa, Gen-ichiro Kikui
2002Topic tracking using subject templates.
Yoshimi Suzuki, Fumiyo Fukumoto, Yoshihiro Sekiguchi
2002Towards a grammar of spoken language: incorporating paralinguistic information.
Nick Campbell
2002Towards an intonation module for a portuguese TTS system.
Diamantino Freitas, Daniela Braga
2002Towards automatic closed captioning : low latency real time broadcast news transcription.
Murat Saraclar, Michael Riley, Enrico Bocchieri, Vincent Goffin
2002Towards every-citizen²s speech interface: an application generator for speech interfaces to databases.
Arthur R. Toth, Thomas K. Harris, James Sanders, Stefanie Shriver, Roni Rosenfeld
2002Towards the question: why has speaking rate such an impact on speech recognition performance?
Robert Faltlhauser, Günther Ruske, Matthias Thomae
2002Training topic classifiers for conversational speech with limited data.
Rukmini Iyer, Jeffrey Z. Ma, Herbert Gish, Owen Kimball
2002Transducer search space modelings for large-vocabulary speech recognition.
Hans J. G. A. Dolfing
2002Transform-based feature vector compression for distributed speech recognition.
Ben Milner, Xu Shao
2002Transformation of spectral envelope for voice conversion based on radial basis function networks.
Tomomi Watanabe, Takahiro Murakami, Munehiro Namba, Tetsuya Hoya, Yoshihisa Ishida
2002Transmission characteristics of outer ear canal.
Karel Pellant, Jan Mejzlík, Karel Prikryl, Zdenek Skvor
2002Tree-structured maximum a posteriori adaptation for a segment-based speech recognition system.
Irina Illina
2002Unconstrained versus constrained acoustic normalisation in confidence scoring.
Jacques Duchateau, Patrick Wambacq
2002Unified task knowledge for spoken language understanding and dialog management.
Jerry H. Wright, Alicia Abella, Allen L. Gorin
2002Unknown-multiple speaker clustering using HMM.
Jitendra Ajmera, Hervé Bourlard, I. Lapidot, Iain McCowan
2002Unsupervised acoustic model adaptation based on phoneme error minimization.
Jun Ogata, Yasuo Ariki
2002Unsupervised language model adaptation for lecture speech transcription.
Thomas Niesler, Daniel Willett
2002Unsupervised n-best based model adaptation using model-level confidence measures.
Ka-Yan Kwan, Tan Lee, Chen Yang
2002Unsupervised speaker segmentation of telephone conversations.
Aaron E. Rosenberg, Allen L. Gorin, Zhu Liu, Sarangarajan Parthasarathy
2002User-customized password speaker verification based on HMM/ANN and GMM models.
Mohamed Faouzi BenZeghiba, Hervé Bourlard
2002User-tailored generation for spoken dialogue: an experiment.
Amanda Stent, Marilyn A. Walker, Steve Whittaker, Preetam Maloor
2002Using EM-trained string-edit distances for approximate matching of acoustic morphemes.
Michael Levit, Elmar Nöth, Allen L. Gorin
2002Using adaptive signal limiter together with weighting techniques for noisy speech recognition.
Wei-Wen Hung
2002Using cross-language cues for story-specific language modeling.
Sanjeev Khudanpur, Woosung Kim
2002Using dynamic WFST composition for recognizing broadcast news.
Diamantino Caseiro, Isabel Trancoso
2002Using observation uncertainty in HMM decoding.
Jon A. Arrowood, Mark A. Clements
2002Using part-of-speech tags, context thresholding, and trigram contexts to improve the auto-induction of semantic classes.
Andrew N. Pargellis, Eric Fosler-Lussier, Augustine Tsai
2002Using start/end timings of spectral transitions between phonemes in concatenative speech synthesis.
Toshio Hirai, Seiichi Tenpaku, Kiyohiro Shikano
2002Using time-stretched pulses for accurate splitting of speech utterances played back in noisy reverberant environments.
Dorothea Kolossa, Qiang Huo
2002Using x-grams for speech-to-speech translation.
Adrià de Gispert, José B. Mariño
2002Utterance verification based on neighborhood information and Bayes factors.
Hui Jiang, Chin-Hui Lee
2002Validation and improvement of automatic phonetic transcriptions.
Catia Cucchiarini, Diana Binnenpoorte
2002Variability in direction of dorsal movement during production of /l/.
Natasha Warner, Allard Jongman, Doris Mcke
2002Variability in the production of glottalized sonorants: data from yapese.
Ian Maddieson, Julie Larson
2002VisSTA: a tool for analyzing multimodal discourse data.
Francis K. H. Quek, Yang Shi, Cemil Kirbas, Shunguang Wu
2002Vocabulary independent OOV detection using support vector machines.
Tommi Lahti, Janne Suontausta
2002Vocalization age as a clinical tool.
Harriet J. Fell, Joel MacAuslan, Linda J. Ferrier, Susan G. Worst, Karen Chenausky
2002Voice transformations for improving children²s speech recognition in a publicly available dialogue system.
Joakim Gustafson, Kåre Sjölander
2002Vowel classification for computer-based visual feedback for speech training for the hearing impaired.
Stephen A. Zahorian, A. Matthew Zimmer, Fansheng Meng
2002Warped-LP residual resampling using DCT for pitch modification.
R. Muralishankar, A. G. Ramakrishnan, P. Prathibha
2002Weighted graph based decision tree optimization for high accuracy acoustic modeling.
Sheng Gao, Jinsong Zhang, Satoshi Nakamura, Chin-Hui Lee, Tat-Seng Chua
2002What relationship between protrusion anticipation and auditory perception?
Rudolph Sock, Béatrice Vaxelaire, Véronique Hecker, Fabrice Hirsch
2002Wizard of oz evaluation of a dialogue with communicator system in Chile.
Néstor Becerra Yoma, Angela Cortés, Mauricio Hormazábal, Enrique López
2002Word endpoints detection in the presence of non-stationary noise.
Mario Toma, Andrea Lodi, Roberto Guerrieri
2002X-JToBI: an extended j-toBI for spontaneous speech.
Kikuo Maekawa, Hideaki Kikuchi, Yosuke Igarashi, Jennifer J. Venditti