| 2006 | "yeah right": sarcasm recognition for spoken dialogue systems. Joseph Tepperman, David R. Traum, Shrikanth S. Narayanan |
| 2006 | /nailon/ - software for online analysis of prosody. Jens Edlund, Mattias Heldner |
| 2006 | 50 years late: repeating miller-nicely 1955. Andrew Lovitt, Jont B. Allen |
| 2006 | A DTW-based dissimilarity measure for left-to-right hidden Markov models and its application to word confusability analysis. Qiang Huo, Wei Li |
| 2006 | A Spanish speech to sign language translation system for assisting deaf-mute people. Rubén San Segundo, Roberto Barra-Chicote, Luis Fernando D'Haro, Juan Manuel Montero, Ricardo de Córdoba, Javier Ferreiros |
| 2006 | A bootstrapping approach for developing language model of new spoken dialogue systems by selecting web texts. Teruhisa Misu, Tatsuya Kawahara |
| 2006 | A case study in the identification of prosodic cues to turn-taking: back-channeling in Arabic. Nigel G. Ward, Yaffa Al Bayyari |
| 2006 | A clustering approach to semantic decoding. Hui Ye, Steve J. Young |
| 2006 | A cohort - UBM approach to mitigate data sparseness for in-set/out-of-set speaker recognition. Vinod Prakash, John H. L. Hansen |
| 2006 | A comparative study of Gaussian selection methods in large vocabulary continuous speech recognition. Dirk Gehrig, Thomas Schaaf |
| 2006 | A comparison of inter-transcriber reliability for two systems of prosodic annotation: rap (rhythm and pitch) and toBI (tones and break indices). Laura Dilley, Mara Breen, Marti Bolivar, John Kraemer, Edward Gibson |
| 2006 | A comparison of singing evaluation algorithms. Partha Lal |
| 2006 | A computational auditory scene analysis system for robust speech recognition. Soundararajan Srinivasan, Yang Shao, Zhaozhang Jin, DeLiang Wang |
| 2006 | A constrained baum-welch algorithm for improved phoneme segmentation and efficient training. David Huggins-Daines, Alexander I. Rudnicky |
| 2006 | A discriminative method for speaker verification using the difference information. Zhenchun Lei, Yingchun Yang, Zhaohui Wu |
| 2006 | A framework for robust MFCC feature extraction using SNR-dependent compression of enhanced mel filter bank energies. Babak Nasersharif, Ahmad Akbari |
| 2006 | A hybrid phrase-based/statistical speech translation system. David Stallard, Fred Choi, Kriste Krstovski, Prem Natarajan, Rohit Prasad, Shirin Saleem |
| 2006 | A joint intention-based dialogue engine. Rajah Annamalai Subramanian, Philip R. Cohen |
| 2006 | A maximum likelihood training approach to irrelevant variability compensation based on piecewise linear transformations. Qiang Huo, Donglai Zhu |
| 2006 | A model for the f0 reset in corpus-based intonation approaches. Francisco Campillo Díaz, Jan P. H. van Santen, Eduardo Rodríguez Banga |
| 2006 | A model of the regularities underlying speaker variation: evidence from hybrid synthesis. Susan R. Hertz |
| 2006 | A multi-pass error detection and correction framework for Mandarin LVCSR. Zhengyu Zhou, Helen M. Meng, Wai Kit Lo |
| 2006 | A multi-space distribution (MSD) approach to speech recognition of tonal languages. Huanliang Wang, Yao Qian, Frank K. Soong, Jian-Lai Zhou, Jiqing Han |
| 2006 | A multiclass framework for speaker verification within an acoustic event sequence system. Nicolas Scheffer, Jean-François Bonastre |
| 2006 | A multilingual embodied conversational agent for tutoring speech and language learning. Dominic W. Massaro, Ying Liu, Trevor H. Chen, Charles Perfetti |
| 2006 | A multilingual expectations model for contextual utterances in mixed-initiative spoken dialogue. Hartwig Holzapfel, Alex Waibel |
| 2006 | A multipitch tracker for monaural speech segmentation. André Coy, Jon Barker |
| 2006 | A new HMM adaptation approach for the case of a hands-free speech input in reverberant rooms. Hans-Günter Hirsch, Harald Finster |
| 2006 | A new dual-microphone speech enhancement method for oriented noises. Hamid Reza Abutalebi, Majid Pourahmadi, Masoud Reza Aghabozorgi |
| 2006 | A new framework for system combination based on integrated hypothesis space. I-Fan Chen, Lin-Shan Lee |
| 2006 | A new set of features for text-independent speaker identification. Carol Y. Espy-Wilson, Sandeep Manocha, Srikanth Vishnubhotla |
| 2006 | A new single-ended measure for assessment of speech quality. Timothy Murphy, Dorel Picovici, Abdulhussain E. Mahdi |
| 2006 | A new state-dependent phonetic tied-mixture model with head-body-tail structured HMM for real-time continuous phoneme recognition system. Junho Park, Hanseok Ko |
| 2006 | A noninvasive, low-cost device to study the velopharyngeal port during speech and some preliminary results. Xiaochuan Niu, Alexander Kain, Jan P. H. van Santen |
| 2006 | A novel environment-dependent speech enhancement method with optimized memory footprint. Suhadi Suhadi, Sorel Stan, Tim Fingscheidt |
| 2006 | A novel framework of text-independent speaker verification based on utterance transform and iterative cohort modeling. Ming Liu, Huazhong Ning, Thomas S. Huang, Zhengyou Zhang |
| 2006 | A phrase-level machine translation approach for disfluency detection using weighted finite state transducers. Sameer Maskey, Bowen Zhou, Yuqing Gao |
| 2006 | A pitch marks filtering algorithm based on restricted dynamic programming. Francesc Alías, Carlos Monzo, Joan Claudi Socoró |
| 2006 | A probabilistic graphical model for microphone array source separation using rich pre-trained source models. Hagai Thomas Attias |
| 2006 | A quality measure method using Gaussian mixture models and divergence measure for speaker identification. Rong Zheng, Shuwu Zhang, Bo Xu |
| 2006 | A robust feature extraction based on the MTF concept for speech recognition in reverberant environment. Xugang Lu, Masashi Unoki, Masato Akagi |
| 2006 | A robust fusion method for multilingual spoken document retrieval systems employing tiered resources. Murat Akbacak, John H. L. Hansen |
| 2006 | A simulated-data adaptation technique for robust speech recognition. Nattanun Thatphithakkul, Boontee Kruatrachue, Chai Wutiwiwatchai, Sanparith Marukatat, Vataya Boonpiam |
| 2006 | A simulation based parameter optimization for a coarticulation model. Jianguo Wei, Xugang Lu, Jianwu Dang |
| 2006 | A speaker adaptation algorithm using principal curves in noisy environments. Jingying Wang, Zuoying Wang |
| 2006 | A spectral clustering approach to speaker diarization. Huazhong Ning, Ming Liu, Hao Tang, Thomas S. Huang |
| 2006 | A spectral-temporal method for pitch tracking. Stephen A. Zahorian, Princy Dikshit, Hongbing Hu |
| 2006 | A spoken language understanding approach using successive learners. Wei-Lin Wu, Ruzhan Lu, Hui Liu, Feng Gao |
| 2006 | A stochastic approach for dialog management based on neural networks. Lluís F. Hurtado, David Griol, Encarna Segarra, Emilio Emilio, Sanchis Sanchis |
| 2006 | A study of emotional speech articulation using a fast magnetic resonance imaging technique. Sungbok Lee, Erik Bresch, Jason Adams, Abe Kazemzadeh, Shrikanth S. Narayanan |
| 2006 | A study on detection based automatic speech recognition. Chengyuan Ma, Yu Tsao, Chin-Hui Lee |
| 2006 | A study on lattice rescoring with knowledge scores for automatic speech recognition. Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee |
| 2006 | A style control technique for speech synthesis using multiple regression HSMM. Takashi Nose, Junichi Yamagishi, Takao Kobayashi |
| 2006 | A successive state and mixture splitting for optimizing the size of models in speech recognition. Soo-Young Suk, Seong-Jun Hahm, Ho-Youl Jung, Hyun-Yeol Chung |
| 2006 | A syllable based continuous speech recognizer for Tamil. A. Lakshmi, Hema A. Murthy |
| 2006 | A technique for controlling voice quality of synthetic speech using multiple regression HSMM. Makoto Tachibana, Takashi Nose, Junichi Yamagishi, Takao Kobayashi |
| 2006 | A text-prompted distributed speaker verification system implemented on a cellular phone and a mobile terminal. Tsuneo Kato, Hisashi Kawai |
| 2006 | A texttiling based approach to topic boundary detection in meetings. Satanjeev Banerjee, Alexander I. Rudnicky |
| 2006 | A time-synchronous phonetic decoder for a long-contextual-Span hidden trajectory model. Xiaolong Li, Li Deng, Dong Yu, Alex Acero |
| 2006 | A tone recognition framework for continuous Mandarin speech. Lei He, Jie Hao |
| 2006 | A trajectory mixture density network for the acoustic-articulatory inversion mapping. Korin Richmond |
| 2006 | A user simulator based on voiceXML for evaluation of spoken dialog systems. Akinori Ito, Keisuke Shimada, Motoyuki Suzuki, Shozo Makino |
| 2006 | A vector space approach to environment modeling for robust speech recognition. Yu Tsao, Chin-Hui Lee |
| 2006 | A wavelet-based parameterization for speech/music segmentation. E. Didiot, Irina Illina, Odile Mella, Dominique Fohr, Jean Paul Haton |
| 2006 | A weight estimation method using LDA for multi-band speech recognition. Koji Iwano, Kaname Kojima, Sadaoki Furui |
| 2006 | ASR-based corrective feedback on pronunciation: does it really work? Ambra Neri, Catia Cucchiarini, Helmer Strik |
| 2006 | Accident - execute: increased activation in nonnative listening. Mirjam Broersma |
| 2006 | Acoustic analysis and automatic recognition of spontaneous children²s speech. Matteo Gerosa, Diego Giuliani, Shrikanth S. Narayanan |
| 2006 | Acoustic characterization of children with speech delay. H. Timothy Bunnell, James B. Polikoff |
| 2006 | Acoustic cues for the classification of regular and irregular phonation. Kushan Surana, Janet Slifka |
| 2006 | Acoustic model training based on linear transformation and MAP modification for HSMM-based speech synthesis. Katsumi Ogata, Makoto Tachibana, Junichi Yamagishi, Takao Kobayashi |
| 2006 | Acoustic modeling for spoken dialogue systems based on unsupervised utterance-based selective training. Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2006 | Adaptive filtering for attenuating musical noise caused by spectral subtraction. Takahiro Murakami, Yoshihisa Ishida |
| 2006 | Adaptive multimodal fusion by uncertainty compensation. Vassilis Pitsikalis, Athanassios Katsamanis, George Papandreou, Petros Maragos |
| 2006 | Adaptive speech enhancement for speech separation in diffuse noise. Rong Hu, Yunxin Zhao |
| 2006 | Advances in lecture recognition: the ISL RT-06s evaluation system. Christian Fügen, Matthias Wölfel, John W. McDonough, Shajith Ikbal, Florian Kraft, Kornel Laskowski, Mari Ostendorf, Sebastian Stüker, Ken'ichi Kumatani |
| 2006 | All-pole model estimation of vocal tract on the frequency domain. Luis Weruaga, Amar Al-Khayat |
| 2006 | Amharic speech synthesis using cepstral method with stress generation rule. Tadesse Anberbir, Tomio Takara |
| 2006 | An ERB loudness pattern based objective speech quality measure. Guo Chen, Vijay Parsa, Susan Scollie |
| 2006 | An HMM-based singing voice synthesis system. Keijiro Saino, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda |
| 2006 | An MRI based study of the acoustic effects of sinus cavities and its application to speaker recognition. Tarun Pruthi, Carol Y. Espy-Wilson |
| 2006 | An acoustic and articulatory study of Lombard speech: global effects on the utterance. Maeva Garnier, Lucie Bailly, Marion Dohen, Pauline Welby, Hélène Loevenbruck |
| 2006 | An adaptive sampling procedure for speech perception experiments. Geoffrey Stewart Morrison |
| 2006 | An annotation scheme for agreement analysis. Siew Leng Toh, Fan Yang, Peter A. Heeman |
| 2006 | An annotation scheme for complex disfluencies. Peter A. Heeman, Andy McMillin, J. Scott Yaruss |
| 2006 | An assessment of automatic speech recognition as speech intelligibility estimation in the context of additive noise. Wei Ming Liu, John S. D. Mason, Nicholas W. D. Evans, Keith A. Jellyman |
| 2006 | An automatic singing skill evaluation method for unknown melodies using pitch interval accuracy and vibrato features. Tomoyasu Nakano, Masataka Goto, Yuzuru Hiraga |
| 2006 | An effective and efficient utterance verification technology using word n-gram filler models. Dong Yu, Yun-Cheng Ju, Alex Acero |
| 2006 | An efficient bispectrum phase entropy-based algorithm for VAD. J. M. Górriz, Javier Ramírez, Carlos García Puntonet, José C. Segura |
| 2006 | An efficient segment-based speech compression technique for hand-held TTS systems. Chang-Heon Lee, Sung-Kyo Jung, Thomas Eriksson, Won-Suk Jun, Hong-Goo Kang |
| 2006 | An improved affine projection algorithm based crosstalk resistant adaptive noise canceller. Guo Chen, Vijay Parsa |
| 2006 | An improved mel-wiener filter for mel-LPC based speech recognition. Md. Babul Islam, Hiroshi Matsumoto, Kazumasa Yamamoto |
| 2006 | An incremental algorithm for signal reconstruction from short-time fourier transform magnitude. Jake V. Bouvrie, Tony Ezzat |
| 2006 | An information theoretic tool for investigating speech perception. Bryce E. Lobdell, Jont B. Allen |
| 2006 | An integrated approach to improve speech recognition rate for non-native speakers. Yunbin Deng, Xiaokun Li, Chiman Kwan, Roger Xu, Bhiksha Raj, Richard M. Stern, David Williamson |
| 2006 | An integrated solution for error concealment in DSR systems over wireless channels. Antonio M. Peinado, Angel M. Gomez, Victoria E. Sánchez, José L. Pérez-Córdoba, Antonio J. Rubio |
| 2006 | An investigation of manifold learning for speech analysis. Andrew Errity, John McKenna |
| 2006 | An online adaptive filtering algorithm for the vocal joystick. Xiao Li, Jonathan Malkin, Susumu Harada, Jeff A. Bilmes, Richard Wright, James A. Landay |
| 2006 | An optimum microphone array post-filter for speech applications. Stamatios Lefkimmiatis, Dimitrios Dimitriadis, Petros Maragos |
| 2006 | An unified unit-selection framework for ultra low bit-rate speech coding. V. Ramasubramanian, D. Harish |
| 2006 | An user-centered development of an intuitive dialog control for speech-controlled music selection in cars. Stefan Schulz, Hilko Donker |
| 2006 | Analysis and detection of speech under sleep deprivation. Tin Lay Nwe, Haizhou Li, Minghui Dong |
| 2006 | Analysis of HMM temporal evolution for automatic speech recognition and utterance verification. Marta Casar, José A. R. Fonollosa |
| 2006 | Analysis of correlation between audio and visual speech features for clean audio feature prediction in noise. Ibrahim Almajai, Ben Milner, Jonathan Darch |
| 2006 | Analysis of lombard effect under different types and levels of noise with application to in-set speaker ID systems. Vaishnevi S. Varadarajan, John H. L. Hansen |
| 2006 | Analysis of nonmodal phonation using minimum entropy deconvolution. Nicolas Malyska, Thomas F. Quatieri |
| 2006 | Analysis of overlaps in meetings by dialog factors, hot spots, speakers, and collection site: insights for automatic speech recognition. Özgür Çetin, Elizabeth Shriberg |
| 2006 | Analysis of prosodic and linguistic cues of phrase finals for turn-taking and dialog acts. Carlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita |
| 2006 | Analyzing dialogue data for real-world emotional speech classification. Ryuichi Nisimura, Souji Omae, Hideki Kawahara, Toshio Irino |
| 2006 | Analyzing reusability of speech corpus based on statistical multidimensional scaling method. Goshu Nagino, Makoto Shozakai |
| 2006 | Articulatory features for "meeting" speech recognition. Florian Metze |
| 2006 | Assessing the reading level of web pages. Sarah E. Petersen, Mari Ostendorf |
| 2006 | Assessment of articulatory sub-systems of dysarthric speech using an isolated-style phoneme recognition system. P. Vijayalakshmi, M. Ramasubba Reddy, Douglas D. O'Shaughnessy |
| 2006 | Audio person tracking in a smart-room environment. Alberto Abad, Carlos Segura, Dusan Macho, Javier Hernando, Climent Nadeu |
| 2006 | Audio-visual speech recognition in the presence of a competing speaker. Xu Shao, Jon Barker |
| 2006 | Auto-segmentation based VAD for robust ASR. Yu Shi, Frank K. Soong, Jian-Lai Zhou |
| 2006 | Automatic English stop consonants classification using wavelet analysis and hidden Markov models. Marco Kühne, Roberto Togneri |
| 2006 | Automatic Mandarin pronunciation scoring for native learners with dialect accent. Si Wei, Qing-Sheng Liu, Yu Hu, Ren-Hua Wang |
| 2006 | Automatic acoustic identification of insects inspired by the speaker recognition paradigm. Ilyas Potamitis, Todor Ganchev, Nikos Fakotakis |
| 2006 | Automatic alignment and error correction of human generated transcripts for long speech recordings. Timothy J. Hazen |
| 2006 | Automatic assignment of anchoring points on vowel templates for defining correspondence between time-frequency representations of speech samples. Toru Takahashi, Masashi Nishi, Toshio Irino, Hideki Kawahara |
| 2006 | Automatic detection of irregular phonation in continuous speech. Srikanth Vishnubhotla, Carol Y. Espy-Wilson |
| 2006 | Automatic detection of voice onset time contrasts for use in pronunciation assessment. Abe Kazemzadeh, Joseph Tepperman, Jorge F. Silva, Hong You, Sungbok Lee, Abeer Alwan, Shrikanth S. Narayanan |
| 2006 | Automatic emotion recognition of speech signal in Mandarin. Sheng Zhang, P. C. Ching, Fanrang Kong |
| 2006 | Automatic generation of statistical language models for interactive voice response applications. Mithun Balakrishna, Cyril Cerovic, Dan I. Moldovan, Ellis Cave |
| 2006 | Automatic grammar correction for second-language learners. John Lee, Stephanie Seneff |
| 2006 | Automatic initial/final generation for dialectal Chinese speech recognition. Linquan Liu, Thomas Fang Zheng, Wenhu Wu |
| 2006 | Automatic language identification using wavelets. Ana Lilia Reyes-Herrera, Luis Villaseñor-Pineda, Manuel Montes-y-Gómez |
| 2006 | Automatic metadata generation and video editing based on speech and image recognition for medical education contents. Satoshi Tamura, Koji Hashimoto, Jiong Zhu, Satoru Hayamizu, Hirotsugu Asai, Hideki Tanahashi, Makoto Kanagawa |
| 2006 | Automatic phonetic segmentation by using a SPM-based approach for a Mandarin singing voice corpus. Cheng-Yuan Lin, Jyh-Shing Roger Jang |
| 2006 | Automatic phonetic transcription of large speech corpora: a comparative study. Christophe Van Bael, Lou Boves, Henk van den Heuvel, Helmer Strik |
| 2006 | Automatic recognition of speakers' age and gender on the basis of empirical studies. Christian A. Müller |
| 2006 | Automatic removal of typed keystrokes from speech signals. Amarnag Subramanya, Michael L. Seltzer, Alex Acero |
| 2006 | Automatic speech recognition experiments with articulatory data. Esmeralda Uraga, Thomas Hain |
| 2006 | Automatic speech recognition of Cantonese-English code-mixing utterances. Joyce Y. C. Chan, P. C. Ching, Tan Lee, Houwei Cao |
| 2006 | Automatic speech segmentation with multiple statistical models. Seung Seop Park, Jong Won Shin, Nam Soo Kim |
| 2006 | Automatic syllable-pattern induction in statistical Thai text-to-phone transcription. Ausdang Thangthai, Chatchawarn Hansakunbuntheung, Rungkarn Siricharoenchai, Chai Wutiwiwatchai |
| 2006 | Automatic transcription of Somali language. Abdillahi Nimaan, Pascal Nocera, Jean-François Bonastre |
| 2006 | BINSEG: an efficient speaker-based segmentation technique. Jindrich Zdánský |
| 2006 | Basque-Spanish language identification using phone-based methods. Víctor G. Guijarrubia, M. Inés Torres |
| 2006 | Bayesian decision tree state tying for conversational speech recognition. Rusheng Hu, Yunxin Zhao |
| 2006 | Bayesian networks for phonetic classification using time-scale features. Franz Pernkopf, Tuan Van Pham |
| 2006 | Boosting HMM performance with a memory upgrade. Mathias De Wachter, Kris Demuynck, Dirk Van Compernolle |
| 2006 | Bootstrapping language models for dialogue systems. Karl Weilhammer, Matthew N. Stuttle, Steve J. Young |
| 2006 | Building an English speech synthesis system from a Japanese ALS patient²s voice. Akemi Iida, Jun Ito, Shimpei Kajima, Tsutomu Sugawara |
| 2006 | Building an English-iraqi Arabic machine translation system for spoken utterances with limited resources. Jason Riesa, Behrang Mohit, Kevin Knight, Daniel Marcu |
| 2006 | CASA based speech separation for robust speech recognition. Runqiang Han, Pei Zhao, Qin Gao, Zhiping Zhang, Hao Wu, Xihong Wu |
| 2006 | CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition. Satoshi Nakamura, Masakiyo Fujimoto, Kazuya Takeda |
| 2006 | CHAT: a conversational helper for automotive tasks. Fuliang Weng, Sebastian Varges, Badri Raghunathan, Florin Ratiu, Heather Pon-Barry, Brian Lathrop, Qi Zhang, Harry Bratt, Tobias Scheideck, Kui Xu, Matthew Purver, Rohit Mishra, Annie Lien, Madhuri Raya, Stanley Peters, Yao Meng, J. Russell, Lawrence Cavedon, Elizabeth Shriberg, Hauke Schmidt, R. Prieto |
| 2006 | CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling. Alan W. Black |
| 2006 | Call analysis with classification using speech and non-speech features. Yun-Cheng Ju, Ye-Yi Wang, Alex Acero |
| 2006 | Category formation and the role of spectral quality in the perception and production of English front vowels. Ricardo Augusto Hoffmann Bion, Paola Escudero, Andréia S. Rauber, Barbara O. Baptista |
| 2006 | Characterization of cued speech vowels from the inner lip contour. Noureddine Aboutabit, Denis Beautemps, Laurent Besacier |
| 2006 | Chinese input method based on reduced Mandarin phonetic alphabet. Chun-Han Tseng, Chia-Ping Chen |
| 2006 | Classified comfort noise generation for efficient voice transmission. Yasheng Qian, Wei-Shou Hsu, Peter Kabal |
| 2006 | Classroom success of an intelligent tutoring system for lexical practice and reading comprehension. Michael Heilman, Kevyn Collins-Thompson, Jamie Callan, Maxine Eskénazi |
| 2006 | Clean speech feature estimation based on soft spectral masking. Young Joon Kim, Woohyung Lim, Nam Soo Kim |
| 2006 | Cluster-based user simulations for learning dialogue strategies. Verena Rieser, Oliver Lemon |
| 2006 | Colloquial Iraqi ASR for speech translation. Shirin Saleem, Rohit Prasad, Prem Natarajan |
| 2006 | Combining acoustic, lexical, and syntactic evidence for automatic unsupervised prosody labeling. Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan |
| 2006 | Combining missing-feature theory, speech enhancement and speaker-dependent/-independent modeling for speech separation. Ji Ming, Timothy J. Hazen, James R. Glass |
| 2006 | Combining multiple-sized sub-word units in a speech recognition system using baseform selection. T. Nagarajan, P. Vijayalakshmi, Douglas D. O'Shaughnessy |
| 2006 | Combining phonetic attributes using conditional random fields. Jeremy Morris, Eric Fosler-Lussier |
| 2006 | Compact n-gram models by incremental growing and clustering of histories. Sami Virpioja, Mikko Kurimo |
| 2006 | Comparative analysis of formants of British, american and australian accents. Seyed Ghorshi, Saeed Vaseghi, Qin Yan |
| 2006 | Comparative study on contributions of pitch-synchronization and peak-amplitude towards robustness issue of ASR. Muhammad Ghulam, Junsei Horikawa, Tsuneo Nitta |
| 2006 | Comparison of Slovak and Czech speech recognition based on grapheme and phoneme acoustic models. Slavomír Lihan, Jozef Juhár, Anton Cizmar |
| 2006 | Comparison of acoustic modeling techniques for Vietnamese and Khmer ASR. Viet Bac Le, Laurent Besacier |
| 2006 | Comparison of keyword spotting methods for searching in speech. Lubos Smídl, Josef V. Psutka |
| 2006 | Comparison of prediction based LSF quantization methods using split VQ. Saikat Chatterjee, T. V. Sreenivas |
| 2006 | Comparison of the ITU-t p.85 standard to other methods for the evaluation of text-to-speech systems. Dmitry Sityaev, Katherine M. Knill, Tina Burrows |
| 2006 | Computer aided pronunciation learning system using speech recognition techniques. Sherif Mahdy Abdou, Salah Eldeen Hamid, Mohsen A. Rashwan, Abdurrahman Samir, Ossama Abdel-Hamid, Mostafa Shahin, Waleed Nazih |
| 2006 | Computer-assisted closed-captioning of live TV broadcasts in French. Gilles Boulianne, Jean-Francois Beaumont, Maryse Boisvert, Julie Brousseau, Patrick Cardinal, Claude Chapdelaine, Michel Comeau, Pierre Ouellet, Frédéric Osterrath |
| 2006 | Conceptual decoding from word lattices: application to the spoken dialogue corpus MEDIA. Christophe Servan, Christian Raymond, Frédéric Béchet, Pascal Nocera |
| 2006 | Conditional random fields for hierarchical segment selection in text-to-speech synthesis. Christian Weiss, Wolfgang Hess |
| 2006 | Consonant and vowel confusions in speech-weighted noise. Sandeep Phatak, Jont B. Allen |
| 2006 | Constrained structural maximum a posteriori linear regression for average-voice-based speech synthesis. Yuji Nakano, Makoto Tachibana, Junichi Yamagishi, Takao Kobayashi |
| 2006 | Constructing stylistic synthesis databases from audio books. Yong Zhao, Di Peng, Lijuan Wang, Min Chu, Yining Chen, Peng Yu, Jun Guo |
| 2006 | Continual on-line monitoring of Czech spoken broadcast programs. Jan Nouza, Jindrich Zdánský, Petr Cerva, Jan Kolorenc |
| 2006 | Continuous time-frequency masking method for blind speech separation with adaptive choice of threshold parameter using ICA. Zbynek Koldovský, Jan Nouza, Jan Kolorenc |
| 2006 | Conversational help desk: vague callers and context switch. Osamuyimen Stewart, Juan M. Huerta, Ea-Ee Jan, Cheng Wu, Xiang Li, David M. Lubensky |
| 2006 | Conversational quality estimation model for wideband IP-telephony services. Hitoshi Aoki, Atsuko Kurashima, Akira Takahashi |
| 2006 | Conversion from phoneme based to grapheme based acoustic models for speech recognition. Andrej Zgank, Zdravko Kacic |
| 2006 | Cooperation between global and local methods for the automatic segmentation of speech synthesis corpora. Safaa Jarifi, Dominique Pastor, Olivier Rosec |
| 2006 | Corpus design based on the kullback-leibler divergence for text-to-speech synthesis application. Aleksandra Krul, Géraldine Damnati, François Yvon, Thierry Moudenc |
| 2006 | Corpus-based generation of fundamental frequency contours using generation process model and considering emotional focuses. Keikichi Hirose, Yasufumi Asano, Nobuaki Minematsu |
| 2006 | Coupling particle filters with automatic speech recognition for speech feature enhancement. Friedrich Faubel, Matthias Wölfel |
| 2006 | Cross-language evaluation of voice-to-phoneme conversions for voice-tag application in embedded platforms. Yan Ming Cheng, Changxue Ma, Lynette Melnar |
| 2006 | Cross-lingual dialog model for speech to speech translation. Emil Ettelaie, Panayiotis G. Georgiou, Shrikanth S. Narayanan |
| 2006 | Cross-system adaptation and combination for continuous speech recognition: the influence of phoneme set and acoustic front-end. Sebastian Stüker, Christian Fügen, Susanne Burger, Matthias Wölfel |
| 2006 | Cues for hesitation in speech synthesis. Rolf Carlson, Kjell Gustafson, Eva Strangert |
| 2006 | Data-driven design of front-end filter bank for Lombard speech recognition. Hynek Boril, Petr Fousek, Petr Pollák |
| 2006 | Decision directed constrained iterative speech enhancement. Amit Das, John H. L. Hansen |
| 2006 | Decision tree-based training of probabilistic concatenation models for corpus-based speech synthesis. Shinsuke Sakai, Tatsuya Kawahara |
| 2006 | Design and performance analysis of a factoid question answering system for spontaneous speech transcriptions. Mihai Surdeanu, David Dominguez-Sal, Pere Comas |
| 2006 | Detecting anger in automated voice portal dialogs. Felix Burkhardt, Jitendra Ajmera, Roman Englert, Joachim Stegmann, Winslow Burleson |
| 2006 | Detecting question-bearing turns in spoken tutorial dialogues. Jackson Liscombe, Jennifer J. Venditti, Julia Hirschberg |
| 2006 | Detection and separation of speech events in meeting recordings. Futoshi Asano, Jun Ogata |
| 2006 | Detection of a third speaker in telephone conversations. Uchechukwu O. Ofoegbu, Ananth N. Iyer, Robert E. Yantorno, Stanley J. Wenndt |
| 2006 | Detection of quotations and inserted clauses and its application to dependency structure analysis in spontaneous Japanese. Ryoji Hamabe, Kiyotaka Uchimoto, Tatsuya Kawahara, Hitoshi Isahara |
| 2006 | Detection of word fragments in Mandarin telephone conversation. Cheng-Tao Chu, Yun-Hsuan Sung, Yuan Zhao, Daniel Jurafsky |
| 2006 | Developing an automatic assessment tool for children²s oral reading. Leen Cleuren, Jacques Duchateau, Alain Sips, Pol Ghesquière, Hugo Van hamme |
| 2006 | Developing consistent pronunciation models for phonemic variants. Marelie H. Davel, Etienne Barnard |
| 2006 | Developing speech dialogs for multimodal HMIs using finite state machines. Silke Goronzy, Raquel Mochales, Nicole Beringer |
| 2006 | Development and evaluation of speech database in automotive environments for practical speech recognition systems. Yasunari Obuchi, Nobuo Hataoka |
| 2006 | Development of a program for self assessment of Japanese pronunciation by English learners. Chiharu Tsurutani, Yutaka Yamauchi, Nobuaki Minematsu, Dean Luo, Kazutaka Maruyama, Keikichi Hirose |
| 2006 | Development of advanced dialog systems with PATE. Norbert Pfleger, Jan Schehl |
| 2006 | Development of prototype text-to-speech systems for northern sotho. H. J. Oosthuizen, S. T. Phihlela, Madimetja Jonas D. Manamela |
| 2006 | Development of slovak GALAXY/voiceXML based spoken language dialogue system to retrieve information from the internet. Jozef Juhár, Stanislav Ondás, Anton Cizmar, Milan Rusko, Gregor Rozinaj, Roman Jarina |
| 2006 | Dialog act tagging with support vector machines and hidden Markov models. Dinoj Surendran, Gina-Anne Levow |
| 2006 | Dialogue act compression via pitch contour preservation. Gabriel Murray, Steve Renals |
| 2006 | Discourse structure and speech recognition problems. Mihai Rotaru, Diane J. Litman |
| 2006 | Discriminant linear processing of time-frequency plane. Fabio Valente, Hynek Hermansky |
| 2006 | Discriminating speech and non-speech with regularized least squares. Ryan Rifkin, Nima Mesgarani |
| 2006 | Discriminative MLE training using a product of Gaussian likelihoods. T. Nagarajan, Douglas D. O'Shaughnessy |
| 2006 | Discriminative adaptation for speaker verification. Chris Longworth, Mark J. F. Gales |
| 2006 | Discriminative kernel-based phoneme sequence recognition. Joseph Keshet, Shai Shalev-Shwartz, Samy Bengio, Yoram Singer, Dan Chazan |
| 2006 | Discriminative models for spoken language understanding. Ye-Yi Wang, Alex Acero |
| 2006 | Discriminative named entity recognition of speech data using speech recognition confidence. Katsuhito Sudoh, Hajime Tsukada, Hideki Isozaki |
| 2006 | Disentangling gestural and auditory contrast accounts of compensation for coarticulation. Navin Viswanathan, James S. Magnuson, Carol A. Fowler |
| 2006 | Distance measure between Gaussian distributions for discriminating speaking styles. Goshu Nagino, Makoto Shozakai |
| 2006 | Distant-talking continuous speech recognition based on a novel reverberation model in the feature domain. Armin Sehr, Marcus Zeller, Walter Kellermann |
| 2006 | Doing research on a deployed spoken dialogue system: one year of let's go! experience. Antoine Raux, Dan Bohus, Brian Langner, Alan W. Black, Maxine Eskénazi |
| 2006 | Dynamic evidence models in a DBN phone recognizer. William Schuler, Tim Miller, Stephen T. Wu, Andrew Exley |
| 2006 | Dynamic extension of a grammar-based dialogue system: constructing an all-recipes knowing robot. Petra Gieselmann, Alex Waibel |
| 2006 | Dynamic help generation by estimating user²s mental model in spoken dialogue systems. Yuichiro Fukubayashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
| 2006 | Edge-splitting in a cumulative multimodal system, for a no-wait temporal threshold on information fusion, combined with an under-specified display. Edward C. Kaiser, Paulo Barthelmess |
| 2006 | Effect of dynamic information of formants on discrimination of English vowels in consonantal contexts by Japanese listeners. Akiyo Joto |
| 2006 | Effect of genre, speaker, and word class on the realization of given and new information. Agustín Gravano, Julia Hirschberg |
| 2006 | Effects of familiarity with faces and voices on second-language speech processing: components of memory traces. Debra M. Hardison |
| 2006 | Effects of featural similarity and overlap position on lexical confusions and overt similarity judgments. Sarah C. Creel, Delphine Dahan, Daniel Swingley |
| 2006 | Effects of frequency shifts on perceived naturalness and gender information in speech. Peter F. Assmann, Sophia Dembling, Terrance M. Nearey |
| 2006 | Effects of midline tongue piercing on spectral centroid frequencies of sibilants. Tom Kovacs, Donald S. Finan |
| 2006 | Effects of word frequency on the acoustic durations of affixes. Mark Pluymaekers, Mirjam Ernestus, R. Harald Baayen |
| 2006 | Efficient Gaussian mixture model evaluation in voice conversion. Jilei Tian, Jani Nurminen, Victor Popa |
| 2006 | Efficient VQ techniques and general noise shaping in noise feedback coding. Jes Thyssen, Juin-Hwey Chen |
| 2006 | Efficient interactive retrieval of spoken documents with key terms ranked by reinforcement learning. Yi-Cheng Pan, Jia-Yu Chen, Yen-shin Lee, Yi-Sheng Fu, Lin-Shan Lee |
| 2006 | Eigenvoice conversion based on Gaussian mixture model. Tomoki Toda, Yamato Ohtani, Kiyohiro Shikano |
| 2006 | Emotion detection in infants² cries based on a maximum likelihood approach. Shoichi Matsunaga, S. Sakaguchi, Masaru Yamashita, Sueharu Miyahara, S. Nishitani, Kazuyuki Shinohara |
| 2006 | Emotion recognition in spontaneous speech using GMMs. Daniel Neiberg, Kjell Elenius, Kornel Laskowski |
| 2006 | Emovoice: a system to generate emotions in speech. João P. Cabral, Luís C. Oliveira |
| 2006 | Enhanced dynamic codebook reordering for advanced quantizer structures. Jani Nurminen |
| 2006 | Enhancement of harmonic content of speech based on a dynamic programming pitch tracking algorithm. Mark R. Every, Philip J. B. Jackson |
| 2006 | Enhancing the performance of a GMM-based speaker identification system in a multi-microphone setup. Andreas Stergiou, Aristodemos Pnevmatikakis, Lazaros C. Polymenakos |
| 2006 | Estimation of the quality dimension "directness/frequency content" for the instrumental assessment of speech quality. Kirstin Scholz, Marcel Wältermann, Lu Huo, Alexander Raake, Sebastian Möller, Ulrich Heute |
| 2006 | Evaluating a virtual speech cuer. Guillaume Gibert, Gérard Bailly, Frédéric Elisei |
| 2006 | Evaluating prosody of Mandarin speech for language learning. Minghui Dong, Haizhou Li, Tin Lay Nwe |
| 2006 | Evaluation of a spoken dialogue system with usability tests and long-term pilot studies: similarities and differences. Markku Turunen, Jaakko Hakulinen, Anssi Kainulainen |
| 2006 | Evaluation of content presentation strategies for an in-car spoken dialogue system. Heather Pon-Barry, Fuliang Weng, Sebastian Varges |
| 2006 | Evaluation of objective measures for speech enhancement. Yi Hu, Philipos C. Loizou |
| 2006 | Evaluation of perceptual quality of control point reduction in rule-based synthesis. Kimmo Pärssinen, Marko Moberg |
| 2006 | Evaluation of voice activity detection by combining multiple features with weight adaptation. Yusuke Kida, Tatsuya Kawahara |
| 2006 | Evolving emotional prosody. Cecilia Ovesdotter Alm, Xavier Llorà |
| 2006 | Examining knowledge sources for human error correction. Yongmei Shi, Lina Zhou |
| 2006 | Example-based grapheme-to-phoneme conversion for Thai. Paisarn Charoenpornsawat, Tanja Schultz |
| 2006 | Expanding phonetic coverage in unit selection synthesis through unit substitution from a donor voice. Alistair Conkie, Ann K. Syrdal |
| 2006 | Experiments on Chinese speech recognition with tonal models and pitch estimation using the Mandarin speecon data. Ying Sun, Daniel Willett, Raymond Brueckner, Rainer Gruhn, Dirk Bühler |
| 2006 | Exploiting dendritic autocorrelogram structure to identify spectro-temporal regions dominated by a single sound source. Ning Ma, Phil D. Green, André Coy |
| 2006 | Exploiting polynomial-fit histogram equalization and temporal average for robust speech recognition. Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen |
| 2006 | Exploiting semantic relations for a spoken language understanding application. Catherine Kobus, Géraldine Damnati, Lionel Delphin-Poulat, Renato De Mori |
| 2006 | Exploring the unknown - collecting 1000 speakers over the internet for the ph@ttsessionz database of adolescent speakers. Christoph Draxler |
| 2006 | Expressive prosody for unit-selection speech synthesis. Volker Strom, Robert A. J. Clark, Simon King |
| 2006 | Extension and further analysis of higher order cepstral moment normalization (HOCMN) for robust features in speech recognition. Chang-Wen Hsu, Lin-Shan Lee |
| 2006 | Extracting formants from short segments of speech using group delay functions. Joseph M. Anand, Sunitha Guruprasad, B. Yegnanarayana |
| 2006 | Factors affecting speakers² choice of fillers in Japanese presentations. Michiko Watanabe, Yasuharu Den, Keikichi Hirose, Shusaku Miwa, Nobuaki Minematsu |
| 2006 | Farsbayan: a unit selection based Farsi speech synthesizer. Mohammad Mehdi Homayounpour, Majid Namnabat |
| 2006 | Fast SVM training based on the choice of effective samples for audio classification. Shilei Zhang, Hongchen Jiang, Shuwu Zhang, Bo Xu |
| 2006 | Fast and effective retraining on contrastive vocal characteristics with bidirectional long short-term memory nets. Nicole Beringer |
| 2006 | Feature analysis for emotion recognition from Mandarin speech considering the special characteristics of Chinese language. Yi-Hao Kao, Lin-Shan Lee |
| 2006 | Feature and model space speaker adaptation with full covariance Gaussians. Daniel Povey, George Saon |
| 2006 | Feature combination using linear discriminant analysis and its pitfalls. Ralf Schlüter, András Zolnay, Hermann Ney |
| 2006 | Feature extraction for spectral continuity measures in concatenative speech synthesis. Barry Kirkpatrick, Darragh O'Brien, Ronan Scaife |
| 2006 | Feature normalization using smoothed mixture transformations. Patrick Kenny, Vishwa Gupta, Gilles Boulianne, Pierre Ouellet, Pierre Dumouchel |
| 2006 | Finding the gaps: applying a connectionist model of word segmentation to noisy phone-recognized speech data. C. Anton Rytting |
| 2006 | Formant-based English vowel assessment for Chinese in Taiwan. Jiang-Chun Chen, Wei-Tang Hsu, Jyh-Shing Roger Jang, Ren-yuan Lyu, Yuang-Chin Chiang |
| 2006 | Forward-backwards training of hybrid HMM/BN acoustic models. Konstantin Markov, Satoshi Nakamura |
| 2006 | Frame based system combination and a comparison with weighted ROVER and CNC. Björn Hoffmeister, Tobias Klein, Ralf Schlüter, Hermann Ney |
| 2006 | Frequency warping based on mapping formant parameters. Zhiwei Shuang, Raimo Bakis, Slava Shechtman, Dan Chazan, Yong Qin |
| 2006 | Frequency warping by linear transformation of standard MFCC. Sankaran Panchapagesan |
| 2006 | Friends and enemies: a novel initialization for speaker diarization. Xavier Anguera, Chuck Wooters, Javier Hernando |
| 2006 | From pre-recorded prompts to corporate voices: on the migration of interactive voice response applications. Volker Fischer, Siegfried Kunzmann |
| 2006 | From reaction to prediction: experiments with computational models of turn-taking. David Schlangen |
| 2006 | Further developments in LSM-based boundary training for unit selection TTS. Jerome R. Bellegarda |
| 2006 | Further investigations on the relationship between objective measures of speech quality and speech recognition rates in noisy environments. Francisco José Fraga, Carlos Alberto Ynoguti, André Godoi Chiovato |
| 2006 | Fusion of phonotactic and prosodic knowledge for language identification. Chi-Yueh Lin, Hsiao-Chuan Wang |
| 2006 | GMM-based acoustic modeling for embedded speech recognition. Christophe Lévy, Georges Linarès, Jean-François Bonastre |
| 2006 | Gammatone auditory filterbank and independent component analysis for speaker identification. Yushi Zhang, Waleed H. Abdulla |
| 2006 | Generalization of the minimum classification error (MCE) training based on maximizing generalized posterior probability (GPP). Qiang Fu, Antonio Moreno-Daniel, Biing-Hwang Juang, Jian-Lai Zhou, Frank K. Soong |
| 2006 | Generating German intonation with a trainable prosodic model. Gérard Bailly, Jan Gorisch |
| 2006 | Generating complementary systems for speech recognition. Catherine Breslin, Mark J. F. Gales |
| 2006 | Generating time-constrained audio presentations of structured information. Brian Langner, Rohit Kumar, Arthur Chan, Lingyun Gu, Alan W. Black |
| 2006 | Geometrically constrained permutation-free source separation in an undercomplete speech unmixing scenario. Erik Visser |
| 2006 | Glottal closure and opening detection for flexible parametric voice coding. Pamornpol Jinachitra |
| 2006 | Grapheme-to-phoneme conversion using automatically extracted associative rules for Korean TTS system. Jinsik Lee, Seungwon Kim, Gary Geunbae Lee |
| 2006 | HMM-based MAP prediction of voiced and unvoiced formant frequencies from noisy MFCC vectors. Jonathan Darch, Ben Milner |
| 2006 | HMM-based continuous sign language recognition using a fast optical flow parameterization of visual information. Guillermo Cortés, Luz García, M. Carmen Benítez, José C. Segura |
| 2006 | HMM-based unit selection using frame sized speech segments. Zhen-Hua Ling, Ren-Hua Wang |
| 2006 | Handling convolutional noise in missing data automatic speech recognition. Maarten Van Segbroeck, Hugo Van hamme |
| 2006 | Have we met? MDP based speaker ID for robot dialogue. Filip Krsmanovic, Curtis Spencer, Daniel Jurafsky, Andrew Y. Ng |
| 2006 | High-quality speech translation in the flight domain. Chao Wang, Stephanie Seneff |
| 2006 | High-rate data embedding in unvoiced speech. Konrad Hofbauer, Gernot Kubin |
| 2006 | Highly directional multi-beam audio loudspeaker. Dirk Olszewski, Klaus Linhard |
| 2006 | Highly noise robust text-dependent speaker recognition based on hypothesized wiener filtering. V. Ramasubramanian, Deepak Vijaywargiay, Kumar V. Praveen |
| 2006 | How auditory and visual prosody is used in end-of-utterance detection. Pashiera Barkhuysen, Emiel Krahmer, Marc Swerts |
| 2006 | How to handle gender and number agreement in statistical language models? Caroline Lavecchia, Kamel Smaïli, Jean Paul Haton |
| 2006 | Hypothesis spaces for minimum Bayes risk training in large vocabulary speech recognition. Matthew Gibson, Thomas Hain |
| 2006 | Hypothesis-based feature combination of multiple speech inputs for robust speech recognition in automotive environments. Yasunari Obuchi, Nobuo Hataoka |
| 2006 | Identification of confusion and surprise in spoken dialog using prosodic features. Rohit Kumar, Carolyn P. Rosé, Diane J. Litman |
| 2006 | Identification of regional accents in French: perception and categorization. Cécile Woehrling, Philippe Boula de Mareüil |
| 2006 | Identify language origin of personal names with normalized appearance number of web pages. Jia-Li You, Yining Chen, Min Chu, Yong Zhao, Jin-Lin Wang |
| 2006 | Imperfect transcript driven speech recognition. Benjamin Lecouteux, Georges Linarès, Pascal Nocera, Jean-François Bonastre |
| 2006 | Improved hybrid microphone array post-filter by integrating a robust speech absence probability estimator for speech enhancement. Junfeng Li, Masato Akagi, Yôiti Suzuki |
| 2006 | Improved language identification using support vector machines for language modeling. Xi Yang, Lu-Feng Zhai, Man-Hung Siu, Herbert Gish |
| 2006 | Improved performance evaluation of speech event detectors. Carla Lopes, Fernando Perdigão |
| 2006 | Improved source modeling and predictive classification for channel robust speech recognition. Valentin Ion, Reinhold Haeb-Umbach |
| 2006 | Improved speech activity detection using cross-channel features for recognition of multiparty meetings. Kofi Boakye, Andreas Stolcke |
| 2006 | Improved tone modeling for Mandarin broadcast news speech recognition. Xin Lei, Man-Hung Siu, Mei-Yuh Hwang, Mari Ostendorf, Tan Lee |
| 2006 | Improved topic classification over maximum entropy model using k-norm based new objectives. Xiang Li, Ea-Ee Jan, Cheng Wu, David M. Lubensky |
| 2006 | Improved warping-invariant features for automatic speech recognition. Jan Rademacher, Matthias Wächter, Alfred Mertins |
| 2006 | Improvement speaker clustering using global similarity features. Konstantin Biatov, Joachim Köhler |
| 2006 | Improvements to bucket box intersection algorithm for fast GMM computation in embedded speech recognition systems. Min Tang, Aravind Ganapathiraju |
| 2006 | Improving Arabic HMM based speech synthesis quality. Ossama Abdel-Hamid, Sherif Mahdy Abdou, Mohsen A. Rashwan |
| 2006 | Improving body transmitted unvoiced speech with statistical voice conversion. Mikihiro Nakagiri, Tomoki Toda, Hideki Kashioka, Kiyohiro Shikano |
| 2006 | Improving glottal waveform estimation through rank-based glottal quality assessment. Elliot Moore II, Juan F. Torres |
| 2006 | Improving perplexity measures to incorporate acoustic confusability. Amit Anil Nanavati, Nitendra Rajput |
| 2006 | Improving phrase-based Korean-English statistical machine translation. Jonghoon Lee, Donghyeon Lee, Gary Geunbae Lee |
| 2006 | Improving speech recognition accuracy with multi-confidence thresholding. Shuangyu Chang |
| 2006 | Improving speech recognition of two simultaneous speech signals by integrating ICA BSS and automatic missing feature mask generation. Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
| 2006 | Improving the characterization of the alternative hypothesis via kernel discriminant analysis for likelihood ratio-based speaker verification. Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang, Ruei-Chuan Chang |
| 2006 | Improving the performance of HMM-based voice conversion using context clustering decision tree and appropriate regression matrix format. Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang |
| 2006 | Improving the performance of out-of-vocabulary word rejection by using support vector machines. Shilei Huang, Xiang Xie, Jingming Kuang |
| 2006 | Improving tone recognition with combined frequency and amplitude modelling. Siwei Wang, Gina-Anne Levow |
| 2006 | Incorporating second-order information into two-step major phrase break prediction for Korean. Seungwon Kim, Jinsik Lee, Byeongchang Kim, Gary Geunbae Lee |
| 2006 | Incremental learning of MAP context-dependent edit operations for spoken phone number recognition in an embedded platform. Hahn Koo, Yan Ming Cheng |
| 2006 | Independent components for acoustic modeling. Jan Trmal, Jan Vanek, Ludek Müller, Jan Zelinka |
| 2006 | Individual on-line variance adaptation of frequency filtered parameters for robust ASR. Jesús Vicente-Peña, Fernando Díaz-de-María, W. Bastiaan Kleijn |
| 2006 | Infants² ability to extract verbs from continuous speech. Ellen Marklund, Francisco Lacerda |
| 2006 | Infinite models for speaker clustering. Fabio Valente |
| 2006 | Influence of pause length on listeners² impressions in simultaneous interpretation. Hitomi Tohyama, Shigeki Matsubara |
| 2006 | Integrating Festival and Windows. Rhys James Jones, Ambrose Choy, Briony Williams |
| 2006 | Integrating phonetic boundary discrimination explicitly into HMM systems. Yu Wang, Eric Fosler-Lussier |
| 2006 | Integrating spoken dialog and question answering: the ritel project. Sophie Rosset, Olivier Galibert, Gabriel Illouz, Aurélien Max |
| 2006 | Integration of a CELP coder in the ARDOR universal sound codec. Balázs Kövesi, Dominique Massaloux, David Virette, Julien Bensa |
| 2006 | Intelligibility of machine translation output in speech synthesis. Laura Mayfield Tomokiyo, Kay Peterson, Alan W. Black, Kevin A. Lenzo |
| 2006 | Interleaving and MMSE estimation with VQ replicas for distributed speech recognition over lossy packet networks. Angel M. Gomez, Antonio M. Peinado, Victoria E. Sánchez, José L. Carmona, Antonio J. Rubio |
| 2006 | Intonational cues to student questions in tutoring dialogs. Jennifer J. Venditti, Julia Hirschberg, Jackson Liscombe |
| 2006 | Intra-speaker variability compensation in speaker verification with limited enrolling data. Claudio Garretón, Néstor Becerra Yoma, Carlos Molina, Fernando Huenupán |
| 2006 | Investigating automatic decomposition for ASR in less represented languages. Thomas Pellegrini, Lori Lamel |
| 2006 | Investigation on Mandarin broadcast news speech recognition. Mei-Yuh Hwang, Xin Lei, Wen Wang, Takahiro Shinozaki |
| 2006 | Investigation on rescoring using minimum verification error (MVE) detectors. Qiang Fu, Biing-Hwang Juang |
| 2006 | Investigations of issues for using multiple acoustic models to improve continuous speech recognition. Rong Zhang, Alexander I. Rudnicky |
| 2006 | Is ASR accurate enough for automated reading tutors, and how can we tell? Jack Mostow |
| 2006 | Is voice quality enough? - study on how the situation and user²s awareness influence the utterance features. Shinya Yamada, Toshihiko Itoh, Kenji Araki |
| 2006 | Issues with uncertainty decoding for noise robust speech recognition. Hank Liao, Mark J. F. Gales |
| 2006 | Joint interpretation of input speech and pen gestures for multimodal human-computer interaction. Pui-Yu Hui, Helen M. Meng |
| 2006 | Joint prosodic and segmental unit selection speech synthesis. Robert A. J. Clark, Simon King |
| 2006 | LDA based feature estimation methods for LVCSR. Janne Pylkkönen |
| 2006 | LINTest: a development tool for testing dialogue systems. Lars Degerstedt, Arne Jönsson |
| 2006 | Language model adaptation for tiny adaptation corpora. Dietrich Klakow |
| 2006 | Language model adaptation with a word list and a raw corpus. Shinsuke Mori |
| 2006 | Language modeling of Chinese personal names based on character units for continuous Chinese speech recognition. Xinhui Hu, Hirofumi Yamamoto, Gen-ichiro Kikui, Yoshinori Sagisaka |
| 2006 | Language, gender, speaking style and language proficiency as factors influencing the autonomous vocalic filler production in spontaneous speech. Ioana Vasilescu, Martine Adda-Decker |
| 2006 | Latent prosodic modeling (LPM) for speech with applications in recognizing spontaneous Mandarin speech with disfluencies. Che-Kuang Lin, Lin-Shan Lee |
| 2006 | Lattice LP filtering for noise reduction in speech signals. Erhard Rank, Gernot Kubin |
| 2006 | Lattice extension and rescoring based approaches for LVCSR of Turkish. Ebru Arisoy, Murat Saraclar |
| 2006 | Learning from errors in grapheme-to-phoneme conversion. Tatyana Polyakova, Antonio Bonafonte |
| 2006 | Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces. Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, Hiroshi Shimodaira |
| 2006 | Lexical stress in continuous speech recognition. Rogier C. van Dalen, Pascal Wiggers, Léon J. M. Rothkrantz |
| 2006 | Limitations of MLLR adaptation with Spanish-accented English: an error analysis. Constance Clarke, Daniel Jurafsky |
| 2006 | Lingua machinae - an unorthodox proposal. Florian Schiel, Christoph Draxler, Marion Libossek |
| 2006 | Linguistic tuple segmentation in n-gram-based statistical machine translation. Adrià de Gispert, José B. Mariño |
| 2006 | Local transformation models for speech recognition. Antonio Miguel, Eduardo Lleida, Alfons Juan, Luis Buera, Alfonso Ortega, Oscar Saz |
| 2006 | Locating phone boundaries from acoustic discontinuities using a two-staged approach. Pairote Leelaphattarakij, Proadpran Punyabukkana, Atiwong Suchato |
| 2006 | Lost speech reconstruction method using speech recognition based on missing feature theory and HMM-based speech synthesis. Shingo Kuroiwa, Satoru Tsuge, Fuji Ren |
| 2006 | Low complexity LID using pruned pattern tables of LZW. S. V. Basavaraja, T. V. Sreenivas |
| 2006 | Low-complexity and efficient classification of voiced/unvoiced/silence for noisy environments. Tuan Van Pham, Gernot Kubin |
| 2006 | Low-resource autodiacritization of abjads for speech keyword search. Patrick Schone |
| 2006 | MMSE estimation of complex-valued discrete Fourier coefficients with generalized gamma priors. Jesper Jensen, Richard C. Hendriks, Jan S. Erkelens, Richard Heusdens |
| 2006 | Manifold HLDA and its application to robust speech recognition. Toshiaki Kubo, Tetsuji Ogawa, Tetsunori Kobayashi |
| 2006 | Map-based adaptation for speech conversion using adaptation data selection and non-parallel training. Chung-Han Lee, Chung-Hsien Wu |
| 2006 | Mapping neural networks for bandwidth extension of narrowband speech. A. Shahina, B. Yegnanarayana |
| 2006 | Max-Gabor analysis and synthesis of spectrograms. Tony Ezzat, Jake V. Bouvrie, Tomaso A. Poggio |
| 2006 | Maximum entropy modeling for diacritization of Arabic text. Ruhi Sarikaya, Ossama Emam, Imed Zitouni, Yuqing Gao |
| 2006 | Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation. Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2006 | Measuring and comparing vowel qualities in a Dutch spontaneous speech corpus. Irene Jacobi, Louis C. W. Pols, Jan Stroop |
| 2006 | Measuring the acceptable word error rate of machine-generated webcast transcripts. Cosmin Munteanu, Gerald Penn, Ronald Baecker, Elaine G. Toms, David James |
| 2006 | Memo: towards automatic usability evaluation of spoken dialogue services by user error simulations. Sebastian Möller, Roman Englert, Klaus-Peter Engelbrecht, Verena Vanessa Hafner, Anthony Jameson, Antti Oulasvirta, Alexander Raake, Norbert Reithinger |
| 2006 | Minimum boundary error training for automatic phonetic segmentation. Jen-Wei Kuo, Hsin-Min Wang |
| 2006 | Minimum classification error training of hidden Markov models for acoustic language identification. Josef G. Bauer, Ekaterina Timoshenko |
| 2006 | Minimum divergence based discriminative training. Jun Du, Peng Liu, Frank K. Soong, Jian-Lai Zhou, Ren-Hua Wang |
| 2006 | Minimum generation error criterion for tree-based clustering of context dependent HMMs. Yi-Jian Wu, Wu Guo, Ren-Hua Wang |
| 2006 | Missing data mask models with global frequency and temporal constraints. Sébastien Demange, Christophe Cerisara, Jean Paul Haton |
| 2006 | Missing feature theory with soft spectral subtraction for speaker verification. Michael T. Padilla, Thomas F. Quatieri, Douglas A. Reynolds |
| 2006 | Missing-feature reconstruction for band-limited speech recognition in spoken document retrieval. Wooil Kim, John H. L. Hansen |
| 2006 | Modeling of speech signals based on Bessel-like orthogonal transform. Giorgio Biagetti, Paolo Crippa, Claudio Turchetti |
| 2006 | Modeling sensory-to-motor mappings using neural nets and a 3d articulatory speech synthesizer. Bernd J. Kröger, Peter Birkholz, Jim Kannampuzha, Christiane Neuschaefer-Rube |
| 2006 | Modeling the acoustic correlates of expressive elements in text genres for expressive text-to-speech synthesis. Hongwu Yang, Helen M. Meng, Lianhong Cai |
| 2006 | Modeling the precedence effect for binaural sound source localization in noisy and echoic environments. Martin Heckmann, Tobias Rodemann, Björn Schölling, Frank Joublin, Christian Goerick |
| 2006 | Modelling aspiration noise during phonation using the LF voice source model. Christer Gobl |
| 2006 | Modified phase opponency based solution to the speech separation challenge. Om Deshmukh, Carol Y. Espy-Wilson |
| 2006 | Monitoring of the natural voice variations in open and closed phases with frequency warped ARMA modeling. Pedro J. Quintana-Morales, Juan L. Navarro-Mesa, Antonio G. Ravelo-García, Fernando D. Lorenzo-García |
| 2006 | Moving speech recognition from software to silicon: the in silico vox project. Edward C. Lin, Kai Yu, Rob A. Rutenbar, Tsuhan Chen |
| 2006 | Multi-accent Chinese speech recognition. Yi Liu, Pascale Fung |
| 2006 | Multi-domain text-to-speech synthesis by automatic text classification. Francesc Alías, Joan Claudi Socoró, Xavier Sevillano, Ignasi Iriondo Sanz, Xavier Gonzalvo |
| 2006 | Multi-flow block interleaving applied to distributed speech recognition over IP networks. Angel M. Gomez, Juan J. Ramos-Muñoz, Antonio M. Peinado, Victoria E. Sánchez |
| 2006 | Multi-layered summarization of spoken document archives by information extraction and semantic structuring. Lin-Shan Lee, Sheng-yi Kong, Yi-Cheng Pan, Yi-Sheng Fu, Yu-tsun Huang |
| 2006 | Multi-microphone periodicity function for robust F0 estimation in real noisy and reverberant environments. Federico Flego, Maurizio Omologo |
| 2006 | Multi-modal system ICANDO: intellectual computer assistant for disabled operators. Alexey Karpov, Andrey Ronzhin, Alexandre Cadiou |
| 2006 | Multi-source far-distance microphone selection and combination for automatic transcription of lectures. Matthias Wölfel, Christian Fügen, Shajith Ikbal, John W. McDonough |
| 2006 | Multi-stream ASR: an oracle perspective. Hemant Misra, Jithendra Vepa, Hervé Bourlard |
| 2006 | Multi-stream speaker diarization systems for the meetings domain. Ascensión Gallardo-Antolín, Xavier Anguera, Chuck Wooters |
| 2006 | Multilingual non-native speech recognition using phonetic confusion-based acoustic model modification and graphemic constraints. Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean Paul Haton |
| 2006 | Multimodal authentication using qualitative support vector machines. Fawaz Alsaade, Aladdin M. Ariyaeeinia, L. Meng, Amit S. Malegaonkar |
| 2006 | Multistage convolutive blind source separation for speech mixture. Yanxue Liang, Ichiro Hagiwara |
| 2006 | Multivariate analysis of frame-based acoustic cues of dysperiodicities in connected speech. Abdellah Kacha, Francis Grenez, Jean Schoentgen |
| 2006 | Nasality perception of vowels in different language background. Shahina Haque, Tomio Takara |
| 2006 | Native and nonnative audio-visual perception of English fricatives in quiet and cafe-noise backgrounds. Yue Wang, Dawn M. Behne, Haisheng Jiang, Chad Danyluck |
| 2006 | New 20-word lists for word intelligibility test in Japanese. Shuichi Sakamoto, Tadahiro Yoshikawa, Shigeaki Amano, Yôiti Suzuki, Tadahisa Kondo |
| 2006 | New considerations for vowel nasalization based on separate mouth-nose recording. Gang Feng, Cyril Kotenkoff |
| 2006 | New improvements in decoding speed and latency for automatic captioning. Jian Xue, Rusheng Hu, Yunxin Zhao |
| 2006 | New measures to chart toddlers² speech perception and language development: a test of the lexical restructuring hypothesis. Iris-Corinna Schwarz, Denis Burnham |
| 2006 | Ninth International Conference on Spoken Language Processing, INTERSPEECH-ICSLP 2006, Pittsburgh, PA, USA, September 17-21, 2006 |
| 2006 | Noise robust model-based voice activity detection. Ángel de la Torre, Javier Ramírez, M. Carmen Benítez, José C. Segura, Luz García, Antonio J. Rubio |
| 2006 | Noise update modeling for speech enhancement: when do we do enough? Nitish Krishnamurthy, John H. L. Hansen |
| 2006 | Noise-robust speech recognition of conversational telephone speech. Gang Chen, Hesham Tolba, Douglas D. O'Shaughnessy |
| 2006 | Noisy speech recognition based on selection of multiple noise suppression methods using noise GMMs. Norihide Kitaoka, Souta Hamaguchi, Seiichi Nakagawa |
| 2006 | Non-intrusive speech quality assessment with low computational complexity. Volodya Grancharov, David Yuheng Zhao, Jonas Lindblom, W. Bastiaan Kleijn |
| 2006 | Nonlinear dynamical invariants for speech recognition. S. Prasad, Sundararajan Srinivasan, M. Pannuri, Georgios Y. Lazarou, Joseph Picone |
| 2006 | Normalization of the inter-frame information using smoothing filtering. Luz García, José C. Segura, M. Carmen Benítez, Javier Ramírez, Ángel de la Torre |
| 2006 | Novel entropy based moving average refiners for HMM landmarks. Rahul Chitturi, Mark Hasegawa-Johnson |
| 2006 | Novel method for data clustering and mode selection with application in voice conversion. Jani Nurminen, Jilei Tian, Victor Popa |
| 2006 | Novel time domain multi-class SVMs for landmark detection. Rahul Chitturi, Mark Hasegawa-Johnson |
| 2006 | Objective estimation of suicidal risk using vocal output characteristics. T. Yingthawornsuk, H. Kaymaz Keskinpala, Daniel J. France, D. Mitchell Wilkes, Richard G. Shiavi, Ronald M. Salomon |
| 2006 | Observations of the spoken language acquisition process based on a multimodal infant behavior corpus. Ryo Tsuji, Tomohiko Kasami, Shogo Ishikawa, Shinya Kiriyama, Yoichi Takebayashi, Shigeyoshi Kitazawa |
| 2006 | On a greedy learning algorithm for dPLRM with applications to phonetic feature detection. Tor André Myrvoll, Tomoko Matsui |
| 2006 | On designing context sensitive language models for spoken dialog systems. Vaibhava Goel, Ramesh A. Gopinath |
| 2006 | On speaker-specific prosodic models for automatic dialog act segmentation of multi-party meetings. Jáchym Kolár, Elizabeth Shriberg, Yang Liu |
| 2006 | On speech variation and word type differentiation by articulatory feature representations. Louis ten Bosch, R. Harald Baayen, Mirjam Ernestus |
| 2006 | On the correlation between energy and pitch accent in read English speech. Andrew Rosenberg, Julia Hirschberg |
| 2006 | On the fusion of prosody, voice spectrum and face features for multimodal person verification. M. Farrs, Ainara Garde, Pascual Ejarque, Jordi Luque, Javier Hernando |
| 2006 | On the relation between maximum spectral transition positions and phone boundaries. Sorin Dusan, Lawrence R. Rabiner |
| 2006 | On the sufficiency and redundancy of pitch for TRP projection. Wieneke Wesseling, Rob van Son, Louis C. W. Pols |
| 2006 | On the sufficiency of automatic phonetic transcriptions for pronunciation variation research. Christophe Van Bael, Hans van Halteren |
| 2006 | On the use of Jacobian adaptation in real speaker verification applications. Jan Anguita, Javier Hernando |
| 2006 | On the use of morphological analysis for dialectal Arabic speech recognition. Mohamed Afify, Ruhi Sarikaya, Hong-Kwang Jeff Kuo, Laurent Besacier, Yuqing Gao |
| 2006 | Online speaker change detection by combining BIC with microphone array beamforming. Joerg Schmalenstroeer, Reinhold Haeb-Umbach |
| 2006 | Online speech detection and dual-gender speech recognition for captioning broadcast news. Toru Imai, Shoei Sato, Akio Kobayashi, Kazuo Onoe, Shinichi Homma |
| 2006 | Open-vocabulary spoken document retrieval based on new subword models and subword phonetic similarity. Kohei Iwata, Yoshiaki Itoh, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee |
| 2006 | Opinion mining in a telephone survey corpus. Nathalie Camelin, Géraldine Damnati, Frédéric Béchet, Renato De Mori |
| 2006 | Optimization of class weights for LDA feature transformations. Andrej Ljolje |
| 2006 | Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system. Roger Hsiao, Ashish Venugopal, Thilo Köhler, Ying Zhang, Paisarn Charoenpornsawat, Andreas Zollmann, Stephan Vogel, Alan W. Black, Tanja Schultz, Alex Waibel |
| 2006 | Pauses as a tool to ensure rhythmic wellformedness. Augustin Speyer |
| 2006 | Perception of fundamental frequency in cochlear implant patients. Ángel de la Torre, Cristina Roldán, Manuel Sainz |
| 2006 | Perceptive and acoustic measurement of average speaking pitch of female and male speakers in German radio news. Sven Grawunder, Ines Bose, Birgit Hertha, Franziska Trauselt, Lutz Christian Anders |
| 2006 | Perceptual identification and phonetic analysis of 6 foreign accents in French. Bianca Vieru-Dimulescu, Philippe Boula de Mareüil |
| 2006 | Performance analysis of various single channel speech enhancement algorithms for automatic speech recognition. Myung-Suk Song, Chang-Heon Lee, Hong-Goo Kang |
| 2006 | Performance evaluation of three features for model-based single channel speech separation problem. Mohammad H. Radfar, Richard M. Dansereau, Abolghasem Sayadiyan |
| 2006 | Performance improvement of dialog speech translation by rejecting unreliable utterances. Toshiyuki Takezawa, Tohru Shimizu |
| 2006 | Perplexity based linguistic model adaptation for speech summarisation. Pierre Chatain, Edward W. D. Whittaker, Joanna Mrozinski, Sadaoki Furui |
| 2006 | Personality factors in human deception detection: comparing human to machine performance. Frank Enos, Stefan Benus, Robin L. Cautin, Martin Graciarena, Julia Hirschberg, Elizabeth Shriberg |
| 2006 | Phone recognition analysis for trajectory HMM. Le Zhang, Steve Renals |
| 2006 | Phone vector DHMM to decode a phone recognizer's output. Bong-Wan Kim, Dae-Lim Choi, Yongnam Um, Yong-Ju Lee |
| 2006 | Phoneme recognition based on fisher weight map to higher-order local auto-correlation. Yasuo Ariki, Shunsuke Kato, Tetsuya Takiguchi |
| 2006 | Phoneme-to-grapheme mapping for spoken inquiries to the semantic web. Axel Horndasch, Elmar Nöth, Anton Batliner, Volker Warnke |
| 2006 | Phonetic research on accented Chinese in three dialectal regions: Shanghai, Wuhan and Xiamen. Aijun Li, Qiang Fang, Ziyu Xiong |
| 2006 | Phonetically enriched labeling in unit selection TTS synthesis. Yeon-Jun Kim, Ann K. Syrdal, Alistair Conkie, Marc C. Beutnagel |
| 2006 | Phrase break prediction using logistic generalized linear model. Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao |
| 2006 | Physiologically-motivated synchrony-based processing for robust automatic speech recognition. Chanwoo Kim, Yu-Hsiang Bosco Chiu, Richard M. Stern |
| 2006 | Pitch determination using aligned AMDF. M. Shahidur Rahman, Hirobumi Tanaka, Tetsuya Shimamura |
| 2006 | Pitch range and pause duration as markers of discourse hierarchy: perception experiments. Jörg Mayer, Ekaterina Jasinskaja, Ulrike Kölsch |
| 2006 | Pitch resynchronization while recovering from a late frame in a predictive speech decoder. Kyle D. Anderson, Philippe Gournay |
| 2006 | Pitch-scale modification using the modulated aspiration noise source. Daryush D. Mehta, Thomas F. Quatieri |
| 2006 | Posterior based keyword spotting with a priori thresholds. Hamed Ketabdar, Jithendra Vepa, Samy Bengio, Hervé Bourlard |
| 2006 | Potential relevance of audio-visual integration in mammals for computational modeling. Eeva Klintfors, Francisco Lacerda |
| 2006 | Powered cepstral normalization (p-CN) for robust features in speech recognition. Chang-Wen Hsu, Lin-Shan Lee |
| 2006 | Productions in bilinguism, early foreign language learning and monolinguism: a prosodic comparison. Ranka Bijeljac-Babic, Christelle Dodane, Sabine Metta, Claire Gerard |
| 2006 | Prominent words as anchors for TRP projection. Rob van Son, Wieneke Wesseling, Louis C. W. Pols |
| 2006 | Prompt selection with reinforcement learning in an AT&t call routing application. Charles Lewis, Giuseppe Di Fabbrizio |
| 2006 | Pronunciation dependent language models. Andrej Ljolje |
| 2006 | Pronunciation variant-based multi-path HMMs for syllables. Annika Hämäläinen, Louis ten Bosch, Lou Boves |
| 2006 | Pronunciation variation modeling for Mandarin with accent. Chi Zhang, Ji Wu, Xi Xiao, Zuoying Wang |
| 2006 | Pronunciation verification of children²s speech for automatic literacy assessment. Joseph Tepperman, Jorge F. Silva, Abe Kazemzadeh, Hong You, Sungbok Lee, Abeer Alwan, Shrikanth S. Narayanan |
| 2006 | Prosodic boundaries in Czech: an experiment based on delexicalized speech. Tomás Dubeda |
| 2006 | Prosodic feature generation for back-channel prediction. Thamar Solorio, Olac Fuentes, Nigel G. Ward, Yaffa Al Bayyari |
| 2006 | Prosodic features for a maximum entropy language model. Oscar Chan, Roberto Togneri |
| 2006 | Prosodic features for speaker verification. Leena Mary, B. Yegnanarayana |
| 2006 | Prosodic modeling in large vocabulary Mandarin speech recognition. Jui-Ting Huang, Lin-Shan Lee |
| 2006 | Prosody of interrogative and affirmative sentences in vietnamese language: analysis and perceptive results. Minh-Quang Vu, Do Dat Tran, Eric Castelli |
| 2006 | Prototyping a call system for students of Japanese using dynamic diagram generation and interactive hints. Christopher J. Waple, Yasushi Tsubota, Masatake Dantsuji, Tatsuya Kawahara |
| 2006 | QASR: question answering using semantic roles for speech interface. Svetlana Stenchikova, Dilek Hakkani-Tür, Gökhan Tür |
| 2006 | Quality improvement of telephone speech by artificial bandwidth expansion - listening tests in three languages. Hannu Pulakka, Laura Laaksonen, Paavo Alku |
| 2006 | Question answering with discriminative learning algorithms. Junlan Feng |
| 2006 | Quick individual fitting methods of simplified hearing compensation for elderly people. Kengo Fujita, Tsuneo Kato, Hisashi Kawai |
| 2006 | Radiobot-CFF: a spoken dialogue system for military training. Antonio Roque, Anton Leuski, Vivek Kumar Rangarajan Sridhar, Susan Robinson, Ashish Vaswani, Shrikanth S. Narayanan, David R. Traum |
| 2006 | Rapid simulation-driven reinforcement learning of multimodal dialog strategies in human-robot interaction. Thomas Prommer, Hartwig Holzapfel, Alex Waibel |
| 2006 | Rapid speaker adaptation using regression-tree based spectral peak alignment. Shizhen Wang, Xiaodong Cui, Abeer Alwan |
| 2006 | Real vs. acted emotional speech. Janneke Wilting, Emiel Krahmer, Marc Swerts |
| 2006 | Real-life emotions detection with lexical and paralinguistic cues on human-human call center dialogs. Laurence Devillers, Laurence Vidrascu |
| 2006 | Real-time synthesis of Chinese visual speech and facial expressions using MPEG-4 FAP features in a three-dimensional avatar. Zhiyong Wu, Shen Zhang, Lianhong Cai, Helen M. Meng |
| 2006 | Realizations and representations of Thai tones in monomoraic syllables. Rattima Nitisaroj |
| 2006 | Recent advances in phonotactic language recognition using binary-decision trees. Jirí Navrátil |
| 2006 | Recent advances in speech fragment decoding techniques. Jon Barker, André Coy, Ning Ma, Martin Cooke |
| 2006 | Recent advances of IBM's handheld speech translation system. Weizhong Zhu, Bowen Zhou, Charles Prosser, Pavel Krbec, Yuqing Gao |
| 2006 | Recent progress on the discriminative region-dependent transform for speech feature extraction. Bing Zhang, Spyros Matsoukas, Richard M. Schwartz |
| 2006 | Recognition of classroom lectures in european portuguese. Isabel Trancoso, Ricardo Nunes, Luís Neves, Céu Viana, Helena Moniz, Diamantino Caseiro, Ana Isabel Mata |
| 2006 | Recognition of interest in human conversational speech. Björn W. Schuller, Niels Köhler, Ronald Müller, Gerhard Rigoll |
| 2006 | Reconstructing tongue movements from audio and video. Hedvig Kjellström, Olov Engwall, Olle Bälter |
| 2006 | Reducing computation on parallel decoding using frame-wise confidence scores. Tomohiro Hakamata, Akinobu Lee, Yoshihiko Nankaku, Keiichi Tokuda |
| 2006 | Reducing speech coding distortion for speaker identification. Alan McCree |
| 2006 | Redundancy and productivity in the speech technology lexicon - can we do better? Susan Fitt, Korin Richmond |
| 2006 | Respiratory/laryngeal interactions during sustained vowel production in children. Donald S. Finan, Carol A. Boliek |
| 2006 | Robust acoustic-based syllable detection. Zhimin Xie, Partha Niyogi |
| 2006 | Robust automatic speech recognition for accented Mandarin in car environments. Pei Ding, Lei He, Xiang Yan, Jie Hao |
| 2006 | Robust feature extraction based on spectral peaks of group delay and autocorrelation function and phase domain analysis. Gholamreza Farahani, Seyed Mohammad Ahadi, Mohammad Mehdi Homayounpour |
| 2006 | Robust feature space adaptation for telephony speech recognition. Xin Lei, Jon Hamaker, Xiaodong He |
| 2006 | Robust interpretation in dialogue by combining confidence scores with contextual features. Matthew Purver, Florin Ratiu, Lawrence Cavedon |
| 2006 | Robust phone lattice decoding. Kris Demuynck, Dirk Van Compernolle, Hugo Van hamme |
| 2006 | Robust speaker diarization for meetings: ICSI RT06s evaluation system. Xavier Anguera, Chuck Wooters, José M. Pardo |
| 2006 | Robust speech recognition by modifying clean and telephone feature vectors using bidirectional neural network. Mansoor Vali, Seyyed Ali Seyyed Salehi, Kazem Karimi |
| 2006 | Robust speech recognition over mobile networks using combined weighted viterbi decoding and subvector based error concealment. Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg |
| 2006 | Role of phase estimation in speech enhancement. Benjamin J. Shannon, Kuldip K. Paliwal |
| 2006 | SPAM and full covariance for speech recognition. Daniel Povey |
| 2006 | Saliency parsing for automated directory assistance. Issac Alphonso, Shuangyu Chang |
| 2006 | Scalable and portable web-based multimodal dialogue interaction with geographical databases. Alexander Gruenstein, Stephanie Seneff, Chao Wang |
| 2006 | Segment connection networks for corpus-based speech synthesis. Geert Coorman |
| 2006 | Segmental duration modeling in Turkish. Özlem Öztürk, Tolga Çiloglu |
| 2006 | Selective-LPC based representation of STRAIGHT spectrum and its applications in spectral smoothing. Heng Kang, Wenju Liu |
| 2006 | Semi-automatic extraction of vocal tract movements from cineradiographic data. Julie Fontecave, Frédéric Berthommier |
| 2006 | Sentence boundary detection of spontaneous Japanese using statistical language model and support vector machines. Yuya Akita, Masahiro Saikou, Hiroaki Nanjo, Tatsuya Kawahara |
| 2006 | Sentence boundary detection using sequential dependency analysis combined with CRF-based chunking. Takanobu Oba, Takaaki Hori, Atsushi Nakamura |
| 2006 | Sequence classification for machine translation. Srinivas Bangalore, Patrick Haffner, Stephan Kanthak |
| 2006 | Signal modification incorporating perceptual weighting filter. Joon-Hyuk Chang, Woohyung Lim, Nam Soo Kim |
| 2006 | Significance of formants from difference spectrum for speaker identification. Kishore Prahallad, Varanasi Sudhakar, Veluru Ranganatham, Krishna M. Bharat, S. Roy Debashish |
| 2006 | Silence energy normalization for robust speech recognition in additive noise environment. Chung-fu Tai, Jeih-weih Hung |
| 2006 | Single channel speech enhancement by frequency domain constrained optimization and temporal masking. Wen Jin, Michael S. Scordilis |
| 2006 | Single frame selection for phoneme classification. Tingyao Wu, Dirk Van Compernolle, Jacques Duchateau, Hugo Van hamme |
| 2006 | Single-channel speech separation using sparse non-negative matrix factorization. Mikkel N. Schmidt, Rasmus Kongsgaard Olsson |
| 2006 | Six approaches to limited domain concatenative speech synthesis. Robert J. Utama, Ann K. Syrdal, Alistair Conkie |
| 2006 | Sloparl - slovenian parliamentary speech and text corpus for large vocabulary continuous speech recognition. Andrej Zgank, Tomaz Rotovnik, Matej Grasic, Marko Kos, Damjan Vlaj, Zdravko Kacic |
| 2006 | Soft decision combining for dual channel noise reduction. Timo Gerkmann, Rainer Martin |
| 2006 | Soft margin estimation of hidden Markov model parameters. Jinyu Li, Ming Yuan, Chin-Hui Lee |
| 2006 | Software architectures for incremental understanding of human speech. Gregory Aist, James F. Allen, Ellen Campana, Lucian Galescu, Carlos Gómez Gallo, Scott C. Stoness, Mary D. Swift, Michael K. Tanenhaus |
| 2006 | Solving large margin estimation of HMMS via semidefinite programming. Xinwei Li, Hui Jiang |
| 2006 | Soundbite detection in broadcast news domain. Sameer Maskey, Julia Hirschberg |
| 2006 | Sparseness and speech perception in noise. Guoping Li, Mark E. Lutman |
| 2006 | Speaker adaptation of trajectory HMMs using feature-space MLLR. Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura |
| 2006 | Speaker adaptation using evolutionary-based linear transform. Sid-Ahmed Selouani, Douglas D. O'Shaughnessy |
| 2006 | Speaker cluster based GMM tokenization for speaker recognition. Bin Ma, Donglai Zhu, Rong Tong, Haizhou Li |
| 2006 | Speaker clustered regression-class trees for MLLR adaptation. Arindam Mandal, Mari Ostendorf, Andreas Stolcke |
| 2006 | Speaker diarization for multiple distant microphone meetings: mixing acoustic features and inter-channel time differences. José M. Pardo, Xavier Anguera, Chuck Wooters |
| 2006 | Speaker identification under noisy environments by using harmonic structure extraction and reliable frame weighting. Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
| 2006 | Speaker independent voiced-unvoiced detection evaluated in different speaking styles. Martin Heckmann, Marco Moebus, Frank Joublin, Christian Goerick |
| 2006 | Speaker localization based on oriented global coherence field. Alessio Brutti, Maurizio Omologo, Piergiorgio Svaizer |
| 2006 | Speaker verification with non-audible murmur segments. Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2006 | Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech. Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2006 | Speaking faces for face-voice speaker identity verification. Girija Chetty, Michael Wagner |
| 2006 | Specificity and generalizability of spontaneous phonetic imitation. Kuniko Y. Nielsen |
| 2006 | Speech analyzer using a joint estimation model of spectral envelope and fine structure. Hirokazu Kameoka, Jonathan Le Roux, Nobutaka Ono, Shigeki Sagayama |
| 2006 | Speech and speech recognition during dictation corrections. Keith Vertanen |
| 2006 | Speech enhancement based on residual noise shaping. Jong Won Shin, Seung Yeol Lee, Hwan Sik Yun, Nam Soo Kim |
| 2006 | Speech enhancement based on spectral estimation from higher-lag autocorrelation. Benjamin J. Shannon, Kuldip K. Paliwal, Climent Nadeu |
| 2006 | Speech enhancement using modified phase opponency model. Om Deshmukh, Carol Y. Espy-Wilson |
| 2006 | Speech recognition of foreign out-of-vocabulary words using a hierarchical language model. Hirofumi Yamamoto, Gen-ichiro Kikui, Satoshi Nakamura, Yoshinori Sagisaka |
| 2006 | Speech recognition using factorial hidden Markov models for separation in the feature space. Tuomas Virtanen |
| 2006 | Speech recognition with phonological features: some issues to attend. Frederik Stouten, Jean-Pierre Martens |
| 2006 | Speech technology for minority languages: the case of Irish (gaelic). Ailbhe Ní Chasaide, John Wogan, Brian Ó Raghallaigh, Áine Ní Bhriain, Eric Zoerner, Harald Berthelsen, Christer Gobl |
| 2006 | Speech/non-speech discrimination combining advanced feature extraction and SVM learning. Javier Ramírez, Pablo Yélamos, J. M. Górriz, José C. Segura, Luz García |
| 2006 | Spoken language technologies applied to digital talking books. Isabel Trancoso, Carlos Duarte, António Joaquim Serralheiro, Diamantino Caseiro, Luís Carriço, Céu Viana |
| 2006 | Spontaneous Thai speech recognition. Monika Woszczyna, Paisarn Charoenpornsawat, Tanja Schultz |
| 2006 | State-level variable modeling for phoneme classification. Hao-Zheng Li, Douglas D. O'Shaughnessy |
| 2006 | Statistical analysis and performance of DFT domain noise reduction filters for robust speech recognition. Colin Breithaupt, Rainer Martin |
| 2006 | Steady-state suppression in reverberation: a comparison of native and nonnative speech perception. Nao Hodoshima, Dawn M. Behne, Takayuki Arai |
| 2006 | Stochastic vector mapping-based feature enhancement using prior model and environment adaptation for noisy speech recognition. Chia-Hsin Hsieh, Chung-Hsien Wu, Jun-Yu Lin |
| 2006 | Study of time and frequency variability in pathological speech and error reduction methods for automatic speech recognition. Oscar Saz, Antonio Miguel, Eduardo Lleida, Alfonso Ortega, Luis Buera |
| 2006 | Study on speaker verification on emotional speech. Wei Wu, Thomas Fang Zheng, Ming-Xing Xu, Huanjun Bao |
| 2006 | Sub-word unit based non-audible speech recognition using surface electromyography. Matthias Walliczek, Florian Kraft, Szu-Chen Stan Jou, Tanja Schultz, Alex Waibel |
| 2006 | Subspace modeling and selection for noisy speech recognition. Jen-Tzung Chien, Chuan-Wei Ting |
| 2006 | Substitute sounds for ventriloquism and speech disorders. Jörg Metzner, Marcel Schmittfull, Karl Schnell |
| 2006 | Summarization evaluation for text and speech: issues and approaches. Ani Nenkova |
| 2006 | Summarization of spontaneous conversations. Xiaodan Zhu, Gerald Penn |
| 2006 | Super-human multi-talker speech recognition: the IBM 2006 speech separation challenge system. Trausti T. Kristjansson, John R. Hershey, Peder A. Olsen, Steven J. Rennie, Ramesh A. Gopinath |
| 2006 | Syllable-length path mixture hidden Markov models with trajectory clustering for continuous speech recognition. Yan Han, Lou Boves |
| 2006 | Synthesizing breathiness in natural speech with sinusoidal modelling. Brett Matthews, Raimo Bakis, Ellen Eide |
| 2006 | System- versus user-initiative dialog strategy for driver information systems. Chantal Ackermann, Marion Libossek |
| 2006 | TDA: a new trainable trajectory formation system for facial animation. Oxana Govokhina, Gérard Bailly, Gaspard Breton, Paul C. Bagshaw |
| 2006 | Testing the effect of audiovisual cues to prominence via a reaction-time experiment. Emiel Krahmer, Marc Swerts |
| 2006 | Text-independent cross-language voice conversion. David Sündermann, Harald Höge, Antonio Bonafonte, Hermann Ney, Julia Hirschberg |
| 2006 | Text-independent speaker identification in birds. E. J. S. Fox, J. D. Roberts, Mohammed Bennamoun |
| 2006 | The 2006 RWTH parliamentary speeches transcription system. Jonas Lööf, Maximilian Bisani, Christian Gollan, Georg Heigold, Björn Hoffmeister, Christian Plahl, Ralf Schlüter, Hermann Ney |
| 2006 | The IBM 2006 speech transcription system for european parliamentary speeches. Bhuvana Ramabhadran, Olivier Siohan, Lidia Mangu, Geoffrey Zweig, Martin Westphal, Henrik Schulz, Alvaro Soneiro |
| 2006 | The ICSI+ multilingual sentence segmentation system. M. Zimmerman, Dilek Hakkani-Tür, James G. Fung, Nikki Mirghafori, Luke R. Gottlieb, Elizabeth Shriberg, Yang Liu |
| 2006 | The importance of different facial areas for signalling visual prominence. Marc Swerts, Emiel Krahmer |
| 2006 | The role of positional probability in the segmentation of Cantonese speech. Michael C. W. Yip |
| 2006 | The role of prosody in the perception of US native English accents. Ayako Ikeno, John H. L. Hansen |
| 2006 | The segmentation of multi-channel meeting recordings for automatic speech recognition. John Dines, Jithendra Vepa, Thomas Hain |
| 2006 | The target cost formulation in unit selection speech synthesis. Paul Taylor |
| 2006 | The use of Bayesian network for incorporating accent, gender and wide-context dependency information. Sakriani Sakti, Konstantin Markov, Satoshi Nakamura |
| 2006 | The vocal joystick data collection effort and vowel corpus. Kelley Kilanski, Jonathan Malkin, Xiao Li, Richard Wright, Jeff A. Bilmes |
| 2006 | Thesaurus expansion using similar word pairs from patent documents. Yoshimi Suzuki, Fumiyo Fukumoto |
| 2006 | Time-dependent cross-probability model for multi-environment model based LInear normalization. Luis Buera, Eduardo Lleida, Juan Arturo Nolazco-Flores, Antonio Miguel, Alfonso Ortega |
| 2006 | Timing levels in segment-based speech emotion recognition. Björn W. Schuller, Gerhard Rigoll |
| 2006 | Tone recognition of continuous speech of standard Chinese using neural network and tone nucleus model. Keikichi Hirose, Hui Hu, Xiaodong Wang, Nobuaki Minematsu |
| 2006 | Topic-based language modeling with dynamic Bayesian networks. Pascal Wiggers, Léon J. M. Rothkrantz |
| 2006 | Totally data-driven duration modeling based on generalized linear model for Mandarin TTS. Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao |
| 2006 | Totally data-driven intonation prediction model using a novel F0 contour parametric representation. Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao |
| 2006 | Towards a comprehensive investigation of factors relevant to peak alignment using a unit selection corpus. Matthias Jilka, Bernd Möbius |
| 2006 | Towards a multimodal topic tracking system for a mobile robot. Jan Frederik Maas, Britta Wrede, Gerhard Sagerer |
| 2006 | Towards an integrated understanding of speaking rate in conversation. Jiahong Yuan, Mark Y. Liberman, Christopher Cieri |
| 2006 | Towards automatic parameter extraction of command-response model for Cantonese. Raymond W. M. Ng, Tan Lee, Wentao Gu |
| 2006 | Towards continuous speech recognition using surface electromyography. Szu-Chen Stan Jou, Tanja Schultz, Matthias Walliczek, Florian Kraft, Alex Waibel |
| 2006 | Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters. Tobias Gehrig, Ulrich Klee, John W. McDonough, Shajith Ikbal, Matthias Wölfel, Christian Fügen |
| 2006 | Tracking of involuntary formant frequency variations and application to parkinsonian speech. Laurence Cnockaert, Jean Schoentgen, Pascal Auzou, Canan Ozsancak, Francis Grenez |
| 2006 | Tracking of visible vocal tract resonances (VVTR) based on kalman filtering. I. Yücel Özbek, Mübeccel Demirekler |
| 2006 | Training native English speakers to identify Japanese vowel length with fast rate sentences. Yukari Hirata, Elizabeth Whitehurst, Emily Cullings, Jacob Whiton, Carol Glenn |
| 2006 | Training of coarticulation models using dominance functions and visual unit selection methods for audio-visual speech synthesis. Zdenek Krnoul, Milos Zelezný, Ludek Müller, Jakub Kanis |
| 2006 | Two stage transform vector quantization of LSFs for wideband speech coding. Saikat Chatterjee, T. V. Sreenivas |
| 2006 | Two-microphone voice activity detection in the presence of coherent interference. Gibak Kim, Nam Ik Cho |
| 2006 | Two-stage vocabulary-free spoken document retrieval - subword identification and re-recognition of the identified sections. Yoshiaki Itoh, Takayuki Otake, Kohei Iwata, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee |
| 2006 | Two-step unsupervised speaker adaptation based on speaker and gender recognition and HMM combination. Petr Cerva, Jan Nouza, Jan Silovský |
| 2006 | Underlying quality dimensions of modern telephone connections. Marcel Wältermann, Kirstin Scholz, Alexander Raake, Ulrich Heute, Sebastian Möller |
| 2006 | Unfilled pauses in Japanese sentences read aloud by non-native learners. Hiroko Hirano, Goh Kawai, Keikichi Hirose, Nobuaki Minematsu |
| 2006 | Unifying unit selection and hidden Markov model speech synthesis. Paul Taylor |
| 2006 | Unit selection and its relation to symbolic prosody: a new approach. Daniel Tihelka, Jindrich Matousek |
| 2006 | Unsupervised Spanish dialect classification. Rongqing Huang, John H. L. Hansen |
| 2006 | Unsupervised adaptation for acoustic language identification. Ekaterina Timoshenko, Josef G. Bauer |
| 2006 | Unsupervised detection of whispered speech in the presence of normal phonation. Michael A. Carlin, Brett Y. Smolenski, Stanley J. Wenndt |
| 2006 | Unsupervised language model adaptation based on automatic text collection from WWW. Motoyuki Suzuki, Yasutomo Kajiura, Akinori Ito, Shozo Makino |
| 2006 | Unsupervised language model adaptation for Mandarin broadcast conversation transcription. David Mrva, Philip C. Woodland |
| 2006 | Unsupervised language model adaptation using latent semantic marginals. Yik-Cheung Tam, Tanja Schultz |
| 2006 | Unsupervised learning of HMM topology for text-dependent speaker verification. Ming Liu, Thomas S. Huang |
| 2006 | Unsupervised model adaptation for speaker verification. Alexandre Preti, Jean-François Bonastre |
| 2006 | Unsupervised segmentation of words into morphemes - morpho challenge 2005 application to automatic speech recognition. Mikko Kurimo, Mathias Creutz, Matti Varjokallio, Ebru Arisoy, Murat Saraclar |
| 2006 | Use of incrementally regulated discriminative margins in MCE training for speech recognition. Dong Yu, Li Deng, Xiaodong He, Alex Acero |
| 2006 | User expectations and real experience on a multimodal interactive system. Kristiina Jokinen, Topi Hurtig |
| 2006 | User responses to prosodic variation in fragmentary grounding utterances in dialog. Gabriel Skantze, David House, Jens Edlund |
| 2006 | User simulation for spoken dialogue systems: learning and evaluation. Kallirroi Georgila, James Henderson, Oliver Lemon |
| 2006 | Using SVM and error-correcting codes for multiclass dialog act classification in meeting corpus. Yang Liu |
| 2006 | Using a differential microphone array to estimate the direction of arrival of two acoustic sources. Fotios Talantzis, Anthony G. Constantinides, Lazaros C. Polymenakos |
| 2006 | Using genetic algorithms to weight acoustic features for speaker recognition. Maider Zamalloa, Germán Bordel, Luis Javier Rodríguez, Mikel Peñagarikano, Juan Pedro Uribe |
| 2006 | Using latent semantic indexing for morph-based spoken document retrieval. Ville T. Turunen, Mikko Kurimo |
| 2006 | Using posterior-based features in template matching for speech recognition. Guillermo Aradilla, Jithendra Vepa, Hervé Bourlard |
| 2006 | Using speech recognition technique for constructing a phonetically transcribed taiwanese (min-nan) text corpus. Min-Siong Liang, Ren-yuan Lyu, Yuang-Chin Chiang |
| 2006 | Using system and user performance features to improve emotion detection in spoken tutoring dialogs. Hua Ai, Diane J. Litman, Katherine Forbes-Riley, Mihai Rotaru, Joel R. Tetreault, Amruta Purandare |
| 2006 | Vector taylor series based joint uncertainty decoding. Haitian Xu, Luca Rigazio, David Kryze |
| 2006 | Vector-based spoken language recognition using output coding. Haizhou Li, Bin Ma, Rong Tong |
| 2006 | Visual correlates to prominence in several expressive modes. Jonas Beskow, Björn Granström, David House |
| 2006 | Visual speech segmentation and speaker recognition for transcription of TV news. Josef Chaloupka |
| 2006 | Vocal emotion recognition with cochlear implants. Xin Luo, Qian-Jie Fu, John J. Galvin III |
| 2006 | Voice GMM modelling for FESTIVAL/MBROLA emotive TTS synthesis. Mauro Nicolao, Carlo Drioli, Piero Cosi |
| 2006 | Voice activity detection in personal audio recordings using autocorrelogram compensation. Keansub Lee, Daniel P. W. Ellis |
| 2006 | Voice activity detector based on enhanced cumulant of LPC residual and on-line EM algorithm. David Cournapeau, Tatsuya Kawahara, Kenji Mase, Tomoji Toriyama |
| 2006 | Voice conversion based on mixtures of factor analyzers. Yosuke Uto, Yoshihiko Nankaku, Tomoki Toda, Akinobu Lee, Keiichi Tokuda |
| 2006 | Voice source correlates of prosodic features in american English: a pilot study. Markus Iseli, Yen-Liang Shue, Melissa A. Epstein, Patricia A. Keating, Jody Kreiman, Abeer Alwan |
| 2006 | Voting for two speaker segmentation. Narayanaswamy Balakrishnan, Rashmi Gangadharaiah, Richard M. Stern |
| 2006 | Wavelet ridge track interpretation in terms of formants. Salma Chaari, Kaïs Ouni, Noureddine Ellouze |
| 2006 | Weighted codebook mapping for noisy speech enhancement using harmonic-noise model. Esfandiar Zavarehei, Saeed Vaseghi, Qin Yan |
| 2006 | Within-class covariance normalization for SVM-based speaker recognition. Andrew O. Hatch, Sachin S. Kajarekar, Andreas Stolcke |
| 2006 | Word intelligibility estimation of noise-reduced speech. Takeshi Yamada, Masakazu Kumakura, Nobuhiko Kitawaki |
| 2006 | Word order and tonal shape in the production of focus in short Finnish utterances. Martti Vainio, Juhani Järvikivi, Stefan Werner |
| 2006 | Word structure and tone perception in Mandarin. Hansjörg Mixdorff, Yu Hu |