INTERSPEECH - RankMe

660 papers

Year	Title / Authors
2006	"yeah right": sarcasm recognition for spoken dialogue systems. Joseph Tepperman, David R. Traum, Shrikanth S. Narayanan
2006	/nailon/ - software for online analysis of prosody. Jens Edlund, Mattias Heldner
2006	50 years late: repeating miller-nicely 1955. Andrew Lovitt, Jont B. Allen
2006	A DTW-based dissimilarity measure for left-to-right hidden Markov models and its application to word confusability analysis. Qiang Huo, Wei Li
2006	A Spanish speech to sign language translation system for assisting deaf-mute people. Rubén San Segundo, Roberto Barra-Chicote, Luis Fernando D'Haro, Juan Manuel Montero, Ricardo de Córdoba, Javier Ferreiros
2006	A bootstrapping approach for developing language model of new spoken dialogue systems by selecting web texts. Teruhisa Misu, Tatsuya Kawahara
2006	A case study in the identification of prosodic cues to turn-taking: back-channeling in Arabic. Nigel G. Ward, Yaffa Al Bayyari
2006	A clustering approach to semantic decoding. Hui Ye, Steve J. Young
2006	A cohort - UBM approach to mitigate data sparseness for in-set/out-of-set speaker recognition. Vinod Prakash, John H. L. Hansen
2006	A comparative study of Gaussian selection methods in large vocabulary continuous speech recognition. Dirk Gehrig, Thomas Schaaf
2006	A comparison of inter-transcriber reliability for two systems of prosodic annotation: rap (rhythm and pitch) and toBI (tones and break indices). Laura Dilley, Mara Breen, Marti Bolivar, John Kraemer, Edward Gibson
2006	A comparison of singing evaluation algorithms. Partha Lal
2006	A computational auditory scene analysis system for robust speech recognition. Soundararajan Srinivasan, Yang Shao, Zhaozhang Jin, DeLiang Wang
2006	A constrained baum-welch algorithm for improved phoneme segmentation and efficient training. David Huggins-Daines, Alexander I. Rudnicky
2006	A discriminative method for speaker verification using the difference information. Zhenchun Lei, Yingchun Yang, Zhaohui Wu
2006	A framework for robust MFCC feature extraction using SNR-dependent compression of enhanced mel filter bank energies. Babak Nasersharif, Ahmad Akbari
2006	A hybrid phrase-based/statistical speech translation system. David Stallard, Fred Choi, Kriste Krstovski, Prem Natarajan, Rohit Prasad, Shirin Saleem
2006	A joint intention-based dialogue engine. Rajah Annamalai Subramanian, Philip R. Cohen
2006	A maximum likelihood training approach to irrelevant variability compensation based on piecewise linear transformations. Qiang Huo, Donglai Zhu
2006	A model for the f0 reset in corpus-based intonation approaches. Francisco Campillo Díaz, Jan P. H. van Santen, Eduardo Rodríguez Banga
2006	A model of the regularities underlying speaker variation: evidence from hybrid synthesis. Susan R. Hertz
2006	A multi-pass error detection and correction framework for Mandarin LVCSR. Zhengyu Zhou, Helen M. Meng, Wai Kit Lo
2006	A multi-space distribution (MSD) approach to speech recognition of tonal languages. Huanliang Wang, Yao Qian, Frank K. Soong, Jian-Lai Zhou, Jiqing Han
2006	A multiclass framework for speaker verification within an acoustic event sequence system. Nicolas Scheffer, Jean-François Bonastre
2006	A multilingual embodied conversational agent for tutoring speech and language learning. Dominic W. Massaro, Ying Liu, Trevor H. Chen, Charles Perfetti
2006	A multilingual expectations model for contextual utterances in mixed-initiative spoken dialogue. Hartwig Holzapfel, Alex Waibel
2006	A multipitch tracker for monaural speech segmentation. André Coy, Jon Barker
2006	A new HMM adaptation approach for the case of a hands-free speech input in reverberant rooms. Hans-Günter Hirsch, Harald Finster
2006	A new dual-microphone speech enhancement method for oriented noises. Hamid Reza Abutalebi, Majid Pourahmadi, Masoud Reza Aghabozorgi
2006	A new framework for system combination based on integrated hypothesis space. I-Fan Chen, Lin-Shan Lee
2006	A new set of features for text-independent speaker identification. Carol Y. Espy-Wilson, Sandeep Manocha, Srikanth Vishnubhotla
2006	A new single-ended measure for assessment of speech quality. Timothy Murphy, Dorel Picovici, Abdulhussain E. Mahdi
2006	A new state-dependent phonetic tied-mixture model with head-body-tail structured HMM for real-time continuous phoneme recognition system. Junho Park, Hanseok Ko
2006	A noninvasive, low-cost device to study the velopharyngeal port during speech and some preliminary results. Xiaochuan Niu, Alexander Kain, Jan P. H. van Santen
2006	A novel environment-dependent speech enhancement method with optimized memory footprint. Suhadi Suhadi, Sorel Stan, Tim Fingscheidt
2006	A novel framework of text-independent speaker verification based on utterance transform and iterative cohort modeling. Ming Liu, Huazhong Ning, Thomas S. Huang, Zhengyou Zhang
2006	A phrase-level machine translation approach for disfluency detection using weighted finite state transducers. Sameer Maskey, Bowen Zhou, Yuqing Gao
2006	A pitch marks filtering algorithm based on restricted dynamic programming. Francesc Alías, Carlos Monzo, Joan Claudi Socoró
2006	A probabilistic graphical model for microphone array source separation using rich pre-trained source models. Hagai Thomas Attias
2006	A quality measure method using Gaussian mixture models and divergence measure for speaker identification. Rong Zheng, Shuwu Zhang, Bo Xu
2006	A robust feature extraction based on the MTF concept for speech recognition in reverberant environment. Xugang Lu, Masashi Unoki, Masato Akagi
2006	A robust fusion method for multilingual spoken document retrieval systems employing tiered resources. Murat Akbacak, John H. L. Hansen
2006	A simulated-data adaptation technique for robust speech recognition. Nattanun Thatphithakkul, Boontee Kruatrachue, Chai Wutiwiwatchai, Sanparith Marukatat, Vataya Boonpiam
2006	A simulation based parameter optimization for a coarticulation model. Jianguo Wei, Xugang Lu, Jianwu Dang
2006	A speaker adaptation algorithm using principal curves in noisy environments. Jingying Wang, Zuoying Wang
2006	A spectral clustering approach to speaker diarization. Huazhong Ning, Ming Liu, Hao Tang, Thomas S. Huang
2006	A spectral-temporal method for pitch tracking. Stephen A. Zahorian, Princy Dikshit, Hongbing Hu
2006	A spoken language understanding approach using successive learners. Wei-Lin Wu, Ruzhan Lu, Hui Liu, Feng Gao
2006	A stochastic approach for dialog management based on neural networks. Lluís F. Hurtado, David Griol, Encarna Segarra, Emilio Emilio, Sanchis Sanchis
2006	A study of emotional speech articulation using a fast magnetic resonance imaging technique. Sungbok Lee, Erik Bresch, Jason Adams, Abe Kazemzadeh, Shrikanth S. Narayanan
2006	A study on detection based automatic speech recognition. Chengyuan Ma, Yu Tsao, Chin-Hui Lee
2006	A study on lattice rescoring with knowledge scores for automatic speech recognition. Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee
2006	A style control technique for speech synthesis using multiple regression HSMM. Takashi Nose, Junichi Yamagishi, Takao Kobayashi
2006	A successive state and mixture splitting for optimizing the size of models in speech recognition. Soo-Young Suk, Seong-Jun Hahm, Ho-Youl Jung, Hyun-Yeol Chung
2006	A syllable based continuous speech recognizer for Tamil. A. Lakshmi, Hema A. Murthy
2006	A technique for controlling voice quality of synthetic speech using multiple regression HSMM. Makoto Tachibana, Takashi Nose, Junichi Yamagishi, Takao Kobayashi
2006	A text-prompted distributed speaker verification system implemented on a cellular phone and a mobile terminal. Tsuneo Kato, Hisashi Kawai
2006	A texttiling based approach to topic boundary detection in meetings. Satanjeev Banerjee, Alexander I. Rudnicky
2006	A time-synchronous phonetic decoder for a long-contextual-Span hidden trajectory model. Xiaolong Li, Li Deng, Dong Yu, Alex Acero
2006	A tone recognition framework for continuous Mandarin speech. Lei He, Jie Hao
2006	A trajectory mixture density network for the acoustic-articulatory inversion mapping. Korin Richmond
2006	A user simulator based on voiceXML for evaluation of spoken dialog systems. Akinori Ito, Keisuke Shimada, Motoyuki Suzuki, Shozo Makino
2006	A vector space approach to environment modeling for robust speech recognition. Yu Tsao, Chin-Hui Lee
2006	A wavelet-based parameterization for speech/music segmentation. E. Didiot, Irina Illina, Odile Mella, Dominique Fohr, Jean Paul Haton
2006	A weight estimation method using LDA for multi-band speech recognition. Koji Iwano, Kaname Kojima, Sadaoki Furui
2006	ASR-based corrective feedback on pronunciation: does it really work? Ambra Neri, Catia Cucchiarini, Helmer Strik
2006	Accident - execute: increased activation in nonnative listening. Mirjam Broersma
2006	Acoustic analysis and automatic recognition of spontaneous children²s speech. Matteo Gerosa, Diego Giuliani, Shrikanth S. Narayanan
2006	Acoustic characterization of children with speech delay. H. Timothy Bunnell, James B. Polikoff
2006	Acoustic cues for the classification of regular and irregular phonation. Kushan Surana, Janet Slifka
2006	Acoustic model training based on linear transformation and MAP modification for HSMM-based speech synthesis. Katsumi Ogata, Makoto Tachibana, Junichi Yamagishi, Takao Kobayashi
2006	Acoustic modeling for spoken dialogue systems based on unsupervised utterance-based selective training. Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2006	Adaptive filtering for attenuating musical noise caused by spectral subtraction. Takahiro Murakami, Yoshihisa Ishida
2006	Adaptive multimodal fusion by uncertainty compensation. Vassilis Pitsikalis, Athanassios Katsamanis, George Papandreou, Petros Maragos
2006	Adaptive speech enhancement for speech separation in diffuse noise. Rong Hu, Yunxin Zhao
2006	Advances in lecture recognition: the ISL RT-06s evaluation system. Christian Fügen, Matthias Wölfel, John W. McDonough, Shajith Ikbal, Florian Kraft, Kornel Laskowski, Mari Ostendorf, Sebastian Stüker, Ken'ichi Kumatani
2006	All-pole model estimation of vocal tract on the frequency domain. Luis Weruaga, Amar Al-Khayat
2006	Amharic speech synthesis using cepstral method with stress generation rule. Tadesse Anberbir, Tomio Takara
2006	An ERB loudness pattern based objective speech quality measure. Guo Chen, Vijay Parsa, Susan Scollie
2006	An HMM-based singing voice synthesis system. Keijiro Saino, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda
2006	An MRI based study of the acoustic effects of sinus cavities and its application to speaker recognition. Tarun Pruthi, Carol Y. Espy-Wilson
2006	An acoustic and articulatory study of Lombard speech: global effects on the utterance. Maeva Garnier, Lucie Bailly, Marion Dohen, Pauline Welby, Hélène Loevenbruck
2006	An adaptive sampling procedure for speech perception experiments. Geoffrey Stewart Morrison
2006	An annotation scheme for agreement analysis. Siew Leng Toh, Fan Yang, Peter A. Heeman
2006	An annotation scheme for complex disfluencies. Peter A. Heeman, Andy McMillin, J. Scott Yaruss
2006	An assessment of automatic speech recognition as speech intelligibility estimation in the context of additive noise. Wei Ming Liu, John S. D. Mason, Nicholas W. D. Evans, Keith A. Jellyman
2006	An automatic singing skill evaluation method for unknown melodies using pitch interval accuracy and vibrato features. Tomoyasu Nakano, Masataka Goto, Yuzuru Hiraga
2006	An effective and efficient utterance verification technology using word n-gram filler models. Dong Yu, Yun-Cheng Ju, Alex Acero
2006	An efficient bispectrum phase entropy-based algorithm for VAD. J. M. Górriz, Javier Ramírez, Carlos García Puntonet, José C. Segura
2006	An efficient segment-based speech compression technique for hand-held TTS systems. Chang-Heon Lee, Sung-Kyo Jung, Thomas Eriksson, Won-Suk Jun, Hong-Goo Kang
2006	An improved affine projection algorithm based crosstalk resistant adaptive noise canceller. Guo Chen, Vijay Parsa
2006	An improved mel-wiener filter for mel-LPC based speech recognition. Md. Babul Islam, Hiroshi Matsumoto, Kazumasa Yamamoto
2006	An incremental algorithm for signal reconstruction from short-time fourier transform magnitude. Jake V. Bouvrie, Tony Ezzat
2006	An information theoretic tool for investigating speech perception. Bryce E. Lobdell, Jont B. Allen
2006	An integrated approach to improve speech recognition rate for non-native speakers. Yunbin Deng, Xiaokun Li, Chiman Kwan, Roger Xu, Bhiksha Raj, Richard M. Stern, David Williamson
2006	An integrated solution for error concealment in DSR systems over wireless channels. Antonio M. Peinado, Angel M. Gomez, Victoria E. Sánchez, José L. Pérez-Córdoba, Antonio J. Rubio
2006	An investigation of manifold learning for speech analysis. Andrew Errity, John McKenna
2006	An online adaptive filtering algorithm for the vocal joystick. Xiao Li, Jonathan Malkin, Susumu Harada, Jeff A. Bilmes, Richard Wright, James A. Landay
2006	An optimum microphone array post-filter for speech applications. Stamatios Lefkimmiatis, Dimitrios Dimitriadis, Petros Maragos
2006	An unified unit-selection framework for ultra low bit-rate speech coding. V. Ramasubramanian, D. Harish
2006	An user-centered development of an intuitive dialog control for speech-controlled music selection in cars. Stefan Schulz, Hilko Donker
2006	Analysis and detection of speech under sleep deprivation. Tin Lay Nwe, Haizhou Li, Minghui Dong
2006	Analysis of HMM temporal evolution for automatic speech recognition and utterance verification. Marta Casar, José A. R. Fonollosa
2006	Analysis of correlation between audio and visual speech features for clean audio feature prediction in noise. Ibrahim Almajai, Ben Milner, Jonathan Darch
2006	Analysis of lombard effect under different types and levels of noise with application to in-set speaker ID systems. Vaishnevi S. Varadarajan, John H. L. Hansen
2006	Analysis of nonmodal phonation using minimum entropy deconvolution. Nicolas Malyska, Thomas F. Quatieri
2006	Analysis of overlaps in meetings by dialog factors, hot spots, speakers, and collection site: insights for automatic speech recognition. Özgür Çetin, Elizabeth Shriberg
2006	Analysis of prosodic and linguistic cues of phrase finals for turn-taking and dialog acts. Carlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita
2006	Analyzing dialogue data for real-world emotional speech classification. Ryuichi Nisimura, Souji Omae, Hideki Kawahara, Toshio Irino
2006	Analyzing reusability of speech corpus based on statistical multidimensional scaling method. Goshu Nagino, Makoto Shozakai
2006	Articulatory features for "meeting" speech recognition. Florian Metze
2006	Assessing the reading level of web pages. Sarah E. Petersen, Mari Ostendorf
2006	Assessment of articulatory sub-systems of dysarthric speech using an isolated-style phoneme recognition system. P. Vijayalakshmi, M. Ramasubba Reddy, Douglas D. O'Shaughnessy
2006	Audio person tracking in a smart-room environment. Alberto Abad, Carlos Segura, Dusan Macho, Javier Hernando, Climent Nadeu
2006	Audio-visual speech recognition in the presence of a competing speaker. Xu Shao, Jon Barker
2006	Auto-segmentation based VAD for robust ASR. Yu Shi, Frank K. Soong, Jian-Lai Zhou
2006	Automatic English stop consonants classification using wavelet analysis and hidden Markov models. Marco Kühne, Roberto Togneri
2006	Automatic Mandarin pronunciation scoring for native learners with dialect accent. Si Wei, Qing-Sheng Liu, Yu Hu, Ren-Hua Wang
2006	Automatic acoustic identification of insects inspired by the speaker recognition paradigm. Ilyas Potamitis, Todor Ganchev, Nikos Fakotakis
2006	Automatic alignment and error correction of human generated transcripts for long speech recordings. Timothy J. Hazen
2006	Automatic assignment of anchoring points on vowel templates for defining correspondence between time-frequency representations of speech samples. Toru Takahashi, Masashi Nishi, Toshio Irino, Hideki Kawahara
2006	Automatic detection of irregular phonation in continuous speech. Srikanth Vishnubhotla, Carol Y. Espy-Wilson
2006	Automatic detection of voice onset time contrasts for use in pronunciation assessment. Abe Kazemzadeh, Joseph Tepperman, Jorge F. Silva, Hong You, Sungbok Lee, Abeer Alwan, Shrikanth S. Narayanan
2006	Automatic emotion recognition of speech signal in Mandarin. Sheng Zhang, P. C. Ching, Fanrang Kong
2006	Automatic generation of statistical language models for interactive voice response applications. Mithun Balakrishna, Cyril Cerovic, Dan I. Moldovan, Ellis Cave
2006	Automatic grammar correction for second-language learners. John Lee, Stephanie Seneff
2006	Automatic initial/final generation for dialectal Chinese speech recognition. Linquan Liu, Thomas Fang Zheng, Wenhu Wu
2006	Automatic language identification using wavelets. Ana Lilia Reyes-Herrera, Luis Villaseñor-Pineda, Manuel Montes-y-Gómez
2006	Automatic metadata generation and video editing based on speech and image recognition for medical education contents. Satoshi Tamura, Koji Hashimoto, Jiong Zhu, Satoru Hayamizu, Hirotsugu Asai, Hideki Tanahashi, Makoto Kanagawa
2006	Automatic phonetic segmentation by using a SPM-based approach for a Mandarin singing voice corpus. Cheng-Yuan Lin, Jyh-Shing Roger Jang
2006	Automatic phonetic transcription of large speech corpora: a comparative study. Christophe Van Bael, Lou Boves, Henk van den Heuvel, Helmer Strik
2006	Automatic recognition of speakers' age and gender on the basis of empirical studies. Christian A. Müller
2006	Automatic removal of typed keystrokes from speech signals. Amarnag Subramanya, Michael L. Seltzer, Alex Acero
2006	Automatic speech recognition experiments with articulatory data. Esmeralda Uraga, Thomas Hain
2006	Automatic speech recognition of Cantonese-English code-mixing utterances. Joyce Y. C. Chan, P. C. Ching, Tan Lee, Houwei Cao
2006	Automatic speech segmentation with multiple statistical models. Seung Seop Park, Jong Won Shin, Nam Soo Kim
2006	Automatic syllable-pattern induction in statistical Thai text-to-phone transcription. Ausdang Thangthai, Chatchawarn Hansakunbuntheung, Rungkarn Siricharoenchai, Chai Wutiwiwatchai
2006	Automatic transcription of Somali language. Abdillahi Nimaan, Pascal Nocera, Jean-François Bonastre
2006	BINSEG: an efficient speaker-based segmentation technique. Jindrich Zdánský
2006	Basque-Spanish language identification using phone-based methods. Víctor G. Guijarrubia, M. Inés Torres
2006	Bayesian decision tree state tying for conversational speech recognition. Rusheng Hu, Yunxin Zhao
2006	Bayesian networks for phonetic classification using time-scale features. Franz Pernkopf, Tuan Van Pham
2006	Boosting HMM performance with a memory upgrade. Mathias De Wachter, Kris Demuynck, Dirk Van Compernolle
2006	Bootstrapping language models for dialogue systems. Karl Weilhammer, Matthew N. Stuttle, Steve J. Young
2006	Building an English speech synthesis system from a Japanese ALS patient²s voice. Akemi Iida, Jun Ito, Shimpei Kajima, Tsutomu Sugawara
2006	Building an English-iraqi Arabic machine translation system for spoken utterances with limited resources. Jason Riesa, Behrang Mohit, Kevin Knight, Daniel Marcu
2006	CASA based speech separation for robust speech recognition. Runqiang Han, Pei Zhao, Qin Gao, Zhiping Zhang, Hao Wu, Xihong Wu
2006	CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition. Satoshi Nakamura, Masakiyo Fujimoto, Kazuya Takeda
2006	CHAT: a conversational helper for automotive tasks. Fuliang Weng, Sebastian Varges, Badri Raghunathan, Florin Ratiu, Heather Pon-Barry, Brian Lathrop, Qi Zhang, Harry Bratt, Tobias Scheideck, Kui Xu, Matthew Purver, Rohit Mishra, Annie Lien, Madhuri Raya, Stanley Peters, Yao Meng, J. Russell, Lawrence Cavedon, Elizabeth Shriberg, Hauke Schmidt, R. Prieto
2006	CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling. Alan W. Black
2006	Call analysis with classification using speech and non-speech features. Yun-Cheng Ju, Ye-Yi Wang, Alex Acero
2006	Category formation and the role of spectral quality in the perception and production of English front vowels. Ricardo Augusto Hoffmann Bion, Paola Escudero, Andréia S. Rauber, Barbara O. Baptista
2006	Characterization of cued speech vowels from the inner lip contour. Noureddine Aboutabit, Denis Beautemps, Laurent Besacier
2006	Chinese input method based on reduced Mandarin phonetic alphabet. Chun-Han Tseng, Chia-Ping Chen
2006	Classified comfort noise generation for efficient voice transmission. Yasheng Qian, Wei-Shou Hsu, Peter Kabal
2006	Classroom success of an intelligent tutoring system for lexical practice and reading comprehension. Michael Heilman, Kevyn Collins-Thompson, Jamie Callan, Maxine Eskénazi
2006	Clean speech feature estimation based on soft spectral masking. Young Joon Kim, Woohyung Lim, Nam Soo Kim
2006	Cluster-based user simulations for learning dialogue strategies. Verena Rieser, Oliver Lemon
2006	Colloquial Iraqi ASR for speech translation. Shirin Saleem, Rohit Prasad, Prem Natarajan
2006	Combining acoustic, lexical, and syntactic evidence for automatic unsupervised prosody labeling. Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan
2006	Combining missing-feature theory, speech enhancement and speaker-dependent/-independent modeling for speech separation. Ji Ming, Timothy J. Hazen, James R. Glass
2006	Combining multiple-sized sub-word units in a speech recognition system using baseform selection. T. Nagarajan, P. Vijayalakshmi, Douglas D. O'Shaughnessy
2006	Combining phonetic attributes using conditional random fields. Jeremy Morris, Eric Fosler-Lussier
2006	Compact n-gram models by incremental growing and clustering of histories. Sami Virpioja, Mikko Kurimo
2006	Comparative analysis of formants of British, american and australian accents. Seyed Ghorshi, Saeed Vaseghi, Qin Yan
2006	Comparative study on contributions of pitch-synchronization and peak-amplitude towards robustness issue of ASR. Muhammad Ghulam, Junsei Horikawa, Tsuneo Nitta
2006	Comparison of Slovak and Czech speech recognition based on grapheme and phoneme acoustic models. Slavomír Lihan, Jozef Juhár, Anton Cizmar
2006	Comparison of acoustic modeling techniques for Vietnamese and Khmer ASR. Viet Bac Le, Laurent Besacier
2006	Comparison of keyword spotting methods for searching in speech. Lubos Smídl, Josef V. Psutka
2006	Comparison of prediction based LSF quantization methods using split VQ. Saikat Chatterjee, T. V. Sreenivas
2006	Comparison of the ITU-t p.85 standard to other methods for the evaluation of text-to-speech systems. Dmitry Sityaev, Katherine M. Knill, Tina Burrows
2006	Computer aided pronunciation learning system using speech recognition techniques. Sherif Mahdy Abdou, Salah Eldeen Hamid, Mohsen A. Rashwan, Abdurrahman Samir, Ossama Abdel-Hamid, Mostafa Shahin, Waleed Nazih
2006	Computer-assisted closed-captioning of live TV broadcasts in French. Gilles Boulianne, Jean-Francois Beaumont, Maryse Boisvert, Julie Brousseau, Patrick Cardinal, Claude Chapdelaine, Michel Comeau, Pierre Ouellet, Frédéric Osterrath
2006	Conceptual decoding from word lattices: application to the spoken dialogue corpus MEDIA. Christophe Servan, Christian Raymond, Frédéric Béchet, Pascal Nocera
2006	Conditional random fields for hierarchical segment selection in text-to-speech synthesis. Christian Weiss, Wolfgang Hess
2006	Consonant and vowel confusions in speech-weighted noise. Sandeep Phatak, Jont B. Allen
2006	Constrained structural maximum a posteriori linear regression for average-voice-based speech synthesis. Yuji Nakano, Makoto Tachibana, Junichi Yamagishi, Takao Kobayashi
2006	Constructing stylistic synthesis databases from audio books. Yong Zhao, Di Peng, Lijuan Wang, Min Chu, Yining Chen, Peng Yu, Jun Guo
2006	Continual on-line monitoring of Czech spoken broadcast programs. Jan Nouza, Jindrich Zdánský, Petr Cerva, Jan Kolorenc
2006	Continuous time-frequency masking method for blind speech separation with adaptive choice of threshold parameter using ICA. Zbynek Koldovský, Jan Nouza, Jan Kolorenc
2006	Conversational help desk: vague callers and context switch. Osamuyimen Stewart, Juan M. Huerta, Ea-Ee Jan, Cheng Wu, Xiang Li, David M. Lubensky
2006	Conversational quality estimation model for wideband IP-telephony services. Hitoshi Aoki, Atsuko Kurashima, Akira Takahashi
2006	Conversion from phoneme based to grapheme based acoustic models for speech recognition. Andrej Zgank, Zdravko Kacic
2006	Cooperation between global and local methods for the automatic segmentation of speech synthesis corpora. Safaa Jarifi, Dominique Pastor, Olivier Rosec
2006	Corpus design based on the kullback-leibler divergence for text-to-speech synthesis application. Aleksandra Krul, Géraldine Damnati, François Yvon, Thierry Moudenc
2006	Corpus-based generation of fundamental frequency contours using generation process model and considering emotional focuses. Keikichi Hirose, Yasufumi Asano, Nobuaki Minematsu
2006	Coupling particle filters with automatic speech recognition for speech feature enhancement. Friedrich Faubel, Matthias Wölfel
2006	Cross-language evaluation of voice-to-phoneme conversions for voice-tag application in embedded platforms. Yan Ming Cheng, Changxue Ma, Lynette Melnar
2006	Cross-lingual dialog model for speech to speech translation. Emil Ettelaie, Panayiotis G. Georgiou, Shrikanth S. Narayanan
2006	Cross-system adaptation and combination for continuous speech recognition: the influence of phoneme set and acoustic front-end. Sebastian Stüker, Christian Fügen, Susanne Burger, Matthias Wölfel
2006	Cues for hesitation in speech synthesis. Rolf Carlson, Kjell Gustafson, Eva Strangert
2006	Data-driven design of front-end filter bank for Lombard speech recognition. Hynek Boril, Petr Fousek, Petr Pollák
2006	Decision directed constrained iterative speech enhancement. Amit Das, John H. L. Hansen
2006	Decision tree-based training of probabilistic concatenation models for corpus-based speech synthesis. Shinsuke Sakai, Tatsuya Kawahara
2006	Design and performance analysis of a factoid question answering system for spontaneous speech transcriptions. Mihai Surdeanu, David Dominguez-Sal, Pere Comas
2006	Detecting anger in automated voice portal dialogs. Felix Burkhardt, Jitendra Ajmera, Roman Englert, Joachim Stegmann, Winslow Burleson
2006	Detecting question-bearing turns in spoken tutorial dialogues. Jackson Liscombe, Jennifer J. Venditti, Julia Hirschberg
2006	Detection and separation of speech events in meeting recordings. Futoshi Asano, Jun Ogata
2006	Detection of a third speaker in telephone conversations. Uchechukwu O. Ofoegbu, Ananth N. Iyer, Robert E. Yantorno, Stanley J. Wenndt
2006	Detection of quotations and inserted clauses and its application to dependency structure analysis in spontaneous Japanese. Ryoji Hamabe, Kiyotaka Uchimoto, Tatsuya Kawahara, Hitoshi Isahara
2006	Detection of word fragments in Mandarin telephone conversation. Cheng-Tao Chu, Yun-Hsuan Sung, Yuan Zhao, Daniel Jurafsky
2006	Developing an automatic assessment tool for children²s oral reading. Leen Cleuren, Jacques Duchateau, Alain Sips, Pol Ghesquière, Hugo Van hamme
2006	Developing consistent pronunciation models for phonemic variants. Marelie H. Davel, Etienne Barnard
2006	Developing speech dialogs for multimodal HMIs using finite state machines. Silke Goronzy, Raquel Mochales, Nicole Beringer
2006	Development and evaluation of speech database in automotive environments for practical speech recognition systems. Yasunari Obuchi, Nobuo Hataoka
2006	Development of a program for self assessment of Japanese pronunciation by English learners. Chiharu Tsurutani, Yutaka Yamauchi, Nobuaki Minematsu, Dean Luo, Kazutaka Maruyama, Keikichi Hirose
2006	Development of advanced dialog systems with PATE. Norbert Pfleger, Jan Schehl
2006	Development of prototype text-to-speech systems for northern sotho. H. J. Oosthuizen, S. T. Phihlela, Madimetja Jonas D. Manamela
2006	Development of slovak GALAXY/voiceXML based spoken language dialogue system to retrieve information from the internet. Jozef Juhár, Stanislav Ondás, Anton Cizmar, Milan Rusko, Gregor Rozinaj, Roman Jarina
2006	Dialog act tagging with support vector machines and hidden Markov models. Dinoj Surendran, Gina-Anne Levow
2006	Dialogue act compression via pitch contour preservation. Gabriel Murray, Steve Renals
2006	Discourse structure and speech recognition problems. Mihai Rotaru, Diane J. Litman
2006	Discriminant linear processing of time-frequency plane. Fabio Valente, Hynek Hermansky
2006	Discriminating speech and non-speech with regularized least squares. Ryan Rifkin, Nima Mesgarani
2006	Discriminative MLE training using a product of Gaussian likelihoods. T. Nagarajan, Douglas D. O'Shaughnessy
2006	Discriminative adaptation for speaker verification. Chris Longworth, Mark J. F. Gales
2006	Discriminative kernel-based phoneme sequence recognition. Joseph Keshet, Shai Shalev-Shwartz, Samy Bengio, Yoram Singer, Dan Chazan
2006	Discriminative models for spoken language understanding. Ye-Yi Wang, Alex Acero
2006	Discriminative named entity recognition of speech data using speech recognition confidence. Katsuhito Sudoh, Hajime Tsukada, Hideki Isozaki
2006	Disentangling gestural and auditory contrast accounts of compensation for coarticulation. Navin Viswanathan, James S. Magnuson, Carol A. Fowler
2006	Distance measure between Gaussian distributions for discriminating speaking styles. Goshu Nagino, Makoto Shozakai
2006	Distant-talking continuous speech recognition based on a novel reverberation model in the feature domain. Armin Sehr, Marcus Zeller, Walter Kellermann
2006	Doing research on a deployed spoken dialogue system: one year of let's go! experience. Antoine Raux, Dan Bohus, Brian Langner, Alan W. Black, Maxine Eskénazi
2006	Dynamic evidence models in a DBN phone recognizer. William Schuler, Tim Miller, Stephen T. Wu, Andrew Exley
2006	Dynamic extension of a grammar-based dialogue system: constructing an all-recipes knowing robot. Petra Gieselmann, Alex Waibel
2006	Dynamic help generation by estimating user²s mental model in spoken dialogue systems. Yuichiro Fukubayashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
2006	Edge-splitting in a cumulative multimodal system, for a no-wait temporal threshold on information fusion, combined with an under-specified display. Edward C. Kaiser, Paulo Barthelmess
2006	Effect of dynamic information of formants on discrimination of English vowels in consonantal contexts by Japanese listeners. Akiyo Joto
2006	Effect of genre, speaker, and word class on the realization of given and new information. Agustín Gravano, Julia Hirschberg
2006	Effects of familiarity with faces and voices on second-language speech processing: components of memory traces. Debra M. Hardison
2006	Effects of featural similarity and overlap position on lexical confusions and overt similarity judgments. Sarah C. Creel, Delphine Dahan, Daniel Swingley
2006	Effects of frequency shifts on perceived naturalness and gender information in speech. Peter F. Assmann, Sophia Dembling, Terrance M. Nearey
2006	Effects of midline tongue piercing on spectral centroid frequencies of sibilants. Tom Kovacs, Donald S. Finan
2006	Effects of word frequency on the acoustic durations of affixes. Mark Pluymaekers, Mirjam Ernestus, R. Harald Baayen
2006	Efficient Gaussian mixture model evaluation in voice conversion. Jilei Tian, Jani Nurminen, Victor Popa
2006	Efficient VQ techniques and general noise shaping in noise feedback coding. Jes Thyssen, Juin-Hwey Chen
2006	Efficient interactive retrieval of spoken documents with key terms ranked by reinforcement learning. Yi-Cheng Pan, Jia-Yu Chen, Yen-shin Lee, Yi-Sheng Fu, Lin-Shan Lee
2006	Eigenvoice conversion based on Gaussian mixture model. Tomoki Toda, Yamato Ohtani, Kiyohiro Shikano
2006	Emotion detection in infants² cries based on a maximum likelihood approach. Shoichi Matsunaga, S. Sakaguchi, Masaru Yamashita, Sueharu Miyahara, S. Nishitani, Kazuyuki Shinohara
2006	Emotion recognition in spontaneous speech using GMMs. Daniel Neiberg, Kjell Elenius, Kornel Laskowski
2006	Emovoice: a system to generate emotions in speech. João P. Cabral, Luís C. Oliveira
2006	Enhanced dynamic codebook reordering for advanced quantizer structures. Jani Nurminen
2006	Enhancement of harmonic content of speech based on a dynamic programming pitch tracking algorithm. Mark R. Every, Philip J. B. Jackson
2006	Enhancing the performance of a GMM-based speaker identification system in a multi-microphone setup. Andreas Stergiou, Aristodemos Pnevmatikakis, Lazaros C. Polymenakos
2006	Estimation of the quality dimension "directness/frequency content" for the instrumental assessment of speech quality. Kirstin Scholz, Marcel Wältermann, Lu Huo, Alexander Raake, Sebastian Möller, Ulrich Heute
2006	Evaluating a virtual speech cuer. Guillaume Gibert, Gérard Bailly, Frédéric Elisei
2006	Evaluating prosody of Mandarin speech for language learning. Minghui Dong, Haizhou Li, Tin Lay Nwe
2006	Evaluation of a spoken dialogue system with usability tests and long-term pilot studies: similarities and differences. Markku Turunen, Jaakko Hakulinen, Anssi Kainulainen
2006	Evaluation of content presentation strategies for an in-car spoken dialogue system. Heather Pon-Barry, Fuliang Weng, Sebastian Varges
2006	Evaluation of objective measures for speech enhancement. Yi Hu, Philipos C. Loizou
2006	Evaluation of perceptual quality of control point reduction in rule-based synthesis. Kimmo Pärssinen, Marko Moberg
2006	Evaluation of voice activity detection by combining multiple features with weight adaptation. Yusuke Kida, Tatsuya Kawahara
2006	Evolving emotional prosody. Cecilia Ovesdotter Alm, Xavier Llorà
2006	Examining knowledge sources for human error correction. Yongmei Shi, Lina Zhou
2006	Example-based grapheme-to-phoneme conversion for Thai. Paisarn Charoenpornsawat, Tanja Schultz
2006	Expanding phonetic coverage in unit selection synthesis through unit substitution from a donor voice. Alistair Conkie, Ann K. Syrdal
2006	Experiments on Chinese speech recognition with tonal models and pitch estimation using the Mandarin speecon data. Ying Sun, Daniel Willett, Raymond Brueckner, Rainer Gruhn, Dirk Bühler
2006	Exploiting dendritic autocorrelogram structure to identify spectro-temporal regions dominated by a single sound source. Ning Ma, Phil D. Green, André Coy
2006	Exploiting polynomial-fit histogram equalization and temporal average for robust speech recognition. Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen
2006	Exploiting semantic relations for a spoken language understanding application. Catherine Kobus, Géraldine Damnati, Lionel Delphin-Poulat, Renato De Mori
2006	Exploring the unknown - collecting 1000 speakers over the internet for the ph@ttsessionz database of adolescent speakers. Christoph Draxler
2006	Expressive prosody for unit-selection speech synthesis. Volker Strom, Robert A. J. Clark, Simon King
2006	Extension and further analysis of higher order cepstral moment normalization (HOCMN) for robust features in speech recognition. Chang-Wen Hsu, Lin-Shan Lee
2006	Extracting formants from short segments of speech using group delay functions. Joseph M. Anand, Sunitha Guruprasad, B. Yegnanarayana
2006	Factors affecting speakers² choice of fillers in Japanese presentations. Michiko Watanabe, Yasuharu Den, Keikichi Hirose, Shusaku Miwa, Nobuaki Minematsu
2006	Farsbayan: a unit selection based Farsi speech synthesizer. Mohammad Mehdi Homayounpour, Majid Namnabat
2006	Fast SVM training based on the choice of effective samples for audio classification. Shilei Zhang, Hongchen Jiang, Shuwu Zhang, Bo Xu
2006	Fast and effective retraining on contrastive vocal characteristics with bidirectional long short-term memory nets. Nicole Beringer
2006	Feature analysis for emotion recognition from Mandarin speech considering the special characteristics of Chinese language. Yi-Hao Kao, Lin-Shan Lee
2006	Feature and model space speaker adaptation with full covariance Gaussians. Daniel Povey, George Saon
2006	Feature combination using linear discriminant analysis and its pitfalls. Ralf Schlüter, András Zolnay, Hermann Ney
2006	Feature extraction for spectral continuity measures in concatenative speech synthesis. Barry Kirkpatrick, Darragh O'Brien, Ronan Scaife
2006	Feature normalization using smoothed mixture transformations. Patrick Kenny, Vishwa Gupta, Gilles Boulianne, Pierre Ouellet, Pierre Dumouchel
2006	Finding the gaps: applying a connectionist model of word segmentation to noisy phone-recognized speech data. C. Anton Rytting
2006	Formant-based English vowel assessment for Chinese in Taiwan. Jiang-Chun Chen, Wei-Tang Hsu, Jyh-Shing Roger Jang, Ren-yuan Lyu, Yuang-Chin Chiang
2006	Forward-backwards training of hybrid HMM/BN acoustic models. Konstantin Markov, Satoshi Nakamura
2006	Frame based system combination and a comparison with weighted ROVER and CNC. Björn Hoffmeister, Tobias Klein, Ralf Schlüter, Hermann Ney
2006	Frequency warping based on mapping formant parameters. Zhiwei Shuang, Raimo Bakis, Slava Shechtman, Dan Chazan, Yong Qin
2006	Frequency warping by linear transformation of standard MFCC. Sankaran Panchapagesan
2006	Friends and enemies: a novel initialization for speaker diarization. Xavier Anguera, Chuck Wooters, Javier Hernando
2006	From pre-recorded prompts to corporate voices: on the migration of interactive voice response applications. Volker Fischer, Siegfried Kunzmann
2006	From reaction to prediction: experiments with computational models of turn-taking. David Schlangen
2006	Further developments in LSM-based boundary training for unit selection TTS. Jerome R. Bellegarda
2006	Further investigations on the relationship between objective measures of speech quality and speech recognition rates in noisy environments. Francisco José Fraga, Carlos Alberto Ynoguti, André Godoi Chiovato
2006	Fusion of phonotactic and prosodic knowledge for language identification. Chi-Yueh Lin, Hsiao-Chuan Wang
2006	GMM-based acoustic modeling for embedded speech recognition. Christophe Lévy, Georges Linarès, Jean-François Bonastre
2006	Gammatone auditory filterbank and independent component analysis for speaker identification. Yushi Zhang, Waleed H. Abdulla
2006	Generalization of the minimum classification error (MCE) training based on maximizing generalized posterior probability (GPP). Qiang Fu, Antonio Moreno-Daniel, Biing-Hwang Juang, Jian-Lai Zhou, Frank K. Soong
2006	Generating German intonation with a trainable prosodic model. Gérard Bailly, Jan Gorisch
2006	Generating complementary systems for speech recognition. Catherine Breslin, Mark J. F. Gales
2006	Generating time-constrained audio presentations of structured information. Brian Langner, Rohit Kumar, Arthur Chan, Lingyun Gu, Alan W. Black
2006	Geometrically constrained permutation-free source separation in an undercomplete speech unmixing scenario. Erik Visser
2006	Glottal closure and opening detection for flexible parametric voice coding. Pamornpol Jinachitra
2006	Grapheme-to-phoneme conversion using automatically extracted associative rules for Korean TTS system. Jinsik Lee, Seungwon Kim, Gary Geunbae Lee
2006	HMM-based MAP prediction of voiced and unvoiced formant frequencies from noisy MFCC vectors. Jonathan Darch, Ben Milner
2006	HMM-based continuous sign language recognition using a fast optical flow parameterization of visual information. Guillermo Cortés, Luz García, M. Carmen Benítez, José C. Segura
2006	HMM-based unit selection using frame sized speech segments. Zhen-Hua Ling, Ren-Hua Wang
2006	Handling convolutional noise in missing data automatic speech recognition. Maarten Van Segbroeck, Hugo Van hamme
2006	Have we met? MDP based speaker ID for robot dialogue. Filip Krsmanovic, Curtis Spencer, Daniel Jurafsky, Andrew Y. Ng
2006	High-quality speech translation in the flight domain. Chao Wang, Stephanie Seneff
2006	High-rate data embedding in unvoiced speech. Konrad Hofbauer, Gernot Kubin
2006	Highly directional multi-beam audio loudspeaker. Dirk Olszewski, Klaus Linhard
2006	Highly noise robust text-dependent speaker recognition based on hypothesized wiener filtering. V. Ramasubramanian, Deepak Vijaywargiay, Kumar V. Praveen
2006	How auditory and visual prosody is used in end-of-utterance detection. Pashiera Barkhuysen, Emiel Krahmer, Marc Swerts
2006	How to handle gender and number agreement in statistical language models? Caroline Lavecchia, Kamel Smaïli, Jean Paul Haton
2006	Hypothesis spaces for minimum Bayes risk training in large vocabulary speech recognition. Matthew Gibson, Thomas Hain
2006	Hypothesis-based feature combination of multiple speech inputs for robust speech recognition in automotive environments. Yasunari Obuchi, Nobuo Hataoka
2006	Identification of confusion and surprise in spoken dialog using prosodic features. Rohit Kumar, Carolyn P. Rosé, Diane J. Litman
2006	Identification of regional accents in French: perception and categorization. Cécile Woehrling, Philippe Boula de Mareüil
2006	Identify language origin of personal names with normalized appearance number of web pages. Jia-Li You, Yining Chen, Min Chu, Yong Zhao, Jin-Lin Wang
2006	Imperfect transcript driven speech recognition. Benjamin Lecouteux, Georges Linarès, Pascal Nocera, Jean-François Bonastre
2006	Improved hybrid microphone array post-filter by integrating a robust speech absence probability estimator for speech enhancement. Junfeng Li, Masato Akagi, Yôiti Suzuki
2006	Improved language identification using support vector machines for language modeling. Xi Yang, Lu-Feng Zhai, Man-Hung Siu, Herbert Gish
2006	Improved performance evaluation of speech event detectors. Carla Lopes, Fernando Perdigão
2006	Improved source modeling and predictive classification for channel robust speech recognition. Valentin Ion, Reinhold Haeb-Umbach
2006	Improved speech activity detection using cross-channel features for recognition of multiparty meetings. Kofi Boakye, Andreas Stolcke
2006	Improved tone modeling for Mandarin broadcast news speech recognition. Xin Lei, Man-Hung Siu, Mei-Yuh Hwang, Mari Ostendorf, Tan Lee
2006	Improved topic classification over maximum entropy model using k-norm based new objectives. Xiang Li, Ea-Ee Jan, Cheng Wu, David M. Lubensky
2006	Improved warping-invariant features for automatic speech recognition. Jan Rademacher, Matthias Wächter, Alfred Mertins
2006	Improvement speaker clustering using global similarity features. Konstantin Biatov, Joachim Köhler
2006	Improvements to bucket box intersection algorithm for fast GMM computation in embedded speech recognition systems. Min Tang, Aravind Ganapathiraju
2006	Improving Arabic HMM based speech synthesis quality. Ossama Abdel-Hamid, Sherif Mahdy Abdou, Mohsen A. Rashwan
2006	Improving body transmitted unvoiced speech with statistical voice conversion. Mikihiro Nakagiri, Tomoki Toda, Hideki Kashioka, Kiyohiro Shikano
2006	Improving glottal waveform estimation through rank-based glottal quality assessment. Elliot Moore II, Juan F. Torres
2006	Improving perplexity measures to incorporate acoustic confusability. Amit Anil Nanavati, Nitendra Rajput
2006	Improving phrase-based Korean-English statistical machine translation. Jonghoon Lee, Donghyeon Lee, Gary Geunbae Lee
2006	Improving speech recognition accuracy with multi-confidence thresholding. Shuangyu Chang
2006	Improving speech recognition of two simultaneous speech signals by integrating ICA BSS and automatic missing feature mask generation. Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
2006	Improving the characterization of the alternative hypothesis via kernel discriminant analysis for likelihood ratio-based speaker verification. Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang, Ruei-Chuan Chang
2006	Improving the performance of HMM-based voice conversion using context clustering decision tree and appropriate regression matrix format. Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang
2006	Improving the performance of out-of-vocabulary word rejection by using support vector machines. Shilei Huang, Xiang Xie, Jingming Kuang
2006	Improving tone recognition with combined frequency and amplitude modelling. Siwei Wang, Gina-Anne Levow
2006	Incorporating second-order information into two-step major phrase break prediction for Korean. Seungwon Kim, Jinsik Lee, Byeongchang Kim, Gary Geunbae Lee
2006	Incremental learning of MAP context-dependent edit operations for spoken phone number recognition in an embedded platform. Hahn Koo, Yan Ming Cheng
2006	Independent components for acoustic modeling. Jan Trmal, Jan Vanek, Ludek Müller, Jan Zelinka
2006	Individual on-line variance adaptation of frequency filtered parameters for robust ASR. Jesús Vicente-Peña, Fernando Díaz-de-María, W. Bastiaan Kleijn
2006	Infants² ability to extract verbs from continuous speech. Ellen Marklund, Francisco Lacerda
2006	Infinite models for speaker clustering. Fabio Valente
2006	Influence of pause length on listeners² impressions in simultaneous interpretation. Hitomi Tohyama, Shigeki Matsubara
2006	Integrating Festival and Windows. Rhys James Jones, Ambrose Choy, Briony Williams
2006	Integrating phonetic boundary discrimination explicitly into HMM systems. Yu Wang, Eric Fosler-Lussier
2006	Integrating spoken dialog and question answering: the ritel project. Sophie Rosset, Olivier Galibert, Gabriel Illouz, Aurélien Max
2006	Integration of a CELP coder in the ARDOR universal sound codec. Balázs Kövesi, Dominique Massaloux, David Virette, Julien Bensa
2006	Intelligibility of machine translation output in speech synthesis. Laura Mayfield Tomokiyo, Kay Peterson, Alan W. Black, Kevin A. Lenzo
2006	Interleaving and MMSE estimation with VQ replicas for distributed speech recognition over lossy packet networks. Angel M. Gomez, Antonio M. Peinado, Victoria E. Sánchez, José L. Carmona, Antonio J. Rubio
2006	Intonational cues to student questions in tutoring dialogs. Jennifer J. Venditti, Julia Hirschberg, Jackson Liscombe
2006	Intra-speaker variability compensation in speaker verification with limited enrolling data. Claudio Garretón, Néstor Becerra Yoma, Carlos Molina, Fernando Huenupán
2006	Investigating automatic decomposition for ASR in less represented languages. Thomas Pellegrini, Lori Lamel
2006	Investigation on Mandarin broadcast news speech recognition. Mei-Yuh Hwang, Xin Lei, Wen Wang, Takahiro Shinozaki
2006	Investigation on rescoring using minimum verification error (MVE) detectors. Qiang Fu, Biing-Hwang Juang
2006	Investigations of issues for using multiple acoustic models to improve continuous speech recognition. Rong Zhang, Alexander I. Rudnicky
2006	Is ASR accurate enough for automated reading tutors, and how can we tell? Jack Mostow
2006	Is voice quality enough? - study on how the situation and user²s awareness influence the utterance features. Shinya Yamada, Toshihiko Itoh, Kenji Araki
2006	Issues with uncertainty decoding for noise robust speech recognition. Hank Liao, Mark J. F. Gales
2006	Joint interpretation of input speech and pen gestures for multimodal human-computer interaction. Pui-Yu Hui, Helen M. Meng
2006	Joint prosodic and segmental unit selection speech synthesis. Robert A. J. Clark, Simon King
2006	LDA based feature estimation methods for LVCSR. Janne Pylkkönen
2006	LINTest: a development tool for testing dialogue systems. Lars Degerstedt, Arne Jönsson
2006	Language model adaptation for tiny adaptation corpora. Dietrich Klakow
2006	Language model adaptation with a word list and a raw corpus. Shinsuke Mori
2006	Language modeling of Chinese personal names based on character units for continuous Chinese speech recognition. Xinhui Hu, Hirofumi Yamamoto, Gen-ichiro Kikui, Yoshinori Sagisaka
2006	Language, gender, speaking style and language proficiency as factors influencing the autonomous vocalic filler production in spontaneous speech. Ioana Vasilescu, Martine Adda-Decker
2006	Latent prosodic modeling (LPM) for speech with applications in recognizing spontaneous Mandarin speech with disfluencies. Che-Kuang Lin, Lin-Shan Lee
2006	Lattice LP filtering for noise reduction in speech signals. Erhard Rank, Gernot Kubin
2006	Lattice extension and rescoring based approaches for LVCSR of Turkish. Ebru Arisoy, Murat Saraclar
2006	Learning from errors in grapheme-to-phoneme conversion. Tatyana Polyakova, Antonio Bonafonte
2006	Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces. Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, Hiroshi Shimodaira
2006	Lexical stress in continuous speech recognition. Rogier C. van Dalen, Pascal Wiggers, Léon J. M. Rothkrantz
2006	Limitations of MLLR adaptation with Spanish-accented English: an error analysis. Constance Clarke, Daniel Jurafsky
2006	Lingua machinae - an unorthodox proposal. Florian Schiel, Christoph Draxler, Marion Libossek
2006	Linguistic tuple segmentation in n-gram-based statistical machine translation. Adrià de Gispert, José B. Mariño
2006	Local transformation models for speech recognition. Antonio Miguel, Eduardo Lleida, Alfons Juan, Luis Buera, Alfonso Ortega, Oscar Saz
2006	Locating phone boundaries from acoustic discontinuities using a two-staged approach. Pairote Leelaphattarakij, Proadpran Punyabukkana, Atiwong Suchato
2006	Lost speech reconstruction method using speech recognition based on missing feature theory and HMM-based speech synthesis. Shingo Kuroiwa, Satoru Tsuge, Fuji Ren
2006	Low complexity LID using pruned pattern tables of LZW. S. V. Basavaraja, T. V. Sreenivas
2006	Low-complexity and efficient classification of voiced/unvoiced/silence for noisy environments. Tuan Van Pham, Gernot Kubin
2006	Low-resource autodiacritization of abjads for speech keyword search. Patrick Schone
2006	MMSE estimation of complex-valued discrete Fourier coefficients with generalized gamma priors. Jesper Jensen, Richard C. Hendriks, Jan S. Erkelens, Richard Heusdens
2006	Manifold HLDA and its application to robust speech recognition. Toshiaki Kubo, Tetsuji Ogawa, Tetsunori Kobayashi
2006	Map-based adaptation for speech conversion using adaptation data selection and non-parallel training. Chung-Han Lee, Chung-Hsien Wu
2006	Mapping neural networks for bandwidth extension of narrowband speech. A. Shahina, B. Yegnanarayana
2006	Max-Gabor analysis and synthesis of spectrograms. Tony Ezzat, Jake V. Bouvrie, Tomaso A. Poggio
2006	Maximum entropy modeling for diacritization of Arabic text. Ruhi Sarikaya, Ossama Emam, Imed Zitouni, Yuqing Gao
2006	Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation. Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2006	Measuring and comparing vowel qualities in a Dutch spontaneous speech corpus. Irene Jacobi, Louis C. W. Pols, Jan Stroop
2006	Measuring the acceptable word error rate of machine-generated webcast transcripts. Cosmin Munteanu, Gerald Penn, Ronald Baecker, Elaine G. Toms, David James
2006	Memo: towards automatic usability evaluation of spoken dialogue services by user error simulations. Sebastian Möller, Roman Englert, Klaus-Peter Engelbrecht, Verena Vanessa Hafner, Anthony Jameson, Antti Oulasvirta, Alexander Raake, Norbert Reithinger
2006	Minimum boundary error training for automatic phonetic segmentation. Jen-Wei Kuo, Hsin-Min Wang
2006	Minimum classification error training of hidden Markov models for acoustic language identification. Josef G. Bauer, Ekaterina Timoshenko
2006	Minimum divergence based discriminative training. Jun Du, Peng Liu, Frank K. Soong, Jian-Lai Zhou, Ren-Hua Wang
2006	Minimum generation error criterion for tree-based clustering of context dependent HMMs. Yi-Jian Wu, Wu Guo, Ren-Hua Wang
2006	Missing data mask models with global frequency and temporal constraints. Sébastien Demange, Christophe Cerisara, Jean Paul Haton
2006	Missing feature theory with soft spectral subtraction for speaker verification. Michael T. Padilla, Thomas F. Quatieri, Douglas A. Reynolds
2006	Missing-feature reconstruction for band-limited speech recognition in spoken document retrieval. Wooil Kim, John H. L. Hansen
2006	Modeling of speech signals based on Bessel-like orthogonal transform. Giorgio Biagetti, Paolo Crippa, Claudio Turchetti
2006	Modeling sensory-to-motor mappings using neural nets and a 3d articulatory speech synthesizer. Bernd J. Kröger, Peter Birkholz, Jim Kannampuzha, Christiane Neuschaefer-Rube
2006	Modeling the acoustic correlates of expressive elements in text genres for expressive text-to-speech synthesis. Hongwu Yang, Helen M. Meng, Lianhong Cai
2006	Modeling the precedence effect for binaural sound source localization in noisy and echoic environments. Martin Heckmann, Tobias Rodemann, Björn Schölling, Frank Joublin, Christian Goerick
2006	Modelling aspiration noise during phonation using the LF voice source model. Christer Gobl
2006	Modified phase opponency based solution to the speech separation challenge. Om Deshmukh, Carol Y. Espy-Wilson
2006	Monitoring of the natural voice variations in open and closed phases with frequency warped ARMA modeling. Pedro J. Quintana-Morales, Juan L. Navarro-Mesa, Antonio G. Ravelo-García, Fernando D. Lorenzo-García
2006	Moving speech recognition from software to silicon: the in silico vox project. Edward C. Lin, Kai Yu, Rob A. Rutenbar, Tsuhan Chen
2006	Multi-accent Chinese speech recognition. Yi Liu, Pascale Fung
2006	Multi-domain text-to-speech synthesis by automatic text classification. Francesc Alías, Joan Claudi Socoró, Xavier Sevillano, Ignasi Iriondo Sanz, Xavier Gonzalvo
2006	Multi-flow block interleaving applied to distributed speech recognition over IP networks. Angel M. Gomez, Juan J. Ramos-Muñoz, Antonio M. Peinado, Victoria E. Sánchez
2006	Multi-layered summarization of spoken document archives by information extraction and semantic structuring. Lin-Shan Lee, Sheng-yi Kong, Yi-Cheng Pan, Yi-Sheng Fu, Yu-tsun Huang
2006	Multi-microphone periodicity function for robust F0 estimation in real noisy and reverberant environments. Federico Flego, Maurizio Omologo
2006	Multi-modal system ICANDO: intellectual computer assistant for disabled operators. Alexey Karpov, Andrey Ronzhin, Alexandre Cadiou
2006	Multi-source far-distance microphone selection and combination for automatic transcription of lectures. Matthias Wölfel, Christian Fügen, Shajith Ikbal, John W. McDonough
2006	Multi-stream ASR: an oracle perspective. Hemant Misra, Jithendra Vepa, Hervé Bourlard
2006	Multi-stream speaker diarization systems for the meetings domain. Ascensión Gallardo-Antolín, Xavier Anguera, Chuck Wooters
2006	Multilingual non-native speech recognition using phonetic confusion-based acoustic model modification and graphemic constraints. Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean Paul Haton
2006	Multimodal authentication using qualitative support vector machines. Fawaz Alsaade, Aladdin M. Ariyaeeinia, L. Meng, Amit S. Malegaonkar
2006	Multistage convolutive blind source separation for speech mixture. Yanxue Liang, Ichiro Hagiwara
2006	Multivariate analysis of frame-based acoustic cues of dysperiodicities in connected speech. Abdellah Kacha, Francis Grenez, Jean Schoentgen
2006	Nasality perception of vowels in different language background. Shahina Haque, Tomio Takara
2006	Native and nonnative audio-visual perception of English fricatives in quiet and cafe-noise backgrounds. Yue Wang, Dawn M. Behne, Haisheng Jiang, Chad Danyluck
2006	New 20-word lists for word intelligibility test in Japanese. Shuichi Sakamoto, Tadahiro Yoshikawa, Shigeaki Amano, Yôiti Suzuki, Tadahisa Kondo
2006	New considerations for vowel nasalization based on separate mouth-nose recording. Gang Feng, Cyril Kotenkoff
2006	New improvements in decoding speed and latency for automatic captioning. Jian Xue, Rusheng Hu, Yunxin Zhao
2006	New measures to chart toddlers² speech perception and language development: a test of the lexical restructuring hypothesis. Iris-Corinna Schwarz, Denis Burnham
2006	Ninth International Conference on Spoken Language Processing, INTERSPEECH-ICSLP 2006, Pittsburgh, PA, USA, September 17-21, 2006
2006	Noise robust model-based voice activity detection. Ángel de la Torre, Javier Ramírez, M. Carmen Benítez, José C. Segura, Luz García, Antonio J. Rubio
2006	Noise update modeling for speech enhancement: when do we do enough? Nitish Krishnamurthy, John H. L. Hansen
2006	Noise-robust speech recognition of conversational telephone speech. Gang Chen, Hesham Tolba, Douglas D. O'Shaughnessy
2006	Noisy speech recognition based on selection of multiple noise suppression methods using noise GMMs. Norihide Kitaoka, Souta Hamaguchi, Seiichi Nakagawa
2006	Non-intrusive speech quality assessment with low computational complexity. Volodya Grancharov, David Yuheng Zhao, Jonas Lindblom, W. Bastiaan Kleijn
2006	Nonlinear dynamical invariants for speech recognition. S. Prasad, Sundararajan Srinivasan, M. Pannuri, Georgios Y. Lazarou, Joseph Picone
2006	Normalization of the inter-frame information using smoothing filtering. Luz García, José C. Segura, M. Carmen Benítez, Javier Ramírez, Ángel de la Torre
2006	Novel entropy based moving average refiners for HMM landmarks. Rahul Chitturi, Mark Hasegawa-Johnson
2006	Novel method for data clustering and mode selection with application in voice conversion. Jani Nurminen, Jilei Tian, Victor Popa
2006	Novel time domain multi-class SVMs for landmark detection. Rahul Chitturi, Mark Hasegawa-Johnson
2006	Objective estimation of suicidal risk using vocal output characteristics. T. Yingthawornsuk, H. Kaymaz Keskinpala, Daniel J. France, D. Mitchell Wilkes, Richard G. Shiavi, Ronald M. Salomon
2006	Observations of the spoken language acquisition process based on a multimodal infant behavior corpus. Ryo Tsuji, Tomohiko Kasami, Shogo Ishikawa, Shinya Kiriyama, Yoichi Takebayashi, Shigeyoshi Kitazawa
2006	On a greedy learning algorithm for dPLRM with applications to phonetic feature detection. Tor André Myrvoll, Tomoko Matsui
2006	On designing context sensitive language models for spoken dialog systems. Vaibhava Goel, Ramesh A. Gopinath
2006	On speaker-specific prosodic models for automatic dialog act segmentation of multi-party meetings. Jáchym Kolár, Elizabeth Shriberg, Yang Liu
2006	On speech variation and word type differentiation by articulatory feature representations. Louis ten Bosch, R. Harald Baayen, Mirjam Ernestus
2006	On the correlation between energy and pitch accent in read English speech. Andrew Rosenberg, Julia Hirschberg
2006	On the fusion of prosody, voice spectrum and face features for multimodal person verification. M. Farrs, Ainara Garde, Pascual Ejarque, Jordi Luque, Javier Hernando
2006	On the relation between maximum spectral transition positions and phone boundaries. Sorin Dusan, Lawrence R. Rabiner
2006	On the sufficiency and redundancy of pitch for TRP projection. Wieneke Wesseling, Rob van Son, Louis C. W. Pols
2006	On the sufficiency of automatic phonetic transcriptions for pronunciation variation research. Christophe Van Bael, Hans van Halteren
2006	On the use of Jacobian adaptation in real speaker verification applications. Jan Anguita, Javier Hernando
2006	On the use of morphological analysis for dialectal Arabic speech recognition. Mohamed Afify, Ruhi Sarikaya, Hong-Kwang Jeff Kuo, Laurent Besacier, Yuqing Gao
2006	Online speaker change detection by combining BIC with microphone array beamforming. Joerg Schmalenstroeer, Reinhold Haeb-Umbach
2006	Online speech detection and dual-gender speech recognition for captioning broadcast news. Toru Imai, Shoei Sato, Akio Kobayashi, Kazuo Onoe, Shinichi Homma
2006	Open-vocabulary spoken document retrieval based on new subword models and subword phonetic similarity. Kohei Iwata, Yoshiaki Itoh, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee
2006	Opinion mining in a telephone survey corpus. Nathalie Camelin, Géraldine Damnati, Frédéric Béchet, Renato De Mori
2006	Optimization of class weights for LDA feature transformations. Andrej Ljolje
2006	Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system. Roger Hsiao, Ashish Venugopal, Thilo Köhler, Ying Zhang, Paisarn Charoenpornsawat, Andreas Zollmann, Stephan Vogel, Alan W. Black, Tanja Schultz, Alex Waibel
2006	Pauses as a tool to ensure rhythmic wellformedness. Augustin Speyer
2006	Perception of fundamental frequency in cochlear implant patients. Ángel de la Torre, Cristina Roldán, Manuel Sainz
2006	Perceptive and acoustic measurement of average speaking pitch of female and male speakers in German radio news. Sven Grawunder, Ines Bose, Birgit Hertha, Franziska Trauselt, Lutz Christian Anders
2006	Perceptual identification and phonetic analysis of 6 foreign accents in French. Bianca Vieru-Dimulescu, Philippe Boula de Mareüil
2006	Performance analysis of various single channel speech enhancement algorithms for automatic speech recognition. Myung-Suk Song, Chang-Heon Lee, Hong-Goo Kang
2006	Performance evaluation of three features for model-based single channel speech separation problem. Mohammad H. Radfar, Richard M. Dansereau, Abolghasem Sayadiyan
2006	Performance improvement of dialog speech translation by rejecting unreliable utterances. Toshiyuki Takezawa, Tohru Shimizu
2006	Perplexity based linguistic model adaptation for speech summarisation. Pierre Chatain, Edward W. D. Whittaker, Joanna Mrozinski, Sadaoki Furui
2006	Personality factors in human deception detection: comparing human to machine performance. Frank Enos, Stefan Benus, Robin L. Cautin, Martin Graciarena, Julia Hirschberg, Elizabeth Shriberg
2006	Phone recognition analysis for trajectory HMM. Le Zhang, Steve Renals
2006	Phone vector DHMM to decode a phone recognizer's output. Bong-Wan Kim, Dae-Lim Choi, Yongnam Um, Yong-Ju Lee
2006	Phoneme recognition based on fisher weight map to higher-order local auto-correlation. Yasuo Ariki, Shunsuke Kato, Tetsuya Takiguchi
2006	Phoneme-to-grapheme mapping for spoken inquiries to the semantic web. Axel Horndasch, Elmar Nöth, Anton Batliner, Volker Warnke
2006	Phonetic research on accented Chinese in three dialectal regions: Shanghai, Wuhan and Xiamen. Aijun Li, Qiang Fang, Ziyu Xiong
2006	Phonetically enriched labeling in unit selection TTS synthesis. Yeon-Jun Kim, Ann K. Syrdal, Alistair Conkie, Marc C. Beutnagel
2006	Phrase break prediction using logistic generalized linear model. Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao
2006	Physiologically-motivated synchrony-based processing for robust automatic speech recognition. Chanwoo Kim, Yu-Hsiang Bosco Chiu, Richard M. Stern
2006	Pitch determination using aligned AMDF. M. Shahidur Rahman, Hirobumi Tanaka, Tetsuya Shimamura
2006	Pitch range and pause duration as markers of discourse hierarchy: perception experiments. Jörg Mayer, Ekaterina Jasinskaja, Ulrike Kölsch
2006	Pitch resynchronization while recovering from a late frame in a predictive speech decoder. Kyle D. Anderson, Philippe Gournay
2006	Pitch-scale modification using the modulated aspiration noise source. Daryush D. Mehta, Thomas F. Quatieri
2006	Posterior based keyword spotting with a priori thresholds. Hamed Ketabdar, Jithendra Vepa, Samy Bengio, Hervé Bourlard
2006	Potential relevance of audio-visual integration in mammals for computational modeling. Eeva Klintfors, Francisco Lacerda
2006	Powered cepstral normalization (p-CN) for robust features in speech recognition. Chang-Wen Hsu, Lin-Shan Lee
2006	Productions in bilinguism, early foreign language learning and monolinguism: a prosodic comparison. Ranka Bijeljac-Babic, Christelle Dodane, Sabine Metta, Claire Gerard
2006	Prominent words as anchors for TRP projection. Rob van Son, Wieneke Wesseling, Louis C. W. Pols
2006	Prompt selection with reinforcement learning in an AT&t call routing application. Charles Lewis, Giuseppe Di Fabbrizio
2006	Pronunciation dependent language models. Andrej Ljolje
2006	Pronunciation variant-based multi-path HMMs for syllables. Annika Hämäläinen, Louis ten Bosch, Lou Boves
2006	Pronunciation variation modeling for Mandarin with accent. Chi Zhang, Ji Wu, Xi Xiao, Zuoying Wang
2006	Pronunciation verification of children²s speech for automatic literacy assessment. Joseph Tepperman, Jorge F. Silva, Abe Kazemzadeh, Hong You, Sungbok Lee, Abeer Alwan, Shrikanth S. Narayanan
2006	Prosodic boundaries in Czech: an experiment based on delexicalized speech. Tomás Dubeda
2006	Prosodic feature generation for back-channel prediction. Thamar Solorio, Olac Fuentes, Nigel G. Ward, Yaffa Al Bayyari
2006	Prosodic features for a maximum entropy language model. Oscar Chan, Roberto Togneri
2006	Prosodic features for speaker verification. Leena Mary, B. Yegnanarayana
2006	Prosodic modeling in large vocabulary Mandarin speech recognition. Jui-Ting Huang, Lin-Shan Lee
2006	Prosody of interrogative and affirmative sentences in vietnamese language: analysis and perceptive results. Minh-Quang Vu, Do Dat Tran, Eric Castelli
2006	Prototyping a call system for students of Japanese using dynamic diagram generation and interactive hints. Christopher J. Waple, Yasushi Tsubota, Masatake Dantsuji, Tatsuya Kawahara
2006	QASR: question answering using semantic roles for speech interface. Svetlana Stenchikova, Dilek Hakkani-Tür, Gökhan Tür
2006	Quality improvement of telephone speech by artificial bandwidth expansion - listening tests in three languages. Hannu Pulakka, Laura Laaksonen, Paavo Alku
2006	Question answering with discriminative learning algorithms. Junlan Feng
2006	Quick individual fitting methods of simplified hearing compensation for elderly people. Kengo Fujita, Tsuneo Kato, Hisashi Kawai
2006	Radiobot-CFF: a spoken dialogue system for military training. Antonio Roque, Anton Leuski, Vivek Kumar Rangarajan Sridhar, Susan Robinson, Ashish Vaswani, Shrikanth S. Narayanan, David R. Traum
2006	Rapid simulation-driven reinforcement learning of multimodal dialog strategies in human-robot interaction. Thomas Prommer, Hartwig Holzapfel, Alex Waibel
2006	Rapid speaker adaptation using regression-tree based spectral peak alignment. Shizhen Wang, Xiaodong Cui, Abeer Alwan
2006	Real vs. acted emotional speech. Janneke Wilting, Emiel Krahmer, Marc Swerts
2006	Real-life emotions detection with lexical and paralinguistic cues on human-human call center dialogs. Laurence Devillers, Laurence Vidrascu
2006	Real-time synthesis of Chinese visual speech and facial expressions using MPEG-4 FAP features in a three-dimensional avatar. Zhiyong Wu, Shen Zhang, Lianhong Cai, Helen M. Meng
2006	Realizations and representations of Thai tones in monomoraic syllables. Rattima Nitisaroj
2006	Recent advances in phonotactic language recognition using binary-decision trees. Jirí Navrátil
2006	Recent advances in speech fragment decoding techniques. Jon Barker, André Coy, Ning Ma, Martin Cooke
2006	Recent advances of IBM's handheld speech translation system. Weizhong Zhu, Bowen Zhou, Charles Prosser, Pavel Krbec, Yuqing Gao
2006	Recent progress on the discriminative region-dependent transform for speech feature extraction. Bing Zhang, Spyros Matsoukas, Richard M. Schwartz
2006	Recognition of classroom lectures in european portuguese. Isabel Trancoso, Ricardo Nunes, Luís Neves, Céu Viana, Helena Moniz, Diamantino Caseiro, Ana Isabel Mata
2006	Recognition of interest in human conversational speech. Björn W. Schuller, Niels Köhler, Ronald Müller, Gerhard Rigoll
2006	Reconstructing tongue movements from audio and video. Hedvig Kjellström, Olov Engwall, Olle Bälter
2006	Reducing computation on parallel decoding using frame-wise confidence scores. Tomohiro Hakamata, Akinobu Lee, Yoshihiko Nankaku, Keiichi Tokuda
2006	Reducing speech coding distortion for speaker identification. Alan McCree
2006	Redundancy and productivity in the speech technology lexicon - can we do better? Susan Fitt, Korin Richmond
2006	Respiratory/laryngeal interactions during sustained vowel production in children. Donald S. Finan, Carol A. Boliek
2006	Robust acoustic-based syllable detection. Zhimin Xie, Partha Niyogi
2006	Robust automatic speech recognition for accented Mandarin in car environments. Pei Ding, Lei He, Xiang Yan, Jie Hao
2006	Robust feature extraction based on spectral peaks of group delay and autocorrelation function and phase domain analysis. Gholamreza Farahani, Seyed Mohammad Ahadi, Mohammad Mehdi Homayounpour
2006	Robust feature space adaptation for telephony speech recognition. Xin Lei, Jon Hamaker, Xiaodong He
2006	Robust interpretation in dialogue by combining confidence scores with contextual features. Matthew Purver, Florin Ratiu, Lawrence Cavedon
2006	Robust phone lattice decoding. Kris Demuynck, Dirk Van Compernolle, Hugo Van hamme
2006	Robust speaker diarization for meetings: ICSI RT06s evaluation system. Xavier Anguera, Chuck Wooters, José M. Pardo
2006	Robust speech recognition by modifying clean and telephone feature vectors using bidirectional neural network. Mansoor Vali, Seyyed Ali Seyyed Salehi, Kazem Karimi
2006	Robust speech recognition over mobile networks using combined weighted viterbi decoding and subvector based error concealment. Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg
2006	Role of phase estimation in speech enhancement. Benjamin J. Shannon, Kuldip K. Paliwal
2006	SPAM and full covariance for speech recognition. Daniel Povey
2006	Saliency parsing for automated directory assistance. Issac Alphonso, Shuangyu Chang
2006	Scalable and portable web-based multimodal dialogue interaction with geographical databases. Alexander Gruenstein, Stephanie Seneff, Chao Wang
2006	Segment connection networks for corpus-based speech synthesis. Geert Coorman
2006	Segmental duration modeling in Turkish. Özlem Öztürk, Tolga Çiloglu
2006	Selective-LPC based representation of STRAIGHT spectrum and its applications in spectral smoothing. Heng Kang, Wenju Liu
2006	Semi-automatic extraction of vocal tract movements from cineradiographic data. Julie Fontecave, Frédéric Berthommier
2006	Sentence boundary detection of spontaneous Japanese using statistical language model and support vector machines. Yuya Akita, Masahiro Saikou, Hiroaki Nanjo, Tatsuya Kawahara
2006	Sentence boundary detection using sequential dependency analysis combined with CRF-based chunking. Takanobu Oba, Takaaki Hori, Atsushi Nakamura
2006	Sequence classification for machine translation. Srinivas Bangalore, Patrick Haffner, Stephan Kanthak
2006	Signal modification incorporating perceptual weighting filter. Joon-Hyuk Chang, Woohyung Lim, Nam Soo Kim
2006	Significance of formants from difference spectrum for speaker identification. Kishore Prahallad, Varanasi Sudhakar, Veluru Ranganatham, Krishna M. Bharat, S. Roy Debashish
2006	Silence energy normalization for robust speech recognition in additive noise environment. Chung-fu Tai, Jeih-weih Hung
2006	Single channel speech enhancement by frequency domain constrained optimization and temporal masking. Wen Jin, Michael S. Scordilis
2006	Single frame selection for phoneme classification. Tingyao Wu, Dirk Van Compernolle, Jacques Duchateau, Hugo Van hamme
2006	Single-channel speech separation using sparse non-negative matrix factorization. Mikkel N. Schmidt, Rasmus Kongsgaard Olsson
2006	Six approaches to limited domain concatenative speech synthesis. Robert J. Utama, Ann K. Syrdal, Alistair Conkie
2006	Sloparl - slovenian parliamentary speech and text corpus for large vocabulary continuous speech recognition. Andrej Zgank, Tomaz Rotovnik, Matej Grasic, Marko Kos, Damjan Vlaj, Zdravko Kacic
2006	Soft decision combining for dual channel noise reduction. Timo Gerkmann, Rainer Martin
2006	Soft margin estimation of hidden Markov model parameters. Jinyu Li, Ming Yuan, Chin-Hui Lee
2006	Software architectures for incremental understanding of human speech. Gregory Aist, James F. Allen, Ellen Campana, Lucian Galescu, Carlos Gómez Gallo, Scott C. Stoness, Mary D. Swift, Michael K. Tanenhaus
2006	Solving large margin estimation of HMMS via semidefinite programming. Xinwei Li, Hui Jiang
2006	Soundbite detection in broadcast news domain. Sameer Maskey, Julia Hirschberg
2006	Sparseness and speech perception in noise. Guoping Li, Mark E. Lutman
2006	Speaker adaptation of trajectory HMMs using feature-space MLLR. Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura
2006	Speaker adaptation using evolutionary-based linear transform. Sid-Ahmed Selouani, Douglas D. O'Shaughnessy
2006	Speaker cluster based GMM tokenization for speaker recognition. Bin Ma, Donglai Zhu, Rong Tong, Haizhou Li
2006	Speaker clustered regression-class trees for MLLR adaptation. Arindam Mandal, Mari Ostendorf, Andreas Stolcke
2006	Speaker diarization for multiple distant microphone meetings: mixing acoustic features and inter-channel time differences. José M. Pardo, Xavier Anguera, Chuck Wooters
2006	Speaker identification under noisy environments by using harmonic structure extraction and reliable frame weighting. Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
2006	Speaker independent voiced-unvoiced detection evaluated in different speaking styles. Martin Heckmann, Marco Moebus, Frank Joublin, Christian Goerick
2006	Speaker localization based on oriented global coherence field. Alessio Brutti, Maurizio Omologo, Piergiorgio Svaizer
2006	Speaker verification with non-audible murmur segments. Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano
2006	Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech. Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2006	Speaking faces for face-voice speaker identity verification. Girija Chetty, Michael Wagner
2006	Specificity and generalizability of spontaneous phonetic imitation. Kuniko Y. Nielsen
2006	Speech analyzer using a joint estimation model of spectral envelope and fine structure. Hirokazu Kameoka, Jonathan Le Roux, Nobutaka Ono, Shigeki Sagayama
2006	Speech and speech recognition during dictation corrections. Keith Vertanen
2006	Speech enhancement based on residual noise shaping. Jong Won Shin, Seung Yeol Lee, Hwan Sik Yun, Nam Soo Kim
2006	Speech enhancement based on spectral estimation from higher-lag autocorrelation. Benjamin J. Shannon, Kuldip K. Paliwal, Climent Nadeu
2006	Speech enhancement using modified phase opponency model. Om Deshmukh, Carol Y. Espy-Wilson
2006	Speech recognition of foreign out-of-vocabulary words using a hierarchical language model. Hirofumi Yamamoto, Gen-ichiro Kikui, Satoshi Nakamura, Yoshinori Sagisaka
2006	Speech recognition using factorial hidden Markov models for separation in the feature space. Tuomas Virtanen
2006	Speech recognition with phonological features: some issues to attend. Frederik Stouten, Jean-Pierre Martens
2006	Speech technology for minority languages: the case of Irish (gaelic). Ailbhe Ní Chasaide, John Wogan, Brian Ó Raghallaigh, Áine Ní Bhriain, Eric Zoerner, Harald Berthelsen, Christer Gobl
2006	Speech/non-speech discrimination combining advanced feature extraction and SVM learning. Javier Ramírez, Pablo Yélamos, J. M. Górriz, José C. Segura, Luz García
2006	Spoken language technologies applied to digital talking books. Isabel Trancoso, Carlos Duarte, António Joaquim Serralheiro, Diamantino Caseiro, Luís Carriço, Céu Viana
2006	Spontaneous Thai speech recognition. Monika Woszczyna, Paisarn Charoenpornsawat, Tanja Schultz
2006	State-level variable modeling for phoneme classification. Hao-Zheng Li, Douglas D. O'Shaughnessy
2006	Statistical analysis and performance of DFT domain noise reduction filters for robust speech recognition. Colin Breithaupt, Rainer Martin
2006	Steady-state suppression in reverberation: a comparison of native and nonnative speech perception. Nao Hodoshima, Dawn M. Behne, Takayuki Arai
2006	Stochastic vector mapping-based feature enhancement using prior model and environment adaptation for noisy speech recognition. Chia-Hsin Hsieh, Chung-Hsien Wu, Jun-Yu Lin
2006	Study of time and frequency variability in pathological speech and error reduction methods for automatic speech recognition. Oscar Saz, Antonio Miguel, Eduardo Lleida, Alfonso Ortega, Luis Buera
2006	Study on speaker verification on emotional speech. Wei Wu, Thomas Fang Zheng, Ming-Xing Xu, Huanjun Bao
2006	Sub-word unit based non-audible speech recognition using surface electromyography. Matthias Walliczek, Florian Kraft, Szu-Chen Stan Jou, Tanja Schultz, Alex Waibel
2006	Subspace modeling and selection for noisy speech recognition. Jen-Tzung Chien, Chuan-Wei Ting
2006	Substitute sounds for ventriloquism and speech disorders. Jörg Metzner, Marcel Schmittfull, Karl Schnell
2006	Summarization evaluation for text and speech: issues and approaches. Ani Nenkova
2006	Summarization of spontaneous conversations. Xiaodan Zhu, Gerald Penn
2006	Super-human multi-talker speech recognition: the IBM 2006 speech separation challenge system. Trausti T. Kristjansson, John R. Hershey, Peder A. Olsen, Steven J. Rennie, Ramesh A. Gopinath
2006	Syllable-length path mixture hidden Markov models with trajectory clustering for continuous speech recognition. Yan Han, Lou Boves
2006	Synthesizing breathiness in natural speech with sinusoidal modelling. Brett Matthews, Raimo Bakis, Ellen Eide
2006	System- versus user-initiative dialog strategy for driver information systems. Chantal Ackermann, Marion Libossek
2006	TDA: a new trainable trajectory formation system for facial animation. Oxana Govokhina, Gérard Bailly, Gaspard Breton, Paul C. Bagshaw
2006	Testing the effect of audiovisual cues to prominence via a reaction-time experiment. Emiel Krahmer, Marc Swerts
2006	Text-independent cross-language voice conversion. David Sündermann, Harald Höge, Antonio Bonafonte, Hermann Ney, Julia Hirschberg
2006	Text-independent speaker identification in birds. E. J. S. Fox, J. D. Roberts, Mohammed Bennamoun
2006	The 2006 RWTH parliamentary speeches transcription system. Jonas Lööf, Maximilian Bisani, Christian Gollan, Georg Heigold, Björn Hoffmeister, Christian Plahl, Ralf Schlüter, Hermann Ney
2006	The IBM 2006 speech transcription system for european parliamentary speeches. Bhuvana Ramabhadran, Olivier Siohan, Lidia Mangu, Geoffrey Zweig, Martin Westphal, Henrik Schulz, Alvaro Soneiro
2006	The ICSI+ multilingual sentence segmentation system. M. Zimmerman, Dilek Hakkani-Tür, James G. Fung, Nikki Mirghafori, Luke R. Gottlieb, Elizabeth Shriberg, Yang Liu
2006	The importance of different facial areas for signalling visual prominence. Marc Swerts, Emiel Krahmer
2006	The role of positional probability in the segmentation of Cantonese speech. Michael C. W. Yip
2006	The role of prosody in the perception of US native English accents. Ayako Ikeno, John H. L. Hansen
2006	The segmentation of multi-channel meeting recordings for automatic speech recognition. John Dines, Jithendra Vepa, Thomas Hain
2006	The target cost formulation in unit selection speech synthesis. Paul Taylor
2006	The use of Bayesian network for incorporating accent, gender and wide-context dependency information. Sakriani Sakti, Konstantin Markov, Satoshi Nakamura
2006	The vocal joystick data collection effort and vowel corpus. Kelley Kilanski, Jonathan Malkin, Xiao Li, Richard Wright, Jeff A. Bilmes
2006	Thesaurus expansion using similar word pairs from patent documents. Yoshimi Suzuki, Fumiyo Fukumoto
2006	Time-dependent cross-probability model for multi-environment model based LInear normalization. Luis Buera, Eduardo Lleida, Juan Arturo Nolazco-Flores, Antonio Miguel, Alfonso Ortega
2006	Timing levels in segment-based speech emotion recognition. Björn W. Schuller, Gerhard Rigoll
2006	Tone recognition of continuous speech of standard Chinese using neural network and tone nucleus model. Keikichi Hirose, Hui Hu, Xiaodong Wang, Nobuaki Minematsu
2006	Topic-based language modeling with dynamic Bayesian networks. Pascal Wiggers, Léon J. M. Rothkrantz
2006	Totally data-driven duration modeling based on generalized linear model for Mandarin TTS. Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao
2006	Totally data-driven intonation prediction model using a novel F0 contour parametric representation. Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao
2006	Towards a comprehensive investigation of factors relevant to peak alignment using a unit selection corpus. Matthias Jilka, Bernd Möbius
2006	Towards a multimodal topic tracking system for a mobile robot. Jan Frederik Maas, Britta Wrede, Gerhard Sagerer
2006	Towards an integrated understanding of speaking rate in conversation. Jiahong Yuan, Mark Y. Liberman, Christopher Cieri
2006	Towards automatic parameter extraction of command-response model for Cantonese. Raymond W. M. Ng, Tan Lee, Wentao Gu
2006	Towards continuous speech recognition using surface electromyography. Szu-Chen Stan Jou, Tanja Schultz, Matthias Walliczek, Florian Kraft, Alex Waibel
2006	Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters. Tobias Gehrig, Ulrich Klee, John W. McDonough, Shajith Ikbal, Matthias Wölfel, Christian Fügen
2006	Tracking of involuntary formant frequency variations and application to parkinsonian speech. Laurence Cnockaert, Jean Schoentgen, Pascal Auzou, Canan Ozsancak, Francis Grenez
2006	Tracking of visible vocal tract resonances (VVTR) based on kalman filtering. I. Yücel Özbek, Mübeccel Demirekler
2006	Training native English speakers to identify Japanese vowel length with fast rate sentences. Yukari Hirata, Elizabeth Whitehurst, Emily Cullings, Jacob Whiton, Carol Glenn
2006	Training of coarticulation models using dominance functions and visual unit selection methods for audio-visual speech synthesis. Zdenek Krnoul, Milos Zelezný, Ludek Müller, Jakub Kanis
2006	Two stage transform vector quantization of LSFs for wideband speech coding. Saikat Chatterjee, T. V. Sreenivas
2006	Two-microphone voice activity detection in the presence of coherent interference. Gibak Kim, Nam Ik Cho
2006	Two-stage vocabulary-free spoken document retrieval - subword identification and re-recognition of the identified sections. Yoshiaki Itoh, Takayuki Otake, Kohei Iwata, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee
2006	Two-step unsupervised speaker adaptation based on speaker and gender recognition and HMM combination. Petr Cerva, Jan Nouza, Jan Silovský
2006	Underlying quality dimensions of modern telephone connections. Marcel Wältermann, Kirstin Scholz, Alexander Raake, Ulrich Heute, Sebastian Möller
2006	Unfilled pauses in Japanese sentences read aloud by non-native learners. Hiroko Hirano, Goh Kawai, Keikichi Hirose, Nobuaki Minematsu
2006	Unifying unit selection and hidden Markov model speech synthesis. Paul Taylor
2006	Unit selection and its relation to symbolic prosody: a new approach. Daniel Tihelka, Jindrich Matousek
2006	Unsupervised Spanish dialect classification. Rongqing Huang, John H. L. Hansen
2006	Unsupervised adaptation for acoustic language identification. Ekaterina Timoshenko, Josef G. Bauer
2006	Unsupervised detection of whispered speech in the presence of normal phonation. Michael A. Carlin, Brett Y. Smolenski, Stanley J. Wenndt
2006	Unsupervised language model adaptation based on automatic text collection from WWW. Motoyuki Suzuki, Yasutomo Kajiura, Akinori Ito, Shozo Makino
2006	Unsupervised language model adaptation for Mandarin broadcast conversation transcription. David Mrva, Philip C. Woodland
2006	Unsupervised language model adaptation using latent semantic marginals. Yik-Cheung Tam, Tanja Schultz
2006	Unsupervised learning of HMM topology for text-dependent speaker verification. Ming Liu, Thomas S. Huang
2006	Unsupervised model adaptation for speaker verification. Alexandre Preti, Jean-François Bonastre
2006	Unsupervised segmentation of words into morphemes - morpho challenge 2005 application to automatic speech recognition. Mikko Kurimo, Mathias Creutz, Matti Varjokallio, Ebru Arisoy, Murat Saraclar
2006	Use of incrementally regulated discriminative margins in MCE training for speech recognition. Dong Yu, Li Deng, Xiaodong He, Alex Acero
2006	User expectations and real experience on a multimodal interactive system. Kristiina Jokinen, Topi Hurtig
2006	User responses to prosodic variation in fragmentary grounding utterances in dialog. Gabriel Skantze, David House, Jens Edlund
2006	User simulation for spoken dialogue systems: learning and evaluation. Kallirroi Georgila, James Henderson, Oliver Lemon
2006	Using SVM and error-correcting codes for multiclass dialog act classification in meeting corpus. Yang Liu
2006	Using a differential microphone array to estimate the direction of arrival of two acoustic sources. Fotios Talantzis, Anthony G. Constantinides, Lazaros C. Polymenakos
2006	Using genetic algorithms to weight acoustic features for speaker recognition. Maider Zamalloa, Germán Bordel, Luis Javier Rodríguez, Mikel Peñagarikano, Juan Pedro Uribe
2006	Using latent semantic indexing for morph-based spoken document retrieval. Ville T. Turunen, Mikko Kurimo
2006	Using posterior-based features in template matching for speech recognition. Guillermo Aradilla, Jithendra Vepa, Hervé Bourlard
2006	Using speech recognition technique for constructing a phonetically transcribed taiwanese (min-nan) text corpus. Min-Siong Liang, Ren-yuan Lyu, Yuang-Chin Chiang
2006	Using system and user performance features to improve emotion detection in spoken tutoring dialogs. Hua Ai, Diane J. Litman, Katherine Forbes-Riley, Mihai Rotaru, Joel R. Tetreault, Amruta Purandare
2006	Vector taylor series based joint uncertainty decoding. Haitian Xu, Luca Rigazio, David Kryze
2006	Vector-based spoken language recognition using output coding. Haizhou Li, Bin Ma, Rong Tong
2006	Visual correlates to prominence in several expressive modes. Jonas Beskow, Björn Granström, David House
2006	Visual speech segmentation and speaker recognition for transcription of TV news. Josef Chaloupka
2006	Vocal emotion recognition with cochlear implants. Xin Luo, Qian-Jie Fu, John J. Galvin III
2006	Voice GMM modelling for FESTIVAL/MBROLA emotive TTS synthesis. Mauro Nicolao, Carlo Drioli, Piero Cosi
2006	Voice activity detection in personal audio recordings using autocorrelogram compensation. Keansub Lee, Daniel P. W. Ellis
2006	Voice activity detector based on enhanced cumulant of LPC residual and on-line EM algorithm. David Cournapeau, Tatsuya Kawahara, Kenji Mase, Tomoji Toriyama
2006	Voice conversion based on mixtures of factor analyzers. Yosuke Uto, Yoshihiko Nankaku, Tomoki Toda, Akinobu Lee, Keiichi Tokuda
2006	Voice source correlates of prosodic features in american English: a pilot study. Markus Iseli, Yen-Liang Shue, Melissa A. Epstein, Patricia A. Keating, Jody Kreiman, Abeer Alwan
2006	Voting for two speaker segmentation. Narayanaswamy Balakrishnan, Rashmi Gangadharaiah, Richard M. Stern
2006	Wavelet ridge track interpretation in terms of formants. Salma Chaari, Kaïs Ouni, Noureddine Ellouze
2006	Weighted codebook mapping for noisy speech enhancement using harmonic-noise model. Esfandiar Zavarehei, Saeed Vaseghi, Qin Yan
2006	Within-class covariance normalization for SVM-based speaker recognition. Andrew O. Hatch, Sachin S. Kajarekar, Andreas Stolcke
2006	Word intelligibility estimation of noise-reduced speech. Takeshi Yamada, Masakazu Kumakura, Nobuhiko Kitawaki
2006	Word order and tonal shape in the production of focus in short Finnish utterances. Martti Vainio, Juhani Järvikivi, Stefan Werner
2006	Word structure and tone perception in Mandarin. Hansjörg Mixdorff, Yu Hu