INTERSPEECH A

660 papers

YearTitle / Authors
2006"yeah right": sarcasm recognition for spoken dialogue systems.
Joseph Tepperman, David R. Traum, Shrikanth S. Narayanan
2006/nailon/ - software for online analysis of prosody.
Jens Edlund, Mattias Heldner
200650 years late: repeating miller-nicely 1955.
Andrew Lovitt, Jont B. Allen
2006A DTW-based dissimilarity measure for left-to-right hidden Markov models and its application to word confusability analysis.
Qiang Huo, Wei Li
2006A Spanish speech to sign language translation system for assisting deaf-mute people.
Rubén San Segundo, Roberto Barra-Chicote, Luis Fernando D'Haro, Juan Manuel Montero, Ricardo de Córdoba, Javier Ferreiros
2006A bootstrapping approach for developing language model of new spoken dialogue systems by selecting web texts.
Teruhisa Misu, Tatsuya Kawahara
2006A case study in the identification of prosodic cues to turn-taking: back-channeling in Arabic.
Nigel G. Ward, Yaffa Al Bayyari
2006A clustering approach to semantic decoding.
Hui Ye, Steve J. Young
2006A cohort - UBM approach to mitigate data sparseness for in-set/out-of-set speaker recognition.
Vinod Prakash, John H. L. Hansen
2006A comparative study of Gaussian selection methods in large vocabulary continuous speech recognition.
Dirk Gehrig, Thomas Schaaf
2006A comparison of inter-transcriber reliability for two systems of prosodic annotation: rap (rhythm and pitch) and toBI (tones and break indices).
Laura Dilley, Mara Breen, Marti Bolivar, John Kraemer, Edward Gibson
2006A comparison of singing evaluation algorithms.
Partha Lal
2006A computational auditory scene analysis system for robust speech recognition.
Soundararajan Srinivasan, Yang Shao, Zhaozhang Jin, DeLiang Wang
2006A constrained baum-welch algorithm for improved phoneme segmentation and efficient training.
David Huggins-Daines, Alexander I. Rudnicky
2006A discriminative method for speaker verification using the difference information.
Zhenchun Lei, Yingchun Yang, Zhaohui Wu
2006A framework for robust MFCC feature extraction using SNR-dependent compression of enhanced mel filter bank energies.
Babak Nasersharif, Ahmad Akbari
2006A hybrid phrase-based/statistical speech translation system.
David Stallard, Fred Choi, Kriste Krstovski, Prem Natarajan, Rohit Prasad, Shirin Saleem
2006A joint intention-based dialogue engine.
Rajah Annamalai Subramanian, Philip R. Cohen
2006A maximum likelihood training approach to irrelevant variability compensation based on piecewise linear transformations.
Qiang Huo, Donglai Zhu
2006A model for the f0 reset in corpus-based intonation approaches.
Francisco Campillo Díaz, Jan P. H. van Santen, Eduardo Rodríguez Banga
2006A model of the regularities underlying speaker variation: evidence from hybrid synthesis.
Susan R. Hertz
2006A multi-pass error detection and correction framework for Mandarin LVCSR.
Zhengyu Zhou, Helen M. Meng, Wai Kit Lo
2006A multi-space distribution (MSD) approach to speech recognition of tonal languages.
Huanliang Wang, Yao Qian, Frank K. Soong, Jian-Lai Zhou, Jiqing Han
2006A multiclass framework for speaker verification within an acoustic event sequence system.
Nicolas Scheffer, Jean-François Bonastre
2006A multilingual embodied conversational agent for tutoring speech and language learning.
Dominic W. Massaro, Ying Liu, Trevor H. Chen, Charles Perfetti
2006A multilingual expectations model for contextual utterances in mixed-initiative spoken dialogue.
Hartwig Holzapfel, Alex Waibel
2006A multipitch tracker for monaural speech segmentation.
André Coy, Jon Barker
2006A new HMM adaptation approach for the case of a hands-free speech input in reverberant rooms.
Hans-Günter Hirsch, Harald Finster
2006A new dual-microphone speech enhancement method for oriented noises.
Hamid Reza Abutalebi, Majid Pourahmadi, Masoud Reza Aghabozorgi
2006A new framework for system combination based on integrated hypothesis space.
I-Fan Chen, Lin-Shan Lee
2006A new set of features for text-independent speaker identification.
Carol Y. Espy-Wilson, Sandeep Manocha, Srikanth Vishnubhotla
2006A new single-ended measure for assessment of speech quality.
Timothy Murphy, Dorel Picovici, Abdulhussain E. Mahdi
2006A new state-dependent phonetic tied-mixture model with head-body-tail structured HMM for real-time continuous phoneme recognition system.
Junho Park, Hanseok Ko
2006A noninvasive, low-cost device to study the velopharyngeal port during speech and some preliminary results.
Xiaochuan Niu, Alexander Kain, Jan P. H. van Santen
2006A novel environment-dependent speech enhancement method with optimized memory footprint.
Suhadi Suhadi, Sorel Stan, Tim Fingscheidt
2006A novel framework of text-independent speaker verification based on utterance transform and iterative cohort modeling.
Ming Liu, Huazhong Ning, Thomas S. Huang, Zhengyou Zhang
2006A phrase-level machine translation approach for disfluency detection using weighted finite state transducers.
Sameer Maskey, Bowen Zhou, Yuqing Gao
2006A pitch marks filtering algorithm based on restricted dynamic programming.
Francesc Alías, Carlos Monzo, Joan Claudi Socoró
2006A probabilistic graphical model for microphone array source separation using rich pre-trained source models.
Hagai Thomas Attias
2006A quality measure method using Gaussian mixture models and divergence measure for speaker identification.
Rong Zheng, Shuwu Zhang, Bo Xu
2006A robust feature extraction based on the MTF concept for speech recognition in reverberant environment.
Xugang Lu, Masashi Unoki, Masato Akagi
2006A robust fusion method for multilingual spoken document retrieval systems employing tiered resources.
Murat Akbacak, John H. L. Hansen
2006A simulated-data adaptation technique for robust speech recognition.
Nattanun Thatphithakkul, Boontee Kruatrachue, Chai Wutiwiwatchai, Sanparith Marukatat, Vataya Boonpiam
2006A simulation based parameter optimization for a coarticulation model.
Jianguo Wei, Xugang Lu, Jianwu Dang
2006A speaker adaptation algorithm using principal curves in noisy environments.
Jingying Wang, Zuoying Wang
2006A spectral clustering approach to speaker diarization.
Huazhong Ning, Ming Liu, Hao Tang, Thomas S. Huang
2006A spectral-temporal method for pitch tracking.
Stephen A. Zahorian, Princy Dikshit, Hongbing Hu
2006A spoken language understanding approach using successive learners.
Wei-Lin Wu, Ruzhan Lu, Hui Liu, Feng Gao
2006A stochastic approach for dialog management based on neural networks.
Lluís F. Hurtado, David Griol, Encarna Segarra, Emilio Emilio, Sanchis Sanchis
2006A study of emotional speech articulation using a fast magnetic resonance imaging technique.
Sungbok Lee, Erik Bresch, Jason Adams, Abe Kazemzadeh, Shrikanth S. Narayanan
2006A study on detection based automatic speech recognition.
Chengyuan Ma, Yu Tsao, Chin-Hui Lee
2006A study on lattice rescoring with knowledge scores for automatic speech recognition.
Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee
2006A style control technique for speech synthesis using multiple regression HSMM.
Takashi Nose, Junichi Yamagishi, Takao Kobayashi
2006A successive state and mixture splitting for optimizing the size of models in speech recognition.
Soo-Young Suk, Seong-Jun Hahm, Ho-Youl Jung, Hyun-Yeol Chung
2006A syllable based continuous speech recognizer for Tamil.
A. Lakshmi, Hema A. Murthy
2006A technique for controlling voice quality of synthetic speech using multiple regression HSMM.
Makoto Tachibana, Takashi Nose, Junichi Yamagishi, Takao Kobayashi
2006A text-prompted distributed speaker verification system implemented on a cellular phone and a mobile terminal.
Tsuneo Kato, Hisashi Kawai
2006A texttiling based approach to topic boundary detection in meetings.
Satanjeev Banerjee, Alexander I. Rudnicky
2006A time-synchronous phonetic decoder for a long-contextual-Span hidden trajectory model.
Xiaolong Li, Li Deng, Dong Yu, Alex Acero
2006A tone recognition framework for continuous Mandarin speech.
Lei He, Jie Hao
2006A trajectory mixture density network for the acoustic-articulatory inversion mapping.
Korin Richmond
2006A user simulator based on voiceXML for evaluation of spoken dialog systems.
Akinori Ito, Keisuke Shimada, Motoyuki Suzuki, Shozo Makino
2006A vector space approach to environment modeling for robust speech recognition.
Yu Tsao, Chin-Hui Lee
2006A wavelet-based parameterization for speech/music segmentation.
E. Didiot, Irina Illina, Odile Mella, Dominique Fohr, Jean Paul Haton
2006A weight estimation method using LDA for multi-band speech recognition.
Koji Iwano, Kaname Kojima, Sadaoki Furui
2006ASR-based corrective feedback on pronunciation: does it really work?
Ambra Neri, Catia Cucchiarini, Helmer Strik
2006Accident - execute: increased activation in nonnative listening.
Mirjam Broersma
2006Acoustic analysis and automatic recognition of spontaneous children²s speech.
Matteo Gerosa, Diego Giuliani, Shrikanth S. Narayanan
2006Acoustic characterization of children with speech delay.
H. Timothy Bunnell, James B. Polikoff
2006Acoustic cues for the classification of regular and irregular phonation.
Kushan Surana, Janet Slifka
2006Acoustic model training based on linear transformation and MAP modification for HSMM-based speech synthesis.
Katsumi Ogata, Makoto Tachibana, Junichi Yamagishi, Takao Kobayashi
2006Acoustic modeling for spoken dialogue systems based on unsupervised utterance-based selective training.
Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2006Adaptive filtering for attenuating musical noise caused by spectral subtraction.
Takahiro Murakami, Yoshihisa Ishida
2006Adaptive multimodal fusion by uncertainty compensation.
Vassilis Pitsikalis, Athanassios Katsamanis, George Papandreou, Petros Maragos
2006Adaptive speech enhancement for speech separation in diffuse noise.
Rong Hu, Yunxin Zhao
2006Advances in lecture recognition: the ISL RT-06s evaluation system.
Christian Fügen, Matthias Wölfel, John W. McDonough, Shajith Ikbal, Florian Kraft, Kornel Laskowski, Mari Ostendorf, Sebastian Stüker, Ken'ichi Kumatani
2006All-pole model estimation of vocal tract on the frequency domain.
Luis Weruaga, Amar Al-Khayat
2006Amharic speech synthesis using cepstral method with stress generation rule.
Tadesse Anberbir, Tomio Takara
2006An ERB loudness pattern based objective speech quality measure.
Guo Chen, Vijay Parsa, Susan Scollie
2006An HMM-based singing voice synthesis system.
Keijiro Saino, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda
2006An MRI based study of the acoustic effects of sinus cavities and its application to speaker recognition.
Tarun Pruthi, Carol Y. Espy-Wilson
2006An acoustic and articulatory study of Lombard speech: global effects on the utterance.
Maeva Garnier, Lucie Bailly, Marion Dohen, Pauline Welby, Hélène Loevenbruck
2006An adaptive sampling procedure for speech perception experiments.
Geoffrey Stewart Morrison
2006An annotation scheme for agreement analysis.
Siew Leng Toh, Fan Yang, Peter A. Heeman
2006An annotation scheme for complex disfluencies.
Peter A. Heeman, Andy McMillin, J. Scott Yaruss
2006An assessment of automatic speech recognition as speech intelligibility estimation in the context of additive noise.
Wei Ming Liu, John S. D. Mason, Nicholas W. D. Evans, Keith A. Jellyman
2006An automatic singing skill evaluation method for unknown melodies using pitch interval accuracy and vibrato features.
Tomoyasu Nakano, Masataka Goto, Yuzuru Hiraga
2006An effective and efficient utterance verification technology using word n-gram filler models.
Dong Yu, Yun-Cheng Ju, Alex Acero
2006An efficient bispectrum phase entropy-based algorithm for VAD.
J. M. Górriz, Javier Ramírez, Carlos García Puntonet, José C. Segura
2006An efficient segment-based speech compression technique for hand-held TTS systems.
Chang-Heon Lee, Sung-Kyo Jung, Thomas Eriksson, Won-Suk Jun, Hong-Goo Kang
2006An improved affine projection algorithm based crosstalk resistant adaptive noise canceller.
Guo Chen, Vijay Parsa
2006An improved mel-wiener filter for mel-LPC based speech recognition.
Md. Babul Islam, Hiroshi Matsumoto, Kazumasa Yamamoto
2006An incremental algorithm for signal reconstruction from short-time fourier transform magnitude.
Jake V. Bouvrie, Tony Ezzat
2006An information theoretic tool for investigating speech perception.
Bryce E. Lobdell, Jont B. Allen
2006An integrated approach to improve speech recognition rate for non-native speakers.
Yunbin Deng, Xiaokun Li, Chiman Kwan, Roger Xu, Bhiksha Raj, Richard M. Stern, David Williamson
2006An integrated solution for error concealment in DSR systems over wireless channels.
Antonio M. Peinado, Angel M. Gomez, Victoria E. Sánchez, José L. Pérez-Córdoba, Antonio J. Rubio
2006An investigation of manifold learning for speech analysis.
Andrew Errity, John McKenna
2006An online adaptive filtering algorithm for the vocal joystick.
Xiao Li, Jonathan Malkin, Susumu Harada, Jeff A. Bilmes, Richard Wright, James A. Landay
2006An optimum microphone array post-filter for speech applications.
Stamatios Lefkimmiatis, Dimitrios Dimitriadis, Petros Maragos
2006An unified unit-selection framework for ultra low bit-rate speech coding.
V. Ramasubramanian, D. Harish
2006An user-centered development of an intuitive dialog control for speech-controlled music selection in cars.
Stefan Schulz, Hilko Donker
2006Analysis and detection of speech under sleep deprivation.
Tin Lay Nwe, Haizhou Li, Minghui Dong
2006Analysis of HMM temporal evolution for automatic speech recognition and utterance verification.
Marta Casar, José A. R. Fonollosa
2006Analysis of correlation between audio and visual speech features for clean audio feature prediction in noise.
Ibrahim Almajai, Ben Milner, Jonathan Darch
2006Analysis of lombard effect under different types and levels of noise with application to in-set speaker ID systems.
Vaishnevi S. Varadarajan, John H. L. Hansen
2006Analysis of nonmodal phonation using minimum entropy deconvolution.
Nicolas Malyska, Thomas F. Quatieri
2006Analysis of overlaps in meetings by dialog factors, hot spots, speakers, and collection site: insights for automatic speech recognition.
Özgür Çetin, Elizabeth Shriberg
2006Analysis of prosodic and linguistic cues of phrase finals for turn-taking and dialog acts.
Carlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita
2006Analyzing dialogue data for real-world emotional speech classification.
Ryuichi Nisimura, Souji Omae, Hideki Kawahara, Toshio Irino
2006Analyzing reusability of speech corpus based on statistical multidimensional scaling method.
Goshu Nagino, Makoto Shozakai
2006Articulatory features for "meeting" speech recognition.
Florian Metze
2006Assessing the reading level of web pages.
Sarah E. Petersen, Mari Ostendorf
2006Assessment of articulatory sub-systems of dysarthric speech using an isolated-style phoneme recognition system.
P. Vijayalakshmi, M. Ramasubba Reddy, Douglas D. O'Shaughnessy
2006Audio person tracking in a smart-room environment.
Alberto Abad, Carlos Segura, Dusan Macho, Javier Hernando, Climent Nadeu
2006Audio-visual speech recognition in the presence of a competing speaker.
Xu Shao, Jon Barker
2006Auto-segmentation based VAD for robust ASR.
Yu Shi, Frank K. Soong, Jian-Lai Zhou
2006Automatic English stop consonants classification using wavelet analysis and hidden Markov models.
Marco Kühne, Roberto Togneri
2006Automatic Mandarin pronunciation scoring for native learners with dialect accent.
Si Wei, Qing-Sheng Liu, Yu Hu, Ren-Hua Wang
2006Automatic acoustic identification of insects inspired by the speaker recognition paradigm.
Ilyas Potamitis, Todor Ganchev, Nikos Fakotakis
2006Automatic alignment and error correction of human generated transcripts for long speech recordings.
Timothy J. Hazen
2006Automatic assignment of anchoring points on vowel templates for defining correspondence between time-frequency representations of speech samples.
Toru Takahashi, Masashi Nishi, Toshio Irino, Hideki Kawahara
2006Automatic detection of irregular phonation in continuous speech.
Srikanth Vishnubhotla, Carol Y. Espy-Wilson
2006Automatic detection of voice onset time contrasts for use in pronunciation assessment.
Abe Kazemzadeh, Joseph Tepperman, Jorge F. Silva, Hong You, Sungbok Lee, Abeer Alwan, Shrikanth S. Narayanan
2006Automatic emotion recognition of speech signal in Mandarin.
Sheng Zhang, P. C. Ching, Fanrang Kong
2006Automatic generation of statistical language models for interactive voice response applications.
Mithun Balakrishna, Cyril Cerovic, Dan I. Moldovan, Ellis Cave
2006Automatic grammar correction for second-language learners.
John Lee, Stephanie Seneff
2006Automatic initial/final generation for dialectal Chinese speech recognition.
Linquan Liu, Thomas Fang Zheng, Wenhu Wu
2006Automatic language identification using wavelets.
Ana Lilia Reyes-Herrera, Luis Villaseñor-Pineda, Manuel Montes-y-Gómez
2006Automatic metadata generation and video editing based on speech and image recognition for medical education contents.
Satoshi Tamura, Koji Hashimoto, Jiong Zhu, Satoru Hayamizu, Hirotsugu Asai, Hideki Tanahashi, Makoto Kanagawa
2006Automatic phonetic segmentation by using a SPM-based approach for a Mandarin singing voice corpus.
Cheng-Yuan Lin, Jyh-Shing Roger Jang
2006Automatic phonetic transcription of large speech corpora: a comparative study.
Christophe Van Bael, Lou Boves, Henk van den Heuvel, Helmer Strik
2006Automatic recognition of speakers' age and gender on the basis of empirical studies.
Christian A. Müller
2006Automatic removal of typed keystrokes from speech signals.
Amarnag Subramanya, Michael L. Seltzer, Alex Acero
2006Automatic speech recognition experiments with articulatory data.
Esmeralda Uraga, Thomas Hain
2006Automatic speech recognition of Cantonese-English code-mixing utterances.
Joyce Y. C. Chan, P. C. Ching, Tan Lee, Houwei Cao
2006Automatic speech segmentation with multiple statistical models.
Seung Seop Park, Jong Won Shin, Nam Soo Kim
2006Automatic syllable-pattern induction in statistical Thai text-to-phone transcription.
Ausdang Thangthai, Chatchawarn Hansakunbuntheung, Rungkarn Siricharoenchai, Chai Wutiwiwatchai
2006Automatic transcription of Somali language.
Abdillahi Nimaan, Pascal Nocera, Jean-François Bonastre
2006BINSEG: an efficient speaker-based segmentation technique.
Jindrich Zdánský
2006Basque-Spanish language identification using phone-based methods.
Víctor G. Guijarrubia, M. Inés Torres
2006Bayesian decision tree state tying for conversational speech recognition.
Rusheng Hu, Yunxin Zhao
2006Bayesian networks for phonetic classification using time-scale features.
Franz Pernkopf, Tuan Van Pham
2006Boosting HMM performance with a memory upgrade.
Mathias De Wachter, Kris Demuynck, Dirk Van Compernolle
2006Bootstrapping language models for dialogue systems.
Karl Weilhammer, Matthew N. Stuttle, Steve J. Young
2006Building an English speech synthesis system from a Japanese ALS patient²s voice.
Akemi Iida, Jun Ito, Shimpei Kajima, Tsutomu Sugawara
2006Building an English-iraqi Arabic machine translation system for spoken utterances with limited resources.
Jason Riesa, Behrang Mohit, Kevin Knight, Daniel Marcu
2006CASA based speech separation for robust speech recognition.
Runqiang Han, Pei Zhao, Qin Gao, Zhiping Zhang, Hao Wu, Xihong Wu
2006CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition.
Satoshi Nakamura, Masakiyo Fujimoto, Kazuya Takeda
2006CHAT: a conversational helper for automotive tasks.
Fuliang Weng, Sebastian Varges, Badri Raghunathan, Florin Ratiu, Heather Pon-Barry, Brian Lathrop, Qi Zhang, Harry Bratt, Tobias Scheideck, Kui Xu, Matthew Purver, Rohit Mishra, Annie Lien, Madhuri Raya, Stanley Peters, Yao Meng, J. Russell, Lawrence Cavedon, Elizabeth Shriberg, Hauke Schmidt, R. Prieto
2006CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling.
Alan W. Black
2006Call analysis with classification using speech and non-speech features.
Yun-Cheng Ju, Ye-Yi Wang, Alex Acero
2006Category formation and the role of spectral quality in the perception and production of English front vowels.
Ricardo Augusto Hoffmann Bion, Paola Escudero, Andréia S. Rauber, Barbara O. Baptista
2006Characterization of cued speech vowels from the inner lip contour.
Noureddine Aboutabit, Denis Beautemps, Laurent Besacier
2006Chinese input method based on reduced Mandarin phonetic alphabet.
Chun-Han Tseng, Chia-Ping Chen
2006Classified comfort noise generation for efficient voice transmission.
Yasheng Qian, Wei-Shou Hsu, Peter Kabal
2006Classroom success of an intelligent tutoring system for lexical practice and reading comprehension.
Michael Heilman, Kevyn Collins-Thompson, Jamie Callan, Maxine Eskénazi
2006Clean speech feature estimation based on soft spectral masking.
Young Joon Kim, Woohyung Lim, Nam Soo Kim
2006Cluster-based user simulations for learning dialogue strategies.
Verena Rieser, Oliver Lemon
2006Colloquial Iraqi ASR for speech translation.
Shirin Saleem, Rohit Prasad, Prem Natarajan
2006Combining acoustic, lexical, and syntactic evidence for automatic unsupervised prosody labeling.
Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan
2006Combining missing-feature theory, speech enhancement and speaker-dependent/-independent modeling for speech separation.
Ji Ming, Timothy J. Hazen, James R. Glass
2006Combining multiple-sized sub-word units in a speech recognition system using baseform selection.
T. Nagarajan, P. Vijayalakshmi, Douglas D. O'Shaughnessy
2006Combining phonetic attributes using conditional random fields.
Jeremy Morris, Eric Fosler-Lussier
2006Compact n-gram models by incremental growing and clustering of histories.
Sami Virpioja, Mikko Kurimo
2006Comparative analysis of formants of British, american and australian accents.
Seyed Ghorshi, Saeed Vaseghi, Qin Yan
2006Comparative study on contributions of pitch-synchronization and peak-amplitude towards robustness issue of ASR.
Muhammad Ghulam, Junsei Horikawa, Tsuneo Nitta
2006Comparison of Slovak and Czech speech recognition based on grapheme and phoneme acoustic models.
Slavomír Lihan, Jozef Juhár, Anton Cizmar
2006Comparison of acoustic modeling techniques for Vietnamese and Khmer ASR.
Viet Bac Le, Laurent Besacier
2006Comparison of keyword spotting methods for searching in speech.
Lubos Smídl, Josef V. Psutka
2006Comparison of prediction based LSF quantization methods using split VQ.
Saikat Chatterjee, T. V. Sreenivas
2006Comparison of the ITU-t p.85 standard to other methods for the evaluation of text-to-speech systems.
Dmitry Sityaev, Katherine M. Knill, Tina Burrows
2006Computer aided pronunciation learning system using speech recognition techniques.
Sherif Mahdy Abdou, Salah Eldeen Hamid, Mohsen A. Rashwan, Abdurrahman Samir, Ossama Abdel-Hamid, Mostafa Shahin, Waleed Nazih
2006Computer-assisted closed-captioning of live TV broadcasts in French.
Gilles Boulianne, Jean-Francois Beaumont, Maryse Boisvert, Julie Brousseau, Patrick Cardinal, Claude Chapdelaine, Michel Comeau, Pierre Ouellet, Frédéric Osterrath
2006Conceptual decoding from word lattices: application to the spoken dialogue corpus MEDIA.
Christophe Servan, Christian Raymond, Frédéric Béchet, Pascal Nocera
2006Conditional random fields for hierarchical segment selection in text-to-speech synthesis.
Christian Weiss, Wolfgang Hess
2006Consonant and vowel confusions in speech-weighted noise.
Sandeep Phatak, Jont B. Allen
2006Constrained structural maximum a posteriori linear regression for average-voice-based speech synthesis.
Yuji Nakano, Makoto Tachibana, Junichi Yamagishi, Takao Kobayashi
2006Constructing stylistic synthesis databases from audio books.
Yong Zhao, Di Peng, Lijuan Wang, Min Chu, Yining Chen, Peng Yu, Jun Guo
2006Continual on-line monitoring of Czech spoken broadcast programs.
Jan Nouza, Jindrich Zdánský, Petr Cerva, Jan Kolorenc
2006Continuous time-frequency masking method for blind speech separation with adaptive choice of threshold parameter using ICA.
Zbynek Koldovský, Jan Nouza, Jan Kolorenc
2006Conversational help desk: vague callers and context switch.
Osamuyimen Stewart, Juan M. Huerta, Ea-Ee Jan, Cheng Wu, Xiang Li, David M. Lubensky
2006Conversational quality estimation model for wideband IP-telephony services.
Hitoshi Aoki, Atsuko Kurashima, Akira Takahashi
2006Conversion from phoneme based to grapheme based acoustic models for speech recognition.
Andrej Zgank, Zdravko Kacic
2006Cooperation between global and local methods for the automatic segmentation of speech synthesis corpora.
Safaa Jarifi, Dominique Pastor, Olivier Rosec
2006Corpus design based on the kullback-leibler divergence for text-to-speech synthesis application.
Aleksandra Krul, Géraldine Damnati, François Yvon, Thierry Moudenc
2006Corpus-based generation of fundamental frequency contours using generation process model and considering emotional focuses.
Keikichi Hirose, Yasufumi Asano, Nobuaki Minematsu
2006Coupling particle filters with automatic speech recognition for speech feature enhancement.
Friedrich Faubel, Matthias Wölfel
2006Cross-language evaluation of voice-to-phoneme conversions for voice-tag application in embedded platforms.
Yan Ming Cheng, Changxue Ma, Lynette Melnar
2006Cross-lingual dialog model for speech to speech translation.
Emil Ettelaie, Panayiotis G. Georgiou, Shrikanth S. Narayanan
2006Cross-system adaptation and combination for continuous speech recognition: the influence of phoneme set and acoustic front-end.
Sebastian Stüker, Christian Fügen, Susanne Burger, Matthias Wölfel
2006Cues for hesitation in speech synthesis.
Rolf Carlson, Kjell Gustafson, Eva Strangert
2006Data-driven design of front-end filter bank for Lombard speech recognition.
Hynek Boril, Petr Fousek, Petr Pollák
2006Decision directed constrained iterative speech enhancement.
Amit Das, John H. L. Hansen
2006Decision tree-based training of probabilistic concatenation models for corpus-based speech synthesis.
Shinsuke Sakai, Tatsuya Kawahara
2006Design and performance analysis of a factoid question answering system for spontaneous speech transcriptions.
Mihai Surdeanu, David Dominguez-Sal, Pere Comas
2006Detecting anger in automated voice portal dialogs.
Felix Burkhardt, Jitendra Ajmera, Roman Englert, Joachim Stegmann, Winslow Burleson
2006Detecting question-bearing turns in spoken tutorial dialogues.
Jackson Liscombe, Jennifer J. Venditti, Julia Hirschberg
2006Detection and separation of speech events in meeting recordings.
Futoshi Asano, Jun Ogata
2006Detection of a third speaker in telephone conversations.
Uchechukwu O. Ofoegbu, Ananth N. Iyer, Robert E. Yantorno, Stanley J. Wenndt
2006Detection of quotations and inserted clauses and its application to dependency structure analysis in spontaneous Japanese.
Ryoji Hamabe, Kiyotaka Uchimoto, Tatsuya Kawahara, Hitoshi Isahara
2006Detection of word fragments in Mandarin telephone conversation.
Cheng-Tao Chu, Yun-Hsuan Sung, Yuan Zhao, Daniel Jurafsky
2006Developing an automatic assessment tool for children²s oral reading.
Leen Cleuren, Jacques Duchateau, Alain Sips, Pol Ghesquière, Hugo Van hamme
2006Developing consistent pronunciation models for phonemic variants.
Marelie H. Davel, Etienne Barnard
2006Developing speech dialogs for multimodal HMIs using finite state machines.
Silke Goronzy, Raquel Mochales, Nicole Beringer
2006Development and evaluation of speech database in automotive environments for practical speech recognition systems.
Yasunari Obuchi, Nobuo Hataoka
2006Development of a program for self assessment of Japanese pronunciation by English learners.
Chiharu Tsurutani, Yutaka Yamauchi, Nobuaki Minematsu, Dean Luo, Kazutaka Maruyama, Keikichi Hirose
2006Development of advanced dialog systems with PATE.
Norbert Pfleger, Jan Schehl
2006Development of prototype text-to-speech systems for northern sotho.
H. J. Oosthuizen, S. T. Phihlela, Madimetja Jonas D. Manamela
2006Development of slovak GALAXY/voiceXML based spoken language dialogue system to retrieve information from the internet.
Jozef Juhár, Stanislav Ondás, Anton Cizmar, Milan Rusko, Gregor Rozinaj, Roman Jarina
2006Dialog act tagging with support vector machines and hidden Markov models.
Dinoj Surendran, Gina-Anne Levow
2006Dialogue act compression via pitch contour preservation.
Gabriel Murray, Steve Renals
2006Discourse structure and speech recognition problems.
Mihai Rotaru, Diane J. Litman
2006Discriminant linear processing of time-frequency plane.
Fabio Valente, Hynek Hermansky
2006Discriminating speech and non-speech with regularized least squares.
Ryan Rifkin, Nima Mesgarani
2006Discriminative MLE training using a product of Gaussian likelihoods.
T. Nagarajan, Douglas D. O'Shaughnessy
2006Discriminative adaptation for speaker verification.
Chris Longworth, Mark J. F. Gales
2006Discriminative kernel-based phoneme sequence recognition.
Joseph Keshet, Shai Shalev-Shwartz, Samy Bengio, Yoram Singer, Dan Chazan
2006Discriminative models for spoken language understanding.
Ye-Yi Wang, Alex Acero
2006Discriminative named entity recognition of speech data using speech recognition confidence.
Katsuhito Sudoh, Hajime Tsukada, Hideki Isozaki
2006Disentangling gestural and auditory contrast accounts of compensation for coarticulation.
Navin Viswanathan, James S. Magnuson, Carol A. Fowler
2006Distance measure between Gaussian distributions for discriminating speaking styles.
Goshu Nagino, Makoto Shozakai
2006Distant-talking continuous speech recognition based on a novel reverberation model in the feature domain.
Armin Sehr, Marcus Zeller, Walter Kellermann
2006Doing research on a deployed spoken dialogue system: one year of let's go! experience.
Antoine Raux, Dan Bohus, Brian Langner, Alan W. Black, Maxine Eskénazi
2006Dynamic evidence models in a DBN phone recognizer.
William Schuler, Tim Miller, Stephen T. Wu, Andrew Exley
2006Dynamic extension of a grammar-based dialogue system: constructing an all-recipes knowing robot.
Petra Gieselmann, Alex Waibel
2006Dynamic help generation by estimating user²s mental model in spoken dialogue systems.
Yuichiro Fukubayashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
2006Edge-splitting in a cumulative multimodal system, for a no-wait temporal threshold on information fusion, combined with an under-specified display.
Edward C. Kaiser, Paulo Barthelmess
2006Effect of dynamic information of formants on discrimination of English vowels in consonantal contexts by Japanese listeners.
Akiyo Joto
2006Effect of genre, speaker, and word class on the realization of given and new information.
Agustín Gravano, Julia Hirschberg
2006Effects of familiarity with faces and voices on second-language speech processing: components of memory traces.
Debra M. Hardison
2006Effects of featural similarity and overlap position on lexical confusions and overt similarity judgments.
Sarah C. Creel, Delphine Dahan, Daniel Swingley
2006Effects of frequency shifts on perceived naturalness and gender information in speech.
Peter F. Assmann, Sophia Dembling, Terrance M. Nearey
2006Effects of midline tongue piercing on spectral centroid frequencies of sibilants.
Tom Kovacs, Donald S. Finan
2006Effects of word frequency on the acoustic durations of affixes.
Mark Pluymaekers, Mirjam Ernestus, R. Harald Baayen
2006Efficient Gaussian mixture model evaluation in voice conversion.
Jilei Tian, Jani Nurminen, Victor Popa
2006Efficient VQ techniques and general noise shaping in noise feedback coding.
Jes Thyssen, Juin-Hwey Chen
2006Efficient interactive retrieval of spoken documents with key terms ranked by reinforcement learning.
Yi-Cheng Pan, Jia-Yu Chen, Yen-shin Lee, Yi-Sheng Fu, Lin-Shan Lee
2006Eigenvoice conversion based on Gaussian mixture model.
Tomoki Toda, Yamato Ohtani, Kiyohiro Shikano
2006Emotion detection in infants² cries based on a maximum likelihood approach.
Shoichi Matsunaga, S. Sakaguchi, Masaru Yamashita, Sueharu Miyahara, S. Nishitani, Kazuyuki Shinohara
2006Emotion recognition in spontaneous speech using GMMs.
Daniel Neiberg, Kjell Elenius, Kornel Laskowski
2006Emovoice: a system to generate emotions in speech.
João P. Cabral, Luís C. Oliveira
2006Enhanced dynamic codebook reordering for advanced quantizer structures.
Jani Nurminen
2006Enhancement of harmonic content of speech based on a dynamic programming pitch tracking algorithm.
Mark R. Every, Philip J. B. Jackson
2006Enhancing the performance of a GMM-based speaker identification system in a multi-microphone setup.
Andreas Stergiou, Aristodemos Pnevmatikakis, Lazaros C. Polymenakos
2006Estimation of the quality dimension "directness/frequency content" for the instrumental assessment of speech quality.
Kirstin Scholz, Marcel Wältermann, Lu Huo, Alexander Raake, Sebastian Möller, Ulrich Heute
2006Evaluating a virtual speech cuer.
Guillaume Gibert, Gérard Bailly, Frédéric Elisei
2006Evaluating prosody of Mandarin speech for language learning.
Minghui Dong, Haizhou Li, Tin Lay Nwe
2006Evaluation of a spoken dialogue system with usability tests and long-term pilot studies: similarities and differences.
Markku Turunen, Jaakko Hakulinen, Anssi Kainulainen
2006Evaluation of content presentation strategies for an in-car spoken dialogue system.
Heather Pon-Barry, Fuliang Weng, Sebastian Varges
2006Evaluation of objective measures for speech enhancement.
Yi Hu, Philipos C. Loizou
2006Evaluation of perceptual quality of control point reduction in rule-based synthesis.
Kimmo Pärssinen, Marko Moberg
2006Evaluation of voice activity detection by combining multiple features with weight adaptation.
Yusuke Kida, Tatsuya Kawahara
2006Evolving emotional prosody.
Cecilia Ovesdotter Alm, Xavier Llorà
2006Examining knowledge sources for human error correction.
Yongmei Shi, Lina Zhou
2006Example-based grapheme-to-phoneme conversion for Thai.
Paisarn Charoenpornsawat, Tanja Schultz
2006Expanding phonetic coverage in unit selection synthesis through unit substitution from a donor voice.
Alistair Conkie, Ann K. Syrdal
2006Experiments on Chinese speech recognition with tonal models and pitch estimation using the Mandarin speecon data.
Ying Sun, Daniel Willett, Raymond Brueckner, Rainer Gruhn, Dirk Bühler
2006Exploiting dendritic autocorrelogram structure to identify spectro-temporal regions dominated by a single sound source.
Ning Ma, Phil D. Green, André Coy
2006Exploiting polynomial-fit histogram equalization and temporal average for robust speech recognition.
Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen
2006Exploiting semantic relations for a spoken language understanding application.
Catherine Kobus, Géraldine Damnati, Lionel Delphin-Poulat, Renato De Mori
2006Exploring the unknown - collecting 1000 speakers over the internet for the ph@ttsessionz database of adolescent speakers.
Christoph Draxler
2006Expressive prosody for unit-selection speech synthesis.
Volker Strom, Robert A. J. Clark, Simon King
2006Extension and further analysis of higher order cepstral moment normalization (HOCMN) for robust features in speech recognition.
Chang-Wen Hsu, Lin-Shan Lee
2006Extracting formants from short segments of speech using group delay functions.
Joseph M. Anand, Sunitha Guruprasad, B. Yegnanarayana
2006Factors affecting speakers² choice of fillers in Japanese presentations.
Michiko Watanabe, Yasuharu Den, Keikichi Hirose, Shusaku Miwa, Nobuaki Minematsu
2006Farsbayan: a unit selection based Farsi speech synthesizer.
Mohammad Mehdi Homayounpour, Majid Namnabat
2006Fast SVM training based on the choice of effective samples for audio classification.
Shilei Zhang, Hongchen Jiang, Shuwu Zhang, Bo Xu
2006Fast and effective retraining on contrastive vocal characteristics with bidirectional long short-term memory nets.
Nicole Beringer
2006Feature analysis for emotion recognition from Mandarin speech considering the special characteristics of Chinese language.
Yi-Hao Kao, Lin-Shan Lee
2006Feature and model space speaker adaptation with full covariance Gaussians.
Daniel Povey, George Saon
2006Feature combination using linear discriminant analysis and its pitfalls.
Ralf Schlüter, András Zolnay, Hermann Ney
2006Feature extraction for spectral continuity measures in concatenative speech synthesis.
Barry Kirkpatrick, Darragh O'Brien, Ronan Scaife
2006Feature normalization using smoothed mixture transformations.
Patrick Kenny, Vishwa Gupta, Gilles Boulianne, Pierre Ouellet, Pierre Dumouchel
2006Finding the gaps: applying a connectionist model of word segmentation to noisy phone-recognized speech data.
C. Anton Rytting
2006Formant-based English vowel assessment for Chinese in Taiwan.
Jiang-Chun Chen, Wei-Tang Hsu, Jyh-Shing Roger Jang, Ren-yuan Lyu, Yuang-Chin Chiang
2006Forward-backwards training of hybrid HMM/BN acoustic models.
Konstantin Markov, Satoshi Nakamura
2006Frame based system combination and a comparison with weighted ROVER and CNC.
Björn Hoffmeister, Tobias Klein, Ralf Schlüter, Hermann Ney
2006Frequency warping based on mapping formant parameters.
Zhiwei Shuang, Raimo Bakis, Slava Shechtman, Dan Chazan, Yong Qin
2006Frequency warping by linear transformation of standard MFCC.
Sankaran Panchapagesan
2006Friends and enemies: a novel initialization for speaker diarization.
Xavier Anguera, Chuck Wooters, Javier Hernando
2006From pre-recorded prompts to corporate voices: on the migration of interactive voice response applications.
Volker Fischer, Siegfried Kunzmann
2006From reaction to prediction: experiments with computational models of turn-taking.
David Schlangen
2006Further developments in LSM-based boundary training for unit selection TTS.
Jerome R. Bellegarda
2006Further investigations on the relationship between objective measures of speech quality and speech recognition rates in noisy environments.
Francisco José Fraga, Carlos Alberto Ynoguti, André Godoi Chiovato
2006Fusion of phonotactic and prosodic knowledge for language identification.
Chi-Yueh Lin, Hsiao-Chuan Wang
2006GMM-based acoustic modeling for embedded speech recognition.
Christophe Lévy, Georges Linarès, Jean-François Bonastre
2006Gammatone auditory filterbank and independent component analysis for speaker identification.
Yushi Zhang, Waleed H. Abdulla
2006Generalization of the minimum classification error (MCE) training based on maximizing generalized posterior probability (GPP).
Qiang Fu, Antonio Moreno-Daniel, Biing-Hwang Juang, Jian-Lai Zhou, Frank K. Soong
2006Generating German intonation with a trainable prosodic model.
Gérard Bailly, Jan Gorisch
2006Generating complementary systems for speech recognition.
Catherine Breslin, Mark J. F. Gales
2006Generating time-constrained audio presentations of structured information.
Brian Langner, Rohit Kumar, Arthur Chan, Lingyun Gu, Alan W. Black
2006Geometrically constrained permutation-free source separation in an undercomplete speech unmixing scenario.
Erik Visser
2006Glottal closure and opening detection for flexible parametric voice coding.
Pamornpol Jinachitra
2006Grapheme-to-phoneme conversion using automatically extracted associative rules for Korean TTS system.
Jinsik Lee, Seungwon Kim, Gary Geunbae Lee
2006HMM-based MAP prediction of voiced and unvoiced formant frequencies from noisy MFCC vectors.
Jonathan Darch, Ben Milner
2006HMM-based continuous sign language recognition using a fast optical flow parameterization of visual information.
Guillermo Cortés, Luz García, M. Carmen Benítez, José C. Segura
2006HMM-based unit selection using frame sized speech segments.
Zhen-Hua Ling, Ren-Hua Wang
2006Handling convolutional noise in missing data automatic speech recognition.
Maarten Van Segbroeck, Hugo Van hamme
2006Have we met? MDP based speaker ID for robot dialogue.
Filip Krsmanovic, Curtis Spencer, Daniel Jurafsky, Andrew Y. Ng
2006High-quality speech translation in the flight domain.
Chao Wang, Stephanie Seneff
2006High-rate data embedding in unvoiced speech.
Konrad Hofbauer, Gernot Kubin
2006Highly directional multi-beam audio loudspeaker.
Dirk Olszewski, Klaus Linhard
2006Highly noise robust text-dependent speaker recognition based on hypothesized wiener filtering.
V. Ramasubramanian, Deepak Vijaywargiay, Kumar V. Praveen
2006How auditory and visual prosody is used in end-of-utterance detection.
Pashiera Barkhuysen, Emiel Krahmer, Marc Swerts
2006How to handle gender and number agreement in statistical language models?
Caroline Lavecchia, Kamel Smaïli, Jean Paul Haton
2006Hypothesis spaces for minimum Bayes risk training in large vocabulary speech recognition.
Matthew Gibson, Thomas Hain
2006Hypothesis-based feature combination of multiple speech inputs for robust speech recognition in automotive environments.
Yasunari Obuchi, Nobuo Hataoka
2006Identification of confusion and surprise in spoken dialog using prosodic features.
Rohit Kumar, Carolyn P. Rosé, Diane J. Litman
2006Identification of regional accents in French: perception and categorization.
Cécile Woehrling, Philippe Boula de Mareüil
2006Identify language origin of personal names with normalized appearance number of web pages.
Jia-Li You, Yining Chen, Min Chu, Yong Zhao, Jin-Lin Wang
2006Imperfect transcript driven speech recognition.
Benjamin Lecouteux, Georges Linarès, Pascal Nocera, Jean-François Bonastre
2006Improved hybrid microphone array post-filter by integrating a robust speech absence probability estimator for speech enhancement.
Junfeng Li, Masato Akagi, Yôiti Suzuki
2006Improved language identification using support vector machines for language modeling.
Xi Yang, Lu-Feng Zhai, Man-Hung Siu, Herbert Gish
2006Improved performance evaluation of speech event detectors.
Carla Lopes, Fernando Perdigão
2006Improved source modeling and predictive classification for channel robust speech recognition.
Valentin Ion, Reinhold Haeb-Umbach
2006Improved speech activity detection using cross-channel features for recognition of multiparty meetings.
Kofi Boakye, Andreas Stolcke
2006Improved tone modeling for Mandarin broadcast news speech recognition.
Xin Lei, Man-Hung Siu, Mei-Yuh Hwang, Mari Ostendorf, Tan Lee
2006Improved topic classification over maximum entropy model using k-norm based new objectives.
Xiang Li, Ea-Ee Jan, Cheng Wu, David M. Lubensky
2006Improved warping-invariant features for automatic speech recognition.
Jan Rademacher, Matthias Wächter, Alfred Mertins
2006Improvement speaker clustering using global similarity features.
Konstantin Biatov, Joachim Köhler
2006Improvements to bucket box intersection algorithm for fast GMM computation in embedded speech recognition systems.
Min Tang, Aravind Ganapathiraju
2006Improving Arabic HMM based speech synthesis quality.
Ossama Abdel-Hamid, Sherif Mahdy Abdou, Mohsen A. Rashwan
2006Improving body transmitted unvoiced speech with statistical voice conversion.
Mikihiro Nakagiri, Tomoki Toda, Hideki Kashioka, Kiyohiro Shikano
2006Improving glottal waveform estimation through rank-based glottal quality assessment.
Elliot Moore II, Juan F. Torres
2006Improving perplexity measures to incorporate acoustic confusability.
Amit Anil Nanavati, Nitendra Rajput
2006Improving phrase-based Korean-English statistical machine translation.
Jonghoon Lee, Donghyeon Lee, Gary Geunbae Lee
2006Improving speech recognition accuracy with multi-confidence thresholding.
Shuangyu Chang
2006Improving speech recognition of two simultaneous speech signals by integrating ICA BSS and automatic missing feature mask generation.
Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
2006Improving the characterization of the alternative hypothesis via kernel discriminant analysis for likelihood ratio-based speaker verification.
Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang, Ruei-Chuan Chang
2006Improving the performance of HMM-based voice conversion using context clustering decision tree and appropriate regression matrix format.
Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang
2006Improving the performance of out-of-vocabulary word rejection by using support vector machines.
Shilei Huang, Xiang Xie, Jingming Kuang
2006Improving tone recognition with combined frequency and amplitude modelling.
Siwei Wang, Gina-Anne Levow
2006Incorporating second-order information into two-step major phrase break prediction for Korean.
Seungwon Kim, Jinsik Lee, Byeongchang Kim, Gary Geunbae Lee
2006Incremental learning of MAP context-dependent edit operations for spoken phone number recognition in an embedded platform.
Hahn Koo, Yan Ming Cheng
2006Independent components for acoustic modeling.
Jan Trmal, Jan Vanek, Ludek Müller, Jan Zelinka
2006Individual on-line variance adaptation of frequency filtered parameters for robust ASR.
Jesús Vicente-Peña, Fernando Díaz-de-María, W. Bastiaan Kleijn
2006Infants² ability to extract verbs from continuous speech.
Ellen Marklund, Francisco Lacerda
2006Infinite models for speaker clustering.
Fabio Valente
2006Influence of pause length on listeners² impressions in simultaneous interpretation.
Hitomi Tohyama, Shigeki Matsubara
2006Integrating Festival and Windows.
Rhys James Jones, Ambrose Choy, Briony Williams
2006Integrating phonetic boundary discrimination explicitly into HMM systems.
Yu Wang, Eric Fosler-Lussier
2006Integrating spoken dialog and question answering: the ritel project.
Sophie Rosset, Olivier Galibert, Gabriel Illouz, Aurélien Max
2006Integration of a CELP coder in the ARDOR universal sound codec.
Balázs Kövesi, Dominique Massaloux, David Virette, Julien Bensa
2006Intelligibility of machine translation output in speech synthesis.
Laura Mayfield Tomokiyo, Kay Peterson, Alan W. Black, Kevin A. Lenzo
2006Interleaving and MMSE estimation with VQ replicas for distributed speech recognition over lossy packet networks.
Angel M. Gomez, Antonio M. Peinado, Victoria E. Sánchez, José L. Carmona, Antonio J. Rubio
2006Intonational cues to student questions in tutoring dialogs.
Jennifer J. Venditti, Julia Hirschberg, Jackson Liscombe
2006Intra-speaker variability compensation in speaker verification with limited enrolling data.
Claudio Garretón, Néstor Becerra Yoma, Carlos Molina, Fernando Huenupán
2006Investigating automatic decomposition for ASR in less represented languages.
Thomas Pellegrini, Lori Lamel
2006Investigation on Mandarin broadcast news speech recognition.
Mei-Yuh Hwang, Xin Lei, Wen Wang, Takahiro Shinozaki
2006Investigation on rescoring using minimum verification error (MVE) detectors.
Qiang Fu, Biing-Hwang Juang
2006Investigations of issues for using multiple acoustic models to improve continuous speech recognition.
Rong Zhang, Alexander I. Rudnicky
2006Is ASR accurate enough for automated reading tutors, and how can we tell?
Jack Mostow
2006Is voice quality enough? - study on how the situation and user²s awareness influence the utterance features.
Shinya Yamada, Toshihiko Itoh, Kenji Araki
2006Issues with uncertainty decoding for noise robust speech recognition.
Hank Liao, Mark J. F. Gales
2006Joint interpretation of input speech and pen gestures for multimodal human-computer interaction.
Pui-Yu Hui, Helen M. Meng
2006Joint prosodic and segmental unit selection speech synthesis.
Robert A. J. Clark, Simon King
2006LDA based feature estimation methods for LVCSR.
Janne Pylkkönen
2006LINTest: a development tool for testing dialogue systems.
Lars Degerstedt, Arne Jönsson
2006Language model adaptation for tiny adaptation corpora.
Dietrich Klakow
2006Language model adaptation with a word list and a raw corpus.
Shinsuke Mori
2006Language modeling of Chinese personal names based on character units for continuous Chinese speech recognition.
Xinhui Hu, Hirofumi Yamamoto, Gen-ichiro Kikui, Yoshinori Sagisaka
2006Language, gender, speaking style and language proficiency as factors influencing the autonomous vocalic filler production in spontaneous speech.
Ioana Vasilescu, Martine Adda-Decker
2006Latent prosodic modeling (LPM) for speech with applications in recognizing spontaneous Mandarin speech with disfluencies.
Che-Kuang Lin, Lin-Shan Lee
2006Lattice LP filtering for noise reduction in speech signals.
Erhard Rank, Gernot Kubin
2006Lattice extension and rescoring based approaches for LVCSR of Turkish.
Ebru Arisoy, Murat Saraclar
2006Learning from errors in grapheme-to-phoneme conversion.
Tatyana Polyakova, Antonio Bonafonte
2006Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces.
Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, Hiroshi Shimodaira
2006Lexical stress in continuous speech recognition.
Rogier C. van Dalen, Pascal Wiggers, Léon J. M. Rothkrantz
2006Limitations of MLLR adaptation with Spanish-accented English: an error analysis.
Constance Clarke, Daniel Jurafsky
2006Lingua machinae - an unorthodox proposal.
Florian Schiel, Christoph Draxler, Marion Libossek
2006Linguistic tuple segmentation in n-gram-based statistical machine translation.
Adrià de Gispert, José B. Mariño
2006Local transformation models for speech recognition.
Antonio Miguel, Eduardo Lleida, Alfons Juan, Luis Buera, Alfonso Ortega, Oscar Saz
2006Locating phone boundaries from acoustic discontinuities using a two-staged approach.
Pairote Leelaphattarakij, Proadpran Punyabukkana, Atiwong Suchato
2006Lost speech reconstruction method using speech recognition based on missing feature theory and HMM-based speech synthesis.
Shingo Kuroiwa, Satoru Tsuge, Fuji Ren
2006Low complexity LID using pruned pattern tables of LZW.
S. V. Basavaraja, T. V. Sreenivas
2006Low-complexity and efficient classification of voiced/unvoiced/silence for noisy environments.
Tuan Van Pham, Gernot Kubin
2006Low-resource autodiacritization of abjads for speech keyword search.
Patrick Schone
2006MMSE estimation of complex-valued discrete Fourier coefficients with generalized gamma priors.
Jesper Jensen, Richard C. Hendriks, Jan S. Erkelens, Richard Heusdens
2006Manifold HLDA and its application to robust speech recognition.
Toshiaki Kubo, Tetsuji Ogawa, Tetsunori Kobayashi
2006Map-based adaptation for speech conversion using adaptation data selection and non-parallel training.
Chung-Han Lee, Chung-Hsien Wu
2006Mapping neural networks for bandwidth extension of narrowband speech.
A. Shahina, B. Yegnanarayana
2006Max-Gabor analysis and synthesis of spectrograms.
Tony Ezzat, Jake V. Bouvrie, Tomaso A. Poggio
2006Maximum entropy modeling for diacritization of Arabic text.
Ruhi Sarikaya, Ossama Emam, Imed Zitouni, Yuqing Gao
2006Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation.
Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2006Measuring and comparing vowel qualities in a Dutch spontaneous speech corpus.
Irene Jacobi, Louis C. W. Pols, Jan Stroop
2006Measuring the acceptable word error rate of machine-generated webcast transcripts.
Cosmin Munteanu, Gerald Penn, Ronald Baecker, Elaine G. Toms, David James
2006Memo: towards automatic usability evaluation of spoken dialogue services by user error simulations.
Sebastian Möller, Roman Englert, Klaus-Peter Engelbrecht, Verena Vanessa Hafner, Anthony Jameson, Antti Oulasvirta, Alexander Raake, Norbert Reithinger
2006Minimum boundary error training for automatic phonetic segmentation.
Jen-Wei Kuo, Hsin-Min Wang
2006Minimum classification error training of hidden Markov models for acoustic language identification.
Josef G. Bauer, Ekaterina Timoshenko
2006Minimum divergence based discriminative training.
Jun Du, Peng Liu, Frank K. Soong, Jian-Lai Zhou, Ren-Hua Wang
2006Minimum generation error criterion for tree-based clustering of context dependent HMMs.
Yi-Jian Wu, Wu Guo, Ren-Hua Wang
2006Missing data mask models with global frequency and temporal constraints.
Sébastien Demange, Christophe Cerisara, Jean Paul Haton
2006Missing feature theory with soft spectral subtraction for speaker verification.
Michael T. Padilla, Thomas F. Quatieri, Douglas A. Reynolds
2006Missing-feature reconstruction for band-limited speech recognition in spoken document retrieval.
Wooil Kim, John H. L. Hansen
2006Modeling of speech signals based on Bessel-like orthogonal transform.
Giorgio Biagetti, Paolo Crippa, Claudio Turchetti
2006Modeling sensory-to-motor mappings using neural nets and a 3d articulatory speech synthesizer.
Bernd J. Kröger, Peter Birkholz, Jim Kannampuzha, Christiane Neuschaefer-Rube
2006Modeling the acoustic correlates of expressive elements in text genres for expressive text-to-speech synthesis.
Hongwu Yang, Helen M. Meng, Lianhong Cai
2006Modeling the precedence effect for binaural sound source localization in noisy and echoic environments.
Martin Heckmann, Tobias Rodemann, Björn Schölling, Frank Joublin, Christian Goerick
2006Modelling aspiration noise during phonation using the LF voice source model.
Christer Gobl
2006Modified phase opponency based solution to the speech separation challenge.
Om Deshmukh, Carol Y. Espy-Wilson
2006Monitoring of the natural voice variations in open and closed phases with frequency warped ARMA modeling.
Pedro J. Quintana-Morales, Juan L. Navarro-Mesa, Antonio G. Ravelo-García, Fernando D. Lorenzo-García
2006Moving speech recognition from software to silicon: the in silico vox project.
Edward C. Lin, Kai Yu, Rob A. Rutenbar, Tsuhan Chen
2006Multi-accent Chinese speech recognition.
Yi Liu, Pascale Fung
2006Multi-domain text-to-speech synthesis by automatic text classification.
Francesc Alías, Joan Claudi Socoró, Xavier Sevillano, Ignasi Iriondo Sanz, Xavier Gonzalvo
2006Multi-flow block interleaving applied to distributed speech recognition over IP networks.
Angel M. Gomez, Juan J. Ramos-Muñoz, Antonio M. Peinado, Victoria E. Sánchez
2006Multi-layered summarization of spoken document archives by information extraction and semantic structuring.
Lin-Shan Lee, Sheng-yi Kong, Yi-Cheng Pan, Yi-Sheng Fu, Yu-tsun Huang
2006Multi-microphone periodicity function for robust F0 estimation in real noisy and reverberant environments.
Federico Flego, Maurizio Omologo
2006Multi-modal system ICANDO: intellectual computer assistant for disabled operators.
Alexey Karpov, Andrey Ronzhin, Alexandre Cadiou
2006Multi-source far-distance microphone selection and combination for automatic transcription of lectures.
Matthias Wölfel, Christian Fügen, Shajith Ikbal, John W. McDonough
2006Multi-stream ASR: an oracle perspective.
Hemant Misra, Jithendra Vepa, Hervé Bourlard
2006Multi-stream speaker diarization systems for the meetings domain.
Ascensión Gallardo-Antolín, Xavier Anguera, Chuck Wooters
2006Multilingual non-native speech recognition using phonetic confusion-based acoustic model modification and graphemic constraints.
Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean Paul Haton
2006Multimodal authentication using qualitative support vector machines.
Fawaz Alsaade, Aladdin M. Ariyaeeinia, L. Meng, Amit S. Malegaonkar
2006Multistage convolutive blind source separation for speech mixture.
Yanxue Liang, Ichiro Hagiwara
2006Multivariate analysis of frame-based acoustic cues of dysperiodicities in connected speech.
Abdellah Kacha, Francis Grenez, Jean Schoentgen
2006Nasality perception of vowels in different language background.
Shahina Haque, Tomio Takara
2006Native and nonnative audio-visual perception of English fricatives in quiet and cafe-noise backgrounds.
Yue Wang, Dawn M. Behne, Haisheng Jiang, Chad Danyluck
2006New 20-word lists for word intelligibility test in Japanese.
Shuichi Sakamoto, Tadahiro Yoshikawa, Shigeaki Amano, Yôiti Suzuki, Tadahisa Kondo
2006New considerations for vowel nasalization based on separate mouth-nose recording.
Gang Feng, Cyril Kotenkoff
2006New improvements in decoding speed and latency for automatic captioning.
Jian Xue, Rusheng Hu, Yunxin Zhao
2006New measures to chart toddlers² speech perception and language development: a test of the lexical restructuring hypothesis.
Iris-Corinna Schwarz, Denis Burnham
2006Ninth International Conference on Spoken Language Processing, INTERSPEECH-ICSLP 2006, Pittsburgh, PA, USA, September 17-21, 2006
2006Noise robust model-based voice activity detection.
Ángel de la Torre, Javier Ramírez, M. Carmen Benítez, José C. Segura, Luz García, Antonio J. Rubio
2006Noise update modeling for speech enhancement: when do we do enough?
Nitish Krishnamurthy, John H. L. Hansen
2006Noise-robust speech recognition of conversational telephone speech.
Gang Chen, Hesham Tolba, Douglas D. O'Shaughnessy
2006Noisy speech recognition based on selection of multiple noise suppression methods using noise GMMs.
Norihide Kitaoka, Souta Hamaguchi, Seiichi Nakagawa
2006Non-intrusive speech quality assessment with low computational complexity.
Volodya Grancharov, David Yuheng Zhao, Jonas Lindblom, W. Bastiaan Kleijn
2006Nonlinear dynamical invariants for speech recognition.
S. Prasad, Sundararajan Srinivasan, M. Pannuri, Georgios Y. Lazarou, Joseph Picone
2006Normalization of the inter-frame information using smoothing filtering.
Luz García, José C. Segura, M. Carmen Benítez, Javier Ramírez, Ángel de la Torre
2006Novel entropy based moving average refiners for HMM landmarks.
Rahul Chitturi, Mark Hasegawa-Johnson
2006Novel method for data clustering and mode selection with application in voice conversion.
Jani Nurminen, Jilei Tian, Victor Popa
2006Novel time domain multi-class SVMs for landmark detection.
Rahul Chitturi, Mark Hasegawa-Johnson
2006Objective estimation of suicidal risk using vocal output characteristics.
T. Yingthawornsuk, H. Kaymaz Keskinpala, Daniel J. France, D. Mitchell Wilkes, Richard G. Shiavi, Ronald M. Salomon
2006Observations of the spoken language acquisition process based on a multimodal infant behavior corpus.
Ryo Tsuji, Tomohiko Kasami, Shogo Ishikawa, Shinya Kiriyama, Yoichi Takebayashi, Shigeyoshi Kitazawa
2006On a greedy learning algorithm for dPLRM with applications to phonetic feature detection.
Tor André Myrvoll, Tomoko Matsui
2006On designing context sensitive language models for spoken dialog systems.
Vaibhava Goel, Ramesh A. Gopinath
2006On speaker-specific prosodic models for automatic dialog act segmentation of multi-party meetings.
Jáchym Kolár, Elizabeth Shriberg, Yang Liu
2006On speech variation and word type differentiation by articulatory feature representations.
Louis ten Bosch, R. Harald Baayen, Mirjam Ernestus
2006On the correlation between energy and pitch accent in read English speech.
Andrew Rosenberg, Julia Hirschberg
2006On the fusion of prosody, voice spectrum and face features for multimodal person verification.
M. Farrs, Ainara Garde, Pascual Ejarque, Jordi Luque, Javier Hernando
2006On the relation between maximum spectral transition positions and phone boundaries.
Sorin Dusan, Lawrence R. Rabiner
2006On the sufficiency and redundancy of pitch for TRP projection.
Wieneke Wesseling, Rob van Son, Louis C. W. Pols
2006On the sufficiency of automatic phonetic transcriptions for pronunciation variation research.
Christophe Van Bael, Hans van Halteren
2006On the use of Jacobian adaptation in real speaker verification applications.
Jan Anguita, Javier Hernando
2006On the use of morphological analysis for dialectal Arabic speech recognition.
Mohamed Afify, Ruhi Sarikaya, Hong-Kwang Jeff Kuo, Laurent Besacier, Yuqing Gao
2006Online speaker change detection by combining BIC with microphone array beamforming.
Joerg Schmalenstroeer, Reinhold Haeb-Umbach
2006Online speech detection and dual-gender speech recognition for captioning broadcast news.
Toru Imai, Shoei Sato, Akio Kobayashi, Kazuo Onoe, Shinichi Homma
2006Open-vocabulary spoken document retrieval based on new subword models and subword phonetic similarity.
Kohei Iwata, Yoshiaki Itoh, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee
2006Opinion mining in a telephone survey corpus.
Nathalie Camelin, Géraldine Damnati, Frédéric Béchet, Renato De Mori
2006Optimization of class weights for LDA feature transformations.
Andrej Ljolje
2006Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system.
Roger Hsiao, Ashish Venugopal, Thilo Köhler, Ying Zhang, Paisarn Charoenpornsawat, Andreas Zollmann, Stephan Vogel, Alan W. Black, Tanja Schultz, Alex Waibel
2006Pauses as a tool to ensure rhythmic wellformedness.
Augustin Speyer
2006Perception of fundamental frequency in cochlear implant patients.
Ángel de la Torre, Cristina Roldán, Manuel Sainz
2006Perceptive and acoustic measurement of average speaking pitch of female and male speakers in German radio news.
Sven Grawunder, Ines Bose, Birgit Hertha, Franziska Trauselt, Lutz Christian Anders
2006Perceptual identification and phonetic analysis of 6 foreign accents in French.
Bianca Vieru-Dimulescu, Philippe Boula de Mareüil
2006Performance analysis of various single channel speech enhancement algorithms for automatic speech recognition.
Myung-Suk Song, Chang-Heon Lee, Hong-Goo Kang
2006Performance evaluation of three features for model-based single channel speech separation problem.
Mohammad H. Radfar, Richard M. Dansereau, Abolghasem Sayadiyan
2006Performance improvement of dialog speech translation by rejecting unreliable utterances.
Toshiyuki Takezawa, Tohru Shimizu
2006Perplexity based linguistic model adaptation for speech summarisation.
Pierre Chatain, Edward W. D. Whittaker, Joanna Mrozinski, Sadaoki Furui
2006Personality factors in human deception detection: comparing human to machine performance.
Frank Enos, Stefan Benus, Robin L. Cautin, Martin Graciarena, Julia Hirschberg, Elizabeth Shriberg
2006Phone recognition analysis for trajectory HMM.
Le Zhang, Steve Renals
2006Phone vector DHMM to decode a phone recognizer's output.
Bong-Wan Kim, Dae-Lim Choi, Yongnam Um, Yong-Ju Lee
2006Phoneme recognition based on fisher weight map to higher-order local auto-correlation.
Yasuo Ariki, Shunsuke Kato, Tetsuya Takiguchi
2006Phoneme-to-grapheme mapping for spoken inquiries to the semantic web.
Axel Horndasch, Elmar Nöth, Anton Batliner, Volker Warnke
2006Phonetic research on accented Chinese in three dialectal regions: Shanghai, Wuhan and Xiamen.
Aijun Li, Qiang Fang, Ziyu Xiong
2006Phonetically enriched labeling in unit selection TTS synthesis.
Yeon-Jun Kim, Ann K. Syrdal, Alistair Conkie, Marc C. Beutnagel
2006Phrase break prediction using logistic generalized linear model.
Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao
2006Physiologically-motivated synchrony-based processing for robust automatic speech recognition.
Chanwoo Kim, Yu-Hsiang Bosco Chiu, Richard M. Stern
2006Pitch determination using aligned AMDF.
M. Shahidur Rahman, Hirobumi Tanaka, Tetsuya Shimamura
2006Pitch range and pause duration as markers of discourse hierarchy: perception experiments.
Jörg Mayer, Ekaterina Jasinskaja, Ulrike Kölsch
2006Pitch resynchronization while recovering from a late frame in a predictive speech decoder.
Kyle D. Anderson, Philippe Gournay
2006Pitch-scale modification using the modulated aspiration noise source.
Daryush D. Mehta, Thomas F. Quatieri
2006Posterior based keyword spotting with a priori thresholds.
Hamed Ketabdar, Jithendra Vepa, Samy Bengio, Hervé Bourlard
2006Potential relevance of audio-visual integration in mammals for computational modeling.
Eeva Klintfors, Francisco Lacerda
2006Powered cepstral normalization (p-CN) for robust features in speech recognition.
Chang-Wen Hsu, Lin-Shan Lee
2006Productions in bilinguism, early foreign language learning and monolinguism: a prosodic comparison.
Ranka Bijeljac-Babic, Christelle Dodane, Sabine Metta, Claire Gerard
2006Prominent words as anchors for TRP projection.
Rob van Son, Wieneke Wesseling, Louis C. W. Pols
2006Prompt selection with reinforcement learning in an AT&t call routing application.
Charles Lewis, Giuseppe Di Fabbrizio
2006Pronunciation dependent language models.
Andrej Ljolje
2006Pronunciation variant-based multi-path HMMs for syllables.
Annika Hämäläinen, Louis ten Bosch, Lou Boves
2006Pronunciation variation modeling for Mandarin with accent.
Chi Zhang, Ji Wu, Xi Xiao, Zuoying Wang
2006Pronunciation verification of children²s speech for automatic literacy assessment.
Joseph Tepperman, Jorge F. Silva, Abe Kazemzadeh, Hong You, Sungbok Lee, Abeer Alwan, Shrikanth S. Narayanan
2006Prosodic boundaries in Czech: an experiment based on delexicalized speech.
Tomás Dubeda
2006Prosodic feature generation for back-channel prediction.
Thamar Solorio, Olac Fuentes, Nigel G. Ward, Yaffa Al Bayyari
2006Prosodic features for a maximum entropy language model.
Oscar Chan, Roberto Togneri
2006Prosodic features for speaker verification.
Leena Mary, B. Yegnanarayana
2006Prosodic modeling in large vocabulary Mandarin speech recognition.
Jui-Ting Huang, Lin-Shan Lee
2006Prosody of interrogative and affirmative sentences in vietnamese language: analysis and perceptive results.
Minh-Quang Vu, Do Dat Tran, Eric Castelli
2006Prototyping a call system for students of Japanese using dynamic diagram generation and interactive hints.
Christopher J. Waple, Yasushi Tsubota, Masatake Dantsuji, Tatsuya Kawahara
2006QASR: question answering using semantic roles for speech interface.
Svetlana Stenchikova, Dilek Hakkani-Tür, Gökhan Tür
2006Quality improvement of telephone speech by artificial bandwidth expansion - listening tests in three languages.
Hannu Pulakka, Laura Laaksonen, Paavo Alku
2006Question answering with discriminative learning algorithms.
Junlan Feng
2006Quick individual fitting methods of simplified hearing compensation for elderly people.
Kengo Fujita, Tsuneo Kato, Hisashi Kawai
2006Radiobot-CFF: a spoken dialogue system for military training.
Antonio Roque, Anton Leuski, Vivek Kumar Rangarajan Sridhar, Susan Robinson, Ashish Vaswani, Shrikanth S. Narayanan, David R. Traum
2006Rapid simulation-driven reinforcement learning of multimodal dialog strategies in human-robot interaction.
Thomas Prommer, Hartwig Holzapfel, Alex Waibel
2006Rapid speaker adaptation using regression-tree based spectral peak alignment.
Shizhen Wang, Xiaodong Cui, Abeer Alwan
2006Real vs. acted emotional speech.
Janneke Wilting, Emiel Krahmer, Marc Swerts
2006Real-life emotions detection with lexical and paralinguistic cues on human-human call center dialogs.
Laurence Devillers, Laurence Vidrascu
2006Real-time synthesis of Chinese visual speech and facial expressions using MPEG-4 FAP features in a three-dimensional avatar.
Zhiyong Wu, Shen Zhang, Lianhong Cai, Helen M. Meng
2006Realizations and representations of Thai tones in monomoraic syllables.
Rattima Nitisaroj
2006Recent advances in phonotactic language recognition using binary-decision trees.
Jirí Navrátil
2006Recent advances in speech fragment decoding techniques.
Jon Barker, André Coy, Ning Ma, Martin Cooke
2006Recent advances of IBM's handheld speech translation system.
Weizhong Zhu, Bowen Zhou, Charles Prosser, Pavel Krbec, Yuqing Gao
2006Recent progress on the discriminative region-dependent transform for speech feature extraction.
Bing Zhang, Spyros Matsoukas, Richard M. Schwartz
2006Recognition of classroom lectures in european portuguese.
Isabel Trancoso, Ricardo Nunes, Luís Neves, Céu Viana, Helena Moniz, Diamantino Caseiro, Ana Isabel Mata
2006Recognition of interest in human conversational speech.
Björn W. Schuller, Niels Köhler, Ronald Müller, Gerhard Rigoll
2006Reconstructing tongue movements from audio and video.
Hedvig Kjellström, Olov Engwall, Olle Bälter
2006Reducing computation on parallel decoding using frame-wise confidence scores.
Tomohiro Hakamata, Akinobu Lee, Yoshihiko Nankaku, Keiichi Tokuda
2006Reducing speech coding distortion for speaker identification.
Alan McCree
2006Redundancy and productivity in the speech technology lexicon - can we do better?
Susan Fitt, Korin Richmond
2006Respiratory/laryngeal interactions during sustained vowel production in children.
Donald S. Finan, Carol A. Boliek
2006Robust acoustic-based syllable detection.
Zhimin Xie, Partha Niyogi
2006Robust automatic speech recognition for accented Mandarin in car environments.
Pei Ding, Lei He, Xiang Yan, Jie Hao
2006Robust feature extraction based on spectral peaks of group delay and autocorrelation function and phase domain analysis.
Gholamreza Farahani, Seyed Mohammad Ahadi, Mohammad Mehdi Homayounpour
2006Robust feature space adaptation for telephony speech recognition.
Xin Lei, Jon Hamaker, Xiaodong He
2006Robust interpretation in dialogue by combining confidence scores with contextual features.
Matthew Purver, Florin Ratiu, Lawrence Cavedon
2006Robust phone lattice decoding.
Kris Demuynck, Dirk Van Compernolle, Hugo Van hamme
2006Robust speaker diarization for meetings: ICSI RT06s evaluation system.
Xavier Anguera, Chuck Wooters, José M. Pardo
2006Robust speech recognition by modifying clean and telephone feature vectors using bidirectional neural network.
Mansoor Vali, Seyyed Ali Seyyed Salehi, Kazem Karimi
2006Robust speech recognition over mobile networks using combined weighted viterbi decoding and subvector based error concealment.
Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg
2006Role of phase estimation in speech enhancement.
Benjamin J. Shannon, Kuldip K. Paliwal
2006SPAM and full covariance for speech recognition.
Daniel Povey
2006Saliency parsing for automated directory assistance.
Issac Alphonso, Shuangyu Chang
2006Scalable and portable web-based multimodal dialogue interaction with geographical databases.
Alexander Gruenstein, Stephanie Seneff, Chao Wang
2006Segment connection networks for corpus-based speech synthesis.
Geert Coorman
2006Segmental duration modeling in Turkish.
Özlem Öztürk, Tolga Çiloglu
2006Selective-LPC based representation of STRAIGHT spectrum and its applications in spectral smoothing.
Heng Kang, Wenju Liu
2006Semi-automatic extraction of vocal tract movements from cineradiographic data.
Julie Fontecave, Frédéric Berthommier
2006Sentence boundary detection of spontaneous Japanese using statistical language model and support vector machines.
Yuya Akita, Masahiro Saikou, Hiroaki Nanjo, Tatsuya Kawahara
2006Sentence boundary detection using sequential dependency analysis combined with CRF-based chunking.
Takanobu Oba, Takaaki Hori, Atsushi Nakamura
2006Sequence classification for machine translation.
Srinivas Bangalore, Patrick Haffner, Stephan Kanthak
2006Signal modification incorporating perceptual weighting filter.
Joon-Hyuk Chang, Woohyung Lim, Nam Soo Kim
2006Significance of formants from difference spectrum for speaker identification.
Kishore Prahallad, Varanasi Sudhakar, Veluru Ranganatham, Krishna M. Bharat, S. Roy Debashish
2006Silence energy normalization for robust speech recognition in additive noise environment.
Chung-fu Tai, Jeih-weih Hung
2006Single channel speech enhancement by frequency domain constrained optimization and temporal masking.
Wen Jin, Michael S. Scordilis
2006Single frame selection for phoneme classification.
Tingyao Wu, Dirk Van Compernolle, Jacques Duchateau, Hugo Van hamme
2006Single-channel speech separation using sparse non-negative matrix factorization.
Mikkel N. Schmidt, Rasmus Kongsgaard Olsson
2006Six approaches to limited domain concatenative speech synthesis.
Robert J. Utama, Ann K. Syrdal, Alistair Conkie
2006Sloparl - slovenian parliamentary speech and text corpus for large vocabulary continuous speech recognition.
Andrej Zgank, Tomaz Rotovnik, Matej Grasic, Marko Kos, Damjan Vlaj, Zdravko Kacic
2006Soft decision combining for dual channel noise reduction.
Timo Gerkmann, Rainer Martin
2006Soft margin estimation of hidden Markov model parameters.
Jinyu Li, Ming Yuan, Chin-Hui Lee
2006Software architectures for incremental understanding of human speech.
Gregory Aist, James F. Allen, Ellen Campana, Lucian Galescu, Carlos Gómez Gallo, Scott C. Stoness, Mary D. Swift, Michael K. Tanenhaus
2006Solving large margin estimation of HMMS via semidefinite programming.
Xinwei Li, Hui Jiang
2006Soundbite detection in broadcast news domain.
Sameer Maskey, Julia Hirschberg
2006Sparseness and speech perception in noise.
Guoping Li, Mark E. Lutman
2006Speaker adaptation of trajectory HMMs using feature-space MLLR.
Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura
2006Speaker adaptation using evolutionary-based linear transform.
Sid-Ahmed Selouani, Douglas D. O'Shaughnessy
2006Speaker cluster based GMM tokenization for speaker recognition.
Bin Ma, Donglai Zhu, Rong Tong, Haizhou Li
2006Speaker clustered regression-class trees for MLLR adaptation.
Arindam Mandal, Mari Ostendorf, Andreas Stolcke
2006Speaker diarization for multiple distant microphone meetings: mixing acoustic features and inter-channel time differences.
José M. Pardo, Xavier Anguera, Chuck Wooters
2006Speaker identification under noisy environments by using harmonic structure extraction and reliable frame weighting.
Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
2006Speaker independent voiced-unvoiced detection evaluated in different speaking styles.
Martin Heckmann, Marco Moebus, Frank Joublin, Christian Goerick
2006Speaker localization based on oriented global coherence field.
Alessio Brutti, Maurizio Omologo, Piergiorgio Svaizer
2006Speaker verification with non-audible murmur segments.
Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano
2006Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech.
Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2006Speaking faces for face-voice speaker identity verification.
Girija Chetty, Michael Wagner
2006Specificity and generalizability of spontaneous phonetic imitation.
Kuniko Y. Nielsen
2006Speech analyzer using a joint estimation model of spectral envelope and fine structure.
Hirokazu Kameoka, Jonathan Le Roux, Nobutaka Ono, Shigeki Sagayama
2006Speech and speech recognition during dictation corrections.
Keith Vertanen
2006Speech enhancement based on residual noise shaping.
Jong Won Shin, Seung Yeol Lee, Hwan Sik Yun, Nam Soo Kim
2006Speech enhancement based on spectral estimation from higher-lag autocorrelation.
Benjamin J. Shannon, Kuldip K. Paliwal, Climent Nadeu
2006Speech enhancement using modified phase opponency model.
Om Deshmukh, Carol Y. Espy-Wilson
2006Speech recognition of foreign out-of-vocabulary words using a hierarchical language model.
Hirofumi Yamamoto, Gen-ichiro Kikui, Satoshi Nakamura, Yoshinori Sagisaka
2006Speech recognition using factorial hidden Markov models for separation in the feature space.
Tuomas Virtanen
2006Speech recognition with phonological features: some issues to attend.
Frederik Stouten, Jean-Pierre Martens
2006Speech technology for minority languages: the case of Irish (gaelic).
Ailbhe Ní Chasaide, John Wogan, Brian Ó Raghallaigh, Áine Ní Bhriain, Eric Zoerner, Harald Berthelsen, Christer Gobl
2006Speech/non-speech discrimination combining advanced feature extraction and SVM learning.
Javier Ramírez, Pablo Yélamos, J. M. Górriz, José C. Segura, Luz García
2006Spoken language technologies applied to digital talking books.
Isabel Trancoso, Carlos Duarte, António Joaquim Serralheiro, Diamantino Caseiro, Luís Carriço, Céu Viana
2006Spontaneous Thai speech recognition.
Monika Woszczyna, Paisarn Charoenpornsawat, Tanja Schultz
2006State-level variable modeling for phoneme classification.
Hao-Zheng Li, Douglas D. O'Shaughnessy
2006Statistical analysis and performance of DFT domain noise reduction filters for robust speech recognition.
Colin Breithaupt, Rainer Martin
2006Steady-state suppression in reverberation: a comparison of native and nonnative speech perception.
Nao Hodoshima, Dawn M. Behne, Takayuki Arai
2006Stochastic vector mapping-based feature enhancement using prior model and environment adaptation for noisy speech recognition.
Chia-Hsin Hsieh, Chung-Hsien Wu, Jun-Yu Lin
2006Study of time and frequency variability in pathological speech and error reduction methods for automatic speech recognition.
Oscar Saz, Antonio Miguel, Eduardo Lleida, Alfonso Ortega, Luis Buera
2006Study on speaker verification on emotional speech.
Wei Wu, Thomas Fang Zheng, Ming-Xing Xu, Huanjun Bao
2006Sub-word unit based non-audible speech recognition using surface electromyography.
Matthias Walliczek, Florian Kraft, Szu-Chen Stan Jou, Tanja Schultz, Alex Waibel
2006Subspace modeling and selection for noisy speech recognition.
Jen-Tzung Chien, Chuan-Wei Ting
2006Substitute sounds for ventriloquism and speech disorders.
Jörg Metzner, Marcel Schmittfull, Karl Schnell
2006Summarization evaluation for text and speech: issues and approaches.
Ani Nenkova
2006Summarization of spontaneous conversations.
Xiaodan Zhu, Gerald Penn
2006Super-human multi-talker speech recognition: the IBM 2006 speech separation challenge system.
Trausti T. Kristjansson, John R. Hershey, Peder A. Olsen, Steven J. Rennie, Ramesh A. Gopinath
2006Syllable-length path mixture hidden Markov models with trajectory clustering for continuous speech recognition.
Yan Han, Lou Boves
2006Synthesizing breathiness in natural speech with sinusoidal modelling.
Brett Matthews, Raimo Bakis, Ellen Eide
2006System- versus user-initiative dialog strategy for driver information systems.
Chantal Ackermann, Marion Libossek
2006TDA: a new trainable trajectory formation system for facial animation.
Oxana Govokhina, Gérard Bailly, Gaspard Breton, Paul C. Bagshaw
2006Testing the effect of audiovisual cues to prominence via a reaction-time experiment.
Emiel Krahmer, Marc Swerts
2006Text-independent cross-language voice conversion.
David Sündermann, Harald Höge, Antonio Bonafonte, Hermann Ney, Julia Hirschberg
2006Text-independent speaker identification in birds.
E. J. S. Fox, J. D. Roberts, Mohammed Bennamoun
2006The 2006 RWTH parliamentary speeches transcription system.
Jonas Lööf, Maximilian Bisani, Christian Gollan, Georg Heigold, Björn Hoffmeister, Christian Plahl, Ralf Schlüter, Hermann Ney
2006The IBM 2006 speech transcription system for european parliamentary speeches.
Bhuvana Ramabhadran, Olivier Siohan, Lidia Mangu, Geoffrey Zweig, Martin Westphal, Henrik Schulz, Alvaro Soneiro
2006The ICSI+ multilingual sentence segmentation system.
M. Zimmerman, Dilek Hakkani-Tür, James G. Fung, Nikki Mirghafori, Luke R. Gottlieb, Elizabeth Shriberg, Yang Liu
2006The importance of different facial areas for signalling visual prominence.
Marc Swerts, Emiel Krahmer
2006The role of positional probability in the segmentation of Cantonese speech.
Michael C. W. Yip
2006The role of prosody in the perception of US native English accents.
Ayako Ikeno, John H. L. Hansen
2006The segmentation of multi-channel meeting recordings for automatic speech recognition.
John Dines, Jithendra Vepa, Thomas Hain
2006The target cost formulation in unit selection speech synthesis.
Paul Taylor
2006The use of Bayesian network for incorporating accent, gender and wide-context dependency information.
Sakriani Sakti, Konstantin Markov, Satoshi Nakamura
2006The vocal joystick data collection effort and vowel corpus.
Kelley Kilanski, Jonathan Malkin, Xiao Li, Richard Wright, Jeff A. Bilmes
2006Thesaurus expansion using similar word pairs from patent documents.
Yoshimi Suzuki, Fumiyo Fukumoto
2006Time-dependent cross-probability model for multi-environment model based LInear normalization.
Luis Buera, Eduardo Lleida, Juan Arturo Nolazco-Flores, Antonio Miguel, Alfonso Ortega
2006Timing levels in segment-based speech emotion recognition.
Björn W. Schuller, Gerhard Rigoll
2006Tone recognition of continuous speech of standard Chinese using neural network and tone nucleus model.
Keikichi Hirose, Hui Hu, Xiaodong Wang, Nobuaki Minematsu
2006Topic-based language modeling with dynamic Bayesian networks.
Pascal Wiggers, Léon J. M. Rothkrantz
2006Totally data-driven duration modeling based on generalized linear model for Mandarin TTS.
Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao
2006Totally data-driven intonation prediction model using a novel F0 contour parametric representation.
Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao
2006Towards a comprehensive investigation of factors relevant to peak alignment using a unit selection corpus.
Matthias Jilka, Bernd Möbius
2006Towards a multimodal topic tracking system for a mobile robot.
Jan Frederik Maas, Britta Wrede, Gerhard Sagerer
2006Towards an integrated understanding of speaking rate in conversation.
Jiahong Yuan, Mark Y. Liberman, Christopher Cieri
2006Towards automatic parameter extraction of command-response model for Cantonese.
Raymond W. M. Ng, Tan Lee, Wentao Gu
2006Towards continuous speech recognition using surface electromyography.
Szu-Chen Stan Jou, Tanja Schultz, Matthias Walliczek, Florian Kraft, Alex Waibel
2006Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters.
Tobias Gehrig, Ulrich Klee, John W. McDonough, Shajith Ikbal, Matthias Wölfel, Christian Fügen
2006Tracking of involuntary formant frequency variations and application to parkinsonian speech.
Laurence Cnockaert, Jean Schoentgen, Pascal Auzou, Canan Ozsancak, Francis Grenez
2006Tracking of visible vocal tract resonances (VVTR) based on kalman filtering.
I. Yücel Özbek, Mübeccel Demirekler
2006Training native English speakers to identify Japanese vowel length with fast rate sentences.
Yukari Hirata, Elizabeth Whitehurst, Emily Cullings, Jacob Whiton, Carol Glenn
2006Training of coarticulation models using dominance functions and visual unit selection methods for audio-visual speech synthesis.
Zdenek Krnoul, Milos Zelezný, Ludek Müller, Jakub Kanis
2006Two stage transform vector quantization of LSFs for wideband speech coding.
Saikat Chatterjee, T. V. Sreenivas
2006Two-microphone voice activity detection in the presence of coherent interference.
Gibak Kim, Nam Ik Cho
2006Two-stage vocabulary-free spoken document retrieval - subword identification and re-recognition of the identified sections.
Yoshiaki Itoh, Takayuki Otake, Kohei Iwata, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee
2006Two-step unsupervised speaker adaptation based on speaker and gender recognition and HMM combination.
Petr Cerva, Jan Nouza, Jan Silovský
2006Underlying quality dimensions of modern telephone connections.
Marcel Wältermann, Kirstin Scholz, Alexander Raake, Ulrich Heute, Sebastian Möller
2006Unfilled pauses in Japanese sentences read aloud by non-native learners.
Hiroko Hirano, Goh Kawai, Keikichi Hirose, Nobuaki Minematsu
2006Unifying unit selection and hidden Markov model speech synthesis.
Paul Taylor
2006Unit selection and its relation to symbolic prosody: a new approach.
Daniel Tihelka, Jindrich Matousek
2006Unsupervised Spanish dialect classification.
Rongqing Huang, John H. L. Hansen
2006Unsupervised adaptation for acoustic language identification.
Ekaterina Timoshenko, Josef G. Bauer
2006Unsupervised detection of whispered speech in the presence of normal phonation.
Michael A. Carlin, Brett Y. Smolenski, Stanley J. Wenndt
2006Unsupervised language model adaptation based on automatic text collection from WWW.
Motoyuki Suzuki, Yasutomo Kajiura, Akinori Ito, Shozo Makino
2006Unsupervised language model adaptation for Mandarin broadcast conversation transcription.
David Mrva, Philip C. Woodland
2006Unsupervised language model adaptation using latent semantic marginals.
Yik-Cheung Tam, Tanja Schultz
2006Unsupervised learning of HMM topology for text-dependent speaker verification.
Ming Liu, Thomas S. Huang
2006Unsupervised model adaptation for speaker verification.
Alexandre Preti, Jean-François Bonastre
2006Unsupervised segmentation of words into morphemes - morpho challenge 2005 application to automatic speech recognition.
Mikko Kurimo, Mathias Creutz, Matti Varjokallio, Ebru Arisoy, Murat Saraclar
2006Use of incrementally regulated discriminative margins in MCE training for speech recognition.
Dong Yu, Li Deng, Xiaodong He, Alex Acero
2006User expectations and real experience on a multimodal interactive system.
Kristiina Jokinen, Topi Hurtig
2006User responses to prosodic variation in fragmentary grounding utterances in dialog.
Gabriel Skantze, David House, Jens Edlund
2006User simulation for spoken dialogue systems: learning and evaluation.
Kallirroi Georgila, James Henderson, Oliver Lemon
2006Using SVM and error-correcting codes for multiclass dialog act classification in meeting corpus.
Yang Liu
2006Using a differential microphone array to estimate the direction of arrival of two acoustic sources.
Fotios Talantzis, Anthony G. Constantinides, Lazaros C. Polymenakos
2006Using genetic algorithms to weight acoustic features for speaker recognition.
Maider Zamalloa, Germán Bordel, Luis Javier Rodríguez, Mikel Peñagarikano, Juan Pedro Uribe
2006Using latent semantic indexing for morph-based spoken document retrieval.
Ville T. Turunen, Mikko Kurimo
2006Using posterior-based features in template matching for speech recognition.
Guillermo Aradilla, Jithendra Vepa, Hervé Bourlard
2006Using speech recognition technique for constructing a phonetically transcribed taiwanese (min-nan) text corpus.
Min-Siong Liang, Ren-yuan Lyu, Yuang-Chin Chiang
2006Using system and user performance features to improve emotion detection in spoken tutoring dialogs.
Hua Ai, Diane J. Litman, Katherine Forbes-Riley, Mihai Rotaru, Joel R. Tetreault, Amruta Purandare
2006Vector taylor series based joint uncertainty decoding.
Haitian Xu, Luca Rigazio, David Kryze
2006Vector-based spoken language recognition using output coding.
Haizhou Li, Bin Ma, Rong Tong
2006Visual correlates to prominence in several expressive modes.
Jonas Beskow, Björn Granström, David House
2006Visual speech segmentation and speaker recognition for transcription of TV news.
Josef Chaloupka
2006Vocal emotion recognition with cochlear implants.
Xin Luo, Qian-Jie Fu, John J. Galvin III
2006Voice GMM modelling for FESTIVAL/MBROLA emotive TTS synthesis.
Mauro Nicolao, Carlo Drioli, Piero Cosi
2006Voice activity detection in personal audio recordings using autocorrelogram compensation.
Keansub Lee, Daniel P. W. Ellis
2006Voice activity detector based on enhanced cumulant of LPC residual and on-line EM algorithm.
David Cournapeau, Tatsuya Kawahara, Kenji Mase, Tomoji Toriyama
2006Voice conversion based on mixtures of factor analyzers.
Yosuke Uto, Yoshihiko Nankaku, Tomoki Toda, Akinobu Lee, Keiichi Tokuda
2006Voice source correlates of prosodic features in american English: a pilot study.
Markus Iseli, Yen-Liang Shue, Melissa A. Epstein, Patricia A. Keating, Jody Kreiman, Abeer Alwan
2006Voting for two speaker segmentation.
Narayanaswamy Balakrishnan, Rashmi Gangadharaiah, Richard M. Stern
2006Wavelet ridge track interpretation in terms of formants.
Salma Chaari, Kaïs Ouni, Noureddine Ellouze
2006Weighted codebook mapping for noisy speech enhancement using harmonic-noise model.
Esfandiar Zavarehei, Saeed Vaseghi, Qin Yan
2006Within-class covariance normalization for SVM-based speaker recognition.
Andrew O. Hatch, Sachin S. Kajarekar, Andreas Stolcke
2006Word intelligibility estimation of noise-reduced speech.
Takeshi Yamada, Masakazu Kumakura, Nobuhiko Kitawaki
2006Word order and tonal shape in the production of focus in short Finnish utterances.
Martti Vainio, Juhani Järvikivi, Stefan Werner
2006Word structure and tone perception in Mandarin.
Hansjörg Mixdorff, Yu Hu