| 2008 | "look at the shark": evaluation of student produced standardized sentences of infant- and foreigner-directed speech. Monja A. Knoll, Lisa Scharrer |
| 2008 | "your baby can't hear you": how mothers talk to infants with simulated hearing loss. Christa Lam, Christine Kitamura |
| 2008 | 9th Annual Conference of the International Speech Communication Association, INTERSPEECH 2008, Brisbane, Australia, September 22-26, 2008 |
| 2008 | A 'speechiness' measure to improve speech decoding in the presence of other sound sources. Ning Ma, Phil D. Green |
| 2008 | A 3-d virtual head as a tool for speech therapy for children. Sascha Fagel, Katja Madany |
| 2008 | A 8.32 kb/s embedded wideband speech coding candidate for ITU-t EV-VBR standardization. Changchun Bao, Hai-ting Li, Ze-xin Liu, Rui Fan, Heng Zhu, Mao-shen Jia, Rui Li |
| 2008 | A Bayesian approach to semantic composition for spoken language interpretation. Marie-Jean Meurs, Fabrice Lefèvre, Renato De Mori |
| 2008 | A Japanese CALL system based on dynamic question generation and error prediction for ASR. Hongcui Wang, Tatsuya Kawahara |
| 2008 | A Niuean variant of New Zealand English? Donna Starks, Laura Thompson, Catherine Inez Watson |
| 2008 | A PCM coding noise reduction for ITU-t g.711.1. Jean-Luc Garcia, Claude Marro, Balázs Kövesi |
| 2008 | A browsing system for classroom lecture speech. Shingo Togashi, Seiichi Nakagawa |
| 2008 | A closer look on hierarchical spectro-temporal features (HIST). Martin Heckmann, Xavier Domont, Frank Joublin, Christian Goerick |
| 2008 | A combination of data mining method with decision trees building for speech/music discrimination. Qiong Wu, Qin Yan, Jun Wang, Jun Hong |
| 2008 | A comparative study in automatic recognition of broadcast audio. Stavros Ntalampiras, Nikos Fakotakis |
| 2008 | A comparative study on AM and FM features. Yotaro Kubo, Shigeki Okawa, Akira Kurematsu, Katsuhiko Shirai |
| 2008 | A comparative study on dissyllabic stress patterns of Mandarin and Cantonese. Weixiang Hu, Jin Jian, Aijun Li, Xia Wang |
| 2008 | A comparison of broad phonetic and acoustic units for noise robust segment-based phonetic recognition. Tara N. Sainath, Victor Zue |
| 2008 | A comparison of input entry rates in a multimodal mobile application. Aleksi Melto, Markku Turunen, Jaakko Hakulinen, Anssi Kainulainen, Tomi Heimonen |
| 2008 | A comparison of subspace feature-domain methods for language recognition. William M. Campbell, Douglas E. Sturim, Pedro A. Torres-Carrasquillo, Douglas A. Reynolds |
| 2008 | A comparison of two acoustic measurement approaches to the rhythm continuum of natural Chinese and English speech. Matthew Benton, Liz Dockendorf |
| 2008 | A comparison of voice conversion methods for transforming voice quality in emotional speech synthesis. Oytun Türk, Marc Schröder |
| 2008 | A composite framework for affective sensing. Gordon McIntyre, Roland Göcke |
| 2008 | A comprehensive study on the effects of room reverberation on fundamental frequency estimation. Rico Petrick, Masashi Unoki, Anish Mittal, Carlos Segura, Rüdiger Hoffmann |
| 2008 | A computational model of language acquisition: focus on word discovery. Louis ten Bosch, Hugo Van hamme, Lou Boves |
| 2008 | A computationally efficient approach to warp factor estimation in VTLN using EM algorithm and sufficient statistics. P. T. Akhil, Shakti Prasad Rath, Srinivasan Umesh, D. Rama Sanand |
| 2008 | A corpus-based prosodic study of Alsatian, Belgian and Swiss French. Cécile Woehrling, Philippe Boula de Mareüil, Martine Adda-Decker, Lori Lamel |
| 2008 | A data format enabling interoperation of speech recognition, translation and information extraction engines: the GALE type system. John F. Pitrelli, Burn L. Lewis, Edward A. Epstein, Jerome L. Quinn, Ganesh N. Ramaswamy |
| 2008 | A development of Czech talking head. Zdenek Krnoul, Milos Zelezný |
| 2008 | A dual microphone coherence based method for speech enhancement in headsets. Mohsen Rahmani, Ahmad Akbari, Beghdad Ayad |
| 2008 | A fast speaker adaptation method using aspect model. Seongjun Hahm, Akinori Ito, Shozo Makino, Motoyuki Suzuki |
| 2008 | A feature compensation approach using high-order vector taylor series approximation of an explicit distortion model for noisy speech recognition. Jun Du, Qiang Huo |
| 2008 | A frequency domain approach for speech enhancement with directionality using compact microphone array. Heng Zhang, Qiang Fu, Yonghong Yan |
| 2008 | A generalised derivative kernel for speaker verification. Chris Longworth, Mark J. F. Gales |
| 2008 | A hybrid SVM/MCE training approach for vector space topic identification of spoken audio recordings. Timothy J. Hazen, Fred Richardson |
| 2008 | A hybrid speech signal based algorithm for pitch marking using finite state machines. Hussein Hussein, Matthias Wolff, Oliver Jokisch, Frank Duckhorn, Guntram Strecha, Rüdiger Hoffmann |
| 2008 | A language-modeling approach to inverse text normalization and data cleanup for multimodal voice search applications. Yun-Cheng Ju, Julian Odell |
| 2008 | A long state vector kalman filter for speech enhancement. Stephen So, Kuldip K. Paliwal |
| 2008 | A low-power hardware search architecture for speech recognition. Patrick J. Bourke, Rob A. Rutenbar |
| 2008 | A method for automatic and dynamic estimation of discourse genre typology with prosodic features. Nicolas Obin, Anne Lacheret-Dujour, Christophe Veaux, Xavier Rodet, Anne-Catherine Simon |
| 2008 | A method for automatically estimating F0 model parameters and a speech re-synthesis tool using F0 model and STRAIGHT. Shota Sato, Taro Kimura, Yasuo Horiuchi, Masafumi Nishida, Shingo Kuroiwa, Akira Ichikawa |
| 2008 | A methodology and tool suite for evaluation of accuracy of interoperating statistical natural language processing engines. Uma Murthy, John F. Pitrelli, Ganesh N. Ramaswamy, Martin Franz, Burn L. Lewis |
| 2008 | A minimum classification error based distance measure for template based speech recognition. Mike Matton, Dirk Van Compernolle, Ronald Cools |
| 2008 | A model based investigation of activation patterns of the tongue muscles for vowel production. Qiang Fang, Satoru Fujita, Xugang Lu, Jianwu Dang |
| 2008 | A neural network based nonlinear feature transformation for speech recognition. Hongbing Hu, Stephen A. Zahorian |
| 2008 | A new fast algebraic fixed codebook search algorithm in CELP speech coding. Vaclav Eksler, Redwan Salami, Milan Jelinek |
| 2008 | A non-acoustic approach to crosslingual speech recognition performance prediction. Chen Liu, Lynette Melnar |
| 2008 | A novel approach in continuous speech recognition for Vietnamese, an isolating tonal language. Hong Quang Nguyen, Pascal Nocera, Eric Castelli, Van Loan Trinh |
| 2008 | A novel transcoding algorithm between 3GPP AMR-NB (7.95kbit/s) and ITU-t g.729a (8kbit/s). Hao Xu, Changchun Bao |
| 2008 | A penalized logistic regression approach to detection based phone classification. Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee |
| 2008 | A phase-averaged model for the relationship between noisy speech, clean speech and noise in the log-mel domain. Friedrich Faubel, John W. McDonough, Dietrich Klakow |
| 2008 | A phonetic assessment of cross-language voice conversion. Kayoko Yanagisawa, Mark A. Huckvale |
| 2008 | A posterior approach for microphone array based speech recognition. Dong Wang, Ivan Himawan, Joe Frankel, Simon King |
| 2008 | A posteriori SNR weighted energy based variable frame rate analysis for speech recognition. Zheng-Hua Tan, Børge Lindberg |
| 2008 | A probabilistic trajectory synthesis system for synthesising visual speech. Barry-John Theobald, Nicholas Wilkinson |
| 2008 | A rank-predicted pseudo-greedy approach to efficient text selection from large-scale corpus for maximum coverage of target units. Wei Li, Qiang Huo |
| 2008 | A real-time text to audio-visual speech synthesis system. Lijuan Wang, Xiaojun Qian, Lei Ma, Yao Qian, Yining Chen, Frank K. Soong |
| 2008 | A reliable technique for detecting the second subglottal resonance and its use in cross-language speaker adaptation. Shizhen Wang, Steven M. Lulich, Abeer Alwan |
| 2008 | A seven-tone dialect in southern China with falling-rising-falling contour: a linguistic acoustic analysis. Xiaonong Zhu, Caicai Zhang |
| 2008 | A shrinkage estimator for speech recognition with full covariance HMMs. Peter Bell, Simon King |
| 2008 | A speech enhancement approach using piecewise linear approximation of an explicit model of environmental distortions. Jun Du, Qiang Huo |
| 2008 | A spoken language interpretation component for a robot dialogue system. Enes Makalic, Ingrid Zukerman, Michael Niemann |
| 2008 | A statistical model-based voice activity detection employing minimum classification error technique. Sang-Ick Kang, Ji-Hyun Song, Kye-Hwan Lee, Yun-Sik Park, Joon-Hyuk Chang |
| 2008 | A study of pitch patterns of Japanese English analyzed via comparative linguistic features of English and Japanese. Tomoko Nariai, Kazuyo Tanaka |
| 2008 | A study of unsupervised clustering techniques for language modeling. Sangyun Hahn, Abhinav Sethy, Hong-Kwang Jeff Kuo, Bhuvana Ramabhadran |
| 2008 | A trainable trajectory formation model TD-HMM parameterized for the LIPS 2008 challenge. Gérard Bailly, Oxana Govokhina, Gaspard Breton, Frédéric Elisei, Christophe Savariaux |
| 2008 | A vowel based approach for acted emotion recognition. Fabien Ringeval, Mohamed Chetouani |
| 2008 | A wavelet based speech enhancement method using noise classification and shaping. Mehdi Mohammadi, Behzad Zamani, Babak Nasersharif, Mohsen Rahmani, Ahmad Akbari |
| 2008 | Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies. Martin Wöllmer, Florian Eyben, Stephan Reiter, Björn W. Schuller, Cate Cox, Ellen Douglas-Cowie, Roddy Cowie |
| 2008 | Accommodating explicit user expressions of uncertainty in voice search or something like that. Tim Paek, Yun-Cheng Ju |
| 2008 | Acoustic analysis of imitated voice produced by a professional impersonator. Tatsuya Kitamura |
| 2008 | Acoustic cues for the perception of intonation in Cantonese. Joan K.-Y. Ma, Valter Ciocca, Tara L. Whitehill |
| 2008 | Acoustic event classification using a distributed microphone network with a GMM/SVM combined algorithm. Christian Zieger, Maurizio Omologo |
| 2008 | Acoustic modeling based on model structure annealing for speech recognition. Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda |
| 2008 | Acoustic-phonetic approach for automatic evaluation of spoken grammar. Om Deshmukh, Ashish Verma |
| 2008 | Adaptive HMM topology for speech recognition. Chuan-Wei Ting, Kuo-Yuan Lee, Jen-Tzung Chien |
| 2008 | Adaptive beamforming and soft missing data decoding for robust speech recognition in reverberant environments. Marco Kühne, Roberto Togneri, Sven Nordholm |
| 2008 | Adaptive decision tree-based phone cluster models for speaker clustering. Chia-Hsin Hsieh, Chung-Hsien Wu, Han-Ping Shen |
| 2008 | Adaptive filter based prosody modification approach. Qingcai Chen, Shusen Zhou, Dandan Wang, Xiaohong Yang |
| 2008 | Adaptive training using discriminative mapping transforms. Chandra Kant Raut, Kai Yu, Mark J. F. Gales |
| 2008 | Adaptive-order fractional Fourier transform features for speech recognition. Hui Yin, Xiang Xie, Jingming Kuang |
| 2008 | Addressing database mismatch in forensic speaker recognition with Ahumada III: a public real-casework database in Spanish. Daniel Ramos, Joaquin Gonzalez-Rodriguez, Javier Gonzalez-Dominguez, Jose Juan Lucena-Molina |
| 2008 | Addressing the out-of-vocabulary problem for large-scale Chinese spoken term detection. Sha Meng, Jian Shao, Roger Peng Yu, Jia Liu, Frank Seide |
| 2008 | Advances in phonotactic language recognition. Ondrej Glembek, Pavel Matejka, Lukás Burget, Tomás Mikolov |
| 2008 | Advertisement detection in French broadcast news using acoustic repetition and Gaussian mixture models. Vishwa Gupta, Gilles Boulianne, Patrick Kenny, Pierre Dumouchel |
| 2008 | Agglomerative hierarchical speaker clustering using incremental Gaussian mixture cluster modeling. Kyu Jeong Han, Shrikanth S. Narayanan |
| 2008 | Aggregated cross-validation and its efficient application to Gaussian mixture optimization. Takahiro Shinozaki, Sadaoki Furui, Tatsuya Kawahara |
| 2008 | Aggregating distributed STT, MT, and information extraction engines: the GALE interoperability-demo system. John F. Pitrelli, Burn L. Lewis, Edward A. Epstein, Martin Franz, Daniel Kiecza, Jerome L. Quinn, Ganesh N. Ramaswamy, Amit Srivastava, Paola Virga |
| 2008 | Amplitude and amplitude variation of emotional speech. Hartmut R. Pfitzinger, Christian Kaernbach |
| 2008 | An ERP study on categorical perception of lexical tones and nonspeech pitches. Hongying Zheng, William S.-Y. Wang |
| 2008 | An acoustic typology of apraxic speech - toward reliable diagnosis. Jacqueline McKechnie, Kirrie J. Ballard, Donald A. Robin, Adam Jacks, Sallyanne Palethorpe, Kristin M. Rosen |
| 2008 | An acoustic-phonetic comparative analysis of Osaka and Kagoshima Japanese tonal phenomena. Shunichi Ishihara |
| 2008 | An algorithm for multi-pitch tracking in co-channel speech. Srikanth Vishnubhotla, Carol Y. Espy-Wilson |
| 2008 | An analysis of multimodal cues of interruption in dyadic spoken interactions. Chi-Chun Lee, Sungbok Lee, Shrikanth S. Narayanan |
| 2008 | An analysis of vocal tract shaping in English sibilant fricatives using real-time magnetic resonance imaging. Erik Bresch, Daylen Riggs, Louis M. Goldstein, Dani Byrd, Sungbok Lee, Shrikanth S. Narayanan |
| 2008 | An effective microphone array post-filter in arbitrary environments. Ning Cheng, Wenju Liu, Peng Li, Bo Xu |
| 2008 | An ellipsoid constrained quadratic programming perspective to discriminative training of HMMs. Peng Liu, Frank K. Soong |
| 2008 | An empirical analysis of word error rate and keyword error rate. Youngja Park, Siddharth Patwardhan, Karthik Visweswariah, Stephen C. Gates |
| 2008 | An entropy based feature for whisper-island detection within audio streams. Chi Zhang, John H. L. Hansen |
| 2008 | An estimation technique of style expressiveness for emotional speech using model adaptation based on multiple-regression HSMM. Takashi Nose, Yoichi Kato, Makoto Tachibana, Takao Kobayashi |
| 2008 | An evaluation of non-standard features for grapheme-to-phoneme conversion. Gabriel Webster, Norbert Braunschweiler |
| 2008 | An expert system in speaker verification task. Zbynek Zajíc, Lukás Machlica, Ales Padrta, Jan Vanek, Vlasta Radová |
| 2008 | An improved one-to-many eigenvoice conversion system. Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2008 | An instrumental measure for end-to-end speech transmission quality based on perceptual dimensions: framework and realization. Marcel Wältermann, Kirstin Scholz, Sebastian Möller, Lu Huo, Alexander Raake, Ulrich Heute |
| 2008 | An interval type-2 fuzzy logic system to translate between emotion-related vocabularies. Abe Kazemzadeh, Sungbok Lee, Shrikanth S. Narayanan |
| 2008 | An intuitive class discriminability measure for feature selection in a speech recognition system. Ladan Golipour, Douglas D. O'Shaughnessy |
| 2008 | An investigation of acoustic models for multilingual code-switching. Christopher M. White, Sanjeev Khudanpur, James K. Baker |
| 2008 | An objective singing evaluation approach by relating acoustic measurements to perceptual ratings. Chuan Cao, Ming Li, Jian Liu, Yonghong Yan |
| 2008 | An on-line adaptation technique for emotional speech recognition using style estimation with multiple-regression HMM. Yusuke Ijima, Makoto Tachibana, Takashi Nose, Takao Kobayashi |
| 2008 | Analysis and perception of speech under physical task stress. Keith W. Godin, John H. L. Hansen |
| 2008 | Analysis of drivers' speech in a car environment. Tomoyuki Kato, Jun Okamoto, Makoto Shozakai |
| 2008 | Analysis of glottal stops in speech signals. B. Yegnanarayana, S. Rajendran, Hussien Seid Worku, N. Dhananjaya |
| 2008 | Analysis of impostor tests with high scores in NIST-SRE context. Salah Eddine Mezaache, Jean-François Bonastre, Driss Matrouf |
| 2008 | Analysis of physiologically-motivated signal processing for robust speech recognition. Yu-Hsiang Bosco Chiu, Richard M. Stern |
| 2008 | Analysis of relationship between impression of human-to-human conversations and prosodic change and its modeling. Ryota Nishimura, Norihide Kitaoka, Seiichi Nakagawa |
| 2008 | Analysis of subspace within-class covariance normalization for SVM-based speaker verification. Liang Lu, Yuan Dong, Xianyu Zhao, Jian Zhao, Chengyu Dong, Haila Wang |
| 2008 | Analysis of voice-quality features of speech that expresses "anger", "joy", and "sadness" uttered by radio actors and actresses. Shoichi Takeda, Yuuri Yasuda, Risako Isobe, Shogo Kiryu, Makiko Tsuru |
| 2008 | Anchor-model fusion for language recognition. Ignacio López-Moreno, Daniel Ramos, Joaquin Gonzalez-Rodriguez, Doroteo T. Toledano |
| 2008 | Anton: an animatronic model of a human tongue and vocal tract. Robin Hofe, Roger K. Moore |
| 2008 | Application and evaluation of speech technologies in language learning: experiments with the Saybot player. Sylvain Chevalier, Zhenhai Cao |
| 2008 | Application of weighted finite-state transducers to improve recognition accuracy for dysarthric speech. Santiago Omar Caballero Morales, Stephen J. Cox |
| 2008 | Applications of virtual-evidence based speech recognizer training. Amarnag Subramanya, Jeff A. Bilmes |
| 2008 | Applying pitch-dependent difference detection and modification to emotional speaker recognition. Ting Huang, Yingchun Yang |
| 2008 | Articulatory control of HMM-based parametric speech synthesis driven by phonetic knowledge. Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi, Ren-Hua Wang |
| 2008 | Aspects of pharyngealized phonemes in Arabic using articulography. Slim Ouni |
| 2008 | Assessing agreement of observer- and self-annotations in spontaneous multimodal emotion data. Khiet P. Truong, Mark A. Neerincx, David A. van Leeuwen |
| 2008 | Assessment of correlation between objective measures and speech recognition performance in the evaluation of speech enhancement. Pei Ding, Jie Hao |
| 2008 | Assessment of objective quality measures for speech intelligibility. Wei Ming Liu, Keith A. Jellyman, Nicholas W. D. Evans, John S. D. Mason |
| 2008 | Assessment of the speech-quality dimension "noisiness" for the instrumental estimation and analysis of telephone-band speech quality. Kirstin Scholz, Christine Kühnel, Marcel Wältermann, Sebastian Möller, Ulrich Heute |
| 2008 | Assigning suitable phrasal tones and pitch accents by sensing affective information from text to synthesize human-like speech. Shaikh Mostafa Al Masum, M. Khademul Islam Molla, Keikichi Hirose |
| 2008 | Audio indexing for an interactive Italian literature management system. Carlo Drioli, Piero Cosi |
| 2008 | Audio-visual multilevel fusion for speech and speaker recognition. Girija Chetty, Michael Wagner |
| 2008 | Auditory-based formant estimation in noise using a probabilistic framework. Claudius Gläser, Martin Heckmann, Frank Joublin, Christian Goerick |
| 2008 | Automatic accent classification using ensemble methods. Fukun Bi, Jian Yang, Dan Xu |
| 2008 | Automatic children's reading tutor on hand-held devices. Xiaolong Li, Li Deng, Yun-Cheng Ju, Alex Acero |
| 2008 | Automatic customer feedback processing: alarm detection in open question spoken messages. Nathalie Camelin, Géraldine Damnati, Frédéric Béchet, Renato De Mori |
| 2008 | Automatic detection of the context of acoustic landmark deletion. Nanette Veilleux, Stefanie Shattuck-Hufnagel |
| 2008 | Automatic estimation of language model parameters for unseen words using morpho-syntactic contextual information. Ciro Martins, António J. S. Teixeira, João Paulo Neto |
| 2008 | Automatic evaluation of characteristic speech disorders in children with cleft lip and palate. Andreas K. Maier, Florian Hönig, Christian Hacker, Maria Schuster, Elmar Nöth |
| 2008 | Automatic generation and pruning of phonetic mispronunciations to support computer-aided pronunciation training. Lan Wang, Xin Feng, Helen M. Meng |
| 2008 | Automatic lip synchronization by speech signal analysis. Goranka Zoric, Aleksandra Cerekovic, Igor S. Pandzic |
| 2008 | Automatic pitch-synchronous phonetic segmentation. Jindrich Matousek, Jan Romportl |
| 2008 | Automatic pronunciation evaluation and classification. Om Deshmukh, Sachindra Joshi, Ashish Verma |
| 2008 | Automatic pronunciation evaluation of language learners' utterances generated through shadowing. Dean Luo, Naoya Shimomura, Nobuaki Minematsu, Yutaka Yamauchi, Keikichi Hirose |
| 2008 | Automatic recognition of anger in spontaneous speech. Daniel Neiberg, Kjell Elenius |
| 2008 | Automatic speech recognition for scientific purposes - webASR. Thomas Hain, Asmaa El Hannani, Stuart N. Wrigley, Vincent Wan |
| 2008 | Automatic word stress marking and syllabification for Catalan TTS. Silvia Rustullet, Daniela Braga, João Nogueira, Miguel Sales Dias |
| 2008 | Automatic-type calibration of traditionally derived likelihood ratios: forensic analysis of australian English /o/ formant trajectories. Geoffrey Stewart Morrison, Yuko Kinoshita |
| 2008 | Automatically learning speaker-independent acoustic subword units. Balakrishnan Varadarajan, Sanjeev Khudanpur |
| 2008 | BUT language recognition system for NIST 2007 evaluations. Pavel Matejka, Lukás Burget, Ondrej Glembek, Petr Schwarz, Valiantsina Hubeika, Michal Fapso, Tomás Mikolov, Oldrich Plchot, Jan Cernocký |
| 2008 | Babble speech: acoustic and perceptual variability. Nitish Krishnamurthy, Ayako Ikeno, John H. L. Hansen |
| 2008 | Backward Viterbi beam search for utilizing dynamic task complexity information. Min Tang, Philippe Di Cristo |
| 2008 | Bag-of-word normalized n-gram models. Abhinav Sethy, Bhuvana Ramabhadran |
| 2008 | Balancing spoken content adaptation and unit length in the recognition of emotion and interest. Bogdan Vlasenko, Björn W. Schuller, Kinfe Tadesse Mengistu, Gerhard Rigoll, Andreas Wendemuth |
| 2008 | Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition. Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda |
| 2008 | Bayesian latent topic clustering model. Meng-Sung Wu, Jen-Tzung Chien |
| 2008 | Better nonnative intonation scores through prosodic theory. Joseph Tepperman, Shrikanth S. Narayanan |
| 2008 | Beyond frame independence: parametric modelling of time duration in speaker and language recognition. Alan McCree, Fred Richardson, Elliot Singer, Douglas A. Reynolds |
| 2008 | Beyond linear transforms: efficient non-linear dynamic adaptation for noise robust speech recognition. Steven J. Rennie, Pierre L. Dognin |
| 2008 | Bi-Gaussian score equalization in an audio-visual SVM-based person verification system. Pascual Ejarque, Javier Hernando |
| 2008 | Blind dereverberation based on CMN and spectral subtraction by multi-channel LMS algorithm. Longbiao Wang, Seiichi Nakagawa, Norihide Kitaoka |
| 2008 | Building and combining document and music spaces for music query-by-webpage system. Ryoei Takahashi, Yasunori Ohishi, Norihide Kitaoka, Kazuya Takeda |
| 2008 | Building sleek synthesizers for multi-lingual screen reader. E. Veera Raghavendra, B. Yegnanarayana, Alan W. Black, Kishore Prahallad |
| 2008 | CENSREC-4: development of evaluation framework for distant-talking speech recognition under reverberant environments. Masato Nakayama, Takanobu Nishiura, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Tetsuji Ogawa, Shigeki Matsuda, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura |
| 2008 | Can audio-visual instructions help learners improve their articulation? - an ultrasound study of short term changes. Olov Engwall |
| 2008 | Can visualization of internal articulators support speech perception? Preben Wik, Olov Engwall |
| 2008 | Can you "read tongue movements"? Pierre Badin, Yuliya Tarabalka, Frédéric Elisei, Gérard Bailly |
| 2008 | Cascading appearance-based features for visual speaker verification. David Dean, Sridha Sridharan, Patrick Lucey |
| 2008 | Central vowels in Arrernte: metrical prominence and pitch accent. Marija Tabain, Kristine Rickard, Gavan Breen, Veronica Dobson |
| 2008 | Cepstral domain voice activity detection for improved noise modeling in MMSE feature enhancement for ASR. Svein Gunnar Pettersen, Magne Hallstein Johnsen |
| 2008 | Characterizing speech utterances for speaker verification with sequence kernel SVM. Kong-Aik Lee, Changhuai You, Haizhou Li, Tomi Kinnunen, Donglai Zhu |
| 2008 | Class lecture summarization taking into account consecutiveness of important sentences. Yasuhisa Fujii, Kazumasa Yamamoto, Norihide Kitaoka, Seiichi Nakagawa |
| 2008 | Class-based statistical machine translation for field maintainable speech-to-speech translation. Ian R. Lane, Alex Waibel |
| 2008 | Clustering initialization based on spatial information for speaker diarization of meetings. Jordi Luque, Carlos Segura, Javier Hernando |
| 2008 | Coarticulation in nasal and lateral clusters in Warlpiri. Janet Fletcher, Deborah Loakes, Andrew Butcher |
| 2008 | Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping. Ming Li, Chuan Cao, Di Wang, Ping Lu, Qiang Fu, Yonghong Yan |
| 2008 | Combination method of bone-conduction speech and air-conduction speech for speaker recognition. Satoru Tsuge, Takashi Osanai, Hisanori Makinae, Toshiaki Kamada, Minoru Fukumi, Shingo Kuroiwa |
| 2008 | Combination of clean and contaminated GMM/SVM for far-field text-independent speaker verification. Christian Zieger, Maurizio Omologo |
| 2008 | Combining continuous progressive model adaptation and factor analysis for speaker verification. Mitchell McLaren, Driss Matrouf, Robbie Vogt, Jean-François Bonastre |
| 2008 | Combining evidence from a generative and a discriminative model in phoneme recognition. Joel Pinto, Hynek Hermansky |
| 2008 | Combining neural network and rule-based systems for dysarthria diagnosis. James Carmichael, Vincent Wan, Phil D. Green |
| 2008 | Combining noise compensation and missing-feature decoding for large vocabulary speech recognition in noise. Jianhua Lu, Ji Ming, Roger F. Woods |
| 2008 | Combining statistical and syntactical systems for spoken language understanding with graphical models. Stefan Schwärzler, Jürgen T. Geiger, Joachim Schenk, Marc A. Al-Hames, Benedikt Hörnler, Günther Ruske, Gerhard Rigoll |
| 2008 | Combining task-dependent information with auditory attention cues for prominence detection in speech. Ozlem Kalinli, Shrikanth S. Narayanan |
| 2008 | Comparative evaluation of different methods for voice activity detection. Hongfei Ding, Koichi Yamamoto, Masami Akamine |
| 2008 | Comparing prosodic models for speaker recognition. Cheung-Chi Leung, Marc Ferras, Claude Barras, Jean-Luc Gauvain |
| 2008 | Comparing text-driven and speech-driven visual speech synthesisers. Barry-John Theobald, Gavin C. Cawley, J. Andrew Bangham, Iain A. Matthews, Nicholas Wilkinson |
| 2008 | Comparing word, character, and phoneme n-grams for subjective utterance recognition. Theresa Wilson, Stephan Raaijmakers |
| 2008 | Comparison of AM-FM based features for robust speech recognition. K. V. S. Narayana, T. V. Sreenivas |
| 2008 | Comparison of input and feature space nonlinear kernel nuisance attribute projections for speaker verification. Xianyu Zhao, Yuan Dong, Jian Zhao, Liang Lu, Jiqing Liu, Haila Wang |
| 2008 | Comparison of variable selection methods and classifiers for native accent identification. Tingyao Wu, Peter Karsmakers, Hugo Van hamme, Dirk Van Compernolle |
| 2008 | Computational language acquisition by statistical bottom-up processing. Okko Johannes Räsänen, Unto K. Laine, Toomas Altosaar |
| 2008 | Confusion-based entropy-weighted decoding for robust speech recognition. Yi Chen, Chia-Yu Wan, Lin-Shan Lee |
| 2008 | Connected speech processes in Warlpiri. John Ingram, Mary Laughren, Jeff Chapman |
| 2008 | Consonant discrimination of degraded speech using an efferent-inspired closed-loop cochlear model. David P. Messing, Lorraine Delhorne, Ed Bruckert, Louis D. Braida, Oded Ghitza |
| 2008 | Consonant enhancement in Lamalama, an initial-dropping language of Cape York Peninsula, North Queensland. Christina Pentland |
| 2008 | Context dependent language model adaptation. Xunying Liu, Mark J. F. Gales, Philip C. Woodland |
| 2008 | Context-dependent phone models and models adaptation for phonotactic language recognition. Mohamed Faouzi BenZeghiba, Jean-Luc Gauvain, Lori Lamel |
| 2008 | Context-sensitive probabilistic phone mapping model for cross-lingual speech recognition. Khe Chai Sim, Haizhou Li |
| 2008 | Continuous phone recognition without target language training data. Dau-Cheng Lyu, Sabato Marco Siniscalchi, Tae-Yoon Kim, Chin-Hui Lee |
| 2008 | Continuous pose-invariant lipreading. Patrick Lucey, Sridha Sridharan, David Dean |
| 2008 | Contrastive utterances make alternatives salient - cross-modal priming evidence. Bettina Braun, Lara Tagliapietra, Anne Cutler |
| 2008 | Control of prosodic focus in corpus-based generation of fundamental frequency based on the generation process model. Keiko Ochi, Keikichi Hirose, Nobuaki Minematsu |
| 2008 | Convergence between SVM-based and distance-based paradigms for speaker recognition. Delphine Charlet, Xianyu Zhao, Yuan Dong |
| 2008 | Correlation of utterance length and segmental duration in Finnish is questionable. Jussi Hakokari, Tuomo Saarni, Jouni Isoaho, Tapio Salakoski |
| 2008 | Correspondence of perception and production boundaries between single and geminate stops in Japanese. Shigeaki Amano, Yukari Hirata |
| 2008 | Covariance modelling for noise-robust speech recognition. Rogier C. van Dalen, Mark J. F. Gales |
| 2008 | Covariance updates for discriminative training by constrained line search. Peter Bell, Simon King |
| 2008 | Covariations of English segmental durations across speakers. Jiahong Yuan |
| 2008 | Cross-dialect Irish prosody: linguistic constraints on Fujisaki modelling. Maria O'Reilly, Ailbhe Ní Chasaide, Christer Gobl |
| 2008 | Cross-language study of vocal correlates of affective states. Irena Yanushevskaya, Ailbhe Ní Chasaide, Christer Gobl |
| 2008 | Cross-lingual portability of MLP-based tandem features - a case study for English and Hungarian. László Tóth, Joe Frankel, Gábor Gosztolya, Simon King |
| 2008 | Cross-lingual sentence extraction for information distillation. Adish Kumar Singla, Dilek Hakkani-Tür |
| 2008 | Crosscorrelation of adjacent spectra enhances fundamental frequency tracking. Philippe Martin |
| 2008 | Czech-to-slovak adapted broadcast news transcription system. Jan Nouza, Jan Silovský, Jindrich Zdánský, Petr Cerva, Martin Kroul, Josef Chaloupka |
| 2008 | DC-constrained linear prediction for glottal inverse filtering. Paavo Alku, Carlo Magi, Tom Bäckström |
| 2008 | DISCO: development and integration of speech technology into courseware for language learning. Catia Cucchiarini, Joost van Doremalen, Helmer Strik |
| 2008 | Data selection and smoothing in an open-source system for the 2008 NIST machine translation evaluation. Holger Schwenk, Yannick Estève |
| 2008 | Data-driven clustered hierarchical tandem system for LVCSR. Shuo-Yiin Chang, Lin-Shan Lee |
| 2008 | Dealing with limited and noisy data in ASR: a hybrid knowledge-based and statistical approach. Abeer Alwan |
| 2008 | Decision tree based frame mode selection for AMR-WB+. Jong Kyu Kim, Seung Seop Park, Chang Woo Han, Nam Soo Kim |
| 2008 | Decoding-time prediction of non-verbalized punctuation. Anoop Deoras, Jürgen Fritsch |
| 2008 | Decomposition of rotational distortion caused by VTL difference using eigenvalues of its transformation matrix. Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose |
| 2008 | Dependency parsing of Japanese spoken monologue based on clause-starts detection. Tomohiro Ohno, Shigeki Matsubara, Hideki Kashioka, Yasuyoshi Inagaki |
| 2008 | Design and formulation for speech interface based on flexible shortcuts. Teppei Nakano, Tomoyuki Kumai, Tetsunori Kobayashi, Yasushi Ishikawa |
| 2008 | Designing a massively multiplayer online role-playing game around text-to-speech. Mike Rozak |
| 2008 | Detection of acoustic events in interactive seminar data with temporal overlaps. Andrey Temko, Climent Nadeu |
| 2008 | Detection of feeling through back-channels in spoken dialogue. Tatsuya Kawahara, Masayoshi Toyokura, Teruhisa Misu, Chiori Hori |
| 2008 | Detection of repetitions in spontaneous speech in dialogue sessions. Mert Cevik, Fuliang Weng, Chin-Hui Lee |
| 2008 | Detection of security related affect and behaviour in passenger transport. Björn W. Schuller, Matthias Wimmer, Dejan Arsic, Tobias Moosmayr, Gerhard Rigoll |
| 2008 | Detection of speech embedded in real acoustic background based on amplitude modulation spectrogram features. Jörn Anemüller, Denny Schmidt, Jörg-Hendrik Bach |
| 2008 | Detection of speech under physical stress: model development, sensor selection, and feature fusion. Sanjay A. Patil, John H. L. Hansen |
| 2008 | Development and evaluation of Polish speech corpus for unit selection speech synthesis systems. Grazyna Demenko, Jolanta Bachan, Bernd Möbius, Katarzyna Klessa, Marcin Szymanski, Stefan Grocholewski |
| 2008 | Development and evaluation of hands-free spoken dialogue system for railway station guidance. Hiroshi Saruwatari, Yu Takahashi, Hiroyuki Sakai, Shota Takeuchi, Tobias Cincarek, Hiromichi Kawanami, Kiyohiro Shikano |
| 2008 | Development of SRI's translation systems for broadcast news and broadcast conversations. Jing Zheng, Wen Wang, Necip Fazil Ayan |
| 2008 | Development of communicative skills in 8- to 16-month-old children: a longitudinal study. Eeva Klintfors, Ulla Sundberg, Francisco Lacerda, Ellen Marklund, Lisa Gustavsson, Ulla Bjursäter, Iris-Corinna Schwarz, Göran Söderlund |
| 2008 | Development of the SRI/nightingale Arabic ASR system. Dimitra Vergyri, Arindam Mandal, Wen Wang, Andreas Stolcke, Jing Zheng, Martin Graciarena, David Rybach, Christian Gollan, Ralf Schlüter, Katrin Kirchhoff, Arlo Faria, Nelson Morgan |
| 2008 | Development of the primary CRIM system for the NIST 2008 speaker recognition evaluation. Patrick Kenny, Najim Dehak, Pierre Ouellet, Vishwa Gupta, Pierre Dumouchel |
| 2008 | Development of tone perception and tone production in Cantonese-learning children aged 2 to 5 years. Valter Ciocca, Vivian W.-K. Ip |
| 2008 | Dialect classification via discriminative training. Yun Lei, John H. L. Hansen |
| 2008 | Dialect recognition using adapted phonetic models. Wade Shen, Nancy F. Chen, Douglas A. Reynolds |
| 2008 | Dialect separation assessment using log-likelihood score distributions. Mahnoosh Mehrabani, John H. L. Hansen |
| 2008 | Dialog management using weighted finite-state transducers. Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura |
| 2008 | Different roles of pitch and duration in distinguishing word stress in English. Jiahong Yuan, Stephen Isard, Mark Y. Liberman |
| 2008 | Dimensionality reduction of modulation frequency features for speech discrimination. Maria E. Markaki, Yannis Stylianou |
| 2008 | Discourse prosody context - global F0 and tempo modulations. Chiu-yu Tseng, Zhao-yu Su |
| 2008 | Discovering phrases in machine translation by simulated annealing. Caroline Lavecchia, David Langlois, Kamel Smaïli |
| 2008 | Discrimination of task-related words for vocabulary design of spoken dialog systems. Akinori Ito, Toyomi Meguro, Shozo Makino, Motoyuki Suzuki |
| 2008 | Discriminative classifiers with generative kernels for noise robust ASR. Mark J. F. Gales, Chris Longworth |
| 2008 | Discriminative graph training for ultra-fast low-footprint speech indexing. Upendra V. Chaudhari, Hong-Kwang Jeff Kuo, Brian Kingsbury |
| 2008 | Discriminative model combination and language model selection in a reading tutor for children. Abdurrahman Samir, Jacques Duchateau, Hugo Van hamme |
| 2008 | Discriminative n-gram language modeling for Turkish. Ebru Arisoy, Brian Roark, Izhak Shafran, Murat Saraclar |
| 2008 | Discriminative rescoring based on minimization of word errors for transcribing broadcast news. Akio Kobayashi, Takahiro Oku, Shinichi Homma, Shoei Sato, Toru Imai, Tohru Takagi |
| 2008 | Discriminative training and channel compensation for acoustic language recognition. Valiantsina Hubeika, Lukás Burget, Pavel Matejka, Petr Schwarz |
| 2008 | Discriminative training for complementariness in system combination. Daniel Willett, Chuang He |
| 2008 | Discriminative training of variable-parameter HMMs for noise robust speech recognition. Dong Yu, Li Deng, Yifan Gong, Alex Acero |
| 2008 | Discriminative training using the trusted expectation maximization. Yasser Hifny, Yuqing Gao |
| 2008 | Discrimininative training of narrow band - wide band adapted systems for meeting recognition. Martin Karafiát, Lukás Burget, Thomas Hain, Jan Cernocký |
| 2008 | Distinctive feature fusion for recognition of australian English consonants. Trent W. Lewis, David M. W. Powers |
| 2008 | Do English speakers assimilate Mandarin tones to English prosodic categories? Connie K. So, Catherine T. Best |
| 2008 | Do discourse cues facilitate recall in information presentation messages? Andi Winterboer, Johanna D. Moore, Fernanda Ferreira |
| 2008 | Does the Mcgurk effect rely on processing time constraints? Christian Kroos, Ashlie Dreves |
| 2008 | Domain-specific classification methods for disfluency detection. Sebastian Germesin, Tilman Becker, Peter Poller |
| 2008 | Duration and F0 interval of utterance-final intonation contours in the perception of German sentence modality. Benno Peters, Hartmut R. Pfitzinger |
| 2008 | Duration refinement by jointly optimizing state and longer unit likelihood. Boyang Gao, Yao Qian, Zhizheng Wu, Frank K. Soong |
| 2008 | DySANA: dynamic speech and noise adaptation for voice activity detection. Ron J. Weiss, Trausti T. Kristjansson |
| 2008 | Dysarthric speech database for universal access research. Heejin Kim, Mark Hasegawa-Johnson, Adrienne Perlman, Jon R. Gunderson, Thomas S. Huang, Kenneth L. Watkin, Simone Frame |
| 2008 | Dysphonic voices and the 0-3000 hz frequency band. Gilles Pouchoulin, Corinne Fredouille, Jean-François Bonastre, Alain Ghio, Antoine Giovanni |
| 2008 | Effect of compressing the dynamic range of the power spectrum in modulation filtering based speech enhancement. James G. Lyons, Kuldip K. Paliwal |
| 2008 | Effective acoustic adaptation for a distant-talking interactive TV system. Jing Huang, Mark Epstein, Marco Matassoni |
| 2008 | Effects of allophones on the performance of Korean speech recognition. Hyejin Hong, Sunhee Kim, Minhwa Chung |
| 2008 | Effects of intonational phrase boundaries on pitch-accented syllables in american English. Yen-Liang Shue, Stefanie Shattuck-Hufnagel, Markus Iseli, Sun-Ah Jun, Nanette Veilleux, Abeer Alwan |
| 2008 | Effects of user modeling on POMDP-based dialogue systems. Dongho Kim, Hyeong Seop Sim, Kee-Eung Kim, Jin Hyung Kim, Hyunjeong Kim, Joo Won Sung |
| 2008 | Effects of vocal effort and speaking style on text-independent speaker verification. Elizabeth Shriberg, Martin Graciarena, Harry Bratt, Andreas Kathol, Sachin S. Kajarekar, Huda Jameel, Colleen Richey, Fred Goodman |
| 2008 | Efficient handwriting correction of speech recognition errors with template constrained posterior (TCP). Lijuan Wang, Tao Hu, Peng Liu, Frank K. Soong |
| 2008 | Efficient join cost computation for unit selection based TTS systems. Feng Ding, Jani Nurminen, Jilei Tian |
| 2008 | Efficient representation of throat microphone speech. K. Sri Rama Murty, Saurav Khurana, Yogendra Umesh Itankar, M. R. Kesheorey, B. Yegnanarayana |
| 2008 | Eigen-MLLR environment/speaker compensation for robust speech recognition. Yuan-Fu Liao, Hung-Hsiang Fang, Chi-Hui Hsu |
| 2008 | Eigen-channel compensation and discriminatively trained Gaussian mixture models for dialect and accent recognition. Pedro A. Torres-Carrasquillo, Douglas E. Sturim, Douglas A. Reynolds, Alan McCree |
| 2008 | Emotion conversion using F0 segment selection. Zeynep Inanoglu, Steve J. Young |
| 2008 | Emotion recognition in spontaneous emotional speech for anonymity-protected voice chat systems. Yoshiko Arimoto, Hiromi Kawatsu, Sumio Ohno, Hitoshi Iida |
| 2008 | Emotions and articulatory precision. Martijn Goudbeek, Jean-Philippe Goldman, Klaus R. Scherer |
| 2008 | Energy and entropy based switching algorithm for speech endpoint detection in varying SNR conditions. Krishna Chaitanya, Rohit Sinha |
| 2008 | English word stress as produced by English and dutch speakers: the role of segmental and suprasegmental differences. Bettina Braun, Kristin Lemhofer, Anne Cutler |
| 2008 | Enhancement of noisy speech recordings via blind source separation. Jirí Málek, Zbynek Koldovský, Jindrich Zdánský, Jan Nouza |
| 2008 | Environment mismatch compensation using average eigenspace for speech recognition. Abhishek Kumar, John H. L. Hansen |
| 2008 | Estimation of children's reading ability by fusion of automatic pronunciation verification and fluency detection. Matthew Black, Joseph Tepperman, Sungbok Lee, Shrikanth S. Narayanan |
| 2008 | Estimation of vocal tract area function for Mandarin vowel sequences using MRI. Gaowu Wang, Jianwu Dang, Jiangping Kong |
| 2008 | Evaluating semantic-level confidence scores with multiple hypotheses. Blaise Thomson, Kai Yu, Milica Gasic, Simon Keizer, François Mairesse, Jost Schatzmann, Steve J. Young |
| 2008 | Evaluating spoken language model based on filler prediction model in speech recognition. Kengo Ohta, Masatoshi Tsuchiya, Seiichi Nakagawa |
| 2008 | Evaluation of Finnish unit selection and HMM-based speech synthesis. Hanna Silén, Elina Helander, Jani Nurminen, Moncef Gabbouj |
| 2008 | Evaluation of a live broadcast news subtitling system for portuguese. Hugo Meinedo, Márcio Viveiros, João Paulo Neto |
| 2008 | Evaluation of modulation spectrum equalization techniques for large vocabulary robust speech recognition. Liang-Che Sun, Chang-Wen Hsu, Lin-Shan Lee |
| 2008 | Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environments. Keigo Nakamura, Tomoki Toda, Yoshitaka Nakajima, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2008 | Evaluation of voice activity and voicing detection. Bojan Kotnik, Pierre Sendorek, Sergey Astrov, Turgay Koç, Tolga Çiloglu, Laura Docío Fernández, Eduardo Rodríguez Banga, Harald Höge, Zdravko Kacic |
| 2008 | Evidence of a near-merger in western sydney australian English vowels. Rikke L. Bundgaard-Nielsen, Catherine T. Best, Michael D. Tyler, Christian Kroos |
| 2008 | Evidence of coarticulation in a phonological feature detection system. Abhijeet Sangwan, Ayako Ikeno, John H. L. Hansen |
| 2008 | Examining pitch-accent variability from an exemplar-theoretic perspective. Michael Walsh, Katrin Schweitzer, Bernd Möbius, Hinrich Schütze |
| 2008 | Expanding vocabulary for recognizing user's abbreviations of proper nouns without increasing ASR error rates in spoken dialogue systems. Masaki Katsumaru, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
| 2008 | Experimental evaluation of multi-band position-pitch estimation (m-popi) algorithm for multi-speaker localization. Tania Habib, Lukas Ottowitz, Marián Képesi |
| 2008 | Experiments with the ABI (accents of the british isles) speech corpus. Shona D'Arcy, Martin J. Russell |
| 2008 | Exploiting spatial-temporal feature distribution characteristics for robust speech recognition. Wei-Hau Chen, Shih-Hsiang Lin, Berlin Chen |
| 2008 | Exploiting the ASR n-best by tracking multiple dialog state hypotheses. Jason D. Williams |
| 2008 | Exploring a mechanism of speech sychronization using auditory delayed experiments. Masato Ishizaki, Yasuharu Den, Senshi Fukashiro |
| 2008 | Exploring classification techniques in speech based cognitive load monitoring. Bo Yin, Natalie Ruiz, Fang Chen, Eliathamby Ambikairajah |
| 2008 | Exploring the Uncanny Valley Effect with talking heads. Takaaki Kuratate, Kathryn Ayers, Jeesun Kim, Denis Burnham |
| 2008 | Extended partial distance elimination and dynamic Gaussian selection for fast likelihood computation. Ghazi Bouselmi, Jun Cai |
| 2008 | Extensibility verification of robust domain selection against out-of-grammar utterances in multi-domain spoken dialogue system. Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
| 2008 | Extracting word-pronunciation pairs from comparable set of text and speech. Tetsuro Sasada, Shinsuke Mori, Tatsuya Kawahara |
| 2008 | Extraction and tracking of formant response jitter in the cochlea for objective prediction of SB/SF DAM attributes. Wenliang Lu, Deep Sen |
| 2008 | FM features for automatic forensic speaker recognition. Tharmarajah Thiruvaran, Eliathamby Ambikairajah, Julien Epps |
| 2008 | Factor analysis multi-session training constraint in session compensation for speaker verification. Driss Matrouf, Jean-François Bonastre, Salah Eddine Mezaache |
| 2008 | Factor analysis subspace estimation for speaker verification with short utterances. Robbie Vogt, Brendan Baker, Sridha Sridharan |
| 2008 | Factored translation models for enriching spoken language translation with prosody. Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore, Shrikanth S. Narayanan |
| 2008 | Fast call-classification system development without in-domain training data. Christophe Servan, Frédéric Béchet |
| 2008 | Fast n-gram language model look-ahead for decoders with static pronunciation prefix trees. Marijn Huijbregts, Roeland Ordelman, Franciska de Jong |
| 2008 | Fast search for common segments in speech signals for speaker verification. Michael Gerber, Beat Pfister |
| 2008 | Fast speaker adaptive training for speech recognition. Daniel Povey, Hong-Kwang Jeff Kuo, Hagen Soltau |
| 2008 | Fast speech decoding through phone confusion networks. Nicola Bertoldi, Marcello Federico, Daniele Falavigna, Matteo Gerosa |
| 2008 | Feature adaptation of hearing-impaired lip shapes: the vowel case in the cued speech context. Noureddine Aboutabit, Denis Beautemps, Olivier Mathieu, Laurent Besacier |
| 2008 | Feature space transforms for Czech sign-language recognition. Jan Trmal, Marek Hrúz, Jan Zelinka, Pavel Campr, Ludek Müller |
| 2008 | Feature vector normalization with combined standard and throat microphones for robust ASR. Luis Buera, Antonio Miguel, Oscar Saz, Alfonso Ortega, Eduardo Lleida |
| 2008 | Features for automatic detection of voice bars in continuous speech. N. Dhananjaya, S. Rajendran, B. Yegnanarayana |
| 2008 | Filling acoustic holes through leveraged uncorellated GMMs for in-set/out-of-set speaker recognition. Jun-Won Suh, Pongtep Angkititrakul, John H. L. Hansen |
| 2008 | Finding two-level interpersonal context: proximity and conversation detection from personal audio feature data. Masayuki Okamoto, Naoki Iketani, Keisuke Nishimura, Masaaki Kikuchi, Kenta Cho, Masanori Hattori, Sougo Tsuboi |
| 2008 | Flexible discriminative training based on equal error group scores obtained from an error-indexed forward-backward algorithm. Erik McDermott, Atsushi Nakamura |
| 2008 | Foreign accent identification based on prosodic parameters. Marina Piat, Dominique Fohr, Irina Illina |
| 2008 | Forensic automatic speaker recognition: fiction or science? Joaquin Gonzalez-Rodriguez |
| 2008 | Forensic speaker recognition in Chinese: a multivariate likelihood ratio discrimination on /i/ and /y/. Cuiling Zhang, Geoffrey Stewart Morrison, Philip Rose |
| 2008 | Forensic speaker verification using formant features and Gaussian mixture models. Timo Becker, Michael Jessen, Catalin Grigoras |
| 2008 | Forward optimal modeling of acoustic confusions in Mandarin CALL system. Fengpei Ge, Fuping Pan, Changliang Liu, Bin Dong, Yonghong Yan |
| 2008 | Fragmented context-dependent syllable acoustic models. Kishan Thambiratnam, Frank Seide |
| 2008 | Frame-synchronous and local confidence measures for on-the-fly automatic speech recognition. Joseph Razik, Odile Mella, Dominique Fohr, Jean Paul Haton |
| 2008 | Frequency compression/transposition of fricative consonants for the hearing impaired with high-frequency dead regions. Francisco J. Fraga, Leticia P. Costa S. Prates, Maria Cecilia M. Iorio |
| 2008 | Frequency-domain parameter estimations for binary masked signals. Johan Xi Zhang, Mads Græsbøll Christensen, Joachim Dahl, Søren Holdt Jensen, Marc Moonen |
| 2008 | From 3-d speaker cloning to text-to-audiovisual-speech. Sascha Fagel, Frédéric Elisei, Gérard Bailly |
| 2008 | From domain specification to virtual humans: an integrated approach to authoring tactical questioning characters. Sudeep Gandhe, David DeVault, Antonio Roque, Bilyana Martinovski, Ron Artstein, Anton Leuski, Jillian Gerten, David R. Traum |
| 2008 | Front-end for far-field speech recognition based on frequency domain linear prediction. Sriram Ganapathy, Samuel Thomas, Hynek Hermansky |
| 2008 | Fusion of audio and video modalities for detection of acoustic events. Taras Butko, Andrey Temko, Climent Nadeu, Cristian Canton-Ferrer |
| 2008 | GOOSE on the move: a study of /u/-fronting in Australian news speech. Jennifer Price |
| 2008 | GPU accelerated acoustic likelihood computations. Patrick Cardinal, Pierre Dumouchel, Gilles Boulianne, Michel Comeau |
| 2008 | GPU-accelerated Gaussian clustering for fMPE discriminative training. Yu Shi, Frank Seide, Frank K. Soong |
| 2008 | Gammatone-domain model combination for consonant recognition in noisy environments. Jae Sam Yoon, Ji Hun Park, Hong Kook Kim |
| 2008 | Gender-related differences in the production and perception of emotion. Marc Swerts, Emiel Krahmer |
| 2008 | Generalization of extended baum-welch parameter estimation for discriminative training and decoding. Dimitri Kanevsky, Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo |
| 2008 | Generalized parametric spectral subtraction using weighted Euclidean distortion. Amit Das, John H. L. Hansen |
| 2008 | Generating intonation from a mixed CART-HMM model for speech synthesis. Cédric Boidin, Olivier Boëffard |
| 2008 | Generating natural F0 trajectory with additive trees. Yao Qian, Hui Liang, Frank K. Soong |
| 2008 | Genetic programming based optimization of class-dependent PCA for extracting robust MFCC. Houman Abbasian, Babak Nasersharif, Ahmad Akbari |
| 2008 | Getting the last laugh: automatic laughter segmentation in meetings. Mary Tai Knox, Nelson Morgan, Nikki Mirghafori |
| 2008 | Glottal spectral separation for parametric speech synthesis. João P. Cabral, Steve Renals, Korin Richmond, Junichi Yamagishi |
| 2008 | Goldman-hodgkin-katz cochlear hair cell models - a foundation for nonlinear cochlear mechanics. Matthew R. Flax, W. Harvey Holmes |
| 2008 | Group delay function for improved gender identification. Kye-Hwan Lee, Sang-Ick Kang, Ji-Hyun Song, Joon-Hyuk Chang |
| 2008 | Growing bottleneck features for tandem ASR. Joe Frankel, Dong Wang, Simon King |
| 2008 | HAC-models: a novel approach to continuous speech recognition. Hugo Van hamme |
| 2008 | HMM adaptation using statistical linear approximation for robust automatic speech recognition. Michael Berkovitch, Ilan D. Shallom |
| 2008 | HMM-based Finnish text-to-speech system utilizing glottal inverse filtering. Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Vainio, Paavo Alku |
| 2008 | HMM-based estimation of unreliable spectral components for noise robust speech recognition. Bengt J. Borgström, Abeer Alwan |
| 2008 | Hearing at home - communication support in home environments for hearing impaired persons. Jonas Beskow, Björn Granström, Peter Nordqvist, Samer Al Moubayed, Giampiero Salvi, Tobias Herzke, Arne Schulz |
| 2008 | High-level speaker verification via articulatory-feature based sequence kernels and SVM. Shi-Xiong Zhang, Man-Wai Mak |
| 2008 | High-performance low-latency speech recognition via multi-layered feature streaming and fast Gaussian computation. Liang Gu, Jian Xue, Xiaodong Cui, Yuqing Gao |
| 2008 | High-quality analysis/synthesis method based on temporal decomposition for speech modification. Binh Phu Nguyen, Takeshi Shibata, Masato Akagi |
| 2008 | Higher layer coding of non-speech like signals using factorial pulse codebook. Udar Mittal, James P. Ashley, Jonathan Gibbs |
| 2008 | Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech. Samuel Thomas, Sriram Ganapathy, Hynek Hermansky |
| 2008 | How can you use disfluencies and still sound as a good speaker? Helena Moniz, Ana Isabel Mata, Isabel Trancoso, Céu Viana |
| 2008 | How do the elderly talk to a natural language call routing system? Knut Kvale, Ragnhild Halvorsrud |
| 2008 | How many do we need? exploration of the population size effect on the performance of forensic speaker classification. Shunichi Ishihara, Yuko Kinoshita |
| 2008 | How useful are polynomials for analyzing intonation? Laura E. de Ruiter |
| 2008 | Human speech perception and feature extraction. Bryce E. Lobdell, Mark Hasegawa-Johnson, Jont B. Allen |
| 2008 | Human-like ears versus two-microphone array, which works better for speaker identification? Waleed H. Abdulla, Yushi Zhang |
| 2008 | ICA-based MAP speech enhancement with multiple variable speech distribution models. Xin Zou, Peter Jancovic, Münevver Köküer, Martin J. Russell |
| 2008 | IRSTLM: an open source toolkit for handling large scale language models. Marcello Federico, Nicola Bertoldi, Mauro Cettolo |
| 2008 | Identifying relevant phrases to summarize decisions in spoken meetings. Raquel Fernández, Matthew Frampton, John Dowding, Anish Adukuzhiyil, Patrick Ehlen, Stanley Peters |
| 2008 | Implementation and evaluation of fast on-the-fly WFST composition algorithms. Tasuku Oonishi, Paul R. Dixon, Koji Iwano, Sadaoki Furui |
| 2008 | Implicit state-tying for support vector machines based speech recognition. Daniel Bolaños, Wayne H. Ward |
| 2008 | Improved frame loss recovery using closed-loop estimation of very low bit rate side information. Philippe Gournay |
| 2008 | Improved large vocabulary Mandarin speech recognition by selectively using tone information with a two-stage prosodic model. Li-Wei Cheng, Lin-Shan Lee |
| 2008 | Improved novelty detection for online GMM based speaker diarization. Konstantin Markov, Satoshi Nakamura |
| 2008 | Improvement of eigenvoice-based speaker adaptation by parameter space clustering. Shutaro Tanji, Koichi Shinoda, Sadaoki Furui, Antonio Ortega |
| 2008 | Improvement to a NAM captured whisper-to-speech system. Viet-Anh Tran, Gérard Bailly, Hélène Loevenbruck, Christian Jutten |
| 2008 | Improving Japanese language models using POS information. Langzhou Chen, Hisayoshi Nagae, Matthew N. Stuttle |
| 2008 | Improving consonant identification in noise and reverberation by steady-state suppression as a preprocessing approach. Nao Hodoshima, Wataru Yoshida, Takayuki Arai |
| 2008 | Improving large scale alphanumeric string recognition using redundant information. Ea-Ee Jan, Osamuyimen Stewart, Raymond Co, David M. Lubensky |
| 2008 | Improving mispronunciation detection and diagnosis of learners' speech with context-sensitive phonological rules based on language transfer. Alissa M. Harrison, Wing Yiu Lau, Helen M. Meng, Lan Wang |
| 2008 | Improving preselection in unit selection synthesis. Alistair Conkie, Ann K. Syrdal, Yeon-Jun Kim, Marc C. Beutnagel |
| 2008 | Improving pronunciation modeling for non-native speech recognition. Tien Ping Tan, Laurent Besacier |
| 2008 | Improving searching speed and accuracy of query by humming system based on three methods: feature fusion, candidates set reduction and multiple similarity measurement rescoring. Lei Wang, Shen Huang, Sheng Hu, Jiaen Liang, Bo Xu |
| 2008 | Improving speech systems built from very little data. John Kominek, Sameer Badaskar, Tanja Schultz, Alan W. Black |
| 2008 | Improving the ensemble speaker and speaking environment modeling approach by enhancing the precision of the online estimation process. Yu Tsao, Chin-Hui Lee |
| 2008 | Improving the multigram algorithm by using lattices as input. Joris Driesen, Hugo Van hamme |
| 2008 | In search of models in speech communication research. Hiroya Fujisaki |
| 2008 | In-car speech recognition using model-based wiener filter and multi-condition training. Masanori Tsujikawa, Takayuki Arakawa, Ryosuke Isotani |
| 2008 | Including pitch accent optionality in unit selection text-to-speech synthesis. Leonardo Badino, Robert A. J. Clark, Volker Strom |
| 2008 | Incorporating acoustical modelling of phone transitions in an hybrid ANN/HMM speech recognizer. Alberto Abad, João Paulo Neto |
| 2008 | Incorporating durational modification in voice transformation. Arthur R. Toth, Alan W. Black |
| 2008 | Inductive and example-based learning for text classification. Ye-Yi Wang, Xiao Li, Alex Acero |
| 2008 | Infants' native and nonnative tone perception. Karen Mattock |
| 2008 | Influences on tone in Sepedi, a southern Bantu language. Sabine Zerbian, Etienne Barnard |
| 2008 | Inhibitory processes of Chinese spoken word recognition. Michael C. W. Yip |
| 2008 | Integrating rule and template-based approaches for emotional Malay speech synthesis. Mumtaz Begum, Raja Noor Ainon, Roziati Zainuddin, Zuraidah M. Don, Gerry Knowles |
| 2008 | Integration of TDOA features in information bottleneck framework for fast speaker diarization. Deepu Vijayasenan, Fabio Valente, Hervé Bourlard |
| 2008 | Integration of audiovisual speech and priming effects. Azra Nahid Ali |
| 2008 | Integration of metamodel and acoustic model for speech recognition. Hironori Matsumasa, Tetsuya Takiguchi, Yasuo Ariki, Ichao Li, Toshitaka Nakabayashi |
| 2008 | Intelligibility evaluation of Ramsey-derived interleavers for internet voice streaming with the iLBC codec. Angel M. Gomez, José L. Carmona, Antonio M. Peinado, Victoria E. Sánchez, José A. González |
| 2008 | Intentional voice command detection for completely hands-free speech interface in home environments. Yasunari Obuchi, Masahito Togami, Takashi Sumiyoshi |
| 2008 | Interrelationship between vocal effort and vocal tract acoustics: a pilot study. Maeva Garnier, Joe Wolfe, Nathalie Henrich, John Smith |
| 2008 | Intersession variability in speaker recognition: a behind the scene analysis. Daniel Garcia-Romero, Carol Y. Espy-Wilson |
| 2008 | Intonation modeling of Mandarin Chinese using a superpositional approach. Pablo Daniel Agüero, Antonio Bonafonte, Lu Yu, Juan Carlos Tulli |
| 2008 | Intonational phrases for speech summarization. Sameer Maskey, Andrew Rosenberg, Julia Hirschberg |
| 2008 | Intrinsic consonantal F0 perturbation in 3-way VOT contrast and its implications for aspiration-conditioned tonal split: evidence from Vietnamese. Michael J. Carne |
| 2008 | Introducing a FM based feature to hierarchical language identification. Bo Yin, Tharmarajah Thiruvaran, Eliathamby Ambikairajah, Fang Chen |
| 2008 | Introducing temporal asymmetries in feature extraction for automatic speech recognition. Garimella S. V. S. Sivaram, Hynek Hermansky |
| 2008 | Introducing the compression wave cochlear amplifier. Matthew R. Flax, W. Harvey Holmes |
| 2008 | Investigating festival's target cost function using perceptual experiments. Volker Strom, Simon King |
| 2008 | Investigating morphological decomposition for transcription of Arabic broadcast news and broadcast conversation data. Lori Lamel, Abdelkhalek Messaoudi, Jean-Luc Gauvain |
| 2008 | Investigating perception of places of articulation in sign and speech. Stina Ojala, Olli Aaltonen, Tapio Salakoski |
| 2008 | Investigations into phonological attribute classifier representations for CRF phone recognition. Prateeti Mohapatra, Eric Fosler-Lussier |
| 2008 | Is a speech recognizer useful for characteristic analysis of classroom lecture speech? Kenji Kobayashi, Mitsuhiro Somiya, Hiromitsu Nishizaki, Yoshihiro Sekiguchi |
| 2008 | Iterative language model estimation: efficient data structure & algorithms. Bo-June Paul Hsu, James R. Glass |
| 2008 | Joint Bayesian predictive classification and parallel model combination with prior scaling for robust ASR. Svein Gunnar Pettersen |
| 2008 | Joint time-frequency segmentation for transient decomposition. Charturong Tantibundhit, Gernot Kubin |
| 2008 | LIPS2008: visual speech synthesis challenge. Barry-John Theobald, Sascha Fagel, Gérard Bailly, Frédéric Elisei |
| 2008 | LTS using decision forest of regression trees and neural networks. Tanuja Sarkar, Sachin Joshi, Sathish Chandra Pammi, Kishore Prahallad |
| 2008 | Landmark based recognition of stops: acoustic attributes versus smoothed spectra. Veena Karjigi, Preeti Rao |
| 2008 | Language and genre detection in audio content analysis. Vikramjit Mitra, Daniel Garcia-Romero, Carol Y. Espy-Wilson |
| 2008 | Language experience dependent plasticity for pitch representation in the human brainstem. Ananthanarayan Krishnan, Jackson T. Gandour, Jayaganesh Swaminathan |
| 2008 | Language identification on code-switching utterances using multiple cues. Dau-Cheng Lyu, Ren-yuan Lyu |
| 2008 | Language model adaptation for a speech to sign language translation system using web frequencies and a MAP framework. Luis Fernando D'Haro, Rubén San Segundo, Ricardo de Córdoba, Jan Bungeroth, Daniel Stein, Hermann Ney |
| 2008 | Language modeling for speech recognition of spoken Cantonese. Yu Ting Yeung, Houwei Cao, Nengheng Zheng, Tan Lee, P. C. Ching |
| 2008 | Large margin multinomial mixture model for text categorization. Zhen-Yu Pan, Hui Jiang |
| 2008 | Learning essential speaker sub-space using hetero-associative neural networks for speaker clustering. Shajith Ikbal, Karthik Visweswariah |
| 2008 | Let's go lab: a platform for evaluation of spoken dialog systems with real world users. Maxine Eskénazi, Alan W. Black, Antoine Raux, Brian Langner |
| 2008 | Leveraging emotion detection using emotions from yes-no answers. Narjès Boufaden, Pierre Dumouchel |
| 2008 | Lexical analyses of native and non-native English language instructor speech based on a six-month co-taught classroom video corpus. Noriaki Katagiri, Goh Kawai |
| 2008 | Lexicon expansion using pronunciation variations extracted on the basis of speaker-related deviation in recognition error statistics. Yoshifumi Onishi |
| 2008 | Lightly supervised acoustic model training on EPPS recordings. Matthias Paulik, Alex Waibel |
| 2008 | Linear discriminant feature extraction using weighted classification confusion information. Hung-Shin Lee, Berlin Chen |
| 2008 | Lip synchronization: from phone lattice to PCA eigen-projections using neural networks. Samer Al Moubayed, Michaël De Smet, Hugo Van hamme |
| 2008 | Localization of multiple sound sources based on inter-channel correlation using a distributed microphone system. Kook Cho, Hajime Okumura, Takanobu Nishiura, Yoichi Yamashita |
| 2008 | Long-term spectro-temporal information for improved automatic speech emotion classification. Siqing Wu, Tiago H. Falk, Wai-Yip Chan |
| 2008 | Longitudinal study of ASR performance on ageing voices. Ravichander Vipperla, Steve Renals, Joe Frankel |
| 2008 | Low complexity near-optimal unit-selection algorithm for ultra low bit-rate speech coding based on n-best lattice and Viterbi search. V. Ramasubramanian, D. Harish |
| 2008 | Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory. Takashi Muramatsu, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2008 | MAP and sub-word level t-norm for text-dependent speaker recognition. Doroteo T. Toledano, Daniel Hernández López, Cristina Esteve-Elizalde, Joaquin Gonzalez-Rodriguez, Rubén Fernández Pozo, Luis A. Hernández Gómez |
| 2008 | MASSY speaks English: adaptation and evaluation of a talking head. Sascha Fagel |
| 2008 | MDS-based visualization method for multiple speech corpora. Kimiko Yamakawa, Tomoko Matsui, Shuichi Itahashi |
| 2008 | MUESLI: multiple utterance error correction for a spoken language interface. Federico Cesari, Horacio Franco, Gregory K. Myers, Harry Bratt |
| 2008 | Machine translation in continuous space. Ruhi Sarikaya, Yonggang Deng, Mohamed Afify, Brian Kingsbury, Yuqing Gao |
| 2008 | Making confident speaker verification decisions with minimal speech. Robbie Vogt, Sridha Sridharan, Michael Mason |
| 2008 | Mandarin Chinese tone nucleus detection with landmarks. Siwei Wang, Gina-Anne Levow |
| 2008 | Mandarin connected digits recognition for whispered speech. Tingting Ru, Xiang Xie, Hui Yin, Jingming Kuang |
| 2008 | Mask estimation incorporating time-frequency trajectories for a CASA-based ASR front-end. Ji Hun Park, Jae Sam Yoon, Hong Kook Kim |
| 2008 | Masked speech priming: no priming in dense neighbourhoods. Chris Davis, Jeesun Kim, Angelo Barbaro |
| 2008 | Maximum a posteriori adaptation for many-to-one eigenvoice conversion. Daisuke Tani, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2008 | Maximum accept and reject (MARS) training of HMM-GMM speech recognition systems. Vivek Tyagi |
| 2008 | Maximum kurtosis beamforming with the generalized sidelobe canceller. Ken'ichi Kumatani, John W. McDonough, Barbara Rauch, Philip N. Garner, Weifeng Li, John Dines |
| 2008 | Maximum mutual information estimation with unlabeled data for phonetic classification. Jui-Ting Huang, Mark Hasegawa-Johnson |
| 2008 | Measuring speech quality impact on tasks performance. Virginie Durin, Laetitia Gros |
| 2008 | Mel-frequency cepstral coefficient-based bandwidth extension of narrowband speech. Amr H. Nour-Eldin, Peter Kabal |
| 2008 | Memo workbench for semi-automated usability testing. Klaus-Peter Engelbrecht, Michael Kruppa, Sebastian Möller, Michael Quade |
| 2008 | Methods to optimize transcription of on-line media. Sarah Conrod, Sara H. Basson, Dimitri Kanevsky |
| 2008 | Metric learning for unsupervised phoneme segmentation. Yu Qiao, Nobuaki Minematsu |
| 2008 | Min-max discriminative training of decoding parameters using iterative linear programming. Brian Mak, Tom Ko |
| 2008 | Minimal training based semantic categorization in a voice activated question answering (VAQA) system. Mithun Balakrishna, Marta Tatu, Dan I. Moldovan |
| 2008 | Minimum generation error training with direct log spectral distortion on LSPs for HMM-based speech synthesis. Yi-Jian Wu, Keiichi Tokuda |
| 2008 | Minimum phone error discriminative training for Mandarin Chinese speaker adaptation. Liang-Yu Chen, Chun-Jen Lee, Jyh-Shing Roger Jang |
| 2008 | Mispronunciation detection for Mandarin Chinese. Chao Huang, Feng Zhang, Frank K. Soong, Min Chu |
| 2008 | Missing-feature method for speaker recognition in band-restricted conditions. Wooil Kim, John H. L. Hansen |
| 2008 | Mobidic - a mobile dictation and notetaking application. Markku Turunen, Aleksi Melto, Anssi Kainulainen, Jaakko Hakulinen |
| 2008 | Modeling Austrian dialect varieties for TTS. Friedrich Neubarth, Michael Pucher, Christian Kranzler |
| 2008 | Modeling prior belief for speaker verification SVM systems. Luciana Ferrer |
| 2008 | Modeling the effects on time-into-utterance on word probabilities. Nigel G. Ward, Alejandro Vega |
| 2008 | Modelling fine-phonetic detail in a computational model of word recognition. Odette Scharenborg |
| 2008 | Modelling rapport in embodied conversational agents. Justine Cassell |
| 2008 | Modulation spectrogram features for improved speaker diarization. Oriol Vinyals, Gerald Friedland |
| 2008 | Monte Carlo model-space noise adaptation for speech recognition. Daniel Povey, Brian Kingsbury |
| 2008 | Multi-accent and accent-independent non-native speech recognition. Ghazi Bouselmi, Dominique Fohr, Irina Illina |
| 2008 | Multi-band and multi-cue analyses of disordered connected speech. Ali Alpan, Youri Maryn, Francis Grenez, Abdellah Kacha, Jean Schoentgen |
| 2008 | Multi-modal recording, analysis and indexing of poster sessions. Tatsuya Kawahara, Hisao Setoguchi, Katsuya Takanashi, Kentaro Ishizuka, Shoko Araki |
| 2008 | Multi-speaker meeting audio segmentation. Tin Lay Nwe, Minghui Dong, Swe Zin Kalayar Khine, Haizhou Li |
| 2008 | Multi-stream spectro-temporal features for robust speech recognition. Sherry Y. Zhao, Nelson Morgan |
| 2008 | Multidimensional features of emotional speech. Tomoko Suzuki, Machiko Ikemoto, Tomoko Sano, Toshihiko Kinoshita |
| 2008 | Multilevel parametric-base F0 model for speech synthesis. Javier Latorre, Masami Akamine |
| 2008 | Multimodal perception of Mandarin tone for cochlear implant users. Damien J. Smith, Denis Burnham |
| 2008 | Multipitch tracking using a factorial hidden Markov model. Michael Wohlmayr, Franz Pernkopf |
| 2008 | N-best based stochastic mapping on stereo HMM for noise robust speech recognition. Xiaodong Cui, Mohamed Afify, Yuqing Gao |
| 2008 | Neural network based regression for robust overlapping speech recognition using microphone arrays. Weifeng Li, John Dines, Mathew Magimai-Doss, Hervé Bourlard |
| 2008 | Noise driven short-time phase spectrum compensation procedure for speech enhancement. Anthony P. Stark, Kamil K. Wójcicki, James G. Lyons, Kuldip K. Paliwal |
| 2008 | Noise reduction through compressed sensing. Jort F. Gemmeke, Bert Cranen |
| 2008 | Noise robust speech dereverberation using constrained inverse filter. Ken'ichi Furuya, Akitoshi Kataoka, Youichi Haneda |
| 2008 | Non-segmental duration feature extraction for prosodic classification. Amy Dashiell, Brian Hutchinson, Anna Margolis, Mari Ostendorf |
| 2008 | Nonlinear mixture autoregressive hidden Markov models for speech recognition. Sundararajan Srinivasan, Tao Ma, Daniel May, Georgios Y. Lazarou, Joseph Picone |
| 2008 | Nonnative speech recognition based on state-candidate bilingual model modification. Qingqing Zhang, Ta Li, Jielin Pan, Yonghong Yan |
| 2008 | Nonverbal responses to social inclusion and exclusion. Emiel Krahmer, Juliette Schaafsma, Marc Swerts, Ad Vingerhoets |
| 2008 | Objective intelligibility assessment of pathological speakers. Catherine Middag, Gwen Van Nuffelen, Jean-Pierre Martens, Marc De Bodt |
| 2008 | On a generalization of margin-based discriminative training to robust speech recognition. Jinyu Li, Chin-Hui Lee |
| 2008 | On estimation of a speaker's confusion matrix from sparse data. Stephen Cox |
| 2008 | On the combination of auditory and modulation frequency channels for ASR applications. Fabio Valente, Hynek Hermansky |
| 2008 | On the development of variable length Teager energy operator (VTEO). Vikrant Tomar, Hemant A. Patil |
| 2008 | On the equivalence of Gaussian and log-linear HMMs. Georg Heigold, Patrick Lehnen, Ralf Schlüter, Hermann Ney |
| 2008 | On the generation of synthetic disfluent speech: local prosodic modifications caused by the insertion of editing terms. Jordi Adell, Antonio Bonafonte, David Escudero Mancebo |
| 2008 | On the impact of alignment on voice conversion performance. Elina Helander, Jan Schwarz, Jani Nurminen, Hanna Silén, Moncef Gabbouj |
| 2008 | On the mask modeling and feature representation in the missing-feature ASR: evaluation on the Consonant Challenge. Peter Jancovic, Münevver Köküer |
| 2008 | On the perceived quality of noise reduced signals. Valérie Gautier-Turbin, Laetitia Gros |
| 2008 | On the properties of a time-varying quasi-harmonic model of speech. Yannis Pantazis, Olivier Rosec, Yannis Stylianou |
| 2008 | On the role of acting skills for the collection of simulated emotional speech. Emiel Krahmer, Marc Swerts |
| 2008 | On the use of a multilingual neural network front-end. Stefano Scanzio, Pietro Laface, Luciano Fissore, Roberto Gemello, Franco Mana |
| 2008 | Online unsupervised pattern discovery in speech using parallelization. Mrugesh R. Gajjar, R. Govindarajan, T. V. Sreenivas |
| 2008 | Online vocabulary adaptation using contextual information and information retrieval. Hagai Aronowitz |
| 2008 | Open-vocabulary spoken-document retrieval based on query expansion using related web documents. Makoto Terao, Takafumi Koshinaka, Shinichi Ando, Ryosuke Isotani, Akitoshi Okumura |
| 2008 | Optimization and evaluation of Gabor feature sets for ASR. Bernd T. Meyer, Birger Kollmeier |
| 2008 | Packing the meeting summarization knapsack. Korbinian Riedhammer, Daniel Gillick, Benoît Favre, Dilek Hakkani-Tür |
| 2008 | Paralinguistic effects on turn-taking behavior in expressive conversation. Hiroki Mori, Hideki Kasuya |
| 2008 | Paralinguistic elements in speech synthesis. Didier Cadic, Lionel Segalen |
| 2008 | Parallel and hierarchical speech feature classification using frame and segment-based methods. Jun Hou, Lawrence R. Rabiner, Sorin Dusan |
| 2008 | Parallelized factor analysis and feature normalization for automatic speaker verification. Jun Luo, Cheung-Chi Leung, Marc Ferras, Claude Barras |
| 2008 | Parameter clustering and sharing in variable-parameter HMMs for noise robust speech recognition. Dong Yu, Li Deng, Yifan Gong, Alex Acero |
| 2008 | Parameter estimation method of F0 control model for singing voices. Yasunori Ohishi, Hirokazu Kameoka, Kunio Kashino, Kazuya Takeda |
| 2008 | Parsing with subdomain instance weighting from raw corpora. Barbara Plank, Khalil Sima'an |
| 2008 | Patterns, prototypes, performance: classifying emotional user states. Dino Seppi, Anton Batliner, Björn W. Schuller, Stefan Steidl, Thurid Vogt, Johannes Wagner, Laurence Devillers, Laurence Vidrascu, Noam Amir, Vered Aharonson |
| 2008 | Pausing and phrase length in two australian languages. Bella Ross |
| 2008 | Penalty function maximization for large margin HMM training. George Saon, Daniel Povey |
| 2008 | Perception and production of /i: /, /i@/ and /e: / in australian English. Robert H. Mannell |
| 2008 | Perception and production of consonant clusters in Japanese-English bilingual and Japanese monolingual speakers. Hinako Masuda, Takayuki Arai |
| 2008 | Perception of dialectal prosody. Adrian Leemann, Beat Siebenhaar |
| 2008 | Perceptual evidence of modern Greek voiced stops as phonological categories. Mark Antoniou, Catherine T. Best, Michael D. Tyler |
| 2008 | Perceptual speaker identification using monosyllabic stimuli - effects of the nucleus vowels and speaker characteristics contained in nasals. Kanae Amino, Takayuki Arai |
| 2008 | Performance improvement of text-independent speaker verification systems based on histogram enhancement in noisy environments. C. H. Kwon, J. K. Choi, Eliathamby Ambikairajah |
| 2008 | Phone recognition from ultrasound and optical video sequences for a silent speech interface. Thomas Hueber, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Maureen Stone |
| 2008 | Phone-based cepstral polynomial SVM system for speaker recognition. Sachin S. Kajarekar |
| 2008 | Phone-duration-dependent long-term dynamic features for a stochastic model-based voice activity detection. Takashi Fukuda, Osamu Ichikawa, Masafumi Nishimura |
| 2008 | Phoneme recognition based on hybrid neural networks with inhibition/enhancement of distinctive phonetic feature (DPF) trajectories. Mohammad Nurul Huda, Kouichi Katsurada, Tsuneo Nitta |
| 2008 | Phonetic and speaker variations in automatic emotion classification. Vidhyasaharan Sethu, Eliathamby Ambikairajah, Julien Epps |
| 2008 | Phonetic confusion analysis and robust phone set generation for Shanghai-accented Mandarin speech recognition. Guo-Hong Ding |
| 2008 | Phonetic query expansion for spoken document retrieval. Jonathan Mamou, Bhuvana Ramabhadran |
| 2008 | Phonetic-acoustic and feature analyses by a neural network to assess speech quality in patients treated for head and neck cancer. Marieke de Bruijn, Irma Verdonck-de Leeuw, Louis ten Bosch, Joop Kuik, Hugo Quené, Lou Boves, Hans Langendijk, C. René Leemans |
| 2008 | Phonetically prestopped laterals in Australian languages: a preliminary investigation of Warlpiri. Deborah Loakes, Andrew Butcher, Janet Fletcher, Hywel Stoakes |
| 2008 | Phonological representations in poor readers. Cecile T. L. Kuijpers, Louis ten Bosch |
| 2008 | Phonotactically well-formed onset clusters as processing units in word recognition. Tom Lentz |
| 2008 | Physical models of the human vocal tract with gel-type material. Takayuki Arai |
| 2008 | Physically embodied conversational agents as health and fitness companions. Markku Turunen, Jaakko Hakulinen, Cameron G. Smith, Daniel Charlton, Li Zhang, Marc Cavazza |
| 2008 | Pitch adaptive features for LVCSR. Giulia Garau, Steve Renals |
| 2008 | Pitch target analysis of Thai tones using quantitative target approximation model and unsupervised clustering. Santitham Prom-on |
| 2008 | Politecnico di Torino system for the 2007 NIST language recognition evaluation. Fabio Castaldo, Emanuele Dalmasso, Pietro Laface, Daniele Colibro, Claudio Vair |
| 2008 | Positional effects on the characterization of ejectives in Waima'a. Mary Stevens, John Hajek |
| 2008 | Predictability of STRFs in auditory cortex neurons depends on stimulus class. Max F. K. Happel, Simon Müller, Jörn Anemüller, Frank W. Ohl |
| 2008 | Predicting ASR errors by exploiting barge-in rate of individual users for spoken dialogue systems. Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno |
| 2008 | Predicting tongue shapes from a few landmark locations. Chao Qin, Miguel Á. Carreira-Perpiñán, Korin Richmond, Alan Wrench, Steve Renals |
| 2008 | Prelexically-driven perceptual retuning of phoneme boundaries. Anne Cutler, James M. McQueen, Sally Butterfield, Dennis Norris |
| 2008 | Preliminary evaluation of speech/sound recognition for telemedicine application in a real environment. Michel Vacher, Anthony Fleury, Jean-François Serignat, Norbert Noury, Hubert Glasson |
| 2008 | Preparing a corpus of dutch spontaneous dialogues for automatic phonetic analysis. Barbara Schuppler, Mirjam Ernestus, Odette Scharenborg, Lou Boves |
| 2008 | Probabilistic answer selection based on conditional random fields for spoken dialog system. Yoshitaka Yoshimi, Ryota Kakitsuba, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda |
| 2008 | Probabilistic feature mapping based on trajectory HMMs. Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda |
| 2008 | Probabilistic latent speaker training for large vocabulary speech recognition. Dan Su, Xihong Wu, Huisheng Chi |
| 2008 | Pronunciation error detection techniques for children's speech. Daniel Bolaños, Wayne H. Ward, Barbara Wise, Sarel van Vuuren |
| 2008 | Pronunciation reduction: how it relates to speech style, gender, and age. Helmer Strik, Joost van Doremalen, Catia Cucchiarini |
| 2008 | Pronunciation training: the role of eye and ear. Dominic W. Massaro, Stephanie Bigler, Trevor H. Chen, Marcus Perlman, Slim Ouni |
| 2008 | Pronunciation verification of English letter-sounds in preliterate children. Matthew Black, Joseph Tepperman, Abe Kazemzadeh, Sungbok Lee, Shrikanth S. Narayanan |
| 2008 | Prosodic and spectral features within segment-based acoustic modeling. Björn W. Schuller, Xiaohua Zhang, Gerhard Rigoll |
| 2008 | Prosodic manifestations of confidence and uncertainty in spoken language. Heather Pon-Barry |
| 2008 | Prosodic position effects and function words in English: a pilot study. Mitsuhiro Nakamura |
| 2008 | Prosody boundary detection through context-dependent position models. Yue-Ning Hu, Min Chu, Chao Huang, Yan-Ning Zhang |
| 2008 | Prosody for Mandarin speech recognition: a comparative study of read and spontaneous speech. Yu Ting Yeung, Yao Qian, Tan Lee, Frank K. Soong |
| 2008 | Psychoacoustically-motivated adaptive β-order generalized spectral subtraction based on data-driven optimization. Junfeng Li, Hui Jiang, Masato Akagi |
| 2008 | Quantitative analysis of intonation patterns produced by Cantonese speakers with Parkinson's disease: a preliminary study. Joan K.-Y. Ma, Tara L. Whitehill |
| 2008 | Quantitative prosodic analysis of spontaneous speech. Hansjörg Mixdorff |
| 2008 | Question and answer database optimization using speech recognition results. Shota Takeuchi, Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2008 | Rapid unsupervised speaker adaptation robust in reverberant environment conditions. Randy Gomez, Jani Even, Kiyohiro Shikano |
| 2008 | Rate dependent spectral reduction for voiceless fricatives. Benjamin Weiss |
| 2008 | Realistic facial animation system for interactive services. Kang Liu, Jörn Ostermann |
| 2008 | Recent improvements of the RWTH GALE Mandarin LVCSR system. Christian Plahl, Björn Hoffmeister, Mei-Yuh Hwang, Danju Lu, Georg Heigold, Jonas Lööf, Ralf Schlüter, Hermann Ney |
| 2008 | Recognition of English utterances with grammatical and lexical mistakes for dialogue-based CALL system. Akinori Ito, Ryohei Tsutsui, Shozo Makino, Motoyuki Suzuki |
| 2008 | Recognition of stress in speech using wavelet analysis and Teager energy operator. Ling He, Margaret Lech, Sheeraz Memon, Nicholas B. Allen |
| 2008 | Recognizing and modelling regional varieties of Swedish. Jonas Beskow, Gösta Bruce, Laura Enflo, Björn Granström, Susanne Schötz |
| 2008 | Recognizing named entities in spoken Chinese dialogues with a character-level maximum entropy tagger. Changchun Bao, Weiqun Xu, Yonghong Yan |
| 2008 | Recovering participant identities in meetings from a probabilistic description of vocal interaction. Kornel Laskowski, Tanja Schultz |
| 2008 | Reducing the effect of OOV query words by using morph-based spoken document retrieval. Ville T. Turunen |
| 2008 | Region-based vocal tract length normalization for ASR. Michail G. Maragakis, Alexandros Potamianos |
| 2008 | Regularized non-negative matrix factorization with temporal dependencies for speech denoising. Kevin W. Wilson, Bhiksha Raj, Paris Smaragdis |
| 2008 | Reinforced temporal structure information for embedded utterance-based speaker recognition. Anthony Larcher, Jean-François Bonastre, John S. D. Mason |
| 2008 | Relation between geometry and kinematics of articulatory trajectory associated with emotional speech production. Sungbok Lee, Tsuneo Kato, Shrikanth S. Narayanan |
| 2008 | Reversal of short front vowel raising in Australian English. Felicity Cox, Sallyanne Palethorpe |
| 2008 | Rhythm based music segmentation and octave scale cepstral features for sung language recognition. Namunu Chinthaka Maddage, Haizhou Li |
| 2008 | Rich morphology based n-gram language models for Arabic. Ahmad Emami, Imed Zitouni, Lidia Mangu |
| 2008 | Robust far-field speaker identification under mismatched conditions. Qin Jin, Tanja Schultz |
| 2008 | Robust front end processing for speech recognition in reverberant environments: utilization of speech characteristics. Rico Petrick, Xugang Lu, Masashi Unoki, Masato Akagi, Rüdiger Hoffmann |
| 2008 | Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis. Chanwoo Kim, Richard M. Stern |
| 2008 | Robust speaker change detection using Kernel-Gaussian model. Jie Gao, Xiang Zhang, Qingwei Zhao, Yonghong Yan |
| 2008 | Robust speaker identification using cross-correlation GTF-ICA feature. Yushi Zhang, Waleed H. Abdulla |
| 2008 | Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions. Chien-Lin Huang, Bin Ma, Chung-Hsien Wu, Brian Mak, Haizhou Li |
| 2008 | Robust spoken term detection using combination of phone-based and word-based recognition. Kenji Iwata, Koichi Shinoda, Sadaoki Furui |
| 2008 | Robust voiced/unvoiced speech classification using empirical mode decomposition and periodic correlation model. Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu |
| 2008 | Robustness of HMM-based speech synthesis. Junichi Yamagishi, Zhen-Hua Ling, Simon King |
| 2008 | Robustness of prosodic features to voice imitation. Mireia Farrús, Michael Wagner, Jan Anguita, Javier Hernando |
| 2008 | SPRAAK: an open source "SPeech recognition and automatic annotation kit". Kris Demuynck, Jan Roelens, Dirk Van Compernolle, Patrick Wambacq |
| 2008 | Schwa variants in american English. H. Timothy Bunnell, Jason Lilley |
| 2008 | Science workshop with sliding vocal-tract model. Takayuki Arai |
| 2008 | Scripted dialogs versus improvisation: lessons learned about emotional elicitation techniques from the IEMOCAP database. Carlos Busso, Shrikanth S. Narayanan |
| 2008 | Search and classification based language model adaptation. Qin Shi, Stephen M. Chu, Wen Liu, Hong-Kwang Jeff Kuo, Yi Liu, Yong Qin |
| 2008 | Seed models combination and state level mappings of cross-lingual transfer for rapid HMM development: from English to Mandarin. Xufang Zhao, Douglas D. O'Shaughnessy |
| 2008 | Segmentation cues in lexical identification and in lexical acquisition: same or different? Odile Bagou, Ulrich H. Frauenfelder |
| 2008 | Short- and long-term dynamic features for robust speech recognition. Takashi Fukuda, Osamu Ichikawa, Masafumi Nishimura |
| 2008 | Significance of group delay based acoustic features in the linguistic search space for robust speech recognition. R. Ramya, Rajesh M. Hegde, Hema A. Murthy |
| 2008 | Silence feature normalization for robust speech recognition in additive noise environments. Chih-Cheng Wang, Chi-An Pan, Jeih-weih Hung |
| 2008 | Silence models in weighted finite-state transducers. Philip N. Garner |
| 2008 | Similarity between vowels influences response execution in word identification. Jason D. Zevin, Thomas A. Farmer |
| 2008 | Simultaneous conversion of duration and spectrum based on statistical models including time-sequence matching. Kaori Yutani, Yosuke Uto, Yoshihiko Nankaku, Tomoki Toda, Keiichi Tokuda |
| 2008 | Six- and twelve-month-olds' discrimination of native versus non-native between- and within-organ fricative place contrasts. Michael D. Tyler, Catherine T. Best, Louis M. Goldstein, Mark Antoniou, Lidija Krebs-Lazendic |
| 2008 | Soft margin estimation with various separation levels for LVCSR. Jinyu Li, Zhi-Jie Yan, Chin-Hui Lee, Ren-Hua Wang |
| 2008 | Soft missing-feature mask generation for simultaneous speech recognition system in robots. Toru Takahashi, Shun'ichi Yamamoto, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
| 2008 | Sound capture system and spatial filter for small devices. Ivan Tashev, Slavy Mihov, Tyler Gleghorn, Alex Acero |
| 2008 | Source separation based on binaural cues and source model constraints. Ron J. Weiss, Michael I. Mandel, Daniel P. W. Ellis |
| 2008 | Sparse linear predictors for speech processing. Daniele Giacobello, Mads Græsbøll Christensen, Joachim Dahl, Søren Holdt Jensen, Marc Moonen |
| 2008 | Speaker adaptive training using shift-MLLR. Jonas Lööf, Christian Gollan, Hermann Ney |
| 2008 | Speaker identification for whispered speech based on frequency warping and score competition. Xing Fan, John H. L. Hansen |
| 2008 | Speaker identification in noise mismatch conditions based on jump function Kolmogorov analysis in wavelet domain. Tran Huy Dat, Haizhou Li |
| 2008 | Speaker orientation estimation based on hybridation of GCC-PHAT and HLBR. Carlos Segura, Alberto Abad, Javier Hernando, Climent Nadeu |
| 2008 | Speaker recognition based on variational Bayesian method. Tatsuya Ito, Kei Hashimoto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda |
| 2008 | Speaker recognition in two-wire test sessions. Hagai Aronowitz, Yosef A. Solewicz |
| 2008 | Speaker verification with non-audible murmur segments by combining global alignment kernel and penalized logistic regression machine. Hideki Okamoto, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2008 | Speaker-independent emotion recognition based on feature vector classification. Jeong-Sik Park, Ji-Hwan Kim, Sang-Min Yoon, Yung-Hwan Oh |
| 2008 | Spectral envelope recovery beyond the nyquist limit for high-quality manipulation of speech sounds. Hideki Kawahara, Masanori Morise, Hideki Banno, Toru Takahashi, Ryuichi Nisimura, Toshio Irino |
| 2008 | Spectral noise shaping: improvements in speech/audio codec based on linear prediction in spectral domain. Sriram Ganapathy, Petr Motlícek, Hynek Hermansky, Harinath Garudadri |
| 2008 | Spectral subtraction in likelihood-maximizing framework for robust speech recognition. Bagher BabaAli, Hossein Sameti, Mehran Safayani |
| 2008 | Spectro-temporal features for robust far-field speaker identification. Tiago H. Falk, Wai-Yip Chan |
| 2008 | Speech analysis using instantaneous frequency deviation. Anthony P. Stark, Kuldip K. Paliwal |
| 2008 | Speech as a means of monitoring cognitive function of elderly speakers. Shona D'Arcy, Viliam Rapcan, Nils Penard, Margaret E. Morris, Ian H. Robertson, Richard B. Reilly |
| 2008 | Speech enhancement based on hypothesized Wiener filtering. V. Ramasubramanian, Deepak Vijaywargi |
| 2008 | Speech enhancement based on novel two-step a priori SNR estimators. Md. Jahangir Alam, Douglas D. O'Shaughnessy, Sid-Ahmed Selouani |
| 2008 | Speech enhancement using a wiener denoising technique and musical noise reduction. Md. Jahangir Alam, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy, Sofia Ben Jebara |
| 2008 | Speech interaction with an emotional robotic dog. Christian Martyn Jones, Andrew Deeming |
| 2008 | Speech recognition for vocalized and subvocal modes of production using surface EMG signals from the neck and face. Geoffrey S. Meltzner, Jason J. Sroka, James T. Heaton, L. Donald Gilmore, Glen Colby, Serge H. Roy, Nancy Chen, Carlo J. De Luca |
| 2008 | Speech recognition in noisy environments using a switching linear dynamic model for feature enhancement. Björn W. Schuller, Martin Wöllmer, Tobias Moosmayr, Gerhard Rigoll |
| 2008 | Speech recognition performance of CJLC: corpus of Japanese lecture contents. Satoru Kogure, Hiromitsu Nishizaki, Masatoshi Tsuchiya, Kazumasa Yamamoto, Shingo Togashi, Seiichi Nakagawa |
| 2008 | Speech recognition using non-linear trajectories in a formant-based articulatory layer of a multiple-level segmental HMM. Hongwei Hu, Martin J. Russell |
| 2008 | Speech recognition using soft decision trees. Jitendra Ajmera, Masami Akamine |
| 2008 | Speech-driven 3d facial animation for mobile entertainment. Juan Yan, Xiang Xie, Hao Hu |
| 2008 | Speech-driven lip motion generation with a trajectory HMM. Gregor Hofer, Junichi Yamagishi, Hiroshi Shimodaira |
| 2008 | Speech-overlapped acoustic event detection for automotive applications. Christian A. Müller, Joan-Isaac Biel, Edward Kim, Daniel Rosario |
| 2008 | Speech/laughter classification in meeting audio. Swe Zin Kalayar Khine, Tin Lay Nwe, Haizhou Li |
| 2008 | Speech/non-speech segments detection based on chaotic and prosodic features. Soheil Shafiee, Farshad Almasganj, Ayyoob Jafari |
| 2008 | Spoken digit recognition using a hierarchical temporal memory. Joost van Doremalen, Lou Boves |
| 2008 | Spoken document retrieval by translating recognition candidates into correct transcriptions. Tomoyosi Akiba, Yusuke Yokota |
| 2008 | Spoken keyword spotting via multi-lattice alignment. Hui Lin, Alex Stupakov, Jeff A. Bilmes |
| 2008 | Spoken language translation systems ************ ASR word lattice translation with exhaustive reordering is possible. Evgeny Matusov, Björn Hoffmeister, Hermann Ney |
| 2008 | Statistical shared plan-based dialog management. Amanda J. Stent, Srinivas Bangalore |
| 2008 | Statistical speech activity detection based on spatial power distribution for analyses of poster presentations. Kentaro Ishizuka, Shoko Araki, Tatsuya Kawahara |
| 2008 | Statistical text-to-speech synthesis with improved dynamics. Stas Tiomkin, David Malah |
| 2008 | Strategies for building a Farsi-English SMT system from limited resources. Andreas Kathol, Jing Zheng |
| 2008 | Stream decoding for simultaneous spoken language translation. Muntsin Kolss, Stephan Vogel, Alex Waibel |
| 2008 | Structure to speech conversion - speech generation based on infant-like vocal imitation. Daisuke Saito, Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hirose |
| 2008 | Structured heterogeneity of English stress variants. Noriko Hattori |
| 2008 | Structured models for joint decoding of repeated utterances. Geoffrey Zweig, Dan Bohus, Xiao Li, Patrick Nguyen |
| 2008 | Studies on estimation of the number of sources in blind source separation. Takaaki Ishibashi, Hidetoshi Nakashima, Hiromu Gotanda |
| 2008 | Study of integration of statistical model-based voice activity detection and noise suppression. Masakiyo Fujimoto, Kentaro Ishizuka, Tomohiro Nakatani |
| 2008 | Study of jacobian compensation using linear transformation of conventional MFCC for VTLN. D. Rama Sanand, Srinivasan Umesh |
| 2008 | Study on "ng, a" type of discourse markers in standard Chinese. Zhigang Yin, Aijun Li, Ziyu Xiong |
| 2008 | Study on manipulation method of voice quality based on the vocal tract area function. Yoshinori Uchimura, Hideki Banno, Fumitada Itakura, Hideki Kawahara |
| 2008 | Study on strained rough voice as a conveyer of rage. Yumiko O. Kato, Yoshifumi Hirose, Takahiro Kamai |
| 2008 | Study on unique pharyngeal and uvular consonants in foreign accented Arabic. Yousef Ajami Alotaibi, Khondaker Abdullah Al Mamun, Muhammad Ghulam |
| 2008 | Subspace based speech enhancement using Gaussian mixture model. Achintya Kundu, Saikat Chatterjee, T. V. Sreenivas |
| 2008 | Sudden noise reduction based on GMM with noise power estimation. Nobuyuki Miyake, Tetsuya Takiguchi, Yasuo Ariki |
| 2008 | Syntactic complexity induces explicit grounding in the Maptask corpus. Martin I. Tietze, Vera Demberg, Johanna D. Moore |
| 2008 | Synthesis by generation and concatenation of multiform segments. Vincent Pollet, Andrew P. Breen |
| 2008 | System combination for spoken language understanding. Stefan Hahn, Patrick Lehnen, Hermann Ney |
| 2008 | T-test distance and clustering criterion for speaker diarization. Trung Hieu Nguyen, Engsiong Chng, Haizhou Li |
| 2008 | T-tilt: a modified tilt model for F0 analysis and synthesis in tonal languages. Ausdang Thangthai, Nattanun Thatphithakkul, Chai Wutiwiwatchai, Anocha Rugchatjaroen, Sittipong Saychum |
| 2008 | Talking heads and pronunciation training: a review. Valérie Hazan |
| 2008 | Tandem processing of fepstrum features. Vivek Tyagi |
| 2008 | Target-oriented phone selection from universal phone set for spoken language recognition. Rong Tong, Bin Ma, Haizhou Li, Engsiong Chng |
| 2008 | Testing a large corpus of natural standard Arabic for rhythm class. Liz Dockendorf, Dalal Almubayei, Matthew Benton |
| 2008 | Text, rhythm and metrical form in an Aboriginal song series. Myfany Turpin |
| 2008 | Text-dependent speaker recognition by efficient capture of speaker dynamics in compressed time-frequency representations of speech. Amitava Das, Gokul Chittaranjan |
| 2008 | Thai named-entity recognition using class-based language modeling on multiple-sized subword units. Kwanchiva Saykhum, Vataya Boonpiam, Nattanun Thatphithakkul, Chai Wutiwiwatchai, Cholwich Nattee |
| 2008 | The CMU-interACT 2008 Mandarin transcription system. Roger Hsiao, Mark C. Fuhs, Yik-Cheung Tam, Qin Jin, Tanja Schultz |
| 2008 | The English pronunciation of successive groups of Maori speakers. Catherine Inez Watson, Margaret Maclagan, Jeanette King, Ray Harlow |
| 2008 | The MITLL NIST LRE 2007 language recognition system. Pedro A. Torres-Carrasquillo, Elliot Singer, William M. Campbell, Terry P. Gleason, Alan McCree, Douglas A. Reynolds, Fred Richardson, Wade Shen, Douglas E. Sturim |
| 2008 | The acoustic to articulation mapping: non-linear or non-unique? Daniel Neiberg, Gopal Ananthakrishnan, Olov Engwall |
| 2008 | The assimilation of L2 australian English vowels to L1 Japanese vowel categories: vocabulary size matters. Rikke L. Bundgaard-Nielsen, Catherine T. Best, Michael D. Tyler |
| 2008 | The best of both worlds: unifying conventional dialog systems and POMDPs. Jason D. Williams |
| 2008 | The case for automatic higher-level features in forensic speaker recognition. Elizabeth Shriberg, Andreas Stolcke |
| 2008 | The effect of auditory and visual degradation on audiovisual perception of native and non-native speakers. Valérie Hazan, Enid Li |
| 2008 | The effect of cognitive load on disfluencies during in-vehicle spoken dialogue. Anders Lindström, Jessica Villing, Staffan Larsson, Alexander Seward, Nina Åberg, Cecilia Holtelius |
| 2008 | The effect of first language (L1) dialects on the identification of Vietnamese word-final stops. Kimiko Tsukada, Thu T. A. Nguyen |
| 2008 | The effect of position on the realization of second occurrence focus. Jason B. Bishop |
| 2008 | The effect of spectral tilt on infants' discrimination of fricatives. Elizabeth Beach, Christine Kitamura, Harvey Dillon, Teresa Ching, Denis Burnham |
| 2008 | The entropy of the articulatory phonological code: recognizing gestures from tract variables. Xiaodan Zhuang, Hosung Nam, Mark Hasegawa-Johnson, Louis M. Goldstein, Elliot Saltzman |
| 2008 | The expression and perception of emotions: comparing assessments of self versus others. Carlos Busso, Shrikanth S. Narayanan |
| 2008 | The impact of language dynamics on the capitalization of broadcast news. Fernando Batista, Nuno J. Mamede, Isabel Trancoso |
| 2008 | The influence of audio presentation style on multitasking during teleconferences. Stuart N. Wrigley, Simon Tucker, Guy J. Brown, Steve Whittaker |
| 2008 | The intelligibility of the English vowel /ʌ/ produced by native speakers of Japanese and its relations to the acoustic characteristics. Akiyo Joto |
| 2008 | The interspeech 2008 consonant challenge. Martin Cooke, Odette Scharenborg |
| 2008 | The linear transformation of LF glottal waveforms for voice conversion. Arantza del Pozo, Steve J. Young |
| 2008 | The meanings carried by interjections in spontaneous speech. Carlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita |
| 2008 | The non-native consonant challenge for european languages. María Luisa García Lecumberri, Martin Cooke, Francesco Cutugno, Mircea Giurgiu, Bernd T. Meyer, Odette Scharenborg, Wim A. van Dommelen, Jan Volín |
| 2008 | The role of 'delta' features in speaker verification. Ying Liu, Martin J. Russell, Michael J. Carey |
| 2008 | The role of Japanese pitch accent in spoken-word recognition: evidence from middle-aged accentless dialect listeners. Takashi Otake, Marii Higuchi |
| 2008 | The strength of stress-related lexical competition depends on the presence of first-syllable stress. Eva Reinisch, Alexandra Jesse, James M. McQueen |
| 2008 | The value of auditory offset adaptation and appropriate acoustic modeling. Huan Wang, David Gelbart, Hans-Günter Hirsch, Werner Hemmert |
| 2008 | The vowels of Australian Aboriginal English. Andrew Butcher, Victoria Anderson |
| 2008 | Three-sectional-staff characterization of Cantonese level tones. Rerrario Shui-Ching Ho, Yoshinori Sagisaka |
| 2008 | Time and frequency dependent amplification for speech intelligibility enhancement in noisy environments. Henk Brouckxon, Werner Verhelst, Bart De Schuymer |
| 2008 | Time-lag adaptation for semi-synchronous speech and pen input. Yasushi Watanabe, Koichi Shinoda, Sadaoki Furui |
| 2008 | To what extent does tagged-MRI technique allow to infer tongue muscles' activation pattern? a modelling study. Stéphanie Buchaillard, Pascal Perrier, Yohan Payan |
| 2008 | Tone hyperarticulation in Cantonese infant-directed speech. Nan Xu, Denis Burnham |
| 2008 | Topic segmentation and indexation in a media watch system. Rui Amaral, Isabel Trancoso |
| 2008 | Towards a non-parametric acoustic model: an acoustic decision tree for observation probability calculation. Jasha Droppo, Michael L. Seltzer, Alex Acero, Yu-Hsiang Bosco Chiu |
| 2008 | Towards a segmental vocoder driven by ultrasound and optical images of the tongue and lips. Thomas Hueber, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Maureen Stone |
| 2008 | Towards automatic emotional state categorization from speech signals. Arslan Shaukat, Ke Chen |
| 2008 | Towards automatic learning in LVCSR: rapid development of a Persian broadcast transcription system. Christian Gollan, Hermann Ney |
| 2008 | Towards domain independence in machine aided human translation. Aarthi M. Reddy, Richard C. Rose |
| 2008 | Towards flexible speech coding for speech synthesis: an LF + modulated noise vocoder. Yannis Agiomyrgiannakis, Olivier Rosec |
| 2008 | Towards measuring continuous acoustic feature convergence in unconstrained spoken dialogues. Spyros Kousidis, David Dorran, Yi Wang, Brian Vaughan, Charlie Cullen, Dermot Campbell, Ciaran McDonnell, Eugene Coyle |
| 2008 | Towards the integration of automatic speech recognition and information retrieval for spoken query processing. Antonio Moreno-Daniel, Jay G. Wilpon, Biing-Hwang Juang, Sarangarajan Parthasarathy |
| 2008 | Towards unsupervised training of the classifier-based speech translator. Emil Ettelaie, Panayiotis G. Georgiou, Shrikanth S. Narayanan |
| 2008 | Towards vocabulary-independent speech indexing for large-scale repositories. Jian Shao, Roger Peng Yu, Qingwei Zhao, Yonghong Yan, Frank Seide |
| 2008 | Training audio events detectors with a sound effects corpus. Isabel Trancoso, José Portelo, Miguel M. F. Bugalho, João Paulo Neto, António Joaquim Serralheiro |
| 2008 | Transcribing broadcast data using MLP features. Petr Fousek, Lori Lamel, Jean-Luc Gauvain |
| 2008 | Transcription-less call routing using unsupervised language model adaptation. Nicolae Duta |
| 2008 | Traveling wave based group delays for cochlear implant speech processing. Daniel A. Taft, David B. Grayden, Anthony N. Burkitt |
| 2008 | Tree grammars as models of prosodic structure. Joseph Tepperman, Shrikanth S. Narayanan |
| 2008 | Two protocols comparing human and machine phonetic recognition performance in conversational speech. Wade Shen, Joseph P. Olive, Douglas A. Jones |
| 2008 | Two stage iterative Wiener filtering for speech enhancement. Krishna Nand K., T. V. Sreenivas |
| 2008 | Two step speaker segmentation method using Bayesian information criterion and adapted Gaussian mixtures models. Matej Grasic, Marko Kos, Andrej Zgank, Zdravko Kacic |
| 2008 | Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech. Kofi Boakye, Oriol Vinyals, Gerald Friedland |
| 2008 | Two- and three-dimensional visual articulatory models for pronunciation training and for treatment of speech disorders. Bernd J. Kröger, Verena Graf-Borttscheller, Anja Lowit |
| 2008 | Two-stage prosody prediction for emotional text-to-speech synthesis. Hao Tang, Xi Zhou, Matthias Odisio, Mark Hasegawa-Johnson, Thomas S. Huang |
| 2008 | Unsupervised adaptation for HMM-based speech synthesis. Simon King, Keiichi Tokuda, Heiga Zen, Junichi Yamagishi |
| 2008 | Unsupervised language model adaptation based on topic and role information in multiparty meetings. Songfang Huang, Steve Renals |
| 2008 | Unsupervised learning of edit parameters for matching name variants. Daniel Gillick, Dilek Hakkani-Tür, Michael Levit |
| 2008 | Unsupervised re-scoring of observation probability based on maximum entropy criterion by using confidence measure with telephone speech. Carlos Molina, Néstor Becerra Yoma, Fernando Huenupán, Claudio Garretón |
| 2008 | Unsupervised versus supervised training of acoustic models. Jeff Z. Ma, Richard M. Schwartz |
| 2008 | Usability of ASR-based reading training for dyslexics. Jakob Schou Pedersen, Lars Bo Larsen, Børge Lindberg |
| 2008 | Use of spectral centre of gravity for generating speaker invariant features for automatic speech recognition. D. Rama Sanand, V. Balaji, Rani R. Sandhya, Srinivasan Umesh |
| 2008 | Usefulness of text-conditioning and a new database for text-dependent speaker recognition research. Amitava Das, Gokul Chittaranjan, Gopala Krishna Anumanchipalli |
| 2008 | User perception of multi-modal interfaces for mobile applications. Florian Metze, Roman Englert, Udo Bub, Ingmar Kliche, Thomas Scheerbarth |
| 2008 | User study of the Bayesian update of dialogue state approach to dialogue management. Blaise Thomson, Milica Gasic, Simon Keizer, François Mairesse, Jost Schatzmann, Kai Yu, Steve J. Young |
| 2008 | Using KL-based acoustic models in a large vocabulary recognition task. Guillermo Aradilla, Hervé Bourlard, Mathew Magimai-Doss |
| 2008 | Using MAP estimation of feature transformation for speaker recognition. Donglai Zhu, Bin Ma, Haizhou Li |
| 2008 | Using latent Dirichlet allocation to incorporate domain knowledge for topic transition detection. Xiaodan Zhu, Xuming He, Cosmin Munteanu, Gerald Penn |
| 2008 | Using prosody for the improvement of ASR - sentence modality recognition. Klára Vicsi, György Szaszák |
| 2008 | Using syllable nuclei locations to improve automatic speech recognition in the presence of burst noise. Chris D. Bartels, Jeff A. Bilmes |
| 2008 | Utterance-level normalization for relative articulation rate analysis. Tuomo Saarni, Jussi Hakokari, Jouni Isoaho, Tapio Salakoski |
| 2008 | Verifying pronunciation accuracy from speakers with neuromuscular disorders. Shou-Chun Yin, Richard C. Rose, Oscar Saz, Eduardo Lleida |
| 2008 | Visual speech modifies the phoneme restoration effect. Erin Cvejic, Jeesun Kim, Chris Davis |
| 2008 | Vocabulary independent discriminative term frequency estimation. J. Scott Olsson |
| 2008 | Vocal imitation in early language acquisition. Lisa Gustavsson, Francisco Lacerda |
| 2008 | Vocal tract inversion by cepstral analysis-by-synthesis using chain matrices. Sankaran Panchapagesan, Abeer Alwan |
| 2008 | Voice activity detection algorithms using subband power distance feature for noisy environments. Tuan Van Pham, Michael Stadtschnitzer, Franz Pernkopf, Gernot Kubin |
| 2008 | Voice activity detection using modified Wigner-ville distribution. Lakshmish Kaushik, Douglas D. O'Shaughnessy |
| 2008 | Voice commands in home environment - a consumer survey. Hannu Soronen, Markku Turunen, Jaakko Hakulinen |
| 2008 | Voicing influences the saliency of place of articulation in audio-visual speech perception in babble. Magnus Alm, Dawn M. Behne |
| 2008 | Vowel duration, compression and lengthening in stressed syllables in central and southern varieties of standard Italian. John Hajek, Mary Stevens |
| 2008 | Vowel epenthesis, acoustics and phonology patterns in Moroccan Arabic. Azra Nahid Ali, Mohamed Lahrouchi, Michael Ingleby |
| 2008 | Vowel placement during operatic singing: 'come si parla' or 'aggiustamento'? Thomas John Millhouse, Dianna T. Kenny |
| 2008 | Weakly supervised training for parsing Mandarin broadcast transcripts. Wen Wang |
| 2008 | Weighted segmental k-means initialization for SOM-based speaker clustering. Oshry Ben-Harush, Itshak Lapidot, Hugo Guterman |
| 2008 | What makes a good speaker? subject ratings, acoustic measurements and perceptual evaluations. Eva Strangert, Joakim Gustafson |
| 2008 | When calls go wrong: how to detect problematic calls based on log-files and emotions? Ota Herm, Alexander Schmitt, Jackson Liscombe |
| 2008 | Wikispeech - a content management system for speech databases. Christoph Draxler, Klaus Jänsch |
| 2008 | Within-class feature normalization for robust speech recognition. Yuan-Fu Liao, Chi-Hui Hsu, Chi-Min Yang, Jeng-Shien Lin, Sen-Chia Chang |
| 2008 | Word stress placement by native speakers and Japanese learners of English. Keiichi Ishikawa, Jun Nomura |
| 2008 | XMLLR for improved speaker adaptation in speech recognition. Daniel Povey, Hong-Kwang Jeff Kuo |
| 2008 | f-divergence is a generalized invariant measure between distributions. Yu Qiao, Nobuaki Minematsu |
| 2008 | iCNC and iROVER: the limits of improving system combination with classification? Björn Hoffmeister, Ralf Schlüter, Hermann Ney |