INTERSPEECH - RankMe

752 papers

Year	Title / Authors
2007	"polyaural" array processing for automatic speech recognition in degraded environments. Richard M. Stern, Evandro B. Gouvêa, Govindarajan Thattai
2007	8th Annual Conference of the International Speech Communication Association, INTERSPEECH 2007, Antwerp, Belgium, August 27-31, 2007
2007	A Bayesian network classifier for word-level reading assessment. Joseph Tepperman, Matthew Black, Patti Price, Sungbok Lee, Abe Kazemzadeh, Matteo Gerosa, Margaret Heritage, Abeer Alwan, Shrikanth S. Narayanan
2007	A GMM-based probabilistic sequence kernel for speaker verification. Kong-Aik Lee, Changhuai You, Haizhou Li, Tomi Kinnunen
2007	A HMM recognition of consonant-vowel syllables from lip contours: the cued speech case. Noureddine Aboutabit, Denis Beautemps, Jeanne Clarke, Laurent Besacier
2007	A MAP based approach to adaptive speech intelligibility measurements. Trym Holter, Svein Srsdal
2007	A comparative evaluation of the zeros of z transform representation for voice source estimation. Nicolas Sturmel, Christophe d'Alessandro, Boris Doval
2007	A comparative study of speech rate estimation techniques. Tomas Dekens, Mike Demol, Werner Verhelst, Piet Verhoeve
2007	A comparative study on speech summarization of broadcast news and lecture speech. Jian Zhang, Ricky Ho Yin Chan, Pascale Fung, Lu Cao
2007	A comparison of acoustic features for articulatory inversion. Chao Qin, Miguel Á. Carreira-Perpiñán
2007	A comparison of estimated and MAP-predicted formants and fundamental frequencies with a speech reconstruction application. Jonathan Darch, Ben Milner
2007	A comparison of session variability compensation techniques for SVM-based speaker recognition. Mitchell McLaren, Robbie Vogt, Brendan Baker, Sridha Sridharan
2007	A comparison of speaker clustering and speech recognition techniques for air situational awareness. Wade Shen, Douglas A. Reynolds
2007	A computational model for unsupervised word discovery. Louis ten Bosch, Bert Cranen
2007	A conservative aggressive subspace tracker. Koby Crammer
2007	A corpus study of the 3 Yiya Chen, Jiahong Yuan
2007	A data visualization and analysis method for natural language call routing system design. Hong-Kwang Jeff Kuo, Vaibhava Goel
2007	A fast fuzzy keyword spotting algorithm based on syllable confusion network. Jian Shao, Qingwei Zhao, Pengyuan Zhang, Zhaojie Liu, Yonghong Yan
2007	A fast optimization method for large margin estimation of HMMs based on second order cone programming. Yan Yin, Hui Jiang
2007	A fine pitch model for speech. Jasha Droppo, Alex Acero
2007	A flexible spectral modification method based on temporal decomposition and Gaussian mixture model. Binh Phu Nguyen, Masato Akagi
2007	A four-cube FEM model of the extrinsic and intrinsic tongue muscles to simulate the production of vowel /i/. Sayoko Takano, Hiroki Matsuzaki, Kunitoshi Motoki
2007	A framework of reply speech generation for concept-to-speech conversion in spoken dialogue systems. Seiya Takada, Yuji Yagi, Keikichi Hirose, Nobuaki Minematsu
2007	A generic methodology of converting transliterated text to phonetic strings case study: greeklish. Nikos Tsourakis, Vassilios Digalakis
2007	A learning method for Thai phonetization of English words. Ausdang Thangthai, Chai Wutiwiwatchai, Anocha Rugchatjaroen, Sittipong Saychum
2007	A method for evaluating task-oriented spoken dialog translation systems based on communication efficiency. Toshiyuki Takezawa, Masahide Mizushima, Tohru Shimizu, Gen-ichiro Kikui
2007	A methodology for the automatic detection of perceived prominent syllables in spoken French. Jean-Philippe Goldman, Mathieu Avanzi, Anne-Catherine Simon, Anne Lacheret, Antoine Auchlin
2007	A model of glottal flow incorporating viscous-inviscid interaction. Tokihiko Kaburagi, Yosuke Tanabe
2007	A model-based estimation of phonotactic language verification performance. Kakeung Wong, Man-Hung Siu, Brian Mak
2007	A morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages - like Hungarian. Péter Mihajlik, Tibor Fegyó, Zoltán Tüske, Pavel Ircing
2007	A multiple-model based framework for automatic speech segmentation. Seung Seop Park, Jong Won Shin, Jong Kyu Kim, Nam Soo Kim
2007	A multitask learning perspective on acoustic-articulatory inversion. Korin Richmond
2007	A new approach for phoneme segmentation of speech signals. Ladan Golipour, Douglas D. O'Shaughnessy
2007	A new kernel for SVM MLLR based speaker recognition. Zahi N. Karam, William M. Campbell
2007	A novel 2kb/s waveform interpolation speech coder based on non-negative matrix factorization. Peng Zhang, Changchun Bao
2007	A novel energy distribution comparison approach for robust speech spectrum vector quantization. Ahmed Ismail, Yasser Dakroury, Hazem M. Abbas
2007	A packetization and variable bitrate interframe compression scheme for vector quantizer-based distributed speech recognition. Bengt J. Borgström, Abeer Alwan
2007	A pair-based language model for the robust lexical analysis in Chinese text-to-speech synthesis. Wu Liu, Dezhi Huang, Yuan Dong, Xinnian Mao, Haila Wang
2007	A paradigm for mobile speech-centric services. Lars Bo Larsen, Kasper Løvborg Jensen, Søren Larsen, Morten Højfeldt Rasmussen
2007	A phonetic concatenative approach of labial coarticulation. Vincent Robert, Yves Laprie, Anne Bonneau
2007	A phonetic search approach to the 2006 NIST spoken term detection evaluation. Roy Wallace, Robbie Vogt, Sridha Sridharan
2007	A pitch extraction system based on phase locked loops and consensus decision. Patricia A. Pelle, Claudio Estienne
2007	A portable record player for wax cylinders using a laser-beam reflection method. Tohru Ifukube, Yasuyuki Shimizu
2007	A preselection method based on cost degradation from the optimal sequence for concatenative speech synthesis. Nobuyuki Nishizawa, Hisashi Kawai
2007	A reference model weighting-based method for robust speech recognition. Yuan-Fu Liao, Yh-Her Yang, Chi-Hui Hsu, Cheng-Chang Lee, Jing-Teng Zeng
2007	A robust mel-scale subband voice activity detector for a car platform. Agustín Álvarez-Marquina, Rafael Martínez, Pedro Gómez, Victor Nieto Lluis, V. Rodellar
2007	A robust multi-phase pitch-mark detection algorithm. Milan Legát, Jindrich Matousek, Daniel Tihelka
2007	A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system. Kyu Jeong Han, Shrikanth S. Narayanan
2007	A rule-based speech morphing for verifying a expressive speech perception model. Chun-Fang Huang, Masato Akagi
2007	A saliency-based auditory attention model with applications to unsupervised prominent syllable detection in speech. Ozlem Kalinli, Shrikanth S. Narayanan
2007	A semi-automatic approach for speaker mining of tapped telephone conversations. Sandeep Manocha, Carol Y. Espy-Wilson
2007	A semi-supervised learning approach for morpheme segmentation for an Arabic dialect. Mei Yang, Jing Zheng, Andreas Kathol
2007	A semi-supervised method for efficient construction of statistical spoken language understanding resources. Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee
2007	A smoothing kernel for spatially related features and its application to speaker verification. Luciana Ferrer, M. Kemal Sönmez, Elizabeth Shriberg
2007	A soft-clustering algorithm for automatic induction of semantic classes. Elias Iosif, Alexandros Potamianos
2007	A speech rate related lip movement model for speech animation. Wei Zhou, Zengfu Wang
2007	A statistical method of evaluating pronunciation proficiency for presentation in English. Seiichi Nakagawa, Kei Ohta
2007	A statistical model based post-filtering algorithm for residual echo suppression. Seung Yeol Lee, Jong Won Shin, Hwan Sik Yun, Nam Soo Kim
2007	A straightforward and efficient implementation of the factor analysis model for speaker verification. Driss Matrouf, Nicolas Scheffer, Benoit G. B. Fauve, Jean-François Bonastre
2007	A structured speech model parameterized by recursive dynamics and neural networks. Roberto Togneri, Li Deng
2007	A study on temporal features derived by analytic signal. Yotaro Kubo, Shigeki Okawa, Akira Kurematsu, Katsuhiko Shirai
2007	A study on word detector design and knowledge-based pruning and rescoring. Chengyuan Ma, Chin-Hui Lee
2007	A sub-optimal viterbi-like search for linear dynamic models classification. Dimitris Oikonomidis, Vassilios Diakoloukas, Vassilios Digalakis
2007	A system for transforming the emotion in speech: combining data-driven conversion techniques for prosody and voice quality. Zeynep Inanoglu, Steve J. Young
2007	A tagging algorithm for mixed language identification in a noisy domain. Mike Rosner, Paulseph-John Farrugia
2007	A text-constrained prosodic system for speaker verification. Elizabeth Shriberg, Luciana Ferrer
2007	A text-free approach to assessing nonnative intonation. Joseph Tepperman, Abe Kazemzadeh, Shrikanth S. Narayanan
2007	A trainable excitation model for HMM-based speech synthesis. Ranniery Maia, Tomoki Toda, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda
2007	A unified approach to multi-pose audio-visual ASR. Patrick Lucey, Gerasimos Potamianos, Sridha Sridharan
2007	A unified probabilistic generative framework for extractive spoken document summarization. Yi-Ting Chen, Hsuan-Sheng Chiu, Hsin-Min Wang, Berlin Chen
2007	A uniformly most powerful test for statistical model-based voice activity detection. Keun Won Jang, Dong Kook Kim, Joon-Hyuk Chang
2007	A variational approach to robust maximum likelihood estimation for speech recognition. Mohamed Kamal Omar
2007	ASR-based pronunciation training: scoring accuracy and pedagogical effectiveness of a system for dutch L2 learners. Catia Cucchiarini, Ambra Neri, Febe de Wet, Helmer Strik
2007	Accelerating the annotation of lexical data for less-resourced languages. Gerhard B. Van Huyssteen, Martin J. Puttkammer
2007	Accent assignment algorithm in Hungarian, based on syntactic analysis. Anne Tamm, Kálmán Abari, Gábor Olaszy
2007	Accurate marginalization range for missing data recognition. Sébastien Demange, Christophe Cerisara, Jean Paul Haton
2007	Acoustic analysis of the neutral tone in Mandarin. Philippe Martin, Jun Li
2007	Acoustic and affective comparisons of natural and imaginary infant-, foreigner- and adult-directed speech. Monja A. Knoll, Lisa Scharrer
2007	Acoustic correlates of intelligibility enhancements in clearly produced fricatives. Kazumi Maniwa, Allard Jongman, Travis Wade
2007	Acoustic correlates of laryngeal-muscle fatigue: findings for a phonometric prevention of acquired voice pathologies. Victor J. Boucher
2007	Acoustic features of anger utterances during natural dialog. Yoshiko Arimoto, Sumio Ohno, Hitoshi Iida
2007	Acoustic language identification using fast discriminative training. Fabio Castaldo, Daniele Colibro, Emanuele Dalmasso, Pietro Laface, Claudio Vair
2007	Acoustic parameters for the automatic detection of vowel nasalization. Tarun Pruthi, Carol Y. Espy-Wilson
2007	Acoustic-phonetic features for refining the explicit speech segmentation. Antonio Marcos Selmini, Fábio Violaro
2007	Acquisition and synchronization of multimodal articulatory data. Michael Aron, Nicolas Ferveur, Erwan Kerrien, Marie-Odile Berger, Yves Laprie
2007	Acquisition of vowel duration in children speaking american English. Eon-Suk Ko
2007	Active binaural distance estimation for dynamic sources. Yan-Chen Lu, Martin Cooke, Heidi Christensen
2007	Adaptive weighting of microphone arrays for distant-talking F0 and voiced/unvoiced estimation. Federico Flego, Christian Zieger, Maurizio Omologo
2007	Adding noise to improve noise robustness in speech recognition. Nicolás Morales, Liang Gu, Yuqing Gao
2007	Advanced front-end for robust speech recognition in extremely adverse environments. Dimitrios Dimitriadis, José C. Segura, Luz García, Alexandros Potamianos, Petros Maragos, Vassilis Pitsikalis
2007	Advances in Mandarin broadcast speech recognition. Mei-Yuh Hwang, Wen Wang, Xin Lei, Jing Zheng, Özgür Çetin, Gang Peng
2007	Advances in speechfind: transcript reliability estimation employing confidence measure based on discriminative sub-word model for SDR. Wooil Kim, John H. L. Hansen
2007	Age-related changes in fundamental frequency and formants: a longitudinal study of four speakers. Jonathan Harrington, Sallyanne Palethorpe, Catherine I. Watson
2007	Alignment of the second low target in dutch falling-rising pitch contours. Jörg Peters, Judith Hanssen, Carlos Gussenhoven
2007	Always listening to you: creating exhaustive audio database in home environments. Yasunari Obuchi, Akio Amano
2007	Ambient telephony: scenarios and research challenges. Aki Härmä
2007	An 8-32 kbit/s scalable wideband coder extended with MDCT-based bandwidth extension on top of a 6.8 kbit/s narrowband CELP coder. Masahiro Oshikiri, Hiroyuki Ehara, Toshiyuki Morii, Tomofumi Yamanashi, Kaoru Satoh, Koji Yoshida
2007	An HMM acoustic model incorporating various additional knowledge sources. Sakriani Sakti, Konstantin Markov, Satoshi Nakamura
2007	An HMM-based speech synthesis system applied to German and its adaptation to a limited set of expressive football announcements. Sacha Krstulovic, Anna Hunecke, Marc Schröder
2007	An MRI study of european portuguese nasals. Paula Martins, Inês Carbone, Augusto Silva, António J. S. Teixeira
2007	An active approach to speaker and task adaptation based on automatic analysis of vocabulary confusability. Qiang Huo, Wei Li
2007	An analysis of individual differences in the f Hiromi Kawatsu, Sumio Ohno
2007	An approach to efficient generation of high-accuracy and compact error-corrective models for speech recognition. Takanobu Oba, Takaaki Hori, Atsushi Nakamura
2007	An approach to iterative speech feature enhancement and recognition. Stefan Windmann, Reinhold Haeb-Umbach
2007	An approximate solution for perceptually constrained signal subspace speech enhancement method. Adam Borowicz, Alexander A. Petrovsky
2007	An articulatory and acoustic study of "retroflex" and "bunched" american English rhotic sound based on MRI. Xinhui Zhou, Carol Y. Espy-Wilson, Mark Tiede, Suzanne Boyce
2007	An automatic prosody labeling method for Mandarin speech. Chen-Yu Chiang, Hsiu-Min Yu, Yih-Ru Wang, Sin-Horng Chen
2007	An effective initial/final duration prediction method for corpus-based singing voice synthesis of Mandarin Chinese. Cheng-Yuan Lin, Pei-Chi Jao, Jyh-Shing Roger Jang
2007	An empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping. Chao Qin, Miguel Á. Carreira-Perpiñán
2007	An ensemble modeling approach to joint characterization of speaker and speaking environments. Yu Tsao, Chin-Hui Lee
2007	An evaluation of cross-language adaptation and native speech training for rapid HMM construction based on very limited training data. Xufang Zhao, Douglas D. O'Shaughnessy
2007	An extension 2DPCA based visual feature extraction method for audio-visual speech recognition. Guanyong Wu, Jie Zhu
2007	An improved method for unsupervised training of LVCSR systems. Christian Gollan, Stefan Hahn, Ralf Schlüter, Hermann Ney
2007	An improved speaker diarization system. Rong Fu, Ian D. Benest
2007	An information state based dialogue manager for a mobile robot. Marcelo Quinderé, Luís Seabra Lopes, António J. S. Teixeira
2007	An information theoretic approach to predict speech intelligibility for listeners with normal and impaired hearing. Svante Stadler, Arne Leijon, Björn Hagerman
2007	An integration method of retrieval results using plural subword models for vocabulary-free spoken document retrieval. Yoshiaki Itoh, Kohei Iwata, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee
2007	An interactive timeline for speech database browsing. Benoît Favre, Jean-François Bonastre, Patrice Bellot
2007	An open-set detection evaluation methodology applied to language and emotion recognition. David A. van Leeuwen, Khiet P. Truong
2007	An optimal speech enhancement under speech uncertainty probability and masking property of auditory system. Xiaoshan Huang, Xiaoqun Zhao
2007	An overview on automatic speech attribute transcription (ASAT). Chin-Hui Lee, Mark A. Clements, Sorin Dusan, Eric Fosler-Lussier, Keith Johnson, Biing-Hwang Juang, Lawrence R. Rabiner
2007	An unsupervised approach to automatic prosodic annotation. Xinqiang Ni, Yining Chen, Frank K. Soong, Min Chu, Ping Zhang
2007	Analysis and classification of speech mode: whispered through shouted. Chi Zhang, John H. L. Hansen
2007	Analysis of communication failures for spoken dialogue systems. Sebastian Möller, Klaus-Peter Engelbrecht, Antti Oulasvirta
2007	Analysis of emotional speech prosody in terms of part of speech tags. Murtaza Bulut, Sungbok Lee, Shrikanth S. Narayanan
2007	Analysis of head motions and speech in spoken dialogue. Carlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita
2007	Analysis of the impact of analogue telephone channel on MFCC parameters for voice pathology detection. Rubén Fraile, Juan Ignacio Godino-Llorente, Nicolás Sáenz-Lechón, Víctor Osma-Ruiz, Pedro Gómez-Vilda
2007	Analysis of the occurrence of laughter in meetings. Kornel Laskowski, Susanne Burger
2007	Analyzing temporal transition of real user's behaviors in a spoken dialogue system. Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno
2007	Application of CMLLR in narrow band wide band adapted systems. Martin Karafiát, Lukás Burget, Jan Cernocký, Thomas Hain
2007	Application of shifted delta cepstral features in speaker verification. José R. Calvo, Rafael Fernández, Gabriel Hernández
2007	Application of speech technology in a home based assessment kiosk for early detection of alzheimer's disease. Rachel Coulston, Esther Klabbers, Jacques de Villiers, John-Paul Hosom
2007	Applying word duration constraints by using unrolled HMMs. Ning Ma, Jon Barker, Phil D. Green
2007	Approaches for adaptive database reduction for text-to-speech synthesis. Aleksandra Krul, Géraldine Damnati, François Yvon, Cédric Boidin, Thierry Moudenc
2007	Approximation method of subglottal system using ARMA filter. Nobuhiro Miki, Kyohei Hayashi
2007	Articulatory acoustic feature applications in speech synthesis. Peter Cahill, Daniel Aioanei, Julie Carson-Berndsen
2007	Articulatory feature classifiers trained on 2000 hours of telephone speech. Joe Frankel, Mathew Magimai-Doss, Simon King, Karen Livescu, Özgür Çetin
2007	Articulatory synthesis of singing. Peter Birkholz
2007	Artificial bandwidth extension for speech signals using speech recogniton. Shingo Kuroiwa, Masashi Takashina, Satoru Tsuge, Fuji Ren
2007	Artificial bandwidth extension without side information for ITU-t g.729.1. Bernd Geiser, Hervé Taddei, Peter Vary
2007	Artificial impostor voice transformation effects on false acceptance rates. Jean-François Bonastre, Driss Matrouf, Corinne Fredouille
2007	Aspects of visual speech in Arabic. Slim Ouni, Kaïs Ouni
2007	Assessment of vocal dysperiodicities in connected disordered speech. Ali Alpan, Abdellah Kacha, Francis Grenez, Jean Schoentgen
2007	Attention shift decoding for conversational speech recognition. Raghunandan Kumaran, Jeff A. Bilmes, Katrin Kirchhoff
2007	Attribute-based Mandarin speech recognition using conditional random fields. Chi-Yueh Lin, Hsiao-Chuan Wang
2007	Audio classification using extended baum-welch transformations. Tara N. Sainath, Victor Zue, Dimitri Kanevsky
2007	Audio-based approaches to head orientation estimation in a smart-room. Alberto Abad, Carlos Segura, Climent Nadeu, Javier Hernando
2007	Audio-visual integration for robust speech recognition using maximum weighted stream posteriors. Rowan Seymour, Darryl Stewart, Ji Ming
2007	Audio-visual phoneme classification for pronunciation training applications. Hedvig Kjellström, Olov Engwall, Sherif Mahdy Abdou, Olle Bälter
2007	Audiovisual emotional speech of game playing children: effects of age and culture. Suleman Shahid, Emiel Krahmer, Marc Swerts
2007	Audiovisual speaker identity verification based on lip motion features. Girija Chetty, Michael Wagner
2007	Automated directory assistance system - from theory to practice. Dong Yu, Yun-Cheng Ju, Ye-Yi Wang, Geoffrey Zweig, Alex Acero
2007	Automatic acoustic segmentation for speech recognition on broadcast recordings. Gang Peng, Mei-Yuh Hwang, Mari Ostendorf
2007	Automatic assessment of children's reading level. Jacques Duchateau, Leen Cleuren, Hugo Van hamme, Pol Ghesquière
2007	Automatic building of synthetic voices from large multi-paragraph speech databases. Kishore Prahallad, Arthur R. Toth, Alan W. Black
2007	Automatic detection and classification of disfluent reading miscues in young children's speech for the purpose of assessment. Matthew Black, Joseph Tepperman, Sungbok Lee, Patti Price, Shrikanth S. Narayanan
2007	Automatic estimation of scaling factors among probabilistic models in speech recognition. Tadashi Emori, Yoshifumi Onishi, Koichi Shinoda
2007	Automatic extraction of cue phrases for important sentences in lecture speech and automatic lecture speech summarization. Yasuhisa Fujii, Norihide Kitaoka, Seiichi Nakagawa
2007	Automatic generation of cloze items for prepositions. John Lee, Stephanie Seneff
2007	Automatic head motion prediction from speech data. Gregor Hofer, Hiroshi Shimodaira
2007	Automatic large-scale oral language proficiency assessment. Febe de Wet, Christa van der Walt, Thomas Niesler
2007	Automatic laughter detection using neural networks. Mary Tai Knox, Nikki Mirghafori
2007	Automatic phonetic segmentation of Spanish emotional speech. Ascensión Gallardo-Antolín, Roberto Barra-Chicote, Marc Schröder, Sacha Krstulovic, Juan Manuel Montero
2007	Automatic pitch accent prediction for text-to-speech synthesis. Ian Read, Stephen Cox
2007	Automatic question detection: prosodic-lexical features and crosslingual experiments. Minh-Quang Vu, Laurent Besacier, Eric Castelli
2007	Automatic recognition of connected vowels only using speaker-invariant representation of speech dynamics. Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hirose
2007	Automatic scoring of the intelligibility in patients with cancer of the oral cavity. Andreas K. Maier, Maria Schuster, Anton Batliner, Elmar Nöth, Emeka Nkenke
2007	Automatic speech recognition for an under-resourced language - amharic. Solomon Teferra Abate, Wolfgang Menzel
2007	Automatic speech recognition framework for multilingual audio contents. Hiroaki Nanjo, Yuichi Oku, Takehiko Yoshimi
2007	Automatic speech recognition with a cochlear implant front-end. Waldo Nogueira, Tamás Harczos, Bernd Edler, Jörn Ostermann, Andreas Büchner
2007	Automatic transcription for a web 2.0 service to search podcasts. Jun Ogata, Masataka Goto, Kouichirou Eto
2007	Automatically learning the units of speech by non-negative matrix factorisation. Veronique Stouten, Kris Demuynck, Hugo Van hamme
2007	BECAM tool - a semi-automatic tool for bootstrapping emotion corpus annotation and management. Slim Abdennadher, Mohamed Aly, Dirk Bühler, Wolfgang Minker, Johannes Pittermann
2007	Bayes risk-based optimization of dialogue management for document retrieval system with speech interface. Teruhisa Misu, Tatsuya Kawahara
2007	Behavior models for learning and receptionist dialogs. Hartwig Holzapfel, Alex Waibel
2007	Benchmarking human performance on the acoustic and linguistic subtasks of ASR systems. László Tóth
2007	Bhattacharyya error and divergence using variational importance sampling. Peder A. Olsen, John R. Hershey
2007	Bilingual LSA-based translation lexicon adaptation for spoken language translation. Yik-Cheung Tam, Tanja Schultz
2007	Bit-erasure channel decoding for GMM-based multiple description coding. Yannis Agiomyrgiannakis, Yannis Stylianou
2007	Blind adaptive principal eigenvector beamforming for acoustical source separation. Ernst Warsitz, Reinhold Haeb-Umbach, Dang Hai Tran Vu
2007	Boosting with anti-models for automatic language identification. Xi Yang, Man-Hung Siu, Herbert Gish, Brian Mak
2007	Bootstrapping morphological analysis of gĩkũyũ using unsupervised maximum entropy learning. Guy De Pauw, Peter Waiganjo Wagacha
2007	Building an information retrieval system for serbian - challenges and solutions. Miroslav Martinovic, Srdjdan Vesic, Goran Rakic
2007	Building multiple complementary systems using directed decision trees. Catherine Breslin, Mark J. F. Gales
2007	CALL courseware for learning reactive tokens in face-to-face dialogs. Takafumi Utashiro, Goh Kawai
2007	Can unquantised articulatory feature continuums be modelled? Odette Scharenborg, Vincent Wan
2007	Categorical perception in intonation: a matter of signal dynamics? Oliver Niebuhr
2007	Categorical perception of Cantonese tones in context: a cross-linguistic study. Hongying Zheng, Peter W. M. Tsang, William S.-Y. Wang
2007	Channel selection by class separability measures for automatic transcriptions on distant microphones. Matthias Wölfel
2007	Children's convergence in referring expressions to graphical objects in a speech-enabled computer game. Linda Bell, Joakim Gustafson
2007	Class constrained ROVER based speech enhancement. Amit Das, John H. L. Hansen
2007	Classification of discourse functions of affirmative words in spoken dialogue. Agustín Gravano, Stefan Benus, Julia Hirschberg, Shira Mitchell, Ilia Vovsha
2007	Cluster adaptive training weights as features in SVM-based speaker verification. Hao Yang, Yuan Dong, Xianyu Zhao, Jian Zhao, Liang Lu, Haila Wang
2007	Cluster-based polynomial-fit histogram equalization (CPHEQ) for robust speech recognition. Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen
2007	Clustered maximum likelihood linear basis for rapid speaker adaptation. Yun Tang, Richard C. Rose
2007	Clustering-based two-dimensional linear discriminant analysis for speech recognition. Xiao-Bing Li, Douglas D. O'Shaughnessy
2007	Co-training using prosodic and lexical information for sentence segmentation. Ümit Güz, Sébastien Cuendet, Dilek Hakkani-Tür, Gökhan Tür
2007	Collection of empirical data for standardization of generic vocabularies in speech driven ICT devices and services. Rosemary Orr, Bernat González i Llinares, Françoise Petersen, Helge Hüttenrauch, Martin Böcker, Michael Tate
2007	Combination of LSF and pole based parameter interpolation for model-based diphone concatenation. Karl Schnell, Arild Lacroix
2007	Combined acoustic and pronunciation modelling for non-native speech recognition. Ghazi Bouselmi, Dominique Fohr, Irina Illina
2007	Combining frame and turn-level information for robust recognition of emotions within speech. Bogdan Vlasenko, Björn W. Schuller, Andreas Wendemuth, Gerhard Rigoll
2007	Combining length distribution model with decision tree in prosodic phrase prediction. Qin Shi, Danning Jiang, Fanping Meng, Yong Qin
2007	Combining rate and place information for robust pitch extraction. Martin Heckmann, Frank Joublin, Christian Goerick
2007	Combining short-term cepstral and long-term pitch features for automatic recognition of speaker age. Christian A. Müller, Felix Burkhardt
2007	Compact representations of the articulatory-to-acoustic mapping. Blaise Potard, Yves Laprie
2007	Comparing GMM-based speech transformation systems. Larbi Mesbahi, Vincent Barreaud, Olivier Boëffard
2007	Comparing american and palestinian perceptions of charisma using acoustic-prosodic and lexical analysis. Fadi Biadsy, Julia Hirschberg, Andrew Rosenberg, Wisam Dakka
2007	Comparing classifiers for pronunciation error detection. Helmer Strik, Khiet P. Truong, Febe de Wet, Catia Cucchiarini
2007	Comparing praat and snack formant measurements on two large corpora of northern and southern French. Cécile Woehrling, Philippe Boula de Mareüil
2007	Comparison of HMM and DTW methods in automatic recognition of pathological phoneme pronunciation. Robert Wielgat, Tomasz P. Zielinski, Pawel Swietojanski, Piotr Zoladz, Daniel Król, Tomasz Wozniak, Stanislaw Grabias
2007	Comparison of multiple voice source parameters in different phonation types. Matti Airas, Paavo Alku
2007	Comparison of subspace methods for Gaussian mixture models in speech recognition. Matti Varjokallio, Mikko Kurimo
2007	Comparison of two kinds of speaker location representation for SVM-based speaker verification. Xianyu Zhao, Yuan Dong, Hao Yang, Jian Zhao, Liang Lu, Haila Wang
2007	Complementarity and redundancy in multimodal user inputs with speech and pen gestures. Pui-Yu Hui, Zhengyu Zhou, Helen M. Meng
2007	Complementary approaches for voice disorder assessment. Jean-François Bonastre, Corinne Fredouille, Alain Ghio, Antoine Giovanni, Gilles Pouchoulin, Joana Revis, Bernard Teston, P. Yu
2007	Computer-supported human-human multilingual communication. Alex Waibel, Keni Bernardin, Matthias Wölfel
2007	Computerized chironomy: evaluation of hand-controlled intonation reiteration. Christophe d'Alessandro, Albert Rilliard, Sylvain Le Beux
2007	Concept and evaluation of a downward-compatible system for spatial teleconferencing using automatic speaker clustering. Alexander Raake, Sascha Spors, Jens Ahrens, Jitendra Ajmera
2007	Conditional use of word lattices, confusion networks and 1-best string hypotheses in a sequential interpretation strategy. Bogdan Minescu, Géraldine Damnati, Frédéric Béchet, Renato De Mori
2007	Conditionally linear Gaussian models for estimating vocal tract resonances. Daniel Rudoy, Daniel N. Spendley, Patrick J. Wolfe
2007	Confidence measure based unsupervised target model adaptation for speaker verification. Alexandre Preti, Jean-François Bonastre, Driss Matrouf, François Capman, Bertrand Ravera
2007	Confidence measures for voice search applications. Ye-Yi Wang, Dong Yu, Yun-Cheng Ju, Geoffrey Zweig, Alex Acero
2007	Construction and analysis of multiple paths in syllable models. Annika Hämäläinen, Louis ten Bosch, Lou Boves
2007	Construction of a phonotactic dialect corpus using semiautomatic annotation. Reva Schwartz, Wade Shen, Joseph P. Campbell, Shelley Paget, Julie Vonwiller, Dominique Estival, Christopher Cieri
2007	Construction of spoken language model including fillers using filler prediction model. Kengo Ohta, Masatoshi Tsuchiya, Seiichi Nakagawa
2007	Context constrained-generalized posterior probability for verifying phone transcriptions. Hua Zhang, Lijuan Wang, Frank K. Soong, Wenju Liu
2007	Context dependent syllable acoustic model for continuous Chinese speech recognition. Hao Wu, Xihong Wu
2007	Context dependent word modeling for statistical machine translation using part-of-speech tags. Ruhi Sarikaya, Yonggang Deng, Yuqing Gao
2007	Continuous prosodic features and formant modeling with joint factor analysis for speaker verification. Najim Dehak, Patrick Kenny, Pierre Dumouchel
2007	Continuous-speech phone recognition from ultrasound and optical images of the tongue and lips. Thomas Hueber, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Maureen Stone
2007	Contributions of temporal fine structure cues to Chinese speech recognition in cochlear implant simulation. Lin Yang, Jianping Zhang, Yonghong Yan
2007	Control of an articulatory speech synthesizer based on dynamic approximation of spatial articulatory targets. Peter Birkholz
2007	Conversation detection and speaker segmentation in privacy-sensitive situated speech data. Danny Wyatt, Tanzeem Choudhury, Jeff A. Bilmes
2007	Corpus-based generation of prosodic features from text based on generation process model. Keikichi Hirose, Keiko Ochi, Nobuaki Minematsu
2007	Creating multimedia dictionaries of endangered languages using LEXUS. Jacquelijn Ringersma, Marc Kemps-Snijders
2007	Creating spoken dialogue characters from corpora without annotations. Sudeep Gandhe, David R. Traum
2007	Cross-language phonemisation in German text-to-speech synthesis. Jochen Steigner, Marc Schröder
2007	Cross-linguistic analysis of prosodic features for sentence segmentation. James G. Fung, Dilek Hakkani-Tür, Mathew Magimai-Doss, Elizabeth Shriberg, Sébastien Cuendet, Nikki Mirghafori
2007	DFT domain subspace based noise tracking for speech enhancement. Richard C. Hendriks, Jesper Jensen, Richard Heusdens
2007	Degradation-classification assisted single-ended quality measurement of speech. Hua Yuan, Tiago H. Falk, Wai-Yip Chan
2007	Dependence of tone perception on syllable perception. Michael Olsberg, Yi Xu, Jeremy Green
2007	Derivative and parametric kernels for speaker verification. Chris Longworth, Mark J. F. Gales
2007	Design and characterization of the non-native military air traffic communications database (nnMATC). Stéphane Pigeon, Wade Shen, Aaron D. Lawson, David A. van Leeuwen
2007	Design and development of voice controlled aids for motor-handicapped persons. Petr Cerva, Jan Nouza
2007	Design and recording of Czech sign language corpus for automatic sign language recognition. Pavel Campr, Marek Hrúz, Milos Zelezný
2007	Design of a rich multimodal interface for mobile spoken route guidance. Markku Turunen, Jaakko Hakulinen, Anssi Kainulainen, Aleksi Melto, Topi Hurtig
2007	Detecting deception using critical segments. Frank Enos, Elizabeth Shriberg, Martin Graciarena, Julia Hirschberg, Andreas Stolcke
2007	Detecting pitch accent using pitch-corrected energy-based predictors. Andrew Rosenberg, Julia Hirschberg
2007	Detection and removal of switching noise in push-to-talk and voice operated exchange communications systems. Brett Y. Smolenski
2007	Detection of instants of glottal closure using characteristics of excitation source. Sunitha Guruprasad, B. Yegnanarayana, K. Sri Rama Murty
2007	Detection of out-of-vocabulary words in posterior based ASR. Hamed Ketabdar, Mirko Hannemann, Hynek Hermansky
2007	Detection, diarization, and transcription of far-field lecture speech. Jing Huang, Etienne Marcheret, Karthik Visweswariah, Vit Libal, Gerasimos Potamianos
2007	Detection-based ASR in the automatic speech attribute transcription project. Ilana Bromberg, Qian Qian, Jun Hou, Jinyu Li, Chengyuan Ma, Brett Matthews, Antonio Moreno-Daniel, Jeremy Morris, Sabato Marco Siniscalchi, Yu Tsao, Yu Wang
2007	Development of multimodal resources for multilingual information retrieval in the basque context. Nora Barroso, Aitzol Ezeiza, N. Gilisagasti, Karmele López de Ipiña, A. López, Juan Miguel López
2007	Development of preschool children subsystem for ASR and q&a in a real-environment speech-oriented guidance task. Tobias Cincarek, Izumi Shindo, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2007	Dimension reduction for speaker identification based on mutual information. Xugang Lu, Jianwu Dang
2007	Dimensionality reduction for speech recognition using neighborhood components analysis. Natasha Singh-Miller, Michael Collins, Timothy J. Hazen
2007	Dimensionality reduction methods applied to both magnitude and phase derived features. Andrew Errity, John McKenna, Barry Kirkpatrick
2007	Dimensionality reduction of speech features using nonlinear principal components analysis. Stephen A. Zahorian, Tara Singh, Hongbing Hu
2007	Direct acoustic feature using iterative EM algorithm and spectral energy for classifying suicidal speech. T. Yingthawornsuk, H. Kaymaz Keskinpala, D. Mitchell Wilkes, Richard G. Shiavi, Ronald M. Salomon
2007	Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics. John Dines, Jithendra Vepa
2007	Discrimination and recognition of scaled word sounds. Toshio Irino, Yoshie Aoki, Yoshie Hayashi, Hideki Kawahara, Roy D. Patterson
2007	Discriminative MCE-based speaker adaptation of acoustic models for a spoken lecture processing task. Timothy J. Hazen, Erik McDermott
2007	Discriminative noise adaptive training approach for an environment migration. Byung Ok Kang, Ho-Young Jung, Yunkeun Lee
2007	Discriminative optimization of language adapted HMMs for a language identification system based on parallel phoneme recognizers. Josef G. Bauer, Bernt Andrassy, Ekaterina Timoshenko
2007	Disfluency correction of spontaneous speech using conditional random fields with variable-length features. Jui-Feng Yeh, Chung-Hsien Wu, Wei-Yen Wu
2007	Distinctive phonetic feature (DPF) based phone segmentation using hybrid neural networks. Mohammad Nurul Huda, Muhammad Ghulam, Junsei Horikawa, Tsuneo Nitta
2007	Do different boundary types induce subtle acoustic cues to which French listeners are sensitive? Odile Bagou, Sophie Dufour, Cécile Fougeron, Alain Content, Ulrich H. Frauenfelder
2007	Dual-channel acoustic detection of nasalization states. Xiaochuan Niu, Jan P. H. van Santen
2007	Duration and pauses as boundary-markers in speech: a cross-linguistic study. Li-chiung Yang
2007	Duration and pronunciation conditioned lexical modeling for speaker verification. Gökhan Tür, Elizabeth Shriberg, Andreas Stolcke, Sachin S. Kajarekar
2007	Dynamic integration of multiple feature streams for robust real-time LVCSR. Shoei Sato, Kazuo Onoe, Akio Kobayashi, Shinichi Homma, Toru Imai, Tohru Takagi, Tetsunori Kobayashi
2007	Dynamic language change in MIMUS. Carmen del Solar, Guillermo Pérez-García, Eva Florencio, David Moral, Gabriel Amores Carredano, Pilar Manchón Portillo
2007	Dynamic language model adaptation using presentation slides for lecture speech recognition. Hiroki Yamazaki, Koji Iwano, Koichi Shinoda, Sadaoki Furui, Haruo Yokota
2007	ELAN: a free and open-source multimedia annotation tool. Han Sloetjes, Albert Russel, Alexander Klassmann
2007	EMD based soft-thresholding for speech enhancement. Erhan Deger, Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu, Md. Kamrul Hasan
2007	Effect of incomplete glottal closures on estimates of glottal waves via inverse filtering of vowel sounds. Huiqun Deng, Douglas D. O'Shaughnessy
2007	Effect of intensive voice therapy on vocal tremor for parkinson speakers. Laurence Cnockaert, Jean Schoentgen, Canan Ozsancak, Pascal Auzou, Francis Grenez
2007	Effect of number of masking talkers on speech-on-speech masking in Chinese. Xihong Wu, Jing Chen, Zhigang Yang, Qiang Huang, Mengyuan Wang, Liang Li
2007	Effect of unsteady glottal flow on the speech production process. Hideyuki Nomura, Tetsuo Funada
2007	Effect of within- and between-talker variability on word identification in noise by younger and older adults. Huiwen Goy, Kathleen Pichora-Fuller, Pascal van Lieshout, Gurjit Singh, Bruce Schneider
2007	Effects of FE modelled consequences of tonsillectomy on perceptual evaluation of voice. Anne-Maria Laukkanen, Jaromír Horácek, Pavel Svancara, Elina Lehtinen
2007	Effects of non-native dialects on spoken word recognition. Jennifer T. Le, Catherine T. Best, Michael D. Tyler, Christian Kroos
2007	Effects of quiz-style information presentation on user understanding. Ryuichiro Higashinaka, Kohji Dohsaka, Shigeaki Amano, Hideki Isozaki
2007	Effects of testosterone levels on temporal and intonational aspects of speech: more exploratory data. Charles A. Lamoureux, Victor J. Boucher
2007	Efficient estimation of speaker-specific projecting feature transforms. Jonas Lööf, Ralf Schlüter, Hermann Ney
2007	Emotion attribute projection for speaker recognition on emotional speech. Huanjun Bao, Ming-Xing Xu, Thomas Fang Zheng
2007	Emotion clustering using the results of subjective opinion tests for emotion recognition in infants' cries. N. Satoh, Katsuya Yamauchi, Shoichi Matsunaga, Masaru Yamashita, R. Nakagawa, Kazuyuki Shinohara
2007	Empirical evidence for prosodic phrasing: pauses as linguistic annotation in Korean read speech. Hyongsil Cho, Daniel Hirst
2007	English and French speakers' perception of voicing distinctions in non-native lateral consonant syllable onsets. Catherine T. Best, Pierre A. Hallé, Jennifer S. Pardo
2007	Enhancing acoustic-to-EPG mapping with lip position information. Asterios Toutios, Konstantinos G. Margaritis
2007	Enhancing usability of CAPL system for qur'an recitation learning. Abdurrahman Samir, Sherif Mahdy Abdou, Ahmed Husien Khalil, Mohsen A. Rashwan
2007	Environmentally aware voice activity detector. Abhijeet Sangwan, Nitish Krishnamurthy, John H. L. Hansen
2007	Error detection in confusion network. Alexandre Allauzen
2007	Error-tolerant question answering for spoken documents. Tomoyosi Akiba, Hirofumi Tsujimura
2007	Estimating VTLN warping factors by distribution matching. Janne Pylkkönen
2007	Estimation of place of articulation in stop consonants for visual feedback. Milind S. Shah, Prem C. Pandey
2007	Evaluating acoustic distance measures for template based recognition. Mathias De Wachter, Kris Demuynck, Patrick Wambacq, Dirk Van Compernolle
2007	Evaluating and optimizing Japanese tutor system featuring dynamic question generation and interactive guidance. Christopher J. Waple, Hongcui Wang, Tatsuya Kawahara, Yasushi Tsubota, Masatake Dantsuji
2007	Evaluating the temporal structure normalisation technique on the Aurora-4 task. Xiong Xiao, Engsiong Chng, Haizhou Li
2007	Evaluating two versions of the momel pitch modelling algorithm on a corpus of read speech in Korean. Daniel Hirst, Hyongsil Cho, Sunhee Kim, Hyunji Yu
2007	Evaluation of alternatives on speech to sign language translation. Rubén San Segundo, Alicia Pérez, Daniel Ortiz, Luis Fernando D'Haro, M. Inés Torres, Francisco Casacuberta
2007	Evaluation of real-time voice activity detection based on high order statistics. David Cournapeau, Tatsuya Kawahara
2007	Evaluation of syllable stress using single class classifier. Abhinav Parate, Ashish Verma, Jayanta Basak
2007	Evaluation of the combined use of MEMLIN and MLLR on the non-native adaptation task of hiwire project database. Luis Buera, Antonio Miguel, Oscar Saz, Eduardo Lleida, Alfonso Ortega
2007	Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank. Satomi Tanaka, Minoru Tsuzaki, Hiroaki Kato, Yoshinori Sagisaka
2007	Evolutionary minimum verification error learning of the alternative hypothesis model for LLR-based speaker verification. Yi-Hsiang Chao, Wei-Ho Tsai, Shih-Sian Cheng, Hsin-Min Wang, Ruei-Chuan Chang
2007	Experimental validation of direct and inverse glottal flow models for unsteady flow conditions. Julien Cisonni, Annemie Van Hirtum, Jan Willems, Xavier Pelorson
2007	Experiments on hiwire database using denoising and adaptation with a hybrid HMM-ANN model. Roberto Gemello, Franco Mana, Stefano Scanzio
2007	Exploiting information extraction annotations for document retrieval in distillation tasks. Dilek Hakkani-Tür, Gökhan Tür, Michael Levit
2007	Exploiting phoneme similarities in hybrid HMM-ANN keyword spotting. Joel Pinto, Andrew Lovitt, Hynek Hermansky
2007	Exploiting prosodic features for dialog act tagging in a discriminative modeling framework. Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore, Shrikanth S. Narayanan
2007	Exploiting prosody for PCFGs with latent annotations. Markus Dreyer, Izhak Shafran
2007	Exploiting unlabeled internal data in conditional random fields to reduce word segmentation errors for Chinese texts. Richard Tzong-Han Tsai, Hsi-Chuan Hung, Hong-Jie Dai, Wen-Lian Hsu
2007	Exploring initiative strategies using computer simulation. Fan Yang, Peter A. Heeman
2007	Exploring tonal variations via context-dependent tone models. Yue-Ning Hu, Min Chu, Chao Huang, Yan-Ning Zhang
2007	Extended powered cepstral normalization (p-CN) with range equalization for robust features in speech recognition. Chang-Wen Hsu, Lin-Shan Lee
2007	Extra large vocabulary continuous speech recognition algorithm based on information retrieval. Valeriy Pylypenko
2007	Extracting true speaker identities from transcriptions. Yannick Estève, Sylvain Meignier, Paul Deléglise, Julie Mauclair
2007	F Hiroko Hirano, Keikichi Hirose, Goh Kawai, Wentao Gu, Nobuaki Minematsu
2007	F Rerrario Shui-Ching Ho, Yoshinori Sagisaka
2007	F0 transformation within the voice conversion framework. Zdenek Hanzlícek, Jindrich Matousek
2007	Fast adaptation of GMM-based compact models. Christophe Lévy, Georges Linarès, Jean-François Bonastre
2007	Feasibility of constructing an expressive speech corpus from television soap opera dialogue. Peter Rutten
2007	Feature and distribution normalization schemes for statistical mismatch reduction in reverberant speech recognition. A. M. Toh, Roberto Togneri, Sven Nordholm
2007	Features interpolation domain for distributed speech recognition and performance for ITU-t g.723.1 CODEC. Vladimir Fabregas Surigué de Alencar, Abraham Alcaim
2007	Features of pauses and conjunctions at syntactic and discourse boundaries in Japanese monologues. Michiko Watanabe, Yasuharu Den, Keikichi Hirose, Shusaku Miwa, Nobuaki Minematsu
2007	Fepstrum: an improved modulation spectrum for ASR. Vivek Tyagi
2007	Filtering the unknown: speech activity detection in heterogeneous video collections. Marijn Huijbregts, Chuck Wooters, Roeland Ordelman
2007	Fixed-size kernel logistic regression for phoneme classification. Peter Karsmakers, Kristiaan Pelckmans, Johan A. K. Suykens, Hugo Van hamme
2007	Formal modelling of L1 and L2 perceptual learning: computational linguistics versus machine learning. Paola Escudero, Jelle Kastelein, Klara A. Weiand, R. J. J. H. van Son
2007	Formant-based synthesis of singing. Sten Ternström, Johan Sundberg
2007	Frame alignment method for cross-lingual voice conversion. Daniel Erro, Asunción Moreno
2007	Frame margin probability discriminative training algorithm for noisy speech recognition. Hao-Zheng Li, Douglas D. O'Shaughnessy
2007	Frequency domain correspondence for speaker normalization. Ming Liu, Xi Zhou, Mark Hasegawa-Johnson, Thomas S. Huang, Zhengyou Zhang
2007	Frequency study for the characterization of the dysphonic voices. Gilles Pouchoulin, Corinne Fredouille, Jean-François Bonastre, Alain Ghio, Antoine Giovanni
2007	From one base form to multiple output styles - predicting stylistic dynamics of discourse prosody. Chiu-yu Tseng, Zhao-yu Su
2007	Fused HMM-adaptation of multi-stream HMMs for audio-visual speech recognition. David Dean, Patrick Lucey, Sridha Sridharan, Tim Wark
2007	Fusing acoustic, phonetic and data-driven systems for text-independent speaker verification. Asmaa El Hannani, Dijana Petrovska-Delacrétaz
2007	Fusion of contrastive acoustic models for parallel phonotactic spoken language identification. Khe Chai Sim, Haizhou Li
2007	Fusion of global statistical and segmental spectral features for speech emotion recognition. Hao Hu, Ming-Xing Xu, Wei Wu
2007	G2p conversion of names: what can we do (better)? Henk van den Heuvel, Jean-Pierre Martens, Nanneke Konings
2007	GEMSIS - a novel application of speech recognition to emergency and disaster medicine. Satoshi Tamura, Kunihiko Takamatsu, Shinji Ogura, Satoru Hayamizu
2007	Gaussian mixture optimization for HMM based on efficient cross-validation. Takahiro Shinozaki, Tatsuya Kawahara
2007	Generating small, accurate acoustic models with a modified Bayesian information criterion. Kai Yu, Rob A. Rutenbar
2007	Generative and discriminative algorithms for spoken language understanding. Christian Raymond, Giuseppe Riccardi
2007	Generic class-based statistical language models for robust speech understanding in directed dialog applications. Matthieu Hébert
2007	Getting start with UTDrive: driver-behavior modeling and assessment of distraction for in-vehicle speech systems. Pongtep Angkititrakul, DongGu Kwak, Sangjo Choi, Jeonghee Kim, Anh PhucPhan, Amardeep Sathyanarayana, John H. L. Hansen
2007	Global features for rapid identity verification with dynamic biometric data. Andrew C. Morris, Jacques C. Koreman, B. Ly-Van, Harin Sellahewa, Sabah Jassim, R. Llarena Gómez
2007	Group delay features for emotion detection. Vidhyasaharan Sethu, Eliathamby Ambikairajah, Julien Epps
2007	HMM-based speech recognition using decision trees instead of GMMs. Remco Teunen, Masami Akamine
2007	Handling OOV words in Arabic ASR via flexible morphological constraints. Nguyen Bach, Mohamed Noamany, Ian R. Lane, Tanja Schultz
2007	Handling phonetic context and speaker variation in a structure-based speech recognizer. Dong Yu, Li Deng, Alex Acero
2007	Handling speech input in the ritel QA dialogue system. Boris W. van Schooten, Sophie Rosset, Olivier Galibert, Aurélien Max, Rieks op den Akker, Gabriel Illouz
2007	Hierarchical acoustic modeling based on random-effects regression for automatic speech recognition. Yan Han, Lou Boves
2007	Hierarchical dialogue optimization using semi-Markov decision processes. Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, Hiroshi Shimodaira
2007	Hierarchical language identification based on automatic language clustering. Bo Yin, Eliathamby Ambikairajah, Fang Chen
2007	Hierarchical neural networks feature extraction for LVCSR system. Fabio Valente, Jithendra Vepa, Christian Plahl, Christian Gollan, Hynek Hermansky, Ralf Schlüter
2007	Hierarchical non-uniform unit selection based on prosodic structure. Jun Xu, Dezhi Huang, Yongxin Wang, Yuan Dong, Lianhong Cai, Haila Wang
2007	High-level feature-based speaker verification via articulatory phonetic-class pronunciation modeling. Shi-Xiong Zhang, Man-Wai Mak, Helen M. Meng
2007	Homograph ambiguity resolution in front-end design for portuguese TTS systems. Daniela Braga, Luís Pinto Coelho, Fernando Gil Vianna Resende Jr.
2007	How predictable is ASR confidence in dialog applications? Xiang Li, Juan M. Huerta
2007	How to access audio files of large data bases using in-car speech dialogue systems. Sandra Mann, André Berton, Ute Ehrlich
2007	How to integrate speech-operated internet information dialogs into a car. André Berton, Peter Regel-Brietzmann, Hans Ulrich Block, Stefanie Schachtl, Manfred Gehrke
2007	How to judge reusability of existing speech corpora for target task by utilizing statistical multidimensional scaling. Goshu Nagino, Makoto Shozakai, Kiyohiro Shikano
2007	How to personalize speech applications for web-based information in a car. Philipp Fischer, Andreas Österle, André Berton, Peter Regel-Brietzmann
2007	Hybrid electroglottograph and speech signal based algorithm for pitch marking. Hussein Hussein, Oliver Jokisch
2007	Hybridizing conversational and clear speech. Akiko Kusumoto, Alexander Kain, John-Paul Hosom, Jan P. H. van Santen
2007	IceNLP: a natural language processing toolkit for icelandic. Hrafn Loftsson, Eiríkur Rögnvaldsson
2007	Identification of natural whistled vowels by non-whistlers. Julien Meyer, Fanny Meunier, Laure Dentel
2007	Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees. Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2007	Implementation and evaluation of an HMM-based Thai speech synthesis system. Suphattharachai Chomphan, Takao Kobayashi
2007	Improved HMM/SVM methods for automatic phoneme segmentation. Jen-Wei Kuo, Hung-Yi Lo, Hsin-Min Wang
2007	Improved acoustic modeling for transcribing Arabic broadcast data. Lori Lamel, Abdelkhalek Messaoudi, Jean-Luc Gauvain
2007	Improved language recognition using better phonetic decoders and fusion with MFCC and SDC features. Doroteo T. Toledano, Javier Gonzalez-Dominguez, Alejandro Abejón-Gonzalez, Danilo Spada, Ismael Mateos-Garcia, Joaquin Gonzalez-Rodriguez
2007	Improved location features for meeting speaker diarization. Scott Otterson
2007	Improved machine translation of speech-to-text outputs. Daniel Déchelotte, Holger Schwenk, Gilles Adda, Jean-Luc Gauvain
2007	Improved methods for language model based question classification. Andreas Merkel, Dietrich Klakow
2007	Improvements in machine translation for English/iraqi speech translation. Shirin Saleem, Krishna Subramanian, Rohit Prasad, David Stallard, Chia-Lin Kao, Prem Natarajan, Raid Suleiman
2007	Improving phonotactic language recognition with acoustic adaptation. Wade Shen, Douglas A. Reynolds
2007	Improving speaker diarization for CHIL lecture meetings. Jing Huang, Etienne Marcheret, Karthik Visweswariah
2007	Improving speech translation with automatic boundary prediction. Evgeny Matusov, Dustin Hillard, Mathew Magimai-Doss, Dilek Hakkani-Tür, Mari Ostendorf, Hermann Ney
2007	Improving the phase vocoder approach to pitch-shifting. Petko Nikolov Petkov, W. Bastiaan Kleijn
2007	In-context phone posteriors as complementary features for tandem ASR. Hamed Ketabdar, Hervé Bourlard
2007	Increasing prosodic variability of text-to-speech synthesizers. Géza Németh, Márk Fék, Tamás Gábor Csapó
2007	Incremental perception of acted and real emotional speech. Pashiera Barkhuysen, Emiel Krahmer, Marc Swerts
2007	Influence of task duration in text-independent speaker verification. Benoit G. B. Fauve, Nicholas W. D. Evans, Neil Pearson, Jean-François Bonastre, John S. D. Mason
2007	Information retrieval strategies for accessing african audio corpora. Abdillahi Nimaan, Pascal Nocera, Frédéric Béchet, Jean-François Bonastre
2007	Integrating MAP, marginals, and unsupervised language model adaptation. Wen Wang, Andreas Stolcke
2007	Integrating audio and visual cues for speaker friendliness in multimodal speech synthesis. David House
2007	Integrating pitch and localisation cues at a speech fragment level. Heidi Christensen, Ning Ma, Stuart N. Wrigley, Jon Barker
2007	Integration of ASR and machine translation models in a document translation task. Aarthi M. Reddy, Richard C. Rose, Alain Désilets
2007	Intensive gestures in French and their multimodal correlates. Gaëlle Ferré, Roxane Bertrand, Philippe Blache, Robert Espesser, Stéphane Rauzy
2007	Inter-language prosodic style modification experiment using word impression vector for communicative speech generation. Ke Li, Yoko Greenberg, Yoshinori Sagisaka
2007	Intercoder reliability in annotating complex disfluencies. Peter A. Heeman, Andy McMillin, J. Scott Yaruss
2007	Introduction to multilingual corpus-based concatenative speech synthesis. Filip Deprez, Jan Odijk, Jan De Moortel
2007	Investigations into early and late reflections on distant-talking speech recognition toward suitable reverberation criteria. Takanobu Nishiura, Yoshiki Hirano, Yuki Denda, Masato Nakayama
2007	Iraqcomm: a next generation translation system. Kristin Precoda, Jing Zheng, Dimitra Vergyri, Horacio Franco, Colleen Richey, Andreas Kathol, Sachin S. Kajarekar
2007	Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions. Yu Hu, Qiang Huo
2007	Iterative unit selection with unnatural prosody detection. Dacheng Lin, Yong Zhao, Frank K. Soong, Min Chu, Jieyu Zhao
2007	JAAE: the java abstract annotation editor. Ivan Habernal, Miloslav Konopík
2007	Jitter and shimmer measurements for speaker recognition. Mireia Farrús, Javier Hernando, Pascual Ejarque
2007	Joint position-pitch extraction from multichannel audio. Michael Wohlmayr, Marián Képesi
2007	Joint speaker segmentation, localization and identification for streaming audio. Joerg Schmalenstroeer, Reinhold Haeb-Umbach
2007	Kettle hinders cat, shadow does not hinder shed: activation of 'almost embedded' words in nonnative listening. Mirjam Broersma
2007	Knowledge consistent user simulations for dialog systems. Hua Ai, Diane J. Litman
2007	L2 consonant identification in noise: cross-language comparisons. Anne Cutler, Martin Cooke, María Luisa García Lecumberri, Dennis Pasveer
2007	LSA-based language model adaptation for highly inflected languages. Tanel Alumäe, Toomas Kirt
2007	Landmark-based approach to speech recognition: an alternative to HMMs. Carol Y. Espy-Wilson, Tarun Pruthi, Amit Juneja, Om Deshmukh
2007	Language identification based on n-gram frequency ranking. Ricardo de Córdoba, Luis Fernando D'Haro, Fernando Fernández Martínez, Javier Macías Guarasa, Javier Ferreiros
2007	Language identification of person names using CF-IOF based weighing function. Samuel Thomas, Ashish Verma
2007	Language identification using several sources of information with a multiple-Gaussian classifier. Ricardo de Córdoba, Luis Fernando D'Haro, Fernando Fernández Martínez, Juan Manuel Montero, Roberto Barra-Chicote
2007	Language model adaptation using latent dirichlet allocation and an efficient topic inference algorithm. Aaron Heidel, Hung-An Chang, Lin-Shan Lee
2007	Language modeling for automatic turkish broadcast news transcription. Ebru Arisoy, Hasim Sak, Murat Saraclar
2007	Language modeling using PLSA-based topic HMM. Atsushi Sako, Tetsuya Takiguchi, Yasuo Ariki
2007	Large-scale random forest language models for speech recognition. Yi Su, Frederick Jelinek, Sanjeev Khudanpur
2007	Learning dialogue strategies for interactive database search. Verena Rieser, Oliver Lemon
2007	Learning spoken document similarity and recommendation using supervised probabilistic latent semantic analysis. Kishan Thambiratnam, Frank Seide
2007	Learning the inter-frame distance for discriminative template-based keyword detection. David Grangier, Samy Bengio
2007	Learning tone distinctions for Mandarin Chinese. David Weenink, Guangqin Chen, Zongyan Chen, Stefan de Konink, Dennis Vierkant, Eveline van Hagen, R. J. J. H. van Son
2007	Length, ordering preference and intonational phrasing: evidence from pauses. Gerrit Kentner
2007	Lexicon adaptation with reduced character error (LARCE) - a new direction in Chinese language modeling. Yi-Cheng Pan, Lin-Shan Lee
2007	Line cepstral quefrencies and their use for acoustic inventory coding. Guntram Strecha, Matthias Eichner, Rüdiger Hoffmann
2007	Linear and non linear kernel GMM supervector machines for speaker verification. Réda Dehak, Najim Dehak, Patrick Kenny, Pierre Dumouchel
2007	Linear prediction of audio signals. Toon van Waterschoot, Marc Moonen
2007	Linear transformation approach to VTLN using dynamic frequency warping. D. Rama Sanand, D. Dinesh Kumar, Srinivasan Umesh
2007	Lombard speech impact on perceptual speaker recognition. Ayako Ikeno, John H. L. Hansen
2007	Loquendo - Politecnico di torino's 2006 NIST speaker recognition evaluation system. Claudio Vair, Daniele Colibro, Fabio Castaldo, Emanuele Dalmasso, Pietro Laface
2007	MRASTA and PLP in automatic speech recognition. S. R. Mahadeva Prasanna, Hynek Hermansky
2007	Machine learning for spoken dialogue systems. Oliver Lemon, Olivier Pietquin
2007	Management of static/dynamic properties in a multimodal interaction system. Kouichi Katsurada, Yuji Okuma, Makoto Yano, Yurie Iribe, Tsuneo Nitta
2007	Mandarin vowel pronunciation quality evaluation by using formant pattern recognition. Fuping Pan, Qingwei Zhao, Yonghong Yan
2007	Mel sub-band filtering and compression for robust speech recognition. Babak Nasersharif, Ahmad Akbari, Mohammad Mehdi Homayounpour
2007	Memory efficient modeling of polyphone context with weighted finite-state transducers. Emilian Stoimenov, John W. McDonough
2007	Method of LP-based blind restoration for improving intelligibility of bone-conducted speech. Thang Tat Vu, Germine Seide, Masashi Unoki, Masato Akagi
2007	Minimal pairs and functional loads of sound contrasts obtained from a list of modern greek words. Constandinos Kalimeris, Stelios Bakamidis
2007	Minimum rank error training for language modeling. Meng-Sung Wu, Jen-Tzung Chien
2007	Mobile adaptive CALL (MAC): a lightweight speech-based intervention for mobile language learners. Maria Uther, James Uther, Panos Athanasopoulos, Pushpendra Singh, Reiko Akahane-Yamada
2007	Model-based speech separation with single-microphone input. Siu Wa Lee, Frank K. Soong, Pak-Chung Ching
2007	Model-driven detection of clean speech patches in noise. Jonathan Laidler, Martin Cooke, Neil D. Lawrence
2007	Model-space MLLR for trajectory HMMs. Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda
2007	Modeling context and language variation for non-native speech recognition. Tien Ping Tan, Laurent Besacier
2007	Modeling incompletion phenomenon in Mandarin dialog prosody. Jian Yu, Lixing Huang, Jianhua Tao, Xia Wang
2007	Modeling the statistical behavior of lexical chains to capture word cohesiveness for automatic story segmentation. Shing-kai Chan, Lei Xie, Helen M. Meng
2007	Modeling tones in hakka on the basis of the command-response model. Wentao Gu, Rerrario Shui-Ching Ho, Tan Lee
2007	Modelling confusion matrices to improve speech recognition accuracy, with an application to dysarthric speech. Santiago Omar Caballero Morales, Stephen J. Cox
2007	Modelling prominence and emphasis improves unit-selection synthesis. Volker Strom, Ani Nenkova, Robert A. J. Clark, Yolanda Vazquez-Alvarez, Jason M. Brenier, Simon King, Dan Jurafsky
2007	Modelling the human-machine gap in speech reception: microscopic speech intelligibility prediction for normal-hearing subjects with an auditory model. Tim Jürgens, Thomas Brand, Birger Kollmeier
2007	More on acoustic correlates of stress. Daan Wissing
2007	Morfessor and variKN machine learning tools for speech and language technology. Vesa Siivola, Mathias Creutz, Mikko Kurimo
2007	Morphological pre-processing technique and its applications on speech signal. Hyun Soo Kim
2007	Morphosyntactic processing of n-best lists for improved recognition and confidence measure computation. Stéphane Huet, Guillaume Gravier, Pascale Sébillot
2007	MuLAS: a framework for automatically building multi-tier corpora. Sérgio Paulo, Luís C. Oliveira
2007	Multi-layer kohonen self-organizing feature map for language identification. Liang Wang, Eliathamby Ambikairajah, Eric H. C. Choi
2007	Multi-modal user authentication from video for mobile or variable-environment applications. Timothy J. Hazen, Daniel Schultz
2007	Multi-resolution soft features for channel-robust distributed speech recognition. Valentin Ion, Reinhold Haeb-Umbach
2007	Multi-step linear prediction based speech dereverberation in noisy reverberant environment. Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Masato Miyoshi
2007	Multi-stream features combination based on dempster-shafer rule for LVCSR system. Fabio Valente, Jithendra Vepa, Hynek Hermansky
2007	Multiband, multisensor robust features for noisy speech recognition. Dimitrios Dimitriadis, Petros Maragos, Stamatios Lefkimmiatis
2007	Multimodal speech recognition with ultrasonic sensors. Bo Zhu, Timothy J. Hazen, James R. Glass
2007	Mutual information and the speech signal. Mattias Nilsson, W. Bastiaan Kleijn
2007	N-best: the northern- and southern-dutch benchmark evaluation of speech recognition technology. Judith M. Kessens, David A. van Leeuwen
2007	Narrowband to wideband feature expansion for robust multilingual ASR. Dusan Macho
2007	Natural-emotion GMM transformation algorithm for emotional speaker recognition. Zhenyu Shan, Yingchun Yang, Ruizhi Ye
2007	Neighborhood density and neighborhood frequency effects in French spoken word recognition. Sophie Dufour, Ulrich H. Frauenfelder
2007	Nepalese retroflex stops: a static palatography study of inter- and intra-speaker variability. Rajesh Khatiwada
2007	Never-ending learning with dynamic hidden Markov network. Konstantin Markov, Satoshi Nakamura
2007	New algorithm for LPC residual estimation from LSF vectors for a voice conversion system. Winston S. Percybrooks, Elliot Moore
2007	New word acquisition using subword modeling. Ghinwa F. Choueiter, Stephanie Seneff, James R. Glass
2007	Noise reduction based on adaptive β-order generalized spectral subtraction for speech enhancement. Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yôiti Suzuki
2007	Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio. Kentaro Ishizuka, Tomohiro Nakatani, Masakiyo Fujimoto, Noboru Miyazaki
2007	Noise robust speech recognition for voice driven wheelchair. Akira Sasou, Hiroaki Kojima
2007	Noise robust voice activity detection based on switching kalman filter. Masakiyo Fujimoto, Kentaro Ishizuka
2007	Noise suppression based on extending a speech-dominated modulation band. Tiago H. Falk, Svante Stadler, W. Bastiaan Kleijn, Wai-Yip Chan
2007	Noise suppression using search strategy with multi-model compositions. Takatoshi Jitsuhiro, Tomoji Toriyama, Kiyoshi Kogure
2007	Noise tracking for speech systems in adverse environments. Nitish Krishnamurthy, John H. L. Hansen
2007	Noise-robust hands-free voice activity detection with adaptive zero crossing detection using talker direction estimation. Yuki Denda, Takamasa Tanaka, Masato Nakayama, Takanobu Nishiura, Yoichi Yamashita
2007	Non-linear spectral contrast stretching for in-car speech recognition. Weifeng Li, Hervé Bourlard
2007	Normalized two stage SVQ for minimum complexity wide-band LSF quantization. Saikat Chatterjee, Thippur V. Sreenivas
2007	Novel eigenpitch-based prosody model for text-to-speech synthesis. Jilei Tian, Jani Nurminen, Imre Kiss
2007	Novel low-band phase representation for low bit-rate speech coding. Ahmed Ismail, Yasser Dakroury, Hazem M. Abbas
2007	Objective analysis of the effect of memory inclusion on bandwidth extension of narrowband speech. Amr H. Nour-Eldin, Peter Kabal
2007	Objective parameters from videokymographic images: a user-friendly interface. Claudia Manfredi, Leonardo Bocchi, Giovanna Cantarella, Giorgio Peretti, Gabriele Guidi, Vincenzo Mezzatesta
2007	Omnidirectional audio-visual talker localizer with dynamic feature fusion based on validity and reliability criteria. Yuki Denda, Takanobu Nishiura, Yoichi Yamashita
2007	On automatic prominence detection for German. Fabio Tamburini, Petra Wagner
2007	On comparing and combining intra-speaker variability compensation and unsupervised model adaptation in speaker verification. Claudio Garretón, Néstor Becerra Yoma, Fernando Huenupán, Carlos Molina
2007	On filled-pauses and prolongations in european portuguese. Helena Moniz, Ana Isabel Mata, Céu Viana
2007	On optimal estimation of compressed speech for hearing aids. Dirk Mauler, Anil M. Nagathil, Rainer Martin
2007	On organic interfaces. Victor Zue
2007	On the categorical nature of the process involved in schwa elision in French. Audrey Bürki, Cécile Fougeron, Cédric Gendrot
2007	On the equivalence of Gaussian HMM and Gaussian HMM-like hidden conditional random fields. Georg Heigold, Ralf Schlüter, Hermann Ney
2007	On the importance of pure prosody in the perception of speaker identity. Elina Helander, Jani Nurminen
2007	On the jointly unsupervised feature vector normalization and acoustic model compensation for robust speech recognition. Luis Buera, Antonio Miguel, Eduardo Lleida, Oscar Saz, Alfonso Ortega
2007	On the limitations of voice conversion techniques in emotion identification tasks. Roberto Barra-Chicote, Juan Manuel Montero, Javier Macías Guarasa, Juana M. Gutiérrez-Arriola, Javier Ferreiros, José Manuel Pardo
2007	On the role of spectral dynamics in unit selection speech synthesis. Barry Kirkpatrick, Darragh O'Brien, Ronan Scaife, Andrew Errity
2007	On the use of time-delay neural networks for highly accurate classification of stop consonants. Jun Hou, Lawrence R. Rabiner, Sorin Dusan
2007	On web-based creation of speech resources for less-resourced languages. Christoph Draxler
2007	Online call quality monitoring for automating agent-based call centers. Woosung Kim
2007	Online vocabulary adaptation using limited adaptation data. C. E. Liu, Kishan Thambiratnam, Frank Seide
2007	Ontology-based multimodal high level fusion involving natural language analysis for aged people home care application. Olga Vybornova, Monica Gemo, Ronald Moncarey, Benoît Macq
2007	Optimization of temporal filters in the modulation frequency domain for constructing robust features in speech recognition. Jeih-weih Hung
2007	Optimization on decoding graphs by discriminative training. Shiuan-Sung Lin, François Yvon
2007	Optimized one-bit quantization for adapted GMM-based speaker verification. Ivy H. Tseng, Olivier Verscheure, Deepak S. Turaga, Upendra V. Chaudhari
2007	Optimizing sentence segmentation for spoken language translation. Sharath Rao, Ian R. Lane, Tanja Schultz
2007	PCA-based feature extraction for fluctuation in speaking style of articulation disorders. Hironori Matsumasa, Tetsuya Takiguchi, Yasuo Ariki, Ichao Li, Toshitaka Nakabayashi
2007	PLSA-based topic detection in meetings for adaptation of lexicon and language model. Yuya Akita, Yusuke Nemoto, Tatsuya Kawahara
2007	Parameter tuning for fast speech recognition. Thomas Colthurst, Tresi Arvizo, Chia-Lin Kao, Owen Kimball, Stephen A. Lowe, David R. H. Miller, Jim Van Sciver
2007	People watcher: a game for eliciting human-transcribed data for automated directory assistance. Tim Paek, Yun-Cheng Ju, Christopher Meek
2007	Perception and production of word-final alveolar stops by brazilian portuguese learners of English. Melissa Bettoni-Techio, Andréia S. Rauber, Rosana Denise Koerich
2007	Perception of disfluency: language differences and listener bias. Catherine Lai, Kyle Gorman, Jiahong Yuan, Mark Y. Liberman
2007	Perceptual equivalence of approximated Cantonese tone contours. Yujia Li, Tan Lee
2007	Perceptual musical noise reduction using critical bands tonality coefficients and masking thresholds. Anis Ben Aicha, Sofia Ben Jebara
2007	Perceptual relevance of pitch contours of Mandarin tones and its efficacy in prosody generation of speech synthesis. Shi-Han Chen, Chih-Chung Kuo
2007	Perceptual-based playout mechanisms for multi-stream voice over IP networks. Chun-Feng Wu, Cheng-Lung Lee, Wen-Whei Chang
2007	Performance evaluation of HMM-based style classification with a small amount of training data. Makoto Tachibana, Keigo Kawashima, Junichi Yamagishi, Takao Kobayashi
2007	Performance evaluation of glottal quality measures from the perspective of vocal tract filter consistency. Juan F. Torres, Elliot Moore
2007	Performance of speaker-dependent wideband speech coding. Ethan Robert Duni, Bhaskar D. Rao
2007	Phone boundary detection using selective refinements and context-dependent acoustic features. Sirinoot Boonsuk, Proadpran Punyabukkana, Atiwong Suchato
2007	Phone-discriminating minimum classification error (p-MCE) training for phonetic recognition. Qian Qian, Xiaodong He, Li Deng
2007	Phoneme confusions in human and automatic speech recognition. Bernd T. Meyer, Matthias Wächter, Thomas Brand, Birger Kollmeier
2007	Phoneme dependent frame selection preference. Tingyao Wu, Jacques Duchateau, Dirk Van Compernolle
2007	Phonetic based sentence level rewriting of questions typed by dyslexic spellers in an information retrieval context. Laurianne Sitbon, Patrice Bellot, Philippe Blache
2007	Phonetic geminates in cypriot greek: the case of voiceless plosives. Christiana Christodoulou
2007	Phonotactic spoken language identification with limited training data. Marius Peche, Marelie H. Davel, Etienne Barnard
2007	Phrases in category-based language models for Spanish and basque ASR. Raquel Justo, M. Inés Torres
2007	Pitch accent versus lexical stress: quantifying acoustic measures related to the voice source. Yen-Liang Shue, Markus Iseli, Nanette Veilleux, Abeer Alwan
2007	Pitch estimation of noisy speech signals using empirical mode decomposition. Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu, Md. Kamrul Hasan
2007	Pitch pattern alternation in goshogawara Japanese: evidence for a prosodic phrase above the domain for downstep. Yosuke Igarashi
2007	Pitch period estimation using multipulse model and wavelet transform. Prasanta Kumar Ghosh, Antonio Ortega, Shrikanth S. Narayanan
2007	PocketSUMMIT: small-footprint continuous speech recognition. I. Lee Hetherington
2007	Podcastle: a web 2.0 approach to speech recognition research. Masataka Goto, Jun Ogata, Kouichirou Eto
2007	Pointing to a target while naming it with /pata/ or /tapa/: the effect of consonants and stress position on jaw-finger coordination. Amélie Rochet-Capellan, Jean-Luc Schwartz, Rafael Laboissière, Arturo Galvàn
2007	Predicting focus through prominence structure. Sasha Calhoun
2007	Predicting the consequences of vocalizations in early infancy. Francisco Lacerda, Lisa Gustavsson
2007	Predicting vowel duration in spontaneous canadian French speech. Darcie Williams, François Poiré
2007	Predictive minimum Bayes risk classification for robust speech recognition. Jen-Tzung Chien, Koichi Shinoda, Sadaoki Furui
2007	Prelexical adjustments to speaker idiosyncrasies: are they position-specific? Alexandra Jesse, James M. McQueen
2007	Preliminary experiments toward automatic generation of new TTS voices from recorded speech alone. Ryuki Tachibana, Tohru Nagano, Gakuto Kurata, Masafumi Nishimura, Noboru Babaguchi
2007	Preventing an external acoustic noise from being misrecognized as a speech recognition object by confirming the lip movement image signal. Soo-Jong Lee, Jun Park, Eung-Kyeu Kim
2007	Probabilistic deduction of symbol mappings for extension of lexicons. Rita Singh, Evandro B. Gouvêa, Bhiksha Raj
2007	Probabilistic latent speaker analysis for large vocabulary speech recognition. Dan Su, Xihong Wu, Huisheng Chi
2007	Processing image and audio information for recognising discourse participation status through features of face and voice. Nick Campbell, Damien Douxchamps
2007	Prosody change and response timing analysis in spontaneously spoken dialogs and their modeling in a spoken dialog system. Ryota Nishimura, Norihide Kitaoka, Seiichi Nakagawa
2007	Prosody, emotions, and... 'whatever'. Stefan Benus, Agustín Gravano, Julia Hirschberg
2007	Prosody-enriched lattices for improved syllable recognition. Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan
2007	Punctuating confusion networks for speech translation. Roldano Cattoni, Nicola Bertoldi, Marcello Federico
2007	Pushy versus meek - using avatars to influence turn-taking behaviour. Jens Edlund, Jonas Beskow
2007	Quality assessment of speech enhancement systems by separation of enhanced speech, noise, and echo. Tim Fingscheidt, Suhadi Suhadi
2007	Quasi text-independent speaker-verification based on pattern matching. Michael Gerber, René Beutler, Beat Pfister
2007	RAMCESS/handsketch: a multi-representation framework for realtime and expressive singing synthesis. Nicolas D'Alessandro, Thierry Dutoit
2007	Rapid and accurate spoken term detection. David R. H. Miller, Michael Kleber, Chia-Lin Kao, Owen Kimball, Thomas Colthurst, Stephen A. Lowe, Richard M. Schwartz, Herbert Gish
2007	Rapid speaker adaptation by reference model interpolation. Wen Xuan Teng, Guillaume Gravier, Frédéric Bimbot, Frédéric Soufflet
2007	Rapid unsupervised speaker adaptation using single utterance based on MLLR and speaker selection. Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2007	Realisations and alternations in German /r/-realisation. Christiane Ulbrich, Horst Ulbrich
2007	Recent progress in the MIT spoken lecture processing project. James R. Glass, Timothy J. Hazen, D. Scott Cyphers, Igor Malioutov, David Huynh, Regina Barzilay
2007	Recognition of foreign names spoken by native speakers. Frederik Stouten, Jean-Pierre Martens
2007	Reconstructing audio signals from modified non-coherent hilbert envelopes. Joachim Thiemann, Peter Kabal
2007	Recovering punctuation marks for automatic speech recognition. Fernando Batista, Diamantino Caseiro, Nuno J. Mamede, Isabel Trancoso
2007	Reducing recognition error rate based on context relationships among dialogue turns. Hsu-Chih Wu, Stephanie Seneff
2007	Regularized feature-based maximum likelihood linear regression for speech recognition. Mohamed Kamal Omar
2007	Relative evaluation of informativeness in machine generated summaries. BalaKrishna Kolluru, Yoshihiko Gotoh
2007	Resources for new research directions in speaker recognition: the mixer 3, 4 and 5 corpora. Christopher Cieri, Linda Corson, David Graff, Kevin Walker
2007	Rhotic variation and schwa epenthesis in windsor French. Ivan Chow, François Poiré
2007	Rigid vs non-rigid face and head motion in phone and tone perception. Denis Burnham, Jessica Reynolds, Guillaume Vignali, Sandra Bollwerk, Caroline Jones
2007	Robust F0 modeling for Mandarin speech recognition in noise. Sheng Qiang, Yao Qian, Frank K. Soong, Congfu Xu
2007	Robust and high-resolution voiced/unvoiced classification in noisy speech using a signal smoothness criterion. A. Sreenivasa Murthy, S. Chandra Sekhar, Thippur V. Sreenivas
2007	Robust distributed speech recognition using histogram equalization and correlation information. Pedro M. Martinez, José C. Segura, Luz García
2007	Robust location understanding in spoken dialog systems using intersections. Michael L. Seltzer, Yun-Cheng Ju, Ivan Tashev, Alex Acero
2007	Robust voice activity detection based on adaptive sub-band energy sequence analysis and harmonic detection. Yanmeng Guo, Qian Qian, Yonghong Yan
2007	Robust voice activity detection for narrow-bandwidth speaker verification under adverse environments. Tuan Van Pham, Michael Neffe, Gernot Kubin
2007	Robustness of long time measures of fundamental frequency. Jonas Lindh, Anders Eriksson
2007	Robustness of several kernel-based fast adaptation methods on noisy LVCSR. Brian Kan-Wing Mak, Roger Wend-Huu Hsiao
2007	Russian vowels system acoustic features development in ontogenesis. Elena E. Lyakso, Olga V. Frolova
2007	SPICE: web-based tools for rapid language adaptation in speech processing systems. Tanja Schultz, Alan W. Black, Sameer Badaskar, Matthew Hornyak, John Kominek
2007	Score distribution scaling for speaker recognition. Vinod Prakash, John H. L. Hansen
2007	Score fusion for articulatory feature detection. Brian M. Ore, Raymond E. Slyh
2007	Segment deletion in spontaneous speech: a corpus study using mixed effects models with crossed random effects. Christophe Van Bael, R. Harald Baayen, Helmer Strik
2007	Segmentation of speech: child's play? Odette Scharenborg, Mirjam Ernestus, Vincent Wan
2007	Selecting on-topic sentences from natural language corpora. Michael Levit, Elizabeth Boschee, Marjorie Freedman
2007	Selection of optimal dimensionality reduction method using chernoff bound for segmental unit input HMM. Makoto Sakai, Norihide Kitaoka, Seiichi Nakagawa
2007	Self-organization in the evolution of shared systems of speech sounds: a computational study. Pierre-Yves Oudeyer
2007	Semi-supervised learning of speech sounds. Aren Jansen, Partha Niyogi
2007	Sentence level intelligibility evaluation for Mandarin text-to-speech systems using semantically unpredictable sentences. Jian Li, Dmitry Sityaev, Jie Hao
2007	Single channel speech separation using maximum a posteriori estimation. Mohammad H. Radfar, Richard M. Dansereau
2007	Singleton and geminate stops in Finnish - acoustic correlates. Christopher S. Doty, Kaori Idemaru, Susan G. Guion
2007	Smooth soft mel-spectrographic masks based on blind sparse source separation. Marco Kühne, Roberto Togneri, Sven Nordholm
2007	Soft margin feature extraction for automatic speech recognition. Jinyu Li, Chin-Hui Lee
2007	Some evidence on the phonetics and phonology of prosodic phrasing in Russian. Irina Nesterenko, Pavel A. Skrelin
2007	Sparse Gaussian graphical models for speech recognition. Peter Bell, Simon King
2007	Speaker adaptation of language models for automatic dialog act segmentation of meetings. Jáchym Kolár, Yang Liu, Elizabeth Shriberg
2007	Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model. Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2007	Speaker clustering using direct maximization of a BIC-based score. Wei-Ho Tsai
2007	Speaker diarization using normalized cross likelihood ratio. Viet Bac Le, Odile Mella, Dominique Fohr
2007	Speaker recognition by combining MFCC and phase information. Seiichi Nakagawa, Kouhei Asakawa, Longbiao Wang
2007	Speaker recognition using kernel-PCA and intersession variability modeling. Hagai Aronowitz
2007	Speaker role based structural classification of broadcast news stories. BalaKrishna Kolluru, Yoshihiko Gotoh
2007	Speaker verification with multiple classifier fusion using Bayes based confidence measure. Fernando Huenupán, Néstor Becerra Yoma, Carlos Molina, Claudio Garretón
2007	Speaking rate effects in a landmark-based phonetic exemplar model. Travis Wade, Bernd Möbius
2007	Speaking through a noisy channel - experiments on inducing clarification behaviour in human-human dialogue. David Schlangen, Raquel Fernández
2007	Spectro-temporal analysis of speech using 2-d Gabor filters. Tony Ezzat, Jake V. Bouvrie, Tomaso A. Poggio
2007	Spectro-temporal processing for blind estimation of reverberation time and single-ended quality measurement of reverberant speech. Tiago H. Falk, Hua Yuan, Wai-Yip Chan
2007	Speech based drug information system for aged and visually impaired persons. Géza Németh, Gábor Olaszy, Mátyás Bartalis, Géza Kiss, Csaba Zainkó, Péter Mihajlik
2007	Speech coding and information processing by auditory neurons. Huan Wang, Werner Hemmert
2007	Speech enhancement using PCA and variance of the reconstruction error model identification. Amin Haji Abolhassani, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy, Mohamed Faouzi Harkat
2007	Speech enhancement using multi-reference noise reduction in a vehicle environment. Abderrahman Essebbar, Tristan Poinsard
2007	Speech enhancement with improved a posteriori SNR computation. Suhadi Suhadi, Tim Fingscheidt
2007	Speech feature compensation based on pseudo stereo codebooks for robust speech recognition in additive noise environments. Tsung-hsueh Hsieh, Jeih-weih Hung
2007	Speech fundamental frequency estimation using the alternate comb. Jean-Sylvain Liénard, François Signol, Claude Barras
2007	Speech mining in noisy audio message corpus. Nathalie Camelin, Frédéric Béchet, Géraldine Damnati, Renato De Mori
2007	Speech perception in children with speech sound disorder. H. Timothy Bunnell, N. Carolyn Schanen, Linda D. Vallino, Thierry G. Morlet, James B. Polikoff, Jennette D. Driscoll, James T. Mantell
2007	Speech quality after major surgery of the oral cavity and oropharynx with microvascular soft tissue reconstruction. Irma Verdonck-de Leeuw, Louis ten Bosch, Li Ying Chao, Rico N. P. M. Rinkel, Pepijn A. Borggreven, Lou Boves, C. René Leemans
2007	Speech quality estimation using packet loss effects in CELP-type speech coders. Min-Ki Lee, Kyung-Tae Kim, Hong-Goo Kang, Dae Hee Youn
2007	Speech recognition techniques for a sign language recognition system. Philippe Dreuw, David Rybach, Thomas Deselaers, Morteza Zahedi, Hermann Ney
2007	Speech recognition with factorial-HMM syllabic acoustic models. Gianpaolo Coro, Francesco Cutugno, Fulvio Caropreso
2007	Speech recognition with state-based nearest neighbour classifiers. Thomas Deselaers, Georg Heigold, Hermann Ney
2007	Speech reinforcement based on partial specific loudness. Jong Won Shin, Woohyung Lim, June Sig Sung, Nam Soo Kim
2007	Speech synthesis enhancement in noisy environments. Davide Bonardo, Enrico Zovato
2007	Speech to chant transformation with the phase vocoder. Axel Röbel, Joshua Fineberg
2007	Speech-based annotation and retrieval of digital photographs. Timothy J. Hazen, Brennan Sherry, Mark Adler
2007	Speech-nonspeech discrimination using the information bottleneck method and spectro-temporal modulation index. Maria E. Markaki, Michael Wohlmayr, Yannis Stylianou
2007	Speechindexer in action: managing endangered Formosan languages. Jozsef Szakos, Ulrike Glavitsch
2007	Speeding-up neural network training using sentence and frame selection. Stefano Scanzio, Pietro Laface, Roberto Gemello, Franco Mana
2007	Spoken language identification using score vector modeling and support vector machine. Ming Li, Hongbin Suo, Xiao Wu, Ping Lu, Yonghong Yan
2007	Spoken word recognition of Chinese homophones: a further investigation. Michael C. W. Yip
2007	Spontaneous speech synthesis by pronunciation variant selection - a comparison to natural speech. Steffen Werner, Rüdiger Hoffmann
2007	Stabilised weighted linear prediction - a robust all-pole method for speech processing. Carlo Magi, Tom Bäckström, Paavo Alku
2007	Statistical identification of critical, dependent and redundant articulators. Veena D. Singampalli, Philip J. B. Jackson
2007	Statistical vowelization of Arabic text for speech synthesis in speech-to-speech translation systems. Liang Gu, Wei Zhang, Lazkin Tahir, Yuqing Gao
2007	String and lattice based discriminative training for the corpus of spontaneous Japanese lecture transcription task. Erik McDermott, Atsushi Nakamura
2007	Structural Bayesian language modeling and adaptation. Sibel Yaman, Jen-Tzung Chien, Chin-Hui Lee
2007	Structural assessment of language learners' pronunciation. Nobuaki Minematsu, K. Kamata, Satoshi Asakawa, Takehiko Makino, Tazuko Nishimura, Keikichi Hirose
2007	Structure-based and template-based automatic speech recognition - comparing parametric and non-parametric approaches. Li Deng, Helmer Strik
2007	Study on speaker verification with non-audible murmur segments. Hideki Okamoto, Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano
2007	Style estimation of speech based on multiple regression hidden semi-Markov model. Takashi Nose, Yoichi Kato, Takao Kobayashi
2007	Subword-based position specific posterior lattices (s-PSPL) for indexing speech information. Yi-Cheng Pan, Hung-Lin Chang, Berlin Chen, Lin-Shan Lee
2007	Support vector regression for speaker verification. Ignacio López-Moreno, Ismael Mateos-Garcia, Daniel Ramos, Joaquin Gonzalez-Rodriguez
2007	Suprasegmental aspects of pre-lexical speech in cochlear implanted children. Øydis Hide, Steven Gillis, Paul Govaerts
2007	Syllable lattices as a basis for a children's speech reading tracker. Daniel Bolaños, Wayne H. Ward, Sarel van Vuuren, Javier Garrido Salas
2007	Syllable timing patterns in Polish: results from annotation mining. Dafydd Gibbon, Jolanta Bachan, Grazyna Demenko
2007	Synthesis of prosodic attitudinal variants in German backchannel ja. Thorsten Stocksmeier, Stefan Kopp, Dafydd Gibbon
2007	System request detection in conversation based on acoustic and speaker alternation features. Tomoyuki Yamagata, Atsushi Sako, Tetsuya Takiguchi, Yasuo Ariki
2007	Tagging syllable boundaries with joint n-gram models. Helmut Schmid, Bernd Möbius, Julia Weidenkaff
2007	Temporal alignment of creaky voice in neutralised realisations of an underlying, post-nasal voicing contrast in German. Tina John, Jonathan Harrington
2007	Temporal downtrends in Czech read speech. Jan Volín, Radek Skarnitzl
2007	Temporal episodic memory model: an evolution of minerva2. Viktoria Maier, Roger K. Moore
2007	Temporal masking for unsupervised minimum Bayes risk speaker adaptation. Matthew Gibson, Thomas Hain
2007	Testing the relevance of speech rate, pitch and a glottal Chink for the perception of age in synthesized speech using formant synthesis. Ralf Winkler
2007	Text island spotting in large speech databases. Benjamin Lecouteux, Georges Linarès, Frédéric Beaugendre, Pascal Nocera
2007	The BBN 2007 displayless English/iraqi speech-to-speech translation system. David Stallard, Fred Choi, Chia-Lin Kao, Kriste Krstovski, Premkumar Natarajan, Rohit Prasad, Shirin Saleem, Krishna Subramanian
2007	The IRST English-Spanish translation system for european parliament speeches. Daniele Falavigna, Nicola Bertoldi, Fabio Brugnara, Roldano Cattoni, Mauro Cettolo, Boxing Chen, Marcello Federico, Diego Giuliani, Roberto Gretter, Deepa Gupta, Dino Seppi
2007	The ISL 2007 English speech transcription system for european parliament speeches. Sebastian Stüker, Christian Fügen, Florian Kraft, Matthias Wölfel
2007	The RWTH 2007 TC-STAR evaluation system for european English and Spanish. Jonas Lööf, Christian Gollan, Stefan Hahn, Georg Heigold, Björn Hoffmeister, Christian Plahl, David Rybach, Ralf Schlüter, Hermann Ney
2007	The SRI/OGI 2006 spoken term detection system. Dimitra Vergyri, Izhak Shafran, Andreas Stolcke, Venkata Ramana Rao Gadde, Murat Akbacak, Brian Roark, Wen Wang
2007	The blame game: performance analysis of speaker diarization system components. Marijn Huijbregts, Chuck Wooters
2007	The buckeye corpus of speech: updates and enhancements. Eric Fosler-Lussier, Laura Dilley, Na'im R. Tyson, Mark A. Pitt
2007	The developmental analysis of demonstrative expression skills utilizing a multimodal infant behavior corpus. Shinya Kiriyama, Ryo Tsuji, Tomohiko Kasami, Shogo Ishikawa, Naofumi Otani, Hiroaki Horiuchi, Yoichi Takebayashi, Shigeyoshi Kitazawa
2007	The duration of speech pauses in a multilingual environment. Mike Demol, Werner Verhelst, Piet Verhoeve
2007	The effect of filled pauses in a lecture speech on impressive evaluation of listeners. Hiromitsu Nishizaki, Mitsuhiro Somiya, Kenji Kobayashi, Yoshihiro Sekiguchi
2007	The effect of highband harmonic structure in the artificial bandwidth expansion of telephone speech. Hannu Pulakka, Paavo Alku, Laura Laaksonen, Päivi Valve
2007	The effect of speech interface accuracy on driving performance. Andrew L. Kun, Tim Paek, Zeljko Medenica
2007	The effect of the additivity assumption on time and frequency domain wiener filtering for speech enhancement. Kamil K. Wójcicki, Stephen So, Kuldip K. Paliwal
2007	The harming part of room acoustics in automatic speech recognition. Rico Petrick, Kevin Lohde, Matthias Wolff, Rüdiger Hoffmann
2007	The harmonic model codec (HMC) framework for voIP. Yannis Agiomyrgiannakis, Yannis Stylianou
2007	The influence of masking words on the prediction of TRPs in a shadowed dialog. Wieneke Wesseling, R. J. J. H. van Son, Louis C. W. Pols
2007	The influence of speech activity detection and overlap on speaker diarization for meeting room recordings. Corinne Fredouille, Nicholas W. D. Evans
2007	The influence of user tailoring and cognitive load on user performance in spoken dialogue systems. Andi Winterboer, Jiang Hu, Johanna D. Moore, Clifford Nass
2007	The influence of utterance chunking on machine translation performance. Christian Fügen, Muntsin Kolss
2007	The influence of vowel quality features on peak alignment. Matthias Jilka, Bernd Möbius
2007	The intelligibility and its relations to acoustic characteristics of English /s/ and /esh/ produced by native speakers of Japanese. Akiyo Joto, Yoshiki Nagase, Seiya Funatsu
2007	The limits of multidimensional category learning. Martijn Goudbeek, Daniel Swingley, Keith R. Kluender
2007	The neural basis of speech perception - a view from functional imaging. Sophie K. Scott
2007	The neutral tone in question intonation in Mandarin. Fang Liu, Yi Xu
2007	The phonetic exponency of phrasal accentuation in French and German. William J. Barry, Bistra Andreeva, Ingmar Steiner
2007	The phonetics and phonology of high and low tones in two falling f0-contours in standard German. Tamara Rathcke, Jonathan Harrington
2007	The relationship between the perception and production of English nasal codas by brazilian learners of English. Denise Cristina Kluge, Andréia S. Rauber, Mara Silvia Reis, Ricardo Augusto Hoffmann Bion
2007	The relevance of feature type for the automatic classification of emotional user states: low level descriptors and functionals. Björn W. Schuller, Anton Batliner, Dino Seppi, Stefan Steidl, Thurid Vogt, Johannes Wagner, Laurence Devillers, Laurence Vidrascu, Noam Amir, Loïc Kessous, Vered Aharonson
2007	The role of intonation and voice quality in the affective speech perception. Ioulia Grichkovtsova, Anne Lacheret, Michel Morel
2007	The role of metrical stress in comprehension and production in dutch children at-risk of dyslexia. Petra van Alphen, Elise de Bree, Paula Fikkert, Frank Wijnen
2007	The role of outer hair cell function in the perception of synthetic versus natural speech. Maria K. Wolters, Pauline Campbell, Christine DePlacido, Amy Liddell, David Owens
2007	The virtual guide: a direction giving embodied conversational agent. Mariët Theune, Dennis Hofs, Marco van Kessel
2007	The voice-rate dialog system for consumer ratings. Geoffrey Zweig, Patrick Nguyen, Yun-Cheng Ju, Ye-Yi Wang, Dong Yu, Alex Acero
2007	The voiceTRAN machine translation system. Jerneja Zganec-Gros, Stanislav Gruden
2007	Thinking outside the cube: modeling language processing tasks in a multiple resource paradigm. Kilian G. Seeber
2007	Time-compressed speech perception with speech and noise maskers. Douglas Brungart, Nandini Iyer
2007	Time-domain blind audio source separation using advanced ICA methods. Zbynek Koldovský, Petr Tichavský
2007	Time-varying pre-emphasis and inverse filtering of speech. Karl Schnell, Arild Lacroix
2007	Time-warping and re-phasing in packet loss concealment. Robert Zopf, Jes Thyssen, Juin-Hwey Chen
2007	Tone production by the speakers of different age-and-gender groups. Wai-Sum Lee
2007	Top-down effects on compensation for coarticulation are not replicable. Holger Mitterer
2007	Topic estimation with domain extensibility for guiding user's out-of-grammar utterances in multi-domain spoken dialogue systems. Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
2007	Topic in dialogue: prosodic and syntactic features. Claudia Crocco, Renata Savy
2007	Towards better language modeling for Thai LVCSR. Markpong Jongtaveesataporn, Issara Thienlikit, Chai Wutiwiwatchai, Sadaoki Furui
2007	Towards online speech summarization. Gabriel Murray, Steve Renals
2007	Trainable speaker diarization. Hagai Aronowitz
2007	Translating conversational speech to standard linguistic form. Darren Scott Appling, Nick Campbell
2007	Two-stage system for robust neutral/lombard speech recognition. Hynek Boril, Petr Fousek, Harald Höge
2007	Two-stream emotion recognition for call center monitoring. Purnima Gupta, Nitendra Rajput
2007	Unsupervised HMM classification of F0 curves. Damien Lolive, Nelly Barbot, Olivier Boëffard
2007	Unsupervised categorisation approaches for technical support automated agents. Amparo Albalate, Dimitar Dimitrov, Roberto Pieraccini
2007	Unsupervised re-scoring of observation probability in viterbi based on reinforcement learning by using confidence measure and HMM neighborhood. Carlos Molina, Néstor Becerra Yoma, Fernando Huenupán, Claudio Garretón
2007	Unsupervised training of adaptation rate using q-learning in large vocabulary continuous speech recognition. Masafumi Nishida, Yasuo Horiuchi, Akira Ichikawa
2007	Unsupervised training with directed manual transcription for recognising Mandarin broadcast audio. Kai Yu, Mark J. F. Gales, Philip C. Woodland
2007	Use of lexical and affective prosodic cues to emotion by younger and older adults. Kate Dupuis, Kathleen Pichora-Fuller
2007	Use of syllable center detection for improved duration modeling in Chinese Mandarin connected digits recognition. Sergey Astrov, Joachim Hofer, Harald Höge
2007	Using a small development set to build a robust dialectal Chinese speech recognizer. Linquan Liu, Thomas Fang Zheng, Makoto Akabane, Ruxin Chen, Wenhu Wu
2007	Using direction of arrival estimate and acoustic feature information in speaker diarization. Chin-Wei Eugene Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Engsiong Chng, Haizhou Li, Susanto Rahardja
2007	Using eye movements for online evaluation of speech synthesis. Charlotte van Hooijdonk, Edwin Commandeur, Reinier Cozijn, Emiel Krahmer, Erwin Marsi
2007	Using information state to improve dialogue move identification in a spoken dialogue system. Hua Ai, Antonio Roque, Anton Leuski, David R. Traum
2007	Using inter-lingual triggers for machine translation. Caroline Lavecchia, Kamel Smaïli, David Langlois, Jean Paul Haton
2007	Using multiple strategies to manage spoken dialogue. Shiu-Wah Chu, Ian M. O'Neill, Philip Hanna
2007	Using neutral speech models for emotional speech analysis. Carlos Busso, Sungbok Lee, Shrikanth S. Narayanan
2007	Using phonetic features in unsupervised word decompounding for ASR with application to a less-represented language. Thomas Pellegrini, Lori Lamel
2007	Using prosodic and spectral characteristics for sleepiness detection. Jarek Krajewski, Bernd J. Kröger
2007	Using speech rhythm for acoustic language identification. Ekaterina Timoshenko, Harald Höge
2007	Using waveform matching techniques in the measurement of shimmer in voiced signals. Carlos A. Ferrer-Riesgo, María Esperanza Hernández-Díaz, Eduardo González-Moreira
2007	Utilizing online content as domain knowledge in a multi-domain dynamic dialogue system. Craig Wootton, Michael F. McTear, Terry Anderson
2007	Utterance-final glottalization as a cue for familiar speaker recognition. Tamás Böhm, Stefanie Shattuck-Hufnagel
2007	VOCALOID - commercial singing synthesizer based on sample concatenation. Hideki Kenmochi, Hayato Ohshita
2007	VZ-norm: an extension of z-norm to the multivariate case for anchor model based speaker verification. Delphine Charlet, Mikaël Collet, Frédéric Bimbot
2007	Varying input segmentation for story boundary detection in English, Arabic and Mandarin broadcast news. Andrew Rosenberg, Mehrbod Sharifi, Julia Hirschberg
2007	Vector-quantization based mask estimation for missing data automatic speech recognition. Maarten Van Segbroeck, Hugo Van hamme
2007	Virtual fusion for speaker recognition. Yosef A. Solewicz, Moshe Koppel
2007	Visual analysis of lip coarticulation in VCV utterances. Aseel Turkmani, Adrian Hilton, Philip J. B. Jackson, James D. Edge
2007	Visual information and redundancy conveyed by internal articulator dynamics in synthetic audiovisual speech. Katja Grauwinkel, Britta Dewitt, Sascha Fagel
2007	Visualizing acoustic similarities between emotions in speech: an acoustic map of emotions. Khiet P. Truong, David A. van Leeuwen
2007	Vocabulary selection for a broadcast news transcription system using a morpho-syntactic approach. Ciro Martins, António J. S. Teixeira, João Paulo Neto
2007	Vocal conversion from speaking voice to singing voice using STRAIGHT. Takeshi Saitou, Masataka Goto, Masashi Unoki, Masato Akagi
2007	Vocal tract and area function estimation with both lip and glottal losses. Kaustubh Kalgaonkar, Mark A. Clements
2007	Vocal tract length during speech production. Sorin Dusan
2007	Voice activated powered wheelchair with non-voice rejection algorithm. Soo-Young Suk, Hiroaki Kojima
2007	Voice activity detection based on support vector machine using effective feature vectors. Q-Haing Jo, Yun-Sik Park, Kye-Hwan Lee, Ji-Hyun Song, Joon-Hyuk Chang
2007	Voice activity detection in degraded speech using excitation source information. K. Sri Rama Murty, B. Yegnanarayana, Sunitha Guruprasad
2007	Voice activity detection using the phase vector in microphone array. Gibak Kim, Nam Ik Cho
2007	Voice fatigue and use of speech recognition: a study of voice quality ratings. Christel G. de Bruijn, Sandra P. Whiteside
2007	Voice source and vocal tract variations as cues to emotional states perceived from expressive conversational speech. Hiroki Mori, Hideki Kasuya
2007	Voicepedia: towards speech-based access to unstructured information. J. Sherwani, Dong Yu, Tim Paek, Mary Czerwinski, Yun-Cheng Ju, Alex Acero
2007	Voicing level control with application in voice conversion. Jani Nurminen, Jilei Tian, Victor Popa
2007	Voicing-based codebook in low-rate wideband CELP coding. Driss Guerchi, Tamer Rabie, Abdelrhani Louzi
2007	Vowel production in two occlusal classes. André Araújo, Luis M. T. Jesus, Isabel M. Costa
2007	Vowels and tones in infant directed speech: hyperarticulation for both, but different developmental patterns. Nan Xu, Denis Burnham, Christine Kitamura
2007	Wavelet-based front-end for electromyographic speech recognition. Michael Wand, Szu-Chen Stan Jou, Tanja Schultz
2007	Web-based language modelling for automatic lecture transcription. Cosmin Munteanu, Gerald Penn, Ronald Baecker
2007	Weighted frequency warping for voice conversion. Daniel Erro, Asunción Moreno
2007	What do listeners attend to in hearing prosodic structures? investigating the human speech-parser using short-term recall. Annie C. Gilbert, Victor J. Boucher
2007	Women's vocal aging: a longitudinal approach. Markus Brckl
2007	Word confusability - measuring hidden Markov model similarity. Jia-Yu Chen, Peder A. Olsen, John R. Hershey
2007	Word duration modeling for word graph rescoring in LVCSR. Dino Seppi, Daniele Falavigna, Georg Stemmer, Roberto Gretter
2007	Word stress correlates in spontaneous child-directed speech in German. Katrin Schneider, Bernd Möbius
2007	Word-conditioned HMM supervectors for speaker recognition. Howard Lei, Nikki Mirghafori
2007	Zero-crossing-based ratio masking for sound segregation. Sung Jun An, Young-Ik Kim, Rhee Man Kil
2007	fMPE-MAP: improved discriminative adaptation for modeling new domains. Jing Zheng, Andreas Stolcke
2007	ugloss: a framework for improving spoken language generation understandability. Brian Langner, Alan W. Black