| 2007 | "polyaural" array processing for automatic speech recognition in degraded environments. Richard M. Stern, Evandro B. Gouvêa, Govindarajan Thattai |
| 2007 | 8th Annual Conference of the International Speech Communication Association, INTERSPEECH 2007, Antwerp, Belgium, August 27-31, 2007 |
| 2007 | A Bayesian network classifier for word-level reading assessment. Joseph Tepperman, Matthew Black, Patti Price, Sungbok Lee, Abe Kazemzadeh, Matteo Gerosa, Margaret Heritage, Abeer Alwan, Shrikanth S. Narayanan |
| 2007 | A GMM-based probabilistic sequence kernel for speaker verification. Kong-Aik Lee, Changhuai You, Haizhou Li, Tomi Kinnunen |
| 2007 | A HMM recognition of consonant-vowel syllables from lip contours: the cued speech case. Noureddine Aboutabit, Denis Beautemps, Jeanne Clarke, Laurent Besacier |
| 2007 | A MAP based approach to adaptive speech intelligibility measurements. Trym Holter, Svein Srsdal |
| 2007 | A comparative evaluation of the zeros of z transform representation for voice source estimation. Nicolas Sturmel, Christophe d'Alessandro, Boris Doval |
| 2007 | A comparative study of speech rate estimation techniques. Tomas Dekens, Mike Demol, Werner Verhelst, Piet Verhoeve |
| 2007 | A comparative study on speech summarization of broadcast news and lecture speech. Jian Zhang, Ricky Ho Yin Chan, Pascale Fung, Lu Cao |
| 2007 | A comparison of acoustic features for articulatory inversion. Chao Qin, Miguel Á. Carreira-Perpiñán |
| 2007 | A comparison of estimated and MAP-predicted formants and fundamental frequencies with a speech reconstruction application. Jonathan Darch, Ben Milner |
| 2007 | A comparison of session variability compensation techniques for SVM-based speaker recognition. Mitchell McLaren, Robbie Vogt, Brendan Baker, Sridha Sridharan |
| 2007 | A comparison of speaker clustering and speech recognition techniques for air situational awareness. Wade Shen, Douglas A. Reynolds |
| 2007 | A computational model for unsupervised word discovery. Louis ten Bosch, Bert Cranen |
| 2007 | A conservative aggressive subspace tracker. Koby Crammer |
| 2007 | A corpus study of the 3 Yiya Chen, Jiahong Yuan |
| 2007 | A data visualization and analysis method for natural language call routing system design. Hong-Kwang Jeff Kuo, Vaibhava Goel |
| 2007 | A fast fuzzy keyword spotting algorithm based on syllable confusion network. Jian Shao, Qingwei Zhao, Pengyuan Zhang, Zhaojie Liu, Yonghong Yan |
| 2007 | A fast optimization method for large margin estimation of HMMs based on second order cone programming. Yan Yin, Hui Jiang |
| 2007 | A fine pitch model for speech. Jasha Droppo, Alex Acero |
| 2007 | A flexible spectral modification method based on temporal decomposition and Gaussian mixture model. Binh Phu Nguyen, Masato Akagi |
| 2007 | A four-cube FEM model of the extrinsic and intrinsic tongue muscles to simulate the production of vowel /i/. Sayoko Takano, Hiroki Matsuzaki, Kunitoshi Motoki |
| 2007 | A framework of reply speech generation for concept-to-speech conversion in spoken dialogue systems. Seiya Takada, Yuji Yagi, Keikichi Hirose, Nobuaki Minematsu |
| 2007 | A generic methodology of converting transliterated text to phonetic strings case study: greeklish. Nikos Tsourakis, Vassilios Digalakis |
| 2007 | A learning method for Thai phonetization of English words. Ausdang Thangthai, Chai Wutiwiwatchai, Anocha Rugchatjaroen, Sittipong Saychum |
| 2007 | A method for evaluating task-oriented spoken dialog translation systems based on communication efficiency. Toshiyuki Takezawa, Masahide Mizushima, Tohru Shimizu, Gen-ichiro Kikui |
| 2007 | A methodology for the automatic detection of perceived prominent syllables in spoken French. Jean-Philippe Goldman, Mathieu Avanzi, Anne-Catherine Simon, Anne Lacheret, Antoine Auchlin |
| 2007 | A model of glottal flow incorporating viscous-inviscid interaction. Tokihiko Kaburagi, Yosuke Tanabe |
| 2007 | A model-based estimation of phonotactic language verification performance. Kakeung Wong, Man-Hung Siu, Brian Mak |
| 2007 | A morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages - like Hungarian. Péter Mihajlik, Tibor Fegyó, Zoltán Tüske, Pavel Ircing |
| 2007 | A multiple-model based framework for automatic speech segmentation. Seung Seop Park, Jong Won Shin, Jong Kyu Kim, Nam Soo Kim |
| 2007 | A multitask learning perspective on acoustic-articulatory inversion. Korin Richmond |
| 2007 | A new approach for phoneme segmentation of speech signals. Ladan Golipour, Douglas D. O'Shaughnessy |
| 2007 | A new kernel for SVM MLLR based speaker recognition. Zahi N. Karam, William M. Campbell |
| 2007 | A novel 2kb/s waveform interpolation speech coder based on non-negative matrix factorization. Peng Zhang, Changchun Bao |
| 2007 | A novel energy distribution comparison approach for robust speech spectrum vector quantization. Ahmed Ismail, Yasser Dakroury, Hazem M. Abbas |
| 2007 | A packetization and variable bitrate interframe compression scheme for vector quantizer-based distributed speech recognition. Bengt J. Borgström, Abeer Alwan |
| 2007 | A pair-based language model for the robust lexical analysis in Chinese text-to-speech synthesis. Wu Liu, Dezhi Huang, Yuan Dong, Xinnian Mao, Haila Wang |
| 2007 | A paradigm for mobile speech-centric services. Lars Bo Larsen, Kasper Løvborg Jensen, Søren Larsen, Morten Højfeldt Rasmussen |
| 2007 | A phonetic concatenative approach of labial coarticulation. Vincent Robert, Yves Laprie, Anne Bonneau |
| 2007 | A phonetic search approach to the 2006 NIST spoken term detection evaluation. Roy Wallace, Robbie Vogt, Sridha Sridharan |
| 2007 | A pitch extraction system based on phase locked loops and consensus decision. Patricia A. Pelle, Claudio Estienne |
| 2007 | A portable record player for wax cylinders using a laser-beam reflection method. Tohru Ifukube, Yasuyuki Shimizu |
| 2007 | A preselection method based on cost degradation from the optimal sequence for concatenative speech synthesis. Nobuyuki Nishizawa, Hisashi Kawai |
| 2007 | A reference model weighting-based method for robust speech recognition. Yuan-Fu Liao, Yh-Her Yang, Chi-Hui Hsu, Cheng-Chang Lee, Jing-Teng Zeng |
| 2007 | A robust mel-scale subband voice activity detector for a car platform. Agustín Álvarez-Marquina, Rafael Martínez, Pedro Gómez, Victor Nieto Lluis, V. Rodellar |
| 2007 | A robust multi-phase pitch-mark detection algorithm. Milan Legát, Jindrich Matousek, Daniel Tihelka |
| 2007 | A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system. Kyu Jeong Han, Shrikanth S. Narayanan |
| 2007 | A rule-based speech morphing for verifying a expressive speech perception model. Chun-Fang Huang, Masato Akagi |
| 2007 | A saliency-based auditory attention model with applications to unsupervised prominent syllable detection in speech. Ozlem Kalinli, Shrikanth S. Narayanan |
| 2007 | A semi-automatic approach for speaker mining of tapped telephone conversations. Sandeep Manocha, Carol Y. Espy-Wilson |
| 2007 | A semi-supervised learning approach for morpheme segmentation for an Arabic dialect. Mei Yang, Jing Zheng, Andreas Kathol |
| 2007 | A semi-supervised method for efficient construction of statistical spoken language understanding resources. Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee |
| 2007 | A smoothing kernel for spatially related features and its application to speaker verification. Luciana Ferrer, M. Kemal Sönmez, Elizabeth Shriberg |
| 2007 | A soft-clustering algorithm for automatic induction of semantic classes. Elias Iosif, Alexandros Potamianos |
| 2007 | A speech rate related lip movement model for speech animation. Wei Zhou, Zengfu Wang |
| 2007 | A statistical method of evaluating pronunciation proficiency for presentation in English. Seiichi Nakagawa, Kei Ohta |
| 2007 | A statistical model based post-filtering algorithm for residual echo suppression. Seung Yeol Lee, Jong Won Shin, Hwan Sik Yun, Nam Soo Kim |
| 2007 | A straightforward and efficient implementation of the factor analysis model for speaker verification. Driss Matrouf, Nicolas Scheffer, Benoit G. B. Fauve, Jean-François Bonastre |
| 2007 | A structured speech model parameterized by recursive dynamics and neural networks. Roberto Togneri, Li Deng |
| 2007 | A study on temporal features derived by analytic signal. Yotaro Kubo, Shigeki Okawa, Akira Kurematsu, Katsuhiko Shirai |
| 2007 | A study on word detector design and knowledge-based pruning and rescoring. Chengyuan Ma, Chin-Hui Lee |
| 2007 | A sub-optimal viterbi-like search for linear dynamic models classification. Dimitris Oikonomidis, Vassilios Diakoloukas, Vassilios Digalakis |
| 2007 | A system for transforming the emotion in speech: combining data-driven conversion techniques for prosody and voice quality. Zeynep Inanoglu, Steve J. Young |
| 2007 | A tagging algorithm for mixed language identification in a noisy domain. Mike Rosner, Paulseph-John Farrugia |
| 2007 | A text-constrained prosodic system for speaker verification. Elizabeth Shriberg, Luciana Ferrer |
| 2007 | A text-free approach to assessing nonnative intonation. Joseph Tepperman, Abe Kazemzadeh, Shrikanth S. Narayanan |
| 2007 | A trainable excitation model for HMM-based speech synthesis. Ranniery Maia, Tomoki Toda, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda |
| 2007 | A unified approach to multi-pose audio-visual ASR. Patrick Lucey, Gerasimos Potamianos, Sridha Sridharan |
| 2007 | A unified probabilistic generative framework for extractive spoken document summarization. Yi-Ting Chen, Hsuan-Sheng Chiu, Hsin-Min Wang, Berlin Chen |
| 2007 | A uniformly most powerful test for statistical model-based voice activity detection. Keun Won Jang, Dong Kook Kim, Joon-Hyuk Chang |
| 2007 | A variational approach to robust maximum likelihood estimation for speech recognition. Mohamed Kamal Omar |
| 2007 | ASR-based pronunciation training: scoring accuracy and pedagogical effectiveness of a system for dutch L2 learners. Catia Cucchiarini, Ambra Neri, Febe de Wet, Helmer Strik |
| 2007 | Accelerating the annotation of lexical data for less-resourced languages. Gerhard B. Van Huyssteen, Martin J. Puttkammer |
| 2007 | Accent assignment algorithm in Hungarian, based on syntactic analysis. Anne Tamm, Kálmán Abari, Gábor Olaszy |
| 2007 | Accurate marginalization range for missing data recognition. Sébastien Demange, Christophe Cerisara, Jean Paul Haton |
| 2007 | Acoustic analysis of the neutral tone in Mandarin. Philippe Martin, Jun Li |
| 2007 | Acoustic and affective comparisons of natural and imaginary infant-, foreigner- and adult-directed speech. Monja A. Knoll, Lisa Scharrer |
| 2007 | Acoustic correlates of intelligibility enhancements in clearly produced fricatives. Kazumi Maniwa, Allard Jongman, Travis Wade |
| 2007 | Acoustic correlates of laryngeal-muscle fatigue: findings for a phonometric prevention of acquired voice pathologies. Victor J. Boucher |
| 2007 | Acoustic features of anger utterances during natural dialog. Yoshiko Arimoto, Sumio Ohno, Hitoshi Iida |
| 2007 | Acoustic language identification using fast discriminative training. Fabio Castaldo, Daniele Colibro, Emanuele Dalmasso, Pietro Laface, Claudio Vair |
| 2007 | Acoustic parameters for the automatic detection of vowel nasalization. Tarun Pruthi, Carol Y. Espy-Wilson |
| 2007 | Acoustic-phonetic features for refining the explicit speech segmentation. Antonio Marcos Selmini, Fábio Violaro |
| 2007 | Acquisition and synchronization of multimodal articulatory data. Michael Aron, Nicolas Ferveur, Erwan Kerrien, Marie-Odile Berger, Yves Laprie |
| 2007 | Acquisition of vowel duration in children speaking american English. Eon-Suk Ko |
| 2007 | Active binaural distance estimation for dynamic sources. Yan-Chen Lu, Martin Cooke, Heidi Christensen |
| 2007 | Adaptive weighting of microphone arrays for distant-talking F0 and voiced/unvoiced estimation. Federico Flego, Christian Zieger, Maurizio Omologo |
| 2007 | Adding noise to improve noise robustness in speech recognition. Nicolás Morales, Liang Gu, Yuqing Gao |
| 2007 | Advanced front-end for robust speech recognition in extremely adverse environments. Dimitrios Dimitriadis, José C. Segura, Luz García, Alexandros Potamianos, Petros Maragos, Vassilis Pitsikalis |
| 2007 | Advances in Mandarin broadcast speech recognition. Mei-Yuh Hwang, Wen Wang, Xin Lei, Jing Zheng, Özgür Çetin, Gang Peng |
| 2007 | Advances in speechfind: transcript reliability estimation employing confidence measure based on discriminative sub-word model for SDR. Wooil Kim, John H. L. Hansen |
| 2007 | Age-related changes in fundamental frequency and formants: a longitudinal study of four speakers. Jonathan Harrington, Sallyanne Palethorpe, Catherine I. Watson |
| 2007 | Alignment of the second low target in dutch falling-rising pitch contours. Jörg Peters, Judith Hanssen, Carlos Gussenhoven |
| 2007 | Always listening to you: creating exhaustive audio database in home environments. Yasunari Obuchi, Akio Amano |
| 2007 | Ambient telephony: scenarios and research challenges. Aki Härmä |
| 2007 | An 8-32 kbit/s scalable wideband coder extended with MDCT-based bandwidth extension on top of a 6.8 kbit/s narrowband CELP coder. Masahiro Oshikiri, Hiroyuki Ehara, Toshiyuki Morii, Tomofumi Yamanashi, Kaoru Satoh, Koji Yoshida |
| 2007 | An HMM acoustic model incorporating various additional knowledge sources. Sakriani Sakti, Konstantin Markov, Satoshi Nakamura |
| 2007 | An HMM-based speech synthesis system applied to German and its adaptation to a limited set of expressive football announcements. Sacha Krstulovic, Anna Hunecke, Marc Schröder |
| 2007 | An MRI study of european portuguese nasals. Paula Martins, Inês Carbone, Augusto Silva, António J. S. Teixeira |
| 2007 | An active approach to speaker and task adaptation based on automatic analysis of vocabulary confusability. Qiang Huo, Wei Li |
| 2007 | An analysis of individual differences in the f Hiromi Kawatsu, Sumio Ohno |
| 2007 | An approach to efficient generation of high-accuracy and compact error-corrective models for speech recognition. Takanobu Oba, Takaaki Hori, Atsushi Nakamura |
| 2007 | An approach to iterative speech feature enhancement and recognition. Stefan Windmann, Reinhold Haeb-Umbach |
| 2007 | An approximate solution for perceptually constrained signal subspace speech enhancement method. Adam Borowicz, Alexander A. Petrovsky |
| 2007 | An articulatory and acoustic study of "retroflex" and "bunched" american English rhotic sound based on MRI. Xinhui Zhou, Carol Y. Espy-Wilson, Mark Tiede, Suzanne Boyce |
| 2007 | An automatic prosody labeling method for Mandarin speech. Chen-Yu Chiang, Hsiu-Min Yu, Yih-Ru Wang, Sin-Horng Chen |
| 2007 | An effective initial/final duration prediction method for corpus-based singing voice synthesis of Mandarin Chinese. Cheng-Yuan Lin, Pei-Chi Jao, Jyh-Shing Roger Jang |
| 2007 | An empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping. Chao Qin, Miguel Á. Carreira-Perpiñán |
| 2007 | An ensemble modeling approach to joint characterization of speaker and speaking environments. Yu Tsao, Chin-Hui Lee |
| 2007 | An evaluation of cross-language adaptation and native speech training for rapid HMM construction based on very limited training data. Xufang Zhao, Douglas D. O'Shaughnessy |
| 2007 | An extension 2DPCA based visual feature extraction method for audio-visual speech recognition. Guanyong Wu, Jie Zhu |
| 2007 | An improved method for unsupervised training of LVCSR systems. Christian Gollan, Stefan Hahn, Ralf Schlüter, Hermann Ney |
| 2007 | An improved speaker diarization system. Rong Fu, Ian D. Benest |
| 2007 | An information state based dialogue manager for a mobile robot. Marcelo Quinderé, Luís Seabra Lopes, António J. S. Teixeira |
| 2007 | An information theoretic approach to predict speech intelligibility for listeners with normal and impaired hearing. Svante Stadler, Arne Leijon, Björn Hagerman |
| 2007 | An integration method of retrieval results using plural subword models for vocabulary-free spoken document retrieval. Yoshiaki Itoh, Kohei Iwata, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee |
| 2007 | An interactive timeline for speech database browsing. Benoît Favre, Jean-François Bonastre, Patrice Bellot |
| 2007 | An open-set detection evaluation methodology applied to language and emotion recognition. David A. van Leeuwen, Khiet P. Truong |
| 2007 | An optimal speech enhancement under speech uncertainty probability and masking property of auditory system. Xiaoshan Huang, Xiaoqun Zhao |
| 2007 | An overview on automatic speech attribute transcription (ASAT). Chin-Hui Lee, Mark A. Clements, Sorin Dusan, Eric Fosler-Lussier, Keith Johnson, Biing-Hwang Juang, Lawrence R. Rabiner |
| 2007 | An unsupervised approach to automatic prosodic annotation. Xinqiang Ni, Yining Chen, Frank K. Soong, Min Chu, Ping Zhang |
| 2007 | Analysis and classification of speech mode: whispered through shouted. Chi Zhang, John H. L. Hansen |
| 2007 | Analysis of communication failures for spoken dialogue systems. Sebastian Möller, Klaus-Peter Engelbrecht, Antti Oulasvirta |
| 2007 | Analysis of emotional speech prosody in terms of part of speech tags. Murtaza Bulut, Sungbok Lee, Shrikanth S. Narayanan |
| 2007 | Analysis of head motions and speech in spoken dialogue. Carlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita |
| 2007 | Analysis of the impact of analogue telephone channel on MFCC parameters for voice pathology detection. Rubén Fraile, Juan Ignacio Godino-Llorente, Nicolás Sáenz-Lechón, Víctor Osma-Ruiz, Pedro Gómez-Vilda |
| 2007 | Analysis of the occurrence of laughter in meetings. Kornel Laskowski, Susanne Burger |
| 2007 | Analyzing temporal transition of real user's behaviors in a spoken dialogue system. Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno |
| 2007 | Application of CMLLR in narrow band wide band adapted systems. Martin Karafiát, Lukás Burget, Jan Cernocký, Thomas Hain |
| 2007 | Application of shifted delta cepstral features in speaker verification. José R. Calvo, Rafael Fernández, Gabriel Hernández |
| 2007 | Application of speech technology in a home based assessment kiosk for early detection of alzheimer's disease. Rachel Coulston, Esther Klabbers, Jacques de Villiers, John-Paul Hosom |
| 2007 | Applying word duration constraints by using unrolled HMMs. Ning Ma, Jon Barker, Phil D. Green |
| 2007 | Approaches for adaptive database reduction for text-to-speech synthesis. Aleksandra Krul, Géraldine Damnati, François Yvon, Cédric Boidin, Thierry Moudenc |
| 2007 | Approximation method of subglottal system using ARMA filter. Nobuhiro Miki, Kyohei Hayashi |
| 2007 | Articulatory acoustic feature applications in speech synthesis. Peter Cahill, Daniel Aioanei, Julie Carson-Berndsen |
| 2007 | Articulatory feature classifiers trained on 2000 hours of telephone speech. Joe Frankel, Mathew Magimai-Doss, Simon King, Karen Livescu, Özgür Çetin |
| 2007 | Articulatory synthesis of singing. Peter Birkholz |
| 2007 | Artificial bandwidth extension for speech signals using speech recogniton. Shingo Kuroiwa, Masashi Takashina, Satoru Tsuge, Fuji Ren |
| 2007 | Artificial bandwidth extension without side information for ITU-t g.729.1. Bernd Geiser, Hervé Taddei, Peter Vary |
| 2007 | Artificial impostor voice transformation effects on false acceptance rates. Jean-François Bonastre, Driss Matrouf, Corinne Fredouille |
| 2007 | Aspects of visual speech in Arabic. Slim Ouni, Kaïs Ouni |
| 2007 | Assessment of vocal dysperiodicities in connected disordered speech. Ali Alpan, Abdellah Kacha, Francis Grenez, Jean Schoentgen |
| 2007 | Attention shift decoding for conversational speech recognition. Raghunandan Kumaran, Jeff A. Bilmes, Katrin Kirchhoff |
| 2007 | Attribute-based Mandarin speech recognition using conditional random fields. Chi-Yueh Lin, Hsiao-Chuan Wang |
| 2007 | Audio classification using extended baum-welch transformations. Tara N. Sainath, Victor Zue, Dimitri Kanevsky |
| 2007 | Audio-based approaches to head orientation estimation in a smart-room. Alberto Abad, Carlos Segura, Climent Nadeu, Javier Hernando |
| 2007 | Audio-visual integration for robust speech recognition using maximum weighted stream posteriors. Rowan Seymour, Darryl Stewart, Ji Ming |
| 2007 | Audio-visual phoneme classification for pronunciation training applications. Hedvig Kjellström, Olov Engwall, Sherif Mahdy Abdou, Olle Bälter |
| 2007 | Audiovisual emotional speech of game playing children: effects of age and culture. Suleman Shahid, Emiel Krahmer, Marc Swerts |
| 2007 | Audiovisual speaker identity verification based on lip motion features. Girija Chetty, Michael Wagner |
| 2007 | Automated directory assistance system - from theory to practice. Dong Yu, Yun-Cheng Ju, Ye-Yi Wang, Geoffrey Zweig, Alex Acero |
| 2007 | Automatic acoustic segmentation for speech recognition on broadcast recordings. Gang Peng, Mei-Yuh Hwang, Mari Ostendorf |
| 2007 | Automatic assessment of children's reading level. Jacques Duchateau, Leen Cleuren, Hugo Van hamme, Pol Ghesquière |
| 2007 | Automatic building of synthetic voices from large multi-paragraph speech databases. Kishore Prahallad, Arthur R. Toth, Alan W. Black |
| 2007 | Automatic detection and classification of disfluent reading miscues in young children's speech for the purpose of assessment. Matthew Black, Joseph Tepperman, Sungbok Lee, Patti Price, Shrikanth S. Narayanan |
| 2007 | Automatic estimation of scaling factors among probabilistic models in speech recognition. Tadashi Emori, Yoshifumi Onishi, Koichi Shinoda |
| 2007 | Automatic extraction of cue phrases for important sentences in lecture speech and automatic lecture speech summarization. Yasuhisa Fujii, Norihide Kitaoka, Seiichi Nakagawa |
| 2007 | Automatic generation of cloze items for prepositions. John Lee, Stephanie Seneff |
| 2007 | Automatic head motion prediction from speech data. Gregor Hofer, Hiroshi Shimodaira |
| 2007 | Automatic large-scale oral language proficiency assessment. Febe de Wet, Christa van der Walt, Thomas Niesler |
| 2007 | Automatic laughter detection using neural networks. Mary Tai Knox, Nikki Mirghafori |
| 2007 | Automatic phonetic segmentation of Spanish emotional speech. Ascensión Gallardo-Antolín, Roberto Barra-Chicote, Marc Schröder, Sacha Krstulovic, Juan Manuel Montero |
| 2007 | Automatic pitch accent prediction for text-to-speech synthesis. Ian Read, Stephen Cox |
| 2007 | Automatic question detection: prosodic-lexical features and crosslingual experiments. Minh-Quang Vu, Laurent Besacier, Eric Castelli |
| 2007 | Automatic recognition of connected vowels only using speaker-invariant representation of speech dynamics. Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hirose |
| 2007 | Automatic scoring of the intelligibility in patients with cancer of the oral cavity. Andreas K. Maier, Maria Schuster, Anton Batliner, Elmar Nöth, Emeka Nkenke |
| 2007 | Automatic speech recognition for an under-resourced language - amharic. Solomon Teferra Abate, Wolfgang Menzel |
| 2007 | Automatic speech recognition framework for multilingual audio contents. Hiroaki Nanjo, Yuichi Oku, Takehiko Yoshimi |
| 2007 | Automatic speech recognition with a cochlear implant front-end. Waldo Nogueira, Tamás Harczos, Bernd Edler, Jörn Ostermann, Andreas Büchner |
| 2007 | Automatic transcription for a web 2.0 service to search podcasts. Jun Ogata, Masataka Goto, Kouichirou Eto |
| 2007 | Automatically learning the units of speech by non-negative matrix factorisation. Veronique Stouten, Kris Demuynck, Hugo Van hamme |
| 2007 | BECAM tool - a semi-automatic tool for bootstrapping emotion corpus annotation and management. Slim Abdennadher, Mohamed Aly, Dirk Bühler, Wolfgang Minker, Johannes Pittermann |
| 2007 | Bayes risk-based optimization of dialogue management for document retrieval system with speech interface. Teruhisa Misu, Tatsuya Kawahara |
| 2007 | Behavior models for learning and receptionist dialogs. Hartwig Holzapfel, Alex Waibel |
| 2007 | Benchmarking human performance on the acoustic and linguistic subtasks of ASR systems. László Tóth |
| 2007 | Bhattacharyya error and divergence using variational importance sampling. Peder A. Olsen, John R. Hershey |
| 2007 | Bilingual LSA-based translation lexicon adaptation for spoken language translation. Yik-Cheung Tam, Tanja Schultz |
| 2007 | Bit-erasure channel decoding for GMM-based multiple description coding. Yannis Agiomyrgiannakis, Yannis Stylianou |
| 2007 | Blind adaptive principal eigenvector beamforming for acoustical source separation. Ernst Warsitz, Reinhold Haeb-Umbach, Dang Hai Tran Vu |
| 2007 | Boosting with anti-models for automatic language identification. Xi Yang, Man-Hung Siu, Herbert Gish, Brian Mak |
| 2007 | Bootstrapping morphological analysis of gĩkũyũ using unsupervised maximum entropy learning. Guy De Pauw, Peter Waiganjo Wagacha |
| 2007 | Building an information retrieval system for serbian - challenges and solutions. Miroslav Martinovic, Srdjdan Vesic, Goran Rakic |
| 2007 | Building multiple complementary systems using directed decision trees. Catherine Breslin, Mark J. F. Gales |
| 2007 | CALL courseware for learning reactive tokens in face-to-face dialogs. Takafumi Utashiro, Goh Kawai |
| 2007 | Can unquantised articulatory feature continuums be modelled? Odette Scharenborg, Vincent Wan |
| 2007 | Categorical perception in intonation: a matter of signal dynamics? Oliver Niebuhr |
| 2007 | Categorical perception of Cantonese tones in context: a cross-linguistic study. Hongying Zheng, Peter W. M. Tsang, William S.-Y. Wang |
| 2007 | Channel selection by class separability measures for automatic transcriptions on distant microphones. Matthias Wölfel |
| 2007 | Children's convergence in referring expressions to graphical objects in a speech-enabled computer game. Linda Bell, Joakim Gustafson |
| 2007 | Class constrained ROVER based speech enhancement. Amit Das, John H. L. Hansen |
| 2007 | Classification of discourse functions of affirmative words in spoken dialogue. Agustín Gravano, Stefan Benus, Julia Hirschberg, Shira Mitchell, Ilia Vovsha |
| 2007 | Cluster adaptive training weights as features in SVM-based speaker verification. Hao Yang, Yuan Dong, Xianyu Zhao, Jian Zhao, Liang Lu, Haila Wang |
| 2007 | Cluster-based polynomial-fit histogram equalization (CPHEQ) for robust speech recognition. Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen |
| 2007 | Clustered maximum likelihood linear basis for rapid speaker adaptation. Yun Tang, Richard C. Rose |
| 2007 | Clustering-based two-dimensional linear discriminant analysis for speech recognition. Xiao-Bing Li, Douglas D. O'Shaughnessy |
| 2007 | Co-training using prosodic and lexical information for sentence segmentation. Ümit Güz, Sébastien Cuendet, Dilek Hakkani-Tür, Gökhan Tür |
| 2007 | Collection of empirical data for standardization of generic vocabularies in speech driven ICT devices and services. Rosemary Orr, Bernat González i Llinares, Françoise Petersen, Helge Hüttenrauch, Martin Böcker, Michael Tate |
| 2007 | Combination of LSF and pole based parameter interpolation for model-based diphone concatenation. Karl Schnell, Arild Lacroix |
| 2007 | Combined acoustic and pronunciation modelling for non-native speech recognition. Ghazi Bouselmi, Dominique Fohr, Irina Illina |
| 2007 | Combining frame and turn-level information for robust recognition of emotions within speech. Bogdan Vlasenko, Björn W. Schuller, Andreas Wendemuth, Gerhard Rigoll |
| 2007 | Combining length distribution model with decision tree in prosodic phrase prediction. Qin Shi, Danning Jiang, Fanping Meng, Yong Qin |
| 2007 | Combining rate and place information for robust pitch extraction. Martin Heckmann, Frank Joublin, Christian Goerick |
| 2007 | Combining short-term cepstral and long-term pitch features for automatic recognition of speaker age. Christian A. Müller, Felix Burkhardt |
| 2007 | Compact representations of the articulatory-to-acoustic mapping. Blaise Potard, Yves Laprie |
| 2007 | Comparing GMM-based speech transformation systems. Larbi Mesbahi, Vincent Barreaud, Olivier Boëffard |
| 2007 | Comparing american and palestinian perceptions of charisma using acoustic-prosodic and lexical analysis. Fadi Biadsy, Julia Hirschberg, Andrew Rosenberg, Wisam Dakka |
| 2007 | Comparing classifiers for pronunciation error detection. Helmer Strik, Khiet P. Truong, Febe de Wet, Catia Cucchiarini |
| 2007 | Comparing praat and snack formant measurements on two large corpora of northern and southern French. Cécile Woehrling, Philippe Boula de Mareüil |
| 2007 | Comparison of HMM and DTW methods in automatic recognition of pathological phoneme pronunciation. Robert Wielgat, Tomasz P. Zielinski, Pawel Swietojanski, Piotr Zoladz, Daniel Król, Tomasz Wozniak, Stanislaw Grabias |
| 2007 | Comparison of multiple voice source parameters in different phonation types. Matti Airas, Paavo Alku |
| 2007 | Comparison of subspace methods for Gaussian mixture models in speech recognition. Matti Varjokallio, Mikko Kurimo |
| 2007 | Comparison of two kinds of speaker location representation for SVM-based speaker verification. Xianyu Zhao, Yuan Dong, Hao Yang, Jian Zhao, Liang Lu, Haila Wang |
| 2007 | Complementarity and redundancy in multimodal user inputs with speech and pen gestures. Pui-Yu Hui, Zhengyu Zhou, Helen M. Meng |
| 2007 | Complementary approaches for voice disorder assessment. Jean-François Bonastre, Corinne Fredouille, Alain Ghio, Antoine Giovanni, Gilles Pouchoulin, Joana Revis, Bernard Teston, P. Yu |
| 2007 | Computer-supported human-human multilingual communication. Alex Waibel, Keni Bernardin, Matthias Wölfel |
| 2007 | Computerized chironomy: evaluation of hand-controlled intonation reiteration. Christophe d'Alessandro, Albert Rilliard, Sylvain Le Beux |
| 2007 | Concept and evaluation of a downward-compatible system for spatial teleconferencing using automatic speaker clustering. Alexander Raake, Sascha Spors, Jens Ahrens, Jitendra Ajmera |
| 2007 | Conditional use of word lattices, confusion networks and 1-best string hypotheses in a sequential interpretation strategy. Bogdan Minescu, Géraldine Damnati, Frédéric Béchet, Renato De Mori |
| 2007 | Conditionally linear Gaussian models for estimating vocal tract resonances. Daniel Rudoy, Daniel N. Spendley, Patrick J. Wolfe |
| 2007 | Confidence measure based unsupervised target model adaptation for speaker verification. Alexandre Preti, Jean-François Bonastre, Driss Matrouf, François Capman, Bertrand Ravera |
| 2007 | Confidence measures for voice search applications. Ye-Yi Wang, Dong Yu, Yun-Cheng Ju, Geoffrey Zweig, Alex Acero |
| 2007 | Construction and analysis of multiple paths in syllable models. Annika Hämäläinen, Louis ten Bosch, Lou Boves |
| 2007 | Construction of a phonotactic dialect corpus using semiautomatic annotation. Reva Schwartz, Wade Shen, Joseph P. Campbell, Shelley Paget, Julie Vonwiller, Dominique Estival, Christopher Cieri |
| 2007 | Construction of spoken language model including fillers using filler prediction model. Kengo Ohta, Masatoshi Tsuchiya, Seiichi Nakagawa |
| 2007 | Context constrained-generalized posterior probability for verifying phone transcriptions. Hua Zhang, Lijuan Wang, Frank K. Soong, Wenju Liu |
| 2007 | Context dependent syllable acoustic model for continuous Chinese speech recognition. Hao Wu, Xihong Wu |
| 2007 | Context dependent word modeling for statistical machine translation using part-of-speech tags. Ruhi Sarikaya, Yonggang Deng, Yuqing Gao |
| 2007 | Continuous prosodic features and formant modeling with joint factor analysis for speaker verification. Najim Dehak, Patrick Kenny, Pierre Dumouchel |
| 2007 | Continuous-speech phone recognition from ultrasound and optical images of the tongue and lips. Thomas Hueber, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Maureen Stone |
| 2007 | Contributions of temporal fine structure cues to Chinese speech recognition in cochlear implant simulation. Lin Yang, Jianping Zhang, Yonghong Yan |
| 2007 | Control of an articulatory speech synthesizer based on dynamic approximation of spatial articulatory targets. Peter Birkholz |
| 2007 | Conversation detection and speaker segmentation in privacy-sensitive situated speech data. Danny Wyatt, Tanzeem Choudhury, Jeff A. Bilmes |
| 2007 | Corpus-based generation of prosodic features from text based on generation process model. Keikichi Hirose, Keiko Ochi, Nobuaki Minematsu |
| 2007 | Creating multimedia dictionaries of endangered languages using LEXUS. Jacquelijn Ringersma, Marc Kemps-Snijders |
| 2007 | Creating spoken dialogue characters from corpora without annotations. Sudeep Gandhe, David R. Traum |
| 2007 | Cross-language phonemisation in German text-to-speech synthesis. Jochen Steigner, Marc Schröder |
| 2007 | Cross-linguistic analysis of prosodic features for sentence segmentation. James G. Fung, Dilek Hakkani-Tür, Mathew Magimai-Doss, Elizabeth Shriberg, Sébastien Cuendet, Nikki Mirghafori |
| 2007 | DFT domain subspace based noise tracking for speech enhancement. Richard C. Hendriks, Jesper Jensen, Richard Heusdens |
| 2007 | Degradation-classification assisted single-ended quality measurement of speech. Hua Yuan, Tiago H. Falk, Wai-Yip Chan |
| 2007 | Dependence of tone perception on syllable perception. Michael Olsberg, Yi Xu, Jeremy Green |
| 2007 | Derivative and parametric kernels for speaker verification. Chris Longworth, Mark J. F. Gales |
| 2007 | Design and characterization of the non-native military air traffic communications database (nnMATC). Stéphane Pigeon, Wade Shen, Aaron D. Lawson, David A. van Leeuwen |
| 2007 | Design and development of voice controlled aids for motor-handicapped persons. Petr Cerva, Jan Nouza |
| 2007 | Design and recording of Czech sign language corpus for automatic sign language recognition. Pavel Campr, Marek Hrúz, Milos Zelezný |
| 2007 | Design of a rich multimodal interface for mobile spoken route guidance. Markku Turunen, Jaakko Hakulinen, Anssi Kainulainen, Aleksi Melto, Topi Hurtig |
| 2007 | Detecting deception using critical segments. Frank Enos, Elizabeth Shriberg, Martin Graciarena, Julia Hirschberg, Andreas Stolcke |
| 2007 | Detecting pitch accent using pitch-corrected energy-based predictors. Andrew Rosenberg, Julia Hirschberg |
| 2007 | Detection and removal of switching noise in push-to-talk and voice operated exchange communications systems. Brett Y. Smolenski |
| 2007 | Detection of instants of glottal closure using characteristics of excitation source. Sunitha Guruprasad, B. Yegnanarayana, K. Sri Rama Murty |
| 2007 | Detection of out-of-vocabulary words in posterior based ASR. Hamed Ketabdar, Mirko Hannemann, Hynek Hermansky |
| 2007 | Detection, diarization, and transcription of far-field lecture speech. Jing Huang, Etienne Marcheret, Karthik Visweswariah, Vit Libal, Gerasimos Potamianos |
| 2007 | Detection-based ASR in the automatic speech attribute transcription project. Ilana Bromberg, Qian Qian, Jun Hou, Jinyu Li, Chengyuan Ma, Brett Matthews, Antonio Moreno-Daniel, Jeremy Morris, Sabato Marco Siniscalchi, Yu Tsao, Yu Wang |
| 2007 | Development of multimodal resources for multilingual information retrieval in the basque context. Nora Barroso, Aitzol Ezeiza, N. Gilisagasti, Karmele López de Ipiña, A. López, Juan Miguel López |
| 2007 | Development of preschool children subsystem for ASR and q&a in a real-environment speech-oriented guidance task. Tobias Cincarek, Izumi Shindo, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2007 | Dimension reduction for speaker identification based on mutual information. Xugang Lu, Jianwu Dang |
| 2007 | Dimensionality reduction for speech recognition using neighborhood components analysis. Natasha Singh-Miller, Michael Collins, Timothy J. Hazen |
| 2007 | Dimensionality reduction methods applied to both magnitude and phase derived features. Andrew Errity, John McKenna, Barry Kirkpatrick |
| 2007 | Dimensionality reduction of speech features using nonlinear principal components analysis. Stephen A. Zahorian, Tara Singh, Hongbing Hu |
| 2007 | Direct acoustic feature using iterative EM algorithm and spectral energy for classifying suicidal speech. T. Yingthawornsuk, H. Kaymaz Keskinpala, D. Mitchell Wilkes, Richard G. Shiavi, Ronald M. Salomon |
| 2007 | Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics. John Dines, Jithendra Vepa |
| 2007 | Discrimination and recognition of scaled word sounds. Toshio Irino, Yoshie Aoki, Yoshie Hayashi, Hideki Kawahara, Roy D. Patterson |
| 2007 | Discriminative MCE-based speaker adaptation of acoustic models for a spoken lecture processing task. Timothy J. Hazen, Erik McDermott |
| 2007 | Discriminative noise adaptive training approach for an environment migration. Byung Ok Kang, Ho-Young Jung, Yunkeun Lee |
| 2007 | Discriminative optimization of language adapted HMMs for a language identification system based on parallel phoneme recognizers. Josef G. Bauer, Bernt Andrassy, Ekaterina Timoshenko |
| 2007 | Disfluency correction of spontaneous speech using conditional random fields with variable-length features. Jui-Feng Yeh, Chung-Hsien Wu, Wei-Yen Wu |
| 2007 | Distinctive phonetic feature (DPF) based phone segmentation using hybrid neural networks. Mohammad Nurul Huda, Muhammad Ghulam, Junsei Horikawa, Tsuneo Nitta |
| 2007 | Do different boundary types induce subtle acoustic cues to which French listeners are sensitive? Odile Bagou, Sophie Dufour, Cécile Fougeron, Alain Content, Ulrich H. Frauenfelder |
| 2007 | Dual-channel acoustic detection of nasalization states. Xiaochuan Niu, Jan P. H. van Santen |
| 2007 | Duration and pauses as boundary-markers in speech: a cross-linguistic study. Li-chiung Yang |
| 2007 | Duration and pronunciation conditioned lexical modeling for speaker verification. Gökhan Tür, Elizabeth Shriberg, Andreas Stolcke, Sachin S. Kajarekar |
| 2007 | Dynamic integration of multiple feature streams for robust real-time LVCSR. Shoei Sato, Kazuo Onoe, Akio Kobayashi, Shinichi Homma, Toru Imai, Tohru Takagi, Tetsunori Kobayashi |
| 2007 | Dynamic language change in MIMUS. Carmen del Solar, Guillermo Pérez-García, Eva Florencio, David Moral, Gabriel Amores Carredano, Pilar Manchón Portillo |
| 2007 | Dynamic language model adaptation using presentation slides for lecture speech recognition. Hiroki Yamazaki, Koji Iwano, Koichi Shinoda, Sadaoki Furui, Haruo Yokota |
| 2007 | ELAN: a free and open-source multimedia annotation tool. Han Sloetjes, Albert Russel, Alexander Klassmann |
| 2007 | EMD based soft-thresholding for speech enhancement. Erhan Deger, Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu, Md. Kamrul Hasan |
| 2007 | Effect of incomplete glottal closures on estimates of glottal waves via inverse filtering of vowel sounds. Huiqun Deng, Douglas D. O'Shaughnessy |
| 2007 | Effect of intensive voice therapy on vocal tremor for parkinson speakers. Laurence Cnockaert, Jean Schoentgen, Canan Ozsancak, Pascal Auzou, Francis Grenez |
| 2007 | Effect of number of masking talkers on speech-on-speech masking in Chinese. Xihong Wu, Jing Chen, Zhigang Yang, Qiang Huang, Mengyuan Wang, Liang Li |
| 2007 | Effect of unsteady glottal flow on the speech production process. Hideyuki Nomura, Tetsuo Funada |
| 2007 | Effect of within- and between-talker variability on word identification in noise by younger and older adults. Huiwen Goy, Kathleen Pichora-Fuller, Pascal van Lieshout, Gurjit Singh, Bruce Schneider |
| 2007 | Effects of FE modelled consequences of tonsillectomy on perceptual evaluation of voice. Anne-Maria Laukkanen, Jaromír Horácek, Pavel Svancara, Elina Lehtinen |
| 2007 | Effects of non-native dialects on spoken word recognition. Jennifer T. Le, Catherine T. Best, Michael D. Tyler, Christian Kroos |
| 2007 | Effects of quiz-style information presentation on user understanding. Ryuichiro Higashinaka, Kohji Dohsaka, Shigeaki Amano, Hideki Isozaki |
| 2007 | Effects of testosterone levels on temporal and intonational aspects of speech: more exploratory data. Charles A. Lamoureux, Victor J. Boucher |
| 2007 | Efficient estimation of speaker-specific projecting feature transforms. Jonas Lööf, Ralf Schlüter, Hermann Ney |
| 2007 | Emotion attribute projection for speaker recognition on emotional speech. Huanjun Bao, Ming-Xing Xu, Thomas Fang Zheng |
| 2007 | Emotion clustering using the results of subjective opinion tests for emotion recognition in infants' cries. N. Satoh, Katsuya Yamauchi, Shoichi Matsunaga, Masaru Yamashita, R. Nakagawa, Kazuyuki Shinohara |
| 2007 | Empirical evidence for prosodic phrasing: pauses as linguistic annotation in Korean read speech. Hyongsil Cho, Daniel Hirst |
| 2007 | English and French speakers' perception of voicing distinctions in non-native lateral consonant syllable onsets. Catherine T. Best, Pierre A. Hallé, Jennifer S. Pardo |
| 2007 | Enhancing acoustic-to-EPG mapping with lip position information. Asterios Toutios, Konstantinos G. Margaritis |
| 2007 | Enhancing usability of CAPL system for qur'an recitation learning. Abdurrahman Samir, Sherif Mahdy Abdou, Ahmed Husien Khalil, Mohsen A. Rashwan |
| 2007 | Environmentally aware voice activity detector. Abhijeet Sangwan, Nitish Krishnamurthy, John H. L. Hansen |
| 2007 | Error detection in confusion network. Alexandre Allauzen |
| 2007 | Error-tolerant question answering for spoken documents. Tomoyosi Akiba, Hirofumi Tsujimura |
| 2007 | Estimating VTLN warping factors by distribution matching. Janne Pylkkönen |
| 2007 | Estimation of place of articulation in stop consonants for visual feedback. Milind S. Shah, Prem C. Pandey |
| 2007 | Evaluating acoustic distance measures for template based recognition. Mathias De Wachter, Kris Demuynck, Patrick Wambacq, Dirk Van Compernolle |
| 2007 | Evaluating and optimizing Japanese tutor system featuring dynamic question generation and interactive guidance. Christopher J. Waple, Hongcui Wang, Tatsuya Kawahara, Yasushi Tsubota, Masatake Dantsuji |
| 2007 | Evaluating the temporal structure normalisation technique on the Aurora-4 task. Xiong Xiao, Engsiong Chng, Haizhou Li |
| 2007 | Evaluating two versions of the momel pitch modelling algorithm on a corpus of read speech in Korean. Daniel Hirst, Hyongsil Cho, Sunhee Kim, Hyunji Yu |
| 2007 | Evaluation of alternatives on speech to sign language translation. Rubén San Segundo, Alicia Pérez, Daniel Ortiz, Luis Fernando D'Haro, M. Inés Torres, Francisco Casacuberta |
| 2007 | Evaluation of real-time voice activity detection based on high order statistics. David Cournapeau, Tatsuya Kawahara |
| 2007 | Evaluation of syllable stress using single class classifier. Abhinav Parate, Ashish Verma, Jayanta Basak |
| 2007 | Evaluation of the combined use of MEMLIN and MLLR on the non-native adaptation task of hiwire project database. Luis Buera, Antonio Miguel, Oscar Saz, Eduardo Lleida, Alfonso Ortega |
| 2007 | Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank. Satomi Tanaka, Minoru Tsuzaki, Hiroaki Kato, Yoshinori Sagisaka |
| 2007 | Evolutionary minimum verification error learning of the alternative hypothesis model for LLR-based speaker verification. Yi-Hsiang Chao, Wei-Ho Tsai, Shih-Sian Cheng, Hsin-Min Wang, Ruei-Chuan Chang |
| 2007 | Experimental validation of direct and inverse glottal flow models for unsteady flow conditions. Julien Cisonni, Annemie Van Hirtum, Jan Willems, Xavier Pelorson |
| 2007 | Experiments on hiwire database using denoising and adaptation with a hybrid HMM-ANN model. Roberto Gemello, Franco Mana, Stefano Scanzio |
| 2007 | Exploiting information extraction annotations for document retrieval in distillation tasks. Dilek Hakkani-Tür, Gökhan Tür, Michael Levit |
| 2007 | Exploiting phoneme similarities in hybrid HMM-ANN keyword spotting. Joel Pinto, Andrew Lovitt, Hynek Hermansky |
| 2007 | Exploiting prosodic features for dialog act tagging in a discriminative modeling framework. Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore, Shrikanth S. Narayanan |
| 2007 | Exploiting prosody for PCFGs with latent annotations. Markus Dreyer, Izhak Shafran |
| 2007 | Exploiting unlabeled internal data in conditional random fields to reduce word segmentation errors for Chinese texts. Richard Tzong-Han Tsai, Hsi-Chuan Hung, Hong-Jie Dai, Wen-Lian Hsu |
| 2007 | Exploring initiative strategies using computer simulation. Fan Yang, Peter A. Heeman |
| 2007 | Exploring tonal variations via context-dependent tone models. Yue-Ning Hu, Min Chu, Chao Huang, Yan-Ning Zhang |
| 2007 | Extended powered cepstral normalization (p-CN) with range equalization for robust features in speech recognition. Chang-Wen Hsu, Lin-Shan Lee |
| 2007 | Extra large vocabulary continuous speech recognition algorithm based on information retrieval. Valeriy Pylypenko |
| 2007 | Extracting true speaker identities from transcriptions. Yannick Estève, Sylvain Meignier, Paul Deléglise, Julie Mauclair |
| 2007 | F Hiroko Hirano, Keikichi Hirose, Goh Kawai, Wentao Gu, Nobuaki Minematsu |
| 2007 | F Rerrario Shui-Ching Ho, Yoshinori Sagisaka |
| 2007 | F0 transformation within the voice conversion framework. Zdenek Hanzlícek, Jindrich Matousek |
| 2007 | Fast adaptation of GMM-based compact models. Christophe Lévy, Georges Linarès, Jean-François Bonastre |
| 2007 | Feasibility of constructing an expressive speech corpus from television soap opera dialogue. Peter Rutten |
| 2007 | Feature and distribution normalization schemes for statistical mismatch reduction in reverberant speech recognition. A. M. Toh, Roberto Togneri, Sven Nordholm |
| 2007 | Features interpolation domain for distributed speech recognition and performance for ITU-t g.723.1 CODEC. Vladimir Fabregas Surigué de Alencar, Abraham Alcaim |
| 2007 | Features of pauses and conjunctions at syntactic and discourse boundaries in Japanese monologues. Michiko Watanabe, Yasuharu Den, Keikichi Hirose, Shusaku Miwa, Nobuaki Minematsu |
| 2007 | Fepstrum: an improved modulation spectrum for ASR. Vivek Tyagi |
| 2007 | Filtering the unknown: speech activity detection in heterogeneous video collections. Marijn Huijbregts, Chuck Wooters, Roeland Ordelman |
| 2007 | Fixed-size kernel logistic regression for phoneme classification. Peter Karsmakers, Kristiaan Pelckmans, Johan A. K. Suykens, Hugo Van hamme |
| 2007 | Formal modelling of L1 and L2 perceptual learning: computational linguistics versus machine learning. Paola Escudero, Jelle Kastelein, Klara A. Weiand, R. J. J. H. van Son |
| 2007 | Formant-based synthesis of singing. Sten Ternström, Johan Sundberg |
| 2007 | Frame alignment method for cross-lingual voice conversion. Daniel Erro, Asunción Moreno |
| 2007 | Frame margin probability discriminative training algorithm for noisy speech recognition. Hao-Zheng Li, Douglas D. O'Shaughnessy |
| 2007 | Frequency domain correspondence for speaker normalization. Ming Liu, Xi Zhou, Mark Hasegawa-Johnson, Thomas S. Huang, Zhengyou Zhang |
| 2007 | Frequency study for the characterization of the dysphonic voices. Gilles Pouchoulin, Corinne Fredouille, Jean-François Bonastre, Alain Ghio, Antoine Giovanni |
| 2007 | From one base form to multiple output styles - predicting stylistic dynamics of discourse prosody. Chiu-yu Tseng, Zhao-yu Su |
| 2007 | Fused HMM-adaptation of multi-stream HMMs for audio-visual speech recognition. David Dean, Patrick Lucey, Sridha Sridharan, Tim Wark |
| 2007 | Fusing acoustic, phonetic and data-driven systems for text-independent speaker verification. Asmaa El Hannani, Dijana Petrovska-Delacrétaz |
| 2007 | Fusion of contrastive acoustic models for parallel phonotactic spoken language identification. Khe Chai Sim, Haizhou Li |
| 2007 | Fusion of global statistical and segmental spectral features for speech emotion recognition. Hao Hu, Ming-Xing Xu, Wei Wu |
| 2007 | G2p conversion of names: what can we do (better)? Henk van den Heuvel, Jean-Pierre Martens, Nanneke Konings |
| 2007 | GEMSIS - a novel application of speech recognition to emergency and disaster medicine. Satoshi Tamura, Kunihiko Takamatsu, Shinji Ogura, Satoru Hayamizu |
| 2007 | Gaussian mixture optimization for HMM based on efficient cross-validation. Takahiro Shinozaki, Tatsuya Kawahara |
| 2007 | Generating small, accurate acoustic models with a modified Bayesian information criterion. Kai Yu, Rob A. Rutenbar |
| 2007 | Generative and discriminative algorithms for spoken language understanding. Christian Raymond, Giuseppe Riccardi |
| 2007 | Generic class-based statistical language models for robust speech understanding in directed dialog applications. Matthieu Hébert |
| 2007 | Getting start with UTDrive: driver-behavior modeling and assessment of distraction for in-vehicle speech systems. Pongtep Angkititrakul, DongGu Kwak, Sangjo Choi, Jeonghee Kim, Anh PhucPhan, Amardeep Sathyanarayana, John H. L. Hansen |
| 2007 | Global features for rapid identity verification with dynamic biometric data. Andrew C. Morris, Jacques C. Koreman, B. Ly-Van, Harin Sellahewa, Sabah Jassim, R. Llarena Gómez |
| 2007 | Group delay features for emotion detection. Vidhyasaharan Sethu, Eliathamby Ambikairajah, Julien Epps |
| 2007 | HMM-based speech recognition using decision trees instead of GMMs. Remco Teunen, Masami Akamine |
| 2007 | Handling OOV words in Arabic ASR via flexible morphological constraints. Nguyen Bach, Mohamed Noamany, Ian R. Lane, Tanja Schultz |
| 2007 | Handling phonetic context and speaker variation in a structure-based speech recognizer. Dong Yu, Li Deng, Alex Acero |
| 2007 | Handling speech input in the ritel QA dialogue system. Boris W. van Schooten, Sophie Rosset, Olivier Galibert, Aurélien Max, Rieks op den Akker, Gabriel Illouz |
| 2007 | Hierarchical acoustic modeling based on random-effects regression for automatic speech recognition. Yan Han, Lou Boves |
| 2007 | Hierarchical dialogue optimization using semi-Markov decision processes. Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, Hiroshi Shimodaira |
| 2007 | Hierarchical language identification based on automatic language clustering. Bo Yin, Eliathamby Ambikairajah, Fang Chen |
| 2007 | Hierarchical neural networks feature extraction for LVCSR system. Fabio Valente, Jithendra Vepa, Christian Plahl, Christian Gollan, Hynek Hermansky, Ralf Schlüter |
| 2007 | Hierarchical non-uniform unit selection based on prosodic structure. Jun Xu, Dezhi Huang, Yongxin Wang, Yuan Dong, Lianhong Cai, Haila Wang |
| 2007 | High-level feature-based speaker verification via articulatory phonetic-class pronunciation modeling. Shi-Xiong Zhang, Man-Wai Mak, Helen M. Meng |
| 2007 | Homograph ambiguity resolution in front-end design for portuguese TTS systems. Daniela Braga, Luís Pinto Coelho, Fernando Gil Vianna Resende Jr. |
| 2007 | How predictable is ASR confidence in dialog applications? Xiang Li, Juan M. Huerta |
| 2007 | How to access audio files of large data bases using in-car speech dialogue systems. Sandra Mann, André Berton, Ute Ehrlich |
| 2007 | How to integrate speech-operated internet information dialogs into a car. André Berton, Peter Regel-Brietzmann, Hans Ulrich Block, Stefanie Schachtl, Manfred Gehrke |
| 2007 | How to judge reusability of existing speech corpora for target task by utilizing statistical multidimensional scaling. Goshu Nagino, Makoto Shozakai, Kiyohiro Shikano |
| 2007 | How to personalize speech applications for web-based information in a car. Philipp Fischer, Andreas Österle, André Berton, Peter Regel-Brietzmann |
| 2007 | Hybrid electroglottograph and speech signal based algorithm for pitch marking. Hussein Hussein, Oliver Jokisch |
| 2007 | Hybridizing conversational and clear speech. Akiko Kusumoto, Alexander Kain, John-Paul Hosom, Jan P. H. van Santen |
| 2007 | IceNLP: a natural language processing toolkit for icelandic. Hrafn Loftsson, Eiríkur Rögnvaldsson |
| 2007 | Identification of natural whistled vowels by non-whistlers. Julien Meyer, Fanny Meunier, Laure Dentel |
| 2007 | Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees. Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2007 | Implementation and evaluation of an HMM-based Thai speech synthesis system. Suphattharachai Chomphan, Takao Kobayashi |
| 2007 | Improved HMM/SVM methods for automatic phoneme segmentation. Jen-Wei Kuo, Hung-Yi Lo, Hsin-Min Wang |
| 2007 | Improved acoustic modeling for transcribing Arabic broadcast data. Lori Lamel, Abdelkhalek Messaoudi, Jean-Luc Gauvain |
| 2007 | Improved language recognition using better phonetic decoders and fusion with MFCC and SDC features. Doroteo T. Toledano, Javier Gonzalez-Dominguez, Alejandro Abejón-Gonzalez, Danilo Spada, Ismael Mateos-Garcia, Joaquin Gonzalez-Rodriguez |
| 2007 | Improved location features for meeting speaker diarization. Scott Otterson |
| 2007 | Improved machine translation of speech-to-text outputs. Daniel Déchelotte, Holger Schwenk, Gilles Adda, Jean-Luc Gauvain |
| 2007 | Improved methods for language model based question classification. Andreas Merkel, Dietrich Klakow |
| 2007 | Improvements in machine translation for English/iraqi speech translation. Shirin Saleem, Krishna Subramanian, Rohit Prasad, David Stallard, Chia-Lin Kao, Prem Natarajan, Raid Suleiman |
| 2007 | Improving phonotactic language recognition with acoustic adaptation. Wade Shen, Douglas A. Reynolds |
| 2007 | Improving speaker diarization for CHIL lecture meetings. Jing Huang, Etienne Marcheret, Karthik Visweswariah |
| 2007 | Improving speech translation with automatic boundary prediction. Evgeny Matusov, Dustin Hillard, Mathew Magimai-Doss, Dilek Hakkani-Tür, Mari Ostendorf, Hermann Ney |
| 2007 | Improving the phase vocoder approach to pitch-shifting. Petko Nikolov Petkov, W. Bastiaan Kleijn |
| 2007 | In-context phone posteriors as complementary features for tandem ASR. Hamed Ketabdar, Hervé Bourlard |
| 2007 | Increasing prosodic variability of text-to-speech synthesizers. Géza Németh, Márk Fék, Tamás Gábor Csapó |
| 2007 | Incremental perception of acted and real emotional speech. Pashiera Barkhuysen, Emiel Krahmer, Marc Swerts |
| 2007 | Influence of task duration in text-independent speaker verification. Benoit G. B. Fauve, Nicholas W. D. Evans, Neil Pearson, Jean-François Bonastre, John S. D. Mason |
| 2007 | Information retrieval strategies for accessing african audio corpora. Abdillahi Nimaan, Pascal Nocera, Frédéric Béchet, Jean-François Bonastre |
| 2007 | Integrating MAP, marginals, and unsupervised language model adaptation. Wen Wang, Andreas Stolcke |
| 2007 | Integrating audio and visual cues for speaker friendliness in multimodal speech synthesis. David House |
| 2007 | Integrating pitch and localisation cues at a speech fragment level. Heidi Christensen, Ning Ma, Stuart N. Wrigley, Jon Barker |
| 2007 | Integration of ASR and machine translation models in a document translation task. Aarthi M. Reddy, Richard C. Rose, Alain Désilets |
| 2007 | Intensive gestures in French and their multimodal correlates. Gaëlle Ferré, Roxane Bertrand, Philippe Blache, Robert Espesser, Stéphane Rauzy |
| 2007 | Inter-language prosodic style modification experiment using word impression vector for communicative speech generation. Ke Li, Yoko Greenberg, Yoshinori Sagisaka |
| 2007 | Intercoder reliability in annotating complex disfluencies. Peter A. Heeman, Andy McMillin, J. Scott Yaruss |
| 2007 | Introduction to multilingual corpus-based concatenative speech synthesis. Filip Deprez, Jan Odijk, Jan De Moortel |
| 2007 | Investigations into early and late reflections on distant-talking speech recognition toward suitable reverberation criteria. Takanobu Nishiura, Yoshiki Hirano, Yuki Denda, Masato Nakayama |
| 2007 | Iraqcomm: a next generation translation system. Kristin Precoda, Jing Zheng, Dimitra Vergyri, Horacio Franco, Colleen Richey, Andreas Kathol, Sachin S. Kajarekar |
| 2007 | Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions. Yu Hu, Qiang Huo |
| 2007 | Iterative unit selection with unnatural prosody detection. Dacheng Lin, Yong Zhao, Frank K. Soong, Min Chu, Jieyu Zhao |
| 2007 | JAAE: the java abstract annotation editor. Ivan Habernal, Miloslav Konopík |
| 2007 | Jitter and shimmer measurements for speaker recognition. Mireia Farrús, Javier Hernando, Pascual Ejarque |
| 2007 | Joint position-pitch extraction from multichannel audio. Michael Wohlmayr, Marián Képesi |
| 2007 | Joint speaker segmentation, localization and identification for streaming audio. Joerg Schmalenstroeer, Reinhold Haeb-Umbach |
| 2007 | Kettle hinders cat, shadow does not hinder shed: activation of 'almost embedded' words in nonnative listening. Mirjam Broersma |
| 2007 | Knowledge consistent user simulations for dialog systems. Hua Ai, Diane J. Litman |
| 2007 | L2 consonant identification in noise: cross-language comparisons. Anne Cutler, Martin Cooke, María Luisa García Lecumberri, Dennis Pasveer |
| 2007 | LSA-based language model adaptation for highly inflected languages. Tanel Alumäe, Toomas Kirt |
| 2007 | Landmark-based approach to speech recognition: an alternative to HMMs. Carol Y. Espy-Wilson, Tarun Pruthi, Amit Juneja, Om Deshmukh |
| 2007 | Language identification based on n-gram frequency ranking. Ricardo de Córdoba, Luis Fernando D'Haro, Fernando Fernández Martínez, Javier Macías Guarasa, Javier Ferreiros |
| 2007 | Language identification of person names using CF-IOF based weighing function. Samuel Thomas, Ashish Verma |
| 2007 | Language identification using several sources of information with a multiple-Gaussian classifier. Ricardo de Córdoba, Luis Fernando D'Haro, Fernando Fernández Martínez, Juan Manuel Montero, Roberto Barra-Chicote |
| 2007 | Language model adaptation using latent dirichlet allocation and an efficient topic inference algorithm. Aaron Heidel, Hung-An Chang, Lin-Shan Lee |
| 2007 | Language modeling for automatic turkish broadcast news transcription. Ebru Arisoy, Hasim Sak, Murat Saraclar |
| 2007 | Language modeling using PLSA-based topic HMM. Atsushi Sako, Tetsuya Takiguchi, Yasuo Ariki |
| 2007 | Large-scale random forest language models for speech recognition. Yi Su, Frederick Jelinek, Sanjeev Khudanpur |
| 2007 | Learning dialogue strategies for interactive database search. Verena Rieser, Oliver Lemon |
| 2007 | Learning spoken document similarity and recommendation using supervised probabilistic latent semantic analysis. Kishan Thambiratnam, Frank Seide |
| 2007 | Learning the inter-frame distance for discriminative template-based keyword detection. David Grangier, Samy Bengio |
| 2007 | Learning tone distinctions for Mandarin Chinese. David Weenink, Guangqin Chen, Zongyan Chen, Stefan de Konink, Dennis Vierkant, Eveline van Hagen, R. J. J. H. van Son |
| 2007 | Length, ordering preference and intonational phrasing: evidence from pauses. Gerrit Kentner |
| 2007 | Lexicon adaptation with reduced character error (LARCE) - a new direction in Chinese language modeling. Yi-Cheng Pan, Lin-Shan Lee |
| 2007 | Line cepstral quefrencies and their use for acoustic inventory coding. Guntram Strecha, Matthias Eichner, Rüdiger Hoffmann |
| 2007 | Linear and non linear kernel GMM supervector machines for speaker verification. Réda Dehak, Najim Dehak, Patrick Kenny, Pierre Dumouchel |
| 2007 | Linear prediction of audio signals. Toon van Waterschoot, Marc Moonen |
| 2007 | Linear transformation approach to VTLN using dynamic frequency warping. D. Rama Sanand, D. Dinesh Kumar, Srinivasan Umesh |
| 2007 | Lombard speech impact on perceptual speaker recognition. Ayako Ikeno, John H. L. Hansen |
| 2007 | Loquendo - Politecnico di torino's 2006 NIST speaker recognition evaluation system. Claudio Vair, Daniele Colibro, Fabio Castaldo, Emanuele Dalmasso, Pietro Laface |
| 2007 | MRASTA and PLP in automatic speech recognition. S. R. Mahadeva Prasanna, Hynek Hermansky |
| 2007 | Machine learning for spoken dialogue systems. Oliver Lemon, Olivier Pietquin |
| 2007 | Management of static/dynamic properties in a multimodal interaction system. Kouichi Katsurada, Yuji Okuma, Makoto Yano, Yurie Iribe, Tsuneo Nitta |
| 2007 | Mandarin vowel pronunciation quality evaluation by using formant pattern recognition. Fuping Pan, Qingwei Zhao, Yonghong Yan |
| 2007 | Mel sub-band filtering and compression for robust speech recognition. Babak Nasersharif, Ahmad Akbari, Mohammad Mehdi Homayounpour |
| 2007 | Memory efficient modeling of polyphone context with weighted finite-state transducers. Emilian Stoimenov, John W. McDonough |
| 2007 | Method of LP-based blind restoration for improving intelligibility of bone-conducted speech. Thang Tat Vu, Germine Seide, Masashi Unoki, Masato Akagi |
| 2007 | Minimal pairs and functional loads of sound contrasts obtained from a list of modern greek words. Constandinos Kalimeris, Stelios Bakamidis |
| 2007 | Minimum rank error training for language modeling. Meng-Sung Wu, Jen-Tzung Chien |
| 2007 | Mobile adaptive CALL (MAC): a lightweight speech-based intervention for mobile language learners. Maria Uther, James Uther, Panos Athanasopoulos, Pushpendra Singh, Reiko Akahane-Yamada |
| 2007 | Model-based speech separation with single-microphone input. Siu Wa Lee, Frank K. Soong, Pak-Chung Ching |
| 2007 | Model-driven detection of clean speech patches in noise. Jonathan Laidler, Martin Cooke, Neil D. Lawrence |
| 2007 | Model-space MLLR for trajectory HMMs. Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda |
| 2007 | Modeling context and language variation for non-native speech recognition. Tien Ping Tan, Laurent Besacier |
| 2007 | Modeling incompletion phenomenon in Mandarin dialog prosody. Jian Yu, Lixing Huang, Jianhua Tao, Xia Wang |
| 2007 | Modeling the statistical behavior of lexical chains to capture word cohesiveness for automatic story segmentation. Shing-kai Chan, Lei Xie, Helen M. Meng |
| 2007 | Modeling tones in hakka on the basis of the command-response model. Wentao Gu, Rerrario Shui-Ching Ho, Tan Lee |
| 2007 | Modelling confusion matrices to improve speech recognition accuracy, with an application to dysarthric speech. Santiago Omar Caballero Morales, Stephen J. Cox |
| 2007 | Modelling prominence and emphasis improves unit-selection synthesis. Volker Strom, Ani Nenkova, Robert A. J. Clark, Yolanda Vazquez-Alvarez, Jason M. Brenier, Simon King, Dan Jurafsky |
| 2007 | Modelling the human-machine gap in speech reception: microscopic speech intelligibility prediction for normal-hearing subjects with an auditory model. Tim Jürgens, Thomas Brand, Birger Kollmeier |
| 2007 | More on acoustic correlates of stress. Daan Wissing |
| 2007 | Morfessor and variKN machine learning tools for speech and language technology. Vesa Siivola, Mathias Creutz, Mikko Kurimo |
| 2007 | Morphological pre-processing technique and its applications on speech signal. Hyun Soo Kim |
| 2007 | Morphosyntactic processing of n-best lists for improved recognition and confidence measure computation. Stéphane Huet, Guillaume Gravier, Pascale Sébillot |
| 2007 | MuLAS: a framework for automatically building multi-tier corpora. Sérgio Paulo, Luís C. Oliveira |
| 2007 | Multi-layer kohonen self-organizing feature map for language identification. Liang Wang, Eliathamby Ambikairajah, Eric H. C. Choi |
| 2007 | Multi-modal user authentication from video for mobile or variable-environment applications. Timothy J. Hazen, Daniel Schultz |
| 2007 | Multi-resolution soft features for channel-robust distributed speech recognition. Valentin Ion, Reinhold Haeb-Umbach |
| 2007 | Multi-step linear prediction based speech dereverberation in noisy reverberant environment. Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Masato Miyoshi |
| 2007 | Multi-stream features combination based on dempster-shafer rule for LVCSR system. Fabio Valente, Jithendra Vepa, Hynek Hermansky |
| 2007 | Multiband, multisensor robust features for noisy speech recognition. Dimitrios Dimitriadis, Petros Maragos, Stamatios Lefkimmiatis |
| 2007 | Multimodal speech recognition with ultrasonic sensors. Bo Zhu, Timothy J. Hazen, James R. Glass |
| 2007 | Mutual information and the speech signal. Mattias Nilsson, W. Bastiaan Kleijn |
| 2007 | N-best: the northern- and southern-dutch benchmark evaluation of speech recognition technology. Judith M. Kessens, David A. van Leeuwen |
| 2007 | Narrowband to wideband feature expansion for robust multilingual ASR. Dusan Macho |
| 2007 | Natural-emotion GMM transformation algorithm for emotional speaker recognition. Zhenyu Shan, Yingchun Yang, Ruizhi Ye |
| 2007 | Neighborhood density and neighborhood frequency effects in French spoken word recognition. Sophie Dufour, Ulrich H. Frauenfelder |
| 2007 | Nepalese retroflex stops: a static palatography study of inter- and intra-speaker variability. Rajesh Khatiwada |
| 2007 | Never-ending learning with dynamic hidden Markov network. Konstantin Markov, Satoshi Nakamura |
| 2007 | New algorithm for LPC residual estimation from LSF vectors for a voice conversion system. Winston S. Percybrooks, Elliot Moore |
| 2007 | New word acquisition using subword modeling. Ghinwa F. Choueiter, Stephanie Seneff, James R. Glass |
| 2007 | Noise reduction based on adaptive β-order generalized spectral subtraction for speech enhancement. Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yôiti Suzuki |
| 2007 | Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio. Kentaro Ishizuka, Tomohiro Nakatani, Masakiyo Fujimoto, Noboru Miyazaki |
| 2007 | Noise robust speech recognition for voice driven wheelchair. Akira Sasou, Hiroaki Kojima |
| 2007 | Noise robust voice activity detection based on switching kalman filter. Masakiyo Fujimoto, Kentaro Ishizuka |
| 2007 | Noise suppression based on extending a speech-dominated modulation band. Tiago H. Falk, Svante Stadler, W. Bastiaan Kleijn, Wai-Yip Chan |
| 2007 | Noise suppression using search strategy with multi-model compositions. Takatoshi Jitsuhiro, Tomoji Toriyama, Kiyoshi Kogure |
| 2007 | Noise tracking for speech systems in adverse environments. Nitish Krishnamurthy, John H. L. Hansen |
| 2007 | Noise-robust hands-free voice activity detection with adaptive zero crossing detection using talker direction estimation. Yuki Denda, Takamasa Tanaka, Masato Nakayama, Takanobu Nishiura, Yoichi Yamashita |
| 2007 | Non-linear spectral contrast stretching for in-car speech recognition. Weifeng Li, Hervé Bourlard |
| 2007 | Normalized two stage SVQ for minimum complexity wide-band LSF quantization. Saikat Chatterjee, Thippur V. Sreenivas |
| 2007 | Novel eigenpitch-based prosody model for text-to-speech synthesis. Jilei Tian, Jani Nurminen, Imre Kiss |
| 2007 | Novel low-band phase representation for low bit-rate speech coding. Ahmed Ismail, Yasser Dakroury, Hazem M. Abbas |
| 2007 | Objective analysis of the effect of memory inclusion on bandwidth extension of narrowband speech. Amr H. Nour-Eldin, Peter Kabal |
| 2007 | Objective parameters from videokymographic images: a user-friendly interface. Claudia Manfredi, Leonardo Bocchi, Giovanna Cantarella, Giorgio Peretti, Gabriele Guidi, Vincenzo Mezzatesta |
| 2007 | Omnidirectional audio-visual talker localizer with dynamic feature fusion based on validity and reliability criteria. Yuki Denda, Takanobu Nishiura, Yoichi Yamashita |
| 2007 | On automatic prominence detection for German. Fabio Tamburini, Petra Wagner |
| 2007 | On comparing and combining intra-speaker variability compensation and unsupervised model adaptation in speaker verification. Claudio Garretón, Néstor Becerra Yoma, Fernando Huenupán, Carlos Molina |
| 2007 | On filled-pauses and prolongations in european portuguese. Helena Moniz, Ana Isabel Mata, Céu Viana |
| 2007 | On optimal estimation of compressed speech for hearing aids. Dirk Mauler, Anil M. Nagathil, Rainer Martin |
| 2007 | On organic interfaces. Victor Zue |
| 2007 | On the categorical nature of the process involved in schwa elision in French. Audrey Bürki, Cécile Fougeron, Cédric Gendrot |
| 2007 | On the equivalence of Gaussian HMM and Gaussian HMM-like hidden conditional random fields. Georg Heigold, Ralf Schlüter, Hermann Ney |
| 2007 | On the importance of pure prosody in the perception of speaker identity. Elina Helander, Jani Nurminen |
| 2007 | On the jointly unsupervised feature vector normalization and acoustic model compensation for robust speech recognition. Luis Buera, Antonio Miguel, Eduardo Lleida, Oscar Saz, Alfonso Ortega |
| 2007 | On the limitations of voice conversion techniques in emotion identification tasks. Roberto Barra-Chicote, Juan Manuel Montero, Javier Macías Guarasa, Juana M. Gutiérrez-Arriola, Javier Ferreiros, José Manuel Pardo |
| 2007 | On the role of spectral dynamics in unit selection speech synthesis. Barry Kirkpatrick, Darragh O'Brien, Ronan Scaife, Andrew Errity |
| 2007 | On the use of time-delay neural networks for highly accurate classification of stop consonants. Jun Hou, Lawrence R. Rabiner, Sorin Dusan |
| 2007 | On web-based creation of speech resources for less-resourced languages. Christoph Draxler |
| 2007 | Online call quality monitoring for automating agent-based call centers. Woosung Kim |
| 2007 | Online vocabulary adaptation using limited adaptation data. C. E. Liu, Kishan Thambiratnam, Frank Seide |
| 2007 | Ontology-based multimodal high level fusion involving natural language analysis for aged people home care application. Olga Vybornova, Monica Gemo, Ronald Moncarey, Benoît Macq |
| 2007 | Optimization of temporal filters in the modulation frequency domain for constructing robust features in speech recognition. Jeih-weih Hung |
| 2007 | Optimization on decoding graphs by discriminative training. Shiuan-Sung Lin, François Yvon |
| 2007 | Optimized one-bit quantization for adapted GMM-based speaker verification. Ivy H. Tseng, Olivier Verscheure, Deepak S. Turaga, Upendra V. Chaudhari |
| 2007 | Optimizing sentence segmentation for spoken language translation. Sharath Rao, Ian R. Lane, Tanja Schultz |
| 2007 | PCA-based feature extraction for fluctuation in speaking style of articulation disorders. Hironori Matsumasa, Tetsuya Takiguchi, Yasuo Ariki, Ichao Li, Toshitaka Nakabayashi |
| 2007 | PLSA-based topic detection in meetings for adaptation of lexicon and language model. Yuya Akita, Yusuke Nemoto, Tatsuya Kawahara |
| 2007 | Parameter tuning for fast speech recognition. Thomas Colthurst, Tresi Arvizo, Chia-Lin Kao, Owen Kimball, Stephen A. Lowe, David R. H. Miller, Jim Van Sciver |
| 2007 | People watcher: a game for eliciting human-transcribed data for automated directory assistance. Tim Paek, Yun-Cheng Ju, Christopher Meek |
| 2007 | Perception and production of word-final alveolar stops by brazilian portuguese learners of English. Melissa Bettoni-Techio, Andréia S. Rauber, Rosana Denise Koerich |
| 2007 | Perception of disfluency: language differences and listener bias. Catherine Lai, Kyle Gorman, Jiahong Yuan, Mark Y. Liberman |
| 2007 | Perceptual equivalence of approximated Cantonese tone contours. Yujia Li, Tan Lee |
| 2007 | Perceptual musical noise reduction using critical bands tonality coefficients and masking thresholds. Anis Ben Aicha, Sofia Ben Jebara |
| 2007 | Perceptual relevance of pitch contours of Mandarin tones and its efficacy in prosody generation of speech synthesis. Shi-Han Chen, Chih-Chung Kuo |
| 2007 | Perceptual-based playout mechanisms for multi-stream voice over IP networks. Chun-Feng Wu, Cheng-Lung Lee, Wen-Whei Chang |
| 2007 | Performance evaluation of HMM-based style classification with a small amount of training data. Makoto Tachibana, Keigo Kawashima, Junichi Yamagishi, Takao Kobayashi |
| 2007 | Performance evaluation of glottal quality measures from the perspective of vocal tract filter consistency. Juan F. Torres, Elliot Moore |
| 2007 | Performance of speaker-dependent wideband speech coding. Ethan Robert Duni, Bhaskar D. Rao |
| 2007 | Phone boundary detection using selective refinements and context-dependent acoustic features. Sirinoot Boonsuk, Proadpran Punyabukkana, Atiwong Suchato |
| 2007 | Phone-discriminating minimum classification error (p-MCE) training for phonetic recognition. Qian Qian, Xiaodong He, Li Deng |
| 2007 | Phoneme confusions in human and automatic speech recognition. Bernd T. Meyer, Matthias Wächter, Thomas Brand, Birger Kollmeier |
| 2007 | Phoneme dependent frame selection preference. Tingyao Wu, Jacques Duchateau, Dirk Van Compernolle |
| 2007 | Phonetic based sentence level rewriting of questions typed by dyslexic spellers in an information retrieval context. Laurianne Sitbon, Patrice Bellot, Philippe Blache |
| 2007 | Phonetic geminates in cypriot greek: the case of voiceless plosives. Christiana Christodoulou |
| 2007 | Phonotactic spoken language identification with limited training data. Marius Peche, Marelie H. Davel, Etienne Barnard |
| 2007 | Phrases in category-based language models for Spanish and basque ASR. Raquel Justo, M. Inés Torres |
| 2007 | Pitch accent versus lexical stress: quantifying acoustic measures related to the voice source. Yen-Liang Shue, Markus Iseli, Nanette Veilleux, Abeer Alwan |
| 2007 | Pitch estimation of noisy speech signals using empirical mode decomposition. Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu, Md. Kamrul Hasan |
| 2007 | Pitch pattern alternation in goshogawara Japanese: evidence for a prosodic phrase above the domain for downstep. Yosuke Igarashi |
| 2007 | Pitch period estimation using multipulse model and wavelet transform. Prasanta Kumar Ghosh, Antonio Ortega, Shrikanth S. Narayanan |
| 2007 | PocketSUMMIT: small-footprint continuous speech recognition. I. Lee Hetherington |
| 2007 | Podcastle: a web 2.0 approach to speech recognition research. Masataka Goto, Jun Ogata, Kouichirou Eto |
| 2007 | Pointing to a target while naming it with /pata/ or /tapa/: the effect of consonants and stress position on jaw-finger coordination. Amélie Rochet-Capellan, Jean-Luc Schwartz, Rafael Laboissière, Arturo Galvàn |
| 2007 | Predicting focus through prominence structure. Sasha Calhoun |
| 2007 | Predicting the consequences of vocalizations in early infancy. Francisco Lacerda, Lisa Gustavsson |
| 2007 | Predicting vowel duration in spontaneous canadian French speech. Darcie Williams, François Poiré |
| 2007 | Predictive minimum Bayes risk classification for robust speech recognition. Jen-Tzung Chien, Koichi Shinoda, Sadaoki Furui |
| 2007 | Prelexical adjustments to speaker idiosyncrasies: are they position-specific? Alexandra Jesse, James M. McQueen |
| 2007 | Preliminary experiments toward automatic generation of new TTS voices from recorded speech alone. Ryuki Tachibana, Tohru Nagano, Gakuto Kurata, Masafumi Nishimura, Noboru Babaguchi |
| 2007 | Preventing an external acoustic noise from being misrecognized as a speech recognition object by confirming the lip movement image signal. Soo-Jong Lee, Jun Park, Eung-Kyeu Kim |
| 2007 | Probabilistic deduction of symbol mappings for extension of lexicons. Rita Singh, Evandro B. Gouvêa, Bhiksha Raj |
| 2007 | Probabilistic latent speaker analysis for large vocabulary speech recognition. Dan Su, Xihong Wu, Huisheng Chi |
| 2007 | Processing image and audio information for recognising discourse participation status through features of face and voice. Nick Campbell, Damien Douxchamps |
| 2007 | Prosody change and response timing analysis in spontaneously spoken dialogs and their modeling in a spoken dialog system. Ryota Nishimura, Norihide Kitaoka, Seiichi Nakagawa |
| 2007 | Prosody, emotions, and... 'whatever'. Stefan Benus, Agustín Gravano, Julia Hirschberg |
| 2007 | Prosody-enriched lattices for improved syllable recognition. Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan |
| 2007 | Punctuating confusion networks for speech translation. Roldano Cattoni, Nicola Bertoldi, Marcello Federico |
| 2007 | Pushy versus meek - using avatars to influence turn-taking behaviour. Jens Edlund, Jonas Beskow |
| 2007 | Quality assessment of speech enhancement systems by separation of enhanced speech, noise, and echo. Tim Fingscheidt, Suhadi Suhadi |
| 2007 | Quasi text-independent speaker-verification based on pattern matching. Michael Gerber, René Beutler, Beat Pfister |
| 2007 | RAMCESS/handsketch: a multi-representation framework for realtime and expressive singing synthesis. Nicolas D'Alessandro, Thierry Dutoit |
| 2007 | Rapid and accurate spoken term detection. David R. H. Miller, Michael Kleber, Chia-Lin Kao, Owen Kimball, Thomas Colthurst, Stephen A. Lowe, Richard M. Schwartz, Herbert Gish |
| 2007 | Rapid speaker adaptation by reference model interpolation. Wen Xuan Teng, Guillaume Gravier, Frédéric Bimbot, Frédéric Soufflet |
| 2007 | Rapid unsupervised speaker adaptation using single utterance based on MLLR and speaker selection. Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2007 | Realisations and alternations in German /r/-realisation. Christiane Ulbrich, Horst Ulbrich |
| 2007 | Recent progress in the MIT spoken lecture processing project. James R. Glass, Timothy J. Hazen, D. Scott Cyphers, Igor Malioutov, David Huynh, Regina Barzilay |
| 2007 | Recognition of foreign names spoken by native speakers. Frederik Stouten, Jean-Pierre Martens |
| 2007 | Reconstructing audio signals from modified non-coherent hilbert envelopes. Joachim Thiemann, Peter Kabal |
| 2007 | Recovering punctuation marks for automatic speech recognition. Fernando Batista, Diamantino Caseiro, Nuno J. Mamede, Isabel Trancoso |
| 2007 | Reducing recognition error rate based on context relationships among dialogue turns. Hsu-Chih Wu, Stephanie Seneff |
| 2007 | Regularized feature-based maximum likelihood linear regression for speech recognition. Mohamed Kamal Omar |
| 2007 | Relative evaluation of informativeness in machine generated summaries. BalaKrishna Kolluru, Yoshihiko Gotoh |
| 2007 | Resources for new research directions in speaker recognition: the mixer 3, 4 and 5 corpora. Christopher Cieri, Linda Corson, David Graff, Kevin Walker |
| 2007 | Rhotic variation and schwa epenthesis in windsor French. Ivan Chow, François Poiré |
| 2007 | Rigid vs non-rigid face and head motion in phone and tone perception. Denis Burnham, Jessica Reynolds, Guillaume Vignali, Sandra Bollwerk, Caroline Jones |
| 2007 | Robust F0 modeling for Mandarin speech recognition in noise. Sheng Qiang, Yao Qian, Frank K. Soong, Congfu Xu |
| 2007 | Robust and high-resolution voiced/unvoiced classification in noisy speech using a signal smoothness criterion. A. Sreenivasa Murthy, S. Chandra Sekhar, Thippur V. Sreenivas |
| 2007 | Robust distributed speech recognition using histogram equalization and correlation information. Pedro M. Martinez, José C. Segura, Luz García |
| 2007 | Robust location understanding in spoken dialog systems using intersections. Michael L. Seltzer, Yun-Cheng Ju, Ivan Tashev, Alex Acero |
| 2007 | Robust voice activity detection based on adaptive sub-band energy sequence analysis and harmonic detection. Yanmeng Guo, Qian Qian, Yonghong Yan |
| 2007 | Robust voice activity detection for narrow-bandwidth speaker verification under adverse environments. Tuan Van Pham, Michael Neffe, Gernot Kubin |
| 2007 | Robustness of long time measures of fundamental frequency. Jonas Lindh, Anders Eriksson |
| 2007 | Robustness of several kernel-based fast adaptation methods on noisy LVCSR. Brian Kan-Wing Mak, Roger Wend-Huu Hsiao |
| 2007 | Russian vowels system acoustic features development in ontogenesis. Elena E. Lyakso, Olga V. Frolova |
| 2007 | SPICE: web-based tools for rapid language adaptation in speech processing systems. Tanja Schultz, Alan W. Black, Sameer Badaskar, Matthew Hornyak, John Kominek |
| 2007 | Score distribution scaling for speaker recognition. Vinod Prakash, John H. L. Hansen |
| 2007 | Score fusion for articulatory feature detection. Brian M. Ore, Raymond E. Slyh |
| 2007 | Segment deletion in spontaneous speech: a corpus study using mixed effects models with crossed random effects. Christophe Van Bael, R. Harald Baayen, Helmer Strik |
| 2007 | Segmentation of speech: child's play? Odette Scharenborg, Mirjam Ernestus, Vincent Wan |
| 2007 | Selecting on-topic sentences from natural language corpora. Michael Levit, Elizabeth Boschee, Marjorie Freedman |
| 2007 | Selection of optimal dimensionality reduction method using chernoff bound for segmental unit input HMM. Makoto Sakai, Norihide Kitaoka, Seiichi Nakagawa |
| 2007 | Self-organization in the evolution of shared systems of speech sounds: a computational study. Pierre-Yves Oudeyer |
| 2007 | Semi-supervised learning of speech sounds. Aren Jansen, Partha Niyogi |
| 2007 | Sentence level intelligibility evaluation for Mandarin text-to-speech systems using semantically unpredictable sentences. Jian Li, Dmitry Sityaev, Jie Hao |
| 2007 | Single channel speech separation using maximum a posteriori estimation. Mohammad H. Radfar, Richard M. Dansereau |
| 2007 | Singleton and geminate stops in Finnish - acoustic correlates. Christopher S. Doty, Kaori Idemaru, Susan G. Guion |
| 2007 | Smooth soft mel-spectrographic masks based on blind sparse source separation. Marco Kühne, Roberto Togneri, Sven Nordholm |
| 2007 | Soft margin feature extraction for automatic speech recognition. Jinyu Li, Chin-Hui Lee |
| 2007 | Some evidence on the phonetics and phonology of prosodic phrasing in Russian. Irina Nesterenko, Pavel A. Skrelin |
| 2007 | Sparse Gaussian graphical models for speech recognition. Peter Bell, Simon King |
| 2007 | Speaker adaptation of language models for automatic dialog act segmentation of meetings. Jáchym Kolár, Yang Liu, Elizabeth Shriberg |
| 2007 | Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model. Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2007 | Speaker clustering using direct maximization of a BIC-based score. Wei-Ho Tsai |
| 2007 | Speaker diarization using normalized cross likelihood ratio. Viet Bac Le, Odile Mella, Dominique Fohr |
| 2007 | Speaker recognition by combining MFCC and phase information. Seiichi Nakagawa, Kouhei Asakawa, Longbiao Wang |
| 2007 | Speaker recognition using kernel-PCA and intersession variability modeling. Hagai Aronowitz |
| 2007 | Speaker role based structural classification of broadcast news stories. BalaKrishna Kolluru, Yoshihiko Gotoh |
| 2007 | Speaker verification with multiple classifier fusion using Bayes based confidence measure. Fernando Huenupán, Néstor Becerra Yoma, Carlos Molina, Claudio Garretón |
| 2007 | Speaking rate effects in a landmark-based phonetic exemplar model. Travis Wade, Bernd Möbius |
| 2007 | Speaking through a noisy channel - experiments on inducing clarification behaviour in human-human dialogue. David Schlangen, Raquel Fernández |
| 2007 | Spectro-temporal analysis of speech using 2-d Gabor filters. Tony Ezzat, Jake V. Bouvrie, Tomaso A. Poggio |
| 2007 | Spectro-temporal processing for blind estimation of reverberation time and single-ended quality measurement of reverberant speech. Tiago H. Falk, Hua Yuan, Wai-Yip Chan |
| 2007 | Speech based drug information system for aged and visually impaired persons. Géza Németh, Gábor Olaszy, Mátyás Bartalis, Géza Kiss, Csaba Zainkó, Péter Mihajlik |
| 2007 | Speech coding and information processing by auditory neurons. Huan Wang, Werner Hemmert |
| 2007 | Speech enhancement using PCA and variance of the reconstruction error model identification. Amin Haji Abolhassani, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy, Mohamed Faouzi Harkat |
| 2007 | Speech enhancement using multi-reference noise reduction in a vehicle environment. Abderrahman Essebbar, Tristan Poinsard |
| 2007 | Speech enhancement with improved a posteriori SNR computation. Suhadi Suhadi, Tim Fingscheidt |
| 2007 | Speech feature compensation based on pseudo stereo codebooks for robust speech recognition in additive noise environments. Tsung-hsueh Hsieh, Jeih-weih Hung |
| 2007 | Speech fundamental frequency estimation using the alternate comb. Jean-Sylvain Liénard, François Signol, Claude Barras |
| 2007 | Speech mining in noisy audio message corpus. Nathalie Camelin, Frédéric Béchet, Géraldine Damnati, Renato De Mori |
| 2007 | Speech perception in children with speech sound disorder. H. Timothy Bunnell, N. Carolyn Schanen, Linda D. Vallino, Thierry G. Morlet, James B. Polikoff, Jennette D. Driscoll, James T. Mantell |
| 2007 | Speech quality after major surgery of the oral cavity and oropharynx with microvascular soft tissue reconstruction. Irma Verdonck-de Leeuw, Louis ten Bosch, Li Ying Chao, Rico N. P. M. Rinkel, Pepijn A. Borggreven, Lou Boves, C. René Leemans |
| 2007 | Speech quality estimation using packet loss effects in CELP-type speech coders. Min-Ki Lee, Kyung-Tae Kim, Hong-Goo Kang, Dae Hee Youn |
| 2007 | Speech recognition techniques for a sign language recognition system. Philippe Dreuw, David Rybach, Thomas Deselaers, Morteza Zahedi, Hermann Ney |
| 2007 | Speech recognition with factorial-HMM syllabic acoustic models. Gianpaolo Coro, Francesco Cutugno, Fulvio Caropreso |
| 2007 | Speech recognition with state-based nearest neighbour classifiers. Thomas Deselaers, Georg Heigold, Hermann Ney |
| 2007 | Speech reinforcement based on partial specific loudness. Jong Won Shin, Woohyung Lim, June Sig Sung, Nam Soo Kim |
| 2007 | Speech synthesis enhancement in noisy environments. Davide Bonardo, Enrico Zovato |
| 2007 | Speech to chant transformation with the phase vocoder. Axel Röbel, Joshua Fineberg |
| 2007 | Speech-based annotation and retrieval of digital photographs. Timothy J. Hazen, Brennan Sherry, Mark Adler |
| 2007 | Speech-nonspeech discrimination using the information bottleneck method and spectro-temporal modulation index. Maria E. Markaki, Michael Wohlmayr, Yannis Stylianou |
| 2007 | Speechindexer in action: managing endangered Formosan languages. Jozsef Szakos, Ulrike Glavitsch |
| 2007 | Speeding-up neural network training using sentence and frame selection. Stefano Scanzio, Pietro Laface, Roberto Gemello, Franco Mana |
| 2007 | Spoken language identification using score vector modeling and support vector machine. Ming Li, Hongbin Suo, Xiao Wu, Ping Lu, Yonghong Yan |
| 2007 | Spoken word recognition of Chinese homophones: a further investigation. Michael C. W. Yip |
| 2007 | Spontaneous speech synthesis by pronunciation variant selection - a comparison to natural speech. Steffen Werner, Rüdiger Hoffmann |
| 2007 | Stabilised weighted linear prediction - a robust all-pole method for speech processing. Carlo Magi, Tom Bäckström, Paavo Alku |
| 2007 | Statistical identification of critical, dependent and redundant articulators. Veena D. Singampalli, Philip J. B. Jackson |
| 2007 | Statistical vowelization of Arabic text for speech synthesis in speech-to-speech translation systems. Liang Gu, Wei Zhang, Lazkin Tahir, Yuqing Gao |
| 2007 | String and lattice based discriminative training for the corpus of spontaneous Japanese lecture transcription task. Erik McDermott, Atsushi Nakamura |
| 2007 | Structural Bayesian language modeling and adaptation. Sibel Yaman, Jen-Tzung Chien, Chin-Hui Lee |
| 2007 | Structural assessment of language learners' pronunciation. Nobuaki Minematsu, K. Kamata, Satoshi Asakawa, Takehiko Makino, Tazuko Nishimura, Keikichi Hirose |
| 2007 | Structure-based and template-based automatic speech recognition - comparing parametric and non-parametric approaches. Li Deng, Helmer Strik |
| 2007 | Study on speaker verification with non-audible murmur segments. Hideki Okamoto, Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2007 | Style estimation of speech based on multiple regression hidden semi-Markov model. Takashi Nose, Yoichi Kato, Takao Kobayashi |
| 2007 | Subword-based position specific posterior lattices (s-PSPL) for indexing speech information. Yi-Cheng Pan, Hung-Lin Chang, Berlin Chen, Lin-Shan Lee |
| 2007 | Support vector regression for speaker verification. Ignacio López-Moreno, Ismael Mateos-Garcia, Daniel Ramos, Joaquin Gonzalez-Rodriguez |
| 2007 | Suprasegmental aspects of pre-lexical speech in cochlear implanted children. Øydis Hide, Steven Gillis, Paul Govaerts |
| 2007 | Syllable lattices as a basis for a children's speech reading tracker. Daniel Bolaños, Wayne H. Ward, Sarel van Vuuren, Javier Garrido Salas |
| 2007 | Syllable timing patterns in Polish: results from annotation mining. Dafydd Gibbon, Jolanta Bachan, Grazyna Demenko |
| 2007 | Synthesis of prosodic attitudinal variants in German backchannel ja. Thorsten Stocksmeier, Stefan Kopp, Dafydd Gibbon |
| 2007 | System request detection in conversation based on acoustic and speaker alternation features. Tomoyuki Yamagata, Atsushi Sako, Tetsuya Takiguchi, Yasuo Ariki |
| 2007 | Tagging syllable boundaries with joint n-gram models. Helmut Schmid, Bernd Möbius, Julia Weidenkaff |
| 2007 | Temporal alignment of creaky voice in neutralised realisations of an underlying, post-nasal voicing contrast in German. Tina John, Jonathan Harrington |
| 2007 | Temporal downtrends in Czech read speech. Jan Volín, Radek Skarnitzl |
| 2007 | Temporal episodic memory model: an evolution of minerva2. Viktoria Maier, Roger K. Moore |
| 2007 | Temporal masking for unsupervised minimum Bayes risk speaker adaptation. Matthew Gibson, Thomas Hain |
| 2007 | Testing the relevance of speech rate, pitch and a glottal Chink for the perception of age in synthesized speech using formant synthesis. Ralf Winkler |
| 2007 | Text island spotting in large speech databases. Benjamin Lecouteux, Georges Linarès, Frédéric Beaugendre, Pascal Nocera |
| 2007 | The BBN 2007 displayless English/iraqi speech-to-speech translation system. David Stallard, Fred Choi, Chia-Lin Kao, Kriste Krstovski, Premkumar Natarajan, Rohit Prasad, Shirin Saleem, Krishna Subramanian |
| 2007 | The IRST English-Spanish translation system for european parliament speeches. Daniele Falavigna, Nicola Bertoldi, Fabio Brugnara, Roldano Cattoni, Mauro Cettolo, Boxing Chen, Marcello Federico, Diego Giuliani, Roberto Gretter, Deepa Gupta, Dino Seppi |
| 2007 | The ISL 2007 English speech transcription system for european parliament speeches. Sebastian Stüker, Christian Fügen, Florian Kraft, Matthias Wölfel |
| 2007 | The RWTH 2007 TC-STAR evaluation system for european English and Spanish. Jonas Lööf, Christian Gollan, Stefan Hahn, Georg Heigold, Björn Hoffmeister, Christian Plahl, David Rybach, Ralf Schlüter, Hermann Ney |
| 2007 | The SRI/OGI 2006 spoken term detection system. Dimitra Vergyri, Izhak Shafran, Andreas Stolcke, Venkata Ramana Rao Gadde, Murat Akbacak, Brian Roark, Wen Wang |
| 2007 | The blame game: performance analysis of speaker diarization system components. Marijn Huijbregts, Chuck Wooters |
| 2007 | The buckeye corpus of speech: updates and enhancements. Eric Fosler-Lussier, Laura Dilley, Na'im R. Tyson, Mark A. Pitt |
| 2007 | The developmental analysis of demonstrative expression skills utilizing a multimodal infant behavior corpus. Shinya Kiriyama, Ryo Tsuji, Tomohiko Kasami, Shogo Ishikawa, Naofumi Otani, Hiroaki Horiuchi, Yoichi Takebayashi, Shigeyoshi Kitazawa |
| 2007 | The duration of speech pauses in a multilingual environment. Mike Demol, Werner Verhelst, Piet Verhoeve |
| 2007 | The effect of filled pauses in a lecture speech on impressive evaluation of listeners. Hiromitsu Nishizaki, Mitsuhiro Somiya, Kenji Kobayashi, Yoshihiro Sekiguchi |
| 2007 | The effect of highband harmonic structure in the artificial bandwidth expansion of telephone speech. Hannu Pulakka, Paavo Alku, Laura Laaksonen, Päivi Valve |
| 2007 | The effect of speech interface accuracy on driving performance. Andrew L. Kun, Tim Paek, Zeljko Medenica |
| 2007 | The effect of the additivity assumption on time and frequency domain wiener filtering for speech enhancement. Kamil K. Wójcicki, Stephen So, Kuldip K. Paliwal |
| 2007 | The harming part of room acoustics in automatic speech recognition. Rico Petrick, Kevin Lohde, Matthias Wolff, Rüdiger Hoffmann |
| 2007 | The harmonic model codec (HMC) framework for voIP. Yannis Agiomyrgiannakis, Yannis Stylianou |
| 2007 | The influence of masking words on the prediction of TRPs in a shadowed dialog. Wieneke Wesseling, R. J. J. H. van Son, Louis C. W. Pols |
| 2007 | The influence of speech activity detection and overlap on speaker diarization for meeting room recordings. Corinne Fredouille, Nicholas W. D. Evans |
| 2007 | The influence of user tailoring and cognitive load on user performance in spoken dialogue systems. Andi Winterboer, Jiang Hu, Johanna D. Moore, Clifford Nass |
| 2007 | The influence of utterance chunking on machine translation performance. Christian Fügen, Muntsin Kolss |
| 2007 | The influence of vowel quality features on peak alignment. Matthias Jilka, Bernd Möbius |
| 2007 | The intelligibility and its relations to acoustic characteristics of English /s/ and /esh/ produced by native speakers of Japanese. Akiyo Joto, Yoshiki Nagase, Seiya Funatsu |
| 2007 | The limits of multidimensional category learning. Martijn Goudbeek, Daniel Swingley, Keith R. Kluender |
| 2007 | The neural basis of speech perception - a view from functional imaging. Sophie K. Scott |
| 2007 | The neutral tone in question intonation in Mandarin. Fang Liu, Yi Xu |
| 2007 | The phonetic exponency of phrasal accentuation in French and German. William J. Barry, Bistra Andreeva, Ingmar Steiner |
| 2007 | The phonetics and phonology of high and low tones in two falling f0-contours in standard German. Tamara Rathcke, Jonathan Harrington |
| 2007 | The relationship between the perception and production of English nasal codas by brazilian learners of English. Denise Cristina Kluge, Andréia S. Rauber, Mara Silvia Reis, Ricardo Augusto Hoffmann Bion |
| 2007 | The relevance of feature type for the automatic classification of emotional user states: low level descriptors and functionals. Björn W. Schuller, Anton Batliner, Dino Seppi, Stefan Steidl, Thurid Vogt, Johannes Wagner, Laurence Devillers, Laurence Vidrascu, Noam Amir, Loïc Kessous, Vered Aharonson |
| 2007 | The role of intonation and voice quality in the affective speech perception. Ioulia Grichkovtsova, Anne Lacheret, Michel Morel |
| 2007 | The role of metrical stress in comprehension and production in dutch children at-risk of dyslexia. Petra van Alphen, Elise de Bree, Paula Fikkert, Frank Wijnen |
| 2007 | The role of outer hair cell function in the perception of synthetic versus natural speech. Maria K. Wolters, Pauline Campbell, Christine DePlacido, Amy Liddell, David Owens |
| 2007 | The virtual guide: a direction giving embodied conversational agent. Mariët Theune, Dennis Hofs, Marco van Kessel |
| 2007 | The voice-rate dialog system for consumer ratings. Geoffrey Zweig, Patrick Nguyen, Yun-Cheng Ju, Ye-Yi Wang, Dong Yu, Alex Acero |
| 2007 | The voiceTRAN machine translation system. Jerneja Zganec-Gros, Stanislav Gruden |
| 2007 | Thinking outside the cube: modeling language processing tasks in a multiple resource paradigm. Kilian G. Seeber |
| 2007 | Time-compressed speech perception with speech and noise maskers. Douglas Brungart, Nandini Iyer |
| 2007 | Time-domain blind audio source separation using advanced ICA methods. Zbynek Koldovský, Petr Tichavský |
| 2007 | Time-varying pre-emphasis and inverse filtering of speech. Karl Schnell, Arild Lacroix |
| 2007 | Time-warping and re-phasing in packet loss concealment. Robert Zopf, Jes Thyssen, Juin-Hwey Chen |
| 2007 | Tone production by the speakers of different age-and-gender groups. Wai-Sum Lee |
| 2007 | Top-down effects on compensation for coarticulation are not replicable. Holger Mitterer |
| 2007 | Topic estimation with domain extensibility for guiding user's out-of-grammar utterances in multi-domain spoken dialogue systems. Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
| 2007 | Topic in dialogue: prosodic and syntactic features. Claudia Crocco, Renata Savy |
| 2007 | Towards better language modeling for Thai LVCSR. Markpong Jongtaveesataporn, Issara Thienlikit, Chai Wutiwiwatchai, Sadaoki Furui |
| 2007 | Towards online speech summarization. Gabriel Murray, Steve Renals |
| 2007 | Trainable speaker diarization. Hagai Aronowitz |
| 2007 | Translating conversational speech to standard linguistic form. Darren Scott Appling, Nick Campbell |
| 2007 | Two-stage system for robust neutral/lombard speech recognition. Hynek Boril, Petr Fousek, Harald Höge |
| 2007 | Two-stream emotion recognition for call center monitoring. Purnima Gupta, Nitendra Rajput |
| 2007 | Unsupervised HMM classification of F0 curves. Damien Lolive, Nelly Barbot, Olivier Boëffard |
| 2007 | Unsupervised categorisation approaches for technical support automated agents. Amparo Albalate, Dimitar Dimitrov, Roberto Pieraccini |
| 2007 | Unsupervised re-scoring of observation probability in viterbi based on reinforcement learning by using confidence measure and HMM neighborhood. Carlos Molina, Néstor Becerra Yoma, Fernando Huenupán, Claudio Garretón |
| 2007 | Unsupervised training of adaptation rate using q-learning in large vocabulary continuous speech recognition. Masafumi Nishida, Yasuo Horiuchi, Akira Ichikawa |
| 2007 | Unsupervised training with directed manual transcription for recognising Mandarin broadcast audio. Kai Yu, Mark J. F. Gales, Philip C. Woodland |
| 2007 | Use of lexical and affective prosodic cues to emotion by younger and older adults. Kate Dupuis, Kathleen Pichora-Fuller |
| 2007 | Use of syllable center detection for improved duration modeling in Chinese Mandarin connected digits recognition. Sergey Astrov, Joachim Hofer, Harald Höge |
| 2007 | Using a small development set to build a robust dialectal Chinese speech recognizer. Linquan Liu, Thomas Fang Zheng, Makoto Akabane, Ruxin Chen, Wenhu Wu |
| 2007 | Using direction of arrival estimate and acoustic feature information in speaker diarization. Chin-Wei Eugene Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Engsiong Chng, Haizhou Li, Susanto Rahardja |
| 2007 | Using eye movements for online evaluation of speech synthesis. Charlotte van Hooijdonk, Edwin Commandeur, Reinier Cozijn, Emiel Krahmer, Erwin Marsi |
| 2007 | Using information state to improve dialogue move identification in a spoken dialogue system. Hua Ai, Antonio Roque, Anton Leuski, David R. Traum |
| 2007 | Using inter-lingual triggers for machine translation. Caroline Lavecchia, Kamel Smaïli, David Langlois, Jean Paul Haton |
| 2007 | Using multiple strategies to manage spoken dialogue. Shiu-Wah Chu, Ian M. O'Neill, Philip Hanna |
| 2007 | Using neutral speech models for emotional speech analysis. Carlos Busso, Sungbok Lee, Shrikanth S. Narayanan |
| 2007 | Using phonetic features in unsupervised word decompounding for ASR with application to a less-represented language. Thomas Pellegrini, Lori Lamel |
| 2007 | Using prosodic and spectral characteristics for sleepiness detection. Jarek Krajewski, Bernd J. Kröger |
| 2007 | Using speech rhythm for acoustic language identification. Ekaterina Timoshenko, Harald Höge |
| 2007 | Using waveform matching techniques in the measurement of shimmer in voiced signals. Carlos A. Ferrer-Riesgo, María Esperanza Hernández-Díaz, Eduardo González-Moreira |
| 2007 | Utilizing online content as domain knowledge in a multi-domain dynamic dialogue system. Craig Wootton, Michael F. McTear, Terry Anderson |
| 2007 | Utterance-final glottalization as a cue for familiar speaker recognition. Tamás Böhm, Stefanie Shattuck-Hufnagel |
| 2007 | VOCALOID - commercial singing synthesizer based on sample concatenation. Hideki Kenmochi, Hayato Ohshita |
| 2007 | VZ-norm: an extension of z-norm to the multivariate case for anchor model based speaker verification. Delphine Charlet, Mikaël Collet, Frédéric Bimbot |
| 2007 | Varying input segmentation for story boundary detection in English, Arabic and Mandarin broadcast news. Andrew Rosenberg, Mehrbod Sharifi, Julia Hirschberg |
| 2007 | Vector-quantization based mask estimation for missing data automatic speech recognition. Maarten Van Segbroeck, Hugo Van hamme |
| 2007 | Virtual fusion for speaker recognition. Yosef A. Solewicz, Moshe Koppel |
| 2007 | Visual analysis of lip coarticulation in VCV utterances. Aseel Turkmani, Adrian Hilton, Philip J. B. Jackson, James D. Edge |
| 2007 | Visual information and redundancy conveyed by internal articulator dynamics in synthetic audiovisual speech. Katja Grauwinkel, Britta Dewitt, Sascha Fagel |
| 2007 | Visualizing acoustic similarities between emotions in speech: an acoustic map of emotions. Khiet P. Truong, David A. van Leeuwen |
| 2007 | Vocabulary selection for a broadcast news transcription system using a morpho-syntactic approach. Ciro Martins, António J. S. Teixeira, João Paulo Neto |
| 2007 | Vocal conversion from speaking voice to singing voice using STRAIGHT. Takeshi Saitou, Masataka Goto, Masashi Unoki, Masato Akagi |
| 2007 | Vocal tract and area function estimation with both lip and glottal losses. Kaustubh Kalgaonkar, Mark A. Clements |
| 2007 | Vocal tract length during speech production. Sorin Dusan |
| 2007 | Voice activated powered wheelchair with non-voice rejection algorithm. Soo-Young Suk, Hiroaki Kojima |
| 2007 | Voice activity detection based on support vector machine using effective feature vectors. Q-Haing Jo, Yun-Sik Park, Kye-Hwan Lee, Ji-Hyun Song, Joon-Hyuk Chang |
| 2007 | Voice activity detection in degraded speech using excitation source information. K. Sri Rama Murty, B. Yegnanarayana, Sunitha Guruprasad |
| 2007 | Voice activity detection using the phase vector in microphone array. Gibak Kim, Nam Ik Cho |
| 2007 | Voice fatigue and use of speech recognition: a study of voice quality ratings. Christel G. de Bruijn, Sandra P. Whiteside |
| 2007 | Voice source and vocal tract variations as cues to emotional states perceived from expressive conversational speech. Hiroki Mori, Hideki Kasuya |
| 2007 | Voicepedia: towards speech-based access to unstructured information. J. Sherwani, Dong Yu, Tim Paek, Mary Czerwinski, Yun-Cheng Ju, Alex Acero |
| 2007 | Voicing level control with application in voice conversion. Jani Nurminen, Jilei Tian, Victor Popa |
| 2007 | Voicing-based codebook in low-rate wideband CELP coding. Driss Guerchi, Tamer Rabie, Abdelrhani Louzi |
| 2007 | Vowel production in two occlusal classes. André Araújo, Luis M. T. Jesus, Isabel M. Costa |
| 2007 | Vowels and tones in infant directed speech: hyperarticulation for both, but different developmental patterns. Nan Xu, Denis Burnham, Christine Kitamura |
| 2007 | Wavelet-based front-end for electromyographic speech recognition. Michael Wand, Szu-Chen Stan Jou, Tanja Schultz |
| 2007 | Web-based language modelling for automatic lecture transcription. Cosmin Munteanu, Gerald Penn, Ronald Baecker |
| 2007 | Weighted frequency warping for voice conversion. Daniel Erro, Asunción Moreno |
| 2007 | What do listeners attend to in hearing prosodic structures? investigating the human speech-parser using short-term recall. Annie C. Gilbert, Victor J. Boucher |
| 2007 | Women's vocal aging: a longitudinal approach. Markus Brckl |
| 2007 | Word confusability - measuring hidden Markov model similarity. Jia-Yu Chen, Peder A. Olsen, John R. Hershey |
| 2007 | Word duration modeling for word graph rescoring in LVCSR. Dino Seppi, Daniele Falavigna, Georg Stemmer, Roberto Gretter |
| 2007 | Word stress correlates in spontaneous child-directed speech in German. Katrin Schneider, Bernd Möbius |
| 2007 | Word-conditioned HMM supervectors for speaker recognition. Howard Lei, Nikki Mirghafori |
| 2007 | Zero-crossing-based ratio masking for sound segregation. Sung Jun An, Young-Ik Kim, Rhee Man Kil |
| 2007 | fMPE-MAP: improved discriminative adaptation for modeling new domains. Jing Zheng, Andreas Stolcke |
| 2007 | ugloss: a framework for improving spoken language generation understandability. Brian Langner, Alan W. Black |