INTERSPEECH A

752 papers

YearTitle / Authors
2007"polyaural" array processing for automatic speech recognition in degraded environments.
Richard M. Stern, Evandro B. Gouvêa, Govindarajan Thattai
20078th Annual Conference of the International Speech Communication Association, INTERSPEECH 2007, Antwerp, Belgium, August 27-31, 2007
2007A Bayesian network classifier for word-level reading assessment.
Joseph Tepperman, Matthew Black, Patti Price, Sungbok Lee, Abe Kazemzadeh, Matteo Gerosa, Margaret Heritage, Abeer Alwan, Shrikanth S. Narayanan
2007A GMM-based probabilistic sequence kernel for speaker verification.
Kong-Aik Lee, Changhuai You, Haizhou Li, Tomi Kinnunen
2007A HMM recognition of consonant-vowel syllables from lip contours: the cued speech case.
Noureddine Aboutabit, Denis Beautemps, Jeanne Clarke, Laurent Besacier
2007A MAP based approach to adaptive speech intelligibility measurements.
Trym Holter, Svein Srsdal
2007A comparative evaluation of the zeros of z transform representation for voice source estimation.
Nicolas Sturmel, Christophe d'Alessandro, Boris Doval
2007A comparative study of speech rate estimation techniques.
Tomas Dekens, Mike Demol, Werner Verhelst, Piet Verhoeve
2007A comparative study on speech summarization of broadcast news and lecture speech.
Jian Zhang, Ricky Ho Yin Chan, Pascale Fung, Lu Cao
2007A comparison of acoustic features for articulatory inversion.
Chao Qin, Miguel Á. Carreira-Perpiñán
2007A comparison of estimated and MAP-predicted formants and fundamental frequencies with a speech reconstruction application.
Jonathan Darch, Ben Milner
2007A comparison of session variability compensation techniques for SVM-based speaker recognition.
Mitchell McLaren, Robbie Vogt, Brendan Baker, Sridha Sridharan
2007A comparison of speaker clustering and speech recognition techniques for air situational awareness.
Wade Shen, Douglas A. Reynolds
2007A computational model for unsupervised word discovery.
Louis ten Bosch, Bert Cranen
2007A conservative aggressive subspace tracker.
Koby Crammer
2007A corpus study of the 3
Yiya Chen, Jiahong Yuan
2007A data visualization and analysis method for natural language call routing system design.
Hong-Kwang Jeff Kuo, Vaibhava Goel
2007A fast fuzzy keyword spotting algorithm based on syllable confusion network.
Jian Shao, Qingwei Zhao, Pengyuan Zhang, Zhaojie Liu, Yonghong Yan
2007A fast optimization method for large margin estimation of HMMs based on second order cone programming.
Yan Yin, Hui Jiang
2007A fine pitch model for speech.
Jasha Droppo, Alex Acero
2007A flexible spectral modification method based on temporal decomposition and Gaussian mixture model.
Binh Phu Nguyen, Masato Akagi
2007A four-cube FEM model of the extrinsic and intrinsic tongue muscles to simulate the production of vowel /i/.
Sayoko Takano, Hiroki Matsuzaki, Kunitoshi Motoki
2007A framework of reply speech generation for concept-to-speech conversion in spoken dialogue systems.
Seiya Takada, Yuji Yagi, Keikichi Hirose, Nobuaki Minematsu
2007A generic methodology of converting transliterated text to phonetic strings case study: greeklish.
Nikos Tsourakis, Vassilios Digalakis
2007A learning method for Thai phonetization of English words.
Ausdang Thangthai, Chai Wutiwiwatchai, Anocha Rugchatjaroen, Sittipong Saychum
2007A method for evaluating task-oriented spoken dialog translation systems based on communication efficiency.
Toshiyuki Takezawa, Masahide Mizushima, Tohru Shimizu, Gen-ichiro Kikui
2007A methodology for the automatic detection of perceived prominent syllables in spoken French.
Jean-Philippe Goldman, Mathieu Avanzi, Anne-Catherine Simon, Anne Lacheret, Antoine Auchlin
2007A model of glottal flow incorporating viscous-inviscid interaction.
Tokihiko Kaburagi, Yosuke Tanabe
2007A model-based estimation of phonotactic language verification performance.
Kakeung Wong, Man-Hung Siu, Brian Mak
2007A morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages - like Hungarian.
Péter Mihajlik, Tibor Fegyó, Zoltán Tüske, Pavel Ircing
2007A multiple-model based framework for automatic speech segmentation.
Seung Seop Park, Jong Won Shin, Jong Kyu Kim, Nam Soo Kim
2007A multitask learning perspective on acoustic-articulatory inversion.
Korin Richmond
2007A new approach for phoneme segmentation of speech signals.
Ladan Golipour, Douglas D. O'Shaughnessy
2007A new kernel for SVM MLLR based speaker recognition.
Zahi N. Karam, William M. Campbell
2007A novel 2kb/s waveform interpolation speech coder based on non-negative matrix factorization.
Peng Zhang, Changchun Bao
2007A novel energy distribution comparison approach for robust speech spectrum vector quantization.
Ahmed Ismail, Yasser Dakroury, Hazem M. Abbas
2007A packetization and variable bitrate interframe compression scheme for vector quantizer-based distributed speech recognition.
Bengt J. Borgström, Abeer Alwan
2007A pair-based language model for the robust lexical analysis in Chinese text-to-speech synthesis.
Wu Liu, Dezhi Huang, Yuan Dong, Xinnian Mao, Haila Wang
2007A paradigm for mobile speech-centric services.
Lars Bo Larsen, Kasper Løvborg Jensen, Søren Larsen, Morten Højfeldt Rasmussen
2007A phonetic concatenative approach of labial coarticulation.
Vincent Robert, Yves Laprie, Anne Bonneau
2007A phonetic search approach to the 2006 NIST spoken term detection evaluation.
Roy Wallace, Robbie Vogt, Sridha Sridharan
2007A pitch extraction system based on phase locked loops and consensus decision.
Patricia A. Pelle, Claudio Estienne
2007A portable record player for wax cylinders using a laser-beam reflection method.
Tohru Ifukube, Yasuyuki Shimizu
2007A preselection method based on cost degradation from the optimal sequence for concatenative speech synthesis.
Nobuyuki Nishizawa, Hisashi Kawai
2007A reference model weighting-based method for robust speech recognition.
Yuan-Fu Liao, Yh-Her Yang, Chi-Hui Hsu, Cheng-Chang Lee, Jing-Teng Zeng
2007A robust mel-scale subband voice activity detector for a car platform.
Agustín Álvarez-Marquina, Rafael Martínez, Pedro Gómez, Victor Nieto Lluis, V. Rodellar
2007A robust multi-phase pitch-mark detection algorithm.
Milan Legát, Jindrich Matousek, Daniel Tihelka
2007A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system.
Kyu Jeong Han, Shrikanth S. Narayanan
2007A rule-based speech morphing for verifying a expressive speech perception model.
Chun-Fang Huang, Masato Akagi
2007A saliency-based auditory attention model with applications to unsupervised prominent syllable detection in speech.
Ozlem Kalinli, Shrikanth S. Narayanan
2007A semi-automatic approach for speaker mining of tapped telephone conversations.
Sandeep Manocha, Carol Y. Espy-Wilson
2007A semi-supervised learning approach for morpheme segmentation for an Arabic dialect.
Mei Yang, Jing Zheng, Andreas Kathol
2007A semi-supervised method for efficient construction of statistical spoken language understanding resources.
Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee
2007A smoothing kernel for spatially related features and its application to speaker verification.
Luciana Ferrer, M. Kemal Sönmez, Elizabeth Shriberg
2007A soft-clustering algorithm for automatic induction of semantic classes.
Elias Iosif, Alexandros Potamianos
2007A speech rate related lip movement model for speech animation.
Wei Zhou, Zengfu Wang
2007A statistical method of evaluating pronunciation proficiency for presentation in English.
Seiichi Nakagawa, Kei Ohta
2007A statistical model based post-filtering algorithm for residual echo suppression.
Seung Yeol Lee, Jong Won Shin, Hwan Sik Yun, Nam Soo Kim
2007A straightforward and efficient implementation of the factor analysis model for speaker verification.
Driss Matrouf, Nicolas Scheffer, Benoit G. B. Fauve, Jean-François Bonastre
2007A structured speech model parameterized by recursive dynamics and neural networks.
Roberto Togneri, Li Deng
2007A study on temporal features derived by analytic signal.
Yotaro Kubo, Shigeki Okawa, Akira Kurematsu, Katsuhiko Shirai
2007A study on word detector design and knowledge-based pruning and rescoring.
Chengyuan Ma, Chin-Hui Lee
2007A sub-optimal viterbi-like search for linear dynamic models classification.
Dimitris Oikonomidis, Vassilios Diakoloukas, Vassilios Digalakis
2007A system for transforming the emotion in speech: combining data-driven conversion techniques for prosody and voice quality.
Zeynep Inanoglu, Steve J. Young
2007A tagging algorithm for mixed language identification in a noisy domain.
Mike Rosner, Paulseph-John Farrugia
2007A text-constrained prosodic system for speaker verification.
Elizabeth Shriberg, Luciana Ferrer
2007A text-free approach to assessing nonnative intonation.
Joseph Tepperman, Abe Kazemzadeh, Shrikanth S. Narayanan
2007A trainable excitation model for HMM-based speech synthesis.
Ranniery Maia, Tomoki Toda, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda
2007A unified approach to multi-pose audio-visual ASR.
Patrick Lucey, Gerasimos Potamianos, Sridha Sridharan
2007A unified probabilistic generative framework for extractive spoken document summarization.
Yi-Ting Chen, Hsuan-Sheng Chiu, Hsin-Min Wang, Berlin Chen
2007A uniformly most powerful test for statistical model-based voice activity detection.
Keun Won Jang, Dong Kook Kim, Joon-Hyuk Chang
2007A variational approach to robust maximum likelihood estimation for speech recognition.
Mohamed Kamal Omar
2007ASR-based pronunciation training: scoring accuracy and pedagogical effectiveness of a system for dutch L2 learners.
Catia Cucchiarini, Ambra Neri, Febe de Wet, Helmer Strik
2007Accelerating the annotation of lexical data for less-resourced languages.
Gerhard B. Van Huyssteen, Martin J. Puttkammer
2007Accent assignment algorithm in Hungarian, based on syntactic analysis.
Anne Tamm, Kálmán Abari, Gábor Olaszy
2007Accurate marginalization range for missing data recognition.
Sébastien Demange, Christophe Cerisara, Jean Paul Haton
2007Acoustic analysis of the neutral tone in Mandarin.
Philippe Martin, Jun Li
2007Acoustic and affective comparisons of natural and imaginary infant-, foreigner- and adult-directed speech.
Monja A. Knoll, Lisa Scharrer
2007Acoustic correlates of intelligibility enhancements in clearly produced fricatives.
Kazumi Maniwa, Allard Jongman, Travis Wade
2007Acoustic correlates of laryngeal-muscle fatigue: findings for a phonometric prevention of acquired voice pathologies.
Victor J. Boucher
2007Acoustic features of anger utterances during natural dialog.
Yoshiko Arimoto, Sumio Ohno, Hitoshi Iida
2007Acoustic language identification using fast discriminative training.
Fabio Castaldo, Daniele Colibro, Emanuele Dalmasso, Pietro Laface, Claudio Vair
2007Acoustic parameters for the automatic detection of vowel nasalization.
Tarun Pruthi, Carol Y. Espy-Wilson
2007Acoustic-phonetic features for refining the explicit speech segmentation.
Antonio Marcos Selmini, Fábio Violaro
2007Acquisition and synchronization of multimodal articulatory data.
Michael Aron, Nicolas Ferveur, Erwan Kerrien, Marie-Odile Berger, Yves Laprie
2007Acquisition of vowel duration in children speaking american English.
Eon-Suk Ko
2007Active binaural distance estimation for dynamic sources.
Yan-Chen Lu, Martin Cooke, Heidi Christensen
2007Adaptive weighting of microphone arrays for distant-talking F0 and voiced/unvoiced estimation.
Federico Flego, Christian Zieger, Maurizio Omologo
2007Adding noise to improve noise robustness in speech recognition.
Nicolás Morales, Liang Gu, Yuqing Gao
2007Advanced front-end for robust speech recognition in extremely adverse environments.
Dimitrios Dimitriadis, José C. Segura, Luz García, Alexandros Potamianos, Petros Maragos, Vassilis Pitsikalis
2007Advances in Mandarin broadcast speech recognition.
Mei-Yuh Hwang, Wen Wang, Xin Lei, Jing Zheng, Özgür Çetin, Gang Peng
2007Advances in speechfind: transcript reliability estimation employing confidence measure based on discriminative sub-word model for SDR.
Wooil Kim, John H. L. Hansen
2007Age-related changes in fundamental frequency and formants: a longitudinal study of four speakers.
Jonathan Harrington, Sallyanne Palethorpe, Catherine I. Watson
2007Alignment of the second low target in dutch falling-rising pitch contours.
Jörg Peters, Judith Hanssen, Carlos Gussenhoven
2007Always listening to you: creating exhaustive audio database in home environments.
Yasunari Obuchi, Akio Amano
2007Ambient telephony: scenarios and research challenges.
Aki Härmä
2007An 8-32 kbit/s scalable wideband coder extended with MDCT-based bandwidth extension on top of a 6.8 kbit/s narrowband CELP coder.
Masahiro Oshikiri, Hiroyuki Ehara, Toshiyuki Morii, Tomofumi Yamanashi, Kaoru Satoh, Koji Yoshida
2007An HMM acoustic model incorporating various additional knowledge sources.
Sakriani Sakti, Konstantin Markov, Satoshi Nakamura
2007An HMM-based speech synthesis system applied to German and its adaptation to a limited set of expressive football announcements.
Sacha Krstulovic, Anna Hunecke, Marc Schröder
2007An MRI study of european portuguese nasals.
Paula Martins, Inês Carbone, Augusto Silva, António J. S. Teixeira
2007An active approach to speaker and task adaptation based on automatic analysis of vocabulary confusability.
Qiang Huo, Wei Li
2007An analysis of individual differences in the f
Hiromi Kawatsu, Sumio Ohno
2007An approach to efficient generation of high-accuracy and compact error-corrective models for speech recognition.
Takanobu Oba, Takaaki Hori, Atsushi Nakamura
2007An approach to iterative speech feature enhancement and recognition.
Stefan Windmann, Reinhold Haeb-Umbach
2007An approximate solution for perceptually constrained signal subspace speech enhancement method.
Adam Borowicz, Alexander A. Petrovsky
2007An articulatory and acoustic study of "retroflex" and "bunched" american English rhotic sound based on MRI.
Xinhui Zhou, Carol Y. Espy-Wilson, Mark Tiede, Suzanne Boyce
2007An automatic prosody labeling method for Mandarin speech.
Chen-Yu Chiang, Hsiu-Min Yu, Yih-Ru Wang, Sin-Horng Chen
2007An effective initial/final duration prediction method for corpus-based singing voice synthesis of Mandarin Chinese.
Cheng-Yuan Lin, Pei-Chi Jao, Jyh-Shing Roger Jang
2007An empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping.
Chao Qin, Miguel Á. Carreira-Perpiñán
2007An ensemble modeling approach to joint characterization of speaker and speaking environments.
Yu Tsao, Chin-Hui Lee
2007An evaluation of cross-language adaptation and native speech training for rapid HMM construction based on very limited training data.
Xufang Zhao, Douglas D. O'Shaughnessy
2007An extension 2DPCA based visual feature extraction method for audio-visual speech recognition.
Guanyong Wu, Jie Zhu
2007An improved method for unsupervised training of LVCSR systems.
Christian Gollan, Stefan Hahn, Ralf Schlüter, Hermann Ney
2007An improved speaker diarization system.
Rong Fu, Ian D. Benest
2007An information state based dialogue manager for a mobile robot.
Marcelo Quinderé, Luís Seabra Lopes, António J. S. Teixeira
2007An information theoretic approach to predict speech intelligibility for listeners with normal and impaired hearing.
Svante Stadler, Arne Leijon, Björn Hagerman
2007An integration method of retrieval results using plural subword models for vocabulary-free spoken document retrieval.
Yoshiaki Itoh, Kohei Iwata, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee
2007An interactive timeline for speech database browsing.
Benoît Favre, Jean-François Bonastre, Patrice Bellot
2007An open-set detection evaluation methodology applied to language and emotion recognition.
David A. van Leeuwen, Khiet P. Truong
2007An optimal speech enhancement under speech uncertainty probability and masking property of auditory system.
Xiaoshan Huang, Xiaoqun Zhao
2007An overview on automatic speech attribute transcription (ASAT).
Chin-Hui Lee, Mark A. Clements, Sorin Dusan, Eric Fosler-Lussier, Keith Johnson, Biing-Hwang Juang, Lawrence R. Rabiner
2007An unsupervised approach to automatic prosodic annotation.
Xinqiang Ni, Yining Chen, Frank K. Soong, Min Chu, Ping Zhang
2007Analysis and classification of speech mode: whispered through shouted.
Chi Zhang, John H. L. Hansen
2007Analysis of communication failures for spoken dialogue systems.
Sebastian Möller, Klaus-Peter Engelbrecht, Antti Oulasvirta
2007Analysis of emotional speech prosody in terms of part of speech tags.
Murtaza Bulut, Sungbok Lee, Shrikanth S. Narayanan
2007Analysis of head motions and speech in spoken dialogue.
Carlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita
2007Analysis of the impact of analogue telephone channel on MFCC parameters for voice pathology detection.
Rubén Fraile, Juan Ignacio Godino-Llorente, Nicolás Sáenz-Lechón, Víctor Osma-Ruiz, Pedro Gómez-Vilda
2007Analysis of the occurrence of laughter in meetings.
Kornel Laskowski, Susanne Burger
2007Analyzing temporal transition of real user's behaviors in a spoken dialogue system.
Kazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno
2007Application of CMLLR in narrow band wide band adapted systems.
Martin Karafiát, Lukás Burget, Jan Cernocký, Thomas Hain
2007Application of shifted delta cepstral features in speaker verification.
José R. Calvo, Rafael Fernández, Gabriel Hernández
2007Application of speech technology in a home based assessment kiosk for early detection of alzheimer's disease.
Rachel Coulston, Esther Klabbers, Jacques de Villiers, John-Paul Hosom
2007Applying word duration constraints by using unrolled HMMs.
Ning Ma, Jon Barker, Phil D. Green
2007Approaches for adaptive database reduction for text-to-speech synthesis.
Aleksandra Krul, Géraldine Damnati, François Yvon, Cédric Boidin, Thierry Moudenc
2007Approximation method of subglottal system using ARMA filter.
Nobuhiro Miki, Kyohei Hayashi
2007Articulatory acoustic feature applications in speech synthesis.
Peter Cahill, Daniel Aioanei, Julie Carson-Berndsen
2007Articulatory feature classifiers trained on 2000 hours of telephone speech.
Joe Frankel, Mathew Magimai-Doss, Simon King, Karen Livescu, Özgür Çetin
2007Articulatory synthesis of singing.
Peter Birkholz
2007Artificial bandwidth extension for speech signals using speech recogniton.
Shingo Kuroiwa, Masashi Takashina, Satoru Tsuge, Fuji Ren
2007Artificial bandwidth extension without side information for ITU-t g.729.1.
Bernd Geiser, Hervé Taddei, Peter Vary
2007Artificial impostor voice transformation effects on false acceptance rates.
Jean-François Bonastre, Driss Matrouf, Corinne Fredouille
2007Aspects of visual speech in Arabic.
Slim Ouni, Kaïs Ouni
2007Assessment of vocal dysperiodicities in connected disordered speech.
Ali Alpan, Abdellah Kacha, Francis Grenez, Jean Schoentgen
2007Attention shift decoding for conversational speech recognition.
Raghunandan Kumaran, Jeff A. Bilmes, Katrin Kirchhoff
2007Attribute-based Mandarin speech recognition using conditional random fields.
Chi-Yueh Lin, Hsiao-Chuan Wang
2007Audio classification using extended baum-welch transformations.
Tara N. Sainath, Victor Zue, Dimitri Kanevsky
2007Audio-based approaches to head orientation estimation in a smart-room.
Alberto Abad, Carlos Segura, Climent Nadeu, Javier Hernando
2007Audio-visual integration for robust speech recognition using maximum weighted stream posteriors.
Rowan Seymour, Darryl Stewart, Ji Ming
2007Audio-visual phoneme classification for pronunciation training applications.
Hedvig Kjellström, Olov Engwall, Sherif Mahdy Abdou, Olle Bälter
2007Audiovisual emotional speech of game playing children: effects of age and culture.
Suleman Shahid, Emiel Krahmer, Marc Swerts
2007Audiovisual speaker identity verification based on lip motion features.
Girija Chetty, Michael Wagner
2007Automated directory assistance system - from theory to practice.
Dong Yu, Yun-Cheng Ju, Ye-Yi Wang, Geoffrey Zweig, Alex Acero
2007Automatic acoustic segmentation for speech recognition on broadcast recordings.
Gang Peng, Mei-Yuh Hwang, Mari Ostendorf
2007Automatic assessment of children's reading level.
Jacques Duchateau, Leen Cleuren, Hugo Van hamme, Pol Ghesquière
2007Automatic building of synthetic voices from large multi-paragraph speech databases.
Kishore Prahallad, Arthur R. Toth, Alan W. Black
2007Automatic detection and classification of disfluent reading miscues in young children's speech for the purpose of assessment.
Matthew Black, Joseph Tepperman, Sungbok Lee, Patti Price, Shrikanth S. Narayanan
2007Automatic estimation of scaling factors among probabilistic models in speech recognition.
Tadashi Emori, Yoshifumi Onishi, Koichi Shinoda
2007Automatic extraction of cue phrases for important sentences in lecture speech and automatic lecture speech summarization.
Yasuhisa Fujii, Norihide Kitaoka, Seiichi Nakagawa
2007Automatic generation of cloze items for prepositions.
John Lee, Stephanie Seneff
2007Automatic head motion prediction from speech data.
Gregor Hofer, Hiroshi Shimodaira
2007Automatic large-scale oral language proficiency assessment.
Febe de Wet, Christa van der Walt, Thomas Niesler
2007Automatic laughter detection using neural networks.
Mary Tai Knox, Nikki Mirghafori
2007Automatic phonetic segmentation of Spanish emotional speech.
Ascensión Gallardo-Antolín, Roberto Barra-Chicote, Marc Schröder, Sacha Krstulovic, Juan Manuel Montero
2007Automatic pitch accent prediction for text-to-speech synthesis.
Ian Read, Stephen Cox
2007Automatic question detection: prosodic-lexical features and crosslingual experiments.
Minh-Quang Vu, Laurent Besacier, Eric Castelli
2007Automatic recognition of connected vowels only using speaker-invariant representation of speech dynamics.
Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hirose
2007Automatic scoring of the intelligibility in patients with cancer of the oral cavity.
Andreas K. Maier, Maria Schuster, Anton Batliner, Elmar Nöth, Emeka Nkenke
2007Automatic speech recognition for an under-resourced language - amharic.
Solomon Teferra Abate, Wolfgang Menzel
2007Automatic speech recognition framework for multilingual audio contents.
Hiroaki Nanjo, Yuichi Oku, Takehiko Yoshimi
2007Automatic speech recognition with a cochlear implant front-end.
Waldo Nogueira, Tamás Harczos, Bernd Edler, Jörn Ostermann, Andreas Büchner
2007Automatic transcription for a web 2.0 service to search podcasts.
Jun Ogata, Masataka Goto, Kouichirou Eto
2007Automatically learning the units of speech by non-negative matrix factorisation.
Veronique Stouten, Kris Demuynck, Hugo Van hamme
2007BECAM tool - a semi-automatic tool for bootstrapping emotion corpus annotation and management.
Slim Abdennadher, Mohamed Aly, Dirk Bühler, Wolfgang Minker, Johannes Pittermann
2007Bayes risk-based optimization of dialogue management for document retrieval system with speech interface.
Teruhisa Misu, Tatsuya Kawahara
2007Behavior models for learning and receptionist dialogs.
Hartwig Holzapfel, Alex Waibel
2007Benchmarking human performance on the acoustic and linguistic subtasks of ASR systems.
László Tóth
2007Bhattacharyya error and divergence using variational importance sampling.
Peder A. Olsen, John R. Hershey
2007Bilingual LSA-based translation lexicon adaptation for spoken language translation.
Yik-Cheung Tam, Tanja Schultz
2007Bit-erasure channel decoding for GMM-based multiple description coding.
Yannis Agiomyrgiannakis, Yannis Stylianou
2007Blind adaptive principal eigenvector beamforming for acoustical source separation.
Ernst Warsitz, Reinhold Haeb-Umbach, Dang Hai Tran Vu
2007Boosting with anti-models for automatic language identification.
Xi Yang, Man-Hung Siu, Herbert Gish, Brian Mak
2007Bootstrapping morphological analysis of gĩkũyũ using unsupervised maximum entropy learning.
Guy De Pauw, Peter Waiganjo Wagacha
2007Building an information retrieval system for serbian - challenges and solutions.
Miroslav Martinovic, Srdjdan Vesic, Goran Rakic
2007Building multiple complementary systems using directed decision trees.
Catherine Breslin, Mark J. F. Gales
2007CALL courseware for learning reactive tokens in face-to-face dialogs.
Takafumi Utashiro, Goh Kawai
2007Can unquantised articulatory feature continuums be modelled?
Odette Scharenborg, Vincent Wan
2007Categorical perception in intonation: a matter of signal dynamics?
Oliver Niebuhr
2007Categorical perception of Cantonese tones in context: a cross-linguistic study.
Hongying Zheng, Peter W. M. Tsang, William S.-Y. Wang
2007Channel selection by class separability measures for automatic transcriptions on distant microphones.
Matthias Wölfel
2007Children's convergence in referring expressions to graphical objects in a speech-enabled computer game.
Linda Bell, Joakim Gustafson
2007Class constrained ROVER based speech enhancement.
Amit Das, John H. L. Hansen
2007Classification of discourse functions of affirmative words in spoken dialogue.
Agustín Gravano, Stefan Benus, Julia Hirschberg, Shira Mitchell, Ilia Vovsha
2007Cluster adaptive training weights as features in SVM-based speaker verification.
Hao Yang, Yuan Dong, Xianyu Zhao, Jian Zhao, Liang Lu, Haila Wang
2007Cluster-based polynomial-fit histogram equalization (CPHEQ) for robust speech recognition.
Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen
2007Clustered maximum likelihood linear basis for rapid speaker adaptation.
Yun Tang, Richard C. Rose
2007Clustering-based two-dimensional linear discriminant analysis for speech recognition.
Xiao-Bing Li, Douglas D. O'Shaughnessy
2007Co-training using prosodic and lexical information for sentence segmentation.
Ümit Güz, Sébastien Cuendet, Dilek Hakkani-Tür, Gökhan Tür
2007Collection of empirical data for standardization of generic vocabularies in speech driven ICT devices and services.
Rosemary Orr, Bernat González i Llinares, Françoise Petersen, Helge Hüttenrauch, Martin Böcker, Michael Tate
2007Combination of LSF and pole based parameter interpolation for model-based diphone concatenation.
Karl Schnell, Arild Lacroix
2007Combined acoustic and pronunciation modelling for non-native speech recognition.
Ghazi Bouselmi, Dominique Fohr, Irina Illina
2007Combining frame and turn-level information for robust recognition of emotions within speech.
Bogdan Vlasenko, Björn W. Schuller, Andreas Wendemuth, Gerhard Rigoll
2007Combining length distribution model with decision tree in prosodic phrase prediction.
Qin Shi, Danning Jiang, Fanping Meng, Yong Qin
2007Combining rate and place information for robust pitch extraction.
Martin Heckmann, Frank Joublin, Christian Goerick
2007Combining short-term cepstral and long-term pitch features for automatic recognition of speaker age.
Christian A. Müller, Felix Burkhardt
2007Compact representations of the articulatory-to-acoustic mapping.
Blaise Potard, Yves Laprie
2007Comparing GMM-based speech transformation systems.
Larbi Mesbahi, Vincent Barreaud, Olivier Boëffard
2007Comparing american and palestinian perceptions of charisma using acoustic-prosodic and lexical analysis.
Fadi Biadsy, Julia Hirschberg, Andrew Rosenberg, Wisam Dakka
2007Comparing classifiers for pronunciation error detection.
Helmer Strik, Khiet P. Truong, Febe de Wet, Catia Cucchiarini
2007Comparing praat and snack formant measurements on two large corpora of northern and southern French.
Cécile Woehrling, Philippe Boula de Mareüil
2007Comparison of HMM and DTW methods in automatic recognition of pathological phoneme pronunciation.
Robert Wielgat, Tomasz P. Zielinski, Pawel Swietojanski, Piotr Zoladz, Daniel Król, Tomasz Wozniak, Stanislaw Grabias
2007Comparison of multiple voice source parameters in different phonation types.
Matti Airas, Paavo Alku
2007Comparison of subspace methods for Gaussian mixture models in speech recognition.
Matti Varjokallio, Mikko Kurimo
2007Comparison of two kinds of speaker location representation for SVM-based speaker verification.
Xianyu Zhao, Yuan Dong, Hao Yang, Jian Zhao, Liang Lu, Haila Wang
2007Complementarity and redundancy in multimodal user inputs with speech and pen gestures.
Pui-Yu Hui, Zhengyu Zhou, Helen M. Meng
2007Complementary approaches for voice disorder assessment.
Jean-François Bonastre, Corinne Fredouille, Alain Ghio, Antoine Giovanni, Gilles Pouchoulin, Joana Revis, Bernard Teston, P. Yu
2007Computer-supported human-human multilingual communication.
Alex Waibel, Keni Bernardin, Matthias Wölfel
2007Computerized chironomy: evaluation of hand-controlled intonation reiteration.
Christophe d'Alessandro, Albert Rilliard, Sylvain Le Beux
2007Concept and evaluation of a downward-compatible system for spatial teleconferencing using automatic speaker clustering.
Alexander Raake, Sascha Spors, Jens Ahrens, Jitendra Ajmera
2007Conditional use of word lattices, confusion networks and 1-best string hypotheses in a sequential interpretation strategy.
Bogdan Minescu, Géraldine Damnati, Frédéric Béchet, Renato De Mori
2007Conditionally linear Gaussian models for estimating vocal tract resonances.
Daniel Rudoy, Daniel N. Spendley, Patrick J. Wolfe
2007Confidence measure based unsupervised target model adaptation for speaker verification.
Alexandre Preti, Jean-François Bonastre, Driss Matrouf, François Capman, Bertrand Ravera
2007Confidence measures for voice search applications.
Ye-Yi Wang, Dong Yu, Yun-Cheng Ju, Geoffrey Zweig, Alex Acero
2007Construction and analysis of multiple paths in syllable models.
Annika Hämäläinen, Louis ten Bosch, Lou Boves
2007Construction of a phonotactic dialect corpus using semiautomatic annotation.
Reva Schwartz, Wade Shen, Joseph P. Campbell, Shelley Paget, Julie Vonwiller, Dominique Estival, Christopher Cieri
2007Construction of spoken language model including fillers using filler prediction model.
Kengo Ohta, Masatoshi Tsuchiya, Seiichi Nakagawa
2007Context constrained-generalized posterior probability for verifying phone transcriptions.
Hua Zhang, Lijuan Wang, Frank K. Soong, Wenju Liu
2007Context dependent syllable acoustic model for continuous Chinese speech recognition.
Hao Wu, Xihong Wu
2007Context dependent word modeling for statistical machine translation using part-of-speech tags.
Ruhi Sarikaya, Yonggang Deng, Yuqing Gao
2007Continuous prosodic features and formant modeling with joint factor analysis for speaker verification.
Najim Dehak, Patrick Kenny, Pierre Dumouchel
2007Continuous-speech phone recognition from ultrasound and optical images of the tongue and lips.
Thomas Hueber, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Maureen Stone
2007Contributions of temporal fine structure cues to Chinese speech recognition in cochlear implant simulation.
Lin Yang, Jianping Zhang, Yonghong Yan
2007Control of an articulatory speech synthesizer based on dynamic approximation of spatial articulatory targets.
Peter Birkholz
2007Conversation detection and speaker segmentation in privacy-sensitive situated speech data.
Danny Wyatt, Tanzeem Choudhury, Jeff A. Bilmes
2007Corpus-based generation of prosodic features from text based on generation process model.
Keikichi Hirose, Keiko Ochi, Nobuaki Minematsu
2007Creating multimedia dictionaries of endangered languages using LEXUS.
Jacquelijn Ringersma, Marc Kemps-Snijders
2007Creating spoken dialogue characters from corpora without annotations.
Sudeep Gandhe, David R. Traum
2007Cross-language phonemisation in German text-to-speech synthesis.
Jochen Steigner, Marc Schröder
2007Cross-linguistic analysis of prosodic features for sentence segmentation.
James G. Fung, Dilek Hakkani-Tür, Mathew Magimai-Doss, Elizabeth Shriberg, Sébastien Cuendet, Nikki Mirghafori
2007DFT domain subspace based noise tracking for speech enhancement.
Richard C. Hendriks, Jesper Jensen, Richard Heusdens
2007Degradation-classification assisted single-ended quality measurement of speech.
Hua Yuan, Tiago H. Falk, Wai-Yip Chan
2007Dependence of tone perception on syllable perception.
Michael Olsberg, Yi Xu, Jeremy Green
2007Derivative and parametric kernels for speaker verification.
Chris Longworth, Mark J. F. Gales
2007Design and characterization of the non-native military air traffic communications database (nnMATC).
Stéphane Pigeon, Wade Shen, Aaron D. Lawson, David A. van Leeuwen
2007Design and development of voice controlled aids for motor-handicapped persons.
Petr Cerva, Jan Nouza
2007Design and recording of Czech sign language corpus for automatic sign language recognition.
Pavel Campr, Marek Hrúz, Milos Zelezný
2007Design of a rich multimodal interface for mobile spoken route guidance.
Markku Turunen, Jaakko Hakulinen, Anssi Kainulainen, Aleksi Melto, Topi Hurtig
2007Detecting deception using critical segments.
Frank Enos, Elizabeth Shriberg, Martin Graciarena, Julia Hirschberg, Andreas Stolcke
2007Detecting pitch accent using pitch-corrected energy-based predictors.
Andrew Rosenberg, Julia Hirschberg
2007Detection and removal of switching noise in push-to-talk and voice operated exchange communications systems.
Brett Y. Smolenski
2007Detection of instants of glottal closure using characteristics of excitation source.
Sunitha Guruprasad, B. Yegnanarayana, K. Sri Rama Murty
2007Detection of out-of-vocabulary words in posterior based ASR.
Hamed Ketabdar, Mirko Hannemann, Hynek Hermansky
2007Detection, diarization, and transcription of far-field lecture speech.
Jing Huang, Etienne Marcheret, Karthik Visweswariah, Vit Libal, Gerasimos Potamianos
2007Detection-based ASR in the automatic speech attribute transcription project.
Ilana Bromberg, Qian Qian, Jun Hou, Jinyu Li, Chengyuan Ma, Brett Matthews, Antonio Moreno-Daniel, Jeremy Morris, Sabato Marco Siniscalchi, Yu Tsao, Yu Wang
2007Development of multimodal resources for multilingual information retrieval in the basque context.
Nora Barroso, Aitzol Ezeiza, N. Gilisagasti, Karmele López de Ipiña, A. López, Juan Miguel López
2007Development of preschool children subsystem for ASR and q&a in a real-environment speech-oriented guidance task.
Tobias Cincarek, Izumi Shindo, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2007Dimension reduction for speaker identification based on mutual information.
Xugang Lu, Jianwu Dang
2007Dimensionality reduction for speech recognition using neighborhood components analysis.
Natasha Singh-Miller, Michael Collins, Timothy J. Hazen
2007Dimensionality reduction methods applied to both magnitude and phase derived features.
Andrew Errity, John McKenna, Barry Kirkpatrick
2007Dimensionality reduction of speech features using nonlinear principal components analysis.
Stephen A. Zahorian, Tara Singh, Hongbing Hu
2007Direct acoustic feature using iterative EM algorithm and spectral energy for classifying suicidal speech.
T. Yingthawornsuk, H. Kaymaz Keskinpala, D. Mitchell Wilkes, Richard G. Shiavi, Ronald M. Salomon
2007Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics.
John Dines, Jithendra Vepa
2007Discrimination and recognition of scaled word sounds.
Toshio Irino, Yoshie Aoki, Yoshie Hayashi, Hideki Kawahara, Roy D. Patterson
2007Discriminative MCE-based speaker adaptation of acoustic models for a spoken lecture processing task.
Timothy J. Hazen, Erik McDermott
2007Discriminative noise adaptive training approach for an environment migration.
Byung Ok Kang, Ho-Young Jung, Yunkeun Lee
2007Discriminative optimization of language adapted HMMs for a language identification system based on parallel phoneme recognizers.
Josef G. Bauer, Bernt Andrassy, Ekaterina Timoshenko
2007Disfluency correction of spontaneous speech using conditional random fields with variable-length features.
Jui-Feng Yeh, Chung-Hsien Wu, Wei-Yen Wu
2007Distinctive phonetic feature (DPF) based phone segmentation using hybrid neural networks.
Mohammad Nurul Huda, Muhammad Ghulam, Junsei Horikawa, Tsuneo Nitta
2007Do different boundary types induce subtle acoustic cues to which French listeners are sensitive?
Odile Bagou, Sophie Dufour, Cécile Fougeron, Alain Content, Ulrich H. Frauenfelder
2007Dual-channel acoustic detection of nasalization states.
Xiaochuan Niu, Jan P. H. van Santen
2007Duration and pauses as boundary-markers in speech: a cross-linguistic study.
Li-chiung Yang
2007Duration and pronunciation conditioned lexical modeling for speaker verification.
Gökhan Tür, Elizabeth Shriberg, Andreas Stolcke, Sachin S. Kajarekar
2007Dynamic integration of multiple feature streams for robust real-time LVCSR.
Shoei Sato, Kazuo Onoe, Akio Kobayashi, Shinichi Homma, Toru Imai, Tohru Takagi, Tetsunori Kobayashi
2007Dynamic language change in MIMUS.
Carmen del Solar, Guillermo Pérez-García, Eva Florencio, David Moral, Gabriel Amores Carredano, Pilar Manchón Portillo
2007Dynamic language model adaptation using presentation slides for lecture speech recognition.
Hiroki Yamazaki, Koji Iwano, Koichi Shinoda, Sadaoki Furui, Haruo Yokota
2007ELAN: a free and open-source multimedia annotation tool.
Han Sloetjes, Albert Russel, Alexander Klassmann
2007EMD based soft-thresholding for speech enhancement.
Erhan Deger, Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu, Md. Kamrul Hasan
2007Effect of incomplete glottal closures on estimates of glottal waves via inverse filtering of vowel sounds.
Huiqun Deng, Douglas D. O'Shaughnessy
2007Effect of intensive voice therapy on vocal tremor for parkinson speakers.
Laurence Cnockaert, Jean Schoentgen, Canan Ozsancak, Pascal Auzou, Francis Grenez
2007Effect of number of masking talkers on speech-on-speech masking in Chinese.
Xihong Wu, Jing Chen, Zhigang Yang, Qiang Huang, Mengyuan Wang, Liang Li
2007Effect of unsteady glottal flow on the speech production process.
Hideyuki Nomura, Tetsuo Funada
2007Effect of within- and between-talker variability on word identification in noise by younger and older adults.
Huiwen Goy, Kathleen Pichora-Fuller, Pascal van Lieshout, Gurjit Singh, Bruce Schneider
2007Effects of FE modelled consequences of tonsillectomy on perceptual evaluation of voice.
Anne-Maria Laukkanen, Jaromír Horácek, Pavel Svancara, Elina Lehtinen
2007Effects of non-native dialects on spoken word recognition.
Jennifer T. Le, Catherine T. Best, Michael D. Tyler, Christian Kroos
2007Effects of quiz-style information presentation on user understanding.
Ryuichiro Higashinaka, Kohji Dohsaka, Shigeaki Amano, Hideki Isozaki
2007Effects of testosterone levels on temporal and intonational aspects of speech: more exploratory data.
Charles A. Lamoureux, Victor J. Boucher
2007Efficient estimation of speaker-specific projecting feature transforms.
Jonas Lööf, Ralf Schlüter, Hermann Ney
2007Emotion attribute projection for speaker recognition on emotional speech.
Huanjun Bao, Ming-Xing Xu, Thomas Fang Zheng
2007Emotion clustering using the results of subjective opinion tests for emotion recognition in infants' cries.
N. Satoh, Katsuya Yamauchi, Shoichi Matsunaga, Masaru Yamashita, R. Nakagawa, Kazuyuki Shinohara
2007Empirical evidence for prosodic phrasing: pauses as linguistic annotation in Korean read speech.
Hyongsil Cho, Daniel Hirst
2007English and French speakers' perception of voicing distinctions in non-native lateral consonant syllable onsets.
Catherine T. Best, Pierre A. Hallé, Jennifer S. Pardo
2007Enhancing acoustic-to-EPG mapping with lip position information.
Asterios Toutios, Konstantinos G. Margaritis
2007Enhancing usability of CAPL system for qur'an recitation learning.
Abdurrahman Samir, Sherif Mahdy Abdou, Ahmed Husien Khalil, Mohsen A. Rashwan
2007Environmentally aware voice activity detector.
Abhijeet Sangwan, Nitish Krishnamurthy, John H. L. Hansen
2007Error detection in confusion network.
Alexandre Allauzen
2007Error-tolerant question answering for spoken documents.
Tomoyosi Akiba, Hirofumi Tsujimura
2007Estimating VTLN warping factors by distribution matching.
Janne Pylkkönen
2007Estimation of place of articulation in stop consonants for visual feedback.
Milind S. Shah, Prem C. Pandey
2007Evaluating acoustic distance measures for template based recognition.
Mathias De Wachter, Kris Demuynck, Patrick Wambacq, Dirk Van Compernolle
2007Evaluating and optimizing Japanese tutor system featuring dynamic question generation and interactive guidance.
Christopher J. Waple, Hongcui Wang, Tatsuya Kawahara, Yasushi Tsubota, Masatake Dantsuji
2007Evaluating the temporal structure normalisation technique on the Aurora-4 task.
Xiong Xiao, Engsiong Chng, Haizhou Li
2007Evaluating two versions of the momel pitch modelling algorithm on a corpus of read speech in Korean.
Daniel Hirst, Hyongsil Cho, Sunhee Kim, Hyunji Yu
2007Evaluation of alternatives on speech to sign language translation.
Rubén San Segundo, Alicia Pérez, Daniel Ortiz, Luis Fernando D'Haro, M. Inés Torres, Francisco Casacuberta
2007Evaluation of real-time voice activity detection based on high order statistics.
David Cournapeau, Tatsuya Kawahara
2007Evaluation of syllable stress using single class classifier.
Abhinav Parate, Ashish Verma, Jayanta Basak
2007Evaluation of the combined use of MEMLIN and MLLR on the non-native adaptation task of hiwire project database.
Luis Buera, Antonio Miguel, Oscar Saz, Eduardo Lleida, Alfonso Ortega
2007Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank.
Satomi Tanaka, Minoru Tsuzaki, Hiroaki Kato, Yoshinori Sagisaka
2007Evolutionary minimum verification error learning of the alternative hypothesis model for LLR-based speaker verification.
Yi-Hsiang Chao, Wei-Ho Tsai, Shih-Sian Cheng, Hsin-Min Wang, Ruei-Chuan Chang
2007Experimental validation of direct and inverse glottal flow models for unsteady flow conditions.
Julien Cisonni, Annemie Van Hirtum, Jan Willems, Xavier Pelorson
2007Experiments on hiwire database using denoising and adaptation with a hybrid HMM-ANN model.
Roberto Gemello, Franco Mana, Stefano Scanzio
2007Exploiting information extraction annotations for document retrieval in distillation tasks.
Dilek Hakkani-Tür, Gökhan Tür, Michael Levit
2007Exploiting phoneme similarities in hybrid HMM-ANN keyword spotting.
Joel Pinto, Andrew Lovitt, Hynek Hermansky
2007Exploiting prosodic features for dialog act tagging in a discriminative modeling framework.
Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore, Shrikanth S. Narayanan
2007Exploiting prosody for PCFGs with latent annotations.
Markus Dreyer, Izhak Shafran
2007Exploiting unlabeled internal data in conditional random fields to reduce word segmentation errors for Chinese texts.
Richard Tzong-Han Tsai, Hsi-Chuan Hung, Hong-Jie Dai, Wen-Lian Hsu
2007Exploring initiative strategies using computer simulation.
Fan Yang, Peter A. Heeman
2007Exploring tonal variations via context-dependent tone models.
Yue-Ning Hu, Min Chu, Chao Huang, Yan-Ning Zhang
2007Extended powered cepstral normalization (p-CN) with range equalization for robust features in speech recognition.
Chang-Wen Hsu, Lin-Shan Lee
2007Extra large vocabulary continuous speech recognition algorithm based on information retrieval.
Valeriy Pylypenko
2007Extracting true speaker identities from transcriptions.
Yannick Estève, Sylvain Meignier, Paul Deléglise, Julie Mauclair
2007F
Hiroko Hirano, Keikichi Hirose, Goh Kawai, Wentao Gu, Nobuaki Minematsu
2007F
Rerrario Shui-Ching Ho, Yoshinori Sagisaka
2007F0 transformation within the voice conversion framework.
Zdenek Hanzlícek, Jindrich Matousek
2007Fast adaptation of GMM-based compact models.
Christophe Lévy, Georges Linarès, Jean-François Bonastre
2007Feasibility of constructing an expressive speech corpus from television soap opera dialogue.
Peter Rutten
2007Feature and distribution normalization schemes for statistical mismatch reduction in reverberant speech recognition.
A. M. Toh, Roberto Togneri, Sven Nordholm
2007Features interpolation domain for distributed speech recognition and performance for ITU-t g.723.1 CODEC.
Vladimir Fabregas Surigué de Alencar, Abraham Alcaim
2007Features of pauses and conjunctions at syntactic and discourse boundaries in Japanese monologues.
Michiko Watanabe, Yasuharu Den, Keikichi Hirose, Shusaku Miwa, Nobuaki Minematsu
2007Fepstrum: an improved modulation spectrum for ASR.
Vivek Tyagi
2007Filtering the unknown: speech activity detection in heterogeneous video collections.
Marijn Huijbregts, Chuck Wooters, Roeland Ordelman
2007Fixed-size kernel logistic regression for phoneme classification.
Peter Karsmakers, Kristiaan Pelckmans, Johan A. K. Suykens, Hugo Van hamme
2007Formal modelling of L1 and L2 perceptual learning: computational linguistics versus machine learning.
Paola Escudero, Jelle Kastelein, Klara A. Weiand, R. J. J. H. van Son
2007Formant-based synthesis of singing.
Sten Ternström, Johan Sundberg
2007Frame alignment method for cross-lingual voice conversion.
Daniel Erro, Asunción Moreno
2007Frame margin probability discriminative training algorithm for noisy speech recognition.
Hao-Zheng Li, Douglas D. O'Shaughnessy
2007Frequency domain correspondence for speaker normalization.
Ming Liu, Xi Zhou, Mark Hasegawa-Johnson, Thomas S. Huang, Zhengyou Zhang
2007Frequency study for the characterization of the dysphonic voices.
Gilles Pouchoulin, Corinne Fredouille, Jean-François Bonastre, Alain Ghio, Antoine Giovanni
2007From one base form to multiple output styles - predicting stylistic dynamics of discourse prosody.
Chiu-yu Tseng, Zhao-yu Su
2007Fused HMM-adaptation of multi-stream HMMs for audio-visual speech recognition.
David Dean, Patrick Lucey, Sridha Sridharan, Tim Wark
2007Fusing acoustic, phonetic and data-driven systems for text-independent speaker verification.
Asmaa El Hannani, Dijana Petrovska-Delacrétaz
2007Fusion of contrastive acoustic models for parallel phonotactic spoken language identification.
Khe Chai Sim, Haizhou Li
2007Fusion of global statistical and segmental spectral features for speech emotion recognition.
Hao Hu, Ming-Xing Xu, Wei Wu
2007G2p conversion of names: what can we do (better)?
Henk van den Heuvel, Jean-Pierre Martens, Nanneke Konings
2007GEMSIS - a novel application of speech recognition to emergency and disaster medicine.
Satoshi Tamura, Kunihiko Takamatsu, Shinji Ogura, Satoru Hayamizu
2007Gaussian mixture optimization for HMM based on efficient cross-validation.
Takahiro Shinozaki, Tatsuya Kawahara
2007Generating small, accurate acoustic models with a modified Bayesian information criterion.
Kai Yu, Rob A. Rutenbar
2007Generative and discriminative algorithms for spoken language understanding.
Christian Raymond, Giuseppe Riccardi
2007Generic class-based statistical language models for robust speech understanding in directed dialog applications.
Matthieu Hébert
2007Getting start with UTDrive: driver-behavior modeling and assessment of distraction for in-vehicle speech systems.
Pongtep Angkititrakul, DongGu Kwak, Sangjo Choi, Jeonghee Kim, Anh PhucPhan, Amardeep Sathyanarayana, John H. L. Hansen
2007Global features for rapid identity verification with dynamic biometric data.
Andrew C. Morris, Jacques C. Koreman, B. Ly-Van, Harin Sellahewa, Sabah Jassim, R. Llarena Gómez
2007Group delay features for emotion detection.
Vidhyasaharan Sethu, Eliathamby Ambikairajah, Julien Epps
2007HMM-based speech recognition using decision trees instead of GMMs.
Remco Teunen, Masami Akamine
2007Handling OOV words in Arabic ASR via flexible morphological constraints.
Nguyen Bach, Mohamed Noamany, Ian R. Lane, Tanja Schultz
2007Handling phonetic context and speaker variation in a structure-based speech recognizer.
Dong Yu, Li Deng, Alex Acero
2007Handling speech input in the ritel QA dialogue system.
Boris W. van Schooten, Sophie Rosset, Olivier Galibert, Aurélien Max, Rieks op den Akker, Gabriel Illouz
2007Hierarchical acoustic modeling based on random-effects regression for automatic speech recognition.
Yan Han, Lou Boves
2007Hierarchical dialogue optimization using semi-Markov decision processes.
Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, Hiroshi Shimodaira
2007Hierarchical language identification based on automatic language clustering.
Bo Yin, Eliathamby Ambikairajah, Fang Chen
2007Hierarchical neural networks feature extraction for LVCSR system.
Fabio Valente, Jithendra Vepa, Christian Plahl, Christian Gollan, Hynek Hermansky, Ralf Schlüter
2007Hierarchical non-uniform unit selection based on prosodic structure.
Jun Xu, Dezhi Huang, Yongxin Wang, Yuan Dong, Lianhong Cai, Haila Wang
2007High-level feature-based speaker verification via articulatory phonetic-class pronunciation modeling.
Shi-Xiong Zhang, Man-Wai Mak, Helen M. Meng
2007Homograph ambiguity resolution in front-end design for portuguese TTS systems.
Daniela Braga, Luís Pinto Coelho, Fernando Gil Vianna Resende Jr.
2007How predictable is ASR confidence in dialog applications?
Xiang Li, Juan M. Huerta
2007How to access audio files of large data bases using in-car speech dialogue systems.
Sandra Mann, André Berton, Ute Ehrlich
2007How to integrate speech-operated internet information dialogs into a car.
André Berton, Peter Regel-Brietzmann, Hans Ulrich Block, Stefanie Schachtl, Manfred Gehrke
2007How to judge reusability of existing speech corpora for target task by utilizing statistical multidimensional scaling.
Goshu Nagino, Makoto Shozakai, Kiyohiro Shikano
2007How to personalize speech applications for web-based information in a car.
Philipp Fischer, Andreas Österle, André Berton, Peter Regel-Brietzmann
2007Hybrid electroglottograph and speech signal based algorithm for pitch marking.
Hussein Hussein, Oliver Jokisch
2007Hybridizing conversational and clear speech.
Akiko Kusumoto, Alexander Kain, John-Paul Hosom, Jan P. H. van Santen
2007IceNLP: a natural language processing toolkit for icelandic.
Hrafn Loftsson, Eiríkur Rögnvaldsson
2007Identification of natural whistled vowels by non-whistlers.
Julien Meyer, Fanny Meunier, Laure Dentel
2007Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees.
Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2007Implementation and evaluation of an HMM-based Thai speech synthesis system.
Suphattharachai Chomphan, Takao Kobayashi
2007Improved HMM/SVM methods for automatic phoneme segmentation.
Jen-Wei Kuo, Hung-Yi Lo, Hsin-Min Wang
2007Improved acoustic modeling for transcribing Arabic broadcast data.
Lori Lamel, Abdelkhalek Messaoudi, Jean-Luc Gauvain
2007Improved language recognition using better phonetic decoders and fusion with MFCC and SDC features.
Doroteo T. Toledano, Javier Gonzalez-Dominguez, Alejandro Abejón-Gonzalez, Danilo Spada, Ismael Mateos-Garcia, Joaquin Gonzalez-Rodriguez
2007Improved location features for meeting speaker diarization.
Scott Otterson
2007Improved machine translation of speech-to-text outputs.
Daniel Déchelotte, Holger Schwenk, Gilles Adda, Jean-Luc Gauvain
2007Improved methods for language model based question classification.
Andreas Merkel, Dietrich Klakow
2007Improvements in machine translation for English/iraqi speech translation.
Shirin Saleem, Krishna Subramanian, Rohit Prasad, David Stallard, Chia-Lin Kao, Prem Natarajan, Raid Suleiman
2007Improving phonotactic language recognition with acoustic adaptation.
Wade Shen, Douglas A. Reynolds
2007Improving speaker diarization for CHIL lecture meetings.
Jing Huang, Etienne Marcheret, Karthik Visweswariah
2007Improving speech translation with automatic boundary prediction.
Evgeny Matusov, Dustin Hillard, Mathew Magimai-Doss, Dilek Hakkani-Tür, Mari Ostendorf, Hermann Ney
2007Improving the phase vocoder approach to pitch-shifting.
Petko Nikolov Petkov, W. Bastiaan Kleijn
2007In-context phone posteriors as complementary features for tandem ASR.
Hamed Ketabdar, Hervé Bourlard
2007Increasing prosodic variability of text-to-speech synthesizers.
Géza Németh, Márk Fék, Tamás Gábor Csapó
2007Incremental perception of acted and real emotional speech.
Pashiera Barkhuysen, Emiel Krahmer, Marc Swerts
2007Influence of task duration in text-independent speaker verification.
Benoit G. B. Fauve, Nicholas W. D. Evans, Neil Pearson, Jean-François Bonastre, John S. D. Mason
2007Information retrieval strategies for accessing african audio corpora.
Abdillahi Nimaan, Pascal Nocera, Frédéric Béchet, Jean-François Bonastre
2007Integrating MAP, marginals, and unsupervised language model adaptation.
Wen Wang, Andreas Stolcke
2007Integrating audio and visual cues for speaker friendliness in multimodal speech synthesis.
David House
2007Integrating pitch and localisation cues at a speech fragment level.
Heidi Christensen, Ning Ma, Stuart N. Wrigley, Jon Barker
2007Integration of ASR and machine translation models in a document translation task.
Aarthi M. Reddy, Richard C. Rose, Alain Désilets
2007Intensive gestures in French and their multimodal correlates.
Gaëlle Ferré, Roxane Bertrand, Philippe Blache, Robert Espesser, Stéphane Rauzy
2007Inter-language prosodic style modification experiment using word impression vector for communicative speech generation.
Ke Li, Yoko Greenberg, Yoshinori Sagisaka
2007Intercoder reliability in annotating complex disfluencies.
Peter A. Heeman, Andy McMillin, J. Scott Yaruss
2007Introduction to multilingual corpus-based concatenative speech synthesis.
Filip Deprez, Jan Odijk, Jan De Moortel
2007Investigations into early and late reflections on distant-talking speech recognition toward suitable reverberation criteria.
Takanobu Nishiura, Yoshiki Hirano, Yuki Denda, Masato Nakayama
2007Iraqcomm: a next generation translation system.
Kristin Precoda, Jing Zheng, Dimitra Vergyri, Horacio Franco, Colleen Richey, Andreas Kathol, Sachin S. Kajarekar
2007Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions.
Yu Hu, Qiang Huo
2007Iterative unit selection with unnatural prosody detection.
Dacheng Lin, Yong Zhao, Frank K. Soong, Min Chu, Jieyu Zhao
2007JAAE: the java abstract annotation editor.
Ivan Habernal, Miloslav Konopík
2007Jitter and shimmer measurements for speaker recognition.
Mireia Farrús, Javier Hernando, Pascual Ejarque
2007Joint position-pitch extraction from multichannel audio.
Michael Wohlmayr, Marián Képesi
2007Joint speaker segmentation, localization and identification for streaming audio.
Joerg Schmalenstroeer, Reinhold Haeb-Umbach
2007Kettle hinders cat, shadow does not hinder shed: activation of 'almost embedded' words in nonnative listening.
Mirjam Broersma
2007Knowledge consistent user simulations for dialog systems.
Hua Ai, Diane J. Litman
2007L2 consonant identification in noise: cross-language comparisons.
Anne Cutler, Martin Cooke, María Luisa García Lecumberri, Dennis Pasveer
2007LSA-based language model adaptation for highly inflected languages.
Tanel Alumäe, Toomas Kirt
2007Landmark-based approach to speech recognition: an alternative to HMMs.
Carol Y. Espy-Wilson, Tarun Pruthi, Amit Juneja, Om Deshmukh
2007Language identification based on n-gram frequency ranking.
Ricardo de Córdoba, Luis Fernando D'Haro, Fernando Fernández Martínez, Javier Macías Guarasa, Javier Ferreiros
2007Language identification of person names using CF-IOF based weighing function.
Samuel Thomas, Ashish Verma
2007Language identification using several sources of information with a multiple-Gaussian classifier.
Ricardo de Córdoba, Luis Fernando D'Haro, Fernando Fernández Martínez, Juan Manuel Montero, Roberto Barra-Chicote
2007Language model adaptation using latent dirichlet allocation and an efficient topic inference algorithm.
Aaron Heidel, Hung-An Chang, Lin-Shan Lee
2007Language modeling for automatic turkish broadcast news transcription.
Ebru Arisoy, Hasim Sak, Murat Saraclar
2007Language modeling using PLSA-based topic HMM.
Atsushi Sako, Tetsuya Takiguchi, Yasuo Ariki
2007Large-scale random forest language models for speech recognition.
Yi Su, Frederick Jelinek, Sanjeev Khudanpur
2007Learning dialogue strategies for interactive database search.
Verena Rieser, Oliver Lemon
2007Learning spoken document similarity and recommendation using supervised probabilistic latent semantic analysis.
Kishan Thambiratnam, Frank Seide
2007Learning the inter-frame distance for discriminative template-based keyword detection.
David Grangier, Samy Bengio
2007Learning tone distinctions for Mandarin Chinese.
David Weenink, Guangqin Chen, Zongyan Chen, Stefan de Konink, Dennis Vierkant, Eveline van Hagen, R. J. J. H. van Son
2007Length, ordering preference and intonational phrasing: evidence from pauses.
Gerrit Kentner
2007Lexicon adaptation with reduced character error (LARCE) - a new direction in Chinese language modeling.
Yi-Cheng Pan, Lin-Shan Lee
2007Line cepstral quefrencies and their use for acoustic inventory coding.
Guntram Strecha, Matthias Eichner, Rüdiger Hoffmann
2007Linear and non linear kernel GMM supervector machines for speaker verification.
Réda Dehak, Najim Dehak, Patrick Kenny, Pierre Dumouchel
2007Linear prediction of audio signals.
Toon van Waterschoot, Marc Moonen
2007Linear transformation approach to VTLN using dynamic frequency warping.
D. Rama Sanand, D. Dinesh Kumar, Srinivasan Umesh
2007Lombard speech impact on perceptual speaker recognition.
Ayako Ikeno, John H. L. Hansen
2007Loquendo - Politecnico di torino's 2006 NIST speaker recognition evaluation system.
Claudio Vair, Daniele Colibro, Fabio Castaldo, Emanuele Dalmasso, Pietro Laface
2007MRASTA and PLP in automatic speech recognition.
S. R. Mahadeva Prasanna, Hynek Hermansky
2007Machine learning for spoken dialogue systems.
Oliver Lemon, Olivier Pietquin
2007Management of static/dynamic properties in a multimodal interaction system.
Kouichi Katsurada, Yuji Okuma, Makoto Yano, Yurie Iribe, Tsuneo Nitta
2007Mandarin vowel pronunciation quality evaluation by using formant pattern recognition.
Fuping Pan, Qingwei Zhao, Yonghong Yan
2007Mel sub-band filtering and compression for robust speech recognition.
Babak Nasersharif, Ahmad Akbari, Mohammad Mehdi Homayounpour
2007Memory efficient modeling of polyphone context with weighted finite-state transducers.
Emilian Stoimenov, John W. McDonough
2007Method of LP-based blind restoration for improving intelligibility of bone-conducted speech.
Thang Tat Vu, Germine Seide, Masashi Unoki, Masato Akagi
2007Minimal pairs and functional loads of sound contrasts obtained from a list of modern greek words.
Constandinos Kalimeris, Stelios Bakamidis
2007Minimum rank error training for language modeling.
Meng-Sung Wu, Jen-Tzung Chien
2007Mobile adaptive CALL (MAC): a lightweight speech-based intervention for mobile language learners.
Maria Uther, James Uther, Panos Athanasopoulos, Pushpendra Singh, Reiko Akahane-Yamada
2007Model-based speech separation with single-microphone input.
Siu Wa Lee, Frank K. Soong, Pak-Chung Ching
2007Model-driven detection of clean speech patches in noise.
Jonathan Laidler, Martin Cooke, Neil D. Lawrence
2007Model-space MLLR for trajectory HMMs.
Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda
2007Modeling context and language variation for non-native speech recognition.
Tien Ping Tan, Laurent Besacier
2007Modeling incompletion phenomenon in Mandarin dialog prosody.
Jian Yu, Lixing Huang, Jianhua Tao, Xia Wang
2007Modeling the statistical behavior of lexical chains to capture word cohesiveness for automatic story segmentation.
Shing-kai Chan, Lei Xie, Helen M. Meng
2007Modeling tones in hakka on the basis of the command-response model.
Wentao Gu, Rerrario Shui-Ching Ho, Tan Lee
2007Modelling confusion matrices to improve speech recognition accuracy, with an application to dysarthric speech.
Santiago Omar Caballero Morales, Stephen J. Cox
2007Modelling prominence and emphasis improves unit-selection synthesis.
Volker Strom, Ani Nenkova, Robert A. J. Clark, Yolanda Vazquez-Alvarez, Jason M. Brenier, Simon King, Dan Jurafsky
2007Modelling the human-machine gap in speech reception: microscopic speech intelligibility prediction for normal-hearing subjects with an auditory model.
Tim Jürgens, Thomas Brand, Birger Kollmeier
2007More on acoustic correlates of stress.
Daan Wissing
2007Morfessor and variKN machine learning tools for speech and language technology.
Vesa Siivola, Mathias Creutz, Mikko Kurimo
2007Morphological pre-processing technique and its applications on speech signal.
Hyun Soo Kim
2007Morphosyntactic processing of n-best lists for improved recognition and confidence measure computation.
Stéphane Huet, Guillaume Gravier, Pascale Sébillot
2007MuLAS: a framework for automatically building multi-tier corpora.
Sérgio Paulo, Luís C. Oliveira
2007Multi-layer kohonen self-organizing feature map for language identification.
Liang Wang, Eliathamby Ambikairajah, Eric H. C. Choi
2007Multi-modal user authentication from video for mobile or variable-environment applications.
Timothy J. Hazen, Daniel Schultz
2007Multi-resolution soft features for channel-robust distributed speech recognition.
Valentin Ion, Reinhold Haeb-Umbach
2007Multi-step linear prediction based speech dereverberation in noisy reverberant environment.
Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Masato Miyoshi
2007Multi-stream features combination based on dempster-shafer rule for LVCSR system.
Fabio Valente, Jithendra Vepa, Hynek Hermansky
2007Multiband, multisensor robust features for noisy speech recognition.
Dimitrios Dimitriadis, Petros Maragos, Stamatios Lefkimmiatis
2007Multimodal speech recognition with ultrasonic sensors.
Bo Zhu, Timothy J. Hazen, James R. Glass
2007Mutual information and the speech signal.
Mattias Nilsson, W. Bastiaan Kleijn
2007N-best: the northern- and southern-dutch benchmark evaluation of speech recognition technology.
Judith M. Kessens, David A. van Leeuwen
2007Narrowband to wideband feature expansion for robust multilingual ASR.
Dusan Macho
2007Natural-emotion GMM transformation algorithm for emotional speaker recognition.
Zhenyu Shan, Yingchun Yang, Ruizhi Ye
2007Neighborhood density and neighborhood frequency effects in French spoken word recognition.
Sophie Dufour, Ulrich H. Frauenfelder
2007Nepalese retroflex stops: a static palatography study of inter- and intra-speaker variability.
Rajesh Khatiwada
2007Never-ending learning with dynamic hidden Markov network.
Konstantin Markov, Satoshi Nakamura
2007New algorithm for LPC residual estimation from LSF vectors for a voice conversion system.
Winston S. Percybrooks, Elliot Moore
2007New word acquisition using subword modeling.
Ghinwa F. Choueiter, Stephanie Seneff, James R. Glass
2007Noise reduction based on adaptive β-order generalized spectral subtraction for speech enhancement.
Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yôiti Suzuki
2007Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio.
Kentaro Ishizuka, Tomohiro Nakatani, Masakiyo Fujimoto, Noboru Miyazaki
2007Noise robust speech recognition for voice driven wheelchair.
Akira Sasou, Hiroaki Kojima
2007Noise robust voice activity detection based on switching kalman filter.
Masakiyo Fujimoto, Kentaro Ishizuka
2007Noise suppression based on extending a speech-dominated modulation band.
Tiago H. Falk, Svante Stadler, W. Bastiaan Kleijn, Wai-Yip Chan
2007Noise suppression using search strategy with multi-model compositions.
Takatoshi Jitsuhiro, Tomoji Toriyama, Kiyoshi Kogure
2007Noise tracking for speech systems in adverse environments.
Nitish Krishnamurthy, John H. L. Hansen
2007Noise-robust hands-free voice activity detection with adaptive zero crossing detection using talker direction estimation.
Yuki Denda, Takamasa Tanaka, Masato Nakayama, Takanobu Nishiura, Yoichi Yamashita
2007Non-linear spectral contrast stretching for in-car speech recognition.
Weifeng Li, Hervé Bourlard
2007Normalized two stage SVQ for minimum complexity wide-band LSF quantization.
Saikat Chatterjee, Thippur V. Sreenivas
2007Novel eigenpitch-based prosody model for text-to-speech synthesis.
Jilei Tian, Jani Nurminen, Imre Kiss
2007Novel low-band phase representation for low bit-rate speech coding.
Ahmed Ismail, Yasser Dakroury, Hazem M. Abbas
2007Objective analysis of the effect of memory inclusion on bandwidth extension of narrowband speech.
Amr H. Nour-Eldin, Peter Kabal
2007Objective parameters from videokymographic images: a user-friendly interface.
Claudia Manfredi, Leonardo Bocchi, Giovanna Cantarella, Giorgio Peretti, Gabriele Guidi, Vincenzo Mezzatesta
2007Omnidirectional audio-visual talker localizer with dynamic feature fusion based on validity and reliability criteria.
Yuki Denda, Takanobu Nishiura, Yoichi Yamashita
2007On automatic prominence detection for German.
Fabio Tamburini, Petra Wagner
2007On comparing and combining intra-speaker variability compensation and unsupervised model adaptation in speaker verification.
Claudio Garretón, Néstor Becerra Yoma, Fernando Huenupán, Carlos Molina
2007On filled-pauses and prolongations in european portuguese.
Helena Moniz, Ana Isabel Mata, Céu Viana
2007On optimal estimation of compressed speech for hearing aids.
Dirk Mauler, Anil M. Nagathil, Rainer Martin
2007On organic interfaces.
Victor Zue
2007On the categorical nature of the process involved in schwa elision in French.
Audrey Bürki, Cécile Fougeron, Cédric Gendrot
2007On the equivalence of Gaussian HMM and Gaussian HMM-like hidden conditional random fields.
Georg Heigold, Ralf Schlüter, Hermann Ney
2007On the importance of pure prosody in the perception of speaker identity.
Elina Helander, Jani Nurminen
2007On the jointly unsupervised feature vector normalization and acoustic model compensation for robust speech recognition.
Luis Buera, Antonio Miguel, Eduardo Lleida, Oscar Saz, Alfonso Ortega
2007On the limitations of voice conversion techniques in emotion identification tasks.
Roberto Barra-Chicote, Juan Manuel Montero, Javier Macías Guarasa, Juana M. Gutiérrez-Arriola, Javier Ferreiros, José Manuel Pardo
2007On the role of spectral dynamics in unit selection speech synthesis.
Barry Kirkpatrick, Darragh O'Brien, Ronan Scaife, Andrew Errity
2007On the use of time-delay neural networks for highly accurate classification of stop consonants.
Jun Hou, Lawrence R. Rabiner, Sorin Dusan
2007On web-based creation of speech resources for less-resourced languages.
Christoph Draxler
2007Online call quality monitoring for automating agent-based call centers.
Woosung Kim
2007Online vocabulary adaptation using limited adaptation data.
C. E. Liu, Kishan Thambiratnam, Frank Seide
2007Ontology-based multimodal high level fusion involving natural language analysis for aged people home care application.
Olga Vybornova, Monica Gemo, Ronald Moncarey, Benoît Macq
2007Optimization of temporal filters in the modulation frequency domain for constructing robust features in speech recognition.
Jeih-weih Hung
2007Optimization on decoding graphs by discriminative training.
Shiuan-Sung Lin, François Yvon
2007Optimized one-bit quantization for adapted GMM-based speaker verification.
Ivy H. Tseng, Olivier Verscheure, Deepak S. Turaga, Upendra V. Chaudhari
2007Optimizing sentence segmentation for spoken language translation.
Sharath Rao, Ian R. Lane, Tanja Schultz
2007PCA-based feature extraction for fluctuation in speaking style of articulation disorders.
Hironori Matsumasa, Tetsuya Takiguchi, Yasuo Ariki, Ichao Li, Toshitaka Nakabayashi
2007PLSA-based topic detection in meetings for adaptation of lexicon and language model.
Yuya Akita, Yusuke Nemoto, Tatsuya Kawahara
2007Parameter tuning for fast speech recognition.
Thomas Colthurst, Tresi Arvizo, Chia-Lin Kao, Owen Kimball, Stephen A. Lowe, David R. H. Miller, Jim Van Sciver
2007People watcher: a game for eliciting human-transcribed data for automated directory assistance.
Tim Paek, Yun-Cheng Ju, Christopher Meek
2007Perception and production of word-final alveolar stops by brazilian portuguese learners of English.
Melissa Bettoni-Techio, Andréia S. Rauber, Rosana Denise Koerich
2007Perception of disfluency: language differences and listener bias.
Catherine Lai, Kyle Gorman, Jiahong Yuan, Mark Y. Liberman
2007Perceptual equivalence of approximated Cantonese tone contours.
Yujia Li, Tan Lee
2007Perceptual musical noise reduction using critical bands tonality coefficients and masking thresholds.
Anis Ben Aicha, Sofia Ben Jebara
2007Perceptual relevance of pitch contours of Mandarin tones and its efficacy in prosody generation of speech synthesis.
Shi-Han Chen, Chih-Chung Kuo
2007Perceptual-based playout mechanisms for multi-stream voice over IP networks.
Chun-Feng Wu, Cheng-Lung Lee, Wen-Whei Chang
2007Performance evaluation of HMM-based style classification with a small amount of training data.
Makoto Tachibana, Keigo Kawashima, Junichi Yamagishi, Takao Kobayashi
2007Performance evaluation of glottal quality measures from the perspective of vocal tract filter consistency.
Juan F. Torres, Elliot Moore
2007Performance of speaker-dependent wideband speech coding.
Ethan Robert Duni, Bhaskar D. Rao
2007Phone boundary detection using selective refinements and context-dependent acoustic features.
Sirinoot Boonsuk, Proadpran Punyabukkana, Atiwong Suchato
2007Phone-discriminating minimum classification error (p-MCE) training for phonetic recognition.
Qian Qian, Xiaodong He, Li Deng
2007Phoneme confusions in human and automatic speech recognition.
Bernd T. Meyer, Matthias Wächter, Thomas Brand, Birger Kollmeier
2007Phoneme dependent frame selection preference.
Tingyao Wu, Jacques Duchateau, Dirk Van Compernolle
2007Phonetic based sentence level rewriting of questions typed by dyslexic spellers in an information retrieval context.
Laurianne Sitbon, Patrice Bellot, Philippe Blache
2007Phonetic geminates in cypriot greek: the case of voiceless plosives.
Christiana Christodoulou
2007Phonotactic spoken language identification with limited training data.
Marius Peche, Marelie H. Davel, Etienne Barnard
2007Phrases in category-based language models for Spanish and basque ASR.
Raquel Justo, M. Inés Torres
2007Pitch accent versus lexical stress: quantifying acoustic measures related to the voice source.
Yen-Liang Shue, Markus Iseli, Nanette Veilleux, Abeer Alwan
2007Pitch estimation of noisy speech signals using empirical mode decomposition.
Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu, Md. Kamrul Hasan
2007Pitch pattern alternation in goshogawara Japanese: evidence for a prosodic phrase above the domain for downstep.
Yosuke Igarashi
2007Pitch period estimation using multipulse model and wavelet transform.
Prasanta Kumar Ghosh, Antonio Ortega, Shrikanth S. Narayanan
2007PocketSUMMIT: small-footprint continuous speech recognition.
I. Lee Hetherington
2007Podcastle: a web 2.0 approach to speech recognition research.
Masataka Goto, Jun Ogata, Kouichirou Eto
2007Pointing to a target while naming it with /pata/ or /tapa/: the effect of consonants and stress position on jaw-finger coordination.
Amélie Rochet-Capellan, Jean-Luc Schwartz, Rafael Laboissière, Arturo Galvàn
2007Predicting focus through prominence structure.
Sasha Calhoun
2007Predicting the consequences of vocalizations in early infancy.
Francisco Lacerda, Lisa Gustavsson
2007Predicting vowel duration in spontaneous canadian French speech.
Darcie Williams, François Poiré
2007Predictive minimum Bayes risk classification for robust speech recognition.
Jen-Tzung Chien, Koichi Shinoda, Sadaoki Furui
2007Prelexical adjustments to speaker idiosyncrasies: are they position-specific?
Alexandra Jesse, James M. McQueen
2007Preliminary experiments toward automatic generation of new TTS voices from recorded speech alone.
Ryuki Tachibana, Tohru Nagano, Gakuto Kurata, Masafumi Nishimura, Noboru Babaguchi
2007Preventing an external acoustic noise from being misrecognized as a speech recognition object by confirming the lip movement image signal.
Soo-Jong Lee, Jun Park, Eung-Kyeu Kim
2007Probabilistic deduction of symbol mappings for extension of lexicons.
Rita Singh, Evandro B. Gouvêa, Bhiksha Raj
2007Probabilistic latent speaker analysis for large vocabulary speech recognition.
Dan Su, Xihong Wu, Huisheng Chi
2007Processing image and audio information for recognising discourse participation status through features of face and voice.
Nick Campbell, Damien Douxchamps
2007Prosody change and response timing analysis in spontaneously spoken dialogs and their modeling in a spoken dialog system.
Ryota Nishimura, Norihide Kitaoka, Seiichi Nakagawa
2007Prosody, emotions, and... 'whatever'.
Stefan Benus, Agustín Gravano, Julia Hirschberg
2007Prosody-enriched lattices for improved syllable recognition.
Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan
2007Punctuating confusion networks for speech translation.
Roldano Cattoni, Nicola Bertoldi, Marcello Federico
2007Pushy versus meek - using avatars to influence turn-taking behaviour.
Jens Edlund, Jonas Beskow
2007Quality assessment of speech enhancement systems by separation of enhanced speech, noise, and echo.
Tim Fingscheidt, Suhadi Suhadi
2007Quasi text-independent speaker-verification based on pattern matching.
Michael Gerber, René Beutler, Beat Pfister
2007RAMCESS/handsketch: a multi-representation framework for realtime and expressive singing synthesis.
Nicolas D'Alessandro, Thierry Dutoit
2007Rapid and accurate spoken term detection.
David R. H. Miller, Michael Kleber, Chia-Lin Kao, Owen Kimball, Thomas Colthurst, Stephen A. Lowe, Richard M. Schwartz, Herbert Gish
2007Rapid speaker adaptation by reference model interpolation.
Wen Xuan Teng, Guillaume Gravier, Frédéric Bimbot, Frédéric Soufflet
2007Rapid unsupervised speaker adaptation using single utterance based on MLLR and speaker selection.
Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2007Realisations and alternations in German /r/-realisation.
Christiane Ulbrich, Horst Ulbrich
2007Recent progress in the MIT spoken lecture processing project.
James R. Glass, Timothy J. Hazen, D. Scott Cyphers, Igor Malioutov, David Huynh, Regina Barzilay
2007Recognition of foreign names spoken by native speakers.
Frederik Stouten, Jean-Pierre Martens
2007Reconstructing audio signals from modified non-coherent hilbert envelopes.
Joachim Thiemann, Peter Kabal
2007Recovering punctuation marks for automatic speech recognition.
Fernando Batista, Diamantino Caseiro, Nuno J. Mamede, Isabel Trancoso
2007Reducing recognition error rate based on context relationships among dialogue turns.
Hsu-Chih Wu, Stephanie Seneff
2007Regularized feature-based maximum likelihood linear regression for speech recognition.
Mohamed Kamal Omar
2007Relative evaluation of informativeness in machine generated summaries.
BalaKrishna Kolluru, Yoshihiko Gotoh
2007Resources for new research directions in speaker recognition: the mixer 3, 4 and 5 corpora.
Christopher Cieri, Linda Corson, David Graff, Kevin Walker
2007Rhotic variation and schwa epenthesis in windsor French.
Ivan Chow, François Poiré
2007Rigid vs non-rigid face and head motion in phone and tone perception.
Denis Burnham, Jessica Reynolds, Guillaume Vignali, Sandra Bollwerk, Caroline Jones
2007Robust F0 modeling for Mandarin speech recognition in noise.
Sheng Qiang, Yao Qian, Frank K. Soong, Congfu Xu
2007Robust and high-resolution voiced/unvoiced classification in noisy speech using a signal smoothness criterion.
A. Sreenivasa Murthy, S. Chandra Sekhar, Thippur V. Sreenivas
2007Robust distributed speech recognition using histogram equalization and correlation information.
Pedro M. Martinez, José C. Segura, Luz García
2007Robust location understanding in spoken dialog systems using intersections.
Michael L. Seltzer, Yun-Cheng Ju, Ivan Tashev, Alex Acero
2007Robust voice activity detection based on adaptive sub-band energy sequence analysis and harmonic detection.
Yanmeng Guo, Qian Qian, Yonghong Yan
2007Robust voice activity detection for narrow-bandwidth speaker verification under adverse environments.
Tuan Van Pham, Michael Neffe, Gernot Kubin
2007Robustness of long time measures of fundamental frequency.
Jonas Lindh, Anders Eriksson
2007Robustness of several kernel-based fast adaptation methods on noisy LVCSR.
Brian Kan-Wing Mak, Roger Wend-Huu Hsiao
2007Russian vowels system acoustic features development in ontogenesis.
Elena E. Lyakso, Olga V. Frolova
2007SPICE: web-based tools for rapid language adaptation in speech processing systems.
Tanja Schultz, Alan W. Black, Sameer Badaskar, Matthew Hornyak, John Kominek
2007Score distribution scaling for speaker recognition.
Vinod Prakash, John H. L. Hansen
2007Score fusion for articulatory feature detection.
Brian M. Ore, Raymond E. Slyh
2007Segment deletion in spontaneous speech: a corpus study using mixed effects models with crossed random effects.
Christophe Van Bael, R. Harald Baayen, Helmer Strik
2007Segmentation of speech: child's play?
Odette Scharenborg, Mirjam Ernestus, Vincent Wan
2007Selecting on-topic sentences from natural language corpora.
Michael Levit, Elizabeth Boschee, Marjorie Freedman
2007Selection of optimal dimensionality reduction method using chernoff bound for segmental unit input HMM.
Makoto Sakai, Norihide Kitaoka, Seiichi Nakagawa
2007Self-organization in the evolution of shared systems of speech sounds: a computational study.
Pierre-Yves Oudeyer
2007Semi-supervised learning of speech sounds.
Aren Jansen, Partha Niyogi
2007Sentence level intelligibility evaluation for Mandarin text-to-speech systems using semantically unpredictable sentences.
Jian Li, Dmitry Sityaev, Jie Hao
2007Single channel speech separation using maximum a posteriori estimation.
Mohammad H. Radfar, Richard M. Dansereau
2007Singleton and geminate stops in Finnish - acoustic correlates.
Christopher S. Doty, Kaori Idemaru, Susan G. Guion
2007Smooth soft mel-spectrographic masks based on blind sparse source separation.
Marco Kühne, Roberto Togneri, Sven Nordholm
2007Soft margin feature extraction for automatic speech recognition.
Jinyu Li, Chin-Hui Lee
2007Some evidence on the phonetics and phonology of prosodic phrasing in Russian.
Irina Nesterenko, Pavel A. Skrelin
2007Sparse Gaussian graphical models for speech recognition.
Peter Bell, Simon King
2007Speaker adaptation of language models for automatic dialog act segmentation of meetings.
Jáchym Kolár, Yang Liu, Elizabeth Shriberg
2007Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model.
Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
2007Speaker clustering using direct maximization of a BIC-based score.
Wei-Ho Tsai
2007Speaker diarization using normalized cross likelihood ratio.
Viet Bac Le, Odile Mella, Dominique Fohr
2007Speaker recognition by combining MFCC and phase information.
Seiichi Nakagawa, Kouhei Asakawa, Longbiao Wang
2007Speaker recognition using kernel-PCA and intersession variability modeling.
Hagai Aronowitz
2007Speaker role based structural classification of broadcast news stories.
BalaKrishna Kolluru, Yoshihiko Gotoh
2007Speaker verification with multiple classifier fusion using Bayes based confidence measure.
Fernando Huenupán, Néstor Becerra Yoma, Carlos Molina, Claudio Garretón
2007Speaking rate effects in a landmark-based phonetic exemplar model.
Travis Wade, Bernd Möbius
2007Speaking through a noisy channel - experiments on inducing clarification behaviour in human-human dialogue.
David Schlangen, Raquel Fernández
2007Spectro-temporal analysis of speech using 2-d Gabor filters.
Tony Ezzat, Jake V. Bouvrie, Tomaso A. Poggio
2007Spectro-temporal processing for blind estimation of reverberation time and single-ended quality measurement of reverberant speech.
Tiago H. Falk, Hua Yuan, Wai-Yip Chan
2007Speech based drug information system for aged and visually impaired persons.
Géza Németh, Gábor Olaszy, Mátyás Bartalis, Géza Kiss, Csaba Zainkó, Péter Mihajlik
2007Speech coding and information processing by auditory neurons.
Huan Wang, Werner Hemmert
2007Speech enhancement using PCA and variance of the reconstruction error model identification.
Amin Haji Abolhassani, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy, Mohamed Faouzi Harkat
2007Speech enhancement using multi-reference noise reduction in a vehicle environment.
Abderrahman Essebbar, Tristan Poinsard
2007Speech enhancement with improved a posteriori SNR computation.
Suhadi Suhadi, Tim Fingscheidt
2007Speech feature compensation based on pseudo stereo codebooks for robust speech recognition in additive noise environments.
Tsung-hsueh Hsieh, Jeih-weih Hung
2007Speech fundamental frequency estimation using the alternate comb.
Jean-Sylvain Liénard, François Signol, Claude Barras
2007Speech mining in noisy audio message corpus.
Nathalie Camelin, Frédéric Béchet, Géraldine Damnati, Renato De Mori
2007Speech perception in children with speech sound disorder.
H. Timothy Bunnell, N. Carolyn Schanen, Linda D. Vallino, Thierry G. Morlet, James B. Polikoff, Jennette D. Driscoll, James T. Mantell
2007Speech quality after major surgery of the oral cavity and oropharynx with microvascular soft tissue reconstruction.
Irma Verdonck-de Leeuw, Louis ten Bosch, Li Ying Chao, Rico N. P. M. Rinkel, Pepijn A. Borggreven, Lou Boves, C. René Leemans
2007Speech quality estimation using packet loss effects in CELP-type speech coders.
Min-Ki Lee, Kyung-Tae Kim, Hong-Goo Kang, Dae Hee Youn
2007Speech recognition techniques for a sign language recognition system.
Philippe Dreuw, David Rybach, Thomas Deselaers, Morteza Zahedi, Hermann Ney
2007Speech recognition with factorial-HMM syllabic acoustic models.
Gianpaolo Coro, Francesco Cutugno, Fulvio Caropreso
2007Speech recognition with state-based nearest neighbour classifiers.
Thomas Deselaers, Georg Heigold, Hermann Ney
2007Speech reinforcement based on partial specific loudness.
Jong Won Shin, Woohyung Lim, June Sig Sung, Nam Soo Kim
2007Speech synthesis enhancement in noisy environments.
Davide Bonardo, Enrico Zovato
2007Speech to chant transformation with the phase vocoder.
Axel Röbel, Joshua Fineberg
2007Speech-based annotation and retrieval of digital photographs.
Timothy J. Hazen, Brennan Sherry, Mark Adler
2007Speech-nonspeech discrimination using the information bottleneck method and spectro-temporal modulation index.
Maria E. Markaki, Michael Wohlmayr, Yannis Stylianou
2007Speechindexer in action: managing endangered Formosan languages.
Jozsef Szakos, Ulrike Glavitsch
2007Speeding-up neural network training using sentence and frame selection.
Stefano Scanzio, Pietro Laface, Roberto Gemello, Franco Mana
2007Spoken language identification using score vector modeling and support vector machine.
Ming Li, Hongbin Suo, Xiao Wu, Ping Lu, Yonghong Yan
2007Spoken word recognition of Chinese homophones: a further investigation.
Michael C. W. Yip
2007Spontaneous speech synthesis by pronunciation variant selection - a comparison to natural speech.
Steffen Werner, Rüdiger Hoffmann
2007Stabilised weighted linear prediction - a robust all-pole method for speech processing.
Carlo Magi, Tom Bäckström, Paavo Alku
2007Statistical identification of critical, dependent and redundant articulators.
Veena D. Singampalli, Philip J. B. Jackson
2007Statistical vowelization of Arabic text for speech synthesis in speech-to-speech translation systems.
Liang Gu, Wei Zhang, Lazkin Tahir, Yuqing Gao
2007String and lattice based discriminative training for the corpus of spontaneous Japanese lecture transcription task.
Erik McDermott, Atsushi Nakamura
2007Structural Bayesian language modeling and adaptation.
Sibel Yaman, Jen-Tzung Chien, Chin-Hui Lee
2007Structural assessment of language learners' pronunciation.
Nobuaki Minematsu, K. Kamata, Satoshi Asakawa, Takehiko Makino, Tazuko Nishimura, Keikichi Hirose
2007Structure-based and template-based automatic speech recognition - comparing parametric and non-parametric approaches.
Li Deng, Helmer Strik
2007Study on speaker verification with non-audible murmur segments.
Hideki Okamoto, Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano
2007Style estimation of speech based on multiple regression hidden semi-Markov model.
Takashi Nose, Yoichi Kato, Takao Kobayashi
2007Subword-based position specific posterior lattices (s-PSPL) for indexing speech information.
Yi-Cheng Pan, Hung-Lin Chang, Berlin Chen, Lin-Shan Lee
2007Support vector regression for speaker verification.
Ignacio López-Moreno, Ismael Mateos-Garcia, Daniel Ramos, Joaquin Gonzalez-Rodriguez
2007Suprasegmental aspects of pre-lexical speech in cochlear implanted children.
Øydis Hide, Steven Gillis, Paul Govaerts
2007Syllable lattices as a basis for a children's speech reading tracker.
Daniel Bolaños, Wayne H. Ward, Sarel van Vuuren, Javier Garrido Salas
2007Syllable timing patterns in Polish: results from annotation mining.
Dafydd Gibbon, Jolanta Bachan, Grazyna Demenko
2007Synthesis of prosodic attitudinal variants in German backchannel ja.
Thorsten Stocksmeier, Stefan Kopp, Dafydd Gibbon
2007System request detection in conversation based on acoustic and speaker alternation features.
Tomoyuki Yamagata, Atsushi Sako, Tetsuya Takiguchi, Yasuo Ariki
2007Tagging syllable boundaries with joint n-gram models.
Helmut Schmid, Bernd Möbius, Julia Weidenkaff
2007Temporal alignment of creaky voice in neutralised realisations of an underlying, post-nasal voicing contrast in German.
Tina John, Jonathan Harrington
2007Temporal downtrends in Czech read speech.
Jan Volín, Radek Skarnitzl
2007Temporal episodic memory model: an evolution of minerva2.
Viktoria Maier, Roger K. Moore
2007Temporal masking for unsupervised minimum Bayes risk speaker adaptation.
Matthew Gibson, Thomas Hain
2007Testing the relevance of speech rate, pitch and a glottal Chink for the perception of age in synthesized speech using formant synthesis.
Ralf Winkler
2007Text island spotting in large speech databases.
Benjamin Lecouteux, Georges Linarès, Frédéric Beaugendre, Pascal Nocera
2007The BBN 2007 displayless English/iraqi speech-to-speech translation system.
David Stallard, Fred Choi, Chia-Lin Kao, Kriste Krstovski, Premkumar Natarajan, Rohit Prasad, Shirin Saleem, Krishna Subramanian
2007The IRST English-Spanish translation system for european parliament speeches.
Daniele Falavigna, Nicola Bertoldi, Fabio Brugnara, Roldano Cattoni, Mauro Cettolo, Boxing Chen, Marcello Federico, Diego Giuliani, Roberto Gretter, Deepa Gupta, Dino Seppi
2007The ISL 2007 English speech transcription system for european parliament speeches.
Sebastian Stüker, Christian Fügen, Florian Kraft, Matthias Wölfel
2007The RWTH 2007 TC-STAR evaluation system for european English and Spanish.
Jonas Lööf, Christian Gollan, Stefan Hahn, Georg Heigold, Björn Hoffmeister, Christian Plahl, David Rybach, Ralf Schlüter, Hermann Ney
2007The SRI/OGI 2006 spoken term detection system.
Dimitra Vergyri, Izhak Shafran, Andreas Stolcke, Venkata Ramana Rao Gadde, Murat Akbacak, Brian Roark, Wen Wang
2007The blame game: performance analysis of speaker diarization system components.
Marijn Huijbregts, Chuck Wooters
2007The buckeye corpus of speech: updates and enhancements.
Eric Fosler-Lussier, Laura Dilley, Na'im R. Tyson, Mark A. Pitt
2007The developmental analysis of demonstrative expression skills utilizing a multimodal infant behavior corpus.
Shinya Kiriyama, Ryo Tsuji, Tomohiko Kasami, Shogo Ishikawa, Naofumi Otani, Hiroaki Horiuchi, Yoichi Takebayashi, Shigeyoshi Kitazawa
2007The duration of speech pauses in a multilingual environment.
Mike Demol, Werner Verhelst, Piet Verhoeve
2007The effect of filled pauses in a lecture speech on impressive evaluation of listeners.
Hiromitsu Nishizaki, Mitsuhiro Somiya, Kenji Kobayashi, Yoshihiro Sekiguchi
2007The effect of highband harmonic structure in the artificial bandwidth expansion of telephone speech.
Hannu Pulakka, Paavo Alku, Laura Laaksonen, Päivi Valve
2007The effect of speech interface accuracy on driving performance.
Andrew L. Kun, Tim Paek, Zeljko Medenica
2007The effect of the additivity assumption on time and frequency domain wiener filtering for speech enhancement.
Kamil K. Wójcicki, Stephen So, Kuldip K. Paliwal
2007The harming part of room acoustics in automatic speech recognition.
Rico Petrick, Kevin Lohde, Matthias Wolff, Rüdiger Hoffmann
2007The harmonic model codec (HMC) framework for voIP.
Yannis Agiomyrgiannakis, Yannis Stylianou
2007The influence of masking words on the prediction of TRPs in a shadowed dialog.
Wieneke Wesseling, R. J. J. H. van Son, Louis C. W. Pols
2007The influence of speech activity detection and overlap on speaker diarization for meeting room recordings.
Corinne Fredouille, Nicholas W. D. Evans
2007The influence of user tailoring and cognitive load on user performance in spoken dialogue systems.
Andi Winterboer, Jiang Hu, Johanna D. Moore, Clifford Nass
2007The influence of utterance chunking on machine translation performance.
Christian Fügen, Muntsin Kolss
2007The influence of vowel quality features on peak alignment.
Matthias Jilka, Bernd Möbius
2007The intelligibility and its relations to acoustic characteristics of English /s/ and /esh/ produced by native speakers of Japanese.
Akiyo Joto, Yoshiki Nagase, Seiya Funatsu
2007The limits of multidimensional category learning.
Martijn Goudbeek, Daniel Swingley, Keith R. Kluender
2007The neural basis of speech perception - a view from functional imaging.
Sophie K. Scott
2007The neutral tone in question intonation in Mandarin.
Fang Liu, Yi Xu
2007The phonetic exponency of phrasal accentuation in French and German.
William J. Barry, Bistra Andreeva, Ingmar Steiner
2007The phonetics and phonology of high and low tones in two falling f0-contours in standard German.
Tamara Rathcke, Jonathan Harrington
2007The relationship between the perception and production of English nasal codas by brazilian learners of English.
Denise Cristina Kluge, Andréia S. Rauber, Mara Silvia Reis, Ricardo Augusto Hoffmann Bion
2007The relevance of feature type for the automatic classification of emotional user states: low level descriptors and functionals.
Björn W. Schuller, Anton Batliner, Dino Seppi, Stefan Steidl, Thurid Vogt, Johannes Wagner, Laurence Devillers, Laurence Vidrascu, Noam Amir, Loïc Kessous, Vered Aharonson
2007The role of intonation and voice quality in the affective speech perception.
Ioulia Grichkovtsova, Anne Lacheret, Michel Morel
2007The role of metrical stress in comprehension and production in dutch children at-risk of dyslexia.
Petra van Alphen, Elise de Bree, Paula Fikkert, Frank Wijnen
2007The role of outer hair cell function in the perception of synthetic versus natural speech.
Maria K. Wolters, Pauline Campbell, Christine DePlacido, Amy Liddell, David Owens
2007The virtual guide: a direction giving embodied conversational agent.
Mariët Theune, Dennis Hofs, Marco van Kessel
2007The voice-rate dialog system for consumer ratings.
Geoffrey Zweig, Patrick Nguyen, Yun-Cheng Ju, Ye-Yi Wang, Dong Yu, Alex Acero
2007The voiceTRAN machine translation system.
Jerneja Zganec-Gros, Stanislav Gruden
2007Thinking outside the cube: modeling language processing tasks in a multiple resource paradigm.
Kilian G. Seeber
2007Time-compressed speech perception with speech and noise maskers.
Douglas Brungart, Nandini Iyer
2007Time-domain blind audio source separation using advanced ICA methods.
Zbynek Koldovský, Petr Tichavský
2007Time-varying pre-emphasis and inverse filtering of speech.
Karl Schnell, Arild Lacroix
2007Time-warping and re-phasing in packet loss concealment.
Robert Zopf, Jes Thyssen, Juin-Hwey Chen
2007Tone production by the speakers of different age-and-gender groups.
Wai-Sum Lee
2007Top-down effects on compensation for coarticulation are not replicable.
Holger Mitterer
2007Topic estimation with domain extensibility for guiding user's out-of-grammar utterances in multi-domain spoken dialogue systems.
Satoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
2007Topic in dialogue: prosodic and syntactic features.
Claudia Crocco, Renata Savy
2007Towards better language modeling for Thai LVCSR.
Markpong Jongtaveesataporn, Issara Thienlikit, Chai Wutiwiwatchai, Sadaoki Furui
2007Towards online speech summarization.
Gabriel Murray, Steve Renals
2007Trainable speaker diarization.
Hagai Aronowitz
2007Translating conversational speech to standard linguistic form.
Darren Scott Appling, Nick Campbell
2007Two-stage system for robust neutral/lombard speech recognition.
Hynek Boril, Petr Fousek, Harald Höge
2007Two-stream emotion recognition for call center monitoring.
Purnima Gupta, Nitendra Rajput
2007Unsupervised HMM classification of F0 curves.
Damien Lolive, Nelly Barbot, Olivier Boëffard
2007Unsupervised categorisation approaches for technical support automated agents.
Amparo Albalate, Dimitar Dimitrov, Roberto Pieraccini
2007Unsupervised re-scoring of observation probability in viterbi based on reinforcement learning by using confidence measure and HMM neighborhood.
Carlos Molina, Néstor Becerra Yoma, Fernando Huenupán, Claudio Garretón
2007Unsupervised training of adaptation rate using q-learning in large vocabulary continuous speech recognition.
Masafumi Nishida, Yasuo Horiuchi, Akira Ichikawa
2007Unsupervised training with directed manual transcription for recognising Mandarin broadcast audio.
Kai Yu, Mark J. F. Gales, Philip C. Woodland
2007Use of lexical and affective prosodic cues to emotion by younger and older adults.
Kate Dupuis, Kathleen Pichora-Fuller
2007Use of syllable center detection for improved duration modeling in Chinese Mandarin connected digits recognition.
Sergey Astrov, Joachim Hofer, Harald Höge
2007Using a small development set to build a robust dialectal Chinese speech recognizer.
Linquan Liu, Thomas Fang Zheng, Makoto Akabane, Ruxin Chen, Wenhu Wu
2007Using direction of arrival estimate and acoustic feature information in speaker diarization.
Chin-Wei Eugene Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Engsiong Chng, Haizhou Li, Susanto Rahardja
2007Using eye movements for online evaluation of speech synthesis.
Charlotte van Hooijdonk, Edwin Commandeur, Reinier Cozijn, Emiel Krahmer, Erwin Marsi
2007Using information state to improve dialogue move identification in a spoken dialogue system.
Hua Ai, Antonio Roque, Anton Leuski, David R. Traum
2007Using inter-lingual triggers for machine translation.
Caroline Lavecchia, Kamel Smaïli, David Langlois, Jean Paul Haton
2007Using multiple strategies to manage spoken dialogue.
Shiu-Wah Chu, Ian M. O'Neill, Philip Hanna
2007Using neutral speech models for emotional speech analysis.
Carlos Busso, Sungbok Lee, Shrikanth S. Narayanan
2007Using phonetic features in unsupervised word decompounding for ASR with application to a less-represented language.
Thomas Pellegrini, Lori Lamel
2007Using prosodic and spectral characteristics for sleepiness detection.
Jarek Krajewski, Bernd J. Kröger
2007Using speech rhythm for acoustic language identification.
Ekaterina Timoshenko, Harald Höge
2007Using waveform matching techniques in the measurement of shimmer in voiced signals.
Carlos A. Ferrer-Riesgo, María Esperanza Hernández-Díaz, Eduardo González-Moreira
2007Utilizing online content as domain knowledge in a multi-domain dynamic dialogue system.
Craig Wootton, Michael F. McTear, Terry Anderson
2007Utterance-final glottalization as a cue for familiar speaker recognition.
Tamás Böhm, Stefanie Shattuck-Hufnagel
2007VOCALOID - commercial singing synthesizer based on sample concatenation.
Hideki Kenmochi, Hayato Ohshita
2007VZ-norm: an extension of z-norm to the multivariate case for anchor model based speaker verification.
Delphine Charlet, Mikaël Collet, Frédéric Bimbot
2007Varying input segmentation for story boundary detection in English, Arabic and Mandarin broadcast news.
Andrew Rosenberg, Mehrbod Sharifi, Julia Hirschberg
2007Vector-quantization based mask estimation for missing data automatic speech recognition.
Maarten Van Segbroeck, Hugo Van hamme
2007Virtual fusion for speaker recognition.
Yosef A. Solewicz, Moshe Koppel
2007Visual analysis of lip coarticulation in VCV utterances.
Aseel Turkmani, Adrian Hilton, Philip J. B. Jackson, James D. Edge
2007Visual information and redundancy conveyed by internal articulator dynamics in synthetic audiovisual speech.
Katja Grauwinkel, Britta Dewitt, Sascha Fagel
2007Visualizing acoustic similarities between emotions in speech: an acoustic map of emotions.
Khiet P. Truong, David A. van Leeuwen
2007Vocabulary selection for a broadcast news transcription system using a morpho-syntactic approach.
Ciro Martins, António J. S. Teixeira, João Paulo Neto
2007Vocal conversion from speaking voice to singing voice using STRAIGHT.
Takeshi Saitou, Masataka Goto, Masashi Unoki, Masato Akagi
2007Vocal tract and area function estimation with both lip and glottal losses.
Kaustubh Kalgaonkar, Mark A. Clements
2007Vocal tract length during speech production.
Sorin Dusan
2007Voice activated powered wheelchair with non-voice rejection algorithm.
Soo-Young Suk, Hiroaki Kojima
2007Voice activity detection based on support vector machine using effective feature vectors.
Q-Haing Jo, Yun-Sik Park, Kye-Hwan Lee, Ji-Hyun Song, Joon-Hyuk Chang
2007Voice activity detection in degraded speech using excitation source information.
K. Sri Rama Murty, B. Yegnanarayana, Sunitha Guruprasad
2007Voice activity detection using the phase vector in microphone array.
Gibak Kim, Nam Ik Cho
2007Voice fatigue and use of speech recognition: a study of voice quality ratings.
Christel G. de Bruijn, Sandra P. Whiteside
2007Voice source and vocal tract variations as cues to emotional states perceived from expressive conversational speech.
Hiroki Mori, Hideki Kasuya
2007Voicepedia: towards speech-based access to unstructured information.
J. Sherwani, Dong Yu, Tim Paek, Mary Czerwinski, Yun-Cheng Ju, Alex Acero
2007Voicing level control with application in voice conversion.
Jani Nurminen, Jilei Tian, Victor Popa
2007Voicing-based codebook in low-rate wideband CELP coding.
Driss Guerchi, Tamer Rabie, Abdelrhani Louzi
2007Vowel production in two occlusal classes.
André Araújo, Luis M. T. Jesus, Isabel M. Costa
2007Vowels and tones in infant directed speech: hyperarticulation for both, but different developmental patterns.
Nan Xu, Denis Burnham, Christine Kitamura
2007Wavelet-based front-end for electromyographic speech recognition.
Michael Wand, Szu-Chen Stan Jou, Tanja Schultz
2007Web-based language modelling for automatic lecture transcription.
Cosmin Munteanu, Gerald Penn, Ronald Baecker
2007Weighted frequency warping for voice conversion.
Daniel Erro, Asunción Moreno
2007What do listeners attend to in hearing prosodic structures? investigating the human speech-parser using short-term recall.
Annie C. Gilbert, Victor J. Boucher
2007Women's vocal aging: a longitudinal approach.
Markus Brckl
2007Word confusability - measuring hidden Markov model similarity.
Jia-Yu Chen, Peder A. Olsen, John R. Hershey
2007Word duration modeling for word graph rescoring in LVCSR.
Dino Seppi, Daniele Falavigna, Georg Stemmer, Roberto Gretter
2007Word stress correlates in spontaneous child-directed speech in German.
Katrin Schneider, Bernd Möbius
2007Word-conditioned HMM supervectors for speaker recognition.
Howard Lei, Nikki Mirghafori
2007Zero-crossing-based ratio masking for sound segregation.
Sung Jun An, Young-Ik Kim, Rhee Man Kil
2007fMPE-MAP: improved discriminative adaptation for modeling new domains.
Jing Zheng, Andreas Stolcke
2007ugloss: a framework for improving spoken language generation understandability.
Brian Langner, Alan W. Black