ASRU C

127 papers

YearTitle / Authors
2007A Mandarin lecture speech transcription system for speech summarization.
Ricky Ho Yin Chan, Justin Jian Zhang, Pascale Fung, Lu Cao
2007A compact semidefinite programming (SDP) formulation for large margin estimation of HMMS in speech recognition.
Yan Yin, Hui Jiang
2007A comparisonal study of the multi-layer Kohonen self-organizing feature maps for spoken language identification.
Liang Wang, Eliathamby Ambikairajah, Eric H. C. Choi
2007A constrained line search approach to general discriminative HMM training.
Peng Liu, Cong Liu, Hui Jiang, Frank K. Soong, Ren-Hua Wang
2007A data-centric architecture for data-driven spoken dialog systems.
Sebastian Varges, Giuseppe Riccardi
2007A fast-match approach for robust, faster than real-time speaker diarization.
Yan Huang, Oriol Vinyals, Gerald Friedland, Christian A. Müller, Nikki Mirghafori, Chuck Wooters
2007A language modeling approach to question answering on speech transcripts.
Matthias H. Heie, Edward W. D. Whittaker, Josef R. Novak, Sadaoki Furui
2007A method for evaluating and comparing user simulations: The Cramér-von Mises divergence.
Jason D. Williams
2007A multi-layer architecture for semi-synchronous event-driven dialogue management.
Antoine Raux, Maxine Eskénazi
2007A novel weighting technique for fusing Language Identification systems based on pair-wise performances.
Bo Yin, Eliathamby Ambikairajah, Fang Chen
2007A study of lattice-based spoken term detection for Chinese spontaneous speech.
Sha Meng, Peng Yu, Frank Seide, Jia Liu
2007A study on rescoring using HMM-based detectors for continuous speech recognition.
Qiang Fu, Biing-Hwang Juang
2007A study on soft margin estimation for LVCSR.
Jinyu Li, Zhi-Jie Yan, Chin-Hui Lee, Ren-Hua Wang
2007A system for speech driven information retrieval.
César González Ferreras, Valentín Cardeñoso-Payo
2007Adapting grapheme-to-phoneme conversion for name recognition.
Xiao Li, Asela Gunawardana, Alex Acero
2007Advances in Arabic broadcast news transcription at RWTH.
David Rybach, Stefan Hahn, Christian Gollan, Ralf Schlüter, Hermann Ney
2007Agglomerative information bottleneck for speaker diarization of meetings data.
Deepu Vijayasenan, Fabio Valente, Hervé Bourlard
2007An algorithm for fast composition of weighted finite-state transducers.
John W. McDonough, Emilian Stoimenov, Dietrich Klakow
2007An enhanced minimum classification error learning framework for balancing insertion, deletion and substitution errors.
Yuan-Fu Liao, Jia Jang Tu, Sen-Chia Chang, Chin-Hui Lee
2007Analytical comparison between position specific posterior lattices and confusion networks based on words and subword units for spoken document indexing.
Yi-Cheng Pan, Hung-Lin Chang, Lin-Shan Lee
2007Automatic detection of contrastive elements in spontaneous speech.
Ani Nenkova, Dan Jurafsky
2007Automatic lexical pronunciations generation and update.
Ghinwa F. Choueiter, Stephanie Seneff, James R. Glass
2007Automatic speech recognition based on weighted minimum classification error (W-MCE) training method.
Qiang Fu, Biing-Hwang Juang
2007Bayesian adaptation in HMM training and decoding using a mixture of feature transforms.
Stavros Tsakalidis, Spyros Matsoukas
2007Broad phonetic class recognition in a Hidden Markov model framework using extended Baum-Welch transformations.
Tara N. Sainath, Dimitri Kanevsky, Bhuvana Ramabhadran
2007Building a highly accurate Mandarin speech recognizer.
Mei-Yuh Hwang, Gang Peng, Wen Wang, Arlo Faria, Aaron Heidel, Mari Ostendorf
2007Call classification for automated troubleshooting on large corpora.
Keelan Evanini, David Suendermann, Roberto Pieraccini
2007Combining statistical models with symbolic grammar in parsing.
Junichi Tsujii
2007Comparing one and two-stage acoustic modeling in the recognition of emotion in speech.
Björn W. Schuller, Bogdan Vlasenko, Ricardo Minguez, Gerhard Rigoll, Andreas Wendemuth
2007Consolidation based speech translation.
Chiori Hori, Bing Zhao, Stephan Vogel, Alex Waibel
2007Crosslingual acoustic model development for automatics speech recognition.
Frank Diehl, Asunción Moreno, Enric Monte
2007Data selection for speech recognition.
Yi Wu, Rong Zhang, Alexander I. Rudnicky
2007Dealing with cross-lingual aspects in spoken name recognition.
Frederik Stouten, Jean-Pierre Martens
2007Deriving salient learners' mispronunciations from cross-language phonological comparisons.
Helen Mei-Ling Meng, Yuen Yee Lo, Lan Wang, Wing Yiu Lau
2007Design and implementation of a robot audition system for automatic speech recognition of simultaneous speech.
Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno
2007Development and portability of ASR and Q&A modules for real-environment speech-oriented guidance systems.
Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano
2007Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance.
Norihide Kitaoka, Kazumasa Yamamoto, Tomohiro Kusamizu, Seiichi Nakagawa, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura
2007Development of a phonetic system for large vocabulary Arabic speech recognition.
Mark J. F. Gales, Frank Diehl, Chandra Kant Raut, Marcus Tomalin, Philip C. Woodland, Kai Yu
2007Development of the 2007 RWTH Mandarin LVCSR system.
Björn Hoffmeister, Christian Plahl, Peter Fritz, Georg Heigold, Jonas Lööf, Ralf Schlüter, Hermann Ney
2007Discriminative language model adaptation for Mandarin broadcast speech transcription and translation.
Xunying Liu, William J. Byrne, Mark J. F. Gales, Adrià de Gispert, Marcus Tomalin, Philip C. Woodland, Kai Yu
2007Discriminative training of multi-state barge-in models.
Andrej Ljolje, Vincent Goffin
2007Dynamic language modeling for a daily broadcast news transcription system.
Ciro Martins, António J. S. Teixeira, João Paulo Neto
2007Dynamic vocabulary prediction for isolated-word dictation on embedded devices.
Jussi Leppänen, Jilei Tian
2007Efficient combination of parametric spaces, models and metrics for speaker diarization
Themos Stafylakis, Vassilis Katsouros, George Carayannis
2007Efficient use of overlap information in speaker diarization.
Scott Otterson, Mari Ostendorf
2007Empirical study of neural network language models for Arabic speech recognition.
Ahmad Emami, Lidia Mangu
2007Error simulation for training statistical dialogue systems.
Jost Schatzmann, Blaise Thomson, Steve J. Young
2007Example-based error recovery strategy for spoken dialog system.
Cheongjae Lee, Sangkeun Jung, Donghyeon Lee, Gary Geunbae Lee
2007Experiments on cross-system acoustic model adaptation.
Diego Giuliani, Fabio Brugnara
2007Exploiting complementary aspects of phonological features in automatic speech recognition.
Parya Momayyez, James Waterhouse, Richard Rose
2007Extensible speech recognition system using proxy-agent.
Teppei Nakano, Shinya Fujie, Tetsunori Kobayashi
2007Factor analysis of acoustic features for streamed hidden Markov modeling.
Chuan-Wei Ting, Jen-Tzung Chien
2007Fast audio search using vector space modelling.
Brett Matthews, Upendra V. Chaudhari, Bhuvana Ramabhadran
2007Generalized linear interpolation of language models.
Bo-June Paul Hsu
2007Graph-based learning for phonetic classification.
Andrei Alexandrescu, Katrin Kirchhoff
2007HMM training based on CV-EM and CV Gaussian mixture optimization.
Takahiro Shinozaki, Tatsuya Kawahara
2007Hierarchical Pitman-Yor language models for ASR in meetings.
Songfang Huang, Steve Renals
2007Hierarchical large-margin Gaussian mixture models for phonetic classification.
Hung-An Chang, James R. Glass
2007High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series.
Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex Acero
2007IEEE Workshop on Automatic Speech Recognition & Understanding, ASRU 2007, Kyoto, Japan, December 9-13, 2007
Sadaoki Furui, Tatsuya Kawahara
2007Implicit user-adaptive system engagement in speech, pen and multimodal interfaces.
Sharon L. Oviatt
2007Improvements in phone based audio search via constrained match with high order confusion estimates.
Upendra V. Chaudhari, Michael Picheny
2007Improving lecture speech summarization using rhetorical information.
Justin Jian Zhang, Ricky Ho Yin Chan, Pascale Fung
2007Incorporating the voicing information into HMM-based automatic speech recognition.
Peter Jancovic, Münevver Köküer
2007Integrating several annotation layers for statistical information distillation.
Michael Levit, Dilek Hakkani-Tür, Gökhan Tür, Daniel Gillick
2007Interpolation of lost speech segments using LP-HNM model with codebook-mapping post-processing.
Esfandiar Zavarehei, Saeed Vaseghi
2007Interpolative variable frame rate transmission of speech features for distributed speech recognition.
Huiqun Deng, Douglas D. O'Shaughnessy, Jean-Guy Dahan, William F. Ganong III
2007Introduction of the METI project "development of fundamental speech recognition technology".
Sadaoki Furui, Tetsunori Kobayashi
2007Investigating linguistic knowledge in a maximum entropy token-based language model.
Jia Cui, Yi Su, Keith B. Hall, Frederick Jelinek
2007Investigating the use of speech features and their corresponding distribution characteristics for robust speech recognition.
Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen
2007Joint decoding of multiple speech patterns for robust speech recognition.
Nishanth Ulhas Nair, T. V. Sreenivas
2007Lattice-based Viterbi decoding techniques for speech translation.
George Saon, Michael Picheny
2007Maximum entropy model parameterization with TF∗IDF weighted vector space model.
Ye-Yi Wang, Alex Acero
2007Minimum mutual information beamforming for simultaneous active speakers.
Ken'ichi Kumatani, Uwe Mayer, Tobias Gehrig, Emilian Stoimenov, John W. McDonough, Matthias Wölfel
2007Mixture Gaussian HMM-trajctory method using likelihood compensation.
Yasuhiro Minami
2007Modulation spectrum equalization for robust speech recognition.
Liang-Che Sun, Chang-Wen Hsu, Lin-Shan Lee
2007Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPS.
Özgür Çetin, Mathew Magimai-Doss, Karen Livescu, Arthur Kantor, Simon King, Chris D. Bartels, Joe Frankel
2007Multi-stream dialect classification using SVM-GMM hybrid classifiers.
Rahul Chitturi, John H. L. Hansen
2007Multiple feature combination to improve speaker diarization of telephone conversations.
Vishwa Gupta, Patrick Kenny, Pierre Ouellet, Gilles Boulianne, Pierre Dumouchel
2007Never-ending learning system for on-line speaker diarization.
Konstantin Markov, Satoshi Nakamura
2007Non-native pronunciation variation modeling using an indirect data driven method.
Mina Kim, Yoo Rhee Oh, Hong Kook Kim
2007Non-native speech databases.
Martin Raab, Rainer Gruhn, Elmar Nöth
2007OOV detection by joint word/phone lattice alignment.
Hui Lin, Jeff A. Bilmes, Dimitra Vergyri, Katrin Kirchhoff
2007Phonological feature based variable frame rate scheme for improved speech recognition.
Abhijeet Sangwan, John H. L. Hansen
2007Predictive linear transforms for noise robust speech recognition.
Mark J. F. Gales, Rogier C. van Dalen
2007Random discriminant structure analysis for automatic recognition of connected vowels.
Yu Qiao, Satoshi Asakawa, Nobuaki Minematsu
2007Recognition and understanding of meetings the AMI and AMIDA projects.
Steve Renals, Thomas Hain, Hervé Bourlard
2007Refine bigram PLSA model by assigning latent topics unevenly.
Jiazhong Nie, Runxin Li, Dingsheng Luo, Xihong Wu
2007Regularization, adaptation, and non-independent features improve hidden conditional random fields for phone classification.
Yun-Hsuan Sung, Constantinos Boulis, Christopher D. Manning, Dan Jurafsky
2007Reranking machine translation hypotheses with structured and web-based language models.
Wen Wang, Andreas Stolcke, Jing Zheng
2007Robust speaker clustering strategies to data source variation for improved speaker diarization.
Kyu Jeong Han, Samuel Kim, Shrikanth S. Narayanan
2007Robust speech recognition by properly utilizing reliable frames and segments in corrupted signals.
Yi Chen, Chia-Yu Wan, Lin-Shan Lee
2007Robust speech recognition using noise suppression based on multiple composite models and multi-pass search.
Takatoshi Jitsuhiro, Tomoji Toriyama, Kiyoshi Kogure
2007Robust speech recognition with on-line unsupervised acoustic feature compensation.
Luis Buera, Antonio Miguel, Eduardo Lleida, Oscar Saz, Alfonso Ortega
2007Robust topic inference for latent semantic language model adaptation.
Aaron Heidel, Lin-Shan Lee
2007Roles of high-fidelity acoustic modeling in robust speech recognition.
Li Deng
2007Semantic translation error rate for evaluating translation systems.
Krishna Subramanian, David Stallard, Rohit Prasad, Shirin Saleem, Prem Natarajan
2007Sensei: Spoken language assessment for call center agents.
Abhishek Chandel, Abhinav Parate, Maymon Madathingal, Himanshu Pant, Nitendra Rajput, Shajith Ikbal, Om Deshmukh, Ashish Verma
2007Soundbite identification using reference and automatic transcripts of broadcast news speech.
Feifan Liu, Yang Liu
2007Speech enhancement using PCA and variance of the reconstruction error in distributed speech recognition.
Amin Haji Abolhassani, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy
2007Speech recognition with localized time-frequency pattern detectors.
Ken Schutte, James R. Glass
2007Speech-translation: from domain-limited to domain-unlimited translation tasks.
Stephan Vogel
2007Speechfind for CDP: Advances in spoken document retrieval for the U. S. collaborative digitization program.
Wooil Kim, John H. L. Hansen
2007Spoken document summarization using relevant information.
Yi-Ting Chen, Shih-Hsiang Lin, Hsin-Min Wang, Berlin Chen
2007Spoken language understanding with kernels for syntactic/semantic structures.
Alessandro Moschitti, Giuseppe Riccardi, Christian Raymond
2007Spoken language understanding: a survey.
Renato De Mori
2007State-dependent mixture tying with variable codebook size for accented speech recognition.
Yi Liu, Fang Zheng, Lei He, Yunqing Xia
2007Submodularity and adaptation.
Jeff A. Bilmes
2007The GALE project: A description and an update.
Jordan Cohen
2007The IBM 2007 speech transcription system for European parliamentary speeches.
Bhuvana Ramabhadran, Olivier Siohan, Abhinav Sethy
2007The LIMSI QAst systems: Comparison between human and automatic rules generation for question-answering on speech transcriptions.
Sophie Rosset, Olivier Galibert, Gilles Adda, Éric Bilinski
2007The RWTH Arabic-to-English spoken language translation system.
Oliver Bender, Evgeny Matusov, Stefan Hahn, Sasa Hasan, Shahram Khadivi, Hermann Ney
2007The Titech large vocabulary WFST speech recognition system.
Paul R. Dixon, Diamantino Caseiro, Tasuku Oonishi, Sadaoki Furui
2007Topic identification from audio recordings using word and phone recognition lattices.
Timothy J. Hazen, Fred Richardson, Anna Margolis
2007Towards bottom-up continuous phone recognition.
Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee
2007Towards robust automatic evaluation of pathologic telephone speech.
Korbinian Riedhammer, Georg Stemmer, Tino Haderlein, Maria Schuster, Frank Rosanowski, Elmar Nöth, Andreas K. Maier
2007Towards spoken-document retrieval for the enterprise: Approximate word-lattice indexing with text indexers.
Frank Seide, Peng Yu, Yu Shi
2007Training data selection for improving discriminative training of acoustic models.
Shih-Hung Liu, Fang-Hui Chu, Shih-Hsiang Lin, Hung-Shin Lee, Berlin Chen
2007Two extensions to ensemble speaker and speaking environment modeling for robust automatic speech recognition.
Yu Tsao, Chin-Hui Lee
2007Type-II dialogue systems for information access from unstructured knowledge sources.
Yi-Cheng Pan, Lin-Shan Lee
2007Uncertainty in training large vocabulary speech recognizers.
Amarnag Subramanya, Chris D. Bartels, Jeff A. Bilmes, Patrick Nguyen
2007Unsupervised state clustering for stochastic dialog management.
Fabrice Lefèvre, Renato De Mori
2007Use of syllable nuclei locations to improve ASR.
Chris D. Bartels, Jeff A. Bilmes
2007Using particle filters to track dialogue state.
Jason D. Williams
2007Variational Kullback-Leibler divergence for Hidden Markov models.
John R. Hershey, Peder A. Olsen, Steven J. Rennie
2007Voice search - Information access via voice queries.
Ye-Yi Wang
2007Voice/audio information retrieval: minimizing the need for human ears.
Mark Clements, Marsal Gavaldà