| 2007 | A Mandarin lecture speech transcription system for speech summarization. Ricky Ho Yin Chan, Justin Jian Zhang, Pascale Fung, Lu Cao |
| 2007 | A compact semidefinite programming (SDP) formulation for large margin estimation of HMMS in speech recognition. Yan Yin, Hui Jiang |
| 2007 | A comparisonal study of the multi-layer Kohonen self-organizing feature maps for spoken language identification. Liang Wang, Eliathamby Ambikairajah, Eric H. C. Choi |
| 2007 | A constrained line search approach to general discriminative HMM training. Peng Liu, Cong Liu, Hui Jiang, Frank K. Soong, Ren-Hua Wang |
| 2007 | A data-centric architecture for data-driven spoken dialog systems. Sebastian Varges, Giuseppe Riccardi |
| 2007 | A fast-match approach for robust, faster than real-time speaker diarization. Yan Huang, Oriol Vinyals, Gerald Friedland, Christian A. Müller, Nikki Mirghafori, Chuck Wooters |
| 2007 | A language modeling approach to question answering on speech transcripts. Matthias H. Heie, Edward W. D. Whittaker, Josef R. Novak, Sadaoki Furui |
| 2007 | A method for evaluating and comparing user simulations: The Cramér-von Mises divergence. Jason D. Williams |
| 2007 | A multi-layer architecture for semi-synchronous event-driven dialogue management. Antoine Raux, Maxine Eskénazi |
| 2007 | A novel weighting technique for fusing Language Identification systems based on pair-wise performances. Bo Yin, Eliathamby Ambikairajah, Fang Chen |
| 2007 | A study of lattice-based spoken term detection for Chinese spontaneous speech. Sha Meng, Peng Yu, Frank Seide, Jia Liu |
| 2007 | A study on rescoring using HMM-based detectors for continuous speech recognition. Qiang Fu, Biing-Hwang Juang |
| 2007 | A study on soft margin estimation for LVCSR. Jinyu Li, Zhi-Jie Yan, Chin-Hui Lee, Ren-Hua Wang |
| 2007 | A system for speech driven information retrieval. César González Ferreras, Valentín Cardeñoso-Payo |
| 2007 | Adapting grapheme-to-phoneme conversion for name recognition. Xiao Li, Asela Gunawardana, Alex Acero |
| 2007 | Advances in Arabic broadcast news transcription at RWTH. David Rybach, Stefan Hahn, Christian Gollan, Ralf Schlüter, Hermann Ney |
| 2007 | Agglomerative information bottleneck for speaker diarization of meetings data. Deepu Vijayasenan, Fabio Valente, Hervé Bourlard |
| 2007 | An algorithm for fast composition of weighted finite-state transducers. John W. McDonough, Emilian Stoimenov, Dietrich Klakow |
| 2007 | An enhanced minimum classification error learning framework for balancing insertion, deletion and substitution errors. Yuan-Fu Liao, Jia Jang Tu, Sen-Chia Chang, Chin-Hui Lee |
| 2007 | Analytical comparison between position specific posterior lattices and confusion networks based on words and subword units for spoken document indexing. Yi-Cheng Pan, Hung-Lin Chang, Lin-Shan Lee |
| 2007 | Automatic detection of contrastive elements in spontaneous speech. Ani Nenkova, Dan Jurafsky |
| 2007 | Automatic lexical pronunciations generation and update. Ghinwa F. Choueiter, Stephanie Seneff, James R. Glass |
| 2007 | Automatic speech recognition based on weighted minimum classification error (W-MCE) training method. Qiang Fu, Biing-Hwang Juang |
| 2007 | Bayesian adaptation in HMM training and decoding using a mixture of feature transforms. Stavros Tsakalidis, Spyros Matsoukas |
| 2007 | Broad phonetic class recognition in a Hidden Markov model framework using extended Baum-Welch transformations. Tara N. Sainath, Dimitri Kanevsky, Bhuvana Ramabhadran |
| 2007 | Building a highly accurate Mandarin speech recognizer. Mei-Yuh Hwang, Gang Peng, Wen Wang, Arlo Faria, Aaron Heidel, Mari Ostendorf |
| 2007 | Call classification for automated troubleshooting on large corpora. Keelan Evanini, David Suendermann, Roberto Pieraccini |
| 2007 | Combining statistical models with symbolic grammar in parsing. Junichi Tsujii |
| 2007 | Comparing one and two-stage acoustic modeling in the recognition of emotion in speech. Björn W. Schuller, Bogdan Vlasenko, Ricardo Minguez, Gerhard Rigoll, Andreas Wendemuth |
| 2007 | Consolidation based speech translation. Chiori Hori, Bing Zhao, Stephan Vogel, Alex Waibel |
| 2007 | Crosslingual acoustic model development for automatics speech recognition. Frank Diehl, Asunción Moreno, Enric Monte |
| 2007 | Data selection for speech recognition. Yi Wu, Rong Zhang, Alexander I. Rudnicky |
| 2007 | Dealing with cross-lingual aspects in spoken name recognition. Frederik Stouten, Jean-Pierre Martens |
| 2007 | Deriving salient learners' mispronunciations from cross-language phonological comparisons. Helen Mei-Ling Meng, Yuen Yee Lo, Lan Wang, Wing Yiu Lau |
| 2007 | Design and implementation of a robot audition system for automatic speech recognition of simultaneous speech. Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
| 2007 | Development and portability of ASR and Q&A modules for real-environment speech-oriented guidance systems. Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano |
| 2007 | Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance. Norihide Kitaoka, Kazumasa Yamamoto, Tomohiro Kusamizu, Seiichi Nakagawa, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura |
| 2007 | Development of a phonetic system for large vocabulary Arabic speech recognition. Mark J. F. Gales, Frank Diehl, Chandra Kant Raut, Marcus Tomalin, Philip C. Woodland, Kai Yu |
| 2007 | Development of the 2007 RWTH Mandarin LVCSR system. Björn Hoffmeister, Christian Plahl, Peter Fritz, Georg Heigold, Jonas Lööf, Ralf Schlüter, Hermann Ney |
| 2007 | Discriminative language model adaptation for Mandarin broadcast speech transcription and translation. Xunying Liu, William J. Byrne, Mark J. F. Gales, Adrià de Gispert, Marcus Tomalin, Philip C. Woodland, Kai Yu |
| 2007 | Discriminative training of multi-state barge-in models. Andrej Ljolje, Vincent Goffin |
| 2007 | Dynamic language modeling for a daily broadcast news transcription system. Ciro Martins, António J. S. Teixeira, João Paulo Neto |
| 2007 | Dynamic vocabulary prediction for isolated-word dictation on embedded devices. Jussi Leppänen, Jilei Tian |
| 2007 | Efficient combination of parametric spaces, models and metrics for speaker diarization Themos Stafylakis, Vassilis Katsouros, George Carayannis |
| 2007 | Efficient use of overlap information in speaker diarization. Scott Otterson, Mari Ostendorf |
| 2007 | Empirical study of neural network language models for Arabic speech recognition. Ahmad Emami, Lidia Mangu |
| 2007 | Error simulation for training statistical dialogue systems. Jost Schatzmann, Blaise Thomson, Steve J. Young |
| 2007 | Example-based error recovery strategy for spoken dialog system. Cheongjae Lee, Sangkeun Jung, Donghyeon Lee, Gary Geunbae Lee |
| 2007 | Experiments on cross-system acoustic model adaptation. Diego Giuliani, Fabio Brugnara |
| 2007 | Exploiting complementary aspects of phonological features in automatic speech recognition. Parya Momayyez, James Waterhouse, Richard Rose |
| 2007 | Extensible speech recognition system using proxy-agent. Teppei Nakano, Shinya Fujie, Tetsunori Kobayashi |
| 2007 | Factor analysis of acoustic features for streamed hidden Markov modeling. Chuan-Wei Ting, Jen-Tzung Chien |
| 2007 | Fast audio search using vector space modelling. Brett Matthews, Upendra V. Chaudhari, Bhuvana Ramabhadran |
| 2007 | Generalized linear interpolation of language models. Bo-June Paul Hsu |
| 2007 | Graph-based learning for phonetic classification. Andrei Alexandrescu, Katrin Kirchhoff |
| 2007 | HMM training based on CV-EM and CV Gaussian mixture optimization. Takahiro Shinozaki, Tatsuya Kawahara |
| 2007 | Hierarchical Pitman-Yor language models for ASR in meetings. Songfang Huang, Steve Renals |
| 2007 | Hierarchical large-margin Gaussian mixture models for phonetic classification. Hung-An Chang, James R. Glass |
| 2007 | High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series. Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex Acero |
| 2007 | IEEE Workshop on Automatic Speech Recognition & Understanding, ASRU 2007, Kyoto, Japan, December 9-13, 2007 Sadaoki Furui, Tatsuya Kawahara |
| 2007 | Implicit user-adaptive system engagement in speech, pen and multimodal interfaces. Sharon L. Oviatt |
| 2007 | Improvements in phone based audio search via constrained match with high order confusion estimates. Upendra V. Chaudhari, Michael Picheny |
| 2007 | Improving lecture speech summarization using rhetorical information. Justin Jian Zhang, Ricky Ho Yin Chan, Pascale Fung |
| 2007 | Incorporating the voicing information into HMM-based automatic speech recognition. Peter Jancovic, Münevver Köküer |
| 2007 | Integrating several annotation layers for statistical information distillation. Michael Levit, Dilek Hakkani-Tür, Gökhan Tür, Daniel Gillick |
| 2007 | Interpolation of lost speech segments using LP-HNM model with codebook-mapping post-processing. Esfandiar Zavarehei, Saeed Vaseghi |
| 2007 | Interpolative variable frame rate transmission of speech features for distributed speech recognition. Huiqun Deng, Douglas D. O'Shaughnessy, Jean-Guy Dahan, William F. Ganong III |
| 2007 | Introduction of the METI project "development of fundamental speech recognition technology". Sadaoki Furui, Tetsunori Kobayashi |
| 2007 | Investigating linguistic knowledge in a maximum entropy token-based language model. Jia Cui, Yi Su, Keith B. Hall, Frederick Jelinek |
| 2007 | Investigating the use of speech features and their corresponding distribution characteristics for robust speech recognition. Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen |
| 2007 | Joint decoding of multiple speech patterns for robust speech recognition. Nishanth Ulhas Nair, T. V. Sreenivas |
| 2007 | Lattice-based Viterbi decoding techniques for speech translation. George Saon, Michael Picheny |
| 2007 | Maximum entropy model parameterization with TF∗IDF weighted vector space model. Ye-Yi Wang, Alex Acero |
| 2007 | Minimum mutual information beamforming for simultaneous active speakers. Ken'ichi Kumatani, Uwe Mayer, Tobias Gehrig, Emilian Stoimenov, John W. McDonough, Matthias Wölfel |
| 2007 | Mixture Gaussian HMM-trajctory method using likelihood compensation. Yasuhiro Minami |
| 2007 | Modulation spectrum equalization for robust speech recognition. Liang-Che Sun, Chang-Wen Hsu, Lin-Shan Lee |
| 2007 | Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPS. Özgür Çetin, Mathew Magimai-Doss, Karen Livescu, Arthur Kantor, Simon King, Chris D. Bartels, Joe Frankel |
| 2007 | Multi-stream dialect classification using SVM-GMM hybrid classifiers. Rahul Chitturi, John H. L. Hansen |
| 2007 | Multiple feature combination to improve speaker diarization of telephone conversations. Vishwa Gupta, Patrick Kenny, Pierre Ouellet, Gilles Boulianne, Pierre Dumouchel |
| 2007 | Never-ending learning system for on-line speaker diarization. Konstantin Markov, Satoshi Nakamura |
| 2007 | Non-native pronunciation variation modeling using an indirect data driven method. Mina Kim, Yoo Rhee Oh, Hong Kook Kim |
| 2007 | Non-native speech databases. Martin Raab, Rainer Gruhn, Elmar Nöth |
| 2007 | OOV detection by joint word/phone lattice alignment. Hui Lin, Jeff A. Bilmes, Dimitra Vergyri, Katrin Kirchhoff |
| 2007 | Phonological feature based variable frame rate scheme for improved speech recognition. Abhijeet Sangwan, John H. L. Hansen |
| 2007 | Predictive linear transforms for noise robust speech recognition. Mark J. F. Gales, Rogier C. van Dalen |
| 2007 | Random discriminant structure analysis for automatic recognition of connected vowels. Yu Qiao, Satoshi Asakawa, Nobuaki Minematsu |
| 2007 | Recognition and understanding of meetings the AMI and AMIDA projects. Steve Renals, Thomas Hain, Hervé Bourlard |
| 2007 | Refine bigram PLSA model by assigning latent topics unevenly. Jiazhong Nie, Runxin Li, Dingsheng Luo, Xihong Wu |
| 2007 | Regularization, adaptation, and non-independent features improve hidden conditional random fields for phone classification. Yun-Hsuan Sung, Constantinos Boulis, Christopher D. Manning, Dan Jurafsky |
| 2007 | Reranking machine translation hypotheses with structured and web-based language models. Wen Wang, Andreas Stolcke, Jing Zheng |
| 2007 | Robust speaker clustering strategies to data source variation for improved speaker diarization. Kyu Jeong Han, Samuel Kim, Shrikanth S. Narayanan |
| 2007 | Robust speech recognition by properly utilizing reliable frames and segments in corrupted signals. Yi Chen, Chia-Yu Wan, Lin-Shan Lee |
| 2007 | Robust speech recognition using noise suppression based on multiple composite models and multi-pass search. Takatoshi Jitsuhiro, Tomoji Toriyama, Kiyoshi Kogure |
| 2007 | Robust speech recognition with on-line unsupervised acoustic feature compensation. Luis Buera, Antonio Miguel, Eduardo Lleida, Oscar Saz, Alfonso Ortega |
| 2007 | Robust topic inference for latent semantic language model adaptation. Aaron Heidel, Lin-Shan Lee |
| 2007 | Roles of high-fidelity acoustic modeling in robust speech recognition. Li Deng |
| 2007 | Semantic translation error rate for evaluating translation systems. Krishna Subramanian, David Stallard, Rohit Prasad, Shirin Saleem, Prem Natarajan |
| 2007 | Sensei: Spoken language assessment for call center agents. Abhishek Chandel, Abhinav Parate, Maymon Madathingal, Himanshu Pant, Nitendra Rajput, Shajith Ikbal, Om Deshmukh, Ashish Verma |
| 2007 | Soundbite identification using reference and automatic transcripts of broadcast news speech. Feifan Liu, Yang Liu |
| 2007 | Speech enhancement using PCA and variance of the reconstruction error in distributed speech recognition. Amin Haji Abolhassani, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy |
| 2007 | Speech recognition with localized time-frequency pattern detectors. Ken Schutte, James R. Glass |
| 2007 | Speech-translation: from domain-limited to domain-unlimited translation tasks. Stephan Vogel |
| 2007 | Speechfind for CDP: Advances in spoken document retrieval for the U. S. collaborative digitization program. Wooil Kim, John H. L. Hansen |
| 2007 | Spoken document summarization using relevant information. Yi-Ting Chen, Shih-Hsiang Lin, Hsin-Min Wang, Berlin Chen |
| 2007 | Spoken language understanding with kernels for syntactic/semantic structures. Alessandro Moschitti, Giuseppe Riccardi, Christian Raymond |
| 2007 | Spoken language understanding: a survey. Renato De Mori |
| 2007 | State-dependent mixture tying with variable codebook size for accented speech recognition. Yi Liu, Fang Zheng, Lei He, Yunqing Xia |
| 2007 | Submodularity and adaptation. Jeff A. Bilmes |
| 2007 | The GALE project: A description and an update. Jordan Cohen |
| 2007 | The IBM 2007 speech transcription system for European parliamentary speeches. Bhuvana Ramabhadran, Olivier Siohan, Abhinav Sethy |
| 2007 | The LIMSI QAst systems: Comparison between human and automatic rules generation for question-answering on speech transcriptions. Sophie Rosset, Olivier Galibert, Gilles Adda, Éric Bilinski |
| 2007 | The RWTH Arabic-to-English spoken language translation system. Oliver Bender, Evgeny Matusov, Stefan Hahn, Sasa Hasan, Shahram Khadivi, Hermann Ney |
| 2007 | The Titech large vocabulary WFST speech recognition system. Paul R. Dixon, Diamantino Caseiro, Tasuku Oonishi, Sadaoki Furui |
| 2007 | Topic identification from audio recordings using word and phone recognition lattices. Timothy J. Hazen, Fred Richardson, Anna Margolis |
| 2007 | Towards bottom-up continuous phone recognition. Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee |
| 2007 | Towards robust automatic evaluation of pathologic telephone speech. Korbinian Riedhammer, Georg Stemmer, Tino Haderlein, Maria Schuster, Frank Rosanowski, Elmar Nöth, Andreas K. Maier |
| 2007 | Towards spoken-document retrieval for the enterprise: Approximate word-lattice indexing with text indexers. Frank Seide, Peng Yu, Yu Shi |
| 2007 | Training data selection for improving discriminative training of acoustic models. Shih-Hung Liu, Fang-Hui Chu, Shih-Hsiang Lin, Hung-Shin Lee, Berlin Chen |
| 2007 | Two extensions to ensemble speaker and speaking environment modeling for robust automatic speech recognition. Yu Tsao, Chin-Hui Lee |
| 2007 | Type-II dialogue systems for information access from unstructured knowledge sources. Yi-Cheng Pan, Lin-Shan Lee |
| 2007 | Uncertainty in training large vocabulary speech recognizers. Amarnag Subramanya, Chris D. Bartels, Jeff A. Bilmes, Patrick Nguyen |
| 2007 | Unsupervised state clustering for stochastic dialog management. Fabrice Lefèvre, Renato De Mori |
| 2007 | Use of syllable nuclei locations to improve ASR. Chris D. Bartels, Jeff A. Bilmes |
| 2007 | Using particle filters to track dialogue state. Jason D. Williams |
| 2007 | Variational Kullback-Leibler divergence for Hidden Markov models. John R. Hershey, Peder A. Olsen, Steven J. Rennie |
| 2007 | Voice search - Information access via voice queries. Ye-Yi Wang |
| 2007 | Voice/audio information retrieval: minimizing the need for human ears. Mark Clements, Marsal Gavaldà |