INTERSPEECH 2008, 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008

researchr

You are not signed in
Sign in
Sign up

INTERSPEECH 2008, 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008. ISCA, 2008.

Conference: interspeech2008

Abstract is missing.

In search of models in speech communication researchHiroya Fujisaki. 1-10 [doi]

Dealing with limited and noisy data in ASR: a hybrid knowledge-based and statistical approachAbeer Alwan. 11-15 [doi]

Forensic automatic speaker recognition: fiction or science?Joaquin Gonzalez-Rodriguez. 16-17 [doi]

Modelling rapport in embodied conversational agentsJustine Cassell. 18-19 [doi]

Agglomerative hierarchical speaker clustering using incremental Gaussian mixture cluster modelingKyu Jeong Han, Shrikanth S. Narayanan. 20-23 [doi]

Weighted segmental k-means initialization for SOM-based speaker clusteringOshry Ben-Harush, Itshak Lapidot, Hugo Guterman. 24-27 [doi]

Learning essential speaker sub-space using hetero-associative neural networks for speaker clusteringShajith Ikbal, Karthik Visweswariah. 28-31 [doi]

Two s a crowd: improving speaker diarization by automatically identifying and excluding overlapped speechKofi Boakye, Oriol Vinyals, Gerald Friedland. 32-35 [doi]

T-test distance and clustering criterion for speaker diarizationTrung Hieu Nguyen, Engsiong Chng, Haizhou Li. 36-39 [doi]

Integration of TDOA features in information bottleneck framework for fast speaker diarizationDeepu Vijayasenan, Fabio Valente, Hervé Bourlard. 40-43 [doi]

Low complexity near-optimal unit-selection algorithm for ultra low bit-rate speech coding based on n-best lattice and Viterbi searchV. Ramasubramanian, D. Harish. 44 [doi]

A new fast algebraic fixed codebook search algorithm in CELP speech codingVaclav Eksler, Redwan Salami, Milan Jelinek. 45-48 [doi]

A novel transcoding algorithm between 3GPP AMR-NB (7.95kbit/s) and ITU-t g.729a (8kbit/s)Hao Xu, Changchun Bao. 49-52 [doi]

Mel-frequency cepstral coefficient-based bandwidth extension of narrowband speechAmr H. Nour-Eldin, Peter Kabal. 53-56 [doi]

A PCM coding noise reduction for ITU-t g.711.1Jean-Luc Garcia, Claude Marro, Balázs Kövesi. 57-60 [doi]

An instrumental measure for end-to-end speech transmission quality based on perceptual dimensions: framework and realizationMarcel Wältermann, Kirstin Scholz, Sebastian Möller, Lu Huo, Alexander Raake, Ulrich Heute. 61-64 [doi]

Duration and F0 interval of utterance-final intonation contours in the perception of German sentence modalityBenno Peters, Hartmut R. Pfitzinger. 65-68 [doi]

Contrastive utterances make alternatives salient - cross-modal priming evidenceBettina Braun, Lara Tagliapietra, Anne Cutler. 69 [doi]

Exploring a mechanism of speech sychronization using auditory delayed experimentsMasato Ishizaki, Yasuharu Den, Senshi Fukashiro. 70-73 [doi]

Prosodic manifestations of confidence and uncertainty in spoken languageHeather Pon-Barry. 74-77 [doi]

Identifying relevant phrases to summarize decisions in spoken meetingsRaquel Fernández, Matthew Frampton, John Dowding, Anish Adukuzhiyil, Patrick Ehlen, Stanley Peters. 78-81 [doi]

Recovering participant identities in meetings from a probabilistic description of vocal interactionKornel Laskowski, Tanja Schultz. 82-85 [doi]

Coarticulation in nasal and lateral clusters in WarlpiriJanet Fletcher, Deborah Loakes, Andrew Butcher. 86-89 [doi]

Phonetically prestopped laterals in Australian languages: a preliminary investigation of WarlpiriDeborah Loakes, Andrew Butcher, Janet Fletcher, Hywel Stoakes. 90-93 [doi]

Connected speech processes in WarlpiriJohn Ingram, Mary Laughren, Jeff Chapman. 94 [doi]

Text, rhythm and metrical form in an Aboriginal song seriesMyfany Turpin. 96-98 [doi]

Statistical speech activity detection based on spatial power distribution for analyses of poster presentationsKentaro Ishizuka, Shoko Araki, Tatsuya Kawahara. 99-102 [doi]

A statistical model-based voice activity detection employing minimum classification error techniqueSang-Ick Kang, Ji-Hyun Song, Kye-Hwan Lee, Yun-Sik Park, Joon-Hyuk Chang. 103-106 [doi]

Comparative evaluation of different methods for voice activity detectionHongfei Ding, Koichi Yamamoto, Masami Akamine. 107-110 [doi]

Speech/non-speech segments detection based on chaotic and prosodic featuresSoheil Shafiee, Farshad Almasganj, Ayyoob Jafari. 111-114 [doi]

Acoustic event classification using a distributed microphone network with a GMM/SVM combined algorithmChristian Zieger, Maurizio Omologo. 115-118 [doi]

Intentional voice command detection for completely hands-free speech interface in home environmentsYasunari Obuchi, Masahito Togami, Takashi Sumiyoshi. 119-122 [doi]

Fusion of audio and video modalities for detection of acoustic eventsTaras Butko, Andrey Temko, Climent Nadeu, Cristian Canton-Ferrer. 123-126 [doi]

DySANA: dynamic speech and noise adaptation for voice activity detectionRon J. Weiss, Trausti T. Kristjansson. 127-130 [doi]

A comprehensive study on the effects of room reverberation on fundamental frequency estimationRico Petrick, Masashi Unoki, Anish Mittal, Carlos Segura, Rüdiger Hoffmann. 131-134 [doi]

A hybrid speech signal based algorithm for pitch marking using finite state machinesHussein Hussein, Matthias Wolff, Oliver Jokisch, Frank Duckhorn, Guntram Strecha, Rüdiger Hoffmann. 135-138 [doi]

Parameter estimation method of F0 control model for singing voicesYasunori Ohishi, Hirokazu Kameoka, Kunio Kashino, Kazuya Takeda. 139-142 [doi]

An algorithm for multi-pitch tracking in co-channel speechSrikanth Vishnubhotla, Carol Y. Espy-Wilson. 143-146 [doi]

Multipitch tracking using a factorial hidden Markov modelMichael Wohlmayr, Franz Pernkopf. 147-150 [doi]

Cochannel speech separation using multi-pitch estimation and model based voiced sequential groupingMing Li, Chuan Cao, Di Wang, Ping Lu, Qiang Fu, YongHong Yan. 151-154 [doi]

Crosscorrelation of adjacent spectra enhances fundamental frequency trackingPhilippe Martin. 155-158 [doi]

Enhancement of noisy speech recordings via blind source separationJirí Málek, Zbynek Koldovský, Jindrich Zdánský, Jan Nouza. 159-162 [doi]

Studies on estimation of the number of sources in blind source separationTakaaki Ishibashi, Hidetoshi Nakashima, Hiromu Gotanda. 163-166 [doi]

Speech enhancement based on hypothesized Wiener filteringV. Ramasubramanian, Deepak Vijaywargi. 167-170 [doi]

Psychoacoustically-motivated adaptive β-order generalized spectral subtraction based on data-driven optimizationJunfeng Li, Hui Jiang, Masato Akagi. 171-174 [doi]

Two stage iterative Wiener filtering for speech enhancementKrishna Nand K., T. V. Sreenivas. 175-178 [doi]

Assessment of correlation between objective measures and speech recognition performance in the evaluation of speech enhancementPei Ding, Jie Hao. 179-182 [doi]

Predicting ASR errors by exploiting barge-in rate of individual users for spoken dialogue systemsKazunori Komatani, Tatsuya Kawahara, Hiroshi G. Okuno. 183-186 [doi]

Expanding vocabulary for recognizing user s abbreviations of proper nouns without increasing ASR error rates in spoken dialogue systemsMasaki Katsumaru, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno. 187-190 [doi]

Exploiting the ASR n-best by tracking multiple dialog state hypothesesJason D. Williams. 191-194 [doi]

A spoken language interpretation component for a robot dialogue systemEnes Makalic, Ingrid Zukerman, Michael Niemann. 195-198 [doi]

MUESLI: multiple utterance error correction for a spoken language interfaceFederico Cesari, Horacio Franco, Gregory K. Myers, Harry Bratt. 199-202 [doi]

Methods to optimize transcription of on-line mediaSarah Conrod, Sara H. Basson, Dimitri Kanevsky. 203-206 [doi]

Discrimination of task-related words for vocabulary design of spoken dialog systemsAkinori Ito, Toyomi Meguro, Shozo Makino, Motoyuki Suzuki. 207-210 [doi]

Dialog management using weighted finite-state transducersChiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura. 211-214 [doi]

Probabilistic answer selection based on conditional random fields for spoken dialog systemYoshitaka Yoshimi, Ryota Kakitsuba, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda. 215-218 [doi]

Let s go lab: a platform for evaluation of spoken dialog systems with real world usersMaxine Eskenazi, Alan W. Black, Antoine Raux, Brian Langner. 219 [doi]

The impact of language dynamics on the capitalization of broadcast newsFernando Batista, Nuno J. Mamede, Isabel Trancoso. 220-223 [doi]

Lightly supervised acoustic model training on EPPS recordingsMatthias Paulik, Alex Waibel. 224-227 [doi]

Fast call-classification system development without in-domain training dataChristophe Servan, Frédéric Béchet. 228-231 [doi]

iCNC and iROVER: the limits of improving system combination with classification?Björn Hoffmeister, Ralf Schlüter, Hermann Ney. 232-235 [doi]

System combination for spoken language understandingStefan Hahn, Patrick Lehnen, Hermann Ney. 236-239 [doi]

Multidimensional features of emotional speechTomoko Suzuki, Machiko Ikemoto, Tomoko Sano, Toshihiko Kinoshita. 240 [doi]

Leveraging emotion detection using emotions from yes-no answersNarjès Boufaden, Pierre Dumouchel. 241-244 [doi]

Vowel placement during operatic singing: come si parla or aggiustamento ?Thomas John Millhouse, Dianna T. Kenny. 245-248 [doi]

Study on strained rough voice as a conveyer of rageYumiko O. Kato, Yoshifumi Hirose, Takahiro Kamai. 249-252 [doi]

Integrating rule and template-based approaches for emotional Malay speech synthesisMumtaz Begum, Raja Noor Ainon, Roziati Zainuddin, Zuraidah M. Don, Gerry Knowles. 253-256 [doi]

The expression and perception of emotions: comparing assessments of self versus othersCarlos Busso, Shrikanth S. Narayanan. 257-260 [doi]

On the role of acting skills for the collection of simulated emotional speechEmiel Krahmer, Marc Swerts. 261-264 [doi]

Detection of security related affect and behaviour in passenger transportBjörn Schuller, Matthias Wimmer, Dejan Arsic, Tobias Moosmayr, Gerhard Rigoll. 265-268 [doi]

Soft margin estimation with various separation levels for LVCSRJinyu Li, Zhi-Jie Yan, Chin-Hui Lee, Ren-Hua Wang. 269-272 [doi]

On the equivalence of Gaussian and log-linear HMMsGeorg Heigold, Patrick Lehnen, Ralf Schlüter, Hermann Ney. 273-276 [doi]

Generalization of extended baum-welch parameter estimation for discriminative training and decodingDimitri Kanevsky, Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo. 277-280 [doi]

An ellipsoid constrained quadratic programming perspective to discriminative training of HMMsPeng Liu, Frank K. Soong. 281-284 [doi]

Discriminative training of variable-parameter HMMs for noise robust speech recognitionDong Yu, Li Deng, Yifan Gong, Alex Acero. 285-288 [doi]

Towards a non-parametric acoustic model: an acoustic decision tree for observation probability calculationJasha Droppo, Michael L. Seltzer, Alex Acero, Yu-Hsiang Bosco Chiu. 289-292 [doi]

Experiments with the ABI (accents of the british isles) speech corpusShona D Arcy, Martin J. Russell. 293-296 [doi]

Politecnico di Torino system for the 2007 NIST language recognition evaluationFabio Castaldo, Emanuele Dalmasso, Pietro Laface, Daniele Colibro, Claudio Vair. 297-300 [doi]

Discriminative training and channel compensation for acoustic language recognitionValiantsina Hubeika, Lukas Burget, Pavel Matejka, Petr Schwarz. 301-304 [doi]

Comparison of variable selection methods and classifiers for native accent identificationTingyao Wu, Peter Karsmakers, Hugo Van Hamme, Dirk Van Compernolle. 305-308 [doi]

A comparison of subspace feature-domain methods for language recognitionWilliam M. Campbell, Douglas E. Sturim, Pedro A. Torres-Carrasquillo, Douglas A. Reynolds. 309-312 [doi]

Context-dependent phone models and models adaptation for phonotactic language recognitionMohamed Faouzi BenZeghiba, Jean-Luc Gauvain, Lori Lamel. 313-316 [doi]

Emotions and articulatory precisionMartijn Goudbeek, Jean Philippe Goldman, Klaus R. Scherer. 317 [doi]

Assessing agreement of observer- and self-annotations in spontaneous multimodal emotion dataKhiet P. Truong, Mark A. Neerincx, David A. van Leeuwen. 318-321 [doi]

Emotion recognition in spontaneous emotional speech for anonymity-protected voice chat systemsYoshiko Arimoto, Hiromi Kawatsu, Sumio Ohno, Hitoshi Iida. 322-325 [doi]

Assigning suitable phrasal tones and pitch accents by sensing affective information from text to synthesize human-like speechShaikh Mostafa Al Masum, M. Khademul Islam Molla, Keikichi Hirose. 326-329 [doi]

Cross-language study of vocal correlates of affective statesIrena Yanushevskaya, Ailbhe Ní Chasaide, Christer Gobl. 330-333 [doi]

Gender-related differences in the production and perception of emotionMarc Swerts, Emiel Krahmer. 334-337 [doi]

The English pronunciation of successive groups of Maori speakersCatherine I. Watson, Margaret Maclagan, Jeanette King, Ray Harlow. 338-341 [doi]

Reversal of short front vowel raising in Australian EnglishFelicity Cox, Sallyanne Palethorpe. 342-345 [doi]

GOOSE on the move: a study of /u/-fronting in Australian news speechJennifer Price. 346 [doi]

The vowels of Australian Aboriginal EnglishAndrew Butcher, Victoria Anderson. 347-350 [doi]

Perception and production of /i: /, /i@/ and /e: / in australian EnglishRobert H. Mannell. 351-354 [doi]

An expert system in speaker verification taskZbynek Zajíc, Lukás Machlica, Ales Padrta, Jan Vanek, Vlasta Radová. 355-358 [doi]

Cascading appearance-based features for visual speaker verificationDavid Dean, Sridha Sridharan, Patrick Lucey. 359-362 [doi]

Improved novelty detection for online GMM based speaker diarizationKonstantin Markov, Satoshi Nakamura. 363-366 [doi]

Analysis of impostor tests with high scores in NIST-SRE contextSalah Eddine Mezaache, Jean-François Bonastre, Driss Matrouf. 367-370 [doi]

Reinforced temporal structure information for embedded utterance-based speaker recognitionAnthony Larcher, Jean-François Bonastre, John S. D. Mason. 371-374 [doi]

Fast search for common segments in speech signals for speaker verificationMichael Gerber, Beat Pfister. 375-378 [doi]

Audio-visual multilevel fusion for speech and speaker recognitionGirija Chetty, Michael Wagner. 379-382 [doi]

Clustering initialization based on spatial information for speaker diarization of meetingsJordi Luque, Carlos Segura, Javier Hernando. 383-386 [doi]

Effect of compressing the dynamic range of the power spectrum in modulation filtering based speech enhancementJames G. Lyons, Kuldip K. Paliwal. 387-390 [doi]

A long state vector kalman filter for speech enhancementStephen So, Kuldip K. Paliwal. 391-394 [doi]

Subspace based speech enhancement using Gaussian mixture modelAchintya Kundu, Saikat Chatterjee, T. V. Sreenivas. 395-398 [doi]

Generalized parametric spectral subtraction using weighted Euclidean distortionAmit Das, John H. L. Hansen. 399-402 [doi]

Sudden noise reduction based on GMM with noise power estimationNobuyuki Miyake, Tetsuya Takiguchi, Yasuo Ariki. 403-406 [doi]

Speech enhancement using a wiener denoising technique and musical noise reductionMd. Jahangir Alam, Sid-Ahmed Selouani, Douglas D. O Shaughnessy, Sofia Ben Jebara. 407-410 [doi]

Regularized non-negative matrix factorization with temporal dependencies for speech denoisingKevin W. Wilson, Bhiksha Raj, Paris Smaragdis. 411-414 [doi]

ICA-based MAP speech enhancement with multiple variable speech distribution modelsXin Zou, Peter Jancovic, Münevver Köküer, Martin J. Russell. 415-418 [doi]

Source separation based on binaural cues and source model constraintsRon J. Weiss, Michael I. Mandel, Daniel P. W. Ellis. 419-422 [doi]

Maximum kurtosis beamforming with the generalized sidelobe cancellerKen ichi Kumatani, John W. McDonough, Barbara Rauch, Philip N. Garner, Weifeng Li, John Dines. 423-426 [doi]

Noise robust speech dereverberation using constrained inverse filterKen ichi Furuya, Akitoshi Kataoka, Youichi Haneda. 427-430 [doi]

A dual microphone coherence based method for speech enhancement in headsetsMohsen Rahmani, Ahmad Akbari, Beghdad Ayad. 431-434 [doi]

Sound capture system and spatial filter for small devicesIvan Tashev, Slavy Mihov, Tyler Gleghorn, Alex Acero. 435-438 [doi]

An effective microphone array post-filter in arbitrary environmentsNing Cheng, Wenju Liu, Peng Li, Bo Xu. 439-442 [doi]

Localization of multiple sound sources based on inter-channel correlation using a distributed microphone systemKook Cho, Hajime Okumura, Takanobu Nishiura, Yoichi Yamashita. 443-446 [doi]

A frequency domain approach for speech enhancement with directionality using compact microphone arrayHeng Zhang, Qiang Fu, YongHong Yan. 447-450 [doi]

Question and answer database optimization using speech recognition resultsShota Takeuchi, Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano. 451-454 [doi]

Development and evaluation of hands-free spoken dialogue system for railway station guidanceHiroshi Saruwatari, Yu Takahashi, Hiroyuki Sakai, Shota Takeuchi, Tobias Cincarek, Hiromichi Kawanami, Kiyohiro Shikano. 455-458 [doi]

Statistical shared plan-based dialog managementAmanda J. Stent, Srinivas Bangalore. 459-462 [doi]

When calls go wrong: how to detect problematic calls based on log-files and emotions?Ota Herm, Alexander Schmitt, Jackson Liscombe. 463-466 [doi]

Unsupervised learning of edit parameters for matching name variantsDaniel Gillick, Dilek Hakkani-Tür, Michael Levit. 467-470 [doi]

Detection of repetitions in spontaneous speech in dialogue sessionsMert Cevik, Fuliang Weng, Chin-Hui Lee. 471-474 [doi]

Automatic customer feedback processing: alarm detection in open question spoken messagesNathalie Camelin, Géraldine Damnati, Frédéric Béchet, Renato de Mori. 475-478 [doi]

Minimal training based semantic categorization in a voice activated question answering (VAQA) systemMithun Balakrishna, Marta Tatu, Dan I. Moldovan. 479-482 [doi]

User study of the Bayesian update of dialogue state approach to dialogue managementBlaise Thomson, Milica Gasic, Simon Keizer, François Mairesse, Jost Schatzmann, Kai Yu, Steve Young. 483-486 [doi]

Extensibility verification of robust domain selection against out-of-grammar utterances in multi-domain spoken dialogue systemSatoshi Ikeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno. 487-490 [doi]

Improving large scale alphanumeric string recognition using redundant informationEa-Ee Jan, Osamuyimen Stewart, Raymond Co, David Lubensky. 491-494 [doi]

SPRAAK: an open source SPeech recognition and automatic annotation kit Kris Demuynck, Jan Roelens, Dirk Van Compernolle, Patrick Wambacq. 495 [doi]

Preliminary evaluation of speech/sound recognition for telemedicine application in a real environmentMichel Vacher, Anthony Fleury, Jean-François Serignat, Norbert Noury, Hubert Glasson. 496-499 [doi]

Mobidic - a mobile dictation and notetaking applicationMarkku Turunen, Aleksi Melto, Anssi Kainulainen, Jaakko Hakulinen. 500-503 [doi]

Automatic speech recognition for scientific purposes - webASRThomas Hain, Asmaa El Hannani, Stuart N. Wrigley, Vincent Wan. 504-507 [doi]

Evaluation of a live broadcast news subtitling system for portugueseHugo Meinedo, Márcio Viveiros, João Paulo Neto. 508-511 [doi]

Recognizing and modelling regional varieties of SwedishJonas Beskow, Gösta Bruce, Laura Enflo, Björn Granström, Susanne Schötz. 512-515 [doi]

Vowel duration, compression and lengthening in stressed syllables in central and southern varieties of standard ItalianJohn Hajek, Mary Stevens. 516-519 [doi]

Acoustic cues for the perception of intonation in CantoneseJoan K. Y. Ma, Valter Ciocca, Tara L. Whitehill. 520-523 [doi]

Perception of dialectal prosodyAdrian Leemann, Beat Siebenhaar. 524-527 [doi]

Does the Mcgurk effect rely on processing time constraints?Christian Kroos, Ashlie Dreves. 528 [doi]

Exploring the Uncanny Valley Effect with talking headsTakaaki Kuratate, Kathryn Ayers, Jeesun Kim, Denis Burnham. 529 [doi]

How do the elderly talk to a natural language call routing system?Knut Kvale, Ragnhild Halvorsrud. 530-533 [doi]

Analysis of relationship between impression of human-to-human conversations and prosodic change and its modelingRyota Nishimura, Norihide Kitaoka, Seiichi Nakagawa. 534-537 [doi]

Utterance-level normalization for relative articulation rate analysisTuomo Saarni, Jussi Hakokari, Jouni Isoaho, Tapio Salakoski. 538-541 [doi]

Syntactic complexity induces explicit grounding in the Maptask corpusMartin I. Tietze, Vera Demberg, Johanna D. Moore. 542 [doi]

Do discourse cues facilitate recall in information presentation messages?Andi Winterboer, Johanna D. Moore, Fernanda Ferreira. 543 [doi]

Structured heterogeneity of English stress variantsNoriko Hattori. 544 [doi]

A method for automatically estimating F0 model parameters and a speech re-synthesis tool using F0 model and STRAIGHTShota Sato, Taro Kimura, Yasuo Horiuchi, Masafumi Nishida, Shingo Kuroiwa, Akira Ichikawa. 545-548 [doi]

Noise driven short-time phase spectrum compensation procedure for speech enhancementAnthony P. Stark, Kamil K. Wójcicki, James G. Lyons, Kuldip K. Paliwal. 549-552 [doi]

A phase-averaged model for the relationship between noisy speech, clean speech and noise in the log-mel domainFriedrich Faubel, John W. McDonough, Dietrich Klakow. 553-556 [doi]

Time and frequency dependent amplification for speech intelligibility enhancement in noisy environmentsHenk Brouckxon, Werner Verhelst, Bart De Schuymer. 557-560 [doi]

A wavelet based speech enhancement method using noise classification and shapingMehdi Mohammadi, Behzad Zamani, Babak Nasersharif, Mohsen Rahmani, Ahmad Akbari. 561-564 [doi]

Speech enhancement based on novel two-step a priori SNR estimatorsMd. Jahangir Alam, Douglas D. O Shaughnessy, Sid-Ahmed Selouani. 565-568 [doi]

A speech enhancement approach using piecewise linear approximation of an explicit model of environmental distortionsJun Du, Qiang Huo. 569-572 [doi]

Articulatory control of HMM-based parametric speech synthesis driven by phonetic knowledgeZhen-Hua Ling, Korin Richmond, Junichi Yamagishi, Ren-Hua Wang. 573-576 [doi]

Minimum generation error training with direct log spectral distortion on LSPs for HMM-based speech synthesisYi-Jian Wu, Keiichi Tokuda. 577-580 [doi]

Robustness of HMM-based speech synthesisJunichi Yamagishi, Zhen-Hua Ling, Simon King. 581-584 [doi]

Improving preselection in unit selection synthesisAlistair Conkie, Ann K. Syrdal, Yeon-Jun Kim, Marc C. Beutnagel. 585-588 [doi]

Efficient join cost computation for unit selection based TTS systemsFeng Ding, Jani Nurminen, Jilei Tian. 589-592 [doi]

A phonetic assessment of cross-language voice conversionKayoko Yanagisawa, Mark Huckvale. 593-596 [doi]

Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependenciesMartin Wöllmer, Florian Eyben, Stephan Reiter, Björn Schuller, Cate Cox, Ellen Douglas-Cowie, Roddy Cowie. 597-600 [doi]

Patterns, prototypes, performance: classifying emotional user statesDino Seppi, Anton Batliner, Björn Schuller, Stefan Steidl, Thurid Vogt, Johannes Wagner, Laurence Devillers, Laurence Vidrascu, Noam Amir, Vered Aharonson. 601-604 [doi]

Recognition of stress in speech using wavelet analysis and Teager energy operatorLing He, Margaret Lech, Sheeraz Memon, Nicholas Allen. 605-608 [doi]

Effects of vocal effort and speaking style on text-independent speaker verificationElizabeth Shriberg, Martin Graciarena, Harry Bratt, Andreas Kathol, Sachin S. Kajarekar, Huda Jameel, Colleen Richey, Fred Goodman. 609-612 [doi]

Robustness of prosodic features to voice imitationMireia Farrús, Michael Wagner, Jan Anguita, Javier Hernando. 613-616 [doi]

Phonetic and speaker variations in automatic emotion classificationVidhyasaharan Sethu, Eliathamby Ambikairajah, Julien Epps. 617-620 [doi]

Infants' native and nonnative tone perceptionKaren Mattock. 621 [doi]

Language experience dependent plasticity for pitch representation in the human brainstemAnanthanarayan Krishnan, Jackson T. Gandour, Jayaganesh Swaminathan. 622 [doi]

Development of tone perception and tone production in Cantonese-learning children aged 2 to 5 yearsValter Ciocca, Vivian W.-K. Ip. 623 [doi]

Tone hyperarticulation in Cantonese infant-directed speechNan Xu, Denis Burnham. 624 [doi]

Influences on tone in Sepedi, a southern Bantu languageSabine Zerbian, Etienne Barnard. 625 [doi]

An acoustic-phonetic comparative analysis of Osaka and Kagoshima Japanese tonal phenomenaShunichi Ishihara. 626-629 [doi]

Modulation spectrogram features for improved speaker diarizationOriol Vinyals, Gerald Friedland. 630-633 [doi]

Spectro-temporal features for robust far-field speaker identificationTiago H. Falk, Wai-Yip Chan. 634-637 [doi]

Long-term spectro-temporal information for improved automatic speech emotion classificationSiqing Wu, Tiago H. Falk, Wai-Yip Chan. 638-641 [doi]

A comparative study on AM and FM featuresYotaro Kubo, Shigeki Okawa, Akira Kurematsu, Katsuhiko Shirai. 642-645 [doi]

Dimensionality reduction of modulation frequency features for speech discriminationMaria E. Markaki, Yannis Stylianou. 646-649 [doi]

Spectral envelope recovery beyond the nyquist limit for high-quality manipulation of speech soundsHideki Kawahara, Masanori Morise, Hideki Banno, Toru Takahashi, Ryuichi Nisimura, Toshio Irino. 650-653 [doi]

Adaptive-order fractional Fourier transform features for speech recognitionHui Yin, Xiang Xie, Jingming Kuang. 654-657 [doi]

Robust front end processing for speech recognition in reverberant environments: utilization of speech characteristicsRico Petrick, Xugang Lu, Masashi Unoki, Masato Akagi, Rüdiger Hoffmann. 658-661 [doi]

High-quality analysis/synthesis method based on temporal decomposition for speech modificationBinh Phu Nguyen, Takeshi Shibata, Masato Akagi. 662-665 [doi]

Improved frame loss recovery using closed-loop estimation of very low bit rate side informationPhilippe Gournay. 666-669 [doi]

Predictability of STRFs in auditory cortex neurons depends on stimulus classMax F. K. Happel, Simon Müller, Jörn Anemüller, Frank W. Ohl. 670 [doi]

Higher layer coding of non-speech like signals using factorial pulse codebookUdar Mittal, James P. Ashley, Jonathan Gibbs. 671-674 [doi]

Spectral noise shaping: improvements in speech/audio codec based on linear prediction in spectral domainSriram Ganapathy, Petr Motlícek, Hynek Hermansky, Harinath Garudadri. 675-678 [doi]

Introducing the compression wave cochlear amplifierMatthew R. Flax, W. Harvey Holmes. 679-682 [doi]

Goldman-hodgkin-katz cochlear hair cell models - a foundation for nonlinear cochlear mechanicsMatthew R. Flax, W. Harvey Holmes. 683-686 [doi]

A 8.32 kb/s embedded wideband speech coding candidate for ITU-t EV-VBR standardizationChangchun Bao, Hai Ting Li, Ze-xin Liu, Rui Fan, Heng Zhu, Mao-shen Jia, Rui Li. 687-690 [doi]

Decision tree based frame mode selection for AMR-WB+Jong-Kyu Kim, Seung Seop Park, Chang Woo Han, Nam Soo Kim. 695-698 [doi]

Assessment of objective quality measures for speech intelligibilityWei M. Liu, Keith A. Jellyman, Nicholas W. D. Evans, John S. D. Mason. 699-702 [doi]

Assessment of the speech-quality dimension noisiness for the instrumental estimation and analysis of telephone-band speech qualityKirstin Scholz, Christine Kühnel, Marcel Wältermann, Sebastian Möller, Ulrich Heute. 703-706 [doi]

Intelligibility evaluation of Ramsey-derived interleavers for internet voice streaming with the iLBC codecAngel M. Gomez, José L. Carmona, Antonio M. Peinado, Victoria E. Sánchez, José A. González. 707-710 [doi]

Language identification on code-switching utterances using multiple cuesDau-Cheng Lyu, Ren-Yuan Lyu. 711-714 [doi]

Target-oriented phone selection from universal phone set for spoken language recognitionRong Tong, Bin Ma, Haizhou Li, Engsiong Chng. 715-718 [doi]

The MITLL NIST LRE 2007 language recognition systemPedro A. Torres-Carrasquillo, Elliot Singer, William M. Campbell, Terry P. Gleason, Alan McCree, Douglas A. Reynolds, Fred Richardson, Wade Shen, Douglas E. Sturim. 719-722 [doi]

Eigen-channel compensation and discriminatively trained Gaussian mixture models for dialect and accent recognitionPedro A. Torres-Carrasquillo, Douglas E. Sturim, Douglas A. Reynolds, Alan McCree. 723-726 [doi]

Anchor-model fusion for language recognitionIgnacio Lopez-Moreno, Daniel Ramos, Joaquin Gonzalez-Rodriguez, Doroteo Torre Toledano. 727-730 [doi]

Introducing a FM based feature to hierarchical language identificationBo Yin, Tharmarajah Thiruvaran, Eliathamby Ambikairajah, Fang Chen. 731-734 [doi]

Dialect classification via discriminative trainingYun Lei, John H. L. Hansen. 735-738 [doi]

BUT language recognition system for NIST 2007 evaluationsPavel Matejka, Lukas Burget, Ondrej Glembek, Petr Schwarz, Valiantsina Hubeika, Michal Fapso, Tomas Mikolov, Oldrich Plchot, Jan Cernocký. 739-742 [doi]

Advances in phonotactic language recognitionOndrej Glembek, Pavel Matejka, Lukas Burget, Tomas Mikolov. 743-746 [doi]

Dialect separation assessment using log-likelihood score distributionsMahnoosh Mehrabani, John H. L. Hansen. 747-750 [doi]

Study on unique pharyngeal and uvular consonants in foreign accented ArabicYousef Ajami Alotaibi, Khondaker Abdullah-Al-Mamun, Muhammad Ghulam. 751-754 [doi]

Automatic accent classification using ensemble methodsFukun Bi, Jian Yang, Dan Xu. 755-758 [doi]

Foreign accent identification based on prosodic parametersMarina Piat, Dominique Fohr, Irina Illina. 759-762 [doi]

Dialect recognition using adapted phonetic modelsWade Shen, Nancy F. Chen, Douglas A. Reynolds. 763-766 [doi]

Beyond frame independence: parametric modelling of time duration in speaker and language recognitionAlan McCree, Fred Richardson, Elliot Singer, Douglas A. Reynolds. 767-770 [doi]

Testing a large corpus of natural standard Arabic for rhythm classLiz Dockendorf, Dalal Almubayei, Matthew Benton. 771 [doi]

A comparison of two acoustic measurement approaches to the rhythm continuum of natural Chinese and English speechMatthew Benton, Liz Dockendorf. 772-775 [doi]

A study of pitch patterns of Japanese English analyzed via comparative linguistic features of English and JapaneseTomoko Nariai, Kazuyo Tanaka. 776-779 [doi]

A corpus-based prosodic study of Alsatian, Belgian and Swiss FrenchCécile Woehrling, Philippe Boula de Mareüil, Martine Adda-Decker, Lori Lamel. 780-783 [doi]

Prosodic position effects and function words in English: a pilot studyMitsuhiro Nakamura. 784 [doi]

How useful are polynomials for analyzing intonation?Laura E. de Ruiter. 785-788 [doi]

Adaptive filter based prosody modification approachQingcai Chen, Shusen Zhou, Dandan Wang, Xiaohong Yang. 789-792 [doi]

Speech/laughter classification in meeting audioSwe Zin Kalayar Khine, Tin Lay Nwe, Haizhou Li. 793-796 [doi]

Getting the last laugh: automatic laughter segmentation in meetingsMary Tai Knox, Nelson Morgan, Nikki Mirghafori. 797-800 [doi]

The influence of audio presentation style on multitasking during teleconferencesStuart N. Wrigley, Simon Tucker, Guy J. Brown, Steve Whittaker. 801-804 [doi]

Balancing spoken content adaptation and unit length in the recognition of emotion and interestBogdan Vlasenko, Björn Schuller, Kinfe Tadesse Mengistu, Gerhard Rigoll, Andreas Wendemuth. 805-808 [doi]

Nonverbal responses to social inclusion and exclusionEmiel Krahmer, Juliette Schaafsma, Marc Swerts, Ad Vingerhoets. 809-812 [doi]

Acoustic analysis of imitated voice produced by a professional impersonatorTatsuya Kitamura. 813-816 [doi]

Detection of speech under physical stress: model development, sensor selection, and feature fusionSanjay A. Patil, John H. L. Hansen. 817-820 [doi]

Improving Japanese language models using POS informationLangzhou Chen, Hisayoshi Nagae, Matthew N. Stuttle. 821-824 [doi]

Discriminative n-gram language modeling for TurkishEbru Arisoy, Brian Roark, Izhak Shafran, Murat Saraclar. 825-828 [doi]

Rich morphology based n-gram language models for ArabicAhmad Emami, Imed Zitouni, Lidia Mangu. 829-832 [doi]

Unsupervised language model adaptation based on topic and role information in multiparty meetingsSongfang Huang, Steve Renals. 833-836 [doi]

Context dependent language model adaptationXunying Liu, Mark J. F. Gales, Philip C. Woodland. 837-840 [doi]

Iterative language model estimation: efficient data structure & algorithmsBo-June (Paul) Hsu, James R. Glass. 841-844 [doi]

Phone-based cepstral polynomial SVM system for speaker recognitionSachin S. Kajarekar. 845-848 [doi]

Using MAP estimation of feature transformation for speaker recognitionDonglai Zhu, Bin Ma, Haizhou Li. 849-852 [doi]

Factor analysis subspace estimation for speaker verification with short utterancesRobbie Vogt, Brendan Baker, Sridha Sridharan. 853-856 [doi]

Combining continuous progressive model adaptation and factor analysis for speaker verificationMitchell McLaren, Driss Matrouf, Robbie Vogt, Jean-François Bonastre. 857-860 [doi]

Adaptive decision tree-based phone cluster models for speaker clusteringChia-Hsin Hsieh, Chung-Hsien Wu, Han-Ping Shen. 861-864 [doi]

Speaker recognition in two-wire test sessionsHagai Aronowitz, Yosef A. Solewicz. 865-868 [doi]

The effect of position on the realization of second occurrence focusJason B. Bishop. 869-872 [doi]

Effects of intonational phrase boundaries on pitch-accented syllables in american EnglishYen-Liang Shue, Stefanie Shattuck-Hufnagel, Markus Iseli, Sun-Ah Jun, Nanette Veilleux, Abeer Alwan. 873-876 [doi]

Examining pitch-accent variability from an exemplar-theoretic perspectiveMichael Walsh, Katrin Schweitzer, Bernd Möbius, Hinrich Schütze. 877-880 [doi]

Correlation of utterance length and segmental duration in Finnish is questionableJussi Hakokari, Tuomo Saarni, Jouni Isoaho, Tapio Salakoski. 881-884 [doi]

Different roles of pitch and duration in distinguishing word stress in EnglishJiahong Yuan, Stephen Isard, Mark Liberman. 885 [doi]

Cross-dialect Irish prosody: linguistic constraints on Fujisaki modellingMaria O Reilly, Ailbhe Ní Chasaide, Christer Gobl. 886-889 [doi]

Introducing temporal asymmetries in feature extraction for automatic speech recognitionGarimella S. V. S. Sivaram, Hynek Hermansky. 890-893 [doi]

A closer look on hierarchical spectro-temporal features (HIST)Martin Heckmann, Xavier Domont, Frank Joublin, Christian Goerick. 894-897 [doi]

Multi-stream spectro-temporal features for robust speech recognitionSherry Y. Zhao, Nelson Morgan. 898-901 [doi]

The value of auditory offset adaptation and appropriate acoustic modelingHuan Wang, David Gelbart, Hans-Günter Hirsch, Werner Hemmert. 902-905 [doi]

Optimization and evaluation of Gabor feature sets for ASRBernd T. Meyer, Birger Kollmeier. 906-909 [doi]

A shrinkage estimator for speech recognition with full covariance HMMsPeter Bell, Simon King. 910-913 [doi]

Covariance updates for discriminative training by constrained line searchPeter Bell, Simon King. 914 [doi]

Min-max discriminative training of decoding parameters using iterative linear programmingBrian Mak, Tom Ko. 915-918 [doi]

Discriminative training for complementariness in system combinationDaniel Willett, Chuang He. 919 [doi]

Penalty function maximization for large margin HMM trainingGeorge Saon, Daniel Povey. 920-923 [doi]

Implicit state-tying for support vector machines based speech recognitionDaniel Bolaños, Wayne Ward. 924-927 [doi]

Using KL-based acoustic models in a large vocabulary recognition taskGuillermo Aradilla, Hervé Bourlard, Mathew Magimai-Doss. 928-931 [doi]

Acoustic modeling based on model structure annealing for speech recognitionSayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda. 932-935 [doi]

Bayesian context clustering using cross valid prior distribution for HMM-based speech recognitionKei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda. 936-939 [doi]

Speech recognition using soft decision treesJitendra Ajmera, Masami Akamine. 940-943 [doi]

GPU-accelerated Gaussian clustering for fMPE discriminative trainingYu Shi, Frank Seide, Frank K. Soong. 944-947 [doi]

Discriminative training using the trusted expectation maximizationYasser Hifny, Yuqing Gao. 948-951 [doi]

Maximum mutual information estimation with unlabeled data for phonetic classificationJui-Ting Huang, Mark Hasegawa-Johnson. 952-955 [doi]

Maximum accept and reject (MARS) training of HMM-GMM speech recognition systemsVivek Tyagi. 956-959 [doi]

Nonlinear mixture autoregressive hidden Markov models for speech recognitionSundar Srinivasan, Tao Ma, Daniel May, Georgios Y. Lazarou, Joseph Picone. 960-963 [doi]

GPU accelerated acoustic likelihood computationsPatrick Cardinal, Pierre Dumouchel, Gilles Boulianne, Michel Comeau. 964-967 [doi]

CENSREC-4: development of evaluation framework for distant-talking speech recognition under reverberant environmentsMasato Nakayama, Takanobu Nishiura, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Tetsuji Ogawa, Shigeki Matsuda, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura. 968-971 [doi]

In-car speech recognition using model-based wiener filter and multi-condition trainingMasanori Tsujikawa, Takayuki Arakawa, Ryosuke Isotani. 972-975 [doi]

Adaptive beamforming and soft missing data decoding for robust speech recognition in reverberant environmentsMarco Kühne, Roberto Togneri, Sven Nordholm. 976-979 [doi]

Spectral subtraction in likelihood-maximizing framework for robust speech recognitionBagher BabaAli, Hossein Sameti, Mehran Safayani. 980-983 [doi]

Front-end for far-field speech recognition based on frequency domain linear predictionSriram Ganapathy, Samuel Thomas, Hynek Hermansky. 984-987 [doi]

Mask estimation incorporating time-frequency trajectories for a CASA-based ASR front-endJi Hun Park, Jae Sam Yoon, Hong Kook Kim. 988-991 [doi]

Soft missing-feature mask generation for simultaneous speech recognition system in robotsToru Takahashi, Shun ichi Yamamoto, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno. 992-995 [doi]

A posterior approach for microphone array based speech recognitionDong Wang, Ivan Himawan, Joe Frankel, Simon King. 996-999 [doi]

Analysis of physiologically-motivated signal processing for robust speech recognitionYu-Hsiang Bosco Chiu, Richard M. Stern. 1000-1003 [doi]

Evaluation of modulation spectrum equalization techniques for large vocabulary robust speech recognitionLiang-Che Sun, Chang-Wen Hsu, Lin-Shan Lee. 1004-1007 [doi]

Confusion-based entropy-weighted decoding for robust speech recognitionYi Chen, Chia-Yu Wan, Lin-Shan Lee. 1008-1011 [doi]

Cepstral domain voice activity detection for improved noise modeling in MMSE feature enhancement for ASRSvein Gunnar Pettersen, Magne Hallstein Johnsen. 1012-1015 [doi]

Unsupervised re-scoring of observation probability based on maximum entropy criterion by using confidence measure with telephone speechCarlos Molina, Néstor Becerra Yoma, Fernando Huenupán, Claudio Garretón. 1016-1019 [doi]

Within-class feature normalization for robust speech recognitionYuan-Fu Liao, Chi-Hui Hsu, Chi-Min Yang, Jeng-Shien Lin, Sen-Chia Chang. 1020-1023 [doi]

A posteriori SNR weighted energy based variable frame rate analysis for speech recognitionZheng-Hua Tan, Børge Lindberg. 1024-1027 [doi]

Silence feature normalization for robust speech recognition in additive noise environmentsChieh-cheng Wang, Chi-an Pan, Jeih-Weih Hung. 1028-1031 [doi]

Blind dereverberation based on CMN and spectral subtraction by multi-channel LMS algorithmLongbiao Wang, Seiichi Nakagawa, Norihide Kitaoka. 1032-1035 [doi]

Amplitude and amplitude variation of emotional speechHartmut R. Pfitzinger, Christian Kaernbach. 1036-1039 [doi]

Babble speech: acoustic and perceptual variabilityNitish Krishnamurthy, Ayako Ikeno, John H. L. Hansen. 1040-1043 [doi]

On the properties of a time-varying quasi-harmonic model of speechYannis Pantazis, Olivier Rosec, Yannis Stylianou. 1044-1047 [doi]

Extraction and tracking of formant response jitter in the cochlea for objective prediction of SB/SF DAM attributesWenliang Lu, D. Sen. 1048-1051 [doi]

Consonant discrimination of degraded speech using an efferent-inspired closed-loop cochlear modelDavid P. Messing, Lorraine Delhorne, Ed Bruckert, Louis D. Braida, Oded Ghitza. 1052-1055 [doi]

On the development of variable length Teager energy operator (VTEO)Vikrant Tomar, Hemant A. Patil. 1056-1059 [doi]

Metric learning for unsupervised phoneme segmentationYu Qiao, Nobuaki Minematsu. 1060-1063 [doi]

Combining task-dependent information with auditory attention cues for prominence detection in speechOzlem Kalinli, Shrikanth S. Narayanan. 1064-1067 [doi]

Probabilistic feature mapping based on trajectory HMMsHeiga Zen, Yoshihiko Nankaku, Keiichi Tokuda. 1068-1071 [doi]

Simultaneous conversion of duration and spectrum based on statistical models including time-sequence matchingKaori Yutani, Yosuke Uto, Yoshihiko Nankaku, Tomoki Toda, Keiichi Tokuda. 1072-1075 [doi]

Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectoryTakashi Muramatsu, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano. 1076-1079 [doi]

An improved one-to-many eigenvoice conversion systemYamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano. 1080-1083 [doi]

Study on manipulation method of voice quality based on the vocal tract area functionYoshinori Uchimura, Hideki Banno, Fumitada Itakura, Hideki Kawahara. 1084-1087 [doi]

Incorporating durational modification in voice transformationArthur R. Toth, Alan W. Black. 1088-1091 [doi]

Non-segmental duration feature extraction for prosodic classificationAmy Dashiell, Brian Hutchinson, Anna Margolis, Mari Ostendorf. 1092-1095 [doi]

An ERP study on categorical perception of lexical tones and nonspeech pitchesHongying Zheng, William S.-Y. Wang. 1096 [doi]

The role of Japanese pitch accent in spoken-word recognition: evidence from middle-aged accentless dialect listenersTakashi Otake, Marii Higuchi. 1097-1100 [doi]

Mandarin Chinese tone nucleus detection with landmarksSiwei Wang, Gina-Anne Levow. 1101-1104 [doi]

A comparative study on dissyllabic stress patterns of Mandarin and CantoneseWeixiang Hu, Jin Jian, Aijun Li, Xia Wang. 1105-1108 [doi]

Three-sectional-staff characterization of Cantonese level tonesRerrario Shui-Ching Ho, Yoshinori Sagisaka. 1109-1112 [doi]

A seven-tone dialect in southern China with falling-rising-falling contour: a linguistic acoustic analysisXiaonong Zhu, Caicai Zhang. 1113-1115 [doi]

Pitch target analysis of Thai tones using quantitative target approximation model and unsupervised clusteringSantitham Prom-on. 1116-1119 [doi]

Do English speakers assimilate Mandarin tones to English prosodic categories?Connie K. So, Catherine T. Best. 1120 [doi]

Evidence of a near-merger in western sydney australian English vowelsRikke L. Bundgaard-Nielsen, Catherine T. Best, Michael D. Tyler, Christian Kroos. 1121 [doi]

Central vowels in Arrernte: metrical prominence and pitch accentMarija Tabain, Kristine Rickard, Gavan Breen, Veronica Dobson. 1122 [doi]

Pausing and phrase length in two australian languagesBella Ross. 1123 [doi]

Positional effects on the characterization of ejectives in Waima aMary Stevens, John Hajek. 1124-1127 [doi]

A Niuean variant of New Zealand English?Donna Starks, Laura Thompson, Catherine I. Watson. 1128 [doi]

Phonetic confusion analysis and robust phone set generation for Shanghai-accented Mandarin speech recognitionGuo-Hong Ding. 1129-1132 [doi]

Prosody for Mandarin speech recognition: a comparative study of read and spontaneous speechYu Ting Yeung, Yao Qian, Tan Lee, Frank K. Soong. 1133-1136 [doi]

Improved large vocabulary Mandarin speech recognition by selectively using tone information with a two-stage prosodic modelLi-Wei Cheng, Lin-Shan Lee. 1137-1140 [doi]

Mandarin connected digits recognition for whispered speechTingting Ru, Xiang Xie, Hui Yin, Jingming Kuang. 1141-1144 [doi]

Recognizing named entities in spoken Chinese dialogues with a character-level maximum entropy taggerChangchun Bao, Weiqun Xu, YongHong Yan. 1145-1148 [doi]

A novel approach in continuous speech recognition for Vietnamese, an isolating tonal languageHong-Quang Nguyen, Pascal Nocera, Eric Castelli, Van Loan Trinh. 1149-1152 [doi]

Evaluating semantic-level confidence scores with multiple hypothesesBlaise Thomson, Kai Yu, Milica Gasic, Simon Keizer, François Mairesse, Jost Schatzmann, Steve Young. 1153-1156 [doi]

Structured models for joint decoding of repeated utterancesGeoffrey Zweig, Dan Bohus, Xiao Li, Patrick Nguyen. 1157-1160 [doi]

A Bayesian approach to semantic composition for spoken language interpretationMarie-Jean Meurs, Fabrice Lefevre, Renato de Mori. 1161-1164 [doi]

Accommodating explicit user expressions of uncertainty in voice search or something like thatTim Paek, Yun-Cheng Ju. 1165-1168 [doi]

Effects of user modeling on POMDP-based dialogue systemsDongHo Kim, Hyeong Seop Sim, Kee-Eung Kim, Jin Hyung Kim, Hyunjeong Kim, Joo Won Sung. 1169-1172 [doi]

The best of both worlds: unifying conventional dialog systems and POMDPsJason D. Williams. 1173-1176 [doi]

The assimilation of L2 australian English vowels to L1 Japanese vowel categories: vocabulary size mattersRikke L. Bundgaard-Nielsen, Catherine T. Best, Michael D. Tyler. 1177 [doi]

Vowel epenthesis, acoustics and phonology patterns in Moroccan ArabicAzra N. Ali, Mohamed Lahrouchi, Michael Ingleby. 1178-1181 [doi]

Estimation of vocal tract area function for Mandarin vowel sequences using MRIGaowu Wang, Jianwu Dang, Jiangping Kong. 1182-1185 [doi]

The effect of first language (L1) dialects on the identification of Vietnamese word-final stopsKimiko Tsukada, Thu T. A. Nguyen. 1186-1189 [doi]

Perceptual evidence of modern Greek voiced stops as phonological categoriesMark Antoniou, Catherine T. Best, Michael D. Tyler. 1190 [doi]

The effect of auditory and visual degradation on audiovisual perception of native and non-native speakersValérie Hazan, Enid Li. 1191-1194 [doi]

Quantitative prosodic analysis of spontaneous speechHansjörg Mixdorff. 1195 [doi]

The effect of cognitive load on disfluencies during in-vehicle spoken dialogueAnders Lindström, Jessica Villing, Staffan Larsson, Alexander Seward, Nina Åberg, Cecilia Holtelius. 1196-1199 [doi]

Discourse prosody context - global F0 and tempo modulationsChiu-yu Tseng, Zhao-yu Su. 1200-1203 [doi]

A method for automatic and dynamic estimation of discourse genre typology with prosodic featuresNicolas Obin, Anne Lacheret-Dujour, Christophe Veaux, Xavier Rodet, Anne-Catherine Simon. 1204-1207 [doi]

The meanings carried by interjections in spontaneous speechCarlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita. 1208-1211 [doi]

Speech interaction with an emotional robotic dogChristian Martyn Jones, Andrew Deeming. 1212-1215 [doi]

Control of prosodic focus in corpus-based generation of fundamental frequency based on the generation process modelKeiko Ochi, Keikichi Hirose, Nobuaki Minematsu. 1216 [doi]

Discrimininative training of narrow band - wide band adapted systems for meeting recognitionMartin Karafiát, Lukas Burget, Thomas Hain, Jan Cernocký. 1217-1220 [doi]

A fast speaker adaptation method using aspect modelSeongjun Hahm, Akinori Ito, Shozo Makino, Motoyuki Suzuki. 1221-1224 [doi]

Probabilistic latent speaker training for large vocabulary speech recognitionDan Su, Xihong Wu, Huisheng Chi. 1225-1228 [doi]

Improvement of eigenvoice-based speaker adaptation by parameter space clusteringShutaro Tanji, Koichi Shinoda, Sadaoki Furui, Antonio Ortega. 1229-1232 [doi]

Study of jacobian compensation using linear transformation of conventional MFCC for VTLND. Rama Sanand, Srinivasan Umesh. 1233-1236 [doi]

Adaptive HMM topology for speech recognitionChuan-Wei Ting, Kuo-Yuan Lee, Jen-Tzung Chien. 1237-1240 [doi]

Minimum phone error discriminative training for Mandarin Chinese speaker adaptationLiang-Yu Chen, Chun-Jen Lee, Jyh-Shing Roger Jang. 1241-1244 [doi]

Fast speaker adaptive training for speech recognitionDaniel Povey, Hong-Kwang Jeff Kuo, Hagen Soltau. 1245-1248 [doi]

Eigen-MLLR environment/speaker compensation for robust speech recognitionYuan-Fu Liao, Hung-Hsiang Fang, Chi-Hui Hsu. 1249-1252 [doi]

Parameter clustering and sharing in variable-parameter HMMs for noise robust speech recognitionDong Yu, Li Deng, Yifan Gong, Alex Acero. 1253-1256 [doi]

A feature compensation approach using high-order vector taylor series approximation of an explicit distortion model for noisy speech recognitionJun Du, Qiang Huo. 1257-1260 [doi]

N-best based stochastic mapping on stereo HMM for noise robust speech recognitionXiaodong Cui, Mohamed Afify, Yuqing Gao. 1261-1264 [doi]

Improving the ensemble speaker and speaking environment modeling approach by enhancing the precision of the online estimation processYu Tsao, Chin-Hui Lee. 1265-1268 [doi]

Combining noise compensation and missing-feature decoding for large vocabulary speech recognition in noiseJianhua Lu, Ji Ming, Roger Woods. 1269-1272 [doi]

Joint Bayesian predictive classification and parallel model combination with prior scaling for robust ASRSvein Gunnar Pettersen. 1273-1276 [doi]

Environment mismatch compensation using average eigenspace for speech recognitionAbhishek Kumar, John H. L. Hansen. 1277-1280 [doi]

Monte Carlo model-space noise adaptation for speech recognitionDaniel Povey, Brian Kingsbury. 1281-1284 [doi]

A speechiness measure to improve speech decoding in the presence of other sound sourcesNing Ma, Phil Green. 1285-1288 [doi]

Feature vector normalization with combined standard and throat microphones for robust ASRLuis Buera, Antonio Miguel, Oscar Saz, Alfonso Ortega, Eduardo Lleida. 1289-1292 [doi]

Phone-duration-dependent long-term dynamic features for a stochastic model-based voice activity detectionTakashi Fukuda, Osamu Ichikawa, Masafumi Nishimura. 1293-1296 [doi]

An on-line adaptation technique for emotional speech recognition using style estimation with multiple-regression HMMYusuke Ijima, Makoto Tachibana, Takashi Nose, Takao Kobayashi. 1297-1300 [doi]

HMM adaptation using statistical linear approximation for robust automatic speech recognitionMichael Berkovitch, Ilan D. Shallom. 1301-1304 [doi]

Beyond linear transforms: efficient non-linear dynamic adaptation for noise robust speech recognitionSteven J. Rennie, Pierre L. Dognin. 1305-1308 [doi]

Rapid unsupervised speaker adaptation robust in reverberant environment conditionsRandy Gomez, Jani Even, Kiyohiro Shikano. 1309-1312 [doi]

Speaker identification for whispered speech based on frequency warping and score competitionXing Fan, John H. L. Hansen. 1313-1316 [doi]

Experimental evaluation of multi-band position-pitch estimation (m-popi) algorithm for multi-speaker localizationTania Habib, Lukas Ottowitz, Marián Képesi. 1317-1320 [doi]

Features for automatic detection of voice bars in continuous speechN. Dhananjaya, S. Rajendran, B. Yegnanarayana. 1321-1324 [doi]

Speaker orientation estimation based on hybridation of GCC-PHAT and HLBRCarlos Segura, Alberto Abad, Javier Hernando, Climent Nadeu. 1325-1328 [doi]

Parallel and hierarchical speech feature classification using frame and segment-based methodsJun Hou, Lawrence R. Rabiner, Sorin Dusan. 1329-1332 [doi]

Automatically learning speaker-independent acoustic subword unitsBalakrishnan Varadarajan, Sanjeev Khudanpur. 1333-1336 [doi]

Human-like ears versus two-microphone array, which works better for speaker identification?Waleed H. Abdulla, Yushi Zhang. 1337-1340 [doi]

Is a speech recognizer useful for characteristic analysis of classroom lecture speech?Kenji Kobayashi, Mitsuhiro Somiya, Hiromitsu Nishizaki, Yoshihiro Sekiguchi. 1341-1344 [doi]

An intuitive class discriminability measure for feature selection in a speech recognition systemLadan Golipour, Douglas D. O Shaughnessy. 1345-1348 [doi]

f-divergence is a generalized invariant measure between distributionsYu Qiao, Nobuaki Minematsu. 1349-1352 [doi]

Sparse linear predictors for speech processingDaniele Giacobello, Mads Græsbøll Christensen, Joachim Dahl, Søren Holdt Jensen, Marc Moonen. 1353-1356 [doi]

Frequency-domain parameter estimations for binary masked signalsJohan Xi Zhang, Mads Græsbøll Christensen, Joachim Dahl, Søren Holdt Jensen, Marc Moonen. 1357-1360 [doi]

Decomposition of rotational distortion caused by VTL difference using eigenvalues of its transformation matrixDaisuke Saito, Nobuaki Minematsu, Keikichi Hirose. 1361-1364 [doi]

Region-based vocal tract length normalization for ASRMichail G. Maragakis, Alexandros Potamianos. 1365-1368 [doi]

Speaker verification with non-audible murmur segments by combining global alignment kernel and penalized logistic regression machineHideki Okamoto, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano. 1369-1372 [doi]

Analysis of subspace within-class covariance normalization for SVM-based speaker verificationLiang Lu, Yuan Dong, Xianyu Zhao, Jian Zhao, Chengyu Dong, Haila Wang. 1373-1376 [doi]

Comparison of input and feature space nonlinear kernel nuisance attribute projections for speaker verificationXianyu Zhao, Yuan Dong, Jian Zhao, Liang Lu, Jiqing Liu, Haila Wang. 1377-1380 [doi]

A generalised derivative kernel for speaker verificationChris Longworth, Mark J. F. Gales. 1381-1384 [doi]

Modeling prior belief for speaker verification SVM systemsLuciana Ferrer. 1385-1388 [doi]

Convergence between SVM-based and distance-based paradigms for speaker recognitionDelphine Charlet, Xianyu Zhao, Yuan Dong. 1389-1392 [doi]

High-level speaker verification via articulatory-feature based sequence kernels and SVMShi-Xiong Zhang, Man-Wai Mak. 1393-1396 [doi]

Characterizing speech utterances for speaker verification with sequence kernel SVMKong-Aik Lee, Changhuai You, Haizhou Li, Tomi Kinnunen, Donglai Zhu. 1397-1400 [doi]

Development of the primary CRIM system for the NIST 2008 speaker recognition evaluationPatrick Kenny, Najim Dehak, Pierre Ouellet, Vishwa Gupta, Pierre Dumouchel. 1401-1404 [doi]

Making confident speaker verification decisions with minimal speechRobbie Vogt, Sridha Sridharan, Michael Mason. 1405-1408 [doi]

Parallelized factor analysis and feature normalization for automatic speaker verificationJun Luo, Cheung Chi Leung, Marc Ferras, Claude Barras. 1409-1412 [doi]

Intersession variability in speaker recognition: a behind the scene analysisDaniel Garcia-Romero, Carol Y. Espy-Wilson. 1413-1416 [doi]

Speaker recognition based on variational Bayesian methodTatsuya Ito, Kei Hashimoto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda. 1417-1420 [doi]

Factor analysis multi-session training constraint in session compensation for speaker verificationDriss Matrouf, Jean-François Bonastre, Salah Eddine Mezaache. 1421-1424 [doi]

The role of delta features in speaker verificationYing Liu, Martin J. Russell, Michael J. Carey 0002. 1425-1428 [doi]

Investigating morphological decomposition for transcription of Arabic broadcast news and broadcast conversation dataLori Lamel, Abdelkhalek Messaoudi, Jean-Luc Gauvain. 1429-1432 [doi]

Transcribing broadcast data using MLP featuresPetr Fousek, Lori Lamel, Jean-Luc Gauvain. 1433-1436 [doi]

Development of the SRI/nightingale Arabic ASR systemDimitra Vergyri, Arindam Mandal, Wen Wang, Andreas Stolcke, Jing Zheng, Martin Graciarena, David Rybach, Christian Gollan, Ralf Schlüter, Katrin Kirchhoff, Arlo Faria, Nelson Morgan. 1437-1440 [doi]

Towards automatic learning in LVCSR: rapid development of a Persian broadcast transcription systemChristian Gollan, Hermann Ney. 1441-1444 [doi]

The CMU-interACT 2008 Mandarin transcription systemRoger Hsiao, Mark C. Fuhs, Yik-Cheung Tam, Qin Jin, Tanja Schultz. 1445-1448 [doi]

Decoding-time prediction of non-verbalized punctuationAnoop Deoras, Jürgen Fritsch. 1449-1452 [doi]

On the impact of alignment on voice conversion performanceElina Helander, Jan Schwarz, Jani Nurminen, Hanna Silén, Moncef Gabbouj. 1453-1456 [doi]

The linear transformation of LF glottal waveforms for voice conversionArantza del Pozo, Steve Young. 1457-1460 [doi]

Maximum a posteriori adaptation for many-to-one eigenvoice conversionDaisuke Tani, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano. 1461-1463 [doi]

Improvement to a NAM captured whisper-to-speech systemViet-Anh Tran, Gérard Bailly, Hélène Loevenbruck, Christian Jutten. 1465-1468 [doi]

Speaker identification in noise mismatch conditions based on jump function Kolmogorov analysis in wavelet domainTran Huy Dat, Haizhou Li. 1469-1472 [doi]

Modelling fine-phonetic detail in a computational model of word recognitionOdette Scharenborg. 1473-1476 [doi]

Pronunciation reduction: how it relates to speech style, gender, and ageHelmer Strik, Joost van Doremalen, Catia Cucchiarini. 1477-1480 [doi]

Analysis of glottal stops in speech signalsB. Yegnanarayana, S. Rajendran, Hussien Seid Worku, N. Dhananjaya. 1481-1484 [doi]

The acoustic to articulation mapping: non-linear or non-unique?Daniel Neiberg, G. Ananthakrishnan, Olov Engwall. 1485-1488 [doi]

The entropy of the articulatory phonological code: recognizing gestures from tract variablesXiaodan Zhuang, Hosung Nam, Mark Hasegawa-Johnson, Louis M. Goldstein, Elliot Saltzman. 1489-1492 [doi]

Addressing database mismatch in forensic speaker recognition with Ahumada III: a public real-casework database in SpanishDaniel Ramos, Joaquin Gonzalez-Rodriguez, Javier Gonzalez-Dominguez, Jose Juan Lucena-Molina. 1493-1496 [doi]

FM features for automatic forensic speaker recognitionTharmarajah Thiruvaran, Eliathamby Ambikairajah, Julien Epps. 1497-1500 [doi]

Automatic-type calibration of traditionally derived likelihood ratios: forensic analysis of australian English /o/ formant trajectoriesGeoffrey Stewart Morrison, Yuko Kinoshita. 1501-1504 [doi]

Forensic speaker verification using formant features and Gaussian mixture modelsTimo Becker, Michael Jessen, Catalin Grigoras. 1505-1508 [doi]

The case for automatic higher-level features in forensic speaker recognitionElizabeth Shriberg, Andreas Stolcke. 1509-1512 [doi]

Group delay function for improved gender identificationKye-Hwan Lee, Sang-Ick Kang, Ji-Hyun Song, Joon-Hyuk Chang. 1513-1516 [doi]

Frame-synchronous and local confidence measures for on-the-fly automatic speech recognitionJoseph Razik, Odile Mella, Dominique Fohr, Jean-Paul Haton. 1517-1520 [doi]

Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speechSamuel Thomas, Sriram Ganapathy, Hynek Hermansky. 1521-1524 [doi]

Evidence of coarticulation in a phonological feature detection systemAbhijeet Sangwan, Ayako Ikeno, John H. L. Hansen. 1525-1528 [doi]

Phoneme recognition based on hybrid neural networks with inhibition/enhancement of distinctive phonetic feature (DPF) trajectoriesMohammad Nurul Huda, Kouichi Katsurada, Tsuneo Nitta. 1529-1532 [doi]

A neural network based nonlinear feature transformation for speech recognitionHongbing Hu, Stephen A. Zahorian. 1533-1536 [doi]

Significance of group delay based acoustic features in the linguistic search space for robust speech recognitionR. Ramya, Rajesh M. Hegde, Hema A. Murthy. 1537-1540 [doi]

Genetic programming based optimization of class-dependent PCA for extracting robust MFCCHouman Abbasian, Babak Nasersharif, Ahmad Akbari. 1541-1544 [doi]

Comparison of AM-FM based features for robust speech recognitionK. V. S. Narayana, T. V. Sreenivas. 1545-1548 [doi]

Growing bottleneck features for tandem ASRJoe Frankel, Dong Wang, Simon King. 1549 [doi]

Landmark based recognition of stops: acoustic attributes versus smoothed spectraVeena Karjigi, Preeti Rao. 1550-1553 [doi]

Speech recognition performance of CJLC: corpus of Japanese lecture contentsSatoru Kogure, Hiromitsu Nishizaki, Masatoshi Tsuchiya, Kazumasa Yamamoto, Shingo Togashi, Seiichi Nakagawa. 1554-1557 [doi]

Evaluating spoken language model based on filler prediction model in speech recognitionKengo Ohta, Masatoshi Tsuchiya, Seiichi Nakagawa. 1558-1561 [doi]

Transcription-less call routing using unsupervised language model adaptationNicolae Duta. 1562-1565 [doi]

Large margin multinomial mixture model for text categorizationZhen-Yu Pan, Hui Jiang. 1566-1569 [doi]

Language modeling for speech recognition of spoken CantoneseYu Ting Yeung, Houwei Cao, N. H. Zheng, Tan Lee, P. C. Ching. 1570-1573 [doi]

Discriminative rescoring based on minimization of word errors for transcribing broadcast newsAkio Kobayashi, Takahiro Oku, Shinichi Homma, Shoei Sato, Toru Imai, Tohru Takagi. 1574-1577 [doi]

Search and classification based language model adaptationQin Shi, Stephen M. Chu, Wen Liu, Hong-Kwang Jeff Kuo, Yi Liu, Yong Qin. 1578-1581 [doi]

Fast n-gram language model look-ahead for decoders with static pronunciation prefix treesMarijn Huijbregts, Roeland Ordelman, Franciska de Jong. 1582-1585 [doi]

Thai named-entity recognition using class-based language modeling on multiple-sized subword unitsKwanchiva Saykhum, Vataya Boonpiam, Nattanun Thatphithakkul, Chai Wutiwiwatchai, Cholwich Nattee. 1586-1589 [doi]

Combining statistical and syntactical systems for spoken language understanding with graphical modelsStefan Schwärzler, Jürgen T. Geiger, Joachim Schenk, Marc Al-Hames, Benedikt Hörnler, Günther Ruske, Gerhard Rigoll. 1590-1593 [doi]

Bag-of-word normalized n-gram modelsAbhinav Sethy, Bhuvana Ramabhadran. 1594-1597 [doi]

A study of unsupervised clustering techniques for language modelingSangyun Hahn, Abhinav Sethy, Hong-Kwang Jeff Kuo, Bhuvana Ramabhadran. 1598-1601 [doi]

Automatic estimation of language model parameters for unseen words using morpho-syntactic contextual informationCiro Martins, António J. S. Teixeira, João Paulo Neto. 1602-1605 [doi]

Modeling the effects on time-into-utterance on word probabilitiesNigel G. Ward, Alejandro Vega. 1606-1609 [doi]

Inductive and example-based learning for text classificationYe-Yi Wang, Xiao Li, Alex Acero. 1610-1613 [doi]

Comparing word, character, and phoneme n-grams for subjective utterance recognitionTheresa Wilson, Stephan Raaijmakers. 1614-1617 [doi]

IRSTLM: an open source toolkit for handling large scale language modelsMarcello Federico, Nicola Bertoldi, Mauro Cettolo. 1618-1621 [doi]

Multi-modal recording, analysis and indexing of poster sessionsTatsuya Kawahara, Hisao Setoguchi, Katsuya Takanashi, Kentaro Ishizuka, Shoko Araki. 1622-1625 [doi]

Automatic pitch-synchronous phonetic segmentationJindrich Matousek, Jan Romportl. 1626-1629 [doi]

Two protocols comparing human and machine phonetic recognition performance in conversational speechWade Shen, Joseph P. Olive, Douglas Jones. 1630-1633 [doi]

Analysis of drivers speech in a car environmentTomoyuki Kato, Jun Okamoto, Makoto Shozakai. 1634-1637 [doi]

Preparing a corpus of dutch spontaneous dialogues for automatic phonetic analysisBarbara Schuppler, Mirjam Ernestus, Odette Scharenborg, Lou Boves. 1638-1641 [doi]

Evaluation of voice activity and voicing detectionBojan Kotnik, Pierre Sendorek, Sergey Astrov, Turgay Koc, Tolga Çiloglu, Laura Docío Fernández, Eduardo Rodríguez Banga, Harald Höge, Zdravko Kacic. 1642-1645 [doi]

Wikispeech - a content management system for speech databasesChristoph Draxler, Klaus Jänsch. 1646-1649 [doi]

Development and evaluation of Polish speech corpus for unit selection speech synthesis systemsGrazyna Demenko, Jolanta Bachan, Bernd Möbius, Katarzyna Klessa, Marcin Szymanski, Stefan Grocholewski. 1650-1653 [doi]

A data format enabling interoperation of speech recognition, translation and information extraction engines: the GALE type systemJohn F. Pitrelli, Burn L. Lewis, Edward A. Epstein, Jerome L. Quinn, Ganesh N. Ramaswamy. 1654-1657 [doi]

A rank-predicted pseudo-greedy approach to efficient text selection from large-scale corpus for maximum coverage of target unitsWei Li, Qiang Huo. 1658-1661 [doi]

Memo workbench for semi-automated usability testingKlaus-Peter Engelbrecht, Michael Kruppa, Sebastian Möller, Michael Quade. 1662-1665 [doi]

MDS-based visualization method for multiple speech corporaKimiko Yamakawa, Tomoko Matsui, Shuichi Itahashi. 1666-1669 [doi]

Scripted dialogs versus improvisation: lessons learned about emotional elicitation techniques from the IEMOCAP databaseCarlos Busso, Shrikanth S. Narayanan. 1670-1673 [doi]

Analysis and perception of speech under physical task stressKeith W. Godin, John H. L. Hansen. 1674-1677 [doi]

An analysis of multimodal cues of interruption in dyadic spoken interactionsChi-Chun Lee, Sungbok Lee, Shrikanth S. Narayanan. 1678-1681 [doi]

Paralinguistic effects on turn-taking behavior in expressive conversationHiroki Mori, Hideki Kasuya. 1682 [doi]

Study on ng, a type of discourse markers in standard ChineseZhigang Yin, Aijun Li, Ziyu Xiong. 1683-1686 [doi]

How can you use disfluencies and still sound as a good speaker?Helena Moniz, Ana Isabel Mata, Isabel Trancoso, Céu Viana. 1687 [doi]

What makes a good speaker? subject ratings, acoustic measurements and perceptual evaluationsEva Strangert, Joakim Gustafson. 1688-1691 [doi]

Towards measuring continuous acoustic feature convergence in unconstrained spoken dialoguesSpyros Kousidis, David Dorran, Yi Wang, Brian Vaughan, Charlie Cullen, Dermot Campbell, Ciaran McDonnell, Eugene Coyle. 1692-1695 [doi]

Detection of feeling through back-channels in spoken dialogueTatsuya Kawahara, Masayoshi Toyokura, Teruhisa Misu, Chiori Hori. 1696 [doi]

Adaptive training using discriminative mapping transformsC. K. Raut, Kai Yu, Mark J. F. Gales. 1697-1700 [doi]

Speaker adaptive training using shift-MLLRJonas Lööf, Christian Gollan, Hermann Ney. 1701-1704 [doi]

XMLLR for improved speaker adaptation in speech recognitionDaniel Povey, Hong-Kwang Jeff Kuo. 1705-1708 [doi]

Effective acoustic adaptation for a distant-talking interactive TV systemJing Huang, Mark Epstein, Marco Matassoni. 1709-1712 [doi]

A computationally efficient approach to warp factor estimation in VTLN using EM algorithm and sufficient statisticsP. T. Akhil, S. P. Rath, Srinivasan Umesh, D. Rama Sanand. 1713-1716 [doi]

A reliable technique for detecting the second subglottal resonance and its use in cross-language speaker adaptationShizhen Wang, Steven M. Lulich, Abeer Alwan. 1717-1720 [doi]

Automatic pronunciation evaluation and classificationOm Deshmukh, Sachindra Joshi, Ashish Verma. 1721-1724 [doi]

Pronunciation error detection techniques for children s speechDaniel Bolaños, Wayne Ward, Barbara Wise, Sarel Van Vuuren. 1725-1728 [doi]

Automatic generation and pruning of phonetic mispronunciations to support computer-aided pronunciation trainingLan Wang, Xin Feng, Helen M. Meng. 1729-1732 [doi]

Automatic children s reading tutor on hand-held devicesXiaolong Li, Li Deng, Yun-Cheng Ju, Alex Acero. 1733-1736 [doi]

A Japanese CALL system based on dynamic question generation and error prediction for ASRHongcui Wang, Tatsuya Kawahara. 1737-1740 [doi]

Dysarthric speech database for universal access researchHeejin Kim, Mark Hasegawa-Johnson, Adrienne Perlman, Jon Gunderson, Thomas S. Huang, Kenneth Watkin, Simone Frame. 1741-1744 [doi]

Objective intelligibility assessment of pathological speakersCatherine Middag, Gwen Van Nuffelen, Jean-Pierre Martens, Marc De Bodt. 1745-1748 [doi]

Quantitative analysis of intonation patterns produced by Cantonese speakers with Parkinson s disease: a preliminary studyJoan K. Y. Ma, Tara L. Whitehill. 1749-1752 [doi]

Phonetic-acoustic and feature analyses by a neural network to assess speech quality in patients treated for head and neck cancerMarieke de Bruijn, Irma Verdonck-de Leeuw, Louis ten Bosch, Joop Kuik, Hugo Quené, Lou Boves, Hans Langendijk, C. René Leemans. 1753-1756 [doi]

Automatic evaluation of characteristic speech disorders in children with cleft lip and palateAndreas Maier, Florian Hönig, Christian Hacker, Maria Schuster, Elmar Nöth. 1757-1760 [doi]

Application of weighted finite-state transducers to improve recognition accuracy for dysarthric speechSantiago Omar Caballero Morales, Stephen J. Cox. 1761-1764 [doi]

The interspeech 2008 consonant challengeMartin Cooke, Odette Scharenborg. 1765-1768 [doi]

HMM-based estimation of unreliable spectral components for noise robust speech recognitionBengt J. Borgström, Abeer Alwan. 1769-1772 [doi]

Gammatone-domain model combination for consonant recognition in noisy environmentsJae Sam Yoon, Ji Hun Park, Hong Kook Kim. 1773-1776 [doi]

On the mask modeling and feature representation in the missing-feature ASR: evaluation on the Consonant ChallengePeter Jancovic, Münevver Köküer. 1777-1780 [doi]

The non-native consonant challenge for european languagesMaria Luisa Garcia Lecumberri, Martin Cooke, Francesco Cutugno, Mircea Giurgiu, Bernd T. Meyer, Odette Scharenborg, Wim A. van Dommelen, Jan Volín. 1781-1784 [doi]

Noise reduction through compressed sensingJort F. Gemmeke, Bert Cranen. 1785-1788 [doi]

Speech recognition in noisy environments using a switching linear dynamic model for feature enhancementBjörn Schuller, Martin Wöllmer, Tobias Moosmayr, Gerhard Rigoll. 1789-1792 [doi]

Improving consonant identification in noise and reverberation by steady-state suppression as a preprocessing approachNao Hodoshima, Wataru Yoshida, Takayuki Arai. 1793-1796 [doi]

Human speech perception and feature extractionBryce E. Lobdell, Mark Hasegawa-Johnson, Jont B. Allen. 1797-1800 [doi]

Improving pronunciation modeling for non-native speech recognitionTien Ping Tan, Laurent Besacier. 1801-1804 [doi]

Online vocabulary adaptation using contextual information and information retrievalHagai Aronowitz. 1805-1808 [doi]

Lexicon expansion using pronunciation variations extracted on the basis of speaker-related deviation in recognition error statisticsYoshifumi Onishi. 1809-1812 [doi]

Better nonnative intonation scores through prosodic theoryJoseph Tepperman, Shrikanth S. Narayanan. 1813-1816 [doi]

Silence models in weighted finite-state transducersPhilip N. Garner. 1817-1820 [doi]

Extracting word-pronunciation pairs from comparable set of text and speechTetsuro Sasada, Shinsuke Mori, Tatsuya Kawahara. 1821-1824 [doi]

Synthesis by generation and concatenation of multiform segmentsVincent Pollet, Andrew P. Breen. 1825-1828 [doi]

Glottal spectral separation for parametric speech synthesisJoão P. Cabral, Steve Renals, Korin Richmond, Junichi Yamagishi. 1829-1832 [doi]

Improving speech systems built from very little dataJohn Kominek, Sameer Badaskar, Tanja Schultz, Alan W. Black. 1833-1836 [doi]

Structure to speech conversion - speech generation based on infant-like vocal imitationDaisuke Saito, Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hirose. 1837-1840 [doi]

Statistical text-to-speech synthesis with improved dynamicsStas Tiomkin, David Malah. 1841-1844 [doi]

An evaluation of non-standard features for grapheme-to-phoneme conversionGabriel Webster, Norbert Braunschweiler. 1845-1848 [doi]

Towards flexible speech coding for speech synthesis: an LF + modulated noise vocoderYannis Agiomyrgiannakis, Olivier Rosec. 1849-1852 [doi]

Evaluation of Finnish unit selection and HMM-based speech synthesisHanna Silén, Elina Helander, Jani Nurminen, Moncef Gabbouj. 1853-1856 [doi]

A probabilistic trajectory synthesis system for synthesising visual speechBarry-John Theobald, Nicholas Wilkinson. 1857-1860 [doi]

Paralinguistic elements in speech synthesisDidier Cadic, Lionel Segalen. 1861-1864 [doi]

Building sleek synthesizers for multi-lingual screen readerE. Veera Raghavendra, B. Yegnanarayana, Alan W. Black, Kishore Prahallad. 1865-1868 [doi]

Unsupervised adaptation for HMM-based speech synthesisSimon King, Keiichi Tokuda, Heiga Zen, Junichi Yamagishi. 1869-1872 [doi]

Investigating festival s target cost function using perceptual experimentsVolker Strom, Simon King. 1873-1876 [doi]

Modeling Austrian dialect varieties for TTSFriedrich Neubarth, Michael Pucher, Christian Kranzler. 1877-1880 [doi]

HMM-based Finnish text-to-speech system utilizing glottal inverse filteringTuomo Raitio, Antti Suni, Hannu Pulakka, Martti Vainio, Paavo Alku. 1881-1884 [doi]

LTS using decision forest of regression trees and neural networksTanuja Sarkar, Sachin Joshi, Sathish Chandra Pammi, Kishore Prahallad. 1885-1888 [doi]

Automatic word stress marking and syllabification for Catalan TTSSilvia Rustullet, Daniela Braga, João Nogueira, Miguel Sales Dias. 1889-1892 [doi]

Robust far-field speaker identification under mismatched conditionsQin Jin, Tanja Schultz. 1893-1896 [doi]

Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutionsChien-Lin Huang, Bin Ma, Chung-Hsien Wu, Brian Mak, Haizhou Li. 1897-1900 [doi]

Performance improvement of text-independent speaker verification systems based on histogram enhancement in noisy environmentsC. H. Kwon, J. K. Choi, Eliathamby Ambikairajah. 1901-1904 [doi]

Filling acoustic holes through leveraged uncorellated GMMs for in-set/out-of-set speaker recognitionJun-Won Suh, Pongtep Angkititrakul, John H. L. Hansen. 1905-1908 [doi]

Missing-feature method for speaker recognition in band-restricted conditionsWooil Kim, John H. L. Hansen. 1909-1912 [doi]

Robust speaker identification using cross-correlation GTF-ICA featureYushi Zhang, Waleed H. Abdulla. 1913-1916 [doi]

Perceptual speaker identification using monosyllabic stimuli - effects of the nucleus vowels and speaker characteristics contained in nasalsKanae Amino, Takayuki Arai. 1917-1920 [doi]

Text-dependent speaker recognition by efficient capture of speaker dynamics in compressed time-frequency representations of speechAmitava Das, Gokul Chittaranjan. 1921-1924 [doi]

Usefulness of text-conditioning and a new database for text-dependent speaker recognition researchAmitava Das, Gokul Chittaranjan, Gopala Krishna Anumanchipalli. 1925-1928 [doi]

Combination method of bone-conduction speech and air-conduction speech for speaker recognitionSatoru Tsuge, Takashi Osanai, Hisanori Makinae, Toshiaki Kamada, Minoru Fukumi, Shingo Kuroiwa. 1929-1932 [doi]

MAP and sub-word level t-norm for text-dependent speaker recognitionDoroteo Torre Toledano, Daniel Hernández López, Cristina Esteve-Elizalde, Joaquin Gonzalez-Rodriguez, Rubén Fernández Pozo, Luis A. Hernández Gómez. 1933-1936 [doi]

Forensic speaker recognition in Chinese: a multivariate likelihood ratio discrimination on /i/ and /y/Cuiling Zhang, Geoffrey Stewart Morrison, Philip Rose. 1937-1940 [doi]

How many do we need? exploration of the population size effect on the performance of forensic speaker classificationShunichi Ishihara, Yuko Kinoshita. 1941-1944 [doi]

Comparing prosodic models for speaker recognitionCheung Chi Leung, Marc Ferras, Claude Barras, Jean-Luc Gauvain. 1945-1948 [doi]

Combination of clean and contaminated GMM/SVM for far-field text-independent speaker verificationChristian Zieger, Maurizio Omologo. 1949-1952 [doi]

English word stress as produced by English and dutch speakers: the role of segmental and suprasegmental differencesBettina Braun, Kristin Lemhofer, Anne Cutler. 1953 [doi]

The strength of stress-related lexical competition depends on the presence of first-syllable stressEva Reinisch, Alexandra Jesse, James M. McQueen. 1954 [doi]

Word stress placement by native speakers and Japanese learners of EnglishKeiichi Ishikawa, Jun Nomura. 1955-1958 [doi]

Schwa variants in american EnglishH. Timothy Bunnell, Jason Lilley. 1959-1962 [doi]

Covariations of English segmental durations across speakersJiahong Yuan. 1963 [doi]

The intelligibility of the English vowel /ʌ/ produced by native speakers of Japanese and its relations to the acoustic characteristicsAkiyo Joto. 1964-1967 [doi]

Rate dependent spectral reduction for voiceless fricativesBenjamin Weiss. 1968 [doi]

Investigating perception of places of articulation in sign and speechStina Ojala, Olli Aaltonen, Tapio Salakoski. 1969 [doi]

Six- and twelve-month-olds discrimination of native versus non-native between- and within-organ fricative place contrastsMichael D. Tyler, Catherine T. Best, Louis M. Goldstein, Mark Antoniou, Lidija Krebs-Lazendic. 1970 [doi]

your baby can t hear you : how mothers talk to infants with simulated hearing lossChrista Lam, Christine Kitamura. 1971 [doi]

Development of communicative skills in 8- to 16-month-old children: a longitudinal studyEeva Klintfors, Ulla Sundberg, Francisco Lacerda, Ellen Marklund, Lisa Gustavsson, Ulla Bjursäter, Iris-Corinna Schwarz, Göran Söderlund. 1972-1975 [doi]

Vocal imitation in early language acquisitionLisa Gustavsson, Francisco Lacerda. 1976-1979 [doi]

Computational language acquisition by statistical bottom-up processingOkko Johannes Räsänen, Unto K. Laine, Toomas Altosaar. 1980-1983 [doi]

Lexical analyses of native and non-native English language instructor speech based on a six-month co-taught classroom video corpusNoriaki Katagiri, Goh Kawai. 1984-1987 [doi]

Perception and production of consonant clusters in Japanese-English bilingual and Japanese monolingual speakersHinako Masuda, Takayuki Arai. 1988-1991 [doi]

On a generalization of margin-based discriminative training to robust speech recognitionJinyu Li, Chin-Hui Lee. 1992-1995 [doi]

Discriminative classifiers with generative kernels for noise robust ASRMark J. F. Gales, Chris Longworth. 1996-1999 [doi]

Covariance modelling for noise-robust speech recognitionRogier C. van Dalen, Mark J. F. Gales. 2000-2003 [doi]

Exploiting spatial-temporal feature distribution characteristics for robust speech recognitionWei-Hau Chen, Shih-Hsiang Lin, Berlin Chen. 2004-2007 [doi]

Study of integration of statistical model-based voice activity detection and noise suppressionMasakiyo Fujimoto, Kentaro Ishizuka, Tomohiro Nakatani. 2008-2011 [doi]

Neural network based regression for robust overlapping speech recognition using microphone arraysWeifeng Li, John Dines, Mathew Magimai-Doss, Hervé Bourlard. 2012-2015 [doi]

Lip synchronization: from phone lattice to PCA eigen-projections using neural networksSamer Al Moubayed, Michael De Smet, Hugo Van Hamme. 2016-2019 [doi]

Building and combining document and music spaces for music query-by-webpage systemRyoei Takahashi, Yasunori Ohishi, Norihide Kitaoka, Kazuya Takeda. 2020-2023 [doi]

Improving searching speed and accuracy of query by humming system based on three methods: feature fusion, candidates set reduction and multiple similarity measurement rescoringLei Wang, Shen Huang, Sheng Hu, Jiaen Liang, Bo Xu. 2024-2027 [doi]

Towards a segmental vocoder driven by ultrasound and optical images of the tongue and lipsThomas Hueber, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Maureen Stone. 2028-2031 [doi]

Phone recognition from ultrasound and optical video sequences for a silent speech interfaceThomas Hueber, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Maureen Stone. 2032-2035 [doi]

Feature space transforms for Czech sign-language recognitionJan Trmal, Marek Hrúz, Jan Zelinka, Pavel Campr, Ludek Müller. 2036-2039 [doi]

Masked speech priming: no priming in dense neighbourhoodsChris Davis, Jeesun Kim, Angelo Barbaro. 2040-2043 [doi]

Integration of audiovisual speech and priming effectsAzra N. Ali. 2044-2047 [doi]

Similarity between vowels influences response execution in word identificationJason D. Zevin, Thomas A. Farmer. 2048-2051 [doi]

Prelexically-driven perceptual retuning of phoneme boundariesAnne Cutler, James M. McQueen, Sally Butterfield, Dennis Norris. 2056 [doi]

Visual speech modifies the phoneme restoration effectErin Cvejic, Jeesun Kim, Chris Davis. 2057 [doi]

An objective singing evaluation approach by relating acoustic measurements to perceptual ratingsChuan Cao, Ming Li, Jian Liu, YongHong Yan. 2058-2061 [doi]

On the perceived quality of noise reduced signalsValérie Gautier-Turbin, Laetitia Gros. 2062-2065 [doi]

A methodology and tool suite for evaluation of accuracy of interoperating statistical natural language processing enginesUma Murthy, John F. Pitrelli, Ganesh N. Ramaswamy, Martin Franz, Burn L. Lewis. 2066-2069 [doi]

An empirical analysis of word error rate and keyword error rateYoungja Park, Siddharth Patwardhan, Karthik Visweswariah, Stephen C. Gates. 2070-2073 [doi]

Measuring speech quality impact on tasks performanceVirginie Durin, Laetitia Gros. 2074-2077 [doi]

Voice commands in home environment - a consumer surveyHannu Soronen, Markku Turunen, Jaakko Hakulinen. 2078-2081 [doi]

Extended partial distance elimination and dynamic Gaussian selection for fast likelihood computationGhazi Bouselmi, Jun Cai. 2082-2085 [doi]

Improving the multigram algorithm by using lattices as inputJoris Driesen, Hugo Van Hamme. 2086-2089 [doi]

Backward Viterbi beam search for utilizing dynamic task complexity informationMin Tang, Philippe Di Cristo. 2090-2093 [doi]

Fast speech decoding through phone confusion networksNicola Bertoldi, Marcello Federico, Daniele Falavigna, Matteo Gerosa. 2094-2097 [doi]

High-performance low-latency speech recognition via multi-layered feature streaming and fast Gaussian computationLiang Gu, Jian Xue, Xiaodong Cui, Yuqing Gao. 2098-2101 [doi]

A low-power hardware search architecture for speech recognitionPatrick J. Bourke, Rob A. Rutenbar. 2102-2105 [doi]

Phonetic query expansion for spoken document retrievalJonathan Mamou, Bhuvana Ramabhadran. 2106-2109 [doi]

Implementation and evaluation of fast on-the-fly WFST composition algorithmsTasuku Oonishi, Paul R. Dixon, Koji Iwano, Sadaoki Furui. 2110-2113 [doi]

Analysis of voice-quality features of speech that expresses anger , joy , and sadness uttered by radio actors and actressesShoichi Takeda, Yuuri Yasuda, Risako Isobe, Shogo Kiryu, Makiko Tsuru. 2114-2117 [doi]

Including pitch accent optionality in unit selection text-to-speech synthesisLeonardo Badino, Robert A. J. Clark, Volker Strom. 2118-2121 [doi]

Emotion conversion using F0 segment selectionZeynep Inanoglu, Steve Young. 2122-2125 [doi]

Generating natural F0 trajectory with additive treesYao Qian, Hui Liang, Frank K. Soong. 2126-2129 [doi]

Generating intonation from a mixed CART-HMM model for speech synthesisCédric Boidin, Olivier Boëffard. 2130-2133 [doi]

Intonation modeling of Mandarin Chinese using a superpositional approachPablo Daniel Agüero, Antonio Bonafonte, Lu Yu, Juan Carlos Tulli. 2134-2137 [doi]

Two-stage prosody prediction for emotional text-to-speech synthesisHao Tang, Xi Zhou, Matthias Odisio, Mark Hasegawa-Johnson, Thomas S. Huang. 2138-2141 [doi]

Prosody boundary detection through context-dependent position modelsYue-Ning Hu, Min Chu, Chao Huang, Yan-ning Zhang. 2142-2145 [doi]

Addressing the out-of-vocabulary problem for large-scale Chinese spoken term detectionSha Meng, Jian Shao, Roger Peng Yu, Jia Liu, Frank Seide. 2146-2149 [doi]

Towards vocabulary-independent speech indexing for large-scale repositoriesJian Shao, Roger Peng Yu, QingWei Zhao, YongHong Yan, Frank Seide. 2150-2153 [doi]

Towards the integration of automatic speech recognition and information retrieval for spoken query processingAntonio Moreno-Daniel, Jay G. Wilpon, Biing-Hwang Juang, S. Parthasarathy. 2154-2157 [doi]

Reducing the effect of OOV query words by using morph-based spoken document retrievalVille T. Turunen. 2158-2161 [doi]

Bayesian latent topic clustering modelMeng-Sung Wu, Jen-Tzung Chien. 2162-2165 [doi]

Spoken document retrieval by translating recognition candidates into correct transcriptionsTomoyosi Akiba, Yusuke Yokota. 2166-2169 [doi]

Audio indexing for an interactive Italian literature management systemCarlo Drioli, Piero Cosi. 2170 [doi]

Open-vocabulary spoken-document retrieval based on query expansion using related web documentsMakoto Terao, Takafumi Koshinaka, Shinichi Ando, Ryosuke Isotani, Akitoshi Okumura. 2171-2174 [doi]

Discriminative graph training for ultra-fast low-footprint speech indexingUpendra V. Chaudhari, Hong-Kwang Jeff Kuo, Brian Kingsbury. 2175-2178 [doi]

A language-modeling approach to inverse text normalization and data cleanup for multimodal voice search applicationsYun-Cheng Ju, Julian Odell. 2179-2182 [doi]

Topic segmentation and indexation in a media watch systemRui Amaral, Isabel Trancoso. 2183-2186 [doi]

Vocabulary independent discriminative term frequency estimationJ. Scott Olsson. 2187-2190 [doi]

Spoken keyword spotting via multi-lattice alignmentHui Lin, Alex Stupakov, Jeff A. Bilmes. 2191-2194 [doi]

Robust spoken term detection using combination of phone-based and word-based recognitionKenji Iwata, Koichi Shinoda, Sadaoki Furui. 2195-2198 [doi]

Language model adaptation for a speech to sign language translation system using web frequencies and a MAP frameworkLuis Fernando D Haro, Rubén San Segundo, Ricardo de Córdoba, Jan Bungeroth, Daniel Stein, Hermann Ney. 2199-2202 [doi]

Hearing at home - communication support in home environments for hearing impaired personsJonas Beskow, Björn Granström, Peter Nordqvist, Samer Al Moubayed, Giampiero Salvi, Tobias Herzke, Arne Schulz. 2203-2206 [doi]

Traveling wave based group delays for cochlear implant speech processingDaniel A. Taft, David B. Grayden, Anthony N. Burkitt. 2207 [doi]

Multimodal perception of Mandarin tone for cochlear implant usersDamien J. Smith, Denis Burnham. 2208 [doi]

Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environmentsKeigo Nakamura, Tomoki Toda, Yoshitaka Nakajima, Hiroshi Saruwatari, Kiyohiro Shikano. 2209-2212 [doi]

An acoustic typology of apraxic speech - toward reliable diagnosisJacqueline McKechnie, Kirrie J. Ballard, Donald A. Robin, Adam Jacks, Sallyanne Palethorpe, Kristin M. Rosen. 2213 [doi]

Dysphonic voices and the 0-3000 hz frequency bandGilles Pouchoulin, Corinne Fredouille, Jean-François Bonastre, Alain Ghio, Antoine Giovanni. 2214-2217 [doi]

Verifying pronunciation accuracy from speakers with neuromuscular disordersShou-Chun Yin, Richard C. Rose, Oscar Saz, Eduardo Lleida. 2218-2221 [doi]

Multi-band and multi-cue analyses of disordered connected speechAli Alpan, Youri Maryn, Francis Grenez, Abdellah Kacha, Jean Schoentgen. 2222-2225 [doi]

Combining neural network and rule-based systems for dysarthria diagnosisJames Carmichael, Vincent Wan, Phil Green. 2226-2229 [doi]

Speech as a means of monitoring cognitive function of elderly speakersShona D Arcy, Viliam Rapcan, Nils Penard, Margaret E. Morris, Ian H. Robertson, Richard B. Reilly. 2230-2233 [doi]

Integration of metamodel and acoustic model for speech recognitionHironori Matsumasa, Tetsuya Takiguchi, Yasuo Ariki, Ichao Li, Toshitaka Nakabayashi. 2234-2237 [doi]

Frequency compression/transposition of fricative consonants for the hearing impaired with high-frequency dead regionsFrancisco J. Fraga, Leticia P. Costa S. Prates, Maria Cecilia M. Iorio. 2238-2241 [doi]

On the combination of auditory and modulation frequency channels for ASR applicationsFabio Valente, Hynek Hermansky. 2242-2245 [doi]

Tandem processing of fepstrum featuresVivek Tyagi. 2246-2249 [doi]

Data-driven clustered hierarchical tandem system for LVCSRShuo-Yiin Chang, Lin-Shan Lee. 2250-2253 [doi]

Linear discriminant feature extraction using weighted classification confusion informationHung-Shin Lee, Berlin Chen. 2254-2257 [doi]

Use of spectral centre of gravity for generating speaker invariant features for automatic speech recognitionD. Rama Sanand, V. Balaji, Rani R. Sandhya, Srinivasan Umesh. 2258-2261 [doi]

Short- and long-term dynamic features for robust speech recognitionTakashi Fukuda, Osamu Ichikawa, Masafumi Nishimura. 2262-2265 [doi]

Duration refinement by jointly optimizing state and longer unit likelihoodBoyang Gao, Yao Qian, Zhizheng Wu, Frank K. Soong. 2266-2269 [doi]

T-tilt: a modified tilt model for F0 analysis and synthesis in tonal languagesAusdang Thangthai, Nattanun Thatphithakkul, Chai Wutiwiwatchai, Anocha Rugchatjaroen, Sittipong Saychum. 2270-2273 [doi]

Multilevel parametric-base F0 model for speech synthesisJavier Latorre, Masami Akamine. 2274-2277 [doi]

On the generation of synthetic disfluent speech: local prosodic modifications caused by the insertion of editing termsJordi Adell, Antonio Bonafonte, David Escudero Mancebo. 2278-2281 [doi]

A comparison of voice conversion methods for transforming voice quality in emotional speech synthesisOytun Türk, Marc Schröder. 2282-2285 [doi]

Tree grammars as models of prosodic structureJoseph Tepperman, Shrikanth S. Narayanan. 2286-2289 [doi]

Relation between geometry and kinematics of articulatory trajectory associated with emotional speech productionSungbok Lee, Tsuneo Kato, Shrikanth S. Narayanan. 2290-2293 [doi]

Intrinsic consonantal F0 perturbation in 3-way VOT contrast and its implications for aspiration-conditioned tonal split: evidence from VietnameseMichael J. Carne. 2294-2297 [doi]

A model based investigation of activation patterns of the tongue muscles for vowel productionQiang Fang, Satoru Fujita, Xugang Lu, Jianwu Dang. 2298-2301 [doi]

Interrelationship between vocal effort and vocal tract acoustics: a pilot studyMaeva Garnier, Joe Wolfe, Nathalie Henrich, John Smith. 2302-2305 [doi]

Predicting tongue shapes from a few landmark locationsChao Qin, Miguel Á. Carreira-Perpiñán, Korin Richmond, Alan Wrench, Steve Renals. 2306-2309 [doi]

LIPS2008: visual speech synthesis challengeBarry-John Theobald, Sascha Fagel, Gérard Bailly, Frédéric Elisei. 2310-2313 [doi]

Speech-driven lip motion generation with a trajectory HMMGregor Hofer, Junichi Yamagishi, Hiroshi Shimodaira. 2314-2317 [doi]

A trainable trajectory formation model TD-HMM parameterized for the LIPS 2008 challengeGérard Bailly, Oxana Govokhina, Gaspard Breton, Frédéric Elisei, Christophe Savariaux. 2318-2321 [doi]

Comparing text-driven and speech-driven visual speech synthesisersBarry-John Theobald, Gavin C. Cawley, J. Andrew Bangham, Iain Matthews, Nicholas Wilkinson. 2322 [doi]

Automatic lip synchronization by speech signal analysisGoranka Zoric, Aleksandra Cerekovic, Igor S. Pandzic. 2323 [doi]

MASSY speaks English: adaptation and evaluation of a talking headSascha Fagel. 2324 [doi]

From 3-d speaker cloning to text-to-audiovisual-speechSascha Fagel, Frédéric Elisei, Gérard Bailly. 2325 [doi]

A development of Czech talking headZdenek Krnoul, Milos Zelezný. 2326-2329 [doi]

Realistic facial animation system for interactive servicesKang Liu, Jörn Ostermann. 2330-2333 [doi]

Speech-driven 3d facial animation for mobile entertainmentJuan Yan, Xiang Xie, Hao Hu. 2334-2337 [doi]

A real-time text to audio-visual speech synthesis systemLijuan Wang, Xiaojun Qian, Lei Ma, Yao Qian, Yining Chen, Frank K. Soong. 2338-2341 [doi]

Spoken language translation systems ************ ASR word lattice translation with exhaustive reordering is possibleEvgeny Matusov, Björn Hoffmeister, Hermann Ney. 2342-2345 [doi]

Development of SRI s translation systems for broadcast news and broadcast conversationsJing Zheng, Wen Wang, Necip Fazil Ayan. 2346-2349 [doi]

Machine translation in continuous spaceRuhi Sarikaya, Yonggang Deng, Mohamed Afify, Brian Kingsbury, Yuqing Gao. 2350-2353 [doi]

Discovering phrases in machine translation by simulated annealingCaroline Lavecchia, David Langlois, Kamel Smaïli. 2354-2357 [doi]

Towards domain independence in machine aided human translationAarthi Reddy, Richard C. Rose. 2358-2361 [doi]

Class-based statistical machine translation for field maintainable speech-to-speech translationIan R. Lane, Alex Waibel. 2362-2365 [doi]

Nonnative speech recognition based on state-candidate bilingual model modificationQingqing Zhang, Ta Li, Jielin Pan, YongHong Yan. 2366-2369 [doi]

Prosodic and spectral features within segment-based acoustic modelingBjörn Schuller, Xiaohua Zhang, Gerhard Rigoll. 2370-2373 [doi]

Unsupervised versus supervised training of acoustic modelsJeff Z. Ma, Richard M. Schwartz. 2374-2377 [doi]

A comparison of broad phonetic and acoustic units for noise robust segment-based phonetic recognitionTara N. Sainath, Victor Zue. 2378-2381 [doi]

Aggregated cross-validation and its efficient application to Gaussian mixture optimizationTakahiro Shinozaki, Sadaoki Furui, Tatsuya Kawahara. 2382-2385 [doi]

A minimum classification error based distance measure for template based speech recognitionMike Matton, Dirk Van Compernolle, Ronald Cools. 2386-2389 [doi]

A penalized logistic regression approach to detection based phone classificationSabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee. 2390-2393 [doi]

Incorporating acoustical modelling of phone transitions in an hybrid ANN/HMM speech recognizerAlberto Abad, João Paulo Neto. 2394-2397 [doi]

Flexible discriminative training based on equal error group scores obtained from an error-indexed forward-backward algorithmErik McDermott, Atsushi Nakamura. 2398-2401 [doi]

Pitch adaptive features for LVCSRGiulia Garau, Steve Renals. 2402-2405 [doi]

Using syllable nuclei locations to improve automatic speech recognition in the presence of burst noiseChris D. Bartels, Jeff A. Bilmes. 2406-2409 [doi]

Effects of allophones on the performance of Korean speech recognitionHyejin Hong, SunHee Kim, Minhwa Chung. 2410-2413 [doi]

Combining evidence from a generative and a discriminative model in phoneme recognitionJoel Pinto, Hynek Hermansky. 2414-2417 [doi]

Fragmented context-dependent syllable acoustic modelsKishan Thambiratnam, Frank Seide. 2418-2421 [doi]

Speech recognition using non-linear trajectories in a formant-based articulatory layer of a multiple-level segmental HMMHongwei Hu, Martin J. Russell. 2422-2425 [doi]

Recent improvements of the RWTH GALE Mandarin LVCSR systemChristian Plahl, Björn Hoffmeister, M.-Y. Hwang, D. Lu, Georg Heigold, Jonas Lööf, Ralf Schlüter, Hermann Ney. 2426-2429 [doi]

Intonational phrases for speech summarizationSameer Maskey, Andrew Rosenberg, Julia Hirschberg. 2430-2433 [doi]

Packing the meeting summarization knapsackKorbinian Riedhammer, Daniel Gillick, Benoît Favre, Dilek Hakkani-Tür. 2434-2437 [doi]

Class lecture summarization taking into account consecutiveness of important sentencesYasuhisa Fujii, Kazumasa Yamamoto, Norihide Kitaoka, Seiichi Nakagawa. 2438-2441 [doi]

Using latent Dirichlet allocation to incorporate domain knowledge for topic transition detectionXiaodan Zhu, Xuming He, Cosmin Munteanu, Gerald Penn. 2443-2445 [doi]

Weakly supervised training for parsing Mandarin broadcast transcriptsWen Wang. 2446-2449 [doi]

Dependency parsing of Japanese spoken monologue based on clause-starts detectionTomohiro Ohno, Shigeki Matsubara, Hideki Kashioka, Yasuyoshi Inagaki. 2454-2457 [doi]

Online unsupervised pattern discovery in speech using parallelizationMrugesh R. Gajjar, R. Govindarajan, T. V. Sreenivas. 2458-2461 [doi]

A comparison of input entry rates in a multimodal mobile applicationAleksi Melto, Markku Turunen, Jaakko Hakulinen, Anssi Kainulainen, Tomi Heimonen. 2462-2465 [doi]

Physically embodied conversational agents as health and fitness companionsMarkku Turunen, Jaakko Hakulinen, Cameron Smith, Daniel Charlton, Li Zhang, Marc Cavazza. 2466-2469 [doi]

User perception of multi-modal interfaces for mobile applicationsFlorian Metze, Roman Englert, Udo Bub, Ingmar Kliche, Thomas Scheerbarth. 2470-2473 [doi]

Design and formulation for speech interface based on flexible shortcutsTeppei Nakano, Tomoyuki Kumai, Tetsunori Kobayashi, Yasushi Ishikawa. 2474-2477 [doi]

Exploring classification techniques in speech based cognitive load monitoringBo Yin, Natalie Ruiz, Fang Chen, Eliathamby Ambikairajah. 2478-2481 [doi]

Finding two-level interpersonal context: proximity and conversation detection from personal audio feature dataMasayuki Okamoto, Naoki Iketani, Keisuke Nishimura, Masaaki Kikuchi, Kenta Cho, Masanori Hattori, Sougo Tsuboi. 2482-2485 [doi]

From domain specification to virtual humans: an integrated approach to authoring tactical questioning charactersSudeep Gandhe, David DeVault, Antonio Roque, Bilyana Martinovski, Ron Artstein, Anton Leuski, Jillian Gerten, David R. Traum. 2486-2489 [doi]

Designing a massively multiplayer online role-playing game around text-to-speechMike Rozak. 2490-2493 [doi]

Robust speaker change detection using Kernel-Gaussian modelJie Gao, Xiang Zhang, QingWei Zhao, YongHong Yan. 2494-2497 [doi]

A comparative study in automatic recognition of broadcast audioStavros Ntalampiras, Nikos Fakotakis. 2498-2501 [doi]

Joint time-frequency segmentation for transient decompositionCharturong Tantibundhit, Gernot Kubin. 2502-2505 [doi]

Language and genre detection in audio content analysisVikramjit Mitra, Daniel Garcia-Romero, Carol Y. Espy-Wilson. 2506-2509 [doi]

An entropy based feature for whisper-island detection within audio streamsChi Zhang, John H. L. Hansen. 2510-2513 [doi]

Two step speaker segmentation method using Bayesian information criterion and adapted Gaussian mixtures modelsMatej Grasic, Marko Kos, Andrej Zgank, Zdravko Kacic. 2514-2517 [doi]

Domain-specific classification methods for disfluency detectionSebastian Germesin, Tilman Becker, Peter Poller. 2518-2521 [doi]

Multi-speaker meeting audio segmentationTin Lay Nwe, Minghui Dong, Swe Zin Kalayar Khine, Haizhou Li. 2522-2525 [doi]

Rhythm based music segmentation and octave scale cepstral features for sung language recognitionNamunu Chinthaka Maddage, Haizhou Li. 2526-2529 [doi]

Robust voiced/unvoiced speech classification using empirical mode decomposition and periodic correlation modelMd. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu. 2530-2533 [doi]

A combination of data mining method with decision trees building for speech/music discriminationQiong Wu, Qin Yan, Jun Wang, Jun Hong. 2534-2537 [doi]

Advertisement detection in French broadcast news using acoustic repetition and Gaussian mixture modelsVishwa Gupta, Gilles Boulianne, Patrick Kenny, Pierre Dumouchel. 2538-2541 [doi]

Parsing with subdomain instance weighting from raw corporaBarbara Plank, Khalil Sima an. 2540 [doi]

A hybrid SVM/MCE training approach for vector space topic identification of spoken audio recordingsTimothy J. Hazen, Fred Richardson. 2542-2545 [doi]

Training audio events detectors with a sound effects corpusIsabel Trancoso, José Portelo, Miguel Bugalho, João Paulo Neto, António Joaquim Serralheiro. 2546-2549 [doi]

Longitudinal study of ASR performance on ageing voicesRavichander Vipperla, Steve Renals, Joe Frankel. 2550-2553 [doi]

Investigations into phonological attribute classifier representations for CRF phone recognitionPrateeti Mohapatra, Eric Fosler-Lussier. 2558-2561 [doi]

Applications of virtual-evidence based speech recognizer trainingAmarnag Subramanya, Jeff A. Bilmes. 2562-2565 [doi]

Spoken digit recognition using a hierarchical temporal memoryJoost van Doremalen, Lou Boves. 2566-2569 [doi]

A computational model of language acquisition: focus on word discoveryLouis ten Bosch, Hugo Van Hamme, Lou Boves. 2570-2573 [doi]

Voice activity detection using modified Wigner-ville distributionLakshmish Kaushik, Douglas D. O Shaughnessy. 2574-2577 [doi]

Energy and entropy based switching algorithm for speech endpoint detection in varying SNR conditionsKrishna Chaitanya, Rohit Sinha. 2578-2581 [doi]

Detection of speech embedded in real acoustic background based on amplitude modulation spectrogram featuresJörn Anemüller, Denny Schmidt, Jörg-Hendrik Bach. 2582-2585 [doi]

Voice activity detection algorithms using subband power distance feature for noisy environmentsTuan Van Pham, Michael Stadtschnitzer, Franz Pernkopf, Gernot Kubin. 2586-2589 [doi]

Speech-overlapped acoustic event detection for automotive applicationsChristian Müller, Joan-Isaac Biel, Edward Kim, Daniel Rosario. 2590-2593 [doi]

Detection of acoustic events in interactive seminar data with temporal overlapsAndrey Temko, Climent Nadeu. 2594-2597 [doi]

Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysisChanwoo Kim, Richard M. Stern. 2598-2601 [doi]

Speech analysis using instantaneous frequency deviationAnthony P. Stark, Kuldip K. Paliwal. 2602-2605 [doi]

Auditory-based formant estimation in noise using a probabilistic frameworkClaudius Gläser, Martin Heckmann, Frank Joublin, Christian Goerick. 2606-2609 [doi]

Efficient representation of throat microphone speechK. Sri Rama Murty, Saurav Khurana, Yogendra Umesh Itankar, M. R. Kesheorey, B. Yegnanarayana. 2610-2613 [doi]

Acoustic-phonetic approach for automatic evaluation of spoken grammarOm Deshmukh, Ashish Verma. 2614-2617 [doi]

On estimation of a speaker s confusion matrix from sparse dataStephen Cox. 2618-2621 [doi]

Talking heads and pronunciation training: a reviewValérie Hazan. 2622 [doi]

Pronunciation training: the role of eye and earDominic W. Massaro, Stephanie Bigler, Trevor H. Chen, Marcus Perlman, Slim Ouni. 2623-2626 [doi]

Can visualization of internal articulators support speech perception?Preben Wik, Olov Engwall. 2627-2630 [doi]

Can audio-visual instructions help learners improve their articulation? - an ultrasound study of short term changesOlov Engwall. 2631-2634 [doi]

Can you read tongue movements ?Pierre Badin, Yuliya Tarabalka, Frédéric Elisei, Gérard Bailly. 2635-2638 [doi]

Two- and three-dimensional visual articulatory models for pronunciation training and for treatment of speech disordersBernd J. Kröger, Verena Graf-Borttscheller, Anja Lowit. 2639-2642 [doi]

A 3-d virtual head as a tool for speech therapy for childrenSascha Fagel, Katja Madany. 2643-2646 [doi]

Anton: an animatronic model of a human tongue and vocal tractRobin Hofe, Roger K. Moore. 2647-2650 [doi]

Physical models of the human vocal tract with gel-type materialTakayuki Arai. 2651-2654 [doi]

Mispronunciation detection for Mandarin ChineseChao Huang, Feng Zhang, Frank K. Soong, Min Chu. 2655-2658 [doi]

Efficient handwriting correction of speech recognition errors with template constrained posterior (TCP)Lijuan Wang, Tao Hu, Peng Liu, Frank K. Soong. 2659-2662 [doi]

Bi-Gaussian score equalization in an audio-visual SVM-based person verification systemPascual Ejarque, Javier Hernando. 2663-2666 [doi]

Speech recognition for vocalized and subvocal modes of production using surface EMG signals from the neck and faceGeoffrey S. Meltzner, Jason J. Sroka, James T. Heaton, L. Donald Gilmore, Glen Colby, Serge H. Roy, Nancy Chen, Carlo J. De Luca. 2667-2670 [doi]

Distinctive feature fusion for recognition of australian English consonantsTrent W. Lewis, David M. W. Powers. 2671-2674 [doi]

Time-lag adaptation for semi-synchronous speech and pen inputYasushi Watanabe, Koichi Shinoda, Sadaoki Furui. 2675-2678 [doi]

Continuous pose-invariant lipreadingPatrick Lucey, Sridha Sridharan, David Dean. 2679-2682 [doi]

Czech-to-slovak adapted broadcast news transcription systemJan Nouza, Jan Silovský, Jindrich Zdánský, Petr Cerva, Martin Kroul, Josef Chaloupka. 2683-2686 [doi]

Continuous phone recognition without target language training dataDau-Cheng Lyu, Sabato Marco Siniscalchi, Tae-Yoon Kim, Chin-Hui Lee. 2687-2690 [doi]

An investigation of acoustic models for multilingual code-switchingChristopher M. White, Sanjeev Khudanpur, James K. Baker. 2691-2694 [doi]

Cross-lingual portability of MLP-based tandem features - a case study for English and HungarianLászló Tóth, Joe Frankel, Gábor Gosztolya, Simon King. 2695-2698 [doi]

Seed models combination and state level mappings of cross-lingual transfer for rapid HMM development: from English to MandarinXufang Zhao, Douglas D. O Shaughnessy. 2699-2702 [doi]

Multi-accent and accent-independent non-native speech recognitionGhazi Bouselmi, Dominique Fohr, Irina Illina. 2703-2706 [doi]

Cross-lingual sentence extraction for information distillationAdish Kumar Singla, Dilek Z. Hakkani-Tür. 2707-2710 [doi]

On the use of a multilingual neural network front-endStefano Scanzio, Pietro Laface, Luciano Fissore, Roberto Gemello, Franco Mana. 2711-2714 [doi]

Context-sensitive probabilistic phone mapping model for cross-lingual speech recognitionKhe Chai Sim, Haizhou Li. 2715-2718 [doi]

A non-acoustic approach to crosslingual speech recognition performance predictionChen Liu, Lynette Melnar. 2719-2722 [doi]

Factored translation models for enriching spoken language translation with prosodyVivek Kumar Rangarajan Sridhar, Srinivas Bangalore, Shrikanth S. Narayanan. 2723-2726 [doi]

Data selection and smoothing in an open-source system for the 2008 NIST machine translation evaluationHolger Schwenk, Yannick Estève. 2727-2730 [doi]

Strategies for building a Farsi-English SMT system from limited resourcesAndreas Kathol, Jing Zheng. 2731-2734 [doi]

Stream decoding for simultaneous spoken language translationMuntsin Kolss, Stephan Vogel, Alex Waibel. 2735-2738 [doi]

Towards unsupervised training of the classifier-based speech translatorEmil Ettelaie, Panayiotis G. Georgiou, Shrikanth S. Narayanan. 2739-2742 [doi]

Aggregating distributed STT, MT, and information extraction engines: the GALE interoperability-demo systemJohn F. Pitrelli, Burn L. Lewis, Edward A. Epstein, Martin Franz, Daniel Kiecza, Jerome L. Quinn, Ganesh N. Ramaswamy, Amit Srivastava, Paola Virga. 2743-2746 [doi]

An interval type-2 fuzzy logic system to translate between emotion-related vocabulariesAbe Kazemzadeh, Sungbok Lee, Shrikanth S. Narayanan. 2747-2750 [doi]

Applying pitch-dependent difference detection and modification to emotional speaker recognitionTing Huang, Yingchun Yang. 2751-2754 [doi]

Automatic recognition of anger in spontaneous speechDaniel Neiberg, Kjell Elenius. 2755-2758 [doi]

An estimation technique of style expressiveness for emotional speech using model adaptation based on multiple-regression HSMMTakashi Nose, Yoichi Kato, Makoto Tachibana, Takao Kobayashi. 2759-2762 [doi]

A vowel based approach for acted emotion recognitionFabien Ringeval, Mohamed Chetouani. 2763-2766 [doi]

A composite framework for affective sensingGordon McIntyre, Roland Göcke. 2767-2770 [doi]

Towards automatic emotional state categorization from speech signalsArslan Shaukat, Ke Chen. 2771-2774 [doi]

Speaker-independent emotion recognition based on feature vector classificationJeong-Sik Park, Ji-Hwan Kim, Sang Min Yoon, Yung-Hwan Oh. 2775-2778 [doi]

Estimation of children s reading ability by fusion of automatic pronunciation verification and fluency detectionMatthew Black, Joseph Tepperman, Sungbok Lee, Shrikanth S. Narayanan. 2779-2782 [doi]

Pronunciation verification of English letter-sounds in preliterate childrenMatthew Black, Joseph Tepperman, Abe Kazemzadeh, Sungbok Lee, Shrikanth S. Narayanan. 2783-2786 [doi]

Improving mispronunciation detection and diagnosis of learners speech with context-sensitive phonological rules based on language transferAlissa M. Harrison, Wing Yiu Lau, Helen M. Meng, Lan Wang. 2787-2790 [doi]

DISCO: development and integration of speech technology into courseware for language learningCatia Cucchiarini, Joost van Doremalen, Helmer Strik. 2791-2794 [doi]

Discriminative model combination and language model selection in a reading tutor for childrenAbdurrahman Samir, Jacques Duchateau, Hugo Van Hamme. 2795-2798 [doi]

Usability of ASR-based reading training for dyslexicsJakob Schou Pedersen, Lars Bo Larsen, Børge Lindberg. 2799-2802 [doi]

A browsing system for classroom lecture speechShingo Togashi, Seiichi Nakagawa. 2803-2806 [doi]

Automatic pronunciation evaluation of language learners utterances generated through shadowingDean Luo, Naoya Shimomura, Nobuaki Minematsu, Yutaka Yamauchi, Keikichi Hirose. 2807-2810 [doi]

Application and evaluation of speech technologies in language learning: experiments with the Saybot playerSylvain Chevalier, Zhenhai Cao. 2811-2814 [doi]

Forward optimal modeling of acoustic confusions in Mandarin CALL systemFengpei Ge, Fuping Pan, Changliang Liu, Bin Dong, YongHong Yan. 2815-2818 [doi]

Recognition of English utterances with grammatical and lexical mistakes for dialogue-based CALL systemAkinori Ito, Ryohei Tsutsui, Shozo Makino, Motoyuki Suzuki. 2819-2822 [doi]

An analysis of vocal tract shaping in English sibilant fricatives using real-time magnetic resonance imagingErik Bresch, Daylen Riggs, Louis M. Goldstein, Dani Byrd, Sungbok Lee, Shrikanth S. Narayanan. 2823-2826 [doi]

Science workshop with sliding vocal-tract modelTakayuki Arai. 2827-2830 [doi]

Segmentation cues in lexical identification and in lexical acquisition: same or different?Odile Bagou, Ulrich H. Frauenfelder. 2831-2834 [doi]

Phonological representations in poor readersCecile T. L. Kuijpers, Louis ten Bosch. 2835-2838 [doi]

To what extent does tagged-MRI technique allow to infer tongue muscles activation pattern? a modelling studyStéphanie Buchaillard, Pascal Perrier, Yohan Payan. 2839-2842 [doi]

Feature adaptation of hearing-impaired lip shapes: the vowel case in the cued speech contextNoureddine Aboutabit, Denis Beautemps, Olivier Mathieu, Laurent Besacier. 2843-2846 [doi]

Automatic detection of the context of acoustic landmark deletionNanette Veilleux, Stefanie Shattuck-Hufnagel. 2847-2850 [doi]

Aspects of pharyngealized phonemes in Arabic using articulographySlim Ouni. 2851 [doi]

The effect of spectral tilt on infants discrimination of fricativesElizabeth Beach, Christine Kitamura, Harvey Dillon, Teresa Ching, Denis Burnham. 2852 [doi]

look at the shark : evaluation of student produced standardized sentences of infant- and foreigner-directed speechMonja A. Knoll, Lisa Scharrer. 2853-2856 [doi]

Vocal tract inversion by cepstral analysis-by-synthesis using chain matricesSankaran Panchapagesan, Abeer Alwan. 2857-2860 [doi]

DC-constrained linear prediction for glottal inverse filteringPaavo Alku, Carlo Magi, Tomas Bäckström. 2861-2864 [doi]

Voicing influences the saliency of place of articulation in audio-visual speech perception in babbleMagnus Alm, Dawn M. Behne. 2865-2868 [doi]

Correspondence of perception and production boundaries between single and geminate stops in JapaneseShigeaki Amano, Yukari Hirata. 2869-2872 [doi]

Inhibitory processes of Chinese spoken word recognitionMichael C. W. Yip. 2873-2876 [doi]

Using prosody for the improvement of ASR - sentence modality recognitionKlára Vicsi, György Szaszák. 2877-2880 [doi]

External Links

Cite Key

Statistics

PDF

Researchr

INTERSPEECH 2008, 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008

Abstract

Table of Contents