The 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November - 4th December 1998. ISCA 1998 Graeme M. Clark:
Cochlear implants in the second and third millennia.
The use of linguistic hierarchies in speech understanding.
Paul C. Bagshaw:
Unsupervised training of phone duration and energy models for text-to-speech synthesis.
Jerome R. Bellegarda, Kim E. A. Silverman:
Improved duration modeling of English phonemes using a root sinusoidal transformation.
Chilin Shih, Wentao Gu, Jan P. H. van Santen:
Efficient adaptation of TTS duration model to new speakers.
Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Duration modeling for HMM-based speech synthesis.
Cameron S. Fordyce, Mari Ostendorf:
Prosody prediction for speech synthesis using transformational rule-based learning.
Susan Fitt, Stephen Isard:
Representing the environments for phonological processes in an accent-independent lexicon for synthesis of English.
Daniel Faulkner, Charles Bryant:
Efficient lexical retrieval for English text-to-speech synthesis.
Robert E. Donovan, Ellen Eide:
The IBM trainable speech synthesis system.
Sarah Hawkins, Jill House, Mark Huckvale, John Local, Richard Ogden:
Prosynth: an integrated prosodic approach to device-independent, natural-sounding speech synthesis.
Jialu Zhang, Shiwei Dong, Ge Yu:
Total quality evaluation of speech synthesis systems.
Gerit P. Sonntag, Thomas Portele:
Comparative evaluation of synthetic prosody with the PURR method.
Richard Sproat, Andrew J. Hunt, Mari Ostendorf, Paul Taylor, Alan W. Black, Kevin A. Lenzo, Mike Edgington:
SABLE: a standard for TTS markup.
H. Timothy Bunnell, Steven R. Hoskins, Debra Yarrington:
Prosodic vs. segmental contributions to naturalness in a diphone synthesizer.
A mixed-excitation frequency domain model for time-scale pitch-scale modification of speech.
Masami Akamine, Takehiko Kagoshima:
Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS).
Martti Vainio, Toomas Altosaar:
Modeling the microprosody of pitch and loudness for speech synthesis with neural networks.
David T. Chappell, John H. L. Hansen:
Spectral smoothing for concatenative speech synthesis.
Aimin Chen, Saeed Vaseghi, Charles Ho:
MIMIC : a voice-adaptive phonetic-tree speech synthesiser.
Je Hun Jeon, Sunhwa Cha, Minhwa Chung, Jun Park, Kyuwoong Hwang:
Automatic generation of Korean pronunciation variants by multistage applications of phonological rules.
Stephen Cox, Richard Brady, Peter Jackson:
Techniques for accurate automatic annotation of speech waveforms.
Andrew Cronk, Michael W. Macon:
Optimized stopping criteria for tree-based unit selection in concatenative synthesis.
Stéphanie de Tournemire:
Automatic transcription of intonation using an identified prosodic alphabet.
Ignasi Esquerra, Albert Febrer, Climent Nadeu:
Frequency analysis of phonetic units for concatenative synthesis in catalan.
Alex Chengyu Fang, Jill House, Mark Huckvale:
Investigating the syntactic characteristics of English tone units.
Antonio Bonafonte, Ignasi Esquerra, Albert Febrer, José A. R. Fonollosa, Francesc Vallverdú:
The UPC text-to-speech system for Spanish and catalan.
Attila Ferencz, István Nagy, Tunde-Csilla Kovács, Maria Ferencz, Teodora Ratiu:
The new version of the ROMVOX text-to-speech synthesis system based on a hybrid time domain-LPC synthesis technique.
Takehiko Kagoshima, Masahiro Morita, Shigenobu Seto, Masami Akamine:
An F0 contour control model for totally speaker driven text to speech system.
Keikichi Hirose, Hiromichi Kawanami:
On the relationship of speech rates with prosodic units in dialogue speech.
Esther Klabbers, Raymond N. J. Veldhuis:
On the reduction of concatenation artefacts in diphone synthesis.
Chih-Chung Kuo, Kun-Yuan Ma:
Error analysis and confidence measure of Chinese word segmentation.
Jungchul Lee, Donggyu Kang, Sanghoon Kim, Koengmo Sung:
Energy contour generation for a sentence using a neural network learning method.
Yong-Ju Lee, Sook-Hyang Lee, Jong-Jin Kim, Hyun-Ju Ko, Young-Il Kim, Sanghun Kim, Jung-Cheol Lee:
A computational algorithm for F0 contour generation in Korean developed with prosodically labeled databases using k-toBI system.
Kevin A. Lenzo, Christopher Hogan, Jeffrey Allen:
Rapid-deployment text-to-speech in the DIPLOMAT system.
Robert H. Mannell:
Formant diphone parameter extraction utilising a labelled single-speaker database.
Osamu Mizuno, Shin'ya Nakajima:
A new synthetic speech/sound control language.
Ryo Mochizuki, Yasuhiko Arai, Takashi Honda:
A study on the natural-sounding Japanese phonetic word synthesis by using the VCV-balanced word database that consists of the words uttered forcibly in two types of pitch accent.
Vincent Pagel, Kevin A. Lenzo, Alan W. Black:
Letter to sound rules for accented lexicon compression.
Ze'ev Roth, Judith Rosenhouse:
A name announcement algorithm with memory size and computational power constraints.
Frédérique Sannier, Rabia Belrhali, Véronique Aubergé:
How a French TTS system can describe loanwords.
Tomaz Sef, Ales Dobnikar, Matjaz Gams:
Improvements in slovene text-to-speech synthesis.
Shigenobu Seto, Masahiro Morita, Takehiko Kagoshima, Masami Akamine:
Automatic rule generation for linguistic features analysis using inductive learning technique: linguistic features analysis in TOS drive TTS system.
Yoshinori Shiga, Hiroshi Matsuura, Tsuneo Nitta:
Segmental duration control based on an articulatory model.
Text analysis for the bell labs French text-to-speech system.
Jennifer J. Venditti, Jan P. H. van Santen:
Modeling vowel duration for Japanese text-to-speech synthesis.
Ren-Hua Wang, Qingfeng Liu, Yongsheng Teng, Deyu Xia:
Towards a Chinese text-to-speech system with higher naturalness.
Andrew P. Breen, Peter Jackson:
A phonologically motivated method of selecting non-uniform units.
Steve Pearson, Nick Kibre, Nancy Niedzielski:
A synthesis method based on concatenation of demisyllables and a residual excited vocal tract model.
Ann K. Syrdal, Alistair Conkie, Yannis Stylianou:
Exploration of acoustic correlates in speaker selection for concatenative synthesis.
Johan Wouters, Michael W. Macon:
A perceptual evaluation of distance measures for concatenative speech synthesis.
Mike Plumpe, Alex Acero, Hsiao-Wuen Hon, Xuedong Huang:
HMM-based smoothing for concatenative speech synthesis.
Martin Holzapfel, Nick Campbell:
A nonlinear unit selection strategy for concatenative speech synthesis based on syllable level features.
Robert Eklund, Anders Lindström:
How to handle "foreign" sounds in Swedish text-to-speech conversion: approaching the 'xenophone' problem.
Multi-lingual concatenative speech synthesis.
On the use of F0 features in automatic segmentation for speech synthesis.
Atsuhiro Sakurai, Takashi Natsume, Keikichi Hirose:
A linguistic and prosodic database for data-driven Japanese TTS synthesis.
Alexander Kain, Michael W. Macon:
Text-to-speech voice adaptation from sparse training data.
Describing intonation with a parametric model.
Joakim Gustafson, Patrik Elmberg, Rolf Carlson, Arne Jönsson:
An educational dialogue system with a user controllable dialogue manager.
Klaus Failenschmid, J. H. Simon Thornton:
End-user driven dialogue system design: the reward experience.
Yi-Chung Lin, Tung-Hui Chiang, Huei-Ming Wang, Chung-Ming Peng, Chao-Huang Chang:
The design of a multi-domain Mandarin Chinese spoken dialogue system.
Kallirroi Georgila, Anastasios Tsopanoglou, Nikos Fakotakis, George Kokkinakis:
An integrated dialogue system for the automation of call centre services.
An event driven model for dialogue systems.
Cosmin Popovici, Paolo Baggia, Pietro Laface, Loreta Moisa:
Automatic classification of dialogue contexts for dialogue predictions.
Ganesh N. Ramaswamy, Jan Kleindienst:
Automatic identification of command boundaries in a conversational natural language user interface.
Massimo Poesio, Andrei Mikheev:
The predictive power of game structure in dialogue act recognition: experimental results using maximum entropy estimation.
Paul C. Constantinides, Scott Hansma, Chris Tchou, Alexander I. Rudnicky:
A schema based approach to dialog control.
Expanding a time-sensitive conversational architecture for turn-taking to handle content-driven interruption.
Marc Swerts, Hanae Koiso, Atsushi Shimojima, Yasuhiro Katagiri:
On different functions of repetitive utterances.
Hiroaki Noguchi, Yasuharu Den:
Prosody-based detection of the context of backchannel responses.
Lena Strömbäck, Arne Jönsson:
Robust interpretation for spoken dialogue systems.
Yohei Okato, Keiji Kato, Mikio Yamamoto, Shuichi Itahashi:
System-user interaction and response strategy in spoken dialogue system.
Noriko Suzuki, Kazuo Ishii, Michio Okada:
Organizing self-motivated dialogue with autonomous creatures.
Gerhard Hanrieder, Paul Heisterkamp, Thomas Brey:
Fly with the EAGLES: evaluation of the "ACCeSS" spoken language dialogue system.
Maria Aretoulaki, Stefan Harbeck, Florian Gallwitz, Elmar Nöth, Heinrich Niemann, Jozef Ivanecký, Ivo Ipsic, Nikola Pavesic, Václav Matousek:
SQEL: a multilingual and multifunctional dialogue system.
Stefan Kaspar, Achim G. Hoffmann:
Semi-automated incremental prototyping of spoken dialog systems.
Peter A. Heeman, Michael Johnston, Justin Denney, Edward C. Kaiser:
Beyond structured dialogues: factoring out grounding.
Masahiro Araki, Shuji Doshita:
A robust dialogue model for spoken dialogue processing.
Tom Brøndsted, Bo Nygaard Bai, Jesper Østergaard Olsen:
The REWARD service creation environment. an overview.
Matthew Bull, Matthew P. Aylett:
An analysis of the timing of turn-taking in a corpus of goal-oriented dialogue.
Sarah Davies, Massimo Poesio:
The provision of corrective feedback in a spoken dialogue CALL system.
Laurence Devillers, Hélène Bonneau-Maynard:
Evaluation of dialog strategies for a tourist information retrieval system.
Sadaoki Furui, Koh'ichiro Yamaguchi:
Designing a multimodal dialogue system for information retrieval.
Dinghua Guan, Min Chu, Quan Zhang, Jian Liu, Xiangdong Zhang:
The research project of man-computer dialogue system in Chinese.
Kate S. Hone, David Golightly:
Interfaces for speech recognition systems: the impact of vocabulary constraints and syntax on performance.
Tatsuya Iwase, Nigel Ward:
Pacing spoken directions to suit the listener.
Annika Flycht-Eriksson, Arne Jönsson:
A spoken dialogue system utilizing spatial information.
Candace A. Kamm, Diane J. Litman, Marilyn A. Walker:
From novice to expert: the effect of tutorials on user expertise with spoken dialogue systems.
Emergent computational dialogue management architecture for task-oriented spoken dialogue systems.
Tadahiko Kumamoto, Akira Ito:
An analysis of dialogues with our dialogue system through a WWW page.
Michael F. McTear:
Modelling spoken dialogues with state transition diagrams: experiences with the CSLU toolkit.
Michio Okada, Noriko Suzuki, Jacques M. B. Terken:
Situated dialogue coordination for spoken dialogue systems.
Xavier Pouteau, Luis Arévalo:
Robust spoken dialogue systems for consumer products: a concrete application.
Daniel Willett, Arno Romer, Jörg Rottland, Gerhard Rigoll:
A German dialogue system for scheduling dates and meetings by naturally spoken continuous speech.
Chung-Hsien Wu, Gwo-Lang Yan, Chien-Liang Lin:
Spoken dialogue system using corpus-based hidden Markov model.
Peter J. Wyard, Gavin E. Churcher:
A realistic wizard of oz simulation of a multimodal spoken language system.
Yen-Ju Yang, Lin-Shan Lee:
A syllable-based Chinese spoken dialogue system for telephone directory services primarily trained with a corpus.
Hiroyuki Yano, Akira Ito:
How disagreement expressions are used in cooperative tasks.
Tones of a tridialectal: acoustic and perceptual data on ten linguistic tonetic contrasts between lao, nyo and standard Thai.
Napier Guy Ian Thompson:
Tone sandhi between complex tones in a seven-tone southern Thai dialect.
Alexander Robertson Coupe:
The acoustic and perceptual features of tone in the tibeto-burman language ao naga.
The differential status of semivowels in the acoustic phonetic realisation of tone.
Kai Alter, Karsten Steinhauer, Angela D. Friederici:
De-accentuation: linguistic environments and prosodic realizations.
N. Amir, S. Ron:
Towards an automatic classification of emotions in speech.
Marc Schröder, Véronique Aubergé, Marie-Agnès Cathiard:
Can we hear smile?
Matthew P. Aylett, Matthew Bull:
The automatic marking of prominence in spontaneous speech using duration and part of speech information.
JongDeuk Kim, SeongJoon Baek, Myung-Jin Bae:
On a pitch alteration technique in excited cepstral spectrum for high quality TTS.
Jan Buckow, Anton Batliner, Richard Huber, Elmar Nöth, Volker Warnke, Heinrich Niemann:
Dovetailing of acoustics and prosody in spontaneous speech recognition.
Janet E. Cahn:
A computational memory and processing model for prosody.
Convergence of fundamental frequencies in conversation: if it happens, does it matter?
Hiroya Fujisaki, Sumio Ohno, Takashi Yagi, Takeshi Ono:
Analysis and interpretation of fundamental frequency contours of british English in terms of a command-response model.
Frode Holm, Kazue Hata:
Common patterns in word level prosody.
Yasuo Horiuchi, Akira Ichikawa:
Prosodic structure in Japanese spontaneous speech.
An acoustic-phonetic description of word tone in kagoshima Japanese.
Koji Iwano, Keikichi Hirose:
Representing prosodic words using statistical models of moraic transition of fundamental frequency contours of Japanese.
Tae-Yeoub Jang, Minsuck Song, Kiyeong Lee:
Disambiguation of Korean utterances using automatic intonation recognition.
Oliver Jokisch, Diane Hirschfeld, Matthias Eichner, Rüdiger Hoffmann:
Multi-level rhythm control for speech synthesis using hybrid data driven and rule-based approaches.
EGG model of ditoneme in Mandarin.
Geetha Krishnan, Wayne H. Ward:
Temporal organization of speech for normal and fast rates.
A syllable-based generalization of Japanese accentuation.
Non-adjacent segmental effects in tonal realization of accentual phrase in seoul Korean.
Eduardo López, Javier Caminero, Ismael Cortázar, Luis A. Hernández Gómez:
Improvement on connected numbers recognition using prosodic information.
Kazuaki Maeda, Jennifer J. Venditti:
Phonetic investigation of boundary pitch movements in Japanese.
Phonetic and phonological characteristics of paralinguistic information in spoken Japanese.
ToBI accent type recognition.
Hansjörg Mixdorff, Hiroya Fujisaki:
The influence of syllable structure on the timing of intonational events in German.
Osamu Mizuno, Shin'ya Nakajima:
New prosodic control rules for expressive synthetic speech.
Mitsuru Nakai, Hiroshi Shimodaira:
The use of F0 reliability function for prosodic command analysis on F0 contour generation model.
Sumio Ohno, Hiroya Fujisaki, Hideyuki Taguchi:
Analysis of effects of lexical accent, syntax, and global speech rate upon the local speech rate.
Sumio Ohno, Hiroya Fujisaki, Yoshikazu Hara:
On the effects of speech rate upon parameters of the command-response model for the fundamental frequency contours of speech.
Thomas Portele, Barbara Heuft:
The maximum-based description of F0 contours and its application to English.
Perceived prominence and acoustic parameters in american English.
Erhard Rank, Hannes Pirker:
Generating emotional speech with a concatenative synthesizer.
Albert Rilliard, Véronique Aubergé:
A perceptive measure of pure prosody linguistic functions with reiterant sentences.