The 4th International Conference on Spoken Language Processing, Philadelphia, PA, USA, October 3-6, 1996. ISCA 1996 Anne Cutler:
The comparative study of spoken-language processing.
James L. Flanagan:
Natural communication with machines - progress and challenge.
Zhishun Li, Michel Héon, Douglas D. O'Shaughnessy:
New developments in the INRS continuous speech recognition system.
Lori Lamel, Gilles Adda:
On designing pronunciation lexicons for large vocabulary, continuous speech recognition.
Pablo Fetter, Frédéric Dandurand, Peter Regel-Brietzmann:
Word graph rescoring using confidence measures.
Xavier L. Aubert, Peter Beyerlein, Meinhard Ullrich:
A bottom-up approach for handling unseen triphones in large vocabulary continuous speech recognition.
V. Valtchev, Philip C. Woodland, Steve J. Young:
Discriminative optimisation of large vocabulary recognition systems.
Tatsuo Matsuoka, Katsutoshi Ohtsuki, Takeshi Mori, Sadaoki Furui, Katsuhiko Shirai:
Japanese large-vocabulary continuous-speech recognition using a business-newspaper corpus.
David M. Carter, Jaan Kaja, Leonardo Neumeyer, Manny Rayner, Fuliang Weng, Mats Wirén:
Handling compound nouns in a Swedish speech-understanding system.
Javier Macías Guarasa, Ascensión Gallardo-Antolín, Javier Ferreiros, José Manuel Pardo, Luis Villarrubia Grande:
Initial evaluation of a preselection module for a flexible large vocabulary speech recognition system in.
Mamoun Alissali, Paul Deléglise, Alexandrina Rogozan:
Asynchronous integration of visual information in an automatic speech recognition system.
Iain A. Matthews, J. Andrew Bangham, Stephen J. Cox:
Audiovisual speech recognition using multiscale nonlinear image decomposition.
Qin Su, Peter L. Silsbee:
Robust audiovisual integration using semicontinuous hidden Markov models.
Richard P. Schumeyer, Kenneth E. Barner:
The effect of visual information on word initial consonant perception of dysarthric speech.
Devi Chandramohan, Peter L. Silsbee:
A multiple deformable template approach for visual speech recognition.
Piero Cosi, Emanuela Magno Caldognetto, Franco Ferrero, M. Dugatto, Kyriaki Vagges:
Speaker independent bimodal phonetic recognition experiments.
Juergen Luettin, Neil A. Thacker, Steve W. Beet:
Speechreading using shape and intensity information.
Juergen Luettin, Neil A. Thacker, Steve W. Beet:
Speaker identification by lipreading.
David W. Gow Jr., Janis Melvold, Sharon Manuel:
How word onsets drive lexical access and segmentation: evidence from acoustics, phonology and processing.
David van Kuijk, Peter Wittenburg, Ton Dijkstra:
RAW: a real-speech model for human word recognition.
Mehdi Meftah, Sami Boudelaa:
How facilitatory can lexical information be during word recognition? evidence from moroccan arabic.
Alette P. Haveman:
Effects of frequency on the auditory perception of open- versus closed-class words.
Michael S. Vitevitch, Paul A. Luce, Jan Charles-Luce, David Kemmerer:
Phonotactic and metrical influences on adult ratings of spoken nonsense words.
Edward T. Auer, Lynne E. Bernstein:
Lipreading supplemented by voice fundamental frequency: to what extent does the addition of voicing increase lexical uniqueness for the lipreader?
Saskia te Riele, Sieb G. Nooteboom, Hugo Quené:
Strategies used in rhyme-monitoring.
Wilma van Donselaar, Cecile T. L. Kuijpers, Anne Cutler:
How do dutch listeners process words with epenthetic schwa?
Patrick Juola, Philip Zimmermann:
Whole-word phonetic distances and the PGPfone alphabet.
Shuping Ran, J. Bruce Millar, Phil Rose:
Automatic vowel quality description using a variable mapping to an eight cardinal vowel reference set.
Andreas Kipp, Maria-Barbara Wesenick, Florian Schiel:
Automatic detection and segmentation of pronunciation variants in German speech corpora.
Stephanie Seneff, Raymond Lau, Helen M. Meng:
ANGIE: a new framework for speech analysis based on morpho-phonological modelling.
Perceptual contrast in the Korean and English vowel system normalized.
Yong-Ju Lee, Sook-Hyang Lee:
On phonetic characteristics of pause in the Korean read speech.
Sami Boudelaa, Mehdi Meftah:
Cross-language effects of lexical stress in word recognition: the case of Arabic English bilinguals.
Automatic generation of German pronunciation variants.
Maria-Barbara Wesenick, Andreas Kipp:
Estimating the quality of phonetic transcriptions and segmentations of speech signals.
Bojan Petek, Rastislav Sustarsic, Smiljana Komar:
An acoustic analysis of contemporary vowels of the standard slovenian language.
Sandrine Robbe, Anne Bonneau, Sylvie Coste, Yves Laprie:
Using decision trees to construct optimal acoustic cues.
Donna Erickson, Osamu Fujimura:
Maximum jaw displacement in contrastive emphasis.
Rebecca Herman, Mary E. Beckman, Kiyoshi Honda:
Subglottal pressure and final lowering in English.
Cecile T. L. Kuijpers, Wilma van Donselaar, Anne Cutler:
Phonological variation: epenthesis and deletion of schwa in Dutch.
James J. Mahshie:
Feedback considerations for speech training systems.
Clinical applications of computer-based speech training for children with hearing impairment.
Valérie Hazan, Andrew Simpson:
Enhancing information-rich regions of natural VCV and sentence materials presented in noise.
Valérie Hazan, Alan Adlard:
Speech perceptual abilities of children with specific reading difficulty (dyslexia).
Larry D. Paarmann, Michael K. Wynne:
Bimodal perception of spectrum compressed speech.
Dragana Barac-Cikoja, Sally Revoile:
Effect of sentential context on syllabic stress perception by hearing-impaired listeners.
Martin J. Russell, Catherine Brown, Adrian Skilling, Robert W. Series, Julie L. Wallace, Bill Bohnam, Paul Barker:
Applications of automatic speech recognition to speech and language development in young children.
D. R. Campbell:
Sub-band adaptive speech enhancement for hearing aids.
Thomas Portele, Jürgen Krämer:
Adapting a TTS system to a reading machine for the blind.
Modeling of spoken dialogue with and without visual information.
Stephanie Seneff, David Goddeau, Christine Pao, Joseph Polifroni:
Multimodal discourse modelling in a multi-user multi-domain environment.
Kenji Kita, Yoshikazu Fukui, Masaaki Nagata, Tsuyoshi Morimoto:
Automatic acquisition of probabilistic dialogue models.
Paul Heisterkamp, Scott McGlashan:
Units of dialogue management: an example.
Sharon L. Oviatt, Robert VanGent:
Error resolution during multimodal human-computer interaction.
Ramesh R. Sarukkai, Dana H. Ballard:
Improved spontaneous dialogue recognition using dialogue and utterance triggers by adaptive probability boosting.
Kai Hübener, Uwe Jost, Henrik Heine:
Speech recognition for spontaneously spoken German dialogues.
Paul Taylor, Hiroshi Shimodaira, Stephen Isard, Simon King, Jacqueline C. Kowtko:
Using prosodic information to constrain language models for spoken dialogue.
Peter A. Heeman, Kyung-ho Loken-Kim, James F. Allen:
Combining the detection and correction of speech repairs.
Yuji Sagawa, Wataru Sugimoto, Noboru Ohnishi:
Generating spontaneous elliptical utterance.
Gösta Bruce, Marcus Filipsson, Johan Frid, Björn Granström, Kjell Gustafson, Merle Horne, David House, Birgitta Lastow, Paul Touati:
Developing the modelling of Swedish prosody in spontaneous dialogue.
Shimei Pan, Kathleen R. McKeown:
Spoken language generation in a multimedia system.
Keikichi Hirose, Mayumi Sakata, Hiromichi Kawanami:
Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features.
Shuichi Tanaka, Shu Nakazato, Keiichiro Hoashi, Katsuhiko Shirai:
Spoken dialogue interface in a dual task situation.
Yasuhisa Niimi, Yutaka Kobayashi:
A dialogue control strategy based on the reliability of speech recognition.
Alexander I. Rudnicky, Stephen Reed, Eric H. Thayer:
Speechwear: a mobile speech system.
Helen M. Meng, Senis Busayapongchai, James R. Glass, David Goddeau, I. Lee Hetherington, Edward Hurley, Christine Pao, Joseph Polifroni, Stephanie Seneff, Victor Zue:
WHEELS: a conversational system in the automobile classifieds domain.
M. David Sadek, A. Ferrieux, A. Cozannet, Philippe Bretier, Franck Panaget, J. Simonin:
Effective human-computer cooperative spoken dialogue: the AGS demonstrator.
Samir Bennacef, Laurence Devillers, Sophie Rosset, Lori Lamel:
Dialog in the RAILTEL telephone-based system.
Alon Lavie, Lori S. Levin, Yan Qu, Alex Waibel, Donna Gates, Marsal Gavaldà, Laura Mayfield, Maite Taboada:
Dialogue processing in a conversational speech translation system.
Thomas Niesler, Philip C. Woodland:
Combination of word-based and category-based language models.
Francisco J. Valverde-Albacete, José Manuel Pardo:
A multi-level lexical-semantics based language model design for guided integrated continuous speech recognition.
Florian Gallwitz, Elmar Nöth, Heinrich Niemann:
A category based approach for recognition of out-of-vocabulary words.
Kristie Seymore, Ronald Rosenfeld:
Scalable backoff language models.
Rukmini Iyer, Mari Ostendorf:
Modeling long distance dependence in language: topic mixtures vs. dynamic cache models.
Bayesian estimation methods for n-gram language model adaptation.
Man-Hung Siu, Mari Ostendorf:
Modeling disfluencies in conversational speech.
John Miller, Fil Alleva:
Evaluation of a language model using a clustered model backoff.
Antonio Bonafonte, José B. Mariño:
Language modeling using x-grams.
Klaus Ries, Finn Dag Buø, Alex Waibel:
Class phrase models for language modelling.
Introducing linguistic constraints into statistical language modeling.
Jianying Hu, William Turin, Michael K. Brown:
Language modeling with stochastic automata.
Don X. Sun:
Feature dimension reduction using reduced-rank maximum likelihood estimation for hidden Markov models.
Using multi-level segmentation coefficients to improve HMM speech recognition.
Thomas Eisele, Reinhold Haeb-Umbach, Detlev Langmann:
A comparative study of linear feature transformation techniques for automatic speech recognition.
Inclusion of temporal information into features for speech recognition.
Hubert Wassner, Gérard Chollet:
New cepstral representation using wavelet analysis and spectral transformation for robust speech recognition.
Christopher John Long, Sekharajit Datta:
Wavelet based feature extraction for phoneme recognition.
New fast wavelet packet transform algorithms for frame synchronized speech processing.
Srinivasan Umesh, Leon Cohen, Nenad Marinovic, Douglas J. Nelson:
Frequency-warping in speech.
Daisuke Kobayashi, Shoji Kajita, Kazuya Takeda, Fumitada Itakura:
Extracting speech features from human speech-like noise.
Shoji Kajita, Kazuya Takeda, Fumitada Itakura:
Subband-crosscorrelation analysis for robust speech recognition.
Hervé Bourlard, Stéphane Dupont:
A new ASR approach based on independent processing and recombination of partial frequency bands.
Climent Nadeu, José B. Mariño, Javier Hernando, Albino Nogueiras:
Frequency and time filtering of filter-bank energies for HMM speech recognition.
Yves Laprie, Marie-Odile Berger:
Extraction of tongue contours in x-ray images with minimal user interaction.
Didier Demolin, Thierry Metens, Alain Soquet:
Three-dimensional measurement of the vocal tract by MRI.
Philip Gleason, Betty Tuller, J. A. Scott Kelso:
Syllable affiliation of final consonant clusters undergoes a phase transition over speaking rates.
Arthur Lobo, Michael H. O'Malley:
Towards a biomechanical model of the larynx.
Yann Morlec, Gérard Bailly, Véronique Aubergé:
Generating intonation by superposing gestures.
Hideki Kawahara, Hiroko Kato, J. C. Williams:
Effects of auditory feedback on F0 trajectory generation.
Ian S. Burnett, John J. Parry:
On the effects of accent and language on low rate speech coders.
Jeng-Shyang Pan, Fergus R. McInnes, Mervyn A. Jack:
VQ codevector index assignment using genetic algorithms for noisy channels.
Gavin C. Cawley:
An improved vector quantization algorithm for speech transmission over noisy channels.
C. Murgia, Gang Feng, Alain Le Guyader, Catherine Quinquis:
Very low delay and high quality coding of 20 hz-15 khz speech signals at 64 kbit/s.
Carlos M. Ribeiro, Isabel Trancoso:
Application of speaker modification techniques to phonetic vocoding.
Tadashi Yonezaki, Kiyohiro Shikano:
Entropy coded vector quantization with hidden Markov models.
An application of recurrent neural networks to low bit rate speech coding.
Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi, Satoshi Imai:
CELP coding system based on mel-generalized cepstral analysis.
Cheung-Fat Chan, Wai-Kwong Hui:
Wideband re-synthesis of narrowband CELP-coded speech using multiband excitation model.
Takuya Koizumi, Mikio Mori, Shuji Taniguchi, Mitsutoshi Maruya:
Recurrent neural networks for phoneme recognition.
M. A. Mokhtar, A. Zein-el-Abddin:
A model for the acoustic phonetic structure of arabic language using a single ergodic hidden Markov model.
Yifan Gong, Irina Illina, Jean Paul Haton:
Modelling long term variability information in mixture stochastic trajectory framework.
Thierry Moudenc, Robert Sokol, Guy Mercier:
Segmental phonetic features recognition by means of neural-fuzzy networks and integration in an n-best solutions post-processing.
Irina Illina, Yifan Gong:
Stochastic trajectory model with state-mixture for continuous speech recognition.
Hermann Hild, Alex Waibel:
Recognition of spelled names over the telephone.
Gilles Boulianne, Patrick Kenny:
Optimal tying of HMM mixture densities using decision trees.
Hwan Jin Choi, Yung-Hwan Oh:
Speech recognition using an enhanced FVQ based on a codeword dependent distribution normalization and codeword weighting by fuzzy objective function.
Mikko Kurimo, Panu Somervuo:
Using the self-organizing map to speed up the probability density estimation for speech recognition with mixture density HMMs.
Carrie E. Lang, John J. Ohala:
Temporal cues for vowels and universals of vowel inventories.
Ann K. Syrdal:
Acoustic variability in spontaneous conversational speech of american English talkers.
Raquel Willerman, Patricia K. Kuhl:
Cross-language speech perception: Swedish, English, and Spanish speakers' perception of front rounded vowels.
John C. L. Ingram, See-Gyoon Park:
Inter-language vowel perception and production by Korean and Japanese listeners.
Diane Kewley-Port, Reiko Akahane-Yamada, Kiyoaki Aikawa:
Intelligibility and acoustic correlates of Japanese accented English vowels.
Segmentation strategies for spoken language recognition: evidence from semi-bilingual Japanese speakers of English.
Geunbae Lee, Jong-Hyeok Lee, Kyubong Park, Byung-Chang Kim:
Integrating connectionist, statistical and symbolic approaches for continuous spoken Korean processing.
Hynek Hermansky, Sangita Tibrewala, Misha Pavel:
Towards ASR on partially corrupted speech.
Herbert Gish, Kenney Ng:
Parametric trajectory models for speech recognition.
Kate Knill, M. J. F. Gales, Steve J. Young:
Use of Gaussian selection in large vocabulary continuous speech recognition using HMMs.
Jesper Högberg, Kåre Sjölander:
Cross phone state clustering using lexical stress and context.
Eduardo Lleida-Solano, Richard C. Rose:
Likelihood ratio decoding and confidence measures for continuous speech recognition.
Xiaohui Ma, Yifan Gong, Yuqing Fu, Jiren Lu, Jean Paul Haton:
A study on continuous Chinese speech recognition based on stochastic trajectory models.
Yoshiaki Itoh, Jiro Kiyama, Hiroshi Kojima, Susumu Seki, Ryuichi Oka:
A proposal for a new algorithm of reference interval-free continuous DP for real-time speech or text retrieval.
Akinori Ito, Masaki Kohda:
Language modeling by string pattern n-gram for Japanese speech recognition.
Statistical language modeling using a variable context length.
Finn Tore Johansen:
A comparison of hybrid HMM architectures using global discriminative training.
Wei Wei, Etienne Barnard, Mark A. Fanty:
Improved probability estimation with neural network models.