


Остановите войну!
for scientists:
ICSLP 1996: Philadelphia, PA, USA
- The 4th International Conference on Spoken Language Processing, Philadelphia, PA, USA, October 3-6, 1996. ISCA 1996
Plenary Lectures
- Anne Cutler:
The comparative study of spoken-language processing. - James L. Flanagan:
Natural communication with machines - progress and challenge.
Large Vocabulary
- Zhishun Li, Michel Héon, Douglas D. O'Shaughnessy:
New developments in the INRS continuous speech recognition system. - Lori Lamel, Gilles Adda:
On designing pronunciation lexicons for large vocabulary, continuous speech recognition. - Pablo Fetter, Frédéric Dandurand, Peter Regel-Brietzmann:
Word graph rescoring using confidence measures. - Xavier L. Aubert, Peter Beyerlein, Meinhard Ullrich:
A bottom-up approach for handling unseen triphones in large vocabulary continuous speech recognition. - V. Valtchev, Philip C. Woodland, Steve J. Young:
Discriminative optimisation of large vocabulary recognition systems. - Tatsuo Matsuoka, Katsutoshi Ohtsuki, Takeshi Mori, Sadaoki Furui, Katsuhiko Shirai:
Japanese large-vocabulary continuous-speech recognition using a business-newspaper corpus. - David M. Carter, Jaan Kaja, Leonardo Neumeyer, Manny Rayner, Fuliang Weng, Mats Wirén:
Handling compound nouns in a Swedish speech-understanding system. - Javier Macías Guarasa, Ascensión Gallardo-Antolín, Javier Ferreiros, José Manuel Pardo, Luis Villarrubia Grande:
Initial evaluation of a preselection module for a flexible large vocabulary speech recognition system in.
Multimodal ASR (Face and Lips)
- Mamoun Alissali, Paul Deléglise, Alexandrina Rogozan:
Asynchronous integration of visual information in an automatic speech recognition system. - Iain A. Matthews, J. Andrew Bangham, Stephen J. Cox:
Audiovisual speech recognition using multiscale nonlinear image decomposition. - Qin Su, Peter L. Silsbee:
Robust audiovisual integration using semicontinuous hidden Markov models. - Richard P. Schumeyer, Kenneth E. Barner:
The effect of visual information on word initial consonant perception of dysarthric speech. - Devi Chandramohan, Peter L. Silsbee:
A multiple deformable template approach for visual speech recognition. - Piero Cosi, Emanuela Magno Caldognetto, Franco Ferrero, M. Dugatto, Kyriaki Vagges:
Speaker independent bimodal phonetic recognition experiments. - Juergen Luettin, Neil A. Thacker, Steve W. Beet:
Speechreading using shape and intensity information. - Juergen Luettin, Neil A. Thacker, Steve W. Beet:
Speaker identification by lipreading.
Perception of Words
- David W. Gow Jr., Janis Melvold, Sharon Manuel:
How word onsets drive lexical access and segmentation: evidence from acoustics, phonology and processing. - David van Kuijk, Peter Wittenburg, Ton Dijkstra:
RAW: a real-speech model for human word recognition. - Mehdi Meftah, Sami Boudelaa:
How facilitatory can lexical information be during word recognition? evidence from moroccan arabic. - Alette P. Haveman:
Effects of frequency on the auditory perception of open- versus closed-class words. - Michael S. Vitevitch, Paul A. Luce, Jan Charles-Luce, David Kemmerer:
Phonotactic and metrical influences on adult ratings of spoken nonsense words. - Edward T. Auer, Lynne E. Bernstein:
Lipreading supplemented by voice fundamental frequency: to what extent does the addition of voicing increase lexical uniqueness for the lipreader? - Saskia te Riele, Sieb G. Nooteboom, Hugo Quené:
Strategies used in rhyme-monitoring. - Wilma van Donselaar, Cecile T. L. Kuijpers, Anne Cutler:
How do dutch listeners process words with epenthetic schwa?
Phonetics, Transcription, and Analysis
- Patrick Juola, Philip Zimmermann:
Whole-word phonetic distances and the PGPfone alphabet. - Shuping Ran, J. Bruce Millar, Phil Rose:
Automatic vowel quality description using a variable mapping to an eight cardinal vowel reference set. - Andreas Kipp, Maria-Barbara Wesenick, Florian Schiel:
Automatic detection and segmentation of pronunciation variants in German speech corpora. - Stephanie Seneff, Raymond Lau, Helen M. Meng:
ANGIE: a new framework for speech analysis based on morpho-phonological modelling. - Byunggon Yang:
Perceptual contrast in the Korean and English vowel system normalized. - Yong-Ju Lee, Sook-Hyang Lee:
On phonetic characteristics of pause in the Korean read speech. - Sami Boudelaa, Mehdi Meftah:
Cross-language effects of lexical stress in word recognition: the case of Arabic English bilinguals. - Maria-Barbara Wesenick:
Automatic generation of German pronunciation variants. - Maria-Barbara Wesenick, Andreas Kipp:
Estimating the quality of phonetic transcriptions and segmentations of speech signals. - Bojan Petek, Rastislav Sustarsic, Smiljana Komar:
An acoustic analysis of contemporary vowels of the standard slovenian language. - Sandrine Robbe, Anne Bonneau, Sylvie Coste, Yves Laprie:
Using decision trees to construct optimal acoustic cues. - Donna Erickson, Osamu Fujimura:
Maximum jaw displacement in contrastive emphasis. - Rebecca Herman, Mary E. Beckman, Kiyoshi Honda:
Subglottal pressure and final lowering in English. - Cecile T. L. Kuijpers, Wilma van Donselaar, Anne Cutler:
Phonological variation: epenthesis and deletion of schwa in Dutch.
Spoken Language Processing for Special Populations
- James J. Mahshie:
Feedback considerations for speech training systems. - Anne-Marie Öster:
Clinical applications of computer-based speech training for children with hearing impairment. - Valérie Hazan, Andrew Simpson:
Enhancing information-rich regions of natural VCV and sentence materials presented in noise. - Valérie Hazan, Alan Adlard:
Speech perceptual abilities of children with specific reading difficulty (dyslexia). - Larry D. Paarmann, Michael K. Wynne:
Bimodal perception of spectrum compressed speech. - Dragana Barac-Cikoja, Sally Revoile:
Effect of sentential context on syllabic stress perception by hearing-impaired listeners. - Martin J. Russell, Catherine Brown, Adrian Skilling, Robert W. Series, Julie L. Wallace, Bill Bohnam, Paul Barker:
Applications of automatic speech recognition to speech and language development in young children. - D. R. Campbell:
Sub-band adaptive speech enhancement for hearing aids. - Thomas Portele, Jürgen Krämer:
Adapting a TTS system to a reading machine for the blind.
Dialogue Special Sessions
- Katsuhiko Shirai:
Modeling of spoken dialogue with and without visual information. - Stephanie Seneff, David Goddeau, Christine Pao, Joseph Polifroni:
Multimodal discourse modelling in a multi-user multi-domain environment. - Kenji Kita, Yoshikazu Fukui, Masaaki Nagata, Tsuyoshi Morimoto:
Automatic acquisition of probabilistic dialogue models. - Paul Heisterkamp, Scott McGlashan:
Units of dialogue management: an example. - Sharon L. Oviatt, Robert VanGent:
Error resolution during multimodal human-computer interaction. - Ramesh R. Sarukkai, Dana H. Ballard:
Improved spontaneous dialogue recognition using dialogue and utterance triggers by adaptive probability boosting. - Kai Hübener, Uwe Jost, Henrik Heine:
Speech recognition for spontaneously spoken German dialogues. - Paul Taylor, Hiroshi Shimodaira, Stephen Isard, Simon King, Jacqueline C. Kowtko:
Using prosodic information to constrain language models for spoken dialogue. - Peter A. Heeman, Kyung-ho Loken-Kim, James F. Allen:
Combining the detection and correction of speech repairs. - Yuji Sagawa, Wataru Sugimoto, Noboru Ohnishi:
Generating spontaneous elliptical utterance. - Gösta Bruce, Marcus Filipsson, Johan Frid, Björn Granström, Kjell Gustafson, Merle Horne, David House, Birgitta Lastow, Paul Touati:
Developing the modelling of Swedish prosody in spontaneous dialogue. - Shimei Pan, Kathleen R. McKeown:
Spoken language generation in a multimedia system. - Keikichi Hirose, Mayumi Sakata, Hiromichi Kawanami:
Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features. - Shuichi Tanaka, Shu Nakazato, Keiichiro Hoashi, Katsuhiko Shirai:
Spoken dialogue interface in a dual task situation. - Yasuhisa Niimi, Yutaka Kobayashi:
A dialogue control strategy based on the reliability of speech recognition. - Alexander I. Rudnicky, Stephen Reed, Eric H. Thayer:
Speechwear: a mobile speech system. - Helen M. Meng, Senis Busayapongchai, James R. Glass, David Goddeau, I. Lee Hetherington, Edward Hurley, Christine Pao, Joseph Polifroni, Stephanie Seneff, Victor Zue:
WHEELS: a conversational system in the automobile classifieds domain. - M. David Sadek, A. Ferrieux, A. Cozannet, Philippe Bretier, Franck Panaget, J. Simonin:
Effective human-computer cooperative spoken dialogue: the AGS demonstrator. - Samir Bennacef, Laurence Devillers, Sophie Rosset, Lori Lamel:
Dialog in the RAILTEL telephone-based system. - Alon Lavie, Lori S. Levin, Yan Qu, Alex Waibel, Donna Gates, Marsal Gavaldà, Laura Mayfield, Maite Taboada:
Dialogue processing in a conversational speech translation system.
Language Modeling
- Thomas Niesler, Philip C. Woodland:
Combination of word-based and category-based language models. - Francisco J. Valverde-Albacete, José Manuel Pardo:
A multi-level lexical-semantics based language model design for guided integrated continuous speech recognition. - Florian Gallwitz, Elmar Nöth, Heinrich Niemann:
A category based approach for recognition of out-of-vocabulary words. - Kristie Seymore, Ronald Rosenfeld:
Scalable backoff language models. - Rukmini Iyer, Mari Ostendorf:
Modeling long distance dependence in language: topic mixtures vs. dynamic cache models. - Marcello Federico:
Bayesian estimation methods for n-gram language model adaptation. - Man-Hung Siu, Mari Ostendorf:
Modeling disfluencies in conversational speech. - John Miller, Fil Alleva:
Evaluation of a language model using a clustered model backoff. - Antonio Bonafonte, José B. Mariño:
Language modeling using x-grams. - Klaus Ries, Finn Dag Buø, Alex Waibel:
Class phrase models for language modelling. - Petra Geutner:
Introducing linguistic constraints into statistical language modeling. - Jianying Hu, William Turin, Michael K. Brown:
Language modeling with stochastic automata.
Feature Extraction for Speech Recognition
- Don X. Sun:
Feature dimension reduction using reduced-rank maximum likelihood estimation for hidden Markov models. - Kai Hübener:
Using multi-level segmentation coefficients to improve HMM speech recognition. - Thomas Eisele, Reinhold Haeb-Umbach, Detlev Langmann:
A comparative study of linear feature transformation techniques for automatic speech recognition. - Ben Milner:
Inclusion of temporal information into features for speech recognition. - Hubert Wassner, Gérard Chollet:
New cepstral representation using wavelet analysis and spectral transformation for robust speech recognition. - Christopher John Long, Sekharajit Datta:
Wavelet based feature extraction for phoneme recognition. - Andrzej Drygajlo:
New fast wavelet packet transform algorithms for frame synchronized speech processing. - Srinivasan Umesh, Leon Cohen, Nenad Marinovic, Douglas J. Nelson:
Frequency-warping in speech. - Daisuke Kobayashi, Shoji Kajita, Kazuya Takeda, Fumitada Itakura:
Extracting speech features from human speech-like noise. - Shoji Kajita, Kazuya Takeda, Fumitada Itakura:
Subband-crosscorrelation analysis for robust speech recognition. - Hervé Bourlard, Stéphane Dupont:
A new ASR approach based on independent processing and recombination of partial frequency bands. - Climent Nadeu, José B. Mariño, Javier Hernando, Albino Nogueiras:
Frequency and time filtering of filter-bank energies for HMM speech recognition.
Speech Production - Measurement and Modeling
- Yves Laprie, Marie-Odile Berger:
Extraction of tongue contours in x-ray images with minimal user interaction. - Didier Demolin, Thierry Metens, Alain Soquet:
Three-dimensional measurement of the vocal tract by MRI. - Philip Gleason, Betty Tuller, J. A. Scott Kelso:
Syllable affiliation of final consonant clusters undergoes a phase transition over speaking rates. - Arthur Lobo, Michael H. O'Malley:
Towards a biomechanical model of the larynx. - Yann Morlec, Gérard Bailly, Véronique Aubergé:
Generating intonation by superposing gestures. - Hideki Kawahara, Hiroko Kato, J. C. Williams:
Effects of auditory feedback on F0 trajectory generation.
Speech Coding / HMMs and NNs in ASR
- Ian S. Burnett, John J. Parry:
On the effects of accent and language on low rate speech coders. - Jeng-Shyang Pan, Fergus R. McInnes, Mervyn A. Jack:
VQ codevector index assignment using genetic algorithms for noisy channels. - Gavin C. Cawley:
An improved vector quantization algorithm for speech transmission over noisy channels. - C. Murgia, Gang Feng, Alain Le Guyader, Catherine Quinquis:
Very low delay and high quality coding of 20 hz-15 khz speech signals at 64 kbit/s. - Carlos M. Ribeiro, Isabel Trancoso:
Application of speaker modification techniques to phonetic vocoding. - Tadashi Yonezaki, Kiyohiro Shikano:
Entropy coded vector quantization with hidden Markov models. - Minoru Kohata:
An application of recurrent neural networks to low bit rate speech coding. - Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi, Satoshi Imai:
CELP coding system based on mel-generalized cepstral analysis. - Cheung-Fat Chan, Wai-Kwong Hui:
Wideband re-synthesis of narrowband CELP-coded speech using multiband excitation model. - Takuya Koizumi, Mikio Mori, Shuji Taniguchi, Mitsutoshi Maruya:
Recurrent neural networks for phoneme recognition. - M. A. Mokhtar, A. Zein-el-Abddin:
A model for the acoustic phonetic structure of arabic language using a single ergodic hidden Markov model. - Yifan Gong, Irina Illina, Jean Paul Haton:
Modelling long term variability information in mixture stochastic trajectory framework. - Thierry Moudenc, Robert Sokol, Guy Mercier:
Segmental phonetic features recognition by means of neural-fuzzy networks and integration in an n-best solutions post-processing. - Irina Illina, Yifan Gong:
Stochastic trajectory model with state-mixture for continuous speech recognition. - Hermann Hild, Alex Waibel:
Recognition of spelled names over the telephone. - Gilles Boulianne, Patrick Kenny:
Optimal tying of HMM mixture densities using decision trees. - Hwan Jin Choi, Yung-Hwan Oh:
Speech recognition using an enhanced FVQ based on a codeword dependent distribution normalization and codeword weighting by fuzzy objective function. - Mikko Kurimo, Panu Somervuo:
Using the self-organizing map to speed up the probability density estimation for speech recognition with mixture density HMMs.
Vowels
- Carrie E. Lang, John J. Ohala:
Temporal cues for vowels and universals of vowel inventories. - Ann K. Syrdal:
Acoustic variability in spontaneous conversational speech of american English talkers. - Raquel Willerman, Patricia K. Kuhl:
Cross-language speech perception: Swedish, English, and Spanish speakers' perception of front rounded vowels. - John C. L. Ingram, See-Gyoon Park:
Inter-language vowel perception and production by Korean and Japanese listeners. - Diane Kewley-Port, Reiko Akahane-Yamada, Kiyoaki Aikawa:
Intelligibility and acoustic correlates of Japanese accented English vowels. - Kiyoko Yoneyama:
Segmentation strategies for spoken language recognition: evidence from semi-bilingual Japanese speakers of English.
NNs and Stochastic Modeling
- Geunbae Lee, Jong-Hyeok Lee, Kyubong Park, Byung-Chang Kim:
Integrating connectionist, statistical and symbolic approaches for continuous spoken Korean processing. - Hynek Hermansky, Sangita Tibrewala, Misha Pavel:
Towards ASR on partially corrupted speech. - Herbert Gish, Kenney Ng:
Parametric trajectory models for speech recognition. - Kate Knill, Mark J. F. Gales, Steve J. Young:
Use of Gaussian selection in large vocabulary continuous speech recognition using HMMs. - Jesper Högberg, Kåre Sjölander:
Cross phone state clustering using lexical stress and context. - Eduardo Lleida-Solano, Richard C. Rose:
Likelihood ratio decoding and confidence measures for continuous speech recognition. - Xiaohui Ma, Yifan Gong, Yuqing Fu, Jiren Lu, Jean Paul Haton:
A study on continuous Chinese speech recognition based on stochastic trajectory models. - Yoshiaki Itoh, Jiro Kiyama, Hiroshi Kojima, Susumu Seki, Ryuichi Oka:
A proposal for a new algorithm of reference interval-free continuous DP for real-time speech or text retrieval.