


Остановите войну!
for scientists:
EUROSPEECH 1995: Madrid, Spain
- Fourth European Conference on Speech Communication and Technology, EUROSPEECH 1995, Madrid, Spain, September 18-21, 1995. ISCA 1995
Keynotes
- Kenneth N. Stevens:
Applying phonetic knowledge to lexical access. - William A. Ainsworth:
Auditory mechanisms for speech perception. - Hervé Bourlard:
Towards increasing speech recognition error rates. - Sadaoki Furui:
Flexible speech recognition.
Prosody Modelling in ASR
- Takatoshi Jitsuhiro, Tomokazu Yamada, Shigeki Sagayama:
Syllabic duration control for vocabulary-free speech recognition. - Kazuyuki Takagi, Shuichi Itahashi:
Effectiveness of pause information in the content word detection of spoken dialogues. - Kazuhiro Kondo:
Connected Japanese digit recognition with pitch accent-dependent models. - Claude Barras, Marie-José Caraty, Claude Montacié:
Temporal control and training selection for HMM-based system. - Keikichi Hirose, Xinhui Hu:
HMM-based tone recognition of Chinese trisyllables using double codebooks on fundamental frequency and waveform power.
Wideband Coding
- C. Murgia, Gang Feng, Catherine Quinquis, Alain Le Guyader:
Very low delay and high quality coding of 20 hz -15 khz speech at 64 kbit/S. - Shigeaki Sasaki, Akitoshi Kataoka, Takehiro Moriya:
Wideband CELP coder at 16-kbit/s with 10-ms frame. - A. W. Black, Ian A. Atkinson, Ahmet M. Kondoz, Barry G. Evans:
High quality 14.1kb/s wideband speech coder. - Victoria Abreu-Sernández, Domingo Docampo-Amoedo:
A multipulse-deconvolution codec for wideband speech.
Hardware and Systems for Speech Processing
- E. Rohwer:
An advanced multi-DSP platform for speech technology integration in computer telephony applications. - Rafael Ciria, Rafael Sarmiento de Sotomayor, Cristina Aguila, José Parera, Juan Santos:
Voice processing architecture for computer-telephony integration. - Laura Ortiz-Balbuena, Héctor M. Pérez Meana, Alejandro Martínez-González, Luís Nino de Rivera, Mariko Nakano-Miyatake:
Fast convergent analog adaptive filter. - Manuel A. Leandro, Álvaro Villegas, José Manuel Pardo:
Efficient isolated word recognition in Spanish based on static modeling. - Jean-Luc Cochard, Olivier Oppizzi:
Reliability in a multi-agent spoken language recognition system.
Discriminative Training I, II
- Tomoko Matsui, Sadaoki Furui:
A study of speaker adaptation based on minimum classification error training. - Albino Nogueiras Rodríguez, José B. Mariño:
Maximum likelihood based discriminative training of acoustic models. - Hugues Leprieur, Patrick Haffner:
Discriminant learning with minimum memory loss for improved non-vocabulary rejection. - Cesar Martín del Alamo, F. Javier Caminero-Gil, Celinda de la Torre-Munilla, Luis A. Hernández Gómez:
Codebook weights adaptation for discriminative training of SCHMM-based speech recognition systems. - Kyungmin Na, Bumki Jeon, Dong-Il Chang, Soo-Ik Chae, Souguil Ann:
Discriminative training of hidden Markov models using overall risk criterion and reduced gradient method. - Qiang Huo, Chorkin Chan:
Discriminative training of HMM based speech recognizer with gradient projection method. - Javier Hernando Pericas, J. Ayarte, Enric Monte:
Optimization of speech parameter weighting for CDHMM word recognition. - Mazin G. Rahim, Chin-Hui Lee, Biing-Hwang Juang:
Discriminative utterance verification for connected digits recognition. - Antonio M. Peinado, Antonio J. Rubio, José C. Segura, Victoria E. Sánchez, Jesús Esteban Díaz Verdejo:
MCE estimation of VQ parameters for MVQHMM speech recognition. - Wolfgang Reichl, Günther Ruske:
Discriminative training for continuous speech recognition. - Kuldip K. Paliwal, Michiel Bacchiani, Yoshinori Sagisaka:
Minimum classification error training algorithm for feature extractor and pattern classifier in speech recognition.
Auditory Modeling in Speech Recognition
- Stephan Euler:
Integrated optimization of feature transformation for speech recognition. - Andrew C. Morris, José M. Pardo:
Phoneme transition detection and broad classification using a simple model based on the function of onset detector cells found in the cochlear nucleus. - Eric Fragnière, André van Schaik, Eric A. Vittoz:
Linear predictive coding of speech using an analogue cochlear model. - Edward Jones, Eliathamby Ambikairajah:
Pitch extraction of telephone bandwidth speech using a place-temporal approach. - Markus Bodden, Timothy R. Anderson:
A binaural selectivity model for speech recognition. - Cristina Dobrin, Petri Haavisto, Kari Laurila, Jaakko Astola:
Speech recognition experiments in a noisy environment using auditory system modelling.
Applications and Systems
- Frédéric Berthommier, Georg F. Meyer:
Source separation by a functional model of amplitude demodulation. - Vratislav Davidek, Pavel Sovka, Jiri Sika:
Real-time implementation of spectral subtraction algorithm for suppression of acoustic noise in speech. - Bert Van Coile, Hans-Wilhelm Rühl, L. Vogten, M. Thoone, S. Goß, D. Delaey, E. Moons, Jacques M. B. Terken, Jan-Roelof de Pijper, Marianne Kugler, P. Kaufholz, Regina Krüger, Steven Leys, S. Willems:
Speech synthesis for the new pan-european traffic message control system RDS-TMC. - Roberto Pacifici, G. Manca:
Echo cancelling in speech recognition systems. - Teodoro Calonge Cano, Luis Alonso, Rui Ralha, A. L. Sánchez:
Parallel implementation of an hybrid neural network used for speech recognition task. - M. Li, J. T. Proudfoot:
Hardware design of LPC coding for speech feature extraction. - Henning Bergmann, Hans-Hermann Hamer, Andreas Noll, Annedore Paeseler, Horst Tomaschewski:
Modularization in task-specific language modelling. - Carlos Avendaño, Hynek Hermansky, Eric A. Wan:
Beyond NYQUIST: towards the recovery of broad-bandwidth speech from narrow-bandwidth speech.
Large Vocabulary
- David Pye, Philip C. Woodland, Steve J. Young:
Large vocabulary multilingual speech recognition using HTK. - Lori Lamel, Martine Adda-Decker, Jean-Luc Gauvain:
Issues in Large Vocabulary, Multilingual Speech Recognition. - James Barnett, Paul G. Bamberg, Martin Held, Juan Huerta, Linda Manganaro, Adam Weiss:
Comparative performance in large-vocabulary isolated-word recognition in five european languages. - Julie Brousseau, Caroline Drouin, George F. Foster, Pierre Isabelle, Roland Kuhn, Yves Normandin, Pierre Plamondon:
French speech recognition in an automatic dictation system for translators: the transtalk project. - Christian Dugast, Xavier L. Aubert, Reinhard Kneser:
The Philips large-vocabulary recognition system for american English, French, and German. - Sung-Chien Lin, Lee-Feng Chien, Keh-Jiann Chen, Lin-Shan Lee:
A syllable-based very-large-vocabulary voice retrieval system for Chinese databases with textual attributes. - Michael Riley, Andrej Ljolje, Donald Hindle, Fernando Pereira:
The AT&t 60,000 word speech-to-text system. - Tai-Hsuan Ho, Hsin-Min Wang, Lee-Feng Chien, Keh-Jiann Chen, Lin-Shan Lee:
Fast and accurate continuous speech recognition for Chinese language with very large vocabulary. - Zuoying Wang, Jun Wu, Xi Xiao, Jin Quo:
Methods towards the very large vocabulary Chinese speech recognition. - Gary D. Cook, Anthony J. Robinson:
Utterance clustering for large vocabulary continuous speech recognition.
Speech Coding I
- Thomas Eriksson, Jan Linden, Jan Skoglund:
Vector quantization of glottal pulses. - Michele Festa, Daniele Sereno:
A speech coding algorithm based on prototypes interpolation with critical bands and phase coding. - Dionysis E. Tsoukalas, Jiannis Mouropoulos, George Kokkinakis:
Very low-bitrate speech coding using perceptually-derived spectral data. - Lorenzo Piazzo:
A new very low bit rate speech coder: the step decomposition vocoder. - Ian A. Atkinson, Ahmet M. Kondoz, Barry G. Evans:
Time envelope LP vocoder: a new coding technique at very low bit rates. - Dan Stefanoiu, Radwan Kastantin, Gang Feng:
Speech coding based on the discrete-time wavelet transform and human auditory system properties. - F. J. Ancin, M. L. Larreategui, B. L. Burrows, Rolando A. Carrasco:
Wavelets for low bit rate speech coding applications. - Elimberaza Mandridake, Rachid Atay, Mohamed Najim:
Adaptive speech vector coding with a multiresolution hierarchical codebook. - Andrei Popescu, Nicolas Moreau:
Subband analysis-by-synthesis coding. - Clifford I. Parris, Danny Wong, Francois Chambon:
A robust 2.4kb/s LP-MBE with iterative LP modelling. - M. S. Torres-Guijarro, Francisco Javier Casajús-Quirós:
Improved transient representation and quantization for sinusoidal speech coders. - Eric W. M. Yu, Cheung-Fat Chan:
Efficient multiband excitation linear predictive coding of speech at 1.6 kbps. - Bruno Wery, Stephane Deketelaere:
Voice coding in the MSBN satellite communication system. - Barry M. G. Cheetham, Xiaoqin Sun, W. T. K. Wong:
Spectral envelope estimation for low bit-rate sinusoidal speech coders.
Speech Signal Processing / Wavelets
- Israel Cohen, Shalom Raz, David Malah:
Shift-invariant adaptive local trigonometric decomposition. - Paul Micallef, Edward H. S. Chilton:
Spectral envelope of speech using wavelets. - Andrzej Drygajlo, Nicolas Thevoz:
Multiresolution speech analysis using fast time-varying orthogonal wavelet packet transform algorithms. - Maria Rangoussi, Flemming Pedersen:
Second- and third-order wigner distributions in hierarchical recognition of speech phonemes. - Gaafar M. K. Saleh, Mahesan Niranjan, William J. Fitzgerald:
The use of maximum a posteriori parameters in linear prediction of speech.
Applications of Speech Technology
- William C. G. Ortel:
Observed long-term changes in customer calling patterns in a telephone application using automatic speech recognition. - Ayman Asadi, David M. Lubensky, L. Madhavrao, Jayant M. Naik, Vijay Raman, George Vysotsky:
Combining speech algorithms into a "natural" application of speech technology for telephone network services. - Yevgeny Ludovik, Valeriy Sibirtsev:
Intelligent answering machine-secretary. - Kyung-ho Loken-Kim, Young-Duk Park, Suguru Mizunashi, Laurel Fais, Tsuyoshi Morimoto:
Verbal-gestural behaviors in multimodal spoken language interpreting telecommunications. - Jung-Kuei Chen, Lin-Shan Lee, Frank K. Soong:
Large vocabulary, word-based Mandarin dictation system.
Visual Speech
- Bertrand Le Goff, Thierry Guiard-Marigny, Christian Benoît:
Read my lips... and my jaw! how intelligible are the components of a speaker's face? - Angela Fuster Duran:
Mcgurk effect in Spanish and German listeners: influences of visual cues in the perception of Spanish and German conflicting audio-visual stimuli. - Jonas Beskow:
Rule-based visual speech synthesis. - Fabio Lavagetto, Paolo Lavagetto:
A new algorithm for visual synthesis of speech. - Harouna Kabré:
Audiovisual speech recognition using the fuzzy shape filters model.
Speaker Recognition I-III
- Jialong He, Li Liu, Günther Palm:
On the use of features from prediction residual signals in speaker identification. - Kai Tat Ng, Haizhou Li, Jean Paul Haton:
Some nonparametric distance measures in speaker verification. - Michael J. Carey, Graham Tattersall, Eluned S. Parris:
Adaptive transforms for speaker recognition. - Kai Tat Ng, Jian Su, Bingzheng Xu:
Speaker recognition with discriminative speaker VQ models. - A. Federico, Andrea Paoloni:
Parametric speaker recognition over large population of telephonic voices. - Toomas Altosaar, Einar Meister:
Speaker recognition experiments in Estonian using multi-layer feed-forward neural nets. - Ivan Magrin-Chagnolleau, Jean-François Bonastre, Frédéric Bimbot:
Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods. - Pavel V. Labulin, Sergey L. Koval, Andrey N. Raev:
Automatic speaker recognition using formants-based nearest-neighbour distance measure. - M. Mehdi Homayounpour, Gérard Chollet:
Discrimination of voices of twins and siblings for speaker verification. - Kay M. Berkling, Etienne Barnard:
Theoretical error prediction for a language identification system using optimal phoneme clustering. - Jesper Ø. Olsen:
Separation of speakers in audio data. - J.-L. Bonifas, I. Hernaez Rioja, B. Etxebarria Gonzalez, S. Saoudi:
Text-dependent speaker verification using dynamic time warping and vector quantization of LSF. - Haizhou Li, Jean Paul Haton, Yifan Gong:
On MMI learning of Gaussian mixture for speaker models. - Yifan Gong:
Evaluation of Bayes decision approach to automatic determination of thresholds for speaker verification. - Daniele Falavigna:
Comparison of different HMM based methods for speaker verification. - J. Sheikhzadegan, M. Tebiani, M. Lotfizad, Mahmood R. Roohani:
Speaker classification by neural network for short utteranses using phoneme groups in Farsi. - Jean-Luc Le Floch, Claude Montacié, Marie-José Caraty:
Speaker recognition experiments on the NTIMIT database. - Michael Wagner, John S. Mason, J. Bruce Millar:
Speaker identification using vector quantisation with codeword-specific derivative coding. - Haizhou Li, Jean Paul Haton, Jian Su, Yifan Gong:
Speaker recognition with temporal transition models. - Tomoko Matsui, Tomohito Kanno, Sadaoki Furui:
Speaker recognition using HMM composition in noisy environments. - ChiWei Che, Qiguang Lin:
Speaker recognition using HMM with experiments on the yoho database. - Kin Yu, John S. Mason, John Oglesby:
Speaker recognition models. - Thierry Artières, Patrick Gallinari:
Multi-state predictive neural networks for text-independent speaker recognition.
Voice Source Analysis and Modelling
- Francesco Beritelli, Salvatore Casale, Marco Russo:
A voiced/unvoiced speech discrimination technique based on fuzzy logic. - Vassilios Darsinos, Christophe d'Alessandro, B. Yegnanarayana:
Evaluation of a periodic/aperiodic speech decomposition algorithm. - Jean Rouat, Yong Chun Liu, Daniel Morissette:
A pitch determination and voiced/unvoiced decision algorithm for noisy speech. - Léonard Janer:
Modulated Gaussian wavelet transform based speech analyser (MGWTSA) pitch detection algorithm (PDA). - M. L. Larreategui, F. J. Ancin, Rolando A. Carrasco:
An improved epoch detection algorithm based on sinusoidal modelling of speech. - Vassilios Darsinos, Dimitrios Galanis, George Kokkinakis:
A method for fully automatic analysis and modelling of voice source characteristics. - Hartmut R. Pfitzinger:
Dynamic vowel quality: a new determination formalism based on perceptual experiments. - Sumio Ohno, Hiroya Fujisaki:
A method for quantitative analysis of the local speech rate.
Voice Personality Characteristics in TTS
- Ki-Seung Lee, Dae Hee Youn, Il-Whan Cha:
Voice personality transformation using an orthogonal vector space conversion. - Makoto Hashimoto, Norio Higuchi:
Spectral mapping for voice conversion using speaker selection and vector field smoothing. - Norio Higuchi, Makoto Hashimoto:
Analysis of acoustic features affecting speaker identification. - Masato Akagi, Taw Ienaga:
Speaker individualities in fundamental frequency contours and its control. - King-fai Lam, Cheung-Fat Chan:
Interpolating MBE v/UV mixture function for high quality synthesis of speech. - Yannis Stylianou, Olivier Cappé, Eric Moulines:
Statistical methods for voice quality transformation. - Yannis Stylianou, Jean Laroche, Eric Moulines:
High-quality speech modification based on a harmonic + noise model. - Sahar E. Bou-Ghazah, John H. L. Hansen:
Source generator based stressed speech perturbation.
Robust Speech Recognition in Noise
- Néstor Becerra Yoma, Fergus R. McInnes, Mervyn A. Jack:
Improved algorithms for speech recognition in noise using lateral inhibition and SNR weighting. - Olivier Siohan, Yifan Gong, Jean Paul Haton:
Noise adaptation using linear regression for continuous noisy speech recognition. - Ruikang Yang, Markku Majaniemi, Petri Haavisto:
Dynamic parameter compensation for speech recognition in noise. - Andrzej Drygajlo, Nathalie Virag, Gregoire Cosendai:
Robust speech recognition in noise using speech enhancement based on masking properties of the auditory system and adaptive HMM. - Dong Yu, Taiyi Huang:
Canonical correlation based compensation approach for robust speech recognition in noisy environment. - Pedro J. Moreno, Bhiksha Raj, Richard M. Stern:
A unified approach for robust speech recognition.
Modelling and Training for Robust Recognition
- Harald Singer, Kuldip K. Paliwal, Tomohiko Beppu, Yoshinori Sagisaka:
Effect of rasta-type processing for speech recognition with speaking-rate mismatches.