


default search action
Speech Communication, Volume 48
Volume 48, Number 1, January 2006
- Sebastian Möller, Jan Felix Krebber, Paula M. T. Smeele:

Evaluating the speech output component of a smart-home system. 1-27 - Heungkyu Lee, Hanseok Ko

:
Competing models-based text-prompted speaker independent verification algorithm. 28-44 - Tomoki Toda

, Hisashi Kawai, Minoru Tsuzaki, Kiyohiro Shikano:
An evaluation of cost functions sensitively capturing local degradation of naturalness for segment selection in concatenative speech synthesis. 45-56 - Chang Huai You, Soo Ngee Koh, Susanto Rahardja

:
Masking-based beta-order MMSE speech enhancement. 57-70 - Ka-Yee Leung, Man-Wai Mak

, Man-Hung Siu, Sun-Yuan Kung:
Adaptive articulatory feature-based conditional pronunciation modeling for speaker verification. 71-84 - Marie A. Roch:

Gaussian-selection-based non-optimal search for speaker identification. 85-95 - Kotta Manohar, Preeti Rao:

Speech enhancement in nonstationary noise environments using noise properties. 96-109
Volume 48, Number 2, February 2006
- Junfeng Li, Masato Akagi:

A noise reduction system based on hybrid noise estimation technique and post-filtering in arbitrary noise environments. 111-126 - Yassine Mami, Delphine Charlet:

Speaker recognition by location in the space of reference speakers. 127-141 - Vlasios Doumpiotis, William Byrne:

Lattice segmentation and minimum Bayes risk discriminative training for large vocabulary continuous speech recognition. 142-160 - Konstantin Markov, Jianwu Dang, Satoshi Nakamura:

Integration of articulatory and spectrum features based on the hybrid HMM/BN modeling framework. 161-175 - Andrea Facco, Daniele Falavigna, Roberto Gretter, Marcello Viganò:

Design and evaluation of acoustic and language models for large scale telephone services. 176-190 - Arnaud Martin, Laurent Mauuary:

Robust speech/non-speech detection based on LDA-derived parameter and voicing parameter for speech recognition in noisy environments. 191-206 - Cheng-Lung Lee, Wen-Whei Chang, Yuan-Chuan Chiang:

Spectral and prosodic transformations of hearing-impaired Mandarin speech. 207-219 - Sundarrajan Rangachari, Philipos C. Loizou:

A noise-estimation algorithm for highly non-stationary environments. 220-231
Volume 48, Numbers 3-4, March-April 2006
- Srinivas Bangalore, Dilek Hakkani-Tür

, Gökhan Tür
:
Introduction to the Special Issue on Spoken Language Understanding in Conversational Systems. 233-238 - Patrick Haffner:

Scaling large margin classifiers for spoken language understanding. 239-261 - Yulan He

, Steve J. Young:
Spoken language understanding using the Hidden Vector State Model. 262-275 - Murat Saraclar

, Brian Roark:
Utterance classification with discriminative language modeling. 276-287 - Christian Raymond, Frédéric Béchet, Renato de Mori, Géraldine Damnati:

On the use of finite state transducers for semantic interpretation. 288-304 - Chai Wutiwiwatchai, Sadaoki Furui:

A multi-stage approach for Thai spoken language understanding. 305-320 - Ruiqiang Zhang, Gen-ichiro Kikui:

Integration of speech recognition and machine translation: Speech recognition word lattice translation. 321-334 - Johan Boye, Joakim Gustafson, Mats Wirén:

Robust spoken language understanding in a computer game. 335-353 - Hilda Hardy, Alan W. Biermann, R. Bryce Inouye, Ashley McKenzie, Tomek Strzalkowski, Cristian Ursu, Nick Webb, Min Wu:

The Amities system: Data-driven techniques for automated dialogue. 354-373 - Qiang Huang, Stephen J. Cox:

Task-independent call-routing. 374-389 - Ye-Yi Wang, Alex Acero

:
Rapid development of spoken language understanding grammars. 390-416 - Ryuichiro Higashinaka, Katsuhito Sudoh

, Mikio Nakano
:
Incorporating discourse features into confidence scoring of intention recognition results in spoken dialogue systems. 417-436 - Tong Zhang, Mark Hasegawa-Johnson, Stephen E. Levinson:

Extraction of pragmatic and semantic salience from spontaneous spoken English. 437-462
Volume 48, Number 5, May 2006
- Christopher Dromey, Shawn L. Nissen, Petrea Nohr, Samuel G. Fletcher:

Measuring tongue movements during speech: Adaptation of a magnetic jaw-tracking system. 463-473 - Marián Képesi, Luis Weruaga:

Adaptive chirp-based time-frequency analysis of speech signals. 474-492 - Hauke Schramm, Xavier L. Aubert, Bart Bakker, Carsten Meyer, Hermann Ney:

Modeling spontaneous speech variability in professional dictation. 493-515 - Atsushi Fujii, Katunobu Itou, Tetsuya Ishikawa:

LODEM: A system for on-demand video lectures. 516-531 - Carsten Meyer, Hauke Schramm:

Boosting HMM acoustic models in large vocabulary speech recognition. 532-548 - Mark D. Skowronski, John G. Harris

:
Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments. 549-558 - Diane J. Litman, Katherine Forbes-Riley:

Recognizing student emotions and attitudes on the basis of utterances in spoken tutoring dialogues with both human and computer tutors. 559-590
Volume 48, Number 6, June 2006
- SungHee Kim, Robert D. Frisina, Frances M. Mapes, Elizabeth D. Hickman, D. Robert Frisina:

Effect of age on binaural speech intelligibility in normal hearing adults. 591-597 - Praveen K. Kakumanu, Anna Esposito

, Oscar N. Garcia, Ricardo Gutierrez-Osuna
:
A comparison of acoustic coding models for speech-driven facial animation. 598-615 - Tong Zhang, Mark Hasegawa-Johnson

, Stephen E. Levinson:
Cognitive state classification in a spoken tutorial dialogue system. 616-632 - Cynthia G. Clopper, David B. Pisoni:

The Nationwide Speech Project: A new corpus of American English dialects. 633-644 - Daniel Recasens, Aina Espinosa

:
Dispersion and variability of Catalan vowels. 645-666 - Amalia Arvaniti

, D. Robert Ladd, Ineke Mennen
:
Phonetic effects of focus and "tonal crowding" in intonation: Evidence from Greek polar questions. 667-696 - Ben Milner, Xu Shao:

Clean speech reconstruction from MFCC vectors and fundamental frequency using an integrated front-end. 697-715 - Min Chu, Yong Zhao, Eric Chang:

Modeling stylized invariance and local variability of prosody in text-to-speech synthesis. 716-726 - Leigh D. Alsteris, Kuldip K. Paliwal

:
Further intelligibility results from human listening tests using the short-time phase spectrum. 727-736 - Junho Park, Hanseok Ko

:
Achieving a reliable compact acoustic model for embedded speech recognition system with high confusion frequency model handling. 737-745 - Stephen So

, Kuldip K. Paliwal
:
Scalable distributed speech recognition using Gaussian mixture model-based block quantisation. 746-758
Volume 48, Number 7, July 2006
- Frédéric Bimbot, Marcos Faúndez-Zanuy

, Renato de Mori:
Editorial. 759 - Kevin M. Indrebo, Richard J. Povinelli

, Michael T. Johnson:
Sub-banded reconstructed phase spaces for speech recognition. 760-774 - Erhard Rank, Gernot Kubin

:
An oscillator-plus-noise model for speech synthesis. 775-801 - Giampiero Salvi

:
Dynamic behaviour of connectionist speech recognition with strong latency constraints. 802-818 - Dimitrios Dimitriadis, Petros Maragos:

Continuous energy demodulation methods and application to speech analysis. 819-837 - Marcos Faúndez-Zanuy

:
Speech coding through adaptive combined nonlinear prediction. 838-847 - Laurent Benaroya, Frédéric Bimbot, Guillaume Gravier, Rémi Gribonval:

Experiments in audio source separation with one sensor for robust speech recognition. 848-854 - SungHee Kim, Robert D. Frisina, D. Robert Frisina:

Effects of age on speech understanding in normal hearing listeners: Relationship between the auditory efferent system and speech intelligibility in noise. 855-862
Volume 48, Number 8, August 2006
- Luis Fernando D'Haro

, Ricardo de Córdoba
, Javier Ferreiros
, Stefan W. Hamerich, Volker Schless, Basilis Kladis, Volker Schubert, Otilia Kocsis
, Stefan Igel, José Manuel Pardo:
An advanced platform to speed up the design of multilingual dialog applications for multiple modalities. 863-887 - Naveen Srinivasamurthy, Antonio Ortega, Shrikanth S. Narayanan:

Efficient scalable encoding for distributed speech recognition. 888-902 - Mohammad Ali Salmani-Nodoushan

:
A comparative sociopragmatic study of ostensible invitations in English and Farsi. 903-912 - T. Nagarajan, Hema A. Murthy:

Language identification using acoustic log-likelihoods of syllable-like units. 913-926 - Yasser Ghanbari, Mohammad Reza Karami-Mollaei

:
A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets. 927-940 - Francisco Campillo Díaz, Eduardo Rodríguez Banga

:
A method for combining intonation modelling and speech unit selection in corpus-based speech synthesis systems. 941-956 - Jean-Baptiste Maj, Liesbeth Royackers, Jan Wouters

, Marc Moonen:
Comparison of adaptive noise reduction algorithms in dual microphone hearing aids. 957-970 - Roberto Togneri

, Li Deng:
A state-space model with neural-network prediction for recovering vocal tract resonances in fluent speech from Mel-cepstral coefficients. 971-988 - Jinfu Ni, Keikichi Hirose:

Quantitative and structural modeling of voice fundamental frequency contours of speech in Mandarin. 989-1008 - Pushkar Patwardhan, Preeti Rao:

Effect of voice quality on frequency-warped modeling of vowel spectra. 1009-1023 - Adam Borowicz

, Marek Parfieniuk
, Alexander A. Petrovsky:
An application of the warped discrete Fourier transform in the perceptual speech enhancement. 1024-1036 - Jan Stadermann, Gerhard Rigoll:

Hybrid NN/HMM acoustic modeling techniques for distributed speech recognition. 1037-1046 - Ismail Shahin

:
Enhancing speaker identification performance under the shouted talking condition using second-order circular hidden Markov models. 1047-1055
Volume 48, Number 9, September 2006
- Gerasimos Xydas

, Georgios Kouroupetroglou
:
Tone-Group F0 selection for modeling focus prominence in small-footprint speech synthesis. 1057-1078 - Felicia Roberts, Alexander L. Francis

, Melanie Morgan:
The interaction of inter-turn silence with prosodic cues in listener perceptions of "trouble" in conversation. 1079-1093 - Fatih Ögüt, Mehmet Akif Kiliç

, Erkan Zeki Engin, Rasit Midilli:
Voice onset times for Turkish stop consonants. 1094-1099 - Akira Sasou

, Futoshi Asano, Satoshi Nakamura, Kazuyo Tanaka:
HMM-based noise-robust feature compensation. 1100-1111 - Alejandro Bassi, Néstor Becerra Yoma, Patricio Loncomilla:

Estimating tonal prosodic discontinuities in Spanish using HMM. 1112-1125 - Abhinav Sethy, Shrikanth S. Narayanan, S. Parthasarthy:

A split lexicon approach for improved recognition of spoken names. 1126-1136 - Teruhisa Misu, Tatsuya Kawahara

:
Dialogue strategy to clarify user's queries for document retrieval system with speech interface. 1137-1150 - Makoto Hirohata, Yosuke Shinnaka, Koji Iwano

, Sadaoki Furui:
Sentence-extractive automatic speech summarization and evaluation techniques. 1151-1161 - Dimitrios Ververidis

, Constantine Kotropoulos
:
Emotional speech recognition: Resources, features, and methods. 1162-1181 - Vivek Tyagi, Hervé Bourlard, Christian Wellekens:

On variable-scale piecewise stationary spectral analysis of speech signals for ASR. 1182-1191 - Joe Frankel, Simon King

:
Observation process adaptation for linear dynamic models. 1192-1199 - Mohamed Faouzi BenZeghiba, Hervé Bourlard:

User-customized password speaker verification using multiple reference and background models. 1200-1213 - Dong Yu, Li Deng, Alex Acero

:
A lattice search technique for a long-contextual-span hidden trajectory model of speech. 1214-1226
Volume 48, Number 10, October 2006
- Javier Latorre, Koji Iwano

, Sadaoki Furui:
New approach to the polyglot speech generation by means of an HMM-based speaker adaptable synthesizer. 1227-1242 - S. R. Mahadeva Prasanna, Cheedella S. Gupta, B. Yegnanarayana:

Extraction of speaker-specific excitation information from linear prediction residual of speech. 1243-1261 - Özgül Salor, Mübeccel Demirekler:

Dynamic programming approach to voice transformation. 1262-1272 - Zhenyu Xiong, Thomas Fang Zheng, Zhanjiang Song, Frank K. Soong, Wenhu Wu:

A tree-based kernel selection approach to efficient Gaussian mixture model-universal background model based speaker identification. 1273-1282 - Gengxin Ning

, Shu-hung Leung
, Kam-keung Chu, Gang Wei:
A dynamic parameter compensation method for noisy speech recognition. 1283-1293 - Zekeriya Tufekci

, John N. Gowdy, Sabri Gurbuz, Eric K. Patterson
:
Applied mel-frequency discrete wavelet coefficients and parallel model compensation for noise-robust speech recognition. 1294-1307 - Rupal Patel, Maria I. Grigos:

Acoustic characterization of the question-statement contrast in 4, 7 and 11 year-old children. 1308-1318 - Sacha Krstulovic

, Frédéric Bimbot, Olivier Boëffard, Delphine Charlet, Dominique Fohr, Odile Mella:
Optimizing the coverage of a speech database through a selection of representative speaker recordings. 1319-1348 - Wen Jin, Michael S. Scordilis:

Speech enhancement by residual domain constrained optimization. 1349-1364 - Abdellah Kacha

, Francis Grenez, Jean Schoentgen:
Estimation of dysperiodicities in disordered speech. 1365-1378 - Jesús Vicente-Peña, Ascensión Gallardo-Antolín

, Carmen Peláez-Moreno
, Fernando Díaz-de-María
:
Band-pass filtering of the time sequences of spectral parameters for robust wireless speech recognition. 1379-1398
Volume 48, Number 11, November 2006
- Ben Milner, Christian Wellekens, Børge Lindberg:

Special Issue on Robustness Issues for Conversational Interaction. 1399-1401
- Alastair Bruce James, Ben Milner:

Towards improving the robustness of distributed speech recognition in packet loss. 1402-1421 - Antonio Cardenal López, Carmen García-Mateo

, Laura Docío Fernández
:
Weighted Viterbi decoding strategies for distributed speech recognition over IP networks. 1422-1434 - Valentin Ion, Reinhold Haeb-Umbach

:
Uncertainty decoding for distributed speech recognition over error-prone networks. 1435-1446
- Kentaro Ishizuka, Tomohiro Nakatani:

A feature extraction method using subband based periodicity and aperiodicity decomposition with noise robust frontend processing for automatic speech recognition. 1447-1457 - Benjamin J. Shannon, Kuldip K. Paliwal

:
Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition. 1458-1485 - Soundararajan Srinivasan, Nicoleta Roman, DeLiang L. Wang:

Binary and ratio time-frequency masks for robust speech recognition. 1486-1501 - Veronique Stouten, Hugo Van hamme

, Patrick Wambacq
:
Model-based feature enhancement with uncertainty decoding for noise robust ASR. 1502-1514 - Tran Huy Dat, Kazuya Takeda, Fumitada Itakura:

On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement. 1515-1527 - Vivek Tyagi, Christian Wellekens, Dirk T. M. Slock:

Least squares filtering of speech signals for robust ASR. 1528-1544 - Esfandiar Zavarehei, Saeed Vaseghi, Qin Yan:

Inter-frame modeling of DFT trajectories of speech and noise for speech enhancement using Kalman filters. 1545-1555 - Jonathan Darch

, Ben P. Milner, Saeed Vaseghi:
MAP prediction of formant frequencies and voicing class from MFCC vectors in noise. 1556-1572
- Richard C. Rose, Iker Arizmendi:

Efficient client-server based implementations of mobile speech recognition services. 1573-1589
- Frederik Stouten, Jacques Duchateau, Jean-Pierre Martens, Patrick Wambacq

:
Coping with disfluencies in spontaneous speech recognition: Acoustic detection and linguistic context manipulation. 1590-1606
Volume 48, Number 12, December 2006
- Marcos Faúndez-Zanuy

, Léonard Janer-García, Josep Roure Alcobé
, Frédéric Bimbot, Renato de Mori:
Editorial. 1607 - Marcos Faúndez-Zanuy

, Martin Hagmüller
, Gernot Kubin
:
Speaker verification security improvement by means of speech watermarking. 1608-1619 - Mohammed Bahoura

, Jean Rouat:
Wavelet speech enhancement based on time-scale adaptation. 1620-1637 - Juan Manuel Górriz

, Javier Ramírez
, Elmar Wolfgang Lang, Carlos García Puntonet
:
Hard C-means clustering for voice activity detection. 1638-1649 - Martin Hagmüller

, Gernot Kubin
:
Poincaré pitch marks. 1650-1665 - Giampiero Salvi

:
Segment boundary detection via class entropy measurements in connectionist phoneme recognition. 1666-1676 - Sadao Hiroya

, Takemi Mochida:
Multi-speaker articulatory trajectory formation based on speaker-independent articulatory HMMs. 1677-1690 - Anna Pribilová

, Jiri Pribil
:
Non-linear frequency scale mapping for voice conversion in text-to-speech system with cepstral description. 1691-1703 - Peter J. Murphy:

Periodicity estimation in synthesized phonation signals using cepstral rahmonic peaks. 1704-1713

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














