


default search action
IEEE Transactions on Speech and Audio Processing, Volume 13
Volume 13, Number 1, January 2005
- Muhammad Z. Ikram, Dennis R. Morgan:

Permutation inconsistency in blind speech separation: investigation and solutions. 1-13 - Pere Pujol, Susagna Pol, Climent Nadeu

, Astrid Hagen, Hervé Bourlard:
Comparison and combination of features in a hybrid HMM/MLP and a HMM/GMM speech recognition system. 14-22 - Frank Wessel, Hermann Ney:

Unsupervised training of acoustic models for large vocabulary continuous speech recognition. 23-31 - Michael M. Goodwin, A. J. Hipple, B. Link:

Predicting and preventing unmasking incurred in coded audio post-processing. 32-41 - Joshua M. Sachar, Harvey F. Silverman, William R. Patterson III:

Microphone position and gain calibration for a large-aperture microphone array. 42-52 - Simon Doclo

, Marc Moonen:
Multimicrophone noise reduction using recursive GSVD-based optimal filtering with ANC postprocessing stage. 53-69 - Vikas C. Raykar, Igor Kozintsev, Rainer Lienhart:

Position calibration of microphones and loudspeakers in distributed computing platforms. 70-83 - Stuart N. Wrigley

, Guy J. Brown
, Vincent Wan, Steve Renals
:
Speech and crosstalk detection in multichannel audio. 84-91 - Scott C. Douglas

, Hiroshi Sawada, Shoji Makino
:
Natural gradient multichannel blind deconvolution and speech separation using causal FIR filters. 92-104 - Ville Pulkki

, Toni Hirvonen
:
Localization of virtual sources in multichannel audio reproduction. 105-119 - Herbert Buchner, Robert Aichner, Walter Kellermann:

A generalization of blind source separation algorithms for convolutive mixtures based on second-order statistics. 120-134 - Boaz Rafaely

:
Analysis and design of spherical microphone arrays. 135-143
Volume 13, Number 2, March 2005
- Ted Painter, Andreas Spanias:

Perceptual segmentation and component selection for sinusoidal representations of audio. 149-162 - Fredrik Nordén, Per Hedelin:

Companded quantization of speech MDCT coefficients. 163-173 - Robert E. Schapire, Marie Rochery, Mazin G. Rahim, Narendra K. Gupta:

Boosting with prior knowledge for call classification. 174-181 - Jen-Tzung Chien

:
Decision tree State tying using cluster validity criteria. 182-193 - Dong Kook Kim, Nam Soo Kim:

Rapid online adaptation based on transformation space model evolution. 194-202 - Vincent Wan, Steve Renals

:
Speaker verification using sequence discriminant support vector machines. 203-210 - William J. J. Roberts, Yariv Ephraim, Howard W. Sabrin:

Speaker classification using composite hypothesis testing and list decoding. 211-219 - Renat Vafin, W. Bastiaan Kleijn

:
Entropy-constrained polar quantization and its application to audio coding. 220-232 - Dietrich Fränken, Klaus Meerkötter, Joachim Wassmuth:

Observer-based feedback linearization of dynamic loudspeakers with Ac amplifiers. 233-242 - Lorenzo Turicchia, Rahul Sarpeshkar:

A bio-inspired companding strategy for spectral enhancement. 243-253 - H. K. Jang, Ju Sung Park:

Multiresolution sinusoidal model with dynamic segmentation for timescale modification of polyphonic audio signals. 254-262 - Athanasios Mouchtaris, Shrikanth S. Narayanan, Chris Kyriakakis:

Multichannel audio synthesis by subband-based spectral conversion and parameter adaptation. 263-274 - William A. Sethares, Robin D. Morris, James C. Sethares:

Beat tracking of musical performances using low-level audio features. 275-285 - Mrityunjoy Chakraborty, Hideaki Sakai:

Convergence analysis of a complex LMS algorithm with tonal reference signals. 286-292 - Chul Min Lee, Shrikanth S. Narayanan:

Toward detecting emotions in spoken dialogs. 293-303
Volume 13, Number 3, May 2005
- Tetsuya Hoya

, Toshihisa Tanaka, Andrzej Cichocki
, Takahiro Murakami, Gen Hori, Jonathon A. Chambers:
Stereophonic noise reduction using a combined sliding subspace projection and adaptive signal enhancement. 309-320 - Alexandros Potamianos, Shrikanth S. Narayanan, Giuseppe Riccardi:

Adaptive categorical understanding for spoken dialogue systems. 321-329 - Chung-Hsien Wu

, Gwo-Lang Yan:
Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system. 330-344 - Patrick Kenny, Gilles Boulianne

, Pierre Dumouchel
:
Eigenvoice modeling with sparse training data. 345-354 - Ángel de la Torre

, Antonio M. Peinado
, José C. Segura
, José L. Pérez-Córdoba
, M. Carmen Benítez
, Antonio J. Rubio:
Histogram equalization of speech representation for robust speech recognition. 355-366 - Stavros Tsakalidis, Vlasios Doumpiotis, William J. Byrne:

Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation. 367-376 - Jen-Tzung Chien

, Sadaoki Furui:
Predictive hidden Markov model selection for speech recognition. 377-387 - Mohamed Afify:

Accurate compensation in the log-spectral domain for noisy speech recognition. 388-398 - Yu Tsao

, Shang-Ming Lee, Lin-Shan Lee:
Segmental eigenvoice with delicate eigenspace for improved speaker adaptation. 399-411 - Li Deng, Jasha Droppo

, Alex Acero
:
Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion. 412-421 - Charles D. Creusere:

Understanding perceptual distortion in MPEG scalable audio coding. 422-431 - Mohamed F. Mansour, Ahmed H. Tewfik:

Data embedding in audio using time-scale modification. 432-440 - Changsheng Xu, Namunu Chinthaka Maddage, Xi Shao:

Automatic music classification and summarization. 441-450
Volume 13, Number 4, July 2005
- Isabel Trancoso

:
Editorial. 457 - Michael T. Johnson, Richard J. Povinelli

, Andrew C. Lindgren, Jinjin Ye, Xiaolin Liu, Kevin M. Indrebo:
Time-domain isolated phoneme classification using reconstructed phase spaces. 458-466 - Bowen Zhou, John H. L. Hansen:

Efficient audio stream segmentation via the combined T2 statistic and Bayesian information criterion. 467-474 - Chang Huai You, Soo Ngee Koh, Susanto Rahardja

:
beta-order MMSE spectral amplitude estimation for speech enhancement. 475-486 - Ann Spriet, Marc Moonen, Jan Wouters

:
Robustness analysis of multichannel Wiener filtering and generalized sidelobe cancellation for multimicrophone noise reduction in hearing aid applications. 487-503 - Giuseppe Riccardi, Dilek Hakkani-Tür

:
Active learning: theory and applications to automatic speech recognition. 504-511 - Mukund Padmanabhan, Satya Dharanipragada:

Maximizing information content in feature extraction. 512-519 - Frank Seide:

The use of virtual hypothesis copies in decoding of large-vocabulary continuous speech. 520-533 - Ruhi Sarikaya, Yuqing Gao, Michael Picheny, Hakan Erdogan:

Semantic confidence measurement for spoken dialog systems. 534-545 - Mohamed Afify, Feng Liu, Hui Jiang, Olivier Siohan:

A new verification-based fast-match for large vocabulary continuous speech recognition. 546-553 - Bowen Zhou, John H. L. Hansen:

Rapid discriminative acoustic model based on eigenspace mapping for fast speaker adaptation. 554-564 - Kuo-Hwei Yuo, Tai-Hwei Hwang, Hsiao-Chuan Wang:

Combination of autocorrelation-based features and projection measure technique for speaker identification. 565-574 - B. Yegnanarayana, S. R. Mahadeva Prasanna, Jinu Mariam Zachariah, Cheedella S. Gupta:

Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system. 575-582 - Masafumi Nishida, Tatsuya Kawahara

:
Speaker model selection based on the Bayesian information criterion applied to unsupervised speaker indexing. 583-592 - Harvey F. Silverman, Ying Yu, Joshua M. Sachar, William R. Patterson III:

Performance of real-time source-location estimators for a large-aperture microphone array. 593-606 - Ying Song, Yu Gong, Sen M. Kuo:

A robust hybrid feedback active noise cancellation headset. 607-617 - Ming Zhang, Hui Lan, Wee Ser:

On comparison of online secondary path modeling methods with auxiliary noise. 618-628
Volume 13, Numbers 5-1, September 2005
- Mazin Gilbert, Roger K. Moore

, Geoffrey Zweig:
Introduction to the Special Issue on Data Mining of Speech, Audio, and Dialog. 633-634 - Peng Yu, Kaijiang Chen, Chengyuan Ma, Frank Seide:

Vocabulary-Independent Indexing of Spontaneous Speech. 635-643 - Chien-Chang Lin, Shi-Huang Chen, Trieu-Kien Truong, Yukon Chang:

Audio Classification and Categorization Based on Wavelets and Support Vector Machine. 644-651 - Shona Douglas, Deepak Agarwal, Tirso Alonso, Robert M. Bell, Mazin Gilbert, Deborah F. Swayne, Chris Volinsky:

Mining Customer Care Dialogs for "Daily News". 652-660 - Dong Yu, Alex Acero

:
Semiautomatic Improvements of System-Initiative Spoken Dialog Applications Using Interactive Clustering. 661-671 - Lee Begeja, Harris Drucker, David C. Gibbon, Patrick Haffner, Zhu Liu, Bernard Renger, Behzad Shahraray:

Semantic Data Mining of Short Utterances. 672-680 - Lina Zhou, Yongmei Shi, Jinjuan Feng

, Andrew Sears:
Data Mining for Detecting Errors in Dictation Speech Recognition. 681-688 - Chaug-Ching Huang, Jhing-Fa Wang, Dian-Jia Wu:

Automatic Scene Change Detection for Composed Speech and Music Sound Under Low SNR Noisy Environment. 689-699 - John Grothendieck:

Tracking Changes in Language. 700-711 - John H. L. Hansen, Rongqing Huang, Bowen Zhou, Michael S. Seadle, John R. Deller Jr., Aparna Gurijala, Mikko Kurimo, Pongtep Angkititrakul:

SpeechFind: Advances in Spoken Document Retrieval for a National Gallery of the Spoken Word. 712-730
Volume 13, Numbers 5-2, September 2005
- S. S. Yedlapalli:

Transforming Real Linear Prediction Coefficients to Line Spectral Representations With a Real FFT. 733-740 - Minkyu Lee, Jan P. H. van Santen

, Bernd Möbius
, Joseph P. Olive:
Formant Tracking Using Context-Dependent Phonemic Information. 741-750 - Vikas C. Raykar, B. Yegnanarayana, S. R. Mahadeva Prasanna, Ramani Duraiswami

:
Speaker Localization Using Excitation Source Information in Speech. 751-761 - B.-F. Wu, K.-C. Wang:

Robust Endpoint Detection Algorithm Based on the Adaptive Band-Partitioning Spectral Entropy in Adverse Environments. 762-775 - Om Deshmukh, Carol Y. Espy-Wilson, Ariel Salomon, Jawahar Singh:

Use of Temporal Information: Detection of Periodicity, Aperiodicity, and Pitch in Speech. 776-786 - Jonas Lindblom:

A Sinusoidal Voice Over Packet Coder Tailored for the Frame-Erasure Channel. 787-798 - Mikko Tammi, Milan Jelinek, Vesa T. Ruoppila:

Signal Modification Method for Variable Bit Rate Wide-band Speech Coding. 799-810 - Turaj Zakizadeh Shabestary, Per Hedelin:

LSP Quantization by a Union of Locally Trained Codebooks. 811-820 - Doh-Suk Kim:

ANIQUE: An Auditory Model for Single-Ended Speech Quality Estimation. 821-831 - Kamran Rahbar, James P. Reilly:

A Frequency Domain Method for Blind Source Separation of Convolutive Audio Mixtures. 832-844 - Rainer Martin

:
Speech Enhancement Based on Minimum Mean-Square Error Estimation and Supergaussian Priors. 845-856 - Philipos C. Loizou:

Speech Enhancement Based on Perceptually Motivated Bayesian Estimators of the Magnitude Spectrum. 857-869 - Israel Cohen:

Relaxed Statistical Model for Speech Enhancement and a Priori SNR Estimation. 870-881 - Yiteng Huang, Jacob Benesty

, Jingdong Chen:
A Blind Channel Identification-Based Two-Stage Approach to Separation and Dereverberation of Speech Signals in a Reverberant Environment. 882-895 - Saeed Gazor

, Wei Zhang:
Speech enhancement employing Laplacian-Gaussian mixture. 896-904 - Ting Liu, Saeed Gazor

:
A Variable Step-Size Pre-Filter-Bank Adaptive Algorithm. 905-916 - Alfonso Ortega

, Eduardo Lleida
, Enrique Masgrau
:
Speech Reinforcement System for Car Cabin Communications. 917-929 - Michael Pitz, Hermann Ney:

Vocal Tract Normalization Equals Linear Transformation in Cepstral Space. 930-944 - Hui Jiang, Frank K. Soong, Chin-Hui Lee:

A Dynamic In-Search Data Selection Method With Its Applications to Acoustic Modeling and Utterance Verification. 945-955 - James McAuley, Ji Ming, Darryl Stewart, Philip Hanna:

Subband Correlation and Robust Speech Recognition. 956-964 - K. Li, M. N. S. Swamy, M. Omair Ahmad:

An Improved Voice Activity Detection Using Higher Order Statistics. 965-974 - Yifan Gong:

A Method of Joint Compensation of Additive and Convolutive Distortions for Speaker-Independent Speech Recognition. 975-983 - Brian Mak

, James Tin-Yau Kwok, Simon Ka-Lung Ho:
Kernel Eigenvoice Speaker Adaptation. 984-992 - Brian Kan-Wing Mak

, Kin-Wah Chan:
Pruning Hidden Markov Models With Optimal Brain Surgeon. 993-1003 - S. Kwon, Shri Narayanan:

Unsupervised Speaker Indexing Using Generic Models. 1004-1013 - Gerald Schuller, Jelena Kovacevic, F. Masson, Vivek K. Goyal

:
Robust Low-Delay Audio Coding Using Multiple Descriptions. 1014-1024 - Stanley T. Birchfield, Amarnag Subramanya:

Microphone Array Position Calibration by Basis-Point Classical Multidimensional Scaling. 1025-1034 - Juan Pablo Bello

, Laurent Daudet, Samer A. Abdallah, Chris Duxbury, Mike E. Davies, Mark B. Sandler:
A Tutorial on Onset Detection in Music Signals. 1035-1047 - Christof Faller, Jingdong Chen:

Suppressing Acoustic Echo in a Spectral Envelope Space. 1048-1062 - G. R. Campos, David M. Howard

:
On the Computational Efficiency of Different Waveguide Mesh Topologies for Room Acoustic Simulation. 1063-1072 - Federico Avanzini

, Stefania Serafin
, Davide Rocchesso
:
Interactive Simulation of Rigid Body Interaction With Friction-Induced Sound Generation. 1073-1081 - Muhammad Tahir Akhtar

, Masahide Abe, Masayuki Kawamata:
A New Structure for Feedforward Active Noise Control Systems With Improved Online Secondary Path Modeling. 1082-1088
Volume 13, Number 6, November 2005
- Broneslav A. Kiselman, Vladimir V. Krylov:

Comparative analysis of linear and nonlinear speech signals predictors. 1093-1097 - Iasonas Kokkinos, Petros Maragos:

Nonlinear speech analysis using models for chaotic systems. 1098-1109 - B. Yegnanarayana, S. R. Mahadeva Prasanna, Ramani Duraiswami

, Dmitry N. Zotkin:
Processing of reverberant speech for time-delay estimation. 1110-1118 - Javier Ramírez

, José C. Segura
, M. Carmen Benítez
, Ángel de la Torre
, Antonio J. Rubio:
An effective subband OSF-based VAD with noise reduction for robust speech recognition. 1119-1129 - Geert Rombouts, Marc Moonen:

Fast QRD-lattice-based unconstrained optimal filtering for acoustic noise reduction. 1130-1143 - Scott Axelrod, Vaibhava Goel

, Ramesh A. Gopinath, Peder A. Olsen, Karthik Visweswariah:
Subspace constrained Gaussian mixture models for speech recognition. 1144-1160 - Xiaodong Cui, Abeer Alwan:

Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR. 1161-1172 - Thomas Hain

, Philip C. Woodland, Gunnar Evermann, Mark J. F. Gales, Xunying Liu, Gareth L. Moore, Daniel Povey, Lan Wang:
Automatic transcription of conversational telephone speech. 1173-1185 - Ascensión Gallardo-Antolín

, Carmen Peláez-Moreno
, Fernando Díaz-de-María
:
Recognizing GSM digital speech. 1186-1205 - Jae-Sik Lee, Jong-Hoon Jeong, Tae-Gyu Chang:

An efficient method of Huffman decoding for MPEG-2 AAC and its performance analysis. 1206-1209 - I. Kauppinen, K. Roth:

Improved noise reduction in audio signals using spectral resolution enhancement with time-domain signal extrapolation. 1210-1216 - Orlando José Tobias, Rui Seara:

Leaky-FXLMS algorithm: stochastic analysis for Gaussian data and secondary path modeling error. 1217-1230 - Per Åhgren:

Acoustic echo cancellation and doubletalk detection using estimated loudspeaker impulse responses. 1231-1237

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














