


default search action
IEEE Transactions on Audio, Speech & Language Processing, Volume 19
Volume 19, Number 1, January 2011
- Tobias May

, Steven van de Par, Armin Kohlrausch:
A Probabilistic Model for Robust Localization Based on a Binaural Auditory Front-End. 1-13 - Stefan Strahl

, Heiko Hansen, Alfred Mertins:
A Dynamic Fine-Grain Scalable Compression Scheme With Application to Progressive Audio Coding. 14-23 - Albertus C. den Brinker

, Harish Krishnamoorthi, E. A. Verbitskiy:
Similarities and Differences Between Warped Linear Prediction and Laguerre Linear Prediction. 24-33 - Konrad Kowalczyk

, Maarten van Walstijn:
Room Acoustics Simulation Using 3-D Compact Explicit FDTD Schemes. 34-46 - Philipos C. Loizou, Gibak Kim:

Reasons why Current Speech-Enhancement Algorithms do not Improve Speech Intelligibility and Suggested Solutions. 47-56 - Mikel Gainza, Eugene Coyle:

Tempo Detection Using a Hybrid Multiband Approach. 57-68 - Takuya Yoshioka, Tomohiro Nakatani, Masato Miyoshi, Hiroshi G. Okuno

:
Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization. 69-84 - Yun Lei, John H. L. Hansen:

Dialect Classification via Text-Independent Training and Testing for Arabic, Spanish, and Chinese. 85-96 - Luis Antonio Azpicueta-Ruiz

, Marcus Zeller, Aníbal R. Figueiras-Vidal, Jerónimo Arenas-García
, Walter Kellermann:
Adaptive Combination of Volterra Kernels and Its Application to Nonlinear Acoustic Echo Cancellation. 97-110 - Jayme G. A. Barbedo

, George Tzanetakis
:
Musical Instrument Classification Using Individual Partials. 111-122 - Maarten Van Segbroeck, Hugo Van hamme

:
Advances in Missing Feature Techniques for Robust Large-Vocabulary Continuous Speech Recognition. 123-137 - Hélène Papadopoulos, Geoffroy Peeters:

Joint Estimation of Chords and Downbeats From an Audio Signal. 138-152 - Tuomo Raitio, Antti Suni

, Junichi Yamagishi, Hannu Pulakka, Jani Nurminen, Martti Vainio
, Paavo Alku
:
HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering. 153-165 - Mohsen A. Rashwan

, Mohamed Al-Badrashiny, Mohamed Attia, Sherif M. Abdou, Ahmed Rafea
:
A Stochastic Arabic Diacritizer Based on a Hybrid of Factorized and Unfactorized Textual Features. 166-175 - Andre Holzapfel, Yannis Stylianou:

Scale Transform in Rhythmic Similarity of Music. 176-185 - Suhadi Suhadi

, Carsten Last, Tim Fingscheidt
:
A Data-Driven Approach to A Priori SNR Estimation. 186-195 - Ning Wang, P. C. Ching, Nengheng Zheng, Tan Lee

:
Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features. 196-205 - Alexander Krueger, Ernst Warsitz, Reinhold Haeb-Umbach

:
Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation. 206-219
Volume 19, Number 2, February 2011
- Joel Pinto, Garimella S. V. S. Sivaram, Mathew Magimai-Doss, Hynek Hermansky

, Hervé Bourlard:
Analysis of MLP-Based Hierarchical Phoneme Posterior Probability Estimator. 225-241 - Michael Stark, Michael Wohlmayr, Franz Pernkopf

:
Source-Filter-Based Single-Channel Speech Separation Using Pitch Information. 242-255 - Etan Fisher, Boaz Rafaely

:
Near-Field Spherical Microphone Array Processing With Radial Filtering. 256-265 - Weiqiang Zhang

, Liang He
, Yan Deng, Jia Liu, Michael T. Johnson:
Time-Frequency Cepstral Features and Heteroscedastic Linear Discriminant Analysis for Language Recognition. 266-276 - Colin Breithaupt, Rainer Martin

:
Analysis of the Decision-Directed SNR Estimator for Speech Enhancement With Respect to Low-SNR and Transient Conditions. 277-289 - Yannis Pantazis, Olivier Rosec, Yannis Stylianou:

Adaptive AM-FM Signal Decomposition With Application to Speech Analysis. 290-300 - Mitsuko Aramaki, Mireille Besson, Richard Kronland-Martinet

, Sølvi Ystad
:
Controlling the Perceived Material in an Impact Sound Synthesizer. 301-314 - D. K. Kim, Mark J. F. Gales:

Noisy Constrained Maximum-Likelihood Linear Regression for Noise-Robust Speech Recognition. 315-325 - Namgook Cho, C.-C. Jay Kuo

:
Sparse Music Representation With Source-Specific Dictionaries and Its Application to Signal Separation. 326-337 - Ben Milner, Jonathan Darch

:
Robust Acoustic Speech Feature Prediction From Noisy Mel-Frequency Cepstral Coefficients. 338-347 - Joseph Tepperman, Sungbok Lee, Shrikanth S. Narayanan, Abeer Alwan:

A Generative Student Model for Scoring Word Reading Skills. 348-360 - Shefeng Yan, Haohai Sun, U. Peter Svensson, Xiaochuan Ma, J. M. Hovem:

Optimal Modal Beamforming for Spherical Microphone Arrays. 361-371 - Marco Kühne, Roberto Togneri

, Sven Nordholm
:
A New Evidence Model for Missing Data Speech Recognition With Applications in Reverberant Multi-Source Environments. 372-384 - Dinh-Quy Nguyen, Woon-Seng Gan

, Andy W. H. Khong:
Time-Reversal Approach to the Stereophonic Acoustic Echo Cancellation Problem. 385-395 - Fritz Menzer, Christof Faller, Hervé Lissek

:
Obtaining Binaural Room Impulse Responses From B-Format Impulse Responses Using Frequency-Dependent Coherence Matching. 396-405 - Zbynek Koldovský

, Petr Tichavský:
Time-Domain Blind Separation of Audio Sources on the Basis of a Complete ICA Decomposition of an Observation Space. 406-416 - Heiga Zen

, Yoshihiko Nankaku, Keiichi Tokuda:
Continuous Stochastic Feature Mapping Based on Trajectory HMMs. 417-430 - Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:

An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization. 431-438
Volume 19, Number 3, March 2011
- Mohammad A. Dmour, Mike E. Davies:

A New Framework for Underdetermined Speech Extraction Using Mixture of Beamformers. 445-457 - Miroslav Zivanovic

, Johan Schoukens:
On The Polynomial Approximation for Time-Variant Harmonic Signal Modeling. 458-467 - Ana I. García-Moral, Rubén Solera-Ureña

, Carmen Peláez-Moreno
, Fernando Díaz-de-María
:
Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems. 468-481 - Jen-Tzung Chien

, Chuang-Hua Chueh
:
Dirichlet Class Language Models for Speech Recognition. 482-495 - Feipeng Li, Jont B. Allen:

Manipulation of Consonants in Natural Speech. 496-504 - Donglai Zhu, Bin Ma, Haizhou Li

:
Speaker Verification With Feature-Space MAPLR Parameters. 505-515 - Hiroshi Sawada, Shoko Araki

, Shoji Makino
:
Underdetermined Convolutive Blind Source Separation via Frequency Bin-Wise Clustering and Permutation Alignment. 516-527 - Konrad Kowalczyk

, Maarten van Walstijn, Damian T. Murphy:
A Phase Grating Approach to Modeling Surface Diffusion in FDTD Room Acoustics Simulations. 528-537 - Fei Liu, Feifan Liu, Yang Liu:

A Supervised Framework for Keyword Extraction From Meeting Transcripts. 538-548 - Lin Wang, Heping Ding, Fuliang Yin:

A Region-Growing Permutation Alignment Approach in Frequency-Domain Blind Source Separation of Speech Mixtures. 549-557 - L. Anders Ekman

, Volodya Grancharov, W. Bastiaan Kleijn
:
Double-Ended Quality Assessment System for Super-Wideband Speech. 558-569 - Jia Jia, Shen Zhang, Fanbo Meng, Yongxin Wang, Lianhong Cai:

Emotional Audio-Visual Speech Synthesis Based on PAD. 570-582 - Francesco Nesta, Ted S. Wada, Biing-Hwang Juang:

Batch-Online Semi-Blind Source Separation Applied to Multi-Channel Acoustic Echo Cancellation. 583-599 - Prasanta Kumar Ghosh, Andreas Tsiartas, Shrikanth S. Narayanan:

Robust Voice Activity Detection Using Long-Term Signal Variability. 600-613 - Sheng Wu, Xiaojun Qiu

, Ming Wu:
Stereo Acoustic Echo Cancellation Employing Frequency-Domain Preprocessing and Adaptive Filter. 614-623 - Francesco Nesta, Piergiorgio Svaizer

, Maurizio Omologo
:
Convolutive BSS of Short Mixtures by ICA Recursively Regularized Across Frequencies. 624-639 - Juan Andres Morales-Cordovilla, Antonio M. Peinado

, Victoria E. Sánchez, José A. González
:
Feature Extraction Based on Pitch-Synchronous Averaging for Robust Speech Recognition. 640-651 - Miguel Ferrer

, Alberto González
, Maria de Diego
, Gema Piñero
:
Transient Analysis of the Conventional Filtered-x Affine Projection Algorithm for Active Noise Control. 652-657
Volume 19, Number 4, May 2011
- Ivan Himawan

, Iain McCowan, Sridha Sridharan:
Clustered Blind Beamforming From Ad-Hoc Microphone Arrays. 661-676 - Guilin Ma, Fredrik Gran, Finn Jacobsen, Finn T. Agerkvist

:
Adaptive Feedback Cancellation With Band-Limited LPC Vocoder in Digital Hearing Aids. 677-687 - Dong Wang, Simon King

, Joe Frankel:
Stochastic Pronunciation Modeling for Out-of-Vocabulary Spoken Term Detection. 688-698 - Ashutosh Pandey, V. John Mathews:

Low-Delay Signal Processing for Digital Hearing Aids. 699-710 - Charles D. Creusere, Joseph C. Hardin

:
Assessing the Quality of Audio Containing Temporally Varying Distortions. 711-720 - Evgeny Matusov, Hermann Ney:

Lattice-Based ASR-MT Interface for Speech Translation. 721-732 - Rogier C. van Dalen, Mark J. F. Gales:

Extended VTS for Noise-Robust Speech Recognition. 733-743 - Romain Hennequin, Roland Badeau, Bertrand David:

NMF With Time-Frequency Activations to Model Nonstationary Audio Events. 744-753 - Roberto Barra-Chicote

, José Manuel Pardo, Javier Ferreiros
, Juan Manuel Montero
:
Speaker Diarization Based on Intensity Channel Contribution. 754-761 - Yi-Hsuan Yang, Homer H. Chen

:
Ranking-Based Emotion Recognition for Music Organization and Retrieval. 762-774 - Mohammed Ariful Haque, Toufiqul Islam, Md. Kamrul Hasan:

Robust Speech Dereverberation Based on Blind Adaptive Estimation of Acoustic Channels. 775-787 - Najim Dehak

, Patrick Kenny, Réda Dehak, Pierre Dumouchel
, Pierre Ouellet:
Front-End Factor Analysis for Speaker Verification. 788-798 - Michael Wohlmayr, Michael Stark, Franz Pernkopf

:
A Probabilistic Interaction Model for Multipitch Tracking With Factorial Hidden Markov Models. 799-810 - Perry Groot, Tom Heskes

, Tjeerd Dijkstra, James M. Kates:
Predicting Preference Judgments of Individual Normal and Hearing-Impaired Listeners With Gaussian Processes. 811-821 - Ji Ming, Ramji Srinivasan, Danny Crookes:

A Corpus-Based Approach to Speech Enhancement From Nonstationary Noise. 822-836 - Arshia Cont

, Shlomo Dubnov
, Gérard Assayag:
On the Information Geometry of Audio Streams With Applications to Similarity Computing. 837-846 - Hayley Hung, Yan Huang, Gerald Friedland, Daniel Gatica-Perez

:
Estimating Dominance in Multi-Party Meetings Using Speaker Diarization. 847-860 - Kong Aik Lee

, Chang Huai You, Haizhou Li
, Tomi Kinnunen, Khe Chai Sim:
Using Discrete Probabilities With Bhattacharyya Measure for SVM-Based Speaker Verification. 861-870 - Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen:

Leveraging Kullback-Leibler Divergence Measures and Information-Rich Cues for Speech Summarization. 871-882 - Chi Zhang, John H. L. Hansen:

Whisper-Island Detection Based on Unsupervised Segmentation With Entropy-Based Speech Feature Processing. 883-894 - Matthew Gibson, William Byrne:

Unsupervised Intralingual and Cross-Lingual Speaker Adaptation for HMM-Based Speech Synthesis Using Two-Pass Decision Tree Construction. 895-904 - Min-Seok Choi, Hong-Goo Kang:

A Two-Channel Noise Estimator for Speech Enhancement in a Highly Nonstationary Environment. 905-915 - Saman Mousazadeh, Israel Cohen:

AR-GARCH in Presence of Noise: Parameter Estimation and Its Application to Voice Activity Detection. 916-926 - Qiang Wu, Liqing Zhang, Guangchuan Shi:

Robust Multifactor Speech Feature Extraction Based on Gabor Analysis. 927-936 - Peifeng Ji, Ee-Leng Tan, Woon-Seng Gan

, Jun Yang
:
A Comparative Analysis of Preprocessing Methods for the Parametric Loudspeaker Based on the Khokhlov-Zabolotskaya-Kuznetsov Equation for Speech Reproduction. 937-946 - Frank Rudzicz

:
Articulatory Knowledge in the Recognition of Dysarthric Speech. 947-960 - Bin Gao, Wai Lok Woo

, Satnam Singh Dlay:
Single-Channel Source Separation Using EMD-Subband Variable Regularized Sparse Features. 961-976 - Daniel Rudoy, Thomas F. Quatieri, Patrick J. Wolfe:

Time-Varying Autoregressions in Speech: Detection Theory and Applications. 977-989 - Hyeon-Jin Jeon, Tae-Gyu Chang, Sungwook Yu, Sen M. Kuo:

A Narrowband Active Noise Control System With Frequency Corrector. 990-1002 - Emiru Tsunoo, George Tzanetakis

, Nobutaka Ono
, Shigeki Sagayama:
Beyond Timbral Statistics: Improving Music Classification Using Percussive Patterns and Bass Lines. 1003-1014 - Matthew P. Black, Joseph Tepperman, Shrikanth S. Narayanan:

Automatic Prediction of Children's Reading Ability for High-Level Literacy Assessment. 1015-1028 - Dongho Kim, Jin H. Kim, Kee-Eung Kim:

Robust Performance Evaluation of POMDP-Based Dialogue Systems. 1029-1040 - Lifu Wu, Hongsen He, Xiaojun Qiu

:
An Active Impulsive Noise Control Algorithm With Logarithmic Transformation. 1041-1044 - Haohai Sun, Shefeng Yan, U. Peter Svensson:

Robust Minimum Sidelobe Beamforming for Spherical Microphone Arrays. 1045-1051
Volume 19, Number 5, July 2011
- Emily Mower

, Maja J. Mataric, Shrikanth S. Narayanan:
A Framework for Automatic Human Emotion Classification Using Emotion Profiles. 1057-1070 - Kai Yu, Steve J. Young:

Continuous F0 Modeling for HMM Based Statistical Parametric Speech Synthesis. 1071-1079 - Gilles Degottex

, Axel Röbel, Xavier Rodet:
Phase Minimization for Glottal Model Estimation. 1080-1090 - Zhaozhang Jin, DeLiang Wang:

HMM-Based Multipitch Tracking for Noisy and Reverberant Speech. 1091-1102 - Ralf Schlüter

, Markus Nußbaum-Thom, Hermann Ney:
On the Relationship Between Bayes Risk and Word Error Rate in ASR. 1103-1112 - Jerome R. Bellegarda:

A Data-Driven Affective Analysis Framework Toward Naturally Expressive Speech Synthesis. 1113-1122 - Yang Lu, Philipos C. Loizou:

Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty. 1123-1137 - Georg Heigold, Hermann Ney, Patrick Lehnen, Tobias Gass, Ralf Schlüter

:
Equivalence of Generative and Log-Linear Models. 1138-1148 - Aastha Gupta, Thushara D. Abhayapala

:
Three-Dimensional Sound Field Reproduction Using Multiple Circular Loudspeaker Arrays. 1149-1159 - Shasha Xie, Yang Liu:

Using N-Best Lists and Confusion Networks for Meeting Summarization. 1160-1169 - Werayuth Charoenruengkit, Nurgun Erdol:

The Effect of Spectral Estimation on Speech Enhancement Performance. 1170-1179 - I. Yücel Özbek

, Mark Hasegawa-Johnson, Mübeccel Demirekler:
Estimation of Articulatory Trajectories Based on Gaussian Mixture Model (GMM) With Audio-Visual Information Fusion and Dynamic Kalman Smoothing. 1180-1195 - Wei-Ho Tsai, Hao-Ping Lin:

Background Music Removal Based on Cepstrum Transformation for Popular Singer Identification. 1196-1205 - José A. González

, Antonio M. Peinado
, Angel M. Gomez
, José L. Carmona:
Efficient MMSE Estimation and Uncertainty Processing for Multienvironment Robust Speech Recognition. 1206-1220 - Shefeng Yan, Haohai Sun, Xiaochuan Ma, U. Peter Svensson, Chaohuan Hou:

Time-Domain Implementation of Broadband Beamformer in Spherical Harmonics Domain. 1221-1230 - Vladimir Britanak:

On Properties, Relations, and Simplified Implementation of Filter Banks in the Dolby Digital (Plus) AC-3 Audio Coding Standards. 1231-1241 - Geoffroy Peeters:

Spectral and Temporal Periodicity Representations of Rhythm for the Automatic Classification of Music Audio Signal. 1242-1252 - C.-Y. Lin, H.-C. Wang:

Burst Onset Landmark Detection and Its Application to Speech Recognition. 1253-1264 - Pejman Mowlaee

, Mads Græsbøll Christensen
, Søren Holdt Jensen:
New Results on Single-Channel Speech Separation Using Sinusoidal Modeling. 1265-1277 - Stas Tiomkin, David Malah

, Slava Shechtman, Zvi Kons:
A Hybrid Text-to-Speech System That Combines Concatenative and Statistical Synthesis Units. 1278-1288 - Han-Ping Shen, Jui-Feng Yeh, Chung-Hsien Wu

:
Speaker Clustering Using Decision Tree-Based Phone Cluster Models With Multi-Space Probability Distributions. 1289-1300 - T. Etame, Régine Le Bouquin-Jeannès

, Catherine Quinquis, Lætitia Gros, Gérard Faucon:
Towards a New Reference Impairment System in the Subjective Evaluation of Speech Codecs. 1301-1315 - Chengyuan Ma, Chin-Hui Lee:

A Regularized Maximum Figure-of-Merit (rMFoM) Approach to Supervised and Semi-Supervised Learning. 1316-1327 - Fabien Ringeval, Jean Demouy, György Szaszák, Mohamed Chetouani

, L. Robel, Jean Xavier, David Cohen, Monique Plaza:
Automatic Intonation Recognition for the Prosodic Assessment of Language-Impaired Children. 1328-1342 - Emanuele Coviello, Antoni B. Chan

, Gert R. G. Lanckriet:
Time Series Models for Semantic Music Annotation. 1343-1359 - Maider Lehr, Izhak Shafran:

Learning a Discriminative Weighted Finite-State Transducer for Speech Recognition. 1360-1367 - Bram Cornelis, Marc Moonen, Jan Wouters

:
Performance Analysis of Multichannel Wiener Filter-Based Noise Reduction in Hearing Aids Under Second Order Statistics Estimation Errors. 1368-1381 - Anthony Griffin

, Toni Hirvonen
, Christos Tzagkarakis, Athanasios Mouchtaris, Panagiotis Tsakalides
:
Single-Channel and Multi-Channel Sinusoidal Audio Coding Using Compressed Sensing. 1382-1395 - Jibran Yousafzai

, Peter Sollich
, Zoran Cvetkovic, Bin Yu:
Combined Features and Kernel Design for Noise Robust Phoneme Classification Using Support Vector Machines. 1396-1407 - Xing Fan, John H. L. Hansen:

Speaker Identification Within Whispered Speech Audio Streams. 1408-1421 - Peter Birkholz

, Bernd J. Kröger
, Christiane Neuschaefer-Rube
:
Model-Based Reproduction of Articulatory Trajectories for Consonant-Vowel Sequences. 1422-1433 - Wooil Kim, John H. L. Hansen:

A Novel Mask Estimation Method Employing Posterior-Based Representative Mean Estimate for Missing-Feature Speech Recognition. 1434-1443 - Kishore Prahallad, Alan W. Black:

Segmentation of Monologues in Audio Books for Building Synthetic Voices. 1444-1449
Volume 19, Number 6, August 2011
- Hiroshi Saruwatari, Yohei Ishikawa, Yu Takahashi

, Takayuki Inoue, Kiyohiro Shikano, Kazunobu Kondo:
Musical Noise Controllable Algorithm of Channelwise Spectral Subtraction and Adaptive Beamforming Based on Higher Order Statistics. 1457-1466 - Andrea Andò:

Conversion of Multichannel Sound Signal Maintaining Physical Properties of Sound in Reproduced Sound Field. 1467-1475 - Antonio Miguel, Alfonso Ortega

, Luis Buera, Eduardo Lleida
:
Bayesian Networks for Discrete Observation Distributions in Speech Recognition. 1476-1489 - Anthony Lombard, Yuanhang Zheng, Herbert Buchner, Walter Kellermann:

TDOA Estimation for Multiple Sound Sources in Noisy and Reverberant Environments Using Broadband Independent Component Analysis. 1490-1503 - Dimitrios Dimitriadis, Petros Maragos, Alexandros Potamianos:

On the Effects of Filterbank Design and Energy Computation on Robust Speech Recognition. 1504-1516 - Ciira Wa Maina

, John MacLaren Walsh:
Joint Speech Enhancement and Speaker Identification Using Approximate Bayesian Inference. 1517-1529 - Stefania Cecchi

, Laura Romoli, Paolo Peretti, Francesco Piazza:
A Combined Psychoacoustic Approach for Stereo Acoustic Echo Cancellation. 1530-1539 - A. Levy, Sharon Gannot

, Emanuël A. P. Habets
:
Multiple-Hypothesis Extended Particle Filter for Acoustic Source Localization in Reverberant Environments. 1540-1555 - H. D. Tran, Haizhou Li

:
Sound Event Recognition With Probabilistic Distance SVMs. 1556-1568 - Stefan Hahn, Marco Dinarelli, Christian Raymond, Fabrice Lefèvre, Patrick Lehnen, Renato de Mori, Alessandro Moschitti

, Hermann Ney, Giuseppe Riccardi:
Comparing Stochastic Approaches to Spoken Language Understanding in Multiple Languages. 1569-1583 - Ronen Talmon, Israel Cohen, Sharon Gannot

:
Transient Noise Reduction Using Nonlocal Diffusion Filters. 1584-1599 - Ke Hu, DeLiang Wang:

Unvoiced Speech Segregation From Nonspeech Interference via CASA and Spectral Subtraction. 1600-1609 - Fabrizio Argenti

, Paolo Nesi
, Gianni Pantaleo
:
Automatic Transcription of Polyphonic Music Based on the Constant-Q Bispectral Analysis. 1610-1630 - Hyunson Seo, Chi-Sang Jung, Hong-Goo Kang:

Robust Session Variability Compensation for SVM Speaker Verification. 1631-1641 - Ibrahim Almajai, Ben Milner:

Visually Derived Wiener Filters for Speech Enhancement. 1642-1651 - Dalei Wu, Yan Yin, Hui Jiang:

Large-Margin Estimation of Hidden Markov Models With Second-Order Cone Programming for Speech Recognition. 1652-1664 - Haitian Xu, Mark J. F. Gales, K. K. Chin:

Joint Uncertainty Decoding With Predictive Methods for Noise Robust Speech Recognition. 1665-1676 - Roy Wallace, Brendan Baker, Robbie Vogt, Sridha Sridharan:

Discriminative Optimization of the Figure of Merit for Phonetic Spoken Term Detection. 1677-1687 - Peter Grosche, Meinard Müller

:
Extracting Predominant Local Pulse Information From Music Recordings. 1688-1701 - Yao Qian, Zhizheng Wu, Boyang Gao, Frank K. Soong:

Improved Prosody Generation by Maximizing Joint Probability of State and Longer Units. 1702-1710 - Yan Jennifer Wu, Thushara D. Abhayapala

:
Spatial Multizone Soundfield Reproduction: Theory and Design. 1711-1720 - Mathieu Parvaix, Laurent Girin:

Informed Source Separation of Linear Instantaneous Under-Determined Audio Mixtures by Source Index Embedding. 1721-1733 - Jacob Benesty

, Constantin Paleologu, Silviu Ciochina:
On Regularization in Adaptive Filtering. 1734-1742 - Mehdi Bekrani, Andy W. H. Khong, Mojtaba Lotfizad:

A Linear Neural Network-Based Approach to Stereophonic Acoustic Echo Cancellation. 1743-1753 - Geoffroy Peeters, Hélène Papadopoulos:

Simultaneous Beat and Downbeat-Tracking Using a Probabilistic Framework: Theory and Large-Scale Evaluation. 1754-1769 - Takayuki Inoue, Hiroshi Saruwatari, Yu Takahashi

, Kiyohiro Shikano, Kazunobu Kondo:
Theoretical Analysis of Musical Noise in Generalized Spectral Subtraction Based on Higher Order Statistics. 1770-1779 - Ki-Seung Lee, Seok-Pil Lee:

A Relevant Distance Criterion for Interpolation of Head-Related Transfer Functions. 1780-1790 - Qi Li, Yan Huang:

An Auditory-Based Feature Extraction Algorithm for Robust Speaker Identification Under Mismatched Conditions. 1791-1801 - Pasi Saari, Tuomas Eerola

, Olivier Lartillot:
Generalizability and Simplicity as Criteria in Feature Selection: Application to Mood Classification in Music. 1802-1812 - Saikat Chatterjee, W. Bastiaan Kleijn

:
Auditory Model-Based Design and Optimization of Feature Vectors for Automatic Speech Recognition. 1813-1825 - Mehdi Bekrani, Andy W. H. Khong, Mojtaba Lotfizad:

A Clipping-Based Selective-Tap Adaptive Filtering Approach to Stereophonic Acoustic Echo Cancellation. 1826-1836 - Hélène Lachambre, Régine André-Obrecht, Julien Pinquier

:
Distinguishing Monophonies From Polyphonies Using Weibull Bivariate Distributions. 1837-1842 - Joshua D. Reiss:

Design of Audio Parametric Equalizer Filters Directly in the Digital Domain. 1843-1848
Volume 19, Number 7, September 2011
- Guruprasad Seshadri, Bayya Yegnanarayana:

Performance of an Event-Based Instantaneous Fundamental Frequency Estimator for Distant Speech Signals. 1853-1864 - Yegui Xiao:

A New Efficient Narrowband Active Noise Control System and its Performance Analysis. 1865-1874 - Sheng-yi Kong, Lin-Shan Lee:

Semantic Analysis and Organization of Spoken Documents Based on Parameters Derived From Latent Topics. 1875-1889 - Taufiq Hasan

, John H. L. Hansen:
A Study on Universal Background Model Training in Speaker Verification. 1890-1899 - Nilesh Madhu

, Rainer Martin
:
A Versatile Framework for Speaker Separation Using a Model-Based Speaker Localization Approach. 1900-1912 - Vikramjit Mitra, Hosung Nam, Carol Y. Espy-Wilson, Elliot Saltzman, Louis Goldstein:

Articulatory Information for Noise Robust Speech Recognition. 1913-1924 - Qiang Huang, Stephen J. Cox:

Inferring the Structure of a Tennis Game Using Audio Information. 1925-1937 - Maria E. Markaki

, Yannis Stylianou:
Voice Pathology Detection and Discrimination Based on Modulation Spectral Features. 1938-1948 - Eleftheria Georganti, Tobias May

, Steven van de Par, Aki Härmä
, John Mourjopoulos:
Speaker Distance Detection Using a Single Microphone. 1949-1961 - Tacksung Choi, Young-Cheol Park, Dae Hee Youn, Seok-Pil Lee:

Virtual Sound Rendering in a Stereophonic Loudspeaker Setup. 1962-1974 - Gil Dobry, Ron M. Hecht, Mireille Avigal, Yaniv Zigel

:
Supervector Dimension Reduction for Efficient Speaker Age Estimation Based on the Acoustic Speech Signal. 1975-1985 - Jesper Kjær Nielsen

, Mads Græsbøll Christensen
, Ali Taylan Cemgil
, Simon J. Godsill, Søren Holdt Jensen:
Bayesian Interpolation and Parameter Estimation in a Dynamic Sinusoidal Model. 1986-1998 - Sungwoong Kim, Sungrack Yun, Chang D. Yoo:

Large Margin Discriminative Semi-Markov Model for Phonetic Recognition. 1999-2012 - Juan Pablo Bello

:
Measuring Structural Similarity in Music. 2013-2025 - Iain McCowan, David Dean, Mitchell McLaren, Robert Vogt, Sridha Sridharan:

The Delta-Phase Spectrum With Application to Voice Activity Detection and Speaker Recognition. 2026-2038 - Chris Hummersone, Russell Mason

, Tim Brookes:
Ideal Binary Mask Ratio: A Novel Metric for Assessing Binary-Mask-Based Sound Source Separation Algorithms. 2039-2045 - Valentin Emiya

, Emmanuel Vincent, Niklas Harlander, Volker Hohmann:
Subjective and Objective Quality Assessment of Audio Source Separation. 2046-2057 - Muhammad Tahir Akhtar

, Wataru Mitsuhashi:
Improving Performance of Hybrid Active Noise Control Systems for Uncorrelated Narrowband Disturbances. 2058-2066 - Jort F. Gemmeke, Tuomas Virtanen

, Antti Hurmalainen:
Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition. 2067-2080 - Brian Roark, Margaret Mitchell, John-Paul Hosom, Kristy Hollingshead, Jeffrey A. Kaye

:
Spoken Language Derived Measures for Detecting Mild Cognitive Impairment. 2081-2090 - Jun Du, Yu Hu, Hui Jiang:

Boosted Mixture Learning of Gaussian Mixture Hidden Markov Models Based on Maximum Likelihood for Speech Recognition. 2091-2100 - Nobutaka Ito, Hikaru Shimizu, Nobutaka Ono, Shigeki Sagayama:

Diffuse Noise Suppression Using Crystal-Shaped Microphone Arrays. 2101-2110 - Serajul Haque, Roberto Togneri

, Anthony Zaknich:
An Auditory Motivated Asymmetric Compression Technique for Speech Recognition. 2111-2124 - Cees H. Taal, Richard C. Hendriks, Richard Heusdens, Jesper Jensen:

An Algorithm for Intelligibility Prediction of Time-Frequency Weighted Noisy Speech. 2125-2136 - Ryouichi Nishimura, Parham Mokhtari, Hironori Takemoto, Hiroaki Kato:

An Attempt to Calibrate Headphones for Reproduction of Sound Pressure at the Eardrum. 2137-2145 - Stefano Papetti

, Federico Avanzini
, Davide Rocchesso
:
Numerical Methods for a Nonlinear Impact Model: A Comparative Study With Closed-Form Corrections. 2146-2158 - Mehrez Souden, Jingdong Chen, Jacob Benesty

, Sofiène Affes
:
An Integrated Solution for Online Multichannel Noise Tracking and Reduction. 2159-2169 - Hannu Pulakka, Paavo Alku

:
Bandwidth Extension of Telephone Speech Using a Neural Network and a Filter Bank Implementation for Highband Mel Spectrum. 2170-2183 - Yi-Hsuan Yang, Homer H. Chen:

Prediction of the Distribution of Perceived Music Emotions Using Discrete Samples. 2184-2196 - Behnaz Ghoraani, Sridhar Krishnan

:
Time-Frequency Matrix Feature Extraction and Classification of Environmental Audio Signals. 2197-2209 - Amitai Koretz, Joseph Tabrikian

:
Maximum A Posteriori Probability Multiple-Pitch Tracking Using the Harmonic Model. 2210-2221 - Laurent Oudre

, Yves Grenier, Cédric Févotte:
Chord Recognition by Fitting Rescaled Chroma Vectors to Chord Templates. 2222-2233 - Boaz Rafaely

, Dima Khaykin:
Optimal Model-Based Beamforming and Independent Steering for Spherical Loudspeaker Arrays. 2234-2238 - Mads Græsbøll Christensen

, Søren Holdt Jensen:
New Results on Perceptual Distortion Minimization and Nonlinear Least-Squares Frequency Estimation. 2239-2244
Volume 19, Number 8, November 2011
- Laurent Oudre

, Cédric Févotte, Yves Grenier:
Probabilistic Template-Based Chord Recognition. 2249-2259 - Jacob Benesty

, Jingdong Chen, Yiteng Huang:
Binaural Noise Reduction in the Time Domain With a Stereo Setup. 2260-2272 - Zengli Yang, Yahong Rosa Zheng

, Steven L. Grant:
Proportionate Affine Projection Sign Algorithms for Network Echo Cancellation. 2273-2284 - Jun Du, Qiang Huo:

A Feature Compensation Approach Using High-Order Vector Taylor Series Approximation of an Explicit Distortion Model for Noisy Speech Recognition. 2285-2293 - J. Reed, C.-H. Lee:

Preference Music Ratings Prediction Using Tokenization and Minimum Classification Error Training. 2294-2303 - Han-Wen Hsu, Chi-Min Liu:

Decimation-Whitening Filter in Spectral Band Replication. 2304-2313 - Theodore Petsatodis, Christos Boukis, Fotios Talantzis, Zheng-Hua Tan

, Ramjee Prasad:
Convex Combination of Multiple Statistical Models With Application to VAD. 2314-2327 - Zhaozhang Jin, DeLiang Wang:

Reverberant Speech Segregation Based on Multipitch Tracking and Classification. 2328-2337 - Dogan Can, Murat Saraclar

:
Lattice Indexing for Spoken Term Detection. 2338-2347 - Mikel Peñagarikano

, Amparo Varona
, Luis Javier Rodríguez-Fuentes
, Germán Bordel
:
Improved Modeling of Cross-Decoder Phone Co-Occurrences in SVM-Based Phonotactic Language Recognition. 2348-2363 - Trevor Burton, Rafik A. Goubran:

A Generalized Proportionate Subband Adaptive Second-Order Volterra Filter for Acoustic Echo Cancellation in Changing Environments. 2364-2373 - Yusuke Hioka

, Kenta Niwa, Sumitaka Sakauchi, Ken'ichi Furuya
, Youichi Haneda:
Estimating Direct-to-Reverberant Energy Ratio Using D/R Spatial Correlation Matrix Model. 2374-2384 - Cyril Joder, Slim Essid, Gaël Richard:

A Conditional Random Field Framework for Robust and Scalable Audio-to-Score Matching. 2385-2397 - Richard E. Turner, Maneesh Sahani:

Demodulation as Probabilistic Inference. 2398-2411 - Giovanni L. Sicuranza

, Alberto Carini
:
A Generalized FLANN Filter for Nonlinear Active Noise Control. 2412-2417 - Qun Feng Tan, Panayiotis G. Georgiou

, Shrikanth Narayanan:
Enhanced Sparse Imputation Techniques for a Robust Speech Recognition Front-End. 2418-2429 - Boaz Rafaely

:
Bessel Nulls Recovery in Spherical Microphone Arrays for Time-Limited Signals. 2430-2438 - Fabio Valente, Mathew Magimai-Doss

, Christian Plahl, Suman V. Ravuri, Wen Wang:
Transcribing Mandarin Broadcast Speech Using Multi-Layer Perceptron Acoustic Features. 2439-2450 - Timothy J. Hazen:

MCE Training Techniques for Topic Identification of Spoken Audio Documents. 2451-2460 - Dong Yu, Jinyu Li

, Li Deng:
Calibration of Confidence Measures in Speech Recognition. 2461-2473 - Cong Liu, Yu Hu, Li-Rong Dai, Hui Jiang:

Trust Region-Based Optimization for Maximum Mutual Information Estimation of HMMs in Speech Recognition. 2474-2485 - Simo Särkkä, Antti Huovilainen:

Accurate Discretization of Analog Audio Filters With Application to Parametric Equalizer Design. 2486-2493 - Deyi Xiong

, Min Zhang, Haizhou Li
:
A Maximum-Entropy Segmentation Model for Statistical Machine Translation. 2494-2505 - Magnus Berggren, Markus Borgh, Christian Schüldt

, Fredric Lindström, Ingvar Claesson:
Low-Complexity Network Echo Cancellation Approach for Systems Equipped With External Memory. 2506-2515 - Leonardo O. Nunes, Luiz W. P. Biscainho

, Bowon Lee, Amir Said, Ton Kalker, Ronald W. Schafer:
Degradation Type Classifier for Full Band Speech Contaminated With Echo, Broadband Noise, and Reverberation. 2516-2526 - Jorge I. Marin-Hurtado

, David V. Anderson:
FFT-Based Block Processing in Speech Enhancement: Potential Artifacts and Solutions. 2527-2537 - Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez

, Hervé Bourlard, Mathew Magimai-Doss:
Privacy-Sensitive Audio Features for Speech/Nonspeech Detection. 2538-2551 - S. R. Mahadeva Prasanna, Gayadhar Pradhan

:
Significance of Vowel-Like Regions for Speaker Verification Under Degraded Conditions. 2552-2565 - Julio Vargas, Steve McLaughlin

:
Speech Analysis and Synthesis Based on Dynamic Modes. 2566-2578 - Bengt Jonas Borgstrom, Abeer Alwan:

A Unified Framework for Designing Optimal STSA Estimators Assuming Maximum Likelihood Phase Equivalence of Speech and Noise. 2579-2590 - Brian King, Les Atlas:

Single-Channel Source Separation Using Complex Matrix Factorization. 2591-2597 - Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny, David Nahamoo, Dimitri Kanevsky:

Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR. 2598-2613 - Huijun Ding, Ing Yann Soon, Chai Kiat Yeo

:
A DCT-Based Speech Enhancement System With Pitch Synchronous Analysis. 2614-2623 - Dongwen Ying, Yonghong Yan, Jianwu Dang, Frank K. Soong:

Voice Activity Detection Based on an Unsupervised Learning Framework. 2624-2633

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














