


default search action
IEEE Transactions on Audio, Speech and Language Processing, Volume 21
Volume 21, Number 1, January 2013
- Stanislaw Gorlow, Sylvain Marchand:

Informed Audio Source Separation Using Linearly Constrained Spatial Filters. 1-11 - Florin Ghido, Ioan Tabus

:
Sparse Modeling for Lossless Audio Compression. 12-26 - Xiguang Zheng

, Christian H. Ritz
, Jiangtao Xi
:
Encoding Navigable Speech Sources: A Psychoacoustic-Based Analysis-by-Synthesis Approach. 27-36 - Jwu-Sheng Hu, Ming-Tang Lee, Chia-Hsing Yang:

Robust Adaptive Beamformer for Speech Enhancement Using the Second-Order Extended H∞ Filter. 37-48 - Yi-Chin Huang, Chung-Hsien Wu

, Yu-Ting Chao:
Personalized Spectral and Prosody Conversion Using Frame-Based Codeword Distribution and Adaptive CRF. 49-60 - Nilesh Madhu

, Ann Spriet, Sofie Jansen, Raphael Koning, Jan Wouters
:
The Potential for Speech Intelligibility Improvement Using the Ideal Binary Mask and the Ideal Wiener Filter in Single Channel Noise Reduction Systems: Application to Auditory Prostheses. 61-70 - Zafar Rafii, Bryan Pardo:

REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation. 71-82 - Sree Hari Krishnan Parthasarathi, Hervé Bourlard, Daniel Gatica-Perez

:
Wordless Sounds: Robust Speaker Diarization Using Privacy-Preserving Audio Representations. 83-96 - Feng Huang

, Tan Lee
:
Pitch Estimation in Noisy Speech Using Accumulated Peak Spectrum and Sparse Estimation Technique. 97-107 - Néstor Becerra Yoma, Claudio Garretón, Fernando Huenupán

, Ignacio Catalan, Jorge Wuth Sepúlveda:
On Reducing Harmonic and Sampling Distortion in Vocal Tract Length Normalization. 108-119 - Ke Hu, DeLiang Wang:

An Unsupervised Approach to Cochannel Speech Separation. 120-129 - Ronen Talmon, Israel Cohen, Sharon Gannot

:
Single-Channel Transient Interference Suppression With Diffusion Maps. 130-142 - Nima Yousefian, Philipos C. Loizou:

A Dual-Microphone Algorithm That Can Cope With Competing-Talker Scenarios. 143-153 - Miguel Ferrer

, Alberto González
, Maria de Diego
, Gema Pinero
:
Convex Combination Filtered-X Algorithms for Active Noise Control Systems. 154-165 - Kun Han, DeLiang Wang:

Towards Generalizing Classification Based Speech Separation. 166-175 - Nicolas Sturmel, Laurent Daudet:

Informed Source Separation Using Iterative Reconstruction. 176-183 - Ziqiang Shi, Jiqing Han, Tieran Zheng, Shiwen Deng:

Audio Segment Classification Using Online Learning Based Tensor Representation Feature Discrimination. 184-194 - Hai Son Le, Ilya Oparin, Alexandre Allauzen, Jean-Luc Gauvain, François Yvon

:
Structured Output Layer Neural Network Language Models for Speech Recognition. 195-204 - Zhen-Hua Ling, Korin Richmond

, Junichi Yamagishi:
Articulatory Control of HMM-Based Parametric Speech Synthesis Using Feature-Space-Switched Multiple Regression. 205-217
Volume 21, Number 2, February 2013
- Alexandre Trilla, Francesc Alías

:
Sentence-Based Sentiment Analysis for Expressive Text-to-Speech. 223-233 - Paolo Annibale, Jason Filos, Patrick A. Naylor

, Rudolf Rabenstein:
TDOA-Based Speed of Sound Estimation for Air Temperature and Room Geometry Inference. 234-246 - Jung-Woo Choi

, Yang-Hann Kim:
Sound Field Reproduction of a Virtual Source Inside a Loudspeaker Array With Minimal External Radiation. 247-259 - Wei Wu, Mari Ostendorf:

Graph-Based Query Strategies for Active Learning. 260-269 - Yuxuan Wang, Kun Han, DeLiang Wang:

Exploring Monaural Features for Classification-Based Speech Segregation. 270-279 - Yao Qian, Frank K. Soong, Zhi-Jie Yan:

A Unified Trajectory Tiling Approach to High Quality Speech Rendering. 280-290 - Erinç Dikici

, Murat Semerci
, Murat Saraclar
, Ethem Alpaydin
:
Classification and Ranking Approaches to Discriminative Language Modeling for ASR. 291-300 - Boyan Huang, Yegui Xiao, Jinwei Sun, Guo Wei:

A Variable Step-Size FXLMS Algorithm for Narrowband Active Noise Control. 301-312 - Stefano D'Angelo, Jyri Pakarinen, Vesa Välimäki

:
New Family of Wave-Digital Triode Models. 313-321 - Saeed Mosayyebpour, Morteza Esmaeili, T. Aaron Gulliver:

Single-Microphone Early and Late Reverberation Suppression in Noisy Speech. 322-335 - Chengshi Zheng, Hao Liu, Renhua Peng, Xiaodong Li:

A Statistical Analysis of Two-Channel Post-Filter Estimators in Isotropic Noise Fields. 336-342 - Shmulik Markovich Golan, Sharon Gannot

, Israel Cohen:
Distributed Multiple Constraints Generalized Sidelobe Canceler for Fully Connected Wireless Acoustic Sensor Networks. 343-356 - Ian McGraw, Ibrahim Badr, James R. Glass:

Learning Lexicons From Speech Using a Pronunciation Mixture Model. 357-366 - Jonathan William Dennis, Tran Huy Dat, Engsiong Chng

:
Image Feature Representation of the Subband Power Distribution for Robust Sound Event Classification. 367-377 - Nasim Radmanesh, Ian S. Burnett

:
Generation of Isolated Wideband Sound Fields Using a Combined Two-stage Lasso-LS Algorithm. 378-387 - Dong Yu, Li Deng, Frank Seide:

The Deep Tensor Neural Network With Applications to Large Vocabulary Speech Recognition. 388-396 - Manas A. Pathak, Bhiksha Raj:

Privacy-Preserving Speaker Verification and Identification Using Gaussian Mixture Models. 397-406 - Abigail A. Kressner

, David V. Anderson, Christopher J. Rozell:
Evaluating the Generalization of the Hearing Aid Speech Quality Index (HASQI). 407-415 - Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali

:
A Multistream Feature Framework Based on Bandpass Modulation Filtering for Robust Speech Recognition. 416-426 - Chiong-Ching Lai, Sven Nordholm

, Yee-Hong Leung:
Design of Steerable Spherical Broadband Beamformers With Flexible Sensor Configurations. 427-438 - Antonio Canclini, Fabio Antonacci

, Augusto Sarti, Stefano Tubaro:
Acoustic Source Localization With Distributed Asynchronous Microphone Networks. 439-443 - Stefano Gaiotto:

A Tuning-Less Approach in Secondary Path Modeling in Active Noise Control Systems. 444-448 - Matt Speed, Damian T. Murphy, David M. Howard

:
Three-Dimensional Digital Waveguide Mesh Simulation of Cylindrical Vocal Tract Analogs. 449-455
Volume 21, Number 3, March 2013
- Hongsen He, Lifu Wu, Jing Lu, Xiaojun Qiu

, Jingdong Chen:
Time Difference of Arrival Estimation Exploiting Multichannel Spatio-Temporal Prediction. 463-475 - Shan Liang, Wenju Liu, Wei Jiang:

A New Bayesian Method Incorporating With Local Correlation for IBM Estimation. 476-487 - Stephan Tassart:

Band-Limited Impulse Train Generation Using Sampled Infinite Impulse Responses of Analog Filters. 488-497 - Xin Chen, Yunxin Zhao:

Building Acoustic Model Ensembles by Data Sampling With Enhanced Trainings and Features. 498-507 - Simone Spagnol

, Michele Geronazzo
, Federico Avanzini
:
On the Relation Between Pinna Reflection Patterns and Head-Related Transfer Function Features. 508-519 - Vipul Arora

, Laxmidhar Behera
:
On-Line Melody Extraction From Polyphonic Audio Using Harmonic Cluster Tracking. 520-530 - Meinard Müller

, Nanzhu Jiang, Peter Grosche:
A Robust Fitness Measure for Capturing Repetitions in Music Recordings With Applications to Audio Thumbnailing. 531-543 - Shi-Xiong Zhang, Mark J. F. Gales:

Structured SVMs for Automatic Speech Recognition. 544-555 - Daniel Erro, Eva Navas

, Inma Hernáez
:
Parametric Voice Conversion Based on Bilinear Frequency Warping Plus Amplitude Scaling. 556-566 - Shuhua Zhang

, Laurent Girin:
Fast and Accurate Direct MDCT to DFT Conversion With Arbitrary Window Functions. 567-578 - Ladan Baghai-Ravary:

The Inherent Temporal Precision of Phoneme Transitions. 579-586 - Matt Shannon, Heiga Zen

, William Byrne:
Autoregressive Models for Statistical Parametric Speech Synthesis. 587-597 - Jesper Kjær Nielsen

, Mads Græsbøll Christensen
, Søren Holdt Jensen:
Default Bayesian Estimation of the Fundamental Frequency. 598-610 - Ziqiang Shi, Jiqing Han, Tieran Zheng, Ji Li:

Identification of Objectionable Audio Segments Based on Pseudo and Heterogeneous Mixture Models. 611-623 - José A. González

, Antonio M. Peinado
, Ning Ma
, Angel M. Gomez
, Jon Barker
:
MMSE-Based Missing-Feature Reconstruction With Temporal Modeling for Robust Speech Recognition. 624-635 - Bassam Jabaian, Laurent Besacier, Fabrice Lefèvre:

Comparison and Combination of Lightly Supervised Approaches for Language Portability of a Spoken Language Understanding System. 636-648 - Renxian Zhang, Wenjie Li

, Dehong Gao, Ouyang You:
Automatic Twitter Topic Summarization With Speech Acts. 649-658 - Weibin Zhang, Pascale Fung:

Sparse Inverse Covariance Matrices for Low Resource Speech Recognition. 659-668 - Jonathan Botts, José Escolano, Ning Xiang:

Design of IIR Filters With Bayesian Model Selection and Parameter Estimation. 669-674 - Kais Khaldi

, Abdel-Ouahab Boudraa
:
Audio Watermarking Via EMD. 675-680
Volume 21, Number 4, April 2013
- Shoichi Koyama

, Ken'ichi Furuya
, Yusuke Hiwasaki, Yoichi Haneda:
Analytical Approach to Wave Field Reconstruction Filtering in Spatio-Temporal Frequency Domain. 685-696 - Xiao-Lei Zhang, Ji Wu:

Deep Belief Networks Based Voice Activity Detection. 697-710 - Emmanuel Ravelli, Vinay Melkote, Tejaswi Nanjundaswamy, Kenneth Rose:

Joint Optimization of Base and Enhancement Layers in Scalable Audio Coding. 711-724 - Cemil Demir, Murat Saraclar

, Ali Taylan Cemgil
:
Single-Channel Speech-Music Separation for Robust ASR With Mixture Models. 725-736 - Athanasia Zlatintsi, Petros Maragos:

Multiscale Fractal Analysis of Musical Instrument Signals With Application to Recognition. 737-748 - Shakeel Ahmed, Muhammad Tahir Akhtar

, Xi Zhang:
Robust Auxiliary-Noise-Power Scheduling in Active Noise Control Systems With Online Secondary Path Modeling. 749-761 - Yakun Hu, Dapeng Wu

, Antonio Nucci:
Fuzzy-Clustering-Based Decision Tree Approach for Large Population Speaker Identification. 762-774 - Nicki Holighaus

, Monika Dörfler
, Gino Angelo M. Velasco
, Thomas Grill:
A Framework for Invertible, Real-Time Constant-Q Transforms. 775-785 - Sabato Marco Siniscalchi

, Torbjørn Svendsen
, Chin-Hui Lee:
A Bottom-Up Modular Search Approach to Large Vocabulary Continuous Speech Recognition. 786-797 - Jihoon Park, Kwang-Ki Kim, Minsoo Hahn:

Vocal Removal From Multiobject Audio Using Harmonic Information for Karaoke Service. 798-805 - John Woodruff, DeLiang Wang:

Binaural Detection, Localization, and Segregation in Reverberant Environments Based on Joint Pitch and Azimuth Cues. 806-815 - Carlos Vaquero

, Alfonso Ortega
, Antonio Miguel, Eduardo Lleida
:
Quality Assessment for Speaker Diarization and Its Application in Speaker Characterization. 816-827 - Alireza Masnadi-Shirazi, Bhaskar D. Rao:

An ICA-SCT-PHD Filter Approach for Tracking and Separation of Unknown Time-Varying Number of Sources. 828-841 - Taufiq Hasan

, John H. L. Hansen:
Acoustic Factor Analysis for Robust Speaker Verification. 842-853 - Gayadhar Pradhan

, S. R. Mahadeva Prasanna:
Speaker Verification by Vowel and Nonvowel Like Segmentation. 854-867 - Shing-Chow Chan, Y. J. Chu, Z. G. Zhang:

A New Variable Regularized Transform Domain NLMS Adaptive Filtering Algorithm - Acoustic Applications and Performance Analysis. 868-878 - Nicolas Ellaham, Christian Giguère, Wail Gueaieb:

Evaluation of the Phase-Inversion Signal Separation Method When Using Nonlinear Hearing Aids. 879-888
Volume 21, Number 5, May 2013
- Guangzhao Bao, Zhongfu Ye, Xu Xu, Yingyue Zhou:

A Compressed Sensing Approach to Blind Separation of Speech Mixture Based on a Two-Layer Sparsity Model. 899-906 - Shing-Chow Chan, Y. J. Chu, Z. G. Zhang, Kai Man Tsui:

A New Variable Regularized QR Decomposition-Based Recursive Least M-Estimate Algorithm - Performance Analysis and Acoustic Applications. 907-922 - Jesper Rindom Jensen

, Mads Græsbøll Christensen
, Søren Holdt Jensen:
Nonlinear Least Squares Methods for Joint DOA and Pitch Estimation. 923-933 - Sandro Cumani, Pietro Laface:

Memory and Computation Trade-Offs for Efficient I-Vector Extraction. 934-944 - Emanuël A. P. Habets

, Jacob Benesty
:
A Two-Stage Beamforming Approach for Noise Reduction and Dereverberation. 945-958 - Marco Liuni, Axel Röbel, Ewa Matusiak

, Marco Romito
, Xavier Rodet:
Automatic Adaptation of the Time-Frequency Resolution for Sound Analysis and Re-Synthesis. 959-970 - Hiroshi Sawada, Hirokazu Kameoka, Shoko Araki

, Naonori Ueda:
Multichannel Extensions of Non-Negative Matrix Factorization With Complex-Valued Data. 971-982 - Robert M. Nickel, Ramón Fernandez Astudillo

, Dorothea Kolossa
, Rainer Martin
:
Corpus-Based Speech Enhancement With Uncertainty Modeling and Cepstral Smoothing. 983-997 - Nasser Mohammadiha, Arne Leijon:

Nonnegative HMM for Babble Noise Derived From Speech HMM: Application to Speech Enhancement. 998-1011 - Wei Rao, Man-Wai Mak:

Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning. 1012-1022 - Ramón Fernandez Astudillo

, Reinhold Orglmeister:
Computing MMSE Estimates and Residual Uncertainty Directly in the Feature Domain of ASR using STFT Domain Speech Distortion Models. 1023-1034 - Petko Nikolov Petkov, Gustav Eje Henter, W. Bastiaan Kleijn

:
Maximizing Phoneme Recognition Accuracy for Enhanced Speech Intelligibility in Noise. 1035-1045 - Maciej Niedzwiecki, Marcin Ciolek:

Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing. 1046-1059 - Li Deng, Xiao Li:

Machine Learning Paradigms for Speech Recognition: An Overview. 1060-1089 - Coskun Mermer

, Murat Saraclar
, Ruhi Sarikaya:
Improving Statistical Machine Translation Using Bayesian Word Alignment and Gibbs Sampling. 1090-1101 - Xiaodan Zhu, Colin Cherry, Gerald Penn

:
A Graph-Partitioning Framework for Aligning Hierarchical Topic Structures to Presentations. 1102-1112 - Romain Serizel, Marc Moonen, Jan Wouters

, Søren Holdt Jensen:
Binaural Integrated Active Noise Control and Noise Reduction in Hearing Aids. 1113-1118
Volume 21, Number 6, June 2013
- Emanuël A. P. Habets

, Jacob Benesty
:
Multi-Microphone Noise Reduction Based on Orthogonal Noise Signal Decompositions. 1123-1133 - Ping Xu, Pascale Fung:

Cross-Lingual Language Modeling for Low-Resource Speech Recognition. 1134-1144 - Dongwen Ying, Yonghong Yan:

Noise Estimation Using a Constrained Sequential Hidden Markov Model in the Log-Spectral Domain. 1145-1157 - Ciprian Chelba, Peng Xu, Fernando Pereira, Thomas Richardson:

Large Scale Distributed Acoustic Modeling With Back-Off ℕ-Grams. 1158-1169 - John Kane, Christer Gobl

:
Wavelet Maxima Dispersion for Breathy to Tense Voice Discrimination. 1170-1179 - Bin Zhang, Alex Marin, Brian Hutchinson

, Mari Ostendorf:
Learning Phrase Patterns for Text Classification. 1180-1189 - Ilker Bayram, Mustafa E. Kamasak

:
A Simple Prior for Audio Signals. 1190-1200 - Lars-Johan Brännmark, Adrian Bahne, Anders Ahlén:

Compensation of Loudspeaker-Room Responses in a Robust MIMO Control Framework. 1201-1216 - Sandro Cumani, Niko Brummer, Lukás Burget

, Pietro Laface, Oldrich Plchot, Vasileios Vasilakakis:
Pairwise Discriminative Speaker Verification in the 𝕀-Vector Space. 1217-1227 - Ki-Seung Lee:

Position-Dependent Crosstalk Cancellation Using Space Partitioning. 1228-1239 - Yusuke Hioka

, Ken'ichi Furuya
, Kazunori Kobayashi, Kenta Niwa, Yoichi Haneda:
Underdetermined Sound Source Separation Using Power Spectrum Density Estimated by Combination of Directivity Gain. 1240-1250 - Benjamin Lecouteux, Georges Linarès, Yannick Estève, Guillaume Gravier:

Dynamic Combination of Automatic Speech Recognition Systems by Driven Decoding. 1251-1260 - Saman Mousazadeh, Israel Cohen:

Voice Activity Detection in Presence of Transient Noise Using Spectral Clustering. 1261-1271 - Hung-yi Lee, Lin-Shan Lee:

Enhanced Spoken Term Detection Using Support Vector Machines and Weighted Pseudo Examples. 1272-1284 - Tom Ko, Brian Mak

:
Eigentriphones for Context-Dependent Acoustic Modeling. 1285-1294 - David Rybach, Hermann Ney, Ralf Schlüter:

Lexical Prefix Tree and WFST: A Comparison of Two Dynamic Search Concepts for LVCSR. 1295-1307
Volume 21, Number 7, 2013
- Charles Verron, Philippe-Aubert Gauthier

, Jennifer Langlois, Catherine Guastavino
:
Spectral and Spatial Multichannel Analysis/Synthesis of Interior Aircraft Sounds. 1317-1329 - Chun-an Chan, Lin-Shan Lee:

Model-Based Unsupervised Spoken Term Detection with Spoken Queries. 1330-1342 - Jingdong Chen, Jacob Benesty

:
On the Time-Domain Widely Linear LCMV Filter for Noise Reduction With a Stereo System. 1343-1354 - Ji Ming, Ramji Srinivasan, Danny Crookes, Ayeh Jafari:

CLOSE - A Data-Driven Approach to Speech Separation. 1355-1368 - Masahito Togami, Yohei Kawaguchi

, Ryu Takeda
, Yasunari Obuchi, Nobuo Nukaga:
Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function. 1369-1380 - Yuxuan Wang, DeLiang Wang:

Towards Scaling Up Classification-Based Speech Separation. 1381-1390 - Simon Arberet, Pierre Vandergheynst, Rafael E. Carrillo

, Jean-Philippe Thiran
, Yves Wiaux
:
Sparse Reverberant Audio Source Separation via Reweighted Analysis. 1391-1402 - Shuhua Zhang

, Weibei Dou, Huazhong Yang:
MDCT Sinusoidal Analysis for Audio Signals Analysis and Processing. 1403-1414 - Balaji Vasan Srinivasan, Yuancheng Luo, Daniel Garcia-Romero, Dmitry N. Zotkin, Ramani Duraiswami

:
A Symmetric Kernel Partial Least Squares Framework for Speaker Recognition. 1415-1423 - Xiaoyan Cai, Wenjie Li

:
Ranking Through Clustering: An Integrated Approach to Multi-Document Summarization. 1424-1433 - Stanislaw Gorlow, Joshua D. Reiss:

Model-Based Inversion of Dynamic Range Compression. 1434-1444 - Matthew C. McCallum, Bernard J. Guillemin:

Stochastic-Deterministic MMSE STFT Speech Enhancement With General A Priori Information. 1445-1457 - Ali Hassan

, Robert I. Damper, Mahesan Niranjan
:
On Acoustic Emotion Recognition: Compensating for Covariate Shift. 1458-1468 - Fei Liu, Yang Liu:

Towards Abstractive Speech Summarization: Exploring Unsupervised and Supervised Approaches for Spoken Utterance Compression. 1469-1480 - Vesa Välimäki

, Heidi-Maria Lehtonen, Marko Takanen
:
A Perceptual Study on Velvet Noise and Its Variants at Different Pulse Densities. 1481-1488 - Olivier Derrien

, Roland Badeau, Gaël Richard:
Parametric Audio Coding With Exponentially Damped Sinusoids. 1489-1501 - Danilo Comminiello

, Michele Scarpiniti
, Luis Antonio Azpicueta-Ruiz
, Jerónimo Arenas-García
, Aurelio Uncini
:
Functional Link Adaptive Filters for Nonlinear Acoustic Echo Cancellation. 1502-1512 - Shmulik Markovich Golan, Sharon Gannot

, Israel Cohen:
Performance of the SDW-MWF With Randomly Located Microphones in a Reverberant Enclosure. 1513-1523 - Stefan Bilbao:

Modeling of Complex Geometries and Boundary Conditions in Finite Difference/Finite Volume Time Domain Room Acoustics Simulation. 1524-1533
Volume 21, Number 8, August 2013
- Constantin Paleologu, Jacob Benesty

, Silviu Ciochina:
Study of the General Kalman Filter for Echo Cancellation. 1539-1549 - Anaïk Olivero, Bruno Torrésani

, Richard Kronland-Martinet
:
A Class of Algorithms for Time-Frequency Multiplier Estimation. 1550-1559 - Olaf Schleusing, Tomi Kinnunen, Brad H. Story, Jean-Marc Vesin:

Joint Source-Filter Optimization for Accurate Vocal Tract Estimation Using Differential Evolution. 1560-1572 - Dumidu S. Talagala, Wen Zhang, Thushara D. Abhayapala

:
Broadband DOA Estimation Using Sensor Arrays on Complex-Shaped Rigid Bodies. 1573-1585 - Jiajun Zhang, Feifei Zhai, Chengqing Zong

:
Syntax-Based Translation With Bilingually Lexicalized Synchronous Tree Substitution Grammars. 1586-1597 - Chang Woo Han, Shin Jae Kang, Nam Soo Kim:

Reverberation and Noise Robust Feature Compensation Based on IMM. 1598-1611 - Anoop Deoras, Gökhan Tür

, Ruhi Sarikaya, Dilek Hakkani-Tür
:
Joint Discriminative Decoding of Words and Semantic Tags for Spoken Language Understanding. 1612-1621 - Ville Hautamäki

, Tomi Kinnunen, Filip Sedlak, Kong-Aik Lee
, Bin Ma, Haizhou Li
:
Sparse Classifier Fusion for Speaker Verification. 1622-1631 - Sarthak Khanal, Harvey F. Silverman, Rahul R. Shakya:

A Free-Source Method (FrSM) for Calibrating a Large-Aperture Microphone Array. 1632-1639 - Volker Leutnant, Alexander Krueger, Reinhold Haeb-Umbach

:
Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition. 1640-1652 - Enzo De Sena

, Hüseyin Hacihabiboglu
, Zoran Cvetkovic:
Analysis and Design of Multichannel Systems for Perceptual Sound Field Reconstruction. 1653-1665 - Marcelo F. Caetano

, Xavier Rodet:
Musical Instrument Sound Morphing Guided by Perceptually Motivated Features. 1666-1675 - Bin Cheng, Christian H. Ritz

, Ian S. Burnett
, Xiguang Zheng
:
A General Compression Approach to Multi-Channel Three-Dimensional Audio. 1676-1688 - Weifeng Li, Longbiao Wang, Yicong Zhou

, Hervé Bourlard, Qingmin Liao:
Robust Log-Energy Estimation and its Dynamic Change Enhancement for In-car Speech Recognition. 1689-1698 - Alexey Ozerov, Antoine Liutkus, Roland Badeau, Gaël Richard:

Coding-Based Informed Source Separation: Nonnegative Tensor Factorization Approach. 1699-1712 - David Imseng, Hervé Bourlard, John Dines, Philip N. Garner

, Mathew Magimai-Doss
:
Applying Multi- and Cross-Lingual Stochastic Phone Space Transformations to Non-Native Speech Recognition. 1713-1726 - Eleftheria Georganti, Tobias May

, Steven van de Par, John Mourjopoulos:
Sound Source Distance Estimation in Rooms based on Statistical Properties of Binaural Signals. 1727-1741 - Michael Wohlmayr, Franz Pernkopf

:
Model-Based Multiple Pitch Tracking Using Factorial HMMs: Model Adaptation and Inference. 1742-1754 - Ron M. Hecht, Elad Noor, Gil Dobry, Yaniv Zigel

, Aharon Bar-Hillel
, Naftali Tishby:
Effective Model Representation by Information Bottleneck Principle. 1755-1759 - Luis Weruaga, Leonid Dimitrov:

The Spectral Nature of Maximum Likelihood Noise Compensated Linear Prediction. 1760-1765
Volume 21, Number 9, September 2013
- Zhanyu Ma, Arne Leijon, W. Bastiaan Kleijn

:
Vector quantization of LSF parameters with a mixture of dirichlet distributions. 1777-1790 - Liang Lu, K. K. Chin, Arnab Ghoshal, Stephen Renals

:
Joint Uncertainty Decoding for Noise Robust Subspace Gaussian Mixture Models. 1791-1804 - Dimitrios Giannoulis, Anssi Klapuri:

Musical Instrument Recognition in Polyphonic Audio Using Missing Feature Approach. 1805-1817 - Freddy William, Abhijeet Sangwan, John H. L. Hansen:

Automatic Accent Assessment Using Phonetic Mismatch and Human Perception. 1818-1829 - Stanislaw Andrzej Raczynski, Emmanuel Vincent, Shigeki Sagayama:

Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling. 1830-1840 - Raymond W. M. Ng, Tan Lee

, Cheung-Chi Leung, Bin Ma, Haizhou Li
:
Spoken Language Recognition With Prosodic Features. 1841-1853 - Benoit Fuentes, Roland Badeau, Gaël Richard:

Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription. 1854-1866 - Jose Manuel Gil-Cacho, Marco Signoretto, Toon van Waterschoot

, Marc Moonen, Søren Holdt Jensen:
Nonlinear Acoustic Echo Cancellation Based on a Sliding-Window Leaky Kernel Affine Projection Algorithm. 1867-1878 - Ina Kodrasi

, Stefan Goetze
, Simon Doclo
:
Regularization for Partial Multichannel Equalization for Speech Dereverberation. 1879-1890 - Roman Scharrer, Michael Vorländer

:
Sound Field Classification in Small Microphone Arrays Using Spatial Coherences. 1891-1899 - Muhammad Salman Khan

, Syed M. Naqvi
, Ata ur-Rehman
, Wenwu Wang, Jonathon A. Chambers:
Video-Aided Model-Based Source Separation in Real Reverberant Rooms. 1900-1912 - Mehrez Souden, Shoko Araki

, Keisuke Kinoshita
, Tomohiro Nakatani, Hiroshi Sawada:
A Multichannel MMSE-Based Framework for Speech Source Separation and Noise Reduction. 1913-1928 - Matías Zanartu

, Julio C. Ho, Daryush D. Mehta, Robert E. Hillman, George R. Wodicka:
Subglottal Impedance-Based Inverse Filtering of Voiced Sounds Using Neck Surface Acceleration. 1929-1939 - Alex Southern, Samuel Siltanen, Damian T. Murphy, Lauri Savioja:

Room Impulse Response Synthesis and Validation Using a Hybrid Acoustic Model. 1940-1952 - Futoshi Asano, Hideki Asoh

, Kazuhiro Nakadai:
Sound Source Localization Using Joint Bayesian Estimation With a Hierarchical Noise Model. 1953-1965 - Neil Wachowski, Mahmood R. Azimi-Sadjadi:

Characterization of Multiple Transient Acoustical Sources From Time-Transform Representations. 1966-1978 - Cheng-Yuan Chang, Sen M. Kuo:

Complete Parallel Narrowband Active Noise Control Systems. 1979-1986
Volume 21, Number 10, October 2013
- William Hartmann, Arun Narayanan, Eric Fosler-Lussier, DeLiang Wang:

A Direct Masking Approach to Robust ASR. 1993-2005 - Yow-Bang Wang

, Shang-wen Li, Lin-Shan Lee:
An Experimental Analysis on Integrating Multi-Stream Spectro-Temporal, Cepstral and Pitch Information for Mandarin Speech Recognition. 2006-2014 - Stephen Shum, Najim Dehak

, Réda Dehak, James R. Glass:
Unsupervised Methods for Speaker Diarization: An Integrated and Iterative Approach. 2015-2028 - Zbynek Koldovský

, Jirí Málek, Petr Tichavský, Francesco Nesta:
Semi-Blind Noise Extraction Using Partially Known Position of the Target Source. 2029-2041 - Mads Græsbøll Christensen

:
Accurate Estimation of Low Fundamental Frequencies From Real-Valued Measurements. 2042-2056 - Philippe Esling, Carlos Agón:

Multiobjective Time Series Matching for Audio Classification and Retrieval. 2057-2072 - Chao Zhang, Yi Liu, Yunqing Xia, Xuan Wang, Chin-Hui Lee:

Reliable Accent-Specific Unit Generation With Discriminative Dynamic Gaussian Mixture Selection for Multi-Accent Chinese Speech Recognition. 2073-2084 - Gilles Degottex

, Yannis Stylianou:
Analysis and Synthesis of Speech Using an Adaptive Full-Band Harmonic Model. 2085-2095 - Bilei Zhu, Wei Li, Ruijiang Li, Xiangyang Xue:

Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation. 2096-2107 - Sadao Hiroya

:
Non-Negative Temporal Decomposition of Speech Parameters by Multiplicative Update Rules. 2108-2117 - Cyril Joder, Slim Essid, Gaël Richard:

Learning Optimal Features for Polyphonic Audio-to-Score Alignment. 2118-2128 - Zhen-Hua Ling, Li Deng, Dong Yu:

Modeling Spectral Envelopes Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis. 2129-2139 - Nasser Mohammadiha, Paris Smaragdis, Arne Leijon:

Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization. 2140-2151 - Sabato Marco Siniscalchi

, Jinyu Li
, Chin-Hui Lee:
Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems. 2152-2161 - Nikolay D. Gaubitch, Mike Brookes

, Patrick A. Naylor
:
Blind Channel Magnitude Response Estimation in Speech Using Spectrum Classification. 2162-2171 - Masayuki Suzuki, Takuya Yoshioka, Shinji Watanabe

, Nobuaki Minematsu, Keikichi Hirose:
Feature Enhancement With Joint Use of Consecutive Corrupted and Noise Feature Vectors With Discriminative Region Weighting. 2172-2181 - Takuya Yoshioka, Tomohiro Nakatani:

Noise Model Transfer: Novel Approach to Robustness Against Nonstationary Noise. 2182-2192 - Despoina Pavlidi

, Anthony Griffin
, Matthieu Puigt
, Athanasios Mouchtaris:
Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array. 2193-2206 - Sefki Kolozali, Mathieu Barthet

, György Fazekas, Mark B. Sandler
:
Automatic Ontology Generation for Musical Instruments Based on Audio Analysis. 2207-2220
Volume 21, Number 11, November 2013
- Stephen J. Wright, Dimitri Kanevsky, Li Deng, Xiaodong He, Georg Heigold, Haizhou Li

:
Optimization Algorithms and Applications for Speech and Language Processing. 2231-2243 - Gillian M. Chin, Jorge Nocedal, Peder A. Olsen, Steven J. Rennie:

Second Order Methods for Optimizing Convex Matrix Functions and Sparse Covariance Clustering. 2244-2254 - Theodoros Tsiligkaridis, Etienne Marcheret, Vaibhava Goel

:
A Difference of Convex Functions Approach to Large-Scale Log-Linear Model Estimation. 2255-2266 - Tara N. Sainath, Brian Kingsbury, Hagen Soltau, Bhuvana Ramabhadran:

Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks. 2267-2276 - Tuomas Virtanen

, Jort Florent Gemmeke, Bhiksha Raj:
Active-Set Newton Algorithm for Overcomplete Non-Negative Representations of Audio. 2277-2289 - Patrick Cardinal, Pierre Dumouchel, Gilles Boulianne

:
Large Vocabulary Speech Recognition on Parallel Architectures. 2290-2300 - Rémi Mignot, Laurent Daudet, François Ollivier

:
Room Reverberation Reconstruction: Interpolation of the Early Part Using Compressed Sensing. 2301-2312 - Min Zhang, Wenliang Chen, Xiangyu Duan, Rong Zhang:

Improving Graph-Based Dependency Parsing Models With Dependency Language Models. 2313-2323 - Florian Pflug, Tim Fingscheidt

:
Robust Ultra-Low Latency Soft-Decision Decoding of Linear PCM Audio. 2324-2336 - Koji Seto, Tokunbo Ogunfunmi

:
Scalable Speech Coding for IP Networks: Beyond iLBC. 2337-2345 - Kenta Niwa, Yusuke Hioka

, Ken'ichi Furuya
, Yoichi Haneda:
Diffused Sensing for Sharp Directive Beamforming. 2346-2355 - Symeon Delikaris-Manias

, Ville Pulkki
:
Cross Pattern Coherence Algorithm for Spatial Filtering Applications Utilizing Microphone Arrays. 2356-2367 - Bai Ying Lei

, Ing Yann Soon, Ee-Leng Tan:
Robust SVD-Based Audio Watermarking Scheme With Differential Evolution Optimization. 2368-2378 - Nikos Malandrakis, Alexandros Potamianos, Elias Iosif, Shrikanth S. Narayanan:

Distributional Semantic Models for Affective Text Analysis. 2379-2392 - Pasi Pertilä, Matti S. Hämäläinen, Mikael Mieskolainen:

Passive Temporal Offset Estimation of Multichannel Recordings of an Ad-Hoc Microphone Array. 2393-2402 - Liang Vincent Wang, Woon-Seng Gan

, Andy W. H. Khong, Sen M. Kuo:
Convergence Analysis of Narrowband Feedback Active Noise Control System With Imperfect Secondary Path Estimation. 2403-2411 - Chi-Man Pun

, Xiaochen Yuan
:
Robust Segments Detector for De-Synchronization Resilient Audio Watermarking. 2412-2424 - Miranti Indar Mandasari, Rahim Saeidi

, Mitchell McLaren, David A. van Leeuwen:
Quality Measure Functions for Calibration of Speaker Recognition Systems in Various Duration Conditions. 2425-2438 - Fabian Triefenbach, Azarakhsh Jalalvand, Kris Demuynck, Jean-Pierre Martens:

Acoustic Modeling With Hierarchical Reservoirs. 2439-2450 - Donghyeon Lee, Minwoo Jeong, Kyungduk Kim, Seonghan Ryu, Gary Geunbae Lee:

Unsupervised Spoken Language Understanding for a Multi-Domain Dialog System. 2451-2464
Volume 21, Number 12, December 2013
- A. P. Prathosh, T. V. Ananthapadmanabha, A. G. Ramakrishnan:

Epoch Extraction Based on Integrated Linear Prediction Residual Using Plosion Index. 2471-2480 - Tomas Dekens, Werner Verhelst:

Body Conducted Speech Enhancement by Equalization and Signal Fusion. 2481-2492 - Dejan Markovic, Fabio Antonacci

, Augusto Sarti, Stefano Tubaro:
Soundfield Imaging in the Ray Space. 2493-2505 - Partha Lal, Simon King

:
Cross-Lingual Automatic Speech Recognition Using Tandem Features. 2506-2515 - Tomohiro Nakatani, Shoko Araki

, Takuya Yoshioka, Marc Delcroix
, Masakiyo Fujimoto:
Dominance Based Integration of Spatial and Spectral Features for Speech Enhancement. 2516-2531 - Yotam Peled, Boaz Rafaely

:
Linearly-Constrained Minimum-Variance Method for Spherical Microphone Arrays Based on Plane-Wave Decomposition of the Sound Field. 2532-2540 - Jean-Louis Durrieu, Jean-Philippe Thiran

:
Source/Filter Factorial Hidden Markov Model, With Application to Pitch and Formant Tracking. 2541-2553 - Katherine Ellis, Emanuele Coviello, Antoni B. Chan

, Gert R. G. Lanckriet:
A Bag of Systems Representation for Music Auto-Tagging. 2554-2569 - Aroor Dinesh Dileep, C. Chandra Sekhar:

HMM Based Intermediate Matching Kernel for Classification of Sequential Patterns of Speech Using Support Vector Machines. 2570-2582 - Oliver Thiergart, Giovanni Del Galdo

, Maja Taseska, Emanuël A. P. Habets
:
Geometry-Based Spatial Sound Acquisition Using Distributed Microphone Arrays. 2583-2594 - Jesper Rindom Jensen

, Jacob Benesty
, Mads Græsbøll Christensen
, Jingdong Chen:
A Class of Optimal Rectangular Filtering Matrices for Single-Channel Signal Enhancement in the Time Domain. 2595-2606 - Yizhao Ni

, Matt McVicar, Raúl Santos-Rodríguez
, Tijl De Bie:
Understanding Effects of Subjectivity in Measuring Chord Estimation Accuracy. 2607-2615 - Georg Heigold, Hermann Ney, Ralf Schlüter

:
Investigations on an EM-Style Optimization Algorithm for Discriminative Training of HMMs. 2616-2626 - Bruno Defraene, Naim Mansour

, Steven De Hertogh, Toon van Waterschoot
, Moritz Diehl, Marc Moonen:
Declipping of Audio Signals Using Perceptual Compressed Sensing. 2627-2637

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














