


default search action
IEEE Transactions on Audio, Speech & Language Processing, Volume 15
Volume 15, Number 1, January 2007
- Paris Smaragdis:

Convolutive Speech Bases and Their Application to Supervised Speech Separation. 1-12 - Li Deng, Leo J. Lee, Hagai Attias, Alex Acero

:
Adaptive Kalman Filtering and Smoothing for Tracking Vocal Tract Resonances Using a Continuous-Valued Hidden Dynamic Model. 13-23 - Ben Milner, Xu Shao:

Prediction of Fundamental Frequency and Voicing From Mel-Frequency Cepstral Coefficients for Unconstrained Speech Reconstruction. 24-33 - Patrick A. Naylor

, Anastasis Kounoudes
, Jón Guðnason
, Mike Brookes
:
Estimation of Glottal Closure Instants in Voiced Speech Using the DYPSA Algorithm. 34-43 - Farshad Lahouti, Amir K. Khandani:

Soft Reconstruction of Speech in the Presence of Noise and Packet Loss. 44-56 - Sean A. Ramprashad:

Sparse Bit-Allocations Based on Partial Ordering Schemes With Application to Speech and Audio Coding. 57-69 - Taesu Kim, Hagai Thomas Attias, Soo-Young Lee, Te-Won Lee:

Blind Source Separation Exploiting Higher-Order Frequency Dependencies. 70-79 - Tomohiro Nakatani, Keisuke Kinoshita

, Masato Miyoshi:
Harmonicity-Based Blind Dereverberation for Single-Channel Speech Signals. 80-95 - Bertrand Rivet, Laurent Girin, Christian Jutten:

Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures. 96-108 - Guangji Shi, Parham Aarabi, Hui Jiang:

Phase-Based Dual-Microphone Speech Enhancement Using A Prior Speech Model. 109-118 - Gwo-hwa Ju, Lin-Shan Lee:

A Perceptually Constrained GSVD-Based Approach for Enhancing Speech Corrupted by Colored Noise. 119-134 - Steven J. Rennie, Parham Aarabi, Brendan J. Frey:

Variational Probabilistic Speech Separation Using Microphone Arrays. 135-149 - Ian R. Lane, Tatsuya Kawahara

, Tomoko Matsui
, Satoshi Nakamura:
Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics. 150-161 - Christian Raymond, Frédéric Béchet, Nathalie Camelin

, Renato de Mori, Géraldine Damnati:
Sequential Decision Strategies for Machine Interpretation of Speech. 162-171 - Scott Axelrod, Vaibhava Goel

, Ramesh A. Gopinath, Peder A. Olsen, Karthik Visweswariah:
Discriminative Estimation of Subspace Constrained Gaussian Mixture Models for Speech Recognition. 172-189 - Rajesh M. Hegde, Hema A. Murthy, Venkata Ramana Rao Gadde:

Significance of the Modified Group Delay Feature in Speech Recognition. 190-202 - Erik McDermott, Timothy J. Hazen, Jonathan Le Roux, Atsushi Nakamura, Shigeru Katagiri:

Discriminative Training for Large-Vocabulary Speech Recognition Using Minimum Classification Error. 203-223 - Satya Dharanipragada, Umit H. Yapanel, Bhaskar D. Rao:

Robust Feature Extraction for Continuous Speech Recognition Using the MVDR Spectrum Estimation Method. 224-234 - Michael L. Seltzer, Alex Acero

:
Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition. 235-245 - Joe Frankel, Simon King

:
Speech Recognition Using Linear Dynamic Models. 246-256 - Chia-Ping Chen, Jeff A. Bilmes:

MVA Processing of Speech Features. 257-270 - Haizhou Li

, Bin Ma, Chin-Hui Lee:
A Vector Space Modeling Approach to Spoken Language Identification. 271-284 - Peter Day, Asoke K. Nandi:

Robust Text-Independent Speaker Verification Using Genetic Programming. 285-295 - Youngim Jung

, Ae-sun Yoon, Hyuk-Chul Kwon:
Grapheme-to-Phoneme Conversion of Arabic Numeral Expressions for Embedded TTS Systems. 296-309 - Jan H. Plasberg, W. Bastiaan Kleijn

:
The Sensitivity Matrix: Using Advanced Auditory Models in Speech and Audio Processing. 310-319 - Ixone Arroabarren, Alfonso Carlosena

:
Voice Production Mechanisms of Vocal Vibrato in Male Singers. 320-332 - Kazuyoshi Yoshii

, Masataka Goto
, Hiroshi G. Okuno
:
Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression. 333-345 - Kishan Thambiratnam, Sridha Sridharan:

Rapid Yet Accurate Speech Indexing Using Dynamic Match Lattice Spotting. 346-357 - Paris Smaragdis, Petros Boufounos

:
Position and Trajectory Learning for Microphone Arrays. 358-368
Volume 15, Number 2, February 2007
- Yannis Agiomyrgiannakis, Yannis Stylianou:

Conditional Vector Quantization for Speech Coding. 377-386 - Sorin Dusan, James L. Flanagan, Amod Karve, Mridul Balaraman:

Speech Compression by Polynomial Approximation. 387-395 - Guoning Hu, DeLiang Wang:

Auditory Segmentation Based on Onset and Offset Analysis. 396-405 - Richard C. Hendriks, Richard Heusdens, Jesper Jensen:

An MMSE Estimator for Speech Enhancement Under a Combined Stochastic-Deterministic Speech Model. 406-415 - Yoshifumi Nagata, Toyota Fujioka, Masato Abe:

Two-Dimensional DOA Estimation of Sound Sources Based on Weighted Wiener Gain Exploiting Two-Directional Microphones. 416-429 - Marc Delcroix

, Takafumi Hikichi, Masato Miyoshi:
Precise Dereverberation Using Multichannel Linear Prediction. 430-440 - Sriram Srinivasan

, Jonas Samuelsson, W. Bastiaan Kleijn
:
Codebook-Based Bayesian Speech Enhancement for Nonstationary Environments. 441-452 - Rongqing Huang, John H. L. Hansen, Pongtep Angkititrakul:

Dialect/Accent Classification Using Unrestricted Audio. 453-464 - Murat Akbacak, John H. L. Hansen:

Environmental Sniffing: Noise Knowledge Estimation for Robust Speech Systems. 465-477 - Jian Wu, Qiang Huo:

A Study of Minimum Classification Error (MCE) Linear Regression for Supervised Adaptation of MCE-Trained Continuous-Density Hidden Markov Models. 478-488 - Paul D. Teal

:
Tracking Wide-Band Targets Having Significant Doppler Shift. 489-497 - Pongtep Angkititrakul, John H. L. Hansen:

Discriminative In-Set/Out-of-Set Speaker Recognition. 498-508 - Darko Kirovski, Zeph Landau:

Generalized Lempel-Ziv Compression for Audio. 509-518 - Tin Lay Nwe, Haizhou Li

:
Exploring Vibrato-Motivated Acoustic Features for Singer Identification. 519-530 - Nicola Laurenti

, Giovanni De Poli
, Daniele Montagner:
A Nonlinear Method for Stochastic Spectrum Estimation in the Modeling of Musical Sounds. 531-541 - Sunil Bharitkar

, Chris Kyriakakis:
Visualization of Multiple Listener Room Acoustic Equalization With the Sammon Map. 542-551 - Damian T. Murphy, Mark Beeson:

The KW-Boundary Hybrid Digital Waveguide Mesh for Room Acoustics Applications. 552-564 - Ramani Duraiswami

, Dmitry N. Zotkin, Nail A. Gumerov:
Fast Evaluation of the Room Transfer Function Using Multipole Expansion. 565-576 - Jack Mullen, David M. Howard

, Damian T. Murphy:
Real-Time Dynamic Articulations in the 2-D Waveguide Mesh Vocal Tract Model. 577-585 - Xu Sun

, Sen M. Kuo:
Active Narrowband Noise Control Systems Using Cascading Adaptive Filters. 586-592 - Muhammad Tahir Akhtar

, Masahide Abe, Masayuki Kawamata:
On Active Noise Control Systems With Online Acoustic Feedback Path Modeling. 593-600 - Daniel Gatica-Perez

, Guillaume Lathoud, Jean-Marc Odobez
, Iain McCowan:
Audiovisual Probabilistic Tracking of Multiple Speakers in Meetings. 601-616 - Simon Doclo

, Marc Moonen:
Superdirective Beamforming Robust Against Microphone Mismatch. 617-631 - Chang-Heon Lee, Sung-Kyo Jung, Hong-Goo Kang:

Applying a Speaker-Dependent Speech Compression Technique to Concatenative TTS Synthesizers. 632-640 - K.-S. Lee:

Statistical Approach for Voice Personality Transformation. 641-651 - Xiaodong Cui, Abeer Alwan:

Robust Speaker Adaptation by Weighted Model Averaging Based on the Minimum Description Length Criterion. 652-660 - M.-Y. Tsai, F.-C. Chou, L.-S. Lee:

Pronunciation Modeling With Reduced Confusion for Mandarin Chinese Using a Three-Stage Framework. 661-675 - Qin Yan, Saeed Vaseghi, Dimitrios Rentzos, Ching-Hsiang Ho:

Analysis and Synthesis of Formant Spaces of British, Australian, and American Accents. 676-689 - Dagen Wang, Shrikanth S. Narayanan:

An Acoustic Measure for Word Prominence in Spontaneous Speech. 690-701 - Zhiyun Li, Ramani Duraiswami

:
Flexible and Optimal Design of Spherical Microphone Arrays for Beamforming. 702-714 - Mirko Knaak, Shoko Araki

, Shoji Makino
:
Geometrically Constrained Independent Component Analysis. 715-726 - I. Balmages

, Boaz Rafaely
:
Open-Sphere Designs for Spherical Microphone Arrays. 727-732 - Peter Jancovic:

Fast Algorithm for Calculation of the Union-Based Probability. 732-734 - Young-Ik Kim, Rhee Man Kil:

Estimation of Interaural Time Differences Based on Zero-Crossings in Noisy Multisource Environments. 734-743
Volume 15, Number 3, March 2007
- Pradeepa Yahampath, Paul Rondeau:

Multiple-Description Predictive-Vector Quantization With Applications to Low Bit-Rate Speech Coding Over Networks. 749-755 - Ethan Robert Duni, Bhaskar D. Rao:

High-Rate Optimized Recursive Vector Quantization Structures Using Hidden Markov Models. 756-769 - Ethan Robert Duni, Bhaskar D. Rao:

A High-Rate Optimal Transform Coder With Gaussian Mixture Companders. 770-783 - Brian Kan-Wing Mak

, Roger Wend-Huu Hsiao:
Kernel Eigenspace-Based MLLR Adaptation. 784-795 - Bertrand Rivet, Laurent Girin, Christian Jutten:

Log-Rayleigh Distribution: A Simple and Efficient Statistical Representation of Log-Spectral Coefficients. 796-802 - Patricia Scanlon, Daniel P. W. Ellis, Richard B. Reilly

:
Using Broad Phonetic Group Experts for Improved Speech Recognition. 803-812 - Barbara Resch, Mattias Nilsson, L. Anders Ekman, W. Bastiaan Kleijn

:
Estimation of the Instantaneous Pitch of Speech. 813-822 - Francesco Gianfelici, Giorgio Biagetti, Paolo Crippa

, Claudio Turchetti:
Multicomponent AM-FM Representations: An Asymptotically Exact Approach. 823-837 - Dima Ruinskiy, Yizhar Lavner

:
An Effective Algorithm for Automatic Detection and Exact Demarcation of Breath Sounds in Speech and Song Signals. 838-850 - Laurent Girin, Mohammad Firouzmand

, Sylvain Marchand:
Perceptual Long-Term Variable-Rate Sinusoidal Modeling of Speech. 851-861 - Jesper Jensen, Richard Heusdens:

Improved Subspace-Based Single-Channel Speech Enhancement Using Generalized Super-Gaussian Priors. 862-872 - Juho Kontio, Laura Laaksonen, Paavo Alku

:
Neural Network-Based Artificial Bandwidth Expansion of Speech. 873-881 - David Yuheng Zhao, W. Bastiaan Kleijn

:
HMM-Based Gain Modeling for Enhancement of Speech in Noise. 882-892 - M. Khademul Islam Molla, Keikichi Hirose:

Single-Mixture Audio Source Separation by Subspace Decomposition of Hilbert Spectrum. 893-900 - Karsten Vandborg Sørensen, Søren Vang Andersen:

Rayleigh Mixture Model-Based Hidden Markov Modeling and Estimation of Noise in Noisy Speech Signals. 901-917 - Richard C. Hendriks, Rainer Martin

:
MAP Estimators for Speech Enhancement Under Normal and Rayleigh Inverse Gaussian Distributions. 918-927 - Nikos Chatzichrisafis, Vassilios Diakoloukas, Vassilios Digalakis, Costas Harizakis:

Gaussian Mixture Clustering and Language Adaptation for the Development of a New Language Speech Recognition System. 928-938 - Ghinwa F. Choueiter, James R. Glass:

An Implementation of Rational Wavelets and Filter Design for Phonetic Classification. 939-948 - Esther Klabbers, Jan P. H. van Santen

, Alexander Kain:
The Contribution of Various Sources of Spectral Mismatch to Audible Discontinuities in a Diphone Database. 949-956 - Jerome R. Bellegarda:

Globally Optimal Training of Unit Boundaries in Unit Selection Text-to-Speech Synthesis. 957-965 - Pim Korten, Jesper Jensen, Richard Heusdens:

High-Resolution Spherical Quantization of Sinusoidal Parameters. 966-981 - Hirokazu Kameoka, Takuya Nishimoto, Shigeki Sagayama:

A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering. 982-994 - Johannes Nix, Volker Hohmann:

Combined Estimation of Spectral Envelopes and Sound Source Direction of Concurrent Voices by Multidimensional Statistical Filtering. 995-1008 - Matthew E. P. Davies

, Mark D. Plumbley
:
Context-Dependent Beat Tracking of Musical Audio. 1009-1020 - Leevi Peltola, Cumhur Erkut

, Perry R. Cook, Vesa Välimäki
:
Synthesis of Hand Clapping Sounds. 1021-1029 - Jean-Marc Valin:

On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk. 1030-1034 - James D. Gordy, Rafik A. Goubran:

Statistical Analysis of Doubletalk Detection for Calibration and Performance Evaluation. 1035-1043 - Felix Albu

, Martin Bouchard
, Yuriy V. Zakharov
:
Pseudo-Affine Projection Algorithms for Multichannel Active Noise Control. 1044-1052 - Jacob Benesty

, Jingdong Chen, Yiteng Huang, Jacek Dmochowski:
On Microphone-Array Beamforming From a MIMO Acoustic Signal Processing Perspective. 1053-1065 - Tuomas Virtanen

:
Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria. 1066-1074 - Carlos Busso

, Zhigang Deng
, Michael Grimm, Ulrich Neumann, Shrikanth S. Narayanan:
Rigid Head Motion in Expressive Speech Animation: Analysis and Synthesis. 1075-1086 - Chen Yang, Frank K. Soong, Tan Lee

:
Static and Dynamic Spectral Features: Their Noise Robustness and Optimal Weights for ASR. 1087-1097 - Luis Buera, Eduardo Lleida

, Antonio Miguel, Alfonso Ortega
, Oscar Saz:
Cepstral Vector Normalization Based on Stereo Data for Robust Speech Recognition. 1098-1113 - Xianyu Zhao, Zhijian Ou:

Closely Coupled Array Processing and Model-Based Compensation for Microphone Array Speech Recognition. 1114-1122
Volume 15, Number 4, May 2007
- Rasool Tahmasbi

, Sadegh Rezaei:
A Soft Voice Activity Detection Using GARCH Filter and Variance Gamma Distribution. 1129-1134 - Jonathan Le Roux, Hirokazu Kameoka, Nobutaka Ono

, Alain de Cheveigné
, Shigeki Sagayama:
Single and Multiple F0 Contour Estimation Through Parametric Spectrogram Modeling of Speech in Noisy Environments. 1135-1145 - Thomas Eriksson

, Frank Norden:
Memory-Based Vector Quantization of LSF Parameters by a Power Series Approximation. 1146-1155 - Bengt J. Borgstrom, Mihaela van der Schaar, Abeer Alwan:

Rate Allocation for Noncollaborative Multiuser Speech Communication Systems Based on Bargaining Theory. 1156-1166 - Milan Jelinek, Redwan Salami:

Wideband Speech Coding Advances in VMR-WB Standard. 1167-1179 - Athanasios Mouchtaris, Jan Van der Spiegel, Paul Mueller, Panagiotis Tsakalides

:
A Spectral Conversion Approach to Single-Channel Speech Enhancement. 1180-1193 - Esfandiar Zavarehei, Saeed Vaseghi, Qin Yan:

Noisy Speech Enhancement Using Harmonic-Noise Model and Codebook-Based Post-Processing. 1194-1203 - Xuechuan Wang, Douglas D. O'Shaughnessy:

Environmental Independent ASR Model Adaptation/Compensation by Bayesian Parametric Representation. 1204-1217 - Peter Birkholz

, Dietmar Jackèl, Bernd J. Kröger
:
Simulation of Losses Due to Turbulence in the Time-Varying Vocal System. 1218-1226 - Chung-Hsien Wu

, Chi-Chun Hsia, Jiun-Fu Chen, Jhing-Fa Wang:
Variable-Length Unit Selection in TTS Using Structural Syntactic Cost. 1227-1235 - Karthikeyan Umapathy, Sridhar Krishnan

, R. K. Rao:
Audio Signal Feature Extraction and Classification Using Local Discriminant Bases. 1236-1246 - Graham E. Poliner, Daniel P. W. Ellis, Andreas F. Ehmann, Emilia Gómez, Sebastian Streich, Beesuan Ong:

Melody Transcription From Music Audio: Approaches and Evaluation. 1247-1256 - Harvey D. Thornburg, Randal J. Leistikow, Jonathan Berger

:
Melody Extraction and Musical Onset Detection via Probabilistic Models of Framewise STFT Peak Data. 1257-1272 - Emmanuel Vincent, Mark D. Plumbley

:
Low Bit-Rate Object Coding of Musical Audio Using Bayesian Harmonic Models. 1273-1282 - Corentin Dubois, Manuel Davy:

Joint Detection and Tracking of Time-Varying Harmonic Components: A Flexible Bayesian Approach. 1283-1295 - H. M. A. Malik, Rashid Ansari, Ashfaq A. Khokhar:

Robust Data Hiding in Audio Using Allpass Filters. 1296-1304 - Yekutiel Avargel, Israel Cohen:

System Identification in the Short-Time Fourier Transform Domain With Crossband Filtering. 1305-1319 - Fredric Lindström, Christian Schüldt

, Ingvar Claesson:
An Improvement of the Two-Path Algorithm Transfer Logic for Acoustic Echo Cancellation. 1320-1326 - Jacek Dmochowski, Jacob Benesty

, Sofiène Affes:
Direction of Arrival Estimation Using the Parameterized Spatial Correlation Matrix. 1327-1339 - Wolfgang Herbordt, Herbert Buchner, Satoshi Nakamura, Walter Kellermann:

Multichannel Bin-Wise Robust Frequency-Domain Adaptive Filtering and Its Application to Adaptive Beamforming. 1340-1351 - Takaaki Hori, Chiori Hori, Yasuhiro Minami, Atsushi Nakamura:

Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition. 1352-1365 - Xiaodong Cui, Yifan Gong:

A Study of Variable-Parameter Gaussian Mixture Hidden Markov Modeling for Noisy Speech Recognition. 1366-1376 - Mathias De Wachter, Mike Matton

, Kris Demuynck, Patrick Wambacq
, Ronald Cools
, Dirk Van Compernolle:
Template-Based Continuous Speech Recognition. 1377-1390 - Zheng-Hua Tan

, Paul Dalsgaard, Børge Lindberg:
Exploiting Temporal Correlation of Speech for Error Robust and Bandwidth Flexible Distributed Speech Recognition. 1391-1403 - Paris Smaragdis, Madhusudana V. S. Shashanka

:
A Framework for Secure Speech Recognition. 1404-1413 - Xunying Liu, Mark J. F. Gales:

Automatic Model Complexity Control Using Marginalized Discriminative Growth Functions. 1414-1424 - Yan Han, Johan de Veth, Lou Boves:

Trajectory Clustering for Solving the Trajectory Folding Problem in Automatic Speech Recognition. 1425-1434 - Patrick Kenny, Gilles Boulianne

, Pierre Ouellet, Pierre Dumouchel
:
Joint Factor Analysis Versus Eigenchannels in Speaker Recognition. 1435-1447 - Patrick Kenny, Gilles Boulianne

, Pierre Ouellet, Pierre Dumouchel
:
Speaker and Session Variability in GMM-Based Speaker Verification. 1448-1460 - Wei-Ho Tsai, Shih-Sian Cheng, Hsin-Min Wang

:
Automatic Speaker Clustering Using a Voice Characteristic Reference Space and Maximum Purity Estimation. 1461-1474 - Yipeng Li, DeLiang Wang:

Separation of Singing Voice From Music Accompaniment for Monaural Recordings. 1475-1487 - Stefan Bilbao, Lauri Savioja, Julius O. Smith III

:
Parameterized Finite Difference Schemes for Plates: Stability, the Reduction of Directional Dispersion and Frequency Warping. 1488-1495 - Angel M. Gomez

, Antonio M. Peinado
, Victoria E. Sánchez, Antonio J. Rubio:
On the Ramsey Class of Interleavers for Robust Speech Recognition in Burst-Like Packet Loss. 1496-1499
Volume 15, Number 5, July 2007
- Scott C. Douglas

, Malay Gupta, Hiroshi Sawada, Shoji Makino
:
Spatio-Temporal FastICA Algorithms for the Blind Separation of Convolutive Mixtures. 1511-1520 - Intae Lee, Te-Won Lee:

On the Assumption of Spherical Symmetry and Sparseness for the Frequency-Domain Speech Model. 1521-1528 - Ernst Warsitz, Reinhold Haeb-Umbach

:
Blind Acoustic Beamforming Based on Generalized Eigenvalue Decomposition. 1529-1539 - Abdeldjalil Aïssa-El-Bey

, Karim Abed-Meraim, Yves Grenier:
Blind Separation of Underdetermined Convolutive Mixtures Using Their Time-Frequency Representation. 1540-1550 - Zhaoshui He, Shengli Xie, Shuxue Ding, Andrzej Cichocki

:
Convolutive Blind Source Separation in the Frequency Domain Based on Sparse Representation. 1551-1563 - Alexey Ozerov, Pierrick Philippe, Frédéric Bimbot, Rémi Gribonval:

Adaptation of Bayesian Models for Single-Channel Source Separation and its Application to Voice/Music Separation in Popular Songs. 1564-1578 - Ken'ichi Furuya

, Akitoshi Kataoka:
Robust Speech Dereverberation Using Multichannel Blind Deconvolution With Spectral Subtraction. 1579-1591 - Hiroshi Sawada, Shoko Araki

, Ryo Mukai, Shoji Makino
:
Grouping Separated Frequency Components by Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation. 1592-1604 - Oscal T.-C. Chen, Chia-Hsiung Liu:

Content-Dependent Watermarking Scheme in Compressed Speech With Identifying Manner and Location of Attacks. 1605-1616 - Vesa Siivola, Teemu Hirsimäki, Sami Virpioja:

On Growing and Pruning Kneser-Ney Smoothed N-Gram Models. 1617-1624 - Mathieu Lagrange, Sylvain Marchand, Jean-Bernard Rault:

Enhancing the Tracking of Partials for the Sinusoidal Modeling of Polyphonic Sounds. 1625-1634 - Mads Græsbøll Christensen

, Andreas Jakobsson
, Søren Holdt Jensen:
Joint High-Resolution Fundamental Frequency and Order Estimation. 1635-1644 - Xinglei Zhu, Gerald Beauregard, Lonce L. Wyse:

Real-Time Signal Estimation From Modified Short-Time Fourier Transform Magnitude Spectra. 1645-1653 - Anders Meng, Peter Ahrendt, Jan Larsen

, Lars Kai Hansen
:
Temporal Feature Integration for Music Genre Classification. 1654-1664 - Masahiro Yukawa, Konstantinos Slavakis, Isao Yamada:

Adaptive Parallel Quadratic-Metric Projection Algorithms. 1665-1680 - Andy W. H. Khong, Patrick A. Naylor

:
Selective-Tap Adaptive Filtering With Performance Analysis for Identification of Time-Varying Systems. 1681-1695 - Guillaume Lathoud, Jean-Marc Odobez

:
Short-Term Spatio-Temporal Clustering Applied to Multiple Moving Speakers. 1696-1710 - Ji Ming, Timothy J. Hazen, James R. Glass, Douglas A. Reynolds:

Robust Speaker Recognition in Noisy Conditions. 1711-1723 - Mark D. Skowronski, John G. Harris

:
Noise-Robust Automatic Speech Recognition Using a Predictive Echo State Network. 1724-1730 - Mohamed Afify, Olivier Siohan:

Comments on Vocal Tract Length Normalization Equals Linear Transformation in Cepstral Space. 1731-1732
Volume 15, Number 6, August 2007
- Jan S. Erkelens, Richard C. Hendriks, Richard Heusdens, Jesper Jensen:

Minimum Mean-Square Error Estimation of Discrete Fourier Coefficients With Generalized Gamma Priors. 1741-1752 - Chang Huai You, Susanto Rahardja

, Soo Ngee Koh:
Audible Noise Reduction in Eigendomain for Speech Enhancement. 1753-1765 - Aarthi M. Reddy, Bhiksha Raj:

Soft Mask Methods for Single-Channel Speaker Separation. 1766-1776 - Ann Spriet, Geert Rombouts, Marc Moonen, Jan Wouters

:
Combined Feedback and Noise Suppression in Hearing Aids. 1777-1790 - Marc Delcroix

, Takafumi Hikichi, Masato Miyoshi:
Dereverberation and Denoising Using Multichannel Linear Prediction. 1791-1801 - Woojay Jeon, Biing-Hwang Juang:

Speech Analysis in a Model of the Central Auditory System. 1802-1817 - Nikolaos Mitianoudis

, Tania Stathaki:
Batch and Online Underdetermined Source Separation Using Laplacian Mixture Models. 1818-1832 - Maurizio Mancini

, Roberto Bresin
, Catherine Pelachaud:
A Virtual Head Driven by Music Expressivity. 1833-1841 - Shantanu Chakrabartty, Yunbin Deng, Gert Cauwenberghs

:
Robust Speech Feature Extraction by Growth Transformation in Reproducing Kernel Hilbert Space. 1842-1849 - Bertrand Mesot, David Barber

:
Switching Linear Dynamical Systems for Noise Robust Speech Recognition. 1850-1858 - Amit S. Malegaonkar, Aladdin M. Ariyaeeinia, P. Sivakumaran:

Efficient Speaker Change Detection Using Adapted Gaussian Mixture Models. 1859-1869 - Yuan-Fu Liao

, Zi-He Chen, Yau-Tarng Juang:
Latent Prosody Analysis for Robust Speaker Identification. 1870-1883 - Wai Nang Chan, Nengheng Zheng, Tan Lee

:
Discrimination Power of Vocal Source and Vocal Tract Related Features for Speaker Segmentation. 1884-1892 - Wei Wu, Thomas Fang Zheng, Mingxing Xu, Frank K. Soong:

A Cohort-Based Speaker Model Synthesis for Mismatched Channels in Speaker Verification. 1893-1903 - Jean-Luc Rouas

:
Automatic Prosodic Variations Modeling for Language and Dialect Discrimination. 1904-1911 - Peter Taraba

:
Kneser-Ney Smoothing With a Correcting Transformation for Small Data Sets. 1912-1921 - Darko Kirovski, Fabien A. P. Petitcolas

, Zeph Landau:
The Replacement Attack. 1922-1931 - Kai Yu, Mark J. F. Gales:

Bayesian Adaptive Inference and Adaptive Training. 1932-1943
Volume 15, Number 7, September 2007
- Mark A. Przybocki, Alvin F. Martin, Audrey N. Le:

NIST Speaker Recognition Evaluations Utilizing the Mixer Corpora - 2004, 2005, 2006. 1951-1959 - Benoit G. B. Fauve, Driss Matrouf, Nicolas Scheffer, Jean-François Bonastre

, John S. D. Mason:
State-of-the-Art Performance in Text-Independent Speaker Verification Through Open-Source Software. 1960-1968 - Fabio Castaldo, Daniele Colibro, Emanuele Dalmasso, Pietro Laface, Claudio Vair:

Compensation of Nuisance Factors for Speaker and Language Recognition. 1969-1978 - Lukás Burget

, Pavel Matejka, Petr Schwarz
, Ondrej Glembek, Jan Cernocký
:
Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System. 1979-1986 - Andreas Stolcke, Sachin S. Kajarekar, Luciana Ferrer, E. Shrinberg:

Speaker Recognition With Session Variability Normalization Based on MLLR Adaptation Transforms. 1987-1998 - Shou-Chun Yin, Richard C. Rose, Patrick Kenny:

A Joint Factor Analysis Approach to Progressive Model Adaptation in Text-Independent Speaker Verification. 1999-2010 - Xavier Anguera

, Chuck Wooters
, Javier Hernando:
Acoustic Beamforming for Speaker Diarization of Meetings. 2011-2022 - Qin Jin, Tanja Schultz

, Alex Waibel:
Far-Field Speaker Recognition. 2023-2032 - Hagai Aronowitz, David Burshtein:

Efficient Speaker Recognition Using Approximated Cross Entropy (ACE). 2033-2043 - Vinod Prakash, John H. L. Hansen:

In-Set/Out-of-Set Speaker Recognition Under Sparse Enrollment. 2044-2052 - Bin Ma, Haizhou Li

, Rong Tong:
Spoken Language Recognition Using Ensemble Classifiers. 2053-2062 - Yosef A. Solewicz, Moshe Koppel:

UsingPost-Classifiers to Enhance Fusion of Low- and High-Level Speaker Recognition. 2063-2071 - Niko Brümmer, Lukás Burget

, Jan Cernocký
, Ondrej Glembek, Frantisek Grézl, Martin Karafiát
, David A. van Leeuwen, Pavel Matejka, Petr Schwarz
, Albert Strasheim:
Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006. 2072-2084 - William M. Campbell, Joseph P. Campbell, Terry P. Gleason, Douglas A. Reynolds, Wade Shen:

Speaker Verification Using Support Vector Machines and High-Level Features. 2085-2094 - Najim Dehak

, Pierre Dumouchel
, Patrick Kenny:
Modeling Prosodic Features With Joint Factor Analysis for Speaker Verification. 2095-2103 - Joaquin Gonzalez-Rodriguez

, P. Rose, Daniel Ramos
, Doroteo T. Toledano
, Javier Ortega-Garcia
:
Emulating DNA: Rigorous Quantification of Evidential Weight in Transparent and Testable Forensic Speaker Recognition. 2104-2115 - Jason D. Williams

, S. Young:
Scaling POMDPs for Spoken Dialog Management. 2116-2129 - Soundararajan Srinivasan, DeLiang L. Wang:

Transforming Binary Uncertainties for Robust Speech Recognition. 2130-2140 - J. Usher, Jacob Benesty

:
Enhancement of Spatial Sound Quality: A New Reverberation-Extraction Audio Upmixer. 2141-2150 - Cheng-Yuan Lin, Jyh-Shing Roger Jang

:
Automatic Phonetic Segmentation by Score Predictive Model for the Corpora of Mandarin Singing Voices. 2151-2159 - Rusheng Hu, Yunxin Zhao:

Knowledge-Based Adaptive Decision Tree State Tying for Conversational Speech Recognition. 2160-2168
Volume 15, Number 8, November 2007
- Javier Ramírez

, José C. Segura
, Juan Manuel Górriz
, Luz García
:
Improved Voice Activity Detection Using Contextual Multiple Hypothesis Testing for Robust Speech Recognition. 2177-2189 - Dagen Wang, Shrikanth S. Narayanan:

Robust Speech Rate Estimation for Spontaneous Speech. 2190-2201 - Seung Seop Park, Nam Soo Kim:

On Using Multiple Models for Automatic Speech Segmentation. 2202-2212 - Robert I. Damper, Tasanawan Soonklang:

Subjective Evaluation of Techniques for Proper Name Pronunciation. 2213-2221 - Tomoki Toda

, Alan W. Black, Keiichi Tokuda:
Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory. 2222-2235 - Te Li, Susanto Rahardja

, Rongshan Yu, Soo Ngee Koh:
On Integer MDCT for Perceptual Audio Coding. 2236-2248 - Enrique Alexandre

, Lucas Cuadra
, Manuel Rosa-Zurera
, Francisco López-Ferreras:
Feature Selection for Sound Classification in Hearing Aids Through Restricted Search Driven by Genetic Algorithms. 2249-2256 - Hari Krishna Maganti, Daniel Gatica-Perez

, Iain McCowan:
Speech Enhancement and Recognition in Meetings With an Audio-Visual Sensor Array. 2257-2269 - Xiangyang Wang, Wei Qi, Panpan Niu:

A New Adaptive Digital Audio Watermarking Based on Support Vector Regression. 2270-2277 - Leslie S. Smith

, Steve Collins:
Determining ITDs Using Two Microphones on a Flat Panel During Onset Intervals With a Biologically Inspired Spike-Based Technique. 2278-2286 - Harsha I. K. Rao, V. John Mathews, Young-Cheol Park:

A Minimax Approach for the Joint Design of Acoustic Crosstalk Cancellation Filters. 2287-2298 - Mohammad H. Radfar, Richard M. Dansereau:

Single-Channel Speech Separation Using Soft Mask Filtering. 2299-2310 - Jingyi Zhang, Wai Lok Woo

, Satnam Singh Dlay:
Blind Source Separation of Postnonlinear Convolutive Mixture. 2311-2330 - Carlos Busso

, Shrikanth S. Narayanan:
Interrelation Between Speech and Facial Gestures in Emotional Utterances: A Single Subject Study. 2331-2347 - Ari Abramson, Israel Cohen:

Simultaneous Detection and Estimation Approach for Speech Enhancement. 2348-2359 - Zohra Yermeche, Nedelko Grbic, Ingvar Claesson:

Blind Subband Beamforming With Time-Delay Constraints for Moving Source Speech Enhancement. 2360-2372 - Joseph Keshet

, Shai Shalev-Shwartz, Yoram Singer, Dan Chazan:
A Large Margin Algorithm for Speech-to-Phoneme and Music-to-Score Alignment. 2373-2382 - Xinwei Li, Hui Jiang:

Solving Large-Margin Hidden Markov Model Estimation via Semidefinite Programming. 2383-2392 - Jinyu Li

, Ming Yuan
, Chin-Hui Lee:
Approximate Test Risk Bound Minimization Through Soft Margin Estimation. 2393-2404 - Mohamed Afify, Xinwei Li, Hui Jiang:

Statistical Analysis of Minimum Classification Error Learning for Gaussian and Hidden Markov Model Classifiers. 2405-2417 - Srinivasan Umesh

, Rohit Sinha
:
A Study of Filter Bank Smoothing in MFCC Features for Recognition of Children's Speech. 2418-2430 - Haitian Xu, Paul Dalsgaard, Zheng-Hua Tan

, Børge Lindberg:
Noise Condition-Dependent Training Based on Noise Classification and SNR Estimation. 2431-2443 - Rongqing Huang, John H. L. Hansen:

Unsupervised Discriminative Training With Application to Dialect Classification. 2444-2453 - Shizhen Wang, Xiaodong Cui, Abeer Alwan:

Speaker Adaptation With Limited Data Using Regression-Tree-Based Spectral Peak Alignment. 2454-2464 - Jérôme Louradour, Khalid Daoudi, Francis R. Bach:

Feature Space Mahalanobis Sequence Kernels: Application to SVM Speaker Verification. 2465-2475 - Minho Jin, Frank K. Soong, Chang Dong Yoo:

A Syllable Lattice Approach to Speaker Verification. 2476-2484 - Mohamed Chibani, Roch Lefebvre, Philippe Gournay:

Fast Recovery for a CELP-Like Speech Codec After a Frame Erasure. 2485-2495 - Bernd Geiser, Peter Jax, Peter Vary, Hervé Taddei, Stefan Schandl, Martin Gartner, Cyril Guillaume, Stéphane Ragot:

Bandwidth Extension for Hierarchical Speech and Audio Coding in ITU-T Rec. G.729.1. 2496-2509 - Jacek Dmochowski, Jacob Benesty

, Sofiène Affes
:
A Generalized Steered Response Power Method for Computationally Viable Source Localization. 2510-2526 - Ken'ichi Kumatani, Tobias Gehrig, Uwe Mayer, Emilian Stoimenov, John W. McDonough, Matthias Wölfel

:
Adaptive Beamforming With a Minimum Mutual Information Criterion. 2527-2541 - K. C. Ho, Ming Sun:

An Accurate Algebraic Closed-Form Solution for Energy-Based Source Localization. 2542-2550 - Chien-Lin Huang, Chung-Hsien Wu

:
Spoken Document Retrieval Using Multilevel Knowledge and Semantic Verification. 2551-2560 - Toon van Waterschoot

, Marc Moonen:
A Pole-Zero Placement Technique for Designing Second-Order IIR Parametric Equalizer Filters. 2561-2565

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














