


default search action
IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 26
Volume 26, Number 1, January 2018
- Dianna Yee, A. Homayoun Kamkar-Parsi, Rainer Martin

, Henning Puder:
A Noise Reduction Postfilter for Binaurally Linked Single-Microphone Hearing Aids Utilizing a Nearby External Microphone. 5-18 - Tom Bäckström

, Johannes Fischer:
Fast Randomization for Distributed Low-Bitrate Coding of Speech and Audio. 19-30 - Jun Deng, Xinzhou Xu, Zixing Zhang, Sascha Frühholz

, Björn W. Schuller
:
Semisupervised Autoencoders for Speech Emotion Recognition. 31-43 - Md. Sahidullah

, Dennis Alexander Lehmann Thomsen, Rosa González Hautamäki, Tomi Kinnunen, Zheng-Hua Tan
, Robert Parts, Martti Pitkänen:
Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones. 44-56 - Gilles Degottex, Pierre Lanchantin, Mark J. F. Gales:

A Log Domain Pulse Model for Parametric Speech Synthesis. 57-70 - Johannes Abel

, Tim Fingscheidt
:
Artificial Speech Bandwidth Extension Using Deep Neural Networks for Wideband Spectral Envelope Estimation. 71-83 - Yuki Saito

, Shinnosuke Takamichi, Hiroshi Saruwatari:
Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks. 84-96 - Kristian Timm Andersen, Marc Moonen:

Robust Speech-Distortion Weighted Interframe Wiener Filters for Single-Channel Noise Reduction. 97-107 - Chen-Yu Chiang:

Cross-Dialect Adaptation Framework for Constructing Prosodic Models for Chinese Dialect Text-to-Speech Systems. 108-121 - Bingquan Liu, Zhen Xu, Chengjie Sun, Baoxun Wang

, Xiaolong Wang, Derek F. Wong
, Min Zhang:
Content-Oriented User Modeling for Personalized Response Ranking in Chatbots. 122-133 - Zhiyuan Tang, Dong Wang, Yixiang Chen, Lantian Li

, Andrew Abel:
Phonetic Temporal Neural Model for Language Identification. 134-144 - Soumitro Chakrabarty, Emanuël A. P. Habets:

A Bayesian Approach to Informed Spatial Filtering With Robustness Against DOA Estimation Errors. 145-160 - Kuan-Yu Chen

, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang
:
An Information Distillation Framework for Extractive Summarization. 161-170 - Ma Jin

, Yan Song, Ian McLoughlin
, Li-Rong Dai:
LID-Senones and Their Statistics for Language Identification. 171-183 - Zhehuai Chen

, Jasha Droppo
, Jinyu Li
, Wayne Xiong:
Progressive Joint Modeling in Unsupervised Single-Channel Overlapped Speech Recognition. 184-196 - Shivesh Ranjan

, John H. L. Hansen
:
Curriculum Learning Based Approaches for Noise Robust Speaker Recognition. 197-210
Volume 26, Number 2, February 2018
- Yoshiaki Bando

, Katsutoshi Itoyama, Masashi Konyo
, Satoshi Tadokoro, Kazuhiro Nakadai
, Kazuyoshi Yoshii
, Tatsuya Kawahara
, Hiroshi G. Okuno
:
Speech Enhancement Based on Bayesian Low-Rank and Sparse Decomposition of Multichannel Magnitude Spectrograms. 215-230 - Yu-Ping Ruan

, Qian Chen, Zhen-Hua Ling
:
A Sequential Neural Encoder With Latent Structured Description for Modeling Sentences. 231-242 - Amelia Jane Gully

, Helena Daffern, Damian T. Murphy
:
Diphthong Synthesis Using the Dynamic 3D Digital Waveguide Mesh. 243-255 - Chunyang Wu

, Mark J. F. Gales, Anton Ragni, Penny Karanasou
, Khe Chai Sim:
Improving Interpretability and Regularization in Deep Learning. 256-265 - Kehai Chen

, Tiejun Zhao, Muyun Yang
, Lemao Liu
, Akihiro Tamura
, Rui Wang
, Masao Utiyama, Eiichiro Sumita:
A Neural Approach to Source Dependence Based Context Model for Statistical Machine Translation. 266-280 - Joonas Nikunen

, Aleksandr Diment, Tuomas Virtanen
:
Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking. 281-295 - Johan Sward

, Hongbin Li
, Andreas Jakobsson
:
Off-Grid Fundamental Frequency Estimation. 296-303 - Dylan Menzies

, Marcos F. Simón Gálvez, Filippo Maria Fazi
:
A Low-Frequency Panning Method With Compensation for Head Rotation. 304-317 - Branimir Dropuljic

, Igor Mijic
, Davor Petrinovic
, Tanja Jovanovic
, Kresimir Cosic
:
Vocal Analysis of Acoustic Startle Responses. 318-329 - Philipp Aichinger

, Martin Hagmüller
, Berit Schneider-Stickler, Jean Schoentgen, Franz Pernkopf
:
Tracking of Multiple Fundamental Frequencies in Diplophonic Voices. 330-341 - Anastasios Alexandridis, Athanasios Mouchtaris

:
Multiple Sound Source Location Estimation in Wireless Acoustic Sensor Networks Using DOA Estimates: The Data-Association Problem. 342-356 - Robert Rehr, Timo Gerkmann

:
On the Importance of Super-Gaussian Speech Priors for Machine-Learning Based Speech Enhancement. 357-366 - Sonia Djaziri Larbi, Gaël Mahé, Imen Marrakchi-Mezghani, Monia Turki

, Meriem Jaïdane
:
Watermark-Driven Acoustic Echo Cancellation. 367-378 - Annamaria Mesaros

, Toni Heittola
, Emmanouil Benetos
, Peter Foster, Mathieu Lagrange, Tuomas Virtanen
, Mark D. Plumbley
:
Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge. 379-393 - Cheng-Tao Chung, Lin-Shan Lee:

Unsupervised Discovery of Structured Acoustic Tokens With Applications to Spoken Term Detection. 394-405 - Tobias May

:
Robust Speech Dereverberation With a Neural Network-Based Post-Filter That Exploits Multi-Conditional Training of Binaural Cues. 406-414 - Majid Mirbagheri

, Les Atlas, Adrian K. C. Lee
:
Regression Factor Analysis With an Application to Continuous HRIR Measurement. 415-421 - Jen-Tzung Chien

:
Bayesian Nonparametric Learning for Hierarchical and Sparse Topics. 422-435 - Johannes Stahl

, Pejman Mowlaee
:
A Pitch-Synchronous Simultaneous Detection-Estimation Framework for Speech Enhancement. 436-450
Volume 26, Number 3, March 2018
- César D. Salvador

, Shuichi Sakamoto, Jorge Treviño, Yôiti Suzuki:
Boundary Matching Filters for Spherical Microphone and Loudspeaker Arrays. 461-474 - Ahmed Hussen Abdelaziz

:
Comparing Fusion Models for DNN-Based Audiovisual Continuous Speech Recognition. 475-484 - Satoru Emura

:
Residual Echo Reduction for Multichannel Acoustic Echo Cancelers With a Complex-Valued Residual Echo Estimate. 485-500 - Van Hai Do

, Nancy F. Chen
, Boon Pang Lim, Mark A. Hasegawa-Johnson
:
Multitask Learning for Phone Recognition of Underresourced Languages Using Mismatched Transcription. 501-514 - Mehdi Zohourian

, Gerald Enzner
, Rainer Martin
:
Binaural Speaker Localization Integrated Into an Adaptive Beamformer for Hearing Aids. 515-528 - Yong Xiang

, Iynkaran Natgunanathan
, Dezhong Peng
, Guang Hua
, Bo Liu
:
Spread Spectrum Audio Watermarking Using Multiple Orthogonal PN Sequences and Variable Embedding Strengths and Polarities. 529-539 - Chuanqi Tan

, Furu Wei, Qingyu Zhou
, Nan Yang, Bowen Du
, Weifeng Lv, Ming Zhou:
Context-Aware Answer Sentence Selection With Hierarchical Gated Recurrent Neural Networks. 540-549 - Jie Zhang

, Sundeep Prabhakar Chepuri
, Richard Christian Hendriks
, Richard Heusdens:
Microphone Subset Selection for MVDR Beamformer Based Noise Reduction. 550-563 - Syu-Siang Wang

, Payton Lin
, Yu Tsao
, Jeih-Weih Hung, Borching Su
:
Suppression by Selecting Wavelets for Feature Compression in Distributed Speech Recognition. 564-579 - Yu Wang

, Mike Brookes
:
Model-Based Speech Enhancement in the Modulation Domain. 580-594 - Christian Huemmer

, Christian Hofmann
, Roland Maas, Walter Kellermann:
Estimating Parameters of Nonlinear Systems Using the Elitist Particle Filter Based on Evolutionary Strategies. 595-608 - Daniele Salvati

, Carlo Drioli
, Gian Luca Foresti
:
A Low-Complexity Robust Beamforming Using Diagonal Unloading for Acoustic Source Localization. 609-622 - Jinsong Su

, Jiali Zeng, Deyi Xiong
, Yang Liu
, Mingxuan Wang, Jun Xie:
A Hierarchy-to-Sequence Attentional Neural Machine Translation Model. 623-632 - Waad Ben Kheder

, Driss Matrouf, Moez Ajili, Jean-François Bonastre:
A Unified Joint Model to Deal With Nuisance Variabilities in the i-Vector Space. 633-645 - Gregory Gelly

, Jean-Luc Gauvain:
Optimization of RNN-Based Speech Activity Detection. 646-656 - Maja Taseska

, Emanuël A. P. Habets
:
Blind Source Separation of Moving Sources Using Sparsity-Based Source Detection and Tracking. 657-670 - Liang-Chih Yu

, Jin Wang
, K. Robert Lai
, Xuejie Zhang:
Refining Word Embeddings Using Intensity Scores for Sentiment Analysis. 671-681 - Yuval Dorfan

, Axel Plinge
, Gershon Hazan, Sharon Gannot
:
Distributed Expectation-Maximization Algorithm for Speaker Localization in Reverberant Environments. 682-695
Volume 26, Number 4, April 2018
- Zhili Tan

, Man-Wai Mak
, Brian Kan-Wing Mak
:
DNN-Based Score Calibration With Multitask Learning for Noise Robust Speaker Verification. 700-712 - Ya-Jun Hu

, Zhen-Hua Ling
:
Extracting Spectral Features Using Deep Autoencoders With Binary Distributed Hidden Units for Statistical Parametric Speech Synthesis. 713-724 - Bracha Laufer-Goldshtein, Ronen Talmon

, Sharon Gannot
:
A Hybrid Approach for Speaker Tracking Based on TDOA and Data-Driven Models. 725-735 - Sandro Cumani

, Pietro Laface
:
Speaker Recognition Using e-Vectors. 736-748 - Longting Xu

, Kong-Aik Lee
, Haizhou Li
, Zhen Yang:
Generalizing I-Vector Estimation for Rapid Speaker Recognition. 749-759 - Yaakov Buchris

, Israel Cohen, Jacob Benesty
:
Frequency-Domain Design of Asymmetric Circular Differential Microphone Arrays. 760-773 - Jihui Zhang

, Thushara D. Abhayapala
, Wen Zhang
, Prasanga N. Samarasinghe
, Shouda Jiang:
Active Noise Control Over Space: A Wave Domain Approach. 774-786 - Yi Luo

, Zhuo Chen, Nima Mesgarani
:
Speaker-Independent Speech Separation With Deep Attractor Network. 787-796 - Neethu Mariam Joy

, Sandeep Reddy Kothinti
, Srinivasan Umesh
:
FMLLR Speaker Normalization With i-Vector: In Pseudo-FMLLR and Distillation Framework. 797-805 - Swati Chandna

, Wenwu Wang
:
Bootstrap Averaging for Model-Based Source Separation in Reverberant Conditions. 806-819 - Zhili Tan

, Man-Wai Mak
, Brian Kan-Wing Mak
, Yingke Zhu:
Denoised Senone I-Vectors for Robust Speaker Verification. 820-830 - Kousuke Itakura

, Yoshiaki Bando
, Eita Nakamura
, Katsutoshi Itoyama
, Kazuyoshi Yoshii
, Tatsuya Kawahara
:
Bayesian Multichannel Audio Source Separation Based on Integrated Source and Spatial Models. 831-846
Volume 26, Number 5, May 2018
- Youssef El Baba

, Andreas Walther, Emanuël A. P. Habets
:
3D Room Geometry Inference Based on Room Impulse Response Stacks. 857-872 - Qian Zhang, John H. L. Hansen

:
Language/Dialect Recognition Based on Unsupervised Deep Learning. 873-882 - Zhen-Hua Ling

, Yang Ai
, Yu Gu, Li-Rong Dai:
Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension. 883-894 - Marc Delcroix

, Keisuke Kinoshita
, Atsunori Ogawa
, Christian Huemmer
, Tomohiro Nakatani:
Context Adaptive Neural Network Based Acoustic Models for Rapid Adaptation. 895-908 - Linh Thi Thuc Tran, Sven Erik Nordholm

, Henning F. Schepker
, Hai Huyen Dam, Simon Doclo
:
Two-Microphone Hearing Aids Using Prediction Error Method for Adaptive Feedback Control. 909-923 - Jiho Chang, Marton Marschall:

Periphony-Lattice Mixed-Order Ambisonic Scheme for Spherical Microphone Arrays. 924-936 - Nikolaos Dionelis

, Mike Brookes
:
Phase-Aware Single-Channel Speech Enhancement With Modulation-Domain Kalman Filtering. 937-950 - Chengshi Zheng

, Antoine Deleforge, Xiaodong Li
, Walter Kellermann
:
Statistical Analysis of the Multichannel Wiener Filter Using a Bivariate Normal Distribution for Sample Covariance Matrices. 951-966 - Colin Vaz

, Vikram Ramanarayanan
, Shrikanth S. Narayanan
:
Acoustic Denoising Using Dictionary Learning With Spectral and Temporal Regularization. 967-980 - Lin Wang

, Andrea Cavallaro:
Pseudo-Determined Blind Source Separation for Ad-hoc Microphone Networks. 981-994 - Sandro Cumani

, Pietro Laface
:
Scoring Heterogeneous Speaker Vectors Using Nonlinear Transformations and Tied PLDA Models. 995-1009 - Giuliano Bernardi

, Toon van Waterschoot, Jan Wouters
, Marc Moonen
:
Subjective and Objective Sound-Quality Evaluation of Adaptive Feedback Cancellation Algorithms. 1010-1024
Volume 26, Number 6, June 2018
- Hirokazu Kameoka

, Takuya Higuchi
, Mikihiro Tanaka
, Li Li:
Nonnegative Matrix Factorization With Basis Clustering Using Cepstral Distance Regularization. 1025-1036 - Jacob Donley

, Christian H. Ritz
, W. Bastiaan Kleijn
:
Multizone Soundfield Reproduction With Privacy- and Quality-Based Speech Masking Filters. 1037-1051 - Sebastian Braun

, Adam Kuklasinski
, Ofer Schwartz, Oliver Thiergart, Emanuël A. P. Habets, Sharon Gannot
, Simon Doclo
, Jesper Jensen:
Evaluation and Comparison of Late Reverberation Power Spectral Density Estimators. 1052-1067 - Elie-Laurent Benaroya

, Nicolas Obin
, Marco Liuni
, Axel Roebel
, Wilson Raumel, Sylvain Argentieri
:
Binaural Localization of Multiple Sound Sources by Non-Negative Tensor Factorization. 1068-1078 - Nathanaël Perraudin

, Nicki Holighaus
, Piotr Majdak
, Péter Balázs:
Inpainting of Long Audio Segments With Similarity Graphs. 1079-1090 - Paul Magron

, Roland Badeau
, Bertrand David:
Model-Based STFT Phase Recovery for Audio Source Separation. 1091-1101 - Ina Kodrasi

, Simon Doclo
:
Analysis of Eigenvalue Decomposition-Based Late Reverberation Power Spectral Density Estimation. 1102-1114 - Sebastian Braun

, Emanuël A. P. Habets
:
Linear Prediction-Based Online Dereverberation and Noise Reduction Using Alternating Kalman Filters. 1115-1125 - Dhananjay Ram

, Afsaneh Asaei
, Hervé Bourlard:
Sparse Subspace Modeling for Query by Example Spoken Term Detection. 1126-1139 - Martin Krawczyk-Becker

, Timo Gerkmann
:
On Speech Enhancement Under PSD Uncertainty. 1140-1149 - Simon Leglaive

, Roland Badeau
, Gaël Richard:
Student's t Source and Mixing Models for Multichannel Audio Source Separation. 1150-1164
Volume 26, Number 7, July 2018
- Takenori Yoshimura

, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Mel-Cepstrum-Based Quantization Noise Shaping Applied to Neural-Network-Based Speech Waveform Synthesis. 1173-1180 - Qing Wang

, Jun Du
, Li-Rong Dai, Chin-Hui Lee:
A Multiobjective Learning and Ensembling Approach to High-Performance Speech Enhancement With Compact Neural Network Architectures. 1181-1193 - Miguel Ángel del Agua, Adrià Giménez

, Alberto Sanchís
, Jorge Civera
, Alfons Juan
:
Speaker-Adapted Confidence Measures for ASR Using Deep Bidirectional Recurrent Neural Networks. 1194-1202 - Jorge Proença

, Carla Lopes
, Michael Tjalve, Andreas Stolcke, Sara Candeias, Fernando Perdigão
:
Mispronunciation Detection in Children's Reading of Sentences. 1203-1215 - Ljubisa Stankovic

, Milos Brajovic
:
Analysis of the Reconstruction of Sparse Signals in the DCT Domain Applied to Audio Signals. 1216-1231 - João Felipe Santos

, Tiago H. Falk
:
Speech Dereverberation With Context-Aware Recurrent Neural Networks. 1232-1242 - Michele Geronazzo

, Simone Spagnol
, Federico Avanzini
:
Do We Need Individual Head-Related Transfer Functions for Vertical Localization? The Case Study of a Spectral Notch Distance Metric. 1243-1256 - Daniel Marquardt

, Simon Doclo
:
Interaural Coherence Preservation for Binaural Noise Reduction Using Partial Noise Estimation and Spectral Postfiltering. 1257-1270 - Mojtaba Farmani

, Michael Syskind Pedersen, Zheng-Hua Tan
, Jesper Jensen
:
Bias-Compensated Informed Sound Source Localization Using Relative Transfer Functions. 1271-1285 - Fei Tao

, Carlos Busso
:
Gating Neural Network for Large Vocabulary Audiovisual Speech Recognition. 1286-1298
Volume 26, Number 8, August 2018
- Zafar Rafii

, Antoine Liutkus, Fabian-Robert Stöter, Stylianos Ioannis Mimilakis, Derry FitzGerald, Bryan Pardo:
An Overview of Lead and Accompaniment Separation in Music. 1307-1335 - Chien-Yao Wang

, Jia-Ching Wang, Andri Santoso
, Chin-Chin Chiang, Chung-Hsien Wu
:
Sound Event Recognition Using Auditory-Receptive-Field Binary Pattern and Hierarchical-Diving Deep Belief Network. 1336-1351 - Liner Yang, Meishan Zhang

, Yang Liu, Maosong Sun, Nan Yu, Guohong Fu:
Joint POS Tagging and Dependence Parsing With Transition-Based Neural Networks. 1352-1358 - Kai Yu

, Zijian Zhao, Xueyang Wu, Hongtao Lin, Xuan Liu:
Rich Short Text Conversation Using Semantic-Key-Controlled Sequence Generation. 1359-1368 - Bernhard Lehner

, Jan Schlüter, Gerhard Widmer
:
Online, Loudness-Invariant Vocal Detection in Mixed Music Signals. 1369-1380 - Simon Stone

, Michael Marxen
, Peter Birkholz
:
Construction and Evaluation of a Parametric One-Dimensional Vocal Tract Model. 1381-1392 - Tian Tan

, Yanmin Qian
, Hu Hu
, Ying Zhou
, Wen Ding
, Kai Yu
:
Adaptive Very Deep Convolutional Residual Network for Noise Robust Speech Recognition. 1393-1405 - Xin Wang

, Shinji Takaki, Junichi Yamagishi
:
Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis. 1406-1419 - Cassia Valentini-Botinhao

, Junichi Yamagishi
:
Speech Enhancement of Noisy and Reverberant Speech for Text-to-Speech. 1420-1433 - Andreas I. Koutrouvelis

, Thomas W. Sherson
, Richard Heusdens
, Richard C. Hendriks
:
A Low-Cost Robust Distributed Linearly Constrained Beamformer for Wireless Acoustic Sensor Networks With Arbitrary Topology. 1434-1448
Volume 26, Number 9, September 2018
- Chih-Wei Wu

, Christian Dittmar
, Carl Southall
, Richard Vogl, Gerhard Widmer
, Jason Hockman, Meinard Müller
, Alexander Lerch
:
A Review of Automatic Drum Transcription. 1457-1483 - Christine Evers

, Patrick A. Naylor
:
Acoustic SLAM. 1484-1498 - Clement Laroche

, Matthieu Kowalski
, Hélène Papadopoulos, Gaël Richard:
Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation. 1499-1511 - Julio J. Carabias-Orti

, Joonas Nikunen
, Tuomas Virtanen
, Pedro Vera-Candeas
:
Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization. 1512-1527 - Meishan Zhang

, Nan Yu, Guohong Fu:
A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging. 1528-1538 - Dylan Menzies

, Filippo Maria Fazi
:
A Complex Panning Method for Near-Field Imaging. 1539-1548 - Abhinav Misra, John H. L. Hansen

:
Maximum-Likelihood Linear Transformation for Unsupervised Domain Adaptation in Speaker Verification. 1549-1558 - Yukoh Wakabayashi

, Takahiro Fukumori
, Masato Nakayama, Takanobu Nishiura
, Yoichi Yamashita
:
Single-Channel Speech Enhancement With Phase Reconstruction Based on Phase Distortion Averaging. 1559-1569 - Szu-Wei Fu

, Taowei Wang, Yu Tsao
, Xugang Lu, Hisashi Kawai:
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks. 1570-1584 - Ke Xiao

, Supin Wang, Mingxi Wan
, Liang Wu
:
Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation. 1585-1593 - Abdullah Fahim

, Prasanga N. Samarasinghe
, Thushara D. Abhayapala
:
PSD Estimation and Source Separation in a Noisy Reverberant Environment Using a Spherical Microphone Array. 1594-1607 - Hongsen He

, Jingdong Chen
, Jacob Benesty
, Tao Yang:
Noise Robust Frequency-Domain Adaptive Blind Multichannel Identification With ℓp-Norm Constraint. 1608-1619 - Weiwei Zhang

, Zhe Chen
, Fuliang Yin
, Qiaoling Zhang:
Melody Extraction From Polyphonic Music Using Particle Filter and Dynamic Programming. 1620-1632 - Chunlei Zhang

, Kazuhito Koishida, John H. L. Hansen
:
Text-Independent Speaker Verification Based on Triplet Convolutional Neural Network Embeddings. 1633-1644 - M. V. Achuth Rao

, Prasanta Kumar Ghosh
:
PSFM - A Probabilistic Source Filter Model for Noise Robust Glottal Closure Instant Detection. 1645-1657 - Manu Airaksinen

, Lauri Juvela
, Bajibabu Bollepalli, Junichi Yamagishi
, Paavo Alku
:
A Comparison Between STRAIGHT, Glottal, and Sinusoidal Vocoding in Statistical Parametric Speech Synthesis. 1658-1670 - Gaël Mahé

, Meriem Jaïdane
:
Perceptually Controlled Reshaping of Sound Histograms. 1671-1683 - Qinghua Huang, Lin Zhang

, Yong Fang:
Two-Step Spherical Harmonics ESPRIT-Type Algorithms and Performance Analysis. 1684-1697
Volume 26, Number 10, October 2018
- DeLiang Wang

, Jitong Chen
:
Supervised Speech Separation Based on Deep Learning: An Overview. 1702-1726 - Rui Wang

, Masao Utiyama, Andrew M. Finch, Lemao Liu
, Kehai Chen
, Eiichiro Sumita:
Sentence Selection and Weighting for Neural Machine Translation Domain Adaptation. 1727-1741 - Faheem Khan

, Ben P. Milner, Thomas Le Cornu
:
Using Visual Speech Information in Masking Methods for Audio Speaker Separation. 1742-1754 - Xiaofei Li

, Sharon Gannot
, Laurent Girin
, Radu Horaud
:
Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction Based on Convolutive Transfer Function. 1755-1768 - Lutfi Kerem Senel

, Ihsan Utlu, Veysel Yücesoy, Aykut Koç
, Tolga Çukur
:
Semantic Structure and Interpretability of Word Embeddings. 1769-1779 - Yuma Koizumi

, Kenta Niwa
, Yusuke Hioka
, Kazunori Kobayashi, Yoichi Haneda
:
DNN-Based Source Enhancement to Increase Objective Sound Quality Assessment Score. 1780-1792 - Constantin Paleologu

, Jacob Benesty
, Silviu Ciochina:
Linear System Identification Based on a Kronecker Product Decomposition. 1793-1808 - Feifei Xiong

, Stefan Goetze
, Birger Kollmeier, Bernd T. Meyer
:
Exploring Auditory-Inspired Acoustic Features for Room Acoustic Parameter Estimation From Monaural Speech. 1809-1820 - Gaël Le Lan

, Delphine Charlet, Anthony Larcher, Sylvain Meignier:
An Adaptive Method for Cross-Recording Speaker Diarization. 1821-1832 - Wei Xue

, Alastair H. Moore
, Mike Brookes
, Patrick A. Naylor
:
Modulation-Domain Multichannel Kalman Filtering for Speech Enhancement. 1833-1847 - Kai Wu

, Vaninirappuputhenpurayil Gopalan Reju
, Andy W. H. Khong
:
Multisource DOA Estimation in a Reverberant Environment Using a Single Acoustic Vector Sensor. 1848-1859 - Jizhou Huang

, Yaming Sun
, Wei Zhang
, Haifeng Wang
, Ting Liu:
Entity Highlight Generation as Statistical and Neural Machine Translation. 1860-1872 - Quoc Truong Do

, Sakriani Sakti, Satoshi Nakamura
:
Sequence-to-Sequence Models for Emphasis Speech Translation. 1873-1883 - Federico Fontana

, Enrico Bozzo
:
Explicit Fixed-Point Computation of Nonlinear Delay-Free Loop Filter Networks. 1884-1896 - Simon Widmark

:
Causal IIR Audio Precompensator Filters Subject to Quadratic Constraints. 1897-1912 - Fiete Winter

, Hagen Wierstorf, Christoph Hold, Frank Krüger
, Alexander Raake
, Sascha Spors:
Colouration in Local Wave Field Synthesis. 1913-1924 - Asger Heidemann Andersen

, Jan Mark de Haan, Zheng-Hua Tan
, Jesper Jensen
:
Nonintrusive Speech Intelligibility Prediction Using Convolutional Neural Networks. 1925-1939
Volume 26, Number 11, November 2018
- Hossein Hadian

, Hossein Sameti
, Daniel Povey, Sanjeev Khudanpur:
Flat-Start Single-Stage Discriminatively Trained HMM-Based Models for ASR. 1949-1961 - Fabrice Katzberg

, Radoslaw Mazur, Marco Maaß
, Philipp Koch, Alfred Mertins
:
A Compressed Sensing Framework for Dynamic Sound-Field Measurements. 1962-1975 - Sundar Harshavardhan

, Thippur V. Sreenivas, Chandra Sekhar Seelamantula
:
TDOA-Based Multiple Acoustic Source Localization Without Association Ambiguity. 1976-1990 - Reza Sahraeian

, Dirk Van Compernolle
:
Cross-Entropy Training of DNN Ensemble Acoustic Models for Low-Resource ASR. 1991-2001 - Heinrich Dinkel

, Yanmin Qian
, Kai Yu
:
Investigating Raw Wave Deep Neural Networks for End-to-End Speaker Spoofing Detection. 2002-2014 - Jie Zhang

, Richard Heusdens
, Richard Christian Hendriks
:
Rate-Distributed Spatial Filtering Based Noise Reduction in Wireless Acoustic Sensor Networks. 2015-2026 - Michael Heck

, Sakriani Sakti, Satoshi Nakamura
:
Dirichlet Process Mixture of Mixtures Model for Unsupervised Subword Modeling. 2027-2042 - Shuai Nie

, Shan Liang
, Wenju Liu
, Xueliang Zhang
, Jianhua Tao:
Deep Learning Based Speech Separation via NMF-Style Reconstructions. 2043-2055 - Harishchandra Dubey

, Abhijeet Sangwan, John H. L. Hansen
:
Leveraging Frequency-Dependent Kernel and DIP-Based Clustering for Robust Speech Activity Detection in Naturalistic Audio Streams. 2056-2071 - Youngsoo Jang

, Jiyeon Ham, Byung-Jun Lee
, Kee-Eung Kim:
Cross-Language Neural Dialog State Tracker for Large Ontologies Using Hierarchical Attention. 2072-2082 - Gellért Weisz, Pawel Budzianowski

, Pei-Hao Su
, Milica Gasic
:
Sample Efficient Deep Reinforcement Learning for Dialogue Systems With Large Action Spaces. 2083-2097 - Shoufeng Lin

:
Reverberation-Robust Localization of Speakers Using Distinct Speech Onsets and Multichannel Cross Correlations. 2098-2111 - Shamsiah Abidin

, Roberto Togneri
, Ferdous Ahmed Sohel
:
Spectrotemporal Analysis Using Local Binary Pattern Variants for Acoustic Scene Classification. 2112-2121 - Ning Ma

, José A. González
, Guy J. Brown
:
Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks. 2122-2131 - Shuangzhi Wu

, Dongdong Zhang
, Zhirui Zhang, Nan Yang, Mu Li, Ming Zhou:
Dependency-to-Dependency Neural Machine Translation. 2132-2141 - Jingjing Xu

, Hangfeng He
, Xu Sun
, Xuancheng Ren
, Sujian Li:
Cross-Domain and Semisupervised Named Entity Recognition in Chinese Social Media: A Unified Model. 2142-2152 - Steven Van Kuyk

, W. Bastiaan Kleijn
, Richard Christian Hendriks
:
An Evaluation of Intrusive Instrumental Intelligibility Metrics. 2153-2166 - Xi Ouyang

, Kang Gu, Pan Zhou:
Spatial Pyramid Pooling Mechanism in 3D Convolutional Network for Sentence-Level Classification. 2167-2179 - Brian McFee

, Justin Salamon
, Juan Pablo Bello
:
Adaptive Pooling Operators for Weakly Labeled Sound Event Detection. 2180-2193 - Isabel Barbancho

, George Tzanetakis
, Ana M. Barbancho
, Lorenzo J. Tardón
:
Discrimination Between Ascending/Descending Pitch Arpeggios. 2194-2203 - Younggwan Kim

, Myung Jong Kim
, Jahyun Goo, Hoirin Kim
:
Learning Self-Informed Feature Contribution for Deep Learning-Based Acoustic Modeling. 2204-2214 - Mert Burkay Çöteli, Orhun Olgun, Hüseyin Hacihabiboglu

:
Multiple Sound Source Localization With Steered Response Power Density and Hierarchical Grid Refinement. 2215-2229 - Junwei Bao

, Yeyun Gong, Nan Duan
, Ming Zhou, Tiejun Zhao:
Question Generation With Doubly Adversarial Nets. 2230-2239 - Bing Bu, Changchun Bao

, Mao-shen Jia
:
Design of a Planar First-Order Loudspeaker Array for Global Active Noise Control. 2240-2250
Volume 26, Number 12, December 2018
- Xing Wang

, Zhaopeng Tu, Min Zhang
:
Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation. 2255-2266 - Yunxin Zhao

, Mili Kuruvilla-Dugdale
, Minguang Song:
Structured Sparse Spectral Transforms and Structural Measures for Voice Conversion. 2267-2276 - Haniyeh Salehi

, David Suelzle, Paula Folkeard
, Vijay Parsa:
Learning-Based Reference-Free Speech Quality Measures for Hearing Aid Applications. 2277-2288 - Gerald Enzner

, Philipp Thüne
:
Bayesian MMSE Filtering of Noisy Speech by SNR Marginalization With Global PSD Priors. 2289-2304 - Gongping Huang

, Jingdong Chen
, Jacob Benesty
:
Insights Into Frequency-Invariant Beamforming With Concentric Circular Microphone Arrays. 2305-2318 - Shiqi Shen, Yun Chen, Cheng Yang

, Zhiyuan Liu
, Maosong Sun:
Zero-Shot Cross-Lingual Neural Headline Generation. 2319-2327 - Sudeep Surendran

, T. Kishore Kumar:
Oblique Projection and Cepstral Subtraction in Signal Subspace Speech Enhancement for Colored Noise Reduction. 2328-2340 - Qiang Li

, Derek F. Wong
, Lidia S. Chao, Muhua Zhu
, Tong Xiao
, Jingbo Zhu, Min Zhang:
Linguistic Knowledge-Aware Neural Machine Translation. 2341-2354 - Wen Zhang

, Christian Hofmann
, Michael Buerger, Thushara Dheemantha Abhayapala
, Walter Kellermann:
Spatial Noise-Field Control With Online Secondary Path Modeling: A Wave-Domain Approach. 2355-2370 - Adrien Meynard

, Bruno Torrésani
:
Spectral Analysis for Nonstationary Audio. 2371-2380 - Irene Martín-Morató

, Maximo Cobos
, Francesc J. Ferri
:
Adaptive Mid-Term Representations for Robust Audio Event Classification. 2381-2392 - Gergely Firtha

, Péter Fiala, Frank Schultz
, Sascha Spors
:
On the General Relation of Wave Field Synthesis and Spectral Division Method for Linear Arrays. 2393-2403 - Peter Birkholz

, Simon Stone
, Klaus Wolf, Dirk Plettemeier:
Non-Invasive Silent Phoneme Recognition Using Microwave Signals. 2404-2411 - Wei-Wei Lin

, Man-Wai Mak
, Jen-Tzung Chien
:
Multisource I-Vectors Domain Adaptation Using Maximum Mean Discrepancy Based Autoencoders. 2412-2422 - Mohammed Abdel-Wahab

, Carlos Busso
:
Domain Adversarial for Acoustic Emotion Recognition. 2423-2435 - Dalia El Badawy

, Ivan Dokmanic
:
Direction of Arrival With One Microphone, a Few LEGOs, and Non-Negative Matrix Factorization. 2436-2446 - Hung-yi Lee

, Pei-Hung Chung
, Yen-Chen Wu, Tzu-Hsiang Lin, Tsung-Hsien Wen:
Interactive Spoken Content Retrieval by Deep Reinforcement Learning. 2447-2459 - Samy Elshamy

, Nilesh Madhu
, Wouter Tirry, Tim Fingscheidt
:
DNN-Supported Speech Enhancement With Cepstral Estimation of Both Excitation and Envelope. 2460-2474 - Yu Bao

, Huawei Chen
:
A Chance-Constrained Programming Approach to the Design of Robust Broadband Beamformers With Microphone Mismatches. 2475-2488 - Haizhou Li

:
Farewell Editorial. 2489

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














