


default search action
WASPAA 2021: New Paltz, NY, USA
- IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2021, New Paltz, NY, USA, October 17-20, 2021. IEEE 2021, ISBN 978-1-6654-4870-3

- Ryan M. Corey

, Andrew C. Singer
:
Adaptive Binaural Filtering for a Multiple-Talker Listening System Using Remote and On-Ear Microphones. 1-5 - Shahan Nercessian:

End-to-End Zero-Shot Voice Conversion Using a DDSP Vocoder. 1-5 - Shoichi Koyama, Tomoya Nishida, Keisuke Kimura, Takumi Abe, Natsuki Ueno, Jesper Brunnström

:
MESHRIR: A Dataset of Room Impulse Responses on Meshed Grid Points for Evaluating Sound Field Analysis and Synthesis Methods. 1-5 - Pranay Manocha

, Anurag Kumar, Buye Xu, Anjali Menon, Israel D. Gebru, Vamsi K. Ithapu, Paul Calamia:
DPLM: A Deep Perceptual Spatial-Audio Localization Metric. 6-10 - Zhixing Liu, Yannan Wang, Gaoxiong Yi, Tao Yu, Fei Chen:

Assessing Segmental Impact for Objective Speech Quality Evaluation. 11-15 - Ahmed Alghamdi, Wai-Yip Chan, Daniel Fogerty, Jesper Jensen:

Improved Intelligibility Prediction in the Modulation Domain. 16-20 - Ryo Tanabe, Harsh Purohit, Kota Dohi, Takashi Endo, Yuki Nikaido, Toshiki Nakamura, Yohei Kawaguchi:

MIMII Due: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection with Domain Shifts Due to Changes in Operational and Environmental Conditions. 21-25 - Benjamin Elizalde, Radu Revutchi, Samarjit Das, Bhiksha Raj, Ian R. Lane, Laurie M. Heller

:
Identifying Actions for Sound Event Classification. 26-30 - Krishna Subramani, Paris Smaragdis:

Point Cloud Audio Processing. 31-35 - Yu Wang, Nicholas J. Bryan, Justin Salamon, Mark Cartwright, Juan Pablo Bello

:
Who Calls The Shots? Rethinking Few-Shot Learning for Audio. 36-40 - Zhepei Wang, Jonah Casebeer, Adam Clemmitt, Efthymios Tzinis, Paris Smaragdis:

Sound Event Detection with Adaptive Frequency Selection. 41-45 - Efthymios Tzinis, Jonah Casebeer, Zhepei Wang, Paris Smaragdis:

Separate But Together: Unsupervised Federated Learning for Speech Enhancement from Non-IID Data. 46-50 - Scott Wisdom, Aren Jansen, Ron J. Weiss, Hakan Erdogan, John R. Hershey:

Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation. 51-55 - Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux:

Convolutive Prediction for Reverberant Speech Separation. 56-60 - Aurora Cramer, Mark Cartwright, Fatemeh Pishdadian, Juan Pablo Bello

:
Weakly Supervised Source-Specific Sound Level Estimation in Noisy Soundscapes. 61-65 - Ahmed Mustafa, Jan Büthe, Srikanth Korse

, Kishan Gupta, Guillaume Fuchs
, Nicola Pia:
A Streamwise Gan Vocoder for Wideband Speech Coding at Very Low Bit Rate. 66-70 - Santiago Pascual, Joan Serrà, Jordi Pons:

Adversarial Auto-Encoding for Packet Loss Concealment. 71-75 - Daniel T. Jones

, Dushyant Sharma, Stanislav Yu. Kruchinin, Patrick A. Naylor
:
Spatial Coding for Microphone Arrays Using Ipnlms-Based RTF Estimation. 76-80 - Hsuan-Yang Wang, Philip Nelson

, Christine Evers
:
Excitation-Inhibition Cell Activity Patterns for Binaural Source Localisation. 81-85 - Hongmei Hu, Stephan Dieter Ewert:

Speech Intelligibility of Mandarin- and German-Speaking Listeners in Challenging Conditions. 86-90 - Matteo Torcoli

, Jouni Paulus
, Thorsten Kastner, Christian Uhle:
Controlling the Remixing of Separated Dialogue with a Non-Intrusive Quality Estimate. 91-95 - Benjamin Stahl, Alois Sontacchi:

SIDIQ: Computational Quality Assessment of Enhanced Speech Based on Auditory Figure-Ground Segregation, Similarity, and Disturbance. 96-100 - Amir Ivry, Israel Cohen, Baruch Berdugo:

Objective Metrics to Evaluate Residual-Echo Suppression During Double-Talk. 101-105 - Raffaele Malvermi

, Fabio Antonacci, Augusto Sarti, Roberto Corradi:
Prediction of Missing Frequency Response Functions Through Deep Image Prior. 106-110 - Giorgia Cantisani

, Alexey Ozerov, Slim Essid, Gaël Richard:
User-Guided One-Shot Deep Model Adaptation for Music Source Separation. 111-115 - Javier Nistal, Cyran Aouameur, Stefan Lattner, Gaël Richard:

VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding. 116-120 - Christof Weiß, Geoffroy Peeters:

Learning Multi-Pitch Estimation from Weakly Aligned Score-Audio Pairs Using a Multi-Label CTC Loss. 121-125 - Guillaume Carbajal, Julius Richter

, Timo Gerkmann
:
Disentanglement Learning for Variational Autoencoders Applied to Audio-Visual Speech Enhancement. 126-130 - Andreas Brendel

, Walter Kellermann:
Fasteriva: Update Rules for Independent Vector Analysis Based on Negentropy and the Majorize-Minimize Principle. 131-135 - Sebastian Braun, Ivan Tashev:

Low Complexity Online Convolutional Beamforming. 136-140 - Osman Asif Malik

, Venkatalakshmi Vyjayanthi Narumanchi, Stephen Becker, Todd W. Murray:
Superresolution Photoacoustic Tomography Using Random Speckle Illumination and Second Order Moments. 141-145 - Wangyou Zhang, Jing Shi, Chenda Li, Shinji Watanabe

, Yanmin Qian:
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions. 146-150 - Amy Bastine

, Thushara D. Abhayapala, Jihui Zhang
:
Analysis of Frequency-Dependent Behavior of Room Reflections Using Spherical Microphone Measurements & Von Mises-Fisher Clustering. 156-160 - Yuma Koizumi, Shigeki Karita, Scott Wisdom, Hakan Erdogan, John R. Hershey, Llion Jones, Michiel Bacchiani:

DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement. 161-165 - Jiaqi Su, Zeyu Jin, Adam Finkelstein:

HiFi-GAN-2: Studio-Quality Speech Enhancement via Generative Adversarial Networks Conditioned on Acoustic Features. 166-170 - Aswin Sivaraman

, Minje Kim:
Zero-Shot Personalized Speech Enhancement Through Speaker-Informed Model Selection. 171-175 - Sunwoo Kim, Minje Kim:

Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation. 176-180 - Enis Berk Çoban, Ali Raza Syed, Dara Pir, Michael I. Mandel:

Towards Large Scale Ecoacoustic Monitoring with Small Amounts of Labeled Data. 181-185 - Gordon Wichern, Ankush Chakrabarty, Zhong-Qiu Wang, Jonathan Le Roux:

Anomalous Sound Detection Using Attentive Neural Processes. 186-190 - Debottam Dutta, Purvi Agrawal, Sriram Ganapathy:

A Multi-Head Relevance Weighting Framework for Learning Raw Waveform Audio Representations. 191-195 - Donmoon Lee, Kyogu Lee:

Cross-Domain Semi-Supervised Audio Event Classification Using Contrastive Regularization. 196-200 - Vincent W. Neo

, Christine Evers
, Patrick A. Naylor
:
Polynomial Matrix Eigenvalue Decomposition-Based Source Separation Using Informed Spherical Microphone Arrays. 201-205 - Thomas Dietzen

, Enzo De Sena
, Toon van Waterschoot:
Low-Complexity Steered Response Power Mapping Based on Nyquist-Shannon Sampling. 206-210 - Sharath Adavanne

, Archontis Politis
, Tuomas Virtanen
:
Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers. 211-215 - Daniele Salvati

, Carlo Drioli, Gian Luca Foresti:
Spherical Harmonic Diagonal Unloading Beamforming with Ego-Noise Reduction for DOA Estimation from Autonomous Systems. 216-220 - Christian J. Steinmetz, Vamsi Krishna Ithapu, Paul Calamia:

Filtered Noise Shaping for Time Domain Room Impulse Response Estimation from Reverberant Speech. 221-225 - Prerak Srivastava, Antoine Deleforge, Emmanuel Vincent:

Blind Room Parameter Estimation Using Multiple Multichannel Speech Recordings. 226-230 - Jens Ahrens, Hannes Helmholz, David Lou Alon, Sebastià Vicenc Amengual Garí

:
Spherical Harmonic Decomposition of a Sound Field Based on Microphones Around the Circumference of a Human Head. 231-235 - Maximilian Kentgens, Peter Jax:

Ambient-Aware Sound Field Translation Using Optimal Spatial Filtering. 236-240 - Ege Erdem

, Orhun Olgun, Hüseyin Hacihabiboglu
:
Internal Time Delay Calibration of Rigid Spherical Microphone Arrays for Multi-Perspective 6DoF Audio Recordings. 241-245 - Irene Martín-Morató, Manu Harju

, Annamaria Mesaros
:
Crowdsourcing Strong Labels for Sound Event Detection. 246-250 - Eduardo Fonseca, Aren Jansen, Daniel P. W. Ellis, Scott Wisdom, Marco Tagliasacchi, John R. Hershey, Manoj Plakal, Shawn Hershey, R. Channing Moore, Xavier Serra:

Self-Supervised Learning from Automatically Separated Sound Scenes. 251-255 - Jun Deng, Chunhui Gao, Qian Feng, Xinzhou Xu, Zhaopeng Chen:

Adaptive Generalized Cross-Entropy Loss for Sound Event Classification with Noisy Labels. 256-260 - Ryosuke Horiuchi, Shoichi Koyama, Juliano G. C. Ribeiro, Natsuki Ueno, Hiroshi Saruwatari:

Kernel Learning for Sound Field Estimation with L1 and L2 Regularizations. 261-265 - Jingwei Xi, Wen Zhang, Thushara D. Abhayapala:

Magnitude Modelling of Individualized HRTFs Using DNN Based Spherical Harmonic Analysis. 266-270 - Yi Ren

, Yoichi Haneda:
2D Local Exterior Sound Field Reproduction Using an Addition Theorem Based on Circular Harmonic Expansion. 271-275 - Takuma Okamoto:

2D Multizone Sound Field Synthesis with Interior-Exterior Ambisonics. 276-280 - Keisuke Kimura, Shoichi Koyama, Natsuki Ueno, Hiroshi Saruwatari:

Mean-Square-Error-Based Secondary Source Placement in Sound Field Synthesis with Prior Information on Desired Field. 281-285 - Hanwen Bi, Fei Ma, Thushara D. Abhayapala, Prasanga N. Samarasinghe:

Spherical Array Based Drone Noise Measurements and Modelling for Drone Noise Reduction via Propeller Phase Control. 286-290 - Jonah Casebeer, Nicholas J. Bryan, Paris Smaragdis:

Auto-DSP: Learning to Optimize Acoustic Echo Cancellers. 291-295 - Naoki Murata, Yuhta Takida, Tetsu Magariyachi:

Fast Convergent Method for Active Noise Control Over Spatial Region with Causal Constraint. 296-300 - Huiyuan Sun

, Jihui Zhang
, Thushara D. Abhayapala, Prasanga N. Samarasinghe
:
Active Noise Control Over 3D Space with Remote Microphone Technique in the Wave Domain. 301-305 - Tamara Smyth, Devansh Zurale

:
On the Role of Lip Reflection/Transmission in the Relationship Between LPC and Waveguide Vocal Tract Models. 311-315 - Darius Petermann, Seungkwon Beack, Minje Kim:

Harp-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable Neural Audio Coding. 316-320 - François G. Germain:

Periodic Analysis of Nonlinear Virtual Analog Models. 321-325 - Aidan O. T. Hogg, Vincent W. Neo

, Stephan Weiss, Christine Evers
, Patrick A. Naylor
:
A Polynomial Eigenvalue Decomposition Music Approach for Broadband Sound Source Localization. 326-330 - Daniel Aleksander Krause, Archontis Politis

, Annamaria Mesaros
:
Joint Direction and Proximity Classification of Overlapping Sound Events from Binaural Audio. 331-335 - Pierre-Amaury Grumiaux, Srdan Kitic, Prerak Srivastava, Laurent Girin, Alexandre Guérin:

Saladnet: Self-Attentive Multisource Localization in the Ambisonics Domain. 336-340 - Christoph Kirsch, Stephan Dieter Ewert

:
Low-Order Filter Approximation of Diffraction for Virtual Acoustics. 341-345 - Thomas Deppisch

, Jens Ahrens, Sebastià Vicenc Amengual Garí
, Paul Calamia:
Spatial Subtraction of Reflections from Room Impulse Responses Measured with a Spherical Microphone Array. 346-350 - Achille Aknin, Roland Badeau:

Stochastic Reverberation Model with a Frequency Dependent Attenuation. 351-355 - Paula Sánchez López, Paul Callens, Milos Cernak:

A Universal Deep Room Acoustics Estimator. 356-360 - Christoph Hold

, Sebastian J. Schlecht, Archontis Politis
, Ville Pulkki:
Spatial Filter Bank in the Spherical Harmonic Domain: Reconstruction and Application. 361-365 - Stefano Damiano

, Federico Borra, Alberto Bernardini
, Fabio Antonacci, Augusto Sarti:
Soundfield Reconstruction in Reverberant Rooms Based on Compressive Sensing and Image-Source Models of Early Reflections. 366-370 - Leo McCormack, Archontis Politis

, Ville Pulkki:
Rendering of Source Spread for Arbitrary Playback Setups Based on Spatial Covariance Matching. 371-375

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














