


default search action
WASPAA 2023: New Paltz, NY, USA
- IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023, New Paltz, NY, USA, October 22-25, 2023. IEEE 2023, ISBN 979-8-3503-2372-6

- Ayal Schwartz, Elior Hadad, Sharon Gannot, Shlomo E. Chazan:

Array Configuration Mismatch in Deep DOA Estimation: Towards Robust Training. 1-5 - Bastiaan Tamm

, Rik Vandenberghe, Hugo Van hamme
:
Analysis of XLS-R for Speech Quality Assessment. 1-5 - Aryan Chaudhary, Vinayak Abrol:

Towards on-Device Keyword Spotting using Low-Footprint Quaternion Neural Models. 1-5 - Gal Itzhak, Israel Cohen

:
Region-of-Interest Oriented Constant-Beamwidth Beamforming with Rectangular Arrays. 1-5 - Chang-Bin Jeon, Kyogu Lee:

Music De-Limiter Networks Via Sample-Wise Gain Inversion. 1-5 - Da-Hee Yang

, Donghyun Kim, Joon-Hyuk Chang:
Masked Frequency Modeling for Improving Packet Loss Concealment in Speech Transmission Systems. 1-5 - Kenta Ogawa, Shun Sawada, Kouichi Katsurada, Hidehumi Ohmura:

Automatic Detection of Poor Tone Quality in Classical Guitar Playing Using Deep Anomaly Detection Method. 1-5 - Afagh Farhadi, Laurel H. Carney:

Predicting Thresholds in an Auditory Overshoot Paradigm Using a Computational Subcortical Model with Efferent Feedback. 1-5 - Leny Vinceslas, Matteo Scerbo

, Hüseyin Hacihabiboglu, Zoran Cvetkovic, Enzo De Sena
:
Low-Complexity Higher Order Scattering Delay Networks. 1-5 - Yurii Iotov

, Sidsel Marie Nørholm, Valiantsin Belyi, Mads Græsbøll Christensen
:
Adaptive Sparse Linear Prediction in Fixed-Filter ANC Headphone Applications for Multi-Speaker Speech Reduction. 1-5 - Shuai Tao, Yang Xiang, Himavanth Reddy, Jesper Rindom Jensen

, Mads Græsbøll Christensen
:
Single Channel Speech Presence Probability Estimation based on Hybrid Global-Local Information. 1-5 - Tre DiPassio, Michael C. Heilemann

, Benjamin Thompson, Mark F. Bocko:
Estimating the Direction of Arrival of a Spoken Wake Word Using a Single Sensor on an Elastic Panel. 1-5 - Devansh Zurale

, Shlomo Dubnov:
Learning Sub-Dimensional HRTF Representations Towards Individualization Applications - Traditional and Deep Learning Approaches. 1-5 - Richard Füg, Bernd Edler:

Temporal Noise Shaping on MDCT Subband Signals for Transform Audio Coding. 1-5 - Pablo M. Delgado, Jürgen Herre:

An Improved Metric of Informational Masking for Perceptual Audio Quality Measurement. 1-5 - Dimitrios Bralios

, Efthymios Tzinis, Paris Smaragdis:
Complete and Separate: Conditional Separation with Missing Target Source Attribute Completion. 1-5 - Eric Guizzo, Tillman Weyde, Giacomo Tarroni, Danilo Comminiello:

Quaternion Anti-Transfer Learning for Speech Emotion Recognition. 1-5 - Michael Neri

, Archontis Politis
, Daniel Krause, Marco Carli, Tuomas Virtanen
:
Single-Channel Speaker Distance Estimation in Reverberant Environments. 1-5 - Yuma Koizumi, Heiga Zen, Shigeki Karita, Yifan Ding, Kohei Yatabe

, Nobuyuki Morioka, Yu Zhang, Wei Han, Ankur Bapna, Michiel Bacchiani:
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations. 1-5 - François G. Germain, Gordon Wichern, Jonathan Le Roux:

Hyperbolic Unsupervised Anomalous Sound Detection. 1-5 - Elisa Tengan, Thomas Dietzen

, Filip Elvander
, Toon van Waterschoot:
Multi-Source Direction-of-Arrival Estimation using Group-Sparse Fitting of Steered Response Power Maps. 1-5 - Yoshiki Masuyama, Xuankai Chang, Wangyou Zhang, Samuele Cornell

, Zhong-Qiu Wang, Nobutaka Ono, Yanmin Qian, Shinji Watanabe
:
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation. 1-5 - Atsushi Miyashita, Tomoki Toda:

Differentiable Representation of Warping Based on Lie Group Theory. 1-5 - Hong-Goo Kang, Jan Skoglund, W. Bastiaan Kleijn

, Andrew Storus, Hengchin Yeh:
A High-Rate Extension to Soundstream. 1-5 - Jarin Ritu, Ethan Barnes, Riley Martell, Alexandra Van Dine, Joshua Peeples:

Histogram Layer Time Delay Neural Networks for Passive Sonar Classification. 1-5 - James A. King, Arshdeep Singh

, Mark D. Plumbley:
Compressing Audio CNNS with Graph Centrality Based Filter Pruning. 1-5 - Keisuke Kimura, Shoichi Koyama, Hiroshi Saruwatari:

Perceptual Quality Enhancement of Sound Field Synthesis Based on Combination of Pressure and Amplitude Matching. 1-5 - Ivan Shanin, Simon Dixon:

Annotating Jazz Recordings Using Lead Sheet Alignment with Deep Chroma Features. 1-5 - Jean-Marie Lemercier, Simon Welker, Timo Gerkmann:

Diffusion Posterior Sampling for Informed Single-Channel Dereverberation. 1-5 - Ahmed Alghamdi, Leonard Moen, Wai-Yip Chan, Daniel Fogerty, Jesper Jensen:

Correlation Based Glimpse Proportion Index. 1-5 - Yoshiki Masuyama, Natsuki Ueno, Nobutaka Ono:

Signal Reconstruction from Mel-Spectrogram Based on Bi-Level Consistency of Full-Band Magnitude and Phase. 1-5 - Enric Gusó, Joanna Luberadzka

, Martí Baig, Umut Sayin Saraç
, Xavier Serra:
An Objective Evaluation of Hearing AIDS and DNN-Based Binaural Speech Enhancement in Complex Acoustic Scenes. 1-5 - Julia Wilkins, Justin Salamon, Magdalena Fuentes

, Juan Pablo Bello
, Oriol Nieto:
Bridging High-Quality Audio and Video Via Language for Sound Effects Retrieval from Visual Queries. 1-5 - Byeongho Jo, Seungkwon Beack:

Hybrid Noise Shaping for Audio Coding Using Perfectly Overlapped Window. 1-5 - Yinghao Aaron Li, Cong Han, Nima Mesgarani:

SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs. 1-5 - Saurjya Sarkar, Louise Thorpe, Emmanouil Benetos, Mark Sandler:

Leveraging Synthetic Data for Improving Chamber Ensemble Separation. 1-5 - Jin Woo Lee

, Hyeong-Seok Choi, Kyogu Lee:
AECSQI: Referenceless Acoustic Echo Cancellation Measures Using Speech Quality and Intelligibility Improvement. 1-5 - Jiarui Hai, Mounya Elhilali:

Diff-Pitcher: Diffusion-Based Singing Voice Pitch Correction. 1-5 - Ricardo Falcón Pérez

, Gordon Wichern, François G. Germain, Jonathan Le Roux:
Location as Supervision for Weakly Supervised Multi-Channel Source Separation of Machine Sounds. 1-5 - Ilyass Moummad, Nicolas Farrugia:

Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning. 1-5 - Vincent Lostanlen, Daniel Haider, Han Han, Mathieu Lagrange, Péter Balázs, Martin Ehler

:
Fitting Auditory Filterbanks with Multiresolution Neural Networks. 1-5 - Menglu Li, Xiao-Ping Zhang:

Robust Audio Anti-Spoofing System Based on Low-Frequency Sub-Band Information. 1-5 - Rajesh R

, Padmanabhan Rajan:
Neural Networks for Interference Reduction in Multi-Track Recordings. 1-5 - Ante Jukic, Jagadeesh Balam, Boris Ginsburg:

Flexible Multichannel Speech Enhancement for Noise-Robust Frontend. 1-5 - Alice Sokolova, Baris Aksanli

, Fred Harris, Harinath Garudadri:
Consolidating Compression and Revisiting Expansion: an Alternative Amplification Rule for Wide Dynamic Range Compression. 1-5 - Bowen Zhi, Alisha Sharma, Dmitry N. Zotkin, Ramani Duraiswami

:
A Differentiable Image Source Model for Room Acoustics Optimization. 1-5 - Archontis Politis

, Lauros Pajunen, Jussi Leppänen, Sujeet Mate, Antti J. Eronen:
Wide-Area 6DOF Rendering of Multi-Point Ambisonic Recordings Based on Interpolation of Spatial Parameters. 1-5 - Mohamed Elminshawi, Srikanth Raj Chetupalli

, Emanuël A. P. Habets:
Slim-Tasnet: A Slimmable Neural Network for Speech Separation. 1-5 - Martin Strauss, Nicola Pia, Nagashree K. S. Rao, Bernd Edler:

SEFGAN: Harvesting the Power of Normalizing Flows and GANs for Efficient High-Quality Speech Enhancement. 1-5 - Maximilian Schäfer, Karolina Prawda

, Rudolf Rabenstein, Sebastian J. Schlecht:
Distribution of Modal Damping in Absorptive Shoebox Rooms. 1-5 - Rui Wang, Tomoki Toda:

Directional Target Speaker Extraction under Noisy Underdetermined Conditions through Conditional Variational Autoencoder with Global Style Tokens. 1-5 - Pil Moo Byun, Jeong-Hwan Choi

, Joon-Hyuk Chang:
Class Activation Mapping-Driven Data Augmentation: Masking Significant Regions for Enhanced Acoustic Scene Classification. 1-5 - Taejun Kim, Juhan Nam:

All-in-One Metrical and Functional Structure Analysis with Neighborhood Attentions on Demixed Audio. 1-5 - Henri Gode, Simon Doclo:

Covariance Blocking and Whitening Method for Successive Relative Transfer Function Vector Estimation in Multi-Speaker Scenarios. 1-5 - Jan Büthe, Jean-Marc Valin, Ahmed Mustafa:

Lace: A Light-Weight, Causal Model for Enhancing Coded Speech Through Adaptive Convolutions. 1-5 - Cyrus Vahidi, Shubhr Singh, Emmanouil Benetos, Huy Phan, Dan Stowell

, György Fazekas, Mathieu Lagrange:
Perceptual Musical Similarity Metric Learning with Graph Neural Networks. 1-5 - Diep Luong

, Minh Tran, Shayan Gharib, Konstantinos Drossos
, Tuomas Virtanen
:
Representation Learning for Audio Privacy Preservation Using Source Separation and Robust Adversarial Learning. 1-5 - Nils L. Westhausen, Bernd T. Meyer:

Low Bit Rate Binaural Link for Improved Ultra Low-Latency Low-Complexity Multichannel Speech Enhancement in Hearing Aids. 1-5 - Shoichi Koyama, Masaki Nakada, Juliano G. C. Ribeiro, Hiroshi Saruwatari:

Kernel Interpolation of Incident Sound Field in Region Including Scattering Objects. 1-5 - Matthew Rice, Christian J. Steinmetz, George Fazekas, Joshua D. Reiss:

General Purpose Audio Effect Removal. 1-5 - Samuel F. Potter, Monte Hoover, Dmitry N. Zotkin, Ramani Duraiswami

:
Computing Acoustic Onsets Via an Eikonal Solver. 1-5 - Andrew Wiggins, Youngmoo E. Kim:

A Differentiable Acoustic Guitar Model for String-Specific Polyphonic Synthesis. 1-5 - Hao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, Julian J. McAuley

:
CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models. 1-5 - Amir Ivry, Israel Cohen, Baruch Berdugo:

Deep Adaptation Control for Stereophonic Acoustic Echo Cancellation. 1-5 - Yaakov Buchris, Israel Cohen, Alon Amar:

Design of Frequency-Invariant Beamformers with Sparse Concentric Circular Arrays. 1-5 - Wo Jae Lee, Emanuele Coviello:

A Novel Method to Detect Instrumental Music in a Large Scale Music Catalog. 1-5 - George Close

, Thomas Hain
, Stefan Goetze
:
The Effect of Spoken Language on Speech Enhancement Using Self-Supervised Speech Representation Loss Functions. 1-5 - Carlotta Anemüller, Oliver Thiergart, Emanuël A. P. Habets:

Neural Audio Decorrelation Using Generative Adversarial Networks. 1-5 - Aditya Arie Nugraha, Diego Di Carlo, Yoshiaki Bando, Mathieu Fontaine, Kazuyoshi Yoshii

:
Time-Domain Audio Source Separation Based on Gaussian Processes with Deep Kernel Learning. 1-5 - Zhi Zhong, Hao Shi, Masato Hirano, Kazuki Shimada, Kazuya Tateishi, Takashi Shibuya

, Shusuke Takahashi, Yuki Mitsufuji:
Extending Audio Masked Autoencoders toward Audio Restoration. 1-5 - Ryan M. Corey:

Mixed-Delay Distributed Beamforming for Own-Speech Separation in Hearing Devices with Wireless Remote Microphones. 1-5 - Axel Marmoret, Jérémy E. Cohen, Frédéric Bimbot:

Convolutive Block-Matching Segmentation Algorithm with Application to Music Structure Analysis. 1-5 - Mark R. P. Thomas, Jan-Hendrik Hanschke:

Inverted Cardioid Topology for Multi-Radius Spherical Microphone Arrays. 1-5 - Yutong Wen, You Zhang

, Zhiyao Duan:
Mitigating Cross-Database Differences for Learning Unified HRTF Representation. 1-5 - Christoph Hold

, Leo McCormack, Archontis Politis
, Ville Pulkki:
Optimizing Higher-Order Directional Audio Coding with Adaptive Mixing and Energy Matching for Ambisonic Compression and Upmixing. 1-5 - Benjamin Stahl, Alois Sontacchi:

Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network for Direction-Based Speech Enhancement with Head-Mounted Microphone Arrays. 1-5 - Davide Berghi

, Philip J. B. Jackson:
Audio Inputs for Active Speaker Detection and Localization Via Microphone Array. 1-5 - Shivam Saini

, Jürgen Peissig:
Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer. 1-5 - Ernst Seidel

, Pejman Mowlaee, Tim Fingscheidt:
Efficient Deep Acoustic Echo Suppression with Condition-Aware Training. 1-5 - Wiebke Middelberg, Henri Gode, Simon Doclo:

Relative Transfer Function Vector Estimation for Acoustic Sensor Networks Exploiting Covariance Matrix Structure. 1-5 - Sungho Lee, Hyeong-Seok Choi, Kyogu Lee:

Yet Another Generative Model for Room Impulse Response Estimation. 1-5 - Saksham Singh Kushwaha, Irán R. Román

, Magdalena Fuentes
, Juan Pablo Bello
:
Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions. 1-5 - Zhepei Wang, Cem Subakan, Krishna Subramani, Junkai Wu, Tiago Tavares, Fábio Ayres, Paris Smaragdis:

Unsupervised Improvement of Audio-Text Cross-Modal Representations. 1-5

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














