default search action

combined dblp search
author search
venue search
publication search

ask others

EURASIP Journal on Audio, Speech, and Music Processing, Volume 2024

> Home > Journals > EURASIP Journal on Audio, Speech, and Music Processing

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Volume 2024, Number 1, December 2024

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ShaoMMZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ShaoMMZ24
Yunfei Shao, Xinxin Ma, Yong Ma, Weiqiang Zhang:
Deep semantic learning for acoustic scene classification. 1
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/PhapatanaburiWLNJU24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/PhapatanaburiWLNJU24
Khomdet Phapatanaburi, Longbiao Wang, Meng Liu, Seiichi Nakagawa, Talit Jumphoo, Peerapong Uthansakul:
Significance of relative phase features for shouted and normal speech classification. 2
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/KoguchiM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/KoguchiM24
Junya Koguchi, Masanori Morise:
Neural electric bass guitar synthesis framework enabling attack-sustain-representation-based technique control. 3
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/WuYWLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/WuYWLS24
Shangda Wu, Yue Yang, Zhaowen Wang, Xiaobing Li, Maosong Sun:
Generating chord progression from melody with flexible harmonic rhythm and controllable harmonic density. 4
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/KindtTBM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/KindtTBM24
Stijn Kindt, Jenthe Thienpondt, Luca Becker, Nilesh Madhu:
Correction: Robustness of ad hoc microphone clustering using speaker embeddings: evaluation under realistic and challenging scenarios. 5
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/SheferawMKM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/SheferawMKM24
Gebremichael Kibret Sheferaw, Waweru Mwangi, Michael W. Kimwele, Adane Letta Mamuye:
Gated recurrent unit predictor model-based adaptive differential pulse code modulation speech decoder. 6
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/XieWG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/XieWG24
Lingyun Xie, Yuehong Wang, Yan Gao:
Acoustical feature analysis and optimization for aesthetic recognition of Chinese traditional music. 7
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/YechuriV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/YechuriV24
Sivaramakrishna Yechuri, Sunny Dayal Vanambathina:
Sub-convolutional U-Net with transformer attention network for end-to-end single-channel speech enhancement. 8
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/HinrichsGLO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/HinrichsGLO24
Reemt Hinrichs, Kevin Gerkens, Alexander Lange, Jörn Ostermann:
Blind extraction of guitar effects through blind system inversion and neural guitar effect modeling. 9
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/GuptaPG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/GuptaPG24
Priyanka Gupta, Hemant A. Patil, Rodrigo Capobianco Guido:
Vulnerability issues in Automatic Speaker Verification (ASV) systems. 10
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/BarakatTD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/BarakatTD24
Huda Barakat, Oytun Türk, Cenk Demiroglu:
Deep learning-based expressive speech synthesis: a systematic review of approaches, challenges, and resources. 11
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/AlvarezAMB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/AlvarezAMB24
Marcos Lazaro Alvarez, Laura Arjona, Miguel Enrique Iglesias Martínez, Alfonso Bahillo:
Automatic classification of the physical surface in sound uroflowmetry using machine learning methods. 12
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/LiangZA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/LiangZA24
Zining Liang, Wen Zhang, Thushara D. Abhayapala:
Sound field reconstruction using neural processes with dynamic kernels. 13
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/HizlisoyAC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/HizlisoyAC24
Serhat Hizlisoy, Recep Sinan Arslan, Emel Çolakoglu:
Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning. 14
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/TejedorT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/TejedorT24
Javier Tejedor, Doroteo T. Toledano:
Whisper-based spoken term detection systems for search on speech ALBAYZIN evaluation challenge. 15
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/SainiEP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/SainiEP24
Shivam Saini, Isaac Engel, Jürgen Peissig:
An end-to-end approach for blindly rendering a virtual sound source in an audio augmented reality environment. 16
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ComanducciAS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ComanducciAS24
Luca Comanducci, Fabio Antonacci, Augusto Sarti:
Synthesis of soundfields through irregular loudspeaker arrays based on convolutional neural networks. 17
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/MahumIJMH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/MahumIJMH24
Rabbia Mahum, Aun Irtaza, Ali Javed, Haitham A. Mahmoud, Haseeb Hassan:
DeepDet: YAMNet with BottleNeck Attention Module (BAM) TTS synthesis detection. 18
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/KothintiE24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/KothintiE24
Sandeep Reddy Kothinti, Mounya Elhilali:
Multi-rate modulation encoding via unsupervised learning for audio event detection. 19
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ZhangZZQW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ZhangZZQW24
Zehua Zhang, Lu Zhang, Xuyi Zhuang, Yukun Qian, Mingjiang Wang:
Supervised Attention Multi-Scale Temporal Convolutional Network for monaural speech enhancement. 20
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/MahumIJMH24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/MahumIJMH24a
Rabbia Mahum, Aun Irtaza, Ali Javed, Haitham A. Mahmoud, Haseeb Hassan:
Correction: DeepDet: YAMNet with BottleNeck Attention Module (BAM) for TTS synthesis detection. 21
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/SaqibCJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/SaqibCJ24
Usama Saqib, Mads Græsbøll Christensen, Jesper Rindom Jensen:
Robust acoustic reflector localization using a modified EM algorithm. 22
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/WangJLBJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/WangJLBJ24
Chunxi Wang, Maoshen Jia, Meiran Li, Changchun Bao, Wenyu Jin:
Exploring the power of pure attention mechanisms in blind room parameter estimation. 23
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/WojnarHR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/WojnarHR24
Tomasz Wojnar, Jaroslaw Hryszko, Adam Roman:
Mi-Go: tool which uses YouTube as data source for evaluating general-purpose speech recognition machine learning models. 24
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/GimenoGomezM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/GimenoGomezM24
David Gimeno-Gómez, Carlos David Martínez-Hinarejos:
Continuous lipreading based on acoustic temporal alignments. 25
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/MikkonenWV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/MikkonenWV24
Otto Mikkonen, Alec Wright, Vesa Välimäki:
Sampling the user controls in neural modeling of audio devices. 26
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/LuberadzkaKLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/LuberadzkaKLH24
Joanna Luberadzka, Hendrik Kayser, Jörg Lücke, Volker Hohmann:
Towards multidimensional attentive voice tracking - estimating voice state from auditory glimpses with regression neural networks and Monte Carlo sampling. 27
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ChenAMLX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ChenAMLX24
Zhiyong Chen, Zhiqi Ai, Youxuan Ma, Xinnuo Li, Shugong Xu:
Optimizing feature fusion for improved zero-shot adaptation in text-to-speech synthesis. 28
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/LiuYQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/LiuYQ24
Yunpeng Liu, Xukui Yang, Dan Qu:
Exploration of Whisper fine-tuning strategies for low-resource ASR. 29
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/AbimbolaKK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/AbimbolaKK24
Jeremiah Abimbola, Daniel Kostrzewa, Pawel Kasprowski:
Music time signature detection using ResNet18. 30
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/Lewandowski24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/Lewandowski24
Marcin Lewandowski:
Estimating the first and second derivatives of discrete audio data. 31
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/KujawskiPS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/KujawskiPS24
Adam Kujawski, Art J. R. Pelling, Ennes Sarradj:
MIRACLE - a microphone array impulse response dataset for acoustic learning. 32
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/SajihaRRSSB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/SajihaRRSSB24
Shaik Sajiha, Kodali Radha, Dhulipalla Venkata Rao, Nammi Sneha, Gunnam Suryanarayana, Durga Prasad Bavirisetti:
Automatic dysarthria detection and severity level assessment using CWT-layered CNN model. 33
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/MaHHH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/MaHHH24
Mengzhen Ma, Ying Hu, Liang He, Hao Huang:
GLFER-Net: a polyphonic sound source localization and detection network based on global-local feature extraction and recalibration. 34
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/KanwalMASH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/KanwalMASH24
Tahira Kanwal, Rabbia Mahum, AbdulMalik Al-Salman, Mohamed Sharaf, Haseeb Hassan:
Fake speech detection using VGGish with attention block. 35
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/FengZZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/FengZZX24
Xin Feng, Yue Zhao, Wei Zong, Xiaona Xu:
Adaptive multi-task learning for speech to text translation. 36
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/LiuZXXZJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/LiuZXXZJ24
Yigang Liu, Yue Zhao, Xiaona Xu, Liang Xu, Xubei Zhang, Qiang Ji:
Exploring task-diverse meta-learning on Tibetan multi-dialect speech recognition. 37
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/PoirotBK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/PoirotBK24
Samuel Poirot, Stefan Bilbao, Richard Kronland-Martinet:
A simplified and controllable model of mode coupling for addressing nonlinear phenomena in sound synthesis processes. 38
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/SawataTUTM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/SawataTUTM24
Ryosuke Sawata, Naoya Takahashi, Stefan Uhlich, Shusuke Takahashi, Yuki Mitsufuji:
The whole is greater than the sum of its parts: improving music source separation by bridging networks. 39
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/MoriONOK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/MoriONOK24
Daiki Mori, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka:
Recognition of target domain Japanese speech using language model replacement. 40
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/VerburgEWF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/VerburgEWF24
Samuel A. Verburg, Filip Elvander, Toon van Waterschoot, Efren Fernandez-Grande:
Optimal sensor placement for the spatial reconstruction of sound fields. 41
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/OlivieriKPASF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/OlivieriKPASF24
Marco Olivieri, Xenofon Karakonstantis, Mirco Pezzoli, Fabio Antonacci, Augusto Sarti, Efren Fernandez-Grande:
Physics-informed neural network for volumetric sound field reconstruction of speech signals. 42
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/RibeiroKS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/RibeiroKS24
Juliano G. C. Ribeiro, Shoichi Koyama, Hiroshi Saruwatari:
Physics-constrained adaptive kernel interpolation for region-to-region acoustic transfer function: a Bayesian approach. 43
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/LiWZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/LiWZZ24
Zijin Li, Wenwu Wang, Kejun Zhang, Mengyao Zhu:
Guest editorial: AI for computational audition - sound and music processing. 44
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/JalmbyEW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/JalmbyEW24
Martin Jälmby, Filip Elvander, Toon van Waterschoot:
Compression of room impulse responses for compact storage and fast low-latency convolution. 45
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/KinoshitaO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/KinoshitaO24
Yuma Kinoshita, Nobutaka Ono:
End-to-end training of acoustic scene classification using distributed sound-to-light conversion devices: verification through simulation experiments. 46
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ZengXW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ZengXW24
Xiao Zeng, Shiyun Xu, Mingjiang Wang:
A time-frequency fusion model for multi-channel speech enhancement. 47
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ZhangH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ZhangH24
Chaoyang Zhang, Yan Hua:
Dance2Music-Diffusion: leveraging latent diffusion models for music generation from dance videos. 48
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/DamianoBGW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/DamianoBGW24
Stefano Damiano, Luca Bondi, Andre Guntoro, Toon van Waterschoot:
A framework for the acoustic simulation of passing vehicles using variable length delay lines. 49

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.