


Остановите войну!
for scientists:


default search action
Wenwu Wang 0001
Person information

- affiliation: University of Surrey, Guildford, UK
Other persons with the same name
- Wenwu Wang 0002
— Qufu Normal University, Qufu, Shandong, China
- Wenwu Wang 0003
— Xidian University, Xi'an, China
- Wenwu Wang 0004 — Wuhan University, Wuhan, China
- Wenwu Wang 0005 — Harbin Institute of Technology, Harbin, China
- Wenwu Wang 0006 — Institute of Microelectronics, Chinese Academy of Sciences, Beijing, China
- Wenwu Wang 0007 — Sichuan University, School of Mechanical Engineering, Chengdu, China
- Wenwu Wang 0008 — Wuhan University of Science and Technology, School of Information Science and Engineering, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j103]Yuanbo Hou
, Bo Kang
, Andrew Mitchell
, Wenwu Wang
, Jian Kang
, Dick Botteldooren
:
Cooperative Scene-Event Modelling for Acoustic Scene Classification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 68-82 (2024) - 2023
- [j102]Mukunthan Tharmakulasingam
, Wenwu Wang
, Michael Kerby, Roberto La Ragione
, Anil Fernando:
TransAMR: An Interpretable Transformer Model for Accurate Prediction of Antimicrobial Resistance Using Antibiotic Administration Data. IEEE Access 11: 75337-75350 (2023) - [j101]Jian Guan
, Youde Liu, Qiuqiang Kong, Feiyang Xiao, Qiaoxi Zhu, Jiantong Tian, Wenwu Wang:
Transformer-based autoencoder with ID constraint for unsupervised anomalous sound detection. EURASIP J. Audio Speech Music. Process. 2023(1): 42 (2023) - [j100]Yina Guo
, Ting Liu, Xiaofei Zhang, Anhong Wang, Wenwu Wang:
End-to-end translation of human neural activity to speech with a dual-dual generative adversarial network. Knowl. Based Syst. 277: 110837 (2023) - [j99]Jing Dong
, Kai Wu
, Chang Liu, Xue Mei, Wenwu Wang
:
Discriminative analysis dictionary learning with adaptively ordinal locality preserving. Neural Networks 165: 298-309 (2023) - [j98]Liming Shi, Xinheng Wang
, Limin Yu, Wenwu Wang, Zhi Wang, Muddesar Iqbal
, Charalampos C. Tsimenidis, Shahid Mumtaz:
A long-range aerial acoustic communication scheme. Phys. Commun. 60: 102135 (2023) - [j97]Feiyang Xiao
, Jian Guan
, Qiaoxi Zhu
, Wenwu Wang
:
Graph Attention for Automated Audio Captioning. IEEE Signal Process. Lett. 30: 413-417 (2023) - [j96]Yuanbo Hou
, Siyang Song
, Chuang Yu
, Wenwu Wang
, Dick Botteldooren
:
Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification. IEEE Signal Process. Lett. 30: 1382-1386 (2023) - [j95]Shidrokh Goudarzi
, Seyed Ahmad Soleymani
, Wenwu Wang
, Pei Xiao
:
UAV-Enabled Mobile Edge Computing for Resource Allocation Using Cooperative Evolutionary Computation. IEEE Trans. Aerosp. Electron. Syst. 59(5): 5134-5147 (2023) - [j94]Yi Li
, Yang Sun
, Wenwu Wang
, Syed Mohsen Naqvi
:
U-Shaped Transformer With Frequency-Band Aware Attention for Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1511-1521 (2023) - [j93]Yiming Zhang
, Hong Yu
, Ruoyi Du, Zheng-Hua Tan
, Wenwu Wang
, Zhanyu Ma
, Yuan Dong
:
ACTUAL: Audio Captioning With Caption Feature Space Regularization. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2643-2657 (2023) - [j92]Weitao Yuan
, Shengbei Wang
, Jianming Wang
, Masashi Unoki
, Wenwu Wang
:
Unsupervised Deep Unfolded Representation Learning for Singing Voice Separation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3206-3220 (2023) - [j91]Cheng Xue, Xionghu Zhong
, Minjie Cai
, Hao Chen
, Wenwu Wang
:
Audio-Visual Event Localization by Learning Spatial and Semantic Co-Attention. IEEE Trans. Multim. 25: 418-429 (2023) - [c185]Qiushi Huang, Yu Zhang, Tom Ko, Xubo Liu, Bo Wu, Wenwu Wang, H. Lilian Tang:
Personalized Dialogue Generation with Persona-Adaptive Attention. AAAI 2023: 12916-12923 - [c184]Özkan Çayli, Xubo Liu, Volkan Kiliç, Wenwu Wang:
Knowledge Distillation for Efficient Audio-Visual Video Captioning. EUSIPCO 2023: 745-749 - [c183]Feiyang Xiao, Qiaoxi Zhu, Jian Guan, Wenwu Wang:
Enhancing Audio Retrieval with Attention-based Encoder for Audio Feature Representation. EUSIPCO 2023: 755-759 - [c182]Yi Yuan, Haohe Liu, Jinhua Liang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Leveraging Pre-Trained AudioLDM for Sound Generation: A Benchmark Study. EUSIPCO 2023: 765-769 - [c181]Xinyuan Zhou, Shiyong Lan, Wenwu Wang, Xinyang Li, Siyuan Zhou, Hongyu Yang:
Visual-Haptic-Kinesthetic Object Recognition with Multimodal Transformer. ICANN (7) 2023: 233-245 - [c180]Jian Guan, Youde Liu, Qiaoxi Zhu, Tieran Zheng, Jiqing Han, Wenwu Wang:
Time-Weighted Frequency Domain Audio Representation with GMM Estimator for Anomalous Sound Detection. ICASSP 2023: 1-5 - [c179]Jian Guan, Feiyang Xiao, Youde Liu, Qiaoxi Zhu, Wenwu Wang:
Anomalous Sound Detection Using Audio Representation with Machine ID Based Contrastive Learning Pretraining. ICASSP 2023: 1-5 - [c178]Yuanbo Hou, Yun Wang, Wenwu Wang, Dick Botteldooren:
Gct: Gated Contextual Transformer for Sequential Audio Tagging. ICASSP 2023: 1-5 - [c177]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang:
Simple Pooling Front-Ends for Efficient Audio Classification. ICASSP 2023: 1-5 - [c176]Weitao Yuan, Yuren Bian, Shengbei Wang, Masashi Unoki, Wenwu Wang:
An Improved Optimal Transport Kernel Embedding Method with Gating Mechanism for Singing Voice Separation and Speaker Identification. ICASSP 2023: 1-5 - [c175]Xiaoxiao Yin, Shiyong Lan, Weikang Huang, Yitong Ma, Wenwu Wang, Hongyu Yang, Yilin Zheng:
DLAHSD: Dynamic Label Adopted In Auxiliary Head for SAR Detection. ICIP 2023: 3434-3438 - [c174]Wei Ma, Shiyong Lan, Weikang Huang, Wenwu Wang, Hongyu Yang, Yitong Ma, Yongjie Ma:
A Semantics-Aware Normalizing Flow Model for Anomaly Detection. ICME 2023: 2207-2212 - [c173]Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo P. Mandic, Wenwu Wang, Mark D. Plumbley:
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models. ICML 2023: 21450-21474 - [c172]Wenhan Li, Xiongjie Chen, Wenwu Wang, Víctor Elvira, Yunpeng Li:
Differentiable Bootstrap Particle Filters for Regime-Switching Models. SSP 2023: 200-204 - [i97]Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo P. Mandic, Wenwu Wang, Mark D. Plumbley:
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models. CoRR abs/2301.12503 (2023) - [i96]Wenhan Li, Xiongjie Chen, Wenwu Wang, Víctor Elvira, Yunpeng Li:
Differentiable Bootstrap Particle Filters for Regime-Switching Models. CoRR abs/2302.10319 (2023) - [i95]Yi Yuan, Haohe Liu, Jinhua Liang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Leveraging Pre-trained AudioLDM for Text to Sound Generation: A Benchmark Study. CoRR abs/2303.03857 (2023) - [i94]Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang:
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research. CoRR abs/2303.17395 (2023) - [i93]Feiyang Xiao, Jian Guan, Qiaoxi Zhu, Wenwu Wang:
Graph Attention for Automated Audio Captioning. CoRR abs/2304.03586 (2023) - [i92]Jian Guan, Feiyang Xiao, Youde Liu, Qiaoxi Zhu, Wenwu Wang:
Anomalous Sound Detection using Audio Representation with Machine ID based Contrastive Learning Pretraining. CoRR abs/2304.03588 (2023) - [i91]Jian Guan, Youde Liu, Qiaoxi Zhu, Tieran Zheng, Jiqing Han, Wenwu Wang:
Time-weighted Frequency Domain Audio Representation with GMM Estimator for Anomalous Sound Detection. CoRR abs/2305.03328 (2023) - [i90]Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Mark D. Plumbley, Wenwu Wang:
Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7. CoRR abs/2305.15905 (2023) - [i89]Jinhua Liang, Xubo Liu, Haohe Liu, Huy Phan, Emmanouil Benetos, Mark D. Plumbley, Wenwu Wang:
Adapting Language-Audio Models as Few-Shot Audio Learners. CoRR abs/2305.17719 (2023) - [i88]Jianyuan Sun, Xubo Liu, Xinhao Mei, Volkan Kiliç, Mark D. Plumbley, Wenwu Wang:
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning. CoRR abs/2305.18753 (2023) - [i87]Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Peipei Wu, Mark D. Plumbley, Wenwu Wang:
Text-Driven Foley Sound Generation With Latent Diffusion Model. CoRR abs/2306.10359 (2023) - [i86]Xubo Liu, Zhongkai Zhu, Haohe Liu, Yi Yuan, Meng Cui, Qiushi Huang, Jinhua Liang, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
WavJourney: Compositional Audio Creation with Large Language Models. CoRR abs/2307.14335 (2023) - [i85]Xubo Liu, Qiuqiang Kong, Yan Zhao, Haohe Liu, Yi Yuan, Yuzhuo Liu, Rui Xia, Yuxuan Wang, Mark D. Plumbley, Wenwu Wang:
Separate Anything You Describe. CoRR abs/2308.05037 (2023) - [i84]Haohe Liu, Qiao Tian, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley:
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining. CoRR abs/2308.05734 (2023) - [i83]Jinbo Hu, Yin Cao, Ming Wu, Feiran Yang, Ziying Yu, Wenwu Wang, Mark D. Plumbley, Jun Yang:
META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection. CoRR abs/2308.08847 (2023) - [i82]Yuanbo Hou, Siyang Song, Cheng Luo, Andrew Mitchell, Qiaoqiao Ren, Weicheng Xie, Jian Kang, Wenwu Wang, Dick Botteldooren:
Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning. CoRR abs/2308.11980 (2023) - [i81]Meng Cui, Xubo Liu, Haohe Liu, Zhuangzhuang Du, Tao Chen, Guoping Lian, Daoliang Li, Wenwu Wang:
Multimodal Fish Feeding Intensity Assessment in Aquaculture. CoRR abs/2309.05058 (2023) - [i80]Haohe Liu, Ke Chen, Qiao Tian, Wenwu Wang, Mark D. Plumbley:
AudioSR: Versatile Audio Super-resolution at Scale. CoRR abs/2309.07314 (2023) - [i79]Haiyan Lan, Qiaoxi Zhu, Jian Guan, Yuming Wei, Wenwu Wang:
Hierarchical Metadata Information Constrained Self-Supervised Learning for Anomalous Sound Detection Under Domain Shift. CoRR abs/2309.07498 (2023) - [i78]Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Retrieval-Augmented Text-to-Audio Generation. CoRR abs/2309.08051 (2023) - [i77]Feiyang Xiao, Qiaoxi Zhu, Jian Guan, Xubo Liu, Haohe Liu, Kejia Zhang, Wenwu Wang:
Synth-AC: Enhancing Audio Captioning with Synthetic Supervision. CoRR abs/2309.09705 (2023) - [i76]Jinzheng Zhao, Yong Xu, Xinyuan Qian, Wenwu Wang:
Audio Visual Speaker Localization from EgoCentric Views. CoRR abs/2309.16308 (2023) - [i75]Yuanbo Hou, Siyang Song, Chuang Yu, Wenwu Wang, Dick Botteldooren:
Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification. CoRR abs/2310.03889 (2023) - [i74]Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang:
CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing. CoRR abs/2310.07517 (2023) - [i73]Jian Guan, Youde Liu, Qiuqiang Kong, Feiyang Xiao, Qiaoxi Zhu, Jiantong Tian, Wenwu Wang:
Transformer-based Autoencoder with ID Constraint for Unsupervised Anomalous Sound Detection. CoRR abs/2310.08950 (2023) - [i72]Hejing Zhang, Qiaoxi Zhu, Jian Guan, Haohe Liu, Feiyang Xiao, Jiantong Tian, Xinhao Mei, Xubo Liu, Wenwu Wang:
First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation. CoRR abs/2310.14173 (2023) - [i71]Jinzheng Zhao, Yong Xu, Xinyuan Qian, Davide Berghi, Peipei Wu, Meng Cui, Jianyuan Sun, Philip J. B. Jackson, Wenwu Wang:
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions. CoRR abs/2310.14778 (2023) - 2022
- [j90]Ting Liu
, Wenwu Wang, Xiaofei Zhang, Yina Guo:
One to multiple mapping dual learning: Learning multiple signals from one mixture. Digit. Signal Process. 129: 103686 (2022) - [j89]Haitao Li, Shuguo Yang, Wenwu Wang:
Improved capsule routing for weakly labeled sound event detection. EURASIP J. Audio Speech Music. Process. 2022(1): 5 (2022) - [j88]Xinhao Mei
, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Automated audio captioning: an overview of recent progress and new challenges. EURASIP J. Audio Speech Music. Process. 2022(1): 26 (2022) - [j87]Jian Guan
, Jiabei Liu, Pengming Feng
, Wenwu Wang
:
Multiscale Deep Neural Network With Two-Stage Loss for SAR Target Recognition With Small Training Set. IEEE Geosci. Remote. Sens. Lett. 19: 1-5 (2022) - [j86]Jing Dong
, Liu Yang, Chang Liu, Wei Cheng
, Wenwu Wang:
Support vector machine embedding discriminative dictionary pair learning for pattern classification. Neural Networks 155: 498-511 (2022) - [j85]Lin Dong, Jifeng Qi, Baoshu Yin, Hai Zhi, Delei Li, Shuguo Yang, Wenwu Wang, Hong Cai, Bowen Xie:
Reconstruction of Subsurface Salinity Structure in the South China Sea Using Satellite Observations: A LightGBM-Based Deep Forest Method. Remote. Sens. 14(14): 3494 (2022) - [j84]Arash Shilandari
, Hossein Marvi, Hossein Khosravi, Wenwu Wang:
Speech emotion recognition using data augmentation method by cycle-generative adversarial networks. Signal Image Video Process. 16(7): 1955-1962 (2022) - [j83]Feiyang Xiao
, Jian Guan
, Haiyan Lan
, Qiaoxi Zhu
, Wenwu Wang
:
Local Information Assisted Attention-Free Decoder for Audio Captioning. IEEE Signal Process. Lett. 29: 1604-1608 (2022) - [j82]Kunkun SongGong
, Wenwu Wang
, Huawei Chen
:
Acoustic Source Localization in the Circular Harmonic Domain Using Deep Learning Architecture. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2475-2491 (2022) - [c171]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Segment-Level Metric Learning for Few-Shot Bioacoustic Event Detection. DCASE 2022 - [c170]Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning for On-Ddevice Environmental Sound Classification. DCASE 2022 - [c169]Dongchao Yang, Helin Wang, Wenwu Wang, Yuexian Zou:
A Mixed Supervised Learning Framework For Target Sound Detection. DCASE 2022 - [c168]Jianyuan Sun, Xubo Liu, Xinhao Mei, Jinzheng Zhao, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Deep Neural Decision Forest for Acoustic Scene Classification. EUSIPCO 2022: 772-776 - [c167]Jinzheng Zhao, Peipei Wu, Shidrokh Goudarzi, Xubo Liu, Jianyuan Sun, Yong Xu, Wenwu Wang:
Visually Assisted Self-supervised Audio Speaker Localization and Tracking. EUSIPCO 2022: 787-791 - [c166]Özkan Çayli, Volkan Kiliç, Aytug Onan, Wenwu Wang:
Auxiliary Classifier based Residual RNN for Image Captioning. EUSIPCO 2022: 1126-1130 - [c165]Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Volkan Kilic, Wenwu Wang:
Leveraging Pre-trained BERT for Audio Captioning. EUSIPCO 2022: 1145-1149 - [c164]Özge Taylan Moral, Volkan Kiliç, Aytug Onan, Wenwu Wang:
Automated Image Captioning with Multi-layer Gated Recurrent Unit. EUSIPCO 2022: 1160-1164 - [c163]Wenbo Wang, Jian Guan, Xinyi Che, Wenwu Wang:
MS-MLP: Multi-scale Sampling MLP for ECG Classification. EUSIPCO 2022: 1288-1292 - [c162]Shidrokh Goudarzi, Wenwu Wang, Pei Xiao, Lyudmila Mihaylova, Simon J. Godsill:
UAV-enabled Edge Computing for Optimal Task Distribution in Target Tracking. FUSION 2022: 1-7 - [c161]Tassadaq Hussain, Wenwu Wang, Nidhal Bouaynaya, Hassan M. Fathallah-Shaykh, Lyudmila Mihaylova:
Deep Learning for Audio Visual Emotion Recognition. FUSION 2022: 1-8 - [c160]Weikang Huang
, Shiyong Lan, Wenwu Wang, Xuedong Yuan, Hongyu Yang, Piaoyang Li, Wei Ma:
Face Super-Resolution with Spatial Attention Guided by Multiscale Receptive-Field Features. ICANN (1) 2022: 145-157 - [c159]Caiyin Yang
, Shiyong Lan, Weikang Huang, Wenwu Wang, Guoliang Liu, Hongyu Yang, Wei Ma, Piaoyang Li:
A Transformer-Based GAN for Anomaly Detection. ICANN (2) 2022: 345-357 - [c158]Dongchao Yang, Helin Wang, Yuexian Zou, Zhongjie Ye, Wenwu Wang:
A Mutual Learning Framework for Few-Shot Sound Event Detection. ICASSP 2022: 811-815 - [c157]Youde Liu, Jian Guan, Qiaoxi Zhu
, Wenwu Wang:
Anomalous Sound Detection Using Spectral-Temporal Information Fusion. ICASSP 2022: 816-820 - [c156]Jinzheng Zhao, Peipei Wu, Xubo Liu, Yong Xu, Lyudmila Mihaylova, Simon J. Godsill, Wenwu Wang:
Audio-Visual Tracking of Multiple Speakers Via a PMBM Filter. ICASSP 2022: 5068-5072 - [c155]Peipei Wu, Jinzheng Zhao, Shidrokh Goudarzi, Wenwu Wang:
Partial Arithmetic Consensus based Distributed Intensity Particle Flow SMC-PHD Filter for Multi-Target Tracking. ICASSP 2022: 5078-5082 - [c154]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Diverse Audio Captioning Via Adversarial Training. ICASSP 2022: 8882-8886 - [c153]Shiyong Lan, Yitong Ma, Weikang Huang, Wenwu Wang, Hongyu Yang, Pyang Li:
DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural Network for Traffic Flow Forecasting. ICML 2022: 11906-11917 - [c152]Dongchao Yang, Helin Wang, Zhongjie Ye, Yuexian Zou, Wenwu Wang:
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection. INTERSPEECH 2022: 1511-1515 - [c151]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Separate What You Describe: Language-Queried Audio Source Separation. INTERSPEECH 2022: 1801-1805 - [c150]Jinzheng Zhao, Peipei Wu, Xubo Liu, Shidrokh Goudarzi, Haohe Liu, Yong Xu, Wenwu Wang:
Audio Visual Multi-Speaker Tracking with Improved GCF and PMBM Filter. INTERSPEECH 2022: 3704-3708 - [c149]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
On Metric Learning for Audio-Text Cross-Modal Retrieval. INTERSPEECH 2022: 4142-4146 - [c148]Meng Cui, Xubo Liu, Jinzheng Zhao, Jianyuan Sun, Guoping Lian, Tao Chen
, Mark D. Plumbley, Daoliang Li, Wenwu Wang:
Fish Feeding Intensity Assessment in Aquaculture: A New Audio Dataset AFFIA3K and a Deep Learning Algorithm. MLSP 2022: 1-6 - [c147]Buddhiprabha Erabadda, Gosala Kulupana, Thanuja Mallikarachchi, Wenwu Wang, Anil Fernando:
A Hybrid Approach to Blind Video Quality Prediction of User Generated Content. PCS 2022: 307-311 - [i70]Feiyang Xiao, Jian Guan, Qiaoxi Zhu, Haiyan Lan, Wenwu Wang:
Local Information Assisted Attention-free Decoder for Audio Captioning. CoRR abs/2201.03217 (2022) - [i69]Youde Liu, Jian Guan, Qiaoxi Zhu, Wenwu Wang:
Anomalous Sound Detection using Spectral-Temporal Information Fusion. CoRR abs/2201.05510 (2022) - [i68]Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Leveraging Pre-trained BERT for Audio Captioning. CoRR abs/2203.02838 (2022) - [i67]Jianyuan Sun, Xubo Liu, Xinhao Mei, Jinzheng Zhao, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Deep Neural Decision Forest for Acoustic Scene Classification. CoRR abs/2203.03436 (2022) - [i66]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Separate What You Describe: Language-Queried Audio Source Separation. CoRR abs/2203.15147 (2022) - [i65]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
On Metric Learning for Audio-Text Cross-Modal Retrieval. CoRR abs/2203.15537 (2022) - [i64]Dongchao Yang, Helin Wang, Yuexian Zou, Wenwu Wang:
A Two-student Learning Framework for Mixed Supervised Target Sound Detection. CoRR abs/2204.02088 (2022) - [i63]Dongchao Yang, Helin Wang, Zhongjie Ye, Yuexian Zou, Wenwu Wang:
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection. CoRR abs/2204.02143 (2022) - [i62]Xinhao Mei, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Automated Audio Captioning: an Overview of Recent Progress and New Challenges. CoRR abs/2205.05949 (2022) - [i61]Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh
, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning For On-Device Environmental Sound Classification. CoRR abs/2207.07429 (2022) - [i60]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Segment-level Metric Learning for Few-shot Bioacoustic Event Detection. CoRR abs/2207.07773 (2022) - [i59]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Surrey System for DCASE 2022 Task 5: Few-shot Bioacoustic Event Detection with Segment-level Metric Learning. CoRR abs/2207.10547 (2022) - [i58]Arshdeep Singh, James A. King, Xubo Liu, Wenwu Wang, Mark D. Plumbley:
Low-complexity CNNs for Acoustic Scene Classification. CoRR abs/2208.01555 (2022) - [i57]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang:
Simple Pooling Front-ends For Efficient Audio Classification. CoRR abs/2210.00943 (2022) - [i56]Haohe Liu, Xubo Liu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Learning the Spectrogram Temporal Resolution for Audio Classification. CoRR abs/2210.01719 (2022) - [i55]Jianyuan Sun, Xubo Liu, Xinhao Mei, Mark D. Plumbley, Volkan Kilic, Wenwu Wang:
Automated Audio Captioning via Fusion of Low- and High- Dimensional Features. CoRR abs/2210.05037 (2022) - [i54]Yuanbo Hou, Yun Wang, Wenwu Wang, Dick Botteldooren:
GCT: Gated Contextual Transformer for Sequential Audio Tagging. CoRR abs/2210.12541 (2022) - [i53]Qiushi Huang, Yu Zhang, Tom Ko, Xubo Liu, Bo Wu, Wenwu Wang, H. Lilian Tang:
Personalized Dialogue Generation with Persona-Adaptive Attention. CoRR abs/2210.15088 (2022) - [i52]Yuanbo Hou, Siyang Song, Chuang Yu
, Yuxin Song, Wenwu Wang, Dick Botteldooren:
Multi-dimensional Edge-based Audio Event Relational Graph Representation Learning for Acoustic Scene Classification. CoRR abs/2210.15366 (2022) - [i51]Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, H. Lilian Tang, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. CoRR abs/2210.16428 (2022) - [i50]Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley:
Ontology-aware Learning and Evaluation for Audio Tagging. CoRR abs/2211.12195 (2022) - [i49]