


default search action
ICASSP 2025: Hyderabad, India
- 2025 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025, Hyderabad, India, April 6-11, 2025. IEEE 2025, ISBN 979-8-3503-6875-8

- Adam Misik, Driton Salihu, Xin Su, Heike Brock, Eckehard G. Steinbach:

HypCAD: Geometry-Enhanced Hyperbolic Contrastive Learning for CAD Model Retrieval. 1-5 - Xuege Hou, Yali Li, Shengjin Wang:

Attention Augmented Structure-centric Bias Mitigation with Feature Disentanglement. 1-5 - Yuwu Lu, Yihan Yang:

Context-Guided Active Domain Adaptation for Blended Target Domain. 1-5 - Jianqi Gao, Jian Cao, Shiyou Qian, Wei Guan:

Enhancing Graph-based Fraud Detection by Adversarial Confidence Reweighting. 1-5 - Hsiang-Chun Yu, Jen-Tzung Chien

:
Attention Disentanglement for Semantic Diffusion Modeling in Text-to-Image Generation. 1-5 - Maixuan Peng, Yuyang Wu, Yang Lu, Mengke Li, Yiqun Zhang, Yiu-Ming Cheung:

Weighted Density for The Win: Accurate Subspace Density Clustering. 1-5 - Jong-Ik Park, Carlee Joe-Wong:

FedTLU: Federated Learning with Targeted Layer Updates. 1-5 - Gongyu Chen, Haomin Zhang, Chaofan Ding, Zihao Chen, Xinhan Di:

Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio. 1-5 - Liyao Wang, Zuzeng Lin, Danni Wu, Zihao Yu, Suzhe Zhang, Zixian Wu, Feng Wang:

Animation Anycolor: Enhancing Line Drawing Colorization with Keypoint Matching. 1-5 - Changzeng Fu, Zelin Fu

, Shaojun Yan, Xiaoyong Lyu, Yuliang Zhao:
In-Context Multitask Learning for Few-shot Fine-tuning of Large Language Models in Traditional Chinese Medicine Tongue Diagnosis. 1-5 - Shuvayan Banerjee, Sudhansh Peddabomma, Radhendushka Srivastava

, James Saunderson, Ajit Rajwade:
Identification and Correction of Permutation Errors in Compressed Sensing-Based Group Testing. 1-5 - Zeyu Wang

, Chen Li, Huiying Xu, Xinzhong Zhu, Xiao Huang, Hongbo Li:
RestorMamba: An Enhanced Synergistic State Space Model for Image Restoration. 1-5 - Dian Huang

, Jianqi Gao, Xiangfeng Luo, Hao Wu
:
Improving Knowledge Base Question Answering via Retrieval Enhancement and Stepwise Reasoning. 1-5 - Zeyu Wang

, Chen Li, Huiying Xu, Xinzhong Zhu, Xiao Huang, Hongbo Li:
MambaInst: Lightweight State Space Model for Real-Time Instance Segmentation. 1-5 - Sen Feng, Mingjie Zhao, Zhanpei Huang, Yuzhu Ji, Yiqun Zhang, Yiu-Ming Cheung:

Robust Qualitative Data Clustering via Learnable Multi-Metric Space Fusion. 1-5 - Amartyaveer

, Saurabh Kumar, Sumit Sharma, Sathvik Udupa, Sandhya Badiger, Abhayjeet Singh, Deekshitha G, Jesuraja Bandekar, Savitha Murthy, Prasanta Kumar Ghosh:
Improving Dialect Identification in Indian Languages Using Multimodal Features from Dialect Informed ASR. 1-5 - Ziyi Li, Wei-Long Zheng, Bao-Liang Lu:

Gram: A Large-Scale General EEG Model for Raw Data Classification and Restoration Tasks. 1-5 - Tingyu Zhao, Bo Peng, Zhenguang Zhang, Daipeng Yang, Xi Wu:

Content-Aware Dynamic Superpixel Segmentation. 1-5 - Yu Liang, Sheng Zhang, Jie Wu:

Online Optimization of Offloading Video Analytics Tasks to Multiple Edges for Accuracy Maximization. 1-5 - Xiaoyan Ma, Peng Zhu, Qinyuan Liu, Zidong Wang:

A Risk Prediction Model for Real Estate Corporations Using High-Target Semantic BERT and Improved GRU. 1-5 - Junbin Zhuang, Jiajia Zhou, Yan Zheng, Yasheng Chang, Suleman Mazhar:

Multi-domain fusion network for underwater image enhancement. 1-5 - Yue Zhu, Haiwen Diao, Shang Gao, Long Chen, Huchuan Lu:

KARST: Multi-Kernel Kronecker Adaptation with Re-Scaling Transmission for Visual Classification. 1-5 - Minghui Chen, Chao Qu, Jiahui Pan:

Meta-MMD Fusion: Enhancing Cross-Subject Motor Imagery Classification. 1-5 - Chengwen Zhang, Yaohui Liu, Bo Cheng:

A MoE Multimodal Graph Attention Network Framework for Multimodal Emotion Recognition. 1-5 - Hongyi Li

, Jiawei Ye, Jie Wu, Lijun Zu:
Efficient and Expandable Token-Level Approach for Multi-Domain Sensitive Information Classification. 1-5 - Beiyuan Zhang, Yue Ma, Chunlei Fu, Xinyang Song, Zhenan Sun, Ziqiang Li:

Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance. 1-5 - Sijia Li, Haonan Lou, Xu Zhang, Xin Zeng, Zhixuan Shen, Tianrui Li:

Role-Specific Reward Design with Large Language Model for StarCraft II. 1-5 - Die Hu

, Jingguo Ge, Weitao Tang
, Guoyi Li, Liangxiong Li, Bingzhen Wu:
WebSurfer: Enhancing LLM Agents with Web-Wise Feedback for Web Navigation. 1-5 - Wei Chen, Chen Li, Wenjuan Zhou, Yuhang Li, Tianhang Guo, Yuhua Tang:

Exploiting Foundation Models for Label-Efficient Few-Shot Learning via Feature Coupling: A Case Study of cardiac CT Segmentation. 1-5 - Zhaojun Guo, Guobiao Li, Junqiang Huang, Xinpeng Zhang, Zhenxing Qian, Sheng Li:

Filtering Resistant Large Language Model Watermarking via Style Injection. 1-5 - Runze Chen, Mingyu Xiao, Haiyong Luo, Fang Zhao, Fan Wu, Hao Xiong, Qi Liu, Meng Song:

CSS: Overcoming Pose and Scene Challenges in Crowd-Sourced 3D Gaussian Splatting. 1-5 - Ruiyuan Chen, Zhixin Li

, Han Zeng, Yifan Liu, Tao He, Tiecheng Song:
PDCE: Patch-wise Dynamic Curve Estimation for Low-Light Image Enhancement. 1-5 - Haoge Deng, Xin Dai, Jijin Hu, Yonggang Qi:

AnimateSketches: Animate Sketches with Instance-Aware Mask. 1-5 - Heng Gao, Zhuolin He, Jian Pu:

Detecting OOD Samples via Optimal Transport Scoring Function. 1-5 - Jin Li, Zitong Yu, Ziqiang He, Z. Jane Wang, Xiangui Kang:

PGD-Imp: Rethinking and Unleashing Potential of Classic PGD with Dual Strategies for Imperceptible Adversarial Attacks. 1-5 - Octavian Pascu, Dan Oneata, Horia Cucu, Nicolas M. Müller:

Easy, Interpretable, Effective: openSMILE for voice deepfake detection. 1-5 - Aron Bevelander, Kim Batselier, Nitin Jonathan Myers:

A Divide-and-conquer Approach for Sparse Recovery in High Dimensions. 1-5 - Kepei Zhang, Ge Tong, Xuetao Zhang:

Imitating Human Selective Attention Using Dual Policy Network for Scanpath Prediction. 1-5 - Shijing Si, Haixia Sun, Jiawen Gu:

GPPT: Gaussian Process-infused Prompt Tuning for Vision-language Models. 1-5 - Yiming Wang, Hongxi Wei, Heng Wang:

AS-Net: Adaptive Style-aware Network for Handwritten Text Generation. 1-5 - Yiming Wang, Hongxi Wei, Shiwen Sun:

SmartExp: An Adaptive Data Expansion Strategy for Improving Handwritten Text Recognition. 1-5 - Jianqi Gao, Jian Cao, Jinghua Tang:

Promoting PLM Fine-Tuning through Consistency Adversarial Training. 1-5 - Shugang Hao

, Lingjie Duan:
Algorithm Design for Continual Learning in IoT Networks. 1-5 - Jiazhen Chen, Mingbin Feng, Tony S. Wirjanto:

Harnessing Contrastive Learning and Neural Transformation for Time Series Anomaly Detection. 1-5 - Yongheng Zhang, Danfeng Yan:

Knowledge Distillation for Image Restoration : Simultaneous Learning from Degraded and Clean Images. 1-5 - Changzeng Fu, Yikai Su, Kaifeng Su, Le Yang, Peng Shan, Xiaoyong Lv, Yuliang Zhao:

Hierarchical Similarity Loss Enhanced Depth and Structural Fidelity in Monocular RGB-to-Depth Mapping with Adversarial Training. 1-5 - Hao Wang

, Li Xu, Yuntao Yu, Weiyue Ding, Yiming Xu:
Global Context MambaVision for EEG-based Emotion Recognition. 1-5 - Yunxin Mao, Haotian Wang, Yishuai Cai, Minglong Li, Ji Wang, Wenjing Yang:

Automated Exposure Mapping for Networked Interference. 1-5 - He Deng, Xiaojie Yin, Xianmin Lan:

BiMA: Bidimensional multi-level attention embedded network for single-frame infrared small target detection. 1-5 - Abbaas Alif Mohamed Nishar, Shrinivas Kudekar, Bernard Kintzing, Ashwin Ashok:

Revelio: A Real-World Screen-Camera Communication System with Visually Imperceptible Data Embedding. 1-5 - Liu Yu, Ludie Guo, Ping Kuang, Fan Zhou:

Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences. 1-5 - Chenhao Lin, Yanjie Zhu, Yingmao Miao, Zhengyu Zhao, Shuai Liu, Chao Shen:

TGDrag: Adding Semantic Control into Point-based Image Editing via Text Guidance. 1-5 - Hongliang Zeng, Ping Zhang, Fang Li, Qinpeng Yi, Jiahua Wang, Tingyu Ye:

Active Visual Learning for Robots with Dueling Deep Q-Networks and Transformer Encoders. 1-5 - Haoran Liao, Jidong Tian, Shaohua Hu, Zhihao Zhu, Hao He, Yaohui Jin:

Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning in Large Language Models. 1-5 - Qingyao Wu, Bosheng Chen, Chen Li, Xiaotong Tu, Xinghao Ding, Yue Huang:

Efficient Infrared Image Super-Resolution Reconstruction via Guided Filter Coefficients Estimation with Parallax Attention Mechanism. 1-5 - Zi Ye

, Tianxiang Chen, Ziyang Wang, Hanwei Zhang, Lijun Zhang:
HFE-RWKV: High-Frequency Enhanced RWKV Model for Efficient Left Ventricle Segmentation in Pediatric Echocardiograms. 1-5 - Ashok S. Kumar, Sheetal Kalyani:

Practical Radar Sensing Using Two Stage Neural Network for Denoising OTFS Signals. 1-5 - Jing Jiang, Jiankun Zhu, Zhaopan Xu, Xi Chen, Sicheng Zhao, Hongxun Yao:

Gaussian Constrained Diffeomorphic Deformation Network for Panoramic Semantic Segmentation. 1-5 - Jue Xiao, Zepu Yi

, Hewang Nie
, Zhi Lu, Xueming Tang, Songfeng Lu, Zhiguo Huang, Runqing Zhang:
FedDiT: Federated Learning by Distillation Token Enhanced Vision Transformer. 1-5 - Hao Wang, Cheng Deng, Zhidong Zhao:

Knowledge-Guided Prompt Learning for Deepfake Facial Image Detection. 1-5 - Jiaming Zhou, Shiwan Zhao, Hui Wang, Tian-Hao Zhang, Haoqin Sun, Xuechen Wang, Yong Qin:

Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores. 1-5 - Yunfei Chen, Zhan Yang

, Jun Long:
Unsupervised Hierarchical Dynamic Similarity Hashing for Multimedia Retrieval. 1-5 - Guangwenjie Zou, Liang Yao, Fan Liu, Chuanyi Zhang, Xin Li, Ning Chen, Shengxiang Xu

, Jun Zhou:
RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification. 1-5 - Chenyang Zhou

, Monghjaya, Licheng Wu:
MonTransformer: Self-Supervised Phonetic to Glyph Conversion Leveraging Positional Context for Traditional Mongolian Texts. 1-5 - Jiaming Zhou, Shiwan Zhao, Jiabei He, Hui Wang, Wenjia Zeng, Yong Chen, Haoqin Sun, Aobo Kong, Yong Qin:

M2R-Whisper: Multi-stage and Multi-scale Retrieval Augmentation for Enhancing Whisper. 1-5 - Zhenpeng Li, Xiao Zhao, Jingwei Bian, Biao Liu, Wei Li, Lihua Zhang:

V-Fusion: 2D Detection-enhanced Multimodal 3D BEV Object Detection. 1-5 - Tingyang Lu, Jiayao Tan, Linyan Li, Fuyuan Hu:

Less Over More: Interference Sample Gradient Purification For Parallel Continual Learning. 1-5 - Feng Zhou, Chi Li, Ju Dai, Mengxiao Zhu, Yongmei Zhang, Yu-Kun Lai, Paul L. Rosin:

Chat-Driven 3D Human Pose and Shape Editing with Large Language Models. 1-5 - Wenqi Sun, Ruobing Xie, Junjie Zhang, Zitian Guo, Wayne Xin Zhao, Zhanhui Kang, Ji-Rong Wen:

A Pre-trained Plug-in Mixture-of-LoRAs Model for Transferable Sequential Recommendation. 1-5 - Ziang Yang, Lingwei Wei, Biyu Zhou, Xuehai Tang, Ruixuan Li, Songlin Hu:

Segment-Recurrent Transformer with Multi-Scale Fusion for Long-Term Time Series Forecasting. 1-5 - Zhenbo Shi, Zhidong Yu, Yuxuan Zhang, Shuchang Wang, Xiaoman Liu, Wei Yang, Liusheng Huang:

Tip the Scales: Achieving Balance in Adversarial Examples Across Modalities. 1-5 - Karim El Khoury, Maxime Zanella, Benoît Gérin, Tiffanie Godelaine, Benoît Macq, Saïd Mahmoudi, Christophe De Vleeschouwer, Ismail Ben Ayed:

Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification. 1-5 - Zhan Su, Jiashuang Huang, Shu Jiang, Mingliang Wang, Weiping Ding

:
SUFT: Sparse and Uncertain Fusion Transformers for Multi-Atlas Brain Network Analysis. 1-5 - Yichi Wang, Jie Zhang, Chengqian Jiang, Weitai Zhang, Zhongyi Ye, Lirong Dai:

Leveraging Boolean Directivity Embedding for Binaural Target Speaker Extraction. 1-5 - Hecheng Wang, Lizhe Qi, Yunquan Sun:

Integrating Failures in Robot Skill Acquisition with Offline Action-Sequence Diffusion RL. 1-5 - Siyu Liu, Zhida Zhang

, Junxian Duan, Jie Cao, Aihua Zheng:
Dual-PST: Dual-Branch SpatioTemporal-Planar Network for Video Forgery Detection. 1-5 - Feng Pan, Wei Wang, Jianing Zhang:

A Robust Online Miscalibration Detection and Correction Method for LiDAR-Camera. 1-5 - Shentong Mo, Zehua Chen, Fan Bao, Jun Zhu:

DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap. 1-5 - Yu Pu, Wei-Qiang Zhang:

Integrating Pause Information with Word Embeddings in Language Models for Alzheimer's Disease Detection from Spontaneous Speech. 1-5 - Jiale Zou, Yan Chen, Peng Zhou, Chao Wen, Liang Du, Yuhua Qian:

Consensus Graph Filter Learning for Multiple Graph Clustering. 1-5 - Yuxuan Wang, Zhen Xing, Zuxuan Wu:

Advancing Dark Action Recognition via Modality Fusion and Dark-to-Light Diffusion Model. 1-5 - Koby Todros:

Robust Detection Based on the K-Score Test. 1-5 - Xianqi Zhang, Wenrui Wang, Shitong Chai, Xingtao Wang, Xiaopeng Fan

:
Training-Free Task Planning by Parsing Language Signals With Common Sense. 1-5 - Madhava Krishna

, A. V. Subramanyam:
Keypoint Aware Masked Image Modelling. 1-5 - Hongyang Chen, Kaisheng Ma:

Enhancing Vision: Harmonizing Frequency for Imaging Quality and Perception Accuracy. 1-5 - Yuwei Zhang, Yan Wu, Yanming Liu, Xinyue Peng:

CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Downstream Vision Tasks Under Unknown Degradations. 1-5 - Xiaogang Jia, Songlei Jian, Yusong Tan, Yonggang Che, Wei Chen, Zhengfa Liang:

Gated Cross-Attention Network for Depth Completion. 1-5 - Yhonatan Kvich, Alperen Yasar, Eyyup Tasci, Rabia Tugce Yazicigil, Yonina C. Eldar:

Modulo Sampling and Recovery with Unknown and Time-Varying Folding Parameter. 1-5 - Minju Kim, Joonhyeon Bae, Eunsik Shin, Kyogu Lee:

Synthetic Dataset Generation for String Ensemble Separation. 1-5 - Quanfeng Lv, Jingguo Ge, Yifei Xu, Tong Li, Liangxiong Li:

PaSTS: Parameter-affined Seasonal-Trend Synthesis for Multi-dimensional Long-Term Time Series Forecasting within LLM. 1-5 - Peng Xie, Kani Chen:

Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics. 1-5 - Zhaokun Zhou, Jun Niu, Yang Zhang, Li Yuan, Yuesheng Zhu:

Spiking Transformer with Spatial-Temporal Spiking Self-Attention. 1-5 - Zhaokun Zhou, Yijie Lu, Jiaqiyu Zhan, Guibo Luo, Yuesheng Zhu:

SpikingPoint: Rethinking Point as Spike for Efficient 3D Point Cloud Analysis. 1-5 - Shankhanil Mitra, Rajiv Soundararajan:

Vision-Language Model Guided Semi-supervised Learning for No-Reference Video Quality Assessment. 1-5 - Zhiling Zhang, Jie Zhang, Kui Zhang, Wenbo Zhou, Ting Xu, Daiheng Gao, Zixian Guo, Qinglang Guo, Weiming Zhang, Nenghai Yu:

Segue: Side-information Guided Generative Unlearnable Examples for Facial Privacy Protection in Real World. 1-5 - Zhexian Zhou, Liang Xiao, Guo-Sen Xie:

Part in Part Embedding Network for Zero-Shot Learning. 1-5 - Reza Mirzaeifard, Stefan Werner:

Federated Smoothing ADMM for Quantile Regression with Non-Convex Sparse Penalties. 1-5 - Zhiwei Dong, Ruihao Gong

, Yang Yong, Shuo Wu, Yongqiang Yao, Song-Lu Chen, Xu-Cheng Yin:
Tool Playgrounds: A Comprehensive and Analyzable Benchmark for LLM Tool Invocation. 1-5 - Junzheng Zhang, Weijia Guo, Bochao Liu, Ruixin Shi, Yong Li, Shiming Ge:

Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition. 1-5 - Yiwen Wang, Zixin Tang, Yexun Hu

, Guisong Liu, Tai-Xiang Jiang:
Spectral Low-Rank Attention with Flow-Based Refinement for Spectral Reconstruction. 1-5 - Riccardo Miccini

, Clément Laroche, Tobias Piechowiak, Luca Pezzarossa:
Scalable Speech Enhancement With Dynamic Channel Pruning. 1-5 - Rang Liu, Ming Li, Qian Liu:

Joint Space-Time Adaptive Processing and Beamforming Design for Cell-Free ISAC Systems. 1-5 - Do June Min, Karel Mundnich, Andy Lapastora, Erfan Soltanmohammadi, Srikanth Ronanki, Kyu J. Han:

Speech Retrieval-Augmented Generation without Automatic Speech Recognition. 1-5 - Ryuhei Takahashi, Pu Wang:

GREST: Ghost Targets Removal Algorithm Using Multipath Angle Estimation. 1-5 - Tsubasa Terada, Toshihiro Ito, Ryuhei Takahashi:

Separate Estimation of Angular Velocity and Angle for Digital Array Radar. 1-5 - Jieru Jia, Jianchao Yang:

SSDViT: Exploring Siamese and Self Distillation in ViTs for Generalizable Person Re-identification. 1-5 - Weijie Xiong, Kai Zhong, Zhiling Xiao, Jingran Lin, Qiang Li:

Secure Analog Beamforming Design for Wireless Communication Systems With Movable Antennas. 1-5 - Zeyu Li, Sheng Yang, Hanxiang Yang, Xiongxin Tang, Fengge Wu, Fanjiang Xu:

BIAWDiff: Enhancing Low-Light Images with Bio-Inspired Attention and Wavelet Diffusion. 1-5 - Jianjian Yin

, Yi Chen, Zhichao Zheng, Junsheng Zhou, Yanhui Gu:
Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation. 1-5 - Yingmao Miao, Chenhao Lin, Zhengyu Zhao, Hang Wang, Shuai Liu, Chao Shen, Xiaohong Guan:

One-Shot Face Avatar Generation in a Single Forward Pass with Identity Preservation. 1-5 - Haorui Li, Jiaqi Liang, Linjing Li, Daniel Zeng:

Conservative Offline Meta-Reinforcement Learning with Task Similarity Measurement. 1-5 - Erjin Bao, Ching-Chun Chang, Hanrui Wang

, Isao Echizen:
Agentic Copyright Watermarking against Adversarial Evidence Forgery with Purification-Agnostic Curriculum Proxy Learning. 1-5 - Xiaoyu Chen, Changde Du, Che Liu, Yizhe Wang, Huiguang He:

BP-GPT: Auditory Neural Decoding Using fMRI-prompted LLM. 1-5 - Zengzhao Chen, Chuanxu Zhao, Zhifeng Wang, Chuan Liu, Qiuyu Zheng, Cheng Zou:

DS-BTIAN: A Novel Deep-Shallow Bidirectional Transformer Interactive Attention Network for Multimodal Emotion Recognition. 1-5 - Haoyin Yan, Jie Zhang, Cunhang Fan, Yeping Zhou, Peiqi Liu:

LiSenNet: Lightweight Sub-band and Dual-Path Modeling for Real-Time Speech Enhancement. 1-5 - Feifei Fu, Zhiwu Lu:

Enhancing Data-Free Class-Incremental Learning via Image-Centric Dual Distillation. 1-5 - He Huang, Wenjie Huang, Qi Yang, Yiling Xu, Zhu Li:

A Hierarchical Compression Technique for 3D Gaussian Splatting Compression. 1-5 - Luyu Zhu, Kai Ye, Jiayu Yao, Chenxi Li, Luwen Zhao, Yuxin Cao, Derui Wang, Jie Hao:

MAID: Model Attribution via Inverse Diffusion. 1-5 - Longsheng Jin, Hua Chen, Jiaxiong Fang, Wei Liu, Ye Tian, Gang Wang:

Fourth-Order Cumulant Based 3-D Near-Field Underdetermined Parameter Estimation With Exact Spatial Propagation Model. 1-5 - Zongyou Yu, Qiang Qu, Xiaoming Chen, Chen Wang:

Can Large Language Models Grasp Event Signals? Exploring Pure Zero-Shot Event-based Recognition. 1-5 - Shibo Wang, Zili Ma, Ka-Hou Chan, Yue Liu, Tong Tong, Qinquan Gao, Guangtao Zhai, Xiaohong Liu, Tao Tan

:
Contrastive Learning via Randomly Generated Deep Supervision. 1-5 - Shuguo Hu, Xudong Zhao, Shuwei Hu, Xuan Gao:

Fusion-OSR: Cross-Domain Contrastive Learning with Weibull Calibration for Time Series Open Set Recognition. 1-5 - Fengyu Lu, Jiaxin Duan, Junfei Liu:

Learning Markup Language Model for Composite Relationships Extraction. 1-5 - Fanxuan Kong

, Jun Lu
:
A Weakly Supervised Semantic Segmentation Model with Enhanced CLIP Feature Extraction. 1-5 - Shun Zou, Zhuo Zhang

, Guangwei Gao:
OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation. 1-5 - Guoming Lu, Heng Yin

, Zhiyong Shu, Jielei Wang
, Guangchun Luo:
BDCKD: Unlocking the Power of Brownian Distance Covariance in Knowledge Distillation. 1-5 - Mainak Chakraborty, Chandan

, Bodhibrata Mukhopadhyay, Sahil Anchal, Subrat Kar:
VibeGait: Enhancing Structural-Vibration based Gait Recognition using Vision. 1-5 - Xiang Xue, Quan Liu, Meilong Shi, Yuchao Jin:

Diverse Collaboration in Multi-Agent Reinforcement Learning via Self-Adaptive Method. 1-5 - Zeren Zhang, Haibo Qin, Jiayu Huang, Jo-Ku Cheng

, Yixin Li, Hui Lin, Yitao Duan, Jinwen Ma:
SwapTalk: Audio-Driven Talking Face Generation with One-Shot Customization in Latent Space. 1-5 - Wenxiang Jiang

, Hanwei Zhang, Weigang Wang
, Zhongwen Guo, Tianao Zhang, Hao Wang:
MPAM-3DGS: Multi-Parametric Adversarial Manipulation for 3D Gaussian Splatting. 1-5 - Niv Cohen, Yhonatan Kvich, Rui Guo, Yonina C. Eldar:

Deep Unfolding of Full Waveform Inversion for Quantitative Ultrasound Imaging. 1-5 - Haoqiu Xiong, Robbert Beerten, Zhuangzhuang Cui, Yang Miao, Sofie Pollin:

BS-Breath: Respiration Sensing with Cell-free Massive MIMO. 1-5 - Yan Sun, Wen-Qin Wang, Maria Sabrina Greco, Fulvio Gini:

Subspace-Based Range-Angle Tracking for Coherent FDA Radar. 1-5 - Weihao Deng, Fei Han, Qinghua Ling, Qing Liu, Henry Han:

Causal fMRI-Mamba: Causal State Space Model for Neural Decoding and Brain Task States Recognition. 1-5 - Guohao Li

, Hongyu Yang, Yifang Men, Di Huang, Weixin Li
, Ruijie Yang, Yunhong Wang:
Generating Editable Head Avatars with 3D Gaussian GANs. 1-5 - Lixiong Qin, Ning Jiang, Yang Zhang, Yuhan Qiu, Dingheng Zeng, Jiani Hu, Weihong Deng:

Towards Interactive Deepfake Analysis. 1-5 - Nikolai Lund Kühne

, Astrid H. F. Kitchena, Marie S. Jensen, Mikkel S. L. Brøndt, Martin Gonzalez, Christophe A. N. Biscio
, Zheng-Hua Tan
:
Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models. 1-5 - Kirill Paramonov, Mete Ozay, Eunju Yang, Jijoong Moon, Umberto Michieli:

Controllable Forgetting Mechanism for Few-Shot Class-Incremental Learning. 1-5 - Jaesung Huh, Juan Azcarreta Ortiz, Anurag Kumar, Ashutosh Pandey, Ali Aroudi, Daniel D. E. Wong, Francesco Nesta, Buye Xu, Jacob Donley:

Advancing Active Speaker Detection for Egocentric Videos. 1-5 - Fangqi Li, Shilin Wang, Lei Yang

:
Rethinking the Fragility and Robustness of Fingerprints of Deep Neural Networks. 1-5 - Guoyi Li, Zhongjiang Yao, Die Hu, Yingrui Xu, Xiaodan Zhang, Honglei Lyu:

Emotion-aware Structural Enhancement Graph Auto-Encoder for Rumor Detection. 1-5 - Bochao Zou, Zizheng Guo, Wenfeng Qin, Xin Li, Kangsheng Wang

, Huimin Ma:
Synergistic Spotting and Recognition of Micro-Expression via Temporal State Transition. 1-5 - Marko Tuononen

, Dani Korpi, Ville Hautamäki:
Interpreting Deep Neural Network-Based Receiver Under Varying Signal-To-Noise Ratios. 1-5 - Fangyuan Xie, Feiping Nie, Weizhong Yu, Xuelong Li:

Efficient Co-clustering via Anchor-refined Label Spreading. 1-5 - Zeyu Wang

, Jiayu Wang, Haiyu Song, Jiawei Feng, Haoran Duan:
Multi-Modal Medical Image Fusion via 3D Manifold Fitting and Dual-Domain Cross-Attention. 1-5 - Shuanglin Li, Siyang Song, Rajesh Nair, Syed Mohsen Naqvi:

A Frequency-aware Augmentation Network for Mental Disorders Assessment from Audio. 1-5 - Luca A. Lanzendörfer, Constantin Pinkl, Nathanaël Perraudin, Roger Wattenhofer:

Bootstrapping Language-Audio Pre-training for Music Captioning. 1-5 - Jaejun Lee, Yoori Oh, Kyogu Lee:

Speaking Without Sound: Multi-speaker Silent Speech Voicing with Facial Inputs Only. 1-5 - Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek, Andrew Zisserman:

The Sound of Water: Inferring Physical Properties from Pouring Liquids. 1-5 - Zhiling Ye, Liang-Guo Zhang, Dingheng Zeng, Quan Lu, Ning Jiang:

Realistic Real-Time Talking Head Synthesis with Grid Encoding and Progressive Conditioning. 1-5 - Haitao Liu, Xinyi Zhang, Chuanmin Jia, Yanbiao Li, Gaogang Xie:

AKI360: Enabling Highly Interactive 360-degree Video Streaming by Adaptive Keyframe Interval. 1-5 - Kanwardeep Singh Gahlot, Sandeep Joshi, Ke Wang

:
Nonlinear Anisotropic Diffusion-Based Channel Estimation in 5G Wireless Networks. 1-5 - Purnima Kamath, Chitralekha Gupta, Suranga Nanayakkara:

MorphFader: Enabling Fine-grained Controllable Morphing with Text-to-Audio Models. 1-5 - Bowen Hao, Dongliang Zhou, Xiaojie Li, Xingyu Zhang, Liang Xie, Jianlong Wu, Erwei Yin:

LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition. 1-5 - Hailun Zhang

, Xinrui Wang, Qijun Zhao:
Granularity-Aware Contrastive Learning for Fine-Grained Action Recognition. 1-5 - Chengxin Zhao, Hefei Ling, Jiazhong Chen, Han Fang, Zongyi Li, Sijing Xie:

AD2T: Adversarial Distortion Domain Translation for Robust Watermarking against Non-differentiable Distortions. 1-5 - Yifan Xiong, Dongyue Guo, Lipeng Shen, Wei Mo, Hui Yang, Yi Lin:

Adversarial Feature Disentanglement Framework for Voice Pathology Detection. 1-5 - Tingxuan Chen, Liu Yang, Zidong Wang, Shuai Luo, Jun Long:

Enhancing Extrapolation Reasoning on Temporal Knowledge Graphs with Logic Rules and Queries. 1-5 - Jiaying Gao

, Fausto Giunchiglia, Tongyu Zhao, Chuntao Li, Hao Xu:
Dual-Pyramid Attention Collaborative Network for Oracle Bone Inscription Classification. 1-5 - Yali Bi

, Enyu Che, Yinan Chen, Yuanpeng He, Jingwei Qu:
Multi-Prototype-based Embedding Refinement for Medical Image Segmentation. 1-5 - Yujie Ding, Shenghua Teng, Zuoyong Li, Xiao Chen:

LSU-NET: Lightweight Automatic Organs Segmentation Network for Medical Images. 1-5 - Dong Sun, Wenya Guo, Xumeng Liu, Ying Zhang, Zhaoxiang Hou, Zengxiang Li:

Zero-shot Document Retrieval with Hybrid Pseudo-document Retriever. 1-5 - Jialu Tang, Tong Xia, Yuan Lu, Cecilia Mascolo, Aaqib Saeed:

Electrocardiogram Report Generation and Question Answering via Retrieval-Augmented Self-Supervised Modeling. 1-5 - Xinjue Wang

, Esa Ollila
, Sergiy A. Vorobyov:
Robust Activity Detection for Massive Access using Covariance-based Matching Pursuit. 1-5 - Shan-Ya Yang, Hao-Chung Cheng, Chien-Yao Wang, Jia-Ching Wang, Chun-Yi Lee:

A Key to Effective Multi-task Learning: Separate Query Selection for Task-Synergized Handling and Node Utilization. 1-5 - Zhihua Xie, Haolin Chang:

Micro-expression Spotting based on Multi-modal Hierarchical Semantic-guided Deep Fusion Model. 1-5 - Esther Ramdinmawii, Vinay Kumar Mittal:

Discriminating Mizo Hunting and War Chants using Acoustic Features. 1-5 - Qizao Wang, Xuelin Qian, Bin Li, Lifeng Chen, Yanwei Fu, Xiangyang Xue:

Content and Salient Semantics Collaboration for Cloth-Changing Person Re-Identification. 1-5 - Keyi Liu, Yeqi Luo, Weidong Yang, Jingyi Xu, Zhijun Li, Wen-Ming Chen, Ben Fei:

GS-PT: Exploiting 3D Gaussian Splatting for Comprehensive Point Cloud Understanding via Self-supervised Learning. 1-5 - Changshun Wu

, Wendi Ding, Xiaowei Huang, Saddek Bensalem:
Out-of-Distribution Detectors: Not Yet Primed for Practical Deployment. 1-5 - Shuqi Dai, Yunyun Wang, Roger B. Dannenberg, Zeyu Jin:

Everyone-Can-Sing: Zero-Shot Singing Voice Synthesis and Conversion with Speech Reference. 1-5 - Sachini Piyoni Ekanayake, Daphney-Stavroula Zois:

Instance-wise Feature Acquisition with Classifier Selection Option for Structured Data Instances. 1-5 - Bharath Vishwanath, Yingzhan Xu, Kai Zhang, Li Zhang:

Cross-Component Residual Prediction for Geometry-Based Point Cloud Compression. 1-5 - Natarajan Balaji Shankar

, Zilai Wang, Eray Eren, Abeer A. Alwan:
Selective Attention Merging for low resource tasks: A case study of Child ASR. 1-5 - Nicholas D. Sidiropoulos, Yuanyuan Tang:

Interference-Resilient Hybrid Multi-Antenna ARQ. 1-5 - Yongsung Park, Peter Gerstoft, Seunghyun Yoon, Woojae Seong:

Physics-Informed Neural Networks for Ocean Acoustic Field Prediction with Envelope Smoothing. 1-5 - Jinping Zou, Xiaoge Deng, Tao Sun:

Sharpness-Aware Minimization with Adaptive Regularization for Training Deep Neural Networks. 1-5 - Longshen Ou, Yu Takahashi, Ye Wang:

Lead Instrument Detection from Multitrack Music. 1-5 - Shengchang Xiao, Xueshuai Zhang, Pengyuan Zhang, Yonghong Yan:

Debiased Training For Semi-supervised Sound Event Detection. 1-5 - Chenyang Yu, Xuehu Liu, Ju Dai, Pingping Zhang, Huchuan Lu:

Hierarchical Proxy Learning for Cloth-Changing Person Re-Identification. 1-5 - Wei Chen, Xin Luo, Yulin He, Tianrui Liu, Di Wu, Tianhang Guo, Yuhang Li, Yuhua Tang:

A Cost-effective Solution for Remote Sensing Image Segmentation via Train/Test-Time Adaptation. 1-5 - Gautam Siddharth Kashyap

, Zohaib Hasan Siddiqui, Mohammad Anas Azeez, Rafiq Ali, Shantanu Kumar, Navin Kamuni, Jiechao Gao:
Fooling the Forgers: A Multi-Stage Framework for Audio Deepfake Detection. 1-5 - Wei Zhang, Tian-Hao Zhang, Chao Luo, Hui Zhou, Chao Yang, Xinyuan Qian, Xu-Cheng Yin:

Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition. 1-5 - Mingyu Liu, Yijie Wang, Xiaohui Zhou, Yongjun Wang:

Graph Structure Learning via Transfer Entropy for Multivariate Time Series Anomaly Detection. 1-5 - Gautam Siddharth Kashyap

, Harsh Joshi, Manaswi Kulahara, Rajkumar Dhakar, Atul Sajjanhar, Jiechao Gao, Sarthak Jain, Shahab Saquib Sohail:
Can AI See What We Can't? Leveraging Deep Learning and Multi-Temporal Satellite Data to Revolutionize Crop Type Mapping and Yield Prediction. 1-5 - Lingxi Hu, Xiao Wu

, Risa Higashita, Xiaoli Xing, Menglan Zhou, Song Lin, Xiaorong Li, Xiaoling Li, Jinming Duan, Jiang Liu:
Exploring Temporal Constraints for Unsupervised Iris Motion Tracking in AS-OCT Videos. 1-5 - Yung Jer Wong, Teck Khim Ng:

Local Statistics for Generative Image Detection. 1-5 - Yingyu Chen, Ziyuan Yang, Deng Xiong, Yi Zhang:

Modality Modulation and Dual Consistency for Multi-Modality Semi-Supervised Medical Image Segmentation. 1-5 - Bingbing Wang, Geng Tu, Bin Liang, Zhixin Bai, Min Yang, Xi Zeng, Liang Yao, Ruifeng Xu:

Enhancing Emotion Reasoning for Image Multi-Emotion Prediction. 1-5 - Zhiheng Li, Zhimin Weng, Yuehuan Wang:

Multi-view Feature Discrepancy Attack for Single Object Tracking. 1-5 - Haoran Liao, Shaohua Hu, Zhihao Zhu, Hao He, Yaohui Jin:

TopoRefine: Iterative Refinement with Reasoning Topology as High-Level Feedback. 1-5 - Jiawei Zhang, Ziwen Li, Jinpu Zhang, Yuehuan Wang:

Retinex-Based Self-Conditioned Diffusion Model for Low-Light Image Enhancement. 1-5 - Haoran Liao, Zhihao Zhu, Shaohua Hu, Hao He, Yaohui Jin:

Faithful Self-Refinement in Mathematical Reasoning via Progressive Back-Translation. 1-5 - Diwei Huang, Kunyang Lin, Peihao Chen, Qing Du:

Map-Guided Few-Shot Audio-Visual Acoustics Modeling. 1-5 - Yumeng Yang, Guannan Dong, Aichun Zhu, Mingcheng Ni, Yifeng Li:

Unveiling Local Well-posedness Influence for Cross-modal Person Re-Identification. 1-5 - Zhongyu Huang, Yijun Chen, Aizierjiang Aiersilan, Li Li:

G-Depth: An Efficient Graph Method for Robust Depth Completion. 1-5 - Fei Wu, Ruixuan Zhou, Changhui Hu, Qinghua Huang, Xiao-Yuan Jing:

Complementary Graph Learning and Prompt-based Cross-modal Generation for Missing-modality Fake News Detection. 1-5 - Yi-Chiao Wu, Dejan Markovic, Steven Krenn, Israel D. Gebru, Alexander Richard:

ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling. 1-5 - Bao-Hsuan Huang, Po-Chih Kuo, Likai Huang, Chaur-Jong Hu, Cheng-Yu Chen:

Explainable Detection of Alzheimer's Disease Through Analysis of Human Behavior in Video. 1-5 - Yiwei Li, Jiaxin Liu, Lin Yang, Yating Zhang, Kunlin Liu, Ge Zhou, Liangze Yin, Wei Dong:

A Robust Distributed Recurrent Neural Network for Multi-Agent Consensus Control. 1-5 - Sandeep Rao, Rajan Narasimha, Shunqiao Sun:

Signal Processing Challenges in Automotive Radar. 1-5 - Yong Ren, Chenxing Li, Manjie Xu, Wei Liang, Yu Gu, Rilin Chen, Dong Yu:

STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment. 1-5 - Zhi Chen

, Yun-Fei Shao
, Yong Ma, Mingsheng Wei, Le Zhang, Wei-Qiang Zhang:
Improving Acoustic Scene Classification in Low-Resource Conditions. 1-5 - Yiyuan Ge, Zhihao Chen, Ziyang Wang, Jiaju Kang, Mingya Zhang:

Spectral Enhancement and Pseudo-Anchor Guidance for Infrared-Visible Person Re-Identification. 1-5 - Xiang Ying, Xiangchuan Xie, Tianyi Xu, Yue Zhao, Zechen Meng, Mankun Zhao:

WMRE: Enhancing Distant Supervised Relation Extraction with Word-level Multi-instance Learning and Multi-hierarchical Feature. 1-5 - Zongye Zhang

, Huanyu Zhou, Qingjie Liu, Yunhong Wang:
SkeletonMix: A Mixup-Based Data Augmentation Framework for Skeleton-Based Action Recognition. 1-5 - Guanglong Zhang, Tianren Wang, Jinjie Guo, Zhiyuan Yang, YiLian Wu, Guixia Kang:

Exploring the Interpretability of EEG-Inception Convolutional Neural Networks for Epilepsy Prediction. 1-5 - Zhangchen Zhu, Jiafeng Li, Ying Wen:

Self-Optimization Training for Weakly Supervised Image Manipulation Localization. 1-5 - Arad Gast, Luc Le Magoarou, Nir Shlezinger:

DCD-MUSIC: Deep-Learning-Aided Cascaded Differentiable MUSIC Algorithm for Near-Field Localization of Multiple Sources. 1-5 - Junkun Hong, Yitian Long, Yueyi Luo, Liujie Hua, Jun Long, Qianqian Qi:

A Reinforcement Learning Agent Controlled Multi-branch Small Object Detection Framework. 1-5 - Liujie Hua, Xiu Su, Yueyi Luo, Shan You, Jun Long:

HieClip: Hierarchical CLIP with Explicit Alignment for Zero-Shot Anomaly Detection. 1-5 - Li Fu, Shanyong Yu, Siqi Li, Fan Lu, Youzheng Wu, Xiaodong He:

UME: Upcycling Mixture-of-Experts for Scalable and Efficient Automatic Speech Recognition. 1-5 - Beichuan Tang, Yimao Sun, Xiantao Heng, Yanbing Yang, Liangyin Chen:

Extending MPR for Locating a Moving Object Based on TDOA and FDOA. 1-5 - Ruya Jiang, Chun Wang

, Weihong Deng:
Seek and Solve Reasoning for Table Question Answering. 1-5 - Chen Liu, Zhaolin Wan, Penghong Wang, Xingtao Wang, Xiaopeng Fan

:
TS-Net: Assembling Task-specific Features from Multiple Feature Levels for Multi-task Learning. 1-5 - Ding Xu, Lingjie Duan, Hongbo Zhu:

Hybrid Content Caching Empowered By AIGC in Wireless Networks. 1-5 - Haoran Shen, Chen Zeng, Jiahui Wang, Qiao Wang:

Reduced Effectiveness of Kolmogorov-Arnold Networks on Functions with Noise. 1-5 - Amit Milstein, Tomer Yablonka, Nir Shlezinger:

Learned Approximated Optimization for Rapid Low-Complexity Hybrid Beamforming Design. 1-5 - Riling Wei, Hanjie Chen, Kelu Yao, Chuanguang Yang, Jun Wang, Chao Li:

ECG-guided individual identification via PPG. 1-5 - Peter Gerstoft, Yongsung Park:

Atom-Constrained Maximum Likelihood Gridless DOA with Wirtinger Gradients. 1-5 - Ke-Han Lu

, Zhehuai Chen, Szu-Wei Fu, Chao-Han Huck Yang, Jagadeesh Balam, Boris Ginsburg, Yu-Chiang Frank Wang, Hung-Yi Lee:
Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data. 1-5 - Wenyu Zhang, Shuo Sun, Bin Wang, Xunlong Zou, Zhuohan Liu, Yingxu He, Geyu Lin, Nancy F. Chen, Ai Ti Aw:

MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders. 1-5 - Xiquan Li, Wenxi Chen, Ziyang Ma, Xuenan Xu, Yuzhe Liang, Zhisheng Zheng, Qiuqiang Kong, Xie Chen:

DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning. 1-5 - Xuan Zhou, Zongyong Deng, Qijun Zhao:

Identity-Agnostic Learning for Deepfake Face Detection. 1-5 - Weijie Wang, Yan Wang, Guokun Xu, Zuxin Chen, Siyuan Li, Min Yu, Weiqing Huang, Degang Sun:

VN-GT: Optimizing Virtual Network Deployment via Game Theory. 1-5 - Xiaoli Yao, Jia Tan, Zijian Deng, Deng Xiong, Qijun Zhao, Min Wu:

MUPO-Net: A Multilevel Dual-domain Progressive Enhancement Network with Embedded Attention for CT Metal Artifact Reduction. 1-5 - Shuai Guo, Yang Gu, Yuan Ma, Yingwei Zhang, Weining Weng, Jun Liu, Weiwei Dai, Yiqiang Chen:

Semantic-oriented Visual Prompt Learning for Class Incremental Learning. 1-5 - Qiao Li, Kanlun Tan, Qiao Liu, Di Yuan, Xin Li, Yunpeng Liu:

Efficient Hierarchical Domain Adaptive Thermal Infrared Tracking. 1-5 - Jiang Fang, Haonan He, Chen Guo, Jiyan Sun, Zhaorui Guo, Chao Xu, Mohan Su, Yinlong Liu, Wei Ma:

DASSL: Domain Agnostic Self-Supervised Learning with Multiple Missing Information Reconstruction Branches. 1-5 - Kang He, Yuzhe Ding

, Bobo Li, Haining Wang, Fei Li, Chong Teng, Donghong Ji:
Harnessing Dimensional Contrast and Information Compensation for Sentence Embedding Enhancement. 1-5 - Dezhi Zheng, Kaijun Deng, Jinbao Wang, Linlin Shen:

Dual Encoders for Diffusion-based Image Inpainting. 1-5 - Xiaoxu Ma, Chen Zhao, Minglai Shao, Yujie Lin:

Hypergraph-Based Dynamic Graph Node Classification. 1-5 - Bo Hu, Wenzhi Chen

, Chunyi Li
, Jiaxu Leng, Weisheng Li, Xinbo Gao:
Bidirectional Reference Image Quality Assessment via Content-Quality Correlation Modeling. 1-5 - Wei-Hua Li

, Yu-Hsing Hsieh, Huei-Fang Yang, Chu-Song Chen:
PDSeg: Patch-Wise Distillation and Controllable Image Generation for Weakly-Supervised Histopathology Tissue Segmentation. 1-5 - Prince Arya, Saurabh Kumar, Ashish Agarwal, Nutan Yenneti, Narasimha Pai:

Redefining Well Exposedness for Locally Adaptive Multi-Exposure Fusion. 1-5 - Yanxu Mao, Xiaohui Chen

, Peipei Liu, Tiehan Cui, Zuhui Yue, Zheng Li:
GEGA: Graph Convolutional Networks and Evidence Retrieval Guided Attention for Enhanced Document-level Relation Extraction. 1-5 - Xinlei Huang, Jialiang Tang, Xubin Zheng, Jinjia Zhou, Wenxin Yu, Ning Jiang:

Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed Scenarios. 1-5 - Mattes Ohlenbusch

, Christian Rollwage, Simon Doclo:
Low-Complexity Own Voice Reconstruction for Hearables with an In-Ear Microphone. 1-5 - Xinyu Wang, Kang Chen, Lei Liu, Tao Han, Bin Li, Lei Bai:

Global Tropical Cyclone Intensity Forecasting with Multi-modal Multi-scale Causal Autoregressive Model. 1-5 - Lijian Li, Yuanpeng He, Chi-Man Pun:

An Adaptive Framework for Multi-View Clustering Leveraging Conditional Entropy Optimization. 1-5 - Mengjian Zhang, Guihua Wen, Pei Yang

:
Multi-label body constitution recognition via dual transform MLP-like architecture using tongue images. 1-5 - Renjun Jia, Kaiming Yang, Dawei Cheng, Li Han, Yuqi Liang:

Graph-Driven Insights: Enhancing Stock Market Prediction with Relational Temporal Dynamics. 1-5 - Po-Wei Chen, Von-Wun Soo:

Training Better Embedding With Perturbed Data Augmentation for Automatic Singing Quality Assessment. 1-5 - Xiaoyan Liao, Haoliang Zhao, Fan Yang, Kwokching Cheung, Jun Jiang, Yong Zhao, Jie Chen, Xinan Wang:

RetinaStereo: Dynamic-Volume Stereo Matching Network. 1-5 - Minghao Li, Hechuan Lin, Huiying Xu, Ziying Wang, Xinzhong Zhu, Xiao Huang:

One-step Incomplete Multi-view Clustering based on Bipartite Graph Learning. 1-5 - Dimme de Groot

, Baturalp Karslioglu, Odette Scharenborg
, J. Martinez
:
Loudspeaker Beamforming to Enhance Speech Recognition Performance of Voice Driven Applications. 1-5 - Eleftheria Lydaki, Zheng-Hua Tan

, Jesper Jensen, Meng Guo:
Deep Feedback Cancellation for Hearing Aids with Improved System Stability and Sound Quality. 1-5 - Tao Feng, Jie Zhang, Huashan Liu, Zhijie Wang, Shengyuan Pang:

Towards Efficient Deep Hashing Retrieval: Condensing Your Data via Feature-Embedding Matching. 1-5 - Robin San Roman, Pierre Fernandez, Antoine Deleforge, Yossi Adi, Romain Serizel:

Latent Watermarking of Audio Generative Models. 1-5 - Yu Liang, Sheng Zhang, Jie Wu:

Volatile MAB-based Configuration Selection for Offloading Video Analytics Tasks to Edges. 1-5 - L. Yashvanth, Chandra R. Murthy, Bhaskar D. Rao:

Distributed IRSs Mitigate Spatial Wideband & Beam Split Effects. 1-5 - Xintong Lu, Jiahe Li, Yuchao Zhang, Wendong Wang:

Towards Feature-Consistent Parameter Collaboration for Personalized Federated Learning. 1-5 - Yongqiang Zhao, Zhenyu Li, Zhi Jin, Feng Zhang, Lianwei Wu, Xinhai Xu, Donghong Liu:

Try Before You Buy: Solving Multi-Model Complex Tasks by Model Competitions. 1-5 - Tianyang Wang, Xiaofei Nan, Yunze Wang, Yuhang Yan, Zhenkai Gao, Jingxin Liu:

Enhanced Corneal Endothelial Cell Segmentation via Frequency-Selected Residual Fourier Diffusion Models. 1-5 - Akhil Vasim

, Pankhi Kashyap, Shabnam Choudhury, Biplab Banerjee:
Spatially-Aware Cross-Modal Contrastive Learning for Low-Shot HSI Classification. 1-5 - Juliette Chevallier, Gersende Fort:

Sampling Nonsmooth Log-Concave Densities: A Comparative Study of Primal-Dual Based Proposal Distributions. 1-5 - Xin Li, Feng Xu, Feifei Tao, Yao Tong, Xin Lyu, Jianyi Zhong, André Kaup:

A spectrum-enhanced attention model for semantic segmentation of remote sensing images. 1-5 - Hsi-Ai Tsao, Lei Hsiung, Pin-Yu Chen, Tsung-Yi Ho:

When Does Visual Prompting Outperform Linear Probing for Vision-Language Models? A Likelihood Perspective. 1-5 - Enhui Chai, Xingyu Li, Tianxiang Cui, Zheng Lu, Fiseha Berhanu Tesema

:
Accelerating Convergence in Bounding Box Regression with a Refined IoU Loss Function. 1-5 - Zilong Hu, Yan Qiao, Zidang Cai, Rongyao Hu, Junjie Wang, Meng Li, Zhenchun Wei:

Dual Trajectory Revised Diffusion Model for Time Series Forecasting. 1-5 - Or Berebi, Fabian Brinkmann, Stefan Weinzierl, Boaz Rafaely:

Ambisonics Binaural Rendering via Masked Magnitude Least Squares. 1-5 - An Luo, Kai Hu, Kai Jiang:

Rethinking Dual-Stream Super-Resolution for Enhancing Remote Sensing Object Detection. 1-5 - Lingwei Meng, Shujie Hu, Jiawen Kang, Zhaoqing Li, Yuejiao Wang, Wenxuan Wu, Xixin Wu, Xunying Liu, Helen Meng:

Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions. 1-5 - Mengting Ma, Mengjiao Zhao, Yizhen Jiang, Xiangdong Li, Wei Zhang:

SSFMamba: Spatial-Spectral Fusion State Space Model for Pansharpening. 1-5 - Yuanchen Shi, Fang Kong:

MSA-ITEI: A Novel Method for Multimodal Analysis of Social Media Stickers. 1-5 - Jiawen Kang, Lingwei Meng, Mingyu Cui, Yuejiao Wang, Xixin Wu, Xunying Liu, Helen Meng:

Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC. 1-5 - Junteng Jia, Gil Keren, Wei Zhou, Egor Lakomkin, Xiaohui Zhang, Chunyang Wu, Frank Seide, Jay Mahadeokar, Ozlem Kalinli:

Efficient Streaming LLM for Speech Recognition. 1-5 - Yun Wu

, John McAllister:
Efficient Co-Approximate Parallel Compressive Depth Reconstruction on FPGA. 1-5 - Zhenhan Huang, Tejaswini Pedapati, Pin-Yu Chen, Jianxi Gao

:
Modular Prompt Learning Improves Vision-Language Models. 1-5 - Yujie Zhu, Xinyi Zhang, Yekai Lu, Guang Yang

, Faming Fang, Guixu Zhang:
First-order State Space Model for Lightweight Image Super-resolution. 1-5 - Srikar Yellapragada, Kowshik Thopalli, Vivek Sivaraman Narayanaswamy, Wesam Sakla, Yang Liu, Yamen Mubarka, Dimitris Samaras, Jayaraman J. Thiagarajan:

Leveraging Registers in Vision Transformers for Robust Adaptation. 1-5 - Tianyou Liang, Xiaoxu Li, Yu Peng, Min Xu:

Segment Any Bone in CT with Partial Supervision. 1-5 - Thomas Aussaguès, Anne Ferréol, Alice Delmer, Pascal Larzabal:

Whitening Effects for ML-DoA Estimation using a Sparse Representation of Array Covariance. 1-5 - Houcheng Su, Bingli Wang

, Daixian Liu, Jiao Li, Chen-Bin Feng, Chi-Man Vong:
Towards Fully Test-Time Adaptation via Variance Balancing and Semantic Augmentation. 1-5 - Xuechen Wang, Shiwan Zhao, Haoqin Sun, Hui Wang, Jiaming Zhou, Yong Qin:

Enhancing Multimodal Emotion Recognition through Multi-Granularity Cross-Modal Alignment. 1-5 - Chien-Yu Huang, Min-Han Shih

, Ke-Han Lu, Chi-Yuan Hsiao, Hung-Yi Lee:
SpeechCaps: Advancing Instruction-Based Universal Speech Models with Multi-Talker Speaking Style Captioning. 1-5 - Chengming Liu, Fan Wu, Lei Shi

:
FasterGold-DETR: An Efficient End-to-End Fire Detection Model via Gather-and-Distribute Mechanism. 1-5 - Yanjun Zhao, Tian Zhou, Chao Chen, Liang Sun, Yi Qian, Rong Jin:

Sparse-VQ Transformer: An FFN-Free Framework with Vector Quantization for Enhanced Time Series. 1-5 - Qing Wang, Jixun Yao, Zhaokai Sun, Pengcheng Guo, Lei Xie, John H. L. Hansen:

DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification. 1-5 - Fangzhou Han, Tianyi Yu, Lamei Zhang, Lingyu Si, Yiqi Zhang:

SlotFusion: Object-Centric Audiovisual Feature Fusion with Slot Attention for Remote Sensing Scene Recognition. 1-5 - Xinhao Liu, Zetao Lin, Yingzhao Jiang, Qiao Yan:

Adversarial Knowledge Transfer for Black-Box Model Inversion Attack. 1-5 - Wei Liu, Liuan Wang, Jun Sun:

Efficient Object Placement Via LLM and Diffusion Model. 1-5 - Allen H.-X. Lei, Tianchen Deng, Han Wang, Jianfei Yang, Shenghai Yuan

:
Audio Array-Based 3D UAV Trajectory Estimation with LiDAR Pseudo-Labeling. 1-5 - Mei Yu, Shengkang Dong, Xuewei Li, Zewen Shang, Yingzhou Sun, Zhiqiang Liu:

A Novel Network for Short-Term Wind Speed Prediction: Mitigating Distribution Shift and Feature Loss. 1-5 - Vijaya Yajnanarayana, Philipp Geuer, Satyam Dwivedi:

Indoor Sensing with Measurements. 1-5 - Zhenyuan Xiao

, Huanran Hu, Guili Xu, Junwei He:
TAME: Temporal Audio-based Mamba for Enhanced Drone Trajectory Estimation and Classification. 1-5 - Ben Wan, Tianyi Zheng

, Zhaoyu Chen, Yuxiao Wang, Jia Wang:
Pruning for Sparse Diffusion Models Based on Gradient Flow. 1-5 - Jinfu Wei, Zheng Zhang, Ran Liao, Duan Gao:

UniFaceGAN: High-Quality 3D Face Editing With a Unified Latent Space. 1-5 - Jiawei Yin, Yu Gao, Wenbin Zhang, Tianyi Wang, Mingjun Zhang:

Diffusion Augmentation Sub-center Modeling for Unsupervised Anomalous Sound Detection with Partially Attribute-Unavailable Conditions. 1-5 - Haoyang Li, Jia Qi Yip, Tianyu Fan, Eng Siong Chng:

Speech Enhancement Using Continuous Embeddings of Neural Audio Codec. 1-5 - Wenlong Wang, Dahua Gao, Xinyu Liu:

LLM-Guided Dual-Branch Diffusion Model for Fine-Grained Motion Synthesis. 1-5 - Wenqi Zheng, Jianing Chen, Junze Yang, Chuhao Chen, Wei Li, Rahul Yadav, Xiangxu Meng:

Feature Refinement Decomposition and Relation Preference Enhancement for Remote Sensing Change Detection. 1-5 - Yuchi Ishikawa, Tatsuya Komatsu, Yoshimitsu Aoki:

Pre-training with Synthetic Patterns for Audio. 1-5 - Yufei Zhang, Zheling Meng, Bo Peng, Jing Dong, Beilin Chu

, Wei Wang
:
Partial Reconstruction Error for Deepfake Detection. 1-5 - Yu Liao, Haixin Guan, Shuang Wei, Yanhua Long:

Leveraging Out-of-Domain Noise for Unsupervised Domain Adaptation in Speech Enhancement. 1-5 - Bo-Wei Tseng, Wen-Li Wei, Jen-Chun Lin

:
Birds of a Feather: Learning to Retrieve Dance Poses From Music Via Ground-Truth Annotation Lifting. 1-5 - Alex Agranovich, Eliya Nachmani, Oleg Rybakov, Yifan Ding, Ye Jia, Nadav Bar, Heiga Zen, Michelle Tadmor Ramanovich:

SimulTron: On-Device Simultaneous Speech to Speech Translation. 1-5 - Yanfeng Wu, Chen Hui, Ronghua Liao, Shaohui Liu, Debin Zhao:

Image Compressive Sensing With Adaptive Sampling by Median Filtering. 1-5 - Shaode Yu, Ze Chen

, Zhimu Yang, Jiacheng Gu, Bizu Feng
, Qiurui Sun:
Exploring Kolmogorov-Arnold networks for realistic image sharpness assessment. 1-5 - Wenqi Zheng, Jianing Chen, Junze Yang, Chuhao Chen, Wei Li, Rahul Yadav, Xiangxu Meng:

Improving 5G Positioning Through Signal-to-Noise Ratio Recognition Training. 1-5 - Ziheng Zhang, Zihan Li

, Dandan Shan, Yuehui Qiu, Qingqi Hong, Qingqiang Wu:
An Intra- and Cross-frame Topological Consistency Scheme for Semi-supervised Atherosclerotic Coronary Plaque Segmentation. 1-5 - Tianxiao Gao, Li Guo, Shihao Wang, Shiai Zhu, Dajiang Zhou:

PQNAS: Mixed-precision Quantization-aware Neural Architecture Search with Pseudo Quantizer. 1-5 - Yi He, Lei Yang

, Shilin Wang:
Enhancing Visual Forced Alignment with Local Context-Aware Feature Extraction and Multi-Task Learning. 1-5 - Jie Zhang, Yirong Yao, Wei He, Yiqun Niu, Chongjun Wang:

Regret Optimization Experience Replay in Off-Policy Reinforcement Learning. 1-5 - Kun Bu

, Yuanchao Liu, Wenbo Wang, Ziyi Cao
:
PIN: A Prompt-based Implicit Sentiment Analysis Network for Chinese. 1-5 - Sho Inoue, Shuai Wang, Wanxing Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li:

MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion. 1-5 - Shengjie Hu, Xiaogang Zhang, Hua Chen, Wenbin Yan:

EGAS: Enhanced Geometry-aware 3D Asset Generation Using Gaussian Splatting. 1-5 - Dominik Semmler, Michael Joham, Wolfgang Utschick:

Nonlinear Precoding in the RIS-Aided MIMO Broadcast Channel. 1-5 - Hanan Beit-On, Vladimir Tourbabin, Boaz Rafaely:

The Importance of Spatial and Spectral Information in Multiple Speaker Tracking. 1-5 - Kewei Li, Hengshun Zhou, Kai Shen, Yusheng Dai, Jun Du:

Phoneme-Level Contrastive Learning for User-Defined Keyword Spotting with Flexible Enrollment. 1-5 - Huawei Sun, Nastassia Vysotskaya, Tobias Sukianto, Hao Feng, Julius Ott, Xiangyuan Peng, Lorenzo Servadei, Robert Wille:

LiRCDepth: Lightweight Radar-Camera Depth Estimation via Knowledge Distillation and Uncertainty Guidance. 1-5 - Even Matencio, Charles Truong, Laurent Oudre:

Covariance Change Point Detection for Graph Signals. 1-5 - Ivica Kopriva:

Robust Kernel Sparse Subspace Clustering. 1-5 - Hao Yu, Haoyu Chen, Guoying Zhao:

Learning Binary-Antithetical Information Bottleneck for Generalizable Face Anti-Spoofing. 1-5 - Qian Huang, Wenting Liu, Xin Li, Yiming Wang:

Learned Video Compression With Refined Adaptive Flow Pyramid And Coordinate-Aware Attention. 1-5 - Shaolong Wei

, Jiashuang Huang, Mingliang Wang, Shu Jiang, Weiping Ding
:
SCNN: Spike Coupling Neural Network for Multimodal Brain Network Analysis. 1-5 - Hangyang Kong, Wenbo Zhou, Xuxiang He, Xiaotong Tu, Xinghao Ding:

Efficient Dataset Distillation through Low-Rank Space Sampling. 1-5 - Rui Xie, Shuzhan Guo, Li Zou, Jiaxiong Liu, Qian Wang, Jun Zhou:

Spatially-variant Blur Degradation Model Based on Depth Estimation. 1-5 - Riran Cheng, Xupeng Wang

, Ferdous Sohel, Hang Lei:
Superpoints Guided Local Explanation For Deep 3D Trackers. 1-5 - Chang Gong, Boyu Yang

, Weiguo Zheng:
HFLR: Optimizing GNN Training via High-Fixed-Low-Resampling. 1-5 - Jianing Chen, Chuhao Chen, Junze Yang, Wei Li, Rahul Yadav, Wenqi Zheng:

GEMD-UNet: Graph Structure Enhanced Multi-dimensional Learning Unet for Cloud Detection. 1-5 - Yize Sui, Wanrong Huang, Wenjing Yang, Chaofan Zhao, Jing Ren, Ji Wang:

Robust CLIP-Guided Deep Thinking: A Two-Stage Optimization Strategy for Enhancing Adversarial Robustness and Reliability in LVLMs. 1-5 - Shota Nakada, Taichi Nishimura, Hokuto Munakata, Masayoshi Kondo, Tatsuya Komatsu:

DETECLAP: Enhancing Audio-Visual Representation Learning with Object Information. 1-5 - Kyungho Kim

, Jaejin Seo, Seongmin Park, Jihwa Lee:
Prompt Crossing: Evaluating Whether LLM Response Stem from Jailbreak or Normal Prompt. 1-5 - Qiankun Pi, Jicang Lu, Yepeng Sun, Qinlong Fan, Xukun Zhou, Shouxin Shang:

Stance Detection for Social Text: Inference-Enhanced Multi-Task Learning with Machine-Annotated Supervision. 1-5 - Keitaro Yamashita, Kazuki Naganuma, Shunsuke Ono:

Controlling the Number of Sample-Contributive Vertices in Generalized Sampling of Graph Signals. 1-5 - Weiming Qu, Tianlin Liu, Jiawei Du, Dingsheng Luo:

CEMSSL: Conditional Embodied Self-Supervised Learning is All You Need for High-precision Multi-solution Inverse Kinematics of Robot Arms. 1-5 - Tristan S. W. Stevens, Oisín Nolan, Jean-Luc Robert, Ruud J. G. van Sloun

:
Sequential Posterior Sampling with Diffusion Models. 1-5 - Xu Chu

, Hanlin Xue, Bingce Wang, Xiaoyang Liu, Weiping Li, Tong Mo, Tuoyu Feng, Zhijie Tan:
Adaptive Spatiotemporal Augmentation for Improving Dynamic Graph Learning. 1-5 - Yongxin Deng, Xihe Qiu, Xiaoyu Tan, Chao Qu, Jing Pan, Yuan Cheng, Yinghui Xu, Wei Chu:

CogniDual Framework: Self-Training Large Language Models within a Dual-System Theoretical Framework for Improving Cognitive Tasks. 1-5 - Tianqi Zhao, Liangrui Peng, Gang Yao, Di Wu, Yao Tao:

Disentangled Representation Learning for Chinese Handwriting Recognition. 1-5 - Yael Segal-Feldman, Aviv Shamsian, Aviv Navon, Gill Hetz, Joseph Keshet

:
Whisper in Medusa's Ear: Multi-head Efficient Decoding for Transformer-based ASR. 1-5 - Zhilu Wang, Peinan Li, Lingbo Zhao, Fengkai Yuan, Rui Hou, Dan Meng:

RanDoctor: System-Level Ransomware Detection with ProbSparse Self-Attention. 1-5 - Jiale Chen, Xuelian Dong, Wenxiu Xie

, Tao Gong, Fu Lee Wang, Tianyong Hao:
Span Attention for Entity-Consistent Task-Oriented Dialogue Response Generation. 1-5 - Hanchen Pei, Kang Chen, Gongping Huang, Jilu Jin, Jacob Benesty, Jingdong Chen:

On the Design of Low-Rank Differential Beamformers with Nonuniform Linear Microphone Arrays. 1-5 - Zijing Zhang, Jianfei Xiao, Bate Liu:

HiLiteMamba: A Lightweight and High-Frequency Aware Network for Single Image Super-Resolution. 1-5 - Weiqiao Shan, Yuhao Zhang

, Yuchen Han, Bei Li, Xiaofeng Zhao, Yuang Li, Min Zhang, Hao Yang, Tong Xiao, Jingbo Zhu:
Optimizing Speech Multi-View Feature Fusion through Conditional Computation. 1-5 - Kunlong Zhao, Xueqin Luo, Jilu Jin, Gongping Huang, Jingdong Chen, Jacob Benesty:

Design of Robust Differential Beamformers with Microphone Arrays of Arbitrary Planar Geometry. 1-5 - Chiming Duan

, Tong Jia, Yong Yang, Guiyang Liu
, Jinbu Liu, Huxing Zhang, Qi Zhou, Ying Li, Gang Huang:
EagerLog: Active Learning Enhanced Retrieval Augmented Generation for Log-based Anomaly Detection. 1-5 - Zihang Guo, Zijian Li, Zhiyong Yang, Zhenping Mou, Jieru Guo:

DDNet: Exploring Dual Dependencies for Long-Term Time Series Forecasting. 1-5 - Gregor Meehan

, Johan Pauwels
:
Evaluating Contrastive Methodologies for Music Representation Learning Using Playlist Data. 1-5 - Zhanbo Feng, Zenan Ling, Xinyu Lu

, Ci Gong, Feng Zhou, Wugedele Bao, Jie Li, Fan Yang, Robert C. Qiu:
Textual and Visual Prompt Fusion for Image Editing via Step-Wise Alignment. 1-5 - Zhongquan Jian, Daihang Wu, Xiangjian Zeng, Junfeng Yao, Meihong Wang, Qingqiang Wu:

Curriculum Contrastive Learning for Aspect-based Sentiment Analysis. 1-5 - Yakov Gusakov, Osvaldo Simeone, Tirza Routtenberg, Nir Shlezinger:

Rapid Online Bayesian Learning for Deep Receivers. 1-5 - Manuele Rusci, Hugo Van hamme, Tinne Tuytelaars:

Self-Incremental Training for Personalized Voice Command Recognition in a Wireless Audio Sensor Network. 1-5 - Yuanyuan Wang, Hangting Chen, Dongchao Yang, Zhiyong Wu, Xixin Wu:

AudioComposer: Towards Fine-grained Audio Generation with Natural Language Descriptions. 1-5 - Chang Hu, Xuyang Teng, Wenpeng Xing, Han Chen, Chenhao Ye, Meng Han:

Distill To Detect: Amplifying Anomalies in Backdoor Models through Knowledge Distillation. 1-5 - Wenbo Yin, Congxuan Zhang

, Zhen Chen, Cheng Feng, Liyue Ge, Zige Wang:
ICIMG-Net: Inject Context Information to Motion Generation for Optical Flow Estimation. 1-5 - Qian Dong, Yuezhou Dong, Ke Qin, Guiduo Duan, Tao He:

Unbiased Multimodal Audio-to-Intent Recognition. 1-5 - Weidan Yan, Wenze Shao, Dengyin Zhang:

LNLFace: Enhanced Blind Face Restoration With Local and Non-local Lookups. 1-5 - Shuo Zhang

, Jiaming Huang
, Wenbing Tang, Lili Tian, Yuang Wei, Jing Liu:
Multi-modal Salient Object Detection via a Unified Diffusion Model. 1-5 - Simin Niu, Xun Liang, Sensen Zhang, Zhiyu Li, Xuan Zhang, Wu Bo, Hanyu Wang, Shichao Song, Mengwei Wang, Jiawei Yang:

When Sparse Graph Representation Learning Falls into Domain Shift: Feature Augmentation for Cross-Domain Graph Meta-Learning. 1-5 - Junqing Zhang, Wen Zhang, Jingdong Chen, Jacob Benesty:

Radiation and Directivity Analysis of a Vibrating Dome-Shaped Radiator Mounted on an Infinite Baffle. 1-5 - Lunke Fei, Jiacheng Yang, Wai-Keung Wong, Shuping Zhao, Anne Toomey, Jiehang Deng:

Palm-vein images reconstruction against adversarial attacks. 1-5 - Xinyu Wang, Haotian Jiang, Haolin Huang, Yu Fang, Mengjie Xu, Qian Wang:

DCIM-AVSR: Efficient Audio-Visual Speech Recognition via Dual Conformer Interaction Module. 1-5 - Pin-Jhao Chen, Woan-Shiuan Chien, Chi-Chun Lee:

Disentangle Heart Rate Signals for Improved Stress Detection. 1-5 - Tomohiro Nakatani, Naoyuki Kamo, Marc Delcroix, Shoko Araki:

A Hybrid Probabilistic-Deterministic Model Recursively Enhancing Speech. 1-5 - Xin-Yu Chen

, Yu-Ming Chen, Chin-Po Chen, Bo-Hao Su, Susan Shur-Fen Gau, Chi-Chun Lee:
SocialRecNet: A Multimodal LLM-Based Framework for Assessing Social Reciprocity in Autism Spectrum Disorder. 1-5 - Shreya G. Upadhyay, Ali N. Salman, Carlos Busso, Chi-Chun Lee:

Mouth Articulation-Based Anchoring for Improved Cross-Corpus Speech Emotion Recognition. 1-5 - Jing-Chun Wang, Woan-Shiuan Chien, Chi-Chun Lee:

A Dynamic Edge-Selection Mechanism in HRV Hypergraph Learning for Improved Stress Detection. 1-5 - Ziqi Liang, Xulong Zhang, Chang Liu, Xiaoyang Qu, Weifeng Zhao, Jianzong Wang:

CycleFlow: Leveraging Cycle Consistency in Flow Matching for Speaker Style Adaptation. 1-5 - Bo-Hao Su, Shreya G. Upadhyay, Chi-Chun Lee:

Toward Zero-Shot Speech Emotion Recognition Using LLMs in the Absence of Target Data. 1-5 - Jinkai Li, Jinghua Wang, Xin Wang

, Liang Yan, Yong Xu:
Component-wise Self-Correction Network for Human Motion Prediction. 1-5 - Sieun Hyeon

, Kyudan Jung, Nam-Joon Kim, Hyun Gon Ryu, Jaeyoung Do:
MathReader : Text-to-Speech for Mathematical Documents. 1-5 - Zheyuan Wang, Ziyao Meng

, Dezhi Wu, Haoran Liao, Tianyi Wang, Hao Shen, Jiajia Li, Haitao Song:
A Progressive Local Variance-guided Strategy for Improving Data Augmentation Reliability. 1-5 - Chun-Chieh Weng, Huan-Yu Chen, Jing-Tong Tzeng

, Ching-Heng Lin, Po-Chih Kuo, Chi-Chun Lee:
Mask Augmentation For Tumor Classification In Medical Images. 1-5 - An-Yan Chang, Jing-Tong Tzeng

, Huan-Yu Chen, Chun-Hsiang Huang, Edward Pei-Chuan Huang, Chi-Chun Lee:
Valve Token Masked Autoencoder for Missing Recordings on Cardiac Abnormality Classification. 1-5 - Hanfang Liang, Yizhuo Yang, Jinming Hu, Jianfei Yang, Fen Liu, Shenghai Yuan

:
Unsupervised UAV 3D Trajectories Estimation with Sparse Point Clouds. 1-5 - Wenlan Kuang, Zhixin Li:

Two-stream Semantic Alignment Networks for Multi-label Image Classification. 1-5 - Peilei Fu, Song Guo

:
FreeLesion: Synthetic Image-Mask Pairs for Fundus Lesion Segmentation via Curriculum Learning and Feature-Loss Guided Filtering. 1-5 - Jinpeng Xu, Chunna Zhao, Jing Yang, Yaqun Huang, Yaoyuan Yang, Lip Yee Por:

FDDSGCN: Fractional Decoupling Dynamic Spatiotemporal Graph Convolutional Network for Traffic Forecasting. 1-5 - Jiayi Zou, Gengyun Jia, Bing-Kun Bao:

Causal Debiasing for Visual Commonsense Reasoning. 1-5 - Wei Zhang, Yi Zhang, Li Zhu, Qianghuai Jia, Feijun Jiang, Hongcheng Guo, Zhoujun Li, Mengping Zhou:

ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback. 1-5 - Longtao Wang, Qingtian Zeng, Guiyuan Yuan, Hua Duan, Cheng Cheng, Kai Jiang:

Dynamic Graph Multi-granularity Attribute Scene Evolution Sequence Recommendation. 1-5 - Li Liu, Wentao Lei, Wenwu Wang:

Multi-Modal Rhythmic Generative Model for Chinese Cued Speech Gestures Generation. 1-5 - Wentao Lei, Li Liu:

Teaching Others Teaches Yourself: Semi-supervised Ensembled Pseudo-labeling Method for Image Classification. 1-5 - Yuhan Lin, Shengxiang Deng, Xudong Li:

Hypergradient-free Training for Deep Equilibrium Models. 1-5 - Longtao Wang, Qingtian Zeng, Guiyuan Yuan, Hua Duan, Cheng Cheng, Zilong Wang:

Heterogeneous Graph Dual-structure Optimization Based Attribute-aware for Recommendation. 1-5 - Vladimir Malenovsky, Tommy Vaillancourt, Milan Jelinek, Eleni Fotopoulou, Emmanuel Ravelli:

Cross-Talk Detection in the IVAS Stereo Codec Based on GCC-PHAT. 1-5 - Jian Zou

, Jian Xiao
, Qingyu Mao, Shuai Liu, Bohuai Xiao
, Yongsheng Liang:
Deep Receiver for Multi-Layer Data Transmission with Superimposed Pilots. 1-5 - Xun Liang, Simin Niu, Sensen Zhang, Zhiyu Li, Xuan Zhang, Bo Wu, Feiyu Xiong, Bo Tang, Hanyu Wang, Shichao Song, Mengwei Wang, Jiawei Yang:

Retrieval-Augmented Multilingual Citation Generation. 1-5 - Jianbo Zheng, Lida Huang, Tairui Zhang, Bin Jiang, Chao Yang:

Subdomain Uncertainty Optimization for Cross-Speed Fault Diagnosis. 1-5 - Zhiyuan Hu, Julián Tachella, Michael Unser, Jonathan Dong:

Structured Random Model for Fast and Robust Phase Retrieval. 1-5 - Eklavya Sarkar, Mathew Magimai-Doss

:
Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing. 1-5 - Jiongge Zhang, Hang Dong, Long Tian, Xiongpeng He, Huimin Sun, Yuan Liu:

Sparse Bayesian Network for Fast Micro-Doppler Analysis. 1-5 - Yuan Chen, Chongju Zhong, Pinyi Huang, Wangyang Cai, Lei Wang:

Improving Micro-expression Recognition using Multi-sequence Driven Face Generation. 1-5 - Ning Jiang, Yanhong Liu, Dingheng Zeng, Yue Feng, Weihong Deng, Ying Li:

Device-aware Optical Adversarial Attack for a Portable Projector-camera System. 1-5 - Hillary Hauger, Philipp Scholl, Gitta Kutyniok:

Robust Identifiability for Symbolic Recovery of Differential Equations. 1-5 - You-Wei Luo, Zhi-Hao Li, Chuan-Xian Ren:

MPOT: Manifold Preserving Optimal Transport for Visual Recognition Under Severe Distribution Shift. 1-5 - Simon W. Penninga

, Hans Van Gorp, Ruud J. G. van Sloun
:
Deep Sylvester Posterior Inference for Adaptive Compressed Sensing in Ultrasound Imaging. 1-5 - Yuexuan Kong, Gabriel Meseguer-Brocal, Vincent Lostanlen, Mathieu Lagrange, Romain Hennequin:

S-KEY: Self-supervised Learning of Major and Minor Keys from Audio. 1-5 - Yu Lu, Ran Wang, Dian Ding, Han Zhang, Liyun Zhang, Lanqing Yang, Yi-Chao Chen, Guangtao Xue:

AMSER: Accelerate Mobile Speech Emotion Recognition with Signal Compression. 1-5 - Guoming Li

, Jian Yang, Shangsong Liang:
ERGNN: Spectral Graph Neural Network With Explicitly-Optimized Rational Graph Filters. 1-5 - Zhaofeng Lin, Naomi Harte:

Uncovering the Visual Contribution in Audio-Visual Speech Recognition. 1-5 - Yi Yuan, Xubo Liu, Haohe Liu, Mark D. Plumbley, Wenwu Wang:

FlowSep: Language-Queried Sound Separation with Rectified Flow Matching. 1-5 - Han Wang, Eduardo Pérez, Iris A. M. Huijben

, Hans Van Gorp, Ruud van Sloun
, Florian Römer:
Learning Structured Compressed Sensing with Automatic Resource Allocation. 1-5 - Yitong Cai, Chengwei Peng, Shu Li, Yuyi Liu, Hongfei Zhang, Binxing Fang:

TRACE: A Robust Framework for Malicious Traffic Detection with Noisy Labels. 1-5 - Shreya G. Upadhyay, Woan-Shiuan Chien, Chi-Chun Lee:

Is It Still Fair? Investigating Gender Fairness in Cross-Corpus Speech Emotion Recognition. 1-5 - Yi Yuan, Dongya Jia, Xiaobin Zhuang, Yuanzhe Chen, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xubo Liu, Xiyuan Kang, Mark D. Plumbley, Wenwu Wang:

Sound-VECaps: Improving Audio Generation with Visually Enhanced Captions. 1-5 - Junding Zhang, Di Rao, Youssef Akoudad

, Wei Gao, Jie Chen:
Lightweight Self-Supervised Monocular Depth Estimation for All-Day Scenes Using Generative Adversarial Network. 1-5 - Martin Benjak

, Jörn Ostermann
:
Exploration of Sequence-wise Optimized Parameters for Low Complexity Enhancement Video Coding (LCEVC) on 4K Content. 1-5 - Yifan Zeng

, Peijia Zheng, Jian Li:
A Federated Learning Network Intrusion Detection System for Multiple Imbalances. 1-5 - Xiaoyu Bie, Xubo Liu, Gaël Richard:

Learning Source Disentanglement in Neural Audio Codec. 1-5 - Shuanglin Li, Zhijie Xie, Syed Mohsen Naqvi:

Efficient Long Speech Sequence Modelling for Time-Domain Depression Level Estimation. 1-5 - Yuzhe Weng, Haotian Wang, Tian Gao, Kewei Li, Shutong Niu, Jun Du:

Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention. 1-5 - Yaowei Guo, Jiazheng Xing, Xiaojun Hou, Shuo Xin, Juntao Jiang, Demetri Terzopoulos, Chenfanfu Jiang, Yong Liu:

CFSum: A Transformer-Based Multi-Modal Video Summarization Framework With Coarse-Fine Fusion. 1-5 - Priyanka Maity, Monali Chakraborty, Suraj Srivastava, Aditya K. Jagannatham:

Hybrid Precoding in mmWave Multiuser MIMO Systems with Delay Alignment Modulation (DAM). 1-5 - Kyudan Jung, Nam-Joon Kim, Hyun Gon Ryu, Sieun Hyeon

, Seung Jun Lee, Hyuk-Jae Lee:
TeXBLEU: Automatic Metric for Evaluate LaTeX Format. 1-5 - Shuirong Cao, Ruoxi Cheng, Zhiqiang Wang:

AGR: Age Group fairness Reward for Bias Mitigation in LLMs. 1-5 - Xavier Juanola, Gloria Haro, Magdalena Fuentes:

A Critical Assessment of Visual Sound Source Localization Models Including Negative Audio. 1-5 - Yihao Wang, Zhongdi Wu, Joseph Nese, Akihito Kamata, Vedant Nilabh, Eric C. Larson:

A Unified Model for Oral Reading Fluency and Student Prosody. 1-5 - Hao Wen, Paul Krause, Lee Gillam:

Continuous-Discrete Differentiable Particle Filters for Irregular Time Series. 1-5 - Jian Cheng, Sam Nguyen

:
Speech Few-Shot Learning for Language Learners' Speech Recognition. 1-5 - Zezhong Jin, Youzhi Tu, Zhe Li, Zilong Huang, Chong-Xin Gan, Man-Wai Mak:

Denoising Student Features with Diffusion Models for Knowledge Distillation in Speaker Verification. 1-5 - Miao Jing, Vidhyasaharan Sethu, Beena Ahmed:

Improved Out-of-domain Detection in VAE Latent Spaces with Boundary-driven Regularisation. 1-5 - Miao Jing, Vidhyasaharan Sethu, Beena Ahmed:

Evidential Neural GPLDA: A Novel Approach to Quantify Prediction Uncertainty in Speaker Verification Systems. 1-5 - Ruxin Zheng, Shunqiao Sun, Hongshan Liu, Holger Caesar, Honglei Chen, Jian Li:

Advancing High-Resolution and Efficient Automotive Radar Imaging through Domain-Informed 1D Deep Learning. 1-5 - Kumari Nishu, Minsik Cho, Devang Naik:

SLiCK: Exploiting Subsequences for Length-Constrained Keyword Spotting. 1-5 - Carter Lyons, Raghu G. Raj, Margaret Cheney:

Unrolled Generative Compound Gaussian Network for Computer Tomography. 1-5 - Wentao Chao, Junli Zhao, Fuqing Duan, Guanghui Wang:

LFSRDiff: Light Field Image Super-Resolution via Diffusion Models. 1-5 - Mengmeng Li, Jinlong Tian, Yongqiang Zhao, Hongmei Li, Xudong Fang:

MVCBRec: Multi-View Contrastive Learning for Bundle Recommendation. 1-5 - Baolu Xue, Hanyuan Zheng, Jiale Zhang, Jiewen Liu, Bing Chen:

FedRPN: An Efficient Framework for Optimizing System Heterogeneity in Federated Learning. 1-5 - R. Gnana Praveen, Jahangir Alam:

LAVViT: Latent Audio-Visual Vision Transformers for Speaker Verification. 1-5 - Kai Yoshida, Masahiro Mizukami, Seiya Kawano, Canasai Kruengkrai, Hiroaki Sugiyama, Koichiro Yoshino:

Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression. 1-5 - Xiao Zhang, Haodong Jing, Hui Chen, Yongqiang Ma, Nanning Zheng:

Refiner: Fine-grained Cross-modal Concepts Refinement for Compositional Zero-Shot Learning. 1-5 - Mei Qiu, Lauren Ann Christopher, Stanley Y. P. Chien, Lingxi Li:

Adaptive Aspect Ratios with Patch-Mixup-ViT-based Vehicle ReID. 1-5 - Xinrui Zhang, Yufeng Wang, Shuangkang Fang, Zesheng Wang, Huayu Zhang, Dacheng Qi

, Wenrui Ding:
ASFC-NeRF: Large-Scale Scene Rendering with Adaptive Sampling and Feature-aware Compression. 1-5 - Masato Mimura, Takafumi Moriya, Kohei Matsuura:

Advancing Streaming ASR with Chunk-wise Attention and Trans-chunk Selective State Spaces. 1-5 - Shoko Araki, Nobutaka Ito, Reinhold Haeb-Umbach, Gordon Wichern, Zhong-Qiu Wang, Yuki Mitsufuji:

30+ Years of Source Separation Research: Achievements and Future Challenges. 1-5 - Minghui Li, Lei Yu, Hewen Pan, Shengqing Hu, Longling Zhang, Shengshan Hu, Wei Wan

, Peijin Guo:
An Efficient Residual-based Low-dose PET Reconstruction with Spatial-Frequency Integration. 1-5 - Saidur R. Pavel, Yimin D. Zhang, Batu K. Chalise:

Massive MIMO System Partitioning for Efficient Hybrid Beamformer Optimization. 1-5 - Wenqi Ding, Yuanchao Liu, Zhongjie Wang

, Zheng Chu:
MPFL: A Decentralised Federated Learning Framework Based on Multi-Population Genetic Algorithm. 1-5 - Rui Xu, Huadong Liu, Yongcen Li, Xinchen Ye, Zhihui Wang, Hongkai Wang, Haojie Li, Yi Wang, Dingpin Huang, Fangyi Xu, Yi Gan, Yuan Tu, Hongjie Hu:

2.5D Top-K Ranked Multiple Instance Learning to Classify NSCLC PD-L1 Status on CT Images. 1-5 - Shiyu Teng, Jiaqing Liu, Hao Sun, Shurong Chai, Tomoko Tateyama, Lanfen Lin, Yen-Wei Chen:

Enhanced Multimodal Depression Detection With Emotion Prompts. 1-5 - Zehao Wang, Haobo Yue, Zhicheng Zhang, Da Mu, Jin Tang, Jianqin Yin:

MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection. 1-5 - Da Zhang, Jiazheng Sun

, Chenxiao Xia, Ruinan Ma, Jun Zheng:
ADD: A Detection Method for Image-Processing Adversarial Defenses. 1-5 - Yang Hu

, Jinxia Zhang, Kaihua Zhang, Yin Yuan, Jiale Huang, Zechao Zhan, Xin Wang:
Shifting Spotlight for Co-supervision: A Simple yet Efficient Single-branch Network to See Through Camouflage. 1-5 - Qingyun Xu, Lixiang Liu, Xin Zhou:

Fioma: Towards Open-Set Semi-Supervised Specific Emitter Identification. 1-5 - Zeren Zhang, Jo-Ku Cheng

, Jingyang Deng, Lu Tian, Jinwen Ma, Ziran Qin, Xiaokai Zhang, Na Zhu, Tuo Leng:
Diagram Formalization Enhanced Multi-Modal Geometry Problem Solver. 1-5 - Hui Sun, Yanfeng Ding, Liping Yi, Huidong Ma, Haonan Xie, Gang Wang, Xiaoguang Liu:

Adaptive Lossless Compression for Genomics Data by Multiple (s, k)-mer Encoding and XLSTM. 1-5 - Hongzhou Zhu

, Yuhao Qiu, Renjie Hu, Ang Li
, Shengji Zhu, Lei Wang:
Automatic Numbering and Pathological Recognition of Pediatric Teeth Using CNN and Attention Mechanisms. 1-5 - Ryandhimas E. Zezario, Sabato Marco Siniscalchi, Hsin-Min Wang

, Yu Tsao:
A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models. 1-5 - Tonmoy Rajkhowa, Amartya Roy Chowdhury, Achyut Mani Tripathi, Sanjeev Sharma, Om Jee Pandey:

Semi-Supervised Knowledge Distillation Framework towards Lightweight Large Language Model for Spoken Language Translation. 1-5 - Jiale Huang, Dehong Gao, Jinxia Zhang, Zechao Zhan, Yang Hu, Xin Wang:

FashionFAE: Fine-grained Attributes Enhanced Fashion Vision-Language Pre-training. 1-5 - Yongheng Zhang, Danfeng Yan:

Soft Knowledge Distillation with Multi-Dimensional Cross-Net Attention for Image Restoration Models Compression. 1-5 - Xudong Jin

, Jianfeng Xu, Kei Kawamura:
Fast inter-frame coding for dynamic meshes via supervoxel-based shape matching. 1-5 - Minglin Wu, Jing Xu, Xueyuan Chen, Helen Meng:

Integrating Potential Pronunciations for Enhanced Mispronunciation Detection and Diagnosis Ability in LLMs. 1-5 - Kaijie Xu, Xilin Dai

, Lin Qiu:
OPFormer: Real-Time Optimal Power Flow with CNN-Based Transformer. 1-5 - Jiale Yan, Bo Zhao, Chunyu Yang:

Stealthy Backdoor Attack against Video Recognition Models. 1-5 - Gallil Maimon, Amit Roth, Yossi Adi:

Salmon: A Suite for Acoustic Language Model Evaluation. 1-5 - Yuang Li, Xiaosong Qiao, Xiaofeng Zhao, Huan Zhao, Wei Tang, Min Zhang, Hao Yang:

Large Language Model Should Understand Pinyin for Chinese ASR Error Correction. 1-5 - Jihao Fan, Liang Huang, Jun Li

, Long Shi, Yuwen Qian:
Quantum Multi-Path Communication Protocol Based on Maximum Flow Theory. 1-5 - Jiabao Wei, Zhiyuan Ma:

DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning. 1-5 - Lior Frankel, Shlomo E. Chazan, Jacob Goldberger:

Automatic Detection of Domain Shifts in Speech Enhancement Systems Using Confidence-Based Metrics. 1-5 - Mufeng Yao, Chao Liu, Lexu Xie, Mingmin Chi:

MambaRF: A Bi-directional Mamba Structure for Radio Frequency Signal Classification of Unmanned Aerial Vehicle. 1-5 - Zhenqiao Cheng, Chongjun Ouyang, Xingqi Zhang:

Movable Antenna Aided Physical Layer Security with No Eavesdropper CSI. 1-5 - Abhijay Ghildyal, Nabajeet Barman, Saman Zadtootaghaj:

Foundation Models Boost Low-Level Perceptual Similarity Metrics. 1-5 - Guanwen Feng, Yilin Zhang

, Yunan Li, Siyu Jin, Qiguang Miao:
Gaussian-Face: Talking Head Generation with Hybrid Density via 3D Gaussian Splatting. 1-5 - Xiaoqin Tang, Chaohui Liu, Guoqiang Xiao:

Advancing Paired Image-Mask Synthesis for Automated Nanoparticle Phenotyping. 1-5 - Shuo Zhang

, Jiaming Huang
, Yan Wu, Tao Hu, Wenbing Tang, Jing Liu:
Seg-diffusion: Text-to-Image Diffusion Model for Open-Vocabulary Semantic Segmentation. 1-5 - Jiajing Zhang, Jiamei Jiang, Linjing Li, Daniel Zeng:

A Novel Decision-Making Model for Playing Board Game Combining Planning and Opponent Behaviors. 1-5 - Derong Kong

, Huaizhang Liao, Jingyuan Xia:
A Cross-Modal Multi-Attitude Framework for the Generation of Space Target ISAR Images. 1-5 - Zakaria Aldeneh, Takuya Higuchi, Jee-Weon Jung, Li-Wei Chen, Stephen Shum, Ahmed Hussen Abdelaziz, Shinji Watanabe

, Tatiana Likhomanenko, Barry-John Theobald:
Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels. 1-5 - Guanwen Feng, Yilin Zhang

, Yunan Li, An Liu, Qiguang Miao:
Sign-Mamba: Advanced Mamba-Based Sign Language Generation. 1-5 - Yuanfeng Xu, Yuhao Chen, Zhongzhan Huang, Zijian He, Guangrun Wang, Liang Lin:

Anima2: Cross-Species Animal Animation through Image-to-Video Synthesis with Subject Alignment. 1-5 - Jie Zhang, Chengqian Jiang, Yichi Wang, Haoyin Yan, Miao Sun:

Learning-Based Utility Estimation with Application to Speech Enhancement of a Moving Speaker. 1-5 - Jaehun Kim, Ji-Hoon Kim, Yeunju Choi, Tan Dat Nguyen, Seongkyu Mun, Joon Son Chung:

AdaptVC: High Quality Voice Conversion with Adaptive Learning. 1-5 - Long Zeng, Mingwei Zhu, Kaigui Wu, Zefang Li:

Medical Image Segmentation via Sparse Coding Decoder. 1-5 - Guanwen Feng, Siyu Jin, Zhihao Qian, Yunan Li, Qiguang Miao:

KAN-Face: Efficient Resource Usage and Precision Lip-Sync in Talking Head Generation. 1-5 - Zakaria Aldeneh, Vimal Thilak, Takuya Higuchi, Barry-John Theobald, Tatiana Likhomanenko:

Towards Automatic Assessment of Self-Supervised Speech Models using Rank. 1-5 - Xiaoqiang Zhang, Ying Chen, Guangyao Li, Buwen Liang:

PEDE: Enhance Multi-modal Sarcasm Detection in Videos via Prompted Emotion Distributions. 1-5 - Xin Sun, Boqian Liu, Xinchen Ye, Rui Xu, Haojie Li:

Self-Supervised Monocular Depth Estimation from Videos via Pose-Adaptive Reconstruction. 1-5 - Xiaoyun Han, Jun Wang:

R2-SAC: A Relaxation-and-Refinement SAC Agent for Stock Portfolio Trading. 1-5 - Boyan Gu, Sheng Zheng, Xiaojun Mao, Zhonglei Wang:

Transfer Learning via Functional Balancing in Reproducing Kernel Hilbert Spaces. 1-5 - Hanyu Meng

, Jeroen Breebaart, Jeremy Stoddard, Vidhyasaharan Sethu, Eliathamby Ambikairajah:
Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features. 1-5 - Xin Sun, Boqian Liu, Xinchen Ye, Guanqiao Chen, Rui Xu, Haojie Li:

Few-shot Image Classification based on Attribute Prediction and Selection. 1-5 - Zhe Xu, Zhipei Lei, Dingyong Gou, Yanlin Wu, Liwen Zhang, Cong Li:

Edge-aware Laplacian Pyramid Network for Efficient Image Deblurring. 1-5 - Botao Sun, Ignacio Roldan

, Francesco Fioranelli:
Automatic Labelling & Semantic Segmentation with 4D Radar Tensors. 1-5 - Jianqi Zhang, Mengxuan Wang, Jingyao Wang, Lingyu Si, Changwen Zheng, Fanjiang Xu:

Less Yet Robust: Crucial Region Selection for Scene Recognition. 1-5 - Kexin Zhang, Yanqing Xu, Tsung-Hui Chang:

Networked ISAC Beamforming Design with Capacity-Limited Fronthaul Links. 1-5 - Xinchen Ye, Xia Mao, Rui Xu, Haojie Li:

Mining Scene Structural Guidance for Thermal Images in Self-Supervised Monocular Depth Estimation. 1-5 - Yujie Lin, Jingyao Liu, Yan Gao, Ante Wang, Jinsong Su:

A Dual-Perspective Metaphor Detection Framework Using Large Language Models. 1-5 - Kanoko Goto, Takumi Karasawa, Takumi Hirose, Rei Kawakami, Nakamasa Inoue:

Multi-Point Positional Insertion Tuning for Small Object Detection. 1-5 - Xinchen Ye, Aokai Zhang

, Rui Xu, Haojie Li:
Delving into Transformer-based Network Architecture for Guided Depth Super-Resolution. 1-5 - Hang Yin, Zhifeng Lin, Xin Liu, Bin Sun, Kan Li:

Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning. 1-5 - Shaik Basheeruddin Shah, Nazar T. Ali, Ahmed Altunaiji, Vijay Kumar Chakka

, Mohamed I. AlHajri:
Complex Coprime Frequency Sum Based Signal Representation for Period Estimation. 1-5 - Lulin Li, Ben Chen, Xuechao Zou, Junliang Xing, Pin Tao:

UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary Identification in High-Resolution Remote Sensing Images. 1-5 - Pei-Sze Tan

, Sailaja Rajanala, Yee-Fan Tan, Arghya Pal
, Chun-Ling Tan, Raphaël C.-W. Phan, Huey Fang Ong:
Post-Hoc Adversarial Stickers Against Micro-Expression Leakage. 1-5 - Satwinder Singh, Qianli Wang

, Zihan Zhong, Clarion Mendes, Mark Hasegawa-Johnson, Waleed Abdulla, Seyed Reza Shahamiri
:
Robust Cross-Etiology and Speaker-Independent Dysarthric Speech Recognition. 1-5 - Aamna Zahid Piracha, Bernhard Rinner:

Virtual Leader-based Safe Formation-Switching Control for Dense Environments. 1-5 - Jiahui Pan, Hui Zhang, Xueliang Zhang:

Enhancing Multi-Channel Speech with Limited Microphones via Spherical Harmonic Transform. 1-5 - Zhiyu Li, Xinpei Zhao, Jing Wang, Xinyuan Qian, Xiang Xie:

M2PAIR: A High-Quality Acoustic Impulse Response Computation Model. 1-5 - Wei Xiao, Weibei Dou, Wenlong Wang, Gaoxiong Yi, Jingxin Li, Shidong Shang:

AVS3P10 Standard for Real-time Speech Coding. 1-5 - Jinfeng Wu, Wu Shi:

Improving GAN Performance Using Confidence-Aware Discrimination. 1-5 - Tiesunlong Shen, Jin Wang, Xuejie Zhang, Erik Cambria:

Hop-level Direct Preference Optimization for Knowledge Graph Reasoning with Trees. 1-5 - Xue Wen:

Windowed Quantum Phase Estimation: Signal Processing Approach to a Quantum Algorithm. 1-5 - Jiafeng Qiu

, Huadan Wang, Peihan Yao, Gang Shen
:
BCG data imputation via multimodal feature alignment and semantic sequence prediction. 1-5 - Jiayuan Li, Lei Cui, Jie Zhang, Haiqiang Fei, Yu Chen, Hongsong Zhu:

Steering Large Language Models for Vulnerability Detection. 1-5 - The Chuong Chu, Vu Tuan Dat Pham, Trung Kien Dao, Ngoc Hoang Nguyen, Steven Q. H. Truong:

AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR. 1-5 - Ziyu Tang, Xiren Zhou, Ao Chen, Shikang Liu, Chuyang Wei, Huanhuan Chen:

Inside and Inside: Efficient Anomaly Detection by Fully Capturing the Detailed Dynamics. 1-5 - Sandrine Tornay, Mathew Magimai-Doss

:
Towards Dynamic Skeleton-based Handshape Subunits for Sign Language Assessment. 1-5 - Jianfei Wu, Xubin Wang, Weijia Jia:

Enhancing Text Annotation Through Rationale-Driven Collaborative Few-Shot Prompting. 1-5 - Marco Manzoni

, Francesco Linsalata, Maurizio Magarini, Stefano Tebaldini:
COSMIC waveforms for Integrated Communication and Imaging. 1-5 - Taichi Uchida, Yoshihiro Kanamori, Yuki Endo:

3D View Optimization for Improving Image Aesthetics. 1-5 - Yanbin He

, Geethu Joseph:
Kronecker-structured Sparse Vector Recovery with Application to IRS-MIMO Channel Estimation. 1-5 - Xia Jiang, Linglingzhi Zhu

, Taoli Zheng, Anthony Man-Cho So:
Single-Loop Variance-Reduced Stochastic Algorithm for Nonconvex-Concave Minimax Optimization. 1-5 - Yi Pan, Yujia Zhang, Michael Kampffmeyer, Xiaoguang Zhao:

RefCap: Zero-shot Video Corpus Moment Retrieval Based on Refined Dense Video Captioning. 1-5 - Yichen Zeng, Jilu Jin, Gongping Huang, Jingdong Chen, Jacob Benesty:

DOA Estimation Based on Enhanced SRP-MVDR Using Kronecker Product Decomposition for Large Rectangular Microphone Arrays. 1-5 - Weihao Zhang, Xin Xia, Maopeng Li, Yunbo Zhao:

Vision Mamba-Based Approach for Incomplete Boundary Document Image Rectification. 1-5 - Yi Pan, Yujia Zhang, Xiaoguang Zhao:

FAWL: Weakly-Supervised Video Corpus Moment Retrieval with Frame-Wise Auxiliary Alignment and Weighted Contrastive Learning. 1-5 - Yutong Wang, Xiaofeng Meng, Minhao Zou

, Siyang Leng
:
Multi-hop Self-augmented Graph Contrastive Learning for Node Classification. 1-5 - Limeng Zhang, Zenghui Zhang, Juanping Wu, Weiwei Guo, Tao Zhang, Wenxian Yu:

MMCD: Memory-Based Multimodal Change Detection. 1-5 - Louis Hémadou, Héléne Vorobieva, Ewa Kijak, Frédéric Jurie:

Adapting Without Seeing: Text-Aided Domain Adaptation for Adapting CLIP-like Models to Novel Domains. 1-5 - Jinlong Wang, Xiongxin Tang, Fanjiang Xu, Hanxiang Yang:

Amplitude-Guidance Low-Light Image Enhancement with Frequency-based Channel Attention. 1-5 - Xinyuan Zheng, Xiaojie Li, Canghong Shi, Jia He, Zhan ao Huang, Xian Zhang, Imran Mumtaz:

CAPAST: Content Affinity Preserved Arbitrary Style Transfer. 1-5 - Tingxuan Chen, Liu Yang, Zidong Wang, Guohui Li, Jun Long:

Enhancing Session-Based Recommendation with Hypergraph Motifs and Contrastive Learning. 1-5 - Yaoyun Zhang, Xuenan Xu, Mengyue Wu:

Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance. 1-5 - Xiaolei Zhang, Zhaoyu Chen, Guangpu Chen, Xinyu Feng, Qingni Shen, Zhonghai Wu:

RPPFL: Robust and Privacy-Preserving Federated Learning via Trusted Execution Environments. 1-5 - Xupei Zhang, Hanlin Qin, Jingjing Li, Jinni Geng, Zihan Gao, Yue Yu:

Spatial-Frequency Information Interaction Diffusion for SAR Colorization. 1-5 - Xinkai Du, Quanjie Han, Chao Lv, Yan Liu, Yalin Sun, Hao Shu, Hongbo Shan, Maosong Sun:

Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation. 1-5 - Rebecca Clain, Eduardo Fernandes Montesuma, Fred Ngolè Mboula:

Decentralized Federated Dataset Dictionary Learning for Multi-Source Domain Adaptation. 1-5 - Arijit Biswas, Guanxin Jiang:

RF-GML: Reference-Free Generative Machine Listener. 1-5 - Junhan Wang, Zhangming Wu, Zhuoyue Wang, Lu Dong:

SBA: A Swift and Stealthy Backdoor Attack Framework for Federated Learning. 1-5 - Ünal Ege Gaznepoglu, Nils Peters:

Why disentanglement-based speaker anonymization systems fail at preserving emotions? 1-5 - Qing Chang, Wei Dai, Zhihao Shuai

, Limin Yu, Yutao Yue
:
Spatial-Temporal Perception with Causal Inference for Naturalistic Driving Action Recognition. 1-5 - Fasih Haider, Raven Hickson, Peter Kind, Saturnino Luz:

Automatic recognition of rodent call types using deep supervectors. 1-5 - Maurice Kuschel, Amr Alkhatib, Tanuj Hasija, Henrik Boström:

Explaining Representations in Correlation-based Deep Multiview Representation Learning. 1-5 - Shilong Zhang, Yu Song, Shubin Wang:

FA-GAN: Defense Against Adversarial Attacks in Automatic Modulation Recognition. 1-5 - Samrat Mukherjee, Tanuj Sur, Saurish Seksaria, Subhasis Chaudhuri, Gemma Roig, Biplab Banerjee:

UIDAPLE: Unsupervised Incremental Domain Adaptation through Adaptive Prompt Learning. 1-5 - Cheng Yang, Saikat Chatterjee, Tobias J. Oechtering:

Enhancing Network Calibration for Low-Cost Gas Sensor Networks Through Adaptive Similarity Search. 1-5 - Hong Liu, Xiuxiu Qiu, Yiming Shi, Zelin Zang:

USD: Unsupervised Soft Contrastive Learning for Fault Detection in Multivariate Time Series. 1-5 - Riran Cheng, Xupeng Wang

, Ferdous Sohel, Hang Lei:
RSM: Refined Saliency Map For Explainable 3D Object Tracking. 1-5 - Zhen Li, Jibin Wang, Zhuo Chen, Kun Wu, Meng Ai, Leike An, Liqiang Wang, Haoxuan Li:

Unifying Within and Across: Intra-Modality Multi-View Fusion and Inter-Modality Alignment for Knowledge Graph Completion. 1-5 - Qiuyu Liang

, Weihua Wang, Cunda Wang
, Feilong Bao, Jie Yu:
Hyperbolic Multimodal Knowledge Graph Embedding. 1-5 - Chen Zou, Qingsen Ma, Jia Wang

, Ming Lu, Shanghang Zhang, Zhaofeng He:
GaussianEnhancer: A General Rendering Enhancer for Gaussian Splatting. 1-5 - Shengkui Zhao, Zexu Pan, Kun Zhou, Yukun Ma, Chong Zhang, Bin Ma:

Conditional Latent Diffusion-Based Speech Enhancement via Dual Context Learning. 1-5 - Jinshan Zeng, Yiyang Yuan, Yan Zhang, Yefei Wang

, Xijia Wang:
Learning Stroke-Order Dynamics in Few-Shot Font Generation via Sequential Awareness. 1-5 - Erlei Zhang, Wenxuan Yuan, Xiangsen Liu:

ChannelMixer: A Hybrid CNN-Transformer Framework for Enhanced Multivariate Long-Term Time Series Forecasting. 1-5 - Jiahao Dong, Zuo Zuo, Zongze Wu, Meiqin Liu:

A Scale-Adaptive and Background-Robust Method for Surface Defect Detection. 1-5 - Shijie Nie, Ziqiang Shi, Rujie Liu, Song Guo, Meng Zhang, Mengjiao Wang, Kazuki Osamura, Lina Septiana, Narishige Abe:

Attribute Conditional Diffusion-Augmented Person Re-Identification. 1-5 - Shengkui Zhao, Kun Zhou, Zexu Pan, Yukun Ma, Chong Zhang

, Bin Ma:
HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution. 1-5 - Sapta Girish Neelam:

Distributed Interference Alignment Precoding and Detection for MU-MIMO OTSM Downlink in Time-Varying Channels. 1-5 - Yu Xi, Haoyu Li, Xiaoyu Gu, Hao Li, Yidi Jiang, Kai Yu:

Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency. 1-5 - Hanene F. Z. Brachemi Meftah, Wassim Hamidouche, Sid Ahmed Fezza, Olivier Déforges, Kassem Kallas

:
Energy Backdoor Attack to Deep Neural Networks. 1-5 - Ryosuke Watanabe, Keisuke Nonaka, Eduardo Pavez, Tatsuya Kobayashi, Antonio Ortega:

No-Reference Point Cloud Quality Assessment Based on Graph Signal Variation. 1-5 - Pengfei Qi, Yifei Zhang, Wenqiang Li, Youwen Hu, Kunlong Bai:

An Attribute-Enriched Dataset and Auto-Annotated Pipeline for Open Detection. 1-5 - Tao Chen, Qun Niu, Ning Liu:

NeRF-VLD: Efficient Visual Landmark Database Construction via Scene Constraints. 1-5 - Lars Nockenberg, Wenxuan Wei, Mariam Navai, Eckehard G. Steinbach:

Deep Learning-Based Perceptual Vibrotactile Codec with Rate Scalability. 1-5 - Liangshan Zhu, Xing Wu, Chengliang Wang, Haidong Wang:

SAM Adaptation with Refocused Attention and Diverse Prompts for Medical Image Segmentation. 1-5 - Nan Wang

, Xiaohan Yan, Xiaowei Song, Zhicheng Wang:
Semantic-Guided Gaussian Splatting with Deferred Rendering. 1-5 - Shuai Liu, Jianyu Ding, Jie Yang, Wei Liu:

Mixed Gaussian Splatting for High-Quality Rendering and Reconstruction. 1-5 - Shulei Qiu, Wanqi Yang, Ming Yang:

Hybrid Feature Collaborative Reconstruction Network for Few-Shot Fine-Grained Image Classification. 1-5 - Ziqi Zhou, Weize Quan, Zhaojin Lu, Dong-Ming Yan:

Diffused Poses and Distilled Expressions for Controllable Audio-driven Talking Face Generation. 1-5 - Ziyang Chen, Dongqin Liu, Jiao Dai, Songlin Hu:

Boosting Open-Vocabulary Object Detection Performance via Class-Agnostic Pseudo-Labels and MultiModal Hybrid Knowledge. 1-5 - Peipei Zhao, Jiaxuan Wang, Zixiang Lu, Qiguang Miao:

MLNet: Mutual Learning Network to Improve Self-Supervised Representation for Fine-Grained Visual Recognition. 1-5 - Tianteng Gu, Bei Liu

, Yanmin Qian:
Efficient Pruning for Large-Scale Seq2Seq Speech Models without Back-Propagation. 1-5 - Malek Khammassi, Virginia Bordignon, Vincenzo Matta, Ali H. Sayed:

Fundamental Social Learning Scaling Law for Tracking Hidden Markov Models. 1-5 - Yiqin Luo, Tianlong Gu, Fengrui Hao, Liang Chang:

BID-Net: Balanced Incremental Distillation Network for Fair Dermatological Disease Diagnosis. 1-5 - Yuxi Zhou, Tao Feng, Yazhuo Gao, Yixuan Wu, Lin Yang, Jiaqi Lin:

Intrusion Detection for Intelligent Transportation Systems: A lightweight interpretable model. 1-5 - Bishal Ghosh, Emma Li, Tanaya Guha:

Active Listener: Continuous Generation of Listener's Head Motion Response in Dyadic Interactions. 1-5 - Sheng Liu, Shiming Zhu, Huilong Pi, Yunchuan Qin, Zhuo Tang, Ruihui Li:

Single-View Reconstruction via Decoupled 3D Gaussian Splatting. 1-5 - Yubiao Yue, Jun Xue, Haihuang Liang, Zhenzhang Li, Yufeng Wang:

MpoxMamba: A Grouped Mamba-based Lightweight Hybrid Network for Mpox Detection. 1-5 - Chen Liu, Xiaohui Rong:

Automated Graph Attention Network for Heterogeneous Entity Resolution. 1-5 - Jingjing Tang, Erica Cooper, Xin Wang, Junichi Yamagishi, György Fazekas:

Towards An Integrated Approach for Expressive Piano Performance Synthesis from Music Scores. 1-5 - Xuyao Deng, Tianjiao Wan, Kele Xu, Tian Gao, Peng Qiao, Dawei Feng, Yong Dou:

Scaling Bioacoustic Signal Pre-training with Million Samples Via Mask-Modeling. 1-5 - Yudong He, Baeck Hyun Woo, Richard Hau Yue So:

A Novel Weighted Sparse Component Analysis for Underdetermined Blind Speech Separation. 1-5 - Yingying Fan, Kaisiyuan Wang, Hang Zhou, Shengyi He, Yu Wu:

RQTalker: Speech-driven 3D Facial Animation via Region-aware Vector Quantization. 1-5 - Haoxuan Wang, Qingdong He, Jinlong Peng, Hao Yang, Mingmin Chi, Yabiao Wang

:
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection. 1-5 - Yaqin Li, Chenjian Sun, Yihong Dong:

A Novel Audio-Visual Multimodal Semi-Supervised Model Based on Graph Neural Networks for Depression Detection. 1-5 - Fang Nan, Feng Tian, Ni Zhang, Nian Liu, Haonan Miao, Guang Dai, Mengmeng Wang:

Density-aware and Depth-aware Visual Representation for Zero-Shot Object Counting. 1-5 - Yi Zhu, Xiangyang Liu, Tianqi Pang, Xuncan Xiao, Xiaofan Zhang, Chenyou Fan:

Hybrid Feature Fusion for Enhancing Medical Document Embedding. 1-5 - Wenlong Dong, Qing Zhu, Qirong Mao:

Key Clues Guided Video Character Social Relationship Recognition Enhanced by LLM. 1-5 - Ensieh Khazaei, Bilal Taha, Alireza Esmaeilzehi, Dimitrios Hatzinakos:

OSR: Toward Developing Efficient Federated Learning-based Human Activity Recognition using Optimal Server Representations. 1-5 - Jiatao Chen, Xing Tang

, Tianming Xie, Jing Wang, Wenjing Dong, Bing Shi:
MusicMamba: A Dual-Feature Modeling Approach for Generating Chinese Traditional Music with Modal Precision. 1-5 - Tianyu Fang, Nhan Thanh Nguyen, Markku J. Juntti:

Low-Complexity Cramér-Rao Lower Bound and Sum Rate Optimization in ISAC Systems. 1-5 - Ru Li, Tingting Chai, Samaneh Kouchaki, David A. Clifton, Yang Yang:

Microtitre Plate Image Augmentation with Generative Adversarial Networks. 1-5 - Artem Dementyev, Chandan K. A. Reddy, Scott Wisdom, Navin Chatlani, John R. Hershey, Richard F. Lyon:

Towards Sub-millisecond Latency Real-Time Speech Enhancement Models on Hearables. 1-5 - Tiago Fernandes Tavares, Fábio José Ayres

, Zhepei Wang, Paris Smaragdis:
On Class Separability Pitfalls In Audio-Text Contrastive Zero-Shot Learning. 1-5 - Wanlong Liu, Yichen Xiao, Dingyi Zeng, Hongyang Zhao, Wenyu Chen, Malu Zhang:

Mixed-Precision Graph Neural Quantization for Low Bit Large Language Models. 1-5 - Wenzhi Guo, Lijun Chen:

Proud-SLAM: Neural Point-based Hybrid RGBD Monocular Dense SLAM. 1-5 - Ronghao Yu, Yun Liu

, Xiyue Bai, Rui Yang, Yingna Wu:
3DSignDiff: Towards 3D Sign Language Gesture Generation. 1-5 - Fang Nan, Ni Zhang, Qidong Liu, Wei Jing, Guang Dai, Yan Chen, Feng Tian:

Exploring Triple Knowledge Cues for Zero-Shot Human-Object Interaction Detection. 1-5 - Pei Zhang

, Dong Wang, Chanyue Wu, Jing Yang, Lei Kang, Zongwen Bai, Ying Li, Qiang Shen:
HyperDiff: Masked Diffusion Model with High-efficient Transformer for Hyperspectral Image Cross-Scene Classification. 1-5 - Wangdong Guo, Qing Zhu, Qirong Mao:

Joint Multi-Scale Contextual and Noise Suppression for Group Emotion Recognition. 1-5 - Yongsheng Han, Alberto Natali, Geert Leus:

Graph Topology Identification Based on Covariance Matching. 1-5 - Prabhat Kumar, Chandra R. Murthy:

Multiscale Adaptive Channel Estimation for OTFS. 1-5 - Yiming Liu, Rui Song, Lida Shi, Ling Gao, Hao Xu:

DGJA: Dependency Graph-enhanced Joint Attention Structure for Multimodal Sarcasm Detection. 1-5 - Xuezhi Xiang, Zhushan Ma, Lei Zhang, Denis Ombati, Himaloy Himu, Xiantong Zhen:

LKA-ReID: Vehicle Re-Identification with Large Kernel Attention. 1-5 - Zijia An, Boyu Diao, Libo Huang, Ruiqi Liu, Zhulin An, Yongjun Xu:

IOR: Inversed Objects Replay for Incremental Object Detection. 1-5 - Cheng Zhong, Shaofeng Zhang, Feng Zhu, Rui Zhao, Xiaokang Yang, Junchi Yan:

Graph Pooling via Dropping Task-Irrelevant Nodes. 1-5 - Xiaowen Cai, Daizong Liu, Runwei Guan, Pan Zhou:

Imperceptible Transfer Attack on Large Vision-Language Models. 1-5 - Shutong Duan, Jingyun Yang, Yang Tan, Guoqing Zhang, Yang Li, Xiao-Ping Zhang:

Transfer Risk Map: Mitigating Pixel-level Negative Transfer in Medical Segmentation. 1-5 - Simon Rouard, Robin San Roman, Yossi Adi, Axel Roebel:

MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling. 1-5 - Junlin Wu, Ning Zhang, Cheng Zhong, Boan Chen, Huanxi Liu, Junchi Yan:

Melody Structure Transfer Network: Generating Music with Separable Self-Attention. 1-5 - Keke Tang, Weiyao Ke, Weilong Peng, Xiaofei Wang, Ziyong Du, Zhize Wu, Peican Zhu, Zhihong Tian:

Imperceptible Adversarial Attacks on Point Clouds Guided by Point-to-Surface Field. 1-5 - Jiannan Chen, Zhizhuo Jiang, Xueqian Wang, Yaowen Li, Huajie Wang

, Yu Liu:
A Novel Split Deep Unfolding Transformer for Pan-Sharpening. 1-5 - Bo Hu, Wei Wang, Chunyi Li

, Lihuo He, Leida Li
, Xinbo Gao:
A Multi-annotated and Multi-modal Dataset for Wide-angle Video Quality Assessment. 1-5 - Tianyi Shi, Siyang Zheng, Zhu Meng, Zhe Cui, Jin Huang, Changrui Ren, Bo Zhang, Zhicheng Zhao:

TELL ME: Tackle Electrocardiogram with Large Language Model Effectively. 1-5 - Cheng Zhong, Junlin Wu, Ziming Feng, Boan Chen, Junchi Yan:

Towards Green VAE: A Light Pixel-weighting Technique to Enhance Variational AutoEncoder. 1-5 - Kai Guo, Seungwon Choi, Jongseong Choi, Lae-Hoon Kim:

A Practical Gated Recurrent Transformer Network Incorporating Multiple Fusions for Video Denoising. 1-5 - Xiaotao Wu, Zhaoxin Fan, Huiguang He, Dinggang Shen:

ThicknessVAE: Learning a Lateral Prior for Clothed Human Body Reconstruction. 1-5 - Hao Ma

, Zhiyuan Peng, Xu Li, Yukai Li, Mingjie Shao, Qiuqiang Kong, Ju Liu:
Language-Queried Target Sound Extraction Without Parallel Training Data. 1-5 


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID