


default search action
ICASSP 2022: Virtual and Singapore
- IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022. IEEE 2022, ISBN 978-1-6654-0541-6
- Shibo Zhang, Ebrahim Nemati, Minh Dinh, Nathan Folkman, Tousif Ahmed
, Md. Mahbubur Rahman, Jilong Kuang, Nabil Alshurafa, Alex Gao:
Coughtrigger: Earbuds IMU Based Cough Detection Activator Using An Energy-Efficient Sensitivity-Prioritized Time Series Classifier. 1-5 - Hoang Truong, Alessandro Montanari, Fahim Kawsar:
Non-Invasive Blood Pressure Monitoring with Multi-Modal In-Ear Sensing. 6-10 - Xiaolu Zeng, Beibei Wang, Chenshu Wu, Sai Deepika Regani, K. J. Ray Liu:
Intelligent Wi-Fi Based Child Presence Detection System. 11-15 - Wenxuan Li, Dongheng Zhang, Yadong Li, Zhi Wu, Jinbo Chen, Dong Zhang, Yang Hu, Qibin Sun, Yan Chen:
Real-Time Fall Detection Using Mmwave Radar. 16-20 - Dae Yon Hwang
, Pai Chet Ng, Yuanhao Yu
, Yang Wang, Petros Spachos, Dimitrios Hatzinakos, Konstantinos N. Plataniotis:
Hierarchical Deep Learning Model with Inertial and Physiological Sensors Fusion for Wearable-Based Human Activity Recognition. 21-25 - Yu-Chen Lin, Tsun-An Hsieh, Kuo-Hsuan Hung, Cheng Yu, Harinath Garudadri, Yu Tsao, Tei-Wei Kuo
:
Speech Recovery For Real-World Self-Powered Intermittent Devices. 26-30 - Ai Okano, Yoshinobu Kajikawa:
Phase Control of Parametric Array Loudspeaker by Optimizing Sideband Weights. 31-35 - Florian Scalvini
, Camille Bordeau
, Maxime Ambard, Cyrille Migniot, Julien Dubois:
Low-Latency Human-Computer Auditory Interface Based on Real-Time Vision Analysis. 36-40 - Akihiko Sugiyama:
Robust Adaptive Noise Canceller Algorithm with Snr-Based Stepsize Control and Noise-Path Gain Compensation. 41-45 - Chao Liu, Linlin Gao, Ruobing Jiang
:
Neartracker: Acoustic 2-D Target Tracking with Nearby Reflector in Siso System. 46-50 - Harinarayanan. E. V, Sachin Ghanekar:
An Efficient Method For Generic Dsp Implementation Of Dilated Convolution. 51-55 - Yu-Shan Tai
, Chieh-Fang Teng, Cheng-Yang Chang, An-Yeu Andy Wu:
Compression-Aware Projection with Greedy Dimension Reduction for Convolutional Neural Network Activations. 56-60 - Simon Narduzzi
, Siavash Arjomand Bigdeli, Shih-Chii Liu, L. Andrea Dunbar:
Optimizing The Consumption Of Spiking Neural Networks With Activity Regularization. 61-65 - Sujan Kumar Gonugondla, Naresh R. Shanbhag:
IMPQ: Reduced Complexity Neural Networks Via Granular Precision Assignment. 66-70 - Youngeun Kim, Hyoungseob Park, Abhishek Moitra, Abhiroop Bhattacharjee, Yeshwanth Venkatesha, Priyadarshini Panda:
Rate Coding Or Direct Coding: Which One Is Better For Accurate, Robust, And Energy-Efficient Spiking Neural Networks? 71-75 - Linghao Song, Yuze Chi, Jason Cong:
PYXIS: An Open-Source Performance Dataset Of Sparse Accelerators. 76-80 - Zuozhou Pan, Zhiping Lin, Yuanjin Zheng, Zong Meng:
Fast Fault Diagnosis Method Of Rolling Bearings In Multi-Sensor Measurement Enviroment. 81-85 - Diaa Badawi, Ishaan Bassi, Sule Ozev, Ahmet Enis Çetin
:
Detecting Anomaly in Chemical Sensors via Regularized Contrastive Learning. 86-90 - Cheng Tang
, Junkai Ji, Qiuzhen Lin
, Yan Zhou:
Evolutionary Neural Architecture Design of Liquid State Machine for Image Classification. 91-95 - Huy Phan
, Yi Xie, Jian Liu, Yingying Chen, Bo Yuan:
Invisible and Efficient Backdoor Attacks for Compressed Deep Neural Networks. 96-100 - Cheng-Hung Lo, Pei-Yun Tsai:
Tensor-Based Orthogonal Matching Pursuit with Phase Rotation for Channel Estimation In Hybrid Beamforming Mimo-Ofdm Systems. 101-105 - Darius Petermann, Minje Kim:
Spain-Net: Spatially-Informed Stereophonic Music Source Separation. 106-110 - Siyuan Yuan, Zhepei Wang, Umut Isik, Ritwik Giri, Jean-Marc Valin, Michael M. Goodwin, Arvindh Krishnaswamy:
Improved Singing Voice Separation with Chromagram-Based Pitch-Aware Remixing. 111-115 - Haici Yang, Shivani Firodiya, Nicholas J. Bryan, Minje Kim:
Don't Separate, Learn To Remix: End-To-End Neural Remixing With Joint Optimization. 116-120 - Yu Wang, Daniel Stoller, Rachel M. Bittner, Juan Pablo Bello
:
Few-Shot Musical Source Separation. 121-125 - Ethan Manilow, Patrick O'Reilly, Prem Seetharaman, Bryan Pardo:
Source Separation By Steering Pretrained Music Models. 126-130 - Xuewen Yao
, Megan Micheletti, Mckensey Johnson, Edison Thomaz, Kaya de Barbaro
:
Infant Crying Detection In Real-World Environments. 131-135 - Qin Zhang, Qingming Tang, Chieh-Chi Kao, Ming Sun, Yang Liu, Chao Wang:
Wikitag: Wikipedia-Based Knowledge Embeddings Towards Improved Acoustic Event Classification. 136-140 - Magdalena Fuentes
, Bea Steers, Pablo Zinemanas
, Martín Rocamora
, Luca Bondi
, Julia Wilkins, Qianyi Shi, Yao Hou, Samarjit Das, Xavier Serra, Juan Pablo Bello
:
Urban Sound & Sight: Dataset And Benchmark For Audio-Visual Urban Scene Understanding. 141-145 - Sai Srinadhu Katta, Kide Vuojärvi, Sivaprasad Nandyala
, Ulla-Maria Kovalainen, Lauren Baddeley:
Real-World On-Board Uav Audio Data Set For Propeller Anomalies. 146-150 - Yuan Gong
, Jin Yu, James R. Glass:
Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition. 151-155 - Kento Nagatomo, Masahiro Yasuda, Kohei Yatabe
, Shoichiro Saito, Yasuhiro Oikawa:
Wearable Seld Dataset: Dataset For Sound Event Localization And Detection Using Wearable Devices Around Head. 156-160 - Viet-Anh Nguyen, Anh H. T. Nguyen, Andy W. H. Khong:
Tunet: A Block-Online Bandwidth Extension Model Based On Transformers And Self-Supervised Pretraining. 161-165 - Jinjiang Liu, Xueliang Zhang:
DRC-NET: Densely Connected Recurrent Convolutional Neural Network for Speech Dereverberation. 166-170 - Jean-Marie Lemercier, Joachim Thiemann, Raphael Koning, Timo Gerkmann
:
Customizable End-To-End Optimization Of Online Neural Network-Supported Dereverberation For Hearing Devices. 171-175 - Naoyuki Kamo, Rintaro Ikeshita, Keisuke Kinoshita
, Tomohiro Nakatani:
Importance of Switch Optimization Criterion in Switching WPE Dereverberation. 176-180 - Ziyu Wang, Dejing Xu, Gus Xia
, Ying Shan:
Audio-To-Symbolic Arrangement Via Cross-Modal Music Representation Learning. 181-185 - Shiqi Wei, Gus Xia
, Yixiao Zhang, Liwei Lin, Weiguo Gao:
Music Phrase Inpainting Using Long-Term Representation and Contrastive Loss. 186-190 - Yi Zou, Pei Zou, Yi Zhao, Kaixiang Zhang, Ran Zhang, Xiaorui Wang:
Melons: Generating Melody With Long-Term Structure Using Transformers And Structure Graph. 191-195 - Moyu Terao, Yuki Hiramatsu, Ryoto Ishizuka, Yiming Wu, Kazuyoshi Yoshii:
Difficulty-Aware Neural Band-to-Piano Score Arrangement based on Note- and Statistic-Level Criteria. 196-200 - Pedro Ramoneda, Nazif Can Tamer, Vsevolod Eremenko
, Xavier Serra, Marius Miron:
Score Difficulty Analysis for Piano Performance Education based on Fingering. 201-205 - Zhipeng Chen, Yiya Hao, Yaobin Chen, Gong Chen, Liang Ruan:
A Neural Network-based Howling Detection Method for Real-Time Communication Applications. 206-210 - Tomer Fireaizen, Saar Ron, Omer Bobrowski
:
Alarm Sound Detection Using Topological Signal Processing. 211-215 - Osamu Ichikawa, Yuuto Shima, Takahiro Nakayama, Hajime Shirouzu:
A Method For Estimating The Grouping Of Participants In Classroom Group Work Using Only Audio Information. 216-220 - Yuki Okamoto, Shota Horiguchi, Masaaki Yamamoto, Keisuke Imoto, Yohei Kawaguchi:
Environmental Sound Extraction Using Onomatopoeic Words. 221-225 - Masahiro Yasuda, Yasunori Ohishi, Shoichiro Saito:
Echo-Aware Adaptation of Sound Event Localization and Detection in Unknown Environments. 226-230 - Juncheng B. Li, Shuhui Qu, Xinjian Li, Bernie Po-Yao Huang, Florian Metze:
On Adversarial Robustness Of Large-Scale Audio Visual Learning. 231-235 - Haibin Wu, Po-Chun Hsu, Ji Gao, Shanshan Zhang, Shen Huang, Jian Kang, Zhiyong Wu, Helen Meng, Hung-Yi Lee:
Adversarial Sample Detection for Speaker Verification by Neural Vocoders. 236-240 - Naoya Takahashi, Yuki Mitsufuji:
Amicable Examples for Informed Source Separation. 241-245 - David M. Chan
, Shalini Ghosh, Debmalya Chakrabarty, Björn Hoffmeister:
Multi-Modal Pre-Training for Automated Speech Recognition. 246-250 - Ryota Tsunoda, Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yoshie Imai:
Speaker-Targeted Audio-Visual Speech Recognition Using a Hybrid CTC/Attention Model with Interference Loss. 251-255 - Yifei Wu, Chenda Li, Jinfeng Bai, Zhongqin Wu, Yanmin Qian:
Time-Domain Audio-Visual Speech Separation on Low Quality Videos. 256-260 - Mhd Modar Halimeh, Walter Kellermann:
Complex-Valued Spatial Autoencoders for Multichannel Speech Enhancement. 261-265 - Zhi-Wei Tan, Anh H. T. Nguyen, Yuan Liu
, Andy W. H. Khong:
Multichannel Noise Reduction Using Dilated Multichannel U-Net and Pre-Trained Single-Channel Network. 266-270 - Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Zhuo Chen, Xuedong Huang:
One Model to Enhance Them All: Array Geometry Agnostic Multi-Channel Personalized Speech Enhancement. 271-275 - Cong Han, Emine Merve Kaya, Kyle Hoefer, Malcolm Slaney, Simon Carlile:
Multi-Channel Speech Denoising for Machine Ears. 276-280 - Zhong-Qiu Wang, DeLiang Wang:
Localization based Sequential Grouping for Continuous Speech Separation. 281-285 - Mieszko Fras, Marcin Witkowski
, Konrad Kowalczyk
:
Convolutional Weighted Minimum Mean Square Error Filter for Joint Source Separation and Dereverberation. 286-290 - Ethan Manilow, Curtis Hawthorne, Cheng-Zhi Anna Huang, Bryan Pardo, Jesse H. Engel:
Improving Source Separation by Explicitly Modeling Dependencies between Sources. 291-295 - Yuichiro Koyama, Naoki Murata, Stefan Uhlich, Giorgio Fabbro, Shusuke Takahashi, Yuki Mitsufuji:
Music Source Separation With Deep Equilibrium Models. 296-300 - Natsuki Akaishi, Kohei Yatabe
, Yasuhiro Oikawa:
Harmonic and Percussive Sound Separation Based on Mixed Partial Derivative of Phase Spectrogram. 301-305 - Enric Gusó
, Jordi Pons, Santiago Pascual, Joan Serrà:
On Loss Functions and Evaluation Metrics for Music Source Separation. 306-310 - Sangwook Park, Mounya Elhilali:
Time-Balanced Focal Loss for Audio Event Detection. 311-315 - Kazuki Shimada
, Yuichiro Koyama, Shusuke Takahashi, Naoya Takahashi, Emiru Tsunoo, Yuki Mitsufuji:
Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training. 316-320 - Arman Zharmagambetov, Qingming Tang, Chieh-Chi Kao, Qin Zhang, Ming Sun, Viktor Rozgic, Jasha Droppo, Chao Wang:
Improved Representation Learning For Acoustic Event Classification Using Tree-Structured Ontology. 321-325 - Sandeep Kothinti, Mounya Elhilali:
Temporal Contrastive-Loss for Audio Event Detection. 326-330 - Xu Wang, Xiangjinzi Zhang, Yunfei Zi
, Shengwu Xiong:
A Frame Loss of Multiple Instance Learning for Weakly Supervised Sound Event Detection. 331-335 - Heinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang:
Pseudo Strong Labels for Large Scale Weakly Supervised Audio Tagging. 336-340 - Wenyu Jin, Tim Schoof, Henning F. Schepker:
Individualized Hear-Through For Acoustic Transparency Using PCA-Based Sound Pressure Estimation At The Eardrum. 341-345 - Benjamin Lentz
, Rainer Martin
, Kirsten Oberländer, Christiane Völter:
On Spectral and Temporal Sparsification of Speech Signals for the Improvement of Speech Perception in CI Listeners. 346-350 - Fotios Drakopoulos, Sarah Verhulst:
A Differentiable Optimisation Framework for The Design of Individualised DNN-based Hearing-Aid Strategies. 351-355 - Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Xiaofei Wang, Zhuo Chen, Xuedong Huang:
Personalized speech enhancement: new models and Comprehensive evaluation. 356-360 - Jinxu Xiang
, Yuyang Zhu, Rundi Wu
, Ruilin Xu, Yuko Ishiwaka, Changxi Zheng:
Dynamic Sliding Window for Realtime Denoising Networks. 361-365 - Sunwoo Kim, Minje Kim
:
Bloom-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement. 366-370 - Tianrui Wang, Weibin Zhu, Yingying Gao, Junlan Feng, Shilei Zhang:
HGCN: Harmonic Gated Compensation Network for Speech Enhancement. 371-375 - Wenbin Jiang, Zhijun Liu, Kai Yu, Fei Wen:
Speech Enhancement with Neural Homomorphic Synthesis. 376-380 - Yang Xiang, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen
:
A Bayesian Permutation Training Deep Representation Learning Method for Speech Enhancement with Variational Autoencoder. 381-385 - Huajian Fang, Tal Peer, Stefan Wermter
, Timo Gerkmann
:
Integrating Statistical Uncertainty into Neural Network-Based Speech Enhancement. 386-390 - Viet Anh Trinh, Sebastian Braun:
Unsupervised Speech Enhancement with Speech Recognition Embedding and Disentanglement Losses. 391-395 - Xianke Wang, Wei Xu, Weiming Yang, Wenqing Cheng:
Musicyolo: A Sight-Singing Onset/Offset Detection Framework Based on Object Detection Instead of Spectrum Frames. 396-400 - Yun-Ning Hung, Ju-Chiang Wang, Xuchen Song, Wei Tsung Lu, Minz Won:
Modeling Beats and Downbeats with a Time-Frequency Transformer. 401-405 - Michael Krause
, Meinard Müller:
Hierarchical Classification of Singing Activity, Gender, and Type in Complex Music Recordings. 406-410 - Qiqi He, Xiaoheng Sun, Yi Yu, Wei Li:
Deepchorus: A Hybrid Model of Multi-Scale Convolution And Self-Attention for Chorus Detection. 411-415 - Ju-Chiang Wang, Yun-Ning Hung, Jordan B. L. Smith:
To Catch A Chorus, Verse, Intro, or Anything Else: Analyzing a Song with Structural Functions. 416-420 - Mojtaba Heydari, Matthew C. McCallum, Andreas F. Ehmann, Zhiyao Duan:
A Novel 1D State Space for Efficient Music Rhythmic Analysis. 421-425 - Haici Yang, Sanna Wager, Spencer Russell, Mike Luo, Minje Kim, Wontak Kim:
Upmixing Via Style Transfer: A Variational Autoencoder for Disentangling Spatial Images And Musical Content. 426-430 - Ricardo Falcón Pérez
, Kazuki Shimada
, Yuichiro Koyama, Shusuke Takahashi, Yuki Mitsufuji:
Spatial Mixup: Directional Loudness Modification as Data Augmentation for Sound Event Localization and Detection. 431-435 - Tobias Kabzinski, Peter Jax:
Towards Faster Continuous Multi-Channel HRTF Measurements Based On Learning System Models. 436-440 - Bowen Zhi, Dmitry N. Zotkin, Ramani Duraiswami
:
Towards Fast And Convenient End-To-End HRTF Personalization. 441-445 - Mateusz Guzik
, Konrad Kowalczyk
:
Wishart Localization Prior On Spatial Covariance Matrix In Ambisonic Source Separation Using Non-Negative Tensor Factorization. 446-450 - Jiawen Huang, Emmanouil Benetos
, Sebastian Ewert:
Improving Lyrics Alignment Through Joint Pitch Detection. 451-455 - Ilaria Manco, Emmanouil Benetos
, Elio Quinton, György Fazekas:
Learning Music Audio Representations Via Weak Language Supervision. 456-460 - David Giuseppe Badiane, Raffaele Malvermi, Sebastian Gonzalez, Fabio Antonacci, Augusto Sarti:
On the Prediction of the Frequency Response of a Wooden Plate from Its Mechanical Parameters. 461-465 - Bo-Yu Chen, Wei-Han Hsu, Wei-Hsiang Liao, Marco A. Martínez Ramírez, Yuki Mitsufuji, Yi-Hsuan Yang:
Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks. 466-470 - Han Chen, Yan Song, Li-Rong Dai, Ian McLoughlin
, Lin Liu:
Self-Supervised Representation Learning for Unsupervised Anomalous Sound Detection Under Domain Shift. 471-475 - Vasileios Tsouvalas
, Aaqib Saeed
, Tanir Ozcelebi:
Federated Self-Training for Data-Efficient Audio Recognition. 476-480 - Meng Feng, Chieh-Chi Kao, Qingming Tang, Ming Sun, Viktor Rozgic, Spyros Matsoukas, Chao Wang:
Federated Self-Supervised Learning for Acoustic Event Classification. 481-485 - Kwanghee Choi, Martin Kersner, Jacob Morton, Buru Chang:
Temporal Knowledge Distillation for on-device Audio Classification. 486-490 - Ognjen (Oggi) Rudovic, Akanksha Bindal, Vineet Garg, Pramod Simha, Pranay Dighe, Sachin Kajarekar:
Streaming on-Device Detection of Device Directed Speech from Voice and Touch-Based Invocation. 491-495 - Hiroshi Sawada, Rintaro Ikeshita, Keisuke Kinoshita
, Tomohiro Nakatani:
Multi-Frame Full-Rank Spatial Covariance Analysis for Underdetermined BSS in Reverberant Environments. 496-500 - Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, Kazuyoshi Yoshii
:
Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation. 501-505 - Yudong He
, He Wang, Qifeng Chen, Richard Hau Yue So:
Harvesting Partially-Disjoint Time-Frequency Information for Improving Degenerate Unmixing Estimation Technique. 506-510 - Shogo Seki, Hirokazu Kameoka, Li Li:
Investigation And Comparison of Optimization Methods for Variational Autoencoder-Based Underdetermined Multichannel Source Separation. 511-515 - Li Li, Hirokazu Kameoka, Shogo Seki:
HBP: An Efficient Block Permutation Solver Using Hungarian Algorithm and Spectrogram Inpainting for Multichannel Audio Source Separation. 516-520 - Chenxing Li, Yang Wang, Feng Deng, Zhuo Zhang, Xiaorui Wang, Zhongyuan Wang:
EAD-Conformer: a Conformer-Based Encoder-Attention-Decoder-Network for Multi-Task Audio Source Separation. 521-525 - Darius Petermann, Gordon Wichern, Zhong-Qiu Wang, Jonathan Le Roux:
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks. 526-530 - Félix Mathieu, Thomas Courtat, Gaël Richard, Geoffroy Peeters:
Phase Shifted Bedrosian Filterbank: An Interpretable Audio Front-End for Time-Domain Audio Source Separation. 531-535 - Rahil Parikh, Ilya Kavalerov, Carol Y. Espy-Wilson, Shihab A. Shamma:
Harmonicity Plays a Critical Role in DNN Based Versus in Biologically-Inspired Monaural Speech Segregation Systems. 536-540 - Changsheng Quan, Xiaofei Li:
Multi-Channel Narrow-Band Deep Speech Separation with Full-Band Permutation Invariant Training. 541-545 - Cunhang Fan, Zhao Lv
, Shengbing Pei, Mingyue Niu:
Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction. 546-550 - Ebrahim Nemati, Xuhai Xu, Viswam Nathan, Korosh Vatanparvar, Tousif Ahmed
, Md. Mahbubur Rahman, Dan McCaffrey, Jilong Kuang, Alex Gao:
Ubilung: Multi-Modal Passive-Based Lung Health Assessment. 551-555 - Neeraj Kumar Sharma
, Srikanth Raj Chetupalli, Debarpan Bhattacharya, Debottam Dutta, Pravin Mote, Sriram Ganapathy:
The Second Dicova Challenge: Dataset and Performance Analysis for Diagnosis of Covid-19 Using Acoustics. 556-560 - Xing-Yu Chen, Qiu-Shi Zhu
, Jie Zhang, Li-Rong Dai:
Supervised and Self-Supervised Pretraining Based Covid-19 Detection Using Acoustic Breathing/Cough/Speech Signals. 561-565 - Madhu R. Kamble, Jose Patino, Maria A. Zuluaga, Massimiliano Todisco:
Exploring Auditory Acoustic Features for The Diagnosis of Covid-19. 566-570 - Anton Ratnarajah, Shi-Xiong Zhang, Meng Yu, Zhenyu Tang, Dinesh Manocha, Dong Yu:
Fast-Rir: Fast Neural Diffuse Room Impulse Response Generator. 571-575 - Juliano G. C. Ribeiro, Shoichi Koyama, Hiroshi Saruwatari:
Region-to-Region Kernel Interpolation of Acoustic Transfer Function with Directional Weighting. 576-580 - Philipp Götz, Cagdas Tuna, Andreas Walther, Emanuël A. P. Habets:
Blind Reverberation Time Estimation in Dynamic Acoustic Conditions. 581-585 - Maozhong Fu, Jesper Rindom Jensen
, Yuhan Li
, Mads Græsbøll Christensen
:
Sparse Modeling of The Early Part of Noisy Room Impulse Responses with Sparse Bayesian Learning. 586-590 - Jack Deadman, Jon Barker:
Improved Simulation of Realistically-Spatialised Simultaneous Speech Using Multi-Camera Analysis in The Chime-5 Dataset. 591-595 - Mattia Papa, Clara Borrelli, Paolo Bestagini, Fabio Antonacci, Augusto Sarti, Stefano Tubaro:
A Data-Driven Approach for Acoustic Parameter Similarity Estimation of Speech Recording. 596-600 - Yudong Zhao, György Fazekas, Mark B. Sandler:
Violinist Identification Using Note-Level Timbre Feature Distributions. 601-605 - Hang Zhao, Chen Zhang, Bilei Zhu, Zejun Ma, Kejun Zhang:
S3T: Self-Supervised Pre-Training with Swin Transformer For Music Classification. 606-610 - Morgan Buisson, Pablo Alonso-Jiménez, Dmitry Bogdanov:
Ambiguity Modelling with Label Distribution Learning for Music Classification. 611-615 - Xingjian Du
, Ke Chen, Zijie Wang, Bilei Zhu, Zejun Ma:
Bytecover2: Towards Dimensionality Reduction of Latent Embedding for Efficient Cover Song Identification. 616-620 - Ke Chen, Shuai Yu, Cheng-i Wang, Wei Li, Taylor Berg-Kirkpatrick, Shlomo Dubnov
:
Tonet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music. 621-625 - Shuai Yu
, Xi Chen, Wei Li:
Hierarchical Graph-Based Neural Network for Singing Melody Extraction. 626-630 - Michel Olvera, Emmanuel Vincent, Gilles Gasso:
On The Impact of Normalization Strategies in Unsupervised Adversarial Domain Adaptation for Acoustic Scene Classification. 631-635 - Tom Denton, Scott Wisdom, John R. Hershey:
Improving Bird Classification with Unsupervised Sound Separation. 636-640 - Francesco Paissan, Alberto Ancilotto, Alessio Brutti, Elisabetta Farella:
Scalable Neural Architectures for End-to-End Environmental Sound Classification. 641-645 - Ke Chen, Xingjian Du
, Bilei Zhu, Zejun Ma, Taylor Berg-Kirkpatrick, Shlomo Dubnov
:
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection. 646-650 - You Wang, David V. Anderson:
Hybrid Attention-Based Prototypical Networks for Few-Shot Sound Classification. 651-655 - Karn N. Watcharasupat
, Thi Ngoc Tho Nguyen, Woon-Seng Gan
, Shengkui Zhao, Bin Ma:
End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression. 656-660 - Ziteng Wang, Yueyue Na, Biao Tian, Qiang Fu:
NN3A: Neural Network Supported Acoustic Echo Cancellation, Noise Suppression and Automatic Gain Control for Real-Time Communications. 661-665 - Jan Franzen, Tim Fingscheidt
:
Deep Residual Echo Suppression and Noise Reduction: A Multi-Input FCRN Approach in a Hybrid Speech Enhancement System. 666-670 - Hao Zhang, DeLiang Wang:
Neural Cascade Architecture for Joint Acoustic Echo and Noise Suppression. 671-675 - Santiago Ruiz
, Toon van Waterschoot, Marc Moonen:
Cascade Multi-Channel Noise Reduction and Acoustic Feedback Cancellation. 676-680 - Chenda Li, Lei Yang, Weiqin Wang, Yanmin Qian:
Skim: Skipping Memory Lstm for Low-Latency Real-Time Continuous Speech Separation. 681-685 - Aswin Sivaraman
, Scott Wisdom, Hakan Erdogan, John R. Hershey:
Adapting Speech Separation to Real-World Meetings using Mixture Invariant Training. 686-690 - Eisuke Konno, Daisuke Saito, Nobuaki Minematsu:
Quantifying Discriminability between NMF Bases. 691-695 - Hassan Taherian, Ke Tan, DeLiang Wang:
Location-Based Training for Multi-Channel Talker-Independent Speaker Separation. 696-700 - Robin Scheibler:
SDR - Medium Rare with Fast Computations. 701-705 - Hirokazu Kameoka, Shogo Seki, Li Li, Chihiro Watanabe:
Attentionpit: Soft Permutation Invariant Training for Audio Source Separation with Attention Mechanism. 706-710 - Olga Slizovskaia, Gordon Wichern, Zhong-Qiu Wang, Jonathan Le Roux:
Locate This, Not that: Class-Conditioned Sound Event DOA Estimation. 711-715 - Thi Ngoc Tho Nguyen, Douglas L. Jones, Karn N. Watcharasupat
, Huy Phan, Woon-Seng Gan
:
SALSA-Lite: A Fast and Effective Feature for Polyphonic Sound Event Localization and Detection with Microphone Arrays. 716-720 - Bing Yang, Hong Liu, Xiaofei Li:
SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization. 721-725 - Yonggang Hu
, Sharon Gannot:
Closed-Form Single Source Direction-of-Arrival Estimator Using First-Order Relative Harmonic Coefficients. 726-730 - Jianhua Geng, Sifan Wang, Xin Lou:
A Slide-Save Based Framework for Multi-Source DOA Extraction with Closely Spaced Sources. 731-735 - Yu Chen, Bowen Liu
, Zijian Zhang
, Hun-Seok Kim:
An End-to-End Deep Learning Framework For Multiple Audio Source Separation And Localization. 736-740 - Amir Ivry, Israel Cohen, Baruch Berdugo:
Deep Adaptation Control for Acoustic Echo Cancellation. 741-745 - Amir Ivry, Israel Cohen, Baruch Berdugo:
Off-the-Shelf Deep Integration For Residual-Echo Suppression. 746-750 - Chenggang Zhang, Jinjiang Liu, Xueliang Zhang:
A Complex Spectral Mapping with Inplace Convolution Recurrent Neural Networks For Acoustic Echo Cancellation. 751-755 - Hao Zhang, Srivatsan Kandadai, Harsha Rao, Minje Kim, Tarun Pruthi, Trausti T. Kristjansson:
Deep Adaptive Aec: Hybrid of Deep Learning and Adaptive Acoustic Echo Cancellation. 756-760 - Yurii Iotov
, Sidsel Marie Nørholm, Valiantsin Belyi, Mads Dyrholm, Mads Græsbøll Christensen
:
Computationally Efficient Fixed-Filter ANC for Speech Based on Long-Term Prediction for Headphone Applications. 761-765 - Thomas Haubner, Andreas Brendel
, Walter Kellermann:
End-To-End Deep Learning-Based Adaptation Control for Frequency-Domain Adaptive System Identification. 766-770 - Grigoris Bastas, Stefanos Koutoupis, Maximos A. Kaliakatsos-Papakostas, Vassilis Katsouros, Petros Maragos:
A Few-Sample Strategy for Guitar Tablature Transcription Based on Inharmonicity Analysis and Playability Constraints. 771-775 - Longshen Ou, Ziyi Guo, Emmanouil Benetos
, Jiqing Han, Ye Wang
:
Exploring Transformer's Potential on Automatic Piano Transcription. 776-780 - Rachel M. Bittner, Juan José Bosch, David Rubinstein, Gabriel Meseguer-Brocal, Sebastian Ewert:
A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation. 781-785 - Yu-Hua Chen, Wen-Yi Hsiao, Tsu-Kuang Hsieh, Jyh-Shing Roger Jang, Yi-Hsuan Yang:
Towards Automatic Transcription of Polyphonic Electric Guitar Music: A New Dataset and a Multi-Loss Transformer Model. 786-790 - Xiaoxue Gao
, Chitralekha Gupta, Haizhou Li:
Genre-Conditioned Acoustic Models for Automatic Lyrics Transcription of Polyphonic Music. 791-795 - Sangeun Kum, Jongpil Lee, Keunhyoung Luke Kim, Taehyoung Kim, Juhan Nam
:
Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music. 796-800 - Noriyuki Tonami, Keisuke Imoto, Ryotaro Nagase, Yuki Okamoto, Takahiro Fukumori, Yoichi Yamashita:
Sound Event Detection Guided by Semantic Contexts of Scenes. 801-805 - Keigo Wakayama
, Shoichiro Saito:
CNN-Transformer with Self-Attention Network for Sound Event Detection. 806-810 - Dongchao Yang, Helin Wang, Yuexian Zou, Zhongjie Ye, Wenwu Wang:
A Mutual Learning Framework for Few-Shot Sound Event Detection. 811-815 - Youde Liu, Jian Guan, Qiaoxi Zhu
, Wenwu Wang:
Anomalous Sound Detection Using Spectral-Temporal Information Fusion. 816-820 - Yadong Guan, Jiabin Xue, Guibin Zheng, Jiqing Han:
Sparse Self-Attention for Semi-Supervised Sound Event Detection. 821-825 - Hayato Endo, Hiromitsu Nishizaki:
Peer Collaborative Learning for Polyphonic Sound Event Detection. 826-830 - Srikanth Korse
, Nicola Pia, Kishan Gupta, Guillaume Fuchs
:
PostGAN: A GAN-Based Post-Processor to Enhance the Quality of Coded Speech. 831-835 - Kishan Gupta, Srikanth Korse
, Bernd Edler, Guillaume Fuchs
:
A DNN Based Post-Filter to Enhance the Quality of Coded Speech in MDCT Domain. 836-840 - Eloi Moliner, Vesa Välimäki:
A Two-Stage U-Net for High-Fidelity Denoising of Historical Recordings. 841-845 - Marvin Borsdorf
, Kevin Scheck, Haizhou Li, Tanja Schultz
:
Experts Versus All-Rounders: Target Language Extraction for Multiple Target Languages. 846-850 - Guangwei Li, Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Category-Adapted Sound Event Enhancement with Weakly Labeled Data. 851-855 - Rubén M. Clavería, Simon J. Godsill:
Sequential MCMC Methods for Audio Signal Enhancement. 856-860 - Tejas Jayashankar, Thilo Köhler, Kaustubh Kalgaonkar, Zhiping Xiu, Jilong Wu, Ju Lin, Prabhav Agrawal, Qing He:
Architecture for Variable Bitrate Neural Speech Codec with Configurable Computation Complexity. 861-865 - Xue Jiang, Xiulian Peng, Chengyu Zheng, Huaying Xue, Yuan Zhang, Yan Lu:
End-to-End Neural Speech Coding for Real-Time Communications. 866-870 - Seungmin Shin
, Joon Byun, Youngcheol Park, Jongmo Sung, Seungkwon Beack:
Deep Neural Network (DNN) Audio Coder Using A Perceptually Improved Training Method. 871-875 - Chanwoo Lee, Hyungseob Lim, Jihyun Lee, Inseon Jang, Hong-Goo Kang:
Progressive Multi-Stage Neural Audio Coding with Guided References. 876-880 - Ehab A. AlBadawy, Andrew Gibiansky, Qing He, Jilong Wu, Ming-Ching Chang, Siwei Lyu:
Vocbench: A Neural Vocoder Benchmark for Speech Synthesis. 881-885 - Chandan K. A. Reddy, Vishak Gopal, Ross Cutler:
Dnsmos P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors. 886-890 - Pranay Manocha
, Zeyu Jin, Adam Finkelstein:
SQAPP: No-Reference Speech Quality Assessment Via Pairwise Preference. 891-895 - Wen-Chin Huang, Erica Cooper, Junichi Yamagishi, Tomoki Toda:
LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech. 896-900 - Marju Purin, Sten Sootla, Mateja Sponza, Ando Saabas, Ross Cutler:
AECMOS: A Speech Quality Assessment Metric for Echo Impairment. 901-905 - Miao Liu, Jing Wang, Shicong Li, Fei Xiang, Yue Yao, Lidong Yang:
MOS Predictor for Synthetic Speech with I-Vector Inputs. 906-910 - Daan Ratering, W. Bastiaan Kleijn
, Jean Gonzalez Silva, Riccardo M. G. Ferrari
:
Wave-Domain Approach for Cancelling Noise Entering Open Windows. 911-915 - Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach
:
On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes. 916-920 - Takuya Yoshioka, Xiaofei Wang, Dongmei Wang:
Picknet: Real-Time Channel Selection for Ad Hoc Microphone Arrays. 921-925 - Jarred Barber, Yifeng Fan, Tao Zhang:
End-To-End Alexa Device Arbitration. 926-930 - Natsuki Ueno, Nobutaka Ono:
Instantaneous Linear Dimensionality Reduction of Multichannel Time-Series Signal for Array Signal Processing. 931-935 - Srdan Kitic, Jérôme Daniel:
Generalized Time Domain Velocity Vector. 936-940 - Masaya Kawamura, Tomohiko Nakamura
, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, Kazunobu Kondo:
Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds. 941-945 - Yashish M. Siriwardena, Guilhem Marion, Shihab A. Shamma:
The Mirrornet : Learning Audio Synthesizer Controls Inspired by Sensorimotor Interaction. 946-950 - Hao-Wen Dong, Cong Zhou, Taylor Berg-Kirkpatrick, Julian J. McAuley
:
Deep Performer: Score-to-Audio Music Performance Synthesis. 951-955 - Chien-Feng Liao, Jen-Yu Liu, Yi-Hsuan Yang:
KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE Using Mel-Spectrograms. 956-960 - Jihyun Lee, Hyungseob Lim, Chanwoo Lee, Inseon Jang, Hong-Goo Kang:
Adversarial Audio Synthesis Using a Harmonic-Percussive Discriminator. 961-965 - Jing Yang, Chulhong Min, Akhil Mathur, Fahim Kawsar:
SleepGAN: Towards Personalized Sleep Therapy Music. 966-970 - Xuenan Xu, Mengyue Wu, Kai Yu:
Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition. 971-975 - Andrey Guzhov, Federico Raue, Jörn Hees, Andreas Dengel:
Audioclip: Extending Clip to Image, Text and Audio. 976-980 - Zelin Zhou, Zhiling Zhang, Xuenan Xu, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu:
Can Audio Captions Be Evaluated With Image Caption Metrics? 981-985 - Pablo M. Delgado, Jürgen Herre:
A Data-Driven Cognitive Salience Model for Objective Perceptual Audio Quality Assessment. 986-990 - Ryosuke Sawata, Yosuke Kashiwagi, Shusuke Takahashi:
Improving Character Error Rate is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-Box Acoustic Models. 991-995 - Sebastian Braun, Hannes Gamper:
Effect of Noise Suppression Losses on Speech Distortion and ASR Performance. 996-1000 - Alix Jeannerot
, Niels de Koeijer, Pablo Martínez-Nuevo
, Martin Bo Møller
, Jakob Dyreby, Paolo Prandoni:
Increasing Loudness in Audio Signals: A Perceptually Motivated Approach to Preserve Audio Quality. 1001-1005 - Sebastian J. Schlecht, Leonardo Fierro, Vesa Välimäki, Juha Backman:
Audio Peak Reduction Using a Synced allpass Filter. 1006-1010 - Tomoro Tanaka, Kohei Yatabe
, Masahiro Yasuda, Yasuhiro Oikawa:
APPLADE: Adjustable Plug-and-Play Audio Declipper Combining DNN with Sparse Optimization. 1011-1015 - Daniel Tompkins, Kshitiz Kumar, Jian Wu:
Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, and Pretraining: an Ablation Study. 1016-1020 - Janek Ebbers, Reinhold Haeb-Umbach
, Romain Serizel:
Threshold Independent Evaluation of Sound Event Detection Scores. 1021-1025 - Seyed M. R. Modaresi
, Aomar Osmani, Mohammadreza Razzazi
, Abdelghani Chibani:
Multimodal Evaluation Method for Sound Event Detection. 1026-1030 - Francesca Ronchini, Romain Serizel:
A Benchmark of State-of-the-Art Sound Event Detection Systems Evaluated on Synthetic Soundscapes. 1031-1035 - Hye-jin Shim, Jee-weon Jung, Ju-ho Kim, Ha-Jin Yu:
Attentive Max Feature Map and Joint Training for Acoustic Scene Classification. 1036-1040 - Hu Hu
, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Chin-Hui Lee:
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer. 4041-4045 - Christian Bergler, Manuel Schmitt, Andreas K. Maier, Rachael Xi Cheng, Volker Barth, Elmar Nöth:
ORCA-PARTY: An Automatic Killer Whale Sound Type Separation Toolkit Using Deep Learning. 1046-1050 - Mirco Pezzoli
, Maximo Cobos
, Fabio Antonacci, Augusto Sarti:
Sparsity-Based Sound Field Separation in the Spherical Harmonics Domain. 1051-1055 - Kazuyuki Arikawa, Shoichi Koyama, Hiroshi Saruwatari:
Spatial Active Noise Control Based on Individual Kernel Interpolation of Primary and Secondary Sound Fields. 1056-1060 - Sipei Zhao
, Ian S. Burnett
:
Time-Domain Acoustic Contrast Control with A Spatial Uniformity Constraint for Personal Audio Systems. 1061-1065 - Liming Shi
, Guoli Ping, Xiaoxiang Shen, Mads Græsbøll Christensen
:
Generation of Personal Sound Fields in Reverberant Environments Using Interframe Correlation. 1066-1070 - Jesper Brunnström
, Shoichi Koyama, Marc Moonen:
Variable Span Trade-Off Filter for Sound Zone Control with Kernel Interpolation Weighting. 1071-1075 - Nara Hahn, Frank Schultz, Sascha Spors:
Time Domain Radial Filter Design for Spherical Waves. 1076-1080 - Junxiao Sun, Ke Zhang, Shuyi Niu, Yan Zhang, Youyong Kong:
Feature Space Message Passing Network for Medical Image Semantic Segmentation. 1081-1085 - Yixin Wang, Zhe Xu, Jiang Tian, Jie Luo, Zhongchao Shi, Yang Zhang, Jianping Fan, Zhiqiang He:
Cross-Domain Few-Shot Learning for Rare-Disease Skin Lesion Segmentation. 1086-1090 - Chen Li
, Wei Chen, Xin Luo, Yulin He, Yusong Tan:
Adaptive Pseudo Labeling for Source-Free Domain Adaptation in Medical Image Segmentation. 1091-1095 - Abdullah F. Al-Battal, Imanuel R. Lerman, Truong Q. Nguyen:
Object Detection and Tracking in Ultrasound Scans Using an Optical Flow and Semantic Segmentation Framework Based on Convolutional Neural Networks. 1096-1100 - Dachuan Shi, Ruiyang Liu, Linmi Tao, Chun Yuan:
Heuristic Dropout: An Efficient Regularization Method for Medical Image Segmentation Models. 1101-1105 - Paria Jeihouni, Omid Dehzangi, Annahita Amireskandari, Ali Dabouei, Ali Rezai, Nasser M. Nasrabadi:
Superresolution and Segmentation of OCT Scans Using Multi-Stage Adversarial Guided Attention Training. 1106-1110 - Yusuke Akamatsu
, Yoshifumi Onishi
, Hitoshi Imaoka:
Heart Rate and Oxygen Saturation Estimation from Facial Video with Multimodal Physiological Data Generation. 1111-1115 - Kuan-Chen Wang, Kai-Chun Liu, Hsin-Min Wang
, Yu Tsao:
EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement. 1116-1120 - Sawon Pratiher, Apoorva Srivastava, Yedla Bindu Priyatha, Nirmalya Ghosh, Amit Patra:
A Dilated Residual Vision Transformer for Atrial Fibrillation Detection from Stacked Time-Frequency ECG Representations. 1121-1125 - Crystal T. Wei, Ming-En Hsieh, Chien-Liang Liu
, Vincent S. Tseng:
Contrastive Heartbeats: Contrastive Learning for Self-Supervised ECG Representation and Phenotyping. 1126-1130 - Omid Dehzangi, Paria Jeihouni, Jad Ramadan, Victor S. Finomore, Nasser M. Nasrabadi, Ali Rezai:
Ubiquitous Physiological Prediction of SUD Patients' Wellness State Using Memory-Based Convolutional Models. 1131-1135 - Mu Yang, Darpit Dave, Madhav Erraguntla, Gerard L. Coté, Ricardo Gutierrez-Osuna:
Joint Hypoglycemia Prediction and Glucose Forecasting via Deep Multi-Task Learning. 1136-1140 - Siddharth Subramani, Achuth Rao M. V, Anwesha Roy, Prasanna Suresh Hegde, Prasanta Kumar Ghosh:
SegNet-Based Deep Representation Learning for Dysphagia Classification. 1141-1145 - Francois Buet-Golfouse, Hans Roggeman, Islam Utyagulov:
Robust Collaborative Learning for Sequence Modelling. 1146-1150 - Jen-Cheng Hou, Aileen McGonigal
, Fabrice Bartolomei, Monique Thonnat:
A Self-Supervised Pre-Training Framework for Vision-Based Seizure Classification. 1151-1155 - Huaiwen Luo, Lu Zhang, Lianyu Zhou, Xu Lin
, Zehuai Zhang, Mingjiang Wang:
Design of Real-Time System Based on Machine Learning for Snoring and OSA Detection. 1156-1160 - Kaan Sel, Noah Huerta, Michael S. Sacks, Roozbeh Jafari:
Parametric Modeling of Human Wrist for Bioimpedance-Based Physiological Sensing. 1161-1165 - José Fernando Adrán Otero, Oscar Soláns Caballer, Pere Martí-Puig, Zhe Sun, Toshihisa Tanaka, Jordi Solé-Casals:
Preliminary Results on the Generation of Artificial Handwriting Data Using a Decomposition-Recombination Strategy. 1166-1170 - Suguru Kanoga
, Takayuki Hoshino, Mitsunori Tada:
A Style Transfer Mapping and Fine-Tuning Subject Transfer Framework Using Convolutional Neural Networks for Surface Electromyogram Pattern Recognition. 1171-1175 - Chencheng Guo, Hui Qian, Baoling Hong:
Feature-Based Sensing Matrix Design for Analog to Information Converters. 1176-1180 - K. M. Naimul Hassan, Md. Shamiul Alam Hridoy, Naima Tasnim
, Atia Faria Chowdhury, Tanvir Alam Roni, Sheikh Tabrez, Arik Subhana, Celia Shahnaz
:
ALSNet: A Dilated 1-D CNN for Identifying ALS from Raw EMG Signal. 1181-1185 - Bilal Ahmad
, Liana Khamidullina, Alexey Alexandrovich Korobkov, Alla Manina
, Jens Haueisen, Martin Haardt:
Joint Model Order Estimation for Multiple Tensors with A Coupled Mode and Applications to the Joint Decomposition of EEG, MEG Magnetometer, and Gradiometer Tensors. 1186-1190 - Zhikang Zhang
, Jonathan Zhao, Fengbo Ren:
An Experimental Study on Transferring Data-Driven Image Compressive Sensing to Bioelectric Signals. 1191-1195 - Elahe Rahimian, Soheil Zabihi, Amir Asif
, Dario Farina, Seyed Farokh Atashzar
, Arash Mohammadi:
Hand Gesture Recognition Using Temporal Convolutions and Attention Mechanism. 1196-1200 - Bo Fang
, Junxin Chen
, Wei Wang
, Yicong Zhou:
Combining Multiple Style Transfer Networks and Transfer Learning For LGE-CMR Segmentation. 1201-1205 - Jaeyoung Huh
, Shujaat Khan
, Jong Chul Ye:
Multi-Domain Unpaired Ultrasound Image Artifact Removal Using a Single Convolutional Neural Network. 1206-1210 - Xiao Li, Huizhi Liang, Sidhartha Nagala, Jane Chen:
Improving Ultrasound Image Classification with Local Texture Quantisation. 1211-1215 - Tristan S. W. Stevens
, Nishith Chennakeshava, Frederik J. de Bruijn, Martin Pekar, Ruud J. G. van Sloun
:
Accelerated Intravascular Ultrasound Imaging using Deep Reinforcement Learning. 1216-1220 - Nishith Chennakeshava, Tristan S. W. Stevens
, Frederik J. de Bruijn, Andrew Hancock, Martin Pekar, Yonina C. Eldar, Massimo Mischi
, Ruud J. G. van Sloun
:
Deep Proximal Unfolding For Image Recovery from Under-Sampled Channel Data in Intravascular Ultrasound. 1221-1225 - Gongpeng Cao, Yiping Wang, Manli Zhang, Jing Zhang, Guixia Kang, Xin Xu:
Multiview Long-Short Spatial Contrastive Learning For 3D Medical Image Analysis. 1226-1230 - Khuong Vo
, Manoj Vishwanath, Ramesh Srinivasan, Nikil D. Dutt, Hung Cao:
Composing Graphical Models with Generative Adversarial Networks for EEG Signal Modeling. 1231-1235 - David Bethge, Philipp Hallgarten, Tobias Grosse-Puppendahl, Mohamed Kari
, Ralf Mikut
, Albrecht Schmidt, Ozan Özdenizci
:
Domain-Invariant Representation Learning from EEG with Private Encoders. 1236-1240 - Guangyi Zhang, Ali Etemad:
Holistic Semi-Supervised Approaches for EEG Representation Learning. 1241-1245 - Pankaj Pandey, Gulshan Sharma
, Krishna P. Miyapuram, Ramanathan Subramanian
, Derek Lomas:
Music Identification Using Brain Responses to Initial Snippets. 1246-1250 - Wei Xu, Jing Wang, Ziyu Jia, Zhiqing Hong, Yunze Li, Youfang Lin
:
Multi-Level Spatial-Temporal Adaptation Network for Motor Imagery Classification. 1251-1255 - Lies Bollens
, Tom Francart, Hugo Van hamme
:
Learning Subject-Invariant Representations from Speech-Evoked EEG Using Variational Autoencoders. 1256-1260 - Xinru Dai, Tai Ma, Haibin Cai, Ying Wen:
Unsupervised Hierarchical Translation-Based Model for Multi-Modal Medical Image Registration. 1261-1265 - Zailiang Chen, Hailei Lan, Yongan Meng, Yuchen Xiong, Jing Luo, Hailan Shen:
FAZ-BV: A Diabetic Macular Ischemia Grading Framework Combining Faz Attention Network and Blood Vessel Enhancement Filters. 1266-1270 - Lijuan Lu, Shun Miao, Ling Ye:
Fracture Detection and Localization in Chest X-Rays Using Semi-Supervised Learning with Dynamic Sharpening. 1271-1275 - Ryan Zhang, Jiadai Zhu, Stephen Yang
, Mahdi S. Hosseini, Angelo Genovese
, Lina Chen, Corwyn Rowsell, Savvas Damaskinos, Sonal Varma, Konstantinos N. Plataniotis:
Histokt: Cross Knowledge Transfer in Computational Pathology. 1276-1280 - Giovana Augusta Benvenuto
, Marilaine Colnago, Wallace Casaca:
Unsupervised Deep Learning Network for Deformable Fundus Image Registration. 1281-1285 - Huijuan Yang
, Aaron S. Coyner, Feri Guretno
, Ivan Ho Mien, Chuan Sheng Foo, J. Peter Campbell, Susan Ostmo, Michael F. Chiang
, Pavitra Krishnaswamy:
A Minimally Supervised Approach for Medical Image Quality Assessment in Domain Shift Settings. 1286-1290 - Yanbin He, Zhiyang Lu, Jun Wang
, Jun Shi:
A Channel Attention Based MLP-Mixer Network for Motor Imagery Decoding With EEG. 1291-1295 - Miguel Angrick
, Maarten C. Ottenhoff, Lorenz Diener, Darius Ivucic, Gabriel Ivucic, Sophocles Goulis, Albert J. Colon, G. Louis Wagner, Dean J. Krusienski
, Pieter Leonard Kubben, Tanja Schultz
, Christian Herff
:
Towards Closed-Loop Speech Synthesis from Stereotactic EEG: A Unit Selection Approach. 1296-1300 - Jaeun Phyo, Wonjun Ko
, Eunjin Jeon, Heung-Il Suk
:
Enhancing Contextual Encoding With Stage-Confusion and Stage-Transition Estimation for EEG-Based Sleep Staging. 1301-1305 - Hadi Habibzadeh, Kevin J. Long, Ally E. Atkins, Daphney-Stavroula Zois
, James J. S. Norton:
Improving BCI-based Color Vision Assessment Using Gaussian Process Regression. 1306-1310 - Shuji Komeiji, Kai Shigemi, Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano, Koichi Shinoda, Toshihisa Tanaka:
Transformer-Based Estimation of Spoken Sentences Using Electrocorticography. 1311-1315 - Marzieh Ajirak, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djuric:
Boost Ensemble Learning for Classification of CTG SIGNALS. 1316-1320 - Yifan Wang, Ying Lan:
Multi-View Learning Based on Non-Redundant Fusion for Icu Patient Mortality Prediction. 1321-1325 - Tong Chen, Guanchao Feng, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djuric:
Improving Phase-Rectified Signal Averaging for Fetal Heart Rate Analysis. 1326-1330 - Liu Yang
, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djuric:
Unsupervised Clustering and Analysis of Contraction-Dependent Fetal Heart Rate Segments. 1331-1335 - Orestis Apostolou, Vasileios S. Charisis, Georgios K. Apostolidis, Leontios J. Hadjileontiadis:
A Method for Detecting Coronary Artery Disease using Noisy Ultrashort Electrocardiogram Recordings. 1336-1340 - Nele Sophie Brügge, Jan Graßhoff
, Arne Weigenand, Philipp Rostalski:
Multi-Task Gaussian Process Regression for the Detection of Sleep Cycles in Premature Infants. 1341-1345 - Silpa Babu, Seyedehsara Nayer, Sajan Goud Lingala, Namrata Vaswani
:
Fast Low Rank Column-Wise Compressive Sensing For Accelerated Dynamic MRI. 1346-1350 - Sizhuo Liu, Philip Schniter, Rizwan Ahmad:
MRI Recovery with a Self-Calibrated Denoiser. 1351-1355 - Wanqi Zhang, Lulu Wang, Wei Chen, Yuanyuan Jia, Zhongshi He, Jinglong Du:
3d Cross-Scale Feature Transformer Network for Brain Mr Image Super-Resolution. 1356-1360 - Harsh Singh, Ognjen Arandjelovic
:
Data Efficient Support Vector Machine Training Using the Minimum Description Length Principle. 1361-1365 - Yuanpin Zhou
, Yao Lu:
Multiple Instance Learning with Task-Specific Multi-Level Features for Weakly Annotated Histopathological Image Classification. 1366-1370 - Guang Li, Ren Togo, Takahiro Ogawa, Miki Haseyama:
Self-Knowledge Distillation based Self-Supervised Learning for Covid-19 Detection from Chest X-Ray Images. 1371-1375 - Rui Xu, Yufeng Wang, Xinchen Ye, Pengcheng Wu, Yen-Wei Chen, Fangyi Xu, Wenchao Zhu, Chao Chen, Yong Zhou, Hongjie Hu, Xiaofeng Qu, Shoji Kido, Noriyuki Tomiyama:
Pixel-Level and Affinity-Level Knowledge Distillation for Unsupervised Segmentation of Covid-19 Lesions. 1376-1380 - Nastaran Enshaei, Moezedin Javad Rafiee, Arash Mohammadi, Farnoosh Naderkhani:
Data Shapley Value for Handling Noisy Labels: An Application in Screening Covid-19 Pneumonia from Chest CT Scans. 1381-1385 - Xiongbiao Luo:
Accurate Multiscale Selective Fusion of CT and Video Images for Real-Time Endoscopic Camera 3D Tracking in Robotic Surgery. 1386-1390 - Ruixiang Geng, Qing Liu
, Shuo Feng, Yixiong Liang:
Learning Deep Pathological Features for WSI-Level Cervical Cancer Grading. 1391-1395 - Bowen Xu, Wenqiang Zhang:
Selective Scale Cascade Attention Network for Breast Cancer Histopathology Image Classification. 1396-1400 - Archishman Biswas, Hernando C. Ombao
:
Frequency-Specific Non-Linear Granger Causality in a Network of Brain Signals. 1401-1405 - Kosuke Fukumori, Noboru Yoshida, Hidenori Sugano, Madoka Nakajima, Toshihisa Tanaka:
Epileptic Spike Detection by Recurrent Neural Networks with Self-Attention Mechanism. 1406-1410 - Jian Yin, Yuan Wang
:
Topological Correlation of Brain Signals. 1411-1415 - Bahman Abdi-Sargezeh, Antonio Valentín, Gonzalo Alarcón, Saeid Sanei:
Online Detection of Scalp-Invisible Mesial-Temporal Brain Interictal Epileptiform Discharges from EEG. 1416-1420 - Yulu Wang, Yiwen Sun, Lei Fang, Changshui Zhang:
Leveraging Sparse Coding for EEG Based Emotion Recognition in Shooting. 1421-1425 - Weilai Li, Lanfeng Zhong, Weixi Xiang, Tongzhou Kang, Dakun Lai:
A Novel Unsupervised Autoencoder-Based HFOs Detector in Intracranial EEG Signals. 1426-1430 - Fei Ye, Zhiqiang Wang, Sheng Zhu, Xuanya Li, Kai Hu:
A Novel Convolutional Neural Network Based on Adaptive Multi-Scale Aggregation and Boundary-Aware for Lateral Ventricle Segmentation on MR images. 1431-1435 - Wentao Liu, Huihua Yang, Tong Tian, Xipeng Pan, Weijin Xu
:
Multiscale Attention Aggregation Network for 2D Vessel Segmentation. 1436-1440 - Xinxin Shan, Tai Ma, Anqi Gu, Haibin Cai, Ying Wen:
TCRNet: Make Transformer, CNN and RNN Complement Each Other. 1441-1445 - Ke Zheng
, Junhai Xu, Jianguo Wei
:
Double Noise Mean Teacher Self-Ensembling Model for Semi-Supervised Tumor Segmentation. 1446-1450 - Siming Yuan, Qing Liu
, Shenghui Liao, Fuchang Han, Haitao Wei, Yingqi Zhang:
Rethinking Computer-Aided Pelvis Segmentation. 1451-1455 - Hyunwoo Yu
, Jae-hun Shim, Jaeho Kwak, Jou Won Song, Suk-Ju Kang:
Vision Transformer-Based Retina Vessel Segmentation with Deep Adaptive Gamma Correction. 1456-1460 - Yuan Wang
, Moo K. Chung, Julius Fridriksson:
Spectral Permutation Test on Persistence Diagrams. 1461-1465 - Isabell Lehmann
, Evrim Acar, Tanuj Hasija, Mohammad A. B. S. Akhonda, Vince D. Calhoun
, Peter J. Schreier, Tülay Adali:
Multi-Task fMRI Data Fusion Using IVA and PARAFAC2. 1466-1470 - Hanlu Yang, Mohammad A. B. S. Akhonda, Fateme Ghayem, Qunfang Long, Vince D. Calhoun
, Tülay Adali:
Independent Vector Analysis Based Subgroup Identification from Multisubject fMRI Data. 1471-1475 - Damian Pascual, Béni Egressy, Nicolas Affolter, Yiming Cai, Oliver Richter, Roger Wattenhofer:
Improving Brain Decoding Methods and Evaluation. 1476-1480 - Xiaofeng Liu, Fangxu Xing, Maureen Stone, Jerry L. Prince, Jangwon Kim, Georges El Fakhri, Jonghye Woo:
Cmri2spec: Cine MRI Sequence to Spectrogram Synthesis via A Pairwise Heterogeneous Translator. 1481-1485 - Wenhan Wang, Youyong Kong, Zhenghua Hou, Chunfeng Yang, Yonggui Yuan:
Spatio-Temporal Attention Graph Convolution Network for Functional Connectome Classification. 1486-1490 - Avrajit Ghosh, Michael T. McCann, Saiprasad Ravishankar:
Bilevel Learning of ℓ1 Regularizers with Closed-Form Gradients (BLORC). 1491-1495 - V. S. Unni, Ruturaj G. Gavaskar, Kunal N. Chaudhury:
Multiband Image Fusion with Controllable Error Guarantees. 1496-1500 - Zhuojie Huang
, Shuping Zhao, Lunke Fei, Jigang Wu:
Weighted Graph Embedded Low-Rank Projection Learning for Feature Extraction. 1501-1505 - Vasiliki Kouni
, Georgios Paraskevopoulos, Holger Rauhut, George C. Alexandropoulos:
ADMM-DAD Net: A Deep Unfolding Network for Analysis Compressed Sensing. 1506-1510 - Alexander Lin, Andrew H. Song, Berkin Bilgic
, Demba E. Ba:
High-Dimensional Sparse Bayesian Learning without Covariance Matrices. 1511-1515 - Baoshun Shi
, Yuxin Wang, Qiusheng Lian:
A Trainable Bounded Denoiser Using Double Tight Frame Network for Snapshot Compressive Imaging. 1516-1520 - Seobin Park, Tae Hyun Kim:
Progressive Image Super-Resolution via Neural Differential Equation. 1521-1525 - Yuhui Quan, Xinran Qin, Mingqin Chen, Yan Huang:
High-Quality Self-Supervised Snapshot Hyperspectral Imaging. 1526-1530 - Abderrahim Halimi
, Jakeoung Koo
, Robert A. Lamb, Gerald S. Buller, Steve McLaughlin
:
Robust Bayesian Reconstruction of Multispectral Single-Photon 3D Lidar Data with Non-Uniform Background. 1531-1535 - Quentin Febvre, Ronan Fablet, Julien Le Sommer, Clément Ubelmann:
Joint Calibration and Mapping of Satellite Altimetry Data Using Trainable Variational Models. 1536-1540 - Michalis Giannopoulos, Grigorios Tsagkatakis, Panagiotis Tsakalides:
4D Convolutional Neural Networks for Multi-Spectral and Multi-Temporal Remote Sensing Data Classification. 1541-1545 - Cheick T. Cissé, Ahed Alboody
, Matthieu Puigt
, Gilles Roussel, Vincent Vantrepotte, Cédric Jamet, Trung-Kien Tran:
A New Deep Learning Method for Multispectral Image Time Series Completion Using Hyperspectral Data. 1546-1550 - Xinyi Wei, Hans Van Gorp
, Lizeth Gonzalez-Carabarin, Daniel Freedman, Yonina C. Eldar, Ruud J. G. van Sloun
:
Image Denoising with Deep Unfolding And Normalizing Flows. 1551-1555 - Rohit Ranade, Yangwen Liang, Shuangquan Wang, Dongwoon Bai, Jungwon Lee:
3D Texture Super Resolution via the Rendering Loss. 1556-1560 - Changhun Sung, Byungdeok Kim:
Bundle ICP with Virtual Depth for Hand-Held 3d Scanner. 1561-1565 - Julián Tachella, Michael P. Sheehan, Mike E. Davies:
Sketched RT3D: How to Reconstruct Billions of Photons Per Second. 1566-1570 - Naveen Kuruba, Neel Badadare, Vikram Narayan, Satish Putta:
A Generic Method to Estimate Camera Extrinsic Parameters. 1571-1575 - Yash Sanghvi, Abhiram Gnanasambandan, Stanley H. Chan:
Photon-Limited Deblurring Using Algorithm Unrolling. 1576-1580 - Wenpeng Xing, Jie Chen:
NEX+: Novel View Synthesis with Neural Regularisation Over Multi-Plane Images. 1581-1585 - Daniel Nicholls
, Alex W. Robinson
, Jack Wells
, Amirafshar Moshtaghpour, Mounib Bahri
, Angus I. Kirkland, Nigel D. Browning:
Compressive Scanning Transmission Electron Microscopy. 1586-1590 - Simon Welker
, Tal Peer, Henry N. Chapman
, Timo Gerkmann
:
Deep Iterative Phase Retrieval for Ptychography. 1591-1595 - Vinayak Killedar, Chandra Sekhar Seelamantula:
Compressive Phase Retrieval Based On Sparse Latent Generative Priors. 1596-1600 - Abdulrahman M. Alanazi, Singanallur V. Venkatakrishnan, Hector J. Santos-Villalobos
, Gregery T. Buzzard, Charles A. Bouman:
Model-Based Reconstruction for Collimated Beam Ultrasound Systems. 1601-1605 - Tim Straubinger, Robert Xiao, Helge Rhodin:
Learned Acoustic Reconstruction Using Synthetic Aperture Focusing. 1606-1610 - Guanze Liu, Bo Xu
, Han Huang, Cheng Lu, Yandong Guo:
SDETR: Attention-Guided Salient Object Detection with Transformer. 1611-1615 - Kristian Fischer, Markus Hofbauer, Christopher B. Kuhn, Eckehard G. Steinbach
, André Kaup:
Evaluation of Video Coding for Machines without Ground Truth. 1616-1620 - Thuc Nguyen Huu, Vinh Van Duong, Jonghoon Yim, Byeungwoo Jeon
:
Raw Plenoptic Video Coding Under Hexagonal Lattice Resolution of Motion Vectors. 1621-1624 - Kianoush Jafari, Alireza Aminlou, Miska M. Hannuksela:
Comparison of Boundary Artifact Removal Methods in Coding of Generalized Cubemap Projection Using VVC. 1625-1629 - Shen Wang, Yibing Fu, Chen Zhu, Li Song, Wenjun Zhang:
Low-Complexity Multi-Model CNN in-Loop Filter for AVS3. 1630-1634 - Junyan Huo, Yu Sun, Haixin Wang, Shuai Wan, Fuzheng Yang, Ming Li:
Unified Matrix Coding for NN Originated MIP in H.266/VVC. 1635-1639 - Yuanyuan Xu, Taoyu Yang, Zengjie Tan, Haolun Lan:
FOV-Based Coding Optimization for 360-Degree Virtual Reality Videos. 1640-1644 - Jian Wang, Xinyue Li
, Wei Song, Zhichao Zhang
, Weiqi Guo:
Multi-Hierarchy Proxy Structure for Deep Metric Learning. 1645-1649 - Michail Kaseris, Ioannis Mademlis
, Ioannis Pitas:
Exploiting Caption Diversity for Unsupervised Video Summarization. 1650-1654 - Wanqian Zhang, Dayan Wu, Chule Yang, Bo Li
, Weiping Wang:
Clustering and Separating Similarities for Deep Unsupervised Hashing. 1655-1659 - Junying Huang, Fan Chen, Keze Wang, Liang Lin, Dongyu Zhang:
Enhancing Prototypical Few-Shot Learning By Leveraging The Local-Level Strategy. 1660-1664 - Chao Zhou
, Miguel R. D. Rodrigues:
Blind Unmixing Using A Double Deep Image Prior. 1665-1669 - Yi Liu, Yanjie Liang, Qiangqiang Wu, Liming Zhang, Hanzi Wang:
A New Framework for Multiple Deep Correlation Filters Based Object Tracking. 1670-1674 - Bo-Hao Chen, Hsiang-Yin Cheng, Jia-Li Yin:
Adaptive Actor-Critic Bilateral Filter. 1675-1679 - Niklas Kämper, Joachim Weickert:
Domain Decomposition Algorithms for Real-Time Homogeneous Diffusion Inpainting in 4K. 1680-1684 - Michiaki Tatsubori, Takao Moriyama, Tatsuya Ishikawa, Paolo Fraccaro, Anne Jones, Blair Edwards, Julian Kuehnert, Sekou L. Remy:
Deep Temporal Interpolation of Radar-Based Precipitation. 1685-1689 - Zikai Sun, Thierry Blu:
A Nonlinear Steerable Complex Wavelet Decomposition of Images. 1690-1694 - Xiang Cao
, Haibo Shen, Liangqi Zhang, Yihao Luo, Tianjiang Wang:
Kernel Estimation Network for Blind Super-Resolution. 1695-1699 - Yixiong Zhang, Zhipeng Su, Feng Qi, Jianyang Zhou, Xiao-Ping Zhang:
Terahertz Image Restoration Benchmarking Dataset. 1700-1704 - Xingrun Xing, Yalong Jiang, Baochang Zhang, Wenrui Ding, Yangguang Li, Hongguang Li, Huan Peng:
Binary Dense Predictors for Human Pose Estimation Based on Dynamic Thresholds and Filtering. 1705-1709 - Haidong Zhu, Zhaoheng Zheng, Mohammad Soleymani, Ram Nevatia:
Self-Supervised Learning for Sentiment Analysis via Image-Text Matching. 1710-1714 - Wei-Yu Lee
, Jheng-Yu Wang, Yu-Chiang Frank Wang:
Domain-Agnostic Meta-Learning for Cross-Domain Few-Shot Classification. 1715-1719 - Dahyun Kim, Sunjae Yoon
, Ji Woo Hong, Chang D. Yoo:
Semantic Association Network for Video Corpus Moment Retrieval. 1720-1724 - Nida Itrat Abbasi, Siyang Song, Hatice Gunes:
Statistical, Spectral and Graph Representations for Video-Based Facial Expression Recognition in Children. 1725-1729 - Nakyeong Yang, Taegwan Kang, Kyomin Jung:
Deriving Explainable Discriminative Attributes Using Confusion About Counterfactual Class. 1730-1734 - Chenghu Du
, Feng Yu
, Minghua Jiang, Yaxin Zhao, Xiong Wei, Tao Peng, Xinrong Hu:
Realistic Monocular-To-3d Virtual Try-On Via Multi-Scale Characteristics Capture. 1735-1739 - Ehsan Pajouheshgar, Tong Zhang, Sabine Süsstrunk:
Optimizing Latent Space Directions for Gan-Based Local Image Editing. 1740-1744 - Jingning Xu, Benlai Tang, Mingjie Wang, Siyuan Bian, Wenyi Guo, Xiang Yin, Zejun Ma:
Towards Using Clothes Style Transfer for Scenario-Aware Person Video Generation. 1745-1749 - Somi Jeong, Jiyoung Lee
, Kwanghoon Sohn:
Multi-Domain Unsupervised Image-to-Image Translation with Appearance Adaptive Convolution. 1750-1754 - Yifan Yuan, Siteng Ma, Junping Zhang:
VR-FAM: Variance-Reduced Encoder with Nonlinear Transformation for Facial Attribute Manipulation. 1755-1759 - George Eskandar, Mohamed Abdelsamad, Karim Armanious, Shuai Zhang, Bin Yang:
Wavelet-Based Unsupervised Label-to-Image Translation. 1760-1764 - Sadid Sahami, Gene Cheung, Chia-Wen Lin:
Fast Graph Sampling for Short Video Summarization Using Gershgorin Disc Alignment. 1765-1769 - Xiaopeng Ke, Boyu Chang
, Hao Wu, Fengyuan Xu, Sheng Zhong:
Towards Practical and Efficient Long Video Summary. 1770-1774 - Sunhee Hwang, Minsong Ki, Seung-Hyun Lee, Sanghoon Park, Byoung-Ki Jeon:
Cut And Continuous Paste Towards Real-Time Deep Fall Detection. 1775-1779 - Aditya Singh, Saheb Chhabra, Puspita Majumdar, Richa Singh, Mayank Vatsa:
Mannet: A Large-Scale Manipulated Image Detection Dataset And Baseline Evaluations. 1780-1784 - Laura Kart, Niv Cohen:
Approaches Toward Physical and General Video Anomaly Detection. 1785-1789 - Suiyi Ling, Andreas Pastor, Junle Wang, Patrick Le Callet:
Considering User Agreement in Learning to Predict the Aesthetic Quality. 1790-1794 - Qi Zheng
, Zhengzhong Tu, Yibo Fan, Xiaoyang Zeng, Alan C. Bovik:
No-Reference Quality Assessment of Variable Frame-Rate Videos Using Temporal Bandpass Statistics. 1795-1799 - Joel Jung, Alexandre Giraud, Meijia Song, Songnan Li, Xiang Li, Shan Liu:
Towards Joint Frame-Level and MOS Quality Predictions with Low-Complexity Objective Models. 1800-1804 - Satyam Mohla, Anshul Nasery, Biplab Banerjee:
Teaching CNNs to Mimic Human Visual Cognitive Process & Regularise Texture-Shape Bias. 1805-1809 - Shaoguo Wen, Suiyi Ling, Junle Wang, Ximing Chen, Yanqing Jing, Patrick Le Callet:
Subjective And Objective Quality Assessment Of Mobile Gaming Video. 1810-1814 - Yanzhe Zhong, Huadong Pan, Bangjie Tang, Zhonggeng Liu, Yiming Zhu, Jun Yin:
ER-PIQA: A Task-Guided Pedestrian Image Quality Assessment Via Embedding Reconstruction. 1815-1819 - Mohsen Zand, Haleh Damirchi, Andrew Farley, Mahdiyar Molahasani, Michael A. Greenspan, Ali Etemad:
Multiscale Crowd Counting and Localization By Multitask Point Supervision. 1820-1824 - Yu-Zhang Chen, Tsung-Jung Liu
, Kuan-Hsien Liu
:
Super-Resolution of Satellite Images by two-Dimensional RRDB and Edge-Enhancement Generative Adversarial Network. 1825-1829 - Saurabh Sahu, Palash Goyal:
Leveraging Local Temporal Information for Multimodal Scene Classification. 1830-1834 - Menghao Li, Mingtao Pei, Wei Liang:
Predicting Human Motion Using Key Subsequences. 1835-1839 - Ruxin Ding, Jianfeng Ren, Heng Yu
, Jiawei Li:
Dynamic Texture Recognition Using PDV Hashing and Dictionary Learning on Multi-Scale Volume Local Binary Pattern. 1840-1844 - Qing Gao, Mingtao Pei, Hongyu Shen:
Do You Live a Healthy Life? Analyzing Lifestyle by Visual Life Logging. 1845-1849 - Liping Huang, Taizo Suzuki:
Weighted Wavelet-Based Spectral-Spatial Transforms For CFA-Sampled Raw Camera Image Compression Considering Image Features. 1850-1854 - Dongyang Li, Zhenhong Sun, Zhiyu Tan, Xiuyu Sun, Fangyi Zhang
, Yichen Qian, Hao Li:
Jmpnet: Joint Motion Prediction for Learning-Based Video Compression. 1855-1859 - Fabian Brand, Christian Herglotz, André Kaup:
A Low-Parametric Model for Bit-Rate Estimation of VVC Residual Coding. 1860-1864 - Vignesh V. Menon
, Hadi Amirpour
, Mohammed Ghanbari, Christian Timmerer:
OPTE: Online Per-Title Encoding for Live Video Streaming. 1865-1869 - Kedeng Tong, Xin Jin, Chen Wang, Fan Jiang:
SADN: Learned Light Field Image Compression with Spatial-Angular Decorrelation. 1870-1874 - Wenfeng Li, Zongcai Du, Hao He, Jie Tang, Gangshan Wu:
Hierarchical Feature Aggregation Network for Deep Image Compression. 1875-1879 - Tianyou Chen
, Xiaoguang Hu, Jin Xiao
, Guofeng Zhang, Shaojie Wang:
Accurate Instance Segmentation Via Collaborative Learning. 1880-1884 - Jiehua Zhang, Zhuo Su, Yanghe Feng, Xin Lu
, Matti Pietikäinen, Li Liu:
Dynamic Binary Neural Network by Learning Channel-Wise Thresholds. 1885-1889 - Wanyu Wu, Wei Wang, Kui Jiang, Xin Xu, Ruimin Hu:
Self-Supervised Learning on A Lightweight Low-Light Image Enhancement Model with Curve Refinement. 1890-1894 - Jingquan Wang, Jing Xu
, Yu Pan, Zenglin Xu:
Semantically Proportional Patchmix for Few-Shot Learning. 1895-1899 - Zhikui Chen, Tiandong Ji, Suhua Zhang, Fangming Zhong:
Noise Suppression for Improved Few-Shot Learning. 1900-1904 - Cheryl Sze Yin Wong, Guo Yang, Arulmurugan Ambikapathi, Savitha Ramasamy
:
Online Continual Learning Using Enhanced Random Vector Functional Link Networks. 1905-1909 - Miaohua Zhang
, Yongsheng Gao, Jun Zhou:
A Generalized Kernel Risk Sensitive Loss for Robust Two-Dimensional Singular Value Decomposition. 1910-1914 - Xiangling Ding
, Pu Huang, Dengyong Zhang, Xianfeng Zhao
:
Video Frame Interpolation via Local Lightweight Bidirectional Encoding with Channel Attention Cascade. 1915-1919 - Yue Lv, Wenming Yang, Wangmeng Zuo, Qingmin Liao, Rui Zhu:
Sain: Similarity-Aware Video Frame Interpolation. 1920-1924 - Zejia Fan, Jiaying Liu
, Wenhan Yang, Wei Xiang, Zongming Guo:
Self-Learned Video Super-Resolution with Augmented Spatial and Temporal Context. 1925-1929 - Jiahui Liu, Mingcai Zhou, Meng Xiao:
Deformable Convolution Dense Network for Compressed Video Quality Enhancement. 1930-1934 - Siying Liu
, Roxana Alexandru, Pier Luigi Dragotti
:
Convolutional ISTA Network with Temporal Consistency Constraints for Video Reconstruction from Event Cameras. 1935-1939 - Xuezhi Tong, Rui Wang
, Chuan Wang, Sanyi Zhang, Xiaochun Cao:
PMP-NET: Rethinking Visual Context for Scene Graph Generation. 1940-1944 - Feicheng Huang, Zhixin Li:
Improve Image Captioning Via Relation Modeling. 1945-1949 - Lei Cui, Huan Peng, Yangguang Li, Chuming Li, Xingrun Xing:
Equal Loss: A Simple Loss Function for Noise Robust Learning. 1950-1954 - Boyang Wan, Wenhui Jiang, Yuming Fang:
Informative Attention Supervision for Grounded Video Description. 1955-1959 - Jialu Zhang, Qian Zhang, Jianfeng Ren, Yitian Zhao, Jiang Liu
:
Spatial-Context-Aware Deep Neural Network for Multi-Class Image Classification. 1960-1964 - Hongjun Wu
, Mengzhu Li, Yongcheng Liu, Hongzhe Liu
, Cheng Xu
, Xuewei Li:
Transtl: Spatial-Temporal Localization Transformer for Multi-Label Video Classification. 1965-1969 - Kyuyeon Kim, Junsik Jung, Woo Jae Kim, Sung-Eui Yoon:
Deep Video Inpainting Guided by Audio-Visual Self-Supervision. 1970-1974 - Guangwei Li, Xuenan Xu, Mengyue Wu, Kai Yu:
Navigating Audio-Visual Event Detection Across Mismatched Modalities. 1975-1979 - Donglai Wei, Chen-Geng Liu
, Yang Liu
, Jing Liu, Xiao-Guang Zhu, Xinhua Zeng:
Look, Listen and Pay More Attention: Fusing Multi-Modal Information for Video Violence Detection. 1980-1984 - Changsheng Xu, Zhenlong Xu, Yifan He, Shuigeng Zhou, Jihong Guan:
Multi-Modal Learning with Text Merging for TEXTVQA. 1985-1989 - Ping Wang, Yijie Cao, Lei Lu:
A Novel Part Feature Integration and Fusion Method for Fine-Grained Vehicle Recognition. 1990-1994 - Yiqiang Chen, Feng Liu, Ke Pei:
Monocular Vehicle 3D Bounding Box Estimation Using Homograhy and Geometry in Traffic Scene. 1995-1999 - Xin Yi, Bo Ma, Jiahao Wu:
FSM: Feature Sampling Module for Object Detection. 2000-2004 - Senyun Kuang, Shijin Meng, Bo Xiao, Lv Tang, Bo Li:
Rethinking Two-B-Real Net for Real-Time Salient Object Detection. 2005-2009 - Bo Cui, Hui Qu, Xuhui Huang, Shan Yu:
Balanced Ranking and Sorting For Class Incremental Object Detection. 2010-2014 - Yihao Luo, Xiang Cao
, Juntao Zhang, Leixilan Pan, Tianjiang Wang, Qi Feng:
Multi-Scale Reinforcement Learning Strategy for Object Detection. 2015-2019 - Zhihao Wu, Chengliang Liu
, Chao Huang, Jie Wen, Yong Xu:
Deep Object Detection with Example Attribute Based Prediction Modulation. 2020-2024 - Shanzhi Yin
, Chao Li, Youneng Bao, Yongsheng Liang, Fanyang Meng, Wei Liu:
Universal Efficient Variable-Rate Neural Image Compression. 2025-2029 - Bowen Li
, Xin Yao, Chao Li, Youneng Bao, Fanyang Meng, Yongsheng Liang:
AdderIC: Towards Low Computation Cost Image Compression. 2030-2034 - Saiping Zhang, Luis Herranz
, Marta Mrak, Marc Górriz Blanch, Shuai Wan, Fuzheng Yang:
DCNGAN: A Deformable Convolution-Based GAN with QP Adaptation for Perceptual Quality Enhancement of Compressed Video. 2035-2039 - Anne-Flore Perrin
, Yejing Xie, Tao Zhang, Yiting Liao, Junlin Li, Patrick Le Callet:
Specialised Video Quality Model For Enhanced User Generated Content (UGC) With Special Effects. 2040-2044 - Andreas Pastor, Lukás Krasula, Xiaoqing Zhu, Zhi Li, Patrick Le Callet:
Improving Maximum Likelihood Difference Scaling Method To Measure Inter Content Scale. 2045-2049 - Ao-Xiang Zhang, Yuan-Gen Wang:
Texture Information Boosts Video Quality Assessment. 2050-2054 - Keisuke Ozawa
:
Plug-and-Play and Relay Regularizations on Noisy Low Rank Tensor Completion for Snapshot Multispectral Image Restoration. 2055-2059 - Ashish Tiwari
, Shanmuganathan Raman:
LERPS: Lighting Estimation and Relighting for Photometric Stereo. 2060-2064 - Huiyu Duan, Xiongkuo Min
, Wei Shen, Guangtao Zhai:
A Unified Two-Stage Model for Separating Superimposed Images. 2065-2069 - Siyu Huang
, Haoyi Xiong
, Tianyang Wang, Bihan Wen
, Qingzhong Wang, Zeyu Chen, Jun Huan, Dejing Dou:
Parameter-Free Style Projection for Arbitrary Image Style Transfer. 2070-2074 - Yangfan Sun, Zhu Li, Li Li, Shizheng Wang, Wei Gao
:
Optimization of Compressive Light Field Display in Dual-Guided Learning. 2075-2079 - Yusuke Matsui, Yoshiki Imaizumi, Naoya Miyamoto, Naoki Yoshifuji
:
ARM 4-BIT PQ: SIMD-Based Acceleration for Approximate Nearest Neighbor Search on ARM. 2080-2084 - Chao Wang, Yi Gu, Jie Li, Xinlei He, Zirui Zhang, Yuting Gao, Chentao Wu:
Iterative Learning for Distorted Image Restoration. 2085-2089 - Xiaoyu Zhang
, Wei Gao
, Hui Yuan, Ge Li:
JE2NET: Joint Exploitation and Exploration in Reinforcement Learning Based Image Restoration. 2090-2094 - Kun Yang, Juan Zhang, Xiaoqi Lang:
Multiple Patch-Aware Network for Faster Real-World Image Dehazing. 2095-2099 - Zhenyu Tang, Long Ma, Xiaoke Shang, Xin Fan:
Learning to Fuse Heterogeneous Features for Low-Light Image Enhancement. 2100-2104 - Jiachun Li, Kunkun Qin, Ruotao Xu
, Hui Ji
:
Deep Scale-Aware Image Smoothing. 2105-2109 - Yanbo Gao, Menghu Jia, Shuai Li, Xun Cai, Mao Ye, Frédéric Dufaux:
A Multiscale Gradient-Backpropagation Optimization Framework for Deformable Convolution Based Compressed Video Enhancement. 2110-2114 - Tomohiro Hayase, Suguru Yasutomi, Nakamasa Inoue:
Downstream Augmentation Generation For Contrastive Learning. 2115-2119 - Chao Dong, Qi Ye, Wenchao Meng
, Kaixiang Yang:
Few-Shot Learning with Improved Local Representations via Bias Rectify Module. 2120-2124 - Pichao Wang, Fan Wang, Hao Li:
Image-to-Video Re-Identification via Mutual Discriminative Knowledge Transfer. 2125-2129 - Fangxin Liu, Wenbo Zhao, Yongbiao Chen, Zongwu Wang, Fei Dai:
DynSNN: A Dynamic Approach to Reduce Redundancy in Spiking Neural Networks. 2130-2134 - Yongsheng Zhang, Qing Liu
, Yang Zhao, Yixiong Liang:
MEJIGCLU: More Effective Jigsaw Clustering For Unsupervised Visual Representation Learning. 2135-2139 - Cheng Zhuang, Yunlian Sun:
Ganet: Unary Attention Reaches Pairwise Attention Via Implicit Group Clustering in Light-Weight CNNs. 2140-2144 - Ting-Wei Chang, Wei-Chen Chiu, Ching-Chun Huang:
Find The Way Back: Invertible Kernel Estimator For Blind Image Super-Resolution. 2145-2149 - Haoquan Wang, Gang Zhang, Zhichun Lei:
Fine-Grained Dynamic Loss for Accurate Single-Image Super-Resolution. 2150-2154 - Gongzhe Li, Linwei Qiu
, Haopeng Zhang, Fengying Xie, Zhiguo Jiang:
Multi-Frame Super-Resolution With Raw Images Via Modified Deformable Convolution. 2155-2159 - Yan Wang, Yao Lu, Shunzhou Wang, Wenyao Zhang, Zijian Wang:
Local-Global Feature Aggregation for Light Field Image Super-Resolution. 2160-2164 - Hao He, Zongcai Du, Wenfeng Li, Jie Tang, Gangshan Wu:
Pyramid Fusion Attention Network For Single Image Super-Resolution. 2165-2169 - Xian Zhong
, Zhuo Zhou
, Wenxuan Liu, Kui Jiang, Xuemei Jia, Wenxin Huang, Zheng Wang:
VCD: View-Constraint Disentanglement for Action Recognition. 2170-2174 - Chengming Zou, Ducheng Yuan, Long Lan, Haoang Chi:
Privacy-Preserving Action Recognition. 2175-2179 - Hongcheng Zhang, Xu Zhao:
Spatio-Temporal Motion Aggregation Network for Video Action Detection. 2180-2184 - Yanhao Jing, Feng Wang:
TP-VIT: A Two-Pathway Vision Transformer for Video Action Recognition. 2185-2189 - Yang Liu
, Jing Liu, Xiaoguang Zhu, Donglai Wei, Xiaohong Huang, Liang Song:
Learning Task-Specific Representation for Video Anomaly Detection with Spatial-Temporal Attention. 2190-2194 - Mengzhu Li, Hongjun Wu
, Yongcheng Liu, Hongzhe Liu
, Cheng Xu
, Xuewei Li:
W-ART: Action Relation Transformer for Weakly-Supervised Temporal Action Localization. 2195-2199 - Jinpeng Liu, Song Wu, Dehong He, Guoqiang Xiao:
MS-ROCANet: Multi-Scale Residual Orthogonal-Channel Attention Network for Scene Text Detection. 2200-2204 - Shan Liu, Guoqiang Xiao, Xiaohui Xu, Song Wu:
Bi-Directional Normalization and Color Attention-Guided Generative Adversarial Network for Image Enhancement. 2205-2209 - Zhikui Chen, Han Wang, Suhua Zhang, Fangming Zhong:
Dual-Attention Network for Few-Shot Segmentation. 2210-2214 - Jiapeng Li, Ge Li, Thomas H. Li:
Attention Guided Invariance Selection for Local Feature Descriptors. 2215-2219 - Jiahao Wang, Mingdeng Cao, Shuwei Shi, Baoyuan Wu, Yujiu Yang
:
Attention Probe: Vision Transformer Distillation in the Wild. 2220-2224 - Bin Jiang, Fangqiang Xu
, Jun Xia, Chao Yang, Wei Huang, Yun Huang:
Stacked Multi-Scale Attention Network for Image Colorization. 2225-2229 - Han Wang, Yali Li, Shengjin Wang:
CRPN: Distinguish Novel Categories Via Class-Relevant Region Proposal Network for Few-Shot Object Detection. 2230-2234 - Zhishan Li, Mingmu Chen, Yifan He, Lei Xie, Hongye Su:
An Efficient Framework for Detection and Recognition of Numerical Traffic Signs. 2235-2239 - Zongyao Li, Ren Togo, Takahiro Ogawa, Miki Haseyama:
Divergence-Guided Feature Alignment for Cross-Domain Object Detection. 2240-2244 - Jun Wang
, Hefeng Zhou, Xiaohan Yu
:
PGTRNET: Two-Phase Weakly Supervised Object Detection with Pseudo Ground Truth Refinement. 2245-2249 - Weijie Liu, Chong Wang, Shenghao Yu, Chenchen Tao
, Jun Wang, Jiafei Wu:
Novel Instance Mining with Pseudo-Margin Evaluation for Few-Shot Object Detection. 2250-2254 - Chuang Yang
, Mulin Chen, Yuan Yuan, Qi Wang:
BiP-Net: Bidirectional Perspective Strategy Based Arbitrary-Shaped Text Detection Network. 2255-2259 - Tim Heydrich, Yimin Yang, Xiangyu Ma, Yu Liu, Shan Du
:
A Novel Lightweight Network for Fast Monocular Depth Estimation. 2260-2264 - Tim Heydrich, Yimin Yang, Shan Du
:
A Lightweight Self-Supervised Training Framework for Monocular Depth Estimation. 2265-2269 - Hao Liu, Hui Yuan, Raouf Hamzaoui, Wei Gao
, Shuai Li:
PU-Refiner: A Geometry Refiner with Adversarial Learning for Point Cloud Upsampling. 2270-2274 - Bo-Fan Chen, Yang-Ming Yeh, Yi-Chang Lu:
CF-Net: Complementary Fusion Network for Rotation Invariant Point Cloud Completion. 2275-2279 - Zihao Zhang, Nan Sang, Xupeng Wang
:
TH-Net: A Method Of Single 3d Object Tracking Based On Transformers And Hausdorff Distance. 2280-2284 - Hengxin Feng, Weifeng Liu, Yanjiang Wang, Baodi Liu:
Enrich Features for Few-Shot Point Cloud Classification. 2285-2289 - Jaewoo Lee, Daeul Park, Dongwook Lee, Daehyun Ji:
Semi-Supervised 360° Depth Estimation from Multiple Fisheye Cameras with Pixel-Level Selective Loss. 2290-2294 - Wei Zhong, Yazhi Yuan, Xinchen Ye, Dian Zheng, Rui Xu:
Underwater Stereo Matching Via Unsupervised Appearance And Feature Adaptation Networks. 2295-2299 - Pei Tang, Liangrui Peng, Ruijie Yan, Haodong Shi, Gang Yao, Changsong Liu, Jie Li, Yuqi Zhang:
Domain Adaptation via Mutual Information Maximization for Handwriting Recognition. 2300-2304 - Ang Li, Jian Hu, Chilin Fu, Xiaolu Zhang, Jun Zhou:
Attribute-Conditioned Face Swapping Network for Low-Resolution Images. 2305-2309 - Ying Bian, Peng Zhang, Jingjing Wang, Chunmao Wang, Shiliang Pu:
Learning Multiple Explainable and Generalizable Cues for Face Anti-Spoofing. 2310-2314 - Bastien Laville
, Laure Blanc-Féraud, Gilles Aubert:
Off-The-Grid Covariance-Based Super-Resolution Fluctuation Microscopy. 2315-2319 - Zhiyuan Zha
, Bihan Wen
, Xin Yuan, Jiantao Zhou, Ce Zhu:
Simultaneous Nonlocal Low-Rank And Deep Priors For Poisson Denoising. 2320-2324 - Yiming Liu
, Yanni Zhang, Qiang Li, Jun Kong, Miao Qi, Jianzhong Wang
:
Double Closed-Loop Network for Image Deblurring. 2325-2329 - Ying Zhang, Youjun Xiang, Lei Cai, Yuli Fu, Wanliang Huo, Junjun Xia:
Single Image De-Raining with High-Low Frequency Guidance. 2330-2334 - Wu Yang, Wuzhen Shi:
Detail Generation and Fusion Networks for Image Inpainting. 2335-2339 - Hong Liu, Ying Zhu, Guoliang Hua, Weibo Huang, Runwei Ding:
Adaptive Weighted Network With Edge Enhancement Module For Monocular Self-Supervised Depth Estimation. 2340-2344 - Diclehan Karakaya
, Oguzhan Ulucan
, Mehmet Türkan
:
Pas-Mef: Multi-Exposure Image Fusion Based On Principal Component Analysis, Adaptive Well-Exposedness And Saliency Map. 2345-2349 - Miaoju Ban, Runwei Ding, Jian Zhang
, Tianyu Guo
, Tao Wang
:
PDD-Net: A Precise Defect Detection Network Based on Point Set Representation. 2350-2354 - Renhui Zhang, Tiancheng Lin, Rui Zhang, Yi Xu:
Solving The Long-Tailed Problem Via Intra- And Inter-Category Balance. 2355-2359 - Zhanchao Huang, Wei Li, Ran Tao:
Extracting and Distilling Direction-Adaptive Knowledge for Lightweight Object Detection in Remote Sensing Images. 2360-2364 - Xiaoliu Luo, Jing Luo, Zhao Duan, Jin Tan, Taiping Zhang:
Pseudo-Interacting Guided Network for Few-Shot Segmentation. 2365-2369 - Yuehui Wang, Qing Wang, Dongyu Zhang:
Few-Shot Generation By Modeling Stereoscopic Priors. 2370-2374 - Kohei Matsuzaki, Kei Kawamura:
Relative Viewpoint Estimation Based on Structured 3d Representation Alignment. 2375-2379 - Minxiang Ye, Yifei Zhang, Shiqiang Zhu, Anhuan Xie, Dan Zhang:
Deep Markov Clustering for Panoptic Segmentation. 2380-2384 - Libo Liu, Chengjian Huang, Chunsheng Cai, Xiaodong Zhang, Qingmao Hu:
Multi-Task Learning Improves the Brain Stoke Lesion Segmentation. 2385-2389 - Hongyi Wang
, Shiao Xie, Lanfen Lin, Yutaro Iwamoto, Xian-Hua Han, Yen-Wei Chen, Ruofeng Tong:
Mixed Transformer U-Net for Medical Image Segmentation. 2390-2394 - Wankang Zeng, Wenkang Fan, Dongfang Shen, Yinran Chen, Xiongbiao Luo:
Contrastive Translation Learning For Medical Image Segmentation. 2395-2399 - Tianfang Meng, Wenqiang Zhang:
Fast Video Object Segmentation via Dynamic YOLACT. 2400-2404 - Tiyu Fang, Zhen Liang, Xiuli Shao, Zihao Dong, Jinping Li:
Depth Removal Distillation for RGB-D Semantic Segmentation. 2405-2409 - Lingzhao Ju, Xu Zhao:
Mask-Based Attention Parallel Network for in-the-Wild Facial Expression Recognition. 2410-2414 - Lifang Zhou, Siqin Li, Yi Wang, Junlin Liu:
SDNET: Lightweight Facial Expression Recognition For Sample Disequilibrium. 2415-2419 - Mengting Wei
, Wenming Zheng, Yuan Zong, Xingxun Jiang, Cheng Lu, Jiateng Liu:
A Novel Micro-Expression Recognition Approach Using Attention-Based Magnification-Adaptive Networks. 2420-2424 - Weidong Tian, Housen Zhang, Chen Peng, Zhong-Qiu Zhao:
Lipreading Model Based On Whole-Part Collaborative Learning. 2425-2429 - Ahmed Al-Hindawi, Marcela P. Vizcaychipi, Yiannis Demiris
:
What Is The Patient Looking At? Robust Gaze-Scene Intersection Under Free-Viewing Conditions. 2430-2434 - Haoxian Huang, Luqian Ren, Zhuo Yang, Yinwei Zhan, Qieshi Zhang, Jujian Lv:
GAZEATTENTIONNET: Gaze Estimation with Attentions. 2435-2439 - Yang Yang
, Yonghua Zhang, Xiaojie Guo:
Low-Light Image Enhancement via Feature Restoration. 2440-2444 - Xiaoyu Zhang
, Wei Gao
:
HIRL: Hybrid Image Restoration Based on Hierarchical Deep Reinforcement Learning via Two-Step Analysis. 2445-2449 - Chengrong Wang, Chenjie Cao, Yanwei Fu
, Xiangyang Xue:
High-Fidelity Portrait Editing Via Exploring Differentiable Guided Sketches from the Latent Space. 2450-2454 - Zhihong Pan
:
Learning Adjustable Image Rescaling with Joint Optimization of Perception and Distortion. 2455-2459 - Wenjun Chen, Chunling Yang, Xin Yang:
FSOINET: Feature-Space Optimization-Inspired Network For Image Compressive Sensing. 2460-2464 - Keuntek Lee, Yeong Il Jang, Nam Ik Cho:
Disentangled Feature-Guided Multi-Exposure High Dynamic Range Imaging. 2465-2469 - Peilun Du, Xiaolong Zheng, Liang Liu, Huadong Ma:
Defending Against Universal Attack Via Curvature-Aware Category Adversarial Training. 2470-2474 - Yunjian Zhang, Yanwei Liu, Jinxia Liu, Pengwei Zhan, Liming Wang, Zhen Xu:
SP Attack: Single-Perspective Attack for Generating Adversarial Omnidirectional Images. 2475-2479 - Yachun Li, Ying Lian, Jingjing Wang, Yuhui Chen, Chunmao Wang, Shiliang Pu:
Few-Shot One-Class Domain Adaptation Based On Frequency For Iris Presentation Attack Detection. 2480-2484 - Margarita Geleta, Cristina Punti, Kevin McGuinness, Jordi Pons, Cristian Canton, Xavier Giró-i-Nieto:
Pixinwav: Residual Steganography for Hiding Pixels in Audio. 2485-2489 - Yurui Xie, Ling Guan:
A Semi-Handcrafted Keypoint Detector with Discriminative Feature Encoding. 2490-2494 - Antonio Agudo
:
Safari from Visual Signals: Recovering Volumetric 3d Shapes. 2495-2499 - Farshad G. Veshki, Sergiy A. Vorobyov:
Coupled Feature Learning Via Structured Convolutional Sparse Coding for Multimodal Image Fusion. 2500-2504 - Rongtao Xu, Changwei Wang, Bin Fan, Yuyang Zhang, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang:
DOMAINDESC: Learning Local Descriptors With Domain Adaptation. 2505-2509 - Arya Aftab, Alireza Morsali
, Shahrokh Ghaemmaghami:
Multi-Head Relu Implicit Neural Representation Networks. 2510-2514 - ZhaoJing Zhou, Yun Zhou, Zhuqing Jiang, Aidong Men, Haiying Wang:
An Efficient Method for Model Pruning Using Knowledge Distillation with Few Samples. 2515-2519 - Guangyu Ren, Tianhong Dai
, Tania Stathaki:
Adaptive Intra-Group Aggregation for Co-Saliency Detection. 2520-2524 - Tanmoy Mukherjee, Nikos Deligiannis
:
Novel Class Discovery: A Dependency Approach. 2525-2528 - Yanfeng Liu
, Qiang Li, Yuan Yuan, Qi Wang:
Single-Shot Balanced Detector for Geospatial Object Detection. 2529-2533 - Ruixin Shi, Junzheng Zhang, Yong Li, Shiming Ge:
Regularized Latent Space Exploration for Discriminative Face Super-Resolution. 2534-2538 - Yi Hou
, Chengyang Li, Yuheng Lu, Liping Zhu, Yuan Li
, Huizhu Jia, Xiaodong Xie:
Enhancing and Dissecting Crowd Counting by Synthetic Data. 2539-2543 - Chenghu Du
, Feng Yu
, Minghua Jiang, Xiong Wei, Tao Peng, Xinrong Hu:
Multi-Pose Virtual Try-On Via Self-Adaptive Feature Filtering. 2544-2548 - Jie Zhang, Yi Xiao, Guo Chen, Qingping Sun, Fangqiang Xu
, Chi-Sing Leung:
Histogram-Guided Semantic-Aware Colorization. 2549-2553 - Green Rosh K. S, Nikhil Krishnan, B. H. Pawan Prasad, Sachin Deepak Lomte:
Content Preserving Scale Space Network for Fast Image Restoration from Noisy-Blurry Pairs. 2554-2558 - Rong Bao, Yurui Ren, Ge Li, Wei Gao
, Shan Liu:
Flow-Based Point Cloud Completion Network with Adversarial Refinement. 2559-2563 - Zezeng Li
, Weimin Wang
, Na Lei, Rui Wang:
Weakly Supervised Point Cloud Upsampling VIA Optimal Transport. 2564-2568 - Ryosuke Watanabe
, Keisuke Nonaka, Haruhisa Kato, Eduardo Pavez, Tatsuya Kobayashi, Antonio Ortega:
Point Cloud Denoising Using Normal Vector-Based Graph Wavelet Shrinkage. 2569-2573 - Anique Akhtar, Zhu Li, Geert Van Der Auwera, Jianle Chen:
Dynamic Point Cloud Interpolation. 2574-2578 - Shashank N. Sridhara, Eduardo Pavez, Antonio Ortega, Ryosuke Watanabe, Keisuke Nonaka:
Point Cloud Attribute Compression Via Chroma Subsampling. 2579-2583 - Lili Zhao
, Xuhu Lin, Wenyi Wang
, Kai-Kuang Ma, Jianwen Chen:
Rangeinet: Fast Lidar Point Cloud Temporal Interpolation. 2584-2588 - Lianlei Shan, Weiqiang Wang:
MBNet: A Multi-Resolution Branch Network for Semantic Segmentation Of Ultra-High Resolution Images. 2589-2593 - Yuxuan Zhang, Wei Yang:
BSOLO: Boundary-Aware One-Stage Instance Segmentation SOLO. 2594-2598 - Shaoping Jiang, Xiangmin Xu, Fang Liu, Xiaofen Xing, Lin Wang:
CS-GResNet: A Simple and Highly Efficient Network for Facial Expression Recognition. 2599-2603 - Bingxu Lu, Qinghua Hu, Yu Wang, Guosheng Hu:
RCANet: Row-Column Attention Network for Semantic Segmentation. 2604-2608 - Zhaozhi Xie, Hongtao Lu:
Exploring Category Consistency for Weakly Supervised Semantic Segmentation. 2609-2613 - Hyeonbin Hwang, Soyeon Kim, Wei-Jin Park, Jiho Seo, Kyungtae Ko, Hyeon Yeo:
Vision Transformer Equipped With Neural Resizer On Facial Expression Recognition Task. 2614-2618 - Kaining Ying, Zhenhua Wang, Cong Bai, Pengfei Zhou:
ISDA: Position-Aware Instance Segmentation with Deformable Attention. 2619-2623 - Zhenfei Zhang, Ming-Ching Chang, Tien D. Bui:
Improving Class Activation Map for Weakly Supervised Object Localization. 2624-2628 - Ruizhe Chen, Zhenqi Fu
, Yue Huang, En Cheng, Xinghao Ding:
A Robust Object Segmentation Network for UnderWater Scenes. 2629-2633 - Leiping Jie
, Hui Zhang:
A Fast and Efficient Network for Single Image Shadow Detection. 2634-2638 - Arvi Jonnarth
, Michael Felsberg:
Importance Sampling Cams For Weakly-Supervised Segmentation. 2639-2643 - Qingfeng Liu, Hai Su, Mostafa El-Khamy
, Kee-Bong Song:
DeepGBASS: Deep Guided Boundary-Aware Semantic Segmentation. 2644-2648 - Talha Hanif Butt, Murtaza Taj:
Camera Calibration Through Camera Projection Loss. 2649-2653 - Christopher Walker, Yuxing Wang, Yawen Lu, Guoyu Lu
:
Inferring Camera Intrinsics Based on Surfaces of Revolution: A Single Image Geometric Network Approach for Camera Calibration. 2654-2658 - Sibo Zhang, Jiahong Yuan, Miao Liao, Liangjun Zhang:
Text2video: Text-Driven Talking-Head Video Synthesis with Personalized Phoneme - Pose Dictionary. 2659-2663 - Mohamed Afham, Udith Haputhanthri, Jathurshan Pradeepkumar
, Mithunjha Anandakumar
, Ashwin De Silva
, Chamira U. S. Edussooriya:
Towards Accurate Cross-Domain in-Bed Human Pose Estimation. 2664-2668 - Yu Sun
, Tianyu Huang, Qian Bao, Wu Liu, Wenpeng Gao, Yili Fu:
Learning Monocular Mesh Recovery of Multiple Body Parts Via Synthesis. 2669-2673 - Xiyang Liu, Peng Li, Ding Ni, Yan Wang, Hui Xue:
LightPose: A Lightweight and Efficient Model with Transformer for Human Pose Estimation. 2674-2678 - Qier An, Yuan Shen
:
On The Observability in Visual Slam Networks. 2679-2683 - Yuxiao Li
, Santiago Mazuelas, Yuan Shen
:
Variational Bayesian Framework for Advanced Image Generation with Domain-Related Variables. 2684-2688 - Marina Gardella
, Tina Nikoukhah, Yanhao Li
, Quentin Bammey
:
The Impact of JPEG Compression on Prior Image Noise. 2689-2693 - Tin Lay Nwe, Ramanpreet Singh Pahwa, Richard Chang, Oo Zaw Min, Jie Wang, Yiqun Li, Dongyun Lin, Shitala Prasad, Sheng Dong:
On the Use of Component Structural Characteristics for Voxel Segmentation in Semicon 3D Images. 2694-2698 - Zihan Zhang, Thierry Blu:
Blind Source Separation via a Weak Exclusion Principle. 2699-2703 - Yuqi Zhang, Qi Qian, Chong Liu, Weihua Chen, Fan Wang, Hao Li, Rong Jin:
Graph Convolution for Re-Ranking in Person Re-Identification. 2704-2708 - Jing Yang, Canlong Zhang, Zhixin Li, Yanping Tang:
Multi-Level Relation Aware Network for Person Re-Identification. 2709-2713 - Zhaopeng Dou, Zhongdao Wang, Yali Li, Shengjin Wang:
Progressive-Granularity Retrieval Via Hierarchical Feature Alignment for Person Re-Identification. 2714-2718 - Minjung Kim, MyeongAh Cho
, Heansung Lee, Suhwan Cho
, Sangyoun Lee:
Occluded Person Re-Identification Via Relational Adaptive Feature Correction Learning. 2719-2723 - Shiping Li, Min Cao, Min Zhang:
Learning Semantic-Aligned Feature Representation for Text-Based Person Search. 2724-2728 - Xuezhi Xiang, Ning Lv
, Yulong Qiao:
Transformer-Based Person Search Model with Symmetric Online Instance Matching. 2729-2733 - Qingye Zhao, Xin Chen, Zhuoyu Zhao, Enyi Tang, Xuandong Li:
Wassertrain: An Adversarial Training Framework Against Wasserstein Adversarial Attacks. 2734-2738 - Siao Liu, Zhaoyu Chen, Wei Li, Jiwei Zhu, Jiafeng Wang, Wenqiang Zhang, Zhongxue Gan:
Efficient Universal Shuffle Attack for Visual Object Tracking. 2739-2743 - Riran Cheng
, Nan Sang, Yinyuan Zhou, Xupeng Wang
:
Non-Rigid Transformation Based Adversarial Attack Against 3d Object Tracking. 2744-2748 - Zhengyi Wang, Xupeng Wang
, Ferdous Sohel
, Mohammed Bennamoun
, Yong Liao, Jiali Yu:
Adversary Distillation for One-Shot Attacks on 3D Target Tracking. 2749-2453 - Yin Yin Low, Angeline Tanvy, Raphaël C.-W. Phan, Xiaojun Chang
:
AdverFacial: Privacy-Preserving Universal Adversarial Perturbation Against Facial Micro-Expression Leakages. 2754-2758 - Suryabhan Singh Hada, Miguel Á. Carreira-Perpiñán:
Interpretable Image Classification Using Sparse Oblique Decision Trees. 2759-2763 - Zhenqi Fu
, Xiaopeng Lin, Wu Wang, Yue Huang, Xinghao Ding:
Underwater Image Enhancement Via Learning Water Type Desensitized Representations. 2764-2768 - Ziyin Ma, Changjae Oh:
A Wavelet-Based Dual-Stream Network for Underwater Image Enhancement. 2769-2773 - Shu Chai, Zhenqi Fu
, Yue Huang, Xiaotong Tu, Xinghao Ding:
Unsupervised and Untrained Underwater Image Restoration Based on Physical Image Formation Model. 2774-2778 - Zhenlong Wang, Weifeng Liu, Yanjiang Wang, Baodi Liu:
Agcyclegan: Attention-Guided Cyclegan for Single Underwater Image Restoration. 2779-2783 - Shuhan Qi, Jianjun Du, Mingyan Wu, Hong Yi, Linlin Tang, Tao Qian, Xuan Wang:
Underwater Small Target Detection Based on Deformable Convolutional Pyramid. 2784-2788 - Kaixin Chen, Lin Zhang, Ying Shen, Yicong Zhou:
Towards Controllable and Physical Interpretable Underwater Scene Simulation. 2789-2793 - Yongshan Zhang, Xinxin Wang, Zhenyu Wang, Xinwei Jiang, Yicong Zhou:
Graph Learning Based Autoencoder for Hyperspectral Band Selection. 2794-2798 - Fengchao Xiong, Minchao Ye, Jun Zhou, Jianfeng Lu, Yuntao Qian:
Multitask Sparse Neural Network for Hyperspectral Image Denoising. 2799-2803 - Chen Xiaoyue, Xianghai Cao:
Hyperspectral Image Classification Based on Co-Learning Through Dual-Architecture Ensemble. 2804-2808 - Zhuanfeng Li
, Fengchao Xiong, Jianfeng Lu, Jun Zhou, Yuntao Qian:
Material-Guided Siamese Fusion Network for Hyperspectral Object Tracking. 2809-2813 - Xiuheng Wang, Jie Chen, Cédric Richard
:
Hyperspectral Image Super-Resolution with Deep Priors and Degradation Model Inversion. 2814-2818 - Na Liu, Wei Li, Ran Tao:
Geometric Low-Rank Tensor Approximation for Remotely Sensed Hyperspectral And Multispectral Imagery Fusion. 2819-2823 - Haoyue Tian, Pan Gao, Ran Wei, Manoranjan Paul
:
Dilated Convolutional Neural Network-Based Deep Reference Picture Generation for Video Compression. 2824-2828 - Yanghao Li, Xinyao Chen, Jisheng Li
, Jiangtao Wen, Yuxing Han, Shan Liu, Xiaozhong Xu:
Rate Control for Learned Video Compression. 2829-2833 - Xuekai Wei, Mingliang Zhou, Weijia Jia:
Global Optimization Solution for Dynamic Adaptive 360-Degree Streaming. 1-5 - Juliano S. Assine, José Cândido Silveira Santos Filho, Eduardo Valle:
Collaborative Object Detectors Adaptive to Bandwidth and Computation. 2839-2843 - Mu Li, Baojiang Zhong
, Kai-Kuang Ma:
MA-NET: Multi-Scale Attention-Aware Network for Optical Flow Estimation. 2844-2848 - Yizhuo Li, Cewu Lu:
Modeling Human Memory in Multi-Object Tracking with Transformers. 2849-2853 - Chang-Sheng Lin, Chia-Yi Hsu, Pin-Yu Chen, Chia-Mu Yu:
Real-World Adversarial Examples Via Makeup. 2854-2858 - Joseph Clements, Yingjie Lao:
In Pursuit of Preserving the Fidelity of Adversarial Images. 2859-2863 - Meiling Li
, Nan Zhong, Xinpeng Zhang, Zhenxing Qian
, Sheng Li:
Object-Oriented Backdoor Attack Against Image Captioning. 2864-2868 - Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich:
Towards Robust Speech-to-Text Adversarial Attack. 2869-2873 - Yixiao Xu
, Xiaolei Liu, Mingyong Yin, Teng Hu, Kangyi Ding:
Sparse Adversarial Attack For Video Via Gradient-Based Keyframe Selection. 2874-2878 - Hui Zeng
, Kang Deng, Biwei Chen, Anjie Peng:
How Secure Are The Adversarial Examples Themselves? 2879-2883 - Xiaohui Zhao, Yang Yu, Rongrong Ni, Yao Zhao:
Exploring Complementarity of Global and Local Spatiotemporal Information for Fake Face Video Detection. 2884-2888 - Edoardo Daniele Cannas
, János Horváth, Sriram Baireddy, Paolo Bestagini, Edward J. Delp, Stefano Tubaro:
Panchromatic Imagery Copy-Paste Localization Through Data-Driven Sensor Attribution. 2889-2893 - Lv Chen, Dengpan Ye, Yueyun Shang, Jiaqing Huang:
Robust Video Hashing Based on Local Fluctuation Preserving for Tracking Deep Fake Videos. 2894-2898 - Ping Wang, Kunlin Liu, Wenbo Zhou, Hang Zhou, Honggu Liu, Weiming Zhang, Nenghai Yu:
ADT: Anti-Deepfake Transformer. 2899-1903 - Hui Guo, Shu Hu, Xin Wang, Ming-Ching Chang, Siwei Lyu:
Eyes Tell All: Irregular Pupil Shapes Reveal GAN-Generated Faces. 2904-2908 - Antonio Theophilo, Rafael Padilha, Fernanda A. Andaló, Anderson Rocha:
Explainable Artificial Intelligence for Authorship Attribution on Social Media. 2909-2913 - Guiping Zhu, Mingzhu Ma, Yuwen Huang, Kuikui Wang, Gongping Yang
:
Dual-Domain Low-Rank Fusion Deep Metric Learning for Off-the-Person ECG Biometrics. 2914-2918 - Kanghao Zhang
, Shan Liang, Shuai Nie, Shulin He, Jiahui Pan, Xueliang Zhang, Haoxin Ma, Jiangyan Yi:
A Robust Deep Audio Splicing Detection Method via Singularity Detection Feature. 2919-2923 - Kuikui Wang, Gongping Yang
, Yuwen Huang, Lu Yang, Yilong Yin:
Online Ecg Biometrics Via Hadamard Code. 2924-2928 - Ziyue Xiang
, Paolo Bestagini, Stefano Tubaro, Edward J. Delp:
Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers. 2929-2933 - Chong Liu, Yuqi Zhang, Weihua Chen, Fan Wang, Hao Li, Yi-Dong Shen:
Adaptive Matching Strategy for Multi-Target Multi-Camera Tracking. 2934-2938 - Hanye Huang, Youjun Xiang, Guodong Yang, Lingling Lv, Xianfeng Li
, Zichun Weng, Yuli Fu:
Generalized Face Anti-Spoofing via Cross-Adversarial Disentanglement with Mixing Augmentation. 2939-2943 - Taoshan Zhang, Youjun Xiang, Xianfeng Li
, Zichun Weng, Zhen Chen, Yuli Fu:
Free Lunch for Cross-Domain Occluded Face Recognition without Source Data. 2944-2948 - Zijun Zhuang
, Hongtao Lu:
Coneface: Approximate Pairwise Loss for Face Recognition. 2949-2953 - Jie Jiang, Yunlian Sun:
Depth-Based Ensemble Learning Network For Face Anti-Spoofing. 2954-2958 - Eklavya Sarkar
, Pavel Korshunov, Laurent Colbois
, Sébastien Marcel:
Are GAN-based morphs threatening face recognition? 2959-2963 - Yulu Jin
, Lifeng Lai:
Privacy Protection In Learning Fair Representations. 2964-2968 - Le Feng, Sheng Li, Zhenxing Qian
, Xinpeng Zhang:
Stealthy Backdoor Attack with Adversarial Training. 2969-2973 - Dan Zhao, Hong Chen, Suyun Zhao, Ruixuan Liu, Cuiping Li, Xiaoying Zhang:
Fldp: Flexible Strategy For Local Differential Privacy. 2974-2978 - Mohammad Amin Zarrabian
, Ni Ding, Parastoo Sadeghi, Thierry Rakotoarivelo:
Enhancing Utility In The Watchdog Privacy Mechanism. 2979-2983 - Michele Cirillo, Mario Di Mauro
, Vincenzo Matta, Giuseppe Basileo:
Cyber-Threat Propagation over Network-Slicing Architectures. 2984-2988 - Ecenaz Erdemir, Pier Luigi Dragotti
, Deniz Gündüz:
Privacy-Aware Communication over a Wiretap Channel with Generative Networks. 2989-2993 - Ran Shi, Jian Xiong, Tong Qiao:
Encrypted Image Visual Security Index via Non-Local Recognizable Degree Evaluation. 2994-2998 - Lu Miao, Wei Yang, Rong Hu, Lu Li, Liusheng Huang:
Against Backdoor Attacks In Federated Learning With Differential Privacy. 2999-3003 - Xinying Liao, Jiaye Xue, Shengxing Yu, Ximeng Liu, Jiangang Shu:
SecMPNN: 3-Party Privacy-Preserving Molecular Structure Properties Inference. 3004-3008 - Behrooz Razeghi
, Shideh Rezaeifar, Sohrab Ferdowsi, Taras Holotyak
, Slava Voloshynovskiy:
Compressed Data Sharing Based On Information Bottleneck Model. 3009-3013 - Thibault Maho, Teddy Furon, Erwan Le Merrer:
Randomized Smoothing Under Attack: How Good is it in Practice? 3014-3018 - Chau Yi Li, Andrea Cavallaro:
Training Privacy-Preserving Video Analytics Pipelines by Suppressing Features That Reveal Information About Private Attributes. 3019-3023 - Yulong Wang
, Xingshu Chen, Qixu Wang, Run Yang, Bangzhou Xin:
Unsupervised Anomaly Detection for Container Cloud Via BILSTM-Based Variational Auto-Encoder. 3024-3028 - Fusen Wang, Jun Sang, Chunlin Huang, Bin Cai, Hong Xiang, Nong Sang:
Applying Deep Learning to Known-Plaintext Attack on Chaotic Image Encryption Schemes. 3029-3033 - Jiahong Xie, Haibo Cheng, Rong Zhu, Ping Wang, Kaitai Liang:
WordMarkov: A New Password Probability Model of Semantics. 3034-3038 - Cong Li
, Qingni Shen, Zhikang Xie
, Jisheng Dong, Yuejian Fang, Zhonghai Wu:
Efficient Identity-Based Chameleon Hash for Mobile Devices. 3039-3043 - Xiaoxi He, Haibo Cheng, Jiahong Xie, Ping Wang, Kaitai Liang:
Passtrans: An Improved Password Reuse Model Based on Transformer. 3044-3048