


default search action
6th PRCV 2023: Xiamen, China - Part I
- Qingshan Liu
, Hanzi Wang
, Zhanyu Ma
, Weishi Zheng
, Hongbin Zha
, Xilin Chen
, Liang Wang, Rongrong Ji
:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part I. Lecture Notes in Computer Science 14425, Springer 2024, ISBN 978-981-99-8428-2
Action Recognition
- Chengguo Yuan, Yu Jin, Zongzhen Wu, Fanting Wei, Yangzirui Wang, Lan Chen, Xiao Wang:
Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion Based Classification. 3-15 - Yang Shu, Wanggen Li, Doudou Li, Kun Gao, Biao Jie:
Multi-scale Dilated Attention Graph Convolutional Network for Skeleton-Based Action Recognition. 16-28 - Wentian Xin, Yi Liu, Ruyi Liu, Qiguang Miao, Cheng Shi, Chi-Man Pun:
Auto-Learning-GCN: An Ingenious Framework for Skeleton-Based Action Recognition. 29-42 - Xiaowei Zhu, Qian Huang, Chang Li
, Jingwen Cui, Yingying Chen:
Skeleton-Based Action Recognition with Combined Part-Wise Topology Graph Convolutional Networks. 43-59 - Mingliang Xue
, Siwei Wang, Bing Fu, Zhengyang Zhao, Tao Liu, Lingfeng Lai:
Segmenting Key Clues to Induce Human-Object Interaction Detection. 60-71 - Teng Huang
, Weiqing Kong
, Jiaming Liang
, Ziyu Ding
, Hui Li
, Xi Zhang:
Lightweight Multispectral Skeleton and Multi-stream Graph Attention Networks for Enhanced Action Prediction with Multiple Modalities. 72-83 - Wanchuan Yu, Hanyu Guo, Yan Yan, Jie Li, Hanzi Wang:
Spatio-Temporal Self-supervision for Few-Shot Action Recognition. 84-96 - Jiulin Li, Mengyu Yang, Yang Liu, Gongli Xi, Lanshan Zhang, Ye Tian:
A Fuzzy Error Based Fine-Tune Method for Spatio-Temporal Recognition Model. 97-108 - Jinzhao Luo, Lu Zhou, Guibo Zhu, Guojing Ge, Beiying Yang, Jinqiao Wang:
Temporal-Channel Topology Enhanced Network for Skeleton-Based Action Recognition. 109-119 - Ying Zhou, Yana Zhang, Aiqiu Wu
:
HFGCN-Based Action Recognition System for Figure Skating. 120-130
Multi-modal Information Processing
- Zhengyu Li, Yao Wu
, Yanyun Qu:
Image Priors Assisted Pre-training for Point Cloud Shape Analysis. 133-145 - Wei Yue:
AMM-GAN: Attribute-Matching Memory for Person Text-to-Image Generation. 146-158 - Liucun Lu
, Jinghui Qin
, Zequn Jie
, Lin Ma
, Liang Lin
, Xiaodan Liang
:
RecFormer: Recurrent Multi-modal Transformer with History-Aware Contrastive Learning for Visual Dialog. 159-171 - Jiancheng Huang, Yifan Liu, Jin Qin, Shifeng Chen:
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing. 172-184 - Jiaer Xia, Haozhe Yang, Yan Zhang, Pingyang Dai
:
Enhancing Text-Image Person Retrieval Through Nuances Varied Sample. 185-196 - Yi Zhang
, Ce Zhang
, Xueting Hu, Zhihai He:
Unsupervised Prototype Adapter for Vision-Language Models. 197-209 - Wenjun Feng, Dazhen Lin, Donglin Cao:
Multimodal Causal Relations Enhanced CLIP for Image-to-Text Retrieval. 210-221 - Longzheng Wang, Chuang Zhang, Hongbo Xu, Yongxiu Xu, Siqi Wang:
Exploring Cross-Modal Inconsistency in Entities and Emotions for Multimodal Fake News Detection. 222-234 - Mengluan Li, Yanqing Guo, Haiyan Fu, Yi Li, Hong Su:
Deep Consistency Preserving Network for Unsupervised Cross-Modal Hashing. 235-246 - Mintu Yang, Xianxu Hou, Hao Li, Linlin Shen, Lixin Fan:
Learning Adapters for Text-Guided Portrait Stylization with Pretrained Diffusion Models. 247-258 - Zikun Song, Pinle Qin, Jianchao Zeng, Shuangjiao Zhai, Rui Chai, JunYi Yan:
EdgeFusion: Infrared and Visible Image Fusion Algorithm in Low Light. 259-270 - Yuanyuan Qiu, Zhenning Yu, Zhenguo Gao:
An Efficient Momentum Framework for Face-Voice Association Learning. 271-283 - Yuan Qing, Naixing Wu, Shaohua Wan, Lixin Duan:
Multi-modal Instance Refinement for Cross-Domain Action Recognition. 284-296 - Yang Xu, Junyi Wu
, Yan Yan, Xinsheng Du, Huiji Zhang, Jianqiang Zhao, Zhipeng Gao:
Modality Interference Decoupling and Representation Alignment for Caricature-Visual Face Recognition. 297-308 - Jie Wang, Yixiao Zheng, Ruoyi Du, Yiming Zhang, Kongming Liang, Zhanyu Ma:
Plugging Stylized Controls in Open-Stylized Image Captioning. 309-320 - Taoying Zhang, Hesong Li, Qiankun Liu, Xiaoyong Wang, Ying Fu:
MGT: Modality-Guided Transformer for Infrared and Visible Image Fusion. 321-332 - Chenyu Zhou
, Xiuhong Li
, Zhe Li
, Fan Chen, Xiaofan Wang, Dan Yang, Bin Chen
, Songlin Li
:
Multimodal Rumor Detection by Using Additive Angular Margin with Class-Aware Attention for Hard Samples. 333-344 - Lingfeng Hu, Si Liu, Hanzi Wang:
An Effective Dynamic Reweighting Method for Unbiased Scene Graph Generation. 345-356 - Zejun Wang
, Xinglong Wu
, Hongwei Yang
, Hui He
, Yu Tai
, Weizhe Zhang
:
Multi-modal Graph and Sequence Fusion Learning for Recommendation. 357-369 - Guoyong Cai, Shunjie Wang, Guangrui Lv:
Co-attention Guided Local-Global Feature Fusion for Aspect-Level Multimodal Sentiment Analysis. 370-382 - Qing Zhang, Haocheng Lv, Jie Liu, Zhiyun Chen, Jianyong Duan, Mingying Xv, Hao Wang:
Discovering Multimodal Hierarchical Structures with Graph Neural Networks for Multi-modal and Multi-hop Question Answering. 383-394 - Chengjie Sun, Weiwei Chen, Lei Lin, Lili Shan:
Enhancing Recommender System with Multi-modal Knowledge Graph. 395-407 - Guoqing Xu, Min Hu
, Xiaohua Wang
, Jiaoyun Yang
, Nan Li
, Qingyu Zhang:
Location Attention Knowledge Embedding Model for Image-Text Matching. 408-421 - Dan Liu, Wei Song, Xiaobing Zhao:
Pedestrian Attribute Recognition Based on Multimodal Transformer. 422-433 - Xinyi Wu
, Xia Yuan
, YanChao Cui
, Chunxia Zhao:
RGB-D Road Segmentation Based on Geometric Prior Information. 434-445 - Tingting Han
, Yuanxin Lv, Zhou Yu, Jun Yu, Jianping Fan, Liu Yuan:
Contrastive Perturbation Network for Weakly Supervised Temporal Sentence Grounding. 446-460 - Feng Li, Enguang Zuo, Chen Chen, Cheng Chen, Mingrui Ma, Yunling Wang, Xiaoyi Lv, Min Li:
MLDF-Net: Metadata Based Multi-level Dynamic Fusion Network. 461-473 - Ran Yan
, Ruiying Du
, Kun He
, Jing Chen
:
Efficient Adversarial Training with Membership Inference Resistance. 474-486 - Hongyu Wang, Pengpeng Qiang, Hongye Tan, Jingchang Hu:
Enhancing Image Comprehension for Computer Science Visual Question Answering. 487-498 - Wei Bao, Jingjing Hu, Meiyu Huang, Xueshuang Xiang:
Cross-Modal Attentive Recalibration and Dynamic Fusion for Multispectral Pedestrian Detection. 499-510

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.