default search action
Xiang Bai
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j134]Ling Fu, Zijie Wu, Yingying Zhu, Yuliang Liu, Xiang Bai:
Enhancing scene text detectors with realistic text image synthesis using diffusion models. Comput. Vis. Image Underst. 250: 104224 (2025) - [j133]Weijia Wu, Yuzhong Zhao, Zhuang Li, Jiahong Li, Hong Zhou, Mike Zheng Shou, Xiang Bai:
A large cross-modal video retrieval dataset with reading comprehension. Pattern Recognit. 157: 110818 (2025) - [j132]Dongliang Luo, Yuliang Liu, Rui Yang, Xianjin Liu, Jishen Zeng, Yu Zhou, Xiang Bai:
Toward real text manipulation detection: New dataset and new solution. Pattern Recognit. 157: 110828 (2025) - 2024
- [j131]Wenzhe Ding, Xiang Bai, Qingwei Wang, Huisheng Yao, Jian Liu, Hong Yang:
Research on the spacecraft ground equivalence test assessment problem: A comprehensive assessment method combining interval-type evaluation and prospect-two-dimensional cloud. Appl. Soft Comput. 166: 111882 (2024) - [j130]Dingyuan Zhang, Dingkang Liang, Hongcheng Yang, Zhikang Zou, Xiaoqing Ye, Zhe Liu, Xiang Bai:
SAM3D: zero-shot 3D object detection via the segment anything model. Sci. China Inf. Sci. 67(4) (2024) - [j129]Wenwen Yu, Yuliang Liu, Xingkui Zhu, Haoyu Cao, Xing Sun, Xiang Bai:
Turning a CLIP Model Into a Scene Text Spotter. IEEE Trans. Pattern Anal. Mach. Intell. 46(9): 6040-6054 (2024) - [j128]Debin Liu, Xiang Bai, Ruonan Zhao, Xianjun Deng, Laurence T. Yang:
Dual-Grained Lightweight Strategy. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 10228-10245 (2024) - [j127]Weijia Wu, Yiming Zhang, Yefei He, Luoming Zhang, Zhenyu Lou, Hong Zhou, Xiang Bai:
DSText V2: A comprehensive video text spotting dataset for dense and small text. Pattern Recognit. 149: 110177 (2024) - [j126]Mingkun Yang, Biao Yang, Minghui Liao, Yingying Zhu, Xiang Bai:
Class-Aware Mask-guided feature refinement for scene text recognition. Pattern Recognit. 149: 110244 (2024) - [j125]Mingkun Yang, Biao Yang, Minghui Liao, Yingying Zhu, Xiang Bai:
Sequential visual and semantic consistency for semi-supervised text recognition. Pattern Recognit. Lett. 178: 174-180 (2024) - [j124]Wenzhe Ding, Xiang Bai, Qingwei Wang, Fang Long, Hailin Li, Zhengrong Wu, Jian Liu, Huisheng Yao, Hong Yang:
A truncated test scheme design method for success-failure in-orbit tests. Reliab. Eng. Syst. Saf. 243: 109782 (2024) - [j123]Yuxuan Cai, Dingkang Liang, Dongliang Luo, Xinwei He, Xin Yang, Xiang Bai:
A Discrepancy Aware Framework for Robust Anomaly Detection. IEEE Trans. Ind. Informatics 20(3): 3986-3995 (2024) - [c192]Haisu Guan, Huanxin Yang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu:
Deciphering Oracle Bone Language with Diffusion Models. ACL (1) 2024: 15554-15567 - [c191]Junfeng Wu, Yi Jiang, Qihao Liu, Zehuan Yuan, Xiang Bai, Song Bai:
General Object Foundation Model for Images and Videos at Scale. CVPR 2024: 3783-3795 - [c190]Xin Zhou, Dingkang Liang, Wei Xu, Xingkui Zhu, Yihan Xu, Zhikang Zou, Xiang Bai:
Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis. CVPR 2024: 14707-14717 - [c189]Mingxin Huang, Hongliang Li, Yuliang Liu, Xiang Bai, Lianwen Jin:
Bridging the Gap Between End-to-End and Two-Step Text Spotting. CVPR 2024: 15608-15618 - [c188]Jianqiang Wan, Sibo Song, Wenwen Yu, Yuliang Liu, Wenqing Cheng, Fei Huang, Xiang Bai, Cong Yao, Zhibo Yang:
OMNIPARSER: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition. CVPR 2024: 15641-15653 - [c187]Zhang Li, Biao Yang, Qiang Liu, Zhiyin Ma, Shuo Zhang, Jingxu Yang, Yabo Sun, Yuliang Liu, Xiang Bai:
Monkey: Image Resolution and Text Label are Important Things for Large Multi-Modal Models. CVPR 2024: 26753-26763 - [c186]Dingyuan Zhang, Dingkang Liang, Zichang Tan, Xiaoqing Ye, Cheng Zhang, Jingdong Wang, Xiang Bai:
Make Your ViT-Based Multi-view 3D Detectors Faster via Token Compression. ECCV (47) 2024: 56-72 - [c185]Zheng Zhang, Yeyao Ma, Enming Zhang, Xiang Bai:
PSALM: Pixelwise SegmentAtion with Large Multi-modal Model. ECCV (34) 2024: 74-91 - [c184]Zhe Liu, Jinghua Hou, Xiaoqing Ye, Tong Wang, Jingdong Wang, Xiang Bai:
SEED: A Simple and Effective 3D DETR in Point Clouds. ECCV (11) 2024: 110-126 - [c183]Jinghua Hou, Tong Wang, Xiaoqing Ye, Zhe Liu, Shi Gong, Xiao Tan, Errui Ding, Jingdong Wang, Xiang Bai:
OPEN: Object-Wise Position Embedding for Multi-view 3D Object Detection. ECCV (26) 2024: 146-162 - [c182]Xudong Xie, Yuzhe Li, Yang Liu, Zhifei Zhang, Zhaowen Wang, Wei Xiong, Xiang Bai:
WAS: Dataset and Methods for Artistic Text Segmentation. ECCV (55) 2024: 237-254 - [c181]Zijie Wu, Chaohui Yu, Yanqin Jiang, Chenjie Cao, Fan Wang, Xiang Bai:
SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer. ECCV (13) 2024: 361-379 - [c180]Junyi Li, Junfeng Wu, Weizhi Zhao, Song Bai, Xiang Bai:
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects. ECCV (75) 2024: 475-494 - [c179]Baole Wei, Minghang He, Liangcai Gao, Duoyou Zhou, Xiang Bai, Zhi Tang:
Maskstr: Guide Scene Text Recognition Models with Masking. ICASSP 2024: 4245-4249 - [c178]Linger Deng, Mingxin Huang, Xudong Xie, Yuliang Liu, Lianwen Jin, Xiang Bai:
Progressive Evolution from Single-Point to Polygon for Scene Text. ICDAR (5) 2024: 111-128 - [c177]Pengjie Wang, Kaile Zhang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu:
Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction. ICDAR (1) 2024: 169-187 - [c176]Fadila Wendigoundi Douamba, Jianjun Song, Ling Fu, Yuliang Liu, Xiang Bai:
The First Swahili Language Scene Text Detection and Recognition Dataset. ICDAR (5) 2024: 215-226 - [c175]Chenyang Gao, Biao Yang, Wenwen Yu, Yuliang Liu, Xiang Bai:
Knowledge Mining of Scene Text for Referring Expression Comprehension. ICDAR (5) 2024: 245-262 - [c174]Hiba Maryam, Ling Fu, Jiajun Song, Tajrian ABM Shafayet, Qidi Luo, Xiang Bai, Yuliang Liu:
Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering. ICDAR (5) 2024: 279-292 - [c173]Shuo Zhang, Biao Yang, Zhang Li, Zhiyin Ma, Yuliang Liu, Xiang Bai:
Exploring the Capabilities of Large Multimodal Models on Dense Text. ICDAR (6) 2024: 281-298 - [i188]Mingxin Huang, Dezhi Peng, Hongliang Li, Zhenghao Peng, Chongyu Liu, Dahua Lin, Yuliang Liu, Xiang Bai, Lianwen Jin:
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting. CoRR abs/2401.07641 (2024) - [i187]Haisu Guan, Jinpeng Wan, Yuliang Liu, Pengjie Wang, Kaile Zhang, Zhebin Kuang, Xinyu Wang, Xiang Bai, Lianwen Jin:
An open dataset for the evolution of oracle bone characters: EVOBC. CoRR abs/2401.12467 (2024) - [i186]Kaixin Xiong, Dingyuan Zhang, Dingkang Liang, Zhe Liu, Hongcheng Yang, Wondimu Dikubab, Jianwei Cheng, Xiang Bai:
You Only Look Bottom-Up for Monocular 3D Object Detection. CoRR abs/2401.15319 (2024) - [i185]Pengjie Wang, Kaile Zhang, Yuliang Liu, Jinpeng Wan, Haisu Guan, Zhebin Kuang, Xinyu Wang, Lianwen Jin, Xiang Bai:
An open dataset for oracle bone script recognition and decipherment. CoRR abs/2401.15365 (2024) - [i184]Wei Chen, Hengxu Lin, Qun Zhang, Xiaojin Zhang, Xiang Bai, Xuanjing Huang, Zhongyu Wei:
CauESC: A Causal Aware Model for Emotional Support Conversation. CoRR abs/2401.17755 (2024) - [i183]Dingkang Liang, Xin Zhou, Xinyu Wang, Xingkui Zhu, Wei Xu, Zhikang Zou, Xiaoqing Ye, Xiang Bai:
PointMamba: A Simple State Space Model for Point Cloud Analysis. CoRR abs/2402.10739 (2024) - [i182]Mingkun Yang, Biao Yang, Minghui Liao, Yingying Zhu, Xiang Bai:
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition. CoRR abs/2402.13643 (2024) - [i181]Mingkun Yang, Biao Yang, Minghui Liao, Yingying Zhu, Xiang Bai:
Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition. CoRR abs/2402.15806 (2024) - [i180]Xin Zhou, Dingkang Liang, Wei Xu, Xingkui Zhu, Yihan Xu, Zhikang Zou, Xiang Bai:
Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis. CoRR abs/2403.01439 (2024) - [i179]Yuliang Liu, Biao Yang, Qiang Liu, Zhang Li, Zhiyin Ma, Shuo Zhang, Xiang Bai:
TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document. CoRR abs/2403.04473 (2024) - [i178]Yuxuan Cai, Xinwei He, Dingkang Liang, Ao Tong, Xiang Bai:
Anomaly Detection by Adapting a pre-trained Vision Language Model. CoRR abs/2403.09493 (2024) - [i177]Zheng Zhang, Yeyao Ma, Enming Zhang, Xiang Bai:
PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model. CoRR abs/2403.14598 (2024) - [i176]Jianqiang Wan, Sibo Song, Wenwen Yu, Yuliang Liu, Wenqing Cheng, Fei Huang, Xiang Bai, Cong Yao, Zhibo Yang:
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition. CoRR abs/2403.19128 (2024) - [i175]Zijie Wu, Chaohui Yu, Yanqin Jiang, Chenjie Cao, Fan Wang, Xiang Bai:
SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer. CoRR abs/2404.03736 (2024) - [i174]Mingxin Huang, Hongliang Li, Yuliang Liu, Xiang Bai, Lianwen Jin:
Bridging the Gap Between End-to-End and Two-Step Text Spotting. CoRR abs/2404.04624 (2024) - [i173]Jingqun Tang, Chunhui Lin, Zhen Zhao, Shu Wei, Binghong Wu, Qi Liu, Hao Feng, Yang Li, Siqi Wang, Lei Liao, Wei Shi, Yuliang Liu, Hao Liu, Yuan Xie, Xiang Bai, Can Huang:
TextSquare: Scaling up Text-Centric Visual Instruction Tuning. CoRR abs/2404.12803 (2024) - [i172]Yuliang Liu, Mingxin Huang, Hao Yan, Linger Deng, Weijia Wu, Hao Lu, Chunhua Shen, Lianwen Jin, Xiang Bai:
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization. CoRR abs/2404.19652 (2024) - [i171]Shuo Zhang, Biao Yang, Zhang Li, Zhiyin Ma, Yuliang Liu, Xiang Bai:
Exploring the Capabilities of Large Multimodal Models on Dense Text. CoRR abs/2405.06706 (2024) - [i170]Fadila Wendigoundi Douamba, Jianjun Song, Ling Fu, Yuliang Liu, Xiang Bai:
The First Swahili Language Scene Text Detection and Recognition Dataset. CoRR abs/2405.11437 (2024) - [i169]Jingqun Tang, Qi Liu, Yongjie Ye, Jinghui Lu, Shu Wei, Chunhui Lin, Wanqing Li, Mohamad Fitri Faiz Bin Mahmood, Hao Feng, Zhen Zhao, Yanjie Wang, Yuliang Liu, Hao Liu, Xiang Bai, Can Huang:
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. CoRR abs/2405.11985 (2024) - [i168]Hiba Maryam, Ling Fu, Jiajun Song, Tajrian ABM Shafayet, Qidi Luo, Xiang Bai, Yuliang Liu:
Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering. CoRR abs/2405.12533 (2024) - [i167]Haisu Guan, Huanxin Yang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu:
Deciphering Oracle Bone Language with Diffusion Models. CoRR abs/2406.00684 (2024) - [i166]Pengjie Wang, Kaile Zhang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu:
Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction. CoRR abs/2406.03019 (2024) - [i165]Xingkui Zhu, Yiran Guan, Dingkang Liang, Yuchao Chen, Yuliang Liu, Xiang Bai:
MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks. CoRR abs/2406.04801 (2024) - [i164]Dingkang Liang, Wei Hua, Chunsheng Shi, Zhikang Zou, Xiaoqing Ye, Xiang Bai:
SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection. CoRR abs/2407.01016 (2024) - [i163]Wei Xu, Chunsheng Shi, Sifan Tu, Xin Zhou, Dingkang Liang, Xiang Bai:
A Unified Framework for 3D Scene Understanding. CoRR abs/2407.03263 (2024) - [i162]Zhe Liu, Jinghua Hou, Xiaoqing Ye, Tong Wang, Jingdong Wang, Xiang Bai:
SEED: A Simple and Effective 3D DETR in Point Clouds. CoRR abs/2407.10749 (2024) - [i161]Jinghua Hou, Tong Wang, Xiaoqing Ye, Zhe Liu, Shi Gong, Xiao Tan, Errui Ding, Jingdong Wang, Xiang Bai:
OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection. CoRR abs/2407.10753 (2024) - [i160]Junyi Li, Junfeng Wu, Weizhi Zhao, Song Bai, Xiang Bai:
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects. CoRR abs/2407.16696 (2024) - [i159]Zhe Liu, Jinghua Hou, Xinyu Wang, Xiaoqing Ye, Jingdong Wang, Hengshuang Zhao, Xiang Bai:
LION: Linear Group RNN for 3D Object Detection in Point Clouds. CoRR abs/2407.18232 (2024) - [i158]Xudong Xie, Yuzhe Li, Yang Liu, Zhifei Zhang, Zhaowen Wang, Wei Xiong, Xiang Bai:
WAS: Dataset and Methods for Artistic Text Segmentation. CoRR abs/2408.00106 (2024) - [i157]Mingxin Huang, Yuliang Liu, Dingkang Liang, Lianwen Jin, Xiang Bai:
Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models. CoRR abs/2408.02034 (2024) - [i156]Tingfeng Huang, Yuxuan Cheng, Jingbo Xia, Rui Yu, Yuxuan Cai, Jinhai Xiang, Xinwei He, Xiang Bai:
Attention-Guided Perturbation for Unsupervised Image Anomaly Detection. CoRR abs/2408.07490 (2024) - [i155]eiyao Zhao, Zhengshuo Li, Jiahui Zhang, Xiang Bai, Jia Su:
Stochastic Real-Time Economic Dispatch for Integrated Electric and Gas Systems Considering Uncertainty Propagation and Pipeline Leakage. CoRR abs/2408.08101 (2024) - [i154]Dingyuan Zhang, Dingkang Liang, Zichang Tan, Xiaoqing Ye, Cheng Zhang, Jingdong Wang, Xiang Bai:
Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression. CoRR abs/2409.00633 (2024) - [i153]Xudong Xie, Liang Yin, Hao Yan, Yang Liu, Jing Ding, Minghui Liao, Yuliang Liu, Wei Chen, Xiang Bai:
PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling. CoRR abs/2410.05970 (2024) - [i152]Zhuoling Li, Liangliang Ren, Jinrong Yang, Yong Zhao, Xiaoyang Wu, Zhenhua Xu, Xiang Bai, Hengshuang Zhao:
VIRT: Vision Instructed Transformer for Robotic Manipulation. CoRR abs/2410.07169 (2024) - [i151]Dingkang Liang, Tianrui Feng, Xin Zhou, Yumeng Zhang, Zhikang Zou, Xiang Bai:
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning. CoRR abs/2410.08114 (2024) - [i150]Bin Shan, Xiang Fei, Wei Shi, An-Lan Wang, Guozhi Tang, Lei Liao, Jingqun Tang, Xiang Bai, Can Huang:
MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark. CoRR abs/2410.11538 (2024) - [i149]Yuxuan Cai, Jiangning Zhang, Haoyang He, Xinwei He, Ao Tong, Zhenye Gan, Chengjie Wang, Xiang Bai:
LLaVA-KD: A Framework of Distilling Multimodal Large Language Models. CoRR abs/2410.16236 (2024) - [i148]Linger Deng, Yuliang Liu, Bohan Li, Dongliang Luo, Liang Wu, Chengquan Zhang, Pengyuan Lyu, Ziyang Zhang, Gang Zhang, Errui Ding, Yingying Zhu, Xiang Bai:
R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models. CoRR abs/2410.17885 (2024) - [i147]Zhenbiao Cao, Yuanlei Zheng, Zhihao Fan, Xiaojin Zhang, Wei Chen, Xiang Bai:
RSL-SQL: Robust Schema Linking in Text-to-SQL Generation. CoRR abs/2411.00073 (2024) - 2023
- [j122]Yajie Chen, Xin Yang, Xiang Bai:
Confidence-weighted mutual supervision on dual networks for unsupervised cross-modality image segmentation. Sci. China Inf. Sci. 66(11) (2023) - [j121]Bin Cao, Tingyong Wu, Xiang Bai:
Stochastic programming based multi-arm bandit offloading strategy for internet of things. Digit. Commun. Networks 9(5): 1200-1211 (2023) - [j120]Dong Wu, Manwen Liao, Weitian Zhang, Xing-Gang Wang, Xiang Bai, Wenqing Cheng, Wen-Yu Liu:
Correction to: YOLOP: You Only Look Once for Panoptic Driving Perception. Mach. Intell. Res. 20(6): 952 (2023) - [j119]Minghui Liao, Zhisheng Zou, Zhaoyi Wan, Cong Yao, Xiang Bai:
Real-Time Scene Text Detection With Differentiable Binarization and Adaptive Scale Fusion. IEEE Trans. Pattern Anal. Mach. Intell. 45(1): 919-931 (2023) - [j118]Hui Zhang, Quanming Yao, James T. Kwok, Xiang Bai:
Searching a High Performance Feature Extractor for Text Recognition Network. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 6231-6246 (2023) - [j117]Zhe Liu, Tengteng Huang, Bingling Li, Xiwu Chen, Xi Wang, Xiang Bai:
EPNet++: Cascade Bi-Directional Fusion for Multi-Modal 3D Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 45(7): 8324-8341 (2023) - [j116]Mengshun Hu, Kui Jiang, Zheng Wang, Xiang Bai, Ruimin Hu:
CycMuNet+: Cycle-Projected Mutual Learning for Spatial-Temporal Video Super-Resolution. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 13376-13392 (2023) - [j115]Mengde Xu, Zheng Zhang, Fangyun Wei, Han Hu, Xiang Bai:
SAN: Side Adapter Network for Open-Vocabulary Semantic Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15546-15561 (2023) - [j114]Yuliang Liu, Jiaxin Zhang, Dezhi Peng, Mingxin Huang, Xinyu Wang, Jingqun Tang, Can Huang, Dahua Lin, Chunhua Shen, Xiang Bai, Lianwen Jin:
SPTS v2: Single-Point Scene Text Spotting. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15665-15679 (2023) - [j113]Kaixin Xiong, Dingyuan Zhang, Dingkang Liang, Zhe Liu, Hongcheng Yang, Wondimu Dikubab, Jianwei Cheng, Xiang Bai:
You Only Look Bottom-Up for Monocular 3D Object Detection. IEEE Robotics Autom. Lett. 8(11): 7464-7471 (2023) - [j112]Cairong Zhao, Zefan Qu, Xinyang Jiang, Yuanpeng Tu, Xiang Bai:
Content-Adaptive Auto-Occlusion Network for Occluded Person Re-Identification. IEEE Trans. Image Process. 32: 4223-4236 (2023) - [j111]Tianyi Shi, Xiaohuan Ding, Wei Zhou, Feng Pan, Zengqiang Yan, Xiang Bai, Xin Yang:
Affinity Feature Strengthening for Accurate, Complete and Robust Vessel Segmentation. IEEE J. Biomed. Health Informatics 27(8): 4006-4017 (2023) - [c172]Zhe Liu, Xiaoqing Ye, Xiao Tan, Errui Ding, Xiang Bai:
StereoDistill: Pick the Cream from LiDAR for Distilling Stereo-Based 3D Object Detection. AAAI 2023: 1790-1798 - [c171]Dingkang Liang, Jiahao Xie, Zhikang Zou, Xiaoqing Ye, Wei Xu, Xiang Bai:
CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model. CVPR 2023: 2893-2903 - [c170]Mengde Xu, Zheng Zhang, Fangyun Wei, Han Hu, Xiang Bai:
Side Adapter Network for Open-Vocabulary Semantic Segmentation. CVPR 2023: 2945-2954 - [c169]Qihao Liu, Junfeng Wu, Yi Jiang, Xiang Bai, Alan L. Yuille, Song Bai:
InstMove: Instance Motion for Object-centric Video Segmentation. CVPR 2023: 6344-6354 - [c168]Wenwen Yu, Yuliang Liu, Wei Hua, Deqiang Jiang, Bo Ren, Xiang Bai:
Turning a CLIP Model into a Scene Text Detector. CVPR 2023: 6978-6988 - [c167]Zhibo Yang, Rujiao Long, Pengfei Wang, Sibo Song, Humen Zhong, Wenqing Cheng, Xiang Bai, Cong Yao:
Modeling Entities as Semantic Points for Visual Information Extraction in the Wild. CVPR 2023: 15358-15367 - [c166]Wei Hua, Dingkang Liang, Jingyu Li, Xiaolong Liu, Zhikang Zou, Xiaoqing Ye, Xiang Bai:
SOOD: Towards Semi-Supervised Oriented Object Detection. CVPR 2023: 15558-15567 - [c165]Kaixin Xiong, Shi Gong, Xiaoqing Ye, Xiao Tan, Ji Wan, Errui Ding, Jingdong Wang, Xiang Bai:
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection. CVPR 2023: 21570-21579 - [c164]Dingyuan Zhang, Dingkang Liang, Zhikang Zou, Jingyu Li, Xiaoqing Ye, Zhe Liu, Xiao Tan, Xiang Bai:
A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection. ICCV 2023: 8339-8349 - [c163]Mingxin Huang, Jiaxin Zhang, Dezhi Peng, Hao Lu, Can Huang, Yuliang Liu, Xiang Bai, Lianwen Jin:
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer. ICCV 2023: 19438-19448 - [c162]Jianfeng Kuang, Wei Hua, Dingkang Liang, Mingkun Yang, Deqiang Jiang, Bo Ren, Xiang Bai:
Visual Information Extraction in the Wild: Practical Dataset and End-to-End Solution. ICDAR (6) 2023: 36-53 - [c161]Zhuang Liu, Ye Yuan, Zhilong Ji, Jinfeng Bai, Xiang Bai:
Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition. ICDAR (1) 2023: 152-166 - [c160]Chenyang Gao, Biao Yang, Hao Wang, Mingkun Yang, Wenwen Yu, Yuliang Liu, Xiang Bai:
TextREC: A Dataset for Referring Expression Comprehension with Reading Comprehension. ICDAR (3) 2023: 402-420 - [c159]Weijia Wu, Yuzhong Zhao, Zhuang Li, Jiahong Li, Mike Zheng Shou, Umapada Pal, Dimosthenis Karatzas, Xiang Bai:
ICDAR 2023 Competition on Video Text Reading for Dense and Small Text. ICDAR (2) 2023: 405-419 - [c158]Zhibo Yang, Xiaoge Song, Sibo Song, Tong Lu, Xiang Bai, Cheng-Lin Liu, Fei Huang, Cong Yao:
ICDAR 2023 Competition on Born Digital Video Text Question Answering. ICDAR (2) 2023: 508-521 - [c157]Wenwen Yu, Mingyu Liu, Mingrui Chen, Ning Lu, Yinlong Wen, Yuliang Liu, Dimosthenis Karatzas, Xiang Bai:
ICDAR 2023 Competition on Reading the Seal Title. ICDAR (2) 2023: 522-535 - [c156]Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai:
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images. ICDAR (2) 2023: 536-552 - [c155]Chenyang Gao, Yuliang Liu, Shiyu Yao, Jinfeng Bai, Xiang Bai, Lianwen Jin, Cheng-Lin Liu:
ICDAR 2023 Competition on Recognition of Multi-line Handwritten Mathematical Expressions. ICDAR (2) 2023: 566-576 - [c154]Dongliang Luo, Yu Zhou, Rui Yang, Yuliang Liu, Xianjin Liu, Jishen Zeng, Enming Zhang, Biao Yang, Ziming Huang, Lianwen Jin, Xiang Bai:
ICDAR 2023 Competition on Detecting Tampered Text in Images. ICDAR (2) 2023: 587-600 - [c153]Jinghua Hou, Zhe Liu, Dingkang Liang, Zhikang Zou, Xiaoqing Ye, Xiang Bai:
Query-based Temporal Fusion with Explicit Motion for 3D Object Detection. NeurIPS 2023 - [c152]Xin Zhou, Jinghua Hou,