Stop the war!
Остановите войну!
for scientists:
default search action
Xiaodan Liang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j63]Xiaojun Wang, Zichen Lou, Xiaodan Liang:
Optimal operation of integrated electricity and gas networks with risk analysis using downside risk constraints method. Comput. Chem. Eng. 184: 108641 (2024) - [j62]Linfeng Li, Weixing Su, Fang Liu, Maowei He, Xiaodan Liang:
Multi-scale adaptive networks for efficient inference. Int. J. Mach. Learn. Cybern. 15(2): 267-282 (2024) - [j61]Guangrun Wang, Changlin Li, Liuchun Yuan, Jiefeng Peng, Xiaoyu Xian, Xiaodan Liang, Xiaojun Chang, Liang Lin:
DNA Family: Boosting Weight-Sharing NAS With Block-Wise Supervisions. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 2722-2740 (2024) - [j60]Hanlin Zhang, Shuai Lin, Weiyang Liu, Pan Zhou, Jian Tang, Xiaodan Liang, Eric P. Xing:
Iterative Graph Self-Distillation. IEEE Trans. Knowl. Data Eng. 36(3): 1161-1169 (2024) - [j59]Shuai Lin, Chen Liu, Pan Zhou, Zi-Yuan Hu, Shuojia Wang, Ruihui Zhao, Yefeng Zheng, Liang Lin, Eric P. Xing, Xiaodan Liang:
Prototypical Graph Contrastive Learning. IEEE Trans. Neural Networks Learn. Syst. 35(2): 2747-2758 (2024) - [c225]Xuan Huang, Hanhui Li, Zejun Yang, Zhisheng Wang, Xiaodan Liang:
3D Visibility-Aware Generalizable Neural Radiance Fields for Interacting Hands. AAAI 2024: 2400-2408 - [c224]Hanhui Li, Xiaojian Lin, Xuan Huang, Zejun Yang, Zhisheng Wang, Xiaodan Liang:
Monocular 3D Hand Mesh Recovery via Dual Noise Estimation. AAAI 2024: 3046-3054 - [c223]Luoyang Lin, Zutao Jiang, Xiaodan Liang, Liqian Ma, Michael C. Kampffmeyer, Xiaochun Cao:
PTUS: Photo-Realistic Talking Upper-Body Synthesis via 3D-Aware Motion Decomposition Warping. AAAI 2024: 3441-3449 - [c222]Zhenyu Xie, Yang Wu, Xuehao Gao, Zhongqian Sun, Wei Yang, Xiaodan Liang:
Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model. AAAI 2024: 6252-6260 - [c221]Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li:
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter. ACL (Findings) 2024: 7160-7174 - [c220]Jiaqi Chen, Bingqian Lin, Ran Xu, Zhenhua Chai, Xiaodan Liang, Kwan-Yee Kenneth Wong:
MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation. ACL (1) 2024: 9796-9810 - [c219]Yinya Huang, Ruixin Hong, Hongming Zhang, Wei Shao, Zhicheng Yang, Dong Yu, Changshui Zhang, Xiaodan Liang, Linqi Song:
CLOMO: Counterfactual Logical Modification with Large Language Models. ACL (1) 2024: 11012-11034 - [c218]Qingxing Cao, Junhao Cheng, Xiaodan Liang, Liang Lin:
VisDiaHalBench: A Visual Dialogue Benchmark For Diagnosing Hallucination in Large Vision-Language Models. ACL (1) 2024: 12161-12176 - [c217]Xiwen Liang, Liang Ma, Shanshan Guo, Jianhua Han, Hang Xu, Shikui Ma, Xiaodan Liang:
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation. ACL (Findings) 2024: 12538-12559 - [c216]Yinya Huang, Xiaohan Lin, Zhengying Liu, Qingxing Cao, Huajian Xin, Haiming Wang, Zhenguo Li, Linqi Song, Xiaodan Liang:
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data. ICLR 2024 - [c215]Renjie Pi, Lewei Yao, Jianhua Han, Xiaodan Liang, Wei Zhang, Hang Xu:
Ins-DetCLIP: Aligning Detection Model to Follow Human-Language Instruction. ICLR 2024 - [c214]Haiming Wang, Huajian Xin, Chuanyang Zheng, Zhengying Liu, Qingxing Cao, Yinya Huang, Jing Xiong, Han Shi, Enze Xie, Jian Yin, Zhenguo Li, Xiaodan Liang:
LEGO-Prover: Neural Theorem Proving with Growing Libraries. ICLR 2024 - [c213]Jing Xiong, Zixuan Li, Chuanyang Zheng, Zhijiang Guo, Yichun Yin, Enze Xie, Zhicheng Yang, Qingxing Cao, Haiming Wang, Xiongwei Han, Jing Tang, Chengming Li, Xiaodan Liang:
DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning. ICLR 2024 - [i262]Xuan Huang, Hanhui Li, Zejun Yang, Zhisheng Wang, Xiaodan Liang:
3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands. CoRR abs/2401.00979 (2024) - [i261]Xinpeng Ding, Jianhua Han, Hang Xu, Xiaodan Liang, Wei Zhang, Xiaomeng Li:
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models. CoRR abs/2401.00988 (2024) - [i260]Jiaqi Chen, Bingqian Lin, Ran Xu, Zhenhua Chai, Xiaodan Liang, Kwan-Yee K. Wong:
MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation. CoRR abs/2401.07314 (2024) - [i259]Yinya Huang, Xiaohan Lin, Zhengying Liu, Qingxing Cao, Huajian Xin, Haiming Wang, Zhenguo Li, Linqi Song, Xiaodan Liang:
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data. CoRR abs/2402.08957 (2024) - [i258]Tao Tang, Guangrun Wang, Yixing Lao, Peng Chen, Jie Liu, Liang Lin, Kaicheng Yu, Xiaodan Liang:
AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis. CoRR abs/2402.17483 (2024) - [i257]Guangrun Wang, Changlin Li, Liuchun Yuan, Jiefeng Peng, Xiaoyu Xian, Xiaodan Liang, Xiaojun Chang, Liang Lin:
DNA Family: Boosting Weight-Sharing NAS with Block-Wise Supervisions. CoRR abs/2403.01326 (2024) - [i256]Bingqian Lin, Yanxin Long, Yi Zhu, Fengda Zhu, Xiaodan Liang, Qixiang Ye, Liang Lin:
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning. CoRR abs/2403.05770 (2024) - [i255]Bingqian Lin, Yunshuang Nie, Ziming Wei, Jiaqi Chen, Shikui Ma, Jianhua Han, Hang Xu, Xiaojun Chang, Xiaodan Liang:
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning. CoRR abs/2403.07376 (2024) - [i254]Zicheng Zhang, Tong Zhang, Yi Zhu, Jianzhuang Liu, Xiaodan Liang, Qixiang Ye, Wei Ke:
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation. CoRR abs/2403.08426 (2024) - [i253]Minbin Huang, Yanxin Long, Xinchi Deng, Ruihang Chu, Jiangfeng Xiong, Xiaodan Liang, Hong Cheng, Qinglin Lu, Wei Liu:
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation. CoRR abs/2403.08857 (2024) - [i252]Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu:
LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model. CoRR abs/2403.11929 (2024) - [i251]Sihao Lin, Pumeng Lyu, Dongrui Liu, Tao Tang, Xiaodan Liang, Andy Song, Xiaojun Chang:
MLP Can Be A Good Transformer Learner. CoRR abs/2404.05657 (2024) - [i250]Lewei Yao, Renjie Pi, Jianhua Han, Xiaodan Liang, Hang Xu, Wei Zhang, Zhenguo Li, Dan Xu:
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection. CoRR abs/2404.09216 (2024) - [i249]Jiehui Huang, Xiao Dong, Wenhui Song, Hanhui Li, Jun Zhou, Yuhao Cheng, Shutao Liao, Long Chen, Yiqiang Yan, Shengcai Liao, Xiaodan Liang:
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving. CoRR abs/2404.16771 (2024) - [i248]Junhao Cheng, Baiqiao Yin, Kaixin Cai, Minbin Huang, Hanhui Li, Yuxin He, Xi Lu, Yue Li, Yifei Li, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang:
TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation. CoRR abs/2404.18919 (2024) - [i247]Xujie Zhang, Ente Lin, Xiu Li, Yuxuan Luo, Michael Kampffmeyer, Xin Dong, Xiaodan Liang:
MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion Generation. CoRR abs/2405.00448 (2024) - [i246]Xiaohan Lin, Qingxing Cao, Yinya Huang, Zhicheng Yang, Zhengying Liu, Zhenguo Li, Xiaodan Liang:
ATG: Benchmarking Automated Theorem Generation for Generative Language Models. CoRR abs/2405.06677 (2024) - [i245]Siyu Lou, Yuntian Chen, Xiaodan Liang, Liang Lin, Quanshi Zhang:
Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs. CoRR abs/2405.11880 (2024) - [i244]Huajian Xin, Daya Guo, Zhihong Shao, Zhizhou Ren, Qihao Zhu, Bo Liu, Chong Ruan, Wenda Li, Xiaodan Liang:
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data. CoRR abs/2405.14333 (2024) - [i243]Haiming Wang, Huajian Xin, Zhengying Liu, Wenda Li, Yinya Huang, Jianqiao Lu, Zhicheng Yang, Jing Tang, Jian Yin, Zhenguo Li, Xiaodan Liang:
Proving Theorems Recursively. CoRR abs/2405.14414 (2024) - [i242]Jian Zhao, Lei Jin, Jianshu Li, Zheng Zhu, Yinglei Teng, Jiaojiao Zhao, Sadaf Gulshad, Zheng Wang, Bo Zhao, Xiangbo Shu, Yunchao Wei, Xuecheng Nie, Xiaojie Jin, Xiaodan Liang, Shin'ichi Satoh, Yandong Guo, Cewu Lu, Junliang Xing, Jane Shengmei Shen:
The SkatingVerse Workshop & Challenge: Methods and Results. CoRR abs/2405.17188 (2024) - [i241]Jun Zheng, Fuwei Zhao, Youjiang Xu, Xin Dong, Xiaodan Liang:
VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers. CoRR abs/2405.18326 (2024) - [i240]Bingqian Lin, Yunshuang Nie, Ziming Wei, Yi Zhu, Hang Xu, Shikui Ma, Jianzhuang Liu, Xiaodan Liang:
Correctable Landmark Discovery via Large Models for Vision-Language Navigation. CoRR abs/2405.18721 (2024) - [i239]Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li:
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter. CoRR abs/2405.19465 (2024) - [i238]Junhao Cheng, Xi Lu, Hanhui Li, Khun Loun Zai, Baiqiao Yin, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang:
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation. CoRR abs/2406.01388 (2024) - [i237]Lijun Zhou, Tao Tang, Pengkun Hao, Zihang He, Kalok Ho, Shuo Gu, Wenbo Hou, Zhihui Hao, Haiyang Sun, Kun Zhan, Peng Jia, Xianpeng Lang, Xiaodan Liang:
UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking. CoRR abs/2406.02147 (2024) - [i236]Gexin Huang, Chenfei Wu, Mingjie Li, Xiaojun Chang, Ling Chen, Ying Sun, Shen Zhao, Xiaodan Liang, Liang Lin:
Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification. CoRR abs/2406.02990 (2024) - [i235]Xiaohan Lin, Qingxing Cao, Yinya Huang, Haiming Wang, Jianqiao Lu, Zhengying Liu, Linqi Song, Xiaodan Liang:
FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving. CoRR abs/2406.14408 (2024) - [i234]Sukmin Yun, Haokun Lin, Rusiru Thushara, Mohammad Qazim Bhat, Yongxin Wang, Zutao Jiang, Mingkai Deng, Jinhong Wang, Tianhua Tao, Junbo Li, Haonan Li, Preslav Nakov, Timothy Baldwin, Zhengzhong Liu, Eric P. Xing, Xiaodan Liang, Zhiqiang Shen:
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs. CoRR abs/2406.20098 (2024) - [i233]Jiaqi Chen, Bingqian Lin, Xinmin Liu, Xiaodan Liang, Kwan-Yee K. Wong:
Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation. CoRR abs/2407.05890 (2024) - [i232]Guian Fang, Wenbiao Yan, Yuanfan Guo, Jianhua Han, Zutao Jiang, Hang Xu, Shengcai Liao, Xiaodan Liang:
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance. CoRR abs/2407.06937 (2024) - [i231]Hao Wang, Pengzhen Ren, Zequn Jie, Xiao Dong, Chengjian Feng, Yinlong Qian, Lin Ma, Dongmei Jiang, Yaowei Wang, Xiangyuan Lan, Xiaodan Liang:
OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion. CoRR abs/2407.07844 (2024) - [i230]Runhui Huang, Xinpeng Ding, Chunwei Wang, Jianhua Han, Yulong Liu, Hengshuang Zhao, Hang Xu, Lu Hou, Wei Zhang, Xiaodan Liang:
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models. CoRR abs/2407.08706 (2024) - [i229]Zhicheng Yang, Yinya Huang, Wei Shi, Liang Feng, Linqi Song, Yiwei Wang, Xiaodan Liang, Jing Tang:
Benchmarking LLMs for Optimization Modeling and Enhancing Reasoning via Reverse Socratic Synthesis. CoRR abs/2407.09887 (2024) - [i228]Mingjie Li, Haokun Lin, Liang Qiu, Xiaodan Liang, Ling Chen, Abdulmotaleb Elsaddik, Xiaojun Chang:
Contrastive Learning with Counterfactual Explanations for Radiology Report Generation. CoRR abs/2407.14474 (2024) - [i227]Zheng Chong, Xiao Dong, Haoxiang Li, Shiyue Zhang, Wenqing Zhang, Xujie Zhang, Hanqing Zhao, Xiaodan Liang:
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models. CoRR abs/2407.15886 (2024) - [i226]Zhenyu Xie, Haoye Dong, Yufei Gao, Zehua Ma, Xiaodan Liang:
DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models. CoRR abs/2407.16511 (2024) - [i225]Yuxuan Hu, Minghuan Tan, Chenwei Zhang, Zixuan Li, Xiaodan Liang, Min Yang, Chengming Li, Xiping Hu:
APTNESS: Incorporating Appraisal Theory and Emotion Support Strategies for Empathetic Response Generation. CoRR abs/2407.21048 (2024) - 2023
- [j58]Qiuyan Wang, Xiaodan Liang, Rize Jin, Yang Yan:
Applications of Strongly Regular Cayley Graphs to Codebooks. IEEE Access 11: 106980-106986 (2023) - [j57]Hang Chen, Bowei Cao, Jiangcun Yang, He Ren, Xingqiu Xia, Xiaowen Zhang, Wei Yan, Xiaodan Liang, Chen Li:
Construction and effect evaluation of prediction model for red blood cell transfusion requirement in cesarean section based on artificial intelligence. BMC Medical Informatics Decis. Mak. 23(1): 213 (2023) - [j56]Linfeng Li, Weixing Su, Fang Liu, Maowei He, Xiaodan Liang:
Knowledge Fusion Distillation: Improving Distillation with Multi-scale Attention Mechanisms. Neural Process. Lett. 55(5): 6165-6180 (2023) - [j55]Boyu Yang, Mingbao Lin, Yunxiao Zhang, Binghao Liu, Xiaodan Liang, Rongrong Ji, Qixiang Ye:
Dynamic Support Network for Few-Shot Class Incremental Learning. IEEE Trans. Pattern Anal. Mach. Intell. 45(3): 2945-2951 (2023) - [j54]Changlin Li, Guangrun Wang, Bing Wang, Xiaodan Liang, Zhihui Li, Xiaojun Chang:
DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and Vision Transformers. IEEE Trans. Pattern Anal. Mach. Intell. 45(4): 4430-4446 (2023) - [j53]Yinya Huang, Lemao Liu, Kun Xu, Meng Fang, Liang Lin, Xiaodan Liang:
Discourse-Aware Graph Networks for Textual Logical Reasoning. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 11668-11688 (2023) - [j52]Bingqian Lin, Yanxin Long, Yi Zhu, Fengda Zhu, Xiaodan Liang, Qixiang Ye, Liang Lin:
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 12535-12549 (2023) - [j51]Xiao Dong, Xunlin Zhan, Yunchao Wei, Xiaoyong Wei, Yaowei Wang, Minlong Lu, Xiaochun Cao, Xiaodan Liang:
Entity-Graph Enhanced Cross-Modal Pretraining for Instance-Level Product Retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 13117-13133 (2023) - [j50]Junfan Lin, Keze Wang, Ziliang Chen, Xiaodan Liang, Liang Lin:
Towards Causality-Aware Inferring: A Sequential Discriminative Approach for Medical Diagnosis. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 13363-13375 (2023) - [j49]Dapeng Feng, Songfang Han, Hang Xu, Xiaodan Liang, Xiaojun Tan:
Point-Guided Contrastive Learning for Monocular 3-D Object Detection. IEEE Trans. Cybern. 53(2): 954-966 (2023) - [j48]Xiao Dong, Gengwei Zhang, Xunlin Zhan, Yi Ding, Yunchao Wei, Minlong Lu, Xiaodan Liang:
Caption-Aided Product Detection via Collaborative Pseudo-Label Harmonization. IEEE Trans. Multim. 25: 1916-1927 (2023) - [j47]Mingjie Li, Rui Liu, Fuyu Wang, Xiaojun Chang, Xiaodan Liang:
Auxiliary signal-guided knowledge encoder-decoder for medical report generation. World Wide Web (WWW) 26(1): 253-270 (2023) - [c212]Runhui Huang, Yanxin Long, Jianhua Han, Hang Xu, Xiwen Liang, Chunjing Xu, Xiaodan Liang:
NLIP: Noise-Robust Language-Image Pre-training. AAAI 2023: 926-934 - [c211]Zutao Jiang, Guansong Lu, Xiaodan Liang, Jihua Zhu, Wei Zhang, Xiaojun Chang, Hang Xu:
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation. AAAI 2023: 1051-1059 - [c210]Bingqian Lin, Yi Zhu, Xiaodan Liang, Liang Lin, Jianzhuang Liu:
Actional Atomic-Concept Learning for Demystifying Vision-Language Navigation. AAAI 2023: 1568-1576 - [c209]Haiming Wang, Ye Yuan, Zhengying Liu, Jianhao Shen, Yichun Yin, Jing Xiong, Enze Xie, Han Shi, Yujun Li, Lin Li, Jian Yin, Zhenguo Li, Xiaodan Liang:
DT-Solver: Automated Theorem Proving with Dynamic-Tree Sampling Guided by Proof-level Value Function. ACL (1) 2023: 12632-12646 - [c208]Shida Chen, Xiaodan Liang, Pan Zhao:
Application of Intelligent Mobile Terminal in Virtual Building Construction Training Teaching. ADHIP (2) 2023: 345-360 - [c207]Mengxue Qu, Yu Wu, Yunchao Wei, Wu Liu, Xiaodan Liang, Yao Zhao:
Learning to Segment Every Referring Object Point by Point. CVPR 2023: 3021-3030 - [c206]Kaicheng Yu, Tang Tao, Hongwei Xie, Zhiwei Lin, Tingting Liang, Bing Wang, Peng Chen, Dayang Hao, Yongtao Wang, Xiaodan Liang:
Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection. CVPR Workshops 2023: 3188-3198 - [c205]Mingjie Li, Bingqian Lin, Zicong Chen, Haokun Lin, Xiaodan Liang, Xiaojun Chang:
Dynamic Graph Enhanced Contrastive Learning for Chest X-Ray Report Generation. CVPR 2023: 3334-3343 - [c204]Xiwen Liang, Minzhe Niu, Jianhua Han, Hang Xu, Chunjing Xu, Xiaodan Liang:
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving. CVPR 2023: 9611-9621 - [c203]Yanxin Long, Youpeng Wen, Jianhua Han, Hang Xu, Pengzhen Ren, Wei Zhang, Shen Zhao, Xiaodan Liang:
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining. CVPR 2023: 15233-15243 - [c202]Yihan Zeng, Chenhan Jiang, Jiageng Mao, Jianhua Han, Chaoqiang Ye, Qingqiu Huang, Dit-Yan Yeung, Zhen Yang, Xiaodan Liang, Hang Xu:
CLIP2: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data. CVPR 2023: 15244-15253 - [c201]Lewei Yao, Jianhua Han, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Hang Xu:
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment. CVPR 2023: 23497-23506 - [c200]Zhenyu Xie, Zaiyu Huang, Xin Dong, Fuwei Zhao, Haoye Dong, Xijin Zhang, Feida Zhu, Xiaodan Liang:
GP-VTON: Towards General Purpose Virtual Try-On via Collaborative Local-Flow Global-Parsing Learning. CVPR 2023: 23550-23559 - [c199]Jing Xiong, Jianhao Shen, Ye Yuan, Haiming Wang, Yichun Yin, Zhengying Liu, Lin Li, Zhijiang Guo, Qingxing Cao, Yinya Huang, Chuanyang Zheng, Xiaodan Liang, Ming Zhang, Qun Liu:
TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models. EMNLP 2023: 11594-11632 - [c198]Guangyi Liu, Zeyu Feng, Yuan Gao, Zichao Yang, Xiaodan Liang, Junwei Bao, Xiaodong He, Shuguang Cui, Zhen Li, Zhiting Hu:
Composable Text Controls in Latent Space with ODEs. EMNLP 2023: 16543-16570 - [c197]Kaixin Cai, Pengzhen Ren, Yi Zhu, Hang Xu, Jianzhuang Liu, Changlin Li, Guangrun Wang, Xiaodan Liang:
MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation. ICCV 2023: 1196-1205 - [c196]Zhijian Huang, Sihao Lin, Guiyu Liu, Mukun Luo, Chaoqiang Ye, Hang Xu, Xiaojun Chang, Xiaodan Liang:
FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration. ICCV 2023: 3479-3488 - [c195]Haoyuan Li, Haoye Dong, Hanchao Jia, Dong Huang, Michael C. Kampffmeyer, Liang Lin, Xiaodan Liang:
Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos. ICCV 2023: 8710-8719 - [c194]Cuican Yu, Guansong Lu, Yihan Zeng, Jian Sun, Xiaodan Liang, Huibin Li, Zongben Xu, Songcen Xu, Wei Zhang, Hang Xu:
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images. ICCV 2023: 15280-15291 - [c193]Runhui Huang, Jianhua Han, Guansong Lu, Xiaodan Liang, Yihan Zeng, Wei Zhang, Hang Xu:
DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability. ICCV 2023: 15667-15677 - [c192]Xinchi Deng, Han Shi, Runhui Huang, Changlin Li, Hang Xu, Jianhua Han, James T. Kwok, Shen Zhao, Wei Zhang, Xiaodan Liang:
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training. ICCV 2023: 22121-22132 - [c191]Hongguang Zhu, Yunchao Wei, Xiaodan Liang, Chunjie Zhang, Yao Zhao:
CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation. ICCV 2023: 22200-22210 - [c190]Binbin Yang, Yi Luo, Ziliang Chen, Guangrun Wang, Xiaodan Liang, Liang Lin:
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts. ICCV 2023: 22612-22622 - [c189]Xujie Zhang, Binbin Yang, Michael C. Kampffmeyer, Wenqing Zhang, Shiyue Zhang, Guansong Lu, Liang Lin, Hang Xu, Xiaodan Liang:
DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment. ICCV 2023: 23097-23106 - [c188]Jiahui Gao, Renjie Pi, Yong Lin, Hang Xu, Jiacheng Ye, Zhiyong Wu, Weizhong Zhang, Xiaodan Liang, Zhenguo Li, Lingpeng Kong:
Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning. ICLR 2023 - [c187]Pengzhen Ren, Changlin Li, Hang Xu, Yi Zhu, Guangrun Wang, Jianzhuang Liu, Xiaojun Chang, Xiaodan Liang:
ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency. ICLR 2023 - [c186]Fengda Zhu, Vincent CS Lee, Xiaojun Chang, Xiaodan Liang:
Vision Language Navigation with Knowledge-driven Environmental Dreamer. IJCAI 2023: 1840-1848 - [c185]Mengxue Qu, Yu Wu, Wu Liu, Xiaodan Liang, Jingkuan Song, Yao Zhao, Yunchao Wei:
RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments. NeurIPS 2023 - [c184]Liucun Lu, Jinghui Qin, Zequn Jie, Lin Ma, Liang Lin, Xiaodan Liang:
RecFormer: Recurrent Multi-modal Transformer with History-Aware Contrastive Learning for Visual Dialog. PRCV (1) 2023: 159-171 - [i224]Bingqian Lin, Yi Zhu, Xiaodan Liang, Liang Lin, Jianzhuang Liu:
Actional Atomic-Concept Learning for Demystifying Vision-Language Navigation. CoRR abs/2302.06072 (2023) - [i223]Pengzhen Ren, Changlin Li, Hang Xu, Yi Zhu, Guangrun Wang, Jianzhuang Liu, Xiaojun Chang, Xiaodan Liang:
ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency. CoRR abs/2302.10307 (2023) - [i222]Xiwen Liang, Minzhe Niu, Jianhua Han, Hang Xu, Chunjing Xu, Xiaodan Liang:
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving. CoRR abs/2303.01788 (2023) - [i221]Yanxin Long, Youpeng Wen, Jianhua Han, Hang Xu, Pengzhen Ren, Wei Zhang, Shen Zhao, Xiaodan Liang:
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining. CoRR abs/2303.02489 (2023) - [i220]Mingjie Li, Bingqian Lin, Zicong Chen, Haokun Lin, Xiaodan Liang, Xiaojun Chang:
Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation. CoRR abs/2303.10323 (2023) - [i219]Yihan Zeng, Chenhan Jiang, Jiageng Mao, Jianhua Han, Chaoqiang Ye, Qingqiu Huang, Dit-Yan Yeung, Zhen Yang, Xiaodan Liang, Hang Xu:
CLIP2: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data. CoRR abs/2303.12417 (2023) - [i218]Zhenyu Xie, Zaiyu Huang, Xin Dong, Fuwei Zhao, Haoye Dong, Xijin Zhang, Feida Zhu, Xiaodan Liang:
GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing Learning. CoRR abs/2303.13756 (2023) - [i217]Lewei Yao, Jianhua Han, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Hang Xu:
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment. CoRR abs/2304.04514 (2023) - [i216]Tang Tao, Longfei Gao, Guangrun Wang, Peng Chen, Dayang Hao, Xiaodan Liang, Mathieu Salzmann, Kaicheng Yu:
LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields. CoRR abs/2304.10406 (2023)