default search action
Ming Yan
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. The links to all actual bibliographies of persons of the same or a similar name can be found below. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Other persons with the same name
- Ming Yan 0001 — University of California, San Diego, Center for Wireless Communication, CA, USA
- Ming Yan 0002 — Nanyang Technological University, School of Electrical and Electronic Engineering, Singapore
- Ming Yan 0003 — Northwest Institute of Nuclear Technology, Xi'an, China (and 1 more)
- Ming Yan 0004 — Jinan University, Management School, Guangzhou, Chian (and 2 more)
- Ming Yan 0005 — Communication University of China, Beijing, China
- Ming Yan 0006 — Chinese University of Hong Kong, Shenzhen, China (and 2 more)
- Ming Yan 0007 — A*STAR, Centre for Frontier AI Research, Singapore (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j70]Ming Yan, Liang Wang, Meiling Zhang, Pan Shi:
Improved LS-SVM Boiler Combustion Model Based on Affinity Propagation. IEEE Access 12: 35184-35194 (2024) - [j69]Guangrong Li, Chaoying Zhao, Bin Li, Jiuyuan Li, Xiaojie Liu, Jianqi Lou, Ming Yan, Baohang Wang:
Stepwise estimation of height change time series and two-dimensional surface deformation over mountain excavation and City construction region with TS-InSAR technique. Int. J. Appl. Earth Obs. Geoinformation 131: 103982 (2024) - [j68]Xiao He, Ming Yan:
GraphKM: machine and deep learning for KM prediction of wildtype and mutant enzymes. BMC Bioinform. 25(1): 135 (2024) - [j67]Deyu Lin, Ming Yan, Linghe Kong, Ruoxuan Quan, Yong Liang Guan:
A Framework of Real-Time Intelligent Transportation System Based on Hybrid Fog-Cloud Computing. IEEE Commun. Mag. 62(1): 126-132 (2024) - [j66]Zhuo Wu, Zan Wang, Junjie Chen, Hanmo You, Ming Yan, Lanjun Wang:
Stratified random sampling for neural network test input selection. Inf. Softw. Technol. 165: 107331 (2024) - [j65]Ming Yan, Wenhao Guo, Hanbo Zheng, Tuanfa Qin:
Joint NTP-MAPPO and SDN for Energy Trading Among Multi-Base-Station Microgrids. IEEE Internet Things J. 11(10): 18568-18579 (2024) - [j64]Chao Wang, Ming Yan, Junjie Yu:
Sorted L1/L2 Minimization for Sparse Signal Recovery. J. Sci. Comput. 99(2): 32 (2024) - [j63]Ming Yan, Chaoying Zhao, Xiaojie Liu, Baohang Wang:
Sequential SBAS-InSAR Backward Estimation of Deformation Time Series. IEEE Geosci. Remote. Sens. Lett. 21: 1-5 (2024) - [j62]Haomin Tang, Shu Liu, Weijie Tan, Lingling Fu, Ming Yan, Hongchao Feng:
Prediction of midpalatal suture maturation stage based on transfer learning and enhanced vision transformer. BMC Medical Informatics Decis. Mak. 24(1): 232 (2024) - [j61]Ming Yan, Yueli Hu, Haikun Zhang:
Progressive meaningful visual cryptography for secure communication of grayscale medical images. Multim. Tools Appl. 83(11): 33639-33652 (2024) - [j60]Ming Yan, Junjie Chen, Xuejie Cao, Zhuo Wu, Yuning Kang, Zan Wang:
Revisiting deep neural network test coverage from the test effectiveness perspective. J. Softw. Evol. Process. 36(4) (2024) - [j59]Ying Ma, Chuyi Yu, Ming Yan, Arun Kumar Sangaiah, Youke Wu:
Dark-Side Avoidance of Mobile Applications With Data Biases Elimination in Socio-Cyber World. IEEE Trans. Comput. Soc. Syst. 11(4): 4955-4964 (2024) - [j58]Ming Yan, Zengcai Wang, Pusheng Wang, Jie Zhang:
Precise Shearer Positioning Technology Based on Interacting Multiple Model With Adaptivity and Robustness. IEEE Trans. Instrum. Meas. 73: 1-18 (2024) - [j57]Linhui Xiao, Xiaoshan Yang, Fang Peng, Ming Yan, Yaowei Wang, Changsheng Xu:
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding. IEEE Trans. Multim. 26: 4334-4347 (2024) - [c92]Chaoya Jiang, Wei Ye, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Shikun Zhang:
TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training. AAAI 2024: 2489-2497 - [c91]Haoran Liu, Ying Ma, Ming Yan, Yingke Chen, Dezhong Peng, Xu Wang:
DiDA: Disambiguated Domain Alignment for Cross-Domain Retrieval with Partial Labels. AAAI 2024: 3612-3620 - [c90]Hongzhan Chen, Hehong Chen, Ming Yan, Wenshen Xu, Gao Xing, Weizhou Shen, Xiaojun Quan, Chenliang Li, Ji Zhang, Fei Huang:
SocialBench: Sociality Evaluation of Role-Playing Conversational Agents. ACL (Findings) 2024: 2108-2126 - [c89]Yuanhang Zheng, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
Budget-Constrained Tool Learning with Planning. ACL (Findings) 2024: 9039-9052 - [c88]Yang Zhang, Keqin Bao, Ming Yan, Wenjie Wang, Fuli Feng, Xiangnan He:
Text-like Encoding of Collaborative Information in Large Language Models for Recommendation. ACL (1) 2024: 9181-9191 - [c87]An Liu, Zonghan Yang, Zhenhe Zhang, Qingyuan Hu, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs. ACL (Findings) 2024: 10960-10977 - [c86]Ziyue Wang, Chi Chen, Yiqi Zhu, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu:
Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion. ACL (1) 2024: 11229-11245 - [c85]Chi Chen, Yiyang Du, Zheng Fang, Ziyue Wang, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu:
Model Composition for Multimodal Large Language Models. ACL (1) 2024: 11246-11262 - [c84]Vadim Grigorev, Jiayu Li, Weizhi Ma, Zhiyu He, Min Zhang, Yiqun Liu, Ming Yan, Ji Zhang:
SiTunes: A Situational Music Recommendation Dataset with Physiological and Psychological Signals. CHIIR 2024: 417-421 - [c83]Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training. LREC/COLING 2024: 14664-14675 - [c82]Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval. LREC/COLING 2024: 17031-17041 - [c81]Rui Zhang, Yukai Huang, Sicheng Liang, Shangyi Sun, Shaonan Ma, Chengying Huan, Lulu Chen, Zhihui Lu, Yang Xu, Ming Yan, Jie Wu:
Revisiting Learned Index with Byte-addressable Persistent Storage. ICPP 2024: 929-938 - [c80]Chenlin Zhao, Jiabo Ye, Yaguang Song, Ming Yan, Xiaoshan Yang, Changsheng Xu:
Part-Aware Prompt Tuning for Weakly Supervised Referring Expression Grounding. MMM (3) 2024: 489-502 - [i88]Hongzhan Chen, Xiaojun Quan, Hehong Chen, Ming Yan, Ji Zhang:
Knowledge Distillation for Closed-Source Language Models. CoRR abs/2401.07013 (2024) - [i87]Weizhou Shen, Chenliang Li, Hongzhan Chen, Ming Yan, Xiaojun Quan, Hehong Chen, Ji Zhang, Fei Huang:
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent. CoRR abs/2401.07324 (2024) - [i86]Junyang Wang, Haiyang Xu, Jiabo Ye, Ming Yan, Weizhou Shen, Ji Zhang, Fei Huang, Jitao Sang:
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception. CoRR abs/2401.16158 (2024) - [i85]Zijun Liu, Boqun Kou, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
Meta Ranking: Less Capable Language Models are Capable for Single Response Judgement. CoRR abs/2402.12146 (2024) - [i84]Ziyue Wang, Chi Chen, Yiqi Zhu, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu:
Browse and Concentrate: Comprehending Multimodal Content via prior-LLM Context Fusion. CoRR abs/2402.12195 (2024) - [i83]Chi Chen, Yiyang Du, Zheng Fang, Ziyue Wang, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu:
Model Composition for Multimodal Large Language Models. CoRR abs/2402.12750 (2024) - [i82]An Liu, Zonghan Yang, Zhenhe Zhang, Qingyuan Hu, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs. CoRR abs/2402.12835 (2024) - [i81]Chaoya Jiang, Wei Ye, Mengfan Dong, Hongrui Jia, Haiyang Xu, Ming Yan, Ji Zhang, Shikun Zhang:
Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models. CoRR abs/2402.15721 (2024) - [i80]Yuanhang Zheng, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
Budget-Constrained Tool Learning with Planning. CoRR abs/2402.15960 (2024) - [i79]Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval. CoRR abs/2402.16769 (2024) - [i78]Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training. CoRR abs/2403.00249 (2024) - [i77]Wei Ye, Chaoya Jiang, Haiyang Xu, Chenhao Ye, Chenliang Li, Ming Yan, Shikun Zhang, Songhang Huang, Fei Huang:
Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection. CoRR abs/2403.07883 (2024) - [i76]Anwen Hu, Haiyang Xu, Jiabo Ye, Ming Yan, Liang Zhang, Bo Zhang, Chen Li, Ji Zhang, Qin Jin, Fei Huang, Jingren Zhou:
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding. CoRR abs/2403.12895 (2024) - [i75]Hongzhan Chen, Hehong Chen, Ming Yan, Wenshen Xu, Xing Gao, Weizhou Shen, Xiaojun Quan, Chenliang Li, Ji Zhang, Fei Huang, Jingren Zhou:
RoleInteract: Evaluating the Social Interaction of Role-Playing Agents. CoRR abs/2403.13679 (2024) - [i74]Zonghan Yang, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu:
ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy. CoRR abs/2403.14589 (2024) - [i73]Ming Yan, Yan Zhang, Shuqiang Cai, Shuqi Fan, Xincheng Lin, Yudi Dai, Siqi Shen, Chenglu Wen, Lan Xu, Yuexin Ma, Cheng Wang:
RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method. CoRR abs/2403.19501 (2024) - [i72]Liang Zhang, Anwen Hu, Haiyang Xu, Ming Yan, Yichen Xu, Qin Jin, Ji Zhang, Fei Huang:
TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning. CoRR abs/2404.16635 (2024) - [i71]Junyang Wang, Haiyang Xu, Haitao Jia, Xi Zhang, Ming Yan, Weizhou Shen, Ji Zhang, Fei Huang, Jitao Sang:
Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration. CoRR abs/2406.01014 (2024) - [i70]Chenhao Si, Ming Yan:
Initialization-enhanced Physics-Informed Neural Network with Domain Decomposition (IDPINN). CoRR abs/2406.03172 (2024) - [i69]Yuhao Dan, Junfeng Tian, Jie Zhou, Ming Yan, Ji Zhang, Qin Chen, Liang He:
Modeling Comparative Logical Relation with Contrastive Learning for Text Generation. CoRR abs/2406.09095 (2024) - [i68]Baihan Li, Zeyu Xie, Xuenan Xu, Yiwei Guo, Ming Yan, Ji Zhang, Kai Yu, Mengyue Wu:
DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation. CoRR abs/2407.13198 (2024) - [i67]Xuenan Xu, Pingyue Zhang, Ming Yan, Ji Zhang, Mengyue Wu:
Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models. CoRR abs/2407.14355 (2024) - [i66]Haowei Liu, Xi Zhang, Haiyang Xu, Yaya Shi, Chaoya Jiang, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu:
MIBench: Evaluating Multimodal Large Language Models over Multiple Images. CoRR abs/2407.15272 (2024) - 2023
- [j56]Ping Xu, Yang Zhao, Lingyun Xue, Yian Liu, Ming Yan, Lei Zhu, Lin Weng, Shundi Hu, Luhong Wen:
Fentanyl analogs classification via Siamese network and mass spectral library searching. Expert Syst. Appl. 217: 119534 (2023) - [j55]Jie Zhou, Junfeng Tian, Rui Wang, Yuanbin Wu, Ming Yan, Liang He, Xuanjing Huang:
Multi-modal multi-hop interaction network for dialogue response generation. Expert Syst. Appl. 227: 120267 (2023) - [j54]Suhong Wang, Wenhao Guo, Hongmin Sun, Junyu Ren, Ming Yan, Yongle Hu, Tuanfa Qin:
Multi-layer task scheduling and resource allocation schemes considering idle resource and task priority in IoT networks. IET Commun. 17(20): 2319-2334 (2023) - [j53]Jian Chen, Ming Yan, Muhammad Rabea Hanzla Qureshi, Keke Geng:
Estimating the visibility in foggy weather based on meteorological and video data: A Recurrent Neural Network approach. IET Signal Process. 17(1) (2023) - [j52]Lu Liu, Shaohua Yang, Ming Yan, Binkang Li, Yang Guo, Mingan Guo, Gang Li, Errui Zhou:
The effect of photodiode shape on pinning potential for charge transfer in CMOS image sensors. Microelectron. J. 131: 105651 (2023) - [j51]Haoyu Zhang, Meng Liu, Yuhong Li, Ming Yan, Zan Gao, Xiaojun Chang, Liqiang Nie:
Attribute-Guided Collaborative Learning for Partial Person Re-Identification. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 14144-14160 (2023) - [j50]Min Li, Zheng Liu, Yu Xia, Mingyang He, Kang-wen Yang, Shuai Yuan, Ming Yan, Kun Huang, He-ping Zeng:
Terahertz Time-of-Flight Ranging with Adaptive Clock Asynchronous Optical Sampling. Sensors 23(2): 715 (2023) - [j49]Haikun Zhang, Yueli Hu, Ming Yan:
Thermal Image Super-Resolution Based on Lightweight Dynamic Attention Network for Infrared Sensors. Sensors 23(21): 8717 (2023) - [j48]Haikun Zhang, Yueli Hu, Ming Yan, Bin Ma:
Thermal image super-resolution via multi-path residual attention network. Signal Image Video Process. 17(5): 2073-2081 (2023) - [j47]Ming Yan, Haiyang Xu, Chenliang Li, Junfeng Tian, Bin Bi, Wei Wang, Xianzhe Xu, Ji Zhang, Songfang Huang, Fei Huang, Luo Si, Rong Jin:
Achieving Human Parity on Visual Question Answering. ACM Trans. Inf. Syst. 41(3): 79:1-79:40 (2023) - [c79]Qianglong Chen, Guohai Xu, Ming Yan, Ji Zhang, Fei Huang, Luo Si, Yin Zhang:
Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering. ACL (Findings) 2023: 13207-13224 - [c78]Ming Yan, Xin Wang, Yudi Dai, Siqi Shen, Chenglu Wen, Lan Xu, Yuexin Ma, Cheng Wang:
CIMI4D: A Large Multimodal Climbing Motion Dataset under Human-scene Interactions. CVPR 2023: 12977-12988 - [c77]Rui Zhang, Yao Wu, Shangyi Sun, Lulu Chen, Yibo Huang, Ming Yan, Jie Wu:
PFtree: Optimizing Persistent Adaptive Radix Tree for PM Systems on eADR Platform. DASFAA (1) 2023: 46-61 - [c76]Xinxing Zhou, Fei Wen, Ming Yan:
An Improved Method for High Speed DUC in Broadband Transmitters. EITCE 2023: 737-741 - [c75]Chenliang Li, He Chen, Ming Yan, Weizhou Shen, Haiyang Xu, Zhikai Wu, Zhicheng Zhang, Wenmeng Zhou, Yingda Chen, Chen Cheng, Hongzhu Shi, Ji Zhang, Fei Huang, Jingren Zhou:
ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models. EMNLP (Demos) 2023: 566-578 - [c74]Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Guohai Xu, Chenliang Li, Junfeng Tian, Qi Qian, Ji Zhang, Qin Jin, Liang He, Xin Lin, Fei Huang:
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model. EMNLP (Findings) 2023: 2841-2858 - [c73]Hongzhan Chen, Siyue Wu, Xiaojun Quan, Rui Wang, Ming Yan, Ji Zhang:
MCC-KD: Multi-CoT Consistent Knowledge Distillation. EMNLP (Findings) 2023: 6805-6820 - [c72]Xu Yang, Zhangzikang Li, Haiyang Xu, Hanwang Zhang, Qinghao Ye, Chenliang Li, Ming Yan, Yu Zhang, Fei Huang, Songfang Huang:
Learning Trajectory-Word Alignments for Video-Language Tasks. ICCV 2023: 2504-2514 - [c71]Chaoya Jiang, Haiyang Xu, Wei Ye, Qinghao Ye, Chenliang Li, Ming Yan, Bin Bi, Shikun Zhang, Fei Huang, Songfang Huang:
BUS : Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization. ICCV 2023: 2888-2898 - [c70]Junyang Wang, Yuanhong Xu, Juhua Hu, Ming Yan, Jitao Sang, Qi Qian:
Improved Visual Fine-tuning with Natural Language Supervision. ICCV 2023: 11865-11875 - [c69]Qinghao Ye, Guohai Xu, Ming Yan, Haiyang Xu, Qi Qian, Ji Zhang, Fei Huang:
HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training. ICCV 2023: 15359-15370 - [c68]Shumin Deng, Chengming Wang, Zhoubo Li, Ningyu Zhang, Zelin Dai, Hehong Chen, Feiyu Xiong, Ming Yan, Qiang Chen, Mosha Chen, Jiaoyan Chen, Jeff Z. Pan, Bryan Hooi, Huajun Chen:
Construction and Applications of Billion-Scale Pre-Trained Multimodal Business Knowledge Graph. ICDE 2023: 2988-3002 - [c67]Haiyang Xu, Qinghao Ye, Ming Yan, Yaya Shi, Jiabo Ye, Yuanhong Xu, Chenliang Li, Bin Bi, Qi Qian, Wei Wang, Guohai Xu, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou:
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video. ICML 2023: 38728-38748 - [c66]Ming Yan, Junjie Chen, Hangyu Mao, Jiajun Jiang, Jianye Hao, Xingjian Li, Zhao Tian, Zhichao Chen, Dong Li, Zhangkong Xian, Yanwei Guo, Wulong Liu, Bin Wang, Yuefeng Sun, Yongshun Cui:
Achieving Last-Mile Functional Coverage in Testing Chip Design Software Implementations. ICSE-SEIP 2023: 343-354 - [c65]Junyang Wang, Ming Yan, Yi Zhang, Jitao Sang:
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal Mapping. IJCAI 2023: 4326-4334 - [c64]Yaya Shi, Haowei Liu, Haiyang Xu, Zongyang Ma, Qinghao Ye, Anwen Hu, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu, Zheng-Jun Zha:
Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval. ACM Multimedia 2023: 4460-4470 - [c63]Chaoya Jiang, Haiyang Xu, Wei Ye, Qinghao Ye, Chenliang Li, Ming Yan, Bin Bi, Shikun Zhang, Fei Huang, Ji Zhang:
COPA : Efficient Vision-Language Pre-training through Collaborative Object- and Patch-Text Alignment. ACM Multimedia 2023: 4480-4491 - [c62]Qinghao Ye, Haiyang Xu, Ming Yan, Chenlin Zhao, Junyang Wang, Xiaoshan Yang, Ji Zhang, Fei Huang, Jitao Sang, Changsheng Xu:
mPLUG-Octopus: The Versatile Assistant Empowered by A Modularized End-to-End Multimodal LLM. ACM Multimedia 2023: 9365-9367 - [c61]Ming Yan, Wei Geng, Pan Hui:
Towards a 3D Evaluation Dataset for User Acceptance of Automated Shuttles. VR Workshops 2023: 89-93 - [i65]Xu Yang, Zhangzikang Li, Haiyang Xu, Hanwang Zhang, Qinghao Ye, Chenliang Li, Ming Yan, Yu Zhang, Fei Huang, Songfang Huang:
Learning Trajectory-Word Alignments for Video-Language Tasks. CoRR abs/2301.01953 (2023) - [i64]Shangyi Sun, Chunpu Huang, Rui Zhang, Lulu Chen, Yukai Huang, Ming Yan, Jie Wu:
A Comprehensive Study on Optimizing Systems with Data Processing Units. CoRR abs/2301.06070 (2023) - [i63]Haiyang Xu, Qinghao Ye, Ming Yan, Yaya Shi, Jiabo Ye, Yuanhong Xu, Chenliang Li, Bin Bi, Qi Qian, Wei Wang, Guohai Xu, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou:
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video. CoRR abs/2302.00402 (2023) - [i62]Ming Yan, Xin Wang, Yudi Dai, Siqi Shen, Chenglu Wen, Lan Xu, Yuexin Ma, Cheng Wang:
CIMI4D: A Large Multimodal Climbing Motion Dataset under Human-scene Interactions. CoRR abs/2303.17948 (2023) - [i61]Junyang Wang, Yuanhong Xu, Juhua Hu, Ming Yan, Jitao Sang, Qi Qian:
Improved Visual Fine-tuning with Natural Language Supervision. CoRR abs/2304.01489 (2023) - [i60]Junfeng Tian, Hehong Chen, Guohai Xu, Ming Yan, Xing Gao, Jianhai Zhang, Chenliang Li, Jiayi Liu, Wenshen Xu, Haiyang Xu, Qi Qian, Wei Wang, Qinghao Ye, Jiejing Zhang, Ji Zhang, Fei Huang, Jingren Zhou:
ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human. CoRR abs/2304.07849 (2023) - [i59]Junyang Wang, Ming Yan, Yi Zhang, Jitao Sang:
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal Mapping. CoRR abs/2304.13273 (2023) - [i58]Qinghao Ye, Haiyang Xu, Guohai Xu, Jiabo Ye, Ming Yan, Yiyang Zhou, Junyang Wang, Anwen Hu, Pengcheng Shi, Yaya Shi, Chenliang Li, Yuanhong Xu, Hehong Chen, Junfeng Tian, Qian Qi, Ji Zhang, Fei Huang:
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality. CoRR abs/2304.14178 (2023) - [i57]Xu Yang, Jiawei Peng, Zihua Wang, Haiyang Xu, Qinghao Ye, Chenliang Li, Ming Yan, Fei Huang, Zhangzikang Li, Yu Zhang:
Transforming Visual Scene Graphs to Image Captions. CoRR abs/2305.02177 (2023) - [i56]Chaoya Jiang, Wei Ye, Haiyang Xu, Ming Yan, Shikun Zhang, Jie Zhang, Fei Huang:
Vision Langauge Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation. CoRR abs/2305.04474 (2023) - [i55]Qianglong Chen, Feng Ji, Feng-Lin Li, Guohai Xu, Ming Yan, Ji Zhang, Yin Zhang:
AMTSS: An Adaptive Multi-Teacher Single-Student Knowledge Distillation Framework For Multilingual Language Inference. CoRR abs/2305.07928 (2023) - [i54]Qianglong Chen, Guohai Xu, Ming Yan, Ji Zhang, Fei Huang, Luo Si, Yin Zhang:
Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering. CoRR abs/2305.08135 (2023) - [i53]Linhui Xiao, Xiaoshan Yang, Fang Peng, Ming Yan, Yaowei Wang, Changsheng Xu:
CLIP-VG: Self-paced Curriculum Adapting of CLIP via Exploiting Pseudo-Language Labels for Visual Grounding. CoRR abs/2305.08685 (2023) - [i52]Haiyang Xu, Qinghao Ye, Xuan Wu, Ming Yan, Yuan Miao, Jiabo Ye, Guohai Xu, Anwen Hu, Yaya Shi, Guangwei Xu, Chenliang Li, Qi Qian, Maofei Que, Ji Zhang, Xiao Zeng, Fei Huang:
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks. CoRR abs/2306.04362 (2023) - [i51]Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Yuhao Dan, Chenlin Zhao, Guohai Xu, Chenliang Li, Junfeng Tian, Qian Qi, Ji Zhang, Fei Huang:
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding. CoRR abs/2307.02499 (2023) - [i50]Chaoya Jiang, Haiyang Xu, Wei Ye, Qinghao Ye, Chenliang Li, Ming Yan, Bin Bi, Shikun Zhang, Fei Huang, Songfang Huang:
BUS: Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization. CoRR abs/2307.08504 (2023) - [i49]Guohai Xu, Jiayi Liu, Ming Yan, Haotian Xu, Jinghui Si, Zhuoran Zhou, Peng Yi, Xing Gao, Jitao Sang, Rong Zhang, Ji Zhang, Chao Peng, Fei Huang, Jingren Zhou:
CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility. CoRR abs/2307.09705 (2023) - [i48]Chaoya Jiang, Haiyang Xu, Wei Ye, Qinghao Ye, Chenliang Li, Ming Yan, Bin Bi, Shikun Zhang, Ji Zhang, Fei Huang:
COPA: Efficient Vision-Language Pre-training Through Collaborative Object- and Patch-Text Alignment. CoRR abs/2308.03475 (2023) - [i47]Chao Wang, Ming Yan, Junjie Yu:
Sorted L1/L2 Minimization for Sparse Signal Recovery. CoRR abs/2308.04125 (2023) - [i46]Ming Yan, Junjie Chen, Jie M. Zhang, Xuejie Cao, Chen Yang, Mark Harman:
COCO: Testing Code Generation Systems via Concretized Instructions. CoRR abs/2308.13319 (2023) - [i45]Junyang Wang, Yiyang Zhou, Guohai Xu, Pengcheng Shi, Chenlin Zhao, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Jihua Zhu, Jitao Sang, Haoyu Tang:
Evaluation and Analysis of Hallucination in Large Vision-Language Models. CoRR abs/2308.15126 (2023) - [i44]Chenliang Li, Hehong Chen, Ming Yan, Weizhou Shen, Haiyang Xu, Zhikai Wu, Zhicheng Zhang, Wenmeng Zhou, Yingda Chen, Chen Cheng, Hongzhu Shi, Ji Zhang, Fei Huang, Jingren Zhou:
ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models. CoRR abs/2309.00986 (2023) - [i43]Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Guohai Xu, Chenliang Li, Junfeng Tian, Qi Qian, Ji Zhang, Qin Jin, Liang He, Xin Alex Lin, Fei Huang:
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model. CoRR abs/2310.05126 (2023) - [i42]Hongzhan Chen, Siyue Wu, Xiaojun Quan, Rui Wang, Ming Yan, Ji Zhang:
MCC-KD: Multi-CoT Consistent Knowledge Distillation. CoRR