default search action
Ying Shan
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j22]Weihao Cheng, Ying Shan:
Learning layout generation for virtual worlds. Comput. Vis. Media 10(3): 577-592 (2024) - [j21]Ziqi Zhang, Zongyang Ma, Chunfeng Yuan, Yuxin Chen, Peijin Wang, Zhongang Qi, Chenglei Hao, Bing Li, Ying Shan, Weiming Hu, Stephen J. Maybank:
Chinese Title Generation for Short Videos: Dataset, Metric and Algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 46(7): 5192-5208 (2024) - [j20]Yihua Huang, Yan-Pei Cao, Yu-Kun Lai, Ying Shan, Lin Gao:
NeRF-Texture: Synthesizing Neural Radiance Field Textures. IEEE Trans. Pattern Anal. Mach. Intell. 46(9): 5986-6000 (2024) - [j19]Yuxin Chen, Ziqi Zhang, Zhongang Qi, Chunfeng Yuan, Jie Wang, Ying Shan, Bing Li, Weiming Hu, Xiaohu Qie, Jianping Wu:
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation. IEEE Trans. Circuits Syst. Video Technol. 34(4): 2041-2055 (2024) - [j18]Dan Zhang, Wenzheng Feng, Yuandong Wang, Zhongang Qi, Ying Shan, Jie Tang:
DropConn: Dropout Connection Based Random GNNs for Molecular Property Prediction. IEEE Trans. Knowl. Data Eng. 36(2): 518-529 (2024) - [j17]Chen Li, Yixiao Ge, Dian Li, Ying Shan:
Vision-Language Instruction Tuning: A Review and Analysis. Trans. Mach. Learn. Res. 2024 (2024) - [j16]Jiashuo Yu, Junfu Pu, Ying Cheng, Rui Feng, Ying Shan:
Learning Music-Dance Representations Through Explicit-Implicit Rhythm Synchronization. IEEE Trans. Multim. 26: 8454-8463 (2024) - [c144]Weihao Cheng, Yan-Pei Cao, Ying Shan:
SparseGNV: Generating Novel Views of Indoor Scenes with Sparse RGB-D Images. AAAI 2024: 1308-1316 - [c143]Shi-Sheng Huang, Zi-Xin Zou, Yichi Zhang, Yan-Pei Cao, Ying Shan:
SC-NeuS: Consistent Neural Surface Reconstruction from Sparse and Noisy Views. AAAI 2024: 2357-2365 - [c142]Chong Mou, Xintao Wang, Liangbin Xie, Yanze Wu, Jian Zhang, Zhongang Qi, Ying Shan:
T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models. AAAI 2024: 4296-4304 - [c141]Tao Wu, Xuewei Li, Zhongang Qi, Di Hu, Xintao Wang, Ying Shan, Xi Li:
SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model. AAAI 2024: 6126-6134 - [c140]Yiyu Zhuang, Qi Zhang, Xuan Wang, Hao Zhu, Ying Feng, Xiaoyu Li, Ying Shan, Xun Cao:
A Pre-convolved Representation for Plug-and-Play Neural Illumination Fields. AAAI 2024: 7828-7836 - [c139]Zixin Zou, Weihao Cheng, Yan-Pei Cao, Shi-Sheng Huang, Ying Shan, Song-Hai Zhang:
Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views. AAAI 2024: 7900-7908 - [c138]Chengyue Wu, Yukang Gan, Yixiao Ge, Zeyu Lu, Jiahao Wang, Ye Feng, Ying Shan, Ping Luo:
LLaMA Pro: Progressive LLaMA with Block Expansion. ACL (1) 2024: 6518-6537 - [c137]Shansong Liu, Atin Sakkeer Hussain, Chenshuo Sun, Ying Shan:
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning. ICASSP 2024: 286-290 - [c136]Tianjun Mao, Shansong Liu, Yunxuan Zhang, Dian Li, Ying Shan:
Unified Pretraining Target Based Video-Music Retrieval with Music Rhythm and Video Optical Flow Information. ICASSP 2024: 7890-7894 - [c135]Shansong Liu, Xu Li, Dian Li, Ying Shan:
Humtrans: A Novel Open-Source Dataset for Humming Melody Transcription and Beyond. ICASSP 2024: 7915-7919 - [c134]Binzhu Sha, Xu Li, Zhiyong Wu, Ying Shan, Helen Meng:
Neural Concatenative Singing Voice Conversion: Rethinking Concatenation-Based Approach for One-Shot Singing Voice Conversion. ICASSP 2024: 12577-12581 - [c133]Yuying Ge, Sijie Zhao, Ziyun Zeng, Yixiao Ge, Chen Li, Xintao Wang, Ying Shan:
Making LLaMA SEE and Draw with SEED Tokenizer. ICLR 2024 - [c132]Yingqing He, Shaoshu Yang, Haoxin Chen, Xiaodong Cun, Menghan Xia, Yong Zhang, Xintao Wang, Ran He, Qifeng Chen, Ying Shan:
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models. ICLR 2024 - [c131]Chong Mou, Xintao Wang, Jiechong Song, Ying Shan, Jian Zhang:
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models. ICLR 2024 - [c130]Haonan Qiu, Menghan Xia, Yong Zhang, Yingqing He, Xintao Wang, Ying Shan, Ziwei Liu:
FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling. ICLR 2024 - [c129]Jiaxu Zhang, Shaoli Huang, Zhigang Tu, Xin Chen, Xiaohang Zhan, Gang Yu, Ying Shan:
TapMo: Shape-aware Motion Generation of Skeleton-free Characters. ICLR 2024 - [c128]Zhouxia Wang, Ziyang Yuan, Xintao Wang, Yaowei Li, Tianshui Chen, Menghan Xia, Ping Luo, Ying Shan:
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation. SIGGRAPH (Conference Paper Track) 2024: 114 - [c127]Dan Zhang, Yangliao Geng, Wenwen Gong, Zhongang Qi, Zhiyu Chen, Xing Tang, Ying Shan, Yuxiao Dong, Jie Tang:
RecDCL: Dual Contrastive Learning for Recommendation. WWW 2024: 3655-3666 - [i200]Chengyue Wu, Yukang Gan, Yixiao Ge, Zeyu Lu, Jiahao Wang, Ye Feng, Ping Luo, Ying Shan:
LLaMA Pro: Progressive LLaMA with Block Expansion. CoRR abs/2401.02415 (2024) - [i199]Jay Zhangjie Wu, Guian Fang, Haoning Wu, Xintao Wang, Yixiao Ge, Xiaodong Cun, David Junhao Zhang, Jia-Wei Liu, Yuchao Gu, Rui Zhao, Weisi Lin, Wynne Hsu, Ying Shan, Mike Zheng Shou:
Towards A Better Metric for Text-to-Video Generation. CoRR abs/2401.07781 (2024) - [i198]Haoxin Chen, Yong Zhang, Xiaodong Cun, Menghan Xia, Xintao Wang, Chao Weng, Ying Shan:
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models. CoRR abs/2401.09047 (2024) - [i197]Xiaohu Jiang, Yixiao Ge, Yuying Ge, Chun Yuan, Ying Shan:
Supervised Fine-tuning in turn Improves Visual Foundation Models. CoRR abs/2401.10222 (2024) - [i196]Yiyuan Zhang, Xiaohan Ding, Kaixiong Gong, Yixiao Ge, Ying Shan, Xiangyu Yue:
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities. CoRR abs/2401.14405 (2024) - [i195]Jingyu Zhuang, Di Kang, Yan-Pei Cao, Guanbin Li, Liang Lin, Ying Shan:
TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-Prompts. CoRR abs/2401.14828 (2024) - [i194]Dan Zhang, Yangliao Geng, Wenwen Gong, Zhongang Qi, Zhiyu Chen, Xing Tang, Ying Shan, Yuxiao Dong, Jie Tang:
RecDCL: Dual Contrastive Learning for Recommendation. CoRR abs/2401.15635 (2024) - [i193]Tianheng Cheng, Lin Song, Yixiao Ge, Wenyu Liu, Xinggang Wang, Ying Shan:
YOLO-World: Real-Time Open-Vocabulary Object Detection. CoRR abs/2401.17270 (2024) - [i192]Xiaoyu Li, Qi Zhang, Di Kang, Weihao Cheng, Yiming Gao, Jingbo Zhang, Zhihao Liang, Jing Liao, Yan-Pei Cao, Ying Shan:
Advances in 3D Generation: A Survey. CoRR abs/2401.17807 (2024) - [i191]Chong Mou, Xintao Wang, Jiechong Song, Ying Shan, Jian Zhang:
DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing. CoRR abs/2402.02583 (2024) - [i190]Lanqing Guo, Yingqing He, Haoxin Chen, Menghan Xia, Xiaodong Cun, Yufei Wang, Siyu Huang, Yong Zhang, Xintao Wang, Qifeng Chen, Ying Shan, Bihan Wen:
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation. CoRR abs/2402.10491 (2024) - [i189]Xiuzhe Wu, Xiaoyang Lyu, Qihao Huang, Yong Liu, Yang Wu, Ying Shan, Xiaojuan Qi:
DO3D: Self-supervised Learning of Decomposed Object-aware 3D Motion and Depth from Monocular Videos. CoRR abs/2403.05895 (2024) - [i188]Xuan Ju, Xian Liu, Xintao Wang, Yuxuan Bian, Ying Shan, Qiang Xu:
BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion. CoRR abs/2403.06976 (2024) - [i187]Ang Li, Qiugen Xiao, Peng Cao, Jian Tang, Yi Yuan, Zijie Zhao, Xiaoyuan Chen, Liang Zhang, Xiangyang Li, Kaitong Yang, Weidong Guo, Yukang Gan, Xu Yu, Daniell Wang, Ying Shan:
HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback. CoRR abs/2403.08309 (2024) - [i186]Duotun Wang, Hengyu Meng, Zeyu Cai, Zhijing Shao, Qianxi Liu, Lin Wang, Mingming Fan, Ying Shan, Xiaohang Zhan, Zeyu Wang:
HeadEvolver: Text to Head Avatars via Locally Learnable Mesh Deformation. CoRR abs/2403.09326 (2024) - [i185]Tao Wu, Xuewei Li, Zhongang Qi, Di Hu, Xintao Wang, Ying Shan, Xi Li:
SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model. CoRR abs/2403.10044 (2024) - [i184]Tian-Xing Xu, Wenbo Hu, Yu-Kun Lai, Ying Shan, Song-Hai Zhang:
Texture-GS: Disentangling the Geometry and Texture for 3D Gaussian Splatting Editing. CoRR abs/2403.10050 (2024) - [i183]Yujiao Jiang, Qingmin Liao, Xiaoyu Li, Li Ma, Qi Zhang, Chaopeng Zhang, Zongqing Lu, Ying Shan:
UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling. CoRR abs/2403.11589 (2024) - [i182]Ruyang Liu, Chen Li, Haoran Tang, Yixiao Ge, Ying Shan, Ge Li:
ST-LLM: Large Language Models Are Effective Temporal Learners. CoRR abs/2404.00308 (2024) - [i181]Jiale Xu, Weihao Cheng, Yiming Gao, Xintao Wang, Shenghua Gao, Ying Shan:
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models. CoRR abs/2404.07191 (2024) - [i180]Yuying Ge, Sijie Zhao, Jinguo Zhu, Yixiao Ge, Kun Yi, Lin Song, Chen Li, Xiaohan Ding, Ying Shan:
SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation. CoRR abs/2404.14396 (2024) - [i179]Bohao Li, Yuying Ge, Yi Chen, Yixiao Ge, Ruimao Zhang, Ying Shan:
SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension. CoRR abs/2404.16790 (2024) - [i178]Zidong Cao, Zhan Wang, Yexin Liu, Yan-Pei Cao, Ying Shan, Wei Zeng, Lin Wang:
Learning High-Quality Navigation and Zooming on Omnidirectional Images in Virtual Reality. CoRR abs/2405.00351 (2024) - [i177]Yuying Ge, Sijie Zhao, Chen Li, Yixiao Ge, Ying Shan:
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image Editing. CoRR abs/2405.04007 (2024) - [i176]Chengyue Wu, Yixiao Ge, Qiushan Guo, Jiahao Wang, Zhixuan Liang, Zeyu Lu, Ying Shan, Ping Luo:
Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots. CoRR abs/2405.07990 (2024) - [i175]Chong Mou, Mingdeng Cao, Xintao Wang, Zhaoyang Zhang, Ying Shan, Jian Zhang:
ReVideo: Remake a Video with Motion and Content Control. CoRR abs/2405.13865 (2024) - [i174]Xiangjun Gao, Xiaoyu Li, Yiyu Zhuang, Qi Zhang, Wenbo Hu, Chaopeng Zhang, Yao Yao, Ying Shan, Long Quan:
Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh. CoRR abs/2405.17811 (2024) - [i173]Jinbo Xing, Hanyuan Liu, Menghan Xia, Yong Zhang, Xintao Wang, Ying Shan, Tien-Tsin Wong:
ToonCrafter: Generative Cartoon Interpolation. CoRR abs/2405.17933 (2024) - [i172]Hanchao Liu, Xiaohang Zhan, Shaoli Huang, Tai-Jiang Mu, Ying Shan:
Programmable Motion Generation for Open-Set Motion Control Tasks. CoRR abs/2405.19283 (2024) - [i171]Muyao Niu, Xiaodong Cun, Xintao Wang, Yong Zhang, Ying Shan, Yinqiang Zheng:
MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model. CoRR abs/2405.20222 (2024) - [i170]Sijie Zhao, Yong Zhang, Xiaodong Cun, Shaoshu Yang, Muyao Niu, Xiaoyu Li, Wenbo Hu, Ying Shan:
CV-VAE: A Compatible Video VAE for Latent Generative Video Models. CoRR abs/2405.20279 (2024) - [i169]Shaoshu Yang, Yong Zhang, Xiaodong Cun, Ying Shan, Ran He:
ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation. CoRR abs/2406.00908 (2024) - [i168]Yicheng Xiao, Lin Song, Shaoli Huang, Jiangshan Wang, Siyu Song, Yixiao Ge, Xiu Li, Ying Shan:
GrootVL: Tree Topology is All You Need in State Space Model. CoRR abs/2406.02395 (2024) - [i167]Tao Yang, Yingmin Luo, Zhongang Qi, Yang Wu, Ying Shan, Chang Wen Chen:
PosterLLaVa: Constructing a Unified Multi-modal Layout Generator with LLM. CoRR abs/2406.02884 (2024) - [i166]Xubing Ye, Yukang Gan, Xiaoke Huang, Yixiao Ge, Ying Shan, Yansong Tang:
VoCo-LLaMA: Towards Vision Compression with Large Language Models. CoRR abs/2406.12275 (2024) - [i165]Yaowei Li, Xintao Wang, Zhaoyang Zhang, Zhouxia Wang, Ziyang Yuan, Liangbin Xie, Yuexian Zou, Ying Shan:
Image Conductor: Precision Control for Interactive Video Synthesis. CoRR abs/2406.15339 (2024) - [i164]Xuan Ju, Yiming Gao, Zhaoyang Zhang, Ziyang Yuan, Xintao Wang, Ailing Zeng, Yu Xiong, Qiang Xu, Ying Shan:
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions. CoRR abs/2407.06358 (2024) - [i163]Zongyang Ma, Ziqi Zhang, Yuxin Chen, Zhongang Qi, Chunfeng Yuan, Bing Li, Yingmin Luo, Xu Li, Xiaojuan Qi, Ying Shan, Weiming Hu:
EA-VTR: Event-Aware Video-Text Retrieval. CoRR abs/2407.07478 (2024) - [i162]Yuxin Chen, Zongyang Ma, Ziqi Zhang, Zhongang Qi, Chunfeng Yuan, Bing Li, Junfu Pu, Ying Shan, Xiaojuan Qi, Weiming Hu:
How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval? CoRR abs/2407.07479 (2024) - [i161]Shuai Yang, Yuying Ge, Yang Li, Yukang Chen, Yixiao Ge, Ying Shan, Yingcong Chen:
SEED-Story: Multimodal Long Story Generation with Large Language Model. CoRR abs/2407.08683 (2024) - [i160]Qinyu Yang, Haoxin Chen, Yong Zhang, Menghan Xia, Xiaodong Cun, Zhixun Su, Ying Shan:
Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models. CoRR abs/2407.10285 (2024) - [i159]Xuan Ju, Junhao Zhuang, Zhaoyang Zhang, Yuxuan Bian, Qiang Xu, Ying Shan:
Image Inpainting Models are Effective Tools for Instruction-guided Image Editing. CoRR abs/2407.13139 (2024) - [i158]Chaolei Tan, Zihang Lin, Junfu Pu, Zhongang Qi, Wei-Yi Pei, Zhi Qu, Yexin Wang, Ying Shan, Wei-Shi Zheng, Jian-Fang Hu:
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses. CoRR abs/2408.01669 (2024) - 2023
- [j15]Hao Ren, Ziqiang Zheng, Yang Wu, Hong Lu, Yang Yang, Ying Shan, Sai-Kit Yeung:
ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image Retrieval. IEEE Trans. Circuits Syst. Video Technol. 33(9): 5022-5035 (2023) - [j14]Xiao Wang, Weirong Ye, Zhongang Qi, Guangge Wang, Jianping Wu, Ying Shan, Xiaohu Qie, Hanzi Wang:
Task-Aware Dual-Representation Network for Few-Shot Action Recognition. IEEE Trans. Circuits Syst. Video Technol. 33(10): 5932-5946 (2023) - [c126]Yizhen Chen, Jie Wang, Lijian Lin, Zhongang Qi, Jin Ma, Ying Shan:
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval. AAAI 2023: 396-404 - [c125]Lijian Lin, Xintao Wang, Zhongang Qi, Ying Shan:
Accelerating the Training of Video Super-resolution Models. AAAI 2023: 1595-1603 - [c124]Liangbin Xie, Xintao Wang, Shuwei Shi, Jinjin Gu, Chao Dong, Ying Shan:
Mitigating Artifacts in Real-World Video Super-resolution Models. AAAI 2023: 2956-2964 - [c123]Binjie Zhang, Shupeng Su, Yixiao Ge, Xuyuan Xu, Yexin Wang, Chun Yuan, Mike Zheng Shou, Ying Shan:
Darwinian Model Upgrades: Model Evolving with Selective Compatibility. AAAI 2023: 3393-3400 - [c122]Zhihan Yang, Zhiyong Wu, Ying Shan, Jia Jia:
What Does Your Face Sound Like? 3D Face Shape towards Voice. AAAI 2023: 13905-13913 - [c121]Limao Xiong, Jie Zhou, Qunxi Zhu, Xiao Wang, Yuanbin Wu, Qi Zhang, Tao Gui, Xuanjing Huang, Jin Ma, Ying Shan:
A Confidence-based Partial Label Learning Model for Crowd-Annotated Named Entity Recognition. ACL (Findings) 2023: 1375-1386 - [c120]Rui Zheng, Zhiheng Xi, Qin Liu, Wenbin Lai, Tao Gui, Qi Zhang, Xuanjing Huang, Jin Ma, Ying Shan, Weifeng Ge:
Characterizing the Impacts of Instances on Robustness. ACL (Findings) 2023: 2314-2332 - [c119]Songyang Gao, Shihan Dou, Yan Liu, Xiao Wang, Qi Zhang, Zhongyu Wei, Jin Ma, Ying Shan:
DSRM: Boost Textual Adversarial Training with Distribution Shift Risk Minimization. ACL (1) 2023: 12177-12189 - [c118]Songyang Gao, Shihan Dou, Qi Zhang, Xuanjing Huang, Jin Ma, Ying Shan:
On the Universal Adversarial Perturbations for Efficient Data-free Adversarial Detection. ACL (Findings) 2023: 13573-13581 - [c117]Yiming Gao, Yan-Pei Cao, Ying Shan:
SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes. CVPR 2023: 108-118 - [c116]Fei Yin, Yong Zhang, Xuan Wang, Tengfei Wang, Xiaoyu Li, Yuan Gong, Yanbo Fan, Xiaodong Cun, Ying Shan, Cengiz Öztireli, Yujiu Yang:
3D GAN Inversion with Facial Symmetry Prior. CVPR 2023: 342-351 - [c115]Youxin Pang, Yong Zhang, Weize Quan, Yanbo Fan, Xiaodong Cun, Ying Shan, Dong-Ming Yan:
DPE: Disentanglement of Pose and Expression for General Video Portrait Editing. CVPR 2023: 427-436 - [c114]Fang Zhao, Zekun Li, Shaoli Huang, Junwu Weng, Tianfei Zhou, Guo-Sen Xie, Jue Wang, Ying Shan:
Learning Anchor Transformations for 3D Garment Animation. CVPR 2023: 491-500 - [c113]Mingdeng Cao, Chong Mou, Fanghua Yu, Xintao Wang, Yinqiang Zheng, Jian Zhang, Chao Dong, Gen Li, Ying Shan, Radu Timofte, Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Xuhan Sheng, Bin Chen, Haoyu Ma, Ming Cheng, Shijie Zhao, Wanwan Cui, Tianyu Xu, Chunyang Li, Long Bao, Heng Sun, Huaibo Huang, Xiaoqiang Zhou, Yuang Ai, Ran He, Renlong Wu, Yi Yang, Zhilu Zhang, Shuohao Zhang, Junyi Li, Yunjin Chen, Dongwei Ren, Wangmeng Zuo, Qian Wang, Hao-Hsiang Yang, Yi-Chung Chen, Zhi-Kai Huang, Wei-Ting Chen, Yuan-Chun Chiang, Hua-En Chang, I-Hsiang Chen, Chia-Hsuan Hsieh, Sy-Yen Kuo, Zebin Zhang, Jiaqi Zhang, Yuhui Wang, Shuhao Cui, Junshi Huang, Li Zhu, Shuman Tian, Wei Yu, Bingchun Luo:
NTIRE 2023 Challenge on 360° Omnidirectional Image and Video Super-Resolution: Datasets, Methods and Results. CVPR Workshops 2023: 1731-1745 - [c112]Yunpeng Bai, Yanbo Fan, Xuan Wang, Yong Zhang, Jingxiang Sun, Chun Yuan, Ying Shan:
High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors. CVPR 2023: 4541-4551 - [c111]Jinpeng Wang, Yixiao Ge, Rui Yan, Yuying Ge, Kevin Qinghong Lin, Satoshi Tsutsui, Xudong Lin, Guanyu Cai, Jianping Wu, Ying Shan, Xiaohu Qie, Mike Zheng Shou:
All in One: Exploring Unified Video-Language Pre-Training. CVPR 2023: 6598-6608 - [c110]Yue Chen, Xingyu Chen, Xuan Wang, Qi Zhang, Yu Guo, Ying Shan, Fei Wang:
Local-to-Global Registration for Bundle-Adjusting Neural Radiance Fields. CVPR 2023: 8264-8273 - [c109]Wenxuan Zhang, Xiaodong Cun, Xuan Wang, Yong Zhang, Xi Shen, Yu Guo, Ying Shan, Fei Wang:
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation. CVPR 2023: 8652-8661 - [c108]Yuxin Chen, Zongyang Ma, Ziqi Zhang, Zhongang Qi, Chunfeng Yuan, Ying Shan, Bing Li, Weiming Hu, Xiaohu Qie, Jianping Wu:
ViLEM: Visual-Language Error Modeling for Image-Text Retrieval. CVPR 2023: 11018-11027 - [c107]Hao Ai, Zidong Cao, Yan-Pei Cao, Ying Shan, Lin Wang:
HRDFuse: Monocular 360° Depth Estimation by Collaboratively Learning Holistic-with-Regional Depth Distributions. CVPR 2023: 13273-13282 - [c106]Fanghua Yu, Xintao Wang, Mingdeng Cao, Gen Li, Ying Shan, Chao Dong:
OSRT: Omnidirectional Image Super-Resolution with Distortion-aware Transformer. CVPR 2023: 13283-13292 - [c105]Jiaxu Zhang, Junwu Weng, Di Kang, Fang Zhao, Shaoli Huang, Xuefei Zhe, Linchao Bao, Ying Shan, Jue Wang, Zhigang Tu:
Skinned Motion Retargeting with Residual Perception of Motion Semantics & Geometry. CVPR 2023: 13864-13872 - [c104]Qiangqiang Wu, Tianyu Yang, Ziquan Liu, Baoyuan Wu, Ying Shan, Antoni B. Chan:
DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking Tasks. CVPR 2023: 14561-14571 - [c103]Jiale Xu, Xintao Wang, Weihao Cheng, Yan-Pei Cao, Ying Shan, Xiaohu Qie, Shenghua Gao:
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models. CVPR 2023: 20908-20918 - [c102]Guangcong Zheng, Xianpan Zhou, Xuewei Li, Zhongang Qi, Ying Shan, Xi Li:
LayoutDiffusion: Controllable Diffusion Model for Layout-to-Image Generation. CVPR 2023: 22490-22499 - [c101]Teng Wang, Yixiao Ge, Feng Zheng, Ran Cheng, Ying Shan, Xiaohu Qie, Ping Luo:
Accelerating Vision-Language Pretraining with Free Language Modeling. CVPR 2023: 23161-23170 - [c100]Shusheng Yang, Yixiao Ge, Kun Yi, Dian Li, Ying Shan, Xiaohu Qie, Xinggang Wang:
RILS: Masked Visual Reconstruction in Language Semantic Space. CVPR 2023: 23304-23314 - [c99]Liang Chen, Yong Zhang, Yibing Song, Ying Shan, Lingqiao Liu:
Improved Test-Time Adaptation for Domain Generalization. CVPR 2023: 24172-24182 - [c98]Wenxi Ma, Tianxiang Hou, Qianji Di, Zhongang Qi, Ying Shan, Hanzi Wang:
ERBNet: An Effective Representation Based Network for Unbiased Scene Graph Generation. ICASSP 2023: 1-5 - [c97]Shaohuan Zhou, Xu Li, Zhiyong Wu, Ying Shan, Helen Meng:
Enhancing the Vocal Range of Single-Speaker Singing Voice Synthesis with Melody-Unsupervised Pre-Training. ICASSP 2023: 1-5 - [c96]Xiaotong Li, Zixuan Hu, Yixiao Ge, Ying Shan, Ling-Yu Duan:
Exploring Model Transferability through the Lens of Potential Energy. ICCV 2023: 5406-5415 - [c95]Yuxin Fang, Shusheng Yang, Shijie Wang, Yixiao Ge, Ying Shan, Xinggang Wang:
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection. ICCV 2023: 6221-6230 - [c94]Jay Zhangjie Wu, Yixiao Ge, Xintao Wang, Stan Weixian Lei, Yuchao Gu, Yufei Shi, Wynne Hsu, Ying Shan, Xiaohu Qie, Mike Zheng Shou:
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation. ICCV 2023: 7589-7599 - [c93]Zidong Cao, Hao Ai, Yan-Pei Cao, Ying Shan, Xiaohu Qie, Lin Wang:
OmniZoomer: Learning to Move and Zoom in on Sphere at High-Resolution. ICCV 2023: 12851-12861 - [c92]Zongyang Ma, Ziqi Zhang, Yuxin Chen, Zhongang Qi, Yingmin Luo, Zekun Li, Chunfeng Yuan, Bing Li, Xiaohu Qie, Ying Shan, Weiming Hu:
Order-Prompted Tag Sequence Generation for Video Tagging. ICCV 2023: 15635-15644 - [c91]Chenyang Qi, Xiaodong Cun, Yong Zhang, Chenyang Lei, Xintao Wang, Ying Shan, Qifeng Chen:
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing. ICCV 2023: 15886-15896 - [c90]