


default search action
Xinhan Di
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [c10]Zhifeng Xie, Hao Li, Huiming Ding, Mengtian Li, Xinhan Di, Ying Cao:
HieraFashDiff: Hierarchical Fashion Design with Multi-stage Diffusion Models. AAAI 2025: 8762-8770 - [i29]Shuangtao Li, Shuaihao Dong, Kexin Luan, Xinhan Di, Chaofan Ding:
Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search. CoRR abs/2501.01478 (2025) - [i28]Kristin Qi, Xinhan Di:
Attentional Triple-Encoder Network in Spatiospectral Domains for Medical Image Segmentation. CoRR abs/2503.16389 (2025) - [i27]Haomin Zhang, Sizhe Shan, Haoyu Wang, Zihao Chen, Xiulong Liu, Chaofan Ding, Xinhan Di:
Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference Optimization. CoRR abs/2503.22200 (2025) - [i26]Yunming Liang, Zihao Chen, Chaofan Ding, Xinhan Di:
DeepSound-V1: Start to Think Step-by-Step in the Audio Generation from Videos. CoRR abs/2503.22208 (2025) - [i25]Haomin Zhang, Chang Liu, Junjie Zheng, Zihao Chen, Chaofan Ding, Xinhan Di:
DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation. CoRR abs/2503.22265 (2025) - [i24]Junjie Zheng, Zihao Chen, Chaofan Ding, Xinhan Di:
DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning Guidance. CoRR abs/2503.23660 (2025) - 2024
- [j1]Xinkang Zhang
, Xiaokun Dai
, Ziqun Zhang
, Xinhan Di
, Xinrong Chen
:
Hand-Object Pose Estimation and Reconstruction Based on Signed Distance Field and Multiscale Feature Interaction. IEEE Trans. Ind. Informatics 20(9): 11242-11251 (2024) - [i23]Xinhan Di, Zihao Chen, Yunming Liang, Junjie Zheng, Yihua Wang, Chaofan Ding:
Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation. CoRR abs/2408.00284 (2024) - [i22]Huan Yang, Jiahui Chen, Chaofan Ding, Runhua Shi, Siyu Xiong, Qingqi Hong, Xiaoqi Mo, Xinhan Di:
Self-Supervised Learning of Deviation in Latent Representation for Co-speech Gesture Video Generation. CoRR abs/2409.17674 (2024) - [i21]Shuting Zhao, Chenkang Du, Kristin Qi, Xinrong Chen, Xinhan Di:
Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation. CoRR abs/2410.00979 (2024) - [i20]Wenmo Qiu, Xinhan Di:
OCC-MLLM:Empowering Multimodal Large Language Model For the Understanding of Occluded Objects. CoRR abs/2410.01261 (2024) - [i19]Shuxin Yang, Xinhan Di:
OCC-MLLM-Alpha:Empowering Multi-modal Large Language Model for the Understanding of Occluded Objects with Self-Supervised Test-Time Learning. CoRR abs/2410.01861 (2024) - [i18]Wenjing Gao, Yuanyuan Yang, Jianrui Wei, Xuntao Yin, Xinhan Di:
Multi-Stage Graph Learning for fMRI Analysis to Diagnose Neuro-Developmental Disorders. CoRR abs/2410.05342 (2024) - [i17]Zihao Chen, Haomin Zhang, Xinhan Di, Haoyu Wang, Sizhe Shan, Junjie Zheng, Yunming Liang, Yihan Fan, Xinfa Zhu, Wenjie Tian, Yihua Wang, Chaofan Ding, Lei Xie:
YingSound: Video-Guided Sound Effects Generation with Multi-modal Chain-of-Thought Controls. CoRR abs/2412.09168 (2024) - [i16]Changqun Li, Chaofan Ding, Kexin Luan, Xinhan Di:
Low-Rank Adaptation with Task-Relevant Feature Enhancement for Fine-tuning Language Models. CoRR abs/2412.09827 (2024) - [i15]Gongyu Chen, Haomin Zhang, Chaofan Ding, Zihao Chen, Xinhan Di:
Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio. CoRR abs/2412.17306 (2024) - [i14]Huchen Jiang, Yangyang Ma, Chaofan Ding, Kexin Luan, Xinhan Di:
Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning. CoRR abs/2412.17397 (2024) - 2023
- [c9]Xinhan Di, Xiaokun Dai, Xinkang Zhang, Xinrong Chen:
Dual Attention Poser: Dual Path Body Tracking Based on Attention. CVPR Workshops 2023: 2795-2804 - [c8]Xinkang Zhang, Xinhan Di, Xiaokun Dai, Xinrong Chen:
An Attention-Based Signed Distance Field Estimation Method for Hand-Object Reconstruction. VR Workshops 2023: 675-676 - 2022
- [c7]Xinhan Di, Pengqian Yu
:
LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction. ECCV Workshops (3) 2022: 722-738 - [i13]Xinhan Di, Pengqian Yu:
LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction. CoRR abs/2208.09815 (2022) - [i12]Xinhan Di, Pengqian Yu:
Hierarchical Reinforcement Learning for Furniture Layout in Virtual Indoor Scenes. CoRR abs/2210.10431 (2022) - 2021
- [i11]Xinhan Di, Pengqian Yu:
Deep Reinforcement Learning for Producing Furniture Layout in Indoor Scenes. CoRR abs/2101.07462 (2021) - [i10]Xinhan Di, Pengqian Yu:
Multi-Agent Reinforcement Learning of 3D Furniture Layout Simulation in Indoor Graphics Scenes. CoRR abs/2102.09137 (2021) - 2020
- [c6]Xinhan Di, Pengqian Yu, Hong Zhu, Lei Cai, Qiuyan Sheng, Changyu Sun, Ling-Qiang Ran:
Structural Plan of Indoor Scenes with Personalized Preferences. ECCV Workshops (4) 2020: 455-468 - [c5]Xinhan Di, Pengqian Yu, Rui Bu, Mingchao Sun:
Mutual Information Maximization in Graph Neural Networks. IJCNN 2020: 1-7 - [i9]Yuli Zhang, Yeyang He, Shaowen Zhu, Xinhan Di:
The Direction-Aware, Learnable, Additive Kernels and the Adversarial Network for Deep Floor Plan Recognition. CoRR abs/2001.11194 (2020) - [i8]Xinhan Di, Pengqian Yu, Hong Zhu, Lei Cai, Qiuyan Sheng, Changyu Sun:
Towards Adversarial Planning for Indoor Scenes with Rotation. CoRR abs/2006.13527 (2020) - [i7]Xinhan Di, Pengqian Yu, Hong Zhu, Lei Cai, Qiuyan Sheng, Changyu Sun:
Structural Plan of Indoor Scenes with Personalized Preferences. CoRR abs/2008.01323 (2020) - [i6]Xinhan Di, Pengqian Yu, Danfeng Yang, Hong Zhu, Changyu Sun, YinDong Liu:
Deep Layout of Custom-size Furniture through Multiple-domain Learning. CoRR abs/2012.08131 (2020) - [i5]Xinhan Di, Pengqian Yu, Danfeng Yang, Hong Zhu, Changyu Sun, YinDong Liu:
End-to-end Generative Floor-plan and Layout with Attributes and Relation Graph. CoRR abs/2012.08514 (2020)
2010 – 2019
- 2019
- [i4]Xinhan Di, Pengqian Yu, Mingchao Sun, Rui Bu:
Neighborhood Enlargement in Graph Neural Networks. CoRR abs/1905.08509 (2019) - 2018
- [c4]Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Xinhan Di, Baoquan Chen:
PointCNN: Convolution On X-Transformed Points. NeurIPS 2018: 828-838 - [i3]Xinhan Di, Pengqian Yu, Meng Tian:
Towards Adversarial Training with Moderate Performance Improvement for Neural Network Classification. CoRR abs/1807.00340 (2018) - [i2]Xinhan Di, Pengqian Yu, Meng Tian:
Ambient Hidden Space of Generative Adversarial Networks. CoRR abs/1807.00780 (2018) - 2017
- [c3]Xinhan Di, Pengqian Yu:
Max-Boost-GAN: Max Operation to Boost Generative Ability of Generative Adversarial Networks. ICCV Workshops 2017: 1156-1164 - [c2]Xinhan Di, Pengqian Yu:
Multiplicative Noise Channel in Generative Adversarial Networks. ICCV Workshops 2017: 1165-1172 - [i1]Xinhan Di, Pengqian Yu:
3D Reconstruction of Simple Objects from A Single View Silhouette Image. CoRR abs/1701.04752 (2017) - 2016
- [c1]Xinhan Di, Rozenn Dahyot
, Mukta Prasad:
Deep Shape from a Low Number of Silhouettes. ECCV Workshops (3) 2016: 251-265
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-04-22 21:05 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint