default search action
Yuan Xie 0001
Person information
- affiliation: Alibaba DAMO Academy
- affiliation (former): University of California at Santa Barbara, CA, USA
- affiliation (2003 - 2013): Pennsylvania State University, Philadelphia, PA, USA
- affiliation (PhD 2002): Princeton University, Princeton, NJ, USA
Other persons with the same name
- Yuan Xie — disambiguation page
- Yuan Xie 0002 — Shanghai Jiao Tong University, China
- Yuan Xie 0003 — California Institute of Technology, Center for Computational Regulatory Genomics, CA, USA
- Yuan Xie 0004 — Sun Yat-sen University, Guangzhou, China
- Yuan Xie 0005 — Indiana University Bloomington, USA
- Yuan Xie 0006 — East China Normal University, Shanghai, China (and 2 more)
- Yuan Xie 0007 — Guangdong University of Technology, Guangzhou, China
- Yuan Xie 0008 — Guilin University of Electronic Technology, School of Computer and Information Security, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j167]Ruiyang Ma, Jiayi Huang, Shijian Zhang, Yuan Xie, Guojie Luo:
NoCFuzzer: Automating NoC Verification in UVM. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 44(1): 371-384 (2025) - 2024
- [j166]Yiquan Chen, Yuan Xie, Yijing Wang, Jiexiong Xu, Zhen Jin, Anyu Li, Xiaoyan Fu, Qiang Liu, Wenzhi Chen:
Optimizing NVMe Storage for Large-Scale Deployment: Key Technologies and Strategies in Alibaba Cloud. IEEE Micro 44(5): 47-56 (2024) - [j165]Nan Wu, Yingjie Li, Hang Yang, Hanqiu Chen, Steve Dai, Cong Hao, Cunxi Yu, Yuan Xie:
Survey of Machine Learning for Software-assisted Hardware Design Verification: Past, Present, and Prospect. ACM Trans. Design Autom. Electr. Syst. 29(4): 1-42 (2024) - [c357]Zhaodong Chen, Andrew Kerr, Richard Cai, Jack Kosaian, Haicheng Wu, Yufei Ding, Yuan Xie:
EVT: Accelerating Deep Learning Training with Epilogue Visitor Tree. ASPLOS (3) 2024: 301-316 - [i80]Meng Wu, Mingyu Yan, Wenming Li, Xiaochun Ye, Dongrui Fan, Yuan Xie:
A Comprehensive Survey on GNN Characterization. CoRR abs/2408.01902 (2024) - 2023
- [j164]Nan Wu, Yuan Xie:
A Survey of Machine Learning for Computer Architecture and Systems. ACM Comput. Surv. 55(3): 54:1-54:39 (2023) - [j163]Fengbin Tu, Yiqi Wang, Zihan Wu, Ling Liang, Yufei Ding, Bongjin Kim, Leibo Liu, Shaojun Wei, Yuan Xie, Shouyi Yin:
ReDCIM: Reconfigurable Digital Computing- In -Memory Processor With Unified FP/INT Pipeline for Cloud AI Acceleration. IEEE J. Solid State Circuits 58(1): 243-255 (2023) - [j162]Fengbin Tu, Zihan Wu, Yiqi Wang, Ling Liang, Liu Liu, Yufei Ding, Leibo Liu, Shaojun Wei, Yuan Xie, Shouyi Yin:
TranCIM: Full-Digital Bitline-Transpose CIM-based Sparse Transformer Accelerator With Pipeline/Parallel Reconfigurable Modes. IEEE J. Solid State Circuits 58(6): 1798-1809 (2023) - [j161]Guiming Wu, Qianwen He, Jiali Jiang, Zhenxiang Zhang, Yunfeng Shi, Xin Long, Linquan Jiang, Shuangchen Li, Yuan Xie, Changzheng Wei, Yuan Zhao, Ying Yan, Hui Zhang, Yinchao Zou:
E-Booster: A Field-Programmable Gate Array-Based Accelerator for Secure Tree Boosting Using Additively Homomorphic Encryption. IEEE Micro 43(5): 88-96 (2023) - [j160]Haiyang Lin, Mingyu Yan, Xiaochun Ye, Dongrui Fan, Shirui Pan, Wenguang Chen, Yuan Xie:
A Comprehensive Survey on Distributed Training of Graph Neural Networks. Proc. IEEE 111(12): 1572-1606 (2023) - [j159]Ling Liang, Jilan Lin, Zheng Qu, Ishtiyaque Ahmad, Fengbin Tu, Trinabh Gupta, Yufei Ding, Yuan Xie:
SPG: Structure-Private Graph Database via SqueezePIR. Proc. VLDB Endow. 16(7): 1615-1628 (2023) - [j158]Fengbin Tu, Yiqi Wang, Ling Liang, Yufei Ding, Leibo Liu, Shaojun Wei, Shouyi Yin, Yuan Xie:
SDP: Co-Designing Algorithm, Dataflow, and Architecture for In-SRAM Sparse NN Acceleration. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 42(1): 109-121 (2023) - [j157]Nan Wu, Yuan Xie, Cong Hao:
IronMan-Pro: Multiobjective Design Space Exploration in HLS via Reinforcement Learning and Graph Neural Network-Based Modeling. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 42(3): 900-913 (2023) - [j156]Bizhao Shi, Jiaxi Zhang, Zhuolun He, Xuechao Wei, Sicheng Li, Guojie Luo, Hongzhong Zheng, Yuan Xie:
Efficient Super-Resolution System With Block-Wise Hybridization and Quantized Winograd on FPGA. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 42(11): 3910-3924 (2023) - [j155]Zhenhua Zhu, Hanbo Sun, Tongxin Xie, Yu Zhu, Guohao Dai, Lixue Xia, Dimin Niu, Xiaoming Chen, Xiaobo Sharon Hu, Yu Cao, Yuan Xie, Huazhong Yang, Yu Wang:
MNSIM 2.0: A Behavior-Level Modeling Tool for Processing-In-Memory Architectures. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 42(11): 4112-4125 (2023) - [j154]Yiqi Wang, Fengbin Tu, Leibo Liu, Shaojun Wei, Yuan Xie, Shouyi Yin:
SPCIM: Sparsity-Balanced Practical CIM Accelerator With Optimized Spatial-Temporal Multi-Macro Utilization. IEEE Trans. Circuits Syst. I Regul. Pap. 70(1): 214-227 (2023) - [j153]Ling Liang, Xing Hu, Lei Deng, Yujie Wu, Guoqi Li, Yufei Ding, Peng Li, Yuan Xie:
Exploring Adversarial Attack in Spiking Neural Networks With Spike-Compatible Gradient. IEEE Trans. Neural Networks Learn. Syst. 34(5): 2569-2583 (2023) - [j152]Lei Deng, Yujie Wu, Yifan Hu, Ling Liang, Guoqi Li, Xing Hu, Yufei Ding, Peng Li, Yuan Xie:
Comprehensive SNN Compression Using ADMM Optimization and Activity Regularization. IEEE Trans. Neural Networks Learn. Syst. 34(6): 2791-2805 (2023) - [j151]Yanhong Wang, Tianchan Guan, Dimin Niu, Qiaosha Zou, Hongzhong Zheng, Chuanjin Richard Shi, Yuan Xie:
Accelerating Distributed GNN Training by Codes. IEEE Trans. Parallel Distributed Syst. 34(9): 2598-2614 (2023) - [c356]Zhiyao Li, Jiaxiang Li, Taijie Chen, Dimin Niu, Hongzhong Zheng, Yuan Xie, Mingyu Gao:
Spada: Accelerating Sparse Matrix Multiplication with Adaptive Dataflow. ASPLOS (2) 2023: 747-761 - [c355]Xuanle Ren, Zhaohui Chen, Zhen Gu, Yanheng Lu, Ruiguang Zhong, Wen-Jie Lu, Jiansong Zhang, Yichi Zhang, Hanghang Wu, Xiaofu Zheng, Heng Liu, Tingqiang Chu, Cheng Hong, Changzheng Wei, Dimin Niu, Yuan Xie:
CHAM: A Customized Homomorphic Encryption Accelerator for Fast Matrix-Vector Product. DAC 2023: 1-6 - [c354]Ao Ren, Yuhao Wang, Tao Zhang, Jiaxing Shi, Duo Liu, Xianzhang Chen, Yujuan Tan, Yuan Xie:
HBP: Hierarchically Balanced Pruning and Accelerator Co-Design for Efficient DNN Inference. DAC 2023: 1-6 - [c353]Nan Wu, Yingjie Li, Cong Hao, Steve Dai, Cunxi Yu, Yuan Xie:
Gamora: Graph Learning based Symbolic Reasoning for Large-Scale Boolean Networks. DAC 2023: 1-6 - [c352]Chen Bai, Xuechao Wei, Youwei Zhuo, Yi Cai, Hongzhong Zheng, Bei Yu, Yuan Xie:
Klotski: DNN Model Orchestration Framework for Dataflow Architecture Accelerators. ICCAD 2023: 1-9 - [c351]Siqi Li, Fengbin Tu, Liu Liu, Jilan Lin, Zheng Wang, Yangwook Kang, Yufei Ding, Yuan Xie:
ECSSD: Hardware/Data Layout Co-Designed In-Storage-Computing Architecture for Extreme Classification. ISCA 2023: 58:1-58:14 - [c350]Chen Bai, Jiayi Huang, Xuechao Wei, Yuzhe Ma, Sicheng Li, Hongzhong Zheng, Bei Yu, Yuan Xie:
ArchExplorer: Microarchitecture Exploration Via Bottleneck Analysis. MICRO 2023: 268-282 - [c349]Shulin Zeng, Zhenhua Zhu, Jun Liu, Haoyu Zhang, Guohao Dai, Zixuan Zhou, Shuangchen Li, Xuefei Ning, Yuan Xie, Huazhong Yang, Yu Wang:
DF-GAS: a Distributed FPGA-as-a-Service Architecture towards Billion-Scale Graph-based Approximate Nearest Neighbor Search. MICRO 2023: 283-296 - [c348]Guyue Huang, Zhengyang Wang, Po-An Tsai, Chen Zhang, Yufei Ding, Yuan Xie:
RM-STC: Row-Merge Dataflow Inspired GPU Sparse Tensor Core for Energy-Efficient Sparse Acceleration. MICRO 2023: 338-352 - [c347]Zheng Qu, Dimin Niu, Shuangchen Li, Hongzhong Zheng, Yuan Xie:
TT-GNN: Efficient On-Chip Graph Neural Network Training via Embedding Reformation and Hardware Optimization. MICRO 2023: 452-464 - [c346]Guyue Huang, Yang Bai, Liu Liu, Yuke Wang, Bei Yu, Yufei Ding, Yuan Xie:
ALCOP: Automatic Load-Compute Pipelining in Deep Learning Compiler for AI-GPUs. MLSys 2023 - [c345]Zhaodong Chen, Zheng Qu, Yuying Quan, Liu Liu, Yufei Ding, Yuan Xie:
Dynamic N: M Fine-Grained Structured Sparse Attention Mechanism. PPoPP 2023: 369-379 - [i79]Nan Wu, Yingjie Li, Cong Hao, Steve Dai, Cunxi Yu, Yuan Xie:
Gamora: Graph Learning based Symbolic Reasoning for Large-Scale Boolean Networks. CoRR abs/2303.08256 (2023) - [i78]Yiquan Chen, Zhen Jin, Yijing Wang, Yi Chen, Hao Yu, Jiexiong Xu, Jinlong Chen, Wenhai Lin, Kanghua Fang, Chengkun Wei, Qiang Liu, Yuan Xie, Wenzhi Chen:
High-performance and Scalable Software-based NVMe Virtualization Mechanism with I/O Queues Passthrough. CoRR abs/2304.05148 (2023) - [i77]Yuanwei Fang, Zihao Liu, Yanheng Lu, Jiawei Liu, Jiajie Li, Yi Jin, Jian Chen, Yenkuang Chen, Hongzhong Zheng, Yuan Xie:
NPS: A Framework for Accurate Program Sampling Using Graph Neural Network. CoRR abs/2304.08880 (2023) - [i76]Jianyu Xu, Hanwen Zhang, Ling Liang, Lei Deng, Yuan Xie, Guoqi Li:
NP-Hardness of Tensor Network Contraction Ordering. CoRR abs/2310.06140 (2023) - 2022
- [j150]Linyong Huang, Zhe Zhang, Shuangchen Li, Dimin Niu, Yijin Guan, Hongzhong Zheng, Yuan Xie:
Practical Near-Data-Processing Architecture for Large-Scale Distributed Graph Neural Network. IEEE Access 10: 46796-46807 (2022) - [j149]Zhaoyang Du, Yijin Guan, Tianchan Guan, Dimin Niu, Hongzhong Zheng, Yuan Xie:
Accelerating CPU-Based Sparse General Matrix Multiplication With Binary Row Merging. IEEE Access 10: 79237-79248 (2022) - [j148]Zhaoyang Du, Yijin Guan, Tianchan Guan, Dimin Niu, Linyong Huang, Hongzhong Zheng, Yuan Xie:
OpSparse: A Highly Optimized Framework for Sparse General Matrix Multiplication on GPUs. IEEE Access 10: 85960-85974 (2022) - [j147]Xinfeng Xie, Peng Gu, Jiayi Huang, Yufei Ding, Yuan Xie:
MPU-Sim: A Simulator for In-DRAM Near-Bank Processing Architectures. IEEE Comput. Archit. Lett. 21(1): 1-4 (2022) - [j146]Mingyu Yan, Mo Zou, Xiaocheng Yang, Wenming Li, Xiaochun Ye, Dongrui Fan, Yuan Xie:
Characterizing and Understanding HGNNs on GPUs. IEEE Comput. Archit. Lett. 21(2): 69-72 (2022) - [j145]Linyong Huang, Zhe Zhang, Zhaoyang Du, Shuangchen Li, Hongzhong Zheng, Yuan Xie, Nianxiong Tan:
EPQuant: A Graph Neural Network compression approach based on product quantization. Neurocomputing 503: 49-61 (2022) - [j144]Jeong-Jun Lee, Wenrui Zhang, Yuan Xie, Peng Li:
SaARSP: An Architecture for Systolic-Array Acceleration of Recurrent Spiking Neural Networks. ACM J. Emerg. Technol. Comput. Syst. 18(4): 68:1-68:23 (2022) - [j143]Zhaodong Chen, Lei Deng, Bangyan Wang, Guoqi Li, Yuan Xie:
A Comprehensive and Modularized Statistical Framework for Gradient Norm Equality in Deep Neural Networks. IEEE Trans. Pattern Anal. Mach. Intell. 44(1): 13-31 (2022) - [j142]Xuanle Ren, Le Su, Zhen Gu, Sheng Wang, Feifei Li, Yuan Xie, Song Bian, Chao Li, Fan Zhang:
HEDA: Multi-Attribute Unbounded Aggregation over Homomorphically Encrypted Database. Proc. VLDB Endow. 16(4): 601-614 (2022) - [j141]Bangyan Wang, Lei Deng, Zheng Qu, Shuangchen Li, Zheng Zhang, Yuan Xie:
Efficient Processing of Sparse Tensor Decomposition via Unified Abstraction and PE-Interactive Architecture. IEEE Trans. Computers 71(2): 266-281 (2022) - [j140]Gongjian Sun, Mingyu Yan, Duo Wang, Han Li, Wenming Li, Xiaochun Ye, Dongrui Fan, Yuan Xie:
Multi-Node Acceleration for Large-Scale GCNs. IEEE Trans. Computers 71(12): 3140-3152 (2022) - [j139]Liu Liu, Zheng Qu, Zhaodong Chen, Fengbin Tu, Yufei Ding, Yuan Xie:
Dynamic Sparse Attention for Scalable Transformer Acceleration. IEEE Trans. Computers 71(12): 3165-3178 (2022) - [j138]Xing Hu, Ling Liang, Xiaobing Chen, Lei Deng, Yu Ji, Yufei Ding, Zidong Du, Qi Guo, Timothy Sherwood, Yuan Xie:
A Systematic View of Model Leakage Risks in Deep Neural Network Systems. IEEE Trans. Computers 71(12): 3254-3267 (2022) - [j137]Zheng Qu, Lei Deng, Bangyan Wang, Hengnu Chen, Jilan Lin, Ling Liang, Guoqi Li, Zheng Zhang, Yuan Xie:
Hardware-Enabled Efficient Data Processing With Tensor-Train Decomposition. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 41(2): 372-385 (2022) - [j136]Xiaobing Chen, Yuke Wang, Xinfeng Xie, Xing Hu, Abanti Basak, Ling Liang, Mingyu Yan, Lei Deng, Yufei Ding, Zidong Du, Yuan Xie:
Rubik: A Hierarchical Architecture for Efficient Graph Neural Network Training. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 41(4): 936-949 (2022) - [j135]Yuke Wang, Boyuan Feng, Gushu Li, Lei Deng, Yuan Xie, Yufei Ding:
STPAcc: Structural TI-Based Pruning for Accelerating Distance-Related Algorithms on CPU-FPGA Platforms. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 41(5): 1358-1370 (2022) - [j134]Ling Liang, Zheng Qu, Zhaodong Chen, Fengbin Tu, Yujie Wu, Lei Deng, Guoqi Li, Peng Li, Yuan Xie:
H2Learn: High-Efficiency Learning Accelerator for High-Accuracy Spiking Neural Networks. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 41(11): 4782-4796 (2022) - [c344]Nan Wu, Jiwon Lee, Yuan Xie, Cong Hao:
LOSTIN: Logic Optimization via Spatio-Temporal Information with Hybrid Graph Models. ASAP 2022: 11-18 - [c343]Zheng Qu, Liu Liu, Fengbin Tu, Zhaodong Chen, Yufei Ding, Yuan Xie:
DOTA: detect and omit weak attentions for scalable transformer acceleration. ASPLOS 2022: 14-26 - [c342]Gushu Li, Anbang Wu, Yunong Shi, Ali Javadi-Abhari, Yufei Ding, Yuan Xie:
Paulihedral: a generalized block-wise compiler optimization framework for Quantum simulation kernels. ASPLOS 2022: 554-569 - [c341]Bangyan Wang, Lei Deng, Fei Sun, Guohao Dai, Liu Liu, Yu Wang, Yuan Xie:
A one-for-all and o(v log(v ))-cost solution for parallel merge style operations on sorted key-value arrays. ASPLOS 2022: 669-682 - [c340]Zejiang Hou, Minghai Qin, Fei Sun, Xiaolong Ma, Kun Yuan, Yi Xu, Yen-Kuang Chen, Rong Jin, Yuan Xie, Sun-Yuan Kung:
CHEX: CHannel EXploration for CNN Model Compression. CVPR 2022: 12277-12288 - [c339]Nan Wu, Hang Yang, Yuan Xie, Pan Li, Cong Hao:
High-level synthesis performance prediction using GNNs: benchmarking, modeling, and advancing. DAC 2022: 49-54 - [c338]Guohao Dai, Guyue Huang, Shang Yang, Zhongming Yu, Hengrui Zhang, Yufei Ding, Yuan Xie, Huazhong Yang, Yu Wang:
Heuristic adaptability to input dynamics for SpMM on CPUs. DAC 2022: 595-600 - [c337]Haiyang Lin, Mingyu Yan, Duo Wang, Mo Zou, Fengbin Tu, Xiaochun Ye, Dongrui Fan, Yuan Xie:
Alleviating datapath conflicts and design centralization in graph analytics acceleration. DAC 2022: 901-906 - [c336]Guyue Huang, Haoran Li, Minghai Qin, Fei Sun, Yufei Ding, Yuan Xie:
Shfl-BW: accelerating deep neural network inference with tensor-core aware weight pruning. DAC 2022: 1153-1158 - [c335]Ling Liang, Zhaodong Chen, Lei Deng, Fengbin Tu, Guoqi Li, Yuan Xie:
Accelerating Spatiotemporal Supervised Training of Large-Scale Spiking Neural Networks on GPU. DATE 2022: 658-663 - [c334]Sicheng Li, Chen Bai, Xuechao Wei, Bizhao Shi, Yen-Kuang Chen, Yuan Xie:
2022 ICCAD CAD Contest Problem C: Microarchitecture Design Space Exploration. ICCAD 2022: 95:1-95:7 - [c333]Nan Wu, Yuan Xie, Cong Hao:
AI-assisted Synthesis in Next Generation EDA: Promises, Challenges, and Prospects. ICCD 2022: 207-214 - [c332]Xiaolong Ma, Minghai Qin, Fei Sun, Zejiang Hou, Kun Yuan, Yi Xu, Yanzhi Wang, Yen-Kuang Chen, Rong Jin, Yuan Xie:
Effective Model Sparsification by Scheduled Grow-and-Prune Methods. ICLR 2022 - [c331]Zhaoyang Du, Yijin Guan, Tianchan Guan, Dimin Niu, Nianxiong Tan, Xiaopeng Yu, Hongzhong Zheng, Jianyi Meng, Xiaolang Yan, Yuan Xie:
Predicting the Output Structure of Sparse Matrix Multiplication with Sampled Compression Ratio. ICPADS 2022: 483-490 - [c330]Minghai Qin, Tianyun Zhang, Fei Sun, Yen-Kuang Chen, Makan Fardad, Yanzhi Wang, Yuan Xie:
Compact Multi-level Sparse Neural Networks with Input Independent Dynamic Rerouting. ICTAI 2022: 555-562 - [c329]Xin Liu, Mingyu Yan, Lei Deng, Guoqi Li, Xiaochun Ye, Dongrui Fan, Shirui Pan, Yuan Xie:
Survey on Graph Neural Network Acceleration: An Algorithmic Perspective. IJCAI 2022: 5521-5529 - [c328]Jilan Lin, Ling Liang, Zheng Qu, Ishtiyaque Ahmad, Liu Liu, Fengbin Tu, Trinabh Gupta, Yufei Ding, Yuan Xie:
INSPIRE: in-storage private information retrieval via protocol and architecture co-design. ISCA 2022: 102-115 - [c327]Guohao Dai, Zhenhua Zhu, Tianyu Fu, Chiyue Wei, Bangyan Wang, Xiangyu Li, Yuan Xie, Huazhong Yang, Yu Wang:
DIMMining: pruning-efficient and parallel graph mining on near-memory-computing. ISCA 2022: 130-145 - [c326]Anbang Wu, Gushu Li, Hezi Zhang, Gian Giacomo Guerreschi, Yufei Ding, Yuan Xie:
A synthesis framework for stitching surface code with superconducting quantum devices. ISCA 2022: 337-350 - [c325]Shuangchen Li, Dimin Niu, Yuhao Wang, Wei Han, Zhe Zhang, Tianchan Guan, Yijin Guan, Heng Liu, Linyong Huang, Zhaoyang Du, Fei Xue, Yuanwei Fang, Hongzhong Zheng, Yuan Xie:
Hyperscale FPGA-as-a-service architecture for large-scale distributed graph neural network. ISCA 2022: 946-961 - [c324]Dimin Niu, Shuangchen Li, Yuhao Wang, Wei Han, Zhe Zhang, Yijin Guan, Tianchan Guan, Fei Sun, Fei Xue, Lide Duan, Yuanwei Fang, Hongzhong Zheng, Xiping Jiang, Song Wang, Fengguo Zuo, Yubing Wang, Bing Yu, Qiwei Ren, Yuan Xie:
184QPS/W 64Mb/mm23D Logic-to-DRAM Hybrid Bonding with Process-Near-Memory Engine for Recommendation System. ISSCC 2022: 1-3 - [c323]Fengbin Tu, Yiqi Wang, Zihan Wu, Ling Liang, Yufei Ding, Bongjin Kim, Leibo Liu, Shaojun Wei, Yuan Xie, Shouyi Yin:
A 28nm 29.2TFLOPS/W BF16 and 36.5TOPS/W INT8 Reconfigurable Digital CIM Processor with Unified FP/INT Pipeline and Bitwise In-Memory Booth Multiplication for Cloud Deep Learning Acceleration. ISSCC 2022: 1-3 - [c322]Haozhe Zhu, Bo Jiao, Jinshan Zhang, Xinru Jia, Yunzhengmao Wang, Tianchan Guan, Shengcheng Wang, Dimin Niu, Hongzhong Zheng, Chixiao Chen, Mingyu Wang, Lihua Zhang, Xiaoyang Zeng, Qi Liu, Yuan Xie, Ming Liu:
COMB-MCM: Computing-on-Memory-Boundary NN Processor with Bipolar Bitwise Sparsity Optimization for Scalable Multi-Chiplet-Module Edge Machine Learning. ISSCC 2022: 1-3 - [c321]Fengbin Tu, Zihan Wu, Yiqi Wang, Ling Liang, Liu Liu, Yufei Ding, Leibo Liu, Shaojun Wei, Yuan Xie, Shouyi Yin:
A 28nm 15.59µJ/Token Full-Digital Bitline-Transpose CIM-Based Sparse Transformer Accelerator with Pipeline/Parallel Reconfigurable Modes. ISSCC 2022: 466-468 - [c320]Wenqin Huangfu, Krishna T. Malladi, Andrew Chang, Yuan Xie:
BEACON: Scalable Near-Data-Processing Accelerators for Genome Analysis near Memory Pool with the CXL Support. MICRO 2022: 727-743 - [c319]Anbang Wu, Hezi Zhang, Gushu Li, Alireza Shabani, Yuan Xie, Yufei Ding:
AutoComm: A Framework for Enabling Efficient Communication in Distributed Quantum Programs. MICRO 2022: 1027-1041 - [c318]Hengrui Zhang, Zhongming Yu, Guohao Dai, Guyue Huang, Yufei Ding, Yuan Xie, Yu Wang:
Understanding GNN Computational Graph: A Coordinated Computation, IO, and Memory Perspective. MLSys 2022 - [c317]Ling Liang, Kaidi Xu, Xing Hu, Lei Deng, Yuan Xie:
Toward Robust Spiking Neural Network Against Adversarial Perturbation. NeurIPS 2022 - [c316]Boyuan Feng, Tianqi Tang, Yuke Wang, Zhaodong Chen, Zheng Wang, Shu Yang, Yuan Xie, Yufei Ding:
Faith: An Efficient Framework for Transformer Verification on GPUs. USENIX ATC 2022: 167-182 - [i75]Nan Wu, Hang Yang, Yuan Xie, Pan Li, Cong Hao:
High-Level Synthesis Performance Prediction using GNNs: Benchmarking, Modeling, and Advancing. CoRR abs/2201.06848 (2022) - [i74]Nan Wu, Jiwon Lee, Yuan Xie, Cong Hao:
Hybrid Graph Models for Logic Optimization via Spatio-Temporal Information. CoRR abs/2201.08455 (2022) - [i73]Xin Liu, Mingyu Yan, Lei Deng, Guoqi Li, Xiaochun Ye, Dongrui Fan, Shirui Pan, Yuan Xie:
Survey on Graph Neural Network Acceleration: An Algorithmic Perspective. CoRR abs/2202.04822 (2022) - [i72]Guohao Dai, Guyue Huang, Shang Yang, Zhongming Yu, Hengrui Zhang, Yufei Ding, Yuan Xie, Huazhong Yang, Yu Wang:
Heuristic Adaptability to Input Dynamics for SpMM on GPUs. CoRR abs/2202.08556 (2022) - [i71]Haiyang Lin, Mingyu Yan, Duo Wang, Mo Zou, Fengbin Tu, Xiaochun Ye, Dongrui Fan, Yuan Xie:
Alleviating Datapath Conflicts and Design Centralization in Graph Analytics Acceleration. CoRR abs/2202.11343 (2022) - [i70]Zhaodong Chen, Yuying Quan, Zheng Qu, Liu Liu, Yufei Ding, Yuan Xie:
Dynamic N: M Fine-grained Structured Sparse Attention Mechanism. CoRR abs/2203.00091 (2022) - [i69]Guyue Huang, Haoran Li, Minghai Qin, Fei Sun, Yufei Ding, Yuan Xie:
Shfl-BW: Accelerating Deep Neural Network Inference with Tensor-Core Aware Weight Pruning. CoRR abs/2203.05016 (2022) - [i68]Zejiang Hou, Minghai Qin, Fei Sun, Xiaolong Ma, Kun Yuan, Yi Xu, Yen-Kuang Chen, Rong Jin, Yuan Xie, Sun-Yuan Kung:
CHEX: CHannel EXploration for CNN Model Compression. CoRR abs/2203.15794 (2022) - [i67]Ling Liang, Kaidi Xu, Xing Hu, Lei Deng, Yuan Xie:
Toward Robust Spiking Neural Network Against Adversarial Perturbation. CoRR abs/2205.01625 (2022) - [i66]Zihao Zhao, Yanhong Wang, Qiaosha Zou, Tie Xu, Fangbo Tao, Jiansong Zhang, Xiaoan Wang, Chuanjin Richard Shi, Junwen Luo, Yuan Xie:
The Spike Gating Flow: A Hierarchical Structure Based Spiking Neural Network for Online Gesture Recognition. CoRR abs/2206.01910 (2022) - [i65]Zhaoyang Du, Yijin Guan, Tianchan Guan, Dimin Niu, Hongzhong Zheng, Yuan Xie:
Accelerating CPU-based Sparse General Matrix Multiplication with Binary Row Merging. CoRR abs/2206.06611 (2022) - [i64]Zhaoyang Du, Yijin Guan, Tianchan Guan, Dimin Niu, Linyong Huang, Hongzhong Zheng, Yuan Xie:
OpSparse: a Highly Optimized Framework for Sparse General Matrix Multiplication on GPUs. CoRR abs/2206.07244 (2022) - [i63]Tianqi Tang, Yuan Xie:
Cost-Aware Exploration for Chiplet-Based Architecture with Advanced Packaging Technologies. CoRR abs/2206.07308 (2022) - [i62]Gongjian Sun, Mingyu Yan, Duo Wang, Han Li, Wenming Li, Xiaochun Ye, Dongrui Fan, Yuan Xie:
Multi-node Acceleration for Large-scale GCNs. CoRR abs/2207.07258 (2022) - [i61]Zhaoyang Du, Yijin Guan, Tianchan Guan, Dimin Niu, Nianxiong Tan, Xiaopeng Yu, Hongzhong Zheng, Jianyi Meng, Xiaolang Yan, Yuan Xie:
Predicting the Output Structure of Sparse Matrix Multiplication with Sampled Compression Ratio. CoRR abs/2207.13848 (2022) - [i60]Mingyu Yan, Mo Zou, Xiaocheng Yang, Wenming Li, Xiaochun Ye, Dongrui Fan, Yuan Xie:
Characterizing and Understanding HGNNs on GPUs. CoRR abs/2208.04758 (2022) - [i59]Zejiang Hou, Fei Sun, Yen-Kuang Chen, Yuan Xie, Sun-Yuan Kung:
MILAN: Masked Image Pretraining on Language Assisted Representation. CoRR abs/2208.06049 (2022) - [i58]Boyuan Feng, Tianqi Tang, Yuke Wang, Zhaodong Chen, Zheng Wang, Shu Yang, Yuan Xie, Yufei Ding:
Faith: An Efficient Framework for Transformer Verification on GPUs. CoRR abs/2209.12708 (2022) - [i57]Guyue Huang, Yang Bai, Liu Liu, Yuke Wang, Bei Yu, Yufei Ding, Yuan Xie:
Enabling Data Movement and Computation Pipelining in Deep Learning Compiler. CoRR abs/2210.16691 (2022) - [i56]Haiyang Lin, Mingyu Yan, Xiaochun Ye, Dongrui Fan, Shirui Pan, Wenguang Chen, Yuan Xie:
A Comp