![](https://dblp.dagstuhl.de/img/logo.320x120.png)
![search dblp search dblp](https://dblp.dagstuhl.de/img/search.dark.16x16.png)
![search dblp](https://dblp.dagstuhl.de/img/search.dark.16x16.png)
default search action
Zhongzhi Luan
Person information
Refine list
![note](https://dblp.dagstuhl.de/img/note-mark.dark.12x12.png)
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j40]Mingzhen Li, Yi Liu, Bangduo Chen, Hailong Yang, Zhongzhi Luan, Depei Qian:
Building a domain-specific compiler for emerging processors with a reusable approach. Sci. China Inf. Sci. 67(1) (2024) - [j39]Mingzhen Li, Changxi Liu
, Jianjin Liao, Xuegui Zheng, Hailong Yang, Rujun Sun, Jun Xu, Lin Gan, Guangwen Yang, Zhongzhi Luan, Depei Qian:
Towards optimized tensor code generation for deep learning on sunway many-core processor. Frontiers Comput. Sci. 18(2): 182101 (2024) - [j38]Yizhen Li, Zhongzhi Luan, Yixing Liu, Heyuan Liu, Jiaxing Qi, Dongran Han:
Automated information extraction model enhancing traditional Chinese medicine RCT evidence extraction (Evi-BERT): algorithm development and validation. Frontiers Artif. Intell. 7 (2024) - [j37]Jiaxing Qi
, Zhongzhi Luan
, Shaohan Huang
, Carol J. Fung
, Hailong Yang
:
LogSay: An Efficient Comprehension System for Log Numerical Reasoning. IEEE Trans. Computers 73(7): 1809-1821 (2024) - [j36]Jiaxing Qi
, Zhongzhi Luan
, Shaohan Huang
, Carol J. Fung
, Hailong Yang
, Depei Qian
:
SpikeLog: Log-Based Anomaly Detection via Potential-Assisted Spiking Neuron Network. IEEE Trans. Knowl. Data Eng. 36(12): 9322-9335 (2024) - [j35]Qingxiao Sun, Yi Liu, Hailong Yang, Zhonghui Jiang, Zhongzhi Luan, Depei Qian:
Adaptive Auto-Tuning Framework for Global Exploration of Stencil Optimization on GPUs. IEEE Trans. Parallel Distributed Syst. 35(1): 20-33 (2024) - [j34]Siqi Wang
, Tianyu Feng
, Hailong Yang
, Xin You
, Bangduo Chen
, Tongxuan Liu
, Zhongzhi Luan
, Depei Qian
:
AtRec: Accelerating Recommendation Model Training on CPUs. IEEE Trans. Parallel Distributed Syst. 35(6): 750-763 (2024) - [j33]Jiaxing Qi
, Wencong Xiao, Mingzhen Li
, Chaojie Yang, Yong Li, Wei Lin, Hailong Yang
, Zhongzhi Luan
, Depei Qian
:
ElasticBatch: A Learning-Augmented Elastic Scheduling System for Batch Inference on MIG. IEEE Trans. Parallel Distributed Syst. 35(10): 1708-1720 (2024) - [c133]Ting Jiang, Shaohan Huang, Zhongzhi Luan, Deqing Wang, Fuzhen Zhuang:
Scaling Sentence Embeddings with Large Language Models. EMNLP (Findings) 2024: 3182-3196 - [c132]Shaohan Huang
, Zhongzhi Luan
:
Semantic-Aware Log Understanding and Analysis. HPDC 2024: 413-416 - [c131]Siyu Wu
, Hailong Yang
, Xin You
, Ruihao Gong
, Yi Liu
, Zhongzhi Luan
, Depei Qian
:
PRoof: A Comprehensive Hierarchical Profiling Framework for Deep Neural Networks with Roofline Analysis. ICPP 2024: 822-832 - [c130]Kaige Zhang
, Xiaoyan Liu
, Hailong Yang
, Tianyu Feng
, Xinyu Yang
, Yi Liu
, Zhongzhi Luan
, Depei Qian
:
Jigsaw: Accelerating SpMM with Vector Sparsity on Sparse Tensor Core. ICPP 2024: 1124-1134 - [c129]Xiaoyan Liu
, Xuegui Zheng
, Hailong Yang
, Zhongzhi Luan
, Depei Qian
:
Tetris: Accelerating Sparse Convolution by Exploiting Memory Reuse on GPU. PPoPP 2024: 229-242 - [c128]Xiaoyan Liu, Xinyu Yang, Kejie Ma, Shanghao Liu, Kaige Zhang, Hailong Yang, Yi Liu, Zhongzhi Luan, Depei Qian:
Moirae: Generating High-Performance Composite Stencil Programs with Global Optimizations. SC 2024: 20 - [c127]Xin You, Zhibo Xuan, Hailong Yang, Zhongzhi Luan, Yi Liu, Depei Qian:
GVARP: Detecting Performance Variance on Large-Scale Heterogeneous Systems. SC 2024: 57 - [c126]Shaohan Huang, Yi Liu, Jiaxing Qi, Jing Shang, Zhiwen Xiao, Carol J. Fung, Zhihui Wu, Hailong Yang, Zhongzhi Luan, Depei Qian:
Gloss: Guiding Large Language Models to Answer Questions from System Logs. SANER 2024: 91-101 - [i18]Siqi Wang, Hailong Yang, Xuezhu Wang, Tongxuan Liu, Pengbo Wang, Xuning Liang, Kejie Ma, Tianyu Feng, Xin You, Yongjun Bao, Yi Liu, Zhongzhi Luan, Depei Qian:
Minions: Accelerating Large Language Model Inference with Adaptive and Collective Speculative Decoding. CoRR abs/2402.15678 (2024) - [i17]Yizhen Li, Shaohan Huang, Jiaxing Qi, Lei Quan, Dongran Han, Zhongzhi Luan:
Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine Knowledge. CoRR abs/2403.09164 (2024) - [i16]Yiqing Wang, Xiaoyan Liu, Hailong Yang, Xinyu Yang, Pengbo Wang, Yi Liu, Zhongzhi Luan, Depei Qian:
INSPIRIT: Optimizing Heterogeneous Task Scheduling through Adaptive Priority in Task-based Runtime Systems. CoRR abs/2404.03226 (2024) - [i15]Jiaxing Qi, Zhongzhi Luan, Shaohan Huang, Carol J. Fung, Hailong Yang, Depei Qian:
FDLoRA: Personalized Federated Learning of Large Language Model via Dual LoRA Tuning. CoRR abs/2406.07925 (2024) - [i14]Daixuan Cheng, Shaohan Huang, Ziyu Zhu, Xintong Zhang, Wayne Xin Zhao, Zhongzhi Luan, Bo Dai, Zhenliang Zhang:
On Domain-Specific Post-Training for Multimodal Large Language Models. CoRR abs/2411.19930 (2024) - 2023
- [j32]Biao Sun, Mingzhen Li, Hailong Yang
, Jun Xu, Zhongzhi Luan, Depei Qian:
Adapting combined tiling to stencil optimizations on sunway processor. CCF Trans. High Perform. Comput. 5(3): 322-333 (2023) - [j31]Hailong Yang
, Yi Liu
, Zhongzhi Luan
, Lin Gan
, Guangwen Yang, Depei Qian
:
Input-Aware Sparse Tensor Storage Format Selection for Optimizing MTTKRP. Computer 56(8): 4-7 (2023) - [j30]Xiaoyan Liu, Yi Liu, Bohong Yin, Hailong Yang, Zhongzhi Luan, Depei Qian:
swSpAMM: optimizing large-scale sparse approximate matrix multiplication on Sunway Taihulight. Frontiers Comput. Sci. 17(4): 174104 (2023) - [j29]Shaohan Huang
, Yi Liu
, Carol J. Fung
, He Wang, Hailong Yang
, Zhongzhi Luan
:
Improving Log-Based Anomaly Detection by Pre-Training Hierarchical Transformers. IEEE Trans. Computers 72(9): 2656-2667 (2023) - [j28]Pengyu Mu
, Yi Liu
, Rui Wang
, Guoxiang Liu
, Zhonghao Sun, Hailong Yang
, Zhongzhi Luan, Depei Qian
:
HAOTuner: A Hardware Adaptive Operator Auto-Tuner for Dynamic Shape Tensor Compilers. IEEE Trans. Computers 72(11): 3178-3190 (2023) - [j27]Jiaxing Qi
, Zhongzhi Luan
, Shaohan Huang
, Carol J. Fung
, Hailong Yang
, Hanlu Li, Danfeng Zhu, Depei Qian:
LogEncoder: Log-Based Contrastive Representation Learning for Anomaly Detection. IEEE Trans. Netw. Serv. Manag. 20(2): 1378-1391 (2023) - [c125]Xin You, Hailong Yang, Kelun Lei
, Zhongzhi Luan, Depei Qian:
VClinic: A Portable and Efficient Framework for Fine-Grained Value Profilers. ASPLOS (2) 2023: 892-904 - [c124]Hongrui Liu, Kelun Lei, Hailong Yang, Zhongzhi Luan, Depei Qian:
Towards Optimized Hydrological Forecast Prediction of WRF-Hydro on GPU. HPCC/DSS/SmartCity/DependSys 2023: 138-145 - [c123]Jiaxing Qi, Shaohan Huang, Zhongzhi Luan, Shu Yang, Carol J. Fung, Hailong Yang, Depei Qian, Jing Shang, Zhiwen Xiao, Zhihui Wu:
LogGPT: Exploring ChatGPT for Log-Based Anomaly Detection. HPCC/DSS/SmartCity/DependSys 2023: 273-280 - [c122]Junlin Chen, Chaojing Liu, Zhongzhi Luan, Ming Gong, Qingfeng Li, Depei Qian:
Large-Scale Parallelization and Optimization of Lattice QCD on Tianhe New Generation Supercomputer. HPCC/DSS/SmartCity/DependSys 2023: 499-506 - [c121]Zhibo Xuan, Hailong Yang, Pengbo Wang, Xin Sun, Jiwei Hao, Shenglin Duan, Yongfeng Shi, Zhongzhi Luan, Depei Qian:
gGMED: Towards GPU Accelerated Geometric Modeling Evaluation and Derivative Processes. ICA3PP (3) 2023: 378-397 - [c120]Kelun Lei, Shaokang Du, Xin You, Zhibo Xuan, Haoran Kong, Hailong Yang, Jing Shang, Zhiwen Xiao, Zhihui Wu, Zhongzhi Luan, Depei Qian:
Accelerating Big Data Application by Eliminating Redundancy on Hadoop Cluster. ICPADS 2023: 751-756 - [c119]Shaokang Du, Xin You, Hailong Yang, Jing Shang, Zhiwen Xiao, Zhihui Wu, Zhongzhi Luan, Depei Qian:
Efficient Deep Molecular Dynamic Model Training on Heterogeneous System. ICPADS 2023: 1869-1876 - [c118]Mingzhen Li
, Hailong Yang
, Shanjun Zhang
, Fengwei Yu
, Ruihao Gong
, Yi Liu
, Zhongzhi Luan
, Depei Qian
:
Exploiting Subgraph Similarities for Efficient Auto-tuning of Tensor Programs. ICPP 2023: 786-796 - [c117]Kelun Lei
, Xin You
, Hailong Yang
, Zhongzhi Luan
, Depei Qian
:
BiRFIA: Selective Binary Rewriting for Function Interception on ARM. ICS 2023: 87-98 - [c116]Jianjin Liao, Mingzhen Li, Hailong Yang, Qingxiao Sun, Biao Sun, Jiwei Hao, Tianyu Feng
, Fengwei Yu, Shengdong Chen, Ye Tao, Zicheng Zhang, Zhongzhi Luan, Depei Qian:
Exploiting Input Tensor Dynamics in Activation Checkpointing for Efficient Training on GPU. IPDPS 2023: 156-166 - [c115]Mingzhen Li
, Wencong Xiao
, Hailong Yang
, Biao Sun
, Hanyu Zhao
, Shiru Ren
, Zhongzhi Luan
, Xianyan Jia
, Yi Liu
, Yong Li
, Wei Lin
, Depei Qian
:
EasyScale: Elastic Training with Consistent Accuracy and Improved Utilization on GPUs. SC 2023: 55:1-55:14 - [c114]Xin You
, Hailong Yang
, Kelun Lei
, Zhongzhi Luan
, Depei Qian
:
TrivialSpy: Identifying Software Triviality via Fine-grained and Dataflow-based Value Profiling. SC 2023: 90:1-90:13 - [i13]Shaohan Huang, Yi Liu, Carol J. Fung, Jiaxing Qi, Hailong Yang, Zhongzhi Luan:
LogQA: Question Answering in Unstructured Logs. CoRR abs/2303.11715 (2023) - [i12]Ting Jiang, Shaohan Huang, Zhongzhi Luan, Deqing Wang, Fuzhen Zhuang:
Scaling Sentence Embeddings with Large Language Models. CoRR abs/2307.16645 (2023) - [i11]Jiaxing Qi, Shaohan Huang, Zhongzhi Luan, Carol J. Fung, Hailong Yang, Depei Qian:
LogGPT: Exploring ChatGPT for Log-Based Anomaly Detection. CoRR abs/2309.01189 (2023) - 2022
- [j26]Xin You, Hailong Yang, Zhongzhi Luan, Depei Qian:
Accelerating the cryo-EM structure determination in RELION on GPU cluster. Frontiers Comput. Sci. 16(3): 163102 (2022) - [j25]Qingxiao Sun, Liu Yi, Hailong Yang
, Mingzhen Li, Zhongzhi Luan
, Depei Qian:
QoS-aware dynamic resource allocation with improved utilization and energy efficiency on GPU. Parallel Comput. 113: 102958 (2022) - [j24]Qingxiao Sun
, Yi Liu
, Hailong Yang
, Ming Dun, Zhongzhi Luan
, Lin Gan
, Guangwen Yang, Depei Qian
:
Input-Aware Sparse Tensor Storage Format Selection for Optimizing MTTKRP. IEEE Trans. Computers 71(8): 1968-1981 (2022) - [j23]Xiaoyan Liu, Yi Liu, Hailong Yang
, Ming Dun, Bohong Yin, Zhongzhi Luan, Depei Qian:
Accelerating approximate matrix multiplication for near-sparse matrices on GPUs. J. Supercomput. 78(9): 11464-11491 (2022) - [j22]Shaozhi Dai
, Zhongzhi Luan
, Shaohan Huang, Carol J. Fung, He Wang, Hailong Yang
, Depei Qian
:
REVAL: Recommend Which Variables to Log With Pretrained Model and Graph Neural Network. IEEE Trans. Netw. Serv. Manag. 19(4): 4045-4057 (2022) - [c113]Shaohan Huang, Yi Liu, Carol J. Fung, Hailong Yang, Zhongzhi Luan:
Black-box Attacks to Log-based Anomaly Detection. CNSM 2022: 310-316 - [c112]Jiwei Hao, Hailong Yang, Qingxiao Sun, Huaitao Zhang, Zhongzhi Luan, Depei Qian:
Towards Optimized Streaming Tensor Completion on multiple GPUs. HPCC/DSS/SmartCity/DependSys 2022: 1123-1128 - [c111]Xin You, Changxi Liu
, Hailong Yang, Pengbo Wang, Zhongzhi Luan, Depei Qian:
Vectorizing SpMV by Exploiting Dynamic Regular Patterns. ICPP 2022: 53:1-53:12 - [c110]Xiaoyan Liu, Yi Liu, Hailong Yang, Jianjin Liao, Mingzhen Li, Zhongzhi Luan, Depei Qian:
Toward accelerated stencil computation by adapting tensor core unit on GPU. ICS 2022: 28:1-28:12 - [c109]Qingxiao Sun, Yi Liu, Hailong Yang, Zhonghui Jiang, Zhongzhi Luan, Depei Qian:
StencilMART: Predicting Optimization Selection for Stencil Computations across GPUs. IPDPS 2022: 875-885 - [c108]Xin You, Hailong Yang, Zhibo Xuan, Zhongzhi Luan, Depei Qian:
PowerSpector: Towards Energy Efficiency with Calling-Context-Aware Profiling. IPDPS 2022: 1272-1282 - [c107]Jiaxing Qi, Zhongzhi Luan, Shaohan Huang, Yukun Wang, Carol J. Fung, Hailong Yang, Depei Qian:
Adanomaly: Adaptive Anomaly Detection for System Logs with Adversarial Learning. NOMS 2022: 1-5 - [c106]Qingxiao Sun, Yi Liu, Hailong Yang, Ruizhe Zhang, Ming Dun, Mingzhen Li, Xiaoyan Liu, Wencong Xiao, Yong Li, Zhongzhi Luan, Depei Qian:
CoGNN: Efficient Scheduling for Concurrent GNN Training on GPUs. SC 2022: 39:1-39:15 - [i10]Shanjun Zhang, Mingzhen Li, Hailong Yang, Yi Liu, Zhongzhi Luan, Depei Qian:
FamilySeer: Towards Optimized Tensor Codes by Exploiting Computation Subgraph Similarity. CoRR abs/2201.00194 (2022) - [i9]Mingzhen Li, Wencong Xiao, Biao Sun, Hanyu Zhao, Hailong Yang, Shiru Ren, Zhongzhi Luan, Xianyan Jia, Yi Liu, Yong Li, Depei Qian, Wei Lin:
EasyScale: Accuracy-consistent Elastic Training for Deep Learning. CoRR abs/2208.14228 (2022) - [i8]Jianjin Liao, Mingzhen Li, Qingxiao Sun, Jiwei Hao, Fengwei Yu, Shengdong Chen, Ye Tao, Zicheng Zhang, Hailong Yang, Zhongzhi Luan, Depei Qian:
Mimose: An Input-Aware Checkpointing Planner for Efficient Training on GPU. CoRR abs/2209.02478 (2022) - 2021
- [j21]Ming Dun, Yunchun Li, Qingxiao Sun, Hailong Yang, Wei Li, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian:
Towards efficient canonical polyadic decomposition on sunway many-core processor. Inf. Sci. 549: 221-248 (2021) - [j20]Qingchang Han, Hailong Yang
, Ming Dun, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian:
Towards efficient tile low-rank GEMM computation on sunway many-core processors. J. Supercomput. 77(5): 4533-4564 (2021) - [j19]Mingzhen Li
, Yi Liu
, Xiaoyan Liu, Qingxiao Sun, Xin You, Hailong Yang
, Zhongzhi Luan
, Lin Gan, Guangwen Yang, Depei Qian
:
The Deep Learning Compiler: A Comprehensive Survey. IEEE Trans. Parallel Distributed Syst. 32(3): 708-727 (2021) - [c105]Ruiyuan Gao, Hailong Yang, Shaohan Huang, Ming Dun, Mingzhen Li, Zerong Luan, Zhongzhi Luan, Depei Qian:
PriPro: Towards Effective Privacy Protection on Edge-Cloud System running DNN Inference. CCGRID 2021: 334-343 - [c104]Qingxiao Sun, Yi Liu, Hailong Yang, Zhonghui Jiang, Xiaoyan Liu, Ming Dun, Zhongzhi Luan, Depei Qian:
csTuner: Scalable Auto-tuning Framework for Complex Stencil Computation on GPUs. CLUSTER 2021: 192-203 - [c103]Xin You, Hailong Yang, Zhonghui Jiang, Zhongzhi Luan, Depei Qian:
DRStencil: Exploiting Data Reuse within Low-order Stencil on GPU. HPCC/DSS/SmartCity/DependSys 2021: 63-70 - [c102]Mingzhen Li
, Yi Liu, Hailong Yang, Yongmin Hu, Qingxiao Sun, Bangduo Chen, Xin You, Xiaoyan Liu, Zhongzhi Luan, Depei Qian:
Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors. ICPP 2021: 34:1-34:12 - [c101]Ming Dun, Yunchun Li, Hailong Yang, Qingxiao Sun, Zhongzhi Luan, Depei Qian:
An optimized tensor completion library for multiple GPUs. ICS 2021: 417-430 - [c100]Tianyu Feng
, Siyan Chen, Xin You, Shuzhang Zhong, Hailong Yang, Zhongzhi Luan, Depei Qian:
dgQuEST: Accelerating Large Scale Quantum Circuit Simulation through Hybrid CPU-GPU Memory Hierarchies. NPC 2021: 16-27 - [i7]Xiaoyan Liu, Yi Liu, Ming Dun, Bohong Yin, Hailong Yang, Zhongzhi Luan, Depei Qian:
Accelerating Sparse Approximate Matrix Multiplication on GPUs. CoRR abs/2103.13042 (2021) - 2020
- [j18]Shaohan Huang
, Yi Liu
, Carol J. Fung, Rong He, Yining Zhao, Hailong Yang
, Zhongzhi Luan
:
HitAnomaly: Hierarchical Transformers for Anomaly Detection in System Log. IEEE Trans. Netw. Serv. Manag. 17(4): 2064-2076 (2020) - [j17]Lan Gao
, Yunlong Xu
, Rui Wang
, Zhongzhi Luan
, Zhibin Yu
, Depei Qian
:
Thread-Level Locking for SIMT Architectures. IEEE Trans. Parallel Distributed Syst. 31(5): 1121-1136 (2020) - [j16]Yongmin Hu, Hailong Yang
, Zhongzhi Luan
, Lin Gan, Guangwen Yang, Depei Qian
:
Massively Scaling Seismic Processing on Sunway TaihuLight Supercomputer. IEEE Trans. Parallel Distributed Syst. 31(5): 1194-1208 (2020) - [j15]Mingzhen Li
, Yi Liu
, Hailong Yang
, Zhongzhi Luan
, Lin Gan, Guangwen Yang, Depei Qian
:
Accelerating Sparse Cholesky Factorization on Sunway Manycore Architecture. IEEE Trans. Parallel Distributed Syst. 31(7): 1636-1650 (2020) - [c99]Bangduo Chen, Mingzhen Li, Hailong Yang, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian:
swRodinia: A Benchmark Suite for Exploiting Architecture Properties of Sunway Processor. Bench 2020: 22-38 - [c98]Shaohan Huang, Yi Liu, Carol J. Fung, Rong He, Yining Zhao, Hailong Yang, Zhongzhi Luan:
Transfer Log-based Anomaly Detection with Pseudo Labels. CNSM 2020: 1-5 - [c97]Yi Wei, Xin You, Hailong Yang, Zhongzhi Luan, Depei Qian:
Towards GPU Acceleration of Phonon Computation with ShengBTE. HPC Asia 2020: 32-42 - [c96]Shaohan Huang, Yi Liu, Carol J. Fung, Wanhe An, Rong He, Yining Zhao, Hailong Yang, Zhongzhi Luan:
A Gated Few-shot Learning Model For Anomaly Detection. ICOIN 2020: 505-509 - [c95]Qingchang Han, Yongmin Hu, Fengwei Yu, Hailong Yang, Bing Liu, Peng Hu, Ruihao Gong
, Yanfei Wang, Rui Wang, Zhongzhi Luan, Depei Qian:
Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures. ICPP 2020: 38:1-38:12 - [c94]Shaohan Huang, Yi Liu, Carol J. Fung, Rong He, Yining Zhao, Hailong Yang, Zhongzhi Luan:
Paddy: An Event Log Parsing Approach using Dynamic Dictionary. NOMS 2020: 1-8 - [c93]Qingxiao Sun, Yi Liu, Ming Dun, Hailong Yang, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian:
SpTFS: sparse tensor format selection for MTTKRP via deep learning. SC 2020: 18 - [c92]Xin You, Hailong Yang, Zhongzhi Luan, Depei Qian, Xu Liu:
ZeroSpy: exploring software inefficiency with redundant zeros. SC 2020: 29 - [c91]Bohong Yin, Yunchun Li, Ming Dun, Xin You, Hailong Yang, Zhongzhi Luan, Depei Qian:
swGBDT: Efficient Gradient Boosted Decision Tree on Sunway Many-Core Processor. SCFA 2020: 67-86 - [i6]Ruiyuan Gao, Ming Dun, Hailong Yang, Zhongzhi Luan, Depei Qian:
Privacy for Rescue: A New Testimony Why Privacy is Vulnerable In Deep Models. CoRR abs/2001.00493 (2020) - [i5]Mingzhen Li, Yi Liu, Xiaoyan Liu, Qingxiao Sun, Xin You, Hailong Yang, Zhongzhi Luan, Depei Qian:
The Deep Learning Compiler: A Comprehensive Survey. CoRR abs/2002.03794 (2020)
2010 – 2019
- 2019
- [j14]Guang Wei
, Depei Qian
, Hailong Yang
, Zhongzhi Luan
, Lin Wang
:
FPowerTool: A Function-Level Power Profiling Tool. IEEE Access 7: 185710-185719 (2019) - [j13]Xiaogang Zhong, Hailong Yang
, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian:
swTensor: accelerating tensor decomposition on Sunway architecture. CCF Trans. High Perform. Comput. 1(3-4): 161-176 (2019) - [j12]Depei Qian, Zhongzhi Luan:
High Performance Computing Development in China: A Brief Review and Perspectives. Comput. Sci. Eng. 21(1): 6-16 (2019) - [j11]Lin Wang, Depei Qian, Rui Wang, Zhongzhi Luan, Hailong Yang, Huaxiang Zhang:
A novel index system describing program runtime characteristics for workload consolidation. Frontiers Comput. Sci. 13(3): 489-499 (2019) - [j10]Lan Gao
, Yunlong Xu, Rui Wang, Hailong Yang, Zhongzhi Luan, Depei Qian:
Accelerating in-memory transaction processing using general purpose graphics processing units. Future Gener. Comput. Syst. 97: 836-848 (2019) - [c90]Shaohan Huang, Yu Wu, Furu Wei, Zhongzhi Luan:
Dictionary-Guided Editing Networks for Paraphrase Generation. AAAI 2019: 6546-6553 - [c89]Jiaming Zhou, Yuqiao Tian, Weicheng Li, Rui Wang, Zhongzhi Luan, Depei Qian:
LADet: A Light-weight and Adaptive Network for Multi-scale Object Detection. ACML 2019: 912-923 - [c88]Qingchang Han, Hailong Yang, Zhongzhi Luan, Depei Qian:
Accelerating tile low-rank GEMM on sunway architecture: POSTER. CF 2019: 295-297 - [c87]Qingxiao Sun, Yi Liu, Hailong Yang, Zhongzhi Luan, Depei Qian:
SMQoS: Improving Utilization and Energy Efficiency with QoS Awareness on GPUs. CLUSTER 2019: 1-5 - [c86]Xin You, Hailong Yang, Zhongzhi Luan, Depei Qian:
L-DAG: Enabling Loopy Workflow in Scientific Application with Automatic DAG Transformation. DASC/PiCom/DataCom/CyberSciTech 2019: 946-953 - [c85]Lu Xu, Zhongzhi Luan, Carol J. Fung, Da Ye, Depei Qian:
Anomaly Detection Models Based on Context-Aware Sequential Long Short-Term Memory Learning. GLOBECOM 2019: 1-6 - [c84]Xianya Fu, Rui Wang, Peixuan Zuo, Jiaming Zhou, Jia Zhai, Xiaodan Xie, Zhongzhi Luan, Depei Qian:
FLONet: Fewer Labeling Cost Active Learning for Deep Neural Network. HPCC/SmartCity/DSS 2019: 289-296 - [c83]Ming Dun, Yunchun Li, Hailong Yang, Wei Li, Zhongzhi Luan, Depei Qian:
swCPD: Optimizing Canonical Polyadic Decomposition on Sunway Manycore Architecture. HPCC/SmartCity/DSS 2019: 1320-1327 - [c82]Lan Gao, Yunlong Xu, Chongyang Xu, Rui Wang, Hailong Yang, Zhongzhi Luan, Depei Qian:
Towards a General and Efficient Linked-List Hash Table on GPUs. HPCC/SmartCity/DSS 2019: 1452-1460 - [c81]Zehui Jin, Ming Dun, Xin You, Hailong Yang, Yunchun Li, Yingchun Lin, Zhongzhi Luan, Depei Qian:
Improving the Parallelism of CESM on GPU. ICA3PP (2) 2019: 11-18 - [c80]Chongyang Xu, Zhongzhi Luan, Lan Gao, Rui Wang, Han Zhang, Lianyi Zhang, Yi Liu, Depei Qian:
Multiple Algorithms Against Multiple Hardware Architectures: Data-Driven Exploration on Deep Convolution Neural Network. NPC 2019: 371-375 - [c79]Guang Wei, Depei Qian, Hailong Yang, Zhongzhi Luan:
Modeling Power Consumption of The Code Execution Using Performance Counters Statistics. PDCAT 2019: 381-385 - [c78]Xin You, Hailong Yang, Zhongzhi Luan, Yi Liu, Depei Qian:
Performance Evaluation and Analysis of Linear Algebra Kernels in the Prototype Tianhe-3 Cluster. SCFA 2019: 86-105 - [i4]Changxi Liu, Hailong Yang, Rujun Sun, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian:
swTVM: Exploring the Automated Compilation for Deep Learning on Sunway Architecture. CoRR abs/1904.07404 (2019) - [i3]Weicheng Li, Rui Wang, Zhongzhi Luan, Di Huang, Zidong Du, Yunji Chen, Depei Qian:
CompactNet: Platform-Aware Automatic Optimization for Convolutional Neural Networks. CoRR abs/1905.11669 (2019) - [i2]Yongmin Hu, Hailong Yang, Zhongzhi Luan, Depei Qian:
Massively Scaling Seismic Processing on Sunway TaihuLight Supercomputer. CoRR abs/1907.11678 (2019) - [i1]Changxi Liu, Hailong Yang, Xu Liu, Zhongzhi Luan, Depei Qian:
Intelligent-Unrolling: Exploiting Regular Patterns in Irregular Applications. CoRR abs/1910.13346 (2019) - 2018
- [j9]Changxi Liu
, Hailong Yang, Rui Wang, Zhongzhi Luan, Depei Qian:
T1000: Mitigating the memory footprint of convolution neural networks with decomposition and re-fusion. Future Gener. Comput. Syst. 84: 1-10 (2018) - [j8]