


default search action
CCF Transactions on High Performance Computing, Volume 5
Volume 5, Number 1, March 2023
- Jiachang Sun, Huiyuan Li, Wenjing Ma:

Editorial for the special issue on new algorithms and software for E-scale high performance computing. 1-2 - Chaofeng Hou

, Aiqi Zhu, Shuai Zhang, Mingcan Zhao, Yanhao Ye, Ji Xu, Wei Ge:
Atomistic simulation of low-dimensional nanostructures toward extreme-scale supercomputing. 3-11 - Lian Duan, Chuanfu Xiao, Min Li, Mingshuo Ding, Chao Yang

:
a-Tucker: fast input-adaptive and matricization-free Tucker decomposition of higher-order tensors on GPUs. 12-25 - Xinming Qin

, Junshi Chen, Zhaolong Luo, Lingyun Wan, Jielan Li, Shizhe Jiao, Zhenlin Zhang, Qingcai Jiang, Wei Hu
, Hong An, Jinlong Yang:
High performance computing for first-principles Kohn-Sham density functional theory towards exascale supercomputers. 26-42 - Kan Liu, Xinliang Wang, Wei Xue

:
Model guided algorithm optimization for tridiagonal solver on many-core architectures. 43-55 - Fangfang Liu

, Wenjing Ma, Yuwen Zhao, Daokun Chen, Yi Hu, Qinglin Lu
, Wanwang Yin, Xinhui Yuan, Lijuan Jiang, Hao Yan, Min Li, Hongsen Wang, Xinyu Wang, Chao Yang:
xMath2.0: a high-performance extended math library for SW26010-Pro many-core processor. 56-71 - Xiaowen Xu

, Xiaoqiang Yue, Runzhang Mao, Yuntong Deng, Silu Huang, Haifeng Zou, Xiao Liu, Shaoliang Hu, Chunsheng Feng, Shi Shu, Zeyao Mo:
JXPAMG: a parallel algebraic multigrid solver for extreme-scale numerical simulations. 72-83 - Qiao Sun, Wenjing Ma, Jiachang Sun, Huiyuan Li:

Evolving the HPL benchmark towards multi-GPGPU clusters. 84-96 - Fangfang Liu

, Wenjing Ma, Yuwen Zhao, Daokun Chen, Yi Hu, Qinglin Lu
, Wanwang Yin, Xinhui Yuan, Lijuan Jiang, Hao Yan, Min Li, Hongsen Wang, Xinyu Wang, Chao Yang:
Publisher Correction: xMath2.0: a high-performance extended math library for SW26010-Pro many-core processor. 97
Volume 5, Number 2, June 2023
- Weifeng Liu

, Guangming Tan, Xiaowen Xu:
Editorial for the special issue on architecture, algorithms and applications of high performance sparse matrix computations. 99-101 - Y. R. Annie Bessant

, J. Grace Jency
, K. Martin Sagayam, A. Amir Anton Jone, Digvijay Pandey
, Binay Kumar Pandey:
Improved parallel matrix multiplication using Strassen and Urdhvatiryagbhyam method. 102-115 - Shengguo Li

, Xia Liao
, Yutong Lu, José E. Román, Xiaoqiang Yue:
A parallel structured banded DC algorithm for symmetric eigenvalue problems. 116-128 - Zhengyang Lu, Weifeng Liu

:
TileSpTRSV: a tiled algorithm for parallel sparse triangular solve on GPUs. 129-143 - Li Zhao

, Shizhe Li, Chen-Song Zhang, Chunsheng Feng, Shi Shu:
An improved multistage preconditioner on GPUs for compositional reservoir simulation. 144-159 - Jiaquan Gao

, Xinyue Chu, Yizhou Wang:
HeuriSPAI: a heuristic sparse approximate inverse preconditioning algorithm on GPU. 160-170 - Yu Li, Zijing Wang, Hehu Xie

:
GCGE: a package for solving large scale eigenvalue problems by parallel block damping inverse power method. 171-190 - Chuanying Li

, Stef Graillat, Zhe Quan, Tongxiang Gu, Hao Jiang, Kenli Li:
XHYPRE: a reliable parallel numerical algorithm library for solving large-scale sparse linear equations. 191-209 - Genghan Zhang, Yuetong Zhao, Yanting Tao, Zhongming Yu, Guohao Dai, Sitao Huang

, Yuan Wen, Pavlos Petoumenos, Yu Wang:
Sgap: towards efficient sparse tensor algebra compilation for GPU. 210-227
Volume 5, Number 3, September 2023
- Liang Yuan, Junmin Xiao:

SI on parallel system and algorithm optimization. 229-230 - Yongtao Luo

, Bo Yang, Jie Liu, Ruibo Wang, Jinmin Wen, Tiaojie Xiao, Xuguang Chen, Chunye Gong:
MT-office: parallel password recovery program for office on domestic heterogeneous multi-core processor. 231-244 - Xiaojun Lei, Tongxiang Gu, Xiaowen Xu:

ddRingAllreduce: a high-precision RingAllreduce algorithm. 245-257 - Kexing Zhou, Yong Dong, Juan Chen, Yuhan Cao, Zekai Li, Rongyu Deng, Yifei Guo, Zhixin Ou:

Processor power forecasting through model sample analysis and clustering. 258-276 - Yuan Zhang, Huawei Cao

, Yan Liang, Jie Zhang, Junying Huang, Xiaochun Ye, Xuejun An:
FSGraph: fast and scalable implementation of graph traversal on GPUs. 277-291 - Xiaohui Wei, Xinyang Zheng

, Chenyang Wang, Guangli Li, Hengshan Yue
:
FASS-pruner: customizing a fine-grained CNN accelerator-aware pruning framework via intra-filter splitting and inter-filter shuffling. 292-303 - Jie Lou, Yiming Sun

, Jie Zhang, Huawei Cao
, Yuan Zhang, Ninghui Sun:
ArkGPU: enabling applications' high-goodput co-location execution on multitasking GPUs. 304-321 - Biao Sun, Mingzhen Li, Hailong Yang

, Jun Xu, Zhongzhi Luan, Depei Qian:
Adapting combined tiling to stencil optimizations on sunway processor. 322-333 - Songwen Pei

, Jie Luo, Sheng Liang, Haonan Ding, Xiaochun Ye, Mingsong Chen:
Carbon Emissions Reduction of Neural Network by Discrete Rank Pruning. 334-346
Volume 5, Number 4, December 2023
- Bin Zhao

, Jiangkai Hu, Dapeng Wang, Bo Zhang, Fajing Chen, Ziwei Wan, Siyuan Sun:
The GRAPES evaluation tools based on Python (GetPy). 347-359 - Jiaxu Guo

, Yidan Xu, Haohuan Fu, Wei Xue, Lin Gan, Mengxuan Tan, Tingye Wu, Yutong Shen, Xianwei Wu, Liang Hu, Xilong Che:
GEO-WMS: an improved approach to geoscientific workflow management system on HPC. 360-373 - D. Sirisha

, S. Sambhu Prasad:
MPEFT: a makespan minimizing heuristic scheduling algorithm for workflows in heterogeneous computing systems. 374-389 - Faezeh Mollasalehi, Ehsan Mousavi Khaneghah

, Amirhosein Reyhani ShowkatAbad, Seyed Alireza Seyednejad, Faeze Gholamrezaie:
ExaLB: a mathematical framework for load balancing to support distributed exascale computing environments. 390-415 - Pouria Fakhri, Ehsan Mousavi Khaneghah

, Zohreh Esmaeili Bidhendi, Araz R. Aliev
:
ExaSU: a mathematical model for selecting the structured or unstructured resource discovery mechanism in distributed exascale computing environments. 416-428 - Yan Zeng

, Yong Ding, Dongyang Ou, Jilin Zhang, Yongjian Ren, Yunquan Zhang:
MP-DPS: adaptive distributed training for deep learning based on node merging and path prediction. 429-441 - Fang Lin

, Yi Liu, Xin Wang, Xueyan Gai:
Leveraging simulation of high performance computing systems with node simulation using architecture simulator. 442-464 - Ou Wu

, Binbin Huang, Shanshan Li, Yanze Wang, Haoming Li:
A performance evaluation method of queuing theory based on Cosmos cross-chain platform. 465-485

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














