


default search action
Taisuke Boku
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j24]Yutaka Watanabe, Miwako Tsuji, Hitoshi Murai, Taisuke Boku, Mitsuhisa Sato:
Design and performance evaluation of UCX for the Tofu Interconnect D on Fugaku towards efficient multithreaded communication. J. Supercomput. 80(14): 20715-20742 (2024) - [j23]Yutaka Watanabe, Miwako Tsuji, Hitoshi Murai, Taisuke Boku, Mitsuhisa Sato:
Correction: Design and performance evaluation of UCX for the Tofu Interconnect D on Fugaku towards efficient multithreaded communication. J. Supercomput. 80(17): 25710 (2024) - [c139]Toshihiro Hanawa, Kengo Nakajima, Yohei Miki, Takashi Shimokawabe, Kazuya Yamazaki, Shinji Sumimoto, Osamu Tatebe, Taisuke Boku, Daisuke Takahashi, Akira Nukada, Norihisa Fujita, Ryohei Kobayashi, Hiroto Tadano, Akira Naruse:
Preliminary Performance Evaluation of Grace-Hopper GH200. CLUSTER Workshops 2024: 184-185 - [c138]Wentao Liang, Norihisa Fujita, Ryohei Kobayashi, Taisuke Boku:
Using SYCLomatic to Migrate CUDA Code to oneAPI Adapting NVIDIA GPU. CLUSTER Workshops 2024: 192-193 - [c137]Kaito Kitazume, Norihisa Fujita, Ryohei Kobayashi, Taisuke Boku:
Preliminary Evaluation of Kyokko for Inter-FPGA Communication Framework CIRCUS. CLUSTER Workshops 2024: 194-195 - [c136]Norihisa Fujita, Beau Johnston, Narasinga Rao Miniskar, Ryohei Kobayashi, Mohammad Alaul Haque Monil, Keita Teranishi, Seyong Lee, Jeffrey S. Vetter, Taisuke Boku:
CHARM-SYCL & IRIS: A Tool Chain for Performance Portability on Extremely Heterogeneous Systems. e-Science 2024: 1-10 - [c135]Wentao Liang
, Norihisa Fujita
, Ryohei Kobayashi
, Taisuke Boku
:
Using Intel oneAPI for Multi-hybrid Acceleration Programming with GPU and FPGA Coupling. HPC Asia Workshops 2024: 69-76 - [c134]Taisuke Boku
, Masatake Sugita
, Ryohei Kobayashi
, Shinnosuke Furuya
, Takuya Fujie
, Masahito Ohue
, Yutaka Akiyama
:
Improving Performance on Replica-Exchange Molecular Dynamics Simulations by Optimizing GPU Core Utilization. ICPP 2024: 1082-1091 - 2023
- [j22]Riadh Ben Abdelhamid
, Yoshiki Yamaguchi
, Taisuke Boku
:
A Scalable Many-core Overlay Architecture on an HBM2-enabled Multi-Die FPGA. ACM Trans. Reconfigurable Technol. Syst. 16(1): 15:1-15:33 (2023) - [c133]Yuka Sano, Taisuke Boku, Mitsuhisa Sato, Miwako Tsuji, Norihisa Fujita, Ryohei Kobayashi:
Performance improvement by enhancing spatial parallelism on FPGA for HPC applications. CLUSTER Workshops 2023: 58-59 - [c132]Kohei Kikuchi
, Norihisa Fujita
, Ryohei Kobayashi
, Taisuke Boku
:
Implementation and Performance Evaluation of Collective Communications Using CIRCUS on Multiple FPGAs. HPC Asia Workshops 2023: 15-23 - [c131]Ryohei Kobayashi
, Norihisa Fujita
, Yoshiki Yamaguchi
, Taisuke Boku
, Kohji Yoshikawa
, Makito Abe
, Masayuki Umemura
:
GPU-FPGA-accelerated Radiative Transfer Simulation with Inter-FPGA Communication. HPC Asia 2023: 117-125 - [c130]Norihisa Fujita
, Beau Johnston
, Ryohei Kobayashi
, Keita Teranishi
, Seyong Lee
, Taisuke Boku
, Jeffrey S. Vetter
:
CHARM-SYCL: New Unified Programming Environment for Multiple Accelerator Types. SC Workshops 2023: 1651-1661 - [c129]Taisuke Boku, Ryuta Tsunashima, Ryohei Kobayashi
, Norihisa Fujita, Seyong Lee, Jeffrey S. Vetter, Hitoshi Murai, Masahiro Nakao, Miwako Tsuji, Mitsuhisa Sato:
OpenACC Unified Programming Environment for Multi-hybrid Acceleration with GPU and FPGA. ISC Workshops 2023: 662-674 - 2022
- [j21]Yuta Hirokawa, Atsushi Yamada, Shunsuke Yamada
, Masashi Noda, Mitsuharu Uemoto, Taisuke Boku, Kazuhiro Yabana
:
Large-scale ab initio simulation of light-matter interaction at the atomic scale in Fugaku. Int. J. High Perform. Comput. Appl. 36(2): 182-197 (2022) - [j20]Ryohei Kobayashi
, Kento Miura, Norihisa Fujita, Taisuke Boku, Toshiyuki Amagasa:
An Open-source FPGA Library for Data Sorting. J. Inf. Process. 30: 766-777 (2022) - [c128]Kento Miura, Ryohei Kobayashi
, Toshiyuki Amagasa, Hiroyuki Kitagawa, Norihisa Fujita, Taisuke Boku:
An FPGA-based Accelerator for Regular Path Queries over Edge-labeled Graphs. IEEE Big Data 2022: 415-422 - [c127]Norihisa Fujita, Ryohei Kobayashi
, Yoshiki Yamaguchi, Taisuke Boku:
Implementation and Performance Evaluation of Memory System Using Addressable Cache for HPC Applications on HBM2 Equipped FPGAs. Euro-Par Workshops 2022: 121-132 - [c126]Yuka Sano, Ryohei Kobayashi
, Norihisa Fujita, Taisuke Boku:
Performance Evaluation on GPU-FPGA Accelerated Computing Considering Interconnections between Accelerators. HEART 2022: 10-16 - [c125]Ryuta Kashino, Ryohei Kobayashi
, Norihisa Fujita, Taisuke Boku:
Multi-hetero Acceleration by GPU and FPGA for Astrophysics Simulation on oneAPI Environment. HPC Asia 2022: 84-93 - [c124]Taisuke Boku, Norihisa Fujita, Ryohei Kobayashi
, Osamu Tatebe:
Cygnus - World First Multihybrid Accelerated Cluster with GPU and FPGA Coupling. ICPP Workshops 2022: 8:1 - [c123]Yutaka Watanabe, Mitsuhisa Sato, Miwako Tsuji, Hitoshi Murai, Taisuke Boku:
Design and Performance Evaluation of UCX for Tofu-D Interconnect with OpenSHMEM-UCX on Fugaku. PAW-ATM@SC 2022: 52-61 - [c122]Ryohei Kobayashi
, Norihisa Fujita, Yoshiki Yamaguchi, Taisuke Boku, Kohji Yoshikawa, Makito Abe, Masayuki Umemura:
Accelerating Radiative Transfer Simulation on NVIDIA GPUs with OpenACC. PDCAT 2022: 344-358 - [c121]Taisuke Boku:
How FPGA can contribute to HPC ? VLSI-DAT 2022: 1 - 2021
- [j19]Riadh Ben Abdelhamid
, Yoshiki Yamaguchi
, Taisuke Boku:
A Highly-Efficient and Tightly-Connected Many-Core Overlay Architecture. IEEE Access 9: 65277-65292 (2021) - [c120]Norihisa Fujita, Ryohei Kobayashi
, Yoshiki Yamaguchi, Taisuke Boku:
HBM2 Memory System for HPC Applications on an FPGA. CLUSTER 2021: 783-786 - [c119]Naoya Umezu, Yoshiki Yamaguchi, Taisuke Boku:
An FPGA-based storage control with load balancing. CLUSTER 2021: 791-794 - [c118]Kazuki Furukawa, Ryohei Kobayashi
, Tomoya Yokono, Norihisa Fujita, Yoshiki Yamaguchi, Taisuke Boku, Kohji Yoshikawa, Masayuki Umemura:
An efficient RTL buffering scheme for an FPGA-accelerated simulation of diffuse radiative transfer. FPT 2021: 1-9 - [c117]Ryohei Kobayashi
, Kento Miura, Norihisa Fujita, Taisuke Boku, Toshiyuki Amagasa
:
A Sorting Library for FPGA Implementation in OpenCL Programming. HEART 2021: 10:1-10:6 - [c116]Ryuta Kashino, Ryohei Kobayashi
, Norihisa Fujita, Taisuke Boku:
Performance Evaluation of OpenCL-Enabled Inter-FPGA Optical Link Communication Framework CIRCUS and SMI. HPC Asia 2021: 23-31 - [c115]Koei Watanabe, Kohei Kikuchi, Taisuke Boku, Takuto Sato, Hiroyuki Kusaka:
High Resolution of City-Level Climate Simulation by GPU with Multi-physical Phenomena. NPC 2021: 3-15 - [p4]Akihiro Tabuchi, Hitoshi Murai, Masahiro Nakao, Tetsuya Odajima, Taisuke Boku:
XcalableACC: An Integration of XcalableMP and OpenACC. XcalableMP PGAS Programming Language 2021: 123-146 - [p3]Keisuke Tsugane, Taisuke Boku, Hitoshi Murai, Mitsuhisa Sato, William Tang
, Bei Wang:
Hybrid-View Programming of Nuclear Fusion Simulation Code in XcalableMP. XcalableMP PGAS Programming Language 2021: 181-203 - [p2]Miwako Tsuji
, Hitoshi Murai, Taisuke Boku, Mitsuhisa Sato, Serge G. Petiton, Nahid Emad, Thomas Dufaud, Joachim Protze, Christian Terboven, Matthias S. Müller:
Multi-SPMD Programming Model with YML and XcalableMP. XcalableMP PGAS Programming Language 2021: 219-243 - 2020
- [j18]Ryohei Kobayashi
, Norihisa Fujita, Yoshiki Yamaguchi, Taisuke Boku, Kohji Yoshikawa, Makito Abe, Masayuki Umemura:
Multi-Hybrid Accelerated Simulation by GPU and FPGA on Radiative Transfer Simulation in Astrophysics. J. Inf. Process. 28: 1073-1089 (2020) - [c114]Ryohei Kobayashi
, Norihisa Fujita, Yoshiki Yamaguchi, Taisuke Boku, Kohji Yoshikawa, Makito Abe, Masayuki Umemura:
Accelerating Radiative Transfer Simulation with GPU-FPGA Cooperative Computation. ASAP 2020: 9-16 - [c113]Riadh Ben Abdelhamid, Yoshiki Yamaguchi, Taisuke Boku:
Condensing an overload of parallel computing ingredients into a single architecture recipe. ASAP 2020: 25-28 - [c112]Taisuke Boku:
AsHES 2020 Keynote Speaker (5: 30 pm CDT). IPDPS Workshops 2020: 431 - [c111]Norihisa Fujita, Ryohei Kobayashi
, Yoshiki Yamaguchi, Tomohiro Ueno
, Kentaro Sano, Taisuke Boku:
Performance Evaluation of Pipelined Communication Combined with Computation in OpenCL Programming on FPGA. IPDPS Workshops 2020: 450-459 - [c110]Daisuke Tsuji, Taisuke Boku, Ryosaku Ikeda, Takuto Sato, Hiroto Tadano, Hiroyuki Kusaka:
Parallelized GPU Code of City-Level Large Eddy Simulation. ISPDC 2020: 76-83 - [c109]Norihisa Fujita, Ryohei Kobayashi
, Yoshiki Yamaguchi, Taisuke Boku, Kohji Yoshikawa, Makito Abe, Masayuki Umemura:
OpenCL-enabled Parallel Raytracing for Astrophysical Application on Multiple FPGAs with Optical Links. H2RC@SC 2020: 48-55 - [p1]Joachim Protze
, Miwako Tsuji
, Christian Terboven
, Thomas Dufaud, Hitoshi Murai, Serge G. Petiton, Nahid Emad, Matthias S. Müller
, Taisuke Boku:
MYX: Runtime Correctness Analysis for Multi-Level Parallel Programming Paradigms. Software for Exascale Computing 2020: 545-567 - [i1]Roman Iakymchuk
, Daichi Mukunoki
, Artur Podobas, Fabienne Jézéquel, Toshiyuki Imamura, Norihisa Fujita, Jens Huthmann, Shuhei Kudo, Yiyu Tan, Jens Domke, Kai Torben Ohlhus, Takeshi Fukaya, Takeo Hoshi, Yuki Murakami, Maho Nakata, Takeshi Ogita, Kentaro Sano, Taisuke Boku:
White Paper from Workshop on Large-scale Parallel Numerical Computing Technology (LSPANC 2020): HPC and Computer Arithmetic toward Minimal-Precision Computing. CoRR abs/2004.04628 (2020)
2010 – 2019
- 2019
- [j17]Masashi Noda, Shunsuke A. Sato
, Yuta Hirokawa, Mitsuharu Uemoto, Takashi Takeuchi
, Shunsuke Yamada
, Atsushi Yamada
, Yasushi Shinohara, Maiku Yamaguchi, Kenji Iida, Isabella Floss, Tomohito Otobe
, Kyung-Min Lee, Kazuya Ishimura
, Taisuke Boku, George F. Bertsch, Katsuyuki Nobusada, Kazuhiro Yabana:
SALMON: Scalable Ab-initio Light-Matter simulator for Optics and Nanoscience. Comput. Phys. Commun. 235: 356-365 (2019) - [j16]Masahiro Nakao, Hitoshi Murai, Hidetoshi Iwashita, Taisuke Boku, Mitsuhisa Sato:
Implementation and evaluation of the HPC challenge benchmark in the XcalableMP PGAS language. Int. J. High Perform. Comput. Appl. 33(1) (2019) - [j15]Masahiro Nakao, Tetsuya Odajima, Hitoshi Murai, Akihiro Tabuchi, Norihisa Fujita, Toshihiro Hanawa, Taisuke Boku, Mitsuhisa Sato:
Evaluation of XcalableACC with tightly coupled accelerators/InfiniBand hybrid communication on accelerated cluster. Int. J. High Perform. Comput. Appl. 33(5) (2019) - [c108]Riadh Ben Abdelhamid, Yoshiki Yamaguchi, Taisuke Boku:
MITRACA: Manycore Interlinked Torus Reconfigurable Accelerator Architecture. ASAP 2019: 38 - [c107]Iman Firmansyah
, Changdao Du, Norihisa Fujita, Yoshiki Yamaguchi, Taisuke Boku:
FPGA-based Implementation of Memory-Intensive Application using OpenCL. HEART 2019: 16:1-16:4 - [c106]Miwako Tsuji
, Taisuke Boku, Mitsuhisa Sato:
Scalable communication performance prediction using auto-generated pseudo MPI event trace. HPC Asia 2019: 53-62 - [c105]Norihisa Fujita, Ryohei Kobayashi
, Yoshiki Yamaguchi, Taisuke Boku:
Parallel Processing on FPGA Combining Computation and Communication in OpenCL Programming. IPDPS Workshops 2019: 479-488 - [c104]Ryohei Kobayashi
, Norihisa Fujita, Yoshiki Yamaguchi, Ayumi Nakamichi, Taisuke Boku:
GPU-FPGA Heterogeneous Computing with OpenCL-Enabled Direct Memory Access. IPDPS Workshops 2019: 489-498 - [c103]Riadh Ben Abdelhamid, Yoshiki Yamaguchi, Taisuke Boku:
MITRACA: A Next-Gen Heterogeneous Architecture. MCSoC 2019: 304-311 - [c102]Thomas Steinke
, Estela Suarez, Taisuke Boku, Nalini Kumar, David E. Martin
:
Using FPGAs to Accelerate HPC and Data Analytics on Intel-Based Systems. ISC Workshops 2019: 561-566 - 2018
- [c101]Norihisa Fujita, Ryohei Kobayashi
, Yoshiki Yamaguchi, Yuma Oobata, Taisuke Boku, Makito Abe, Kohji Yoshikawa, Masayuki Umemura:
Accelerating Space Radiative Transfer on FPGA using OpenCL. HEART 2018: 6:1-6:7 - [c100]Akihiro Tabuchi, Masahiro Nakao, Hitoshi Murai, Taisuke Boku, Mitsuhisa Sato:
Performance evaluation for a hydrodynamics application in XcalableACC PGAS language for accelerated clusters. HPC Asia Workshops 2018: 1-10 - [c99]Masahiro Nakao, Hitoshi Murai, Taisuke Boku, Mitsuhisa Sato:
Linkage of XcalableMP and Python languages for high productivity on HPC cluster system: application to graph order/degree problem. HPC Asia Workshops 2018: 39-47 - [c98]Masahiro Nakao, Hitoshi Murai, Taisuke Boku, Mitsuhisa Sato:
Performance evaluation for omni XcalableMP compiler on many-core cluster system based on knights landing. HPC Asia Workshops 2018: 52-58 - [c97]Masashi Horikoshi, Larry Meadows, Tom Elken, Pradeep Sivakumar, Edward Mascarenhas, James Erwin, Dmitry Durnov, Alexander Sannikov, Toshihiro Hanawa
, Taisuke Boku:
Scaling collectives on large clusters using Intel(R) architecture processors and fabric. HPC Asia Workshops 2018: 59-62 - [c96]Larry Meadows, Ken-Ichi Ishikawa, Taisuke Boku, Masashi Horikoshi:
Multiple endpoints for improved MPI performance on a lattice QCD code. HPC Asia Workshops 2018: 67-70 - [c95]Yuta Hirokawa, Taisuke Boku, Shunsuke A. Sato
, Kazuhiro Yabana:
Performance Evaluation of Large Scale Electron Dynamics Simulation under Many-core Cluster based on Knights Landing. HPC Asia 2018: 183-191 - [c94]Ryohei Kobayashi
, Yuma Oobata, Norihisa Fujita, Yoshiki Yamaguchi, Taisuke Boku:
OpenCL-ready High Speed FPGA Network for Reconfigurable High Performance Computing. HPC Asia 2018: 192-201 - [c93]Balazs Gerofi, Rolf Riesen, Masamichi Takagi, Taisuke Boku, Kengo Nakajima, Yutaka Ishikawa, Robert W. Wisniewski:
Performance and Scalability of Lightweight Multi-kernel Based Operating Systems. IPDPS 2018: 116-125 - [c92]Yutaka Watanabe, Jinpil Lee, Taisuke Boku, Mitsuhisa Sato:
Trade-Off of Offloading to FPGA in OpenMP Task-Based Programming. IWOMP 2018: 96-110 - [c91]Kazuaki Matsumura, Mitsuhisa Sato, Taisuke Boku, Artur Podobas, Satoshi Matsuoka:
MACC: An OpenACC Transpiler for Automatic Multi-GPU Use. SCFA 2018: 109-127 - [c90]Yuta Hirokawa, Taisuke Boku, Mitsuharu Uemoto, Shunsuke A. Sato
, Kazuhiro Yabana:
Performance Optimization and Evaluation of Scalable Optoelectronics Application on Large Scale KNL Cluster. ISC 2018: 205-225 - 2017
- [c89]Akihiro Tabuchi, Masahiro Nakao, Hitoshi Murai, Taisuke Boku, Mitsuhisa Sato:
Implementation and Evaluation of One-sided PGAS Communication in XcalableACC for Accelerated Clusters. CCGrid 2017: 625-634 - [c88]Masahiro Nakao, Hitoshi Murai, Hidetoshi Iwashita, Akihiro Tabuchi, Taisuke Boku, Mitsuhisa Sato:
Implementing Lattice QCD Application with XcalableACC Language on Accelerated Cluster. CLUSTER 2017: 429-438 - [c87]Taisuke Boku, Ken-Ichi Ishikawa, Yoshinobu Kuramashi, Lawrence Meadows:
Mixed Precision Solver Scalable to 16000 MPI Processes for Lattice Quantum Chromodynamics Simulations on the Oakforest-PACS System. CANDAR 2017: 362-368 - [c86]Hiroki Nakamura, Hirotaka Takayama, Yoshiki Yamaguchi, Taisuke Boku:
Thorough analysis of PCIe Gen3 communication. ReConFig 2017: 1-6 - [c85]Joachim Protze
, Christian Terboven
, Matthias S. Müller
, Serge G. Petiton, Nahid Emad, Hitoshi Murai, Taisuke Boku:
Runtime Correctness Checking for Emerging Programming Paradigms. CORRECTNESS@SC 2017: 21-27 - 2016
- [j14]Keisuke Tsugane, Taisuke Boku, Hitoshi Murai, Mitsuhisa Sato, William M. Tang
, Bei Wang:
Hybrid-view programming of nuclear fusion simulation code in the PGAS parallel programming language XcalableMP. Parallel Comput. 57: 37-51 (2016) - [c84]Iman Firmansyah
, Yoshiki Yamaguchi, Taisuke Boku:
Performance evaluation of Stratix V DE5-Net FPGA board for high performance computing. IC3INA 2016: 23-27 - [c83]Yuta Hirokawa, Taisuke Boku, Shunsuke A. Sato
, Kazuhiro Yabana:
Electron Dynamics Simulation with Time-Dependent Density Functional Theory on Large Scale Symmetric Mode Xeon Phi Cluster. IPDPS Workshops 2016: 1202-1211 - [c82]Akihiro Tabuchi, Yasuyuki Kimura, Sunao Torii, Hideo Matsufuru
, Tadashi Ishikawa
, Taisuke Boku, Mitsuhisa Sato:
Design and Preliminary Evaluation of Omni OpenACC Compiler for Massive MIMD Processor PEZY-SC. IWOMP 2016: 293-305 - [c81]Kazuya Matsumoto, Norihisa Fujita, Toshihiro Hanawa
, Taisuke Boku:
Implementation and Evaluation of NAS Parallel CG Benchmark on GPU Cluster with Proprietary Interconnect TCA. VECPAR 2016: 135-145 - 2015
- [c80]Toshihiro Hanawa
, Yuetsu Kodama, Taisuke Boku, Hideharu Amano, Hitoshi Murai, Masayuki Umemura, Mitsuhisa Sato:
Towards Unification of Accelerated Computing and Interconnection For Extreme-Scale Computing. ARC 2015: 463-474 - [c79]Toshihiro Hanawa
, Hisafumi Fujii, Norihisa Fujita, Tetsuya Odajima, Kazuya Matsumoto, Yuetsu Kodama, Taisuke Boku:
Improving Strong-Scaling on GPU Cluster Based on Tightly Coupled Accelerators Architecture. CLUSTER 2015: 88-91 - [c78]Tetsuya Odajima, Taisuke Boku, Toshihiro Hanawa
, Hitoshi Murai, Masahiro Nakao, Akihiro Tabuchi, Mitsuhisa Sato:
Hybrid Communication with TCA and InfiniBand on a Parallel Programming Language XcalableACC for GPU Clusters. CLUSTER 2015: 627-634 - [c77]Toshihiro Hanawa
, Hisafumi Fujii, Norihisa Fujita, Tetsuya Odajima, Kazuya Matsumoto, Taisuke Boku:
Evaluation of FFT for GPU Cluster Using Tightly Coupled Accelerators Architecture. CLUSTER 2015: 635-641 - [c76]Kazuya Matsumoto, Toshihiro Hanawa
, Yuetsu Kodama, Hisafumi Fujii, Taisuke Boku:
Implementation of CG Method on GPU Cluster with Proprietary Interconnect TCA for GPU Direct Communication. IPDPS Workshops 2015: 647-655 - 2014
- [j13]Yukihiro Hasegawa, Jun-ichi Iwata, Miwako Tsuji
, Daisuke Takahashi
, Atsushi Oshiyama
, Kazuo Minami, Taisuke Boku, Hikaru Inoue, Yoshito Kitazawa, Ikuo Miyoshi, Mitsuo Yokokawa
:
Performance evaluation of ultra-large-scale first-principles electronic structure calculation code on the K computer. Int. J. High Perform. Comput. Appl. 28(3): 335-355 (2014) - [j12]Masashi Noda, Kazuya Ishimura
, Katsuyuki Nobusada, Kazuhiro Yabana, Taisuke Boku:
Massively-parallel electron dynamics calculations in real-time and real-space: Toward applications to nanostructures of more than ten-nanometers in size. J. Comput. Phys. 265: 145-155 (2014) - [j11]Yuetsu Kodama, Toshihiro Hanawa
, Taisuke Boku, Mitsuhisa Sato:
PEACH2: An FPGA-based PCIe network device for Tightly Coupled Accelerators. SIGARCH Comput. Archit. News 42(4): 3-8 (2014) - [c75]Norihisa Fujita, Hisafumi Fujii, Toshihiro Hanawa
, Yuetsu Kodama, Taisuke Boku, Yoshinobu Kuramashi, Mike Clark:
QCD Library for GPU Cluster with Proprietary Interconnect for GPU Direct Communication. Euro-Par Workshops (1) 2014: 251-262 - [c74]Takuya Kuhara, Takahiro Kaneda, Toshihiro Hanawa
, Yuetsu Kodama, Taisuke Boku, Hideharu Amano:
A Preliminarily Evaluation of PEACH3: A Switching Hub for Tightly Coupled Accelerators. CANDAR 2014: 377-381 - [c73]Keisuke Tsugane, Hideo Nuga
, Taisuke Boku, Hitoshi Murai, Mitsuhisa Sato, William M. Tang
, Bei Wang:
Hybrid-view programming of nuclear fusion simulation code in the PGAS parallel programming language XcalableMP. ICPADS 2014: 640-647 - [c72]Norihisa Fujita, Hideo Nuga
, Taisuke Boku, Yasuhiro Idomura
:
Nuclear Fusion Simulation Code Optimization and Performance Evaluation on GPU Cluster. IPDPS Workshops 2014: 1266-1274 - [c71]Masahiro Nakao, Hitoshi Murai, Takenori Shimosaka, Akihiro Tabuchi, Toshihiro Hanawa
, Yuetsu Kodama, Taisuke Boku, Mitsuhisa Sato:
XcalableACC: extension of XcalableMP PGAS language using OpenACC for accelerator clusters. WACCPD@SC 2014: 27-36 - 2013
- [c70]Takaaki Miyajima, Takuya Kuhara, Toshihiro Hanawa
, Hideharu Amano, Taisuke Boku:
Task level pipelining with PEACH2: An FPGA switching fabric for high performance computing. FPT 2013: 466-469 - [c69]Toshihiro Hanawa
, Yuetsu Kodama, Taisuke Boku, Mitsuhisa Sato:
Interconnection Network for Tightly Coupled Accelerators Architecture. Hot Interconnects 2013: 79-82 - [c68]Tetsuya Odajima, Taisuke Boku, Mitsuhisa Sato, Toshihiro Hanawa
, Yuetsu Kodama, Raymond Namyst, Samuel Thibault, Olivier Aumage:
Adaptive Task Size Control on High Level Programming for GPU/CPU Work Sharing. ICA3PP (2) 2013: 59-68 - [c67]Norihisa Fujita, Hideo Nuga
, Taisuke Boku, Yasuhiro Idomura
:
Nuclear Fusion Simulation Code Optimization on GPU Clusters. ICPADS 2013: 420-421 - [c66]Toshihiro Hanawa
, Yuetsu Kodama, Taisuke Boku, Mitsuhisa Sato:
Tightly Coupled Accelerators Architecture for Minimizing Communication Latency among Accelerators. IPDPS Workshops 2013: 1030-1039 - 2012
- [c65]Masahiro Nakao, Jinpil Lee, Taisuke Boku, Mitsuhisa Sato:
Productivity and Performance of Global-View Programming with XcalableMP PGAS Language. CCGRID 2012: 402-409 - [c64]Tetsuya Odajima, Taisuke Boku, Toshihiro Hanawa
, Jinpil Lee, Mitsuhisa Sato:
GPU/CPU Work Sharing with Parallel Language XcalableMP-dev for Parallelized Accelerated Computing. ICPP Workshops 2012: 97-106 - [c63]Takuma Nomizu, Daisuke Takahashi
, Jinpil Lee, Taisuke Boku, Mitsuhisa Sato:
Implementation of XcalableMP Device Acceleration Extention with OpenCL. IPDPS Workshops 2012: 2394-2403 - 2011
- [j10]Jack J. Dongarra, Peter H. Beckman, Terry Moore, Patrick Aerts, Giovanni Aloisio
, Jean-Claude Andre, David Barkai, Jean-Yves Berthou, Taisuke Boku, Bertrand Braunschweig, Franck Cappello, Barbara M. Chapman, Xuebin Chi, Alok N. Choudhary, Sudip S. Dosanjh, Thom H. Dunning, Sandro Fiore
, Al Geist, Bill Gropp
, Robert J. Harrison
, Mark Hereld, Michael A. Heroux, Adolfy Hoisie, Koh Hotta, Zhong Jin, Yutaka Ishikawa, Fred Johnson, Sanjay Kale, Richard Kenway, David E. Keyes, Bill Kramer, Jesús Labarta
, Alain Lichnewsky, Thomas Lippert, Bob Lucas, Barney Maccabe
, Satoshi Matsuoka, Paul Messina, Peter Michielse, Bernd Mohr
, Matthias S. Müller
, Wolfgang E. Nagel, Hiroshi Nakashima, Michael E. Papka
, Daniel A. Reed, Mitsuhisa Sato, Edward Seidel, John Shalf
, David Skinner, Marc Snir, Thomas L. Sterling, Rick Stevens, Frederick H. Streitz
, Bob Sugar, Shinji Sumimoto, William M. Tang
, John A. Taylor, Rajeev Thakur
, Anne E. Trefethen, Mateo Valero
, Aad J. van der Steen, Jeffrey S. Vetter, Peg Williams, Robert W. Wisniewski, Katherine A. Yelick
:
The International Exascale Software Project roadmap. Int. J. High Perform. Comput. Appl. 25(1): 3-60 (2011) - [j9]Sugako Otani, Hiroyuki Kondo, Itaru Nonomura, Toshihiro Hanawa
, Shin'ichi Miura, Taisuke Boku:
Peach: A Multicore Communication System on Chip with PCI Express. IEEE Micro 31(6): 39-50 (2011) - [c62]Sugako Otani, Hiroyuki Kondo, Itaru Nonomura, Atsuyuki Ikeya, Minoru Uemura, Katsushi Asahina, Kazutami Arimoto, Shin'ichi Miura, Toshihiro Hanawa
, Taisuke Boku, Mitsuhisa Sato:
An 80 Gbps dependable multicore communication SoC with PCI express I/F and intelligent interrupt controller. COOL Chips 2011: 1-3 - [c61]Shin'ichi Miura, Toshihiro Hanawa
, Taisuke Boku, Mitsuhisa Sato:
XMCAPI: Inter-core Communication Interface on Multi-chip Embedded Systems. EUC 2011: 397-402 - [c60]Wolfgang Karl, Samuel Thibault, Stanimire Tomov
, Taisuke Boku:
Introduction. Euro-Par (2) 2011: 399-400 - [c59]Jinpil Lee, Minh Tuan Tran, Tetsuya Odajima, Taisuke Boku, Mitsuhisa Sato:
An Extension of XcalableMP PGAS Lanaguage for Multi-node GPU Clusters. Euro-Par Workshops (1) 2011: 429-439 - [c58]Toshihiro Hanawa
, Taisuke Boku, Shin'ichi Miura, Mitsuhisa Sato, Kazutami Arimoto:
PEARL and PEACH: A Novel PCI Express Direct Link and Its Implementation. IPDPS Workshops 2011: 871-879 - [c57]Sugako Otani, Hiroyuki Kondo, Itaru Nonomura, Atsuyuki Ikeya, Minoru Uemura, Yasushi Hayakawa, Takeshi Oshita, Satoshi Kaneko, Katsushi Asahina, Kazutami Arimoto, Shin'ichi Miura, Toshihiro Hanawa
, Taisuke Boku, Mitsuhisa Sato:
An 80Gb/s dependable communication SoC with PCI express I/F and 8 CPUs. ISSCC 2011: 266-268 - [c56]