


default search action
10th VECPAR 2012: Kobe, Japan
- Michel J. Daydé, Osni Marques, Kengo Nakajima:

High Performance Computing for Computational Science - VECPAR 2012, 10th International Conference, Kobe, Japan, July 17-20, 2012, Revised Selected Papers. Lecture Notes in Computer Science 7851, Springer 2013, ISBN 978-3-642-38717-3
Invited Presentations
- Horst D. Simon

:
Barriers to Exascale Computing. 1-3 - Richard W. Vuduc

, Kenneth Czechowski:
Toward a Theory of Algorithm-Architecture Co-design. 4-8 - Takashi Furumura:

Visualization of Strong Ground Motion from the 2011 Off Tohoku, Japan (Mw=9.0) Earthquake Obtained from Dense Nation-Wide Seismic Network and Large-Scale Parallel FDM Simulation. 9-16 - Ryutaro Himeno:

Grand Challenge in Life Science on K Computer. 17-22 - Kenji Ono, Tomohiro Kawanabe, Toshio Hatada:

HPC/PF - High Performance Computing Platform: An Environment That Accelerates Large-Scale Simulations. 23-27
GPU Computing
- Jakub Kurzak, Piotr Luszczek, Mathieu Faverge, Jack J. Dongarra:

Programming the LU Factorization for a Multicore System with Accelerators. 28-35 - Rohit Gupta, Martin B. van Gijzen

, Cornelis Vuik
:
Efficient Two-Level Preconditioned Conjugate Gradient Method on the GPU. 36-49 - Andrés Tomás, Zhaojun Bai, Vicente Hernández:

Parallelization of the QR Decomposition with Column Pivoting Using Column Cyclic Distribution on Multicore and GPU Processors. 50-58 - Toshiyuki Imamura, Susumu Yamada, Masahiko Machida:

A High Performance SYMV Kernel on a Fermi-core GPU. 59-71 - Ahmad Abdelfattah, Jack J. Dongarra, David E. Keyes

, Hatem Ltaief
:
Optimizing Memory-Bound SYMV Kernel on GPU Hardware Accelerators. 72-79
Applications
- Hajime Yamamoto, Shinichi Nanai, Keni Zhang, Pascal Audigane

, Christophe Chiaberge
, Ryusei Ogata, Noriaki Nishikawa, Yuichi Hirokawa, Satoru Shingu, Kengo Nakajima:
Numerical Simulation of Long-Term Fate of CO2 Stored in Deep Reservoir Rocks on Massively Parallel Vector Supercomputer. 80-92 - Jinfang Gao, Huilin Xing:

High Performance Simulation of Complicated Fluid Flow in 3D Fractured Porous Media with Permeable Material Matrix Using LBM. 93-104 - M. L. L. Wijerathne

, Muneo Hori, Tsuyoshi Ichimura, Seizo Tanaka:
Parallel Scalability Enhancements of Seismic Response and Evacuation Simulations of Integrated Earthquake Simulator. 105-117 - Anthony Scemama

, Michel Caffarel
, Emmanuel Oseret, William Jalby:
QMC=Chem: A Quantum Monte Carlo Program for Large-Scale Simulations in Chemistry at the Petascale Level and beyond. 118-127
Finite Element Method from Various Viewpoints
- Niclas Jansson

:
Optimizing Sparse Matrix Assembly in Finite Element Solvers with One-Sided Communication. 128-139 - Satoshi Ohshima, Masae Hayashi, Takahiro Katagiri, Kengo Nakajima:

Implementation and Evaluation of 3D Finite Element Method Application for CUDA. 140-148 - Alberto F. De Souza

, Lucas de Paula Veronese, Leonardo Muniz de Lima, Claudine Badue
, Lucia Catabriga
:
Evaluation of Two Parallel Finite Element Implementations of the Time-Dependent Advection Diffusion Problem: GPU versus Cluster Considering Time and Energy Consumption. 149-162
Cloud and Visualization
- Germán Moltó

, Amanda Calatrava
, Vicente Hernández:
A Service-Oriented Architecture for Scientific Computing on Cloud Infrastructures. 163-176 - Alexandre Solon Nery, Nadia Nedjah

, Felipe M. G. França
, Lech Józwiak:
Interactive Volume Rendering Based on Ray-Casting for Multi-core Architectures. 177-186
Performance
- Franz Franchetti, Yevgen Voronenko, Gheorghe Almási:

Automatic Generation of the HPC Challenge's Global FFT Benchmark for BlueGene/P. 187-200 - Edgar Solomonik, James Demmel:

Matrix Multiplication on Multidimensional Torus Networks. 201-215
Methods and Tools for Advanced Scientific Computing
- Babak Hejazialhosseini, Christian Conti, Diego Rossinelli, Petros Koumoutsakos

:
High Performance CPU Kernels for Multiphase Compressible Flows. 216-225 - Yasunori Futamura

, Tetsuya Sakurai, Shinnosuke Furuya, Jun-ichi Iwata:
Efficient Algorithm for Linear Systems Arising in Solutions of Eigenproblems and Its Application to Electronic-Structure Calculations. 226-235 - Takahiro Katagiri, Takao Sakurai, Mitsuyoshi Igai, Satoshi Ohshima, Hisayasu Kuroda, Ken Naono, Kengo Nakajima:

Control Formats for Unsymmetric and Symmetric Sparse Matrix-Vector Multiplications on OpenMP Implementations. 236-248
Algorithms and Data Analysis
- Sandrine Mouysset, Ronan Guivarch:

Sparsification on Parallel Spectral Clustering. 249-260 - Prasanna Balaprakash

, Stefan M. Wild
, Paul D. Hovland
:
An Experimental Study of Global and Local Search Algorithms in Empirical Performance Tuning. 261-269 - Aleksandr Drozd, Naoya Maruyama, Satoshi Matsuoka:

A Multi GPU Read Alignment Algorithm with Model-Based Performance Optimization. 270-277
Parallel Iterative Solvers on Multicore Architectures
- Masae Hayashi, Kengo Nakajima:

OpenMP/MPI Hybrid Parallel ILU(k) Preconditioner for FEM Based on Extended Hierarchical Interface Decomposition for Multi-core Clusters. 278-291 - Masatoshi Kawai, Takeshi Iwashita, Hiroshi Nakashima, Osni Marques:

Parallel Smoother Based on Block Red-Black Ordering for Multigrid Poisson Solver. 292-299 - Vincent Heuveline

, Sven Janko, Wolfgang Karl, Björn Rocker, Martin Schindewolf:
Software Transactional Memory, OpenMP and Pthread Implementations of the Conjugate Gradients Method - A Preliminary Evaluation. 300-313
The Seventh International Workshop on Automatic Performance Tuning
- Takahiro Katagiri, Pierre-Yves Aquilanti, Serge G. Petiton:

A Smart Tuning Strategy for Restart Frequency of GMRES(m) with Hierarchical Cache Sizes. 314-328 - Lu Li, Usman Dastgeer, Christoph W. Kessler:

Adaptive Off-Line Tuning for Optimized Composition of Components for Heterogeneous Many-Core Systems. 329-345 - Diego Fabregat-Traver

, Paolo Bientinesi:
A Domain-Specific Compiler for Linear Algebra Operations. 346-361 - Bryan Marker, Jack Poulson, Don S. Batory, Robert A. van de Geijn:

Designing Linear Algebra Algorithms by Transformation: Mechanizing the Expert Developer. 362-378 - Hiroki Toyokawa, Hiroyuki Ishigami, Kinji Kimura, Masami Takata, Yoshimasa Nakamura:

Accelerating the Reorthogonalization of Singular Vectors with a Multi-core Processor. 379-390 - Jeffrey Morlan, Shoaib Kamil, Armando Fox:

Auto-tuning the Matrix Powers Kernel with SEJITS. 391-403 - Tatsuya Abe

, Mitsuhisa Sato:
Auto-tuning of Numerical Programs by Block Multi-color Ordering Code Generation and Job-Level Parallel Execution. 404-419 - Ayumu Tomiyama, Reiji Suda:

Automatic Parameter Optimization for Edit Distance Algorithm on GPU. 420-434 - Kengo Nakajima:

Automatic Tuning of Parallel Multigrid Solvers Using OpenMP/MPI Hybrid Parallel Programming Models. 435-450 - Andreas Schäfer, Dietmar Fey:

A Predictive Performance Model for Stencil Codes on Multicore CPUs. 451-466

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














