


default search action
9th ICS 1995: Barcelona, Spain
- Mateo Valero:

Proceedings of the 9th international conference on Supercomputing, ICS 1995, Barcelona, Spain, July 3-7, 1995. ACM 1995, ISBN 0-89791-728-6 - Thomas Stricker, James M. Stichnoth, David R. O'Hallaron, Susan Hinrichs, Thomas R. Gross:

Decoupling Synchronization and Data Transfer in Message Passing Systems of Parallel Computers. 1-10 - Alain Kägi, Nagi Aboulenein, Doug Burger, James R. Goodman:

Techniques for Reducing Overheads of Shared-Memory Multiprocessing. 11-20 - Wei Li:

Compiler Cache Optimizations for Banded Matrix Problems. 21-30 - Alexandre E. Eichenberger, Edward S. Davidson, Santosh G. Abraham:

Optimum Modulo Schedules for Minimum Register Requirements. 31-40 - Welf Löwe, Wolf Zimmermann:

Upper Time Bounds for Executing PRAM-Programs on the LogP-Machine. 41-50 - Thomas Fahringer, Matthew Haines, Piyush Mehrotra:

On the Utility of Threads for Data Parallel Programming. 51-59 - Sartaj Sahni:

The DMBC: Architecture and Fundamental Operations. 60-66 - Chau-Wen Tseng, Jennifer-Ann M. Anderson, Saman P. Amarasinghe

, Monica S. Lam:
Unified Compilation Techniques for Shared and Distributed Address Space Machines. 67-76 - Tzi-cker Chiueh, Manish Verma:

A Compiler-Directed Distributed Shared Memory System. 77-86 - David A. Garza-Salazar, A. P. Wim Böhm:

Reducing Communication by Honoring Multiple Alignments. 87-96 - Xiaodong Zhang, Zhichen Xu:

Multiprocessor Scalability Predictions Through Detailed Program Execution Analysis. 97-106 - José A. Gregorio

, Fernando Vallejo, Ramón Beivide, Carmen Carrión
:
Petri Net Modeling of Interconnection Networks for Massively Parallel Architectures. 107-116 - Manuel Ujaldon, Emilio L. Zapata:

Efficient Resolution of Sparse Indirections in Data-Parallel Compilers. 117-126 - Andreas Müller, Roland Rühl:

Extending High Performance Fortran for the Support of Unstructured Computations. 127-136 - Lawrence Rauchwerger, Nancy M. Amato, David A. Padua:

Run-Time Methods for Parallelizing Partially Parallel Loops. 137-146 - Chien-Min Wang

, Chiu-Yu Ku:
A Near-Optimal Broadcasting Algorithm in All-Port Wormhole-Routed Hypercubes. 147-153 - Kiran Raghavendra Desai, Kanad Ghose:

A Comparative Study of Single Hop WDM Interconnections for Multiprocessors. 154-163 - R. Knecht, Gregory Allen Kohring:

Dynamic Load Balancing for the Simulation of Granular Materials. 164-169 - Mounir Hamdi, Chi-kin Lee:

Dynamic Load Balancing of Data Parallel Applications on a Distributed Network. 170-179 - Ken Kennedy, Nenad Nedeljkovic, Ajay Sethi:

Efficient Address Generation for Block-Cyclic Distributions. 180-184 - Hiroyuki Sato, Takeshi Nanri, Masaaki Shimasaki:

Using Asynchronous and Bulk Communications to Construct an Optimizing Compiler for Distributed-Memory Machines with Consideration Given to Communications Costs. 185-189 - Tatsuya Shindo, Hidetoshi Iwashita, Tsunehisa Doi, Junichi Hagiwara, Shaun Kaneshiro:

HPF Compiler for the AP1000. 190-194 - Nasser Elmasri, Herbert H. J. Hum, Guang R. Gao:

The Threaded Communication Library: Preliminary Experiences on a Multiprocessor with Dual-Processor Nodes. 195-199 - Hesham Keshk, Shin-ichiro Mori, Hiroshi Nakashima, Shinji Tomita:

Amon: A Parallel Slice Algorithm for Wire Routing. 200-208 - Reiji Suda, Yoshio Oyanagi:

Implementation of Sparta, a Highly Parallel Circuit Simulator by the Preconditioned Jacobi Method, on a Distributed Memory Machine. 209-217 - Daniel González-Morales, José L. Roda

, Francisco Almeida, Casiano Rodríguez
, F. García:
Integral Knapsack Problems: Parallel Algorithms and Their Implementations on Distributed Systems. 218-226 - Donna Bergmark:

Optimization and Parallelization of a Commodity Trade Model for the IBM SP1/2, using Parallel Programming Tools. 227-236 - C. J. Tan:

Deep Blue: Computer Chess and Massively Parallel Systems (extended abstract). 237-239 - Feng-Hsiung Hsu, Murray Campbell, A. Joseph Hoane Jr.:

Deep Blue System Overview. 240-244 - Nathalie Drach:

Hardware Implementation Issues of Data Prefetching. 245-254 - David A. Koufaty, Xiangfeng Chen, David K. Poulsen, Josep Torrellas:

Data Forwarding in Scalable Shared-Memory Multiprocessors. 255-264 - Vadim Maslov:

Enhancing Array Dataflow Dependence Analysis with On-demand Global Value Propagation. 265-269 - Hiroshi Ohta, Yasuhiko Saito, Masahiro Kainaga, Hiroyuki Ono:

Optimal Tile Size Adjustment in Compiling General DOACROSS Loop Nests. 270-279 - Mounir Hamdi, Siang W. Song:

Efficient Embeddings into the Hypercube Using Matrix Transformations. 280-288 - Chao-Wei Ou, Manoj Gunwani, Sanjay Ranka

:
Architecture-independent Locality-improving Transformations of Computational Graphs Embedded in k-Dimensions. 289-298 - Jacques Jorda, Abdelaziz Mzoughi, Daniel Litaize:

Semi-linear and Bi-base Storage Schemes Classes: General Overview and Case Study. 299-307 - Shigeru Kusakabe, Taku Nagai, Yoshihiro Yamashita, Rin-Ichiro Taniguchi, Makoto Amamiya:

A Dataflow Language with Object-based Extension and its Implementation on a Commercially Available Parallel Machine. 308-317 - Michael F. P. O'Boyle, François Bodin:

Compiler Reduction of Synchronisation in Shared Virtual Memory Systems. 318-327 - Hayato Yamana

, Mitsuhisa Sato, Yuetsu Kodama, Hirofumi Sakane, Shuichi Sakai
, Yoshinori Yamaguchi:
A Macrotask-level Unlimited Speculative Execution on Multiprocessors. 328-337 - Antonio González, Carlos Aliagas, Mateo Valero:

A Data Cache with Multiple Caching Strategies Tuned to Different Types of Locality. 338-347 - Yasuhiko Nakashima, Toshiaki Kitamura, Hideo Tamura, Masaaki Takiuchi, Ken'ichi Miura:

Scalar Processor of the VPP500 Parallel Supercomputer. 348-356 - Robert van Engelen, Lex Wolters:

A Comparison of Parallel Programming Paradigms and Data Distributions for a Limited Area Numerical Weather Forecast Routine. 357-364 - Marios D. Dikaiakos, Daphne Manoussaki, Calvin Lin, Diana E. Woodward:

The Portable Parallel Implementation of Two Novel Mathematical Biology Algorithms in ZPL. 365-374 - Tzi-cker Chiueh:

Performance Optimization for Parallel Tape Arrays. 375-384 - James V. Huber Jr., Andrew A. Chien, Christopher L. Elford, David S. Blumenthal, Daniel A. Reed:

PPFS: a High Performance Portable Parallel File System. 385-394 - Rajesh Bordawekar, Alok N. Choudhary:

Communication Strategies for Out-of-Core Programs on Distributed Memory Machines. 395-403 - Sandra Johnson Baylor, Caroline Benveniste, Yarsun Hsu:

Performance Evaluation of a Parallel I/O Architecture. 404-413 - Peng Tu, David A. Padua:

Gated SSA-based Demand-Driven Symbolic Analysis for Parallelizing Compilers. 414-423 - Ernesto Su, Antonio Lain, Shankar Ramaswamy, Daniel J. Palermo, Eugene W. Hodges IV, Prithviraj Banerjee:

Advanced Compilation Techniques in the PARADIGM Compiler for Distributed-memory Multicomputers. 424-433 - Peiyi Tang, Nianshu Gao:

Vectorization beyond Data Dependences. 434-443 - William M. Pottenger, Rudolf Eigenmann:

Idiom Recognition in the Polaris Parallelizing Compiler. 444-448

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














