


default search action
23rd PPOPP 2018: Vienna, Austria
- Andreas Krall, Thomas R. Gross:

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2018, Vienna, Austria, February 24-28, 2018. ACM 2018, ISBN 978-1-4503-4982-6 - Haosen Wen, Joseph Izraelevitz, Wentao Cai

, H. Alan Beadle, Michael L. Scott:
Interval-based memory reclamation. 1-13 - Maya Arbel-Raviv, Trevor Brown:

Harnessing epoch-based reclamation for efficient range queries. 14-27 - Michal Friedman

, Maurice Herlihy, Virendra J. Marathe, Erez Petrank:
A persistent lock-free queue for non-volatile memory. 28-40 - Linnan Wang, Jinmian Ye, Yiyang Zhao, Wei Wu

, Ang Li, Shuaiwen Leon Song, Zenglin Xu, Tim Kraska:
Superneurons: dynamic GPU memory management for training deep neural networks. 41-53 - Mehmet E. Belviranli, Seyong Lee

, Jeffrey S. Vetter, Laxmi N. Bhuyan:
Juggler: a dependence-aware task-based execution framework for GPUs. 54-67 - Maria Kotsifakou, Prakalp Srivastava, Matthew D. Sinclair, Rakesh Komuravelli, Vikram S. Adve, Sarita V. Adve:

HPVM: heterogeneous parallel virtual machine. 68-80 - Adrien Guatto, Sam Westrick

, Ram Raghunathan, Umut A. Acar, Matthew Fluet:
Hierarchical memory management for mutable state. 81-93 - Yue Zhao, Jiajia Li

, Chunhua Liao
, Xipeng Shen
:
Bridging the gap between deep learning and sparse matrix format selection. 94-108 - Zhen Jia, Aleksandar Zlateski, Frédo Durand, Kai Li:

Optimizing N-dimensional, winograd-based convolution for manycore CPUs. 109-123 - Xiongchao Tang, Jidong Zhai, Xuehai Qian, Bingsheng He

, Wei Xue, Wenguang Chen:
vSensor: leveraging fixed-workload snippets of programs for performance variance detection. 124-136 - Aleksandar Prokopec

:
Cache-tries: concurrent lock-free hash tries with constant-time operations. 137-151 - Milind Chabbi, Shasha Wen, Xu Liu:

Featherlight on-the-fly false-sharing detection. 152-167 - Prashant Singh Rawat, Fabrice Rastello, Aravind Sukumaran-Rajam

, Louis-Noël Pouchet, Atanas Rountev, P. Sadayappan
:
Register optimizations for stencils on GPUs. 168-182 - Da Zheng, Disa Mhembere, Joshua T. Vogelstein, Carey E. Priebe, Randal C. Burns

:
FlashR: parallelize and scale R for machine learning using SSDs. 183-194 - Harshitha Menon, Kathryn M. Mohror

:
DisCVar: discovering critical variables using algorithmic differentiation for transient faults. 195-206 - Dana Drachsler-Cohen, Martin T. Vechev, Eran Yahav:

Practical concurrent traversals in search trees. 207-218 - Lukas Gianinazzi

, Pavel Kalvoda, Alessandro De Palma, Maciej Besta, Torsten Hoefler:
Communication-avoiding parallel minimum cuts and connected components. 219-232 - Artem Khyzha

, Hagit Attiya
, Alexey Gotsman, Noam Rinetzky:
Safe privatization in transactional memory. 233-245 - Samuel Grossman, Heiner Litz, Christos Kozyrakis:

Making pull-based graph processing performant. 246-260 - Abhinav Jangda, Uday Bondhugula:

An effective fusion and tile size model for optimizing image processing pipelines. 261-275 - Lei Wang, Liangji Zhuang, Junhang Chen, Huimin Cui, Fang Lv, Ying Liu, Xiaobing Feng:

Lazygraph: lazy data coherency for replicas in distributed graph-parallel computation. 276-289 - Yihan Sun, Daniel Ferizovic, Guy E. Blelloch:

PAM: parallel augmented maps. 290-304 - Zhouwang Fu, Tao Song

, Zhengwei Qi, Haibing Guan:
Efficient shuffle management with SCache for DAG computing frameworks. 305-316 - Xueqi Li

, Guangming Tan, Bingchen Wang, Ninghui Sun:
High-performance genomic analysis framework with in-memory computing. 317-328 - Yang Liu, Jianguo Wang, Steven Swanson:

Griffin: uniting CPU and GPU in information retrieval systems for intra-query parallelism. 327-337 - Xinliang Wang, Weifeng Liu

, Wei Xue, Li Wu:
swSpTRSV: a fast sparse triangular solve with sparse level tile layout on sunway architectures. 338-353 - James R. Wilcox

, Cormac Flanagan, Stephen N. Freund
:
VerifiedFT: a verified, high-performance precise dynamic race detector. 354-367 - Yifan Xu, I-Ting Angelina Lee, Kunal Agrawal:

Efficient parallel determinacy race detection for two-dimensional dags. 368-380 - Umut A. Acar, Vitaly Aksenov

, Arthur Charguéraud, Mike Rainey:
Performance challenges in modular parallel programs. 381-382 - Mahwish Arif, Hans Vandierendonck:

Reducing the burden of parallel loop schedulers for many-core processors. 383-384 - Nachshon Cohen, Erez Petrank, James R. Larus:

Reducing transaction aborts by looking to the future. 385-386 - Andreia Correia, Pedro Ramalhete:

Strong trylocks for reader-writer locks. 387-388 - Yao Dong, Ana L. Milanova, Julian Dolby:

SecureMR: secure mapreduce using homomorphic encryption and program partitioning. 389-390 - Jesun Sahariar Firoz, Marcin Zalewski, Andrew Lumsdaine

:
A scalable distance-1 vertex coloring algorithm for power-law graphs. 391-392 - Koby Hayashi

, Grey Ballard
, Yujie Jiang, Michael J. Tobia
:
Shared-memory parallelization of MTTKRP for dense tensors. 393-394 - Peng Jiang, Gagan Agrawal:

Revealing parallel scans and reductions in sequential loops through function reconstruction. 395-396 - Changwan Hong, Aravind Sukumaran-Rajam

, Jinsung Kim
, Prashant Singh Rawat, Sriram Krishnamoorthy
, Louis-Noël Pouchet, Fabrice Rastello, P. Sadayappan:
Performance modeling for GPUs using abstract kernel emulation. 397-398 - Herbert Jordan, Bernhard Scholz, Pavle Subotic:

Two concurrent data structures for efficient datalog query processing. 399-400 - Bernhard Kerbl

, Joerg H. Mueller, Michael Kenzel, Dieter Schmalstieg, Markus Steinberger
:
A scalable queue for work distribution on GPUs. 401-402 - Johannes de Fine Licht, Michaela Blott, Torsten Hoefler:

Designing scalable FPGA architectures using high-level synthesis. 403-404 - Bo Liu

, Wenbin Jiang
, Hai Jin, Xuanhua Shi, Yang Ma:
Layrub: layer-centric GPU memory reuse and data migration in extreme-scale deep learning systems. 405-406 - Junhong Liu

, Xin He, Weifeng Liu
, Guangming Tan:
Register-based implementation of the sparse general matrix-matrix multiplication on GPUs. 407-408 - Girish Mururu, Ada Gavrilovska, Santosh Pande

:
Quantifying and reducing execution variance in STM via model driven commit optimization. 409-410 - Jung-Ho Park, Hyungmin Cho, Wookeun Jung, Jaejin Lee:

Transparent GPU memory management for DNNs. 411-412 - Manuel Pöter, Jesper Larsson Träff:

Stamp-it, amortized constant-time memory reclamation in comparison to five other schemes. 413-414 - Stefan Reif, Wolfgang Schröder-Preikschat

:
A predictable synchronisation algorithm. 415-416 - Heinrich Riebler

, Gavin Vaz, Tobias Kenter, Christian Plessl
:
Automated code acceleration targeting heterogeneous openCL devices. 417-418 - Isaac Sánchez Barrera

, Marc Casas
, Miquel Moretó
, Eduard Ayguadé, Jesús Labarta, Mateo Valero:
Graph partitioning applied to DAG scheduling to reduce NUMA effects. 419-420 - Vasily Volkov:

A microbenchmark to study GPU performance models. 421-422 - Tuowen Zhao

, Mary W. Hall
, Protonu Basu, Samuel Williams
, Hans Johansen
:
SIMD code generation for stencils on brick decompositions. 423-424

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














