default search action
24th HiPC 2017: Jaipur, India
- 24th IEEE International Conference on High Performance Computing, HiPC 2017, Jaipur, India, December 18-21, 2017. IEEE Computer Society 2017, ISBN 978-1-5386-2293-3
Keynote 1
- Parthasarathy Ranganathan:
End of Moore's Law: Or, a Computer Architect's Mid-life Crisis? 1
Technical Session 1: Graph Algorithms
- Devavret Makkar, David A. Bader, Oded Green:
Exact and Parallel Triangle Counting in Dynamic Graphs. 2-12 - Humayun Kabir, Kamesh Madduri:
Shared-Memory Graph Truss Decomposition. 13-22 - Ajay Panyala, Omer Subasi, Mahantesh Halappanavar, Ananth Kalyanaraman, Daniel G. Chavarría-Miranda, Sriram Krishnamoorthy:
Approximate Computing Techniques for Iterative Graph Algorithms. 23-32 - Subhadeep Karan, Jaroslaw Zola:
Scalable Exact Parent Sets Identification in Bayesian Networks Learning with Apache Spark. 33-41 - Md. Vasimuddin, Srinivas Aluru:
Parallel Exact Dynamic Bayesian Network Structure Learning with Application to Gene Networks. 42-51 - Thejaka Amila Kanewala, Marcin Zalewski, Andrew Lumsdaine:
Parallel Asynchronous Distributed-Memory Maximal Independent Set Algorithm with Work Ordering. 52-61
Technical Session 2: Architecture and Communication
- Mingzhe Li, Xiaoyi Lu, Hari Subramoni, Dhabaleswar K. Panda:
Designing Registration Caching Free High-Performance MPI Library with Implicit On-Demand Paging (ODP) of InfiniBand. 62-71 - George Michelogiannakis, John Shalf:
Last Level Collective Hardware Prefetching For Data-Parallel Applications. 72-83 - Jahanzeb Maqbool Hashmi, Khaled Hamidouche, Hari Subramoni, Dhabaleswar K. Panda:
Kernel-Assisted Communication Engine for MPI on Emerging Manycore Processors. 84-93 - Bilge Acun, Eun Kyung Lee, Yoonho Park, Laxmikant V. Kalé:
Support for Power Efficient Proactive Cooling Mechanisms. 94-103 - Ayan Palchaudhuri, Anindya Sundar Dhar:
Redundant Arithmetic Based High Speed Carry Free Hybrid Adders with Built-In Scan Chain on FPGAs. 104-113 - Dharanidhar Dang, Jyotikrishna Dass, Rabi N. Mahapatra:
ConvLight: A Convolutional Accelerator with Memristor Integrated Photonic Computing. 114-123
Technical Session 3: Algorithms
- Guy E. Blelloch, Phillip B. Gibbons, Harsha Vardhan Simhadri:
Provably Efficient Scheduling of Dynamically Allocating Programs on Parallel Cache Hierarchies. 124-133 - Michael Orr, Oliver Sinnen:
Further Explorations in State-Space Search for Optimal Task Scheduling. 134-141 - Dineshkumar Rajagopal, Daniele Tafani, Yiannis Georgiou, David Glesser, Michael Ott:
A Novel Approach for Job Scheduling Optimizations Under Power Cap for ARM and Intel HPC Systems. 142-151 - Mulya Agung, Muhammad Alfian Amrizal, Kazuhiko Komatsu, Ryusuke Egawa, Hiroyuki Takizawa:
A Memory Congestion-Aware MPI Process Placement for Modern NUMA Systems. 152-161 - Pooja Aggarwal, Smruti R. Sarangi:
Expander: Lock-Free Cache for a Concurrent Data Structure. 162-171 - Maxime Schmitt, Philippe Helluy, Cédric Bastoul:
Adaptive Code Refinement: A Compiler Technique and Extensions to Generate Self-Tuning Applications. 172-181
Keynote 2
- Rajeev Rastogi:
Machine Learning @ Amazon. 182
Technical Session 4: Big Data, Machine Learning and Optimization
- Sunwoo Lee, Dipendra Jha, Ankit Agrawal, Alok N. Choudhary, Wei-keng Liao:
Parallel Deep Convolutional Neural Network Training by Exploiting the Overlapping of Computation and Communication. 183-192 - Adeesha Wijayasiri, Tania Banerjee, Sanjay Ranka, Sartaj Sahni, Mark S. Schmalz:
Parallel Dynamic Data Driven Approaches for Synthetic Aperture Radar. 193-202 - Jayanth Kalyanasundaram, Yogesh Simmhan:
ARM Wrestling with Big Data: A Study of Commodity ARM64 Server for Big Data Workloads. 203-212 - Shashank Gugnani, Xiaoyi Lu, Franco Pestilli, Cesar F. Caiafa, Dhabaleswar K. Panda:
MPI-LiFE: Designing High-Performance Linear Fascicle Evaluation of Brain Connectome with MPI. 213-222 - Sidharth Kumar, Duong Hoang, Steve Petruzza, John Edwards, Valerio Pascucci:
Reducing Network Congestion and Synchronization Overhead During Aggregation of Hierarchical Data. 223-232 - Jianwei Xiao, Ming Gu, Julien Langou:
Fast Parallel Randomized QR with Column Pivoting Algorithms for Reliable Low-Rank Matrix Approximations. 233-242
Technical Session 5: Graph Algorithms and GPU
- Miyuru Dayarathna, Sathya Bandara, Nandula Jayamaha, Mahen Herath, Achala Madhushan, Sanath Jayasena, Toyotaro Suzumura:
An X10-Based Distributed Streaming Graph Database Engine. 243-252 - Sreeram Potluri, Anshuman Goswami, Davide Rossetti, Chris J. Newburn, Manjunath Gorentla Venkata, Neena Imam:
GPU-Centric Communication on NVIDIA GPU Clusters with InfiniBand: A Case Study with OpenSHMEM. 253-262 - Alind Khare, Vikram Goyal, Srikanth Baride, Sushil K. Prasad, Michael McDermott, Dhara Shah:
Distributed Algorithm for High-Utility Subgraph Pattern Mining Over Big Data Platforms. 263-272 - Kartik Lakhotia, Shreyas G. Singapura, Rajgopal Kannan, Viktor K. Prasanna:
ReCALL: Reordered Cache Aware Locality Based Graph Processing. 273-282 - Süreyya Emre Kurt, Vineeth Thumma, Changwan Hong, Aravind Sukumaran-Rajam, P. Sadayappan:
Characterization of Data Movement Requirements for Sparse Matrix Computations on GPUs. 283-293 - Sangkeun Lee, Sudharshan S. Vazhkudai, Raghul Gunasekaran:
Applying Graph Analytics to Understand Compute Core Usage and Publication Trends in a Petascale Supercomputing Facility. 294-305
Keynote 3
- Ian T. Foster:
Computing Just What You Need: Online Data Analysis and Reduction at Extreme Scales. 306
Technical Session 6: System Software
- Zhihao Jia, Sean Treichler, Galen M. Shipman, Michael Bauer, Noah Watkins, Carlos Maltzahn, Patrick S. McCormick, Alex Aiken:
Integrating External Resources with a Task-Based Programming Model. 307-316 - Edward Chuah, Arshad Jhumka, Samantha Alt, Theodoros Damoulas, Nentawe Gurumdimma, Marie-Christine Sawley, William L. Barth, Tommy Minyard, James C. Browne:
Enabling Dependability-Driven Resource Use and Message Log-Analysis for Cluster System Diagnosis. 317-327 - Changsu Kim, Juhyun Kim, Juwon Kang, Jae W. Lee, Hanjun Kim:
Context-Aware Memory Profiling for Speculative Parallelism. 328-337 - Harenome Razanajato, Cédric Bastoul, Vincent Loechner:
Lifting Barriers Using Parallel Polyhedral Regions. 338-347 - Seyed Hessam Mirsadeghi, Jesper Larsson Träff, Pavan Balaji, Ahmad Afsahi:
Exploiting Common Neighborhoods to Optimize MPI Neighborhood Collectives. 348-357 - Arpith Chacko Jacob, Alexandre E. Eichenberger, Hyojin Sung, Samuel F. Antão, Gheorghe-Teodor Bercea, Carlo Bertolli, Alexey Bataev, Tian Jin, Tong Chen, Zehra Sura, Georgios Rokos, Kevin O'Brien:
Efficient Fork-Join on GPUs Through Warp Specialization. 358-367
Technical Session 7: GPU Frameworks and Applications
- Ajai V. George, Sankar Manoj, Sanket R. Gupte, Sayantan Mitra, Santonu Sarkar:
Thrust++: Extending Thrust Framework for Better Abstraction and Performance. 368-377 - Harshil Shah, Siddharth Kamaria, Riddhesh Markandeya, Miral Shah, Bhaskar Chaudhury:
A Novel Implementation of 2D3V Particle-in-Cell (PIC) Algorithm for Kepler GPU Architecture. 378-387 - Dharma Teja Vooturi, Kishore Kothapalli, Upinder Singh Bhalla:
Parallelizing Hines Matrix Solver in Neuron Simulations on GPU. 388-397 - Esteban Rangel, Nicholas Frontiere, Salman Habib, Katrin Heitmann, Wei-keng Liao, Ankit Agrawal, Alok N. Choudhary:
Building Halo Merger Trees from the Q Continuum Simulation. 398-407 - Andrew Todd, Marziyeh Nourian, Michela Becchi:
A Memory-Efficient GPU Method for Hamming and Levenshtein Distance Similarity. 408-418
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.