Concurrent B+trees have been widely used in many systems from file systems to databases. With the requests expanding exponentially, the systems are facing tremendous performance pressure. GPUs have shown their potential to accelerate the concurrent B+trees operations with their high volume of parallel computing resources and large memory bandwidth. In concurrent B+tree, conflict detection and handling complicates the request processing logic, increases the number of memory accesses and leads to execution path divergence. That leads to performance degradation and increased response time variance. In this paper, we present a GPU concurrent B+tree. To reduce the performance impact introduced by conflict detection, an optimization is proposed to detect and resolve key conflicts before concurrent requests are issued to the execution phase. Then another optimization is used to reduce structure conflicts and concurrent control overheads. Finally, an optimization is used to reduce structure conflicts and improve memory performance by exploiting the locality among requests. Evaluations on a TITAN V GPU show that Eirene is efficient with a throughput of 1.23 billion per second. Compared to a state-of-the-art GPU B+tree system, it can achieve 5.1X speedup and reduce the response time variance from 38% to 4%.
Tue 5 AprDisplayed time zone: Eastern Time (US & Canada) change
14:00 - 15:25 | |||
14:00 5mTalk | POSTER: Automatic Synthesis of Parallel Unix Commands and Pipelines with KumQuat Main Conference Jiasi Shen Massachusetts Institute of Technology, Martin C. Rinard Massachusetts Institute of Technology, Nikos Vasilakis Massachusetts Institute of Technology | ||
14:05 5mTalk | POSTER: Towards OmpSs-2 and OpenACC Interoperation Main Conference Orestis Korakitis Barcelona Supercomputing Center (BSC), Simon Garcia De Gonzalo Barcelona Supercomputing Center (BSC), Nicolas Guidotti INESC-ID, Instituto Superior Técnico, University of Lisbon, João Barreto INESC-ID, José C. Monteiro INESC-ID, Instituto Superior Técnico, University of Lisbon, Antonio J. Peña Barcelona Supercomputing Center (BSC) | ||
14:10 5mTalk | POSTER: LB-HM: Load Balance-Aware Data Placement on Heterogeneous Memory for Task-Parallel HPC Applications Main Conference | ||
14:15 5mTalk | POSTER: Hardening Selective Protection across Multiple Program Inputs for HPC Applications Main Conference Yafan Huang University of Iowa, Shengjian Guo Baidu USA, Sheng Di Argonne National Laboratory, Guanpeng Li University of Iowa, Franck Cappello Argonne National Laboratory | ||
14:20 5mTalk | POSTER: A Parallel Branch-and-Bound Algorithm with History-Based Domination Main Conference Taspon Gonggiatgul California State University, Sacramento, Ghassan Shobaki California State University, Sacramento, Pınar Muyan-Özçelik California State University, Sacramento | ||
14:25 5mTalk | POSTER: Remote OpenMP Offloading Main Conference | ||
14:30 5mTalk | POSTER: High Performance GPU Concurrent B+tree Main Conference Weihua Zhang Fudan University, Chuanlei Zhao Fudan University, Lu Peng Louisiana State University, Yuzhe Lin Fudan University, Fengzhe Zhang Fudan University, Jinhu Jiang Fudan University | ||
14:35 5mTalk | POSTER: The Problem-Based Benchmark Suite (PBBS), V2 Main Conference Daniel Anderson Carnegie Mellon University, Guy E. Blelloch Carnegie Mellon University, USA, Laxman Dhulipala University of Maryland, College Park, Magdalen Dobson Carnegie Mellon University, Yihan Sun University of California, Riverside | ||
14:40 5mTalk | POSTER: An LLVM-based Open-Source Compiler for NVIDIA GPUs Main Conference Da Yan Hong Kong University of Science and Technology, Wei Wang Hong Kong University of Science and Technology, Xiaowen Chu Data Science and Analytics Thrust, HKUST(GZ) | ||
14:45 5mTalk | POSTER: ParGeo: A Library for Parallel Computational Geometry Main Conference Yiqiu Wang Massachusetts Institute of Technology, Shangdi Yu Massachusetts Institute of Technology, Laxman Dhulipala University of Maryland, College Park, Yan Gu UC Riverside, Julian Shun MIT | ||
14:50 5mTalk | POSTER: Parallel Algorithms for Masked Sparse Matrix-Matrix Products Main Conference Srđan Milaković Rice University, Oguz Selvitopi Lawrence Berkeley National Laboratory, Israt Nisa AWS AI, Zoran Budimlić Rice University, Aydin Buluc Lawrence Berkeley National Laboratory | ||
14:55 5mTalk | POSTER: Rethinking Graph Data Placement for Graph Neural Network Training on Multiple GPUs Main Conference | ||
15:00 5mTalk | POSTER: Optimizing Consistency for Partially Replicated Data Stores Main Conference Ivan Kuraj MIT CSAIL, USA, Armando Solar-Lezama Massachusetts Institute of Technology, Nadia Polikarpova University of California at San Diego | ||
15:05 5mTalk | POSTER: Optimizing Sparse Computations Jointly Main Conference Kazem Cheshmi University of Toronto, Michelle Strout University of Arizona, Maryam Mehri Dehnavi University of Toronto | ||
15:10 5mTalk | POSTER: wCQ: A Fast Wait-Free Queue with Bounded Memory Usage Main Conference | ||
15:15 5mTalk | POSTER: Automatic Differentiation of Parallel Loops with Formal Methods Main Conference | ||
15:20 5mTalk | POSTER: A W-cycle Algorithm for Efficient Batched SVD on GPUs Main Conference Junmin Xiao Institute of Computing Technology of Chinese Academy of Sciences, Qing Xue Institute of Computing Technology, Chinese Academy of Sciences, Hui Ma Institute of Computing Technology, Chinese Academy of Sciences, Xiaoyang Zhang Institute of Computing Technology, Chinese Academy of Sciences, Guangming Tan Chinese Academy of Sciences(CAS) |