GPU implementation of Boruvka's algorithm to Euclidean minimum spanning tree based on Elias method
Affiliation auteurs | !!!! Error affiliation !!!! |
Titre | GPU implementation of Boruvka's algorithm to Euclidean minimum spanning tree based on Elias method |
Type de publication | Journal Article |
Year of Publication | 2019 |
Auteurs | Qiao W-bao, Creput J-C |
Journal | APPLIED SOFT COMPUTING |
Volume | 76 |
Pagination | 105-120 |
Date Published | MAR |
Type of Article | Article |
ISSN | 1568-4946 |
Mots-clés | Breadth first search, Data parallel, Decentralized control, EMST, Euclidean minimum spanning tree, GPU parallel BFS, GPU parallel EMST |
Résumé | We present both sequential and data parallel approaches to build hierarchical minimum spanning forest (MSF) or tree (MST) in Euclidean space (EMSF/EMST) for applications whose input N points are uniformly or boundedly distributed in Euclidean space. Each iteration of the sequential approach takes O(N) time complexity through combining Boruvka's algorithm with an improved component-based neighborhood search algorithm, namely sliced spiral search, which is a newly proposed improvement to Bentley's spiral search for finding a component graph's closest outgoing point on the plane. It works based on the uniqueness property in Euclidean space, and allows O(1) time complexity for one search from a query point to find the component's closest outgoing point at different iterations of Boruvka's algorithm. The data parallel approach includes a newly proposed two-direction breadth-first search (BFS) implementation on graphics processing unit (GPU) platform, which is specialized for selecting a spanning tree's shortest outgoing edge. This GPU two-direction parallel BFS enables a tree traversal operation to start from any one of its vertex acting as root. These GPU parallel implementations work by assigning N threads with one thread associated to one input point, one thread occupies O(1) local memory and the whole algorithm occupies O(N) global memory. Experiments are conducted on both uniformly distributed data sets and TSPLIB database. We evaluate computation time of the proposed approaches on more than 80 benchmarks with size N growing up to 106 points on personal laptop. (C) 2018 Elsevier B.V. All rights reserved. |
DOI | 10.1016/j.asoc.2018.10.046 |