Massive 2-opt and 3-opt Moves with High Performance GPU Local Search to Large-Scale Traveling Salesman Problem

Affiliation auteurs!!!! Error affiliation !!!!
TitreMassive 2-opt and 3-opt Moves with High Performance GPU Local Search to Large-Scale Traveling Salesman Problem
Type de publicationConference Paper
Year of Publication2019
AuteursQiao W-bao, Creput J-C
EditorBattiti R, Brunato M, Kotsireas I, Pardalos PM
Conference NameLEARNING AND INTELLIGENT OPTIMIZATION, LION 12
PublisherUniv Florida, Ctr Appl Optimizat; Wilfrid Laurier Univ, Comp Algebra Res Grp Lab; Inst Advancement Phys & Math
Conference LocationGEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND
ISBN Number978-3-030-05348-2; 978-3-030-05347-5
Mots-clésMassive 2-opt moves, Optimization High performance GPU computing, Parallel 2-opt
Résumé

2-opt, 3-opt or k-opt heuristics are classical local search algorithms for traveling salesman problems (TSP) in combinatorial optimization area. This paper introduces a judicious decision making methodology of offloading which part of the k-opt heuristic works in parallel on Graphics Processing Unit (GPU) while which part remains sequential, called ``multiple k-opt evaluation, multiple k-opt moves'', in order to simultaneously execute, without interference, massive 2-/3-opt moves that are globally found on the same TSP tour or the same Euclidean space for many edges, as well as keep high performance for GPU massive k-opt evaluation. We prove the methodology is judicious and valuable because of our originally proposed sequential non-interacted 2-/3-exchange set partition algorithm taking linear time complexity and a new TSP tour representation, array of ordered coordinates-index, in order unveil how to use GPU on-chip shared memory to achieve the same goal as using doubly linked list and array of ordered coordinates for parallel k-opt implementation. We test this methodology on 22 national TSP instances with up to 71009 cities and with brute initial tour solution. Average maximum 997 non-interacted 2-opt moves are found and executed on the same tour of ch71009.tsp instance in one iteration of our proposed method. Experimental comparisons show that our proposed methodology gets huge acceleration over both classical sequential and a possible current fastest state-of-the-art GPU parallel 2-opt implementation.

DOI10.1007/978-3-030-05348-2_8