Skip to main navigation Skip to search Skip to main content

An improved implementation of parallel selection on GPUs

  • Cleveland State University
  • Department of Computer Science

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

The computing power of current Graphical Processing Units (GPUs) has increased rapidly over the years. They offer much more computational power than recent CPUs by providing a vast number of simple, data parallel, multithreaded cores. In this paper, we proposed an improved implementation of parallel selection and compare the performance of different parallel selection algorithms on the current generation of NVIDIA GPUs. That is, given a massively large array of elements, we were interested in how we could use a GPU to efficiently select those elements that meet certain criteria and then store them into a target array for further processing. The optimization techniques used and implementation issues encountered are discussed in detail. Furthermore, the experimental results show that our advanced implementation performs an average of 2.88 times faster than Thrust, an open-source parallel algorithms library.
Original languageEnglish
Title of host publicationProceedings of the IASTED International Symposium on Software Engineering and Applications, SEA 2015
Place of Publicationusa
PublisherActa [email protected]
Pages63-69
Number of pages7
ISBN (Electronic)9780889869776
DOIs
StatePublished - Jan 1 2015
Event2015 IASTED International Symposium on Software Engineering and Applications, SEA 2015 - Marina del Rey, United States
Duration: Oct 26 2015Oct 27 2015

Conference

Conference2015 IASTED International Symposium on Software Engineering and Applications, SEA 2015
Country/TerritoryUnited States
CityMarina del Rey
Period10/26/1510/27/15

Keywords

  • CUDA
  • GPU
  • Optimization Techniques
  • Parallel Selection
  • SIMT
  • Thrust Library

Cite this