Fast Four-Way Parallel Radix Sorting on GPUs
Linh Ha, Jens Kruger, Claudio T. Silva
In Computer Graphics Forum, 28(8), 2009.
Abstract: Efficient sorting is a key requirement for many computer science algorithms. Acceleration of existing techniques as well as developing new sorting approaches is crucial for many real-time graphics scenarios, database systems, and numerical simulations to name just a few. It is one of the most fundamental operations to organize and filter the ever growing massive amounts of data gathered on a daily basis. While optimal sorting models for serial execution on a single processor exist, efficient parallel sorting remains a challenge. In this paper, we present a hardware-optimized parallel implementation of the radix sort algorithm that results in a significant speed up over existing sorting implementations. We outperform all known General Processing Unit (GPU) based sorting systems by about a factor of two and eliminate restrictions on the sorting key space. This makes our algorithm not only the fastest, but also the first general GPU sorting solution.
Keyword(s): I.3.1: Parallel processing on Graphic Processing Units, G.1.0: Parallel algorithms
Article URL: http://dx.doi.org/10.1111/j.1467-8659.2009.01542.x
BibTeX format:
@article{CGF:CGF1542,
  author = {Linh Ha and Jens Kruger and Claudio T. Silva},
  title = {Fast Four-Way Parallel Radix Sorting on GPUs},
  journal = {Computer Graphics Forum},
  volume = {28},
  number = {8},
  pages = {2368--2378},
  year = {2009},
}
Search for more articles by Linh Ha.
Search for more articles by Jens Kruger.
Search for more articles by Claudio T. Silva.

Return to the search page.


graphbib: Powered by "bibsql" and "SQLite3."