Fast sorting for exact OIT of complex scenes
Pyarelal Knowles, Geoff Leach, Fabio Zambetta
In The Visual Computer, 30(6-8), June 2014.
Abstract: Exact order-independent transparency (OIT) techniques capture all fragments during rasterization. The fragments are then sorted per-pixel by depth and composited in order using alpha transparency. The sorting stage is a bottleneck for high depth complexity scenes, taking 70–95% of the total time for those investigated. In this paper, we show that typical shader-based sorting speed is impacted by local memory latency and occupancy. We present and discuss the use of both registers and an external merge sort in register-based block sort to better use the memory hierarchy of the GPU for improved OIT rendering performance. This approach builds upon backwards memory allocation, achieving an OIT rendering speed up to 1.7× that of the best previous method and 6.3× that of the common straight forward OIT implementation. In some cases, the sorting stage is reduced to no longer be the dominant OIT component.
Article URL: http://dx.doi.org/10.1007/s00371-014-0956-z
BibTeX format:
@article{Knowles:2014:FSF,
  author = {Pyarelal Knowles and Geoff Leach and Fabio Zambetta},
  title = {Fast sorting for exact OIT of complex scenes},
  journal = {The Visual Computer},
  volume = {30},
  number = {6-8},
  pages = {603--613},
  month = jun,
  year = {2014},
}
Search for more articles by Pyarelal Knowles.
Search for more articles by Geoff Leach.
Search for more articles by Fabio Zambetta.

Return to the search page.


graphbib: Powered by "bibsql" and "SQLite3."