NUMA-aware image compositing on multi-GPU platform
Pan Wang, Zhiquan Cheng, Ralph Martin, Huahai Liu, Xun Cai, Sikun Li
In The Visual Computer, 29(6--8), June 2013.
Abstract: Sort-last parallel rendering is widely used. Recent GPU developments mean that a PC equipped with multiple GPUs is a viable alternative to a high-cost supercomputer: the Fermi architecture of a single GPU supports uniform virtual addressing, providing a foundation for non-uniform memory access (NUMA) on multi-GPU platforms. Such hardware changes require the user to reconsider the parallel rendering algorithms. In this paper, we thoroughly investigate the NUMA-aware image compositing problem, which is the key final stage in sort-last parallel rendering. Based on a proven radix-k strategy, we find one optimal compositing algorithm, which takes advantage of NUMA architecture on the multi-GPU platform. We quantitatively analyze different image compositing modes for practical image compositing, taking into account peer-to-peer communication costs between GPUs. Our experiments on various datasets show that our image compositing method is very fast, an image of a few megapixels can be composited in about 10 ms by eight GPUs.
Article URL: http://dx.doi.org/10.1007/s00371-013-0803-7
BibTeX format:
@article{Wang:2013:NIC,
  author = {Pan Wang and Zhiquan Cheng and Ralph Martin and Huahai Liu and Xun Cai and Sikun Li},
  title = {NUMA-aware image compositing on multi-GPU platform},
  journal = {The Visual Computer},
  volume = {29},
  number = {6--8},
  pages = {639--649},
  month = jun,
  year = {2013},
}
Search for more articles by Pan Wang.
Search for more articles by Zhiquan Cheng.
Search for more articles by Ralph Martin.
Search for more articles by Huahai Liu.
Search for more articles by Xun Cai.
Search for more articles by Sikun Li.

Return to the search page.


graphbib: Powered by "bibsql" and "SQLite3."