GPU-efficient recursive filtering and summed-area tables
Diego Nehab, André Maximo, Rodolfo S. Lima, Hugues Hoppe
In ACM Transactions on Graphics, 30(6), December 2011.
Abstract: Image processing operations like blurring, inverse convolution, and summed-area tables are often computed efficiently as a sequence of 1D recursive filters. While much research has explored parallel recursive filtering, prior techniques do not optimize across the entire filter sequence. Typically, a separate filter (or often a causal-anticausal filter pair) is required in each dimension. Computing these filter passes independently results in significant traffic to global memory, creating a bottleneck in GPU systems. We present a new algorithmic framework for parallel evaluation. It partitions the image into 2D blocks, with a small band of additional data buffered along each block perimeter. We show that these perimeter bands are sufficient to accumulate the effects of the successive filters. A remarkable result is that the image data is read only twice and written just once, independent of image size, and thus total memory bandwidth is reduced even compared to the traditional serial algorithm. We demonstrate significant speedups in GPU computation.
Article URL: http://dx.doi.org/10.1145/2070781.2024210
BibTeX format:
@article{Nehab:2011:GRF,
  author = {Diego Nehab and André Maximo and Rodolfo S. Lima and Hugues Hoppe},
  title = {GPU-efficient recursive filtering and summed-area tables},
  journal = {ACM Transactions on Graphics},
  volume = {30},
  number = {6},
  pages = {176:1--176:12},
  month = dec,
  year = {2011},
}
Search for more articles by Diego Nehab.
Search for more articles by André Maximo.
Search for more articles by Rodolfo S. Lima.
Search for more articles by Hugues Hoppe.

Return to the search page.


graphbib: Powered by "bibsql" and "SQLite3."