Publication Type |
pre-print |
School or College |
College of Engineering |
Department |
Computing, School of |
Creator |
Kirby, Robert Michael |
Other Author |
King, James |
Title |
A scalable, efficient scheme for evaluation of stencil computations over unstructured meshes |
Date |
2013-01-01 |
Description |
Stencil computations are a common class of operations that appear in many computational scientific and engineering applications. Stencil computations often benefit from compile-time analysis, exploiting data-locality, and parallelism. Post-processing of discontinuous Galerkin (dG) simulation solutions with B-spline kernels is an example of a numerical method which requires evaluating computationally intensive stencil operations over a mesh. Previous work on stencil computations has focused on structured meshes, while giving little attention to unstructured meshes. Performing stencil operations over an unstructured mesh requires sampling of heterogeneous elements which often leads to inefficient memory access patterns and limits data locality/reuse. In this paper, we present an efficient method for performing stencil computations over unstructured meshes which increases data-locality and cache efficiency, and a scalable approach for stencil tiling and concurrent execution. We provide experimental results in the context of post-processing of dG solutions that demonstrate the effectiveness of our approach. |
Type |
Text |
Publisher |
Institute of Electrical and Electronics Engineers (IEEE) |
Issue |
79 |
Language |
eng |
Bibliographic Citation |
King, J., & Kirby, R. M. (2013). A scalable, efficient scheme for evaluation of stencil computations over unstructured meshes. International Conference for High Performance Computing, Networking, no. 79. |
Rights Management |
(c) 2013 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. |
Format Medium |
application/pdf |
Format Extent |
724,181 bytes |
Identifier |
uspace,18704 |
ARK |
ark:/87278/s6tj1wrg |
Setname |
ir_uspace |
ID |
712579 |
Reference URL |
https://collections.lib.utah.edu/ark:/87278/s6tj1wrg |