Application scientists are interested in another metric: the time it takes to solve the system. Comparing several different partitioning algorithms on matrices derived from irregular grid PDE problems, results indicate that the number of EC is a poor predictor of solver run-time. This is because current graph partitioning algorithms are not tailored for the data distribution problem; the linear system solver, preconditioner, matrix data structures, and computational environment are generally ignored.
A simple simulation-based approximate execution time (AET) metric that estimates the performance of a linear system solver based on combining cost estimates of the solver's computational, communications, and synchronization kernels is presented. Simulation of kernel performance provides a comparatively easy way to model solvers and preconditioners, while giving reasonably accurate estimates in a reasonable amount of simulator execution time. AET estimates have a relative error of 13.1% or less compared to the actual time per iteration of the HMPE linear system solver, and typically is within 5%. A data distribution algorithm is proposed using the AET metric as its cost function.