Parallel implementation of compact numerical schemes

Published Date:2001
Details:

Personal Authors:

Corporate Authors:National Centers for Environmental Prediction (U.S.) ; Weather Research Program (U.S.) ; United States, Federal Aviation Administration, ; ... More ▼

Keywords:

Series:Office note (National Centers for Environmental Prediction (U.S.)) ; 434

Document Type:

Description:The formulation of compact schemes in a parallel environment is described in detail for the case of differencing. Compact numerical schemes are spatially implicit methods for obtaining spatial derivatives and partial integrations, interpolating from one grid to its staggered counterpart, and so on, where the target results occupy a regular grid as dense as the regular grid of source data. Among the numerical schemes to perform such operations at a given formal order of accuracy, the compact forms usually yield the smallest principal truncation error coefficients and, when properly coded for a sharedmemory machine, require essentially minimal extra computational cost. However, the inherently recursive nature of the application of compact schemes poses a problem for their efficient implementation on distributedmemory machines, which we set out to address in this note. Generally, a compact scheme is expressed for an entire line of data simultaneously by a linear system characterized by banded matrices on both sides of the equation. The solution involves a preliminary "LDU" decomposition of the system matrix on the lefthand side of the equation, where L and U are lower and upper triangular banded matrices with unit main diagonals, D is a diagonal matrix. For the common case in which the original system matrix is symmetric, factors L and U are the transposes of each other and the decomposition is a modified Cholesky factorization. The components of the resulting factors provide the coefficients for a pair of onesided recursions required by the subsequent application of the scheme to lines of data. For unbounded (effectively infinite or cyclic) lines of data on a uniform grid, the recursions simplify to the extent of having constant coefficients and it is convenient to retain this simplicity, by appropriate manipulations of the scheme's end conditions, even when finite boundaries intrude. We show, using three different implementation strategies, that each such constantcoefficient recursion can then be carried out in a reasonably efficient manner across several processors of a distributedmemory computer although the cost now becomes significantly greater than the cost of applying the conventional difference scheme of the corresponding order of accuracy. Nevertheless, the compact schemes remain viable within a massively parallel atmospheric model where highorder numerics are desired. Spatial filters with recursive components, such as the various Butterworth filters, involve similar numerical considerations and, as we show, may be handled in a parallel computing environment in the same way.

Supporting Files:No Additional Files
You May Also Like: