An efficient higher order accurate parallel algorithm for aeroacoustic applications

Thomas S. Chyczewski, Lyle Norman Long

Research output: Contribution to conferencePaper

3 Citations (Scopus)

Abstract

In this paper the problems associated with achieving good performance from a Computational Aeroacoustics (CAA) code on the CM-5 (in an entirely data parallel approach) are addressed. The CAA algorithm requires solving the full 3-D Navier Stokes equations using high order spatial and temporal differencing and nonreflecting boundary conditions, among other things, to preserve the integrity of the acoustic wave solution. The spatial differencing is accomplished with sixth order central differences. A fourth order Runge-Kutta scheme is used for time advancement. Each grid point needs data from eighteen neighboring points to perform the spatia! differencing. Near the boundaries biased stencils are required to maintain high order accuracy. These large, varied stencils present two problems from a parallel processing point of view. One is the large amount of data that has to be communicated and the associated communication time. The other is the large number of stencils required to maintain high accuracy near the boundaries. Specialized treatment of each of the different stencils would provide for a very poorly load balanced (and consequently inefficient) code. The communication time overhead is significantly decreased by organizing the data into blocks and communicating on a block basis instead of an element basis. The specialized boundary treatment problem is all but eliminated by using generalized global stencil arrays so that for a given physical boundary condition all points can be evaluated in parallel. Results indicate a significant performance improvement over a code that doesn’t use the methodologies proposed here. These results are presented in the form of a detailed timing breakdown Per iteration.

Original languageEnglish (US)
StatePublished - Jan 1 1994
EventAIAA Fluid Dynamics Conference, 1994 - Colorado Springs, United States
Duration: Jun 20 1994Jun 23 1994

Other

OtherAIAA Fluid Dynamics Conference, 1994
CountryUnited States
CityColorado Springs
Period6/20/946/23/94

Fingerprint

Computational aeroacoustics
Aeroacoustics
Parallel algorithms
Boundary conditions
Communication
Navier Stokes equations
Acoustic waves
Processing

All Science Journal Classification (ASJC) codes

  • Aerospace Engineering
  • Engineering (miscellaneous)

Cite this

Chyczewski, T. S., & Long, L. N. (1994). An efficient higher order accurate parallel algorithm for aeroacoustic applications. Paper presented at AIAA Fluid Dynamics Conference, 1994, Colorado Springs, United States.
Chyczewski, Thomas S. ; Long, Lyle Norman. / An efficient higher order accurate parallel algorithm for aeroacoustic applications. Paper presented at AIAA Fluid Dynamics Conference, 1994, Colorado Springs, United States.
@conference{c20a2cddf9e2420797aec9184c0f648f,
title = "An efficient higher order accurate parallel algorithm for aeroacoustic applications",
abstract = "In this paper the problems associated with achieving good performance from a Computational Aeroacoustics (CAA) code on the CM-5 (in an entirely data parallel approach) are addressed. The CAA algorithm requires solving the full 3-D Navier Stokes equations using high order spatial and temporal differencing and nonreflecting boundary conditions, among other things, to preserve the integrity of the acoustic wave solution. The spatial differencing is accomplished with sixth order central differences. A fourth order Runge-Kutta scheme is used for time advancement. Each grid point needs data from eighteen neighboring points to perform the spatia! differencing. Near the boundaries biased stencils are required to maintain high order accuracy. These large, varied stencils present two problems from a parallel processing point of view. One is the large amount of data that has to be communicated and the associated communication time. The other is the large number of stencils required to maintain high accuracy near the boundaries. Specialized treatment of each of the different stencils would provide for a very poorly load balanced (and consequently inefficient) code. The communication time overhead is significantly decreased by organizing the data into blocks and communicating on a block basis instead of an element basis. The specialized boundary treatment problem is all but eliminated by using generalized global stencil arrays so that for a given physical boundary condition all points can be evaluated in parallel. Results indicate a significant performance improvement over a code that doesn’t use the methodologies proposed here. These results are presented in the form of a detailed timing breakdown Per iteration.",
author = "Chyczewski, {Thomas S.} and Long, {Lyle Norman}",
year = "1994",
month = "1",
day = "1",
language = "English (US)",
note = "AIAA Fluid Dynamics Conference, 1994 ; Conference date: 20-06-1994 Through 23-06-1994",

}

Chyczewski, TS & Long, LN 1994, 'An efficient higher order accurate parallel algorithm for aeroacoustic applications' Paper presented at AIAA Fluid Dynamics Conference, 1994, Colorado Springs, United States, 6/20/94 - 6/23/94, .

An efficient higher order accurate parallel algorithm for aeroacoustic applications. / Chyczewski, Thomas S.; Long, Lyle Norman.

1994. Paper presented at AIAA Fluid Dynamics Conference, 1994, Colorado Springs, United States.

Research output: Contribution to conferencePaper

TY - CONF

T1 - An efficient higher order accurate parallel algorithm for aeroacoustic applications

AU - Chyczewski, Thomas S.

AU - Long, Lyle Norman

PY - 1994/1/1

Y1 - 1994/1/1

N2 - In this paper the problems associated with achieving good performance from a Computational Aeroacoustics (CAA) code on the CM-5 (in an entirely data parallel approach) are addressed. The CAA algorithm requires solving the full 3-D Navier Stokes equations using high order spatial and temporal differencing and nonreflecting boundary conditions, among other things, to preserve the integrity of the acoustic wave solution. The spatial differencing is accomplished with sixth order central differences. A fourth order Runge-Kutta scheme is used for time advancement. Each grid point needs data from eighteen neighboring points to perform the spatia! differencing. Near the boundaries biased stencils are required to maintain high order accuracy. These large, varied stencils present two problems from a parallel processing point of view. One is the large amount of data that has to be communicated and the associated communication time. The other is the large number of stencils required to maintain high accuracy near the boundaries. Specialized treatment of each of the different stencils would provide for a very poorly load balanced (and consequently inefficient) code. The communication time overhead is significantly decreased by organizing the data into blocks and communicating on a block basis instead of an element basis. The specialized boundary treatment problem is all but eliminated by using generalized global stencil arrays so that for a given physical boundary condition all points can be evaluated in parallel. Results indicate a significant performance improvement over a code that doesn’t use the methodologies proposed here. These results are presented in the form of a detailed timing breakdown Per iteration.

AB - In this paper the problems associated with achieving good performance from a Computational Aeroacoustics (CAA) code on the CM-5 (in an entirely data parallel approach) are addressed. The CAA algorithm requires solving the full 3-D Navier Stokes equations using high order spatial and temporal differencing and nonreflecting boundary conditions, among other things, to preserve the integrity of the acoustic wave solution. The spatial differencing is accomplished with sixth order central differences. A fourth order Runge-Kutta scheme is used for time advancement. Each grid point needs data from eighteen neighboring points to perform the spatia! differencing. Near the boundaries biased stencils are required to maintain high order accuracy. These large, varied stencils present two problems from a parallel processing point of view. One is the large amount of data that has to be communicated and the associated communication time. The other is the large number of stencils required to maintain high accuracy near the boundaries. Specialized treatment of each of the different stencils would provide for a very poorly load balanced (and consequently inefficient) code. The communication time overhead is significantly decreased by organizing the data into blocks and communicating on a block basis instead of an element basis. The specialized boundary treatment problem is all but eliminated by using generalized global stencil arrays so that for a given physical boundary condition all points can be evaluated in parallel. Results indicate a significant performance improvement over a code that doesn’t use the methodologies proposed here. These results are presented in the form of a detailed timing breakdown Per iteration.

UR - http://www.scopus.com/inward/record.url?scp=84997830957&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84997830957&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:84997830957

ER -

Chyczewski TS, Long LN. An efficient higher order accurate parallel algorithm for aeroacoustic applications. 1994. Paper presented at AIAA Fluid Dynamics Conference, 1994, Colorado Springs, United States.