TY - JOUR
T1 - Implementation of implicit finite element methods for incompressible flows on the CM-5
AU - Kennedy, J. G.
AU - Behr, M.
AU - Kalro, V.
AU - Tezduyar, T. E.
N1 - Funding Information:
This researchw as sponsoredb y NASA-JSC underg rantNAG 9-449,b y NSF under grantsC TS-8796352a ndA SC-9211083b,y ARPA underN IST contrac6t 0NANB2DI272a, ndby ARO under grant DAAH04-93-G-0514P.a rtial supportf or this work has also come from the ARO contractn umber DAAL03-89-C.003w8 ith the AHPCRC at the Universityo f Minnesota. We are indebtedt o Zdenek Johan for helpful commentsa nd providinga ccesst o his CM-5 implementationsb ooft hthe RSB algorithmfo r datad ecompositioann dthe two-stepg athera ndscatter algorithmsW. e are also indebtedto Kapil Mathurf or helpfulc ommentasn d his contributions tthoe two-stepg athera nd scattera lgorithms.
PY - 1994/11
Y1 - 1994/11
N2 - A parallel implementation of an implicit finite element formulation for incompressible fluids on a distributed-memory massively parallel computer is presented. The dominant issue that distinguishes the implementation of finite element problems on distributed-memory computers from that on traditional shared-memory scalar or vector computers is the distribution of data (and hence workload) to the processors and the non-uniform memory hierarchy associated with the processors, particularly the non-uniform costs associated with on-processor and off-processor memory references. Accessing data stored in a remote processor requires computing resources an order of magnitude greater than accessing data locally in a processor. This distribution of data motivates the development of alternatives to traditional algorithms and data structures designed for shared-memory computers, which must now account for distributed-memory architectures. Data structures as well as data decomposition and data communication algorithms designed for distributed-memory computers are presented in the context of high level language constructs from High Performance Fortran. The discussion relies primarily on abstract features of the hardware and software environment and should be applicable, in principle, to a variety of distributed-memory system. The actual implementation is carried out on a Connection Machine CM-5 system with high performance communication functions.
AB - A parallel implementation of an implicit finite element formulation for incompressible fluids on a distributed-memory massively parallel computer is presented. The dominant issue that distinguishes the implementation of finite element problems on distributed-memory computers from that on traditional shared-memory scalar or vector computers is the distribution of data (and hence workload) to the processors and the non-uniform memory hierarchy associated with the processors, particularly the non-uniform costs associated with on-processor and off-processor memory references. Accessing data stored in a remote processor requires computing resources an order of magnitude greater than accessing data locally in a processor. This distribution of data motivates the development of alternatives to traditional algorithms and data structures designed for shared-memory computers, which must now account for distributed-memory architectures. Data structures as well as data decomposition and data communication algorithms designed for distributed-memory computers are presented in the context of high level language constructs from High Performance Fortran. The discussion relies primarily on abstract features of the hardware and software environment and should be applicable, in principle, to a variety of distributed-memory system. The actual implementation is carried out on a Connection Machine CM-5 system with high performance communication functions.
UR - http://www.scopus.com/inward/record.url?scp=0028534248&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0028534248&partnerID=8YFLogxK
U2 - 10.1016/0045-7825(94)00078-6
DO - 10.1016/0045-7825(94)00078-6
M3 - Article
AN - SCOPUS:0028534248
SN - 0374-2830
VL - 119
SP - 95
EP - 111
JO - Computer Methods in Applied Mechanics and Engineering
JF - Computer Methods in Applied Mechanics and Engineering
IS - 1-2
ER -