A Newton-Krylov Solver for the Euler Equat ions on ......braster of AppM Science Graduate Department of Aerospace Science and Engineering University of Toronto 2001 A Newton-Krylov

A Newton-Krylov Solver for the Euler Equat ions on Unstructured Grids

Edward Wehner

A thesis subrnitted in conformity with the requirements for the degree of Master of Applied Science

Graduate Department of Aerospace Science and Engineering University of Toronto

@ Copyright by Edward Wehner 2001

BiWiithéaue nationale du Canada

Aquisitions and AcquSions et Bibliogaphii Semices senrices bibliographiques

The author has granted a non- L'auteur a accordé me licence non exclusive licence aliowing the exclusive permettant à la National Library of Canada to Bibliothèque nationale du Canada de reproduce, loan, distn'bute or sell reproduire. prêter, distri'buer ou copies of this thesis in mkroform, veridre des copies de cette thèse sous paper or electronic formats. la forme de microfiche/f'ilm, de

reproduction sur papier ou sur format électronique.

The author retains ownership of the L'auteur conserve la propriété du copyright in this thesis. Neither the droit d'auteur qui protège cette thèse. thesis nor substantial extracts fiom it Ni la thèse ni des extraits substantiels may be printed or othemke de celle-ci ne doivent être imprimés reproduced without the author's ou autrement reproduits sans son permission. autorisation.

Abstrad

A Newton-Krylov Solver for the

Euler Equations on Unstructureci Grids

Edward Wehner

braster of A p p M Science

Graduate Department of Aerospace Science and Engineering

University of Toronto

2001

A Newton-Krylov solver for the Euler equations that govern the flow of a compressible invis-

cid flow, with application to Bows over aerodynamic conûgwations, is presented. The Euler

equations and a nonhear artifiicial dissipation scheme are discretid in space using a node

based filnite-volume formulation on arbitrary polygonal unstructured Pr;&. The resulting

systern of nonlinear algebraic equations is solved using an inexact Newton strategy in which

the Linear prablern wising at each Newton iteration is solved iterativdy using the matrix-

free GMRES algorit hm preconditiond by an apprmimate Jacobian matrix subjected to an

incompIete factorisation and reordering. implicit Euler IocaI tirne stepping is applied when

solving transonic flows.

The solver is tested with single and multi-element aidoils in the subsonic and

transonic fhw regimes. Various GMRES tolerances, preconditioner fiu-in leveis, and pre

condit ioned tderances are investigated. Sets of optimum base parame ters are determineci.

Results show the solver to be cornpetitive with other solvers. Recommendations are made

for aspects of the solver that are worthy of further study.

Acknowledgement s

1 wish to thank my supervisor, Prof. D. W. Zingg, for his knowfedge, guidance and support

during my research. Eiis approachabiity and respect for self-initiative were always conducive

to a cornfortable cesearch atmosphere.

1 am gratefd to my coiieagues Luis Mamano, Jason Lassatine, and Peter Blaser,

with whom it was always a pleasure to coiiaborate on the Newton-Krylov project of the

UTIAS CFD group, and Todd Chishoim and Marian Nemec, who provided helpfd insights

into the Newton-Krylov topic.

Words cannot express my gratitude to my mother for everything a motber does

for her son.

The Government of Ontario / Victoria Noakes Graduate Scholarship in Science

and Technology, and the Graduate Scholarship / Loan provided by the Consejo Nacional de

Ciencia y Tecnologia (CONACYT) of Mexico are &O appreciated.

Contents

List of Figures Uc

List of Tables xi

List of Symbols and Abbreviations xiii

1 Introduction 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1 Motivation 1

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2 Objective 3 1.3 Outiine . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

2 Equation Formulation 5 2.1 The Euler Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.2 Spatial Discretisation 6

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.3 Artsciai Dissipation 8

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.4 Bomdary Conditions 9 . . . . . . . . . . . . . . . . . . . . . . . . . 2.4.1 Body Surface Bowdary 10

. . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.4.2 Far-field Boundary 11

3 Algorithm Formulation 13 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.1 The Residuai b c t i o n 13

. . . . . . . . . . . . . . . . . . . . . . . . . . . 3.2 An Inexact Newton Method 14 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.3 Matrix-free GMRES 15 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.4 L U Preconditioning 16

. . . . . . . . . . . . . . . . . . . 3.4.1 The Approxirnate Jacobian Matrix 17 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.5 RCM Reordering 19 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.6 Startup Strategy 19

4 Results ai . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.1 Singldement Airfoil Cases 22

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.1.1 Subsonic FIow 23

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.1.2 Transonic Flow 27 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.2 Multielement Airfoii Case 31

5 Conclusions

Appendices

A Aigorithm Flow Diagram

B Examples of the Jacobians for

References

List of Figures

A control volume . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A representative interior ceU . . . . . . . . . . . . . . . . . . . . . . . . . . . A representative boundary ceii . . . . . . . . . . . . . . . . . . . . . . . . .

Structure of the approximate Jacubian rnatrix (3.63 1.node grid) . . . . . . . Approximate Jacobian matrix with RCM reordering (3.63 1.node grid) . . . . 3,631 and 14.19 h o d e triangui ai. grids about NACA 0012 airfoii . . . . . . . Mach number contours and Cp plot (Mm = 0.63. a = 2.0'). . . . . . . . . . Convergence history using base parameters (Mm = 0.63. a = 2.0.). . . . . . Cornparison of a d u e s (Ma = 0.63. a = 2.0'). . . . . . . . . . . . . . . . . Cornparison of GMmS tolerances. G (Mm = 0.63. a = 2.0.). . . . . . . . . Cornparison of LLU @) Ni leveh (Mm = 0.63. a = 2.0'). . . . . . . . . . . . Cornparison of preconditioner fceezing tolerances. P (M, = 0.63, a = 2.0"). O t her single-eIement subsonic cases . . . . . . . . . . . . . . . . . . . . . . . Mach number contours and Cp plot (Mm = 0.8. a = 1.0'). . . . . . . . . . . Convergence history using base parameters (Mm = 0.8. a = 1.0.). . . . . . Cornparison of a values (Mm = 0.8. a = 1.0'). . . . . . . . . . . . . . . . . Compacison of GMRES tolerances. G (Mm = 0.8. a = 1.0.). . . . . . . . . Cornparison of [LU@) f i l levels (M, = 0.8. a = 1.0.). . . . . . . . . . . . . Cornparison of preconditioner &ezing tolerances. P (Mm = 0.8, a = 1.0.). O t her singleelement transonic cases . . . . . . . . . . . . . . . . . . . . . . . 17.89 4node triangular grid about multi-element airfoii . . . . . . . . . . . . . .

4.17 Mach number contours and Cp plot (Mm = 0.2, a = 5.0"). . . . . . . . . . . 4.18 Convergence Mary using base parameters (M, = 0.2. a = 5.0.). . . . . . 4.19 Cornparison of a values (M, = 0.2. a = 5.0.). . . . . . . . . . . . . . . . . 4.20 Cornparison of GMRES tolerances. G [M, = 0.2, a = 5.0"). . . . . . . . . 4.21 Cornparison of IL&) fiil IeveIs (M, = 0.2. a = 5.0"). . . . . . . . . . . . . 4.22 Cornparison of prmnditioner k s i n g tolerances. P (M, = 0.2, a = 5.0").

List of Tables

. . . . . . . . . . . . . . . . 2.1 Far-field d o w and outflow boundary conditions 12

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.1 Suite of test cases 21 . . . . . . . . . . . . . . . . . 4.2 Base parameters for single-element airfoi1 cases 23

4.3 Drag, lift and moment coefficients (hla = 0.63, cr = 2.0"). . . . . . . . . . . 23 4.4 Drag. liA and moment coeficients (Mm = 0.8. a = 1.0"). . . . . . . . . . . . 28

. . . . . . . . . . . . . 4.5 Base parameters for muiti-element high-lift airfoi1 case 31 4.6 Drag, Lift and moment coefficients (Mm = 0.2. a = 5.0"). . . . . . . . . . . . 33

List of Symbols and Abbreviations

Generic matrix in hear system of equations Drag coefficient Lift coefficient Moment coefficient Pressure coeEcient Flux tensor GMRES tolerance Totai enthaipy Krylov subspace of dimension m Mach number Preconditioner tolerance Vector of conservecl variables Global solution vector Residual h c t i o n Density residuai Riemann invariants Interior scheme portion of residud h c t i o n Boundary scheme portion of residuaI hinction Flow velocity Normal and tangentid flow velocity components

Speed of sound Generic I&-hand side vector in hear system of equations Total energy per unit mass Tme step Unit vecton in the x and y directions GMRES iteration number Newton iteration number Local normal vector Normal vector components m the x and y directions Summation iimit for cell edges Summation b i t for boundary celi faces

Pressure Level of E U W-in Residuai at the mth GMRES iteration Position vector Local tangent vector Tangent vector components in the x and y directions VelociS component in the x and y directions Generic solution vector in linear system of equatioas

Angle of at ta& Perturbation constant for matrix-h.ee GMRES Second and fourth-dXerence art Scia1 dissipation switches SpecXc heat ratio of gas Second and fourth-difference artScid dissipation controt constants Art Sciai dissipation scaling factor Vector of matrix-kee GMRES vector product Surface area Surface contour Density Fourth-dXérence artificial dissipation control constant

1 Identity matrix JE, 3.4 Exact and appraximate Jacobian matrices

b Subscript to indicate a boundary value e Symbol and s u E . p t to indicate an extrapolated d u e it k Subscripts for arbitrary celIs m Subscript to indicate a boundary uode 00 Symbd and subscript to indicate freestream condition - Superscript t O indicate non-dimensionalised variable O Subscript to indicate stagnation condition

CFD CFL CPU GMRES ILU PETSc RCM RHS

Computatiod FIuid Dynamics Couraut-fiecirich-Lewy (number) Central Processing Unit Generalized Minimum Residual (algorith) Incomplete Lower-Upptx (factorisation) Portable, ExtensibIe Toolkit for ScientSc Computation Reverse Cuthill-McKee (morde-) Right-Band Side

Chapter 1

Introduction

1. i Motivation

A principal interest in the field of Computationai Fluid Dynamics (CFD) is the

developrnent of efficient solution methods for the Navier-Stokes equations, which mode1 the

behaviour of fluid Bows. This interest is strongly motivated by the application of such

methods in the aerodynamic design cycle.

Newton-Krylov solvers are a family of implicit solvers that apply quasi-Newton

methods to solve the Navier-Stokes equations. As with other solvers, they are usuaIly f i t

deveioped for the Euler equations, since, without friction and thermal conductivity terms,

the Euler equations are simpler than the Navier-Stokes equations but retain the non-iinear

behaviour and much of the physical signiscance of the superset equations. Some Newton-

Krylov solven use structureci grids to ttansform phpicd space into computational -, whiie othen use unstructured grids. Structured grids are simpler, but unstructured grids

conform better to complex geometries and are easier to adapt and refine as needed for

increased accufacy.

Application of an exact Newton method to solve large systems of partid Mer-

entiai equations is impracticai due to the memory and CPU resources required- in their

work of 1982, Dembo, Eisenstat, and Steihaug [l] showed that, if properly implemented,

inexact-Newton methods can save on these resources considerably by not perfoming each

Newton iteration exactly, while sti.1 offering quadratic convergence. Since then, a variety of

independent efforts have used inexact Newton methods in CFD. Vanka [21 deveioped New-

ton solvers for the Navier-Stokes equatiom. Venkatakrishnan [3], Orkwis [4j, and Whitfieid

2 Chapter 1. Introduction

and Taylor [SI applied Newton-like methods to solve transonic flows. These efforts used

direct methods or stationary iterative methods to solve the linear probIem that arises at

each Newton iteration, and expiicitly formed the Jacobian matrix,

The replacement of direct methods with Krylov iterative methods, and of explicit

Jacobian matrix formation with rnatrix-fiee formulations, origiaates in works by Gear and

Saad [6], Chan and Jackson [?], B r m and Hindmarsh [8], and Brown and Saad [9]. It may

have been in this last work that the term "Newton-Krylov" appeared for the first t h e . The

first use in CFD of the Generalized Minimum Residual (GMRES) [IO] Krylov method can be

traced to Wigton, Yu, and Young [ll], and Johan, Hughes, and Shakib (12, 131. Venkatakr-

ishnan and Mavriplis (141 showed Newton-Krylov methods with Lncomplete Upper-Lower

(ILU) factorisation preconditioning to be cornpetitive with rnultigrid methods. Keyes [15]

showed the same with a matrix-free formulation. Tidriri (161 studied preconditioning tech-

niques for matrïx-free Newton-Krylov methods. Other works that have investigated partic-

ular aspects of rnatr ix-h inexact Newton-Krylov solvers include [L?, 18, 19. 20,21,22,23].

The immediate predecessors of the present thesis, within the University of Toronto

[nstitute for Aerospace Studies (UTLAS) CFD group, are two inexact Newton-Kqylov solvers

written by Blanco (24, 251, and by Pueyo [26, 271. The work of Blanco aolves the Euler

equations and is based on unstructured triangular grids using a node-based, centred finite

volume discretisation, wMe the work of Pueyo solves both the Euler and Navier-Stokes

equations and is based on Stni~tUred "Cn grids using second-order centred-dinerence oper-

aton. These works appIy second and fourth-ciifference scalar artificial dissipation, and use

the GMRES algorithm, preconditioned by the Jacobian matrix subjected to an ILU factori-

sation and Reverse Cuthiii-McKee (RCM) reordering, to solve the iinear system that arises

at each Newton iteration. Blanco has tested both matrix-present and matrix-free GMRES,

wiiile Pueyo has concentrateci on a mtarted version of the latter. In th& use of rnatrix-

free GMRES, both reseacchers have used an approximation to the Jacobian mat& for the

preconditioner- In order to overcome the instabilities that can develop during the iimt iter-

ations of transanic cases, as a start-up strategy BIanco has applied matrix-present GMRES

with a îust-order Jacobian, while Pueyo has used the approximately factored algorithm of

ARC2D in diagonal form with two IeveIs of mesh sequmcing.

Section 1.2. Objective

1.2 Objective

The objective of this thesis projet is to develop an efiicient twdimensionai

Newton-Krylov solver for the Euler equations, with application to subsonic and transonic

Bows over aerodynamic configurations. The solver uses a node-based fite-volume formu-

iation on arbitrary polygonal unstructureci grids. The solver applies second and fourth-

ciifference scalar artificiai dissipation, and resolves the h e a r system that arises at each

Newton iteration using matrix-Etee GMRES with an LU-factorised, RCM-reordered a p

proximate Jacobian preconditioner. An implicit Euler local time step strategy is used for

the start-up of transonic cases. Tests with single and multi-element airfoih are run to

ver@ the proper functioning of the solver and determine the parameters that optimise its

performance.

1.3 Outline

This document is divided into five chapten. Foliowing on the introduction of the

first chapter, the second chapter describes the Euler equations and their spatial discretisa-

tion, including artiûcial dissipation and boundary conditions. The third chapter sets forth

the methodology appiied to solve the Euier equations as cast in the second chapter. The

fourth chapter is an exposition of test results. Based on these results, the 6fth and final

chapter presents conclusions and suggestions for future work related to the present solver.

Chapter 1. Introduction

Chapter 2

Equat ion Formulation

This chapter presents the Euler equations and their spatial discretisation, inchding

artificial dissipation and boundary conditions.

2.1 The Euler Equations

The integrai conservative form of the Euler equations for an unsteady, two-dimensional,

compressibIe inviscid flow aci.oss a surface 0, can be expresseci as:

where Q is the vector of conserveci variables and F(Q) is the inviscid flux tensor:

e

The dimensiouai variables, density (b), docity (6, 6) and total energy (é), are

non-dimensionalised using b s t r e a m (00) dues:

Chapter 2. Equation Formulation

Figure 2.1: A control volume

where a is the speed of souad, which for ideal fluids is a = m. Pressure is rehted to

the conserved variables by the equation of state for a perfect gas:

which provides clmure for Eq. 2.1, by relating the thermodynamic variables of the system.

2.2 Spatial Discretisat ion

The computational domain is discretised using a hitevolume formuiation in which

the control volumes are arbitrary polygonal ceh. These celis compose a dual grid, formeci

by comecting the centroids of of a trïangdar primary grid, as shown in Fig. 2-1.

A representative ceii i fiom the interior of the domain, whose ne edges border on

ceils such as representative neighbour ceii k, is shom in Fig. 2.2. The c d has an area

and outward normais = [Ayik -Azik]. The four conserved variables of the solution

vector Qi are stored at the c d node.

Section 2.2. Spatial Dimetisa tion

Figure 2.2: A representative interior c d

h m Gauss' theorern in two dimensions, which transforms surface integrals into

line integrals, Eq. 2.1 can be rewritten as:

where aR is the contour enclosing surface R, and n is a vector outwardly normai to the

surface. For ceiI à, this equation is applied as:

on the assumption that the d u e s of the conserveci variables of Qi are constant throughout

the c d and equal to t heir d u e s at the ceil node. The flux tensor Fik is taken as the average

of the flux tensors of ceils i and k, thus Eq. 2.7 can be written as:

d 1 % RZQi = -îC (Fi + FI) - nik

k=L

For interior ceiis, C Fi -% = O, since it is a sum over a closed contour.

Chapber 2 Eipation Formulation

2.3 Artificial Dissipation

A scaiar artificial dissipation term is added to the right-hand side of Eq. 2.8, to

provide numerid stability. The term consists of a scalar dissipation c&cient, a second-

Merence Laplacian operator, which is applied near shocks, and a weaker fourth-dinerence

biharmonic operator, which is used everywhere else in the computationai domain.

The second-difference Laplacian operator is:

where

The coefficient &' uses the pressure gradient acroni ce& i and k to detect shocks and thus

trigger the second-difference dissipation. d2) is a user-dhed constant that controls the

strength of the dissipation. For each cell edge contribution, the average of d2) for neighbour

c e h i and k is used.

The fourth-dEerence biharmonic operator is:

The coefficieut EP is a switch that tunui on the fourth-difference dissipation whenew the

second-ciifference dissipation is turned off. d') is a user-dehd constant that controls the

strength of the dissipation.

So that the artificial dissipation is properiy s d e d with respect to solution values,

it is muitiplied by a scaling factor obtained h m the largest eigenvaiue of the flux Jambian:

Section 2.4. Boundary Conditions

Figure 2.3: A representative boundary ceii

where u. v and a are calculateci h e d on the average of the solution vectors of ceh i and

Qik = Qi + Qk

2 C o m b ' i the scaling factor and the second and fourth-difference operaton, the complete

scalar artificid dissipation vector is:

This vector applies only to c d edges; no artificid dissipation is applied at the boundary

ceU faces described in Section 2.4.

2.4 Boundary Conditions

The cornputationai domain presents twu types of boundaries: the body surface

boundary of the solid aerodynamic con6guration placed in the domain, and the far-field

boundary at the outer M t of the domain itseif- in order to properly mode1 the 0ow in the

domain, certain conditions must be imposed at the bouudaries-

A representative boundary ceii is shom in Fig. 2.3. In addition to the centre

node, there is a node at the mid-point m of each of nf boundary faces. A vector Qb,

containing four boundaq vaiues, is stored at each fhe node. Some of the bomdary d u e s

are caicuiated bom freestream vaiues, whiIe others are extrapolateci to zeroth or ûmt order

10 Chap ter 2. Equation Fornufation

h m the conserveci variables stored at the centre node. The extrapolations are based on

the Taylor expansion: Zemth Order

and

A r = r z i + r y j

The Eux contribution hom the boundary faces is:

where now F (Qb (Qc (Qi))), inatead of simply F(Q) as at aii other nodes in the compu-

tationai domain.

2.4.1 Body Sudace Boundary

Given that an aerodynamic configuration such as an airfoi1 presents a rigid surface

to the Bow, at body surface boundaries the nornial component of the flow velocity is zero.

That is:

V, = V - n = ( u i + v j ) - ( n , i + % j ) = u % + v % = O (2.21)

where n is the local unit vector normal to the bady surface. The tangentid component of

the flow velocity, given by

where t is the local unit vector tangent to the body surface, is extrôpoiated to fmt order.

Pressure p is also extrapoiated to k t order, and stagnation enthalpy [E = (e fp) fp] is set

to £tee-strearn dues .

In view of these considerations, for body sdàce bonndaries, density and energy

are expresseci as:

Section 2.4. Boundary Conditions

The velocities u, u, Vn, and & are related by:

Thus, considering Eq. 2.21, and substituthg Eqs. 2.22, 2.23, and 2.24 in Eq. 2.2, at body

surface boundaries there is:

The flow over the aerodynamic configuration placed in the computational domain

generates disturbances in the b s t r e a m conditions. These disturbanm convect and diffk

throughout the domain, and when they reach the far-field boundary, instead of exiting,

t hey can be reflected back into the domain, producing large oscillations that hinder solver

convergence. To ensure that the disturbances exit the domain, non-reflecting boundary

conditions, which use the four Riemann invariants, are applied. Flow through the far-

field boundary is taken to be l o d y normal to the boundary, so onôdimensional Riemann

invariants are used. They are:

in terms of these Riemann invariants, there is:

such that density and energy are:

where u and v are obtained considering Eqs. 2.21, 2.22, 2.25, 2.30, and 2.31.

The Riemann boundary conditions are calculateci fiom fiee-stream d u e s or ex-

trapolated to zeroth order, depending on the Ereestrearn Mach number M, and whether

the boundary is of infiow or outflow. Table 2.1 summarises how the Eu-field boundary

conditions are calculateci. As an example, substituthg Eqs. 2.27 to 2.34 in Eq. 2.2, and

applying the d e s of Table 2.1 for a far-field inflow boundary with M, < 1, there is:

where

Table 2.1: Far-field inflow and outfiow boundary conditions.

INFLOW OUTPLOW

Mm < l m

RI R2 Ri3 % e o o o o o o

> l o o o o m o o e

RI R2 R3 % e e e e e e

Chapter 3

Algorit hm Formulation

This chapter explicitly states the equation resolved by the present solver and de-

scribes the methodoiogy used to obtain the solution. The methodology involves an inexact

Newton met hod, the matrix-Eree Generalized Minimum Residual (GMRES) algorithm, In-

complete Lower-Upper (EU) preconditioning, Reverse Cuthiil-McKee (RCM) reordering,

and Locai Time Stepping. The solution aigorithm is written in the C++ cornputer tan-

guage and uses routines kom the Portable, Extensible Toolkit for Scientific Computation

(PETSc) [28]. A fiow diagram of the algorithm is provideci in Appendix A.

3.1 The Residual Function

The discretised Bux integrah, artificial dissipation term and boundary condition

equations cast in the previous cbapter can be combined into a single Eunction for the entire

computationai domain. The function, referred to as the residual function, is a vector of

cells four-entry vectors and l ads to the non-hear system of equations:

It is this equation that is resolved by the present solver, hding the solution vector Q that

satisûes the system.

The ith equation of the system of Eq. 3.1 is obtained from Eq. 2.8, whose t h e

derivative is identically zero at steady state, and expressions 2.16 and 2.20. It can be

expressed as:

&(QI = %(Q) + T~(Q> = 0 ( 3 4

where

3.2 An Inexact Newton Method

The present solver appiies an inexact Newton method to solve Eq. 3.1.

Newton's method is based on Newton's Linearisation to first order which, applied

to Eq. 3.1, is:

~ [ Q l n + i ) ) R ( Q ( ~ ) ) + j f ) @ ( n ) = O (3.5)

where A Q ( ~ ) E Q("+') - ~ ( ~ 1 , and fi, the exact Jacobian matrix of R, is:

Hence, to first-order accuracy, the non-linear system of Eq. 3.1 can be repiaced by the linear

system of equations:

R(Q(~) ) + $)AQ(~) = O (3.7)

which it is customary to mite as:

with the right-hand term being referred to as Ythe right-hand side" or RnS.

The solution vector Q that satisfies Eq. 3.1 is found by an iterative process, started

by proposing an initial guess vector, @O), based on, for exampIe, freestream dues . At

each iteration, refmed to as a Wewton iteration*, the solution vector is updated using the

definition of A Q ( ~ ) as: Q(~+L) = ~ ( n ) + A Q ( ~ ) (3.9)

where the updating vector AQ(R) is found by solving Eq. 3.8 using the method aesmibed

in Section 3.3. At the nth Newton iteration, there is:

Section 3.3. Ma trix-tiee GMRES 15

which represents an inexact Newton method where the solution vector Q(") does not zero

the residual function of Eq. 3.8, but rather reduces it by a factor of @. In the limit, when

rl(n) = O, Eq. 3.10 reverts to Eq. 3.8 of Newton's method.

To determine the level of convergence of the residual function at the nth Newton

iteration, the present soIver caldates a residual R(") given by the Li-utim of the k t

equation of the residual Function, correspondhg to conservation of mass, normalised by the

same Lz-nom taken at the b t Newton iteration:

This residud is refmed to as the "density residual". The present solver stops the Newton

iterations when the density residud is equal to or d e r than 10-LO.

3.3 Matrix-fFee GMRES

The linear system of Eq. 3.8 is solved for AQ at each Newton iteration using the

GMRES aigorithm, an iterative solver belonging to the family of Krylov solvers.

For any linear system of equations of the form:

the GMFES aigorithm h d s the iterate x, E {XO f Km) that minimises the L1 n o m of

the residuai r, = b - &, where xo is an initial guess, Km is a Krylov subspace of the

form:

Km = s p z n { v , ~ v t , A'VI, .. . , A ~ - ' U ~ ) (3.13)

and

When solving Eq. 3.8, at the rnth GMRES iteration of the nth Newton iteration

the GMRES residual k rm = -R(Q(~)) - ~ ( " 1 A -

E ( Q)m (3.15)

If rm # 0, then (AQ), is still not the exact solution of Eq. 3.8. However' Pueyo [26,27]

has shown that soiving Eq. 3.8 accurateiy does not necessarily equaiiy reduce the residd

function of the non-Iinear system of Eq. 3.1. Therefore, to gave CPU time, the present

16 Cbapter 3. Aigorithm Formulation

solver stops the GMRES iterations when the GMRES residuai has decreased by G orders

of magnitude dehed by the user. In addition, since the storage requirements of GMRES

increase ünearly and the CPU expense inmeases quadraticdy with the number of vectors

in the KryIov subspace, the preseni solver terminates GMRES at m = 40, and restarts it

irsing the Iatest value of AQ to caiculate the new initiai residuai ro. This restarted form of

the GMRES aigorithm is denoted GMRES(40).

The matrix-vector products required by the GMRES algorithm to form the Krylov

subspace of Eq. 3.13 can be approximated using finitedifferences:

where E is obtained from:

E Ilvll2 - 6 with the present solver setting E, to machine zero. This matrix-free GMRES strategy

eliminates the need to explicitly form the exact Jacobian m a t h JE, thus producing a

savings in matrix storage.

3.4 ILU Preconditioning

In solving systerns of the form of Eq. 3.12, the GMRES algorithm can converge

sIowly or no& at aU if the system is iii-conditioned. The exact Jacobian matrix JE is off-

diagonaüy dominant, causing the system of Eq. 3.8 to bc iii-conditioned and thus susceptible

to this shortcoming. To alleviate this difiicuity, the p r e n t solver right prmnditions the

system of Eq. 3.8.

Applied to a system of the form of Eq. 3.12, right-preconditioning c m be e x p d

as:

(AM-L, (MX) = b

where M is a matrix that makes this system better conditioned than the original one. The

m a t e M used by this solver is obtained from an incomplete lower-upper factorisation of

an approotimate Jacobian matrix TA, describeci in the foiiowing Snbeection 3.4.1. With

right-preconditioning, Eq. 3.8 becomes:

Section 3.4. iL U Preconditioning 17

The factorisation of the approxhate Jacobian matrix JA is denoted ILU(p), where

the parameter p refen to the level of fill-in aiiowed in the factorisation. A 6li-in l e d of

p = O indicates that no 6ii-in beyond the original matrix pattern of nonzeros is aiiowed in

the L and U factor matrices. As p grows, more nonzero elements are added to the L and U

factors. thus making the factorisation more complete. A more complete factorisation should

reduce the number of GMRES iterations needed to converge the GMRES residual, but this

happens at the expense of inmeased storage and time per iteration.

It has been shown by Pueyo [26,27] that the number of GMRES iterations at each

Newton iteration does not increase if the preconditioner matrix JA is not recalcuiated at

each Newton iteration. Thus, the present solver freezes the preconditioner when the density

residual of Eq. 3.11 has decreased to a user-defined tolerance, P.

3.4.1 The Approximate Jacobian Matrix

The approximate Jacobian matrix JA on which the premnditioning of Eq. 3.8 is

baaed is a non-syrnrnetric sparse block matrix with block dimensions celis x cells. As the

positionof the off-diagonal blocks depends on the ordering of the ceii nodes, the unstructureci

nature of the grid of the present solver produces a matrix with a large bandwidth. An

example of the mat& structute is shown in Fig. 3.1.

Figure 3.1: Structure of the approximate Jacobian matrix (3,631-node grid).

18 Chapter 3. Algorithm Formulation

Each nonzero entry of J.4 is a 4 x 4 Jacobian m a t h corresponding to a partial

derivative of Eq. 3.2 with respect to Qi (diagonal blocks) or Qk (off-diagonal blocks).

including three approximations, the diagonal terrns are of the form:

(3.20)

and off-diagonal terms are of the form:

where, since C Fi - nik = O for interior celis, the fkit sum of Eq. 3.20 is nonzero only for

boundary celis. Examples of the Jacobians contained in Eqs. 3.20 and 3.21 are provided in

Appendix B.

The three approximations incorporated in Eqs. 3.20 and 3.21 are:

1. The scalar dissipation coefficient Xik is treated as if it were independent of Q.

2. The fourth-difierence biharmonic operator (2.11) is neglected and the coefficient E!:)

of the second-Merence Laplacian operator (2.9) is replaced by a linear combination

of the second and fourth-difierence dissipation coefficients of the RHS, that is:

where a is a user-defined control constant. Since the fourth-Merence operator in-

volves neighbour-of-neighbour nodes, its exclusion reduces the off-diagonal dominance

of the apprcallmate Jacobian matrix and decreases the number of blocks per row, thus

economising matrix storage,

3. Given that Fm (Q6 (Qe (Qi))), applying the chah d e to Eq. 3.4 gives:

However, for the approxhate Jacobian matrix, it is taken that Q, = Qi, correspond-

ing to a zeroth-order extrapolation, a s shown in Eq. 2.17, and thus:

where Z is the identity matrix. This reduction of the boundary condition treatrnent

from fÙst order to zeroth order economises matrix storage.

Section 3.5. RCM Reordering

Figure 3.2: Approximate Jacobian matrix with RCM reordering (3,631-node grid).

3.5 RCM Reordering

The large bandwidth of the approximate Jacobian matrix JA can affect the quality

of the preconditioning matrix. This is because o&diagonal matrix elements produce many

nonzero elements in the L and U factor matrices of a compIete LU imtorisation, and such

elements may be Iost in the incomplete LU factorisation used to iorm the preconditioning

mat& The present solver addresses this difhiculty by reordering the approximate Jacobian

mat* using the Reverse Cuthill-McKee (RCM) bandwidth reduction strategy. Fig. 3.2

shows the stmcture of the matrix of Fig. 3.1 aiter RCM reordering.

3.6 Startup Strategy

The initial Newton iterations are susceptible to converge slowly or even to diverge

if the initial guess vector @O) is far fiom the final solution vector. This is particularly the

case for flows with shocks.

The present solver appiies an implicit Euler start up strategy to attenuate this

difiiculty when solving transonic flows, in which keestream Mach nmnbers are in the range

0.7 < 1M, < 1.2. A time step h is added to the diagonai elements of the appmDcimate

Jacobian matrix, thus increasing the diagonal dominance of the mat*

Chapter 3. Algonthm Formulation

With the time step, Eq. 3.8 becomes:

The time step is calculated IocalIy at each ceii hm:

where the Coutant-F'riedrich-Lewy (CE) number is a user-defined value. It is incorporated

into both the exact Jacobian JE and the a p p r h t e preconditioner Jacobian $A, but is

not included for boundary ceii faces. The Z/h tenn of Eq. 3.25 is multiplieci by the density

residuai R(") of Eq. 3.1 1, which acts as a scaling coefficient. As the La-nom of the density

residuai tends to zero, so does the scaluig coefficient, and the term Z/h tends to zero. in

the limit, as I/R(")II, approaches zero, Eq. 3.25 approaches Eq. 3.8.

Chapter 4

Results

This chapter presents the resuits of tests to veri& and optimise the performance

of the Newton-Krylov solver.

The tests were conducted on a suite of eight cases, hvolving three subsonic and

four transonic flows around the singl~lement NACA 0012 airfoil, and one subsonic Bow

about a multi-element high-lift airfoil.

Airfoil

Mult i-&ment

Flow regime

Subeonic

Transonic

Table 4.1: Suite of test cases.

The three cases shown in 'm2d type in Table 4.1, one for each flow regime, were

taken as primary validation cases. They were tested using a set of base parameters deter-

niined to be optimum for each one. The five rPmRining cases of the suite were tested in

less detail with the set of base parameten used for the primary validation case of their flow

regime.

The sets of base parameten include the fourth-Merence dissipation control con-

stant, u (Subsection 3.4.1); the artificial dissipation control constants 6 2 and ~4 (Section

2.3); the tokrance G applied to the GMRES residual, r, (Section 3.3); the level of ILU

BU-in, p (Section 3.4); and the tolerance P appiied to the density residud R (Section 3.2)

for keezing the preconditioner matrix (Section 3.4). For the transonic cases, a local tirne

step, h (Section 3.6) was applied using a CFL value determined to be optimum.

Results for the primary validation cases include Mach number contours, pressure

coefficient (C,,) plots, and force and moment coefficients, to verify the performance of the

solver. These resdts are complemented wit h convergence histories for tests that compare

the performance of the solver when one of the base parameter values is varied. Resuits for

the rernaining cases of the suite only show the convergence history for a test nin with the

corresponding base parameten.

Convergence histories are reported in terms of the CPU t h e required for the

density residual R to converge to 10-'O. For these results to be independent of the cornputer

or compiler used, the CPU tirne is normaliseci by the cost of one evaluation of the RHS. This

cost includes aU flux Jacobian evaluations, the pressure field update, and the calculation of

artificiai dissipation. The convergence histories are complemented with graphs showhg the

number of GMRES iterations per Newton iteration.

All tests were run on a Compaq Alpha ES40 server with 667 MHz Alpha 21264A

processon.

4.1 Single-element Airfoil Cases

The singIe-eIement airfoil cases were run on the 3,631 and 14,193-node trianguIar

grids shown in Fig. 4.1, with the optimum base parameters indicated in Table 4.2. UnIess

specificaüy noted, aü of the single-element airfoii test results reported are for the 14,193-

node grid.

Section 4.1. Siogleelemen t Airfoil Cases

Figure 4.1: 3,631 and 14,193-node triangular grids about NACA 0012 airioil.

Table 4.2: Base parameters for singlôelement airfoi1 cases.

Flow Regime

Subsonic Tkansonic

4.1.1 Subsonic Flow

The primary validation case for subsonic Bow over a single-eiement airfoii

with M, = 0.63 and a = 2.0".

1%

0.63 0.8

is a case

The Mach number contours and Cp plot for this case are shown in Fig. 4.2. As

expected for an inviscid flow without shocks, the figures show a smooth flow field. The force

and moment coefficients of Table 4.3 agree weli with published resuits. in particulart the

drag mefEicient (CD), which should be zero, is sufficiently small to be attributed to numerical

error. Such error may be due to the use of zeroth instead of firstsrder extrapolation at the

airfoil boundaries.

a ("1

2.0 1.0

Table 4.3: Drag, lift and moment coetücients (M, = 0.63, a = 2.0").

o

5 4

K*

0.0 20.0

~4

0.3 0.3

CFL

- 100

G

IO-' IO-'

p

5 7

P

10-' IO-*

Figure 4.2: Mach number contours and C,, plot (Mm = 0.63, a = 2.0').

Fig. 4.3 shows the RHS evaluations for this primary validation case run with the

optimum base parameters. Each symbol represents a Newton iteration. On the 3,631-node

grid, this case ran in approximately 20 s, with 0.12 s of CPU t h e spent per RHS evaluation,

and an average of 9.6 GMRES iterations per Newton iteration. On the 14,193-node grid,

the case ran in approximately 2 m 448 s, with 0.53 s of CPU tirne ven t per RHS evaluation,

and an average of 15.1 GMRES iterations per Newton iteration.

The choice of optimum values for the base parameters of this primary validation

RHÇ Evah&na Nirtmlmmm

Figure 4.3: Convergence history using base parameters (Mm = 0.63, a = 2.0").

Section 4.1. Single-element Airfoi1 Cases

o=l - 0 9 - a s - &- : = . -7 -

-9

-10 -

Figure 4.4: Comparison of o values (Mm = 0.63, a = 2.0').

case is corroborated by the results of Figs. 4.4 to 4.7.

Fig. 4.4 cornpaxes solver performance when the fourth-difference dissipation contro!

constant is varied. Optimum resuits are obtained when o = 5. For a = 1, the number of

RHS evaluations more than doubles, and for each of the other values of a, except a = 7,

the difference in the number of RHS evduations is practicaiiy indistinguishable. The first

four values of o required 6 4 to be raised in the set of base parameters from 0.3 to 0.4 for

u = 2,3,4, and to 0.6 for a = 1.

Fig. 4.5 compares solver performance when the orders of magnitude that the GM-

RES residual is reduced at each Newton iteration are varied. The results c o h that it is

counterproductive to reduce the GMRES residuai by more than 1 order of magnitude. The

cases in which the GMRES residual was reduced by 2 and 3 orders of magnitude required

a local tirne step with CFL = 100, sol aithough not required, for consistency the 1-order

case was also nui with this time step. Thecefore, a true performance cornparison is one

made between the more than 3,000 RHS evaluations taken by the h r d e r reduction case of

Fig. 4.5, and the 320 RHS evaluations of the 14,193-node case of Fig. 4.3.

Fig. 4.6 compares solver performance when the Hi-in level of the preconditioner

matrix is varied. The factorisation of ILU(0) is too incomplete, Ieading to a vecy Iarge

number of RHS evaluations. An ILU(1) factorisation offérs a signiûcant improvement,

but the best resuits are obtained for ILU(4) and above, for which the ddierences in the

number of RHS evahations are aimost indiscemible. An ILU(5) factorisation was chosen

2 Oroen - - 30rders -

-8 - -10 -

O 5 0 0 1 0 M ) l S O O a M O 2 M O 3 W O 3 5 0 0 RHS Evalunbm

Figure 4.5: Cornparison of GMRES tolerances, G (Mm = 0.63, a = 2.0").

RHS

Figure 4.6: Comparison of ILU@) fill levels (Mm = 0.63, a = 2.0").

as optimum, because it offers a slight advantage in the number of RHS evaIuations, and

can be nin with the ~4 value of the base parameters, whereas ILU(6) and ILU(7) require

K4 = 0.4.

Fig, 4.7 compares solver performance when the toletance applied to the density

residual for heezing the preconditioner rnatrix is varied. Optimum d t s are obtained

when the preconditioner matrix is fiozen as soon as the density residd is equal to or less

than 10-'.

Appiication of the base parameters to the cases of Mm = 0.3; a = 5.0°, and

Mm = 0.5; a = 3.0" yields the r d t s ofFig. 4.8. The convergence history for the Mm = 0.5;

Section 4.1. Single-element Airfoi1 Cases

Figure 4.7: Cornparison O€ preconditioner €reezing tolerances, P (Mm = 0.63, o! = 2.0").

Figure 4.8: Other singleelement subsonic cases.

a = 3.0" case is very siraiiar to that shown in Fig. 4.3 for the primary validation case.

However, as expected, the Mm = 0.3; a = 5.0' case requires more RHS iterations given the

combination of lower keestream Mach number and higher angle of attack.

4.1.2 Tkansonic Flow

The primary validation case for transonic flow over a singie-element airioil is a

case with Mm = 0.8 and a = 1.0'.

The Mach number contours and Cp plot for this case are shown in Fig. 4.9. As

expected for this case, they show the formation of a shock on the upper surface of the aiaoil.

The force and moment coefficients of Table 4.4 agree weil with pubiished resuits. Given

that the present solver considers the Bow to be inviscid, the fact that the value of CD is

Figure 4.9: Mach number contours and Cp plot (Mm = 0.8, a = l.OO).

Table 4.4: Drag, iift and moment coeficients (Mm = 0.8, a = 1.0").

approximately ten times greater than for the subsonic validation case is attributed solely

to the pressure discontinuity acrw the shock.

Fig. 4.10 shows the RHS evaluations for this primary validation case nin with the

optimum base parameters. Each symbol represents a Newton iteration. On the 3,631-node

grid, this case ran in approximately 42 s, with 0.1 s of CPU tirne spent per RHS evaluation,

and an average of 6 GMRES iterations per Newton iteration. On the 14,193-node grid, the

case ran in approximately 7 m 9 s, with 0.45 s of CPU t h e spent per RBS evaluation,

and an average of 8.3 GMRES iterations per Newton iteration. The appearance of "spikesn

during convergence is attributable to a poor initial guess Q(O) that tends to mislead the

solver, especiaily during the first Newton iterations.

The choice of optimum d u e s for the base parameters of this primary validation

case is corroborated by the resdts of Figs. 4.11 to 4.14.

Fig. 4.11 compares solver performance when the fourth-diffefence dissipation con-

Section 4.1. Single-elmen t Airfoil C m 29

RHS Nirmnllvdan

Figure 4.10: Convergence history using base parameters (Mm = 0.8, a = 1.0").

Figure 4.11: Cornparison of o d u e s (Mm = 0.8, a = 1.0').

trol constant is varied. Optimum resuits are obtained when a = 4. The case with a = 2

did not converge.

Fig. 4.12 compares solver performance when the orders of magnitude that the

GMRES residual is reduced at each Newton iteration are varieci. Again, as with the subsonic

validation case, the results confina that it is counterproductive to reduce the GMRES

residuai by more than 1 order of magnitude at each Newton iteration. The 2 and h r d e r

cases were run with (3% d u e s of 30 and 10 respectively.

Fig. 4.13 compares solver performance when the fdi-in Ievel of the preconditioner

matnx is varied. The factorisations with 6il-in IeveIs below 4 are too incomplete, causing

Figure 4.12: Comparison of GMRES tolerances, G (Ma = 0.8, a = l.OO),

Figure 4.13: Comparison of ILLT(p) 6ii levets (M, = 0.8, a = 1.0").

the solver to not converge. The optimum convergence rate is obtained with ILU(7), which

is higher than for the single-element subsonic validation me.

Fig. 4-14 compares solver performance when the tolerance appiied to the density

residuai for k i n g the preconditioner matrix is varied. Optimum resdts are obtained if

the preconditioner is frozen as soon as the density m i d d is equai to or d e r than IO-*.

Freezing the preconditioner below this tolerance has no e f f i on the nmber of GMRES

iterations per Newton iteration.

Application of the base parameters to the cases of M, = 0.729; a = 2.31°,

M, = 0.8; a = 1.25", and M, = 0.95; a = 1.0" yieIds the resntts of Fig. 4.15.

Section 4.2. Multi-element Airfoil Case 3 1

RHS Evdwbm Hmaiimdm

Figure 4.14: Comparison of preconditioner beezing tolerances, P (Mm = 0.8, a = 1.0").

-2

a " I - 1 a 4

6 :: 4 -

-9

-IO

Figure 4.15: Other single-element transonic cases.

4.2 Multi-element Airfoil Case

O Z a 400 600 a00 l m , lm 1400 isa, lm t 2 3 4 5 6 7 s s 10 11 12 13 14 1s 1s 17 10 19 m

'IO-! 1- ..n - . 10.' - -?Y- ' . . 10' . * . -

ïS* :

\, ,-.-. .. ...,, ; .

'\

.

- '. .

The validation case for subsonic iif'ting flow over a multi-element airfoil is a case

with M, = 0.2 and cr = 5.0". It was nui on the 17,8%node triangdar grid shown in

Fig. 4.16, with the optimum base parameters indicated in Table 4.5.

The Mach number contours and Cp plot for this d d a t i o n case are shom in

40

35

30

12s a ,'= # 15

10

5

O

Table 4.5: Base parameters for multi-element high-lift airfoil case-

Flow Regirne

Subsonic lifting

Mm

0.2

cr [O]

5.0

a

5

~2

0.0

~4

0.4

G

IO-L

CFL

- p ( P

4 IO-^

Figure 4.16: 17,894node trianguiar grid about mdti-element airfoil.

Figure 4.17: Mach number contours and Cp plot (Ma = 0.2, a = 5.0°).

Fig. 4.17. The force and moment coefficients are shown in Table 4.6.

Fii. 4.18 shows the RHS evduations for this validation case run with the optimum

base parameters. Each symbol represents a Newton iteration. The case ran in approximatdy

Section 4.2. Multi-dement Moi1 Case

Table 4.6: Drag, lift and moment coeficients (Mm = 0.2, ct = 5.0").

Leading-edge slat Wing section Trahgedge fiap 1 Trailing-edge flap 2

Figure 4.18: Convergence history using base parameters (M, = 0.2, a = 5.0").

0.2904 0.1859 0.3694 1.138 0.4261 0.8490 3.755 -0.0854 -0.3930

0.5003 -0.4313 -0.2753

11 m 49 s, with 0.82 s of CPU tirne spent per RHS evaluation, and an average of 37.2

GMRES iterations per Newton iteration. These values are higher than for the subsonic and

transonic single-element airfoil cases. This is to be expected &en the greater complexity

of the aerodynamic co~guration.

The choice of optimum values for the base parameters of this validation case is

corroborated by the results of Figs. 4.19 to 4.22.

Fig. 4.19 compares solver performance when the fourth-difference dissipation con-

trol constant is varied. As with the singIeeIement validation cases, optimum results are

obtained when a = 5. For this case, however, the solver does not converge for any value of

a lower than this.

Fig. 4.20 compares solver performance when the orders of magnitude that the

GLIRES residual is reduced at each Newton iteration are varied. The resdts CO& that

it is counterproductive to reduce the GMRES residnat by more thau 1 order of magnitude.

The cases in which the GMRES residual was reduced by 2 and 3 orders of magnitude

Chapter 4. Results

RHS Evaluaüom

Figure 4.19: Comparison of u values (M, = 0.2, a = 5.0").

Figure 4.20: Comparison of GMRES tolerances, G (M, = (3.2, cr = 5.0").

required a locd tirne step with CFL = 100, so, although not required, for consistency the

l-order case was also nui with this tirne step. Therefore, a true performance cornparison

is one made between the more than 5,000 RHS evaiuations taken by the 3-order reduction

case of Fig. 4.20, and the 700 RHS evaiuations of Fig. 4.18.

Fig. 4.21 compares solver performance when the fiii-in Ievel of the preconditioner

matrix is varieci. As in the previous validation cases, the factorisation of ILU(0) is too

incomplete, leading to a very large number of RHS evaiuations. A . ILU(4) factorisation

Section 4.2. Multi-element Airfoil Case

Figure 4.21: Cornparison of ILU(p) fiii levels (Mm = 0.2, a = 5.0").

Figure 4.22: Cornparison of preconditioner b i n g tolerances, P (Mm = 0.2, a = 5.0').

offers optimum resuits. The solver does not converge for higher ievels of fiii-in.

Fig. 4.22 compares soiver performance when the tolerance applied to the density

residual for fieezing the preconditioner rnatrix is varieci. Optimum resuits are obtained

when the preconditioner matrix is fiozen as mon as the density residuai is equal to or less

than IO-*. Recalculating the preconditioner matrix for fnrther Newton iterations produces

no benefit.

Chapter 5

Conclusions

A node-based finite-volume Newton-Krylov solver has been implemented to solve

the Euler equations on arbitrary polygonal unstructured grids about aerodynarnic conlig-

urations. The solver applies an inexact Newton method to solve the non-iinear system of

equations that results Çom the spatial discretisation of the Euler equations. A matrix-kee

GMRES strategy is used to solve the hear system arising at each Newton iteration. This

system is right-preconditioned using an approximate Jacobian matrix subjected to ILU

factorisation and RCM reordering.

The solver ha, been tested with a single-element airfoil in the subsonic and tran-

sonic Bow regirnes, and with a multi-element aiFfoil in a subsonic high-iift case. The tests

successfully validate the functioning of the solver and investigate the &ect on convergence

of vaying grid size, scalar artiûciai dissipation parameters, preconditioner fiil-in, and GM-

RES and preconditioner-hing tolemnces. Sets of base parameters that optimise the

performance of the solver have been determined.

The solver is ready to be extendeci £iom the Eder equations to the fdi Navier-

Stokes equations; however, some of its present aspects are worthy of further attention- A

full hearisation of artificial dissipation terms and an extension h m zeroth to fitstsrder

extrapolation at boundaries codd be appiied to determine if an improved preconditioner can

be obtained without incurring excessive speed and memory penalties. Matrix dissipation,

instead of the current scalar dissipation, could be appiied in an attempt to lower the values of

the second and fourth-clifference dissipation terms, which might improve accuracy.Difkent

levels of GMRES restart codd be tested to determine their impact on solver performance.

Finaiiy, start-up strategies, such as mesh sequencing, codd be attempted to provide the

38 Chapter S. Conclusions

solver with an improved initial guess vector, which would improve the speed of the solver,

especiaüy in subsonic lifting cases and transonic flow cases with shocks.

Appendix A

Algorit hm Flow Diagrarn

Propose initial guess

Update solution vector &" = &' + &l

B"" €Y'

Is Wrn 5 1 Crt0

Forni J,"' matrix YES YES [lLU(n) & RCW to precondiion

J,~'&=-R(&) M = J&-

M=J/

Apply GMRES to solve for& in (J,"M-')(M& )=-R(&) Z Advance one GMRES iteraüm i+

Appendix A. Aigori th Flow Diagram

Appendix B

Examples of the Jacobians for JA

interior Cells

For interior ceiis i and k, the flux Jacobians of Eqs. 3.20 and 3.21 are of the form:

aF -= aQ

Rom Eqs. 2.2, 2.5, and 2.3, there is:

and

Appendix B. Exanples of the Jacobians for JA

Tberefore. the flux Jacobians are:

Boundary Faces

For boundary faces rn, andogously, the flux Jacobians of Eqs. 3.20 and 3.21 are of the fom:

For the body surface boundary case of Subsection 2.4.1, the flux tensor of Eq. 2.3 is ex-

presseci in ternis of Qb of Eq. 2.26, where the pressure p and the tangentia.1 velocity com-

ponent V; are extrapolateci values expressed in terms of the variables q ,~ , ~ ~ 2 , qe3, and qe4.

Similarly, for each of the four far- field botmdary cases oE Subsection 2.4.2, the ftmc tensor of

Eq. 2.3 is e x p d in terms of the corresponding Qa vector; the example vector of Eq. 2.35

corresponds to the subsonic inftow case- Dependhg on the case, a, p, p, Va, and & are

extrapolateci values, expressed in terms of the variables q,~, qe2, q d , and q d .

The Q vector Jacobians of Eq. 3.20 are of the form:

There are five such Jacobians: one for the body surface boundary Qb vector of Eq.2.26, and

four for the far-field boundary Qb vecton, of which Eq. 2.35 is the k t . The Jacobian for

the supersonic inûow case is null. As an example, for the body surface houndary case:

These Jacobians are dculated using the M~ple [29] symbolic mathematical corn-

putation system. As an example, the code for generating the C code for the Jacobian of

Eq. B.7 is:

wit h(lina1g);

Appendix 8. Examples of the Jacobians for J..

References

[Il Dembo, R. S., S. C. Eisenstat, and T. Steihaug, "Inexact Newton Methods," SIAM

Journal on Numerical Analysis, Vol, 19, No. 2, 1982, pp. 4ûû-408.

[2] Vanka, S. P., "BIock-Implicit Calculation of Steady Turbulent Recircuiating Flows,"

international Journal of Heat and Mass ïhnsfer, Vol. 28, 1985, pp. 2093-2103.

[3] Venkatakrishnan, V.. "Newton Solution of Inviscid and Viscous Problems," AiAA Pa-

pet 880413, January 1988.

[4] Orkwis, P. D.. "Cornparison of Newton's and Quasi-Newton's Method SoLvers for the

Navier-S tokes Equations." AIAA Journal, Vol. 31, No. 27, 1993, pp. 832-836.

[5] Whitfield, D. L., and L. K. Taylor, W k m t i z e d Newton-Relaxation SoIution of High

Resolution Flux-DWerence Split Schemes," Pmceedings of the AIAA Tenfh Annual

Computational Fluad Dynamics Conference, 1991, pp. 134145.

[6] Gear; C. W., and Y. Saad, "Iterative Solution of Linear Equations in ODE codes,"

SIAM Journal on Scientific and Statistical Computing, Vol, 4, 1983, pp. 583-601.

[?] Chan, T. F., and K- R. Jacksou, "Nonliueariy Preconditicrned Krylov Subspace Meth-

ods for Discrete Nehan Algorith," SIAM Journal on Scient+ and S t ~ t i ~ c a l Cm-

puting, Vol. 5 , 1984, pp. 535-542.

f8j Brown, P. N., and A- C. Hindmarsh, "Matrix-hm Methods for StSSystems oiODEs,"

SIAM Journal un Numericol AnalysîP, Vol. 23, L986, pp. 610-638.

[9] Brown, P. N., and Y. Saad, 'Kybnd Krylov Methods for Nonlinear Systems of Equa-

tions: SIAM Journal on Scientific and Statistid Computing, Vol. 11, 1990, pp. 450-

481.

[IO] Saad, Y., and M. H. Schultz, "GMRES: A Generaiized Minimum Residuai Algorithm

for Solving Nonsymmetric Linear Systems," SIAM Journal on Scientific and Statisticul

Computing, Vol. 7, No. 3, 1986, pp. 856869.

[Il] Wigton, L. B., N. J. Yu, and D. P. Yomg, "GMRES Acceleration of Computational

Fluid Dynamics Codes," AIAA Paper 851494, Juiy 1985.

[12] Johan, Z., T, J. R. Hughes, and F. Shakib, "A Globdy Convergent Matrix-Free Ai-

gorithm for Impiicit Time-Marching Schemes Arising in Finite EIement Anaiysis in

Fluids," Computational Methods in Applied Liechanics and Engineering, Vol. 87, 1991,

pp. 281-304.

[13] Shakib, F., T. J. R. Hughes, and 2. Johan, "Element-by-element Algorithms for Non-

symmetric Matrix Problems Arising in Fluids," Superlarge Problems in Computational

Mechanics, Plenum, 1987, pp. 1-34.

[14] Venkamkrishnan, V., and D. J. Mavriplis, 'Impiicit Solvers for Unstructureci Meshes,"

AIAA paper 91-1537, Jwie 1991.

[15] Keyes. D. E., "Aerodynamic Applications of Newton-Kryiov-Schwarz Solvers," Proceed-

ings of the 14th International Conference on Numerimi Methods in Fluid Pynamics,

Springer, 1995, pp. 1-20.

f16] Tidriri, M. D., "Efficient Preconditioning of Newton-Krylov Matrix-Free Algorithms

for Cor ipressible Flows," Journal of Computational Physics, Vol, 132, 1997, pp. 51-61.

[17] Ajmani , K., W. F. Ng, and M. S. Liou, "Preconditioned Conjugate Gradient Methods

for the Compressible Navier-Stokes Equations," Journal of Computational Physics,

Vol. 11 1, 1994, pp. 68-81.

[18] Barth, T. J., and S. W. Linton, "An Unstructureci Mesh Newton Solver for Com-

pressible Fluid Flow and its Paralle1 Implementation," AIAA paper 95-0221, January

1995.

[191 Jiang, H., and P. A. Forsyth, "Robust Linear and Nonlinex Strategies for Solution of

the lkausonic Euler Equations," Cornputers and Fluiàs, Vol. 24, 1995, pp. 753470.

[20] McHugh, P. R., and D. A. Knoil, "CompaRson of Standard and Matrix-Ekee Imple-

mentations of Several Newton-Kry Iov Solvers," AIAA Journal, Vol. 12, pp. 23942400,

December 1994.

[21] Mavripiis, D. J., "On Convergence Acceleration Techniques for UtI~tnictUred Meshes,"

AIAA Technical Report 982966, 1998.

[22] Nielsen, E. J., W. K. Anderson, R. W. Wdters, and D. E. Keyes, "Application

of Newton-Krylov Met hodology to a Three-Dimensional Unstmct ured Euler Code,"

AWA Paper 951733, June 1995.

[23] Tidriri, M. D., "Krylov Methods for Compressible Flows," ICASE Technical Report

95-48. June 1995.

[24] Blanco, M.. and D. W. Zingg, "Fast Newton-Krylov Method for C'nstmctured Grïds,"

AIAA Journal. Vol. 36, No. 4, 1998, pp. 607-612.

[251 Blanco, M., An Implicit Solution Method for the Euler Equations on Uns!;uctured

ïEangular Grids, Master's t hesis, University of Toronto, Sep tember 1995.

[26] Pueyo, A., and Zingg, D.W.' "Efficient Newton-Krylov Solver for Aerodynarnic Com-

putations," AIAA Journal, Vol. 36, No. 11, 1998, pp. 1991-1997.

[2?] Pueyo, A., -4n Eficient Newton-Krylov Method for the Euler and Nauier-Stokes Equa-

tions, Ph.D. thesis, University of Toronto, December 1997.

[28] Baiay, S., K. Buschelman, W. D. Gropp, D. Kaushik, L. C. Mcinnes, and B. F. Smith,

"PETSc home page," http://~.mcs.anl.gm/petsc, 2001.

i291 Char, B. W., K. O. Geddes, G. H, Gonnet, B. L. Leong, M. B. Monagan, and S. M.

Watt, First Leaues: A ïùtorial Introduction fo Mople V , Springer-VerIag, New York,

1992.

Documents

A Newton-Krylov Solver for the Euler Equat ions on ......braster of AppM Science Graduate Department of Aerospace Science and Engineering University of Toronto 2001 A Newton-Krylov