Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
1,;u f/lElTON tt1.tr# C>CPf" UL S-,,\,N bJC
LA -:S.? lie..
INTERNATIONAL SYMPOSIUM
ON
OPERATOR THEORY OF NETWORKS
AND SYSTEMS
Volume 2 EDITORS:
N. LEVAN R. SAEKS
AUGUST 17-19, 1977
SPONSOR ED BY
TEXAS TECH. UNIVERSITY U.S. AIR FORCE OFFICE
OF SCIENTIFIC RESEARCH UNDER GRANT 77·3382
Hilton Inn -lubbock, Texas
OPERATOR THEORY OF NETWORKS AND SYSTEMS. INTERNATIONAL SYMPOSIUM ON.
Vol. 1. Concordia University, Montreal, Canada. August 12-14, 1975. 153 pages. Paper ......... :....................................... $22.00
CONTENTS (partial):
• Structure Result for Nonlinear Passive Systems • Frequency Response Methods in Multivariable Infinite Dimensional
Linear Systems • A Walsh Operational Matrix for Solving Variational Problems • The Feedback Interconnection of Multivariable Systems: Simplifying
Theorems for Stability • Linear Hilbert Networks Containing Finitely Many Nonlinear Elements • Linear Network Synthesis Using Iteration Methods • A Note on the Nagy-Foias Lossy and Lossless Space • An Output Control Problems Containing Input Derivatives • Contractive Transfer Ratios of Operator Network
(Standing Orders Accepted)
Additional Copies Available From:
WESTERN PERIODICALS COMPANY 13000 RAYMER STREET
NORTH HOLLYWOOD, CALIFORNIA 91605
Copyright © 1977 by Western Periodicals Company 13000 Raymer Street North Hollywood, California 91605
ii
PRE F ACE
The International Symposium on the Operator Theory of Networks and Systems is a biannual
conference which is guided, ~rom year to year, by a Steering Committee composed of
representatives from a number of universities. The current symposium is the second in
the series, and the first which Texas Tech University has had the honor of hosting.
Although originally conceived as a forum for the presentation of research results in the
area of operator theoretic techniques applied to network and system theory, the scope
of the symposium has grown considerably over the years. The present volumn therefore
encompasses research spanning the entire field of Mathematical Ne~work and Systems
Theory. Mathematical tools include algebraic techniques, algebriac topology, and
differential geometry in addition to the operator theoretic and functional analytic
techniques to which the symposium was originally directed.
The proceedings clearly convey the international interests in the area of Mathematical
Network and Systems Theory. Contributions of Scientists from three continents - Asia,
Eroupe, and North America are presented.
The Symposium committee is please to acknowledge the support of the Department of
Electrical Engineering, the College of Engineering, and the Graduate School of Texas
Tech University. Moreover, the symposium could not have been conducted without the
financial support of the U. S. Air Force Office of Scientific Research.
III
SECOND INTERNATIONAL SYMPOSIUM ON THE OPERATOR THEORY OF
NETWORKS AND SYSTEMS
STEERING COMMITTEE
C. A. Desoer University of Calif. at Berkeley Berkeley, CA.
J. W. Helton University of Calif. at San Diego La Jolla, CA.
N. Levan University of Calif. at Los Angeles Los Angeles, CA.
W. A. Porter Louisiana State University Baton Rouge LA.
R. Saeks Texas Tech University Lubbock, TX.
A. H. Zemanian State university of New York Stony Brook, N.Y.
R. W. Newcomb (Chairman) University of Maryland College Park, MA.
SYMPOSIUM ORGANIZING COMMITTEE
R. M. DeSantis (Program Chairman) University of Montreal
N. Levan (Co-Chairman) University of Calif. at Los Angeles Los Angeles, CA. Montreal Quebec
Canada
R. Saeks (Co-Chairman) Texas Tech University
Lubbock, TX.
IV
TAB LEO F CON TEN T S
1: COLLOQUIUM LECTURE
"Input-Output Properties of Interconnected Systems: Part I"
C. A. Desoer, Univ. of Calif. at Berkeley ----------------------
2: GENERAL SESSION I
"Bernstein Systems for Approximation and Realization"
W. A. Porter, Louisiana State Univ. ----------------------
"A Linear Systems Theory in Multidimensional Time"
A. V. Balakrishnan, UCLA ---------
"Optimal Control in Hilbert Space" M. Steinberger, M. Schumitzky, and L. M. Silverman, Univ. of Southern Calif. ---------------
"Solvability and Linerzation of Monotone Hilbert Networks"
V. Dolezal, SUNY at Stony Brook ----------------------------
4: SESSION ON FUNCTION ANALYTIC TECHNIQUES
"Time-Varying Input-Output Systems Whose Signals are Banach-SpaceValued Distributions"
A. N. Zemanian, SUNY at Stony Brook ----------------------------
"Causality and C Operators" A. Feintuch, °Ben Gurion Univ. ----------------------------
"Wiener-Hopf Techniques in Resolution Space"
L. Tung, and R. Saeks, Texas Tech Univ. -----------------------
"Approximate Controllability and Weak State Stabilizability"
C. Benchimol, UCLA ---------------
"A Modified Discrete Convolution Operator for Simulation of Linear Continuous Systems"
H. B. Kekre and D. B. Phatak, Indian Inst. of Tech. at Bombay
5: SESSION ON DYNAMICAL SYSTEMS
"Differential Systems on Alternative Algebras"
R. W. Newcomb, Univ. of
2
6
.7
13
17
21
28
34
35
__ Maryland -------------------------- 36
v
"Lagrangians with Integrals: An Approach to the Variational Theory of Dissipative Networks"
V. M. Fatic, Tri-State Univ., and W. A. Blackwell, Virginia Polytechnic Inst. and State 38 Univ. -----------------------------
"An Estimation of Parameters in Parabolic Equation with SpaciallyVarying Coefficients"
Z. Jacyno, Univ. of Quebec in 56 Montreal --------------------------
"The Singularity Expansion Method _ in Electromagnetic Scattering" ~9'l
D. R. Wilton, Univ. of Miss. ------~.
"Variational Principels for Mechanical and Structural Systems with Applications to Optimality of Design"
V. Komkov, Texas Tech Univ. -------- 63
"A Numerical Calculation Method for Simultaneous Ordinary Differential Equation of High Order by the Momentary Diagonalized Modal Property"
S. Azuma, Ibaraki Univ., and B. F. Womack, Univ. of Texas at Austin -------------------------- 66
6: SESSION ON CONTROL
"On Structurally Stable Nonlinear Regulation with Step Inputs"
W. M. Wonham, Univ. of Toronto ----- 72
"Frequency Domain Stability for a Class of Partial Differential Equations"
D. Wexler, Univ. Notre Dame de la Paix ------------------------- 76
"Densensitizing Observer Design for Optimal Feedback Control"
M. M. Missaghie, Sentrol Systems Ltd. _______________________________ 81
"Grassman Manifolds and Global Properties of the Riccati Equation"
C. Martin, NASA/AMES Research Center _____________________________ 82
"Generalized Operator and Optimal Control"
S. M. Yousif, Calif. State 86 Univ. at Sacramento ----------------
"Absolute Invariant Compensators: Concepts, Properties and'Applications"
R. M. DeSantis, Univ. of Montreal --------------------------- 90
8: GENERAL SESSION II
"Stability Tests for One, Two, and Multidimensional Linear Systems"
E. I. Jury, Univ. of Calif. at Berkeley _______________________ 91
"A Darlington Realization Theory of Optimal Linear Predictors"
P. DeWilde, T.H. Delft.
"Operator Theory Techniques for Finite Dimensional Problems"
92
J. W •. Helton, Univ. of Calif. at San Diego ______________________ 96
"The Relation Between Network Theory, Vector Calculus and Theoretical Physics"
F. H. Branin, IBM Corp. ----------- 97
9: SESSION ON LINEAR NETWORKS AND SYSTEMS
"Detached Coefficients Representation and Degree Functor of a Polynomial Matrix with Application to Linear Systems"
Y. S. Ho and P. H. Roe, Univ. of Waterloo _______________________ 103
"A Representation of Impedance Function in Terms of the Poles and Zeros for Transmission Lines"
F. Kato and M. Saito, Univ. of Tokyo --------------------------109
"Evaluation of Constituent Matrices of an Analytic Matrix Function"
F. C. Chang and S. R. Pulufani, Alabama A&M univ. -----------------113
"On the Losless Scattering Matrix Synthesis via State Space Techniques"
A. L. Dobruck and M. S. Piekarski, Wroclaw Technical Univ. -----------116
"Algebraic Characterizat:.ion of Matrices whose Multivariable Characteristic Polynomial is Hurwitzian"
M. S. Piekarski, Wroclaw Tech. Univ. -----------------------------121
10: SESSION ON OPERATOR METHODS
"Contraction Operator of Class C and the Structure of a class of 0 Infinite Dimensional System"
D. Hedberg, Hughes Aircraft Co., and N. Levan, UCLA -----------127
"Discrete-Time System Operators on Resolution Sets of Sequences"
R. J. Leake and B. Swanimathan, Univ. of Notre Dame ---------------128
VI
"Some Aspects in a Theory of General Linear Systems"
R. H. Foulker, Youngstown State Univ. -----------------------
"Certain Aspects of Inverse Filters"
V. P. Sinha, Indian Inst. of Tech. at Kanpui, and H. S. Sekhon, Punjab Argicultural Univ. -----------------------------
"Adaptive Antenna Polarization Schemes for Clutter Suppression and Target Identification"
G. Ioannids and D. Hammers, ITT Gilfillan Inc. ----------------
"On Limitations Based on Properties of the Strum-Liouville Operators in the Synthesis Procedures of Nonuniform Lines"
Z. Trazaska, Inst. of the Theory of Elec. Engrg. and Elec. Measurements of Warsaw -------------------
11: SESSION ON NONLINEAR SYSTEMS
"A Theory of Best Appro~imation of Nonlinear Functionals and Operators by Volterra Expansions"
133
134
135
'140
L. V. Zyla and R.J.P. de Figueiredo, Rice Univ. - ____________ 143
"Continued Fraction Describing Functions for Bilinear and Multiplicative Nonlinear Systems"
C. F. Chen, Univ. of Houston, and R. E. Yates, u.S. Army Research Lab. at Redstone Arsenal ____________________________ 144
"Nonlinear Analysis of Gyrator Networks: A Numerical Example"
M. B. Waldron and M. A. Smithers, Univ. of Houston ___________________ 148
"Lie Series and the Power System Stability Problems"
R. K. Bansal and R. Subramanina, Pubjab Argicultural Univ. ---------- 152
"A Study of Varying Efficiency Multiserver Queue Models"
H. B. Kekre, R. D. Kumar, and H. M. Srivastava, Indian Inst. of Tech. at Bombay ----------------- 156
"Modular Design of the Network which Realizes Original jProgram"
S. P. Kartashev,. Univ. of Nebraska, S. 1. Kartashev, DCA Assoc. ________ 161
INPUT-OUTPUT PROPERTIES OF INTERCONNECTED
SYSTEMS
C.A. Desoer Dept. of Electrical Engineering and Computer Science
University of California Berkeley, Ca. 94720
ABSTRACT
The purpose of this survey paper is to review some recent results on the input-output properties of both linear and nonlinear interconnected systems. The results presented deal primarily with qualitative system properties such as stability and parameter sensitivity with emphasis being placed on the robustness of these properties.
BERNSTEIN SYSTEMS FOR APPROXIMATION AND REALIZATION*
William A. Porter**
1. INTRODUCTION
This summary deals with the approximation of non
linear systems by polynomic operators. For per
spective it is helpful to consider the familiar
Volterra series expansion on L2 given by
p(x) ,= kO + S k~ (a)x(a)da +Hk2 (a,B)x(a)x(B)dadB + J~!k3(a,B,y)x(a)x(B)x(Y)dadBdY + ...
where the kernels kO' kl, ••. ,kn
satisfy properties
suitable to an operator on L2
. For the obvious
reasons we refer to each term on the right hand
side as a power function. If the number of terms
is finite then p is said to be a polynomic opera
tor. Our interest in polynomic operators centers
on their use as approximates of the more general
nonlinear functions on L2
•
The relevant lit~rature may be grouped into two
subcategories. -First-there is the analytic theory
in which f is assumed to have derivatives (Frechet
or Gateau) of all orders and p arises as a power
series expansion on a bounded domain. This line of
devElopment was initiated by Volterra [1] and was
first e.pplied in a systems setting by Weiner and in
ensuing years several others including [2], [3],
and [0]. Mere recently [5], [6], [7], [8] have
investigated the Volterra expansion of solutions to
nonlinear differEntial equations with current
emphasis on computation and convergence problems.
The analytic theory identifies the power func
tions with the deri vat:iv'~s of the system function.
f, to be approximated. The requisite differenti
ability is a severe condition, however, a fringe
benefit accrued is that the causality of the power
functions is dictated by the causality of f.
In an independent line of development the polynomic
approximation problem has been approached as a
generalization of the classic Weierstrass result.
In this setting the function, f, to be approximated
need not be differentiable. Emphasis is placed on
uniformly approximating f, by the polynomial p,
over an arbitrary compact set. In this approach
the causality issue is less trivial. The computa
tion of the power functions, which no longer repre
sent derivatives, has here to fore been obsure. In
summary we introduce the concept of a Bernstein
differential system. This system provides one con
structive realization of the Weierstrass approach.
An incidental bonus of the Bernstein system is a
pseudo sampling-theorem for systems. In short,
given an arbitrary system with a prescribed contin
uity modulus, the density with which one must input
output s~~ple in order to be able to approximately
reconstruct the system is established.
For efficiency of presentation we shall in many
cases be overly restrictive in the assumptions made,
for example we consider only Hilbert spaces. Also
the development is purposely c0nstrained so as to
ilse classical results on the Bernstein polynomials.
*Supported in par .. by the United States Air Force Office of Scientific Research, Grant No. 77-0352.
**Department of Electri_al Engineering, Louisiana State University, Baton Rouge, Louisiana.
2
In closing this introduction it is noted that the
admittance characteristic of the enhancement mode
MOSFET transistor provides an almost exact square
law (see [9]). It is easily shown that squaring
devices can be used to construct general polynomic
operators of the type necessary to realize the
Bernstein system. Thus it appears that the topic
of polynomic system approximation may have a ready
practicality in terms of microcircuit technology.
2. WEIERSTRASS APPROXIMATION
As the technical work of this study deals primar
ily with the Weierstrass approximation it is use
ful to comment in somewhat more detail on the
existing literature. In this regard we cite first
the original contribution of Weierstrass [10]
whose fundamental result in contemporary form
reads as follows.
Let f be an arbitrary continuous function on R,
the real line. Let D cR be an arbitrary compact
set. The~for every E > 0 there exists a finite
polynomial p, such that sup {If(x) - p(x)l:
XED} < E. The Weierstrass result, over the years,
has drawn the attention of several distinguished
mathematicians including Frechet [11], Bernstein
[12] and Stone [13] who investigated the relation
ship to power series expansions, constructive
methods for finding the polynomial and extended
the result to Rn among other things.
More recently Prenter [14] considered a real sep
arable Hilbert space H, and showed that if K c:: H
is compact, £ > 0 and f continuous on H then there
exists a finite polynomic operator p, such that
~~~llf(X) - p(x)11 < E
In a similar effort Prenter [15] and Ahmed [16]
were able to use normed linear spaces.
The Prenter result [14], for example, states that
if f is a continuous function on real L2
(a,b) then
on every compact subset there exists a finite
number of kernels kO' kl, ... ,kn such that p of
equation 1 is an E-approximat~on for f. Actually
we would suspect more, namely, that if f is causal
then each kernel ki is causal, tha: is for instance
3
0, T > t. More generally, can a caus-
ality structure be superimposed on the function
and its approximation? This question is answered
affirmatively in [17] and [18].
The setting for [17] and [18] is a Hilbert resolu
tion space* {H,pt } where H is real and separable.
The set K c:: H is always compact. The sets: C, SC,
M, C(K), and P denote the causal, strictly causal,
memoryless, continuous on K c: H, and polynomic
functions, respectively, on {H,pt }. For brevity
we shall say that P is dense in C(K) in the sense
of Prenter's theorem.
The results of [17] include the following. The set
P" SC is dense in elK) n SC. In L2 the stronger
result that pO SC is dense in C(K) A C is also
established. This last result does not abstract.
In t2 it is known [18] that P n SC is not dense in
C(K) " c.
All of the above results are nonconstructive in
that they give no clue as to finding the polynomic
approximate of a given function. On the real line,
however, several constructive forms of the Weier
strass result do exist and the Bernstein polyno
mials constitute one of the more intriguing
approaches to such constructions. In the study we
develop a generalization of the Bernstein poly
nomials to real Hilbert space. Using a causal data
interpolation scheme identification of the p E P~SC
that approximates f E C(K)~C results.
To be more explicit let {H,P}be any Hilbert resolu
tion space. The set QCH is compact and E > 0 is
arbitrary, ~ denotes a tuplet of indicies. fC .!l
denotes a causal polynomial constructed explicitely.
We summarize our first result in the following
theorem
Theorem (1) If causal f is bounded on n then
for each X£Q and £>0 there exists .!l such that
I If(x) - fC(x)1 I < E at every continuity point n
of f. If f is continuous on n then .!l exists
such that ~~~I If(x) - f~(x)1 I < E.
The explicit nature of theorem (2) has answered the
central theoretical questions in rather complete
form. It is of interest, however, to give a real
ization of the causal Bernstein function, fC
, in ~
state variable form. For this attention was fo-
cused on the real 12 [a,d] for O<d<oo e~uipped with
the usual inner product and the resolution of the
identity taken to be the familiar truncation
operators.
A second result which has been established is
summarized in the theorem
Theorem (2) In 12
, for every f E C(K)~ C and
E > a, there~ists a differential system
~(t) A(t)z(t) + B(t)u(t), z(O) = 0,
wet) ~(z(t),t)
where ~({·),t) is polynomic, such that the
map w = p(u) satisfies
s~PI If(u) _ p(u)1 I < E.
For obvious reasons the e~uations of this
theorem are called a Bernstein system.
4
REFERENCES
[1] Volterra, V., Theory of Functionals, Dover Publications Inc., New York, 1959.
[2] Bedrosian, E. and S. O. Rice, "The Output Properties of Volterra Systems Driven by Harmonic and Gaussian Inputs," Proc. IEEE, Vol. 59, pp. 1688-1707, 1971.
[3] Parente, R. B., "Nonlinear Differential Equations and Analytic Systems Theory," SIAM J. Appl. Math., Vol. 18, pp. 41-66,1970.
[4] Van Trees, H. L., "Functional Techniques for the Analysis of the Nonlinear Behavio~ of Phase-Locked Loops," Proc. IEEE, Vol. 52, pp. 894-911.
[5] Brockett, R. W., Volterra Series and Geometric Control Theory, Automatica, Vol. 12, 1976.
[6] Bruni, C., DiPillo, G., Koch, G., "On the Mathematical Models of Bilinear Systems," Ricerche di Automatica, Vol. 2, pp. 11-26, 1976.
[7] Krener, A. H., "Linearization and Bilinearization of Control Systems," Proc. 1974 Allerton Conf. on Cir. and Sys. Th., pp. 834-843, 1974.
[8] Gilbert, E. G., "Volterra Series and the Response of Nonlinear Differential Systems," Trans. Conf. on Inf. Sci. and Systems, Johns Hopkins University, March 1976.
[9] R. S. Cobbold, "Theory and Applications of Field-Effect Transistors," Wiley-Interscience, New York, 1970, Section 7.1.3.
[10] K. Weierstrass, "Uber die Analytische Darshell-bankeit Sogenannter Willkurlicher Funktionen Reeler Argumente," Math. Werke, III Bd., 1903.
[11] M. Frechet, "S~r les Fonctionelles Continues," Ann. de l'Ecole Normale Sup., Third Series, Vol. 27, 1910.
[12] S. Bernstein, Demonstrqtion du Theoreme de Weierstrass, fondee sur Ie calcul des probabilities, Com. Soc. Math., Kharkow, (2), 13, 1912-13.
5
[13] M. H. Stone, "The Generalized Weierstrass Approximation Theorem," Math. Mag., Vol. 21, pp. 167-183, 1948.
[14] P. M. Prenter, "A Weierstrass Theomem for real Separable Helbert Spaces," J. Approximation Theory, Vol. 3, No.4, pp. 341-351, Dec. 1970.
[15] P. M. Prenter, "A Weierstrass Theorem for Real Normed Linear Space," Bull. American Math. Soc., Vol. 75, pp. 860-862,1969.
[16] N. W. Ahmed, "An Approximation Theorem for Continuous Functions on Lo spaces," Univ. of Ottawa, Ottawa, Can., TR-73-l8, Nov. 191 .
[17] W. A. Porter, T. M. Clark, and R. M. DeSantis, "Causality Structure and the Weierstrass Theorem," J. Math. Anal. Appli., Vol. 52, No.2, Nov. 1975.
[18] W. A. Porter, "The Common Causality Structure of Multilinear Maps and their Multipower Forms," J. Math, Anal. Appl., Vol.. 52, No.2, Nov. 1975.
A LINEAR SYSTEMS THEORY IN MULTIDIMENSIONAL TIME
A. V. Balakrishnan Department of System Science
university of California Los Angeles, Ca. 90024
ABSTRACT
A theory of linear systems which do not have a time-like parameter is formulated. The input-output properties of such systems are studied and a notion of state is developed. The resultant theory is applied to the study of Markov random fields.
6
I _
M.
OPTIMAL CONTROL IN HILBERT SPACE
* * * Steinberger, A. Schumitzky and L. Silverman * Department of Electrical Engineering ':'Department of Mathematics
University of Southern California Los Angeles, California 90007
Abstract
Using the key concepts of causal factorization and state space under Nerode equivalence, we show that for a bounded, linear, strictly causal operator on Hilbert resolution spaces with quadratic cost functional, the optimal control may be expressed in memoryless state feedback form provided the forcing function is expressed as an initial state. We further conjecture that there is no causal feedback which realizes the optimal control for a larger subspace of forcing functions.
I. IN:TRODUCTION
In this paper will will examine the optimal control of a linear system with respect to a quadratic performance criterion. In particular, we will be interested in the cases in which the optimal control can be put in causal state feedback form. This problem was solved for the case of finite dimensional differential systems of the form
input-output point of view rather than from an explicit representation for the dynamics of the plant to be controlled. That is, we follow Porter [9] in studying systems of the form
y = Tu + f
where T is a linear, bounded, strictly causal operator from one Hilbert resolution space HI to another, HZ; and f is a given forcing function. Using the concept of Nerode equivalence promoted by Kalman [10] and since adopted by many other authors, we construct a state space for T. We then go on to construct a Hilbert resolution space of state trajectories. Given this space of state trajectories, we then show that if the operator (I + T*T) admits a strictly causal factorization (in the manner of Gohberg and Krein (11]) and f is generated by an initial state, then the optimal control can be realized in memoryless state feedback form. Furthermore, the optimal feedback operator can be extracted fairly directly from the factorization of (I+T*T).
x Ax + Bu
y Cx
by Kalman [1] with highly successful results. Subs equent efforts have been directed toward expanding Kalman's results to cover systems described by generalized versions of the above equations (see Delfour and Mitter [Z], Lions [3], Lukes and Russell [4]. Datko [5], Pritchard [6], and Curtain and 1critchard [7]).
Recently, we have written a paper [8] in which we attack the optimal control problem from an
This work was supported in part by the National Science Foundation under Grants ENG 76-14379, GP - Z0130 and by JSEP through AFOSR/AFSC under Contract F446Z0-7l-C-0067.
7
In this paper we will review the pertinent details of [8]. Section II will contain the necessary mathematical background and major supporting theorems, Section III will present the actual optimal control results, and Section IV will give some brief concluding remarks.
II. MA THEMA TICAL BACKGROUND
II. 1 Resolution of the Identity
A Hilbert space H is said to be equipped with a resolution of the identity if for every t e [to' t..,] a closed subset of IR, 3: pt:H"'H 3
to (i) p u 0 Vue H
(ii)
(iii)
(iv)
to:> p u u
ptp" = p" pt = pt
t t (u l , P uZ)H = (P u l ' uZ)H
Vue H
V ,. ~ t
Furthermore, we can define the complementary projection Pt: H'" H, Pt '" I - pt from wh ich we have I = pt + Pt' motivating the term "resolution
of the identity."
In the rest of this paper, we will associate with HI (HZ) the family of projections pt(i5
t).
A more complete treatment of this subject and the subject of causality can be found in the excellent article by DeSantis (13].
II. 2 Causality
In common usage, an operator is said to be causal if and only if past outputs are only effected by past inputs. In mathematical terms
we have:
Definition Z.l: An operator T is ~ iff
ptTPtu = ptTu , V t e [to' to>], u e HI
Similarly we have:
Definition 2. Z: An operator T is anticausal iff
An operator which is both causal and anticausal is called memoryless since its present output depends not on the past or the future but on the present input. Under reasonable conditions, any causal operator can be additively decomposed into the sum of a memoryless operator and an operator which has no memoryless part. An operator which has no memoryless part is said to be strictly causal. We can define
strict causality more precisely using partitions
of the time set [to' t." ] •
Let 0 be a countable family of finite partitions of the time set [to, to>] with the following
properties:
8
(i) 11 cO VnelN n n+I
(ii) for every t e [to' to>] and e > 0 3: n, k :;I
o < tk
- t < e n
t k+l
For notational simplicity let t, k - P k P n n t
t k +l ~k ~ ~ n (:, - Pk P
n t n
causality:
n
Then we can define strict
Definition Z.3: An operator T is strictly causal
iff T is causal and
lim N(n) ~k k :0 [:; T (:,
n n o
k=O
for any family as defined above. This limit is
taken in the uniform topology.
The following results are immediate:
Pt T Pt = T Pt
pt T pt = T pt
(3) If S:Hr-+Hz is strictly causal and T: HZ'" H3 is causal and bounded, then ST is strictly causal.
(4) If T: HI ... H Z is causal and bounded, and S: HZ'" H3 is strictly causal, then TS is
strictly causal.
For convenience, we will hereafter call an operator T an LBSC operator if it is linear,
bounded, and strictly causal.
II. 3 The State Trajectory Space
In order to talk about state feedback in a Hilbert resolution space setting, we first need a space of state trajectories which is a Hilbert resolution space. Such a space' is constructed
in [8].
The most basic concept underlying our construction of a state space is the concept of Nerode equivalence, which, in our setting is defined as follows:
Definition 2.4: Two inputs U 1,u2 €H I are Nerode equivalent at time t with respect to operator T, written u I , T': t u 2 , iff
~ t ~ t P
t T P u
l P
t T P u 2
Clearly "T, t" is an equivalence relation over Hi' so that it is possible to form equivalence classes of HI under "T' t". This set of equivalence classes, XT(t),' will be called the "state space of T at time t." Using a fundamental theorem of modern algebra, Saeks [12] has shown that ESt T pt can be decomposed into the product of two mappings, kT(t) and gT(t) in such a way that the following diagram com-mutes:
H -H
lk~ ~)2 XT(t)
We then go on to form, for any LBSC operator T, a Hilbert resolution space of state trajectories which may be thought of informally as being of the form
x( .) where x(t) € XT(t) for each t € [to' tex> ]
We call this space XT
and define kT and gT es sentially as
(kT u)(t)
and
We prove that gT is memoryless (as has been stated by Saeks [12]), kT is strictly causal, and T = gT kT • The reader wishing a more rigorous treatment should refer directly to [8] as we have glossed over many important details here.
Having the required tools, we then go on to prove a theorem concerning the relationship of open loop control to state feedback control,
9
motivated by an analogous theorem by Hautus and Heymann [14] in the discrete time case. In order to quote this theorem, we will need a lemma due to DeSantis [13] which states that if W is LBSC, the (I+W)-I=I+V where V is LBSC. Then we have
Theor ern I: Given k T : HI -+ X T , kT an LBSC operator, and a dynamic input transformation (I+W):HI-+H I , where W is LBSC (therefore let (I+W)-I=I+V where V is LBSC).
Then there exists linear bounded memoryless state feedback F: X T -+ HI such that
v u + Fx
x and
v (I+W)u iff
The above condition may be interpreted as saying that the state of V may be found as a part of the state of T for any input at any time. The systems governed by the above equations are diagramed below.
u x
( a )
u x
( b )
Figure
What Theorem I says is that when the state of V can be found in the state of T, the open loop controller of Figure I b can be implemented as a state feedback controller of the type shown in Figure lao The solution to the optimal control problem will es s entially consist of deriving the appropriate open. loop controller and then showing that Theorem I applies, thus giving us the desired state feedback control.
III. THE OPTIMAL CONTROL PROBLEM
With the background given in the previous sec-
p
tion, we are in a position to state in full the optimal control problem we wish to solve.
The t Problem: Given
(1) Two Hilbert resolution spaces HI and
HZ;
(Z) An LBSC operator
(3)
(4)
find
T : HI -t HZ
t~
A forcing function f = T P u;
Z A cost functional J(u, t, f} = I P t u I H +
IPt(Tu+f}l~ ; 1 Z
Uo e: Pt
HI :3 J(u, t, f} is minimized.
Note that the assumption f = T pt;: is equivalent to the assumption that the initial conditions of this problem are given as an initial state. In [8], we motivate the conjecture that all problems of this type for which an optimal state feedback law exists can be expressed in this form by first noting that f can be decomposed
as
and that the optimal control for fZ' is identically 0, and then using Theorem I to suggest that a causal feedback solution only exists when f=Tpt;:.
The t-problem may be easily solved using the projection theorem (15] to yield
However, since T*T is neither causal nor anticausal, the causality properties of this solution are still very much in doubt. Much can be resolved, however, through the use of causal factorization theory.
We will now assume that (I+T*T) admits a strictly causal factorization; that is, we will as sume that there exists an LBSC operator V
such that
,-1+ T T
(therefore let (1 + V}-l = I + W where W is LBSC.) Then we have
-Pt(V + V* + V'''V} pt~ _p (V + V '''V) pt~
t
(V'" is anticausal)
t~
PT(I+V)UO
= -Pt YP u
(by multiplying by Pt(I+ W"-))
For notational Simplicity, we define
~
as the operator which maps the forcing input u
onto its optimal control uO •
Next we will prove that the (I + W) derived from factorization theory is exactly the dynamic input transformation needed to apply Theorem 1 to the optimal control problem.
10
Consider the feedback system of Figure Z
t~ P u
t~ u+P u
Figure Z
along with its associated equation
u t~
-Pt
V(u+ P u)
t~ (I + P
t V) u -P
t Y P u
but, since ue:PtH I , we have
t~ ~
P t u::: - (I + ~Nl P t V P u = Pt@(t) u
Thus the above system solves the t-problem for t~
f=TP u.
The following lemma will be found to be the central reason for the use of the notion of "state" in the solution of the optimal control
problem.
Lemma 1: Assume the strictly causal factorizati';,n 1+ T*T = (1+ V'''}(I + V). Then V t e: [to' tee ];
ul T,t uZ-'ul V':t uZ·
Proof: t t
Pt
T P ul
= P t T P U z ~, t ," t
Pt
T Pt
T P u l = Pt
T P t T P Uz
P T*T pt u = P T'~T pt u tIt 2
(T* is anticausal)
'" * t * * t Pt(VtV tV V)P ul
= Pt(VtV tV V)Pu2
* t * t Pt(V tV V) P u l = Pt(VtV V) P u 2
(V* is anticausal)
~ t * t Pt(ItV~)PtVP u l = Pt(ItV )PtVP u 2
or, multiplying by Pt(I t W*),
t t P
t V P u
l = P
t V P u
2
Further insight can be gained by multiplying the above equation by (I t W) to get the equation
Although it is not used in the theorem to follow, it is the heart of the concept. In words, this equation states that the initial state carries all the infor mation nec es sary for the derivation of the subsequent optimal control. This is exactly why state feedback optimal control is possible.
Note, ~ow, that by Lemma 1 and the fact that (I t W)-l = (I t V), Theorem 2 applies to the dynamic input transformation (I t W). Thus we have proved the following theorem:
Theorem 2: For any system of the form
y = Tu t f
wher e T: HI" H 2 is a bounded, linear, strictly causal operator on Hilbert resolution spaces, and I + T*T admits a strictly causal factorization, there exists a bounded· memoryless linear
t~ state feedback F: 'X T" H; such that V f = T P u, ~ e HI' t e [to,tCDj, the control Uo whic~ minimizes J(u, t, f) = \ :t(T P t u + f)\ H2 t \ P t U\H
l over
the clas s of admls sable controls u e P t H 1 satisfies
Review of Proof:
(1) P Uo minimizes J(u, t, f) over the class of admissable controls u e PtH l where f = T ptu:
iff PtUO = Pt@(t)U:.
(2) PtuO=!\@(t)\I: if£~PtUO=-PtV(PtUot pt \1:) where I t T~T = (I t V~)(I + V).
(3) Using Theorem 1 and Lemma I, there exists F linear, bounded and memoryless such that -V = FkT; 1. e., the following diagram commutes:
II
where F = -gv h •
IV. CONCLUSIONS
Using the key concepts of causal factorization and state space under Nerode equivalence, we showed that for a bounded, linear, strictly causal operator on Hilbert resolution spaces with quadratic cost functional, the optimal control may be expressed in memoryless state feedback form provided the forcing function is expressed as an initial state. We further conjecture that there is no causal feedback which realizes the optimal control for a larger subspace of forcing functions.
[1]
[2]
[3]
[4]
[5]
[6]
(7]
BIBLIOGRAPHY
R. E. Kalman, "Contributions to the Theory of Optimal Control:' Bol. Soc. Mat. Mexi~, 5 (1960), pp. 102-119.
M. C. Delfour and S. K. Mitter, "Controllability, Observability and Optimal Feedback Control of Affine Hereditary Differential Systems,"SIAM J. Contr., 10 (1972), pp. 298-327.
J. L. Lions, "Optimal Control of Systems Governed by Partial Differential Equations," Springer-Verlag, Berlin, 1971.
D. L. Lukes and D. L. Russell, "The Quadratic Criterion for Distributed Systems," SIAM J. Contr., 7 (1969), pp. 101-121.
R. Datko, "A Linear Control Problem in Abstract Hilbert Space," J. Diff. Equ., 9 (1971), pp. 346-359.
A. J. Pritchard, "Stability and Control of Distributed Parameter Systems Governed by Wave Equations," IFAC Conference on Distributed Parameter Systems, Banff, Canada, 1971.
R. Curtain and A. J. Pritchard, "The Infinite-Dimensional Riccati Equation for Systems Defined by Evolution Operators," SIAM J. Contr., 14 (1976), pp. 951-983.
[8] M. Steinberger, A. Schurnitzky and L. M. Silverman," Optimal Causal Feedback Control of Linear Infinite Dimensional Systems," submitted to SIAM J. Contr.
[9] W.A. Porter, "A Basic Optimization Problem in Linear Systems," ~ Sys. Theory, 5 (1971), pp. 20-44.
(10] R. E. Kalman, "Lectures on Controllability and Observability," CIME Seminar on Controllability and Obs ervability, Bologna,
Italy (1968).
[11] I. C. Gohberg and M. G. Krein, "On the Factorization of Operators in Hilbert Space," Am. Math. Soc. Trans., Ser. 2,
51 (1966), pp. 155-188.
[12] R. Saeks, "Resolution Space Operators and Systems," Springer-Verlag, New York, 1973.
[13J R. M. DeSantis, "Causality Theory in Systems Analysis," IEEE Proc., 64 (1966), pp. 155-188.
[I4] M.l... J. Hautus and M. Heymann, "Linear Feedback - An Algebraic Approach," Center for Mathematical System Theory, UniverSity of Florida, 1976.
[15J D. G. Luenberger, Optimization by Vector Space Methods, Wiley, New York, 1969.
12
., - / I
SOLVABILITY AND LINEARIZATION OF MONOTONE HILBERT NETWORKS
Vaclav Dolezal State University of New York at Stony Brook
Stony Brook, New York
Abstract
Sufficient conditions for solvability of a class of monotone Hilbert networks are given. Moreover, a linearization of a nonlinear monotone Hilbert network in a neighborhood of an operating point is discussed.
1. INTRODUCTION
In the first part of the paper we give sufficient
conditions for solvability of a Hilbert network
some elements of which are described by monotone
operators defined only on subsets of the underly-
ing Hilbert space. As a special case we consider
a finite nonlinear LRC-network, whose inductors
are linear, time-varying.
In the second part we discuss a linearization of
a nonlinear monotone Hilbert network in a given
neighborhood of an operating point, which is sub-
optimal in a certain sense. An estimate for the
difference between the exact and approximate
current distribution is established.
2. SOLVABILITY
A Hilbert network ~ is called solvable, if for any
excitation by EMF and/or current sources there
exists in ~ a current distribution obeying
Kirchhoff laws. Known effective results concern-
ing solvability ([1] or ThEDrems 4, 5 in [2J) make
the assumption that the operators describing
\3
network elements are defined on the entire under-
lying Hilbert space H. Consequently, these re
sults are unapplicable, if the network contains
d1fferentiators.
The theorem given below attempts to fill this gap.
It is based on properties of maximal monotone
operators [3J, [4J and has the following physical
interpretation:
If all elements in a (finite or infinite) monotone
network ~ which are described by operators defined
on the entire space are removed and a unit resis
tor is inserted in every branch, and if the net-~
work ~l thus obtained is solvable, then ~ is also
solvable.
To avoid repetition of definitions, we will use
the notation and concepts introduced in the survey
paper [2J.
Theorem 2.1. Let H be a real Hilbert space, let G
be a locally finite oriented graph having
c2
:s; ~O branches, and let D CH c2 , N; n D f ¢.
A C2 zl:n-H is a monotone operator,
A C2 C2 Z2: H - H is a hemicontinuous operator
such that A A IP
(Z2Xl - Z2x2' Xl - x2> ~ c II xl - x21
for all x~,
and p >1,
x e HC2 with some fixed 2
(2.1 )
c >0
(iii) the Hilbert network 711 = (21
+I,G) possesses C2
a solution for any e e H
Then, for any e eHc2 , the network ~= (Zl +Z2,G)
possesses a unique solution in B corresponding to
e.
Using Theorem 2.1, we can prove a solvability re
sult for a finite LRC-network with linear time-
varying inductors, whose underlying space is the
real space 12
[0,TJ (we will write 12 in the
sequel).
Definition. Let G be a finite oriented graph hav-
ing c2
branches, and let 1 S k S c2
•
(i) there exist loops sl, s2,
Assume that k
(ii )
•• , S such
that, for each m = 1, 2, ., k, the loop
Sm contains b and does not contain any m
other branch in the set tbl
, b2
, ••• , bk }, t(t) is a symmetric k x k matrix having a
continuous derivative t'(t) on [O,TJ such
that t(t) and t'(t) is positive definite
and positive semi-definite, respectively,
for each t e: [0, T J, A C c
(iii) T· 1 2 -1 2 is an operator • . 2 2A k c -k c
Let the operator 1 : K x 1 2 - 1 2 be defined by 2 2
(Lx)(t) = (1(t)x(t)} I, (2.2)
where 1(t) = Lr t(t) OJ is a c x c matrix, and K ° ° 2 2
is the space of all absolutely continuous functions
on [O,TJ. Then the Hilbert network 11 = (L+T,G)
will be called 1-proper.
Clearly, this definition requires that all induc
tors (and possible mutual couplings) are confined
only to the branch~s bl
, b2
, ••• , bk
• Also note
that the operator T describes the behavior of both the resistors and the capacitors in the network.
Theorem 2.2. Let G be a finite oriented graph
14
having c2
branches, let lSk S C2
and let j , j , A c c l 2
••• , jk be real numbers. Let T: 1/-122 be a
hemicontinuous operator such that, for some c > 0
and p >1, we have
A A liP (TXl - TX2 , Xl - x2) ~ c Ilxl - x2 C2 A A A
for all Xl' x2
e:12
, and let 71=(1+T,G) be 1-c A
proper. Then, for any ee:12
2 , '1lpossesses a
unique solution i = [i l corresponding to e n-o k c2-k 0 () 0
1 e:K X12
and 1 0 =J for m=l, 2, •. m m
such 1h1:t
• , k.
Observe that Theorem 2.2 has the following
physical interpretation: if a network '1l satis
fies the hypothesis, then for any EMF's el
, e2
,
••.• ' ec e: 12 and any values jl' j2' ••. , jk
there exi~ts in ~ a unique current distribution
il
, i2
, ••• , i e 12 such that, for m = 1, 2, c2
., k, im is absolutely continuous and satis-
fies the initial condition i (O)=j • m m
3. 1INEARIZATION
The problem dealt with in this part resembles the
small-signal analysis of nonlinear networks [5J,
but differs from it in several aspects. While in
the small-signal analysis a "strictly local" ap
proximation (Frechet derivative) is used for ob-
taining an approximate solution, in the present
approach we use a certain "global" approximation.
Also, the Hilbert network setting is, of course,
more general.
To explain the underlying idea, consider a Hilbert
network ~= (i,G). Assume that we know the current
distribution iO in '1l corresponding to some EMF
vector eO (operating point), and that we seek dis
tributions ie*' which correspond to excitations
eO +e*, where the e*'s satisfy the inequality
Ile*!1 Sr with some given r >0. To find approxima
tions to the ie*'s, we linearize the network '1l
in a vicinity of iO
• More specifically, defining ,... I ,. I '" A
the operator Z by Z S = Z( iO +S) - ZiO' we assume
that we can find a linear operator Zo which satis
fies the inequality liz 'e: -zosii sail e: lion a certain
ball centered at the origin, where a >0 is not
* t~~ large. If ie* is the s~lution of the linear
netw~rk ~o= (Zo,G) corresp~nding to e*, we will
take io + i* * as an appr~ximati~n to the solution A e
i * ~f 'TI. e
The overall relative error
A = sup II i *- (iO+i** )11·lle* i1-1
r lIe*ll~r e e e*f 0
(3.1)
depends, of course, ~n the choice ~f ZO' and con
sequently, on the constant a. However, in speci
fic cases ~f networks it is usually not difficult
to c~nstruct Zo s~ that, for a given r >0, a is
small en~ugh. In this context, let us emphasize
the fact that taking the Frechet derivative of
Z at iO f~r Zo (a counterpart of the small-signal
analysis) need not lead to the smallest values of
a.
It turns out that if Z is strongly monotone, we
can give a simple upper bound for Ar' which is
roughly prop~rti~nal to a for a small.
The theorems that follow use again the notation
introduced in [lJ or [2J. Without loss of gen
erality we assume that the operating point eO and
the c~rresponding solution iO of ~ are zero. Also
note that Z is not required to be single-valued.
Theorem 3.1. Let H be a real Hilbert space, let
~ = (Z ,G) be a Hilbert network with Z being 'a set c
mapping defined on a nonempty subset D CH 2 such
that 0 € D, and let r >0. Furthermore, let A c c
F=X*(NAnD)CH 0, and let W :F ... e(H 0) be defined a A*A.A
by W = X zt.. Assume that
(i) there exists b >0 such that
(Yl -Y2' xl -x2)~b Ilxl - x211
2
for all xi €F, Yi €WXi ' i=l, 2,
(ii) there exists a linear bounded operator
(3.2)
c c ZO:H 2"' H 2 and a constant a with O<a<b
such that
15
I .... * .... ,. for all x €F nB and y eWx, where Wo =X ZOX,
c R -1 B~= (x:x€H 0, IIxll~R} and R=b r,
(iii) F nB~ is dense in B~.
If i €Hc
2 is the (unique) solution of ~ correspond-c A *
ing to some e* € H 2 with Ilx*e*1I ~r, and i is the
(unique) solution of "b = (Zo,G) corresponding to
e*, then
II i - i* II :s;ab-l(b -aflllx*e* 11, -1 -1
i.e., A ~ab (b-a) • r
(3.4)
Moreover, in (ii) the requirement that (3.3) holds
can be replaced by the stronger assumption that
r c2 for all v € D nBR
and u e Zv, where BR = tV : v € H ,
11 v II :S;R}.
A
The assumption concerning D and Zo made above can
be modified. Indeed, we have
Theorem 3.2. The Theorem 3.1 remains true, if its
conditions (ii) and (iii) are replaced by the
following assumptions:
(ii)* BRCD for R=b-lr, A C2 c2
(iii)* there exists a linear operator Zo : H "'H
and a constant a with O<a <b such that (3.3)
holds for all x € B~ and yeWx.
If, in addition, c
operator on H 0,
* c2 for any e e H ,
c D = H 2 and W is a hemicontinuous
A
then ~ possesses a unique solution c
i.e., (3.4) holds for any e*eH 2
Note that still another (but weaker) result like
Theorem 3.1 can be proved, provided it is assumed c
that D is a linear subspace of H 2 and ~ is
solvable. However, we omit the details.
References
[lJ V. Dolezal, Nonlinear Networks, Elsevier Scientific Publishing Co., 1977.
[2J , Basic Properties of Hilbert Networks, IEEE Trans. CAS, vol. CAS-23, #8, (1976), pp. 490-497.
[~l H. Brezis, Monotonicity Methods in Hilbert
Spaces and Some Applications to Nonlinear Partial Differential Equations, Contributions to Nonlinear Functional Analysis, E. Zarantonello (ed.), Acad. Press, 1971, pp. 101-156.
[4J R. T. Rockafellar, On the Maximality of Suma of Nonlinear Monotone Operators, Tran& Amer. Math. Soc., 149 (1970), pp. 75-88.
[5J B. Feikari, Fundamentals of Network Analysis and Synthesis, Prentice-Hall, Inc., 1974.
16
j
TIME-VARYING INPUT-OUTPUT SYSTEMS WHOSE SIGNALS
ARE BANACH-SPACE-VALUED DISTRIBUTIONS
A. H. Zemanian State University of New York
Stony Brook , N. Y. 11794
Abstract
The composition of an operator-valued distribution and a Banach-space-valued distribution is established by extending Schwartz's kernel theorem in such a fashion that it has the form of the Cristescu-Marinescu composition of scalar distributions. This provides a representation for many linear continuous timevarying systems.
1. INTRODUCTION
The idea of a time-varying Banach system, which
was defined and analyzed in a prior work [10], led
to a study of composition operators acting on
spaces of distributions that take their values in
Banach spaces. Two types of composition were con
sidered in [10]. The first, which was called
"composi tion • " makes use of Schwartz's kernel
theorem [8], [11] and its extension to Banach
space-valued distributions. It provides an explic
it representation n = f. for every continuous
linear mappingnof £leA) into [t); B] by means of
the composition product f-v, where .f is a distri
bution on the real plane taking its values in
[A; B] and v~D(A). Here, A and B are complex
Banach spaces, [A; B] is the space of continuous
linear mappings of A into B,1)(A) is the space of
infinitely differentiable (i.e., smooth) A-valued
functions of compact support on the real line
supplied with its customary topology, 1:> = D(C), C
being the complex plane, and PD; B] is the space
of B-valued distributions equipped with the topo
logy of uniform convergence on the bounded sets
of 1). A shortcoming of this representation for n is that it is defined only for certain suitably
restricted continuous functions v and not for
singular distributions v.
17
The second type of composition, which was called
"composition 0" in [10], is an extension to Banach
space-valued distributions of a composition product
first introduced by Cristescu [2] and subsequently'
developed by Cristescu and Marinescu [3], Sabac
[7], Wexler [9], Cioranescu [1], Pondelicek [6],
and Dolezal [4]. In contrast to composition. ,
not all continuous linear mappings ofi)(A) into
[D; B] can be represented by a compositiono opera
tor. However, composition 0 has the virtue that,
when it does exist, it can be applied to singular
distributions.
The present work is aimed at this gap between com
posi tion. and composition o· . A formula is given
for extending composition. , which we henceforth
refer to simply as "composition", onto singular
Banach-space-valued distributions by using
Cristescu's form of composition. The idea is as
follows. For the sake of simplicity let us con
sider the case where f is a scalar distribution on
the real plane and v and ~ are members of~. Let
us also assume that f is of finite order so that
f(t,x) = (-D )i(_D )rh(t,x) where h is a continuous t x
function on the real plane and Dt
= J /d t. (These
assumptions will not be imposed subsequently.)
Then, by the composition arising from Schwartz's
£i£ ;is
kernel theorem, we formally write
<'f·v,q,> = Hh(t'X)D~cj>(t)D:V(x)dtdX
= S(D~V(x)Jh(t,X)D!q,(t)dtdX
= <V(X),(~Dx)rJh(t,X)D~q,(t)dt>
= (v(x) '(Yx(t) , Ht»>
(1.1)
where Yx is a mapping of!) into C and hopefully a
distribution depending on the parameter x. The
important thing is that the right~hand side of
(1.1) is precisely Cristescu's form of composition,
and therefore, if it turns out that (Yx,q,) is a
sufficiently smooth function of x, then (1.1) pro
vides a means of extending the operator f· onto
some space of singular distributions v. This
paper gives a rigorous presentation of this idea
and overcomes the complications that arise when
the assumptions that f is of finite order and
that f and v are scalar-valued are dropped.
The present work extends the discussion in [10] in
still other ways. For instance, this discussion
encompasses the larger class of composition oper
ators f - that are defined only on finite-order
distributions v [5], in contrast to the smaller
class considered in [10] of operators defined on
infinite-order distributions v. In addition, our
present results imply an estimate on the order of
the resulting composition product, something that
was not available in [10]. Another generalization
is that distributions on multidimensional euclid
ean spaces are now allowed, whereas [10] was re
stricted to distributions v on the real line.
It should also be pointed out that, although
Schwartz's extensive work [8] discusses the com~
position product f-v of vector-valued f and
scalar-valued v (see pages 124-126 of volume 7 in
[8]), it does not discuss the problem attacked
herein, namely, that of defining f.v, where f is
an operator-valued distribution and v is a Banach
space-valued distribution, both of which may be
singular and of arbitrarily large order.
2. NOTATIONS
The notations used in this work are the same as
those of [11]. The reader should refer to that
18
work (especially to Section 1.2 and the Index of
Symbols) for the definitions of any symbols not
defined herein. There is one exception however.
In this work we will assign a more specialized
meaning to the symbol RS as follows. m will always e denote an ordered s-triple {ml ,m2,···,ms }' each
component of which is either a nonnegative integer
or 00 For example, {2,~,0} is such a 3-tuple
RS denotes the collection of such s-tuples e
Throughout this work we will always have m~R: and . n JERe'
pS is s-dimensional real euclidean space. K will
be a compact interval in RS , and K will be its in-
terior. If HRI or T=CO~ the notation [T] denotes
the s-tuple all of whose compontents are equal to
T. However, [0] is denoted simply by O. A and B
always denote complex Banach spaces with norms
V-itA and f{ '/IB
respectively. If U and V are
two topological linear spaces, the symbol [U; V] denotes the linear space of all continuous linear
mappings of U into V. Unless the opposite is
explicitly indicated, we always assign to [U; V] the topology of uniform convergence on the bound
ed sets of U, which we call the "bounded topology".
Thus, for instance, [A; B] is assigned its opera
tor-norm topology.
Let q, be a function from RS into some
Banach space. When we say that q, is continuous
or has a derivative, it will always be understood
that the continuity and derivative are with re
spect to the norm topology of the Banach space.
Thus, for example, if the Banach space happens to
be [A; B], the said continuity is with respect to
the operator-norm topology of [A; B]. Let
k = {k ,"',k } be a nonnegative integer in RS
I s (i.e., every component kv is a nonnegative in-
teger.) Any {partial) derivative
J Ikl q, -k k-
d t 1 ,)t s s I k I = k 1 + ... +k s
. k Ikl of q, wlil be denoted by D q, = q, . We will also
write D~ q,(t) = q,(k) (t). We shall refer to k
(and not Ikl)as the order of the differential
operator Dk. The notation Ikl should not be
confused with the magnitude notation for the
members of RS
•
The support of any function or distribution f on
RS is denoted by supp f.
We will use the standard function spaces ~~lA), .DR~(A), c~~(a), and (?~(A), equipped with their
customary topologies. All of these are defined in
[10] and also in [11]. When m=oo, we drop the
superscript m. Similarly, we drop the subscript
RS to write 1J meA) = nm(A) and E. meA) ;: [meA) AS R'
whenever there is no need to specify the euclidean
space RS•
Schwartz's Kernel Theorem. n is a sequentially
continuous linear mapping of b,(A) into [DR"; B]
if and only if there exists an 'f € [.DR"+s (A); B] such that flv = f.v for every v~ D (A). Here, f is
R' uniquely determined byn, and conversely. The
composition product fov is defined by
<f.v,~> = <f(t,x), ~(t)v(x» (2.1 )
where ~fi 1) , Vii:!> •. (A), ttRn
, and xliRs
. R" R
A proof of this theorem is given in [11; Chapter
4] .
3, COMPOSITION OPERATORS ON [em ;A] INTO [~j ;B]. . RS R"
Lemma 1. Given any v£[em; A], define q by . v
(q , Fe) = F(v, e> v
(3.1)
where F~[A; B] and eED. Then, there exists a
unique Vf[em+[2] ([A; B]) whose restriction to the
elements of J)( [A; B]) of the form Fe, where
F€[A; B] and e~~ coincides with qv' Moreover,
supp v = supp v. In addi tion v ~V is a sequent
ially continuous linear injection of [em; A] into
[Em+ [2] ([A; B]); B].
The proof of Lemma 1 is quite similar to that of
Lemma 4-2 of [10]. The assertion concerning the
supports is an easy consequence of the Hahn-Banach
theorem.
Next, let VE£m(A). Then, the same arguments as
those used for Lemma I show that there is a unique
V€[~m+[2]([A; Bl); B] which coincides with q on v
all elements of the form Fe. Furthermore, v gen-
erates a unique member of [~o; A], which we also
denote by v, through the definition:
(v, ~> : J v(x)~(x)dx ~EDo. It'
19
(Here, the superscript ° in ~O denotes the s-tuple
[0].) Moreover, v defines an [[A; B]; B]-valued
function v' on RS by the definition:
v'(x)F = Fv(x) Fe[A; B], x~Rs. By the Hahn-Banach theorem, supp v' supp v and,
for each x, v(x)~v'(x) is injective. Some
straight-forward arguments establish
Lemma 2. Let v€~(A). Then, for every integer
k€Rs with O~k~m, we have that Dkv' exists, is
continuous from RS into [[A: B]; B], and therefore
generates a regular member of [.Do ([A; B]); B].
Moreover,
k k D v'(x)F = FD vex)
for every F€[A; B]. Finally, v'=v in the sense of
equali ty in [1)[2] ([A; B]); BJ, and supp v' = supp v. = supp v'. Now, let fEr .DRn,,$(A); B]. Choose any two compact
intervals IeRn and VCRS. It can be shown that
there exist a jeRn not depending on I, an e
hfe~KL ([A; B]), and two nonnegative inte~ers itRn
and n;Rs with i::;j such that, for all ~Et>i and
ve.f)L (A) ,
<~.v,~> = <f(t,x), ~(t)v(x»
• SISLh(t,X).(i) (t)v(r) (x)dtdx.
1'.'e May now convert the integral on the right-hand
5i~e into a reneated inteRral:
S,[Srh(t,x)cl> (i) (t)dt]v(r) (x)dx.
Assume that the inner integral has continuous de
rivatives on L of at least order r+[2]. Then, we
mar integrate by parts Irl times to obtain
Sr,fn:rrh(t,x)+(i) (t)dt]v(x)dx
jir ~ (-1) 'rl pr x x
" Let us now assume in addition that supp vC L. By
virtue of Lemma 2 and its notation, we may write
<f.v,,> = S"v' (x)n: SIh(t,X)cl>(i) (t)dtdx
= <~(x), n:Jrh(t.x),(i) (t)dt)
Tri s result mot ivates the following assumption
and definition, which provide a means of extend
inp the definition (2.1) of the composition oper
ator fo: vl+f.v onto the space [E~; A]. ~
u
Assumption. AssuMe tl·at c{)rresllondi nr to a .given n I m£P~ f"('r
f"t.{DR" .. CA); P.] tt..ere is a j€Pe am an e
Nhich the f"ollol-'inr condition!': arC' .<;atisfjecl.
Ct... 01' ce 04' the comnact ; nterva1 s TCpn ~r ~C~ n
1 Icns tj1ere exi st an rEer I (fA: TIl) aNl h:O anl " , .X . . r 1 pS • th .... h nonnepative inte?er~ lEP anr rE. ~1 • l.) suc
i trat, for all ¢E.flr and all vd\(,h),
(f(t ,x), Ht)vex» r') rr)
"STSyHt,xH·'·(t)v (x)rltdx
and, in ad,H tiol',
(n I'l+l+{'" Srh(t")<I>' (t)r1t€(\ .. ({.i\; nl).
nefinition.
Assumption. o
choose T nne! L ~UC1, t'·~t ,w'n ¢cT :m(~ SlInn veT,
Finn11", c"oose h, i, anel l' in accorchnc(' ",it"
tl,e ~s~l1fl'ntior. ,,'(' def"'ne (4'.v, <1» I,V
'A ~,.. J 'i) () > (r.v,¢> ~ (v(x), ''x T'·(t,xH t"t,
A "1+ r21 , .. here vis t.h:Jt !'1enh'r of rt <;' • (rA: nJ); l'\]
n' corre<;p{)n'~ ing to v i.n accordancp vi th LCT'lMa 1.
T1'coren. l'J1<'rT t "'(' ,\SSI1T'lnt ion a!'f1 nef"inition,
(f.v,¢) is inliepen,'ent {)I' t 1,I' C"nices 01' tl·c
paraneters T, L. 1~, i, and r amI is consistent
,dtl' (::.1). "orcover, tl,c onerator f.: \'~f.v is
a seouentially contimlous linear T'lanninp of"
[ & m. AJ into f.t).i; pl. cR' ' , R"
The proof of" this theoreM is nuite cxtensive an"
ldll appear dsel\'l'ere.
After maling annro"riate assUT'lntions on the SllP
nort of" 4', "'e cnn refine comn~~ition onerators
f". that man [.oR;: /I] into r~1I.~: P], ~E~:: A' into • 1'1 1 m
{E '. "J Tf)" /11 into r t .: "1, and rf) ; '\) ~., . ! . ~.' . II" , -
into (.(t J ; Al. )forrover, convolution can he
shown to h~ a special ca~e of ollr cOl"no~ition,
Tris ~.'orl ,va~ s\lnnorte~ 'w t"e "'<tt;.onal Science
Foundation under r.rant "re:; 75-"52('8 .~n:;,
r 1] I. rioranescu, 'Tami 1 i i eompozahile de oper
atori", St. r~TC. "at., vol. 1~ (1967),
nn. 44('\-454.
P] R. Cristescll, 'Tamille5 comnosaMe5 de dis
triblltjon~ et ~vstemes rr.ysiques lineaires",
!lev. Pown. "ath, Pures ~t "pn1., vol. 9 (1%4)
1'p. 703-711.
f3] R. Cristescu and r. 'larinescu, "lInele
Aplicatii Ale Teori~i nistrilmtiilor",
Fclitllra Ikademiei "enuhl icii e:;ociali5te
Romania, Rucuresti, 1~66.
r4] V. f'olezal and .J. Sanborn, "A representation
ct linear continuous operators on testin~
functions and distributions", Siam .. J. !lath,
Anal., vol. I (1970), np. ~Ol-S"(.
[5] .J. I'orvath, Topological Vector Spaces and
Distrihutions, vol. T, Addison-"'esley,
l!eadinr" /lass., 191'i6.
P,. Pondel ice~:, "A contri hilt i on to the foun
dation of networl' theory usinr distrihution
theorv", rzechos lova~' Hatr. ,~., vol. I!' r94)
(IQ69), nn. 6!'7-710.
.. C;ahac, ''ramilii COr:1nOzabi1e de distrihutii
si transformata Fourier:, "St, Cerc. ~!at.,
vol. 17 (1 0 ('5), ~p. 607-613.
rllJ L. .C;crwartz, "Tllforie des distributions a
valeurs vectod elIes", .~nnal es de 1 'T nst i tut
Fourier, tOT'll' 7 (1!'!i7) , nry. 1-130 , and tome 8
20
(IQ5!'), nn. 1-206.
[Q] l~. "'~xler, "Solutions periodioues et presoue
np.riodic des systcl'1cS d'enuations differen
tielles lineaires en di strihutions", ~
nifferent ial Fouat ions, vol. 2 (1966),
nn. 12-32.
nn) A. II. ?PT'lanian, "I\anach systems, llilhert
ports, and ""-norts", in :\etwork TheorY,
R. Roite (~d.), Gordon and Breac", ~ew York,
1972,
fIll A. II. Zemanian, Pealizahility Theorv for
rontinuous Linear Systems, Academic Press,
~!ew York., 1972,
Causality and Co Operators
Avraham Feintuch Ben Gurion University of the 'legev
Beersheva, Israel
Abstract It i~ shown that the class of C Contractions appear in a natural way in the algebra of causal
linear operators. Stability propertigs of such operators are studied.
1. Introduction:
The appearance of linear transformations 'J
which operate On infinite dimensional Hilbert
space has become quite common in systems theory
today. This is true both in the state space
theory and the input-output theory. In particular,
a significant number of authors ([1], [Il], [10],
[11], [14]) have noticed the relevance of the
Nagy-Foias model theory for contractions in
systems theory. An important suhclass of these
operators is the family of Co contractions. These
seem to be a quite natural ,generalization of
finite dimensional operators and such basic
finite dimensional concepts such as minimal
polynomials and .Jordan c,anonical forms have a
natural extension to the infinite dimensional
case.
The usefullness of these operators and the
unilateral shi ft operator to \~hich they are
closely related has been noticed by a number of
authors. Common to all of them is a state-space
approach.
My purpose here is to show how these
operators appear in a natural way in causality
structures and play an important role in the
theory of stability of linear feedhack systems.
~fost of the results presented here have appeared
in the literature in an operator theoretic setting.
However, this is, to my knOldedge, the first time
that they are presented from the point of view of
systems theory.
2. Causality:
Causality is usually descrihed on an complex
Hilhert space II in terms of a resolution space
21
structure. This is descrihed in detail in [17].
Here \~e present a slightly different formal ism
first descrihed in [17]. This will make it simpler
to translate operator theoretic results directly
into a systems theory framework.
'" Definition 2.1: A family:~ of suhspaces of II is a
nest if it is totally ordered hy inclusion. q is
complete if
(i)
(ii)
.A. {®}, fL'E N
forNoc~, n N :-.ltNo
and VN NtNo
arc in N.
for any 'I t ~l, ~I wi 11 denote the subspace
V (L : L t 'N and L c ~I).
with (0) = (0). If ~ #~, N is called the
predecessor of ~I.
1\ Definition 2.2: A nest N is maximal if
"\ (i) ~ is complete
(ii) for il E ~, dimel e '1)< 1
Definition 2.3: A nest space is a pair (H, ~) consisting of a lIilhert space II and a maximal
nest :-.l of suhspaces of 11.
PM will denote the orthogonal projection
with range '1.
The nest space structure allows us to define
causality in an ahstract setting.
" Definition 2.4: Let (II, 'J) he a nest space. A
bounded operator T on II is causal if 1':/ = 1'~ITP: I A
fOT all n E '\. T is ant i-causa 1 if Tl'q = 1':'ITP:'1
and memorvless if P.IT = TP~I for all :'f E ~'i.
rxaMple 2.1: The classical context of causality
is h'hen II = L2(-oo, co) and ~ = 1.2
(-<0, t) i'~' E iR). A
It is easily ch()cked that (H,N) is a nest space
and that the definition of causality given ahove
i;
coincides with the classical definition. It should /to
be noted that if H £ ~,~I ~I. ~ests which have
this property are called continuous. Throughout
this paper we will consider continuous nests
unless stated otherwise. This is done since the
corresponding results in the discrete case are
quite easily obtained.
3. Shift Invariant Subspaces:
Let m denote measure on the unit circle n
in the complex plane normalized so that men) 1.
L 2 will denote L 2 (1jI). Every f t L2 has a Fouries
f(e iO) = ! ina which converges to f in the n"-CO c e
2 2 n L -norm. H will denote the subspace.
112 = (f t L2 : f(e iO ) = ~o c eine
) . n- n
i. e. f t H2 if and only if its negative Fourier
coefficients are zero. The unilateral shift on 112
is the operator.
(Sf)(eiO) = eiof(eiS ).
. The basic facts about H2 and S can be found in
[12]. Any 112 function f has an analytic extrnsion
to the unit disc D = (z : Izl < 1) whose value. at
z will be denoted by fez).
The invariant subspaces of S were described in a
well-known theorem of Beurlinp,. For this we need
the notion of an inner function.
Definition 3.1: A non-constant function q analytic
in n is called inner if Iq(z)1 ~ 1 and Iq (eiS)I=1
almost everywhere on ~
Inner functions playa major role in the theory
of Co contractions and I will therefore briefly
discuss their structure. An elegant and complete
treatment can be found in [12].
If is inner then it can be factored as
rj>fil) = exB(A) s(A), Inl = 1
\~here B(A) is the Blaschke product determined by
the zeroes of rf> inside the unit disc and SeA) is
of the form _f2~eit+A
exp "t dUt , o e -1 + A S (A)
u being a non-negati.ve finite singular measure.
It is now possible tostate Beurling's Theorem
and to di.scuss its importance for cansality.
TZZT
Theorem 3.2: Let S be the unilateral shift on H2.
Then ~I c H2 is an invariant subspace for S if
and only if there exists an inner function rf> such
that ~I = rf>H2 = (rf>f : f t H2).
It will be more conventient to work with S* A
the adjoint of S. Every continuous chain N of
invariant suhspaces of S* has the form
~ft = (rf>t 1I2)"l where rf>t is inner. Moreover, each
rf>t is a singular inner function since the presence
of a Blaschke product would introduce a jump "-
discontinuity intoN.
The importance of such chains is that, up to
unitary equivalence, they are universal.
Theorem 3.2: [13]: Every continuous chain of
suhspaces (in a separable Hilbert space) is
unitarily equivalent to a continuous chain of
invariant subspaces of S*.
22
A
Thus given a nest space (H, N) with H
separable, infinite-dimensional we may as well 2 " 2 .l} assume that II = H and that N = {(rf>t II)' for
some chain {rf>t} of inner functions.
It is worth noting that the basic theory of
inner functions leads to the fact that
rf>l H2 c rf>2 H2 if and only if rf>2 divides 4>1' Thus
(rf>l 112)1;.:::> (4)2 112).l if and only if rf>2 divides 4>1.
It is then quite natural to use the expression
"a chain of inner functions". /'.
The identification of (H, N) with
(112, {(rf>t H2).l}) will allow as to show that the
algebra of causal operators contains many Co
operators.
4. Co Contractions:
In this section we present the relevant
material from the theory of Co contractions. No
attempt at completeness is made. A complete
treatment is given in [19, Ch. 3].
Let T be a contraction (IITII ~ 1) on H. Then
9[19], Ch. 1) there corresponds a decomposition
of II into an orthogonal sum of two subspaces
reducing T, say II = "0 III HI' such that the part
of T on "0 is unitary and the part of T on "1 is
completely non-unitary. This means that Ttlll has
no non-zero reducing subspace L for which T\L is
a unitary operator. This decomposition is unique
and 110 or HI may equal the trivial subspace (0).
We will concern ourselves with the case that
110 = (0); i. e. T is completely non-unitary.
By,,"" we denote that algebra of bounded
analytic functions on n with the usual norm (or 0;,
equivalenty, the subalr,ebra of L consisting of
those functions whose negative Fourier
coefficients vanish). Then if T is'c.n.u.
(completely non-unitary) it is possihle to define 00
the operator Ijl(T) for all ¢ t II
Oefinition: Co is the class of completely non
unitary contractions T for which there exists a
non-zero function UtII'''' such that u(T) = o. u £ I~ can he factored into u u. with u an
e 1 e outer function and u. an inner function. It is
1
shown in [19] that if u(T) = n then Hi (T) = O.
\lso if T is of class COlT has a minimal function
mT
Ivhuch divides every flinction u such that
ueT) = O. This function is determined up to a
constant factor of modulus 1. In the sequel the
spectrum of T will play an important role. This
is completely determined hy mT
. Let
m,.().) = fl(A)S(X) be the factorization of mT
•
Theorem 4.1: [19, p. 126]: Let fTIT he the minimal
function and aCT) the spectrum of the contraction
T of class Cf)' Let 5r he the set consisting of
the zeroes of mT in the open unit disc n rl.nd of
the compliment, in the unit circle lT of the union
of nrc! of lTon which mT is analytic. Then
aCT) = ST' The simplest examples of Co operators are
given in the following theorem.
Theorem 4.2 [19, p. 124]: Let he a non-constant
inner function and let 14 = (dlH2).L in 112. Then
the operators p~lsl~1 and S*I~1 he long to Co and
their minimal function is
~low we return to callsa 1 i tv. A.s seen ahove, 1\ . •
if (II, ~) is a nest space, we Crl.n assume it is
(112 , {(¢t ,,2):-}).
the
2'" suppose ~I = (<Pt
II) for some t and consider
operator T .... = P~ISP~I' By Theorem 4.2 T is Co'
23
It is easily checked that T is causal and T* is
anti-causal. Thlls the algehra of causal operators
is rich in Co operators.
I I~ould like to end this section hy giving an
example of a Co contraction in a more traditional
contex for feedback systems.
Example 4.3:
2 Let II = L (_00, (0) Id th the usual nest of " 2 suhspaces N = {I. (_00, a) _00 < a < oo}. Let F be
the convolution operator on II with kernel
x < ()
2e -x x > 0 f(x) =~
Let P he t projection on L2(0, 1) and consider
the operator I - PF. This is the return difference
from the feedhack system with input-output map F
and feedhack P (which is just a truncation). It
is sho\Vn in ([181) that I - PF is unitarilly
equivalent to P~ISP~1 where M = (tj>1I2).L with
z+l ¢ = exp(z_l)'
S. Feedback Sta~ility
Let (H, N) be a nest space and consider the
feerlhack system described hy the equations
y Ke + d
e = ry + u (1)
"here K and F are causal operators on (H, N) IVhich
are, respectively, called the plant and feedback
operators.
Conceptually, u is the input signal, d is the
disturbance in the output of the system; u and d
are considered the independent variables of the
feedhack system. The dependent variables are the
output y and the plant input e.
Comhining the abOVe equations, we obtain
(I - KF) Y Ku + d
(I FK) e u + rd
\Vhich relate the dependent variables to the
independent ones. It is clear that for y and e to
be uniquelY determined hy u and d \Ve need that the
inverses of I - FK and I - FK to exist. KF is
called the open loop gain and I - KF is the return
difference of the system. It is easy to see that
(I - KF)-l exists if and only if (I - FK)-l exists
and one is causal if and only if the other is.
Stability of feedback system is usually
defined in terms of extension spaces (see [17],
p. 65). Fortunately an important theorem of
IHllems ([20]) allows us to avoid this approach.
IHllems result will be used as our definition of
stability. What is lost in the intuitiveness of
the extension spaces is more than made up for in
mathematical simplicity.
Definition 5.1:
rhe feedback system (1) is well posed and -1 -1 stable if (I - KF) (equivalently (I - FK) )
exists as a bounded operator on " and is causal on
(II, N).
Here we consider the case where the return
difference ( I - KF~ is a Co ·contraction. While
we do not have a complete solution the result to
be presented.
Definition 5.2:
A contraction r on a separable IIilhert space
is essentially unitary if hoth I - r*r and
I - TT* are compact.
I~ is worth making some remarks ahout the
operators I - r*r and I - TT* for a contraction T.
rhe square roots Dr = (I - r*T)1/2 and
Dr * = (I - TT*)1/2 are called the defect operators
for r, the closure of their ranges DT, Dr * are
called the defect spaces and dr = dim DT
,
~* = dim Dr * are called the defect indices of T.
It is easy to see that dr = 0 characterizes
the isompetric operators and dT = dr
* = 0
characterizes the unitary operators. Thus, the
defect indices measure, in a sense, the deviation
of the contraction T from heing unitary.
Another way of looking at this is the
following. The unilateral shift defined in 3 is in
a certain sense a universal operator. If T is a
contraction such that rn ~ 0 strongly then T is
unitarily equivalent to a compression of some
multiplicity (possihly infinite) unilateral shift
24
to some co-invariant suhspace. ([19]).
Our restriction means that T is related to
a unilateral shift of essentially multiplicity.
In the case of essentially unitary Co contractions we always have l~el1 posedness and
stability ([17J).
Theorem 5.3: I f (I - KF)-l is hounded and I - KF
is an essentially unitary Co contraction, then
the system (1) is well posed and stahle.
6. Strict Causality:
It turns out that for most desired properties
of feedhack system the assumption of causality of
K and F is not enough. Thus the concept of
causality was strenghed in various directions hy
a number of authors (see, for example [21J, [2J,
r33]). One particularly useful direction is the
concept of strict causality introduced by
De Santis, Porter and Saeks ([2J, [4J, [17J). One
of the rather surprising aspects of strict
causality is tlJat Nhile it is in some sense
natural for linear systems it turns out to be
important for non-linear systems as well ([12],
(5]) .
The "property of causality for an operator
means, in the finite dimensional case, that it has
a lONer triangular matrix representation. Strict
causality Nill reduce, in this case, to a
strictly lower triangular matrix repersentation.
We present the concept very hriefly and refer the
interested reader to [17J.
Let E he the set of projections onto the
memhers of N. By a partition P of E is meant a
finite suhset.
P = {Ei : 0 < i ~ n}
of E such that
... < E = I. n
Ai will denote the projection El - Ei _l • A
partition PI is a refinment. P ~ Pl' Note that the
partitions of E form a directed set under refinment.
Let T he a hounded operator on II and P any
partition of N. Form the sum.
Lp(T) =i~l Ei _l TAi
Theorem 6.1 ([6], [17]): T is causal if and only , ,
for any E > 0, there ,exists a partition P of such
that for any refinement PI of P
IIJJp (T) II < s: • 1
It is worth noting that this is wquivalent
to strong convergence of Lp(T) to zero; i.e.
causality = strong causality. This will not be
the case for strict causality which we define now.
Definition 6.2: Let T be a causal operator on
" n (H,N). Let Vp(T) = E ~. T~. i=l 1 1
T is strictly cousal if for any E > 0 there
exists a partition P of E such that for any
refinement PI of P,
IIVp (T) II < E. 1
Intuitively this means that the elements of
the diagonal of the matrix of T with respect to
any partition are zero.
It is of interest that the property of strict
causality defines the spectrum of T.
Theorem 6.3 [6]: If is strictly causal, then T is
quasinilpotent; i.e. the spectrum of T consists
of the point {oJ.
An immediate consequence of this is that if
the open loop gain of the system (1) is strictly
causal, then the system is well posed and stable.
We now return to Co contractions and consider
the problem of classifying the strictly causal
operators of this class. As mentioned above. the
spectral properties of such operators play an
important role. These were summarized in Theorem
4.1. The only quasinilpotent Co contractions are
the nilpotent ones. Of equal interest is the case
where T has a spectrum consisting of a single
point which is not necessarily O. An examination
of the spectrum of T allows us (up to a similarity)
to two possibilities:
(i) aCT) 0
(ii) aCT) 1.
In the first case the minimal function of T is
just u(t) = zn. In the second it is
. z+l u(z) = exp a(~).
Both situations are included in the next
theorem.
Theorem 6.4:(£7]) If T is caulal and I - T*T is
compact. then A - T is strictly causal if and
only if aCT) = A. At this point I would like to mention two
open prohlems in strict causality, the first in
the context of Co contractions and the second in
more general context.
(1) Can Theorem 6.4 he extended; i.e. Can the
condition I - T*T be dropped? I conjecture that
the answer in general is negative though I'm not
sure why.
(2) Suppose T is a nilpotent causal operator.
Is T strictly causal. The answer is yes ifAthe
nest fJ· is discrete (dim M 9 ~I = 1 for ~1 E N).
7. Strong Strict Causality:
Definition 7.1: Let T he a causal operator on
(H,~)
for a partition P. T is strongly strictly causal
if for any E > 0 and x E II there exists an
partition P of E such that for any refinemeT.t PI
of P,
IIVp (T)xll < E. 1
Clearly strict causality inplies strong
strict causality. The converse does no~ hold.
An example that shows this as well as the fact
that strong strict callsality is a more natural
concept will he given helow. First however, I
would like to mention where this concept prove<
useful in Systems Theory.
C Causality plays a role not only in stability
prohlems hut also in state decomposition for
non-time-invariant systems (see [17]). ,\s ~':lS
shown hy Saeks ([17], p. 130) in the case ~here
T is strongly strictly causal, a minimal state
decomposition gives a complete set of invariants
for T.
Example 7.2: Let II = [2(0.00) the space of
sequences {an}oon=o such that r la 12 < 00.
n=O n
For sequence (a O' aI' 8 2 , ... ) let s(ao' aI' a 2 ,
) = (0, aO' aI' ••• ). This is just the
previously mentioned unilateral shift in a
different context.
25
n Let N - {V 8 i - M } where e i is the i-th
i-O n
co-ordinate vector. Then S is causal. It is not
strictly causal since a (5) • h I I z I " 1l. lIowever
S is strongly strictly causal. For the matrix of S
with respect to {e } is n
/ 0 ..........
f 1 0
\ 0 0
L~t E ~ 0, and x = (aO
' al' ••• ,) • Then choose .. ~ .. uch that :.\a \2 < c.
~ n
Let P be the partition {o'. PI' ••• , PN, J} where 1
Pi is the projection on ~ ek •
n Then r ~.S~. has the matrix
i= 1 1 1
n ....... ..
o
o o
where the 0 in the upper left corner represents an ~ x 'i hlick.
If dim /I < 00 then aCT) • {O},
Note that for T a Co-contraction any
eigenvalue A of T mllst have IAI < 1.
This is not always the case. One need only
take 5* which is strongly strictly anti-causal
(n dual property) and has a large point spectrum
{z I I z 1 < I}. I'le wi 11 present what we hope is a
reasonable hypothesis.
Prohlems 7.3:(1) If T is strongly strictly causal
then the point spectrum of T has only one
component; Le. T has no more than one isolated
eihenvalue:
(2) If T is Co with I -T*T compact and
o(T) c {Izl I} then T is strongly strictly
cousal.
(3) Can the assumption I -T*T compact be
sropped?
8. Conclusion:
have tried to show that Co cOntractions
are an important class of operators for the study
of problems in linear feedback systems. Under the
assumption that the defect operators of such a
contraction are sufficiently small we have seen
that systems involving Co coptractions are well
behaved. I have also mentioned a numver of open
quest ions I~hich I feel are worth considering and
are basic to the understanding of the behavior of
linear feedback systems.
Bibliographr
[1) ? .J.S. Baras and R.W. Brockett, "H--functions
Thus n and infinite dimensional realizat ion theory", l: fli S fli x = (0, .... aN' aN+I , ... ).
i= I SIAl-I .J. Control. n
It follows that I I l: fl. S fl. xl I < E. This also ;=, 1 1
dearly holds for any refinement of P.
~e note that for strongly strictly causal
operators the spectrum can be quite large. 1I0liever
the propert), that we would hope carries over from
the finite-dimensional space is that the point s
spectrum can not consist of more than one point.
This is in fact the case for the class of Co
contractions that we considered ([S»).
Theorem 7.3: Suppose T is a Co-contraction which
is strongly strictly causal. and su~h that I-T*T
i~ compact. If dim H = "". then o(T" el' z I = 1 }.
(2) R.H. DeSantis, "Causality for nonliflear
systems in llilbert space". ~Iath. Sys. tho 7, No.4
(1974), 323-337.
[3] , "Causality, strict causality,
and invertibility for systems in lIilhert resolution
space, SIAII .J. Control 12. No.3 (1974),536-553.
[4] R.H. DeSantis and N.A. Porter. "On time
related properties of non-linear system, SIAH .J.
App!. Hath. 24, :-lo. 2 (1973)'. 188-206.
[5) --------------, "On the analysis of feedhack systems with a polynomial
plant", Int . .J. Control 21. No.1 (1975), 159-175.
26
[6] A. Felntuch, "Causal alld strictly causal operators, to appear •
[7] ------, "on stability", to appear.
Co operators and feedbaCk1 I
[8] ------, "On strong strict causality for operators", to appear.
[9] P.A. Fuhrmann, "Realization theory in Hilbert
space for a class of transfer functions, J. Funct.
Anal. 18, No.4 (1975), 338-349.
[10] ------, "On realization of linear
systems and applications to some questions of
stability", Mat. Sys. Th .• 8, No.2 (1974), 132-141.
[11] J.W. IIilton, "Discrete time systems,
operator models and scattering theory" .J. Funct.
Anal. 16 (1974), 15-38.
[12] K. 1I0ffman, "Banach Spaces of Analytic
Functions", Prentice lIa11, Englewood Cliffs, N.J. 1962.
(13] T.L. Kriete, "Fourier transforms and chains
of inner functions", Duke Math. J. 40, No.1
(1973), 131-143.
(14] N. Levan, "The Nagy-Foias Operato~ ~iodels,
Networks, and Systems",
IEEE Trans. on Circuits and Systems, Vol. CAS-23,
No.6 (1976), 335-343.
[IS] W.A.Porter, "The conunon causality structure
of multilinear maps and their multipower forms",
J. Math. Analysis and Applic. 57 (1977).
[16] W.A. Porter and R.M. De Santis, "Linear
systems with mult iplicative control", Int. J.
Control 20, No.2 (1974), 257-266.
[17] R. Sacks, "Resolution Space Operators and
Systems", Lecture notes in Economics and Hath.
Systems 82, Springer-Verlag 1973.
[18] D.Sarason, "A remark on the Volterra operator,
J. Hath. Analysis and Applic. 12 (1965), 244-246.
[19] B. Sz.-Nagy and e. Foias, "lIarmonic Analysis
of Operators on Hilbert Space", North-Holland,
American Elsevier, New York 1970.
[20] J.e. l1i 11 ems , "Stability, instability,
invertibility and causality", SIAM J. Cont. 7
(1968), 645-671.
[21] , Analysis of Feedback Systemd, Ca,brodge, ~nT Press, 1971.
27
WIENER-HOPF TECHNIQUES IN RESOLUTION SPACE
L. Tung and R. Saeks Dept. of Electrical Engineering
Texas Tech University Lubbock, Texas 79409
I. INTRODUCTION
Wiener-Hopf filtering is a widely used
technique in certain kinds of optimization
problems. The purpose of this paper is to
formulate Wiener-Hopf filtering in abstract
spaces (reflexive Banach resolution spaces)
and to examine problems involved for the
formulation and the solving of the Wiener
Hopf filter.
Referring to what has been done in the fre
quency domain of the classical Wiener-Hopf
filteringl, we've found five major problems
for the formulation of Wiener-Hopf filter
ing in abstract spaces. They are
i. Random variables in abstract spaces
ii. Causality
iii. Operator factorization
iv. Operator decomposition
v. Optimization.
These problems are briefly introduced as
follows:
i. Random process can be thought of
as a random variable which takes values in
a function space. In order to do so, we
need an adequate probability measure over
the space involved. Fortunately, this
kind of probability measure has been de-. 2 F fined over metr1c space or our pur-
poses, we assume that the space involved
is reflexive Banach space, not only be
cause this kind of space possesses nice
properties but also because stochastic con
cepts such as "mean" and "variance opera-
28
tion" can be defined therein. Random vari
ables taking values in reflexive Banach
space is discussed in section II with pro
bability measure assumed implicitly.
ii. Concepts of causality have been
introduced into Hilbert space-the so-called
Hilbert resolution space3 • In section III,
we extend the works done for Hilbert. space
to Banach space. Concepts of causality,
such as causal, anti-causal, miniphase and
maxiphase, are defined. Emphases are given
to reflexive Banach resolution space.
iii. Operators to be factorized in the
form of KK*, where K* denotes the adjoint
of K, have to be "positive"and "self-ad
joint". These commonly-used properties
among operators on Hilbert space can be
extended to operators which map reflexive
Banach space to its dual space. Factoriza
tion theorem is given in section IV. Fac
tor operator K is required to be left-mini
phase.
iv. The decomposition of operators over
Hilbert spaces is treaded in Ref. 3. For
operators over Banach spaces, this problem
is still under research. For our conveni
ence, operators are restricted to those
which guarantee the decomposition.
v. As in the classical Wiener-Hopf
filtering, we would like to minimize the
variance of the error. However, when Wiener
Hopf filtering is formulated in reflexive
Banach space, the variance of the error is
a positive and self-adjoint operator which
can only be minimized in the partial order
ing of the positive operators.' This subject
is treaded in section V.
II. BANACH SPACE VALUED RANDOM
VARIABLES
The theory of Banach space valued random
variables has been studied in Ref. 2. For
our purpose, we discuss reflexive Banach
space valued random variables with proba
bility measure over the space assumed im
plicitly. The development follows that of
Parthasarathy (2) and Balakrishnan (4); the
reader is referred to these works for the
details.
Let p, TI denote finitely additive random
variables taking values in a reflexive
Banach space B. For such random variables,
we assume
* * * E{ I (p, x ) I} < CD , for all x £ B
* (2.1)
E{(p, x )} is continuous in x*
Here E{.} denotes the expected value of a
scalar valued random variable with respect
to the probability space underlying p. For
random variables satisfy condition (2.1),
there is a unique vector mp in B satisfy
ing
* * * * E{(p, x )} = (mp, x ), for x £ B •
mp is termed as the mean of random variable
p. As in most stochastic processes, mean
is not our prime concern. Therefore, in
the sequel we only deal with zero-mean ran
dom variables. For such random variables,
we further assume
E{ I (p , * * ) I } x ) (TI , Y < CD , * for all x , y* £ B*
(2.2) E{ (p , x*) (TI , y*) }
is continuous in x* and y*
It can be shown that condition (2.2) implies
condition (2.1). Now let's take a look at
E{(p,x*)
E{ (p, x*)
(TI, y*)}. If we fix y*, then
(TI, y*)} is a bounded linear
29
functional on B* (so an element of B**=B).
This means that there exists a unique Py*
in B such that E{ (p, x*) (TI, y*)}
= (Py*' x*) for x* £ B*.
QPTI
= B* + B, by QPTI
y*
Define a mapping
Py*.
Hence E{ (p, x*) (TI, y*)} (QPTI y*, x*).
It can be easily proved that Q is linear. PTI
Moreover, Q is PTI
bounded. Q is termed PTI
as the covariance operator of random vari-
abIes p and TI. Covariance operators satis
fy following conditions:
i. Q(Lp) (MTI) = L QPTIM* , where Land are linear bounded operator on B.
ii. Define Qp Qpp then
iii.
iv.
QP
+TI = Qp + QPTI + QTIP + QTI •
Qp is called the variance operator
of p.
Q* in particular Q = Q * TIP , P P
Q is positive in the sense that p
(Qp y* , y*)
for all y* £ B*.
These conditions result from straight for
ward manipulation of the defining equation
for the covariance operator. Using QPTI'
we say that p and TI are independent if
QPTI
= O.
III. BANACH RESOLUTION SPACE
By a Banach resolution space, we mean a
2-tuple, (B, BF), where B is a Banach space
and BF is the so-called resolution of iden
tity in B, which is defined in the following:
(A) Resolution of identity
Definition 3.1. Let B be a Banach space.
By a resolution of identity, BF, in B, we
mean a family of linear bounded operators,
BF(~), on B defined for each Borel subset,
~, of the real number set R, satisfying
the followings:
i. BF (R) = IB-identity operator on B
ii. BF(~Il . BF(~2) = BF(~lM2)' for
all ~l' ~2 £ S (R)- the set of all
Bo el subsets of R.
iii. n n n
BF(?~i) = I BF(~l)' where {~l}l is
a finite set of disjoint Borel sub
sets of R.
iv. IIBF(~) x II 2. Ilxll, for all
~ £ B (R) and x £ B (Equivalent
statement: Norm of BF(~) is either
o or 1)
The subscript on the left in the notation,
BF , is to notify that the resolution of
identity is defined over space B and will
be dropped if no ambiguity would result.
Working with a Banach resolution space,
(B'BF), it is natural and important to ask
whether we can define a resolution of
identity in B*, the dual space of B. The
following theorem gives us the answer.
Theorem 3.1. Let (B, BF) be a Banach re-
* solution space, then {BF (~) I~ £ B(6)}
is a resolution of identity in B*-termed
as the induced resolution space, (B*'BF*).
With the resolution of identity defined as
above, we'd like to point out that although
Hilbert space is a special case of Banach
space, Definition 3.1 does not lead to a
Hilbert resolution space3 • In Hilbert re
solution space, the resolution of identity,
{E(~) I~ £ B(R)}, satisfies an additional
condition, i.e. E*(~) = E(~).
Example. Let p, q £ R, such that
lip + l/q = 1. Then Lp is a
reflexive Banach space with dual
space Lq . For each f £ Lp ' def-
ine (f ,q) = f""f (t) g (t) dt. Let
F(M f(t) = X (~) f(t) = 0 , tlb.
f(t),tF:~
It is easy to show that {F(~) ~£B(R)} is
a resolution of identity in Land F*(6) p
= X (~) for all ~ £ B (R).
(B) Concepts of causality
Definition 3.2. Let (X,xF), (Y'yF) be
30
Banach resolution spaces. T: X+Y, is a
linear bounded operator
(i) T is causal, if
Ft ~ Ft T Ft T X x 2 Y· Xl = Y x2 , t
where BF = BF(-w,t) , B = X, Y.
(ii) T is anti-causal, if
XFt Xl = xFt x 2 => yFt T Xl = yFt T x 2 ,
where BFt = BF (t,w), B = X, y.
(iii) T is memoryless, if T is causal and
anti-causal.
(iv) T is left-miniphase, if
(v) T is left-maxiphase, if
(vi) T is right-miniphase, if
(vii) T is right-maxiphase, if
According to above definitions, we've found
the following results:
(1) Miniphase, left-or right-, implies
causality.
(2) Maxiphase, left-or right-, implies
anti-causality.
(3) When X and Yare reflexive, we have
(a) T is causal <=> T* is anti-causal
(b) T is left-miniphase <=> T* is
right-maxiphase.
(4) When X and Yare reflexive and T is
invertable, we have
(a) Miniphases are equivalent, so
are maxiphases.
(b) is miniphase and -1
T => T T causal
is maxiphase and -1 . T => T T antl.-
causal.
Readers are referred to Ref. 5 for the de
tails.
IV. OPERATOR FACTORIZATION
Not every operator over arbitrary Banach
spaces can be factorized in form desired.
For our purpose the desired form of factori
zation is K K*. Operator to be factorized
in this form has to be positive and self
adjoint. These two commonly-used proper
ties for operators over Hilbert space can,
be extended to operators that map from re
flexive Banach space to its dual space.
They are defined as follows:
Defini Hon 4.1.
(i) Let B be a reflexive Banach space.
(ii) Q B+B*, is linear and bounded.
Q is said to be positive if
(x, Qx) ~ 0, for each x E B.
Q is said to be self-adjoint if
Q* = Q.
Note that Q* : B**=B + B* 'so it
makes sense to compare Q with Q*.
For positive and self-adjoint operators, we
have the following theorem:
Theorem 4.1.
(i) Let B be a reflexive Banach space.
(ii) Q : B+B*, is linear, bounded,
positive and self-adjoint.
Then there exist a Hilbert space H
Theorem 4.2.
(i) Let (B,F) be a reflexive Banach re
solution space. (B*, F*) denotes
the induced resolution space.
(ii) Q = (B,F) + (B*, F*), is linear,
bounded, positive and self-adjoint.
Then there exist a Hilbert resolution
space (H,E)
K = (H, E)
1. Q
and a linear bounded operator
+ (B*, F*) such that
K k*
2. K is a left-miniphase
3. The factorization is unique up to
a memoryless unitary transforma
tion.
For the proof of this theory, please refer
to Ref. 5.
V. WIENER-HOPF FILTERING FORMULATED
IN REFLEXIVE BANACH SPACE
With the preparation of sections II, III
and IV, now we are ready for the formula
tion of Wiener-Hopf filtering. The for
mulation is done as follows:
Let X, n be random variables taking values
in a reflexive Banach resolution space
(B, F). X denotes the signal and n the
and a linear bounded operator K : H+B, noise. Both X and n satisfy condition
such that Q = K K*6. (2.2) in section II and they are assumed
When dealing with Banach resolution spaces,
the usefulness of operator factorization is
limited unless the factor operator possesses
certain causal properties. Referring to
factorization of the spectral density in
classical Wiener-Hopf filtering, we have
found what we need is a factorization theo
rem which gives a causal operator and quar
antees a causal inverse once the existance
of a inverse is granted, i.e. a theorem
that gives a miniphase factorization. Based
on Theorem 4.1, we construct the resolution
of identities in spaces involved and we come
up with the following theorem.
3J
to be zero-mean and independent. As such,
X and n have Qx and Qn
as their variance
operators respectively. The propbem we are
facing is to find a filter, T B+B, linear
and causal, to operate on X + n such
that the error, defined as x-y, where y is
the output of T, has a variance operator
that is minimal in the partial ordering of
the positive operators. We will describe
this ordering right after we find the vari
ance of the error.
Let e deonte the error and Qe denote its
variance operator. Since
def e = X-Y = X - T(X+n) = (I~T)X + Tn
we have
* Qe = (I-T) Qx (I-T) * + T Qn
T , following
from the results in section II. Rearrang
ing terms in Q , we get e
Qe = T(QX+Q~)T* - Qx T* - T Qx + Qx·
Q is dependent on T. e notify the dependence.
We write Qe(T) to
Q (T ) is said to e 0
be minimal for some filter To' if
Q (T) < Q (T ) => Q (T) = Q (T ) e - e 0 e e 0
(A ~ B if (B-A) is positive).
In the equation for Q , Q + Q represents e x n
the variance operator of X+n, hence is
positive and self-adjoint. Therefore, by
Theorem 4.2, there exists a resolution
space (H,E) and a linear bounded operator
K = (H,E) + (B,F), such that (a) Qe = K K*
(b) K is a left-miniphase. Without further
assumptions, the formulation would be stuck
right here. At this point, what we need
is an invertable factor operator K. The
invertability of K can be guaranteed by
the invertability of Qx+Qn. There are
several ways to secure the invertability
of Qx+Qn. One way is to assume that Qx+
Q in onto and Q is positive definite. n n With an invertable factor K, Q can be e rewritten an
Qe = T K K* T* - Qx T* - K Qx + Qx
[TK-Q (K~)-l] [K~T*-T-IQ ] + Qx X X
_ Q (K*)-l K- l Q X X
[TK-Q (K*)-l] [TK-Q (K*)-l]~+ Qx X X
_ Q (KK*)-lQ X X
The last two terms in the above equation, -1
Qx and Qx(KK*) Qx' are positive and in-
dependent of T. Hence, to find the mini
mal of Q is the same as to find the mini-e -1 -1 *
mal of [TK-QX(K*) ] [TK-QX(K*) ] - de-
noted as Q(T) in the sequel. Right now,
we are facing the same kind of problem as
in classical Wiener-Hopf filtering. Mini-
32
-1 -1 mal Q(t) occurs when T = QX(K*) K , but "it
does not represent a causal system in gene
ral. In order to get a possible optimal -1
causal filter, can we decompose QX(K*)
into "causal part" and "strictly anti
causal" (a term to be generalized in Banach
resolution space) and under what conditions
can we do so? This subject has been,treated
in Ref. 3 and Ref. 7 when the reflexive
Banach space happens to be a Hilbert space.
However in reflexive Banach resolution
spaces, the subject is still under research.
While we follow the same pattern as that
of classical wiener-Hopf filtering in fre
quency domain, we would like to ask whether
this decomposition would work and how it
would. The same question in classical
Wiener-Hopf filtering is not directly ans
wered in frequency domain. In order to
find the answer, let's assume the decom
position. Let
Q (K*)-l = C + A, where C is the X
-1 causal part of QX(K*) and A is the "stri-
ctly anti-causal part" (a term to be gener
alized in Banach resolution space). Then
Q(T) [TK-C-A] [TK-C-A] * * * [TK-C] [TK-C] - A[TK-C]
- [TK-C]A* + AA*.
To claim TK-C=O is the condition for mini
mal Q(T), we should demonstrate that those * cross terms, [TK-C]A* and A[TK-C] , have
no effect on the ordering of Q(T). Again
when the reflexive Banach resolution space
is Hilbert resolution space, we've found
two ways to achieve this. The first one
is to take the trace of Q{T). Surely, work
has to be done to guarantee Q{T) being
nuclear. The second one is to take the
memoryless part of Q{T). This is justified
once the decomposition is given. However
we've also found advantages and disadvant
ages to each way. For the method of taking
trace, it gives a minimal variance operator
once the maximum of the trace is found, but
we have to restrict certain operators, and Systems, New York, Springer-Verlag,
such as Qx and T, to be Hilbert-Schmidt. 1973.
On the other hand, the method of taking 4. Balakrishnan, A-V., Introduction to Op-
memoryless part works for a broader class timization Theory in a Hilbert Space,
of operators-operators have decompositions, New York, Springer-Verlag, 1971.
but it does not give a minimal variance
operator. The best we can have is a vari
ance operator that has a minimal memory
less part. However, there is an important
aspect for taking the memory less part.
This method allows us to generalize the
idea in reflexive Banach space, while the
other method does not. The reason is quite
simple, for it does not make sense to talk
about the eigenvalue of an operator that
maps from Banach space to its dual, not
to mention the trace of such an operator,
while it does make sense to take the
memoryless part given the decomposition.
Readers are referred to Ref. S for the
destils of Wiener-Hopf filtering formulated
in Hilbert resolution space. When all the
problems mentioned above are solved, we
would come up with the optimal filter
T C K- l , a causal system o
VI. CONCLUS ION
Wiener-Hopf filtering has been formulated
and solved in Hilbert resolution spaceS.
In this paper, we outlined the formulation
in reflexive Banach resolution space and
the possible way of solving it. Generali
zation would be accomplished once the
theory of operator decomposition in Banach
resolution space is completed.
REFERENCES
1. Cooper, G.R. and Mcgillem, C.D.,
Probabilistic Methods of Signal and
System Analysis, New York, Holf,
Rinehart and Winston, Inc., 1971.
2. Parthasarathy, K.R., Probability
Measures on Metric Spaces, Academic
Press, 1967.
3. Saeks, R., Resolution Space, Operators
33
S. Tung, L.J., "Random Variables, Wiener
Hofp filtering and Control Formulated
in Abstract Spaces", Ph.D. Thesis,
Texas Tech University, to appear.
6. Masani, P., "An Explicit Treatment of
Dialation Theory", University of Pitts
burgh, published notes.
7. DeSantis, R.M., "Causality Structure of
Engineering Systems", Ph.D. Thesis,
University of Michigan, Sept. 1971.
APPROXI}~TE CONTROLLABILITY
AND WEAK STABILIZABILITY
Claude D. Benchimol System Science Department
University of California at Los Angeles Los Angeles, California 90024
Abstract
First using Sz. Nagy, C. Foias approach for contraction operators, we prove the following theorem: Let H be a Hilbert space, and T(t} a C contraction semigroup in H. Then H can be dec~mposed into three or~hogonal subspaces Hcnu ' Wu and W , all reduc~ng T(t} and T*(t}, such that
Wu + W Hu
Wu + Wcnu W
such that, On H ,T(t} is completely non unitary, and weakly stable On Wc~uT(t} is unitary and weakly stable On WU , T(t) is unitary, and x w, T(t)x / 0 at t + • W is called the "weakly stable subspace" W is called the "weakly unstable subspace". Hu reduces T(t) to a unitary group. H reduces T(t} to a completely non unitary contraction sgb\Ygroup.
Next, we define the controllable subspace to be C = Range T(t)B
t 0 and the uncontrollable subspace
C N(B*T*(t)}. t 0
Or main result is: Let A be the infinitesimal generator of a C contraction semigroup T(t} in a Hilbert space H, and B a bounded Rperator mapping another Hilbert space H. into H. Then, the system x = Ax + Bu is weakly stabilizable if~and only if the "weakly unstable states" of T(t} are approximately controllable, or equivalently W C. Furthermore K = -B* is a stabilizing feedback gain. As a corollary, the above condition is necessary and sufficient for strong stabilizability, when A has a compact resolvent. These results generalize the well known finite dimensional condition (see Wonham) and considerably strengthen the results of Slemrod, where only a sufficient condition for weak stabilizability was given, assuming that the resolvent of A was compact.
34
A MODIFIED DISCRETE CONVOLUTION OPERATOR FOR
SIMULATION OF LINEAR CONTINUOUS SYSTEMS
H. B. Kekre and D. B. Phatak Indian Institute of Technology
Bombay, India
SU~Y
Linear continuous systems are characterized by the convolution integral which takes the form of convolution sum for discrete systems. When it is desired to obtain a discrete model for simulating a continuous system, the traditional time-domain methods make use of the sampling process as it can be most conveniently implemented. However, there is a one to one correspondance between the convolution integral and the convolution sum only for band limited systems and signals, with sampling rate greater than the Nyquist rate.
Steiglitz has derived an isomorphism which preserves convolution in the above sense. The discrete sequences representing the continuous signals are obtained as coefficients of signal expansion on a continuous domain basis {A (t)} ~, where A (t) are La-n n=-~ n guerre functions. Oppenheim has given a generalized form of such basis functions. But the coefficient computation requires analog filtering and hence is not very attractive from digital computer simulation point of view.
This paper presents a new relationship between the two domains which retains the sampling process and removes all approximations attached to the sampling interval T, for the so called "unbandlimited" square integrable signals and linear time invariant causal systems. It is based on the expansion of functions on a real exponential basis. A modified discrete convolution operator is defined which is shown to correspond to the convolution integral.
* DIFFF,Rl~;NTIAL SYSTEHS ON ALmRNATIV~ ALG~BHg)
Robert W. Newcomb Electrical B~ngineering Department University of l'laryland College Park, 11aryland 20742
A 00 tract
Continuing a previous study, which gave a universal square - law canonical for~ for differential systems by embedding the description in an attached commutative nonassociative algebra, we show here that when the algebra is alternative the description can further be reduced to zero.
"It is my wish that the Treasury shall make a chronicle setting forth the g~neolo~."
[1, p.1J 1. Introduction
Previous1~ [2] we have shown that any system
g described in the state variable form x ~ f(x,t),
with f reasonably behaved, can be reduced to the
canonical form
x(O) .. x given - -0
where 2t(t) is a vector in an attached algebra
0(8) associated with g and ~ = dz/dt. This alge-
bra is commutative rut not necessarily associative.
Amo~g the more extensively studied nonassociative
algebras are the alternative algebras [3, Chap. 3].
Here we show that if O(g) is an alternative alge
bra the canonical form can be further reduced to
the ultimate form of zero.
2. ~lain Result
We first recall that an algebra is a vector
space in which multiplication of any two vectors
is properly defined [4, p. 144J. An alternative
alge bra J, is an a1ge bra in which for all ~, X E a
o
(?a)
and (?b)
Since the attached algebras for (1) are assumerl
commutative, there is no difference between (2a)
and (2b) in the considerations here. Too, as is
known [5, p.3l9], we may assume that U has a unity
element~, since otherwise one may be adjoined
while preserving the alternative nature of the
algebra. Now, given an alternative algebra a(g)
associated with a system g to yield its repre
sentation x = x.x, that is, representation by (1)
within the algebra, we can proceed somewhat as
with Riccati differential equations [6, p.12] by
introducing a new vector variable y through . r '" ~'l (J)
Then differentiation of (J), using (1), yields
.y = -x.y-x.y = -(x.x).y + x.(x.y). On usinF, (2a) -- ---- --- ---this gives our main result
'y = 0 - - (4)
* This work was supported in part by the US Natioal Science Foundation under Grant i:SF gNG 75-CJJ227 and in part by a ]i'ul bright - Hays Grant to 11alaysia.
36
In other words, any system Hhose att?ched al[iehra
is alternative can be reduced further Hithin the
a1ge bra to Y = .2.
3. ~;olution
:';quation (4) ca,n l:€ lnterr,ratod to yield a
solution: . y = a (5a)
y(t)"'at+b - - a /" b constant (Sb)
Sul:Etitution in (3) gives, -1 1 if y and then b-
exist,
«(,a)
( 6b)
As the initial conditions are
~(o)
He obtain
== x -0
-1 -~'t (6c)
Hhich ar;rees Hith [:, ::q. (JA)] previously obtained
for division a1r;ebras. :;ince only the ratio of .<::.
to b is important, in (5b) He may all"ays t.ako
a == ~, £ == ,£. In this case 12 is clearly nonr3in[;U
lar, and, hence (as Z(!) =):2 = .§.), by continuity
~ t) [or sufficiently [;mall t. By analytic contin
uation (7) is then sr,en to t:e the solution Hithin
the a,ttached al terna ti ve al;cc bra.
;!e have S(Jen hcre tht!.t ir thl) attached al,w bra
U(o,) of a SylltO[;l 3 h~ eJter112-tive, thc canonical
equations ~ = ~.~ take tho very simple linearized
form .i = 2 Hithin Lhe '>3.'1e 2.lso bra. This rurther
loacls to a sir:IJllc solution x( t) = x LO - tx rl ,.- ' 0 - . 0
r;iven the initial conoitions 2<0 ?:CU:), irri~r;nc,ctjve
of Hhether or not the alt8rnative alc;ebra. h; a
divbion a]i~r·L'Ta. 1t is of interest to Clotr" thai,
alternative cliv:!,[,ion illr:ebras then~~olves ilTe lTell
studi()d and are either il,sGociativc or isonorphic to
an ei~ht - dinensional ~aylny - Dickson d~vision
algr; bra over thd,r center L'7, 'P. :;l!l:.
To H. :J. H. ,Tanalullail llith rene"lbrar.ce oT
!';alaysian developments.
37
"Hhf)n he hnc'.nl the HOrd of lib Highner;s, hr; tool-: t.he cO;;J'llanu lljlOll hb h"ad ami hib linbG Here boHed h)T)eath the Heirht 0[' it." .,
LI, PP.J-?:
l1ercre!lces:
L'll" 1 A I" (" " '1 ) (' , ' ,.a ay nna s "C Jarall ,'·0 ayll , '~. '. "rmrn,
L.3l
tran"lator, Oxrord Univerr;it.y rre~:.~. ;'lal'J
;,umpur, 1970.
:i. ;1. LeHcomb, ";.onlincar Li:~I'()rnnti8Jjyr;tel'l~~: II Canonic, i'iul tivariable Thecry," l'roceedinf's of the L';;~:;, Vol. (,5, [.0. (., June 1977.
il. ~). :,chai'er, "An Introul)ction to .,ona2.sccia.tive l\l~:,p,bras." Acacierr,ic J-re~)f;, .,r.~; YorI", 19(/.
L. J. raic;e, " Jcrua.I: A13eb:r:'u.5," in ",)t~J.f!ies jn l:od ern Ale:'> bra," cd i ted by A. A. j,] lx,rt, 'i'hr· Lathe:latical Association of' A!1erica, 19(,3, r:m. JlfLf - 18G.
L51 j. •• A. Albert, liOn Rie;ht Alte~";rJ-LiV('; Alf':pr)}·;::l.~:, 11
Anm'.ls of Lathem2.tics, ·.jol. 5'~" ,D. 2, Anril 191f9, PP. 318 - 320.
L6l ::. '1'. i'lcid, "::lic',],t:i, ljE;'nronti~J ''111ation~~,'' AC8,(1'_8nic Pres~J, 1:(:H '{or~~, 1')72.
L.7J '. j(lfdnI'cld., "A C!hiJTact"rization 'l:' th;) ';ayl~y :·;urrbers,11 in "~)tHcJiGG in ;.odern t\ll/~l:r.a, II ed Hwl by A. A. Allxrt, 'l'he ,.a.thc'ni',t ir.iJJ ASGoci.3,tion or Ar~erica, 19('3, pp. J?:. - Ih3.
- r 'J, '-, '
~r ... • I ~) • (. ;;;.
LAGRANGIANS WITH INTEGRALS.
AN APPROACH TO THE VARIATIONAL THEORY OF DISSIPATIVE SYSTEMS
Vuk M. Fati6 Electrical Engineering Department
Tri-State University Angola, Indiana 46703
Abstract
William A. Blackwell Electrical Engineering Department Virginia Polytechnic Institute
and State University Blacksburg, Virginia 24061
Some fundamental results on Hamilton's principle with a class of Lagrangians which depend of path dependent integrals are derived. It is shown that dynamical equations of linear dissipative systems could never be identified with the stationarity conditions for this class of action functionals. An equivalence, however, is established between dynamical equations of purely dissipative linear systems and stationarity conditions for an action functional with a Langrangian containing integrals, and a Hamilton's principle for purely dissipative linear systems is thus established. Exponential Lagrangians for one-dimensional systems are derived as a special case of the theory.
1. INTRODUCTION
Network and system theory is occasionally dressed up in the formalism of analytical mechanics, but no satisfactory method is known yet for incorporation of dissipative forces in Lagrange-Hamilton's theory. Rayleigh's dissipative function does not resolve the problem, because it destroys the link between Lagrange's equations and Hamilton's principle. The difficulty lies in incompatibility of Lagrange's equations with dynamical equations of systems with dissipation. For linear systems
Rund's conditions reduce to: (a) ~ and ~ should be symmetric matrices, and (b) ~ should be a skewsymmetric matrix. These conditions are obviously met in the case of conservative (B = 0) and gyroscopic (~= _~T) systems, but conditi~n (b) excludes dissipative systems, which have symmetric ~ ~~. For the special case of systems with one degree of freedom it means that ~ - term cannot appear in Lagrangian equation.
H. Rund (1) has shown that equations
are identical with a system of Euler-Lagrange's equations:
(1)
(2)
for a L(t,gA) iff ~(t) is a symmetric matrix, B(t) is a skew-symmetric matrix, and B(t) = K(t) -~T(t). When coefficient matrices are-consta~t,
38
Rund's theorem, however, presumes the identity
which is not necessary to satisfy in order to establish a variational principle of Hamilton's type:
tl 6 J L(t,.9..,g)dt = 0 (4)
to
Existence of Hamilton's principle requires only an equivalence of equations (1) and (2), i.e. identity of their solutions. As a matter of fact, if a Lagrangian L(t,~,~) could generate
d aL aL dt ag: - a~ -
= !(t,~,g) .{ -It [~Jt)§.] + ~(t)§. + !(t)~ - Q.(t)}
with matrix! nonsingular in [to,tlJ ' equations (2) would be equivalent to (1), and the variational principle (4) would be demonstrated. This idea has already been explored: Havas(2), Rohrer(3), and Denman(4)-(6) examined its various aspects, and it is sUfficient to mention here that equation
Mq + Bq + ~(q) = 0 ( 5)
could be generated from Lagrangian:
~t [ ] L(t,q,q) = eM ~2 - ~~(X)dX qo
(6)
while the equation
Mq ± Bq2 + ~(q) = 0 (7)
follows from the Lagrangian:
M ±2~ lq L(t,q,q) = 2q 2e -
±2Bx e W ~(x)dx (8)
qo
However, Van der Vaart(7) proved that the multiplier method cannot be generalized to systems with n>l degrees of freedom (unless some very special conditions are satisfied, or system is uncoupled). Lagranians of this type have been used in some quantum-mechanical problems (Buch-Denman(8), Kerner(9), Besieris-Fatic(lO~, and the multiplier method has been recentl) extended to some continuous systems (Denman(ll , Fatic-Blackwell (12».
The assumption (3) can also be circumvented, as it was done in the "image" method (Morse-Feshbach(13~ Leech(14», by adjoining an "image" system with negative dissipation (-~) and generalized coordinates ~ to the given system. The requirement (3) can then be replaced (for time-invariant systems) by
39
It it - ~~ = ~q + ~9. + ]Sq - Q.( t)
:4 aL - ~ = Mr - Br + Kr - R(t) ut ag: a~ -- -- -- -
which is satisfied by a Langrangian, bilinear in ~ and~. Applications in network theory and generalization to time-varying systems are presented in (15)-(17)
Both of these methods have their shortcomings: the multiplier method is limited to systems with one variable (discrete or continuous), while the "image" method requires additional auxiliary variables; canonical formalism in both cases does not lead to physically meaningful results.
Another radically different approach was proposed by Huang and Blackwell(18): instead of modifying Lagrange's equations by Rayleigh function, dissipated energy can be included in Lagrangian, together with other energy functions. But in contrast to other energy functions, which depend on the state of the system only, dissipated energy WD depends on the whole path 9.{t) in[to,tJ, so that
WD = J\_T(-r)~g{t)dt to
(9)
is a path-dependent integral. Presence of a pathdependent integral in Lagrangian raises a question of whether Lagrangian equations (2) can be used as stationarity conditions any more. In order to clarify the situation, we have developed some fundamental results concerning variational principles with Lagrangians which contain one general class of path-dependent integrals. These results, interesting in themselves as a new variational problem, may eventually prove to be useful for variational theory of dissioative systems. At least, Rund's theorem does not apply to Lagrangians with integrals.
2. LAGRANGIANS WITH INTEGRALS
Lagrangians composed exclusively of integrals appear i~ nonlinear system theory (White-Woodson(19), Stern(20), Meisel(21), MacFarlane(22),(23), JonesEvans(24», where all relevant energy and co-energy functions are expressed as integrals. Lagrangians
-(6) and (8) also contain integrals. All these integrals. however. are by their very nature pathindependent; they represent state functions, which depend only on initial and final states of the system. In order to get suitable mathematical framework for variational theory of dissipative systems. integrals which are truly functionals need to be installed in Lagrangian.
Fairly general class of Lagrangians containing integrals is represented by
L = L [t,g.<t), 9..(t), ~(t)] (10)
where
1 ~ t f CT. 9.(-r), 9..(T)] dt (11);
to
9.(t). 9..(t) and 9(t) are n X 1, n X 1 and m X 1 matrices (vectors) respectively:
T g(t) = {ql~t). q2(t)' .....• qn(t)}
g(t) ={ql(t), 42(t),·····'qn(t)}T
T 9(t) = {9l(t). 92(t)' ..... '~m(t)}
Dissipated energy belongs to this class (f=gT~). as well as Lagrangians (6) and (8): f = ~(q)q for
(6). and f = e±2l q ~(q) q for (8).
It is important to realize the difference between classical Lagrangian
L = L [t, 9.(t) , 9..(t)] (12 )
and Lagrangian (10). L in (12) is a function of t. g(t). and g(t) in the ordinary sense, i.e. it depends on t. 9.(t), and 9..(t) pointwise. On the other hand. L in (10) contains terms q which depend on the path 9.(t) , taken by the system in the time interval [to.t]. Therefore, L in (10) contains integral functionals together with the point functions 9.(t) and 9.. (t). Lagrangians containing functionals ~(t), with functional nature of 9(t) spelled out - -
L = l{t, 9.(t) , 9..(t), 9 [9.(T): to$ t ~ t]} (13)
may nevertheless be point functions of its variables t. 9.(t) , 9..(t) , and 9(t). For convenience (which will be apparent soon) we shall restrict
40
consideration to this simpler class; Lagrangians (6) and (8) are typical examples of this type.
Lagranqians (13) should be distinguished from the case
L = l{t, 9.(t) , 9..(t), F[t, 9.(t), 9..(t)]} (14)
where L depends on t, 9.(t) and 9..(t) explicitly and implicitly by the way of a pOintwise function F: (14) is actually a classical Lagrangian of the same type as (12), only more complicated. The term 9(t) in (13) is not a pointwise function of t, 9.(t) and 9..(t). but a functional, which depends on the whole path ~(t) between to and t,i.e. on the history of the system.
Consideration of Lagrangians (13) leads to a fundamental question: what is a necessary condition for stationarity of the action functional
J ~ Itl L[t, 9.(t) , 9..(t), q(t)] dt (15) to !
with fixed endpoints to and tl? The assumption taken in (5), (18) and (24) that
~ ltl u L[t, 9.(t), 9..(t), ~(t)J dt = 0
to
is equivalent to
(16)
(17)
is not apriori justified, because L is not of the type (12), which has been assumed to be the case in the derivation of Euler-Lagrange's equations (17). Therefore, it becomes necessary to derive an appropriate equation from (16).
2.1. STATIONARITY OF THE ACTION FUNCTIONAL
In the class of functionals (15), with q given by (11), the simplest case is when -
( 18)
Introducing a new variable ~ ~ q(t) , ~ = 9.(t) and i = 9..(t) , and (10) becomes a fu~ction of r's: L = L [t, .t.<t) , i(t), .dt)]. For this ty;e of
Lagrangian Euler-Lagrange's equations are:
Returning to the variables ~, (19) reads
E.!:. - ~ E.!:. + ~ E..l;. = 0 (20) a9 dt a~ dt2 a~ -
This is the necessary condition for stationarity of J for Land q given by (15) and (18) respectively. In a more-general case former substitution procedure is not feasible, so that the derivation must be carried out directly calculating the variation of action functional in terms of ~~.
The first variation of J with fixed endpoints is:
t oJ = J 1 oL [t, ~(t), 9.(t), 9(t)J dt
to -(21)
The integrand, being a function of t, ~(t), ~(t), and q(t), can be expanded by Taylor's theorem into
By the well-known property of isochronous varia
tions:
(23) ,
so that after integration by parts
tl eJ = J [(E.!:._ d.u. )\n(t) +(E.!:. )Teq]dt(24)
to a~ Of a~:L a~.:.
(the endpoints are fixed). Now oq(t) should also be expressed in terms of e~:
J tl T Jt . = (;~) 15 f[T, ~(T), 9.(T)]dT dt (24)
to .:. to
Using commutativity of variation and integration and Taylor's theorem again
tl t
eJ1 = J (;~)T J [~~ e~(T) + * e~(T)J dT dt (25) to .:. to
41
Integrating the last term in (25) by parts
t t (26) l ~i e9.(T)dT = (*)T=t oq(t) -l d~*Oq(T)dT because o~( to) = Q., and o~( t) 1 Q.. Therefore
It1 T
oJ 1 = t (~~) (~) t o~( t) dt + o .:.
t1 T t
+ I (~~) I ( ~~ -d~ * )e~(TldT dt (27)
to .:. to
Changing the order of integration in the last term (Fig. 1) it becomes
t
Fig. 1: Integration region in (27).
Interchanging dummy variables t and T the last expression turns into
It1 tl T
[J (~~) dTJ( ~~ - a%*)o~(t)dt to t .:.
(28)
so that:
oJ = /tl{( ;~)T - a%( ~l + (~~r(*) + to .:.
(29)
Due to arbitrariness of e~(t), stationarity condi-
.. ,
tion oJ = 0 is equivalent to
(30)
Transposing this matrix equation we get:
This is the necessary condition for stationarity of J , which will be called EULER'S EQUATION in order to distinguish it from LAGRANGES EQUATIONS (17) .
When f = ~ /see (18)/ Euler's equation becomes
tl
!h.-.J!~+J!h.d =0 aB. dt aB. aq T t !
which· evidently reduces to (20) after differentiation with respect to t.
2.1.1. Discussion of Euler's Equation
Euler's equation (31) is a differential--integral equation, whose the most conspicuous feature is the presence of an integral from t to tl' The interval (t,t l ] is the "future" with respect to the moment t; hence Euler's equation apparently states that future behaviour of the trajectory (q(T) for t < T ~ t 1) influences its present state q(t). This is a feature typical of anticipative systems, not of dissipative ones. Therefore, Euler's equation seems in principle unsuitable for dissipative systems.
However, the other terms in Euler's equation are also functionals in general: they involve q(t) terms (aL/aq also depends on q in general), 'i.e. they depend!on the past behavior of the system -of its history. Now we see that Euler's equation is a functional equation which relates the past and the future behaviour of the system via its
42
present state. This type of equation does not necessarily violate causality; for instance
tl itt) 1 HT, !lh), itT)] dT = 0
t
is equivalent to
.9. + Ht, !l(t) , 9..(t)] = 0
which is just a system of ordinary differential equati ons.
To show that Euler's equation is not disqualified as a mathematical model for real systems, all we have to prove is that (31) can be reduced to an equation which does not contai1tl aL Indeed,
aq clT. t .
introducing the notation:
6 aL d aL + (af.)T aL ~(t) = a!l - dt.ag: 8! a~ ~xl
~xn
_C(t) 6 ~ = aq ~xl
for the sake of brevity, Euler's equation reads
(32)
For a coordinate qi (i = 1, 2, .... , n) this is an
algebraic equation in J(tl I(T) dT:
Differentiating (33) m times we get:
1.e.
k
+ L (~) d:: ,Q,=O
T dk-,Q, B. -;:-;;- I{ t) + -1 dt"'-,Q,
dk T Jl +-..B. C(r)dT=O dt~ -1 t -
(34)
(k = 1,2, ... ,m). This is a system of algebraic
Itl
equations in t ~(T) dT, which can be solved if
S.(t) is nonsingular (mxm matrix S.{t) is defined -1 k -1
by Skj ~ d!k Bji ) and the solutions substituted
in (33). The result is the eliminant
where ~i{t) is the vector mxl, whose components are the first two terms in (34) for k = 1,2, .. ,m. The determinant (35) does not contain (tl any more, although it is still a differential-i~egral equation in ge~eral, because it may contain q. But they are all~t, so that Euler's equatio~ (3l) is
actually not l~mited to anticipative systems, but also suitable for hereditary processes (25) or
systems with memory.
2.1.2. Complementary Case of Euler's Equation
If q in (lO) is defined as !.
tl ~(t) ~f f[T,9JT),~{T)J dT
t
(36)
necessary condition for stationarity of J can be derived in the same fashion as (3l). The difference begins in {26}, which reads now:
tl = - (!f.) 09.{t) -J ~ ll:. 09.(r) dT (37) a9. T=t dT a~
t
(because o~tl) = ~), so that (27) becomes
tl T oJ l = - tJ (~~) ( *)t oq{t) dt +
o
(38)
Double integral in the second term of (38) is now taken over the region shown in Fig. 2., so that changing the order of integration second term in
(38) changes into
43
t
Fig. 2: Integration region in (38).
i tl iT T [ (:~) dt] (~~ - i*)o9.{T) dT
to to !.
Interchanging the variables t and T it turns into
tl t T J [J (;~) dT] (;i -d~ *)Oq{t) dt to to &
Now evidently Euler's equation is t
aL d aL ( at)T aL + (U, d .!!.)T J aL a9. - dt ag: - ag: a~ a9. - dt ag, a~ dT = ~
- to - (39)
It can be proved in the same way as in 2.1.1. that~: may be eliminated from (39); the result is a differential-integral equation with tftl only, so that (39) seems to be generally suitable for anticipative systems.
2.2. RELATIONS BETWEEN EULER'S AND LAGRANGIAN EQUATIONS
Euler's equation is obviously a generalization of Lagrangian equation: when f = ~, i.e. when Lagrangian does not contain integrals, (31) reduces to (17). There are several other situations in which Euler's equation is related to Lagrangian equation.
I
2.2.1. Transformation of Euler's Equation to Lagrangi an Form.
Lagrangians (10) depend on ~ and i explicitly and implicitly via q; one may expect, therefore, that necessary condition for stationarity of (15) is of the form
DL d DL _ ng:-Clt~-Q. (40)
where partial derivatives D/D~ and D/Di take into account both explicit and implicit dependence of L on ~ and i.
In order to verify this conjecture, we need expressions for Dl/D~ and Dl/D.9.; they can be obtained if one can separate contributions of increments of ~ and .9. to the increment of L. This separation can be achieved by avoiding elimination of 0.9. in the derivation of Euler's equation.
Changing the order of integration in (25) we get (Fig. 1)
t} t}
oJ} = f [f ( ~~ f dtJ (~i o~h) + * 0.9.( T) ) dT
to T
and interchanging dummy variables t and T
so that
(41)
The multiplying factors of o~ and oi (in square brackets) are measures of the rate of increment of L due to the increments of ~ and i. It seems reasonable, therefore, to define them as global par-tial derivatives: t
T } Dl/:,.£.!:.+(af.}J .£idT (42) D~ = a~ a~ aq
t .:.
Then (41) reads as
t}
oJ = J [( ~~)TOq(t) + ( ~)\gJt)J dt to
Integrating second terms by parts
tl
oJ = J (~~ -d~ ~rOq(t) dt to
Comparing (45) with (29) we see that
_ DL d DL = D~ - CIt og:
(43)
(44)
(45)
(46)
Therefore, with the global partial derivatives defined as in (42) - (43), Euler's equation can be put in the lagrangian form.
The artificial form in which DL/D~ and DL/Di appear in (42) - (43), however, points out that the reduction of Euler's equation to an equation in Lagrangian form is achieved by force. As a matter of fact, Euler's and lagrangian equations are two different equations, which is in agreement with the well known fact that stationarity conditions for the action functional have additional terms, besides those in (17), if Lagrangian contains higher derivatives of q than the first.
2.2.2. Equivalence between Lagrangians containing Integral Functiona1s and Lagrangians without them.
Although Euler's equation cannot be reduced to Lagranian equation with the same Lagrangian L, it does not prohibit the possibility that there is another Lagrangian L' such that left-hand side of
aL' d aL' a~ - CIt aq = Q. (47)
coincides with the left-hand side of Euler's equa-
tion (31). An insight into th~se equations makes the following guess plausible:
Theorem 1: If Lagrangian is of the form
L(t.9..9..~) = Ll (t.9. • .9.) + hl(t)9 (48).
Euler's equation (31) is identical to (47) with
Proof:
~ _ ~ ~ = ~ _ ~ ~ + (.£1 d af)T a9. dt a9. a9. dt a9. a9. - dt a9: X
tl tl
J h2{T) dT - (*)T d~J h2h) dr =
t t
=':':l_~aLl (af)T (af daf)T a9. dt ar- + aB: h2{t) + a9. - dt ~ X
The expression on the right is actually the Euler's expression for L given by (48) because ~~ = h2(t). Q.E.D. ..
Corollary: Instead of applying Euler's equation to Lagrangian (48). the same result can be obtained if Lagt'angian equation is applied to the Lagrangi an (49).
2.2.3. Path-independent Integrals and Euler's Equation
If Hamilton's principle is to be found from (16),
Euler's equation (31) ought to be equivalent to dynamical equations; the simplest form of equivalence is the identity:
tl
.£h _ ~ .u. + (~) T .£h + ( at. _ ~ g ) T J a L -a9. dt a9. a9. aq a9. dt a9. tqdr = . .
t -
(50)
In selecting a proper Lagrangian, f and aL/aq can be chosen independently of each other; in otfier
words. integrand of 9 and the functional dependence of L on 9 can be chosen separately. The most attractive-choice of f would be one which annihilates the last term in Euler's equation. one which has anticipative character. Such f is to satisfy the
equation af. d af _ a9. - dt aB: = Q.
(51)
identically, i.e. for every ~(t)
Theorem 2.: f is an identical solution of (51) iff
f = it f.{t.9.)
where f. is an arbitrary function of t and 9.. ~: If f is of the form (52), i.e.
(52)
(53)
for i = 1.2, ...• n (the summation convention implied),
then
so that (when mixed derivatives are continuous)
afi d afi _ aq-; - Of a"Cik = 0
~i ,k = 1.2, ...• n, which is tantamount to (51). If f satisfies (51) i.e. (54), then
2 2 2
(54)
af i a f1 a f i . a fi .. 3Cik - ataClk - aqja~k qj - a<ijaClk qj :: 0 (55)
The first three terms in (55) do not depend on q .• J hence the coefficients of q. must be zeroes:
J
so that
and
af.
2 a f.
---.-,--..l = 0 aqjaqk
a< = <Pik{t.9.)
~.j
fi = <Pi~(t,9.)q~ + ~i(t,9.)
(56)
(57)
(running ~).where <P'k and ~. are arbitrary differen-1 1
tiable functions of t and 9.. Inserting (56) and
45
F
II
(57) into (55) we get
a~i~. a~i a~ik a~ik. aqk q~ + aq; -at - aqj qj '= 0
i. e. ~--- q +----=0 ( a~i' a~ik). a~i a~ik aqk aqj j aqk at-
This identity is possible only if (~j = 1.2 •..• n)
a~ij a~ik _ a~i a~ik - (58) - -- = 0 and - - -- = 0 aqk aqj aqk at
Whenever the conditions (58) are satisfied. ~.dt + , + ~ikdqk is an exact differential. Then a function Fi(t.~) exists such that
aF. aFi ~i = ~ and ~ik = aqk
Substituting (59) in (57) we get
aF. aF. f ,. +-' i = aq- q~ at
~
( 59)
¥i = 1.2 •...• n. which is the same as (53). Q.E.D.
Therefore. every f satisfying (51) identically has to be of the form (52). But then
t (60)
q(t) J c& £[T.~(·r)] dT = £[t.~(t)]-E[to.~(to)] to
so that Lagrangian does not contain integral functiona1s any more. and reduces to the classical case (14). which - according to the Rund's Theorem - cannot serve the purpose.
t
+ J ( ~i -if * )c~(T)dT to t
c~(t) = * c~(t) + J( *- -if * )o~(T) dT to
(61)
because c~(to) = Q (see Fig. 3)
'I.
~--~t~o------~~~--~t----~~
Fig. 3. Variation of the path
The first term in (61) represents the contribution due to variation at T = t; it vanishes when o~(t) = Q (Fig. 4). Integral term in (61) collects all the contributions due to variations between to and t.
The path-independence of q(t) means that its value is the same no matter which path is followed between two fixed points (Fig. 4). If
Equation (60) at the same time proves sufficiency t of the statement in
Theorem 3: q(t) is independent of the path (i.e. q is a state-function) iff f is given by (52).
Proof of necessity: If q(t) is independent of the path. its increment. oq.-corresponding to an arbitrary variation of the path between two fixed points. is zero: oq(t) = Q . The first variation of q(t) is: .:.
.:. t
Fig. 4. Independence of q of the path we set oq(t) = ~ and o~(t) = ~ in (61). path inde-
.:.
46
F
pendence is assured if
t .
J (~; -i if) 09.(T )dT = Q to
for arbitrary 69.(.). This can be true iff
al "' ~ ~ = a 39. dT 09. -
for every 1: E [to, tJ and every 9.(T), i.e. if f satisfies the equation (51). But then by Theorem 2. f is given by (52). Q.E.D.
From Theorems 2 and 3 follows Corollary 1: Integral (10, is path-independent iff f satisfies (51).
Corollary 2:The most general state function of the form (11) has f given by (52):
d [ ] aE 3E. f = at f. t,9.(t) = at + 39. 9. (62)
which is linear in~. (Of course, any function linear in 9.:
(63)
with arbitrary i and 1 does not always generate a state function). Theorems 2 and 3 show that one can get rid of the last (anticipative) term of Euler's equation iff integral q is path-independent. But then one can eliminate-integrals from ~agrangian, because f is given by (52), and q represents a function of t and 9.. as (60) show~. Euler's equation is then reduced to
and, since
~ _ ...!! a~ + (2.t.)T ~ = a a~ dt 39. 39. a~ -
u. = aE a9. a9.
Isee (62)/. eventually to:
~ _ d~ + (1E.}T ~ = a 3.9. at a 9. Cl9. a9 -
(64)
(65)
On the other hand. in this case Lagrangian is a classical one, of the type (14), so that Lagrangian equations also apply. L depends on 9. both explicitly ~n~ implicitly, through f.(t,9.). Using the symbol a/a9. as partial derivative which takes
47
into account both explicit and implicit dependence on 9.. we get:
aL daL_aL+(<lE)ToL d3L 39. - dt IT - a9. 09. a9 - at aI
so that Lagrangian equation A A
aL d aL 39. - df N = Q. (66)
gives the same result as Euler's equation. This shows that in the case when lagrangian contains path-independent integrals. one can use Lagrangian equation (66) rather than Euler's equation. If any of the integrals is path-dependent. Lagrangian equations are no more a)propriate, and one must use Euler's equation. (* Following examples will illustrate this pOint.
For dissipative systems Lagrangians with path-independent integrals are no better than Lagrangians without integrals; according to the Rund's theorem, they are both unable to generate identity (3).
2.2.4. Examples
~. Lagrangians (6) and (8) contain integrals with insegrand f = ~(q)~ in the first case, and
f = e±2~~(q)q in the second.They both belong to the
class f A G(q)q, which is a special case of (52) with q
F( t,q) = J G(x)dx qo
One can easily verify that they both satisfy (51).
Integrals from (6) and (8) are, thus, path-independent, which justifies the use of Lagrangian equations in (5).
On the other hand, the integrals could be eliminated from (6) and (8). Theorem 1. generates an equivalent lagrangian
B tl B
L' = e~t ~2 + ~(q)q Jr (_eMr ) d. = t
* Another special case when one can use Lagrangian equations, even if 9 is path-dependent, is the class of Lagrangians (48). but then, according to Theorem 1. L has to be replaced by an equivalent Lagrangian L'.
t ; j;
B t in the first case; the tenn ~ ~(q)q eP; 1 may be
omitted as an identical solution of Lagrangian equation, and finally
= /~[~q2 _ .p(q)q (t 1 - t)]
±~ • once again. the tenn e .p(q)q tl may be omitted, and finally
±2~ L' = e [ ~q2 + .p(q)q t] (68)
Lagranglans (67) and (68) are equivalent to (6) and (a) respectively, and at the same time they are simpler.
Ex. 2. If an external force is applied to the system, equations (5) and (7) are to be replaced by
Mij + 82q + .p(q) = Q(t)
and Mij + Blq2 + .p{q} = Q(t)
Appropriate lagrangians are (15)
q
(~q2 - / <l;(x)dx + Q(t)q] qo
B -2t
L I = eM [~ 42 + ;2 q.p(q) + Q(t)q]
for (69), and
(69)
( 70)
(71)
(72)
B q
L = e 2w1
q ~ q2 - f B B (73) ~lX 2~q
qo e 4>(x)dx + 2~1 e Q(t)
B
L I = e 2;{lq [~cj2 + q.p(q)t + 2~1 Q(t)] (74)
If both linear and quadratic dissipation are present:
Mq + Blq2 + B2q + .p(q) = Q(t) (75)
with Bl; 0 and B2 ; 0, it was shown in (15) that
B B B ~ +;t-~ . it X
L = e 2 q2 - e
B B B q 2p;l x ~q+i"t Je .p(x)dx + ~ Q(t)e
qo .
(76)
and B B
~q + -2t rl ] L' = e M l~q2 + ~2 q.p(q) + 2~1 Q(t) (77)
L' can be derived from L applying Theorem 1.:
B tl B .
+ e~ q 4>(q)q J _etf T dT t .
This is equal to the expression in (77) if one drops the term containing t l , which satisfies Lagrangian equation identically.
Ex. 3. With the multiplier method in mind, it seems reasonable to investigate the lagrangian:
L(t.q,q,~) = e~Ll(t,q.q) (78)
When substituted in Euler's equation (31) it gives
aL 1 d ( all) af eq - - -=- eq .,......-- + ~ eqL l + • aq ut . aq aq'
tl
+ { ~~ - d~ *)J e9 Ll dT = 0 t
(79)
for (70). Corresponding Lagrangians Land L' are ob. Now taking viously related by Theorem 1. f = ~ (80)
48
It is easy to check that Lagrangian equation with L'
Ll = ~ q2(t) - ~ q2(t) + Q(t)q (79) becomes:
(81) from (84) gives the same result (83) as Euler's equation with L from (82).
~t-to) r, B ] e LQ(t) - Kq - Mq - MMq = 0
which is equivalent to
Mq + Bq + Kq = Q(t)
With the choice (80) - (81) Lagrangian (78) is equal (except for the insignificant constant factor
B :M to e ) to the Lagrangian (71) with ~(q) = Kq and
qo = O.
Taking f = ~~i~ and the same Ll as in (81) the result can obviously be extended to the case of time variable coefficients(3).
The multiplier method is thus incorporated in our theory.
Ex. 4. Lagrangian given in(18), apart from its topological complexities, is essentially of the form t t (82)
L l·T M' 1 T K 1 f· T • J T( ).( = 2S _9 - II _9 -"2 9 ~9 dT + Q. T 9.: T)dT to to
This Lagrangian contains two path-dependent inte-
The very fact that there is an equivalent Lagrangian without integrals is an evidence that Lagrangian (82) does not generate dynamical equations for linear dissipative systems; the result only confirms thi s concl us i on.
The last term in (83) does not have any physical significance. Therefore, Lagrangian (82) does not fulfill the expectations of (15). At the same time, Lagrangian (84) has the same effect, but its structure is simpler --- formally and in principle.
3. CAN EULER'S EQUATION GENERATE DYNAMICAL EQUATION OF LINEAR DISSIPATIVE SYSTEM?
Negative result in Ex. 4. is not incidental, i.e. due to an unfortunate choice of Lagrangian. As a matter of fact, it is a necessary consequence of
Theorem 4: There is no Lagrangian function of t, q, q, and ~, where ~ is defined by (11), which generates the identity tl d aL aL aL af + ( d af af )/ aL
Of aq - ail - ail aq Of aq - ail ail dT -. t .
= Mq + Bq + Kq - Q(t) (85) grals, because their integrands do not satisfy (51). Therefore, in using this Lagrangian one must apply Proof: Using handy notation Lq ~ ~~. etc., and taking
Euler's equation, rather than Lagrangian equation. total time derivatives:
In this case:
aL ail = H-] f=
so that
(*)T = [at (~T ~ g)! at (Q.T~)] = [2~g \ Q.(t)]
and Euler's equation gives:
- ~9 - Mg - B9 + g(t) + [~g - ~(t)](tl - t) = Q. (83)
It is possible to formulate for Lagrangian (82) an equivalent Lagrangian, which does not contain integrals; by Theorem 1.:
(84)
49
d dt fq = fqqC:i + fqqq + fqt
(85) becomes~tl
(Lqq + fqq J LqdT) t .
+ Lq' t - L + L· f - L f· + (f. -q q~ ~ q qt
= Mq + Bq + Kq - Q(t) ( 86)
The only terms involving q are the first two terms on both sides of (86), so that we must have
tl
L.. + f.. J Lq dT = M (87) qq qq t
Differentiating (87) with respect to t we get:
d L. ·-M Lq = df~ (88)
when fqq 1- 0
Lemma 1: (88) is equivalent to (87) iff
Lqq = M at t = tl ( 89)
Proof: (8l) satisfies (89) and at the same time implies (88). On the other hand, from (88)
t
Lqq-M I -Lgg-M/ = f f.. f .. qq t=t qq t=t* t*
for any t*; especially for t=tl
tl L .• -M L· ·-M J ~f J - ~f I = L dT .. .. * q qq t=tl qq t=t t* .
Subtracting the last two equations, and taking (89)
into account we get:
t tl t tl L··-M / J ~:= L dT - L dT:= f.. q q qq t*· t*·
f Lq dT = -J Lq dT t 1 • t·
which is identical to (87). Q.E.D.
We can safely take the assumption fqq 1- 0, because the case fqq = 0 is of no interest, as Lemma 2 proves.
Lemma 2: If fqq = 0 the Euler's equation does not generate Sq - term.
Proof: If f·. = 0, then L .. = M ¥to In this case: qq qq
f q = A( t,q)
f = A(t,q)q + S(t,q)
Lq = Mq + F(t.q,~) _ 1 ·2 ( ). ) L - 2 Mq + F t,q,~ q + G(t,q,~
where A, S, F, and G are arbitrary functions of their respective variables. Euler's equation then
50
reduces to
None of these terms contain q, so that the case
fqq = 0 can be excluded from further considerations. (Especially when At-B = 0 q is path-independent, q • and then we know that Euler's equation reduces to Lagrangian equation, which cannot produce dissipative term). Q.E.D.
The condition (87) eliminates the first terms in both sides of (86), and al.so helps to eliminate integrals of Lq:
M-L .• (L. + f· ~) q + L· t - L + L. f-f.L + qq qq qq q q q~ q ~
M-L .. + (f· t - f ) ~ = Bq + Kq - Q(t) q q qq
From (88) we have:
L· .-M L =(2+·2+" -!+ f.L).=.gg-.:..:. q at q aq q aq aq f .. . . qq
But Lq does not depend on q, so that
and
From (91):
so that:
L· ·-M ~ = F (t,q,~)
L·· = M + f·· F (t,q,q) qq qq .
(90)
(91)
(92)
L· := Mq + f· F(t,q,q),+ G(t,q,q) (93) qq. . and:
L = } Mq2 + f F + qG + H (94)
where f=f(t,q,q), while F, G, and H are arbitrary functions of t, q, and q.
r I
I !
These results simplifY (90):
(f. F + f·F + G - f· F) ~ + f·tF + f.F t + Gt -qq q q q qq q q
- f F - f F - ~G - H + (f·F + G ) f -q q q q q q q
- f· (Ft + F q + F f) - (f' t - f ) F = Bq + Kq-Q(t) q q q q q i.e. Gt - fF - H + G f = Bq + Kq - Q(t) q q q (95)
Taking derivative of L in (94) with respect to q, and comparing it with (92) we get:
f F + ~ G + H = F t + ~ F + f Fq 9 9 9 q
i.e. (G - F ) q + H - F
t = 0
9 q 9 which is possible only if
Gq = Fq , Hq Ft
But then (95) is reduced to
Gt - Hq = Bq + Kq - Q(t)
.
(96)
(97)
which is impossible because neither G nor H depend on~. Q.E.D.
Remark 1: There is no need to prove Theorem 4. for multidimensional case; one-dimensional case should not be an exception.
Remark 2: The same conclusion holds for the equation
(99)
where ~, ~, y, and Q are arbitrary given functions of their variables, as can be seen glancing at the the former proof. This means that Van der Poh1 equation
q + y(q2_1} q + q = 0 ( 100)
for example, cannot be generated by Euler'S equation.
The Theorem 4., therefore, shows that Euler's equation, just like Lagrangian equation, cannot be identified with dynamical equations of linear (and some nonlinear) dissipative systems. This, however, does not mean that Hamilton's principle for dissi-pative systems cannot be found in the form (16),
type of equivalence - with a multiplier - was already considered in Ex. 1-3. Different type of equivalence is based on observation that Euler's equation can generate the term B~, but accompanied by parasitic terms, like Bq(t1-t} in Ex. 4., which may not influence the solution. This idea is pursued in the next section.
4. HAMILTON'S PRINCIPLE AND MINIMUM THEOREMS IN NETWORK THEORY
For a network composed of capacitors and ideal voltage sources ~(t) only, Lagrangian equations (2) with the Lagrangian
L = !- 9.T ~ 9. - ~T (t) 9. (101)
generate the correct equati ons ~ 9. = ~(t). (~i s the inverse capacitance matrix). Therefore, q's are distributed among capacitors so that
( 102)
has stationary value. In static equilibrium with ~ = const (102) reduces to the well-known Thomson's theorem.
Corresponding theorem for resistive network with ideal voltage sources ~(t) states (26), (27) that
the functi on
(103)
(~- loop currents, ~ - loop resistance matrix) has minimum for the actual distribution of currents. compared with all hypothetical distributions which satisfy Kirchoff's current law. When resistive network with ideal current sources let) is considered, corresponding function is
(104)
where ~ is the node conductance matrix, and ~ nodal voltages. Components of ~ are generalized velocities, while generalized coordinates are the components of flux
t
t(t) ~ Jr ~ (T) dT because Hamilton's principle does not require the identity (85), but only a sort of equivalence. One so that ~=i.
51
The proof could be based on the modified form of Lagrangian equation
d -ll.- ~+ ~= n(t) Of c9. c9. c9. !t
with the Rayleigh's function
R~l9.·TB9.· - 2 -
which reduces for L = 0 to
i.e.
lB. = Q(t) c9. -
( 105)
(105) is the necessary condition for a minimum of R - !IT(t)~, which is only in notation different from (103) and (104).
Although close in spirit to Hamilton's principle, the former theorems for resistive networks have no direct relation to it. The connection with Hamilton's principle could be established if the Lagrangian
t
L = J [t ~T R ~ - IT(,:)~J dT (106) to
is considered. This is a Lagrangian with the integral, actually composed of only one integral with
( • ) 1· T . T() . f T ,9.,9. = '2 9. R 9. - I T 9. ( 107}
Stationarity condition for the action functional
tl t
J J [1 • T . T()'J ( J = '2 9. R 9. - I T 9. dT dt 108) to to
is the Euler's equation (31), which reduces in this case to
(Ri - I) - ~ (R~ - I) (t l - t) = 0 (109)
This is not the equation
R ~ - I( t) = 0 (110 )
for the flow in resistive networks, but it can be shown that (109) and (110) are equivalent, i.e. they have the same solutions for the same initial
52,
conditions.
Proof: Equation (110) obviously implies (109). On the other hand, introducing
I(t) ~ R~ - I(t)
for brevi ty, (109) has the form
I - (~ - t) ~t = Q.
Now for the kth component of I
dXk Xk - (t1 - t) c.rt = 0
i.e. Xk (t) t
J d~~ = J Xk(tO) to
so that the solution of (109) is
i.e.
(111 )
( 112)
( 113)
Initial values i(to), common to (109) and (110), satisfy (110):
( 114)
so that I(to) = Q.. (113) then implies I(t) = Q. in [to,t l ], which means that (109) implies (110). Q.E.D.
Therefore, Hamilton's principle for resistive networks with ideal voltage Sources is
tl t
IS f /[t p"T R ~ - IT(T)i] dT dt = 0 (115) to to
Similarly, for resistive networks with ideal current sources
tl t
IS J J [~yT R y - IT(T)y] dT dt = 0 (116) to to
It should be noted that the Lagrangian (106) is a special case of Lagrangian (82) with the conservative part (first two terms) omitted.
Lagrangian (106) also belongs to the class (48), with Ll = 0 and h2= 1, and therefore can be replaced
by an equivalent Lagrangian without integrals:
When (117) is substituted in Lagrangian equation (47), it generates the equation
( 118)
which is identical to (109).
At the same time (118) supplies an additional proof of equivalence between (109) and (110): (118) im
plies
(t1-t)(!!.9.. - g) = const
and the constant is zero, which follows from (114) for t=to. or directly for t=tl; but
(tl-t)(! 9.. - g)= Q.
can be true for every t in [to.tl ] only if
!!.9.. - Q(t) = Q. Q.E.D.
Hamilton's principles (115) and (116) can. there
fore, be replaced by
iational principle with Lagrangians containing integrals exist. ensuing analytical formalism can bear only an indirect relationship to dynamical equations for dissipative systems.
However. all those difficulties might have been caused by an insufficiently general framework. Integrals considered here may not have been general
enough. Integrals 1 ike
t
~(t) = f ~ (t,T) f [T.!l(T).9..(·d] dT - to
( 121)
or even t
~(t) =J f [t.!l(t).gJt);T.!l(T),g.<-d] dT
to
(122)
may prove to be more suitable for dissipative systems. This seems to be the case indeed: recent developments (28)-(30) in variational operator theory (31)-(32) lead to Lagrangians with integrals as a result. The integrals which appear there are of more general nature than those admitted in Lagrangian h~re. Hence
tl the result presented here need to be generalized in () f (tl-t) [t 9..T !!. 9.. - IT(t) .9.] dt = 0 (119) order to make a meaningful comparison.
to On the other hand. the idea of equivalence between Euler'S equation and dynamical equations is worth pursuing further in order to generalize the result
of Section 4.
and tl () f (tl-t)
to
respectively (in (120) ~ = 1)· CONCLUSION
Apart from its possible use in the theory of dissipative systems, the theory of Lagrangians with inte-grals offers variational formulation for a class of
Although the framework of variational calculus was differential-integral equations (31) and (39). and extended by introduction of Lagrangians with inte- variational principles for systems described by such grals. Hamilton's principle was established only for equations. like some hereditary systems (25)
purely dissipative systems. Even in this case Lagrangians with integrals turned out to be unnecessary: the same equations could be generated by Lagrangians without integrals. Furthermore, the result was achieved only indirectly. through an equivalence between dynamical equations and Euler's equation; it could not be otherwise, in the light of Theorem 4. and Rund's theorem. Variational formulation, simple and direct as one known for conservative systems, cannot be designed for dissipative systems using Lagrangians with integrals. If a var-
53
REFERENCES
1. Rund, H.: SYSTEMS OF FIRST-ORDER LINEAR ORDINARY DIFFERENTIAL EQUATIONS AS CANONICAL OR EULER-LAGRANGE EQUATIONS, "J. Math. Physics". Vol. 8. No. 10. Oct. 1967. pp. 2059-2062.
2. Havas, P.: THE RANGE OF APPLICATION OF THE LAGRANGE FORMALISM-I. "Nuovo Cilllento Supplemento". Vol. 5. 1956, pp. 363-388.
3. Rohrer, R. A.: ANALYTICAL MECHANICS AND THE ANALYSIS OF LINEAR TIME-VARYING NETWORKS, "J.
= I ;;i: == --
Franklin lnst." Vol. 266, 1958, No.1, pp. 27-38.
4. Denman, H. H.: ON LINEAR FRICTION IN LAGRANGE'S EQUATION, "Am. J. Phys.". Vol 34, 1966. No. 12. pp. 1147-1149.
5. Denman. H. H.: TIME-TRANSLATIONAL INVARIANCE FOR CERTAIN DISSIPATIVE CLASSICAL SYSTEMS. "Am. J. Physics", Vol. 36, 1968. No.6, pp.
516-519.
6. Denman. H. H. & Buch. L. H.: SOLUTION OF THE HAMILTON-JACoeI EQUATION FOR CERTAIN DISSIPATIVE CLASSICAL MECHANICAL SYSTEMS, "J. Math. Physics", Vol. 14, 1973. No.3, pp. 326-329.
7. Van der Vaart. R. H.: VARIATIONAL PRINCIPLE FOR CERTAIN NONCONSERVATIVE SYSTEMS. "Am. J. Phys. II Vol. 35. 1967. No.5, pp. 419-423.
8. Buch. L. H. & Denman, H. H.: SOLUTION OF THE SCHRODINGER EQUATION FOR SOME ELECTRIC FIELD PROBLEMS. "Am. J. Physics". Vol. 42, April 1974. No.4, pp. 304-309.
9. Kerner, E. H.: NOTE ON THE FORCED AND DAMPED OSCILLATOR IN QUANTUM MECHANICS, "Can. J. Phys.". Vol. 36, 1958, pp. 371-377.
10. Besieris, I. M. & Fatic. V.: QUANTUM MECHANICAL DISTRIBUTION FUNCTIONS FOR A SPECIAL CLASS OF DISSIPATIVE SYSTEMS, "Bulletin of the Amer; can Phys i ca 1 Soci ety". Ser. II, Vol. 19. No. 10, 1974, p. 1128.
11. Denman, H. H.: LAGRANGIAN FORMALISM FOR VIBRATIONS IN SOME DAMPED CONTINUOUS SYSTEMS, "Journal of Sound and Vibration". Vol. 36, 1974. No.4, pp. 485-489.
12. Fatic. V. M. & Blackwell. W. A.: HAMILTON'S PRINCIPLE FOR THE LOSSY TRANSMISSION LINE, "Proc. 1976 Conf. on Information Sciences and Systems", The John Hopkins Unlv., Baltimore, Md., March 1976, pp. 279-285.
13. Morse,~. M. & Feshbach, H.: METHODS OF THEORETICAL PHYSICS, Vol. I. McGraW-Hill, 1953.
14. Leech, T. W.: CLASSICAL MECHANICS, Methuen,
1965.
15. Fatic. V. M.: ON VARIATIONAL PRINCIPLES FOR DISSIPATIVE NETWORKS AND SYSTEMS. Ph.D. Dissertation, VPI & SU 1976
16. Fatic. V. M. & BlaCKwell, W. A.: VARIATIONAL MODELS FOR CONSERVATIVE NETWORKS WITH IDEAL CURRENT AND VOLTAGE SOURCES, "Proc. VII Ann. SE Symp. System Theory". Auburn Univ. and Tuskeegee Inst., AL, March 1975, pp. 268-274.
17. Fatic. V. M. & Blackwell. W. A.: COMPOSITE VARIATIONAL PRINCIPLES FOR LINEAR TIME-VARYING LUMPED-PARAMETER SYSTEMS WITH DISSIPATION, "Proc. 8th Ann. SE Symp. on System Theory", Univ. of Tennessee, Knoxville, Tn., April 1976. pp. 95-101.
18. Huang, H. L. & Blackwell, W. A.: NEW LAGRANGIAN AND HAMILTONIAN FUNCTIONS FOR LINEAR DISSIPATIVE PHYSICAL SYSTEMS WITH IDEAL DRIVERS, "IEEE Trans. on Circuit Theory", Vol. CT-1B. 1971, No.4, pp. 461-463.
19. White, D. C. & Woodson, H. H.: ELECTROMECHANICAL ENERGY CONVERSION, Wiley, 1959.
20. Stern, T. E.: THEORY OF NONLINEAR NETWORKS AND SYSTEMS, Addison-Wesley, 1965.
21. Meisel, J.: PRINCIPLES OF ELECTROMECHANICAL ENERGY CONVERSION, MCGraw-Hill, 1966.
22. MacFarlane, A. G. J.: DYNAMICAL SYSTEM MODELS. George G. Harrap & Co .• Ltd., 1970.
23. MacFarlane, A. G. J.: FORMULATION OF THE STATESPACE EQUATIONS FOR NONLINEAR NETWORKS, "Int. J.
Control", Vol. 5, 1967, No.2, pp. 145-161.
24. Jones, D. L. & Evans, F. J.: VARIATIONAL ANALYSIS OF ELECTRICAL NETWORKS, "J. Franklin Inst~
Vol. 293, March 1972. No.3, p. 9-23.
25. Volterra. V.: THEORY OF FUNCTIONALS AND OF INTEGRAL AND INTEGRO-DIFFERENTIAL EQUATIONS, Dover, 1959.
26. Jeans, J.: THE MATHEMATICAL THEORY OF ELECTRICITY AND MAGNETISM, 5th ed., Cambridge at the University Press, 1933.
27. Desoer, C. A. - Kuh, E. S.: BASIC CIRCUIT
···~--~~~·c.~ ______________ ~ __ ~_~
THEORY, McGraw-Hill, 1969.
28. Tonti, E.: A SYSTEMATIC APPROACH TO THE SEARCH FOR VARIATIONAL PRINCIPLES, An article
in the book: "International Conference on Variational Methods in Engineering", Vol. I-II, pp. 1/1 - 1/12. Ed. by Brebbia, C. A. &
Tottenham, H., Southampton University, 1972.
29. Magri, F.: VARIATIONAL FORMULATION FOR EVERY LINEAR EQUATION, "International Journal of Engineering Science", 12(1974), 6, 537-549.
30. Atherton, R. W. & Homsy, G. M.: ON THE EXISTENCE AND FORMULATION OF VARIATIONAL PRINCIPLES FOR NONLINEAR 01 FFERENTIAL EQUATIONS, "Studi es in App 1 i ed Mathemati cs," VoL LIV, March 1975, No.1, pp. 31-60.
31. Vainberg, M. M.: VARIATIONAL METHODS FOR THE STUDY OF NONLINEAR OPERATORS, Holden-Day, Inc.
1964.
32. Mikhlin, S. G.: VARIATIONAL METHODS IN MATHEMATICAL PHYSICS, Pergamon Press, Oxford,
1964.
BIOGRAPHIES
VUK M. FATIC was born in Pancevo, Yugoslavia, on March 22, 1932. He received the Dip1. Ing. degree from Belgrade University, Belgrade, Yugoslavia, and the M.Sc. and Ph.D. degrees from Virginia Polytechnic Institute & State University, Blacksburg, VA, in 1960. 1973. and 1976 respectively. all in electrical p.ngineering. He was a research engineer with "Elektroinstitut" and INTDI. Belgrade. from 1960 to 1968. and an assistant in control engineering at Novi Sad University. Yugoslavia. from 1968 to 1970. During the 1975-1976 academic year he was a Visiting Assistant Professor at Union College. Schenectady. N.Y., and since 1976 he has been an Assistant Professor of electrical engineering at Tri-State University. Angola. IN. He has worked in control, power, laser, and IR engineering. His special area of interest is application of variational calculus in system. network. and field theory. in which he has contributed six papers together with W. A. Blackwell. He is a member of honor societies ~K~ and HKN. and a student member of IEEE.
WILLIAM A. BLACKWELL was born in Fort Worth. Texas, on May 17. 1920. He received the B.S. degree from Texas Technological College. Lubbock, in 1949. the M.Sc. degree from the University of Illinois. Urbana, in 1952. and the Ph.D. degree from the Michigan State University. East Lansing. in 1959. all in electrical engineering. He has been at Virginia Polytechnic Institute and State University since 1966. where he is presently Professor and
Head of the Electrical Engineering Department. He has also been a member of the electrical engineering faculty at five other universities. and has industrial experience with General Dynamics and General Electric corporations. He is a senior member 6f ASEE and NSPE. and is a registered professional engineer in the state of Texas. He has published two texts and numerous papers in his area of specialty, network theory and control engineering.
55
iiji , ,;'11
r :\_
AN ESTIMATION OF PARAMETERS IN A PARABOLIC EQUATION
WITH SPATIALLY-VARYING COEFFICIENTS
Z. Jacyno Department of PhYSics
University of Quebec in Montreal Montreal, Canada
Abstract
An estimator is derived for distributed parameter systems described by a parabolic equation with spatially varying coefficients, based upon the variational approach with usual least-square criterion and the observation data accompanied by noise. As a result the iterative procedure follows with the adjoint nonhomogeneous state equation. The'evaluation of probabilistic properties of the estimator shows the blased'results obtained even with the use of unbiased measurement data.
1. INTRODUCTION The distributed parameter system considered in this paper is described by a parabolic partial differential equation with spatially-varying conductivity coefficient being a known function of space coordinate. Thus the estimator has to provide the values of unknown constants only. The considered estimating procedure uses variational approach, from which an iterative scheme results. It is based upon the noisy 'input data gathered from the system. As a consequence the adjoint state equation becomes non-homogeneous with a "forcing" factor given by the measurements. The estimator provides the best estimation from each set of observations available, charged with noise and measurement errors. For the purpose of probabilistic optimization the procedure needs to be repeated for a statist; ca 11y representati ve number of input data sets. This produces a set of estimated parameters, related in some way to probabilistic characteristics of input noise. Supposing the normal distribution for input noise, the evaluation of the estimator from the probabilistic point of view is sought.
56
2. DETERMINISTIC ESTIMATOR The distributed parameter system under consideration is described by the parabolic partial differential equation Yt=(aYx)x+~(t,x); (x,t)€Qx]O,TC, Q=]O.l[cR, (l)
with a being a spatially varying coefficient, i.e. a=a(x) and the following initial
y{O.x) '" Yo(x); (x, t)€S1XO (2)
and boundary
Yxlx=o -aoh(t) , YxlX=l = 0 , h(t) H '" const.
conditions. This system can be interpreted as onedimensional rod heated along x and at one boundary point and perfectly insulated at another, with y being its temperature. The parameters to be estimated are the spatiallyvarying thermal conductivity coefficient of the form
a '"
with al const., a2. = const.
(4)
(5)
(6)
and the heat exchange coefficient
ao = const. (7)
The estimator $.* = [a~ at a~]T (8)
uses the input data obtained from the k-th set of observations of the system
m(t,x) = y(t,x) t n(t,x), (9) n(t,x) - measurement error and noise
and the usual least-square criterion T 1
J(A) =jf(m-y)2 dt dx (10)
where o 0
A, Y - estimated parameter and state, to provide the best approximation A* of A from the information available at the k-th step, i.e. to minimize J
J(A*) s J(A), A E Aadm (11 )
The estimating procedure G is based on a variational approach [1]. The variation of the state equation (1)
0Yt = o[(aYxh] , (12) after multiplication by an adjoint state vector v and following substitutions
vOYt = (voY)t - vtoy , vo[(ayx)x] = [vo(ayx)]x - vxo(ayx) , avxoyx = (avxoy)x - (avx)xoy ,
becomes (voY)t = [Vtt(avx)x] Oy-vxyxoat(avoyx-avxoyt
tvyxoa)x • (13) The variations of the initial and boundary conditions are
~YI t= 0 = 0 ,
oYxlx= 0 = -h(t)oao
(14 )
(15 )
Green's theorem applied to (13) over the domain of independant variables gives
f (voy) I i~b d\"l= I![vtt(avx)x]oy dt dn -non J
- ~fvxyxoa dt dntJ (avoyx-avxoy t o n IX-l 0 t vyxoa ) x:O dt . (16 )
Thus the variation of augmented criterion with the initial condition
v(T) = 0 ; XEn, (17)
imposed upon the adjoint state vector and the initial and boundary conditions for y taken into
account leads to
OJ~vtt(aVx)x-2(m-y)]oy dt d\"l~~Vxyxoa dt dn+
ilavxoYI X= 1 t[avxoyt(aovh-vyx)oa] I x=O } dt. (18) o
57
This allows to find the adjoint state equation Vt = -(avx)x + 2(m-y) , (19)
with initial condition (17) and boundary conditions
vx Ix=o = 0 Vx IX=l = 0
(x, t)Eo\"lX]O, n, (20)
T bringin
f18) to
~J=- jvxYxoa o n
dt dntj(aovh-vyx)oalx=o dt .(21) o
Introducing the function
1
1, x=O ~(O) ~
0, Xo'O (22)
(21) takes the final form T
oJ=~~-VXyx+~(O)(aoVh-VYx)J oA dt dn . (23) o n
The estimator G consists of the following steps: 1) initialize the procedure by making the first estimation of A, 2) find the state vector from (1)-(3), 3) compute the criterion (10) using the first set of observation data available, 4) find the adjoint state vector from (19), (20), (17) ,
5) from (23), find oA applying one of the known methods (steepest descent, conjugate gradient) to assure the decrease of (10), 6) repeat step 2 until the decrease (23) has atteined the value lesser than that initially imposed. It provides the best estimation A*(k) based upon a set of observation data m(k). The procedures then continues for m(ktl). As a result, the estimation produces a set of the best estimates . {A*(k)} , being a random variable with a certain
probability distribution PG ,related to the characteristics of input random variable n(t,x), true value of the coefficient A and the nature of the estimator G.
3. PROBABILISTIC PROPERTIES OF THE ESTIMATOR
iii
I'
\ I: I'
The noise n(t,x) represents a local random variable at given XEQ. We suppose that it has the normal probability distribution with both the local mean
Il(X) = E[n(x)J = 0, Y xdl (24)
and variance a2 (x) = E[n(x) nT(x)J. Y XEQ (25)
being known. The total values of these parameters
over all the domain Q are
II = !1l(X)dX = 0 ,
a2 = [a2(X)dX
The estimator defined in the previous chapter, symbolically denoted here as
A*(k) :: G [m(k)J ,
(26)
(27)
(28)
can then be characterized, [2J, by its mean value
A ~ E {A*(k) f :: jA*PG(A*)dA*=fi(m)p(m (29)
here as j(A) = (m-y)2 ,
can be also given as j(A) :: (y-y)2+n2+2n(y-y) ,
where jd(A) = (y_y)2
(34)
(34a)
(35) represents (34) for the deterministic estimator. The expectation
E[j(A)J :: (y_y)2+E(n2) , (36) extended over all the domain of independant varia-
bles ;r E[J(A)J = Jd(A)+ ~ a2 dt , (37)
o with
Jd(A) - deterministic least-square criterion, shows that the criterion used is biased with the bias
E[J(A)J - Jd(A) = o2T . (38) and covariance matrix
V~ E (A*-A)(A*-A) T:: j[G(m)-AJ Thus at the ~-th iteration, based upon m(k)
[G(m)-A]Tp(mIA)dm.(30) J~[A(k)]:: J~_l[A(k)] + 6J~ (39) the estimated parameters are biased because of (38).
The condition (24) imposed upon (9) means that the observations are unbiased
E[m(t,x)] :: y(t,x). (31)
To evaluate the estimator G in this regard, each of its steps needs to be considered. The state equation, rewritten expressly in terms
Ofy:h~ [::]f'r:;:Jx . f , (32)
shows the expected value of the state estimation 9 at each iteration linearly dependant upon the expected value of A. The characteristics of m do not intervene here explicitly. The same conclusion holds for the adjoint state
equation vt= - ra1]T [ v] + 2(m-y) , (33)
La2 4> vx x when (24) is taken into consideration. The step 5 uses the results provided by (32) and (33). The statistical characteristics of the observations react here through the adjoint state, appearing as linear function. Consequently it is not affected by m. Finally let us consider the step 3 having an essential importance in providing the best estimation of $.* through (11). The criterion function,denoted
58
The bias can be evaluated through the sensitivity function (23). Consequently the estimator G provides, as a whole, biased results for A. The same approach can be used for the evaluation of the variance.
ACKNOWLEDGMENT The author wishes to acknowledge partial support obtained from the Canadian National Research Council through grant No. A 9516.
REFERENCES [1] Goodson R.E., Polis M. (editors). "Identifica
tion of Parameters in Distributed Systems", The American Society of Mechanical Engineers, New York, 1974.
[2] Bard Y .• "Nonlinear Parameter Estimation", Academic Press, New York and London, 1974.
!
THE SINGULARITY EXPANSION METHOD IN ELECTROMAGNETIC SCATTERING
D. R. Wilton Depa~tment of Electrical Engineering
'Uni~ersity of Mississippi University, MS 38677
Abstract
1. INTRODUCTION
The advent of the high speed, large storage digital
computer has made possible the numerical solution
of electromagnetic scattering and radiation pro
blems which were previously intractable. Most suc
cessful of the various alternative numerical ap
proaches for conducting scatterers has been to first
formulate the problem as an integral equation for
the induced current on the scatterer, and then to
solve the integral equation by the so-called method
of moments (MoM).(l) Formulation is usually in the
frequency domain so that frequency appears as a pa
rameter in the integral operator. If time domain
results are required, they are generally obtained
by standard Fourier transform methods. More recent
ly, however, it has been realized that a more com
pact representation of the solution for both fre
quency and time domain applications can be obtained
by determining the complex frequency-domain singu
larities of the resolvent kernel. This procedure
has been termed the singularity expansion method
(SEM).(2) In the following sections we outline the
method for conducting scatterers, present two sim
ple examples, and summarize some pertinent questions
related to the method and some of its extensions.
2. FORMULATION OF THE SINGULARITY EXPANSION METHOD
2.1 INTEGRAL EQUATION FOR CONDUCTING SCATTERERS
Assuming an exp(st) time dependence, the scattered
59
field at a point r due to a conducting scatterer S
whose induced current is J(r) is given by
-~ Ir-r' 1 -s - -1 (S2 ~ E (r) =-- - - 'V'V.
41TSE 2 e c -----dS' (1)
c S
where c is the speed of light in a medium with per
mittivity E. If the incident field is denoted by -i -E (r), then the application of the boundary condi-
tion, which requires the total tangential electric
field to vanish on the scatterer surface, yields
the operator equation
(2)
-i -i where Et is the component of E tangential to Sand
3 is a linear operator defined by
= r~ (S2 - w'UJ(r~ 41TSE 2 c S
s 1- - 1 - - r-r'
~ c dS'J ,r on S. Ir-r' 1 t
(3) A numerical treatment of (2) amounts to discretiz-
ing the surface current J on.the object and enforc
ing the equality in (2) at discrete points on the
object. A set of linear equations results which (1)
are then solved for the current on the scatterer.
2.2 SPECTRAL REPRESENTATION OF THE OPERATOR AND ITS INVERSE
By the electromagnetic reciprocity theorem,~ is a
complex symmetric operator. The spectral represen-
f filii
','I
"
'I Ii
i: i I
tat ion of the kernel r can be shown to be
1'(I.,r' ,s) ::: ~ A (s) C (r,s) C (r' ,s) L.. n n n n
where {A } and {C } are sets of eigenvalues and n n
(4)
~igenvectors, respectively, defined by the eigen-
value problem
~ C ::: A C . (5) iT n n n The inverse or resolvent kernel is then given by
n Hence, the inverse operator is
J ::: ~-l E;
<r-l(r,r'), E;(r' ~ (.7)
2.3 SINGULARITY EXPANSION OF THE INVERSE OPERATOR
It has been shown that r is an entire function of
the complex frequency s and, hence, that r 1 is
h · (3) I 1 h meromorp 1C. ts po es occur at t e zeros of
An(s), all of which must be in the left half plane.
Thus if s . denotes the ith zero of An(s), then n1
A(s.):::O. n n1
(8)
For three dimensional conducting scatterers, it
appears that a partial fraction expansion of the
form
1 I"TsT n I
i
[A'(s .)]-1 n1
s-s . n1 (9)
is always possible, where the prime denotes differ-
ention with respect to s and the sum is over all
the zeros (possibly infinite in number) of An(s).
The residues in (9) can alternatively be expressed
as
where C . n1
A'(s.) =<C., r,(s.)· C.) n n1 n1 n1 n1
_ C (s .). Thus (6) becomes n n1
~l - -r (r,r' ,s) L C (r) C (r')
::: -/;~C -,1'-:-' ",::,,~-c-.->-=a_(S-_S-) \: a a a a
a
(10 )
(11)
where the two indices on the natural frequencies
have been replaced by a single index a. The nor
malized quantities Ca(r) are called the "modal
currents" and are source-free solutions at the
60
frequency sa'
In a numerical solution, the various quantities ap
pearing in (11) are found as follows. First, since
the determinant of the matrix approximation to the
operator must also vanish at a pole, the complex
resonant frequencies are found by searching the
complex frequency plane for the zeros of the deter
minant. The corresponding modal current is then
readily found as a homogeneous solution of the
matrix equation. Finally, the scale factors
(Ca.r~ • Ca )-1 are computed for each pole.
2.4 TIME AND FREQUENCY DOMAIN REPRESENTATIONS OF THE SOLUTION
From (7) and (11) the complex frequency domain solu
tion for the current is
J(r,s)
a
n(s) c (I.) a
(s-s ) a
(12)
where the "coupling coefficient" n(s) is defined as
n(s) (13)
Eq. (12) constitutes an expansion of the current in
terms of singularities of the integral operator. If
the response due to time harmonic excitationexp(iwt)
is desired, one need only substitute s = iw.
The surface current in the time domain is found from
the Laplace inversion integral applied to (7); atiOO
)(r,t) =2;i J <r-l(r,r' ,s), E!(r,s» estds
a-ioo (14)
The integration along the Bromwich contour in (14)
is along a line to the right of and parallel to the
iw axis in the s-plane. If one assumes that
~l _ _ _ -sTL(r,r') r (r,r',s) ~ D(s) e Re s ~ _00 (15)
where D(s) is an algebraic dyadic function of s,
then Jordan's lemma may be used to close the con
tour. Where the contour is closed in the right
half of the s plane, there will be no contribution
to the integral because there can be no poles in
the right half plane for a passive scatterer. If
the contour can be closed in the left half plane,
then, by the residue theorem,
s t ]"er,t) = I r\/r,t) Cae a
a -i -+ pole terms from E (r,s) (16)
The factor 110. in (16) might be termed a "general-. d" l' ., (4) I . . 11 ~ze coup ~ng coeff~c~ent. t ~s essent~a y an
inner product of the modal current C with Ei and :;.:.:.:.;.;;.;;.-~-...... ..;......;.;.....;;.;;;.;;.".; __ .. ~_ __ a a a gating function in the integrand which depends on
t, TL(r,r') and the form of the incident field,
i.e., all the terms determining the exponential
behavior of the integrand in (14). For sufficiently
long times, the integration will be over the whole
object, resulting in
-i -+ pole terms due to E (r,s) (17)
where, in contrast to (16), the coefficients of
the (damped) exponential terms exp(sat) are time
independent. Obviously, the determination of the
gating behavior of the integrand, which is known
once TL(r,r') is known, is crucial to the SEM.
This determination has not been satisfactorily
solved to date.
3. EXAMPLES
3.1 THE WIRE LOOP
A conducting wire loop is a simple example of a
scatterer or radiator which can be reduced to a
scalar problem. For thin wires, the total current
flowing parallel to the wire axis can be shown to
satisfy the integral equation
1T
J H¢' )K(¢-¢' )d¢' (18a)
-1T
where 110 is the impedance of free space '" 1201T
ohms, and ¢ is the angle between the center and two
points on the loop. In operator notation, (18a)
can be succinctly written as
(l8b)
where E! is the component of Ei in the direction
of increasing ¢ at the wire surface.(5) Because
61
of the rotational symmetry of the structure, the
eigenvectors of the operator defined by (18) (and
hence the modal currents, since they are inde\Jend
ent of s) are proportional to exp(-in¢); i.e., the
current is representable as the Fourier series
00
I(¢ ) I I (s) e- in</> n
(19)
n=-oo
where
-i < in</> i ? I
e ,E¢e¢,S)
n 11 1T a (s) o n (20)
Note that the term illo1T an(s) is just the eigen
value of the operator in (18) corresponding to the
modal current exp(-in¢). an(s) has been defined in
terms of integrals of Anger-Weber functions and its
zeros have been determined for a wide range of loop . (5,6). 1 fl' h s~zes. Dom~nant po es or a oop w~t
n = 2~n(21Tb/a) = 15.0, where b is the loop radius
and a is the wire radius, are shown in Figure 1.
Time domain currents have been computed for a'pulse
excited loop antenna and a plane wave excited loop ( 5)
scatterer by the SEM.
3.2 THE THIN LINEAR DIPOLE
For a thin linear dipole oriented along the z-axis,
the integral equation for the total wire current
has the form
~I = E! where Ei is the component of incident electric field
z along the wire axis. Tesche has computed poles and
associated modal currents for the dipole and some
results are given in Figures 2 and 3.( 7) The
principal difference here as compared with the loop
is that the modal currents (i.e., the operator's
eigenvectors) are not known ~ priori so that the
operator cannot be diagonalized before numerically
searching for the poles. Tesche also illustrates
time domain current calculations.
4. CONCLUSIONS AND RECOMMENDATIONS
The SEM is a useful technique in electromagnetic
scattering primarily because of the compactness of
the representation. The factorization of the.repre
sentation into time-varying and spatially-varying
parts and the explicit exhibition of the manner in
I rni~ Ii"
which the object couples to the incident field are
also important in conceptually understanding the
scattering mechanism. It also appears that the
SEM will lead to the development of analysis and
synthesis techniques for loaded scatterers as well . .' . f (8) as ~n obtaining equ~valent Clrcu~ts or antennas.
It has also been shown that pole information for
antennas or scatterers in free space also complete
ly determines the response of objects immmersed in
d. (5)
homogeneous lossy me lao
Further developments in SEM and extensions such as
the above are hampered by a lack of knowledge in
the following areas:
(1) Further information is needed on the
possible orders and the distribution of
the poles in the complex plane. The
(2)
influence of object topology on the kinds
and distribution of poles is also needed.
An asymptotic form =-1 - -of r (1'.1" ,s) is
needed in order to determine the correct
form of the coupling coefficient in
Eq. (16). The necessity for and the form
of the coupling coefficient is a subject
much in debate at present.
(3) For synthesis and equivalent circuit
problems, it is desirable to know. for
example, if eigenvalues of the integral
operator are positive real. Other prop
erties must be known if synthesis is to
b d • . 1 d' (6) e one us~ng passlve oa lng,
It is hoped that new results pertaining to some of
these problems will be forthcoming in the near
future.
REFERENCES
1. R.F. Harrington, Field Computation by Moment Hethods, Macmillan. N.Y., 1968.
2.
3.
4.
5.
L.B. Felsen, Ed .• Transient Electromagnetic Fields, Ch. 3. C.E. Baum, "The Singularity Expansion Method," Springer-Verlag, N.Y., 1976.
L. Marin and R.W. Latham, "Natural-mode representation of transient scattered fields," IEEE Trans. Ant. and Prop., AP-21, pp. 809-818,---Nov., 1973.
L.W. Pearson, "The Singularity Expansion Representation of the Transient Electromagnetic Coupling through a Rectangular Aperture," Ph.D. Thesis, Univ. of Ill., 1976.
K.R. Umashankar. "The Calculation of Electromagnetic Transient Currents on Thin Perfectly Conducting Bodies Using Singularity Expansion
62
Method," Ph.D. Thesis, Univ. of Miss., 1974.
6. R.F. Blackburn, "Analysis and Synthesis of an Impedance-Loaded Loop Antenna Using the Singulari~y Expansion Method," Ph.D. Thesis, Univ. of M1SS., 1976.
7. F.M. Tesche, "On the analysis of scattering and antenna problems using the singularity expansion technique," IEEE Trans, Ant. and Prop., AP-21, pp. 52-63, Jan. 1973.
8. C.E. Bauro, "Emerging technology for transient and broad-band analysis and synthesis of antennas and scatterers," Proc. of IEEE. Vol. 64, No. 11, pp. 1598-1616, Nov. 1976.
BIOGRAPHY
Donald R. Wilton received his B.S •• M.S. and Ph.D.
degrees in 1964, 1966. and 1970, respectively, all
from the University of Illinois. Since 1970 he has
been with the University of Mississippi where he
holds the rank of associate professor.
iwb r--r--r--~-'---r--..... -r 2l a r '''r/j~ ::r·,,)., "'"i .
t/', ',I, : "!. /.0 60~ 'j', /'" : ,
f /., "1~' 30[!, " r " : I "! , Figure 2, Poles in s-plane
. 0.0 I ' /' 'L' ~;J!)j for linear dipole. wire diam-Lcb -6.0 -~.O 0.0 eter / leugth " 0.01 (from
, (7) . Tesche Figure 1. Poles ln
s-plane for wire loop.
- - n*2;.,L.1 (b)
(a) "3,101
Figure 3. Modal currents for wire of Figure 2.
l
VARIATIONAL PRINCIPLES FOR MECHANICAL AND STRUCTURAL
SYSTEMS WITH APPLICATIONS TO OPTIMALITY OF DESIGN
V. Kornkov Dept. of Mathematics Texas Tech University Lubbock, Texas 79409
ABSTRACT
Traditionally design of engineering systems was the prerogative of "a design engineer" who drawing on his experience produced some kind of an acceptable design. Then some changes or improvements would be made, after an analysis of the performance of the preliminary design disclosed some undesirable features, or hon-complience with performance criteria.
Only in the last decade were some efforts made to optimize the design by introducing some mathematical theory and defining the concept of optimality. On the surface the problem is similar to some classical problems of control theory.
The behavior of the system is governed by a set of differential equations of the form
L(z) = f
subject to some initial and boundary conditions.
Here z denotes the state vector, f is a given forcing term, L is a differential or integral operator.
For the time being the domain of L, and the spaces of functions, or generalized functions to which z and f belong, will be left deliberately vague.
There is of course a perfectly good explanation. All of the above must be determined by physical considerations rather then mathematical convenience.
Since the design is not fixed, the design vector ~ is introduced, u £ U, where U is the set of admissible designs, 1.e. designs which ~re physically feasible and satisfy a priori imposed constraints.
Unlike control theory problems, not only ~ and possible ~ depend on ~, but also the differential operator L depends on~. Tnis is easily 1llustrated by offering a very simple structural design problem. Suppose we wish to design a beam of minimum weight, supporting a given distributed weight over a span of length 1.
Maximum permissible stress, and maximum deflection are given. For simplicity of discussion we restrict the design to the I-beam shape welded from strips of constant thickness h. The design vector,
63
contains three components u l ' u 2 ' u 3 as shown.
L~'777':~
TI-~2~~1 ul
~ ~ T~3~
The cross-sectional area A (u), the moment of inertia about the neutral axis I(u), the deflection, ~(~), and the prescribed constraints all clearly~depend on the design vector ~ = (u
l' u 2 ' u
3). The state
variable w satisfies the differentia! equa~ion
2 2 ~2(EI(u)d w2 ) = f(x) dx dw (A)
If the elastic support is assumed, the inhomogeneous term f also becomes a function of the design vector~. The optimal design problem can now be formulated.
Minimize J(~) = p J~ A(~) dx, subject to constraints ~w) = 4>0 '
o(u, f(u) < 00
•
The state equation (a) has to be regarded as a constraint condition imposed on w (~) .
The existence of Fieclet derivatives for the extended problem of minimization of the functional
10J(U) + 11(4) - 4>0) + 12(0 - 00
) = ~(~, w(~), a"(~),~.l
has been shown in [2] (Haug and Ko~k~v), under the hypothesis that w is an element of Sobolev space W2 '. The stability (i.e. sensitivity) analysis is much harder but partial results were also obtained in [2] for the structural case. For basic theory concerning sufficiency conditions see [5].
The results of [2] are somewhat restricted by specific assumptions made on purely physical grounds, which are applicable to structural analysis. So far no general stability theory exists.
A much more serious defect is the absence of existence and "un"iqueness theorems. While reasonable copying of parallel results in control theory would offer some existence results, and theorems of this type are quoted in this paper, they are not immediately applicable to mechanical or structural systems. In fact a simple example of nonexistence was offered by Komkov and Coleman in [1], and for the sake of completeness it is well known to be non-unique, but certain functions of such designs are unique in analogy with the control problems. (see Komkov [3], or [4], chapter 2.)
Taking a completely new approach to problems of optimal design, it is shown here that in the physically unrealistic case of relaxed designs an optimal solution always exists, and consquently approximately optimal admissible designs exist. This is illustrated on a specific example of structural design with a dynamic load. Some recent results considering variational principles for dissipative systems are included in the derivation of sufficiency conditions for optimality of design (See [6]).
64
REFERENCES
[1] V. Komkov and N.P. Coleman, "Optimality of design and sensitivity analysis of beam theory", International J. Control, Vol. 17, #3, (1972) p. 455-463.
[2] V. Komkov, "Optimal Control Theory for the Damping of Vibrations of Simple Elastic Systems", Springer Verlag, Berlin (1973), Lecture Notes in Math. #253.
[4] Russian translation, revised and corrected, foreward by Se1ezov, Mir, Moscow, 1976.
[5] V. Komkov and N.P. Coleman, "An analytic approach to some problems of optimal engineering design", Archives of Mechanics, Vol. 27 #4, (1975) p. 565-575.
[6] V. Komkov, "Another look at dual variational principles", to appear in J. Math. Anal. App1. in 1978.
6S
" ':
A NUMERICAL CALCULATION METHOD FOR SIMULTANEOUS ORDINARY DIFFERENTIAL EQUATION
OF HIGHER ORDER BY THE MOMENTARY DIAGONALIZED MODAL PROPERTY*
Baxter F. Womack Department of Electrical Engineering
and Electronics Research Center The University of Texas at Austin
Austin, Texas 78712
Sadao Azuma Ibaraki University
4-12-1 Nakanarusawa-cho Hitachi-shi, Ibaraki-ken
JAPAN 316
Abstract A new numerical calculation method is treated for solving simultaneous ordinary differential equations of higher order as initial value problems. In this method of numerical calculation, the concept of a diagonalized modal property is introduced to achieve the simplification.
1. INTRODUCTION
For the transient analysis of a control system,
the analog-hybrid computer is generally used. It
has drawbacks, however, including limited accuracy
of calculation, the determination of many con
stants, and the settings of nonlinear elements
that are necessary to achieve proper scale factor.
This requires much work and skill on the side of a
user. The digital computer, on the other hand,
has its own advantages, even though it is not so
suitable for calculating dynamic behavior of a
system. If the program of simulating the dynamic
process of control elements is incorporated, a
solution in the required analysis can be readily
obtained simply by giving as the input the infor
mation constituting the control system, not
troubled by such problems as scale factor. Digi
tal simulation is then suited to the analysis of
a complicated control system, and, for example,
can be useful in the design of an optimum
controller.
*This research was supported in part by the DoD Joint Services Electronics Program through the Air Force Office of Scientific Research (AFSC) Contract F49620-77-C-OlOl, and the Japanese Ministry of Education.
66
A control system in general is made up by several
combinations of simple elements described by trans
fer functions or by simultaneous linear ordinary
differential equations of higher order. Several
types of digital simulators are now available for
carrying out the integrations and numerical calcu
lations. Which to use is thus optional.
Numerical solutions for the transient analysis of
a given system are of two types, however, as
follows: First, in the case of an extremely gra
dual change of the functions and the requirement
for high accuracy, for example, in cosmic space
dynamics, the usage of ultrahigh accuracy formula
is advantageous. Second, in the cases of an
extremely rapid change of functions, a large scale
with high complexity, or the simulation of a com
plex control system, the employment of perfect and
tough formula, though of lower order, is useful.
In the present study, from the viewpoint above, an
attempt is made to develop an improved numerical
calculation method in digital simulation for simul
taneous linear (or nonlinear) differential equations
of order n which describe a control system with
and without delay time.
2. FORMULATION OF NUXERICAL CALCULATION METHOD
2.1 DIFFERENCE FORM OF THE STATE SPACE EQUATION
The nonlinear. nonautonomous concentrated-parameter
continuous system may be given as
x = f(X.U. t) (1)
where on the right-hand side. fi(X.U.t) is a vector
function with i=1.2 •...• n. in columns. X is an
n x 1 vector; U is m x 1.
The hyper-curved surface fi(X.U.t) intersecting a
representative point of the (n+m+l) dimensional
space at the existing point of time tN' that is.
the static relation of nonlinear multi-variables is
schemed to momentary linearization as follows:
i 1.2 ••••• n. j = 1.2 •...• n. k
Equatid., (1) can be described as
it -i
n n
L: aij~j + L: bi~ j=l k=l
i 1.2 ••..• n
Rewrite Eq. (3) as
it = -i
m
l3 i = k bi~k
(2)
1.2 •...• m
(3)
(4)
Accordingly. Eq. (4) can be denoted by the follo
wing momentary linear state space equation.
X A*X + 'U
where x -1 '\ 0
~2 0 Ct2
~3 0 0
X = ~ A* 0 0
~ 0 0
['~ 132.!! u= 13k
S U k = 1.2 ••••• m
0
0
~ 0
0
0
0
0
Ct4
0
U =
(5)
o o o o
a n
[J 67
Assume that each output. Yi is a momentary function
of the state vector X and the input vector U and is
determined by Eq. (6) including t explicitly.
y = F(X.U.t)
The momentary output equation corresponding to
Eq. (5) is given as
y = CX + DU
where C is the ~ x n matrix and D is the ~ x m
coefficient matrix.
(6)
(7)
To substitute a difference form for the momentary
linear state space equation given as Eq. (5). with
the initial value X(tN)=~' we use the solution as
i "'t X("'t) = exp(A*"'t)~ + 0 exp{A*("'t-T)}k(T)dT (8)
Assume the inputU(T) to be constant U in the N
integral section and let X("'t) = ~+l be the state
quantity "'t time afterwards; the solution then
takes the difference form:
-1 X 1 = exp(A*"'t)X + {exp(A*"'t)-I}A* U N+ -~ N
Define G* and H* as follows:
G*
H*
exp(A*"'t) }
(G-I)A*-l
The difference equation becomes
~+l G*~ + H*UN where
gl 0 0 0 hI 0
0 g2 0 0 0 h2
0 0 g3 0 0 0 G*= H*=
0 0 0 g4 0 0 0
0 0 0 0 g 0 0
i 1,2, •.• ,n
0 0
0 0
h3 0
0 h4
0 0
-1 Cti
(9)
(10)
(11)
0
0
0
0
h n
In Eq. (11). since matrix A* is diagonal, G* and H*
take the same diagonal form •. Equation (11), there
fore, is finally separated into n-scalar linear
difference equations.
(12)
i = 1,2, ••. ,n
I
i i
2.2 DIAGONALIZED MODAL ELEMENTS
In the process described of introducing the dif
ference equation, each element ~i of the diagonal
matrix ~* which determines the gi and hi values
has been given only as
(13)
i = 1,2, ••• ,n
Ordinarily, however, the values of elements ~i are
calculated independently of the modal matrix for
diagonalization. The ~i values themselves contain
information concerning the direction of an eigen
vector and also the mode of determining exp(~i~t)
an instant after. It thus varies successively
with the transition of each state quantity.
The value of coefficient ~ .. in the infinitesimal ~J
interval ~t is taken to be constant due to the
assumption in Eq. (3). There is then the need that
the respective state quantities in Eq. (13), deter
mining the direction of state trajectory, be
expressed as ~xi' in the same time interval.
In consequence, the relation is,
(14)
i 1,2, .•. ,n ;
the infinitesimal change of each state variable,
though it is an unknown quantity, satisfies
approximately the relation ~Xj/~Xi ~ Xj/xi'
The 'relation (14) can thus take the form:
a .. x. ~J J
i 1,2, ... ,n
~i' formed as in Eq. (15), will be called the
diagonalized modal elements" or "diagonalized
modal coefficients," and A*, the "diagonalized
modal matrix."
2.3 INTEGRAL MODAL COEFFICIENTS
(15)
Consider that the diagonalized modal elements ~ i
are all determined by the value of X, and join the
values in the difference equation for convenience
68
in numerical calculation~ then Eq. (11) can be
rewritten as
~+l = ~ + H*~ (16)
The difference form in Eq. (16) will be compared
with the existing numerical calculation formulae
from the simplest Euler's method generally used
for the solution of an ordinary differential equa
tion of first order to the predictor-corrector type.
In the difference equation (16) derived here, H* in
the second term on the right-hand side is given as
a quantity successively changing. In Euler's
method, however, it is introduced throughout as a
constant in ~t. And then, in most other integral
formulae, ingenuity is exerted in the part corres
ponding to the second term on the right-hand side
of Eq. (16).
When the difference equation takes the form of
Eq. (11), G* gives the rate change for ~+l calcu
lated on the basis of ~, as follows:
exp(A*~t) (17)
Assume UN to be constant in ~t, then H* can be a
coefficient for UN as a result of the integral in
Eq. (8).
In Eq. (16) as rewritten, the essential charac
teristic of diagonal matrix G* which is determined
by the diagonalized modal matrix A* is expressed
in the second term on the right-hand side. Then,
H* determines the integrated value in relation
with ~ and also the increment of ~ in an infinite
simal time interval ~t. It is thus the most
important element.
H* gives directly the change of quantity XN in
relation with XN
• In this sense, it will be called
the "integral modal matrix," and the diagonal ele
ments of H*, hi' the "integral modal elements" or
"integral modal coefficients."
3. MODIFIED DIFFERENCE EQUATION
The numerical calculation is such that when the
state trajectory is described along a hyper-curved
surface in state space, the hyper-plane passing
through a given representative point at a point in
time is first determined. And then, the next
representative point is obtained in this hyper
plane. Therefore, in using Eq. (16) as the ori
ginal form for the successive calculation, when
the state trajectory spins starting at the origin
in state space, the error of calculation tends to
increase in a region of large value of the
varying elements ai in the diagonalized modal
matrix A*. In the following, a method will be
introduced to calculate the diagonalized modal
coefficient with high accuracy, in the whole
region of calculation even when the coefficient is
almost infinitely large.
3.1 APPLICATION OF A CORRECTOR-TYPE FORMULA
The difference equation (16) given initially can
be expressed in scalar form as
(xi)N+l = (xi)N + hi(xi)N
hi {exp(aillt) - l}/ai
i = 1,2, ... ,n
(18)
If the state space equation is linear and non
oscillatory, the solution can be very close to
the true value, with little error, owing to
successive calculations with Eq. (18). If then
the state space equation is oscillatory, the dis
crete error can be held to 0(lIt3).
For example, when the numerical solution is desired
to be within an overall averaged error in the ana
lysis of an oscillatory system, a corrector-type
formula such as Eq. (19) is applied only once.
n m
(xi)N+l = ~ aij(xi)N+l + ~ bik(uk)N+l j=l k=l
(xi)N+l = (xi)N + II; {(xi)N+(xi)N+l}
i = 1,2, ... ,n
3.2 INTRODUCTION OF THE APPROXIMATE CALCULATION FORMULAE FOR h.
1
(19)
If the system is stable and the calculated diago
nalized modal coefficients all take a negative
sign, the integral modal coefficient hi in Eq. (18)
is expressed as
hi' = {I - exp ( - '1. t. t) } I ai
Equation (20) can be transformed as follows:
(20)
69
(21)
When the value of t.t is assumed to be sufficiently
small, it is possible that p(aillt) "'t.t/2.
In this case, the portion in { } on the right-hand
side of Eq. (21) corresponds with the trapezoidal
formula. It can be considered, therefore, that the
scalar difference equation (21) is A-stable and at
the same time a formula by the implicit method has
the least error of discretization. In [41, several
refinements of (21) are given but are omitted here
because of space limitations.
4. PROCEDURE OF THE NUMERICAL CALCULATION
The numerical calculation formula obtained thus,
in advance solution process, can be termed an
implicit formula where the value at the next point
of time is expressed as an implicit function of
the already obtained value. The implicit formula,
in general, is considered to be advantageous in
stability and accuracy. It is necessary, however,
to solve a numerical equation at each step. If
the differential equation is nonlinear, the corres
ponding numerical equation also becomes nonlinear.
It is generally rare to solve a single differential
equation in an operation with a computer. Since
simultaneous differential equations are treated, the
numerical equations also take the simultaneous form.
In this case, to solve the numerical equations the
Gauss' or Crout's elimination method is used for a
linear problem; for a nonlinear problem, the
Newton's method is used.
In the present method of numerical calculation, the
concept of a diagonalized modal property is intro
duced to achieve simplification. The tremendous
amount of elimination for the numerical equations,
necessary in a single calculation of the simul
taneous ordinary differential equations, is thus
eliminated, so the treatment can be highly efficient
regradless of the linear or nonlinear problem.
Detailed computation steps are given in [41.
....
5. NUMERICAL SOLUTION OF THE GENERAL, SIMULTANEOUS ORDINARY DIFFERENTIAL EQUATIONS OF HIGHER ORDER
In a transient solution of the oscillatory system
equation, successive calculations are possible by
choosing the degree in the approximate formulae
according to the purpose. To make the numerical
solution applicable to higher order ordinary dif
ferential equations in a general for~ the numerical
calculation formulae introduced thus far will be
rearranged as follows:
When the system state space equation is linear
simultaneous with order n and there exist extremely
large differences among the n values of ai' for
the purpose of an optimal numerical calculation
the degree of ~ as approximation in the function
of (ai~t) should be higher. With the degree of
about 10, however, the computing time for exponen
tial function as approximation becomes nearly the
same as that for the exponential function as an
intrinsic function of the computer. In this case,
by utilizing the intrinsic function, a reasonable
value of exponential function can be calculated,
and the calculated value of hi can also approach
the theoretical one from the relation of this ex
ponential value and the approximate function ~.
6. NUMERICAL CALCULATION METHOD OF THE ORDINARY DIFFERENTIAL EQUATION WITH DEAD TIME
The numerical calculation method can be applied
easily to a given system which also includes dead
time besides being in a nonlinear form.
7. APPLICATIONS AND RESULTS
In performing the calculation in a transient ana
lysis, the usage of an explicit formula such as the
Runge-Kutta method is problematic. This is because,
due to the undominant eigen-value (for a large
absolute value), the time interval ~t in successive
calculation is limited, so that there occurs the
need for a tremendous number of steps to observe
the overall dynamic behavior of the system.
That the difference equation is A-stable means that
the solution always converges for any ~t regardless
of the eigen-value if only the sign is "minus."
The implicit formula has the advantage that its
calculation can be done easily with the ~t
determined in consideration of the dominant eigen
value even when there are extreme differences be
tween some eigen-values of the state space equatio~
Several examples are given in [4].
Whether it is nonlinear or time-varying, the state
space equation can be considered locally as a linear
state space equation with constant coefficients
when the variation is momentary in an infinitesimal
time interval. Then in order to advance one step
in the infinitesimal time interval ~t, it is pos
sible to apply a conventional, numeric calculation
formula.
To obtain a calculation approaching the rigorous
solution in the course of successive calculations,
an attempt was made to improve the numerical calcu
lation formulae. When the state space equation is
linear, the value of exp(A~t) plays an important
role in the rigorous solution, so the matrix calcu
lation is generally required. To eliminate the need
of matrix calculation for a linear equation or to
facilitate solution of the nonlinear state space
equation, the system state equation is treated as a
momentary state space equation in an extremely small
time interval. And further, by introducing the
concept of a momentary diagonalized modal matrix
into the momentary state space equation, the dif
ference equation necessary for the solution is given
in a simple form of a scalar equation.
As this results, it is understood that the numerical
calculation formula (the difference equation) becomes
one of the implicit methods which is A-stable with
small discretization error. Simplification of the
numerical calculation itself is thus possible by
switching over to the simple technique of calculating
the diagonalized modal coefficient from the complex
process rather than having to solve the numerical
equation in each step for an implicit formula.
And furthermore, in order to achieve the numerical
calculation formula suitable for the originally
oscillatory solution, the implicit formulae are
modified and expanded.
70
The programming of this numerical calculation method
is simple, and the method is designed for digital
simulation. The desired solution can be attained
by giving as the data the order of a state space
equation, the number of input variables, coef
ficients a ij and bik (time-varying, the form in
the case of nonlinearity), and the initial value
X(O+) of a state variable. The calculating con
ditions needed are the initial time, final time,
and time interval 6t.
As shown in [4], the concept of a diagonalized
modal property is incorporated into the group
of solutions based on difference, which is an
effective numerical solution of the ordinary
differential equation. Due to this introduction,
the procedure of calculations is more systematic
and purely numerical. And, formation and exten
sion of the basic numerical-calculation algorithm
should lead to the design of a numerical calcu
lation method with higher accuracy.
71
REFERENCES
1. Richard S. Varga, Matrix Iterative Analysis, Prentice-Hall, Inc., New Jersey, U.S.A., 1963.
2. Baxter F. Womack and Troy F. Henson, "Time Domain Desensitized Specific Optimal System Design," Texas Biannual of Electronics Research, No. 20, 1974.
3. Hayato Togawa, Numerical Calculation for Differential Equation (Finite Element Method and Difference Method), Ohm Inc., Tokyo, Japan, 1974.
4. B. F. Womack and S. Azuma, "A Numerical Calculation 11ethod for Simultaneous Ordinary Differential Equation of Higher Order by the Momentary Diagonalized Modal Property," Texas Biannual of Electronics Research, No. 23, May 1976.
Ii
II' I
ON STRUCTURALLY STABLE NONLINEAR
REGULATION \HTH STEP INPUTS
w. M. Nonharnl
Systems Control Group Dept. of Electrical Engineering
University of Toronto Toronto, Canada M5S lA4
ABSTRACT
The internal model principle (IMP) is developed for nonlinear regulators with step reference and disturbance signals. The regulator structure is derived from a requirement of structural stability and a corresponding transversality property.
1. 2
Regulation: a general framework
In this note a system is a pair of
form (X,~) where X is a real, finite-dimen
sional, COO differential manifold and
~:X + TX is a vector field on X. The no
tation follows Lang [1]. We call (X,~) the
regulator (system). Next we bring in an
auxiliary system (V,n) called the exosystem.
The exosystem is envisaged as a factor sy
stem of the regulator that drives it,
namely there is a submersion ~:X + V such
that the diagram below commutes.
X __ Z.~_~) TX
V __ ...:.n'--_-)o> TV
The regulator (X,~) is assumed to
1 This research was partially supported by The National Research Council of Canada, Grant No. A-7399.
2 For the system-theoretic background and motivation the reader is referred to [4] and [5].
72
be amenable to (V,n) in the following sense.
There exists a unique embedding v:V + X
such that
'lToV = id, (1)
Thus we have the diagram below. The amen
ability property can be thought of as evi
dence of willingness on the part of the
regulator to respond faithfully to the
exosystem: namely (V,v) is an 'induced'
submanifold of X to which the vector field
~ is tangent and on which the restriction
~lv(V) copies the action of n on V.
V n ) TV I
VI I : v*
id ~
~ '" X -4 TX id
~l !l
1 ~* V ~ TV",---
In the application to actual regulator
systems, v(V) would be required to have the
property of being a stable attractor ~or
the flow a of ~; namely with ~ complete,
v(V) is Liapunov stable and
a(t,x) ~ v(V) as t ~ 00 (2)
for all XEX. We do not explicitly need
such a property, although without it the
setup would probably be of little interest.
Let (K,K) be a closed submanifold3 of
X thought of as good, namely a desirable
place for the state x of the regulator to
reside. For instance, K(K)cX might be -1
given as 0 (zo) for some output map
o:X ~ Z, the point ZOEZ representing zero
error, i.e. ideal regulation. In view of
(2) the condition of output regulation
that we shall impose is
v(V)cK(K) • (3)
Thus, a(t,x) would tend to become good as
t~oo, this property being the geometric
counterpart of the usual control engineer
ing requirement of zero steady-state error.
Since (3) is true we know (by Warner [2],
Th. 1.32) that v factors smoothly through
K, namely there exists a morphism l:V ~ K,
in the present case necessarily an immer
sion, such that
Kol = V • (4)
So we have the diagram:
This diagram might Nell have ~erved
as our starting point, as n plays no fur
ther essential role. From the diagram
there results
(5)
We think of (5) as a version of the output
regulation condition.
Let us suppose now that TX is given
the structure
-3 All submanifolds are declared to be embedded.
73
(6)
where the subbundles Eo ~ X, Ec ~ X are
identified with an object to be controlled
and a controller respectively, and E ~ X e
is identified with the exosystem; in par-
ticular
(7)
Accordingly we have a unique decomposition
~ = (~o'~c'~e) := ~o e ~c e ~e (8)
and the relation non = n*o~e .
From now on ~c' ~e' nand K are as
sumed to be fixed; on the other hand, we
think of the section ~ :X ~ E as variable 00
00
in the space of smooth sections r (Eo)' on
the realistic assumption that our control
led object depends on physical parameters
that need not be susceptible to direct
measurement or control. Of course if ~o
is varied to ~o' ~say, then v will in
general vary to v, in accordance with (1).
Following [3] we adopt the Whitney COO
topology for spaces of morphisms of mani
folds. The output regulation condition
will be said to be structurally stable at
~ Eroo(E ) provided o 0
v = Kol , l:V ~ K
(~ ,~ ,~ ) with f: in some o ceo
(9)
for all f: nbhd of ~o. The general problem before us
is to explore the implications for the
control section ~ :X ~ E of the require-c c -ment of structural stability. Before pro-
ceeding it should be remarked that the
suitability of the topology chosen in our
definition is open to some doubt, as it is
not clear in general that interesting re
gulators exist that are structurally stable
in the stated, strong sense. However, in
the special case to be treated next this
problem does not arise.
I l'
2. Regulation: step inputs
We denote the zero section of a
subbundle E.7 X by s. and write Z = s. (E.).
By regulation under the action of step
inputs we mean the special case of the
foregoing setup obtained when
n (v) '" 0v E Tv (V)
for all VEV. Thus
n*~ (x) On (x)' XEX.
(10)
(11)
In this case our condition of structural
stability reduces to
~OKO~(V) '" O\)(v) , VEV, (12)
for all ~ = (~ ,~ ,~ ) with ~ in some nbhd o ceo
N of ~ . o 0 There follows, in the notation
of zero sections,
~OKO~{V)CZX , (13)
which yields
~ OKo~(V)CZ o 0
(l4a)
and
~ OKo~(V)eZ c c
(14b)
be
Let the dimensions of our manifolds
dim X n, dim V
dim K
m, dim Ec
n-q.
Also, we impose the harmless technical re
quirement4 that, when restricted to K(K),
~c is transverse to Zc:
~ OK iii Z c c (15)
Furthermore, by the results of ([3], Sec.
II. 4) we know that there exists in No a
section ~ such that ~ oK iii Z ; indeed, o 0 000
such sections are dense in r (E). It o follows that the inverse images of Zc' Zo
under ~coK,
folds of K.
~ oK respectively are submanio With ~c fixed we may even
4 The requirement is harmless as long as dim K + dim Zc ~ dim Ec, namely (n-q)+n ~ n+nc ' or q s n-nc = no+m ; but this will always be true in the application to regulators.
74
choose ~o such that
~ -1 -1 (~OOK) (Zo) ili (~coK) (Zc)' (16)
so the intersection is again a submanifold
of K. Then (14) yields
dim [(~ OK)-l(Z ) n (~ OK)-l{Z )] o 0 c c
~ dim ~(V)
dim V
m (17)
Write dim (~ OK)-l(Z ) '" n. In view of 000
(14) and the transversality relations,
there follows
n dim K - dim ~ OK (K) 0 0
dim K - dim K /I dim E 0
0 v (dim K - dim E 0)
0 v [n - q - (n - n - m)] c
0 v (nc
+ m - q) . (18)
Writing dim -1
also (~COK) (Zc) = r we
have, by transversality,
dim [( ~ 0 K) -1 (Z ) n (~ 0 K) -1 (Z )] o 0 c c
o v [no + r - (n - q)]
o v [0 v{nc + m - q) +
+ r - (n - q)]
and by (17) there follows
o v (nc + m - q) + r - n + q ~ m .
(19)
Now if nc+m-q s 0 this gives r ~ n-q+m,
which is impossible since (~ OK)-l{Z )eK. c c
Therefore nc+m-q > 0 and we get finally
r ~ n - nco (20)
Since r s dim K n-q this shows in addit-
ion that
n ~ q (21) c
Our results may be paraphrased by
saying that the control section reduces
to zero on some submanifold of K, say K , c
of dimension at least r ~ n-nc
. By (21)
j
at least q 'components' of control vanish
on Kc' and this fact has the engineeering
interpretation that the inputs to at least
q 'integrators' in the control loop reduce
to zero when the 'error' is zero. So our
result is precisely analogous to that for
the linear case, and globalizes the local
structure theory developed in [4]. Con
structing a tubular neighborhood of K(K ) c
in X, we may parametrize it as usual by
the fibers, and the n-r ~ q 'components'
of the fiber vectors provide a specific
realization of the required feedback vari
ables.
3. References
[1] S. Lang. Differential Manifolds. Addison-Wesley: Reading, Mass., 1972.
[2] F.W. Warner. Foundations of Differentiable Manifolds and Lie Groups. Scott, Foresman: Glenview, Illinois, 1971.
[3] M. Golubitsky, V. Guillemin. Stable Mappings and Their Singularities. Springer-Verlag; New York, 1973.
[4] B.A. Francis,W.M. Wonham. The internal model principle of control theory. Automatica 12(5), 1976, pp. 457-465.
[5] t"l.M. Wonham. Towards an abstract internal model principle. IEEE Trans. SMC ~(ll), 1976, pp. 735-740.
75
FREQUENCY DOMAIN STABILITY FOR A
CLASS OF PARTIAL DIFFERENTIAL EQUATIONS
D. Wexler Facultes Universitaires N.D. de la Paix,
Namur, Belgium
1. I NTRODUCTl ON
In recent years, an increasing interest has been taken in the system of integro-differential equations
( 1.1)
ddt T ( t, C;) = ddC; l? 1 ( c;) ddC; T ( t, E; J +
+ P2(E;) T(t,E;) + ¢(a(t» a(E;)
d Y2 at a(t) = f b(E;) T(t,E;) dE; Yl
for all tE]O,+""[ and almost all E;E)Yl'Y2[ ,
subject to boundary conditions
( 1.2) tE]O,+oo[,
and"to initial conditions
a(O) = ao (1. 3)
where the real constants OJ verify
the nonlinear function ¢: R + R is continuous with r ¢(r) > 0 for all r E R , r ~ 0 and a , b are elements in the space L2(Yl,Y2) of realvalued square-integrable on ]Yl'Y2[ (classes of) functions. Also conditions on 8. and on the
J
76
real functions PI, P2 were required in order to insure the associated to (1.1) , (1.2) SturmLiouville operator L ,
( d d 1.5) L x(C;) = ~ [PI (C;) ~ x(E;)] + P2(E;) x(E;)
to be self-adjoint and negative in L2(Yl'Y2) .
Systems of this type arise as dynamic models of one-dimensional continuous medium nuclear reactors and one is interested in the asymptotic behaviour of the solutions as t + +"" , mainly in L iapunov type stability. The above problem has been studied by means of Volterra integral equations, energy functions,transform methods, Galerkin methods, semi group theory; for these approaches and more comp 1 ete references, we refer the reader to [1, 2 chapter 4 and It) .
In this contribution, we transform (1.1) , (1.2) into an abstract problem which we discuss by means of Popov type frequency domain methodS. We consider the system of differentiel equations
(1.6 ) du at = A u + ¢(o) a do at = <b ,u>
where A is a linear (possibly unbounded) operator with domain D(A) and range R(A) in a real Hilbert space H , ¢: R + R is a given nonlinear locally Lipschitz function verifying r ¢(r) > 0 , for all r E R , r F 0, a, b are given elements in Hand <.,. > denotes the inner product in H. System (1.6) is viewed in the Hilbert space J( = H x lR. It will also be assumed that A generates a differentiable exponentially stable
Co-semi group on H .
System (1.6) may serve as an abstract version for a large class of significant partial differential equations, among which (1.1) , (1.2) is only a special example ( H = L2(Y1,Y2) , A = L). In our opinion, the advantage of this abst~act setting when discussing frequency domain stability for partial differential equations does not consist only in the generality we gain, but mainly in the fact that, in this way, we are somewhat guided by the methods and results which are now classical for the case when H is finite-dimensional (i.e. (1.6) is a system of ordinary differential equations).
The approach we choose here to study system (1.6) is an extension to the Hilbert setting of the approach used previously by Corduneanu for ordinary differential equations in finite-dimensional spaces 12, chapter 3]. In this approach, an essential point is the use of the frequency domain criteria for Volterra integral or integro-differential equations established in 12 chapter 3,6, 7]. The additional technical difficulties which arise in the infinite-dimensional case may be overcome for a large class of unbounded operators A including significant partial differential operators.
In section 2, we state our main results and make a few comments; proofs will apprear elsewhere. In section 3, we apply the abstract results to problem
(1.1) , (1.2).
2. MAIN RESULTS
In this section, we assume that A generates a Co-semi group S on H which satisfies the following conditions:
(2.1) S is differentiable (i.e. S(t) He D(A)
for all t > 0 )
and there exist M> 1 , a > 0 such that
I I -at (2.2) S(t) ~(H) ~ M e for all t ~ 0
where ~(H) denotes the Banach space of bounded 1 inear operators from H to H; for the theory of semi groups of linear operators, we refer the reader
It is usefull to consider also the complexification HC of H ; the elements of HC will be written as x + i y , X E H , Y E H and the inner product of HC will be denoted by <','>HC' We denote by I the identity operator on H ,by IC the identity operator on HC and by AC the linear operator
defined as
AC (x + i y) = A x + A y
with domain
D(AC) = D(A) + i D(A)
An important case when our assumptions on A hold is the case in which, for some a> 0 , the operator AC + a IC generates a bounded holomorphic semigroup (which is equivalent to saying that A is densely defined, there exists w > 0 such that the resolvent set P(AC) of AC contains the sector larg()..+w)1 < 2-131 +w and, for each £ E ]O,wl there exists ~ ~ 0 such that
c c -1 -1 I ().. I - A) It ( HC) ~ K£ I).. + a I .
for all )..EQ; with larg(Ha)1 ~ 2-1:n- + £ ) •
This condition is in turn satisfied when, for some a > 0 , the operator -(AC + a IC) is m-sectorial with vertex 0 15, pp. 490-491] (note that significant differential operators are m-sectorial 15, p. 280] ). The latter condition holds in particular when A is selfadjoint and there exists a> 0 such that A + a I is negative.
The function (u,o) from 10,+001 to jf is said to be a solution of (1.6) with initial data (uo,oo) E jf if it satisfies the following conditions:
(i) u is of class C1 on 10,+001, continuous at 0 , u(O) = Uo and
u(t) E D(A) , ¥t(t) = A u(t) + </>(o(t» a
for all t ~ 0
(ii) ° is of class C1 on 10,+001, 0(0) =00 and
~(t) = <b,u(t» for all t ~ 0
to 15, chapter IX and 8, chapter IX] . The zero solution of (1.6) is sais to be stable in
77
, !
the 1 arge if : (i) for each (uo,ao) EX, there exists a uni
que solution of (1.6) with initial data (uo,ao) ; and
(ii) there exists a continuous strictly increasing function IT from [O,+oo[ to [O,+oo[ with IT(O) = 0 such that, for any solution (u,a) with initial data (uo,ao) and any
r> OJ
I (uo,ao) Ix';;; r implies
I (u(t) ,a(t)) Ix';;; IT(r) for all t;;;. 0
The zero solution is said to be uniformly asymptotically stable in the large if it is stable in the large and if, for any bounded set S in X , the solution (u,a) with initial data (uo,ao) tends to 0 as t -->- + 00, uniformly with respect to (uo,ao) EtS .
We now state our main result.
Theorem 1. Assume that
and there exists q;;;. 0
(2.3) such that Re(l- isq)<b,(isl c - AC)-lA-la>Hc -
- q<b,A- 1a> .;;; 0 for all s;;;' 0 .
Then, the zero solution of (1.6) is uniformly asymptotically stable in the large.
Exponential estimates may be obtained by 1inearisation :
Corollary 1. Assume the conditions (2.3) hold and denote by l' ( . )( Uo ,a 0) the so 1 uti on of (1. 6) with initial data (Uo,ao) . Assume moreover that ¢ is differentiable at 0 with d ~(O)/d r> 0 . Then there exists S > 0 such that, for each bounded se t IS in}{, we may fi nd C;;;' 1 with
I T(t)(uo,oo)1 .;;; C e-St \(uo,ao)1 x x
for all (uo,ao) E <B and t;;;. 0
A question which seems of importance when (1.6) serves as a mathematical model of a physical system is the sensitivity of its stability properties with respect to small perturbations in parameters.
78
Theorem 2 below furnishes a partial answer to this question. For each y > 0 , denote by Gy the set of couples (a,b) E H x H satisfying
-1 <b,A a> > 0 and there exists q;;;. 0 (2.4)
such that Re(l - isq)<b,(isl c - Ac)-lA-1a>HC -
- q<b,A-1a> < y-1 forall s;;;' 0 ,
and denote by f'(" the class of locally Lipschitz real function ~ satisfying 0 < r ~(r) < y r2 , for all r E ~ , r f: 0 .
Theorem 2. Let y be a strictly positive number. Then Gy is open and, for each (a,b) E Gy , and each ~ E~y , the zero solution of (1.6) is uniformly asymptotically stable in the large.
Consider now the important case when A is selfadjoint, A + a I is negative for some a> 0 and A-I is compact (these conditions hold for some elliptic operators A). There exists then a Hilbert basis (en)nE IN of H such that, for each n E IN, en is an eigenvector of A and the sequence (>.'n)nE IN of associated eigenvalues is decreasing; moreover, Ao"": 0 and lim An = _00
[3]. Denote the Fourier coefficients of a and b with respect to en by an and bn respectively. Then the frequency domain conditions (2.3)
may be written r A-I an bn > 0 and there n=O n exists q;;;' 0 such that
n!O (1 +q An)(s +A~f1 an bn ;;;' 0 for all s;;;. 0 .
These conditions hold with q = \Ao\-l if, for all n EN, we have an bn .;;; 0 and there exists m ElN such that am bm < 0 .
To check the frequency domain conditions (2.3) requires the knowledge of the resolvent of AC
which, in the infinite-dimensional case, is far from being an easy matter. When A = L is defined by (1.5), it amounts to solving a Sturm-Liouville problem for an ordinary second order differential operator. That is why we state below some sufficient and easy to check conditions for (2.3) to hold.
Proposition 1. Assume that, for some a> 0 and
k < 0 • the operator _(Ac + a IC) is m-sectorial with vertex 0 and b = k a '1 0 . Then. conditions (2.3) hold.
It may be shown that the asymptotic stability result established previously in [41 for system (1.6) with selfadjoint operator A by using the theory of nonlinear semi groups and some Liapunov functions is closely related to a specialization of Corollary 1.
Finally. we note that frequency domain methods may be applied to discuss also some other control systems in infinite-dimensional spaceS. as. for instance. system (1.6) under the assumption that o is a simple. isolated eigenvalue of AC • which is also of interest in reactor dynamics.
3. APPLICATION TO PROBLEM (1.1) • (1.2)
We assume that the parameters of system (1.1) • (1.2) satisfy the following conditions: PI E
Cl([Yl'Y21) and PI> 0 on [Y1'Y21; P2 E C([Yl'Y2 1) and P2":;0 on [Yl'Y2]; 15 1 c2~0
and 15 3 154>0; either P2$0 .or 11511+11531>0. Put H = L2(Yl'Y2) and define L by (1.5) with domain D(L) consisting of functions x E C1([ Yl'Y2 1) such that d2 x / d E;2E L2(Yl'Y2) and
dx . dx v °1 x(Yl) +15 2 ~(Yl) = 0 , 15 3 x('("2) +15 4 ~(.2) = O.
It is well-known that, under the above assumptions. L is selfadjoint. L-1 is compact and L - AO I is negative. where AO < 0 is the first eigenva-1 ue of L [31 .
Clearly, system (2.1) with A = L may then be viewed as a1\ L2-version of problem (1.1) • (1.2) and. since L generates a Co-semi group which satisfies (2.1) and (2.2). we may apply to (2.1) the results obtained in section 2.
Theorems 1 and 2 and Corollary 1 furnish stability results for the L2- vers ion of problem (1.1) • (1.2) under the norm of L2(Y1'Y2) . Most of the previously established stability results for problem (1.1) • (1.2) are in terms of classical solutions and stronger norms. That is why it seems of interest to show that, in fact, the solutions of the L2-version verify (1.1) • (1.2) in some classical
79
sense and stability results under a stronger norm hold. Given (To.oo) in JC = L2(Yl'Y2) x IR ,we say that the couple of functions (T.o) :
-T: ]0.+oo[x[Yl'Y21+1R • 0: [O.+oo[+lR.
is a classical solution of (1.1) • (1.2) with initial data (To.oo) if the following conditions hold: TEC100.+oo[X[Y}'Y2]) and OECl([O.+oo[); for each t > 0 • one has a2 T ( t •. ) / a E;2 E
L2(Y1'Y2) ; (T.o) verifies (1.1) • (1.2) and (1.3).
We limit ourselves to state a "classical" version for Theorem 1, but such versions may be obtained also for Theorem 2 and Corollary 1.
Theorem 3. Assume conditions (2.3) with A = L hold. Let (To.oo) be an arbitrary element in JC
and denote by (u.o) the solution of the L2-version (1.6) with initial data (To.oo) . Then, (T.o) • where T(t.·) = u(t) • is the classical solution of problem (1.1) • (1.2) with initial data (To.oo). Moreover. for any bounded set ~ c JC. the solution (T ,0) of (1.1) • (1.2) ~Iith initial data (To.oo) satisfies :
do d2a ott) + 0 • Qf(t) + 0 • ~(t) + 0 , dt
sup I T(t.E;) I + O. sup I ~i(t,O I + 0 E; E [y l'Y 21 f,; E [ y l'Y 21
. sup 1~~(t.f,;)1 + 0 and <21 Ig(t.OI dE; + 0 E;E[Yl'Y2] af,;
as t + +00 • uniformly with respect to (To.oo) E6.
REFERENCES
[1] T.A. Bronikowski. J.E. Hall. J.A. Nohel, Quantitative Estimates for a Nonlinear System of Integrodifferential Equations Arising in Reactor Dynamics. SIAM J. Math. Anal., 3 (1972), pp. 567-588.
[ 2] C. Corduneanu, Integral Equations and Stability of Feedback Systems. Academic Press, 1973.
[ 3] R. Courant and D. Hilbert, Methods of Mathematical Physics, vol. 1. Interscience. 1953.
(4) E.F. Infante and J.A. Walker. On the Stability
, I
Properties of an Equation Arising in Reactor
Dynamics,J.Math.Anal. Appl. 55(1976),p~ 112-124.
[5) T. Kato, Perturbation Theory for Linear Operators, Springer Verlag, 1966.
[6) J.A. Nohel and D.F.. Shea, Frequency Domain Methods for Volterra Equations, Advances in Mathematics 22 (1976) 3, pp. 278-304.
[7) O. Staffans, Positive Definite Measures with Applications to a Volterra Equation, Trans. Amer. Math. Soc., 218 (1976), pp. 219-237.
[8) K. Yosida, Functional Analysis, Springer Verlag 1974.
aD
DESENSITIZING OBSERVER DESIGN
FOR
OPTIMAL FEEDBACK CONTROL
Manuchehr M. Missaghie Sentro1 Systems Ltd.
4401 Steeles Avenue West Downsview, Ontario
Canada
SUMMARY
This paper considers the problem of designing the observer in an observer based optimal feedback control scheme so as to offset the effect of plant parameter error. The optimal observer gain is determined as the solution of an equation which is formed by equating to zero the first variation of a linear combination of the usual performance index and the norms of the sensitivity matrices of that performance index with respect to the plant parameter matrices. This equation is solved iteratively by the method of successive approximation. The computations involved in each iteration are linear and require the solution of four matrix Lyapunov equations. The results are tested on a single input, single output second order system and a number of observations regarding the effect of the desensitizing observer on the feedback control system is noted.
81
GRASSMAN MANIFOLDS AND GLOBAL PROPERTIES OF THE RICCATI EQUATION
Clyde Martin* Ames Research Center (NASA) Moffett Field, California
Abstract
Riccati differential equations are studied as vector fields on a Grassman manifold. Such manifolds serve as a compactifi~ation for the Riccati equations.
1. INTRODUCTI ON
The Ricoati equations arise in many areas
of systems theory, the most familiar be
ing in the linear quadratic optimization
problem. This particular class of Riccati
equations has received a great deal of
study and there is little additional that
needs to be said for the purposes of op
timization. However, there are other
classes of Riccati differential equations
and other classes of algebraic Riccati
equations. The purpose of this paper is
to describe a framework which encompasses
all finite dimensional Riccati equations and gives a means of systematically study
ing them. The basic idea is to compactify
the domain of the Riccati equation in such a way that the operator becomes a COO vec
tor field and hence has an integral curve
for all t £ m. The resulting manifold is a Grassman manifold and as such has sev
eral useful representations. The properties of the manifold will be exploited to
obtain several properties of the Riccati
equation. A more detailed and more Lie
theoretic account will appear in a later
publication.
2. GRASSMAN MANIFOLDS
Let V be an n dimensional complex vector
space and let p be an integer less than or
equal to n. The set GP(V) will be the set
of all p-dimensional subspaces of V. The object of this section is to describe a
manifold topology for GP(V).
Let U and W be fixed subspaces of V with the following properties:
W £ GP(V), U £ Gn-p(V); (2.1) W 81 U = V. (2.2)
Let L(X,Y) denote the set of al1 linear maps from a vector space X to a vector
space Y. Now for A £ L(W,D) define UA £ GP(V) by
UA ='{w + Aw:A £ L(W,U), w £ W}.(2.3)
UA is clearly a wel1 defined element of
GP(V) and also note that UA = UB implies
*National Research Council Senior Research Associate
82
that A = B. Define E L(W,U)} (Z.4)
We see
r (U) =' {U A: A
that r(U) is in one to one corre-
spondence with L(W,U). Choosing a basis in Wand U we can then define a one to
one and onto function ~U from r(U) to a:(n-p)p by
(z. 5)
where A is the matrix representation of A E L(W,U). We can give r(U) the topology of a:(n-p)p. Now let S E GP(V) and let
Uo
E Gn-p(V) be such that S ti Uo = V and
W @ Uo = V. (The existence of such a Uo is an elementary exercise in linear algebra.) Let'{si} be a basis for S. There is a unique w. and u. such that s. = III
W. + u. with u. E U. Defin A E L(W U ) 1 1 1 0 ' 0
by Aw. = u .. It follows that S E r(U ) 1 1 0 •
Thus we have that GP (V) = U r (U) • (Z .6)
UEGn - p (V) It is actually easy to verify that GP(V) is the union of a finite set of r(U). With the topology induced by the r(U) GP(V) is called a Grassman manifold.
Thus we have that (r(U),~u) is a candidate for a chart and chart map for a differentiable structure for GP(V). The analyticity of the map
-1 ~u 0 ~u' (Z.7)
needs to be verified.
Let S E r(U') n r(U) and suppose that S = UA. We need to determine a T such that S = UTe Each element of u' £ U' can be written uniquely as u' = u + w with u E U and w E W. Define Al and AZ by Al(u') = u and AZ(u') = w. The uniqueness of the decomposition determines that Al E L(U',U) and AZ £ L(U' ,W). In fact it is easy to show that Al is invertible. (Show it is one to one and note that dimU = dimU'). Now we have that UA =' {w + Aw : w £ W, A £ L(W,U')}
='{w + AlA\\' + AZAw : w E IV, A£L(\.;,II')}
83
='{(I + AZA)w + AIAw : (I + AZA) E L(W,W)AIA E L(W,U)}
=' {w + AIA(I + AZA)-lw : AIA(I + AZA)-l
E L(w,u)} U 1 (Z.8)
AIA(I+AZA) -
Thus the map Z.7 is such that
~u 0 ~~~ (A) = AIA(I + AZA)-l
and is analytic on ~u' (r (U' )) n ~u (r (U)) . Thus we have an atlas on GP(V). That it is Hausdorf follows from the fact that given two elements of GP(V) there is a
chart r(U') that contains both.
That GP(V) is a compact manifold follows from the fact that it can be represented as a set of left cosets of the group of orthogonal complex matrices--a compact Lie group. The proof is elementary but slightly involved and can be found in [IJ.
3. RICCATI EQUATIONS AS VECTOR FIELDS ON GP(V)
Let G(n,a:) be the Lie group of complex invertible matrices. For v E GP(V) and for a E G~(n,a:) define an action of G~(n,a:) on GP(V) by a(V) =' {ax:x E v}-the natural action of G~(n,a:) on GP(V). We will consider this action in local coordinates on r(U). Partition a as
[::: :::] where all E L(W,W), a lZ £ L(U,W), a ZI £ L(W,U) and a ZZ E L(U,U). Then v E r(U) implies that v = U A'
UA ='{w + Aw:A £ L(W,U)}
and
aUA =' {allw + alZAw + aZlw + aZZAw A E L(W,U)}
='{(a ll + alZA)w + (a ZI + aZZA)w
(3.1)
A E L(W,U)} (3.Z)
If aUA £ r(U) then all + alZA is invertible and we have that
(3.3)
Thus a acts on the parameter space as-
a : A + (a Zl + azZA) (all + a A)-l lZ (3.4)
a generalized linear fractional transfor-mation.
Let aCt) be a one parameter subgroup of
G~(n,[) with infinitesimal generator B, da -1 ~ = B (3.5)
and assume B is a constant matrix. For UA € r(U) we have that a(t)(UA) = U
(aZl(t)+a2Z(t)A)(all(t) -1 +a lZ (t)A)
UX(t) (3.6)
We will show that X(t) is the solution of a Riccati differential equation. The
derivatives of a .. (t) can be obtained 1J
from 3.5 in terms of the partitioned B matrix. Calculating d at X(t) X' (t)
(aZl(t)+a2Z(t)A)(all(t) -1
+a1Z(t)A) - (aZl(t)+aZZ(t)A)
• (all(t)+alZA)-l(ailCt)+aiz(t)A) -1 (all Ct) +a lZ (t)A)
= (aZl(t)+azz(t)A)(all(t) -1 +alZ(t)A)
- X(t) (ail (t)+aiZCt)A) -1 (all(t)+alZ(t)A) •
Substituting the values obtained from 3.5 one obtains:
£t X(t) = BZlall+BZZaZl + (BZla lZ -1 + BZZaZZ)X(t)(all+alZA)
-X(t)Bllall + BlZa lZ -1 +(BllalZ+B1ZaZZ)X(t) (all+alZA)
BZl + BZZ X(t) - X(t)
• (Bll +B12 X(t)) (3.7)
Thus the action of a one parameter group~ locally represented by a Riccati differ
ential equation and each Riccati equation
84
corresponds to a vector field on GP(V).
For a slightly different development see [ZJ.
Consider some of the immediate conse
quences of this linearization. Let S be
an equilibrium point of the equation 3.5.
Then S is a fixed point of aCt) for all t,
i.e. a(t)S = S. It is easy to show that
a(t)S = S iff BS ~ S. This result is
essentially what Potter observed in [3J
o obtain his solution of the algebraic
Riccati equation except that we obtain
slightly more in that we have determined
all solutions of algebraic Riccati equa
tions, even those at "infinity." The
results of Section Z can be used to obtain
the results of [3J and the slightly more general results of [4J.
4. LINEARIZATION OF RICCATI DIFFERENTIAL EQUATIONS
Consider the Grassman manifold GP(V), the
one parameter group aCt) and its associ
ated vector field B. If a and B are con
sidered as matrices then their representa
tion is with respect to a given basis of
a given decomposition of V, i.e. V = W ~ U
and a basis is chosen as' {wI"" ,wp,ul
' .•• u }. Thus a change of representation n-p of a and B is equivalent to a change of charts in GP(V).
Let V = W' ~ U' be another representation of V and let y be the appropriate change
of basis. Then the change of representation of a is just the usual change of
-1 basis with a' = yay. As usual we have that
g, = -1 y Bya' (4. I)
and the vector field B changes as a change of basis. On the other hand, if U
A '{w+Aw:A € L(W,U)} then U
A =
U' -1 U'A' is the (YZl+YZZA) (Y1I+YlZA)
representation in terms of W' and U'.
Thus a change in charts is represented as
a similarity transformation in the global
representation of the vector field and as a linear fractional transforcation in the
local representation.
Let
and
Y = ~2l + ~22Y - Y(~ll + ~12Y) (4.3)
Define 4.Z to be equivalent to 4.3 iff -1
X = (YZl+YZZY)(Yll+YlZY) for some
y = (Yll Y12) £ GR.(V). YZI YZZ
That is, two Riccati equations are equi
valent iff ~heir solution curves are
related by a generalized linear frac
tional transformation. We have immed
iately the following theorem.
Theorem 4.1. Every Riccati equation is
equivalent to a linear differential equa
tion.
Proof: There exists a similarity trans
formation that transforms B to its Jordan form. The resulting Riccati equation is
[:' :,] where the representation is with respect to the positive and negative eigenspaces
of 4.4. We can assume that BZ is stable
and Bl is unstable. Then letting W be the positive eigenspace and U the negative
eigenspace we hdve that u(t)UA = U and hence as t goes to
expBZ(t)Aexp-B1(t) infinity a(t)UA approaches Wand so W is the unique stable equilibrium point in the
chart r(U). Thus all of the other equili
brium points of 4.4 lie in the complement
of a canonical chart. It is possible to show that this set is an algebraic subset
of GP(V) and further is an invariant sub
manifold of GP(V) for 4.4. It can be further decomposed to obtain a nested set
of invariant submanifolds of decreasing
dimension. It appears that certain re
sults of [5J on the partial ordering of solutions are related to these invariant
submanifolds. A detailed study of this is beyond the scope of this paper.
REFERENCES
linear. 1. Griffiths, P. and Adams, J., Topics
Note, though, that not every Riccati equa
tion is equivalent to a homo1eneous lin-o C °d B 0 0] B 0 ear equatIon. onsl er = 1 O. IS
not similar to a diagonal matrix.
Consider the familiar LQG problem and
assume that it is properly posed. Let P = -Q - A'P - PA + PBR-IB'P (4.3)
be ~ Riccati equation with Ham-
iltonian vector field
B = fA -Q
(4.4)
Since the LQG problem was assumed to be well posed we have that B is equivalent
to a matrix
85
in Algebraic and Analytic Geometry, Princeton University Press, Princeton,
New Jersey, 1974. Z. Hermann, R., Interdisciplinary Mathe
matics, Vol. III, 1973, Math Sci
Press, Brookline, MA. 3; Potter, J. E., Matrix Quadratic Solu
tions, SIAM J. Appl. Math., Vol. 14,
pp. 496-501, 1966. 4. Martensson, K., On the Matrix Riccati
Equation, Inform. Sci., Vol. 3,
pp. 17 - 49, 1971. 5. Willems, J. C., Least Squares Station
ary Optimal Control and the Algebraic
Riccati Equation, IEEE Trans. Aut.
Con. 16 (1971), 621-634.
• GENERALIZED OPERATOR AND OPTIMAL CotlTFClL
Salah M. Yousif California State University, Sacramento
ABSTRACT
This paper presents the optimal control problem in terms of linear operator theoretic foundations. The optimal control that minimizes a specified obj ecti ve functional is derived utili zing the concept of generali zed inverse of linear operator. The operators are assumed to be defined on Hilbert spaces. The representation of the control and the operator inverse in fini te dimensional minimal energy control problem is treated, and examp les for thi s case wi 11 be gi ven .
I. INTRODUCTIQ~
In many control problems, large number of systems can
be represented as operators on vector spaces. ~Iore
precisely gi\"en, a linear dynamic system
~lt) = :\(t1x(t) + B(t)u(t) (1)
the zero state response of this system is given by t
xlt) I ¢l(t, s)B(s)u(s)ds (Z) o
Idlere ~(t, s) the transition matrix. A general class
of control problems may be described by determining the
control \"ector u(t) that steers the state from the
origin at time to 0 to final state x(t f ) at the final
time t f and minimizes the performance index J(u) given
by
., IZ J(u) = Ilull~ + Ilx(t f ) - xl (3)
I,here x is some specified state in the state space.
The norms given in (3) are the usual Eucleadean norms.
If I,e let the space of the controls to be Hilbert space
HI and the other space another Hilbert space HZ' then
equation (2) represents a linear operator from HI into
H ~ and can be wri t ten in the form
(Lu) (t) = x(t) ( 4)
iie Id 11 assume L to be bounded linear operator from HI
into H2• The perfol:lllance index J (u) becomes
J(u) = lIulJZ + IILu - xli Z (5)
where the norms Ilull and IILu - xii are defined res
pecti vely in terms of the inner products in HI and HZ'
If I,e require HI to be LZ[O, I], then the inner product
in this case is defined by
T * (xl' xZ) = IXI (t) Xz (t) dt (6)
o
The control function u(t) may be assumed to lie in a
convex set in HI' This .condition may be necessary for
existence of the optimal control. The class of control
86
problems that fits the above description includes mini
mum energy control, regulator problems and tracking
control systems. For the last two problems, J(u) may
assume the well known quadratic performance index given
by
J(u) = lIull~ + IILu - x,,~ (6)
where R is positive definite, and Q is at least positive
semidefinite. Similar problems were treated in the
literature. Balakrishnan [3] obtained a solution of
similar problem by generating sequence of control
function un that converges to the optimal control of
minimal norm. He also computed the optimal control
function using steepest descent method. Porter treated
a general problem in linear system optimization" des
cribed by
x = Lu + TZ ( 7)
where L: Hl~ HZ and T: HZ~ H3 and Land T are bounded
linear transformations between Hilbert spaces [12].
The performance index J(u) is given by
( 8)
and the problem is to find for a fixed element, Xl in
HZ a control Uo in HI such that J(u) is minimum. This
control Uo was given by
u = o -(I + L*L)-lL*TX
1
where L* denotes the adjoint of A.
(9)
In fact, the closed
form solution obtained in (9) is similar to one obtained
by Balakrishnan in terms of convergent sequence of
control functions in HI' Although these solutions are
of interest to us, our main interest is to treat ~pe
cial problems in this general form and their represen
tation in finite dimensional spaces. Therefore, we
will proceed in section two to treat and solve these
problems; and in section three, we will present solu
tion for the finite dimensional discrete minimal energy
control problem. In our approach, the pseudoinverse
of linear operator plays a principal role in the solu
tions and synthesis of the optimal control.
II. GENERALIZED CONTROL PROBLEM
The optimal control problem described previously can
be modified to the following generalized form. We
are given a control system as in equation (4), that
is,
tLu)(t) = x(t) (4)
It is required to find the control uO(t) of minimum
norm which lies in a convex set in Hilbert space HI
that steers the system (4) from the origin to a state
which lies as close as possible to a specified fixed
state Xl in Hilbert space H2 . We will assume L here
to be bounded linear operator from HI into H2 with
close range R(L). This formulation of the control
problem in hand becomes similar to the linear least
square problem in Hilbert space. The solution of the
above problem that yields the optimal control uO(t)
is given by the following theorem.
Theorem 2.1 Let L be as described above. The optimal
control uO(t) of minimum norm that minimizes
is given by
uO
(t)
where L+ is the pseudoinverse operator of L.
(10)
(11)
Proof. The proof we will provide here is similar to
the proof of theorem 7.4.2 in reference [6]. We let
Xl = Y + z where ye:R(L) and y e:R(L) . Since Uo
=
+ Luo= + +. + h L Xl' then LL Xl = LL y, Slnce LL = PR(L)w ere
PR(A) is the projection of H2 on R(A). Therefore,
IILuo - XIII = Ily - XIII ·llzll·
Let ul be any element in HI' Lule:R(L). Thence,
IILUI - xll12 = IILw - Yl12 + Ilzl12~IILuo - xll12 (12)
which proves that uO(t) minimizes I ILu - XII 12.
To prove uO(t) has minimum norm, we let u ' be any
solution that minimizes I ILu - XII 12. That is, u ' is
least square solution. Then, L*Lu ' = L*x l and
u2 = u ' - UO is in the null space, N(L*L) of L*L. +
Since N(L*L) = N(L) and L xle:R(L*) N(A). Therefore,
IIu l l12 = II u211
2 + Iluol12~lluol12 (13)
Consequently, uO(t) has mlnlmum norm, which means in
engineering terms: the optimal control uO(t) that
steers the system to a state closest, in the least
square sense, to the target state Xl it steers it
with minimum energy. Since all finite dimensional
spaces are complete and therefore Hilbert spaces,
every matrix is a bounded linear operator with closed
range. Therefore, the above generalized control pro
blem in finite dimensional spaces becomes more tangible
for computation since the pseudoinverse of a matrix is
computable by a variety of computation methods [2].
In the finite dimensional case, L becomes an mxm matrix
from En into Em' the finite dimensional Eucleadean
spaces of dimensions n and m respectively. Consequently,
theorem 2.1 will have the following equivalent version
in finite dimensional spaces.
Theorem 2.2. Let L be an nxm continuous matrix from
,Em to En as described by (4). The minimum norm uO that
minimizes (10) is given by
87
o + u = L Xl (14)
where L+ is the pseudoinverse matrix of L.
Proof. The proof of this theorem is similar to that
of theorem 2.1 and is provided in reference [IS].
The above theorem, as it will be seen in section three,
will be used to derive the optimal minimum energy
control of linear finite dimensional time invariant
control system.
The least square version of theorem ~.2 is the follo
wing theorem [2].
Theorem 2. 3. Let L be an nxm continuous matrix from
Em into En described by (4).
which
The vector uOe:Rm for
and
are minimi zed, is given by
o u = GXl
where G satisfies
LGL = L, (QLG) T = QLG, (RGL) T = RGL
(15)
(18)
Proof. The proof of this theorem is proyided in refe
rence [2]. The basic ingredients for the proof of
theorem 2.3 are the following trans formations which
yield the equivalent unweighted least squares pr~blem
given by theorem 2.2.
The above transformations imply
II Lu - XIII ~ = II Lu - xll12
and
(20)
(21)
In writing (19), we use the fact that every positive
definite matrix 0 has a unique positive definite square
root matrix 01
/2 and the inverse of 0 1/ 2 is denoted by
D- 1I2•
III. MINIMUM ENERGY CONTROL
In this section. we solve the problem of minimum energy
control of finite-dimensional. time-invariant discrete
control systems. The method of solution will depend
mainly on the material provided in section two. The
discrete control system will be described by
x(k + 1) = Ax(k) + Bu(k). k = 0, 1. 2 •••• (22)
where x (k) e:En and u(k) e:Ern and A and B are matrices of
appropriate dimensions. For this problem. we will not
assume that the system is completely controllable.
although we assume that the uncontrollable states are
stable. These assumptions and the nature of the solu
tion make the approach for obtaining the control func
tion less restrictive and more general. It is desired
to find a control sequence {u(k)}N-l of minimum energy o
that steers the system (22) to a state as close as
possible to the origin, in finite time N, starting
from initial state x(O). That is, N-l .
o = x(N) = ANx(O) + L A{N-I-lJBu(i) (23) i=o
or
x(O) N-l .
- L A-1-IBu(i) i=o
u{O)
u(1)
u(N-l)
where the submatrices ~ are given by
~ = A-k-lB, k = 0, 1, ••• , N-l.
Equation (24) may be written in the compact form.
LU = xeO)
L -{LO' Ll ••••• ~-l]
is the n x (mxn) controllability matrix
UI = [u(O), u(l), ••• , u(N_I)]T
(24)
(25)
(26)
(27)
(28)
is an Nm-row vector. In general. we will not assume
the rank of L to equal n. Theorem 2.2 yields the
unique optimal control Uo. That is.
UO
= L+ x(O} (29)
UO is the control vector in ENxm that yields the
closest vector in the range of L to x(O). That is.
UO minimizes \\Lu - x(O)1 \2 over all Ue:ENxm
' Moreover,
88
UO
is of minimum Eucleadean norm. The pseudoinverse
matrix L+ may be computed by many of the efficient
computational methods that exist to compute the pseudo
inverse [2. 7]. The optimal control UO is still an
open loop control. This control can be synthesized by
the following straightforward method. It is evident
that the minimum energy control that steers the state
x(k) to the origin in N steps. using 28, is given by
uO{k)
uO(k+l)
uO(N+k-l)
(30)
Equation (30) implies that the control UO(k) is given by
uO
(k) = F x(k) (31)
where the matrix F is the first (mxn) submatrix of the
(Nxm)xn matrix L+. That is. F consists exactly of the
m rows of the matrix L+ That is.
o ] L+ m-n (32)
where 1m is the mxm unity matrix and 0m_n is the mx
(m-n) zero matrix. It is clear that F is to be com
puted only once in each control action. and the minimum
energy feedback control is given by (31). Since the
synthesis procedure depends on the computation of the
pseudoinverse L+ of L. the following iterative proce
dure can be used to compute L+. The procedure consists of the fOllowing steps {7].
(i) Let i k denote the kth column of L and let Lk
denote the submatrix consisting of the first columns of L. That is,
(33)
k = 1. 2 •.••• M ; M = number of columns of L.
(ii) Compute the following two vectors
(34)
we have
o otherwise (35)
(iii) Compute the row vector hk
(36)
(i v)
(v)
+ Compute the matrix Lk by
c; [::-1 -,>hk 1
Finally,
+ + L = LM
(37)
(38)
The above procedure is easy to implement since it does
not require any matrix inversion.
IV. CONCLUSION
The optimal minimum energy control problem was solved
in terms of an operator and its pseudoinverse in
Hilbert spaces. The finite dimensional version was
solved in terms of the pseudoinverse of matrix in the
discrete control systems case. A procedure based on
computing the matrix pseudoinverse was used to synthe
size the optimal control that led to feedback optimal
minimum energy control.
REFERENCES
1. Albert, Arthur. Regression on the Moore-Penrose Pseudo Inverse, Academic Press, New York, 1972.
2. Ben-Israel, Adi and Thomas N. E. Greville. Generalized Inverses: Theory and Applications, John Wiley, New York, 1974.
3. Balakrishnan, A. V., "An operator theoretic formulation of a class of control problems and a steepest descent method of solution," J. SIAM, Control, Ser. A, Vol. 1, No.2, 1963.
4. Balakrishnan, A. V., "Linear systems with infinite dimensional state space," Symp. On System Theory Proc., Polytechnic Institute of Brooklyn, 1965.
5. Balakrishnan, A. V., "Optimal Control Problems in Banach Spaces," J. SIAM, Control, Ser. A, Vol. 3, No.1, 1965.
6. Blum, E. K. Numerical Analysis and Computation, Addison Wesley, New York, 1972.
7. Bollion, L. T. and L. P. Odell. Generalized Inverse Matrices, John Wiley-Interscience, New York, 1971.
8. Desoer, C. A. Notes for a Second Course on Linear Systems, Van Nostrand Reinhold, 1970.
9. Goldstein, A. A •. Constructive Real Analysis, Harper and Row, New York, 1967.
10. Kalman, R. E., Y. C. Ho and K. S. Narendra, "Controllability of Linear Dynamic Systems," Contribution to Differential Equations, Vol. 1, No.2, John Wiley, New York, 1961.
11. Luenberger, D. G. "A Generalized Maximum Principle," Recent Advances in Optimization Techniques, 1968.
12. Porter, W. A., "A basic optimization problem in linear systems," Math. System Theory, pp. 20-44, 1971.
89
13.
14.
15.
Porter, W. A. Modern Foundation of System Engineering, Macmillan, New York, 1966.
Porter, W. A. and J. P. Williams, "Extension of the Minimum Effort Control Problem," J. Math. Anal. Appl., 13, 1966.
Yousif, S. M., "Generalized Minimum Energy Control of Discrete System," Eighth Asi lomar Conference Proc. on Circuits, Systems and Computers, 1974.
16. Yousif, S. M., "On approximate controllability of linear dynamic systems," 18th Midwest Symposium on Circuits and Systems Proc., 1975.
17. Zadeh, L. A. and C. A. Desoer, Linear System Theory, McGraw-Hill, New York, 1963.
I, Iii
ABSOLUTE INVARIANT COMPENSATORS : CONCEPTS, PROPERTIES
AND APPLICATIONS
Romano M. DeSantis Ecole Poly technique de Montreal
University· of Montreal Montreal, Canada
ABSTRACT
The available theory concerning the concepts of absolute invariant compensators is extended by studying the structural properties of these compensators with respect to stability, feedback and comparative sensitivity. It is shown that the interconnections between these properties and the Cruz-Perkins comparative sensitivity matrix provides a natural vehicle to extend the application of the classical Nyquist plot approach to the design of multivariable feedback compensators. The engineering significance of the development is illustrated via the design of a multivariable frequency-voltage compensa-tor for an interconnected power system. .
STABILITY TESTS FOR ONE, TWO AND MULTIDIMENSIONAL LINEAR SYSTEMS
E. I. Jury Department of Electrical Engineering and Computer Sciences
and the Electronics Research Laboratory University of California, Berkeley, California 94720
Abstract
This talk will review analytical stability tests for one-dimensional linear systems since the early tests of E. J. Routh in his famous Adams Prize Essay of 1877. The historical background of Routh's stability test and criterion, as well as Fuller's conjecture on its simplification, will be mentioned. In this historical review, the works of Hermite, Sylvester, Maxwell and others as related to the stability problem will also be discussed. This review will provide the context for a discussion of recent stability tests obtained for two dimensional and multidimensional linear systems. These tests will be described and their computational complexity will be discussed in detail. In addition, the applications of stability testing to the study of two and multidimensional digital filters, numerical analysis of stiff-differential equations, realization of mixed lumped and distributed parameter systems, and the design of output feedback systems will be briefly mentioned. Comments on future research in this area will conclude the lecture.
Research sponsored by the .National Science Foundation Grant ENG76-2l8l6.
91
A DARLINGTON REALIZATION THEORY
OF OPTIMAL LINEAR PREDICTORS.
P. Dewilde T. H. Delft
The Netherlands
Abstract
A systematic junction of Darlington Synthesis theory, optimal linear prediction and stochastic modelling is discussed in this paper. Key elements in the theory are J-unitary coprime factorization, and the theory of orthogonal polynomials on the unit circle.
1. INTRODUCTION
In recent years, the similarity between time invariant prediction and transmission line theory both leading to Sturm-Liouville type problems, has been discussed by a number of authors (1,2,3). Also, the relation between Wiener-Hopf type equations and scattering problems has been observed by a number of authors (for a survey, see 1). It is clear that (non-uniform) transmission line theory is but a special case of the more general Chandrasekhar-Sobolev theory for non-uniform scattering and transmission in an optical medium. On the other hand, (non-uniform) transmission line synthesis can be viewed as a special case of Darlington synthesis wherby the transmission eigenstructure is located at infinity in the complex plane. The Darlington synthesis is, in another direction than the Chandrasekhar theory, more general also and has a definite bearing on prediction or modelling theory. The simplest instance of this circle of arguments is the predictive time discrete filtering using orthogonal polynomials -or an equivalent Levinson type algorithm (4,5,2). This case also is but a special case of Darlington synthesis, and we will start with this fact in the next paragraph. In the third paragraph we shall show how the Darlington synthesis generalizes the Levinson algorithm and produces a novel identification algorithm. In the fourth paragraph, we shall introduce the time-continuous case and show its analogy with the previous discussion.
2. THE LEVINSON ALGORITHM, ORI'HOGONAL POLYNOMIALS
AND DARLINGTON SYNTHESIS.
The relationship between orthogonal polynomials on the unit circle and Darlington synthesis is well know (see e.g. 7). Also, the relation between the Levinson algorithm and orthogonal polynomials
92
has been discussed in the literature (1). In this paragraph, we shall show in what manner the Levinson algorithm and the theory of orthogonal polynomials appear as a special case of Darlington synthesis. The general Darlington synthesis will be used in the next paragraph to obtain a generalization. Let x(k) be a stationary Gaussian time discrete (k = 0,1, ... ) stochastic process with zero mean and c~ance function r (k), whereb~ r (k) = Ex(n)x(n-k) for all suitable n. Let x(n) be the optimal least square linear prediction of x(n) in function of th~ data x(k) (k = O,l, .• ,n-l). Writing E(n) x(n) - x(n)
= - ~ ank x(n-k) k=l
for the nth step innovation, and
4> (z) n
where
EE (n)E\ri1
n z + n n-k I: a kZ
k=l n (1)
(2)
we have the following well-known and easy to derive facts : (1) the 4>k form a set of orthogonal polynomials on the unit circle with respect to the spectral measure dF derived from the covariance sequence r(k) by Bochner's theorem. (2) They obey a two-tier recursion formula, similar to the Levinson algorithm, and given by (5)
whereby
* 4>k(z) (4)
and £
~+1 (5)
00 -k (3) defining Zl (z) = reO) + 2 E r(k)z analytic
k=l outside the unit circle, and Wk(z) PZ 1 (z)~k(z)
where the operator P denotes projection on the analytic part inside the unit cir~le, we have that Z is approximated by W (z)/~ (z) in a weak sense.
1 k k The adjoint polynomials Wk satisfy a recurs~on similar to (3) but with uk replaced by - uk·
(4) ~ (z) and W (z) are polynomial with zeros insid~ the unitkcircle, and satisfying an "energy conservation relationshipll(supposing reO) = 1 and ~*(z) = (~(lrz»-):
~kWk71- + Wk~k* = 2
It follows that Z2 l/~k satisfies
1 Z2* Z2 = 2" (Zl* + Zl)
(6)
(7)
which is also the relation between the input impedance and the transfer impedance of a lossless network terminated in unit resistors and pictured in fig. 1.
fig.l Network interpretation of eq. (7).
It should be remarked, as pointed out in (5), that jjl jjl
Z2~ (e )Z2(e )d e is a weak approximation of dF,
at least if certain Szeg6 type regularity conditions are satisfied.
Proposition 1 The recursion for 1> k+l and W k+l can ·be written in the following form :
z
where
and z
1(1-1 uk
+ 1 12) -uk+1 z
(9)
is a J-unitary passive chain scattering matrix (6) of the first degree with transmission zero at infinity (a Darlington section). Writing the recursion out one obtains :
.'nl -h: ::1- li k=n
o k
(10)
A lossless circuit realization for this is given in fig. 2 :
~B---=Gl=~ o 01 b !L n ___ . ~
93
fig. 2 Circuit realization of the product (10).
whereby
(n)
o [::1 ( III
A "running" innovations filter is then oJjtained by taking a
2=b
2=1 so that, in this case, fig 3 produ
ces a "wfiitening" filter :
fig. 3 : The "running" whitening filter.
If, on the other hand, a modelling filter is desired, one can put a
1=1, and in that case the
filter of fig.3 with a 2=b2 will produce the transfer 1/~ and act as an approximate modelling filter. This fi£ter, connected as in fig. 4, has output impedanRe Zo = Wn/~n' and inverse transfer impedance Zit = z N n ,
fig. 4 : The modelling filter.
and all its transmission zeros are at zero.
4. SYNTHESIS PROCEDURES
Possibly the easiest way to convert (10) into a synthesis procedure, is the following:
Let us temporarily assume that the model is autoregressive, i.e. that there is an n such that the
spectral density is given by : s(eie ) = 1/1~ (eiS
) 12 n
-1 and thus S(z) exists and S(z) = ~n(z)~n~z). From (10) we have
En An] = [1 oJ G(n) (12)
where G(n) is polynomial, J-unitary and passive. (12) can also be written as
~n E+zo 1-Z0J = G o ] G(n) (13)
It follows that e(n) can be obtained by extraction of the pole at infinity from ~n E+zo 1-Z0] at
the right. We proceed to show (1) that this extraction procedure is independent from ~n in such a way
that e.g.
zn E+zo (14)
would produce the same J-factor to the right, and (2) that the procedure is essentially unique and leads to the factored form (10). Indeed, the right hand side of (13) is easily seen to be a coprime factorization of the left hand side with factors analytic outside the unit circle. This fac!~rization is known to be uni~uT except for a z -unimodular factor. Since G n is J-unitary, this unimodular factor can only be constant Junitary. Moreover the same is true for (14) . This completes the proof. It should be noted that (13) can be performed on the time series Zo
immediately without conversion to the frequency domain. The procedure is then not only a network synthesis bUt also an identification i.e. it identifies the system from its time series.
Suppose next that it is not known whether the system producing Zo is indeed autoregressive. The Levinson procedure (or the equivalent procedure using orthogonal polynomials) then produces a filter with all its transmission zeros at infinity which gives (1) the innovations at each stage and (2) - when n is sufficiently large - an autoregressive approximation to the modelling filter (5). Both can be obtained in exactly the same way as described previously : again Z is known, and a factorization of (14) with SUf~iciently large n will produce the predictive filter. More generally, in the multivariable case, we have that a facto-rization of
u(n) 11+zo 1-Z
0] Gll 6 1J G(n) (15)
where u(n) is a (judiciously chosen) unitary matrix
94
with poles at infinity of degree n and ern) J-unitary and passive, will produce a multi variable version of the innovations filter.
The previous procedure can now be generalized to generate innovations filters and modelling filters which are not restricted in their transmission zero . location. E.g. suppose that such a filter is desired with n transmission zeros at infinity and m at a given point a with lal>l, then a factorization of :
n m r,. z (z-a) [.+ZO 6 ] e(n+m)
12 (16)
. (n+m) . w~ll produce the answer where G ~s chosen as a J-unitary passive matrix having apole of order n at infinity and a pole of order m at a, and the factorization (16) is required to be coprime. 6
12 can be so determined that
-n -m. z (z-a) 6 12 (17)
where 6'12 side the
is suitably bounded and analytiC outunit circle.
More generally, (15) can be used with any u(n) unitary and having singularities only outside the unit circle. (17) then generalizes to :
(18)
where 6'12 is analytic outside the unit circle.
Again, the factorization (15) can be performed on the time series and does not require any Fourier transformation. In this case, time invariant linear combinations of data in the input stream will be estimated and the "innovations" obtained will be orthogonal on such(cynbinations. The choice of unitary matrices U n is n~t)arbitrary. In fact, one must require that U n .. 0 in some weak sense. This is the case e.g. for zn due to the Lebesgue lemma. From a numerical point of view, the modelling and innova~iyns filter will be "best" if the zeros of U n are chosen as close as possible to' the zeros of the (unknown) system which generates the covariance. It is known that the identification problem of a covariance is generally ill-conditione~.)Introducing the information contained in U n adds information on the probable transmission properties of the system and makes the problemt~ell-posed. In the Levinson filter of the n order, only the n last data are stored, and all other past information is lost. In the generalized Levinson filter obtained by factorization of (15) time-invariant linear combinations of past data are stored. These combinations may be much more efficient for predictive purposes - especially if the filter generating the data has finite zeros.
5. THE CONTINUOUS-TIME ANALOG
Let x(t) be a time-invariant Gaussian stochastic process with covariance function r(t) and spectral measure given by (Bochner's theorem) :
oCt) + K(t)
In this case the innovation-(8) is given by
E: (t) x(t) - It h(t,T) X(T) dT o
where h(t,T) obeys the Wiener-Hopf equation
h(t,T) + Ig h(t,ul K(u-T)du = K(t-T)
(19)
(20)
(21 )
The Fourier transform (with respect to T) of 6(t-T) h(t,T):
P(t,w) -iwt = e (22)
obeys the following set of differential equations (2) :
a at P(t,w) -iwP(t,w) P 1Ir (t,w)h(t,O) (23a)
() at plt- (t,w) = -P(t,w)h(t,O) (23b)
where P -IE (t,w) = e -iwt P"("t";Wj (24)
Eqs. (23) are analogous to eq. (3). They describe a non-reciprocal transmission line with J-skew transition matrix
. ~iW A(t) =
:h(t,O) (25)
and gen~rate a transfer matrix 0 t given by a product integral :
o t
t
I o
(26)
and analogous to (10). The transfer function (22) so obtained has a finite impulse response just as in the previous paragraph. The 0 theory there is equally valid here, except that a ne w type of section - consisting of transmission line structure - has to be introduced. Again, a coprime factorization of type (15) provides for alternative approximations and a generalization of the method.
6. BIBLIOGRAPHY
1. T. Kailath, "A view of Three Decades of Linear Filtering theory", IEEE Trans. on Info. Theory, vol. IT-20, No.2, pp. 145-181, March 1974.
2. T. Kailath, A. Vieira, M. Morf, "Inverses of Toeplitz Operators, Innovations, and orthogonal polynomials", Informations System Lab. Stanford University.
95 , ' . ..1
3. M.G. Krein, "The Continuous analog of Theorems on Polynomials Orthogonal on the Unit Circle", Dokl. Akad. Nauk. SSSR, Vol. 104, pp. 637-640, 1955.
4. N. Levinson, "The Wiener RMS (Root Mean Square) Error Criterium in Filter Design and Prediction", Jour. of Math. and Phys., Vol. XXV, No.4, Jan.1947 pp. 261-278.
5. L.Ya.Geronimus, Orthogonal Polynomials, Transl. Consultants Bureau, New York 1961.
6. P. Dewilde, "Input-Output Description of Roomy Systems", SIAM J. on Control and Optimization, Vol. 14, No.4, July 1976.
7. J.D. Rhodes, P.C. Marston, D.C. Youla, "Explicit Solution for the Synthesis of Two-Variable Transmission Line Networks", IEEE Trans. on Circuit Theory, Vol. CT-20, No.5, Sept. 1973.
8. T. Kailath, "An Innovations Approach to LeastSquare Estimation - Part I : Linear Filtering in Additive White Noise", IEEE Trans. on Antom. Con. Vol. AC-13, No.6, Dec. 1968.
BIOGRAPHY
Patrick Dewilde was born in Korbeek-Lo, Belgium, on Jan. 17th, 1943. He received the engineer degree from the University of Louvain, Belgium, in 1966, and the Ph.D. degree from Stanford University, in 1970. He has held research and teaching positions with the University of Louvain, the University of California at Berkeley, and the University of Lagos, Nigeria. He is presently professor of Network Theory at the T.H. Delft, the Netherlands. His main interests are in applied algebra (systems an network synthesis, factorization properties for matrix functions, scatterin,g matrix theory, numerical analysis and fast algorithms) and in teaching Network Theory.
OPERATOR THEORY TECHNIQUES FOR FINITE DIMENSIONAL PROBLEMS
J. W. Helton Dept. of Mathematics
Univ. of Calif. at San Diego San Diego, Calif. 92110
ABSTRACT
Problems with matrix valued rational functions can become complicated very quickly (once one starts chasing various sub-spaces corresponding two different orders of vanshing). Operator Theory provides techniques for handling such problems which are conceptual but which still have considerable power. This talk describes how one can use operator theory on the problem of broad band impedance matching. The (rather powerful) commutant lifting theorem allows one to obtain some results for input where none were previously available. Furthermore, it helps to complete the classical theory for one ports.
96
THE RELATION BETWEEN NETWORK THEORY,
VECTOR CALCULUS, AND THEORETICAL PHYSICS
Franklin H. Branin, Jr. System Communications Division Laboratory
IBM Corporation Kingston, N.Y.
Abstract
This tutorial paper is a brief review of previously published work by Roth, Branin, and Tonti relating to the underlying nature of network analogies and their use in engineering and physics. Roth's algebraic topological characterization of the l-network (linear graph) problem laid the foundation for Branin's treatment of the 3-network problem and its relation to the vector calculus and Maxwell's equations. Tonti has extended these ideas to encompass a significant portion of theoretical physics.
1. SUMMARY lem -- as well as many related network prob-
The purpose of this tutorial paper is to lems -- involves a l-network, or linear
point out the intimate relation that exists graph, consisting of a set of intercon
between network theory, the vector calculus, nected O-dimensional objects [points, nodes,
and much of theoretical physics -- in short vertices, O-cells] and l-dimensional ob-
to show the mathematical basis for network
analogies and the implications thereof. The paper summarizes contributions made by Roth (1) , Branin(2,6) , and Tonti. (3,4,5,7)
jects [lines, branches, edges, or l-cells].
However, many physical systems require the
use of surface and volume elements [2-cells
and 3-cells] as well as points and lines to Most familiarly, network analogies have characterize them properly. Without a clear
evolved from a comparison of the basic equa- understanding of the related 3-network prob-
tions of some physical phenomenon with the
equations describing an electrical network.
This heuristic approach has been fruitful,
but it has failed to identify the underly
ing justification for many of these anal
ogies. The reason for this failure is the
fact that the electrical network problem is
of lower dimension than many of the physi
cal phenomena for which network analogies
have been sought.
Specifically, the electrical network prob-
lem, then, the full implications of the net
work analogies for such systems cannot be
appreciated. Indeed, some of the inherent
richness of the mathematical theory that
describes these phenomena is missed.
In a fundamental contribution to network theory, Roth(l) rigorously characterized
the l-network problem from a completely
abstract algebraic topological point of
view, devoid of any necessary physical
interpretation. In so doing, he laid the
97
foundation for understanding fully just
what conditions are necessary for the ex
istence of any particular physical example
of a I-network problem.
It is well known that the I-network or lin
ear graph can have "through" and "across"
variables associated with its nodes, its
branches, and its loops. As a consequence
of interconnecting the branches, these
through and across variables are subjected
to topological constraints that force the
through variables to sum to zero at the
nodes and the across variables to sum to
zero around the loops of the I-network.
Topological matrices (A, the branch-node
incidence matrix and C, the branch-loop
or circuit matrix) are used to express the
topological constraints. In addition, the
constitutive relations between the through
and across variables in each branch can
In this diagram, the through variables ap
pear in the uppermost sequence of transform
ations, the mesh (loop) variables i' and
branch variables i being responses. The
variables I associated with the branches
are quite arbitrary and correspond in an
electrical network to independent current
sources. The across variables, appearing
in the lower transformation sequence, in
clude the node variables e' and branch vari
ables e, both of which are responses; the
branch variables E are again arbitrary and
correspond electrically to independent volt
age sources.
The topological constraints on the through
variables comprise the equations
and/or i = C i' (1)
and
(2)
be expressed in matrix form. The topologi- while those on the across variables are
cal constraints and constitutive relations
together completely describe the I-network
problem from an engineering or physical
point of view.
and
and/or e = A e' (3)
(4) Roth's abstract view of this problem is more The constitutive relations involve the
penetrating and is summarized by the trans- composite through and across variables as-
formation diagram of Fig. 1. socia ted with the branches, namely
c "ill Zlly
------ 0 At r:l ---•• L!J
[rJ .... -----Ct I:I ... -_A_- GJ o •• -----
(mesh) ( branch) (node)
Fig. 1 Roth's Transformatjon Diagram for the I-Network Problem
E+e = Z(I+i) and/or I+i = Y(E+e) (5)
(In Eqs. (1), (3), and (5), the "and/or"
signifies that the two alternative expres
sions are mathematically equivalent.)
Rather than considering Eqs. (1) to (4) in
terms of topological matrix operations,
Roth regards them in terms of boundary and
coboundary operations since these have much
deeper significance topologically. For the
same reason, he treats the Z and Y matrices
as de~ining isomorphisms rather than simple
matrix operations.
There is much more substance in the detail
of Roth's paper than we can discuss here, but the main implication of his work is the fact that the through and across variables in a I-network problem can belong to any
98
vector space. Accordingly, any physical and "across" are no longer appropriate for
phenomenon in which through and across vari- describing variables associated with 2- and
abIes exist, obey the constitutive and topo- 3-cells, the terms 0-, 1-, 2-, 3-chains and
logical constraint laws, and belong to some 0-, 1-, 2-, 3-cochains will be used instead.
vector space, can be treated as an example
of a 1-network problem. In particular, this includes structural analysis(8)where the
force and displacement variables are 6-vec
tors and mechanical systems where they are
3-vectors. In the spirit of Roth's approach, Kron(9)
derived a transformation diagram purporting
to explain Maxwell's equations. However,
the present author found Kron's results to
be unacceptable and sought an alternative
In the chain sequence, appearing at the top
of Fig. 2, the symbols Ml and M2 represent
responses while the symbols P1 and P2 stand
for independently assignable variables. In
the cochain sequence, Ml and M2 represent
responses while pI and p2 are arbitrary.
Moreover, the symbols COl' C12 , and C23 stand for "connection matrices" (incidence
matrices) between the 0- and I-cells, the
1- and 2-cells, and the 2- and 3-cells. Al
ternatively, these matrices may be regarded explanation. By extending Roth's ideas, he as boundary operators and their transposes
developed an analogous transformation dia- as coboundary operators. (Incidentally,
gram that describes the algebraic structure C~l is identical with the A matrix of Fig. 1.).
of the 3-network problem, as shown in Fig.2. Finally, in place of the single Z/Y isomorph-
3- chains 2-chains .i-chains O-chains
Chain
~ C23
~ Sequence • • 0
P2 C12
~ • • 0
t 11 t I S211 T 2
Pi C 01 B (~3S2 C23' ( C23 S2 C23 '- •
511 f T 1 (COIT1 colf11 (CoiT 1 C~11 Cochain ~ ..
~ Sequence t
0 .. C23
~ ..
C;2 C61 E] 0 M1 .. .. 3-cochains 2- cochains 1. - cochains O-cochoins
Fig. 2 ism of the I-network problem, there are ~
in the 3-network problem, namely the SllTl Here, the underlying topological structure isomorphism between the I-chains and l-co
is a set of interconnected 0-, 1-, 2-, and chains and the S2/T2 isomorphism between
3-cells [points, lines, surface, and volume the 2-chains and 2-cochains.
Branin's Transformation Diagram for the 3-Network Problem
elements] having two sets of variables assoc-Now the 3-network problem definable on this iated with them, just as in the I-network algebraic structure has the very peculiar
case. However, since the terms "through" feature that if the matrices (boundary and
99
'L lilll --"I
coboundary operators) COl' C12 ' and C23 and
the isomorphisms SllTl and S2/T2 are given, only Pl and p2 need be specified in order
1 for all the responses -- AND P and P2 --
to be determined. The original motivation for defining this
3-network problem, of course, was to find
an explanation for Maxwell's equations -
which involve the vector calculus opera
tors grad, curl, and div. These operators
turn out to be the exact counterparts in
the continuum of the coboundary operators t t d c t . th h . COl' C12 ' an 23 ~n e coc a~n sequence
of Fig. 2. However, there is no counter
part of the chain sequence in vector cal
culus. What is more, a full analogy be
tween the structure of Fig. 2 and that of
the vector calculus requires two cochain
sequences.
In order to satisfy this requirement, the
concept of a dual 3-network was introduced.
Here, the 0-, 1-, 2-, and 3-cells of the
primal 3-network can be made isomorphic to
the 3-, 2-, 1-, and O-cells of the dual 3-
network. As a consequence, the primal
chain sequence in Fig. 2 can be replaced
by the (isomorphic) dual cochain sequence,
as shown in Fig. 3.
Dual O-cochoins l-cochoins
Cochoin C23 •
The 3-network problem associated with this
new transformation diagram has exactly the
same structure as before -- except that the
dual cochain sequence symbols (superscrip
ted and underscored) replace the primal
chain sequence symbols (subscripted and not
underscored. )
At this stage, the state of affairs is pre
cisely equivalent to that of the earliest
stage in the classical derivation of the
vector calculus operators grad, curl, and
div. In other words, both in the classical
treatment and in our 3-network, we have
established algebraic relations between
certain quantities associated with discrete
points in space, finite line segments, fin
ite surface elements, and finite volume ele
ments. Moreover, these algebraic relations
reflect the topological connectedness of
these 0-, 1-, 2-, and 3-cells in exactly
the same way in the classical treatment
and in our more formal 3-network approach.
If, now, we let these finite 1-, 2-, and
3-cells approach the limit of zero size,
we find -- not surprisingly, in view of
the results of this limiting process in the
classical derivation -- that the coboundary
operators in both cochain sequences of Fig. 3
2-cochoins
• 0 ca ~ Sequence Cl2
3-cochoins
~ el .. • 0
t H t )-1 COl ra Dual
(C23S2C23) (CZ3SZC23 S2HT2 p2 • 3-Complex
St 1fT! Primal
~ ~ t -11 r t Cochain • (C01TIC01) (COITI C01) Sequence t
C23 0 • •
~ 3-cochoins t C!2 t B Mt.
cO! Primal 0 • • 3-Complex Z-cochains
l-cochoins O-cochains Fig. 3 Transformation Diagram for the
Primal-Dual 3-Network Problem
100
become identical to the vector calculus
operators. In particular, C~l becomes equal to -grad, ci2 becomes equal to ~, and C~3 becomes equal to div. The net result, then, is the transformation diagram of Fig. 4 which describes the vector cal
culus relations in the continuum.
0- cochains
Dual Cochain Sequence
9 -CJrad
(- div 5 CJrad) (-div s CJrad)-l
Primal Cochain Sequence
3- cochains
dlv 4
1- cochains
u
s
w
2-cochains
Fig. 4 Transformation Diagram for the vector Calculus
curl
curl 4
ELECTROMAGNETIC FIELD
dual dual l-cochoins dual
On the basis of these vector calculus relationships, the corresponding transformation diagram for Maxwell's equations can be
derived as show in Fig. 5. Here it is worth remarking that the dimensionalities of all the physical variables are fully
consistent with their topological roles.
2 - cochains
J
z y
v 4
1- cochains
3- cochains
div p Dual
3-complex
(-div Y CJrad)-l (-div Y CJrod)
- CJrad Primal 3-complex
0- cochains
O-cochoins r--, -grad
GJ 2-cochains
1 1 I 1 ----- ..... ------ ..... L_..J curl
• 1+ I I
-I' ~, II II I I +1
r--, I 1 1 I L_..J
o
r-' 1 I I I 1 I
I-al 4 .. ----
+-------div ..
primal curl
3- cochains o .... 1------primal 2-cochoins
Fig. 5 Transformation Diagram for Maxwell's Equations
0 dual 3-cochoins
ill -----.. 0
div ---0 11 ~+C :t
~ . -,,,' primal l-cochains
101
primal O-cochains
This diagram highlights the fact that two
different types of vector are involved,
namely line-density vectors such as E and
H, associated with I-cells, and surface
density vectors such as B and I, associ
ated with 2-cel1s. Clearly, it is topo
logically possible to take the curl of
the first and the divergence of the second
type of vector, but not vice versa. This
point is apparently not well recognized.
The missing variable associated with the
primal 3-cells in Fig. 5 would be magnetic
charge density. But if the diagram is ro
tated 1800 about its center, the corres
ponding relations would, in effect, de
scribe a fully consistent "magnetoelectric"
field. Page and Adams (10) point out that
this is quite possible, both mathematically
and physically.
Tonti(3,4,5,7)has greatly extended and re-
fined the ideas embodied in the 3-network.
He has treated in some detail the connect
~vity properties of 0-, 1-, 2-, and 3-cells
2 • REFERENCES
1. J. P. Roth, "An Application of Algebra
ic Topology to Numerical Analysis: On the
Existence of a SOlution to the Network Prob
lem", Proc. Nat'l. Acad. Sci., vol. 41, pp.
599-600, 1955.
2. F. H. Branin, Jr., "The Algebraic~Topo
logical Basis for Network Analogies and the
Vector Calculus", Proc. Symp. on Generalized
Networks, vol. 16, Microwave Res. Inst. Symp.
Ser., Polytechnic Inst. of Brooklyn, pp. 453-
491, 1966.
3. E. Tonti, "On the Mathematical Structure
of a Large Class of Physical Theories",
Rend. Acc. Lincei, vol. 52, pp. 48-56, 1972.
4. E. Tonti, "A Mathematical Model for
Physical Theories", Rend. Acc. Lincei, vol.
52, pp. 175-181 (Part I)~ pp. 350-356 (Part
II), 1972.
5. E. Tonti, "On the Formal Structure of
Physical Theories", Quad. dei Gruppi di
Ricerca Matematica, Istituto di Matematica
del Politechnico Milano, 1975.
and shown numerous examples of how the asso- 6. F. H. Branin, Jr., "The Network Concept
ciated variables relate to physical quanti- as a Unifying Principle in Engineering and
ties. He has introduced concepts from the
mu1tivector calculus and tensor calculus,
from the Grassman and Clifford algebras,
and from the theory of external different-
the Physical Sciences", in Problem Analysis
in Science and Engineering, F. H. Branin
and K. Huseyin, Eds., New York: Academic
Press, to be published.
ial forms to supplement the algebraic-topo- 7. E. Tonti, "The Reason of the Analogies
logical principles already involved. Indeed,in Physics", in Problem Analysis in Science
he has laid a very broad foundation for a and Engineering, F. H. Branin and K. Huseyin,
comprehensive approach to theoretical phys- Eds., New York: Academic Press, to be pub
ics involving 1-, 2-, and 3-network models. lished.
A partial (but representative) list of the 8. S. J. Fenves and F. H. Branin, Jr., "A
systems treated by Tonti in this way is as Network-Topological Formulation of Struc-
follows: classical and relativistic part- tural Analysis", Journ. Struct. Div., Amer.
icle dynamics, vibrations of strings and Soc. Civil Engr., vol. 89, pp. 483-514,1963.
rods, acoustics and fluid dynamics, thermo- 9. G. Kron, Diakoptics: The Piecewise Solu-
statics and irreversible thermodynamics, tion of Large-Scale Systems, London: Macdon-
geometrical optics, the Klein-Gordon and aId, 1963.
Schroedinger equations, classical and rela- 10. L. Page and N. I. Adams, Electrodynam-
tivistic gravitational theory, the wave ics, pp. 210-211, New York: Van Nostrand,
equation, and the Dirac and Proca equations. 1940.
Tonti's work is indeed a remarkable and
comprehensive contribution.
102
DETACHED COEFFICIENTS REPRESENTATION AND DEGREE FUNCTOR
OF A POL YNOUIAL MATRIX \lITH APPLICATION TO LINEAR SYSTEMS
Y • S. Ho P • H • Roe Department of Systems Design
University of IJaterloo Waterloo, Ontario, Canada
Abstract
The zeros of a polynomial matrix are classified as explicit infinite zeros, implicit infinite zeros and zeros in the finite complex plane. Using detached coefficients method and an operator called the degree functor of the matrix, explicit formulae to calculate the multiplicity of the explicit infinite zero and the number of finite zero in the complex plane are derived.
1. INTRODUCTION number of the explicit zeros and the number of
In the analysis of linear systems, we encounter very zeros in the finite complex plane are derived.
often computation in the polynomial ring F[x] where These lead to the determination of relative prime-
F is either the field of real numbers R or complex ness and number of common zeros of two polynomial
numbers C, and x is a formal indeterminate [1,2].
An important question has been asked [3]: Is it
possible to bypass the machinery of polynomial al
gebra and relate everything to standard matrix com
putation, such as the determination of rank? This
question is of some interest from the viewpoint of
pure mathematics, since it concerns the representa
tion of polynomial algebra via matrices in the sense
analogous to group representations [17]. Even more
interesting perhaps are the implications on numeri
cal analysis and computing art in general, since
the computations in F[x] are rather awkward. In
this paper, we attempt to answer partially the
above mentioned questions.
The zeros of a rectangular polynomial matrix of full
rank are classified as explicit infinite zeros, im
plicit infinite zeros and zeros in the finite com
plex plane. Using the "detached coefficients
method" and an operator called the "degree functor"
of the matrix, explicit formulae to calculate the
matrices of different sizes. The results are used
to express unifying the conditions of existence,
controllability, observability and invertibility of
a linear time-invariant dynamical system in terms
of the number of zeros in the finite complex plane
of appropriate polynomial matrices derived from the
equations representing the system.
2. MATHEMATICAL PRELIMINARIES
We consider first two polynomials in R[s]:
laCs) ,b(s)] [a ,b ]sp + [a l,b l]sP-l + p p p- p-
+ [ao,bo] (2.1)
where ap
I 0, bp
= bp
_l = ... = bq+l = 0, bq I 0,
and p ~ q. We called p the degree of the matrix
[a(s),b(s)] and denoted it as d[a(s),b(s)].
The resultant matrix R[(a) ,(b) ] of [a(s),b(s)] is p p
defined as follows:
103
+-p columns -"I+- P columns ....
R{ (a) , (b) ] a b p p 0 0
a1 b
l
O a b 0 2 P 0 0 rows a a
1 bl
bo 0
-1 a b p P
a p-l a b b P P p-l
a 0 q b P P
The classical resultant matrix R[(a) ,(b) ] can be q p
obtained by deleting the first (p-q) columns and
the last (p-q) rows of R[(a) ,(b) J. The subre-p p
sultant matrix R[ (a) j' (b) .] of [a (s), b (s)] is p- P-]
defined as the matrix obtained from R[(a) ,(b) J by P p
deleting the first and last j columns, and the first
and last j rows. The subresultant matrix
R[(a) j,(b) j] can be similarly defined. q- p-
Lemma 2.1 [6]: The greatest common divisor of
(a(s),b(s)], denoted as G.C.D[a(s),b(s»), has
degree greater than zero if and only if the result
ant R{(a) ,(b) } ~ det R[(a) ,(b) 1 is zero, q P q P
Lemma 2.2 [6,7,8]: Degree of G,C.D[a(s),b(s)]
d[C.C.D[a(s).b(s)l}, is j if and only if
R{(a) ,(b) }=R{{a) l,(b) l} q p q- p-
and
R{(a) j,(b) j} f O. q- p-
In order to facilitate and motivate the discussions
in this paper, we relabel the columns of R[(a) , p
(b) ] and denote the reSUlting matrix as: p
Rix2
(a,b) R;X2(a,h} Rlx~(a,b) p-
R1x2(a b) = ao b 0 0 p , 0
a l hI a b 0 0
a b 0 0
a h a p-l b
p-l P P
0 0 a b p p
a b p P
where the lx2 denotes the order of the matrix laCs) ,b(s) J.
(2.2)
With the definition of Rlx2 (a,b). we can generate p
b:2 Rj (a,b) as follows:
(a) If j<P. R~X2(a,b) is defined as the matrix ob
tained by deleting the last (p-j) rows and the'.last lx2 2(p-j) columns vf R (a,b). p
lx2 (b) If j>p, R (a,b) is defined as the matrix obp tained by bordering (j-p) rows to the bottom and
2(j-p) columns to the right of R1x2(a,b) in the p
lx2 lx2 same fashion as R2 (a, b) is obtained from 1): (a, b).
Definition 2.1: Let A be an nxm matrix over R, the
Taking into account a + 0, we have
p ~[AJ respectively. We call n-rank[A]. the nullity Theorem 2.1: d[C.C.D[a(s),b(s)]] = j if and only if of the row rank of A and denote it as ~[AJr'
rank and ullity of A are denoted as rank[A] and
R{{a) ,(b) } = R{(a) l,(b) I} = .•• p p p- p-
and
R{(a) j,(b) .}fO. p- P-J
Theorem 2.2 (22):
then for j>l
104
r. lx2 1 ~LRi (a,b)j'
r
ThElorem 2.3 [22]: d[G.C.D.[a(s),b(s)]] = j, if and
only if there exists an i, such that with
T][R~X2(a,b)t ~ 1 x d[a(s),b(s)]
r, lx2( J [ lx2 ] T]~i+l a,b) = 11 Ri (a.b) = j r r
(2.3)
3. POLYNOMIAL MATRICES
Definition 3.1: Let pes) be an nxm matrix over
R[s] and let
_ p p-l pes) - P s + P IS + ••• + PIs + P (3.1)
p p- 0
where the coefficient matrices Pi : i=O.1,2, ••• ,p
are nxm matrices over R. pes) is assum~d to be of
full rank and n:::m unless otherwise stated.
(1) P is called the leading coefficient matrix p
of pes);
(2) pes) is of degree p, denoted as d[P(s)] = p,
if its leading coefficient matrix P is not a p
(3)
(4)
zero matrix;
pes) is called proper, if its leading coeffi-
cient matrix P p is of full rank;
The rank r of pes) is equal to the dimension
of the largest minor of pes) which is not a
zero polynomial. If r<n. pes) is called
identically singular; otherwise. it is
called full or pes) has full rank;
(5) Any matrix formed by the coefficient matrices
of pes) is called an associated matrix of
pes) ;
(6) pes) is called non-singular if and only if
pes) is full and m=n.
Let A(s) be an nxn non-singular matrix over R[s].
If A(s) is proper. det[A(s)] is a polynomial over
R of degree nxd[A(s)]. It has nxd[A(s)] zeros
(counting multiplicities) over C. If A(s) is im
proper, d[detA(s)] is less than nxd[A(s). In fact
d[detA(s)] is the number of zero of de~(s~ in the
finite complex plane : C'''''.
Definition 3.2: (a) A number AEC ''''' is said to
a zero of pes) if and only if
rank[P(A)] over C < rank[P(s)] over R[s].
(b) Let 6r
(s) be the G.C.D. of all minors of order
r of pes). 6 (s) is called the characteristic n
polynomial of pes).
Definition 3.2 leads directly to
Theorem 3.1: AEC' "" is a zero of P (s) if and only
if A is a zero of 6n
(s).
Definition 3.3: Let K = max{ d [ai (s)] I ai (s) is a
minor of order n of pes) where i=1.2 •••.• C:}.*
(a) n",,(P) ~ n x d[P(s)] - d[6n (s)] is called the
multiplicity of the infinite zero of pes) in
C; (b) n~(P) ~ n x d[P(s)] - K is called the multi
plicity of the explicit infinite zero of pes)
in C; (c) ni(p) ~ K - d[6 (s)] is called the multiplicity
"" n of the implicit infinite zero of pes) in C.
Theorem 3.2 [22]: If pes) is improper. there exists
a non-singular polynomial matrix T(s), which is the' ** product of elementary row transformation matrices.
such that T (s) x P (s) ~ p* (s) is proper and
* d[P(s)] = d[P (s)] if and only if pes) is of full
rank.
Corollary 3.1: det T(s) = as~
where a is real and ~ = n:(P) •
Remark 3.1: The explicit infinite zero of pes) has
* been normalized to zero via T(s). That is P (s) e will contain additional n",,(P) number of zeros at
the origin of C besides the zeros of pes) in C'''''.
D fi i i 3 4 L Mnxm(p) Hnxm(p) and Rnxm(p) e n t on .: et p+j , p+j j be associated with pes) as follows:
* m Cn
is the combination of m objects taken n at a time.
** We mean the usual constant'elementary row transformations together with Tii(s): whose off-diagonal elements are zero and whose diagonal elements are unity except for the ith. which is equal to s.
105
Mnxm(p) = p+j P
P P p-l PI Po ° I .
° P Pz PI P I p 0 I
I P Ip P
PI p-l 0
11 =
o
where ~xm(P) is of order (ixn) x [(p+i)xmJ,
H~xm(P) is of order (ixn) x (ixm) and it consists of
the first (ixm) columns of M~xm(P), and R~xm(P) is
of order [(p+j)xn] x (jxm). The ~xm(P), H~xm(P) nxm
and Ri (P) are defined only for i>O.
Theorem 3.3 [ZZJ: If pes) is of full rank, then
the associ~ted matrices M~xm(P) : i=l,2, •.. of pes)
are of full rank.
Theorem 3.4 [22J: PCs) is of full rank if and only
if there exists an i>O, such that with nxm J 11 rank[Ho {P} = 0
rank [H~:(P) J = rank [H~(P)] + n ••• (3.2)
Corollary 3.2:
(a) If p is the first index such that (3.2) is
satisfied, then
rank[H~(p)] rank[H~xm(P)J + nj; j>l
(b) D[T(s)] = p
(c) T][H~xm(p)l = T][H~(P)J= d[det T(s)]
(d) K = d[l1n (s)] + n:(P) = n[dIP(s)]-p)
+ rank [H~xm(P) J . m Definition 3.5: Let Qi : 1.2, •.•• r=Cn
be the sub-
matrices of order nxn of PCs) and let
aiCs) = det Qi(s). Let R~xe(A). ~xt(A) and
Hkxt (A) be associated with an kxt (k ~ t) matrix i
106
A(s} over R[sl as were defined in Definition 3.4. kxe kxt
Rl (A) and MI (A) are called the detached coeffi-
cient representations of A(s). An operator 6
called the degree functor of pes) is defined with
the following propert.ies:
if and only if there exists a 0 which is the smal
lest index that exists, such that with
(3.3)
where A(s) and B(s} are respectively nxm and nxe
matrices over R[sl.
Theorem 3.5:
Proof: This is all but obvious if we note that the nxn nxn nxn( )] matrix [Rl (Ql).Rl (Q2}' .••• Rl Q
r and the
matrix [R~xm(P)] will give the same 0 and same j in
(3.3).
Theorems (Z.3).(3.5) and Definition 3.5 lead direct
ly to:
Theorem 3.6: With [a(s).b(s)] as in (2.1). we have
(a) ':{RiXl(a)] d[a(s)]
(b) 6[RiXZ
(a,b)J 6[RixlCa).RiXI(b)]
[Theorem 3.51
deC.C.D. of [a(s) ,b(s) 1]
[Theorem 2.3]
The importance of Theorem (3.6) lies in the fact lxZ that when 0 is operated on R1 (a.b) of [a(s),b(s)],
it turns out to be the degree of the G.C.D. of
laCs) ,b(s)].
Theorem 3.7: Let a(s) [al (s),a2(s), ••.• a
r(s)],
where a i : i=1.2 •••• ,4 can be any r polynomials.
Then
d[G.C.D.[a(s)]] = O[Rixr(a)].
Proof: We prove the theorem by induction on r.
That the theorem is true for r = 1 follows from
Part (a) of Theorem (3.6). Suppose the theorem is
true for n<r. and let
then
0.4)
Consider
[Theorem 3.6(b)]
(3.4)
[Theorem 3.5]
[Theorem 3.5]
The induction is completed.
Theorem 3.8: If pes) is square; Le •• n=m. then
(3.5)
Proof: (a) If pes) is proper. Definition 3.5 shows
that 6[Rnxn (P)] = n x d[P(s)] and Part (c) of 1 e
Corollary (3.2) shows that n~(P) = o. (b) If pes) is improper. Theorem (3.4) shows that
fJ[R~xn(P)]r=n x d[P(s)]. and Parts (c) and Cd) of
Corollary (3.2) show that
d[det pes)] = n x d[P(s)] - fJ[H~xn(P)] e
n x d[P(s)] - n~(P)
Theorem (3.5) follows by invoking Definition 3.5.
Theorem 3.9: If pes) is an nxm matrix over R[5]
of full rank and n<m. then
= d [G.C.D. of [al (s) .a2 (s) •••• a/s) J [Definition 3.5]
6 [Rixr (a) ] [Theorem 3.7]
r, lxl lxl lxl 1 6LRl (al).Rl (a2).···.Rl (ar )
[Theorem 3.5]
r, nxn nxn nxn 1 6 LRI (Ql) .Rl (Q2)'··· .Rl (Qr)
_ min{n!(Qi) I i=1,2, ••• r} [Theorem 3.8]
[Theorem 3.5]
Corollary 3.3: Let pes) = [A(s) ,B(s). where A(s) and
B(s) are matrices over R[s] of orders nxm and nxt
respectively. Then the following propositions are
equivalent.
(a) A(s) and B(s) are relatively (left) prime
(b) 6 [R~X(mxt) (p) ] = n! (p)
(c) Rank[P(s)] = n for all s in C"~.
Definition 3.6: Let pes) = [A(s).B(s)], where A(s)
and B(s) are of orders nxm and nxt respectively and
rank[P(s)] =n.
(a) 6[R~X(m+t)(P)] - n!(p)
is called the common zero of A(s) and B(s)
(b) the zeros of pes) which are zeros of A(s) and
Bes) are called the explicit common zeros of
A(s) and h(s). otherwise they are called the
implicit common zeros of A(s) and B(s).
Let
rQ(s)
Lv(s)
4. LINEAR SYSTEMS
U(S)][x] [01 W(s) u y J
be the system matrix in polynomial form [I], where
x.u and yare the vectors of system variables, inputs and outputs. The following results can be
derived using the results derived in Section 3.
1. The system is completely controlable if and
only if
Proof: J r. ] 6rRnlx (n+t) (Q.U) - n:(Q,U).
d [fln (8)] = d I.G.C .0. of [detQl (s), detQ2 (s) •.. ,detQr (s)] l -[Definition 3.5] 2. The system is completely observable if and only
107
•
if
oG~(n+m)(QT,VT}J = U:(QT,VT).
3. The system is completely observable and control
able if and only if
8 [a~x(nxt} (Q,U)] + 0 [R~X(n+m) (QT, VT)]
- 8 [R~X(n+t+m) (Q,U, VT) J e e T e T = noo(Q,U) + noo(Q,V ) - uoo(Q,U,V ).
4.
5.
The system is invertible if and only if the
number of zeros of the system matrix is finite.
The system is solvable if and only if the
number of zeros of Q(s} is finite.
5. DISCUSSION
The scope of application of the theory developed in
this paper is not confined only to the topics just
mentioned. Due to lack of space, the application
to rational function matrices is omitted.
Interested readers can refer to [22J. Recent in
terest in coprime factorization of a regular trans
fer function matrix [21] and the stability studies
conducted using inners [23) pointed out that the
study of "functors" may open up a promising
frontier of research.
REFERENCES
1. H.H. Rosenbrock, State Space and Multivariab1e I~rory. New York : Wi~f!' ~~lO.
2. L.A. Zadeh and C.A. Desoer, Linear System Theory. New York: McGraw-Hill Book Co., 1963.
3. R.E. Kalman, "Some computational problems and methods related to invariant factors and control theory", John Leech edited, Computational Problems in Abstract Algebra. Hungary: Pergamon Press, 1970, pp.390.393.
4. R.E. Kalman, "Irreducible realizations and the degree of a matrix of rational functions". SIAM J. Appl. Math., Vol.13, No.2, June 1965, pp.520-544.
5. R.E. Kalman, P.L. Falb and M.A. Arbib, Topics in Mathematical System Theory. McGraw-Hill, 1969.
6. S. Barnett, Matrices in Control Theory with Application to Linear Programming. London: Van Nostrand Reinhold Company, 1971.
7. A.S. Householder, "Bigradients, and the problem of Routh and Hurwitz", SIAM Review, 10, 1968, pp.56-66.
108
8. J.M. Thmnas, Differential Systems. American Mathematical Society Colloquium Publications, Vo1.21. Baltimore, Md. : Waverly Press, 1937.
9. J.M. Thomas, Systems and Roots. Richmond, Va. The William Byrd Press, Inc., 1962, pp.2l-49.
10. Y.S. Ho and P.H. Roe, "Degree of polynomial matrix and explicit formula for the order of complexity of linear active networks". Proceedings, IEEE International Symposium on Circuits and Systems, San Fransisco, April 22-25, 1974.
11. W.A. Wolovich, "On determining the zeros of state-space systems", IEEE Trans. Aut. Control, Vol.AC-18, No.5, October, 1973, pp.542-544.
12. C.T. Chen, "Irreducibility of dynamic equation realizations of sets of differential equations", IEEE Trans. Aut. Control, Vo1.AC-13, 1970, p.13L
13. B. McMillan, "Introduction to formal realizability theory", Bell System Tech. J., 31, 1952, pp.217-279,541-600.
14. F.R. Gantmakher, The Theory of Matrices and Its Applications, Vol.I & II. New York: Chelsea, 1959.
15. V. Belevitch, Classical Network Theory. San Fransisco : Holden-Day, 1968.
16. E.G. Gilbert, "Controllability and observability in multivariable control system.:", SIAM J. Control, Ser.A, Vol.2, No.1, 1963, pp.128-151.
17. M. Newman, Matrix Representations of Groups. National Bureau of Standard Applied Mathematics, Ser.No.60. Washington, D.C. : U.S. Government Printing Office, July, 196B.
18. S.H. Wang and E.J. Davison, "A new invertibility criteria for linear multivariable systems", IEEE Trans. Aut. Control, Vo1.AC-IB, No.5, October, 1973, p.538.
19. M. Heymann, "The prime structure of linear dynamic systems", SIAM J. Control, Vo1.l0, No.3, August, 1972, pp.460-469.
20. T. Ohtsuki and L.K. Cheung, "A matrix decomposition-reduction procedure for the pole-zero calculation of transfer functions", IEEE Trans. CT, Vol.CT-20, No.3, May, 1973, pp.262-271.
21. C.A. Desoer and J.D. Schulman, "Zeros and poles of matrix transfer functions and their dynamical interpretation", IEEE Trans. Circuits and Systems, Vol.CAS-2I, No.1, January, 1974, pp.3-7.
22. Y.S. Ho, Detached Coefficients Representations and Degree Functor of a Polynomial Matrix with Application to Linear Systems, Ph.D. Thesis, Department of Systems Design, University of Waterloo, Ontario, Canada, March, 1974.
23. E.I. Jury, Inners and Stability of Dynamic Systems. New York: John Wiley & Sons, 1974.
A REPRESENTATION OF THE IMPEDANCE FUNCTION
IN TERMS OF THE POLES AND ZEROS FOR TRANSMISSION LINES
F. Kato and M. Saito University of Tokyo
Tokyo, Japan
Abstract
An expression of the input impedance is derived in terms of the poles and zeros for a nonuniform lossless transmission line terminated in an inductance. From the expression, residues at the poles are determined and Gel'fand-Levitan's theory is shown to be applicable to realize a line with the given poles and zeros. Some relations among poles and zeros of the four-terminal parameters are also given.
1. INTRODUCTION
Consider a nonuniform lossless transmission
line of finite length terminated in an inductance.
When the poles and residues of the input impedance
are given, the synthesis problem is reduced to a
linear integral equation by means of Gel'fand
Levitan's algorithm for the inverse Sturm-Liouville
problem [1,5,6]. Two sequences of poles corres
ponding to two different terminating inductances
can be treated in a similar way [2,4]. On the
other hand, poles and zeros of the input impedance
may be sometimes specified in, e.g., the phase
matching problem for optical circuits with symme-
trical structures. Although Marcenko's algorithm
[3,4] is available for this kind of problems, it
has a difficulty in that a nonlinear integral equa-
tion should be solved. It is the problem that is
investigated in this paper.
The organization of the paper is as follows.
Sections 2 and 3 are devoted to the preliminary
presentation of basic equations and some properties
of the solution, respectively. In Section 4, we
obtain the expression of the input impedance in
terms of the poles and zeros and the characteristic
impedance at the input terminal. In Section 5,
109
the residues of the impedance at the poles are ob
tained from which we get the asymptotic behavior of
the residues. The result allows us to apply
Gel'fand-Levitan's theory to the problem. Finally,
in Section 6, some collateral formulas are derived
for the poles and zeros of the four-terminal para-
meters.
2. BASIC EQUATIONS
Let a loss less transmission line have nonuni
form distributed inductance L(x»O and distributed
capacitance C(x»O, then the telegraphists' equa
tions are given by dV(x2!:!)
dx -pL(x)I (x, p) (1)
dI(x1E) - pC (x)V(x, p) (2) dx
where p is the complex frequency variable, O<x< t
is the spatial variable, and I(x,p) and V(x,p) are
the current and voltage, respectively. The elimi
nation of I(x,p) between (1) and (2) results in
_iJldV(x,E)}_p2C(x)V(x,P)=O. (3) dxlL dx
When output terminal is terminated in an inductance
Lt , boundary condition
V' (t,p) + Lit) V(t,p) = 0 t
(4)
is to be met. In addition, we impose the norma1i-
zing condition to V(x,p)
JL(t) (5) V{t,p) ~ 4 CU.) .
The input impedance is, in terms of the solution
V{x,p), expressed as Z () ~~~_ pL{O)V(O,p) in p I(O,p) V' (O,p) (6)
Suppose that the characteristic impedance
ZO(x) =JL(x)/C{x) (7)
is twice continuously differentiable. By the
Liouville substitution
~(x) '" \JL(X)C(X) dx (8)
A=~(l!,) .(9)
yO(O '" l/ZO(~) (10)
_......l-. d 2J'YOfIT q (0 - )10 (~) d~2 (11)
y(~,p) =JYO(~) V(~.p) (12) Eqs.(3)-(6) are transformed, respectively, to
y"(!;,p) -{ p2 +q(O}y(~,p) '" 0 (13)
where
y' (A,p) + H y(A,p) 0
y(A,p) =1 pZO(O)y(O,p)
Zin(P) "'-y'(O,p) -hy(O,p}
3. PROPERTIES OF THE SOLUTION
(14)
(IS)
(16)
(17) , (18)
Concerning the solution V(x,p) of (3)-(5),
V(O,p) and V'(O,p) are even entire functions of p
and can be written as w 2
V(O,p) C11T(1+~) k=O zk co 2
V'(O,p) =C2(p2+p~) TT(l+~)
(19)
(20) k=l P
where C1 and C2 are constants, and jz:and jpk are
a zero of V(O,p) and of V'(O,p), respectively.
Two sequences of non-negative numbers {zk} and {Pk}
separate each other and asymptotic formulas
Pk=ak+O(-k1). zk=a(k+.!.2)+O(-k1 ) (2) ) , 1 ~ (22
hold, where
a=rr/A. (23) For real p, we can write the solution y(~.p)
of Eqs.(13)-(15) as
Y(!;,p) = cosh p(A-I',;) + J; K(~, t) cosh p (A-t) dt (24)
110
where the kernel K(~,t) is independent of p.
4. THE INPUT IMPEDANCE
In this section, we derive an expression of
the input impedance in terms of the poles and zeros
for the transmission line. From (16) and (24), it
can be seen that
(25)
On the other hand, substitution of (19) and (20)
into (6) yields it ( 1 + p2/z~) k=:;:..O ___ _
(26) TT'" 2 2 •
( 1 + l' /Pk
) k=l
Evidently. {jpk} and {jzk} are poles and zeros,
respectively, of Zi (p). Making use of the formu-
la n TI{1+x2/( k+!)2} k=O 2
coth lTX =: co
Eq.(26) is rewritten as
the series
Thus
lim Zi (p) p-.oo n
1fX IT (1+x2/k2 ) k=l
(27)
(30)
(33)
(34) Consequently, the input impedance is, from (25),
(28) and (34), given by
(35)
where
2 2 2 • (36»)(37) p + a (k+ 1/2 )
The leading term ZO(O)coth pl'l in (35) coincides
with the input impedance of the uniform line with
the same electrical length 1'1, and the infinite
product represents the effects of the deviation of
poles and zeros from those of the corresponding
uniform line.
5. THE ASYMPTOTIC ESTIMATE OF RESIDUES
The residue of Zin(P) at the pole jPn: 2 2
h(n)= lim (p +Pn) Zin(P) P+jPn 2p
is calculated from (35) without difficulties.
The result is as follows.
(38)
1 Case 1: Pn=a(k+Z )' ak, for any non-negative
integer k. 2 2 2
( ) ZO(O) p - a n 2 2 h n = 2 n cotp 1'I/T<I>k(-P )TI'I'k(-P ).(39)
Pn n kfon n k n Case 2: Pn = an.
Z (0) 2 TT 2 h(n)= ~lir<l>k(-P) 'I'k(-Pn)·
Ll k+n n k (40)
Case 3: p = am. for non-negative integer m = n. n 2 2 2
(n)_ ZO(O) Pn- a n 2 2 h - 1'1 2 2 IT <l>k(-Pn)n'l'k(-P ). (41)
Pn- Pm k+m,n k n
Case 4: Pn=~(m~1~2), for non-negetive integer m. Z (0) 1I p - a n
h(n)=_O __ n (z2_ p2)JT<I> (_p2)lT'I' (_p2) 4 2 m n k+n k n k k n·
In any case, th~nresidue hen) is positive. (42)
Suppose that the poles {p } and the zeros n
{z } have the asymptotic behavior n
a O a l 1 Pn= an+n +3+ 0(4)
n n
1 bO b l 1 z =a(n+z ) +1+--1-3 +0(4)· n Il+-2 (n~) n
(43)
(44)
respectively, where aO' al , bO and bl are const
ants. Applying similar computations as [2], we
obtain the asymptotic estimate of hen): 3
h(n)=Z (O)[.!+l:..t~A-tO+ l'Iao)l]+o(l:..) (45) o 1'1 2 2 21T 3 1 3
n 1T n where
00 00
A= }(p2_a2k2_2aa )_L:{z2_ a2(k+.!)2_2ab} ~k 0k=Ok 2 0
2 bO 1T2 2 7 +1T bo(ao-T) -Tao-zaaO. (46)
Equation (45) is compatible with the asymptotic re
quirement imposed on the normalizing constants by
Gel'fand-Levitan's theory. Thus, there exists a
transmission line with a finite length whose input
impedance has the specified poles {jp } and {jz }. n n
Furthermore, ,the resulting q(~) is absolutely conti-
nuous [2].
6. FOUR-TERMINAL PARAMETERS
Let the output terminal electrically open. We
denote the poles and zeros of Zll(P) by {jPn} and
{jzn}' respectively, and the zeros of Z22(P) by
{jw}. In a similar manner as Section 4, we obtain n
the following expressions:
Z11 (p) = ZO(O) coth pl'l TT<I>k(p2) IT 'I'k(p2) k k
Z12(p) = 4Z0(O)ZO(I'I) cosech pl'l IT <l>k(p2) k
2 TT- 2 Z22(P) =ZO(l'I) cothPl'llT<I>k(P) 'I'k(P) k k
where 2 2
(47)
(48)
(49)
- 2 P +wk 'I' k (p ) = 2 2 1 2 (50)
p+a(k+Z ) From (48), the residue of Z12(P) at the pole jPn is
given by r:c--7"""::-J",,, 2 2 2
( ) JZO(O)ZO(I'I) p - a n 2 hl~ 2 n cosec p 1'1 TT <l>k(-P ) (51)
Pn n k+n n provided Pn = ak, for any non-negative integer k.
Here, we notice the relation zll (p)
zI2 (p) -=:--=---=-~ IT (l+p2/z~) . k
Invoking Eq.(39), it can be seen
2 2 2 2
(52)
( ) Zo(O) p -a n TT 2:TT zk h n ____ n __ cosec P (0, 4>k(-Pn) I I 2 I 2· (53)
12 2 Pn n k ka(k+l)
Comparison between (51) and (53) yields 2
ZO(l'I) = TT zk
ZO(O) k a 2( k+.!)2 Analogously, making use of 2
Z22(P) Z (p) =-=-=-~
12 rr(l+p2/w~) k
2 1 2 ZO(I'I) = na (k+ Z)
we obtain
(54)
(55)
(56) ZO(O) k w~
The residue of Zll(P) at the pole jPn is given by
III
(39) • On the other hand, it follows from 2 2
l+p /zk Zl1(p) =Z22(P)TI 2 2
k l+p /wk
in all cases cited in Section 4. The formulas
(57)
correspond to the fact that four-terminal characte
ristics of a transmission line is completely deter
mined by Zll(P) alone and that fwn} can not be
independent of {p } and {z }. n n
7. CONCLUSION
For a nonuniform lossless transmission line
terminated in an inductance, the input impedance
has been explicitly determined by the poles, zeros
and the characteristic impedance at the input ter
minal. The residue at each pole can be determined
from the expression of the impedance. It has been
shown that the sequence of residues meets the asym
ptotic requirement for the normalizing constants of
the Sturm-Liouville equation. The result of
Gel'fand and Levitan for the inverse Sturm-Liouville
operators can be applied to arrive at a nonuniform
line with the specified poles and zeros. Expre-
ssions for four-terminal network parameters Zll,Z12
and Z22 have also been obtained in a similar manner.
Corresponding to the fact that only one of Zll and
Z22 can arbitrarily be specified, a series of rela
tions among poles and zeros of Zll~ and zeros of
Z22 has been derived.
REFERENCES
[1] LM.Gel 'fand and B.M.Levitan, "On the Determina
tion of a Differential Equation from its Spec
tral Function," Amer.Math.Soc.Transl., Ser.2,
Vol.l, pp.253-304(1955).
[2] B.M.Levitan and M.G.Gasymov,"Determination of a
Differential Equation by Two of its Spectra,"
Usp.Mat.Nauk., pp.1-63(1964).
112
[3] V.A.Marcenko,"Some Questions in the Theory of
One-Dimensional Linear Differential Operators
of the Second Order, I," Amer.Math.Soc.Transl.,
Ser.2, Vol.101, pp.l-104(1973).
[4J M.R.Wohlers,"A Realizability Theory for Smooth
Lossless Transmission Lines," IEEE Trans. on
Circuit Theory, Vol.CT-13, No.4, pp.356-363,
Dec. 1966.
[5] M.R.Wohlers,"A Realizability Theory for Smooth
Lossless Transmission Lines - Part II," IEEE
Trans. on Circuit Theory, Vol.CT-14, pp.442-444,
Dec. 1967.
[6] K. Horiuchi , K.Kawakita and H.Watanabe,"A Note
on the Synthesis of Terminated Nonuniform Trans
mission Lines," CT Res. Group, lnst.Elect.Comm.
Eng.Jap., No.CT68-l6, July 1968.
BIOGRAPHIES
Fumio Kato was born in Fukushima, Japan, on Novem-
ber 13, 1950. He received the B.S. degree from
Yokohama National University, Yokohama, Japan, in
1973, and the M.S. degree from University of Tokyo,
Tokyo, Japan, in 1975, both in electrical engineer
ing.
At present, he is studying toword the Ph.D.
degree in the Graduate School of University of
Tokyo, specializing distributed-constant networks.
Mr.Kato is a member of the Institute of Elec
tronics and Communication Engineers of Japan.
Masao Saito (M 1962) was born in 1933. He obtain-
ed B. of Engng. in 1956, M. of Engng. in 1958 and
D. of Engng. in 1962 from University of Tokyo.
Working for some time in Faculty of Engineering in
University of Tokyo, he is now serving as a profes
sor of medical engineering in Faculty of Medicine
in the same university.
His principal interest is in circuit and sys
tems theory, especially its application to biologi
cal and medical systems.
He is now a member of the board of Japan Soci
ety of Medical Electronics and Biological Engineer
ing, Vice-President of the International Federation
for Medical and Biological Engineering, Vice-Presi
dent of the World Association of Medical Inform
atics.
EVALUATION OF CONSTITUENT MATRICES
OF AN ANALYTIC MATRIX FUNCTION
Feng-cheng Chang Surender Pulusani
Alabama A&M University Normal. Alabama
ABSTRACT
An analytic matrix function of a given arbitrary square matrix is expressed in terms of constituent matrices when all eigenvalues with multicities are known. A simple and practicable approach for computing the constituent matrices are then formulated which requires only straightforward matrix multiplications.
In the theory of linear time-invariant systems,
solutions are often found to be expressible by
analytic function of a matrix A. such as eAt
and sin (A%t). If A is an n)( n arbitrary
constant matrix and f(s) is an analytic func
tion of complex variable s. then the analytic
matrix function f(A) of A is given by the
fundamental formula [1] [2] [3]
The calculation of constituent matrices is gener
ally very involved. especially for a large n.
However. if all the eigenvalues are known before-.
hand, a simple and practicable method for comput
ing Zkh may be developed as follows.
f{A) (1)
where eigenvalues sk with multiplicities rk are obtained from the characteristic polynomial
of A.
c(s) - det (sI - A)
~ sn-p ~ cp
m-l
II k-O
and constituent matrices Zkh' depenent on A
but not on f. are to be determined.
(2)
Let
and
m-l
{s - s {·rr (8 k k'-O
kr,.tt
k .. O ..... m-l; .t - o ..... rk-l.
k-O ..... m-l; t.-O ..... rk-l.
(3)
(4)
By direct differentiation of (3) we obtain the fol
lowing:
113
V.' ;;'f, ',rifl11
'lll (,' "
, ~ I
, h
k' * k, h' 0, " . , rk,l k' k, h' 0, ... , t -1
(5)
k' = k. h' = Q, , rk-l
where
1
j
(6)
are the coefficients of Taylor series expansion of
at
c(s) ckO(s) r
k (s - sk) n-rk
j
1
{ dkO
i 0
bki i-I d (10)
-L: ki-p
bkp ' i 1, 2,
p=O dkO
are found to be the coefficients of Taylor series expansion of l/ckO(s) at s = sk'
1
L (11) i=O
Since the matrices Cki
can be directly computed
from (4) and the scalars. bki can be easily eva
luated by (10) and (6) or some other methods [2]
[3] when all the eigenvalues are known, Eq. (9)
seems to be a very convenient formula for comput
ing constituent matrices of a given matrix with
mUltiple eigenvalues. It does require some ma-
2: dkj (s - sk) .
j=O
If we set successively in (1), f(s) = ck(s),
~ = 0, 1, .. , , rk-l, for every k, then from (5) we obtain
(7) trix multiplications, however, it does not need
any matrix inversions as done by some other ap
proaches [4] [5] [6]
(8)
k = 0, '" , m-l; ~ = 0, ... , rk-l.
The constituent matrices Zkh are therefore found by solving the rk simultaneous linear algebric equations (8) for every k
(9)
k 0, .••• m-l;
where
114
If we substitute (4) into (9), we find
m-l
-IT k'= 0
k' * k
k = 0, ••• , m-l;
(12)
which is also another useful and direct formula
for computing constituent matrices [7].
A complete computer program for finding the con
stituent matrices of a given arbitrary constant
matrix will be developed in the future contribution.
REFERENCES
1. F. R. Gantmacer, The Theory of Matrices, ___ Vol. I, New York: Chelsea, 1959.
2. J. S. Frame, "Matrix Functions and Applications --- Part IV: Matrix functions and constituent matrices," IEEE Spectrum, vol. I, pp. 123-131, June 1964.
3. F. C. Chang and H. Mott, "On the matrix related to the partial fraction expansion of a proper rational function," Proc. IEEE, vol. 62, pp. 1162-1163, Aug. 1974.
4. c. F. Chen and R. E. Yates, "A new approach to matrix Heaviside expansion," Int. J. Control, vol. 11, pp. 431-448, Mar 1970.
5. F. C. Chang, "Evaluation of an analytical function of a companion matrix with multiple eigenvalues," Proc. IEEE, vol. 63, pp. 818-820, May 1975. 6. F. C. Chang, "Evaluation of an analytical function of an arbitrary matrix with multiple eigenvalues," Proc. IEEE, vol. 65, (accepted to be published).
7. F. C. Chang, "A direct approach to the constituent matrices of an arbitrary matrix with multiple eigenvalues," Proc. IEEE, (accepted to be published).
BIOGRAPHIES
Feng-cheng Chang
Assistant Professor, Alabama A&H University Ph. D., University of Alabama, 1972.
Surender Pulusani
Assistant Professor, Alabama A&M University.
115
ON THE LOSSLESS SCATTERING MATRIX
SYNTHESIS VIA STATE-SPACE TECHNIQUES
Andrzej L. Dobrucki and Marian S. Piekarski
Institute of Telecommunication and Acoustics WrocXaw Technical University, Wroclaw, Poland
Abstract
In the paper a new direct algebraic method for a loss less scattering matrix synthesis is developed. The method rest mainly on the minimum state-variable realization theory, but it is more efficient from a computational point of view than other proposed methods. The synthesis algorithm relies on a direct calculation one of the possible scattering matrices describing 10ssless nondynamic part of realized network.
1. INTRODUCTION
LNN Sa
L-______ ~----____ ~ f---<> )1 S(p)
Fig. 1. Synthesis model of S (p)
2. PRELIMINARIES
1')+1
0+1<:.
1
H 1
H
Recently the state-space approach to the
network synthesis has been considered qui
te extensively in scientific literature
/see list of references in the book 2/. In
the field of integrated circuits, the me
thod has some significant app1ications,be
cause fundamental requirements of integra
ted technology are satisfied4 . In the pa
per3 ,4,5 a new direct procedure for 10ss
less admittance matrix synthesis has been
proposed, which is more efficient from a
computational point of vie than other me
thods. In this paper, by revising and ex
tending of the above procedure, a synthe
sis method of lumped linear finite statio
nary 10ssless /LLFSL/ n-port networks de
scribed by a scattering matrix S (p) is
presented. The method leads to a model
consisting of a 10ssless nondynamic net
work /LNN/ with /n+k+1/ terminals and
It is well known 1 ,2,6 that the necessary
and sufficient conditions for the realiza
tion of the nxn scattering matrix S(p) by
a LLFSL n-port network are
/n+k/ portS,which is terminated by groun
ded capacitors at its last k terminals as
shown in Fig. 1. The number of used capa
citor is minimal and equal to the McMil
lan's degree 2 of the scattering matrix S(p)
116
a. s(p) is rational an real for real p,
b. S (p) is analytic in Re p>O,
c. s(p) st(_p) = In. (1)
An nxn matrix. S(p) possesing the above pro
perties is said to be bounded real regular
and para-unitary and can be written in the
t r-i 1_0Bi P
form
. g(p) (2 a)
where the Bi are real constant matrices
and the least common denominator of the
entries in S (p)
g(p) =
is Hurwitzian6 .
The scattering matrix s(p) seen at the
first n ports of the networks shown in
Fig. 1 is given by
S(p)
where Sa' the scattering matrix of the LN
coupling network is given by
n k
(4)
The S(p) may be also regarded as a funct
ion of the new complex variable s:
(5)
According to a well-known result2 , the ma
trix (1) has a minimal realization IA,B,C,
01 and can be expanded in the form
where k = b [s (p)] is the MCMillan's de
gree of S (p) .
Comparison (5) and (6) imply the following
relation
Now, the loss less condition of S (s S t = )
a a a = In+k can be expressed in the following
equivalent conditions
A + At = -ctc, (8 a)
B -cto, (sb)
~Ot = 1 . (8 c) n
Hence the problem is to chose a minimal rea
lisation /A,B,C,O/ to satisfy equation (6) and at the same time guarantee that equa
tion (7) describes the scattering matrix of
the LNN.
3. SYNTHESIS ALGORITHM
The scattering matrix S (~ given by (2) can be expanded in the neighborhood of
p = c:c as
S (p) = S_l + (9)
By equating the right sides of (2 a) and (9)
r CIJ L r-i g (p) (S_l S.
). (10) Bip = +L ~
1=0 1=0 pi+l
Equating coefficients of like powers of p
on both sides of (10) ,
B. ]
o, ... ,r (lla)
117
and
r
L: bkS i - k - l for i ~ r+l. k=O
(ll~
The equations (ll~ give a simple recurrent
method to calculating of S .• ~
Substi tuting (9) to (2) and identyfying co-
efficients of like powers of PI
(12a)
Thus the real rational matrix given by (2)
is the scattering matrix of a LLF5L n
-port if and only if (12) is satisfied
and g (p) is Hurwitzian. Expanding the right side of equation (6)
in the neighborhood of p =00 as
cI) •
""' CA 1.B 8 (p) = D + L.. i + 1
i=OP
and equating with (9), we identify
(13)
Equation (14a) uniquely defines the ma
trix D satysfying the lossless condition
(8c), but the equations Cl4b) do not have
a unique solution for A,BtC satysfying
(8a) and C8b). For the purpose of choos
ing a proper matrices A, Band C, it is
convenient to define the block matrix
where
H e
is a Henkel
t -s -1
On S
Xe
On °
= H X e e
block
-8 t 0
t -1
n
matrix and
-5 t e-l
S t
e-2
(-1) e+1S_1
is a triangular block matrix.
(17)
t
T blocks of matrix Te may be written as ij
t;j ( )k+j-l t T .. = -1 Si+j-k-lSk-2 .
J.] =1 (18)
118
Combining (8), (14b) and (18),
Then
(20)
where
Equations (20) and (21) suggest that a way
of obtaining a pair A, C would be to form
the matrix Tel factor it in the form of
equation (20), and then try to identify A
and C from these factors. We do not know
in advance if the matrix Te formed from
the expansion coefficients of S(p) about
p (:lO can always be factored as indicated
in equation (20) ; hence we first study the
properties of Te' to see if it can be fac
tored in the desired form. Equation ~2a)
implies that the matrix S_l
Hence the matrix Xe is also
is nonsingular.
nonsingular.
::::nih:h::n:O:ee=~r::~ He' It is well
Hence
Using
Tij
rank Hr _l '" k "" b[ s(p)). (22)
rank Te "" rank Tr - 1 = k. (12b) and (18) it can be t. T T t
Tji ' ~.e. r-l r-l'
shown that
The generalized companion matrix f2 has the form2 ;
n -b 1 ].
,1 n
The characteristic polynomial of f2 is gi
ven by det(lnrs - n.) [g(s)]n, i.e. all eigenvalues of !1 lie in the left half pla
ne (Res.<O). ~
Equations (23), (IS) and (12b) imply the
following relation
(24)
where
(25)
It suggests that Tr _l
is the solution of
(24). Since Res i <: 0, the equation (24)
has the unique solution2 , which is symme
tric and positive semidefinite matrix.
Therefore Tr - l is symmetric positive se
midefinite matrix of the rank k. Such a
matrix can always be factored in the
form 7
T = MMt r-l
where M is nrxk real matrix. M can be
partitioned into nxk blocks Mi
t [ t t t ] M = Mo ,M l , ... ,Mr
_l
. (27)
Now by comparison of (27) with (21) we can immediately identify a suitable C as
C M (2S) 0
and
P r-l = M. (29)
Then (Sb) gives
B -CtD t -Mo S_1" (30)
To find a suitable A, if we define Td as
(31)
from (15), (IS) , (19), (20) ,(21) and (23) we
see that A must satisfy
119
(32)
The equation (32) has the unique solution
given by
(33)
where M+ is the Moore-Pemrose pseudoinver
se matrix to M7.
It is easy to prove that the matrix A gi
ven by (3~ satisfies the loss less condi
tion (Sa). From (27), (2S) and (30) it -fol
lows that (14b) is satisfied for i = 0, 1,
, ... ,r-l. To see that (14b) is also satis
fied for i ~r we can calculate
Then
r b.Ar - i L 1
1-0
r
L b.Ar - l 1
1=1
and hence (12b) gives
CArB
for i = r, r+l, ...
Concluding, if the minimal realisation lA,
B,C,DI is calculated from (2S) , (30) and
(3~ then the lossless conditions (S) are
satisfied and Sa given by (~ is the scat
tering matrix of LNN.
4. CONCLUSIONS
The synthesis algorithm presented in this
paper relies on a direct calculation of a
minimal realisation /A,B,C,D/ of the sca
ttering matrix S(p), where /A,B,C,D/ sa
tisfies the lossless conditions. If the
loss less minimal realisation of s(p) is
found, it is very simple to calculate one
of the possible scattering matrices Sa de
scribing LNN.
In the method presented here, in contrast
to methods proposed by Youla and Tissil
and also by Anderson and vongpanitlerd2 ,
it is not necessary to solve any matrix
equation. A factorization of a real con
stant symmetric positive semidefinite ma
trix Tr - l into a form MMt is only needed.
5. REFERENCES
1. D. C, Youla, P. Tissi, N-Port Synthe
sis Via Reactance Extraction - Part I,
IEEE Intern. Conv. Rec., 1966, pp.
183-208.
2. B. D. Anderson, S. Vongpanitlerd, Net
work Analysis and Synthesis - A Mo
dern Systems Theory Approach, Prentice
Hall, New York 1973.
120
3. M. S. Piekarski, Synthesis of Lossless
N-Port Networks with Reference to In
tegrated Circuits lin Polish/, I Natio
nal Symposium of Radio Science /URSI/,
Warsaw, Poland, 17-18 February 1975,
pp. 141-143.
4. M. S. Piekarski, Selected Synthesis Pro
blems of linear Microelectronic Circu
its lin Polish/, Pro Nauk. Inst. Tele
kom. i Akustyki, Politechnika Wroclawska,
Nr 24, Seria: Monografie Nr 9, Wroclaw
1976.
5. M. S. Piekarski, Lossless Integrated
N-Port Synthesis Via State-Space Tech
niques lin Polish/, Archiwum Elektro
techniki, t. XXV, z. 2, 1976, pp. 343-
357.
6. R. W. Newcomb, Linear Multiport Syn
thesis, McGraw-Hill, New York 1966.
7. F. R. Gantmacher, The Theory of Ma
trices, Nauka, Moscow 1966 lin Rus
sian edition/.
Q
ALGEBRAIC CHARACTERIZATION OF MATRICES WHOSE
MULTIVARIABLE CHARACTERISTIC POLYNOMIAL IS HURWITZ IAN
Marian S. Piekarski
Institute of Telecommunication and Acoustics, Wroclaw Technical University, poland
Abstract
Suppose g(Al
, A2
, ••. Ar} = det(A n - A } is multivariable characnr
teristic polynomial of an arbitrar§ nr x nr complex matrix An ,whe-
re A All + + ... + A I is an nr x nr diagona1 nr ml r mr
matrix with diagonal complex variables Al' In the paper the necessary and sufficient conditions are given for
the matrix Anr
to have Hurwitz multivariable characteristic polyno-
mial, i.e., to have g( Al
, A2 , ••• , A r} ~ 0 in the polydomain
Re Ai Q 0 (1 " i ~ r) . This is generalization of well-known Theorem given by Lyapunov to
the multivariable case.
1. INTRODUCTION
In the paper the following symbols will
be used:
W '7 0, W~O, W<..O, W~O - positive, nonne
gative, nonpositive definite matrix, res
pectively,
1m - identity matrix of order m,
+ - direct sum of matrices. A i' Ai - complex and complex conjugate
variable, respectively,
ReA i - real part of Ai' g( A
l, A 2' ••• , Ar} - a polynomial g in
>'1' A2 ,···, A r'
An - n x n complex matrix,
t -A , A, A* - transpose, complex conjugate
and complex conjugate trans
pose of A, respectively,
det A - determinant of A,
x = [Xl,x2 , ... ,xn ]t - a column vector,
diag [al,a
2, ••• ,ak] - a diagonal matrix
with diagonal elements
W* m
a l , a 2 ,··· ,ak ,
Wm - m x m Hermitian matrix,
+
. .+ A rlm
r be a diagonal matrix with diago-
121
nal complex variables Ai (i = 1,2, ... ,r),
where ml + m2 + ... + m. + . .. + m ~ r
= nr (1' mi
( nr
, r (. n r ). Let A be an arnr
bitrary n x n complex matrix and consir r
der the multivariable characteristic poly-
nomial
In the case ml takes the form
(1 )
n this polynomial
det( An 1
det(A1
I -A ) n n
(2)
and its properties have been studied for
many years, especialy, with regard to the
stability problem of differential equat
ions in normal form.
It is well-known that in a study of the
stability problem the following idea of
Hurwitz polynomials is important l ,2
Definition 1
A polynomial g(A 1 ) of one independent com
plex variable is called a Hurwitz polyno
mial if g(A1
) 1 0 in the closed domain
ReAl ~ 0.
The important algebraic characterization
of matrices whose one variable characte
ristic polynomial is Hurwitzian has been
given by the following Theorem of Lyapu
nov 3 :
Theorem 1 (Theorem of Lyapunov)
Let An be an n x n complex matrix. Then
the characteristic polynomial g(A l ) = = det(A I - A ) is Hurwitzian if and anInn ly if, for any positive definite n x n
Hermitian matrix K , there exists a posin tive n x n Hermitian matrix Wn which sa-
tisfies the matrix equation
W A + A* W n n n n -K
n (3)
As the name implies, this is originally
due to Lyapunov, but we shall use a sligh
tly modified version that is more conve
nient in further consideration than the
original Theorem.
Theorem 2
Let An be an n x n complex matrix. Then
the characteristic polynomial g"( A 1) = = det(A 1I - A ) is Hurwitzian if and ann n ly if there exists a positive definite
n x n Hermitian matrix Wn such that
W A + A* W < 0, n n n n
(4)
122
A proof of the above Theorem follows direc
tly from the Theorem 1 and will be omitted
here.
Recently, multivariab1e rational functions
and matrices have been finding increasing
applications in analysiS, synthesis and
stability problems of networks and systems . 1,2,3,4,5,7 I (see list of references 1n . n
a study of the above problems the follo
wing idea of mu1tivariable Hurwitz poly
nomial has important application.
Definition 2
A polynomial g(A 1 , A2 , ... , Ar) of r inde
pendent complex variables is called a Hur
witz polynomial in r variables or to be
Hurwitzian, if g(A l , A2 , ... , Ar) 10 in
the closed polydomain ReA i 1 ° (1~ i~ r).
This paper is devoted to the establish
ment of the algebraic characterization of
matrices whose multivariable characteris
tic polynomial is Hurwitzian.
2. PRINCIPAL RESULTS
Let A be an arbitrary n x n complex man trix. Initially, we consider the multiva-
riable characteristic polynomial
(5)
where An = diag [AI' A2 ,···, An1' The above polynomial is a special case of
the polynomial given by (1) when r = nr =
= n, i.e., when ml = m2 = ... = mn = 1.
In order to establish a basis for charac
terization of the general case, and for
simplicity, we shall prove the following
Theorem in terms of the above special case:
Theorem 3
The multivariable characteristic polyno
mial g(A l , A2 ,.·., An) = det( An - An) is
Hurwitzian if and only if there exists
a positive definite diagonal matrix
Wn = diag [wl , w2 ' '" ,wn], (wi) 0;
i 1,2, ... ,n) such that
(6)
Proof:
Sufficiency. Let us suppose that there
exists a positive definite diagonal ma
trix Wn = diag [wl ,w2 ' ••• , wn], (Wi '> o~ i = 1,2, ... ,n), such that Tn = WnAn +
+ A*nWn< 0, Le., X*TnX(O for any complex
n-vector x t 0, and det(An - An) is not
Hurwitzian.
By Definition 2 there exist n complex num
bers Al , A2 , ... , An with nonnegative
real parts (~i + Ai)O~ i = 1,2, ... ,n),
so that the linear system
° (7)
possesses a nontrivial solution xo' From
(7) we have
a contradiction. Thus the condition of
the Theorem is sufficient.
Necessity. For n = 1 this is obvious. For
arbitrary n we proceed by induction, that
is to say we assume the result to be true
for n = k - 1 and show that this implies
its truth for n = k.
Partitioning Ak and A k as follows:
A k
[--~'!.~-~£-J k-l Ac I ad 1
I
k-l 1
k-l 1
(8)
123
it is easily seen that the polynomial det
(Ak - Ak ) can be factored in the following
forms:
1. if det( Ak _ l - Aa) t 0, then
Ab]det(Ak _ l - AaJ , (9a)
2. if (~k - ad) t 0, then
(A k - ad)det(Ak _ l - Aa
AbAc ) . (9b)
Let the polynomial det(Ak
- Ak
) be Hurwi
tzian. Then, according to the Definition 2
and (9a), for each A k-l such that
det(A k-l - Aa) to and ReAi~ ° (l(,i~ k-l),
the equation
° (10)
is satisfied only for Ak such that ReAk<O.
In particular, when A i 00 (1 ~ i ~ k-l) ,
(10) gives Ak - ad ° for Ak such that
ReAk (0, Le.,
in the domain ReAk ~O. Furthermore, (9b)
and (11) imply
in the polydoma.:ln ReAi') ° (l 'i~ k). Hence,
the matrix
A + a
in the domain ReA k ) 0, and the matrix
Cl2a)
(12b)
generate characteristic polynomials which are hurwitzian.
If Dk_l = diag [dl'd2 ,··. ,dk_l] matrix, then
is Hermitian matrix.
is real
(13 )
Hk- l , as a Hermitian matrix, has only real eigenvalues 3 , It is also clear from (12a)
and (13) that for a fixed Ak the maximal
eigenvalue Pm of Hk- l and Hk_ l and Hk
-l
itself are functions of Ak and Dk- l •
Therefore, for each Dk_l , there exists
Ak = Ako = h(Dk_ l ), (ReAko ) 0), such that
Pm reaches a maximal value Prnrn in the domain ReAk ) O. Hence,
in the domain ReAk'q O.
f(hCDk
_l
) ,Dk
_l
)
{l4}
Since Bk- l generates characteristic poly
nomial which is Hurwitzian, by the induc
tion hypothesis, there exists a matrix
-Dk _ 1 == Wk - 1 ) 0 such that
(11) implies
ad + ad < O.
(15)
(16)
According to a well-known Theorem that a
Hermitian matrix is negative definite if
and only if its maximal eigenvalue is negative3 , (14) and (15) imply
and
124
in the domain Re ~k q O.
Summarizing, it is shown that there exists
a constant diagonal matrix Wk- l > 0 such that
A* A* c b)W (0
- - k-l Ak - ad
for any Ak in the domain ReAk
) O.
Putting
Ta Wk_lAa + A*aWk_l'
t2 = A*c'
it is seen that (17) takes the form
t2t*1 + ----"'-_- < 0
Ak - ad
in the domain ReAk
) O.
(17 )
( ISa)
(ISb)
(lSc)
(19)
In particular, when Ak - 00 then Bk
_l
= Aa' Hk_1 Ta and therefore
Ta (0. (20)
Let
Wk· [-~~=l-t--i;] :-' k-l I
where T is given by (18a) and a
anci
k-l
1
The object of course is to prove that the
re exists dk = wk " a such that Tk < a. Be
cuase T a < a, it is only necessary3 to es
tablish that det Tk and det Ta have the
oposite signs at dk = wk ) a.
Since
det T = det[-:.:'l-l::~--J= k T* IT
bl d
= det [~~-L~::-~~~~~~~~-J T* IT -T* (T IT)
bl d b a b
= det [~~_l ____ ~_-----J T* I T T* -IT
b I d- bT b
(Td - T* T- l b a .
I a
it is only neccessary to establish that -1
Td - T*bTa Tb < a or equivalently that
(21)
at dk
= wk
.., a. Let us consider four following cases:
Case 1. tl = a
Then for any dk w
k from the interval
ad + ad
t* T -It 2 a 2
inequality (21) is satisfied.
Case 2. t2 a
Then for any dk
= wk from the interval
t* T -It 1 a 1 ( w
k <.00 a (
inequality (21) is satisfied.
Case 3. tl = t2 a.
This is a trivial case in which inequali
ty (21) is satisfied for any dk= wk ) a.
Case 4. tl ~ a and t2 ~ a.
Let x be any complex (k-l) vector. Then,
from (19) it follows that
P x*Hk
_l
x x*T x + x*t l t*2x
+ a Ak - ad
+ x*t2t*lx
< a (22) A -k ad
for any x ~ a and Ak in the domain ReAk~a.
It is easy to show that, at a fixed x ~ a,p
reaches a maximal value Pmax in the domain ~--'= _
ReAk )/ a for
Re .:id1m y ) ,
Re y + IYI (23a)
where
y=x*tlt*2x • (23b)
125
Substituting (23) into (22) and using the
identity 2 (Rely) 2 = Re y + \y\' one can
obtain
_, i 2 (Re lX*tlt*2x)
P x *T x - 4 -----=---=::..-.-- (a (24 ) max a
ad + ad
for any x ~ a.
Because x*TaX < a for any x ~ a and ad +
+ ad < a (inequalities (16) and (20)), (24}
gives
for any x ~ a.
Let
x = T -let + bt2),
a 1
where
b t* T -It
1 a 1
t* T- l t2 2 a
(25)
(26)
'7 a , (27)
Using (26) and (27), (25) may be transfor
med into the form (t* +t* b)T -let + t 2b) _-=:.1_.-:2=-----=a:..-.--.::.l--'~ < a •
b (28)
Multiplying both sides of (28) by b, (28)
takes the final form
(ad + ad)b - (t*l + t*2b )Ta - l (t l + t 2b)(0
(29)
Now, it is very easy to observe that
(21) and (29) are identical for dk = wk:
= b. It means that det Tk and det Ta have the opposite signs for wk ~ b. The proof
is now completed by induction from n ~ 1,
Q.E.D.
Now, we consider the general case of the
multivariable characteristic polynomial
r n L m. and r ~ n .
r i"1 1. r In this case, the following
- A ) n ' r
Theorem gi-
ves the algebraic characterization of ma
trices whose multivariable characteristic
polynomial is Hurwitzian:
Theorem 4
The multivariable characteristic polyno-
mial g( Al , A2 , ... , A) :: det( A -A ) r nr nr
is Hurwitzian if and only if there
exists a positive definite Hermitian ma-
Wm + W .;. .. '+Wmr
(W . 1 m2 mi
W* ) OJ i = 1,2, .•. ,r) such that mi
(30)
A proof of the above Theorem follows from
the Theorems 2 and 3, but the considera
tions are rather long and will not be
presented here6 •
3. REFERENCES
1. N. K. Bose, E. I. Jury, Positivity
and stability tests for multidimen
sional filters (discrete-continuous),
126
IEEE Trans. on Acoustics, Speech,
and Signal, ASSP-22, 1974, pp. 174-180
2. N. K. Bose, R. W. Newcomb, Tellegen's
theorem and multivariable realizabili
ty theory, Int. J. Electronics, 36,
1974, pp. 417-425
3. F. R. Gantmacher, The Theory of Matri
ces, Nauka, Moscow, 1966 (in Russian
edition)
4. T. Koga, Synthesis of finite passive
n-ports with prescribed positive real
matrices of several variables, IEEE,
Trans. on Circuit Theory, CT-15, 1968,
pp. 2-22
5. M. S. Piekarski, Absolute stability of
linear n-ports, Proc. 1975 IEEE Inter
national Symp. on Circuits and Systems
Boston, April 21-23, 1975, pp. 112-
115
6. M. S. Piekarski, Algebraic characteri
zation of matrices whose multivariable
characteristic polynomial is Hurwi
tzian(to be published)
7. H. L.Van Trees, Synthesis of Optimum
Non-linear Control Systems, The M.l.T.
Press, Cambridge, Massachussetts,1962
CONTRACTION OPERATORS OF CLASS C o
AND THE STRUCTURE OF
A CLASS OF INFINITE DIMENSIONAL SYSTEMS
David J. Hedberg Hughes Aircraft Company
and
N. Levan Department of System Science
University of California, Los Angeles
SUMMARY
In this paper we identify a class of infinite dimensional linear systems which behave in a sense like finite dimensional ones. Specifically, we consider two types; 1) discrete time systems for which the state operator A is a C contraction on a Hilbert space, and 2) "compatible" continuous timg systems for which A is the infinitessimal generator of a contraction semigroup on a Hilbert space and whose cogenerator is a C contraction. This class of systems includes finite dimensional systgms as a particular case. It is shown that controllability and observabi1ity for these systems can be characterized in terms of the "cyclic vectors" of the decomposition of a Co contraction.
The task of finding the multiplicity and the cyclic vectors for a given infinite dimensional contraction operator is generally difficult. We propose in this paper alternate criteria for contro11abi1ty and observabi1ity involving known "cyclic subspaces". The characterization of these subspaces is a result of the Nagy-Poias theory of contraction operations. sufficient conditions for controllability and observabi1ity of the above systems are thus obtained.
Finally, we cite physical examples illustrating some aspects of this class of (nearly finite dimensional) systems.
127
J !
'I,
DISCRETE-TIME SYSTEM OPERATORS
ON RESOLUTION SETS OF SEQUENCES
R. J. Leake and B. Swaminathan Department of Electrical Engineering
University of Notre Dame Notre Dame, Indiana
Abstract
This paper introduces the concept of a resolution set of sequences as a natural consequence of the Hilbert Space resolution theory of Saeks [1] and others.
1. INTRODUCTION
Much of the theoretical work being carried out on the study of discrete-time systems adheres to the consideration of l spaces which seem to have
p all the "right" properties one needs to carry out system analysis. It is felt, however, that occasions often arise in which some special non-l
p class of sequences are of particular interest, and that the "right" properties should be carefully defined. In this paper we apply certain resolution space [I} ideas to sequences and give an outline of how these ideas can be carried forth in the study of discrete-time operators and systems. It is hoped that the reader will bear with us on the terminology. We should use "sequence" as a prefix to the term resolution but drop it for simplicity.
2. SHIFT INVARIANT RESOLUTION SETS
Let s denote the linear space of sequences of complex numbers with typical elements denoted by
(1)
We deal here with the truncated subspace s+ of consisting of sequences which are zero in value for negative arguments with typical elements denoted by
x = ( ..• ,0,0,xO,xl
,x2 ' ... ) (2)
or, presuming that (2) is understood, by x = (xo'xl ,x2'···) (3)
s
128
Introducing the delay shift operator D and
the truncation operators PN
and pN for N = 0,1,2, ...
by
Dx =(0,xO,xl ,x2 ' .•. )
PNx =(0, ..• ,0,xN,xN+l ' ... ) (4)
+ +. we say that a subset of sequences a C:s ~s
(i) Shift invariant iff DNa+ C a + all N=0,1,2, ...
(ii) pN invariant iff N + +
PaC a all N=0,1,2, ...
(iii)PN
invariant iff PNa +
Ca + all N=0,1,2, •..
It is easy to construct examples to show that these conditions are generally unrelated in that any combination of the three conditions may hold without the others being true. Motivated+by [1] we define a resolution set as a subset of s which is both
N P and PN
invariant. Notice that
N PNx + P x = x
or (5)
I
and the operators thus form a resolution of the + identity on a .
The starting time of a subset 'a+ C s+ is the
least+integer K such that YK fO for some sequence YEa .
Notice that if a+ is a resolution set with starting time K then the "impulse" sequence
a(K) = (0, .•. ,0,1,0, ... ) (6)
Kth
is in a+ since if y E a+, YK
fO then
(7)
A shift invariant resolution set is one for which condition (i), (ii) and (iii) hold simultaneously.
A resolution space is a resolution set a+ which is a complex subspace of the vector space s+. That is, for complex c and any x, y E a+ we have
(8)
Theorem 1. A shift invariant subspace a+ of -+ s with starting time K is a resolution
a(K) E a+. space if and only if
Proof: If a+ is a resolution space then + a(K) E a trivally from (6) and (7) above.
+ On the other hand, suppose a(K) Ea. Then + if yEa ,
and hence we also have
Corollary 1.
Corollary 2.
For vector spaces, pN invariance is equivalent to PN invariance.
Because o(K) can be shifted, multiplied, and added over any finite range, all shift invariant resolu-
tion spaces a+ with the same starting time K are equivalent over any finite range. That is
a)
b) DK sc+ C a+ , where sc+ denotes sequences of compact support.
Examples of shift invariant resolution spaces are
+ s+ and 0 + sc , the ~ spaces. p
The theory can be enriched through the z-transform, defined here as a formal series
+ specified by each sequence YES by
-1 -2 y(z)=YO+ylz +Y2z + ... (9)
and it is easy to see that the delay shift of y
corresponds to multiplication of A(Z) by z-l. A partial hierarchy of shift invariant spaces and their isomorphic images with starting time K = 0 is indicated below, with most of the spacial designations given in [2], [3], or [4].
Tr + c. C + C. A + C C + HI c P + C D + C F + I I I I I r I I
sc+ Co rd+ c: ll+' c + C + + + +
al
c loo c. sg c s
+ For example, F denotes the set of all formal
series, D+ denotes the set of (one sided) periodic + distributions, Tr is the set of polynomials in
z-l To illustrate the use of the z-transform we have
129
Theorem 2.' Any shift invariant resolution
subspace of l2+ which is closed in the l2+
topology is of the form DKl2+·
Proof: By Beurling's Theorem [5], every
closed shift invariant subspace of H2 is of
the form {F(z) I F(z) = g-KB(z)S(z)G(z), -K where G(z)EH
2}z B(z) is a Blaschke product
and S(z) is a singular inner function. But
Theorem 1 implies that
, for some G(z)EH2, so G(z) = l/B(z)S(z)
But l/B(z) is not in any H space, nor is p l/S(z) [3,p28] except in the trivial case
,B(z) = S(z) = 1. Hence the shift invariant
resolution subspace must be of the form z~~2 in the Z-domain or DKl
2+ in the time sequence
domain.
Many interesting problems concerning resolution sets and spaces remain to be investigated, but let us turn now to investigate their interaction with operators and systems. Henceforth,
a+ and b+ will denote shift invariant resolution
sets, and if an operator T:a+ ~ b+ is designated
as being linear, it will be implicitly assumed
that a+ and b+ are vector spaces. We have the following designations for T.
Causal: pNx pNx' implies pNTx pNTx'
Anticausal: PNx PNx' implies PNTx PNTx'
Time invariant:DNT P TDN N
(10)
The presence of the PN in the time invariant
condition is, perhaps, bothersome. It is present because we are dealing with truncated sequence spaces rather than uniform sequence sequences. This will be discussed later.
3. MATRIX REPRESENTATION OF LINEAR OPERATORS
In this section, we consider various conditions under which linear operators can be represented by matrices, and how matrices of certain forms characterize the operators they represent. First, we review some results on causality and time invariance of linear operators on resolution spaces. Proofs are omitted for the most part but can be found in [6J.
Theorem 3.
A linear operator G: a+ ~ b+ is causal iff anyone of the following equivalent conditions is satisfied for N = 0,1,2, ..•
a) pNx ~Nx' implies pNGx = pNGx'
b) N o implies pNGx = 0 (11) P x
c) GPN PNGPN
d) pNG pNGpN
Theorem 4.
A linear operator G: a+ ~ b+ is time invariant and causal iff DG = GD.
Theorem 5.
An operator T: a + ~ b + is time invariant iff
DT (12)
Now, we say that a linear operator
G: a+ ~ b+ is represented by a (semi-infinite) matrix.
goo gal g02
glO gll g12 (gka)
g20 g2l g22 {13)
130
iffy Gx is equivalent for k 0,1,2, ... to
(14)
An example of an operator G : a+ ~ s+ which is not representable by a matrix is given by y = Gx with
lim N-><x>
N 1 N+l l:
a=O x
ct
Yk = 0 for k = 1,2,3, ...
(15)
Note: a+ is the shift invariant resolution space for which the limit in (15) exists.
In general, it is not known when linear maps
G : a+ ~ b+ are representable by matrices, but some partial results follow:
Theorem 6.
A semi-infinite matrix (gkct) represents a
linear map G : s+ ~ s+ iff each row of the matrix has only a finite number of non-zero elements.
Theorem 7.
E b d d 1 0+ ~ 0+ is very oun e inear operator G : ~l ~ ~l
representable by a matrix (gka)' Each such matrix satisfies
sup l: ct k=O
Ig 1< "" ka (16)
Furthermore, a matrix (gka) represents a
b d d I' G .,e.+ ~ ,e.+ iff (16) is satis-oun e 1near map : 1 1
fied.
Theorem 8.
A matrix (gka) represents a bounded linear
map G : ,e.: ~ ,e.: if
Theorem 9.
sup l:: k a=O
Ig I < <lO ka (17)
+ + Each causal linear map G : a ~ b determines, and is determined by a lower triangular matrix
goo o 0
glO (gkct)
g20 (18)
o
Theorem 10.
Suppose a linear map G : a+ + b+ is repre
sented by a matrix (gka)' then G is time invar
ient iff there exists a sequence g such that
go g-l g-2
(gka) (gk-a) gl go g-l
(19) g2 gl go
Matrices of this form are called Toeplitz matrices.
The Toeplitz form (gk-a) is a natural form
to associate with the notion of time invariant operators.
(gk-a)
and if x' then y' Hence, xk
Note that ifG is represented by
1 1 0 0 0
0 1 1 0 0
0 0 1 1 0 (20)
(1 1 0 0 0 .•. ).x = (0 1 100 •.• ) (2 1 000 ... ).y = (1 2 100 ..• ). xk_l for all k and Yk = Yk-l for all
k > 1 so that the definition of the invariance (12) is satisfied. It is not true that ~ = ~-l for all k implies Yk = Yk-l for all k. This
would be the usual requirement for spaces running from minus infinity to plus infinity. A universal definition for time invariance could be the requirement (17)
(21)
which corresponds to (lO)because DND*N = PN'
where D*. the adjoint of D is defined by the advance shift relation
(22)
4. NORN. INVARIANCE. BOUNDEDNESS. CONTINUITY AND GAIN
Let a+ be a normal shift invariant resolution space. With starting time k. we make the following definitions for an operator T:s+ + s+
Finite Norm: The a+ norm of T. IITII IITxl1
sup+TIxIT' x£x
xlO
T is of finite a+ norm (FN) iff I ITI I <
Invariant: T is a+ invariant (INV) if Tx £ a+ for + all x £ a
Bounded: T is a+ bounded (BD) if it is of a+ finite norm and if TO = O.
Continuous: T is a + continuous (CON) if it is
any sequence x(n) £ +
invariant and if for a +
converging to x £ a • that is I Ix(n)-xl I + O.
have II Tx (n) -Tx II + O.
+ a
we
Finite Gain: T is a+ finite gain if there exists
Y. 0 < Y < 00 such that IlpNTx11 2. Y IlpNxl1 for
N = 0.1.2 •.•. and all x £ s~
Comments: Note that T s+ + s+ is a+ bounded iff
there exists 0 < y < such that IITxl1 2. y Ilxll
for all x £ a+ I iTI I is the infimum of all such y.
Theorem 11.
+ Given a particular a spac e. the classes of
maps T : s+ + s+ are related as follows:
131
Goneral Causal Linear Col~&l
S. MEMORY AND SYSTEM CONCEPTS
In this section we consider operators
T: a+ + b+ where a+ and b+ are shift invariant resolution sets. and discrete time systems. As a preliminary definition we say that T is memoryless if it is both causal and anti-causal. First we show that memoryless operators are completely specified by ordinary functions of a complex (or real. in the real case) variable.
+ + Lemma 1. T: a + b is memoryless iff xk = ~ implies (Tx)k = (Tx')k for k = 0.1.2 ••••
Lemma 2. T: a+ + b+ is memoryless iff there exist
functions Gk
: C + C k = 0.1.2 •••• such that
, ,I I
,il
I, !
for all x £ a+
(Tx\ = Gk (xk)
Theorem 12.
and all k 0,1,2, ...
T: a+ ~ b+ is time invariant and memoryless
iff there exists a function G:C ~ C such that for
all x £ a+ and all k = 0,1,2, ... (Tx)k = G(xk).
Corollary. T: a+ 7 b+ is linear, time-invariant,
and memory less iff there exists a homogeneous function G: C ~ C, G(ab) = aG(b), such that for
+ all x £ a and k = 0,1,2, ... (Tx)k = G(xk).
In order to delve further into the concept of memory it is convenient to introduce a discrete-time-system defined by the state equations
(23)
where the impact xk
£ X, the state sk £ S, and
the output Yk £ Y. (Here assume X and Yare
subsets of C, the complex numbers.) Now for any given value of s the discrete-time system (23)
o + + + induces a causal map T: a 7 b where a is a + resolution set with sequence values in X and b is a resolution set with sequence values in Y. The induced system operator is said to be of finite memory N if there exists a function G such that
(TX)k = Yk = G(Yk_l'···'Yk_N,xk,··,xk_N,k) (24)
for k~N, and of finite input memory N if ( (Tx)k = Yk = G(xk,··,xk_N,k) (25)
for k~N. These definitions can be carried+ove¥ directly to arbitrary causal operators T:a ~b ' and we see now that a "memory less" operator T is simply any causal T with zero input memory. It is well known that linear time invariant finite dimensional discrete-time system with a rational transfer function induce operators T of finite memory, whereas, if the_iransfer function happens to be a polynomial in z ,T will be of finite input memory.
The connection between causal operators T and discrete-time systems as defined above is strengthened further if we note that every such 0
operator induces, and thus can be represented by a discrete-time system with So = (0,0,0, ... ) and
(xk,O,O, ..• ) (26)
+
132
6. ACKNOWLEDGEMENT
This work was supported by the United States Air Force Office of Scientific Research under grant number AFOSR 76 3036.
7. REFERENCES
[1] R. Saeks, "Resolution Space Operators and Systems," Springer-Verlag, 1973.
[2] R. E. Edwards,"Fourier Series," Vol. II, Holt, Rinehart, and Winston, 1967.
[3] P. 1. Duren, "Theory of Hp Spaces,"
Academic Press, 1970.
[4] N. Dunford, J. T. Schwartz,"Linear Operators," Interscience, 1958.
[5] W. Rudin, "Real and Complex Analysis," McGraw Hill, 1966.
[6] B. Swaminathan, "Characterization of System Operators on Truncated Sequence Spaces," M.S.E.E. Thesis, Electrical Engineering Dept., University of Notre Dame, May 1977.
[7] R. Saeks, R. J. Leake," On Semi-Uniform Resolution Space," Proc. Midwest Circuit Theory Symposium, 1971.
SOME ASPECTS IN A THEORY OF
GENERAL LINEAR SYSTEMS
Robert H. Foulkes, Jr. Electrical Engineering
Youngstown State University Youngstown, Ohio
Abstract
The work discussed here deals with a theory of general linear systems. In a previous work, an exiomatic approach to such a theory was presented. Using set theory and abstract algebra, the usual notions of a linear state space, response separation, and superposition, as well as standard results about controllability and observability, were developed in an unusually abstract setting. The work discussed. here concerns an extension of the superpositon and response separation properties and some results concerning finite automata within the axiomatic framework.
133
CERTAIN ASPECTS OF INVERSE FILTERS
V. P. Sinha Electrical Engineering Dept.
Indian Institute of Technology Kanpur, India
H. s. Sekhon Electrical Engineering Dept.
Punjab Agricultural University Ludhiana 141004, India
Abstract
The problem of inverse filters is often encountered in instrumentation and geophysics. The existing techniques, being based on Laplace transform approach, lend into difficulty when the transfer function has zeros in the right half of the s-plane. Also, they do not adequately cover the practically more significant case of finite duration and/or discrete signals. The inverse filtering problem is re-formulated in terms of the operator techniques so as to include finite duration and/or discrete signals.
-'
11
. , , '
./ /
ADAPTIVE ANTENNA POLARIZATION SCHEMES FOR
CLUTTER SUPPRESSION AND TARGET IDENTIFICATION
George loannidis and David E. Hammers ITT Gilfillan
Van Nuys, California
Abstract
Optimum transmitter and receiver antenna polarizations for target discrimination in the presence of background clutter are obtained. The analysis uses the concept of Stokes vectors and Stokes target operators in a constrained maximization of the ratio of two bilinear forms, represt:nting the signal to interference powt:r ratio.
INTRODUCTION
Polarization characteristics of targets art: described in terms of
the spacial orientation of tht: electric !kld vector in the
im:ident and retlected electromagnetic waves. In current radar
technology designers take advantage of a target's polarization
properties by selecting an antenna polarization (transmittt:d
electric field orientation), which maximizes tht: t:nergy in the
backscattered wave for some assumed target orientation.
Uowevt:r, most targets do not hold their orientation constant
so that the backscattered ent:rgy is not always what it could
be. Designers have also attempted to exploit the polarization
characteristics of background reflections (clutter) by affixing
the antenna polarization such that background contributions
to the retlected energy are cancelled. For example, by taking
advantage of spherical properties of raindrops, which upon
retlection reverse the polarization of an orthogonally polarized
wave, rain echo is cancelled by antennas employing circular
polarization I I I, (2], (3),
Recent antenna developments make it reasonable to control
the transit/receive polarization in a more dynamic manner.
That is, rather than just applying a fixed, horizontal, vertical,
or circular reft:rence, the antenna's polarization can be con
tinuously adjusted to optimize target detectability in a partic
ular target and interference environment. Furthermore, the
problem can be set-up in an operator theoretic manner so
that optimization theory can be applied. In what follows we
do this by use of the Stokes Polarization vectors and Stokes
Target Scattering Operators, defined in the next section, and
then proceed to derive an optimum algorithm for selection of
antenna polarizations, which can be adaptively applied to
the problem.
POLARIZATION DESCRIPTION IN TERMS OF STOKES VECTORS
The scattering of plane polarized electromagnetic waves by
complex body scatterers such as aircraft targets, ground
clutter, rain, or chaff is governed by a 2 X 2 complex
symmetric operator known as the target scattering matrix
U given by
(I)
where hxx ' hyy, hxy = hyx are complex variables describing
the relative amplitude and phase of the x-plane and y-plane
polarized components of the incident and scattered waves.
, In terms of the scattering matrix, H, the voltage induced in a
receiving antenna with a polarization described by the electric
field vector.
135
e/ = (Er' Ey) r is given by
V = e/ He! (2)
where et' = EX' Ey t is the electric field vector of the trans
mitting antenna, the superscript' indicates matrix or vector
transposition and EX ' Ey are complex numbers representing the
relative amplitude and phase of the x and y components of the
electric field.
In radar applications where the target's aspect angle varies
continuously, the quantity of interest is the average received
power for given transmitter and receiver antenna polarizations,
which is given by
Ii \
1\
i
p (V·V), (3)
where the bar indicates time averaging and the superscript*
indicates complex conjugation. Substitution of Equation (2)
into (3) gives
(4)
By rearranging terms in the above equation, Kennaugh [II has
shown that the average received power can be expressed as
p = y' A X (5)
where X and Yare four dimensional vectors, whose elements
are the four Stokes parameters (Kennaugh [II. Poelman [21,
Born and Wolf [3)) describing the transmitter and receiver
antenna polarizations respectively, and A is known as the
average Stokes scattering operator for the target (see
Kennaugh [I]).
The elements of the Stokes vectors X and Y satisfy the
constraint
Xo 2 - x 12 - X22 - X32 = 0 xo>O
and
Y02_YI2_yl-Y32=0 Yo>O
(6)
(7)
The average Stokes scattering operator A is 4 X 4 real symmetric
matrix and its diagonal elements satisfy the equation.
(8)
Furthermore it can be shown that if the vector Z is given by
Z=AX (9)
where X is a Stokes vector then the elements of Z satisfy
(10)
If we introduce a matrix R given by
R = (:.;.::) o 0 0 ·1 (11 )
then Equations (6), (7), and (10) can be expressed as
(12)
We note that Equation (12) defines a convex subset of the
Euclidean 4-space which is invariant under the Stokes target
operator A.
From the above relations it follows that the bilinear form
p = y'A X
is always positive, when X and Yare Stokes vectors and A is a
Stokes operator, an expected result since P is an expression
for the average received power.
136
Optimum Transmitter-Receiver Polarization Combination for Target Detection in the Presence of Background Clutter
When the average Stokes scattering matrices A and e for the
target and clutter have been determined. transmitter and
receiver Stokes polarization vectors are selected so as to
maximize the signal to clutter power ratio.
Y'AX -- = r Y' ex 03)
with the restrictions
y'RY = 0
and
y'RX = 0 Xo >0
where the matrix R is given by Equation (II).
The bars denoting average Stokes matrices have been dropped
here for ease of notation.
Since the vectors X and Y in Equation (22) can be normalized
by appropriate constants that force Xo > 0 and Yo > 0 with
out changing the value of the ratio of Equation (22). the only
significant constraints are
X' RX = 0 (14) and
Y' RY = 0 (15)
Since the bilinear forms Y' AX and y' ex are non-negative
under the restriction that X and Y be Stokes vectors. one could
consider two cases as follows: the first case applies to clutter
suppression by itself and considers possible existence of a pair
of Stokes vectors X and Y such that
y'ex = 0
Y' AX> 0
(i)
(ii)
The second case assumes that there are no Stokes vectors X and
Y that satisfy Equation (i) and proceeds to maximize the ratio
of Equation (13).
Here we present the solution to the second case only.
Maximization of SignaI-to-CIutter Power Ratio
When there are no Stokes vectors X and Y such that Y' CX = 0
and Y' AX> 0, then maximization of the ratio of Equation (13)
under the restrictions of Equations (14) and (IS) is
equivalent to the maximization of
y' AX
subject to the restrictions
y' ex = k (constant)
Y'RY=O,yo>O
and
x' RX = 0, Xo > 0
Introducing Lagrangian multipliers A, and J.I. we obtain that the
optimum Stokes polarization vectors X and Yare given by the
solution of the system of equations
AX = p CX + J.I. RY, (16)
and
A Y = p CY + A RX . (17)
The solution vectors of the above equations are given by the
eigenvectors of
(RA - PRC)2 X = s2 X
where
s2 = AJ.!.
and s2 is not necessarily a positive number.
(18)
After some algebraic manipulation the above equation can be
expressed in the form
where B has the form
f30
is a scalar, b is a three-dimensional vector and B I is a
symmetric 3 X3 matrix.
(19)
(20)
From Equation (18) we obtain that the eigenvalues s2 are
solutions of
(21)
If for some value of s2 = sl2 satisfying Equation (21) the
determinant
then application of the constraint X' RX = 0 to the resulting
solution gives
b' (81 + s2 11 )-2 b = I
where I I is the 3 X 3 identity matrix. If relation (22) is
satisfied one can show (see Brauer and Nohel (41) that
(23)
D(p,s2) = Is21-BI = [(po-s2) - b'(B I +s21lrlbllBI +s2111.
(24)
From the above we obtain that Equations (21), (22). (23), and
(24) imply that s2 must satisfy
and
D(p.s2) = 0
0D(p.s2) = 0
0(s2)
i.e., s2 must be a double root of Equation (21).
If
(25)
C~6)
(27)
137
and also
(28)
then one can show that the vector b defined in
Equation (20) is either equal to zero or it is orthogonal to at
least one vector in the null space of the operator (B I + s 12 I I)'
Therefore there is always an eigenvector of the form
Since this is not a Stokes vector, the existence of a Stokes
vector solution requires that there are at least two eigenvectors
corresponding to the same eigenvalue, i.e., the root s2 = s 12 of
Equation (25) must have at least multiplicity two.
From the preceding discussion we conclude that a necessary.
although not sufficient, condition for an eigenvector of the
matrix (RA-pRC)2 to be a Stokes vector is that the deter
minant of Equation (21) have s2 as a double root. This can
happen if either s = sl is a double root of
-f(p,SI) = IR(A-pC) - Sill = 0 (29)
or if both s = sl and s = -sl are roots of the above equation.
Since the trace of (RA-pRC) is always zero as it follows from
Equation (8) we obtain that
o . (30)
This equation has one double root if
(31 )
and two equal and opposite roots if
(32)
Since the sum of the roots is zero. the above include the case
of two pairs of repeated roots. From the above discussion the
positive values of p satisfying the above equations include the
extrema of p under the constraint that X and Y be Stokes
vectors. a search among these values will yield the ones that
result in X and Y being Stokes vectors. Since s2 must be real.
only those resulting in real values of s2 need to be examined.
Additional criteria can be obtained but will not be discussed
here due to lack of space.
We also note that since Equation (18) can be written as
[R(A-pC) - 51 • [R(A-pC) + 51 X = 0 (33)
If s = sl and s = -sl are eigenvalues of (RA-pRn. then since
the two matrix operators in the brackets in Equation (33)
commute X is given by
x = kl UI + k2 U2 (34)
and from Equations (16) and (17) follows that
(35)
where k I and k2 are constants and U I and U2 are the eigen
vectors of (RA-pRC) corresponding to the eigenvalues sl and
-s I' respectively.
NUMERICAL APPLICA nON
As an application of the above method of selecting optimum
antenna polarizations we consider the problem of detecting a
small (radius of the order of the wavelength), horizontally
oriented, cylindrical target in the presence of a chaff cloud
consisting of a collection of randomly oriented dipole scatterers.
The Stokes scattering operator C for the dipole cloud is given by
c",o' (~ ; : D where 0
02 is a constant depending on the wavelength and
number of dipoles. If one assumes that the dimensions of the
cylinder are such so that its reflection matrix is given by
H=oc (2 0) . 0 I
where' ° c depends on the size of the cylinder and the wave
length, we obtain that the Stokes operator A for this
cylindrical target is given by
A=~~(: ~ ~ ~) 2 0 0 4 0
o 0 0-4
In maximizing the ratio of Equation (13) the constants 0 02
and 0c 2 can be neglected. From the determinant equation
f(p,s) = I s I -R(A-pC) I = 0
we obtain f(p,s) = s4 _ [3p2 _ 19p + 32) s2 + p2 [2p-lI) s
-4(p-4) (2p2 - ISp + 16) = 0 .
If we let dip) equal to the coefficient of s2, d I (p) equal to
the coefficient of s, and do(p) equal to the constant term in
the above polynomial in s we obtain that the values of p
satisfying d I (p) = 0, are
138
PI =0 and P2 = 5.5
The values of p satisfying
are
P3 = 2.13, P4 = 1.34
PS,6 9.2 ± j 0.9
and
P7,8 = S.l7 ± j 1.6 .
The maximum value of the above Pj's is P2 = 5.5. The
corresponding values of s are sl = 4, s2 = -4, s3 = 1.5, and
s4 = -1.5. The eigenvectors corresponding to s = ± 1.5 do not yield Stokes vectors. From the eigenvectors U l' U2 (see
Equations (34) and (35)) corresponding to s = ± 4, we obtain
that the optimum pair of Stokes vectors X and Yare given by
and
where k I and k2 are constants chosen to satisfy the constraints
X' RX = ° and Y 'R Y = 0 .
The resulting values of X and Yare given by
I
2 2 -3 3
X and Y 0 0
ys -0 3 3
which correspond to left-handed and right-handed elliptical
polarizations respectively with a 9-to-4 ratio of the horizontal
relative to the vertical field amplitudes.
CONCLUSIONS
From the above analysis and example it is obvious that the
concept of Stokes vectors and target operators is an extremely
powerful and useful tool in the problem of target discrimina
tion in the presence of background clutter. In [5) Hammers
and MacKinnon developed an operator theoretic approach for
generating probing waveforms to identify target operators. The
Stokes operators and the above analysis can be used to extend
their analysis to include the selection of optimum antenna
polarization for target discrimination and identification.
REFERENCES
[I J Kennaugh, E.M., "Effects on the Type of Polarization on
Echo Characteristics". Final Engr. Rpt. Antenna Lab.,
Ohio State University, Vol. AF contract 28(099) -90
June 1951, AMC3.51, Griffiss AFB, New York.
[2 J Poeiman, AJ., Reconsideration of the target detection
criterioll based on adaptive antenna polarizations.
AGARD Symposium Proceedings, 1976.
[3 J Born, M., and E. Wolf, Principles of Optics. Pergamon
Press, New York, 1965.
[4 J Brauer, F., and J.A. Nohel, Qualitative Theory of
Ordinary Differelltial Equations. W.A. Benjamin, Inc.,
New York, 1968.
[5 J Hammers, D.E., and A.J. MacKinnon, Radar Target
R('wgnitioll. An Operator Theoretic Approach Presented
at Operator Theory of Networks and Systems International
Symposium, Montreal, Canada, August 1975.
BIOGRAPHIES
George A. Ioannidis was born in Athens, Greece on February 14,
1945. He received a B.Eng. degree from Stevens Institute of
139
Technology, Hoboken, N.J., in 1968; M.S. in Electrical Engi
neering in 1971, and PhD. in Electrical Engineering in 1973,
both from Cornell University, Ithaca, N.Y. He has been a
research assistant at the Center of Radiophysics and Space
Research, Cornell University, and at the National Astronomy
and Ionosphere Center in Arecibo, Puerto Rico. He was a
research associate in the Department of Electrical Engineering, .
Cornell University, and the Institute of Geophysics and
Planetary Physics UCLA at Los Angeles. Currently he is senior
design engineer at ITT Gilfillan, Van Nuys, Califor?ia.
Dr. Hammers was born in Chicago, Illinois in 1937. He received
a BS in Engineering in 1960 from Loyola University of
Los Angeles, MSEE from the University of Southern California
in 1963, and PhD. in Engineering in 1973 from the University
of California at Los Angeles, majoring in Systems Optimization
Theory. He joined ITT Gilfillan in 1963 and participated in the
early design and development of a digital Automatic Target
Detection system for the SPS-48 radar. Since then he has
performed systems analysis and design for processing systems
in various 20 and 3D air defense radars, battlefield surveillance
radars including mortar and artillery detecting systems and
Precision Approach Control radars such as the TPN-22 currently
under development. His present position is Director of Systems
Analysis and Programming.
! Ii
ON LIMITATIONS BASED ON PROPERTIES OF THE STURH-LIOUVILLE
OPllliATOnS IN TIlli SYNTIlESIS PHOCEDUIlli OF NC)NeUNIFOml LINBS
Zdzisl.al" ii. Trzasl,a
1{arsal .. Technical. Uni versi ty
IfARSAIY, Pol.and
Abstract
The method for the evaluation of taper functions of a non-uniform l.ine is proposed in this paper. Appl.ication of the Sturl:l-Liouvil.le
op0rator theory permite to calculate the inductance distribution al.ong the llon-uniform lossless line lvi th the prescribed totul capacitance.The desired expressions are obtained by thfJ usc only the N+l eigenvalues of the appropriate operator \dth given boundary condi tions •
1. INTRODUCTION
Recently, synthesis of discret and distri
huted parameter systems has been carried
on throu~h various methods. One of the
pOlverful.l tool.s for the network synthesis
is the differentia1 and inte(..Tal opera tor
theory.
The cenera1 aspects of the app1ications
of the operator theory to the network
synthesis are given in [7,11]. Conditions for the rea1ization a non-uni
for,m parameter dis trihuted netw'ork with
short circuits at its ends have been C-i
ved in the paper i~10]. This method of
synthesis is based on a prescribed imped~
nce function of the network and can be re
garded as an extension of the synthesis
procedure of l.adder networks.
A procedure for synthesizinG' very smooth
l.osslesS non-uniform lines has been
proposed in [9J. The application of this
method is usefu1l. only for 1ines whose
characteristic impedance has very sma11
variations in the propagation direction.
140
Distributions of voltac-es al.onc- a non
uniform 10ssl.ess line using notations of
the Sturm-Liouville operator theory in
the frequency domain' can be expressed as
L V(x,s = s2V(x,s (1) d 2
where L = - di2 + q (x) and q(x) denote
the Sturm-Liouvil.l.e operator and the ta
per function, respectivel.y. The space
coordina te x f (0, l.) is measured from the
beginning of the l.ine. Horeover we denote
by S (a, b), where - 00 <a, b <00, the q spectrum of the line in the case of the
fol.l.owinz boundary
V(O,s)
V (1, s)
condi tions
+ aV (O,s) = 0 x + bVx\l.,s) = 0
Ifhere subindex x denotes the derivative of
the voltage in respect to the space coor
dinate x.
G. Borg has proved in [11 that for two
functions q(x) and q~x) and some values
of a1
and a2
and b the equal.ity Sq(a,b) = S +,b)invol.ves the equal.ity q(x~ :: q~x)
q for Xf(0,1). V.A. Marcenko has extended
in ~ 8] this reaul. ts for the dependence
of the function q x and the values of
ai
and b on the appropriate spectra
Sq(ai,b) for i = 1 and 2. It is well known that the values of diffe
rent spectra Sq(ai,b) for i = 1 and 2
for the same line interlace, e.g. they
form an infinite sequence as follows 2 ·,2 2,2 ( ) sl~sl ~s2 ~s2'" _ 3
As stated in H.Krein papers [5, 6J the
knowledge of two spectra S (a,b) and q Sq{a,b) is sufficient to evaluation the
taper function q (x) and the values a i and b.
However from the practical point of view
only finite number of the values of' the
line spectrum may be taken into account.
Thus a followin~ question appeares. Has
such approximation a great influence on
the obtained results? This problem we can
transforme into following one. How' much
di verge two taper ftUlctiollS ql (x) and
Q2\x) for the same only N+l values of the
appropria te line spectra. I-Ioreover it is
alsO interesting to ans,,,er to the follo
wing question. Are there any limitations
in the procedure for evaluations of the
line parameters?
These problems focuse author's attention
in this paper. \
2. CALCTlLYl':fC,I"- ()F THE TAPBR FUNCTION
In this section a mean for calculations
of' a taper function on the base of N+l
eiGenvalues of an appropriate Sturm-Liou
ville operator f'or the non-unifor'l1 lOSS!
eSS line is demonstrated.
We introduce the follo,V'ing notations.
Let c(x) and d(x) denote h"o continuous
non-decreasing functions of the sp::tce
coordinate x. By u(x)we denote a class of
all operators L for that follmdne; rela
tions are fulfilled )iq(Y~dY ~e(x) , llq'(y)ldY (d(x) (4) o
for 0 ~x " 1. Now we assume operators L j ,,,ith taper
conditions functions qj(x) and boundary
V~(O,s} = 0, VJ(l,S) = 0
vj(o,s) = 0, Vj(l,s) = 0
for j = 1,2. }1oreover ,.e take into account N+ 1 equal
eigenvalues of such two operators L. ,~beJ
re 1 N ~ 711M, H = J[* d(l)+ iqo + 5c
2(1)j l
, ,71 qo = n?ax max q. (x)
J=l,~ xE(o,i) For these operators ",e have the follo,V'ing
estimation of the appropriate t::tper funct
( 8)
where
p(x)
and
141
+ i [2Mx + 2d 1 + Q(x) _ 2c(1) x -~s;tJ!I)l
+ t (x2 + lOx +_lO)C(lB +
+ (10M!. +8c) expl2d(X)] .
The proof of the above relation (S)can be
easy constructed on the base of the theory
H.Krein [5, ~ and given by, among others,
T.Kato [4J. The estimation (S)permits to ev::tluate the
desired taper function, for example, ql~) from the q2(x) and the N+l eic;envalues of
the appropriate sturm-Liouville operators.
To evaluate the desired function q(x) \1e
may use the follo",ine; procedure.
Let be given a sequence of the eigenvulues
uf t,vo operators as stated by relation (3~ Then lV'e may evaluate the so called central
function H (x) as follo,,,V ~l) V 12') H(X) = H(l) + COllst v
2'4")- V:R') (9)
for of (:x ,l)",here v1(x) alld V2(x) are
the solutions of the equations L.V(x) =0 J
(j = 1, 2) wi th the cOlleli tions (5) and (6). solutions of the above equations for v(x) e;i ve the follO\vinG
V (x) = COllS t ( -
Finally \,c obtain
formula), _ -21 cJ}.: x dx
2 e; (x) = V - (x) d vex)
dx2 ( 11)
The limitations in the above procedure
are followinG.
a/ Functions M(x) and their first and se
cond derivatives must be absolutely
continuous.
b/ D " t" dN(x) erl va 1 ve ctX must not vanish.
The central flmction M(x; may be taken
as the inductance distribution alon!3" the
line "'hose capacitance distribution is
I:n01m.
There are also additional limitations for
the central ftmction ~l (x), namely. If for
the given ftmction q(x) = q,(xJconditions
for the Krein's equation are not fulfilled
then the centr::tl function H x Growth to
infinity ""hile the taper ftmction (So to
zero. In the second case the central. feme
tion H Lx) is limited l"hil.e the lenGth of
the the l.in0 tends to infinity.
J. CONCLUSIONS
The conditions for the evaluation of the
Part I. IEEE Trans. on CT, vol. CT-14
1967 , No 4, pp.394-408
4. T.Kato: Perturbation Theory fo~ Linear
Operators. Sprine;er, Berlin, 1976
5. M.G. Krein: Solution of the Inverse
sturm6Liouville Problem. Dokl. Akad.
Nauk SSSR, vol..76 1959, pp.21-24
6. M.G. Krein: On a method of effective
solution of an inverse botmdary prob
lem. ibid., vol.94 1954 , pp.987-990
7. N. Levan: Operator Theory of Network
Synthesis. Proc. 1974 huropean Conf.
on CTD, London, 1974, pp. 234-238
8. V.A. Marcenko: Concerning the theory
of a differential operators of the
second order. Dokl. Akad. Nauk SSSR,
vol. 72 1950 , pp. 457-460
9. S. Ridella: On a variational synthe
sis of lossless non-uniform lines. Al
ta Frequenza, vol. 40 1971 , No 6, pp.
527-533
10. E.N. Protonotarios and O. Wing: Theory
of NOn-tmiform Lines. IEEE Trans. on
CT, vol. CT-14 1967 No 1, pp.2-20
~rbitrary t~per function for a non-tmiforln 11. R. Saeks: Synthesis of general linear
line are shown in the paper. The expre- Network. SIAM Journ. Appl.ted Mathem.,
ssions for the error estimation of tho ea- vol.16, 1968 , NoS, pp. 924-930
lculated taper ftmction on the base of
the N+1 eir,-cnvalues of the appropriate
Sturm-Liouville operntore are given. So
I!!C kinrl of' lin:i te<. tions in the presented
~lrocudere ure demonstrated. The l!lore de-.
taill.ed investi(Sations are tmder develo
ppmcnts.
1. G. Llorg: Bine Uml<hertmG dor Sturm
Liovvilleschen 1!:icen"ertaufeabe .. \cta
Hath., vol. '78 19 1f6 , pp. 1-96
2. N. Dunford and J. T. Sch.,ar tz: Lincrrr
Oper::.tnrs. Part 2. Inter. Publ., New
forl~, 1963
3. D.S. Eeim and C.B. Sharpe: The synth~
sis of Non-uniform Lines of Finite Le n.;th.
142
12. Z.Trzaska: Use of the Sturm-Liouville
operators to the study transient phe
nomena in a non-uniform long transmi
ssion 1.ine. Proc. 1974 Inter. Confer.
CTD, London, 1974, pp. 199-204
13. !-t.R. ,vohlers: A Realizability Theory
for Smooth Lossles Transmission Lines.
IEEE Trans. on CT, vol. CT-1 J 1966, No
4, pp.J56-368
Zdzislaw TRZASKA 1YaS born in Nagnajow,
Poland, on Aue;ust 10,1939. He received
the dee;ree in Blectrical Engineering
in 1970 from the Technical University
of \warsaw. In 1963 he joined Technical.
Universi ty of War sa .. , 1vhere he is pre
sently Director of the Institute of the TIIeory of Electrical Engineering.
A THEORY OF BEST APPROXIMATION OF NONLINEAR FUNCTIONALS
AND OPERATORS BY VOLTERRA EXPANSIONS
L. V. Zyla and R. P. de Figueiredo Dept. of Mathematical Sciences
Rice University Houston, Texas 77001
ABSTRACT
Let H be a real Hilbert space and Fp(H), p > 0, denote the Fock space of order p associated with H. Let T be a nonlinear operator from H into the Sobolev space Hl(i) of real functions on an interval I of the real line. T is such that the functional u+(Tu) (t) , u E H, is an element of Fp(H) for every t E I. Finally, assume that 1 IT(t) 1 1 Fp(H) is Lebesgue measurable as a function of t and
£1 IT(t) 1 12FP (H)dt < 00. The space of such operators is known as the
Bochner space B2 (I,Fp(H)) of Hilbert valued functions on I.
T is viewed as the input-output map of a general nonlinear dynamical system and a specific representation for T is in terms of a Volterra expansion. A particular class of operators of this type arise from differential equations of the form x(t)=f(x,t)+u(t)·g(x,t), x(O) = xO·
The approximation problem we consider is the reconstruction of the output x of T based on the set of observations {~l(u), ~2(u), ... ,
~ (u)} E Rn on the input u (x = TU).- The functional ~l' ... ,~ n n
are not necessarily linear but we do assume that ~lEFp(H), i = 1,
... , n. Using the Hilbert space structures of B2 (I;Fp(H)) and
Fp(H) we derive approximations of T based on ~l' .•. '~n which are
optimal with respect to various criteria, and obtain results for "best" choice of the reconstruction subspace for x in Hl(I) as well as the optimal choice of observation functionals ~l' .•. , ~ for a nonlinear T. n
Supported by the NSF Grant ENG 74-17955.
143
, , h
CONTINUED FRACTION DESCRIBING FUNCTIONS FOR
BILINEAR AND MULTIPLICATIVE NONLINEAR SYSTEMS
C.F. Chen Electrical Engineering Department University of Houston Houston, Texas 77004
R.E. Yates U.S. Army Missile Research and
Development Command Redstone Arsenal, Alabama 35809
Abstract
A harmonic balance method is developed for finding the describing functions of bilinear and multiplicative nonlinear systems. The results belong to the classeof continued fraction approximations which are flexible and easy to use. We can truncate the function as we like to obtain various appropriate describing functions from very crude to very accurate ones.
1. I NTRODUCTI ON
When a nonlinear element Is slngle-v.lued t the Input-output relationship can be expressed an.lytlcal IV In the following ways.
(a) PolynomIal
(b) PIecewIse llne.r
(c) Transcendental functions.
To obtain the descrIbing function of such a nonl1nearlty. one needs to evaluate an Integral whIch determines the fundamental coefficients of a Fourier series. The. derivations of certain descrlbt[l9 functions 'rom the piecewise linear representation are wet I known [ IJ. Those derived from the polynomIal expressIon whIch represents the non I Inearlty Ire In the form of garrwna functlons[2]. When the nonlInearity Is expressed by .. transcendental function, for example, a segment of a sine wave for the saturatIon nonlInear ease, the correspondIng descr.blng function formula 15 In terms of Besset functlons[31.
There Is a class of nont Inearltles which Is encountered frequently, but has no known expllclt describing functIon shown In the literature. ThIs class Is bilinear In particular end multiplicative nonlinear In general.
We wJll develop a harmonic balance method for finding the describIng functions for bilinear and multlpl ieatlve nonl1neer elements.
2. DERIVATION OF THE GENERAL FORMULA
ConsIder the multiplicative system shown 'n FIgure 1.
FIGURE 1. EXAIIPLE IIULTlPL1CATlVE SYSTEM
144
It 1s characterIzed by the dIfferentIal .quotlon[~l:
(1)
where xO(t} and XI (t) are the output and Input of the system respec
tIvely, 0 - ~ ,and fl (0) ~ I - " .••• 4 are polynomial or r.tlonal
functions of D. Assume that the system has dead InItIal states (zero
InItIal conditIons) or Dl xo(t) .0, for 1 • I, 2, 3, .... If F~(D) ~ 0,
Eq. (1) can be sImplIfied as
(2)
where
(3)
end
(~)
Assume that the Impulse responses of Fp(D) end FlfD) are contained In l2. Taking the Fourier transfom of Eq. (l) and applv1nq the frequency convolution theorem to the product term xO(t} x,(t}. we have
where
Xo("') • F[xoft))
XI ("') • Frx i (t)]
let us assume that the Input Signal Is given by
(5)
(6)
(7)
(8)
The 5teady state Input Is
XI(t) -"0 +"1 cos At
Assume that the output signal In the steady state Is
The corresponding Fourier transforms of Eqs. (9) and (10) are
and
From Eqs. (11) and (12). we evaluate the complex convolution term In Eq. (5):
-J+ , + e n{~[w + (n + I)A) + ~[w + ( n - I)A]}J
Substituting Eqs. (II), (12) and (13) Into Eq. (5), we obtain
where
c -. H* 6{w + A) + • ~ H 6 (w + nA) n n ~ n
and
Define
and from Eqs. (15), (18), (19) and (ZO), we have
(10)
(II)
(12)
(13)
•
(14)
(15)
(16)
(17)
(18)
(19)
(zo)
(zl)
(ZZ)
(23)
145
(Z4)
Eq. (24) can be rewri tten as
(ZS)
If Bn ~ 0, we have
(Z6)
Eq. (26) can be expressed In continued fraction form:
_8n __ 1 _ ~ F UnA) - a - _____ ..>..!...L_ ----,;---- (Z7)
[ (;4 1 8n a l 0 0 ( \ Z _. _ f., Fo[{J(n+I)A] 0 FO[j(n+Z)A] - a
O
. From Eq. (Z3) we have
81 Therefore, the describing function defined by N ., - 15 given by
"I
"I _ a _ r
o 81 S;
Substituting Eq. (ZO) for n - Z Into Eq. (30), we get
bO + FZ(jA) N---------~--~-______ _
G(jA) __________ ~h'_Z ________ _
hZ G(Zj.) - ---------'~-------
hZ
G(Jl·) - -----....!!..-----
where
(z8)
(Z9)
(30)
a l h -"2 (33)
Slnee the DC term bO is unknown and must be determined from Eqs. (22) and .(31). we separate N as follows:
where
and
N - N N " D
ND ----------~---------hZ
G(l') - --------"'--------
hZ G(ZjA) - -----!!------
hZ G(3l·) - -----....!!..----
(3~)
I3S)
Bl By definition N .. - , therefore
a l
Let uS def i ne
where FIR' FII
, NDR
, Npi are all real functions of L Ther. from Eqs.
(34) and (37) we have
and therefore
or
Eqs. (41), (1,2) and (43) are the general forms of the DC term and the describing functions of the muhlplicative system.
3. CDNSIDERATIONS OF SOME SPECIAL CASES
Ca) ',The Input signal h.5 no DC component or
ao - 0
then
and
(b) FO(O) - 0:
b __ 2aO FI(O) + a~[NOR FIR - HOI F II ]
o 2 2.0 + at NOR
(c) 80th.O - 0 .nd Fo(O) - 0:
(37)
(40)
(4t)
(42)
(43)
(44)
(45)
(46)
(48)
146
We note that:
(a) When FO(O) - 0 and F2(j),) Is frequency Independent (a constant or zero), then FlU).) = FI(O) - FIR and FII = O. From Eqs. (46) and (47) we have bO ... -F, and N .. 0 which is a trivial case.
(b) When Fj (j).) Is a real function of ). (for example, F, (0) _ 02) then aO and FO(O can not both be zero. Otherwise, from Eq. (49), N .. 0 which is also is trivial case,
(c) N is frequency dependent in general, as Is bOo For the case
where both aO and FO(O) are zero the phase angle of N Is always r [from
Eq. (49) J.
4. ILLUSTRATIVE EXAMPLE
Consider the servo-system with multiplicative feedback as shown In FIgure 2. let us evaluate the descrIbing functIon of the part of the system in the dashed line bloc.k tn FIgure 2.
FIGURE 2. SERVO-SYSTEM WITH MULTIPLICATIVE FEEDBACK
The input-output relation of that part of the system Inside the dashed 1 Tnes Is given by:
Assume that the Input signal has no DC part or aO - 0, then
FI (jw) - ~[w2 + 40 + tSJwJ w + 400
From Eqs. (4S) and (49), we have
10K NOI 2 bO - ---[tSw - - w - 40]
w2 + 400 NOR
.nd
where NO' NOR' and NOI are defined by Eqs. (36) and (39).
N T and bO
versus frequency ware shown In Figures 3 and 4 respec-
tIvely. Some selected data for the non 1 'near system and the corresponding values from Eqs. (53) and (54) are shown for compartson fn Table I. The results are quIte satTsfactory.
w
1.0
1.5
2.0
2.5
TABLE I
COt\PARISON OF DESCRIBING FUNCTION FREQUENCY RESPONSE AND NONLINEAR ANALYTICAL RESULTS FOR a l - 1.0
N[Eq. 54] N [Ana t yt I cal J bO[Eq. 53J bO[Analytlcal]
INI L!! INI L!! 0.37 -90" 0.37 -90" -0.74 -0.75
0.24 -90" 0.24 -90" -0.S9 -0.69
O. tS -90· O.IS -90· -1.09 -t .09
o. t~ -90· O.I~ -90· -1. 3~ -t .22
(50)
(51)
(52)
(53)
(54)
•
50
~O
-'00
1r
.
0.02 0.05 0.' 0.' 0.5 '.0
--"---
~
.; .; 0 0
· · · . • · • •
~ (
I-- (
\
.; 0 0 .; . · · .
• • · • I
FIGURE 3. DESCRIBING FUNCTION FREqUENCY RESPONSE FOR VAR I OUS VALUES OF I NruT SIGNAL
0.02 0,05 0.' 0.' 0.5
--"----5
t·· ~
J 0 0
.; .; . · . . • • • .-•
•
\ \ 0
" t-- ~ ~ -.
1\ -,
-J
-. .; .; 0
-)0 . · . . .- .,?
-)5 1'-
'.0
FIGURE~. DC COEFFICIENT FOR VARIOUS VALUES OF INPUT SIGNAL
REFERENCES
[1] J.E. Gibson, "Honllnear Control System,lI "'cGraw Hili Co., 1965.
[2] R. Sridhar. "A General Method for Deriving The Describing Functions for A Certain Class of Honllnearitles," IRE Trans. on A.C •• Vol. AC-5, 1960, pp. 135-141.
[3] L. A. Pipes, "Applied "'athematlcs for Engineers and Physiclsts,1I Hew York: McGraw Hill Book Co., 1946, p. 321.
[J.,] R. lanber, "A New Method to Derive The Describing Function by Certain Nonlinear Transfer Systems,1I IFAC, 1964, p. 14.
{51 R.J. Kochenburger, "A Frequency Response Method for Analyzing and Symbolizing Contactor Serovmechanlsms,I' Trans. AlEE, Vol. 69, Part I, 1950, pp. 270-284.
[6] L.C. Goldfarh, liOn Some Nonlfnear Phenomena in Regulating Systems," Automation and Remote Control, Vol. 8, 1947, pp. 349-J8)i English Translation, National Bureau of Standards, Washington, D.C., 1952.
[71 Chen, C.F. and I.J. Haas, tlAn Extension of Oppelt CrTterion,lI IEEE Transaction on Automatic Control. 1965.
[8] J.E. Gibson, "Nonlinear Control System," McGraw Hill Co. 1965.
[9] J."'. Leoh, "Frequency Response,lI Edited by R. Oldenburger, New York: Hacmlllan, 1956, pp. 260-268.
[10] N. Minorsky, "Theory of NonlInear Control Systems," New York: McGraw HI", 1969, pp. 64-70.
C.F. Chen has been Professor of Electrical
Engineering, University of Houston, Houston,
Texas since 1966. He receIved the Ph.D. Degree
from Cambridge UnIversIty. He co-authored a
147
book on Control Systems AnalysIs and has authored
more than 50 papers on IdentIficatIon, stabIlity
and model reductIon. He Is a consultant for the
U.S. Army MIssile Research and Development Com
mand, Redstone Arsenal, Alabama.
Robert E. Yates was born In CecIlIa, Kentucky
In 1936. He receIved the 8.S.E.E. and M.S.E.E •
Degrees from the University of Tennessee In 1960
and 1963 respectIvely. He receIved the Ph.D •
Degree In Electrical EngineerIng from the
UnIversIty of Houston In 1972. He has worked In
the analysis and desIgn of control systems for
IndustrIal and mIlItary systems sInce 1960. He
Is presently a Research Aerospace EngIneer wIth
the U.S. Army MIssIle Research and Development
Command at Redstone Arsenal, Alabama where he
Is presently Involved In large scale simula
tIons of termInally guIded tactIcal mIssIles.
NONLINEAR ANALYSIS OF GYRATOR NETWORKS:
A NUMERICAL EXAMPLE
Miles A. Smither and Manjula B. Waldron Electrical Engineering Department
University of Houston Houston. Texas 77004
Abstract Gyrator circuits have traditionally been analysed with linear transistor models. Such an analysis cannot uncover the bias constraints inherent in a given circuit topology. A numerical analysis based on the equations of canonical form nonlinear networks is described for a simple gyrator circuit which clearly shows the operating constraints imposed by the bias requirements.
1. INTRODUCTION
Several gyrator networks have been proposed (1). (2) which demonstrate gyrator properties when analyzed with linear transistor models. The linear models are obtained at the expense of bias information and are. therefore. incapable of detecting the restrictions which transistor nonlinearities and bias requirements place on the range of useful port variables. This paper treats a simple gyrator network (3) using the nonlinear Ebers-Moll transistor model (4) and establishes the allowable operating range for the circuit. As a result of the analysis. an alternative circuit which is useful over a wider range of port variables is proposed.
2 • BACKGROUND
Lin~ar Analysis. Integrable gyrator circuits have been studied in detail using various linear transistor models (5).(6). Figure 1 shows a proposed gyrator circuit (3) which will be analysed in this paper. If the transistors are replaced with linear models. the following relationships are obtained (for Rl =R2):
il~vl/hfe2Rl + v2/Ra
i2~-vl/~ +v2(1/Rc + l/hfelRa )·
This can be written as R -1
a -1 (hfel R)
which. to the extent that Ra «hfe 2Rl •
~ «hfel Ra II Rc
148
and
Ra=~ describes the operation of a gyrator.
The linear analysis cannot show the restrictions which a particular circuit topology places on the port variables. These restrictions can be quite severe. For example. in the circuit of Figure 1 the collector-base junction of T requires that Vl~V2 at all times. Further restrictions are uncovered with the nonlinear analysis.
Nonlinear canonical form networks. Equations based on nonlinear canonical form networks have been developed recently (7).(8). These require that the circuit be treated in two separate parts. linear and nonlinear. The circuit in Figure 1 is in canonical form where all of the nonlinear elements are to the left of the dotted line. Such networks can be represented in the form (9)
A F(v) + 1!. v = C. (1) In eq~atiO'n (1),- -
!!, ~ = -,Q,,!, + ~ T T (2) where ~=(vl.v2 •••.• v6) • .!. = (il .i2 •.•.• i 6) are the port variables. C describes the sources within the linear portion. and matrices!!, and ,Q, describe the linear network. The transistors are described in the !E(~) term. The port relations to the left of the dotted line are
i = I. F (v) (3) with I a blO'ck-diagonal matrix of 3 (one for each transistor) 2X2 blocks (which result from the cross
OOUPl1~fi:~:r' Eb.,,-.,11 mod.l)
where the a's are the reverse and forward o's in the transistor model and E(~) has the form
T F(y)=(fl(vl),f2(v2),···,f6(v6» .
The functions fl, f 2 , •.. ,f6 contain the nonline
arities, the re ationship oetween the transistor terminal voltages and currents.
Combining equations (2) and (3) results in ~ y = -,Q, I I(Y) +.f
or equation (1) with h, = ,Q, I·
3. NUMERICAL ANALYSIS
Equation (1) can be solved for the v vector as -1
Y = ~ {.f - h,I (y)}
which allows an iterative solution to the network equations. The gyrator circuit operating point is found in this manner for each combination of port variables (Vl,VZ) studied. The following parameters were used for the numerical analysis:
Collector saturation current ICBS
=5pA
<1 =.5 r
<1f =·004
V =15V cc Ra=~=lKn
Rc=Rl=R2=10Kn
Note that the emitter saturation current is obtained from (4)
IEBS=ICBS Clr/Cl f · Figure 2 shows the results obtained by setting V2=13 volts and sweeping VI from 5.5 to 10 volts. Tfie figure illustrates a severe bias constraint at V1",7.8 volts. At this level T2 saturates, changing tfie sign of the response of I Z as a function of VI and rendering the circuit useless as a gyrator. For Vl <7.8 volts the circuit is properly biased and behaves as expected. The slope of 12 indicates a gyration impedance of -ll06n, a reasonable result for the circuit values assumed. The response of 11 to VI (for Vl <7.8V) is very weak as expected.
Similar results were obtained by setting Vl =7 volts and sweeping V2 between 7 and 13 volts. Over this range the slope of 11 indicates a gyration impedance of 1014n. I as a function of V is nonlinear in the vicinity ot V =7 volts, a result of the zero collector-base Eias voltage for Tl . Over the remaining range of V2 ' 12 indicates an undesired input impedance at port I of 8.83K due, of course, mainly to Rc'
4. CONCLUSION
This paper has demonstrated that it is relatively straightforward to cast a given transistor circuit into a form which is amenable to numerical solution. The advantage of this type of analysis is the complete characterization of the circuit, the results relating to both bias considerations and small signal performance. The small signal parameters at the ports of the gyrator circuit can be directly calculated from the numerical results. Of greater
J49
practical importance, however, is the ready identification of allowed operating regions for the circuit. For the circuit of Figure 1 it is possible to identify bias constraints by inspection. For more complex circuits a complete nonlinear circuit evaluation is essential. The use of canonical form equations is one method of obtaining such a complete evaluation. By considering the bias constraints of the circuit in Figure 1 a circuit with much wider operating margins can be obtained. The circuit of Figure 3 has several advantages as a straight gyrator circuit. The inputs can operate around ground allowing direct signal coupling if desired. The signal level at one port has no effect on the level requirements at the other port. It is clear that for Rl=R2=R3/2 and for the usual approximations for the transIstors as current mirrors (T3-T
5, T4-T6) and current sources
(T7-T
8-T
9) the operafion of the circuit is that of
a gyrator over very wide ranges of e l and e2 . The numerical analysis of this circuit could be handled with the canonical form network equations or with a special purpose computer program such as SCEPTRE (10). The special purpose program has the obvious advantage of relative ease of use but cannot provide the insight into the circuit that comes from developing the equations as outlined above.
5 • REFERENCES
1. W. Heinlein and H. Holmes, Active Filters for Integrated Circuits, New York, New York:' SpringerVerlag, 1974, p. 306.
2. J. A. Miller and R. W. Newcomb, An Annotated Bibliography on Gyrators in Network Theory, R-72-01, University of Maryland.
3. W. New and R. W. Newcomb, "An Integrable TimeVariable Gyrator," Proceedings of the IEEE (Correspondence), Vol. 53, pp. 2161-2162, Dec. 1965.
4. W. Heinlein and H. Holmes, ~ cit., pp. 236-237.
5. B. D. O. Anderson, D. A. Spaulding and R. W. Newcomb, "Useful Time-Variable Circuit-Element Equivalences," Electronics Letters, Vol. 1, No.3, pp. 56-57, May 1965.
6. B. D. O. Anderson and R. W. Newcomb, "A Capacitor-Transformer Gyrator Realization," Proceedings of the IEEE, Vol. 53, No. 10, p. 1640, Oct. 1965.
7. 1. W. Sandberg and A. N. Willson, Jr., "Some Theorems on Properties of DC Equations of Nonlinear Networks," Bell System Technical Journal, Vol. 48, pp. 1-34, Jan. 1969.
8. A. N. Willson, Jr., "New Theorems on the Equations of Nonlinear DC Transistor Networks," Bell System Technical Journal, Vol. 49, pp. 1713-1738, Oct. 1970.
9. A. N. Willson, Jr., "Some Aspects of the Theory of Nonlinear Networks," Proceedings of the IEEE, Vol. 61, pp. 1092-1113, Aug. 1973.
10. H. W. Mathers, S. R. Sedore, and J. R. Sents, "Automated Digital Computer Program for Determining Responses of Electronic Circuits to Transient Nuclear Radiation (SCEPTRE)," IBM Space Guidance Center, Oswego, N. Y., IBM File 66-928-611, Feb. 1967.
6. BIOGRAPHIES
Miles A. Smither received the BSEE (cum laude) and MSEE degress from the University of Houston in 1967 and 1968 respectively. From 1968 to 1975 he worked for General Electric and Geosource Incorporated. Since 1975 he has been an instructor in the Electrical Engineering Department at the University of Houston and a solid state circuit consultant to local industry. Mr. Smither is a PhD candidate at the University of Houston and is a member of the IEEE, Sigma Xi, Tau Beta Pi, Eta Kappa Nu, and Phi Kappa phi. His current research is in the area of electron beam-material interaction.
Manju1a B. Waldron received her B.Sc(Hous) in Physics and B.E. degrees in India and M.S. and Ph. D. degrees in Electrical Engineering from Stanford University, CA in 1962, 1965, 1968 and 1971 respectively. From 1970 to 1974 she was a lecturer in the School of Electrical Engineering at the University of New South Wales, Australia. She joined the faculty at the University of Houston in 1975. Dr. Waldron is a member of IEEE, Sigma Xi, and *SEE. Her research interests in electrical engineering are Nonlinear systems modeling, speech and psycholinguistic processes of the deaf, bioengineering and engineering and engineering education for women and the hearing handicapped.
150
R2 ~+---~+~--1--------v~~+VCC
151
+Figure 1. A Simple Gyrator Circuit in Canonical Form.
<l 9 <l E E N ... ...
8 -1.0
7 -2.0
Vz= 13 VOLTS 6 L-__ -'--__ --'-__ -'-_..=......_---''------.J - 3.0
5 6 7 8 9 10 II VI (VOLTS)
Figure 2. Nonlinear Analysis Results from the Circuit of Figure 1.
Figure 3. Proposed Gyrator Circuit with Relaxed Bias Constraints.
,I
I'
i'
,I, ,i
LIE S:ERlE3 AND THE roH:ER SYSTEl·; SUmLITY FROBLOO
R. K. Bansal Assistant Professor Electrical Engineering Departnent LA.U., Ludhiana (India)
R. Subramanian Assistant Professor Elect. EnGg. Department I.r.T., Kanpur (Irxiia)
Abstract
The moethd of Lie-series and pattern recoCnition technique are used to estimate the stability domains for po./er system problems. Feasibility of the proposoo method is demonstrated by considering wo munerice.l exar,1pl.es.
1. urmODUCTION
Several approaches have been proposEd to determine
the stability domain for power system problems
during the last decade. Amonr; these, the method
aimed towards a solution of Zubov's partial
differential equation has been considera:l recen~
by Yu & Vongsurya[n and DeSarkar and DharllIl
Rao [2 J. Theoretica]ly the Zubov' s method always
yields a Iuapumv function and if a closed form
of solution for the Zubov's partial differential
Equation can be obtained, the stability domains
are exact. However, in general, it is not possible
to find a closoo form solution. Hence a truncated
series form of solution is us(xi which leads to an
approximate stability boundary. The technique
used for the solution, utilizes a power series
method which does not have uniform convergence
and a large munber of terms are generally
necessary before one obtains the final solution.
A recent work by furnand and Sarlos[3] suggects a
nSoi method of solving the Zubov' s partial
differential equation by the use of Lie-series.
In this paper, the application of this method to
power system stability problems has been
investigated. PO"Ter system problems consisting of
a single ~chine connectoo to an infinite rus
(Wi th governer and flux decay effects includa:l)
152
have been investigated am the results obtainoo
are quite encouraeing.
2. LIE SIDlES TH:lr.:1QUE APPLICATION
2.1 NA.TlIE.:ATICAL BAcJ:anomm Lie series solution for the differential Equations
are essentially the power-series solutions and the
expansion co-ei'ficimts are derived from alrebraic
recl:rsion relations.
The Lie series is definoo as
exlta) x = ~ ... (1) - k....-Q
where ~ is, in general, a vector variable and D
is a linear operator defined by
I' a D = E 9.(x1, Xz, ••• , x ) -,,---
j= 1 J r x J
... (2)
where Q _ are an~tic functions of the variahle x J j
The solution to the differnntial equation
~ = f(~) is then obtained as:
?!(t) = ex:/ tD) ~(O)
2.2 Z1.~OV' S PARTIAL DI:r'FEnE:rrIAL lllUATION SOLU'ITON
... (3)
• .. (1,.)
Zubov's partial differential equation is r;iven
hy:
Lie_series technique can be a:)rJlied to obtain the
solution of this equation as follows:
Define a function ~(~) such that
V(~) = 1 - e'vi(~) ••• (6)
which implies that
~(~) = ct (!) ••• (7)
Thus 0 ~ V(~) (1 correspoIrls to 0 ~ w(~) > - CD,
and t-J(~!) = - mat the stability boundary.
Equation (3) and (7) can now be considered
simultaneously and the sol.utions ~(t) and Vl(~(t»
can be obtained by recursive computation.
The solutions H(x(t» and x(t) are expresse:l as: tD - -
w(x( t» = e W(~(O» ••• (8)
••• (9)
The operator D for this system is
D = £(~)"'lx + + (~) : w ••• (10)
3. Bi. TI' Elm RECOGl;ITION CONCEPTS
Pattern recognition algorithms are use:l to
generate an analytical expression for the hyper
surface that se~retes two sets of points in the
Iilase space. One set of points is obtainoo. f'rom
recursive apIllication of Li&-series technique
indice.ted earlier. This set reprfo.-Sents ( for
sufficien~ large negative W) a close approxi
mation to the actual boundary of the recion of
attraction. A seconi set of points is obtained
by arbi trarily choosing a small constant radial
dis~canent from the first set towards the stable
equilibrium point (or towards interiol' of
stabili ty domain). The s eparcting surface, in
effect, acts as a decision sur.f£.ce and provides
a guaranteEd region of stability.
The decision surface can be approximated by an ~ degree polynomial F(~) as follows:
F(!)= CO+P1(~)+ P2(!)+ ••• + Pr(~) .•• (11)
ware P. (x) is the :i:th degree homot>"eneous
1. - '" polynomial of ~.
The problem involved is in ccmpu~inC the constants
fer the pol:rnomials, which depend ulJOn the ~ber
of pattern classes, di."'lensions of pcttern s~ce
ani the decree or polynomial F(~).
These constants are evaluated by usinc Ho-Y.ashyap
algori thIJ, details of which can be found in [4].
/+. l!XiI.1TLES
(a) A third order system repreJSentine a lll[,chine
with const.ant dat1pin[ and a ve10ci ty governor will
be inv6stigated. The suill[: tquation under post
fault conditions is givE-n by [0]:
d\ ib dt3 + .943 dt2 + (. 714+cos6 ) ~~ + .667 sin 6
-.196=0
The singularities of interest are
••• (12) ,
1) Stable(focus-node) b =17.1°, b= i""= 0
2) Unstable(focus-node) b =162.90 , b= b= 0
After shiftinc the stable Sin...,'·lUllrit:· to the
origin through the coordinate transformation
° x1 = b -17.1. The state variable form of
description is
*1 = x2 *2 = x3 13 = - .943 x3 -(.714 +cos(x1+ 17.1°» x2
-.667 sin (x1+'7.10
) + .196 ••• (13)
Dharma P.ao [7J had observed while studying a
second order SystOOl that the fault trajE:Ctory in
genaral is r(':stricted to the first two quadrants
of the ~-X:2 luane. It was accordinGly felt that
for the considera::! third order system, it may
prove sufficient to obtain that segment of the
decision surface in the x,-x.z-x3 f-h.:'.se space w'··ich
lies in hcl.f s~ce X:2 > 0, x1-~ arbitrary. This
feature allor-,s si~:nificant reduction in computation
time arxl is to be advoc.::.ted especially for higher
order systems. Initial values of xl'x2,x3 woro
chosen to be (1450~ 0,0). The Lie seri~~
integration routine was EllCecu ted and various data.
points obtdnoo. at intervals of At = 0.1. From
the various solution points, twenty equispaced
points were selected in the recion of interest and
153
thes e were considered r;!erhcrs of clc.ss '1'.
'·(;flbors of (!lass '2 1 were grneratErl by a
displ",:anent {) = .03. r
A second deerE·e polynomial ~TaS considerErl as a
rossiblc dt.'Cision surface and the r(JSc'.lting
decision surf£'.ce obtained is:
F(~) = -67.53+39.89 x1+05.C6 x2+52.1~ Xj-5.) x~
-17.42 Xzx1-12.13 x1x)-9.8) ~
-11 • 13 Xzx) -1 • 88 x~ = 0 ••• (14.)
:.'he critical clearing tiJTle obt..?inErl ~J
in~~ecratin.z the caul te:: systCL1 equation and
notinG time for first crossire F(~) = 0 provides
a value of 11 .55 unit:). 'rIns compe.res very well
with the exact vclu(l of 11.65 units [b].
(?) flux decay effects and variable danpin;:: in
the synchronous machine are considerod. ilie
machine e:rua tions are [2] :
2 E' V ,f. (x -xci) 1:.!i1> = F.- ....9.-Sin6 + q sin 26
dt2 ~ xd l 2xd' Xq
-DC 0) £.Q.. dt
dZ E E ("c - x ) allli -s t = ~ _ ~ + • d d 1 V cosb ••• ( 15)
. dt TdO I Td , xd TdO'
,.here v<:criables J';, Pi etc Imve usual meaning and
are definErl as given in [2J. liurnerical data arH
borrowt.'C! frov{2 J and the state variable fom of
eqllations are
Xl = Xz *z = 2r.b1-8/;.99CL18+x
3) sin(x'+'/'Z8)
+2'.5)Sin2(X1+.478)-Xz("O~ sin~(xl+.478) +.G~2cos (x
1+.478))
*3= .36-.621(1.18+X;) +./;21 cos(x1+.478) ••• (16)
Equations(16) have been obtainro after shifting
the stable sineulari ty locf'. tcrl at b =27 .L;.o ,6 = 0,
E , =1.1(; to the ori{!in. The unstable q
sin[.w..ari ty is loca too at c= 13)°, ~ = 0,
E =.115. '!he Lie series solution is 0 1
c~nerated using initial values of x1 'Xz,~ as
(104.2°, 0, -1.061). It was noticed, however,
that for this example, while iL1IiLementina the
TIo-Yaslwap algorithm, convereence faile:l to occur.
:. possible reason coulrl be attri buted w the
nature of the eenerated dI.. ta points. In viEM of
154
this dirficd t.y it was decided to obtain wo
decision ='faces, one in the x1-Xz plane and the
other in the Xl-X) plane. Second decree
polynomials were considered as decision stu'flces
and the dec:i.sion surfB.ce obtained in the x1-Xz
plane is
Fl(~)= -172.3-76.5 x1 + 1.02 Xz + 94.69 x;
2 + 5.94 xl~ + 1.1,15 Xz = 0 ••• (17)
ilie decision surface in the x,-x3
plane is
Fz(~)= 55.23-15.92 Xl + 207.12 x3+4.95 x~
-67.04 x,x3 + 43.48 ~ = 0 ••• (18)
futh Fl and F2 define decision surfaces that are
confinOO w a two dimensional plane and the
follOWinG procedure was adopted to determine the
cri ti cal cl earing time.
step 1 : The faultErl system is intecratoo and the
set of points (x1,x2,x
3) defirJ.ng the
faulte:d trc,jectoI"J are obtained.
step 2 : At rech tiro.c instant t=k ~t, the value
of F 1 and F2 {!iven by(17), (If:) are
comp.lt 00.
step 3 The faulted traj ector.r integeration is
terminated as soon as either of F1
" or
F2 becomes zero.
Step II': The particulnr time instant t = k Llt
obtained in step 3 furnishes an estimate
of cri tical clearine time.
The value of the cri tical clearing time thus
obtainOO Was 0.28 seconds. This is com~rable
wi th the exact value of 0.32 seconds [b J . 5. CONCV':-SIOH
~~oosibili ty of application of Lie series and
pattern recogni tion me~hods have been
deI!Dnstr: ted for some third ordEr power system
problems. The stability regions compare
favourably as evidenced ~J COl'lIXlrison of critical
clearine title wit:. the exact· ... ' values.
b. REFi-RBICES
Y.lI.Yu and E. VOllo""Surya, "1:onlina:r power system stability suty by J~lapunov i'clnction and Zubov's method", IEEE Trans. PAS,vol.<:6 1~.12, pp 1420-14£5.
1 I
2 De-Sarkar, AS. and Dharma Rao,n. "Zubov's r.le~;hod and trnlSient stability problems of power systems", Proc. lEE, 1971, 118(8). pp 1035-1040 •
.3 :Jurnand,G. and Sarloz , G., "Determination of the domain of stability" Journal. of Hath antlysis ond ap:)lication, 2.3, 71/v-722, 1968.
4 funsal, R.I.. and Subramanian,R. ,"stability analysis of power systcr:JS nsing Lie series and ~ttern recor,nition techniques" Prof.lEE.1974, 121, (7), pp b23-629.
5 Li,C.C. and [orD1an:i.~"J., "Decision surface estiwate of no~lincar system stability domain by Lie-series I!Icthod", l~:E TJans. AG-17 1972, ;:n 66&-608.
b Tl..'1rlSal, H.r.,"EBticw.tion of stability domains for for the transient stability investigation of ?JCJWCI' system", i'h.D. ThesiS, Etectrical EnGZ· Daptt, l.l.T. }:~mpur, Aucust 1975.
7 Dharma Hac, [/., "Generation of I,yapunov functiOns for the transient stability problem" tranS. ll1(:C. Instt. of Can da, 19l1S, C-.3 (11).
7. BIOGRI.FRY
Dr. R. K. j}.nsa.l cO!:i,lleted his S.E. and ; .. E.
e in £].ectrical Engineering fro!:\ Birla Institute
,)f Tcchnolo{!y and Science, Pilani, India in 1909
and 1971 respectively. COI:Jplctai his Ph.D.
in Electrical Engineer1ng fror.l Indian Institute
of TechnoloCY, ICanpur, India in 1977. Presently
workine as j,ssistant ?rofessor of Electrical
Eh&im:erine at P.A.U. Ludhiana, lIdia. l-ain
aren of interest is application of system theory
techniqulll to power systB':1 problems.
155
11
Ii I:
, I
:il ,
-- i·
tl.b.Kekre, li..I.:.t.umar and .Il.h.~rivastavR
Co:nputer Centre
Indian Instttute of ~echnology, .oombay, India
A varyinL efficnency rnultiberver queue model has b:en ana.ly::>ed under l'~rkovian absumptions for service time and arrIval pattern. 1quution for probability den~ity func~ion of the waiting ~ime has been obtained as a function of system demand rate and maXImum number of servers. Average 0:; standard deviation of the v'tli ting time from this equation are - in agreement witt- simulation resul ts.
1. INThCDUC'l' IOl~
1'lul ti server queu£:; mO<.1els stud ied so far
(1) aSstune constant server
Lo~ever, in Some practical
t~e server efficiency is
efficiency.
si tuations,
d ep endan t on
the number of active servers in the sys
OP'a. A mul tiprop;rammed computer system
m:.'y be one of the examples where this
5i tuatiO(l (;xists if each active slot in
thcl mul tiprobrammj.ng system is assumed
to be a server. In a multiprogra~med
computer system, more "than one j ODS sha
re the resources of the syste"l so as to
ach ieve concurren t operat ion of CPU and
its peripheral devices; the number of
active jobs define the level of multipr
ogramming. In the model studied, tt.e
eificiency of each server varies and is
gi ven by Ui/i where Ui is the Sum of the
efr'iciencies of i ~ctive oervers. This
notation has been used so that the appl
ication to the multiprogrammed computer
system becomes straight forward. Ui
wou
ld represent the overall efficiency at
multiprograllll!ling level 1. :,rhe mo,lel is
156
analysed under ~arkovinn assumptlons for
system demand rate and Rrri val mecr.,mi far ..
2. QUEUE MODJ:"l.J
The model is shown in l:'ig.l. Each serv
er works at an efficjency ot' Ui/i wten i
:.;ervers are active &: U, is the cum of I
efficiencies of i Berver~. The Gervicp
time requirements of the custom0rs to
the system has ben sf:sumed to hnvc a
negative exponent ial d tstri but10n wi ttL
an average of t. Th'~ arri val of cUbtc:rn
ers is aHsumed to b~ a Poi~son process
wi th an average of 1\ arri. val b per unt t
time. The queueing iiscipline 1s first
come first served (YCt~). The actual
servi ce time will v~ry rlepen'iing upon
the m.unt;er of active :;erven:l, tt,e maxim
um number of serverb oeing i". Therefore,
a term 'system deman(i rate' (::lDh) hru;
been used in place of traffic inten~ity;
system demand rate halo been d(~finerl t?
be the fraction of the time one server
w 111 be kept busy if the "f fici eney of
the server is unity. ior the systHm to
be p.rgod ic, averaf:e i.)U.l:t I.3hould b e les~
t;, ~n aVf>rage b~rvice ti me &.vailable • .r'or
t\,") above merlti oned j·jarkcvian a~su'Ilptio
lle;, average ;;)lJh (~=~) and tP-,rvice rate u i.s U . where lJ is thp SUll! of effici.enceo
lV. !V! 01 sp.rverG ... hen Rll ,., t;"'l'verll !ire bUf'Y·
'lherlHClre, for tr.t -..,ystcl:I to tJe E'rgodici
should be 1 .. S8 than U,,,. 1'1
',( hi;, monel cm' UP a.l'alY.Jed uj the well
k Hown td 1'1, 1 ar. a d ea t·; proce GS \ 1) llnd cr
thd ad umpti or, tha t C'H,tom'::l's not being
servea can fl'(,~1 queuE: 01' infi nj te length.
' .. h,,: stat., tl'ansi t<or; riiabI'am t'al: bE."n
shown ill .r i ,:;. ~~. jo'or all. k > 1'1, LJk==U~l &.b
then the :.lel;',,' ...:erver~J are N. 'lhe ste-
:~d'y iO .. 8 ;,:; :,uu,ysi G gi Vt-!S
1-' n = .t' n-1 0 < n '1-, , .. ( 1 )
~ 1" n> ji. U1'1 l1-i
[
~ ~c: ~l'1-1 1 + U 1 + u~ U 2 +... l.i 1 U 2 ' , • UM_ 1
~L, 1 -1 , . + 77"'"'r.,_._--.-. ---1 ' .. (C!)
0,d.., ... U" \ 1- !u!) c: H 'I
wf,ere'" repr, ;sO.tf: the rrol>abi.li,ty of 11
findJng n .ioliB in the- cystern.
~.1 ~L!-; nglime:
If a C1)~;tcm€r needing serv'l ce time wi th
probar;ili ty dellsi ty flJl1cti Oll \ I'df) of
U exp \-ut), on Hrrival finis there arE
alrdady n cu ... t,'mol's jn t.Le <:iYbte"'l then;
a) if n ('" where .(~ i8 'the max imU!Il number
iUlfneul",tely wtth an efficiellC:f of
~erved U
n1-1 ll+1'
And pd!, of tid: v.ait::llis time will ('e
uU n -rl uU 1 = exp \- --ll±- tj n:;r' n1-1
LJ) if n )J'I" t.he cUbtomer wi. 11 have 'to
wai t in tht-' qUf'ue for, (n-1'.+l) CU~)1,oro'~rl,
to li. ~Hrved uefore bPtn~ served. The
pdf of th~ waiting time in
Id be convolution of pdf's
spent in the queup and the
ring actually bein~ served.
tbiR
of
ti me
The
Cuse wou-
tbe ti[r;(-
~·.pent d Il-
n'H' of the t;me when the queue a,Jvance" by 1 i <;
uU1>'l expl-uUr'l tJ • And thUb pdf for (n-H-+1)
advances in qU8UR will be In-M+1 )th conv-
t n - h n-~+l olution Le. In_MTllu'Ur-l) eXPl-uU1,!t). The pdf of the tir:lt.' actually 't:;()ing served
uUJI'i . uUM will be M expl- -M t), so the pdf of wa-
itin~ in this C8Le wtll be
t 'rn-I1,
f (n-l'i)T ()
UU~ UU~I 1 T exp \ (- -f,j-) • ~ t-l ) } d~. .. (4)
If tr.e prouatJilitv of finding n cudomen;
in the ..,.y'Etl~n· is Pn
, t,HlCl the gt>ncr9.l pdf
of waiting time will De 1'1 uU
i uUi
\1'( t) :- L -1' exp\-i- tj .21 _ 1 i=1
t
+ ;f"P j +JIl f'rj~IlUl,)j+1exp(-uD • .T). "v 0 '1 - ,'I
157
uU .. j. uU1· -
r exp ~(- ~'l~)\'t-T){d~ y' 1" )
a ti on lfl. tel' and rp place
reducps to:
P j+i'j uy
P.q Wi t~. or:
... (5)
... (7)
The first mom~nt of the waiting time wi
II be
1.1:' 1-1 PM ~ 1 -M1 ~+-. 2 u i MUl\'-UM+~1 \l-f/UM)
••• (8)
And the second moment of' the Waiting ti
me Will be:
2 1\12 :; ~ [ w i. s )] .
de s=o
••• l9)
3. APP.LICAT 1011 TO MULT IPHO<.HtAMMING
~quation ,tl) ~ (9) wer~ used to compute
th~ values of average and standard devi
ation of response time for a multlprogr
ammed computer system. Ui would repres
ent the overall efficiency of the system
at mul tiprcgramm1ng level i and 1>1 would
rp.prE:l:;,ent the maximum multiprogramming
level allowed. The values of Ui
were
computed as below (2)
U ::: i
w . ~1-W)1
1- i
i! L: (-L) j L j=O 1-w • j !
... (10)
'" hel'e w is IIG wait at uniprogramming and has Oeen assumed to be 0.65 ~3). The
valud3 of average and stR.ndard d~via tiO
na of the re3pon:oe time for dtfff,orent
system detrand rate and maximum multipro
gramming level have been SrJOWn in b'lg.3.
~his mojel ~~s al~o si~ulated on EC-l030 (;ompLl tel' to check the validi ty of the
ar.alytica,l result:;;. For 'a s~ple of 4000 jobti • .l:'oi ;.>80n arri valt: and negative
\58
exponential jobtimea were generated using
transform technique with equiprobable ra
ndom number generator. These jobs were
run on the Simulated model and their res
ponse times found out. The results obta
ined thereof are portrayed in Fig.,.
4. CQbCLUaIONS
A varying efficiency multi server queue
model has been studied and equation for
pdf of waiting time has been developed.
The results of an application to multipr
ogrammed computer are in agreement with
those obtained by Simulation, thus valid
ating the analytical analysis. The vary
ing efficiency multiserver model is quite
general and could be s'li table for all mu-
1 tj server systems where idle servers help
the busy servers thUd efl'ectively varying
the eff:l.ciency of' service.
rtEFErtENCES
~ 1) .L.JUienrock, 'i,jueueing i.lystem Vol. 1 :
'.cheory' John dley M: 1975
~ 2) S.R.fI'.a.dnic.lq J.J .Donovan, 'Operatj ng
oy Gt em s' i'lauraw hill 1974 (3) l·I.N.J.lehmanj J ..... i:tosenfeld, 'Performa
nce of a Simula ted ftIul t iprogrammi ng
l::lystem' .Proc Ai<'IPd 1968,.l"J(;C. Vol.32.
pp 1431-1442.
bIOGRAPHIES
Lr. H.D • .rI,,;kre was born on April 4. 1935. He received his graduation in Tp.lecotrJllun
lcatian Engineering from JaOalpur Univer
Eity in 195d. de Jid his A.Tech. in Ind
lutrial Elec troni Cf; in 1960 from Ind ian
Institute of 'l:echnology, J:;omoay. In 196;,
he wati awarded Conmonweal th i;}cholarsbip
by lfovt. of Canada and completed hi s .{'II. ~c
}:ngi.nee ri ng in Con trol clys t e~na from the
Univel's.l -;,y of uttawa in 1965. After com
ing back to Inaia he compl€'ted hie Ph.D.
:In 1:)70 on .;lYbte'TI Identificat:lon. He is
on tr,f: teachin~ faculty of Indian InsUt
ute of Technclogy, borr:oay from 1960. At
preHent he i::; guiding a large number of
re~earct Echolars for their ~h.D. on va
riuuH fidrjs suct. ae ..Jysiem I'jentificat
iOll, oJimulation, :"igitCil filter", dpeech
prrJcessing, l'icture procebsinb, !~online
Kr ciyGtc~s, Computer communication syst
eroL, etc. Two cr the reueBrch schclars
hbvp. already comr,leted their rh.Dls. he
h&; .. pUlil.ished over fifty re"earch papers
in vc;.rioub international journals.
hr. h.D. l.uoar "a" born on i.Jept.7, 1941. He gradua twi in Electrical Engineering
from Lanar~G rtindu Univer~ity in 1964
an~ ot.tained hi..: masters deeree in Appl
ied Lleci,ronics in 1967. In 1973, he
wa,_ aloo:o1rded (,;oflffion;%altb Jc;,olqrship and
com[llrted his N. ;;,c. in Comp'-lter :';cience
fro~ ~lliversity of ~Hnchester in 1975.
QUEUE
ne is on the tcachine faculty of Indian
Insti tute of Technology, .dombay since
1968.
h .11. iJri vaBtava wah born on 12th Ivlarch
1939. He received the Jj.~C. degree from
AllahabFFi UniverlJity, .!:l.E. in :r.lectrical
Eng! neE" rio ng from hoorke e University,
l'I.'l'ecIJ. in Applied Electronicti ar.d Ph.L.
from Ll.?, bombay in 1957,1960, 1967
and 1976 respectively. ~rom 1960 to
1967 he worked ~s Lecturer and from 1967
to 1976 as tteatier in Govt. :r:ngg.(;ollege
Jabalp~r. ne worked as ~roject Engineer
in AltEii Project at L 1.'1'., ilomoay from·
~arch 1976 to June 1977 in a special ati
sigr.mlmt. his subjects of interests are
~ueueing theory application to Comoutpr
bystems, otochastic process etc.
I I
I I I UI/I I L _____ -.J COM PUTER SERVICE
UI- OVERALL UTILISATION FACTOR OF CP 4 AT NUL TIPROGRAMNING LEVEL
FIG.1. QUEUEING MODEL
Adt Adt Adt Adt
FIG.2. STATE TRANSITION DIAGRAM
159
IIJ ::E ~
IIJ III Z 0 II. III IIJ II: IIJ o,!)
'" a: !oJ >
'"
z 0 ... ~ > IIJ 0
0 a::
" 0 z « ... III
100
!oJ ::E i= III III Z ~ 10 III III a:: III o,!)
" a:: IIJ >
'"
1~ __ ~ ____ ~ __ ~ ____ ~ __ ~ ____ ~
·7 ·75 ·8 ·85 ·9 '95 ',0 SYSTEM DEMAND RATE
FIG. 3 Q. RESPONSE TIME (ANALYTICAL)
100
1101 = 10
1101 = 8
~7~---. .I..75""-'--I,.8---~. 8""5----"""9--·.L..9-5-...J, • a
SYSTEM DEMAND RATE
FIG.3c.STANDARD DEVIATION OF RESPONSE TIME (ANALYTICAL)
160
~~7---'~7-5---'~8----~'8-5-~'9---~.9-5---J,,0 SYSTEM DEMAND RATE
FIG3b. RESPONSE TIME (SIMULATION)
10
z 0 j:
~ > M :10 ~ 10 1101:8 0 a:: C§ Z
:5 III
lL-__ -L ____ +-__ ~ ____ ~ __ ~_~ ·7 ·75 ·8 ·85 ·9 ·95 1.0
SYSTEM DEMAND RATE
FIG.3d STANDARD DEVIATION OF RESPONSE TIME (SIMULATED)
1
MODULAR DESIGN
OF THE NETVI0RK WHICH REALIZES ORIGINAL PROGRAM
Svetlana P. Kartashev University of Nebraska - Lincoln,
and Steven I. Kartashev
DCA Association, Lincoln, Nebraska
Abstract
The paper considers modular design of the network which is assembled from n unit modules, and realizes given program. The network contains given number p (1 < p < n) of the distinct module types, and is assembled with the use of only pin-to-pin connections.
1. INTRODUCTION
Current trend of large scale integration is to augment the size of one module (IC package). It is therefore expedient to give a broader interpretation of a unit module used as a basic building block =or modular logical realizations.
In the literature on modular logic design it is assumed that a unit module implements either solely a logical function (1-4), or a logial function and th~ flip-
~f1op excited by the function (5-9). The next step in this approach is to assign each unit module with control and executional abilities, i.e. to let it act as a computer module (CH) and contain a control sequencer (sequential machine) and the operations enabled by the sequencer. This means that one output of the sequencer has to enable execution of one operation implemented in the executional portion of the CH. It is then said that this C:I executes an instruction.
This paper continues earlier research pub-
161
1ished in (7-10). Its difference from published works is that a CH is provided with control and executional abilities. Thus it functions as a computing module which executes the portion of an original program. This portion is implemented as the sequencer contained in the CU.
The following problem is solved: for an original program P, a decomposition algorithm is found which decomposes Pinto, n subprogram~. each of which is implemented inside one CM as the transition diagram of an asynchronous sequential machine. Of the total n sequential machine P is decom~osed into. only p(l~p<n) are described by distinct transition diagrams. The consequence of this decomposition is that of the total n CH's which compute P" only p unit module types. CH(l). CH(2), ... , CU(p) are different. This means that CMel) is copied k1 times. CH(2) is copied k2 times, ...• and CH(p) is copied kp times. Therefore. the overall number n of CH's is:
n =:' kl + k2 + ... + kp .
I' I'
II I' ,
[ I
i!
Ii
To achieve computation of the P program,
n CM's are assembled into tLle parallel
network, by connecting input pins of one
CH to the output pins of ot:1er CH's. It
is silOwn that the network N realizes pro
gram P if the input pins of every CII are
connected with the output pins of otDer
CM's in accordance with a so-called shift
register principle first studied in (9)
for modular logical realizations. Thus
the proposed decomposition algorithm
partitions an original proGram Pinto n
sequential machines working in parallel
and gives the rule of connections among
n CM's into a parallel network ~ w~icri
realizes P. The decomposition algorithm
was completely computerized. It was used
for computer-aided design of trie network
N whicri realized 8 programs containing
altogether 343 vertices. T~e network N
wa~ formed from 28 CH's of two types
CH(l) and CM(2), of whicri CM(l) was copied
12 times and CM(2) was copied 16 times.
2. DESCRIPTION OF TdE PROG&hl1 P
The program P realized by the network N
is presented by the program graph, where
each vertex of the grapri identifies one
conventional program instruction and
arrows identify sequencing among program
instructions. There exist two types of
transitions in the program grap~: uncon-
Figure 1.
Program directed graph.
162
ditional and conditional. Each unconditional transition may be eit~er simple
or conplex. By a simple unconditional
transition a ~ b we mean execution of the
b instruction at the next clock period
after the a instruction. By a complex unconditional transition a ~ b, c, d, we
mean concurrent executions of b, c, U in
structions in distinct CH's at the next
clock period after the a instruction.
There exist two types of conditional transitions: simple and complex. By a simple
conditional transition a ~ b; a i c, we mean execution of the b instruction at the
next clock period after the a instruction
if the x sienal is 1 (x = 1) and execution of c instruction at the next clock period after the a instruction if the x signal is o (x = 1).
By a complex conditional transition a ~ b, c, d·, a ~ e, f k n h f 11 , ,~, we mean teo ow-ing: if x signal is I, then execution of a instruction is succeeded at the next
clock period by concurrent execution of b, c, d instructions in pairwise distinct
Cl1' s. If x siBnal is 0 (x = 1), the ex
ecution of a instruction is succeeded at the next clock period by concurrent execu
t~on 0-: e, f, k, £, instructions in pairwise distinct CH' s. [In the program directed graph (Fig. 1) the d + f transition from vertex 4 to vertex 2 is th~ simple
unconditional transition; the f + d, a *
transition from vertex 3 to vertices 4 and 1 is the complex unconditional transttion· the a'~ ~ f, a)~ & u transition from vertex'
6 to vertices 3 and 8 is the simple conditional transition; the b* ~ a b* ~ w W Of J ,
w transition from vertex 10 to vertices 28, 13, 24, 22 is the complex conditional transition. ]
3. DESCRIPTION OF THE NET\m~ N
By network N we mean a set of interconnect
ed CM's which work in parallel over compu-
tation of given program P. Each CM is one IC package and is described by the hardware diagram of Fig. 2. It consists of
the control part, CP, and executional, part, EP. The CP contains the control sequencer C realized as an asynchronous
(Hoore) sequential machine, one identifi
cation flip-flop v and the logical circuit Lv which switches v. The EP contains the processor, memory for data word storage
and input/output device.
Every CM works only if the respective
identification flip-flop v=l. If v=O, CM is idle. Hence parallel execution of k
program instructions in k CM's (k ~ n) is represented by an n-dimensional binary
vector ~ = (VI' v2 ' ... , vn ) having k ones in which each ... oordinate vi £ {O,l} identifies the working state of CM.. Vector
l. ~ is called a network state. In the end of each clock period, network N performs ~ + ~I transition from the network state
~to the network state ~'. As a rule,
v I: ~'.
[Let N contain five CM's, i.e., N =
{CMl , CM2 , CM3 , CM4 , CM5 }. If N performs the 1 1 0 1 0 + 0 1 1 1 0 transition,' then
h .l . + 1 2 3 4 5 ~ h w 1. e 1n v l = 1 1 0 1 0, h as CMl , CM2 and CM4 working over execution of program instruction say a, band c, respectively, whereas CM
3 and CM5 are idle. At the next
clock period N establishes the new network·
state ~2 = 0 1 1 1 0, which means that CMl and CM5 are idle whereas CU2 ' CM3 , CM4 execute new program instructions, for
example d, e, f, respectively.]
Thus for each ~ + ~' transition, one of four types of transitions may occur in
every CMi :
(a) vi = 1 for both ~ and ~'; CHi executes one instruction in ~ and one instruction
+' in v. This type of transition is called
the 1 + 1 transition. + +' (b) Vi = 0 for both v and v; CMi works
163
output dynamic signals
I ,egls,e,l· ··1 ,egltter 1 EP CP , , ~ ..
1 ... r-...
lAdder (J I: T P,ocellor
I (Jcontrol dtcodein C [I 0
IVO
T V, ) Signals from
Identlficotlon fl • .. h F F ~,- ,.glsteFl · · Vn_1 flip-flops I r.glster ... il regl.ter I + Lv
(I next-llate IOftien .
} Rondomr oeceu C""o..
I
~Adl m."!ory , lJi\i \,.,J
• word, : warde
, ,
word, worde I doto regl.ter t •
( 1I0)devlce
I
1 · · ·
1j 12
~ "-/ 1
}
input dynamic signols
Figure 2.
Hardware diagram of one eM.
neither in ~ nor in ~~ i.e. it executes the 0+0 transition.
(c) Vi 0 for ~ and Vi = 1 for ~'; CMi executes nO instruction in ~ and one in-
+' struction in v, or it performs the 0+1
transition.
(d) ,Vi = 1 for ~ and Vi = 0 for ~'; CMi executes one instruction in ~ and no in-
+' atruction in v, or it performs the 1+0
transition.
Every 1+1 and 0+1 transitions in CHi are performed under a special signal x.
l.
called a dynamic signal. The Xi signal enables the C-sequencer transition into a new state (new code in the R. register)
1.
which corresponds to a new prop,ram in-
struction executed by CMi ·
Every 0+0 and 1+0 transitions in CMi mean
that identification flip-flop Vi = 0
164
blocks decoding of the state stored in the Ri register. Thus CMi executes no instructions during the clock periods
when Vi = O. Every CHi may generate two types of dynamic Signals: unconditional
and conditional.
The unconditional dynamic signals are generated by the control decoder of every
C-sequencer and they are used for execution of unconditional transitions from one program instruction to another. The conditional dynamic signals are generated by the EP-circuit of the CM.. One may
l. regard as a conditional dynamic signal either an adder overflow, or a counter
overflow. or a comparison signal, etc. The conditional dynamic signals are used for execution of conditiona~ transitions from one program instruction to another.
The dynamic signals generated by CM are
used for switching the C-sequencer of the same CN. Furthermore, the same signals may be fed to other CH's where they switch the respective C-sequencers into new states. {Suppose, instruction a executed in CH2 calls for comparison of two words A and B. The result of comparison is identified by signal 3 (A = B) or S (A! B). The 8 signal is produced by the comparator of CM2 , and conceived as a conditional dynamic signal. The 6 = 1, enables three concurrent program transitions a ~ b, c, d, from the a instruction to b, c, d instructions, executed in parallel in CM2 , CM3 , CM4 respectively. The B = I also enables three concurrent program transitions a ~. e, f, h, also executed in parallel in Cl12 , Clf3 , CH4 . Thus in CM2 which executes the 1+1 transition, two alternative program ~ransitions are executed a ~ b or a ~ e. The B-signal (8 or S) is fed to CH3 and CM4 . The CU3 performs the 0-.. 1 transition. This means, that if 8=1, it executes c instruction, if B=l, it executes f instruction. Since in previous clock period CM3 was idle, the program transitions executed by CH3 will be denoted as ~.c, and ~'f. The CM4 also executes the 0+1 transition interpreted as either ~.d or ~'h.l Thus it follows from our description, that when N assumes a network state V, each CM. having v. = 1 computes one program
J. J.
instruction. If v has t ones in positions 1, 2, ... t, N computes concurrently pro
gram instructions aI' aZ ' ... , at in CMl , CM2 , ... CM~, respectively. Concurrent computation by N of the instructions aI' aZ' ... , a v during one clock period is thought of as an instruction vector A = (a l .a2 , ...• a t ). Thus the computation of program P by N is conceived as the sequence of executed instruction vectors
A. ft •... , f ..... n, etc.
Note: One distinguishing quality of N is
165
that during execution of the same program p. N can establish the same network state several times, each time computing a new instruction vector. For example, in the
beginning when the network state vI =
I I 0 I I is established first, N =
{CMI , CM2 • CM3 . CM4 • CM5} may compute the instruction vector A = (a.b.O.f.k); in another portion of the program p. establishment of the same vIII 0 I 1 may lead to computing by N of another instruction vector Bl = (c,d,O,m,t), etc. For all instruction vectors induced by the same system state vI' their zero positions
coincide with those of VI' + +,
For each network transition v -+ v. one must switch to the working mode(enable) only those CM's which correspond to the ones positions of the state v'. To do this
one must generate the set of dynamic signals, each of which enables one or several cH' s. A collection of dynamic signals which causes one network transition v -+ v' is called a vector of dy
namic signals. [Let VI = 1 1 0 1 0 and v2 = 0 III O. For I 1 ° 1 0 +0 1 1 1 0,
one has to enable CM2 • CM3 and CM4 using the vector of dynamic signals x = (O.x2' x3 ,x4 ,0). Note: A portion of dynamic signals x2 ,x3 ,x4 might be external with respect to some CM's.] Also, network N may perfom several times the same transition v + v'each time generating a new vector of dynamic signals x. Let the same v + v'be executed at two distinct moments of time t and t'. We assume that at the moment t, it is enabled by the vec~r i 1 of dynamic signals, i.e., V ---- v', and at the moment t' it is enabled by another~vector x2 of dynamic signals. 1. e., v __ Z_ v~~ Then at the moment t, N. executes A __ 1_ ~ transi~on and at the moment t' it executes ~ ---- ~ transition, where A. ~, C, Dare instcuction vectors. In future, each
': I
,I
1
~ transition will be called a pro-gram vector transition.
Since working or nonworking state of each CHi is identified by the identification flip-flop v. (v. - IVO)
l l '
vided with the interface each Vi is profunction F. im-
l
plemented with the logic Lv (Fig. 2). Any function Fi is the function of a dynamic signal Xi fed to or generated inside of CMi and the present system state; stored in the identification flip-flop; vl ,v2 '
... ,vn : Fi f (xi';)'
Each CM is synchronized by two synchroni
zation sequences '1 and '2' New meaning of the flip-flop v is established at the
moment of Tl-pulse. A transition of the C sequencer to the next state is executed at the moment of T2 pulse, i.e., during the time when all identification flipflops v l ,v2 ' ... ,vn assumed their new meanings, thus forming a new network state. Any program instruction executed by each CHi lasts one clock period TO which is measured by two consecutive pulses '2'
4. THE CYCLIC SHIFT REGISTER PRINCIPLE USED FOR MINIMIZATION OF THE NUMBER
OF PAIRWISE DISTINCT CM'S.
Since each CMi is placed on a single integrated package, one has to minimize the
fill
.1Il
v,
number of distinct module types used for fabrication of N. This minimization is conceived as using only a limited number p of pairwise distinct computer modules CM(1),CM(2), ... ,CM(p) (otherwise called basic CM's) i.e., N uses kl replicas of one basic CM(l) , k2 replicas of another basic CM(2) , ... ,kp replicas of the pth CM(p). [Network N of Fig. 3 has 5 CM's, N
{CM(1)l,CM(1)2,CM(2)3,CM(2)4,CM(2)S} where CM(l)l and CM(1)2 are the replicas of basic
CM(l) , and CM(2)3' CM(2)4' CM(2)5 are the replicas of basic CM(2).] To fabricate k.
l
computer modules by replication of the same basic CM(i) , one has to achieve the total hardware identity in their control and executional circuits. Let us show how this may be accomplished.
The initial program P to be computed by the MCS must be modularly decomposed by a special modular decomposition algo'rithm. As a result, computation of initial program P is reduced to the parallel computation
of n subprograms PI'.·.' Pj , ... , P n' where each P. is computed by CM.. Since N con-
J J tains only p basic CM's, all hardware identical eM's must compute identical subprograms. Thus the whole set {PI'" .,Pn } of subprograms must be partitioned into p subsets p. such that each p. subset con-
l l
Os CM(2)4 Os CM(2l5
0, CL[31
0, (1[41
09 (1[5:]
aDI 0.(41 Ci(5]
°'0 °'0 0'0
Oil gDI
°Il (41
0" V(51
·3 '. V5
Figure 3. Connection diagram of the network N.
166
tains only mutua~y identical subprograms. In further treatment, a Pi subset of identical subprograms (p. {P., P. , ... ,
~ ~l 1.2
P . . J) will be called the p:-Modular Pro-l.k· :!:
gram (Pi - MP).
Let us show that minimization of the num
ber of distinct. Pi - MP's results in minimization of the number of basic CM's, i.e. minimization of distinct module types which one has to produce to fabricate N. Suppose that the realization of the
Pi - MP calls for the realization of w(Pi) distinct microoperations. This means that
execution of the same Pi - MP in CM's, say, CM I , CM2 , eM3 necessitates that the executional portions EP of these modules contain only w(Pi) micro-operations. Consequently, if CMI , CM2 , CM3 realize the same p. - HP their executional parts are
~
pairwise identical, i.e., EP I = EP2 = EP 3 .
We now find in what way one can organize the hardware identity of the C sequencers
of the CH' s. The mere fact that CHI' Ct1Z '
CM3 compute the same Pi - HP does not mean that their sequencers CI , CZ ' C3 are pairwise identical. This originates from the fact that at the same clock period CI •
CZ' C3 sequencers may assume distinct states, enablin~ CMI , CMZ' CH3 for execution of different instructions. For the unconditional transition a ~ b, c, d,
executed in CHI' CM2 , CM3 let the sequencer of the CHI perform the a .... b transition under the dynamic signal x generated by the same CMI , i.e., symbolically,
a[ll~ btl]. The x[ll dynamic signal is fed to CMZ and CM3 to enable execution of the c instruction in CM2 and d instruction in CM3 . Thus the Cz sequencer of CMZ
performs x[II~C[2] and the C3
sequencer
of eM3
performs x[I) •• d/3J (Fig 4a).
167
Thus to implement a/l]~b[l), c[Z],
d[3], the CI , CZ' C3 sequencers call for distinct hardware. Thus to achieve the
hardware identity of CI ' eZ' C3 sequencers, it is necessary that these sequencers contain the same transitions. It may be achieved by supplementing PI' PZ' P3 subprograms by the needed program transitions. Indeed, for our case the Pz subprogram must be supplemented by the a~ b transition to be executed in CH2 but at another clock period, i.e.,
a[2] x[21, b[Z) (Fig 4b). The x[2) dynam
ic signal generated by a[Z] must imitate the x[l] signal in what is seen as the transition of the CI sequencer into the d
instruction ( x[2] •• d[l]) and the tran-
sition of the C3 sequencer into the c
instruction ( x[2] hc[3)). Similarly,
one has to implement the a .... b transition
x(3) in the CH3
, 1. e., a [ 3] .. b [3). The x[3] dynamic signal generat~d by a[3] has
to perform x13] .'c[l] in GMI and
x [3 J II. d [2] in CHZ (Fig 4c). Therefore,
the hardware identity of the CI , C2 , C3 s~quencers can be achieved only if the
PI' P2' P3 subprograms contain ing transitions: for, say, ~l work transition, one performs
the follow-+
... v2 net-
a [1 J x [1 J • b [1], c [2 J, d [3 J for another
network transition, say, ~14 ~ VIS' one
performs a [ 2 J x [Z] .. b [Z J, c [ 3 J, d [ 1), and
for a third network transition, for in
stance, v30 + v32 one performs
a[3] x[3] .. b[3], c[l], d[2].
The set Ei (a + b) = {a[ill + brill, ariZ] ~ b[i2l, ... , a[ikil ~ b[iki1} containing ki replicas of the a + b transi
tion belonging to Pi-MP, where a[i1 ] +
brill belongs to Pil , a[i2l ~ b[ belongs
to'Pi2 , ",a[ikJ "+ b[ikJ belongs to Pik
, will
be called the complete modular expansion.
For the a "+ b, c, d transition considered
earlier and assigned to PI - MP {PI' PZ' P3}, the Pz and P3 subprograms are to be supplemented by the wanted program transitions. This results in obtaining the following modular expansion EI(a ~ b, c, d);
t [ I J ..2Ull. b [ I] ,c [ 2 J ,d [ .1]
El (a ~ b,c,d) == [2J~ b[2] ,c[3] ,del] (1) [3]2lll b[3] ,c[l] ,d[~J
It follows from the above that any modular expansion E. (a -+ b, c, d) contains k.; mem-1. . ~
bers 1f [i l ], 11 [i2],."., 11 [1.kil, where each 11[j] denotes the program transition a[j] x[j)b[g], crt], d[k] implemented in
the P. subprogram (which is executed by J
the CM.). The index j of the member 1T[j] s~ows the position of the CMj which executes the predecessor instruction a[j]. The a[j] instruction induces the dynamic signal x[j]. For every member 1T [j], each instruction f must be assigned with position ~(i.e. ,f [£]), which shows that the f instruction is executed in CM~.
f----C~ ___ + ___ C_M.::..2 __ -+ ___ C_M-=3:.....-_---1
I I ~ I I
(0)1 ~ I I ~ I I I I I ~--------~--------~----~
I I ~ ~ I
(b) :
I
I 1 I
( c)
Figure 4.
~ ~ ~
Steps of achieving the hardware identity in CMI , CM2 , CM3
",
168
Similar position numbers are assigned to
the dynamic signals.
Assignment of positions for each instruc
tion and dynamic signal contained in the
member rr[j] £ E. (a ~ b.c.d) is governed ~
by a so-called cyclic shift operator ~.
For the modular expansion EI(a ~ b.c.d)
{ rr[l]. TI[2]. TIr3]} [given by (1)], one
can easily notice that the a instruction
contained in TI[l]. TI[2]. TI[3] is assigned
with positions described by the shifting
rule 1 ~ 2 + 3. The same rule is used
in assigning positions for x dynamic
signal and b instruction. For the c
instruction. the position assignment is
described by the rule 2 + 3 -+ 1 which is
one bit shifted with respect to 1 -+ 2 -+ 3.
The 2 + 3 + 1 rule instead of 1 -+ 2 -+ 3
originates from the fact that the c in
struction is executed not in CMI but in
CM2
. For the d instruction the positions
are assigned under the rule 3 -+ 1 -+ 2.
two bit shifted with respect to 1 -+ 2 -+ 3;
since the d instruction is executed in
CM3 ·
Thus for the modular expansion E1(a -+ b,
c.d) the position assignment is described
by the cyclic shift operator ~l =
1 + 2 -+ 3. Each cyclic shift operator ~ ~i may assume k i distinct states: generic state ~.O [which identifies positions of --- ~ instructions and dynamic signals in gener-
ic member rr[i l ] of the modular expan-
sion E. (a + b. c. d)] and derivative ~ 1 2 ki-l hI. states ~ .• ~ .•...• ~. • w ere~; ~s
~~~ ~ ~ ~ ~
the state of ~ .• which is 1 bit shifted ~ h . 0 \1/ 2 . with respect to t e gener~c ~i ; Ti ~s
the state of ~. which is two bit shifted ~
with respect to ~.O. etc. For the example ~
considered earlier. ~l = l~ forms the followin8 states: generic state
~l 0 = [(1). (2). (3)] -- because accord
ing to (1). the generic TI[l] € El(a -+ b. c. d) enables instruc
tions and dynamic signals in CMl • CM2 •
1 CM3 -- and two derivative states ~l and 2 1 - ~l • where ~l = [(1) ~ 2, (2) -+ 3, (3) -+
2 1] and ~l = [(1) ~ 3, (2) ~ 1, (3) -+ 2].
where each (p) ~ w shows that a position
p belonging to the generic state ~.O must ~
be replaced by the position w in the
derivitave state. Therefore for the modu
lar expansion EI (a ~ b. c. d), containing
the members TI[l]. TI[2]. TI[3]. we obtain 1 2 that TI[2] = ~l (TI[l]). TI[3] = ~l (TI[l]).
Furthermore. the cyclic shift operator ~i
which describes a modular expansion E.(a ~ b) must be the same for any modular ~
169
expansion Ei(c ~ d. e, f) built for the
same Pi - MP. Thus each Pi - HP is described by its own cyclic shift operator
~i' Therefore. if the initial program P is described by I modular programs
PI - MP •...• Pi - MP..... Pe - MP. then for any Pi - ~W there exists the cyclic shift operator ~. which describes all mod-
]-
ular expansions of this Pi - HP.
Henceforth. for any modular expansion Ei
assignment of the positions to instruc
tions and dynamic signals for every member
TI[j] (j = i l • i 2 •...• i ki ) must be governed by the following rule: Closure of
Ej(a ~ c. d. e. f) with respect to o/i.
Each modular expansion Ei(a ~ c. d. e. f)=
{TI[i l ]. TI[i2 ].···. TI[iki ]} bu{~t for the
Pi - MP must be closed with respect toP i ,
i.e .• for any TI[j] € Ei(a -+ c, d, e, f), 0/ .(TI [j]) = TI [m], where TI [m] is also a mem-~
ber of the same Ei . [For El(a -+ b, c.d)=
{TI.[l], TI[2], TI[3]} given by (1), o/ll(TI[l]) = TI[2] and TI[2] € El(a -+ b. C;
d), ~/(TI[I]) = rr[3], 'l'13(rr[l]) = TI[1].]
Let us show that the Closure Rule can be
applied for organization of the pin-topin connection between k i identical CM's.
In our example for the modular expansion
El(a + b, c, d) built for the CM1 , CM2 , CM3 , respectively, and described by
~l l~, the Cl , C2 , C3 sequencers are transferred to the states c and d by
external dynamic signals. In CMl , CM2 , Cli3 , let the x dynamic signal fed through
the pin 0c transfer each C sequencer into the state c. Transition of the same C
sequencer into the d state is performed by
the dynamic signal fed through the pin 0d' Then, as follows from (1). in CMl the Cl sequencer assumes the d state under x[2]
signal fed to 0d; the same sequencer assumes the c state under x[3] signal fed
to 0d' etc. Therefore the following matrix describes feeding of dynamic sig
nals x[l], x[Z], x[3] to the pins 0c and
°d'
2 f[21
ia ..... Id > .... S; Itt > i!l
..... -~ ,.....
7 5 ;::; 0
..... ::t
27 u[2]. zUJ
(0)
° °d c CMI x [3] x[2J
CMZ x[ 1J xl3]
CM3 x[Z] xlI]
Since for El(a ~ b, c, d) given by (1) the positions of dynamic signals are described
by ~l = 1~3, feeding of the dynamic signals to the pins 0c and 0d is established as follows.
Let the generic state ~O specify the posi
tions of dynamic signals fed to the pins
0c and 0d in the CMl • i.e., ~O = «3),
(2». Then for CM2 , one has to feed, to
the pins 0c and 0d' the dynamic signals which positions are identified by ~l «3) + 1, (2) + 3), and for CM3 , the posi-
HI)
~ ...... tt > .... S; ..... Itt ~ > E2 -!!:!. @
8 4 '6
..... N ..... :::lI
9 uUl z[2]
(b)
Figure 5. Sub-programs computed by CM(l)l and CH(l)2'
170
tions of the dynamic signals· fed to Dc and 0d are identified by ~2 = «3) ~ 2, (2) ~ 1) .
4. MODULAR DECOMPOSITION
The modular decompostion algorithm con
sists of the following stages:
Stage 1: Form sets of identical instruc
tions otherwise called generic sets
[For program P given by Fie. l.the follow
ing generic sets are formed: a*(1,6),
f(2,3), d(4,S), u(7,8), b*(19,20,lO), e(11,23,2l), c(lS,17,16), g(12,2S,28),
w(13,24,22), h(18,26,14), z(27,9»)
StaRe 2: Select n, p, and ki(i=l, ... ,p) where n is the overall number of CM's, p is the number of CM's types, k. - is the
~
number of copies of one CM(i) type.
Algorithm: Of all generic sets, form p blocks Bi , each block containing generic sets of the same size ki(i=l, ... ,p). Then:
n = kl + k2 + ... + kp ' P is the number of
g[41
d.[41v0L5J
(0) (b)
distinct blocks Bi . [For our case, all generic sets have sizes 2 or 3, therefore
p = 2, kl = 2, k2 = 3, n = S. Bl = {a* (1,6), f(2,3), d(4,S), u(7,8), z(27,9)1.
B2 = {b*(19,20,lO), e(11,23,2l), c(lS,17, 16), g(lS,2S,28), w(13,24,22), h(18,26,
14) }. )
Stage 3: (1) For every k., give arbitrar-~ .
ily the shift-register operator ~i : i l ~
i2 ~ ... ~ ~k. [For our case, kl = 2, ~ k2 = 3. Ther~fore, ~l shifts positions 1 and 2 (l~), ~2 shifts positions
3, 4, S (3~).)
(2) For each generic set a(i l , i 2 , ... ,
i k ) containing k. members, assign arbi~ . ~
tr~rily one element a(j) to one position j £ {i l , i 2 , ... , i k ) [For a*(1,6) =
{a*(l), a*(6)}, a*(I) is assigned to
CM(l)l' (a*(l) = a*[l)), a*(6) = a*[2); for f(2,3) = {f(2), f(3)}, f(2) = f[l),
f(3) = f[2). etc. Fig's S,6 show all the
remaining assignments made for Bl and B2)
9 (5] g[31
ii [5Jvc1[3J c1[31vG[41
(c)
Figure 6.
Subprograms computed by CM(2)3' CH(2)4' CI1(2)S·
171
I'
Stage 4:
(1) For each original program transition
a + b construct a complete modular expan
sion E(a + b) which satisfies the closure
rule with respect to ~i. If E(a + b)
contains only original program transitions,
no modifications are necessary. If
E(a + b) contains supplemented program
transitions the modifications indicated
in p. 2 are to be performed:
(2a) For every CMi , introduce position
code i which binary value is one fewer
than i [For CM1 , i = 000, for CM2 ,
i = 001, ... , for CMS' i = 100.]
(2b) Equip every position code with de
coder containing n outputs r l , r 2 ,···, r n ,
thereafter called recognizers. For every
CM., only r. [i] = 1, the remaining recog-1. 1.
nizers, rj[i] = O. [For CM1 , decoder
contains S outputs, rl-rS . Of those,
only rl[l] = 1, since it decodes i = 00
stored in CM1
. The remaining recognizers
r2[l], r
3[l], r 4 [l], rS[l] are zeros.
For CM2
, only r 2 [2] = 1, etc.]
(2c) For the modular expansion E(a + b)
containing, say, only one original pro
gram transition, it is required that re
maining program transitions (which war
rant the closure rule) be supplemented.
[Let for E(a + b) = {a[3] + b[3],
a[4] + b[4], a[S] + b[S]}, described by
~ : 3 + 4 + S, only a[3] + b[3] be origi
nal. Since it is executed in CM3
, enable
it with valid recognizer r3
[3] = 1:
a[3] r3[3] )b[3]. Enable a[4] + b[4]
with recognizer r 3 [4] = ~(r3[3]):
a[4] r3[4] ,b[4]. Enable a[S] + b[S]
with recognizer r3
[S] = ~2(r3[3]), namely,
a[S] r 3 (S] I b[S]. Since r3
[4] = r3
[S]
0, two supplemental program transitions
will never be executed.
Therefore, introduced procedure of supple
mentation is such that no supplemented
172
transition is actually executed. Conse
quently program P will be executed cor
rectly. Fie. 's Sand 6 contain all five
subprograms, the original P is decomposed
into.
CONCLUSIONS
This paper presents new mathematical ideas
which allow one to perform a total formal
ization in designing network U destined
to realize an original program P. It is
shown that the principle of shiftinp, op
erator f = (~l' ~2'···' ~p) whic~ opera
tion is equivalent to p concurrently
working shift-registers with total size in
n bits results in the network N containing
n CM' s, of which only p Ct1' s are distinct.
The contributed ideas affect the hardware
identity among modules at the expense of
equivalent supplementation of P.
REfoERE,ICES
1. F. P. Preparata and D. E. Muller,
"Generation of near-optimal universal
Boolean functions", J. Comput. Syst.
Sci. vol. 4, pp. 93-102, Apr. 1970.
2. F. P. Preparata, "On the design of
universal Boolean functions", IEEE
Transactions, Comp., vol. C-20,
pp. 418-423, Apr. 1971.
3. S. S. Yau, and C. K. Tang, "Universal
lop;ic modules and their applications",
IEEE Trans. Comput., vol. C-19, pp.
141-149, Feb. 1970.
4. T. F. Tabloski and F. J. Mowle, "A Nu
merical Expansion Technique and its
Application to Minimal Multiplexer Lo
gic Circuits" IEEE Trans. on Comput.,
vol. C-2S, pp. 684-703.
S. P. Heiner and J. E. Hopcroft, "Hodular
decomposition of synchronous sequential
machines", in Proc. 8th Annu. IEEE Symp.
Switching and Automata Theory, 1967,
~p. 233-239.
6. T. F. Arnold, C. J. Tau, and M. M. New
born, "Iteratively realized sequential
circuits", IEEE Trans. Comput., vol.
C-19, pp. 54-66.
7. S. p. Kartashev, "Methods for realizing
sequential machines on identical inte
grated circuits", Proc. Symp. Comput.
and Automata, Poly tech. Inst. of Brook
lyn, April 13-15, 1971. 8. S. P. Kartashev, "State Assignment for
Realizin8 Modular Input-Free Sequential
Logical Networks without Invertors",
Journal of Comput. and System Sciences,
vol. 7, no. 5, 1973.
Svetlana P. Kartashev was born in Kiev,
USSR. She received the B.S. and M.S. de
grees both in electrical engineering from
the Kiev Poly technical Institute, Kiev, in
1960 and 1961, respectively. In 1969 she
received the Ph.D. degree in computer sci
ences from the Institute of Cybernetics,
Kiev.
From 1961 to 1969 she worked, respectively,
as a Research Engineer and a Research
Associate in the Institute of Cybernetics.
Becoming a permanent resident of the Unit
ed States, from 1970 to 1972 she was a Re
search Associate of the Department of
Computer Science, at The Johns Hopkins University. She is currently an Associate
Professor of Computer Science at the Univ
ersity of Nebraska, Lincoln.
173
9. S. P. Kartashev, "Theory and Implemen
tation of p-Multiple Sequential Ma
chines", IEEE Transactions on Computers,
May 1974, pp. 500-523.
10. S. P. Kartashev, "Parallel Computation
in Modular Computing Systems", Proceed
ings of International Symposium on Uni
formly Structured Automata and Logic,
Japan, Aug. 21-23, 1975, pp. 184-191.
Steven I. Kartashev was born in 1934 in
the USSR. He received the B.S., the M.S.,
and Ph.D. degrees from the Institute of
Cybernetics, Kiev, all in computer sciences
(computer systems and architecture) in
1958, 1959, and 1966, respectively.
From 1961 to 1965 he was with the Insti
tute of Cybernetics, where he developed
and computerized formal design techniques
for constructin8 the control units which
vIere widely used for large computer sys
tems. From 1966 to 1969, he was an Associate Professor with the Computer Sci
ence Department in the Kiev Civil Aviation
Institute. Since 1969 he has been a per
manent resident of the U.S. From 1970 to
1972, he was a Research Scientist in the
Computer Science Department, The Johns Hop
kins University, Baltimore, MD. Concurrent··
ly, he was a Consultant of the Defense,
Space, and Special Systems Groups, Bur
roughs Corp., Paoli, PA. From 1973 to
1974, he was Chief Investigato: of the LSI multicomputer system project with the firm
Dynamic Computer Architecture, Lincoln,
NE. He is currently President of this
firm.
, I'
I'
i I I \
I I'
Azuma, S. ------------------------Balakrishnan, A. V. --------------Bansal, R. K. --------------------
_ Benchimol, C. -------------------Blackwell, W. A. -----------------Branin, F. H. --------------------Chang, F. C. ---------------------de Figueriredo, R.J.P. ----------DeDantis, R. M. ------------------Desoer, C. A. --------------------Dewilde, D. ----------------------Dobruck, A. L. -------------------Dolezal, V. ---------------------Fatic, V. M. ---------------------
../ Feintuch, A. ---------------------Foulker, R. H. -------------------Haas, W. H. ---------------------Hedberg, D. ----------------------Helton, J. W. --------------------Ho, Y. S. ------------------------Ioannids, G. ---------------------Jacyno, A. ----------------------Jury, E. I. ----------------------Kartashev, S. I. ----------------Kartashev, S. P. -----------------Kekre, H. B. --------------------Komkov, V. ----------------------Kumor, R. D. --------------------Leake, R. J. --------------------Levan, N. ------------------------Liberty, S. R. ------------------Lindquist, C. S. -----------------
-Martin, C. -----------------------Mishra, K.L.P. ------------------Missaghie, M. M. -----------------Naylor, A. -----------------------Newcomb, R. W. -------------------Phatak, D. B. --------------------Piekarski, M. S. -----------------Porter, W. A. --------------------Pu1ufani, S. R. ------------------Rajski, C. ----------------------Ramarajan, S. --------------------Reichert, J. D. ------------------Roe, P. H. ----------------------Saeks, R. -~---------------------Saito, M. ------------------------Schumitzkum, M. ------------------Sekhon, H. S. --------------------
~Silverman, L. M. -----------------Sinha, V. P. ---------------------Smithers, M. A. -----------------Srivastava, H. M. ---------------Steinberger, M. -----------------Subramanian, R. -----------------Swanimathan, B. ------------------Trzaska, A. ---------------------Tung, L. -------------------------Waldron, M. B. -------------------Wexler, D. -----------------------Wilton, D. R. --------------------Womack, B. F. -------------------W~nham, W. M. --------------------
AUTHOR INDEX
5 2
11 4 5 8 9
11 6
1,7 8 9 2 5 4
10 9
10 8 9
10 5 8
11 11 11
5 11 10
1 3 9 6
11 6
10 2,5
4 9
2,7 9 5
11 3 9 4 9 2
10 2
10 11 11
2 11 10 10
4 11
6 5 5 6
Yates, R. E. ----------------------Yousif, S. M. ---------------------Zames, G. -------------------------Zemanian, A. H. -------------------Zyla, L. U. -----------------------
174
11 6 4 4
11