Schulz M. Control Theory in Physics and Other Fields of Science: Concepts, Tools, and Applications

Подождите немного. Документ загружается.

3.2 Extensions and Applications 75

3.2.2 Inhomogeneous Linear Evolution Equations

It may be possible that the linear evolution equations have an inhomogeneous

structure

Y = AY + Bw + F, (3.68)

where F (t) is an additional generalized force. This problem can be solved by

a transformation of the state vector Y → Y



= Y − θ, where θ satisﬁes the

equation

θ = Aθ + F (3.69)

so that the new evolution equation for Y



= AY



+ Bw (3.70)

remains. Furthermore, the transformation modiﬁes the original performance

functional (3.43)in

J[Y, w]=



dt [(Y



(t)+θ(t)) Q(t)(Y



(t)+θ(t)) + w(t)R(t)w(t)]

+(Y



(t)+θ(T )) S (Y



(t)+θ(T )) . (3.71)

This result suggests that the class of linear quadratic control problems with

inhomogeneous linear evolution equations can be mapped onto the class of

tracking problems.

3.2.3 Scalar Problems

A special class of linear quadratic problems concerns the evolution in a 1d-

dimensional phase space. In this case all vectors and matrices degenerate to

simple scalar values. Especially, the diﬀerential Ricatti equation is now given

G +2AG −

+ Q = 0 with G(T )=Ω. (3.72)

This equation is the scalar Ricatti equation, originally introduced by J.F.

Ricatti (1676–1754). A general solution of (3.72) is unknown. But if a particu-

lar solution G

(0)

of (3.72) is available, the Ricatti equation can be transformed

by the map G → G

(0)

+ g into a Bernoulli equation

˙g +2



A −

(0)



g −

=0, (3.73)

which we can generally solve. This is helpful as far as we have an analytical

or numerical solution of (3.72) for a special initial condition.

We remark that some special elementary integrable solutions are available

[10, 11, 12]. Two simple checks should be done before one starts a numerical

solution [13]:

76 3 Linear Quadratic Problems

• If B

=2αβAR + β

QR for a suitable pair of constants (α, β) then α/β

is a special solution of the Ricatti equation and it can be transformed into

a Bernoulli equation.

• If (QR)



B − 2QRB



+4ABQR = 0, the general solution reads

G(t)=

tanh









−1

|B|

dτ + C





. (3.74)

An instructive example of a scalar problem is the temperature control in

a homogeneous thermostat. The temperature follows the simple law

ϑ = −κϑ + u, (3.75)

where ϑ is the temperature diﬀerence between the system and its environ-

ment, u is the external changeable heating rate and κ is the eﬀective heat

conductivity. A possible optimal control is a certain stationary state given by

∗

= κϑ

∗

. Uncertainties in preparation of the initial state lead to a possible

initial deviation Y (0) = ϑ(0) − ϑ

∗

(0), which should be gradually suppressed

during the time interval [0,T] by a slightly changed control u = u

∗

+ w.Thus,

we have the linear evolution equation

Y = −κY + w, i.e., A = −κ and B =1.

A progressive control means that the accuracy of the current temperature with

respect to the desired value ϑ

∗

should increase with increasing time. This can

be modeled by Q = αt/T , R =1,andΩ = 0. We obtain the Ricatti equation

G − 2κG − G

αt

= 0 with G(T )=0. (3.76)

The solution is a rational expression of Ayri functions

G(t)=

'κA (x)+A



(x) − C'κB(x) − CB



(x)

A(x) − CB(x)

(3.77)

with A and B the Ayri-A and the Ayri-B function, 'κ = κ(T/α)

1/3

and x =

'κ

+(α/T )

1/3

t. The boundary condition deﬁnes the constant C

C =

'κA('κ

+ α

1/3

2/3

)+A



('κ

+ α

1/3

2/3

)

'κB('κ

+ α

1/3

2/3

)+B



('κ

+ α

1/3

2/3

)

. (3.78)

In order to understand the corresponding control law w

∗

= −GY

∗

and the op-

timal relaxation behavior of the temperature diﬀerence to the nominal state,

see Fig. 3.5, we must be aware that the performance integral initially sup-

presses a strong heating or cooling. In other words, a very fast reaction on an

initial disturbance cannot be expected. The ﬁrst stage of the control regime is

dominated by a natural relaxation following approximately

Y = −κY because

the contributions of the temperature deviations, QY

∼ tY

, to the perfor-

mance are initially small in comparison to the contributions of the control

function Rw

. The dominance of this mechanism increases with increasing

heat conductivity κ. The subsequent stage is mainly the result of the con-

trol via (3.77). We remark that the ﬁnal convergence of G(t) to zero is a

3.3 The Optimal Regulator 77

012345

-3

-2

-1

012345

0,0

0,2

0,4

0,6

0,8

1,0

012345

-3

-2

-1

012345

-3

-2

-1

012345

0,0

0,2

0,4

0,6

0,8

1,0

012345

0,0

0,2

0,4

0,6

0,8

1,0

ttt

Fig. 3.5. Scalar thermostat: optimal control functions w

∗

(top) and optimal tem-

perature relaxation Y

∗

(bottom) for diﬀerent time horizons (T =1,2,3,and5.The

initial deviation from the nominal temperature is Y (0) = 1. The parameters are

κ =0,α =1(left), κ =0,α =10(center )andκ =10,α =10(right)

consequence of the corresponding boundary condition. The consideration of

a nonvanishing end point contribution to the performance allows also other

functional structures.

3.3 The Optimal Regulator

3.3.1 Algebraic Ricatti Equation

A linear quadratic problem with an inﬁnite time horizon and with both the

parameters of the linear system and the parameters of the performance func-

tional being time-invariant is called a linear regulator problem [14]. Obvi-

ously, the resulting problem is a special case of the previously discussed linear

quadratic problems. The independence of the system parameters on time of-

fers a substantial simpliﬁcation of the required mathematical calculus. Hence,

optimal regulator problems are well established in diﬀerent scientiﬁc ﬁelds

and commercial applications [7, 15].

The mathematical formulation of the optimal regulator problem starts

from the performance functional with the inﬁnitely large control horizon

78 3 Linear Quadratic Problems

[Y,w]=

∞



dt [Y (t)QY (t)+w(t)Rw(t)] → inf (3.79)

to be minimized and the linear evolution equations (3.41) with constant co-

eﬃcients

Y (t)=AY (t)+Bw(t) . (3.80)

By no means can the extension of a linear quadratic problem with a ﬁnite

horizon to the corresponding problem with an inﬁnitely large horizon be inter-

preted as a special limit case. The lack of a well-deﬁned upper border requires

also the lack of an endpoint contribution. To overcome these problems, we

consider ﬁrstly a general performance

J[Y, w,t

,T]=



dt [Y (t)QY (t)+w(t)Rw(t)] +

Y (T )ΩY (T ) (3.81)

with ﬁnite start and end points t

and T instead of functional (3.79). We

may follow the same way as in Sect. 3.1.4 in order to obtain the control

law (3.55), the evolution equations for the optimum trajectory (3.56), and

the diﬀerential Ricatti equation (3.53). The value of the performance at the

optimum trajectory using (3.55) becomes

∗

= J[Y

∗

,T]=



dt [Y

∗

+ w

∗

Y (T )ΩY (T )



dtY

∗



Q + GBR

−1



∗

Y (T )ΩY (T ) . (3.82)

From here, we obtain with (3.53) and (3.56)

∗



dtY

∗



−

G − A

G − GA +2GBR

−1



∗

Y (T )ΩY (T )

= −





∗

+ Y

∗



Y (T )ΩY (T )

= −



∗

Y (T )ΩY (T )

3.3 The Optimal Regulator 79

∗

)G(t

∗

) , (3.83)

where the last step follows from the initial condition (3.54). We remark that

this result is valid also for the general linear quadratic problem with time-

dependent matrices. We need (3.54) for the application of a time-symmetry

argument. The performance of the optimal regulator may be written as

∗

]=J[Y

∗

, 0, ∞] . (3.84)

Since the performance of the optimal regulator is invariant against a transla-

tion in time, we have

∗

]=J[Y

∗

, 0, ∞]=J[Y

∗

,τ,∞] (3.85)

for all initial times τ if uniform initial conditions, Y (τ)=Y

, are considered.

Thus we obtain from (3.83) the relation

∗

G(τ)Y

∗

=const for −∞<τ<∞ . (3.86)

Hence, we conclude that the transformation matrix G is time-independent.

This requires that the diﬀerential Ricatti equation (3.53) degenerates to a

so-called algebraic Ricatti equation [6]

G + GA − GBR

−1

G + Q =0, (3.87)

and the optimal control as well as the optimal trajectory is described by

(3.55) and (3.56) with completely time-independent coeﬃcients. Therefore,

the optimal regular can be also interpreted as the mathematical realization of

a static feedback strategy.

3.3.2 Stability of Optimal Regulators

If the algebraic Ricatti equation is solved, the dynamics of an optimal reg-

ulator is completely deﬁned by the control law (3.55) and the dynamics of

the state of the system (3.41). These both equations lead to the equation of

motion of the optimal trajectory (3.56). An initially disturbed system should

converge to its nominal state for suﬃciently long times, i.e., we expect Y

∗

→ 0

for t →∞. This behavior has comprehensive consequences. If we justify a

regulator in such a manner that (3.55) holds, the initial deviation as well as

any later spontaneous appearing perturbation decreases gradually. The neces-

sary condition for this intrinsic stability of the regulator is that the evolution

equation of the optimal trajectory (3.56) is stable. That means the so-called

transfer matrix D of the linear diﬀerential equation system

∗



A − BR

−1



∗

= DY

∗

(3.88)

must be positive deﬁnite.

Let us study the inverted, frictionless pendulum as an instructive example.

The pendulum consists of a cart of mass M and a homogeneous rod of mass

80 3 Linear Quadratic Problems

J,m,l

ϑ,ϑ

x,x

Fig. 3.6. The inverted pendulum problem

m, inertia J and length 2l hinged on the cart (Fig. 3.6). The cart may move

frictionless under the external control force F along a straight line. Denoting

with ϑ the angle between the rod and the vertical axis and with x the position

of the cart, the equations of motion are given by

(M + m)¨x = ml(

sin ϑ −

ϑ cos ϑ)+F (3.89)

(J + ml

)

ϑ = mgl sin ϑ − ml¨x cos ϑ. (3.90)

The stationary but instable solution of this problem,

∗

=˙x

∗

= x

∗

= F

∗

and x

∗

=const., may be our optimum solution. Now, we are interested in

the control of small perturbations. To this aim we introduce the dimensionless

quantities

y =

M + m

(x − x

∗

),τ=

(

m(M + m)gl

(J + ml

)(M + m) − m

t, (3.91)

and

w =

−2

+ m

M + m

, (3.92)

and the system parameter

ε =

M + m

J + ml

. (3.93)

Thus, the linearized equations of motion are now ¨x = −ϑ+wε and

ϑ = ϑ−w/ε.

This leads us to the state vector Y =(y, v, ϑ, ω) with v =˙y and ω =

ϑ.The

control has only one component, namely, w. Hence, we get the matrices

A =







0100

00−10

0001

0010







and B =







−ε

−1







. (3.94)

3.4 Control of Linear Oscillations and Relaxations 81

The matrix A is unstable, i.e., there exists some positive eigenvalues. Although

this example seems to be very simple, a numerical solution [16] of the alge-

braic Ricatti equation is required for a reasonable structure of the quadratic

performance functional. The main problem is that the nonlinear Ricatti equa-

tion has usually more than one real solution. However, the criterion to decide

which solution is reasonable follows from the eigenvalues of the transfer matrix

D = A − BR

−1

Inverted pendulum systems are classical control test rigs for veriﬁcation

and practice of diﬀerent control methods with wide ranging applications from

chemical engineering to robotics [17]. Of course, the applicability of the linear

regulator concept is restricted to small deviations from the nominal behavior.

It is a typical feature of linear optimal regulators that they can control the un-

derlying system only in a suﬃciently close neighborhood of the equilibrium or

of another nominal state. However, the inverted pendulum or several modiﬁ-

cations [18, 19], e.g., the rotational inverted pendulum, the two-stage inverted

pendulum, the triple-stage inverted pendulum or more general a multi-link-

pendulum, are also popular candidates for the check of several nonlinear con-

trol methods. However, the investigation of such problems in beyond the scope

of this book. For more information, we refer the reader to the comprehensive

literature [20, 21, 22, 23, 24, 25].

In principle, one can also invert the optimal regulator problem, i.e., we ask

for the performance which makes a certain controller to an optimum regulator.

The ﬁrst step, of course, is now the creation of a regulator as an abstract or

real technological device. We assume that the regulator stabilizes the system.

This is not at all a trivial task, but this problem concerns the wide ﬁeld

of modern engineering [1, 26, 27, 28, 29]. The knowledge of the regulator is

equivalent to the knowledge of the transfer matrix D =



A − BR

−1



∗

The remaining problem consists now in ﬁnding the performance index to which

the control law of the control instrument is optimal. This problem makes sense

because the structure of Q allows us to determine the weight of the degrees

of freedom involved in the control process [30].

3.4 Control of Linear Oscillations and Relaxations

3.4.1 Integral Representation of State Dynamics

Oscillations

Oscillations are a very frequently observed type of movement. In princi-

ple, most physical models with a well-deﬁned ground state can be approx-

imated by the so-called harmonic limit. This is, roughly spoken, the expan-

sion of the potential of the system in terms of the phase space coordinates

X = {X

,...,X

} up to the second-order around the ground state or

82 3 Linear Quadratic Problems

equilibrium state

. This physically pronounced state can be interpreted as the

nominal state of a possible control theory. Without any restriction we may

identify the origin of the coordinate system with the ground state, X

∗

=0.

This expansion leads to a linear system of second-order diﬀerential equations



β=1

Ω

αβ

=0 for α =1,...,N (3.95)

or in a more compact notation

X +ΩX = 0 with the frequency matrix

Ω.Of

course, this linearization is an idealization of the real object. However, the lin-

earized motion was thoroughly studied because of its wide applications. The

harmonic theory is a suﬃcient and suitable approximation in many scientiﬁc

ﬁelds, e.g., molecular physics and solid state physics or engineering. The in-

ﬂuence of external forces f

requires the consideration of an inhomogeneous

term in (3.95). Thus, this equation can be extended to a more generalized

case



β=1

Ω

αβ

= f

. (3.96)

The force f = {f

,...,f

} can be interpreted as a superposition of driving

forces from external, but noncontrollable, sources ψ

(t) acting on each de-

gree of freedom α and the contributions of N



possible control functions

u = {u

,...,u



} linearly coupled with the equations of motion

(t)=ψ

(t)+





β=1

αβ

, (3.97)

where B is a matrix of type N ×N



with usually time-independent coeﬃcients

(Fig. 3.7). In principal, system (3.96) can be extended to the generalized

system of linear diﬀerential equations

DX(t)=

Mf(t) (3.98)

with the diﬀerential operators

D =



k=1

and

M =



k=1

(3.99)

Or another suﬃciently strong pronounced stationary state.

The frequency matrix is sometimes also denoted as the dynamical matrix.

Of course, we may reduce the higher derivatives to ﬁrst-order derivatives but

this requires an extension of the phase space by velocities, accelerations, etc.

This prolongation method is the standard procedure discussed in the previous

chapters. However, in the present case such an extension of the phase space is not

desirable.

3.4 Control of Linear Oscillations and Relaxations 83

Fig. 3.7. External driving forces and control forces

The time-independent coeﬃcients

and b

are matrices of the order N ×N .

For instance, a vibrational system with the linear Newtonian friction has the

operator

D =

+ Λ

+ Ω (3.100)

where the matrix Λ contains the friction coeﬃcients. Equations of type (3.98)

can be formally solved. The result is a superposition of a solution with zero

external forces considering the initial state and a solution with a zero initial

state considering the external forces

X(t)=



k=1

(t)

k−1

X(t)

k−1



t=0



H(t − τ)f(τ )dτ . (3.101)

The functions H

(t)andH(t) are called the response functions of the system.

These quantities are straightforwardly obtainable for example by application

of the Laplace transform

A(p)=

∞



dt exp {−pt}A(t) , (3.102)

which especially yields a polynomial representation of the diﬀerential opera-

tors

D(p)=



k=1

and M(p)=



k=1

. (3.103)

A more generalized theory can be obtained for time-dependent coeﬃcients. For

the sake of simplicity we focus here only on constant coeﬃcients.

84 3 Linear Quadratic Problems

From here we conclude that the Laplace transformed response functions are

simple algebraic ratios of two polynomials, e.g., H (p)=D(p)

−1

M(p).

Relaxation Processes

Obviously, (3.101) can be extended to all processes following generalized ki-

netic equations of the type

DX(t)+



K(t − τ )X(τ)dτ =

Mf(t) (3.104)

with a suitable memory kernel K(t). Physically, the convolution term in

(3.104) can be interpreted as a generalized friction indicating the hidden inter-

action of the relevant degrees of freedom of the system, collected in the state

vector X, and other degrees of freedom constituting a thermodynamic bath.

The causality of real physical processes requires always the upper limit t of

the integral. A general diﬀerence between (3.104) and the time-local equation

(3.98) is that the latter may be transformed always in a type of structure, but

a time-local representation of (3.104) cannot be obtained with the exception

of special cases. However, the integral representation of the solution of (3.104)

is again (3.101) with the exception that the Laplace transform of the response

function H(t) is now given by

H(p)=[D(p)+K(p)]

−1

M(p) . (3.105)

Evolution equation with memory terms are very popular in several ﬁelds of

modern physics, for example, condensed matter science, hydrodynamics, and

the theory of complex systems. The processes underlying the dynamics of

glasses [32, 33, 34] or the folding of proteins [35] are typical examples with

a pronounced memory. In particular, we can observe a stretched exponential

decay

K(t) ∼ exp {−λt

} and γ<1 (3.106)

close to the glass transition of supercooled liquids [31, 36, 37].

The memory kernel can be determined by several theoretical and experi-

mental methods. Well established theoretical concepts are perturbation tech-

niques in the framework of the linear response theory [38, 40] or the calculus

of Green’s functions [39], or mode-coupling approaches [31, 33, 36], while var-

ious dielectric [42] and mechanical [41] methods as well as x-ray or neutron

scattering [43] are available for the experimental detection of memory eﬀects.

Fractional Derivations and Integrals

A very compact representation of a special class of memory kernels is provided

by the fractional calculus [44]. Under certain conditions fractional integrals