# Full text of "Feedback Control and the Arrow of Time"

## See other formats

Feedback Control and the Arrow of Time Tryphon T. Georgiou and Malcolm C. Smith Abstract The purpose of this paper is to highlight the central role that the time asymmetry of stability plays in feedback control. We show that this provides a new perspective on the use of doubly-infinite or semi-infinite time axes for signal spaces in control theory. We then focus on the implication of this time asymmetry in modeling uncertainty, regulation and robust control. We point out that modeling uncertainty and the ease of control depend critically on the direction of time. We also discuss the relationship of this control-based time-arrow with the well known arrows of time in physics. I. Introduction The origin and implications of the "arrow of time" is one of the deepest and least understood subjects of physics. The "arrow" is an intrinsic part of the world as we know it. Yet its emergence in thermodynamics and cosmology, from physical laws which are apparently impervious to it, remains a controversial subject [29]. At first sight, this subject may seem unconnected with the theory of feedback control. However, starting from the very basic fact that our notion of stability in the sense of Lyapunov is time-asymmetric, we argue that the "arrow of time" does have important implications on modeling and uncertainty, robustness of stability, as well as on the topology for the study of the dynamics of feedback interconnections . The circle of ideas that gave rise to this paper began in a short note published by the authors thirteen years ago [8]. There, it was pointed out that the doubly-infinite time axis presents some "intrinsic difficulties" for developing a suitable input-output systems theory — difficulties that are not present in the semi-infinite time axis setting. These difficulties are not mere mathematical technicalities. Rather, they relate fundamentally to the consistency of the theory of stabilizability across different frameworks. Subsequently, a number of papers were written which shed light on the problem [22], [23], [24], [15], [16], [17]. The present paper takes a fresh look and traces the origin of the "puzzle" to the arrow of Lyapunov stability, and then, explores the relevance of this arrow to the topology of dynamical systems and feedback theory. The relationship of the modern theory of dynamical systems with classical physics and thermodynamics is a developing one. A classical contribution by Nyquist and Johnson [28], [18] is a derivation of the electromotive force due to thermal agitation in conductors. In [4] the issue of irreversibility is treated from the point of view of stochastic control theory. More This work was partially supported by the National Science Foundation. T.T. Georgiou is with Department of Electrical and Computer Engineering, University of Minnesota, Minneapolis, MN 55455; tryphon@ece.umn.edu. M.C. Smith is with the Department of Engineering, University of Cambridge, Cambridge, CB2 1PZ, U.K.; mcs@eng.cam.ac.uk recently [13] has sought to formalize classical thermodynamics in the mathematical language of modern dynamical systems (see also [5]). In [27] information flow and entropy have been studied in the context of the Kalman filter. In [31] it is shown that a linear macroscopic dissipative system can be approximated by a linear lossless microscopic system over arbitrary long time intervals. Our point of view here is influenced by [29] and is somewhat different to the above references in that our main goal is to highlight a time-asymmetry, point out its implications, and discuss its relationship to other well-known asymmetries. The present paper begins by providing a new explanation of the issues raised in [8] with regard to an input-output theory for the doubly-infinite time axis. In Section [III] we introduce the time-conjugation operator and discuss the implications of the time-arrow in optimal control problems. In Section|IV]we analyse the effect of the time-arrow on modelling uncertainty; we show that dynamical systems which are close in the usual sense, that a common controller can stabilise and give similar closed-loop responses for either, may not be close when the time-arrow is reversed. Then, in Section |V] we further illuminate the inherent time-asymmetry in our ability to control a dynamical system with two specific examples. These can be thought of as examples of time irreversible feedback phenomena (see Section IV-BD . In Section [VI] we briefly discuss the arrow of time in physics and its relation to the time-arrow of feedback stability. Finally, in Section IVIII we consider feedback loops with small time delays and discuss the contrasting effects of delays and predictors and the connection with the arrow of time. A. Input-output and Lyapunov stability We focus on finite-dimensional linear dynamical systems which, for the most part, are assumed to be time-invariant. The dimensions of input, state and output (column) vectors, as well as the consistent sizes of transformation matrices in state-space models, are suppressed for notational simplicity. The following result is basic and well-known, cf. [38, p. 52-53], Proposition 1: Let P be a linear time-invariant finite-dimensional system which is controllable and observable and is specified by with an initial condition x(0) = 0. Then y e £ 2 [0, oo) for all u e £ 2 [0, oo) if and only if the matrix A is Hurwitz. Moreover, if this condition holds, y is determined uniquely by y(s) = (C(sl - Ay x B + D)u(s), where * denotes the Laplace transform. Many variants and extensions of the result are familiar: signal spaces with different norms can also be used; there is a finite-gain property relating the £ 2 -norms of y and u; even with x(0) ^ the main equivalence in the proposition still holds. Here we would like to highlight the fact that the result establishes an equivalence between stability defined in terms of the forced response and stability defined in terms of the free response, i.e. an II. Time-asymmetry and stability [12, p. 82]. x = Ax + Bu, y = Cx + Du, (1) (2) equivalence between bounded-input/bounded-output (BIBO) stability and Lyapunov stability for a system operating on the positive time-axis. Asymptotic stability in the sense of Lyapunov is obviously a time- asymmetric concept since convergence of the state vector is required as t tends to PLUS infinity, starting from an arbitrary initial condition at t = 0. In itself, BIBO stability does not appear to have this asymmetry, yet it is implicit in the formulation of Proposition [Q To further illustrate the point we can write down the following obvious corollary of Proposition [H obtained by running time backwards from to — oo. By changing the support of the signal spaces from the positive half-line to the negative half-line stability defined through the forced response (BIBO stability) becomes equivalent to asymptotic stability in the sense of Lyapunov for the reversed time-direction as t tends to MINUS infinity. Proposition 2: Let P be a linear system as in Proposition Q] with x(0) = 0. Then y e C 2 (-oo, 0] for all u e C 2 (-oo, 0] if and only if the matrix -A is Hurwitz. We now turn to the situation where inputs and outputs may have support on the doubly- infinite time-axis. In this case the following holds, e.g. see [44, p. 101]. Proposition 3: Let P be a linear system as in Proposition QJ Then there exists y e C 2 (-oo, oo) for all u e £ 2 (-oo, oo) if and only if A has no imaginary-axis eigenvalues. Moreover, if this condition holds, y is determined uniquely by y(s) = (C(sl - Ay 1 B + D)u(s). We remark that Proposition [3] is the natural generalisation of Proposition \T\ when systems are viewed as operators. A linear system in Proposition \T\ becomes a multiplication operator on the Fourier transformed spaces. The operator is bounded if and only if the "symbol" (the transfer-function) belongs to H^, which under the controllability and observability assumption is equivalent to A being Hurwitz. On the double-axis a multiplication operator on the Fourier transformed spaces is bounded if and only if the symbol belongs to — which for rational symbols excludes only poles on the imaginary axis. In Proposition [3] there is no longer any relationship between a notion of BIBO stability and Lyapunov stability (in either time-direction). Clearly, both A and —A may fail to be Hurwitz. Since only the existence of some y E C 2 { — oo, oo) is required for a given u E C 2 (— oo, oo), and the free motion solutions of ([]]) are ignored, this is not surprising. Propositions Q] and [2] by contrast, establish a connection between BIBO stability and Lya- punov stability as t — > +oo (respectively, t — > — oo) without putting in explicit requirements on the free motion solutions. We now consider the feedback interconnection in the form of Fig. \T\ where P and C are linear systems. The existence of signals u h yj E {1,2}) in C 2 [0, oo) which satisfy the feedback equations for a given pair of external inputs u , y in C 2 [0, oo), for a given set of initial conditions, is a well-known and natural definition of stability in terms of the forced response. From Proposition [T] stability in this sense is equivalent to asymptotic stability in the sense of Lyapunov of the combined state- space (assuming minimal realizations for P and C and well-posedness). Again, BIBO stability inherits the required time- asymmetry from the asymmetry of the support interval [0, oo). U + Ui Mg) — Fig. 1. Standard feedback configuration. It is apparent that the corresponding definition of BIBO stability for this feedback interconnection with £ 2 (— oo, oo) signals, generalising Proposition [3l will not correspond to a sensible notion of closed-loop stability. Indeed, we can easily check that a system P with transfer funtion P(s) = l/(s — 1) is "stabilised" by any of the controllers with C(s) = 2, C(s) = 0, or C(s) = — 0.5/(s + 1). (In conventional terms the controllers give closed-loop poles which are in the open left-half plane (LHP), the open right-half plane (RHP), and in both half planes, respectively.) We can summarize the points so far as follows. Stability is a time-asymmetric concept — the requirement of an asymptotic property as t tends to PLUS infinity defines a time arrow. If stability is defined by requiring bounded outputs in response to bounded inputs then a time arrow is not obviously implied. However, for signal spaces with support on a positive (resp. negative) half-line, the definition turns out to imply a positive (resp. negative) time arrow. On the other hand, a bounded-input bounded-output definition of stability for signals with support on the doubly-infinite time-axis does not define a preferred time arrow. Stable systems defined by bounded "multiplication operators" may be stable in the sense of Lyapunov in the positive time-direction, in the negative time-direction or in neither direction. B. The two-sided time axis and causality The fact that the doubly-infinite time axis causes problems for the analysis of stability and of stabilisation was pointed out in [8]. The explanation given there is consistent with that of Section HH but the overall argument was somewhat different. We now summarize the reasoning of [8]. Two systems P; (i = 1,2) defined by convolution operators were considered: /oo hi{t — r)u(r)dT = hi* u -oo where hi(t) = e t for t > and zero otherwise, and h 2 (t) = — e* for t < and zero otherwise, respectively. Each system has (double-sided Laplace) transfer function equal to l/(s — 1), but with differing regions of convergence. The first system is unstable and causal and the second is stable and non-causal (in fact anticausal) according to the usual definitions. When viewed on C 2 (—oo, oo), P 2 is a bounded operator and hence is a stable system in an input-output sense. On the other hand, it was shown in [8] that P x fails to be stabilisable on £ 2 (— oo, oo). This is a counterintuitive result since Pi is stabilisable in the ordinary way on any positive half-line. The proof that Pi fails to be stabilisable on the doubly-infinite time-axis reduces to the observation that the graph of Pi fails to be closed. It was also pointed out in [8] that the closure of the graph of P 1 coincides with the graph of P 2 . Once the graph is closed there appears to be no problem with stabilisation. But in closing the graph "anti-causal" trajectories are brought in which are inconsistent with the convolution representation of the system, so this was considered inadmissible. Another possible remedy discussed in [8] was to consider the underlying differential equation representations rather than the convolution representations. In fact both systems are defined by the same differential equation V = y + u. (3) More precisely, the trajectories of both Pi and P 2 satisfy this equation. In terms of "flow of time" thinking, P 1 appears to arise by solving this equation forwards in time while P 2 is obtained by solving it backwards. This suggestion seems to make stronger the argument to consider Pi and P 2 to be the same system. But this was considered unnatural in [8] on the grounds that it appears to abandon any notion of causality, or that it leaves the direction of time undefined. The discussion of Section [TT] allows the difficulties pointed out in [8] to be explained in a new way. Let us suppose we are willing to accept the closure of the graph of Pi which makes it "stabilizable" on the double-axis in a bounded-input/bounded-output sense. As explained, Pi and P 2 can now be thought of as one and the same system defined by © — a state-space description as in CE© solved forwards or backwards as desired. Does the closure of the graph resolve the difficulty pointed out in [8]? The answer is no, since the notion of stability does not correspond to the usual notions. As is made clear by Proposition |3l the feedback system may turn out to be stable in a conventional sense in the forward, backward or neither time-directions. C. The work of Makila, Partington and Jacob A number of interesting observations and contributions have followed from [8] which we would like to comment on here. The fact that a causal system on the double-axis can have a non-causal closure has led to a study of "closability" and "causal closability" as questions in their own right. Makila [25] has shown that the lack of causal closability for the example of [8] extends to general L v spaces on the double-axis. Jacob and Partington [17] give general characterisations of the graphs of time-invariant systems and derive necessary and sufficient conditions for the closure of a closable system to be causal. In [22] Makila and Partington consider weighted £ 2 -spaces on the double-axis and show that, when signals have very rapid decrease to zero towards — oo, causal convolution operators may be closed operators. (So there is no issue of causality being lost due to the operation of closure.) On the question of stabilization on the double time-axis, Jacob [15] has made an interesting suggestion. We have seen already that closing the graph and applying the BIBO stability definition fails to recover the usual concept of stability. Jacob proposed that causality of the closed-loop operators of the feedback system be added as an extra requirement. Jacob showed that the resulting characterisation of stability agrees with the usual definitions for linear time-invariant systems. In the context of the present paper we can re-interpret this result by saying that the causality condition forces the positive time-arrow into feedback system stability. We can understand this as follows. In [17] it is shown that a closed linear time-invariant system is causal on £ 2 (— 00,00) if and only if the corresponding transfer function belongs to a certain Smirnov class. For finite dimensional systems this is equivalent to the transfer function having no right half-plane poles. Thus, in Proposition [31 if P is required to be causal, BIBO stability agrees with Lyapunov stability with the positive time- arrow. Makila and Partington in [22] make an interesting observation on the possible extension of Jacob's idea to the time-varying situation. They consider a causal, convolution operator derived from the underlying differential equation y{t) + a(t)y(t) = u(t) (4) where a(t) = — 1 for t < and a(t) = +1 for t > and point out that the closure of the £2 (—00, 00) graph of the convolution system is not the graph of an operator. Essentially this boils down to the fact that there are free motion solutions y(t) = ce~'*', u(t) = 0, where c is a constant, which can be approximated arbitrarily closely by elements of the graph. This raises the question of whether the approach of Jacob can recover a theory of stabilization which is consistent with the single-axis case. At the same time it is pointed out that the system is stabilizable in a Lyapunov sense by the feedback u(t) = -2y(t). (5) In the present context this example highlights the care that is needed in defining stability for time-varying systems, even in the conventional sense. The open-loop system © is Lyapunov stable in the forward time-direction for any initial condition specified at any time (either positive or negative), but not uniformly so. Incidentally, the same is true for stability in the backwards time-direction. With the feedback © in force the system becomes uniformly stable in the sense of Lyapunov in the forward time-direction and unstable in reverse. In the perspective of the present paper, any method to force agreement between BIBO stability on the double-axis and conventional notions (such as requiring causality of the closed loop operators) might be seen as tantamount to directly imposing the desired time arrow within the stability definition. In several papers (e.g., [24], [22], [23]) Makila and Partington have advocated the use of a two-operator model for systems on the doubly infinite time-axis in the form Ay = Bu, where A, B are causal, bounded operators, in contrast to a single-operator model y = Pu, where P is causal and possibly unbounded. Closed-loop stability is defined as the existence of a causal, bounded inverse of the feedback system operator mapping system inputs to exogenous disturbances. Since this definition incorporates a causality requirement on the closed-loop system there is evidently a close relationship between this idea and the approach of Jacob. III. Time-Asymmetry and Optimal Regulation This section focusses on the time-asymmetry of the definition of stability and its im- plications in the context of optimal regulation. Firstly, a time-conjugation operator will be defined as well as the concepts of f-stability and b-stability. Then the finite-horizon quadratic regulator problem will be considered for a system running forwards in time and backwards in time, and it will be shown that the optimal cost is generally different. The infinite-horizon (asymptotic) regulator will also be considered in the same way. It will be shown that the optimal cost can be expressed in terms of the two extremal solutions of the appropriate algebraic Riccati equation. The result shows that ease of optimal regulation depends on the time-direction. A. The time-conjugation operator, f-stability and b-stability Let P denote a dynamical system described by the state-space equations in CE©, initialized at time zero and running forwards in time. Let Z denote the operation on P which corresponds to solving £Dl2]) backwards from t — followed by a flip of the time axis (so the new system runs forward again). More specifically we set t\ = —t, so that d __ d dt dt\ ' and then replace t\ by t which results in —x = Ax + Bu, with x(0) = x y = Cx + Du for the system 3(P). The effect on the transfer function is as follows: if P has transfer function P(s), then 3(P) has transfer function P(—s). Define the system P to be f-stable if A is Hurwitz, and define P to be b-stable if 3(P) is f-stable, or equivalently, if —A is Hurwitz. It is immediately obvious that a linear time-invariant system of the type CD© can never be both f-stable and b-stable. Similarly, a controller which makes £Dl2]) f-stable cannot make it b-stable as well. B. The finite-horizon linear quadratic regulator Let P be a linear time-invariant system which is controllable and observable and de- scribed by CEH2K as before, with D = and x(0) = x . Consider the problem of the regulation of P with criterion J= t (y(t)'Qy(t)+u{t)'Ru(t))dt + x{T)'Hx(T). Jo This has solution u (t) = -R^B'S^xit) (6) where - S(t) = S(t)A + A'S{t) - SfyBR^B'Sit) + C'QC (7) and S(T) = H, with optimal cost J^t = x' S(0)xo [1]. When P runs backwards in time from x(0) = xo with cost J= ! (y(t)'Qy(t)+u(t)'Ru(t))dt + x(-T)'Hx(-T) J-T we can check that the optimal control is still given by © where S(t) satisfies © with S(—T) = —H, and that the optimal cost is J^t = —x' S(0)x . It can be readily verified that S(0) (forward case) is in general different from —S(0) (backward case), and so the optimal cost is different in the two cases, e.g., if A = B = C = Q = R = T=1 and H = 10, then 5(0) = 2.5415 in the forward case and -S(0) = 0.5495 in the backward case. C. The infinite-horizon linear-quadratic regulator Again let P be a linear time-invariant system which is controllable and observable and described by CD© with D = and x(0) = x . It is well-known [1] that POO J= (y(t)'Qy(t)+u(t)'Ru(t))dt (8) Jo has a minimum given by J f oo = x' S + x where S + is the unique positive-definite solution to the algebraic Riccati equation A'S + SA- SBR^B'S + C'QC = 0. (9) It is also well-known that S + is the unique solution of © for which A — BR~ l B'S has all its eigenvalues in the open LHP. In the language of the present paper we can say that S + is the unique solution of © which makes the system (H© f-stable with the controller u = -R- l B'Sx. What happens if we require the minimisation of J= I (y(t)'Qy(t)+u(t)'Ru(t))dt (10) J — oo for CD© running backwards in time? This is the same as the conventional problem for the system 3(P). It is easy to see that the minimum is given by Jb i00 = — ^o^Xo where is the unique negative-definite solution to ©. It is also well-known that S- is the unique solution of © for which A — BR~ l B'S has all its eigenvalues in the open RHP [44]. In the language of the present paper we can say that £_ is the unique solution of © which makes the system (d© b-stable with the controller u = —R~ l B'Sx. In general Jf j0O = x' S + x and Jb,oo = —x' S-X are different. This shows that "difficulty of control" is time-asymmetric for the standard linear-quadratic regulator on the infinite horizon. The difference can be significant, e.g. if A = 1, B = e, C = 1, Q = 1 and R = 1 then S + = 2/e 2 + 1/2 + 0(e 2 ) and 5_ = -1/2 + 0(e 2 ) for e small. IV. Time-Asymmetry and Modelling Uncertainty In this section we look at the topology for uncertainty in feedback control and how this is affected by the time arrow. We will see that dynamical systems which are close in the usual sense, that a common controller can stabilise them and give a similar closed-loop behaviour, may not be close if time is reversed. A. The gap metric and robustness of stability Zames and El-Sakkary [43] introduced a metric on dynamical systems for the purpose of assessing robustness. This was based on the gap metric used in functional analysis to study invertibility of operators [19], [32]. Specifically, systems are considered to be operators on £2[0, oo) with a graph which is a closed subspace of £2(0, 00). Consider two linear systems Pi (i — 1, 2) with transfer functions Pi(s) = rii(s) (mi(s)y 1 where n^s) and m^s) are coprime polynomials or, more generally, right-coprime polyno- mial matrices. Let (n i (-s)) T ni(s) + (m i (-s)) T m i (s) = (^(-s)) T d^s) with det (di(s)) a Hurwitz polynomial and ( ) T representing matrix transpose — the exis- tence of such a polynomial (matrix) di(s) is a standard result in the theory of canonical factorization [41]. Then, /miOOWs))- 1 ^ Gpi,H2 '■= \7~L2 ■= Gi(s)H 2 V^sXdiOO)- 1 J is (the Fourier transform of) the graph of P i5 for i — 1,2. Thus, the graph symbol Gi(s) generates the graph of P; as its range. Then the gap between Pi and P 2 is defined to be 8ft 2 (Pi, P2) := ||n<? p H2 — rig P2 H2 1| where 11^ denotes orthogonal projection onto a closed subspace /C. Let the feedback configuration of Fig. [Qbe denoted by [P, C], where P and C are linear systems defined as operators on £2(0, 00) which may possibly be unbounded. Define H P ,c:= (p) (I - PC)" 1 (I -C) to be the operator mapping ( Uq ) T to ( uj y[ ) T . The following are basic robustness results for gap metric uncertainty. Proposition 4: [9] Assume that the closed-loop system [P, C] is f-stable. Then, [Pi, C] is f-stable for all Pi such that 5 H2 (P, Pi) < b if and only if b < 6 PiC where Proposition 5: [43] Assume that the closed-loop system [P,C] is f-stable. Then, the following are equivalent: (i) 5n 2 (P n , P) -> as n -> oo. (ii) Hp njC is f-stable for sufficiently large n and ||H Pn)C - H PiC ||oo ^ as n ^ oo. Proposition [5] was the primary justification for the claim in [43] that the gap metric defines the "correct" topology for robustness of feedback systems. In the present context, it can be seen that the choice of a signal space with support on the positive half-line is essential in achieving an appropriate topology. To emphasize the point, if £ 2 [0, oo) were replaced by £ 2 (— oo, 0] then the above proposition would hold with f-stability replaced by b- stability. Let us consider the case where systems are defined on £ 2 (— oo, oo). Then we define 5£ 2 (Pl> P 2) := ||ng Pi £2 -ng P2 £ J| where Qp t ,C2 Gi(s)C 2 and £ 2 := ^2(— joo, joo). With this definition, Qv,c 2 ls always closed, but may contain "non-causal" input-output pairs (as pointed out in [8] — see also Section III-BI) . It is easy to construct examples to demonstrate that convergence of <5c 2 (P n ,P) to zero does not allow any closed-loop stability prediction, e.g., [P, C] f-stable does not imply [P n , C] f-stable for sufficiently large n. In [36] Vinnicombe introduced a new metric 5 V (-, •) on dynamical systems which defines the same topology as 5n 2 (-, •)» an( ^ wn i cn satisfies the following inequality: Wv) <£,(•,•)< <Wv)- The v-gap between P 1 and P 2 is defined as follows: f ^(Px.Pa) if ^(P 1; P 2 ) := I wno(det(G 2 (- S ) T G 1 ( S ))) =0, (11) I 1 otherwise, where wno(<?(s)) denotes the winding number about the origin of g(s), as s traces the standard Nyquist D-contour [36], [37]. A simple expression for 5c 2 (-, ■) can be obtained using left fractional representations — let Pi(s) = (m i (s))~ 1 h i (s) be a left-coprime polynomial fraction, d{ the Hurwitz polynomial matrices which satisfy fk{s) (hi(-s)) T + rhi(s) (rhi(-s)) T = d i (s)(d i ( y -s)) T , and define Gi(s) := {-(diis))- 1 ^), {diis))- 1 ™^)) for i = 1,2. The graph of Pj is the kernel of multiplication by Gi(s) (in the respective space of signals H 2 or ^ 2 ). The £ 2 -gap can now be expressed as 5 £2 (Pi,P 2 ) := \\G 2 (s)G 1 (s)\\ 0O . It turns out that Propositions |4] and [5] both hold with 5u 2 replaced by 5 V (see [36]). Since 3c 2 — $v when wno(det(G 2 (-s) T G'i(s))) = (12) holds, this condition effectively imposes a positive time-arrow on the double-axis graph which forces f-stability to be retained under small perturbations in 5 V (-, •). This is illustrated by the following result (which can be readily derived from [36, Theorem 4.2]; see also [10]). Proposition 6: Let [P, C] be f-stable and suppose S C2 (P n , P) — > as n -> oo. Then [P n , C] is f-stable for all sufficiently large n if and only if wno(det(G n (-s) T G(s))) = for all sufficiently large n. B. The effect of the time-arrow on gap distances We define a forward and a backward v-gap as follows, 5„ j/ (P 1 ,P 2 ) := <f„(Pi,P 2 ) ^ 6 (Px,P 2 ) := ^(a(Pi),a(p 2 )). It is straightforward to see that <y £2 (Pi,p 2 ) = 5£ 2 (a(Pi),a(p 2 )), so any difference between 5 v j(P\, P 2 ) and S v> b(Pi, P 2 ) lies in the winding number condition in (fTTI) . Let us examine this more closely. Note that det(G 2 (- S ) T G 1 ( S )) - ^ (S) det(d 2 (-s))det(cii(s)) where h(s) := det(m 2 (— s) T mx(s) + n 2 (— s) T ni(s)). (13) If 5c 2 (Pi, P 2 ) < 1 then it can be shown that wno(det(G 2 (— s) T Gi(s))) is well-defined [36], in which case h(s) admits a canonical factorization h(s) = h+{s)h-{s) (14) where h + (s) and h-(—s) are Hurwitz polynomials. Thus, wno(det(G 2 (— s) T Gi(s))) = if and only if deg(/i + (s)) = deg(det(di(s))), or equivalently deg(A_(s)) = deg(det(d 2 (s))) > It can be shown that the degree of det(c£j(s)) is equal to the McMillan degree of Pj (e.g. using the uniqueness of normalised coprime factors over up to a constant unitary transformation and the corresponding state-space realisations [26], [35], [44]). Determining the graph symbol for 3(P«) requires a canonical factorization (ni(s)) T rii(-s) + (nii(s)) T nii(-s) = (di(-s)) T di(s) with det(c?j(s)) a Hurwitz polynomial. Again it can be shown that the degree of det(c?j(s)) is equal to the McMillan degree of Pj. The corresponding winding number condition in 5„ j6 (Pi, P 2 ) can now be expressed as wno(det((rf 2 (-s)" 1 )/i(-s)(J 1 (s)- 1 ))) = which is equivalent to deg(/t_(s)) being equal to the McMillan degree of Pi. We therefore obtain the following result. Proposition 7: Let Pj(s) {i = 1,2) be the rational transfer functions of linear time- invariant dynamical systems as above, with McMillan degrees and with h,h + ,h^ as in H13H14[) . Assume that 8 C2 (P U ~P 2 ) < 1. 1) The following are equivalent: a) <M p i, p 2) < 1. b) deg(/i + (s)) = /ii, c) deg(/i_(s)) = // 2 . 2) The following are equivalent: a) <y Wi6 (Pi,P 2 ) < !. b) deg(/i_(s)) = nx, c) deg(/i+(s)) = // 2 . 3) The following are equivalent: a) ^ / (P 1 ,P 2 ) = ^ j6 (P 1) P 2 ) < 1, b) /ii = yu 2 = deg(/i + (s)) = deg(/i_(s)). In the above proposition, ij expresses the zero winding number condition in (fTTT) in an equivalent form, while 2) does the same for (^(Pi, P 2 ). It is interesting that when the two conditions are combined as in 3) the result is a very stringent requirement which includes the necessity that P\(s) and P 2 (s) have the same McMillan degree. This serves to highlight the fact that "unmodelled dynamics" which may account for a small error in 5 v j (and which may be neglected in the design of a robust controller) will inevitably account for a substantial error in 5 V: b- Example 8: Consider two systems with different McMillan degrees, e.g. P\{s) = 1, P 2 (s) = 1/s. It can be computed that <5„j(Pi,P 2 ) = l/y/2. Proposition [7] then tells us immediately that ^^(Pi, P 3 ) = 1 where Ps(s) = —1/s. Similarly, if Pi(s) = 1/s 2 , then Proposition [7] tells us that both <5„j(Pi, P 4 ) = <5^ i fe(Pi,P 4 ) = 1 since Pi(s) = P 4 (— s). V. Time-asymmetry and robust control This section addresses the implications of the time-asymmetry in the theory of robust control. In particular, we will also see that a system which is "easy" to control in one direction of time may be far from easy to control in the opposite direction. A. Optimal robustness and difficulty of control In [11] it was shown that bp c could be maximised over all stabilising C and that this amounts to solving a Nehari problem [44]. This optimum value, which we denote by &opt,f(P), can be interpreted as a measure of ease/difficulty of control, where a value near to 1 means the plant is "easy to control" and a value near means the plant is "hard to control". With the understanding that & optjf (P) has the meaning of "ease of control" with respect to the forward time-arrow for stabilty, it is interesting to define AiptM-T ) ■— «opt,f (3(P)), which represents "ease of control" with respect to the backwards time-arrow. Our main purpose in defining 6 pt,b(P) is to highlight the influence of the time-arrow in feedback regulation. Let P be a controllable and observable system which is described by the state-space equations in CQIU), as before. Then, following [11], [9], 6opt,f(P) = a/1 - A max (r+x + ) where Y + is the positive definite solution of the Riccati equation A Y + YA* - YCR~ X C*Y +B(I - D*R- 1 D*)B* = (15) where A = A — BD*R~ 1 C and R = I + DD*, and X + is the corresponding solution to the (F-dependent) Lyapunov equation (A - YC^R^Cyx + X{A - YC'BT^C) +C*R- 1 C = (16) for Y = Y + . Similarly, it can be seen that &o P t,b(P) = v 7 ! " A max (YlX_) (17) where Y- is the negative definite solution of the Riccati equation (U51) while X_ is the corresponding solution to (fT6l) for Y = YL. In the following two examples we will see situations where 6 O pt,f(0 and 6 O pt,b(0 arc very different. Example 9: Near pole-zero cancellations. Consider P(s) = 1 + Letting -A = C = D = 1 and B = e in equations (fl5 |) - ([T7T) gives J op t,f (P) 'I + e + e 2 /2- 1 -e/2 2 x /l + e + e 2 /2 It follows that for small values of e, 6 op t,f(P) = 1 1 e 2 + 0(e 3 ) 32 and hence, 6 opt f (P) — ► 1 as e — ► 0. On the other hand, which leads to WP) = 7N+0(e 2 ) for small values of e, and hence 6 op t,b(P) —> as e — > 0. This is accounted for by the fact that P(s) has a near pole-zero cancellation in the LHP, which is innocuous for f-stabilisation, but highly challenging for b-stabilisation. The latter is equivalent to f-stabilisation of P(—s), which has a troublesome near pole-zero cancellation in the RHP. Example 10: Riding Bicycles. A feedback stability problem in everyday experience is bicycle riding. An elementary model to study rider-bicycle stability is given in [2] which gives the following transfer function from steering angle input to tilt angle: where a,/3, 7 are positive constants and V is the forward speed. This model has one RHP pole, but the zero is in the LHP. As such, this plant is not too difficult to control. Let us consider what happens if we try to ride the bicycle backwards in time. This corresponds to trying to stabilise the plant P(—s) forwards in time. The model still has one RHP pole, but the zero is also in the RHP, which makes stabilisation much more difficult. Indeed if V(3 = ^7 the plant is technically not stabilisable. It is interesting to note that an experimental bicycle with the steered wheel at the rear instead of the front has a transfer function from steering angle input to tilt angle given by [2] (see also [21]) This is exactly the transfer function for the conventional bicycle ridden backwards in time. Figure [2] shows the value of 6 pt,f an d b op t,b versus V with parameter values a = 1/3, (5 — 2 and 7 = 9 (which are deemed reasonably realistic). Recall that 6 op t,b is the same as &opt,f f° r the rear- wheel steered bicycle model (TT9b at the same V . It can be observed that fropt,b is less than 6 op t,f for any V. Also, 5 op t,b is very small for low V, indicating difficulty of control, and zero at V = 1.5 m/s. For larger V, & op t,b increases, indicating that control becomes easier. These results are equivalent to the rear-wheeled steered bicycle being more difficult to ride than the front-wheel steered one, but still being reasonably controllable at higher speeds [2]. aV (18) aV s + f3V s 2 — 7 (19) V in m/s Fig. 2. 6 op t,f and 6 op t,b versus V for the bicycle model of d!8t with a = 1/3, /3 = 2, and 7 = 9. 5. Tzme irreversible feedback phenomena The concept of ease or difficulty of control gives a thought-provoking perspective on reversibility. Systems which in a limiting situation are very difficult to control (in the sense that & pt,f (P) tends to zero) are unlikely to be observed in nature or technology. Nevertheless, such a system may be easy to control in the time-reversed direction (see Examples [9] and [TOl) . This is independent of the fact that the underlying differential equation can be integrated equally well in either time-direction. This is reminiscent of phenomena (such as a bottle falling from the table and shattering into many pieces) that appear to be associated with an intrinsic direction of time even though classical physics would also allow the reversed motion as a solution (see Section [VI] for a further discussion). We expand this point in the context of Example [TOl The loss of stabilizability of the rear- wheeled steered bicycle at V = 1/7//? has the following interesting consequence. Imagine a video of a rear-wheeled steered bicycle being ridden stably at this critical speed. Let us assume that it is possible to verify from the video the actual speed (e.g., by knowing the frame-rate and observing markings on the ground). An observer with a good grounding in control theory would be led to the inescapable conclusion that the video had been made when the said bicycle was actually being ridden backwards in space (i.e., with a negative V) and then played backwards in time as well, giving the impression of a forward motion. VI. The arrow of time in physics The subject of the "arrow of time" is a well-known conundrum in physics. The second law of thermodynamics states that the entropy of a system increases with time. It is the time- asymmetry in this law which gives rise to the notion of the "thermodynamic arrow of time". The classical derivation of the second law in statistical mechanics due to Boltzmann is connected with a famous puzzle known as Loschmidt's paradox [40]. This essentially points out that the laws of mechanics used in the derivation of the second law are time- symmetric whereas the conclusion is not. Evidently the time-asymmetry creeps in through the statistical assumptions. An illuminating discussion of this issue is given in [29]. Other arrows of time have also been defined, for example (i) the "psychological arrow" — the direction in which time passes as perceived by a sentient being [14], [33], (ii) the "cosmological arrow" — the direction of time in which the universe is expanding. Hawking [14] argues that the thermodynamic and psychological arrows are always aligned with each other but these need not always be aligned with the cosmological arrow (though they are at present). In this paper we have described the time- asymmetry in the definition of control systems stability as a time-arrow. In the theory of dynamical systems there is also the notion of passivity, which again defines a time-arrow. For electrical circuits the time-arrow of passivity can be seen in the behaviour of the resistor, in contrast to the inductor and capacitor which are time- symmetric in their operation. If the electrical resistor were to operate backwards in time one would observe a resistor gathering low-grade heat from the environment and charging up a battery. This behaviour would be recognised as a violation of the second law of thermodynamics (see [20, pages 260, 390-2]). In a similar way, an ideal linear damper operating backwards in time extracts low-grade heat from the environment to create mechanical work, in violation of the second law. It seems that the arrow of time in passive systems or circuits coincides with, or is the same as, the thermodynamic arrow. How does the arrow of time for control system stability relate to other time arrows? It is highly unlikely that a control engineer who is designing a control system for a plant will give even a moment's thought to the preferred time arrow for control. Without expressing the thought, the designer will seek decaying free motion solutions in the direction in which time is perceived to be passing. In this way the arrow of time for control could be said to coincide with the psychological arrow. On the other hand, in biological systems, active control is ubiquitous. It is less obvious that, for example, homeostasis in a cell is aligned with the psychological arrow. Here we will be content to raise the question of whether the stability arrow for control systems in general can be directly related to the thermodynamic arrow, e.g. by considering information flow or the effect of internal energy sources. Finally, from a purely mathematical point of view, we observe that the arrow of time for control systems stability appears identical with the arrow of time for passivity. This supports the conclusion that the arrow of time for control systems stability always coincides with the thermodynamic (and psychological) arrow. VII. Feedback loops and time delays Let D T : x(t) h- > x(t — r) denote a time delay operator. It seems superfluous to say that D T is physically realisable for r > 0. Indeed the delay is a common feature of communication and control systems. For r < 0, D T is the ideal predictor which is not believed to be physically realisable as a "real-time" device. At first sight this "fact" appears to be self-evident, but its subtlety is revealed on closer examination — indeed, a rigorous justification appears not to be available at present. An insightful discussion of the issue of "causation" and its connection with the arrow of time is given in Price [29, Chapter 6]. Price's suggestion that the asymmetry of causation "is a projection of our own temporal asymmetry as agents in the world" [29, page 264] is similar to the view expressed by Bertrand Russell: "The law of causality, I believe, like much that passes muster among philosophers, is a relic of a bygone age, surviving, like the monarchy, only because it is erroneously supposed to do no harm". This prevalence in physics and philosophy of an anthropocentric explanation of causation sits in opposition to the belief of the unrealizability of a "prediction machine" out of physical components and processes, and suggests that a deeper analysis of the question is needed. In this paper we will not attempt to further debate the origin and explanation of causation. In the next section we will simply highlight the striking difference in behaviour of feedback loops with small delays versus predictors and confirm the difference using the forward-time gap metric. A. Feedback stability, delays and predictors Consider a feedback system which consists of an integrator in series with a time delay and negative unity feedback. The governing equation is where d(t) denotes an external disturbance. We set d(t) = and consider the totality of all free motion solutions of the system equations. If all solutions decay as t — > +00 we say the system is f-stable. This definition agrees with the one given in Section IIII-AI for finite-dimensional systems. For r > we can verify that (l20l) is f-stable. Taking Laplace transforms in (1201) gives We can verify that all zeros of s + e ST = are in the LHP so the system is f-stable. Now consider the case where r < 0. Note that this corresponds to an integrator with a predictor in negative feedback, which we would not expect to be realizable in the forward time direction. In fact, s + e~ ST has infinitely many zeros in the RHP for any r < and hence the system fails to be f-stable. It is evident that the system displays a discontinuity in the asymptotic (as t — > 00) behaviour of the free motion at the point r = 0. Let us now consider the closeness of the systems involved using the v-gap metric. Let P denote the integrator and P r denote the integrator in series with D r . Regarding these as operators on £2(0, 00) we have the graph: x(t) + x(t - t) = d(t) (20) 1 d(s). s + e for r > 0. Then s (l-e-)ll 00 which tends to zero as r 0. Also so providing |t| < n there are no crossings of the negative real axis of this function when s = juj. Hence, wno(G2(— s) T G\(s)) = 0, for r sufficiently small. This implies for t > and sufficiently small and 8 v j(P, P T ) — > as r — > 0. Now consider the case of P T with r < 0. Again regarding P r as an operator on £2(0, 00) which behaves like e ST for large s, so the winding number of this function is not zero and (J t)i/ (P,P T ) = lforr<0. The above analysis with the gap agrees with the earlier conclusion on f-stability. For t > 0, f-stability was retained for sufficiently small r, but lost for any r < 0. Now we have seen that, as long as t > 0, there is a small error in 5 v j, but for any r < 0, 5 v j(P, P T ) = 1. Finally, it is interesting to mention that the tolerance of feedback loops to small time- delays is guaranteed by a well-known sufficient condition that the high-frequency loop-gain of the feedback loop is smaller than one ([3], [6], [39], [42]) — a condition routinely met in practice. It is easy to check that robustness to an arbitrarily small "parasitic predictor" in the loop would be guaranteed theoretically by the loop-gain being greater than one at arbitrarily high frequencies — a condition that appears impossible to achieve in a real feedback system. 1) Stability is a time-asymmetric concept. The requirement of an asymptotic property as t tends to PLUS infinity defines a time arrow. 2) A stability definition which requires bounded outputs in response to bounded inputs does not obviously imply a time arrow. For signal spaces with support on a positive (resp. negative) half-line, the definition turns out to imply a positive (resp. negative) time arrow. 3) A bounded-input bounded-output definition of stability for signals with support on the doubly-infinite time-axis does not define a preferred time arrow. Stable systems defined by bounded multiplication operators may be stable in the sense of Lyapunov in the positive time direction, in the negative time direction or in neither direction. 4) The fact that the closure of the graph of an unstable causal system may coincide with the graph of a stable anti-causal system on the doubly-infinite time-axis need not be a fundamental obstacle in developing a usable control theory on the doubly-infinite time-axis. 5) Any method which modifies the BIBO definition of stability on the doubly-infinite time-axis to agree with conventional stability notions could be interpreted as the imposition of a positive time-arrow. < 5 B>/ (P,P T ) = ( 5 jCa (P,P T ) we have: and5 £2 (P,P T ) VIII. Synopsis 6) A time-conjugation operator on systems was defined as well as the concepts of f- stability and b-stability. 7) Both the finite-horizon and infinite-horizon quadratic regulators give a different optimal cost for a system running forwards in time and backwards in time. In the infinite horizon case the optimal cost can be expressed in terms of the two extremal solutions of the appropriate algebraic Riccati equation. 8) The role of the positive time arrow in the gap metric measure of uncertainty for dynamical systems was highlighted. The usual 7i 2 -gap metric inherits the positive time arrow by virtue of systems being defined as operators on the positive half-line. The £ 2 -gap metric, which is well known to define an inappropriate topology for robust control, does not have a preferred time-direction due to the underlying operators being defined on the double-axis. The v-gap metric may be interpreted as the C 2 -gap with an imposed time-arrow. 9) A time-conjugated v-gap metric was defined to measure closeness for robust b-stabilisation. It was seen that closeness of systems in the forward and backwards directions is a strong condition which includes the requirement of equal McMillan degrees. 10) It was seen that ease or difficulty of control as measured by optimal robustness in the gap metric is a property that depends on the time-arrow. 11) The situation of a plant which is easy to control in one time-direction but impossible to control in the other shows that irreversibility can be intimately related to control. 12) An engineering perspective of control suggests a close link between the control system stability arrow and the psychological arrow. Unified mathematical frameworks for passive circuits and feedback control suggest a close link between the control system stability arrow and the thermodynamic arrow. The question was raised whether the stability arrow for control systems can be directly related to the thermodynamic arrow. 13) The issue of the non-realizability of the pure predictor as a "real-time" device and the connection with the arrow of time was highlighted as well as the difficulty of es- tablishing non-realizability rigorously. The strongly contrasting behaviour of feedback loops in the presence of arbitrarily small time-delays or predictors was pointed out. IX. Acknowledgement We are grateful to Jan Willems for helpful comments on an earlier draft. References [1] B.D.O. Anderson and J.B. Moore, Optimal control: linear quadratic methods, Prentice-Hall, 1990. [2] K.J. Astrom, R.E. Klein, and A. Lennartsson, "Bicycle dynamics and control: adapted bicycles for education and research," IEEE Control Systems Magazine, 25 (4): 26-47, August 2005. [3] J.F. Barman, EM. Callier, and C.A. Desoer, "L 2 -stability and L 2 -instability of linear time-invariant distributed feedback systems perturbed by a small delay in the loop," IEEE Trans, on Automatic Contr. , 18(5): 479-484, October 1973. [4] R. W. Brockett and J. C. Willems, "Stochastic control and the second law of thermodynamics," in the Proc. of the IEEE Conference on Decision and Control, San Diego, California, pp. 1007-1011, 1978 [5] H. Sandberg, J.C. Delvenne, and J.C. Doyle, "Linear-quadratic-Gaussian heat engines," in the Proc. of the IEEE Conference on Decision and Control, pages 3102-3107, December 2007. [6] T.T. Georgiou and M.C. Smith, "w-Stability of feedback systems," Systems & Control Letters, 13 (4): 271-277, November 1989. [7] T.T. Georgiou and M.C. Smith, "Graphs, causality and stabilizability: linear, shift-invariant systems on I/2[0, oo)," Math, of Control Signals and Systems, 6, 195-223, 1993. [8] T.T. Georgiou and M.C. Smith, "Intrinsic difficulties in using the doubly-infinite time axis for input-output systems theory," IEEE Trans, on Automatic Contr. , 40(3): 516-518, March 1995. [9] T.T. Georgiou and M.C. Smith, "Optimal robustness in the gap metric," IEEE Trans, on Automat. Control, 35, 673-686, 1990. [10] T.T. Georgiou, C. Shankwitz and M.C. Smith, "Identification of linear systems: a stochastic approach based on the graph," Proceedings of the 1992 American Control Conference, Chicago, June 1992, pp. 307-312. [11] K. Glover and D. McFarlane, "Robust stabilization of normalized coprime factor plant descriptions with TJoo- bounded uncertainty," IEEE Trans, on Automat. Contr, vol. 34, pp. 821-830, 1989. [12] M. Green and D.J.N. Limebeer, Linear Robust Control, Prentice Hall, 1995. [13] W. M. Haddad, V. S. Chellaboina, and S. G. Nersesov, Thermodynamics: A Dynamical Systems Approach, Princeton University Press, 2005. [14] S.W. Hawking, A brief history of time, Bantam Books, 1988. [15] B. Jacob, "What is the better signal space for discrete-time systems: &j(Z) or ^(No)?" S1AM J. Contr. and Opt., 43 (4): 1521-1534, 2004. [16] B. Jacob, "An operator theoretical approach towards systems over the signal space ^(Z)", Integral Equations and Operator Theory, 46 (2): 189-214, June 2003. [17] B. Jacob, J.R. Partington, "Graphs, closability, and causality of linear time-invariant discrete-time systems," International J. on Control, 73 (11): 1051-1060, July 2000. [18] J. B. Johnson, "Thermal agitation of electricity in conductors," Phys. Rev., 32: 97-109, July 1928. [19] M.G. Krein and MA. Krasnosel'skii, "Fundamental theorems concerning the extension of Hermitian operators and some of their applications to the theory of orthogonal polynomials and the moment problem (in Russian)," Uspekhi Mat. Nauk., vol. 2, pp. 60-106, 1947. [20] D. Kondepudi and I. Prigogine, Modern thermodynamics: from heat engines to dissipative structures, John Wiley & Sons, 1998. [21] D.J.N. Limebeer and R.S. Sharp, "Bicycles, Motorcycles, and Models," IEEE Control Systems Magazine, 26 (5) 34-61, October 2006. [22] P.M. Makila and J.R. Partington, "A two-operator approach to robust stabilization of linear systems on K," International J. on Control, 79 (9): 1026-1038 Sept. 2006. [23] P.M. Makila, J.R. Partington, "Input-output stabilization of linear systems on Z," IEEE Trans, on Automat. Control, 49 (11): 1916-1928, November 2004. [24] P.M. Makila, J.R. Partington, "Input-output stabilization on the doubly-infinite time axis," International J. on Control, 75 (13): 981-987, September 2002. [25] P.M. Makila, "When is a linear convolution system stabilizable?" Systems & Control Letters, 46 (5): 371-378, August 2002. [26] D.G. Meyer and G.F. Franklin, "A connection between normalized coprime factorizations and linear quadratic regulator theory", IEEE Trans, on Automat. Contr, 32, 227-228, 1987. [27] S.K. Mitter and N.J. Newton, "Information and entropy flow in the Kalman-Bucy filter," J. of Statistical Physics, 118: 145-176, 2005. [28] H. Nyquist, "Thermal agitation of electric charge in conductors," Phys. Rev., 32: 110-113, July 1928. [29] H. Price, Time's Arrow and Archimedes' Point, Oxford University Press, New York, 1996. [30] B. Russell, "On the Notion of Cause", Proceedings of the Aristotelian Society, 13 (1913), pp. 1-26. [31] H. Sandberg, J.C. Delvenne, and J.C. Doyle, "The Statistical Mechanics of Fluctuation-Dissipation and Measurement Back Action," in the Proc. of the American Control Conference, 2007, available at |http://arxiv.or g/abs/math.DS/ d"611628| [32] B. Sz.-Nagy, "Perturbations des transformations autoadjointes dans l'espace de Hilbert," Comm. Math. Helv., vol. 19, pp. 347-366, 1947. [33] L.S. Schulman, Time's arrow and quantum measurement, Cambridge University Press, 1997. [34] B.W. Schumacher, "Demonic Heat Engines", in Physical Origins of Time Asymmetry, Eds. J.J. Halliwell, J. Perez- Mercader, and W.H. Zurek, Cambridge University Press, 1994. [35] M. Vidyasagar, "Normalised coprime factorizations for nonstrictly proper systems", IEEE Trans, on Automat. Contr, 33, 300-301, 1988. [36] G. Vinnicombe, "Frequency domain uncertainty and the graph topology," IEEE Trans, on Automat. Control, 38, 1371-1383, 1993. [37] G. Vinnicombe, Uncertainty and Feedback: H^, loop-shaping and the v-gap metric, Imperial College Press, 2001. [38] J.L. Willems, Stability Theory of Dynamical Systems, Thomas Nelson and Sons Ltd., London, 1970. [39] J. C. Willems, The Analysis of feedback systems, MIT Press, 1971. [40] http://en.wikipedia.org/wiki/Loschmidt's_paradox [41] D.C. Youla, "On the factorization of rational matrices," IRE Transactions of Information Theory, 7, 172-189, 1961. [42] G. Zames, "Realizability Condition for Nonlinear Feedback Systems," IEEE Trans, on Circuits Theory, 11(2): 186- 194, June 1964. [43] G. Zames and A.K. El-Sakkary, "Unstable systems and feedback: The gap metric," Proceedings of the Allerton Conference, pp. 380-385, October 1980. [44] K. Zhou, J.C. Doyle and K. Glover, Robust and optimal control, Prentice-Hall, 1995.