Discrete-Time Linear-Quadratic Regulator

Infinite horizon

For a linear time-invariant discrete-time system $x_{k + 1} = f (x_{k}, u_{k}, k) A x_{k} + B u_{k}$ and a quadratic total cost $J (x_{0}, {u_{i}}_{i = 0}^{\infty}, 0) = i = 0 \sum \infty l (x_{i}, u_{i}, i) x_{i}^{⊤} Q x_{i} + u_{i}^{⊤} R u_{i}, Q ⪰ 0, R ≻ 0$ of its trajectory ${x_{i}}_{i = 0}^{\infty}$ the optimal controller can be derived based on the assumption that the value function takes the form $V (x_{k}, k) = x_{k}^{⊤} S x_{k}, S ≻ 0.$

When substituted into the Bellman Equation along with the system's dynamics we attain $x_{k}^{⊤} S x_{k} = u_{k} min {x_{k}^{⊤} Q x_{k} + u_{k}^{⊤} R u_{k} + (A x_{k} + B u_{k})^{⊤} S (A x_{k} + B u_{k})} . (1)$ To find the minimum, we may take the gradient of its argument (which is by design quadratic and convex) with respect to $u_{k}$ , set it to zero and find the solution (optimal control input) $u_{k}^{*} = - (R + B^{⊤} SB)^{- 1} B^{⊤} S A x_{k} .$

The input can then be substitued back into (1). As the equation must hold for all $x_{k}$ , through basic manipulations we then attain the discrete-time algebraic Riccati equation (DARE) $S = Q + A^{⊤} S A - A^{⊤} SB (B^{⊤} SB + R)^{- 1} B^{⊤} S A, S ≻ 0.$

Finite horizon

For a linear time-invariant discrete-time system $x_{k + 1} = f (x_{k}, u_{k}, k) A x_{k} + B u_{k}$ and a quadratic total cost $J (x_{0}, {u_{i}}_{i = 0}^{N}, 0) = Φ (x_{N}) x_{N}^{⊤} Q_{N} x_{N} + i = 0 \sum N - 1 l (x_{i}, u_{i}, i) x_{i}^{⊤} Q x_{i} + u_{i}^{⊤} R u_{i}, Q_{N} ⪰ 0, Q ⪰ 0, R ≻ 0$ of its trajectory ${x_{i}}_{i = 0}^{N}$ the optimal controller can be derived based on the assumption that the value function takes the form $V (x_{k}, k) = x_{k}^{⊤} S_{k} x_{k}, S_{k} ≻ 0.$ When substituted into the Bellman Equation along with the system's dynamics we attain $x_{k}^{⊤} S_{k} x_{k} = u_{k} min {x_{k}^{⊤} Q x_{k} + u_{k}^{⊤} R u_{k} + (A x_{k} + B u_{k})^{⊤} S_{k + 1} (A x_{k} + B u_{k})} . (1)$ To find the minimum, we may take the gradient of its argument (which is by design quadratic and convex) with respect to $u_{k}$ , set it to zero and find the solution (optimal control input) $u_{k}^{*} = - (R + B^{⊤} S_{k + 1} B)^{- 1} B^{⊤} S_{k + 1} A x_{k} .$

The input can then be substitued back into (1). As the equation must hold for all $x_{k}$ , through basic manipulations we then attain the discrete-time dynamic Riccati equation (DDRE) $S_{k} = Q + A^{⊤} S_{k + 1} A - A^{⊤} S_{k + 1} B (B^{⊤} S_{k + 1} B + R)^{- 1} B^{⊤} S_{k + 1} A, S_{N} = Q_{N}$ for a finite horizon $N \in N$ .

Optimal and Predictive Control

Discrete-Time Linear-Quadratic Regulator

Infinite horizon

Finite horizon