Consider the unconstrained objective f(C) = 3C1² + 2C1C2 + 3C2² - 16C1 - 8C2. Since f(C) is quadratic and its discriminant B² - 4AC =2² - (4)(3)(3) = -20

Question

Consider the unconstrained objective[latex]\ f(C) = 3C^2_1 + 2C_1C_2 + 3C^2_2 - 16C_1 - 8C_2 [/latex]. Since f(C) is quadratic and its discriminant B² - 4AC =2² - (4)(3)(3) = -20 < 0 is negative, it should be clear that it is shaped like a bowl with elliptical cross-sections. Accordingly, there should be one unique global minimum.

The exact minimum can be found by setting the two partial derivatives equal to zero, and solving for [latex]\ C_1 and C_2: [/latex]

[latex]\ \frac{\partial f}{\partial C_1}=6C_1+2C_2-16=0,\[0.3cm] \frac{\partial f}{\partial C_2}=2C_1+6C_2-8=0. [/latex] (10.20)

Solving Equations (10.20) simultaneously gives[latex]\ C = (\frac{5}{2}, \frac{1}{2}) [/latex]. With elementary calculus, this is a trivial problem. However, assuming that we have no access to information, solve the problem numerically using the cyclic coordinates algorithm with initial trial point at[latex]\ C^{(0)} = (0,0). [/latex]

Accepted Answer

The projection of objective f onto the[latex]\ C_1-C_2 [/latex] plane (commonly referred to as a contour plot) is shown in Figure 10.18. It will be noted that all directions traveled in the cyclic coordinate search are parallel to the[latex]\ C_1 and C_2 [/latex] axes. The first direction is in the direction parallel to the[latex]\ C_1 [/latex] axis, so the initial direction vector is[latex]\ v^{(0)} = (1,0) [/latex]. The update to the next point[latex]\ C^{(1)} [/latex] is

[latex]\ C^{(1)}=C^{(0)}+\xi v^{(0)}=(0,0)+\xi (1,0)=(\xi ,0), [/latex]

where[latex]\ \xi = \xi^{(1)} [/latex] for simplicity of notation. It follows that the objective function at the new points is[latex]\ f(C^{(1)}) =f(\xi, 0) = 3\xi^2 - 16\xi [/latex]. Instead of using a line search, such as the golden ratio algorithm, we can find the minimum by setting the derivative of f equal to zero. Since f is now a function only of the variable ξ,

[latex]\ \frac{d}{d\xi}f(C^{(1)})=6\xi-16=0,\\xi=\frac{8}{3}. [/latex]

Therefore,[latex]\ C^{(1)} = (\frac{8}{3}, 0). [/latex]

The question now is whether or not the process terminates. The Cauchy criterion compares[latex]\ C^{(1)} [/latex] against[latex]\ C^{(0)} [/latex] by simply considering the Euclidean two-norm, which is the same as the Euclidean distance between the two points. In this case,

[latex]\ \left\|C^{(1)}-C^{(0)}\right\|=\left\|(\frac{8}{3},0)-(0,0)\right\| =\frac{8}{3}. [/latex]

Since the norm is relatively large, we conclude that a significant change has been made in the distance traveled, and thus we continue on to the next iteration. However, this time we go in the[latex]\ C_2 [/latex] direction:[latex]\ v^{(1)} = (0, 1) [/latex]. Proceeding as in the first iteration,

[latex]\ C^{(2)}=C^{(1)}+\xi v^{(1)}=(\frac{8}{3},0)+\xi (0,1)=(\frac{8}{3},\xi),\[0.3cm] f(C^{(2)}) =f(\frac{8}{3},\xi) = 3\xi^2 - \frac{8}{3}\xi-\frac{64}{3}, \[0.3cm] \frac{d}{d\xi}f(C^{(1)}+\xi v^{(1)})=6\xi-\frac{8}{3}=0. [/latex]

It follows from this that[latex]\ \xi=\frac{4}{9} and C^{(2)} =(\frac{8}{3}, \frac{4}{9}). [/latex] The norm defining the stopping criterion is now

[latex]\ \left\|C^{(2)}-C^{(1)}\right\|=\left\|(\frac{8}{3},\frac{4}{9})-(\frac{8}{3},0)\right\| =\frac{4}{9}, [/latex]

but this is still not "small".

Therefore, we continue - but we cycle back to the vector[latex]\ v^{(2)} = (1,0). [/latex] This gives

[latex]\ f(C^{(2)}+\xi v^{(2)})=3\xi^2+\frac{8}{9}\xi+\frac{1264}{81}. [/latex]

This is minimized when[latex]\ \xi=-\frac{4}{27}\approx -0.1481, [/latex] which implies that

[latex]\ C^{(3)}=C^{(2)}-\frac{4}{27}v^{(2)}\approx (2.5115,0.4444). [/latex]

It follows that the norm is[latex]\ \left\|C^{(3)}-C^{(2)}\right\| \approx 0.1481 [/latex], which we might consider to be sufficiently small enough a change in position, and therefore terminate the process. Table 10.5 lists the sequence of iterates using two-decimal-place accuracy. It will be noted that even though approximation liberties were taken at intermediate steps, convergence to the correct solution (2.5, 0.5) is still accomplished in a timely manner. Also, we observe that in the case of cyclic coordinates, the norm is simply |ξ|.

TABLE 10.5 Sequence of Iterations for Example 10.9 Using the
cyclic coordinates Algorithm.

[latex]\ \begin{array}{c} \hline i&C^{(i)}&v^{(i)}&\xi&f(C^{(i)})\\hline 0&(0.00,0.00)&(1,0)&2.67&0.000\1&(2.67,0.00)&(0,1)&0.44&-21.333\2&(2.67,0.44)&(1,0)&-0.15&-21.923\3&(2.52,0.44)&(0,1)&0.05&-21.990\4&(2.52,0.49)&(1,0)&-0.02&-22.000\5&(2.50,0.49)&(0, 1)&0.01&-22.000\6&(2.50,0.50)&(1,0)&0.00&-22.000\\hline \end{array} [/latex]

Question 10.9: Consider the unconstrained objective f(C) = 3C1² + 2C1C2 + 3......

Related Answered Questions

Apply the method of Hook and Jeeves to the objective function defined in Example 10.9, f(C) = 3C1² + 2C1C2 + 3C2² – 16C1 – 8C2. ...

Verified Answer:

Repeat Example 10.10 for the case where tracking is desired for yss = 6 and vss = 3. Again assume that input to be x(t) = 10 and the time horizon to be [0, 20]. ...

Verified Answer:

Consider the servo system whose block diagram is given in Figure 10.20. There are two tuning parameters C1 and C2 that weight the position and velocity components of the feedback loops. ...

Verified Answer:

Again consider the situation in which an ideal input signal x1 (t) = sin t + sin 2t is contaminated by uniformly distributed white noise over the time horizon [0, 10]. We wish to study the effects of the white noise variance σw² and the two major objective functions outlined above, ...

Verified Answer:

Suppose an ideal signal x1 (t) = sin t + sin 2t is combined with uniformly distributed white noise with unit variance to form an observed signal. This observed signal is sampled at a sampling frequency f = l/δ = 10 samples per unit time. ...

Verified Answer:

It is useful to compare the effective ratios for the three line searches discussed in this chapter. ...

Verified Answer:

Consider the objective function f(C) = e^-c cos C defined on the uncertainty interval 0 ≤ C ≤ n. We know from Example 10.3 that the objective is convex on the given initial interval, so the convexity theorem guarantees a global minimum somewhere on the interval [0, π]. ...

Verified Answer:

Consider the objective function f(C) = e^–c cos C defined on the interval 0 ≤ C ≤ 15. It is clear that this function is infinitely differentiable for all real C, so its concavity is well defined on [0, 15]. ...

Verified Answer:

Again consider the system defined in Example 10.1. However, this time do not assume that an explicit formula for output y(t) is available. (a) Find C on [1, 2] so that y(t) is forced as close as possible to zdes(t) = 1/2 for time t on [0, 1]. Assume an input of x(t) = 1. ...

Verified Answer:

Consider the system defined in Figure 10.1, with input x(t) = 1 on the interval 0 ≤ t ≤ 1 . Owing to physical constraints, the constant C is confined to the interval 1 ≤ C ≤ 2. Using the mean square error (MSE) approach defined by Equation (10.1), ...

Verified Answer: