Научная статья на тему 'State-feedback control principles for inverted pendulum with hysteresis in suspension'

State-feedback control principles for inverted pendulum with hysteresis in suspension Текст научной статьи по специальности «Физика»

CC BY
66
12
i Надоели баннеры? Вы всегда можете отключить рекламу.
Ключевые слова
ГИСТЕРЕЗИСНАЯ НЕЛИНЕЙНОСТЬ / HYSTERETIC NONLINEARITY / ЛЮФТ / BACKLASH / ПРИНЦИПЫ ОБРАТНОЙ СВЯЗИ / FEEDBACK PRINCIPLES / ОПТИМАЛЬНОЕ УПРАВЛЕНИЕ / OPTIMAL CONTROL

Аннотация научной статьи по физике, автор научной работы — Semenov Mikhail E., Abbas Zainib Hatif, Ishchuk Igor N., Kanishcheva Olesya I., Meleshenko Peter A.

In this paper we consider the mathematical model of the inverted pendulum with the hysteretic nonlinearity (in the form of a backlash) under state feedback control. The analytic results for the stability criteria as well as for the solution of the linearized equation are observed and analyzed. The theorems that determine the stabilization of the considered system are formulated and discussed. The question on the optimal control of the system under consideration is also discussed and the corresponding theorem is considered.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Управление перевернутым маятником с гистерезисом в точке подвеса с использованием принципов обратной связи

В представленной статье рассматривается математическая модель управления перевернутым маятником с гистерезисной нелинейностью в виде люфта, основыванная на принципах обратной связи. Получены и проанализированы аналитические результаты для критериев устойчивости, а также решения линеаризованного уравнения. Сформулированы и доказаны теоремы, определяющие стабилизацию рассматриваемой системы. Изучен вопрос об оптимальном управлении данной системы и сформулирована соответствующая теорема.

Текст научной работы на тему «State-feedback control principles for inverted pendulum with hysteresis in suspension»

УДК 51-74, 517.9, 531.536

State-feedback Control Principles for Inverted Pendulum with Hysteresis in Suspension

Mikhail E. Semenov*

Zhukovsky-Gagarin Air Force Academy, Starykh Bolshevikov, 54-A, Voronezh, 394064 Voronezh State University, Universitetskaya, 1, Voronezh, 394006 Voronezh State University of Architecture and Civil Engineering XX-letiya Oktyabrya, 84, Voronezh, 394006

Russia

Zainib Hatif Abbas^

Voronezh State University of Architecture and Civil Engineering XX-letiya Oktyabrya st. 84, Voronezh, 394006

Russia

Igor N. Ishchuk*

Zhukovsky-Gagarin Air Force Academy, Starykh Bolshevikov, 54-A, Voronezh, 394064 Siberian Federal University, Svobodny, 79, Krasnoyarsk, 660041

Russia

Olesya I. Kanishcheva§

Zhukovsky-Gagarin Air Force Academy, Starykh Bolshevikov, 54-A, Voronezh, 394064 Voronezh State University of Architecture and Civil Engineering XX-letiya Oktyabrya, 84, Voronezh, 394006

Russia

Peter A. Meleshenko^

Zhukovsky-Gagarin Air Force Academy, Starykh Bolshevikov, 54-A, Voronezh, 394064 Voronezh State University, Universitetskaya, 1, Voronezh, 394006

Russia

Received 27.04.2016, received in revised form 20.06.2016, accepted 15.08.2016 In this paper we consider the mathematical model of the inverted pendulum with the hysteretic nonlinearity (in the form of a backlash) under state feedback control. The analytic results for the stability criteria as well as for the solution of the linearized equation are observed and analyzed. The theorems that determine the stabilization of the considered system are formulated and discussed. The question on the optimal control of the system under consideration is also discussed and the corresponding theorem is considered.

Keywords: hysteretic nonlinearity, backlash, feedback principles, optimal control. DOI: 10.17516/1997-1397-2016-9-4-498-509.

* mkl150@mail.ru

tzainab.hatiff@gmail.com

£ boerby@rambler.ru

§ oleka_olesya@mail.ru ^ melechp@yandex.ru © Siberian Federal University. All rights reserved

Introduction

The problem of the inverted pendulum is of more than 100 years history [1-3], however it remains relevant even in the present days [4-8]. As is well known the model of the inverted pendulum plays the central role in the control theory [4,7,9-12]. Moreover, it is well established benchmark problem that provides many challenging problems to control design. Because of their nonlinear nature pendulums have maintained their usefulness and they are now used to illustrate many of the ideas emerging in the field of nonlinear control [13]. Typical examples are feedback stabilization, variable structure control, passivity based control , back-stepping and forwarding, nonlinear observers, friction compensation, and nonlinear model reduction.

It should also be noted that the model of the inverted pendulum (especially, under various kinds of control of motion of the suspension point) is widely used in the various fields of physics, applied mathematics, engineer sciences, neuroscience, economics and others. In order to make an adequately description of the dynamics of real physical and mechanical systems it is necessary to take into account the effects of hysteretic nature such as "backlash", '"tops" etc. [14]. The mathematical models of such nonlinearities according to the classical patterns of Krasnosel'skii and Pokrovskii [15], reduce to operators that are treated as converters in an appropriate function spaces. The dynamics of such converters are described by the relation of "input-state" and "stateoutput".

As is known, most of the real physical and technical systems contain a various kind of parts that can be represented as a cylinder with a piston [14]. The backlashes appear in such systems during its long operation due to the "aging" of the materials. As was mentioned above such backlashes are of hysteretic nature and the analysis of such nonlinearities is quiet important and actual problem [16,17]. In this paper we investigate the problem of the state feedback control of the inverted pendulum with the hysteretic nonlinearity in the form of backlash.

1. Inverted pendulum with hysteretic nonlinearity

In this section we briefly describe the mathematical model of the inverted pendulum with the horizontal moving suspension point. Also, in terms of the in-out converter we mathematically describe such a nonlinearity as backlash.

1.1. Mathematical model

The equations of motion of the inverted pendulum with the horizontal moving suspension point together with the initial conditions (see the Fig. 1) can be written in the following form:

Ap = mgl sin p — mul cos p,

(1)

p(0) = po, p(0) = w(0) = wo,

u(t) = r[u0,h]x(t), (2)

where A is a general moment of inertia of the pendulum, p(t) is the angle of vertical deviation of the pendulum, u(t) is a law of motion for the cylinder of the length h, x(t) is a law of motion for the piston which can be interpreted as a control parameter, r[u0, h] is defined below.

The equation (1) describes the so-called in-out relations of the hysteretic converter in the form of backlash.

y, i # m

-k k i

1 w

1

P C X

Fig. 1. General view of the inverted pendulum with the suspension point in the form of a cylinder C with a piston P

In the following we will consider the case, when the acceleration of the piston is constant, namely

\x\ = k = const.

Let us assume also that the deviations of the pendulum are small so we can rewrite the equation (1) in the linearized form:

A< = mgl< — mul,

(3)

<(0) = <0, <¿>(0) = = w0.

1.2. Backlash as hysteresis non-linearity

The mathematical models of the hysteresis non-linearities according to the classical patterns of Krasnosel'skii and Pokrovskii [15], reduce to operators that are treated as converters in an appropriate function spaces. The dynamics of such converters are described by the relation of "input-state" and "state-output".

The out state of the converter in the form of backlash (such an out state is considered on the monotonic inputs) can be described by the following relation

{no, for uo ^ x(t) ^ uo + h, x(t), for x(t) < u0, (4)

x(t) — h, for u0 + h < x(t).

This relation can be illustrated by the Fig. 2.

With a special limit construction and using the semigroup identity in the form

r[u(ti), h]x(t) = r [r[u0, hjx(ti), h] x(t), (5)

the r-operator can be applied to all continuous inputs. It should also be noted that the presence of hysteresis-type operator in the equation (3) complicates the stabilization of the pendulum as a whole. In general, the control impact for such a system will be retarded (we should "predict" the future position of the pendulum).

Fig. 2. Schematic view of the backlash's (4) action

2. Stabilization of inverted pendulum with hysteresis non-linearity

In this section we consider the state feedback control of the inverted pendulum with hysteresis non-linearity in the form of backlash. Namely, we obtain the analytic expression for the stability zones of such a system as well as we formulate the theorems that determine the stabilization of the considered system.

2.1. Dynamical characteristics: analytic results

Let us consider the state feedback control of the inverted pendulum, i.e., we assume that the input state of the hysteretic converter obeys the following relation:

X = k sign(ap + w), (6)

where a > 0 and sign(z) is the usual signum function.

The linearized equation (3) can be rewritten in the equivalent matrix form we determine the

general moment of inertia as A = ml2 and use the notation B = \ as follows:

(7)

U)=vO+*

where

0) = (

(0 M

\ B 2 0 ) ' u(t) = r[uo, h]x(t), X = + w),

p(0) = po, p(0) = w(0) = wo. The eigenvalues of the matrix V are B and —B so that the corresponding eigenvectors are 1 and ( J" ) respectively. Here, it should be noted that if the phase coordinates of the

B J \ B

system under consideration at some time moment will be placed on the line B<p + w = 0, then in the future (in the next time moments), in the absence of control, the phase coordinates will asymptotically tend to zero. Therefore, the control should be arranged (on the conceptual level) in such a manner as to "preserve" the phase coordinates in the vicinity of this line.

On each of the interval where the function u is constant the system (7) can be solved and the result is:

($)=««( :)+(8)

Here 1

. ,,, coshBt — sinhBt \

A(i) = B ,

V B sinh Bt cosh Bt J

( --(coshBt - 1) N (9)

u(t)

g B

--sinh Bt

\ g

¥>o and w0 are initial deviation and frequency of the pendulum respectively, u0 is an acceleration of the cylinder on the interval where the function u is constant.

The behavior of the system (8) on the whole time interval can be represented by the following recurrent relation:

CSH' - '-■>( W^) + U(t - ™

Here tk are the moments at which the control changes, ftk-1 and wtk-1 are the values of angle and angle velocity at the moment tk-1 respectively, utk-1 is an acceleration of the cylinder on the interval [tk-1,tk ].

2.2. Dissipative motion

The equation (3) is called dissipative if there exists a limited domain Q on the product of the phase space of the system (7) and the state space of the hysteretic converter (4) that for any initial values (^0, w0, u) e Q, the solutions of the equation (3) remain uniformly limited. In other words, this system is called dissipative if there exists a region in the phase space and matching region in the state space of the hysteresis converter that the solution which began in this region do not go to infinity.

Let us formulate the following theorem:

Theorem 2.1. The sufficient condition for existence of the dissipative regime of the pendulum's motion in a vicinity of the upper position is:

kB

g

(11)

2h

where t = W ^ is the time for which the piston passes through the cylinder.

Proof. As is followed from the equation (10), the movement of the phase parameters on the line Bf + w = 0 (such a line corresponds to stabilization of the system) occurs during the time t:

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

-lnf-2-■ (12)

B \Bfo + wo - f) y '

Let the piston passes through the cylinder during the time t . Then, the phase parameters of a pendulum are:

b^t + wT = eBT (B^o + wo). (13)

t

After substitution of (13) in to (12) one gets:

t = — In

Bk

g

B WBt(B<po + wq) - Bk

This equation has a real value if

Bk

eBT(bpo + wq) - bgk

> 0,

or

eBT\b<pq + wq| <

Bk

g

Let us note that the inequality (11) determines the stability zones of the system under consideration. □

2.3. Non-ideal relay in feedback

As is known, the measuring devices of any mechanical systems do not always work perfectly. So, let us consider the stabilization problem for inverted pendulum in the case when the uncertainty in the control takes place. Let us assume that this uncertainty is fixed, then the acceleration of the suspension point (control parameter) corresponds to the output state of the non-ideal relay converter:

y(t) = By(t) + w(t), u = kR [-£, £, sign (u(to)), yo] y(t),

(14)

where e > 0. The action of the presented converter R [-£,£, sign («(to)), yo] can be illustrated as is shown in the Fig. 3. Detailed description of this converter is given in [15].

g

Fig. 3. Schematic view of the non-ideal relay's R [-£,£, sign(ii(io)), yo] action

The parameter £ can be considered as an uncertainty in the measurement of the value B<p+w. Let us assume also that the following inequality takes place:

kB £ < -.

g

Otherwise the stabilization of the pendulum can not be observed.

Dynamics of the system with non-ideal relay in the feedback described by the equations:

+

U)=Kr)+*

U = kR [-e, e, sign (x(t°)), y°] y(t),

(15)

<¿>(0) = <0, w(0) = wo■

Let us assume that at initial time y(t0) = e. The time for which the phase coordinates of the system (9) under influence of the control will move to the position y(tc) = -e can be found using the following expressions:

— >-

y(tc),

or, in the full form:

where

-£ = B

( B 1 ) A(tc)( 1°) + ( B 1 )

kv(tc),

u(t)

Making the transformations we obtain

( -^(coshBt - 1) g B

--sinh Bt

\ g

—e = e

Bt

kB

°e - — (■ g

Btc

- 1) ,

and

(g + e \ l^gB - ej

(16)

(17)

A similar result takes place if y(t0) = -e. Thus, the total period of the control (14) is T = 2tc and y(2tc) = y(t0) = e. If y(t0) = e and y(t0) = -e, then for a finite time the phase coordinates of the system (under the control (14)) will move to the position y(t) = e or y(t) = -e.

Using the results presented above we can consider the question on the asymptotical (Lya-punov) stability of solutions of the system (15). Let us recall some facts from the Lyapunov stability theory. Namely, the definition of the asymptotic (Lyapunov) stability has the form:

Solution s\t,t0, tions take place:

Wq

of the system (15) is called Lyapunov stable if the following condi-

1. There exists a positive number p, that if

fi

)-( ÏÏ)

< p then the solution

s(t,t0,(^ )) is defined for all values of t > t0;

2. For any positive number e we can choose a positive number S < p, that if

fi Wi

fo

Wq

< S then the inequality

(tto ^rO) - sMs))

< e

takes place at t > t0.

t

c

s

Lyapunov stable solution s ( t, t0 positive number a < p, that if

(: )

( ^^ -( P \ V "1 / \ wo /

M s )) -• M : ))

of (9) is an asymptotically stable if there exists a

< a we have:

^ 0 at t ^

Now, we can formulate the following theorem:

Theorem 2.2. The system (15) has an asymptotically (Lyapunov) stable solution in the form of closed loop:

) V(0)

V "(0)

)-(

at 0 < 0 <tc

) № ) = ( V "(0) ) \

cosh B0 — sinh B0 B

B sinh B0 cosh B0

cosh B0 — sinh B0 B

B sinh B0 cosh B0

)(Î )

+

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

/ k

' — (cosh B0 - 1) ^ Bk

--sinh B0

\ g

(18)

)( ° )

+

k

' - (cosh B0 - 1)

g Bk

-sinh B0

g

(19)

at tc < 0 < 2t c with the attraction domain for solution |Bpo + "o| ^

kB

g

Proof. In order to prove this theorem it is needed to validate that for any initial conditions from the attraction region the following relations will take place:

lim ly(ntc)l - e, lim p(ntc) — 0.

(20) (21)

It is evident that the condition (20) is executed for any n (this fact can be proved by a direct substitution). The condition (21) for p(ntc) determines by the following equality (this equality can be obtained by a simple but cumbersome calculations):

P(ntc) —

Of course,

(i + f)

( 1-1 f )

Po .

0.

As an illustration of this result in the Fig. 4 we present the phase portrait of the system (15). The simulation parameters are: m = 1 kg, k = 0.2m • sec-2, g = 9.8m • sec-2, l = 0.3m, £ = 0.02 sec-1, p0 = -0.01, w0 = 0.0771 sec-1.

The case when measuring devices have the random uncertainty in the measurements (desyn-chronization in the control) is of particular interest. Our numerical experiments show that increasing of the simulation time leads to the fact that the probability of the stabilization of the system decreases and tends to zero. This means that the pendulum can not save upright position under desynchronization.

s

Fig. 4. Phase portrait of the system (15). Blue straight lines limit the zone of dissipative motion; light blue dashed lines are Bf + u = e and Bf + u = —e respectively; red line is —Bf + u = 0.

Simulation parameters are k = 0.2 m • sec 2, g = 9.8 m • sec 2, l = 0.3 m, e V0 = -0.01, w0 = 0.0771 sec"1

0.02 sec~

3. Optimal control

In many technical problems the question on stabilization has a general interest. However, together with the stabilization of the system there is the problem of optimal control (this problem corresponds to asymptotically optimal characteristics of the system). In the considered problem of stabilization of the inverted pendulum the problem of optimal control corresponds to minimizing of the functional which determines the deviation of the pendulum from the vertical position. Let us consider a functional (the so-called objective functional) as follows:

1 I'1

J = 2T Jt V + W) dt.

(22)

When the equations describing the dynamics of the system (2) are executed it is necessary to achieve the minimization of the functional (22). Let us note also that the law of stabilization should be sought only in the set of functions that stabilize the system (7), i.e., when the following phase restrictions take place:

Bk

|Bf(t) + u(t)| <

g

(23)

Solution of the posed problem can be expressed in the form of the following theorem on the optimal control of the pendulum :

Theorem 3.1. Let a system of equations (7) is given together with the initial conditions that correspond to a dissipative regime of motion of the pendulum. Then, under the control (4), the

functional (22) will be minimized, and the trajectory of the pendulum ( f

u

the dissipativeness region in a vicinity of the upper position (23).

will li

lie entirely in

1

Proof. The proof of this theorem is based on the Pontryagin's maximum principle as well as on the analysis of a zero-dynamics set [18]. Following the Pontryagin maximum principle, let us write the Hamiltonian for the system of equations (7) and the functional (22):

H(p, u, A) = -(V + B+ AlW + A2 (BV - y) •

Let us form also the system of coupled equations:

Here T means the transposition procedure and

-=(i o ).q=(0:)■

We will find a solution of (25) in the following form [18]:

(£)=-(:) •

P = PT > o-

(24)

A= -ATA + T Qx• (25)

(26)

The last condition means that the matrix P corresponds to a nonnegative-definite quadratic form.

Let us introduce the row vector C — (Ci, C2), which, together with the matrix P, determines the solution of algebraic equations that can be obtained by substitution of (26) in to (25). Such a substitution leads to the following system:

AT P + PA — -Q + C TC, (27)

PM = 0,

(28)

here M =

(ï>

Solving the equations (27) and (28), we obtain the following:

P

( Bo )■

(■ B )•

(29)

(30)

Let us introduce a virtual output of the system in the form y = Cx. Using property 8.1 in [18], we obtain the form of control that minimizes the functional (22), namely:

u = r[wo, h]

k sign ^p + Buj

(31)

Let us note that the line p + — w = 0 should be a set of zero dynamics. Of course this fact is

B

obeyed. So the proof of the theorem is completed. □

Here we would like to note that the law of optimal control (following the presented theorem), "moves" the phase coordinates of the system (7) to the position y(t) = 0, and then stops. Such a control stabilizes the pendulum in the upper position. But, it is absolutely clear that in real systems it is difficult to get such a position, so it is necessary to consider the problem of finding the optimal control in another way.

One way of finding the optimal control is based on the assumption that the measuring instruments (in most cases) laid "errors" (or, uncertainties) and, thus, the switching of control is based on the principle of non-ideal relay.

Conclusions

Here we would like to formulate the following concluding remarks. The obtained in this paper results not only accurately predict the behavior of a pendulum under hysteretic control, but also allow to determine the possibility of the dissipative motion in the vicinity of the top position. The existence of the dissipative motion depends on the initial deviation of the pendulum's position as well as on the physical parameters of the system under consideration. Introduction of non-ideal relay in the state feedback control allows us to describe the periodic modes of the system (7). However, it should also be noted that the results obtained for the presence of the non-ideal relay in the state feedback control can be used for description of the real physical (and, in particular, mechanical) systems because the parameters of such systems can be measured with the inevitable uncertainties only.

Also, our numerical experiments show that the presence of the backlash with the nonzero step in the feedback control of the inverted pendulum leads to dissipative motion only and asymptotic convergence to an upright position is fundamentally unattainable. We have also considered the question on the optimal control of the system under consideration. Namely the theorem on the optimal control of the pendulum has been formulated and discussed.

This work is supported by the RFBR, grants 16-08-00312-a and 15-08-02611-a.

References

[1] A.Stephenson, On an induced stability, Phil. Mag., 15(1908), 233.

[2] P.L.Kapitza, Dynamic stability of a pendulum when its point of suspension vibrates, Soviet Phys. JETP, 21(1951), 588.

[3] P.L.Kapitza, Pendulum with a vibrating suspension, Usp. Fiz. Nauk, 44(1951), 7 (in Russian).

[4] E.I.Butikov, Subharmonic resonances of the parametrically driven pendulum, J. Phys. A: Math. Theor., 35(2002), 6209.

[5] A.Arinstein, M.Gitterman, Eur. J. Phys., 29(2008), 355.

[6] E.I.Butikov, An improved criterion for Kapitza's pendulum stability, J. Phys. A: Math. Theor., 44(2011), 295202.

[7] J.J.Wang, Simulation studies of inverted pendulum based on PID controllers, Simul. Model. Pract. Th, 19(2011), 440.

[8] E.I.Butikov, Oscillations of a simple pendulum with extremely large amplitudes, Eur. J. Phys., 33(2012), 1555.

[9] F.L.Chernous'ko, S.A. Reshmin, Time-Optimal Swing-Up Feedback Control of a Pendulum, Nonlinear Dynam., 47(2007), 65.

10] S.A.Reshmin, F.L.Chernous'ko, A Time-Optimal Control Synthesis for a Nonlinear Pendulum, J. Comput. Sys. Sc. Int. 46(2007), 9.

11] K.D.Kim, P.Kumar, Real-Time Middleware for Networked Control Systems and Application to an Unstable System, IEEE Transactions on Control Systems Technology, 21(2013), 1898.

12] J.Huang, et al., Modeling and Velocity Control for a Novel Narrow Vehicle Based on Mobile Wheeled Inverted Pendulum, IEEE Transactions on Control Systems Technology, 21(2013), 1607.

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

13] K.J.Astrom, K. Furuta, Swinging up a pendulum by energy control, Automatica, 36(2000), 287.

14] R.A.Nelepin (ed.), Methods of investigation of automatic control nonlinear systems, Nauka, Moscow, 1975 (in Russian).

15] M.A.Krasnosel'skii, A.V.Pokrovskii, Systems with Hysteresis, Springer Verlag, 1989.

16] M.E.Semenov, D.V.Shevlyakova, P.A.Meleshenko, Inverted pendulum under hysteresis control: stability zones and periodic solutions, Nonlinear Dynam., 75(2014), 247.

17] M.E.Semenov, A.M.Solovyov, P.A. Meleshenko, Elastic inverted pendulum with backlash in suspension: stabilization problem, Nonlinear Dynam., 82(2015), 677.

18] I.V.Miroshnik, Automatic control theory, Piter, St.Peterburg, 2006 (in Russian).

Управление перевернутым маятником с гистерезисом в точке подвеса с использованием принципов обратной связи

Михаил Е. Семенов

Военно-воздушная академия Старых Большевиков, 54а, Воронеж, 394064

Россия

Зейнаб Хатиф Аббас Игорь Н. Ищук Олеся И. Канищева Петр А. Мелешенко

В представленной статье 'рассматривается математическая модель управления перевернутым маятником с гистерезисной нелинейностью в виде люфта, основыванная на принципах обратной связи. Получены и проанализированы аналитические результаты для критериев устойчивости, а также решения линеаризованного уравнения. Сформулированы и доказаны теоремы, определяющие стабилизацию ^рассматриваемой системы. Изучен вопрос об оптимальном управлении данной системы и сформулирована соответствующая теорема.

Ключевые слова: гистерезисная нелинейность, люфт, принципы обратной связи, оптимальное управление.

i Надоели баннеры? Вы всегда можете отключить рекламу.