東華大學圖書館 |

語系: 繁體中文

說明(常見問題)

回圖書館首頁

手機版館藏查詢

登入

回首頁

切換: 標籤 | MARC模式 | ISBD

From Model-based to Data-driven Disc...

Columbia University., Mechanical Engineering.

FindBook

Google Book

Amazon

博客來

From Model-based to Data-driven Discrete-time Iterative Learning Control.

紀錄類型:	書目-電子資源 : Monograph/item
正題名/作者:	From Model-based to Data-driven Discrete-time Iterative Learning Control./
作者:	Song, Bing.
出版者:	Ann Arbor : ProQuest Dissertations & Theses, : 2019,
面頁冊數:	151 p.
附註:	Source: Dissertations Abstracts International, Volume: 80-05, Section: B.
Contained By:	Dissertations Abstracts International80-05B.
標題:	Mechanical engineering. -
電子資源:	http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10976059
ISBN:	9780438634275

From Model-based to Data-driven Discrete-time Iterative Learning Control.
Song, Bing.

From Model-based to Data-driven Discrete-time Iterative Learning Control. - Ann Arbor : ProQuest Dissertations & Theses, 2019 - 151 p.

Source: Dissertations Abstracts International, Volume: 80-05, Section: B.

Thesis (Ph.D.)--Columbia University, 2019.

This item must not be sold to any third party vendors.

This dissertation presents a series of new results of iterative learning control (ILC) that progresses from model-based ILC algorithms to data-driven ILC algorithms. ILC is a type of trial-and-error algorithm to learn by repetitions in practice to follow a pre-defined finite-time maneuver with high tracking accuracy. Mathematically ILC constructs a contraction mapping between the tracking errors of successive iterations, and aims to converge to a tracking accuracy approaching the reproducibility level of the hardware. It produces feedforward commands based on measurements from previous iterations to eliminates tracking errors from the bandwidth limitation of these feedback controllers, transient responses, model inaccuracies, unknown repeating disturbance, etc. Generally, ILC uses an a priori model to form the contraction mapping that guarantees monotonic decay of the tracking error. However, un-modeled high frequency dynamics may destabilize the control system. The existing infinite impulse response filtering techniques to stop the learning at such frequencies, have initial condition issues that can cause an otherwise stable ILC law to become unstable. A circulant form of zero-phase filtering for finite-time trajectories is proposed here to avoid such issues. This work addresses the problem of possible lack of stability robustness when ILC uses an imperfect a prior model. Besides the computation of feedforward commands, measurements from previous iterations can also be used to update the dynamic model. In other words, as the learning progresses, an iterative data-driven model development is made. This leads to adaptive ILC methods. An indirect adaptive linear ILC method to speed up the desired maneuver is presented here. The updates of the system model are realized by embedding an observer in ILC to estimate the system Markov parameters. This method can be used to increase the productivity or to produce high tracking accuracy when the desired trajectory is too fast for feedback control to be effective. When it comes to nonlinear ILC, data is used to update a progression of models along a homotopy, i.e., the ILC method presented in this thesis uses data to repeatedly create bilinear models in a homotopy approaching the desired trajectory. The improvement here makes use of Carleman bilinearized models to capture more nonlinear dynamics, with the potential for faster convergence when compared to existing methods based on linearized models. The last work presented here finally uses model-free reinforcement learning (RL) to eliminate the need for an a priori model. It is analogous to direct adaptive control using data to directly produce the gains in the ILC law without use of a model. An off-policy RL method is first developed by extending a model-free model predictive control method and then applied in the trial domain for ILC. Adjustments of the ILC learning law and the RL recursion equation for state-value function updates allow the collection of enough data while improving the tracking accuracy without much safety concerns. This algorithm can be seen as the first step to bridge ILC and RL aiming to address nonlinear systems.

ISBN: 9780438634275Subjects--Topical Terms:

649730
Mechanical engineering.

From Model-based to Data-driven Discrete-time Iterative Learning Control.
LDR:04242nmm a2200313 4500 001 2263141
005 20200214113156.5
008 220629s2019 ||||||||||||||||| ||eng d
020 $a 9780438634275
035 $a (MiAaPQ)AAI10976059
035 $a (MiAaPQ)columbia:14985
035 $a AAI10976059
040 $a MiAaPQ $c MiAaPQ
100 1 $a Song, Bing. $3 1944044
245 1 0 $a From Model-based to Data-driven Discrete-time Iterative Learning Control.
260 1 $a Ann Arbor : $b ProQuest Dissertations & Theses, $c 2019
300 $a 151 p.
500 $a Source: Dissertations Abstracts International, Volume: 80-05, Section: B.
500 $a Publisher info.: Dissertation/Thesis.
500 $a Advisor: Longman, Richard W.
502 $a Thesis (Ph.D.)--Columbia University, 2019.
506 $a This item must not be sold to any third party vendors.
520 $a This dissertation presents a series of new results of iterative learning control (ILC) that progresses from model-based ILC algorithms to data-driven ILC algorithms. ILC is a type of trial-and-error algorithm to learn by repetitions in practice to follow a pre-defined finite-time maneuver with high tracking accuracy. Mathematically ILC constructs a contraction mapping between the tracking errors of successive iterations, and aims to converge to a tracking accuracy approaching the reproducibility level of the hardware. It produces feedforward commands based on measurements from previous iterations to eliminates tracking errors from the bandwidth limitation of these feedback controllers, transient responses, model inaccuracies, unknown repeating disturbance, etc. Generally, ILC uses an a priori model to form the contraction mapping that guarantees monotonic decay of the tracking error. However, un-modeled high frequency dynamics may destabilize the control system. The existing infinite impulse response filtering techniques to stop the learning at such frequencies, have initial condition issues that can cause an otherwise stable ILC law to become unstable. A circulant form of zero-phase filtering for finite-time trajectories is proposed here to avoid such issues. This work addresses the problem of possible lack of stability robustness when ILC uses an imperfect a prior model. Besides the computation of feedforward commands, measurements from previous iterations can also be used to update the dynamic model. In other words, as the learning progresses, an iterative data-driven model development is made. This leads to adaptive ILC methods. An indirect adaptive linear ILC method to speed up the desired maneuver is presented here. The updates of the system model are realized by embedding an observer in ILC to estimate the system Markov parameters. This method can be used to increase the productivity or to produce high tracking accuracy when the desired trajectory is too fast for feedback control to be effective. When it comes to nonlinear ILC, data is used to update a progression of models along a homotopy, i.e., the ILC method presented in this thesis uses data to repeatedly create bilinear models in a homotopy approaching the desired trajectory. The improvement here makes use of Carleman bilinearized models to capture more nonlinear dynamics, with the potential for faster convergence when compared to existing methods based on linearized models. The last work presented here finally uses model-free reinforcement learning (RL) to eliminate the need for an a priori model. It is analogous to direct adaptive control using data to directly produce the gains in the ILC law without use of a model. An off-policy RL method is first developed by extending a model-free model predictive control method and then applied in the trial domain for ILC. Adjustments of the ILC learning law and the RL recursion equation for state-value function updates allow the collection of enough data while improving the tracking accuracy without much safety concerns. This algorithm can be seen as the first step to bridge ILC and RL aiming to address nonlinear systems.
590 $a School code: 0054.
650 4 $a Mechanical engineering. $3 649730
690 $a 0548
710 2 $a Columbia University. $b Mechanical Engineering. $3 1684265
773 0 $t Dissertations Abstracts International $g 80-05B.
790 $a 0054
791 $a Ph.D.
792 $a 2019
793 $a English
856 4 0 $u http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10976059