TWI809592B - Model prediction control system and method thereof - Google Patents
Model prediction control system and method thereof Download PDFInfo
- Publication number
- TWI809592B TWI809592B TW110145943A TW110145943A TWI809592B TW I809592 B TWI809592 B TW I809592B TW 110145943 A TW110145943 A TW 110145943A TW 110145943 A TW110145943 A TW 110145943A TW I809592 B TWI809592 B TW I809592B
- Authority
- TW
- Taiwan
- Prior art keywords
- predictive control
- model predictive
- model
- parameters
- control unit
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/02—Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Medical Informatics (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Automation & Control Theory (AREA)
- Feedback Control In General (AREA)
- Radar Systems Or Details Thereof (AREA)
- Monitoring And Testing Of Nuclear Reactors (AREA)
- Control Of Eletrric Generators (AREA)
- Testing And Monitoring For Control Systems (AREA)
Abstract
Description
本揭示係關於一種控制系統及控制方法,特別有關於一種用於預測並控制目標物之行為的模型預測控制系統及其方法。 The present disclosure relates to a control system and a control method, in particular to a model predictive control system and method for predicting and controlling the behavior of an object.
模型預測控制機制係根據目標物之動態行為建立目標物之動態模型,並根據動態模型以預測目標物的動態變化,而後藉由最佳化演算法計算控制決策輸入值的最佳解,以使得目標物的行為能滿足預定目標。 The model predictive control mechanism is to establish the dynamic model of the target object according to the dynamic behavior of the target object, and predict the dynamic change of the target object according to the dynamic model, and then use the optimization algorithm to calculate the optimal solution of the control decision input value, so that The behavior of the object satisfies the intended purpose.
然而,模型預測控制機制的最佳化演算法涉及維度較大的最佳化求解過程,當目標物之動態變化較快或目標物之動態模型較複雜時,最佳化演算法將耗費大量運算時間而導致無法在有限時間的決策週期內完成最佳化求解,此將造成目標物的控制決策延遲而影響目標物之效能表現甚至引發意外事故。 However, the optimization algorithm of the model predictive control mechanism involves a large-dimensional optimization solution process. When the dynamic change of the target object is fast or the dynamic model of the target object is complex, the optimization algorithm will consume a lot of calculations As a result, the optimal solution cannot be completed within the limited time decision-making cycle, which will cause the delay of the control decision of the target object, affect the performance of the target object and even cause accidents.
因此,本技術領域之相關產業之技術人員係致力於改良模型預測控制機制,可於各種應用情境(例如金融交易,自駕車,無人機,工廠產線,等)中快速計算出適合於控制目標的控制決策輸入值,而能夠避免控制決策延遲;並且,改良的模型預測控制機制亦可大幅減少運算量以降低硬體規格需求。 Therefore, technicians in related industries in this technical field are committed to improving the model predictive control mechanism, which can quickly calculate the suitable control target in various application scenarios (such as financial transactions, self-driving cars, drones, factory production lines, etc.) The input value of the control decision can avoid the delay of the control decision; moreover, the improved model predictive control mechanism can also greatly reduce the amount of calculation to reduce the hardware specification requirements.
本揭示係提供一種模型預測控制系統,包括模型模擬單元、監測單元及代理模型預測控制單元。模型模擬單元用於根據目標物之行為模型以及多個前狀態的控制決策參數以模擬此目標物之動態行為,且對應產生多個實際狀態參數。監測單元,用於監測此些前一狀態的控制決策參數及此些實際狀態參數,並根據此些前一狀態的控制決策參數及此些實際狀態參數產生多個估計狀態參數。代理模型預測控制單元,用於根據此些估計狀態參數及多個目標參數執行近似的模型預測控制運算,以得到多個目前狀態的控制決策參數,並將此些目前狀態的控制決策參數回傳至此模型模擬單元。其中,此代理模型預測控制單元執行此近似的模型預測控制運算以取代運算複雜度較高的模型預測控制運算。 The disclosure provides a model predictive control system, including a model simulation unit, a monitoring unit and an agent model predictive control unit. The model simulation unit is used for simulating the dynamic behavior of the target according to the behavior model of the target and a plurality of control decision parameters of the previous state, and correspondingly generates a plurality of actual state parameters. The monitoring unit is configured to monitor the control decision parameters of the previous state and the actual state parameters, and generate a plurality of estimated state parameters according to the control decision parameters of the previous state and the actual state parameters. The agent model predictive control unit is used to perform approximate model predictive control operations according to the estimated state parameters and multiple target parameters, so as to obtain multiple control decision parameters of the current state, and return the control decision parameters of the current state So far the model simulates the unit. Wherein, the proxy model predictive control unit executes the approximate model predictive control operation to replace the model predictive control operation with high computational complexity.
本揭示亦提供一種模型預測控制方法,包括以下步驟。建立目標物之行為模型。根據此目標物之此行為模型以及多個前狀態的控制決策參數以模擬此目標物之動態行為。對應產生多個實際狀態參數。監測此些前一狀態的控制決策參數及此些實際狀態參數。根據此些前一狀態的控制決策參數及此些實際狀態參數產生多個估計狀態參數。設置代理模型預測控制單元。根據此些估計狀態參數及多個目標參數藉由此代理模型預測控制單元執行近似的模型預測控制運算,以取代運算複雜度較高的模型預測控制運算。以及,將此些目前狀態的控制決策參數回傳至此模型模擬單元。 The disclosure also provides a model predictive control method, including the following steps. Create a behavioral model of the target. The dynamic behavior of the target is simulated according to the behavior model of the target and multiple control decision parameters of the previous state. Correspondingly generate a plurality of actual state parameters. Such previous state control decision parameters and such actual state parameters are monitored. A plurality of estimated state parameters are generated according to the control decision parameters of the previous state and the actual state parameters. Set up the surrogate model predictive control unit. Based on the estimated state parameters and multiple target parameters, the proxy model predictive control unit performs approximate model predictive control operations to replace model predictive control operations with high computational complexity. And, return the control decision parameters of these current states to the model simulation unit.
透過閱讀以下圖式、詳細說明以及申請專利範圍, 可見本揭示之其他方面以及優點。 By reading the following drawings, detailed descriptions and scope of patent application, Other aspects and advantages of the present disclosure can be seen.
1000:模型預測控制系統 1000: Model Predictive Control Systems
100:模型模擬單元 100:Model simulation unit
200:監測單元 200: Monitoring unit
300:最佳化模型預測控制單元 300: Optimizing Model Predictive Control Units
400:代理模型預測控制單元 400: Proxy Model Predictive Control Unit
500:更新單元 500: update unit
700:目標物 700: target
2000:工廠產線 2000: Factory production line
u1,u2,u3:控制決策參數 u1, u2, u3: control decision parameters
y1,y2:實際狀態參數 y1, y2: actual state parameters
x1,x2:估計狀態參數 x1, x2: Estimated state parameters
S:目標參數集合 S: set of target parameters
s1~sN:目標參數 s1~sN: target parameters
track1:目標移動軌跡 track1: target movement track
track2:實際移動軌跡 track2: actual moving track
u1_mean,u2_mean:平均值 u1_mean, u2_mean: average value
u1_var,u2_var:變異值 u1_var, u2_var: variation value
2100:饋入量 2100: Feed amount
2200:蒸氣量 2200: steam volume
2300:再沸騰溫度或壓力 2300: Reboiling temperature or pressure
S110~S180:步驟 S110~S180: steps
第1圖為本揭示一實施例之模型預測控制系統的方塊圖。 FIG. 1 is a block diagram of a model predictive control system according to an embodiment of the present disclosure.
第2圖為本揭示之模型預測控制系統應用於自駕車之實施例的示意圖。 Fig. 2 is a schematic diagram of an embodiment of the application of the model predictive control system disclosed in the present disclosure to a self-driving car.
第3圖為本揭示之代理模型預測控制單元執行模型預測控制運算的示意圖。 FIG. 3 is a schematic diagram of an agent model predictive control unit performing model predictive control calculations according to the present disclosure.
第4圖為本揭示之模型預測控制系統應用於工廠產線之實施例的示意圖。 FIG. 4 is a schematic diagram of an embodiment of applying the model predictive control system of the present disclosure to a factory production line.
第5圖為本揭示一實施例之模型預測控制方法的流程圖。 FIG. 5 is a flowchart of a model predictive control method according to an embodiment of the present disclosure.
本說明書的技術用語係參照本技術領域之習慣用語,如本說明書對部分用語有加以說明或定義,該部分用語之解釋係以本說明書之說明或定義為準。本揭露之各個實施例分別具有一或多個技術特徵。在可能實施的前提下,本技術領域具有通常知識者可選擇性地實施任一實施例中部分或全部的技術特徵,或者選擇性地將這些實施例中部分或全部的技術特徵加以組合。 The technical terms in this specification refer to the customary terms in this technical field. If some terms are explained or defined in this specification, the explanations or definitions of these terms shall prevail. Each embodiment of the disclosure has one or more technical features. On the premise of possible implementation, those skilled in the art may selectively implement some or all of the technical features in any embodiment, or selectively combine some or all of the technical features in these embodiments.
第1圖為本揭示一實施例之模型預測控制(model prediction control,MPC)系統1000的方塊圖。請參見第1圖,模型預測控制系統1000包括模型模擬(model simulation)單元100、監測(observation)單元200、最佳化模型預測控制(optimal
model prediction control)單元300、代理模型預測控制(proxy model prediction control)單元400、更新單元500。
FIG. 1 is a block diagram of a model prediction control (MPC)
模型模擬單元100可建立目標物700的行為模型以模擬目標物700的動態行為。在一種示例中,目標物700(或可稱為「目標對象」、「操作標的」,泛指模型預測控制系統1000進行預測控制的控制標的)例如為自駕車,模型模擬單元100可建立自駕車的行為模型以模擬自駕車的「車速」、「行進方向」等等的動態行為。其中,自駕車的行為「車速」關聯的控制對象為自駕車的「油門」,而「行進方向」關聯的控制對象為自駕車的「方向盤」。在運作上,模型模擬單元100可接收複數個控制決策參數,其中的一個控制決策參數u1例如為「油門大小」,其關聯的控制對象為「油門」;模型模擬單元100可根據控制決策參數u1模擬控制自駕車的「油門」以調整自駕車的行為「車速」。並且,模型模擬單元100可模擬計算出「車速」的實際測量值(measurement),即,「車速」的實際狀態參數y1;換言之,實際狀態參數y1表示自駕車的「實際車速」。
The
另一方面,模型模擬單元100的另一個控制決策參數u2例如為「方向盤旋轉量」,其關聯的控制對象為「方向盤」;模型模擬單元100可根據控制決策參數u2模擬控制自駕車的「方向盤」以調整自駕車的行為「行進方向」。並且,模型模擬單元100可模擬計算出「行進方向」的實際測量值,即,實際狀態參數y2;換言之,實際狀態參數y2表示自駕車的「實際行進方向」。
On the other hand, another control decision parameter u2 of the
綜上所述,模型模擬單元100可根據控制決策參數u1「油門大小」模擬控制自駕車的控制對象「油門」以調整自駕車的行為「車速」,模型模擬單元100並對應輸出實際狀態參數y1「實際車速」。另一方面,模型模擬單元100可根據控制決策參數u2「方向盤旋轉量」模擬控制自駕車的控制對象「方向盤」以調整自駕車的行為「行進方向」,模型模擬單元100並對應輸出實際狀態參數y2「實際行進方向」。
To sum up, the
監測單元200連接於模型模擬單元100,監測單元200可從模型模擬單元100接收實際狀態參數y1、y2,並根據控制決策參數u1、u2產生估計(estimation)狀態參數x1、x2。其中,估計狀態參數x1表示自駕車的「估計車速」,估計狀態參數x2表示自駕車的「估計行進方向」。更具體而言,監測單元200可接收目前狀態(或稱為:目前疊代(iteration),即,第k個疊代)的實際狀態參數y1(k)、y2(k),並且接收前一個狀態(或稱為:前一個疊代,即,第(k-1)個疊代)的控制決策參數u1(k-1)、u2(k-1)。監測單元200根據目前疊代的實際狀態參數y1(k)、y2(k)與前一個疊代的控制決策參數u1(k-1)、u2(k-1)產生目前疊代的估計狀態參數x1(k)、x2(k)。
The
並且,最佳化模型預測控制單元300連接於監測單元200,最佳化模型預測控制單元300可從監測單元200接收估計狀態參數x1(k)、x2(k)。此外,最佳化模型預測控制單元300可接收目標參數集合S,目標參數集合S可包括複數個目標參數s1、
s2、...、sN而可表示為S={s1、s2、...、sN}。請配合參見第2圖,其繪示本揭示之模型預測控制系統1000應用於自駕車之實施例的示意圖;目標參數集合S可表示使用者期待目標物700(自駕車)應達成的目標行為,第2圖所示自駕車應達成的目標移動軌跡track1即為目標參數集合S。並且,自駕車的目標移動軌跡track1的每一點的座標即為目標參數集合S的各個目標參數s1、s2、...、sN。根據目標參數集合S以及估計狀態參數x1(k)、x2(k),最佳化模型預測控制單元300可執行模型預測控制以計算出對應的控制決策參數u1(k)、u2(k)。換言之,為了使自駕車的行為滿足目標參數集合S(即,使自駕車的實際移動軌跡track2滿足目標移動軌跡track1),最佳化模型預測控制單元300根據估計狀態參數x1(k)、x2(k)(即,自駕車的「估計車速」與「估計行進方向」)計算出對應的控制決策參數u1(k)、u2(k)(即,自駕車的「油門大小」及「方向盤旋轉量」)以分別控制自駕車的「油門」及「方向盤」,據以調整自駕車的行為「車速」及「行進方向」。調整後的「車速」及「行進方向」並可經由模型模擬單元100的模擬而反映在實際狀態參數y1、y2(即,自駕車的「實際車速」及「實際行進方向」)。
Moreover, the optimized model predictive control unit 300 is connected to the
在本實施例中,最佳化模型預測控制單元300根據最佳化的演算法以執行模型預測控制,據以計算出可滿足目標參數集合S的控制決策參數u1、u2的最佳解(optimal solution)。然而,若考量自駕車的其他更多的控制對象(例如,排檔、引擎溫 度,等),則最佳化模型預測控制單元300必須對於數量較多的控制決策參數u1、u2、u3、...、uM進行最佳化的運算,此將耗費大量運算時間而無法在較短的決策週期(例如,自駕車的行進駕駛的決策週期可短至約0.1秒)內計算得到控制決策參數u1、u2、u3、...、uM的最佳解,因而無法達到即時控制,其可能造成自駕車的性能減損或安全事故。 In this embodiment, the optimized model predictive control unit 300 executes the model predictive control according to the optimized algorithm, so as to calculate the optimal solution (optimal solution). However, if considering other more control objects of self-driving cars (for example, gear, engine temperature degree, etc.), then the optimization model predictive control unit 300 must perform optimization operations on a large number of control decision parameters u1, u2, u3, ..., uM, which will consume a lot of computing time and cannot be used in The optimal solution of the control decision parameters u1, u2, u3, ..., uM can be calculated in a short decision cycle (for example, the decision cycle of self-driving car driving can be as short as about 0.1 second), so real-time control cannot be achieved , which may cause performance impairment or safety accidents of self-driving cars.
針對於此,可採用預先建立的代理模型預測控制單元400取代最佳化模型預測控制單元300。係以代理模型預測控制單元400執行運算複雜度較低(因而決策週期較短)的模型預測控制的運算,以取代最佳化模型預測控制單元300執行的運算複雜度較高(因而決策週期較長)的模型預測控制的運算。並且,代理模型預測控制單元400雖非執行最佳化的運算,然而代理模型預測控制單元400仍能夠得到近似最佳解的結果,因而仍能有效調整控制目標物的行為。在本實施例中,可用離線(off-line)方式預先建立代理模型預測控制單元400的模型(即,離線建模),建立模型的方式可包括模仿學習(imitation learning)等之機器學習方式。並且,參見第3圖繪示的代理模型預測控制單元400執行模型預測控制運算的示意圖;代理模型預測控制單元400可根據隨機程序運算以執行模型預測控制,例如可根據高斯程序(Gaussian Process,GP)的運算執行模型預測控制。仍然以自駕車的兩個控制決策參數u1、u2(即,自駕車的「油門大小」及「方向盤旋轉量」)為例,代理模型預測控制單元400根據估計狀態參數x1、x2 以及目標參數集合S執行高斯程序的運算以計算出控制決策參數u1、u2各自的平均值u1_mean、u2_mean以及變異值u1_var、u2_var。在一示例中,係以平均值u1_mean、u2_mean作為最終輸出的控制決策參數u1、u2(即,控制決策參數u1=平均值u1_mean,控制決策參數u2=平均值u2_mean)。另一方面,係以控制決策參數u1、u2的變異值u1_var、u2_var作為模型可靠度(reliability)指標rindex,據以判斷是否需要更新代理模型預測控制單元400。 In view of this, the pre-established agent model predictive control unit 400 can be used to replace the optimized model predictive control unit 300 . The model predictive control unit 400 is used as an agent to perform model predictive control operations with lower computational complexity (thus the decision cycle is shorter) to replace the computational complexity performed by the optimized model predictive control unit 300 (thus the decision cycle is shorter) Long) operation of model predictive control. Moreover, although the surrogate model predictive control unit 400 does not perform optimal calculations, the surrogate model predictive control unit 400 can still obtain a result close to the optimal solution, and thus can effectively adjust the behavior of the control target. In this embodiment, the model of the agent model prediction control unit 400 can be pre-established in an off-line manner (ie, offline modeling), and the manner of establishing the model can include imitation learning and other machine learning methods. Also, refer to the schematic diagram of the agent model predictive control unit 400 shown in FIG. 3 performing model predictive control operations; the agent model predictive control unit 400 can operate according to a random program to perform model predictive control, for example, according to a Gaussian process (Gaussian Process, GP ) to perform model predictive control. Still taking the two control decision parameters u1, u2 of the self-driving car (that is, the "accelerator size" and "steering wheel rotation" of the self-driving car) as an example, the agent model prediction control unit 400 estimates the state parameters x1, x2 And the target parameter set S executes the operation of the Gaussian program to calculate the respective mean values u1_mean, u2_mean and variation values u1_var, u2_var of the control decision parameters u1, u2. In one example, the control decision parameters u1 and u2 are output with mean values u1_mean and u2_mean (ie, control decision parameter u1=mean value u1_mean, control decision parameter u2=mean value u2_mean). On the other hand, the variation values u1_var and u2_var of the control decision parameters u1 and u2 are used as the model reliability index rindex to determine whether to update the proxy model predictive control unit 400 .
承上,更新單元500可設定模型可靠度指標rindex的閥值t1,並且比較模型可靠度指標rindex與閥值t1。當模型可靠度指標rindex大於閥值t1(即,控制決策參數u1、u2的變異值u1_var、u2_var大於閥值t1)時,表示目前採用的代理模型預測控制單元400的可靠度較低而需進行適應性(adaptive)更新。在一示例中,可用線上(on-line)方式收集取得各項操作數據(例如自駕車的「車速」、「行進方向」關聯的操作數據),並以此些操作數據對於代理模型預測控制單元400進行適應性更新。
Continuing from the above, the
由上,可藉由離線方式預先建立代理模型預測控制單元400的模型(即,離線建模),並藉由線上方式對於代理模型預測控制單元400進行適應性更新(即,線上更新)。並且,當代理模型預測控制單元400正在進行適應性更新時,代理模型預測控制單元400無法即時計算控制決策參數u1、u2,在此狀況下則回歸至使用最佳化模型預測控制單元300執行最佳化的模型預測 控制(即,運算複雜度較高(因而決策週期較長)的模型預測控制)以計算控制決策參數u1、u2。 From the above, the model of the agent model predictive control unit 400 can be pre-established offline (ie, offline modeling), and the agent model predictive control unit 400 can be adaptively updated online (ie, online update). Moreover, when the agent model predictive control unit 400 is performing adaptive updating, the agent model predictive control unit 400 cannot calculate the control decision parameters u1, u2 in real time. Optimized model predictions control (that is, model predictive control with high computational complexity (and therefore long decision-making period)) to calculate the control decision parameters u1, u2.
第4圖繪示本揭示之模型預測控制系統1000應用於工廠產線2000的實施例的示意圖,如第4圖所示,欲控制的目標物為工廠產線2000中的「蒸餾塔」,模型預測控制系統1000的控制對象包括整個蒸餾塔系統之中的各個PID控制器設定點的控制變數,例如「饋入(feed)量」2100、「蒸氣(steam)量」2200、「再沸騰(reboiler)溫度或壓力」2300,等。模型預測控制系統1000的控制決策參數u1、u2、u3分別對應於上述各個控制對象的PID控制器設定點,例如:控制決策參數u1為「饋入量」2100的PID控制器設定點。控制決策參數u2為「蒸氣(steam)量」2200的PID控制器設定點。並且,控制決策參數u3為「再沸騰溫度或壓力」2300的PID控制器設定點。
Figure 4 shows a schematic diagram of an embodiment of the model
並且,目標物「蒸餾塔」的目標參數s1例如為「蒸餾液的單位時間產出量」:為了達成使用者期望的單位時間產出量,可藉由模型預測控制系統1000的代理模型預測控制單元400執行近似的模型預測控制,以計算出近似於最佳解的控制決策參數u1、u2、u3,並將控制決策參數u1、u2、u3導入實際的工廠產線2000中的整個蒸餾塔系統的對應PID控制器設定點。上述之工廠產線2000(以蒸餾塔系統的化學工廠為例)的決策週期約為15秒~30秒,可藉由代理模型預測控制單元400執行較為快速的
近似模型預測控制,而能夠在15秒~30秒的決策週期內計算出近似於最佳解的控制決策參數u1、u2、u3。
In addition, the target parameter s1 of the target "distillation tower" is, for example, "the output per unit time of the distillate": in order to achieve the output per unit time desired by the user, the proxy model predictive control of the model
此外,在工廠產線2000中,本實施例之代理模型預測控制系統1000亦可選擇性的配合控制迴路性能評估(CLPA)系統、全廠擾動源追朔(PDT)系統、控制器參數調諧(PID tunner)系統或系統識別(建模)系統以控制調整工廠產線2000中的各項變數。
In addition, in the
第5圖為本揭示一實施例之模型預測控制方法的流程圖。請參見第5圖,模型預測控制方法可配合第1圖之模型預測控制系統1000及第3圖之代理模型預測控制單元400而實施。首先,在步驟S110中,藉由模型模擬單元100接收複數個控制決策參數,例如接收自駕車的兩個控制決策參數u1「油門大小」及控制決策參數u2「方向盤旋轉量」。並且,藉由模型模擬單元100根據控制決策參數u1、u2模擬產生實際狀態參數y1「實際車速」及實際狀態參數y2「實際行進方向」。
FIG. 5 is a flowchart of a model predictive control method according to an embodiment of the present disclosure. Please refer to FIG. 5 , the model predictive control method can be implemented in cooperation with the model
而後,在步驟S120中,模型模擬單元100將目前狀態的實際狀態參數y1(k)、y2(k)傳送至監測單元200,並且監測單元200更接收前一個狀態的控制決策參數u1(k-1)、u2(k-1)。據此,監測單元200根據實際狀態參數y1(k)、y2(k)及控制決策參數u1(k-1)、u2(k-1)產生估計狀態參數x1(k)「估計車速」及估計狀態參數x2(k)「估計行進方向」。
Then, in step S120, the
而後,在步驟S130中,判斷是否處於模型預測控制系統1000的決策週期之內;以自駕車為例,決策週期約為0.1秒。若仍處於決策週期之內,則執行步驟S135,使用代理模型預測控制單元400執行近似模型預測控制;其中,代理模型預測控制單元400接收目標參數集合S以及估計狀態參數x1(k)、x2(k)並據以產生控制決策參數u1、u2各自的平均值u1_mean、u2_mean以及變異值u1_var、u2_var,並將變異值u1_var、u2_var作為模型可靠度指標rindex。產生的控制決策參數u1、u2可再提供至步驟S110的模型模擬單元100執行模擬。
Then, in step S130, it is judged whether it is within the decision-making period of the model
而後,在步驟S140中,設定模型可靠度指標rindex的閥值t1;並且判斷模型可靠度指標rindex是否大於閥值t1。若模型可靠度指標rindex小於閥值t1(即,變異值u1_var、u2_var小於閥值t1),則執行步驟S150,將平均值u1_mean、u2_mean作為最終輸出的控制決策參數u1、u2(即,控制決策參數u1=平均值u1_mean,控制決策參數u2=平均值u2_mean)。 Then, in step S140, a threshold t1 of the model reliability index rindex is set; and it is judged whether the model reliability index rindex is greater than the threshold t1. If the model reliability index rindex is less than the threshold t1 (that is, the variation values u1_var and u2_var are less than the threshold t1), then execute step S150, and use the average values u1_mean and u2_mean as the final output control decision parameters u1 and u2 (that is, the control decision parameter u1=mean value u1_mean, control decision parameter u2=mean value u2_mean).
若模型可靠度指標rindex大於閥值t1(即,變異值u1_var、u2_var大於閥值t1),表示必須更新代理模型預測控制單元400,則執行步驟S160,進行代理模型預測控制單元400更新的預備。而後,執行步驟S170,使用最佳化模型預測控制單元300計算控制決策參數u1、u2。而後,執行步驟S180,執行更新代理模型預測控制單元400;例如,可使用線上適應性更新方式,從線上收集關聯於目標物的行為模型的各項操作數據,藉 由此些數據對於代理模型預測控制單元400進行更新。更新後,可執行步驟S150,藉由更新後的代理模型預測控制單元400計算控制決策參數u1、u2。 If the model reliability index rindex is greater than the threshold t1 (that is, the variation values u1_var and u2_var are greater than the threshold t1), it means that the proxy model predictive control unit 400 must be updated, and step S160 is performed to prepare for updating the proxy model predictive control unit 400. Then, step S170 is executed, and the optimal model predictive control unit 300 is used to calculate the control decision parameters u1 and u2 . Then, step S180 is executed to update the agent model predictive control unit 400; for example, an online adaptive update method can be used to collect various operational data related to the behavior model of the target object from the line, by The agent model predictive control unit 400 is updated from these data. After the update, step S150 can be executed to predict the control unit 400 to calculate the control decision parameters u1 and u2 through the updated agent model.
綜上所述,本揭示之模型預測控制系統1000配合模型預測控制方法之實施,係以離線建模的模仿學習或機器學習的方式預先建立代理模型預測控制單元400;並使用代理模型預測控制單元400執行近似的模型預測控制運算,以取代最佳化模型預測控制單元300執行運算複雜度較高(因而決策週期較長)的最佳化模型預測控制的運算。因此,代理模型預測控制單元400可大幅減少運算時間,而能夠在較短的決策週期(特別是針對於自駕車約為0.1秒的較短決策週期)內計算出近似最佳解的控制決策參數。並且,代理模型預測控制單元400根據高斯程序模型執行模型預測控制運算以得到控制決策參數的平均值與變異值,並將控制決策參數的變異值作為模型可靠度指標。若模型可靠度指標大於閥值則對於代理模型預測控制單元400進行線上適應性更新。
To sum up, the model
雖然本發明已以較佳實施例及範例詳細揭露如上,可理解的是,此些範例意指說明而非限制之意義。可預期的是,所屬技術領域中具有通常知識者可想到多種修改及組合,其多種修改及組合落在本發明之精神以及後附之申請專利範圍之範圍內。 Although the present invention has been disclosed above in detail with preferred embodiments and examples, it should be understood that these examples are meant to be illustrative rather than limiting. It is expected that those skilled in the art can conceive various modifications and combinations, and the various modifications and combinations fall within the spirit of the present invention and the scope of the appended patent application.
1000:模型預測控制系統 1000: Model Predictive Control Systems
100:模型模擬單元 100:Model simulation unit
200:監測單元 200: Monitoring unit
300:最佳化模型預測控制單元 300: Optimizing Model Predictive Control Units
400:代理模型預測控制單元 400: Proxy Model Predictive Control Unit
500:更新單元 500: update unit
700:目標物 700: target
u1,u2:控制決策參數 u1, u2: control decision parameters
y1,y2:實際狀態參數 y1, y2: actual state parameters
x1,x2:估計狀態參數 x1, x2: Estimated state parameters
S:目標參數集合 S: set of target parameters
s1~sN:目標參數 s1~sN: target parameters
Claims (20)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW110145943A TWI809592B (en) | 2021-12-08 | 2021-12-08 | Model prediction control system and method thereof |
CN202111547953.6A CN116243599A (en) | 2021-12-08 | 2021-12-16 | Model predictive control system and method thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW110145943A TWI809592B (en) | 2021-12-08 | 2021-12-08 | Model prediction control system and method thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
TW202324329A TW202324329A (en) | 2023-06-16 |
TWI809592B true TWI809592B (en) | 2023-07-21 |
Family
ID=86624745
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW110145943A TWI809592B (en) | 2021-12-08 | 2021-12-08 | Model prediction control system and method thereof |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN116243599A (en) |
TW (1) | TWI809592B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TW201510688A (en) * | 2013-06-07 | 2015-03-16 | Scient Design Co | System and method for monitoring a process |
US20190310650A1 (en) * | 2018-04-09 | 2019-10-10 | SafeAI, Inc. | Techniques for considering uncertainty in use of artificial intelligence models |
CN111587407A (en) * | 2017-11-10 | 2020-08-25 | 辉达公司 | System and method for safe and reliable autonomous vehicle |
CN112639834A (en) * | 2018-09-05 | 2021-04-09 | 赛多利斯司特蒂姆数据分析公司 | Computer-implemented method, computer program product, and system for data analysis |
-
2021
- 2021-12-08 TW TW110145943A patent/TWI809592B/en active
- 2021-12-16 CN CN202111547953.6A patent/CN116243599A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TW201510688A (en) * | 2013-06-07 | 2015-03-16 | Scient Design Co | System and method for monitoring a process |
CN111587407A (en) * | 2017-11-10 | 2020-08-25 | 辉达公司 | System and method for safe and reliable autonomous vehicle |
US20190310650A1 (en) * | 2018-04-09 | 2019-10-10 | SafeAI, Inc. | Techniques for considering uncertainty in use of artificial intelligence models |
CN112639834A (en) * | 2018-09-05 | 2021-04-09 | 赛多利斯司特蒂姆数据分析公司 | Computer-implemented method, computer program product, and system for data analysis |
Also Published As
Publication number | Publication date |
---|---|
CN116243599A (en) | 2023-06-09 |
TW202324329A (en) | 2023-06-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112668235B (en) | Robot control method based on off-line model pre-training learning DDPG algorithm | |
Waegeman et al. | Feedback control by online learning an inverse model | |
Hester et al. | RTMBA: A real-time model-based reinforcement learning architecture for robot control | |
CN105700538B (en) | Track follower method based on neural network and pid algorithm | |
Meier et al. | Towards robust online inverse dynamics learning | |
Narendra et al. | Fast Reinforcement Learning using multiple models | |
Kukker et al. | Genetic algorithm-optimized fuzzy lyapunov reinforcement learning for nonlinear systems | |
CN117114226B (en) | Intelligent dynamic optimization and process scheduling system of automation equipment | |
Ma et al. | Neural network model-based reinforcement learning control for auv 3-d path following | |
KR102266620B1 (en) | Learning system for robotic object manipulation | |
CN104898426A (en) | Room temperature loop control method based on gradient descent method and generalized prediction control | |
CN101276207A (en) | Multivariable non-linear system prediction function control method based on Hammerstein model | |
TWI809592B (en) | Model prediction control system and method thereof | |
Kim et al. | TOAST: trajectory optimization and simultaneous tracking using shared neural network dynamics | |
Li et al. | A model-free predictive control method based on polynomial regression | |
Deng et al. | Sparse online Gaussian process impedance learning for multi-DoF robotic arms | |
Farias et al. | Position control of a mobile robot using reinforcement learning | |
CN110888323A (en) | Control method for intelligent optimization of switching system | |
CN112801558B (en) | Optimization method and device of process parameter adjustment action decision model | |
Sieberg et al. | Computation time optimization of a model-based predictive roll stabilization by neuro-fuzzy systems | |
CN113219842A (en) | Mechanical arm optimal tracking control method, system, processing equipment and storage medium based on self-adaptive dynamic programming | |
JP2023027611A5 (en) | ||
Fernandez-Gauna et al. | Effect of initial conditioning of reinforcement learning agents on feedback control tasks over continuous state and action spaces | |
Samsani et al. | Rapid Autonomous Vehicle Drifting with Deep Reinforcement Learning | |
CN111781821B (en) | Parameter self-tuning method of SISO (SISO) compact-format model-free controller based on Attention mechanism cyclic neural network |