TWI829895B

TWI829895B - Model monitoring system based on health and method thereof

Info

Publication number: TWI829895B
Application number: TW109109423A
Authority: TW
Inventors: 張立昇; 李函穎
Original assignee: 中華電信股份有限公司
Priority date: 2020-03-20
Filing date: 2020-03-20
Publication date: 2024-01-21
Also published as: TW202137237A

Abstract

The present invention relates to a model monitoring system based on health and a method thereof, the method comprises: providing a prediction model based on big data; receiving a test set generated by collecting test data of the big data; formulating model health weights and calculating various indicators of the prediction model and system; summing the various indicators based on the model health weights to generate the health of the prediction model; and monitoring the prediction model, determining the status of the prediction model or the need for reconstruction based on the health of the prediction model. The invention refers to various indicators of the system and the model, calculates the health of the model and monitors the model, and can more effectively and appropriately select the best model and provide the user with the best application service.

Description

Health-based model monitoring system and method

本發明係關於模型監控之技術，尤指一種基於健康度之模型監控系統及方法。 The present invention relates to model monitoring technology, and in particular, to a health-based model monitoring system and method.

習知的模型監控系統通常是以模型準確度或是其它單一評估指標進行模型評估監控，據以挑選模型或決定重新訓練模型，惟上述習知系統採用單一模型評估指標進行模型監控，在模型評估實作層面，多以特定模型評估指標作為衡量模型之依據，但因未考量系統及模型等多重評估因子，予以監控及評估此預測模型，據以決定是否於系統中留用，故無法適應多種不同的需求條件，提供其餘使用者呼叫模型之應用服務。是以，習用之單一指標模型評估系統，在複雜的情況下，因未能參照系統及模型等多項指標，便難以從模型平台有效且合適地挑選最佳模型，實非一良善之設計，亟待予以改良。 Conventional model monitoring systems usually use model accuracy or other single evaluation indicators for model evaluation and monitoring to select models or decide to retrain models. However, the above-mentioned conventional system uses a single model evaluation index for model monitoring. In model evaluation, At the implementation level, specific model evaluation indicators are often used as the basis for measuring models. However, because multiple evaluation factors such as the system and model are not considered to monitor and evaluate the prediction model to decide whether to retain it in the system, it cannot adapt to a variety of different models. Based on the demand conditions, it provides application services for other users to call the model. Therefore, in complex situations, the commonly used single-index model evaluation system cannot effectively and appropriately select the best model from the model platform because it fails to refer to multiple indicators such as the system and the model. It is not a good design and is in urgent need of be improved.

由上可知，若能找出一種模型監控機制，特別是能多重評估因子系統化並且能自動化計算健康度，藉此持續進行模型監控，此將成為本技術領域人員急欲追求解決方案之目標。 It can be seen from the above that if a model monitoring mechanism can be found, especially one that can systemize multiple evaluation factors and automatically calculate health, so as to continuously monitor the model, this will become a goal that those in the technical field are eager to pursue a solution.

本發明之目的係提出一種模型監控機制，透過採用更廣義之模型評估方式，參照多種系統及模型相關重要指標，系統化、自動化計算模型健康度並據以監控模型，並因應複雜情況有效且合適挑選最佳模型，藉以改善現有技術僅以模型單一評估指標監控模型的缺點。 The purpose of the present invention is to propose a model monitoring mechanism that uses a broader model evaluation method and refers to a variety of systems and model-related important indicators to systematically and automatically calculate the health of the model and monitor the model accordingly, and is effective and appropriate in response to complex situations. Select the best model to improve the shortcomings of the existing technology that only monitors the model with a single evaluation index.

為達到上述目的與其他目的，本發明係提出一種基於健康度之模型監控系統，包括：模型管理模組，用以基於大數據產生預測模型；資料蒐集模組，用以蒐集大數據測試資料以生成測試集；模型健康度模組，係用以接收來自該模型管理模組之該預測模型及來自該資料蒐集模組之該測試集，制定模型健康度權重以及計算該預測模型及系統之各項指標，將該各項指標依據該模型健康度權重加總以產生該預測模型之健康度；以及模型監控模組，係連結該模型健康度模組，用以接收該預測模型之健康度，以據之監控該預測模型、決定該預測模型之狀態或重建需求。 In order to achieve the above objects and other objects, the present invention proposes a health-based model monitoring system, including: a model management module to generate a prediction model based on big data; a data collection module to collect big data test data to Generate a test set; the model health module is used to receive the prediction model from the model management module and the test set from the data collection module, formulate model health weights and calculate various parameters of the prediction model and the system. An indicator is used to sum up the indicators according to the health degree weight of the model to generate the health degree of the prediction model; and the model monitoring module is connected to the model health degree module to receive the health degree of the prediction model, This is used to monitor the prediction model and determine the status or reconstruction needs of the prediction model.

於上述系統中，該模型健康度權重係依據其他外部數據制定，或是自行制定，或是採取上述兩者混合之方式制定。 In the above system, the model health weight is formulated based on other external data, or is formulated by itself, or a mixture of the above two methods is adopted.

於上述系統中，該各項指標包括模型評估指標、系統效能指標或熱門評估指標或其它評估指標，且該模型健康度模組係採用上述至少二個指標以進行健康度計算。 In the above system, the various indicators include model evaluation indicators, system performance indicators, popular evaluation indicators or other evaluation indicators, and the model health module uses at least two of the above indicators to calculate health.

於上述系統中，該模型評估指標係透過該預測模型及該測試集，使用管制圖、準確率或其它方式之至少一種方式評估該預測模型，以產生模型評估結果；該系統效能指標係於系統中紀錄並計算該預測模型之平均反應時間、吞吐量、穩定性或其它系統效能之其中至少一者，以產生系統效能評估結果；該熱門評估指標係於系統中紀錄並計算該預測模型之呼叫次數、使用人數、服務時間或其它有關模型熱門程度之其中至少一者，以產生熱門評估結果。 In the above system, the model evaluation index is to evaluate the prediction model through the prediction model and the test set using at least one method of control chart, accuracy or other methods to generate a model evaluation result; the system performance index is in the system Record and calculate at least one of the average response time, throughput, stability or other system performance of the prediction model to generate System performance evaluation results; the popularity evaluation indicator records and calculates at least one of the number of calls, number of users, service time or other related model popularity in the system to generate popularity evaluation results.

本發明復提出一種基於健康度之模型監控方法，包括：提供基於大數據所產生之預測模型；接收蒐集大數據測試資料所產生之測試集；制定模型健康度權重以及計算該預測模型及系統之各項指標；將該各項指標依據該模型健康度權重加總以產生該預測模型之健康度；以及依據該預測模型之健康度監控該預測模型、決定該預測模型之狀態或重建需求。 The present invention proposes a health-based model monitoring method, which includes: providing a prediction model based on big data; receiving a test set generated by collecting big data test data; formulating model health weights and calculating the prediction model and system Each indicator; sum up each indicator according to the health degree weight of the model to generate the health degree of the prediction model; and monitor the prediction model according to the health degree of the prediction model and determine the status or reconstruction needs of the prediction model.

於上述方法中，該制定模型健康度權重之步驟係包括依據其他外部數據以制定該各項指標之模型健康度權重，或是自行制定該各項指標之模型健康度權重，或是採取上述兩者混合之方式以制定該各項指標之模型健康度權重。 In the above method, the step of formulating the model health weight includes formulating the model health weight of each indicator based on other external data, or formulating the model health weight of each indicator by oneself, or adopting the above two methods. A mixed method is used to formulate the model health weight of each indicator.

於上述方法中，該各項指標包括模型評估指標、系統效能指標或熱門評估指標或其它指標，且該模型健康度模組係採用上述至少二個指標以進行健康度計算。 In the above method, the various indicators include model evaluation indicators, system performance indicators, popular evaluation indicators or other indicators, and the model health module uses at least two of the above indicators to calculate health.

於上述方法中，該模型評估指標係透過該預測模型及該測試集，使用管制圖、準確率或其它方式之至少一種方式評估該預測模型，以產生模型評估結果；該系統效能指標係於系統中紀錄並計算該預測模型之平均反應時間、吞吐量、穩定性或其它系統效能之其中至少一者，以產生系統效能評估結果；該熱門評估指標係於系統中紀錄並計算該預測模型之呼叫次數、使用人數、服務時間或其它有關模型熱門程度之其中至少一者，以產生熱門評估結果。 In the above method, the model evaluation index is to evaluate the prediction model through the prediction model and the test set using at least one method of control chart, accuracy or other methods to generate a model evaluation result; the system performance index is based on the system Record and calculate at least one of the average response time, throughput, stability or other system performance of the prediction model to generate system performance evaluation results; the popular evaluation indicators are recorded in the system Record and calculate at least one of the number of calls, number of users, service time or other related model popularity of the prediction model to generate popularity evaluation results.

綜上可知，本發明所述之基於健康度之模型監控系統及其方法，透過模型管理模組產生預測模型並上線，運用資料蒐集模組蒐集測試集，當累積足夠的測試集，經由制定模型健康度權重以及計算各項指標，將各項指標依模型健康度權重加總以產生該預測模型之健康度，再運用模型監控模組依據該預測模型之健康度來監控模型，直到取得新的測試集，再進行下一輪次之健康度計算及模型監控，由上可知，本發明參照系統及模型等各項指標，計算模型健康度並據以監控模型，更能有效且合適地挑選最佳模型，提供使用者最佳的應用服務，本發明改善現有技術僅以模型單一評估指標監控模型的缺點，且本發明在考量系統及模型等不同需求條件下，參照多重評估因子系統化、自動化計算健康度，據以持續進行模型監控，並能因應複雜情況有效且合適挑選最佳模型。 In summary, it can be seen that the health-based model monitoring system and method of the present invention generates a prediction model through the model management module and puts it online, and uses the data collection module to collect test sets. When sufficient test sets are accumulated, the model is formulated Health weight and calculation of various indicators, sum up the indicators according to the model health weight to generate the health of the prediction model, and then use the model monitoring module to monitor the model according to the health of the prediction model until a new Test set, and then perform the next round of health calculation and model monitoring. From the above, it can be seen that the present invention refers to various indicators such as the system and model to calculate the model health and monitor the model accordingly, which can more effectively and appropriately select the best model to provide users with the best application services. The present invention improves the shortcomings of the existing technology that only monitors the model with a single evaluation index. Furthermore, the present invention systematically and automatically calculates multiple evaluation factors while considering different requirements such as the system and the model. Healthy, based on which the model can be continuously monitored and the best model can be effectively and appropriately selected in response to complex situations.

1:基於健康度之模型監控系統 1: Health-based model monitoring system

11:模型管理模組 11:Model management module

12:資料蒐集模組 12:Data collection module

13:模型健康度模組 13: Model health module

14:模型監控模組 14: Model monitoring module

S21~S25:步驟 S21~S25: steps

S31~S38:流程 S31~S38: Process

S41~S44:流程 S41~S44: Process

第1圖為本發明之基於健康度之模型監控系統的系統架構圖； Figure 1 is a system architecture diagram of the health-based model monitoring system of the present invention;

第2圖為本發明之基於健康度之模型監控方法的步驟圖； Figure 2 is a step diagram of the health-based model monitoring method of the present invention;

第3圖為本發明之基於健康度之模型監控方法於一實施例的流程圖；以及 Figure 3 is a flow chart of the health-based model monitoring method in one embodiment of the present invention; and

第4圖為本發明之基於健康度之模型監控方法中運用管制圖呈現模型評估結果的流程圖。 Figure 4 is a flow chart of using control charts to present model evaluation results in the health-based model monitoring method of the present invention.

以下藉由特定的具體實施形態說明本發明之技術內容，熟悉此技藝之人士可由本說明書所揭示之內容輕易地瞭解本發明之優點與功效。然本發明亦可藉由其他不同的具體實施形態加以施行或應用。 The following describes the technical content of the present invention through specific embodiments. Those familiar with the art can easily understand the advantages and effects of the present invention from the content disclosed in this specification. However, the present invention can also be implemented or applied through other different specific implementation forms.

第1圖為本發明之基於健康度之模型監控系統的系統架構圖。如圖所示，本發明之基於健康度之模型監控系統1可參照多種系統及模型相關重要指標，系統化、自動化計算模型健康度並據以監控模型，其中，基於健康度之模型監控系統1係包括模型管理模組11、資料蒐集模組12、模型健康度模組13以及模型監控模組14。 Figure 1 is a system architecture diagram of the health-based model monitoring system of the present invention. As shown in the figure, the health-based model monitoring system 1 of the present invention can refer to a variety of systems and model-related important indicators to systematically and automatically calculate the model health and monitor the model accordingly. Among them, the health-based model monitoring system 1 The system includes a model management module 11, a data collection module 12, a model health module 13 and a model monitoring module 14.

模型管理模組11用以基於大數據產生預測模型。簡言之，模型管理模組11係產生待預測之模型並上線，並提供模型訓練評估及部署上線之功能。 The model management module 11 is used to generate prediction models based on big data. In short, the model management module 11 generates the model to be predicted and puts it online, and provides the functions of model training, evaluation and deployment.

資料蒐集模組12用以蒐集大數據測試資料以生成測試集。資料蒐集模組12主要用於蒐集大數據測試資料，並檢視是否累積足夠的測試集決定流程，若已累積足夠的測試集時，則啟動模型健康度模組13執行後續程序，反之，若未能累積足夠的測試集時，則由資料蒐集模組12持續蒐集測試集。 The data collection module 12 is used to collect big data test data to generate a test set. The data collection module 12 is mainly used to collect big data test data and check whether sufficient test sets have been accumulated to determine the process. If enough test sets have been accumulated, the model health module 13 is started to perform subsequent procedures. Otherwise, if not, When enough test sets can be accumulated, the data collection module 12 continues to collect test sets.

模型健康度模組13用以接收來自該模型管理模組11之預測模型及來自該資料蒐集模組12之測試集，並透過制定模型健康度權重以及計算該預測模型及系統之各項指標，將該各項指標依據該模型健康度權重加總以產生該預測模型之健康度。 The model health module 13 is used to receive the prediction model from the model management module 11 and the test set from the data collection module 12, and by formulating the model health weight and calculating various indicators of the prediction model and system, The various indicators are summed according to the healthiness weight of the model to generate the healthiness of the prediction model.

於一實施例中，模型健康度權重可依據其他外部數據制定，或是自行制定，或是採取上述兩者混合之方式制定。詳言之，模型健康度權重之制定可包括客觀方式和預設方式，客觀方式表示依據實際情況，像是其他外部數據，例如參酌google進行搜尋的統計結果，進行模型健康度權重之制定，而預設方式則自行進行模型健康度權重之制定。 In one embodiment, the model health weight can be formulated based on other external data, or can be formulated by itself, or a mixture of the above two methods can be used. Specifically, the formulation of model health weights can include objective methods and preset methods. The objective method means that the model health weights are formulated based on actual conditions, such as other external data, such as referring to the statistical results of Google searches, and The default method is to set the model health weight by yourself.

於一實施例中，各項指標可包括模型評估指標、系統效能指標、熱門評估指標或其他評估指標，該模型健康度模組13係採用上述至少二個指標以進行健康度計算。 In one embodiment, various indicators may include model evaluation indicators, system performance indicators, popular evaluation indicators or other evaluation indicators. The model health module 13 uses at least two of the above indicators to perform health calculations.

前述之模型評估指標係透過該預測模型及該測試集，使用管制圖、準確率或其它方式之至少一種方式評估該預測模型，以產生模型評估結果；前述之系統效能指標係於系統中紀錄並計算該預測模型之平均反應時間、吞吐量、穩定性或其它系統效能之其中至少一者，以產生系統效能評估結果；另外，前述之熱門評估指標係於系統中紀錄並計算該預測模型之呼叫次數、使用人數、服務時間或其它有關模型熱門程度之其中至少一者，以產生熱門評估結果。 The aforementioned model evaluation indicators are used to evaluate the prediction model through the prediction model and the test set, using at least one method of control chart, accuracy or other methods to generate model evaluation results; the aforementioned system performance indicators are recorded in the system and Calculate at least one of the average response time, throughput, stability or other system performance of the prediction model to generate system performance evaluation results; in addition, the aforementioned popular evaluation indicators are recorded in the system and calculate the calls of the prediction model At least one of the number of times, number of users, service time or other related model popularity to generate popularity evaluation results.

模型監控模組14連結該模型健康度模組13，係用以接收該預測模型之健康度，並據以監控該預測模型、決定該預測模型之狀態或重建需求。 The model monitoring module 14 is connected to the model health module 13 and is used to receive the health of the prediction model, monitor the prediction model, and determine the status or reconstruction needs of the prediction model.

由上可知，本發明之基於健康度之模型監控系統主要由模型健康度模組13及模型監控模組14來進行模型健康度評估，其中，模型健康度模組13傳遞模型健康度至模型監控模組14，當模型監控模組14發出健康度計算之請求時，模型健康度模組13將回傳健康度計算結果，其中，模型健康度權重可依據外部數據制定，或是自行制定，亦可採取上述兩者混合方式制定，計算各項指標包括系統及模型之各項評估指標，例如為模型評估指標、系統效能指標、熱門評估指標或其它評估指標，並採計至少二指標進行健康度計算。 It can be seen from the above that the health-based model monitoring system of the present invention mainly uses the model health module 13 and the model monitoring module 14 to evaluate the model health, wherein the model health module 13 transmits the model health to the model monitoring Module 14, when the model monitoring module 14 sends a request for health calculation, the model health module 13 will return the health calculation result, where, The model health weight can be formulated based on external data, or it can be formulated by itself, or it can be formulated using a mixture of the above two methods. The calculated indicators include various evaluation indicators of the system and model, such as model evaluation indicators, system performance indicators, popular Evaluation indicators or other evaluation indicators, and adopt at least two indicators for health calculation.

第2圖為本發明之基於健康度之模型監控方法的步驟圖。 Figure 2 is a step diagram of the health-based model monitoring method of the present invention.

於步驟S21，係提供基於大數據所產生之預測模型。於本步驟中，可透過如第1圖之模型管理模組依據大數據產生預測模型並上線。 In step S21, a prediction model generated based on big data is provided. In this step, the prediction model can be generated based on big data through the model management module as shown in Figure 1 and put online.

於步驟S22，係接收蒐集大數據測試資料所產生之測試集。於本步驟中，可透過如第1圖之資料蒐集模組執行大數據測試資料之蒐集，以產生測試集。 In step S22, a test set generated by collecting big data test data is received. In this step, the big data test data can be collected through the data collection module as shown in Figure 1 to generate a test set.

於步驟S23，係制定模型健康度權重以及計算該預測模型及系統之各項指標。於本步驟中，可透過如第1圖之模型健康度模組來制定模型健康度權重，並計算預測模型及系統之各項指標。 In step S23, the model health weight is formulated and various indicators of the prediction model and system are calculated. In this step, the model health weight can be formulated through the model health module as shown in Figure 1, and various indicators of the prediction model and system can be calculated.

於一實施例中，該各項指標可為模型評估指標、系統效能指標或熱門評估指標，且該模型健康度模組係採用上述至少二個指標以進行健康度計算。具體來說，模型評估指標可透過預測模型及測試集，使用管制圖、準確率或其它方式之至少一種方式評估該預測模型，以產生模型評估結果；系統效能指標可於系統中紀錄並計算該預測模型之平均反應時間、吞吐量、穩定性或其它系統效能之其中至少一者，以產生系統效能評估結果；熱門評估指標可於系統中紀錄並計算該預測模型之呼叫次數、使用人數、服務時間或其它有關模型熱門程度之其中至少一者，以產生熱門評估結果。 In one embodiment, the various indicators may be model evaluation indicators, system performance indicators or popular evaluation indicators, and the model health module uses at least two of the above indicators to perform health calculations. Specifically, the model evaluation index can use at least one method of control chart, accuracy or other methods to evaluate the prediction model through the prediction model and the test set to generate the model evaluation result; the system performance index can be recorded and calculated in the system. At least one of the average response time, throughput, stability or other system performance of the prediction model is used to generate system performance evaluation results; popular evaluation indicators can be recorded and calculated in the system such as the number of calls, number of users, and services of the prediction model At least one of time or other related model popularity to generate popularity evaluation results.

具體來說，本步驟中所述之制定模型健康度權重，可包括依據實際狀況(例如其他外部數據)以制定該各項指標之模型健康度權重，或是自行制定該各項指標之模型健康度權重，或是採取上述兩者混合之方式以制定該各項指標之模型健康度權重。 Specifically, formulating the model health weights described in this step may include formulating the model health weights for each indicator based on actual conditions (such as other external data), or formulating the model health weights for each indicator by ourselves. degree weight, or a mixture of the above two methods is used to formulate the model health weight of each indicator.

於步驟S24，係將該各項指標依據該模型健康度權重加總以產生該預測模型之健康度。於本步驟中，透過計算系統及模型之各項指標，依模型健康度權重加總以及利用模型健康度計算公式，最終計算出模型健康度，關於模型健康度計算公式後面會再詳述。 In step S24, the various indicators are summed according to the healthiness weight of the model to generate the healthiness of the prediction model. In this step, the model health is finally calculated by calculating various indicators of the system and model, summing up the model health weights, and using the model health calculation formula. The model health calculation formula will be described in detail later.

於步驟S25，係依據該預測模型之健康度監控該預測模型、決定該預測模型之狀態或重建需求。於本步驟中，透過如第1圖之模型監控模組依健康度來監控模型，進而達到監控預測模型之目的，或是進一步決定模型狀態及模型重建需求，即可達到有效且合適挑選最佳模型之功效。 In step S25, the prediction model is monitored according to the health of the prediction model, and the status or reconstruction needs of the prediction model are determined. In this step, the model monitoring module as shown in Figure 1 is used to monitor the model according to its health, so as to achieve the purpose of monitoring the prediction model, or to further determine the model status and model reconstruction needs, so as to achieve effective and appropriate selection of the best model. The efficacy of the model.

另外，可視是否取得新的測試集決定流程，當已取得新的測試集時，可啟動如第1圖之資料蒐集模組持續蒐集測試集，並執行新的模型評估。 In addition, the decision process depends on whether a new test set is obtained. When a new test set is obtained, the data collection module as shown in Figure 1 can be started to continuously collect the test set and perform new model evaluation.

第3圖為本發明之基於健康度之模型監控方法於一實施例的流程圖。 Figure 3 is a flow chart of the health-based model monitoring method in one embodiment of the present invention.

於流程S31，係模型管理模組產生模型並上線。本流程係產生待預測之模型，並提供模型訓練評估及部署上線之功能。 In process S31, the model management module generates the model and goes online. This process generates the model to be predicted and provides the functions of model training, evaluation, and deployment.

於流程S32，係資料蒐集模組蒐集測試集。本流程係蒐集大數據測試資料，接著進入流程S33，將視是否累積足夠的測試集決定流程，當已累積足夠的測試集時，將啟動模型健康度模組，執行後續的模型健康度權重之制定以及各項指標之計算，反之，若未能累積足夠的測試集時，將啟動資料蒐集模組持續蒐集測試集。 In process S32, the data collection module collects the test set. This process collects big data test data, and then enters process S33. The process will be determined based on whether sufficient test sets have been accumulated. When enough test sets have been accumulated, the model health module will be started to perform subsequent model health. The formulation of degree weights and the calculation of various indicators. On the contrary, if insufficient test sets cannot be accumulated, the data collection module will be activated to continue collecting test sets.

於流程S34，係模型健康度模組制定模型健康度權重。於本流程中，制定模型健康度權重可依客觀方式制定各項指標之模型健康度權重，或自行制定各模型健康度權重，亦或採取上述兩者之混合方式制定各模型健康度權重，這裡所述之客觀方式即是依據實際情況進行設定。 In process S34, the model health module formulates the model health weight. In this process, the model health weight can be formulated by objectively formulating the model health weight of each indicator, or by formulating the health weight of each model by oneself, or by using a mixture of the above two methods to formulate the health weight of each model. Here The objective method described is based on the actual situation.

於流程S35，係模型健康度模組計算各項指標。於本流程中，各項指標係指系統及模型之各項評估指標，可為模型評估指標、系統效能指標、熱門評估指標或其它評估指標，並採計至少二指標進行健康度計算，藉由參照系統及模型等各項指標，因應複雜情況，達到有效且合適地挑選最佳模型之功效，可提供使用者最佳的應用服務。 In process S35, the model health module calculates various indicators. In this process, each indicator refers to various evaluation indicators of the system and model, which can be model evaluation indicators, system performance indicators, popular evaluation indicators or other evaluation indicators, and at least two indicators are used for health calculation. By referring to various indicators such as systems and models, we can effectively and appropriately select the best model in response to complex situations, and provide users with the best application services.

於一實施例中，關於前述之模型評估指標，可透過設置用以接收基於大數據產生之預測模型及蒐集而得之測試集，評估此預測模型並產生模型評估結果，其中，模型評估結果可使用管制圖、準確率或其它方式其中至少一種方式產生評估結果。 In one embodiment, the aforementioned model evaluation indicators can be configured to receive a prediction model generated based on big data and a test set collected, evaluate the prediction model and generate a model evaluation result, where the model evaluation result can be The evaluation results are generated using at least one of control charts, accuracy, or other methods.

於另一實施例中，關於前述之系統效能指標，可透過設置用以接收基於大數據產生之預測模型，於系統中紀錄並計算有關此模型預測之平均反應時間、吞吐量、穩定性或其它系統效能的其中至少一者。 In another embodiment, the aforementioned system performance indicators can be configured to receive a prediction model generated based on big data, and the average response time, throughput, stability or other predictions related to this model can be recorded and calculated in the system. At least one of system performance.

於再一實施例中，關於前述之熱門評估指標，可透過設置用以接收基於大數據產生之預測模型，於系統中紀錄並計算有關此模型預測之呼叫次數、使用人數、服務時間或其它有關模型熱門程度的其中至少一者。 In yet another embodiment, the aforementioned popular evaluation indicators can be configured to receive a prediction model generated based on big data, and the system can record and calculate the number of calls, number of users, service time or other related predictions predicted by this model. At least one of the model popularity levels.

於流程S36，係模型健康度模組依權重加總產生健康度。於本步驟中，可透過計算系統及模型之各項指標，依健康度權重加總，並利用模型健康度計算公式產生模型健康度。 In process S36, the model health module generates health based on weighted summation. In this step, various indicators of the system and model can be calculated, summed according to the health weight, and the model health calculation formula can be used to generate the model health.

於流程S37，係模型監控模組依健康度監控模型。於本步驟中，模型監控模組設置用以接收預測模型之模型健康度，據以監控此預測模型，決定模型狀態及模型重建需求。 In process S37, the model monitoring module monitors the model according to the health level. In this step, the model monitoring module is configured to receive the model health of the prediction model, monitor the prediction model accordingly, and determine the model status and model reconstruction requirements.

上述六個流程中，流程S35透過模型健康度模組計算系統及模型等各項指標，搭配流程S34之制定模型健康度權重，並由流程S36自動化產生健康度，由於此一模型健康度係參照多種系統及模型相關重要指標，即可因應複雜情況，系統化、自動化計算健康度，最後，透過流程S37之模型監控模組依健康度監控模型，即可達到有效且合適挑選最佳模型之功效。 Among the above six processes, process S35 calculates various indicators such as system and model through the model health module, and cooperates with the formulating model health weight in process S34, and the health degree is automatically generated by process S36, because this model health is based on the reference A variety of system and model-related important indicators can be used to systematically and automatically calculate health in response to complex situations. Finally, through the model monitoring module of process S37, the model is monitored based on health, which can achieve the effect of effectively and appropriately selecting the best model. .

最後，上述流程可依序執行，直到取得新的測試集，再進行下一輪次之健康度計算及模型監控，亦即於流程S38，將視是否取得新的測試集決定流程，當已取得新的測試集時，將啟動資料蒐集模組持續蒐集測試集，當未能取得新的測試集時，將結束整體流程。 Finally, the above process can be executed in sequence until a new test set is obtained, and then the next round of health calculation and model monitoring is performed. That is, in process S38, the process will be decided based on whether a new test set has been obtained. When a new test set has been obtained, When the test set is obtained, the data collection module will be started to continuously collect the test set. When a new test set cannot be obtained, the entire process will be ended.

以下揭露本發明之基於健康度之模型監控方法的實施例，請一併搭配第3圖。本實施例採用監督式機器學習演算法，使用決策樹(Decision tree)演算法進行模型訓練並上線，考慮模型評估、系統效能及熱門程度等需求條件，分別訂定模型評估指標、系統效能指標、熱門評估指標等三項重要指標，以提升預測準確率、降低模型反應時間及掌握熱門模型為目標，線性加權計算而得到健康度，據以監控模型，以期在模型準確率下降、系統效能下降、熱門程度下降或準確率上升但系統效能下降等複雜的組合條件下，仍能下架不合適的模型，有效提升模型評估監控之效能。 The following discloses an embodiment of the health-based model monitoring method of the present invention, please refer to Figure 3 together. This embodiment adopts a supervised machine learning algorithm and uses a decision tree algorithm to train the model and put it online. Taking into account demand conditions such as model evaluation, system performance, and popularity, model evaluation indicators, system performance indicators, and Three important indicators, including popular evaluation indicators, are aimed at improving prediction accuracy, reducing model response time, and mastering popular models. The health degree is obtained by linear weighting calculation, which is used to monitor the model in order to ensure that the model is accurate. Under complex combination conditions such as declining rate, declining system performance, declining popularity, or rising accuracy but declining system performance, inappropriate models can still be removed from the shelves, effectively improving the performance of model evaluation and monitoring.

首先，如流程S31，由模型管理模組產生模型並上線，瀏覽與管理訓練後的模型，進而部署上線。 First, as shown in process S31, the model management module generates a model and goes online, browses and manages the trained model, and then deploys and goes online.

接著，如流程S32，透過資料蒐集模組蒐集測試集，持續蒐集新的測試集資料並即時方式上傳測試集，舉例來說，可利用即時方式於線上設定評估演算法，系統會立即呼叫模型健康度模組制定模型健康度權重及計算各項指標，以利健康度計算及模型監控，其中，即時方式上傳測試集亦可改採定期方式，利用定期方式設定系統排程所需之評估演算法、評估起迄時間及頻率，並指定定期更新測試集後，系統亦會將上述資訊提供模型健康度模組。另外，如流程S33，將視是否累積足夠的測試集決定流程，當已累積足夠的測試集時，將啟動模型健康度模組制定模型健康度權重及計算各項指標，當未能累積足夠的測試集時，將由資料蒐集模組持續蒐集測試集。 Then, as in process S32, the test set is collected through the data collection module, new test set data is continuously collected, and the test set is uploaded in real time. For example, the evaluation algorithm can be set online in real time, and the system will immediately call the model health The degree module formulates the model health weight and calculates various indicators to facilitate health calculation and model monitoring. Among them, the real-time method can also be used to upload the test set to the periodic method, and the periodic method can be used to set the evaluation algorithm required for the system schedule. , evaluate the start and end time and frequency, and specify that the test set should be updated regularly, the system will also provide the above information to the model health module. In addition, as in process S33, the decision process will be based on whether sufficient test sets have been accumulated. When enough test sets have been accumulated, the model health module will be started to formulate model health weights and calculate various indicators. When sufficient test sets are not accumulated, When testing the set, the data collection module will continue to collect the test set.

本實施例的數據屬於監督式機器學習演算法中的分類預測，於此蒐集12組穩定的測試集(S ₁,S ₂,...,S ₁₂)，其中每一組測試集S _i均包含大量數據，後續將使用預測模型，每當累積一組足夠的測試集時，即依線上即時方式取得一個模型評估結果Weighted F1 score，同時系統會立即呼叫模型健康度模組計算模型評估指標，以利健康度計算及模型監控。詳言之，Weighted F1 score為望大品質特性，故當Weighted F1 score越高時，表示模型評估結果較佳。據此，本實施例將依序獲得12個模型評估結果Weighted F1 score(X ₁,X ₂,...,X ₁₂)，以供後續計算模型評估指標使用。 The data in this embodiment belongs to the classification prediction in the supervised machine learning algorithm. Here, 12 sets of stable test sets ( S ₁ , S ₂ ,..., S ₁₂ ) are collected, in which each set of test sets S _i Contains a large amount of data, and the prediction model will be used in the future. Whenever a sufficient test set is accumulated, a model evaluation result Weighted F1 score will be obtained online in real time. At the same time, the system will immediately call the model health module to calculate the model evaluation index. To facilitate health calculation and model monitoring. Specifically, the Weighted F1 score is a high-quality characteristic, so when the Weighted F1 score is higher, it means that the model evaluation result is better. Accordingly, this embodiment will sequentially obtain 12 model evaluation results Weighted F1 score ( X ₁ , X ₂ ,..., X ₁₂ ) for subsequent calculation of model evaluation indicators.

接著，如流程S34，模型健康度模組制定模型健康度權重，設置用以接收一基於大數據產生之預測模型及蒐集而得之測試集，並採用客觀方式制定各項指標之模型健康度權重，其中各指標值及各權重值均介於0、1之間，且各權重值總和為1，制定方式可改為採用預設方式，即依據各項指標之重要程度來制定各模型健康度權重，亦或是採用客觀及預設之混合方式制定各模型健康度權重，藉由參照系統及模型等各項指標，因應複雜情形，達到有效且合適地挑選最佳模型之功效，可提供使用者最佳的應用服務。 Next, as in process S34, the model health module formulates the model health weight, is configured to receive a prediction model generated based on big data and the collected test set, and uses an objective method to formulate the model health weight of each indicator. , where each indicator value and each weight value are between 0 and 1, and the sum of each weight value is 1. The formulation method can be changed to the default method, that is, the health of each model is formulated according to the importance of each indicator. Weights, or a mixed method of objective and preset methods to determine the health weight of each model. By referring to various indicators such as the system and model, in response to complex situations, the effect of effectively and appropriately selecting the best model can be provided. the best application services.

模型健康度模組計算之各項指標可包括模型評估指標、系統效能指標、熱門評估指標等三指標，制定模型健康度權重共有三個權重值W ₁ 、W ₂ 、W ₃，並依客觀方式制定一組預設權重值(W ₁,W ₂,W ₃)，其中，各權重值W ₁ 、W ₂ 、W ₃均介於0、1之間且其權重值總和為1，可供後續依據模型健康度權重以線性加總來產生健康度。本實施例係運用Google網站分別使用model monitor、average response time、api call等三關鍵字蒐尋，獲蒐尋次數分別為665000000次、362000000次、522000000次，標準化後權重值為(W ₁,W ₂,W ₃)=(0.43,0.23,0.34)，後續可依此權重值組合計算模型健康度，上述是採用客觀方式來制定；於又一實施例中，系統可依預設方式來制定一組預設權重值(W ₁,W ₂,W ₃)=(0.64,0.12,0.24)；於另一實施例中，亦可客觀權重值及預設權重值混合，舉例來說：客觀權重值(W ₁,W ₂,W ₃)=(0.43,0.23,0.34)，預設權重值(W ₁,W ₂,W ₃)=(0.64,0.12,0.24)，採用比例=0.6：0.4，針對客觀權重值及預設權重值進行加權，即健康度權重=(0.43,0.23,0.34)×0.6+(0.64,0.12,0.24)×0.4=(0.51,0.19,0.30)。 The various indicators calculated by the model health module can include three indicators such as model evaluation indicators, system performance indicators, and popular evaluation indicators. There are three weight values W ₁ , W ₂ , and W ₃ for formulating the model health weight, and based on an objective method Develop a set of preset weight values ( W ₁ , W ₂ , W ₃ ), in which each weight value W ₁ , W ₂ , W ₃ is between 0 and 1 and the sum of their weight values is 1, which can be used for subsequent Healthiness is generated by linear summation based on model health weights. In this embodiment, the Google website is used to search using three keywords such as model monitor, average response time, and api call. The number of searches obtained are 665,000,000 times, 362,000,000 times, and 522,000,000 times respectively. The weight values after normalization are ( W ₁ , W ₂ , W ₃ )=(0.43,0.23,0.34), the model health can be calculated based on this combination of weight values later. The above is formulated in an objective way; in another embodiment, the system can formulate a set of presets in a preset way. Assume that the weight value ( W ₁ , W ₂ , W ₃ ) = (0.64, 0.12, 0.24); in another embodiment, the objective weight value and the default weight value can also be mixed, for example: the objective weight value ( W ₁ , W ₂ , W ₃ )=(0.43,0.23,0.34), the default weight value ( W ₁ , W ₂ , W ₃ )=(0.64,0.12,0.24), using the ratio=0.6:0.4, for the objective weight Values and preset weight values are weighted, that is, health weight = (0.43, 0.23, 0.34) × 0.6 + (0.64, 0.12, 0.24) × 0.4 = (0.51, 0.19, 0.30).

接著，如流程S35，模型健康度模組計算各項指標，其中各項指標分別為模型評估指標、系統效能指標、熱門評估指標，藉由參照系統及模型等各項指標，因應複雜情況，達到有效且合適地挑選最佳模型之功效，可提供使用者最佳的應用服務。下面將依序揭露本實施例中模型評估指標、系統效能指標、熱門評估指標等三項之計算方式。 Then, as in process S35, the model health module calculates various indicators, among which the indicators are model evaluation indicators, system performance indicators, and popular evaluation indicators. By referring to various indicators such as the system and model, in response to complex situations, we can achieve Effectively and appropriately selecting the best model can provide users with the best application services. The calculation methods of the model evaluation index, system performance index, and popular evaluation index in this embodiment will be disclosed in sequence below.

當計算模型評估指標時，可設置用以接收一基於大數據產生之預測模型及蒐集而得之測試集，評估此預測模型並產生模型評估結果，其中，模型評估結果可使用管制圖、準確率或其它方式其中至少一種方式產生評估結果。 When calculating model evaluation indicators, it can be configured to receive a prediction model generated based on big data and a collected test set, evaluate the prediction model and generate model evaluation results. The model evaluation results can use control charts, accuracy rates or other methods, at least one of which produces evaluation results.

參照模型評估指標並據以計算健康度即能有效地挑選最佳模型。系統會針對穩定的測試集，根據預測模型取得模型評估結果，使用管制圖且採計特定筆數資料，參照統計製程管制(Statistical Process Control，SPC)選用合適的管制圖進行繪製。另外，定期修正管制界限請參照第4圖，於流程S41，係判斷管制圖判讀規則，也就是根據管制圖判讀規則選擇至少一項以判斷異常值，於流程S42，係屏除異常值，即依據管制圖判讀規則判斷異常值予以屏除，於流程S43，係計算管制界限，即依據屏除異常值後的模型評估結果重新計算管制界限，於流程S44，係繪製管制圖，也就是將模型評估結果及管制界限繪製於管制圖中。接著，根據模型評估結果為望大品質特性或望小品質特性，計算此實施例之模型評估指標。上述採用特定筆數之作法亦可採計上線至今之筆數。 The best model can be effectively selected by referring to the model evaluation metrics and calculating health based on them. The system will obtain model evaluation results based on the prediction model for a stable test set, use control charts and collect specific number of data, and select appropriate control charts for drawing based on Statistical Process Control (SPC). In addition, please refer to Figure 4 for regular correction of control limits. In process S41, the control chart interpretation rules are judged, that is, at least one item is selected to determine abnormal values according to the control chart interpretation rules. In process S42, abnormal values are screened out, that is, based on The control chart interpretation rules determine that the abnormal values should be eliminated. In process S43, the control limits are calculated, that is, the control limits are recalculated based on the model evaluation results after eliminating the outliers. In process S44, the control chart is drawn, that is, the model evaluation results and Regulatory boundaries are drawn on control charts. Then, according to whether the model evaluation result is a large quality characteristic or a small quality characteristic, the model evaluation index of this embodiment is calculated. The above-mentioned method of using a specific number of transactions can also adopt the number of transactions since the launch.

當流程S32運用資料蒐集模組蒐集測試集且蒐集r組穩定的測試集後，使用分類預測模型來取得模型評估結果Weighted F1 score(X _i，其中i=1,2,…,r)，並選定計算管制界限所採計的資料筆數n筆，根據模型評估結果Weighted F1 score屬於計量型數據，且取樣大小為單一觀測值，參照SPC選擇X-MR管制圖，以其各前後移動資料絕對值差距的平均(即

，其中R _i=|X _i-X _i-1|)，乘以E ₂(參照管制圖係數表，由於移動資料R _i組內樣本數為2，故選用E ₂=2.66)，當作3個標準差的認定標準，即X-MR管制圖之中心線(CL)與管制上限(UCL)及管制下限(LCL)之距離皆為3個標準差。X-MR管制圖是以資料的平均值

做為管制圖的中心線(CL)，即

，如此，X-MR管制圖的管制上限(UCL)及管制下限(LCL)，分別為：

以及

。 When process S32 uses the data collection module to collect test sets and collects r sets of stable test sets, use the classification prediction model to obtain the model evaluation result Weighted F1 score ( X _i , where i=1,2,…,r), and Select the number n of data collected to calculate the control limit. According to the model evaluation results, the Weighted F1 score belongs to measurement data, and the sampling size is a single observation value. Refer to the SPC to select the X-MR control chart, with each forward and backward moving data absolute The average of the value differences (i.e.

, _where R _i ₌ | _ _{_} _ _{_} _ _{_} The identification standard of three standard deviations is that the distance between the center line (CL) of the X-MR control chart and the upper control limit (UCL) and lower control limit (LCL) is 3 standard deviations. The X-MR control chart is based on the average of the data

As the center line (CL) of the control chart, that is

, so, the upper control limit (UCL) and lower control limit (LCL) of the X-MR control chart are respectively:

as well as

.

本實施例蒐集12組穩定的測試集，使用分類預測模型來取得12個模型評估結果Weighted F1 score(X _i)分別為X ₁=0.75、X ₂=0.71、X ₃=0.63、X ₄=0.79、X ₅=0.68、X ₆=0.75、X ₇=0.77、X ₈=0.68、X ₉=0.76、X ₁₀=0.78、X ₁₁=0.41、X ₁₂=0.71。本實施例採計10個模型評估結果Weighted F1 score(X _i)並計算R _i分別為：R ₂=|X ₂-X ₁|=|0.71-0.75|=0.04、R ₃=|X ₃-X ₂|=|0.63-0.71|=0.08、R ₄=|X ₄-X ₃|=|0.79-0.63|=0.16、R ₅=|X ₅-X ₄|=|0.68-0.79|=0.11、R ₆=|X ₆-X ₅|=|0.75-0.68|=0.07、R ₇=|X ₇-X ₆|=|0.77-0.75|=0.02、R ₈=|X ₈-X ₇|=|0.68-0.77|=0.09、R ₉=|X ₉-X ₈|=|0.76-0.68|=0.08、R ₁₀=|X ₁₀-X ₉|=|0.78-0.76|=0.02，故

，且

。如此，X-MR管制圖的管制上限(UCL)及管制下限(LCL)，分別為：

0.73+2.66×0.074=0.927以及

0.533。 This embodiment collects 12 sets of stable test sets and uses a classification prediction model to obtain 12 model evaluation results. Weighted F1 score ( X _i ) are X ₁ =0.75, X ₂ =0.71, X ₃ = _0.63 , and , X ₅ =0.68, X ₆ =0.75, X ₇ =0.77, X ₈ = 0.68, X ₉ = _0.76 , X 10 ₌ _0.78 , This _embodiment collects 10 model _evaluation results Weighted F1 score ( X _i ) and calculates R _i _as follows _: R ₂ =| X ₂ |=|0.63-0.71| = 0.08, R ₄ =| X ₄ - X ₃ _| =|0.79-0.63 _| = _0.16 , R 5 =| R ₆ ₌ | _ _{_} _ _{_} _ _{_} _ _{_} _ _{_} _ _{_} _ _{_} 0.68-0.77| = 0.09, R ₉ =| X ₉ - X ₈ |= | _0.76-0.68 | ₌ _0.08 , R 10 =|

,and

. In this way, the upper control limit (UCL) and lower control limit (LCL) of the X-MR control chart are respectively:

0.73+2.66×0.074=0.927 and

0.533.

後續視測試集蒐集情形，定期依據管制圖判讀規則修正管制界限並重新繪製管制圖，此時，模型健康度模組根據模型評估結果為望大品質特性，計算此一模型之模型評估指標I ₁為：

，其中，

Subsequently, depending on the test set collection situation, the control limits are regularly corrected according to the control chart interpretation rules and the control chart is redrawn. At this time, the model health module calculates the model evaluation index I ₁ of this model based on the model evaluation results to maximize the quality characteristics. for:

,in,

續前一實施例，由於

，因而f(X ₁₁)=X ₁₁=0.41，再者X ₁₂=0.71>0.533=LCL，因而f(X ₁₂)=LCL=0.533。 Continuing the previous embodiment, since

, so f ( X ₁₁ ₎ ₌ _ _ _{_} _

當模型評估結果為望小品質特性，計算此一模型之模型評估指標I ₁為：

，其中，

，

。 When the model evaluation result is a small quality characteristic, the model evaluation index I ₁ of this model is calculated as:

,in,

,

.

本實施例之Weighted F1 score是屬於望大目標，其中，σ=

，故模型評估指標

The Weighted F1 score in this embodiment is a large target, where σ=

, so the model evaluation index

另外，在此實施例中，模型評估指標需介於0、1之間，根據

及

均小於0，模型評估指標I ₁可設定

或

，也可以設定

或

、或其它轉換函數。 In addition, in this embodiment, the model evaluation index needs to be between 0 and 1, according to

and

are all less than 0, the model evaluation index I ₁ can be set

or

, you can also set

or

, or other conversion functions.

當計算系統效能指標時，可設置用以接收一基於大數據產生之預測模型，於系統中紀錄並計算有關此模型預測之平均反應時間或其它系統效能其中至少一者。 When calculating the system performance index, it can be configured to receive a prediction model generated based on big data, and record and calculate at least one of the average response time or other system performance predicted by the model in the system.

由於系統效能指標衡量模型預測之平均反應時間，參照系統效能指標並據以計算健康度更能合適地挑選最佳模型，亦即，系統會紀錄所有上線模型之平均反應時間(t _i，其中i=1,2,…,m)，選定採計時段為特定範圍或上線至今，依據平均反應時間之歷史數據，設定平均反應時間的規格上限USL _{平均反應時間}，並計算製程能力指標Cpk _{平均反應時間}，由於平均反應時間只設定規格上限USL _{平均反應時間}屬於單邊規格，故Cpk _{平均反應時間}=

，其中，

，

Since the system performance index measures the average response time predicted by the model, it is more appropriate to select the best model by referring to the system performance index and calculating the health accordingly. That is, the system will record the average response time of all online models ( t _i , where i =1,2,…,m), select the acquisition period as a specific range or go online to date, set the upper specification limit of the average reaction time USL _{average reaction time} based on the historical data of the average reaction time, and calculate the process capability index Cpk _{average reaction time} , since the average reaction time only sets the upper limit of the specification, the USL _{average reaction time} is a unilateral specification, so Cpk _{average reaction time} =

,in,

,

本實施例使用運用模型健康度模組計算系統效能指標，紀錄並計算有關此模型預測之平均反應時間10筆，分別為t ₁=13、t ₂=17、t ₃=16、t ₄=13、t ₅=21、t ₆=18、t ₇=17、t ₈=20、t ₉=14、t ₁₀=15。設定平均反應時間(單位：ms)的規格上限USL _{平均反應時間}=17，故

，S _{平均反應時間}=

，因此，Cpk _{平均反應時間}=

This embodiment uses the model health module to calculate system performance indicators, and records and calculates 10 average response times related to this model prediction, which are t ₁ =13, t ₂ =17, t ₃ =16, and t ₄ =13 respectively. , t ₅ =21, t ₆ =18, t ₇ =17, t ₈ =20, t ₉ =14, t ₁₀ =15. Set the upper specification limit of average response time (unit: ms) USL _{average response time} = 17, so

, S _{average reaction time} =

, therefore, Cpk _{average reaction time} =

訂定製程能力指標門閥值

後，系統效能指標I ₂設定為： Develop process capability indicator thresholds

Afterwards, the system performance index I ₂ is set to:

本實施例Cpk _反應時間=0.073，設定

，由於

，故系統效能指標I ₂

In this example, Cpk _{reaction time} = 0.073, set

,due to

, so the system performance index I ₂

另外，在此實施例中，系統效能指標需介於0、1之間，根據Cpk _{平均反應時間}介於-∞、∞之間，系統效能指標I ₂可設定為上述函數或其它轉換函數。 In addition, in this embodiment, the system performance index needs to be between 0 and 1. According to the Cpk _{average response time} , which is between -∞ and ∞, the system performance index I ₂ can be set to the above function or other conversion function.

當計算熱門評估指標時，可設置用以接收一基於大數據產生之預測模型，於系統中紀錄並計算有關此模型預測之呼叫次數或其它有關模型熱門程度其中至少一者。由於熱門評估指標衡量模型預測之呼叫次數，參照熱門評估指標並據以計算健康度更能合適地挑選最佳模型，系統會紀錄所有上線模型之呼叫次數，選定採計時段為上線至今，比較得到所有上線模型之呼叫次數之最大值，模型健康度模組計算此一模型之熱門評估指標I ₃，即為此模型預測之呼叫次數與上述最大值之比值，本實施例中，該模型預測之呼叫次數為1452次，且所有上線模型之呼叫次數之最大值為1875次，故熱門評估指標

，由此可見，熱門評估指標之值介於0、1之間。其中選定採計時段可為特定範圍。 When calculating the popularity evaluation index, it can be configured to receive a prediction model generated based on big data, and record and calculate at least one of the number of calls predicted by this model or the popularity of other related models in the system. Since the popular evaluation indicators measure the number of calls predicted by the model, referring to the popular evaluation indicators and calculating the health based on them can more appropriately select the best model. The system will record the number of calls of all online models, and the selected acquisition period is the online to date. Comparison results The maximum value of the number of calls of all online models. The model health module calculates the popular evaluation index I ₃ of this model, which is the ratio of the number of calls predicted by this model to the above maximum value. In this embodiment, the number of calls predicted by this model is The number of calls is 1452, and the maximum number of calls for all online models is 1875, so the popular evaluation indicator

, it can be seen that the value of the popular evaluation index is between 0 and 1. The selected collection period can be a specific range.

接著，如流程S36，模型健康度模組依權重加總產生健康度，根據上述實施例採計模型評估指標I ₁、系統效能指標I ₂、熱門評估指標I ₃共三指標，制定模型健康度權重w ₁、w ₂、w ₃共三個權重值，其中w ₁、w ₂、w ₃之總和為1，依據模型健康度權重以線性加總產生健康度，即依下列健康度公式：

，其中，健康度權重w ₁、w ₂、w ₃介於0、1之間，w ₁、w ₂、w ₃之總和為1，且I ₁、I ₂、I ₃亦介於0、1之間，推得模型健康度Health亦介於0、1之間。由上可知，本實施例之模型健康度權重組合(w ₁,w ₂,w ₃)=(0.51,0.19,0.30)，(I ₁,I ₂,I ₃)=(0.155,0.055,0.822)，則健康度

，顯然，健康度Health=0.34亦介於0、1之間。 Then, as in process S36, the model health module generates a health degree based on the weighted sum. According to the above embodiment, three indicators, namely the model evaluation index I ₁ , the system performance index I ₂ and the popular evaluation index I ₃ are collected to formulate the model health degree. There are three weight values w ₁ , w ₂ , and w ₃ , among which the sum of w ₁ , w ₂ , and w ₃ is 1. The health degree is generated by linear summation according to the model health weight, that is, according to the following health formula:

, where the health weights w ₁ , w ₂ , and w ₃ are between 0 and 1, the sum of w ₁ , w ₂ , and w ₃ is 1, and I ₁ , I ₂ , and I ₃ are also between 0 and 1. Between, the model health degree Health is also estimated to be between 0 and 1. It can be seen from the above that the model health weight combination of this embodiment ( w ₁ , w ₂ , w ₃ )=(0.51,0.19,0.30), ( I ₁ , I ₂ , I ₃ )=(0.155,0.055,0.822) , then the health degree

, obviously, the health degree Health =0.34 is also between 0 and 1.

此健康度Health=0.34係屬最新數據，其歷史數據依時間舊至新排序分別為：0.21,0.22,0.24,0.21,0.25,0.28,0.28,0.29,0.32,0.33,0.33,0.34。 This health degree Health =0.34 is the latest data. The historical data are sorted from oldest to newest: 0.21, 0.22, 0.24, 0.21, 0.25, 0.28, 0.28, 0.29, 0.32, 0.33, 0.33, 0.34.

總體而言，歷史模型健康度所對應之指標應可反映趨勢，使用者可由模型健康度的最新數據，立即掌握系統與模型之健康程度，並據此決策此一預測模型是否持續上線提供相關應用服務。由上述模型健康度數據分析，其數值具緩步向上趨勢，表示系統與模型之健康程度越來越好，此實施例最終決定持續使用此上線模型提供應用服務。 In general, the indicators corresponding to the historical model health should reflect trends. Users can immediately grasp the health of the system and model from the latest data of the model health, and based on this, decide whether this prediction model will continue to be online to provide relevant applications. service. According to the above model health data analysis, the value has a slowly upward trend, indicating that the health of the system and model is getting better and better. This embodiment finally decided to continue to use this online model to provide application services.

本實施例依據模型評估指標、系統效能指標、熱門評估指標共三指標，線性加總產生模型健康度，由於此模型健康度反映模型評估指標、系統效能指標、熱門評估指標等三個向度，相較於傳統僅參照模型準確度或其它單一模型評估指標，本發明參照系統效能及訓練模型等各項指標，因應複雜情況，更能有效且合適地挑選最佳模型，提供使用者最佳的應用服務。 This embodiment generates model health based on three indicators: model evaluation index, system performance index, and popular evaluation index. Linear summation generates model health. Since this model health reflects the three dimensions of model evaluation index, system performance index, and popular evaluation index, Compared with the traditional method that only refers to model accuracy or other single model evaluation indicators, the present invention refers to various indicators such as system performance and training model, and can more effectively and appropriately select the best model in response to complex situations, providing users with the best application services.

接著，如流程S37，模型監控模組依健康度監控模型，由健康度數值監控預測模型相對測試集之健康程度，使用者更可由模型健康度數值，掌握模型是否已偏離最新的測試集等監控狀況，並據以決定是否於平台中留用或重新訓練模型。同時，如流程S38，資料蒐集模組累積足夠的測試集後，會再觸發模型健康度模組及模型監控模組進行下一輪次之健康度計算及模型監控。 Then, as in process S37, the model monitoring module monitors the model according to the health degree, and uses the health degree value to monitor the health degree of the prediction model relative to the test set. The user can also use the model health degree value to understand whether the model has deviated from the latest test set, etc. status and decide whether to retain or retrain the model in the platform. At the same time, as in process S38, after the data collection module accumulates enough test sets, it will trigger the model health module and the model monitoring module to perform the next round of health calculation and model monitoring.

綜上所述，本發明透過多種系統及模型相關重要指標並依權重加總，制定一模型健康度計算公式，據以監控評估模型之健康程度，決定選用該預測模型或是否有重新訓練模型之需求，相較於傳統僅參照模型準確度或其它單一模型評估指標，本發明參照系統及模型等各項指標，更能選擇到最佳的預測模型。 To sum up, the present invention formulates a model health calculation formula by summing up a variety of system and model-related important indicators according to weights, based on which the health of the model is monitored and evaluated, and the decision is made to select the prediction model or whether to retrain the model. requirements. Compared with the traditional method that only refers to model accuracy or other single model evaluation indicators, the present invention can select the best prediction model by referring to various indicators such as the system and model.

本發明的系統、模組等包括微處理器及記憶體或是於包括包括微處理器及記憶體的機器中執行，而演算法、資料、程式等係儲存記憶體或晶片內，微處理器可從記憶體載入資料或演算法或程式進行資料分析或計算等處理，在此不予贅述。例如本發明之模型管理模組、資料蒐集模組、模型健康度模組以及模型監控模組包括有微處理器與記憶體等，且各模組內以此執行分析運算，因而本發明所述之系統其硬體細部結構亦可以相同實現方式。 The system, module, etc. of the present invention include a microprocessor and a memory or are executed in a machine including a microprocessor and a memory. Algorithms, data, programs, etc. are stored in the memory or chip. The microprocessor Data can be loaded from the memory or algorithms or programs can be used to perform data analysis or calculations, which will not be described in detail here. For example, the model management module, data collection module, model health module and model monitoring module of the present invention include a microprocessor and a memory, and each module uses this to perform analysis operations. Therefore, the present invention The hardware details of the system can also be implemented in the same way.

上述實施形態僅例示性說明本發明之原理及其功效，而非用於限制本發明。任何熟習此項技藝之人士均可在不違背本發明之精神及範疇下，對上述實施形態進行修飾與改變。因此，本發明之權利保護範圍，應如後述之申請專利範圍所列。 The above embodiments are only illustrative to illustrate the principles and effects of the present invention, but are not intended to limit the present invention. Anyone skilled in this art can modify and change the above embodiments without departing from the spirit and scope of the invention. Therefore, the scope of protection of the present invention is It should be as listed in the patent application scope mentioned below.

S21~S25:步驟 S21~S25: steps

Claims

A model monitoring system based on health, including: a model management module, used to generate a prediction model based on big data; a data collection module, used to collect big data test data to generate a test set; a model health module, To receive the prediction model from the model management module and the test set from the data collection module, by formulating the model health weight and calculating various indicators of the prediction model and system, the indicators are based on the model The health weights are summed to generate the health of the prediction model, in which the various indicators include model evaluation indicators, system performance indicators or popular evaluation indicators, and the model health module uses at least two of the above indicators to perform health assessment. Degree calculation; and the model monitoring module is connected to the model health module to receive the health of the prediction model, in order to monitor the prediction model and determine the status or reconstruction needs of the prediction model based on it.

For example, in the health-based model monitoring system described in Item 1 of the patent application, the model health weight is formulated based on external data or is formulated by itself, or a mixture of the above two methods is adopted.

A health-based model monitoring system as described in Item 1 of the patent application, wherein the model evaluation index is to evaluate the prediction model through the prediction model and the test set, using at least one method of control chart and accuracy to evaluate the prediction model. Produce model evaluation results.

A health-based model monitoring system as described in item 1 of the patent application, wherein the system performance index is recorded in the system and calculated as the average response time, throughput, stability or various system performance of the prediction model. At least one to produce system performance evaluation results.

A health-based model monitoring system as described in item 1 of the patent application, wherein the popularity evaluation index is one of the number of calls, number of users, service time or the popularity of the relevant model that is recorded and calculated in the system At least one, to produce a hot evaluation result.

A health-based model monitoring method is executed by a computer device with a microprocessor and a memory to perform model monitoring. The method includes: the computer device provides a prediction model generated based on big data; the computer device receives The test set generated by collecting big data test data; the computer equipment formulates the model health weight and calculates various indicators of the prediction model and system; the computer equipment sums the indicators according to the model health weight to generate the The health of the prediction model, wherein the indicators include model evaluation indicators, system performance indicators or popular evaluation indicators, and the computer equipment uses at least two of the above indicators to calculate the health of the prediction model; and the computer equipment Monitor the prediction model according to the health of the prediction model, and determine the status or reconstruction needs of the prediction model.

For example, in the health-based model monitoring method described in Item 6 of the patent application, the step of formulating the model health weight includes formulating the model health weight of each indicator based on external data, or formulating the model health weight by oneself. The model health weight of each indicator, or a mixture of the above two methods is used to formulate the model health weight of each indicator.

The health-based model monitoring method described in Item 6 of the patent application scope, wherein the model evaluation index is to evaluate the prediction model through the prediction model and the test set, using at least one method of control chart and accuracy to evaluate the prediction model. Produce model evaluation results.

The health-based model monitoring method described in item 6 of the patent application, wherein the system performance index is recorded in the system and calculated as the average response time, throughput, stability or various system performance of the prediction model At least one to produce system performance evaluation results.

For example, in the health-based model monitoring method described in Item 6 of the patent application, the popularity evaluation indicator is recorded in the system and calculated among the number of calls, number of users, service time of the prediction model, or the popularity of the relevant model. At least one, to produce a hot evaluation result.