WO2023090825A1

WO2023090825A1 - Ai model drift monitoring device and method

Info

Publication number: WO2023090825A1
Application number: PCT/KR2022/018011
Authority: WO
Inventors: 서창수; 김현진; 김영주; 배경숙
Original assignee: 에스케이 주식회사
Priority date: 2021-11-19
Filing date: 2022-11-15
Publication date: 2023-05-25
Also published as: KR20230073632A

Abstract

The present invention relates to an AI model drift monitoring device and method for monitoring data drift of a distributed AI model over time and, particularly, to an AI model drift monitoring device and method in which a prediction data set and a reference data set for evaluating performance of an AI model are provided, and data drift is detected by contrasting statistical differences between the two data sets. An AI model drift monitoring device according to an embodiment of the present invention comprises: a monitoring configuration unit for selecting and configuring a reference data set and a prediction data set, and configuring an execution period for monitoring of data drift of an AI model; a monitoring result analysis unit for analyzing the reference data set and the prediction data set according to the monitoring period, and comparing statistical differences between the two data sets, thereby determining data drift according to whether data drifts out of a preconfigured reference range; and a monitoring execution correction unit for updating the monitoring configuration and the execution period for data drift according to a result of the analysis of the monitoring result analysis unit.

Description

AI model drift monitoring device and method

The present invention is an AI model drift monitoring apparatus and method for monitoring data drift over time of a distributed AI model, and in particular, a reference data set and a prediction data set are configured to evaluate the performance of an AI model, and two data An AI model drift monitoring device and method for detecting data drift by preparing statistical differences between sets.

AI (Artificial Intelligence) models are developed using various languages, data, and libraries, and there are many restrictions in distributing and operating the developed AI models suitable for various use environments.

In addition, AI models subtly change inference results of machine learning depending on applied data, features, etc., and data drift occurs over time. Here, data drift means a case where the predicted data calculated by the AI model based on the input data is out of a preset range compared to the reference data, that is, out of a threshold value.

If the data drift of the AI model occurs, the AI model should be updated with an optimized AI model by re-executing machine learning using new training data.

Therefore, there is a need for research and development on a method for detecting data drift of an AI model and analyzing data drift more precisely at the feature level constituting data.

The present invention was created in view of the above circumstances, and an object of the present invention is to configure a reference data set and a prediction data set to detect performance degradation of an AI model, and compare the statistical difference between the two data sets to reduce data drift. It is to provide an AI model drift monitoring device and method for monitoring.

An AI model drift monitoring device according to an embodiment of the present invention selects and configures a reference data set and a prediction data set, and sets a monitoring setting unit for setting an execution cycle for monitoring data drift of an AI model, the reference data set and A monitoring result analysis unit that analyzes a prediction data set according to a monitoring cycle, compares the statistical difference between the two data sets, and determines data drift according to whether or not it deviate from a preset reference range, and according to the analysis result of the monitoring result analysis unit It includes a monitoring execution modification unit that updates monitoring settings and execution cycles for data drift.

In one embodiment, the monitoring result analyzer may determine data drift by applying a drift evaluation metric, which is an algorithm for calculating a difference in probability distributions of data constituting the reference data set and the prediction data set.

In an embodiment, the monitoring setting unit may include a target feature setting unit that selects target features to be analyzed from among features constituting data and sets a threshold value for each feature.

In an embodiment, the monitoring result analysis unit may include a feature analysis unit that monitors and analyzes a drift trend of the target features over time.

In one embodiment, the feature analyzer converts a categorical feature into a probability distribution such that the sum of all probabilities is 1 for normalization of target features of a plurality of data constituting the data set, and the numerical features are It can be analyzed by assuming a Gaussian distribution so that the sum of the probabilities is 1.

In an embodiment, the monitoring execution correction unit may include a threshold value correction unit for each feature that resets an appropriate threshold value for each feature according to an analysis result of the feature analysis unit.

In one embodiment, the monitoring setting unit may include an alarm setting unit configured to set a slack channel to notify corresponding information when data drift occurs as a result of analysis of the monitoring result analysis unit and predicted data deviate from a preset reference range. there is.

In one embodiment, the reference data set may be a learning data set composed of data used when generating an AI model, or a past data set composed of data input after serving the AI model.

An AI model drift monitoring method according to an embodiment of the present invention includes: (a) selecting and configuring a reference data set and a prediction data set, and setting an execution cycle for monitoring data drift of an AI model; (b) analyzing the reference data set and the prediction data set according to a monitoring period, comparing statistical differences between the two data sets, and determining data drift according to whether they deviate from a preset reference range; and (c) setting data drift monitoring and updating an execution cycle according to the determination result.

In an embodiment, step (a) may include selecting target features to be analyzed among features constituting data and setting a threshold value for each feature.

In one embodiment, step (b) may include monitoring and analyzing a drift trend of the target features over time.

In one embodiment, in step (b), categorical features are converted into probability distributions so that the sum of all probabilities is 1 for normalization of target features of a plurality of data constituting the data set, and numerical features may include assuming a Gaussian distribution such that the sum of all probabilities is 1.

In an embodiment, the step (c) may include resetting an appropriate threshold value for each feature according to the analysis result of the step (b).

In one embodiment, the step (a) includes setting a slack channel to notify the corresponding information when data drift occurs as the predicted data deviate from a preset reference range as a result of the analysis in the step (b). can do.

In one embodiment, the step (b) may include determining data drift by applying a drift evaluation metric, which is an algorithm for calculating a difference in probability distributions of data constituting the reference data set and the prediction data set. .

The AI model drift monitoring apparatus and method according to the present invention monitors data drift due to the degradation of the predictive performance of the deployed AI model, and compares the statistical difference between the two data sets after configuring a reference data set and a prediction data set. By visualizing it as a graph, data drift can be easily identified.

In addition, the AI model drift monitoring apparatus and method according to the present invention analyzes features that cause data drift by providing feature-level drift scores, monitors drift trends for each feature, and identifies factors affecting data drift. There are possible effects.

1 is a block diagram of an AI model drift monitoring device according to an embodiment of the present invention.

2 is a diagram showing the distribution of data constituting the reference data set and the prediction data set, and FIG. 3 is a graph showing the calculation result by applying KLD, a probability distribution difference calculation algorithm.

4 and 5 show example screens of an AI model drift monitoring list.

6 and 7 show exemplary screens for registering and correcting drift monitoring of an AI model.

8 and 9 show exemplary screens related to drift monitoring results of the AI model.

10 and 11 show exemplary screens related to drift information of an AI model.

12 is a flowchart of an AI model drift monitoring method according to an embodiment of the present invention.

Hereinafter, one embodiment of the present invention will be described in detail through exemplary drawings. In adding reference numerals to components of each drawing, it should be noted that the same components have the same numerals as much as possible even if they are displayed on different drawings. In addition, in describing the present invention, if it is determined that a detailed description of a related known configuration or function may obscure the gist of the present invention, the detailed description will be omitted.

Referring to FIG. 1 , the AI model drift monitoring device 10 of the present invention includes a monitoring setting unit 100, a monitoring result analysis unit 200, and a monitoring execution correction unit 300.

The monitoring setting unit 100 selects and configures a reference data set and a prediction data set, and sets an execution cycle for monitoring data drift.

Here, the reference "data" set may be a learning data set or a past data set. The training data set may be all or a subset of the training data used when creating the AI model, and the training data are refined for learning the AI model from measurement values sensed by sensors, for example. ) can be data.

The past data set may be all or a partial set of actual data input after the generated AI model is distributed and the AI model is served. It is referred to as a past data set because it is the data entered in the 'past' at the time of executing the monitoring analysis for the AI model.

The prediction data set may be all or a partial set of prediction data calculated by the AI model based on past data input after serving the AI model.

In one embodiment of the present invention, the monitoring setting unit 100 includes a target feature setting unit 110 that selects target features to be analyzed among features constituting data and sets a threshold value for each feature. can do.

In addition, the monitoring setting unit 100 sets up a slack channel to notify the corresponding information when data drift occurs as a result of analysis by the monitoring result analysis unit 200, as the predicted data deviate from a preset reference range. It may include an alarm setting unit 120 to set.

The monitoring result analysis unit 200 analyzes the reference data set and the prediction data set according to the monitoring period, compares the statistical difference between the two data sets, and determines whether data drift occurs according to whether the data drifts out of a preset reference range. .

When the reference data set is a learning data set, the monitoring result analysis unit 200 includes a learning data set that is all or a partial set of learning data used to generate the AI model, and actual data (past data) input after serving the AI model. , For example, a statistical difference is compared with a prediction data set that is all or a partial set of prediction data calculated from an AI model based on measurement values sensed through a sensor.

When the reference data set is a past data set, the monitoring result analysis unit 200 sets a past data set, which is all or a partial set of past data input to the AI model one cycle prior to the monitoring execution, as the reference data set. , Comparison of the prediction data set (past data set) with the prediction data set, which is all or a partial set of prediction data calculated from the AI model based on the past data input at the point in time after the reference data set (past data set) is set do.

The monitoring "result" analysis unit 200 determines "data" drift by applying "drift" evaluation "metric" which is an "algorithm" that calculates "probability distribution" difference of "data constituting" the "standard" data set and the "prediction" data set.

As an algorithm for calculating the probability distribution difference, commercially available Jensen Shannon Divergence (JSD) and/or Kullback Leibler Divergence (KLD) may be used, but is not limited thereto.

The probability distribution difference calculated through JSD or KLD can be graphed. KLD has a value from 0 to infinity, and JSD has a value between 0 and 1.

KLD is not commutative when calculating the difference in distribution between two data sets, whereas KLD(A,B)≠KLD(B,A), JSD is commutative, so JSD(A,B)= JSD(B,A), JSD is evaluated as a more stable calculation method.

In one embodiment of the present invention, the monitoring result analysis unit 200 may include a feature analysis unit 210 that monitors and analyzes drift trends of target features set by the monitoring setting unit 100 over time. there is.

The feature analysis unit 210 converts the categorical features into a probability distribution so that the sum of all probabilities is 1 for normalization of the target features of a plurality of data constituting the data set, and the numerical features are　 Total 　 Probability 　 Sum It can be analyzed by assuming that it is a Gaussian distribution so that this becomes 　1. The feature-analysis unit 210 may analyze the drift of the numerical features by sampling N pieces, for example, 100 pieces from the estimated Gaussian distribution.

A categorical feature means a feature whose feature value has a specific range, and a numerical feature means a feature whose feature value has a specific numerical value.

In addition, the monitoring result analysis unit 200 may include a graphic processing unit 220 that visualizes the drift trend of data or features and displays it as a graph. The graphic processing unit 220 may display data values or feature values over time based on the monitoring period as a graph.

In addition, the monitoring result analysis unit 200 may include a drift history inquiry unit 230 that provides drift history information recorded and stored for each monitoring period.

The monitoring execution correcting unit 300 sets monitoring for data drift and updates an execution cycle according to the analysis result of the monitoring result analysis unit 200 . The setting and execution cycle of monitoring can be adjusted in consideration of the usage environment to which the AI model is applied, the possible frequency of data drift, and the severity of damage caused by data drift.

In an embodiment of the present invention, the monitoring execution correction unit 300 may include a threshold value correction unit 310 for each feature that resets an appropriate threshold value for each feature according to the analysis result of the feature analysis unit 210. .

Since each feature constituting the data has a different influence on data drift, a drift trend may be monitored by setting a different threshold for each feature, and an adjustment operation may be performed to reset the threshold according to the monitoring result.

Referring to FIG. 2, A represents the distribution of data constituting the reference data set, and B represents the distribution of data constituting the prediction data set. The data of A and B may be displayed on the XY plane as shown in FIG. 2 based on the numerical values of the features constituting the data. The present invention compares the statistical difference between A and B to determine whether data drift has occurred.

Referring to FIG. 3, a result calculated by applying KLD to a reference data set and a prediction data set is shown. As a result of the calculation, C is a state in which data drift has not occurred because the result value appears within the preset threshold for data drift, and D is a state in which data drift has occurred because the result value has deviated from the threshold value am.

4 and 5 show example screens of an AI model drift monitoring list.

Referring to FIG. 4 , a data drift monitoring list of the deployed AI model is displayed. The monitoring list may include, for example, analysis period of the corresponding AI model, drift, model status, reference data, latest execution date and next execution date and time information. The monitoring registration button 102 may move to a drift registration screen to monitor drift of the AI model.

The AI model displayed in FIG. 4 represents an AI model registered in the drift monitoring list, and whether or not drift is indicated as N means that data drift has not occurred in the corresponding AI model.

Referring to FIG. 5, it indicates a case where the distributed AI model is not registered in the data drift monitoring list, and since it is not registered, a 'No data available' message may be displayed.

Referring to FIG. 6 , in the monitoring description item 104 , drift types, shapes, and special matters of the AI model may be written. The reference time item 106 sets a reference time for monitoring the AI model. If the reference data applied to the AI model is training data, the reference time, that is, the first execution date of monitoring, must be the time when one analysis cycle has passed from the service start time of the AI model so that monitoring can be executed at the same time interval.

The notification setting item 122 activates a slack channel to transmit a notice when a value exceeding a threshold value is detected as a result of monitoring execution of the AI model.

The analysis period item 108 sets the analysis period of AI model monitoring. For example, in the drawing, the left side shows the basic method of setting the analysis period, and the right side shows the Corn expression.

In the basic method, the analysis cycle of AI model monitoring can be set in units of days/weeks/months. The Corn expression can be composed of a string string composed of unit expressions divided into 6 input items, and second/minute/hour/day/month/day information can be subdivided and input into each input item. It can be separated by spaces. That is, the Corn expression can set the monitoring period by subdividing it into seconds/minutes/hours compared to the basic method.

Referring to FIG. 7 , a registration screen for monitoring AI model drift is displayed, and the type of data applied to machine learning of the AI model is selected from the reference data item 130 .

If the selected data is training data, the path is automatically set, and schema information contained in the path of the training data used when creating the AI model, for example, information such as target features and feature types of the training data is automatically called. and displayed on the screen.

For AI model drift monitoring, the reference time, that is, the first execution time of monitoring, must be set. The reference time of monitoring can be changed and reset.

The schema setting item 132 may be automatically called and set if the learning data of the AI model is registered, and if the learning data is not registered, the schema may be manually set.

In one embodiment, when moving to the AI model editing screen and registering the training data for automatic calling of the training data of the AI model, the target feature, feature type, etc. related to the training data are automatically displayed in the schema setting item 132. can be called

The schema setting item 132 may be divided into, for example, target feature, type, and threshold value items, and the user may select a specific target feature and manually set the type and threshold value. The data type may be, for example, numerical or categorical, but is not limited thereto.

When the user checks the threshold value collective setting item 134 and inputs a % value, the corresponding % value is collectively applied to the threshold value of the schema setting item 132 .

In an embodiment of the present invention, a specific % value may be set as a threshold value using values of target features of training data used to generate an AI model as reference values.

As a result of monitoring the AI model, when a result value exceeding a threshold value of a specific target feature is detected, a notice may be transmitted to the slack channel as described above.

Referring to FIG. 8 , the drift information summary item 202 may include monitoring ID, monitoring status, execution time, and drift information. The monitoring ID indicates the ID of the monitoring subject, and the monitoring status may indicate whether monitoring is activated.

Drift may be indicated by Y when the data drift of the AI model is detected, and N when not detected.

The top drift feature item 212 displays top features having a large drift magnitude and their values. In the drawing, three top features are configured to be displayed, but the user is not limited thereto, and various numbers of top features may be displayed in categories necessary for the AI model analysis.

The refresh button 204 updates AI model drift result information and displays it on the screen.

The drift trend item 222 may set a monitoring period, and a drift trend may be calculated as a graph in the set period.

The feature selection item 214 displays a list of features registered for drift monitoring, and when a specific feature is selected, a drift trend for the feature may be calculated as a graph.

Referring to FIG. 9 , a graph area 224 shows the drift trend for each feature selected through the feature selection item 214, and the change trend for each date/time/feature is displayed. When a specific viewpoint is clicked on the graph, information may be displayed while a table related to a feature at the corresponding viewpoint is exposed as a pop-up.

In the drift history item 232, drift sizes, threshold values, and drifting of all target features may be displayed.

The user may modify the existing drift monitoring list by clicking the drift monitoring correction item 206 .

10 and 11 show exemplary screens related to drift information of an AI model.

Referring to FIG. 10 , the drift description item 302 indicates the type of drift and the reference monitoring time.

The notification setting item 304 may be configured to display information of a slack channel to which a notification is to be transmitted when a value exceeding a threshold value is detected as a result of executing monitoring, and to enable modification of information of the slack channel.

The analysis period item 306 sets a drift analysis period, and may set the period in units of days/weeks/months.

The reference data item 308 indicates a location where learning data is stored, and it is impossible to modify the reference data for accuracy of drift monitoring, but it is possible to change the reference time of monitoring.

11 shows drift information of the AI model, and when drift monitoring registration of the AI model is executed in FIG. 7, a screen as shown in FIG. 11 may be displayed. When the user clicks the registration modification button 312, the screen is converted to the screen shown in FIG. 7, and target feature, type, and threshold information can be changed.

Referring to FIG. 12, according to the AI model drift monitoring method of the present invention, the AI model drift monitoring device 10 selects and configures a reference data set and a prediction data set, and sets an execution cycle for monitoring data drift of the AI model. setting (S100), analyzing the reference data set and prediction data set according to the monitoring cycle, comparing the statistical difference between the two data sets, and determining data drift according to whether or not it deviated from the preset reference range (S200), S200 According to the determination result of the step, data drift monitoring is set and an execution cycle is updated (S300).

In one embodiment of the present invention, the AI model drift monitoring apparatus 10 may select target features to be analyzed from among features constituting data and set a threshold value for each feature in step S100.

Each feature has a different degree of influence on data drift, and the AI model drift monitoring device 10 analyzes features that cause data drift based on feature-level drift scores, and analyzes features that cause data drift for each feature. By monitoring drift trends, factors affecting data drift can be identified.

In step S100, the AI model drift monitoring device 10 may set a slack channel to notify the information when data drift occurs because the predicted data deviate from a preset reference range as a result of the analysis in step S200. When a notice about data drift is delivered through a slack channel, follow-up actions can be taken to update the AI model, such as running machine learning on the AI model using new training data.

The AI model drift monitoring device 10 may monitor and analyze drift trends of target features over time in step S200, and furthermore, for normalization of target features of a plurality of data constituting a data set Categorical features are converted into probability distributions so that the sum of all probabilities is 1, and numeric features can be analyzed by assuming a Gaussian distribution so that the sum of all probabilities is 1.

The AI model drift monitoring apparatus 10 may reset an appropriate threshold value for each feature in step S300 according to the drift trend analysis result of the target features executed in step S200.

Here, a specific % value may be reset as a feature-by-feature threshold using values of target features of training data used to generate an AI model as reference values. Since each feature has a different sensitivity to data drift, the threshold value for each feature may be set differently.

The AI model drift monitoring device 10 may determine data drift by applying a drift evaluation metric, which is an algorithm for calculating a difference in probability distribution of data constituting the reference data set and the prediction data set, in step S300. As an algorithm for calculating the probability distribution difference, commercially available Jensen Shannon Divergence (JSD) and/or Kullback Leibler Divergence (KLD) can be used.

The embodiments described above may be implemented as hardware components, software components, and/or a combination of hardware components and software components. For example, the devices, methods, and components described in the embodiments may include, for example, a processor, a controller, an arithmetic logic unit (ALU), a digital signal processor, a microcomputer, a field programmable gate (FPGA). array), programmable logic units (PLUs), microprocessors, or any other device capable of executing and responding to instructions. A processing device may run an operating system (OS) and one or more software applications running on the operating system. A processing device may also access, store, manipulate, process, and generate data in response to execution of software. For convenience of understanding, there are cases in which one processing device is used, but those skilled in the art will understand that the processing device includes a plurality of processing elements and/or a plurality of types of processing elements. It can be seen that it can include. For example, a processing device may include a plurality of processors or a processor and a controller. Also, other processing configurations are possible, such as parallel processors.

Software may include a computer program, code, instructions, or a combination of one or more of the foregoing, which configures a processing device to operate as desired or processes independently or collectively. You can command the device. Software and/or data may be any tangible machine, component, physical device, virtual equipment, computer storage medium or device, intended to be interpreted by or provide instructions or data to a processing device. , or may be permanently or temporarily embodied in a transmitted signal wave. Software may be distributed on networked computer systems and stored or executed in a distributed manner. Software and data may be stored on one or more computer readable media.

The method according to the embodiment may be implemented in the form of program instructions that can be executed through various computer means and recorded on a computer readable medium. The computer readable medium may include program instructions, data files, data structures, etc. alone or in combination. Program commands recorded on the medium may be specially designed and configured for the embodiment, or may be known and usable to those skilled in computer software. Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks and magnetic tapes, optical media such as CD-ROMs and DVDs, and magnetic media such as floptical disks. - includes hardware devices specially configured to store and execute program instructions, such as magneto-optical media, and ROM, RAM, flash memory, and the like. Examples of program instructions include high-level language codes that can be executed by a computer using an interpreter, as well as machine language codes such as those produced by a compiler. The hardware devices described above may be configured to operate as one or more software modules to perform the operations of the embodiments, and vice versa.

Although the present invention has been described in detail with reference to preferred embodiments, the present invention is not limited to the above embodiments, and the technical field to which the present invention belongs without departing from the gist of the present invention claimed in the following claims. Anyone skilled in the art will extend the technical spirit of the present invention to the extent that various variations or modifications are possible.

Claims

A monitoring setting unit that selects and configures a reference data set and a prediction data set and sets an execution cycle for monitoring data drift of the AI model;

a monitoring result analysis unit that analyzes the reference data set and the prediction data set according to a monitoring period, compares statistical differences between the two data sets, and determines data drift according to whether the data drifts out of a preset reference range; and

AI model drift monitoring device characterized in that it comprises a; monitoring execution correction unit for updating the setting and execution cycle of monitoring for data drift according to the analysis result of the monitoring result analysis unit.
According to claim 1,

The monitoring result analysis unit,

AI model drift monitoring device, characterized in that for determining data drift by applying a drift evaluation metric, which is an algorithm that calculates a difference in probability distribution of data constituting the reference data set and the prediction data set.
According to claim 1,

The monitoring setting unit,

An AI model drift monitoring device comprising: a target feature setting unit that selects target features to be analyzed among features constituting the data and sets a threshold value for each feature.
According to claim 3,

The monitoring result analysis unit,

An AI model drift monitoring device comprising: a feature analyzer that monitors and analyzes drift trends of the target features over time.
According to claim 4,

The feature analysis unit,

For normalization of target features of a plurality of data constituting the data set, categorical features are converted into a probability distribution so that the sum of all probabilities is 1, and numerical features are converted into a Gaussian distribution so that the sum of all probabilities is 1. An AI model drift monitoring device, characterized in that it is hypothesized and analyzed.
According to claim 4,

The monitoring execution correction unit,

An AI model drift monitoring device comprising: a feature-by-feature threshold value correction unit that resets an appropriate threshold value for each feature according to the analysis result of the feature analysis unit.
According to claim 1,

The monitoring setting unit,

AI model drift monitoring device comprising: an alarm setting unit for setting a slack channel to notify the corresponding information when data drift occurs as a result of the analysis of the monitoring result analysis unit and the predicted data deviate from a preset reference range; .
According to claim 1,

The reference data set is

AI model drift monitoring device, characterized in that the learning data set consisting of data used when generating the AI model, or the past data set consisting of data input after serving the AI model.
(a) selecting and configuring a reference data set and a prediction data set, and setting an execution cycle for monitoring data drift of the AI model;

(b) analyzing the reference data set and the prediction data set according to a monitoring period, comparing statistical differences between the two data sets, and determining data drift according to whether they deviate from a preset reference range; and

(c) setting monitoring for data drift and updating an execution cycle according to the determination result; AI model drift monitoring method characterized in that it comprises a.
According to claim 9,

In step (a),

An AI model drift monitoring method comprising the steps of selecting target features to be analyzed from among features constituting data and setting a threshold value for each feature.
According to claim 10,

In step (b),

and monitoring and analyzing drift trends of the target features over time.
According to claim 11,

In step (b),

For normalization of target features of a plurality of data constituting the data set, categorical features are converted into a probability distribution so that the sum of all probabilities is 1, and numerical features are converted into a Gaussian distribution so that the sum of all probabilities is 1. AI model drift monitoring method comprising the step of assuming.
According to claim 11,

In step (c),

The AI model drift monitoring method comprising the step of resetting an appropriate threshold value for each feature according to the analysis result of step (b).
According to claim 9,

In step (a),

As a result of the analysis in step (b), if data drift occurs because the predicted data deviate from a preset reference range, setting a slack channel to notify the corresponding information; AI model drift monitoring method comprising the.
According to claim 9,

In step (b),

AI model drift monitoring method comprising: determining data drift by applying a drift evaluation metric, which is an algorithm that calculates a difference in probability distribution of data constituting the reference data set and the prediction data set.
According to claim 9,

The reference data set is

AI model drift monitoring method, characterized in that the learning data set consisting of data used when generating the AI model, or the past data set consisting of data input after serving the AI model.