WO2019139189A1 - 스트리밍 데이터 처리시스템의 이상동작 분석 장치 및 그 방법 - Google Patents
스트리밍 데이터 처리시스템의 이상동작 분석 장치 및 그 방법 Download PDFInfo
- Publication number
- WO2019139189A1 WO2019139189A1 PCT/KR2018/000611 KR2018000611W WO2019139189A1 WO 2019139189 A1 WO2019139189 A1 WO 2019139189A1 KR 2018000611 W KR2018000611 W KR 2018000611W WO 2019139189 A1 WO2019139189 A1 WO 2019139189A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- abnormal
- value
- information
- output
- data processing
- Prior art date
Links
Images
Classifications
-
- F—MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
- F04—POSITIVE - DISPLACEMENT MACHINES FOR LIQUIDS; PUMPS FOR LIQUIDS OR ELASTIC FLUIDS
- F04D—NON-POSITIVE-DISPLACEMENT PUMPS
- F04D29/00—Details, component parts, or accessories
- F04D29/70—Suction grids; Strainers; Dust separation; Cleaning
- F04D29/701—Suction grids; Strainers; Dust separation; Cleaning especially adapted for elastic fluid pumps
- F04D29/703—Suction grids; Strainers; Dust separation; Cleaning especially adapted for elastic fluid pumps specially for fans, e.g. fan guards
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01D—SEPARATION
- B01D46/00—Filters or filtering processes specially modified for separating dispersed particles from gases or vapours
- B01D46/42—Auxiliary equipment or operation thereof
-
- F—MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
- F04—POSITIVE - DISPLACEMENT MACHINES FOR LIQUIDS; PUMPS FOR LIQUIDS OR ELASTIC FLUIDS
- F04D—NON-POSITIVE-DISPLACEMENT PUMPS
- F04D25/00—Pumping installations or systems
- F04D25/02—Units comprising pumps and their driving means
- F04D25/08—Units comprising pumps and their driving means the working fluid being air, e.g. for ventilation
-
- G—PHYSICS
- G08—SIGNALLING
- G08B—SIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
- G08B21/00—Alarms responsive to a single specified undesired or abnormal condition and not otherwise provided for
- G08B21/18—Status alarms
- G08B21/182—Level alarms, e.g. alarms responsive to variables exceeding a threshold
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/24—Monitoring of processes or resources, e.g. monitoring of server load, available bandwidth, upstream requests
- H04N21/2407—Monitoring of transmitted content, e.g. distribution time, number of downloads
Definitions
- the present invention relates to an apparatus and method for analyzing abnormal operation of a streaming data processing system, and more particularly, to a system and method for analyzing an abnormal operation of a streaming data processing system, and more particularly, The present invention relates to an apparatus and method for analyzing an output value of a monitoring apparatus to be calculated.
- the amount of video data to be transmitted and received on the Internet has been dramatically increasing. Users using moving image data have a characteristic of preferring to enjoy the moving image data through a streaming method rather than storing the moving image data permanently in a storage device , A system for collecting and processing streaming data must have a stable high-speed processing capability for streaming data.
- a streaming data processing system In order to maintain high-speed processing performance, a streaming data processing system generally operates with a monitoring device that monitors performance degradation or occurrence of a failure, and a system administrator uses a streaming data processing system Can be efficiently monitored.
- the monitoring device In order to determine whether the streaming data processing system is normally operating, the monitoring device needs a reference.
- the system administrator specifies and manages the reference value, it is necessary to efficiently cope with a change in streaming data throughput or a system failure situation And the burden on the system administrator is heavy.
- a monitoring device that actively adjusts a reference value for determining whether the streaming data processing system is operating normally according to the flow of time and system conditions, thereby minimizing unnecessary intervention of the system administrator .
- the streaming data processing system can actively adjust the reference value for determining whether the system is operating normally according to the flow of time and the state of the system,
- a technology for identifying a system as an active threshold that actively changes whether the system is operating normally has not been widely known. Therefore, it is natural that the system can be operated at predetermined time intervals based on output values output from the system There is no technology that tells how long it has been working normally as a representative value.
- the present invention has been made in view of the above problems, and it is an object of the present invention to provide a system and a method for processing streaming data by calculating a reference value for determining a performance degradation of a processing system for processing streaming data through a stored performance index, And it is an object of the present invention to provide an apparatus and method for accurately determining whether or not an operation is performed.
- an apparatus for detecting an abnormal operation of a system based on a result of comparing an output value periodically output by a system with an active threshold value varying with time, And generating an abnormal operation alarm when it is detected comprising: comparing the variable threshold value for a predetermined time range with the output value to determine whether the anomaly action occurred, the output value according to the viewpoint information, An abnormal data storage unit for storing a variable threshold value corresponding to an output value; A period information calculation unit for calculating an abnormal period information on anomaly duration that is continuously detected based on the viewpoint information; An abnormality intensity calculating unit for calculating a relative abnormality intensity and an absolute abnormality abnormality based on the output value and the variable threshold value; And a representative value calculating unit for calculating an abnormal representative value for the time range on the basis of the result of synthesizing the abnormal period information, the relative abnormal intensity, and the absolute abnormal intensity.
- a method for operating an abnormal operation of a system based on a result of comparing an output value periodically output by a system with an active threshold value varying with time comprising the steps of: comparing the variable threshold value for a predetermined time range with the output value to generate information on the time at which the anomaly action occurred, An abnormal data storage step of storing an output value according to information and a variable threshold value corresponding to the output value; A period information calculating step of calculating abnormal period information on anomaly duration detected by continuous abnormal operation based on the viewpoint information; An anomaly calculating step of calculating a relative anomaly intensity and an absolute anomaly intensity based on the output value and the variable threshold value; And a representative value calculating step of calculating an abnormal representative value for the time range on the basis of the result of synthesizing the abnormal period information, the relative abnormal intensity, and the absolute abnormal intensity.
- a second reference rate which is a threshold value compared with a processing result of a streaming data processing system for processing big streaming data, is actively calculated and applied It is possible to monitor the performance change state of the streaming data processing system while minimizing the intervention of the system administrator, thereby reducing the work load of the system administrator.
- the monitoring apparatus according to the present invention when applied to a streaming data processing system, it is possible to achieve a system monitoring effect equal to or greater than the previous one even if less manpower and time are input.
- FIG. 1 is a diagram schematically showing an overall configuration of a system according to the present invention
- FIG. 2 is a block diagram of an example of an abnormal motion analyzing apparatus according to the present invention.
- FIG. 3 is a diagram schematically illustrating a second reference speed calculated according to the present invention.
- FIG. 4 shows an example in which the second reference speed calculation method according to the present invention is implemented in pseudo-code.
- FIG. 5 is a flowchart illustrating an example of a method for monitoring a streaming data processing system according to the present invention.
- FIG. 6 is a flowchart illustrating an example of a method for analyzing an anomaly of a streaming data processing system according to the present invention.
- FIG. 7 is a diagram showing nodes and edges generated by the representative value calculating unit.
- an apparatus for detecting an abnormal operation of a system based on a result of comparing an output value periodically output by a system with an active threshold value varying with time, And generating an abnormal operation alarm when it is detected comprising: comparing the variable threshold value for a predetermined time range with the output value to determine whether the anomaly action occurred, the output value according to the viewpoint information, An abnormal data storage unit for storing a variable threshold value corresponding to an output value; A period information calculation unit for calculating an abnormal period information on anomaly duration that is continuously detected based on the viewpoint information; An abnormality intensity calculating unit for calculating a relative abnormality intensity and an absolute abnormality abnormality based on the output value and the variable threshold value; And a representative value calculating unit for calculating an abnormal representative value for the time range on the basis of the result of synthesizing the abnormal period information, the relative abnormal intensity, and the absolute abnormal intensity.
- the period information calculation unit may calculate the abnormal period information using a fast Fourier transform.
- the period information calculation unit may calculate the abnormal period information from the sampled data according to a weight set based on an extent to which the output value is out of a variable threshold value corresponding to the output value.
- the period information calculation unit may calculate the abnormal period information based on a result obtained by matching the sampled data to an exponential distribution.
- the abnormality intensity calculating section may calculate the relative abnormality intensity based on a numerical value obtained by integrating the difference between the output value and a variable threshold value corresponding to the output value.
- the abnormality intensity calculating unit may normalize the output value to a value between 0 and 1 based on a maximum output value and a minimum output value included in the set of output values, and calculate the absolute abnormal intensity And a second calculation unit.
- the apparatus comprising: a representative value transmitter for collectively transmitting the abnormal representative value and past abnormal representative values for a time range earlier than the time range to the user terminal; And a representative value adjuster that receives the adjustment value and changes the abnormal representative value based on the received adjustment value.
- a method for operating an abnormal operation of a system based on a result of comparing an output value periodically output by a system with an active threshold value varying with time comprising the steps of: comparing the variable threshold value for a predetermined time range with the output value to generate information on the time at which the anomaly action occurred, An abnormal data storage step of storing an output value according to information and a variable threshold value corresponding to the output value; A period information calculating step of calculating abnormal period information on anomaly duration detected by continuous abnormal operation based on the viewpoint information; An anomaly calculating step of calculating a relative anomaly intensity and an absolute anomaly intensity based on the output value and the variable threshold value; And a representative value calculating step of calculating an abnormal representative value for the time range on the basis of the result of synthesizing the abnormal period information, the relative abnormal intensity, and the absolute abnormal intensity.
- the period information calculation step may be characterized by calculating the abnormal period information using a fast Fourier transform.
- the period information calculating step may calculate the abnormal period information from the sampled data according to a weight set based on the degree of deviation of the output value from the variable threshold value corresponding to the output value .
- the period information calculating step may calculate the abnormal period information based on the result of fitting the sampled data to the exponential distribution.
- the abnormal-strength calculating step may calculate the relative abnormal-intensity based on a numerical value obtained by integrating the difference between the output value and a variable threshold value corresponding to the output value.
- the abnormality intensity calculating step may include normalizing the output value to a value between 0 and 1 based on a maximum output value and a minimum output value of the set of output values, and calculating the absolute abnormality intensity based on the normalized value .
- the representative value transmission step of collectively transmitting the abnormal representative value and the past abnormal representative value for a time range earlier than the time range to the user terminal collectively; And a representative value adjusting step of receiving the adjustment value corresponding to the transmitted abnormal representative value and the past abnormal representative value and changing the abnormal representative value based on the received adjustment value.
- One embodiment of the present invention can provide a computer readable recording medium storing a program for implementing the method according to the above method.
- FIG. 1 is a diagram schematically showing an overall configuration of a system according to the present invention
- the system according to the present invention includes a streaming data transmission system 110, a streaming data processing system 130, a monitoring apparatus 200, an abnormal operation analysis apparatus 300, and an administrator terminal 400
- the streaming data transmission system 110 is connected to the streaming data processing system 130 through the communication network 150.
- the abnormal operation analysis apparatus 300 and the administrator terminal 400 are connected through the communication network 150 It can be seen that various data are transmitted and received.
- the streaming data transmission system 110 stores streaming data.
- the streaming data transmission system 110 receives a request to transmit streaming data from the streaming data processing system 130
- the streaming data transmission system 110 transmits the streaming data to the streaming data processing system 130 via the communication network 150 And transmits the data.
- the streaming data refers to various kinds of video and audio data that can be transmitted by dividing only a part of the entire data, and can play a part of the streaming data on the side receiving a part of the streaming data.
- the streaming data processing system 130 sends a streaming data transmission request to the streaming data transmission system 110 and receives and processes the streaming data.
- the streaming data processing system 130 includes a storage device capable of semi-permanently storing streaming data received from the streaming data transmission system 110, a graphic control device that processes streaming data and outputs the streaming data through an output device, Devices, and various processors.
- the streaming data processing system 130 may record the history of processing the streaming data in a log by unit time or event.
- the monitoring apparatus 200 performs a function of monitoring whether the performance of the streaming data processing system 130 is degraded or an error code is generated.
- the monitoring device 200 may receive the data processing result directly from the streaming data processing system 130 or may analyze the log of the streaming data processing system 130 to determine whether the streaming data processing system 130 is experiencing a performance degradation or failure code .
- the monitoring apparatus 200 may visually output the detected result when the streaming data processing system 130 detects that the performance degradation or the failure code has occurred.
- the monitoring device 200 may be wired to the streaming data processing system 130 or physically or logically included in the streaming data processing system 130 to operate.
- the abnormal operation analyzing apparatus 300 detects an abnormality of the streaming data processing system 130 in the predetermined period based on the active threshold value determined by the monitoring apparatus 200, And transmits the calculated value to the administrator terminal 400 via the communication network 150.
- FIG. The abnormal motion analyzer 300 may also be connected to the streaming data processing system 130 in a wired manner or may be physically or logically included in the streaming data processing system 130, such as the monitoring device 200.
- the manager terminal 400 receives an abnormal representative value from the abnormal operation analyzer 300 via the communication network and transmits the adjusted representative value to the abnormal operation analyzer 300 again.
- the adjustment value will be described later with reference to FIG.
- the administrator terminal 400 is a terminal used by an administrator who manages the overall system according to the present invention.
- the administrator terminal 400 is depicted as a personal computer (PC), but a tablet personal computer or a smart phone And is not limited to a specific type of terminal.
- the streaming data transmission system 110, the streaming data processing system 130, the abnormal operation analysis apparatus 300 and the administrator terminal 400 transmit and receive various data through the communication network 150, A general telephone network, a data network, and a mobile communication network.
- FIG. 2 is a block diagram of an example of an abnormal motion analyzing apparatus according to the present invention.
- the monitoring apparatus 200 of the streaming data processing system 130 will be described together with the abnormal operation analysis apparatus 300 according to the present invention for convenience of explanation.
- the monitoring apparatus 200 and the abnormal operation analyzing apparatus 300 of FIG. 2 can be implemented as an independent apparatus, or may be implemented in a form that is physically or logically included in the streaming data processing system 130 .
- the abnormal motion analysis apparatus 300 may be physically or logically included in the monitoring apparatus 200.
- the monitoring apparatus 200 includes an output data storage unit 210, an under-speed information acquisition unit 230, a relation information calculation unit 250, a second reference speed calculation unit 270,
- the abnormal operation analyzing apparatus 300 includes an abnormal data storing unit 310, a period information calculating unit 320, an abnormal power calculating unit 330, a representative value calculating unit 340 ), A representative value transmission unit 350, and a representative value adjustment unit 360, and will be described with reference to Fig. 1 for convenience of explanation.
- the output data storage unit 210 stores the processing speed information of the data processed up to the first time point and the failure occurrence information by unit time.
- the first time point refers to the time zone nearest to the present time of the past time point when the streaming data processing system 130 processed the streaming data. For example, if the streaming data processing system 130 processes the streaming data from 2 pm to 3 pm, the first time may be at 3 pm.
- the processing rate information refers to information on the value of the rate at which the streaming data processing system 130 processes data.
- the processing speed information may be generated based on information recorded in units of time in the log of the streaming data processing system 130.
- the data processing speed of the streaming data processing system 130 depends on the amount of resources of the system when processing the data, the occurrence status of the failure, the capacity of the streaming data, the type of the streaming data (meaning data format, extension, etc.) ,
- the processing rate information is information in which various pieces of information including the various data processing rates are recorded by unit time. In general, slowing the data processing speed of a system is interpreted as a degraded state of the system.
- the failure occurrence information refers to information about various faults occurring in the process of processing data by the streaming data processing system 130.
- the fault occurrence information includes not only a state where a fault code preset in the streaming data processing system 130 is generated but also a state where a fault code that has not been set in advance is generated.
- the streaming data processing system 130 Even if the streaming data processing system 130 outputs a failure code, the data processing speed of the streaming data processing system 130 may not change at all. That is, the occurrence of a failure and a performance degradation in the streaming data processing system 130 are a separate problem. For example, when the data processing result of the streaming data processing system 130 violates a SLO (Service Level Objective) preset in the streaming data processing system 130, the streaming data processing system 130 can output a failure code have.
- the SLO refers to a constant reference required for the processing result of the streaming data processing system 130, and the reference may be a reference not related to the data processing speed of the streaming data processing system 130.
- the failure occurrence information may include reference failure information at a time when a predetermined reference failure code has occurred and non-reference failure information at a time other than a time point when the reference failure code occurs.
- the point of time other than the reference fault code and the point of time when the fault code does not occur are included at a time point other than the point when the reference fault code occurs.
- the failure occurrence information is information output from the streaming data processing system 130 for each unit time.
- Reference failure information and non-reference failure information since the failure corresponding to the failure code preset in the streaming data processing system 130 does not occur every moment, Reference failure information and non-reference failure information.
- the reference failure information may be represented by 1 and the non-reference failure information may be represented by 0.
- the output data storage unit 210 may previously store a comparison code that can distinguish whether the fault code output by the system is a reference fault code.
- the output data storage unit 210 analyzes the data processing result output from the streaming data processing system 130 to determine whether there is a reference failure code that matches the comparison code and stores reference failure information or non- And a processor for controlling the processor. For example, if the failure code F6009 is output from the streaming data processing system 130 at time t1 and F6009 is included in the comparison code, the failure occurrence information at the time t1 stored in the output data storage unit 210 is the reference failure Information.
- Equation (1) shows an example of failure occurrence information from 0 to t seconds.
- the failure occurrence information from 0 second to t seconds as shown in Equation (1) can be expressed by a vector of t + 1 dimension.
- the processing speed information and the trouble occurrence information are all recorded by unit time.
- the unit time is not limited to a specific time, it may be a unit of 10 seconds, 1 minute, 10 minutes, 1 hour, or the like. Since both the processing speed information and the failure occurrence information are information recorded by unit time, the output data storage unit 210 can map and store the processing speed information and the failure occurrence information. For example, assuming that the unit time is one minute and streaming data is continuously processed from 12:00 pm on January 1, the output data storage unit 210 stores processing speed information and error Occurrence information can be stored in association with each other. When the output data storage unit 210 is searched at 12: 1 PM on Jan. 1, the output data storage unit 210 can output the processing speed information and the failure occurrence information corresponding thereto at once.
- the under-speed information acquiring unit 230 recognizes the under-speed information at the point in time when the data is processed slower than the first reference speed in the process speed information stored in the output data storage 210.
- the first reference speed is a speed value stored in advance in the under-speed information acquisition unit 230 or stored in the output data storage unit 210 and then transmitted to the under-speed information acquisition unit 230.
- the first reference speed is compared with the data processing speed per unit time included in the processing speed information. If the data processing speed of the processing speed information at the specific time is slower than the first reference speed, the under-speed information acquiring unit 230 grasps the processing speed information as under-speed information. If the data processing speed of the processing speed information is slower than the first reference speed, the processing speed information includes information on the processing time, processed data, and processing speed of the streaming data processing system 130, All information included in the speed information becomes information included in the under speed information. That is, the first reference speed is a reference speed for the streaming data processing system 130, which is the rate at which the streaming data processing system 130 processes data, to be.
- the first reference speed may be set differently for each unit time.
- the processing speed of different streaming data It is possible to minimize the possibility that the monitoring device 200 is uniformly determined to be degraded.
- the processing speed of the data in the streaming data processing system 130 may vary over time, and even if the processing speed slows at a certain point in time, it is merely a temporary delay caused by factors other than performance degradation According to the monitoring apparatus 200 according to the present invention, the system administrator can determine whether the system is degraded in consideration of such factors.
- the under-speed information acquiring unit 230 is capable of distinguishing the under-speed information from the speed achievement information other than the under-speed information through the above-described process, and is capable of distinguishing between the under- The under-speed information can be calculated for each unit time.
- Equation (3) shows an example of the speed underflow information from 0 to t seconds.
- y (1) which is the under-velocity information at the 1-second time point when the data is processed slower than the first reference speed
- Y (2) which is the under-speed information of the speed of the vehicle is zero.
- 0 and 1 may be reversely applied to the under-speed information acquiring unit 230 according to a preset value.
- the relationship information calculation unit 250 calculates correlation information between the processing speed information and the failure occurrence information.
- Correlation information refers to information expressed as a formula indicating how the processing speed information and the fault occurrence information have a correlation. Since the streaming data processing system 130 has a large number of results of processing data up to the first time and the failure occurrence information has a binary variable characteristic (whether a preset failure code has occurred or not) It is possible to approximate the correlation with the failure occurrence information by a single equation.
- Fail code output Fail code output Below the first reference speed x y Achieving the first reference speed (exceeding) u v
- Table 1 is a table showing the ratio of data processing results output from the streaming data processing system 130. Since the data processing result output by the streaming data processing system 130 is any one of the four types shown in Table 1, when x, y, u, and v are added together,
- Equations 4 to 7 can be defined using x, y, u, v in Table 1.
- the positive predictive value (PPV) in Equation (4) is obtained by processing data from the streaming data processing system 130 output from the streaming data processing system 130 from 0 to t at a rate lower than the first reference rate, Quot; information "
- cPPV complementary PPV represents information of a time point at which data was processed slower than the first reference speed among data processing results output from the streaming data processing system 130 from 0 to t, .
- Negative Predictive Value is a value obtained by multiplying NPV (Negative Predictive Value) between 0 and t in the data processing result output by the streaming data processing system 130, Means the ratio of information.
- cNPV complementary NPV processes data faster than the first reference speed among the data processing results output from the streaming data processing system 130 from 0 to t, Ratio.
- the system administrator sets the PPV and the NPV so that the calculated PPV and NPV are the specific values desired by the system administrator. And NPV are set in advance.
- exemplary PPV and NPV set by the system administrator will be referred to as? (Alpha) and? (Beta), respectively.
- the PPV and NPV calculated from the data processing results of the streaming data processing system 130 converge to alpha and beta, respectively, over time. That is, when the PPV and the NPV of the data processing result output from the streaming data processing system 130 are alpha and beta, respectively, the system administrator operates the streaming data processing system 130 for a sufficiently long period of time and remains in a stable state for a long time
- the system can be regarded as a monotonic system.
- the present invention proposes a method of calculating a second reference speed that can be compared with a data processing speed of the streaming data processing system 130 at a point in time after monoculture is established as described above. .
- the relation information calculation unit 250 may normalize the processing speed information into a relational expression based on the failure occurrence information and the first reference speed.
- Equation (8) represents the processing speed information as a linear relation based on the failure occurrence information and the first reference speed.
- Y (t) is a vector for processing speed information
- ⁇ (t) is a vector for a first reference speed
- a (t) is a vector for failure occurrence information
- the processing speed information more specifically means the speed lowering information.
- the under-speed information is information indicating whether the data processing speed of the streaming data processing system 130, which is different for each unit time, is faster or slower than the first reference speed, and is information that can be represented by a binary variable.
- the under-speed information is included in the processing speed information, or, in accordance with the embodiment, after the relation information calculating unit 250 receives the processing speed information, Whether or not the information can be obtained.
- the fault occurrence information is information that can be represented by a binary variable as described above.
- the first reference velocity may have a fixed constant value, but it has been explained through the equation (2) that it may have a different value every time according to the embodiment.
- Equation (8) since the response variable Y (t) is a binary variable, the following equation (8) can be obtained by applying the regression modeling to the line form as shown in Equation (8) You can not use the same general line format.
- Equation (8) can not be used is that Y (t) can exceed 1 when x (t) in Equation (8) has a sufficiently large value.
- the second reason that Equation (8) can not be used is because Y (t) is only 0 or 1, so that the precondition for using the linear regression equation is not satisfied.
- the prerequisite for using the linear regression equation is that the residual, which is the test of the significance of the regression coefficient, must follow a normal distribution.
- a (t) is also a binary variable having a value of 0 or 1
- ⁇ (t) is a variable that can have various values
- the linear regression equation such as Equation 8 can not be used in the present invention, Logistic Regression) should be used.
- Equation (9) represents a vector p (x) for using a regression equation of Logit Transformation.
- p (x) is defined as a probability that the data processing speed of the streaming data processing system 130 is slower than the first reference speed when the vector x is defined as the first reference speed and the failure occurrence information.
- Equation (10) is a regression equation calculated by applying the logit transformation to Equation (9). Since each term is defined as a vector of the same dimension, the regression coefficients b1, b2, and c can be obtained by using the maximum likelihood estimation (Maximal Likelihood Estimation).
- the maximum likelihood estimation method is an effective estimation method when the nonlinear statistical model is analyzed based on the binary data when the number of samples is sufficiently secured. Since the maximum likelihood estimation method is already known, The calculation process is omitted.
- Table 2 shows Y (t), A (t), and ⁇ (t) from 1 second to 15 seconds. Assuming that vectors having the same dimensions as those in Table 2 are grouped into vectors having the same dimension, and the summated vectors are substituted into Equation 10, when applying the maximum likelihood estimation method, b1 is 0.8445968, b2 is -0.01878321, and c is 1.507379. Since Table 2 is an exemplary value, b1, b2, and c may be different if Y (t), A (t), and ⁇ (t)
- the second reference velocity calculator 270 receives the ratio information between the under-velocity information and the failure occurrence information, and based on the ratio information and the correlation information calculated by the relation information calculator 250, And calculates the second reference speed at the second time point.
- the ratio information between the under-speed information and the failure occurrence information means the ratio information between the value representing the under-speed information and the value representing the failure occurrence information.
- ⁇ can be ratio information between under-speed information and failure occurrence information.
- alpha is set so that the data processing speed of the streaming data processing system 130 is slower than the first reference speed and the predetermined fault code is output from among the data processing results output from the streaming data processing system 130 when the time t has sufficiently elapsed Is defined as the information on the ratio when With the above logic, ⁇ , PPV, cPPV, NPV, cNPV at a specific point in time can also be ratio information between the under-speed information and the failure occurrence information.
- the second reference speed calculator 270 may receive the rate information of the under-speed information and the fault occurrence information from the system manager or may use a value stored in advance in the output data storage unit 210.
- the correlation information received from the relationship information calculation unit 250 indicates information indicating the relationship between the processing speed information and the failure occurrence information, not only the equation (10), but also the regression coefficients b1, b2, .
- the second reference velocity calculating section 270 calculates the second reference velocity on the basis of the regression coefficients b1, b2, c, which are one example of correlation information, and? And? .
- Equation 10 is expressed with respect to the vector p (x).
- a (t + 1) is a binary variable, which is 0 or 1. That is, a (t + 1) becomes 1 when a preset fault code is output, and a (t + 1) becomes 0 unless a preset fault code is output.
- Equation (13) is a probability at a time t + 1 at which a preset failure code is output
- Equation (14) is a probability at a time t + 1 at which a preset failure code is not output.
- Equation (15) shows the results obtained by summarizing equations (13) and (14). That is, the second reference rate, which is compared with the data processing rate of the streaming data processing system 130 at the time t + 1, can be calculated if the values of?,?, And regression coefficients b1, b2, c are all included. Since the calculation of the second reference velocity in the manner of deriving the equation (15) from the equations (11) to (14) is an example of the method of calculating the second reference velocity, the second reference velocity is calculated based on the ratio information and the correlation information It can be included in the scope of the present invention without using the same formulas as the equations (11) to (15).
- FIG. 3 is a diagram schematically illustrating a second reference speed calculated according to the present invention.
- FIG. 3 represents the first reference speed, the under-velocity information, and the failure occurrence information according to Table 2 as vectors and substitutes the ratio information (?
- the second reference speed at the time point is shown in a graph form.
- Equation (15) the values of b1, b2, and c in Equation (15) are assumed to be b1, b2, and c according to Table 2, and ⁇ is 0.95 and ⁇ is 0.9.
- b1 0.8445968
- b2 is -0.01878321
- c 1.507379.
- the reference speed at the point in time after t + 2 can be calculated by repeating the above-described method.
- FIG. 4 shows an example in which the second reference speed calculation method according to the present invention is implemented in pseudo-code.
- ⁇ and ⁇ are set to 0.95 and 0.9, respectively, and the first reference velocity at the time t is 0 is also assigned a predetermined value.
- ⁇ (t) denotes the data processing rate of the streaming data processing system 130, and other variables are the same as those used in the above description.
- the second reference velocity calculator 270 repeats the calculation until the PPV and the NPV of the present time have sufficiently elapsed and has monoculture, it can be seen that the second reference speed for the time point t + 1 is calculated.
- 'isViolated' is assumed to be a function that receives SLO value as an input and returns a value of 0 or 1.
- the SLO value is an example of failure occurrence information, I have explained.
- the last expression means (15).
- the second reference velocity at time t + 1 calculated through equation (15) becomes the first reference velocity when calculating the second reference velocity at time t + 2.
- the alarm output unit 290 outputs a performance abnormality alarm if the data processing speed at the second time point of the streaming data processing system 130 is slower than the second reference speed.
- the second time point refers to a time point after the first time point, which means a time point after the streaming data processing system 130 shows a monotone after a sufficient time has elapsed. That is, immediately after the unit time of one unit elapses after the first time point, the second time point can be obtained. For example, if the stream data processing system 130 stores information of data processed up to 60 seconds in the output data storage unit 210 and the unit time is 1 second, the 61-second time point may be the second time point .
- the performance abnormality alarm is information that the monitoring apparatus 200 according to the present invention notifies the system manager directly that the performance degradation has occurred in the streaming data processing system 130.
- the system administrator checks the performance abnormality alert, The streaming data processing system 130 can be checked as necessary.
- the abnormal motion analyzing apparatus 300 will be described.
- the data processing situation of the streaming data processing system Can be used as a synonym for an active threshold value.
- variable threshold value may be a value proportional to the second reference velocity, though not equal to the second reference velocity.
- the variable threshold may be 60 or may be 40, which is the predetermined reference value 100 minus the second reference speed. This numerical characteristic is a part that can be arbitrarily changed for the sake of brevity of the algorithm for processing data in the abnormal motion analyzer 300.
- the manager may set an abnormal operation to occur when the output value output from the streaming data processing system exceeds a variable threshold value, or an abnormal operation occurs when the output value output from the streaming data processing system falls below a variable threshold value
- the variable threshold value may be defined as a value equal to the second reference speed or a value obtained by subtracting the second reference speed from the preset value each time.
- the anomaly data storage unit 310 compares a variable threshold value for a predetermined time range with an output value output by the streaming data processing system, and stores the anomaly action time information, the output value according to the view information, And stores a variable threshold value.
- the output value output by the streaming data processing system is the output value output by the streaming data processing system in proportion to the speed at which the streaming data processing system processes the streaming data itself or its speed
- the abnormal operation is a streaming data processing system Which means that the processing speed of data is lower than a certain level. More specifically, when the output value output by the streaming data processing system is lower (or higher) than the variable threshold value, the streaming data processing system can be discriminated by the monitoring device 200 that it is in abnormal operation.
- the predetermined time range refers to a specific period in the past when the streaming data processing system was processing the streaming data.
- the abnormal data storage unit 310 stores the output value, the viewpoint information of the output value, and the variable threshold (second reference rate) output from the stream data processing system from the output data storage unit 210 and the second reference speed calculation unit 270, respectively Transfer it to database. Since the data stored in the abnormal data storage unit 310 are sorted and stored in chronological order, when one abnormal data storage unit 310 is searched for at one time, the output value of the streaming data processing system at that point in time, Lt; / RTI > can be retrieved in a set.
- the streaming data processing system outputs the output value every second, the predetermined time range is from 2:30 pm to 3:00 pm, and the streaming data processing system continues to operate abnormally for a predetermined time range, The output value and the variable threshold value become 1800 sets within the time range.
- the period information calculator 320 calculates information on anomaly durations in which the abnormal operation is continuously detected based on the viewpoint information of the output values.
- the information on the abnormal period will be abbreviated as the abnormal period information.
- the abnormal period means time information for a period in which the abnormal operation of the streaming data processing system is continuously detected.
- the continuous period means that not only the abnormal operation is detected in two consecutive time points but also the streaming data And a case where the output value of the processing system continuously exceeds (or is below) the variable threshold value. For example, even if all of the output values output by the streaming data processing system in three consecutive times exceed the variable threshold value, or if the preset continuous value is five times or the output data of the streaming data processing system exceeds the variable threshold value twice, If the outputted temporal interval exceeds the preset interval value, the information output from the streaming data processing system is not regarded as information on the abnormal period.
- the abnormal period information can be understood as information on how long the streaming data processing system has been behaving abnormally, and the period information calculating unit 320 can calculate the abnormal period information based on the variable threshold corresponding to the output value and the output value, (Consecutive value, interval value) for calculating the time period.
- the period information calculator 320 may calculate the abnormal period information using Fast Fourier Transform (FFT).
- FFT Fast Fourier Transform
- the period information calculator 320 receives the output value of the streaming data processing system, the viewpoint information of the output value, and the variable threshold value corresponding to each output value from the abnormal data storage unit 310, And outputs the abnormal period information.
- the time period information calculation unit 320 arranges the time point information of the output values received from the error data storage unit 310 in chronological order and associates the output value and the variable threshold value for each time point information so that the output value and the variable threshold value are also sorted in chronological order.
- the period information calculating unit 320 calculates the degree of the output value from the variable threshold value (RDR), and determines a weight for each viewpoint information in proportion to the degree. That is, a higher weight value is determined as the output value deviates from the variable threshold value, and a lower weight value is determined if the output value does not greatly differ from the variable threshold value.
- RDR variable threshold value
- the period information calculator 320 applies a weight determined differently for each output value to each output value received from the aberration data storage 310, sets a set of output values to which the weight is applied as one data set, Apply the transformation.
- the weighted output values that make up the data set may be referred to as sampled data.
- the fast Fourier transform is applied to the sampled data, so that the sampled data is divided into at least one frequency unit (component) by the entire data set, As shown in Fig.
- the period information calculator 320 can determine that the output value of the streaming data processing system is the sum of a plurality of abnormal signals (waves) if different peak values exist. That is, if the result of the fast Fourier transform is two or more frequency components showing different peak values, it means that the cause of the abnormal operation of the streaming data processing system is at least two or more.
- the period information calculator 320 recognizes that the cause of the abnormal operation of the streaming data processing system is at least one through the above process and adapts the output values to which the weight constituting the data set is applied to the exponential distribution.
- Equation (16) and Equation (17) show an example of a mathematical expression for an exponential distribution for adapting weighted output values constituting a data set.
- Equation (16) is a probability density function
- Equation (17) is a cumulative distribution function, and is calculated by integrating a probability density function.
- lambda is defined as a parameter that is calculated when the data set is fitted to the exponential distribution, and the fit is determined using the maximum likelihood estimation method described in Equation (10).
- the maximum likelihood estimation method is an effective estimation method when the nonlinear statistical model is analyzed based on the binary data when the number of samples is sufficiently secured. Since the maximum likelihood estimation method is a widely known method, the data sampled through the maximum likelihood estimation method To the exponential distribution will be omitted. According to the above procedure, 1 / ⁇ is the average of the data set, 1 divided by the seven squares of ⁇ is the variance of the data set.
- the new ideal data (a set of output values to which weights are applied) is input after the data set consisting of the output values to which the weighted values are applied is fit to the exponential distribution, Which corresponds to which part of the cumulative distribution function that is obtained. That is, as the duration of the abnormal time of the inputted abnormal data becomes longer, the abnormal time of the abnormal data observed before tends to be higher than that of the abnormal data of the presently observed abnormal data.
- the period information calculation unit 320 calculates a high score corresponding to the probability value, and the calculated score Period information.
- the information is calculated from the period information calculation unit 320 as information represented by numbers.
- the period information calculation unit 320 may calculate the abnormal period information based on the number of times the streaming data processing system per unit fixed time period has performed an abnormal operation.
- the unit fixed time is a constant set in advance in the period information calculation unit 320, and can be changed by an administrator. For example, if the unit fixed time is 1 hour, the number of abnormal operations during the first period is 5, and the number of abnormal operations during the second period is 10, the abnormal period information of the second period, It becomes twice the information of the abnormal period of one period.
- both of the first period and the second period are the same time difference (1 hour) as the unit fixed time, and the method of calculating the abnormal period information based on the number of abnormal operations in each period is limited to the above- I never do that.
- the period information calculation unit 320 may correspond to at least one processor or may include at least one processor for performing the above-described operation processing. Accordingly, the period information calculation unit 320 can be operated in a form included in other hardware devices such as a microprocessor or a general-purpose computer system.
- the anomaly strength calculation unit 330 calculates the relative anomaly intensity and the absolute error intensity based on the output value and the variable threshold value.
- the anomaly calculating unit 330 calculates the first relative value by integrating the difference between the output value of the first period and the variable threshold and measuring the degree of integration. Then, the anomaly-strength calculation unit 330 calculates the second relative value by integrating the difference between the output value of the second period and the variable threshold value and measuring the degree. Finally, the anomaly calculating unit 330 compares the first relative value and the second relative value, and calculates a score proportional to the comparison result.
- the first relative value and the second relative value are the relative abnormal intensities of the first period and the second period, respectively, and the score finally output by the anomaly calculating unit 330 is the relative abnormal intensity of the first period and the second period It is proportional to the result of comparing the relative error intensity.
- the anomaly calculating unit 330 includes a score dictionary referred to calculate a score according to a result of comparing the relative abnormal intensities of the first period and the second period, The results of comparing the relative abnormalities of the second period and the corresponding scores are matched at 1: 1.
- the streaming data processing system according to the present invention has a variable threshold that is a value essential for discriminating an abnormal operation according to the flow of time and the data processing state of the system, Therefore, it is necessary to quantify the intensity of the abnormal operation of the streaming data processing system through a method different from the existing method.
- the absolute anomaly amplitude does not pay attention to the relative anomality between the output value of the streaming data processing system and the variable threshold value but also how far the output value of the streaming data processing system itself is based on the entire output value range Is defined as a numerical value.
- the anomaly strength calculation unit 330 normalizes the output value of the streaming data processing system to a value between 0 and 1 based on the maximum output value and the minimum output value, and calculates the absolute error intensity based on the normalized value. More specifically, the anomaly calculating unit 330 calculates the absolute abnormal intensities in the first period and the second period, which are the same as the relative abnormal intensities with respect to the absolute anomaly, and then, based on the comparison result of the two calculated values .
- the following is an example in which the abnormal-strength calculating unit 330 calculates the absolute abnormal-strength.
- Equation (18) represents an example of a mathematical expression for calculating the absolute error intensity.
- (X) is the minimum value in the set of output values
- max (x) is the maximum value in the set of output values
- xi is the absolute error intensity at a specific point in time
- xi is the output value of the streaming data processing system at a specific point in time
- the absolute error intensity always has a value between 0 and 1.
- the anomaly-strength calculation unit 330 can determine the maximum value and the minimum value from the set of output values of the streaming data processing system, and calculate the absolute error intensity based on the maximum value and the minimum value. For example, assuming that the output values of the streaming data processing system include four output values of 58, 109, 60, 120, and so on, the anomaly calculator 330 detects the maximum value 120 and the minimum value 58, By applying the equation (18), 0, 0.823, 0.032, and 1 are respectively calculated, and the absolute error intensity representing the first period is calculated on the basis of 0.823 and 0.032 excluding 0 and 1.
- the average value of 0.823 and 0.032 can be the absolute abnormal intensity in the first period.
- the abnormality intensity calculating unit 330 receives the set of output values of the second period, calculates the absolute abnormal intensity for the second period through the above process, and then calculates the absolute abnormal intensity of the first period and the second period Score the results of the comparison.
- the abnormal period information, the relative abnormal intensity, and the absolute abnormal intensity calculated by the period information calculating unit 320 and the abnormal intensity calculating unit 330 are transmitted to the representative value calculating unit 340.
- the relative error intensity and the absolute error intensity can be transmitted to the representative value calculation section 340 based on the score calculated based on the relative error intensity and the score calculated based on the absolute error intensity have.
- the representative value calculating unit 340 calculates an abnormal representative value for a predetermined time range on the basis of the result of synthesizing the abnormal period information, the relative abnormal intensity, and the absolute abnormal intensity.
- the ideal representative value is a numerical value representative of the entire predetermined time range, and is defined as a numerical value that shows how the streaming data processing system operates in a predetermined time range.
- the representative value calculating unit 340 may calculate the abnormal representative value by summing or averaging the results calculated by the period information calculating unit 320 and the abnormality intensity calculating unit 330 and may calculate the abnormal representative value Is not limited to a specific method. Therefore, if there is a method for calculating different representative values including all other configurations of the present invention, the method does not depart from the scope of the present invention.
- the representative value calculating unit 340 calculates the abnormal representative value representing the first period, and the predetermined time range thereafter does not overlap with the first period, 1 period, the representative value calculating unit 340 calculates an abnormal representative value representative of the second period.
- the manager can easily compare the abnormality representative value of the first period and the abnormality representative value of the second period so that the abnormality occurrence of the streaming data processing system can be grasped on a period basis rather than every time.
- a conventional technique of scoring abnormal operation characteristics for each module not only requires a large amount of data to be collected and computed, It is difficult to understand intuitively.
- a plurality of modules constituting a streaming data processing system are not operated independently but are influenced while operating in association with each other, and that the streaming data processing system is variable It is possible to calculate a representative value representative of a predetermined time range (first period or second period) by simultaneously taking into consideration the characteristics of detecting the abnormal operation with reference to the threshold value, The transition of the frequency of occurrence of abnormal operation of the system is made by period It is possible to identify.
- the representative value calculating unit 340 May calculate an abnormal representative value for the streaming data processing system by considering the anomaly representative value of each module and the association relation of each module.
- the streaming data processing system does not have a single output value associated with the data processing speed, but rather a streaming data processing system is composed of a plurality of modules, each of which outputs an output value related to the streaming data processing speed
- the present invention is characterized in that an abnormal representative value of the entire system reflecting the relation of each module is calculated.
- Module number Abnormal period information Relative Abnormal Strength Absolute anomaly Abnormal representative value per module One a1 a2 a3 A 2 b1 b2 b3 B 3 c1 c2 c3 C
- Table 3 is a table for explaining a process of calculating an abnormal representative value of the streaming data processing system in this optional embodiment.
- the streaming data processing system is composed of a total of three modules, and the abnormal period information, relative error intensity, absolute error intensity, and abnormal representative value for each module are as shown in Table 3.
- the representative value calculating unit 340 regards the abnormal representative values of the three modules as nodes and establishes a relational expression by considering the association between the modules as an edge, The representative value is calculated.
- the representative value calculation unit 340 grasps an abnormal representative value of each module and maps one node to each abnormal representative value. Subsequently, the representative value calculating unit 340 calculates the edge value between each module on the basis of the abnormal period information, the relative abnormal intensity, and the absolute abnormal intensity of each module.
- the representative value calculating unit 340 sets the abnormal period information, the relative error intensity, and the absolute error intensity of each module as a vector component representing each module, and calculates a Pearson correlation coefficient Expression can be used.
- the Pearson correlation coefficient equation is a method for deriving a correlation value between two vectors based on the components of two vectors of the same dimension, and is a widely known method, and a detailed description thereof will be omitted.
- FIG. 7 is a diagram showing nodes and edges generated by the representative value calculating unit.
- nodes A to C are connected via edges rAB to rCA.
- a to C are abnormal representative values of each node (module), and rAB to rCA are values calculated by the representative value calculating unit 340, and have a size between -1 and 1 according to the Pearson's equation .
- the representative value calculating unit 340 calculates the relationship between the respective modules of the streaming data processing system through the nodes and the edges, An abnormal representative value of the entire system can be calculated.
- Equations (19) and (20) represent equations for the representative value calculation unit 340 to calculate an abnormal representative value.
- NN is the number of nodes
- NE is the number of edges
- N is the number of nodes
- N is the number of nodes
- NA represents the number of edges (nodes) between nodes (modules) in which an abnormal operation has occurred (Number of Anomalous Edges).
- N2 denotes the average of node values
- NT denotes the number of total edges for each node.
- the representative value calculating unit 340 can calculate the abnormal representative value of the streaming data processing system including the modules 1 to 3 through Equation (21) have.
- Equation (21) is an example of a mathematical expression that the representative value calculation unit 340 uses to calculate an abnormal representative value of the streaming data processing system.
- Ntotal denotes an abnormal representative value of a streaming data processing system including at least two modules.
- a plurality of modules constituting the streaming data processing system are not operated independently but are influenced by each other in association with each other And a representative value in which the streaming data processing system simultaneously takes into consideration the characteristics of detecting abnormal operation with reference to the variable threshold value.
- the administrator can grasp the trend of the abnormal operation occurrence frequency of the system at a glance only by confirming the representative values calculated by the predetermined time range.
- the anomaly analysis apparatus 300 may further include a representative value transmission unit 350 and a representative value adjustment unit 360.
- the representative value transmission unit 350 transmits to the user terminal a past abnormality representative value calculated by the representative value calculation unit 340 and a past abnormality representative value for a time range earlier than the preset time range.
- the user terminal is considered to mean the administrator terminal 400 of FIG.
- the representative value adjustment unit 360 receives the adjustment value corresponding to the abnormal representative value and the past abnormality representative value transmitted by the representative value transmission unit 350 and changes the abnormal representative value based on the adjustment value.
- the abnormal operation analyzing apparatus 300 further includes a representative value transmitting unit 350 and a representative value adjusting unit 360.
- the abnormal operation analyzing apparatus 300 receives an adjustment value from the administrator terminal 400, You can change the value. That is, when the manager observes through the manager terminal of the abnormal representative value, the abnormal operation is detected and the high abnormal representative value is calculated (false positive) even though the abnormal operation is not detected in the streaming data processing system,
- the terminal 400 may generate an adjustment value capable of arbitrarily lowering the abnormal representative value and transmit the adjustment value to the representative value adjustment unit 360.
- the administrator may cause the administrator terminal 400 to generate an adjustment value for lowering the abnormal representative value and transmit the adjustment value to the representative value adjustment unit 360.
- the abnormality representative value at a specific point in time is adjusted in a manner that reflects administrator feedback, so that the operation of the abnormal operation analysis apparatus 300 is smoothly performed after the adjusted point in time , It becomes possible to monitor the streaming data processing system more efficiently for a long time and to analyze the abnormal operation with less cost.
- FIG. 5 is a flowchart illustrating an example of a method for monitoring a streaming data processing system according to the present invention.
- the method according to FIG. 5 can be implemented by the monitoring apparatus 200 monitoring the streaming data processing system according to FIG. 2, and therefore will be described with reference to FIG. 2, and a duplicate description of FIG. 2 will be omitted .
- the under-speed information acquiring unit 230 refers to the output data storage 210 and recognizes the under-speed information at the time when the data is processed slower than the first reference speed in the process speed information (S510).
- the output data storage unit 210 stores the processing speed information of the data processed at the first time point and the failure occurrence information by unit time.
- the first reference speed may be set differently for each unit time.
- the failure occurrence information stored in the output data storage unit 210 in step S510 may include reference failure information at a time when a predetermined reference failure code has occurred and non-reference failure information at a time when only a failure code other than the reference failure code has occurred .
- the relationship information calculation unit 250 calculates correlation information between the processing speed information and the failure occurrence information (S530).
- the relation information calculation unit 250 may calculate correlation information using a maximum likelihood method.
- the relationship information calculation unit 250 may calculate correlation information using Logit Transformation.
- the second reference speed calculating unit 270 receives the ratio information of the under-velocity information and the failure occurrence information, calculates the second reference speed at the second point in time after the first point on the basis of the correlation information and the ratio information (S550).
- the rate information received by the second reference rate calculator 270 may be information on the ratio between the under-speed information and the reference failure information.
- the alarm output unit 290 compares the data processing speed of the streaming data processing system 130 at the second time with the second reference speed (S570). If the data processing speed of the streaming data processing system 130 at the second time point is slower than the second reference speed, the alarm output unit 290 outputs a performance abnormality alarm (S590). In step S590, the performance abnormality alarm output by the alarm output unit 290 is information that directly informs the system administrator that a performance degradation has occurred in the streaming data processing system 130, The reporting streaming data processing system 130 may be checked.
- a second reference rate which is a threshold value compared with a processing result of a streaming data processing system for processing big streaming data, is actively calculated and applied It is possible to monitor the performance change state of the streaming data processing system while minimizing the intervention of the system administrator, thereby reducing the work load of the system administrator.
- the monitoring apparatus according to the present invention when applied to a streaming data processing system, it is possible to achieve a system monitoring effect equal to or greater than the previous one even if less manpower and time are input.
- FIG. 6 is a flowchart illustrating an example of a method for analyzing an anomaly of a streaming data processing system according to the present invention.
- the method according to FIG. 6 can be implemented by the abnormal operation analyzing apparatus 300 for analyzing abnormal operation of the streaming data processing system according to FIG. 2, and will be described with reference to FIG. 2, Description will be omitted.
- the abnormal data storage unit 310 selects and stores the system output value and the variable threshold value in a predetermined time range (S610).
- the period information calculation unit 320 calculates information on an abnormal period in which an abnormal operation is detected (S630).
- the anomaly calculating unit 330 calculates the relative anomaly and the absolute anomaly based on the output value and the variable threshold (S650).
- the representative value calculating unit 340 calculates an abnormal representative value representative of a predetermined time range based on the information on the abnormal period, the relative abnormal intensity, and the absolute abnormal intensity (S670).
- the abnormal operation analyzing apparatus 300 implementing the method according to the present embodiment may further include the representative value transmitting unit 350 and the representative value adjusting unit 360, Therefore, it is omitted.
- the embodiments of the present invention described above can be embodied in the form of a computer program that can be executed on various components on a computer, and the computer program can be recorded on a computer-readable medium.
- the medium may be a magnetic medium such as a hard disk, a floppy disk and a magnetic tape, an optical recording medium such as CD-ROM and DVD, a magneto-optical medium such as a floptical disk, , A RAM, a flash memory, and the like, which are specifically configured to store and execute program instructions.
- the computer program may be designed and configured specifically for the present invention or may be known and used by those skilled in the computer software field.
- Examples of computer programs may include machine language code such as those produced by a compiler, as well as high-level language code that may be executed by a computer using an interpreter or the like.
- connections or connecting members of the lines between the components shown in the figures are illustrative of functional connections and / or physical or circuit connections, which may be replaced or additionally provided by a variety of functional connections, physical Connection, or circuit connections. Also, unless explicitly mentioned, such as " essential ", " importantly ", etc., it may not be a necessary component for application of the present invention.
- the present invention can be applied to a big streaming data processing system that processes a large amount of streaming data.
Landscapes
- Engineering & Computer Science (AREA)
- Mechanical Engineering (AREA)
- General Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Emergency Management (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Debugging And Monitoring (AREA)
Abstract
Description
장애코드 출력 | 장애코드 불출력 | |
제1기준속도 미달 | x | y |
제1기준속도 달성(초과) | u | v |
Time | A(t) | Γ(t) | Y(t) |
1 | 0 | 58 | 0 |
2 | 1 | 109 | 1 |
3 | 1 | 126 | 0 |
4 | 0 | 60 | 1 |
5 | 1 | 120 | 1 |
6 | 0 | 69 | 0 |
7 | 1 | 147 | 1 |
8 | 0 | 85 | 1 |
9 | 0 | 58 | 0 |
10 | 1 | 116 | 0 |
11 | 1 | 131 | 0 |
12 | 1 | 104 | 1 |
13 | 1 | 149 | 0 |
14 | 0 | 60 | 1 |
15 | 0 | 64 | 1 |
모듈번호 | 이상기간정보 | 상대적이상강도 | 절대적이상강도 | 모듈별이상대표값 |
1 | a1 | a2 | a3 | A |
2 | b1 | b2 | b3 | B |
3 | c1 | c2 | c3 | C |
Claims (15)
- 시스템이 주기적으로 출력하는 출력값과 시간에 따라 변화하는 가변임계치(active threshold value)를 비교한 결과를 기초로 시스템의 이상동작이 파악되면 이상동작알람을 발생시키는 스트리밍 데이터 처리 시스템에 있어서,미리 정해진 시간범위에 대한 상기 가변임계치와 상기 출력값을 비교하여, 이상동작(anomaly action)이 발생한 시점정보, 상기 시점정보에 따른 출력값, 상기 출력값에 대응되는 가변임계치를 저장하는 이상데이터저장부;상기 시점정보를 기초로 하여, 이상동작이 연속되어 감지된 이상기간(anomaly duration)에 대한 이상기간정보를 산출하는 기간정보산출부;상기 출력값 및 상기 가변임계치를 기초로 하여, 상대적이상강도 및 절대적이상강도를 산출하는 이상강도산출부; 및상기 이상기간정보, 상기 상대적이상강도 및 상기 절대적이상강도를 종합한 결과를 기초로 하여 상기 시간범위에 대한 이상대표값을 산출하는 대표값산출부를 포함하는 스트리밍 데이터 처리 시스템의 이상동작 분석 장치.
- 제1항에 있어서,상기 기간정보산출부는,고속푸리에변환을 이용하여 상기 이상기간정보를 산출하는 것을 특징으로 하는 스트리밍 데이터 처리 시스템의 이상동작 분석 장치.
- 제1항에 있어서,상기 기간정보산출부는,상기 출력값이 상기 출력값과 대응되는 가변임계치를 벗어난 정도를 기초로 설정되는 가중치에 따라 샘플링된 데이터로부터 상기 이상기간정보를 산출하는 것을 특징으로 하는 스트리밍 데이터 처리 시스템의 이상동작 분석 장치.
- 제3항에 있어서,상기 기간정보산출부는,상기 샘플링된 데이터를 지수분포에 적합시킨 결과를 기초로 하여, 상기 이상기간정보를 산출하는 것을 특징으로 하는 스트리밍 데이터 처리 시스템의 이상동작 분석 장치.
- 제1항에 있어서,상기 이상강도산출부는,상기 출력값과 상기 출력값에 대응되는 가변임계치와의 차이를 적분하여 산출된 수치를 기초로 하여 상기 상대적이상강도를 산출하는 것을 특징으로 하는 스트리밍 데이터 처리 시스템의 이상동작 분석 장치.
- 제1항에 있어서,상기 이상강도산출부는,상기 출력값들의 집합에 포함된 최대출력값 및 최소출력값을 기초로 상기 출력값을 0에서 1사이의 값으로 정규화하고,상기 정규화한 값을 기초로 상기 절대적이상강도를 산출하는 것을 특징으로 하는 스트리밍 데이터 처리 시스템의 이상동작 분석 장치.
- 제1항에 있어서,상기 이상대표값 및 상기 시간범위보다 더 이전 시간범위에 대한 과거이상대표값을 일괄적으로 사용자단말에 송신하는 대표값송신부 및상기 송신된 이상대표값 및 과거이상대표값에 대응하는 조정값을 수신하고, 상기 수신된 조정값을 기초로 상기 이상대표값을 변경하는 대표값조정부를 포함하는 것을 특징으로 하는 스트리밍 데이터 처리 시스템의 이상동작 분석 장치.
- 시스템이 주기적으로 출력하는 출력값과 시간에 따라 변화하는 가변임계치(active threshold value)를 비교한 결과를 기초로 시스템의 이상동작이 파악되면 이상동작알람을 발생시키는 스트리밍 데이터 처리 시스템의 이상동작 분석 방법에 있어서,미리 정해진 시간범위에 대한 상기 가변임계치와 상기 출력값을 비교하여, 이상동작(anomaly action)이 발생한 시점정보, 상기 시점정보에 따른 출력값, 상기 출력값에 대응되는 가변임계치를 저장하는 이상데이터저장단계;상기 시점정보를 기초로 하여, 이상동작이 연속되어 감지된 이상기간(anomaly duration)에 대한 이상기간정보를 산출하는 기간정보산출단계;상기 출력값 및 상기 가변임계치를 기초로 하여, 상대적이상강도 및 절대적이상강도를 산출하는 이상강도산출단계; 및상기 이상기간정보, 상기 상대적이상강도 및 상기 절대적이상강도를 종합한 결과를 기초로 하여 상기 시간범위에 대한 이상대표값을 산출하는 대표값산출단계를 포함하는 스트리밍 데이터 처리 시스템의 이상동작 분석 방법.
- 제8항에 있어서,상기 기간정보산출단계는,고속푸리에변환을 이용하여 상기 이상기간정보를 산출하는 것을 특징으로 하는 스트리밍 데이터 처리 시스템의 이상동작 분석 방법.
- 제8항에 있어서,상기 기간정보산출단계는,상기 출력값이 상기 출력값과 대응되는 가변임계치를 벗어난 정도를 기초로 설정되는 가중치에 따라 샘플링된 데이터로부터 상기 이상기간정보를 산출하는 것을 특징으로 하는 스트리밍 데이터 처리 시스템의 이상동작 분석 방법.
- 제10항에 있어서,상기 기간정보산출단계는,상기 샘플링된 데이터를 지수분포에 적합시킨 결과를 기초로 하여, 상기 이상기간정보를 산출하는 것을 특징으로 하는 스트리밍 데이터 처리 시스템의 이상동작 분석 방법.
- 제8항에 있어서,상기 이상강도산출단계는,상기 출력값과 상기 출력값에 대응되는 가변임계치와의 차이를 적분하여 산출된 수치를 기초로 하여 상기 상대적이상강도를 산출하는 것을 특징으로 하는 스트리밍 데이터 처리 시스템의 이상동작 분석 방법.
- 제8항에 있어서,상기 이상강도산출단계는,상기 출력값들의 집합의 최대출력값 및 최소출력값을 기초로 상기 출력값을 0에서 1사이의 값으로 정규화하고,상기 정규화한 값을 기초로 상기 절대적이상강도를 산출하는 것을 특징으로 하는 스트리밍 데이터 처리 시스템의 이상동작 분석 방법.
- 제8항에 있어서,상기 이상대표값 및 상기 시간범위보다 더 이전 시간범위에 대한 과거이상대표값을 일괄적으로 사용자단말에 송신하는 대표값송신단계; 및상기 송신된 이상대표값 및 과거이상대표값에 대응하는 조정값을 수신하고, 상기 수신된 조정값을 기초로 상기 이상대표값을 변경하는 대표값조정단계를 포함하는 것을 특징으로 하는 스트리밍 데이터 처리 시스템의 이상동작 분석 방법.
- 제8항 내지 제14항 중 어느 한 항에 따른 방법을 구현시키기 위한 프로그램을 저장하고 있는 컴퓨터 판독가능한 기록매체.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR10-2018-0003841 | 2018-01-11 | ||
KR1020180003841A KR101869490B1 (ko) | 2018-01-11 | 2018-01-11 | 스트리밍 데이터 처리시스템의 이상동작 분석 장치 및 그 방법 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019139189A1 true WO2019139189A1 (ko) | 2019-07-18 |
Family
ID=62806588
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2018/000611 WO2019139189A1 (ko) | 2018-01-11 | 2018-01-12 | 스트리밍 데이터 처리시스템의 이상동작 분석 장치 및 그 방법 |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR101869490B1 (ko) |
WO (1) | WO2019139189A1 (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114265359A (zh) * | 2021-12-15 | 2022-04-01 | 昆船智能技术股份有限公司 | 一种输送设备运行时间异常的智能检测系统及方法 |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102232203B1 (ko) | 2019-08-12 | 2021-03-25 | 국방과학연구소 | 반능동 레이저 탐지 시스템에서 적응형 임계값 설정방법 및 그 시스템 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011108439A1 (ja) * | 2010-03-03 | 2011-09-09 | 三菱電機株式会社 | 通信制御装置および通信制御方法 |
KR101108020B1 (ko) * | 2010-03-08 | 2012-01-25 | 한국과학기술원 | 이동단말기에 구비된 가속도 센서를 이용한 위치값 인식 업데이트 주기 조절 장치 및 방법과 그 방법을 수행하는 명령어를 포함하는 컴퓨터 판독가능 기록매체 |
KR101218927B1 (ko) * | 2011-12-28 | 2013-01-21 | 주식회사 엘지유플러스 | 웹 서비스의 사용자 체감 성능 모니터링 방법과 이를 위한 프로그램이 기록된 기록매체 및 컴퓨팅 장치 |
WO2016075924A1 (ja) * | 2014-11-10 | 2016-05-19 | 日本電気株式会社 | 情報処理システム及び遅延計測方法 |
KR20160069444A (ko) * | 2014-12-08 | 2016-06-16 | 엔트릭스 주식회사 | 클라우드 스트리밍 서비스를 위한 서비스 품질 모니터링 시스템 및 방법, 그리고 컴퓨터 프로그램이 기록된 기록매체 |
-
2018
- 2018-01-11 KR KR1020180003841A patent/KR101869490B1/ko active IP Right Grant
- 2018-01-12 WO PCT/KR2018/000611 patent/WO2019139189A1/ko active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011108439A1 (ja) * | 2010-03-03 | 2011-09-09 | 三菱電機株式会社 | 通信制御装置および通信制御方法 |
KR101108020B1 (ko) * | 2010-03-08 | 2012-01-25 | 한국과학기술원 | 이동단말기에 구비된 가속도 센서를 이용한 위치값 인식 업데이트 주기 조절 장치 및 방법과 그 방법을 수행하는 명령어를 포함하는 컴퓨터 판독가능 기록매체 |
KR101218927B1 (ko) * | 2011-12-28 | 2013-01-21 | 주식회사 엘지유플러스 | 웹 서비스의 사용자 체감 성능 모니터링 방법과 이를 위한 프로그램이 기록된 기록매체 및 컴퓨팅 장치 |
WO2016075924A1 (ja) * | 2014-11-10 | 2016-05-19 | 日本電気株式会社 | 情報処理システム及び遅延計測方法 |
KR20160069444A (ko) * | 2014-12-08 | 2016-06-16 | 엔트릭스 주식회사 | 클라우드 스트리밍 서비스를 위한 서비스 품질 모니터링 시스템 및 방법, 그리고 컴퓨터 프로그램이 기록된 기록매체 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114265359A (zh) * | 2021-12-15 | 2022-04-01 | 昆船智能技术股份有限公司 | 一种输送设备运行时间异常的智能检测系统及方法 |
CN114265359B (zh) * | 2021-12-15 | 2023-08-25 | 昆船智能技术股份有限公司 | 一种输送设备运行时间异常的智能检测系统及方法 |
Also Published As
Publication number | Publication date |
---|---|
KR101869490B1 (ko) | 2018-06-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020242275A1 (en) | Root cause analysis and automation using machine learning | |
WO2020231193A1 (en) | Beam management method, apparatus, electronic device and computer readable storage medium | |
WO2021215720A1 (en) | Apparatus and method for identifying network anomalies in cellular network | |
WO2022114871A1 (ko) | 배터리 진단 장치, 배터리 진단 방법, 배터리 팩 및 자동차 | |
WO2022071727A1 (en) | Method for sharing spectrum resources, apparatus, electronic device and storage medium | |
WO2020048047A1 (zh) | 系统故障的预警方法、装置、设备及存储介质 | |
WO2021071189A1 (ko) | 중전기기 건전성 분석 플랫폼 및 이를 이용하는 분석 방법 | |
WO2018205544A1 (zh) | 软件项目管理方法、装置、终端及计算机存储介质 | |
WO2021012481A1 (zh) | 系统性能监控方法、装置、设备及存储介质 | |
WO2020101108A1 (ko) | 인공지능 모델 플랫폼 및 인공지능 모델 플랫폼 운영 방법 | |
WO2019139189A1 (ko) | 스트리밍 데이터 처리시스템의 이상동작 분석 장치 및 그 방법 | |
WO2016006977A1 (en) | Apparatus, server, system and method for energy measuring | |
WO2021006405A1 (ko) | 인공지능 서버 | |
WO2020036297A1 (en) | Electronic apparatus and controlling method thereof | |
WO2017206883A1 (zh) | 一种应用处理方法、装置、存储介质及电子设备 | |
WO2023153821A1 (en) | Method of compressing neural network model and electronic apparatus for performing the same | |
WO2009145449A2 (ko) | 노이지 음성 신호의 처리 방법과 이를 위한 장치 및 컴퓨터 판독 가능한 기록매체 | |
WO2018164304A1 (ko) | 잡음 환경의 통화 품질을 개선하는 방법 및 장치 | |
WO2022065623A1 (en) | Method of implementing self-organizing network for plurality of access network devices and electronic device for performing the same | |
WO2020168606A1 (zh) | 广告视频优化方法、装置、设备及计算机可读存储介质 | |
WO2021132798A1 (en) | Method and apparatus for data anonymization | |
WO2010050734A2 (ko) | Fhss 시스템에서 간섭 잡음을 회피하기 위한 장치 및 그 방법 | |
WO2009123412A1 (ko) | 노이지 음성 신호의 처리 방법과 이를 위한 장치 및 컴퓨터 판독 가능한 기록매체 | |
WO2020180027A1 (en) | Home appliance and method for controlling thereof | |
WO2011068315A4 (ko) | 최대 개념강도 인지기법을 이용한 최적의 데이터베이스 선택장치 및 그 방법 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18899327 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 18899327 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 04.02.2021) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 18899327 Country of ref document: EP Kind code of ref document: A1 |