CN112199898A

CN112199898A - Instrument and equipment fault prediction and health management algorithm based on big data

Info

Publication number: CN112199898A
Application number: CN202011254305.7A
Authority: CN
Inventors: 王兆君; 金震; 李明; 曹朝辉
Original assignee: Beijing SunwayWorld Science and Technology Co Ltd
Current assignee: Beijing SunwayWorld Science and Technology Co Ltd
Priority date: 2020-11-11
Filing date: 2020-11-11
Publication date: 2021-01-08
Anticipated expiration: 2040-11-11
Also published as: CN112199898B

Abstract

The invention provides an instrument and equipment fault prediction and health management algorithm based on big data, which comprises the following steps: acquiring signal data of instrument equipment through a stress sensor, an acoustic sensor and a vibration sensor, transmitting the signal data to an equipment management center, and transmitting real-time operation parameters of the instrument equipment to the equipment management center through a machine tool interface; preprocessing the signal data and the operation parameters sent to the equipment management center to obtain preprocessed data; carrying out simulation analysis on the digital twin equipment by using a simulation tool according to the processed data to obtain simulation data; extracting data characteristics according to the simulation data and actual data of the instrument equipment to obtain data characteristics; and predicting the loss and the fault of the instrument and/or the key component of the instrument by utilizing a feed-forward neural network algorithm according to the data characteristics to obtain a prediction result.

Description

Instrument and equipment fault prediction and health management algorithm based on big data

Technical Field

The invention provides an instrument and equipment fault prediction and health management algorithm based on big data, and belongs to the technical field of equipment management.

Background

In the instrument and equipment management process, enterprises mostly maintain major repair, minor repair, spot inspection and the like of equipment according to experience, and some enterprises predict equipment faults by using a major data technology, firstly collect static data and dynamic data of the equipment, then establish an association relation between the equipment state and various data, and finally predict the faults through the obtained real-time data, wherein the methods are based on historical data and experience. The existing equipment management method still has the following problems:

1. the difference between the prediction result of the fault prediction method based on historical data and experience and the actual fault occurrence situation is large, so that the spare part stock is excessive or insufficient;

2. after the instrument equipment breaks down, the achievement rate of the production task is influenced, and the comprehensive utilization rate of the equipment is still low.

Disclosure of Invention

The invention provides an instrument and equipment fault prediction and health management algorithm based on big data, which is used for solving the problems that the difference between a fault prediction result and the actual fault occurrence situation is larger and the comprehensive utilization rate of equipment is lower, and adopts the following technical scheme:

a big-data based instrument failure prediction and health management algorithm, the method comprising:

acquiring signal data of instrument equipment through a stress sensor, an acoustic sensor and a vibration sensor, transmitting the signal data to an equipment management center, and transmitting real-time operation parameters of the instrument equipment to the equipment management center through an interface;

preprocessing the signal data and the operation parameters sent to the equipment management center to obtain preprocessed data;

carrying out simulation analysis on the digital twin equipment by using a simulation tool according to the processed data to obtain simulation data;

extracting data characteristics according to the simulation data and actual data of the instrument equipment to obtain data characteristics;

and predicting the loss and the fault of the instrument and/or the key component of the instrument by utilizing a feed-forward neural network algorithm according to the data characteristics to obtain a prediction result.

Further, the device management center stores the signal data and the operation parameters through a Kafka tool, and the method comprises the following steps:

receiving data through a Kafka tool, and sending the data to a database cache region through a Kafka message bus;

after receiving the data, the database cache region classifies the signal data and the operation parameters based on data types, and sorts the signal data and the operation parameters in respective corresponding data types according to the time sequence of data generation to obtain a data set;

the database cache region extracts the storage space value occupied by the data of the data set, and sends a storage request and the storage space value required by each data set to each database according to the request sending time interval;

after receiving a storage request and a storage space value required by each data set, each database detects whether the database completes the last data storage action, and if the database completes the data storage action, the time and the storage space residual quantity value used by the database for last data storage are sent to the database cache region; if the data storage action is not finished, sending a storage suspension request to the data cache region, and after the data storage action is finished, storing the data for a long time;

the database cache region takes the database corresponding to the returned storage space surplus as a candidate storage database, extracts the storage space surplus returned last time by the candidate storage database, and compares the storage space surplus currently returned by the candidate storage database with the storage space surplus returned last time to obtain a surplus difference value;

canceling the candidate qualification of the candidate storage database when the residual difference exceeds a first preset difference threshold; when the residual difference value exceeds a second preset difference threshold value and does not exceed a first preset difference threshold value, or the residual difference value does not exceed a second preset difference threshold value, reserving the candidate qualification of the candidate storage database;

the data buffer area compares the number of the data sets with the number of the candidate storage databases, and stores data according to the comparison result and the storage rule;

after receiving the time used for storing the data of the same batch fed back by all the databases, the data buffer adjusts the request sending time interval according to the following formula:

wherein T represents a transmission request time interval; m represents the number of the databases; t is₀Indicating an initial default value for the transmission request interval; t is_iThe data storage time for storing the data in the same batch fed back by the ith database is represented; t is_maxThe maximum time value used by the database for storing data in the same batch of storage processes is represented; t is_minAnd the minimum time value used by the database for data storage in the same batch of storage processes is represented.

Further, the storage rule is:

when the number of the data sets is smaller than that of the candidate database, screening out the candidate databases with the number corresponding to the number of the data sets according to the sequence from small to large of the residual difference, and sequentially storing each data set into the candidate databases arranged according to the sequence from small to large of the residual difference according to the sequence from large to small of the storage space occupied by the data;

when the number of the data sets is larger than that of the candidate storage databases, sequentially arranging the data sets according to the sequence of the storage space occupied by the data from large to small to obtain a data set sequence; storing the first A data sets in the data set sequence into a candidate database of which the residual difference value does not exceed a second preset difference threshold value, storing the B data sets in sequence into a candidate database of which the residual difference value exceeds the second preset difference threshold value and does not exceed a first preset difference threshold value, and storing the C data sets into a candidate database of which the residual difference value is minimum after finishing storing the A data sets and the B data sets;

wherein, A corresponds to the number of the candidate databases of which the residual difference value does not exceed a second preset difference threshold, B corresponds to the number of the candidate databases of which the residual difference value exceeds the second preset difference threshold and does not exceed a first preset difference threshold, and C corresponds to the difference value between the number of the data sets and the number of the candidate storage databases.

Further, the signal data and the operation parameters sent to the equipment management center are preprocessed to obtain preprocessed data, and the preprocessing comprises the following steps:

checking the data type of the signal data and the data corresponding to the operating parameters;

carrying out data type verification on the signal data and the data corresponding to the operating parameters;

performing data cleaning on the signal data and the operation parameters after the data type checking and the data type checking are completed to obtain cleaned data;

and carrying out data integration, data transformation and data specification processing on the cleaned data to obtain preprocessed data.

Further, performing simulation analysis on the digital twin device by using a simulation tool according to the processed data to obtain simulation data, wherein the simulation data comprises:

carrying out simulation analysis on the digital twin equipment by using an APDLANSYS virtual simulation method aiming at the processed data, and simulating the running condition of the instrument equipment in real time to obtain simulated data;

and performing consistency comparison on the simulation data and the operation data of the instrument equipment in real time, and keeping consistency between the simulation data and the operation data of the instrument equipment.

Further, extracting data features according to the simulation data and the actual data of the instrument and equipment to obtain the data features, including:

performing data feature extraction on the simulation data and actual data of the instrument equipment by using a Spark tool;

and inputting the data extracted by the Spark tool into the self-encoder, and processing the input data by using the self-encoder to acquire data characteristics.

Further, the device management center includes:

the data acquisition and transmission module is used for acquiring signal data of instrument equipment through the stress sensor, the acoustic sensor and the vibration sensor, transmitting the signal data to an equipment management center, and simultaneously transmitting real-time operation parameters of the instrument equipment to the equipment management center through a machine tool interface;

the device comprises a preprocessing module, a data processing module and a data processing module, wherein the preprocessing module is used for preprocessing signal data and operation parameters sent to a device management center to obtain preprocessed data;

the simulation module is used for carrying out simulation analysis on the digital twin equipment by utilizing a simulation tool aiming at the processed data to obtain simulation data;

the characteristic extraction module is used for extracting data characteristics according to the simulation data and actual data of the instrument equipment to obtain data characteristics;

and the prediction module is used for predicting the loss and the fault of the instrument and/or the key component of the instrument by utilizing a feedforward neural network algorithm according to the data characteristics to obtain a prediction result.

Further, the preprocessing module comprises:

the data type checking module is used for checking the data types of the data corresponding to the signal data and the operation parameters;

the data type checking module is used for carrying out data type checking on the signal data and the data corresponding to the operating parameters;

the data cleaning module is used for cleaning the signal data and the operation parameters which are used for finishing the data type checking and the data type checking to obtain the cleaned data;

and the data processing module is used for carrying out data integration, data transformation and data protocol processing on the cleaned data to obtain the preprocessed data.

Further, the simulation module includes:

the simulation analysis module is used for performing simulation analysis on the digital twin equipment according to the processed data by utilizing an APDLANSYS virtual simulation method, simulating the running condition of the instrument equipment in real time and obtaining simulated data;

and the consistency comparison module is used for carrying out consistency comparison on the simulation data and the operation data of the instrument equipment in real time and keeping consistency between the simulation data and the operation data of the instrument equipment.

Further, the feature extraction module comprises:

the data feature acquisition module is used for extracting data features of the simulation data and actual data of the instrument equipment by utilizing a Spark tool;

the sending module is used for inputting the data extracted by the Spark tool into the self-encoder;

and the self-encoder is used for processing the input data and acquiring data characteristics.

The invention has the beneficial effects that:

compared with the traditional instrument and equipment health management and fault diagnosis, the instrument and equipment fault prediction and health management algorithm based on the big data can realize the real-time interaction and omnibearing state comparison of physical and virtual equipment, simulate the physical state of the equipment in real time, capture more comprehensive equipment operation characteristics, diagnose and predict faults more accurately and verify maintenance strategies more accurately. Therefore, the demand of spare parts can be predicted more accurately, and shortage or excess can be avoided. The more targeted maintenance strategy can reduce the fault downtime of the equipment and provide the comprehensive utilization rate of the equipment.

Drawings

FIG. 1 is a flow chart of an algorithm according to the present invention;

FIG. 2 is a schematic diagram of the algorithm of the present invention.

Detailed Description

The preferred embodiments of the present invention will be described in conjunction with the accompanying drawings, and it will be understood that they are described herein for the purpose of illustration and explanation and not limitation.

The embodiment of the invention provides an instrument and equipment fault prediction and health management algorithm based on big data, and as shown in fig. 1 and fig. 2, the method comprises the following steps:

s1, collecting signal data of instrument equipment through a stress sensor, an acoustic sensor and a vibration sensor, sending the signal data to an equipment management center, and sending real-time operation parameters of the instrument equipment to the equipment management center through a machine tool interface;

s2, preprocessing the signal data and the operation parameters sent to the equipment management center to obtain preprocessed data;

s3, carrying out simulation analysis on the digital twin equipment by using a simulation tool according to the processed data to obtain simulation data;

s4, extracting data characteristics according to the simulation data and actual data of the instrument equipment to obtain data characteristics;

and S5, predicting the loss and the fault of the instrument and/or the key components of the instrument by utilizing a feed-forward neural network algorithm according to the data characteristics to obtain a prediction result.

The working principle of the technical scheme is as follows: firstly, data acquisition is carried out, various sensors such as a stress sensor, an acoustic sensor, a vibration sensor and the like are additionally arranged on instrument equipment, and signal data collected by the sensors are transmitted to an equipment management system in real time; the operation data of the equipment, including power, current, feed amount, work task, work duration and the like, is acquired through an interface of the equipment, and the acquired information is transmitted to an equipment management system in real time. Then, data processing is performed on the acquired data, namely, the acquired data is preprocessed, wherein the preprocessing comprises data type checking, data cleaning, data transformation and the like, and the preprocessing process is realized based on spark streaming. The data after preprocessing can be used for simulation analysis. Finally, carrying out simulation analysis according to the preprocessed data, which specifically comprises the following steps: and performing simulation analysis on the digital twin equipment by using ANSYS, wherein the simulation analysis data is derived from real-time equipment static and dynamic data, the running condition of the physical equipment can be simulated in real time by using a simulation tool, and the running condition is compared with the running data consistency of the physical equipment in real time, so that the reliability of a simulation result is ensured. Then, data feature extraction is carried out according to the simulation data and the actual data of the physical equipment, and the feature extraction can be completed by applying methods such as continuous wavelet transformation, maximum stress simulation and the like during extraction. Meanwhile, the extraction process of the data features is divided into a plurality of stages, the first stage is to use a Spark tool to extract the features, and the second stage is to extract the features based on a self-encoder, so that the dimension reduction of the data, the redundancy reduction and the deeper understanding of the data are realized. And finally, predicting the fault and the service life according to the simulation data and the data characteristics extracted from the actual data of the physical equipment.

The effect of the above technical scheme is as follows: compared with the traditional instrument and equipment health management and fault diagnosis, the instrument and equipment fault prediction and health management algorithm based on the big data can realize the real-time interaction and omnibearing state comparison of physical and virtual equipment, simulate the physical state of the equipment in real time, capture more comprehensive equipment operation characteristics, diagnose and predict faults more accurately and verify maintenance strategies more accurately. Therefore, the demand of spare parts can be predicted more accurately, and shortage or excess can be avoided. The more targeted maintenance strategy can reduce the fault downtime of the equipment and provide the comprehensive utilization rate of the equipment.

In an embodiment of the present invention, the device management center stores the signal data and the operation parameters by using a Kafka tool, including:

step 1, receiving data through a Kafka tool, and sending the data to a database cache region through a Kafka message bus;

step 2, after receiving the data, the database cache region classifies the signal data and the operation parameters based on data types, and sorts the signal data and the operation parameters in respective corresponding data types according to the time sequence of data generation to obtain a data set;

step 3, the database cache region extracts the storage space value occupied by the data of the data set, and sends a storage request and the storage space value required by each data set to each database according to the request sending time interval;

step 4, after receiving the storage request and the storage space value required by each data set, each database detects whether the database completes the last data storage action, and if the data storage action is completed, the time and the storage space residual quantity value used by the database for the last data storage are sent to the database cache region; if the data storage action is not finished, sending a storage suspension request to the data cache region, and after the data storage action is finished, storing the data for a long time;

step 5, the database cache region takes the database corresponding to the returned storage space surplus as a candidate storage database, extracts the storage space surplus returned last time by the candidate storage database, and compares the storage space surplus currently returned by the candidate storage database with the storage space surplus returned last time to obtain a surplus difference value;

step 6, when the residual difference value exceeds a first preset difference value threshold value, canceling the candidate qualification of the candidate storage database; when the residual difference value exceeds a second preset difference threshold value and does not exceed a first preset difference threshold value, or the residual difference value does not exceed a second preset difference threshold value, reserving the candidate qualification of the candidate storage database;

step 7, the data buffer area compares the number of the data sets with the number of the candidate storage databases, and data storage is carried out according to the comparison result and the storage rule;

step 8, after receiving the time used for storing the data of the same batch fed back by all the databases, the data buffer adjusts the request sending time interval according to the following formula:

Wherein the storage rule is:

The effect of the above technical scheme is as follows: by the method and the storage rule for data storage, the saturation rate of the storage space can be effectively reduced, meanwhile, the balance of the storage space surplus among the databases can be improved, and the problem that the data is unreasonably used due to the fact that the storage occupation difference among the data storage spaces is too large is avoided. Meanwhile, the data storage efficiency can be effectively improved through the arrangement of the database cache region and the storage steps, the timeliness and the speed of data storage are improved in the dynamic data storage process of big data, and the problems of reduction of the storage efficiency caused by overhigh data generation frequency and overhigh storage requirement frequency and data overstock caused by overhigh data waiting amount are effectively avoided. Meanwhile, through the self-adaptive adjustment of the sending time interval and the setting of the adjustment formula, the data backlog in the buffer area of the database can be effectively reduced, the data entry amount and the data output amount (namely the data amount sent to the database) in the buffer area of the database are kept highly balanced and reasonable, the problem of data backlog in the buffer area of the database is effectively reduced, and the data storage efficiency is improved. Meanwhile, the sending time interval of the data storage is adjusted according to the data storage time of the database, so that the balance between the data storage and the data storage sending can be effectively improved according to the actual data storage situation.

In an embodiment of the present invention, preprocessing signal data and operation parameters sent to an equipment management center to obtain preprocessed data includes:

s201, checking the data type of the signal data and the data corresponding to the operation parameters;

s202, carrying out data type verification on the signal data and the data corresponding to the operating parameters;

s203, performing data cleaning on the signal data and the operation parameters after the data type checking and the data type checking are completed, and obtaining cleaned data;

and S204, performing data integration, data transformation and data specification processing on the cleaned data to obtain preprocessed data.

The working principle of the technical scheme is as follows: firstly, checking the data type of the signal data and the data corresponding to the operation parameters; then, carrying out data type verification on the signal data and the data corresponding to the operating parameters; then, carrying out data cleaning on the signal data and the operation parameters after the data type checking and the data type checking are finished, and obtaining cleaned data; and finally, performing data integration, data transformation and data specification processing on the cleaned data to obtain preprocessed data.

The effect of the above technical scheme is as follows: the method effectively reduces noise signals in the signal data and the operation parameters, improves the accuracy of subsequent processing of the signal data and the operation parameters, further improves the accuracy of equipment fault prediction, and reduces the difference between a fault prediction result and the actual fault occurrence condition.

In an embodiment of the present invention, performing simulation analysis on the digital twin device by using a simulation tool with respect to the processed data to obtain simulation data, includes:

s301, carrying out simulation analysis on the digital twin equipment according to the processed data by utilizing an APDLANSYS virtual simulation method, and simulating the running condition of the instrument equipment in real time to obtain simulated data;

s302, performing consistency comparison on the simulation data and the operation data of the instrument in real time, and keeping consistency between the simulation data and the operation data of the instrument.

The working principle of the technical scheme is as follows: firstly, carrying out simulation analysis on the digital twin equipment by using an APDLANSYS virtual simulation method aiming at the processed data, and simulating the running condition of the instrument equipment in real time to obtain simulated data; and then, carrying out consistency comparison on the simulation data and the operation data of the instrument equipment in real time, and keeping consistency between the simulation data and the operation data of the instrument equipment.

The effect of the above technical scheme is as follows: the reliability of the simulation result is effectively improved, the accuracy of equipment fault prediction is further improved, and the difference between the fault prediction result and the actual fault occurrence condition is reduced.

According to an embodiment of the present invention, extracting data features according to the simulation data and the actual data of the instrument and device to obtain data features includes:

s401, data feature extraction is carried out on the simulation data and actual data of the instrument equipment by using a Spark tool;

s402, inputting the data extracted by the Spark tool into a self-encoder, and processing the input data by the self-encoder to acquire data characteristics.

The working principle of the technical scheme is as follows: firstly, performing data feature extraction on the simulation data and actual data of the instrument equipment by using a Spark tool; then, the data extracted by the Spark tool is input into the self-encoder, and the self-encoder is used for processing the input data to acquire data characteristics. The Spark tool performs targeted extraction on different characteristics of data in modes of continuous wavelet transformation, kurtosis index acquisition, effective value acquisition, maximum stress simulation, energy simulation, stress cycle times and the like.

The effect of the above technical scheme is as follows: by means of the data feature extraction, accuracy and feature comprehensiveness of data feature extraction can be effectively improved, accuracy of equipment fault prediction is further improved, and difference between a fault prediction result and an actual fault occurrence condition is reduced. Meanwhile, through the feature extraction processing of the two stages, the dimension reduction of the data can be effectively realized, the redundancy is reduced, the equipment management center can deeply know the data in the data processing process, and the prediction accuracy of the equipment fault can be effectively improved when the fault prediction is carried out by utilizing a feedforward neural network algorithm in the follow-up process.

In an embodiment of the present invention, the device management center includes:

The working principle of the technical scheme is as follows: firstly, a data acquisition and transmission module is used for acquiring signal data of instrument equipment through a stress sensor, an acoustic sensor and a vibration sensor, transmitting the signal data to an equipment management center, and simultaneously transmitting real-time operation parameters of the instrument equipment to the equipment management center through a machine tool interface; then, preprocessing the signal data and the operation parameters sent to the equipment management center through a preprocessing module to obtain preprocessed data; then, a simulation module is adopted to carry out simulation analysis on the digital twin equipment aiming at the processed data by utilizing a simulation tool to obtain simulation data; then, a feature extraction module is adopted to extract data features according to the simulation data and actual data of the instrument equipment to obtain data features; and finally, predicting the loss and the fault of the instrument and/or the key component of the instrument by utilizing a feed-forward neural network algorithm through a prediction module according to the data characteristics to obtain a prediction result.

In one embodiment of the present invention, the preprocessing module comprises:

The working principle of the technical scheme is as follows: firstly, checking the data type of the signal data and the data corresponding to the operating parameters through a data type checking module; then, carrying out data type verification on the signal data and the data corresponding to the operating parameters by using a data type verification module; then, data cleaning is carried out on the signal data and the operation parameters which are subjected to data type checking and data type checking through a data cleaning module, and cleaned data are obtained; and finally, performing data integration, data transformation and data specification processing on the cleaned data by adopting a data processing module to obtain the preprocessed data.

In one embodiment of the present invention, the simulation module includes:

The working principle of the technical scheme is as follows: firstly, a simulation analysis module carries out simulation analysis of digital twin equipment on the processed data by utilizing an APDL ANSYS virtual simulation method, and simulates the running condition of the instrument equipment in real time to obtain simulated data; and then, carrying out consistency comparison on the simulation data and the operation data of the instrument equipment in real time by using a consistency comparison module, and keeping consistency between the simulation data and the operation data of the instrument equipment.

In one embodiment of the present invention, the feature extraction module includes:

The working principle of the technical scheme is as follows: firstly, data feature extraction is carried out on the simulation data and actual data of the instrument equipment by a data feature acquisition module through a Spark tool; then, inputting the data extracted by the Spark tool into a self-encoder by adopting a sending module; and finally, processing the input data through an autoencoder to obtain data characteristics.

It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims

1. An instrument equipment fault prediction and health management algorithm based on big data, characterized in that the method comprises:

acquiring signal data of instrument equipment through a stress sensor, an acoustic sensor and a vibration sensor, transmitting the signal data to an equipment management center, and transmitting real-time operation parameters of the instrument equipment to the equipment management center through a machine tool interface;

2. The algorithm of claim 1, wherein the device management center stores the signal data and the operating parameters via a Kafka tool, comprising:

wherein T represents a transmission request time interval; m represents the number of the databases; t is₀Indicating an initial default value for the transmission request interval; t is_iThe data storage time for storing the data in the same batch fed back by the ith database is represented; t is_maxRepresenting the same batch of storage processes, the database is enteredA maximum time value for line data storage; t is_minAnd the minimum time value used by the database for data storage in the same batch of storage processes is represented.

3. The algorithm of claim 2, wherein the storage rule is:

4. The algorithm of claim 1, wherein pre-processing the signal data and the operating parameters sent to the device management center to obtain pre-processed data comprises:

5. The algorithm of claim 1, wherein performing simulation analysis of the digital twin device on the processed data using a simulation tool to obtain simulation data comprises:

carrying out simulation analysis on the digital twin equipment by using an APDL ANSYS virtual simulation method aiming at the processed data, and simulating the running condition of the instrument equipment in real time to obtain simulated data;

6. The algorithm of claim 1, wherein performing data feature extraction according to the simulation data and actual data of the instrument device to obtain data features comprises:

7. The algorithm of claim 1, wherein the device management center comprises:

8. The algorithm of claim 6, wherein the preprocessing module comprises:

9. The algorithm of claim 6, wherein the simulation module comprises:

the simulation analysis module is used for performing simulation analysis on the digital twin equipment according to the processed data by utilizing an APDL ANSYS virtual simulation method, simulating the running condition of the instrument equipment in real time and obtaining simulated data;

10. The algorithm of claim 6, wherein the feature extraction module comprises: