CN112518425B - Intelligent machining cutter wear prediction method based on multi-source sample migration reinforcement learning - Google Patents

Intelligent machining cutter wear prediction method based on multi-source sample migration reinforcement learning Download PDF

Info

Publication number
CN112518425B
CN112518425B CN202011462218.0A CN202011462218A CN112518425B CN 112518425 B CN112518425 B CN 112518425B CN 202011462218 A CN202011462218 A CN 202011462218A CN 112518425 B CN112518425 B CN 112518425B
Authority
CN
China
Prior art keywords
wear
source
task
state
cutter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011462218.0A
Other languages
Chinese (zh)
Other versions
CN112518425A (en
Inventor
杨文安
刘学为
郭宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University of Aeronautics and Astronautics
Original Assignee
Nanjing University of Aeronautics and Astronautics
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University of Aeronautics and Astronautics filed Critical Nanjing University of Aeronautics and Astronautics
Priority to CN202011462218.0A priority Critical patent/CN112518425B/en
Publication of CN112518425A publication Critical patent/CN112518425A/en
Application granted granted Critical
Publication of CN112518425B publication Critical patent/CN112518425B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B23MACHINE TOOLS; METAL-WORKING NOT OTHERWISE PROVIDED FOR
    • B23QDETAILS, COMPONENTS, OR ACCESSORIES FOR MACHINE TOOLS, e.g. ARRANGEMENTS FOR COPYING OR CONTROLLING; MACHINE TOOLS IN GENERAL CHARACTERISED BY THE CONSTRUCTION OF PARTICULAR DETAILS OR COMPONENTS; COMBINATIONS OR ASSOCIATIONS OF METAL-WORKING MACHINES, NOT DIRECTED TO A PARTICULAR RESULT
    • B23Q17/00Arrangements for observing, indicating or measuring on machine tools
    • B23Q17/09Arrangements for observing, indicating or measuring on machine tools for indicating or measuring cutting pressure or for determining cutting-tool condition, e.g. cutting ability, load on tool
    • B23Q17/0952Arrangements for observing, indicating or measuring on machine tools for indicating or measuring cutting pressure or for determining cutting-tool condition, e.g. cutting ability, load on tool during machining
    • B23Q17/0957Detection of tool breakage

Abstract

The method for predicting the wear of the intelligent machining cutter based on the multi-source sample transfer reinforcement learning comprises the following steps: acquiring a plurality of source tasks and a target task according to different wear curves of several cutters; initializing model parameters and maximum iteration time; detecting the wear state of the current cutter, performing feature extraction and dimension reduction on wear data, constructing a training sample set of a weighted extreme learning machine, and training the weighted extreme learning machine; executing a machining action, observing the abrasion state of the current cutter, and calculating state similarity and return similarity between each source task and each target task; calculating the probability of each sample in the source task belonging to a target sample set; acquiring task similarity, and transferring a fixed number of samples from each source sample set to a target sample set; updating the Q value by using a Q learning mechanism based on a weighted extreme learning machine, and adding new tool wear data into a target sample set; and (3) constructing a state equation and an observation equation of the particle filter model.

Description

Intelligent machining cutter wear prediction method based on multi-source sample migration reinforcement learning
Technical Field
The disclosure belongs to the field of tool wear detection of numerical control manufacturing equipment, and particularly relates to an intelligent machining tool wear prediction method based on multi-source sample migration reinforcement learning.
Background
The most common faults in the digital processing process are the abrasion and the damage of the cutter, which not only directly affects the manufacturing cost, the product quality and the production efficiency, but also causes great equipment shutdown accidents in serious cases. Therefore, the detection of the cutter state has important significance in the aspects of ensuring the safety of a processing system, smoothly processing, reducing the production cost, improving the production efficiency and the like. However, since the machining process involves physical reactions, conversion and transfer of substances and energy, and the complexity and uncertainty of the machining process cause difficulty in detecting the state parameters of the tool during the machining process, the premise for realizing the method is that the state information of the tool during the machining process can be effectively obtained and reflected, regardless of the specific implementation of advanced machining process control algorithms and strategies during the machining process control, or the parameter optimization, the state monitoring and fault diagnosis of the tool during the machining process. Due to the limitation of the development level of the machining process detection technology, a plurality of advanced machining process control algorithms and strategies can only stay in theoretical discussion at present and are difficult to apply to actual machining, and a plurality of machining production systems cannot improve the safety and reliability of system operation by means of measures such as fault diagnosis and state monitoring. To solve the above problems, virtual measurement techniques have been developed.
The Chinese patent application of the invention discloses a method for monitoring the abrasion of a numerical control machine tool cutter based on deep learning (CN 201711117628), which is characterized in that three-phase current signals of a spindle motor of the numerical control machine tool are collected, current signals corresponding to the cutter to be detected are intercepted from the three-phase current signals, each section of current signals are normalized, each section of current signals after normalization is input into a sparse automatic coding network for training, and further the detection of the abrasion of the cutter is realized.
The Chinese patent application for 'a cutter wear monitoring method' (CN 103465107A) optimizes an initial weight value and a threshold value of a neural network through a genetic algorithm by acquiring acoustic emission signals of a cutter in various wear states, current signals of a spindle motor and a feed motor in a machine tool, cutting speed, cutting depth and feed amount, and predicts the cutter wear degree by using a trained neural network. However, the method has the disadvantages of large data volume, unbalanced data distribution, long time for training the neural network and low accuracy, and the obtained neural network can only detect a single cutter type, so that the method is not strong in applicability.
The european patent application "Tool monitor" (EP 0334341 A2) uses an acoustic emission sensor to collect an acoustic emission signal of a Tool of a manufacturing equipment, and combines a band-pass filter to preprocess the signal, and obtains a Tool state by comparing the acoustic emission signal of the Tool of the manufacturing equipment collected in real time with the acoustic emission signal when the Tool is broken. However, the method cannot acquire the implicit characteristics of the acoustic emission signal and the cutter state, so that the real-time state of the cutter cannot be completely reflected by the filtered acoustic emission signal, the simple comparison and judgment need to be carried out in an off-line mode, the judgment result mainly depends on the past experience, and the accuracy of the detection result is insufficient.
The european patent application "Cutting tool wear monitor" (EP 0165745 A2) detects the wear state of a rotary tool by collecting short-circuit current, open-circuit voltage and power signals generated on a workpiece during machining in combination with empirical analysis, and the generated current, voltage and power gradually increase as the tool wears until a sharp increase indicates that the tool is damaged due to excessive wear or breakage. However, the current and the voltage are easily affected by external factors, so that the detection result cannot accurately represent the real-time wear state of the cutter, and only whether the cutter is worn or not can be judged through the change of the current and the voltage, but the real-time wear amount of the cutter cannot be judged, and the requirement of real-time detection cannot be met.
Disclosure of Invention
Aiming at least one aspect of the problems, the method for predicting the wear of the intelligent machining cutter based on the multi-source sample transfer reinforcement learning is designed, the multi-source transfer reinforcement learning model and the particle filter model are utilized, and a virtual detection method is combined, so that the problems that the detection target of a cutter wear detection system is single and the accuracy is insufficient in the current machining process are effectively solved.
Specifically, aiming at the defects of the existing cutter wear detection method, the disclosed method aims to provide an intelligent machining cutter wear prediction method which is used for detecting different types of cutters and is based on multi-source sample migration reinforcement learning, has good accuracy and strong data processing capacity, utilizes the idea of migration learning to reduce the difference between wear state data of different cutters, enables the model to detect the wear states of different cutters, reduces the required training data amount, enables the model to train more rapidly, combines a Weighted Extreme Learning Machine (WELM) algorithm to adjust the imbalance among training samples, enables the training samples to be more effective, and improves the accuracy of the model.
The embodiment of the disclosure is realized by adopting the following technical scheme:
the intelligent machining cutter wear prediction method based on multi-source sample migration reinforcement learning is characterized by comprising the following steps:
the method comprises the following steps: constructing an intelligent machining cutter wear prediction system based on multi-source migration reinforcement learning;
step two: acquiring a plurality of source tasks and a target task according to different wear curves of several cutters;
step three: initializing model parameters and maximum iteration time;
step four: detecting the wear state of the current cutter, performing feature extraction and dimension reduction on wear data, constructing a training sample set of a weighted extreme learning machine, and training the weighted extreme learning machine;
step five: executing a machining action, observing the wear state of the current cutter, simultaneously observing the next wear state and return of the cutter, and calculating the state similarity and return similarity between each source task and each target task;
step six: calculating the probability that each sample in the source task belongs to the target sample set, and performing descending order on the samples of each cutter abrasion source task based on the probability;
step seven: acquiring task similarity, and transferring a fixed number of samples from each source sample set to a target sample set;
step eight: updating the Q value by using a Q learning mechanism based on a weighted extreme learning machine, adding new tool wear data to a target sample set, and then rolling a time window forward to discard the oldest sample;
step nine: constructing a state equation and an observation equation of a particle filter model; and
step ten: and predicting the wear state of the cutter for intelligent machining.
According to the embodiment of the disclosure, the system for predicting the wear of the intelligent machining tool in the first step comprises a tool state virtual detection object, a virtual perception model, an empirical model and a particle filter model. The virtual detection objects of the cutter state are different cutters, and a source task and a target task are respectively obtained from the virtual detection objects; the virtual perception model is mainly used for determining an observation equation of a particle filter model and comprises four parts, namely tool wear data acquisition, tool wear data feature selection, tool wear data feature dimension reduction and multi-source migration reinforcement learning based on WELM, wherein the tool wear data are acquired online through a sensor and are acquired offline through a microscope respectively, the tool wear data are subjected to feature selection through statistical features, time domain features and frequency domain features, the selected features are subjected to feature dimension reduction through Kernel Principal Component Analysis (KPCA) method, and the multi-source migration reinforcement learning based on WELM consists of a sample space migration stage, a task space migration stage and a WELM-based Q learning stage; the empirical model is mainly used for determining a state equation of the particle filter model and consists of various tool wear empirical models, such as a tool wear rate model, a tool wear loss model and the like; the particle filter model consists of an observation equation and a state equation and completes virtual detection of the tool wear state of the target task.
According to the embodiment of the disclosure, the model parameters in the third step comprise a WELM learning rate, a WELM discount factor, a Q value updating probability, a WELM hidden layer neuron number L, a Gaussian kernel width parameter and a matching degree control parameter Z P And Z Q Probability control parameter Z X Source task sample number and target task sample number.
According to the embodiment of the disclosure, in the fourth step, the wear state of the current tool is detected by using an online and offline measurement method, and the wear data is subjected to feature extraction and dimension reduction and then used for constructing a training sample set.
For example, in the fourth step, the sensor is used for realizing online measurement of the tool state, the microscope is used for realizing offline measurement of the tool wear state, different feature extraction and dimension reduction methods are used for preprocessing the wear data, and the preprocessed data are used as a training sample set of the WLEM.
According to the embodiment of the disclosure, in the fifth step, the obtained state similarity and the return similarity are used for quantifying the similarity between each sample of the source task and the target task; in the sixth step, the state similarity and the return similarity are used for calculating the probability that each sample in the source task belongs to the target sample set.
According to the embodiment of the disclosure, the method for calculating the state similarity and the return similarity in the step five is as follows:
suppose that
Figure BDA0002829034020000051
And
Figure BDA0002829034020000052
respectively representing the sample sets of the kth source task and the target task, wherein the jth source sample and the ith target sample are respectively represented as
Figure BDA0002829034020000053
And
Figure BDA0002829034020000054
calculated by equations (1) and (2)
Figure BDA0002829034020000055
And with
Figure BDA0002829034020000056
State similarity between them
Figure BDA0002829034020000057
And reporting similarity
Figure BDA0002829034020000058
Figure BDA0002829034020000059
Figure BDA00028290340200000510
In the formula (I), the compound is shown in the specification,
Figure BDA00028290340200000511
being similar weights, δ s ,δ q And delta sa Is the width parameter of the gaussian kernel. Usually will be delta s ,δ q And delta sa Is set to correspond to the molecule | | s' i ,s′ j ||,|Q i -Q j I and I(s) j ,a j ),(s i ,a i ) The same order of magnitude of | this may be
Figure BDA00028290340200000512
And w ij The value of (b) is limited to a reasonable range. It is clear that the higher the similarity between two samples,
Figure BDA00028290340200000513
the larger. Also, as can be seen from equation (2), two Q values Q i And Q j The closer the distance is to each other,
Figure BDA00028290340200000514
the larger.
According to the embodiment of the present disclosure, in the sixth step, the probability that each sample in the source task belongs to the target sample set is used as a sample migration weight, and the value determines the possibility of sample migration.
According to the embodiment of the disclosure, the probability calculation method that each sample in the source task in the sixth step belongs to the target sample set is as follows:
Figure BDA00028290340200000515
in the formula (I), the compound is shown in the specification,
Figure BDA00028290340200000516
referred to as the sample migration weight, is,
Figure BDA00028290340200000517
and
Figure BDA00028290340200000518
respectively represent
Figure BDA00028290340200000519
And
Figure BDA00028290340200000520
the definition of which is similar to equations (1) and (2).
According to the embodiment of the disclosure, in the seventh step, the task similarity is calculated according to the bayesian probability analysis theory, and the value is used for determining the number of samples migrated from each source task to the target task set.
For example, task similarity in step seven
Figure BDA0002829034020000061
The calculation method of (2) is as follows:
from the above analysis, we can calculate the matching degree of the status and the return, as follows:
Figure BDA0002829034020000062
Figure BDA0002829034020000063
in the formula, Z P And Z Q Indicating the control parameter, Z may be set without overflowing the normal value P And Z Q Is set to 1, i.e. Z P= Z Q =1. By multiplying equation (4) by equation (5), the ith target sample can be obtained
Figure BDA0002829034020000064
And source task S k Total degree of match therebetween, i.e.
Figure BDA0002829034020000065
Thus, a likelihood ratio X between the source task and the target task can be obtained k
Figure BDA0002829034020000066
In the formula, P (S) k ) Is a model S k A priori of, and Z X Is a control parameter of the possibility. Z is a linear or branched member X And Z P And Z Q With the same effect, Z can be adjusted without overflowing the usual values X Is set to 1, i.e. Z X=1 . We can normalize the likelihood ratios of all active tasks
Figure BDA0002829034020000067
To obtain a source task S k Task similarity with T, while also requiring migration from M source tasks (n) S -n T ) One sample is used to supplement the target sample.
According to an embodiment of the present disclosure, in the step eight, the maximum Q value or the random Q value is selected with different probabilities to update the Q value.
In the step eight, the rolling time window is used for updating the target sample set, so that the condition that the learning speed of the weighted extreme learning machine is too low due to the fact that the sample set is too large is avoided.
According to the embodiment of the disclosure, the palm-based Q learning mechanism in the step eight is as follows: in an embodiment of the present disclosure, the WELM is used to approximate a reinforcement learning Q-value function.
Suppose s l =[s l1 ,s l2 ,...,s lm ] T ∈R m Represents a system state having m dimensions, and a l e.R represents the action of a Q learning agent, and the input vector of WELM is a state-action pair x l =(s l ,a l ) T =[s l1 ,s l2 ,...,s lm ,a l ] T ∈R m+1 And the output of the WELM is estimated to correspond to(s) l ,a l ) Is given by the formula (i), wherein L =1, 2. Alpha is alpha 1 =[α 11 ,α 12 ,...,α 1(m+1 )]And ω = [ ] 1 ,ω 2 ,..,ω L ] T Is the input and output weight vector, β = [ beta ] 1 ,β 2 ,...,β L ] T Is the deviation of the hidden layer node. To ensure the learning effect of WELMThe number of hidden layer nodes should be equal to the number of training samples, i.e. the number of hidden layer nodes is L.
According to an embodiment of the present disclosure, the architecture of the multi-source migration learning algorithm in step eight is as follows: task space Γ = (S, T) = (S) for multi-source migration reinforcement learning in this disclosure 1 ,...,S k ,..,S M T), the formula includes M source tasks and an unknown target task T. Each source task containing n s A sample and the target task contains n T And (4) sampling. On the one hand, the cost of collecting the sample is high, and n T The sample does not train task T well. On the other hand, there is less of a cost to use M source tasks and extract the required samples than to resample from the current environment. Therefore, we apply a Knowledge Transfer (KT) technique to the WELM-based Q learning to improve the learning speed of the target task T. The mapping of KT is described as follows:
Figure BDA0002829034020000071
in the formula
Figure BDA0002829034020000072
And
Figure BDA0002829034020000073
representing the sample sets corresponding to the source task and the target task, respectively, and Λ represents the optimal transmission sample set.
Compared with the prior art, the technical scheme provided by the embodiment of the disclosure has at least the following beneficial effects:
(1) The migration learning idea adopted by the embodiment of the disclosure has at least the following advantages:
1) The demand for training data volume is smaller when the cutter detection model is trained, and the advantage of the model training speed is obviously improved;
2) The generalization capability of the model obtained by utilizing the transfer learning training is stronger, the capability of the model on classifying well on non-training data is enhanced, and the applicability of the model is improved;
3) The transfer learning training process is more robust, the number of trainable parameters can be reduced by 100%, so that the training is more stable, the debugging is easier, and the accuracy of the tool detection model is improved.
(2) The reinforcement learning model based on the weighted extreme learning machine adopted by the embodiment of the disclosure has at least the following advantages:
1) The tool wear virtual detection system has strong learning capacity, and can learn enough abnormal tool wear states to be communicated;
2) The cutter wear virtual detection system can train unbalanced data, and the applicability and accuracy of the system are improved;
3) The tool wear virtual detection system has continuous self-improvement capability, and the understanding and comprehension of the tool wear condition in the intelligent machining process are gradually improved.
(3) The virtual measurement technology adopted by the embodiment of the disclosure has at least the following advantages:
1) The cutter wear virtual detection system has the functions of error compensation and fault diagnosis, so that the precision and the reliability of cutter wear virtual measurement are improved;
2) The virtual detection system for the cutter wear can comprehensively use a plurality of measurable cutter state information to carry out state estimation, diagnosis and trend analysis on the wear state of the cutter to be detected;
3) The tool wear virtual detection system can acquire microcosmic real-time state information of the detected tool on line so as to meet the requirement of tool wear virtual detection;
therefore, in the embodiment of the disclosure, the intelligent machining tool wear prediction method based on multi-source sample migration reinforcement learning at least solves the problems that the tool state information existing in the intelligent machining process cannot be detected on line in real time, so that real-time control is difficult to perform and the machining quality is difficult to ensure.
Drawings
The accompanying drawings, which are included to provide a further understanding of the disclosure and are incorporated in and constitute a part of this disclosure, illustrate embodiments of the disclosure and together with the description serve to explain the disclosure and not to limit the disclosure. In the drawings:
FIG. 1 is a system block diagram of a method for intelligent machining tool wear prediction based on multi-source sample migration reinforcement learning according to an embodiment of the present disclosure; and
fig. 2 is a block diagram of a wellness learning system based on a wellm in accordance with an embodiment of the present disclosure.
Detailed Description
The technical solution of the present disclosure is further specifically described below by way of examples and with reference to the accompanying drawings. In the specification, the same or similar reference numerals denote the same or similar components. The following description of the embodiments of the present disclosure with reference to the accompanying drawings is intended to explain the general inventive concept of the present disclosure and should not be construed as limiting the present disclosure.
Furthermore, in the following detailed description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the embodiments of the disclosure. It may be evident, however, that one or more embodiments may be practiced without these specific details.
The virtual measurement technology is an automatic measurement technology developed along with a computer technology and a modern measurement technology, realizes measurement of a process variable to be measured by utilizing an easily-measured process variable through various mathematical calculation and estimation methods according to a mathematical relationship between the easily-measured process variable and the process variable to be measured which is difficult to directly measure, overcomes the defects of sealing, lack of flexibility, low response speed and the like of a hardware detection technology, and has the advantages of functionalization, modularization, data sharing, low cost and the like. The virtual measurement technology is applied to cutter abrasion detection, cutter abrasion after actual processing can be predicted, cutter abrasion and damage problems possibly occurring in the actual processing can be found in advance, a feasible modification scheme is provided in a targeted manner, processing parameters are optimized, and the virtual measurement technology has important significance for ensuring the quality of processed products, greatly reducing the processing cost, shortening the processing period and improving the processing efficiency. The virtual measurement technology for the tool wear mainly comprises a tool wear state equation and a tool wear observation equation, wherein the state equation represents the inherent state change of the tool in the machining process and describes the evolution behavior of the tool degradation condition. The observation equation represents the mapping from the current tool state to the tool wear state, but because the intelligent machining process has the characteristics of multiple variables, variable type mixing and strong nonlinear coupling among variables, and the tool wear state in the machining process is interfered by various uncertain factors such as workpiece materials, machining working conditions, machining equipment states and the like, the mapping relationship from the current tool state to the tool wear state is difficult to describe by a mathematical model. The existing solution is to use an intelligent learning model to represent a nonlinear mapping relationship from a current tool state to a tool wear state, so that the accuracy of a virtual detection result of the tool wear state mainly depends on the construction of the intelligent learning model.
The tool wear intelligent learning model established by the traditional machine learning method usually needs data to be in the same distribution, so that more data are needed to be trained when a batch of the same workpieces are processed, but the data of the workpieces produced by a single piece in small batch and processed by the same parameters are less, and the training requirements cannot be met. In the embodiment of the disclosure, data under other parameters are considered to assist training, and variable parameter tool wear detection is realized, which is different from tool wear detection of the same parameter, and a signal acquired by variable parameter tool wear detection is influenced by two factors, namely the actual wear condition of the tool and the change of the cutting parameter. Therefore, data under different parameters are distributed in different data, in order to detect the state of a parameter tool by using tool wear data of other parameters, in the embodiment of the disclosure, a transfer learning method is used to solve the problem of insufficient data in the single-piece small-batch production process, because the transfer learning can help the tool state detection model of a certain parameter to learn by using signal data under other parameters of different distributions.
However, the tool wear sample data obtained in practical applications is always of unbalanced class, i.e. the sample size of one class is much larger than that of another or some classes. For the unbalanced classification problem caused by the classification problem of class data, the traditional machine learning method is mainly based on an intelligent learning model with minimized experience risk or minimized structure risk for classification. The least empirical risk is that the classification error rate of the model on the training set is expected to be as low as possible, and the samples in a few classes are subjected to a large amount of error classification; the minimum structural risk is the desire to maximize the inter-class distance, and the influence of the majority class on the classifier will generally push the inter-class spacing surface to the side of the minority class, thereby causing the classifier to degrade the recognition performance of the minority class. For a large-scale data set, particularly a tool wear data set, the traditional algorithm cannot conveniently and effectively extract data features, so that the classification error rate is improved, and the time cost for searching a classification spacing surface by the algorithm is high.
In this document, so-called migration learning is to use existing knowledge to learn new knowledge, and the core is to find the similarity between the existing knowledge and the new knowledge. In the transfer learning, the existing knowledge is called a source domain, the new knowledge to be learned is called a target domain, the source domain and the target domain are different but have certain correlation, the distribution difference between the source domain and the target domain needs to be reduced, and the knowledge transfer is carried out, so that the data calibration is realized. For example, the source domain target domain distinction is expressed as follows: the target domain can be different in data distribution, feature dimension and model output variation condition relative to the source domain, and the target domain is better modeled by organically utilizing the knowledge in the source domain. In addition, under the condition that target domain calibration data is lack, the migration learning can well utilize the calibrated data in the related field to finish the calibration of the data. It should be noted that if the similarity between the source domain and the target domain is not sufficient, the migration result is not ideal, and a so-called negative migration situation occurs. Therefore, finding the source domain and the target domain with the highest similarity is the most important prerequisite for the whole migration process.
Further, herein, the expression "domain" may include data features and feature distributions, which are subjects of machine learning; the expression "source domain" is a domain that is already known; the expression "target domain" is a domain to be learned; the expression "task" includes an objective function and a learning result, which is a result of learning, and can be understood as a classifier.
As shown in fig. 1, a system block diagram of an intelligent machining tool wear prediction method based on multi-source sample migration reinforcement learning according to an embodiment of the present disclosure is schematically shown. Referring to fig. 1, the intelligent machining tool wear prediction method based on multi-source sample migration reinforcement learning according to an embodiment of the present disclosure may include the following steps.
In the first step, an intelligent machining tool wear prediction system based on multi-source migration reinforcement learning is constructed, as shown in fig. 1.
Specifically, in step one, the tool wear prediction system (also referred to as a tool wear state virtual detection system) may include a tool state virtual detection object, a virtual perception model, an empirical model and a particle filter model. The virtual detection objects of the cutter state are different cutters, and a source task and a target task are respectively obtained from the virtual detection objects; the virtual perception model is mainly used for determining an observation equation of a particle filter model and comprises four parts, namely tool wear data acquisition, tool wear data feature selection, tool wear data feature dimension reduction and multi-source migration reinforcement learning based on a Weighted Extreme Learning Machine (WELM), wherein the tool wear data are acquired online through a sensor and acquired offline through a microscope respectively, the tool wear data are subjected to feature selection by utilizing statistical features, time domain features and frequency domain features, the selected features are subjected to feature dimension reduction by utilizing a Kernel Principal Component Analysis (KPCA) method, and the multi-source migration reinforcement learning based on the WELM comprises a sample space migration stage, a task space migration stage and a Q learning stage based on the WELM; the empirical model is mainly used for determining a state equation of the particle filter model and consists of various tool wear empirical models, such as a tool wear rate model, a tool wear loss model and the like; the particle filter model consists of an observation equation and a state equation and completes virtual detection of the tool wear state of the target task.
In step two, a plurality of source tasks S are obtained according to different wear curves of several tools 1 ,S 2 ,...,S M And a target task T.
In step three, model parameters and maximum iteration time are initialized.
Specifically, in step three, the model parameters to be initialized include a WELM learning rate eta, a WELM discount factor gamma, and a Q value update probability epsilon 0 Welm hidden layer neuron number L, width parameter of Gaussian kernel δ s 、δ sa And δ q, matching degree control parameter Z P And Z Q Probability control parameter Z X Number of samples n of source task S And a target number of task samples n T
And in the fourth step, detecting the wear state of the current cutter, performing feature extraction and dimension reduction on the wear data, constructing a WELM training sample set D, and training the WELM.
Specifically, in the fourth step, the sensor is used for realizing online measurement of the tool state, the microscope is used for realizing offline measurement of the tool wear state, different feature extraction and dimension reduction methods are used for preprocessing wear data, and the preprocessed data are used as a WLEM training sample set.
And step five, executing the machining action, observing the current tool wear state, simultaneously observing the wear state and return of the next tool, and calculating the state similarity and return similarity between each source task and the target task.
Specifically, the method for calculating the state similarity and the return similarity in step five is as follows:
suppose that
Figure BDA0002829034020000121
And with
Figure BDA0002829034020000122
Respectively representing the sample sets of the kth source task and the target task, wherein the jth source sample and the ith target sample are respectively represented as
Figure BDA0002829034020000123
And
Figure BDA0002829034020000124
calculated by equations (1) and (2)
Figure BDA0002829034020000125
And
Figure BDA0002829034020000126
similarity of state therebetween
Figure BDA0002829034020000127
And reporting the similarity
Figure BDA0002829034020000128
Figure BDA0002829034020000129
Figure BDA00028290340200001210
In the formula (I), the compound is shown in the specification,
Figure BDA00028290340200001211
being similar weights, δ s ,δ q And delta sa Is the width parameter of the gaussian kernel. Usually will be delta s ,δ q And delta sa Is set to the corresponding molecule | | | s' i ,s′ j ||,|Q i -Q j I and I(s) j ,a j ),(s i ,a i ) The same order of magnitude of | this may be
Figure BDA00028290340200001212
And w ij The value of (b) is limited to a reasonable range. It is clear that the higher the similarity between two samples,
Figure BDA00028290340200001213
the larger. Also, as can be seen from equation (2), two Q values Q i And Q j The closer the distance is to the point of departure,
Figure BDA00028290340200001214
the larger.
And in the sixth step, calculating the probability that each sample in the source task belongs to the target sample set, and performing descending order on the samples of each tool wear virtual detection source task based on the probability.
Specifically, in the sixth step, the probability calculation method that each sample in the source task belongs to the target sample set is as follows:
Figure BDA0002829034020000131
in the formula (I), the compound is shown in the specification,
Figure BDA0002829034020000132
referred to as the sample-migration weight,
Figure BDA0002829034020000133
and
Figure BDA0002829034020000134
respectively represent
Figure BDA0002829034020000135
And
Figure BDA0002829034020000136
the definition of which is similar to equations (1) and (2).
In step seven, task similarity is obtained, and a fixed number of samples are migrated from each source sample set to the target sample set.
Specifically, the task similarity in the step seven
Figure BDA0002829034020000137
The calculation method of (2) is as follows:
from the above analysis, we can calculate the matching degree of the status and the return, as follows:
Figure BDA0002829034020000138
Figure BDA0002829034020000139
in the formula, Z P And Z Q Indicating the control parameter, Z may be set without overflowing the normal value P And Z Q Is set to 1, i.e. Z P= Z Q=1 . Multiplying equation (4) by equation (5) may obtain the ith target sample
Figure BDA00028290340200001310
And source task S k The total degree of match between, i.e.
Figure BDA00028290340200001311
Thus, a likelihood ratio X between the source task and the target task can be obtained k
Figure BDA00028290340200001312
In the formula, P (S) k ) Is a model S k A priori of (a) and Z X Is a control parameter of the possibility. Z X And Z P And Z O With the same effect, Z can be adjusted without overflowing the usual values X Is set to 1, i.e. Z X1 . We can normalize the likelihood ratios of all active tasks
Figure BDA00028290340200001313
To obtain a source task S k Task similarity with T, while also requiring migration from M source tasks (n) S -n T ) One sample is used to supplement the target sample.
In step eight, update the Q value using the palm-based Q learning mechanism, add new tool wear data to the target sample set, and then scroll the time window forward to discard the oldest samples.
Specifically, the Q learning mechanism based on the palm in step eight is as follows:
in this disclosure, WELM is used to approximate a reinforcement learning Q-value function.
Suppose s l =[s l1 ,s l2 ,...,s lm ] T ∈R m Represents a system state having m dimensions, and a l e.R represents the action of a Q learning agent, and the input vector of WELM is a state-action pair x l =(s l ,a l ) T =[s l1 ,s l2 ,...,s lm ,a l ] T ∈R m+1 And the output of the WELM is estimated to correspond to(s) l ,a l ) Is given by L =1, 2.. And L is the index of the training sample. Alpha (alpha) ("alpha") 1 =[α 11 ,α 12 ,..,α 1(m+1 )]And ω = [ ω ] 1 ,ω 2 ,...,ω L ] T Is the input and output weight vector, β = [ beta ] 1 ,β 2 ,...,β L ] T Is the deviation of the hidden layer node. In order to ensure the learning efficiency of the wellm, the number of hidden layer nodes should be equal to the number of training samples, i.e. the number of hidden layer nodes is L.
Specifically, the architecture of the multi-source migration learning algorithm in step eight is as follows, as shown in fig. 2. Task space Γ = (S, T) = (S) for multi-source migration reinforcement learning in this disclosure 1 ,...,S k ,..,S M T), the formula includes M source tasks and an unknown target task T. Each source task containing n s A sample and the target task contains n T And (4) sampling. On the one hand, the cost of collecting the sample is high, and n T The sample does not train task T well. On the other hand, the cost of using M source tasks and extracting the required samples is small compared to resampling from the current environment. Therefore, we apply a Knowledge Transfer (KT) technique to the WELM-based Q learning to improve the learning speed of the target task T. The mapping of KT is described as follows:
Figure BDA0002829034020000141
in the formula
Figure BDA0002829034020000142
And
Figure BDA0002829034020000143
the sample sets corresponding to the source task and the target task are represented, respectively, and Λ represents the optimal transmission sample set.
In the ninth step, a state equation and an observation equation of the particle filter model are constructed.
In step ten, the tool wear state of the target task is virtually detected.
Embodiments of the present disclosure also provide an intelligent machining tool wear prediction system based on multi-source sample migration reinforcement learning, the machinable tool wear prediction system including a memory and a processor, the memory having instructions stored thereon that, when executed by the processor, implement the method as described above.
The foregoing detailed description has set forth numerous embodiments of the present disclosure via the use of diagrams, block diagrams, flowcharts, and/or examples. Insofar as such schematics, flowcharts, and/or examples contain one or more functions and/or operations, it will be understood by those within the art that each function and/or operation within such schematics, flowcharts, or examples can be implemented, individually and/or collectively, by a wide range of structures, hardware, software, firmware, or virtually any combination thereof. In one embodiment, portions of the subject matter described in embodiments of the present disclosure may be implemented by Application Specific Integrated Circuits (ASICs), field Programmable Gate Arrays (FPGAs), digital Signal Processors (DSPs), or other integrated circuits. However, those skilled in the art will recognize that some aspects of the embodiments disclosed herein, in whole or in part, can be equivalently implemented in integrated circuits, as one or more computer programs running on one or more computers (e.g., as one or more programs running on one or more computer systems), as one or more programs running on one or more processors (e.g., as one or more programs running on one or more microprocessors), as firmware, or as virtually any combination thereof, and that designing the circuitry and/or writing the code for the software and or firmware would be well within the skill of one of skill in the art in light of this disclosure. In addition, those skilled in the art will appreciate that the mechanisms of the subject matter described herein are capable of being distributed as a program product in a variety of forms, and that an illustrative embodiment of the subject matter described herein applies regardless of the particular type of signal bearing media used to actually carry out the distribution. Examples of signal bearing media include, but are not limited to: recordable type media such as floppy disks, hard disk drives, compact Disks (CDs), digital Versatile Disks (DVDs), digital magnetic tape, computer memory, etc.; and a transmission type medium such as a digital and/or an analog communication medium (e.g., a fiber optic cable, a waveguide, a wired communications link, a wireless communication link, etc.).
While the foregoing specification illustrates and describes the practice of the disclosure, it is to be understood that the disclosure is not limited to the forms disclosed herein, but is not intended to be exhaustive or to limit the disclosure to other embodiments, and may be used in various other combinations, modifications, and environments and is capable of changes within the scope of the inventive concept as described herein, commensurate with the above teachings, or the skill or knowledge of the relevant art. And that modifications and variations may be effected by those skilled in the art without departing from the spirit and scope of the disclosure, which is to be protected by the following claims.

Claims (7)

1. An intelligent machining cutter wear prediction method based on multi-source sample migration reinforcement learning is characterized by comprising the following steps:
the method comprises the following steps: constructing an intelligent machining cutter wear prediction system based on multi-source migration reinforcement learning, wherein the intelligent machining cutter wear prediction system comprises cutter state virtual detection objects, and the cutter state virtual detection objects are different cutters;
step two: acquiring a plurality of source tasks and a target task according to different wear curves of several cutters;
step three: initializing model parameters and maximum iteration time;
step four: detecting the wear state of the current cutter, performing feature extraction and dimension reduction on wear data, constructing a training sample set of a weighted extreme learning machine, and training the weighted extreme learning machine;
step five: executing a machining action, observing the wear state of the current cutter, simultaneously observing the next wear state and return of the cutter, and calculating the state similarity and return similarity between each source task and each target task;
step six: calculating the probability that each sample in the source task belongs to the target sample set, and performing descending order on the samples of each cutter abrasion source task based on the probability;
step seven: acquiring task similarity, and transferring a fixed number of samples from each source sample set to a target sample set;
step eight: updating the Q value by using a Q learning mechanism based on a weighted extreme learning machine, adding new tool wear data to a target sample set, and then rolling a time window forward to discard the oldest sample;
step nine: constructing a state equation and an observation equation of a particle filter model; and
step ten: the wear state of the cutter for intelligent processing is predicted,
in the fourth step, the wear state of the current cutter is detected by using an online and offline measurement method, and the wear data is subjected to feature extraction and dimension reduction and then used for constructing a training sample set.
2. The intelligent machining tool wear prediction method of claim 1, wherein in the fifth step, the similarity between each sample of the source task and the target task is quantified by using the obtained state similarity and the return similarity; in the sixth step, the state similarity and the return similarity are used for calculating the probability that each sample in the source task belongs to the target sample set.
3. The method for predicting the wear of the intelligent machining tool according to claim 1, wherein in the sixth step, the probability that each sample in the source task belongs to the target sample set is used as a sample migration weight, and the value determines the possibility of sample migration.
4. The intelligent machining tool wear prediction method of claim 1 wherein in step seven, task similarity is calculated according to bayesian probability analysis theory, this value being used to determine the number of samples to migrate from each source task to the target task set.
5. The intelligent machining tool wear prediction method of claim 1 wherein in step eight, the maximum Q value or a random Q value is selected with different probabilities to update the Q value.
6. The intelligent machining tool wear prediction method of claim 1, wherein in step eight, the rolling time window is used to update the target sample set to avoid too large a sample set resulting in too slow a learning rate of the weighted extreme learning machine.
7. An intelligent machining tool wear prediction system based on multi-source sample migration reinforcement learning, characterized in that the machinable tool wear prediction system comprises a memory and a processor, the memory having stored thereon instructions that, when executed by the processor, implement the method according to any one of claims 1-6.
CN202011462218.0A 2020-12-10 2020-12-10 Intelligent machining cutter wear prediction method based on multi-source sample migration reinforcement learning Active CN112518425B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011462218.0A CN112518425B (en) 2020-12-10 2020-12-10 Intelligent machining cutter wear prediction method based on multi-source sample migration reinforcement learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011462218.0A CN112518425B (en) 2020-12-10 2020-12-10 Intelligent machining cutter wear prediction method based on multi-source sample migration reinforcement learning

Publications (2)

Publication Number Publication Date
CN112518425A CN112518425A (en) 2021-03-19
CN112518425B true CN112518425B (en) 2022-10-04

Family

ID=74999346

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011462218.0A Active CN112518425B (en) 2020-12-10 2020-12-10 Intelligent machining cutter wear prediction method based on multi-source sample migration reinforcement learning

Country Status (1)

Country Link
CN (1) CN112518425B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113103068B (en) * 2021-04-19 2022-03-29 大连理工大学 Cutter state monitoring method based on deep migration learning
CN115365890A (en) * 2022-09-23 2022-11-22 深圳职业技术学院 Method and device for online predicting tool wear value, intelligent terminal and storage medium
CN115351601A (en) * 2022-09-29 2022-11-18 哈尔滨工业大学 Tool wear monitoring method based on transfer learning

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8725456B1 (en) * 2009-05-05 2014-05-13 The United States of America as represented by the Administrator of the National Aeronautics & Space Administration (NASA) Decomposition technique for remaining useful life prediction
CN111639461A (en) * 2020-05-26 2020-09-08 浙江大学 Tool wear state detection method aiming at industrial unbalanced data
CN111782624A (en) * 2020-06-16 2020-10-16 中译语通科技(青岛)有限公司 Machine tool cutter vibration monitoring and analyzing method based on artificial intelligence and big data

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101055563A (en) * 2007-05-21 2007-10-17 北京理工大学 Particle filter method based on multiple advice distribution
CN103955750B (en) * 2014-04-04 2017-02-15 西安交通大学 Rolling bearing remaining life prediction method based on feature fusion and particle filtering
US10139311B2 (en) * 2014-09-26 2018-11-27 Palo Alto Research Center Incorporated Computer-implemented method and system for machine tool damage assessment, prediction, and planning in manufacturing shop floor
CN104792529A (en) * 2015-04-12 2015-07-22 北京化工大学 Rolling bearing life prediction method based on state-space model
CN109108730A (en) * 2018-06-01 2019-01-01 深圳精匠云创科技有限公司 Detection system, detection method and storage equipment
CN109048492B (en) * 2018-07-30 2020-04-21 北京航空航天大学 Tool wear state detection method, device and equipment based on convolutional neural network
CN109212966B (en) * 2018-08-14 2020-04-10 西安交通大学 Multi-working-condition dynamic benchmarking mechanical equipment residual life prediction method
CN109753923A (en) * 2018-12-29 2019-05-14 晋西车轴股份有限公司 Monitoring method, system, equipment and the computer readable storage medium of tool abrasion

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8725456B1 (en) * 2009-05-05 2014-05-13 The United States of America as represented by the Administrator of the National Aeronautics & Space Administration (NASA) Decomposition technique for remaining useful life prediction
CN111639461A (en) * 2020-05-26 2020-09-08 浙江大学 Tool wear state detection method aiming at industrial unbalanced data
CN111782624A (en) * 2020-06-16 2020-10-16 中译语通科技(青岛)有限公司 Machine tool cutter vibration monitoring and analyzing method based on artificial intelligence and big data

Also Published As

Publication number Publication date
CN112518425A (en) 2021-03-19

Similar Documents

Publication Publication Date Title
CN112518425B (en) Intelligent machining cutter wear prediction method based on multi-source sample migration reinforcement learning
Chen et al. Predicting tool wear with multi-sensor data using deep belief networks
CN110153802B (en) Tool wear state identification method based on convolution neural network and long-term and short-term memory neural network combined model
Madhusudana et al. Condition monitoring of face milling tool using K-star algorithm and histogram features of vibration signal
CN111562108A (en) Rolling bearing intelligent fault diagnosis method based on CNN and FCMC
CN110561191B (en) Numerical control machine tool cutter abrasion data processing method based on PCA and self-encoder
CN109597315B (en) Method, equipment and system for identifying health degradation state of mechanical equipment
CN111178553A (en) Industrial equipment health trend analysis method and system based on ARIMA and LSTM algorithms
Liang et al. Data-driven anomaly diagnosis for machining processes
CN114619292B (en) Milling cutter wear monitoring method based on fusion of wavelet denoising and attention mechanism with GRU network
CN113255848A (en) Water turbine cavitation sound signal identification method based on big data learning
CN113496312A (en) Cutter residual life prediction method and equipment based on multi-dimensional feature extraction fusion and long-term and short-term memory network and storage medium
CN113798920B (en) Cutter wear state monitoring method based on variational automatic encoder and extreme learning machine
CN113780153A (en) Cutter wear monitoring and predicting method
Qin et al. Tool wear identification and prediction method based on stack sparse self-coding network
Zhang et al. A novel data-driven method based on sample reliability assessment and improved CNN for machinery fault diagnosis with non-ideal data
CN112207631B (en) Method for generating tool detection model, method, system, device and medium for detecting tool detection model
CN112650146A (en) Fault diagnosis optimization method, system and equipment of numerical control machine tool under multiple working conditions
CN111126255A (en) Numerical control machine tool cutter wear value prediction method based on deep learning regression algorithm
Ge et al. An improved PF remaining useful life prediction method based on quantum genetics and LSTM
CN113467375A (en) Method for predicting remaining usable life of intelligent numerical control machining tool based on virtual measurement
CN116871978A (en) Drilling tool state monitoring method based on multi-sensor fusion
Zegarra et al. Comparison of CNN and CNN-LSTM architectures for tool wear estimation
Tian et al. Prediction of surface roughness using fuzzy broad learning system based on feature selection
CN114354184A (en) Deep learning-based method and device for establishing health early warning model of main shaft of large-scale rotating equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant