WO2020224433A1 - 基于机器学习的目标对象属性预测方法及相关设备 - Google Patents
基于机器学习的目标对象属性预测方法及相关设备 Download PDFInfo
- Publication number
- WO2020224433A1 WO2020224433A1 PCT/CN2020/086007 CN2020086007W WO2020224433A1 WO 2020224433 A1 WO2020224433 A1 WO 2020224433A1 CN 2020086007 W CN2020086007 W CN 2020086007W WO 2020224433 A1 WO2020224433 A1 WO 2020224433A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- feature
- target object
- detection
- neural network
- regularity
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/50—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for simulation or modelling of medical disorders
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/042—Knowledge-based neural networks; Logical representations of neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H10/00—ICT specially adapted for the handling or processing of patient-related medical or healthcare data
- G16H10/40—ICT specially adapted for the handling or processing of patient-related medical or healthcare data for data related to laboratory analysis, e.g. patient specimen analysis
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/70—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/04—Inference or reasoning models
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H10/00—ICT specially adapted for the handling or processing of patient-related medical or healthcare data
- G16H10/60—ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
Definitions
- This application relates to the field of data prediction technology, in particular to target object attribute prediction based on machine learning.
- EHR (electronic health records) data can record every consultation record of the target object.
- more and more clinical diagnosis estimation models can simulate the diagnosis process of doctors based on the EHR data of patients. To predict the future incidence of users.
- the process of predicting the future morbidity of a user can be: taking the medical coding data in the EHR data as the attributes of the patient and inputting it into the clinical diagnosis estimation model.
- the clinical diagnosis estimation model trains the medical coding data. Output the predicted diagnosis result.
- the clinical diagnosis estimation model carries out the process of training the medical coding data, which can characterize the clinical diagnosis estimation model to simulate the diagnosis process of the doctor, so that the subsequent diagnosis results predicted by the clinical diagnosis estimation model can be used to evaluate the patient’s future incidence. prediction.
- the embodiments of the present application provide a method and related equipment for predicting the attributes of a target object based on machine learning, which can solve the problem of low accuracy of the diagnosis result predicted by the clinical diagnosis estimation model.
- the technical solution is as follows:
- a method for predicting the attributes of a target object based on machine learning is provided, which is executed by a computer device, and the method includes:
- the first neural network outputs a first regular feature and a second regular feature different from the first regular feature after two different time sequence calculations, where all The first regular feature represents the historical change law of the detection feature, and the second regular feature represents the future change law of the detection feature;
- the second neural network extracts and outputs at least one local feature of the target object from the global feature
- a device for predicting attributes of a target object based on machine learning includes:
- An acquisition module configured to determine the detection characteristics of the target object according to the detection data of the target object and the attributes corresponding to the detection data
- the calculation module is used to input the detection features into the first neural network; for the detection features in each time sequence of the detection features, the first neural network outputs the first regular feature and different features after two different time sequence calculations In the second regularity feature of the first regularity feature, wherein the first regularity feature represents the historical change law of the detection feature, and the second regularity feature represents the future change law of the detection feature;
- the acquisition module is further configured to determine the global characteristics of the target object based on the first regular feature and the second regular feature;
- An extraction module configured to input the global feature into a second neural network; the second neural network extracts and outputs at least one local feature of the target object from the global feature;
- the prediction module is configured to predict the attributes of the target object based on at least one local feature of the target object.
- a computer device includes: a processor; a memory for storing a computer program; wherein the processor is used for executing the computer program stored on the memory to realize the above-mentioned target object based on machine learning The operation performed by the attribute prediction method.
- a computer-readable storage medium stores a computer program, which when executed by a processor, realizes the operations performed by the above-mentioned method for predicting the attributes of a target object based on machine learning.
- a computer program product including instructions, which when run on a computer, causes the computer to perform the operations performed by the above-mentioned method for predicting the attributes of a target object based on machine learning.
- the global feature of the target object based on the regular features that represent the history of the detection feature and the changing law in the future, and refine the global feature to obtain at least one local feature of the target object. Then, the refined local feature is more
- the characteristics of the target object can be reflected, and the attributes of the target object can be predicted based on local features. Therefore, the accuracy of the predicted attributes can be improved.
- the attributes of the target object are the predicted diagnosis results, the accuracy of the predicted diagnosis results can be improved.
- FIG. 1 is a schematic diagram of an implementation environment provided by an embodiment of the present application.
- FIG. 2 is a flowchart of a method for predicting attributes of a target object based on machine learning provided by an embodiment of the present application
- FIG. 3 is a schematic diagram of a diagnosis estimation model provided by an embodiment of the present application.
- FIG. 4 is a schematic structural diagram of a device for predicting attributes of a target object based on machine learning according to an embodiment of the present application
- Fig. 5 is a schematic structural diagram of a computer device provided by an embodiment of the present application.
- the process of predicting the future morbidity of the user can be: taking the medical coding data in the EHR data as the attributes of the patient and inputting it into the clinical diagnosis estimation model, and the clinical diagnosis estimation model trains the medical coding data, The predicted diagnosis result can be output.
- the process of clinical diagnosis estimation model training on medical coding data can represent the clinical diagnosis estimation model simulating the doctor's diagnosis process, so that the subsequent diagnosis results predicted by the trained clinical diagnosis estimation model can be used to determine the patient's future disease The situation is predicted.
- the input of the clinical diagnosis estimation model is the medical code data. Since the medical code data includes data for thousands of diseases, it is very difficult for one patient. In other words, it is only possible to suffer from one or several diseases, and it is not possible to suffer from various diseases. Therefore, the useful data in the medical coding data is distributed relatively sparsely and discretely in the medical coding data. The coded data can only indicate the disease that the patient suffers, but not the overall physical state of the patient. Then, after the clinical diagnosis estimation model is trained with such medical coded data, the accuracy of the output predicted diagnosis result is low. The predicted diagnosis result determines the patient’s future morbidity is not accurate.
- the present application provides a method for predicting the attributes of a target object based on machine learning.
- the method specifically includes: determining the detection characteristics of the target object according to the detection data of the target object and the attributes corresponding to the detection data;
- the detection features are input into the first neural network, and for the detection features in each time sequence in the detection features, the first neural network outputs the first regular feature and the second regular feature different from the first regular feature after two different sequence calculations.
- the first law feature represents the historical change law of the detection feature
- the second law feature represents the future change law of the detection feature
- the global feature of the target object is determined; the global feature is input to the second neural network ,
- the second neural network extracts and outputs at least one local feature of the target object from the global feature; and predicts the attribute of the target object based on the at least one local feature of the target object.
- the global characteristics of the target object are determined based on regular features that represent the history of detection features and the law of future changes, and the global features are refined to Obtain at least one local feature of the target object, then the refined local features can better reflect the characteristics of the target object, and then predict the attributes of the target object based on the local features. Therefore, the accuracy of the predicted attributes can be improved. When it is a predicted diagnosis result, the accuracy of the predicted diagnosis result can be improved.
- Machine Learning is a multi-field interdisciplinary subject, involving probability theory, statistics, approximation theory, convex analysis, algorithm complexity theory and other subjects. Specializing in the study of how computers simulate or realize human learning behaviors in order to acquire new knowledge or skills, and reorganize the existing knowledge structure to continuously improve its own performance.
- Machine learning is the core of artificial intelligence, the fundamental way to make computers intelligent, and its applications cover all fields of artificial intelligence.
- Machine learning and deep learning usually include artificial neural networks, belief networks, reinforcement learning, transfer learning, inductive learning, and teaching learning techniques.
- AI Artificial Intelligence
- artificial intelligence is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge, and use knowledge to obtain the best results.
- artificial intelligence is a comprehensive technology of computer science, which attempts to understand the essence of intelligence and produce a new kind of intelligent machine that can react in a similar way to human intelligence.
- Artificial intelligence is to study the design principles and implementation methods of various intelligent machines, so that the machines have the functions of perception, reasoning and decision-making.
- Artificial intelligence technology is a comprehensive discipline, covering a wide range of fields, including both hardware-level technology and software-level technology.
- Basic artificial intelligence technologies generally include technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing technologies, operation/interaction systems, and mechatronics.
- Artificial intelligence software technology mainly includes computer vision technology, speech processing technology, natural language processing technology, and machine learning/deep learning.
- the method for predicting the attributes of a target object based on machine learning relates to artificial intelligence, and particularly relates to machine learning in artificial intelligence.
- the method for predicting the attributes of target objects based on machine learning can be applied to computer devices that can perform data processing, such as terminal devices, servers, etc.; wherein the terminal devices can specifically be smart phones, computers, and personal digital devices.
- Assistant Personal Digital Assitant, PDA
- the server can be an application server or a web server. In actual deployment, the server can be an independent server or a cluster server.
- the terminal device can directly perform the attribute prediction of the target object according to the detection data of the target object input by the user and the attributes corresponding to the detection data. Forecast and display the forecast results for users to view.
- the server first predicts the attributes of the target object according to the detection data of the target object uploaded by the terminal device and the attributes corresponding to the detection data. Obtain the prediction result; then send the prediction result to the terminal device so that the terminal device can display the received prediction result for the user to view.
- the following is an example of applying the method for predicting the attributes of a target object based on machine learning provided by the embodiments of the present application to a terminal device.
- the applicable application scenarios are exemplified.
- the application scenario includes: a terminal device and a user; wherein the terminal device is used to execute the machine learning-based
- the target object attribute prediction method is to predict the attribute of the target object to obtain the prediction result for the user to view.
- the terminal device After the terminal device receives the attribute prediction instruction triggered by the user, the terminal device can determine the detection characteristics of the target object according to the detection data of the target object and the attributes corresponding to the detection data; input the detection characteristics into the first neural network, and target each of the detection characteristics. Time-series detection features, the first neural network outputs the first regular feature and the second regular feature different from the first regular feature after two different time-series calculations.
- the first regular feature represents the historical change rule of the detection feature
- the second The regular feature represents the future change law of the detection feature
- the global feature of the target object is determined; the global feature is input to the second neural network, and the second neural network extracts and outputs the target object’s At least one local feature; based on at least one local feature of the target object, predict the attribute of the target object to obtain a prediction result, so that the terminal device can display the prediction result to the user.
- the method for predicting attributes of a target object based on machine learning provided in the embodiments of the present application can also be applied to a server.
- the application scenario includes: a server, a terminal device, and a user.
- the terminal device After the terminal device receives the attribute prediction instruction triggered by the user, the terminal device generates an attribute prediction request according to the attribute prediction instruction, and sends the attribute prediction request to the server, so that the server can receive the attribute prediction request sent by the terminal device Then, according to the detection data of the target object and the corresponding attributes of the detection data, the detection feature of the target object is determined; the detection feature is input into the first neural network, and the first neural network adopts two different types of detection features for each time sequence in the detection feature. After calculating the time series, output the first law feature and the second law feature different from the first law feature.
- the first law feature represents the historical change law of the detection feature
- the second law feature represents the future change law of the detection feature
- the feature and the second regular feature determine the global feature of the target object; the global feature is input to the second neural network, and the second neural network extracts and outputs at least one local feature of the target object from the global feature; based on the at least one local feature of the target object,
- the prediction result is obtained by predicting the attributes of the target object, so that the server can feed back the obtained prediction result to the terminal device, so that the user can view the prediction result on the terminal device.
- Fig. 1 is a schematic diagram of an implementation environment provided by an embodiment of the present application.
- the environment includes a target object attribute prediction system 100 based on machine learning.
- the target object attribute prediction system based on machine learning includes a preprocessing module 101, a detection feature extraction module 102, a regular feature extraction module 103, and a prediction module 104.
- the preprocessing module 101 is used to process the detection data of the target object and the attributes corresponding to the detection data, and specifically convert the detection data of the user and the attributes corresponding to the detection data into data that can be calculated by the detection feature extraction module 102.
- the detection feature extraction module 102 is configured to extract a mixed feature of a feature of the detection data and an attribute corresponding to the detection data, and the extracted mixed feature can be used as a target object detection feature. Specifically, the detection feature extraction module 102 may first extract the features of the attributes and the features of the detection data based on the data processed by the preprocessing module 101, then splice the features of the extracted attributes and the features of the detection data, and finally, the detection The feature extraction module 102 extracts detection features based on the splicing result.
- the regular feature extraction module 103 is used to extract regular features and generate global features of the target object.
- the regular features are used to represent the global change law of the detection feature.
- the regular feature extraction module 103 may first extract based on the detection feature extraction module 102 extracting Then, the law feature extraction module 103 can obtain the global change law of the detection feature based on the historical change law and future change law of the detection feature, and finally according to the law feature representing the global change law , To determine the global characteristics of the target object.
- the prediction module 104 is used to predict the attributes of the target object. Specifically, the prediction module 104 can refine the global features generated by the law feature extraction module 103 through a neural network to obtain the local features of the target object, and then the prediction module 104 uses The target local features are expressed in a concentrated expression of the acquired multiple local features. Finally, the prediction module 104 predicts the attributes of the target object based on the target local features.
- each module in the target object attribute prediction system 100 based on machine learning can be implemented by one computer device or multiple computer devices.
- the embodiments of the present application implement the functions of each module.
- the number of computer equipment is not specifically limited.
- Figure 1 introduces the respective functions of each module in the target object attribute prediction system based on machine learning.
- Figure 2 which is an embodiment of the present application.
- the computer device determines the detection feature of the target object according to the detection data of the target object and the attributes corresponding to the detection data.
- the computer device can be any computer device.
- the target object can be any user.
- the detection data of the target object may include the detection data of each detection of the target object in the historical time period.
- the detection data of each detection corresponds to a detection time. Therefore, the detection data of each detection may include the detection data related to the target object.
- Various types of data taking the detection of the physical signs of the target object as an example, the data detected at one time, heartbeat data, blood pressure data, and other types of data, for any type of data, each detection may detect a lot of data. Multiple data detected at one time can form a timing sequence related to the detection time of this time. Therefore, a detection time may correspond to multiple timing sequences, and each type of timing sequence can be labeled to distinguish between a detection time. Many types of inspection data.
- the embodiment of the present application does not specifically limit the detection time, and does not specifically limit the time interval between two detection times.
- the embodiment of the present application does not limit the aforementioned historical time period, and the aforementioned historical time period may refer to any time period before the computer device uses the attribute prediction method to predict the attribute of the target object.
- the detection data when the detection data is the physical sign data of the target object, the detection data can be the time series data stored in the EHR data.
- the time series data includes the time of each consultation of the target object and the time of each consultation.
- the consultation time is also the detection time.
- the above attribute is used to indicate at least one state of the target object.
- Each position of the attribute corresponds to a state.
- the state identifier can be used to indicate whether the target object has a corresponding state.
- the state identifier may include a first state identifier and a second state identifier. , Where the first state identifier is used to indicate that the target object has a corresponding state, and the second state identifier is used to indicate that the target object does not have a corresponding state, for example, when any position within the attribute has the first state identifier , It means that the target object has a state corresponding to the position. When any position in the attribute has a second state identifier, it means that the target object does not have a state corresponding to the position.
- different character strings may be used to represent the first status identifier and the second status identifier, and the embodiment of the present application does not specifically limit the character string representing the first status identifier or the second status identifier.
- the current detection data will be obtained. Based on the current detection data, the attributes of the target object can be determined, and the detection time corresponding to the determined target object’s attributes is the corresponding detection data of this time The inspection time. It can be seen that in the embodiment of the present application, one attribute corresponds to one detection time, and if the detection data of the target object includes at least one detection time, the detection data of the target object corresponds to at least one attribute.
- the aforementioned at least one state may be the diseased state of the target object, or other states, and the embodiment of the present application does not specifically limit the at least one state.
- the attribute corresponding to the detection data can be medical coding data.
- a medical coding data can consist of 0 and 1. Each position in the medical coding data corresponds to a disease. When the data in a position is 0, it means that the target object does not have the disease corresponding to that position. When the data in any position is 1, it means that the target object has the disease corresponding to the position. It is understandable that here 0 is equivalent to the second state identifier, and 1 here is equivalent to the first state identifier.
- the computer device can first preprocess the detection data of the target object and the attributes corresponding to the detection data, so that the detection data and the attributes corresponding to the detection data conform to the format required for subsequent calculations, and then perform the processing on the processed data Feature extraction to obtain the detection feature of the target object.
- this step 201 can be implemented through the process shown in the following steps 2011-2014.
- Step 2011 The computer device inputs the attribute corresponding to the detection data into the fully connected neural network, filters out the target state in the attribute through the fully connected neural network, performs weighting processing on the target state, and outputs the attribute corresponding to the detection data feature.
- the target object Since multiple states are stored in the attributes of the target object, the target object has some states in the attributes, and the state of the target object is regarded as the target state.
- the computer equipment preprocesses the attributes corresponding to the detection data.
- the computer equipment can use the mulit-hot (multi-hot) vector to express the attributes corresponding to the detection data.
- the mulit-hot vector is composed of 0 and 1. 0 represents that the target object does not have a corresponding state, and 1 represents that the target object has a corresponding state.
- the computer device can input the mulit-hot vector into the fully connected neural network, and the fully connected network filters out the target state of the target object through the coding matrix, and weights the selected target state, and outputs the attributes corresponding to the detection data Characteristics.
- this application performs weighting processing on the filtered target states, so that the processed results can concentrate the characteristics of the attributes corresponding to the mulit-hot vector.
- W T is the coding feature matrix
- the W T can be a pre-trained matrix, or a matrix trained in the process of calculating the attributes of the full neural network
- j is the number of the jth detection time
- x j Is the mulit-hot vector of the attribute corresponding to the j-th detection time, that is, x j represents the attribute mulit-hot vector corresponding to the j-th detection time
- ⁇ j is the feature vector of the attribute corresponding to the j-th detection time, this feature
- the vector represents the feature of the attribute corresponding to the jth detection, j is an integer greater than or equal to 1
- b ⁇ represents the deviation parameter
- b ⁇ can be a pre-trained parameter, or in the process of calculating the attribute
- the computer device uses the mulit-hot vector of the attribute to represent the attribute, the dimensions of the mulit-hot vector may be more high.
- the dimension of the feature ⁇ of the attribute can be reduced compared to the dimension of the mulit-hot vector, so that the process shown in step 2011 can also be regarded as a dimensionality reduction process. Conducive to subsequent calculations.
- Step 2012 The computer device inputs the detection data into a time series analysis tool, extracts the characteristics of each type of data in the detection data at each time series through the time series analysis tool, and outputs a feature set.
- the time series analysis tool can be the HCTSA (highly comparative time-series) code base.
- the characteristics of each type of data in each time series can include the data distribution, entropy, and scaling attributes of this type of data. It can be seen that these features can represent the autocorrelation structure of this type of data, and because these features are based on actual detection Data is obtained, so these characteristics are interpretable.
- the time series analysis tool can extract the characteristics of each type of data in the jth detection data based on the preset feature extraction rules among them, Represents the timing characteristics of the k-th data type during the j-th detection, z is a positive integer, k is the number of the data type, j is the number of the j-th detection, that is, the number of the j-th detection time, and j is a positive integer , 1 ⁇ j ⁇ J, J represents the total number of detections of the target object. It should be noted that the embodiment of the present application does not specifically limit the preset feature extraction rule.
- the time series analysis tool stores the extracted features of each detection data in a feature set Finally, the time series analysis tool can output the feature set, so that the computer device can obtain the feature set.
- the computer device Since the features in the feature set can only represent the autocorrelation structure of these various types of data, and cannot reflect the features of all detection data, the computer device also needs to perform the following step 2013 to obtain the features of the detection data.
- step 2013, the computer device inputs the feature set into the deep cross network, and cross-processes each time sequence feature in the feature set through the deep cross neural network, and outputs the feature of the detection data.
- DCN deep&cross network, deep cross network
- the computing device can input the feature set into the cross network and the deep network respectively.
- multiple timing features in the feature set can be crossed and output
- the common features of all features in the feature set are extracted through the deep network.
- the crossed features output by DCN are combined with the common features extracted by the deep network to obtain the features of the detection data.
- the embodiment of the present application does not specifically limit the execution order of steps 2011-203.
- the computing device may perform step 2011 first, and then perform steps 2012 and 2013; it may also perform steps 2012 and 2013 first, and then perform step 2011; it may also perform steps 2012 and 2013 while performing step 2011.
- Step 2014 The computer device inputs the feature of the attribute and the feature of the detection data into a deep neural network, extracts the mixed feature of the detection data and the attribute corresponding to the detection data through the deep neural network, and outputs the detection feature.
- the computer device can first splice the characteristics of the attribute and the characteristics of the detection data to obtain a spliced feature, and then input the merged feature into the deep nerve.
- Each node in the deep neural network can calculate the data in ⁇ j according to the following second formula, so that The deep neural network can output the detection feature ⁇ j at the jth detection time, and the second formula can be expressed as:
- W x is the first weight matrix
- each weight of W x is used to represent the importance of each element in ⁇ j , through ⁇ j can be achieved in each element of the weighting process, further elements may be integrated in the ⁇ j, ReLU (rectified linear unit rectifying linear) function can be related to data mining preferably between features, thus, strong expression function RELU , Use the ReLU function to After processing, the processed result ⁇ j can express the characteristics of ⁇ j . Therefore, ⁇ j can be used as the detection feature of the jth detection.
- ReLU rectifified linear unit rectifying linear
- the computer device can then extract the detection characteristics at each detection time through the deep neural network.
- each detection time The above detection features are called sub-detection features. Therefore, the detection feature finally output by the deep neural network includes at least one sub-detection feature.
- the detection feature Since the characteristics of the detection data and the attributes of the attributes include the characteristics of various types of data in various time series, the detection feature has a multi-modality, and the detection feature can be regarded as a multi-modal feature.
- the detection feature in the embodiment of the present application can better reflect the feature in the detection process of the target object; and,
- the detection data is the actual detected data, which can be used as an objective basis to make the acquired detection features interpretable.
- the attributes corresponding to the detection data are the result of subjective judgment. Therefore, based on the attributes and detection data, the acquired detection features are The accuracy is high.
- FIG. 3 is a schematic diagram of a diagnostic estimation model provided by an embodiment of the present application, in the multi-modal feature extraction part.
- the medical coding data of the computer equipment that is, the attributes corresponding to the detection data
- the mulit-hot vector is input to the fully connected neural network
- the fully connected neural network can output the medical coding data through calculation
- the feature that is, the feature of the attribute
- the feature process of the fully connected neural network that can output the medical code data through calculation is the process of embedding the medical code data.
- the computer equipment performs feature extraction on the time series data (that is, the detection data) to obtain a feature set.
- the computer equipment inputs the feature set to DCN, and DCN outputs cross-multiple time series feature mixture, that is, It is the feature of the detection data.
- the computer equipment mixes the multi-time-series cross-mixing feature with the attribute feature, and obtains the multi-modal feature (that is, the detection feature) based on the mixed feature.
- the computer device can also adopt other neural networks to obtain the characteristics of the detection data.
- the computer device inputs the detection feature into the first neural network, and for the detection features in each time sequence in the detection feature, the first neural network outputs the first regular feature and the feature different from the first regular feature after two different sequence calculations
- the second regularity feature of, the first regularity feature represents the historical change law of the detection feature
- the second regularity feature represents the future change law of the detection feature.
- the first neural network may be BiRNN (bidirectional recurrent neural networks) with an attention mechanism.
- the BiRNN may be composed of a first sub-network and a second sub-network, where the first sub-network Used to obtain the first regular feature, and the second sub-network is used to obtain the second regular feature.
- the computer device inputs the detection feature into the first sub-network of the first neural network according to the reverse sequence sequence, and performs reverse sequence calculation on the detection feature through the first sub-network, Obtain the first law feature; according to the forward sequence sequence, input the detection feature into the second sub-network of the first neural network, and perform forward sequence calculation on the detection feature through the first sub-network to obtain the second law feature.
- the computer device can input the detection feature into the first sub-network in a forward time sequence, and the computer device can input the detection feature into the first sub-network in a reverse time sequence.
- the detection feature is input into the second sub-network.
- the first sub-network can calculate each sub-feature in the detection feature based on the calculation rules preset in the first sub-network, and finally, the first sub-network
- the sub-network can output the first regular feature Among them, b is used to indicate the reverse direction, and the embodiment of the present application does not specifically limit the preset calculation rule in the first sub-network.
- the input layer of the node input specifically, the computer device inputs ⁇ 1 into the first node of the input layer of the first sub-network, and inputs ⁇ 2 into the second node of the input layer of the first sub-network,...( And so on).
- the second sub-network can calculate each sub-feature in the detection feature based on the calculation rules preset in the second sub-network, and finally, the second sub-network
- the sub-network can output the second regular feature
- f represents the forward direction
- the embodiment of the application does not specifically limit the preset calculation rule in the second sub-network.
- the computer device determines the global feature of the target object based on the first regularity feature and the second regularity feature.
- the computer device can first obtain the law feature representing the global change law of the detection feature based on the first law feature and the second law feature, and then according to This regular feature obtains the global feature.
- this step 203 can be implemented through the process shown in the following steps 2031-2033.
- Step 2031 the computer device splices the first regularity feature and the second regularity feature to obtain a third regularity feature.
- Step 2032 The computer device performs weighting processing on the third regularity feature to obtain a fourth regularity feature.
- the fourth regularity feature is used to represent the global change law of the detected feature.
- the computing device can perform weighting processing on the third regular feature through the attention mechanism in the first neural network.
- this step 2032 can be implemented through the process shown in steps 11-13.
- Step 11 The computer device performs weight learning based on the first attention mechanism and the third regularity feature to obtain at least one first weight, and the first weight is used to indicate that a piece of detection data corresponds to the piece of detection data The importance of the attribute.
- the first attention mechanism is any attention mechanism in the first neural network
- the computer device can perform weight learning based on the weight learning strategy in the first attention mechanism
- the weight learning strategy It can be a location-based attention weight learning strategy
- the location-based attention weight learning strategy can be expressed as: Among them, W ⁇ is the second weight vector, W ⁇ T is the transpose matrix of W ⁇ , b ⁇ is the second deviation parameter, It is the first weight corresponding to the jth detection.
- weight learning strategy in the first attention mechanism may also be other attention weight learning strategies, and the embodiment of the application does not specifically limit the weight learning strategy in the first attention mechanism.
- Step 12 The computer device performs normalization processing on the at least one first weight to obtain at least one second weight.
- the at least one first weight value in step 11 is a value obtained through mathematical calculation, the at least one first weight value may be too large or too small.
- the at least one first weight value may be normalized , So that the size of each second weight value obtained after processing is moderate, then, when the at least one first weight value is too large, the at least one first weight value can be reduced proportionally, when the at least one first weight value If the weight value is too small, the at least one first weight value may be proportionally enlarged to realize the normalization processing of the at least one first weight value.
- each second weight value is only the result of normalizing a first weight value
- the second weight value has the same function as the first weight value, and it is used to indicate that a detection data corresponds to the detection data. The importance of the attribute.
- Step 13 The computer device performs weighting processing on the third regularity feature based on at least one second weight value to obtain the fourth regularity feature.
- the computer device uses the at least one second weight Substituting the third formula into the third formula, the output of the third formula is used as the fourth rule feature to implement the weighting process for the third rule feature, the third formula can be expressed as Among them, c is the fourth law feature, It is the second weight corresponding to the jth detection, J is the total number of detections of the target object.
- the third law feature is weighted by at least one second weight, and the third law feature is more concentrated expression. Therefore, the fourth regular feature can represent the global change regularity of the detection feature.
- the first law feature and the second law feature are weighted, the first law feature and the second law feature can be integrated and represented by the fourth law feature, so the fourth law feature can represent the first law feature
- the historical change law of can also represent the future change law represented by the first law feature. Therefore, the fourth feature law can represent the global change law of the detection feature.
- Step 2033 The computer device determines the global feature based on the third regularity feature and the fourth regularity feature.
- the computer device may determine the last detection time of the third regular feature
- the corresponding regularity feature and the fourth regularity feature are substituted into the fourth formula, and the output of the fourth formula is taken as the global feature, where the fourth formula can be expressed as: among them, Is the global feature, h J is the regular feature corresponding to the last detection time of the target object in the third regular feature, [h J ,c] is the vector after h J and c are joined, W d is the third weight matrix, Is the transposed matrix of W d , b d is the third deviation parameter, and ReLU() refers to the linear rectification function.
- the fourth law feature represents the global change law of the detection feature
- the first law feature can represent the history change law of the detection feature
- the second law feature can represent the future change law of the detection feature. Therefore, the three regular features are weighted. , The result obtained can represent the global characteristics of the target object.
- the computer device inputs the global feature into a second neural network, and the second neural network extracts and outputs at least one local feature of the target object from the global feature.
- the second neural network may be HMCN (hierarchical multi-label classification networks), and the second neural network may extract local features from the input global features. Since the global features cannot represent the details of the target object, the details of the target object can be extracted through the second neural network. Specifically, the second neural network can extract the details of the target object level by level, so that the final extracted details can satisfy the requirements. Demand for attribute forecasting.
- HMCN hierarchical multi-label classification networks
- Each layer of the second neural network can output one local feature.
- the computing device inputs the global feature to the second neural network
- the global feature can be input from the input layer to the output layer of the second neural network.
- the second neural network can calculate the global feature layer by layer.
- it can calculate the hierarchical characteristics of the first target layer and the local characteristics of the target object in the first target layer based on the output data of the second target layer.
- the first target layer Is any layer of the second neural network
- the second target layer is the upper layer of the first target layer in the second neural network
- the level feature is used to indicate that the global feature is in the network layer of the second neural network
- the hierarchical characteristics of the first target layer are determined by the global characteristics and the hierarchical characteristics of the second target layer.
- the second target layer of the second neural network After the second target layer of the second neural network generates the hierarchical characteristics of the second target layer and the local characteristics of the target object in the second target layer, the second target layer can output the second target layer's Hierarchical features and the global features (output data of the second target layer), so that the first target layer can receive the hierarchical features of the second target layer and the global features. Then, following the global features output by each network layer of the second neural network, the global feature can be input to each network layer of the second neural network.
- the first target layer A target layer may calculate the hierarchical characteristics of the first target layer based on the hierarchical characteristics of the second target layer and the global characteristics.
- the hierarchical characteristics of the i-th layer of the second neural network It can be expressed as: Among them, G represents the overall situation, Is the fourth weight matrix, Is the hierarchical feature of layer i-1, Indicates the hierarchical characteristics of the i-th layer, b G is the fourth deviation parameter, and the i-th layer can be regarded as the first target layer.
- the nodes on the first target layer may obtain the local characteristics of the target object on the first target layer based on the hierarchical characteristics of the first target layer and the global characteristics.
- the local feature of the target object in the i-th layer It can be expressed as: Among them, L represents the network layer, Is the fifth weight matrix, b T is the fifth deviation parameter.
- each layer of the second neural network is calculated based on the hierarchical features and global features of the previous layer, the local features of the target object on each layer of the second neural network are affected by the local features of the previous layer.
- the influence of features since the level expression of each layer is determined by the level features of this layer, the local features generated by any network layer in the second neural network can be used as the parent of the local features generated by the next network layer.
- the second neural network can extract the details of the target object level by level.
- the computer device predicts the attribute of the target object based on at least one local feature of the target object.
- a local feature can represent different levels of details of the target object, considering that there are more details, these can be processed intensively to obtain more detailed local features, and then attribute predictions are made based on this more detailed local feature.
- this step 205 can be implemented through the process shown in the following steps 2051-2052.
- Step 2051 The computing device performs weighting processing on at least one local feature of the target object to obtain the target local feature.
- the target local feature is also a more detailed local feature.
- the computing device can perform weighting processing on the local features of the target object through the attention mechanism in the second neural network. In a possible implementation manner, this step 2051 can be implemented through the process shown in steps 21-22.
- Step 21 The computer device performs weight learning based on the second attention mechanism and the at least one local feature to obtain at least one third weight, and one third weight is used to indicate the importance of a local feature.
- the second attention mechanism is any attention mechanism in the second neural network
- the computer computing device performs weight learning based on the second attention mechanism and the at least one local feature
- the second attention mechanism can be
- the weight learning strategy in the second attention mechanism, the learning weight, the weight learning strategy in the second attention mechanism can be expressed as:
- W ⁇ is the sixth weight matrix
- b ⁇ is the sixth deviation parameter
- M represents the number of local features.
- weight learning strategy in the second attention mechanism may also be other attention weight learning strategies, and the embodiment of the application does not specifically limit the weight learning strategy in the second attention mechanism.
- Step 22 The computer device performs weighting processing on at least one local feature of the target object based on the at least one third weight to obtain the target local feature.
- the computer device substitutes at least one third weight and at least one local feature of the target object into the fifth formula, and uses the output of the fifth formula as the target local feature to implement weighting processing on at least one local feature of the target object.
- the fifth formula can be expressed as:
- a G is the attribute corresponding to the detection data corresponding to the current time series
- N is the number of layers of the second neural network.
- At least one local feature is weighted by at least one third weight, so that the obtained target local feature is more detailed.
- Step 2052 Predict the attributes of the target object based on the local features of the target.
- the computer device can substitute the local feature of the target into a sixth formula to predict the attribute of the target object.
- the sixth formula is used to predict the attribute of the target object.
- the sixth formula can be expressed as: among them, It is the attribute corresponding to the predicted target object in the J+1th detection data, W G is the seventh weight matrix, and b G is the seventh deviation parameter.
- the second neural network may determine whether to output the currently predicted attribute according to the global loss and the local loss in the second neural network.
- the second neural network after the global feature is input to the second neural network, if the global loss and the local loss in the second neural network meet preset conditions, the second neural network outputs the current prediction Attribute, otherwise, the second neural network adjusts the weight matrix in the second neural network until the global loss and local loss in the second neural network meet the preset conditions, the local loss is the second neural network in each layer The difference between the expected output data and the actual output data, and the global loss is the difference between the expected final output data of the second neural network and the actual final output data.
- the second neural network can predict the local features of any layer at the next detection time (referred to as predicted local features), then, the predicted local features of the i-th layer It can be expressed as
- the second neural network can predict the attributes of at least one target object.
- the second neural network can use a cross-entropy strategy to calculate the The local loss of any layer, then, the i-th local loss L li can be expressed as:
- Q is the number of target objects, Based on the global characteristics of the Q-th target object, the actual output data of the i-th layer, To predict the output data of the i-th layer based on the global characteristics of the Q-th target object.
- the second neural network can calculate the attributes of at least one target object in the next detection. Then, the The second neural network can predict the attributes of at least one target object at the next detection, and the cross-entropy strategy is used to calculate the global loss L G , where L G can be expressed as:
- the global loss and local loss in the second neural network meet the preset conditions, it means that the difference between the local features generated by each layer of the second neural network and the expected local features reaches the preset accuracy, which can ensure The accuracy of the local features of each layer of the second neural network is relatively high, so that the accuracy of the predicted attributes can be improved.
- the computer device since the second neural network is calculated based on numerical values, and each state in the attributes of the target object is actually represented by a state identifier, the computer device also needs to actualize the second neural network.
- the output data is converted into attributes composed of state identifiers.
- the actual output data of the second neural network may include at least one probability value. Each probability value corresponds to a state in the attribute of the target object.
- the computer device When any probability value is greater than the target value , Indicating that the target object has the target state corresponding to any probability, the computer device stores the first state identifier in the position of the target state in the attribute; when any probability value is less than or equal to the target value, the target If the object does not have the target state corresponding to any one of the probabilities, the computer device stores the second state identifier in the position of the target state in the attribute. Then, by judging each probability value, the actual expression of the attribute can be obtained.
- the embodiment of the application does not specifically limit the target value.
- the attention loop network is equivalent to the first neural network.
- the computing device inputs global features to each layer of the second neural network in the neural-level multi-label modeling part, and the first layer generates the first-level hierarchical features based on the global features
- the computer equipment is calculating the local loss L l1 of the first layer, and the first layer will Output to the second layer, so that the second layer can perform a calculation process similar to the first layer, and finally all the layers of the second neural network can get a Computer equipment will be M Output to the attention set (attentional ensembel), in the attention set, based on the second attention mechanism, generate predicted output data And then based on the predicted output data Generate a global loss L G. Then, when the loss of the global and local loss L G L i satisfies a preset condition, the second neural network output can be
- the method provided by the embodiment of the present application determines the global feature of the target object based on the regular feature representing the history of the detection feature and the future change rule, and refines the global feature to obtain at least one local feature of the target object, then ,
- the refined local features can better reflect the features of the target object, and then predict the attributes of the target object based on the local features. Therefore, the accuracy of the predicted attributes can be improved.
- the prediction can be improved The accuracy of the diagnosis results.
- the detection feature in the embodiment of the present application can better reflect the feature in the detection process of the target object, compared to the feature of the target object obtained only based on the detection data.
- the detection data is the actual detected data, which can be used as an objective basis to make the acquired detection features interpretable, and the attributes corresponding to the detection data are the result of subjective judgment. Therefore, based on the attributes and detection data, the acquired detection The accuracy of features is high.
- the global loss and local loss in the second neural network meet the preset conditions, it means that the local features generated by each layer of the second neural network have reached the expected value, so that the output layer of the second neural network can be guaranteed The accuracy of the output local features is high.
- FIG. 4 is a schematic structural diagram of a device for predicting attributes of a target object based on machine learning according to an embodiment of the present application, and the device includes:
- the acquiring module 401 is configured to determine the detection characteristics of the target object according to the detection data of the target object and the attributes corresponding to the detection data;
- the calculation module 402 is configured to input the detection features into a first neural network; for the detection features in each time sequence of the detection features, the first neural network outputs the first regular feature after two different time sequence calculations And a second regularity feature that is different from the first regularity feature, wherein the first regularity feature represents a historical change law of the detection feature, and the second regularity feature represents a future change law of the detection feature;
- the acquiring module 401 is further configured to determine the global characteristics of the target object based on the first regular feature and the second regular feature;
- the extraction module 403 is configured to input the global feature into a second neural network; the second neural network extracts and outputs at least one local feature of the target object from the global feature;
- the prediction module 404 is configured to predict the attributes of the target object based on at least one local feature of the target object.
- the obtaining module 401 is used to:
- the feature of the attribute and the feature of the detection data are input into a deep neural network, and a mixed feature of the detection data and the attribute corresponding to the detection data is extracted through the deep neural network, and the detection feature is output.
- the obtaining module 401 is used to:
- the global feature is determined.
- the obtaining module 401 is specifically configured to:
- weight learning is performed to obtain at least one first weight, and one of the first weights is used to represent the attribute of a piece of detection data corresponding to the piece of detection data Importance;
- weighting is performed on the third regularity feature to obtain the fourth regularity feature.
- the prediction module 404 includes:
- a processing unit configured to perform weighting processing on at least one local feature of the target object to obtain the target local feature
- the prediction unit is configured to predict the attributes of the target object based on the local features of the target.
- the processing unit is used for;
- weighting is performed on the at least one local feature to obtain the target local feature.
- each layer of the second neural network outputs one local feature.
- the device further includes an output module, configured to, after the global feature is input to the second neural network, if the global loss and the local loss in the second neural network meet a preset condition, then The second neural network outputs the current predicted attributes, the local loss is the difference between the expected output data of the second neural network in each layer and the actual output data, and the global loss is the second neural network The difference between the final output data expected by the network and the actual final data.
- an output module configured to, after the global feature is input to the second neural network, if the global loss and the local loss in the second neural network meet a preset condition, then The second neural network outputs the current predicted attributes, the local loss is the difference between the expected output data of the second neural network in each layer and the actual output data, and the global loss is the second neural network The difference between the final output data expected by the network and the actual final data.
- the device further includes a generating module for generating the local features output by the first target layer based on the hierarchical features of the first target layer in the second neural network and the local features generated by the second target layer,
- the hierarchical feature of the first target layer is used to indicate the state of the global feature in the first target layer
- the second target layer is the upper layer of the first target layer in the second neural network .
- the hierarchical characteristics of the first target layer are determined by the global characteristics and the hierarchical characteristics of the second target layer.
- FIG. 5 is a schematic structural diagram of a computer device provided by an embodiment of the present application.
- the computer device 500 may have relatively large differences due to different configurations or performance, and may include one or more CPUs (central processing units, processors) 501 And one or more memories 502, wherein at least one instruction is stored in the memory 502, and the at least one instruction is loaded and executed by the processor 501 to achieve the machine learning-based goals provided by the foregoing method embodiments Object attribute prediction method.
- the computer device 500 may also have components such as a wired or wireless network interface, a keyboard, and an input/output interface for input and output.
- the computer device 500 may also include other components for implementing device functions, which will not be repeated here.
- a computer-readable storage medium such as a memory including instructions, which can be executed by a processor in a terminal to complete the method for predicting attributes of a target object based on machine learning in the foregoing embodiment.
- the computer-readable storage medium may be ROM (read-only memory, read-only memory), RAM (random access memory, random access memory), CD-ROM (compact disc read-only memory, CD-ROM) , Magnetic tapes, floppy disks and optical data storage devices.
- a computer program product including instructions, which when run on a computer, cause the computer to execute the method for predicting the attributes of a target object based on machine learning in the foregoing embodiments.
- the device for predicting attributes of the target object based on machine learning provided in the above embodiment, only the division of the above functional modules is used as an example for illustration. In actual applications, the above function can be assigned differently according to needs.
- the function module is completed, that is, the internal structure of the device is divided into different function modules to complete all or part of the functions described above.
- the device for predicting the attributes of a target object based on machine learning provided in the above embodiments belongs to the same concept as the embodiments of the method for predicting attributes of a target object based on machine learning. The specific implementation process is detailed in the method embodiments, and will not be repeated here.
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Data Mining & Analysis (AREA)
- Public Health (AREA)
- Medical Informatics (AREA)
- Computing Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Biophysics (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Epidemiology (AREA)
- Primary Health Care (AREA)
- Pathology (AREA)
- Databases & Information Systems (AREA)
- Image Analysis (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (16)
- 一种基于机器学习的目标对象属性预测方法,由计算机设备执行,所述方法包括:根据目标对象的检测数据以及所述检测数据对应的属性,确定所述目标对象的检测特征;将所述检测特征输入第一神经网络;针对所述检测特征中各个时序上的检测特征,所述第一神经网络通过两种不同的时序计算后输出第一规律特征及不同于所述第一规律特征的第二规律特征,其中,所述第一规律特征表示所述检测特征的历史变化规律,所述第二规律特征表示所述检测特征的未来变化规律;基于所述第一规律特征和第二规律特征,确定所述目标对象的全局特征;将所述全局特征输入第二神经网络;所述第二神经网络从所述全局特征提取并输出所述目标对象的至少一个局部特征;基于所述目标对象的至少一个局部特征,对所述目标对象的属性进行预测。
- 根据权利要求1所述的方法,所述根据目标对象的检测数据以及所述检测数据对应的属性,确定所述目标对象的检测特征,包括:将所述检测数据对应的属性输入全连接神经网络,通过所述全连接神经网络筛选出所述属性中的目标状态,对所述目标状态进行加权处理,输出所述属性的特征;将所述检测数据输入时间序列分析工具,通过时间序列分析工具提取所述检测数据中每一类数据在各个时序的特征,输出特征集合;将特征集合输入深交叉神经网络,通过深交叉神经网络对所述特征集合内的各个时序特征进行交叉处理,得到所述检测数据的特征;将所述属性的特征以及所述检测数据的特征输入深度神经网络,通过所述深度神经网络提取所述检测数据和所述检测数据对应的属性的混合特征,输出所述检测特征。
- 根据权利要求1所述的方法,所述基于所述第一规律特征和第二规律 特征,确定所述目标对象的全局特征,包括:将所述第一规律特征和第二规律特征进行拼接,得到第三规律特征;对所述第三规律特征进行加权处理,得到第四规律特征,所述第四规律特征用于表示所述检测特征的全局变化规律;基于所述第三规律特征和所述第四规律特征,确定所述全局特征。
- 根据权利要求3所述的方法,所述对所述第三规律特征进行加权处理,得到第四规律特征,包括:基于第一注意力机制以及所述第三规律特征,进行权值学习,得到至少一个第一权值,一个所述第一权值用于表示一个检测数据与所述一个检测数据对应的属性的重要程度;对所述至少一个第一权值进行归一化处理,得到至少一个第二权值;基于所述至少一个第二权值,对所述第三规律特征进行加权处理,得到所述第四规律特征。
- 根据权利要求1所述的方法,所述基于所述目标对象的至少一个局部特征,对所述目标对象的属性进行预测,包括:对所述目标对象的至少一个局部特征进行加权处理,得到目标局部特征;基于所述目标局部特征,对所述目标对象的属性进行预测。
- 根据权利要求5所述的方法,所述对所述目标对象的至少一个局部特征进行加权处理,得到目标局部特征,包括:基于第二注意力机制以及所述至少一个局部特征,进行权值学习,得到至少一个第三权值,一个第三权值用于表示一个局部特征的重要程度;基于所述至少一个第三权值,对所述至少一个局部特征进行加权处理,得到所述目标局部特征。
- 根据权利要求1所述的方法,所述第二神经网络的每一层输出一个所述局部特征。
- 根据权利要求1所述的方法,所述方法还包括:当将所述全局特征输入至所述第二神经网络后,若所述第二神经网络中的全局损失以及局部损失满足预设条件,则所述第二神经网络输出当前预测的属 性,所述局部损失为所述第二神经网络在每一层期望的输出数据与实际的输出数据的差值,所述全局损失为所述第二神经网络期望的最终输出数据与实际的最终数据的差值。
- 根据权利要求1所述的方法,所述方法还包括:基于所述第二神经网络中第一目标层的层级特征以及第二目标层生成的局部特征,生成所述第一目标层输出的局部特征,第一目标层的所述层级特征用于表示所述全局特征在第一目标层的中的状态,所述第二目标层为所述第二神经网络中所述第一目标层的上一层。
- 根据权利要求9所述的方法,所述第一目标层的层级特征由所述全局特征以及所述第二目标层的层级特征来决定。
- 一种基于机器学习的目标对象属性预测装置,所述装置包括:获取模块,用于根据目标对象的检测数据以及所述检测数据对应的属性,确定所述目标对象的检测特征;计算模块,用于将所述检测特征输入第一神经网络;针对所述检测特征中各个时序上的检测特征,所述第一神经网络通过两种不同的时序计算后输出第一规律特征及不同于所述第一规律特征的第二规律特征,其中,所述第一规律特征表示所述检测特征的历史变化规律,所述第二规律特征表示所述检测特征的未来变化规律;所述获取模块,还用于基于所述第一规律特征和第二规律特征,确定所述目标对象的全局特征;提取模块,用于将所述全局特征输入第二神经网络;所述第二神经网络从所述全局特征提取并输出所述目标对象的至少一个局部特征;预测模块,用于基于所述目标对象的至少一个局部特征,对所述目标对象的属性进行预测。
- 根据权利要求11所述的装置,所述获取模块用于:将所述检测数据对应的属性输入全连接神经网络,通过所述全连接神经网络删除对所述检测数据对应的属性中的非必要因素,得到所述检测数据对应的属性的特征;将所述检测数据输入时间序列分析工具,通过时间序列分析工具提取所述检测数据中每一类数据在各个时序的特征,输出特征集合;将特征集合输入深交叉神经网络,通过深交叉神经网络对所述特征集合内的各个时序特征进行交叉处理,得到所述检测数据的特征;将所述属性的特征以及所述检测数据的特征输入深度神经网络,通过所述深度神经网络提取所述检测数据和所述检测数据对应的属性的混合特征,输出所述检测特征。
- 根据权利要求11所述的装置,所述获取模块用于:将所述第一规律特征和第二规律特征进行拼接,得到第三规律特征;对所述第三规律特征进行加权处理,得到第四规律特征,所述第四规律特征用于表示所述检测特征的全局变化规律;基于所述第三规律特征和所述第四规律特征,确定所述全局特征。
- 一种计算机设备,其特征在于,所述计算机设备包括一个或多个处理器和一个或多个存储器,所述一个或多个存储器中存储有至少一条指令,所述指令由所述一个或多个处理器加载并执行以实现如权利要求1至权利要求10任一项所述的基于机器学习的目标对象属性预测方法所执行的操作。
- 一种计算机可读存储介质,其特征在于,所述存储介质中存储有至少一条指令,所述指令由处理器加载并执行以实现如权利要求1至权利要求10任一项所述的基于机器学习的目标对象属性预测方法所执行的操作。
- 一种包括指令的计算机程序产品,当其在计算机上运行时,使得所述计算机执行权利要求1至10任意一项所述的基于机器学习的目标对象属性预测方法所执行的操作。
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020217025407A KR20210113336A (ko) | 2019-05-09 | 2020-04-22 | 머신 러닝에 기반한 타깃 객체 속성 예측 방법 및 관련 디바이스 |
JP2021562367A JP7191443B2 (ja) | 2019-05-09 | 2020-04-22 | 機械学習に基づくターゲットオブジェクト属性予測方法、関連機器及びコンピュータプログラム |
EP20801439.9A EP3968337A4 (en) | 2019-05-09 | 2020-04-22 | Target object attribute prediction method based on machine learning and related device |
US17/469,270 US20210406687A1 (en) | 2019-05-09 | 2021-09-08 | Method for predicting attribute of target object based on machine learning and related device |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910386448.4 | 2019-05-09 | ||
CN201910386448.4A CN110111885B (zh) | 2019-05-09 | 2019-05-09 | 属性预测方法、装置、计算机设备及计算机可读存储介质 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/469,270 Continuation US20210406687A1 (en) | 2019-05-09 | 2021-09-08 | Method for predicting attribute of target object based on machine learning and related device |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020224433A1 true WO2020224433A1 (zh) | 2020-11-12 |
Family
ID=67489143
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/086007 WO2020224433A1 (zh) | 2019-05-09 | 2020-04-22 | 基于机器学习的目标对象属性预测方法及相关设备 |
Country Status (6)
Country | Link |
---|---|
US (1) | US20210406687A1 (zh) |
EP (1) | EP3968337A4 (zh) |
JP (1) | JP7191443B2 (zh) |
KR (1) | KR20210113336A (zh) |
CN (1) | CN110111885B (zh) |
WO (1) | WO2020224433A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114925806A (zh) * | 2022-03-30 | 2022-08-19 | 北京达佳互联信息技术有限公司 | 信息处理方法、信息处理模型训练方法及装置 |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110111885B (zh) * | 2019-05-09 | 2023-09-19 | 腾讯科技(深圳)有限公司 | 属性预测方法、装置、计算机设备及计算机可读存储介质 |
CN112133441B (zh) * | 2020-08-21 | 2024-05-03 | 广东省人民医院 | 一种mh术后裂孔状态预测模型的建立方法和终端 |
CN115994632A (zh) * | 2023-03-24 | 2023-04-21 | 北京搜狐新动力信息技术有限公司 | 一种点击率预测方法、装置、设备及可读存储介质 |
CN116072298B (zh) * | 2023-04-06 | 2023-08-15 | 之江实验室 | 一种基于层级标记分布学习的疾病预测系统 |
CN116824281B (zh) * | 2023-08-30 | 2023-11-14 | 浙江大学 | 一种隐私保护的图像分类方法及装置 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080010024A1 (en) * | 2003-09-23 | 2008-01-10 | Prediction Sciences Llp | Cellular fibronectin as a diagnostic marker in cardiovascular disease and methods of use thereof |
CN106778014A (zh) * | 2016-12-29 | 2017-05-31 | 浙江大学 | 一种基于循环神经网络的患病风险预测方法 |
CN108648829A (zh) * | 2018-04-11 | 2018-10-12 | 平安科技(深圳)有限公司 | 疾病预测方法及装置、计算机装置及可读存储介质 |
CN109698017A (zh) * | 2018-12-12 | 2019-04-30 | 中电健康云科技有限公司 | 医疗病历数据生成方法及装置 |
CN110111885A (zh) * | 2019-05-09 | 2019-08-09 | 腾讯科技(深圳)有限公司 | 属性预测方法、装置、计算机设备及计算机可读存储介质 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3302081B2 (ja) * | 1993-03-23 | 2002-07-15 | 北陸電力株式会社 | 電力需要量予測装置 |
JP3767954B2 (ja) * | 1996-11-07 | 2006-04-19 | 富士通株式会社 | 需要予測装置 |
US6171256B1 (en) * | 1998-04-30 | 2001-01-09 | Physio-Control Manufacturing Corporation | Method and apparatus for detecting a condition associated with acute cardiac ischemia |
US7634360B2 (en) * | 2003-09-23 | 2009-12-15 | Prediction Sciences, LL | Cellular fibronectin as a diagnostic marker in stroke and methods of use thereof |
WO2014055718A1 (en) * | 2012-10-04 | 2014-04-10 | Aptima, Inc. | Clinical support systems and methods |
JP6276681B2 (ja) | 2014-11-14 | 2018-02-07 | アオイ電子株式会社 | 試料固定装置および試料分析装置 |
US10699151B2 (en) * | 2016-06-03 | 2020-06-30 | Miovision Technologies Incorporated | System and method for performing saliency detection using deep active contours |
CN106777874A (zh) * | 2016-11-18 | 2017-05-31 | 中国科学院自动化研究所 | 基于循环神经网络构建预测模型的方法 |
CN110532571B (zh) * | 2017-09-12 | 2022-11-18 | 腾讯科技(深圳)有限公司 | 文本处理方法及相关装置 |
US11101029B2 (en) * | 2018-07-17 | 2021-08-24 | Petuum Inc. | Systems and methods for predicting medications to prescribe to a patient based on machine learning |
CN109599177B (zh) * | 2018-11-27 | 2023-04-11 | 华侨大学 | 一种基于病历的深度学习预测医疗轨迹的方法 |
-
2019
- 2019-05-09 CN CN201910386448.4A patent/CN110111885B/zh active Active
-
2020
- 2020-04-22 EP EP20801439.9A patent/EP3968337A4/en active Pending
- 2020-04-22 JP JP2021562367A patent/JP7191443B2/ja active Active
- 2020-04-22 WO PCT/CN2020/086007 patent/WO2020224433A1/zh unknown
- 2020-04-22 KR KR1020217025407A patent/KR20210113336A/ko active IP Right Grant
-
2021
- 2021-09-08 US US17/469,270 patent/US20210406687A1/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080010024A1 (en) * | 2003-09-23 | 2008-01-10 | Prediction Sciences Llp | Cellular fibronectin as a diagnostic marker in cardiovascular disease and methods of use thereof |
CN106778014A (zh) * | 2016-12-29 | 2017-05-31 | 浙江大学 | 一种基于循环神经网络的患病风险预测方法 |
CN108648829A (zh) * | 2018-04-11 | 2018-10-12 | 平安科技(深圳)有限公司 | 疾病预测方法及装置、计算机装置及可读存储介质 |
CN109698017A (zh) * | 2018-12-12 | 2019-04-30 | 中电健康云科技有限公司 | 医疗病历数据生成方法及装置 |
CN110111885A (zh) * | 2019-05-09 | 2019-08-09 | 腾讯科技(深圳)有限公司 | 属性预测方法、装置、计算机设备及计算机可读存储介质 |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114925806A (zh) * | 2022-03-30 | 2022-08-19 | 北京达佳互联信息技术有限公司 | 信息处理方法、信息处理模型训练方法及装置 |
Also Published As
Publication number | Publication date |
---|---|
EP3968337A1 (en) | 2022-03-16 |
CN110111885B (zh) | 2023-09-19 |
JP2022530868A (ja) | 2022-07-04 |
EP3968337A4 (en) | 2022-06-29 |
CN110111885A (zh) | 2019-08-09 |
JP7191443B2 (ja) | 2022-12-19 |
KR20210113336A (ko) | 2021-09-15 |
US20210406687A1 (en) | 2021-12-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020224433A1 (zh) | 基于机器学习的目标对象属性预测方法及相关设备 | |
CN109559300A (zh) | 图像处理方法、电子设备及计算机可读存储介质 | |
CN106295186A (zh) | 一种基于智能推理的辅助疾病诊断的方法与系统 | |
CN116994709B (zh) | 一种个性化的饮食与运动推荐方法、系统及电子设备 | |
Sharma et al. | Application of fuzzy logic and genetic algorithm in heart disease risk level prediction | |
CN110363090A (zh) | 智能心脏疾病检测方法、装置及计算机可读存储介质 | |
CN113707323B (zh) | 基于机器学习的疾病预测方法、装置、设备及介质 | |
CN113673244B (zh) | 医疗文本处理方法、装置、计算机设备和存储介质 | |
Xu et al. | Intelligent emotion detection method based on deep learning in medical and health data | |
CN113722474A (zh) | 文本分类方法、装置、设备及存储介质 | |
CN109935337A (zh) | 一种基于相似性度量的病案查找方法及系统 | |
CN116434976A (zh) | 一种融合多源知识图谱的药物重定位方法和系统 | |
CN116994695A (zh) | 报告生成模型的训练方法、装置、设备及存储介质 | |
El-Magd et al. | An interpretable deep learning based approach for chronic obstructive pulmonary disease using explainable artificial intelligence | |
CN112216379A (zh) | 一种基于智能联合学习的疾病诊断系统 | |
CN114191665A (zh) | 机械通气过程中人机异步现象的分类方法和分类装置 | |
CN116612843B (zh) | 一种心理测评掩饰性行为识别方法及系统 | |
Hasan et al. | DEVELOPMENT OF HEART ATTACK PREDICTION MODEL BASED ON ENSEMBLE LEARNING. | |
CN109361539A (zh) | 一种基于发育网络的性能告警方法和装置 | |
Alazwari et al. | Improved Coyote Optimization Algorithm and Deep Learning Driven Activity Recognition in Healthcare | |
Ritul et al. | Identifying Alzheimer’s Dementia Patients Using MRI Clinical Data | |
Singh et al. | Malaria parasite recognition in thin blood smear images using squeeze and excitation networks | |
WO2024216763A1 (zh) | 图像处理方法、装置、电子设备及存储介质 | |
Haritha et al. | Parkinson Disease Identification Using Hand Written Images | |
Li et al. | Artificial Intelligence-Based Assessment of Physical Education and Training Effectiveness |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20801439 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 20217025407 Country of ref document: KR Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2021562367 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2020801439 Country of ref document: EP Effective date: 20211209 |