WO2023050668A1 - Procédé de construction de modèle de regroupement basé sur une inférence causale et procédé de traitement de données médicales - Google Patents

Procédé de construction de modèle de regroupement basé sur une inférence causale et procédé de traitement de données médicales Download PDF

Info

Publication number
WO2023050668A1
WO2023050668A1 PCT/CN2022/074389 CN2022074389W WO2023050668A1 WO 2023050668 A1 WO2023050668 A1 WO 2023050668A1 CN 2022074389 W CN2022074389 W CN 2022074389W WO 2023050668 A1 WO2023050668 A1 WO 2023050668A1
Authority
WO
WIPO (PCT)
Prior art keywords
sample
patient
model
data
loss
Prior art date
Application number
PCT/CN2022/074389
Other languages
English (en)
Chinese (zh)
Inventor
徐卓扬
孙行智
赵婷婷
胡岗
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2023050668A1 publication Critical patent/WO2023050668A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/041Abduction
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients

Definitions

  • the embodiment of the present application relates to the technical field of artificial intelligence, and in particular to a method for constructing a grouping model based on causal inference and a method for processing medical data.
  • the embodiment of the present application provides a method for constructing a grouping model based on causal inference, a system, a computer device, a computer-readable storage medium, and a medical data processing method, which are used to solve the problems of learning and processing of existing deep reinforcement learning models.
  • the grouping model trained based on the deep reinforcement learning model has low accuracy and unreasonable problems in making patient grouping decisions.
  • One aspect of the present application provides a method for constructing a grouping model based on causal inference, including:
  • Obtain multiple sample data of multiple sample patients, and multiple sample data of each sample patient includes multiple basic data, multiple patient historical follow-up data and sample patient grouping result data;
  • Input a plurality of sample data of the plurality of sample patients into the model to be trained, output the propensity score of each sample patient for its corresponding sample patient grouping result data through the model to be trained, and use the model to be trained
  • the model outputs each sample patient corresponding to the sample expected cumulative reward value of each model patient grouping result data in the model to be trained, wherein the propensity score represents the probability that the sample patient corresponds to the sample patient grouping result data;
  • the model parameters in the model to be trained are adjusted to optimize the grouping model.
  • Another aspect of the embodiment of the present application provides a system for constructing a grouping model based on causal inference, including:
  • the first acquisition module is used to acquire multiple sample data of multiple sample patients, and the multiple sample data of each sample patient includes multiple basic data, multiple patient historical follow-up data and sample patient grouping result data;
  • the first model processing module is used to input multiple sample data of the multiple sample patients into the model to be trained, and output the propensity score of each sample patient for its corresponding sample patient grouping result data through the model to be trained value, and the sample expected cumulative reward value of each sample patient corresponding to each model patient grouping result data in the model to be trained is output by the model to be trained, wherein the propensity score indicates that the sample patient corresponds to the sample patient Probability of patient cohort outcome data;
  • the first determining module is used to determine the target sample expected cumulative reward value corresponding to each sample patient from the sample expected cumulative reward value of each sample patient;
  • the optimization module is used to adjust the model parameters in the model to be trained based on the preset loss function, the propensity score of each sample patient and the corresponding expected cumulative reward value of the target sample, so as to optimize the clustering model.
  • Another aspect of the embodiment of the present application provides a medical data processing method, including:
  • the plurality of patient data including a plurality of basic data, a plurality of patient historical follow-up data, and a patient current follow-up data;
  • the target expected cumulative reward value determine the corresponding model patient grouping result data as the target patient grouping result data corresponding to the target patient.
  • a medical data processing system including:
  • the second acquiring module is used to acquire multiple patient data of the target patient, the multiple patient data including multiple basic data, multiple patient historical follow-up data and patient current follow-up data;
  • the second model processing module is used to input the plurality of basic data, the plurality of patient history follow-up data and the patient current follow-up data into the above-mentioned grouping model, and output the target through the grouping model The patient's expected cumulative reward value corresponding to the patient grouping result data of each model;
  • the second determining module is used to determine the largest expected cumulative reward value as a target expected cumulative reward value from among multiple expected cumulative reward values.
  • the third determination module is configured to determine the corresponding model patient grouping result data as the target patient grouping result data corresponding to the target patient according to the target expected cumulative reward value.
  • an embodiment of the present application further provides a computer device, the computer device includes a memory, a processor, and a computer program stored in the memory and operable on the processor, and the processor executes the The following steps are also performed when the computer program:
  • Obtain multiple sample data of multiple sample patients, and multiple sample data of each sample patient includes multiple basic data, multiple patient historical follow-up data and sample patient grouping result data;
  • Input a plurality of sample data of the plurality of sample patients into the model to be trained, output the propensity score of each sample patient for its corresponding sample patient grouping result data through the model to be trained, and use the model to be trained
  • the model outputs each sample patient corresponding to the sample expected cumulative reward value of each model patient grouping result data in the model to be trained, wherein the propensity score represents the probability that the sample patient corresponds to the sample patient grouping result data;
  • the model parameters in the model to be trained are adjusted to optimize the grouping model.
  • an embodiment of the present application further provides a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and the computer program can be executed by at least one processor, so that the at least A processor performs the following steps:
  • Obtain multiple sample data of multiple sample patients, and multiple sample data of each sample patient includes multiple basic data, multiple patient historical follow-up data and sample patient grouping result data;
  • Input a plurality of sample data of the plurality of sample patients into the model to be trained, output the propensity score of each sample patient for its corresponding sample patient grouping result data through the model to be trained, and use the model to be trained
  • the model outputs each sample patient corresponding to the sample expected cumulative reward value of each model patient grouping result data in the model to be trained, wherein the propensity score represents the probability that the sample patient corresponds to the sample patient grouping result data;
  • the model parameters in the model to be trained are adjusted to optimize the grouping model.
  • the causal inference-based grouping model construction method, system, computer equipment, computer-readable storage medium, and medical data processing method use the multiple basic data of the multiple sample patients and the multiple patient historical follow-up data and the sample patient grouping result data are input into the model to be trained, and the propensity score of each sample patient for its corresponding sample patient grouping result data and each sample patient corresponding to each model patient grouping result data are output by the model to be trained
  • the sample expected cumulative reward value of each sample patient from the sample expected cumulative reward value of each sample patient, determine the target sample expected cumulative reward value corresponding to each sample patient; and based on the preset loss function, the propensity score of each sample patient value and the corresponding expected cumulative reward value of the target sample, adjust the model parameters in the model to be trained to optimize the clustering model; train multiple sample data by combining the model to be trained with causal inference analysis, and eliminate the need for patient clustering result data
  • the selection bias makes the model fit more reasonable, and the trained model has a higher application accuracy.
  • Fig. 1 is the flow chart of the steps of the grouping model construction method based on causal inference in Embodiment 1 of the present application;
  • Fig. 2 is the flow chart of the steps of the grouping model construction method based on causal inference in Embodiment 1 of the present application;
  • Fig. 3 is a flow chart of the steps of the method for constructing a grouping model based on causal inference in Embodiment 1 of the present application;
  • FIG. 4 is a schematic diagram of program modules of a system for constructing a grouping model based on causal inference in Embodiment 2 of the present application;
  • FIG. 5 is a flow chart of the steps of the medical data processing method of Embodiment 3 of the present application.
  • FIG. 6 is a schematic diagram of the program modules of the medical data processing system according to Embodiment 4 of the present application.
  • FIG. 7 is a schematic diagram of a hardware structure of a computer device according to Embodiment 5 of the present application.
  • FIG. 1 shows a flow chart of the steps of the method for constructing a grouping model based on causal inference according to an embodiment of the present application. It can be understood that the flowchart in this method embodiment is not used to limit the sequence of execution steps.
  • the following is an exemplary description taking computer equipment as the execution subject, as follows:
  • the method for constructing a grouping model based on causal inference may include steps S100 to S106, wherein:
  • step S100 a plurality of sample data of a plurality of sample patients is acquired, and the plurality of sample data of each sample patient includes a plurality of basic data, a plurality of patient history follow-up data and sample patient grouping result data.
  • the plurality of sample patients may be a plurality of diabetic patients.
  • the historical follow-up data of multiple diabetic patients are collected in chronological order, the basic data of a diabetic patient, the data of each follow-up visit, and the corresponding sample patient grouping result data.
  • the diabetic patient is used as a sample data.
  • multiple basic data include but are not limited to age, gender, place of work, frequently visited places, etc.; follow-up data include: medication history, medical test reports from third-party platforms or medical systems, expert/doctor prescription information, etc. data.
  • the method further includes: performing preprocessing on multiple sample data, specifically including performing feature merging on multiple basic data of multiple sample patients and performing historical follow-up data on multiple patients through feature engineering.
  • the features are combined to obtain the training data. For example, through feature engineering, the first feature primitives of each basic data and the second feature primitives of each patient's historical follow-up data are obtained, and the first feature primitives corresponding to each basic data are respectively performed based on the sample patient grouping result data.
  • Step S102 input multiple sample data of the multiple sample patients into the model to be trained, output the propensity score of each sample patient for its corresponding sample patient grouping result data through the model to be trained, and pass the The model to be trained outputs the sample expected cumulative reward value of each sample patient corresponding to the patient grouping result data of each model in the model to be trained, wherein the propensity score indicates that the sample patient corresponds to the sample patient grouping result data probability.
  • the model to be trained can be a deep reinforcement learning model (Deep Q Network, DQN model).
  • the preprocessed sample data input into the deep reinforcement learning model is defined as state (state); multiple model patient grouping result data are defined as action (action), according to the sample patient in states (multiple The result information obtained after taking action under sample data) defines reward (reward).
  • Action is the one-hot encoding of patient grouping result data, and reward includes long-term reward and short-term reward.
  • the long-term reward can be positioned as: sign (whether there is a complication in the last follow-up)*5; the short-term reward can be defined as: sign (whether the glycated hemoglobin reaches the target in the next follow-up)*1.
  • step S300 performing random allocation on multiple sample data of the multiple sample patients, to obtain a plurality of training sample data and a plurality of control sample data
  • step S302 input the plurality of training sample data and a plurality of control sample data into the model to be trained, and use the model to be trained to Logistic regression is performed on the plurality of training sample data and the plurality of control sample data, and the propensity score of each sample patient to its corresponding sample patient grouping result data is calculated.
  • a plurality of training sample data of a plurality of first sample patients and a plurality of control sample data of a plurality of second sample patients are randomly assigned, and according to each first sample patient from a plurality of second sample Among the patients, the second sample patient is determined for control, which can be understood as being based on the third similarity between the training sample data of the first sample patient and the control sample data of each second sample patient, from each Screen out one or more second sample patients corresponding to a third preset threshold value from the plurality of third similarities corresponding to the first sample patient, and determine the sample patient from the screened one or more second sample patients
  • the second sample patients with inconsistent data in the clustering results were used for control analysis, so as to conduct causal analysis based on randomly assigned sample data through the model.
  • multiple training sample data of the first sample patient are positive sample data
  • each control sample data of one or more second sample patients screened out are negative sample data.
  • the DQN model combines the propensity represented by the sample data to make the expected reward output by the model more accurate.
  • the model to be trained includes an input layer, an output layer, at least four NN layers (hidden layers) and a classification layer, wherein the input layer is used for Receive a plurality of sample data of a plurality of sample patients, the hidden layer is used to analyze and process the plurality of sample data, the output layer includes a plurality of output nodes, and each output node outputs the corresponding model patient grouping of the node The score of the result data; the classification layer is used to convert the score corresponding to each output node into the sample expected cumulative reward value of the patient grouping result data of each model.
  • the input layer is used for Receive a plurality of sample data of a plurality of sample patients
  • the hidden layer is used to analyze and process the plurality of sample data
  • the output layer includes a plurality of output nodes, and each output node outputs the corresponding model patient grouping of the node The score of the result data
  • the classification layer is used to convert the score corresponding to each output node into the sample expected cumulative reward value of the patient grouping
  • a plurality of sample data (states) of the plurality of sample patients are input into the input layer of the model to be trained, and after being processed by two layers of hidden layers, the propensity score of each sample patient for its sample patient grouping result data is output.
  • value g and other eigenvalues, and other eigenvalues are input to the rest of the hidden layer, and output the sample expected cumulative reward values Q 0 , Q 1 , Q of each sample patient corresponding to each model patient grouping result data (action) through the output layer 2 , . . . , Q n .
  • g represents the probability of doctors or experts taking the corresponding sample patient grouping result data under states.
  • g can be expressed as p(a 1
  • s) 1-p(a 0
  • Step S104 from the sample expected cumulative reward value of each sample patient, determine the target sample expected cumulative reward value corresponding to each sample patient.
  • the target sample expected cumulative reward value corresponding to the sample patient is determined to be the largest sample expected cumulative reward value from the multiple sample expected cumulative reward values of each sample patient.
  • Step S106 based on the preset loss function, the propensity score of each patient sample and the corresponding expected cumulative reward value of the target sample, adjust the model parameters in the model to be trained to optimize the clustering model.
  • Loss Q_loss+ ⁇ 1 *g_loss+ ⁇ 2 *reg_loss;
  • Loss is represented as a loss value
  • reg_loss is represented as a regression loss value
  • g_loss is represented as a first loss value
  • Q_loss is represented as a second loss value
  • ⁇ 1 and ⁇ 2 are adjustable hyperparameters of the model to be trained.
  • the model to be trained is repeatedly trained through the following loss function, the Loss is calculated through the loss function, the gradient is calculated for the Loss, the model parameters of the model are adjusted by using the gradient descent algorithm to backpropagate the Loss, and the training is repeated until the Loss is no longer , the grouping model is obtained.
  • the sample data is organized into a quadruple form such as (st t , a t , r, st t+1 ), where st t represents the state at time t, and a t represents the grouping scheme of doctors at time t ( action), r and s t+1 represent the reward obtained after taking a t under s t and the next state to transfer to.
  • the loss function is as follows:
  • the first loss function includes:
  • reg_loss Q(s t , a tmax )-Q(s t , a t );
  • reg_loss is the regression loss value, which is used to prevent overestimation of the Q value.
  • Q represents the expected cumulative reward value corresponding to the sample patient
  • s t represents multiple sample data at time t
  • a t represents the corresponding value of the sample patient at time t.
  • Sample patient grouping result data, Q( st , atmax ) represents the largest sample expected cumulative reward value among multiple sample expected cumulative reward values output by the model to be trained
  • Q(st t , at t ) represents the sample patient The actual expected cumulative reward value of the sample patient clustering result data determined in the s t state;
  • the second loss function includes:
  • g_loss CrossEntropy(g(s t ), to_one_hot(a t ));
  • the third loss function includes:
  • Q_loss (Q(s t , a t )-( ⁇ +max a ( ⁇ *Q(s t+1 , a t+1 )))) 2 ;
  • the discount factor is used to indicate the attenuation ratio of the expected cumulative reward value of the target sample at the next time t+1 discounted to the expected cumulative reward value of the target sample corresponding to the time t;
  • Q_loss is the second loss value.
  • the deep reinforcement learning model is combined with causal inference analysis to train multiple sample data, decoupling the tendency of patient grouping result data representation, eliminating the deviation of patient grouping result data selection, and the model fitting is more reasonable;
  • FIG. 4 shows a schematic diagram of the program modules of the causal inference-based grouping model building system of the present application.
  • the grouping model construction system 40 based on causal inference may include or be divided into one or more program modules, and one or more program modules are stored in a storage medium and executed by one or more processors. Execute to complete the application and realize the above-mentioned method for constructing a grouping model based on causal inference.
  • the program module referred to in the embodiment of the present application refers to a series of computer program instruction segments capable of accomplishing specific functions, which is more suitable than the program itself to describe the execution process of the causal inference-based grouping model construction system 40 in the storage medium. The following description will specifically introduce the functions of each program module of the present embodiment:
  • the said grouping model construction system 40 based on causal inference includes:
  • the first acquiring module 400 is configured to acquire multiple sample data of multiple sample patients, and the multiple sample data of each sample patient includes multiple basic data, multiple patient historical follow-up data and sample patient grouping result data;
  • the first model processing module 402 is configured to input multiple sample data of the multiple sample patients into the model to be trained, and output the tendency of each sample patient to its corresponding sample patient grouping result data through the model to be trained Score and each sample patient corresponding to the sample expected cumulative reward value of each model patient clustering result data, wherein the propensity score represents the probability that the sample patient corresponds to the sample patient clustering result data;
  • the first determining module 404 is configured to determine the target sample expected cumulative reward value corresponding to each sample patient from the sample expected cumulative reward value of each sample patient;
  • the optimization module 406 is used to adjust the model parameters in the model to be trained based on the preset loss function, the propensity score of each sample patient and the corresponding expected cumulative reward value of the target sample, so as to optimize the grouping model.
  • the preset loss function includes a first loss function, a second loss function, and a third loss function; the optimization module 406 is further configured to: based on the first loss function and the The expected cumulative reward value of the target sample corresponding to each sample patient is calculated to obtain the regression loss value; based on the second loss function and the propensity score of each sample patient, the first propensity score corresponding to the propensity score is calculated.
  • Loss value based on the third loss function and the target sample expected cumulative reward value corresponding to each sample patient, calculate the second loss value corresponding to the target sample expected cumulative reward value; for the regression loss value, the Summing the first loss value and the second loss value to obtain a loss value; modifying the model parameters in the model to be trained according to the loss value to obtain a modified model to be trained; and
  • the modified model to be trained performs group training on the multiple sample data of the multiple sample patients, and stops the training when the modified model parameters reach the preset number of modifications and the loss value does not decrease, and the current The model to be trained is marked as the grouping model.
  • the first loss function includes:
  • reg_loss Q(s t , a tmax )-Q(s t , a t );
  • reg_loss is the regression loss value, which is used to prevent overestimation of the Q value.
  • Q represents the expected cumulative reward value corresponding to the sample patient
  • s t represents multiple sample data at time t
  • a t represents the corresponding value of the sample patient at time t.
  • Sample patient grouping result data, Q( st , atmax ) represents the largest sample expected cumulative reward value among multiple sample expected cumulative reward values output by the model to be trained
  • Q(st t , at t ) represents the sample patient The actual expected cumulative reward value of the sample patient clustering result data determined in the s t state;
  • the second loss function includes:
  • g_loss CrossEntropy(g(s t ), to_one_hot(a t ));
  • the third loss function includes:
  • Q_loss (Q(s t , a t )-( ⁇ +max a ( ⁇ *Q(s t+1 , a t+1 )))) 2 ;
  • the discount factor is used to indicate the attenuation ratio of the expected cumulative reward value of the target sample at the next time t+1 discounted to the expected cumulative reward value of the target sample corresponding to the time t;
  • Q_loss is the second loss value.
  • the model to be trained is a deep reinforcement learning model.
  • the first model processing module 402 is further configured to: randomly distribute the multiple sample data of the multiple sample patients to obtain multiple training sample data and multiple control samples data; and input the plurality of training sample data and the plurality of control sample data into the model to be trained, and perform logic on the plurality of training sample data and the plurality of control sample data through the model to be trained Regression, calculating the propensity score of each sample patient for its corresponding sample patient grouping result data.
  • FIG. 5 shows a flow chart of the steps of the medical data processing method according to the embodiment of the present application. It can be understood that the flowchart in this method embodiment is not used to limit the sequence of execution steps.
  • the following is an exemplary description taking computer equipment as the execution subject, as follows:
  • the medical data processing method may include steps S500-S506, wherein:
  • Step S500 acquiring a plurality of patient data of the target patient, the plurality of patient data including a plurality of basic data, a plurality of patient history follow-up data and a patient current follow-up data;
  • Step S502 input the plurality of basic data, the plurality of patient history follow-up data and the patient current follow-up data into the grouping model as described above, and output the target patient corresponding to each model through the grouping model The expected cumulative reward value of patient grouping result data;
  • Step S504 from the multiple expected cumulative reward values, determine the largest expected cumulative reward value as the target expected cumulative reward value;
  • Step S506 according to the target expected cumulative reward value, determine the corresponding model patient grouping result data as the target patient grouping result data corresponding to the target patient.
  • FIG. 6 shows a schematic diagram of program modules of the medical data processing system of the present application.
  • the medical data processing system 60 may include or be divided into one or more program modules, and one or more program modules are stored in a storage medium and executed by one or more processors to complete
  • the program module referred to in the embodiment of this application refers to a series of computer program instruction segments capable of completing specific functions, which is more suitable for describing the execution process of the medical data processing system 60 in the storage medium than the program itself.
  • the following description will specifically introduce the functions of each program module of the present embodiment:
  • the medical data processing system includes:
  • the second acquiring module 600 is configured to acquire multiple patient data of the target patient, the multiple patient data including multiple basic data, multiple patient historical follow-up data, and patient current follow-up data;
  • the second model processing module 602 is configured to input the multiple basic data, the multiple patient historical follow-up data and the patient current follow-up data into the grouping model according to any one of claims 1-5, through
  • the grouping model outputs the expected cumulative reward value corresponding to each model patient grouping result data of the target patient;
  • the second determining module 604 is configured to determine the largest expected cumulative reward value as a target expected cumulative reward value from among multiple expected cumulative reward values;
  • the third determination module 606 is configured to determine the corresponding model patient grouping result data as the target patient grouping result data corresponding to the target patient according to the target expected cumulative reward value.
  • the computer device 2 is a device capable of automatically performing numerical calculation and/or information processing according to preset or stored instructions.
  • the computer device 2 may be a rack server, a blade server, a tower server or a cabinet server (including an independent server, or a server cluster composed of multiple servers) and the like.
  • the computer device 2 at least includes, but is not limited to, a memory 21 , a processor 22 , a network interface 23 , and a causal inference-based grouping model building system 40 that can communicate with each other through a system bus. in:
  • the memory 21 includes at least one type of computer-readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory ( RAM), static random access memory (SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, etc.
  • the memory 21 may be an internal storage unit of the computer device 2 , such as a hard disk or memory of the computer device 2 .
  • the memory 21 can also be an external storage device of the computer device 2, such as a plug-in hard disk equipped on the computer device 2, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) card, flash memory card (Flash Card), etc.
  • the storage 21 may also include both the internal storage unit of the computer device 2 and its external storage device.
  • the memory 21 is usually used to store the operating system and various application software installed in the computer device 2, such as the program codes of the causal inference-based grouping model construction system 40 of the above-mentioned embodiment.
  • the memory 21 can also be used to temporarily store various types of data that have been output or will be output.
  • the processor 22 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips.
  • the processor 22 is generally used to control the overall operation of the computer device 2 .
  • the processor 22 is used to run the program code stored in the memory 21 or process data, for example, run the causal inference-based grouping model construction system 40, so as to implement the causal inference-based grouping model construction method of the above-mentioned embodiment.
  • the network interface 23 may include a wireless network interface or a wired network interface, and the network interface 23 is generally used to establish a communication connection between the computer device 2 and other electronic devices.
  • the network interface 23 is used to connect the computer device 2 with an external terminal through a network, and establish a data transmission channel and a communication connection between the computer device 2 and an external terminal.
  • the network can be an enterprise intranet (Intranet), Internet (Internet), Global System of Mobile communication (Global System of Mobile communication, GSM), broadband code division multiple access (Wideband Code Division Multiple Access, WCDMA), 4G network, 5G Internet, Bluetooth (Bluetooth), Wi-Fi and other wireless or wired networks.
  • FIG. 7 only shows the computer device 2 having components 21-23 and a causal inference-based grouping model building system 40, but it should be understood that it is not required to implement all the components shown, and can be replaced by Implement more or fewer components.
  • the causal inference-based grouping model construction system 40 stored in the memory 21 can also be divided into one or more program modules, and the one or more program modules are stored in the memory 21, And it is executed by one or more processors (processor 22 in this embodiment) to complete the application.
  • FIG. 4 shows a schematic diagram of the program modules of Embodiment 2 of the system 40 for constructing a grouping model based on causal inference.
  • the system for building a grouping model 40 based on causal inference can be divided into the first acquisition Module 400 , first model processing module 402 , first determination module 404 and optimization module 406 .
  • the program module referred to in this application refers to a series of computer program instruction segments capable of completing specific functions, which is more suitable than a program to describe the execution process of the causal inference-based grouping model construction system 40 in the computer device 2 .
  • the specific functions of the program modules 400-406 have been described in detail in the second embodiment, and will not be repeated here.
  • This embodiment also provides a computer-readable storage medium, such as flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory (SRAM), only Read memory (ROM), electrically erasable programmable read-only memory (EEPROM), programmable read-only memory (PROM), magnetic memory, magnetic disk, optical disk, server, App application store, etc., on which computer programs are stored, The corresponding functions are realized when the program is executed by the processor.
  • the computer-readable storage medium may be non-volatile or volatile.
  • the computer-readable storage medium of this embodiment is used to store the system 40 for constructing a grouping model based on causal inference, and when executed by a processor, realizes the method for constructing a grouping model based on causal inference in the above-mentioned embodiment.

Abstract

Procédé de construction de modèle de regroupement basé sur une inférence causale, consistant : à entrer une pluralité de données d'échantillon de multiples patients d'échantillon dans un modèle à former, et à fournir en sortie, au moyen dudit modèle, un indice de tendance de chaque patient d'échantillon pour des données de résultat de regroupement de patients d'échantillon correspondantes et de multiples valeurs de récompense cumulatives attendues d'échantillon correspondant à chaque patient d'échantillon ; à déterminer, parmi les multiples valeurs de récompense cumulatives attendues d'échantillon, une valeur de récompense cumulative attendue d'échantillon cible de chaque patient d'échantillon ; à régler des paramètres de modèle dans ledit modèle sur la base d'une fonction de perte prédéfinie, de l'indice de tendance de chaque patient d'échantillon et d'une valeur de récompense cumulative attendue d'échantillon cible correspondante, de façon à obtenir un modèle de regroupement. Une pluralité de données d'échantillon est formée au moyen de la combinaison d'un modèle à former avec une analyse d'inférence causale, de l'élimination d'un écart de sélection pour des données de résultat de regroupement de patients, de telle sorte que l'ajustement de modèle est plus raisonnable et que la précision d'application d'un modèle formé est plus élevée.
PCT/CN2022/074389 2021-09-30 2022-01-27 Procédé de construction de modèle de regroupement basé sur une inférence causale et procédé de traitement de données médicales WO2023050668A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111156355.6A CN113782192A (zh) 2021-09-30 2021-09-30 基于因果推断的分群模型构建方法和医疗数据处理方法
CN202111156355.6 2021-09-30

Publications (1)

Publication Number Publication Date
WO2023050668A1 true WO2023050668A1 (fr) 2023-04-06

Family

ID=78854417

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/074389 WO2023050668A1 (fr) 2021-09-30 2022-01-27 Procédé de construction de modèle de regroupement basé sur une inférence causale et procédé de traitement de données médicales

Country Status (2)

Country Link
CN (1) CN113782192A (fr)
WO (1) WO2023050668A1 (fr)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113782192A (zh) * 2021-09-30 2021-12-10 平安科技(深圳)有限公司 基于因果推断的分群模型构建方法和医疗数据处理方法

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109858630A (zh) * 2019-02-01 2019-06-07 清华大学 用于强化学习的方法和设备
US20190303765A1 (en) * 2018-03-30 2019-10-03 Visa International Service Association Method, System, and Computer Program Product for Implementing Reinforcement Learning
CN111785366A (zh) * 2020-06-29 2020-10-16 平安科技(深圳)有限公司 患者治疗方案的确定方法、装置及计算机设备
CN112115322A (zh) * 2020-09-25 2020-12-22 平安科技(深圳)有限公司 用户分群方法、装置、电子设备及存储介质
CN113255735A (zh) * 2021-04-29 2021-08-13 平安科技(深圳)有限公司 患者用药方案的确定方法及确定装置
CN113270189A (zh) * 2021-05-19 2021-08-17 复旦大学附属肿瘤医院 一种基于强化学习的肿瘤治疗辅助决策方法
CN113421653A (zh) * 2021-06-23 2021-09-21 平安科技(深圳)有限公司 医疗信息的推送方法及装置、存储介质、计算机设备
CN113782192A (zh) * 2021-09-30 2021-12-10 平安科技(深圳)有限公司 基于因果推断的分群模型构建方法和医疗数据处理方法

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111696661A (zh) * 2020-05-13 2020-09-22 平安科技(深圳)有限公司 患者分群模型构建方法、患者分群方法及相关设备

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190303765A1 (en) * 2018-03-30 2019-10-03 Visa International Service Association Method, System, and Computer Program Product for Implementing Reinforcement Learning
CN109858630A (zh) * 2019-02-01 2019-06-07 清华大学 用于强化学习的方法和设备
CN111785366A (zh) * 2020-06-29 2020-10-16 平安科技(深圳)有限公司 患者治疗方案的确定方法、装置及计算机设备
CN112115322A (zh) * 2020-09-25 2020-12-22 平安科技(深圳)有限公司 用户分群方法、装置、电子设备及存储介质
CN113255735A (zh) * 2021-04-29 2021-08-13 平安科技(深圳)有限公司 患者用药方案的确定方法及确定装置
CN113270189A (zh) * 2021-05-19 2021-08-17 复旦大学附属肿瘤医院 一种基于强化学习的肿瘤治疗辅助决策方法
CN113421653A (zh) * 2021-06-23 2021-09-21 平安科技(深圳)有限公司 医疗信息的推送方法及装置、存储介质、计算机设备
CN113782192A (zh) * 2021-09-30 2021-12-10 平安科技(深圳)有限公司 基于因果推断的分群模型构建方法和医疗数据处理方法

Also Published As

Publication number Publication date
CN113782192A (zh) 2021-12-10

Similar Documents

Publication Publication Date Title
CN112071425B (zh) 一种数据处理方法、装置、计算机设备及存储介质
AU2012245343B2 (en) Predictive modeling
US11250954B2 (en) Patient readmission prediction tool
US9443002B1 (en) Dynamic data analysis and selection for determining outcomes associated with domain specific probabilistic data sets
US20200082918A1 (en) System and methd of social-behavioral roi calculation and optimization
US20170004279A1 (en) Long-term healthcare cost predictions using future trajectories & machine learning
WO2021151327A1 (fr) Procédé et appareil de traitement de données de triage, et dispositif et support
CN111144658B (zh) 医疗风险预测方法、装置、系统、存储介质与电子设备
WO2022227198A1 (fr) Procédé et dispositif permettant de déterminer un régime médicamenteux d'un patient
CN111696661A (zh) 患者分群模型构建方法、患者分群方法及相关设备
US20200058408A1 (en) Systems, methods, and apparatus for linking family electronic medical records and prediction of medical conditions and health management
Hilbert et al. Using decision trees to manage hospital readmission risk for acute myocardial infarction, heart failure, and pneumonia
WO2023050668A1 (fr) Procédé de construction de modèle de regroupement basé sur une inférence causale et procédé de traitement de données médicales
CN112447270A (zh) 一种用药推荐方法、装置、设备及存储介质
WO2020089732A2 (fr) Prédicteur d'utilisation de ressources de traitement de patient
US20150019255A1 (en) Systems and methods for primary admissions analysis
CN115295115A (zh) 基于深度学习的丙戊酸钠血药浓度预测方法及装置
CN114694355B (zh) 远程预警方法及系统
WO2020188043A1 (fr) Procédé et système d'évaluation des coûts basée sur l'activité dicté par le temps dans un établissement de soins de santé de manière efficace et évolutive
WO2023178789A1 (fr) Procédé et appareil d'optimisation de réseau d'estimation de risque de maladie, support et dispositif
CN111368412B (zh) 用于护理需求预测的仿真模型构建方法和装置
Katz-Rogozhnikov et al. Toward Comprehensive Attribution of Healthcare Cost Changes
CN113782216B (zh) 一种失能权重确定方法及装置、电子设备、存储介质
US20230020908A1 (en) Machine learning models for automated selection of executable sequences
US20230105348A1 (en) System for adaptive hospital discharge

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22874071

Country of ref document: EP

Kind code of ref document: A1