WO2022227294A1 - 一种基于多模态融合的疾病风险预测方法和系统 - Google Patents
一种基于多模态融合的疾病风险预测方法和系统 Download PDFInfo
- Publication number
- WO2022227294A1 WO2022227294A1 PCT/CN2021/106860 CN2021106860W WO2022227294A1 WO 2022227294 A1 WO2022227294 A1 WO 2022227294A1 CN 2021106860 W CN2021106860 W CN 2021106860W WO 2022227294 A1 WO2022227294 A1 WO 2022227294A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- features
- risk prediction
- disease risk
- unstructured
- Prior art date
Links
- 230000004927 fusion Effects 0.000 title claims abstract description 71
- 201000010099 disease Diseases 0.000 title claims abstract description 69
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 title claims abstract description 69
- 238000000034 method Methods 0.000 title claims abstract description 50
- 238000013058 risk prediction model Methods 0.000 claims abstract description 27
- 238000012545 processing Methods 0.000 claims abstract description 13
- 238000000605 extraction Methods 0.000 claims description 54
- 238000004140 cleaning Methods 0.000 claims description 25
- 238000004590 computer program Methods 0.000 claims description 15
- 238000011176 pooling Methods 0.000 claims description 10
- 230000015654 memory Effects 0.000 claims description 8
- 238000012549 training Methods 0.000 claims description 7
- 238000013075 data extraction Methods 0.000 claims description 6
- 230000007246 mechanism Effects 0.000 claims description 5
- 238000013528 artificial neural network Methods 0.000 claims description 4
- 238000012217 deletion Methods 0.000 claims description 4
- 230000037430 deletion Effects 0.000 claims description 4
- 238000007781 pre-processing Methods 0.000 claims description 2
- 230000002159 abnormal effect Effects 0.000 claims 1
- 238000009826 distribution Methods 0.000 description 8
- 238000011160 research Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 5
- 230000036541 health Effects 0.000 description 4
- 230000002457 bidirectional effect Effects 0.000 description 3
- 238000013527 convolutional neural network Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000011218 segmentation Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 238000013135 deep learning Methods 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000005065 mining Methods 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 208000004998 Abdominal Pain Diseases 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000013499 data model Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000006403 short-term memory Effects 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/30—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H10/00—ICT specially adapted for the handling or processing of patient-related medical or healthcare data
- G16H10/60—ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/70—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Definitions
- the present application relates to the field of medical big data information processing, and in particular, to a disease risk prediction method and system based on multimodal fusion.
- EHRs Electronic health records
- Digitization and subsequent analysis of medical records constitute an area of digital transformation aimed at collecting multiple medical information about patients in the form of EHRs, including digitized measurements (lab results), verbal descriptions (symptoms and notes, vital signs, etc.) , images (X-rays, CT and MR scans, etc.) and record the patient's treatment process. This digitization creates opportunities to mine health records to improve quality of care and clinical outcomes.
- Electronic health records contain structured and unstructured data with important research and clinical value.
- EHR data With the standardization and digitization of a large amount of EHR data, it is necessary to mine a large amount of multi-source heterogeneous data to establish a risk prediction model to achieve personalized medicine. much needed.
- Most previous attempts have been built on structured EHR fields, and a lot of information in unstructured text data is lost.
- the problem of unbalanced number and distribution of data sets Data collection without purpose often results in that the integrity, accuracy and granularity of recorded data cannot form a systematic system, resulting in missing and irregular data. Therefore, it takes a certain amount of manpower and material resources to collect data. Limited by time and financial resources, the number of good samples that can be obtained is limited. For example, in some embodiments of the present invention, the number of good samples is only 1300, and the distribution of positive and negative samples is not balanced, which will greatly affect the deep neural network. Learning and training of networks.
- the present invention uses the stacked Transformer encoder modules to effectively vectorize the text medical records, which can effectively capture the rich semantic relations contained in the word sequence before and after the long text, and provide medical entities. make the correct representation. Then, the multi-source heterogeneous data is fused at the feature level, and the characteristics of different modal data are fully considered, and then the patient outcome is predicted.
- the present invention provides a method for processing EHR data (including structured data and unstructured data), constructs a disease risk prediction model based on multimodal fusion, a method and system for prediction using the model, and Software devices that implement these functions, etc.
- the invention improves the predictability of the patient's outcome by fully integrating and mining the information of the patient's demographic information, treatment information, diagnostic information, laboratory information and related text treatment medical records, and can effectively help doctors to provide effective reference information. Predict the development of the patient's condition, assist the doctor to formulate the corresponding treatment plan, help the treatment in time, and prevent the disease from developing in the direction of deterioration. At the same time, patients can be shown the development direction of the disease after personalized treatment at each clinical visit to improve their enthusiasm for treatment.
- Multimodal data refers to data collected on multiple different devices or scenarios. Data sets in the real world are often multimodal, for example: a story can be described by text narration as well as images or audio; a document can be represented by multiple different languages or user reviews, etc. .
- the establishment of the multimodal database aims to obtain its important features and representative retrieval labels by analyzing and processing multimodal data, and based on this, establish a database that is convenient for subsequent data retrieval.
- Unstructured data refers to data without a fixed structure, such as office documents, text, pictures, various reports, images, and audio and video information in all formats.
- Unstructured data in medicine includes medical images, electrocardiograms, text medical records, etc.
- Structured data traditional relational data model, row data, data stored in the database, data that can be represented by a two-dimensional table structure, for example, data stored in csv, excel, two-dimensional table.
- the present invention provides the following technical features, and the combination of one or more of the following technical features constitutes the technical solution of the present invention.
- the present invention provides a disease risk prediction method based on multimodal fusion, the method comprising:
- the data includes structured data and unstructured data; in the embodiment of the present invention, the unstructured data especially refers to text;
- the disease risk prediction model further includes the step of performing data cleaning before extracting the structured data features and the unstructured data features;
- the data cleaning includes replacing outliers, using the mean to complete missing values, and deleting dirty data.
- a fully convolutional network (Fully Convolutional Networks, FCN) is used to extract structured data features.
- BERT Bidirectional Encoder Representations from Transformers
- the operation of extracting the fusion feature includes: parallelizing the unstructured data feature and the structured data feature along a specified dimension, and adopting a synthetic minority oversampling technique (Synthetic Minority Oversampling Technique, SMOTE)
- SMOTE Synthetic Minority Oversampling Technique
- the fused features are input as input to fully connected dence layers, and then disease risk prediction is performed by Softmax classifier.
- the present invention adopts the weighting of the cross-entropy loss and the hinge loss to jointly constrain the model.
- Cross-entropy loss can measure the degree of difference between two different probability distributions in the same random variable. The smaller the value of cross-entropy loss, the closer the two probability distributions are.
- Hinge loss is specially used for binary classification problems. It not only requires the classification to be correct, but also the loss will be as small as possible when the confidence is high enough. Since the hinge loss not only measures the fitting degree of the model to the training data, but also measures the complexity of the model itself by adding a regularization term, so the fitting risk can be greatly reduced.
- the present invention provides a method for processing EHR data, comprising:
- EHR data including structured data and unstructured data
- the extracted fusion feature data is used as the data to be identified for medical purposes.
- the data cleaning includes the replacement of outliers, the use of mean values to complete missing values, and the deletion of dirty data; preferably, the unstructured data is text.
- FCN is used to extract structured data features
- BERT is used to extract unstructured features
- the operation of extracting the fusion feature includes: parallelizing the unstructured data feature and the structured data feature along a specified dimension, and using SMOTE to analyze the minority class sample data and newly generate the class The method of sample to reduce the imbalance rate, and then use the segmentation pooling operation to extract the fusion features.
- a method for constructing a disease risk prediction model of the present invention includes:
- EHR data including structured and unstructured data, of patients with known disease risk outcomes; construct datasets, including structured and unstructured data, with known EHR data The final result builds the label set;
- Constructing a disease risk prediction network including: constructing a feature extraction module for structured data extraction, a feature extraction module for unstructured data extraction, and a feature fusion module, structured data feature extraction module and unstructured data feature extraction After the modules are connected in parallel, they are connected in series at the decision layer of the feature fusion module; the disease risk prediction network is implemented based on the Pytorch framework;
- the label set as the label, use the data set (structured data set and unstructured data set) to train the constructed disease risk prediction network, and construct the disease risk prediction model;
- the data cleaning before constructing the data set, it further includes a step of data cleaning on the acquired EHR data, and the data cleaning includes replacing outliers, complementing missing values with mean, and deleting dirty data.
- the feature extraction module for extracting structured data is an FCN module; the feature extraction module for extracting unstructured data is a BERT module (transformer module).
- the feature fusion module executes: parallelizing unstructured data features and structured data features along a specified dimension, and using SMOTE to analyze minority class sample data and generate new samples of this class. Reduce the imbalance rate, and then use the segmented pooling operation to extract the fusion features;
- the fused feature when using the dataset for training, is used as input to the fully connected layer to train the Softmax classifier.
- the present invention also includes the multimodal fusion-based disease risk prediction model constructed by the third aspect.
- the present invention provides a risk prediction system based on multimodal fusion, the system comprising:
- a feature extraction module which is used to perform feature extraction on EHR data to obtain unstructured data features and structured data features
- a feature fusion module which is used to fuse unstructured data features and structured data features and extract fused features
- Classification module which takes the extracted fusion features as input, and obtains disease risk prediction results.
- the feature extraction module includes a structured data feature extraction module and an unstructured data feature extraction module
- the structured data feature extraction module uses the preprocessed structured data as the input of the FCN, maps the data to each latent semantic node, and obtains the structured data features.
- the unstructured data feature extraction module uses BERT to perform feature extraction on the preprocessed unstructured data; preferably, the BERT is composed of a BERT Encoder, and the BERT Encoder is composed of multiple layers of BERT Layers, and the BERT Layer of each layer is composed of Both are Encoder Blocks in Transformer; each encoder layer contains two layers, namely the self-attention mechanism layer and the feedforward neural network layer.
- the feature fusion module parallelizes the unstructured data features and the structured data features along a specified dimension, and adopts SMOTE to analyze the minority class sample data and generate new samples of this class. Reduce the imbalance rate, and then use the segmented pooling operation to extract the fusion features.
- the classification module inputs the fusion feature as an input to the fully connected layer, and then performs classification through the Softmax classifier to obtain a disease risk prediction result.
- the system further includes a data acquisition module for acquiring EHR data.
- the system further includes a data cleaning module, which is configured to preprocess the EHR data after acquiring the EHR data and before performing feature extraction on the EHR data, the preprocessing includes The EHR data cleaning module performs the operations of replacing outliers and using the mean to complete missing values and delete dirty data.
- the system further includes a result output module for outputting disease risk prediction results.
- the present invention provides a computer device, comprising a memory and a processor, the memory stores a computer program, and the processor implements the above-mentioned first aspect and the present invention when the processor executes the computer program. /or the steps of the method of any one of the second aspect and/or the third aspect.
- the present invention provides a computer-readable storage medium on which computer program instructions are stored, and when the program instructions are executed by a processor, implement the above-mentioned first and/or second aspects of the present invention and/or the steps of any of the methods of the third aspect.
- the invention provides an end-to-end patient outcome prediction model.
- the read data is used as the input of the model, and the corresponding data is mined and analyzed in combination with the deep learning method, and the output is the prediction. event outcomes of patients. It can effectively help doctors provide effective reference information, predict the development of the patient's condition, and help in timely treatment. At the same time increase the enthusiasm of patients to cooperate with treatment.
- the invention adopts the bidirectional language model BERT to perform feature extraction on medical texts, and can perform parallel computation on multiple sets of inputs to capture different subspace information.
- the attention mechanism is introduced to help the model obtain contextual information more effectively, learn the word dependencies within the sentence, and capture the internal structure of the sentence.
- data such as Chinese medical question and answer, Chinese medical encyclopedia and Chinese electronic medical record are used, and medical entities such as "abdominal pain" can be represented by more effective vectorization.
- the invention adopts multimodal fusion technology to preprocess, analyze and mine data such as electronic medical records of patients, past medical history information, and text records of patient medical records, and constructs a risk prediction model for predicting patient outcomes, which is for the utilization of clinical real data, Disease outcome assessment provides an aid to help physicians personalize treatment options for each patient.
- FIG. 1 is a flowchart of a method for processing EHR data in a first embodiment of the present invention.
- FIG. 2 is a structural diagram of a system for processing EHR data in the first embodiment of the present invention.
- FIG. 3 is a functional flowchart of a feature fusion module in one or more embodiments of the present invention.
- FIG. 4 is a flowchart of a method for predicting disease risk based on multimodal fusion in a third embodiment of the present invention.
- Figure 5 is a functional flow diagram of a disease risk prediction model in one or more embodiments of the present invention.
- FIG. 6 is a structural diagram of a risk prediction system based on multimodal fusion in a fourth embodiment of the present invention.
- FIG. 7 is a structural diagram of a risk prediction system based on multimodal fusion in a fourth embodiment of the present invention.
- FIG. 8 is a structural diagram of a risk prediction system based on multimodal fusion in a fourth embodiment of the present invention.
- first, second, third, etc. may be used in this application to describe various information, such information should not be limited by these terms. These terms are only used to distinguish the same type of information from each other.
- first information may also be referred to as the second information, and similarly, the second information may also be referred to as the first information, without departing from the scope of the present application.
- word “if” as used herein can be interpreted as "at the time of” or “when” or “in response to determining.”
- first embodiment may be combined with a second embodiment, so long as the particular features, structures, functions or characteristics associated with these embodiments or specific implementations are not mutually exclusive.
- the present invention provides a method for processing EHR data, comprising: acquiring EHR data, the data includes structured data and unstructured data;
- EHR data The processing flow of EHR data is shown in Figure 1, including: data processing of structured data and unstructured data respectively, including data cleaning of structured data and unstructured data respectively, and cleaning of the cleaned data.
- Feature extraction is performed on structured data and unstructured data respectively, and the unstructured data features and structured data features extracted respectively are fused to extract fusion features;
- the extracted fusion feature data is used as the data to be identified for medical purposes.
- the present invention also provides a system for processing EHR data, the core modules of which include: a feature extraction module and a feature fusion module;
- the system may further include a data cleaning module, as shown in FIG. 2 .
- the data cleaning module performs the operations of replacing outliers and using the mean to complete missing values and deleting dirty data. For example, you can first screen the data for outliers, replace the outliers with null values, then perform a weighted average of the data, and use the average to replace outliers and missing values, and spss can be used to clean the data.
- the feature extraction module performs feature extraction on structured data and unstructured data (such as text) contained in the EHR data; optionally, the feature extraction module includes a structured data feature extraction module and an unstructured data feature extraction module.
- the structured data feature extraction module uses the cleaned structured data as the input of the FCN, maps the data to each implicit semantic node, and obtains the structured data features; in this embodiment, the structured data feature extraction module passes through the Dence layer. To learn the weight W, and then obtain the reset feature of the structured data, due to the discreteness of the data, the position information between the features has little influence on the decision-making, so in this process, the position information can be chosen to be discarded.
- the unstructured data feature extraction module uses BERT to extract features from the cleaned unstructured text data.
- the BERT is composed of BERT Encoder, BERT Encoder is composed of multiple layers of BERT Layer, and the BERT Layer of each layer is the Encoder Block in the Transformer; each encoder layer contains two layers, which are the self-attention mechanism layer and the feedforward neural layer. Network layer.
- the stacked Transformer encoder module is used, and the word embedding tensor, sentence block tensor and position encoding tensor are obtained respectively to extract the semantics of medical text data. information, sentence information and location information, and the vectorized representation of the text medical records is calculated.
- connection layer parallels the structured data features and unstructured data features along the specified dimension, and uses SMOTE to analyze the minority class sample data and generate new samples to reduce the Unbalance rate, and extract important information from different structured data according to different data types by adding segmentation pooling operation. Since medical data usually has a small sample size, batch normalization will be affected by the size of batch_size. Therefore, in the embodiment of the present invention, the output of each sub-layer adopts layer normalization.
- the present invention provides a method for constructing a disease risk prediction model, comprising:
- EHR data of patients with known disease risk outcomes includes structured data and unstructured data, and unstructured data mainly refers to text; construct data sets (structured data sets and text data sets) from their EHR data ), construct the label set with its final outcome;
- data cleaning is performed on the obtained EHR data, and the data cleaning includes the replacement of outliers, the use of mean values to complete missing values, and the deletion of dirty data;
- Constructing a disease risk prediction network including: constructing a feature extraction module (FCN) for extracting structured data, a feature extraction module (transformer module) for extracting unstructured data, a feature fusion module, structured data feature extraction module and After the unstructured data feature extraction modules are connected in parallel, they are connected in series at the decision layer of the feature fusion module, and the model architecture is implemented based on the Pytorch framework;
- FCN feature extraction module
- transformer module for extracting unstructured data
- feature fusion module for extracting unstructured data
- structured data feature extraction module After the unstructured data feature extraction modules are connected in parallel, they are connected in series at the decision layer of the feature fusion module, and the model architecture is implemented based on the Pytorch framework
- the disease risk prediction network constructed above is trained with the data set, and the disease risk prediction model is constructed; in this embodiment, the disease risk outcome is used as the label, and the fusion feature is used as input to the fully connected layer to train Softmax Classifiers to build disease risk prediction models.
- cross-entropy loss can measure the degree of difference between two different probability distributions in the same random variable. The smaller the value of cross-entropy loss, the closer the two probability distributions are.
- Hinge loss is specially used for binary classification problems. It not only requires the classification to be correct, but also the loss will be as small as possible when the confidence is high enough. Since the hinge loss not only measures the fitting degree of the model to the training data, but also measures the complexity of the model itself by adding a regularization term, so the fitting risk can be greatly reduced.
- the present invention provides a disease risk prediction method based on multimodal fusion, as shown in FIG. 4 , which includes:
- EHR data of patients to be predicted can include structured data and unstructured data (text);
- the execution steps of the disease risk prediction model include:
- the fusion features are fusion features of unstructured data features and structured data features
- a weighting of the cross-entropy loss and the hinge loss is employed to jointly constrain the model.
- Cross-entropy loss can measure the degree of difference between two different probability distributions in the same random variable. The smaller the value of cross-entropy loss, the closer the two probability distributions are.
- Hinge loss is specially used for binary classification problems. It not only requires the classification to be correct, but also the loss will be as small as possible when the confidence is high enough. Since the hinge loss not only measures the fitting degree of the model to the training data, but also measures the complexity of the model itself by adding a regularization term, so the fitting risk can be greatly reduced.
- the present invention provides a risk prediction system based on multimodal fusion, as shown in FIG. 6 , including: a feature extraction module, a feature fusion module and a classification module.
- the feature extraction module includes: a structured data extraction module and an unstructured data extraction module, as shown in Figure 7.
- the risk prediction system based on multimodal fusion may further include a data acquisition module and/or a data cleaning module and/or a result output module.
- the system may be as shown in FIG. 8 .
- the data cleaning module preprocesses the EHR data, including the analysis of outliers. Replacing and taking the mean fills in missing values and removes dirty data.
- the cleaned and processed unstructured data such as text data
- the core of the model consists of BERT Encoder.
- BERT Encoder consists of multiple layers of BERT Layer.
- the BERT Layer of each layer is actually an Encoder Block in Transformer.
- Each encoder layer contains two layers, a self-attention mechanism layer and a feed-forward neural network layer.
- the cleaned structured data is subjected to feature extraction in the structured data feature extraction module, wherein the cleaned structured data is used as the input of the FCN, and the original data is mapped to each latent semantic node to obtain structured data features.
- the fusion module splices and parallelizes the features of the structured data and the text data along the specified dimension, and uses SMOTE to analyze the minority class sample data and generate this class of samples to reduce the imbalance rate. Then, the segmentation pooling operation is used to extract important information of different structural data to obtain fusion features.
- the classification module takes the fused features extracted after fusion as input to the fully connected layer, and then predicts the patient's outcome through the Softmax classifier.
- the predicted solution obtained by the classification module can be output through the result output module.
- the system described in this embodiment can implement the disease risk prediction method based on multimodal fusion described in the third embodiment.
- the present invention provides a computer device, including a memory and a processor, the memory stores a computer program, and the processor implements the computer program described in the first embodiment when the processor executes the computer program. the steps of the method;
- the present invention provides a computer-readable storage medium on which computer program instructions are stored, and when the program instructions are executed by a processor, implement the steps of the method described in the first embodiment;
- program instructions when executed by the processor, implement the steps of the method described in the third embodiment.
- the device embodiments described above are only illustrative, wherein the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in One place, or it can be distributed over multiple network elements. Some or all of the modules may be selected according to actual needs to achieve the purpose of this embodiment. Those of ordinary skill in the art can understand and implement it without creative effort.
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Public Health (AREA)
- General Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Epidemiology (AREA)
- Data Mining & Analysis (AREA)
- Biomedical Technology (AREA)
- Databases & Information Systems (AREA)
- Pathology (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Medical Treatment And Welfare Office Work (AREA)
Abstract
Description
Claims (10)
- 一种基于多模态融合的疾病风险预测方法,其特征在于,所述方法包括:获取患者的EHR数据,包括结构化数据和非结构化数据;将EHR数据输入疾病风险预测模型,得到疾病风险预测结果;输出疾病风险预测结果;其中,疾病风险预测模型执行步骤,包括:提取结构化数据特征和非结构化数据特征;融合结构化数据特征和非结构化数据特征,提取融合特征;对融合特征进行决策,得到疾病风险预测结果。
- 根据权利要求1所述的方法,其特征在于,采用全卷积网络提取结构化数据特征;优选地,采用BERT提取非结构化特征。
- 根据权利要求1或2所述的方法,其特征在于,所述提取融合特征的操作包括:将非结构化数据特征和结构化数据特征沿指定维度进行并联,采用SMOTE通过对少数类样本数据进行分析并新生成该类样本的方法来降低不平衡率,然后采用分段池化操作,提取得到融合特征;优选地,进行预测时,将融合特征作为input输入到全连接层,然后通过Softmax分类器进行疾病风险预测;优选地,采用交叉熵损失和合页损失的加权来共同约束疾病风险预测模型。
- 根据权利要求1所述的方法,其特征在于,所述疾病风险预测模型在提取结构化数据特征和非结构化数据特征前还包括执行数据清洗的步骤;优选地,所述数据清洗包括对异常值的替换、采用均值对缺失值进行补全,以及删除脏数据;优选地,所述非结构化数据为文本。
- 一种基于多模态融合的风险预测系统,其特征在于,所述系统包括:特征提取模块,其用于对EHR数据进行特征提取,得到非结构化数据特征和结构化数据特征;特征融合模块,其用于对非结构化数据特征和结构化数据特征进行融合处理并提取得到融合特征;分类模块,其以提取的融合特征作为input,得到疾病风险预测结果。
- 根据权利要求5所述的系统,其特征在于,所述特征提取模块包括结构化数据特征提取模块和非结构化数据特征提取模块;其中,所述结构化数据特征提取模块以结构化数据作为FCN的input,将数据映射到各个隐语义节点,得到结构化数据特征;其中,所述非结构化数据特征提取模块采用BERT对非结构化数据进行特征提取;优选地,BERT由BERT Encoder组成,BERT Encoder由多层BERT Layer组成,每一层的BERT Layer均为Transformer中的Encoder Block;每一个encoder层包含两层,分别为自注意力机制层和前馈神经网络层;优选地,所述特征融合模块将非结构化数据特征和结构化数据特征沿指定维度进行并联,采用SMOTE通过对少数类样本数据进行分析并新生成该类样本的方法来降低不平衡率,然后采用分段池化操作,提取得到融合特征;优选地,分类模块将融合特征或结构化数据作为input输入到全连接层,然后通过Softmax分类器对患者的结局进行预测;优选地,所述系统还包括数据获取模块,其用于获取EHR数据;优选地,所述系统还包括数据清洗模块,其用于在获取EHR数据后、在对EHR数据进行特征提取前预处理EHR数据,所述预处理包括对所述EHR数据清洗模块执行对异常值替 换和采用均值对缺失值进行补全并删除脏数据的操作;优选地,所述系统还包括结果输出模块,其用于输出疾病风险预测结果。
- 一种处理EHR数据的方法,其特征在于,包括:获取EHR数据,所述数据包括结构化数据和非结构化数据;对结构化数据和非结构化数据分别进行数据处理,包括对结构化数据和非结构化数据分别进行数据清洗,对清洗后的结构化数据和非结构化数据分别进行特征提取,将分别提取得到的非结构化数据特征和结构化数据特征进行融合处理后提取融合特征;以提取的融合特征数据作为待识别数据用于医疗用途;优选地,所述数据清洗包括对异常值的替换、采用均值对缺失值进行补全,以及删除脏数据;优选地,所述非结构化数据为文本;优选地,提取结构化数据特征采用FCN;优选地,提取非结构化特征采用BERT;优选地,所述提取融合特征的操作包括:将非结构化数据特征和结构化数据特征沿指定维度进行并联,采用SMOTE通过对少数类样本数据进行分析并新生成该类样本的方法来降低不平衡率,然后采用分段池化操作,提取得到融合特征。
- 一种疾病风险预测模型的构建方法,其特征在于,包括:获取已知疾病风险结局的患者的EHR数据,所述数据包括结构化数据和非结构化数据;以获取的EHR数据构建数据集,包括结构化数据集和非结构化数据集,以已知的最终结局构建标签集;构建疾病风险预测网络,包括:构建对于结构化数据进行提取的特征提取模块、对于非结构化数据进行提取的特征提取模块、和特征融合模块,结构化数据特征提取模块和非结构化数据特征提取模块并联连接后在特征融合模块决策层进行串联连接;所述疾病风险 预测网络基于Pytorch框架实现;以标签集为标签,利用数据集训练构建的疾病风险预测网络,构建疾病风险预测模型;优选地,构建数据集前包括对获取的EHR数据进行数据清洗的步骤,数据清洗包括对异常值的替换、采用均值对缺失值进行补全,以及删除脏数据;优选地,对于结构化数据进行提取的特征提取模块为FCN模块;优选地,对于非结构化数据进行提取的特征提取模块为BERT模块;优选地,特征融合模块执行:将非结构化数据特征和结构化数据特征沿指定维度进行并联,采用SMOTE通过对少数类样本数据进行分析并新生成该类样本的方法来降低不平衡率,然后采用分段池化操作,提取得到融合特征;优选地,利用数据集训练时,以融合特征作为input输入到全连接层,训练Softmax分类器。
- 一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,其特征在于,所述处理器执行所述计算机程序时实现权利要求1至4中任一项所述方法的步骤;和/或,所述处理器执行所述计算机程序时实现权利要求7中所述方法的步骤;和/或,所述处理器执行所述计算机程序时实现权利要求8中所述方法的步骤。
- 一种计算机可读存储介质,其上存储有计算机程序指令,其特征在于,该程序指令被处理器执行时实现权利要求1至4中任一项所述方法的步骤;和/或,该程序指令被处理器执行时实现权利要求7中所述方法的步骤;和/或,该程序指令被处理器执行时实现权利要求8中所述方法的步骤。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/910,556 US20240203599A1 (en) | 2021-04-30 | 2021-07-16 | Method and system of for predicting disease risk based on multimodal fusion |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110486200.2 | 2021-04-30 | ||
CN202110486200.2A CN113241135B (zh) | 2021-04-30 | 2021-04-30 | 一种基于多模态融合的疾病风险预测方法和系统 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022227294A1 true WO2022227294A1 (zh) | 2022-11-03 |
Family
ID=77131993
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/106860 WO2022227294A1 (zh) | 2021-04-30 | 2021-07-16 | 一种基于多模态融合的疾病风险预测方法和系统 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20240203599A1 (zh) |
CN (1) | CN113241135B (zh) |
WO (1) | WO2022227294A1 (zh) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115424724A (zh) * | 2022-11-04 | 2022-12-02 | 之江实验室 | 一种多模态图森林的肺癌淋巴结转移辅助诊断系统 |
CN116049397A (zh) * | 2022-12-29 | 2023-05-02 | 北京霍因科技有限公司 | 基于多模态融合的敏感信息发现并自动分类分级方法 |
CN116246774A (zh) * | 2023-03-15 | 2023-06-09 | 北京医准智能科技有限公司 | 一种基于信息融合的分类方法、装置及设备 |
CN117409930A (zh) * | 2023-12-13 | 2024-01-16 | 江西为易科技有限公司 | 基于ai技术实现的医疗康复数据处理方法及系统 |
CN117438023A (zh) * | 2023-10-31 | 2024-01-23 | 灌云县南岗镇卫生院 | 基于大数据的医院信息管理方法及系统 |
CN117992925A (zh) * | 2024-04-03 | 2024-05-07 | 成都新希望金融信息有限公司 | 基于多源异构数据和多模态数据的风险预测方法及装置 |
CN118213058A (zh) * | 2024-05-22 | 2024-06-18 | 山东拓庄医疗科技有限公司 | 一种基于数据分析的医疗器械故障检测管理系统及方法 |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113241135B (zh) * | 2021-04-30 | 2023-05-05 | 山东大学 | 一种基于多模态融合的疾病风险预测方法和系统 |
CN113707309B (zh) * | 2021-08-31 | 2024-08-20 | 平安科技(深圳)有限公司 | 基于机器学习的疾病预测方法及装置 |
CN114067935B (zh) * | 2021-11-03 | 2022-05-20 | 广西壮族自治区通信产业服务有限公司技术服务分公司 | 一种流行病调查方法、系统、电子设备及存储介质 |
CN114203295B (zh) * | 2021-11-23 | 2022-05-20 | 国家康复辅具研究中心 | 脑卒中风险预测干预方法及系统 |
TWI829065B (zh) * | 2022-01-06 | 2024-01-11 | 沐恩生醫光電股份有限公司 | 資料融合系統及其操作方法 |
CN114566275A (zh) * | 2022-02-21 | 2022-05-31 | 山东大学齐鲁医院 | 一种基于混合现实的院前急救辅助系统 |
CN114463825B (zh) * | 2022-04-08 | 2022-07-15 | 北京邮电大学 | 基于多模态融合的人脸预测方法及相关设备 |
CN114822880B (zh) * | 2022-06-30 | 2023-02-28 | 北京超数时代科技有限公司 | 一种基于国产自主可控的医院诊疗信息系统 |
CN115131642B (zh) * | 2022-08-30 | 2022-12-27 | 之江实验室 | 一种基于多视子空间聚类的多模态医学数据融合系统 |
CN115862875B (zh) * | 2023-02-27 | 2024-02-09 | 四川大学华西医院 | 基于多类型特征融合的术后肺部并发症预测方法及系统 |
CN115844348A (zh) * | 2023-02-27 | 2023-03-28 | 山东大学 | 一种基于可穿戴设备的心脏骤停分级响应预警方法和系统 |
CN116612886A (zh) * | 2023-05-06 | 2023-08-18 | 广东省人民医院 | 一种脑卒中早期辅助诊断方法、系统、装置及存储介质 |
CN116798625A (zh) * | 2023-06-26 | 2023-09-22 | 清华大学 | 卒中风险筛查装置、电子设备和存储介质 |
CN117217807B (zh) * | 2023-11-08 | 2024-01-26 | 四川智筹科技有限公司 | 一种基于多模态高维特征的不良资产估值方法 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160342764A1 (en) * | 2015-05-19 | 2016-11-24 | Universidad De Vigo | System, computer-implemented method and computer program product for individualized multiple-disease quantitative risk assessment |
CN109117864A (zh) * | 2018-07-13 | 2019-01-01 | 华南理工大学 | 基于异构特征融合的冠心病风险预测方法、模型及系统 |
CN111916207A (zh) * | 2020-08-07 | 2020-11-10 | 杭州深睿博联科技有限公司 | 一种基于多模态融合的疾病识别方法及装置 |
CN113241135A (zh) * | 2021-04-30 | 2021-08-10 | 山东大学 | 一种基于多模态融合的疾病风险预测方法和系统 |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108428478B (zh) * | 2018-02-27 | 2022-03-29 | 东北师范大学 | 基于异质医疗数据挖掘的甲状腺癌风险预测方法 |
CN109119130A (zh) * | 2018-07-11 | 2019-01-01 | 上海夏先机电科技发展有限公司 | 一种基于云计算的大数据健康管理系统及方法 |
CN111260209B (zh) * | 2020-01-14 | 2022-03-11 | 山东大学 | 电子病历与医学影像结合的心血管疾病风险预测评估系统 |
CN111680169A (zh) * | 2020-06-03 | 2020-09-18 | 国网内蒙古东部电力有限公司 | 一种基于bert模型技术的电力科技成果数据抽取方法 |
CN112199425A (zh) * | 2020-09-16 | 2021-01-08 | 北京好医生云医院管理技术有限公司 | 基于混合数据库结构的医疗大数据中心及其建设方法 |
CN112182243B (zh) * | 2020-09-27 | 2023-11-28 | 中国平安财产保险股份有限公司 | 基于实体识别模型构建知识图谱的方法、终端及存储介质 |
CN112365987B (zh) * | 2020-10-27 | 2023-06-06 | 平安科技(深圳)有限公司 | 诊断数据异常检测方法、装置、计算机设备及存储介质 |
CN112463922A (zh) * | 2020-11-25 | 2021-03-09 | 中国测绘科学研究院 | 一种风险用户识别方法及存储介质 |
CN112652386A (zh) * | 2020-12-25 | 2021-04-13 | 平安科技(深圳)有限公司 | 分诊数据处理方法、装置、计算机设备及存储介质 |
CN112633426B (zh) * | 2021-03-11 | 2021-06-15 | 腾讯科技(深圳)有限公司 | 处理数据类别不均衡的方法、装置、电子设备及存储介质 |
-
2021
- 2021-04-30 CN CN202110486200.2A patent/CN113241135B/zh active Active
- 2021-07-16 WO PCT/CN2021/106860 patent/WO2022227294A1/zh active Application Filing
- 2021-07-16 US US17/910,556 patent/US20240203599A1/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160342764A1 (en) * | 2015-05-19 | 2016-11-24 | Universidad De Vigo | System, computer-implemented method and computer program product for individualized multiple-disease quantitative risk assessment |
CN109117864A (zh) * | 2018-07-13 | 2019-01-01 | 华南理工大学 | 基于异构特征融合的冠心病风险预测方法、模型及系统 |
CN111916207A (zh) * | 2020-08-07 | 2020-11-10 | 杭州深睿博联科技有限公司 | 一种基于多模态融合的疾病识别方法及装置 |
CN113241135A (zh) * | 2021-04-30 | 2021-08-10 | 山东大学 | 一种基于多模态融合的疾病风险预测方法和系统 |
Non-Patent Citations (1)
Title |
---|
LAILA RASMY; YANG XIANG; ZIQIAN XIE; CUI TAO; DEGUI ZHI: "Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction", ARXIV.ORG, 22 May 2020 (2020-05-22), pages 1 - 23, XP081683869 * |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115424724A (zh) * | 2022-11-04 | 2022-12-02 | 之江实验室 | 一种多模态图森林的肺癌淋巴结转移辅助诊断系统 |
CN116049397A (zh) * | 2022-12-29 | 2023-05-02 | 北京霍因科技有限公司 | 基于多模态融合的敏感信息发现并自动分类分级方法 |
CN116049397B (zh) * | 2022-12-29 | 2024-01-02 | 北京霍因科技有限公司 | 基于多模态融合的敏感信息发现并自动分类分级方法 |
CN116246774A (zh) * | 2023-03-15 | 2023-06-09 | 北京医准智能科技有限公司 | 一种基于信息融合的分类方法、装置及设备 |
CN116246774B (zh) * | 2023-03-15 | 2023-11-24 | 浙江医准智能科技有限公司 | 一种基于信息融合的分类方法、装置及设备 |
CN117438023A (zh) * | 2023-10-31 | 2024-01-23 | 灌云县南岗镇卫生院 | 基于大数据的医院信息管理方法及系统 |
CN117438023B (zh) * | 2023-10-31 | 2024-04-26 | 灌云县南岗镇卫生院 | 基于大数据的医院信息管理方法及系统 |
CN117409930A (zh) * | 2023-12-13 | 2024-01-16 | 江西为易科技有限公司 | 基于ai技术实现的医疗康复数据处理方法及系统 |
CN117409930B (zh) * | 2023-12-13 | 2024-02-13 | 江西为易科技有限公司 | 基于ai技术实现的医疗康复数据处理方法及系统 |
CN117992925A (zh) * | 2024-04-03 | 2024-05-07 | 成都新希望金融信息有限公司 | 基于多源异构数据和多模态数据的风险预测方法及装置 |
CN118213058A (zh) * | 2024-05-22 | 2024-06-18 | 山东拓庄医疗科技有限公司 | 一种基于数据分析的医疗器械故障检测管理系统及方法 |
Also Published As
Publication number | Publication date |
---|---|
CN113241135A (zh) | 2021-08-10 |
US20240203599A1 (en) | 2024-06-20 |
CN113241135B (zh) | 2023-05-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2022227294A1 (zh) | 一种基于多模态融合的疾病风险预测方法和系统 | |
CN111316281B (zh) | 基于机器学习的自然语言情境中数值数据的语义分类方法以及系统 | |
Tayarani | Applications of artificial intelligence in battling against covid-19: A literature review | |
CN108831559B (zh) | 一种中文电子病历文本分析方法与系统 | |
US11610678B2 (en) | Medical diagnostic aid and method | |
CN109460473B (zh) | 基于症状提取和特征表示的电子病历多标签分类方法 | |
US20200303072A1 (en) | Method and system for supporting medical decision making | |
WO2023078025A1 (zh) | 一种基于任务分解策略的发热待查辅助鉴别诊断系统 | |
Carchiolo et al. | Medical prescription classification: a NLP-based approach | |
CN113284572B (zh) | 多模态异构的医学数据处理方法及相关装置 | |
Kaswan et al. | AI-based natural language processing for the generation of meaningful information electronic health record (EHR) data | |
CN111477320B (zh) | 治疗效果预测模型的构建系统、治疗效果预测系统及终端 | |
Motwani et al. | Enhanced framework for COVID-19 prediction with computed tomography scan images using dense convolutional neural network and novel loss function | |
CN115862875B (zh) | 基于多类型特征融合的术后肺部并发症预测方法及系统 | |
CN116884612A (zh) | 疾病风险等级的智能分析方法、装置、设备及存储介质 | |
CN117542467B (zh) | 基于患者数据的专病标准数据库自动构建方法 | |
JP2023510667A (ja) | キャラクタ取得、ページ処理と知識グラフ構築方法及び装置、媒体 | |
CN113990489A (zh) | 一种中医药临床证候诊疗智能数据处理和分析挖掘系统 | |
CN114191665A (zh) | 机械通气过程中人机异步现象的分类方法和分类装置 | |
US20210174027A1 (en) | Assertion Detection in Multi-Labelled Clinical Text using Scope Localization | |
CN113658688A (zh) | 基于无分词深度学习的临床决策支持方法 | |
CN117079834A (zh) | 利用大型语言模型监控多中心临床评价执行偏差的方法 | |
CN116313141A (zh) | 一种基于知识图谱的不明原因发热智能问诊方法 | |
Basha et al. | Deep learning neural network (DLNN)-based classification and optimization algorithm for organ inflammation disease diagnosis | |
Agoston | Big data, artificial intelligence, and machine learning in neurotrauma |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 17910556 Country of ref document: US |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21938761 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21938761 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 19/04/2024) |