CN114795114B

CN114795114B - Carbon monoxide poisoning delayed encephalopathy prediction method based on multi-modal learning

Info

Publication number: CN114795114B
Application number: CN202210348106.5A
Authority: CN
Inventors: 邱航; 杨萍; 王利亚; 周德嘉; 胡智栩
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2022-03-29
Filing date: 2022-03-29
Publication date: 2023-05-02
Anticipated expiration: 2042-03-29
Also published as: CN114795114A

Abstract

The invention belongs to the technical field of medical information, and particularly relates to a carbon monoxide poisoning delayed encephalopathy prediction method based on multi-modal learning, which constructs a characteristic dependency graph by combining a Bayesian network structure learning algorithm and doctor suggestions; based on the characteristic dependency graph, embedding the building graph into a feedforward neural network, and integrating the dependency relationship among the characteristics into a neural network structure to obtain a vector representation of the structural characteristics; and designing a feature fusion structure, obtaining vector representation fused with multi-mode information, inputting the vector into a nonlinear neural network layer, and finally obtaining a prediction result of whether a patient with carbon monoxide poisoning will have delayed encephalopathy. The invention fully fuses the multi-mode diagnosis and treatment information, only needs structured data as model input when being applied to early prediction, thus reducing the requirement of a prediction model on the integrity of the multi-mode data in practical application to a certain extent and solving the problem that accurate prediction cannot be realized under the condition of data loss.

Description

A prediction method for delayed encephalopathy in carbon monoxide poisoning based on multimodal learning

技术领域technical field

本发明属于医疗信息技术领域，尤其涉及一种基于多模态学习的一氧化碳中毒迟发性脑病预测方法。The invention belongs to the technical field of medical information, and in particular relates to a method for predicting delayed encephalopathy in carbon monoxide poisoning based on multimodal learning.

背景技术Background technique

一氧化碳中毒是日常生活中常见的意外突发事件。一氧化碳中毒患者在中毒症状缓解后，经过2-60天的假愈期后，会有10％-30％的几率发生以痴呆、精神症状和锥体外系症状为主的精神系统疾病，即迟发性脑病。迟发性脑病知晓率低，病情重且病程长，一旦发生迟发性脑病，患者将丧失劳动能力，日常生活需要他人护理，多数治疗时间长达半年或1年甚至更长，严重者会遗留认知功能障碍等永久性神经功能残疾，从而影响生活质量，加重经济负担，甚至“因病致贫”。因此，实现一氧化碳中毒迟发性脑病风险早期预测将有利于疾病的及时干预，降低其发病频率，有效降低该病对家庭及社会的负担。Carbon monoxide poisoning is a common accident in daily life. After the symptoms of carbon monoxide poisoning are relieved, after a 2-60 day false recovery period, there will be a 10%-30% chance of developing mental system diseases mainly dementia, mental symptoms and extrapyramidal symptoms, that is, delayed onset encephalopathy. The awareness rate of delayed encephalopathy is low, the condition is severe and the course of the disease is long. Once delayed encephalopathy occurs, the patient will lose the ability to work and needs other people's care in daily life. Most of the treatment time is as long as half a year or 1 year or even longer. Cognitive dysfunction and other permanent neurological disabilities, which affect the quality of life, increase the economic burden, and even "disease impoverishment". Therefore, realizing early prediction of the risk of CO poisoning delayed encephalopathy will facilitate the timely intervention of the disease, reduce its incidence frequency, and effectively reduce the burden of the disease on the family and society.

目前，已有的迟发性脑病预测主要采用传统医学统计方法对历史数据进行拟合，发现迟发性脑病患者在一氧化碳中毒早期的危险因素，但该方法单一，缺少准确性。At present, the existing delayed encephalopathy prediction mainly adopts the traditional medical statistical method to fit historical data, and finds the risk factors of patients with delayed encephalopathy in the early stage of carbon monoxide poisoning, but this method is single and lacks accuracy.

目前，临床检查指标、脑电图(EEG)、脑CT和脑MRI是发现一氧化碳中毒患者异常体征的有效手段。充分利用多模态数据(包括临床指标等结构化数据、脑电图、脑CT和脑MRI)能够有效提高迟发性脑病风险预测的准确性。然而，在实际应用中，由于在一氧化碳中毒早期往往存在患者部分检查数据缺失的问题，这将影响到迟发性脑病风险预测的准确性。At present, clinical examination indicators, electroencephalogram (EEG), brain CT and brain MRI are effective means to find abnormal signs in patients with carbon monoxide poisoning. Making full use of multimodal data (including structured data such as clinical indicators, EEG, brain CT and brain MRI) can effectively improve the accuracy of risk prediction for delayed encephalopathy. However, in practical applications, due to the lack of some examination data of patients in the early stage of carbon monoxide poisoning, this will affect the accuracy of the risk prediction of delayed encephalopathy.

发明内容Contents of the invention

为了解决上述现有技术中存在的技术问题，本发明提供了一种基于多模态学习的一氧化碳中毒迟发性脑病预测方法，拟解决背景技术中提到的存在检查数据丢失影响迟发性脑病风险预测的准确性的技术问题。In order to solve the technical problems existing in the above-mentioned prior art, the present invention provides a method for predicting delayed encephalopathy in carbon monoxide poisoning based on multimodal learning, which intends to solve the problem mentioned in the background technology that the loss of inspection data affects delayed encephalopathy Technical issues of the accuracy of risk predictions.

为解决上述技术问题，本发明采用的技术方案如下：In order to solve the problems of the technologies described above, the technical scheme adopted in the present invention is as follows:

一种基于多模态学习的一氧化碳中毒迟发性脑病预测方法，包括以下步骤：A method for predicting delayed encephalopathy in carbon monoxide poisoning based on multimodal learning, comprising the following steps:

步骤1：获取一氧化碳中毒患者的多模态数据，并对多模态数据进行预处理，得到一氧化碳中毒患者的数据集；Step 1: Obtain multimodal data of patients with carbon monoxide poisoning, and preprocess the multimodal data to obtain a data set of patients with carbon monoxide poisoning;

步骤2：基于步骤1中获取的数据集，通过贝叶斯网络结构学习算法以及医生建议构建特征依赖图，特征依赖图中，节点代表特征，边代表特征间的关系，具体体现为特征间的依赖关系；然后删除特征依赖图中表示一氧化碳中毒迟发性脑病标签的节点，保留代表特征的节点；将特征依赖图中出现的所有特征构成集合F，并基于特征依赖图的结构，得到对应的邻接矩阵A；Step 2: Based on the data set obtained in step 1, construct a feature dependency graph through the Bayesian network structure learning algorithm and doctor's advice. In the feature dependency graph, nodes represent features, and edges represent the relationship between features, which is specifically reflected in the relationship between features. Dependency relationship; then delete the node representing the label of carbon monoxide poisoning delayed encephalopathy in the feature dependency graph, and retain the node representing the feature; form a set F of all the features appearing in the feature dependency graph, and based on the structure of the feature dependency graph, get the corresponding adjacency matrix A;

步骤3：基于数据集中的数据类型，分别建立特征提取模型，基于特征提取模型提取多模态数据特征向量；Step 3: Based on the data types in the data set, respectively establish feature extraction models, and extract multimodal data feature vectors based on the feature extraction models;

步骤4：构建基于特征依赖图的图嵌入前馈神经网络；Step 4: Construct a graph embedding feed-forward neural network based on feature-dependent graphs;

步骤5：对多模态数据特征向量进行融合，得到融合向量C1和C2；Step 5: Fusion the multimodal data feature vectors to obtain fusion vectors C1 and C2;

步骤6：基于图嵌入前馈神经网络以及融合向量C1和C2建立一氧化碳中毒迟发性脑病预测模型；Step 6: Based on the graph embedding feed-forward neural network and the fusion vectors C1 and C2, a prediction model for CO poisoning delayed encephalopathy is established;

步骤7：对一氧化碳中毒迟发性脑病预测模型进行测试，选择符合预测性能要求的参数作为一氧化碳中毒迟发性脑病预测模型的参数。Step 7: Test the CO poisoning delayed encephalopathy prediction model, and select parameters that meet the prediction performance requirements as parameters of the CO poisoning delayed encephalopathy prediction model.

本发明所述的一氧化碳中毒迟发性脑病预测模型在训练时，充分融合多模态的诊疗信息，应用与早期预测时，仅需要结构化的数据作为模型输入，因此一定程度上降低了实际应用中预测模型对多模态数据完整性的需求，解决了在部分检查数据缺失的情况下无法实现准确预测的问题。The carbon monoxide poisoning delayed encephalopathy prediction model of the present invention fully integrates multimodal diagnosis and treatment information during training, and only needs structured data as model input during application and early prediction, thus reducing practical application to a certain extent The demand for multi-modal data integrity in the forecasting model solves the problem that accurate forecasting cannot be achieved when some inspection data is missing.

优选的，所述多模态数据包括图像、波形图和结构化数据；Preferably, the multimodal data includes images, waveform diagrams and structured data;

所述图像数据包括脑CT和脑MRI；所述波形图数据包括脑电图；所述结构化数据CO中毒首次就诊信息、辅助检查以及治疗情况。The image data includes brain CT and brain MRI; the waveform image data includes EEG; and the structured data CO poisoning first visit information, auxiliary examination and treatment conditions.

优选的，步骤1中若所获取的患者存在某一模态下的数据缺失，则进行缺失标记；Preferably, in step 1, if there is data missing in a certain modality in the acquired patient, mark the missing data;

所述步骤1还包括获取一氧化碳中毒患者的随访资料信息，构建预测标签；若患者在一氧化碳中毒后的一段时间内发生迟发性脑病，则预测标签为1，否则预测标签为0。The step 1 also includes obtaining follow-up information of patients with carbon monoxide poisoning, and constructing a prediction label; if the patient develops delayed encephalopathy within a period of time after carbon monoxide poisoning, the prediction label is 1, otherwise the prediction label is 0.

优选的，步骤3中所述的特征提取模型包括：特征提取模型M1、特征提取模型M2以及特征提取模型M3；Preferably, the feature extraction model described in step 3 includes: feature extraction model M1, feature extraction model M2 and feature extraction model M3;

所述特征提取模型M1由EEGNet的卷积模块构成，所述卷积模块由三个卷积层组成，依次是Conv2D，DepthwiseConv2D和SeparableConv2D；The feature extraction model M1 is composed of a convolution module of EEGNet, and the convolution module is composed of three convolution layers, followed by Conv2D, DepthwiseConv2D and SeparableConv2D;

所述特征提取模型M2和特征提取模型M3分别由第一卷积神经网络和第二卷积神经网络组成。The feature extraction model M2 and the feature extraction model M3 are respectively composed of a first convolutional neural network and a second convolutional neural network.

优选的，所述特征提取模型M1、特征提取模型M2和特征提取模型M3均从数据集中划分出的训练集1中训练得到；分别利用数据集中脑电图、脑CT和脑MRI数据训练得到三个分类器，调整特征提取模型的参数使得对应的分类器的分类性能达到预定的要求，再去除分类器中的sigmoid分类层，抽取三种模态下符合预定条件的分类器的卷积结构，构成特征提取模型M1、M2以及M3；所述特征提取模型M1、M2以及M3分别用于提取脑电图特征向量E2、脑CT特征向量E3和脑MRI特征向量E4。Preferably, the feature extraction model M1, the feature extraction model M2 and the feature extraction model M3 are all trained from the training set 1 divided in the data set; respectively use the EEG, brain CT and brain MRI data training in the data set to obtain three classifier, adjust the parameters of the feature extraction model so that the classification performance of the corresponding classifier meets the predetermined requirements, then remove the sigmoid classification layer in the classifier, and extract the convolution structure of the classifier that meets the predetermined conditions under the three modes, Feature extraction models M1, M2 and M3 are formed; the feature extraction models M1, M2 and M3 are respectively used to extract EEG feature vector E2, brain CT feature vector E3 and brain MRI feature vector E4.

优选的，步骤4中所述的图嵌入前馈神经网络的定义如下：Preferably, the graph embedding feedforward neural network described in step 4 is defined as follows:

E1＝MLP(σ(x(W·A)+b))；E1=MLP(σ(x(W·A)+b));

式中：A为特征依赖图的邻接矩阵；x为患者的结构化特征的取值构成的特征向量，所述患者的结构化特征包括步骤2中所述的特征集合F中出现的特征；W和b为训练参数；·表示哈达玛积运算；σ表示激活函数；MLP为多层感知机。In the formula: A is the adjacency matrix of the feature dependency graph; x is the feature vector composed of the values of the patient's structural features, and the patient's structural features include the features that appear in the feature set F described in step 2; W and b are training parameters; · means Hadamard product operation; σ means activation function; MLP means multi-layer perceptron.

本发明的图嵌入前馈神经网络部分嵌入了步骤2中所述的特征依赖图，以集成特征间的依赖关系，进一步的构建了结构化特征的特征向量。The graph embedding feedforward neural network of the present invention partially embeds the feature dependency graph described in step 2 to integrate the dependencies between features, and further constructs the feature vector of structured features.

优选的，所述步骤5包括以下步骤：Preferably, said step 5 includes the following steps:

步骤5.1：将步骤3中提取到的多模态数据特征向量作为训练样本[x,E2,E3,E4,y]；结构化特征向量x通过图嵌入神经网络编码，得到特征向量E1；Step 5.1: Use the multimodal data feature vector extracted in step 3 as the training sample [x, E2, E3, E4, y]; the structured feature vector x is encoded by the graph embedding neural network to obtain the feature vector E1;

步骤5.2：基于第一层编码器-解码器将特征向量E1转换为脑电图特征向量E2，并在转换过程中产生用于捕获特征向量E1和脑电图特征向量E2之间的联合信息的融合向量C：Step 5.2: Convert the feature vector E1 to the EEG feature vector E2 based on the first layer of encoder-decoder, and generate the joint information for capturing the joint information between the feature vector E1 and the EEG feature vector E2 during the conversion process Fusion vector C:

C＝f_E-D(E1,E2)；C = f _ED (E1, E2);

式中：f_E-D()表示第一层的编码器-解码器单元，获取融合向量表示C；其中，编码器-解码器单元可选用神经网络结构；In the formula: f _ED () represents the encoder-decoder unit of the first layer, and obtains the fusion vector representation C; wherein, the encoder-decoder unit can choose a neural network structure;

步骤5.3：融合后的向量C输入到第二层的编码器-解码器单元，实现融合向量C和脑CT和脑MRI特征向量E3和E4的转换。第二层的编码器-解码器单元中，融合向量C到脑CT和脑MRI的转换使用一个编码器和两个单独的解码器，在编码解码过程中，分别产生融合向量表示C1和C2：Step 5.3: The fused vector C is input to the encoder-decoder unit of the second layer to realize the conversion of the fused vector C and brain CT and brain MRI feature vectors E3 and E4. In the encoder-decoder unit of the second layer, the conversion of the fusion vector C to brain CT and brain MRI uses an encoder and two separate decoders. During the encoding and decoding process, the fusion vector representations C1 and C2 are generated respectively:

C1＝f_E1-D1(C,E3)；C1=f _E1-D1 (C, E3);

C2＝f_E1-D2(C,E4)；C2=f _E1-D2 (C, E4);

式中：f_E1-D1()、f_E1-D2()表示第二层的编码器-解码器单元，分别用于获取融合向量表示C1和C2；其中，编码器-解码器单元可选用神经网络结构。In the formula: f _E1-D1 (), f _E1-D2 () represent the encoder-decoder unit of the second layer, which are used to obtain fusion vector representations C1 and C2 respectively; among them, the encoder-decoder unit can choose neural network structure.

本发明的特征融合设计了一个两层的编码器-解码器单元，分别融合脑电图特征和脑图像特征(包括脑CT和脑MRI)，使得能够充分的挖掘特定于一氧化碳中毒迟发性脑病预测任务的融合特征，增强一氧化碳中毒迟发性脑病预测模型的适用性和稳定性。The feature fusion of the present invention designs a two-layer encoder-decoder unit, which fuses EEG features and brain image features (including brain CT and brain MRI) respectively, so that it can fully mine the delayed encephalopathy specific to carbon monoxide poisoning Fusion features of a prediction task to enhance the applicability and stability of a predictive model for delayed-onset encephalopathy in carbon monoxide poisoning.

优选的，所述步骤6具体如下所述：Preferably, the step 6 is specifically as follows:

对融合向量C1和C2进行拼接，将融合向量C1和C2拼接后，输入到全连接层，经过非线性拟合，输出一氧化碳中毒患者发生迟发性脑病概率：Splicing the fusion vectors C1 and C2, after splicing the fusion vectors C1 and C2, input to the fully connected layer, after nonlinear fitting, output the probability of delayed encephalopathy in patients with carbon monoxide poisoning:

式中：||表示向量拼接，W、b为训练参数，g表示激活函数，

表示是否具有迟发性脑病风险的预测输出；In the formula: || means vector splicing, W and b are training parameters, g means activation function,

Indicates the predicted output of whether there is a risk of delayed encephalopathy;

一氧化碳中毒迟发性脑病预测模型的损失函数定义为：The loss function of the CO poisoning delayed encephalopathy prediction model is defined as:

L＝λ₁L_E2+λ₂L_E3+λ₃L_E4+λ_pL_p；L＝λ ₁ L _E2 +λ ₂ L _E3 +λ ₃ L _E4 +λ _p L _p ;

式中：L_Ei(i＝2,3,4)表示特征融合过程中产生的误差，L_p表示分类预测产生的误差。λ_i(i＝1,2,3)为超参数，可根据实际情况权衡不同部分损失函数的占比；In the formula: L _Ei (i=2,3,4) represents the error generated during the feature fusion process, and L _p represents the error generated by classification prediction. λ _i (i=1,2,3) is a hyperparameter, which can weigh the proportion of different parts of the loss function according to the actual situation;

具体地，预测部分的损失L_p的计算方法：Specifically, the calculation method of the loss L _p of the prediction part:

L_p＝-y log p-(1-y)log(1-p)；L _p =-y log p-(1-y)log(1-p);

式中：y为一氧化碳中毒患者样本的真实标签，发生迟发性脑病，标签为1，否则标签为0；p表示一氧化碳中毒患者预测为发生迟发性脑病的概率；In the formula: y is the real label of the sample of the patient with carbon monoxide poisoning, if delayed encephalopathy occurs, the label is 1, otherwise the label is 0; p represents the probability that the patient with carbon monoxide poisoning is predicted to develop delayed encephalopathy;

融合部分的损失L_Ei的计算方法：The calculation method of the loss L _Ei of the fusion part:

式中，

分别表示步骤5.2和步骤5.3中所述的解码器的输出，Ei(i＝2,3,4)分别表示步骤3所述的脑电图、脑CT和脑MRI的向量表示。In the formula,

represent the outputs of the decoders described in step 5.2 and step 5.3, respectively, and Ei (i=2, 3, 4) represent the vector representations of the EEG, brain CT and brain MRI described in step 3, respectively.

优选的，将测试集中的结构化特征向量x输入到一氧化碳中毒迟发性脑病预测模型中，基于一氧化碳中毒迟发性脑病预测模型的患者是否会发生迟发性脑病的预测结果，通过准确率、召回率和F1指标评估一氧化碳中毒迟发性脑病预测模型的预测性能。Preferably, the structured feature vector x in the test set is input into the carbon monoxide poisoning delayed encephalopathy prediction model, whether the patients based on the carbon monoxide poisoning delayed encephalopathy prediction model will have delayed encephalopathy prediction results, through accuracy, Recall and F1 metrics to assess the predictive performance of a carbon monoxide poisoning delayed encephalopathy prediction model.

优选的，将步骤1中所述的数据集划分为训练集和测试集，将所述训练集划分为训练集1和训练集2，训练集1用于训练特征提取模型和特征依赖图；训练集2用于训练一氧化碳中毒迟发性脑病预测模型，所述训练集中无数据缺失，测试集中能够存在脑CT、脑MRI或脑电图等数据的缺失。Preferably, the data set described in step 1 is divided into a training set and a test set, and the training set is divided into a training set 1 and a training set 2, and the training set 1 is used to train a feature extraction model and a feature dependency graph; training Set 2 is used to train a carbon monoxide poisoning delayed encephalopathy prediction model. There is no missing data in the training set, and there may be missing data such as brain CT, brain MRI, or EEG in the test set.

本发明的有益效果包括：The beneficial effects of the present invention include:

1.特征依赖图能够挖掘特征间依赖关系，做为先验知识增强一氧化碳中毒迟发性脑病预测模型的预测准确性。图嵌入前馈神经网络基于特征依赖图的结构对特征进行选择，使前馈神经网络输入层到隐藏层之间的连接关系变得稀疏。1. The feature dependency graph can mine the dependencies between features and use it as prior knowledge to enhance the prediction accuracy of the CO poisoning delayed encephalopathy prediction model. The graph embedding feedforward neural network selects features based on the structure of the feature-dependent graph, so that the connection relationship between the input layer and the hidden layer of the feedforward neural network becomes sparse.

2.本发明的多模态特征融合设计了一个两层的编码器-解码器单元，挖掘特定于一氧化碳中毒迟发性脑病预测任务的融合特征。相比于通过注意力机制等方法简单组合不同模态特征来说，本发明中的一氧化碳中毒迟发性脑病预测预测模型设计了专门的多模态特征学习过程，一部分模态转换产生的损失参与模型的优化，能够更有效地利用多模态信息。2. The multimodal feature fusion of the present invention designs a two-layer encoder-decoder unit to mine fusion features specific to the CO poisoning delayed encephalopathy prediction task. Compared with the simple combination of different modal features through the attention mechanism and other methods, the carbon monoxide poisoning delayed encephalopathy prediction model in the present invention is designed with a special multi-modal feature learning process, and part of the loss caused by modal conversion is involved in The optimization of the model can make more effective use of multimodal information.

3.通过特征依赖图的集成和多模态信息的融合，构建稳健准确的一氧化碳中毒迟发性脑病预测模型。本发明所述的一氧化碳中毒迟发性脑病预测模型在训练时，充分融合多模态的诊疗信息；而应用于早期预测时，仅需结构化的数据做为一氧化碳中毒迟发性脑病预测模型的输入。因此，一定程度上降低了实际应用中一氧化碳中毒迟发性脑病预测模型对多模态数据完整性的需求，解决了在部分检查数据缺失的情况下无法实现准确可靠预测的问题。3. Construct a robust and accurate CO poisoning delayed encephalopathy prediction model through the integration of feature-dependency graphs and the fusion of multimodal information. The carbon monoxide poisoning delayed encephalopathy prediction model of the present invention fully integrates multi-modal diagnosis and treatment information during training; when applied to early prediction, only structured data is needed as the carbon monoxide poisoning delayed encephalopathy prediction model enter. Therefore, to a certain extent, the demand for multimodal data integrity of the CO poisoning delayed encephalopathy prediction model in practical applications has been reduced, and the problem of inability to achieve accurate and reliable predictions in the absence of some inspection data has been solved.

附图说明Description of drawings

图1为本发明的不同模态特征向量的表示示意图。FIG. 1 is a schematic representation of different modal eigenvectors of the present invention.

图2为本发明的一氧化碳中毒迟发性脑病预测模型训练示意图。Fig. 2 is a schematic diagram of training the carbon monoxide poisoning delayed encephalopathy prediction model of the present invention.

图3为本发明的一氧化碳中毒迟发性脑病预测模型的训练及测试流程示意图。Fig. 3 is a schematic diagram of the training and testing process of the carbon monoxide poisoning delayed encephalopathy prediction model of the present invention.

图4为本发明的整体流程示意图。Fig. 4 is a schematic diagram of the overall process of the present invention.

图5为本发明的特征依赖图示意图。FIG. 5 is a schematic diagram of a feature dependency graph of the present invention.

具体实施方式Detailed ways

为使本申请实施例的目的、技术方案和优点更加清楚，下面将结合本申请实施例中附图，对本申请实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例仅是本申请一部分实施例，而不是全部的实施例。通常在此处附图中描述和示出的本申请实施例的组件可以各种不同的配置来布置和设计。因此，以下对在附图中提供的本申请的实施例的详细描述并非旨在限制要求保护的本申请的范围，而是仅仅表示本申请的选定实施例。基于本申请的实施例，本领域技术人员在没有做出创造性劳动的前提下所获得的所有其他实施例，都属于本申请保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments are only It is a part of the embodiments of this application, not all of them. The components of the embodiments of the application generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations. Accordingly, the following detailed description of the embodiments of the application provided in the accompanying drawings is not intended to limit the scope of the claimed application, but merely represents selected embodiments of the application. Based on the embodiments of the present application, all other embodiments obtained by those skilled in the art without making creative efforts belong to the scope of protection of the present application.

下面结合附图1和附图5对本发明作进一步的详细说明：Below in conjunction with accompanying drawing 1 and accompanying drawing 5 the present invention is described in further detail:

所述多模态数据包括图像、波形图和结构化数据；The multimodal data includes images, waveforms, and structured data;

具体的，所述CO中毒首次就诊信息包括：年龄、性别、职业、中毒方式、中毒原料、中毒地点、中毒前劳动类型、文化程度、既往病史、中毒时间、就诊时间、首次高压氧治疗时间、中毒时临床表现(头痛、呕吐、四肢无力、昏迷、心悸、呼吸困难等)、昏迷时间、昏迷时Glasgow评分、第一次高压氧时意识状态；所述辅助检查及治疗情况信息包括：血碳氧血红蛋白、血气、血常规、C反应蛋白、D二聚体、肝功能、血脂、血糖及一些特异性生物标志物检测、相关药物治疗(是否使用激素、改善脑功能、脱水降颅压、促醒等)及其他治疗(高压氧治疗方案、康复治疗)。Specifically, the CO poisoning first visit information includes: age, gender, occupation, poisoning method, poisoning raw material, poisoning place, type of work before poisoning, education level, past medical history, time of poisoning, time of visit, time of first hyperbaric oxygen treatment, Clinical manifestations at the time of poisoning (headache, vomiting, weakness of limbs, coma, palpitations, dyspnea, etc.), coma time, Glasgow score at the time of coma, state of consciousness at the first hyperbaric oxygenation; the auxiliary examination and treatment information includes: blood carbon dioxide Oxyhemoglobin, blood gas, blood routine, C-reactive protein, D-dimer, liver function, blood lipid, blood sugar and some specific biomarkers, related drug treatment (whether to use hormones, improve brain function, reduce intracranial pressure by dehydration, promote awakening, etc.) and other treatments (hyperbaric oxygen therapy program, rehabilitation).

步骤1中若所获取的患者存在某一模态下的数据缺失，则进行缺失标记；In step 1, if there is missing data in a certain modality in the acquired patient, mark the missing data;

将步骤1中所述的数据集划分为训练集和测试集，将所述训练集划分为训练集1和训练集2，训练集1用于训练特征提取模型和特征依赖图；训练集2用于训练一氧化碳中毒迟发性脑病预测模型，所述训练集中无数据缺失，测试集中能够存在脑CT、脑MRI或脑电图数据的缺失。The data set described in step 1 is divided into training set and test set, and described training set is divided into training set 1 and training set 2, and training set 1 is used for training feature extraction model and feature dependency graph; Training set 2 uses For training a carbon monoxide poisoning delayed encephalopathy prediction model, there is no missing data in the training set, and there may be missing brain CT, brain MRI or EEG data in the testing set.

步骤2：基于步骤1中获取的数据集，通过贝叶斯网络结构学习算法以及医生建议构建特征依赖图，并删除特征依赖图中表示一氧化碳中毒迟发性脑病标签的节点，保留代表特征的节点；将特征依赖图中出现的所有特征构成集合F，并基于特征依赖图的结构，得到对应的邻接矩阵A；Step 2: Based on the data set obtained in step 1, construct a feature dependency graph through the Bayesian network structure learning algorithm and doctor's advice, and delete the nodes representing the label of carbon monoxide poisoning delayed encephalopathy in the feature dependency graph, and keep the nodes representing the features ; Construct a set F of all the features appearing in the feature dependency graph, and obtain the corresponding adjacency matrix A based on the structure of the feature dependency graph;

参见附图5所示，所述的特征依赖图是一个有向无环图，其中节点代表特征，边代表节点间的依赖关系，有向边指向的节点依赖于有向边的出发节点。Referring to Figure 5, the feature dependency graph is a directed acyclic graph, where nodes represent features, edges represent dependencies between nodes, and the nodes pointed by the directed edges depend on the starting nodes of the directed edges.

特征依赖图中出现的所有特征构成集合F，例如，图5所示，特征依赖图的特征集合F＝{职业，年龄，中毒方式，中毒原料，既往病史-心血管疾病，昏迷时间，高压氧治疗}。根据特征依赖图的结构，可以得到对应的邻接矩阵A。All the features appearing in the feature-dependent graph form a set F, for example, as shown in Figure 5, the feature set F of the feature-dependent graph = {occupation, age, poisoning method, poisoning raw material, past medical history-cardiovascular disease, coma time, hyperbaric oxygen treat}. According to the structure of the feature dependency graph, the corresponding adjacency matrix A can be obtained.

参见附图1所示，步骤3中所述的特征提取模型包括：特征提取模型M1、特征提取模型M2以及特征提取模型M3；Referring to shown in accompanying drawing 1, the feature extraction model described in step 3 comprises: feature extraction model M1, feature extraction model M2 and feature extraction model M3;

所述特征提取模型M1、特征提取模型M2和特征提取模型M3均从数据集中划分出的训练集1中训练得到；分别利用数据集中脑电图、脑CT和脑MRI数据训练得到三个分类器，调整特征提取模型的参数使得对应的分类器的分类性能达到预定的要求，再去除分类器中的sigmoid分类层，抽取三种模态下符合预定条件的分类器的卷积结构，构成特征提取模型M1、M2以及M3；所述特征提取模型M1、M2以及M3分别用于提取脑电图特征向量E2、脑CT特征向量E3和脑MRI特征向量E4。The feature extraction model M1, the feature extraction model M2 and the feature extraction model M3 are all trained from the training set 1 divided in the data set; three classifiers are obtained by training the EEG, brain CT and brain MRI data in the data set respectively , adjust the parameters of the feature extraction model so that the classification performance of the corresponding classifier meets the predetermined requirements, then remove the sigmoid classification layer in the classifier, and extract the convolution structure of the classifier that meets the predetermined conditions under the three modes to form a feature extraction Models M1, M2, and M3; the feature extraction models M1, M2, and M3 are used to extract EEG feature vector E2, brain CT feature vector E3, and brain MRI feature vector E4, respectively.

可选的，本发明可先采用预训练策略初始化特征提取模型参数，从而加快训练速度，提高特征提取模型的精度。Optionally, the present invention may first use a pre-training strategy to initialize the parameters of the feature extraction model, thereby speeding up the training speed and improving the accuracy of the feature extraction model.

参见附图2，步骤4中所述的图嵌入前馈神经网络的定义如下：Referring to accompanying drawing 2, the graph embedding feed-forward neural network described in step 4 is defined as follows:

E1＝MLP(σ(x(W·A)+b))；E1=MLP(σ(x(W·A)+b));

参见附图2，所述步骤5包括以下步骤：Referring to accompanying drawing 2, described step 5 comprises the following steps:

C＝f_E-D(E1,E2)；C = f _ED (E1, E2);

C1＝f_E1-D1(C,E3)；C1=f _E1-D1 (C, E3);

C2＝f_E1-D2(C,E4)；C2=f _E1-D2 (C, E4);

本发明的特征融合设计了一个两层的编码器-解码器单元，分别融合脑电图特征和脑图像特征(包括脑CT和脑MRI)，使得能够充分的挖掘特定于一氧化碳中毒迟发性脑病预测任务的融合特征，增强一种一氧化碳中毒迟发性脑病预测模型的适用性和稳定性。The feature fusion of the present invention designs a two-layer encoder-decoder unit, which fuses EEG features and brain image features (including brain CT and brain MRI) respectively, so that it can fully mine the delayed encephalopathy specific to carbon monoxide poisoning Fusion features of prediction tasks to enhance the applicability and stability of a carbon monoxide poisoning delayed encephalopathy prediction model.

参见附图2，将融合向量C1和C2拼接后，输入到全连接层，经过非线性拟合，输出一氧化碳中毒患者发生迟发性脑病概率：See Figure 2, after the fusion vectors C1 and C2 are spliced, they are input to the fully connected layer, and after nonlinear fitting, the probability of delayed encephalopathy in patients with carbon monoxide poisoning is output:

式中：||表示向量拼接，W、b为训练参数，g表示激活函数，

预测模型的损失函数定义为：The loss function of the predictive model is defined as:

L_p＝-y log p-(1-y)log(1-p)；L _p =-y log p-(1-y)log(1-p);

式中，

本发明所述的一氧化碳中毒迟发性脑病预测模型通过优化特征融合损失和分类损失的加权和更新所述一氧化碳中毒迟发性脑病预测模型的参数，从而得到更加精准的一氧化碳中毒迟发性脑病预测模型。The carbon monoxide poisoning delayed encephalopathy prediction model of the present invention updates the parameters of the carbon monoxide poisoning delayed encephalopathy prediction model by optimizing the weighted sum of feature fusion loss and classification loss, thereby obtaining more accurate carbon monoxide poisoning delayed encephalopathy prediction Model.

将测试集中的结构化特征向量x输入到一氧化碳中毒迟发性脑病预测模型中，基于一氧化碳中毒迟发性脑病预测模型的患者是否会发生迟发性脑病的预测结果，通过准确率、召回率和F1指标评估一氧化碳中毒迟发性脑病预测模型的预测性能。Input the structured feature vector x in the test set into the carbon monoxide poisoning delayed encephalopathy prediction model. Based on the prediction results of whether the patients with carbon monoxide poisoning delayed encephalopathy prediction model will develop delayed encephalopathy, the accuracy, recall rate and The F1 index evaluates the predictive performance of the CO poisoning delayed encephalopathy prediction model.

以上所述实施例仅表达了本申请的具体实施方式，其描述较为具体和详细，但并不能因此而理解为对本申请保护范围的限制。应当指出的是，对于本领域的普通技术人员来说，在不脱离本申请技术方案构思的前提下，还可以做出若干变形和改进，这些都属于本申请的保护范围。The above-mentioned embodiments only express the specific implementation manners of the present application, and the descriptions thereof are relatively specific and detailed, but should not be construed as limiting the protection scope of the present application. It should be noted that those skilled in the art can make several modifications and improvements without departing from the concept of the technical solution of the present application, and these all belong to the protection scope of the present application.

Claims

1. A method for predicting carbon monoxide poisoning delayed encephalopathy based on multi-modal learning is characterized by comprising the following steps:

step 1: acquiring multi-mode data of a patient suffering from carbon monoxide poisoning, and preprocessing the multi-mode data to obtain a data set of the patient suffering from carbon monoxide poisoning;

the multi-modal data includes an image, a waveform diagram, and structured data;

the images include brain CT and brain MRI; the waveform map comprises an electroencephalogram; the structured data comprises first diagnosis information, auxiliary examination and treatment conditions of CO poisoning;

step 2: based on the data set obtained in the step 1, constructing a feature dependency graph through a Bayesian network structure learning algorithm and doctor suggestions, wherein nodes represent features in the feature dependency graph, and edges represent the relationship among the features and represent the dependency relationship among the features; then deleting the nodes of the characteristic dependency graph, which represent the delayed encephalopathy labels of the carbon monoxide poisoning, and reserving the nodes representing the characteristics; all features appearing in the feature dependency graph form a set F, and a corresponding adjacency matrix A is obtained based on the structure of the feature dependency graph;

step 3: respectively establishing a feature extraction model based on the data types in the data set, and extracting multi-mode data feature vectors based on the feature extraction model;

step 4: constructing a graph embedded feedforward neural network based on a characteristic dependency graph, wherein the definition of the graph embedded feedforward neural network is as follows:

E1 = MLP(σ(x(W·A) + b)) ；

wherein: a is an adjacency matrix of the feature dependency graph; x is a feature vector formed by the values of the structural features of the patient, wherein the structural features of the patient comprise the features appearing in the feature set F in the step 2; w and b are training parameters; representing the Hadamard product operation; sigma represents an activation function; MLP is a multi-layer perceptron;

step 5: fusing the multi-mode data feature vectors to obtain fusion vectors C1 and C2, wherein the detailed steps comprise:

step 5.1: taking the multi-mode data feature vector extracted in the step 3 as a training sample [ x, E2, E3, E4, y ]; the structural feature vector x is embedded into a neural network code through a graph to obtain a feature vector E1;

step 5.2: the feature vector E1 is converted into an electroencephalogram feature vector E2 based on the first layer encoder-decoder, and a fusion vector C for capturing joint information between the feature vector E1 and the electroencephalogram feature vector E2 is generated in the conversion process:

C = fE-D (E1, E2) ；

wherein: fE-D () represents the encoder-decoder unit of the first layer, obtaining a fusion vector representation C; wherein the encoder-decoder unit may be of neural network architecture;

step 5.3: inputting the fusion vector C to a coder-decoder unit of a second layer to realize the conversion of the fusion vector C and brain CT and brain MRI eigenvectors E3 and E4; in the encoder-decoder unit of the second layer, the conversion of fusion vector C into brain CT and brain MRI uses one encoder and two separate decoders, which in the encoding and decoding process produce fusion vector representations C1 and C2, respectively:

C1 = fE1-D1 (C, E3)；

C2 = fE1-D2 (C, E4) ；

wherein: fE1-D1 (), fE1-D2 () represent encoder-decoder units of the second layer for obtaining fusion vector representations C1 and C2, respectively; wherein the encoder-decoder unit adopts a neural network structure;

step 6: based on the graph embedded feedforward neural network and fusion vectors C1 and C2, a carbon monoxide poisoning delayed encephalopathy prediction model is established, fusion vectors C1 and C2 are spliced, fusion vectors C1 and C2 are input into a full-connection layer after being spliced, and the probability of delayed encephalopathy occurrence of a carbon monoxide poisoning patient is output through nonlinear fitting:

yˆ = g(W *(C1 || C2) + b) ；

wherein: i represents vector stitching, W, b is a training parameter, g represents an activation function, y ˆ represents a predicted output of whether there is a risk of tardive encephalopathy;

the loss function of the predictive model of delayed encephalopathy in carbon monoxide poisoning is defined as:

L = λ1LE 2 + λ2 LE3 + λ3 LE 4 + λp Lp ；

wherein: LEi; i=2, 3, 4; representing errors generated in the feature fusion process, wherein Lp represents errors generated by classification prediction; λi; i=1, 2, 3; the duty ratio of the loss functions of different parts can be weighed according to actual conditions as super parameters;

specifically, the calculation method of the loss Lp of the predicted portion:

Lp = -y log p - (1- y) log(1- p) ；

wherein: y is the real label of a patient sample with carbon monoxide poisoning, delayed encephalopathy occurs, the label is 1, otherwise, the label is 0; p represents the probability that a patient suffering from carbon monoxide poisoning is predicted to develop a delayed encephalopathy;

calculation method of loss LEi of fusion part:

LEi =|| Ei - Eˆi || ² , i = 2, 3, 4 ；

wherein E ˆ i, i=2, 3, 4; the outputs of the decoders described in step 5.2 and step 5.3, ei, respectively; i=2, 3, 4;

vector representations of the electroencephalogram, brain CT and brain MRI described in step 3 are represented respectively;

step 7: and testing the carbon monoxide poisoning delayed encephalopathy prediction model, and selecting parameters meeting the prediction performance requirements as parameters of the carbon monoxide poisoning delayed encephalopathy prediction model.

2. The method for predicting carbon monoxide poisoning delayed encephalopathy based on multimodal learning according to claim 1, which

Is characterized in that in the step 1, if the acquired patient has data missing in a certain mode, the missing mark is carried out;

the step 1 further comprises the steps of obtaining follow-up information of the patient suffering from carbon monoxide poisoning and constructing a prediction label; if the patient suffers from delayed encephalopathy within a period of time after carbon monoxide poisoning, the predictive label is 1, otherwise, the predictive label is 0.

3. The method for predicting carbon monoxide poisoning delayed encephalopathy according to claim 1, wherein the feature extraction model in step 3 comprises: a feature extraction model M1, a feature extraction model M2, and a feature extraction model M3;

the feature extraction model M1 is composed of a convolution module of EEGNet, wherein the convolution module is composed of three convolution layers, namely Conv2D, depthwiseConv2D and SeparableConv2D in sequence; the feature extraction model M2 and the feature extraction model M3 are respectively composed of a first convolutional neural network and a second convolutional neural network.

4. The method for predicting carbon monoxide poisoning delayed encephalopathy based on multi-modal learning according to claim 3, wherein the feature extraction model M1, the feature extraction model M2 and the feature extraction model M3 are all obtained by training from a training set 1 divided from a dataset; respectively utilizing electroencephalogram, brain CT and brain MRI data in a data set to train to obtain three classifiers, adjusting parameters of a feature extraction model to enable classification performance of the corresponding classifier to meet preset requirements, removing a sigmoid classification layer in the classifier, extracting convolution structures of the classifier meeting preset conditions under three modes, and forming feature extraction models M1, M2 and M3; the feature extraction models M1, M2 and M3 are used to extract an electroencephalogram feature vector E2, a brain CT feature vector E3 and a brain MRI feature vector E4, respectively.

5. The method for predicting carbon monoxide poisoning late-onset encephalopathy based on multi-modal learning according to claim 1, wherein the structured feature vector x in the test set is input into a model for predicting carbon monoxide poisoning late-onset encephalopathy, and the prediction result of whether a patient based on the model for predicting carbon monoxide poisoning late-onset encephalopathy will develop late-onset encephalopathy.

6. The method for predicting carbon monoxide poisoning delayed encephalopathy based on multimodal learning according to claim 1, which

The method is characterized in that the data set in the step 1 is divided into a training set and a testing set, the training set is divided into a training set 1 and a training set 2, and the training set 1 is used for training a feature extraction model and a feature dependency graph; the training set 2 is used for training a carbon monoxide poisoning delayed encephalopathy prediction model, no data is missing in the training set, and the missing of brain CT, brain MRI or electroencephalogram data can exist in the test set.