CN112331332A

CN112331332A - Disease prediction method and system based on multi-granularity feature fusion

Info

Publication number: CN112331332A
Application number: CN202011095993.7A
Authority: CN
Inventors: 赵青; 李建强; 徐春
Original assignee: Beijing University of Technology
Current assignee: Beijing University of Technology
Priority date: 2020-10-14
Filing date: 2020-10-14
Publication date: 2021-02-05

Abstract

The embodiment of the invention provides a disease prediction method and a system based on multi-granularity feature fusion, which comprises the following steps: acquiring fusion characteristics based on a disease to be predicted; inputting the fusion characteristics into a disease prediction model obtained by training to obtain a classification result of the disease types; the disease prediction model is obtained by training fusion characteristics of various diseases based on a parallel self-adaptive convolutional neural network model. According to the embodiment of the invention, by adopting a multi-granularity feature fusion prediction method, not only are fine-granularity words and conceptual features adopted, but also the concept relationship and attribute-value features with larger granularity are adopted to fully understand semantic information in a medical text, so that the performance of model disease prediction is improved.

Description

Disease prediction method and system based on multi-granularity feature fusion

Technical Field

The invention relates to the technical field of computers, in particular to a disease prediction method and system based on multi-granularity feature fusion.

Background

The disease prediction is to automatically divide the diseases into different categories by utilizing the existing semantic analysis technology, can help doctors or patients to quickly know the current disease course state of the patients, and carries out scheduling and coordination of key medical resources according to the prediction of possible intervention means.

Heretofore, the construction methods of prediction models are mainly classified into two types: hypothesis-based driving methods and data-based driving methods. The former starts with assumptions made by clinical experts based on observations and clinical experience, and then finds facts from medical data, verifying the authenticity of the assumptions by deductive reasoning. The predictive model is derived from a set of validated assumptions. Generally, it is assumed that the driven approach does not take full advantage of the valuable information contained in the medical data. The data-driven approach trains machine learning models using fully labeled medical data sets to achieve disease prediction. Traditional Machine Learning models require domain experts to specify clinical features in a special way, while the success of the final Prediction model depends largely on the sophisticated supervision of manually designed feature selection, e.g., the Effective Heart Disease Prediction Using Hybrid Machine Learning Techniques published by Senthilkmar Mohan et al in 2019 proposed a linear Hybrid random forest model for cardiology Prediction. Deep learning, which can reduce the complexity of traditional machine learning feature selection, automatically learns deeper features from data, has become the main approach of predictive models today. The Disease Prediction method based on deep learning usually adopts words or concept vectors as main feature expressions of medical texts, for example, the method is published by Guangkai Li, Songmao Zhang et al in the training Embedding with Domain Knowledge for Oral Disease Diagnosis Prediction article in SmartCom 2018 to learn concepts related to symptoms and diagnoses from Domain ontology and to learn concept features in electronic medical records by using neural network to construct an Oral Disease Prediction model. However, only considering the word or concept vector, because the feature granularity of the word or concept vector is too small, the word or concept vector is likely to cause insufficient extraction of semantic information contained in the medical text, and thus, a correct medical decision cannot be provided.

Disclosure of Invention

The embodiment of the invention provides a disease prediction method and system based on multi-granularity feature fusion, which are used for overcoming the defects in the prior art.

In a first aspect, an embodiment of the present invention provides a disease prediction method based on multi-granularity feature fusion, including:

acquiring fusion characteristics based on a disease to be predicted;

inputting the fusion characteristics into a disease prediction model obtained by training to obtain a classification result of the disease types; the disease prediction model is obtained by training fusion characteristics of various diseases based on a parallel self-adaptive convolutional neural network model.

Further, the disease prediction model is obtained by the following steps:

acquiring a text to be processed, and preprocessing the text to be processed to obtain a preprocessed text;

extracting the features of the preprocessed text to obtain extracted features;

fusing the extracted features based on multi-granularity features to obtain fused features of the various diseases;

and acquiring a parallel self-adaptive convolutional neural network model, inputting the fusion characteristics of the various diseases into the parallel self-adaptive convolutional neural network model for training to obtain the disease prediction model.

Further, the obtaining of the text to be processed and the preprocessing of the text to be processed to obtain the preprocessed text specifically include:

manually marking the medical text data according to the target category to be predicted, and loading the medical text data into a domain body to obtain the text to be processed;

and segmenting the text to be processed into Chinese character strings according to punctuation marks, numbers and space marks, and removing stop words to obtain the preprocessed text.

Further, the extracting features of the preprocessed text to obtain extracted features specifically includes:

and extracting the features of the preprocessed text through conceptual feature extraction, word feature extraction, conceptual relation feature extraction and attribute and value feature extraction to obtain the extracted features.

Further, the extracting features of the preprocessed text by extracting concept features, extracting word features, extracting concept relationship features, and extracting attribute and value features specifically includes:

mapping the preprocessed text to a field body to obtain text data, segmenting the text data into semantic sets by a maximum matching method, converting concept self characteristic types and concept type characteristics which can be matched from the field body into a vector form by adopting a word2vec model, and extracting the concept characteristics by combining the concept self characteristic types and the concept type characteristics;

converting self characteristic types and concept type characteristics which contain concepts which cannot be matched from the domain ontology into a vector form by adopting the word2vec model, and extracting word characteristics;

extracting relation trigger words among concepts by combining the word features, the position features and the negative word features, and representing the concept features and the relation trigger words as concept relation features by combining the concept features;

and further representing the conceptual features as disease and time results containing numerical types and detection and inspection results containing the numerical types and the category types to obtain attribute and value features.

Further, the fusing the extracted features based on multi-granularity features to obtain fused features of the multiple diseases specifically includes:

and directly carrying out vector splicing on the extracted features aiming at the category with large difference of the predicted target, or fusing the extracted features by adopting a weight-based feature fusion method aiming at the category with high similarity of the predicted target to obtain the fusion features of the diseases.

Further, the obtaining a parallel adaptive convolutional neural network model, inputting the fusion characteristics of the multiple diseases into the parallel adaptive convolutional neural network model for training, and obtaining the disease prediction model specifically includes:

segmenting a sentence into different parts according to the difference between the concept relationship characteristic and the attribute and value characteristic to extract semantic information contained in the sentence;

and fusing the semantic information with the concept features and the word features to train the parallel self-adaptive convolutional neural network model, and maintaining the validity of the sentence by adopting dropout operation and zero padding on a convolutional layer to obtain the disease prediction model.

In a second aspect, an embodiment of the present invention further provides a disease prediction system based on multi-granularity feature fusion, including:

the acquisition module is used for acquiring fusion characteristics based on the disease to be predicted;

the processing module is used for inputting the fusion characteristics to a disease prediction model obtained by training to obtain a classification result of the disease types; the disease prediction model is obtained by training fusion characteristics of various diseases based on a parallel self-adaptive convolutional neural network model.

In a third aspect, an embodiment of the present invention further provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor executes the program to implement the steps of the method for predicting a disease based on multi-granular feature fusion as described in any one of the above.

In a fourth aspect, the present invention further provides a non-transitory computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps of the multi-granular feature fusion based disease prediction method as described in any one of the above.

According to the disease prediction method and system based on multi-granularity feature fusion, provided by the embodiment of the invention, by adopting the multi-granularity feature fusion prediction method, not only are words and concept features of fine granularity adopted, but also semantic information in a medical text is fully understood by adopting concept relations and attribute-value features of larger granularity, so that the performance of model disease prediction is improved.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and those skilled in the art can also obtain other drawings according to the drawings without creative efforts.

FIG. 1 is a schematic flow chart of a disease prediction method based on multi-granularity feature fusion according to an embodiment of the present invention;

fig. 2 is an exploded view of a flow module according to an embodiment of the present invention:

FIG. 3 is a schematic structural diagram of a disease prediction system based on multi-granularity feature fusion according to an embodiment of the present invention;

fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Aiming at the problems in the prior art, the embodiment of the invention provides a disease prediction method based on multi-granularity feature fusion, the method extracts features with different granularities based on the existing medical ontology and labeled corpus and fuses the features to train a disease prediction model, and the trained model can provide a category corresponding to a prediction target and can be used for disease prediction related applications, such as disease type prediction or disease severity prediction.

Fig. 1 is a schematic flow chart of a disease prediction method based on multi-granularity feature fusion according to an embodiment of the present invention, as shown in fig. 1, including:

s1, acquiring fusion characteristics based on the disease to be predicted;

s2, inputting the fusion characteristics into a disease prediction model obtained by training to obtain a classification result of disease types; the disease prediction model is obtained by training fusion characteristics of various diseases based on a parallel self-adaptive convolutional neural network model.

Specifically, fusion features related to a disease to be predicted are obtained through a certain technical means, the fusion features are input into a pre-trained disease prediction model, and a final classification result of the disease type is obtained, wherein the disease prediction model is obtained on the basis of a parallel self-adaptive convolutional neural network and is trained through the fusion features of various diseases.

According to the embodiment of the invention, by adopting a multi-granularity feature fusion prediction method, not only are fine-granularity words and conceptual features adopted, but also the concept relationship and attribute-value features with larger granularity are adopted to fully understand semantic information in a medical text, so that the performance of model disease prediction is improved.

Based on the above embodiment, the disease prediction model is obtained by the following steps:

extracting the features of the preprocessed text to obtain extracted features;

Specifically, as shown in fig. 2, when training a disease prediction model, firstly, data preprocessing 1 is performed on a domain ontology to obtain a preprocessed text, then the preprocessed text is subjected to feature extraction 2 including conceptual features 21, word features 22, conceptual relationship features 23 and attribute-value features 24 to obtain extracted features, then the extracted features are fused 3 based on multi-granularity features, wherein vector splicing 31 or a fusion method 32 based on feature weights is directly performed to obtain fusion features of multiple diseases, the model is trained by using the fusion features of multiple diseases based on the obtained parallel adaptive convolutional neural network model to obtain a trained disease prediction model 4, and finally, the trained model is used for disease type classification 5.

Based on any of the above embodiments, the obtaining of the text to be processed and the preprocessing of the text to be processed to obtain the preprocessed text specifically include:

Specifically, the medical text data is manually marked according to the target category to be predicted, and then a domain ontology is loaded; and segmenting the text to be processed into Chinese character strings according to punctuations, numbers and space marks, and removing stop words to obtain the preprocessed text.

Based on any of the above embodiments, the extracting features of the preprocessed text to obtain extracted features specifically includes:

The extracting features of the preprocessed text are extracted through conceptual feature extraction, word feature extraction, conceptual relation feature extraction and attribute and value feature extraction, and the extracting features are obtained specifically by:

Specifically, the method comprises the following four steps: extracting concept features, extracting word features, extracting concept relation features and extracting attribute-value features.

The concept features include a concept self feature and a concept type feature. Firstly, mapping the preprocessed text to a domain ontology, and segmenting text data into semantic sets { Y ] by a maximum matching method₁,…Y_nE.D, D is text data which contains a concept set C matched with the domain ontology₁,…C_nBelongs to Y and has a corresponding concept type C_1type,…C_NtypeAnd secondly, converting concepts and concept types into a d-dimensional vector form by adopting a word2vec model. And finally, extracting the concept features by combining the features of the concept itself with the features of the concept type, and recording the concept features as

e＝{e₁…e_n},e_iE, where c_iBelonging to a concept set for a concept self-feature C₁,…C_N}，c_itypeIs a concept c_iIs of the type C_1type,…C_Ntype}，

Is a vector stitching operation.

Word features refer to semantics for which no matching concept can be found from the domain ontology and are written as { W }₁,…W_nBelongs to Y, similarly adopting word2vec to convert the word into d-dimensional vector form, and recording w as { w ═ w-₁,…w_n}。

Extracting relation trigger words between concepts by combining the word characteristics, the position characteristics and the negative word characteristics, and expressing the concept relation characteristics into a triple form by combining the concept characteristics and marking as p_i＝(e_i,r_i,e_o),p＝{p₁…p_n},p_iE is p, wherein e_iAnd e_oRepresenting a conceptual feature, r_iRepresenting relationship triggers between concepts. Has a { s₁…s_i…s_nE.g. D, where S_iComposing s by m semantics_i＝{w₁…p_i…q_o…w_mIn which e_iAnd e_oDenotes S_iThe concept contained in (a) is,

{w₁…w_mis the sentence S_iEach word relative to a conceptual feature e_iAnd e_oThere are two relative distances between them, which are recorded as

Since the negative word can change the meaning of the word, the negative word feature is extracted by loading the negative word point and is marked as { n₁…n_mE.w, w represents a set of word features. The relationship trigger between the last concepts may be represented by an expression as

Wherein the conceptual feature e_iAnd e_oRelation trigger word r_iIn the same spatial dimension, denoted as

Attribute-value features contain two classes: disease-time and test-exam results. The attribute refers to a conceptual feature, the value in the disease-time includes only a numerical type, and the value in the detection-examination result includes a numerical type and a category type. For numerical types, both the value and its corresponding unit symbol, e.g. the value V_iWith its corresponding unit symbol U_iThe updated value type is calculated as

Wherein u is_iRepresenting a vector form of a unit symbol. Disease-time characteristic is denoted t_i＝(e_o,v_m) For the value type in the detection-inspection result, it is necessary to extract an index level feature, such as concept C ═ C₁,C₂,…,C_nIs given a value v ═ v } v₁,v₂,…,v_nAnd index level L ═ L₁,L₂,L₃The value of the examination result can be represented in the form of a triplet z_i＝(e_i,v_i,l_i) Wherein e is_oAnd e_iFor conceptual features, { v_m,v_i}∈v，l_iIs an index level vector. The category type has no unit symbol and is usually composed of character strings such as: negative, positive, etc. Therefore, the semantics contained in the expression text with accurate negative word features need to be extracted, the category vector of the expression text is directly extracted for the category type without the negative words, and the category type with the negative words, combined with the category features and the negative word features, can be expressed as

Wherein b is_mAs a class feature, n_mTo negate the word vector, the class type of the check-check result can therefore be represented as k_i＝(e_m,g_m) Wherein e is_mRepresenting a conceptual feature. t is t_i,z_i,k_i∈q_i,q＝{q₁,…,q_n},q_iE q, where q represents a set of attribute value features.

Based on any of the above embodiments, the fusing the extracted features based on multi-granularity features to obtain the fused features of the multiple diseases specifically includes:

Specifically, different feature fusion methods are adopted according to different predicted targets, and the extracted features can be directly subjected to vector splicing aiming at the category with larger difference of the predicted targets; the method for fusing the features based on the weight is adopted for the category with higher similarity of the predicted target, and is specifically described as follows:

vector splicing is directly carried out on the extracted features, and the formula can be expressed as follows:

wherein e is_iRepresenting a conceptual feature, w_iRepresenting a word feature, p_iRepresenting a conceptual relational feature, q_iRepresenting attribute-value features.

In the weight-based feature fusion method, the formula can be expressed as:

first, different weights are set for each class of features according to their importance in such features. For example, 4 weights are set, and the calculation formula can be expressed as:

wherein e is_iRepresenting a conceptual feature, w_iRepresenting a word feature, p_iRepresenting a conceptual relational feature, q_iRepresenting attribute-value features. Alpha is alpha_i∈[0,1]And is and

next, a weight-based feature value is calculated by combining the weight obtained in the above formula and the feature vector.

Wherein, CE_iRepresenting weight-based conceptual features, WE_iRepresenting weight-based word features, RE_iRepresenting weight-based conceptual relational features, VE_iRepresenting weight-based attribute-value features.

And fusing the concept features, the word features, the concept relation features and the attribute-value features based on the weights as the input of the neural network of the parallel adaptive rolling machine to train the disease prediction model according to the contents.

Based on any of the above embodiments, the obtaining a parallel adaptive convolutional neural network model, inputting the fusion characteristics of the multiple diseases into the parallel adaptive convolutional neural network model for training, and obtaining the disease prediction model specifically includes:

Specifically, a parallel adaptive rolling machine neural network is adopted to train a disease prediction model, and the specific formula is as follows:

and (3) rolling layers: having a sentence s_i＝{w₁,w₂,…,w_mIn which w_jIs the sentence s_iThe jth word vector of (a) th,

h is the length of the convolution kernel, indicating that h words are contained. The convolution operation for the jth word is:

c_j＝f(k·w_i:i+h-1+b)

wherein

Is a matrix of convolution kernels, b is a deviation, w_i:i+h-1Representation incorporates word vectors from the ith to i + h-1, and f (-) represents a non-linear activation function, usually with ReLU, c_jRepresenting a feature graph, sentence s, after a convolution operation_iThe characteristic diagram of (A) is shown as:

suppose there are l convolution kernels of length h, 1<i<l, the characteristic diagram is shown as:

parallel adaptive pooling layer: firstly, the sentence is divided into different parts according to the difference of concept relationship and attribute-value characteristics, and two characteristics are learned in parallel.

A concept relationship characteristic, c is the position of the sentence according to the concept pair_jIs divided into three parts [ c_j1,c_j2,c_j3]Secondly, the most important information in the sentence is obtained by calculating the maximum value of each part, and the calculation formula is as follows:

finally, all the feature maps after the convolution operation are spliced to obtain a sentence s_iCharacteristic vector b of_sp＝ReLU(v)。

Attribute value feature, c, sentence according to concept position_jDivided into two parts [ c_j1,c_j2]Secondly, the information of the value most related to the concept relationship in the sentence is obtained by calculating the maximum value of each part, and the calculation formula is as follows:

all the characteristic graphs after the volume operation are spliced to obtain a sentence s_iCharacteristic vector b of_sqFinally, combining the sentence characteristic vector of the concept relationship and the attribute value to obtain the final characteristic vector of the final sentence

And finally, combining the extracted concept relation, attribute-characteristics, concept and word characteristics, putting the result into a classification layer of a parallel self-adaptive rolling machine neural network, and generating the final classification result of the disease type through a softmax classifier. Based on different feature fusion methods, the result formula generated by the classifier is as follows:

(1) vector splicing is directly carried out on the extracted features:

O＝softmax(W_oh_i+b_s)

r_s＝argmax(O)

wherein e is_iAs a conceptual feature, w_iIs a word feature, p_iFor conceptual relational features, q_iAs attribute-value features, b_sAs a sentence s_iCharacteristic vector of (2), W_oAs weights, O e [1, n ∈ ]]Indicates that there are n relationship types, r_sIs the last relationship category label.

(2) The weight-based feature fusion method comprises the following steps:

D＝softmax(W_of_i+b_s)

r_s＝argmax(D)

wherein, CE_iRepresenting weight-based conceptual features, WE_iRepresenting weight-based word features, RE_iRepresenting weight-based conceptual relational features, VE_iRepresenting weight-based attribute-value features. b_sAs a sentence s_iCharacteristic vector of (2), W_oFor weight, D is equal to [1, n ]]Indicates that there are n relationship types, r_sIs the last relationship category label.

The disease prediction system based on multi-granularity feature fusion provided by the embodiment of the invention is described below, and the disease prediction system based on multi-granularity feature fusion described below and the disease prediction method based on multi-granularity feature fusion described above can be referred to correspondingly.

Fig. 3 is a schematic structural diagram of a disease prediction system based on multi-granularity feature fusion according to an embodiment of the present invention, as shown in fig. 3, including: an acquisition module 31 and a processing module 32; wherein:

the obtaining module 31 is used for obtaining fusion characteristics based on the disease to be predicted; the processing module 32 is configured to input the fusion features into a disease prediction model obtained through training, so as to obtain a classification result of a disease type; the disease prediction model is obtained by training fusion characteristics of various diseases based on a parallel self-adaptive convolutional neural network model.

Fig. 4 illustrates a physical structure diagram of an electronic device, which may include, as shown in fig. 4: a processor (processor)410, a communication Interface 420, a memory (memory)430 and a communication bus 440, wherein the processor 410, the communication Interface 420 and the memory 430 are communicated with each other via the communication bus 440. The processor 410 may invoke logic instructions in the memory 430 to perform a method of disease prediction based on multi-granular feature fusion, the method comprising: acquiring fusion characteristics based on a disease to be predicted; inputting the fusion characteristics into a disease prediction model obtained by training to obtain a classification result of the disease types; the disease prediction model is obtained by training fusion characteristics of various diseases based on a parallel self-adaptive convolutional neural network model.

In addition, the logic instructions in the memory 430 may be implemented in the form of software functional units and stored in a computer readable storage medium when the software functional units are sold or used as independent products. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.

In another aspect, an embodiment of the present invention further provides a computer program product, where the computer program product includes a computer program stored on a non-transitory computer-readable storage medium, the computer program includes program instructions, and when the program instructions are executed by a computer, the computer is capable of executing the method for predicting diseases based on multi-granularity feature fusion provided by the above-mentioned method embodiments, where the method includes: acquiring fusion characteristics based on a disease to be predicted; inputting the fusion characteristics into a disease prediction model obtained by training to obtain a classification result of the disease types; the disease prediction model is obtained by training fusion characteristics of various diseases based on a parallel self-adaptive convolutional neural network model.

In yet another aspect, an embodiment of the present invention further provides a non-transitory computer-readable storage medium, on which a computer program is stored, where the computer program is implemented by a processor to perform the method for predicting a disease based on multi-granular feature fusion provided in the foregoing embodiments, and the method includes: acquiring fusion characteristics based on a disease to be predicted; inputting the fusion characteristics into a disease prediction model obtained by training to obtain a classification result of the disease types; the disease prediction model is obtained by training fusion characteristics of various diseases based on a parallel self-adaptive convolutional neural network model.

The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.

Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.

Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims

1. A disease prediction method based on multi-granularity feature fusion is characterized by comprising the following steps:

acquiring fusion characteristics based on a disease to be predicted;

2. The method for predicting diseases based on multi-granularity feature fusion according to claim 1, wherein the disease prediction model is obtained by the following steps:

extracting the features of the preprocessed text to obtain extracted features;

3. The disease prediction method based on multi-granularity feature fusion as claimed in claim 2, wherein the obtaining of the text to be processed and the preprocessing of the text to be processed to obtain the preprocessed text specifically comprises:

4. The multi-granularity feature fusion-based disease prediction method according to claim 2, wherein the extracting features from the preprocessed text to obtain extracted features specifically comprises:

5. The multi-granularity feature fusion-based disease prediction method according to claim 4, wherein the extracting features are obtained by performing feature extraction on the preprocessed text through conceptual feature extraction, word feature extraction, conceptual relationship feature extraction, and attribute and value feature extraction, and specifically comprises:

6. The multi-granularity feature fusion-based disease prediction method according to claim 2, wherein the fusion of the extracted features based on the multi-granularity features to obtain the fusion features of the plurality of diseases specifically comprises:

7. The method according to claim 5, wherein the obtaining of the parallel adaptive convolutional neural network model and the inputting of the fusion features of the multiple diseases into the parallel adaptive convolutional neural network model for training to obtain the disease prediction model specifically comprises:

8. A disease prediction system based on multi-granular feature fusion, comprising:

9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor when executing the program implements the steps of the multi-granular feature fusion based disease prediction method according to any one of claims 1 to 7.

10. A non-transitory computer readable storage medium, on which a computer program is stored, which, when being executed by a processor, performs the steps of the multi-granular feature fusion based disease prediction method according to any one of claims 1 to 7.