Knowledge mapping analysis method and device based on medical information
Technical field
The present invention relates to the technical fields of data processing, analyze more particularly, to a kind of knowledge mapping based on medical information
Method and device.
Background technique
Knowledge mapping is also known as mapping knowledge domains, be explicit knowledge's development process and structural relation it is a series of it is various not
With figure, knowledge resource and its carrier described with visualization technique, excavate, analysis, building, draw and explicit knowledge and they
Between connect each other.By by the theory of the subjects such as applied mathematics, graphics, Information Visualization Technology, information science and just
Method and the methods of meterological citation analysis, Co-occurrence Analysis combine, and the core of subject is visually shown using visual map
Structure, developing history, Disciplinary Frontiers and whole Knowledge framework reach the modern theory of Multidisciplinary Integration purpose.For disciplinary study
Practical, valuable reference is provided.Knowledge mapping is the semantic knowledge-base of structuring, for describing physics with sign format
Concept and its correlation in the world, basic composition unit are entity-relationship-entity triple and entity and its phase
Attribute-value pair is closed, is interconnected between entity by relationship, the webbed structure of knowledge of structure.
Currently, knowledge mapping is very widely used, in the field of medicine, construct medical knowledge map, can by illness,
Complicated relationship between disease and diagnosis and treatment means is built into database by knowledge mapping, so as to be medical staff
Good auxiliary diagnosis means are provided.Based on this, occurs the application of a large amount of online question and answer at this stage, these applications receive patient
Information description, return to the disease of a patient in conjunction with the domain knowledge map constructed in advance using Sentence analysis and related algorithm
Sick diagnostic result.
The problem of existing technical solution is generally by search question and answer library with patient's description similarity highest, returns to institute
Corresponding disease outcome.This method cannot reflect conditions of patients completely, and obtained disease outcome accuracy is poor, and can only obtain
The disease that patient is suffered from can not predict the case where disease.
Summary of the invention
In view of this, the purpose of the present invention is to provide a kind of knowledge mapping analysis method and dress based on medical information
It sets, it is poor to alleviate the disease outcome accuracy that existing method obtains, and the technology that can not be predicted the case where disease is asked
Topic.
In a first aspect, the embodiment of the invention provides a kind of knowledge mapping analysis method based on medical information, the side
Method includes:
It obtains and segments word included in patient medical information, wherein the patient medical information includes: the base of patient
This information, the state of an illness information of patient, the treatment information of patient, the social information of patient;
The participle word is analyzed based on default medical knowledge base, obtains target corresponding to the participle word
Standard medical data;
The target criteria medical data is matched with default disease knowledge map, wherein the default disease is known
Know the relevance map that map is various diseases factor associated therewith;
The disease of the patient is determined according to matching result;
Disease based on the patient medical information and the patient constructs the feature vector of the patient, wherein described
Feature vector includes: genius morbi vector, treats feature vector, relationship characteristic vector;
Deterioration probability calculation is carried out to described eigenvector by disease progression machine learning model, obtains the patient's
Disease progression probability, wherein the disease progression machine learning model is to be carried out by K-MEANS algorithm to a large amount of feature vectors
The model that training obtains.
With reference to first aspect, the embodiment of the invention provides the first possible embodiments of first aspect, wherein base
Include: in the feature vector that the disease of the patient medical information and the patient construct the patient
To the essential information of the patient, the state of an illness information of the patient and the disease of the patient are calculated, and are obtained
To the genius morbi vector;
The disease for the treatment of information and the patient to the patient calculates, and obtains the treatment feature vector;
The social information of disease and the patient to the patient calculates, and obtains the relationship characteristic vector.
With reference to first aspect, the embodiment of the invention provides second of possible embodiments of first aspect, wherein right
The essential information of the patient, the state of an illness information of the patient and the disease of the patient calculate, and obtain the disease
Feature vector includes:
Genius morbi is determined according to the disease of the patient;
The genius morbi range of disease progression crowd and the disease spy of stable disease crowd are obtained based on the genius morbi
Levy range;
The genius morbi of essential information based on the patient and patient described in the state of an illness information extraction of the patient;
In conjunction with the genius morbi range of the disease progression crowd, the genius morbi range of the stable disease crowd and
The genius morbi of the patient determines Crowds Distribute belonging to the patient;
The genius morbi vector is calculated based on Crowds Distribute belonging to the patient.
With reference to first aspect, the embodiment of the invention provides the third possible embodiments of first aspect, wherein right
The treatment information of the patient and the disease of the patient calculate, and obtain the treatment feature vector and include:
Therapeutic scheme is determined according to the disease of the patient, wherein includes multiple treatment features in the therapeutic scheme;
The treatment feature of patient described in treatment information extraction based on the patient;
The treatment feature of the patient is matched with the treatment feature in the therapeutic scheme;
The treatment feature vector of the patient is determined according to matching result.
With reference to first aspect, the embodiment of the invention provides the 4th kind of possible embodiments of first aspect, wherein right
The social information of the disease of the patient and the patient calculate, and obtain the relationship characteristic vector and include:
The social information of the disease of the patient and the patient are associated with the foundation of historical relation map, wherein described
Historical relation map is the relation map obtained according to historic patient medical information;
The historical relation map is calculated by community discovery algorithm, obtains social group belonging to the patient
Body;
The weighted value that side in the historical relation map is updated by the disease progression risk of the social groups, obtains more
Relation map after new, wherein the disease progression risk of the social groups is to be obtained according to the historic patient medical information
's;
The updated relation map is calculated by Random Walk Algorithm and node2vector, is obtained described
Relationship characteristic vector.
With reference to first aspect, the embodiment of the invention provides the 5th kind of possible embodiments of first aspect, wherein base
The participle word is analyzed in default medical knowledge base, obtains target criteria medicine number corresponding to the participle word
According to including:
It is obtained and candidate criteria medical data corresponding to the participle word based on the default medical knowledge base;
According in the incidence relation and the medical knowledge base of the participle word between each standard medical data
Topological relation, the target criteria medical data is determined from the candidate criteria medical data.
With reference to first aspect, the embodiment of the invention provides the 6th kind of possible embodiments of first aspect, wherein
After obtaining the disease progression probability of the patient, the method also includes:
It is that the patient recommends therapeutic regimen according to the disease progression probability of the patient.
With reference to first aspect, the embodiment of the invention provides the 7th kind of possible embodiments of first aspect, wherein obtains
The participle word is taken included in patient medical information to include:
Obtain the patient medical information;
The cluster probability of word and word in the patient medical information is counted using condition random field algorithm;
The patient medical information is segmented according to the cluster probability, obtains the participle word.
Second aspect, the embodiment of the invention also provides a kind of knowledge mapping analytical equipment based on medical information are described
Device includes:
Module is obtained, segments word included in patient medical information for obtaining, wherein the patient medical information
It include: the essential information of patient, the state of an illness information of patient, the treatment information of patient, the social information of patient;
Analysis module obtains the participle word for analyzing based on default medical knowledge base the participle word
Target criteria medical data corresponding to language;
Matching module, for matching the target criteria medical data with default disease knowledge map, wherein institute
State the relevance map that default disease knowledge map is various diseases factor associated therewith;
Determining module, for determining the disease of the patient according to matching result;
Construct module, for the disease based on the patient medical information and the patient construct the feature of the patient to
Amount, wherein described eigenvector includes: genius morbi vector, treats feature vector, relationship characteristic vector;
Computing module, for carrying out deterioration probability calculation to described eigenvector by disease progression machine learning model,
Obtain the disease progression probability of the patient, wherein the disease progression machine learning model is by K-MEANS algorithm to big
The model that measure feature vector is trained.
In conjunction with second aspect, the embodiment of the invention provides the first possible embodiments of second aspect, wherein institute
Stating building module includes:
First computing unit, for the essential information to the patient, the state of an illness information of the patient and the patient
Disease calculated, obtain the genius morbi vector;
Second computing unit, the disease for treatment information and the patient to the patient are calculated, are obtained
The treatment feature vector;
Third computing unit, the social information for disease and the patient to the patient calculate, and obtain institute
State relationship characteristic vector.
The embodiment of the present invention brings following the utility model has the advantages that a kind of knowledge mapping analysis method and dress based on medical information
It sets, segments word included in patient medical information this method comprises: obtaining, wherein patient medical information includes: patient's
Essential information, the state of an illness information of patient, the treatment information of patient, the social information of patient;Based on default medical knowledge base to point
Word word is analyzed, and target criteria medical data corresponding to participle word is obtained;By target criteria medical data and preset
Disease knowledge map is matched, wherein default disease knowledge map is the relevance map of various diseases factor associated therewith;
The disease of patient is determined according to matching result;The feature vector of patient is constructed based on the disease of patient medical information and patient,
In, feature vector includes: genius morbi vector, treats feature vector, relationship characteristic vector;Pass through disease progression machine learning mould
Type carries out deterioration probability calculation to feature vector, obtains the disease progression probability of patient, wherein disease progression machine learning model
For the model being trained by K-MEANS algorithm to a large amount of feature vectors.
In the prior art, generally by search question and answer library with patient's description similarity highest the problem of, it is right to return to institute
The disease outcome answered, the disease outcome accuracy that this method obtains is poor, and can not predict the case where disease.With it is existing
Method is compared, can be according to patient medical information in the knowledge mapping analysis method based on medical information of the embodiment of the present invention
In participle word determine its corresponding target criteria medical data, then target criteria medical data and default disease are known again
Know map to be matched, the disease of patient is determined, after obtaining disease outcome, additionally it is possible to disease and patient doctor based on patient
The feature vector for learning information architecture patient carries out feature vector eventually by disease progression machine learning model to deteriorate probability meter
It calculates, obtains the disease progression probability of patient.This method really considers patient medical information, and obtained disease outcome is more quasi-
Really, and the case where disease, can be predicted, obtains the probability of disease progression, alleviate the disease that existing method obtains
As a result accuracy is poor, and the technical issues of can not predict the case where disease.
Other features and advantages of the present invention will illustrate in the following description, also, partly become from specification
It obtains it is clear that understand through the implementation of the invention.The objectives and other advantages of the invention are in specification, claims
And specifically noted structure is achieved and obtained in attached drawing.
To enable the above objects, features and advantages of the present invention to be clearer and more comprehensible, preferred embodiment is cited below particularly, and cooperate
Appended attached drawing, is described in detail below.
Detailed description of the invention
It, below will be to specific in order to illustrate more clearly of the specific embodiment of the invention or technical solution in the prior art
Embodiment or attached drawing needed to be used in the description of the prior art be briefly described, it should be apparent that, it is described below
Attached drawing is some embodiments of the present invention, for those of ordinary skill in the art, before not making the creative labor
It puts, is also possible to obtain other drawings based on these drawings.
Fig. 1 is a kind of flow chart of the knowledge mapping analysis method based on medical information provided in an embodiment of the present invention;
Fig. 2 is the flow chart that word is segmented included in acquisition patient medical information provided in an embodiment of the present invention;
Fig. 3 is the feature vector provided in an embodiment of the present invention that patient is constructed based on the disease of patient medical information and patient
Flow chart;
Fig. 4 is the essential information provided in an embodiment of the present invention to patient, the state of an illness information of patient and the disease of patient
It is calculated, obtains the flow chart of genius morbi vector;
Fig. 5 calculates the treatment information of patient and the disease of patient to be provided in an embodiment of the present invention, is controlled
Treat the flow chart of feature vector;
Fig. 6 calculates the disease of patient and the social information of patient to be provided in an embodiment of the present invention, obtains relationship
The flow chart of feature vector;
Fig. 7 is a kind of structural block diagram of the knowledge mapping analytical equipment based on medical information provided in an embodiment of the present invention.
Icon:
11- obtains module;12- analysis module;13- matching module;14- determining module;15- constructs module;16- calculates mould
Block.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with attached drawing to the present invention
Technical solution be clearly and completely described, it is clear that described embodiments are some of the embodiments of the present invention, rather than
Whole embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art are not making creative work premise
Under every other embodiment obtained, shall fall within the protection scope of the present invention.
For convenient for understanding the present embodiment, first to a kind of based on medical information disclosed in the embodiment of the present invention
Knowledge mapping analysis method describes in detail.
Embodiment one:
A kind of knowledge mapping analysis method based on medical information, with reference to Fig. 1, this method comprises:
Word is segmented included in S102, acquisition patient medical information, wherein patient medical information includes: patient's
Essential information, the state of an illness information of patient, the treatment information of patient, the social information of patient;
In embodiments of the present invention, the executing subject for being somebody's turn to do the knowledge mapping analysis method based on medical information is server,
Server can obtain and segment word included in patient medical information.Specifically, patient medical data includes: the basic of patient
Information, the state of an illness information of patient, the treatment information of patient, the social information of patient.
The essential information of patient includes: the name of patient, the gender of patient, the age of patient, the identification card number etc. of patient
Deng the embodiment of the present invention is to it without concrete restriction.
The state of an illness information of patient includes: the symptom of patient, the medical history of patient, the duration of symptom, and cause symptom can
The event of doubting etc., the embodiment of the present invention is to it without concrete restriction.
The treatment information of patient includes: the time for the treatment of patients, mode (such as drug therapy, the treatment of needle liquid, gastroscope
Etc.), therapeutic scheme (such as drug noun eaten in drug therapy, dosage etc.) etc., the embodiment of the present invention to its not into
Row concrete restriction.
Content is again described in detail the process for obtaining participle word included in patient medical information below, herein
It repeats no more.
S104, participle word is analyzed based on default medical knowledge base, obtains target mark corresponding to participle word
Quasi- medical data;
After obtaining participle word included in patient medical information, inventor is not special in view of certain participle words
The medical terminology of industry, in order to which disease is more accurately determined and predicted, so needing based on default medical knowledge base pair
Participle word is analyzed, and target criteria medical data corresponding to participle word is obtained.Hereinafter the process is carried out again detailed
Thin description, details are not described herein.
S106, target criteria medical data is matched with default disease knowledge map, wherein default disease knowledge figure
Spectrum is the relevance map of various diseases factor associated therewith;
After obtaining target criteria medical data, by target criteria medical data and the progress of default disease knowledge map
Match, wherein default disease knowledge map is the relevance map of various diseases factor associated therewith.
S108, the disease that patient is determined according to matching result;
After the completion of matching, the disease of patient is determined according to matching result.Specifically, the target criteria medicine number such as obtained
According to being dizzy, rhinorrhea, cough is had a throat-ache, by these target criteria medical datas and the progress of default disease knowledge map
Match, is given a mark according to matched matching degree to possible disease, disease corresponding to score highest is determined as patient's
Disease, the disease as determined by above-mentioned target criteria medical data are flu.The embodiment of the present invention is only carried out with a citing
Explanation, does not limit the method in the present invention.
S110, the feature vector based on the disease of patient medical information and patient building patient, wherein feature vector packet
Include: genius morbi vector treats feature vector, relationship characteristic vector;
After obtaining the disease of patient, be based further on the feature of the disease building patient of patient medical information and patient to
Amount.Since the social information of the essential information of patient, the state of an illness information of patient, the treatment information of patient, patient is not structure
Change data, be not used to the computation in later period, so needing these non-structured data carrying out structuring, obtains patient
Feature vector, specifically included genius morbi vector, treated feature vector, relationship characteristic vector.Hereinafter again to the process
It describes in detail.
S112, deterioration probability calculation is carried out to feature vector by disease progression machine learning model, obtains the disease of patient
Disease deteriorates probability, wherein disease progression machine learning model is to be trained by K-MEANS algorithm to a large amount of feature vectors
The model arrived.
After obtaining the feature vector of patient, deterioration probability is carried out to feature vector by disease progression machine learning model
It calculates, it will be able to obtain the disease progression probability of patient.
Disease progression machine learning model in the embodiment of the present invention be first pass through in advance K-MEANS algorithm to big measure feature to
The model being trained is measured, K-MEANS algorithm is one of unsupervised machine learning algorithm, is using K- in the present invention
When MEANS algorithm, K therein is obtained after being analyzed by Gaussian function a large amount of historic patient medical informations.
After the completion of the disease progression machine learning model is established, it will be able to carry out online real-time learning, be carried out to the model
Continuous iteration optimization.
In the prior art, generally by search question and answer library with patient's description similarity highest the problem of, it is right to return to institute
The disease outcome answered, the disease outcome accuracy that this method obtains is poor, and can not predict the case where disease.With it is existing
Method is compared, can be according to patient medical information in the knowledge mapping analysis method based on medical information of the embodiment of the present invention
In participle word determine its corresponding target criteria medical data, then target criteria medical data and default disease are known again
Know map to be matched, the disease of patient is determined, after obtaining disease outcome, additionally it is possible to disease and patient doctor based on patient
The feature vector for learning information architecture patient carries out feature vector eventually by disease progression machine learning model to deteriorate probability meter
It calculates, obtains the disease progression probability of patient.This method really considers patient medical information, and obtained disease outcome is more quasi-
Really, and the case where disease, can be predicted, obtains the probability of disease progression, alleviate the disease that existing method obtains
As a result accuracy is poor, and the technical issues of can not predict the case where disease.
In embodiments of the present invention, after obtaining the disease progression probability of patient, this method further include:
It is that patient recommends therapeutic regimen according to the disease progression probability of patient.
Specifically, be directed to different diseases, different disease progression probability there are corresponding therapeutic regimen,
After obtaining disease progression probability, it will be able to according to therapeutic regimen of the deterioration determine the probability corresponding to it, by the treatment side
Case is pushed to corresponding patient.
The brief introduction that above content carries out the method in the present invention below carries out the particular content being directed to
It is discussed in detail.
Optionally, with reference to Fig. 2, obtaining participle word included in patient medical information includes:
S201, patient medical information is obtained;
In embodiments of the present invention, patient medical information can be text information, or voice messaging, when for voice
When information, first voice messaging is identified, the text information corresponding to voice messaging is obtained, then carries out subsequent mistake again
Journey.
The essential information of patient is mainly that patient is defeated in such a way that voice input or text input when this method is used
The information entered, the state of an illness information of patient include the history state of an illness information and current state of an illness information of patient, and current state of an illness information is
The information that patient states in such a way that voice describes, or with the information etc. of graphic form publication, the history state of an illness of patient
Information is the history state of an illness information (such as passing case) relevant to current state of an illness information acquired automatically by interface,
Similarly, the treatment information of patient also includes the Current therapeutic information of patient, also includes the historical therapeutic information of patient, obtains
Mode is similar to above-mentioned mode, and details are not described herein.
The social information of patient is the information crawled by way of web crawlers, certainly, the embodiment of the present invention pair
It is without concrete restriction.
S202, the cluster probability that word and word in patient medical information are counted using condition random field algorithm;
It after obtaining patient medical information, needs first to pre-process patient medical information, specifically, pretreatment packet
Include: impurity elimination processing, classification processing, impurity elimination processing do not meet the information of preset field requirement to remove in patient medical information, point
Impurity elimination treated information is is classified by class processing according to classification belonging to information, such as symptom class, time class, medicine
Species etc..
There are many kinds of the methods segmented to patient medical information, and condition random field algorithm is used in the embodiment of the present invention
Segmented, count patient medical information in word and word cluster probability, then according to when join together probability be less than predetermined probabilities when,
Word segmentation processing is carried out between two words.
S203, patient medical information is segmented according to cluster probability, obtains participle word.
After obtaining participle word, participle word is analyzed based on default medical knowledge base.
In one optionally embodiment, participle word is analyzed based on default medical knowledge base, is segmented
Target criteria medical data corresponding to word includes:
(1) it obtains based on default medical knowledge base and segments candidate criteria medical data corresponding to word;
Specifically, the preparatory manual sorting of medical profession can be asked to go out or sort out by artificial intelligence technology various
Standard medical data in first input database, are then established search index, and relevant knowledge entry cluster is got up, are obtained
Medical knowledge base.It is, being stored with various standard medical data and its topological relation in medical knowledge base, group reticulates knot
Structure facilitates storage and calling.For the ease of inquiry, medical knowledge base can also increase intelligent word processing and search function.Its
In, generally there are two source, the clinical experiences of medical literature and a certain domain expert for medical knowledge.
Medical knowledge base can be understood as a primary medical knowledge map by putting and side forms, wherein point is used to
Each standard medical data in medical knowledge base are described, for example various symptoms, various organs and tissue etc.;Side is used to describe each mark
Relationship between quasi- medical data, such as " being located at ", "comprising" and " quantity ".
With the continuous development of the medical technologies such as medical test, medical image, clinical diagnosis and rehabilitation, medicine is known
Knowing constantly to enrich, and in order to give full play to the effect of medical knowledge base, can constantly acquire newborn medical knowledge, update medicine
Knowledge base.
Participle word is calculated at a distance from word in default medical knowledge base using distance metric method, when distance is less than in advance
If when value, extracting corresponding candidate criteria medical data (i.e. data of the distance less than preset value).The candidate criteria medicine number
According to quantity may be one, can also be able to be multiple.
It should be noted that the distance metric method includes but is not limited to COS distance, Euclidean distance or other distances
Measure.
(2) it is closed according to the topology in the incidence relation and medical knowledge base of participle word between each standard medical data
System determines target criteria medical data from candidate criteria medical data.
After obtaining candidate criteria medical data, further according in the incidence relation and medical knowledge base of participle word
Topological relation between each standard medical data determines target criteria medical data from candidate criteria medical data.
It is that the cooccurrence relation of statistical medicine word is realized when specific implementation.Assuming that the participle word in patient medical information
Both occurred symptom word " loss of appetite " in language, and disease word " gastritis " also occurred, and laboratory indexes " white blood cell count(WBC) is exceeded ",
With drug " sanjiu weitai capsules ", then it represents that " loss of appetite " and " gastritis " has cooccurrence relation, and " loss of appetite " and " white blood cell count(WBC) is super
Mark " has cooccurrence relation, and " loss of appetite " and " sanjiu weitai capsules " has cooccurrence relation.
Further, if " loss of appetite " and the co-occurrence quantity of " gastritis " are very high, " loss of appetite " and " stomach can be indicated
It is scorching " there is very strong cooccurrence relation.The very strong keyword of cooccurrence relation, referred to as neighbours.In this embodiment, the neighbours of " loss of appetite "
It is " gastritis ", " white blood cell count(WBC) is exceeded ", " sanjiu weitai capsules ".
Topological relation between each standard medical data can be inquired from default medical knowledge base.If it was found that symptom
Word " receive difference " and disease " gastritis ", symptom word " receive difference " and laboratory indexes " white blood cell count(WBC) is exceeded ", symptom word " receive difference " and medicine
Product " sanjiu weitai capsules ", between have direct or indirect company side, then can be determined as " receive difference " and " gastritis " has topological relation, " receives
Difference " has topological relation with " white blood cell count(WBC) is exceeded ", and " receive difference " has topological relation with " sanjiu weitai capsules ".
Point in each medical knowledge base for having topological relation is also referred to as neighbours.In the example above, medical knowledge base
In the neighbours of " receive difference " be " gastritis ", " white blood cell count(WBC) is exceeded ", " sanjiu weitai capsules ".
Thus judge, " receive difference " is much like with the neighbours of " loss of appetite ", so, " receive difference " and " loss of appetite " have compared with
Strong replacement possibility.
Why above-mentioned process is carried out, be because if when patient is described using voice, it is possible to the language recognized
Message breath is incorrect.For example word wrong in the example above " receive difference " is recognized, it can analyze to obtain by this way correct
Word.
The process of building patient characteristic vector is described below:
In one optionally embodiment, with reference to Fig. 3, based on the disease of patient medical information and patient building patient's
Feature vector includes:
S301, the essential information to patient, the state of an illness information of patient and the disease of patient calculate, and obtain disease spy
Levy vector;
Specifically, genius morbi vector is the essential information according to patient, the state of an illness information of patient and the disease of patient
It is calculated, hereinafter specific calculating process is described in detail again, details are not described herein.
S302, the treatment information of patient and the disease of patient are calculated, obtain medical treatment feature vector;
Specifically, treatment feature vector is calculated according to the treatment information of patient and the disease of patient, hereafter
In specific calculating process is described in detail again, details are not described herein.
S303, the disease of patient and the social information of patient are calculated, obtains relationship characteristic vector.
Specifically, relationship characteristic vector is calculated according to the disease of patient and the social information of patient, under same
The process is described in detail in Wen Zhongzai, and details are not described herein.
Optionally, with reference to Fig. 4, to the essential information of patient, the state of an illness information of patient and the disease of patient are calculated,
Obtaining genius morbi vector includes:
S401, genius morbi is determined according to the disease of patient;
Specifically, corresponding genius morbi is also different for different diseases, these genius morbis are specially medical expert
By the representative feature that the medical practice of many years is summarized, these genius morbis can be one, or
It is multiple, it is no longer illustrated in embodiments of the present invention.
The disease of S402, the genius morbi range that disease progression crowd is obtained based on genius morbi and stable disease crowd are special
Levy range;
After obtaining genius morbi, it is based further on genius morbi and obtains corresponding disease progression human diseases characteristic range
With the genius morbi range of stable disease crowd.These ranges are also that expert obtains by the summary of experience of many years.Certainly, also
It may include the genius morbi range of disease improvement crowd, be also no longer illustrated here.
The genius morbi of the state of an illness information extraction patient of S403, the essential information based on patient and patient;
After obtaining the genius morbi range of genius morbi range and stable disease crowd of above-mentioned disease progression crowd,
Extract the genius morbi of patient.
S404, the genius morbi range in conjunction with disease progression crowd, the genius morbi range of stable disease crowd and trouble
The genius morbi of person determines Crowds Distribute belonging to patient;
After obtaining the genius morbi of patient, by the genius morbi range of the genius morbi of patient and disease progression crowd,
The genius morbi range of stable disease crowd matches, and determines Crowds Distribute belonging to patient.
S405, genius morbi vector is calculated based on Crowds Distribute belonging to patient.
Specifically, point patient belonging to after Crowds Distribute, it will be able to obtain genius morbi vector.Specifically, patient
Genius morbi in the genius morbi range of disease progression crowd, the position of the genius morbi range of stable disease crowd is different,
Obtained numerical value is not also identical, which is genius morbi vector.
Optionally, with reference to Fig. 5, the disease for the treatment of information and patient to patient is calculated, the feature that obtains medical treatment to
Amount includes:
S501, therapeutic scheme is determined according to the disease of patient, wherein include multiple treatment features in therapeutic scheme;
In embodiments of the present invention, therapeutic scheme is determined according to the disease of patient, includes multiple treatments spies in therapeutic scheme
Sign.
S502, the treatment feature for treating information extraction patient based on patient;
After the treatment feature for the scheme of obtaining medical treatment, the treatment feature of the treatment information extraction patient based on patient.Specifically
, treatment feature include at least: treatment time, therapeutic modality and with treatment-related other information.For example, therapeutic modality is
Drug therapy, then just including the species characteristic of drug, the dose characteristics of drug in treatment feature.
S503, the treatment feature of patient is matched with the treatment feature in therapeutic scheme;
After the treatment feature for the scheme that obtains medical treatment and the treatment feature of patient, by the treatment feature and therapeutic scheme of patient
In treatment feature matched.
S504, the treatment feature vector that patient is determined according to matching result.
Optionally, with reference to Fig. 6, the social information of disease and patient to patient is calculated, and obtains relationship characteristic vector
Include:
S601, the social information of the disease of patient and patient are associated with the foundation of historical relation map, wherein historical relation
Map is the relation map obtained according to historic patient medical information;
In embodiments of the present invention, historic patient medical information is stored in server, so, it is corresponding that there is also history
Relation map.After the social information of the disease and patient that obtain patient, those information are associated with the foundation of historical relation map.
For example, there are a user A, also having a user A in the social information of patient in historical relation map, just capable of establishing trouble
The disease of person, the social information of patient and being associated with for historical relation map.
S602, historical relation map is calculated by community discovery algorithm, obtains social groups belonging to patient;
Establish with after being associated with of historical relation map, by community discovery algorithm to the historical relation figure established after being associated with
Spectrum is calculated, and social groups belonging to patient are obtained.
S603, the weighted value that side in historical relation map is updated by the disease progression risk of social groups, are updated
Relation map afterwards, wherein the disease progression risk of social groups is to be obtained according to historic patient medical information;
The weighted value that side in historical relation map is updated by the disease progression risk of social groups, obtains updated pass
It is map.Wherein, known to the disease progression risk of social groups.
S604, updated relation map is calculated by Random Walk Algorithm and node2vector, obtains relationship
Feature vector.
After obtaining updated relation map, by Random Walk Algorithm and node2vector to updated relationship
Map is calculated, and relationship characteristic vector is obtained.
Why such calculating is carried out, be because some diseases are group's bursty natures, which (closes
It is feature vector) it is also required to account for.
The invention discloses a kind of knowledge mapping analysis method and device based on medical information, to unsupervised machine learning
The technological innovation of model is applied in medical data processing, by the essential information to patient, the state of an illness information of patient, patient's
Information is treated, the vectorization of the social information of patient calculates, disease progression machine learning model is constructed by K-MEANS algorithm,
Disease progression probability can be determined in real time, patient and medical staff are carried out to the disease event of patient deep
Solution improves cure rate to take effective therapeutic scheme in time.
Embodiment two:
A kind of knowledge mapping analytical equipment based on medical information, with reference to Fig. 7, which includes:
Module 11 is obtained, segments word included in patient medical information for obtaining, wherein patient medical information's packet
It includes: the essential information of patient, the state of an illness information of patient, the treatment information of patient, the social information of patient;
It is right to obtain participle word institute for analyzing based on default medical knowledge base participle word for analysis module 12
The target criteria medical data answered;
Matching module 13, for matching target criteria medical data with default disease knowledge map, wherein default
Disease knowledge map is the relevance map of various diseases factor associated therewith;
Determining module 14, for determining the disease of patient according to matching result;
Module 15 is constructed, the feature vector for the disease building patient based on patient medical information and patient, wherein special
Levying vector includes: genius morbi vector, treats feature vector, relationship characteristic vector;
Computing module 16 is obtained for carrying out deterioration probability calculation to feature vector by disease progression machine learning model
To the disease progression probability of patient, wherein disease progression machine learning model is by K-MEANS algorithm to a large amount of feature vectors
The model being trained.
It, can be according in patient medical information in the knowledge mapping analytical equipment based on medical information of the embodiment of the present invention
Participle word determine its corresponding target criteria medical data, then again by target criteria medical data and default disease knowledge
Map is matched, and the disease of patient is determined, after obtaining disease outcome, additionally it is possible to disease and patient medical based on patient
The feature vector of information architecture patient carries out feature vector eventually by disease progression machine learning model to deteriorate probability meter
It calculates, obtains the disease progression probability of patient.The device really considers patient medical information, and obtained disease outcome is more quasi-
Really, and the case where disease, can be predicted, obtains the probability of disease progression, alleviate the disease that existing method obtains
As a result accuracy is poor, and the technical issues of can not predict the case where disease.
Optionally, building module includes:
First computing unit, for the essential information to patient, the state of an illness information of patient and the disease of patient are counted
It calculates, obtains genius morbi vector;
Second computing unit, the disease for treatment information and patient to patient calculate, and obtain medical treatment feature
Vector;
Third computing unit, the social information for disease and patient to patient calculate, obtain relationship characteristic to
Amount.
Optionally, the first computing unit includes:
First determines subelement, for determining genius morbi according to the disease of patient;
Subelement is obtained, for obtaining genius morbi range and the stable disease people of disease progression crowd based on genius morbi
The genius morbi range of group;
First extracts subelement, and the disease of the state of an illness information extraction patient for essential information and patient based on patient is special
Sign;
Second determines subelement, for combining the genius morbi range of disease progression crowd, the disease of stable disease crowd
Characteristic range and the genius morbi of patient, determine Crowds Distribute belonging to patient;
First computation subunit, for calculating genius morbi vector based on Crowds Distribute belonging to patient.
Optionally, the second computing unit includes:
Third determines subelement, for determining therapeutic scheme according to the disease of patient, wherein comprising multiple in therapeutic scheme
Treat feature;
Second extracts subelement, the treatment feature for the treatment information extraction patient based on patient;
Coupling subelement, for matching the treatment feature of patient with the treatment feature in therapeutic scheme;
4th determines subelement, for determining the treatment feature vector of patient according to matching result.
Optionally, third computing unit includes:
Association subelement is established, is closed for establishing the social information of the disease of patient and patient and historical relation map
Connection, wherein historical relation map is the relation map obtained according to historic patient medical information;
Second computation subunit obtains patient institute for calculating by community discovery algorithm historical relation map
The social groups of category;
Subelement is updated, the weight on side in historical relation map is updated for the disease progression risk by social groups
Value, obtains updated relation map, wherein the disease progression risk of social groups is to be obtained according to historic patient medical information
's;
Third computation subunit, for by Random Walk Algorithm and node2vector to updated relation map into
Row calculates, and obtains relationship characteristic vector.
Optionally, analysis module includes:
First acquisition unit, for obtaining based on default medical knowledge base and segmenting candidate criteria medicine corresponding to word
Data;
Determination unit, between each standard medical data in the incidence relation and medical knowledge base according to participle word
Topological relation, from candidate criteria medical data determine target criteria medical data.
Optionally, device further include:
Recommending module, for being that patient recommends therapeutic regimen according to the disease progression probability of patient.
Optionally, obtaining module includes:
Second acquisition unit, for obtaining patient medical information;
Statistic unit, for the cluster probability using word and word in condition random field algorithm statistics patient medical information;
Participle unit obtains participle word for segmenting according to cluster probability to patient medical information.
Specifically describing in the embodiment two can be with reference to the detailed content in above-described embodiment one, and details are not described herein.
The computer program of knowledge mapping analysis method and device based on medical information provided by the embodiment of the present invention
Product, the computer readable storage medium including storing program code, the instruction that said program code includes can be used for executing
Previous methods method as described in the examples, specific implementation can be found in embodiment of the method, and details are not described herein.
It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description
It with the specific work process of device, can refer to corresponding processes in the foregoing method embodiment, details are not described herein.
In addition, in the description of the embodiment of the present invention unless specifically defined or limited otherwise, term " installation ", " phase
Even ", " connection " shall be understood in a broad sense, for example, it may be being fixedly connected, may be a detachable connection, or be integrally connected;It can
To be mechanical connection, it is also possible to be electrically connected;It can be directly connected, can also can be indirectly connected through an intermediary
Connection inside two elements.For the ordinary skill in the art, above-mentioned term can be understood at this with concrete condition
Concrete meaning in invention.
It, can be with if the function is realized in the form of SFU software functional unit and when sold or used as an independent product
It is stored in a computer readable storage medium.Based on this understanding, technical solution of the present invention is substantially in other words
The part of the part that contributes to existing technology or the technical solution can be embodied in the form of software products, the meter
Calculation machine software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be a
People's computer, server or network equipment etc.) it performs all or part of the steps of the method described in the various embodiments of the present invention.
And storage medium above-mentioned includes: that USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), arbitrary access are deposited
The various media that can store program code such as reservoir (RAM, Random Access Memory), magnetic or disk.
In the description of the present invention, it should be noted that term " center ", "upper", "lower", "left", "right", "vertical",
The orientation or positional relationship of the instructions such as "horizontal", "inner", "outside" be based on the orientation or positional relationship shown in the drawings, merely to
Convenient for description the present invention and simplify description, rather than the device or element of indication or suggestion meaning must have a particular orientation,
It is constructed and operated in a specific orientation, therefore is not considered as limiting the invention.In addition, term " first ", " second ",
" third " is used for descriptive purposes only and cannot be understood as indicating or suggesting relative importance.
Finally, it should be noted that embodiment described above, only a specific embodiment of the invention, to illustrate the present invention
Technical solution, rather than its limitations, scope of protection of the present invention is not limited thereto, although with reference to the foregoing embodiments to this hair
It is bright to be described in detail, those skilled in the art should understand that: anyone skilled in the art
In the technical scope disclosed by the present invention, it can still modify to technical solution documented by previous embodiment or can be light
It is readily conceivable that variation or equivalent replacement of some of the technical features;And these modifications, variation or replacement, do not make
The essence of corresponding technical solution is detached from the spirit and scope of technical solution of the embodiment of the present invention, should all cover in protection of the invention
Within the scope of.Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.