CN106682411A - Method for converting physical examination diagnostic data into disease label - Google Patents

Method for converting physical examination diagnostic data into disease label Download PDF

Info

Publication number
CN106682411A
CN106682411A CN201611198280.7A CN201611198280A CN106682411A CN 106682411 A CN106682411 A CN 106682411A CN 201611198280 A CN201611198280 A CN 201611198280A CN 106682411 A CN106682411 A CN 106682411A
Authority
CN
China
Prior art keywords
disease
physical examination
word
diagnostic data
converted
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201611198280.7A
Other languages
Chinese (zh)
Other versions
CN106682411B (en
Inventor
吴健
周立水
顾盼
邱奇波
邓水光
李莹
尹建伟
吴朝晖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Original Assignee
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU filed Critical Zhejiang University ZJU
Priority to CN201611198280.7A priority Critical patent/CN106682411B/en
Publication of CN106682411A publication Critical patent/CN106682411A/en
Application granted granted Critical
Publication of CN106682411B publication Critical patent/CN106682411B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients

Abstract

The invention discloses a method for converting physical examination diagnostic data into a disease label. The method comprises the steps that 1, the physical examination diagnostic data is subjected to text word segmentation and new word finding processing, and a word sequence is obtained; 2, in the word sequence, needed disease vocabularies are extracted, and disease names are obtained; 3, synonyms in the disease names are merged, and merged disease names are obtained; 4, the disease names are clustered, and a disease category tree is set up; 5, disease marking is carried out according to the merged disease names and the disease category tree, and the disease label is obtained. According to the method, multiple natural language processing technologies are adopted for mining disease results in the physical examination diagnostic data, a disease classification structure is extracted, code digitization is carried out, the disease name label is provided for physical examination records, and therefore the physical examination result is more directly described, and other big data medical treatment can be served.

Description

A kind of method that physical examination diagnostic data is converted into disease label
Technical field
The invention belongs to big data medical field, and in particular to a kind of side that physical examination diagnostic data is converted into disease label Method.
Background technology
With the improvement of people ' s living standards, the enhancing of health care consciousness, health examination is increasingly becoming a kind of social fashion, people Changed the traditional concept of the Cai Qu hospitals only when falling ill, have regular physical checkups and received by most people.Therefore, Hospital have accumulated the electronics physical examination data of magnanimity.However, in primary electronics physical examination data, there is substantial amounts of non-structured text This physical examination data.These text datas often unordered mixed and disorderly redundancy, comes from the hand of the doctor of the various customs of each level, has very Big limitation, it is impossible to used well.
Big data medical treatment is a current focus, refers to the data that medical field is analyzed by big data correlation technique And knowledge therein is excavated so as to increase substantially medical services.In the past few decades, big data influences dearly Each enterprise, including health care industry.Nowadays, substantial amounts of data can allow health care more efficient, more individual character Change.Natural language processing is the closely related technology of big data.Under the auxiliary of big data, natural language processing is also relied on greatly Data have more development, occur in that such as word2vec, LDA new technology.
At present, after each physical examination terminates, the physical examination diagnostic data of acquisition is the diagnosis text for having complex form, and Diagnostic result is not converted to the disease name of summary, it is difficult to by user's fast understanding and be difficult to by follow-up big data medical treatment should With being used to analyze medical data.Therefore, on the basis of it can obtain complete original physical examination diagnostic data, proposition is examined doctor The method that disconnected result is analyzed, and then generation of classifying and carries out digital numeralization at disease classification tree, then in actual data Feed back the disease label of physical examination diagnostic result, be that there is its practical Research Significance and application prospect.
The content of the invention
In view of above-mentioned, the present invention proposes a kind of method that physical examination diagnostic data is converted into disease label, main to use Various natural language processing techniques excavate disease outcome in physical examination diagnostic data, extract wherein classification of diseases structure and are encoded Digitlization, for physical examination record standard disease name label is provided so that more directly description physical examination result and can be other Big data medical services.
The purpose of the present invention is that doctor, to the diagnostic result of patient's physical examination data, finds out disease therein in analyzing physical examination data Name of disease claims, and carries out cluster search, sets up the classification tree of disease, so that provide one is converted into disease by original diagnostic result The method of label.
A kind of method that physical examination diagnostic data is converted into disease label, comprises the following steps:
(1) text participle and new word discovery treatment are carried out to physical examination diagnostic data, word sequence is obtained;
(2) in word sequence, the disease vocabulary required for extracting obtains disease name;
(3) merger is carried out to the synonym in disease name, obtains the disease name of merger;
(4) disease is clustered, sets up disease classification tree;
(5) disease name and disease classification tree according to merger carries out disease marker, obtains disease label.
Step (1) concretely comprises the following steps:
(1-1) carries out participle to diagnostic result primary in physical examination diagnostic data, obtains text word sequence;
(1-2) carries out word merging according to the relation between word in text word sequence, obtains new term;
(1-3) carries out participle again to new term, obtains word sequence.
In step (1-1), for diagnostic result primary in physical examination diagnostic data, first, dictionary creation is used Trie trees carry out participle matching, then, do further index to the result of its participle matching and count, and obtain maximum probability point Word result, referred to as text word sequence.
When carrying out participle matching using Trie trees, there can be various participle modes often for same character string, as It is preferred that, the maximum probability word segmentation result of character string is solved using Dynamic Programming.
In step (1-2), in physical examination diagnostic data, due to the particularity of physical examination diagnostic text, it is difficult to for the first time Participle is that can obtain optimal effect, and the related vocabulary of many diseases may be torn open by mistake, therefore, according to word in word segmentation result Between relation carry out word merging, obtain new term, be easy to subsequent use.Preferably, being referred to by counting point mutual information Mark to judge the relation between word.Point mutual information (Point-wise Mutual Information, PMI) is to weigh two The index of the correlation between things (such as two words), is widely used in machine learning field, its principle such as below equation institute Show:
In probability theory, if x is uncorrelated with y, p (x, y)=p (x) p (y).The two correlation is bigger, then p (x, y) Difference with p (x) p (y) is bigger.Equally, the conditional probability p (x | y) that x occurs in the case where y occurs occurs in itself divided by x Probability p (x) also illustrate that degrees of correlation of the x with y.For adjacent word, for example:The probability for P (thyroid nodule) occur is long-range When the probability of P (thyroid gland) P (tubercle), it is believed that thyroid nodule is the neologisms in a corpus.
In step (2), the keyword (disease name) in word sequence is extracted using statistical method TF-IDF, and combination stops With dictionary, the as basic label for obtaining physical examination diagnostic result text, disease name.Statistical method TF-IDF (term Frequency-inverse document frequency) it is used to assess a words for a text set or a corpus In a copy of it text significance level.The importance of words is directly proportional increase with the number of times that it occurs in the text, but Can be inversely proportional decline with the frequency that it occurs in corpus simultaneously.Word frequency (term frequency, TF) refers to some The number of times that given word occurs in the text.Inverse text frequency (inverse document frequency, IDF) is by total Divided by the number of the text comprising the word, then the business that will be obtained takes the logarithm and obtains text number.
In step (3), there is substantial amounts of synonym in all disease vocabulary of acquisition, such as After Cataract possesses not Name aftercataract and secondary cataract, they are required for being integrated into a disease.Herein for each disease word of discovery The synonym of remittance, mathematic vector is abstracted into using word2vec algorithms by each disease name, reuses the distance of mathematic vector Calculate synonym.
In step (4), in order to more understand the relation between each disease, all extracted in all disease names and finished Afterwards, disease name is clustered, obtains the classification relation between disease.Present invention employs the distribution of implicit Di Li Crays (Latent Dirichlet Allocation, LDA) carries out disease cluster.LDA is a kind of document subject matter generation model, also referred to as It is three layers of bayesian probability model, comprising word, theme and document three-decker.
The inventive method be mainly analysis physical examination data in doctor to the diagnostic result of patient's physical examination data, find out therein Disease name, and cluster search is carried out, the classification tree of disease is set up, so as to original diagnostic result is converted into disease label. The method have the advantage that for:
(1) new word discovery is carried out to medical examination diagnostic result using PMI, it can be found that many undiscovered new disease words Converge.
(2) for a large amount of disease vocabulary, synonym therein is found using word2vec algorithms, in standardization disease name While decrease the word of redundancy.
(3) cluster of disease is carried out using LDA, the contact between the disease of each classification can be easily obtained.
Brief description of the drawings
Fig. 1 is the disease tag system structure chart based on physical examination diagnostic data;
Fig. 2 is that synonym finds algorithm word2vec schematic diagrames;
Fig. 3 is local disease's label classification tree exemplary plot.
Specific embodiment
In order to more specifically describe the present invention, below in conjunction with the accompanying drawings and specific embodiment is to technical scheme It is described in detail.
As shown in figure 1, the present invention comprises the following steps the method that physical examination diagnostic data is converted into disease label:
Step 1, text participle:Participle is carried out to diagnostic result primary in physical examination diagnostic data, text word sequence is obtained.
Text participle needs dictionary to obtain accuracy rate higher, and being collected from outside corpus for selecting here is obtained Basic dictionary combines related medical dictionary.For diagnostic result primary in physical examination data, the Trie trees of dictionary creation are used Participle matching is carried out, further index is done to the result of its participle matching and is counted, obtain maximum probability word segmentation result, referred to as Text word sequence.
Trie trees, also known as word lookup tree, are a kind of tree structures, for preserving substantial amounts of character string.Its advantage is: Memory space is saved using the common prefix of character string.The speed of participle can be increased substantially using Trie trees participle.It is right There can be various participle modes often in same character string, the maximum probability participle of character string is solved used here as Dynamic Programming As a result.
Step 2, new word discovery:Word merging is carried out according to the relation between word in text word sequence, new term is obtained.
In this step, each statistical information after being terminated according to step 1 participle, emphasis is divided adjacent word Analysis.The relation between word is judged using PMI indexs, for adjacent word, the probability for P (thyroid nodule) such as occur is long-range When the probability of P (thyroid gland) P (tubercle), it is believed that thyroid nodule is the neologisms in a corpus.
Step 3, text participle:Participle is carried out again to new term, word sequence is obtained.
The method that this step carries out participle is identical with the segmenting method in step 1, is all using the Trie trees of dictionary creation Participle matching is carried out, the maximum probability word segmentation result of character string is solved using Dynamic Programming.
Step 4, disease is extracted:Required disease vocabulary is extracted in word sequence, disease name is obtained.
The disease name in word sequence is extracted used here as statistical method TF-IDF, and combines deactivation dictionary, it is basic to obtain The label of physical examination diagnostic result text, as disease name.For each diagnostic text data, TF-IDF can find out wherein Occur more and occur and few word inside other diagnostic text data.
Step 5, synonymous conflation of words:Merger is carried out to the synonym in disease name, the disease name of merger is obtained;
Herein, each disease name is abstracted into the vector of mathematics using word2vec algorithms, mathematic vector is reused Distance calculates synonym.As shown in Fig. 2 CBOW and Skip-gram are two kinds of realizations of word2vec algorithms, wherein, w (t) is Refer to a word of optional position, and w (t-1) is then the previous word of its correspondence position, w (t+1) is the latter of its correspondence position Individual word, CBOW algorithms are to can be inferred that current word according to former and later two words of a word, and Skip-gram algorithms are according to one Individual word, can be inferred that the possibility environment of its context.The basic thought of word2vec algorithms is reflected each word by training K dimension real numbers vector (K is the hyper parameter in model) are penetrated into, by the distance between word (such as cosine similarities, Euclidean distance Deng) judge the semantic similarity between them.It using one three layers of neutral net, input layer-hidden layer-output layer, this Individual three-layer neural network is in itself that language model is modeled, but also obtains a kind of table of word in vector space simultaneously Show, and this side effect is only the real target of Word2vec.When obtain each word term vector after, can according to term vector it Between distance find a synonym for word.
Step 6, disease cluster:Disease is clustered, disease classification tree is set up;
Disease cluster is carried out using LDA in this step, to obtain the classification relation between disease, classification tree is set up, Fig. 3 is The Local map of the classification tree for establishing.For every document in corpus, LDA defines following generating process:
(1) to each document, a theme is extracted from theme distribution;
(2) a word is extracted from the word distribution corresponding to the above-mentioned theme being pumped to;
(3) said process is repeated, until each word in traversal document.
Assuming that after such document structure tree mode, ask for the parameters of this model, so as to obtain each theme and Word distribution inside theme.For physical examination diagnostic data base, the word inside each theme is the title of disease, and each Theme is exactly a disease for classification.
Step 7, tag extraction:Disease name and disease classification tree according to merger carry out disease marker, obtain every individuality The inspection unique disease label of text.
Above-described specific embodiment has been described in detail to technical scheme and beneficial effect, Ying Li Solution is to the foregoing is only presently most preferred embodiment of the invention, is not intended to limit the invention, all in principle model of the invention Interior done any modification, supplement and equivalent etc. are enclosed, be should be included within the scope of the present invention.

Claims (8)

1. a kind of method that physical examination diagnostic data is converted into disease label, comprises the following steps:
(1) text participle and new word discovery treatment are carried out to physical examination diagnostic data, word sequence is obtained;
(2) in word sequence, the disease vocabulary required for extracting obtains disease name;
(3) merger is carried out to the synonym in disease name, obtains the disease name of merger;
(4) disease name is clustered, sets up disease classification tree;
(5) disease name and disease classification tree according to merger carries out disease marker, obtains disease label.
2. physical examination diagnostic data is converted into the method for disease label according to claim 1, it is characterised in that:Step (1) Concretely comprise the following steps:
(1-1) carries out participle to diagnostic result primary in physical examination diagnostic data, obtains text word sequence;
(1-2) carries out word merging according to the relation between word in text word sequence, obtains new term;
(1-3) carries out participle again to new term, obtains word sequence.
3. physical examination diagnostic data is converted into the method for disease label according to claim 2, it is characterised in that:In step In (1-1), for diagnostic result primary in physical examination diagnostic data, first, participle matching is carried out using Trie trees plus dictionary, so Afterwards, further index is done to the result of its participle matching and is counted, obtain maximum probability word segmentation result, referred to as text word order Row.
4. physical examination diagnostic data is converted into the method for disease label according to claim 3, it is characterised in that:Using dynamic The maximum probability word segmentation result of programming evaluation character string.
5. physical examination diagnostic data is converted into the method for disease label according to claim 2, it is characterised in that:In step In (1-2), the relation between word is judged by counting point mutual information index.
6. physical examination diagnostic data is converted into the method for disease label according to claim 1, it is characterised in that:In step (2) in, the disease name in word sequence is extracted using statistical method TF-IDF, and combines deactivation dictionary, obtain physical examination diagnosis knot The disease name of fruit text.
7. physical examination diagnostic data is converted into the method for disease label according to claim 1, it is characterised in that:In step (3) in, each disease name is abstracted into by mathematic vector using word2vec algorithms, the distance for reusing mathematic vector calculates same Adopted word.
8. physical examination diagnostic data is converted into the method for disease label according to claim 1, it is characterised in that:In step (4) in, disease cluster is carried out using the distribution of implicit Di Li Crays.
CN201611198280.7A 2016-12-22 2016-12-22 A method of disease label is converted by physical examination diagnostic data Active CN106682411B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611198280.7A CN106682411B (en) 2016-12-22 2016-12-22 A method of disease label is converted by physical examination diagnostic data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611198280.7A CN106682411B (en) 2016-12-22 2016-12-22 A method of disease label is converted by physical examination diagnostic data

Publications (2)

Publication Number Publication Date
CN106682411A true CN106682411A (en) 2017-05-17
CN106682411B CN106682411B (en) 2019-04-16

Family

ID=58871285

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611198280.7A Active CN106682411B (en) 2016-12-22 2016-12-22 A method of disease label is converted by physical examination diagnostic data

Country Status (1)

Country Link
CN (1) CN106682411B (en)

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107515908A (en) * 2017-08-11 2017-12-26 新智数通(北京)技术服务有限公司 A kind of data processing method and device
CN107894976A (en) * 2017-10-12 2018-04-10 北京知道未来信息技术有限公司 A kind of mixing language material segmenting method based on Bi LSTM
CN107977354A (en) * 2017-10-12 2018-05-01 北京知道未来信息技术有限公司 A kind of mixing language material segmenting method based on Bi-LSTM-CNN
CN107992467A (en) * 2017-10-12 2018-05-04 北京知道未来信息技术有限公司 A kind of mixing language material segmenting method based on LSTM
CN109243618A (en) * 2018-09-12 2019-01-18 腾讯科技(深圳)有限公司 Construction method, disease label construction method and the smart machine of medical model
CN109582675A (en) * 2018-11-29 2019-04-05 北京达佳互联信息技术有限公司 Tag match method, apparatus, server and storage medium
CN109994201A (en) * 2019-03-18 2019-07-09 浙江大学 A kind of diabetes based on deep learning and hypertension method for calculating probability
CN110297933A (en) * 2019-07-01 2019-10-01 山东浪潮人工智能研究院有限公司 A kind of theme label recommended method and tool based on deep learning
CN110299206A (en) * 2018-03-21 2019-10-01 华东师范大学 A kind of more contents imply Di Li Cray distributed model and interpretation of the cause, onset and process of an illness method for digging is implied in Chinese medicine case
CN110335684A (en) * 2019-06-14 2019-10-15 电子科技大学 The intelligent dialectical aid decision-making method of Chinese medicine based on topic model technology
CN110580942A (en) * 2019-08-23 2019-12-17 和宇健康科技股份有限公司 novel physical examination report generation method, device, medium and terminal equipment
WO2020124856A1 (en) * 2018-12-18 2020-06-25 众安信息技术服务有限公司 Diagnosis standardization method and device based on word vectors
CN112002416A (en) * 2020-08-23 2020-11-27 吾征智能技术(北京)有限公司 Disease symptom prediction system based on urine character self-learning
CN112735475A (en) * 2020-12-25 2021-04-30 北京博瑞彤芸科技股份有限公司 Method and system for searching disease knowledge through voice
WO2021114632A1 (en) * 2020-05-13 2021-06-17 平安科技(深圳)有限公司 Disease name standardization method, apparatus, device, and storage medium
CN112992376A (en) * 2021-03-04 2021-06-18 山东大学 Disease name matching method and system based on weight adjustment
WO2021155678A1 (en) * 2020-02-03 2021-08-12 腾讯科技(深圳)有限公司 Label marking method and apparatus, and device and readable storage medium
CN113838579A (en) * 2021-09-29 2021-12-24 平安医疗健康管理股份有限公司 Medical data anomaly detection method, device, equipment and storage medium
CN114822865A (en) * 2022-06-27 2022-07-29 天津幸福生命科技有限公司 Diagnostic data identification method and device, electronic equipment and storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102662976A (en) * 2012-03-12 2012-09-12 浙江工业大学 Text feature weighting method based on supervision
CN102760134A (en) * 2011-04-28 2012-10-31 北京百度网讯科技有限公司 Method and device for mining synonyms
CN102915493A (en) * 2011-08-05 2013-02-06 株式会社东芝 Information processing apparatus and method
CN102930009A (en) * 2012-10-30 2013-02-13 北京奇虎科技有限公司 Individual website navigation system
CN103020212A (en) * 2012-12-07 2013-04-03 合一网络技术(北京)有限公司 Method and device for finding hot videos based on user query logs in real time
CN103559178A (en) * 2013-05-31 2014-02-05 武汉中文百科网络有限公司 System and method for switching between simplified Chinese characters and traditional Chinese characters on Internet
CN103902570A (en) * 2012-12-27 2014-07-02 腾讯科技(深圳)有限公司 Text classification feature extraction method, classification method and device
CN104424177A (en) * 2013-08-26 2015-03-18 高德软件有限公司 Method and device for extracting core words
CN104598642A (en) * 2015-02-13 2015-05-06 杜雨阳 Standard disease name checking method and system

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102760134A (en) * 2011-04-28 2012-10-31 北京百度网讯科技有限公司 Method and device for mining synonyms
CN102915493A (en) * 2011-08-05 2013-02-06 株式会社东芝 Information processing apparatus and method
CN102662976A (en) * 2012-03-12 2012-09-12 浙江工业大学 Text feature weighting method based on supervision
CN102930009A (en) * 2012-10-30 2013-02-13 北京奇虎科技有限公司 Individual website navigation system
CN103020212A (en) * 2012-12-07 2013-04-03 合一网络技术(北京)有限公司 Method and device for finding hot videos based on user query logs in real time
CN103902570A (en) * 2012-12-27 2014-07-02 腾讯科技(深圳)有限公司 Text classification feature extraction method, classification method and device
CN103559178A (en) * 2013-05-31 2014-02-05 武汉中文百科网络有限公司 System and method for switching between simplified Chinese characters and traditional Chinese characters on Internet
CN104424177A (en) * 2013-08-26 2015-03-18 高德软件有限公司 Method and device for extracting core words
CN104598642A (en) * 2015-02-13 2015-05-06 杜雨阳 Standard disease name checking method and system

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107515908A (en) * 2017-08-11 2017-12-26 新智数通(北京)技术服务有限公司 A kind of data processing method and device
CN107894976A (en) * 2017-10-12 2018-04-10 北京知道未来信息技术有限公司 A kind of mixing language material segmenting method based on Bi LSTM
CN107977354A (en) * 2017-10-12 2018-05-01 北京知道未来信息技术有限公司 A kind of mixing language material segmenting method based on Bi-LSTM-CNN
CN107992467A (en) * 2017-10-12 2018-05-04 北京知道未来信息技术有限公司 A kind of mixing language material segmenting method based on LSTM
CN110299206A (en) * 2018-03-21 2019-10-01 华东师范大学 A kind of more contents imply Di Li Cray distributed model and interpretation of the cause, onset and process of an illness method for digging is implied in Chinese medicine case
CN109243618B (en) * 2018-09-12 2020-06-16 腾讯科技(深圳)有限公司 Medical model construction method, disease label construction method and intelligent device
CN109243618A (en) * 2018-09-12 2019-01-18 腾讯科技(深圳)有限公司 Construction method, disease label construction method and the smart machine of medical model
CN109582675A (en) * 2018-11-29 2019-04-05 北京达佳互联信息技术有限公司 Tag match method, apparatus, server and storage medium
WO2020124856A1 (en) * 2018-12-18 2020-06-25 众安信息技术服务有限公司 Diagnosis standardization method and device based on word vectors
CN109994201B (en) * 2019-03-18 2021-06-11 浙江大学 Diabetes and hypertension probability calculation system based on deep learning
CN109994201A (en) * 2019-03-18 2019-07-09 浙江大学 A kind of diabetes based on deep learning and hypertension method for calculating probability
CN110335684A (en) * 2019-06-14 2019-10-15 电子科技大学 The intelligent dialectical aid decision-making method of Chinese medicine based on topic model technology
CN110297933A (en) * 2019-07-01 2019-10-01 山东浪潮人工智能研究院有限公司 A kind of theme label recommended method and tool based on deep learning
CN110580942A (en) * 2019-08-23 2019-12-17 和宇健康科技股份有限公司 novel physical examination report generation method, device, medium and terminal equipment
WO2021155678A1 (en) * 2020-02-03 2021-08-12 腾讯科技(深圳)有限公司 Label marking method and apparatus, and device and readable storage medium
WO2021114632A1 (en) * 2020-05-13 2021-06-17 平安科技(深圳)有限公司 Disease name standardization method, apparatus, device, and storage medium
CN112002416A (en) * 2020-08-23 2020-11-27 吾征智能技术(北京)有限公司 Disease symptom prediction system based on urine character self-learning
CN112735475A (en) * 2020-12-25 2021-04-30 北京博瑞彤芸科技股份有限公司 Method and system for searching disease knowledge through voice
CN112735475B (en) * 2020-12-25 2023-02-21 北京博瑞彤芸科技股份有限公司 Method and system for searching disease knowledge through voice
CN112992376A (en) * 2021-03-04 2021-06-18 山东大学 Disease name matching method and system based on weight adjustment
CN113838579A (en) * 2021-09-29 2021-12-24 平安医疗健康管理股份有限公司 Medical data anomaly detection method, device, equipment and storage medium
CN114822865A (en) * 2022-06-27 2022-07-29 天津幸福生命科技有限公司 Diagnostic data identification method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN106682411B (en) 2019-04-16

Similar Documents

Publication Publication Date Title
CN106682411A (en) Method for converting physical examination diagnostic data into disease label
CN111414393B (en) Semantic similar case retrieval method and equipment based on medical knowledge graph
CN109378053B (en) Knowledge graph construction method for medical image
Qayyum et al. Medical image retrieval using deep convolutional neural network
CN106776711B (en) Chinese medical knowledge map construction method based on deep learning
Xiong et al. Reinforced transformer for medical image captioning
CN107480125B (en) Relation linking method based on knowledge graph
CN108628824A (en) A kind of entity recognition method based on Chinese electronic health record
CN105404632B (en) System and method for carrying out serialized annotation on biomedical text based on deep neural network
CN107679110A (en) The method and device of knowledge mapping is improved with reference to text classification and picture attribute extraction
CN112232065A (en) Method and device for mining synonyms
CN111191456B (en) Method for identifying text segments by using sequence labels
CN116860987A (en) Domain knowledge graph construction method and system based on generation type large language model
Leng et al. Bi-level artificial intelligence model for risk classification of acute respiratory diseases based on Chinese clinical data
Saranya et al. Intelligent medical data storage system using machine learning approach
Hayat et al. Modeling subjective affect annotations with multi-task learning
CN113903422A (en) Medical image diagnosis report entity extraction method, device and equipment
CN113343680A (en) Structured information extraction method based on multi-type case history texts
Maulana et al. The Scientific Progress and Prospects of Artificial Intelligence for Cancer Detection: A Bibliometric Analysis
CN108984699A (en) Merge the drug poisonous substance adverse reaction intelligent answer method of multichannel text feature
WO2016090625A1 (en) Scalable web data extraction
CN115831380A (en) Intelligent medical data management system and method based on medical knowledge graph
CN111597330A (en) Intelligent expert recommendation-oriented user image drawing method based on support vector machine
Abu et al. Biodiversity image retrieval framework for monogeneans
Süleyman Medical data analysis for different data types

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant