CN112331352A - Intelligent information matching system based on dengue fever - Google Patents
Intelligent information matching system based on dengue fever Download PDFInfo
- Publication number
- CN112331352A CN112331352A CN202011219107.7A CN202011219107A CN112331352A CN 112331352 A CN112331352 A CN 112331352A CN 202011219107 A CN202011219107 A CN 202011219107A CN 112331352 A CN112331352 A CN 112331352A
- Authority
- CN
- China
- Prior art keywords
- dengue
- vector set
- information
- conversion rule
- matched
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 206010012310 Dengue fever Diseases 0.000 title claims abstract description 97
- 208000001490 Dengue Diseases 0.000 title claims abstract description 96
- 208000025729 dengue disease Diseases 0.000 title claims abstract description 96
- 239000013598 vector Substances 0.000 claims abstract description 80
- 238000006243 chemical reaction Methods 0.000 claims abstract description 49
- 238000004364 calculation method Methods 0.000 claims abstract description 23
- 238000000034 method Methods 0.000 claims abstract description 9
- 238000010586 diagram Methods 0.000 description 11
- 201000010099 disease Diseases 0.000 description 9
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 9
- 210000002966 serum Anatomy 0.000 description 8
- 208000010201 Exanthema Diseases 0.000 description 7
- 201000005884 exanthem Diseases 0.000 description 7
- 230000014509 gene expression Effects 0.000 description 7
- 206010037844 rash Diseases 0.000 description 7
- 208000032843 Hemorrhage Diseases 0.000 description 5
- 206010037660 Pyrexia Diseases 0.000 description 5
- 230000001154 acute effect Effects 0.000 description 5
- 206010018999 Haemorrhage subcutaneous Diseases 0.000 description 4
- 210000000056 organ Anatomy 0.000 description 4
- 208000000112 Myalgia Diseases 0.000 description 3
- 230000000740 bleeding effect Effects 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 208000006820 Arthralgia Diseases 0.000 description 2
- 206010063659 Aversion Diseases 0.000 description 2
- 206010006002 Bone pain Diseases 0.000 description 2
- 208000014644 Brain disease Diseases 0.000 description 2
- 206010051625 Conjunctival hyperaemia Diseases 0.000 description 2
- 241000725619 Dengue virus Species 0.000 description 2
- 241000255925 Diptera Species 0.000 description 2
- 208000032274 Encephalopathy Diseases 0.000 description 2
- 206010015958 Eye pain Diseases 0.000 description 2
- 206010018276 Gingival bleeding Diseases 0.000 description 2
- 206010062713 Haemorrhagic diathesis Diseases 0.000 description 2
- 208000000616 Hemoptysis Diseases 0.000 description 2
- 206010019842 Hepatomegaly Diseases 0.000 description 2
- 206010028813 Nausea Diseases 0.000 description 2
- 208000003251 Pruritus Diseases 0.000 description 2
- 206010037870 Rash morbilliform Diseases 0.000 description 2
- 206010037890 Rash scarlatiniform Diseases 0.000 description 2
- 208000009714 Severe Dengue Diseases 0.000 description 2
- 208000032023 Signs and Symptoms Diseases 0.000 description 2
- 206010040844 Skin exfoliation Diseases 0.000 description 2
- 206010046788 Uterine haemorrhage Diseases 0.000 description 2
- 206010046910 Vaginal haemorrhage Diseases 0.000 description 2
- 206010047700 Vomiting Diseases 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 238000011888 autopsy Methods 0.000 description 2
- 230000002902 bimodal effect Effects 0.000 description 2
- 210000004027 cell Anatomy 0.000 description 2
- 238000003759 clinical diagnosis Methods 0.000 description 2
- 230000035618 desquamation Effects 0.000 description 2
- 206010014599 encephalitis Diseases 0.000 description 2
- 208000001780 epistaxis Diseases 0.000 description 2
- 210000003743 erythrocyte Anatomy 0.000 description 2
- 206010016256 fatigue Diseases 0.000 description 2
- 208000006750 hematuria Diseases 0.000 description 2
- 208000031169 hemorrhagic disease Diseases 0.000 description 2
- 210000000265 leukocyte Anatomy 0.000 description 2
- 210000004698 lymphocyte Anatomy 0.000 description 2
- 230000003387 muscular Effects 0.000 description 2
- 230000008693 nausea Effects 0.000 description 2
- 210000005259 peripheral blood Anatomy 0.000 description 2
- 239000011886 peripheral blood Substances 0.000 description 2
- 210000002381 plasma Anatomy 0.000 description 2
- 208000018316 severe headache Diseases 0.000 description 2
- 230000035939 shock Effects 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 206010043554 thrombocytopenia Diseases 0.000 description 2
- 230000008673 vomiting Effects 0.000 description 2
- 241000256118 Aedes aegypti Species 0.000 description 1
- 241000256173 Aedes albopictus Species 0.000 description 1
- 238000012935 Averaging Methods 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 208000012671 Gastrointestinal haemorrhages Diseases 0.000 description 1
- 206010019233 Headaches Diseases 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 208000022531 anorexia Diseases 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 208000034158 bleeding Diseases 0.000 description 1
- 238000004159 blood analysis Methods 0.000 description 1
- 238000009534 blood test Methods 0.000 description 1
- 230000019771 cognition Effects 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 206010061428 decreased appetite Diseases 0.000 description 1
- 201000002950 dengue hemorrhagic fever Diseases 0.000 description 1
- 201000009892 dengue shock syndrome Diseases 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000011010 flushing procedure Methods 0.000 description 1
- 208000030304 gastrointestinal bleeding Diseases 0.000 description 1
- 230000007160 gastrointestinal dysfunction Effects 0.000 description 1
- 208000011759 gum bleeding Diseases 0.000 description 1
- 231100000869 headache Toxicity 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 230000000405 serological effect Effects 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/70—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/216—Parsing using statistical methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H10/00—ICT specially adapted for the handling or processing of patient-related medical or healthcare data
- G16H10/20—ICT specially adapted for the handling or processing of patient-related medical or healthcare data for electronic clinical trials or questionnaires
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/80—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for detecting, monitoring or modelling epidemics or pandemics, e.g. flu
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Theoretical Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Public Health (AREA)
- Physics & Mathematics (AREA)
- Medical Informatics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Primary Health Care (AREA)
- Epidemiology (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Pathology (AREA)
- Databases & Information Systems (AREA)
- Biomedical Technology (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Probability & Statistics with Applications (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention provides an intelligent matching system for information based on dengue fever. The method comprises the following steps: the acquisition module is used for acquiring a dengue characteristic information text and establishing a first characteristic vector set according to the dengue characteristic information text; the acquisition module is used for acquiring a characteristic information text of the dengue to be matched, extracting characteristic information to be matched in the characteristic information text of the dengue to be matched through TF-IDF, and establishing a second characteristic vector set; the calculation module is used for setting an original meaning conversion rule and a symbol conversion rule, converting the first feature vector set and the second feature vector set according to the original meaning conversion rule and the symbol conversion rule and calculating the similarity between the two converted feature vector sets; and the weighting module is used for carrying out weighted summation according to the similarity, obtaining the result of the weighted summation and generating a corresponding matching report according to the result. The invention greatly improves the accuracy of intelligent information matching and simultaneously improves the speed of information matching through the TF-IDF algorithm and weighted summation.
Description
Technical Field
The invention relates to the field of artificial intelligence, in particular to an intelligent matching system for information based on dengue fever.
Background
Dengue Fever (DF) is an acute infectious disease caused by dengue virus, primarily transmitted by aedes aegypti and aedes albopictus, with the major clinical manifestations of fever, rash, headache, myalgia, anorexia, gastrointestinal dysfunction and systemic failure. In addition, dengue virus can cause dengue hemorrhagic fever and dengue shock syndrome, and the latter two types of clinical symptoms are critical and have higher fatality rate.
Dengue fever is generally diagnosed through routine and biochemical examination, serological examination, virus isolation, reverse transcription polymerase chain reaction, and related symptoms and epidemic history, and the diagnosis depends on the judgment of a clinician, which not only has large workload, but also may cause certain errors, so that the improvement of the existing method is urgently needed.
The above is only for the purpose of assisting understanding of the technical aspects of the present invention, and does not represent an admission that the above is prior art.
Disclosure of Invention
In view of this, the invention provides an intelligent matching system for information based on dengue fever, and aims to solve the technical problem that the accuracy of intelligent matching of information cannot be improved by acquiring feature information for multiple times and weighting in the prior art.
The technical scheme of the invention is realized as follows:
in one aspect, the present invention provides an intelligent matching system for dengue fever-based information, comprising:
the acquisition module is used for acquiring a dengue characteristic information text and establishing a first characteristic vector set according to the dengue characteristic information text;
the acquisition module is used for acquiring a characteristic information text of the dengue to be matched, extracting characteristic information to be matched in the characteristic information text of the dengue to be matched through TF-IDF, and establishing a second characteristic vector set according to the characteristic information to be matched;
the calculation module is used for setting and defining an original conversion rule and a symbol conversion rule, converting the first feature vector set and the second feature vector set according to the original conversion rule and the symbol conversion rule, and calculating the similarity between the two converted feature vector sets;
and the weighting module is used for carrying out weighted summation according to the similarity, obtaining the result of the weighted summation and generating a corresponding matching report according to the result.
On the basis of the above technical solution, preferably, the obtaining module includes a first feature vector set establishing module, configured to obtain a dengue feature information text, where the dengue feature information text includes: the method comprises the steps of establishing a first feature vector set according to epidemiological history feature information, clinical sign feature information and etiological feature information, and storing the epidemiological history feature information and the etiological feature information into the first feature vector set.
On the basis of the above technical scheme, preferably, the acquisition module includes a second feature vector set establishment module, configured to establish a TF-IDF algorithm, set a word frequency range, acquire the feature information text of the dengue fever to be matched, calculate the word frequency of each word in the feature information text of the dengue fever to be matched through the TF-IDF algorithm, determine the feature information word of the feature information text of the dengue fever to be matched through the word frequency range, and establish a corresponding second feature vector set.
On the basis of the above technical solution, preferably, the calculation module includes a conversion module, configured to set an original conversion rule and a symbol conversion rule, and convert the first feature vector set and the second feature vector set according to the primitive conversion rule and the symbol conversion rule to obtain a first basic primitive description, other basic primitive descriptions, a relation primitive description, and a relation symbol description.
On the basis of the above technical solution, preferably, the calculation module includes a similarity calculation module, configured to calculate a similarity between the first feature vector set and the second feature vector set according to the first basic semantic description, the other basic semantic descriptions, the relational semantic description, and the relational symbol description.
On the basis of the above technical solution, preferably, the weighting module includes a weighted average module for setting a similarity threshold, obtaining a final similarity by weighted averaging the similarity, comparing the final similarity with the similarity threshold, and generating a corresponding matching report according to a comparison result.
Still further preferably, the dengue fever based information intelligent matching device comprises:
the acquisition unit is used for acquiring a dengue feature information text and establishing a first feature vector set according to the dengue feature information text;
the acquisition unit is used for acquiring a characteristic information text of the dengue to be matched, extracting characteristic information to be matched in the characteristic information text of the dengue to be matched through TF-IDF, and establishing a second characteristic vector set according to the characteristic information to be matched;
the calculation unit is used for setting an original conversion rule and a symbol conversion rule, converting the first feature vector set and the second feature vector set according to the original conversion rule and the symbol conversion rule, and calculating the similarity between the two converted feature vector sets;
and the weighting unit is used for carrying out weighted summation according to the similarity, obtaining the result of the weighted summation and generating a corresponding matching report according to the result.
Compared with the prior art, the intelligent matching system for information based on dengue has the following beneficial effects:
(1) by accurately extracting the characteristic information text of the dengue fever by using the TF-IDF, the accuracy of a subsequent information matching process is improved, and the time of the information matching process is saved;
(2) by utilizing the original conversion rule and the symbol conversion rule of the definition, the comparison between characters is converted into the comparison between vectors, the text information can be accurately matched, and the matching speed of the system is improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a block diagram of a first embodiment of the intelligent matching system for dengue-based information of the present invention;
FIG. 2 is a block diagram of a second embodiment of the intelligent matching system for dengue-based information according to the present invention;
FIG. 3 is a block diagram illustrating a third exemplary embodiment of the intelligent matching system for dengue-based information;
FIG. 4 is a block diagram illustrating a fourth embodiment of the intelligent matching system for dengue-based information according to the present invention;
FIG. 5 is a block diagram illustrating a fifth embodiment of the intelligent matching system for dengue-based information according to the present invention;
FIG. 6 is a block diagram of the dengue fever based information intelligent matching device of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention.
As shown in fig. 1, fig. 1 is a block diagram illustrating a first embodiment of an intelligent matching system for dengue fever based information according to the present invention. Wherein the intelligent matching system for information based on dengue fever comprises: the device comprises an acquisition module 10, an acquisition module 20, a calculation module 30 and a weighting module 40.
The acquiring module 10 is configured to acquire a dengue feature information text, and establish a first feature vector set according to the dengue feature information text;
the acquisition module 20 is used for acquiring a feature information text of dengue to be matched, extracting feature information to be matched in the feature information text of dengue to be matched through TF-IDF, and establishing a second feature vector set according to the feature information to be matched;
the calculation module 30 is configured to set an original conversion rule and a symbol conversion rule, convert the first feature vector set and the second feature vector set according to the original conversion rule and the symbol conversion rule, and calculate a similarity between the two converted feature vector sets;
and the weighting module 40 is configured to perform weighted summation according to the similarity, obtain a result of the weighted summation, and generate a corresponding matching report according to the result.
Further, as shown in fig. 2, a structural block diagram of a second embodiment of the intelligent matching system for dengue fever based information according to the foregoing embodiments is provided, in this embodiment, the obtaining module 10 further includes:
a first feature vector set establishing module 101, configured to obtain a dengue feature information text, where the dengue feature information text includes: the method comprises the steps of establishing a first feature vector set according to epidemiological history feature information, clinical sign feature information and etiological feature information, and storing the epidemiological history feature information and the etiological feature information into the first feature vector set.
It should be understood that, in this embodiment, the system first obtains a text of dengue fever feature information, where the text of dengue fever feature information includes: the method comprises the following steps of establishing a first feature vector set according to epidemiological history feature information, clinical sign feature information and etiological feature information, and storing the epidemiological history feature information and the etiological feature information into the first feature vector set, wherein the epidemiological history feature information, the clinical sign feature information and the etiological feature information comprise: epidemiological characteristic information: living in dengue fever epidemic areas or in areas of 15 days which are in the epidemic areas, and 5-9 days before the disease occurs, the people have a history of being bitten by mosquitoes; clinical performance characteristic information: the disease suddenly starts. Aversion to cold and fever (39-40 ℃ in 24-36 h. a few patients present with bimodal fever); fatigue, nausea, vomiting, etc.; with more severe headache, orbital pain, and muscular, joint, and skeletal pain; with flushing of the face, neck and chest, conjunctival congestion; superficial lymphadenectasis. In addition, rashes may occur: the disease course is 5-7 days, and the disease is manifested as various rashes (measles-like rashes, scarlet fever-like rashes), subcutaneous bleeding spots and the like. The rash is distributed on the limbs, trunk or head and face, and is mostly itchy without desquamation for 3-5 days; a small number of patients can present with encephalitis-like encephalopathy symptoms and signs; bleeding tendency (positive arm restraint test) is caused, and gingival bleeding, epistaxis, gastrointestinal bleeding, subcutaneous bleeding, hemoptysis, hematuria, vaginal bleeding or pleuroperitoneal bleeding generally occur within 5-8 d of the course of disease; multiple organ profuse hemorrhage; hepatomegaly; with shock, etc.; laboratory examination: peripheral blood examination: thrombocytopenia (100X 109/L of hypotony). The total number of white blood cells is reduced, and the differential count of lymphocytes and single child cells is relatively increased; the volume of the red blood cells is increased by more than 20 percent; a single serum specific IgG antibody positive; positive serum specific IgM antibody; the serum specific IgG antibody in the convalescent phase is increased by 4 times or more than that in the acute phase; DV is isolated from the serum, plasma, blood cell layer or autopsy organ of acute stage patient, or DV antigen is detected.
Further, as shown in fig. 3, a structural block diagram of a third embodiment of the intelligent matching system for dengue fever based information according to the foregoing embodiments is provided, in this embodiment, the acquiring module 20 further includes:
the second feature vector set establishing module 201 is configured to establish a TF-IDF algorithm, set a word frequency range, acquire a feature information text of the dengue fever to be matched, calculate a word frequency of each word in the feature information text of the dengue fever to be matched through the TF-IDF algorithm, determine a feature information word of the feature information text of the dengue fever to be matched through the word frequency range, and establish a corresponding second feature vector set.
It should be understood that the system will establish a TF-IDF algorithm, set a word frequency range, simultaneously acquire a feature information text of the dengue to be matched, calculate the word frequency of each word in the feature information text of the dengue to be matched through the TF-IDF algorithm, determine the feature information word of the feature information text of the dengue to be matched through the word frequency range, and establish a corresponding second feature vector set, and it should be noted that since most of epidemiological history, clinical signs and the like to be diagnosed by a patient are descriptive texts, feature information extraction, that is, feature participle extraction, needs to be performed on the text. The characteristic items are selected mainly by using TF-IDF, characteristic information or characteristic word segmentation contained in an epidemiological history and a clinical sign text to be identified is acquired, and a vector set of the epidemiological history and the clinical sign characteristic information of Dengue Fever (DF) is established respectively so as to facilitate subsequent calculation and cognition.
Further, as shown in fig. 4, a block diagram of a fourth embodiment of the intelligent matching system for dengue fever based information according to the foregoing embodiments is provided, in this embodiment, the calculating module 30 includes:
the conversion module 301 is configured to set an original conversion rule and a symbol conversion rule, and convert the first feature vector set and the second feature vector set according to the primitive conversion rule and the symbol conversion rule to obtain a first basic primitive description, other basic primitive descriptions, a relationship primitive description, and a relationship symbol description.
A similarity calculation module 302, configured to calculate a similarity between the first feature vector set and the second feature vector set according to the first basic semantic description, the other basic semantic descriptions, the relationship semantic descriptions, and the relationship symbol descriptions.
It should be understood that, based on the result of the above feature information extraction, similarity calculation is performed on any type of feature information vector set (including epidemiological history, clinical signs, and etiological data) in the first feature information vector set and the second feature information vector set, respectively. Regarding the calculation of the similarity, several existing basic methods are based on a Vector (Vector), that is, the distance between two vectors is calculated, and the closer the distance is, the greater the similarity is. The specific calculation model and calculation method are as follows:
because the above-mentioned characteristic information representation (word or phrase) is not organized in a tree-like hierarchy, but a kind of network structure; concepts may thus be described by means of semaphores and symbols. For two entries w1(dengue epidemiological history to be screened, clinical sign trait information vector set) and w2(vector set of dengue epidemiological history, clinical sign criteria signature information), w1There are n characteristic information semantic items (concepts, participles or phrases): s11,s12,...,s1n,w2There is an m-feature information semantic (concept, word or phrase): s11,s12,...,s1mThen, w1And w2The similarity of (2) is the maximum value of the similarity of each feature information semantic item (concept, word or phrase), that is:
it should be understood that, in the above calculation model, in order to calculate the semantic similarity between them more accurately, we can express their description as a feature structure containing the following four features:
the first basic semantic description: the value is a basic meaning, and the similarity of the parts of the two concepts is marked as Sim 1(s)1,s2);
Other basic meanings describe: corresponding to all the basic semantic meaning description expressions except the first basic semantic meaning description expression in the semantic expression, the value of the basic semantic meaning description expression is a set of basic semantic meanings, and the similarity of the part of the two concepts is marked as Sim 2(s)1,s2);
Description of relationship semantics: corresponding to all relation-meaning descriptors in the semantic expression, the value of the relation-meaning descriptor is a characteristic structure, and for each characteristic of the characteristic structure, the attribute of the relation-meaning descriptor is a relation-meaning, and the value of the relation-meaning descriptor is a basic meaning or a specific word. The similarity of this part of the two concepts is denoted as Sim 3(s)1,s2);
The relationship notation describes: corresponding to all relational symbolic descriptors in the semantic expression, the value of which is also a feature structure, for each feature of which the attribute is a relational primitive, the attribute of which is a relational primitiveA value is a set, the elements of which are a basic semantic, or a specific word. The similarity of this part of the two concepts is denoted as Sim 4(s)1,s2);
It can be seen that, because the hierarchies of the respective meanings are different, the influence degrees of the respective meanings on the word similarity are different, that is, the weight of the partial similarity in the overall similarity is different, and the weight (percentage) is represented by β, so the overall similarity of the concepts can be expressed as:
wherein β (1. ltoreq. i. ltoreq.4) is an adjustable parameter and has: beta is a1+β2+β3+β4=1,β1≥β2≥β3≥β4. The latter reflects Sim1(s1,s2) To Sim4(s1,s2) The effect on the overall similarity decreases in turn. Since the first independent semantic expression reflects the most important feature of a concept, the weight should be defined to be relatively large, generally above 0.5, and in the above calculation, when finally calculating the weighted average, all parts take equal weight. Thus, the similarity problem between two words is reduced to the similarity problem between two concepts.
Further, as shown in fig. 5, a block diagram of a fifth embodiment of the intelligent matching system for dengue fever based information according to the foregoing embodiments is provided, in this embodiment, the weighting module 40 includes:
the weighted average module 401 is configured to set a similarity threshold, obtain a final similarity by performing weighted average on the similarity, compare the final similarity with the similarity threshold, and generate a corresponding matching report according to a comparison result.
It should be understood that, finally, the system sets a similarity threshold, obtains a final similarity by performing weighted average on the similarities, compares the final similarity with the similarity threshold, and generates a corresponding matching report according to a comparison result, for example: 1. epidemiological data: living in dengue fever epidemic areas or in areas of 15 days which are in the epidemic areas, and 5-9 days before the disease occurs, the people have a history of being bitten by mosquitoes; 2. the clinical manifestations are as follows: 2.1 sudden onset of disease. Aversion to cold and fever (39-40 ℃ in 24-36 h. few patients present with bimodal fever). Fatigue, nausea, vomiting, etc.; 2.2 with more severe headache, orbital pain, and muscular, joint, and skeletal pain; 2.3 with flush of face, neck and chest, conjunctival congestion; 2.4 superficial lymphadenectasis; 2.5 rash: the disease course is 5-7 days, and the disease is manifested as various rashes (measles-like rashes, scarlet fever-like rashes), subcutaneous bleeding spots and the like. The rash is distributed on the limbs, trunk or head and face, and is mostly itchy without desquamation for 3-5 days; 2.6 a minority of patients can present with encephalitis-like encephalopathy symptoms and signs; 2.7 has bleeding tendency (positive arm restraint test), and generally gum bleeding, epistaxis, alimentary tract bleeding, subcutaneous bleeding, hemoptysis, hematuria, vaginal bleeding or pleuroperitoneal bleeding are performed in the course of 5-8 d; 2.8 multiple organ profuse hemorrhage; 2.9 hepatomegaly; 2.10 with shock; 3. laboratory examination: 3.1 peripheral blood test: thrombocytopenia (100X 109/L of hypotony). The total number of white blood cells is reduced, and the differential count of lymphocytes and single child cells is relatively increased; 3.2 the volume of the red blood cells is increased by more than 20 percent; 3.3 a single serum specific IgG antibody positive; 3.4 positive serum specific IgM antibody; 3.5 serum specific IgG antibody in convalescent phase is increased by 4 times or more than that in acute phase; 3.6 isolating DV from the serum, plasma, blood cell layer or autopsy organ of acute stage patient or detecting DV antigen; 4. case classification: 4.1 suspected cases: has at least one of 3.1, 3.2.1, 3.2.2, and 3.2.3 to 3.2.7; 4.2 clinical diagnosis cases: suspected cases plus 3.3.1 (dengue epidemics have been established) or plus 3.3.3 (sporadic cases or epidemics have not been established); 4.3 confirmed cases: dengue fever: any one of 3.3.4, 3.3.5 and 3.3.6 is added in clinical diagnosis cases.
The above description is only for illustrative purposes and does not limit the technical solutions of the present application in any way.
As can be easily found from the above description, the present embodiment provides an intelligent matching system for dengue-based information, which includes: the acquisition module is used for acquiring a dengue characteristic information text and establishing a first characteristic vector set according to the dengue characteristic information text; the acquisition module is used for acquiring a characteristic information text of the dengue to be matched, extracting characteristic information to be matched in the characteristic information text of the dengue to be matched through TF-IDF, and establishing a second characteristic vector set according to the characteristic information to be matched; the calculation module is used for setting and defining an original conversion rule and a symbol conversion rule, converting the first feature vector set and the second feature vector set according to the original conversion rule and the symbol conversion rule, and calculating the similarity between the two converted feature vector sets; and the weighting module is used for carrying out weighted summation according to the similarity, obtaining the result of the weighted summation and generating a corresponding matching report according to the result. According to the embodiment, the accuracy of intelligent information matching is greatly improved through the TF-IDF algorithm and the weighted summation, and meanwhile, the speed of information matching is improved.
In addition, the embodiment of the invention also provides information intelligent matching equipment based on dengue fever. As shown in fig. 6, the dengue-based information intelligent matching device includes: an acquisition unit 10, an acquisition unit 20, a calculation unit 30 and a weighting unit 40.
The acquiring unit 10 is configured to acquire a dengue feature information text, and establish a first feature vector set according to the dengue feature information text;
the acquisition unit 20 is used for acquiring a feature information text of dengue to be matched, extracting feature information to be matched in the feature information text of dengue to be matched through TF-IDF, and establishing a second feature vector set according to the feature information to be matched;
a calculating unit 30, configured to set an original conversion rule and a symbol conversion rule, convert the first feature vector set and the second feature vector set according to the original conversion rule and the symbol conversion rule, and calculate a similarity between the two converted feature vector sets;
and the weighting unit 40 is configured to perform weighted summation according to the similarity, obtain a result of the weighted summation, and generate a corresponding matching report according to the result.
In addition, it should be noted that the above-described embodiments of the apparatus are merely illustrative, and do not limit the scope of the present invention, and in practical applications, a person skilled in the art may select some or all of the modules to implement the purpose of the embodiments according to actual needs, and the present invention is not limited herein.
In addition, the technical details that are not elaborated in this embodiment may be referred to the intelligent matching system for dengue fever based information provided in any embodiment of the present invention, and are not described herein again.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.
Claims (7)
1. An intelligent matching system for dengue fever-based information, which is characterized by comprising:
the acquisition module is used for acquiring a dengue characteristic information text and establishing a first characteristic vector set according to the dengue characteristic information text;
the acquisition module is used for acquiring a characteristic information text of the dengue to be matched, extracting characteristic information to be matched in the characteristic information text of the dengue to be matched through TF-IDF, and establishing a second characteristic vector set according to the characteristic information to be matched;
the calculation module is used for setting and defining an original conversion rule and a symbol conversion rule, converting the first feature vector set and the second feature vector set according to the original conversion rule and the symbol conversion rule, and calculating the similarity between the two converted feature vector sets;
and the weighting module is used for carrying out weighted summation according to the similarity, obtaining the result of the weighted summation and generating a corresponding matching report according to the result.
2. The dengue-based information intelligent matching system of claim 1, wherein: the obtaining module comprises a first feature vector set establishing module and is used for obtaining a dengue feature information text, wherein the dengue feature information text comprises: the method comprises the steps of establishing a first feature vector set according to epidemiological history feature information, clinical sign feature information and etiological feature information, and storing the epidemiological history feature information and the etiological feature information into the first feature vector set.
3. The dengue-based information intelligent matching system of claim 2, wherein: the acquisition module comprises a second feature vector set establishment module for establishing a TF-IDF algorithm, setting a word frequency range, acquiring a characteristic information text of the dengue to be matched, calculating the word frequency of each word in the characteristic information text of the dengue to be matched through the TF-IDF algorithm, determining the characteristic information word of the characteristic information text of the dengue to be matched through the word frequency range, and establishing a corresponding second feature vector set.
4. The dengue-based information intelligent matching system of claim 3, wherein: the calculation module comprises a conversion module which is used for setting a definition original conversion rule and a symbol conversion rule, and converting the first characteristic vector set and the second characteristic vector set according to the semantic original conversion rule and the symbol conversion rule to obtain a first basic semantic description, other basic semantic descriptions, a relation semantic description and a relation symbol description.
5. The dengue-based information intelligent matching system of claim 4, wherein: the calculation module comprises a similarity calculation module used for calculating the similarity between the first feature vector set and the second feature vector set according to the first basic semantic description, the other basic semantic descriptions, the relation semantic descriptions and the relation symbol descriptions.
6. The dengue-based information intelligent matching system of claim 5, wherein: the weighting module comprises a weighted average module which is used for setting a similarity threshold, obtaining the final similarity by carrying out weighted average on the similarity, comparing the final similarity with the similarity threshold and generating a corresponding matching report according to the comparison result.
7. An intelligent matching device for information based on dengue fever, characterized in that the intelligent matching device for information based on dengue fever comprises:
the acquisition unit is used for acquiring a dengue feature information text and establishing a first feature vector set according to the dengue feature information text;
the acquisition unit is used for acquiring a characteristic information text of the dengue to be matched, extracting characteristic information to be matched in the characteristic information text of the dengue to be matched through TF-IDF, and establishing a second characteristic vector set according to the characteristic information to be matched;
the calculation unit is used for setting an original conversion rule and a symbol conversion rule, converting the first feature vector set and the second feature vector set according to the original conversion rule and the symbol conversion rule, and calculating the similarity between the two converted feature vector sets;
and the weighting unit is used for carrying out weighted summation according to the similarity, obtaining the result of the weighted summation and generating a corresponding matching report according to the result.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011219107.7A CN112331352A (en) | 2020-11-04 | 2020-11-04 | Intelligent information matching system based on dengue fever |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011219107.7A CN112331352A (en) | 2020-11-04 | 2020-11-04 | Intelligent information matching system based on dengue fever |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112331352A true CN112331352A (en) | 2021-02-05 |
Family
ID=74316150
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011219107.7A Pending CN112331352A (en) | 2020-11-04 | 2020-11-04 | Intelligent information matching system based on dengue fever |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112331352A (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110688493A (en) * | 2019-09-26 | 2020-01-14 | 京东方科技集团股份有限公司 | Incidence relation construction method and device and electronic equipment |
CN111653359A (en) * | 2020-05-30 | 2020-09-11 | 吾征智能技术(北京)有限公司 | Intelligent prediction model construction method and prediction system for hemorrhagic diseases |
CN111816321A (en) * | 2020-07-09 | 2020-10-23 | 武汉东湖大数据交易中心股份有限公司 | System, apparatus and storage medium for intelligent infectious disease identification based on legal diagnostic criteria |
-
2020
- 2020-11-04 CN CN202011219107.7A patent/CN112331352A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110688493A (en) * | 2019-09-26 | 2020-01-14 | 京东方科技集团股份有限公司 | Incidence relation construction method and device and electronic equipment |
CN111653359A (en) * | 2020-05-30 | 2020-09-11 | 吾征智能技术(北京)有限公司 | Intelligent prediction model construction method and prediction system for hemorrhagic diseases |
CN111816321A (en) * | 2020-07-09 | 2020-10-23 | 武汉东湖大数据交易中心股份有限公司 | System, apparatus and storage medium for intelligent infectious disease identification based on legal diagnostic criteria |
Non-Patent Citations (1)
Title |
---|
李少波: "制造大数据技术与应用", vol. 1, 31 January 2018, 华中科技大学出版社, pages: 93 - 94 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109036467B (en) | TF-LSTM-based CFFD extraction method, voice emotion recognition method and system | |
CN109949929A (en) | A kind of assistant diagnosis system based on the extensive case history of deep learning | |
CN106202543A (en) | Ontology Matching method and system based on machine learning | |
CN105138864B (en) | Protein interactive relation data base construction method based on Biomedical literature | |
CN111666350B (en) | Medical text relation extraction method based on BERT model | |
CN111814460B (en) | External knowledge-based drug interaction relation extraction method and system | |
CN112686040B (en) | Event reality detection method based on graph recurrent neural network | |
CN111061951A (en) | Recommendation model based on double-layer self-attention comment modeling | |
CN116304114B (en) | Intelligent data processing method and system based on surgical nursing | |
US11024329B2 (en) | Word repetition in separate conversations for detecting a sign of cognitive decline | |
CN111090724B (en) | Entity extraction method capable of judging relevance between text content and entity based on deep learning | |
TWI389100B (en) | Method for classifying speech emotion and method for establishing emotional semantic model thereof | |
CN112562725A (en) | Mixed voice emotion classification method based on spectrogram and capsule network | |
CN112331352A (en) | Intelligent information matching system based on dengue fever | |
CN113012776A (en) | Large-scale unbalanced diabetes electronic medical record parallel classification neighborhood evidence Spark method | |
CN112786192A (en) | Intelligent cognitive system, equipment and storage medium for hand-foot-and-mouth disease | |
CN116110594B (en) | Knowledge evaluation method and system of medical knowledge graph based on associated literature | |
Gulati | LungAI: A Deep Learning Convolutional Neural Network for Automated Detection of COVID-19 from Posteroanterior Chest X-Rays | |
CN116168824A (en) | Multi-modal mental disorder assessment method, computer device, and storage medium | |
Phursule et al. | Machine learning-based IoT-enabled perspective model for prediction of COVID-19 test in early stage | |
AU2021300304B2 (en) | Adaptive neural networks for analyzing medical images | |
RU2737598C1 (en) | Method of generating operational information based on formalized conceptual model of domain | |
Mat-Isa et al. | Classification of cervical cancer cells using HMLP network with confidence percentage and confidence level analysis | |
CN114817497A (en) | Mixed question-answering method based on intention recognition and template matching | |
Nematzadeh et al. | Interaction of word learning and semantic category formation in late talking |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |