CN107203686B - Medicine information difference processing method and system - Google Patents

Medicine information difference processing method and system Download PDF

Info

Publication number
CN107203686B
CN107203686B CN201710209408.3A CN201710209408A CN107203686B CN 107203686 B CN107203686 B CN 107203686B CN 201710209408 A CN201710209408 A CN 201710209408A CN 107203686 B CN107203686 B CN 107203686B
Authority
CN
China
Prior art keywords
difference
value
reference item
drug
similarity value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710209408.3A
Other languages
Chinese (zh)
Other versions
CN107203686A (en
Inventor
李登高
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Iron Technology Co Ltd
Original Assignee
Suzhou Iron Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Iron Information Technology Co ltd filed Critical Suzhou Iron Information Technology Co ltd
Priority to CN201710209408.3A priority Critical patent/CN107203686B/en
Publication of CN107203686A publication Critical patent/CN107203686A/en
Application granted granted Critical
Publication of CN107203686B publication Critical patent/CN107203686B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24564Applying rules; Deductive queries

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

The disclosure relates to a medicine information difference processing method and system. The method comprises the following steps: acquiring a matching pair consisting of drug record data and standard drug data; calculating and matching similarity values of all reference items of the drug record data and the standard drug data in the pair based on the drug information codes; calculating the difference value of each reference item based on the similarity value of each reference item and the judgment threshold value corresponding to each reference item; and carrying out weighted average on the difference values of the reference items to obtain a comprehensive matching score of the matching pair. The method and the device can effectively distinguish the matching pairs with approximate matching scores, so that the matching accuracy can be improved, and the matching efficiency is improved.

Description

Medicine information difference processing method and system
Technical Field
The present disclosure relates to the field of big data, and in particular, to a medicine information difference processing method, a medicine information difference processing system, an electronic device, and a computer-readable storage medium.
Background
With the development of internet technology, the medical industry is accelerating medical informatization construction, and drug information management is an important component of medical informatization construction.
At present, the medicine information data in the database used by the medical institution has a large data volume, for example, there are more than 17.2 thousands of medicine information in the medicine information data, each kind of medicine information also includes multiple reference system information, and because the language format of the medical institution personnel is complex and changeable, a large amount of nonstandard and non-uniform medicine information data exists in the medical information system. In the prior art, referring to fig. 1, in order to identify non-standard drug record data as standard drug data, m pieces of non-standard drug record data and n pieces of standard drug data need to form a matching pair one by one, and then whether the non-standard drug record data and the standard drug data in the matching pair are matched is determined by calculating a matching score of the non-standard drug record data and the standard drug data in the matching pair.
However, with prior art solutions the following often occurs: the difference between the matching scores of the non-standard drug record data and the standard drug data is too small to be distinguished effectively, which not only easily causes matching errors, but also reduces matching efficiency.
It is to be noted that the information disclosed in the above background section is only for enhancement of understanding of the background of the present disclosure, and thus may include information that does not constitute prior art known to those of ordinary skill in the art.
Disclosure of Invention
An object of the present disclosure is to provide a medicine information difference processing method and a medicine information difference processing system, thereby overcoming, at least to some extent, one or more problems caused by the limitations and disadvantages of the related art.
According to an aspect of the present disclosure, there is provided a medicine information difference processing method including:
acquiring a matching pair consisting of drug record data and standard drug data;
calculating similarity values of the drug record data and reference items of the standard drug data in the matching pair based on drug information codes;
calculating the difference value of each reference item based on the similarity value of each reference item and the judgment threshold value corresponding to each reference item; and
and carrying out weighted average on the difference values of the reference items to obtain a comprehensive matching score of the matching pair.
In an exemplary embodiment of the present disclosure, calculating the difference value of each reference item based on the similarity value of each reference item and the judgment threshold corresponding to each reference item includes:
comparing the similarity value of the reference item with a judgment threshold corresponding to the reference item;
and obtaining the difference value of the reference item by combining the comparison result with the corresponding judgment threshold, the corresponding compatibility probability and the corresponding rejection probability and adopting an exponential amplification mode on the similar value.
In an exemplary embodiment of the present disclosure, comparing the similarity value of the reference item with the judgment threshold corresponding to the reference item includes:
and comparing the similarity value of the reference item with a result obtained after the judgment threshold value corresponding to the reference item is corrected by the correction factor.
In an exemplary embodiment of the present disclosure, obtaining a difference value of the reference item by performing an exponential amplification on the similarity value according to a comparison result in combination with a corresponding judgment threshold, a corresponding compatibility probability, and a corresponding rejection probability includes:
when the similarity value of the reference item is larger than the corresponding judgment threshold value, the similarity value is exponentially amplified through the following formula to obtain the difference value of each reference item:
difference value log (c/d, n) e
Wherein f is a judgment threshold, c is a compatibility probability, d is a repulsion probability, n is a difference factor number, and e is an amplification factor; and
when the similarity value of the reference item is smaller than the corresponding judgment threshold value, the similarity value is exponentially reduced through the following formula to obtain the difference value of each reference item:
difference value log ((1-c)/(1-d), n) e
Wherein f is a judgment threshold, c is a compatibility probability, d is an exclusion probability, n is the number of difference factors, and e is an amplification factor.
In an exemplary embodiment of the present disclosure, the medicine information code includes one or more of a character code, a font code, and a pronunciation code.
According to an aspect of the present disclosure, there is provided a medicine information difference processing system including:
a matching pair acquisition unit for acquiring a matching pair composed of the drug record data and the standard drug data;
a similarity value calculation unit for calculating a similarity value of each reference item of the drug record data and the standard drug data in the matching pair based on drug information coding;
the difference value calculating unit is used for calculating the difference value of each reference item based on the similarity value of each reference item and the judgment threshold value corresponding to each reference item; and
and the matching score calculating unit is used for carrying out weighted average on the difference values of the reference items to obtain the comprehensive matching score of the matching pair.
In an exemplary embodiment of the present disclosure, calculating the difference value of each reference item based on the similarity value of each reference item and the judgment threshold corresponding to each reference item includes:
comparing the similarity value of the reference item with a judgment threshold corresponding to the reference item;
and obtaining the difference value of the reference item by combining the comparison result with the corresponding judgment threshold, the corresponding compatibility probability and the corresponding rejection probability and adopting an exponential amplification mode on the similar value.
In an exemplary embodiment of the present disclosure, comparing the similarity value of the reference item with the judgment threshold corresponding to the reference item includes:
and comparing the similarity value of the reference item with a result obtained after the judgment threshold value corresponding to the reference item is corrected by the correction factor.
According to an aspect of the present disclosure, there is provided an electronic apparatus including
A processor; and
a memory having stored thereon a computer program that, when executed by the processor, implements any of the drug information difference processing methods described above.
According to an aspect of the present disclosure, there is provided a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements a drug information difference processing method according to any one of the above.
In the drug information difference processing method and the drug information difference processing system in an exemplary embodiment of the present disclosure, the similarity value of each reference item in the acquired matching pair is calculated by the drug information code; calculating the difference value of each reference item based on the similarity value of each reference item and a judgment threshold value; and carrying out weighted average on the difference values of the reference items to obtain a comprehensive matching score of the matching pair. According to the medicine information difference processing system in the present exemplary embodiment, on one hand, the similarity value of each reference item in the obtained matching pair is calculated through the medicine information code, and the similarity value of each reference item in the matching pair can be accurately obtained; on the other hand, the difference value of each reference item is calculated based on the similarity value of each reference item and the judgment threshold, and the similarity value can be subjected to differential amplification processing according to the comparison result of the similarity value and the judgment threshold, so that the similarity value of each reference item can be effectively distinguished; on the other hand, the difference values of the reference items are weighted and averaged to obtain the comprehensive matching score of the matching pairs, so that the difference between the matching scores of the matching pairs can be obviously improved, the matching pairs can be effectively distinguished, the matching result is more accurate, and the matching efficiency is further improved.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
The above and other features and advantages of the present disclosure will become more apparent by describing in detail exemplary embodiments thereof with reference to the attached drawings.
Fig. 1 schematically shows a schematic diagram of a matching pair consisting of m pieces of drug record data and n pieces of standard drug data;
FIG. 2 schematically shows a flow chart of a drug information difference processing method according to an exemplary embodiment of the present disclosure;
FIG. 3 schematically illustrates drug information data having multiple reference frames according to an exemplary embodiment of the present disclosure;
FIG. 4 is a schematic diagram illustrating a comparison of a composite match score of a matching pair calculated according to an exemplary embodiment of the present disclosure with a composite match score of a matching pair calculated according to one aspect; and
fig. 5 schematically illustrates a block diagram of a drug information difference processing system according to an exemplary embodiment of the present disclosure.
Detailed Description
Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art. The same reference numerals denote the same or similar parts in the drawings, and thus, a repetitive description thereof will be omitted.
Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided to give a thorough understanding of embodiments of the disclosure. One skilled in the relevant art will recognize, however, that the embodiments of the disclosure can be practiced without one or more of the specific details, or with other methods, components, materials, devices, steps, and so forth. In other instances, well-known structures, methods, devices, implementations, materials, or operations are not shown or described in detail to avoid obscuring aspects of the disclosure.
The block diagrams shown in the figures are functional entities only and do not necessarily correspond to physically separate entities. That is, these functional entities may be implemented in the form of software, or in one or more software-hardened modules, or in different networks and/or processor devices and/or microcontroller devices.
In the present exemplary embodiment, a drug information difference processing method is first provided. Referring to fig. 2, the medicine information difference processing method may include the steps of:
s210, acquiring a matching pair consisting of the drug record data and the standard drug data;
s220, calculating similarity values of the drug record data and reference items of the standard drug data in the matching pair based on drug information codes;
step S230, calculating difference values of the reference items based on the similarity values of the reference items and the judgment threshold values corresponding to the reference items; and
and S240, carrying out weighted average on the difference values of the reference items to obtain a comprehensive matching score of the matching pair.
According to the medicine information difference processing method in the present exemplary embodiment, on one hand, the similarity value of each reference item in the obtained matching pair is calculated through the medicine information code, and the similarity value of each reference item in the matching pair can be accurately obtained; on the other hand, the difference value of each reference item is calculated based on the similarity value of each reference item and the judgment threshold, and the similarity value can be subjected to differential amplification processing according to the comparison result of the similarity value and the judgment threshold, so that the similarity value of each reference item can be effectively distinguished; on the other hand, the difference values of the reference items are weighted and averaged to obtain the comprehensive matching score of the matching pairs, so that the difference between the matching scores of the matching pairs can be obviously improved, the matching pairs can be effectively distinguished, the matching result is more accurate, and the matching efficiency is further improved.
Next, the medicine information difference processing method in the present exemplary embodiment will be described in detail.
In step S110, a matching pair consisting of the drug record data and the standard drug data is acquired.
In the present exemplary embodiment, the drug record data may be acquired from a medical information system such as a HIS (hospital information system) or an EMR (electronic medical record) of a hospital, but in the exemplary embodiment of the present disclosure, the drug record data may also be acquired from a CIS (clinical information system) of a hospital or a drug information database of a hospital, which is not particularly limited by the present disclosure. In the present exemplary embodiment, the standard drug data may be standard drug data established according to "chinese pharmacopoeia", or may be standard drug data established according to other medical standards, for example, according to "united states pharmacopoeia", "european pharmacopoeia", "WHO international pharmacopoeia", traditional Chinese medicine patent drug preparations issued by the ministry of health "and" national Chinese patent drug standards compilation ", and the disclosure is not particularly limited thereto.
In the present exemplary embodiment, a unified interface may be designed for different types of databases in the hospital data system, such as MySQL, SQL Server, Oracle, DB2, and the like, and the drug record data and the standard drug data in each database may be acquired through the unified interface.
Further, in the present exemplary embodiment, the drug record data and the standard data each contain a plurality of reference items, each corresponding to a reference frame. Referring to fig. 3, in fig. 3, light areas indicate nonstandard drug record data to be processed, and dark areas indicate standard drug data. Each piece of drug record data in fig. 3 may include 7 reference items, each corresponding to a reference frame, for example, the reference frames corresponding to the 7 reference items in fig. 3 are: drug number, approved letter number, drug name (Chinese), drug name (English), formulation specification, drug formulation, manufacturer name. It should be noted that, although the above-mentioned 7 reference frames are shown in the present exemplary embodiment, the reference frames in the exemplary embodiment of the present disclosure are not limited thereto, and for example, the reference frames may also be reference frames of drug components, drug functions, storage modes, and the like, which is also within the protection scope of the present disclosure.
Next, in step S220, similarity values of the reference items of the drug record data and the standard drug data in the matching pair are calculated based on drug information codes.
In the present exemplary embodiment, the medicine information code may include one or more of a character code, a font code, and a pronunciation code, and in an exemplary embodiment of the present disclosure, the medicine information code may further include a font code, a phrase code, and the like, which are also within the scope of the present disclosure.
Further, in the present exemplary embodiment, the similarity value of each reference item in the matching pair may be calculated by editing the distance, but the exemplary embodiments of the present disclosure are not limited thereto, and for example, the similarity value of each reference item in the matching pair may also be calculated by an algorithm such as an N-Gram algorithm and a Soundex algorithm, which is not particularly limited by the present disclosure.
Specifically, table 1 below shows similarity values of the respective reference items calculated by the edit distance. In table 1 below, the list where the NID is located represents standard drug data, the list where the MID is located represents non-standard drug record data, and R1 to R6 represent different reference frames, which may be, for example, drug number, approved character number, drug name (chinese), drug name (english), formulation specification, and drug dosage form.
TABLE 1 similarity values for reference terms in matching pairs
NID MID R1 R2 R3 R4 R5 R6
N102 M185 91.06427 83.02689 55.76598 85.38511 42.87573 83.00242
N102 M423 67.88041 67.07605 89.16404 75.7294 75.68885 60.30778
N102 M902 86.84194 63.75839 88.71935 58.84218 66.24005 65.45528
N102 M580 76.63916 84.76704 62.67677 64.97011 72.45706 62.17329
N102 M1022 56.88255 93.85251 49.15134 91.16981 82.73499 49.01752
N102 M276 45.39508 49.52274 80.3929 39.7603 73.90614 40.16648
N102 M986 56.07566 47.0751 48.23597 55.15449 57.2735 56.68381
N102 M556 21.67067 50.90629 21.7255 57.97105 29.45841 32.19147
N102 M851 38.69177 22.67075 32.35278 36.74417 39.38383 34.78763
N102 M265 13.63181 75.12384 23.11756 60.24734 5.905576 24.16077
N102 M897 28.86901 84.51688 16.08536 16.22755 17.17475 24.42942
Next, in step S230, a difference value of each reference item is calculated based on the similarity value of each reference item and the judgment threshold corresponding to each reference item.
In the present exemplary embodiment, in order to better perform the distinctive processing on the similarity value of each reference item, the judgment threshold value of the similarity value of each reference system may be preset according to the importance degree of the content of each reference system, and furthermore, the judgment threshold value corresponding to each reference system may be corrected according to the processing result of the medicine information difference processing method of the present exemplary embodiment, for example, the judgment threshold value corresponding to the reference system may be corrected by setting a correction coefficient according to the processing result. After the judgment threshold of the similarity value of each reference system is preset, the similarity value of each reference item may be compared with the judgment threshold corresponding to the reference item, when the similarity value of the reference item is greater than the corresponding judgment threshold, the similarity value of the reference item is subjected to a positive amplification process, and when the similarity value of the reference item is less than the corresponding judgment threshold, the similarity value of the reference item is subjected to a negative amplification process.
For example, in the present exemplary embodiment, assuming that the judgment threshold of the reference item is f, the correction coefficient is h, the compatibility probability is c, the repulsion probability is d, the number of the difference factors is n, and the amplification coefficient is e, the similarity value of the reference item may be directly compared with the judgment threshold f corresponding to the reference item, or the similarity value of the reference item may be compared with the sum of the corresponding judgment threshold and the correction coefficient, for example, f + h, or the similarity value of the reference item may be compared with the value of the expression f + h (f + h), which is also within the protection scope of the present disclosure. Table 2 below shows preset values of the judgment threshold f, the correction coefficient h, the compatibility probability c, and the repulsion probability d corresponding to each reference frame according to the present exemplary embodiment:
reference item Judging threshold value Correction factor Probability of compatibility Probability of rejection
R1 53.05839 0.05031834 0.7703145 0.2933824
R2 65.66332 -0.1913443 0.8428367 0.0861728
R3 51.58069 -0.0240676 0.9510112 0.0806468
R4 58.38195 -0.0771713 0.9676309 0.1304032
R5 51.19081 -0.0369349 0.785452 0.2617031
R6 48.39781 0.03709961 0.9866528 0.3030836
It should be noted that the preset values are only examples in the present exemplary embodiment, and the preset values of the judgment threshold f, the correction coefficient h, the compatibility probability c, and the rejection probability d corresponding to each reference frame may also be modified according to the processing result, which is also within the protection scope of the present disclosure.
Further, after the similarity value of the reference item is compared with the corresponding judgment threshold, the difference value of the reference item can be obtained by combining the corresponding judgment threshold, the compatibility probability and the rejection probability according to the comparison result and adopting an exponential amplification mode for the similarity value. For example, when the similarity value of a reference item is greater than the corresponding judgment threshold, the similarity value is exponentially amplified by the following formula to obtain the difference value of each reference item:
difference value log (c/d, n) e (1)
Wherein f is a judgment threshold, c is a compatibility probability, d is a repulsion probability, n is a difference factor number, and e is an amplification factor; and
when the similarity value of the reference item is smaller than the corresponding judgment threshold value, the similarity value is exponentially reduced through the following formula to obtain the difference value of each reference item:
difference value log ((1-c)/(1-d), n) e (2)
Wherein f is a judgment threshold, c is a compatibility probability, d is an exclusion probability, n is the number of difference factors, and e is an amplification factor.
For example, table 3 below shows the difference values of the reference items obtained after exponentially amplifying the similarity values of the reference items. In table 3 below, the list where the NID is located represents standard drug data, the list where the MID is located represents non-standard drug record data, and R1 to R6 represent different reference frames, such as drug number, approved literature, drug name (chinese), drug name (english), formulation specification, and drug dosage form.
TABLE 3 difference values obtained after exponential amplification of the similarity values
NID MID R1 R2 R3 R4 R5 R6
N102 M185 126.8221 273.154268 198.5142 246.8891 -76.4432 141.3387
N102 M423 94.5347 220.676804 317.4037 218.9698 120.0116 102.6937
N102 M902 120.9418 209.761874 315.8207 170.1408 105.0297 111.459
N102 M580 106.7327 278.879278 223.115 187.8596 114.8873 105.8703
N102 M1022 79.21836 308.769993 -207.915 263.6154 131.1839 -279.712
N102 M276 -73.5977 -125.77079 286.1804 -188.768 117.185 -229.205
N102 M986 78.09464 -119.55462 -204.043 159.478 90.81239 96.52271
N102 M556 -35.134 -129.28452 -91.901 167.622 -52.5214 -183.697
N102 M851 -62.7298 -57.575931 -136.855 -174.449 -70.2174 -198.511
N102 M265 -22.1008 247.15363 -97.7895 174.2038 -10.5291 -137.87
N102 M897 -46.8045 278.056272 -68.0426 -77.0429 -30.6209 -139.403
Next, in step S240, the difference values of the reference items are weighted and averaged to obtain a comprehensive matching score of the matching pair.
In this exemplary embodiment, the average value of the difference values of the reference items may be directly calculated to obtain the comprehensive matching score of the matching pair, or the difference values of the reference items may be weighted and averaged to obtain the comprehensive matching score of the matching pair, and the weight of the difference value of each reference item may be determined according to the importance degree of the content of each reference system.
Specifically, table 4 below shows a comprehensive match score obtained by directly averaging the similarity values of the reference items and a comprehensive match score obtained by directly averaging the difference values of the reference items. In table 4 below, the list of NIDs indicates standard drug data, the list of MIDs indicates non-standard drug record data, and R1 to R6 indicate different reference systems, for example, the reference systems may be drug numbers, approved characters, drug names (chinese), drug names (english), formulation specifications, and drug dosage forms, AVG is the average value of the similarity values of the reference items, i.e., the comprehensive match score, and POW is the average value of the difference values of the reference items, i.e., the comprehensive match score. As can be seen from table 4, the difference between the comprehensive matching scores calculated by using the drug information difference processing method of the present exemplary embodiment is significantly enlarged, so that each matching pair can be effectively distinguished, the matching accuracy can be improved, and the matching efficiency is further improved.
TABLE 4 comparison of the match score POW calculated using the present exemplary embodiment with the match score AVG of a solution technique
NID MID R1 R2 R3 R4 R5 R6 AVG POW
N102 M185 91.06427 83.02689 55.76598 85.38511 42.87573 83.00242 73.52007 151.7125
N102 M423 67.88041 67.07605 89.16404 75.7294 75.68885 60.30778 72.64109 179.0484
N102 M902 86.84194 63.75839 88.71935 58.84218 66.24005 65.45528 71.64287 172.1923
N102 M580 76.63916 84.76704 62.67677 64.97011 72.45706 62.17329 70.6139 169.5574
N102 M1022 56.88255 93.85251 49.15134 91.16981 82.73499 49.01752 70.46812 49.19343
N102 M276 45.39508 49.52274 80.3929 39.7603 73.90614 40.16648 54.85727 -35.6627
N102 M986 56.07566 47.0751 48.23597 55.15449 57.2735 56.68381 53.41642 16.88505
N102 M556 21.67067 50.90629 21.7255 57.97105 29.45841 32.19147 35.6539 -54.1526
N102 M851 38.69177 22.67075 32.35278 36.74417 39.38383 34.78763 34.10515 -116.723
N102 M265 13.63181 75.12384 23.11756 60.24734 5.905576 24.16077 33.69782 25.51128
N102 M897 28.86901 84.51688 16.08536 16.22755 17.17475 24.42942 31.21716 -13.9763
More intuitively, as shown in fig. 4, a relatively gentle curve in fig. 4 represents the average AVG of the similarity values of the reference items, and a relatively steep curve represents the average POW of the difference values of the reference items, so that with the medicine information difference processing method in the present exemplary embodiment, the difference between the matching scores of the matching pairs can be significantly increased, and the accuracy of matching can be improved.
Further, in the present exemplary embodiment, the matching pairs may also be sorted according to the magnitude of the obtained composite matching score of the matching pairs, so that the standard drug data with the highest matching score to the non-standard drug record data may be quickly obtained.
It should be noted that although the various steps of the methods of the present disclosure are depicted in the drawings in a particular order, this does not require or imply that these steps must be performed in this particular order, or that all of the depicted steps must be performed, to achieve desirable results. Additionally or alternatively, certain steps may be omitted, multiple steps combined into one step execution, and/or one step broken down into multiple step executions, etc.
Further, in the present exemplary embodiment, there is also provided a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements the drug information difference processing method according to any one of the above-described embodiments.
Further, in the present exemplary embodiment, a medicine information difference processing system is also provided. Referring to fig. 5, the medicine information difference processing system 500 may include: a matching pair acquisition unit 510, a similarity value calculation unit 520, a difference value calculation unit 530, and a matching score calculation unit 540. Wherein:
the matching pair obtaining unit 510 is configured to obtain a matching pair composed of the drug record data and the standard drug data;
the similarity value calculation unit 520 is configured to calculate similarity values of the reference items of the drug record data and the standard drug data in the matching pair based on drug information codes;
the difference value calculating unit 530 is configured to calculate a difference value of each reference item based on the similarity value of each reference item and the judgment threshold corresponding to each reference item; and
the matching score calculating unit 540 is configured to perform weighted average on the difference values of the reference items to obtain a comprehensive matching score of the matching pair.
Further, in this example embodiment, calculating the difference value of each reference item based on the similarity value of each reference item and the judgment threshold corresponding to each reference item may include: comparing the similarity value of the reference item with a judgment threshold corresponding to the reference item; and obtaining the difference value of the reference item by combining the comparison result with the corresponding judgment threshold, the corresponding compatibility probability and the corresponding rejection probability and adopting an exponential amplification mode on the similar value.
Further, in the present exemplary embodiment, comparing the similarity value of the reference item with the judgment threshold corresponding to the reference item may include: and comparing the similarity value of the reference item with a result obtained after the judgment threshold value corresponding to the reference item is corrected by the correction factor.
Since each functional module of the medicine information difference processing system 500 of the exemplary embodiment of the present disclosure corresponds to the step of the exemplary embodiment of the medicine information difference processing method described above, it is not described herein again.
Further, in the present exemplary embodiment, there is also provided an electronic apparatus including a processor; and a memory on which is stored a computer program that, when executed by the processor, implements the drug information difference processing method of any one of the above embodiments.
It should be noted that although several modules or units of the drug information difference processing system are mentioned in the above detailed description, such division is not mandatory. Indeed, the features and functionality of two or more modules or units described above may be embodied in one module or unit, according to embodiments of the present disclosure. Conversely, the features and functions of one module or unit described above may be further divided into embodiments by a plurality of modules or units.
Through the above description of the embodiments, those skilled in the art will readily understand that the exemplary embodiments described herein may be implemented by software, or by software in combination with necessary hardware. Therefore, the technical solution according to the embodiments of the present disclosure may be embodied in the form of a software product, which may be stored in a non-volatile storage medium (which may be a CD-ROM, a usb disk, a removable hard disk, etc.) or on a network, and includes several instructions to enable a computing device (which may be a personal computer, a server, a touch terminal, or a network device, etc.) to execute the method according to the embodiments of the present disclosure.
Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.
It will be understood that the present disclosure is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims (4)

1. A medicine information difference processing method is characterized by comprising the following steps:
acquiring a matching pair consisting of drug record data and standard drug data;
calculating a similarity value of each reference item of the drug record data and the standard drug data in the matching pair based on drug information codes, wherein the drug information codes comprise one or more of character codes, font codes, pronunciation codes, sound-shape codes and phrase codes;
comparing the similarity value of the reference item with a result obtained after correcting the judgment threshold value corresponding to the reference item through the correction factor;
obtaining a difference value of the reference item by combining the comparison result with the corresponding judgment threshold, the corresponding compatibility probability and the corresponding rejection probability and adopting an exponential amplification mode on the similar value; and
carrying out weighted average on the difference values of the reference items to obtain a comprehensive matching score of the matching pair;
the obtaining of the difference value of the reference item by the similarity value in an exponential amplification manner according to the comparison result in combination with the corresponding judgment threshold, the corresponding compatibility probability and the corresponding rejection probability comprises:
when the similarity value of the reference item is larger than the corresponding judgment threshold value, the similarity value is exponentially amplified through the following formula to obtain the difference value of each reference item:
difference = similarity value log (c/d, n) e
Wherein c is the compatibility probability, d is the repulsion probability, n is the number of difference factors and e is the amplification factor; and
when the similarity value of the reference item is smaller than the corresponding judgment threshold value, the similarity value is exponentially reduced through the following formula to obtain the difference value of each reference item:
difference = similarity value log ((1-c)/(1-d), n) e
Where c is the compatibility probability, d is the repulsion probability, n is the number of difference factors, and e is the amplification factor.
2. A drug information difference processing system, comprising:
a matching pair acquisition unit for acquiring a matching pair composed of the drug record data and the standard drug data;
a similarity value calculation unit for calculating a similarity value of each reference item of the drug record data and the standard drug data in the matching pair based on drug information codes, wherein the drug information codes comprise one or more of character codes, font codes, pronunciation codes, sound-shape codes and phrase codes;
the difference value calculation unit is used for comparing the similarity value of the reference item with a result obtained after the judgment threshold value corresponding to the reference item is corrected through the correction factor; obtaining a difference value of the reference item by combining the comparison result with the corresponding judgment threshold, the corresponding compatibility probability and the corresponding rejection probability and adopting an exponential amplification mode on the similar value; and
the matching score calculating unit is used for carrying out weighted average on the difference values of all the reference items to obtain the comprehensive matching score of the matching pair;
the difference value calculation unit is configured to:
when the similarity value of the reference item is larger than the corresponding judgment threshold value, the similarity value is exponentially amplified through the following formula to obtain the difference value of each reference item:
difference = similarity value log (c/d, n) e
Wherein c is the compatibility probability, d is the repulsion probability, n is the number of difference factors and e is the amplification factor; and
when the similarity value of the reference item is smaller than the corresponding judgment threshold value, the similarity value is exponentially reduced through the following formula to obtain the difference value of each reference item:
difference = similarity value log ((1-c)/(1-d), n) e
Where c is the compatibility probability, d is the repulsion probability, n is the number of difference factors, and e is the amplification factor.
3. An electronic device, comprising:
a processor; and
a memory having stored thereon a computer program which, when executed by the processor, implements the drug information difference processing method according to claim 1.
4. A computer-readable storage medium on which a computer program is stored, which program, when executed by a processor, implements the drug information difference processing method according to claim 1.
CN201710209408.3A 2017-03-31 2017-03-31 Medicine information difference processing method and system Active CN107203686B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710209408.3A CN107203686B (en) 2017-03-31 2017-03-31 Medicine information difference processing method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710209408.3A CN107203686B (en) 2017-03-31 2017-03-31 Medicine information difference processing method and system

Publications (2)

Publication Number Publication Date
CN107203686A CN107203686A (en) 2017-09-26
CN107203686B true CN107203686B (en) 2021-04-20

Family

ID=59905567

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710209408.3A Active CN107203686B (en) 2017-03-31 2017-03-31 Medicine information difference processing method and system

Country Status (1)

Country Link
CN (1) CN107203686B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109313709A (en) * 2017-12-29 2019-02-05 深圳配天智能技术研究院有限公司 A kind of measure of similarity, device and storage device
CN110929477B (en) * 2018-09-03 2023-04-28 阿里巴巴集团控股有限公司 Keyword variant determination method and device
CN110543505B (en) * 2019-09-06 2022-02-18 北京远舢智能科技有限公司 Monitoring system based on time series data
CN111798969A (en) * 2020-06-29 2020-10-20 平安国际智慧城市科技股份有限公司 Medical medicine matching method and device, electronic equipment and storage medium
CN113130038A (en) * 2021-04-30 2021-07-16 康键信息技术(深圳)有限公司 Medicine data matching method, device, equipment and storage medium
CN113223657A (en) * 2021-06-01 2021-08-06 联仁健康医疗大数据科技股份有限公司 Medicine information processing method and device, electronic equipment and storage medium

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100576207C (en) * 2007-05-29 2009-12-30 北大方正集团有限公司 Remove the method for repeating objects based on metadata
CN101727535A (en) * 2008-10-30 2010-06-09 北大方正集团有限公司 Cross indexing method for patients crossing system and system thereof
US9355105B2 (en) * 2012-12-19 2016-05-31 International Business Machines Corporation Indexing of large scale patient set
CN104121918A (en) * 2013-04-25 2014-10-29 北京搜狗信息服务有限公司 Real-time path planning method and system
CN103488783A (en) * 2013-09-29 2014-01-01 方正国际软件有限公司 Patient information matching system and patient information matching method
CN103530334B (en) * 2013-09-29 2018-01-23 北大医疗信息技术有限公司 Based on the data matching system and method for comparing template
CN103514285A (en) * 2013-09-29 2014-01-15 方正国际软件有限公司 System and method for generating optimal record data
CN103473373A (en) * 2013-09-29 2013-12-25 方正国际软件有限公司 Threshold matching model-based similarity analysis system and threshold matching model-based similarity analysis method
CN105447474B (en) * 2015-12-15 2019-08-06 3M创新有限公司 Portrait identification method and Identification of Images device
CN106372249B (en) * 2016-09-23 2018-04-13 北京三快在线科技有限公司 A kind of clicking rate predictor method, device and electronic equipment

Also Published As

Publication number Publication date
CN107203686A (en) 2017-09-26

Similar Documents

Publication Publication Date Title
CN107203686B (en) Medicine information difference processing method and system
CN107103048B (en) Medicine information matching method and system
AU2017408801B2 (en) User keyword extraction device and method, and computer-readable storage medium
WO2018120889A1 (en) Input sentence error correction method and device, electronic device, and medium
CN107783950B (en) Method and device for processing drug instruction
Pruss et al. Zika discourse in the Americas: A multilingual topic analysis of Twitter
CN108628975B (en) Database SQL statement optimization method and device
US8793120B1 (en) Behavior-driven multilingual stemming
US20190325021A1 (en) Method and Device for Creating Hyperlink
CN103473373A (en) Threshold matching model-based similarity analysis system and threshold matching model-based similarity analysis method
CN108427702B (en) Target document acquisition method and application server
US20180260390A1 (en) Translation assistance system, translation assitance method and translation assistance program
JP2017532661A (en) Reorganization and presentation of data fields with incorrect inputs
US20220100887A1 (en) Privacy Protection Query Language PQL and System Thereof
JP6237632B2 (en) Text information monitoring dictionary creation device, text information monitoring dictionary creation method, and text information monitoring dictionary creation program
CN110909168A (en) Knowledge graph updating method and device, storage medium and electronic device
CN111986759A (en) Method and system for analyzing electronic medical record, computer equipment and readable storage medium
CN111091883B (en) Medical text processing method, device, storage medium and equipment
CN111104481A (en) Method, device and equipment for identifying matching field
CN114036914A (en) Table data processing method and device, electronic equipment and storage medium
CN110909121A (en) Method and system for medical industry data standardization
EP3425531A1 (en) System, method, electronic device, and storage medium for identifying risk event based on social information
US9495341B1 (en) Fact correction and completion during document drafting
CN106844325B (en) Medical information processing method and medical information processing apparatus
US20120265520A1 (en) Text processor and method of text processing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20240306

Address after: 215000 No.27 Xinfa Road, Suzhou Industrial Park, Suzhou area, China (Jiangsu) pilot Free Trade Zone, Suzhou City, Jiangsu Province

Patentee after: SUZHOU IRON TECHNOLOGY Co.,Ltd.

Country or region after: China

Address before: 10 / F, Ailong building, 27 Xinfa Road, Suzhou Industrial Park, 215000 Jiangsu Province

Patentee before: SUZHOU IRON INFORMATION TECHNOLOGY CO.,LTD.

Country or region before: China