CN104063567A - Establishment method of patient identity source cross reference - Google Patents

Establishment method of patient identity source cross reference Download PDF

Info

Publication number
CN104063567A
CN104063567A CN201310089099.2A CN201310089099A CN104063567A CN 104063567 A CN104063567 A CN 104063567A CN 201310089099 A CN201310089099 A CN 201310089099A CN 104063567 A CN104063567 A CN 104063567A
Authority
CN
China
Prior art keywords
patient
identity
attribute
source
matching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310089099.2A
Other languages
Chinese (zh)
Other versions
CN104063567B (en
Inventor
陈文娟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai United Imaging Healthcare Co Ltd
Wuhan United Imaging Healthcare Co Ltd
Original Assignee
Shanghai United Imaging Healthcare Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai United Imaging Healthcare Co Ltd filed Critical Shanghai United Imaging Healthcare Co Ltd
Priority to CN201310089099.2A priority Critical patent/CN104063567B/en
Publication of CN104063567A publication Critical patent/CN104063567A/en
Application granted granted Critical
Publication of CN104063567B publication Critical patent/CN104063567B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Medical Treatment And Welfare Office Work (AREA)

Abstract

The invention provides an establishment method of a patient identity source cross reference. The method comprises steps as follows: one or more identity attributes of a patient are defined as stop attributes; a stop attribute of an ith patient in a patient identity source A is extracted and matched with a stop attribute of a patient in a patient identity source B, and the matched patients in the patient identity source B are recorded as a to-be-matched set Q; one or more patient identity attributes are defined as recognition attributes, the matching ratio of the ith patient in the patient identity source A and each patient in the set Q is calculated according to the recognition attributes, and the cross reference is established for the patient identity source A and the patient identity source B by comparing the matching ratio with a designed threshold value. According to the method, the cross reference can be accurately established under the condition that the patient identity attributes are defaulted, weight is not required to be allocated to each identity attribute, the process is simplified, and the cross reference establishing efficiency and the practicability are improved.

Description

Method for establishing patient identity source cross index
Technical Field
The invention relates to a medical information technology, in particular to a method for establishing a patient identity source cross index.
Background
With the rapid development of information technology and the deep innovation of medical system, the process of informatization construction of the medical industry in China is gradually accelerated. Medical information systems with specific functions, such as Hospital Information Systems (HIS), Picture Archiving and Communication Systems (PACS), Radiology Information Systems (RIS), etc., are becoming essential technical supports and infrastructures for modern healthcare facility operations. However, the informatization construction of the medical industry in China lacks systematicness and continuity, each medical information system only plays a role in an independent service range, and forms an information isolated island which is used as a camp, so that medical information of patients in the generation of heterogeneous medical information systems is difficult to effectively communicate and share.
In order to realize the transmission and sharing of cross-regional medical Information and the integration of medical service resources, the organization related to the North American radiology of North American architecture (RSNA) and the American Society for medical Information and Management Systems (HIMSS) have established an integrated medical Enterprise health (IHE) concept together with equipment manufacturers. Wherein, a patient identity cross-indexing (PIX) technical framework provides guidance for integrating the heterogeneous medical information system. The IHE PIX technical framework defines three roles: a PIX manager, a patient identity source, and a PIX user. The PIX manager is responsible for creating the unique identity identifier of the patient in the global scope, namely a patient primary index (master patient index), and establishing a mapping relation between the patient primary index and the local identity identifiers of different patient identity sources. Once the PIX manager establishes a cross-index between heterogeneous information systems, a PIX user can obtain a complete view of patient information.
Patient identification is an important cornerstone for building heterogeneous information system cross-indexing. Only by establishing a set of reliable patient identity uniqueness recognition mechanism, the diagnosis information of the same patient in different patient identity sources can be found out, so that the local identity identifiers of the patient identity sources are associated. Currently, the unique identification mechanism of patient identity adopted in the medical field is usually based on a deterministic matching strategy. The basic principle of the deterministic matching strategy is as follows: defining patient attributes for identity recognition, and manually specifying the weight of each attribute; and then calculating the similarity between the reference patient and the patient to be matched item by item, and judging whether to establish cross index for the reference patient and the patient to be matched according to the weighted average value of the similarity. The deterministic matching method has the disadvantages that: the patients in the heterogeneous information system are matched pairwise, the time complexity of the algorithm is in the square level, and the establishing efficiency of the cross index is seriously influenced; the sum of the weights of all the attributes must be 1, when a patient stored in the medical information system is absent, the weights must be distributed to all the attributes again, and the actual operability is poor; the boundary of the weighted average value of the weight and the similarity of each attribute is manually appointed, and the accuracy and the reliability of cross indexing are easily interfered by subjective factors.
Disclosure of Invention
The invention solves the problem of providing a method for establishing a patient identity source cross index, which not only improves the efficiency of establishing the cross index, but also can accurately establish the cross index under the condition that the identity attribute of a patient is default, does not need to distribute weight to each identity attribute, simplifies the flow and improves the practicability.
In order to solve the above problems, the present invention provides a method for establishing a patient identity source cross-index, comprising:
(1) defining one or more identity attributes of the patient as blocking attributes;
(2) extracting the blocking attribute of the ith patient in the patient identity source A, matching the blocking attribute with the blocking attribute of the patient in the patient identity source B, and recording the matched patient in the patient identity source B as a set Q to be matched;
(3) matching the identity attribute of the ith patient in the patient identity source A with the identity attribute of each patient in the set Q to obtain a matching ratio;
(4) according to the matching ratio, establishing a cross index between the ith patient in the patient identity source A and the patient identity source B;
(5) and (5) circularly executing the steps (2) to (4) until all the patient identity sources A and B are cross-indexed.
The method for establishing the patient identity source cross index comprises the following steps:
(1) determining blocking attribute information key of ith patient in the patient identity source Ai(ii) a Determining blocking attribute information key of jth patient in the patient identity source BjAnd calculates the blocking attribute information keyiAnd keyjThe similarity value of (a);
(2) and if the similarity value is larger than a specified threshold value, putting the jth patient in the patient identity source B into the set Q to be matched until the patient identity source B is traversed.
The method for establishing the patient identity source cross index comprises the following steps:
(1) setting N identity attributes of the patient as attributes for identifying the identity of the patient, wherein N is more than or equal to 1;
(2) according to the attribute for identifying the identity of the patient, comparing the ith patient in the patient identity source A with each patient in the set Q to be matched to obtain a similarity value of the identity attribute of the identified patient;
(3) creating a corresponding binary space vector C according to the similarity valueij={Cij(1),Cij(2),…,Cij(N), if the similarity value of the identity attribute of the ith patient in the patient identity source A and the identity attribute of the kth patient in the set Q to be matched is greater than a specified threshold value, Cij(k) 1 is ═ 1; otherwise, then Cij(k) 0; wherein k is more than or equal to 1 and less than or equal to N;
(4) calculating a probability distribution of the binary space vectorWherein, FreqlL is more than or equal to 1 and less than or equal to 2 and is the frequency of the binary space vector appearing in the l-th stateN
(5) And respectively calculating the matching ratio of the ith patient in the patient identity source A and each patient in the set Q to be matched according to the probability distribution.
The method for establishing the patient identity source cross index comprises the steps of identifying the identity attribute of the patient, wherein the identity attribute is an identification number, a driver license number, a social security number, a birth date, a telephone number and/or a contact address.
The method for establishing the patient identity source cross index comprises the following steps of:
(1) creating an identification identity attribute vector S of the ith patient in the patient identity source Ai
(2) Creating an identification identity attribute vector T of the jth patient in the set Q to be matchedj
(3) Respectively from the said identity attribute vector SiAnd TjTake out the k-th elementAnd Tj k(ii) a And calculate saidAnd Tj kThe similarity value of (a).
The above method for establishing a patient identity source cross index, wherein the formula for calculating the similarity value is as follows:
<math> <mrow> <mi>similarity</mi> <mrow> <mo>(</mo> <msubsup> <mi>S</mi> <mi>i</mi> <mi>k</mi> </msubsup> <mo>,</mo> <msubsup> <mi>T</mi> <mi>j</mi> <mi>k</mi> </msubsup> <mo>)</mo> </mrow> <mo>=</mo> <mi>&theta;</mi> <mrow> <mo>(</mo> <msubsup> <mi>S</mi> <mi>i</mi> <mi>k</mi> </msubsup> <mo>,</mo> <msubsup> <mi>T</mi> <mi>j</mi> <mi>k</mi> </msubsup> <mo>)</mo> </mrow> </mrow> </math>
wherein,is the similarity value; theta is a character string matching function; k is more than or equal to 1 and less than or equal to N, and N is the number of the identification identity attributes.
The method for establishing the patient identity source cross index includes the following steps of:
(1) obtaining a matching threshold vector m-m through a maximum expectation algorithm according to the probability distribution1,m2,…,mNAnd a mismatch threshold vector, u ═ u1,u2,…,uNN is the number of the identity attributes;
(2) initializing values of the match probability and the mismatch probability;
(3) according to the value and the initialization value of each element in the binary space vector, performing iterative computation to obtain a matching probability and a mismatching probability;
(4) and calculating to obtain a matching ratio according to the matching probability and the mismatching probability.
The method for establishing the patient identity source cross index, wherein the formula for calculating the matching ratio is as follows:
Ratio ij = log 2 ( p 1 j / p 2 j )
wherein RatioijIs the match ratio;is the match probability;is the mismatch probability.
The method for establishing the patient identity source cross index includes the following steps:
(1) calculating the matching Ratio according to a statistical modelijUpper bound T ofupAnd a lower bound Tlow
(3) If the matching Ratio of the jth patient in the data set Q isijGreater than said upper bound TupThen, the jth patient and the ith patient in the identity source A are subjected to cross indexing; if the matching Ratio of the jth patient in the data set Q isijGreater than or equal to the lower bound TlowAnd is less than or equal to the upper bound TupThen, carrying out manual treatment; if the matching Ratio of the jth patient in the data set Q isijLess than said lower bound TlowThen no processing is performed.
Compared with the prior art, the method introduces the blocking attribute, greatly reduces the matching times and improves the efficiency of establishing the cross index.
Further, compared with the existing method for calculating the identity attribute weighted similarity, the technical scheme of the invention not only can accurately establish the cross index under the condition that the identity attribute of the patient is in default, but also does not need to manually distribute the weight to each attribute, thereby simplifying the actual operation process and avoiding the interference of subjective factors.
Furthermore, the invention introduces the false rejection probability and the false extraction probability, and improves the objectivity, the reliability and the practical value of the cross index.
Drawings
FIG. 1 is a schematic flow chart illustrating cross-indexing of patient identity sources according to an embodiment of the present invention;
FIG. 2 is a schematic flow chart illustrating the matching ratio obtaining according to the embodiment of the present invention;
FIG. 3 is a schematic flow chart illustrating a process of obtaining a similarity value of an identity attribute according to an embodiment of the present invention;
FIG. 4 is a schematic flow chart illustrating the calculation of the matching ratio between the ith patient in the patient identity source A and each patient in the set Q to be matched according to the embodiment of the present invention;
fig. 5 is a schematic flow chart illustrating the process of cross-indexing between the ith patient in the patient identity source a and the patient in the patient identity source B according to the embodiment of the present invention.
Detailed Description
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein, but rather construed as limited to the embodiments set forth herein.
Next, the present invention is described in detail by using schematic diagrams, and when the embodiments of the present invention are described in detail, the schematic diagrams are only examples for convenience of description, and the scope of the present invention should not be limited herein.
The present invention will be described in detail below with reference to the drawings and examples. As shown in fig. 1, first, step S1 is executed to define one or more identity attributes of a patient as blocking attributes. Specifically, in this embodiment, the name of the patient is used as the blocking attribute, the blocking information of the blocking attribute is the pinyin of the name, and the name and the identification number of the patient may also be used as the blocking attribute.
Next, step S2 is executed to extract the blocking attribute of the ith patient in the patient identity source a, match the blocking attribute with the blocking attribute of the patient in the patient identity source B, and record the matching patient in the patient identity source B as the set Q to be matched. Specifically, the blocking attribute information key of the ith patient in the patient identity source A is analyzediIf the patient name is used as the blocking attribute, the blocking attribute information key of the ith patient in the patient identity source A is obtained in step S1iSpelling the patient's name; then, the blocking attribute information key of the jth patient in the patient identity source B is analyzedjI.e. the pinyin for the patient's name; and calculating the blocking attribute information key by a character string similarity calculation functioniAnd keyjSimilarity value of (3) similarity _ name (key)i,keyj) If the similarity value is greater than a specified threshold v0Then put the jth patient in the patient identity source B into the set Q to be matched. In this embodiment, if the ith patient in the patient identity source a is the 1 st patient in the patient identity source a, the blocking attribute information of the patient is analyzed, then the blocking attribute information of the 1 st patient in the patient identity source B is analyzed, the similarity between the blocking attribute information and the blocking attribute information is calculated, and the similarity and the specified threshold v are calculated0Making comparison, if it is greater than specified threshold value v0Then putting the 1 st patient in the patient identity source B into the set Q to be matched; then analyzing the blocking attribute information of the 2 nd patient in the patient identity source B, calculating the similarity value of the blocking attribute information of the 1 st patient in the patient identity source A and the blocking attribute information of the 2 nd patient in the patient identity source B, and similarly, comparing the similarity value with a specified threshold value v0Making comparison, if it is greater than specified threshold value v0Then put the 2 nd patient in the patient identity source B into the set Q; and the rest is repeated until all patients in the patient identity source B are traversed, and all patients in the patient identity source B which are matched with the 1 st patient in the patient identity source A are put into the set Q. In the present embodiment, the specified threshold v0The value of (A) is 0.8.
Next, step S3 is executed to match the identity attribute of the ith patient in the patient identity source a with the identity attribute of each patient in the set Q, so as to obtain a matching ratio. Specifically, as shown in FIG. 2, step S201 is executed first, and N identity attributes of the patient are set as attributes for identifying the identity of the patient, where N ≧ 1. It should be noted that the identity attribute of the patient may be an identification number, a driver license number, a social security number, a date of birth, a telephone number and/or a contact address, or may be a name. In the present embodiment, 3 identity attributes (identification number, social security number, and date of birth) of the patient are set to identify the patient identity attribute.
Then, step S202 is executed, according to the attribute of identifying the patient identity, the ith patient in the patient identity source a is compared with each patient in the set Q to be matched, so as to obtain the similarity value of the attribute of identifying the patient identity. Specifically, as shown in fig. 3, step S301 is executed first, and an identification identity attribute vector S of the ith patient in the patient identity source a is createdi. In this embodiment, the 1 st patient identification attribute vector S in the patient identification source A is created1. Then, step S302 is executed to create an identification identity attribute vector T of the jth patient in the to-be-matched set Qj. In this embodiment, an identifying identity attribute vector T is created for the first patient in the set Q1. Then, step S303 is executed to respectively identify the identity attribute vector SiAnd TjTake out the k-th elementAnd Tj kAnd calculate saidAnd Tj kThe similarity value of (a). The formula for calculating the similarity is as follows:
<math> <mrow> <mi>similarity</mi> <mrow> <mo>(</mo> <msubsup> <mi>S</mi> <mi>i</mi> <mi>k</mi> </msubsup> <mo>,</mo> <msubsup> <mi>T</mi> <mi>j</mi> <mi>k</mi> </msubsup> <mo>)</mo> </mrow> <mo>=</mo> <mi>&theta;</mi> <mrow> <mo>(</mo> <msubsup> <mi>S</mi> <mi>i</mi> <mi>k</mi> </msubsup> <mo>,</mo> <msubsup> <mi>T</mi> <mi>j</mi> <mi>k</mi> </msubsup> <mo>)</mo> </mrow> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>1</mn> <mo>)</mo> </mrow> </mrow> </math>
wherein,is the similarity value; theta is a character string matching function; k is more than or equal to 1 and less than or equal to N, and N is the number of the identification identity attributes. In this embodiment, as shown in step S201, if there are 3 id attribute vectors, the vector S1Sum vector T1The three elements are respectively an identification number, a social security number and a birth date. Then respectively from the vector S1Sum vector T1Taking out the 1 st elementAnd T1 1The element is an identity card number and is calculated by the formula (1)And T1 1I.e. the similarity between the identity number of the 1 st patient in the patient identity source a and the first patient in the set Q. Next, the 2 nd element (social security number) is extracted, the similarity value of the identity attribute is obtained by the same method as the 1 st element, and by analogy, the similarity value of the identity attribute defined in step S201 is obtained, and in this embodiment, the similarity values of the three attributes obtained by calculation are 0.9, 0.85, and 0.8, respectively. Next, steps S302 and S303 are repeated, i.e., the 2 nd in the set Q is createdPatient's identity attribute vector T2And calculating the similarity values of the identification identity attributes of the patient and the 1 st patient in the patient identity source A, and repeating the steps until the whole set Q to be matched is traversed, namely calculating the similarity values of the identification identity attributes of the 1 st patient in the patient identity source A and all the patients in the set Q to be matched.
Then, step S203 is executed to create a corresponding binary space vector C according to the similarity valueij={Cij(1),Cij(2),…,Cij(N), if the similarity value of the identity attribute of the ith patient in the patient identity source A and the identity attribute of the kth patient in the set Q to be matched is greater than a specified threshold value, Cij(k) 1 is ═ 1; otherwise, then Cij(k) 0; wherein k is more than or equal to 1 and less than or equal to N, namely when the identification identity attribute of the ith patient in the patient identity source A or the kth patient in the jth patient in the set Q to be matched is missing, Cij(k) 0. Specifically, first, a given threshold vector v ═ v is determined1,v2,…,vN}. In the present embodiment, the threshold vector is specified to be (0.85, 0.84, 0.81). Then, in step S303, the similarity value between the identification identity attributes of the 1 st patient in the patient identity source a and the 1 st patient in the set to be matched Q is (0.9, 0.85.0.8), and then, the corresponding binary space vector is (1, 1, 0). Then, according to the similarity values of all the identification identity attributes calculated in step S303, the binary space vectors corresponding to the similarity values of the 1 st patient in the patient identity source a and each patient in the set Q to be matched can be respectively known.
Then, step S204 is executed to calculate the probability distribution of the binary space vectorWherein, FreqlL is more than or equal to 1 and less than or equal to 2 and is the frequency of the binary space vector appearing in the l-th stateN. Specifically, a series of binary space vectors is known from step S203, and the probability of the occurrence of the vector (1, 1, 0) in the series of binary space vectors and the other shapes of the binary vectors are calculatedThe probability of occurrence in a state, such as (0, 0, 1), (0, 1, 0), etc., thereby obtaining a probability distribution of the binary space vector.
Then, step S205 is executed to calculate the matching ratio between the ith patient in the patient identity source a and each patient in the set Q to be matched. Specifically, as shown in fig. 4, step S401 is executed first, and according to the probability distribution, a matching threshold vector m ═ m is obtained by a maximum expectation algorithm1,m2,…,mNAnd a mismatch threshold vector, u ═ u1,u2,…,uNAnd N is the number of the identity attributes. Next, step S402 is performed to initialize the values of the match probability and the mismatch probability. Specifically, the match probabilities are initializedProbability of mismatchWherein j is the jth patient in the set Q to be matched, i.e. the initial matching probability and the mismatching probability corresponding to each patient in the set Q to be matched are both 1. Then, step S403 is executed, and a matching probability and a mismatching probability are obtained through iterative computation according to the value of each element in the binary space vector and the initialization value. Specifically, for the match probability, if Cij(k) When the value is equal to 0, thenIf C is presentij(k) 1, thenFor the probability of mismatch, if Cij(k) When the value is equal to 0, thenIf C is presentij(k) 1, thenIn this embodiment, the disease is calculated firstAs can be seen from the above, the matching probability and the mismatching probability between the 1 st patient in the human identity source a and the 1 st patient in the set Q to be matched correspond to the corresponding identification identity attribute if the corresponding binary space vector is (1, 1, 0), and the matching probability and the mismatching probability can be obtained. By analogy, the matching probability and the mismatching probability of the 1 st patient in the patient identity source A and each patient in the set Q to be matched can be obtained. Then, step S404 is executed to calculate a matching ratio according to the matching probability and the mismatching probability. Wherein, the calculation formula is:
Ratio ij = log 2 ( p 1 j / p 2 j ) .
next, step S4 is executed to cross-index the ith patient in the patient identity source a and the patient identity source B according to the matching ratio. Specifically, as shown in fig. 5, step S501 is executed first, and the set Q to be matched is sorted in ascending order according to the matching ratio to obtain a data set Q'. Specifically, the patients in the set Q are sorted in ascending order according to the magnitude of the matching ratio calculated in the above step S404. Then, step S502 is executed to calculate the matching Ratio according to a statistical modelijUpper bound T ofupAnd a lower bound Tlow. Specifically, the specified probability of refusal is alpha, the false probability is beta, wherein alpha is more than 0 and less than or equal to 0.1, and beta is more than 0 and less than or equal to 0.1. If the x patient is satisfied in the data set QThen the upper bound T of the match ratio is reachedup=RatioixWherein x is more than 0 and less than M,m is the number of elements in the data set Q', and i is the ith patient in the patient identity source. If the y patient is present in the data set QThe lower bound T of the match ratiolow=RatioiyWherein y is more than 0 and less than x. In this example, the statistical model used was the Fellegi-Sunter model.
Next, step S503 is executed to determine the matching Ratio of the jth patient in the data set QijThe size of (2). If Ratioij>TμpThen step S504 is executed to establish a cross index between the jth patient and the ith patient in the identity source a; if Tlow≤Ratioij≤TμpIf yes, executing step S505 to perform manual processing; if Ratioij≤TλThen, in step S506, no processing is performed. Specifically, the matching Ratio of the first patient in the data set Q is first determined11Comparing the upper and lower limits, and determining the matching Ratio of the second patient12Until the entire data set Q is traversed.
Then, step S5 is executed, and steps S2-S4 are executed in a loop until all patient identity sources a and B are cross-indexed. Specifically, as with the method of establishing the cross-index of the 1 st patient in the patient identity source a and the patient identity source B, the cross-index of the 2 nd patient in the patient identity source a and the patient identity source B is established until the cross-index is established for all patients in the patient identity source a and the patient identity source B.
Although the present invention has been described with reference to the preferred embodiments, it is not intended to limit the present invention, and those skilled in the art can make variations and modifications of the present invention without departing from the spirit and scope of the present invention by using the methods and technical contents disclosed above.

Claims (9)

1. A method for establishing a patient identity source cross index is characterized by comprising the following steps:
(1) defining one or more identity attributes of the patient as blocking attributes;
(2) extracting the blocking attribute of the ith patient in the patient identity source A, matching the blocking attribute with the blocking attribute of the patient in the patient identity source B, and recording the matched patient in the patient identity source B as a set Q to be matched;
(3) matching the identity attribute of the ith patient in the patient identity source A with the identity attribute of each patient in the set Q to obtain a matching ratio;
(4) according to the matching ratio, establishing a cross index between the ith patient in the patient identity source A and the patient identity source B;
(5) and (5) circularly executing the steps (2) to (4) until all the patient identity sources A and B are cross-indexed.
2. The method of claim 1, wherein the step of determining the set Q to be matched comprises:
(1) determining blocking attribute information key of ith patient in the patient identity source Ai(ii) a Determining blocking attribute information key of jth patient in the patient identity source BjAnd calculates the blocking attribute information keyiAnd keyjThe similarity value of (a);
(2) and if the similarity value is larger than a specified threshold value, putting the jth patient in the patient identity source B into the set Q to be matched until the patient identity source B is traversed.
3. The method of claim 1, wherein the matching ratio is obtained by:
(1) setting N identity attributes of the patient as attributes for identifying the identity of the patient, wherein N is more than or equal to 1;
(2) according to the attribute for identifying the identity of the patient, comparing the ith patient in the patient identity source A with each patient in the set Q to be matched to obtain a similarity value of the identity attribute of the identified patient;
(3) creating a corresponding binary space vector C according to the similarity valueij={Cij(1),Cij(2),…,Cij(N), if the similarity value of the identity attribute of the ith patient in the patient identity source A and the identity attribute of the kth patient in the set Q to be matched is greater than a specified threshold value, Cij(k) 1 is ═ 1; otherwise, then Cij(k) 0; wherein k is more than or equal to 1 and less than or equal to N;
(4) calculating a probability distribution of the binary space vector <math> <mrow> <mi>Freq</mi> <mo>=</mo> <mo>{</mo> <msub> <mi>Freq</mi> <mn>1</mn> </msub> <mo>,</mo> <msub> <mi>Freq</mi> <mn>2</mn> </msub> <mo>,</mo> <mo>&CenterDot;</mo> <mo>&CenterDot;</mo> <mo>&CenterDot;</mo> <mo>,</mo> <msub> <mi>Freq</mi> <msup> <mn>2</mn> <mi>N</mi> </msup> </msub> <mo>}</mo> <mo>,</mo> </mrow> </math> Wherein, FreqlL is more than or equal to 1 and less than or equal to 2 and is the frequency of the binary space vector appearing in the l-th stateN
(5) And respectively calculating the matching ratio of the ith patient in the patient identity source A and each patient in the set Q to be matched according to the probability distribution.
4. The method of claim 3, wherein the patient identification attribute is an identification number, a driver license number, a social security number, a date of birth, a telephone number, and/or a contact address.
5. The method of claim 3, wherein the identity attribute similarity value is obtained by:
(1) creating an identification identity attribute vector S of the ith patient in the patient identity source Ai
(2) Creating an identification identity attribute vector T of the jth patient in the set Q to be matchedj
(3) Respectively from the said identity attribute vector SiAnd TjTake out the k-th elementAnd Tj k(ii) a And calculate saidAnd Tj kThe similarity value of (a).
6. The method of claim 5, wherein the similarity value is calculated by the formula:
<math> <mrow> <mi>similarity</mi> <mrow> <mo>(</mo> <msubsup> <mi>S</mi> <mi>i</mi> <mi>k</mi> </msubsup> <mo>,</mo> <msubsup> <mi>T</mi> <mi>j</mi> <mi>k</mi> </msubsup> <mo>)</mo> </mrow> <mo>=</mo> <mi>&theta;</mi> <mrow> <mo>(</mo> <msubsup> <mi>S</mi> <mi>i</mi> <mi>k</mi> </msubsup> <mo>,</mo> <msubsup> <mi>T</mi> <mi>j</mi> <mi>k</mi> </msubsup> <mo>)</mo> </mrow> </mrow> </math>
wherein,is the similarity value; theta is a character string matching function; k is more than or equal to 1 and less than or equal to N, and N is the number of the identification identity attributes.
7. The method of claim 3, wherein the step of calculating the matching ratio between the ith patient in the patient identity source A and each patient in the set Q to be matched comprises:
(1) obtaining a matching threshold vector m-m through a maximum expectation algorithm according to the probability distribution1,m2,…,mNAnd a mismatch threshold vector, u ═ u1,u2,…,uNN is the number of the identity attributes;
(2) initializing values of the match probability and the mismatch probability;
(3) according to the value and the initialization value of each element in the binary space vector, performing iterative computation to obtain a matching probability and a mismatching probability;
(4) and calculating to obtain a matching ratio according to the matching probability and the mismatching probability.
8. The method of claim 7, wherein said matching ratio is calculated by the formula:
Ratio ij = log 2 ( p 1 j / p 2 j )
wherein RatioijIs the match ratio;is the match probability;is the mismatch probability.
9. The method of claim 1, wherein the i-th patient in the patient identity source A and the patient in the patient identity source B establish the cross-index by the following steps:
(1) calculating the matching RatioijUpper bound T ofupAnd a lower bound Tlow
(2) If the jth patient in the data set QMatching Ratio ofijGreater than said upper bound TupThen, the jth patient and the ith patient in the identity source A are subjected to cross indexing; if the matching Ratio of the jth patient in the data set Q isijGreater than or equal to the lower bound TlowAnd is less than or equal to the upper bound TupThen, carrying out manual treatment; if the matching Ratio of the jth patient in the data set Q isijLess than or equal to the lower bound TlowThen no processing is performed.
CN201310089099.2A 2013-03-20 2013-03-20 A kind of method for building up of patient status source cross-index Active CN104063567B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310089099.2A CN104063567B (en) 2013-03-20 2013-03-20 A kind of method for building up of patient status source cross-index

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310089099.2A CN104063567B (en) 2013-03-20 2013-03-20 A kind of method for building up of patient status source cross-index

Publications (2)

Publication Number Publication Date
CN104063567A true CN104063567A (en) 2014-09-24
CN104063567B CN104063567B (en) 2019-06-21

Family

ID=51551279

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310089099.2A Active CN104063567B (en) 2013-03-20 2013-03-20 A kind of method for building up of patient status source cross-index

Country Status (1)

Country Link
CN (1) CN104063567B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104978322A (en) * 2014-04-03 2015-10-14 武汉联影医疗科技有限公司 Basic data sharing method and basic data sharing system
CN105096028A (en) * 2014-11-20 2015-11-25 北京航天金盾科技有限公司 Intelligent matching method of population data
CN107066803A (en) * 2017-01-25 2017-08-18 济南德健信息技术有限公司 A kind of cleaning method of patient's master index data
CN109522331A (en) * 2018-10-16 2019-03-26 易保互联医疗信息科技(北京)有限公司 Compartmentalization various dimensions health data processing method and medium centered on individual
CN110176282A (en) * 2019-05-22 2019-08-27 浙江大学 A kind of patient identity matching process based on EM algorithm
CN110970136A (en) * 2019-12-02 2020-04-07 北京六元空间信息科技有限责任公司 Clinical study-driven chronic disease management system and method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070299697A1 (en) * 2004-10-12 2007-12-27 Friedlander Robert R Methods for Associating Records in Healthcare Databases with Individuals
CN101189607A (en) * 2005-03-29 2008-05-28 英国电讯有限公司 Schema matching
CN101727535A (en) * 2008-10-30 2010-06-09 北大方正集团有限公司 Cross indexing method for patients crossing system and system thereof
CN101833725A (en) * 2010-04-15 2010-09-15 刘洪利 Method for matching and comparing user ID and ID matching system
CN102947832A (en) * 2010-06-17 2013-02-27 皇家飞利浦电子股份有限公司 Identity matching of patient records

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070299697A1 (en) * 2004-10-12 2007-12-27 Friedlander Robert R Methods for Associating Records in Healthcare Databases with Individuals
CN101189607A (en) * 2005-03-29 2008-05-28 英国电讯有限公司 Schema matching
CN101727535A (en) * 2008-10-30 2010-06-09 北大方正集团有限公司 Cross indexing method for patients crossing system and system thereof
CN101833725A (en) * 2010-04-15 2010-09-15 刘洪利 Method for matching and comparing user ID and ID matching system
CN102947832A (en) * 2010-06-17 2013-02-27 皇家飞利浦电子股份有限公司 Identity matching of patient records

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104978322A (en) * 2014-04-03 2015-10-14 武汉联影医疗科技有限公司 Basic data sharing method and basic data sharing system
CN104978322B (en) * 2014-04-03 2019-10-08 武汉联影医疗科技有限公司 A kind of sharing method and its system of basic data
CN105096028A (en) * 2014-11-20 2015-11-25 北京航天金盾科技有限公司 Intelligent matching method of population data
CN107066803A (en) * 2017-01-25 2017-08-18 济南德健信息技术有限公司 A kind of cleaning method of patient's master index data
CN107066803B (en) * 2017-01-25 2021-02-09 济南德健信息技术有限公司 Cleaning method for patient main index data
CN109522331A (en) * 2018-10-16 2019-03-26 易保互联医疗信息科技(北京)有限公司 Compartmentalization various dimensions health data processing method and medium centered on individual
CN109522331B (en) * 2018-10-16 2021-04-16 易保互联医疗信息科技(北京)有限公司 Individual-centered regionalized multi-dimensional health data processing method and medium
CN110176282A (en) * 2019-05-22 2019-08-27 浙江大学 A kind of patient identity matching process based on EM algorithm
CN110970136A (en) * 2019-12-02 2020-04-07 北京六元空间信息科技有限责任公司 Clinical study-driven chronic disease management system and method

Also Published As

Publication number Publication date
CN104063567B (en) 2019-06-21

Similar Documents

Publication Publication Date Title
US11816078B2 (en) Automatic entity resolution with rules detection and generation system
CN104063567B (en) A kind of method for building up of patient status source cross-index
CN109583332B (en) Face recognition method, face recognition system, medium, and electronic device
WO2023015935A1 (en) Method and apparatus for recommending physical examination item, device and medium
CN111564223B (en) Infectious disease survival probability prediction method, and prediction model training method and device
CN111968746A (en) Cerebral apoplexy risk prediction method and device based on hybrid deep transfer learning
Ghayvat et al. AI-enabled radiologist in the loop: novel AI-based framework to augment radiologist performance for COVID-19 chest CT medical image annotation and classification from pneumonia
CN109829072A (en) Construct atlas calculation and relevant apparatus
CN106845058A (en) The standardized method of disease data and modular station
CN113539409B (en) Treatment scheme recommendation method, device, equipment and storage medium
CN113192605B (en) Medical image classification method, medical image retrieval method and device
JP6908977B2 (en) Medical information processing system, medical information processing device and medical information processing method
CN112334070A (en) Medical image processing apparatus and method, machine learning system, program, and storage medium
JP2021500692A (en) Genealogy entity resolution system and method
Chykeyuk et al. Class-specific regression random forest for accurate extraction of standard planes from 3D echocardiography
CN112634889A (en) Electronic case logging method, device, terminal and medium based on artificial intelligence
CN110990591A (en) Method and system for auditing transcoding quality of medical data
CN114330267A (en) Structural report template design method based on semantic association
CN115858886A (en) Data processing method, device, equipment and readable storage medium
US11075004B2 (en) Discrimination result providing apparatus, operating method of discrimination result providing apparatus, discrimination result providing program, and discrimination result providing system
KR102275819B1 (en) Method, device and system for automatic documentation of architectural design drawing
CN109616165A (en) Medical information methods of exhibiting and device
CN109906487A (en) The system and method that structuring Finding Object (SFO) for carrying out workflow sensitivity for clinical care continuity is recommended
CN111279424B (en) Apparatus, system and method for optimizing image acquisition workflow
CN116313148A (en) Drug sensitivity prediction method, device, terminal equipment and medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
ASS Succession or assignment of patent right

Owner name: WUHAN LIANYING MEDICAL TECHNOLOGY CO., LTD.

Free format text: FORMER OWNER: SHANGHAI UNITED IMAGING HEALTHCARE CO., LTD.

Effective date: 20150717

Owner name: SHANGHAI UNITED IMAGING HEALTHCARE CO., LTD.

Effective date: 20150717

C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20150717

Address after: 430206 B1-7 layer, No. 818, hi tech Avenue, East Lake New Technology Development Zone, Hubei, Wuhan

Applicant after: Wuhan Lian Ying medical science and technology company limited

Applicant after: Shanghai United Imaging Healthcare Co., Ltd.

Address before: 201815 Shanghai City, Jiading District Jiading Industrial Zone No. 1180 Building 8 Hing Road

Applicant before: Shanghai United Imaging Healthcare Co., Ltd.

C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP01 Change in the name or title of a patent holder
CP01 Change in the name or title of a patent holder

Address after: 430206 b1-7 / F, 818 Gaoxin Avenue, Donghu New Technology Development Zone, Wuhan, Hubei Province

Patentee after: WUHAN UNITED IMAGING MEDICAL TECHNOLOGY Co.,Ltd.

Patentee after: Shanghai Lianying Medical Technology Co., Ltd

Address before: 430206 b1-7 / F, 818 Gaoxin Avenue, Donghu New Technology Development Zone, Wuhan, Hubei Province

Patentee before: WUHAN UNITED IMAGING MEDICAL TECHNOLOGY Co.,Ltd.

Patentee before: SHANGHAI UNITED IMAGING HEALTHCARE Co.,Ltd.

CP02 Change in the address of a patent holder
CP02 Change in the address of a patent holder

Address after: 430206 Lianying medical headquarters base, No. 99, gaokeyuan Road, Donghu high tech Development Zone, Wuhan, Hubei Province

Patentee after: WUHAN UNITED IMAGING HEALTHCARE Co.,Ltd.

Patentee after: Shanghai Lianying Medical Technology Co., Ltd

Address before: 430206 b1-7 / F, 818 Gaoxin Avenue, Donghu New Technology Development Zone, Wuhan, Hubei Province

Patentee before: WUHAN UNITED IMAGING HEALTHCARE Co.,Ltd.

Patentee before: Shanghai Lianying Medical Technology Co., Ltd