CN111192689A - Patient identification method based on medical data - Google Patents
Patient identification method based on medical data Download PDFInfo
- Publication number
- CN111192689A CN111192689A CN201811361095.4A CN201811361095A CN111192689A CN 111192689 A CN111192689 A CN 111192689A CN 201811361095 A CN201811361095 A CN 201811361095A CN 111192689 A CN111192689 A CN 111192689A
- Authority
- CN
- China
- Prior art keywords
- same
- medical record
- record data
- point
- patient
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 16
- 238000000605 extraction Methods 0.000 claims abstract description 5
- 206010033307 Overweight Diseases 0.000 claims description 7
- 238000012216 screening Methods 0.000 claims description 5
- 239000008280 blood Substances 0.000 claims description 2
- 210000004369 blood Anatomy 0.000 claims description 2
- 238000012790 confirmation Methods 0.000 claims description 2
- 230000007547 defect Effects 0.000 abstract description 3
- 238000010586 diagram Methods 0.000 description 8
- 238000012545 processing Methods 0.000 description 2
- 238000007429 general method Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/70—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
Abstract
The invention provides a patient identification method based on medical data, which comprises the following steps: A. performing point location extraction in each medical record data, and judging whether the extracted point locations are the same; B. classifying the medical record data with the same point location; C. and D, classifying the medical record data classified in the step B into medical record data of the same patient. According to the technical scheme, whether the point locations are the same or not is directly judged, and medical records with the same point locations are initially classified into the same type, so that the same patient with different medical records is finally screened. Different from the prior art that probability error classification may exist in calculating the similarity between point locations, and the absolute point locations are the same for judgment, so that the accuracy of patient identification can be improved, and the defects in the prior art are eliminated to a certain extent.
Description
Technical Field
The invention relates to the technical field of medical big data, in particular to a patient identification method based on medical data.
Background
During medical data processing, a patient is the basis for medical analysis. But patient association between multiple pieces of patient data is inaccurate in most data processing scenarios.
As shown in fig. 6, the conventional general method is: and extracting the upper point positions of the same data source data from different medical record data to judge the similarity, judging that the same person is the same person when the similarity exceeds a threshold value, and judging that the person is two persons otherwise. The step of extracting the upper point position of the same data source data from different medical record data comprises the step of extracting information such as names, identity card numbers, mobile phone numbers, discharge dates, birthdays and the like from different medical record data.
However, the above method is very easy to have recognition errors, and cannot split data with recognized errors. Thus, the results of medical analysis based on inaccurate data will also be inaccurate. The use of these analysis results in medical treatment is a potential problem.
Disclosure of Invention
The invention mainly aims to provide a patient identification method based on medical data, which comprises the following steps:
A. performing point location extraction in each medical record data, and judging whether the extracted point locations are the same;
B. associating the medical record data with the same point location;
C. and D, classifying the medical record data classified in the step B into medical record data of the same patient.
According to the technical scheme, whether the point locations are the same or not is directly judged, and medical records with the same point locations are initially classified into the same type, so that the same patient with different medical records is finally screened. Different from the prior art that probability error classification may exist in calculating the similarity between point locations, and the absolute point locations are the same for judgment, so that the accuracy of patient identification can be improved, and the defects in the prior art are eliminated to a certain extent.
The step B comprises the following steps:
setting different weights for each point;
and if at least one high-weight point position is the same in different medical record data, associating the medical record data.
Therefore, the absolute point positions are judged to be the same, the accuracy of patient identification can be improved, and the defects of the prior art are eliminated to a certain extent.
The step B comprises the following steps:
setting different weights for each point;
and if at least two low-weight point positions are the same in different medical record data, associating the medical record data.
Thus, for the case of lack of high-weight points, the correct rate of patient identification can be maintained by matching of multiple low-weight points.
The step B comprises the following steps:
setting different weights for each point;
and if only one low-weight point position is the same in different medical record data, carrying out manual screening prompt, and associating the medical record data after confirmation.
Therefore, when the matching of the single low-weight point is generated, the condition of error identification is avoided by informing the manual screening.
The step C further comprises the following steps:
and D, splitting the medical record data which cannot be associated in the step B into the medical record data in the step A.
Therefore, when the medical record data can not be classified, the medical record data is split, and the aim of keeping the accuracy of patient identification is also fulfilled.
The point location includes at least one of: name, identification number, mobile phone number, birthday, sex, blood type, hospital number, department, date of admission and date of discharge.
Therefore, whether the medical record data are the same or not is judged through different information.
And D, identifying the identity of the patient according to the point location.
Drawings
FIG. 1 is a flow chart of the present invention;
FIG. 2 is a schematic diagram of point location determination for different medical records;
FIG. 3 is a diagram illustrating the initial classification of medical records having the same point location;
FIG. 4 is a corresponding diagram of associated medical records and a patient;
FIG. 5-1 is a schematic diagram illustrating point location extraction performed synchronously on different medical record data and determining whether the point locations are the same;
FIG. 5-2 is a schematic diagram of an initialization categorization of medical records having the same point location;
5-3 are schematic diagrams of tests for initializing classifications;
FIGS. 5-4 are schematic diagrams of the patient to which each medical record after examination is assigned;
fig. 6 is a flow chart of the prior art.
Detailed Description
The method for identifying a patient based on medical data according to the present invention will be described in detail with reference to fig. 1 to 5.
As shown in fig. 1, the method comprises the following steps:
s100: and synchronously extracting point positions from different medical record data.
Compared with the prior art that point locations are extracted in batches aiming at medical record data, the embodiment adopts the step of synchronously extracting the point locations from different medical record data.
As shown in fig. 2, the extracted points include, but are not limited to, names, identification numbers, mobile phone numbers, birthdays, and the like. The extraction of the point locations is performed in four medical records (corresponding to the first to fourth medical records in fig. 2) simultaneously. In addition, the point location may also include a hospital number, a hospital, a department, a date of admission and a date of discharge, etc., which are not described in detail herein.
S200: it is determined whether the points extracted in step S100 are the same.
Different from the similarity-based judgment in the prior art, in the embodiment, the patient identification is performed based on whether the point locations are the same.
Referring to fig. 2, the identification numbers in the first and second medical records are the same as the mobile phone number; the names and the birthdays in the second and third medical records are the same; the name, the ID card number and the birthday in the third and fourth calendars are the same.
S300: and initializing and classifying all medical records with the same point positions.
Referring to fig. 2 and 3, there are at least two same points between the four medical records. Then it can be used as the basis for initializing the classification, and assuming that the first, second, and third medical records belong to the first patient, the fourth medical record and the third medical record have the same point location, so that the four medical records are related. The association is such that the first to fourth schedules point to the same patient, i.e. the fourth schedule is also identified as the first patient when the classification is initiated.
Further, by the initialization classification, a preliminary corresponding relation between each medical record and the patient can be established. As shown in fig. 4, when two medical records have the same point, the two medical records are considered to have a relationship, i.e., may be the same patient. In fig. 4, the same combinations of the first to third medical records are summarized as the first patient, and the same points of the fourth to seventh medical records are summarized as the second patient.
S400: and (5) checking the initialization classification, if the initialization classification passes the checking, entering the step S500, otherwise, splitting the initialization classification into an original state, and returning to the step S200.
Different weights are set for each point or combination of points, for example, if the coincidence rate of the combination information such as name + birthday is higher, the combination information is set to be a lower weight, and if the combination coincidence rate of the hospital number + hospital + department is higher, the combination information is set to be a lower weight. The combination of the identity card number and the mobile phone number has uniqueness, and the combination is set as the highest weight. Of course, in the actual judgment process, the combinations of the same points are various, which is not listed here.
When the two medical records have the same combination with high weight, the same patient is considered. For example, in the first medical record and the second medical record in fig. 2, if the identification number is consistent with the highest weighted combination of the mobile phone numbers, the two medical records are identified as the same patient. In the second medical record and the third medical record, although the name and the date of birth are the same, since the weight of the combination is low, even if the two medical records are the same, the two medical records may not belong to the same patient.
In addition, because the mobile phone number may be used by the patient B after the patient A logs out, the weight of the mobile phone number may be set to be lower than that of the identity card number, or to be lower than that of the name and the birthday. When the mobile phone numbers are the same and the identity card number is absent, at least two points with the same number including the same mobile phone number can be taken as judgment conditions, so that under the condition of lacking the identity card number, a plurality of medical records can be accurately associated with the patient. The number of points to be matched is not limited herein.
The example is described in connection with fig. 4. And when the eighth medical record exists, the eighth medical record and the third medical record respectively have the condition that the highest weight combination of the identity card number and the mobile phone number is the same. Meanwhile, if the eighth medical record is the same as the fifth medical record in the low-weight combination of name + birthday + hospital + department, there is a possibility that the first patient and the second patient are actually the same patient.
Based on the situation, if the high weight is adopted for distinguishing, the association with the same low weight between the eighth medical record and the fifth medical record is ignored, namely the medical records belong to different patients A and B, and the medical records corresponding to the patient A are the first medical record, the third medical record and the eighth medical record; if the low weight is used for resolution, the first patient and the second patient are considered to be the same patient.
In the case of FIG. 5, A to F represent 6 different medical records, respectively. Wherein the number 1 indicates that the point locations are the same and the number 0 indicates that the point locations are different. The dashed boxes between the case history C and the case history E represent low weight matching, and if the point locations of the other case histories are the same, the case histories are all high weight matching.
Fig. 5-1 and 5-2 present the initialized categorization between medical records in the form of a table and a block diagram, respectively.
FIG. 5-3 shows that the medical records A-D are classified as patient A and the medical records E-F are classified as patient B according to a high weight matching relationship.
And the case of low-weight matching exists between the medical record C and the medical record E, and no other reference basis exists. Prompting is carried out for the situation so as to carry out manual screening.
S500: and classifying the medical records successfully checked in the step S400 into the same patient.
Fig. 5-4 show the results of the determination after manual screening, i.e., patient a and patient b are the same person. And the identity recognition of the patient can be realized according to the point positions.
The technical scheme of the application is different from the prior art that the similarity between the point positions is calculated, and whether the point positions are the same or not is directly judged. Medical records with the same point location initially fall into the same category. The same patient with different medical records is finally screened by examining the initially summarized classes.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.
Claims (7)
1. A method for identifying a patient based on medical data, comprising the steps of:
A. performing point location extraction in each medical record data, and judging whether the extracted point locations are the same;
B. associating the medical record data with the same point location;
C. and D, classifying the medical record data classified in the step B into medical record data of the same patient.
2. The method of claim 1, wherein step B comprises:
setting different weights for each point;
and if at least one high-weight point position is the same in different medical record data, associating the medical record data.
3. The method of claim 1, wherein step B comprises:
setting different weights for each point;
and if at least two low-weight point positions are the same in different medical record data, associating the medical record data.
4. The method of claim 1, wherein step B comprises:
setting different weights for each point;
and if only one low-weight point position is the same in different medical record data, carrying out manual screening prompt, and associating the medical record data after confirmation.
5. The method of claim 1, wherein step C further comprises:
and D, splitting the medical record data which cannot be associated in the step B into the medical record data in the step A.
6. The method of any one of claims 1 to 5, wherein the point locations comprise at least one of: name, identification number, mobile phone number, birthday, sex, blood type, hospital number, department, date of admission and date of discharge.
7. The method of claim 1, further comprising a step D of identifying the identity of the patient from the point location.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811361095.4A CN111192689B (en) | 2018-11-15 | 2018-11-15 | Patient identification method based on medical data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811361095.4A CN111192689B (en) | 2018-11-15 | 2018-11-15 | Patient identification method based on medical data |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111192689A true CN111192689A (en) | 2020-05-22 |
CN111192689B CN111192689B (en) | 2023-11-24 |
Family
ID=70708890
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811361095.4A Active CN111192689B (en) | 2018-11-15 | 2018-11-15 | Patient identification method based on medical data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111192689B (en) |
Citations (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040128163A1 (en) * | 2002-06-05 | 2004-07-01 | Goodman Philip Holden | Health care information management apparatus, system and method of use and doing business |
US20050267782A1 (en) * | 2004-05-28 | 2005-12-01 | Gudrun Zahlmann | System for processing patient medical data for clinical trials and aggregate analysis |
JP2007287027A (en) * | 2006-04-19 | 2007-11-01 | Fujifilm Corp | Medical planning support system |
US20080212847A1 (en) * | 2007-01-08 | 2008-09-04 | Michael Davies | Method and system for identifying medical sample information source |
JP2009146345A (en) * | 2007-12-18 | 2009-07-02 | Mitsubishi Electric Information Systems Corp | Electronic medical chart system |
CN101727535A (en) * | 2008-10-30 | 2010-06-09 | 北大方正集团有限公司 | Cross indexing method for patients crossing system and system thereof |
JP2010167042A (en) * | 2009-01-21 | 2010-08-05 | Canon Inc | Medical diagnostic support apparatus and control method of the same and program |
CN102576431A (en) * | 2009-10-06 | 2012-07-11 | 皇家飞利浦电子股份有限公司 | Autonomous linkage of patient information records stored at different entities |
JP2015114721A (en) * | 2013-12-09 | 2015-06-22 | 株式会社東芝 | Medical information processing device |
KR20150086089A (en) * | 2014-01-17 | 2015-07-27 | 주식회사 라이브존 | System and method of managing medical image using electronic medical record |
JP2015230631A (en) * | 2014-06-06 | 2015-12-21 | 富士ゼロックス株式会社 | Information processing device and information processing program |
CN105303499A (en) * | 2015-09-16 | 2016-02-03 | 西部天使(北京)健康科技有限公司 | Automatic medical record imputation method and system |
US20160085914A1 (en) * | 2014-09-23 | 2016-03-24 | Practice Fusion, Inc. | Aggregating a patient's disparate medical data from multiple sources |
JP2016099810A (en) * | 2014-11-21 | 2016-05-30 | 日本調剤株式会社 | Pharmacy information management system |
CN106295182A (en) * | 2016-08-10 | 2017-01-04 | 依据数据(湖南)科技有限公司 | A kind of personal identification method based on patient biological information |
CN106682439A (en) * | 2016-12-30 | 2017-05-17 | 广州慧扬信息系统科技有限公司 | Investigational follow-up based medical record screening method |
CN106778021A (en) * | 2016-12-31 | 2017-05-31 | 深圳市前海康启源科技有限公司 | Medical diagnosis information management system and method |
US20170169168A1 (en) * | 2015-12-11 | 2017-06-15 | Lifemed Id, Incorporated | Patient identification systems and methods |
CN107038336A (en) * | 2017-03-21 | 2017-08-11 | 科大讯飞股份有限公司 | A kind of electronic health record automatic generation method and device |
CN107193919A (en) * | 2017-05-15 | 2017-09-22 | 清华大学深圳研究生院 | The search method and system of a kind of electronic health record |
CN108352196A (en) * | 2015-10-30 | 2018-07-31 | 皇家飞利浦有限公司 | There is no hospital's matching in the health care data library for going mark of apparent standard identifier |
-
2018
- 2018-11-15 CN CN201811361095.4A patent/CN111192689B/en active Active
Patent Citations (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040128163A1 (en) * | 2002-06-05 | 2004-07-01 | Goodman Philip Holden | Health care information management apparatus, system and method of use and doing business |
US20050267782A1 (en) * | 2004-05-28 | 2005-12-01 | Gudrun Zahlmann | System for processing patient medical data for clinical trials and aggregate analysis |
JP2007287027A (en) * | 2006-04-19 | 2007-11-01 | Fujifilm Corp | Medical planning support system |
US20080212847A1 (en) * | 2007-01-08 | 2008-09-04 | Michael Davies | Method and system for identifying medical sample information source |
JP2009146345A (en) * | 2007-12-18 | 2009-07-02 | Mitsubishi Electric Information Systems Corp | Electronic medical chart system |
CN101727535A (en) * | 2008-10-30 | 2010-06-09 | 北大方正集团有限公司 | Cross indexing method for patients crossing system and system thereof |
JP2010167042A (en) * | 2009-01-21 | 2010-08-05 | Canon Inc | Medical diagnostic support apparatus and control method of the same and program |
CN102576431A (en) * | 2009-10-06 | 2012-07-11 | 皇家飞利浦电子股份有限公司 | Autonomous linkage of patient information records stored at different entities |
JP2015114721A (en) * | 2013-12-09 | 2015-06-22 | 株式会社東芝 | Medical information processing device |
KR20150086089A (en) * | 2014-01-17 | 2015-07-27 | 주식회사 라이브존 | System and method of managing medical image using electronic medical record |
JP2015230631A (en) * | 2014-06-06 | 2015-12-21 | 富士ゼロックス株式会社 | Information processing device and information processing program |
US20160085914A1 (en) * | 2014-09-23 | 2016-03-24 | Practice Fusion, Inc. | Aggregating a patient's disparate medical data from multiple sources |
JP2016099810A (en) * | 2014-11-21 | 2016-05-30 | 日本調剤株式会社 | Pharmacy information management system |
CN105303499A (en) * | 2015-09-16 | 2016-02-03 | 西部天使(北京)健康科技有限公司 | Automatic medical record imputation method and system |
CN108352196A (en) * | 2015-10-30 | 2018-07-31 | 皇家飞利浦有限公司 | There is no hospital's matching in the health care data library for going mark of apparent standard identifier |
US20170169168A1 (en) * | 2015-12-11 | 2017-06-15 | Lifemed Id, Incorporated | Patient identification systems and methods |
CN106295182A (en) * | 2016-08-10 | 2017-01-04 | 依据数据(湖南)科技有限公司 | A kind of personal identification method based on patient biological information |
CN106682439A (en) * | 2016-12-30 | 2017-05-17 | 广州慧扬信息系统科技有限公司 | Investigational follow-up based medical record screening method |
CN106778021A (en) * | 2016-12-31 | 2017-05-31 | 深圳市前海康启源科技有限公司 | Medical diagnosis information management system and method |
CN107038336A (en) * | 2017-03-21 | 2017-08-11 | 科大讯飞股份有限公司 | A kind of electronic health record automatic generation method and device |
CN107193919A (en) * | 2017-05-15 | 2017-09-22 | 清华大学深圳研究生院 | The search method and system of a kind of electronic health record |
Non-Patent Citations (1)
Title |
---|
罗立刚等: "从医疗记录中提取结构化数据的双阅读/录入系统及其应用", 药物流行病学杂志, vol. 26, no. 6, pages 406 - 409 * |
Also Published As
Publication number | Publication date |
---|---|
CN111192689B (en) | 2023-11-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110910976A (en) | Medical record detection method, device, equipment and storage medium | |
CN109272002B (en) | Bone age tablet classification method and device | |
CN109637605B (en) | Electronic medical record structuring method and computer-readable storage medium | |
EP2908282A1 (en) | Forensic system, forensic method, and forensic program | |
US7949156B2 (en) | Biometric remediation of datasets | |
CN105843889B (en) | Credibility-based data acquisition method and system for big data and common data | |
WO2006073951B1 (en) | Adaptive fingerprint matching method and apparatus | |
CN111210402A (en) | Face image quality scoring method and device, computer equipment and storage medium | |
US20180196924A1 (en) | Computer-implemented method and system for diagnosis of biological conditions of a patient | |
CN111883253A (en) | Disease data analysis method and lung cancer risk prediction system based on medical knowledge base | |
CN103443772B (en) | The method of the individual sex checking based on multi-modal data analysis | |
CN112989990A (en) | Medical bill identification method, device, equipment and storage medium | |
CN110135684B (en) | Capability assessment method, capability assessment device and terminal equipment | |
CN111180060A (en) | Automatic coding method and device for disease diagnosis | |
CN109036506B (en) | Internet medical inquiry supervision method, electronic device and readable storage medium | |
CN111192689B (en) | Patient identification method based on medical data | |
CN111816318A (en) | Heart disease data queue generation method and risk prediction system | |
CN112116976A (en) | Method and device for processing medicine information and computer readable storage medium | |
CN110837494B (en) | Method and device for identifying unspecified diagnosis coding errors of medical record home page | |
CN113807256A (en) | Bill data processing method and device, electronic equipment and storage medium | |
CN112863602A (en) | Chromosome abnormality detection method, chromosome abnormality detection device, computer device, and storage medium | |
KR101793185B1 (en) | Method for identifying patient personal information | |
CN113626591A (en) | Electronic medical record data quality evaluation method based on text classification | |
US20240104178A1 (en) | Information processing apparatus, information processing method, matching system, program, and storage medium | |
US20090216139A1 (en) | Method for collecting and assigning patient data in a clinical trial |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |