CN111192689A - Patient identification method based on medical data - Google Patents

Patient identification method based on medical data Download PDF

Info

Publication number
CN111192689A
CN111192689A CN201811361095.4A CN201811361095A CN111192689A CN 111192689 A CN111192689 A CN 111192689A CN 201811361095 A CN201811361095 A CN 201811361095A CN 111192689 A CN111192689 A CN 111192689A
Authority
CN
China
Prior art keywords
same
medical record
record data
point
patient
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811361095.4A
Other languages
Chinese (zh)
Other versions
CN111192689B (en
Inventor
罗立刚
张旸
陈超
王飞
周剑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Linkdoc Technology Beijing Co ltd
Original Assignee
Linkdoc Technology Beijing Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Linkdoc Technology Beijing Co ltd filed Critical Linkdoc Technology Beijing Co ltd
Priority to CN201811361095.4A priority Critical patent/CN111192689B/en
Publication of CN111192689A publication Critical patent/CN111192689A/en
Application granted granted Critical
Publication of CN111192689B publication Critical patent/CN111192689B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients

Abstract

The invention provides a patient identification method based on medical data, which comprises the following steps: A. performing point location extraction in each medical record data, and judging whether the extracted point locations are the same; B. classifying the medical record data with the same point location; C. and D, classifying the medical record data classified in the step B into medical record data of the same patient. According to the technical scheme, whether the point locations are the same or not is directly judged, and medical records with the same point locations are initially classified into the same type, so that the same patient with different medical records is finally screened. Different from the prior art that probability error classification may exist in calculating the similarity between point locations, and the absolute point locations are the same for judgment, so that the accuracy of patient identification can be improved, and the defects in the prior art are eliminated to a certain extent.

Description

Patient identification method based on medical data
Technical Field
The invention relates to the technical field of medical big data, in particular to a patient identification method based on medical data.
Background
During medical data processing, a patient is the basis for medical analysis. But patient association between multiple pieces of patient data is inaccurate in most data processing scenarios.
As shown in fig. 6, the conventional general method is: and extracting the upper point positions of the same data source data from different medical record data to judge the similarity, judging that the same person is the same person when the similarity exceeds a threshold value, and judging that the person is two persons otherwise. The step of extracting the upper point position of the same data source data from different medical record data comprises the step of extracting information such as names, identity card numbers, mobile phone numbers, discharge dates, birthdays and the like from different medical record data.
However, the above method is very easy to have recognition errors, and cannot split data with recognized errors. Thus, the results of medical analysis based on inaccurate data will also be inaccurate. The use of these analysis results in medical treatment is a potential problem.
Disclosure of Invention
The invention mainly aims to provide a patient identification method based on medical data, which comprises the following steps:
A. performing point location extraction in each medical record data, and judging whether the extracted point locations are the same;
B. associating the medical record data with the same point location;
C. and D, classifying the medical record data classified in the step B into medical record data of the same patient.
According to the technical scheme, whether the point locations are the same or not is directly judged, and medical records with the same point locations are initially classified into the same type, so that the same patient with different medical records is finally screened. Different from the prior art that probability error classification may exist in calculating the similarity between point locations, and the absolute point locations are the same for judgment, so that the accuracy of patient identification can be improved, and the defects in the prior art are eliminated to a certain extent.
The step B comprises the following steps:
setting different weights for each point;
and if at least one high-weight point position is the same in different medical record data, associating the medical record data.
Therefore, the absolute point positions are judged to be the same, the accuracy of patient identification can be improved, and the defects of the prior art are eliminated to a certain extent.
The step B comprises the following steps:
setting different weights for each point;
and if at least two low-weight point positions are the same in different medical record data, associating the medical record data.
Thus, for the case of lack of high-weight points, the correct rate of patient identification can be maintained by matching of multiple low-weight points.
The step B comprises the following steps:
setting different weights for each point;
and if only one low-weight point position is the same in different medical record data, carrying out manual screening prompt, and associating the medical record data after confirmation.
Therefore, when the matching of the single low-weight point is generated, the condition of error identification is avoided by informing the manual screening.
The step C further comprises the following steps:
and D, splitting the medical record data which cannot be associated in the step B into the medical record data in the step A.
Therefore, when the medical record data can not be classified, the medical record data is split, and the aim of keeping the accuracy of patient identification is also fulfilled.
The point location includes at least one of: name, identification number, mobile phone number, birthday, sex, blood type, hospital number, department, date of admission and date of discharge.
Therefore, whether the medical record data are the same or not is judged through different information.
And D, identifying the identity of the patient according to the point location.
Drawings
FIG. 1 is a flow chart of the present invention;
FIG. 2 is a schematic diagram of point location determination for different medical records;
FIG. 3 is a diagram illustrating the initial classification of medical records having the same point location;
FIG. 4 is a corresponding diagram of associated medical records and a patient;
FIG. 5-1 is a schematic diagram illustrating point location extraction performed synchronously on different medical record data and determining whether the point locations are the same;
FIG. 5-2 is a schematic diagram of an initialization categorization of medical records having the same point location;
5-3 are schematic diagrams of tests for initializing classifications;
FIGS. 5-4 are schematic diagrams of the patient to which each medical record after examination is assigned;
fig. 6 is a flow chart of the prior art.
Detailed Description
The method for identifying a patient based on medical data according to the present invention will be described in detail with reference to fig. 1 to 5.
As shown in fig. 1, the method comprises the following steps:
s100: and synchronously extracting point positions from different medical record data.
Compared with the prior art that point locations are extracted in batches aiming at medical record data, the embodiment adopts the step of synchronously extracting the point locations from different medical record data.
As shown in fig. 2, the extracted points include, but are not limited to, names, identification numbers, mobile phone numbers, birthdays, and the like. The extraction of the point locations is performed in four medical records (corresponding to the first to fourth medical records in fig. 2) simultaneously. In addition, the point location may also include a hospital number, a hospital, a department, a date of admission and a date of discharge, etc., which are not described in detail herein.
S200: it is determined whether the points extracted in step S100 are the same.
Different from the similarity-based judgment in the prior art, in the embodiment, the patient identification is performed based on whether the point locations are the same.
Referring to fig. 2, the identification numbers in the first and second medical records are the same as the mobile phone number; the names and the birthdays in the second and third medical records are the same; the name, the ID card number and the birthday in the third and fourth calendars are the same.
S300: and initializing and classifying all medical records with the same point positions.
Referring to fig. 2 and 3, there are at least two same points between the four medical records. Then it can be used as the basis for initializing the classification, and assuming that the first, second, and third medical records belong to the first patient, the fourth medical record and the third medical record have the same point location, so that the four medical records are related. The association is such that the first to fourth schedules point to the same patient, i.e. the fourth schedule is also identified as the first patient when the classification is initiated.
Further, by the initialization classification, a preliminary corresponding relation between each medical record and the patient can be established. As shown in fig. 4, when two medical records have the same point, the two medical records are considered to have a relationship, i.e., may be the same patient. In fig. 4, the same combinations of the first to third medical records are summarized as the first patient, and the same points of the fourth to seventh medical records are summarized as the second patient.
S400: and (5) checking the initialization classification, if the initialization classification passes the checking, entering the step S500, otherwise, splitting the initialization classification into an original state, and returning to the step S200.
Different weights are set for each point or combination of points, for example, if the coincidence rate of the combination information such as name + birthday is higher, the combination information is set to be a lower weight, and if the combination coincidence rate of the hospital number + hospital + department is higher, the combination information is set to be a lower weight. The combination of the identity card number and the mobile phone number has uniqueness, and the combination is set as the highest weight. Of course, in the actual judgment process, the combinations of the same points are various, which is not listed here.
When the two medical records have the same combination with high weight, the same patient is considered. For example, in the first medical record and the second medical record in fig. 2, if the identification number is consistent with the highest weighted combination of the mobile phone numbers, the two medical records are identified as the same patient. In the second medical record and the third medical record, although the name and the date of birth are the same, since the weight of the combination is low, even if the two medical records are the same, the two medical records may not belong to the same patient.
In addition, because the mobile phone number may be used by the patient B after the patient A logs out, the weight of the mobile phone number may be set to be lower than that of the identity card number, or to be lower than that of the name and the birthday. When the mobile phone numbers are the same and the identity card number is absent, at least two points with the same number including the same mobile phone number can be taken as judgment conditions, so that under the condition of lacking the identity card number, a plurality of medical records can be accurately associated with the patient. The number of points to be matched is not limited herein.
The example is described in connection with fig. 4. And when the eighth medical record exists, the eighth medical record and the third medical record respectively have the condition that the highest weight combination of the identity card number and the mobile phone number is the same. Meanwhile, if the eighth medical record is the same as the fifth medical record in the low-weight combination of name + birthday + hospital + department, there is a possibility that the first patient and the second patient are actually the same patient.
Based on the situation, if the high weight is adopted for distinguishing, the association with the same low weight between the eighth medical record and the fifth medical record is ignored, namely the medical records belong to different patients A and B, and the medical records corresponding to the patient A are the first medical record, the third medical record and the eighth medical record; if the low weight is used for resolution, the first patient and the second patient are considered to be the same patient.
In the case of FIG. 5, A to F represent 6 different medical records, respectively. Wherein the number 1 indicates that the point locations are the same and the number 0 indicates that the point locations are different. The dashed boxes between the case history C and the case history E represent low weight matching, and if the point locations of the other case histories are the same, the case histories are all high weight matching.
Fig. 5-1 and 5-2 present the initialized categorization between medical records in the form of a table and a block diagram, respectively.
FIG. 5-3 shows that the medical records A-D are classified as patient A and the medical records E-F are classified as patient B according to a high weight matching relationship.
And the case of low-weight matching exists between the medical record C and the medical record E, and no other reference basis exists. Prompting is carried out for the situation so as to carry out manual screening.
S500: and classifying the medical records successfully checked in the step S400 into the same patient.
Fig. 5-4 show the results of the determination after manual screening, i.e., patient a and patient b are the same person. And the identity recognition of the patient can be realized according to the point positions.
The technical scheme of the application is different from the prior art that the similarity between the point positions is calculated, and whether the point positions are the same or not is directly judged. Medical records with the same point location initially fall into the same category. The same patient with different medical records is finally screened by examining the initially summarized classes.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims (7)

1. A method for identifying a patient based on medical data, comprising the steps of:
A. performing point location extraction in each medical record data, and judging whether the extracted point locations are the same;
B. associating the medical record data with the same point location;
C. and D, classifying the medical record data classified in the step B into medical record data of the same patient.
2. The method of claim 1, wherein step B comprises:
setting different weights for each point;
and if at least one high-weight point position is the same in different medical record data, associating the medical record data.
3. The method of claim 1, wherein step B comprises:
setting different weights for each point;
and if at least two low-weight point positions are the same in different medical record data, associating the medical record data.
4. The method of claim 1, wherein step B comprises:
setting different weights for each point;
and if only one low-weight point position is the same in different medical record data, carrying out manual screening prompt, and associating the medical record data after confirmation.
5. The method of claim 1, wherein step C further comprises:
and D, splitting the medical record data which cannot be associated in the step B into the medical record data in the step A.
6. The method of any one of claims 1 to 5, wherein the point locations comprise at least one of: name, identification number, mobile phone number, birthday, sex, blood type, hospital number, department, date of admission and date of discharge.
7. The method of claim 1, further comprising a step D of identifying the identity of the patient from the point location.
CN201811361095.4A 2018-11-15 2018-11-15 Patient identification method based on medical data Active CN111192689B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811361095.4A CN111192689B (en) 2018-11-15 2018-11-15 Patient identification method based on medical data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811361095.4A CN111192689B (en) 2018-11-15 2018-11-15 Patient identification method based on medical data

Publications (2)

Publication Number Publication Date
CN111192689A true CN111192689A (en) 2020-05-22
CN111192689B CN111192689B (en) 2023-11-24

Family

ID=70708890

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811361095.4A Active CN111192689B (en) 2018-11-15 2018-11-15 Patient identification method based on medical data

Country Status (1)

Country Link
CN (1) CN111192689B (en)

Citations (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040128163A1 (en) * 2002-06-05 2004-07-01 Goodman Philip Holden Health care information management apparatus, system and method of use and doing business
US20050267782A1 (en) * 2004-05-28 2005-12-01 Gudrun Zahlmann System for processing patient medical data for clinical trials and aggregate analysis
JP2007287027A (en) * 2006-04-19 2007-11-01 Fujifilm Corp Medical planning support system
US20080212847A1 (en) * 2007-01-08 2008-09-04 Michael Davies Method and system for identifying medical sample information source
JP2009146345A (en) * 2007-12-18 2009-07-02 Mitsubishi Electric Information Systems Corp Electronic medical chart system
CN101727535A (en) * 2008-10-30 2010-06-09 北大方正集团有限公司 Cross indexing method for patients crossing system and system thereof
JP2010167042A (en) * 2009-01-21 2010-08-05 Canon Inc Medical diagnostic support apparatus and control method of the same and program
CN102576431A (en) * 2009-10-06 2012-07-11 皇家飞利浦电子股份有限公司 Autonomous linkage of patient information records stored at different entities
JP2015114721A (en) * 2013-12-09 2015-06-22 株式会社東芝 Medical information processing device
KR20150086089A (en) * 2014-01-17 2015-07-27 주식회사 라이브존 System and method of managing medical image using electronic medical record
JP2015230631A (en) * 2014-06-06 2015-12-21 富士ゼロックス株式会社 Information processing device and information processing program
CN105303499A (en) * 2015-09-16 2016-02-03 西部天使(北京)健康科技有限公司 Automatic medical record imputation method and system
US20160085914A1 (en) * 2014-09-23 2016-03-24 Practice Fusion, Inc. Aggregating a patient's disparate medical data from multiple sources
JP2016099810A (en) * 2014-11-21 2016-05-30 日本調剤株式会社 Pharmacy information management system
CN106295182A (en) * 2016-08-10 2017-01-04 依据数据(湖南)科技有限公司 A kind of personal identification method based on patient biological information
CN106682439A (en) * 2016-12-30 2017-05-17 广州慧扬信息系统科技有限公司 Investigational follow-up based medical record screening method
CN106778021A (en) * 2016-12-31 2017-05-31 深圳市前海康启源科技有限公司 Medical diagnosis information management system and method
US20170169168A1 (en) * 2015-12-11 2017-06-15 Lifemed Id, Incorporated Patient identification systems and methods
CN107038336A (en) * 2017-03-21 2017-08-11 科大讯飞股份有限公司 A kind of electronic health record automatic generation method and device
CN107193919A (en) * 2017-05-15 2017-09-22 清华大学深圳研究生院 The search method and system of a kind of electronic health record
CN108352196A (en) * 2015-10-30 2018-07-31 皇家飞利浦有限公司 There is no hospital's matching in the health care data library for going mark of apparent standard identifier

Patent Citations (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040128163A1 (en) * 2002-06-05 2004-07-01 Goodman Philip Holden Health care information management apparatus, system and method of use and doing business
US20050267782A1 (en) * 2004-05-28 2005-12-01 Gudrun Zahlmann System for processing patient medical data for clinical trials and aggregate analysis
JP2007287027A (en) * 2006-04-19 2007-11-01 Fujifilm Corp Medical planning support system
US20080212847A1 (en) * 2007-01-08 2008-09-04 Michael Davies Method and system for identifying medical sample information source
JP2009146345A (en) * 2007-12-18 2009-07-02 Mitsubishi Electric Information Systems Corp Electronic medical chart system
CN101727535A (en) * 2008-10-30 2010-06-09 北大方正集团有限公司 Cross indexing method for patients crossing system and system thereof
JP2010167042A (en) * 2009-01-21 2010-08-05 Canon Inc Medical diagnostic support apparatus and control method of the same and program
CN102576431A (en) * 2009-10-06 2012-07-11 皇家飞利浦电子股份有限公司 Autonomous linkage of patient information records stored at different entities
JP2015114721A (en) * 2013-12-09 2015-06-22 株式会社東芝 Medical information processing device
KR20150086089A (en) * 2014-01-17 2015-07-27 주식회사 라이브존 System and method of managing medical image using electronic medical record
JP2015230631A (en) * 2014-06-06 2015-12-21 富士ゼロックス株式会社 Information processing device and information processing program
US20160085914A1 (en) * 2014-09-23 2016-03-24 Practice Fusion, Inc. Aggregating a patient's disparate medical data from multiple sources
JP2016099810A (en) * 2014-11-21 2016-05-30 日本調剤株式会社 Pharmacy information management system
CN105303499A (en) * 2015-09-16 2016-02-03 西部天使(北京)健康科技有限公司 Automatic medical record imputation method and system
CN108352196A (en) * 2015-10-30 2018-07-31 皇家飞利浦有限公司 There is no hospital's matching in the health care data library for going mark of apparent standard identifier
US20170169168A1 (en) * 2015-12-11 2017-06-15 Lifemed Id, Incorporated Patient identification systems and methods
CN106295182A (en) * 2016-08-10 2017-01-04 依据数据(湖南)科技有限公司 A kind of personal identification method based on patient biological information
CN106682439A (en) * 2016-12-30 2017-05-17 广州慧扬信息系统科技有限公司 Investigational follow-up based medical record screening method
CN106778021A (en) * 2016-12-31 2017-05-31 深圳市前海康启源科技有限公司 Medical diagnosis information management system and method
CN107038336A (en) * 2017-03-21 2017-08-11 科大讯飞股份有限公司 A kind of electronic health record automatic generation method and device
CN107193919A (en) * 2017-05-15 2017-09-22 清华大学深圳研究生院 The search method and system of a kind of electronic health record

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
罗立刚等: "从医疗记录中提取结构化数据的双阅读/录入系统及其应用", 药物流行病学杂志, vol. 26, no. 6, pages 406 - 409 *

Also Published As

Publication number Publication date
CN111192689B (en) 2023-11-24

Similar Documents

Publication Publication Date Title
CN110910976A (en) Medical record detection method, device, equipment and storage medium
CN109272002B (en) Bone age tablet classification method and device
CN109637605B (en) Electronic medical record structuring method and computer-readable storage medium
EP2908282A1 (en) Forensic system, forensic method, and forensic program
US7949156B2 (en) Biometric remediation of datasets
CN105843889B (en) Credibility-based data acquisition method and system for big data and common data
WO2006073951B1 (en) Adaptive fingerprint matching method and apparatus
CN111210402A (en) Face image quality scoring method and device, computer equipment and storage medium
US20180196924A1 (en) Computer-implemented method and system for diagnosis of biological conditions of a patient
CN111883253A (en) Disease data analysis method and lung cancer risk prediction system based on medical knowledge base
CN103443772B (en) The method of the individual sex checking based on multi-modal data analysis
CN112989990A (en) Medical bill identification method, device, equipment and storage medium
CN110135684B (en) Capability assessment method, capability assessment device and terminal equipment
CN111180060A (en) Automatic coding method and device for disease diagnosis
CN109036506B (en) Internet medical inquiry supervision method, electronic device and readable storage medium
CN111192689B (en) Patient identification method based on medical data
CN111816318A (en) Heart disease data queue generation method and risk prediction system
CN112116976A (en) Method and device for processing medicine information and computer readable storage medium
CN110837494B (en) Method and device for identifying unspecified diagnosis coding errors of medical record home page
CN113807256A (en) Bill data processing method and device, electronic equipment and storage medium
CN112863602A (en) Chromosome abnormality detection method, chromosome abnormality detection device, computer device, and storage medium
KR101793185B1 (en) Method for identifying patient personal information
CN113626591A (en) Electronic medical record data quality evaluation method based on text classification
US20240104178A1 (en) Information processing apparatus, information processing method, matching system, program, and storage medium
US20090216139A1 (en) Method for collecting and assigning patient data in a clinical trial

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant