CN112863626A - Multi-platform similar medical data removing method, device and equipment - Google Patents

Multi-platform similar medical data removing method, device and equipment Download PDF

Info

Publication number
CN112863626A
CN112863626A CN202110248486.0A CN202110248486A CN112863626A CN 112863626 A CN112863626 A CN 112863626A CN 202110248486 A CN202110248486 A CN 202110248486A CN 112863626 A CN112863626 A CN 112863626A
Authority
CN
China
Prior art keywords
patient
information
basic information
basic
platform
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110248486.0A
Other languages
Chinese (zh)
Inventor
杨开轶
包培文
侯文利
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Guanxin Medical And Health Software Technology Co ltd
Original Assignee
Beijing Guanxin Medical And Health Software Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Guanxin Medical And Health Software Technology Co ltd filed Critical Beijing Guanxin Medical And Health Software Technology Co ltd
Priority to CN202110248486.0A priority Critical patent/CN112863626A/en
Publication of CN112863626A publication Critical patent/CN112863626A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/60ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Public Health (AREA)
  • Epidemiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Data Mining & Analysis (AREA)
  • Biomedical Technology (AREA)
  • Databases & Information Systems (AREA)
  • Pathology (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

The invention relates to a method, a device and equipment for removing multi-platform similar medical data, belonging to the technical field of data filtering, wherein the method comprises the steps of obtaining basic information of a patient through a plurality of target medical platforms and establishing a main patient index of the basic information of the patient; locating diagnostic information for a single patient via the patient master index; calculating the similarity between the diagnosis information according to the Mahalanobis distance; and judging the similarity and the size of a similarity threshold value, and acquiring the basic information of the patient with the similarity higher than the similarity threshold value. According to the invention, a large amount of similar medical data generated when multiple platforms in a hospital are interconnected and intercommunicated is effectively judged by expanding a patient main index establishing mode and a content similarity algorithm, and is removed or modified according to business needs, so that the accuracy of data analysis is ensured, and the technical problems that the data washing is not thorough and a large amount of repeated data still exists in the repeated data washing method in the prior art are solved.

Description

Multi-platform similar medical data removing method, device and equipment
Technical Field
The invention belongs to the technical field of data filtering, and particularly relates to a method, a device and equipment for removing multi-platform similar medical data.
Background
By establishing a Patient Master Index (EMPI) in each system in a hospital, an islanding effect in medical informatization is broken through to realize regional and even cross-regional medical information integration, and particularly important is that a large amount of repeated data can be positioned and cleaned.
In the existing medical and health industry, duplicate data is usually removed through EMPI, and the method mainly comprises the steps of firstly judging whether a patient is the same person and then carrying out diagnosis content matching. There are 3 kinds of technical schemes for judging whether the same patient is currently in the mainstream: collecting natural information of a patient, such as name, birth date and accommodation address; collecting patient identification information, such as an identity card, a social security card, a passport, a military personal identification card and the like; and collecting service records, such as an outpatient serial number, an inpatient serial number and the like. For content duplication determination, verification is usually performed by using conventional schemes such as keyword matching and ICD (international Classification of diseases) code matching.
Although the prior art can remove part of the repeated data, the repeated data with similar contents can only be identified manually when different systems differentiate disease descriptions, and diagnosticians manually input abbreviations or patients understand different cases. Therefore, the method for cleaning the repeated data in the prior art has the technical problems that the data cleaning is not thorough and a great amount of repeated data still exists.
Disclosure of Invention
The invention provides a method, a device and equipment for removing multi-platform similar medical data, which are used for effectively judging a large amount of similar medical data generated when interconnection and intercommunication are carried out on multiple platforms in a hospital by expanding a patient main index establishing mode and by a content similarity algorithm, and clearing or modifying the similar medical data according to business requirements to ensure the accuracy of data analysis so as to solve the technical problems that the repeated data cleaning method in the prior art is not thorough in data cleaning and still has a large amount of repeated data.
The technical scheme provided by the invention is as follows:
in one aspect, a multi-platform similar medical data removal method includes:
acquiring basic information of a patient through a plurality of target medical platforms, and establishing a patient main index of the basic information of the patient, wherein the patient main index comprises: a base patient primary index and an extended patient primary index, the patient base information comprising: natural information, identification information and service information; the establishing of the patient primary index of the patient basic information comprises the following steps: judging whether the target medical platform is associated with biological characteristic acquisition equipment or not; if the target medical platform is not associated with biological characteristic acquisition equipment, establishing a basic patient main index of the patient basic information; if the target medical platform is associated with biological characteristic acquisition equipment, establishing an expanded patient main index of the patient basic information;
positioning diagnosis information of a single patient through the patient main index, wherein the diagnosis information comprises diagnosis and treatment information and auxiliary diagnosis and treatment information;
calculating the similarity between the diagnosis information according to the Mahalanobis distance;
and judging the similarity and the similarity threshold value, and acquiring the diagnosis information with the similarity higher than the similarity threshold value so as to enable the user to manually remove the similar diagnosis information of the patient.
Optionally, the determining whether the target medical platform is associated with a biometric acquisition device includes: judging whether the basic information of the patient contains biological characteristic information, wherein the biological characteristic information comprises: palm vein information and fingerprint information;
if the target medical platform is not associated with a biological characteristic acquisition device, establishing a basic patient master index of the patient basic information, including: if the patient basic information does not contain the biological characteristic information, establishing a basic patient main index of the patient basic information;
if the target medical platform is associated with a biological feature acquisition device, establishing an extended patient primary index of the patient basic information, including: and if the basic information of the patient comprises the biological characteristic information, establishing an expanded main index of the basic information of the patient.
Optionally, the calculating the similarity between the diagnostic information according to the mahalanobis distance further includes:
judging whether a pre-constructed inverse matrix is full-rank;
and if the pre-constructed inverse matrix is not full-rank, reducing the dimension of the basic information of the patient through principal component analysis.
Optionally, the method further includes: based on the test set, the similarity threshold is determined.
In yet another aspect, a multi-platform similar medical data removal device includes: the device comprises an acquisition module, a positioning determination module, a calculation module and a judgment module;
the acquiring module is configured to acquire patient basic information through a plurality of target medical platforms and establish a patient primary index of the patient basic information, where the patient primary index includes: a base patient primary index and an extended patient primary index, the patient base information comprising: natural information, identification information and service information; the system is used for judging whether the target medical platform is associated with biological characteristic acquisition equipment or not; if the target medical platform is not associated with biological characteristic acquisition equipment, establishing a basic patient main index of the patient basic information; if the target medical platform is associated with biological characteristic acquisition equipment, establishing an expanded patient main index of the patient basic information;
the positioning determination module is used for positioning diagnosis information of a single patient through the patient main index, wherein the diagnosis information comprises diagnosis and treatment information and auxiliary diagnosis and treatment information;
the calculation module is used for calculating the similarity between the diagnosis information according to the Mahalanobis distance;
the judging module is used for judging the similarity and the size of a similarity threshold value and acquiring the diagnosis information with the similarity higher than the similarity threshold value so as to enable a user to manually remove the similar diagnosis information of the patient.
Optionally, the positioning determining module is configured to determine whether the patient basic information includes biometric information, where the biometric information includes: palm vein information and fingerprint information; if the patient basic information does not contain the biological characteristic information, establishing a basic patient main index of the patient basic information; and if the basic information of the patient comprises the biological characteristic information, establishing an expanded main index of the basic information of the patient.
Optionally, the calculating module is configured to determine whether a pre-constructed inverse matrix is of a full rank; and if the pre-constructed inverse matrix is not full-rank, reducing the dimension of the basic information of the patient through principal component analysis.
In yet another aspect, a multi-platform similar medical data removal device includes: a processor, and a memory coupled to the processor;
the memory is configured to store a computer program for performing at least the multi-platform similar medical data removal method of any of the above;
the processor is used for calling and executing the computer program in the memory.
The invention has the beneficial effects that:
according to the method, the device and the equipment for removing the multi-platform similar medical data, provided by the embodiment of the invention, the basic information of a patient is obtained through a plurality of target medical platforms, and a patient main index of the basic information of the patient is established, wherein the basic information of the patient comprises the following steps: natural information, identification information and service information; positioning diagnosis information of a single patient through the patient main index, wherein the diagnosis information comprises diagnosis and treatment information and auxiliary diagnosis and treatment information; calculating the similarity between the diagnosis information according to the Mahalanobis distance; and judging the similarity and the size of a similarity threshold value, and acquiring the basic information of the patient with the similarity higher than the similarity threshold value. According to the invention, a large amount of similar medical data generated when multiple platforms in a hospital are interconnected and intercommunicated is effectively judged by expanding a patient main index establishing mode and a content similarity algorithm, and is removed or modified according to business needs, so that the accuracy of data analysis is ensured, and the technical problems that the data washing is not thorough and a large amount of repeated data still exists in the repeated data washing method in the prior art are solved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a schematic flowchart of a multi-platform similar medical data removal method according to an embodiment of the present invention;
FIG. 2 is a schematic structural diagram of a multi-platform similar medical data removal device according to an embodiment of the present invention;
fig. 3 is a schematic structural diagram of a multi-platform similar medical data removing apparatus according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the technical solutions of the present invention will be described in detail below. It is to be understood that the described embodiments are merely exemplary of the invention, and not restrictive of the full scope of the invention. All other embodiments, which can be derived by a person skilled in the art from the examples given herein without any inventive step, are within the scope of the present invention.
In a hospital, a plurality of platforms generally exist, when each platform records basic information of a patient, the adopted recording mode may be different, and similar data is usually removed in order to ensure the neatness and accuracy of the data, however, when the conditions that different platform systems are different in disease description, a diagnostician manually enters abbreviations or a medical record encoding person understands a medical record are met, some repeated data with similar contents can only be manually identified. Therefore, the method for cleaning the repeated data in the prior art has the technical problems that the data cleaning is not thorough and a great amount of repeated data still exists.
In order to at least solve the technical problem provided by the present invention, an embodiment of the present invention provides a method for removing multi-platform similar medical data.
Fig. 1 is a schematic flow chart of a multi-platform similar medical data removing method according to an embodiment of the present invention, and referring to fig. 1, the method according to the embodiment of the present invention may include the following steps:
and S11, acquiring basic information of the patient through a plurality of target medical platforms, and establishing a main patient index of the basic information of the patient.
Wherein the patient basis information includes: natural information, identification information, and service information. The patient master index, comprising: a base patient primary index and an extended patient primary index.
In a specific implementation process, a plurality of medical platforms in a certain hospital or a plurality of hospitals can be defined as target medical platforms, and the plurality of target medical platforms are communicated, so that basic information of patients is collected in the plurality of target medical platforms, wherein the basic information of the patients can include natural information, identity recognition information and service information.
The patient basic information of different platforms can be collected through an ETL module, wherein ETL is a process of loading data of a business system to a data warehouse after extraction, cleaning and conversion, and aims to integrate scattered, disordered and standard non-uniform data of each platform together for extraction (extract), conversion (transform) and loading (load) to a destination end.
In some real-time examples, optionally, establishing a patient master index of patient basis information includes: judging whether the target medical platform is associated with biological characteristic acquisition equipment or not; if the target medical platform is not associated with the biological characteristic acquisition equipment, establishing a basic patient main index of the patient basic information; and if the target medical platform is associated with the biological characteristic acquisition equipment, establishing an expanded patient main index of the patient basic information.
In some real-time examples, optionally, determining whether the target medical platform is associated with a biometric acquisition device includes: judging whether the basic information of the patient contains biological characteristic information, wherein the biological characteristic information comprises: palm vein information and fingerprint information; if the target medical platform is not associated with the biological characteristic acquisition equipment, establishing a basic patient main index of the patient basic information, comprising: if the basic information of the patient does not contain the biological characteristic information, establishing a basic patient main index of the basic information of the patient; if the target medical platform is associated with the biological characteristic acquisition equipment, establishing an expanded patient main index of the patient basic information, wherein the expanded patient main index comprises the following steps: and if the basic information of the patient contains the biological characteristic information, establishing an expanded main index of the basic information of the patient.
For example, it may be determined whether the existing patient basic information includes information acquired by a biometric acquisition device such as a palm vein and a fingerprint, and in a medical institution with patient biometric acquisition capability, an EMPI of patient basic information + biometric may be generated; and generating the EMPI of the basic information of the patient by the medical unit without the biological characteristics of the patient.
And S12, positioning the diagnosis information of the single patient through the main index of the patient, wherein the diagnosis information comprises diagnosis and treatment information and auxiliary diagnosis and treatment information.
For example, the diagnosis information located to a single patient through the EMPI may include medical record first page information, clinic, and/or hospital discharge information, and various auxiliary medical information, and the user may add the diagnosis information according to the diagnosis specification of each hospital, which is not limited herein.
And S13, calculating the similarity between the diagnosis information according to the Mahalanobis distance.
After the diagnosis information of the patient is obtained, calculating the content similarity between the diagnosis information according to the Mahalanobis distance:
F=(F1,F2,…,Fm)=UTX
μF=(μ1,μ2,…,μm)
(F-μF)=UT(X-μx)
wherein the original diagnostic information is set to F (F)1,F2,…,Fm) U is a rotation matrix obtained by factorization using SVD (singular value decomposition) algorithm, and X is a new sample information matrix obtained by rotation.
After transformation, the dimensions of the new sample are linearly independent, and the variance of each dimension is a characteristic value, so that the following formula is satisfied:
Figure RE-GDA0003032792060000091
wherein muFMean value of original diagnostic information, muxAs a mean of the new information after rotation, lambda1To lambda4For the characteristic value of the new information dimension, sigmaxFeature matrices for new information dimensions.
Obtaining a calculation similarity formula:
Figure RE-GDA0003032792060000101
wherein f isiIs the new sample information value after rotation.
According to the calculation formula of DM described above, the similarity to the diagnostic information can be calculated.
In some embodiments, optionally, the method further includes: judging whether a pre-constructed inverse matrix is full-rank; and if the pre-constructed inverse matrix is not full-rank, reducing the dimension of the basic information of the patient through principal component analysis.
For example, after positioning the diagnostic information of a single patient by the patient main index, dimension reduction may be performed on the diagnostic information of these different dimensions, dimension reduction may be performed by Principal Component dimension reduction (PCA), the data after dimension reduction is taken as new data, and the similarity between the new data is calculated according to the mahalanobis distance. Can also solve
Figure RE-GDA0003032792060000102
In the process of the inverse matrix, the incomplete rank is found, and the PCA principal component dimensionality reduction is carried out firstly, so that the subsequent DM solving process cannot be influenced.
And S14, judging the similarity and the size of the similarity threshold, and acquiring the diagnosis information with the similarity higher than the similarity threshold so as to enable the user to manually remove the diagnosis information similar to the patient.
After the similarity among the diagnostic information is calculated, the similarity is compared with a similarity threshold value, the basic information of the patient with the similarity higher than the similarity threshold value is obtained, and recommendation is carried out, so that the medical staff can remove repeated information from the basic information of the patient with the higher similarity, and the difficulty of manual checking is reduced.
For example, the similarity threshold may be preset in advance, e.g., the similarity threshold may be 75%. When the setting of the similar threshold is carried out, the verification can be carried out through the marked test set, and when the accuracy of the test result of the test set is higher than 85%, the similar threshold at the moment is considered to be a feasible similar threshold.
In one specific implementation scenario, for example, the A, B platform in a hospital records well-known diagnosis information, wherein the diagnosis information includes: the name, blood pressure, blood sugar, disease information, medication, etc. are recorded in the a-platform for the case of a small and clear antibacterial drug, and in the B-platform for the case of a small and clear antibacterial drug, but the AB-platform is recorded in different recording methods (e.g., recording method 1 and recording method 2). At this time, similar data are obtained by applying the multi-platform similar medical data removal method provided by the application, and if the similarity of the contents in the recording mode 1 and the recording mode 2 is calculated to be higher than the similarity threshold, it is indicated that the two corresponding contents may be the same content, and the contents are recommended to the user for manual identification. If the same information can not be filtered, 2 times of antibacterial drug administration are recorded simultaneously, and the dosage is judged to exceed the standard.
The multi-platform similar medical data removing method provided by the embodiment of the invention obtains basic information of a patient through a plurality of target medical platforms, and establishes a patient main index of the basic information of the patient, wherein the basic information of the patient comprises the following steps: natural information, identification information and service information; positioning diagnosis information of a single patient through a patient main index, wherein the diagnosis information comprises diagnosis information and auxiliary diagnosis information; calculating the similarity between the diagnostic information according to the Mahalanobis distance; and judging the similarity and the size of the similarity threshold value, and acquiring the basic information of the patient with the similarity higher than the similarity threshold value. According to the invention, a large amount of similar medical data generated when multiple platforms in a hospital are interconnected and intercommunicated is effectively judged by expanding a patient main index establishing mode and a content similarity algorithm, and is removed or modified according to business needs, so that the accuracy of data analysis is ensured, and the technical problems that the data washing is not thorough and a large amount of repeated data still exists in the repeated data washing method in the prior art are solved. When different platforms of medical units are interconnected and intercommunicated, the problem that repeated contents with high content similarity are missed only by depending on manual work and keyword matching is solved.
Based on one general inventive concept, the embodiment of the invention also provides a multi-platform similar medical data removing device.
Fig. 2 is a schematic structural diagram of a multi-platform similar medical data removing device according to an embodiment of the present invention, and referring to fig. 2, the device according to an embodiment of the present invention may include the following structures: an acquisition module 21, a positioning determination module 22, a calculation module 23 and a judgment module 24.
The obtaining module 21 is configured to obtain basic information of a patient through a plurality of target medical platforms, and establish a patient master index of the basic information of the patient, where the patient master index includes: the basic patient main index and the extended patient main index, and the patient basic information comprises: natural information, identification information and service information; the system is used for judging whether the target medical platform is associated with biological characteristic acquisition equipment or not; if the target medical platform is not associated with biological characteristic acquisition equipment, establishing a basic patient main index of the patient basic information; if the target medical platform is associated with biological characteristic acquisition equipment, establishing an expanded patient main index of the patient basic information;
the positioning determination module 22 is used for positioning diagnosis information of a single patient through a patient main index, wherein the diagnosis information comprises diagnosis information and auxiliary diagnosis information;
a calculating module 23, configured to calculate similarity between the diagnosis information according to the mahalanobis distance;
and the judging module 24 is configured to judge the similarity and the size of the similarity threshold, and acquire the diagnosis information with the similarity higher than the similarity threshold, so that the user manually removes the similar diagnosis information of the patient.
Optionally, the positioning determining module 22 is configured to determine whether the patient basic information includes biometric information, where the biometric information includes: palm vein information and fingerprint information; if the basic information of the patient does not contain the biological characteristic information, establishing a basic patient main index of the basic information of the patient; and if the basic information of the patient contains the biological characteristic information, establishing an expanded main index of the basic information of the patient.
Optionally, the calculating module 23 is configured to determine whether a pre-constructed inverse matrix is of a full rank; and if the pre-constructed inverse matrix is not full-rank, reducing the dimension of the basic information of the patient through principal component analysis.
With regard to the apparatus in the above-described embodiment, the specific manner in which each module performs the operation has been described in detail in the embodiment related to the method, and will not be elaborated here.
The multi-platform similar medical data removing device provided by the embodiment of the invention acquires basic information of a patient through a plurality of target medical platforms, and establishes a patient main index of the basic information of the patient, wherein the basic information of the patient comprises: natural information, identification information and service information; positioning diagnosis information of a single patient through a patient main index, wherein the diagnosis information comprises diagnosis information and auxiliary diagnosis information; calculating the similarity between the diagnostic information according to the Mahalanobis distance; and judging the similarity and the size of the similarity threshold value, and acquiring the basic information of the patient with the similarity higher than the similarity threshold value. According to the invention, a large amount of similar medical data generated when multiple platforms in a hospital are interconnected and intercommunicated is effectively judged by expanding a patient main index establishing mode and a content similarity algorithm, and is removed or modified according to business needs, so that the accuracy of data analysis is ensured, and the technical problems that the data washing is not thorough and a large amount of repeated data still exists in the repeated data washing method in the prior art are solved.
Based on one general inventive concept, embodiments of the present invention also provide a multi-platform similar medical data removing apparatus.
Fig. 3 is a schematic structural diagram of a multi-platform similar medical data removing apparatus according to an embodiment of the present invention, and referring to fig. 3, the multi-platform similar medical data removing apparatus according to the embodiment of the present invention includes: a processor 31, and a memory 32 connected to the processor.
The memory 32 is used for storing a computer program, and the computer program is at least used for the multi-platform similar medical data removing method described in any of the above embodiments;
the processor 31 is used to invoke and execute the computer program in the memory.
Embodiments of the present invention also provide a storage medium based on one general inventive concept.
A storage medium storing a computer program which, when executed by a processor, performs the steps of the above-described multi-platform similar medical data removal method.
The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and all the changes or substitutions should be covered within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the appended claims.
It is understood that the same or similar parts in the above embodiments may be mutually referred to, and the same or similar parts in other embodiments may be referred to for the content which is not described in detail in some embodiments.
It should be noted that the terms "first," "second," and the like in the description of the present invention are used for descriptive purposes only and are not to be construed as indicating or implying relative importance. Further, in the description of the present invention, the meaning of "a plurality" means at least two unless otherwise specified.
Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more executable instructions for implementing specific logical functions or steps of the process, and alternate implementations are included within the scope of the preferred embodiment of the present invention in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the present invention.
It should be understood that portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, any one or combination of the following techniques, which are known in the art, may be used: a discrete logic circuit having a logic gate circuit for implementing a logic function on a data signal, an application specific integrated circuit having an appropriate combinational logic gate circuit, a Programmable Gate Array (PGA), a Field Programmable Gate Array (FPGA), or the like.
It will be understood by those skilled in the art that all or part of the steps carried by the method for implementing the above embodiments may be implemented by hardware related to instructions of a program, which may be stored in a computer readable storage medium, and when the program is executed, the program includes one or a combination of the steps of the method embodiments.
In addition, functional units in the embodiments of the present invention may be integrated into one processing module, or each unit may exist alone physically, or two or more units are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The integrated module, if implemented in the form of a software functional module and sold or used as a stand-alone product, may also be stored in a computer readable storage medium.
The storage medium mentioned above may be a read-only memory, a magnetic or optical disk, etc.
In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.

Claims (8)

1. A multi-platform similar medical data removal method is characterized by comprising the following steps:
acquiring basic information of a patient through a plurality of target medical platforms, and establishing a patient main index of the basic information of the patient, wherein the patient main index comprises: a base patient primary index and an extended patient primary index, the patient base information comprising: natural information, identification information and service information;
the establishing of the patient primary index of the patient basic information comprises the following steps: judging whether the target medical platform is associated with biological characteristic acquisition equipment or not; if the target medical platform is not associated with biological characteristic acquisition equipment, establishing a basic patient main index of the patient basic information; if the target medical platform is associated with biological characteristic acquisition equipment, establishing an expanded patient main index of the patient basic information;
positioning diagnosis information of a single patient through the patient main index, wherein the diagnosis information comprises diagnosis and treatment information and auxiliary diagnosis and treatment information;
calculating the similarity between the diagnosis information according to the Mahalanobis distance;
and judging the similarity and the similarity threshold value, and acquiring the diagnosis information with the similarity higher than the similarity threshold value so as to enable the user to manually remove the similar diagnosis information of the patient.
2. The method of claim 1, wherein the determining whether the target medical platform is associated with a biometric acquisition device comprises: judging whether the basic information of the patient contains biological characteristic information, wherein the biological characteristic information comprises: palm vein information and fingerprint information;
if the target medical platform is not associated with a biometric acquisition device, establishing the basic patient master index of the patient basic information, including: if the patient basic information does not contain the biological characteristic information, establishing a basic patient main index of the patient basic information;
if the target medical platform is associated with a biological feature acquisition device, establishing the extended patient master index of the patient basic information, including: and if the basic information of the patient comprises the biological characteristic information, establishing an expanded main index of the basic information of the patient.
3. The method for multi-platform removal of similar medical data according to claim 1, wherein said calculating similarities between said diagnostic information according to mahalanobis distance further comprises:
judging whether a pre-constructed inverse matrix is full-rank;
and if the pre-constructed inverse matrix is not full-rank, reducing the dimension of the basic information of the patient through principal component analysis.
4. The multi-platform similar medical data removal method of claim 1, further comprising: based on the test set, the similarity threshold is determined.
5. A multi-platform similar medical data removal device, comprising: the device comprises an acquisition module, a positioning determination module, a calculation module and a judgment module;
the acquiring module is configured to acquire patient basic information through a plurality of target medical platforms and establish a patient primary index of the patient basic information, where the patient primary index includes: a base patient primary index and an extended patient primary index, the patient base information comprising: natural information, identification information and service information; the system is used for judging whether the target medical platform is associated with biological characteristic acquisition equipment or not; if the target medical platform is not associated with a biological feature acquisition device, establishing the basic patient master index of the patient basic information; if the target medical platform is associated with biological characteristic acquisition equipment, establishing the extended patient main index of the patient basic information;
the positioning determination module is used for positioning diagnosis information of a single patient through the patient main index, wherein the diagnosis information comprises diagnosis and treatment information and auxiliary diagnosis and treatment information;
the calculation module is used for calculating the similarity between the diagnosis information according to the Mahalanobis distance;
the judging module is used for judging the similarity and the size of a similarity threshold value and acquiring the diagnosis information with the similarity higher than the similarity threshold value so as to enable a user to manually remove the similar diagnosis information of the patient.
6. The multi-platform similar medical data removal device of claim 5, wherein the location determination module is configured to determine whether the patient basis information includes biometric information, the biometric information comprising: palm vein information and fingerprint information; if the patient basic information does not contain the biological characteristic information, establishing a basic patient main index of the patient basic information; and if the basic information of the patient comprises the biological characteristic information, establishing an expanded main index of the basic information of the patient.
7. The multi-platform similar medical data removal device according to claim 5, wherein the calculation module is configured to determine whether a pre-constructed inverse matrix is full rank; and if the pre-constructed inverse matrix is not full-rank, reducing the dimension of the basic information of the patient through principal component analysis.
8. A multi-platform similar medical data removal device, comprising: a processor, and a memory coupled to the processor;
the memory is used for storing a computer program at least for executing the multi-platform similar medical data removal method of any one of claims 1-4;
the processor is used for calling and executing the computer program in the memory.
CN202110248486.0A 2021-03-08 2021-03-08 Multi-platform similar medical data removing method, device and equipment Pending CN112863626A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110248486.0A CN112863626A (en) 2021-03-08 2021-03-08 Multi-platform similar medical data removing method, device and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110248486.0A CN112863626A (en) 2021-03-08 2021-03-08 Multi-platform similar medical data removing method, device and equipment

Publications (1)

Publication Number Publication Date
CN112863626A true CN112863626A (en) 2021-05-28

Family

ID=75994510

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110248486.0A Pending CN112863626A (en) 2021-03-08 2021-03-08 Multi-platform similar medical data removing method, device and equipment

Country Status (1)

Country Link
CN (1) CN112863626A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113742348A (en) * 2021-09-07 2021-12-03 上海柯林布瑞信息技术有限公司 Patient data matching method in CDR system, main index establishing method and device
CN113821503A (en) * 2021-09-23 2021-12-21 北京金山云网络技术有限公司 Medical data processing method and device and edge server

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106845056A (en) * 2015-12-04 2017-06-13 北大医疗信息技术有限公司 The integration method and integrating apparatus of medical information
WO2018120668A1 (en) * 2016-12-31 2018-07-05 深圳市易特科信息技术有限公司 Medical big data association and storage system and method
CN109522302A (en) * 2018-11-09 2019-03-26 南京医渡云医学技术有限公司 Medical data processing method, device, electronic equipment and computer-readable medium
CN110600092A (en) * 2019-08-13 2019-12-20 万达信息股份有限公司 Method and system for generating personnel main index applied to medical field
CN111785341A (en) * 2020-06-30 2020-10-16 平安国际智慧城市科技股份有限公司 Patient main index data merging method and device based on similarity

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106845056A (en) * 2015-12-04 2017-06-13 北大医疗信息技术有限公司 The integration method and integrating apparatus of medical information
WO2018120668A1 (en) * 2016-12-31 2018-07-05 深圳市易特科信息技术有限公司 Medical big data association and storage system and method
CN109522302A (en) * 2018-11-09 2019-03-26 南京医渡云医学技术有限公司 Medical data processing method, device, electronic equipment and computer-readable medium
CN110600092A (en) * 2019-08-13 2019-12-20 万达信息股份有限公司 Method and system for generating personnel main index applied to medical field
CN111785341A (en) * 2020-06-30 2020-10-16 平安国际智慧城市科技股份有限公司 Patient main index data merging method and device based on similarity

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
耿娅琼: "基于IHE PIX 的患者主索引系统的设计与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113742348A (en) * 2021-09-07 2021-12-03 上海柯林布瑞信息技术有限公司 Patient data matching method in CDR system, main index establishing method and device
CN113742348B (en) * 2021-09-07 2024-05-17 上海柯林布瑞信息技术有限公司 Patient data matching method in CDR system, main index establishing method and device
CN113821503A (en) * 2021-09-23 2021-12-21 北京金山云网络技术有限公司 Medical data processing method and device and edge server

Similar Documents

Publication Publication Date Title
CN110880361B (en) Personalized accurate medication recommendation method and device
CN107610779B (en) Disease evaluation and disease risk evaluation method and device
Wosiak et al. Integrating correlation-based feature selection and clustering for improved cardiovascular disease diagnosis
Al Jarullah Decision tree discovery for the diagnosis of type II diabetes
CN111710420B (en) Complication onset risk prediction method, system, terminal and storage medium based on electronic medical record big data
JP6907831B2 (en) Context-based patient similarity methods and equipment
CN104115144A (en) System, method and computer program for multi-dimensional temporal and relative data mining framework, analysis and sub-grouping
CN112863626A (en) Multi-platform similar medical data removing method, device and equipment
WO2019146357A1 (en) Medical image processing device, method, and program, and diagnosis assistance device, method, and program
CN111180024A (en) Data processing method and device based on word frequency and inverse document frequency and computer equipment
CN110752027B (en) Electronic medical record data pushing method, device, computer equipment and storage medium
CN114048343A (en) Classification platform covering medical image information of patients in whole disease course
Farid et al. Applying artificial intelligence techniques for prediction of neurodegenerative disorders: A comparative case-study on clinical tests and neuroimaging tests with Alzheimer’s Disease
WO2019062186A1 (en) Diabetes analysis method, application server and computer readable storage medium
Desai et al. Application of ensemble classifiers in accurate diagnosis of myocardial ischemia conditions
Zhao et al. A general framework for medical data mining
CN112908493A (en) Online inquiry processing method, device, medium and terminal equipment
CN109522331B (en) Individual-centered regionalized multi-dimensional health data processing method and medium
CN112102351A (en) Medical image analysis method and device, electronic equipment and readable storage medium
JP2003024281A (en) Apparatus for detecting incentive of disease developing in seizure state
CN115631823A (en) Similar case recommendation method and system
CN115171830A (en) Patient data-based service package generation method, device, equipment and storage medium
CN112101021A (en) Method, device and equipment for realizing standard word mapping
CN111986815A (en) Project combination mining method based on co-occurrence relation and related equipment
CN111667922A (en) Clinical diagnosis and treatment data entry system and method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20210528