CN110289058A - A kind of electronic health record standardization matching process and device - Google Patents

A kind of electronic health record standardization matching process and device Download PDF

Info

Publication number
CN110289058A
CN110289058A CN201910489480.5A CN201910489480A CN110289058A CN 110289058 A CN110289058 A CN 110289058A CN 201910489480 A CN201910489480 A CN 201910489480A CN 110289058 A CN110289058 A CN 110289058A
Authority
CN
China
Prior art keywords
data
drug
health record
electronic health
dictionary library
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910489480.5A
Other languages
Chinese (zh)
Inventor
段建平
袁明明
王炳亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing MetarNet Technologies Co Ltd
Original Assignee
Beijing MetarNet Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing MetarNet Technologies Co Ltd filed Critical Beijing MetarNet Technologies Co Ltd
Priority to CN201910489480.5A priority Critical patent/CN110289058A/en
Publication of CN110289058A publication Critical patent/CN110289058A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/374Thesaurus
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/60ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Public Health (AREA)
  • Medical Informatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Epidemiology (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Primary Health Care (AREA)
  • Physics & Mathematics (AREA)
  • Biomedical Technology (AREA)
  • Pathology (AREA)
  • Computational Linguistics (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

The disclosure is directed to a kind of electronic health record standardization matching process, device, electronic equipment and computer readable storage mediums.Wherein, this method comprises: establishing the dictionary library comprising standard pharmaceutical data based on the total office data of state food pharmaceuticals administration;According to preset algorithm, unique identification coding is generated to the drug data in the dictionary library;According to preset rules, data mining is carried out to the electric power case history of nonstandardized technique, is acquired in the electronic health record to specification drug data;By the standard pharmaceutical date comprision in specification drug data and the dictionary library, the matching to specification drug data and the standard pharmaceutical data is completed.The standardization processing of disclosure electronic health record drug data realizes information resources share, data exchange, statistical analysis between different departments, reduces supervision cost.

Description

A kind of electronic health record standardization matching process and device
Technical field
This disclosure relates to electronic data algorithm field, in particular to a kind of electronic health record standardization matching process, dress It sets, electronic equipment and computer readable storage medium.
Background technique
Current National is difficult to reality due to lacking unified drug construction criteria, each province and city medical procurement information early period at this stage It now interconnects and resource-sharing, the policy of country and each province and city is pushed, effect is implemented in information-based supervision and drug control system Fruit assessment is very unfavorable.Currently, going back situations such as implementing in drug data acquisition, data cleansing, data analysis and relevant policies There are some problems, outstanding behaviours is in no a set of unified drug construction criteria (being coding standard in the present invention).At For the bottleneck for restricting pharmaceuticals industry informatization and development.
The HIS system of hospital (carried out the calculating of information management and on-line operation in recent years in hospital management and curative activity Machine application system), the ERP system (Enterprise Resources Plan) of pharmaceutical manufacturer had the development advanced by leaps and bounds, the level of IT application It is relatively high.But as the development of project and in-depth study find that there is also some for current pharmaceuticals industry informatization Urgent problem to be solved, outstanding behaviours are lacking classifying drugs standard that adapt to pharmaceuticals industry development need, that industry is unified. HIS system, the ERP system of enterprise of most of hospital are done things in his own way, data standard disunity, be cannot achieve data exchange, are total to It enjoys, causes a large amount of repeated construction and the wasting of resources.
From the above, it can be seen that, it is desirable to provide one or more technical solutions for being at least able to solve the above problem.
It should be noted that information is only used for reinforcing the reason to the background of the disclosure disclosed in above-mentioned background technology part Solution, therefore may include the information not constituted to the prior art known to persons of ordinary skill in the art.
Summary of the invention
The disclosure is designed to provide a kind of electronic health record standardization matching process, device, electronic equipment and calculating Machine readable storage medium storing program for executing, so overcome caused by the limitation and defect due to the relevant technologies at least to a certain extent one or The multiple problems of person.
According to one aspect of the disclosure, a kind of electronic health record standardization matching process is provided, comprising:
Dictionary library establishment step is established based on the total office data of state food pharmaceuticals administration comprising standard pharmaceutical data Dictionary library;
Drug data coding step generates unique identification to the drug data in the dictionary library and compiles according to preset algorithm Code;
Electronic health record data collection steps carry out data mining to the electric power case history of nonstandardized technique, adopt according to preset rules Collect in the electronic health record to specification drug data;
Electronic health record Data Matching step, by the standard pharmaceutical data in specification drug data and the dictionary library The matching to specification drug data and the standard pharmaceutical data is completed in comparative analysis.
In a kind of exemplary embodiment of the disclosure, the dictionary library establishment step further include:
Standard pharmaceutical data in the dictionary library include Domestic Drugs product character string, import drugs character string, pharmaceutical production Enterprise's character string, drug distributor's character string, GMP authentication string, GSP authentication string, the character string is according to default Rule splicing.
In a kind of exemplary embodiment of the disclosure, after the character string is spliced according to preset rules, by editor away from Calculate whether the drug data is newly-increased data from algorithm, if so, in deposit dictionary library.
In a kind of exemplary embodiment of the disclosure, the drug data coding step further include:
It is with " People's Republic of China (PRC) health industry standard YY0252---1997 chemicals (raw material, preparation) standard " Basic coding defers to the principle of one yard of an object, to the drug of certain multiple uses, only assigns a unique code.
In a kind of exemplary embodiment of the disclosure, the drug data coding step further include:
Volume is further extended according to ATC sorting code number, chemicals and biological products level Four sorting code number table relationship Code;
Specified value is encoded to level code structure, totally 9 layers of 20 composition, with every group 12,16,18,20 numbers not etc. Font formula exists, wherein the personal characteristics of drug representated by medicine coding include the classification of drug, title, dosage form, preparation specification, Packing specification.
In a kind of exemplary embodiment of the disclosure, the electronic health record data collection steps further include to unstructured The mining analysis of free text data:
Free text mining analysis is realized by the medicine text feature library and terminology bank analysis engine of establishing clinical document, It is updated by borrowing-word lexicon iteration, specification electronic health record medical terminology, realizes key vocabularies classification, semantic context point Analysis, map analysis, and then reach the expected structure standard of data;
Pass through the general data of the clinical element model of analogy SHARPn, the generic data model of PCORI, OMOP/OHDSI Model reaches the structuring of drug data.
In a kind of exemplary embodiment of the disclosure, the electronic health record Data Matching step further include:
By intelligent Compare System, by the standard pharmaceutical data comparison in specification drug data and the dictionary library Analysis obtains the similarity to specification drug data and the standard pharmaceutical data, and then completes described to specification drug The matching of data and the standard pharmaceutical data.
In one aspect of the present disclosure, a kind of electronic health record standardization coalignment is provided, comprising:
Dictionary library establishes module, includes standard pharmaceutical number for being established based on the total office data of state food pharmaceuticals administration According to dictionary library;
Drug data coding module, for generating unique mark to the drug data in the dictionary library according to preset algorithm Know coding;
Electronic health record data acquisition module, for carrying out data digging to the electric power case history of nonstandardized technique according to preset rules Pick, acquires in the electronic health record to specification drug data;
Electronic health record data match module, for by the standard pharmaceutical in specification drug data and the dictionary library Date comprision completes the matching to specification drug data and the standard pharmaceutical data.
In one aspect of the present disclosure, a kind of electronic equipment is provided, comprising:
Processor;And
Memory is stored with computer-readable instruction on the memory, and the computer-readable instruction is by the processing The method according to above-mentioned any one is realized when device executes.
In another aspect of the disclosure, a kind of computer readable storage medium is provided, computer program is stored thereon with, The method according to above-mentioned any one is realized when the computer program is executed by processor.
Electronic health record standardization matching process in the exemplary embodiment of the disclosure, is based on state food drug surveilance pipe It manages total office data and establishes the dictionary library comprising standard pharmaceutical data;According to preset algorithm, to the drug data in the dictionary library Generate unique identification coding;According to preset rules, data mining is carried out to the electric power case history of nonstandardized technique, acquires the electronics disease To specification drug data in going through;By the standard pharmaceutical date comprision in specification drug data and the dictionary library, Complete the matching to specification drug data and the standard pharmaceutical data.On the one hand, disclosure normal dictionary library data are come Data on regular crawl CFDA, can ensure the accuracy and authority of data in a certain range, establish unified number Drug data in the institute of each hospital is subjected to unified specification according to standard;On the other hand, the disclosure is to ensure that doctor's advice information is read The basis take, analyzed, reinforcement basis for IT application work, promotes infrastructure, promotes information sharing, is drug application, audit, scoops up It closes, trade offer conveniently, more enough information resources shares realized between different authorities, data exchange, statistical analysis reduce Supervise cost.
It should be understood that above general description and following detailed description be only it is exemplary and explanatory, not The disclosure can be limited.
Detailed description of the invention
Its example embodiment is described in detail by referring to accompanying drawing, the above and other feature and advantage of the disclosure will become It is more obvious.
Fig. 1 shows the flow chart according to the electronic health record of one exemplary embodiment of disclosure standardization matching process;
Fig. 2 shows standardized the schematic block diagram of coalignment according to the electronic health record of one exemplary embodiment of the disclosure;
Fig. 3 diagrammatically illustrates the block diagram of the electronic equipment according to one exemplary embodiment of the disclosure;And
Fig. 4 diagrammatically illustrates the schematic diagram of the computer readable storage medium according to one exemplary embodiment of the disclosure.
Specific embodiment
Example embodiment is described more fully with reference to the drawings.However, example embodiment can be real in a variety of forms It applies, and is not understood as limited to embodiment set forth herein;On the contrary, thesing embodiments are provided so that the disclosure will be comprehensively and complete It is whole, and the design of example embodiment is comprehensively communicated to those skilled in the art.Identical appended drawing reference indicates in figure Same or similar part, thus repetition thereof will be omitted.
In addition, described feature, structure or characteristic can be incorporated in one or more implementations in any suitable manner In example.In the following description, many details are provided to provide and fully understand to embodiment of the disclosure.However, It will be appreciated by persons skilled in the art that can be with technical solution of the disclosure without one in the specific detail or more It is more, or can be using other methods, constituent element, material, device, step etc..In other cases, it is not shown in detail or describes Known features, method, apparatus, realization, material or operation are to avoid fuzzy all aspects of this disclosure.
Block diagram shown in the drawings is only functional entity, not necessarily must be corresponding with physically separate entity. I.e., it is possible to realize these functional entitys using software form, or these are realized in the module of one or more softwares hardening A part of functional entity or functional entity, or realized in heterogeneous networks and/or processor device and/or microcontroller device These functional entitys.
In this exemplary embodiment, a kind of electronic health record standardization matching process is provided firstly;With reference to shown in Fig. 1, Electronic health record standardization matching process may comprise steps of:
Dictionary library establishment step S110, being established based on the total office data of state food pharmaceuticals administration includes standard pharmaceutical number According to dictionary library;
Drug data coding step S120 generates unique mark to the drug data in the dictionary library according to preset algorithm Know coding;
Electronic health record data collection steps S130 carries out data digging to the electric power case history of nonstandardized technique according to preset rules Pick, acquires in the electronic health record to specification drug data;
Electronic health record Data Matching step S140, by the standard pharmaceutical in specification drug data and the dictionary library Date comprision completes the matching to specification drug data and the standard pharmaceutical data.
On the one hand, disclosure normal dictionary library data source, can be in a certain range in the data on periodically crawl CFDA The accuracy and authority for ensureing data, establish what unified data standard was unified drug data in the institute of each hospital Specification;On the other hand, the disclosure is the basis for ensureing doctor's advice information and reading, analyzing, and reinforces basis for IT application work, promotes basis Construction promotes information sharing, for drug application, audit, bring together, trade provides convenience, between the different authorities of more enough realizations Information resources share, data exchange, statistical analysis, reduce supervision cost.
In the following, by the electronic health record standardization matching process in this example embodiment is further detailed.
In dictionary library establishment step S110, it can be established based on the total office data of state food pharmaceuticals administration comprising mark The dictionary library of quasi drug data.
In this exemplary embodiment, the dictionary library establishment step further include:
Standard pharmaceutical data in the dictionary library include Domestic Drugs product character string, import drugs character string, pharmaceutical production Enterprise's character string, drug distributor's character string, GMP authentication string, GSP authentication string, the character string is according to default Rule splicing.
In this exemplary embodiment, after the character string is spliced according to preset rules, pass through editing distance algorithm (Levenshtein Distance) calculates whether the drug data is newly-increased data, if so, in deposit dictionary library, no It is disconnected abundant, improve the data in normal dictionary library.
It, can be raw to the drug data in the dictionary library according to preset algorithm in drug data coding step S120 It is encoded at unique identification.
In this exemplary embodiment, the drug data coding step further include:
It is with " People's Republic of China (PRC) health industry standard YY0252---1997 chemicals (raw material, preparation) standard " Basic coding defers to the principle of one yard of an object, to the drug of certain multiple uses, only assigns a unique code.
In this exemplary embodiment, the drug data coding step further include:
Volume is further extended according to ATC sorting code number, chemicals and biological products level Four sorting code number table relationship Code;
Specified value is encoded to level code structure, totally 9 layers of 20 composition, with every group 12,16,18,20 numbers not etc. Font formula exists, wherein the personal characteristics of drug representated by medicine coding include the classification of drug, title, dosage form, preparation specification, Packing specification.
In electronic health record data collection steps S130, can according to preset rules, to the electric power case history of nonstandardized technique into Row data mining acquires in the electronic health record to specification drug data.
In this exemplary embodiment, since the information on EMR (Electronic Medical Record) is mostly with text This form storage, and the information of text description usually has ambiguity and many nonstandardized techniques description, need by text mining and Some medical data models are converted into structural data.
In this exemplary embodiment, the electronic health record data collection steps further include to unstructured free textual data According to mining analysis:
Free text mining analysis is realized by the medicine text feature library and terminology bank analysis engine of establishing clinical document, It is updated by borrowing-word lexicon iteration, specification electronic health record medical terminology, realizes key vocabularies classification, semantic context point Analysis, map analysis, and then reach the expected structure standard of data;
Pass through the general data of the clinical element model of analogy SHARPn, the generic data model of PCORI, OMOP/OHDSI Model reaches the structuring of drug data.
In this exemplary embodiment, the structuring that some more famous models reach drug data: SHARPn is used for reference The data normalization process (having used " clinical element model (Clinical Element Model) ") of project development, PCORI " the general data mould that " generic data model (the PCORNET common data model) " proposed, OMOP/OHDSI are proposed Type (OMOP common data model) ".
It, can be by described in specification drug data and the dictionary library in electronic health record Data Matching step S140 Standard pharmaceutical date comprision completes the matching to specification drug data and the standard pharmaceutical data.
In this exemplary embodiment, the electronic health record Data Matching step further include:
By intelligent Compare System, by the standard pharmaceutical data comparison in specification drug data and the dictionary library Analysis obtains the similarity to specification drug data and the standard pharmaceutical data, and then completes described to specification drug The matching of data and the standard pharmaceutical data.
In this exemplary embodiment, normal dictionary library data source, can be certain in the data on periodically crawl CFDA The accuracy and authority that data are ensured in range, establish unified data standard and carry out drug data in the institute of each hospital Unified specification.This method is the basis for ensureing doctor's advice information and reading, analyzing, and reinforces basis for IT application work, basis is promoted to build If promote information sharing, for drug application, audit, bring together, trade provide convenience;Realize the information between different authorities Resource-sharing, data exchange, statistical analysis reduce supervision cost.
It should be noted that although describing each step of method in the disclosure in the accompanying drawings with particular order, This does not require that or implies must execute these steps in this particular order, or have to carry out step shown in whole Just it is able to achieve desired result.Additional or alternative, it is convenient to omit multiple steps are merged into a step and held by certain steps Row, and/or a step is decomposed into execution of multiple steps etc..
In addition, in this exemplary embodiment, additionally providing a kind of electronic health record standardization coalignment.Referring to shown in Fig. 2, Electronic health record standardization coalignment 200 may include: that dictionary library establishes module 210, drug data coding module 220, electricity Sub- medical record data acquisition module 230 and electronic health record data match module 240.Wherein:
Dictionary library establishes module 210, includes standard medicine for being established based on the total office data of state food pharmaceuticals administration The dictionary library of product data;
Drug data coding module 220, for being generated to the drug data in the dictionary library unique according to preset algorithm Identification code;
Electronic health record data acquisition module 230, for carrying out data to the electric power case history of nonstandardized technique according to preset rules It excavates, acquires in the electronic health record to specification drug data;
Electronic health record data match module 240, for by the standard in specification drug data and the dictionary library The matching to specification drug data and the standard pharmaceutical data is completed in drug data comparative analysis.
The detail of each electronic health record standardization coalignment module is in corresponding electronic health record specification among the above Change and be described in detail in matching process, therefore details are not described herein again.
It should be noted that although being referred to several moulds of electronic health record standardization coalignment 200 in the above detailed description Block or unit, but this division is not enforceable.In fact, according to embodiment of the present disclosure, above-described two A or more module or the feature and function of unit can embody in a module or unit.Conversely, above description A module or unit feature and function can with further division be embodied by multiple modules or unit.
In addition, in an exemplary embodiment of the disclosure, additionally providing a kind of electronic equipment that can be realized the above method.
Person of ordinary skill in the field it is understood that various aspects of the invention can be implemented as system, method or Program product.Therefore, various aspects of the invention can be embodied in the following forms, it may be assumed that complete hardware embodiment, completely Software implementation (including firmware, microcode etc.) or hardware and software in terms of combine embodiment, may be collectively referred to as here Circuit, " module " or " system ".
The electronic equipment 300 of this embodiment according to the present invention is described referring to Fig. 3.The electronics that Fig. 3 is shown is set Standby 300 be only an example, should not function to the embodiment of the present invention and use scope bring any restrictions.
As shown in figure 3, electronic equipment 300 is showed in the form of universal computing device.The component of electronic equipment 300 can wrap It includes but is not limited to: at least one above-mentioned processing unit 310, at least one above-mentioned storage unit 320, the different system components of connection The bus 330 of (including storage unit 320 and processing unit 310), display unit 340.
Wherein, the storage unit is stored with program code, and said program code can be held by the processing unit 310 Row, so that various according to the present invention described in the execution of the processing unit 310 above-mentioned " illustrative methods " part of this specification The step of exemplary embodiment.For example, the processing unit 310 can execute step S110 as shown in fig. 1 to step S140。
Storage unit 320 may include the readable medium of volatile memory cell form, such as Random Access Storage Unit (RAM) 3201 and/or cache memory unit 3202, it can further include read-only memory unit (ROM) 3203.
Storage unit 320 can also include program/utility with one group of (at least one) program module 3205 3204, such program module 3205 includes but is not limited to: operating system, one or more application program, other program moulds It may include the realization of network environment in block and program data, each of these examples or certain combination.
Bus 330 can be to indicate one of a few class bus structures or a variety of, including storage unit bus or storage Cell controller, peripheral bus, graphics acceleration port, processing unit use any bus structures in a variety of bus structures Local bus.
Electronic equipment 300 can also be with one or more external equipments 370 (such as keyboard, sensing equipment, bluetooth equipment Deng) communication, can also be enabled a user to one or more equipment interact with the electronic equipment 300 communicate, and/or with make Any equipment (such as the router, modulation /demodulation that the electronic equipment 300 can be communicated with one or more of the other calculating equipment Device etc.) communication.This communication can be carried out by input/output (I/O) interface 350.Also, electronic equipment 300 can be with By network adapter 360 and one or more network (such as local area network (LAN), wide area network (WAN) and/or public network, Such as internet) communication.As shown, network adapter 360 is communicated by bus 330 with other modules of electronic equipment 300. It should be understood that although not shown in the drawings, other hardware and/or software module can not used in conjunction with electronic equipment 300, including but not Be limited to: microcode, device driver, redundant processing unit, external disk drive array, RAID system, tape drive and Data backup storage system etc..
By the description of above embodiment, those skilled in the art is it can be readily appreciated that example embodiment described herein It can also be realized in such a way that software is in conjunction with necessary hardware by software realization.Therefore, implemented according to the disclosure The technical solution of example can be embodied in the form of software products, which can store in a non-volatile memories In medium (can be CD-ROM, USB flash disk, mobile hard disk etc.) or on network, including some instructions are so that a calculating equipment (can To be personal computer, server, terminal installation or network equipment etc.) it executes according to the method for the embodiment of the present disclosure.
In an exemplary embodiment of the disclosure, a kind of computer readable storage medium is additionally provided, energy is stored thereon with Enough realize the program product of this specification above method.In some possible embodiments, various aspects of the invention can be with It is embodied as a kind of form of program product comprising program code, it is described when described program product is run on the terminal device Program code is for executing the terminal device described in above-mentioned " illustrative methods " part of this specification according to the present invention The step of various exemplary embodiments.
Refering to what is shown in Fig. 4, the program product 400 for realizing the above method of embodiment according to the present invention is described, It can using portable compact disc read only memory (CD-ROM) and including program code, and can in terminal device, such as It is run on PC.However, program product of the invention is without being limited thereto, in this document, readable storage medium storing program for executing, which can be, appoints What include or the tangible medium of storage program that the program can be commanded execution system, device or device use or and its It is used in combination.
Described program product can be using any combination of one or more readable mediums.Readable medium can be readable letter Number medium or readable storage medium storing program for executing.Readable storage medium storing program for executing for example can be but be not limited to electricity, magnetic, optical, electromagnetic, infrared ray or System, device or the device of semiconductor, or any above combination.The more specific example of readable storage medium storing program for executing is (non exhaustive List) include: electrical connection with one or more conducting wires, portable disc, hard disk, random access memory (RAM), read-only Memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disc read only memory (CD-ROM), light storage device, magnetic memory device or above-mentioned any appropriate combination.
Computer-readable signal media may include in a base band or as carrier wave a part propagate data-signal, In carry readable program code.The data-signal of this propagation can take various forms, including but not limited to electromagnetic signal, Optical signal or above-mentioned any appropriate combination.Readable signal medium can also be any readable Jie other than readable storage medium storing program for executing Matter, the readable medium can send, propagate or transmit for by instruction execution system, device or device use or and its The program of combined use.
The program code for including on readable medium can transmit with any suitable medium, including but not limited to wirelessly, have Line, optical cable, RF etc. or above-mentioned any appropriate combination.
The program for executing operation of the present invention can be write with any combination of one or more programming languages Code, described program design language include object oriented program language-Java, C++ etc., further include conventional Procedural programming language-such as " C " language or similar programming language.Program code can be fully in user It calculates and executes in equipment, partly executes on a user device, being executed as an independent software package, partially in user's calculating Upper side point is executed on a remote computing or is executed in remote computing device or server completely.It is being related to far Journey calculates in the situation of equipment, and remote computing device can pass through the network of any kind, including local area network (LAN) or wide area network (WAN), it is connected to user calculating equipment, or, it may be connected to external computing device (such as utilize ISP To be connected by internet).
In addition, above-mentioned attached drawing is only the schematic theory of processing included by method according to an exemplary embodiment of the present invention It is bright, rather than limit purpose.It can be readily appreciated that the time that above-mentioned processing shown in the drawings did not indicated or limited these processing is suitable Sequence.In addition, be also easy to understand, these processing, which can be, for example either synchronously or asynchronously to be executed in multiple modules.
Those skilled in the art after considering the specification and implementing the invention disclosed here, will readily occur to its of the disclosure His embodiment.This application is intended to cover any variations, uses, or adaptations of the disclosure, these modifications, purposes or Adaptive change follow the general principles of this disclosure and including the undocumented common knowledge in the art of the disclosure or Conventional techniques.The description and examples are only to be considered as illustrative, and the true scope and spirit of the disclosure are by claim It points out.
It should be understood that the present disclosure is not limited to the precise structures that have been described above and shown in the drawings, and And various modifications and changes may be made without departing from the scope thereof.The scope of the present disclosure is only limited by the attached claims.

Claims (10)

  1. The matching process 1. a kind of electronic health record standardizes, which is characterized in that the described method includes:
    Dictionary library establishment step establishes the dictionary comprising standard pharmaceutical data based on the total office data of state food pharmaceuticals administration Library;
    Drug data coding step generates unique identification coding to the drug data in the dictionary library according to preset algorithm;
    Electronic health record data collection steps carry out data mining to the electric power case history of nonstandardized technique, acquire institute according to preset rules It states in electronic health record to specification drug data;
    Electronic health record Data Matching step, by the standard pharmaceutical data comparison in specification drug data and the dictionary library The matching to specification drug data and the standard pharmaceutical data is completed in analysis.
  2. 2. the method as described in claim 1, which is characterized in that the dictionary library establishment step further include:
    Standard pharmaceutical data in the dictionary library include Domestic Drugs product character string, import drugs character string, pharmaceutical producing enterprise Character string, drug distributor's character string, GMP authentication string, GSP authentication string, the character string is according to preset rules Splicing.
  3. 3. method according to claim 2, which is characterized in that after the character string is spliced according to preset rules, pass through editor Distance algorithm calculates whether the drug data is newly-increased data, if so, in deposit dictionary library.
  4. 4. the method as described in claim 1, which is characterized in that the drug data coding step further include:
    Based on " People's Republic of China (PRC) health industry standard YY0252---1997 chemicals (raw material, preparation) standard " Coding, defers to the principle of one yard of an object, to the drug of certain multiple uses, only assigns a unique code.
  5. 5. method as claimed in claim 4, which is characterized in that the drug data coding step further include:
    Coding is further extended according to ATC sorting code number, chemicals and biological products level Four sorting code number table relationship;
    Specified value is encoded to level code structure, totally 9 layers of 20 composition, with every group 12,16,18,20 digital shapes not etc. Formula exists, and wherein the personal characteristics of drug representated by medicine coding includes the classification of drug, title, dosage form, preparation specification, packaging Specification.
  6. 6. the method as described in claim 1, which is characterized in that the electronic health record data collection steps further include to non-structural Change the mining analysis of free text data:
    Free text mining analysis is realized by the medicine text feature library and terminology bank analysis engine of establishing clinical document, is passed through Borrowing-word lexicon iteration updates, specification electronic health record medical terminology, realization key vocabularies are classified, semantic context is analyzed, Map analysis, and then reach the expected structure standard of data;
    By the clinical element model of analogy SHARPn, the generic data model of PCORI, OMOP/OHDSI generic data model Reach the structuring of drug data.
  7. 7. the method as described in claim 1, which is characterized in that the electronic health record Data Matching step further include:
    By intelligent Compare System, by the standard pharmaceutical data comparison point in specification drug data and the dictionary library Analysis obtains the similarity to specification drug data and the standard pharmaceutical data, and then completes described to specification drug number According to the matching with the standard pharmaceutical data.
  8. The coalignment 8. a kind of electronic health record standardizes, which is characterized in that described device includes:
    Dictionary library establishes module, for being established based on the total office data of state food pharmaceuticals administration comprising standard pharmaceutical data Dictionary library;
    Drug data coding module, for generating unique identification to the drug data in the dictionary library and compiling according to preset algorithm Code;
    Electronic health record data acquisition module, for carrying out data mining to the electric power case history of nonstandardized technique, adopting according to preset rules Collect in the electronic health record to specification drug data;
    Electronic health record data match module, for by the standard pharmaceutical data in specification drug data and the dictionary library The matching to specification drug data and the standard pharmaceutical data is completed in comparative analysis.
  9. 9. a kind of electronic equipment, which is characterized in that including
    Processor;And
    Memory is stored with computer-readable instruction on the memory, and the computer-readable instruction is held by the processor Method according to any one of claim 1 to 7 is realized when row.
  10. 10. a kind of computer readable storage medium, is stored thereon with computer program, the computer program is executed by processor Shi Shixian is according to claim 1 to any one of 7 the methods.
CN201910489480.5A 2019-06-06 2019-06-06 A kind of electronic health record standardization matching process and device Pending CN110289058A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910489480.5A CN110289058A (en) 2019-06-06 2019-06-06 A kind of electronic health record standardization matching process and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910489480.5A CN110289058A (en) 2019-06-06 2019-06-06 A kind of electronic health record standardization matching process and device

Publications (1)

Publication Number Publication Date
CN110289058A true CN110289058A (en) 2019-09-27

Family

ID=68003445

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910489480.5A Pending CN110289058A (en) 2019-06-06 2019-06-06 A kind of electronic health record standardization matching process and device

Country Status (1)

Country Link
CN (1) CN110289058A (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110727710A (en) * 2019-10-12 2020-01-24 平安医疗健康管理股份有限公司 Data analysis method and device, computer equipment and storage medium
CN111125076A (en) * 2019-12-17 2020-05-08 武汉海云健康科技股份有限公司 Big data based medicine universal name cleaning method and system, server and medium
CN111161817A (en) * 2019-12-31 2020-05-15 医渡云(北京)技术有限公司 Medical data standardization processing method, device, medium and electronic equipment
CN111180087A (en) * 2020-01-02 2020-05-19 中国中医科学院中医药信息研究所 Marketing medicine information standardization method, equipment, server and storage medium
CN111400296A (en) * 2020-03-16 2020-07-10 北京大学深圳医院 Kidney pathology immunofluorescence data processing method and device and related equipment
CN111625542A (en) * 2020-05-25 2020-09-04 泰康保险集团股份有限公司 Allergy information database establishing method and device, storage medium and electronic equipment
CN111933244A (en) * 2020-08-17 2020-11-13 医渡云(北京)技术有限公司 Medicine data encoding method and device, computer readable medium and electronic equipment
CN111986754A (en) * 2020-08-21 2020-11-24 南通大学 Electronic medical record management model construction method based on diabetes
CN112527970A (en) * 2020-12-24 2021-03-19 上海浦东发展银行股份有限公司 Data dictionary standardization processing method, device, equipment and storage medium
CN112925819A (en) * 2020-12-21 2021-06-08 上海药慧信息技术有限公司 Method and device for mining bid winning information of medicines
CN113130038A (en) * 2021-04-30 2021-07-16 康键信息技术(深圳)有限公司 Medicine data matching method, device, equipment and storage medium
CN114461866A (en) * 2022-03-23 2022-05-10 百芯智能制造科技(深圳)有限公司 Data normalization processing method and electronic equipment
CN116453637A (en) * 2023-03-20 2023-07-18 杭州市卫生健康事业发展中心 Health data management method and system based on regional big data
CN117763129A (en) * 2024-02-22 2024-03-26 神州医疗科技股份有限公司 medical record retrieval system based on generated pre-training model
CN111523309B (en) * 2020-04-17 2024-07-16 北京懿医云科技有限公司 Drug information normalization method and device, storage medium and electronic equipment

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103678435A (en) * 2013-07-08 2014-03-26 重庆绿色智能技术研究院 Drug specification data similarity matching method
CN104268137A (en) * 2013-07-31 2015-01-07 深圳市华傲数据技术有限公司 Method and device for matching pharmaceutical name data
CN105005683A (en) * 2015-06-17 2015-10-28 北京锐易特软件技术有限公司 Caching system and method for solving data normalization problem of regional medical system
CN106383853A (en) * 2016-08-30 2017-02-08 刘勇 Realization method and system for electronic medical record post-structuring and auxiliary diagnosis
CN106777165A (en) * 2016-12-21 2017-05-31 广东技术师范学院 A kind of medicine information base construction method based on web crawlers
CN107480425A (en) * 2017-07-14 2017-12-15 广东医睦科技有限公司 A kind of medicine information processing method based on medicine coding
CN107784611A (en) * 2017-04-11 2018-03-09 平安医疗健康管理股份有限公司 medicine coding method and device
CN108538395A (en) * 2018-04-02 2018-09-14 上海市儿童医院 A kind of construction method of general medical disease that calls for specialized treatment data system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103678435A (en) * 2013-07-08 2014-03-26 重庆绿色智能技术研究院 Drug specification data similarity matching method
CN104268137A (en) * 2013-07-31 2015-01-07 深圳市华傲数据技术有限公司 Method and device for matching pharmaceutical name data
CN105005683A (en) * 2015-06-17 2015-10-28 北京锐易特软件技术有限公司 Caching system and method for solving data normalization problem of regional medical system
CN106383853A (en) * 2016-08-30 2017-02-08 刘勇 Realization method and system for electronic medical record post-structuring and auxiliary diagnosis
CN106777165A (en) * 2016-12-21 2017-05-31 广东技术师范学院 A kind of medicine information base construction method based on web crawlers
CN107784611A (en) * 2017-04-11 2018-03-09 平安医疗健康管理股份有限公司 medicine coding method and device
CN107480425A (en) * 2017-07-14 2017-12-15 广东医睦科技有限公司 A kind of medicine information processing method based on medicine coding
CN108538395A (en) * 2018-04-02 2018-09-14 上海市儿童医院 A kind of construction method of general medical disease that calls for specialized treatment data system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王若佳 等: "中文电子病历的分词及实体识别研究", 《图书情报工作》 *

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110727710B (en) * 2019-10-12 2023-02-07 平安医疗健康管理股份有限公司 Data analysis method and device, computer equipment and storage medium
CN110727710A (en) * 2019-10-12 2020-01-24 平安医疗健康管理股份有限公司 Data analysis method and device, computer equipment and storage medium
CN111125076A (en) * 2019-12-17 2020-05-08 武汉海云健康科技股份有限公司 Big data based medicine universal name cleaning method and system, server and medium
CN111161817A (en) * 2019-12-31 2020-05-15 医渡云(北京)技术有限公司 Medical data standardization processing method, device, medium and electronic equipment
CN111161817B (en) * 2019-12-31 2023-09-19 医渡云(北京)技术有限公司 Medical data standardized processing method, device, medium and electronic equipment
CN111180087A (en) * 2020-01-02 2020-05-19 中国中医科学院中医药信息研究所 Marketing medicine information standardization method, equipment, server and storage medium
CN111400296A (en) * 2020-03-16 2020-07-10 北京大学深圳医院 Kidney pathology immunofluorescence data processing method and device and related equipment
CN111523309B (en) * 2020-04-17 2024-07-16 北京懿医云科技有限公司 Drug information normalization method and device, storage medium and electronic equipment
CN111625542A (en) * 2020-05-25 2020-09-04 泰康保险集团股份有限公司 Allergy information database establishing method and device, storage medium and electronic equipment
CN111933244A (en) * 2020-08-17 2020-11-13 医渡云(北京)技术有限公司 Medicine data encoding method and device, computer readable medium and electronic equipment
CN111986754A (en) * 2020-08-21 2020-11-24 南通大学 Electronic medical record management model construction method based on diabetes
CN112925819A (en) * 2020-12-21 2021-06-08 上海药慧信息技术有限公司 Method and device for mining bid winning information of medicines
CN112925819B (en) * 2020-12-21 2023-05-19 上海药慧信息技术有限公司 Method and device for mining bid-winning information of medicine
CN112527970A (en) * 2020-12-24 2021-03-19 上海浦东发展银行股份有限公司 Data dictionary standardization processing method, device, equipment and storage medium
CN113130038A (en) * 2021-04-30 2021-07-16 康键信息技术(深圳)有限公司 Medicine data matching method, device, equipment and storage medium
CN114461866A (en) * 2022-03-23 2022-05-10 百芯智能制造科技(深圳)有限公司 Data normalization processing method and electronic equipment
CN116453637A (en) * 2023-03-20 2023-07-18 杭州市卫生健康事业发展中心 Health data management method and system based on regional big data
CN116453637B (en) * 2023-03-20 2023-11-07 杭州市卫生健康事业发展中心 Health data management method and system based on regional big data
CN117763129A (en) * 2024-02-22 2024-03-26 神州医疗科技股份有限公司 medical record retrieval system based on generated pre-training model
CN117763129B (en) * 2024-02-22 2024-05-28 神州医疗科技股份有限公司 Medical record retrieval method and system based on generated pre-training model

Similar Documents

Publication Publication Date Title
CN110289058A (en) A kind of electronic health record standardization matching process and device
Wang et al. A survey on knowledge graph embeddings for link prediction
Shi et al. A survey of data semantization in internet of things
CA3046247C (en) Data platform for automated data extraction, transformation, and/or loading
Chen et al. Data mining for the internet of things: literature review and challenges
US11232365B2 (en) Digital assistant platform
Winfield et al. On formal specification of emergent behaviours in swarm robotic systems
Wu et al. Towards a semantic web of things: a hybrid semantic annotation, extraction, and reasoning framework for cyber-physical system
CN104794151A (en) Spatial knowledge service system building method based on collaborative plotting technology
US12093253B2 (en) Summarized logical forms based on abstract meaning representation and discourse trees
Vieira et al. Towards resilient and sustainable rail and road networks: A systematic literature review on digital twins
US12106054B2 (en) Multi case-based reasoning by syntactic-semantic alignment and discourse analysis
CN104050223A (en) Pivot facets for text mining and search
CN111164704B (en) Derivation of mechanism of action for prediction of drug candidate adverse reactions
Kumar et al. The beginning of a new era: artificial intelligence in healthcare
Salman et al. Big data management in drug–drug interaction: a modern deep learning approach for smart healthcare
González García et al. What is (not) Big Data based on its 7Vs challenges: A survey
Wang et al. Ethereum smart contract vulnerability detection model based on triplet loss and BiLSTM
Panduman et al. A Survey of AI Techniques in IoT Applications with Use Case Investigations in the Smart Environmental Monitoring and Analytics in Real-Time IoT Platform
Levshun et al. Design of secure microcontroller-based systems: application to mobile robots for perimeter monitoring
Jin et al. Ontology-Based Semantic Modeling of Coal Mine Roof Caving Accidents
Azemi et al. Uncertainty in internet of things: a review
Wang et al. Product Innovation Design Process Model Based on Functional Genes Extraction and Construction
Horak et al. Data Integration from Heterogeneous Control Levels for the Purposes of Analysis within Industry 4.0 Concept
Barrera et al. An extension of iStar for Machine Learning requirements by following the PRISE methodology

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190927