CN105005683A - Caching system and method for solving data normalization problem of regional medical system - Google Patents

Caching system and method for solving data normalization problem of regional medical system Download PDF

Info

Publication number
CN105005683A
CN105005683A CN201510337211.9A CN201510337211A CN105005683A CN 105005683 A CN105005683 A CN 105005683A CN 201510337211 A CN201510337211 A CN 201510337211A CN 105005683 A CN105005683 A CN 105005683A
Authority
CN
China
Prior art keywords
data
business
template
metadata
normalization
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510337211.9A
Other languages
Chinese (zh)
Inventor
李轶强
马国耀
蔡军
肖华
杨帆
孙勇韬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING REALESOFT SOFTWARE TECHNOLOGY Co Ltd
Original Assignee
BEIJING REALESOFT SOFTWARE TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING REALESOFT SOFTWARE TECHNOLOGY Co Ltd filed Critical BEIJING REALESOFT SOFTWARE TECHNOLOGY Co Ltd
Priority to CN201510337211.9A priority Critical patent/CN105005683A/en
Publication of CN105005683A publication Critical patent/CN105005683A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a caching system and method for solving a data normalization problem of a regional medical system. The system comprises a template management unit, a data processing unit and a data management unit. The method comprises: defining a data structure normalization template and a service data model in the template; carrying out data structure normalization processing on acquired regional medical data by using the defined data structure normalization template and enabling the regional medical data to accord with service specifications in the aspect of structure; carrying out correlation verification, repeated data processing and deficient data complementing processing on the acquired regional medical data by using the defined data structure normalization template; and caching the data in a normalization process of the data structure normalization template and outputting the processed normalization regional medical data. According to the system and the method provided by the invention, a data caching function based on the data structure normalization template is realized; and the normalization processing of medical service data is realized, the data quality is improved, and the data accords with service archiving requirements.

Description

A kind of caching system and method solving area medical system data conventions problem
Technical field
The invention belongs to medical industry Data cache technology field, be specifically related to a kind of caching system and the method that solve area medical system data conventions problem.
Background technology
In the processing procedure of medical data, the processing procedure of data can be divided into gathering generally cleans and analysis and utilization two Main Stage, and collection wash phase main body completes the collection of data, cleaning, conversion, verification; In analysis and utilization stage main body complete paired data excavation, analyze, represent.Analysis phase based on the data gathering wash phase and provide, and gathers wash phase using analysis and utilization stage definitions demand data as acquisition condition, and the two-stage interdepends.Due to the diversity of collected medical data form, before carrying out analysis and utilization, need to carry out caching process to data, using the data resource as follow-up cleaning, conversion and verification.Caching system, as a data cushion, is deployed in large data acquisition wash phase, the result data that buffer memory collection, cleaning, switch process perform.
And the construction of this caching system is based on on agreeing with property of medical profession basis, the medical profession data model obtained, this model has the feature of medical profession standardization and stability, meet the structure of hospital system database on the one hand, meet the requirement to data content when reporting data on the other hand.
Summary of the invention
For prior art Problems existing, the invention provides a kind of caching system and the method that solve area medical system data conventions problem.
Solve a caching system for area medical system data conventions problem, comprising:
Template Manager unit: for defining and the business data model in management data structures standardization template and template, utilize the normalization of data structure template of definition to carry out normalization of data structure process to the area medical data gathered, make area medical data fit business norms from structure;
Data processing unit: utilize the normalization of data structure template of definition to carry out association verification, repeating data process and data incomplete completion process to the area medical data collected;
Data Management Unit: for carrying out buffer memory to the data in normalization of data structure Templates specifications process and normalized area medical data after output processing.
Described Template Manager unit comprises business model administration module and metadata management module;
Business data model administration module: realize the definition to business data model structure, to comprise between business and incidence relation between business and metadata, the business defined in business data model and the incidence relation of service metadata provide foundation to the data handling procedure performed in data processing unit;
Metadata management module: realize the definition to service metadata content, service metadata is the minimum unit in business data model, service metadata is on the one hand for forming normalization of data structure template, and another aspect is for defining the data cached library structure in Data Management Unit.
Described data processing unit comprises data check module, data deduplication module and Supplementing Data module;
Data check module: association checking treatment is carried out to the data of Data Management Unit buffer memory according to the business data model that Template Manager unit provides, whether verification current cache data meet business need;
Data deduplication module: data redundancy verification is carried out to the data of Data Management Unit buffer memory, and deleting duplicated data;
Supplementing Data module: carry out completion to the data of Data Management Unit buffer memory, carries out completion one by one by the data content of defect according to data dictionary, business norms, makes data meet integrity demands.
Described Data Management Unit comprises sending module, memory module and cache database;
Sending module: perform and send data task in cache database;
Memory module: the store tasks performing data in cache database;
Cache database: for being buffered in the data processing execution result obtained the data processing each stage in normalization of data structure Templates specifications process.
Described Template Manager unit realizes defining business data model and service metadata structure and managing;
Wherein, the data structure that service metadata structure has in execution data process of caching for cache database in Data Management Unit, service metadata structure defines with two-dimensional chain table structure based on cache database two-dimentional relation.
The business association relation of described business data model administration module definition business data model structure, use two-dimensional chain table structure describes the incidence relation between traffic item and traffic item, specifies the metadata structure of associated traffic item and associated use.
Described memory module is used for building cache database according to business data model and service metadata structure, and the storage realized for data processing unit data processing each phase data processing execution result, send to cache database and perform data storage request, and ensure the correct execution that data store.
Described data check module, according to the business data model provided in Template Manager unit and service metadata structure, carries out business association verification to the data of buffer memory in cache database.
Described Supplementing Data module is according to the incidence relation of the business defined in business data model and service metadata, and the business information that usage data dictionary and business data model define, carries out completion to data that are incomplete in cache database or disappearance.
The caching system of the solution area medical system data conventions problem described in employing carries out the method for medical data buffer memory, comprises the following steps:
Step 1: definition data structure standardization template and template in business data model;
Step 1-1: definition service metadata structure, whether whether whether whether the element in service metadata structure comprises that table name claims, version number, literary name name section, electronic health record authentication code field, table field description, major key, business major key, be empty, index, management fields title;
Step 1-2: the business data model in definition data structure standardization template;
To the definition of business data model structure, to comprise between business and incidence relation between business and metadata, the business defined in business data model and the incidence relation of service metadata, provide foundation to the data handling procedure performed in data processing unit;
Step 2: utilize the normalization of data structure template of definition to carry out normalization of data structure process to the area medical data gathered, make area medical data fit business norms from structure;
Step 2-1: according to the service metadata structure defined in step 1-1, creates the database table structure in this service metadata structure in cache database;
Step 2-2: the area medical data of collection are stored according to the list structure created in step 2-1;
Step 3: utilize the normalization of data structure template of definition to carry out association verification, repeating data process and data incomplete completion process to the area medical data collected;
Step 3-1: the data in cache database are verified, to judge whether the data of preserving in cache database meet the business association verification of business data model in normalization of data structure template, and generate quality of data verification report and the report of service template structure accordance;
Step 3-2: after step 3-1 execution is errorless, first progressively the data that Data Management Unit stores are verified according to business data model, according to the business association relation defined in business model structure, defect verification is carried out to quoted metadata structure content, according to the definition of data dictionary and business model structure, completion is carried out to data incomplete content;
Step 3-3: after step 3-2 execution is errorless, according to the management fields in metadata structure, the data record repeated is verified, the record of repetition is deleted from cache database;
Step 4: buffer memory is carried out to the data in normalization of data structure Templates specifications process and normalized area medical data after output processing.
Beneficial effect:
Caching system of the present invention realizes the data buffer storage function based on normalization of data structure template, realizes the standardization processing to medical profession data, promotes the quality of data, makes data fit business file requirement.Setting up of normalization of data structure template is that standard criterion based on medical profession creates, and this specification comprises business norms in medical industry national standard, institute, comprises data encoding convention and data memory format specification equally.Due to the singularity that medical industry relies on specification, make the normalized value of medical data particularly outstanding in actual applications.
The present invention carry the standardization storage problem that the caching system solving area medical system data conventions problem is intended to solve isomeric data in medical system, data storage is made to meet business norms and norm constraint, make the data of storage have stronger business accordance and standardization, thus promote the quality of data.The caching system of this solution area medical system data conventions problem and method successful implementation are at present in the processing item of medical information, business datum is made to obtain high-quality process and storage, thus promote the Construction and management of health care system, for the structure of area medical system provides strong support.
Accompanying drawing explanation
Fig. 1 is the caching system structured flowchart of the solution area medical system data conventions problem of one embodiment of the present invention;
Fig. 2 is one embodiment of the present invention module management cellular construction schematic block diagram;
Fig. 3 is one embodiment of the present invention Data Management Unit structural schematic block diagram;
Fig. 4 is one embodiment of the present invention data processing unit structural schematic block diagram;
Fig. 5 is one embodiment of the present invention service metadata structural drawing;
Fig. 6 is one embodiment of the present invention business data model structural drawing;
Fig. 7 is one embodiment of the present invention caching system application structure schematic diagram;
Fig. 8 is that one embodiment of the present invention outpatient service business ER schemes;
Fig. 9 is one embodiment of the present invention patient basis service metadata structural drawing;
Figure 10 is one embodiment of the present invention outpatient service diagnosis and treatment business data model structural drawing;
Figure 11 is the caching method process flow diagram that one embodiment of the present invention solves area medical system data conventions problem.
Embodiment
Below in conjunction with accompanying drawing, the specific embodiment of the present invention is elaborated.
This method realizes the standardization caching process to hygiene medical treatment data, promotes the quality of data, makes it meet hygienic practice standard.Utilize in process in hygiene medical treatment data, medical data is due to the singularity of its place application system and mechanism, to each other not there is versatility and consistance, and regional health platform is needed to the medical profession data obtaining standard universal, thus grasp the medical institutions' operation conditions in a region.Caching system object of the present invention is the On The Standardization solving medical data in region, consistance and the unitarity of heterogeneous medical data in feasible region is come by the definition of business data model and metadata structure, it is made to meet the requirement of regional health platform to data standard, for the application of medical data provides support.
The present invention, in implementation process, is also applicable to medical institutions' data source and changes, and the implementation condition that regional health platform standard changes.When medical institutions' data source changes, the field contents of service metadata need be adjusted; And business data model structure need be adjusted for the change of regional health platform standard, and enrich data dictionary content.Due to business data model due to its following and supporting business norms, the probability making this caching system need to adjust in actual application is extremely low, thus has higher system stability and extremely low construction cost.
In the present embodiment, for the standardization process of caching to Outpatient Department data, normalization of data structure template is outpatient service service template, realizes the standardization process of caching of hospital's end Outpatient Department data.The business datum collected in heterogeneous medical mechanism, according to regional health medical standard that platform defines, is carried out standardization processing and buffer memory by system of the present invention.
Data in medical system are carried out standardization caching process by the business rule of system of the present invention described by business model structure, and the data after process are reported to regional health platform.
The area medical data normalization problem that the present invention solves, its process based in data model to the definition of metadata and entity thereof.Metadata schema defines the data structure can carrying out acquisition operations with medical system, and the elementary cell of metadata is associated metadata elements, and associated metadata elements is the data set contents obtained from medical system acquisition is abstract decomposition and description.When a group metadata element has identical traffic performance, this group metadata element may be defined as metadata entity, and metadata entity can comprise 1 or multiple metadata entity, the traffic performance of metadata entity derives from (medical treatment) profession standard a nd norm.Cache database as data model actual physical storage medium and exist, its data stored are the actual mappings to database table of associated metadata elements and metadata entity.The data that cache database stores finally are reported to regional platform, simultaneously cache database also buffer memory data carry out the process data that verifies, duplicate removal, completion processing procedure produce, and after data stabilization, data are sent to regional platform.And the associated metadata elements defined and metadata entity, be build the condition of cache database and principle, be also simultaneously data are verified, duplicate removal, completion process foundation.Its essence of data handling procedure make data in terms of content more accurately and reduce redundancy, structurally completely consistent with associated metadata elements and metadata entity is principle, therefore the processing procedure of data be one progressively to the process of metadata convergence.
In the present invention, the definition of metadata entity is based on the practical business of health care, and the present embodiment introduces constructive process and the business datum standardization process of caching of business data model of the present invention for outpatient service business.
Data in area medical system are carried out standardization caching process by the business rule of system of the present invention described by business data model, and the data after process are reported to regional health platform.Application structure of the present invention as shown in Figure 7.
The area medical system data conventions problem that the present invention solves, its process based in business data model to the definition of service metadata and entity thereof.Service metadata defines the data structure can carrying out acquisition operations with area medical system, and the elementary cell of service metadata is associated metadata elements, and associated metadata elements is the data set contents obtained from area medical system acquisition is abstract decomposition and description.When a group metadata element has identical traffic performance, this group metadata element may be defined as metadata entity, and metadata entity can comprise 1 or multiple metadata entity, the traffic performance of metadata entity derives from (medical treatment) profession standard a nd norm.Cache database as business data model actual physical storage medium and exist, its data stored are the actual mappings to database table of associated metadata elements and metadata entity.The data that cache database stores finally are reported to area medical platform, simultaneously cache database also buffer memory data carry out the process data that verifies, duplicate removal, completion processing procedure produce, and after data stabilization, data are sent to area medical platform.And the associated metadata elements defined and metadata entity, be build the condition of cache database and principle, be also simultaneously data are verified, duplicate removal, completion process foundation.Its essence of data handling procedure make data in terms of content more accurately and reduce redundancy, structurally completely consistent with associated metadata elements and metadata entity is principle, therefore the processing procedure of data be one progressively to the process of metadata convergence.
As shown in Figure 1, a kind of caching system solving area medical system data conventions problem, comprising:
Template Manager unit: for defining and the business data model in management data structures standardization template and template, utilize the normalization of data structure template of definition to carry out normalization of data structure process to the area medical data gathered, make area medical data fit business norms from structure;
Data processing unit: utilize the normalization of data structure template of definition to carry out association verification, repeating data process and data incomplete completion process to the area medical data collected;
Data Management Unit: for carrying out buffer memory to the data in normalization of data structure Templates specifications process and normalized area medical data after output processing.
As shown in Figure 2, Template Manager unit comprises business model administration module and metadata management module;
Business data model administration module: realize the definition to business data model structure, to comprise between business and incidence relation between business and metadata, the business defined in business data model and the incidence relation of service metadata provide foundation to the data handling procedure performed in data processing unit; First define traffic item and traffic item incidence relation to each other, traffic item associated afterwards and the metadata of associated use are specified.
Metadata management module: realize the definition to service metadata content, service metadata is the minimum unit in business data model, service metadata is on the one hand for forming normalization of data structure template, and another aspect is for defining the data cached library structure in Data Management Unit.
Template Manager unit realizes defining business data model and service metadata structure and managing;
Wherein, the data structure that service metadata structure has in execution data process of caching for cache database in Data Management Unit, service metadata structure defines with two-dimensional chain table structure based on cache database two-dimentional relation.
The business association relation of business data model administration module definition business data model structure, use two-dimensional chain table structure describes the incidence relation between traffic item and traffic item, specifies the metadata structure of associated traffic item and associated use.
The service metadata structure of present embodiment as shown in Figure 5, service metadata structure describes to the minimum particle size of regional health platform institute reported data, its to contain information be the content finally required to regional health platform, be also basis and the foundation of carrying out Supplementing Data and duplicate removal process.As shown in Figure 6, wherein, business data model is definition and the description of incidence relation between business datum to the business data model structure of present embodiment, is basis and the foundation of data being carried out to School Affairs process.In the present embodiment, in the definition reference zone health platform of business data model and service metadata structure, this traffic criteria is carried out.
As shown in Figure 4, data processing unit comprises data check module, data deduplication module and Supplementing Data module;
Data check module: association checking treatment is carried out to the data of Data Management Unit buffer memory according to the business data model that Template Manager unit provides, whether verification current cache data meet business need;
Data deduplication module: data redundancy verification is carried out to the data of Data Management Unit buffer memory, and deleting duplicated data;
Supplementing Data module: carry out completion to the data of Data Management Unit buffer memory, carries out completion one by one by the data content of defect according to data dictionary, business norms, makes data meet integrity demands.
Data processing unit verifies cache data content, duplicate removal and completion process, makes data meet business norms in terms of content.Data check module, first according to the business data model provided in Template Manager unit and metadata, carries out business association verification to data temporary in cache database.Afterwards, Supplementing Data module is according to data service incidence relation, and the business information that usage data dictionary and business model define, carries out completion to data that are incomplete in cache database or disappearance.Finally, data deduplication module carries out repeatability verification to the data meeting business association rule in cache database, deleting, the abnormal information existed being carried out log recording simultaneously in duplication check by repeating record.
As shown in Figure 3, Data Management Unit comprises sending module, memory module and cache database;
Sending module: perform and send data task in cache database;
Memory module: the store tasks performing data in cache database;
Cache database: for being buffered in the data processing execution result obtained the data processing each stage in normalization of data structure Templates specifications process.
Memory module is used for building cache database according to business data model and service metadata structure, and the storage realized for data processing unit data processing each phase data processing execution result, send to cache database and perform data storage request, and ensure the correct execution that data store.
Data check module, according to the business data model provided in Template Manager unit and service metadata structure, carries out business association verification to the data of buffer memory in cache database.
Supplementing Data module is according to the incidence relation of the business defined in business data model and service metadata, and the business information that usage data dictionary and business data model define, carries out completion to data that are incomplete in cache database or disappearance.
Below the title involved by this method is defined.
(1) service metadata: the data defining and describe other data; Service metadata is the minimum unit in business data model, and service metadata is on the one hand for forming normalization of data structure template, and another aspect is for defining the data cached library structure in Data Management Unit;
(2) data set: have certain topic, can identify and the data acquisition that can be subsequently can by computer;
(3) associated metadata elements: the elementary cell of metadata;
(4) metadata entity: the associated metadata elements of a group profile data same characteristic features.One or more metadata entity can be comprised;
(5) metadata subsets: the subclass of metadata, closes associated metadata elements by the metadata subsets of being correlated with and forms.
Adopt the caching system solving area medical system data conventions problem to carry out the method for medical data buffer memory, as shown in figure 11, comprise the following steps:
Step 1: definition data structure standardization template and template in business data model;
Usually can carry out abstract description by the mode of ER figure to the description of outpatient service business in medical institutions' system, the description of these business relations as shown in Figure 8.This ER figure describes the storage relation of HIS system database and table in area medical system, relation described by it is that business data model constructed by the present invention and metadata structure have effective directive significance, but ER figure is a kind of description form of relation, and the input of Structure Creating process involved by the present invention is not limited only to ER figure.
Step 1-1: definition service metadata structure, whether whether the element in service metadata structure comprise table name and claim (TName), version number (Version), literary name name section (Fields), electronic health record authentication code field (DE_Fields), table field description (F_Desc), whether major key (Primary), whether business major key (B_Primary), be empty (isNull), index (isIndex), management fields title (Mana_F); The service metadata structure of present embodiment as shown in Figure 5.
Definition outpatient service metadata structure, M outpatient service business=< m patient basis, m out-patient registration, m outpatient clinic, m prescriptions for Out-patients, m ambulatory expenses is detailed, m ambulatory expenses invoice>, M are metadata structure, according to the definition of Fig. 5 service metadata structure, for " patient basis ", its m patient basispatient basis's metadata structure as shown in Figure 9 can be defined as.
Patient basis's metadata constructive process take metadata structure as frame constraint, with data item in actual heterogeneous system for content.For Fig. 9, the two-dimensional chain table structure of this metadata describes this patient basis, comprises table name and claims TName to be R_MPI_PATIENTINFO; Version number Version is 2_0; Literary name name section Fields is AUTO_ID, LP_MPI_PATIENT, PATIENT_ID, NAME etc.; Table field description F_Desc explains the associated description of Fields; DE_Fields is electronic health record authentication code corresponding to this field, electronic health record authentication code is for reporting in process carrying out data, necessarily require the field reporting and process, and electronic health record mark is content described in the WS445.2-2014 series standard drafted according to CNS GB/T1.1-2009; And wherein whether major key Primary, whether business major key B_Primary, whether be sky isNull, whether index inIndex is arranged according to the characteristic of Fields; Management fields title Mana_F is RECORD_DTIME, RECORD_UPDATE_DTIME, for recording content metadata rise time working days and update time.
Build all the other metadata in outpatient service diagnosis and treatment business, m out-patient registration, m outpatient clinic, m prescriptions for Out-patients, m ambulatory expenses is detailed, m ambulatory expenses invoice.Metadata business association relation to each other, is described by business data model structure, for the constructive process of business data model structure, on the one hand based on metadata item content, on the other hand based on data ER graph of a relation.
Outpatient service diagnosis and treatment business data model structure describes outpatient service service metadata incidence relation to each other, record its service metadata title quoted by model master meter name item TN in business data model structure, and associate list item LTN and associate field item LKCol and have recorded and the table associated by TN item and literary name section.Meanwhile, master-salve table incidence relation is described by item L and item R, describes the incidence relation of (1:1) one to one between its principal and subordinate's contingency table or one-to-many (1:N).
The table name defined in business data model reference metadata structure claims the information such as TName, literary name section Fields, does not create new metadata item information in business data model, but the business association relation between descriptive metadata.
Step 1-2: the business data model in definition data structure standardization template;
To the definition of business data model structure, to comprise between business and incidence relation between business and metadata, the business defined in business data model and the incidence relation of service metadata, provide foundation to the data handling procedure performed in data processing unit;
Service metadata in business data model is quoting and recombinating element in service metadata in step 1-1, business data model structure and a series of combination with the metadata of identical services feature, i.e. metadata entity.At the definition procedure of this metadata entity, object is carried out organizing and associating according to traffic performance by the metadata structure defined in step 1-1, the element wherein in entity comprise model describe (Desc), master meter name (TN), contingency table (LTN), table associate field (LKCol), with master meter incidence relation (1 or N), with contingency table incidence relation (1 or N).
The table name defined in business data model structure reference metadata structure claims the information such as TName, literary name section Fields, does not create new metadata item information in business data model, but the business association relation between descriptive metadata.
After completing the definition to outpatient service diagnosis and treatment business data model, native system is according to the information structure normalization of data structure template defined in metadata and business data model.In the present system, metadata structure and business data model are undertaken storing and describing by the mode of Excel form, and define metadata title with paging Sheet in Excel, the cell Cell in Sheet carrys out the information of record cast middle term.Certainly, the characteristic of Excel list two relations is utilized to be one mode the most easily in actual applications to record the mode of data model, other modes can be used to carry out recording and describing simultaneously, not limit at this, only illustrate in mode the most conventional in native system.
Metadata management module in the Template Manager unit of native system and business model administration module realize administering and maintaining above-mentioned metadata and business data model, file access interface is provided in above-mentioned two modules, complete and operate with the I/O of Excel file, data in file are carried out extracting and verifying, to judge between metadata and business data model, whether association description is accurate simultaneously.
Step 2: utilize the normalization of data structure template of definition to carry out normalization of data structure process to the area medical data gathered, make area medical data fit business norms from structure;
Step 2-1: according to the service metadata structure defined in step 1-1, creates the database table structure in this service metadata structure in cache database;
Step 2-2: the area medical data of collection are stored according to the list structure created in step 2-1;
Metadata structure, according to the definition of service metadata structure in Template Manager unit, is converted to the version of database table, and is created in cache database by Data Management Unit.
Step 3: utilize the normalization of data structure template of definition to carry out association verification, repeating data process and data incomplete completion process to the area medical data collected;
Step 3-1: the data in cache database are verified, to judge whether the data of preserving in cache database meet the business association verification of business data model in normalization of data structure template, and generate quality of data verification report and the report of service template structure accordance;
Checking procedure is divided into data major key to associate School Affairs patient business datum completeness check, namely data major key association verification verifies incidence relation between business datum major key, to judge whether to occur annular association or cross correlation, to prevent from occurring bulk redundancy information when carrying out Descartes and calculating; Patient's business datum completeness check is the business verification of a class based on business scenario, its verification is core with patient data information, comprise the verification to cost information, patient's business information completeness check, whether meet to verify patient data the business association relation that business data model defines.
The process of business verification is undertaken by data check module, and it is that condition is carried out that its processing procedure is closed with the business association that business model structure in Template Manager unit defines.
Step 3-2: after step 3-1 execution is errorless, progressively the data that Data Management Unit stores are verified according to business data model, according to the business association relation defined in business model structure, defect verification is carried out to quoted metadata structure content, according to the definition of data dictionary and business model structure, completion is carried out to data incomplete content;
Supplementing Data comprises three class Supplementing Data operations, dictionary completion, business completion and project completion, and incomplete data is carried out completion according to data dictionary content by field completion; Business completion, according to the incidence relation defined in business data model, carries out completion to the content metadata that there is data incompleteness, and business completion process can call dictionary completion process; Project completion, be carry out completion with the missing data under the project of certain customization or business specific background, this completion process is carried out according to the data characteristics under specific background, and project completion process can call business completion and dictionary completion process.
The business information of a series of solidification of data dictionary record, carries out completion for logic contained by data item to data incomplete.Data dictionary is kept in Data Management Unit with the form of database table.
Step 3-3: after step 3-2 execution is errorless, according to the management fields (Mana_F) in metadata structure, the data record repeated is verified, the record of repetition is deleted from cache database;
The duplicate removal process of data is that the repeating data existed in gatherer process, business data processing process native system carries out deletion cleaning, comprises and extracts repeatability cleaning and the data scrubbing of business repeatability.Extract repeatability cleaning, refer to that the data deduplication process performed by it is carried out according to the medical profession data redundancy collected, the management fields content according to Mana_F item definition in metadata structure processes.The data scrubbing of business repeatability, be that the business major key of B_Primary item definition processes, business major key is the unique identification of every bar business datum.
For duplicate removal process in verification obtain repeat record, its scale removal process can be recorded in the daily record of native system, to realize the participation of manual type, carries out rollback in time, and adjust duplicate removal processing execution process to the duplicate removal process that there is mistake.
Step 4: buffer memory is carried out to the data in normalization of data structure Templates specifications process and normalized area medical data after output processing.
Data after being disposed to data, report, and empty business datum content in cache database simultaneously.
Purport of the present invention realizes the data buffer storage function based on normalization of data structure template, realize the medical data process for regional health platform and buffer memory, medical data quality of data in process of caching effectively can be promoted, realize planning and the standardization of data.Business data model structure builds based on the medical data feature of area medical system, according to medical profession flow characteristic, metadata in each for medical profession flow process implementation and incidence relation are mapped in business data model and metadata structure, makes it have stronger business accordance and standardization.Carrying out in medical profession data schema process of caching, its standardized feature of business model and its acquisition target heterogeneous medical system and report subject area health platform, all can realize to the dependence of benchmark service with associate, there is again its singularity simultaneously, and business data model and caching system object are that carrying out definition to business datum general between area medical system and regional health platform and incidence relation thereof describes, and the specific details shielded each other between system, thus by the data decoupler between native system feasible region medical system and area medical platform, data processing between differentiation system and data and transmission is realized with general structure.
Area medical system and area medical platform are carried out the implementation of decoupling zero by system described in the present invention, carrying out in medical data acquisition process, its business data model and metadata structure can be applied in the standardization construction scene of multiple medical data acquisition, while saving construction cost, the traffic performance of model also will obtain supplementing and optimizing in actual use, thus reach eucyclic target.
Although the foregoing describe the specific embodiment of the present invention, the those skilled in the art in this area should be appreciated that these only illustrate, can make various changes or modifications, and do not deviate from principle of the present invention and essence to these embodiments.Scope of the present invention is only defined by the appended claims.

Claims (10)

1. solve a caching system for area medical system data conventions problem, it is characterized in that, comprising:
Template Manager unit: for defining and the business data model in management data structures standardization template and template, utilize the normalization of data structure template of definition to carry out normalization of data structure process to the area medical data gathered, make area medical data fit business norms from structure;
Data processing unit: utilize the normalization of data structure template of definition to carry out association verification, repeating data process and data incomplete completion process to the area medical data collected;
Data Management Unit: for carrying out buffer memory to the data in normalization of data structure Templates specifications process and normalized area medical data after output processing.
2. the caching system of solution area medical system data conventions problem according to claim 1, it is characterized in that, described Template Manager unit comprises business model administration module and metadata management module;
Business data model administration module: realize the definition to business data model structure, to comprise between business and incidence relation between business and metadata, the business defined in business data model and the incidence relation of service metadata provide foundation to the data handling procedure performed in data processing unit;
Metadata management module: realize the definition to service metadata content, service metadata is the minimum unit in business data model, service metadata is on the one hand for forming normalization of data structure template, and another aspect is for defining the data cached library structure in Data Management Unit.
3. the caching system of solution area medical system data conventions problem according to claim 1, it is characterized in that, described data processing unit comprises data check module, data deduplication module and Supplementing Data module;
Data check module: association checking treatment is carried out to the data of Data Management Unit buffer memory according to the business data model that Template Manager unit provides, whether verification current cache data meet business need;
Data deduplication module: data redundancy verification is carried out to the data of Data Management Unit buffer memory, and deleting duplicated data;
Supplementing Data module: carry out completion to the data of Data Management Unit buffer memory, carries out completion one by one by the data content of defect according to data dictionary, business norms, makes data meet integrity demands.
4. the caching system of solution area medical system data conventions problem according to claim 1, it is characterized in that, described Data Management Unit comprises sending module, memory module and cache database;
Sending module: perform and send data task in cache database;
Memory module: the store tasks performing data in cache database;
Cache database: for being buffered in the data processing execution result obtained the data processing each stage in normalization of data structure Templates specifications process.
5. the caching system of solution area medical system data conventions problem according to claim 1, is characterized in that, described Template Manager unit realizes defining business data model and service metadata structure and managing;
Wherein, the data structure that service metadata structure has in execution data process of caching for cache database in Data Management Unit, service metadata structure defines with two-dimensional chain table structure based on cache database two-dimentional relation.
6. the caching system of solution area medical system data conventions problem according to claim 2, it is characterized in that, the business association relation of described business data model administration module definition business data model structure, use two-dimensional chain table structure describes the incidence relation between traffic item and traffic item, specifies the metadata structure of associated traffic item and associated use.
7. the caching system of solution area medical system data conventions problem according to claim 4, it is characterized in that, described memory module is used for building cache database according to business data model and service metadata structure, and the storage realized for data processing unit data processing each phase data processing execution result, send to cache database and perform data storage request, and ensure the correct execution that data store.
8. the caching system of solution area medical system data conventions problem according to claim 3, it is characterized in that, described data check module, according to the business data model provided in Template Manager unit and service metadata structure, carries out business association verification to the data of buffer memory in cache database.
9. the caching system of solution area medical system data conventions problem according to claim 3, it is characterized in that, described Supplementing Data module is according to the incidence relation of the business defined in business data model and service metadata, the business information that usage data dictionary and business data model define, carries out completion to data that are incomplete in cache database or disappearance.
10. adopt the caching system of solution area medical system data conventions problem according to claim 1 to carry out the method for medical data buffer memory, it is characterized in that, comprise the following steps:
Step 1: definition data structure standardization template and template in business data model;
Step 1-1: definition service metadata structure, whether whether whether whether the element in service metadata structure comprises that table name claims, version number, literary name name section, electronic health record authentication code field, table field description, major key, business major key, be empty, index, management fields title;
Step 1-2: the business data model in definition data structure standardization template;
To the definition of business data model structure, to comprise between business and incidence relation between business and metadata, the business defined in business data model and the incidence relation of service metadata, provide foundation to the data handling procedure performed in data processing unit;
Step 2: utilize the normalization of data structure template of definition to carry out normalization of data structure process to the area medical data gathered, make area medical data fit business norms from structure;
Step 2-1: according to the service metadata structure defined in step 1-1, creates the database table structure in this service metadata structure in cache database;
Step 2-2: the area medical data of collection are stored according to the list structure created in step 2-1;
Step 3: utilize the normalization of data structure template of definition to carry out association verification, repeating data process and data incomplete completion process to the area medical data collected;
Step 3-1: the data in cache database are verified, to judge whether the data of preserving in cache database meet the business association verification of business data model in normalization of data structure template, and generate quality of data verification report and the report of service template structure accordance;
Step 3-2: after step 3-1 execution is errorless, first progressively the data that Data Management Unit stores are verified according to business data model, according to the business association relation defined in business model structure, defect verification is carried out to quoted metadata structure content, according to the definition of data dictionary and business model structure, completion is carried out to data incomplete content;
Step 3-3: after step 3-2 execution is errorless, according to the management fields in metadata structure, the data record repeated is verified, the record of repetition is deleted from cache database;
Step 4: buffer memory is carried out to the data in normalization of data structure Templates specifications process and normalized area medical data after output processing.
CN201510337211.9A 2015-06-17 2015-06-17 Caching system and method for solving data normalization problem of regional medical system Pending CN105005683A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510337211.9A CN105005683A (en) 2015-06-17 2015-06-17 Caching system and method for solving data normalization problem of regional medical system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510337211.9A CN105005683A (en) 2015-06-17 2015-06-17 Caching system and method for solving data normalization problem of regional medical system

Publications (1)

Publication Number Publication Date
CN105005683A true CN105005683A (en) 2015-10-28

Family

ID=54378355

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510337211.9A Pending CN105005683A (en) 2015-06-17 2015-06-17 Caching system and method for solving data normalization problem of regional medical system

Country Status (1)

Country Link
CN (1) CN105005683A (en)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105550511A (en) * 2015-12-11 2016-05-04 北京锐软科技股份有限公司 Data quality evaluation system and method based on data verification technique
CN105631044A (en) * 2016-01-29 2016-06-01 四川长虹电器股份有限公司 Convergence method of network video resources
CN106066929A (en) * 2016-05-25 2016-11-02 中南大学 A kind of novel clinical medical object method for organizing
CN106777970A (en) * 2016-12-15 2017-05-31 北京锐软科技股份有限公司 The integrated system and method for a kind of medical information system data template
CN107016561A (en) * 2016-10-28 2017-08-04 阿里巴巴集团控股有限公司 A kind of information processing method and device
CN107103196A (en) * 2017-04-26 2017-08-29 成都中医药大学 A kind of tcm clinical practice data cleaning method
CN107729556A (en) * 2017-11-08 2018-02-23 山东浪潮云服务信息科技有限公司 A kind of business datum archiving method and system
CN108268462A (en) * 2016-12-30 2018-07-10 广东精点数据科技股份有限公司 A kind of data quality checking system of relation integraity
CN108877920A (en) * 2018-06-15 2018-11-23 申艳莉 Diagnosis and treatment data managing method and system
CN109144990A (en) * 2018-09-03 2019-01-04 国网浙江省电力有限公司信息通信分公司 A kind of power communication big data method for quality control based on metadata driven
CN109582666A (en) * 2018-09-29 2019-04-05 阿里巴巴集团控股有限公司 Data major key generation method, device, electronic equipment and storage medium
CN109582286A (en) * 2018-07-04 2019-04-05 福州震旦计算机技术有限公司 Data standard method of calibration and its device based on Freemarker technology
CN109616180A (en) * 2018-11-07 2019-04-12 平安科技(深圳)有限公司 Data analysing method, device, terminal and storage medium
CN110289058A (en) * 2019-06-06 2019-09-27 北京市天元网络技术股份有限公司 A kind of electronic health record standardization matching process and device
CN110309124A (en) * 2019-05-23 2019-10-08 深圳宏崎达技术有限公司 Data managing method and system
CN111651442A (en) * 2020-05-15 2020-09-11 京东数字科技控股有限公司 Data reporting method and device, electronic equipment and storage medium
CN111988896A (en) * 2020-08-05 2020-11-24 薛亮 Internet of things equipment management method based on edge computing gateway and big data cloud platform
CN112286912A (en) * 2020-08-12 2021-01-29 上海柯林布瑞信息技术有限公司 Medical data quality checking method and device, terminal and storage medium
CN112328576A (en) * 2020-11-13 2021-02-05 浙江卡易智慧医疗科技有限公司 Representation method of universal data model based on multiple data sources
CN112631785A (en) * 2020-12-31 2021-04-09 新奥数能科技有限公司 Business data processing method and device based on BPMN
CN116860741A (en) * 2023-08-31 2023-10-10 成都智慧锦城大数据有限公司 Automatic data standard checking and synchronizing system and method based on message queue

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101452503A (en) * 2008-11-28 2009-06-10 上海生物信息技术研究中心 Isomerization clinical medical information shared system and method
CN102509012A (en) * 2011-11-04 2012-06-20 厦门市智业软件工程有限公司 Method for mapping contents of electronic medical record into electronic medical record standard database
CN104361221A (en) * 2014-10-31 2015-02-18 沈阳锐易特软件技术有限公司 Heterogeneous system data mapping template-based medical data acquisition system and method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101452503A (en) * 2008-11-28 2009-06-10 上海生物信息技术研究中心 Isomerization clinical medical information shared system and method
CN102509012A (en) * 2011-11-04 2012-06-20 厦门市智业软件工程有限公司 Method for mapping contents of electronic medical record into electronic medical record standard database
CN104361221A (en) * 2014-10-31 2015-02-18 沈阳锐易特软件技术有限公司 Heterogeneous system data mapping template-based medical data acquisition system and method

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105550511B (en) * 2015-12-11 2018-02-09 北京锐软科技股份有限公司 A kind of quality of data evaluation system and method based on data check technology
CN105550511A (en) * 2015-12-11 2016-05-04 北京锐软科技股份有限公司 Data quality evaluation system and method based on data verification technique
CN105631044A (en) * 2016-01-29 2016-06-01 四川长虹电器股份有限公司 Convergence method of network video resources
CN106066929B (en) * 2016-05-25 2018-10-02 中南大学 A kind of clinical medicine object tissue method based on metanetwork
CN106066929A (en) * 2016-05-25 2016-11-02 中南大学 A kind of novel clinical medical object method for organizing
CN107016561A (en) * 2016-10-28 2017-08-04 阿里巴巴集团控股有限公司 A kind of information processing method and device
CN106777970A (en) * 2016-12-15 2017-05-31 北京锐软科技股份有限公司 The integrated system and method for a kind of medical information system data template
CN106777970B (en) * 2016-12-15 2018-12-07 北京锐软科技股份有限公司 A kind of integrated system and method for medical information system data template
CN108268462A (en) * 2016-12-30 2018-07-10 广东精点数据科技股份有限公司 A kind of data quality checking system of relation integraity
CN107103196A (en) * 2017-04-26 2017-08-29 成都中医药大学 A kind of tcm clinical practice data cleaning method
CN107729556A (en) * 2017-11-08 2018-02-23 山东浪潮云服务信息科技有限公司 A kind of business datum archiving method and system
CN108877920A (en) * 2018-06-15 2018-11-23 申艳莉 Diagnosis and treatment data managing method and system
CN109582286A (en) * 2018-07-04 2019-04-05 福州震旦计算机技术有限公司 Data standard method of calibration and its device based on Freemarker technology
CN109582286B (en) * 2018-07-04 2021-11-26 福州震旦计算机技术有限公司 Freemarker technology-based data normalization verification method and device
CN109144990A (en) * 2018-09-03 2019-01-04 国网浙江省电力有限公司信息通信分公司 A kind of power communication big data method for quality control based on metadata driven
CN109582666A (en) * 2018-09-29 2019-04-05 阿里巴巴集团控股有限公司 Data major key generation method, device, electronic equipment and storage medium
CN109616180A (en) * 2018-11-07 2019-04-12 平安科技(深圳)有限公司 Data analysing method, device, terminal and storage medium
CN110309124A (en) * 2019-05-23 2019-10-08 深圳宏崎达技术有限公司 Data managing method and system
CN110309124B (en) * 2019-05-23 2021-12-03 深圳宏崎达技术有限公司 Data management method and system
CN110289058A (en) * 2019-06-06 2019-09-27 北京市天元网络技术股份有限公司 A kind of electronic health record standardization matching process and device
CN111651442A (en) * 2020-05-15 2020-09-11 京东数字科技控股有限公司 Data reporting method and device, electronic equipment and storage medium
CN111988896A (en) * 2020-08-05 2020-11-24 薛亮 Internet of things equipment management method based on edge computing gateway and big data cloud platform
CN112286912A (en) * 2020-08-12 2021-01-29 上海柯林布瑞信息技术有限公司 Medical data quality checking method and device, terminal and storage medium
CN112328576A (en) * 2020-11-13 2021-02-05 浙江卡易智慧医疗科技有限公司 Representation method of universal data model based on multiple data sources
CN112631785A (en) * 2020-12-31 2021-04-09 新奥数能科技有限公司 Business data processing method and device based on BPMN
CN116860741A (en) * 2023-08-31 2023-10-10 成都智慧锦城大数据有限公司 Automatic data standard checking and synchronizing system and method based on message queue
CN116860741B (en) * 2023-08-31 2023-11-10 成都智慧锦城大数据有限公司 Automatic data standard checking and synchronizing system and method based on message queue

Similar Documents

Publication Publication Date Title
CN105005683A (en) Caching system and method for solving data normalization problem of regional medical system
CN103279542B (en) Data import processing method and data processing equipment
WO2021000494A1 (en) Blockchain-based operation logging method and apparatus, device, and storage medium
CN103377100B (en) A kind of data back up method, network node and system
CN106503912A (en) A kind of data service system
CN106777970A (en) The integrated system and method for a kind of medical information system data template
CN107038162A (en) Real time data querying method and system based on database journal
CN105144080A (en) System for metadata management
CN106164865A (en) Affairs batch processing for the dependency perception that data replicate
CN104641614A (en) Systems and methods for scalable structured data distribution
CN102760206A (en) System and method for sharing cross-regional medical image information
CN104933173A (en) Data processing method and device used for heterogeneous multiple data sources, and server
CN108228755A (en) The data of MySQL database based on daily record analytic technique to Hadoop platform synchronize clone method
JP6328768B2 (en) Metadata automation system
KR102141784B1 (en) System for managing ontology data of power grid
CN104991785A (en) Standardized clinical data service support system and method
KR20220013108A (en) System for providing intergration platform for collecting, processing and storaging of bigdata
US8099663B2 (en) Apparatus and method for document synchronization
CN113641659A (en) Medical characteristic database construction method, device, equipment and storage medium
Xu et al. Research on diagnostic information of smart medical care based on big data
CN103729455B (en) Master data storage method based on primary copy storage pattern
Zhu et al. Data modeling for big data
KR101543506B1 (en) Data Warehouse System and Construction Method Thereof
CN103488693A (en) Data processing device and data processing method
CN115456413A (en) Method, device and equipment for matching personnel with posts and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20151028

RJ01 Rejection of invention patent application after publication