CN109189769A - Data standardization processing method, device, computer equipment and storage medium - Google Patents
Data standardization processing method, device, computer equipment and storage medium Download PDFInfo
- Publication number
- CN109189769A CN109189769A CN201810925040.5A CN201810925040A CN109189769A CN 109189769 A CN109189769 A CN 109189769A CN 201810925040 A CN201810925040 A CN 201810925040A CN 109189769 A CN109189769 A CN 109189769A
- Authority
- CN
- China
- Prior art keywords
- field
- initial
- standardization
- initial table
- type
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000003672 processing method Methods 0.000 title claims abstract description 25
- 238000013507 mapping Methods 0.000 claims abstract description 74
- 238000000034 method Methods 0.000 claims abstract description 28
- 238000006243 chemical reaction Methods 0.000 claims abstract description 22
- 238000004590 computer program Methods 0.000 claims description 25
- 238000012545 processing Methods 0.000 claims description 23
- 238000010606 normalization Methods 0.000 claims description 18
- 238000002790 cross-validation Methods 0.000 claims description 12
- 238000001514 detection method Methods 0.000 claims description 11
- 230000000717 retained effect Effects 0.000 claims description 7
- 238000000605 extraction Methods 0.000 claims description 4
- 238000012795 verification Methods 0.000 claims description 3
- 238000003745 diagnosis Methods 0.000 description 9
- 229940079593 drug Drugs 0.000 description 8
- 239000003814 drug Substances 0.000 description 8
- 238000010586 diagram Methods 0.000 description 6
- 230000002159 abnormal effect Effects 0.000 description 5
- 241001269238 Data Species 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 238000007689 inspection Methods 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 230000008859 change Effects 0.000 description 3
- 238000004891 communication Methods 0.000 description 3
- 230000003068 static effect Effects 0.000 description 3
- 241000208340 Araliaceae Species 0.000 description 2
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 description 2
- 235000003140 Panax quinquefolius Nutrition 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 2
- 238000013479 data entry Methods 0.000 description 2
- 235000008434 ginseng Nutrition 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000001360 synchronised effect Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/08—Insurance
Abstract
This application involves a kind of data standardization processing method based on data resource, device, computer equipment and storage mediums.It include primary data in initial table the described method includes: obtaining initial table;The critical field of the primary data is extracted from initial table;Obtain the mapping relations between initial table and standard scale;It include criteria field in standard scale;According to mapping relations, critical field is converted into criteria field;Standardization table corresponding with initial table is generated using multiple criteria fields after conversion.It can be realized the standardization between the data in multiple areas using this method, and provided convenience for the update of the data of different regions and arrangement etc..
Description
Technical field
This application involves technical field of data processing, more particularly to a kind of data standardization processing method, device, calculating
Machine equipment and storage medium.
Background technique
In existing medical data and insurance data, the table structure in each city, field, the value condition of same field are not
Unanimously.For example, some insured insurance types are provided with static form, by directly acquiring existing insured guarantor from database
Dangerous type, the insured insurance type in some cities is provided with dynamic-form, by the way that acquisition is constantly updated in real time from database
Insured insurance type, the insurance type of insurant has the insured section of multiple and different correspondences, therefore, in order to improve to multiple
Medical data and the acquisition of insurance data and synchronous efficiency in area, it is desirable to provide multiple numbers medical from different places can be achieved
According to the unified approach between insurance data.
In traditional data normalization, the two-way maximum matching participle based on medical skill term dictionary is usually utilized to calculate
Method segments medical text data, obtains structural data, simple realization medical data construction standard.But due to not
Data are had differences between the medical data in area, data structure difference is not limited only to, further includes the word of different data
Segment difference is different and value difference, and in traditional data normalization method, is also not directed to unification relevant to settlement of insurance claim data
Method, therefore be not particularly suited for realizing standardization between medical data and insurance data in multiple areas.
Summary of the invention
Based on this, it is necessary to which in view of the above technical problems, providing one kind can be realized medical data and guarantor in multiple areas
Data standardization processing method, device, computer equipment and the storage medium of dangerous data normalization.
A kind of data standardization processing method, which comprises
Initial table is obtained, includes primary data in the initial table;
The critical field of the primary data is extracted from the initial table;
Obtain the mapping relations between the initial table and standard scale;It include criteria field in the standard scale;
According to the mapping relations, the critical field is converted into criteria field;
Standardization table corresponding with the initial table is generated using multiple criteria fields after conversion.
In one of the embodiments, before the acquisition initial table, further includes:
Establish the connection with third party database;
The initial table is obtained from the third party database, the initial table is labeled as original table;
The initial table is initially verified using the original table;
When by carrying out completeness check to multiple critical fielies in the initial table when initially verifying.
The critical field includes user identifier in one of the embodiments,;The method also includes:
The corresponding type of the standardization table is obtained, the type includes medical type and Claims Resolution type;
The standardization table of corresponding medical type and the standardization table for type of settling a claim are obtained according to user identifier;
The standardization table of the standardization table of the medical type and type of settling a claim is subjected to cross validation, identifies medical class
Variance data between the standardization table of type and the standardization table for type of settling a claim.
In one of the embodiments, before the mapping relations obtained between the initial table and standard scale, institute
State method further include:
The major key and external key in the initial table are obtained, and obtains the corresponding relationship between the major key and the external key;
The major key and external key in the standard scale are obtained, and obtains the corresponding relationship between the major key and the external key;
According to the major key of major key and the standard scale in the initial table, establish between the initial table and the standard scale
Mapping relations;
According to the corresponding relationship and the standard scale between external key, the major key and the external key in the initial table
In external key, the corresponding relationship between the major key and the external key, establish between the critical field and the criteria field
Mapping relations.
In one of the embodiments, the method also includes:
When in the standard scale without criteria field corresponding with critical field, corresponding mark is added in the standard scale
Quasi- field, and standard value is set for the criteria field;
When in the initial table without critical field corresponding with criteria field, the criteria field is retained to the mark
In standardization table, and by the standard value of the criteria field, it is set as the standard value of corresponding field in the standardization table.
A kind of data normalization processing unit, described device include:
It includes primary data in the initial table that initial table, which obtains module for obtaining initial table,;
Critical field extraction module, for extracting the critical field of the primary data from the initial table;
Mapping relations obtain module, for obtaining the mapping relations between the initial table and standard scale;The standard scale
In include criteria field;
Field conversion module, for according to the mapping relations, the critical field to be converted to criteria field;
Table generation module is standardized, for generating mark corresponding with the initial table using multiple criteria fields after conversion
Standardization table.
Described device in one of the embodiments, further include:
First detection module, for establishing and the connection of third party database;To described in third party database acquisition
The initial table is labeled as original table by initial table;The initial table is initially verified using the original table;When passing through
When initial verification, completeness check is carried out to multiple critical fielies in the initial table.
Described device in one of the embodiments, further include:
Second detection module, for obtaining the corresponding type of the standardization table, the type includes medical type and reason
Pay for type;The standardization table of corresponding medical type and the standardization table for type of settling a claim are obtained according to user identifier;It will be described
The standardization table of medical type and the standardization table for type of settling a claim carry out cross validation, identify the standardization table of medical type with
And the variance data between the standardization table of Claims Resolution type.
A kind of computer equipment, including memory and processor, the memory are stored with computer program, the processing
Device performs the steps of when executing the computer program
Initial table is obtained, includes primary data in the initial table;
The critical field of the primary data is extracted from the initial table;
Obtain the mapping relations between the initial table and standard scale;It include criteria field in the standard scale;
According to the mapping relations, the critical field is converted into criteria field;
Standardization table corresponding with the initial table is generated using multiple criteria fields after conversion.
A kind of computer readable storage medium, is stored thereon with computer program, and the computer program is held by processor
It is performed the steps of when row
Initial table is obtained, includes primary data in the initial table;
The critical field of the primary data is extracted from the initial table;
Obtain the mapping relations between the initial table and standard scale;It include criteria field in the standard scale;
According to the mapping relations, the critical field is converted into criteria field;
Standardization table corresponding with the initial table is generated using multiple criteria fields after conversion.
Above-mentioned data standardization processing method, device, computer equipment and storage medium extract initial number by initial table
According to critical field, and the mapping relations between initial table and standard scale are obtained, due to the mapping between initial table and standard scale
Relationship, can embody the corresponding relationship of the criteria field between the critical field and standard scale in initial table, therefore can will be according to reflecting
It penetrates relationship and critical field is converted into criteria field, generate standard corresponding with initial table using multiple criteria fields after conversion
Change table, due to may be implemented it is multiple area in data between standardization, can for different regions data update and
Arrangement etc. provides convenience.
Detailed description of the invention
Fig. 1 is the application scenario diagram of data standardization processing method in one embodiment;
Fig. 2 is the flow diagram of data standardization processing method in one embodiment;
Fig. 3 is the flow diagram of data standardization processing method in another embodiment;
Fig. 4 is the flow diagram of data standardization processing method in further embodiment;
Fig. 5 is the structural block diagram of data normalization processing unit in one embodiment;
Fig. 6 is the internal structure chart of computer equipment in one embodiment.
Specific embodiment
It is with reference to the accompanying drawings and embodiments, right in order to which the objects, technical solutions and advantages of the application are more clearly understood
The application is further elaborated.It should be appreciated that specific embodiment described herein is only used to explain the application, not
For limiting the application.
Data standardization processing method provided by the present application can be applied in application environment as shown in Figure 1.Wherein,
Terminal 102 is communicated with server 104 by network by network.Server 104 obtains just from the database of terminal 102
Beginning table includes primary data in initial table, and the critical field of primary data is extracted from the initial table of acquisition, obtains initial table
Mapping relations between standard scale include criteria field in standard scale, according to mapping relations, critical field are converted to mark
Quasi- field generates standardization table corresponding with initial table using multiple criteria fields after conversion.Wherein, terminal 102 can with but
It is not limited to various personal computers, laptop, smart phone, tablet computer and portable wearable device, server
104 can be realized with the server cluster of the either multiple server compositions of independent server.
In one embodiment, as shown in Fig. 2, providing a kind of data standardization processing method, it is applied in this way
It is illustrated for server in Fig. 1, comprising the following steps:
S202 obtains initial table, includes primary data in initial table.
Wherein, initial table is the table before not being standardized in disparate databases, the medical treatment including multiple areas
Data and insurance data.Primary data is the data that the initial table of different regions includes, and can be different critical field, closes
The different values of the definition of key field and corresponding critical field.
Specifically, server obtains the table not being standardized, i.e. initial table from the database of different regions,
It include the medical data and insurance data in multiple areas, wherein medical data includes the personal information of user, such as gender, year
Age, height and weight etc., the medical records including user, such as state of an illness diagnosis and treatment, medication, expense, treatment time and treatment place
Deng.Insurance data includes the insured information of user, i.e., the essential information of insured people, and the premium information including insured people is that is, insured
Payment information, the relevant information of Claims Resolution and the payment information of Claims Resolution including insured people etc..
S204 extracts the critical field of primary data from initial table.
Wherein, initial table includes multiple primary datas, and primary data includes the definition of different critical fielies, critical field
And the different values of corresponding critical field.
Specifically, server extracts the corresponding critical field of multiple primary datas, initial table packet from the initial table obtained
It includes: insured people's essential information initial table, insured information initial table, premium information initial table, medical information initial table, payment information
Initial table and the thin item information initial table of Claims Resolution.
Further, insured people's essential information initial table includes the essential information of insured people, and corresponding critical field includes:
Gender, age, height, weight of insured people etc., insured information initial table include the insured information of insured people, corresponding keyword
Section includes: ID card information, account information, work unit and contact method etc., and premium information initial table includes insured people
Payment information, corresponding critical field includes: insured people, payment time, payment approach and payment number etc., at the beginning of medical information
Beginning table includes the insured relevant information for ruling treatment by men, corresponding critical field include: diagnosis and treatment, go out admission time, medication, medical fee with
And treatment place etc., payment information initial table include the payment information of insured people Claims Resolution, corresponding critical field include: insurance kind,
Claims Resolution timeliness and settling fee etc., thin item information initial table of settling a claim include the thin item information of insured people's Claims Resolution, corresponding key
Field includes: diagnosis and treatment, consumption cost and hospitalization cost etc..
S206 obtains the mapping relations between initial table and standard scale;It include criteria field in standard scale.
Wherein, server needs to pre-establish the mapping relations between initial table and standard scale, and is stored in database, needs
When wanting normalized operation, server obtains mark from the mapping relations read between initial table and standard scale in database
Criteria field in quasi- table.
Specifically, server obtains the major key and external key in initial table, and obtains the correspondence between major key and the external key
Relationship;The major key and external key in institute's standard scale are obtained, and obtains the corresponding relationship between major key and the external key;According to initial table
In major key and standard scale major key, establish the mapping relations between initial table and the standard scale;According in initial table external key,
The corresponding relationship between the external key in corresponding relationship and standard scale, major key and external key between major key and external key, establishes critical field
Mapping relations between criteria field, server obtain the mapping relations between the initial table and standard scale pre-established.
Critical field is converted to criteria field according to mapping relations by S208.
Wherein, server is established between initial table and the standard scale according to the major key of major key and standard scale in initial table
Mapping relations;According to external key, the major key in the corresponding relationship and standard scale between external key, major key and the external key in initial table
Corresponding relationship between external key establishes the mapping relations between critical field and criteria field.
Specifically, standard scale includes criteria field, and initial table includes critical field, and server is according to initial table and standard scale
Between mapping relations and critical field and criteria field between mapping relations, critical field is converted into criteria field,
And the value of critical field is converted into the value of criteria field.
Further, between the initial table and standard scale of different regions there are different mapping relations, a ground wherein
Area, there are one-to-one relationships between initial table and standard scale, for example, the A01 table in initial table correspond to it is insured in standard scale
People's essential information standard scale, initial table A02 correspond to insured information standard table, i.e. corresponding relationship between initial table and standard scale is
Fixed, and there is also in one-to-one relationship, such as initial table A01 for the critical field between corresponding initial table and standard scale
The insured people's name Nam of critical field, key in initial table A02 corresponding with the name in insured people's essential information standard scale
The insured people's ID number IDN of field is corresponding with the ID number in insured information standard table.
Regional at another, the field between initial table and standard scale, there are different corresponding relationships, for example, insured letter
Cease A02 table and A03 table of the information in initial table in standard scale.Wherein, insurance kind type origin is in initial table AC02
Insurance kind type field, identity category then derives from the identity category field in initial table A03 table.In this case, it can relate to
And to initial table to the adjustment of the table structure of standard scale, also, what is recorded in A02 table is to extract the static letter at moment in data
Breath, and then had recorded in A03 table insurant identity category and the corresponding period.For example, IDN in the data of Shaoxing
There are two records in A03 for the insurant of " 0000002044 ", before on December 7th, 2010, with rank-and-file employee's identity
Insured, then insured with civil servant's identity later, there is also identical situations in the data of Jilin.In this case, then it is assumed that A02
All insured periods of the insurance kind type suitable for the insurant A03 table in table.
S210 generates standardization table corresponding with initial table using multiple criteria fields after conversion.
Specifically, standardization table is the initial table by standardization, is reflected due to existing between standard scale and initial table
Relationship is penetrated, and there are mapping relations between critical field and criteria field, therefore critical field is converted into criteria field, and will
After the value of critical field is converted into the value of criteria field, standardization table corresponding with initial table can be obtained.
In above-mentioned data standardization processing method, the critical field of primary data is extracted by initial table, and is obtained initial
Mapping relations between table and standard scale can embody the pass in initial table due to the mapping relations between initial table and standard scale
The corresponding relationship of criteria field between key field and standard scale, therefore will can be converted into marking by critical field according to mapping relations
Quasi- field generates standardization table corresponding with initial table using multiple criteria fields after conversion, due to may be implemented multiplely
The standardization between data in area, therefore can be provided convenience for the data update and arrangement etc. of different regions.
In one embodiment, as shown in figure 3, providing a kind of data standardization processing method, obtain initial table it
Before, this method further include:
S302 establishes the connection with third party database.
Specifically, third party database includes the database of different platform or different regions, such as the insured letter in somewhere
Storing data library is ceased, for storing the insured information of this area difference personnel or the user information storing data library of some hospital,
Essential information including user, such as gender, age, height and weight further include medical information, medication, between treatment and expense
Etc. information.Server can pass through calling interface or the connection of network communication foundation and third party database.
S304, obtains initial table from third party database, and initial table is labeled as original table.
Specifically, server obtains multiple initial tables from multiple third party databases, and different initial tables include different
Critical field, and different critical fielies have different values.Corresponding mark is added for different initial tables, it will be initial according to mark
List notation is original table.
S306 initially verifies initial table using original table.
Wherein, initial verify includes:
(1) server counts the data strip mesh number that multiple initial tables include, and obtains the data entry in original table
Number, the data strip mesh number in initial table and the entry number in original table are compared, when entry number in initial table and original
In table in entry number allowed band in the same size or in error size, error range can be set to [- 10,10], also
It is to say, the entry number of initial table can be greater than the entry thing that original table entry number is also smaller than original table, and range is [- 10,10].
Specifically, data strip mesh number includes total entry number and divides year entry number, can determine whether corresponding city according to entry number
In data it is whether abnormal, such as total entry number with divide year entry number total value differ or the entry number of the previous year with it is latter
The numerical value difference of the entry number in year is very big, illustrates that the data in the city are in abnormality.
(2) server, which obtains in initial table and original table, includes, regional, the corresponding size of population of data cover and
Corresponding relationship between critical field, and according to the size of population of data institute covering area and the data strip mesh number of this area it
Between whether correspond to and whether the size of population consistent with the size of critical field value, judge whether data abnormal in initial table.
Specifically, when the size of population of data institute covering area and the data strip mesh number of this area is in the same size or population
Quantity and critical field value it is in the same size when, the data in the initial table of corresponding area are in normal condition, on the contrary then locate
In abnormality.
(3) server carries out value condition corresponding with each critical field based on the critical field in multiple initial tables
Detection, when the corresponding value of critical field is in preset zone of reasonableness, corresponding value is effective status.Work as critical field
Value when being not in preset zone of reasonableness or value and lacking, the value of corresponding critical field is invalid state.
Specifically, when the admission time of people insured in medical information initial table is on 2 10th, 1900, discharge time is
On March 2nd, 1900, it is not belonging to preset time range, for example herein can set preset time range to nineteen ninety to 2018
Year, then illustrate in this medical information initial table, the corresponding admission time out of insured people is ineffective time, belongs to invalid data, can
This invalid data is deleted, or is supplemented according to the business hours, such as the relevant consultation hours of diagnosis and treatment, as go out be admitted to hospital
Time.
(4) server obtains the access situation in multiple areas, when repeatedly access occur, and data coincidence occur, in time
The value of critical field each in initial table is updated, legacy data is revised as updated data.For example, in the area A first
When secondary access, the size of population of acquisition is 500,000, and when fetching for the second time, the size of population of acquisition is 510,000, then needs first
The data of secondary acquisition are updated to the data of second of acquisition, realize the update processing repeated when fetching.
It further include obtaining the region of access covering and access time range every time when judging certain city for repeatedly access,
Timely update data.For example, having lacked student data when the area A is fetched for the first time, first is not covered when fetching for the second time
The region of secondary access, lacking in information needs be replenished in time, by fetch and expand again former scope operation realize.
S308, when by carrying out completeness check to multiple critical fielies in institute's initial table when initially verifying.
Specifically, completeness check is to carry out verification operation, packet to the integrality of multiple critical fielies in initial table
Include the integrality of field integrality and field value.
Wherein, field integrality include the multiple critical fielies having in each initial table are identified and are classified, and
Preset field rule list acquired multiple critical fielies and preset field rule list is compared, when in initial table
When the number for the critical field being had meets the number recorded in preset field rule list, show the keyword in each initial table
Section is in good working condition.
The integrality of field value includes that critical field is divided into three types and is checked respectively, including numeric type, word
Symbol type and date type obtain multiple field values of above-mentioned three kinds of field types respectively, and according to multiple field values, respectively
Value distribution map corresponding with field type is generated, value distribution map is extracted, value range is generated according to distribution map, according to value
Range can determine whether, the value condition of corresponding field, the respective field in some initial table, and value goes beyond the scope or do not include taking
When being worth most values in range, shows that the value of the field is imperfect or there are invalid values, invalid value or root can be deleted
Value is supplemented according to business rule, realizes that field value obtains integrality.
Above-mentioned data standardization processing method, server utilize original table pair by the way that initial table is labeled as original table
Initial table is initially verified, including to the statistics of data strip mesh number, data in initial table, whether abnormal, critical field value is
No invalid and with the presence or absence of the inspection that data are overlapped, by also needing to carry out integrity check after initially verifying, including field is complete
The inspection of the integrality of whole property and field value can be realized more by before critical field is converted into criteria field in initial table
The data detection in orientation reduces the inflow of invalid data, reduces field amount of translation, improves transfer efficiency.
In another embodiment, as shown in figure 4, providing a kind of data standardization processing method, this method further include:
S402, obtains the corresponding type of standardization table, and type includes medical type and Claims Resolution type.
Wherein, the critical field in initial table is converted into criteria field, and can using multiple criteria fields after conversion
Generate standardization table corresponding with initial table.
Specifically, standardization table includes the standardization table of medical type and the standardization table of Claims Resolution type, wherein medical class
The standardization table of type includes: insured people's essential information standardization table, the essential information including insured people, corresponding critical field packet
It includes: gender, age, height, weight of insured people etc. and medical information standardization table, including the insured related letter for ruling treatment by men
Breath, corresponding critical field include: diagnosis and treatment, out admission time, medication, medical fee and treatment place etc..
The standardization table for type of settling a claim includes: insured information standardization table, the insured information including insured people, corresponding pass
Key field includes: ID card information, account information, work unit and contact method etc., and premium information standardizes table, including ginseng
The payment information of guarantor, corresponding critical field include: insured people, payment time, payment approach and payment number etc., and
Payment information standardize table, including insured people Claims Resolution payment information, corresponding critical field include: insurance kind, Claims Resolution timeliness with
And settling fee etc., and the thin item information standardization table of Claims Resolution, the thin item information including insured people Claims Resolution, corresponding critical field packet
It includes: diagnosis and treatment, consumption cost and hospitalization cost etc..
S404 obtains the standardization table of corresponding medical type and the standardization table for type of settling a claim according to user identifier.
Specifically, user identifier and insured people correspond, and can obtain medical treatment corresponding with insured people according to user identifier
The standardization table of the standardization table of type and type of settling a claim, including insured people's essential information standardize table, medical information standard
Change table, insured information standardization table, premium information standardization table, payment information standardization table and the thin item information standardization of Claims Resolution
Table.
S406, by the standardization table of medical type and the standardization table for type of settling a claim carry out cross validation, identification doctor
Treat the variance data between the standardization table of type and the standardization table for type of settling a claim.
Specifically, multiple critical fielies in the standardization table of medical type, including insured people's essential information standard are obtained
Change each critical field in table and medical information standardization table, obtains multiple keys in the standardization table of Claims Resolution type
Field, including insured information standardization table, premium information standardization table, payment information standardize table, and the thin item information of Claims Resolution
Multiple critical fielies in table are standardized, and obtain the value of different critical fielies, to the pass in different types of standardization table
Key field value carries out cross validation, judges whether the value of the same critical field in various criterion table is consistent, works as value
When consistent, show that the value of the critical field is effective value.
Above-mentioned data standardization processing method by the way that standardization table is divided into medical type and Claims Resolution type, and is distributed and obtains
The value of each critical field in different types of standardization table is taken, and to the critical field in different types of standardization table
Value carries out cross validation, judges whether the value of the same critical field in various criterion table is consistent, when value is consistent,
The value for showing the critical field is effective value, improves the validity of field value.
In one embodiment, a kind of data standardization processing method is provided, is being obtained between initial table and standard scale
Mapping relations before, this method further include:
Server obtains the major key and external key in initial table, and obtains the corresponding relationship between major key and external key;Obtain mark
Major key and external key in quasi- table, and obtain the corresponding relationship between major key and external key;According to the major key and standard scale in initial table
Major key, establish the mapping relations between initial table and standard scale;According to the corresponding pass between external key, major key and the external key in initial table
System and the corresponding relationship between the external key in standard scale, major key and external key, establish the mapping between critical field and criteria field
Relationship.
Specifically, server obtains major key and external key in different initial tables, such as initial for insured people's essential information
The processing of table, major key therein are the gender of insured people, and external key includes age, height and weight of insured people etc., insured information
Major key in initial table includes the ID card information of insured people, and external key includes account information, work unit and the connection of insured people
It is mode etc., wherein insured human nature can there are corresponding relationships with insured people's ID card information, that is to say, that insured person part
Card information includes the gender of insured people, and the age of insured people, is existed with the ID card information and account information of insured people
Corresponding relationship.
Similarly, server obtains the major key and external key in various criterion table, for example medical information standard scale is carried out
Processing, major key are the medical information of insured people, and external key includes admission time, medication, medical fee and treatment place etc., reason
The major key for paying for thin item information standard table is the medical information of insured people, and external key includes consumption cost and hospitalization cost etc..
Wherein, pair of major key and external key in the corresponding relationship and each standard scale of major key and external key in each initial table is obtained
It should be related to, the corresponding relationship between initial table and standard scale can be established, according to pair between external key, major key and the external key in initial table
It should be related to and the corresponding relationship between the external key in standard scale, major key and external key, establish between critical field and criteria field
Mapping relations.
Above-mentioned data standardization processing method, server are built by the major key according to major key and standard scale in initial table
Mapping relations between vertical initial table and standard scale, according to the corresponding relationship between external key, major key and the external key in initial table, Yi Jibiao
External key, major key in quasi- table and the corresponding relationship between external key, establish the mapping relations between critical field and criteria field, can be
Critical field is converted to criteria field, provides direct corresponding relationship, improves the accuracy rate and transfer efficiency of conversion.
In one embodiment, a kind of data standardization processing method, this method are provided further include:
When in standard scale without criteria field corresponding with critical field, server adds corresponding standard in standard scale
Field, and standard value is set for criteria field;When in initial table without critical field corresponding with criteria field, server will be marked
Quasi- field retains into the standardization table, and by the standard value of criteria field, is set as the mark of corresponding field in standardization table
Quasi- value.
Specifically, when nothing criteria field corresponding with critical field in standard scale, that is to say, that closed present in initial table
Key field, without criteria field corresponding with the critical field in standard scale, missing mark corresponding with critical field in standard scale
Quasi- literary name section, server add criteria field corresponding with critical field in standard scale, and according to business rule to be added
Criteria field be arranged standard value.
When nothing critical field corresponding with criteria field in initial table, that is to say, that the criteria field in standard scale, first
Without critical field corresponding with the criteria field in beginning table, the critical field in initial table be in miss status, and server will mark
Quasi- field retains into the standardization table, and by the standard value of criteria field, is set as the mark of corresponding field in standardization table
Quasi- value.
Above-mentioned data standardization processing method is being marked in time in the case where field missing occur in initial table or standard scale
Criteria field corresponding with critical field is added in quasi- table, and is that standard is arranged in added criteria field according to business rule
Value, or criteria field is retained into the standardization table, and by the standard value of criteria field, it is set as in standardization table corresponding
The standard value of field solves the case where initial table or standard literary name section lack before normalized processing, improves standardization
Treatment effeciency.
It should be understood that although each step in the flow chart of Fig. 2-4 is successively shown according to the instruction of arrow,
These steps are not that the inevitable sequence according to arrow instruction successively executes.Unless expressly stating otherwise herein, these steps
Execution there is no stringent sequences to limit, these steps can execute in other order.Moreover, at least one in Fig. 2-4
Part steps may include that perhaps these sub-steps of multiple stages or stage are not necessarily in synchronization to multiple sub-steps
Completion is executed, but can be executed at different times, the execution sequence in these sub-steps or stage is also not necessarily successively
It carries out, but can be at least part of the sub-step or stage of other steps or other steps in turn or alternately
It executes.
In one embodiment, as shown in figure 5, providing a kind of data normalization processing unit, comprising: initial table obtains
Module 502, critical field extraction module 504, mapping relations obtain module 506, and field conversion module 508 and standardization table generate
Module 510, in which:
It includes primary data in initial table that initial table, which obtains module 502 for obtaining initial table,.
Wherein, initial table is the table before not being standardized in disparate databases, the medical treatment including multiple areas
Data and insurance data.Primary data is the data that the initial table of different regions includes, and can be different critical field, closes
The different values of the definition of key field and corresponding critical field.
Specifically, server obtains the table not being standardized, i.e. initial table from the database of different regions,
It include the medical data and insurance data in multiple areas, wherein medical data includes the personal information of user, such as gender, year
Age, height and weight etc., the medical records including user, such as state of an illness diagnosis and treatment, medication, expense, treatment time and treatment place
Deng.Insurance data includes the insured information of user, i.e., the essential information of insured people, and the premium information including insured people is that is, insured
Payment information, the relevant information of Claims Resolution and the payment information of Claims Resolution including insured people etc..
Critical field extraction module 504, for extracting the critical field of primary data from initial table.
Wherein, initial table includes multiple primary datas, and primary data includes the definition of different critical fielies, critical field
And the different values of corresponding critical field.
Specifically, server extracts the corresponding critical field of multiple primary datas, initial table packet from the initial table obtained
It includes: insured people's essential information initial table, insured information initial table, premium information initial table, medical information initial table, payment information
Initial table and the thin item information initial table of Claims Resolution.
Mapping relations obtain module 506, for obtaining the mapping relations between initial table and standard scale;Include in standard scale
Criteria field.
Wherein, server needs to pre-establish the mapping relations between initial table and standard scale, and is stored in database, needs
When wanting normalized operation, server obtains mark from the mapping relations read between initial table and standard scale in database
Criteria field in quasi- table.
Field conversion module 508, for according to mapping relations, critical field to be converted to criteria field.
Wherein, server is established between initial table and the standard scale according to the major key of major key and standard scale in initial table
Mapping relations;According to external key, the major key in the corresponding relationship and standard scale between external key, major key and the external key in initial table
Corresponding relationship between external key establishes the mapping relations between critical field and criteria field.
Specifically, standard scale includes criteria field, and initial table includes critical field, and server is according to initial table and standard scale
Between mapping relations and critical field and criteria field between mapping relations, critical field is converted into criteria field,
And the value of critical field is converted into the value of criteria field.
Table generation module 510 is standardized, for generating mark corresponding with initial table using multiple criteria fields after conversion
Standardization table.
Specifically, standardization table is the initial table by standardization, is reflected due to existing between standard scale and initial table
Relationship is penetrated, and there are mapping relations between critical field and criteria field, therefore critical field is converted into criteria field, and will
After the value of critical field is converted into the value of criteria field, standardization table corresponding with initial table can be obtained.
Above-mentioned data normalization processing unit, the critical field of primary data is extracted by initial table, and obtains initial table
Mapping relations between standard scale can embody the key in initial table due to the mapping relations between initial table and standard scale
The corresponding relationship of criteria field between field and standard scale, therefore critical field can will be converted into standard according to mapping relations
Field generates standardization table corresponding with initial table using multiple criteria fields after conversion, since multiple areas may be implemented
Standardization between interior data, therefore can be provided convenience for the data update and arrangement etc. of different regions.
In one embodiment, a kind of data normalization processing unit is provided, further includes:
First with regard to detection module, for establishing and the connection of third party database;Initial table is obtained from third party database,
Initial table is labeled as original table;Initial table is initially verified using original table;When by when initially verifying, to it is initial
Multiple critical fielies in table carry out completeness check.
Specifically, third party database includes the database of different platform or different regions, such as the insured letter in somewhere
Storing data library is ceased, for storing the insured information of this area difference personnel or the user information storing data library of some hospital,
Essential information including user, such as gender, age, height and weight further include medical information, medication, between treatment and expense
Etc. information.Server can pass through calling interface or the connection of network communication foundation and third party database.
Server obtains multiple initial tables from multiple third party databases, and different initial tables include different keywords
Section, and different critical fielies have different values.Corresponding mark is added for different initial tables, according to mark by initial list notation
For original table.
Wherein, initial verify includes:
(1) server counts the data strip mesh number that multiple initial tables include, and obtains the data entry in original table
Number, the data strip mesh number in initial table and the entry number in original table are compared, when entry number in initial table and original
In table in entry number allowed band in the same size or in error size, error range can be set to [- 10,10], also
It is to say, the entry number of initial table can be greater than the entry thing that original table entry number is also smaller than original table, and range is [- 10,10].
(2) server, which obtains in initial table and original table, includes, regional, the corresponding size of population of data cover and
Corresponding relationship between critical field, and according to the size of population of data institute covering area and the data strip mesh number of this area it
Between whether correspond to and whether the size of population consistent with the size of critical field value, judge whether data abnormal in initial table.
(3) server carries out value condition corresponding with each critical field based on the critical field in multiple initial tables
Detection, when the corresponding value of critical field is in preset zone of reasonableness, corresponding value is effective status.Work as critical field
Value when being not in preset zone of reasonableness or value and lacking, the value of corresponding critical field is invalid state.
(4) server obtains the access situation in multiple areas, when repeatedly access occur, and data coincidence occur, in time
The value of critical field each in initial table is updated, legacy data is revised as updated data.For example, in the area A first
When secondary access, the size of population of acquisition is 500,000, and when fetching for the second time, the size of population of acquisition is 510,000, then needs first
The data of secondary acquisition are updated to the data of second of acquisition, realize the update processing repeated when fetching.
Wherein, field integrality include the multiple critical fielies having in each initial table are identified and are classified, and
Preset field rule list acquired multiple critical fielies and preset field rule list is compared, when in initial table
When the number for the critical field being had meets the number recorded in preset field rule list, show the keyword in each initial table
Section is in good working condition.
The integrality of field value includes that critical field is divided into three types and is checked respectively, including numeric type, word
Symbol type and date type obtain multiple field values of above-mentioned three kinds of field types respectively, and according to multiple field values, respectively
Value distribution map corresponding with field type is generated, value distribution map is extracted, value range is generated according to distribution map, according to value
Range can determine whether, the value condition of corresponding field, the respective field in some initial table, and value goes beyond the scope or do not include taking
When being worth most values in range, shows that the value of the field is imperfect or there are invalid values, invalid value or root can be deleted
Value is supplemented according to business rule, realizes that field value obtains integrality.
Above-mentioned data normalization processing unit, server utilize original table pair by the way that initial table is labeled as original table
Initial table is initially verified, including to the statistics of data strip mesh number, data in initial table, whether abnormal, critical field value is
No invalid and with the presence or absence of the inspection that data are overlapped, by also needing to carry out integrity check after initially verifying, including field is complete
The inspection of the integrality of whole property and field value can be realized more by before critical field is converted into criteria field in initial table
The data detection in orientation reduces the inflow of invalid data, reduces field amount of translation, improves transfer efficiency.
In one embodiment, a kind of data normalization processing unit is provided, further includes:
Second detection module, for obtaining the corresponding type of standardization table, type includes medical type and Claims Resolution type;Root
The standardization table of corresponding medical type and the standardization table for type of settling a claim are obtained according to user identifier;By the standard of medical type
The standardization table for changing table and type of settling a claim carries out cross validation, identifies the mark of the standardization table of medical type and type of settling a claim
Variance data between standardization table.
Specifically, standardization table includes the standardization table of medical type and the standardization table of Claims Resolution type, wherein medical class
The standardization table of type includes: insured people's essential information standardization table, the essential information including insured people, corresponding critical field packet
It includes: gender, age, height, weight of insured people etc. and medical information standardization table, including the insured related letter for ruling treatment by men
Breath, corresponding critical field include: diagnosis and treatment, out admission time, medication, medical fee and treatment place etc..
The standardization table for type of settling a claim includes: insured information standardization table, the insured information including insured people, corresponding pass
Key field includes: ID card information, account information, work unit and contact method etc., and premium information standardizes table, including ginseng
The payment information of guarantor, corresponding critical field include: insured people, payment time, payment approach and payment number etc., and
Payment information standardize table, including insured people Claims Resolution payment information, corresponding critical field include: insurance kind, Claims Resolution timeliness with
And settling fee etc., and the thin item information standardization table of Claims Resolution, the thin item information including insured people Claims Resolution, corresponding critical field packet
It includes: diagnosis and treatment, consumption cost and hospitalization cost etc..
Multiple critical fielies in the standardization table of medical type are obtained, including insured people's essential information standardizes table, with
And each critical field in medical information standardization table, obtain multiple critical fielies in the standardization table of Claims Resolution type, packet
Include insured information standardization table, premium information standardization table, payment information standardization table, and the thin item information standardization table of Claims Resolution
In multiple critical fielies, and obtain the value of different critical fielies, the critical field in different types of standardization table taken
Value carries out cross validation, judges whether the value of the same critical field in various criterion table is consistent, when value is consistent, table
The value of the bright critical field is effective value.
Above-mentioned data normalization processing unit by the way that standardization table is divided into medical type and Claims Resolution type, and is distributed and obtains
The value of each critical field in different types of standardization table is taken, and to the critical field in different types of standardization table
Value carries out cross validation, judges whether the value of the same critical field in various criterion table is consistent, when value is consistent,
The value for showing the critical field is effective value, improves the validity of field value.
In one embodiment, a kind of data normalization processing unit is provided, further includes:
Mapping relations establish module, for obtaining major key and external key in initial table, and obtain major key and the external key it
Between corresponding relationship;The major key and external key in institute's standard scale are obtained, and obtains the corresponding relationship between major key and the external key;Root
According to the major key of major key and standard scale in initial table, the mapping relations between initial table and the standard scale are established;According to initial table
In external key, the external key in corresponding relationship and standard scale between major key and external key, the corresponding relationship between major key and external key, build
Mapping relations between vertical critical field and criteria field.
Specifically, server obtains major key and external key in different initial tables, such as initial for insured people's essential information
The processing of table, major key therein are the gender of insured people, and external key includes age, height and weight of insured people etc., insured information
Major key in initial table includes the ID card information of insured people, and external key includes account information, work unit and the connection of insured people
It is mode etc., wherein insured human nature can there are corresponding relationships with insured people's ID card information, that is to say, that insured person part
Card information includes the gender of insured people, and the age of insured people, is existed with the ID card information and account information of insured people
Corresponding relationship.
Above-mentioned data normalization processing unit, server are built by the major key according to major key and standard scale in initial table
Mapping relations between vertical initial table and standard scale, according to the corresponding relationship between external key, major key and the external key in initial table, Yi Jibiao
External key, major key in quasi- table and the corresponding relationship between external key, establish the mapping relations between critical field and criteria field, can be
Critical field is converted to criteria field, provides direct corresponding relationship, improves the accuracy rate and transfer efficiency of conversion.
In one embodiment, a kind of data normalization device, the device are provided further include:
Third detection module, for adding when in standard scale without criteria field corresponding with critical field in standard scale
Add corresponding criteria field, and standard value is set for criteria field;When nothing critical field corresponding with criteria field in initial table
When, criteria field is retained into the standardization table, and by the standard value of criteria field, is set as corresponding to word in standardization table
The standard value of section.
Specifically, when nothing criteria field corresponding with critical field in standard scale, that is to say, that closed present in initial table
Key field, without criteria field corresponding with the critical field in standard scale, missing mark corresponding with critical field in standard scale
Quasi- literary name section, server add criteria field corresponding with critical field in standard scale, and according to business rule to be added
Criteria field be arranged standard value.
When nothing critical field corresponding with criteria field in initial table, that is to say, that the criteria field in standard scale, first
Without critical field corresponding with the criteria field in beginning table, the critical field in initial table be in miss status, and server will mark
Quasi- field retains into the standardization table, and by the standard value of criteria field, is set as the mark of corresponding field in standardization table
Quasi- value.
Above-mentioned data normalization processing unit is being marked in time in the case where field missing occur in initial table or standard scale
Criteria field corresponding with critical field is added in quasi- table, and is that standard is arranged in added criteria field according to business rule
Value, or criteria field is retained into the standardization table, and by the standard value of criteria field, it is set as in standardization table corresponding
The standard value of field solves the case where initial table or standard literary name section lack before normalized processing, improves standardization
Treatment effeciency.
Specific restriction about data normalization processing unit may refer to above for data standardization processing method
Restriction, details are not described herein.Modules in above-mentioned data normalization processing unit can be fully or partially through software, hard
Part and combinations thereof is realized.Above-mentioned each module can be embedded in the form of hardware or independently of in the processor in computer equipment,
It can also be stored in a software form in the memory in computer equipment, execute the above modules in order to which processor calls
Corresponding operation.
In one embodiment, a kind of computer equipment is provided, which can be server, internal junction
Composition can be as shown in Figure 6.The computer equipment include by system bus connect processor, memory, network interface and
Database.Wherein, the processor of the computer equipment is for providing calculating and control ability.The memory packet of the computer equipment
Include non-volatile memory medium, built-in storage.The non-volatile memory medium is stored with operating system, computer program and data
Library.The built-in storage provides environment for the operation of operating system and computer program in non-volatile memory medium.The calculating
The database of machine equipment is for storing medical data and insurance data.The network interface of the computer equipment is used for and external end
End passes through network connection communication.To realize a kind of data standardization processing method when the computer program is executed by processor.
It will be understood by those skilled in the art that structure shown in Fig. 6, only part relevant to application scheme is tied
The block diagram of structure does not constitute the restriction for the computer equipment being applied thereon to application scheme, specific computer equipment
It may include perhaps combining certain components or with different component layouts than more or fewer components as shown in the figure.
In one embodiment, a kind of computer equipment, including memory and processor are provided, which is stored with
Computer program, the processor perform the steps of when executing computer program
Initial table is obtained, includes primary data in initial table;
The critical field of primary data is extracted from initial table;
Obtain the mapping relations between initial table and standard scale;It include criteria field in standard scale;
According to mapping relations, critical field is converted into criteria field;
Standardization table corresponding with initial table is generated using multiple criteria fields after conversion.
In one embodiment, it is also performed the steps of when processor executes computer program
Establish the connection with third party database;
The initial table is obtained from third party database, initial table is labeled as original table;
Initial table is initially verified using original table;
When by carrying out completeness check to multiple critical fielies in initial table when initially verifying.
In one embodiment, it is also performed the steps of when processor executes computer program
The corresponding type of standardization table is obtained, type includes medical type and Claims Resolution type;
The standardization table of corresponding medical type and the standardization table for type of settling a claim are obtained according to user identifier;
The standardization table of the standardization table of medical type and type of settling a claim is subjected to cross validation, identifies medical type
Standardize the variance data between table and the standardization table for type of settling a claim.
In one embodiment, it is also performed the steps of when processor executes computer program
The major key and external key in initial table are obtained, and obtains the corresponding relationship between major key and the external key;
The major key and external key in institute's standard scale are obtained, and obtains the corresponding relationship between major key and the external key;
According to the major key of major key and standard scale in initial table, the mapping relations between initial table and the standard scale are established;
According to external key, the major key and outer in the corresponding relationship and standard scale between external key, major key and the external key in initial table
Corresponding relationship between key establishes the mapping relations between critical field and criteria field.
In one embodiment, it is also performed the steps of when processor executes computer program
When in standard scale without criteria field corresponding with critical field, corresponding criteria field is added in standard scale,
And standard value is set for criteria field;
When in initial table without critical field corresponding with criteria field, criteria field is retained to the standardization table
In, and by the standard value of criteria field, it is set as the standard value of corresponding field in standardization table.
In one embodiment, a kind of computer readable storage medium is provided, computer program is stored thereon with, is calculated
Machine program performs the steps of when being executed by processor
Initial table is obtained, includes primary data in initial table;
The critical field of primary data is extracted from initial table;
Obtain the mapping relations between initial table and standard scale;It include criteria field in standard scale;
According to mapping relations, critical field is converted into criteria field;
Standardization table corresponding with initial table is generated using multiple criteria fields after conversion.
In one embodiment, it is also performed the steps of when computer program is executed by processor
Establish the connection with third party database;
The initial table is obtained from third party database, initial table is labeled as original table;
Initial table is initially verified using original table;
When by carrying out completeness check to multiple critical fielies in initial table when initially verifying.
In one embodiment, it is also performed the steps of when computer program is executed by processor
The corresponding type of standardization table is obtained, type includes medical type and Claims Resolution type;
The standardization table of corresponding medical type and the standardization table for type of settling a claim are obtained according to user identifier;
The standardization table of the standardization table of medical type and type of settling a claim is subjected to cross validation, identifies medical type
Standardize the variance data between table and the standardization table for type of settling a claim.
In one embodiment, it is also performed the steps of when computer program is executed by processor
The major key and external key in initial table are obtained, and obtains the corresponding relationship between major key and the external key;
The major key and external key in institute's standard scale are obtained, and obtains the corresponding relationship between major key and the external key;
According to the major key of major key and standard scale in initial table, the mapping relations between initial table and the standard scale are established;
According to external key, the major key and outer in the corresponding relationship and standard scale between external key, major key and the external key in initial table
Corresponding relationship between key establishes the mapping relations between critical field and criteria field.
In one embodiment, it is also performed the steps of when computer program is executed by processor
When in standard scale without criteria field corresponding with critical field, corresponding criteria field is added in standard scale,
And standard value is set for criteria field;
When in initial table without critical field corresponding with criteria field, criteria field is retained to the standardization table
In, and by the standard value of criteria field, it is set as the standard value of corresponding field in standardization table.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with
Relevant hardware is instructed to complete by computer program, the computer program can be stored in a non-volatile computer
In read/write memory medium, the computer program is when being executed, it may include such as the process of the embodiment of above-mentioned each method.Wherein,
To any reference of memory, storage, database or other media used in each embodiment provided herein,
Including non-volatile and/or volatile memory.Nonvolatile memory may include read-only memory (ROM), programming ROM
(PROM), electrically programmable ROM (EPROM), electrically erasable ROM (EEPROM) or flash memory.Volatile memory may include
Random access memory (RAM) or external cache.By way of illustration and not limitation, RAM is available in many forms,
Such as static state RAM (SRAM), dynamic ram (DRAM), synchronous dram (SDRAM), double data rate sdram (DDRSDRAM), enhancing
Type SDRAM (ESDRAM), synchronization link (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM
(RDRAM), direct memory bus dynamic ram (DRDRAM) and memory bus dynamic ram (RDRAM) etc..
Each technical characteristic of above embodiments can be combined arbitrarily, for simplicity of description, not to above-described embodiment
In each technical characteristic it is all possible combination be all described, as long as however, the combination of these technical characteristics be not present lance
Shield all should be considered as described in this specification.
The several embodiments of the application above described embodiment only expresses, the description thereof is more specific and detailed, but simultaneously
It cannot therefore be construed as limiting the scope of the patent.It should be pointed out that coming for those of ordinary skill in the art
It says, without departing from the concept of this application, various modifications and improvements can be made, these belong to the protection of the application
Range.Therefore, the scope of protection shall be subject to the appended claims for the application patent.
Claims (10)
1. a kind of data standardization processing method, which comprises
Initial table is obtained, includes primary data in the initial table;
The critical field of the primary data is extracted from the initial table;
Obtain the mapping relations between the initial table and standard scale;It include criteria field in the standard scale;
According to the mapping relations, the critical field is converted into criteria field;
Standardization table corresponding with the initial table is generated using multiple criteria fields after conversion.
2. the method according to claim 1, wherein before the acquisition initial table, further includes:
Establish the connection with third party database;
The initial table is obtained from the third party database, the initial table is labeled as original table;
The initial table is initially verified using the original table;
When by carrying out completeness check to multiple critical fielies in the initial table when initially verifying.
3. the method according to claim 1, wherein the critical field includes user identifier;The method is also
Include:
The corresponding type of the standardization table is obtained, the type includes medical type and Claims Resolution type;
The standardization table of corresponding medical type and the standardization table for type of settling a claim are obtained according to user identifier;
The standardization table of the standardization table of the medical type and type of settling a claim is subjected to cross validation, identifies medical type
Standardize the variance data between table and the standardization table for type of settling a claim.
4. according to claim 1 to method described in 3 any one, which is characterized in that obtain the initial table and mark described
Before mapping relations between quasi- table, the method also includes:
The major key and external key in the initial table are obtained, and obtains the corresponding relationship between the major key and the external key;
The major key and external key in the standard scale are obtained, and obtains the corresponding relationship between the major key and the external key;
According to the major key of major key and the standard scale in the initial table, reflecting between the initial table and the standard scale is established
Penetrate relationship;
According in the corresponding relationship and the standard scale between external key, the major key and the external key in the initial table
Corresponding relationship between external key, the major key and the external key, establishes the mapping between the critical field and the criteria field
Relationship.
5. according to claim 1 to method described in 3 any one, which is characterized in that the method also includes:
When in the standard scale without criteria field corresponding with critical field, corresponding standard word is added in the standard scale
Section, and standard value is set for the criteria field;
When in the initial table without critical field corresponding with criteria field, the criteria field is retained to the standardization
In table, and by the standard value of the criteria field, it is set as the standard value of corresponding field in the standardization table.
6. a kind of data normalization processing unit, which is characterized in that described device includes:
It includes primary data in the initial table that initial table, which obtains module for obtaining initial table,;
Critical field extraction module, for extracting the critical field of the primary data from the initial table;
Mapping relations obtain module, for obtaining the mapping relations between the initial table and standard scale;It is wrapped in the standard scale
Criteria field is included;
Field conversion module, for according to the mapping relations, the critical field to be converted to criteria field;
Table generation module is standardized, for generating standardization corresponding with the initial table using multiple criteria fields after conversion
Table.
7. device according to claim 6, which is characterized in that described device further include:
First detection module, for establishing and the connection of third party database;The third party database is obtained described initial
The initial table is labeled as original table by table;The initial table is initially verified using the original table;When by initial
When verification, completeness check is carried out to multiple critical fielies in the initial table.
8. device according to claim 6, which is characterized in that described device further include:
Second detection module, for obtaining the corresponding type of the standardization table, the type includes medical type and Claims Resolution class
Type;The standardization table of corresponding medical type and the standardization table for type of settling a claim are obtained according to user identifier;By the medical treatment
The standardization table of type and the standardization table for type of settling a claim carry out cross validation, identify the standardization table and reason of medical type
Pay for the variance data between the standardization table of type.
9. a kind of computer equipment, including memory and processor, the memory are stored with computer program, feature exists
In the step of processor realizes any one of claims 1 to 5 the method when executing the computer program.
10. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the computer program
The step of method described in any one of claims 1 to 5 is realized when being executed by processor.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810925040.5A CN109189769A (en) | 2018-08-14 | 2018-08-14 | Data standardization processing method, device, computer equipment and storage medium |
PCT/CN2019/099402 WO2020034873A1 (en) | 2018-08-14 | 2019-08-06 | Data standardization method and apparatus, computer device, and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810925040.5A CN109189769A (en) | 2018-08-14 | 2018-08-14 | Data standardization processing method, device, computer equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109189769A true CN109189769A (en) | 2019-01-11 |
Family
ID=64921727
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810925040.5A Pending CN109189769A (en) | 2018-08-14 | 2018-08-14 | Data standardization processing method, device, computer equipment and storage medium |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN109189769A (en) |
WO (1) | WO2020034873A1 (en) |
Cited By (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109739864A (en) * | 2019-01-24 | 2019-05-10 | 易保互联医疗信息科技(北京)有限公司 | The acquisition of people society data and sharing method, computer storage medium and computer equipment |
CN109902083A (en) * | 2019-02-26 | 2019-06-18 | 北京明略软件系统有限公司 | Method, apparatus, computer storage medium and the terminal of a kind of pair of mark processing |
CN110263016A (en) * | 2019-05-20 | 2019-09-20 | 平安普惠企业管理有限公司 | Data processing method, terminal device and computer storage medium |
CN110457323A (en) * | 2019-08-08 | 2019-11-15 | 北京明略软件系统有限公司 | The processing method and processing device of tables of data |
CN110569236A (en) * | 2019-09-03 | 2019-12-13 | 北京明略软件系统有限公司 | Data management method and device |
CN110597786A (en) * | 2019-09-03 | 2019-12-20 | 北京明略软件系统有限公司 | Structured data management method and device |
CN110727710A (en) * | 2019-10-12 | 2020-01-24 | 平安医疗健康管理股份有限公司 | Data analysis method and device, computer equipment and storage medium |
WO2020034873A1 (en) * | 2018-08-14 | 2020-02-20 | 平安医疗健康管理股份有限公司 | Data standardization method and apparatus, computer device, and storage medium |
CN111008523A (en) * | 2019-11-21 | 2020-04-14 | 中科鼎富(北京)科技发展有限公司 | Information extraction method and device and server |
CN111026757A (en) * | 2019-12-10 | 2020-04-17 | 首都医科大学附属北京友谊医院 | Method, device, equipment and storage medium for generating standard statistical format data |
CN111046035A (en) * | 2019-10-29 | 2020-04-21 | 三盟科技股份有限公司 | Data automation processing method, system, computer equipment and readable storage medium |
CN111061733A (en) * | 2019-12-10 | 2020-04-24 | 北京明略软件系统有限公司 | Data processing method and device, electronic equipment and computer readable storage medium |
CN111340636A (en) * | 2020-02-27 | 2020-06-26 | 平安医疗健康管理股份有限公司 | Data validity detection method and device, computer equipment and storage medium |
CN111369370A (en) * | 2020-03-31 | 2020-07-03 | 中国建设银行股份有限公司 | Estimation table processing method, device, server and storage medium |
CN111488327A (en) * | 2019-01-29 | 2020-08-04 | 卓望数码技术(深圳)有限公司 | Data standard management method and system |
CN111984654A (en) * | 2020-08-31 | 2020-11-24 | 平安医疗健康管理股份有限公司 | Method and device for standardized storage of medical insurance data and computer equipment |
CN112069204A (en) * | 2020-09-30 | 2020-12-11 | 北京百度网讯科技有限公司 | Processing method and device for operator service, intelligent workstation and electronic equipment |
CN112069774A (en) * | 2020-09-03 | 2020-12-11 | 微医云(杭州)控股有限公司 | Data mapping method and device, electronic terminal and storage medium |
CN112270222A (en) * | 2020-10-14 | 2021-01-26 | 招商银行股份有限公司 | Information standardization processing method, equipment and computer readable storage medium |
CN112380214A (en) * | 2020-11-13 | 2021-02-19 | 北京神州泰岳智能数据技术有限公司 | Data processing method and device and electronic equipment |
CN112527970A (en) * | 2020-12-24 | 2021-03-19 | 上海浦东发展银行股份有限公司 | Data dictionary standardization processing method, device, equipment and storage medium |
CN112653756A (en) * | 2020-12-20 | 2021-04-13 | 国网山东省电力公司临沂供电公司 | Intelligent data processing system and method for Internet of things |
CN112651296A (en) * | 2020-11-23 | 2021-04-13 | 安徽继远软件有限公司 | Method and system for automatically detecting data quality problem without prior knowledge |
CN113704250A (en) * | 2021-07-16 | 2021-11-26 | 杭州医康慧联科技股份有限公司 | Data batch processing method suitable for medical data |
CN113971993A (en) * | 2021-10-22 | 2022-01-25 | 浙江太美医疗科技股份有限公司 | Clinical test data conversion method and device, computer equipment and storage medium |
CN115905455A (en) * | 2022-12-31 | 2023-04-04 | 北京和兴创联健康科技有限公司 | Method for standardizing hospital database based on automatic detection technology |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040139421A1 (en) * | 2002-12-09 | 2004-07-15 | Tekelec | Automated methods and systems for generating and updated user-specific industry standards compliance reporting software |
CN101990208A (en) * | 2009-07-31 | 2011-03-23 | 中国移动通信集团公司 | Automatic data checking method, system and equipment |
CN103729337A (en) * | 2013-12-27 | 2014-04-16 | 金蝶软件(中国)有限公司 | Report conversion method and device |
CN104991929A (en) * | 2015-06-30 | 2015-10-21 | 李海军 | Traffic flow data collection method and system |
CN107295039A (en) * | 2016-03-31 | 2017-10-24 | 阿里巴巴集团控股有限公司 | Data access treating method and apparatus |
CN107743153A (en) * | 2017-05-19 | 2018-02-27 | 贵州白山云科技有限公司 | A kind of IP address database generation method and device |
CN107783950A (en) * | 2017-04-11 | 2018-03-09 | 平安医疗健康管理股份有限公司 | Package insert processing method and processing device |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8806322B2 (en) * | 2011-11-28 | 2014-08-12 | Google Inc. | System and method for obtaining a structured address by geocoding unstructured address information |
CN107767929B (en) * | 2017-11-13 | 2024-04-05 | 医渡云(北京)技术有限公司 | Case report form filling method and device, electronic equipment and storage medium |
CN107909493B (en) * | 2017-12-04 | 2020-07-17 | 泰康保险集团股份有限公司 | Policy information processing method and device, computer equipment and storage medium |
CN109189769A (en) * | 2018-08-14 | 2019-01-11 | 平安医疗健康管理股份有限公司 | Data standardization processing method, device, computer equipment and storage medium |
-
2018
- 2018-08-14 CN CN201810925040.5A patent/CN109189769A/en active Pending
-
2019
- 2019-08-06 WO PCT/CN2019/099402 patent/WO2020034873A1/en active Application Filing
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040139421A1 (en) * | 2002-12-09 | 2004-07-15 | Tekelec | Automated methods and systems for generating and updated user-specific industry standards compliance reporting software |
CN101990208A (en) * | 2009-07-31 | 2011-03-23 | 中国移动通信集团公司 | Automatic data checking method, system and equipment |
CN103729337A (en) * | 2013-12-27 | 2014-04-16 | 金蝶软件(中国)有限公司 | Report conversion method and device |
CN104991929A (en) * | 2015-06-30 | 2015-10-21 | 李海军 | Traffic flow data collection method and system |
CN107295039A (en) * | 2016-03-31 | 2017-10-24 | 阿里巴巴集团控股有限公司 | Data access treating method and apparatus |
CN107783950A (en) * | 2017-04-11 | 2018-03-09 | 平安医疗健康管理股份有限公司 | Package insert processing method and processing device |
CN107743153A (en) * | 2017-05-19 | 2018-02-27 | 贵州白山云科技有限公司 | A kind of IP address database generation method and device |
Cited By (34)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020034873A1 (en) * | 2018-08-14 | 2020-02-20 | 平安医疗健康管理股份有限公司 | Data standardization method and apparatus, computer device, and storage medium |
CN109739864B (en) * | 2019-01-24 | 2021-03-23 | 易保互联医疗信息科技(北京)有限公司 | Human-social data acquisition and sharing method, computer storage medium and computer equipment |
CN109739864A (en) * | 2019-01-24 | 2019-05-10 | 易保互联医疗信息科技(北京)有限公司 | The acquisition of people society data and sharing method, computer storage medium and computer equipment |
CN111488327A (en) * | 2019-01-29 | 2020-08-04 | 卓望数码技术(深圳)有限公司 | Data standard management method and system |
CN111488327B (en) * | 2019-01-29 | 2023-08-22 | 卓望数码技术(深圳)有限公司 | Data standard management method and system |
CN109902083A (en) * | 2019-02-26 | 2019-06-18 | 北京明略软件系统有限公司 | Method, apparatus, computer storage medium and the terminal of a kind of pair of mark processing |
CN110263016A (en) * | 2019-05-20 | 2019-09-20 | 平安普惠企业管理有限公司 | Data processing method, terminal device and computer storage medium |
CN110457323A (en) * | 2019-08-08 | 2019-11-15 | 北京明略软件系统有限公司 | The processing method and processing device of tables of data |
CN110569236A (en) * | 2019-09-03 | 2019-12-13 | 北京明略软件系统有限公司 | Data management method and device |
CN110597786A (en) * | 2019-09-03 | 2019-12-20 | 北京明略软件系统有限公司 | Structured data management method and device |
CN110727710A (en) * | 2019-10-12 | 2020-01-24 | 平安医疗健康管理股份有限公司 | Data analysis method and device, computer equipment and storage medium |
CN110727710B (en) * | 2019-10-12 | 2023-02-07 | 平安医疗健康管理股份有限公司 | Data analysis method and device, computer equipment and storage medium |
CN111046035A (en) * | 2019-10-29 | 2020-04-21 | 三盟科技股份有限公司 | Data automation processing method, system, computer equipment and readable storage medium |
CN111008523A (en) * | 2019-11-21 | 2020-04-14 | 中科鼎富(北京)科技发展有限公司 | Information extraction method and device and server |
CN111061733A (en) * | 2019-12-10 | 2020-04-24 | 北京明略软件系统有限公司 | Data processing method and device, electronic equipment and computer readable storage medium |
CN111061733B (en) * | 2019-12-10 | 2024-01-19 | 北京明略软件系统有限公司 | Data processing method, device, electronic equipment and computer readable storage medium |
CN111026757B (en) * | 2019-12-10 | 2023-10-10 | 首都医科大学附属北京友谊医院 | Method, device, equipment and storage medium for generating standard statistical format data |
CN111026757A (en) * | 2019-12-10 | 2020-04-17 | 首都医科大学附属北京友谊医院 | Method, device, equipment and storage medium for generating standard statistical format data |
CN111340636A (en) * | 2020-02-27 | 2020-06-26 | 平安医疗健康管理股份有限公司 | Data validity detection method and device, computer equipment and storage medium |
CN111369370A (en) * | 2020-03-31 | 2020-07-03 | 中国建设银行股份有限公司 | Estimation table processing method, device, server and storage medium |
CN111369370B (en) * | 2020-03-31 | 2024-03-19 | 中国建设银行股份有限公司 | Evaluation list processing method, device, server and storage medium |
CN111984654A (en) * | 2020-08-31 | 2020-11-24 | 平安医疗健康管理股份有限公司 | Method and device for standardized storage of medical insurance data and computer equipment |
CN112069774A (en) * | 2020-09-03 | 2020-12-11 | 微医云(杭州)控股有限公司 | Data mapping method and device, electronic terminal and storage medium |
CN112069204A (en) * | 2020-09-30 | 2020-12-11 | 北京百度网讯科技有限公司 | Processing method and device for operator service, intelligent workstation and electronic equipment |
CN112270222A (en) * | 2020-10-14 | 2021-01-26 | 招商银行股份有限公司 | Information standardization processing method, equipment and computer readable storage medium |
CN112380214A (en) * | 2020-11-13 | 2021-02-19 | 北京神州泰岳智能数据技术有限公司 | Data processing method and device and electronic equipment |
CN112651296A (en) * | 2020-11-23 | 2021-04-13 | 安徽继远软件有限公司 | Method and system for automatically detecting data quality problem without prior knowledge |
CN112653756B (en) * | 2020-12-20 | 2022-09-06 | 国网山东省电力公司临沂供电公司 | Intelligent data processing system and method for Internet of things |
CN112653756A (en) * | 2020-12-20 | 2021-04-13 | 国网山东省电力公司临沂供电公司 | Intelligent data processing system and method for Internet of things |
CN112527970A (en) * | 2020-12-24 | 2021-03-19 | 上海浦东发展银行股份有限公司 | Data dictionary standardization processing method, device, equipment and storage medium |
CN113704250A (en) * | 2021-07-16 | 2021-11-26 | 杭州医康慧联科技股份有限公司 | Data batch processing method suitable for medical data |
CN113971993A (en) * | 2021-10-22 | 2022-01-25 | 浙江太美医疗科技股份有限公司 | Clinical test data conversion method and device, computer equipment and storage medium |
CN115905455A (en) * | 2022-12-31 | 2023-04-04 | 北京和兴创联健康科技有限公司 | Method for standardizing hospital database based on automatic detection technology |
CN115905455B (en) * | 2022-12-31 | 2023-09-29 | 北京和兴创联健康科技有限公司 | Method for normalizing hospital database based on automatic detection technology |
Also Published As
Publication number | Publication date |
---|---|
WO2020034873A1 (en) | 2020-02-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109189769A (en) | Data standardization processing method, device, computer equipment and storage medium | |
CN109474578B (en) | Message checking method, device, computer equipment and storage medium | |
CN109800335A (en) | Generation method, device, computer equipment and the storage medium of enterprise's map | |
CN109523153A (en) | Acquisition methods, device, computer equipment and the storage medium of illegal fund collection enterprise | |
CN108876133A (en) | Risk assessment processing method, device, server and medium based on business information | |
CN109359939A (en) | Business datum method of calibration, device, computer equipment and storage medium | |
CN109659013A (en) | Illness point is examined and method for optimizing route, device, equipment and storage medium | |
CN109445842A (en) | Rule generating method, device, computer equipment and storage medium | |
CN109033058B (en) | Contract text verification method, apparatus, computer device and storage medium | |
CN109816327A (en) | Contract dataset processing method, device, computer equipment and storage medium | |
CN108629567A (en) | Declaration information processing method, device, computer equipment and storage medium | |
WO2019052221A1 (en) | Insurance data checking method and apparatus, computer device, and readable storage medium | |
CN110111208A (en) | Declaration form data processing method, device, computer equipment and storage medium | |
CN109871445A (en) | Fraudulent user recognition methods, device, computer equipment and storage medium | |
CN111984654A (en) | Method and device for standardized storage of medical insurance data and computer equipment | |
CN110990390A (en) | Data cooperative processing method and device, computer equipment and storage medium | |
CN109325868A (en) | Questionnaire data processing method, device, computer equipment and storage medium | |
CN114298804A (en) | Intelligent account checking method, system and computer readable storage medium | |
CN109767226A (en) | Suspicious transaction statistical views generation method and device based on big data | |
CN110472895B (en) | Financial system wind control method and device, computer equipment and storage medium | |
CN112669140A (en) | Financial account sales processing method and device, computer equipment and storage medium | |
CN110275703A (en) | Assignment method, device, computer equipment and the storage medium of key-value pair data | |
CN111324375A (en) | Code management method and device, computer equipment and storage medium | |
CN113888299A (en) | Wind control decision method and device, computer equipment and storage medium | |
CN109360111A (en) | Questionnaire data modification method, device, computer equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20220525 Address after: 518048 China Aviation Center 2901, No. 1018, Huafu Road, Huahang community, Huaqiang North Street, Futian District, Shenzhen, Guangdong Province Applicant after: Shenzhen Ping An medical and Health Technology Service Co.,Ltd. Address before: Room 12G, Area H, 666 Beijing East Road, Huangpu District, Shanghai 200001 Applicant before: PING AN MEDICAL AND HEALTHCARE MANAGEMENT Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190111 |
|
RJ01 | Rejection of invention patent application after publication |