CN110209891A - A kind of zipper table generating method, device, equipment and medium - Google Patents

A kind of zipper table generating method, device, equipment and medium Download PDF

Info

Publication number
CN110209891A
CN110209891A CN201910532415.6A CN201910532415A CN110209891A CN 110209891 A CN110209891 A CN 110209891A CN 201910532415 A CN201910532415 A CN 201910532415A CN 110209891 A CN110209891 A CN 110209891A
Authority
CN
China
Prior art keywords
data
current
historgraphic
character string
record
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910532415.6A
Other languages
Chinese (zh)
Inventor
杨得力
杨晨
李杨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Henan Zhongyuan Consumption Finance Co Ltd
Original Assignee
Henan Zhongyuan Consumption Finance Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Henan Zhongyuan Consumption Finance Co Ltd filed Critical Henan Zhongyuan Consumption Finance Co Ltd
Priority to CN201910532415.6A priority Critical patent/CN110209891A/en
Publication of CN110209891A publication Critical patent/CN110209891A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/219Managing data history or versioning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • G06F16/2358Change logging, detection, and notification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9024Graphs; Linked lists

Abstract

The invention discloses a kind of zipper table generating method, device, equipment and media.The step of this method includes: historgraphic data recording corresponding history feature value of the target matrix under the historical juncture read in data warehouse;Obtain target matrix current data record corresponding with historgraphic data recording under current time;By current data record in the data content of each field be spliced into the second character string, and hash algorithm operation is carried out to the second character string and generates current characteristic value;Judge history feature value and current characteristic value with the presence or absence of difference;If it is, generating the zipper table that record has historgraphic data recording.The whole of the calculation resources of cluster device is occupied in this method relative reduction zipper table generating process, and then ensures the overall operation stability of big data platform and reduces O&M pressure.In addition, the present invention also provides a kind of zipper table creating device, equipment and medium, beneficial effect are same as above.

Description

A kind of zipper table generating method, device, equipment and medium
Technical field
The present invention relates to database fields, more particularly to a kind of zipper table generating method, device, equipment and medium.
Background technique
With the arrival of big data era, each large enterprises often require to build the big data platform of itself, and based on big Data warehouse on data platform is one of application important under big data platform.Data warehouse is for all ranks of enterprise Decision-making process the strategy set that all types data are supported is provided, which is subject-oriented, integrated, time-varying , it is non-volatile.
It is recorded due to being often stored with magnanimity data in actual scene, in the tables of data of data warehouse, in tables of data The content of data record will also tend to generate variation over time, and in the application process to tables of data, user The data record in tables of data is inscribed when it is generally necessary to a certain before tracing, therefore is just needed to tables of data under different historical periods In data record stored.
Biggish waste is caused to memory space in order to avoid the data record in full dose data table memory, currently often Changed data record in tables of data is only saved under historical period by way of zipper table.The purpose of zipper table is to save Data record is before content change in tables of data, until the information of all changes of current state, zipper table is usually reconciliation The history of family information changes the result that content is retained.When being currently generated zipper table, the moment before generally requiring to get The tables of data of tables of data and current time, and corresponding data between the tables of data at moment and the tables of data at current time before comparing Data in record in each respective field, and then sent when certain data of tables of data under moment before and current time records When content change, by before when the data record inscribed save to zipper table.Due to currently carrying out adjacent moment data It is field progress content comparison one by one when the comparison that corresponding data records between table, and in actual scene, in tables of data The field that data record is included is often more, therefore currently in the ratio for carrying out corresponding data record between adjacent moment tables of data Clock synchronization needs to occupy a large amount of calculation resources of cluster device, it is difficult to ensure the overall operation stability of big data platform is easily made At biggish O&M pressure.
It can be seen that provide a kind of zipper table generating method, in relative reduction zipper table generating process to cluster device The whole of calculation resources occupy, and then ensure the overall operation stability of big data platform and reduce O&M pressure, be ability Field technique personnel's technical issues that need to address.
Summary of the invention
The object of the present invention is to provide a kind of zipper table generating method, device, equipment and media, with relative reduction zipper table The entirety of the calculation resources of cluster device is occupied in generating process, and then ensures the overall operation stability of big data platform simultaneously Reduce O&M pressure.
In order to solve the above technical problems, the present invention provides a kind of zipper table generating method, comprising:
Read historgraphic data recording corresponding history feature value of the target matrix in data warehouse under the historical juncture; Wherein, history feature value be by the way that the data content of field each in historgraphic data recording is spliced into the first character string in advance, and Hash algorithm operation generation is carried out to the first character string;
Obtain target matrix current data record corresponding with historgraphic data recording under current time;
By current data record in the data content of each field be spliced into the second character string, and the second character string is carried out Hash algorithm operation generates current characteristic value;
Judge history feature value and current characteristic value with the presence or absence of difference;
If it is, generating the zipper table that record has historgraphic data recording.
Preferably, historgraphic data recording corresponding history of the target matrix under the historical juncture in data warehouse is read Characteristic value, comprising:
The corresponding history feature value of historgraphic data recording is read in preset middle table;Wherein, middle table is based on target The field of tables of data has additional the feature value fields of log history characteristic value.
It preferably, include the data record effective date of storing data record effective date in the field of target matrix Field, zipper table include data record effective date field and data record Expiration Date field.
Preferably, target matrix current data record corresponding with historgraphic data recording, packet under current time are obtained It includes:
Obtain target matrix current data identical with the major key field content of historgraphic data recording under current time Record.
Preferably, hash algorithm includes MD5 hash algorithm.
Preferably, data warehouse includes Hive data warehouse.
Preferably, the data type of the first character string and the second character string is character string type.
In addition, the present invention also provides a kind of zipper table creating devices, comprising:
History feature obtains module, for reading historical data of the target matrix in data warehouse under the historical juncture Record corresponding history feature value;Wherein, history feature value is by advance will be in the data of field each in historgraphic data recording Appearance is spliced into the first character string, and carries out hash algorithm operation generation to the first character string;
Current data obtains module, and for obtaining, target matrix is corresponding with historgraphic data recording under current time to work as Preceding data record;
Current signature computing module, the data content for each field in recording current data are spliced into the second character String, and hash algorithm operation is carried out to the second character string and generates current characteristic value;
Diversity judgement module, for judging history feature value and current characteristic value with the presence or absence of difference, if it is, calling Zipper table generation module;
Zipper table generation module, the zipper table for having historgraphic data recording for generating record.
In addition, the present invention also provides a kind of zipper table generating devices, comprising:
Memory, for storing computer program;
Processor is realized when for executing computer program such as the step of above-mentioned zipper table generating method.
In addition, being stored with meter on computer readable storage medium the present invention also provides a kind of computer readable storage medium Calculation machine program is realized when computer program is executed by processor such as the step of above-mentioned zipper table generating method.
Zipper table generating method provided by the present invention, target matrix in data warehouse first under the reading historical juncture The corresponding history feature value of historical data, wherein by advance by each field in historgraphic data recording when history feature value Data content is spliced into the first character string, and carries out hash algorithm operation generation to the first character string;And then obtain number of targets According to the historgraphic data recording corresponding current data record of the table under current time, by current data record in each field Data content is spliced into the second character string, and carries out identical hash algorithm operation to the second character string and generate current characteristic value, History feature value and current characteristic value are finally judged with the presence or absence of difference, if it is, generating record has the historgraphic data recording Zipper table.This method is that the history of hash algorithm operation generation is carried out by comparing the whole historgraphic data recording of target matrix Whether deposited between characteristic value and the current characteristic value of the whole current data record progress hash algorithm operation generation of target matrix In the mode of difference, the comparison that whether there is difference between the corresponding data record to target matrix under different moments is realized, Since history feature value and current characteristic value can represent the data content of the record of the overall data in target matrix, nothing The comparison that field one by one carries out content need to be carried out between data record, the consistent of history feature value and current characteristic value can be passed through Property learns whether data record changes, to the whole of the calculation resources of cluster device in relative reduction zipper table generating process Body occupies, and then ensures the overall operation stability of big data platform and reduce O&M pressure.In addition, the present invention also provides A kind of zipper table creating device, equipment and medium, beneficial effect are same as above.
Detailed description of the invention
In order to illustrate the embodiments of the present invention more clearly, attached drawing needed in the embodiment will be done simply below It introduces, it should be apparent that, drawings in the following description are only some embodiments of the invention, for ordinary skill people For member, without creative efforts, it is also possible to obtain other drawings based on these drawings.
Fig. 1 is a kind of flow chart of zipper table generating method disclosed by the invention;
Fig. 2 is a kind of structural schematic diagram of zipper table creating device disclosed in the present application.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, rather than whole embodiments.Based on this Embodiment in invention, those of ordinary skill in the art are without making creative work, obtained every other Embodiment belongs to the scope of the present invention.
Biggish waste is caused to memory space in order to avoid the data record in full dose data table memory, currently often Changed data record in tables of data is only saved under historical period by way of zipper table.The purpose of zipper table is to save Data record is before content change in tables of data, until the information of all changes of current state, zipper table is usually reconciliation The history of family information changes the result that content is retained.When being currently generated zipper table, the moment before generally requiring to get The tables of data of tables of data and current time, and corresponding data between the tables of data at moment and the tables of data at current time before comparing Data in record in each respective field, and then sent when certain data of tables of data under moment before and current time records When content change, by before when the data record inscribed save to zipper table.Due to currently carrying out adjacent moment data It is field progress content comparison one by one when the comparison that corresponding data records between table, and in actual scene, in tables of data The field that data record is included is often more, therefore currently in the ratio for carrying out corresponding data record between adjacent moment tables of data Clock synchronization needs to occupy a large amount of calculation resources of cluster device, it is difficult to ensure the overall operation stability of big data platform is easily made At biggish O&M pressure.
For this purpose, core of the invention is to provide a kind of zipper table generating method, in relative reduction zipper table generating process The whole of calculation resources of cluster device is occupied, and then ensures the overall operation stability of big data platform and reduces O&M pressure Power.
In order to enable those skilled in the art to better understand the solution of the present invention, with reference to the accompanying drawings and detailed description The present invention is described in further detail.
Shown in Figure 1, the embodiment of the invention discloses a kind of zipper table generating methods, comprising:
Step S10: read that historgraphic data recording of the target matrix under the historical juncture in data warehouse is corresponding to be gone through History characteristic value.
Wherein, history feature value is by the way that the data content of field each in historgraphic data recording is spliced into the first word in advance Symbol string, and hash algorithm operation generation is carried out to the first character string.
It should be noted that this step is to read go through corresponding with the historgraphic data recording of target matrix under the historical juncture History characteristic value, history feature value are spliced into the first character string by the data content of resource each in historgraphic data recording in advance, and right First character string carries out hash algorithm operation generation.
Hash algorithm in this method is essentially hash function, is that the input of arbitrary data length is passed through hashing algorithm It is transformed into the output of regular length, which is exactly hashed value.This conversion is a kind of compression mapping, it is, hashed value Space is generally much less than the space inputted, and different inputs may hash to identical output, it is impossible to from hashed value Determine unique input value, hash function is briefly exactly a kind of message compression by random length to a certain regular length Eap-message digest function.The first character string in this step is the hashed value of complete history data record, can characterize one Complete history data record, therefore when the data content of field any in historgraphic data recording changes, the first character string Content can change therewith.
In addition, it is necessary to illustrate, historgraphic data recording and current data record in this method refer both in itself Be data record in tables of data, so-called data record is exactly the data in tables of data, and each data record is equal It include the preset field of tables of data.
Step S11: target matrix current data record corresponding with historgraphic data recording under current time is obtained.
It is emphasized that since whether this method is to be changed and determined according to the data record in target matrix Whether the target matrix corresponding zipper table is generated, therefore the current data record being compared should be answered with historgraphic data recording Obtained for corresponding same data record, and then in this step target matrix under current time with historgraphic data recording Corresponding current data record, for being compared in the next steps with historgraphic data recording.
Step S12: by current data record in the data content of each field be spliced into the second character string, and to the second character String carries out hash algorithm operation and generates current characteristic value.
Due to this method focus on by historgraphic data recording and current data record between characteristic value comparison with Substitution historgraphic data recording and current data record the comparison between each corresponding field, therefore during this step records current data The data content of each field is spliced into the second character string, and executes hash algorithm identical with the first character string to the second character string Operation generates current characteristic value.
Step S13: history feature value and current characteristic value are judged with the presence or absence of difference, if so, thening follow the steps S14.
Step S14: the zipper table that record has historgraphic data recording is generated.
It should be noted that since history feature value can characterize respectively historgraphic data recording with current characteristic value and work as Preceding data record, therefore historgraphic data recording can be learned with the presence or absence of difference by comparing history feature value and current characteristic value And whether current data record occurs the variation of data content, and then when history feature value has differences with current characteristic value When, then the zipper table that record has historgraphic data recording is generated, changed data record is recorded with this.
Zipper table generating method provided by the present invention, target matrix in data warehouse first under the reading historical juncture The corresponding history feature value of historical data, wherein by advance by each field in historgraphic data recording when history feature value Data content is spliced into the first character string, and carries out hash algorithm operation generation to the first character string;And then obtain number of targets According to the historgraphic data recording corresponding current data record of the table under current time, by current data record in each field Data content is spliced into the second character string, and carries out identical hash algorithm operation to the second character string and generate current characteristic value, History feature value and current characteristic value are finally judged with the presence or absence of difference, if it is, generating record has the historgraphic data recording Zipper table.This method is that the history of hash algorithm operation generation is carried out by comparing the whole historgraphic data recording of target matrix Whether deposited between characteristic value and the current characteristic value of the whole current data record progress hash algorithm operation generation of target matrix In the mode of difference, the comparison that whether there is difference between the corresponding data record to target matrix under different moments is realized, Since history feature value and current characteristic value can represent the data content of the record of the overall data in target matrix, nothing The comparison that field one by one carries out content need to be carried out between data record, the consistent of history feature value and current characteristic value can be passed through Property learns whether data record changes, to the whole of the calculation resources of cluster device in relative reduction zipper table generating process Body occupies, and then ensures the overall operation stability of big data platform and reduce O&M pressure.
On the basis of the above embodiments, the present invention also provides a series of preferred embodiments.
As a preferred embodiment, reading history number of the target matrix under the historical juncture in data warehouse According to recording corresponding history feature value, comprising:
The corresponding history feature value of historgraphic data recording is read in preset middle table.
Wherein, middle table has additional the feature value fields of log history characteristic value based on the field of target matrix.
It should be noted that present embodiment focuses on introducing middle table, middle table includes target matrix In field also have additional the feature value fields of log history characteristic value and on the basis of the field of target matrix, it is special Value indicative field is used for the corresponding history feature value of historgraphic data recording under the pre-recorded historical juncture.Due in present embodiment Between table on the basis of record has target matrix historgraphic data recording under the historical juncture, also record have the historgraphic data recording History feature value, and generate by operation and the history feature value that is recorded in middle table can be adjusted repeatedly and efficiently With, therefore being capable of the opposite whole efficiency for improving zipper table generating process.
In addition, as a preferred embodiment, including that storing data record comes into force in the field of target matrix The data record effective date field on date, zipper table include data record effective date field and data record expiry date Phase field.
It should be noted that including the storing data record effective date in the target data literary name section of present embodiment Data record effective date field, zipper table include data record effective date field and data record Expiration Date word Section, it is therefore an objective to the date come into force definitely can be recorded by target matrix storing data, target matrix is worked as with this In data record when changing and generating the new data record effective date, can be according to coming into force historgraphic data recording Content of the date as the data record effective date field in zipper table, and using the effective date of current data record as drawing The content of data record Expiration Date field in chained list, so it is more detailed by zipper table store historical data record once Date section through coming into force, convenient for according to zipper table more efficiently to the data record of the tables of data in certain time period before It is traced.
In addition, as a preferred embodiment, obtain target matrix under current time with historgraphic data recording Corresponding current data record, comprising:
Obtain target matrix current data identical with the major key field content of historgraphic data recording under current time Record.
It should be noted that due to consideration that in tables of data in the unique mark data table of major key field energy of data record Every a line, by major key field can pressure data table entity integrity, therefore the major key field of tables of data can be passed through Data content characterizes the data record locating for it, and due in practical applications, the data content of major key field in tables of data It is often constant, therefore present embodiment is using the data content of the major key field of historgraphic data recording as acquisition current data The foundation of record, i.e. acquisition target matrix are identical with the major key field content of historgraphic data recording current under current time Data record, can be accurate corresponding between historgraphic data recording and current data record with respect to ensure, and then guarantees zipper table Content reliability.
In addition, as a preferred embodiment, hash algorithm includes MD5 hash algorithm.
It should be noted that MD5 hash algorithm is a kind of One-way encryption algorithm, the information of input can be encrypted and be converted The information of random length is inputted for the integrality in inspection data transmission process for the hashed value of 128 regular lengths, is passed through Processing is crossed, output is all 128 values of information, therefore the history feature value that opposite can ensure to obtain based on MD5 hash algorithm And current characteristic value can have less byte number, be compared between relative reduction history feature value and current characteristic value Compared with when to overall overhead caused by hardware resource.In addition, MD5 hash algorithm has faster calculating speed, it can be relatively high History feature value and current characteristic value is calculated in effect, further improves the formation efficiency of zipper table.
In addition, as a preferred embodiment, data warehouse includes Hive data warehouse.
Hive data warehouse is the data warehouse based on the building of Hadoop big data framework, can be collected simply by increasing The mode extension storage magnitude of group node, therefore more data records can be stored by tables of data, it is mentioned so as to opposite The overall utility of high zipper table.
On the basis of a series of above-mentioned embodiments, as a preferred embodiment, the first character string and The data type of two character strings is character string type.
It should be noted that due to consideration that there may be words in the data record of tables of data in practical application scene The data content of section is empty situation, in order to guarantee each row of data when the data content of field is empty, in target matrix Record is unique by the characteristic value that hash algorithm operation generates, and present embodiment is by the data content of each field of target matrix After carrying out the conversion of character string (String) type, and then by way of adding separator between the data content in adjacent fields Or the mode of direct splicing splices the data content of all fields in the table of source, generates the first character string and the second character with this String opposite can be avoided when content change occurs for corresponding historgraphic data recording and current data record, the first character string with Second character string is identical to be happened, and then ensures the content reliability of zipper table.
Shown in Figure 2, the embodiment of the invention also discloses a kind of zipper table creating devices, comprising:
History feature obtains module 10, for reading history number of the target matrix in data warehouse under the historical juncture According to recording corresponding history feature value;Wherein, history feature value is by advance by the data of field each in historgraphic data recording Content is spliced into the first character string, and carries out hash algorithm operation generation to the first character string;
Current data obtains module 11, corresponding with historgraphic data recording under current time for obtaining target matrix Current data record;
Current signature computing module 12, the data content for each field in recording current data are spliced into the second character String, and hash algorithm operation is carried out to the second character string and generates current characteristic value;
Diversity judgement module 13, for judging history feature value and current characteristic value with the presence or absence of difference, if it is, adjusting With zipper table generation module 14;
Zipper table generation module 14, the zipper table for having historgraphic data recording for generating record.
Zipper table creating device provided by the present invention, target matrix in data warehouse first under the reading historical juncture The corresponding history feature value of historical data, wherein by advance by each field in historgraphic data recording when history feature value Data content is spliced into the first character string, and carries out hash algorithm operation generation to the first character string;And then obtain number of targets According to the historgraphic data recording corresponding current data record of the table under current time, by current data record in each field Data content is spliced into the second character string, and carries out identical hash algorithm operation to the second character string and generate current characteristic value, History feature value and current characteristic value are finally judged with the presence or absence of difference, if it is, generating record has the historgraphic data recording Zipper table.The present apparatus is that the history of hash algorithm operation generation is carried out by comparing the whole historgraphic data recording of target matrix Whether deposited between characteristic value and the current characteristic value of the whole current data record progress hash algorithm operation generation of target matrix In the mode of difference, the comparison that whether there is difference between the corresponding data record to target matrix under different moments is realized, Since history feature value and current characteristic value can represent the data content of the record of the overall data in target matrix, nothing The comparison that field one by one carries out content need to be carried out between data record, the consistent of history feature value and current characteristic value can be passed through Property learns whether data record changes, to the whole of the calculation resources of cluster device in relative reduction zipper table generating process Body occupies, and then ensures the overall operation stability of big data platform and reduce O&M pressure.
In addition, the embodiment of the present invention also provides a kind of zipper table generating device, comprising:
Memory, for storing computer program;
Processor is realized when for executing computer program such as the step of above-mentioned zipper table generating method.
Zipper table generating device provided by the present invention, target matrix in data warehouse first under the reading historical juncture The corresponding history feature value of historical data, wherein by advance by each field in historgraphic data recording when history feature value Data content is spliced into the first character string, and carries out hash algorithm operation generation to the first character string;And then obtain number of targets According to the historgraphic data recording corresponding current data record of the table under current time, by current data record in each field Data content is spliced into the second character string, and carries out identical hash algorithm operation to the second character string and generate current characteristic value, History feature value and current characteristic value are finally judged with the presence or absence of difference, if it is, generating record has the historgraphic data recording Zipper table.This equipment is that the history of hash algorithm operation generation is carried out by comparing the whole historgraphic data recording of target matrix Whether deposited between characteristic value and the current characteristic value of the whole current data record progress hash algorithm operation generation of target matrix In the mode of difference, the comparison that whether there is difference between the corresponding data record to target matrix under different moments is realized, Since history feature value and current characteristic value can represent the data content of the record of the overall data in target matrix, nothing The comparison that field one by one carries out content need to be carried out between data record, the consistent of history feature value and current characteristic value can be passed through Property learns whether data record changes, to the whole of the calculation resources of cluster device in relative reduction zipper table generating process Body occupies, and then ensures the overall operation stability of big data platform and reduce O&M pressure.
In addition, the embodiment of the present invention also provides a kind of computer readable storage medium, deposited on computer readable storage medium Computer program is contained, is realized when computer program is executed by processor such as the step of above-mentioned zipper table generating method.
Computer readable storage medium provided by the present invention, target data in data warehouse first under the reading historical juncture The corresponding history feature value of the historical data of table, wherein by advance by each word in historgraphic data recording when history feature value The data content of section is spliced into the first character string, and carries out hash algorithm operation generation to the first character string;And then obtain mesh Mark tables of data under current time current data corresponding with the historgraphic data recording record, by current data record in each word The data content of section is spliced into the second character string, and carries out identical hash algorithm operation to the second character string and generate current signature Value finally judges history feature value and current characteristic value with the presence or absence of difference, if it is, generating record has historical data note The zipper table of record.This computer readable storage medium is to carry out Hash calculation by comparing the whole historgraphic data recording of target matrix The history feature value that method operation generates carries out the current of hash algorithm operation generation with the whole current data record of target matrix Between characteristic value whether there is difference mode, realize under different moments target matrix corresponding data record between whether The comparison having differences, since history feature value and current characteristic value can represent the record of the overall data in target matrix Data content carries out the comparison of content there is no need to carry out between data record field one by one, can by history feature value with The consistency of current characteristic value learns whether data record changes, and sets in relative reduction zipper table generating process to cluster The whole of standby calculation resources occupies, and then ensures the overall operation stability of big data platform and reduce O&M pressure.
A kind of zipper table generating method provided by the present invention, device, equipment and medium are described in detail above. Each embodiment is described in a progressive manner in specification, the highlights of each of the examples are with other embodiments not Same place, the same or similar parts in each embodiment may refer to each other.For the device disclosed in the embodiment, due to it It corresponds to the methods disclosed in the examples, so being described relatively simple, reference may be made to the description of the method.It should It points out, it for those skilled in the art, without departing from the principle of the present invention, can also be to this hair Bright some improvement and modification can also be carried out, and these improvements and modifications also fall within the scope of protection of the claims of the present invention.
It should also be noted that, in the present specification, relational terms such as first and second and the like be used merely to by One entity or operation are distinguished with another entity or operation, without necessarily requiring or implying these entities or operation Between there are any actual relationship or orders.Moreover, the terms "include", "comprise" or its any other variant meaning Covering non-exclusive inclusion, so that the process, method, article or equipment for including a series of elements not only includes that A little elements, but also including other elements that are not explicitly listed, or further include for this process, method, article or The intrinsic element of equipment.In the absence of more restrictions, the element limited by sentence "including a ...", is not arranged Except there is also other identical elements in the process, method, article or apparatus that includes the element.

Claims (10)

1. a kind of zipper table generating method characterized by comprising
Read historgraphic data recording corresponding history feature value of the target matrix in data warehouse under the historical juncture;Its In, the history feature value is by the way that the data content of field each in the historgraphic data recording is spliced into the first character in advance String, and hash algorithm operation generation is carried out to first character string;
Obtain target matrix current data record corresponding with the historgraphic data recording under current time;
The data content of each field in current data record is spliced into the second character string, and to second character string into The row hash algorithm operation generates current characteristic value;
Judge the history feature value and the current characteristic value with the presence or absence of difference;
If it is, generating the zipper table that record has the historgraphic data recording.
2. zipper table generating method according to claim 1, which is characterized in that the number of targets read in data warehouse According to the corresponding history feature value of historgraphic data recording of the table under the historical juncture, comprising:
The corresponding history feature value of the historgraphic data recording is read in preset middle table;Wherein, the middle table Field based on the target matrix has additional the feature value fields for recording the history feature value.
3. zipper table generating method according to claim 1, which is characterized in that include in the field of the target matrix There is the data record effective date field of storing data record effective date, the zipper table includes the data record effective date Field and data record Expiration Date field.
4. zipper table generating method according to claim 1, which is characterized in that the acquisition target matrix is being worked as Current data record corresponding with the historgraphic data recording is inscribed when preceding, comprising:
It is identical with the major key field content of the historgraphic data recording described under current time to obtain the target matrix Current data record.
5. zipper table generating method according to claim 1, which is characterized in that the hash algorithm includes that MD5 Hash is calculated Method.
6. zipper table generating method according to claim 1, which is characterized in that the data warehouse includes Hive data bins Library.
7. according to claim 1 to zipper table generating method described in 6 any one, which is characterized in that first character string And the data type of second character string is character string type.
8. a kind of zipper table creating device characterized by comprising
History feature obtains module, for reading historgraphic data recording of the target matrix in data warehouse under the historical juncture Corresponding history feature value;Wherein, the history feature value is by advance by the number of field each in the historgraphic data recording It is spliced into the first character string according to content, and hash algorithm operation generation is carried out to first character string;
Current data obtains module, corresponding with the historgraphic data recording under current time for obtaining the target matrix Current data record;
Current signature computing module, for the data content of each field in current data record to be spliced into the second character String, and the hash algorithm operation is carried out to second character string and generates current characteristic value;
Diversity judgement module, for judging that the history feature value and the current characteristic value whether there is difference, if it is, Call zipper table generation module;
The zipper table generation module, the zipper table for having the historgraphic data recording for generating record.
9. a kind of zipper table generating device characterized by comprising
Memory, for storing computer program;
Processor realizes zipper table as described in any one of claim 1 to 7 generation side when for executing the computer program The step of method.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer on the computer readable storage medium Program, the computer program realize zipper table generating method as described in any one of claim 1 to 7 when being executed by processor The step of.
CN201910532415.6A 2019-06-19 2019-06-19 A kind of zipper table generating method, device, equipment and medium Pending CN110209891A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910532415.6A CN110209891A (en) 2019-06-19 2019-06-19 A kind of zipper table generating method, device, equipment and medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910532415.6A CN110209891A (en) 2019-06-19 2019-06-19 A kind of zipper table generating method, device, equipment and medium

Publications (1)

Publication Number Publication Date
CN110209891A true CN110209891A (en) 2019-09-06

Family

ID=67793610

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910532415.6A Pending CN110209891A (en) 2019-06-19 2019-06-19 A kind of zipper table generating method, device, equipment and medium

Country Status (1)

Country Link
CN (1) CN110209891A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111078672A (en) * 2019-12-20 2020-04-28 中国建设银行股份有限公司 Data comparison method and device for database
CN111143350A (en) * 2019-11-27 2020-05-12 深圳壹账通智能科技有限公司 Enterprise data monitoring method and device, computer equipment and storage medium
CN112735144A (en) * 2020-12-28 2021-04-30 浙江大华技术股份有限公司 Fake plate identification method and device, computer equipment and storage medium
CN112749167A (en) * 2021-01-18 2021-05-04 中国邮政储蓄银行股份有限公司 Method and device for determining broken link data and nonvolatile storage medium
CN112905805A (en) * 2021-03-05 2021-06-04 北京中经惠众科技有限公司 Knowledge graph construction method and device, computer equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101916262A (en) * 2010-07-29 2010-12-15 北京用友政务软件有限公司 Acceleration method of financial element matching
US20170026356A1 (en) * 2015-07-22 2017-01-26 Here Global B.V. Method and apparatus for generating an intelligent primary key facilitating faster object retrieval
CN107193985A (en) * 2017-05-27 2017-09-22 郑州云海信息技术有限公司 A kind of slide fastener table design method of record data change histories
CN109446205A (en) * 2017-08-28 2019-03-08 中国电信股份有限公司 Judge the device and method of data mode and the device and method that data update

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101916262A (en) * 2010-07-29 2010-12-15 北京用友政务软件有限公司 Acceleration method of financial element matching
US20170026356A1 (en) * 2015-07-22 2017-01-26 Here Global B.V. Method and apparatus for generating an intelligent primary key facilitating faster object retrieval
CN107193985A (en) * 2017-05-27 2017-09-22 郑州云海信息技术有限公司 A kind of slide fastener table design method of record data change histories
CN109446205A (en) * 2017-08-28 2019-03-08 中国电信股份有限公司 Judge the device and method of data mode and the device and method that data update

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111143350A (en) * 2019-11-27 2020-05-12 深圳壹账通智能科技有限公司 Enterprise data monitoring method and device, computer equipment and storage medium
CN111078672A (en) * 2019-12-20 2020-04-28 中国建设银行股份有限公司 Data comparison method and device for database
CN111078672B (en) * 2019-12-20 2023-06-02 中国建设银行股份有限公司 Data comparison method and device for database
CN112735144A (en) * 2020-12-28 2021-04-30 浙江大华技术股份有限公司 Fake plate identification method and device, computer equipment and storage medium
CN112749167A (en) * 2021-01-18 2021-05-04 中国邮政储蓄银行股份有限公司 Method and device for determining broken link data and nonvolatile storage medium
CN112905805A (en) * 2021-03-05 2021-06-04 北京中经惠众科技有限公司 Knowledge graph construction method and device, computer equipment and storage medium
CN112905805B (en) * 2021-03-05 2023-09-15 北京中经惠众科技有限公司 Knowledge graph construction method and device, computer equipment and storage medium

Similar Documents

Publication Publication Date Title
CN110209891A (en) A kind of zipper table generating method, device, equipment and medium
Bennett et al. Malstone: towards a benchmark for analytics on large data clouds
CN105787128B (en) A method of restoring Java and serializes file data
CN109919691B (en) Data processing system, method and device
CN106682077A (en) Method for storing massive time series data on basis of Hadoop technologies
CN106844682A (en) Method for interchanging data, apparatus and system
CN107037978A (en) Data Migration bearing calibration and system
WO2021057482A1 (en) Method and device for generating bloom filter in blockchain
CN106484734A (en) A kind of data query caching method and system
Eppstein et al. Separator based sparsification: I. Planarity testing and minimum spanning trees
CN104965835B (en) A kind of file read/write method and device of distributed file system
CN109344268A (en) Method, electronic equipment and the computer readable storage medium of graphic data base write-in
CN110119947B (en) Method and apparatus for shared workload proof computing power generation of symbiotic blockchains
CN110134511A (en) A kind of shared storage optimization method of OpenTSDB
CN103488755B (en) A kind of file system access method and apparatus
CN107368404A (en) A kind of method of auditing administration and system
TWI522827B (en) Real-time storage and real-time reading of huge amounts of data for non-related databases
He et al. SLC-index: A scalable skip list-based index for cloud data processing
CN112381583A (en) Power consumption calculation method and device based on distributed memory calculation technology
CN102521451B (en) A kind of electric network model file, generation method and the system of supporting accelerated model to splice
CN111538804A (en) HBase-based graph data processing method and equipment
CN110245148A (en) A kind of date storage method, device, system and medium
He Construction of Teaching Management Platform for Universities Based on Big Data
CN113626438B (en) Data table management method, device, computer equipment and storage medium
Liu et al. Digital preservation and presentation of institution photo archives: the Anhui University Memory Project Experience

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190906