WO2019052112A1 - Incremental data synchronization method, readable storage medium, terminal device, and apparatus - Google Patents

Incremental data synchronization method, readable storage medium, terminal device, and apparatus Download PDF

Info

Publication number
WO2019052112A1
WO2019052112A1 PCT/CN2018/074869 CN2018074869W WO2019052112A1 WO 2019052112 A1 WO2019052112 A1 WO 2019052112A1 CN 2018074869 W CN2018074869 W CN 2018074869W WO 2019052112 A1 WO2019052112 A1 WO 2019052112A1
Authority
WO
WIPO (PCT)
Prior art keywords
incremental
incremental data
data
record
records
Prior art date
Application number
PCT/CN2018/074869
Other languages
French (fr)
Chinese (zh)
Inventor
覃娆
覃无愧
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2019052112A1 publication Critical patent/WO2019052112A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1095Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes

Definitions

  • the present application belongs to the field of computer technologies, and in particular, to an incremental data synchronization method, a computer readable storage medium, a terminal device, and a device.
  • the current financial business subsystem is large in scale and complex in structure, and the entire system is often divided into multiple service subsystems according to service classification. Therefore, the incremental data of users often needs to be synchronized in multiple service subsystems.
  • the main purpose is to record incremental data in real time to multiple different service subsystems through interface calling. The consistency of data synchronization is difficult to guarantee and the error rate is high.
  • the embodiment of the present application provides an incremental data synchronization method, a computer readable storage medium, a terminal device, and a device, to solve the problem that when incremental data is recorded in real time to multiple different service subsystems through an interface calling manner.
  • the consistency of data synchronization is difficult to guarantee, and the error rate is high.
  • a first aspect of the embodiment of the present application provides an incremental data synchronization method, which may include:
  • the first incremental data is obtained from each of the service subsystems, where the first incremental data is user data actually added by each of the service subsystems in a preset first time period;
  • each of the first incremental data with the second incremental data, respectively, to obtain an incremental record missing from each of the service subsystems, where the incremental records are basic units of incremental data, each The incremental record contains a piece of business information;
  • the missing incremental records are separately added to corresponding business subsystems to complete synchronization of incremental data for each of the business subsystems.
  • a second aspect of embodiments of the present application provides a computer readable storage medium storing computer readable instructions that, when executed by a processor, implement the following steps:
  • the first incremental data is obtained from each of the service subsystems, where the first incremental data is user data actually added by each of the service subsystems in a preset first time period;
  • each of the first incremental data with the second incremental data, respectively, to obtain an incremental record missing from each of the service subsystems, where the incremental records are basic units of incremental data, each The incremental record contains a piece of business information;
  • the missing incremental records are separately added to corresponding business subsystems to complete synchronization of incremental data for each of the business subsystems.
  • a third aspect of an embodiment of the present application provides an incremental data synchronization terminal device comprising a memory, a processor, and computer readable instructions stored in the memory and executable on the processor, the processor The following steps are implemented when the computer readable instructions are executed:
  • the first incremental data is obtained from each of the service subsystems, where the first incremental data is user data actually added by each of the service subsystems in a preset first time period;
  • each of the first incremental data with the second incremental data, respectively, to obtain an incremental record missing from each of the service subsystems, where the incremental records are basic units of incremental data, each The incremental record contains a piece of business information;
  • the missing incremental records are separately added to corresponding business subsystems to complete synchronization of incremental data for each of the business subsystems.
  • a fourth aspect of the embodiments of the present application provides an incremental data synchronization apparatus, which may include:
  • the incremental data obtaining module is configured to obtain first incremental data from each service subsystem, where the first incremental data is a user newly added by each of the service subsystems in a preset first time period. data;
  • An incremental data summary module configured to summarize the obtained first incremental data to obtain second incremental data, where the second incremental data is used by each of the service subsystems in the first time period User data that should be added internally;
  • each incremental record contains a piece of business information
  • the missing record adding module is configured to separately add the missing incremental records to the corresponding service subsystem to complete synchronization of the incremental data of each of the service subsystems.
  • the embodiment of the present application has the beneficial effects that: by using the embodiment of the present application, incremental data is first obtained from each service subsystem, and then summarized, and the aggregated incremental data provides synchronization. Standard, the incremental data of each business subsystem is compared with it, and the difference record between the data and the synchronization standard can be obtained. Since these differences are obtained based on the same synchronization standard, as long as the respective services are According to the synchronization processing of the respective incremental data, the subsystem can ensure that the incremental data between them is consistent, and since the synchronization standard summarizes the data of each service subsystem, the accuracy is greatly ensured. Using this as a reference can greatly reduce the error rate during synchronization.
  • FIG. 1 is a schematic diagram of an implementation environment in an embodiment of the present application.
  • FIG. 2 is a flowchart of an embodiment of an incremental data synchronization method in an embodiment of the present application
  • FIG. 3 is a flowchart of an embodiment of an incremental data synchronization method in a scenario in which a data error exists in the embodiment of the present application;
  • FIG. 4 is a schematic flowchart of determining a system to be checked in an embodiment of the present application.
  • FIG. 5 is a schematic block diagram of an incremental data synchronization terminal device according to an embodiment of the present application.
  • FIG. 6 is a block diagram of an incremental data synchronization computer readable instruction according to an embodiment of the present application.
  • FIG. 1 is a schematic diagram of an implementation environment of an embodiment of the present application.
  • the execution entity in the embodiment of the present application is a synchronization processing system shown in the figure, and the synchronization processing system is connected to each service subsystem to read each service. Incremental data of the subsystem, and summarizes it, then compares the incremental data of each business subsystem with the aggregated incremental data to obtain the incremental data missing from each business subsystem, and finally the missing incremental The records are separately added to the corresponding service subsystems to complete the synchronization of the incremental data for each of the service subsystems.
  • an embodiment of an incremental data synchronization method in the embodiment of the present application may include:
  • Step S201 Acquire first incremental data from each service subsystem.
  • the first incremental data is user data that is newly added by each of the service subsystems in a preset first time period.
  • the synchronization processing system may separately receive the preset data at the first time.
  • the first storage location is configured to read first incremental data of each service subsystem, and the first incremental data is separately extracted by each of the service subsystems at a preset second moment and stored in the first storage location.
  • a company routinely synchronizes incremental data every morning, and each business subsystem starts a scheduled task at 00:05:00 on the T+1 day (ie, the second time), and the T day in the respective system.
  • the incremental data i.e., the first incremental data
  • the synchronization processing system starts a timing task at 02:00:00 (i.e., the first time) on the T+1 day, and reads the first incremental data from the first storage location.
  • the first moment should be later than the second moment, and the The interval between the first time and the second time should be greater than the preset time interval.
  • the specific value of the time interval can be set according to actual conditions to ensure sufficient data processing and storage for each service subsystem. Prevail.
  • Step S202 summarizing the acquired first incremental data to obtain second incremental data.
  • the second incremental data is user data that should be added by each of the service subsystems during the first time period.
  • the incremental data is composed of a plurality of incremental records, which are basic units of incremental data, and each incremental record contains a piece of business information, for example, each incremental record can be Represents a binding relationship between a user and a product account.
  • each incremental data can be regarded as a set, incrementally recorded as an element thereof, and then the union of the sets is obtained, that is, The second incremental data is described. Specifically, all the incremental records may be extracted first and formed into one incremental data, and then only one incremental record is retained for the plurality of incremental records containing the same business information, and other duplicates are deleted. Incremental recording.
  • the first incremental data of the service subsystem 1 includes: incremental record 1, incremental record 2, incremental record 3, and incremental record 4
  • the first incremental data of the service subsystem 2 includes: incremental record 1, incremental record 4, incremental record 5 and incremental record 6,
  • the second incremental data obtained after the summary includes: incremental record 1, incremental record 2, incremental record 3, incremental record 4, Incremental record 5 and incremental record 6.
  • Step S203 comparing each of the first incremental data with the second incremental data, to obtain an incremental record missing by each of the service subsystems.
  • the first incremental data of the service subsystem 1 is compared with the second incremental data, and the obtained incremental increments are recorded as an incremental record 5 and an incremental record 6, and the business is
  • the first incremental data of system 2 is compared to the second incremental data, and the resulting missing incremental records are incremental record 2 and incremental record 3.
  • Step S204 Add the missing incremental records to the corresponding service subsystems respectively to complete synchronization of the incremental data of each of the service subsystems.
  • the synchronization processing system may separately store the missing incremental records in a preset second storage location at a preset third moment, so that each of the service subsystems respectively receives the preset fourth moments.
  • the second storage location reads the incremental records that are each missing, and adds the missing incremental records to the corresponding first incremental data to obtain synchronized incremental data.
  • the third moment should be later than the first moment, and the interval between the first moment and the third moment should be greater than a certain time interval to ensure sufficient synchronization processing system to complete reading.
  • the summation and comparison processes are correct, the fourth time should be later than the third time, and the interval between the third time and the fourth time should be greater than a certain time interval to ensure sufficient
  • the synchronous processing system completes the stored procedure.
  • the synchronization processing system starts the timing task for reading, summarizing, and comparing at 02:00:00 (that is, the first moment) on the T+1 day, and at 05:00:00 on the T+1 day. (that is, the third time) starts a timed task to perform a stored procedure, and each service subsystem starts a timed task for reading and adding at 06:00:00 (that is, the fourth time) on the T+1 day.
  • an embodiment in the context of considering a data error may include:
  • Step S301 Acquire first incremental data from each service subsystem.
  • the step S301 is the same as the step S201 in the foregoing embodiment.
  • Step S302 summarizing the acquired first incremental data to obtain second incremental data.
  • the step S302 is the same as the step S202 in the foregoing embodiment. For details, refer to the description in the foregoing embodiment.
  • Step S303 Calculate the credibility of each incremental record in the second incremental data according to a preset data credibility of each of the service subsystems.
  • the data credibility of each of the service subsystems is determined according to historical statistics. If in the historical statistics, the service subsystem performs the modification operation on the incremental data (including the operation of adding the incremental record and deleting the incremental record). The more operations, the higher the probability that the data will be wrong, and the lower the data credibility.
  • the data credibility of the service subsystem ranges from [0, 1]. When 0 is used, the data is completely untrustworthy. When 1 is taken, the data is completely trusted.
  • the credibility of the incremental record can be calculated by summing the data credibility of each service subsystem in which the incremental record occurs, and dividing by the total number of service subsystems. The resulting quotient is the credibility of the incremental record. Similarly, the credibility of the incremental record ranges from [0, 1]. When 0, it means that the data is completely untrustworthy. Its data is completely trustworthy.
  • the data reliability is 0.9, 0.8, and 0.7.
  • Step S304 extracting, from each of the incremental records in the second incremental data, the preferred record whose reliability is greater than a preset second threshold.
  • the value of the second threshold may be set according to the actual situation. This embodiment does not specifically limit this, but it should be noted that if the value is larger, the preferred record extracted is less, if its value is The smaller, the more preferred records are extracted.
  • step S303 and step S304 respectively counting the total number of occurrences of each incremental record in the second incremental data in the first incremental data. Extracting, from each of the incremental records in the second incremental data, a preferred record that is greater than a preset first threshold, the first threshold being a total of the service subsystems The number is determined. For example, if the total number of service subsystems is 10, the first threshold is set to half of the total number, that is, the value is 5, and the total number of occurrences of the incremental record 1 in the first incremental data is 4. The total number of occurrences of the incremental record 2 in the first incremental data is 6, then the incremental record 2 is the preferred record and the incremental record 1 is not the preferred record.
  • Step S305 the extracted pieces of the preferred records are combined into the second increment data of the preferred ones.
  • Step S306 comparing each of the first incremental data with the preferred second incremental data, to obtain an incremental record and a redundant incremental record of each of the service subsystems.
  • the first incremental data of the service subsystem 1 includes: incremental record 1, incremental record 2, incremental record 3, and incremental record 4
  • the first incremental data of the service subsystem 2 includes: incremental record 1.
  • Incremental record 2, incremental record 3 and incremental record 5 and the preferred second incremental data includes: incremental record 1, incremental record 2, incremental record 3, and incremental record 5.
  • the obtained incremental increment record is an incremental record 5
  • the redundant incremental record is an incremental record 4
  • the first incremental data of the service subsystem 2 is compared with the second incremental data, and the two are completely identical, and there is neither a missing incremental record nor a redundant incremental record.
  • Step S307 Add the missing incremental records to the corresponding service subsystems respectively, and delete the redundant incremental records from the corresponding service subsystems.
  • the synchronization processing system may separately store the missing incremental record and the redundant incremental record into a preset second storage location, so that each of the service subsystems respectively from the second storage location Reading the incremental records that are each missing and the incremental incremental records, and adding the missing incremental records to the corresponding first incremental data, and from the corresponding first incremental data The redundant incremental records are deleted, and the synchronized incremental data is obtained.
  • step S401 the total number of historical operation records of each of the service subsystems is separately counted.
  • the historical operation record is an operation record (including an operation of adding an incremental record and an operation of deleting an incremental record) performed by the service subsystem during the synchronization of the preset second time period, in order to avoid short-term sudden
  • the second time period should be long enough, for example, one week, one month, and the like.
  • Step S402 determining, by the service subsystem that the total number of historical operation records is greater than a preset third threshold, as a to-be-checked system
  • the value of the third threshold may be set according to the actual situation. This embodiment does not specifically limit this, but it should be noted that if the value is larger, the less the system to be checked is determined, if it is taken. The smaller the value, the more systems are determined to be investigated.
  • Step S403 notifying the designated staff to check the to-be-checked system.
  • incremental data is first obtained from each service subsystem, and then summarized, and the aggregated incremental data provides synchronization standards, and the incremental data of each service subsystem is separately In comparison with it, a difference record between the synchronization standard and the synchronization standard can be obtained. Since the differences are obtained based on the same synchronization standard, each of the service subsystems can perform the respective incremental data accordingly. Synchronous processing ensures that the incremental data between them is consistent, and since the synchronization standard summarizes the data of each business subsystem, the accuracy is greatly ensured, and as a reference, the synchronization error can be greatly reduced. rate.
  • FIG. 5 is a schematic block diagram of the incremental data synchronization terminal device provided by the embodiment of the present application. For the convenience of description, only the embodiment of the present application is shown. part.
  • the incremental data synchronization terminal device may be a computing device such as a desktop computer, a notebook, a palmtop computer, or a cloud server.
  • the incremental data synchronization terminal device can include a processor 50, a memory 51, and computer readable instructions 52 stored in the memory 51 and executable on the processor 50.
  • the processor 50 can be a central processing unit (Central Processing Unit, CPU), can also be other general-purpose processors, digital signal processors (DSP), ASICs (Application) Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, etc.
  • the general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
  • the memory 51 may be an internal storage unit of the incremental data synchronization terminal device 5, such as a hard disk or a memory of the incremental data synchronization terminal device 5.
  • the memory 51 may also be an external storage device of the incremental data synchronization terminal device 5, for example, a plug-in hard disk equipped with the smart data card (SMC) provided on the incremental data synchronization terminal device 5, Secure Digital (SD) card, flash card (Flash Card) and so on.
  • the memory 51 may also include both an internal storage unit of the incremental data synchronization terminal device 5 and an external storage device.
  • the memory 51 is used to store the computer readable instructions and other instructions and data required by the incremental data synchronization terminal device 5.
  • the memory 51 can also be used to temporarily store data that has been output or is about to be output.
  • FIG. 6 is a block diagram of computer readable instructions 52 provided by an embodiment of the present application.
  • the computer readable instructions 52 may be partitioned into one or more modules, the one or more modules being stored in the memory 51 and executed by the processor 50, To complete this application.
  • the computer readable instructions 52 ie, incremental data synchronization computer readable instructions, may be segmented into an incremental data acquisition module 601, an incremental data summary module 602, and a first alignment module 603. The missing record adding module 604.
  • the following description will specifically describe the functions of the modules 601-604.
  • the incremental data obtaining module 601 is configured to obtain first incremental data from each service subsystem, where the first incremental data is actually added by each of the service subsystems in a preset first time period.
  • the incremental data summary module 602 is configured to summarize the obtained first incremental data to obtain second incremental data, where the second incremental data is used by each of the service subsystems at the first time User data that should be added in the segment;
  • a first comparison module 603 configured to compare each of the first incremental data with the second incremental data, to obtain an incremental record that is missing from each of the service subsystems, where the incremental record is The basic unit of incremental data, each incremental record contains a piece of business information;
  • the missing record adding module 604 is configured to separately add the missing incremental records to the corresponding service subsystem to complete synchronization of the incremental data of each of the service subsystems.
  • the computer readable instructions 52 may further include:
  • a quantity statistics module configured to separately count the total number of occurrences of each incremental record in the second incremental data in the first incremental data
  • a first preferred record extraction module configured to extract, from each of the incremental records in the second incremental data, the preferred record that is greater than a preset first threshold, the first threshold being The total number of the service subsystems is determined;
  • a first component module configured to form the extracted pieces of the preferred records into the second incremental data after the selection.
  • the computer readable instructions 52 may further include:
  • a credibility calculation module configured to calculate, according to a preset data credibility of each of the service subsystems, a credibility of each incremental record in the second incremental data
  • a second preferred record extraction module configured to extract, from each of the incremental records in the second incremental data, the preferred record whose reliability is greater than a preset second threshold
  • a second component module configured to form the extracted pieces of the preferred records into the second incremental data after the selection.
  • computer readable instructions 52 may further include:
  • a second comparison module configured to compare each of the first incremental data with the preferred second incremental data to obtain an incremental incremental record of each of the service subsystems
  • the redundant record deletion module is configured to delete the redundant incremental records from the corresponding service subsystems.
  • computer readable instructions 52 may further include:
  • the operation statistics module is configured to separately count the total number of historical operation records of each of the service subsystems, where the historical operation record is an operation record performed by the service subsystem during synchronization processing in a preset second time period. ;
  • the to-be-checked system determining module is configured to determine the service subsystem whose total number of historical operation records is greater than a preset third threshold as a to-be-checked system;
  • the troubleshooting module is configured to notify the designated staff to check the system to be checked.
  • modules, units, and/or method steps of the various embodiments described in connection with the embodiments disclosed herein can be implemented in electronic hardware or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the solution. A person skilled in the art can use different methods to implement the described functions for each particular application, but such implementation should not be considered to be beyond the scope of the present application.
  • the disclosed system, apparatus, and method may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
  • a computer readable storage medium A number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present application.
  • the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), and a random access memory (RAM, Random Access).
  • ROM read-only memory
  • RAM Random Access

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present application relates to the technical field of computers, and in particular, to an incremental data synchronization method, a computer-readable storage medium, a terminal device, and an apparatus. The method comprises: separately obtaining first incremental data from each service subsystem, the first incremental data being user data actually added to each service subsystem within a preset first period of time; aggregating the obtained first incremental data to obtain second incremental data, the second incremental data being user data that should be added to each service subsystem within the first period of time; separately comparing the first incremental data with the second incremental data to obtain a missing incremental record of each service subsystem; and separately adding the missing incremental record to the corresponding service subsystem to complete synchronization of the incremental data of each service subsystem. By means of the present application, consistency between incremental data between service subsystems is ensured.

Description

一种增量数据同步方法、可读存储介质、终端设备及装置Incremental data synchronization method, readable storage medium, terminal device and device
本申请要求于2017年9月14日提交中国专利局、申请号为CN 201710825934.2、发明名称为“一种增量数据同步方法及终端设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application is required to be submitted to the China Patent Office on September 14, 2017, and the application number is CN. The priority of the Chinese Patent Application, which is incorporated herein by reference.
技术领域Technical field
本申请属于计算机技术领域,尤其涉及一种增量数据同步方法、计算机可读存储介质、终端设备及装置。The present application belongs to the field of computer technologies, and in particular, to an incremental data synchronization method, a computer readable storage medium, a terminal device, and a device.
背景技术Background technique
目前的金融业务子系统规模庞大、结构复杂,往往会按照业务分类将整个系统划分为多个业务子系统,因此,用户的增量数据往往需要在多个业务子系统中进行同步,目前的方式主要是通过接口调用方式向多个不同的业务子系统实时记录增量数据,数据同步的一致性难以保证,出错率较高。The current financial business subsystem is large in scale and complex in structure, and the entire system is often divided into multiple service subsystems according to service classification. Therefore, the incremental data of users often needs to be synchronized in multiple service subsystems. The main purpose is to record incremental data in real time to multiple different service subsystems through interface calling. The consistency of data synchronization is difficult to guarantee and the error rate is high.
技术问题technical problem
有鉴于此,本申请实施例提供了一种增量数据同步方法、计算机可读存储介质、终端设备及装置,以解决通过接口调用方式向多个不同的业务子系统实时记录增量数据时存在的数据同步的一致性难以保证,出错率较高的问题。In view of this, the embodiment of the present application provides an incremental data synchronization method, a computer readable storage medium, a terminal device, and a device, to solve the problem that when incremental data is recorded in real time to multiple different service subsystems through an interface calling manner. The consistency of data synchronization is difficult to guarantee, and the error rate is high.
技术解决方案Technical solution
本申请实施例的第一方面提供了一种增量数据同步方法,可以包括:A first aspect of the embodiment of the present application provides an incremental data synchronization method, which may include:
分别从各个业务子系统中获取第一增量数据,所述第一增量数据为各个所述业务子系统在预设的第一时间段内实际新增的用户数据;The first incremental data is obtained from each of the service subsystems, where the first incremental data is user data actually added by each of the service subsystems in a preset first time period;
对获取的各个所述第一增量数据进行汇总,得到第二增量数据,所述第二增量数据为各个所述业务子系统在所述第一时间段内所应新增的用户数据;And summarizing the obtained first incremental data to obtain second incremental data, where the second incremental data is user data that should be added by each of the service subsystems in the first time period. ;
将各个所述第一增量数据分别与所述第二增量数据进行比对,得到各个所述业务子系统缺失的增量记录,所述增量记录为增量数据的基本单元,每条增量记录包含一条业务信息;Comparing each of the first incremental data with the second incremental data, respectively, to obtain an incremental record missing from each of the service subsystems, where the incremental records are basic units of incremental data, each The incremental record contains a piece of business information;
将缺失的所述增量记录分别添加至对应的业务子系统中,以完成对各个所述业务子系统的增量数据的同步。The missing incremental records are separately added to corresponding business subsystems to complete synchronization of incremental data for each of the business subsystems.
本申请实施例的第二方面提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机可读指令,所述计算机可读指令被处理器执行时实现如下步骤:A second aspect of embodiments of the present application provides a computer readable storage medium storing computer readable instructions that, when executed by a processor, implement the following steps:
分别从各个业务子系统中获取第一增量数据,所述第一增量数据为各个所述业务子系统在预设的第一时间段内实际新增的用户数据;The first incremental data is obtained from each of the service subsystems, where the first incremental data is user data actually added by each of the service subsystems in a preset first time period;
对获取的各个所述第一增量数据进行汇总,得到第二增量数据,所述第二增量数据为各个所述业务子系统在所述第一时间段内所应新增的用户数据;And summarizing the obtained first incremental data to obtain second incremental data, where the second incremental data is user data that should be added by each of the service subsystems in the first time period. ;
将各个所述第一增量数据分别与所述第二增量数据进行比对,得到各个所述业务子系统缺失的增量记录,所述增量记录为增量数据的基本单元,每条增量记录包含一条业务信息;Comparing each of the first incremental data with the second incremental data, respectively, to obtain an incremental record missing from each of the service subsystems, where the incremental records are basic units of incremental data, each The incremental record contains a piece of business information;
将缺失的所述增量记录分别添加至对应的业务子系统中,以完成对各个所述业务子系统的增量数据的同步。The missing incremental records are separately added to corresponding business subsystems to complete synchronization of incremental data for each of the business subsystems.
本申请实施例的第三方面提供了一种增量数据同步终端设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:A third aspect of an embodiment of the present application provides an incremental data synchronization terminal device comprising a memory, a processor, and computer readable instructions stored in the memory and executable on the processor, the processor The following steps are implemented when the computer readable instructions are executed:
分别从各个业务子系统中获取第一增量数据,所述第一增量数据为各个所述业务子系统在预设的第一时间段内实际新增的用户数据;The first incremental data is obtained from each of the service subsystems, where the first incremental data is user data actually added by each of the service subsystems in a preset first time period;
对获取的各个所述第一增量数据进行汇总,得到第二增量数据,所述第二增量数据为各个所述业务子系统在所述第一时间段内所应新增的用户数据;And summarizing the obtained first incremental data to obtain second incremental data, where the second incremental data is user data that should be added by each of the service subsystems in the first time period. ;
将各个所述第一增量数据分别与所述第二增量数据进行比对,得到各个所述业务子系统缺失的增量记录,所述增量记录为增量数据的基本单元,每条增量记录包含一条业务信息;Comparing each of the first incremental data with the second incremental data, respectively, to obtain an incremental record missing from each of the service subsystems, where the incremental records are basic units of incremental data, each The incremental record contains a piece of business information;
将缺失的所述增量记录分别添加至对应的业务子系统中,以完成对各个所述业务子系统的增量数据的同步。The missing incremental records are separately added to corresponding business subsystems to complete synchronization of incremental data for each of the business subsystems.
本申请实施例的第四方面提供了一种增量数据同步装置,可以包括:A fourth aspect of the embodiments of the present application provides an incremental data synchronization apparatus, which may include:
增量数据获取模块,用于分别从各个业务子系统中获取第一增量数据,所述第一增量数据为各个所述业务子系统在预设的第一时间段内实际新增的用户数据;The incremental data obtaining module is configured to obtain first incremental data from each service subsystem, where the first incremental data is a user newly added by each of the service subsystems in a preset first time period. data;
增量数据汇总模块,用于对获取的各个所述第一增量数据进行汇总,得到第二增量数据,所述第二增量数据为各个所述业务子系统在所述第一时间段内所应新增的用户数据;An incremental data summary module, configured to summarize the obtained first incremental data to obtain second incremental data, where the second incremental data is used by each of the service subsystems in the first time period User data that should be added internally;
第一比对模块,用于将各个所述第一增量数据分别与所述第二增量数据进行比对,得到各个所述业务子系统缺失的增量记录,所述增量记录为增量数据的基本单元,每条增量记录包含一条业务信息;a first comparison module, configured to compare each of the first incremental data with the second incremental data, to obtain an incremental record that is missing from each of the service subsystems, where the incremental record is increased The basic unit of quantity data, each incremental record contains a piece of business information;
缺失记录添加模块,用于将缺失的所述增量记录分别添加至对应的业务子系统中,以完成对各个所述业务子系统的增量数据的同步。The missing record adding module is configured to separately add the missing incremental records to the corresponding service subsystem to complete synchronization of the incremental data of each of the service subsystems.
有益效果Beneficial effect
本申请实施例与现有技术相比存在的有益效果是:通过本申请实施例,首先分别从各个业务子系统中获取增量数据,然后对其进行汇总,汇总的增量数据提供了同步的标准,各个业务子系统的增量数据分别与之进行比对,可以得出其与该同步标准之间存在的差异记录,由于这些差异是基于同样的同步标准得到的,只要将各个所述业务子系统据此对各自的增量数据进行同步处理,就可保证它们之间的增量数据是一致的,而且由于该同步标准汇总了各个业务子系统的数据,极大地保证了其准确性,以其作为参照可以大大降低同步时的错误率。Compared with the prior art, the embodiment of the present application has the beneficial effects that: by using the embodiment of the present application, incremental data is first obtained from each service subsystem, and then summarized, and the aggregated incremental data provides synchronization. Standard, the incremental data of each business subsystem is compared with it, and the difference record between the data and the synchronization standard can be obtained. Since these differences are obtained based on the same synchronization standard, as long as the respective services are According to the synchronization processing of the respective incremental data, the subsystem can ensure that the incremental data between them is consistent, and since the synchronization standard summarizes the data of each service subsystem, the accuracy is greatly ensured. Using this as a reference can greatly reduce the error rate during synchronization.
附图说明DRAWINGS
图1为本申请实施例中的一种实施环境的示意图;1 is a schematic diagram of an implementation environment in an embodiment of the present application;
图2为本申请实施例中一种增量数据同步方法的一个实施例流程图;2 is a flowchart of an embodiment of an incremental data synchronization method in an embodiment of the present application;
图3为本申请实施例中一种增量数据同步方法在考虑存在数据错误的场景下的一个实施例流程图;3 is a flowchart of an embodiment of an incremental data synchronization method in a scenario in which a data error exists in the embodiment of the present application;
图4为本申请实施例中确定待排查系统的示意流程图;4 is a schematic flowchart of determining a system to be checked in an embodiment of the present application;
图5为本申请实施例提供的一种增量数据同步终端设备的示意框图;FIG. 5 is a schematic block diagram of an incremental data synchronization terminal device according to an embodiment of the present application;
图6为本申请实施例提供的一种增量数据同步计算机可读指令的模块图。FIG. 6 is a block diagram of an incremental data synchronization computer readable instruction according to an embodiment of the present application.
本发明的实施方式Embodiments of the invention
图1所示为本申请实施例的一种实施环境的示意图,本申请实施例中的执行主体即为图中所示的同步处理系统,同步处理系统与各个业务子系统连接,读取各个业务子系统的增量数据,并对其进行汇总,然后将各个业务子系统的增量数据与汇总的增量数据对比,得到各个业务子系统缺失的增量数据,最后将缺失的所述增量记录分别添加至对应的业务子系统中,以完成对各个所述业务子系统的增量数据的同步。FIG. 1 is a schematic diagram of an implementation environment of an embodiment of the present application. The execution entity in the embodiment of the present application is a synchronization processing system shown in the figure, and the synchronization processing system is connected to each service subsystem to read each service. Incremental data of the subsystem, and summarizes it, then compares the incremental data of each business subsystem with the aggregated incremental data to obtain the incremental data missing from each business subsystem, and finally the missing incremental The records are separately added to the corresponding service subsystems to complete the synchronization of the incremental data for each of the service subsystems.
请参阅图2,本申请实施例中一种增量数据同步方法的一个实施例可以包括:Referring to FIG. 2, an embodiment of an incremental data synchronization method in the embodiment of the present application may include:
步骤S201,分别从各个业务子系统中获取第一增量数据。Step S201: Acquire first incremental data from each service subsystem.
所述第一增量数据为各个所述业务子系统在预设的第一时间段内实际新增的用户数据,在本实施例中,同步处理系统可以在预设的第一时刻分别从预设的第一存储位置读取各个业务子系统的第一增量数据,所述第一增量数据由各个所述业务子系统在预设的第二时刻分别进行抽取并存储入所述第一存储位置。The first incremental data is user data that is newly added by each of the service subsystems in a preset first time period. In this embodiment, the synchronization processing system may separately receive the preset data at the first time. The first storage location is configured to read first incremental data of each service subsystem, and the first incremental data is separately extracted by each of the service subsystems at a preset second moment and stored in the first storage location.
例如,某公司例行在每天凌晨进行一次增量数据的同步,各个业务子系统在T+1日凌晨00:05:00(即所述第二时刻)启动定时任务,将各自系统中T日00:00:00至24:00:00(即所述第一时间段)的增量数据(即所述第一增量数据)抽取出来,存储入所述第一存储位置。同步处理系统在T+1日凌晨02:00:00(即所述第一时刻)启动定时任务,从所述第一存储位置读取所述第一增量数据。为保证同步处理系统可以顺利地读取出数据,避免出现读取时在所述第一存储位置尚未有数据存入的情况,所述第一时刻应晚于所述第二时刻,且所述第一时刻与所述第二时刻之间的间隔应大于预设的时间间隔,所述时间间隔的具体取值可以根据实际情况来设置,以保证足够各个业务子系统完成数据抽取和存储等过程为准。For example, a company routinely synchronizes incremental data every morning, and each business subsystem starts a scheduled task at 00:05:00 on the T+1 day (ie, the second time), and the T day in the respective system. The incremental data (i.e., the first incremental data) of 00:00:00 to 24:00:00 (i.e., the first time period) is extracted and stored in the first storage location. The synchronization processing system starts a timing task at 02:00:00 (i.e., the first time) on the T+1 day, and reads the first incremental data from the first storage location. In order to ensure that the synchronous processing system can smoothly read out the data, and avoid the case where there is no data stored in the first storage location during reading, the first moment should be later than the second moment, and the The interval between the first time and the second time should be greater than the preset time interval. The specific value of the time interval can be set according to actual conditions to ensure sufficient data processing and storage for each service subsystem. Prevail.
步骤S202,对获取的各个所述第一增量数据进行汇总,得到第二增量数据。Step S202, summarizing the acquired first incremental data to obtain second incremental data.
所述第二增量数据为各个所述业务子系统在所述第一时间段内所应新增的用户数据。The second incremental data is user data that should be added by each of the service subsystems during the first time period.
在本实施例中,增量数据是由若干个增量记录组成的,所述增量记录为增量数据的基本单元,每条增量记录包含一条业务信息,例如,每条增量记录可以代表用户与产品账户之间的一个绑定关系。In this embodiment, the incremental data is composed of a plurality of incremental records, which are basic units of incremental data, and each incremental record contains a piece of business information, for example, each incremental record can be Represents a binding relationship between a user and a product account.
在对读取的各个所述第一增量数据进行汇总时,可以将每个增量数据都看作一个集合,增量记录为其中的元素,然后求得这些集合的并集,也即所述第二增量数据。具体地,可以先提取出所有的增量记录,并将其组成一个增量数据,然后,对其中包含相同业务信息的多个增量记录只保留其中的一个增量记录,删除掉其它重复的增量记录。When summarizing each of the read first incremental data, each incremental data can be regarded as a set, incrementally recorded as an element thereof, and then the union of the sets is obtained, that is, The second incremental data is described. Specifically, all the incremental records may be extracted first and formed into one incremental data, and then only one incremental record is retained for the plurality of incremental records containing the same business information, and other duplicates are deleted. Incremental recording.
例如,若业务子系统1的第一增量数据包括:增量记录1、增量记录2、增量记录3和增量记录4,业务子系统2的第一增量数据包括:增量记录1、增量记录4、增量记录5和增量记录6,那么汇总后得到的第二增量数据中包括:增量记录1、增量记录2、增量记录3、增量记录4、增量记录5和增量记录6。For example, if the first incremental data of the service subsystem 1 includes: incremental record 1, incremental record 2, incremental record 3, and incremental record 4, the first incremental data of the service subsystem 2 includes: incremental record 1, incremental record 4, incremental record 5 and incremental record 6, then the second incremental data obtained after the summary includes: incremental record 1, incremental record 2, incremental record 3, incremental record 4, Incremental record 5 and incremental record 6.
步骤S203,将各个所述第一增量数据分别与所述第二增量数据进行比对,得到各个所述业务子系统缺失的增量记录。Step S203, comparing each of the first incremental data with the second incremental data, to obtain an incremental record missing by each of the service subsystems.
在上述的例子中,将业务子系统1的第一增量数据与所述第二增量数据进行比对,得到的缺失的增量记录为增量记录5和增量记录6,将业务子系统2的第一增量数据与所述第二增量数据进行比对,得到的缺失的增量记录为增量记录2和增量记录3。In the above example, the first incremental data of the service subsystem 1 is compared with the second incremental data, and the obtained incremental increments are recorded as an incremental record 5 and an incremental record 6, and the business is The first incremental data of system 2 is compared to the second incremental data, and the resulting missing incremental records are incremental record 2 and incremental record 3.
步骤S204,将缺失的所述增量记录分别添加至对应的业务子系统中,以完成对各个所述业务子系统的增量数据的同步。Step S204: Add the missing incremental records to the corresponding service subsystems respectively to complete synchronization of the incremental data of each of the service subsystems.
具体地,同步处理系统可以在预设的第三时刻将缺失的所述增量记录分别存储入预设的第二存储位置,以便各个所述业务子系统在预设的第四时刻分别从所述第二存储位置读取各自缺失的所述增量记录,并将缺失的所述增量记录添加至对应的第一增量数据中,得到同步后的增量数据。Specifically, the synchronization processing system may separately store the missing incremental records in a preset second storage location at a preset third moment, so that each of the service subsystems respectively receives the preset fourth moments. The second storage location reads the incremental records that are each missing, and adds the missing incremental records to the corresponding first incremental data to obtain synchronized incremental data.
容易理解地,所述第三时刻应晚于所述第一时刻,且所述第一时刻与所述第三时刻之间的间隔应大于一定的时间间隔,以保证足够同步处理系统完成读取、汇总及比对等过程为准,所述第四时刻应晚于所述第三时刻,且所述第三时刻与所述第四时刻之间的间隔应大于一定的时间间隔,以保证足够同步处理系统完成存储过程为准。It is easy to understand that the third moment should be later than the first moment, and the interval between the first moment and the third moment should be greater than a certain time interval to ensure sufficient synchronization processing system to complete reading. The summation and comparison processes are correct, the fourth time should be later than the third time, and the interval between the third time and the fourth time should be greater than a certain time interval to ensure sufficient The synchronous processing system completes the stored procedure.
例如,同步处理系统在T+1日凌晨02:00:00(即所述第一时刻)启动定时任务进行读取、汇总及比对等过程,并在T+1日凌晨05:00:00(即所述第三时刻)启动定时任务进行存储过程,各个业务子系统在T+1日凌晨06:00:00(即所述第四时刻)启动定时任务进行读取及添加等过程。For example, the synchronization processing system starts the timing task for reading, summarizing, and comparing at 02:00:00 (that is, the first moment) on the T+1 day, and at 05:00:00 on the T+1 day. (that is, the third time) starts a timed task to perform a stored procedure, and each service subsystem starts a timed task for reading and adding at 06:00:00 (that is, the fourth time) on the T+1 day.
在以上的实施例中,是基于业务子系统的数据均为可信数据的这一前提条件。但在实际应用中,业务子系统的数据可能是会存在错误的,需要对其加以考虑。请参阅图3,在考虑存在数据错误的场景下的一个实施例可以包括:In the above embodiments, it is based on the premise that the data of the service subsystem is all trusted data. However, in practical applications, the data of the business subsystem may be wrong and needs to be considered. Referring to FIG. 3, an embodiment in the context of considering a data error may include:
步骤S301,分别从各个业务子系统中获取第一增量数据。Step S301: Acquire first incremental data from each service subsystem.
步骤S301与上述实施例中的步骤S201相同,具体可参照上述实施例中的描述,本实施例对此不再赘述。The step S301 is the same as the step S201 in the foregoing embodiment. For details, refer to the description in the foregoing embodiment.
步骤S302,对获取的各个所述第一增量数据进行汇总,得到第二增量数据。Step S302, summarizing the acquired first incremental data to obtain second incremental data.
步骤S302与上述实施例中的步骤S202相同,具体可参照上述实施例中的描述,本实施例对此不再赘述。The step S302 is the same as the step S202 in the foregoing embodiment. For details, refer to the description in the foregoing embodiment.
步骤S303,根据预设的各个所述业务子系统的数据可信度计算所述第二增量数据中的各条增量记录的可信度。Step S303: Calculate the credibility of each incremental record in the second incremental data according to a preset data credibility of each of the service subsystems.
各个所述业务子系统的数据可信度是根据历史统计数据确定的,若在历史统计数据中,业务子系统对增量数据执行修改的操作(包括添加增量记录的操作和删除增量记录的操作)越多,则说明其数据出现错误的概率越高,其数据可信度就越低。业务子系统的数据可信度的取值范围为[0,1],取0时代表其数据完全不可信,取1时代表其数据完全可信。The data credibility of each of the service subsystems is determined according to historical statistics. If in the historical statistics, the service subsystem performs the modification operation on the incremental data (including the operation of adding the incremental record and deleting the incremental record). The more operations, the higher the probability that the data will be wrong, and the lower the data credibility. The data credibility of the service subsystem ranges from [0, 1]. When 0 is used, the data is completely untrustworthy. When 1 is taken, the data is completely trusted.
在本实施例中,增量记录的可信度可以通过如下方式计算:将出现该条增量记录的各个业务子系统的数据可信度求和,再除以业务子系统的总个数,最后所得的商即为该增量记录的可信度,类似地,增量记录的可信度的取值范围为[0,1],取0时代表其数据完全不可信,取1时代表其数据完全可信。In this embodiment, the credibility of the incremental record can be calculated by summing the data credibility of each service subsystem in which the incremental record occurs, and dividing by the total number of service subsystems. The resulting quotient is the credibility of the incremental record. Similarly, the credibility of the incremental record ranges from [0, 1]. When 0, it means that the data is completely untrustworthy. Its data is completely trustworthy.
例如,若共有3个业务子系统,分别为业务子系统1、业务子系统2、业务子系统3,其数据可信度为0.9、0.8、0.7。其中,增量记录1出现在业务子系统2和业务子系统3中,但没有出现在业务子系统1中,那么增量记录1的可信度就为(0.8+0.7)/3=0.5。For example, if there are three service subsystems, namely, service subsystem 1, service subsystem 2, and service subsystem 3, the data reliability is 0.9, 0.8, and 0.7. Among them, the incremental record 1 appears in the business subsystem 2 and the business subsystem 3, but does not appear in the business subsystem 1, then the credibility of the incremental record 1 is (0.8 + 0.7) / 3 = 0.5.
步骤S304,从所述第二增量数据中的各条所述增量记录中抽取出所述可信度大于预设的第二阈值的优选记录。Step S304, extracting, from each of the incremental records in the second incremental data, the preferred record whose reliability is greater than a preset second threshold.
所述第二阈值的取值可以根据实际情况进行设置,本实施例对此不作具体限定,但需要注意的是,若其取值越大,则抽取出的优选记录越少,若其取值越小,则抽取出的优选记录越多。The value of the second threshold may be set according to the actual situation. This embodiment does not specifically limit this, but it should be noted that if the value is larger, the preferred record extracted is less, if its value is The smaller, the more preferred records are extracted.
可选地,可以采用以下的处理方式作为步骤S303和步骤S304的一种简化:分别统计所述第二增量数据中的各条增量记录在所述第一增量数据中出现的总次数;从所述第二增量数据中的各条所述增量记录中抽取出所述总次数大于预设的第一阈值的优选记录,所述第一阈值由所述业务子系统的总个数确定。例如,若业务子系统的总个数为10,第一阈值设置为该总个数的一半,也即取值为5,增量记录1在第一增量数据中出现的总次数为4,增量记录2在第一增量数据中出现的总次数为6,则增量记录2是优选记录,而增量记录1不是优选记录。Optionally, the following processing manner may be adopted as a simplification of step S303 and step S304: respectively counting the total number of occurrences of each incremental record in the second incremental data in the first incremental data. Extracting, from each of the incremental records in the second incremental data, a preferred record that is greater than a preset first threshold, the first threshold being a total of the service subsystems The number is determined. For example, if the total number of service subsystems is 10, the first threshold is set to half of the total number, that is, the value is 5, and the total number of occurrences of the incremental record 1 in the first incremental data is 4. The total number of occurrences of the incremental record 2 in the first incremental data is 6, then the incremental record 2 is the preferred record and the incremental record 1 is not the preferred record.
步骤S305,将抽取出的各条所述优选记录组成优选后的第二增量数据。Step S305, the extracted pieces of the preferred records are combined into the second increment data of the preferred ones.
相比于初始的第二增量数据,优选后的第二增量数据中只保留了可信度较高的增量记录,据此得到的同步结果更为准确。Compared with the initial second incremental data, only the incremental record with higher reliability is retained in the preferred second incremental data, and the synchronization result obtained is more accurate.
步骤S306,将各个所述第一增量数据分别与优选后的所述第二增量数据进行比对,得到各个所述业务子系统缺失的增量记录以及冗余的增量记录。Step S306, comparing each of the first incremental data with the preferred second incremental data, to obtain an incremental record and a redundant incremental record of each of the service subsystems.
例如,若业务子系统1的第一增量数据包括:增量记录1、增量记录2、增量记录3和增量记录4,业务子系统2的第一增量数据包括:增量记录1、增量记录2、增量记录3和增量记录5,优选后的第二增量数据中包括:增量记录1、增量记录2、增量记录3、和增量记录5。For example, if the first incremental data of the service subsystem 1 includes: incremental record 1, incremental record 2, incremental record 3, and incremental record 4, the first incremental data of the service subsystem 2 includes: incremental record 1. Incremental record 2, incremental record 3 and incremental record 5, and the preferred second incremental data includes: incremental record 1, incremental record 2, incremental record 3, and incremental record 5.
将业务子系统1的第一增量数据与优选后的第二增量数据进行比对,得到的缺失的增量记录为增量记录5,冗余的增量记录为增量记录4,将业务子系统2的第一增量数据与所述第二增量数据进行比对,两者完全一致,既无缺失的增量记录,也无冗余的增量记录。Comparing the first incremental data of the service subsystem 1 with the preferred second incremental data, the obtained incremental increment record is an incremental record 5, and the redundant incremental record is an incremental record 4, The first incremental data of the service subsystem 2 is compared with the second incremental data, and the two are completely identical, and there is neither a missing incremental record nor a redundant incremental record.
步骤S307,将缺失的所述增量记录分别添加至对应的业务子系统中,将冗余的所述增量记录分别从对应的业务子系统中删除。Step S307: Add the missing incremental records to the corresponding service subsystems respectively, and delete the redundant incremental records from the corresponding service subsystems.
具体地,同步处理系统可以将缺失的所述增量记录以及冗余的所述增量记录分别存储入预设的第二存储位置,以便各个所述业务子系统分别从所述第二存储位置读取各自缺失的所述增量记录以及冗余的所述增量记录,并将缺失的所述增量记录添加至对应的第一增量数据中,以及从对应的第一增量数据中删除冗余的所述增量记录,得到同步后的增量数据。Specifically, the synchronization processing system may separately store the missing incremental record and the redundant incremental record into a preset second storage location, so that each of the service subsystems respectively from the second storage location Reading the incremental records that are each missing and the incremental incremental records, and adding the missing incremental records to the corresponding first incremental data, and from the corresponding first incremental data The redundant incremental records are deleted, and the synchronized incremental data is obtained.
优选地,对于数据可信度较低的业务子系统,需要对其进行排查,来减少其发生错误的概率。具体地,可以包括如图4所示的步骤:Preferably, for a business subsystem with low data credibility, it needs to be checked to reduce the probability of its error. Specifically, the steps as shown in FIG. 4 may be included:
步骤S401,分别统计各个所述业务子系统的历史操作记录的总数。In step S401, the total number of historical operation records of each of the service subsystems is separately counted.
所述历史操作记录为在预设的第二时间段内所述业务子系统在进行同步处理时所执行操作记录(包括添加增量记录的操作和删除增量记录的操作),为了避免短期突发状况的影响,所述第二时间段应该足够长,例如,可以取一周、一个月等等。The historical operation record is an operation record (including an operation of adding an incremental record and an operation of deleting an incremental record) performed by the service subsystem during the synchronization of the preset second time period, in order to avoid short-term sudden The second time period should be long enough, for example, one week, one month, and the like.
步骤S402,将所述历史操作记录的总数大于预设的第三阈值的所述业务子系统确定为待排查系统;Step S402, determining, by the service subsystem that the total number of historical operation records is greater than a preset third threshold, as a to-be-checked system;
历史操作记录的总数越大,则说明其数据出现错误的概率越高。所述第三阈值的取值可以根据实际情况进行设置,本实施例对此不作具体限定,但需要注意的是,若其取值越大,则确定出的待排查系统越少,若其取值越小,则确定出的待排查系统越多。The greater the total number of historical operational records, the higher the probability that their data will be incorrect. The value of the third threshold may be set according to the actual situation. This embodiment does not specifically limit this, but it should be noted that if the value is larger, the less the system to be checked is determined, if it is taken. The smaller the value, the more systems are determined to be investigated.
步骤S403,通知指定的工作人员对所述待排查系统进行排查。Step S403, notifying the designated staff to check the to-be-checked system.
综上所述,通过本申请实施例,首先分别从各个业务子系统中获取增量数据,然后对其进行汇总,汇总的增量数据提供了同步的标准,各个业务子系统的增量数据分别与之进行比对,可以得出其与该同步标准之间存在的差异记录,由于这些差异是基于同样的同步标准得到的,只要将各个所述业务子系统据此对各自的增量数据进行同步处理,就可保证它们之间的增量数据是一致的,而且由于该同步标准汇总了各个业务子系统的数据,极大地保证了其准确性,以其作为参照可以大大降低同步时的错误率。In summary, in the embodiment of the present application, incremental data is first obtained from each service subsystem, and then summarized, and the aggregated incremental data provides synchronization standards, and the incremental data of each service subsystem is separately In comparison with it, a difference record between the synchronization standard and the synchronization standard can be obtained. Since the differences are obtained based on the same synchronization standard, each of the service subsystems can perform the respective incremental data accordingly. Synchronous processing ensures that the incremental data between them is consistent, and since the synchronization standard summarizes the data of each business subsystem, the accuracy is greatly ensured, and as a reference, the synchronization error can be greatly reduced. rate.
对应于上文实施例所述的增量数据同步方法,图5示出了本申请实施例提供的增量数据同步终端设备的示意框图,为了便于说明,仅示出了与本申请实施例相关的部分。Corresponding to the incremental data synchronization method described in the foregoing embodiment, FIG. 5 is a schematic block diagram of the incremental data synchronization terminal device provided by the embodiment of the present application. For the convenience of description, only the embodiment of the present application is shown. part.
在本实施例中,所述增量数据同步终端设备可以是桌上型计算机、笔记本、掌上电脑及云端服务器等计算设备。该增量数据同步终端设备可包括:处理器50、存储器51以及存储在所述存储器51中并可在所述处理器50上运行的计算机可读指令52。In this embodiment, the incremental data synchronization terminal device may be a computing device such as a desktop computer, a notebook, a palmtop computer, or a cloud server. The incremental data synchronization terminal device can include a processor 50, a memory 51, and computer readable instructions 52 stored in the memory 51 and executable on the processor 50.
所述处理器50可以是中央处理单元(Central Processing Unit,CPU),还可以是其它通用处理器、数字信号处理器 (Digital Signal Processor,DSP)、专用集成电路 (Application Specific Integrated Circuit,ASIC)、现场可编程门阵列 (Field-Programmable Gate Array,FPGA) 或者其它可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The processor 50 can be a central processing unit (Central Processing Unit, CPU), can also be other general-purpose processors, digital signal processors (DSP), ASICs (Application) Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, etc. The general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
所述存储器51可以是所述增量数据同步终端设备5的内部存储单元,例如增量数据同步终端设备5的硬盘或内存。所述存储器51也可以是所述增量数据同步终端设备5的外部存储设备,例如所述增量数据同步终端设备5上配备的插接式硬盘,智能存储卡(Smart Media Card, SMC),安全数字(Secure Digital, SD)卡,闪存卡(Flash Card)等。进一步地,所述存储器51还可以既包括所述增量数据同步终端设备5的内部存储单元也包括外部存储设备。所述存储器51用于存储所述计算机可读指令以及所述增量数据同步终端设备5所需的其它指令和数据。所述存储器51还可以用于暂时地存储已经输出或者将要输出的数据。The memory 51 may be an internal storage unit of the incremental data synchronization terminal device 5, such as a hard disk or a memory of the incremental data synchronization terminal device 5. The memory 51 may also be an external storage device of the incremental data synchronization terminal device 5, for example, a plug-in hard disk equipped with the smart data card (SMC) provided on the incremental data synchronization terminal device 5, Secure Digital (SD) card, flash card (Flash Card) and so on. Further, the memory 51 may also include both an internal storage unit of the incremental data synchronization terminal device 5 and an external storage device. The memory 51 is used to store the computer readable instructions and other instructions and data required by the incremental data synchronization terminal device 5. The memory 51 can also be used to temporarily store data that has been output or is about to be output.
请参阅图6,是本申请实施例提供的计算机可读指令52的模块图。在本实施例中,所述的计算机可读指令52可以被分割成一个或多个模块,所述一个或者多个模块被存储于所述存储器51中,并由所述处理器50所执行,以完成本申请。例如,在图6中,所述的计算机可读指令52,也即增量数据同步计算机可读指令可以被分割成增量数据获取模块601、增量数据汇总模块602、第一比对模块603、缺失记录添加模块604。以下描述将具体介绍所述模块601-604的功能。Please refer to FIG. 6, which is a block diagram of computer readable instructions 52 provided by an embodiment of the present application. In the present embodiment, the computer readable instructions 52 may be partitioned into one or more modules, the one or more modules being stored in the memory 51 and executed by the processor 50, To complete this application. For example, in FIG. 6, the computer readable instructions 52, ie, incremental data synchronization computer readable instructions, may be segmented into an incremental data acquisition module 601, an incremental data summary module 602, and a first alignment module 603. The missing record adding module 604. The following description will specifically describe the functions of the modules 601-604.
增量数据获取模块601,用于分别从各个业务子系统中获取第一增量数据,所述第一增量数据为各个所述业务子系统在预设的第一时间段内实际新增的用户数据;The incremental data obtaining module 601 is configured to obtain first incremental data from each service subsystem, where the first incremental data is actually added by each of the service subsystems in a preset first time period. User data;
增量数据汇总模块602,用于对获取的各个所述第一增量数据进行汇总,得到第二增量数据,所述第二增量数据为各个所述业务子系统在所述第一时间段内所应新增的用户数据;The incremental data summary module 602 is configured to summarize the obtained first incremental data to obtain second incremental data, where the second incremental data is used by each of the service subsystems at the first time User data that should be added in the segment;
第一比对模块603,用于将各个所述第一增量数据分别与所述第二增量数据进行比对,得到各个所述业务子系统缺失的增量记录,所述增量记录为增量数据的基本单元,每条增量记录包含一条业务信息;a first comparison module 603, configured to compare each of the first incremental data with the second incremental data, to obtain an incremental record that is missing from each of the service subsystems, where the incremental record is The basic unit of incremental data, each incremental record contains a piece of business information;
缺失记录添加模块604,用于将缺失的所述增量记录分别添加至对应的业务子系统中,以完成对各个所述业务子系统的增量数据的同步。The missing record adding module 604 is configured to separately add the missing incremental records to the corresponding service subsystem to complete synchronization of the incremental data of each of the service subsystems.
可选地,所述计算机可读指令52还可以包括:Optionally, the computer readable instructions 52 may further include:
次数统计模块,用于分别统计所述第二增量数据中的各条增量记录在所述第一增量数据中出现的总次数;a quantity statistics module, configured to separately count the total number of occurrences of each incremental record in the second incremental data in the first incremental data;
第一优选记录抽取模块,用于从所述第二增量数据中的各条所述增量记录中抽取出所述总次数大于预设的第一阈值的优选记录,所述第一阈值由所述业务子系统的总个数确定;a first preferred record extraction module, configured to extract, from each of the incremental records in the second incremental data, the preferred record that is greater than a preset first threshold, the first threshold being The total number of the service subsystems is determined;
第一组成模块,用于将抽取出的各条所述优选记录组成优选后的第二增量数据。And a first component module, configured to form the extracted pieces of the preferred records into the second incremental data after the selection.
可选地,所述计算机可读指令52还可以包括:Optionally, the computer readable instructions 52 may further include:
可信度计算模块,用于根据预设的各个所述业务子系统的数据可信度计算所述第二增量数据中的各条增量记录的可信度;a credibility calculation module, configured to calculate, according to a preset data credibility of each of the service subsystems, a credibility of each incremental record in the second incremental data;
第二优选记录抽取模块,用于从所述第二增量数据中的各条所述增量记录中抽取出所述可信度大于预设的第二阈值的优选记录;a second preferred record extraction module, configured to extract, from each of the incremental records in the second incremental data, the preferred record whose reliability is greater than a preset second threshold;
第二组成模块,用于将抽取出的各条所述优选记录组成优选后的第二增量数据。And a second component module, configured to form the extracted pieces of the preferred records into the second incremental data after the selection.
进一步地,所述计算机可读指令52还可以包括:Further, the computer readable instructions 52 may further include:
第二比对模块,用于将各个所述第一增量数据分别与优选后的所述第二增量数据进行比对,得到各个所述业务子系统冗余的增量记录;a second comparison module, configured to compare each of the first incremental data with the preferred second incremental data to obtain an incremental incremental record of each of the service subsystems;
冗余记录删除模块,用于将冗余的所述增量记录分别从对应的业务子系统中删除。The redundant record deletion module is configured to delete the redundant incremental records from the corresponding service subsystems.
进一步地,所述计算机可读指令52还可以包括:Further, the computer readable instructions 52 may further include:
操作统计模块,用于分别统计各个所述业务子系统的历史操作记录的总数,所述历史操作记录为在预设的第二时间段内所述业务子系统在进行同步处理时所执行操作记录;The operation statistics module is configured to separately count the total number of historical operation records of each of the service subsystems, where the historical operation record is an operation record performed by the service subsystem during synchronization processing in a preset second time period. ;
待排查系统确定模块,用于将所述历史操作记录的总数大于预设的第三阈值的所述业务子系统确定为待排查系统;The to-be-checked system determining module is configured to determine the service subsystem whose total number of historical operation records is greater than a preset third threshold as a to-be-checked system;
排查通知模块,用于通知指定的工作人员对所述待排查系统进行排查。The troubleshooting module is configured to notify the designated staff to check the system to be checked.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统,装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。A person skilled in the art can clearly understand that, for the convenience and brevity of the description, the specific working process of the system, the device and the unit described above can refer to the corresponding process in the foregoing method embodiment, and details are not described herein again.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述或记载的部分,可以参见其它实施例的相关描述。In the above embodiments, the descriptions of the various embodiments are different, and the parts that are not detailed or described in the specific embodiments may be referred to the related descriptions of other embodiments.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各实施例的模块、单元和/或方法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those of ordinary skill in the art will appreciate that the modules, units, and/or method steps of the various embodiments described in connection with the embodiments disclosed herein can be implemented in electronic hardware or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the solution. A person skilled in the art can use different methods to implement the described functions for each particular application, but such implementation should not be considered to be beyond the scope of the present application.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided by the present application, it should be understood that the disclosed system, apparatus, and method may be implemented in other manners. For example, the device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储计算机可读指令的介质。The integrated unit, if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application, in essence or the contribution to the prior art, or all or part of the technical solution may be embodied in the form of a software product stored in a storage medium. A number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present application. The foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), and a random access memory (RAM, Random Access). A variety of media that can store computer readable instructions, such as a Memory, a disk, or an optical disk.
以上所述,以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围。The above embodiments are only used to explain the technical solutions of the present application, and are not limited thereto; although the present application has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that they can still The technical solutions described in the embodiments are modified, or the equivalents of the technical features are replaced by the equivalents. The modifications and substitutions of the embodiments do not depart from the spirit and scope of the technical solutions of the embodiments of the present application.

Claims (20)

  1. 一种增量数据同步方法,其特征在于,包括:An incremental data synchronization method, comprising:
    分别从各个业务子系统中获取第一增量数据,所述第一增量数据为各个所述业务子系统在预设的第一时间段内实际新增的用户数据;The first incremental data is obtained from each of the service subsystems, where the first incremental data is user data actually added by each of the service subsystems in a preset first time period;
    对获取的各个所述第一增量数据进行汇总,得到第二增量数据,所述第二增量数据为各个所述业务子系统在所述第一时间段内所应新增的用户数据;And summarizing the obtained first incremental data to obtain second incremental data, where the second incremental data is user data that should be added by each of the service subsystems in the first time period. ;
    将各个所述第一增量数据分别与所述第二增量数据进行比对,得到各个所述业务子系统缺失的增量记录,所述增量记录为增量数据的基本单元,每条增量记录包含一条业务信息;Comparing each of the first incremental data with the second incremental data, respectively, to obtain an incremental record missing from each of the service subsystems, where the incremental records are basic units of incremental data, each The incremental record contains a piece of business information;
    将缺失的所述增量记录分别添加至对应的业务子系统中,以完成对各个所述业务子系统的增量数据的同步。The missing incremental records are separately added to corresponding business subsystems to complete synchronization of incremental data for each of the business subsystems.
  2. 根据权利要求1所述的增量数据同步方法,其特征在于,在得到第二增量数据之后,还包括:The incremental data synchronization method according to claim 1, wherein after obtaining the second incremental data, the method further comprises:
    分别统计所述第二增量数据中的各条增量记录在所述第一增量数据中出现的总次数;And counting, respectively, a total number of times that each of the incremental data records in the second incremental data appears in the first incremental data;
    从所述第二增量数据中的各条所述增量记录中抽取出所述总次数大于预设的第一阈值的优选记录,所述第一阈值由所述业务子系统的总个数确定;Extracting, from each of the incremental records in the second incremental data, a preferred record that is greater than a preset first threshold, the first threshold being the total number of the service subsystems determine;
    将抽取出的各条所述优选记录组成优选后的第二增量数据。Each of the extracted preferred records is composed of the preferred second incremental data.
  3. 根据权利要求1所述的增量数据同步方法,其特征在于,在得到第二增量数据之后,还包括:The incremental data synchronization method according to claim 1, wherein after obtaining the second incremental data, the method further comprises:
    根据预设的各个所述业务子系统的数据可信度计算所述第二增量数据中的各条增量记录的可信度;Calculating the credibility of each incremental record in the second incremental data according to a preset data credibility of each of the service subsystems;
    从所述第二增量数据中的各条所述增量记录中抽取出所述可信度大于预设的第二阈值的优选记录;Extracting, from each of the incremental records in the second incremental data, the preferred record whose reliability is greater than a preset second threshold;
    将抽取出的各条所述优选记录组成优选后的第二增量数据。Each of the extracted preferred records is composed of the preferred second incremental data.
  4. 根据权利要求2或3所述的增量数据同步方法,其特征在于,还包括:The incremental data synchronization method according to claim 2 or 3, further comprising:
    将各个所述第一增量数据分别与优选后的所述第二增量数据进行比对,得到各个所述业务子系统冗余的增量记录;Comparing each of the first incremental data with the preferred second incremental data to obtain an incremental incremental record of each of the service subsystems;
    将冗余的所述增量记录分别从对应的业务子系统中删除。The redundant incremental records are respectively deleted from the corresponding service subsystem.
  5. 根据权利要求1至3中任一项所述的增量数据同步方法,其特征在于,还包括:The incremental data synchronization method according to any one of claims 1 to 3, further comprising:
    分别统计各个所述业务子系统的历史操作记录的总数,所述历史操作记录为在预设的第二时间段内所述业务子系统在进行同步处理时所执行操作记录;Counting, respectively, a total number of historical operation records of each of the service subsystems, where the historical operation record is an operation record performed by the service subsystem when performing synchronization processing in a preset second time period;
    将所述历史操作记录的总数大于预设的第三阈值的所述业务子系统确定为待排查系统;Determining, by the service subsystem, that the total number of historical operation records is greater than a preset third threshold, as a system to be checked;
    通知指定的工作人员对所述待排查系统进行排查。Notify the designated staff to check the system to be investigated.
  6. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机可读指令,其特征在于,所述计算机可读指令被处理器执行时实现如下步骤:A computer readable storage medium storing computer readable instructions, wherein the computer readable instructions, when executed by a processor, implement the following steps:
    分别从各个业务子系统中获取第一增量数据,所述第一增量数据为各个所述业务子系统在预设的第一时间段内实际新增的用户数据;The first incremental data is obtained from each of the service subsystems, where the first incremental data is user data actually added by each of the service subsystems in a preset first time period;
    对获取的各个所述第一增量数据进行汇总,得到第二增量数据,所述第二增量数据为各个所述业务子系统在所述第一时间段内所应新增的用户数据;And summarizing the obtained first incremental data to obtain second incremental data, where the second incremental data is user data that should be added by each of the service subsystems in the first time period. ;
    将各个所述第一增量数据分别与所述第二增量数据进行比对,得到各个所述业务子系统缺失的增量记录,所述增量记录为增量数据的基本单元,每条增量记录包含一条业务信息;Comparing each of the first incremental data with the second incremental data, respectively, to obtain an incremental record missing from each of the service subsystems, where the incremental records are basic units of incremental data, each The incremental record contains a piece of business information;
    将缺失的所述增量记录分别添加至对应的业务子系统中,以完成对各个所述业务子系统的增量数据的同步。The missing incremental records are separately added to corresponding business subsystems to complete synchronization of incremental data for each of the business subsystems.
  7. 根据权利要求6所述的计算机可读存储介质,其特征在于,在得到第二增量数据之后,还包括:The computer readable storage medium according to claim 6, wherein after obtaining the second incremental data, the method further comprises:
    分别统计所述第二增量数据中的各条增量记录在所述第一增量数据中出现的总次数;And counting, respectively, a total number of times that each of the incremental data records in the second incremental data appears in the first incremental data;
    从所述第二增量数据中的各条所述增量记录中抽取出所述总次数大于预设的第一阈值的优选记录,所述第一阈值由所述业务子系统的总个数确定;Extracting, from each of the incremental records in the second incremental data, a preferred record that is greater than a preset first threshold, the first threshold being the total number of the service subsystems determine;
    将抽取出的各条所述优选记录组成优选后的第二增量数据。Each of the extracted preferred records is composed of the preferred second incremental data.
  8. 根据权利要求6所述的计算机可读存储介质,其特征在于,在得到第二增量数据之后,还包括:The computer readable storage medium according to claim 6, wherein after obtaining the second incremental data, the method further comprises:
    根据预设的各个所述业务子系统的数据可信度计算所述第二增量数据中的各条增量记录的可信度;Calculating the credibility of each incremental record in the second incremental data according to a preset data credibility of each of the service subsystems;
    从所述第二增量数据中的各条所述增量记录中抽取出所述可信度大于预设的第二阈值的优选记录;Extracting, from each of the incremental records in the second incremental data, the preferred record whose reliability is greater than a preset second threshold;
    将抽取出的各条所述优选记录组成优选后的第二增量数据。Each of the extracted preferred records is composed of the preferred second incremental data.
  9. 根据权利要求7或8所述的计算机可读存储介质,其特征在于,还包括:The computer readable storage medium according to claim 7 or 8, further comprising:
    将各个所述第一增量数据分别与优选后的所述第二增量数据进行比对,得到各个所述业务子系统冗余的增量记录;Comparing each of the first incremental data with the preferred second incremental data to obtain an incremental incremental record of each of the service subsystems;
    将冗余的所述增量记录分别从对应的业务子系统中删除。The redundant incremental records are respectively deleted from the corresponding service subsystem.
  10. 根据权利要求6至8中任一项所述的计算机可读存储介质,其特征在于,还包括:The computer readable storage medium according to any one of claims 6 to 8, further comprising:
    分别统计各个所述业务子系统的历史操作记录的总数,所述历史操作记录为在预设的第二时间段内所述业务子系统在进行同步处理时所执行操作记录;Counting, respectively, a total number of historical operation records of each of the service subsystems, where the historical operation record is an operation record performed by the service subsystem when performing synchronization processing in a preset second time period;
    将所述历史操作记录的总数大于预设的第三阈值的所述业务子系统确定为待排查系统;Determining, by the service subsystem, that the total number of historical operation records is greater than a preset third threshold, as a system to be checked;
    通知指定的工作人员对所述待排查系统进行排查。Notify the designated staff to check the system to be investigated.
  11. 一种增量数据同步终端设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,其特征在于,所述处理器执行所述计算机可读指令时实现如下步骤:An incremental data synchronization terminal device comprising a memory, a processor, and computer readable instructions stored in the memory and operative on the processor, wherein the processor executes the computer readable The following steps are implemented when the instruction is executed:
    分别从各个业务子系统中获取第一增量数据,所述第一增量数据为各个所述业务子系统在预设的第一时间段内实际新增的用户数据;The first incremental data is obtained from each of the service subsystems, where the first incremental data is user data actually added by each of the service subsystems in a preset first time period;
    对获取的各个所述第一增量数据进行汇总,得到第二增量数据,所述第二增量数据为各个所述业务子系统在所述第一时间段内所应新增的用户数据;And summarizing the obtained first incremental data to obtain second incremental data, where the second incremental data is user data that should be added by each of the service subsystems in the first time period. ;
    将各个所述第一增量数据分别与所述第二增量数据进行比对,得到各个所述业务子系统缺失的增量记录,所述增量记录为增量数据的基本单元,每条增量记录包含一条业务信息;Comparing each of the first incremental data with the second incremental data, respectively, to obtain an incremental record missing from each of the service subsystems, where the incremental records are basic units of incremental data, each The incremental record contains a piece of business information;
    将缺失的所述增量记录分别添加至对应的业务子系统中,以完成对各个所述业务子系统的增量数据的同步。The missing incremental records are separately added to corresponding business subsystems to complete synchronization of incremental data for each of the business subsystems.
  12. 根据权利要求11所述的增量数据同步终端设备,其特征在于,在得到第二增量数据之后,还包括:The incremental data synchronization terminal device according to claim 11, wherein after obtaining the second incremental data, the method further comprises:
    分别统计所述第二增量数据中的各条增量记录在所述第一增量数据中出现的总次数;And counting, respectively, a total number of times that each of the incremental data records in the second incremental data appears in the first incremental data;
    从所述第二增量数据中的各条所述增量记录中抽取出所述总次数大于预设的第一阈值的优选记录,所述第一阈值由所述业务子系统的总个数确定;Extracting, from each of the incremental records in the second incremental data, a preferred record that is greater than a preset first threshold, the first threshold being the total number of the service subsystems determine;
    将抽取出的各条所述优选记录组成优选后的第二增量数据。Each of the extracted preferred records is composed of the preferred second incremental data.
  13. 根据权利要求11所述的增量数据同步终端设备,其特征在于,在得到第二增量数据之后,还包括:The incremental data synchronization terminal device according to claim 11, wherein after obtaining the second incremental data, the method further comprises:
    根据预设的各个所述业务子系统的数据可信度计算所述第二增量数据中的各条增量记录的可信度;Calculating the credibility of each incremental record in the second incremental data according to a preset data credibility of each of the service subsystems;
    从所述第二增量数据中的各条所述增量记录中抽取出所述可信度大于预设的第二阈值的优选记录;Extracting, from each of the incremental records in the second incremental data, the preferred record whose reliability is greater than a preset second threshold;
    将抽取出的各条所述优选记录组成优选后的第二增量数据。Each of the extracted preferred records is composed of the preferred second incremental data.
  14. 根据权利要求12或13所述的增量数据同步终端设备,其特征在于,还包括:The incremental data synchronization terminal device according to claim 12 or 13, further comprising:
    将各个所述第一增量数据分别与优选后的所述第二增量数据进行比对,得到各个所述业务子系统冗余的增量记录;Comparing each of the first incremental data with the preferred second incremental data to obtain an incremental incremental record of each of the service subsystems;
    将冗余的所述增量记录分别从对应的业务子系统中删除。The redundant incremental records are respectively deleted from the corresponding service subsystem.
  15. 根据权利要求11至13中任一项所述的增量数据同步终端设备,其特征在于,还包括:The incremental data synchronization terminal device according to any one of claims 11 to 13, further comprising:
    分别统计各个所述业务子系统的历史操作记录的总数,所述历史操作记录为在预设的第二时间段内所述业务子系统在进行同步处理时所执行操作记录;Counting, respectively, a total number of historical operation records of each of the service subsystems, where the historical operation record is an operation record performed by the service subsystem when performing synchronization processing in a preset second time period;
    将所述历史操作记录的总数大于预设的第三阈值的所述业务子系统确定为待排查系统;Determining, by the service subsystem, that the total number of historical operation records is greater than a preset third threshold, as a system to be checked;
    通知指定的工作人员对所述待排查系统进行排查。Notify the designated staff to check the system to be investigated.
  16. 一种增量数据同步装置,其特征在于,包括:An incremental data synchronization device, comprising:
    增量数据获取模块,用于分别从各个业务子系统中获取第一增量数据,所述第一增量数据为各个所述业务子系统在预设的第一时间段内实际新增的用户数据;The incremental data obtaining module is configured to obtain first incremental data from each service subsystem, where the first incremental data is a user newly added by each of the service subsystems in a preset first time period. data;
    增量数据汇总模块,用于对获取的各个所述第一增量数据进行汇总,得到第二增量数据,所述第二增量数据为各个所述业务子系统在所述第一时间段内所应新增的用户数据;An incremental data summary module, configured to summarize the obtained first incremental data to obtain second incremental data, where the second incremental data is used by each of the service subsystems in the first time period User data that should be added internally;
    第一比对模块,用于将各个所述第一增量数据分别与所述第二增量数据进行比对,得到各个所述业务子系统缺失的增量记录,所述增量记录为增量数据的基本单元,每条增量记录包含一条业务信息;a first comparison module, configured to compare each of the first incremental data with the second incremental data, to obtain an incremental record that is missing from each of the service subsystems, where the incremental record is increased The basic unit of quantity data, each incremental record contains a piece of business information;
    缺失记录添加模块,用于将缺失的所述增量记录分别添加至对应的业务子系统中,以完成对各个所述业务子系统的增量数据的同步。The missing record adding module is configured to separately add the missing incremental records to the corresponding service subsystem to complete synchronization of the incremental data of each of the service subsystems.
  17. 根据权利要求16所述的增量数据同步装置,其特征在于,还包括:The incremental data synchronization device of claim 16, further comprising:
    次数统计模块,用于分别统计所述第二增量数据中的各条增量记录在所述第一增量数据中出现的总次数;a quantity statistics module, configured to separately count the total number of occurrences of each incremental record in the second incremental data in the first incremental data;
    第一优选记录抽取模块,用于从所述第二增量数据中的各条所述增量记录中抽取出所述总次数大于预设的第一阈值的优选记录,所述第一阈值由所述业务子系统的总个数确定;a first preferred record extraction module, configured to extract, from each of the incremental records in the second incremental data, the preferred record that is greater than a preset first threshold, the first threshold being The total number of the service subsystems is determined;
    第一组成模块,用于将抽取出的各条所述优选记录组成优选后的第二增量数据。And a first component module, configured to form the extracted pieces of the preferred records into the second incremental data after the selection.
  18. 根据权利要求16所述的增量数据同步装置,其特征在于,还包括:The incremental data synchronization device of claim 16, further comprising:
    可信度计算模块,用于根据预设的各个所述业务子系统的数据可信度计算所述第二增量数据中的各条增量记录的可信度;a credibility calculation module, configured to calculate, according to a preset data credibility of each of the service subsystems, a credibility of each incremental record in the second incremental data;
    第二优选记录抽取模块,用于从所述第二增量数据中的各条所述增量记录中抽取出所述可信度大于预设的第二阈值的优选记录;a second preferred record extraction module, configured to extract, from each of the incremental records in the second incremental data, the preferred record whose reliability is greater than a preset second threshold;
    第二组成模块,用于将抽取出的各条所述优选记录组成优选后的第二增量数据。And a second component module, configured to form the extracted pieces of the preferred records into the second incremental data after the selection.
  19. 根据权利要求17或18所述的增量数据同步装置,其特征在于,还包括:The incremental data synchronization device according to claim 17 or 18, further comprising:
    第二比对模块,用于将各个所述第一增量数据分别与优选后的所述第二增量数据进行比对,得到各个所述业务子系统冗余的增量记录;a second comparison module, configured to compare each of the first incremental data with the preferred second incremental data to obtain an incremental incremental record of each of the service subsystems;
    冗余记录删除模块,用于将冗余的所述增量记录分别从对应的业务子系统中删除。The redundant record deletion module is configured to delete the redundant incremental records from the corresponding service subsystems.
  20. 根据权利要求16至18中任一项所述的增量数据同步装置,其特征在于,还包括:The incremental data synchronization device according to any one of claims 16 to 18, further comprising:
    操作统计模块,用于分别统计各个所述业务子系统的历史操作记录的总数,所述历史操作记录为在预设的第二时间段内所述业务子系统在进行同步处理时所执行操作记录;The operation statistics module is configured to separately count the total number of historical operation records of each of the service subsystems, where the historical operation record is an operation record performed by the service subsystem during synchronization processing in a preset second time period. ;
    待排查系统确定模块,用于将所述历史操作记录的总数大于预设的第三阈值的所述业务子系统确定为待排查系统;The to-be-checked system determining module is configured to determine the service subsystem whose total number of historical operation records is greater than a preset third threshold as a to-be-checked system;
    排查通知模块,用于通知指定的工作人员对所述待排查系统进行排查。The troubleshooting module is configured to notify the designated staff to check the system to be checked.
PCT/CN2018/074869 2017-09-14 2018-01-31 Incremental data synchronization method, readable storage medium, terminal device, and apparatus WO2019052112A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710825934.2 2017-09-14
CN201710825934.2A CN107689998B (en) 2017-09-14 2017-09-14 Incremental data synchronization method, terminal equipment and computer readable storage medium

Publications (1)

Publication Number Publication Date
WO2019052112A1 true WO2019052112A1 (en) 2019-03-21

Family

ID=61156265

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/074869 WO2019052112A1 (en) 2017-09-14 2018-01-31 Incremental data synchronization method, readable storage medium, terminal device, and apparatus

Country Status (2)

Country Link
CN (1) CN107689998B (en)
WO (1) WO2019052112A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108573042B (en) * 2018-04-10 2022-06-10 平安科技(深圳)有限公司 Report synchronization method, electronic equipment and computer readable storage medium
CN113016166B (en) * 2018-11-14 2023-01-06 华为技术有限公司 Multimedia service transmission method and device
CN110781197B (en) * 2019-09-29 2023-09-26 北京淇瑀信息科技有限公司 Hive offline synchronous verification method and device and electronic equipment
CN110837537B (en) * 2019-10-12 2024-05-31 平安银行股份有限公司 Order data synchronization method, order data synchronization device, computer readable storage medium and server
CN110716939B (en) * 2019-10-16 2023-05-09 深圳市网心科技有限公司 Data management method, electronic device, system and medium
CN114116842B (en) * 2021-11-25 2023-05-19 上海柯林布瑞信息技术有限公司 Multidimensional medical data real-time acquisition method and device, electronic equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101577817A (en) * 2009-06-10 2009-11-11 中兴通讯股份有限公司 Distributed video monitoring system and method thereof for keeping data consistency
CN102096685A (en) * 2009-12-11 2011-06-15 阿里巴巴集团控股有限公司 Method and device for synchronizing distributive data into data warehouse
CN103823884A (en) * 2014-03-06 2014-05-28 国家电网公司 Maintenance right-based loose consistency synchronization method for grid calculating data

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101577817A (en) * 2009-06-10 2009-11-11 中兴通讯股份有限公司 Distributed video monitoring system and method thereof for keeping data consistency
CN102096685A (en) * 2009-12-11 2011-06-15 阿里巴巴集团控股有限公司 Method and device for synchronizing distributive data into data warehouse
CN103823884A (en) * 2014-03-06 2014-05-28 国家电网公司 Maintenance right-based loose consistency synchronization method for grid calculating data

Also Published As

Publication number Publication date
CN107689998B (en) 2020-02-14
CN107689998A (en) 2018-02-13

Similar Documents

Publication Publication Date Title
WO2019052112A1 (en) Incremental data synchronization method, readable storage medium, terminal device, and apparatus
CN108615119B (en) Abnormal user identification method and equipment
CN106548402B (en) Resource transfer monitoring method and device
WO2019178979A1 (en) Method for querying report data, apparatus, storage medium and server
US8364636B2 (en) Real time data replication
WO2022142616A1 (en) Data storage method and apparatus based on redis, and device and storage medium
US20230074102A1 (en) Method and apparatus for processing data based on block chain, device and readable storage medium
CN104750703B (en) A kind of method and apparatus for improving data accuracy
CN113326165B (en) Data processing method and device based on block chain and computer readable storage medium
CN111209339B (en) Block synchronization method, device, computer and storage medium
CN110851535B (en) Data processing method and device based on block chain, storage medium and terminal
US11816163B2 (en) Systems and methods for improved transactional mainframes
WO2019056770A1 (en) Method and apparatus for data processing on basis of related transactions, and computer device
WO2023279970A1 (en) Blockchain-based data synchronization method and apparatus
WO2019029149A1 (en) Insurance policy-type status information counting method, terminal apparatus and storage medium
WO2019153483A1 (en) Service charge determination method and apparatus, and terminal device and medium
WO2022156369A1 (en) Storage block management information synchronous recording method and system, and terminal and storage medium
WO2020211233A1 (en) Method and device for batch data editing, computer device, and storage medium
US20240143456A1 (en) Log replay methods and apparatuses, data recovery methods and apparatuses, and electronic devices
CN108460163A (en) K-DB method for synchronizing data of database, device, equipment and storage medium
CN110046172B (en) Online computing data processing method and system
CN111274255B (en) Service data monitoring method and system, monitoring architecture, equipment and storage medium
US10956369B1 (en) Data aggregations in a distributed environment
CN110837537B (en) Order data synchronization method, order data synchronization device, computer readable storage medium and server
CN107025150A (en) A kind of system and method for realizing the control of data backup real-time recovery

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18856726

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 24/09/2020)

122 Ep: pct application non-entry in european phase

Ref document number: 18856726

Country of ref document: EP

Kind code of ref document: A1