US20220156278A1 - Database Data Replication Tool - Google Patents

Database Data Replication Tool Download PDF

Info

Publication number
US20220156278A1
US20220156278A1 US17/099,059 US202017099059A US2022156278A1 US 20220156278 A1 US20220156278 A1 US 20220156278A1 US 202017099059 A US202017099059 A US 202017099059A US 2022156278 A1 US2022156278 A1 US 2022156278A1
Authority
US
United States
Prior art keywords
data
api
replication
integrated
manager engine
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US17/099,059
Inventor
Yoann Baumert
Clement Delaunay
Daniel-Florin Chis
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ADP Inc
Original Assignee
ADP Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ADP Inc filed Critical ADP Inc
Priority to US17/099,059 priority Critical patent/US20220156278A1/en
Assigned to ADP, LLC reassignment ADP, LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: BAUMERT, Yoann, CHIS, Daniel-Florin, DELAUNAY, Clement
Priority to PCT/US2021/046404 priority patent/WO2022103461A1/en
Assigned to ADP, INC. reassignment ADP, INC. CHANGE OF NAME (SEE DOCUMENT FOR DETAILS). Assignors: ADP, LLC
Publication of US20220156278A1 publication Critical patent/US20220156278A1/en
Priority to US18/113,982 priority patent/US20240020314A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2282Tablespace storage structures; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • G06F16/2358Change logging, detection, and notification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/258Data format conversion from or to a database
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5011Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
    • G06F9/5016Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals the resource being the memory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/45583Memory management, e.g. access or allocation

Definitions

  • the present disclosure relates generally to an improved computer system and, in particular, to replication data for communication between a database and another database or messaging system.
  • Moving data across heterogeneous system typically requires replication, filtering, and of transformation of the data.
  • the heterogeneous systems might comprise different databases.
  • Specialized data replication tools can be employed to facilitate this data replication between systems.
  • these replication tools are often complex, expensive, and require continual maintenance work.
  • An illustrative embodiment provides a computer-implemented method for data replication between databases.
  • the method comprises receiving, through an application programming interface (API) integrated into a data replication tool, a request from a system of record (SoR) application for a replication setup and initialization.
  • API application programming interface
  • SoR system of record
  • a number of data instances are imported through the API from a source database associated with the SoR.
  • a manager engine integrated into the data replication tool initiates a separate replication process for each data instance.
  • the manager engine is configured to support multiple data instances on a single virtual machine instance. If a difference between a data description language (DDL) of the replicated data instances and a DDL of a target database is detected, the DDL of the replicated data instances is automatically changed to the DDL of the target database.
  • the replicated data instances are then exported to the target database.
  • DDL data description language
  • the system comprises a storage device configured to store program instructions and one or more processors operably connected to the storage device and configured to execute the program instructions to cause the system to: receive, through an application programming interface (API) integrated into a data replication tool, a request from a system of record (SoR) application for a replication setup and initialization; import, through the API, a number of data instances from a source database associated with the SoR; initiate, by a manager engine integrated into the data replication tool, a separate replication process for each data instance imported through the API, wherein the manager engine is configured to support multiple data instances on a single virtual machine instance; if a difference between a data description language (DDL) of the replicated data instances and a DDL of a target database is detected, automatically change the DDL of the replicated data instances to the DDL of the target database; and export the replicated data instances to the target database.
  • API application programming interface
  • SoR system of record
  • the computer program product comprises a computer-readable storage medium having program instructions embodied thereon to perform the steps of: receiving, through an application programming interface (API) integrated into a data replication tool, a request from a system of record (SoR) application for a replication setup and initialization; importing, through the API, a number of data instances from a source database associated with the SoR; initiating, by a manager engine integrated into the data replication tool, a separate replication process for each data instance imported through the API, wherein the manager engine is configured to support multiple data instances on a single virtual machine instance; if a difference between a data description language (DDL) of the replicated data instances and a DDL of a target database is detected, automatically changing the DDL of the replicated data instances to the DDL of the target database; and exporting the replicated data instances to the target database.
  • API application programming interface
  • SoR system of record
  • FIG. 1 is a pictorial representation of a network of data processing systems in which illustrative embodiments may be implemented;
  • FIG. 2 is an illustration of a block diagram of a data replication system in accordance with an illustrative embodiment
  • FIG. 3 depicts the data replication tool responding to requests from an SoR application in accordance with an illustrative embodiment
  • FIG. 4 depicts the operation of a data stream through the data replication tool in accordance with an illustrative embodiment
  • FIG. 5 depicts an orchestrator in accordance with an illustrative embodiment
  • FIG. 6 depicts a user interface in accordance with an illustrative embodiment
  • FIG. 7 depicts a flowchart for a process of data replication in accordance with an illustrative embodiment
  • FIG. 8 is an illustration of a block diagram of a data processing system in accordance with an illustrative embodiment.
  • the illustrative embodiments recognize and take into account one or more different considerations. For example, the illustrative embodiments recognize and take into account that moving data across heterogeneous system such as databases typically requires specialized tools for the replication, filtering, and of transformation of the data.
  • the illustrative embodiments also recognize and take into account that these specialized data replication tools are often complex, expensive, and require continual maintenance work.
  • the illustrative embodiments provide a data replication tool that can replicate data in real time and replicate DDL changes.
  • the replication tool is manageable through an integrated application protocol interface (API) and user interface (UI).
  • API application protocol interface
  • UI user interface
  • Data replication, onboarding, and orchestration processes can use the API to push notifications to a messaging system to provide information, warnings, and alerts in real time regarding events requiring attention.
  • Network data processing system 100 is a network of computers in which the illustrative embodiments may be implemented.
  • Network data processing system 100 contains network 102 , which is the medium used to provide communications links between various devices and computers connected together within network data processing system 100 .
  • Network 102 might include connections, such as wire, wireless communication links, or fiber optic cables.
  • server computer 104 and server computer 106 connect to network 102 along with storage unit 108 .
  • client devices 110 connect to network 102 .
  • server computer 104 provides information, such as boot files, operating system images, and applications to client devices 110 .
  • Client devices 110 can be, for example, computers, workstations, or network computers.
  • client devices 110 include client computers 112 , 114 , and 116 .
  • Client devices 110 can also include other types of client devices such as mobile phone 118 , tablet computer 120 , and smart glasses 122 .
  • server computer 104 is network devices that connect to network 102 in which network 102 is the communications media for these network devices.
  • client devices 110 may form an Internet of things (IoT) in which these physical devices can connect to network 102 and exchange information with each other over network 102 .
  • IoT Internet of things
  • Client devices 110 are clients to server computer 104 in this example.
  • Network data processing system 100 may include additional server computers, client computers, and other devices not shown.
  • Client devices 110 connect to network 102 utilizing at least one of wired, optical fiber, or wireless connections.
  • Program code located in network data processing system 100 can be stored on a computer-recordable storage medium and downloaded to a data processing system or other device for use.
  • the program code can be stored on a computer-recordable storage medium on server computer 104 and downloaded to client devices 110 over network 102 for use on client devices 110 .
  • network data processing system 100 is the Internet with network 102 representing a worldwide collection of networks and gateways that use the Transmission Control Protocol/Internet Protocol (TCP/IP) suite of protocols to communicate with one another.
  • TCP/IP Transmission Control Protocol/Internet Protocol
  • network data processing system 100 also may be implemented using a number of different types of networks.
  • network 102 can be comprised of at least one of the Internet, an intranet, a local area network (LAN), a metropolitan area network (MAN), or a wide area network (WAN).
  • FIG. 1 is intended as an example, and not as an architectural limitation for the different illustrative embodiments.
  • the hardware may take the form of a circuit system, an integrated circuit, an application-specific integrated circuit (ASIC), a programmable logic device, or some other suitable type of hardware configured to perform a number of operations.
  • ASIC application-specific integrated circuit
  • the device may be configured to perform the number of operations.
  • the device may be reconfigured at a later time or may be permanently configured to perform the number of operations.
  • Programmable logic devices include, for example, a programmable logic array, programmable array logic, a field programmable logic array, a field programmable gate array, and other suitable hardware devices.
  • the processes may be implemented in organic components integrated with inorganic components and may be comprised entirely of organic components, excluding a human being. For example, the processes may be implemented as circuits in organic semiconductors.
  • FIG. 2 a block diagram of a data replication system is depicted in accordance with an illustrative embodiment.
  • Data replication system 200 might be implemented in network data processing system 100 in FIG. 1 .
  • Data replication system 200 comprises a data replication tool 202 that replicates data between heterogeneous source 226 and target 234 databases.
  • the databases might comprise, e.g., a human capital management (HCM) database 226 and datacloud 234 , but data replication system 200 can be used with other types of databases and messaging systems.
  • Data replication tool 202 might be integrated with a live stream chat system (e.g., WebEx) and used to push live information about a user's status and database status.
  • a live stream chat system e.g., WebEx
  • Manager engine 204 within data replication tool 202 is able to support multiple data instances 228 , 230 , 232 on a single virtual machines (VM) server or container 206 .
  • a VM is a virtual machine that emulates a computer system in software to provide the functionality of a physical computer. VMs can run Java programs as well as programs written in other languages that are compiled to Java bytecode.
  • the user can define the scope of instances managed by manager engine 204 on VM instance 206 .
  • the scope might comprise all data instances within a specified category (e.g., Production) or it might define X specific data instance.
  • API 212 Control of data replication occurs through an application programming interface (API) 212 that is integrated with data replication tool 202 .
  • API 212 might run on container 210 .
  • API 212 receives requests for replication setup and initialization from a system of record (SoR) 222 and imports data instances 228 , 230 , 232 from the source database 226 associated with SoR 222 .
  • SoR system of record
  • a system administrator 224 is able to access API 212 through user interface (UI) 216 , which is also integrated into data replication tool 202 might run on its own container 214 .
  • the system administrator 224 is also able to monitor data analytics 218 (e.g., Splunk) provided replication logs by the manager engine 204 in real-time during data replication operations. Replication logs might also be provided to data analytics 218 by UI 216 .
  • data analytics 218 e.g., Splunk
  • Non-relational (i.e. non-SQL) database 220 stores the last time a data replication was successful for a specific table.
  • System administrator 224 can use UI 216 to instruct the data replication tool 202 to start a new replication from this previous time/date by changing the date and clicking update.
  • Data replication tool 202 might also comprises an internal orchestrator 208 to plan and execute a specified list of actions by manager engine 204 under specified conditions (e.g., at a specified time) and can be managed from UI 216 .
  • an internal orchestrator 208 to plan and execute a specified list of actions by manager engine 204 under specified conditions (e.g., at a specified time) and can be managed from UI 216 .
  • manager engine 204 might be one of a number of manager engines in a load balancing cluster in which a number of tasks are distributed across a set of engines to avoid situations in which some engines are overloaded while leaving other engines idle.
  • Data replication system 200 can be implemented in software, hardware, firmware or a combination thereof.
  • the operations performed by data replication system 200 can be implemented in program code configured to run on hardware, such as a processor unit.
  • firmware the operations performed by data replication system 200 can be implemented in program code and data and stored in persistent memory to run on a processor unit.
  • the hardware may include circuits that operate to perform the operations in data replication system 200 .
  • the hardware may take a form selected from at least one of a circuit system, an integrated circuit, an application specific integrated circuit (ASIC), a programmable logic device, or some other suitable type of hardware configured to perform a number of operations.
  • ASIC application specific integrated circuit
  • the device can be configured to perform the number of operations.
  • the device can be reconfigured at a later time or can be permanently configured to perform the number of operations.
  • Programmable logic devices include, for example, a programmable logic array, a programmable array logic, a field programmable logic array, a field programmable gate array, and other suitable hardware devices.
  • the processes can be implemented in organic components integrated with inorganic components and can be comprised entirely of organic components excluding a human being.
  • the processes can be implemented as circuits in organic semiconductors.
  • a processor is comprised of hardware circuits such as those on an integrated circuit that respond and process instructions and program code that operate a computer.
  • the number of processors is one or more processors can be on the same computer or on different computers. In other words, the process can be distributed between processors on the same or different computers in computer system.
  • the number of processors can be of the same type or different type of processors. For example, a number of processors can be selected from at least one of a single core processor, a dual-core processor, a multi-processor core, a general-purpose central processing unit (CPU), a graphics processing unit (GPU), a digital signal processor (DSP),or some other type or processor.
  • CPU central processing unit
  • GPU graphics processing unit
  • DSP digital signal processor
  • a computer system which is a physical hardware system and includes one or more data processing systems.
  • data processing systems When more than one data processing system is present in the computer system, those data processing systems are in communication with each other using a communications medium.
  • the communications medium can be a network.
  • the data processing systems can be selected from at least one of a computer, a server computer, a tablet computer, or some other suitable data processing system.
  • FIG. 3 depicts the data replication tool responding to requests from an SoR application in accordance with an illustrative embodiment.
  • Architecture 300 might be an example of a more detailed view of data replication tool 202 in FIG. 2 .
  • the database owner can use SoR application 330 to make several requests to API 314 . These requests might include new onboarding of data, requests to start or stop data replication, and requests for status of data replication in real-time.
  • API 314 might comprise a representational state transfer (REST) software architecture 316 that can be constructed within a Spring framework 318 .
  • REST representational state transfer
  • API 314 enables any remote component to start actions on the replication side. Examples of these actions might include client onboarding (setup process to build the replication, e.g., schema creation, Init load, CDC enablement, etc.), data replication administrative tasks (start/stop, cancel, setup, etc.), providing onboarding, and replication real-time status.
  • API 314 provides control actions to a manager 304 within the manager engine 302 .
  • Manager 304 controls the actions of manager engine 302 including setup 306 , replication 308 , and other operations 310 .
  • Replicated data from replication 308 can be forwarded to target database 334 .
  • manager engine 302 might comprise actions that can be performed automatically by manager 304 , manual actions requested by system administrator 326 or SoR 330 , and time-scheduled actions to be executed at specific times of day, which can be monitored by orchestrator 336 .
  • Examples of actions performed automatically by manager include cluster management to redistribute load between cluster nodes, analyze if any replications are up and running, control source performance consumed by the replication process (e.g., statistics, alerts, and reports if the source cannot handle the connections in time), and controlling the health of the system such as RAM, load averages, disk spaces, and CPU function.
  • Examples of manual actions include client setup, client deletion, full data reloads, partial data reloads on selected tables, and optimized reloaded only on tables used by the clients.
  • Some actions might be executed automatically by the manager 304 at specific times according to authorized time slots, but administrator 326 or SoR 330 can manually send a request for an action to be executed immediately. Examples of this type of action might include starting/stopping replication, control and fix connected device configuration (CDC) problems, controlling data compliancy on tables being replicated and correcting any mismatches, analysis and update of SQL server connection strings, and custom actions by the SoR.
  • CDC control and fix connected device configuration
  • API 314 also receives persist records from manager 304 and persists them on a non-relational database (e.g., MongoDB) 328 .
  • a non-relational database e.g., MongoDB
  • Data analytics forwarder 322 forwards replication logs from manager engine 302 to data analytics (i.e. Splunk) 324 in real-time.
  • Data analytics 324 can be access by system administrator 326 for monitoring, alerts, and reporting.
  • System administrator 326 also had administrative access to API 314 through integrated UI 320 .
  • UI 320 might also be used to provide replication logs to data analytics 324 .
  • UI 320 can be used to plan and create scenarios that comprise lists of tasks to be performed under specified conditions.
  • Orchestrator 336 in manager engine 302 regularly checks if any of the planned scenarios must be run. If manager engine 302 executes a scenario, it can inform the system administrator 326 in real-time over email or messaging system 338 through API 314 . Manager engine 302 can send notifications to system messaging 338 to provide system administrator 326 information, warnings, and alerts in real time regarding events requiring attention.
  • FIG. 4 depicts the operation of a data stream through the data replication tool in accordance with an illustrative embodiment.
  • Data stream operation 400 can be implements in architecture 300 shown in FIG. 3 .
  • application instance 1 430 and application instance 2 432 send JavaScript object notation (JSON) SQL files to API 314 .
  • JSON JavaScript object notation
  • Stream import 404 in manager engine 302 receives the JSON data through the API 314 .
  • Stream import 404 parses and schema and data description language (DDL) of the JSON data replaces it if necessary and stores the data in a temporary folder 408 .
  • DDL schema and data description language
  • Stream export 406 then pulls files to be loaded from ready folder 410 and pushes the data to target database 334 . Stream export 406 then moves files to processed folder 412 after they have been exported to target database 334 .
  • API 314 persists the database configuration information on non-relational database 328 .
  • the database configuration might comprise, e.g., source/target information to replicate, status of the databases, database owner, last time an action has been done on the database and by whom, and number of data at source and target to detect mismatches.
  • Manager engine 302 forwards logs to data analytics 324 that can be monitored by system administrator 326 and acted upon through UI 320 .
  • FIG. 5 depicts an orchestrator in accordance with an illustrative embodiment.
  • Orchestrator 500 might be an example of orchestrator 208 in FIG. 2 and orchestrator 336 in FIG. 3 and allows planning the actions of the data replication tool.
  • a set of tasks to be executed according to criteria such as time and validation can be organized as a scenario 502 .
  • Orchestrator 500 can manage communication of planned actions, schedule actions based on date, and validate and control whether a particular scenario 502 can continue actions.
  • Orchestrator 500 allows the system administrator to manage scenarios through the integrated UI (i.e. UI 320 ). Orchestrator 500 can also resume a scenario after a server failure.
  • FIG. 6 depicts a user interface in accordance with an illustrative embodiment.
  • UI 600 might be an example of UI 212 in FIG. 2 and UI 320 in FIGS. 3 and 4 .
  • UI 600 comprises menu 602 that allows the user (i.e. system administrator) to select specific information and actions.
  • UI 600 is displaying a list of databases 604 being managed by the data replication tool. This list includes information about the databases 604 including schema, environment, replication status, and onboarding status. It also includes control buttons 606 , 608 , 610 that allow the user to remove replication, start/stop onboarding, delete the onboarded databases, respectively.
  • UI 600 directly interacts with the API integrated into the data replication tool and provides an administrative console to perform onboarding and replication actions (start/stop, cancel, setup, etc.), follow live database progressions, and add or delete DDLs.
  • FIG. 7 depicts a flowchart for a process of data replication in accordance with an illustrative embodiment.
  • the process in FIG. 7 can be implemented in hardware, software, or both.
  • the process can take the form of program code that is run by one of more processor units located in one or more hardware devices in one or more computer systems.
  • Process 700 might be implemented in data replication system 200 shown in FIG. 2 .
  • Process 700 begins by receiving, through an API integrated into a data replication tool, a request from a SoR application for a replication setup and initialization (step 702 ).
  • the data replication tool imports a number of data instances through the API from a source database associated with the SoR (step 704 ).
  • the replicated data instances might comprise JSON files.
  • a manager engine integrated into the data replication tool initiates a replication process for a separate replication process for each data instance imported through the API (step 706 ).
  • the manager engine is configured to support multiple data instances on a single virtual machine instance.
  • the manager engine might also detect if a table in the replicated data instances requires a load.
  • the data replication tool determines if there is a difference between a data description language (DDL) of the replicated data instances and a DDL of a target database (step 708 ). If a difference is detected, the data replication tool automatically changes the DDL of the replicated data instances to the DDL of the target database (step 710 ).
  • the data replication tool engine might also automatically translate a table in the replicated data instance to a new structure.
  • a translation model might be stored as SQL in a database. The translation model ensures how exactly the target should receive the information.
  • the model might be built by the SoR or target owner, or it can be directly created by the data replication tool. Alternatively, the data replication tool can adapt the replication on the fly without a translational model.
  • Process 700 then ends.
  • Data processing system 1300 may be used to implement one or more computers shown in FIG. 1 (e.g., client devices 110 and server computers 104 , 106 ) and data replication system 200 in FIG. 2 .
  • data processing system 800 includes communications framework 802 , which provides communications between processor unit 804 , memory 806 , persistent storage 808 , communications unit 810 , input/output unit 812 , and display 814 .
  • communications framework 802 may take the form of a bus system.
  • Processor unit 804 serves to execute instructions for software that may be loaded into memory 806 .
  • Processor unit 804 may be a number of processors, a multi-processor core, or some other type of processor, depending on the particular implementation.
  • processor unit 804 comprises one or more conventional general-purpose central processing units (CPUs).
  • processor unit 804 comprises one or more graphical processing units (CPUs).
  • Memory 806 and persistent storage 808 are examples of storage devices 816 .
  • a storage device is any piece of hardware that is capable of storing information, such as, for example, without limitation, at least one of data, program code in functional form, or other suitable information either on a temporary basis, a permanent basis, or both on a temporary basis and a permanent basis.
  • Storage devices 816 may also be referred to as computer-readable storage devices in these illustrative examples.
  • Memory 816 in these examples, may be, for example, a random access memory or any other suitable volatile or non-volatile storage device.
  • Persistent storage 808 may take various forms, depending on the particular implementation.
  • persistent storage 808 may contain one or more components or devices.
  • persistent storage 808 may be a hard drive, a flash memory, a rewritable optical disk, a rewritable magnetic tape, or some combination of the above.
  • the media used by persistent storage 808 also may be removable.
  • a removable hard drive may be used for persistent storage 808 .
  • Communications unit 810 in these illustrative examples, provides for communications with other data processing systems or devices. In these illustrative examples, communications unit 810 is a network interface card.
  • Input/output unit 812 allows for input and output of data with other devices that may be connected to data processing system 800 .
  • input/output unit 812 may provide a connection for user input through at least one of a keyboard, a mouse, or some other suitable input device. Further, input/output unit 812 may send output to a printer.
  • Display 814 provides a mechanism to display information to a user.
  • Instructions for at least one of the operating system, applications, or programs may be located in storage devices 816 , which are in communication with processor unit 804 through communications framework 802 .
  • the processes of the different embodiments may be performed by processor unit 804 using computer-implemented instructions, which may be located in a memory, such as memory 806 .
  • program code computer-usable program code, or computer-readable program code that may be read and executed by a processor in processor unit 804 .
  • the program code in the different embodiments may be embodied on different physical or computer-readable storage media, such as memory 806 or persistent storage 808 .
  • Program code 818 is located in a functional form on computer-readable media 820 that is selectively removable and may be loaded onto or transferred to data processing system 800 for execution by processor unit 804 .
  • Program code 818 and computer-readable media 820 form computer program product 822 in these illustrative examples.
  • computer-readable media 820 may be computer-readable storage media 824 or computer-readable signal media 826 .
  • computer-readable storage media 824 is a physical or tangible storage device used to store program code 818 rather than a medium that propagates or transmits program code 818 .
  • Computer readable storage media 824 is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
  • program code 818 may be transferred to data processing system 800 using computer-readable signal media 826 .
  • Computer-readable signal media 826 may be, for example, a propagated data signal containing program code 818 .
  • Computer-readable signal media 826 may be at least one of an electromagnetic signal, an optical signal, or any other suitable type of signal. These signals may be transmitted over at least one of communications links, such as wireless communications links, optical fiber cable, coaxial cable, a wire, or any other suitable type of communications link.
  • the different components illustrated for data processing system 800 are not meant to provide architectural limitations to the manner in which different embodiments may be implemented.
  • the different illustrative embodiments may be implemented in a data processing system including components in addition to or in place of those illustrated for data processing system 800 .
  • Other components shown in FIG. 8 can be varied from the illustrative examples shown.
  • the different embodiments may be implemented using any hardware device or system capable of running program code 818 .
  • the phrase “a number” means one or more.
  • the phrase “at least one of”, when used with a list of items, means different combinations of one or more of the listed items may be used, and only one of each item in the list may be needed. In other words, “at least one of” means any combination of items and number of items may be used from the list, but not all of the items in the list are required.
  • the item may be a particular object, a thing, or a category.
  • “at least one of item A, item B, or item C” may include item A, item A and item B, or item C. This example also may include item A, item B, and item C or item B and item C. Of course, any combinations of these items may be present. In some illustrative examples, “at least one of” may be, for example, without limitation, two of item A; one of item B; and ten of item C; four of item B and seven of item C; or other suitable combinations.
  • each block in the flowcharts or block diagrams may represent at least one of a module, a segment, a function, or a portion of an operation or step.
  • one or more of the blocks may be implemented as program code.
  • the function or functions noted in the blocks may occur out of the order noted in the figures.
  • two blocks shown in succession may be performed substantially concurrently, or the blocks may sometimes be performed in the reverse order, depending upon the functionality involved.
  • other blocks may be added in addition to the illustrated blocks in a flowchart or block diagram.
  • a component may be configured to perform the action or operation described.
  • the component may have a configuration or design for a structure that provides the component an ability to perform the action or operation that is described in the illustrative examples as being performed by the component.
  • Many modifications and variations will be apparent to those of ordinary skill in the art.
  • different illustrative embodiments may provide different features as compared to other desirable embodiments. The embodiment or embodiments selected are chosen and described in order to best explain the principles of the embodiments, the practical application, and to enable others of ordinary skill in the art to understand the disclosure for various embodiments with various modifications as are suited to the particular use contemplated.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Computing Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Data replication between databases is provided. The method comprises receiving, through an API integrated into a data replication tool, a request from a system of record (SoR) application for a replication setup and initialization. A number of data instances are imported through the API from a source database associated with the SoR. A manager engine integrated into the data replication tool initiates a separate replication process for each data instance. The manager engine is configured to support multiple data instances on a single virtual machine instance. If a difference between a data description language (DDL) of the replicated data instances and a DDL of a target database is detected, the DDL of the replicated data instances is automatically changed to the DDL of the target database. The replicated data instances are then exported to the target database.

Description

    BACKGROUND INFORMATION 1. Field
  • The present disclosure relates generally to an improved computer system and, in particular, to replication data for communication between a database and another database or messaging system.
  • 2. Background
  • Moving data across heterogeneous system typically requires replication, filtering, and of transformation of the data. The heterogeneous systems might comprise different databases. Specialized data replication tools can be employed to facilitate this data replication between systems. However, these replication tools are often complex, expensive, and require continual maintenance work.
  • SUMMARY
  • An illustrative embodiment provides a computer-implemented method for data replication between databases. The method comprises receiving, through an application programming interface (API) integrated into a data replication tool, a request from a system of record (SoR) application for a replication setup and initialization. A number of data instances are imported through the API from a source database associated with the SoR. A manager engine integrated into the data replication tool initiates a separate replication process for each data instance. The manager engine is configured to support multiple data instances on a single virtual machine instance. If a difference between a data description language (DDL) of the replicated data instances and a DDL of a target database is detected, the DDL of the replicated data instances is automatically changed to the DDL of the target database. The replicated data instances are then exported to the target database.
  • Another illustrative embodiment provides a system for data replication between databases. The system comprises a storage device configured to store program instructions and one or more processors operably connected to the storage device and configured to execute the program instructions to cause the system to: receive, through an application programming interface (API) integrated into a data replication tool, a request from a system of record (SoR) application for a replication setup and initialization; import, through the API, a number of data instances from a source database associated with the SoR; initiate, by a manager engine integrated into the data replication tool, a separate replication process for each data instance imported through the API, wherein the manager engine is configured to support multiple data instances on a single virtual machine instance; if a difference between a data description language (DDL) of the replicated data instances and a DDL of a target database is detected, automatically change the DDL of the replicated data instances to the DDL of the target database; and export the replicated data instances to the target database.
  • Another illustrative embodiment provides a computer program product for data replication between databases. The computer program product comprises a computer-readable storage medium having program instructions embodied thereon to perform the steps of: receiving, through an application programming interface (API) integrated into a data replication tool, a request from a system of record (SoR) application for a replication setup and initialization; importing, through the API, a number of data instances from a source database associated with the SoR; initiating, by a manager engine integrated into the data replication tool, a separate replication process for each data instance imported through the API, wherein the manager engine is configured to support multiple data instances on a single virtual machine instance; if a difference between a data description language (DDL) of the replicated data instances and a DDL of a target database is detected, automatically changing the DDL of the replicated data instances to the DDL of the target database; and exporting the replicated data instances to the target database.
  • The features and functions can be achieved independently in various embodiments of the present disclosure or may be combined in yet other embodiments in which further details can be seen with reference to the following description and drawings.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The novel features believed characteristic of the illustrative embodiments are set forth in the appended claims. The illustrative embodiments, however, as well as a preferred mode of use, further objectives and features thereof, will best be understood by reference to the following detailed description of an illustrative embodiment of the present disclosure when read in conjunction with the accompanying drawings, wherein:
  • FIG. 1 is a pictorial representation of a network of data processing systems in which illustrative embodiments may be implemented;
  • FIG. 2 is an illustration of a block diagram of a data replication system in accordance with an illustrative embodiment;
  • FIG. 3 depicts the data replication tool responding to requests from an SoR application in accordance with an illustrative embodiment;
  • FIG. 4 depicts the operation of a data stream through the data replication tool in accordance with an illustrative embodiment;
  • FIG. 5 depicts an orchestrator in accordance with an illustrative embodiment;
  • FIG. 6 depicts a user interface in accordance with an illustrative embodiment;
  • FIG. 7 depicts a flowchart for a process of data replication in accordance with an illustrative embodiment; and
  • FIG. 8 is an illustration of a block diagram of a data processing system in accordance with an illustrative embodiment.
  • DETAILED DESCRIPTION
  • The illustrative embodiments recognize and take into account one or more different considerations. For example, the illustrative embodiments recognize and take into account that moving data across heterogeneous system such as databases typically requires specialized tools for the replication, filtering, and of transformation of the data.
  • The illustrative embodiments also recognize and take into account that these specialized data replication tools are often complex, expensive, and require continual maintenance work.
  • The illustrative embodiments provide a data replication tool that can replicate data in real time and replicate DDL changes. The replication tool is manageable through an integrated application protocol interface (API) and user interface (UI). Data replication, onboarding, and orchestration processes can use the API to push notifications to a messaging system to provide information, warnings, and alerts in real time regarding events requiring attention.
  • With reference to FIG. 1, a pictorial representation of a network of data processing systems is depicted in which illustrative embodiments may be implemented. Network data processing system 100 is a network of computers in which the illustrative embodiments may be implemented. Network data processing system 100 contains network 102, which is the medium used to provide communications links between various devices and computers connected together within network data processing system 100. Network 102 might include connections, such as wire, wireless communication links, or fiber optic cables.
  • In the depicted example, server computer 104 and server computer 106 connect to network 102 along with storage unit 108. In addition, client devices 110 connect to network 102. In the depicted example, server computer 104 provides information, such as boot files, operating system images, and applications to client devices 110. Client devices 110 can be, for example, computers, workstations, or network computers. As depicted, client devices 110 include client computers 112, 114, and 116. Client devices 110 can also include other types of client devices such as mobile phone 118, tablet computer 120, and smart glasses 122.
  • In this illustrative example, server computer 104, server computer 106, storage unit 108, and client devices 110 are network devices that connect to network 102 in which network 102 is the communications media for these network devices. Some or all of client devices 110 may form an Internet of things (IoT) in which these physical devices can connect to network 102 and exchange information with each other over network 102.
  • Client devices 110 are clients to server computer 104 in this example. Network data processing system 100 may include additional server computers, client computers, and other devices not shown. Client devices 110 connect to network 102 utilizing at least one of wired, optical fiber, or wireless connections.
  • Program code located in network data processing system 100 can be stored on a computer-recordable storage medium and downloaded to a data processing system or other device for use. For example, the program code can be stored on a computer-recordable storage medium on server computer 104 and downloaded to client devices 110 over network 102 for use on client devices 110.
  • In the depicted example, network data processing system 100 is the Internet with network 102 representing a worldwide collection of networks and gateways that use the Transmission Control Protocol/Internet Protocol (TCP/IP) suite of protocols to communicate with one another. At the heart of the Internet is a backbone of high-speed data communication lines between major nodes or host computers consisting of thousands of commercial, governmental, educational, and other computer systems that route data and messages. Of course, network data processing system 100 also may be implemented using a number of different types of networks. For example, network 102 can be comprised of at least one of the Internet, an intranet, a local area network (LAN), a metropolitan area network (MAN), or a wide area network (WAN). FIG. 1 is intended as an example, and not as an architectural limitation for the different illustrative embodiments.
  • In the illustrative examples, the hardware may take the form of a circuit system, an integrated circuit, an application-specific integrated circuit (ASIC), a programmable logic device, or some other suitable type of hardware configured to perform a number of operations. With a programmable logic device, the device may be configured to perform the number of operations. The device may be reconfigured at a later time or may be permanently configured to perform the number of operations. Programmable logic devices include, for example, a programmable logic array, programmable array logic, a field programmable logic array, a field programmable gate array, and other suitable hardware devices. Additionally, the processes may be implemented in organic components integrated with inorganic components and may be comprised entirely of organic components, excluding a human being. For example, the processes may be implemented as circuits in organic semiconductors.
  • Turning to FIG. 2, a block diagram of a data replication system is depicted in accordance with an illustrative embodiment. Data replication system 200 might be implemented in network data processing system 100 in FIG. 1.
  • Data replication system 200 comprises a data replication tool 202 that replicates data between heterogeneous source 226 and target 234 databases. The databases might comprise, e.g., a human capital management (HCM) database 226 and datacloud 234, but data replication system 200 can be used with other types of databases and messaging systems. Data replication tool 202 might be integrated with a live stream chat system (e.g., WebEx) and used to push live information about a user's status and database status.
  • Manager engine 204 within data replication tool 202 is able to support multiple data instances 228, 230, 232 on a single virtual machines (VM) server or container 206. A VM is a virtual machine that emulates a computer system in software to provide the functionality of a physical computer. VMs can run Java programs as well as programs written in other languages that are compiled to Java bytecode.
  • The user can define the scope of instances managed by manager engine 204 on VM instance 206. For example, the scope might comprise all data instances within a specified category (e.g., Production) or it might define X specific data instance.
  • Control of data replication occurs through an application programming interface (API) 212 that is integrated with data replication tool 202. API 212 might run on container 210. API 212 receives requests for replication setup and initialization from a system of record (SoR) 222 and imports data instances 228, 230, 232 from the source database 226 associated with SoR 222.
  • A system administrator 224 is able to access API 212 through user interface (UI) 216, which is also integrated into data replication tool 202 might run on its own container 214. The system administrator 224 is also able to monitor data analytics 218 (e.g., Splunk) provided replication logs by the manager engine 204 in real-time during data replication operations. Replication logs might also be provided to data analytics 218 by UI 216.
  • Non-relational (i.e. non-SQL) database 220 stores the last time a data replication was successful for a specific table. System administrator 224 can use UI 216 to instruct the data replication tool 202 to start a new replication from this previous time/date by changing the date and clicking update.
  • Data replication tool 202 might also comprises an internal orchestrator 208 to plan and execute a specified list of actions by manager engine 204 under specified conditions (e.g., at a specified time) and can be managed from UI 216.
  • In an embodiment, manager engine 204 might be one of a number of manager engines in a load balancing cluster in which a number of tasks are distributed across a set of engines to avoid situations in which some engines are overloaded while leaving other engines idle.
  • Data replication system 200 can be implemented in software, hardware, firmware or a combination thereof. When software is used, the operations performed by data replication system 200 can be implemented in program code configured to run on hardware, such as a processor unit. When firmware is used, the operations performed by data replication system 200 can be implemented in program code and data and stored in persistent memory to run on a processor unit. When hardware is employed, the hardware may include circuits that operate to perform the operations in data replication system 200.
  • In the illustrative examples, the hardware may take a form selected from at least one of a circuit system, an integrated circuit, an application specific integrated circuit (ASIC), a programmable logic device, or some other suitable type of hardware configured to perform a number of operations. With a programmable logic device, the device can be configured to perform the number of operations. The device can be reconfigured at a later time or can be permanently configured to perform the number of operations. Programmable logic devices include, for example, a programmable logic array, a programmable array logic, a field programmable logic array, a field programmable gate array, and other suitable hardware devices. Additionally, the processes can be implemented in organic components integrated with inorganic components and can be comprised entirely of organic components excluding a human being. For example, the processes can be implemented as circuits in organic semiconductors.
  • As used herein a processor is comprised of hardware circuits such as those on an integrated circuit that respond and process instructions and program code that operate a computer. When a number of processors execute instructions for a process, the number of processors is one or more processors can be on the same computer or on different computers. In other words, the process can be distributed between processors on the same or different computers in computer system. Further, the number of processors can be of the same type or different type of processors. For example, a number of processors can be selected from at least one of a single core processor, a dual-core processor, a multi-processor core, a general-purpose central processing unit (CPU), a graphics processing unit (GPU), a digital signal processor (DSP),or some other type or processor.
  • These components can be located in a computer system, which is a physical hardware system and includes one or more data processing systems. When more than one data processing system is present in the computer system, those data processing systems are in communication with each other using a communications medium. The communications medium can be a network. The data processing systems can be selected from at least one of a computer, a server computer, a tablet computer, or some other suitable data processing system.
  • FIG. 3 depicts the data replication tool responding to requests from an SoR application in accordance with an illustrative embodiment. Architecture 300 might be an example of a more detailed view of data replication tool 202 in FIG. 2.
  • The database owner can use SoR application 330 to make several requests to API 314. These requests might include new onboarding of data, requests to start or stop data replication, and requests for status of data replication in real-time.
  • API 314 might comprise a representational state transfer (REST) software architecture 316 that can be constructed within a Spring framework 318. API 314 enables any remote component to start actions on the replication side. Examples of these actions might include client onboarding (setup process to build the replication, e.g., schema creation, Init load, CDC enablement, etc.), data replication administrative tasks (start/stop, cancel, setup, etc.), providing onboarding, and replication real-time status.
  • API 314 provides control actions to a manager 304 within the manager engine 302. Manager 304 controls the actions of manager engine 302 including setup 306, replication 308, and other operations 310. Replicated data from replication 308 can be forwarded to target database 334.
  • The other operations 310 performed by manager engine 302 might comprise actions that can be performed automatically by manager 304, manual actions requested by system administrator 326 or SoR 330, and time-scheduled actions to be executed at specific times of day, which can be monitored by orchestrator 336.
  • Examples of actions performed automatically by manager include cluster management to redistribute load between cluster nodes, analyze if any replications are up and running, control source performance consumed by the replication process (e.g., statistics, alerts, and reports if the source cannot handle the connections in time), and controlling the health of the system such as RAM, load averages, disk spaces, and CPU function.
  • Examples of manual actions include client setup, client deletion, full data reloads, partial data reloads on selected tables, and optimized reloaded only on tables used by the clients.
  • Some actions might be executed automatically by the manager 304 at specific times according to authorized time slots, but administrator 326 or SoR 330 can manually send a request for an action to be executed immediately. Examples of this type of action might include starting/stopping replication, control and fix connected device configuration (CDC) problems, controlling data compliancy on tables being replicated and correcting any mismatches, analysis and update of SQL server connection strings, and custom actions by the SoR.
  • API 314 also receives persist records from manager 304 and persists them on a non-relational database (e.g., MongoDB) 328.
  • Data analytics forwarder 322 forwards replication logs from manager engine 302 to data analytics (i.e. Splunk) 324 in real-time. Data analytics 324 can be access by system administrator 326 for monitoring, alerts, and reporting. System administrator 326 also had administrative access to API 314 through integrated UI 320. UI 320 might also be used to provide replication logs to data analytics 324.
  • UI 320 can be used to plan and create scenarios that comprise lists of tasks to be performed under specified conditions. Orchestrator 336 in manager engine 302 regularly checks if any of the planned scenarios must be run. If manager engine 302 executes a scenario, it can inform the system administrator 326 in real-time over email or messaging system 338 through API 314. Manager engine 302 can send notifications to system messaging 338 to provide system administrator 326 information, warnings, and alerts in real time regarding events requiring attention.
  • FIG. 4 depicts the operation of a data stream through the data replication tool in accordance with an illustrative embodiment. Data stream operation 400 can be implements in architecture 300 shown in FIG. 3.
  • In the present example, application instance 1 430 and application instance 2 432 send JavaScript object notation (JSON) SQL files to API 314.
  • Stream import 404 in manager engine 302 receives the JSON data through the API 314. Stream import 404 parses and schema and data description language (DDL) of the JSON data replaces it if necessary and stores the data in a temporary folder 408. When the data is ready for export to the target database 434, stream import 404 pushes the files to ready folder 410.
  • Stream export 406 then pulls files to be loaded from ready folder 410 and pushes the data to target database 334. Stream export 406 then moves files to processed folder 412 after they have been exported to target database 334.
  • API 314 persists the database configuration information on non-relational database 328. The database configuration might comprise, e.g., source/target information to replicate, status of the databases, database owner, last time an action has been done on the database and by whom, and number of data at source and target to detect mismatches. Manager engine 302 forwards logs to data analytics 324 that can be monitored by system administrator 326 and acted upon through UI 320.
  • FIG. 5 depicts an orchestrator in accordance with an illustrative embodiment. Orchestrator 500 might be an example of orchestrator 208 in FIG. 2 and orchestrator 336 in FIG. 3 and allows planning the actions of the data replication tool. A set of tasks to be executed according to criteria such as time and validation can be organized as a scenario 502. Orchestrator 500 can manage communication of planned actions, schedule actions based on date, and validate and control whether a particular scenario 502 can continue actions.
  • Orchestrator 500 allows the system administrator to manage scenarios through the integrated UI (i.e. UI 320). Orchestrator 500 can also resume a scenario after a server failure.
  • FIG. 6 depicts a user interface in accordance with an illustrative embodiment. UI 600 might be an example of UI 212 in FIG. 2 and UI 320 in FIGS. 3 and 4.
  • UI 600 comprises menu 602 that allows the user (i.e. system administrator) to select specific information and actions. In the present example, UI 600 is displaying a list of databases 604 being managed by the data replication tool. This list includes information about the databases 604 including schema, environment, replication status, and onboarding status. It also includes control buttons 606, 608, 610 that allow the user to remove replication, start/stop onboarding, delete the onboarded databases, respectively.
  • UI 600 directly interacts with the API integrated into the data replication tool and provides an administrative console to perform onboarding and replication actions (start/stop, cancel, setup, etc.), follow live database progressions, and add or delete DDLs.
  • FIG. 7 depicts a flowchart for a process of data replication in accordance with an illustrative embodiment. The process in FIG. 7 can be implemented in hardware, software, or both. When implemented in software, the process can take the form of program code that is run by one of more processor units located in one or more hardware devices in one or more computer systems. Process 700 might be implemented in data replication system 200 shown in FIG. 2.
  • Process 700 begins by receiving, through an API integrated into a data replication tool, a request from a SoR application for a replication setup and initialization (step 702).
  • The data replication tool imports a number of data instances through the API from a source database associated with the SoR (step 704). The replicated data instances might comprise JSON files.
  • A manager engine integrated into the data replication tool, initiates a replication process for a separate replication process for each data instance imported through the API (step 706). The manager engine is configured to support multiple data instances on a single virtual machine instance. The manager engine might also detect if a table in the replicated data instances requires a load.
  • The data replication tool determines if there is a difference between a data description language (DDL) of the replicated data instances and a DDL of a target database (step 708). If a difference is detected, the data replication tool automatically changes the DDL of the replicated data instances to the DDL of the target database (step 710). The data replication tool engine might also automatically translate a table in the replicated data instance to a new structure. A translation model might be stored as SQL in a database. The translation model ensures how exactly the target should receive the information. The model might be built by the SoR or target owner, or it can be directly created by the data replication tool. Alternatively, the data replication tool can adapt the replication on the fly without a translational model.
  • If there is no difference in DDL between the source database and target database, or after the DLL has been changed in step 710, the data replication tool exports the replicated data instances to the target database (step 712). Process 700 then ends.
  • Turning now to FIG. 8, an illustration of a block diagram of a data processing system is depicted in accordance with an illustrative embodiment. Data processing system 1300 may be used to implement one or more computers shown in FIG. 1 (e.g., client devices 110 and server computers 104, 106) and data replication system 200 in FIG. 2. In this illustrative example, data processing system 800 includes communications framework 802, which provides communications between processor unit 804, memory 806, persistent storage 808, communications unit 810, input/output unit 812, and display 814. In this example, communications framework 802 may take the form of a bus system.
  • Processor unit 804 serves to execute instructions for software that may be loaded into memory 806. Processor unit 804 may be a number of processors, a multi-processor core, or some other type of processor, depending on the particular implementation. In an embodiment, processor unit 804 comprises one or more conventional general-purpose central processing units (CPUs). In an alternate embodiment, processor unit 804 comprises one or more graphical processing units (CPUs).
  • Memory 806 and persistent storage 808 are examples of storage devices 816. A storage device is any piece of hardware that is capable of storing information, such as, for example, without limitation, at least one of data, program code in functional form, or other suitable information either on a temporary basis, a permanent basis, or both on a temporary basis and a permanent basis. Storage devices 816 may also be referred to as computer-readable storage devices in these illustrative examples. Memory 816, in these examples, may be, for example, a random access memory or any other suitable volatile or non-volatile storage device. Persistent storage 808 may take various forms, depending on the particular implementation.
  • For example, persistent storage 808 may contain one or more components or devices. For example, persistent storage 808 may be a hard drive, a flash memory, a rewritable optical disk, a rewritable magnetic tape, or some combination of the above. The media used by persistent storage 808 also may be removable. For example, a removable hard drive may be used for persistent storage 808. Communications unit 810, in these illustrative examples, provides for communications with other data processing systems or devices. In these illustrative examples, communications unit 810 is a network interface card.
  • Input/output unit 812 allows for input and output of data with other devices that may be connected to data processing system 800. For example, input/output unit 812 may provide a connection for user input through at least one of a keyboard, a mouse, or some other suitable input device. Further, input/output unit 812 may send output to a printer. Display 814 provides a mechanism to display information to a user.
  • Instructions for at least one of the operating system, applications, or programs may be located in storage devices 816, which are in communication with processor unit 804 through communications framework 802. The processes of the different embodiments may be performed by processor unit 804 using computer-implemented instructions, which may be located in a memory, such as memory 806.
  • These instructions are referred to as program code, computer-usable program code, or computer-readable program code that may be read and executed by a processor in processor unit 804. The program code in the different embodiments may be embodied on different physical or computer-readable storage media, such as memory 806 or persistent storage 808.
  • Program code 818 is located in a functional form on computer-readable media 820 that is selectively removable and may be loaded onto or transferred to data processing system 800 for execution by processor unit 804. Program code 818 and computer-readable media 820 form computer program product 822 in these illustrative examples. In one example, computer-readable media 820 may be computer-readable storage media 824 or computer-readable signal media 826.
  • In these illustrative examples, computer-readable storage media 824 is a physical or tangible storage device used to store program code 818 rather than a medium that propagates or transmits program code 818. Computer readable storage media 824, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
  • Alternatively, program code 818 may be transferred to data processing system 800 using computer-readable signal media 826. Computer-readable signal media 826 may be, for example, a propagated data signal containing program code 818. For example, computer-readable signal media 826 may be at least one of an electromagnetic signal, an optical signal, or any other suitable type of signal. These signals may be transmitted over at least one of communications links, such as wireless communications links, optical fiber cable, coaxial cable, a wire, or any other suitable type of communications link.
  • The different components illustrated for data processing system 800 are not meant to provide architectural limitations to the manner in which different embodiments may be implemented. The different illustrative embodiments may be implemented in a data processing system including components in addition to or in place of those illustrated for data processing system 800. Other components shown in FIG. 8 can be varied from the illustrative examples shown. The different embodiments may be implemented using any hardware device or system capable of running program code 818.
  • As used herein, the phrase “a number” means one or more. The phrase “at least one of”, when used with a list of items, means different combinations of one or more of the listed items may be used, and only one of each item in the list may be needed. In other words, “at least one of” means any combination of items and number of items may be used from the list, but not all of the items in the list are required. The item may be a particular object, a thing, or a category.
  • For example, without limitation, “at least one of item A, item B, or item C” may include item A, item A and item B, or item C. This example also may include item A, item B, and item C or item B and item C. Of course, any combinations of these items may be present. In some illustrative examples, “at least one of” may be, for example, without limitation, two of item A; one of item B; and ten of item C; four of item B and seven of item C; or other suitable combinations.
  • The flowcharts and block diagrams in the different depicted embodiments illustrate the architecture, functionality, and operation of some possible implementations of apparatuses and methods in an illustrative embodiment. In this regard, each block in the flowcharts or block diagrams may represent at least one of a module, a segment, a function, or a portion of an operation or step. For example, one or more of the blocks may be implemented as program code.
  • In some alternative implementations of an illustrative embodiment, the function or functions noted in the blocks may occur out of the order noted in the figures. For example, in some cases, two blocks shown in succession may be performed substantially concurrently, or the blocks may sometimes be performed in the reverse order, depending upon the functionality involved. Also, other blocks may be added in addition to the illustrated blocks in a flowchart or block diagram.
  • The description of the different illustrative embodiments has been presented for purposes of illustration and description and is not intended to be exhaustive or limited to the embodiments in the form disclosed. The different illustrative examples describe components that perform actions or operations. In an illustrative embodiment, a component may be configured to perform the action or operation described. For example, the component may have a configuration or design for a structure that provides the component an ability to perform the action or operation that is described in the illustrative examples as being performed by the component. Many modifications and variations will be apparent to those of ordinary skill in the art. Further, different illustrative embodiments may provide different features as compared to other desirable embodiments. The embodiment or embodiments selected are chosen and described in order to best explain the principles of the embodiments, the practical application, and to enable others of ordinary skill in the art to understand the disclosure for various embodiments with various modifications as are suited to the particular use contemplated.

Claims (42)

What is claimed is:
1. A computer-implemented method of data replication between databases, the method comprising:
using a number of processors and random access memory allocation to perform the steps of:
receiving, through an application programming interface (API) integrated into a data replication tool, a request from a system of record (SoR) application for a replication setup and initialization;
importing, through the API, a number of data instances from a source database associated with the SoR;
initiating, by a manager engine integrated into the data replication tool, a separate replication process for each data instance imported through the API, wherein the manager engine is configured to support multiple data instances on a single virtual machine instance;
if a difference between a data description language (DDL) of the replicated data instances and a DDL of a target database is detected, automatically changing the DDL of the replicated data instances to the DDL of the target database; and
exporting the replicated data instances to the target database.
2. The method of claim 1, wherein the replicated data instances comprise JavaScript Object Notation files.
3. The method of claim 1, further comprising persisting, through the API, database configuration information on a non-relational database.
4. The method of claim 1, further comprising forwarding a data replication log in real-time from the manager engine to a data analytics service.
5. The method of claim 1, further comprising forwarding a data replication log in real-time from a user interface integrated into the data replication tool to a data analytics service.
6. The method of claim 1, wherein the API runs on a container.
7. The method of claim 1, further comprising receiving a replication start or stop request from the SoR through the API.
8. The method of claim 1, further comprising receiving a request for real-time status from the SoR through the API.
9. The method of claim 1, further comprising detecting, by the manager engine, if a table in the replicated data instances requires a load.
10. The method of claim 1, further comprising automatically translating, by the manager engine, a table in the replicated data instance to a new structure.
11. The method of claim 1, wherein a system administrator communicates with the API through a user interface integrated into the data replication system.
12. The method of claim 1, wherein the manager engine is one of a number of manager engines in a load balancing cluster.
13. The method of claim 1, wherein the data replication tool is integrated with a live stream chat system.
14. The method of claim 1, wherein an orchestrator integrated into the data replication system executes a specified list of actions by the manager engine under specified conditions.
15. A system for data replication between databases, the system comprising:
a storage device configured to store program instructions; and
one or more processors operably connected to the storage device and configured to execute the program instructions to cause the system to:
receive, through an application programming interface (API) integrated into a data replication tool, a request from a system of record (SoR) application for a replication setup and initialization;
import, through the API, a number of data instances from a source database associated with the SoR;
initiate, by a manager engine integrated into the data replication tool, a separate replication process for each data instance imported through the API, wherein the manager engine is configured to support multiple data instances on a single virtual machine instance;
if a difference between a data description language (DDL) of the replicated data instances and a DDL of a target database is detected, automatically change the DDL of the replicated data instances to the DDL of the target database; and
export the replicated data instances to the target database.
16. The system of claim 15, wherein the replicated data instances comprise JavaScript Object Notation files.
17. The system of claim 15, wherein the processors further execute instructions to persist, through the API, database configuration information on a non-relational database.
18. The system of claim 15, wherein the processors further execute instructions to forward a data replication log in real-time from the manager engine to a data analytics service.
19. The system of claim 15, wherein the processors further execute instructions to forward a data replication log in real-time from a user interface integrated into the data replication tool to a data analytics service.
20. The system of claim 15, wherein the API runs on a container.
21. The system of claim 15, wherein the processors further execute instructions to receive a replication start or stop request from the SoR through the API.
22. The system of claim 15, wherein the processors further execute instructions to receive a request for real-time status from the SoR through the API.
23. The system of claim 15, wherein the processors further execute instructions to detect, by the manager engine, if a table in the replicated data instances requires a load.
24. The system of claim 15, wherein the processors further execute instructions to automatically translate, by the manager engine, a table in the replicated data instance to a new structure.
25. The system of claim 15, wherein a system administrator communicates with the API through a user interface integrated into the data replication system.
26. The system of claim 15, wherein the manager engine is one of a number of manager engines in a load balancing cluster.
27. The system of claim 15, wherein the data replication tool is integrated with a live stream chat system.
28. The system of claim 15, wherein an orchestrator integrated into the data replication system executes a specified list of actions by the manager engine under specified conditions.
29. A computer program product for data replication between databases, the computer program product comprising:
a computer-readable storage medium having program instructions embodied thereon to perform the steps of:
receiving, through an application programming interface (API) integrated into a data replication tool, a request from a system of record (SoR) application for a replication setup and initialization;
importing, through the API, a number of data instances from a source database associated with the SoR;
initiating, by a manager engine integrated into the data replication tool, a separate replication process for each data instance imported through the API, wherein the manager engine is configured to support multiple data instances on a single virtual machine instance;
if a difference between a data description language (DDL) of the replicated data instances and a DDL of a target database is detected, automatically changing the DDL of the replicated data instances to the DDL of the target database; and
exporting the replicated data instances to the target database.
30. The computer program product of claim 29, wherein the replicated data instances comprise JavaScript Object Notation files.
31. The computer program product of claim 29, further comprising instructions for persisting, through the API, database configuration information on a non-relational database.
32. The computer program product of claim 29, further comprising instructions for forwarding a data replication log in real-time from the manager engine to a data analytics service.
33. The computer program product of claim 29, further comprising instructions for forwarding a data replication log in real-time from a user interface integrated into the data replication tool to a data analytics service.
34. The computer program product of claim 29, wherein the API runs on a container.
35. The computer program product of claim 29, further comprising instructions for receiving a replication start or stop request from the SoR through the API.
36. The computer program product of claim 29, further comprising instructions for receiving a request for real-time status from the SoR through the API.
37. The computer program product of claim 29, further comprising instructions for detecting, by the manager engine, if a table in the replicated data instances requires a load.
38. The computer program product of claim 29, further comprising instructions for automatically translating, by the manager engine, a table in the replicated data instance to a new structure.
39. The computer program product of claim 29, wherein a system administrator communicates with the API through a user interface integrated into the data replication system.
40. The computer program product of claim 29, wherein the manager engine is one of a number of manager engines in a load balancing cluster.
41. The computer program product of claim 29, wherein the data replication tool is integrated with a live stream chat system.
42. The computer program product of claim 29, wherein an orchestrator integrated into the data replication system executes a specified list of actions by the manager engine under specified conditions.
US17/099,059 2020-11-16 2020-11-16 Database Data Replication Tool Abandoned US20220156278A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US17/099,059 US20220156278A1 (en) 2020-11-16 2020-11-16 Database Data Replication Tool
PCT/US2021/046404 WO2022103461A1 (en) 2020-11-16 2021-08-18 Database data replication tool
US18/113,982 US20240020314A1 (en) 2020-11-16 2023-02-24 Database data replication tool

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US17/099,059 US20220156278A1 (en) 2020-11-16 2020-11-16 Database Data Replication Tool

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US18/113,982 Continuation US20240020314A1 (en) 2020-11-16 2023-02-24 Database data replication tool

Publications (1)

Publication Number Publication Date
US20220156278A1 true US20220156278A1 (en) 2022-05-19

Family

ID=77726544

Family Applications (2)

Application Number Title Priority Date Filing Date
US17/099,059 Abandoned US20220156278A1 (en) 2020-11-16 2020-11-16 Database Data Replication Tool
US18/113,982 Pending US20240020314A1 (en) 2020-11-16 2023-02-24 Database data replication tool

Family Applications After (1)

Application Number Title Priority Date Filing Date
US18/113,982 Pending US20240020314A1 (en) 2020-11-16 2023-02-24 Database data replication tool

Country Status (2)

Country Link
US (2) US20220156278A1 (en)
WO (1) WO2022103461A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090089619A1 (en) * 2007-09-27 2009-04-02 Kung-Shiuh Huang Automatic detection of functional defects and performance bottlenecks in network devices
US20150249769A1 (en) * 2014-03-03 2015-09-03 Canon Kabushiki Kaisha Image forming apparatus, method for controlling image forming apparatus, and storage medium
US20160292250A1 (en) * 2013-12-12 2016-10-06 Huawei Technologies Co., Ltd. Data replication method and storage system
US20190057163A1 (en) * 2017-08-18 2019-02-21 Sap Se Classification of log entry types
US20200117680A1 (en) * 2017-03-28 2020-04-16 Gb Gas Holdings Limited Data replication system

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2452279A1 (en) * 2009-07-09 2012-05-16 Xeround Systems Ltd. Database system with query interception and redirection
US8818949B2 (en) * 2011-12-30 2014-08-26 Bmc Software, Inc. Systems and methods for migrating database data
US9690834B2 (en) * 2014-11-27 2017-06-27 Siemens Product Lifecycle Management Software Inc. Representation, comparison, and troubleshooting of native data between environments

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090089619A1 (en) * 2007-09-27 2009-04-02 Kung-Shiuh Huang Automatic detection of functional defects and performance bottlenecks in network devices
US20160292250A1 (en) * 2013-12-12 2016-10-06 Huawei Technologies Co., Ltd. Data replication method and storage system
US20150249769A1 (en) * 2014-03-03 2015-09-03 Canon Kabushiki Kaisha Image forming apparatus, method for controlling image forming apparatus, and storage medium
US20200117680A1 (en) * 2017-03-28 2020-04-16 Gb Gas Holdings Limited Data replication system
US20190057163A1 (en) * 2017-08-18 2019-02-21 Sap Se Classification of log entry types

Also Published As

Publication number Publication date
WO2022103461A1 (en) 2022-05-19
US20240020314A1 (en) 2024-01-18

Similar Documents

Publication Publication Date Title
US11244233B2 (en) Intelligent adaptor service in unified automation platforms for robotic process automation
US11394767B2 (en) Central repository of configuration files and two-way replication of search node configuration files
EP3543866B1 (en) Resource-efficient record processing in unified automation platforms for robotic process automation
AU2016273909B2 (en) Data pipeline architecture for cloud processing of structured and unstructured data
US11567959B2 (en) Self-contained files for generating a visualization of query results
EP3617884B1 (en) Adapter extension for inbound messages from robotic automation platforms to unified automation platform
US9135093B2 (en) Event-driven approach for collecting monitoring data of messaging systems
US20160041846A1 (en) Providing configurable workflow capabilities
US20190286474A1 (en) Concurrent queueing and control command feedback loop in unified automation platforms
US20150006476A1 (en) Upgrading and migrating a database
US10977262B2 (en) Data export job engine
US10924334B1 (en) Monitoring distributed systems with auto-remediation
US11875275B1 (en) Custom time series models in computer analytics systems
US11892976B2 (en) Enhanced search performance using data model summaries stored in a remote data store
US11403120B1 (en) Enterprise process graphs for representing RPA data
US20220156278A1 (en) Database Data Replication Tool
US9389883B2 (en) Common system services for managing configuration and other runtime settings of applications
US20210349899A1 (en) Orchestration system for stream storage and processing
US11200255B2 (en) Robust data synchronization solution between databases
CN114625763A (en) Information analysis method and device for database, electronic equipment and readable medium
CN114461490B (en) Fortune dimension aggregation system
EP4066117B1 (en) Managing provenance information for data processing pipelines
US11853273B1 (en) Partial migration of applications across database systems
Sulthana Optimized Data Pipeline between Google Cloud Platform (GCP) and On-Premises
WO2019138335A1 (en) Platform to control one or more systems and explore data across one or more systems

Legal Events

Date Code Title Description
AS Assignment

Owner name: ADP, LLC, NEW JERSEY

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BAUMERT, YOANN;DELAUNAY, CLEMENT;CHIS, DANIEL-FLORIN;REEL/FRAME:054378/0428

Effective date: 20201111

AS Assignment

Owner name: ADP, INC., NEW JERSEY

Free format text: CHANGE OF NAME;ASSIGNOR:ADP, LLC;REEL/FRAME:058959/0729

Effective date: 20200630

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: ADVISORY ACTION MAILED

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION