CN112182080A - Data integration system and data processing method based on data integration system - Google Patents

Data integration system and data processing method based on data integration system Download PDF

Info

Publication number
CN112182080A
CN112182080A CN202011003208.0A CN202011003208A CN112182080A CN 112182080 A CN112182080 A CN 112182080A CN 202011003208 A CN202011003208 A CN 202011003208A CN 112182080 A CN112182080 A CN 112182080A
Authority
CN
China
Prior art keywords
data
database
project
data processing
acquisition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011003208.0A
Other languages
Chinese (zh)
Inventor
林文奇
苏志锋
张峰
郑积聪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Construction Bank Corp
Original Assignee
China Construction Bank Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Construction Bank Corp filed Critical China Construction Bank Corp
Priority to CN202011003208.0A priority Critical patent/CN112182080A/en
Publication of CN112182080A publication Critical patent/CN112182080A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/252Integrating or interfacing systems involving database management systems between a Database Management System and a front-end application
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2282Tablespace storage structures; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/254Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a data integration system and a data processing method based on the data integration system, and relates to the technical field of computers. The system comprises: the system comprises a resource isolation module, a data acquisition module, a data integration module and a system database, wherein the resource isolation module is used for: setting the system and the corresponding authority of at least one project contained in the system to realize resource isolation; the data acquisition module is used for: calling a data acquisition interface to acquire data, and landing the acquired data to a system database; the data integration module is used for: and accessing data in the system database, and processing the accessed data according to the data processing rule. According to the implementation method, the data acquisition and data integration processes can be integrated, a user does not need to develop background services for each application, and the labor cost and the code maintenance cost are reduced.

Description

Data integration system and data processing method based on data integration system
Technical Field
The invention relates to the technical field of computers, in particular to a data integration system and a data processing method based on the data integration system.
Background
In the data era, data processing is the core of system application, and most system applications must go through the processes of data acquisition, data processing and data interface calling. In order to process the differential data of different businesses, an application must develop an application service background for each business, and develop multiple sets of basically consistent codes to perform the same-nature work, which results in higher labor cost and code maintenance cost.
Disclosure of Invention
In view of this, embodiments of the present invention provide a data integration system and a data processing method based on the data integration system, which can integrate data acquisition and data integration processes, and reduce labor cost and code maintenance cost, because a user does not need to develop a background service for each application.
To achieve the above object, according to an aspect of an embodiment of the present invention, a data integration system is provided.
A data integration system according to an embodiment of the present invention includes: the system comprises a resource isolation module, a data acquisition module, a data integration module and a system database; wherein the resource isolation module is configured to: setting a system and a right corresponding to at least one project contained in the system to realize resource isolation; the data acquisition module is used for: calling a data acquisition interface to acquire data and landing the acquired data to the system database; the data integration module is used for: and accessing the data in the system database, and processing the accessed data according to the data processing rule.
Optionally, the resource isolation module is further configured to: managing system roles, wherein the system roles include: a hypervisor, a project creator of the at least one project, and a project user of the at least one project.
Optionally, the resource isolation module is further configured to: setting the hypervisor to have a management system menu, a configuration traffic threshold, a configuration timeout time threshold, and a right to view a project log of the at least one project; setting the project creator of the at least one project to have the authority corresponding to the at least one project, and controlling the authority of the project user of the at least one project to the project menu corresponding to the at least one project through the project creator of the at least one project.
Optionally, the resource isolation module is further configured to: performing resource isolation on the system database through the unique identifier of the at least one project to determine an authorization database corresponding to the at least one project; wherein the system database comprises: a common data area and an external data source.
Optionally, the data acquisition module is further configured to: receiving a data acquisition instruction, and determining a landing database and an acquisition table structure according to the data acquisition instruction, wherein the landing database is a database in the system database; and calling the data acquisition interface to acquire data, and storing the acquired data into the floor database according to the acquisition table structure.
Optionally, the data acquisition module is further configured to: analyzing the data acquisition instruction to obtain a target project corresponding to the data acquisition instruction; selecting at least one authorization database managed by the target project from the system databases according to the unique identifier of the target project; judging whether the acquisition table structure exists in the at least one authorization database; if so, determining the database with the acquisition table structure as the landing database; if not, selecting one database from the at least one authorization database as the landing database, and creating the acquisition table structure.
Optionally, the data acquisition module is further configured to: configuring acquisition table basic information through the target item, wherein the acquisition table basic information comprises at least one of the following options: table name, table remark, field name, field remark and field type; and converting the basic information of the acquisition table into a Create statement to Create the acquisition table structure.
Optionally, the incoming field corresponding to the data acquisition interface includes: the unique identification corresponding to the data acquisition instruction and the acquired data.
Optionally, the data acquisition module is further configured to: analyzing the acquired data to obtain at least one insert statement; executing the at least one insertion statement according to a preset anti-collision mode so as to store the acquired data into the floor database; wherein the preset anti-collision mode comprises at least one of the following options: identification field identification mode, same statement non-operation mode and updating mode.
Optionally, the data integration module is further configured to: receiving the data processing request, and identifying an item corresponding to the data processing request and a data processing rule to be executed; calling a data processing interface corresponding to the data processing rule to be executed, and accessing the system database according to the unique identifier of the identified project to obtain data to be processed; and processing the data to be processed according to the data processing rule to be executed, and returning a processing result through the called data processing interface.
Optionally, the data integration module is further configured to: configuring at least one data processing rule in a front-end and back-end separation mode, and providing a data processing interface corresponding to the at least one data processing rule; wherein the data processing rule to be executed is a rule in the at least one data processing rule.
Optionally, the at least one data processing rule supports multiple development languages; and the data integration module loads a plurality of data source drive packages so as to establish connection with the system database.
Optionally, the system further comprises: the system operation and maintenance module is used for: providing an import and export function of the total amount and increment corresponding to the at least one project so as to realize data consistency of the test environment and the production environment; setting a log viewing authority corresponding to the at least one item; monitoring the flow, the timeout time, and the maximum number of returns of the at least one item.
Optionally, the system operation and maintenance module is further configured to: receiving export requirements, and acquiring data needing to be exported in a full amount or increment in a test environment according to the export requirements so as to import the exported data in a production environment; wherein the derived data comprises: project information, role information, and interface information.
To achieve the above object, according to still another aspect of the embodiments of the present invention, there is provided a data processing method based on a data integration system.
The data processing method based on the data integration system comprises the following steps: receiving a data acquisition instruction, analyzing the data acquisition instruction, calling a data acquisition interface to acquire data, and landing the acquired data to a system database; and receiving a data processing request, accessing data in the system database according to the data processing request, and processing the accessed data according to a data processing rule.
Optionally, the receiving a data acquisition instruction, analyzing the data acquisition instruction, calling a data acquisition interface to perform data acquisition, and landing the acquired data on a system database includes: receiving the data acquisition instruction, analyzing the data acquisition instruction, and determining a landing database and an acquisition table structure, wherein the landing database is a database in the system database; and calling the data acquisition interface to acquire data, and storing the acquired data into the floor database according to the acquisition table structure.
Optionally, the analyzing the data acquisition instruction to determine a structure of a landing database and an acquisition table includes: analyzing the data acquisition instruction to obtain a target project corresponding to the data acquisition instruction; selecting at least one authorization database managed by the target project from the system databases according to the unique identifier of the target project; judging whether the acquisition table structure exists in the at least one authorization database; if so, determining the database with the acquisition table structure as the landing database; if not, selecting one database from the at least one authorization database as the landing database, and creating the acquisition table structure.
Optionally, the creating the acquisition table structure includes: configuring acquisition table basic information through the target item, wherein the acquisition table basic information comprises at least one of the following options: table name, table remark, field name, field remark and field type; and converting the basic information of the acquisition table into a Create statement to Create the acquisition table structure.
Optionally, the incoming field corresponding to the data acquisition interface includes: the unique identification corresponding to the data acquisition instruction and the acquired data.
Optionally, the storing the collected data in the floor database includes: analyzing the acquired data to obtain at least one insert statement; executing the at least one insertion statement according to a preset anti-collision mode so as to store the acquired data into the floor database; wherein the preset anti-collision mode comprises at least one of the following options: identification field identification mode, same statement non-operation mode and updating mode.
Optionally, the receiving a data processing request, accessing data in the system database according to the data processing request, and processing the accessed data according to a data processing rule includes: receiving the data processing request, and identifying an item corresponding to the data processing request and a data processing rule to be executed; calling a data processing interface corresponding to the data processing rule to be executed, and accessing the system database according to the unique identifier of the identified project to obtain data to be processed; and processing the data to be processed according to the data processing rule to be executed, and returning a processing result through the called data processing interface.
Optionally, the method further comprises: configuring at least one data processing rule in a front-end and back-end separation mode, and providing a data processing interface corresponding to the at least one data processing rule; wherein the data processing rule to be executed is a rule in the at least one data processing rule.
Optionally, the at least one data processing rule supports multiple development languages; and the data integration module loads a plurality of data source drive packages so as to establish connection with the system database.
To achieve the above object, according to another aspect of an embodiment of the present invention, there is provided an electronic apparatus.
An electronic device of an embodiment of the present invention includes: one or more processors; the storage device is used for storing one or more programs, and when the one or more programs are executed by one or more processors, the one or more processors implement the data processing method based on the data integration system.
To achieve the above object, according to still another aspect of an embodiment of the present invention, there is provided a computer-readable medium.
A computer-readable medium of an embodiment of the present invention stores thereon a computer program, and the computer program, when executed by a processor, implements a data processing method based on a data integration system of an embodiment of the present invention.
One embodiment of the above invention has the following advantages or benefits: according to the data integration system provided by the embodiment of the invention, the data acquisition module with a data acquisition function is provided, the acquired data is landed in the system database, and the data in the system database is processed by using the data integration module, so that the user requirements can be met, and the required data can be provided for the user. In addition, the resource isolation module of the data integration system can set the corresponding authority of at least one project contained in the system and the system, and realize the resource isolation at the system level and the project level. In the face of complex requirements of data applications, the data integration system provided by the embodiment of the invention can integrate the data acquisition, data integration and system operation and maintenance processes, a user does not need to develop background services for each application, the labor cost and the code maintenance cost are reduced, and the operation and maintenance workload is simplified.
Further effects of the above-mentioned non-conventional alternatives will be described below in connection with the embodiments.
Drawings
The drawings are included to provide a better understanding of the invention and are not to be construed as unduly limiting the invention. Wherein:
FIG. 1 is a schematic diagram of a data integration system according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of the main steps of a data processing method based on a data integration system according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of a data acquisition process according to an embodiment of the invention;
FIG. 4 is a schematic diagram of a process for storing collected data to a floor database according to an embodiment of the invention;
FIG. 5 is a schematic diagram of a data processing process according to an embodiment of the invention;
FIG. 6 is an exemplary system architecture diagram in which embodiments of the present invention may be employed;
fig. 7 is a schematic block diagram of a computer system suitable for use in implementing a terminal device or server of an embodiment of the invention.
Detailed Description
Exemplary embodiments of the present invention are described below with reference to the accompanying drawings, in which various details of embodiments of the invention are included to assist understanding, and which are to be considered as merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
Fig. 1 is a schematic structural diagram of a data integration system according to an embodiment of the present invention. As shown in fig. 1, the data integration system 100 may include: a resource isolation module 101, a data acquisition module 102, a data integration module 103, and a system database 104.
Wherein the resource isolation module 101 is configured to: setting a system and a right corresponding to at least one project contained in the system to realize resource isolation; the data acquisition module 102 is configured to: calling a data acquisition interface to acquire data, and landing the acquired data to a system database; the data integration module 103 is configured to: and accessing data in the system database, and processing the accessed data according to the data processing rule.
In the data era, most system applications need to go through the processes of data acquisition, data processing and data interface calling. In order to process the differential data of different businesses, an application must develop an application service background for each business, and develop multiple sets of basically consistent codes to perform the same-nature work, which results in higher labor cost and code maintenance cost. The data integration system provided by the embodiment of the invention has the data acquisition module with the data acquisition function, the acquired data is landed in the system database, and the data integration module is utilized to process the data in the system database, so that the user requirements can be met, and the required data can be provided for the user. In addition, in the embodiment of the present invention, the data integration system is a basic operation environment supporting operation of each service, and manages a specific service in units of projects, where each project corresponds to a specific service requirement. The resource isolation module of the data integration system can set the corresponding authority of at least one project contained in the system and the system, and realize the resource isolation at the system level and the project level.
Therefore, in the face of complex requirements of data applications, the data integration system provided by the embodiment of the invention can integrate the data acquisition and data integration processes, a user does not need to develop background services for each application, and the labor cost and the code maintenance cost are reduced.
In a referenced embodiment of the present invention, the resource isolation module 101 is further configured to: managing system roles. Wherein, the system role includes: a hypervisor, a project creator of the at least one project, and a project user of the at least one project.
The super administrator has the maximum authority of the data integration system and is responsible for operating and maintaining the whole system. In particular, the hypervisor may manage system menus, configure traffic thresholds, configure timeout thresholds, and view project logs for at least one project. Wherein the super administrator management system menu may include: (1) a super administrator may manage at least one project included in the system, for example, a certain project a may be added, that is, the system may satisfy a specific service requirement corresponding to the project a (note that, the service requirement here refers to a data-type service requirement); (2) the super administrator may manage the system database, and specifically, the super administrator may increase the storage amount of the system database, or may increase the type of the system database, and the like; (3) the super administrator can manage the data processing rules contained in the system; (4) the super administrator can manage Application Program Interfaces (API) contained in the system; (5) the hypervisor may also manage the operation and maintenance state of the system. In addition, the super administrator can configure the flow threshold of the system and the flow threshold corresponding to each item contained in the system; the super administrator can configure the timeout time threshold of the system and the timeout time threshold corresponding to each project; the hypervisor also has the authority to query the item log for each item.
Each project has a corresponding project creator, which may be considered a project administrator, who possesses maximum rights at the project level. Project users can be regarded as ordinary users, and the authority of the project users can be developed into specific authority by project creators of different projects. That is to say, the resource isolation module 101 of the data integration system isolates the menu and the data interface authority corresponding to the project by controlling the role to each user in a project isolation resource manner and taking the project as a unit. Different roles are created for each project according to requirements, and the menus such as a data source, an API (application program interface), project online and project logs are subjected to role-based isolation, so that different roles can have different menu authorities.
In a referenced embodiment of the present invention, the resource isolation module 101 is further configured to: performing resource isolation on a system database through the unique identifier of at least one project to determine an authorization database corresponding to the at least one project; wherein, the system database includes: a common data area and an external data source. The custom data area is a region provided by the system for storing data, such as an Oracle cluster, and the external data source may be provided by the project itself. Each project may manage one or more authorized databases, i.e., unauthorized databases that are not accessible to the project. In the embodiment of the invention, the data integration system realizes the resource isolation of the system database by using the project unique identifier as the identifier for data resource isolation.
Furthermore, the resource isolation module 101 may also use Apache Shiro framework (i.e., a powerful and easy-to-use Java security framework to perform authentication, authorization, cryptography, and session management), and use the @ requirerespermissions annotation to extract the user's record in the permission record table and check the user's permission. In general, a default user has minimum authority, only viewing operation can be performed, and an item creator can open the authority of the user according to actual conditions.
The data collection module 102 can call a data collection interface to collect data and place the collected data into the system database 104, i.e., store the collected data into the system database 104.
In a referenced embodiment of the present invention, the data acquisition module 102 is further configured to: receiving a data acquisition instruction, and determining structures of a landing database and an acquisition table according to the data acquisition instruction; and calling a data acquisition interface to acquire data, and storing the acquired data into a floor database according to an acquisition table structure. The data acquisition instruction is an acquisition instruction set according to project requirements, and for example, the data acquisition of the project a can be set to be performed every 10 minutes. The landing database refers to a specific database in which acquired data needs to be stored, and belongs to a database in a system database. The acquisition table structure refers to a structural form that acquired data are stored in the ground database. The data acquisition module 102 can determine the structure of the floor database and the acquisition table through the data acquisition instruction, so that the data acquisition interface can be called, and the acquired data is stored in the floor database according to the form of the acquisition table structure.
In a reference embodiment of the present invention, the data acquisition module 102 is further configured to: analyzing the data acquisition instruction to obtain a target item corresponding to the data acquisition instruction; selecting at least one authorization database managed by the target project from the system database according to the unique identifier of the target project; judging whether at least one authorization database has an acquisition table structure; if yes, determining the database with the acquisition table structure as a landing database; if not, selecting one database from at least one authorization database as a landing database, and creating an acquisition table structure.
The data acquisition instruction is an acquisition instruction set according to project requirements, so that the target project corresponding to the data acquisition instruction can be obtained by analyzing the data acquisition instruction. As described above, the resource isolation module 101 may perform resource isolation on the system database through the unique identifier of the at least one item to determine the authorized database corresponding to the at least one item. Therefore, after the target project is obtained, at least one authorization database managed by the target project can be selected from the system database according to the unique identifier of the target project. Then, whether the authorization database managed by the target project contains a data table in the form of an acquisition table structure is judged. If the data is contained, the data base with the acquisition table structure can be determined to be a floor database, and the acquired data can be directly stored in the floor database. If not, one database can be selected from the authorized databases as a floor database, specifically, the database can be selected randomly or according to the working state of the authorized database.
In the case that the authorization database managed by the target project does not contain a data table in the form of an acquisition table structure, the acquisition table structure needs to be created. Therefore, in a referenced embodiment of the present invention, the data acquisition module 102 is further configured to: configuring basic information of an acquisition table through a target project, wherein the basic information of the acquisition table comprises at least one of the following options: table name, table remark, field name, field remark and field type; and converting the basic information of the acquisition table into a Create statement to Create an acquisition table structure.
The data acquisition module 102 calls a data acquisition interface to acquire data, where the data acquisition interface may be a standard application program interface, the interface may accept a long array, and perform a batch loading operation on data, and the fields that need to be transmitted include the following two fields: the unique identification corresponding to the data acquisition instruction and the acquired data. The unique identifier corresponding to the data acquisition instruction can be used for tracking the landing state of the data.
The specific process of the data collection module 102 landing the collected data to the system database 104 may be: analyzing the acquired data to obtain at least one insert statement; executing at least one insert statement according to a preset anti-collision mode so as to store the acquired data into a floor database; wherein the preset anti-collision mode comprises at least one of the following options: identification field identification mode, same statement non-operation mode and updating mode. The identification field identification mode can be used for landing the records with the same identification by adopting an updating operation. The same statement non-operation mode may be that the same data is not fallen to the ground for storage. The updating method may be to preset a specific field, and for data with the same specific field, the same data is imported next time, and the data is deleted and reinserted to complete the updating operation. If the user _ id of the table user is a specific field, if the data with the user _ id of 1 exists in the floor database, the same data is imported next time, the data is deleted, reinserted and updated.
The data integration module 103 can process the data in the system database, so as to meet the user requirements and provide the user with the required data. Specifically, the data integration module 103 is further configured to: receiving a data processing request, and identifying an item corresponding to the data processing request and a data processing rule to be executed; calling a data processing interface corresponding to the data processing rule to be executed, and accessing a system database according to the unique identifier of the identified project to obtain data to be processed; and processing the data to be processed according to the data processing rule to be executed, and returning a processing result through the called data processing interface.
The data processing request is an externally sent request, and may be a request sent by a service application. After receiving the data processing request, the data integration module 103 may identify an item corresponding to the data processing request and a data processing rule to be executed. In the embodiment of the invention, the project can correspond to a specific service requirement, and the data processing request is a request sent by the service application, so that the specific project and the data processing rule to be executed corresponding to the request can be acquired through the data processing request. Then, a data processing interface corresponding to the data processing rule to be executed is called, the data to be processed is obtained by accessing the system database, the data to be processed is processed according to the data processing rule to be executed, and finally, a processing result can be returned through the called data processing interface.
The data integration module 103 can meet the requirement to process the data, so the data integration module 103 has several data processing rules. Thus, in a referenced embodiment of the present invention, the data integration module 103 is further configured to: configuring at least one data processing rule in a front-end and back-end separation mode, and providing a data processing interface corresponding to the at least one data processing rule; the data processing rule to be executed is a rule in at least one data processing rule. The data integration module 103 can adopt a front-end and back-end separation mode, the front end adopts a VUE framework (namely, a lightweight front-end framework) to provide a visual interface so that research and development personnel can debug configuration codes, the back end adopts a Springboot framework, a jar package is analyzed by combining a Groovy dynamic language, and various development languages such as SQL, JAVA and Python are supported. In addition, the data processing rules configured by the data integration module 103 can support multiple development languages; and, the data integration module 103 may also load various data source driver packages to establish connections with the system database 104.
In the prior art, in order to process the difference data of different services, an application service background is developed for each service by an application, multiple sets of codes which are basically consistent are developed, and a large number of system operation and maintenance works with the same property are required, so that the labor cost and the code maintenance cost are higher. Thus, as shown in FIG. 1, the data integration system 100 further includes: and a system operation and maintenance module 105.
The system operation and maintenance module 105 is configured to: providing an import and export function of the total amount and increment corresponding to at least one project so as to realize the data consistency of the test environment and the production environment; setting a log viewing authority corresponding to at least one item; the flow, timeout time and maximum number of returns of the at least one item are monitored.
The system operation and maintenance module 105 may provide import and export functions for full and incremental quantities of items. The function can ensure that production and test data are consistent, and the problem of repeated operation of the flow carried out by the test environment in the production environment is solved. Specifically, the data integration system 100 receives export requirements, and acquires data that needs to be exported in a full amount or in an incremental manner in the test environment according to the export requirements, so as to import the exported data in the production environment; wherein the derived data comprises: project information, role information, and interface information. The system operation and maintenance module 105 may export data in the test environment in full or incremental manner according to specific project requirements, import data in the production environment, and achieve data consistency between test and production. The full amount refers to importing and exporting the whole item content, and the increment refers to selectively importing and exporting the content.
In addition, the system operation and maintenance module 105 may set log viewing permissions corresponding to at least one item in combination with the ELK technology. Wherein, ELK is an abbreviation of three open source software, Elasticsearch, logstack, Kibana, which is a lightweight log collection processing tool. The Elasticissearch is an open source distributed search engine; logstash is mainly used for collecting, analyzing and filtering logs; kibana is also an open source and free tool, can provide log analysis friendly Web interface for Logstash and elastic search, can help to gather, analyze and search important data logs.
The system operation and maintenance module 105 may further monitor traffic of at least one project in combination with a Sentinel (i.e., a distributed service architecture-oriented high-available traffic protection component, which mainly uses traffic as an entry point and helps a developer to ensure stability of the microservice from multiple dimensions such as current limiting, traffic shaping, fusing degradation, system load protection, and hotspot protection). The system operation and maintenance module 105 may also control the database connection timeout time and the maximum number of returns.
In the face of complex requirements of data applications, the data integration system provided by the embodiment of the invention can integrate the data acquisition, data integration and system operation and maintenance processes, a user does not need to develop background services for each application, the labor cost and the code maintenance cost are reduced, and the operation and maintenance workload is simplified.
Fig. 2 is a schematic diagram of main steps of a data processing method based on a data integration system according to an embodiment of the present invention. As shown in fig. 2, the main steps of the data processing method based on the data integration system may include:
step S201, receiving a data acquisition instruction, analyzing the data acquisition instruction, calling a data acquisition interface to acquire data, and landing the acquired data on a system database;
step S202, receiving a data processing request, accessing data in a system database according to the data processing request, and processing the accessed data according to a data processing rule.
The data integration system can be used for data acquisition and data processing, and step S201 illustrates a data acquisition process, and places acquired data into a system data sentence; step S202 illustrates a data processing procedure, and processes the accessed data according to the data processing rule.
FIG. 3 is a schematic diagram of a data acquisition process according to an embodiment of the invention. As shown in fig. 3, the data acquisition process may include:
step S2011, receiving a data acquisition instruction, analyzing the data acquisition instruction, and determining a landing database and an acquisition table structure, wherein the landing database is a database in a system database;
and step S2012, calling a data acquisition interface to acquire data, and storing the acquired data into the floor database according to the acquisition table structure.
In step S2011, analyzing the data acquisition command to determine the structure of the landing database and the acquisition table may include:
(1) analyzing the data acquisition instruction to obtain a target item corresponding to the data acquisition instruction;
(2) selecting at least one authorization database managed by the target project from the system database according to the unique identifier of the target project;
(3) judging whether at least one authorization database has an acquisition table structure;
(4) if yes, determining the database with the acquisition table structure as a landing database;
(5) if not, selecting one database from at least one authorization database as a landing database, and creating an acquisition table structure.
In step (5), if the authorization database managed by the target project does not contain the data table in the form of the collection table structure, the collection table structure needs to be created. Specifically, the creating of the collection table structure may be: configuring basic information of an acquisition table through a target project, wherein the basic information of the acquisition table comprises at least one of the following options: table name, table remark, field name, field remark and field type; and converting the basic information of the acquisition table into a Create statement to Create an acquisition table structure.
In the data acquisition method, a data acquisition interface is required to be called for acquisition, and an incoming field corresponding to the data acquisition interface comprises: the unique identification corresponding to the data acquisition instruction and the acquired data. The unique identifier corresponding to the data acquisition instruction can be used for tracking the landing state of the data.
Fig. 4 is a schematic diagram of a process of storing collected data to a floor database according to an embodiment of the present invention. As shown in fig. 4, the process of storing the collected data in the landing database may include:
step S401, analyzing the collected data to obtain at least one insert statement;
step S402, at least one insertion statement is executed according to a preset anti-collision mode, and the acquired data is stored in a floor database.
Wherein the preset anti-collision mode comprises at least one of the following options: identification field identification mode, same statement non-operation mode and updating mode. The predetermined anti-collision manner has been described in detail above, and will not be described in detail here.
Fig. 5 is a schematic diagram of a data processing procedure according to an embodiment of the present invention. As shown in fig. 5, the data processing procedure may include:
step S2021, receiving a data processing request, and identifying an item corresponding to the data processing request and a data processing rule to be executed;
step S2022, calling a data processing interface corresponding to the data processing rule to be executed, and accessing a system database according to the unique identifier of the identified project to obtain data to be processed;
step S2023, processing the data to be processed according to the data processing rule to be executed, and returning the processing result through the called data processing interface.
The items may correspond to specific service requirements, and the data processing request is a request sent by the service application, so that the specific items and the to-be-executed data processing rules corresponding to the request may be obtained through the data processing request. Then, a data processing interface corresponding to the data processing rule to be executed is called, the data to be processed is obtained by accessing the system database, the data to be processed is processed according to the data processing rule to be executed, and finally, a processing result can be returned through the called data processing interface.
The data processing rule to be executed is a rule in at least one data processing rule, and therefore the data processing rule needs to be configured in advance, and the following method may be specifically adopted: and configuring at least one data processing rule in a front-end and back-end separation mode, and providing a data processing interface corresponding to the at least one data processing rule. In addition, the at least one data processing rule supports multiple development languages, such as SQL, JAVA, Python, and the like.
In the face of complex requirements of data applications, the data integration system provided by the embodiment of the invention can execute the data acquisition method and the data processing method, can integrate the data acquisition and data integration processes, and reduces the labor cost and the code maintenance cost because a user does not need to develop background services for each application.
Fig. 6 shows an exemplary system architecture 600 of a data integration system or data processing method based on a data integration system to which embodiments of the present invention may be applied.
As shown in fig. 6, the system architecture 600 may include terminal devices 601, 602, 603, a network 604, and a server 605. The network 604 serves to provide a medium for communication links between the terminal devices 601, 602, 603 and the server 605. Network 604 may include various types of connections, such as wire, wireless communication links, or fiber optic cables, to name a few.
A user may use the terminal devices 601, 602, 603 to interact with the server 605 via the network 604 to receive or send messages or the like. The terminal devices 601, 602, 603 may be various electronic devices having a display screen and supporting web browsing, including but not limited to smart phones, tablet computers, laptop portable computers, desktop computers, and the like. Server 605 may be a server providing various services
It should be noted that the data processing method based on the data integration system provided by the embodiment of the present invention is generally executed by the server 605, and accordingly, the data integration system is generally disposed in the server 605.
It should be understood that the number of terminal devices, networks, and servers in fig. 6 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.
Referring now to FIG. 7, shown is a block diagram of a computer system 700 suitable for use with a terminal device implementing an embodiment of the present invention. The terminal device shown in fig. 7 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present invention.
As shown in fig. 7, the computer system 700 includes a Central Processing Unit (CPU)701, which can perform various appropriate actions and processes in accordance with a program stored in a Read Only Memory (ROM)702 or a program loaded from a storage section 708 into a Random Access Memory (RAM) 703. In the RAM 703, various programs and data necessary for the operation of the system 700 are also stored. The CPU 701, the ROM 702, and the RAM 703 are connected to each other via a bus 704. An input/output (I/O) interface 705 is also connected to bus 704.
The following components are connected to the I/O interface 705: an input portion 706 including a keyboard, a mouse, and the like; an output section 707 including a display such as a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, and a speaker; a storage section 708 including a hard disk and the like; and a communication section 709 including a network interface card such as a LAN card, a modem, or the like. The communication section 709 performs communication processing via a network such as the internet. A drive 710 is also connected to the I/O interface 705 as needed. A removable medium 711 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 710 as necessary, so that a computer program read out therefrom is mounted into the storage section 708 as necessary.
In particular, according to the embodiments of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method illustrated in the flow chart. In such an embodiment, the computer program can be downloaded and installed from a network through the communication section 709, and/or installed from the removable medium 711. The computer program performs the above-described functions defined in the system of the present invention when executed by the Central Processing Unit (CPU) 701.
It should be noted that the computer readable medium shown in the present invention can be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present invention, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In the present invention, however, a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams or flowchart illustration, and combinations of blocks in the block diagrams or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The modules described in the embodiments of the present invention may be implemented by software or hardware. The described modules may also be provided in a processor, which may be described as: a processor includes a resource isolation module, a data acquisition module, a data integration module, and a system database. The names of the modules do not limit the modules themselves in some cases, for example, the resource isolation module may also be described as a module for setting the authority corresponding to at least one item included in the system and the system to realize resource isolation.
As another aspect, the present invention also provides a computer-readable medium that may be contained in the apparatus described in the above embodiments; or may be separate and not incorporated into the device. The computer readable medium carries one or more programs which, when executed by a device, cause the device to comprise: receiving a data acquisition instruction, analyzing the data acquisition instruction, calling a data acquisition interface to acquire data, and landing the acquired data to a system database; and receiving a data processing request, accessing data in the system database according to the data processing request, and processing the accessed data according to the data processing rule.
According to the data integration system provided by the embodiment of the invention, the data acquisition module with a data acquisition function is provided, the acquired data is landed in the system database, and the data in the system database is processed by using the data integration module, so that the user requirements can be met, and the required data can be provided for the user. In addition, the resource isolation module of the data integration system can set the corresponding authority of at least one project contained in the system and the system, and realize the resource isolation at the system level and the project level. In the face of complex requirements of data applications, the data integration system provided by the embodiment of the invention can integrate the data acquisition, data integration and system operation and maintenance processes, a user does not need to develop background services for each application, the labor cost and the code maintenance cost are reduced, and the operation and maintenance workload is simplified.
The above-described embodiments should not be construed as limiting the scope of the invention. Those skilled in the art will appreciate that various modifications, combinations, sub-combinations, and substitutions can occur, depending on design requirements and other factors. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (21)

1. A data integration system, the system comprising: the system comprises a resource isolation module, a data acquisition module, a data integration module and a system database; wherein the content of the first and second substances,
the resource isolation module is configured to: setting a system and a right corresponding to at least one project contained in the system to realize resource isolation;
the data acquisition module is used for: calling a data acquisition interface to acquire data and landing the acquired data to the system database;
the data integration module is used for: and accessing the data in the system database, and processing the accessed data according to the data processing rule.
2. The system of claim 1, wherein the resource isolation module is further configured to:
managing system roles, wherein the system roles include: a hypervisor, a project creator of the at least one project, and a project user of the at least one project.
3. The system of claim 2, wherein the resource isolation module is further configured to:
setting the hypervisor to have a management system menu, a configuration traffic threshold, a configuration timeout time threshold, and a right to view a project log of the at least one project;
setting the project creator of the at least one project to have the authority corresponding to the at least one project, and controlling the authority of the project user of the at least one project to the project menu corresponding to the at least one project through the project creator of the at least one project.
4. The system of claim 1, wherein the resource isolation module is further configured to:
performing resource isolation on the system database through the unique identifier of the at least one project to determine an authorization database corresponding to the at least one project; wherein the content of the first and second substances,
the system database includes: a common data area and an external data source.
5. The system of claim 1, wherein the data acquisition module is further configured to:
receiving a data acquisition instruction, and determining a landing database and an acquisition table structure according to the data acquisition instruction, wherein the landing database is a database in the system database;
and calling the data acquisition interface to acquire data, and storing the acquired data into the floor database according to the acquisition table structure.
6. The system of claim 5, wherein the data acquisition module is further configured to:
analyzing the data acquisition instruction to obtain a target project corresponding to the data acquisition instruction;
selecting at least one authorization database managed by the target project from the system databases according to the unique identifier of the target project;
judging whether the acquisition table structure exists in the at least one authorization database;
if so, determining the database with the acquisition table structure as the landing database;
if not, selecting one database from the at least one authorization database as the landing database, and creating the acquisition table structure.
7. The system of claim 6, wherein the data acquisition module is further configured to:
configuring acquisition table basic information through the target item, wherein the acquisition table basic information comprises at least one of the following options: table name, table remark, field name, field remark and field type;
and converting the basic information of the acquisition table into a Create statement to Create the acquisition table structure.
8. The system according to any one of claims 5-7, wherein the incoming field corresponding to the data acquisition interface comprises: the unique identification corresponding to the data acquisition instruction and the acquired data.
9. The system of claim 8, wherein the data acquisition module is further configured to:
analyzing the acquired data to obtain at least one insert statement;
executing the at least one insertion statement according to a preset anti-collision mode so as to store the acquired data into the floor database; wherein the content of the first and second substances,
the preset anti-collision mode comprises at least one of the following options: identification field identification mode, same statement non-operation mode and updating mode.
10. The system of claim 1, wherein the data integration module is further configured to:
receiving the data processing request, and identifying an item corresponding to the data processing request and a data processing rule to be executed;
calling a data processing interface corresponding to the data processing rule to be executed, and accessing the system database according to the unique identifier of the identified project to obtain data to be processed;
and processing the data to be processed according to the data processing rule to be executed, and returning a processing result through the called data processing interface.
11. The system of claim 10, wherein the data integration module is further configured to:
configuring at least one data processing rule in a front-end and back-end separation mode, and providing a data processing interface corresponding to the at least one data processing rule; wherein the content of the first and second substances,
the data processing rule to be executed is a rule of the at least one data processing rule.
12. The system of claim 11, wherein the at least one data processing rule supports a plurality of development languages; and the number of the first and second groups,
and the data integration module loads a plurality of data source drive packages so as to establish connection with the system database.
13. The system of claim 1, further comprising: the system operation and maintenance module is used for:
providing an import and export function of the total amount and increment corresponding to the at least one project so as to realize data consistency of the test environment and the production environment;
setting a log viewing authority corresponding to the at least one item;
monitoring the flow, the timeout time, and the maximum number of returns of the at least one item.
14. The system of claim 13, wherein the system operation and maintenance module is further configured to:
receiving export requirements, and acquiring data needing to be exported in a full amount or increment in a test environment according to the export requirements so as to import the exported data in a production environment; wherein the content of the first and second substances,
the derived data includes: project information, role information, and interface information.
15. A data processing method based on a data integration system is characterized by comprising the following steps:
receiving a data acquisition instruction, analyzing the data acquisition instruction, calling a data acquisition interface to acquire data, and landing the acquired data to a system database;
and receiving a data processing request, accessing data in the system database according to the data processing request, and processing the accessed data according to a data processing rule.
16. The method of claim 15, wherein receiving the data acquisition command, parsing the data acquisition command, invoking a data acquisition interface for data acquisition, and landing the acquired data on a system database comprises:
receiving the data acquisition instruction, analyzing the data acquisition instruction, and determining a landing database and an acquisition table structure, wherein the landing database is a database in the system database;
and calling the data acquisition interface to acquire data, and storing the acquired data into the floor database according to the acquisition table structure.
17. The method of claim 16, wherein parsing the data acquisition instructions to determine a landing database and acquisition table structure comprises:
analyzing the data acquisition instruction to obtain a target project corresponding to the data acquisition instruction;
selecting at least one authorization database managed by the target project from the system databases according to the unique identifier of the target project;
judging whether the acquisition table structure exists in the at least one authorization database;
if so, determining the database with the acquisition table structure as the landing database;
if not, selecting one database from the at least one authorization database as the landing database, and creating the acquisition table structure.
18. The method of claim 15, wherein receiving a data processing request, accessing data in the system database based on the data processing request, and processing the accessed data according to data processing rules comprises:
receiving the data processing request, and identifying an item corresponding to the data processing request and a data processing rule to be executed;
calling a data processing interface corresponding to the data processing rule to be executed, and accessing the system database according to the unique identifier of the identified project to obtain data to be processed;
and processing the data to be processed according to the data processing rule to be executed, and returning a processing result through the called data processing interface.
19. The method of claim 18, further comprising:
configuring at least one data processing rule in a front-end and back-end separation mode, and providing a data processing interface corresponding to the at least one data processing rule; wherein the content of the first and second substances,
the data processing rule to be executed is a rule of the at least one data processing rule.
20. An electronic device, comprising:
one or more processors;
a storage device for storing one or more programs,
when executed by the one or more processors, cause the one or more processors to implement the method of any one of claims 15-19.
21. A computer-readable medium, on which a computer program is stored, which, when being executed by a processor, carries out the method according to any one of claims 15-19.
CN202011003208.0A 2020-09-22 2020-09-22 Data integration system and data processing method based on data integration system Pending CN112182080A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011003208.0A CN112182080A (en) 2020-09-22 2020-09-22 Data integration system and data processing method based on data integration system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011003208.0A CN112182080A (en) 2020-09-22 2020-09-22 Data integration system and data processing method based on data integration system

Publications (1)

Publication Number Publication Date
CN112182080A true CN112182080A (en) 2021-01-05

Family

ID=73955766

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011003208.0A Pending CN112182080A (en) 2020-09-22 2020-09-22 Data integration system and data processing method based on data integration system

Country Status (1)

Country Link
CN (1) CN112182080A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113157659A (en) * 2021-05-14 2021-07-23 中国建设银行股份有限公司 Log processing method and device

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050144025A1 (en) * 2003-12-29 2005-06-30 Veritas Operating Corporation Using technical performance metrics for business and usage analysis and cost allocation
CN102054025A (en) * 2010-12-01 2011-05-11 北京中交通信科技有限公司 Traffic information resource integration processing method and system
CN105761010A (en) * 2016-02-24 2016-07-13 国网山东省电力公司 Method and system for real-time monitoring of group enterprise audit based on real-time data acquisition
CN106982231A (en) * 2017-05-12 2017-07-25 王振辉 A kind of inside threat real-time detection method based on Agent
CN110135682A (en) * 2019-04-03 2019-08-16 中交一公局第七工程有限公司 Danger large construction project monitoring management method based on BIM technology
CN110275890A (en) * 2019-06-27 2019-09-24 中国石油集团东方地球物理勘探有限责任公司 A kind of data query method and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050144025A1 (en) * 2003-12-29 2005-06-30 Veritas Operating Corporation Using technical performance metrics for business and usage analysis and cost allocation
CN102054025A (en) * 2010-12-01 2011-05-11 北京中交通信科技有限公司 Traffic information resource integration processing method and system
CN105761010A (en) * 2016-02-24 2016-07-13 国网山东省电力公司 Method and system for real-time monitoring of group enterprise audit based on real-time data acquisition
CN106982231A (en) * 2017-05-12 2017-07-25 王振辉 A kind of inside threat real-time detection method based on Agent
CN110135682A (en) * 2019-04-03 2019-08-16 中交一公局第七工程有限公司 Danger large construction project monitoring management method based on BIM technology
CN110275890A (en) * 2019-06-27 2019-09-24 中国石油集团东方地球物理勘探有限责任公司 A kind of data query method and system

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113157659A (en) * 2021-05-14 2021-07-23 中国建设银行股份有限公司 Log processing method and device

Similar Documents

Publication Publication Date Title
CN109992454B (en) Method, device and storage medium for fault location
CN111190888A (en) Method and device for managing graph database cluster
CN111125042A (en) Method and device for determining risk operation event
CN110895534A (en) Data splicing method, device, medium and electronic equipment
CN112861529A (en) Method and device for managing error codes
CN116244387A (en) Entity relationship construction method, device, electronic equipment and storage medium
CN112445691B (en) Non-invasive intelligent contract performance detection method and device
CN113836237A (en) Method and device for auditing data operation of database
CN113282591A (en) Authority filtering method and device, computer equipment and storage medium
CN112182080A (en) Data integration system and data processing method based on data integration system
CN116450622B (en) Method, apparatus, device and computer readable medium for data warehouse entry
CN112559024A (en) Method and device for generating transaction code change list
CN112580065A (en) Data query method and device
CN113360182B (en) Method and apparatus for system performance diagnostics
CN116204428A (en) Test case generation method and device
CN115033574A (en) Information generation method, information generation device, electronic device, and storage medium
CN114281586A (en) Fault determination method and device, electronic equipment and computer readable storage medium
CN113515306A (en) System transplanting method and device
CN113010174A (en) Service monitoring method and device
CN112965747B (en) Method, apparatus, device and computer readable medium for mining code loopholes
CN114266547A (en) Method, device, equipment, medium and program product for identifying business processing strategy
CN112784247A (en) Authority verification method and device for application program
CN115955327A (en) Interception authentication method and device based on document system
CN117112606A (en) Data query method, device, electronic equipment and computer readable medium
CN117149568A (en) Index collection method, device, electronic equipment and computer readable medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination