CN106921755B - Enterprise data integration cloud console, implementation method and system - Google Patents

Enterprise data integration cloud console, implementation method and system Download PDF

Info

Publication number
CN106921755B
CN106921755B CN201710339811.8A CN201710339811A CN106921755B CN 106921755 B CN106921755 B CN 106921755B CN 201710339811 A CN201710339811 A CN 201710339811A CN 106921755 B CN106921755 B CN 106921755B
Authority
CN
China
Prior art keywords
server
servers
data
cloud console
cluster
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710339811.8A
Other languages
Chinese (zh)
Other versions
CN106921755A (en
Inventor
林木
高冉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Software Co Ltd
Original Assignee
Inspur Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Software Co Ltd filed Critical Inspur Software Co Ltd
Priority to CN201710339811.8A priority Critical patent/CN106921755B/en
Publication of CN106921755A publication Critical patent/CN106921755A/en
Application granted granted Critical
Publication of CN106921755B publication Critical patent/CN106921755B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • H04L67/1008Server selection for load balancing based on parameters of servers, e.g. available memory or workload
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
    • H04L67/025Protocols based on web technology, e.g. hypertext transfer protocol [HTTP] for remote control or remote monitoring of applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer Hardware Design (AREA)
  • General Engineering & Computer Science (AREA)
  • Debugging And Monitoring (AREA)
  • Computer And Data Communications (AREA)

Abstract

The invention discloses an enterprise data integration cloud console, an implementation method and a system, which comprise a cloud console and a server cluster, wherein the cloud console adopts the cloud console structure, the server cluster consists of a master server and a plurality of slave servers, each server in the server cluster is provided with an open source tool button, the cloud console controls all operation tasks to be distributed to the slave servers through the master server, and after all the servers finish the operation tasks, the operation information of relevant servers is displayed in the master server. Compared with the prior art, the enterprise data integration cloud control console, the implementation method and the system overcome the processing bottleneck of a single server, monitor the operation condition of the server in real time, discover and solve problems in advance, greatly reduce the learning cost of operation and maintenance developers, can quickly and effectively complete data extraction, cleaning and storage work, provide the accuracy of integral data, and provide convenience for large data analysis.

Description

Enterprise data integration cloud console, implementation method and system
Technical Field
The invention relates to the technical field of server clusters, in particular to an enterprise data integration cloud console, an implementation method and an implementation system.
Background
In terms of enterprise informatization, information system construction is usually characterized by stage and distribution, which results in the existence of an information island phenomenon. The information isolated island refers to the situation that data information among different software, especially among different departments cannot be shared, so that a large amount of redundant data and junk data exist in a system, the consistency of the data cannot be guaranteed, and the whole process of enterprise informatization construction is seriously hindered. To solve this problem, people are beginning to pay attention to enterprise data integration research. Data in different systems are attempted to be reprocessed, so that an integrated analysis-oriented environment is formed, and rules can be mined from massive information, knowledge can be extracted, and decision making can be assisted. The appearance of the button can provide corresponding help undoubtedly, but a processing bottleneck exists in comparison with a single server with mass data, and the interface is extremely unfriendly and cannot be remotely controlled, so that the learning cost of operation and maintenance developers is increased, the accuracy of data is reduced, and the overall IT cost is increased.
Based on the method, the invention provides an enterprise data integration cloud console, an implementation method and an implementation system. The cloud console technology provided by the invention can dynamically expand and manage the server cluster in a remote console, and horizontally expand the operation in the server cluster, so that the processing bottleneck of a single server is overcome, the operation condition of the server is monitored in real time, the problems are found and solved in advance, the learning cost of operation and maintenance developers is greatly reduced, the data extraction, cleaning and storage work can be rapidly and effectively completed, the accuracy of the whole data is provided, and the convenience is provided for the analysis of big data.
Disclosure of Invention
The technical task of the invention is to provide an enterprise data integration cloud console, an implementation method and an implementation system aiming at the defects.
An enterprise data integration cloud console structurally comprises the following five modules: the operation expansion, operation display, operation start and stop, operation monitoring and operation statistics are used for realizing remote control of the server cluster and completing operation tasks, wherein,
the operation expansion module is used for dynamically expanding the configuration server cluster and appointing one operation to different servers to complete the operation;
the operation showing module is used for showing all operation information operated by the current server, including operation description creation time, an operation flow chart and operation execution log information;
the operation starting and stopping module is used for remotely controlling the starting and stopping of the operation task;
the operation monitoring module is used for acquiring and displaying the log of the operation task in real time;
and the operation counting module is used for counting the completion condition of the whole operation task.
The cloud console monitors the servers completing the operation tasks in real time, the monitored information comprises network state, CPU utilization rate, process number and disk space information, and early warning notification is carried out in a mode of short messages and mails.
When the operation start-stop module remotely controls the start-stop of the operation task, the relevant information of data acquisition, cleaning and storage in the server bearing the acquisition work is accurately acquired in real time.
An enterprise data integration cloud console implementation method is based on a cloud console and a server cluster, and the implementation process is as follows:
the cloud control console establishes a data acquisition special cluster in the server cluster, wherein the data acquisition special cluster comprises a main server and a plurality of slave servers;
installing an open source tool button for each server of the data acquisition special cluster, and sequentially starting all the servers;
the cloud control console controls all the operation tasks to be distributed into the slave servers through the master server, and after all the servers complete the operation tasks, the operation information of the relevant servers is displayed in the master server.
The cloud console monitors the server bearing the acquisition work in real time, the monitored information comprises network state, CPU utilization rate, process number and disk space information, and early warning notification is carried out in a mode of short messages and mails.
The cloud control console can remotely control the start and stop of the operation task, and accurately acquires relevant information of data acquisition, cleaning and storage in a server bearing acquisition work in real time.
The cloud console is provided with an operation expansion module, the operation expansion module is used for dynamically expanding and configuring a special data acquisition cluster, assigning an operation task to different servers to be completed cooperatively, the master server controls all the slave servers to complete data acquisition cooperatively, and obtains the network state, the CPU utilization rate, the process number and the disk space information of all the servers in real time and gives an early warning in time.
After the job task is completed, job information can be displayed in the cloud console, the job information comprises job description creation time, a job flow chart and job execution log information, and all job information is acquired.
The cloud control platform adopts the cloud control platform structure, the server cluster is composed of a master server and a plurality of slave servers, each server in the server cluster is provided with an open source tool button, the cloud control platform controls all operation tasks to be distributed into the slave servers through the master server, and after all the servers complete the operation tasks, the operation information of the relevant servers is displayed in the master server.
All servers in the server cluster cooperatively complete a certain job task, and a main server acquires the running information of related servers; the cloud console remote control main server extracts data dispersed to heterogeneous data sources of different service systems to a temporary middle layer, then carries out cleaning, conversion and integration, and finally loads the data to a data warehouse or a data mart, wherein the data comprises relationship data and a plane data file.
Compared with the prior art, the enterprise data integration cloud console, the implementation method and the system have the following beneficial effects:
according to the enterprise data integration cloud console, the implementation method and the system, the operation of the button is remotely controlled through a cloud control technology, firstly, the bottleneck of single server processing is overcome, the problem of concurrency of a large number of jobs can be solved in a cluster mode, the complex jobs can be rapidly and effectively carried out, in addition, the robustness of the whole job processing process is increased, and the whole stable operation is not influenced by the downtime of a single server; and secondly, the operation distribution of the graphical interface showing relevant information and remote control operation reduces the learning cost of operation and maintenance developers, the keys can be controlled quickly and effectively, the stability and reliability of whole data extraction are ensured, the practicability is high, and the application range is wide.
Drawings
FIG. 1 is a schematic diagram of the implementation of the method of the present invention.
Detailed Description
The invention is further described with reference to the following figures and specific embodiments.
The enterprise data integration mainly refers to a process of performing re-concentration and re-unified management on business data of an enterprise-distributed information system, and the process comprises data acquisition, cleaning, processing, storage and the like.
The Cloud Control Platform (CCP) is a remote Control Platform with elastically scalable processing capability, and the management mode of the remote Control Platform is simpler and more efficient than that of a physical server based on the Platform for processing data integration work.
The invention aims to remotely control data extraction, cleaning and storage work by using a cloud console technology, overcome the processing bottleneck of a single server, reduce the difficulty of system development and operation and maintenance, reduce the generation of error data and provide more accurate reference for enterprise operation decision.
As shown in fig. 1, an enterprise data integration cloud console structurally includes the following five modules: the operation expansion, operation display, operation start and stop, operation monitoring and operation statistics are used for realizing remote control of the server cluster and completing operation tasks, wherein,
and (3) operation expansion:
the cloud control console can dynamically expand a configuration server cluster, one operation is assigned to different servers for carrying out, the master server controls all the slave servers to cooperatively complete data acquisition work, the processing bottleneck of a single server is overcome, information such as the network state, the CPU utilization rate, the process number and the disk space of the server is acquired in real time, and early warning can be timely carried out.
And (3) operation exhibition:
and displaying all the job information operated by the current server, including job description creation time, job flow chart, job execution log information and the like, and acquiring all the job information quickly.
Operation start and stop:
the start and stop of the operation can be remotely controlled, and the operation can be effective in real time by one key.
Operation monitoring:
the logs of all the operations can be acquired and displayed in real time, and the operation and maintenance personnel can be assisted to troubleshoot problems quickly.
Operation statistics:
through the whole operation statistics, the circulation of different theme data can be shown on a macroscopic level, and the whole situation of all operations can be integrally mastered.
When the operation start-stop module remotely controls the start-stop of the operation task, the relevant information of data acquisition, cleaning and storage in the server bearing the acquisition work is accurately acquired in real time.
An enterprise data integration cloud console implementation method adopts a cloud console technology and can rapidly establish a data acquisition server cluster. The acquisition work can be horizontally expanded, and the basic operation of the acquisition work is supported: starting, stopping, modifying, and replacing servers. The whole work can be simultaneously operated on a plurality of servers, the main server controls all the work to be equally distributed to different servers, and the work is completed in a coordinated mode, so that the purpose of load balancing is achieved. In addition, the server carrying the acquisition work monitors information such as network state, CPU utilization rate, process number, disk space and the like in real time, and can give early warning notification in time, including short messages, mails and the like. The data acquisition and processing tool is remotely controlled, the related interfaces of the data acquisition tool are called, and the purpose of remotely acquiring the information of the running condition of related operation in real time is achieved. By adopting the cloud console technology, a more stable and safer application system can be constructed, the difficulty of system development and operation and maintenance is reduced, the service innovation is concentrated, and meanwhile, the overall IT cost is greatly reduced.
The method is based on a cloud console and a server cluster, and the implementation process is as follows:
the cloud control console establishes a data acquisition special cluster in the server cluster, wherein the data acquisition special cluster comprises a main server and a plurality of slave servers;
installing an open source tool button for each server of the data acquisition special cluster, and sequentially starting all the servers;
the cloud control console controls all the operation tasks to be distributed into the slave servers through the master server, and after all the servers complete the operation tasks, the operation information of the relevant servers is displayed in the master server.
The cloud console monitors the server bearing the acquisition work in real time, the monitored information comprises network state, CPU utilization rate, process number and disk space information, and early warning notification is carried out in a mode of short messages and mails.
The cloud control console can remotely control the start and stop of the operation task, and accurately acquires relevant information of data acquisition, cleaning and storage in a server bearing acquisition work in real time.
The cloud console is provided with an operation expansion module, the operation expansion module is used for dynamically expanding and configuring a special data acquisition cluster, assigning an operation task to different servers to be completed cooperatively, the master server controls all the slave servers to complete data acquisition cooperatively, and obtains the network state, the CPU utilization rate, the process number and the disk space information of all the servers in real time and gives an early warning in time.
After the job task is completed, job information can be displayed in the cloud console, the job information comprises job description creation time, a job flow chart and job execution log information, and all job information is acquired.
The cloud control platform adopts the cloud control platform structure, the server cluster is composed of a master server and a plurality of slave servers, each server in the server cluster is provided with an open source tool button, the cloud control platform controls all operation tasks to be distributed into the slave servers through the master server, and after all the servers complete the operation tasks, the operation information of the relevant servers is displayed in the master server.
All servers in the server cluster cooperatively complete a certain job task, and a main server acquires the running information of related servers; the cloud control console remotely manages a plurality of server clusters, remotely controls a data acquisition tool button to realize horizontal expansion of work in the server clusters, each server cluster consists of a main server and a plurality of slave servers, and the main server is used as a controller of one cluster. And the servers cooperatively complete a certain job task and acquire the running information of the related servers. Data such as relationship data, plane data files and the like which are dispersed to different business systems and in heterogeneous data sources are remotely, quickly and effectively extracted to a temporary middle layer, then cleaning, converting and integrating are carried out, and finally the data are loaded into a data warehouse or a data mart, so that the whole process is real-time and clearly visible.
The cloud console technology mainly adopts an open source tool (Kettle), clusters are deployed on a plurality of servers, one of the servers is a Master, the other servers are Slave, a specific Master server and a Slave server are designated through a configuration file, all the servers are sequentially started, the clusters are set in a graphical interface, the Master server and the Slave server can communicate with each other through an Http open protocol, operation level expansion is achieved, operation is evenly distributed to Slave servers through a Master server, and a plurality of servers cooperatively complete operation tasks. The detailed information of each running server, including information such as network state, CPU utilization rate, process number and disk space, can be acquired through the SlaveServer interface. The operation start and stop are remotely controlled through interfaces such as SlaveServer, TransPainter and KettleDatabaseReposology, and relevant information of data acquisition, cleaning and storage is accurately acquired in real time.
In the invention, firstly, a graphical interface dynamically configures a button server cluster, thereby realizing the horizontal expansion of the operation and solving the problem of slow jamming of the running of complex operation. And secondly, the graphical interface remotely controls the operation of the data acquisition tool button, and the operation information is acquired and displayed in real time, so that the learning cost of operation and maintenance developers is greatly reduced, and convenience is brought to enterprise data integration.
The present invention can be easily implemented by those skilled in the art from the above detailed description. It should be understood, however, that the intention is not to limit the invention to the particular embodiments described. On the basis of the disclosed embodiments, a person skilled in the art can combine different technical features at will, thereby implementing different technical solutions.
In addition to the technical features described in the specification, the technology is known to those skilled in the art.

Claims (4)

1. An enterprise data integration cloud console implementation method is based on an enterprise data integration cloud console, and comprises the following modules: the system comprises operation extension, operation display, operation monitoring and operation statistics, is used for realizing remote control of a server cluster and completing operation tasks, and is characterized in that the realization process comprises the following steps:
the method comprises the following steps that firstly, a cloud control console builds a special data acquisition cluster in a server cluster, wherein the special data acquisition cluster comprises a main server and a plurality of slave servers;
step two, installing an open source tool button for each server of the special data acquisition cluster, and starting all the servers in sequence;
thirdly, the cloud console controls all the operation tasks to be distributed to the slave servers through the master server, the operation expansion module dynamically expands the configuration data acquisition special cluster, one operation task is assigned to different servers to be completed cooperatively, the master server controls all the slave servers to complete data acquisition cooperatively, the network state, the CPU utilization rate, the process number and the disk space information of all the servers are obtained in real time, and early warning is carried out in time;
the operation monitoring module acquires logs of operation tasks in real time, monitors a server bearing acquisition work in real time, and carries out early warning notification in a mode of short messages and mails, wherein the monitored information comprises network state, CPU utilization rate, process number and disk space information;
the operation showing module shows all operation information operated by the current server, wherein the operation information comprises operation description creation time, an operation flow chart and operation execution log information;
and the operation counting module counts the completion condition of the whole operation task, and displays the running information of the relevant server in the main server after all the servers complete the operation task.
2. The method for implementing the enterprise data integration cloud console according to claim 1, wherein an operation start/stop module is further configured in the cloud console, and is used for remotely controlling start/stop of the operation task, and when the operation start/stop module remotely controls start/stop of the operation task, relevant information of data acquisition, cleaning and storage in a server carrying acquisition work is accurately acquired in real time.
3. An enterprise data integration cloud console system is characterized by comprising a cloud console and a server cluster, wherein the cloud console adopts the cloud console structure of claim 1 or 2, the server cluster is composed of a master server and a plurality of slave servers, each server in the server cluster is provided with an open source tool button, the cloud console controls all operation tasks to be distributed into the slave servers through the master server, and after all the servers complete the operation tasks, operation information of relevant servers is displayed in the master server.
4. The enterprise data integration cloud console system of claim 3, wherein each server in the server cluster cooperatively completes a certain job task, and the main server obtains the operation information of the relevant server; the cloud console remote control main server extracts data dispersed to heterogeneous data sources of different service systems to a temporary middle layer, then carries out cleaning, conversion and integration, and finally loads the data to a data warehouse or a data mart, wherein the data comprises relationship data and a plane data file.
CN201710339811.8A 2017-05-15 2017-05-15 Enterprise data integration cloud console, implementation method and system Active CN106921755B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710339811.8A CN106921755B (en) 2017-05-15 2017-05-15 Enterprise data integration cloud console, implementation method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710339811.8A CN106921755B (en) 2017-05-15 2017-05-15 Enterprise data integration cloud console, implementation method and system

Publications (2)

Publication Number Publication Date
CN106921755A CN106921755A (en) 2017-07-04
CN106921755B true CN106921755B (en) 2020-04-28

Family

ID=59567773

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710339811.8A Active CN106921755B (en) 2017-05-15 2017-05-15 Enterprise data integration cloud console, implementation method and system

Country Status (1)

Country Link
CN (1) CN106921755B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111435344B (en) * 2019-01-15 2023-03-21 中国石油集团川庆钻探工程有限公司长庆钻井总公司 Big data-based drilling acceleration influence factor analysis model

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104391989A (en) * 2014-12-16 2015-03-04 浪潮电子信息产业股份有限公司 Distributed ETL (extract transform load) all-in-one machine system
CN104573071A (en) * 2015-01-26 2015-04-29 湖南大学 Intelligent school situation analysis system and method based on megadata technology
CN106549829A (en) * 2016-10-28 2017-03-29 北方工业大学 Big data calculating platform monitoring system and method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120310875A1 (en) * 2011-06-03 2012-12-06 Prashanth Prahlad Method and system of generating a data lineage repository with lineage visibility, snapshot comparison and version control in a cloud-computing platform

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104391989A (en) * 2014-12-16 2015-03-04 浪潮电子信息产业股份有限公司 Distributed ETL (extract transform load) all-in-one machine system
CN104573071A (en) * 2015-01-26 2015-04-29 湖南大学 Intelligent school situation analysis system and method based on megadata technology
CN106549829A (en) * 2016-10-28 2017-03-29 北方工业大学 Big data calculating platform monitoring system and method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于Hadoop的非结构化文本数据ETL系统设计与实现;习云峰;《中国优秀硕士学位论文全文数据库》;20170331;第1章-第4章 *

Also Published As

Publication number Publication date
CN106921755A (en) 2017-07-04

Similar Documents

Publication Publication Date Title
CN107294772B (en) Dynamic management monitoring service system combined with Docker
CN110794800B (en) Intelligent factory information management monitoring system
CN112600891B (en) Information physical fusion-based edge cloud cooperative system and working method
CN113569987A (en) Model training method and device
CN111459763B (en) Cross-kubernetes cluster monitoring system and method
TWI644534B (en) Cloud platform monitoring method and cloud platform monitoring system
CN108009258B (en) Data acquisition and analysis platform capable of being configured online
CN107992392B (en) Automatic monitoring and repairing system and method for cloud rendering system
CN111865682B (en) Method and device for handling faults
CN103324715B (en) Disaster recovery backup system availability detection method and device
CN109656690A (en) Scheduling system, method and storage medium
CN108647886B (en) Scientific computing process management system
CN114143220B (en) Real-time data visualization platform
CN105162632A (en) Automatic processing system for server cluster failures
US20140136668A1 (en) Real-time self-optimizing system for multitenant based cloud infrastructure
CN102857371A (en) Dynamic allocation management method for cluster system
CN113176948A (en) Edge gateway, edge computing system and configuration method thereof
CN110619014A (en) ETL-based data extraction method
CN106921755B (en) Enterprise data integration cloud console, implementation method and system
CN114691050B (en) Cloud native storage method, device, equipment and medium based on kubernets
CN107153679B (en) Extraction statistical method and system for semi-structured big data
CN104166584A (en) Server virtualization cluster double-layer redundant architecture and construction method thereof
CN106155859B (en) monitoring management system, information processing method and high-density server
CN115826955A (en) Application publishing method, system, electronic device and storage medium
CN112055086B (en) IIS site, release and management method of Windows service, operation and maintenance system and platform

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant