CN108322548B

CN108322548B - Industrial process data analysis platform based on cloud computing

Info

Publication number: CN108322548B
Application number: CN201810185964.6A
Authority: CN
Inventors: 葛志强; 张鑫宇
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2018-03-07
Filing date: 2018-03-07
Publication date: 2020-06-05
Anticipated expiration: 2038-03-07
Also published as: CN108322548A

Abstract

The invention discloses an industrial process data analysis platform based on cloud computing, which comprises a cloud service end, an industrial site end, a client and a monitoring end, wherein the cloud service end consists of a system host and n sub-clusters, each sub-cluster comprises a sub-cluster host and a plurality of slave machines connected with the host, the system host controls the sub-cluster hosts, and the sub-cluster hosts control the slave machines connected with the sub-cluster hosts; the industrial field end reads the data of the industrial object in real time and sends the data to the cloud server end for analysis and processing; the client is used for remotely connecting the cloud server, reading and displaying a cloud server computing result in real time and remotely sending an instruction to the cloud server; the monitoring terminal is used for monitoring the cloud service terminal, managing user information and industrial object information and sending a control instruction to the industrial site terminal in real time through the cloud service terminal. The platform can realize remote monitoring of the industrial process, has high operation efficiency, and can bear large-scale users for simultaneous use.

Description

Industrial process data analysis platform based on cloud computing

Technical Field

The invention belongs to the field of industrial process control, and particularly relates to an industrial process data analysis platform based on cloud computing.

Background

In industrial process control, an industrial process data analysis software platform often plays an important role, the platform mainly can collect and store process data in real time, and corresponding modeling algorithms are used for mainly realizing data-driven industrial process monitoring and soft measurement of process variables and quality variables so as to keep stable operation and improve product quality.

The industrial process monitoring based on data driving is to establish a mathematical model through actually measured process data so as to estimate and acquire the running state of the process, finally identify the abnormal behavior of the process, realize the timely diagnosis of the fault and ensure the safe running of the system; the process variable and quality variable soft measurement adopts a proper model, and the process variable and quality variable which are difficult to be directly measured by a sensor are predicted by using process variable data which are easy to measure, so that the product quality can be better controlled, and the production efficiency is improved.

At present, most industrial process data analysis software platforms are located in local computers of industrial fields, a client and an algorithm program are mutually nested, algorithms are calculated on one computer, and a data storage mode of the algorithms is that a certain computer is used as a database for storage, so that the calculation efficiency of the algorithms and the data storage capacity are limited. However, with the continuous development of modern industry, more and more industrial data can be stored, a series of industrial big data problems are formed, and with the continuous rise of internet and internet of things, the mode of realizing remote monitoring based on network is also the inevitable trend of modern industry by using mobile clients and other modes of fusing 'internet + industry'.

However, the traditional industrial process data analysis platform cannot meet the requirements of modern industry under the drive of the internet and ultra-large-scale data, firstly, the traditional system cannot store the ultra-large-scale data, and if the traditional system still uses less data to carry out operation, larger deviation is generated; secondly, due to excessive data quantity, the traditional single-machine algorithm operation mode cannot meet the real-time performance and stability of calculation; meanwhile, the client form of the system is single, and the client and the algorithm program cannot realize remote monitoring and control on site; finally, the algorithm in the traditional system has no shareability and reusability, a set of process monitoring and soft measurement algorithm must be independently written for each industrial object, the algorithm utilization rate is low, and the algorithm is not easy to manage. In summary, the conventional industrial process data analysis platform needs to be improved in terms of both algorithms and software platforms.

Disclosure of Invention

Aiming at the defects of the prior art, the invention provides an industrial process data analysis platform based on cloud computing, which can realize high independence of various algorithms, high sharing of data and high operation efficiency, and the specific technical scheme is as follows:

the industrial process data analysis platform based on cloud computing is characterized by comprising a cloud service end, an industrial site end, a client and a monitoring end;

the cloud service end is composed of a system host and n sub-clusters, the system host and each sub-cluster are positioned in the same local area network, wherein a Web server for providing service for the monitoring end, a data server for driving a system host database and a load balancer for connecting and managing each sub-cluster are installed in the system host; each sub-cluster comprises a sub-cluster host and a plurality of slave machines connected with the host, the system host controls the plurality of sub-cluster hosts, and the sub-cluster hosts control the slave machines connected with the sub-cluster hosts;

the industrial site end comprises a plurality of data processing modules, each data processing module is bound with a sensor of a corresponding industrial object, reads data of the industrial object in the working process in real time and sends the data to the cloud service end for analysis and processing; the data processing module is also used for remotely receiving control instructions sent by the client and the monitoring end to realize the start and stop of data collected by the data processing module, and the data processing module can also set corresponding industrial object control operation according to actual field conditions;

the client is used for remotely connecting the cloud server, reading a calculation result of the cloud server in real time, carrying out visual display, remotely sending an instruction to the cloud server according to needs, and remotely controlling the start and stop of data acquisition work of a data acquisition module of an industrial field end and the control operation of an industrial object under the condition allowed by the field by the cloud server;

the monitoring terminal is used for monitoring the working state of the cloud server and the industrial object operation data, managing user information and industrial object information, and sending a control instruction to the industrial site terminal in real time through the cloud server.

Further, the interaction process of the industrial site side, the cloud service side and the client side is as follows:

the industrial field side logs in a system host of the cloud server side, and the system host distributes corresponding calculation sub-clusters to the system host; the method comprises the steps that an industrial field terminal obtains variable data in real time from an industrial field local database and transmits the variable data to a corresponding sub-cluster of a cloud server according to a certain period to carry out algorithm analysis, the cloud server stores data analysis results into a database of a system host, and a client side polls the database of the system host to read the data analysis results and carry out visualization;

the client sends an instruction to the cloud server, the zone bit of the corresponding field in the system host database is modified, the industrial field side polls the corresponding field of the system host database, the zone bit is read, and corresponding control operation is started.

Further, the interaction between the monitoring terminal, the cloud service terminal and the industrial site terminal includes:

the monitoring terminal polls a system host database of the cloud service terminal and visualizes data so as to monitor the running state of the industrial site terminal; the monitoring end accesses a system host database in the cloud server end, reads user information and realizes the addition, deletion, modification and check of the user information; the monitoring end reads the working parameters of the system host and the sub-clusters in real time and dynamically displays the working parameters;

and the monitoring end sends an instruction to the cloud server end, modifies the zone bit of the corresponding field of the system host database, and the industrial field end polls the corresponding field of the system host database, reads the zone bit and starts corresponding control operation.

Further, the load balancer balances the load as follows:

the load balancer reads the working state information of the connected sub-groups at the same time interval, the working state information comprises the static configuration of each sub-group node and the dynamic performance in the working process, the dynamic performance comprises the CPU frequency, the memory occupation condition, the disk I/O reading rate and the communication network bandwidth use condition, the number of users distributed to each sub-group at the moment is dynamically calculated by adopting a load balancing algorithm after the dynamic performance is read, then the field of the IP address of each client-side connection sub-group node stored in a system host database is changed, and the client-side reads the IP address of the distributed computing node by polling the field and is connected with the node, so that the users are newly allocated to each sub-group.

Further, the load balancer adopts a dynamic load balancing algorithm based on load weight probability to control the load distribution of the sub-clusters, and the algorithm comprises the following steps:

the method comprises the following steps: defining an integrated static performance index C(s) describing compute nodes in a sub-cluster_ij) And a real-time dynamic performance index L(s)_ij)：

C(s_ij)＝k₁×n_ij×C(c_ij)+k₂×C(m_ij)+k₃×C(d_ij)+k₄×C(p_ij)

L(s_ij)＝k₁×L(c_ij)+k₂×L(m_ij)+k₃×L(d_ij)+k₄×L(p_ij)

Wherein, i is 1, …, N, j is 1, …, M_iN is the number of sub-clusters connected to the system host, M_iThe number of computing nodes connected under the ith sub-cluster, s_ijIs the j computing node under the i sub-cluster, n_ijIs the number of CPU cores of the compute node, c_ijRepresenting the usage of the node CPU, C (C)_ij) Representing a preconfigured static frequency, L (c)_ij) Representing real-time dynamic CPU frequency during operation; m is_ijRepresents the use case of node memory, wherein C (m)_ij) Representing a preconfigured memory capacity, L (m)_ij) Representing the real-time dynamic memory occupancy rate during work; d_ijRepresenting the use of node disk I/O, C (d)_ij) Representing a preconfigured disk I/O read rate, L (d)_ij) Representing the I/O occupancy rate of the real-time dynamic disk in work; p is a radical of_ijRepresents the network usage of each node, where C (p)_ij) Representing a preconfigured node network throughput, L (p)_ij) Representing real-time occupancy of the node network bandwidth, where k₁,k₂,k₃,k₄Is a weight coefficient, and k₁+k₂+k₃+k₄＝1；

Step two: defining a load weight W(s) of a node from static and dynamic performance indicators of the computing node_ij)

Step three: the load weight values of all the nodes are summed and averaged to obtain the load weight value W (S) of each sub-cluster_i)

Step four: the maximum load weight value in each sub-cluster at a certain moment is represented as W_max(S) bringing it into contact therewithLoad weights of all the sub-clusters are differenced to obtain a load difference value delta Q (S)_i)

ΔQ(S_i)＝W_max(S)-W(S_i)

Step five: defining the probability P (S) that the system host assigns users to each sub-cluster_i)

Step six: multiplying the current user number using the platform by the user distribution probability to obtain the number of users to be distributed to each sub-cluster

R(S₁)＝floor(P(S₁)×Num)

R(S₂)＝floor(P(S₂)×Num)

R(S_k)＝0

Where Num is the number of people currently using the platform, floor is rounding down, R (S)_i) The number of users assigned to each sub-cluster.

Further, the system host database is used for storing basic information of the user, industrial object information bound by the user, control instructions sent by the client and the monitoring terminal, and analysis data acquired from the sub-cluster.

Furthermore, the sub-cluster is configured with a distributed file storage architecture and a Map-Reduce-based distributed parallel computing framework, the sub-cluster host is a scheduling node, the sub-cluster host is connected with the sub-cluster host and is a data node and a computing node, all process monitoring and variable soft measurement algorithms are written on the computing node of the sub-cluster slave in each sub-cluster in the form of Map-Reduce, a functional interface is formed for each algorithm based on an MVC programming model and is stored in each sub-cluster host, and data is also stored in a distributed manner in a database of each computing node.

Furthermore, the client runs on a PC and/or a mobile terminal, and the monitoring terminal runs in a webpage form.

Furthermore, the process monitoring and variable soft measurement algorithms configured in the computing nodes of the sub-cluster of the cloud server side have a distributed parallel mode and a single machine mode.

Furthermore, the client can also directly upload local data to the cloud server for data analysis.

The invention has the following beneficial effects:

according to the industrial process data analysis platform based on cloud computing, a plurality of clusters are configured in a cloud server, each cluster is provided with a distributed parallel computing frame, all distributed industrial monitoring and soft measurement algorithms are compiled in the cloud server, a calling interface of each algorithm is formed, and meanwhile, a multithreading technology is adopted to realize that all the algorithms can be used at the same time; and the industrial site end acquires data in real time and remotely sends the data to the cloud end for calculation, and the client end reads a calculation result from the cloud server and visualizes the calculation result. The process monitoring and soft measurement algorithms are highly independent and highly shared, and the operation efficiency is improved.

Drawings

FIG. 1 is an overall architecture diagram of a cloud computing-based industrial process data analytics platform of the present invention;

FIG. 2 is a schematic diagram of a cloud server;

FIG. 3 is a schematic diagram of the operation of a load balancer at a cloud server;

FIG. 4 is a diagram illustrating a distributed storage architecture for files in a sub-cluster;

FIG. 5 is a block diagram of a distributed computing framework in a sub-cluster;

FIG. 6 is a schematic diagram illustrating an interaction process between an industrial site and a cloud server or a client;

FIG. 7 is a schematic diagram illustrating an interaction process between a client and a cloud server or an industrial site;

fig. 8 is a schematic view of an interaction process between a monitoring terminal and a cloud server terminal and between a monitoring terminal and an industrial site terminal.

Detailed Description

The present invention will be described in detail below with reference to the accompanying drawings and preferred embodiments, and the objects and effects of the present invention will become more apparent, and the present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

As shown in fig. 1, an industrial process data analysis platform based on cloud computing includes a cloud service end, an industrial site end, a client end and a monitoring end;

the cloud service end consists of a system host and n sub-clusters, wherein the system host and each sub-cluster are positioned in the same local area network, and a Web server for providing service for the monitoring end, a data server for driving a system host database and a load balancer for connecting and managing each sub-cluster are arranged in the system host; each sub-cluster comprises a sub-cluster host and a plurality of slave machines connected with the host, the system host controls the plurality of sub-cluster hosts, and the sub-cluster hosts control the slave machines connected with the sub-cluster hosts; the sub-cluster is provided with a distributed file storage architecture and a Map-Reduce-based distributed parallel computing framework, the sub-cluster host is a scheduling node, the slave machines connected with the sub-cluster host are a data node and a computing node, all process monitoring and variable soft measurement algorithms are written on the computing nodes of the slave machines in each sub-cluster in a Map-Reduce mode, a functional interface is formed for each algorithm based on an MVC programming model and stored in each sub-cluster host, and meanwhile, data are also stored in a distributed mode in a database of each computing node.

The industrial site end is presented in a software form and comprises a plurality of data processing modules, each data processing module is bound with a sensor of a corresponding industrial object, reads data of the industrial object in the working process in real time and sends the data to the cloud service end for analysis and processing; the data processing module is also used for remotely receiving control instructions sent by the client and the monitoring end to realize the start and stop of data collected by the data processing module, and the data processing module can also set corresponding industrial object control operation according to actual field conditions;

the client is used for remotely connecting the cloud server, reading a calculation result of the cloud server in real time, carrying out visual display, remotely sending an instruction to the cloud server according to needs, and remotely controlling the start and stop of data acquisition work of a data acquisition module of the industrial field end and the control operation of an industrial object under the condition allowed by the field by the cloud server; the client runs on a PC and/or a mobile terminal.

The monitoring terminal adopts a webpage form and is used for monitoring the working state of the cloud service terminal and the operation data of the industrial object, managing user information and industrial object information, editing registration information of users, adding and cancelling the users, checking the industrial object information bound by the current cloud service terminal, and sending a control instruction to the industrial site terminal in real time through the cloud service terminal. The administrator can log in the monitoring end webpage at any time and place for management.

The specific working principles and functions of the platform cloud server, the industrial site, the client and the monitoring terminal are described in detail below.

1. Cloud server

A specific architecture of the cloud server is shown in fig. 2, and specific functions and working processes of each module in the cloud server are specifically described below.

1.1 System host

1.1.1 data Server

The data server is mainly used for driving the system host database and obtaining the algorithm analysis result stored in the distributed database in the sub-cluster. The user information data table in the system host database is firstly used for storing information of user such as login, registration information, use duration, purchase time and the like, and secondly storing industrial object information bound by the user, control sent by the client and the monitoring terminalInstructions and analytical data obtained from the sub-clusters, (e.g. using PCA (principal component analysis) for process monitoring, the analytical data obtained from the sub-clusters is T²And SPE statistics and control limits of both) for the client and the monitoring side to read data and send instructions using the data server.

1.1.2Web Server

The Web server is a server used for driving the monitoring end Web page, namely, all programs of the monitoring end Web site are executed in the Web server in the system host. Since the monitoring end needs to send a command to the cloud end in real time, read and manage user information, industrial object information, cloud end analysis data and the like stored in the system host database, the monitoring end needs to access the database of the system host through the Web server to perform the above dynamic operation on the system host database.

1.1.3 load balancer

The purpose of realizing load balancing of each sub-cluster in the system host is to solve the problem of uneven load caused when a large number of users access a certain sub-cluster at the same time. When the workload of a certain sub-cluster is already large, the computing capability and the computing speed of the certain sub-cluster are both greatly reduced, and the loads of other sub-clusters are possibly not large, so that the load (user) of the large-load sub-cluster needs to be distributed to other sub-clusters, each sub-cluster keeps the best working state, and the overall computing speed of the cloud server is ensured.

The working principle of the load balancer designed in the platform is shown in fig. 3, the load balancer is installed in a system host, the load balancer reads the working state information of the connected sub-clusters at the same time, including the static configuration of each sub-cluster node and the dynamic performance in the working process, the dynamic performance comprises CPU frequency, memory occupation, disk I/O reading rate and communication network bandwidth use condition, the number of users assigned to each sub-cluster at that time is dynamically calculated using a load balancing algorithm after reading, and then changing the field of the IP address of each client connection sub-cluster node stored in the system host database, and the client reads the IP address of the distributed computing node by polling the field and is connected with the node, thereby realizing the re-allocation of users for each sub-cluster.

The load balancer designed in the platform has an internal load balancing strategy of a dynamic load balancing algorithm based on load weight probability, and the algorithm has the following principle:

C(s_ij)＝k₁×n_ij×C(c_ij)+k₂×C(m_ij)+k₃×C(d_ij)+k₄×C(p_ij)

L(s_ij)＝k₁×L(c_ij)+k₂×L(m_ij)+k₃×L(d_ij)+k₄×L(p_ij)

Step four: the maximum load weight value in each sub-cluster at a certain moment is represented as W_max(S), the load weight values of all the sub-clusters are subtracted to obtain a load difference value delta Q (S)_i)

ΔQ(S_i)＝W_max(S)-W(S_i)

R(S₁)＝floor(P(S₁)×Num)

R(S₂)＝floor(P(S₂)×Num)

R(S_k)＝0

1.2 sub-clusters

1.2.1 File distributed storage architecture in a sub-cluster

From the above, in order to improve the storage and extraction speed of mass data, the platform adopts a distributed file storage architecture, and the user data volume is less and only needs to be stored in the database attached to the system host, while a large amount of data used for calculation and algorithm modeling needs to be stored in each sub-cluster in a parallel distribution manner.

For each sub-cluster, the cluster master may be considered a scheduling node and the slaves it is connected to may be considered data nodes. When the file is stored, the scheduling node in the cluster host firstly divides the data into blocks according to a certain size, then informs the data nodes in the slave of the distribution result of each block of data, and finally stores the obtained data blocks into the disk of the computer or the server by each data node.

When the file is read, the scheduling node in the cluster host informs each data node of the file to be extracted, and after the notification is completed, each data node extracts and splices corresponding data blocks into a new file in parallel according to the requirement and sends the new file to the client.

The file distributed storage process is shown in fig. 4.

1.2.2 distributed computing framework in a sub-Cluster

The part of the platform mainly completing algorithm operation is a computing node in a sub-cluster, and in order to improve the capability of the algorithm in the process of processing big data, the platform is provided with a distributed parallel computing framework based on MapReduce in each sub-cluster.

The distributed parallel computing framework is similar to the file distributed storage framework, the operation process of each computing node is managed and scheduled by the cluster host, and the task or algorithm parallel is realized by simultaneously computing data blocks by a plurality of computing nodes under the control of the cluster host.

The MapReduce is one kind of distributed parallel computation, and mainly extracts and processes data blocks stored in each computing node in parallel through a Map link, maps the data blocks into key value pairs and stores the key value pairs into a disk of each computing node, and the Reduce link extracts the key value pairs generated by the Map in parallel, reduces the key value pairs according to data with the same key value and processes the reduced data.

The MapReduce process is performed in the computing nodes, and the cluster host is still responsible for scheduling among processes in the process, so that the distributed computing framework in the sub-cluster is shown in fig. 5.

1.2.3 Process monitoring and Soft measurement Algorithm library

Each sub-cluster comprises a process monitoring and soft measurement algorithm library which comprises a distributed parallel algorithm and a single machine algorithm from an operation mode; from a functional perspective, process monitoring algorithms as well as soft measurement algorithms are included. The codes of all algorithms are stored in the computing nodes of the sub-cluster, and a calling interface for transmitting parameters and data in and transmitting results out is formed.

When a plurality of users use the same algorithm at the same time, the computing node can process two requests at the same time in a multi-thread mode, and an exclusive folder is formed for each user, so that all algorithms can be used by large-scale users at the same time.

The user can also directly upload local data to the cloud server through the client to perform data analysis.

The interaction process between the various ends of the platform is described below.

The interaction process of the industrial site end, the cloud server and the client is shown in fig. 6, the industrial site end logs in a system host of the cloud server, and the system host distributes corresponding computing sub-clusters to the system host; the method comprises the steps that an industrial field terminal obtains variable data in real time from an industrial field local database and transmits the variable data to a corresponding sub-cluster of a cloud server according to a certain period to carry out algorithm analysis, the cloud server stores data analysis results into a database of a system host, and a client side polls the database of the system host to read the data analysis results and carry out visualization;

Fig. 7 shows an interaction process between the client and the cloud server and an industrial site, and a method for reading data of the cloud server in real time by the client still adopts a polling technology for a database of a host of the cloud server system, that is, an analysis result is read from the database of the host of the cloud server system at regular intervals, and is visually displayed on the client. Meanwhile, when the client sends an instruction to the industrial object, the relevant field of the system host database is updated, and the industrial field end can poll the system host database to read the field, so that the industrial field end updates the local database of the industrial monitoring software to complete control operation.

The interaction between the monitoring end and the cloud service end and the interaction between the monitoring end and the industrial site end are shown in fig. 8. The monitoring end is presented in a webpage form and mainly has the functions of monitoring the working state of the cloud server, managing user information, managing industrial object information, monitoring industrial object operation data, sending simple control instructions and the like.

The monitoring terminal polls a system host database of the cloud service terminal and visualizes data so as to monitor the running state of the industrial site terminal; the monitoring end accesses a system host database in the cloud server end, reads user information and realizes the addition, deletion, modification and check of the user information; the monitoring end reads working parameters of a system host and a sub-cluster in real time, dynamically displays the working parameters and draws a real-time dynamic curve;

It will be understood by those skilled in the art that the foregoing is only a preferred embodiment of the present invention, and is not intended to limit the invention, and although the invention has been described in detail with reference to the foregoing examples, it will be apparent to those skilled in the art that various changes in the form and details of the embodiments may be made and equivalents may be substituted for elements thereof. All modifications, equivalents and the like which come within the spirit and principle of the invention are intended to be included within the scope of the invention.

Claims

1. The industrial process data analysis platform based on cloud computing is characterized by comprising a cloud service end, an industrial site end, a client and a monitoring end;

the monitoring terminal is used for monitoring the working state of the cloud server and the industrial object operation data, managing user information and industrial object information, and sending a control instruction to the industrial field terminal in real time through the cloud server;

the interaction process of the industrial site end, the cloud service end and the client is as follows:

2. The cloud computing-based industrial process data analytics platform of claim 1, wherein the interaction of the monitor, cloud server and industrial site comprises:

3. The cloud computing-based industrial process data analytics platform of claim 1, wherein the load balancer balances the load as follows:

4. The cloud computing-based industrial process data analytics platform of claim 3, wherein said load balancer uses a dynamic load balancing algorithm based on load weight probability to control the load distribution of said sub-clusters, said algorithm comprising the steps of:

C(s_ij)＝k₁×n_ij×C(c_ij)+k₂×C(m_ij)+k₃×C(d_ij)+k₄×C(p_ij)

L(s_ij)＝k₁×L(c_ij)+k₂×L(m_ij)+k₃×L(d_ij)+k₄×L(p_ij)

ΔQ(S_i)＝W_max(S)-W(S_i)

R(S₁)＝floor(P(S₁)×Num)

R(S₂)＝floor(P(S₂)×Num)

R(S_k)＝0

5. The cloud computing-based industrial process data analytics platform of claim 1, wherein the system host database is configured to store basic information of a user, industrial object information bound by the user, control instructions sent by the client and the monitor, and analysis data obtained from the sub-cluster.

6. The cloud-computing-based industrial process data analytic platform of claim 1, wherein the sub-clusters are configured with a distributed file storage architecture and a Map-Reduce-based distributed parallel computing architecture, the sub-cluster hosts are scheduling nodes, the slaves connected to the sub-cluster hosts are data nodes and computing nodes, all process monitoring and variable soft measurement algorithms are written on the computing nodes of the slaves in each sub-cluster in a Map-Reduce form, a functional interface is formed for each algorithm based on an MVC programming model and stored in each sub-cluster host, and data are also stored in a database of each computing node in a distributed manner.

7. The cloud computing-based industrial process data analytics platform of claim 1, wherein said client runs on a PC and/or a mobile terminal and said monitor runs in the form of a web page.

8. The cloud computing-based industrial process data analytic platform of claim 1, wherein the process monitoring and variable soft measurement algorithms configured in the compute nodes of the sub-cluster of the cloud server have a distributed parallel mode and a stand-alone mode.

9. The cloud computing-based industrial process data analytics platform as claimed in claim 1, wherein said client can also directly upload local data to a cloud server for data analysis.