CN106452899A - Distributed data mining system and method - Google Patents

Distributed data mining system and method Download PDF

Info

Publication number
CN106452899A
CN106452899A CN201610957904.2A CN201610957904A CN106452899A CN 106452899 A CN106452899 A CN 106452899A CN 201610957904 A CN201610957904 A CN 201610957904A CN 106452899 A CN106452899 A CN 106452899A
Authority
CN
China
Prior art keywords
server
control server
task
information
calculation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610957904.2A
Other languages
Chinese (zh)
Other versions
CN106452899B (en
Inventor
丁贤
金焰
王备
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Industrial and Commercial Bank of China Ltd ICBC
Original Assignee
Industrial and Commercial Bank of China Ltd ICBC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Industrial and Commercial Bank of China Ltd ICBC filed Critical Industrial and Commercial Bank of China Ltd ICBC
Priority to CN201610957904.2A priority Critical patent/CN106452899B/en
Publication of CN106452899A publication Critical patent/CN106452899A/en
Application granted granted Critical
Publication of CN106452899B publication Critical patent/CN106452899B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0663Performing the actions predefined by failover planning, e.g. switching to standby network elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network

Abstract

The invention provides a distributed data mining system and method, and relates to the technical field of data mining. The distributed data mining system is characterized in that a first control server is used as a work node; and a second control server is used for real-time monitoring, and sending a work node replacing request to a prepositive server to complete replacing of the work node when the second control servers determines that the first control server malfunctions. Therefore, after completing switching of the work node, the normal working of the whole distributed data mining system is recovered. The distributed data mining system and method can avoid the problem that once the current JobTracker server as the main control node malfunctions, the task scheduling system of the whole hadoop framework becomes paralysed so that the task scheduling and processing cannot be finished.

Description

A kind of distributed data digging system and method
Technical field
The present invention relates to data mining technology field, more particularly to a kind of distributed data digging system and method.
Background technology
In recent years, the rise of the generation information technology with big data, cloud computing, mobile Internet etc. as representative is in the whole world In the range of started " revolution of third time IT ".With the arrival in big data epoch, data analysis capabilities how are lifted further, deep The business for entering mining data is worth, and closely drives the change of products innovation, operation flow and management system, realizes real " with number It is said that words ", become a new problem.The essence of data mining be excavate from mass data implicit, decision-making is had potential The relation, pattern of value and trend.Due to being related to the calculating of mass data, the framework for realizing system is had higher requirements.
At present the big data digging system of main flow be using distributed mining structure, will a big data analysis task Decomposed, then by multiple servers parallel computation, finally result of itemizing is collected.For example based on hadoop framework Distributed system basic framework to be represented is widely used.In hadoop framework, JobTracker server undertakes total tune The big data analysis operation that the key player of degree, i.e. receive user terminal submit to, realizes task according to map/reduce algorithm Decompose, then according to the busy degree of calculation server (TaskTracker), task is distributed on the calculation server of free time. JobTracker server monitors the implementation status of the upper task of calculation server (TaskTracker) simultaneously, if tasks carrying has Abnormal, then task is redistributed.There is an apparent defect, the i.e. risk of Single Point of Faliure in this framework.As master The task scheduling system for causing whole hadoop framework is absorbed in if breaking down by the JobTracker server of control node Paralysis.Being embodied in user terminal the submission of task cannot be completed because of the main controlled node fault as service entrance;Computing The implementation procedure of server (TaskTracker) cannot be monitored and cause execution failed tasks redistribute;Simultaneously Calculation server (TaskTracker) makes resource in idle state because losing task distributor.
Content of the invention
Embodiments of the invention provide a kind of distributed data digging system and method, current as main controlled node to solve JobTracker server if breaking down, the task scheduling system for causing whole hadoop framework is paralysed, no Method completes task scheduling and the problem for processing.
For reaching above-mentioned purpose, the present invention is adopted the following technical scheme that:
A kind of distributed data digging system, including:User terminal group, front server group, the first control server, the Two control servers and calculation server group;The user terminal group includes multiple user terminals;The front server group Including multiple front servers;The calculation server group includes multiple calculation servers;The user terminal group with described before Put server group communication connection;The front server group respectively with described first control server, second control server and The calculation server group communication connection;
The user terminal, for sending data mining task request to the front server;
The front server, for parsing the domain-name information of data mining task request, digs according to the data Data mining task request is submitted to the domain-name information of pick task requests the first control server as working node;
The first control server, for asking corresponding data mining task to carry out point the data mining task Solution, forms multiple data mining subtasks;The plurality of data mining subtask is sent to the front server;
The front server, is additionally operable to be assigned at multiple calculation servers by the plurality of data mining subtask Row is processed, and receives the task feedback information of calculation server, and the task feedback information is sent to the described first control Server;
The first control server, is additionally operable to for the task feedback information real-time synchronization to be sent to the second control service At device;
The second control server, for the first control server described in monitor in real time, is confirming first control During server fail, working node is sent to the front server and substitute request;
The front server, is additionally operable to substitute request according to the working node, the second control clothes described in more new record The network address of business device, so that the second control server is used as working node;
The second control server, is additionally operable to ask to each calculation server broadcasting tasks information;
The calculation server, is additionally operable to when broadcasting tasks information request is listened to, to the preposition clothes Business device feedback task situation information;
The front server, is additionally operable to for the task situation information to be sent to the described second control server;
The second control server, it is right to be additionally operable to carry out the task situation information and the task feedback information Than, determine the different information of the task situation information and the task feedback information, and according to the process strategy for pre-setting, The different information is processed.
Further, the front server, is additionally operable to obtain the fortune of other front servers in front server group Row status information;When the running status of other front servers is malfunction, receive with other front servers even The connection request of the user terminal for connecing, and set up communication connection.
Further, the front server, is additionally operable to record the network of the first control server as working node Address or the network address of the second control server as working node.
Additionally, the front server, specifically for receiving the heartbeat message of calculation server;The calculation server Heartbeat message includes that calculation server processing data excavates the cpu resource letter of the task feedback information of subtask and calculation server Breath;The heartbeat message of the calculation server is sent to the described first control server.
Additionally, described first control server, specifically for multiple data mining subtasks are sent to described preposition During server, data synchronization information is sent to the second control server;The data synchronization information includes data mining subtask The corresponding calculation server of mission number and each data mining subtask IP address;
After the heartbeat message for receiving calculation server, the heartbeat message real-time synchronization of calculation server is sent to At two control servers.
Additionally, the second control server, specifically for being taken to the described first control with prefixed time interval timing Business device sends heartbeat request;If continuous n time sends after heartbeat request to the described first control server, the first control is not all received The heart beating response message of control server, it is determined that the first control server fail, sends to the front server Working node substitutes request;Wherein n is the frequency threshold value for pre-setting.
Additionally, the second control server, specifically for:
According to the task situation information and the task feedback information, two parts of task list lists are generated;The task Inventory list includes the cpu resource information of the IP address of calculation server and calculation server;
According to two parts of task list lists, different information is determined;
If the different information has been distributed to due to the first control server after calculation server for the first control server Fault, is not synchronized to the task of the second control server, controls the number of server according to task situation information updating second According to synchronizing information;
If the different information is the first control server has distributed to calculation server, and calculation server process task After failure, due to the first control server failure, the task of the second control server is not synchronized to, from the task situation information Middle acquisition mission failure information, and corresponding for mission failure information data mining subtask is redistributed;
If the different information is the first control server still unappropriated data mining subtask, will still unappropriated number Distributed at calculation server by front server according to excavation subtask and processed.
Additionally, the first control server, specifically for the cpu resource information according to calculation server, by data Excavate the calculation server that cpu resource maximum in each calculation server is distributed in subtask.
A kind of distributed data digging method, is applied to above-mentioned distributed data digging system, and the system includes:User Set of terminal, front server group, the first control server, the second control server and calculation server group;User's end End group includes multiple user terminals;The front server group includes multiple front servers;The calculation server group includes Multiple calculation servers;The user terminal group is communicated to connect with the front server group;The front server group is respectively With the described first control server, the second control server and calculation server group communication connection;
Methods described includes:
User terminal sends data mining task request to the front server;
Front server parses the domain-name information of the data mining task request, is asked according to the data mining task Domain-name information using the data mining task request be submitted to as working node first control server;
The data mining task is asked corresponding data mining task to be decomposed by the first control server, is formed many Individual data mining subtask, and the plurality of data mining subtask is sent to the front server;
The front server is assigned to the plurality of data mining subtask at multiple calculation servers and is processed, And the task feedback information of calculation server is received, and the task feedback information is sent to the described first control server;
The first control server is sent to the task feedback information real-time synchronization at the second control server;
First control server described in the second control server real-time monitoring, is confirming the first control server When breaking down, working node is sent to the front server and substitute request;
The front server substitutes request according to the working node, the net of the second control server described in more new record Network address, so that the second control server is used as working node;
The second control server is asked to each calculation server broadcasting tasks information;
The calculation server is fed back to the front server when broadcasting tasks information request is listened to Task situation information;
The task situation information is sent to the described second control server by the front server;
The second control server is contrasted to the task situation information and the task feedback information, determines institute The different information of task situation information and the task feedback information is stated, and according to the process strategy for pre-setting, to the difference Different information is processed.
Additionally, described distributed data digging method, also includes:
The front server obtains the running state information of other front servers in front server group;Described When the running status of other front servers is malfunction, the connection of the user terminal being connected with other front servers is received Request, and set up communication connection.
Additionally, described distributed data digging method, also includes:
The front server record is as the network address of the first control server of working node or as work section The network address of the second control server of point.
Specifically, the plurality of data mining subtask is assigned at multiple calculation servers by the front server Row is processed, and receives the task feedback information of calculation server, and the task feedback information is sent to the described first control Server, including:
The front server receives the heartbeat message of calculation server;The heartbeat message of the calculation server includes fortune Calculate the task feedback information of server process data mining subtask and the cpu resource information of calculation server;
The heartbeat message of the calculation server is sent to the described first control server.
In addition described distributed data digging method, also includes:
Described first controls server when multiple data mining subtasks are sent to the front server, to second Control server sends data synchronization information;The data synchronization information includes the mission number of data mining subtask and each number According to the IP address for excavating the corresponding calculation server in subtask;
The first control server is sent to the task feedback information real-time synchronization at the second control server, bag Include:
The heart beating of calculation server is believed after the heartbeat message for receiving calculation server by the first control server Breath real-time synchronization is sent at the second control server.
Specifically, the first control server described in the second control server real-time monitoring, is confirming first control When control server breaks down, working node is sent to the front server and substitute request, including:
The second control server sends heart beating with prefixed time interval timing to the described first control server please Ask;
If continuous n time sends after heartbeat request to the described first control server, the first control server is not all received Heart beating response message, it is determined that described first control server fail, to the front server send working node Substitute request;Wherein n is the frequency threshold value for pre-setting.
Specifically, the second control server carries out right to the task situation information and the task feedback information Than, determine the different information of the task situation information and the task feedback information, and according to the process strategy for pre-setting, The different information is processed, including:
The second control server generates two parts of tasks according to the task situation information and the task feedback information Inventory list;The task list list includes the IP address of calculation server and the cpu resource information of calculation server;
The second control server determines different information according to two parts of task list lists;
If the different information has been distributed to due to the first control server after calculation server for the first control server Fault, is not synchronized to the task of the second control server, controls the number of server according to task situation information updating second According to synchronizing information;
If the different information is the first control server has distributed to calculation server, and calculation server process task After failure, due to the first control server failure, the task of the second control server is not synchronized to, from the task situation information Middle acquisition mission failure information, and corresponding for mission failure information data mining subtask is redistributed;
If the different information is the first control server still unappropriated data mining subtask, will still unappropriated number Distributed at calculation server by front server according to excavation subtask and processed.
Further, described distributed data digging method, also includes:
One data mining subtask is distributed by the first control server according to the cpu resource information of calculation server Calculation server to cpu resource maximum in each calculation server.
A kind of distributed data digging system and method provided in an embodiment of the present invention, is controlling as the first of working node When control server breaks down, the second control server can substitute the described first control server becomes new working node, So as to, after working node switching is completed, recover the normal work of whole distributed data digging system.Certainly, when the second control During server fail, the first control server can also substitute the described second control server, and two control servers can The general layout hot standby to realize principal and subordinate, it is to avoid the current JobTracker server as main controlled node will if breaking down The task scheduling system of whole hadoop framework is caused to paralyse, it is impossible to complete task scheduling and the problem for processing.
Description of the drawings
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing Accompanying drawing to be used needed for technology description is had to be briefly described, it should be apparent that, drawings in the following description are only this Some embodiments of invention, for those of ordinary skill in the art, without having to pay creative labor, may be used also To obtain other accompanying drawings according to these accompanying drawings.
Fig. 1 is a kind of structural representation one of distributed data digging system provided in an embodiment of the present invention;
Fig. 2 is a kind of flow chart one of distributed data digging method provided in an embodiment of the present invention;
Fig. 3 is a kind of flowchart 2 of distributed data digging method provided in an embodiment of the present invention.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation is described, it is clear that described embodiment is only a part of embodiment of the present invention, rather than whole embodiments.It is based on Embodiment in the present invention, it is every other that those of ordinary skill in the art are obtained under the premise of creative work is not made Embodiment, belongs to the scope of protection of the invention.
As shown in figure 1, the embodiment of the present invention provides a kind of distributed data digging system 10, including:User terminal group 11, Front server group 12, first controls server 13, second to control server 14 and calculation server group 15;User's end End group 11 includes multiple user terminals 111;The front server group 12 includes multiple front servers 121;The computing clothes Business device group 15 includes multiple calculation servers 151;The user terminal group 11 is communicated to connect with the front server group 12;Institute State front server group 12 and control server 14 and the calculation server group with the described first control server 13, second respectively 15 communication connections.
Wherein, the user terminal 111, for sending data mining task request to the front server 121.
Herein, the user terminal 111 can run procedure script of hadoop client or data mining task etc..
The front server 121, for parsing the domain-name information of data mining task request, according to the data Data mining task request is submitted to the domain-name information of mining task request the first control service as working node Device 13.
The first control server 13, for asking corresponding data mining task to carry out the data mining task Decompose, form multiple data mining subtasks.The plurality of data mining subtask is sent to the front server 121.
The front server 121, is additionally operable to for the plurality of data mining subtask to be assigned to multiple calculation servers Processed at 151, and the task feedback information of calculation server 151 is received, and the task feedback information is sent to institute State the first control server 13.
The first control server 13, is additionally operable to for the task feedback information real-time synchronization to be sent to the second control clothes At business device 14.
The second control server 14, for the first control server 13 described in monitor in real time, is confirming described first When control server 13 breaks down, working node is sent to the front server 121 and substitute request.
The front server 121, is additionally operable to substitute request according to the working node, the second control described in more new record The network address of server 14, so that the second control server 14 is used as working node.
The second control server 14, is additionally operable to ask to 151 broadcasting tasks information of each calculation server.
The calculation server 151, is additionally operable to when broadcasting tasks information request is listened to, to described preposition Server 121 feeds back task situation information.
The front server 121, is additionally operable to for the task situation information to be sent to the described second control server 14.
The second control server 14, it is right to be additionally operable to carry out the task situation information and the task feedback information Than, determine the different information of the task situation information and the task feedback information, and according to the process strategy for pre-setting, The different information is processed.
Further, the front server 121, is additionally operable to obtain other the preposition services in 121 groups of front server The running state information of device 121.When the running status of other front servers 121 is malfunction, receive and other The connection request of the user terminal that front server 121 connects, and set up communication connection.
Further, the front server 121, is additionally operable to record as the first control server 13 of working node The network address or the network address of the second control server 14 as working node.
Additionally, the front server 121, specifically for receiving the heartbeat message of calculation server 151.The computing clothes The heartbeat message of business device 151 includes that 151 processing data of calculation server excavates the task feedback information of subtask and computational service The cpu resource information of device 151.The heartbeat message of the calculation server 151 is sent to the described first control server 13.
Additionally, described first control server 13, specifically for multiple data mining subtasks are sent to described before When server 121 is put, data synchronization information is sent to the second control server 14.The data synchronization information includes data mining The IP address of the corresponding calculation server 151 of the mission number of subtask and each data mining subtask.
After the heartbeat message for receiving calculation server 151, the heartbeat message real-time synchronization of calculation server 151 is sent out It is sent at the second control server 14.
Additionally, the second control server 14, specifically for prefixed time interval timing to the described first control Server 13 sends heartbeat request.If continuous n time sends after heartbeat request to the described first control server 13, all do not receive The heart beating response message of the first control server 13, it is determined that the first control server 13 breaks down, to described preposition Server 121 sends working node and substitutes request.Wherein n is the frequency threshold value for pre-setting.
Additionally, the second control server 14, specifically for:
According to the task situation information and the task feedback information, two parts of task list lists are generated.The task Inventory list includes the cpu resource information of the IP address of calculation server 151 and calculation server 151.
According to two parts of task list lists, different information is determined.
If the different information has been distributed to as the first control takes after calculation server 151 for the first control server 13 Business 13 fault of device, is not synchronized to the task of the second control server 14, according to the control clothes of task situation information updating second The data synchronization information of business device 14.
If the different information is the first control server 13 has distributed to calculation server 151, and calculation server 151 After process task failure, due to the first control 13 fault of server, the task of the second control server 14 is not synchronized to, from described Mission failure information is obtained in task situation information, and corresponding for mission failure information data mining subtask is divided again Join.
If the different information is the first control server 13 still unappropriated data mining subtask, will be still unappropriated Data mining subtask is distributed to by front server 121 and is processed at calculation server 151.
Additionally, the first control server 13, specifically for the cpu resource information according to calculation server 151, by one The calculation server 151 of cpu resource maximum in each calculation server 151 is distributed in data mining subtask.
A kind of distributed data digging system provided in an embodiment of the present invention, in the first control service as working node When device breaks down, second control server can substitute described first control server become new working node, so as to After completing working node switching, recover the normal work of whole distributed data digging system.Certainly, when second controls server When breaking down, the first control server can also substitute the described second control server, and two control servers can be realized The hot standby general layout of principal and subordinate, it is to avoid the current JobTracker server as main controlled node will cause whole if breaking down The task scheduling system of individual hadoop framework paralyses, it is impossible to complete task scheduling and the problem for processing.
As shown in Fig. 2 the embodiment of the present invention provides a kind of distributed data digging method, it is applied to shown in above-mentioned Fig. 1 Distributed data digging system, methods described includes:
Step 201, user terminal send data mining task request to the front server.
Step 202, front server parse the domain-name information of the data mining task request, according to the data mining Data mining task request is submitted to the domain-name information of task requests the first control server as working node.
The data mining task is asked corresponding data mining task to carry out point by step 203, the first control server Solution, forms multiple data mining subtasks, and the plurality of data mining subtask is sent to the front server.
Step 204, the front server are assigned to the plurality of data mining subtask at multiple calculation servers Processed, and the task feedback information of calculation server is received, and the task feedback information is sent to the described first control Control server.
The task feedback information real-time synchronization is sent to the second control clothes by step 205, the first control server At business device.
First control server described in step 206, the second control server real-time monitoring, is confirming first control When control server breaks down, working node is sent to the front server and substitute request.
Step 207, the front server substitute request according to the working node, the second control clothes described in more new record The network address of business device, so that the second control server is used as working node.
Step 208, the second control server are asked to each calculation server broadcasting tasks information.
Step 209, the calculation server listen to the broadcasting tasks information request when, to the preposition clothes Business device feedback task situation information.
The task situation information is sent to the described second control server by step 210, the front server.
Step 211, described second control server the task situation information and the task feedback information are carried out right Than, determine the different information of the task situation information and the task feedback information, and according to the process strategy for pre-setting, The different information is processed.
A kind of distributed data digging method provided in an embodiment of the present invention, in the first control service as working node When device breaks down, second control server can substitute described first control server become new working node, so as to After completing working node switching, recover the normal work of whole distributed data digging system.Certainly, when second controls server When breaking down, the first control server can also substitute the described second control server, and two control servers can be realized The hot standby general layout of principal and subordinate, it is to avoid the current JobTracker server as main controlled node will cause whole if breaking down The task scheduling system of individual hadoop framework paralyses, it is impossible to complete task scheduling and the problem for processing.
In order that those skilled in the art is better understood by the present invention, a more detailed embodiment is set forth below, As shown in figure 3, the embodiment of the present invention provides a kind of distributed data digging method, including:
Step 301, user terminal and a front server set up communication connection.
Step 302, each front server obtain the running state information of other front servers in front server group; When the running status of other front servers is malfunction, the user terminal being connected with other front servers is received Connection request, and set up communication connection.
Herein, it is the load balancing of realizing front server group, the present invention is using ZooKeeper as Distributed Application journey Sequence coordination service, Zookeeper is generally made up of multiple nodes (being each front server) herein, and each node leads to each other Cross heart beating and obtain others' running state information, and in the internal memory of each node in store full dose data, node single-point therefore Barrier can't affect the service ability of whole cluster.In the present invention, each front server is used as a section of Zookeeper Point, after client is connected on certain front server, client will remain this connection, and by this connection come Send request, event notice is obtained, and heart beating is sent, if be connected breaks down, then client can be connected to automatically In addition, on available node, finally the service cluster of one High Availabitity of composition, realizes the association between multiple distributed variable-frequencypump Biconditional operation.
Step 303, user terminal send data mining task request to the front server.
Step 304, front server parse the domain-name information of the data mining task request, according to the data mining Data mining task request is submitted to the domain-name information of task requests the first control server as working node.
Wherein, the front server record has the network address of the first control server as working node.Now, Second control server is used as secondary node.
The data mining task is asked corresponding data mining task to carry out point by step 305, the first control server Solution, forms multiple data mining subtasks, and the plurality of data mining subtask is sent to the front server.
Step 306, first control server when multiple data mining subtasks are sent to the front server, to Second control server sends data synchronization information;The data synchronization information include data mining subtask mission number and The IP address of the corresponding calculation server in each data mining subtask.
Step 307, the front server are assigned to the plurality of data mining subtask at multiple calculation servers Processed.
Step 308, front server receive the heartbeat message of calculation server, and the heart beating of the calculation server is believed Breath is sent to the described first control server.
Wherein, the heartbeat message of the calculation server includes that the task of calculation server processing data excavation subtask is anti- The cpu resource information of feedforward information and calculation server.
Step 309, the first control server can continue a data mining according to the cpu resource information of calculation server The calculation server of cpu resource maximum in each calculation server is distributed in subtask.
Step 310, first control server after the heartbeat message for receiving calculation server, by the heart of calculation server Hop-information real-time synchronization is sent at the second control server.
Step 311, the second control server send the heart with prefixed time interval timing to the described first control server Request is jumped, if continuous n time sends after heartbeat request to the described first control server, does not all receive the first control server Heart beating response message, it is determined that the first control server fail, sends working node more to the front server For request.
Wherein n is the frequency threshold value for pre-setting.
Step 312, the front server substitute request according to the working node, the second control clothes described in more new record The network address of business device, so that the second control server is used as working node.
Now, the front server needs record as the network address of the second control server of working node.
Step 313, the second control server are asked to each calculation server broadcasting tasks information.
Step 314, the calculation server listen to the broadcasting tasks information request when, to the preposition clothes Business device feedback task situation information.
The task situation information is sent to the described second control server by step 315, the front server.
Step 316, the second control server are generated according to the task situation information and the task feedback information Two parts of task list lists.
The task list list includes the IP address of calculation server and the cpu resource information of calculation server.
Step 317, the second control server determine different information according to two parts of task list lists.
After step 317, can be with execution step 318,319 or step 320.
If step 318 different information has been distributed to due to the first control after calculation server for the first control server Control server fault, is not synchronized to the task of the second control server, according to the control clothes of task situation information updating second The data synchronization information of business device.
If step 319 different information is the first control server has distributed to calculation server, and calculation server After process task failure, due to the first control server failure, the task of the second control server is not synchronized to, from the task Mission failure information is obtained in situation information, and corresponding for mission failure information data mining subtask is redistributed.
If step 320 different information is the first control server still unappropriated data mining subtask, will not yet The data mining subtask of distribution is distributed at calculation server by front server and is processed.
After step 318, step 319 or step 320 are completed, the second control server has been disposed as working node Finish, by the second control server as working node, return to step 304 continues whole flow process, and now second controls server Work process is identical with the first control server, and follow-up monitoring second controls the work of server by the first control server Carry out, two such controls server cycle alternation, constitutes the hot standby general layout of principal and subordinate.What deserves to be explained is, the first control clothes " first ", " second " in business device and the second control server is just for the sake of two control servers of difference, and two controls The 26S Proteasome Structure and Function of server is essentially identical, and when one of them is as working node, another is used as secondary node.
A kind of distributed data digging method provided in an embodiment of the present invention, in the first control service as working node When device breaks down, second control server can substitute described first control server become new working node, so as to After completing working node switching, recover the normal work of whole distributed data digging system.Certainly, when second controls server When breaking down, the first control server can also substitute the described second control server, and two control servers can be realized The hot standby general layout of principal and subordinate, it is to avoid the current JobTracker server as main controlled node will cause whole if breaking down The task scheduling system of individual hadoop framework paralyses, it is impossible to complete task scheduling and the problem for processing.
Those skilled in the art are it should be appreciated that embodiments of the invention can be provided as method, system or computer program Product.Therefore, the present invention can adopt complete hardware embodiment, complete software embodiment or with reference to software and hardware in terms of reality Apply the form of example.And, the present invention can be adopted in one or more computers for wherein including computer usable program code The upper computer program that implements of usable storage medium (including but not limited to disk memory, CD-ROM, optical memory etc.) is produced The form of product.
The present invention is the flow process with reference to method according to embodiments of the present invention, equipment (system) and computer program Figure and/or block diagram are describing.It should be understood that can be by computer program instructions flowchart and/or each stream in block diagram Journey and/or the combination of square frame and flow chart and/or the flow process in block diagram and/or square frame.These computer programs can be provided The processor of general purpose computer, special-purpose computer, Embedded Processor or other programmable data processing device is instructed to produce A raw machine so that produced for reality by the instruction of computer or the computing device of other programmable data processing device The device of the function of specifying in present one flow process of flow chart or multiple flow processs and/or one square frame of block diagram or multiple square frames.
These computer program instructions may be alternatively stored in and can guide computer or other programmable data processing device with spy Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory is produced to be included to refer to Make the manufacture of device, the command device realize in one flow process of flow chart or multiple flow processs and/or one square frame of block diagram or The function of specifying in multiple square frames.
These computer program instructions can be also loaded in computer or other programmable data processing device so that in meter Series of operation steps is executed on calculation machine or other programmable devices to produce computer implemented process, so as in computer or The instruction for executing on other programmable devices is provided for realizing in one flow process of flow chart or multiple flow processs and/or block diagram one The step of function of specifying in individual square frame or multiple square frames.
Apply specific embodiment in the present invention to be set forth the principle of the present invention and embodiment, above example Explanation be only intended to help and understand the method for the present invention and its core concept;Simultaneously for one of ordinary skill in the art, According to the thought of the present invention, all will change in specific embodiments and applications, in sum, in this specification Appearance should not be construed as limiting the invention.

Claims (16)

1. a kind of distributed data digging system, it is characterised in that include:User terminal group, front server group, the first control Server, the second control server and calculation server group;The user terminal group includes multiple user terminals;Described preposition Server group includes multiple front servers;The calculation server group includes multiple calculation servers;The user terminal group Communicate to connect with the front server group;The front server group is controlled with the described first control server, second respectively Server and calculation server group communication connection;
The user terminal, for sending data mining task request to the front server;
The front server, for parsing the domain-name information of data mining task request, appoints according to the data mining Data mining task request is submitted to the domain-name information of business request the first control server as working node;
The first control server, for the data mining task is asked corresponding data mining task to be decomposed, Form multiple data mining subtasks;The plurality of data mining subtask is sent to the front server;
The front server, is additionally operable to the plurality of data mining subtask is assigned at multiple calculation servers and is located Reason, and the task feedback information of calculation server is received, and the task feedback information is sent to the described first control service Device;
The first control server, is additionally operable to for the task feedback information real-time synchronization to be sent to the second control server Place;
The second control server, for the first control server described in monitor in real time, is confirming the first control service When device breaks down, working node is sent to the front server and substitute request;
The front server, is additionally operable to substitute request according to the working node, the second control server described in more new record The network address so that described second control server as working node;
The second control server, is additionally operable to ask to each calculation server broadcasting tasks information;
The calculation server, is additionally operable to when broadcasting tasks information request is listened to, to the front server Feedback task situation information;
The front server, is additionally operable to for the task situation information to be sent to the described second control server;
The second control server, is additionally operable to contrast the task situation information and the task feedback information, really The fixed task situation information and the different information of the task feedback information, and according to the process strategy for pre-setting, to institute State different information to be processed.
2. distributed data digging system according to claim 1, it is characterised in that the front server, is additionally operable to Obtain the running state information of other front servers in front server group;Operation shape in other front servers The connection request of the user terminal being connected with other front servers when state is malfunction, is received, and sets up communication connection.
3. distributed data digging system according to claim 2, it is characterised in that the front server, is additionally operable to Record the net of the second control server as the first of the working node network address for controlling server or as working node Network address.
4. distributed data digging system according to claim 3, it is characterised in that the front server, concrete uses In the heartbeat message for receiving calculation server;The heartbeat message of the calculation server includes that calculation server processing data is excavated The task feedback information of subtask and the cpu resource information of calculation server;The heartbeat message of the calculation server is sent To the described first control server.
5. distributed data digging system according to claim 4, it is characterised in that the first control server, tool Body is used for, when multiple data mining subtasks are sent to the front server, sending data to the second control server same Step information;The data synchronization information includes the corresponding fortune of the mission number of data mining subtask and each data mining subtask Calculate the IP address of server;
After the heartbeat message for receiving calculation server, the heartbeat message real-time synchronization of calculation server is sent to the second control At control server.
6. distributed data digging system according to claim 5, it is characterised in that the second control server, tool Body is used for sending heartbeat request with prefixed time interval timing to the described first control server;If continuous n time to described the After one control server sends heartbeat request, the heart beating response message of the first control server is not all received, it is determined that described First control server fail, sends working node to the front server and substitutes request;Wherein n pre-sets Frequency threshold value.
7. distributed data digging system according to claim 6, it is characterised in that the second control server, tool Body is used for:
According to the task situation information and the task feedback information, two parts of task list lists are generated;The task list List includes the cpu resource information of the IP address of calculation server and calculation server;
According to two parts of task list lists, different information is determined;
If the different information has been distributed to due to the first control server failure after calculation server for the first control server, The task of the second control server is not synchronized to, controls the data syn-chronization of server according to task situation information updating second Information;
If the different information is the first control server has distributed to calculation server, and the failure of calculation server process task Afterwards, due to the first control server failure, the task of the second control server is not synchronized to, is obtained from the task situation information Mission failure information is taken, and corresponding for mission failure information data mining subtask is redistributed;
If the different information is the first control server still unappropriated data mining subtask, still unappropriated data will dig Pick subtask is distributed at calculation server by front server and is processed.
8. distributed data digging system according to claim 7, it is characterised in that the first control server, tool Body is used for the cpu resource information according to calculation server, and CPU money in each calculation server is distributed in a data mining subtask The calculation server of source maximum.
9. a kind of distributed data digging method, it is characterised in that be applied to distributed described in any one of claim 1 to 8 Data digging system, the system includes:User terminal group, front server group, the first control server, the second control server And calculation server group;The user terminal group includes multiple user terminals;The front server group includes multiple preposition Server;The calculation server group includes multiple calculation servers;The user terminal group is logical with the front server group Letter connection;The front server group controls server and the computational service with the described first control server, second respectively Device group is communicated to connect;
Methods described includes:
User terminal sends data mining task request to the front server;
Front server parses the domain-name information of the data mining task request, according to the domain of data mining task request Data mining task request is submitted to name information the first control server as working node;
The data mining task is asked corresponding data mining task to be decomposed by the first control server, forms many numbers According to excavation subtask, and the plurality of data mining subtask is sent to the front server;
The front server is assigned to the plurality of data mining subtask at multiple calculation servers and is processed, and connects The task feedback information of calculation server is received, and the task feedback information is sent to the described first control server;
The first control server is sent to the task feedback information real-time synchronization at the second control server;
First control server described in the second control server real-time monitoring, is confirming the first control server generation During fault, working node is sent to the front server and substitute request;
The front server substitutes request according to the working node, the network ground of the second control server described in more new record Location, so that the second control server is used as working node;
The second control server is asked to each calculation server broadcasting tasks information;
The calculation server feeds back task when broadcasting tasks information request is listened to the front server Situation information;
The task situation information is sent to the described second control server by the front server;
The second control server is contrasted to the task situation information and the task feedback information, determines described appointing Business situation information and the different information of the task feedback information, and according to the process strategy for pre-setting, the difference is believed Breath is processed.
10. distributed data digging method according to claim 9, it is characterised in that also include:
The front server obtains the running state information of other front servers in front server group;Described other When the running status of front server is malfunction, the connection for receiving the user terminal being connected with other front servers please Ask, and set up communication connection.
11. distributed data digging methods according to claim 10, it is characterised in that also include:
The front server record is as the network address of the first control server of working node or as working node The network address of the second control server.
12. distributed data digging methods according to claim 11, it is characterised in that the front server will be described Multiple data mining subtasks are assigned at multiple calculation servers and are processed, and receive the task feedback letter of calculation server Breath, and the task feedback information is sent to the described first control server, including:
The front server receives the heartbeat message of calculation server;The heartbeat message of the calculation server includes that computing takes Business device processing data excavates the task feedback information of subtask and the cpu resource information of calculation server;
The heartbeat message of the calculation server is sent to the described first control server.
13. distributed data digging methods according to claim 12, it is characterised in that also include:
The first control server controls to second when multiple data mining subtasks are sent to the front server Server sends data synchronization information;The data synchronization information includes that the mission number of data mining subtask and each data are dug The IP address of the corresponding calculation server in pick subtask;
The first control server is sent to the task feedback information real-time synchronization at the second control server, including:
Described first controls server after the heartbeat message for receiving calculation server, by the heartbeat message reality of calculation server When synchronized transmission to second control server at.
14. distributed data digging methods according to claim 13, it is characterised in that the second control server reality When monitor described first control server, confirm described first control server fail when, to the front server Send working node and substitute request, including:
The second control server sends heartbeat request with prefixed time interval timing to the described first control server;
If continuous n time sends after heartbeat request to the described first control server, the heart of the first control server is not all received Jumping response message, it is determined that the first control server fail, working node replacement is sent to the front server Request;Wherein n is the frequency threshold value for pre-setting.
15. distributed data digging methods according to claim 14, it is characterised in that the second control server pair The task situation information and the task feedback information are contrasted, and determine the task situation information and task feedback The different information of information, and according to the process strategy for pre-setting, the different information is processed, including:
The second control server generates two parts of task lists according to the task situation information and the task feedback information List;The task list list includes the IP address of calculation server and the cpu resource information of calculation server;
The second control server determines different information according to two parts of task list lists;
If the different information has been distributed to due to the first control server failure after calculation server for the first control server, The task of the second control server is not synchronized to, controls the data syn-chronization of server according to task situation information updating second Information;
If the different information is the first control server has distributed to calculation server, and the failure of calculation server process task Afterwards, due to the first control server failure, the task of the second control server is not synchronized to, is obtained from the task situation information Mission failure information is taken, and corresponding for mission failure information data mining subtask is redistributed;
If the different information is the first control server still unappropriated data mining subtask, still unappropriated data will dig Pick subtask is distributed at calculation server by front server and is processed.
16. distributed data digging methods according to claim 15, it is characterised in that also include:
One data mining subtask is distributed to respectively by the first control server according to the cpu resource information of calculation server The calculation server of cpu resource maximum in calculation server.
CN201610957904.2A 2016-10-27 2016-10-27 A kind of distributed data digging system and method Active CN106452899B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610957904.2A CN106452899B (en) 2016-10-27 2016-10-27 A kind of distributed data digging system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610957904.2A CN106452899B (en) 2016-10-27 2016-10-27 A kind of distributed data digging system and method

Publications (2)

Publication Number Publication Date
CN106452899A true CN106452899A (en) 2017-02-22
CN106452899B CN106452899B (en) 2019-09-06

Family

ID=58179730

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610957904.2A Active CN106452899B (en) 2016-10-27 2016-10-27 A kind of distributed data digging system and method

Country Status (1)

Country Link
CN (1) CN106452899B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107025288A (en) * 2017-04-14 2017-08-08 四川九鼎瑞信软件开发有限公司 Distributed data digging method and system
CN109033196A (en) * 2018-06-28 2018-12-18 北京奇虎科技有限公司 A kind of distributed data scheduling system and method
CN110298549A (en) * 2019-05-30 2019-10-01 广东民航机场建设有限公司 Project task processing method, device, equipment and the medium of airport passenger throughput
CN110503431A (en) * 2019-08-07 2019-11-26 广州子牛区块链技术有限公司 A kind of personal node type dividend cell system based on block chain technology

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060149591A1 (en) * 2004-12-30 2006-07-06 Thomas Hanf System and methods for distributed analysis of patient records
CN102567396A (en) * 2010-12-30 2012-07-11 中国移动通信集团公司 Method, system and device for data mining on basis of cloud computing
CN102841822A (en) * 2011-06-23 2012-12-26 腾讯科技(深圳)有限公司 Method and system for performing crash protection on jobTracker hosts
CN103220183A (en) * 2013-05-02 2013-07-24 杭州电子科技大学 Implement method of Hadoop high-availability system based on double-main-engine warm backup
US9456049B2 (en) * 2011-11-22 2016-09-27 Netapp, Inc. Optimizing distributed data analytics for shared storage

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060149591A1 (en) * 2004-12-30 2006-07-06 Thomas Hanf System and methods for distributed analysis of patient records
CN102567396A (en) * 2010-12-30 2012-07-11 中国移动通信集团公司 Method, system and device for data mining on basis of cloud computing
CN102841822A (en) * 2011-06-23 2012-12-26 腾讯科技(深圳)有限公司 Method and system for performing crash protection on jobTracker hosts
US9456049B2 (en) * 2011-11-22 2016-09-27 Netapp, Inc. Optimizing distributed data analytics for shared storage
CN103220183A (en) * 2013-05-02 2013-07-24 杭州电子科技大学 Implement method of Hadoop high-availability system based on double-main-engine warm backup

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘明刚: "《MapReduce故障容错研究与作业调度器优化》", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107025288A (en) * 2017-04-14 2017-08-08 四川九鼎瑞信软件开发有限公司 Distributed data digging method and system
CN109033196A (en) * 2018-06-28 2018-12-18 北京奇虎科技有限公司 A kind of distributed data scheduling system and method
CN110298549A (en) * 2019-05-30 2019-10-01 广东民航机场建设有限公司 Project task processing method, device, equipment and the medium of airport passenger throughput
CN110298549B (en) * 2019-05-30 2022-05-06 广东省机场管理集团有限公司工程建设指挥部 Project task processing method, device, equipment and medium for airport construction engineering
CN110503431A (en) * 2019-08-07 2019-11-26 广州子牛区块链技术有限公司 A kind of personal node type dividend cell system based on block chain technology

Also Published As

Publication number Publication date
CN106452899B (en) 2019-09-06

Similar Documents

Publication Publication Date Title
CN106452899A (en) Distributed data mining system and method
CN108985937B (en) Computing resource sharing method based on block chain technology and block chain system
CN106331098A (en) Server cluster system
CN103369054B (en) A kind of acquisition tasks management method and system
JP2005209201A (en) Node management in high-availability cluster
CN104199957A (en) Redis universal agent implementation method
CN107959705A (en) The distribution method and control server of streaming computing task
CN106936620B (en) Alarm event processing method and processing device
CN110855737A (en) Consistency level controllable self-adaptive data synchronization method and system
CN111209301A (en) Method and system for improving operation performance based on dependency tree splitting
JP2013105381A (en) Communication system, method, and heartbeat proxy server
Guerraoui et al. Can 100 machines agree?
CN112492022A (en) Cluster, method, system and storage medium for improving database availability
CN103108000A (en) Task synchronization method and system and host node and work nodes in system
US20100262871A1 (en) Method for implementing highly available data parallel operations on a computational grid
CN113259177B (en) Reconstruction method and device of virtual network slice
Ali et al. Probabilistic normed load monitoring in large scale distributed systems using mobile agents
CN106230939A (en) A kind of application operation and monitoring method and system
CN113312059B (en) Service processing system, method and cloud native system
CN115391058A (en) SDN-based resource event processing method, resource creating method and system
CN107005434A (en) A kind of method, device and the equipment of synchronous virtual network function VNF states
CN114760304A (en) Computing power information processing method and system and computing power gateway
Stanik et al. Failover pattern with a self-healing mechanism for high availability cloud solutions
CN110647440A (en) Big data task processing method and system based on state machine
CN107040388A (en) Charge system and method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant