CN1256684C - Machine group management system of large scale machine group and its information processing method - Google Patents

Machine group management system of large scale machine group and its information processing method Download PDF

Info

Publication number
CN1256684C
CN1256684C CN 02159906 CN02159906A CN1256684C CN 1256684 C CN1256684 C CN 1256684C CN 02159906 CN02159906 CN 02159906 CN 02159906 A CN02159906 A CN 02159906A CN 1256684 C CN1256684 C CN 1256684C
Authority
CN
China
Prior art keywords
node
client
next stage
sub
server end
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 02159906
Other languages
Chinese (zh)
Other versions
CN1512371A (en
Inventor
赵玉萍
肖利民
杨帆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lenovo Beijing Ltd
Original Assignee
Lenovo Beijing Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lenovo Beijing Ltd filed Critical Lenovo Beijing Ltd
Priority to CN 02159906 priority Critical patent/CN1256684C/en
Publication of CN1512371A publication Critical patent/CN1512371A/en
Application granted granted Critical
Publication of CN1256684C publication Critical patent/CN1256684C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Computer And Data Communications (AREA)

Abstract

The present invention discloses a cluster management system. Except for one client end which is used as the uppermost-stage node and more than one server end which is used as the undermost-stage node, the present invention further comprises one or more stages, wherein each stage comprises more than one sub service node; each sub service node further comprises a sub service node server end and a sub service node client end; the uppermost-stage node, each stage of sub service nodes and the undermost-stage node are connected with the server nodes of the next stage through the client ends of the prior stage, and the client ends of the next stage are connected with the server ends of the nodes of the next stage which is relative to the next stage; in this way, the nodes of all the stages are sequentially connected. Besides, the present invention also discloses an information processing method of the cluster management system. By the present invention, a large-scale cluster system can be managed.

Description

A kind of cluster management system of an extensive group of planes and information processing method thereof
Technical field
The present invention relates to the computer group administrative skill, be specifically related to a kind of cluster management system and information processing method thereof.
Background technology
Computing machine can carry out quick computing and data are carried out the instrument of complex process as a kind of, all plays an important role in current scientific technological advance and daily life.And along with the more and more higher requirement to the operational speed of a computer and processing power, present single computing machine has been difficult to adapt to gradually, therefore, people are actively developing arithmetic speed supercomputer faster on the one hand at present, also expected on the other hand a large amount of common computer being linked together by increasingly mature network technology, utilize their arithmetic capabilities separately by sharing their resources separately with unification, thereby realize higher calculation process ability.So computer cluster system is arisen at the historic moment.
Network of Workstation is exactly by one group of network system separate and that form by the interconnected computing machine of express network.A large amount of computing machines link together, but how to concentrate and unified their effect of performance, thereby realize resource and more complicated processing power widely, and this just needs the cluster management system that Network of Workstation is managed.Cluster management system is a logical concept, it is by dividing the work to Network of Workstation and operation corresponding management software on each independent computing machine, make the collaborative work in whole Network of Workstation of each computing machine, thereby can make full use of resource that each computing machine had and processing power in the Network of Workstation, realize more complex calculations by parallel processing.
Present cluster management system adopts distributed software configuration, i.e. client-server end structure mostly.In this structure, client is responsible for receiving user's request and is sent it to each server end, and each server end accepts request, and simultaneously execution result is returned to client exectorial, feeds back to the user after the client process again.Wherein, the communication equipment between the client and server end is shaped on multiple, for example the RMI mechanism of java, corba communication mechanism and based on communication mechanism of socket or the like.When nodal point number is larger, consider many-sided factors such as response speed, reliability, stability of the situation that takies, the order of system resource, the general at present communication mechanism that adopts based on socket.
In this socket communication mechanism, the main frame of each communication all has IP address unique in the present networks environment, often has a plurality of communication programs to exist on the main frame, and each such program all will take a PORT COM.Therefore, just can determine the position of a communication program by an IP address and a PORT COM, just certain program is on certain IP and has taken certain port.In the socket network program that intercoms mutually at present, a side is called CLIENT PROGRAM (client), and the opposing party is called service routine (server), and service routine always waits to be connected passively.After service routine started, initialization finished and just enters the wait connection status; And after the CLIENT PROGRAM startup, at first send connection request to the remote service program, and after service routine is received request, connecting according to certain rule, after connection was finished, two programs had just been set up a virtual data communication link.Data be read in and be write out to these programs can directly from the socket of oneself opening, and need not be concerned about actual data link.
As shown in Figure 1, the distributed frame of the cluster management system of common employing socket communication mechanism adopts the concurrency programming pattern of single client one multiserver end.This structure has only a client to be used for receiving request from the user, and this request is the parallel fill order of a plurality of nodes in a group of planes normally.After client is received the parallel command request of this many nodes, for socket connection of request generation of each node, to the server processes request Connection Service of these nodes.After waiting for that last socket receives response, upper procedure will be returned after the result treatment.
Above-mentioned programming mode is applicable to the situation that the nodal point number scale is less, when node is on a grand scale, this structure is because operating system itself allows the restriction to number of threads of the file number opened simultaneously and socket communication mechanism, can not satisfy to allow the concurrent exectorial needs of a plurality of nodes.Therefore these method for managing system only can be realized basic system management, do not have the ability of extensibility and managing large scale group of planes node.The further raising to the requirement of the operational speed of a computer and processing power that further develops along with science and technology and network with people; need in a Network of Workstation, comprise more computing machine; therefore system's nodal point number scale is also just bigger; at this moment present cluster management system just seems unable to do what one wishes on to the management of more complicated and larger Network of Workstation; seriously have influence on the application of an extensive group of planes, thereby hindered further developing of science and technology.
Summary of the invention
In view of this, in order to solve the shortcoming of prior art, an object of the present invention is to provide a kind of extendible, cluster management system that can the managing large scale computer group.
Another object of the present invention provides the method that a kind of above-mentioned cluster management system carries out information processing.
Above-mentioned purpose of the present invention is solved by the following technical solutions:
A kind of cluster management system, comprise the client of the upper level node of a conduct and greater than the server end of 1 the next stage node of conduct, and further comprise more than or equal to 1 grade and each grade comprise sub-services node greater than 1, wherein each sub-services node further comprises a sub-service station server end and a sub-service station client, upper level node, sub-services nodes at different levels and next stage node are connected to the server end of its corresponding next stage node by the client of upper level node, the mode of server end that the client of next stage node is connected to the node of next stage again of its correspondence is linked in sequence.
In above-mentioned cluster management system, the progression of the sub-services node between upper level node and the next stage node can be 1 grade, and the quantity of the next stage node that is connected with each upper level node can equate.Connection between each grade node can be that socket communicates to connect simultaneously.
A kind of method of carrying out information processing based on above-mentioned cluster management system comprises the steps:
After the client of a. upper level node receives the user request information packet, extract the IP address of the destination server end that this packet comprises, and determine to send the sub-services of the purpose step by step node of this packet;
B. upper level node sends to this packet the server end of the next stage node that step a determines, next stage node server end is forwarded to this packet the client of this grade node, the client of this grade node is forwarded to the server end of next stage node again with this packet, and by this way this packet sequence is forwarded to continuously the server end of next stage node;
The server end fill order of c. next stage node, the packet that will comprise the fill order result by the reverse order of transmitting packet with step b are forwarded to the client of upper level node.
In above-mentioned information processing method, can be in advance when setting up cluster management system set up the tree-like mapping relations table of upper level node, determine the sub-services of the purpose step by step node of transmission packet among the step a according to this tree-like mapping relations table to next stage node in the client of upper level node.
From technical scheme of the present invention as can be seen, by the sub-services node is set in existing cluster management system, the server end process that operates in like this on each node in the group of planes no longer is independently, these server processes are divided into several groups, and each group sets up and the contact of upper level node by sub-service station series is unified.The server processes of these groupings no longer and the client process direct communication of upper level node but communicate with sub-services node in this group.Sub-services node in every group has dual identity in the communication process of carrying out parallel command, it is the client of the server end of next stage node, is again the server end of the client of upper level node.It receives the connection request and the order of sending from the client of upper level node as server end, but it does not carry out this order, at this moment it again will be according to connecting from each server processes that the client-requested of upper level node is final and group is interior as client, request command is finally sent to each interior server processes of group, by the real fill order of each server processes, and return to the group in the sub-services node, the sub-services node will gather these results, order is as a result returned to the client of upper level node, client just of the prior art.Like this; the data that the socket of the required foundation of upper level node is connected significantly reduce; satisfied operating system itself and allowed the restriction of the file number opened simultaneously and socket communication mechanism number of threads; make more node concurrent fill order simultaneously, greatly improved the managerial ability of cluster management system extensive Network of Workstation.
Simultaneously, because the present invention is by being provided with the form expanding system function of sub-services node, taken less system resource, and kept the exectorial speed of original system substantially, in the enhanced system performance, can't too increase system resource and reduce processing speed.Because the annexation of sub-services node and server end and client does not change, so its inner change all is transparent for the user on client upper strata and the order of server end lower floor, thereby makes things convenient for the maintenance of system.The present invention can be provided with the sub-services node of many levels, therefore the group of planes scale of management can be expanded arbitrarily, has improved the extendability of system greatly.
Description of drawings
Fig. 1 is the cluster management system synoptic diagram according to the single client-multiserver end system of prior art;
Fig. 2 is the cluster management system synoptic diagram according to the single client-multiserver end system of process grouping of the present invention;
Fig. 3 is the message processing flow figure according to cluster management system of the present invention.
Embodiment
Below in conjunction with the drawings and specific embodiments the present invention is described in detail.
The present invention expands the cluster management system of prior art by a kind of structure of grouping.Fig. 2 is according to the cluster management system synoptic diagram of the single client-multiserver end system of process grouping of the present invention, in order to be easy to explanation, has showed the situation of having only one-level sub-services node in Fig. 2.As shown in Figure 2, the present invention is called client as upper level node with original client, be also referred to as the client of main service station, the server end that original server end is called next stage node, on the basis that has kept original client and server end, increased the sub-services node in addition, this sub-services node self has a server end and a client respectively, and they are called as the client of the server end and the sub-services node of sub-services node respectively.In the present invention, the server end of next stage node is divided into groups, the number of the group of being divided equals the quantity of sub-services node, like this, same group the server end as next stage node is set up socket with the client of a corresponding sub-service station be connected, and the client of the server end of this sub-services node and main service station is set up socket and is connected.Each client of organizing server end, corresponding sub-services node and the main service station of next stage node all is connected according to the method described above, has promptly constituted cluster management system of the present invention.Like this when carrying out user's concurrent command request, client no longer is made up of a node, but form jointly by a plurality of nodes that comprise main service station and sub-services node, formed a tree structure between main service station, a plurality of sub-services node and the more service device end.
In the present invention, the quantity of the sub-services node of need setting up and decide according to the scale of Network of Workstation with the quantity that each sub-services node is set up the server end that socket is connected.Usually, the quantity of setting up the server end that socket is connected with each sub-services node should be identical or roughly the same, the grouping to server end just preferably connects according to average principle, can make like this from the packet of main service station basically mean allocation give each sub-services node, thereby avoid the uneven phenomenon of sub-services node load, guarantee the smooth work of system.
After the structure of having introduced cluster management system of the present invention, introducing cluster management system according to the present invention below is how to handle specifying information.Thereby after having increased the sub-services node as mentioned above and having set up according to cluster management system of the present invention, just after the tree structure of having set up between main service station, sub-services node and the server end, at first set up a mapping relations table between them at main service station, just each server end is which sub-services node to be connected to main service station by, after main service station receives the packet that comprises this IP at server end address, can clearly give which sub-services node like this with this packet delivery.Specifying information treatment step subsequently as shown in Figure 3.
In cluster management system of the present invention, when main service station after step S301 receives solicited message packet from the user, at first extract the IP address of the destination server end that is comprised in this information packet at step S302, determine according to the mapping relations table and the corresponding server end in above-mentioned IP address sets up that socket is connected is which sub-services node at step S303 then, thus the purpose sub-services node of definite this information packet of transmission.At step S304, main service station sends to this information packet the server end of corresponding sub-services node according to above-mentioned definite result.
After the server end of sub-services node receives the user request information packet that the client of autonomous service station sends, and unlike the common server end, this packet is handled, but directly this packet being transmitted to the sub-services node client that is arranged in this sub-services node equally at step S305, the client of sub-services node sends to this packet the server end with corresponding IP address at step S306 according to the IP address that packet comprised.
Server end behind the packet that receives from the sub-services node, the same fill order in step S307 with prior art, the packet that will comprise the fill order result at step S308 turns back to the client of the sub-services node that connects with it then.At this moment, the client of sub-services node is not handled this packet equally, but it directly is transmitted to the server end of sub-services node at step S309, at step S310 this packet is sent to the client of main service station by the server end of sub-services node.
The above-mentioned the present invention of being carries out the process of a user command.Can know, can receive a plurality of user commands simultaneously, in cluster management system, can carry out a plurality of same information processes simultaneously as the client of main service station.In the present invention, because main service station is no longer established direct links with each server end, but connect with the sub-services node, need all significantly reduce in quantity of documents and the number of threads that main service station is opened like this, a large amount of files and thread can be chosen in the sub-services node and open, thereby compare with prior art, cluster management system according to the present invention makes main service station can receive more user command, and can carry out more complex calculations.
Only be provided with one-level sub-services node in an embodiment in cluster management system of Zhan Shiing and the method, but those skilled in the art will readily appreciate that, if system's node is larger, two-stage, three grades or more multistage sub-services node also can be set, this moment is as long as client that will upper level is connected to the server end of next stage node, the server end that the client of next stage node is connected to next stage again gets final product, be linked in sequence like this, just formed a more massive tree structure.The system of Jian Liing can be applied to more massive Network of Workstation like this, the more more complicated orders of parallel processing simultaneously.Therefore the embodiment that is appreciated that the present invention and is showed only is used to illustrate spirit of the present invention, and is not used in restriction the present invention.

Claims (6)

1. cluster management system, comprise the client of the upper level node of a conduct and greater than the server end of 1 the next stage node of conduct, it is characterized in that, this system further comprise more than or equal to 1 grade and each grade comprise sub-services node greater than 1, each described sub-services node further comprises a sub-service station server end and a sub-service station client, described upper level node, sub-services nodes at different levels and next stage node are connected to the server end of its corresponding next stage node by the client of upper level node, the mode of server end that the client of next stage node is connected to the node of next stage again of its correspondence is linked in sequence.
2. cluster management system according to claim 1, the progression of the sub-services node between described upper level node and the next stage node is 1 grade.
3. cluster management system according to claim 1 is characterized in that, the quantity of the described next stage node that is connected with each upper level node equates.
4. cluster management system according to claim 1 is characterized in that, the connection between described each grade node is that socket communicates to connect.
5. the method that the described cluster management system of claim 1 carries out information processing comprises the steps:
After the client of a. upper level node receives the user request information packet, extract the IP address of the destination server end that this packet comprises, and determine to send the sub-services of the purpose step by step node of this packet;
B. upper level node sends to this packet the server end of the next stage node that step a determines, described next stage node server end is forwarded to this packet the client of this grade node, the client of this grade node is forwarded to the server end of next stage node again with this packet, and by this way this packet sequence is forwarded to continuously the server end of next stage node;
The server end fill order of c. next stage node, the packet that will comprise the fill order result by the reverse order of transmitting packet with step b are forwarded to the client of upper level node.
6. information processing method according to claim 5, it is characterized in that, when setting up cluster management system, set up the tree-like mapping relations table of upper level node in the client of upper level node in advance, determine to send the sub-services of the purpose step by step node of packet among the step a according to described tree-like mapping relations table to next stage node.
CN 02159906 2002-12-27 2002-12-27 Machine group management system of large scale machine group and its information processing method Expired - Fee Related CN1256684C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 02159906 CN1256684C (en) 2002-12-27 2002-12-27 Machine group management system of large scale machine group and its information processing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 02159906 CN1256684C (en) 2002-12-27 2002-12-27 Machine group management system of large scale machine group and its information processing method

Publications (2)

Publication Number Publication Date
CN1512371A CN1512371A (en) 2004-07-14
CN1256684C true CN1256684C (en) 2006-05-17

Family

ID=34237681

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 02159906 Expired - Fee Related CN1256684C (en) 2002-12-27 2002-12-27 Machine group management system of large scale machine group and its information processing method

Country Status (1)

Country Link
CN (1) CN1256684C (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101667034B (en) * 2009-09-21 2011-06-22 北京航空航天大学 Scalable monitoring system supporting hybrid clusters
CN102141973B (en) * 2010-02-02 2013-12-25 联想(北京)有限公司 Cluster management method and device and cluster management and monitoring system
CN103873279B (en) * 2012-12-13 2015-07-15 腾讯科技(深圳)有限公司 Server management method and server management device
CN106598767B (en) * 2016-11-29 2019-07-09 积成电子股份有限公司 Intelligent substation monitoring service console implementation method based on watchdog technique
CN106936927A (en) * 2017-04-19 2017-07-07 武汉票据交易中心有限公司 A kind of socket communication means based on pc client

Also Published As

Publication number Publication date
CN1512371A (en) 2004-07-14

Similar Documents

Publication Publication Date Title
AU2011305638B2 (en) Transpose box based network scaling
US6915347B2 (en) Associating multiple display units in a grouped server environment
CN100337427C (en) System and method for dynamically altering connections in a data processing network
US6922727B2 (en) Method and system for managing parallel data transfer through multiple sockets to provide scalability to a computer network
AU2011305575B2 (en) Transpose boxes for network interconnection
US20030191838A1 (en) Distributed intelligent virtual server
CN1622549A (en) Method and apparatus for sharing applications using p2p protocol
CN1323420A (en) Striping packets of data across multiple virtual channels
CN103441937A (en) Sending method and receiving method of multicast data
Wang et al. Designing efficient high performance server-centric data center network architecture
CN1893422A (en) Distribution type data transmission method based on internetwork standard
JP2001521694A (en) Reservation method and apparatus for asynchronous transfer mode communication
CN1256684C (en) Machine group management system of large scale machine group and its information processing method
CN1455347A (en) Distributed parallel scheduling wide band network server system
CN1620068A (en) Link integrating apparatus and method
US20050188070A1 (en) Vertical perimeter framework for providing application services
CN1599351A (en) Method for constructing high cost performance machine group network load equilization converter
CN1679303A (en) Method for providing a group of elementary services in a telecommunications network
Aumage et al. Netibis: an efficient and dynamic communication system for heterogeneous grids
CN1271541C (en) Machine group management system and its information processing method
CN1791014A (en) Design method for optical transmission communication network service route
Duan et al. Fitted fat-tree for localized traffic in data center networks
CN117112157A (en) General distributed computing system for task based on CLTS scheduling algorithm
CN111090503A (en) High-cost-performance cloud computing service system based on FPGA chip
Gray The evolution of SNA

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20060517

Termination date: 20201227

CF01 Termination of patent right due to non-payment of annual fee