CN1592231A - Maintaining unit structure of high extending internet superserver and its method - Google Patents
Maintaining unit structure of high extending internet superserver and its method Download PDFInfo
- Publication number
- CN1592231A CN1592231A CN 200410064294 CN200410064294A CN1592231A CN 1592231 A CN1592231 A CN 1592231A CN 200410064294 CN200410064294 CN 200410064294 CN 200410064294 A CN200410064294 A CN 200410064294A CN 1592231 A CN1592231 A CN 1592231A
- Authority
- CN
- China
- Prior art keywords
- server
- adss
- blade
- internet
- database
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Landscapes
- Computer And Data Communications (AREA)
Abstract
The present invention discloses a maintenance unit structure of the high-expandability internet super sever and the method, and when the ADSS server has a malfunction the high-expandability internet super server dynamically redistributes a server to operate. The first and the second ADSS servers image to each other and comprise a corresponding data base including redundant data, a field host machine control protocol server, an XML interface and a monitoring timer. The ADSS server is connected with at least one server operating system and a storing switchboard; and the storing switchboard is connected with at least one storing unit. The second ADSS server detects the malfunction of the first ADSS server with a heartbeat monitoring arithmetic, and the automatic start malfunction backup switches the function to the second ADSS server. The structure also comprises a monitoring data management setting which is composed of considerable servers that are connected with the stellate setting arrangement of the data management units and are rearranged.
Description
Technical field
The present invention relates to the data processing commercial kitchen area, be meant a kind of maintaining unit structure and method of high extending internet superserver especially.
Background technology
Commercialization ISP and server service provider appear at the fast development that has promoted the Internet to a great extent, for example Internet service provider (ISPs), ASP (ASPs), stand alone software merchant (ISVs), the scheme consulting developer (ESPs) of enterprise and Management Advisory Services developer (MSPs) or the like.About above these services, here not clear and definite definition, but as a rule, the service that these service suppliers and main process equipment merchant are provided will be catered to exactly, most of even whole clients' demand, these demands are mode then about host application, website exploitation, managing eBusiness and server allotment with earning construction cost or periodic service expense.For example, at the server allocation process, expense mainly results from according on client's special demands and the hardware and software specification for its application and website appointment setting.As purpose of the present invention, term " host services " intention is provided by the various dissimilar service that is provided by service supplier in this field and main process equipment merchant.For simplicity, we be referred to as these service suppliers and main process equipment merchant for " server service provider " (HSP).
Just as providing the mode of line, commercial HSPs to offer one of user by international telephony network, telephone operator can enter the passage that network host is used between their client.HSPs is used to provide the computer equipment of host application and service, is commonly referred to server.In the simplest mode, this server can be one and be connected to PC on the Internet by socket that it can move according to the custom-designed special software of the customer requirements of this server.Employed various mode when providing host services for HSPs, most of HSPs will use one group of server set that connects with internal network.This server set is exactly our usually said " server farm " (server farm).In this " group ", each server can be finished its unique task, also can share multinomial different task by several servers, for example mailbox server, the webserver, certificate server and can take into account management server.When providing host services for the World Wide Web website, for example single webserver is generally a lot of small-sized World Wide Web websites gatherings and provides support, and big website then need be supported operation by the special webserver.
Along with the increase day by day to the Internet service demand, the market space of Internet industry is also just increasing, needs bigger capacity to satisfy this type of demand.A kind of method that satisfies this class market demand is exactly to utilize the computer system of bigger ability as server.Large-scale main frame and medium-sized computer system begin to be used to do the server of large-scale internet sites and common network, most HSPs is owing to the high cost of considering these systems, complexity and lack flexibility and be not inclined to the use large computer system, these HSPs are ready to use " server farm " be made up of many PC servers (serverfarm) to support to move on the contrary, these servers are connected on the shared the Internet line or modulator-demodulator group, also can enter in one group of disc driver sometimes.When HSP increased a host services client, one or more PC servers manually was increased among the HSP " server farm ", and the client has installed specific software and data for this reason, as Web content.In this way, the hardware of HSP configuration certain level only is in order to support its current client's demand.It is also important that, HSP can to the client collect the early stage installation cost to pay the prime cost of this hardware.
For HSP, a large amount of account softwares can be used for collecting the expense of these metering service, for example the HSP Power of the XaCCT of rens.com and inovaware.com.Other software program of having developed is in order to auxiliary HSP network management, for example network service management of IP Magic, the resonate.com of lightspeedsystem.com and the MAMBA of laminate.com.By making in this way, the expense of the large computer system that HSP needn't the subsidiary bulk redundancy capacity of prepayment, and these expenses are can not be HSP generation income immediately.Compare for different clients provides support with mainframe of use, " server farm " provides a cover simple method, and this method can be guaranteed the fail safe and the data integrity of customer data in the running environment of different clients' coexistences.If software that server loaded and data only are the particular customer service, then there is no question about in the fail safe of customer information.If the server that is a customer service only loads this client's software, and only be connected to this client's data, customer information thereby obtain independent process then, its fail safe is protected.The management of HSP and operation have become the theme of each paper and seminar, as Hursti, Jani and " management that access internet and service provide " of the network interconnection seminar held on April 19th, 1999.The representative instance that disposes various hardware, software, maintenance and support for the commercial rank that internet access and mainframe network website are provided about the HSP every month rackspace.com that can browse web sites.
When the client need increase or reduce the quantity of service, HSP will manually add or delete server and add to the HSP server farm or from the HSP server farm or the deletion server, and this server farm is directly connected on the storage and network interconnection of client web site.When adding service, key step is as follows: (1) receives the order of change service from the host services customers' place, (2) HSP obtains new server hardware to satisfy required change, (3) the HSP professional is at server farm position build-in services device hardware, (4) adding server hardware is wired on the storage and network connectivity of this website, (5) be the server hardware load software, the HSP professional is by a series of initialization steps, by customer requirement this software is configured, (6) will newly install and join in the server farm, for the client provides host services through the server of complete configuration.In either case, each server farm is assigned to a particular customer, and server farm must be configured to satisfy to greatest extent client's demand for services.
At first, must restart the part or all of existing server of managing in the group and finish said process, reflect that new server adds the situation in the server farm to because pointer in the existing server and form need manually more newly arrive.This demand regulation is only regularly to change server hardware in well-defined server window, for example in late into the night in certain evening on Sunday.In the recent period, as MicrosoftWindows 2000, Microsoft Cluster Server, Oracle Parallel Server, WindowsNetwork Load Balancing Service softwares such as (NLB) and similar program have developed, and expand to the new server of the automatic permission time in office and join in the existing group, and need not in these well-defined windows, manually to carry out.
This type of server is integrated to have high efficiency, and a service groups workload is excessive especially therein, and another service groups workload is when too small.Under the sort of situation, server can be switched to another service groups from a service groups.The patent No. 5,951,694 have described the execution path of software on the concrete management server, and it is balanced more to guarantee the request of management group in different service groups that its working load equalization scheme is revised the mapping form.
A plurality of patents described single troop or the management group in server between carry out the workload equilibrium technology.U.S. Patent number 6,006,259 have described the safety that is included under the master server control and the software of heartbeat setting is trooped, and all members in trooping have distributed common IP address, and load balancing is just in the middle execution of trooping.U.S. Patent number 5,537,542,5,948,065 and 5,974,462 have described balanced setting of various workloads of the multisystem computer treatment system that possesses shared data space.Can between client and server, insert an intermediate system in addition and finish distribution work between server.U.S. Patent number 6,097,882 have described the dubbing system between client and the server, this dubbing system IP packet that changes its course on the basis of server availability and workload.
A weak point of management server and computer hardware is the possibility that nextport hardware component NextPort breaks down.In this case, well-known, server system enters the fault backup mode.The fault backup mode is a kind of backup operation pattern, and in this pattern, because fault or the machine of delaying be when causing the one-level component failures, the level two assembly will be carried out the function of a level assembly (as processor, server, network or database).Automatic Program sends the unloading task to the back-up system assembly, so that seamless as far as possible concerning the end user.In network internal, the fault backup can be applied to any assembly or component system, for example access path, memory device or the webserver.
U.S. Patent number 5,615,329 have described the method for automatic eliminating network internal hardware component failures, it comprises that redundant hardware is set carries out remote data mirroring, firsts and seconds computer system realizes by using specially independently for this, and wherein level two is taken over the function of carrying out a level system when a level system breaks down.The problem of these mirror image settings is to cost an arm and a leg and waste that resource, particularly level two are in idle standby mode when wait one level system is made mistakes.
U.S. Patent number 5,696,895 have described another solution, and promptly each server is carried out the task of itself, other servers break down but each server all is assigned with as the backup of a server in other server.This makes that being carried out by two servers of task can be continued on backup server, but performance can be demoted.Other example of this type of solution has work allocation server node (POD) server design and USI integrated network service (Complex Web Service).Being used to the nextport hardware component NextPort of these services is provided is the predefine evaluation work distribution server node that comprises load balancing software, and this also can get rid of the fault of management group internal hardware assembly.Even if use this predefine evaluation work distribution server node, also need to take doing homework in a week and install.
All these solutions can be based on the management group of existing hardware calculated capacity inner management, balanced operation amount automatically and find out hardware fault; Yet seldom solution can be used extra hardware resource automatically to the management group.If know the demand of additional hardware resources in advance, modal solution is exactly to organize pre-configured hardware resource for management on the basis of maximum prefetch survey demand, make the management group make correct response during demand in the peak phase, and the additional hardware resources that satisfies this peak requirements is underutilized in At All Other Times, therefore, because underusing of hardware resource just increased for the management group provides the cost of host services.
Fig. 1 shows the storage area network schematic diagram, as shown in Figure 1, comprise memory in the storage area network (SAN, Storage Area Network), as disk or be positioned at the disk array (RAID, Redundant Array of Inexpensive Disk) of calculation server outside.These RAID memories are called as optical-fibre channel (FC, Fiber Channel) technical battery by use and are connected to server, and this optical-fibre channel technology is a kind of network technology, and it comprises conveyer, as fiber optic cables (Fiber Optic Cable); With distributed exchanged form, as fibre channel media; And the pci card that connection is provided for server (host bus coupling or HBA).Said system is very expensive, and is mainly used in providing the memory capacity that exceeds the original storage of server rack to server on the industry.
Though the method that certain redundancy is provided for these type systems has been arranged, but because RAID only is single, self-contained equipment, still can't be by freely distributing the quantity with the server that loads balance to be connected with RAID equipment, specifically defective is as follows: each client server must carry out manual configuration and just can be connected on the RAID equipment; Traditional solution requirement client server disk at first internally begins to start, then after finishing configuration, add exterior storage by storage area network again, this just need store the configuration information that requires to be connected to storage area network on the client server into after manual configuration; If RAID equipment complete failure so just recovers this fault without any method at all, there is not any mode can successfully server be switched to another storage device yet; Because what use is manual configuration, therefore reconfigures solution and may realize hardly by telemanagement; The flexibility of above-mentioned this solution is very limited, and cost is very high.
Although the HSP way to manage has had some important raisings, and developed the operation that a lot of programs and instrument are assisted the HSP network, but the basic fundamental that HSP is used to create with the physical resource of maintenance server group changes very little, therefore, be desirable to provide a kind of more efficiently mode and operate HSP, to improve the physical resource management of server farm.
Summary of the invention
In view of this, one object of the present invention is to provide a kind of maintaining unit structure of high extending internet superserver, another object of the present invention is to provide a kind of maintaining method of high extending internet superserver, to improve the physical resource management of server farm.
In order to achieve the above object, the invention provides a kind of maintaining unit structure of high extending internet superserver, comprising:
At least one is connected to the blade server that the Internet exchange is provided with;
The one ADSS server is connected to one or more blade servers by the Internet switch, and an ADSS server comprises,
First database, this database are connected to first first Internet protocol IP address server that is adapted at distributing IP address in the framework,
The one XML interface, this XML interface are connected between a server OS and the ADSS server;
The 2nd ADSS server, this ADSS server is connected to one or more blade servers by the Internet exchange setting, and the 2nd ADSS server comprises,
Second database, when the one ADSS server breaks down, this database is connected to the second Internet protocol IP address server that is adapted at distributing IP address in the framework, and be suitably for the user and provide the 2nd ADSS server of directory service to be connected, wherein second database is connected to first database, and comprise from the redundant information of first database and
The 2nd XML interface, this XML interface are connected between server OS and the 2nd ADSS server;
Server OS is connected with at least one monitor data management devices, and the 2nd ADSS server uses the heartbeat monitor algorithm to detect the fault of an ADSS server, and starts fault backup conversion the one ADSS server capability to the two ADSS servers;
Storage switch, it is connected with the 2nd ADSS server with an ADSS server; And memory cell, this memory cell is connected with storage switch.
The described first Internet protocol address server and the second Internet protocol address server use from comprising dynamic host configuration protocol DHCP and starting the communication protocol of selecting the agreement BOOTP group.
Described first database and second database are used for storage and receive and initiating equipment address, active volume position and Storage Mapping information.
A described ADSS server and the 2nd ADSS server further comprise: supervision timer is used to restart server operation.
Described monitor data management devices comprises: monitoring management cell S MU is connected with one or more Data Management Unit DMU, and each Data Management Unit is connected with the blade server that one or more reconfigure.
Described monitor data management devices comprises:
The Data Management Unit DMU that is connected with blade server that one or more reconfigure, be used to monitor blade server state, control electric power function, respond and between each blade server, switch from the order of input/output device, and monitor each blade server function, by management bus and I/O bus arbitration supervisory communications;
Monitoring management cell S MU is connected with the Data Management Unit of star-shaped configuration on management bus and I/O bus line, and the monitoring management unit is connected with Data Management Unit by the order that is transmitted by Data Management Unit management line.
Described each blade server receive that base plate plays with the signal that discharges Servers-all after break away from from communication bus, selected then blade server breaks away from the back at all blade servers from communication bus and engages with communication bus.
The invention also discloses a kind of maintaining method of high extending internet superserver, this method comprises:
A, the 2nd ADSS server use the heartbeat monitor algorithm to detect the fault of an ADSS server, and start fault backup conversion the one ADSS server capability to the two ADSS servers.
Comprise before the described steps A:
A0, startup client server.
Described steps A 0 is: start from storage area network guiding client server.
Further comprise before the described steps A 0: before client server starts, the relevant configuration data are delivered to the relative users server by starting the ROM expansion.
The scheme that proposes according to the present invention when high expandable internet super server breaks down at the ADSS server, is dynamically redistributed server operation.The first and second ADSS servers are videoed mutually, and comprise possess redundant data, the database of the correspondence of domain host control protocol server, XML interface and supervision timer.The ADSS server is connected with a storage switch with at least one server OS; Storage switch is connected with at least one memory cell.The 2nd ADSS server detects an ADSS server failure by the heartbeat monitor algorithm, starts the fault backup automatically function is transformed into the 2nd ADSS server.This framework comprises that also the monitor data management of being made up of the server that reconfigures that is connected with Data Management Unit star configurations array in a large number is provided with, and provides a kind of more efficiently mode to operate HSP, has improved the physical resource management of server farm.
In addition, system of the present invention also allows to expand existing memory capacity by adding more RAID equipment, and also allowing increases the capacity of ADSS storing virtual bandwidth by adding more ADSS equipment.Created the flexible and reliable storage means of a kind of safety in this way.
Description of drawings
Fig. 1 shows the storage area network schematic diagram;
Fig. 2 shows the structure chart that uses iSCSI of the present invention to start the simple and easy high expandable internet super server of driver replication server;
Fig. 3 shows iSCSI of the present invention and starts the activation of driver and the flow chart of operation;
Fig. 4 shows the structural representation of ADSS distributed memory system;
Fig. 5 shows the structure chart of the server farm of the present invention's description.
Embodiment
For making the purpose, technical solutions and advantages of the present invention clearer, the present invention is described in further detail below in conjunction with accompanying drawing.
Among the present invention, provide a kind of use new method that limitation of the prior art is addressed to storage area network, provide memory capacity or virtual disk by distributed and redundant method for client server, client server can be blade server.The basis that said method is realized mainly is to create a storage area network, and this generally can realize by the optical-fibre channel technology.
Fig. 2 shows the structure chart that uses iSCSI of the present invention to start the simple and easy high expandable internet super server of driver replication server, as shown in Figure 2, the framework 100 of high expandable internet super server is defined by many server master boards, and each such mainboard is set to blade server 110.The details that configuration of high expandable internet super server 100 internal physical and computer server 110 are provided with and one embodiment of the present of invention are by U.S. Patent number 6,452,809 patent provides, be entitled as " high expandable internet super server ", can be for reference at this, submit to the application title of filing to be simultaneously " iSCSI of high expandable internet super server starts driving method and equipment ".The preferential software setting of computer server 110 is described in the application reference of title for " provide dynamically host services manage different accounts and website " in front in detail.
Framework of the present invention is further by the (ADSS of dynamic data storage system, Active Data StorageSystem) hardware 130 definition, ADSS hardware 130 has been created the ADSS server that comprises ADSS module 132, domain host control protocol (DHCPD, Dynamic Host Configuration Protocol) server 134, database 136, XML-interface logic 138 and supervision timer 140.ADSS hardware 130 is duplicated by ADSS hardware 150, comprises ADSS module 152, DHCPD server 154, database 156, XML-interface logic 158 and supervision timer 160.ADSS hardware 130 and ADSS hardware 150 all are connected to blade server 110 by the Internet switch 120.The ADSS hardware 130 and the ADSS hardware 150 of combination are regarded as the virtual management system, and this is optionally to connect the system of virtual capacity to initiating equipment (for example, the file server of data is read or writes in client, host computer system or requirement).
Framework 100 also comprises server OS (Engine OS, Engine Operating System) 162, it via storage switch 166 at ADSS hardware 130,150 and System Management Unit (SMU, System Management Unit) connect between 164, and switch connects between ADSS hardware 130,150 and memory disk 168.The whole process control of framework 100 and control exist system 162 to be responsible for by server, and storage and driving mapping then are responsible for by ADSS module 132,152.
ADSS module 132 and 152 provides directory service to distributed computing environment (DCE) and application, and this service provides the interface of single simplification so that the user uses the catalogue resource from different networks when avoiding difference; This is a centralization and standardized system, and it makes the network management automation of user data, fail safe and distributed resource, and itself and other catalogue is operated mutually.In addition, when the network manager was provided the single-point of the inherent hierarchical view of network and all network objects of management, Active Directory Services (activedirectory service) allowed the user to use single login process to visit and allows accessed resources in the network.
DHCPD server 134 with 154 the IP address of server system internal distribution uniqueness to equipment that framework 100 is connected on, for example after the computer log, DHCPD server 134 and 154 is effectively selected in master list or the address base unique from particular network and system or client are given in untapped IP address assignment, usually these IP addresses can be distributed arbitrarily, the client searches for Dynamic Host Configuration Protocol server by the mode that broadcasting lacks the IP address, and Dynamic Host Configuration Protocol server is then made response by hire out the effective I P address from its master list or address base to client.In the present invention, framework 100 supports that special Dynamic Host Configuration Protocol server is a blade server client distribution particular ip address by IP address and medium access control (MAC) address are combined, because MAC Address is network interface unit (NIC, NetworkInterface Card) physics, address that can not change, constant, thus the IP address of guaranteeing the blade server client is always consistent.The IP address relevant with MAC Address arbitrarily generates when initial configuration ADSS hardware, and remains unchanged after generating.In addition, in the DHCP standard, use specific extension field to come to send extraneous information among the present invention to the blade server client, this extraneous information is used to define the iSCSI parameter that finds ADSS hardware required, and these parameters will be used for request and the required checking of login ADSS hardware to server disk.
Return Fig. 2, database 136 and 156 is connected to corresponding ADSS module 132 and 152 and DHCPD server 134 and 154, as the warehouse of receiving terminal, sending ending equipment addressing, active volume position and original storage map information, also be used as the information source of corresponding DHCPD server simultaneously.All ADSS are replicated the database between the row member, so that main system information redundancy.XML interface background program 138 and 158 serves as the interface between server OS 162 and the ADSS hardware 130,150, and they provide the function of login feature and automatic operation A DSS hardware.When server deadlock state occurred in operating process, supervision timer 140 and 160 was restarted server operation, and for example, timer expiry shows the ADSS fault.Storage switch 166 optical-fibre channel or the Internet types of being called preferably, it is allowed between disk 168 and ADSS hardware 130,150 storage and obtains data.
In framework 100 described embodiment, unless break down, ADSS hardware 130 serves as main Dynamic Host Configuration Protocol server.The heartbeat monitor circuit is used for test failure as line 139 between ADSS hardware 130 and ADSS hardware 150.When server 130 breaks down, server 150 will detect the heartbeat response and serve DHCP information immediately.In specific overall situation, server hardware will guarantee that all storages are available by fibre channel switch, as the storage in the disk 168.When therefore one of them server broke down, another server (only showing two servers at this) can be carried out the function of failed server.The database that the DHCPD server is direct and corresponding connects, because each server of all IP addresses of framework 100 and mac address information has only a database.
In this embodiment, server OS 162 (or simple and easy socket) is sent " activity " by XML interface background program 138 or 158 and (action) is ordered and create, change or delete virtual capacity.XML interface logic 138 is sent activity command equally and is distributed and do not distribute or increase and reduce virtual capacity it can be used transmitting terminal, sends detection, mirror image in addition, duplicates and movement directive.The logic of XML interface background program 138 partly receives " activity " order that comprises to issue orders: detect effective activity command; Be transformed into server command; Carry out server command; Confirm command execution; The failure order is returned; Provide and feed back to server OS 162.Server OS 162 is also sent information consultation by XML interface logic 138, and XML interface logic 138 is verified effectively consulting, and conversion XML seeks advice from advice database, and transition response is returned the XML data to server OS 162 again to XML.In addition, XML interface logic 138 sends alarms server OS 162, and fault warning is sent by logon server or SNMP.
By above description to high expandable internet super server framework 100, the flow chart of describing with reference to figure 3 is done roughly understanding to the flow process that signs in to high expandable internet super server again.By using iSCSI to start the driver login, operation at this makes iSCSI startup driver be divided into two parts: iSCSI virtual bench (ADSS hardware 130 and ADSS hardware 150 are formed virtual bench), see also the right half of flow chart shown in Figure 3, with the iSCSI starting drive, see also the left half of flow chart shown in Figure 3.Begin login by transmit a request to the iSCSI virtual bench from initiating equipment, via starting module 202.The iSCSI virtual bench determines whether virtual capacity has been assigned to the request initiating equipment, via decision-making module 204.If the unallocated initiating equipment of virtual capacity, then the iSCSI virtual bench is waited for new startup request.On the contrary, if virtual capacity has been assigned to initiating equipment, then login is proceeded, and the response from DHCPD server 134 is activated by the MAC Address of initiating equipment thus, via operational module 206.Then, the distribution that ADSS module 132 is apprised of virtual capacity is connected with MAC, via operational module 208, and is connected to the power supply of blade server 110, via the operational module 210 of iSCSI starting drive.
Then, network interface unit generates external component interconnected (PCI, Peripheral ComponentInterconnect) device id mask, therefore sends to start request, via operational module 212.As everyone knows, blade server is by the following characterizing definition of database 136 inside: the MAC Address of (1) predefined network interface unit; (2) the IP address of (distribution) initiating equipment comprises (a) A level subnet [255.0.0.0], (b) the 10.[rack] the .[frame] the .[insert groove]; (3) iSCSI checking territory (distribution) comprising: (a) penetration DHCP, (b) initiating equipment title.Term " penetration DHCP " refers to all iSCSI checking territories and all is pushed to client sending end by DHCP.Specifically, all current iSCSI dispose the authorization information that all requires such as the IP address of the capacity of will serving of user name, password, iSCSI receiving terminal etc. and manually import the client backstage by the operating system utility software.Here it is, and why preferential iSCSI dispose the one of the main reasons that can not start, because this information is when loading of operating system and corresponding iSCSI software driver and read when pre-seting parameter or manually importing this information by the operator and just can use.
By send this information via DHCP, the pre-OS stage that the present invention not only provides in start-up course makes this information to the available method of client sending end (initiating equipment), but also can create central authority ADSS, ADSS can store and dynamically change these settings so that certain operations, these operation as automated back-ups of optional ADSS unit, or do not disturbing the client to use to add under the prerequisite or change is installed in the quantity and the size of the virtual disk on the client computer.
In the application that is entitled as " iSCSI of high expandable internet super server starts driving method and equipment " more detailed description is arranged, iSCSI starts ROM IE process and sends discovery asks Dynamic Host Configuration Protocol server 134, via operational module 214.Dynamic Host Configuration Protocol server 134 responds to finding request based on the MAC and the load balancing rule of initiating equipment, via operational module 216.Specifically, Dynamic Host Configuration Protocol server 134 sends client IP address, mask and gateway, sends the iSCSI log-on message simultaneously: the IP address of (1) server (the IP address of ADSS hardware); (2) agreement (being defaulted as transmission control protocol (TCP)); (3) portal number (acquiescence 3260); (4) initial logic unit number (LUN); (5) receiving terminal title is as the iscsi target title of ADSS server; (6) initiating equipment title.
About the load balancing regulations option of Dynamic Host Configuration Protocol server, when workload is light, at first select some ADSS unit to satisfy client's demand.Load balancing in the ADSS system architecture comprises two ADSS master servers that DHCP, database and management resource are provided, and is configured to fault-tolerant the trooping of critical data library information and DHCP service.What included in addition is the assembly of a large amount of subordinate ADSS, and these assemblies link to each other with the ADSS master server and by its control, these subordinates ADSS only serves virtual capacity in the unit.The models of priority that connects by minimum, wherein when serving new client ADSS always Priority Service is in the client of minimum number, load balancing is realized by the responsibility of distributing virtual capacity services between different ADSS unit.Therefore the rank of service realizes by restriction client's maximum quantity that also any ADSS can both be for the client creates more memory bandwidth, and these clients use the ADSS unit of these upper limit settings but not those unit of operating on standard A DSS storehouse.
Return Fig. 3, iSCSI starts ROM and continues to receive DHCPD server 134 information, via operational module 218, re-uses this information startup and signs in to server, via operational module 220.ADSS module 132 receives logging request, and this request of checking on the MAC that introduces login and initiating equipment title, via operational module 222.Next, the virtual capacity of distribution is talked with and is served in the login of ADSS module creation, via operational module 224.ISCSI starts the DOS disk of ROM simulation band virtual capacity and interrupts signal-arm No. 13, via operational module 226.ISCSI starts ROM storage ADSS log-on message in upper end storage area (UMB, Upper Memory Block), via operational module 228.Start then and proceed, via operational module 230.
Thus, server starts with 16 bit patterns by network from the iSCSI module, via operational module 232.16 bit manipulation system bootstrap routines are written into 32 unified iSCSI drivers, via operational module 234.32 unified iSCSI drivers read the ADSS log-on message from UMB, login again again is via operational module 236.ADSS module 132 receives logging request, is verified again based on MAC, via operational module 238 again.Then, the ADSS module is rebuild the login dialogue, serves the virtual capacity of distribution again, via operational module 240.At last, 32-bit operating system activates the iSCSI module of use fully, just uses as being both local device freely, via operational module 242.
According to above description, realization of the present invention roughly is summarized as follows: the method for describing among the present invention has been described and has not been comprised internal disk in a kind of client server, but storage area network special with low cost from, that flexibility is high starts.This just requires:
One, use a kind of method before client server starts, the relevant configuration data to be delivered to the relative users server, this point realizes by using startup ROM to expand, and this expansion is start-up routine elder generation's reception before data, and uses the DHCP agreement to send related data;
Two, a kind of mode is to start from storage area network guiding client server, same this point is also expanded by startup ROM and is realized, is that client server is installed remote dummy disk (Remote Virtual Disk) and local disk of emulation in this expansion;
Three, use iSCSI as transmission medium, and do not use optical-fibre channel;
Four, the ADSS storing virtual device system (Storage Virtualizer System) of use between original RAID storage device and server;
Five, use a kind of method of group to obtain many ADSS equipment, thereby satisfy the storage demand of client server.
Fig. 4 shows the structural representation of ADSS distributed memory system, as shown in Figure 4, at the consideration of conventional storage area network solution cost, has adopted the novel transfer approach of a kind of iSCSI of being called now.ISCSI is a kind of mode that is used for encapsulating the SCSI standard, and it is to be used for the transmission method that communicates between disk and computer by ICP/IP protocol.Fundamentally, this method has been to use a kind of more cheap and ripe gigabit Ethernet network to substitute expensive external fiber channel network.Yet because iSCSI is a kind of software protocol, it requires at first import operation system of client server, and then the additional external storage, and therefore, iSCSI is faced with the problem same with optical-fibre channel at present.It also requires by manual configuration required information to be connected on the corresponding exterior storage.
Jian Yi method is between client server iSCSI to be used as unique transfer approach in the present invention, and by using ROM BIOS expansion to solve problem.This ROM BIOS expansion is added on the client server, and like this when energized after, it just can Control Server, and then starts also operating system by integral disk.Different is that this ROM BIOS expansion is connected with a gigabit networking adapter (Gigabit Network Adapter), sends request for its configuration data then.This configuration data is used to notify client server to " disk " that where go to seek it.
DHCP (DHCP is used in this request, Dynamic Host ConfigurationProtocol), a DHCP request is just received by an ADSS server therein then, this request responding feeds back to the corresponding information of client server, promptly notifies client server required " disk " that where goes to seek for its use.Because the configuration of client server can be changed rapidly, like this, just allows freely to select to use which ADSS storing virtual device to provide service as client server.
According to definition, use iSCSI that this ADSS memory virtual machine is connected with user's blade server by gigabit Ethernet, yet they or use optical-fibre channel or use SCSI agreement appends on a large amount of raid storage devices.ADSS equipment is created a kind of mode simultaneously the RAID memory is divided into some little virtual disks also with optical-fibre channel or the SCSI protocol translation becomes iSCSI.
In Fig. 4, adopt many RAID equipment a kind of like this mode that links to each other with ADSS equipment, so that all ADSS equipment can both " be seen " whole RAID memories, therefore, client server just can use any ADSS equipment to satisfy its memory requirement.
When ADSS equipment can store these virtual disks on a plurality of RAID equipment, the flexibility of this system was just apparent, therefore also just can provide extra redundancy.Similarly, because ADSS equipment can visit all RAID equipment, so virtual disk just can move freely between RAID equipment, but can not influence the function of client server.
Conversely, owing to can send configuration information to client server, therefore just can at random indicate client server, and needn't this enter from the old place from other its virtual disk of ADSS device access by the DHCP agreement.For example, if an ADSS equipment fault removes to seek another ADSS equipment just can indicate client server to change its path to its virtual disk, can move equally.ADSS equipment is the RAID equipment of addressable leading data also.
At last, this system also allows to expand existing memory capacity by adding more RAID equipment, and also allowing increases the capacity of ADSS storing virtual bandwidth by adding more ADSS equipment.Created the flexible and reliable storage means of a kind of safety in this way.
The importance of this method be can the centralized control memory map to the method for client server.For example, if a user wishes to use Windows2000 to start the several users server, just the ADSS system can be installed to the virtual disk of these Windows2000 on user's the blade server.Like this, program is just simple to the power supply that only needs to connect blade server, and they just can start Windows2000.If the user wishes Windows2000 is changed into Linux, this user only needs power supply is disconnected so, again the Linux virtual disk is videoed, and recloses power supply again and gets final product.
In the described in the present invention high expandable internet super server, exist a master control system to be called Engine OS, it can control client server (with the form of blade) and ADSS system (by means of the XML agreement).Engine OS so just makes calling program and simple, because can control the storage in the power supply that switches on and off user's blade server and the client server of videoing.Like this, management loop has just been finished, and the whole system industry can freely reconfigure, and does not need direct manual intervention client server.
Fig. 5 shows the structure chart of the server farm of the present invention's description, the monitor data management devices (Supervisory Data Management Arrangement) 300 of promptly forming framework 100, as shown in Figure 5, monitor data management devices 300 comprises and a large amount of distributed management unit (DMU, DistributedManagement Unit) 332~338 blade servers that reconfigure in a large number 312 that link to each other, 314,316 and 318, these distributed management unit connect with a monitoring management unit (SMU, Supervisory Management Unit) 360 again at least.SMU360 comprises output 362 and the internet management output 364 of sharing KVM/USB equipment.
In this embodiment, adorn 8 blade servers in each blade server frame 312~318 (totally 4), each DMU module monitors operation conditions and frame fan, voltage and the frame temperature of blade server by communication line 322A, 324A, 326A and 328A.DMU also controls the power supply supply of frame inner blade server, and switches between individual server in frame from input and output device by communication line 322B, 324B, 326B and 328B response.In addition, the function that each DMU module 332,334,336 is different with 338 monitoring servers, and communicate by letter from the SMU360 arbitration management with I/O bus 342B, 344B, 346B, 348B by management bus 342A, 344A, 346A, 348A.In addition, DMU module fixedly KVM/USB output and supervisory signal arrives single DVI type electric wire, and this is wired to SMU360, stores the event loop daily record again.
In this embodiment, each blade of each server comprises an embedded microcontroller.This embedded microcontroller monitoring mainboard, in the circulation daily record, instant report condition sends alarm and accepts the difference in functionality order when going wrong with its state storage, for example starts shooting, shuts down, resets, KVM (keyboard, video and mouse) selects and KVM release.These communication functions are finished by line 322C, 324C, 326C and 328C.
For example, SMU360 links to each other on management bus 342A, 344A, 346A, 348A and I/O bus 342B, 344B, 346B, 348B line with the DMU module of star-shaped configuration, and SMU360 is connected with DMU by the order that transmits via DMU management line.Supervisory communications have the reliable communication bag of the shared bus of the detection and the ability that retransfers to be handled by connection.The SMU module is identical with the DMU external form, and local frame comprises embedded DMU.SMU is connected with four blade server frames (Server Blade Unit Board) on management line 342~348 via the order of being sent to DMU.SMU provides the higher-level user interface by the Internet port for frame.SMU switches and consolidates the KVM/USB bus and send it to and share the KVM/USB output socket.
Keyboard, video, mouse and USB (KVM/USB) switching between server is operated by switching bus mode.Select first blade server will make backplane signal play, thereby discharge the Servers-all of KVM/USB bus.All blade servers will receive backplane signal, and the previous blade server that is connected with bus breaks away from, and selected blade server will engage with communication bus.
As can be seen, framework advantage of the present invention is the distributed nature of ADSS system in each embodiment described above.Although another well-known system provides the fault-tolerant right of the storing virtual device that possesses the fault backup capabilities, but there are not other expansion possibilities, and the present invention preferentially provides the distributed virtual device, for example ADSS can both serve client's blade arbitrarily arbitrarily, because all client's blades can " be seen " in the ADSS unit, can see all RAID memory cell that store virtual capacity.In this way, client server can be mapped to any ADSS unit requests automated back-up or redistribution load capacity, so just can be in office the upgrade mixed-bandwidth of whole system of time interpolation ADSS unit.
Agreement of the present invention is protected by copyright, and the copyright holder only allows fax of the present invention and duplicate to occur in patent and trademark office's file or record, otherwise All rights are reserved without exception.
In a word, the above is preferred embodiment of the present invention only, is not to be used to limit protection scope of the present invention.
Claims (11)
1, a kind of maintaining unit structure of high extending internet superserver is characterized in that, comprising:
At least one is connected to the blade server that the Internet exchange is provided with;
The one ADSS server is connected to one or more blade servers by the Internet switch, and an ADSS server comprises,
First database, this database are connected to first first Internet protocol IP address server that is adapted at distributing IP address in the framework,
The one XML interface, this XML interface are connected between a server OS and the ADSS server;
The 2nd ADSS server, this ADSS server is connected to one or more blade servers by the Internet exchange setting, and the 2nd ADSS server comprises,
Second database, when the one ADSS server breaks down, this database is connected to the second Internet protocol IP address server that is adapted at distributing IP address in the framework, and be suitably for the user and provide the 2nd ADSS module of directory service to be connected, wherein second database is connected to first database, and comprise from the redundant information of first database and
The 2nd XML interface, this XML interface are connected between server OS and the 2nd ADSS server;
Server OS is connected with at least one monitor data management devices, and the 2nd ADSS server uses the heartbeat monitor algorithm to detect the fault of an ADSS server, and starts fault backup conversion the one ADSS server capability to the two ADSS servers;
Storage switch, it is connected with the 2nd ADSS server with an ADSS server; And memory cell, this memory cell is connected with storage switch.
2, maintaining unit structure according to claim 1 is characterized in that: the described first Internet protocol address server and the second Internet protocol address server use from comprising dynamic host configuration protocol DHCP and starting the communication protocol of selecting the agreement BOOTP group.
3, maintaining unit structure according to claim 1 is characterized in that: described first database and second database are used for storage and receive and initiating equipment address, active volume position and Storage Mapping information.
4, maintaining unit structure according to claim 1 is characterized in that, a described ADSS server and the 2nd ADSS server further comprise: supervision timer is used to restart server operation.
5, maintaining unit structure according to claim 1, it is characterized in that, described monitor data management devices comprises: monitoring management cell S MU is connected with one or more Data Management Unit DMU, and each Data Management Unit is connected with the blade server that one or more reconfigure.
6, maintaining unit structure according to claim 5 is characterized in that, described monitor data management devices comprises:
The Data Management Unit DMU that is connected with blade server that one or more reconfigure, be used to monitor blade server state, control electric power function, respond and between each blade server, switch from the order of input/output device, and monitor each blade server function, by management bus and I/O bus arbitration supervisory communications;
Monitoring management cell S MU is connected with the Data Management Unit of star-shaped configuration on management bus and I/O bus line, and the monitoring management unit is connected with Data Management Unit by the order that is transmitted by Data Management Unit management line.
7, maintaining unit structure according to claim 6, it is characterized in that, described each blade server receive that base plate plays with the signal that discharges Servers-all after break away from from communication bus, selected then blade server breaks away from the back at all blade servers from communication bus and engages with communication bus.
8, a kind of maintaining method of high extending internet superserver is characterized in that, the method includes the steps of:
A, the 2nd ADSS server use the heartbeat monitor algorithm to detect the fault of an ADSS server, and start fault backup conversion the one ADSS server capability to the two ADSS servers.
9, method according to claim 8 is characterized in that, comprises before the described steps A:
A0, startup client server.
10, method according to claim 9 is characterized in that, described steps A 0 is: start from storage area network guiding client server.
11, method according to claim 9 is characterized in that, further comprises before the described steps A 0: before client server starts the relevant configuration data are delivered to the relative users server by starting the ROM expansion.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US49849303P | 2003-08-28 | 2003-08-28 | |
US60/498,493 | 2003-08-28 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1592231A true CN1592231A (en) | 2005-03-09 |
CN100421382C CN100421382C (en) | 2008-09-24 |
Family
ID=34619291
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2004100642940A Expired - Fee Related CN100421382C (en) | 2003-08-28 | 2004-08-30 | Maintaining unit structure of high extending internet superserver and its method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN100421382C (en) |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100392600C (en) * | 2005-05-12 | 2008-06-04 | 国际商业机器公司 | Internet SCSI communication via UNDI services method and system |
CN101212477B (en) * | 2006-12-30 | 2010-11-10 | 广达电脑股份有限公司 | Management interface between embedded systems of blade server |
CN102006190A (en) * | 2010-11-23 | 2011-04-06 | 浪潮(北京)电子信息产业有限公司 | High-availability cluster backup system and backup method thereof |
CN101551649B (en) * | 2008-03-31 | 2011-06-29 | 上海宝信软件股份有限公司 | Equipment monitoring apparatus supporting single connection and realizing method thereof |
CN101778091B (en) * | 2009-01-08 | 2012-07-18 | 王垒 | Expandable security server alternate system |
CN101326521B (en) * | 2005-12-16 | 2012-08-15 | 艾利森电话股份有限公司 | Method and apparatus for XML document manager server |
CN101553768B (en) * | 2005-06-15 | 2013-05-15 | 思科技术公司 | Methods and devices for networking blade servers |
CN101741607B (en) * | 2008-11-11 | 2013-06-12 | 大唐移动通信设备有限公司 | Telecommunication equipment and internal resource management method thereof |
CN103516918A (en) * | 2012-06-28 | 2014-01-15 | 中兴通讯股份有限公司 | Method and device for recovering resource failures |
CN103618788A (en) * | 2013-11-26 | 2014-03-05 | 曙光信息产业股份有限公司 | System high-availability method supporting B/S structure |
CN106339291A (en) * | 2015-07-06 | 2017-01-18 | 群晖科技股份有限公司 | Method and apparatus for managing a storage system via a hybrid management path |
CN107710802A (en) * | 2015-06-26 | 2018-02-16 | 瑞典爱立信有限公司 | The method and relevant device used in control node and service radio node |
CN108228209A (en) * | 2016-12-21 | 2018-06-29 | 广达电脑股份有限公司 | Automatically update the system, method and medium of the firmware of the element of server system |
CN111752626A (en) * | 2020-06-24 | 2020-10-09 | 深圳忆联信息系统有限公司 | Implementation method and device for solving fingerprint deployment drive deficiency and computer equipment |
CN112667477A (en) * | 2020-12-30 | 2021-04-16 | 湖南博匠信息科技有限公司 | Recording and monitoring method and system for blade type board card |
CN113961397A (en) * | 2021-10-28 | 2022-01-21 | 航天壹进制(南京)数据科技有限公司 | High-availability cluster disaster tolerance method based on backup disaster tolerance system |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5544347A (en) * | 1990-09-24 | 1996-08-06 | Emc Corporation | Data storage system controlled remote data mirroring with respectively maintained data indices |
US5889935A (en) * | 1996-05-28 | 1999-03-30 | Emc Corporation | Disaster control features for remote data mirroring |
CN1198406C (en) * | 2000-09-02 | 2005-04-20 | 中兴通讯股份有限公司 | Stand-by method and device of communication system |
US20030005350A1 (en) * | 2001-06-29 | 2003-01-02 | Maarten Koning | Failover management system |
-
2004
- 2004-08-30 CN CNB2004100642940A patent/CN100421382C/en not_active Expired - Fee Related
Cited By (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100392600C (en) * | 2005-05-12 | 2008-06-04 | 国际商业机器公司 | Internet SCSI communication via UNDI services method and system |
CN101553768B (en) * | 2005-06-15 | 2013-05-15 | 思科技术公司 | Methods and devices for networking blade servers |
CN101326521B (en) * | 2005-12-16 | 2012-08-15 | 艾利森电话股份有限公司 | Method and apparatus for XML document manager server |
CN101212477B (en) * | 2006-12-30 | 2010-11-10 | 广达电脑股份有限公司 | Management interface between embedded systems of blade server |
CN101551649B (en) * | 2008-03-31 | 2011-06-29 | 上海宝信软件股份有限公司 | Equipment monitoring apparatus supporting single connection and realizing method thereof |
CN101741607B (en) * | 2008-11-11 | 2013-06-12 | 大唐移动通信设备有限公司 | Telecommunication equipment and internal resource management method thereof |
CN101778091B (en) * | 2009-01-08 | 2012-07-18 | 王垒 | Expandable security server alternate system |
CN102006190A (en) * | 2010-11-23 | 2011-04-06 | 浪潮(北京)电子信息产业有限公司 | High-availability cluster backup system and backup method thereof |
CN102006190B (en) * | 2010-11-23 | 2012-10-31 | 浪潮(北京)电子信息产业有限公司 | High-availability cluster backup system and backup method thereof |
CN103516918A (en) * | 2012-06-28 | 2014-01-15 | 中兴通讯股份有限公司 | Method and device for recovering resource failures |
CN103618788A (en) * | 2013-11-26 | 2014-03-05 | 曙光信息产业股份有限公司 | System high-availability method supporting B/S structure |
CN107710802A (en) * | 2015-06-26 | 2018-02-16 | 瑞典爱立信有限公司 | The method and relevant device used in control node and service radio node |
CN107710802B (en) * | 2015-06-26 | 2022-02-18 | 瑞典爱立信有限公司 | Method used in control node and serving radio node and related equipment |
CN106339291A (en) * | 2015-07-06 | 2017-01-18 | 群晖科技股份有限公司 | Method and apparatus for managing a storage system via a hybrid management path |
CN106339291B (en) * | 2015-07-06 | 2019-01-11 | 群晖科技股份有限公司 | Method and apparatus for managing a storage system via a hybrid management path |
US10185494B2 (en) | 2015-07-06 | 2019-01-22 | Synology Incorporated | Method and associated apparatus for managing a storage system with aid of hybrid management paths |
CN108228209A (en) * | 2016-12-21 | 2018-06-29 | 广达电脑股份有限公司 | Automatically update the system, method and medium of the firmware of the element of server system |
CN108228209B (en) * | 2016-12-21 | 2021-06-01 | 广达电脑股份有限公司 | System, method, and medium for automatically updating firmware of elements of a server system |
CN111752626A (en) * | 2020-06-24 | 2020-10-09 | 深圳忆联信息系统有限公司 | Implementation method and device for solving fingerprint deployment drive deficiency and computer equipment |
CN112667477A (en) * | 2020-12-30 | 2021-04-16 | 湖南博匠信息科技有限公司 | Recording and monitoring method and system for blade type board card |
CN113961397A (en) * | 2021-10-28 | 2022-01-21 | 航天壹进制(南京)数据科技有限公司 | High-availability cluster disaster tolerance method based on backup disaster tolerance system |
Also Published As
Publication number | Publication date |
---|---|
CN100421382C (en) | 2008-09-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20050080891A1 (en) | Maintenance unit architecture for a scalable internet engine | |
US10791181B1 (en) | Method and apparatus for web based storage on-demand distribution | |
CN102110071B (en) | Virtual machine cluster system and implementation method thereof | |
KR100840960B1 (en) | Method and system for providing dynamic hosted service management | |
US8909767B2 (en) | Cloud federation in a cloud computing environment | |
US9612814B2 (en) | Network topology-aware recovery automation | |
CN100421382C (en) | Maintaining unit structure of high extending internet superserver and its method | |
CN107404524B (en) | Distributed cluster node access method and device | |
US7814364B2 (en) | On-demand provisioning of computer resources in physical/virtual cluster environments | |
US9288266B1 (en) | Method and apparatus for web based storage on-demand | |
US8612553B2 (en) | Method and system for dynamically purposing a computing device | |
US8713127B2 (en) | Techniques for distributed storage aggregation | |
US20050108593A1 (en) | Cluster failover from physical node to virtual node | |
US8224941B2 (en) | Method, apparatus, and computer product for managing operation | |
US9602600B1 (en) | Method and apparatus for web based storage on-demand | |
US8387013B2 (en) | Method, apparatus, and computer product for managing operation | |
US8819200B2 (en) | Automated cluster node configuration | |
CN1834912A (en) | ISCSI bootstrap driving system and method for expandable internet engine | |
US20190332293A1 (en) | Methods for managing group objects with different service level objectives for an application and devices thereof | |
Chavis et al. | A Guide to the IBM Clustered Network File System | |
Chen | New Development of Storage Architectures and Network Managed PCs | |
GUIDE | VMware View 5.1 and FlexPod |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20080924 Termination date: 20150830 |
|
EXPY | Termination of patent right or utility model |