CN103139066A

CN103139066A - Source switching method, main control equipment and node equipment

Info

Publication number: CN103139066A
Application number: CN2013100351342A
Authority: CN
Inventors: 杨焕军
Original assignee: Datang Mobile Communications Equipment Co Ltd
Current assignee: Datang Mobile Communications Equipment Co Ltd
Priority date: 2013-01-29
Filing date: 2013-01-29
Publication date: 2013-06-05
Anticipated expiration: 2033-01-29
Also published as: CN103139066B

Abstract

The invention provides a source switching method, main control equipment and node equipment. The source switching method, the main control equipment and the node equipment are used for switching of business units when one business unit breaks down, guaranteeing the normal business service, preventing influence on other business units, and improving utilization rate of equipment. The source switching method comprises the steps of determining a spare business unit corresponding to a broken main business unit when the main control equipment determines that one main business unit breaks down, and determining the spare business unit to be used as a main business unit by the main control equipment. Each business unit is used for providing a certain business service.

Description

Resource switching method, master control equipment and node equipment

Technical Field

The present invention relates to the field of mobile communications, and in particular, to a resource switching method, a master control device, and a node device.

Background

In a communication system, in order to ensure high reliability, backup processing needs to be performed on a service flow, so that the service cannot be recovered after equipment such as a server, an industrial personal computer and a single board fails. In the traditional backup mode, the device is taken as a backup unit, and as long as a service module in the main device fails, switching is triggered, the main device is reset, the standby device is started up, and the service module on the new main device continues to provide service. As long as a certain service module on the main equipment has a fault, the switching is triggered without considering the working states of other service modules on the equipment.

Therefore, the equipment switching mode in the prior art has low equipment utilization rate.

Disclosure of Invention

The embodiment of the invention provides a resource switching method, a master control device and a node device, which are used for switching a service unit when the service unit fails, so that other service units on the device are not influenced, normal service is guaranteed, and the utilization efficiency of the device is improved.

The resource switching method provided by the embodiment of the invention comprises the following steps:

when the master control equipment determines that any main service unit in the network fails, determining a standby service unit corresponding to the failed main service unit;

the main control equipment determines the standby service unit as a main service unit;

each service unit is used for providing a set service.

Another resource switching method provided in the embodiments of the present invention includes:

the node equipment receives indication information sent by main control equipment in a network;

the node equipment determines the standby service unit as a main service unit according to the indication information;

each service unit is used for providing a set service.

The embodiment of the invention provides a main control device, which comprises:

a selecting unit, configured to determine, when it is determined that any one of the master service units of the node devices in the network fails, a standby service unit corresponding to the failed master service unit;

the switching unit is used for determining the standby service unit as a main service unit;

each service unit is used for providing a set service.

The node device provided by the embodiment of the invention comprises:

the receiving unit is used for receiving the indication information sent by the main control equipment in the network;

the switching unit is used for determining the standby service unit as the main service unit according to the indication information;

each service unit is used for providing a set service.

According to the technical scheme, when the master control equipment determines that any main service unit in the network fails, the standby service unit corresponding to the failed main service unit is determined; the main control equipment determines the standby service unit as a main service unit; each service unit is used for providing a set service. The service unit in the process is used as the object for judging and switching the fault, so that the service unit for providing other service in the equipment where the main service unit with the fault is located is not influenced, and the utilization efficiency of the equipment is improved.

Drawings

Fig. 1 is a flowchart illustrating a resource switching method according to an embodiment of the present invention;

fig. 2 is a flowchart illustrating another resource switching method according to an embodiment of the present invention;

fig. 3 is a logic diagram of a service unit relationship in a network according to an embodiment of the present invention;

fig. 4 is a logic diagram of service group division in a network according to an embodiment of the present invention;

fig. 5 is a logic diagram of an N + M backup in a network according to an embodiment of the present invention;

fig. 6 is a schematic flowchart of initializing a service unit working network according to an embodiment of the present invention;

fig. 7 is a schematic diagram of a single board interaction in an initialization process according to an embodiment of the present invention;

fig. 8 is a schematic flowchart of a configuration service unit of a main control board according to an embodiment of the present invention;

fig. 9 is a schematic flowchart of service unit fault processing according to an embodiment of the present invention;

fig. 10 is a schematic structural diagram of a master device according to an embodiment of the present invention;

fig. 11 is a schematic structural diagram of a node device according to an embodiment of the present invention.

Detailed Description

The embodiment of the invention provides a resource switching method, a master control device and a node device, which are used for switching a service unit when the service unit fails, ensuring normal service, not influencing other service units on the device and improving the utilization efficiency of the device.

The embodiment of the invention utilizes a multi-process operating system to expand the function and application range of the equipment. The components are basic units of the equipment program during running, have independent compiling and evolution functions, each component provides an independent service, and a plurality of components provide a set service together. All components for providing a set service are determined as a service unit, and various service units are determined according to the service provided by the equipment. In the device network, a plurality of service units providing the same service are provided, including a main service unit and a standby service unit, and each service unit providing the same service is configured to operate in different processes. The standby service unit is used for providing the service, and when the main service unit corresponding to the standby service unit fails, the standby service unit is determined to be the main service unit and is used for continuously providing the service.

Referring to fig. 1, a resource switching method provided in the embodiment of the present invention includes:

s101, when the master control equipment determines that any main service unit in the network fails, determining a standby service unit corresponding to the failed main service unit;

s102, the main control device determines the standby service unit as a main service unit;

each service unit is used for providing a set service.

Preferably, when the standby service unit is located in the node device, the main control device sends indication information to the node device where the standby service unit is located, so as to indicate the node device where the standby service unit is located to determine the standby service unit as the main service unit; and when the standby service unit is positioned in the main control equipment, the main control equipment determines the standby service unit as the main service unit.

Preferably, when the failed main service unit does not have a corresponding standby service unit which operates normally, or when the failed main service unit does not have a corresponding standby service unit, the failed service unit is not switched.

Preferably, the determining that any active service unit of the node device in the network fails in S101 includes: receiving fault information about any main service unit sent by node equipment in a network; or, determining that a main service unit of the master control device fails; alternatively, it is determined that the node apparatus loses the response. Preferably, when it is determined that the node device loses the response, it is determined that each of the active service unit and the standby service unit in the node device that loses the response fails.

Preferably, the scheme further includes exception handling for each service unit: when the master control device determines that any one of the standby service units fails, the master service unit corresponding to the standby service unit is determined as the master service unit corresponding to the standby service unit which does not fail and is used for providing the same service.

Preferably, the scheme includes the configuration of each service unit during the network startup. Within a preset length of time after the master control device is started, the method further comprises: the main control equipment receives a state query request of the service unit and is used for determining whether the service unit is in a main state or a standby state; the main control equipment determines that the service unit is in a main state or a standby state, and when the service unit is in the standby state, the main control equipment determines a main service unit corresponding to the standby service unit; the main control equipment sends a state indication in a main state or a standby state to the service unit; when the service unit is determined to be in the standby state, the state indication further includes information of the active service unit corresponding to the service unit in the standby state. And within the preset length of time, each service unit can finish normal starting.

Preferably, the determining that the service unit is in the active state or the standby state includes: judging whether the number of main service units for providing the same service in the network reaches a preset number, if so, determining that the service units are in a standby state; otherwise, determining that the service unit is in the active state. The main service unit is configured first, and then the standby service unit is configured, so as to ensure the number of the main service units which normally provide service.

The scheme is used for configuring the main service units corresponding to the standby service units in a balanced manner.

Preferably, when it is determined that the service unit is in the standby state, determining the active service unit corresponding to the standby service unit includes: determining the number M of standby service units and the number N of main service units which are used for providing the same service in a pre-configured network; determining the number N/M of the main service units corresponding to each standby service unit; when N/M is not an integer, taking the minimum integer larger than N/M; when the number of the main service units with undetermined corresponding relations is larger than N/M, determining the N/M main service units with undetermined corresponding relations as the main service units corresponding to the standby service units; and when the number of the main service units with undetermined corresponding relations is less than N/M, determining all the main service units with undetermined corresponding relations as the main service units corresponding to the standby service units. A plurality of N + M service unit combinations are configured in the system, and the service units are respectively backed up and allocated according to the above mode by each combination. The values of N and M may take any positive number, with a 1+1 backup when N =1.M =1.

Preferably, after the master device finishes starting for a preset length of time, the method includes: when the number of the standby service units for providing the same service determined by the main control device is smaller than the number of the standby service units in the network configured in advance, determining the main service unit of which the corresponding relation is not determined as the main service unit corresponding to the determined standby service unit. And taking the preset length of time as waiting time, and determining the main service unit of which the corresponding relation is not determined as the main service unit corresponding to the determined standby service unit after the waiting time is exceeded under the condition that node equipment where the service unit is located in the network is not started or the service unit fails to normally respond after the service unit is started.

Preferably, the determining of the primary service unit with the undetermined correspondence as the primary service unit corresponding to the determined standby service unit is performed when the number of the primary service units corresponding to the standby service unit does not reach the preset threshold. The maximum number of the main service units which can be used by each standby service unit is the preset threshold, and when the number of the main service units corresponding to the standby service units is greater than the preset threshold, the main service units are not allocated any more even if the main service units which do not determine the corresponding relationship exist.

Preferably, after the primary service unit for which the correspondence relationship is not determined is determined as the primary service unit corresponding to the determined standby service unit, the method further includes: when determining that a newly added standby service unit exists and no main service unit with undetermined corresponding relation exists, re-determining part of main service units corresponding to other standby service units for providing the same service as the main service unit corresponding to the newly added standby service unit; when determining that a newly added standby service unit exists and a main service unit with an undetermined corresponding relationship exists, determining the main service unit with the undetermined corresponding relationship as a main service unit corresponding to the newly added standby service unit; if the number of the main service units corresponding to the newly added standby service unit does not reach the preset threshold, re-determining a part of the main service units corresponding to other standby service units for providing the same service as the main service units corresponding to the newly added standby service unit.

That is, when the main control device determines that there is a newly added standby service unit, it preferentially determines the active service unit that does not determine the correspondence relationship as the active service unit corresponding to the newly added standby service unit, and the number of the active service units corresponding to the newly added standby service unit also cannot exceed the preset threshold. If the main service units with the corresponding relationship are not determined to be completely distributed, the corresponding relationship among the service units is transferred according to the number of the main service units corresponding to each standby service unit so as to meet the balanced distribution among the standby service units.

Referring to fig. 2, another resource switching method provided in the embodiment of the present invention includes:

s201, receiving indication information sent by a main control device in a network by a node device;

s202, the node equipment determines the standby service unit as a main service unit according to the indication information;

each service unit is used for providing a set service.

Preferably, the method further comprises: when the node equipment determines that any main service unit or standby service unit has a fault, the process where the main service unit or standby service unit is located is restarted, and fault information of the main service unit or standby service unit is sent to the main control equipment.

Preferably, the method further comprises: when any service unit of the node equipment is started, sending a state query request of the service unit to the main control equipment to determine whether the service unit is in a main state or a standby state; the node equipment receives a state indication fed back by the master control equipment according to the state query request; and the node equipment determines that the service unit is in the active state according to the state indication fed back by the main control equipment, or determines that the service unit is in the standby state and acquires the information of the active service unit corresponding to the service unit in the standby state from the state indication.

Taking a single board as an example, the following provides a specific embodiment of the present invention.

Fig. 3 is a schematic logic diagram of a service unit relationship in a network according to an embodiment of the present invention. The main control board of the system controls the node board, and deploys a global backup management (gbackup) component to manage the service units of the whole system. Each node board is provided with an lbackup component (local backup management) for managing the service unit of the node board. A plurality of components are deployed in the process, the component C1 has no backup relation, the components C2 and C3 are planned to be a service unit bu1, and the components C4 and C5 are planned to be a service unit bu 2. There may also be a service unit configuration on the main control board.

Fig. 4 illustrates service unit planning, where service units bu1 and bu3 belong to service group bg1, and service units bu2 and bu4 belong to service group bg 2. Wherein the c2 component of bu1 and the c2 component of bu3 have backup relationship, the c3 component of bu1 and the c3 component of bu4 have backup relationship, the c4 component of bu2 and the c4 component of bu4 have backup relationship, and the c5 component of bu2 and the c5 component of bu4 have backup relationship.

Fig. 5 shows a general N + M backup manner (N =3, M = 2), where a bg includes 5 service units (bu1, bu2, bu3, bu4, and bu5), 3 service units are active and 2 service units are standby, and the active-standby relationship is determined during system operation. Wherein the components c1, c3, c5, c7 and c9 have backup relations, and the components c2, c4, c6, c8 and c10 have backup relations. The processes with backup relationship provide the same business service.

The number of the business units deployed in one process is not limited. According to the operation situation, a service unit deployed on one board may be a main service unit or a standby service unit, and a service unit included in one service group may be deployed on different boards or on the same board. If the service units are deployed on the same single board, the service units need to be deployed in different processes to ensure that the exception can be recovered in time.

When each single board is started, loading the process on the single board according to the data configuration, and loading and activating the components in the process; the startup among the single boards has no influence, and all the components are processed in parallel. Referring to fig. 6, the initialization process of the single board working network includes:

s601, when the gbackup component on the main control board receives the activation command, the service group information, the service unit information and the information of the components on the service unit of the whole system are obtained from the database and stored.

S602, when the lbackup component on the node board receives the activation command, the information of the service unit configured by the node is inquired for the gbackup component on the main control board, and the request message carries the physical address of the request message.

S603, other components receive the activating command, perform initialization processing on the components and register the components in the lbackup.

S604, the lbackup component receives and records the configuration information, judges whether all components in the configured service unit are registered, and sends a service unit state query request to the gbackup component on the main control board if all components in a certain service unit are registered.

S605, the gbackup component receives the service unit state query request, determines the state of the service unit according to the pre-configured strategy and returns a response to the lbackup.

S606, the lbackup component receives the service unit state response message and sequentially sends a state indication message to the components in the service unit, the components receive the indication message and process the indication message according to the main/standby state and return the response to the lbackup, the lbackup judges that the responses of all the components in the service unit are received, and then sends a service unit state setting completion indication message to the gbackup component, and the gbackup component receives the message and stores the message.

S607, the service unit starts to work after determining its own active/standby state, the components of the active service unit provide service together, and the components of the standby service unit provide backup service.

Fig. 7 is a schematic diagram of single board interaction in the initialization process.

Since the state of each board is unknown at the network initialization stage, a timer needs to be started when the state is set for the standby service unit during the period, and the timer acts to ensure that all the standby service units are normal after the timeout of the timer is up. Taking the N + M backup design as an example, referring to fig. 8, the configuration policy of the main control board for the service unit includes the following steps:

s801, setting the state as primary according to the sequence of the service unit state requests, and then setting the state as standby;

before the states of the N main service units are not set, the M standby service units are not set.

The gbackup component receives the service unit state query request, firstly judges the number of the main service units in the service group, if the number is less than N, sets the service units as main and responds back to the lbackup; and if the number of the main service units reaches N, setting the service unit state as standby.

S802, when the standby service units are set, N/M (rounding up) main service units with undetermined backup relationship are searched in the existing main service units, and the information of the standby service units and the standby main information to be backed up are sent to the corresponding lbackup. And if the number of undetermined backup relations in the main service unit is less than N/M (rounding up), taking all the surplus.

S803, after the timer is overtime, it is determined whether all the M standby service units have been determined, and if there are less than M standby service units (possibly, the board where the service units are located is not started), a load balancing policy needs to be executed, so as to distribute the balance of the main service units, for which the backup relationship is not determined, to all the normal standby service units.

The processes of S801-S803 are executed in parallel by each service group without mutual influence.

After the timer is overtime, the processing of each service unit comprises the following conditions:

1. and a service unit state query request: the service unit is set as a standby service, and the N/M pieces of main information backed up by other standby service units are moved to the service unit for backup. And the backup balance of the standby service unit is ensured.

2. Detecting and processing faults of each service unit, referring to fig. 9, the specific steps include:

s901, carrying out fault detection on each service unit;

the lbackup component carries out health detection on other components, if the heartbeat of the components in the service unit is lost, the service unit is considered to be in failure, the process of the service unit is restarted, and the gbackup component is informed to process.

And the gbackup component performs inter-node heartbeat detection, and if a certain node is abnormal, the gbackup component judges the state of the service unit configured on the node and sequentially processes the service units. If there is main service unit, informing the standby service unit to change the state to main; if the standby service unit exists, the main service unit is informed of the change of the backup object.

The processing of the failure of the active service unit is transferred to S902, and the processing of the failure of the standby service unit is transferred to S903.

S902, processing the failure of the service unit;

and (3) processing the failure of the main service unit: after the gbackup component detects that the main service unit fails, the state of the service unit is set to be unknown, the standby service unit corresponding to the main service unit is searched, the state of the standby service unit is set to be main after the normal corresponding standby service unit is determined, the lbackup on the node where the standby service unit is located is informed to carry out state setting processing, the lbackup sequentially carries out state setting on the components in the service unit, and the gbackup record is informed after the state setting is finished. If the standby service unit also backs up other primary information, the primary backup information needs to be released.

And restarting the process after the main service unit fails, and carrying out a state query request to the gbackup component again.

Processing the standby service unit failure: when the standby service unit fails, the gbackup component detects that the state of the service unit is unknown, and simultaneously informs the standby main service unit to switch the standby relation to another standby service unit (if a normal standby service unit exists).

And restarting the process after the standby service unit fails, and carrying out a state query request to the gbackup component again by the service unit.

Referring to fig. 10, a master device provided in an embodiment of the present invention includes:

a selecting unit 1001, configured to determine, when it is determined that any active service unit of a node device in a network fails, a standby service unit corresponding to the failed active service unit;

a switching unit 1002, configured to determine the standby service unit as a primary service unit;

each service unit is used for providing a set service.

Preferably, the node device is configured to send indication information to the node device where the standby service unit is located when the standby service unit is located in the node device, so as to indicate the node device where the standby service unit is located to determine the standby service unit as the main service unit; and when the standby service unit is positioned in the main control equipment, the main control equipment determines the standby service unit as the main service unit.

Preferably, the determining, by the selecting unit, that any active service unit in the network fails includes: receiving fault information about any main service unit sent by node equipment in a network; or, determining that a main service unit of the master control device fails; alternatively, it is determined that the node apparatus loses the response.

Preferably, when the selection unit determines that the node device loses the response, it is determined that each of the active service unit and the standby service unit in the node device that loses the response fails.

Preferably, the apparatus further comprises: and the standby fault processing unit is used for determining the main service unit corresponding to the standby service unit as the main service unit corresponding to the standby service unit which is used for providing the same service and does not have faults when determining that any standby service unit has faults.

Preferably, the device further includes a start processing unit, configured to receive a status query request of the service unit within a preset length of time after the master control device is started, so as to determine that the service unit is in the active state or the standby state; determining that a service unit is in a main state or a standby state, and determining a main service unit corresponding to a standby service unit when the service unit is in the standby state; sending a state indication in a main state or a standby state to a service unit; when the service unit is determined to be in the standby state, the state indication further includes information of the active service unit corresponding to the service unit in the standby state.

Preferably, the step of determining, by the start processing unit, that the service unit is in the active state or the standby state includes: judging whether the number of main service units for providing the same service in the network reaches a preset number, if so, determining that the service units are in a standby state; otherwise, determining that the service unit is in the active state.

Preferably, when it is determined that the service unit is in the standby state, the determining, by the start processing unit, the active service unit corresponding to the standby service unit includes: determining the number M of standby service units and the number N of main service units which are used for providing the same service in a pre-configured network, and determining the number N/M of the main service units corresponding to each standby service unit; when N/M is not an integer, taking the minimum integer larger than N/M; when the number of the main service units with undetermined corresponding relations is larger than N/M, determining the N/M main service units with undetermined corresponding relations as the main service units corresponding to the standby service units; and when the number of the main service units with undetermined corresponding relations is less than N/M, determining all the main service units with undetermined corresponding relations as the main service units corresponding to the standby service units.

Preferably, the device further includes a balanced allocation unit, configured to determine, after the master control device completes starting for the preset length of time, the active service unit that does not determine the correspondence relationship as the active service unit corresponding to the determined standby service unit when the number of the determined standby service units for providing the same service is smaller than the number of the standby service units in the network configured in advance.

Preferably, the determining of the primary service unit with the undetermined correspondence as the primary service unit corresponding to the determined standby service unit is performed when the number of the primary service units corresponding to the standby service unit does not reach the preset threshold.

Preferably, the balanced distribution unit is further configured to, after determining the primary service unit with the undetermined correspondence as the primary service unit corresponding to the determined standby service unit, when determining that there is a newly added standby service unit and there is no primary service unit with an undetermined correspondence, re-determine a part of primary service units corresponding to other standby service units for providing the same service as the primary service unit corresponding to the newly added standby service unit; when determining that a newly added standby service unit exists and a main service unit with an undetermined corresponding relationship exists, determining the main service unit with the undetermined corresponding relationship as a main service unit corresponding to the newly added standby service unit; if the number of the main service units corresponding to the newly added standby service unit does not reach the preset threshold, re-determining a part of the main service units corresponding to other standby service units for providing the same service as the main service units corresponding to the newly added standby service unit.

Referring to fig. 11, a node device provided in an embodiment of the present invention includes:

a receiving unit 1101, configured to receive indication information sent by a master device in a network;

a switching unit 1102, configured to determine, according to the indication information, a standby service unit as a primary service unit;

each service unit is used for providing a set service.

Preferably, the apparatus further comprises: and the fault processing unit is used for restarting the process of the main service unit or the standby service unit when the node equipment determines that any main service unit or standby service unit has a fault, and sending the fault information of the main service unit or the standby service unit to the main control equipment.

Preferably, the apparatus further comprises a status query unit configured to: when any service unit finishes starting, sending a state query request of the service unit to the main control equipment to determine whether the service unit is in a main state or a standby state; receiving a state indication fed back by the master control equipment according to the state query request; and determining that the service unit is in the active state according to the state indication fed back by the main control device, or determining that the service unit is in the standby state and acquiring the information of the active service unit corresponding to the service unit in the standby state from the state indication.

In summary, embodiments of the present invention provide a resource switching method, a master control device, and a node device, so as to switch a service unit when a service unit fails, ensure normal service, not affect other service units on a device, and improve utilization efficiency of the device.

As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, optical storage, and the like) having computer-usable program code embodied therein.

The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims

1. A method for switching resources, the method comprising:

each service unit is used for providing a set service.

2. The method of claim 1, wherein the determining, by the master device, the standby service unit as the active service unit comprises:

when the standby service unit is located in the node device, the main control device sends indication information to the node device where the standby service unit is located, so as to indicate the node device where the standby service unit is located to determine the standby service unit as a main service unit;

and when the standby service unit is positioned in the main control equipment, the main control equipment determines the standby service unit as the main service unit.

3. The method of claim 1, wherein the determining that any active service element in the network is faulty comprises:

receiving fault information about any main service unit sent by node equipment in a network; or,

determining that a main service unit of the main control equipment fails; or,

it is determined that the node device loses response.

4. The method of claim 3, wherein when it is determined that the node device loses response, it is determined that each of the active service unit and the standby service unit in the node device that loses response has failed.

5. The method of claim 1, further comprising:

when the master control device determines that any one of the standby service units fails, the master service unit corresponding to the standby service unit is determined as the master service unit corresponding to the standby service unit which does not fail and is used for providing the same service.

6. The method of claim 1, wherein within a preset length of time after the master device is started, the method further comprises:

the main control equipment receives a state query request of the service unit and is used for determining whether the service unit is in a main state or a standby state;

the main control equipment determines that the service unit is in a main state or a standby state, and when the service unit is in the standby state, the main control equipment determines a main service unit corresponding to the standby service unit;

the main control equipment sends a state indication in a main state or a standby state to the service unit; when the service unit is determined to be in the standby state, the state indication further includes information of the active service unit corresponding to the service unit in the standby state.

7. The method of claim 6, wherein the determining that the service element is in the active state or the standby state comprises:

judging whether the number of main service units for providing the same service in the network reaches a preset number, if so, determining that the service units are in a standby state; otherwise, determining that the service unit is in the active state.

8. The method of claim 7, wherein when it is determined that the service unit is in the standby state, determining the active service unit corresponding to the standby service unit comprises:

determining the number M of standby service units and the number N of main service units which are used for providing the same service in a pre-configured network; determining the number N/M of the main service units corresponding to each standby service unit; when N/M is not an integer, taking the minimum integer larger than N/M;

when the number of the main service units with undetermined corresponding relations is larger than N/M, determining the N/M main service units with undetermined corresponding relations as the main service units corresponding to the standby service units;

and when the number of the main service units with undetermined corresponding relations is less than N/M, determining all the main service units with undetermined corresponding relations as the main service units corresponding to the standby service units.

9. The method of claim 6, 7 or 8, wherein after the master device completes booting for a preset length of time, the method comprises:

when the number of the standby service units for providing the same service determined by the main control device is smaller than the number of the standby service units in the network configured in advance, determining the main service unit of which the corresponding relation is not determined as the main service unit corresponding to the determined standby service unit.

10. The method according to claim 9, wherein the determining of the primary service unit with no determined correspondence relationship as the primary service unit corresponding to the determined standby service unit is performed when the number of the primary service units corresponding to the standby service unit does not reach a preset threshold.

11. The method according to claim 10, wherein after determining the active service unit whose correspondence is not determined as the active service unit corresponding to the determined standby service unit, the method further comprises:

when the main control device determines that a newly-added standby service unit exists and no main service unit with undetermined corresponding relation exists, re-determining a part of main service units corresponding to other standby service units for providing the same service as the main service unit corresponding to the newly-added standby service unit;

when the main control equipment determines that a newly added standby service unit exists and a main service unit with an undetermined corresponding relation exists, determining the main service unit with the undetermined corresponding relation as a main service unit corresponding to the newly added standby service unit; if the number of the main service units corresponding to the newly added standby service unit does not reach the preset threshold, re-determining a part of the main service units corresponding to other standby service units for providing the same service as the main service units corresponding to the newly added standby service unit.

12. A method for switching resources, the method comprising:

each service unit is used for providing a set service.

13. The method of claim 12, further comprising:

when the node equipment determines that any main service unit or standby service unit has a fault, the process where the main service unit or standby service unit is located is restarted, and fault information of the main service unit or standby service unit is sent to the main control equipment.

14. The method of claim 12, further comprising:

when any service unit of the node equipment is started, sending a state query request of the service unit to the main control equipment to determine whether the service unit is in a main state or a standby state;

the node equipment receives a state indication fed back by the master control equipment according to the state query request;

and the node equipment determines that the service unit is in the active state according to the state indication fed back by the main control equipment, or determines that the service unit is in the standby state and acquires the information of the active service unit corresponding to the service unit in the standby state from the state indication.

15. A master device, comprising:

the selection unit is used for determining a standby service unit corresponding to a main service unit which fails when determining that any main service unit in the network fails;

each service unit is used for providing a set service.

16. The device according to claim 15, wherein the switching unit is specifically configured to:

when the standby service unit is located in the node device, sending indication information to the node device where the standby service unit is located, so as to indicate the node device where the standby service unit is located to determine the standby service unit as a main service unit;

17. The apparatus of claim 15, wherein the selecting unit determining that any active service unit in the network has failed comprises:

determining that a main service unit of the main control equipment fails; or,

it is determined that the node device loses response.

18. The apparatus according to claim 17, wherein when the selection unit determines that the node apparatus loses the response, it is determined that each of the active service unit and the standby service unit in the node apparatus that loses the response has failed.

19. The apparatus of claim 15, further comprising:

and the standby fault processing unit is used for determining the main service unit corresponding to the standby service unit as the main service unit corresponding to the standby service unit which is used for providing the same service and does not have faults when determining that any standby service unit has faults.

20. The device of claim 15, further comprising a boot processing unit for, within a preset length of time after the master device boots up,

receiving a state query request of a service unit to determine whether the service unit is in a main state or a standby state;

determining that a service unit is in a main state or a standby state, and determining a main service unit corresponding to a standby service unit when the service unit is in the standby state;

sending a state indication in a main state or a standby state to a service unit; when the service unit is determined to be in the standby state, the state indication further includes information of the active service unit corresponding to the service unit in the standby state.

21. The apparatus of claim 20, wherein the initiating processing unit determining that the service unit is in the active state or the standby state comprises:

22. The apparatus as claimed in claim 21, wherein when determining that the service unit is in the standby state, the starting processing unit determining the active service unit corresponding to the standby service unit includes:

23. The device according to claim 20, 21 or 22, wherein the device further comprises an equalization allocation unit for, after the master device completes booting for a preset length of time,

and when the number of the determined standby service units for providing the same service is less than the number of the standby service units in the pre-configured network, determining the main service unit of which the corresponding relation is not determined as the main service unit corresponding to the determined standby service unit.

24. The apparatus according to claim 23, wherein the determining of the active service unit whose correspondence is not determined as the active service unit corresponding to the determined standby service unit is performed when the number of the active service units corresponding to the standby service unit does not reach a preset threshold.

25. The apparatus of claim 24, wherein the balanced allocation unit is further configured to, after determining the active service unit with no determined correspondence as the active service unit corresponding to the determined standby service unit,

when determining that a newly added standby service unit exists and no main service unit with undetermined corresponding relation exists, re-determining part of main service units corresponding to other standby service units for providing the same service as the main service unit corresponding to the newly added standby service unit;

when determining that a newly added standby service unit exists and a main service unit with an undetermined corresponding relationship exists, determining the main service unit with the undetermined corresponding relationship as a main service unit corresponding to the newly added standby service unit; if the number of the main service units corresponding to the newly added standby service unit does not reach the preset threshold, re-determining a part of the main service units corresponding to other standby service units for providing the same service as the main service units corresponding to the newly added standby service unit.

26. A node apparatus, characterized in that the apparatus comprises:

each service unit is used for providing a set service.

27. The apparatus of claim 26, further comprising:

and the fault processing unit is used for restarting the process of the main service unit or the standby service unit when the node equipment determines that any main service unit or standby service unit has a fault, and sending the fault information of the main service unit or the standby service unit to the main control equipment.

28. The apparatus of claim 26, further comprising a status query unit to:

when any service unit finishes starting, sending a state query request of the service unit to the main control equipment to determine whether the service unit is in a main state or a standby state;

receiving a state indication fed back by the master control equipment according to the state query request;

and determining that the service unit is in the active state according to the state indication fed back by the main control device, or determining that the service unit is in the standby state and acquiring the information of the active service unit corresponding to the service unit in the standby state from the state indication.