CN103139066A - Source switching method, main control equipment and node equipment - Google Patents

Source switching method, main control equipment and node equipment Download PDF

Info

Publication number
CN103139066A
CN103139066A CN2013100351342A CN201310035134A CN103139066A CN 103139066 A CN103139066 A CN 103139066A CN 2013100351342 A CN2013100351342 A CN 2013100351342A CN 201310035134 A CN201310035134 A CN 201310035134A CN 103139066 A CN103139066 A CN 103139066A
Authority
CN
China
Prior art keywords
service unit
standby
service
main
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2013100351342A
Other languages
Chinese (zh)
Other versions
CN103139066B (en
Inventor
杨焕军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Datang Mobile Communications Equipment Co Ltd
Original Assignee
Datang Mobile Communications Equipment Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Datang Mobile Communications Equipment Co Ltd filed Critical Datang Mobile Communications Equipment Co Ltd
Priority to CN201310035134.2A priority Critical patent/CN103139066B/en
Publication of CN103139066A publication Critical patent/CN103139066A/en
Application granted granted Critical
Publication of CN103139066B publication Critical patent/CN103139066B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention provides a source switching method, main control equipment and node equipment. The source switching method, the main control equipment and the node equipment are used for switching of business units when one business unit breaks down, guaranteeing the normal business service, preventing influence on other business units, and improving utilization rate of equipment. The source switching method comprises the steps of determining a spare business unit corresponding to a broken main business unit when the main control equipment determines that one main business unit breaks down, and determining the spare business unit to be used as a main business unit by the main control equipment. Each business unit is used for providing a certain business service.

Description

Resource switching method, master control equipment and node equipment
Technical Field
The present invention relates to the field of mobile communications, and in particular, to a resource switching method, a master control device, and a node device.
Background
In a communication system, in order to ensure high reliability, backup processing needs to be performed on a service flow, so that the service cannot be recovered after equipment such as a server, an industrial personal computer and a single board fails. In the traditional backup mode, the device is taken as a backup unit, and as long as a service module in the main device fails, switching is triggered, the main device is reset, the standby device is started up, and the service module on the new main device continues to provide service. As long as a certain service module on the main equipment has a fault, the switching is triggered without considering the working states of other service modules on the equipment.
Therefore, the equipment switching mode in the prior art has low equipment utilization rate.
Disclosure of Invention
The embodiment of the invention provides a resource switching method, a master control device and a node device, which are used for switching a service unit when the service unit fails, so that other service units on the device are not influenced, normal service is guaranteed, and the utilization efficiency of the device is improved.
The resource switching method provided by the embodiment of the invention comprises the following steps:
when the master control equipment determines that any main service unit in the network fails, determining a standby service unit corresponding to the failed main service unit;
the main control equipment determines the standby service unit as a main service unit;
each service unit is used for providing a set service.
Another resource switching method provided in the embodiments of the present invention includes:
the node equipment receives indication information sent by main control equipment in a network;
the node equipment determines the standby service unit as a main service unit according to the indication information;
each service unit is used for providing a set service.
The embodiment of the invention provides a main control device, which comprises:
a selecting unit, configured to determine, when it is determined that any one of the master service units of the node devices in the network fails, a standby service unit corresponding to the failed master service unit;
the switching unit is used for determining the standby service unit as a main service unit;
each service unit is used for providing a set service.
The node device provided by the embodiment of the invention comprises:
the receiving unit is used for receiving the indication information sent by the main control equipment in the network;
the switching unit is used for determining the standby service unit as the main service unit according to the indication information;
each service unit is used for providing a set service.
According to the technical scheme, when the master control equipment determines that any main service unit in the network fails, the standby service unit corresponding to the failed main service unit is determined; the main control equipment determines the standby service unit as a main service unit; each service unit is used for providing a set service. The service unit in the process is used as the object for judging and switching the fault, so that the service unit for providing other service in the equipment where the main service unit with the fault is located is not influenced, and the utilization efficiency of the equipment is improved.
Drawings
Fig. 1 is a flowchart illustrating a resource switching method according to an embodiment of the present invention;
fig. 2 is a flowchart illustrating another resource switching method according to an embodiment of the present invention;
fig. 3 is a logic diagram of a service unit relationship in a network according to an embodiment of the present invention;
fig. 4 is a logic diagram of service group division in a network according to an embodiment of the present invention;
fig. 5 is a logic diagram of an N + M backup in a network according to an embodiment of the present invention;
fig. 6 is a schematic flowchart of initializing a service unit working network according to an embodiment of the present invention;
fig. 7 is a schematic diagram of a single board interaction in an initialization process according to an embodiment of the present invention;
fig. 8 is a schematic flowchart of a configuration service unit of a main control board according to an embodiment of the present invention;
fig. 9 is a schematic flowchart of service unit fault processing according to an embodiment of the present invention;
fig. 10 is a schematic structural diagram of a master device according to an embodiment of the present invention;
fig. 11 is a schematic structural diagram of a node device according to an embodiment of the present invention.
Detailed Description
The embodiment of the invention provides a resource switching method, a master control device and a node device, which are used for switching a service unit when the service unit fails, ensuring normal service, not influencing other service units on the device and improving the utilization efficiency of the device.
The embodiment of the invention utilizes a multi-process operating system to expand the function and application range of the equipment. The components are basic units of the equipment program during running, have independent compiling and evolution functions, each component provides an independent service, and a plurality of components provide a set service together. All components for providing a set service are determined as a service unit, and various service units are determined according to the service provided by the equipment. In the device network, a plurality of service units providing the same service are provided, including a main service unit and a standby service unit, and each service unit providing the same service is configured to operate in different processes. The standby service unit is used for providing the service, and when the main service unit corresponding to the standby service unit fails, the standby service unit is determined to be the main service unit and is used for continuously providing the service.
Referring to fig. 1, a resource switching method provided in the embodiment of the present invention includes:
s101, when the master control equipment determines that any main service unit in the network fails, determining a standby service unit corresponding to the failed main service unit;
s102, the main control device determines the standby service unit as a main service unit;
each service unit is used for providing a set service.
Preferably, when the standby service unit is located in the node device, the main control device sends indication information to the node device where the standby service unit is located, so as to indicate the node device where the standby service unit is located to determine the standby service unit as the main service unit; and when the standby service unit is positioned in the main control equipment, the main control equipment determines the standby service unit as the main service unit.
Preferably, when the failed main service unit does not have a corresponding standby service unit which operates normally, or when the failed main service unit does not have a corresponding standby service unit, the failed service unit is not switched.
Preferably, the determining that any active service unit of the node device in the network fails in S101 includes: receiving fault information about any main service unit sent by node equipment in a network; or, determining that a main service unit of the master control device fails; alternatively, it is determined that the node apparatus loses the response. Preferably, when it is determined that the node device loses the response, it is determined that each of the active service unit and the standby service unit in the node device that loses the response fails.
Preferably, the scheme further includes exception handling for each service unit: when the master control device determines that any one of the standby service units fails, the master service unit corresponding to the standby service unit is determined as the master service unit corresponding to the standby service unit which does not fail and is used for providing the same service.
Preferably, the scheme includes the configuration of each service unit during the network startup. Within a preset length of time after the master control device is started, the method further comprises: the main control equipment receives a state query request of the service unit and is used for determining whether the service unit is in a main state or a standby state; the main control equipment determines that the service unit is in a main state or a standby state, and when the service unit is in the standby state, the main control equipment determines a main service unit corresponding to the standby service unit; the main control equipment sends a state indication in a main state or a standby state to the service unit; when the service unit is determined to be in the standby state, the state indication further includes information of the active service unit corresponding to the service unit in the standby state. And within the preset length of time, each service unit can finish normal starting.
Preferably, the determining that the service unit is in the active state or the standby state includes: judging whether the number of main service units for providing the same service in the network reaches a preset number, if so, determining that the service units are in a standby state; otherwise, determining that the service unit is in the active state. The main service unit is configured first, and then the standby service unit is configured, so as to ensure the number of the main service units which normally provide service.
The scheme is used for configuring the main service units corresponding to the standby service units in a balanced manner.
Preferably, when it is determined that the service unit is in the standby state, determining the active service unit corresponding to the standby service unit includes: determining the number M of standby service units and the number N of main service units which are used for providing the same service in a pre-configured network; determining the number N/M of the main service units corresponding to each standby service unit; when N/M is not an integer, taking the minimum integer larger than N/M; when the number of the main service units with undetermined corresponding relations is larger than N/M, determining the N/M main service units with undetermined corresponding relations as the main service units corresponding to the standby service units; and when the number of the main service units with undetermined corresponding relations is less than N/M, determining all the main service units with undetermined corresponding relations as the main service units corresponding to the standby service units. A plurality of N + M service unit combinations are configured in the system, and the service units are respectively backed up and allocated according to the above mode by each combination. The values of N and M may take any positive number, with a 1+1 backup when N =1.M =1.
Preferably, after the master device finishes starting for a preset length of time, the method includes: when the number of the standby service units for providing the same service determined by the main control device is smaller than the number of the standby service units in the network configured in advance, determining the main service unit of which the corresponding relation is not determined as the main service unit corresponding to the determined standby service unit. And taking the preset length of time as waiting time, and determining the main service unit of which the corresponding relation is not determined as the main service unit corresponding to the determined standby service unit after the waiting time is exceeded under the condition that node equipment where the service unit is located in the network is not started or the service unit fails to normally respond after the service unit is started.
Preferably, the determining of the primary service unit with the undetermined correspondence as the primary service unit corresponding to the determined standby service unit is performed when the number of the primary service units corresponding to the standby service unit does not reach the preset threshold. The maximum number of the main service units which can be used by each standby service unit is the preset threshold, and when the number of the main service units corresponding to the standby service units is greater than the preset threshold, the main service units are not allocated any more even if the main service units which do not determine the corresponding relationship exist.
Preferably, after the primary service unit for which the correspondence relationship is not determined is determined as the primary service unit corresponding to the determined standby service unit, the method further includes: when determining that a newly added standby service unit exists and no main service unit with undetermined corresponding relation exists, re-determining part of main service units corresponding to other standby service units for providing the same service as the main service unit corresponding to the newly added standby service unit; when determining that a newly added standby service unit exists and a main service unit with an undetermined corresponding relationship exists, determining the main service unit with the undetermined corresponding relationship as a main service unit corresponding to the newly added standby service unit; if the number of the main service units corresponding to the newly added standby service unit does not reach the preset threshold, re-determining a part of the main service units corresponding to other standby service units for providing the same service as the main service units corresponding to the newly added standby service unit.
That is, when the main control device determines that there is a newly added standby service unit, it preferentially determines the active service unit that does not determine the correspondence relationship as the active service unit corresponding to the newly added standby service unit, and the number of the active service units corresponding to the newly added standby service unit also cannot exceed the preset threshold. If the main service units with the corresponding relationship are not determined to be completely distributed, the corresponding relationship among the service units is transferred according to the number of the main service units corresponding to each standby service unit so as to meet the balanced distribution among the standby service units.
Referring to fig. 2, another resource switching method provided in the embodiment of the present invention includes:
s201, receiving indication information sent by a main control device in a network by a node device;
s202, the node equipment determines the standby service unit as a main service unit according to the indication information;
each service unit is used for providing a set service.
Preferably, the method further comprises: when the node equipment determines that any main service unit or standby service unit has a fault, the process where the main service unit or standby service unit is located is restarted, and fault information of the main service unit or standby service unit is sent to the main control equipment.
Preferably, the method further comprises: when any service unit of the node equipment is started, sending a state query request of the service unit to the main control equipment to determine whether the service unit is in a main state or a standby state; the node equipment receives a state indication fed back by the master control equipment according to the state query request; and the node equipment determines that the service unit is in the active state according to the state indication fed back by the main control equipment, or determines that the service unit is in the standby state and acquires the information of the active service unit corresponding to the service unit in the standby state from the state indication.
Taking a single board as an example, the following provides a specific embodiment of the present invention.
Fig. 3 is a schematic logic diagram of a service unit relationship in a network according to an embodiment of the present invention. The main control board of the system controls the node board, and deploys a global backup management (gbackup) component to manage the service units of the whole system. Each node board is provided with an lbackup component (local backup management) for managing the service unit of the node board. A plurality of components are deployed in the process, the component C1 has no backup relation, the components C2 and C3 are planned to be a service unit bu1, and the components C4 and C5 are planned to be a service unit bu 2. There may also be a service unit configuration on the main control board.
Fig. 4 illustrates service unit planning, where service units bu1 and bu3 belong to service group bg1, and service units bu2 and bu4 belong to service group bg 2. Wherein the c2 component of bu1 and the c2 component of bu3 have backup relationship, the c3 component of bu1 and the c3 component of bu4 have backup relationship, the c4 component of bu2 and the c4 component of bu4 have backup relationship, and the c5 component of bu2 and the c5 component of bu4 have backup relationship.
Fig. 5 shows a general N + M backup manner (N =3, M = 2), where a bg includes 5 service units (bu1, bu2, bu3, bu4, and bu5), 3 service units are active and 2 service units are standby, and the active-standby relationship is determined during system operation. Wherein the components c1, c3, c5, c7 and c9 have backup relations, and the components c2, c4, c6, c8 and c10 have backup relations. The processes with backup relationship provide the same business service.
The number of the business units deployed in one process is not limited. According to the operation situation, a service unit deployed on one board may be a main service unit or a standby service unit, and a service unit included in one service group may be deployed on different boards or on the same board. If the service units are deployed on the same single board, the service units need to be deployed in different processes to ensure that the exception can be recovered in time.
When each single board is started, loading the process on the single board according to the data configuration, and loading and activating the components in the process; the startup among the single boards has no influence, and all the components are processed in parallel. Referring to fig. 6, the initialization process of the single board working network includes:
s601, when the gbackup component on the main control board receives the activation command, the service group information, the service unit information and the information of the components on the service unit of the whole system are obtained from the database and stored.
S602, when the lbackup component on the node board receives the activation command, the information of the service unit configured by the node is inquired for the gbackup component on the main control board, and the request message carries the physical address of the request message.
S603, other components receive the activating command, perform initialization processing on the components and register the components in the lbackup.
S604, the lbackup component receives and records the configuration information, judges whether all components in the configured service unit are registered, and sends a service unit state query request to the gbackup component on the main control board if all components in a certain service unit are registered.
S605, the gbackup component receives the service unit state query request, determines the state of the service unit according to the pre-configured strategy and returns a response to the lbackup.
S606, the lbackup component receives the service unit state response message and sequentially sends a state indication message to the components in the service unit, the components receive the indication message and process the indication message according to the main/standby state and return the response to the lbackup, the lbackup judges that the responses of all the components in the service unit are received, and then sends a service unit state setting completion indication message to the gbackup component, and the gbackup component receives the message and stores the message.
S607, the service unit starts to work after determining its own active/standby state, the components of the active service unit provide service together, and the components of the standby service unit provide backup service.
Fig. 7 is a schematic diagram of single board interaction in the initialization process.
Since the state of each board is unknown at the network initialization stage, a timer needs to be started when the state is set for the standby service unit during the period, and the timer acts to ensure that all the standby service units are normal after the timeout of the timer is up. Taking the N + M backup design as an example, referring to fig. 8, the configuration policy of the main control board for the service unit includes the following steps:
s801, setting the state as primary according to the sequence of the service unit state requests, and then setting the state as standby;
before the states of the N main service units are not set, the M standby service units are not set.
The gbackup component receives the service unit state query request, firstly judges the number of the main service units in the service group, if the number is less than N, sets the service units as main and responds back to the lbackup; and if the number of the main service units reaches N, setting the service unit state as standby.
S802, when the standby service units are set, N/M (rounding up) main service units with undetermined backup relationship are searched in the existing main service units, and the information of the standby service units and the standby main information to be backed up are sent to the corresponding lbackup. And if the number of undetermined backup relations in the main service unit is less than N/M (rounding up), taking all the surplus.
S803, after the timer is overtime, it is determined whether all the M standby service units have been determined, and if there are less than M standby service units (possibly, the board where the service units are located is not started), a load balancing policy needs to be executed, so as to distribute the balance of the main service units, for which the backup relationship is not determined, to all the normal standby service units.
The processes of S801-S803 are executed in parallel by each service group without mutual influence.
After the timer is overtime, the processing of each service unit comprises the following conditions:
1. and a service unit state query request: the service unit is set as a standby service, and the N/M pieces of main information backed up by other standby service units are moved to the service unit for backup. And the backup balance of the standby service unit is ensured.
2. Detecting and processing faults of each service unit, referring to fig. 9, the specific steps include:
s901, carrying out fault detection on each service unit;
the lbackup component carries out health detection on other components, if the heartbeat of the components in the service unit is lost, the service unit is considered to be in failure, the process of the service unit is restarted, and the gbackup component is informed to process.
And the gbackup component performs inter-node heartbeat detection, and if a certain node is abnormal, the gbackup component judges the state of the service unit configured on the node and sequentially processes the service units. If there is main service unit, informing the standby service unit to change the state to main; if the standby service unit exists, the main service unit is informed of the change of the backup object.
The processing of the failure of the active service unit is transferred to S902, and the processing of the failure of the standby service unit is transferred to S903.
S902, processing the failure of the service unit;
and (3) processing the failure of the main service unit: after the gbackup component detects that the main service unit fails, the state of the service unit is set to be unknown, the standby service unit corresponding to the main service unit is searched, the state of the standby service unit is set to be main after the normal corresponding standby service unit is determined, the lbackup on the node where the standby service unit is located is informed to carry out state setting processing, the lbackup sequentially carries out state setting on the components in the service unit, and the gbackup record is informed after the state setting is finished. If the standby service unit also backs up other primary information, the primary backup information needs to be released.
And restarting the process after the main service unit fails, and carrying out a state query request to the gbackup component again.
Processing the standby service unit failure: when the standby service unit fails, the gbackup component detects that the state of the service unit is unknown, and simultaneously informs the standby main service unit to switch the standby relation to another standby service unit (if a normal standby service unit exists).
And restarting the process after the standby service unit fails, and carrying out a state query request to the gbackup component again by the service unit.
Referring to fig. 10, a master device provided in an embodiment of the present invention includes:
a selecting unit 1001, configured to determine, when it is determined that any active service unit of a node device in a network fails, a standby service unit corresponding to the failed active service unit;
a switching unit 1002, configured to determine the standby service unit as a primary service unit;
each service unit is used for providing a set service.
Preferably, the node device is configured to send indication information to the node device where the standby service unit is located when the standby service unit is located in the node device, so as to indicate the node device where the standby service unit is located to determine the standby service unit as the main service unit; and when the standby service unit is positioned in the main control equipment, the main control equipment determines the standby service unit as the main service unit.
Preferably, the determining, by the selecting unit, that any active service unit in the network fails includes: receiving fault information about any main service unit sent by node equipment in a network; or, determining that a main service unit of the master control device fails; alternatively, it is determined that the node apparatus loses the response.
Preferably, when the selection unit determines that the node device loses the response, it is determined that each of the active service unit and the standby service unit in the node device that loses the response fails.
Preferably, the apparatus further comprises: and the standby fault processing unit is used for determining the main service unit corresponding to the standby service unit as the main service unit corresponding to the standby service unit which is used for providing the same service and does not have faults when determining that any standby service unit has faults.
Preferably, the device further includes a start processing unit, configured to receive a status query request of the service unit within a preset length of time after the master control device is started, so as to determine that the service unit is in the active state or the standby state; determining that a service unit is in a main state or a standby state, and determining a main service unit corresponding to a standby service unit when the service unit is in the standby state; sending a state indication in a main state or a standby state to a service unit; when the service unit is determined to be in the standby state, the state indication further includes information of the active service unit corresponding to the service unit in the standby state.
Preferably, the step of determining, by the start processing unit, that the service unit is in the active state or the standby state includes: judging whether the number of main service units for providing the same service in the network reaches a preset number, if so, determining that the service units are in a standby state; otherwise, determining that the service unit is in the active state.
Preferably, when it is determined that the service unit is in the standby state, the determining, by the start processing unit, the active service unit corresponding to the standby service unit includes: determining the number M of standby service units and the number N of main service units which are used for providing the same service in a pre-configured network, and determining the number N/M of the main service units corresponding to each standby service unit; when N/M is not an integer, taking the minimum integer larger than N/M; when the number of the main service units with undetermined corresponding relations is larger than N/M, determining the N/M main service units with undetermined corresponding relations as the main service units corresponding to the standby service units; and when the number of the main service units with undetermined corresponding relations is less than N/M, determining all the main service units with undetermined corresponding relations as the main service units corresponding to the standby service units.
Preferably, the device further includes a balanced allocation unit, configured to determine, after the master control device completes starting for the preset length of time, the active service unit that does not determine the correspondence relationship as the active service unit corresponding to the determined standby service unit when the number of the determined standby service units for providing the same service is smaller than the number of the standby service units in the network configured in advance.
Preferably, the determining of the primary service unit with the undetermined correspondence as the primary service unit corresponding to the determined standby service unit is performed when the number of the primary service units corresponding to the standby service unit does not reach the preset threshold.
Preferably, the balanced distribution unit is further configured to, after determining the primary service unit with the undetermined correspondence as the primary service unit corresponding to the determined standby service unit, when determining that there is a newly added standby service unit and there is no primary service unit with an undetermined correspondence, re-determine a part of primary service units corresponding to other standby service units for providing the same service as the primary service unit corresponding to the newly added standby service unit; when determining that a newly added standby service unit exists and a main service unit with an undetermined corresponding relationship exists, determining the main service unit with the undetermined corresponding relationship as a main service unit corresponding to the newly added standby service unit; if the number of the main service units corresponding to the newly added standby service unit does not reach the preset threshold, re-determining a part of the main service units corresponding to other standby service units for providing the same service as the main service units corresponding to the newly added standby service unit.
Referring to fig. 11, a node device provided in an embodiment of the present invention includes:
a receiving unit 1101, configured to receive indication information sent by a master device in a network;
a switching unit 1102, configured to determine, according to the indication information, a standby service unit as a primary service unit;
each service unit is used for providing a set service.
Preferably, the apparatus further comprises: and the fault processing unit is used for restarting the process of the main service unit or the standby service unit when the node equipment determines that any main service unit or standby service unit has a fault, and sending the fault information of the main service unit or the standby service unit to the main control equipment.
Preferably, the apparatus further comprises a status query unit configured to: when any service unit finishes starting, sending a state query request of the service unit to the main control equipment to determine whether the service unit is in a main state or a standby state; receiving a state indication fed back by the master control equipment according to the state query request; and determining that the service unit is in the active state according to the state indication fed back by the main control device, or determining that the service unit is in the standby state and acquiring the information of the active service unit corresponding to the service unit in the standby state from the state indication.
In summary, embodiments of the present invention provide a resource switching method, a master control device, and a node device, so as to switch a service unit when a service unit fails, ensure normal service, not affect other service units on a device, and improve utilization efficiency of the device.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims (28)

1. A method for switching resources, the method comprising:
when the master control equipment determines that any main service unit in the network fails, determining a standby service unit corresponding to the failed main service unit;
the main control equipment determines the standby service unit as a main service unit;
each service unit is used for providing a set service.
2. The method of claim 1, wherein the determining, by the master device, the standby service unit as the active service unit comprises:
when the standby service unit is located in the node device, the main control device sends indication information to the node device where the standby service unit is located, so as to indicate the node device where the standby service unit is located to determine the standby service unit as a main service unit;
and when the standby service unit is positioned in the main control equipment, the main control equipment determines the standby service unit as the main service unit.
3. The method of claim 1, wherein the determining that any active service element in the network is faulty comprises:
receiving fault information about any main service unit sent by node equipment in a network; or,
determining that a main service unit of the main control equipment fails; or,
it is determined that the node device loses response.
4. The method of claim 3, wherein when it is determined that the node device loses response, it is determined that each of the active service unit and the standby service unit in the node device that loses response has failed.
5. The method of claim 1, further comprising:
when the master control device determines that any one of the standby service units fails, the master service unit corresponding to the standby service unit is determined as the master service unit corresponding to the standby service unit which does not fail and is used for providing the same service.
6. The method of claim 1, wherein within a preset length of time after the master device is started, the method further comprises:
the main control equipment receives a state query request of the service unit and is used for determining whether the service unit is in a main state or a standby state;
the main control equipment determines that the service unit is in a main state or a standby state, and when the service unit is in the standby state, the main control equipment determines a main service unit corresponding to the standby service unit;
the main control equipment sends a state indication in a main state or a standby state to the service unit; when the service unit is determined to be in the standby state, the state indication further includes information of the active service unit corresponding to the service unit in the standby state.
7. The method of claim 6, wherein the determining that the service element is in the active state or the standby state comprises:
judging whether the number of main service units for providing the same service in the network reaches a preset number, if so, determining that the service units are in a standby state; otherwise, determining that the service unit is in the active state.
8. The method of claim 7, wherein when it is determined that the service unit is in the standby state, determining the active service unit corresponding to the standby service unit comprises:
determining the number M of standby service units and the number N of main service units which are used for providing the same service in a pre-configured network; determining the number N/M of the main service units corresponding to each standby service unit; when N/M is not an integer, taking the minimum integer larger than N/M;
when the number of the main service units with undetermined corresponding relations is larger than N/M, determining the N/M main service units with undetermined corresponding relations as the main service units corresponding to the standby service units;
and when the number of the main service units with undetermined corresponding relations is less than N/M, determining all the main service units with undetermined corresponding relations as the main service units corresponding to the standby service units.
9. The method of claim 6, 7 or 8, wherein after the master device completes booting for a preset length of time, the method comprises:
when the number of the standby service units for providing the same service determined by the main control device is smaller than the number of the standby service units in the network configured in advance, determining the main service unit of which the corresponding relation is not determined as the main service unit corresponding to the determined standby service unit.
10. The method according to claim 9, wherein the determining of the primary service unit with no determined correspondence relationship as the primary service unit corresponding to the determined standby service unit is performed when the number of the primary service units corresponding to the standby service unit does not reach a preset threshold.
11. The method according to claim 10, wherein after determining the active service unit whose correspondence is not determined as the active service unit corresponding to the determined standby service unit, the method further comprises:
when the main control device determines that a newly-added standby service unit exists and no main service unit with undetermined corresponding relation exists, re-determining a part of main service units corresponding to other standby service units for providing the same service as the main service unit corresponding to the newly-added standby service unit;
when the main control equipment determines that a newly added standby service unit exists and a main service unit with an undetermined corresponding relation exists, determining the main service unit with the undetermined corresponding relation as a main service unit corresponding to the newly added standby service unit; if the number of the main service units corresponding to the newly added standby service unit does not reach the preset threshold, re-determining a part of the main service units corresponding to other standby service units for providing the same service as the main service units corresponding to the newly added standby service unit.
12. A method for switching resources, the method comprising:
the node equipment receives indication information sent by main control equipment in a network;
the node equipment determines the standby service unit as a main service unit according to the indication information;
each service unit is used for providing a set service.
13. The method of claim 12, further comprising:
when the node equipment determines that any main service unit or standby service unit has a fault, the process where the main service unit or standby service unit is located is restarted, and fault information of the main service unit or standby service unit is sent to the main control equipment.
14. The method of claim 12, further comprising:
when any service unit of the node equipment is started, sending a state query request of the service unit to the main control equipment to determine whether the service unit is in a main state or a standby state;
the node equipment receives a state indication fed back by the master control equipment according to the state query request;
and the node equipment determines that the service unit is in the active state according to the state indication fed back by the main control equipment, or determines that the service unit is in the standby state and acquires the information of the active service unit corresponding to the service unit in the standby state from the state indication.
15. A master device, comprising:
the selection unit is used for determining a standby service unit corresponding to a main service unit which fails when determining that any main service unit in the network fails;
the switching unit is used for determining the standby service unit as a main service unit;
each service unit is used for providing a set service.
16. The device according to claim 15, wherein the switching unit is specifically configured to:
when the standby service unit is located in the node device, sending indication information to the node device where the standby service unit is located, so as to indicate the node device where the standby service unit is located to determine the standby service unit as a main service unit;
and when the standby service unit is positioned in the main control equipment, the main control equipment determines the standby service unit as the main service unit.
17. The apparatus of claim 15, wherein the selecting unit determining that any active service unit in the network has failed comprises:
receiving fault information about any main service unit sent by node equipment in a network; or,
determining that a main service unit of the main control equipment fails; or,
it is determined that the node device loses response.
18. The apparatus according to claim 17, wherein when the selection unit determines that the node apparatus loses the response, it is determined that each of the active service unit and the standby service unit in the node apparatus that loses the response has failed.
19. The apparatus of claim 15, further comprising:
and the standby fault processing unit is used for determining the main service unit corresponding to the standby service unit as the main service unit corresponding to the standby service unit which is used for providing the same service and does not have faults when determining that any standby service unit has faults.
20. The device of claim 15, further comprising a boot processing unit for, within a preset length of time after the master device boots up,
receiving a state query request of a service unit to determine whether the service unit is in a main state or a standby state;
determining that a service unit is in a main state or a standby state, and determining a main service unit corresponding to a standby service unit when the service unit is in the standby state;
sending a state indication in a main state or a standby state to a service unit; when the service unit is determined to be in the standby state, the state indication further includes information of the active service unit corresponding to the service unit in the standby state.
21. The apparatus of claim 20, wherein the initiating processing unit determining that the service unit is in the active state or the standby state comprises:
judging whether the number of main service units for providing the same service in the network reaches a preset number, if so, determining that the service units are in a standby state; otherwise, determining that the service unit is in the active state.
22. The apparatus as claimed in claim 21, wherein when determining that the service unit is in the standby state, the starting processing unit determining the active service unit corresponding to the standby service unit includes:
determining the number M of standby service units and the number N of main service units which are used for providing the same service in a pre-configured network; determining the number N/M of the main service units corresponding to each standby service unit; when N/M is not an integer, taking the minimum integer larger than N/M;
when the number of the main service units with undetermined corresponding relations is larger than N/M, determining the N/M main service units with undetermined corresponding relations as the main service units corresponding to the standby service units;
and when the number of the main service units with undetermined corresponding relations is less than N/M, determining all the main service units with undetermined corresponding relations as the main service units corresponding to the standby service units.
23. The device according to claim 20, 21 or 22, wherein the device further comprises an equalization allocation unit for, after the master device completes booting for a preset length of time,
and when the number of the determined standby service units for providing the same service is less than the number of the standby service units in the pre-configured network, determining the main service unit of which the corresponding relation is not determined as the main service unit corresponding to the determined standby service unit.
24. The apparatus according to claim 23, wherein the determining of the active service unit whose correspondence is not determined as the active service unit corresponding to the determined standby service unit is performed when the number of the active service units corresponding to the standby service unit does not reach a preset threshold.
25. The apparatus of claim 24, wherein the balanced allocation unit is further configured to, after determining the active service unit with no determined correspondence as the active service unit corresponding to the determined standby service unit,
when determining that a newly added standby service unit exists and no main service unit with undetermined corresponding relation exists, re-determining part of main service units corresponding to other standby service units for providing the same service as the main service unit corresponding to the newly added standby service unit;
when determining that a newly added standby service unit exists and a main service unit with an undetermined corresponding relationship exists, determining the main service unit with the undetermined corresponding relationship as a main service unit corresponding to the newly added standby service unit; if the number of the main service units corresponding to the newly added standby service unit does not reach the preset threshold, re-determining a part of the main service units corresponding to other standby service units for providing the same service as the main service units corresponding to the newly added standby service unit.
26. A node apparatus, characterized in that the apparatus comprises:
the receiving unit is used for receiving the indication information sent by the main control equipment in the network;
the switching unit is used for determining the standby service unit as the main service unit according to the indication information;
each service unit is used for providing a set service.
27. The apparatus of claim 26, further comprising:
and the fault processing unit is used for restarting the process of the main service unit or the standby service unit when the node equipment determines that any main service unit or standby service unit has a fault, and sending the fault information of the main service unit or the standby service unit to the main control equipment.
28. The apparatus of claim 26, further comprising a status query unit to:
when any service unit finishes starting, sending a state query request of the service unit to the main control equipment to determine whether the service unit is in a main state or a standby state;
receiving a state indication fed back by the master control equipment according to the state query request;
and determining that the service unit is in the active state according to the state indication fed back by the main control device, or determining that the service unit is in the standby state and acquiring the information of the active service unit corresponding to the service unit in the standby state from the state indication.
CN201310035134.2A 2013-01-29 2013-01-29 A kind of changing method of resource, main control device and node device Active CN103139066B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310035134.2A CN103139066B (en) 2013-01-29 2013-01-29 A kind of changing method of resource, main control device and node device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310035134.2A CN103139066B (en) 2013-01-29 2013-01-29 A kind of changing method of resource, main control device and node device

Publications (2)

Publication Number Publication Date
CN103139066A true CN103139066A (en) 2013-06-05
CN103139066B CN103139066B (en) 2017-04-05

Family

ID=48498396

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310035134.2A Active CN103139066B (en) 2013-01-29 2013-01-29 A kind of changing method of resource, main control device and node device

Country Status (1)

Country Link
CN (1) CN103139066B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104200148A (en) * 2014-08-25 2014-12-10 浪潮集团有限公司 Smart card redundancy switching method based on independently domestic redundancy server
WO2016165465A1 (en) * 2015-09-29 2016-10-20 中兴通讯股份有限公司 Network management method, emergency system and storage medium
CN106656530A (en) * 2015-07-27 2017-05-10 中兴通讯股份有限公司 Method and device for generating Operation, Administration and Maintenance alarm

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6678369B2 (en) * 2000-06-09 2004-01-13 Nms Communications Corporation Network interface redundancy
CN1518234A (en) * 2003-01-17 2004-08-04 深圳市中兴通讯股份有限公司上海第二 Centralized control and hierarchical implementing switching control method and device
CN1747585A (en) * 2004-09-10 2006-03-15 华为技术有限公司 Method and device for realizing service processing unit backup in intelligent network system
CN1750419A (en) * 2004-09-16 2006-03-22 华为技术有限公司 Method and device for realizing single board backup in communication network device
CN1764080A (en) * 2005-11-15 2006-04-26 中兴通讯股份有限公司 Device and method for realizing ASC
CN1852456A (en) * 2005-11-30 2006-10-25 华为技术有限公司 System for realizing business switch-over and method therefor
CN101110862A (en) * 2006-07-18 2008-01-23 中兴通讯股份有限公司 Speech processing system implementing method
CN101150361A (en) * 2006-09-20 2008-03-26 北京格林威尔科技发展有限公司 A method and system for realizing fully protective switching in passive optical network
CN101212723A (en) * 2006-12-27 2008-07-02 大唐移动通信设备有限公司 Cluster network system and failure attenuation method on the basis of the network system
CN102045259A (en) * 2010-12-30 2011-05-04 上海顶竹通讯技术有限公司 Packet switching equipment and method for managing customer service
CN102130718A (en) * 2010-01-20 2011-07-20 中兴通讯股份有限公司 Network element equipment and method for protection switching of backbone optical path

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6678369B2 (en) * 2000-06-09 2004-01-13 Nms Communications Corporation Network interface redundancy
CN1518234A (en) * 2003-01-17 2004-08-04 深圳市中兴通讯股份有限公司上海第二 Centralized control and hierarchical implementing switching control method and device
CN1747585A (en) * 2004-09-10 2006-03-15 华为技术有限公司 Method and device for realizing service processing unit backup in intelligent network system
CN1750419A (en) * 2004-09-16 2006-03-22 华为技术有限公司 Method and device for realizing single board backup in communication network device
CN1764080A (en) * 2005-11-15 2006-04-26 中兴通讯股份有限公司 Device and method for realizing ASC
CN1852456A (en) * 2005-11-30 2006-10-25 华为技术有限公司 System for realizing business switch-over and method therefor
CN101110862A (en) * 2006-07-18 2008-01-23 中兴通讯股份有限公司 Speech processing system implementing method
CN101150361A (en) * 2006-09-20 2008-03-26 北京格林威尔科技发展有限公司 A method and system for realizing fully protective switching in passive optical network
CN101212723A (en) * 2006-12-27 2008-07-02 大唐移动通信设备有限公司 Cluster network system and failure attenuation method on the basis of the network system
CN102130718A (en) * 2010-01-20 2011-07-20 中兴通讯股份有限公司 Network element equipment and method for protection switching of backbone optical path
CN102045259A (en) * 2010-12-30 2011-05-04 上海顶竹通讯技术有限公司 Packet switching equipment and method for managing customer service

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104200148A (en) * 2014-08-25 2014-12-10 浪潮集团有限公司 Smart card redundancy switching method based on independently domestic redundancy server
CN104200148B (en) * 2014-08-25 2017-06-09 浪潮集团有限公司 A kind of smart card redundancy switching method based on autonomous domestic redundant server
CN106656530A (en) * 2015-07-27 2017-05-10 中兴通讯股份有限公司 Method and device for generating Operation, Administration and Maintenance alarm
WO2016165465A1 (en) * 2015-09-29 2016-10-20 中兴通讯股份有限公司 Network management method, emergency system and storage medium
CN106559822A (en) * 2015-09-29 2017-04-05 中兴通讯股份有限公司 A kind of network management and emergency system

Also Published As

Publication number Publication date
CN103139066B (en) 2017-04-05

Similar Documents

Publication Publication Date Title
US20180143854A1 (en) Methods, systems and apparatus to perform a workflow in a software defined data center
US10999139B2 (en) Online upgrade method, apparatus, and system
CN102708018B (en) Method and system for exception handling, proxy equipment and control device
CN102355369B (en) Virtual clustered system as well as processing method and processing device thereof
US9558082B2 (en) VM availability during migration and VM network failures in host computing systems
CN111641515B (en) VNF life cycle management method and device
US9703652B2 (en) VM and host management function availability during management network failure in host computing systems in a failover cluster
CN105159798A (en) Dual-machine hot-standby method for virtual machines, dual-machine hot-standby management server and system
CN112865992B (en) Method and device for switching master nodes in distributed master-slave system and computer equipment
CN112865993B (en) Method and device for switching slave nodes in distributed master-slave system
US20210406127A1 (en) Method to orchestrate a container-based application on a terminal device
US7657734B2 (en) Methods and apparatus for automatically multi-booting a computer system
CN111857951A (en) Containerized deployment platform and deployment method
CN104503861A (en) Abnormality handling method and system, agency device and control device
US20200133728A1 (en) Data based scheduling for horizontally scalable clusters
CN103139066B (en) A kind of changing method of resource, main control device and node device
WO2018123456A1 (en) Programmable controller, management device, and control system
CN115373799A (en) Cluster management method and device and electronic equipment
CN109032674B (en) Multi-process management method, system and network equipment
WO2017124829A1 (en) Method and device for restoring virtual machine
JP2017027166A (en) Operation management unit, operation management program, and information processing system
CN115277398A (en) Cluster network configuration method and device
CN115202839A (en) Task scheduling method and system based on container
CN116360865A (en) Cluster management method, device and computing system
CN114157569A (en) Cluster system and construction method and construction device thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant