CN110737521B - Disaster recovery method and device based on task scheduling center - Google Patents

Disaster recovery method and device based on task scheduling center Download PDF

Info

Publication number
CN110737521B
CN110737521B CN201910976026.2A CN201910976026A CN110737521B CN 110737521 B CN110737521 B CN 110737521B CN 201910976026 A CN201910976026 A CN 201910976026A CN 110737521 B CN110737521 B CN 110737521B
Authority
CN
China
Prior art keywords
task scheduling
management
center
task
control system
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910976026.2A
Other languages
Chinese (zh)
Other versions
CN110737521A (en
Inventor
朱亮
吕丽红
运朝青
范丹丹
郑军
王良
尹曙明
李强
高坚
娄珂珂
毛军礼
王兆欣
王志浩
霍永华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
32039 Unit Of Chinese Pla
CETC 54 Research Institute
Original Assignee
32039 Unit Of Chinese Pla
CETC 54 Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 32039 Unit Of Chinese Pla, CETC 54 Research Institute filed Critical 32039 Unit Of Chinese Pla
Priority to CN201910976026.2A priority Critical patent/CN110737521B/en
Publication of CN110737521A publication Critical patent/CN110737521A/en
Application granted granted Critical
Publication of CN110737521B publication Critical patent/CN110737521B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/48Program initiating; Program switching, e.g. by interrupt
    • G06F9/4806Task transfer initiation or dispatching
    • G06F9/4843Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
    • G06F9/4881Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1448Management of the data involved in backup or backup restore
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5011Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals

Abstract

The invention provides a disaster recovery method and a device based on a task scheduling center, which relate to the technical field of disaster recovery backup and are applied to an arbitration center, wherein the disaster recovery method comprises the steps of obtaining user application information, wherein the user application information is used for representing task information executed by the task scheduling center; selecting a first target task scheduling center with a normal running state from the plurality of task scheduling centers by carrying out state detection on the plurality of task scheduling centers; the user application information is sent to the first target task scheduling center, so that the first target task scheduling center obtains a task planning scheme according to the user application information and the current resource condition, disaster tolerance can be switched when the measurement and control system with higher requirements on operation data has faults, the operation data is normally measured and controlled, and the disaster tolerance reliability is ensured.

Description

Disaster recovery method and device based on task scheduling center
Technical Field
The invention relates to the technical field of disaster recovery backup, in particular to a disaster recovery method and device based on a task scheduling center.
Background
The traditional disaster recovery backup means of the relay satellite measurement and control management system is laggard, and the requirements of safe survivability of relay satellite measurement and control management and system disaster recovery backup are difficult to meet.
In addition, disaster recovery backup includes device level disaster recovery and system level disaster recovery. The current device-level data backup can only solve the backup problem of single device data, while the system-level disaster recovery backup generally realizes data backup through software, and can only perform switching work when a system fails to work to realize disaster recovery, but cannot ensure real-time backup of operation data, and cannot ensure normal operation of measurement and control management work in applications based on huge operation data, such as relay satellite measurement and control management.
Disclosure of Invention
The invention aims to provide a disaster recovery method and a disaster recovery device based on a task scheduling center, which can switch disaster recovery when a measurement and control system with higher requirement on operation data fails, normally measure and control the operation data and ensure the reliability of the disaster recovery.
In a first aspect, an embodiment of the present invention provides a disaster recovery method based on a task scheduling center, which is applied to an arbitration center, and includes:
acquiring user application information, wherein the user application information is used for representing task information executed by a task scheduling center;
selecting a first target task scheduling center with a normal running state from the plurality of task scheduling centers by performing state detection on the plurality of task scheduling centers;
and sending the user application information to the first target task scheduling center so that the first target task scheduling center obtains a task planning scheme according to the user application information and the current resource condition.
With reference to the first aspect, an embodiment of the present invention provides a first possible implementation manner of the first aspect, where the method further includes:
and if the running states of the plurality of task scheduling centers are normal, selecting a second target task scheduling center with less distribution of the number of the first historical tasks from the plurality of task scheduling centers according to the distribution condition of the first historical tasks, and sending the user application information to the second target task scheduling center.
With reference to the first aspect, an embodiment of the present invention provides a second possible implementation manner of the first aspect, where the method further includes:
receiving a link establishment signal;
and selecting a third target task scheduling center with a small second historical task number from the plurality of task scheduling centers according to the distribution condition of the second historical tasks, so that the third target task scheduling center establishes a link.
With reference to the first aspect, an embodiment of the present invention provides a third possible implementation manner of the first aspect, where the method further includes:
receiving a control information uploading signal;
and selecting a fourth target task scheduling center with less third history task quantity distribution according to the distribution condition of third history tasks, so that the user application information is uploaded to the fourth target task scheduling center.
With reference to the first aspect, an embodiment of the present invention provides a fourth possible implementation manner of the first aspect, where performing state detection on a plurality of task scheduling centers to select a first target task scheduling center with a normal operating state from the plurality of task scheduling centers includes:
sending heartbeat detection signals to a plurality of task scheduling centers in real time;
receiving feedback signals of a plurality of task scheduling centers;
and selecting a first target task scheduling center with a normal operation state from the plurality of task scheduling centers according to the feedback signal.
In a second aspect, an embodiment of the present invention further provides a disaster recovery method based on a task scheduling center, which is applied to a first task scheduling center, and includes:
receiving user application information sent by an arbitration center;
and obtaining a task planning scheme according to the user application information and the current resource condition, and sending the task planning scheme to a management and control system so that the management and control system loads and configures according to the task planning scheme.
With reference to the second aspect, an embodiment of the present invention provides a first possible implementation manner of the second aspect, where the method further includes:
and synchronizing application data and/or management and control information to a second task scheduling center so that the second task scheduling center continues to execute a current task according to the synchronized application data and/or management and control information, wherein the application data is used for completing the current task obtained according to the user application information, and the management and control information comprises current resource occupation conditions and equipment operation information in the management and control system.
With reference to the second aspect, an embodiment of the present invention provides a second possible implementation manner of the second aspect, where obtaining a task planning scheme according to the user application information and the current resource situation, and sending the task planning scheme to a management and control system includes:
acquiring the current available resource condition;
obtaining a resource pre-allocation scheme according to the current available resource condition and the user application information;
and performing task planning on the current resources according to the resource pre-allocation scheme to obtain a task planning scheme, and sending the task planning scheme to a management and control system.
In a third aspect, an embodiment of the present invention provides a disaster recovery device based on a task scheduling center, where the disaster recovery device applied to an arbitration center includes:
the system comprises an acquisition module, a task scheduling center and a processing module, wherein the acquisition module is used for acquiring user application information which is used for representing task information executed by the task scheduling center;
the selection module is used for selecting a first target task scheduling center with a normal running state from the plurality of task scheduling centers by carrying out state detection on the plurality of task scheduling centers;
and the sending module is used for sending the user application information to the first target task scheduling center so that the first target task scheduling center obtains a task planning scheme according to the user application information and the current resource condition.
In a fourth aspect, an embodiment of the present invention provides a disaster recovery device based on a task scheduling center, where the disaster recovery device applied to a first task scheduling center includes:
the receiving module is used for receiving the user application information sent by the arbitration center;
and the planning module is used for obtaining a task planning scheme according to the user application information and the current resource condition, and sending the task planning scheme to a management and control system so that the management and control system loads and configures according to the task planning scheme.
The embodiment of the invention provides a disaster recovery method and a disaster recovery device based on a task scheduling center, wherein an arbitration center acquires task information required to be executed by the task scheduling center through user application information, performs state detection on a plurality of task scheduling centers, can select a first target task scheduling center in a normal operation state when a measurement and control system with higher requirement on operation data has a fault, and transmits the user application information to the first target task scheduling center to perform normal measurement and control on relay satellite operation data so as to ensure disaster recovery reliability.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and drawings.
In order to make the aforementioned and other objects, features and advantages of the present invention comprehensible, preferred embodiments accompanied with figures are described in detail below.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.
Fig. 1 is a schematic flowchart of a disaster recovery method based on a task scheduling center according to an embodiment of the present invention;
fig. 2 is a schematic flow chart of another disaster recovery method based on a task scheduling center according to an embodiment of the present invention;
fig. 3 is a schematic flowchart of another disaster recovery method based on a task scheduling center according to an embodiment of the present invention;
fig. 4 is a flowchart of a disaster recovery method based on a task scheduling center according to an embodiment of the present invention;
fig. 5 is a schematic functional module diagram of a disaster recovery device based on a task scheduling center according to an embodiment of the present invention;
fig. 6 is a schematic functional module diagram of another disaster recovery device based on a task scheduling center according to an embodiment of the present invention.
Detailed Description
To make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is apparent that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
At present, in the relay satellite measurement and control field, simple data-level backup cannot meet the disaster tolerance requirement, while system-level backup cannot meet the situation that measurement and control data in the relay satellite measurement and control field are complex, and the disaster tolerance reliability cannot be guaranteed.
Based on this, the disaster recovery method and apparatus based on the task scheduling center provided by the embodiments of the present invention can switch disaster recovery when the measurement and control system with higher requirement on the operation data has a fault, normally measure and control the operation data, and ensure the reliability of disaster recovery.
To facilitate understanding of the embodiment, a disaster recovery method based on a task scheduling center disclosed in the embodiment of the present invention is first described in detail.
Fig. 1 is a schematic flowchart of a disaster recovery method based on a task scheduling center according to an embodiment of the present invention.
Referring to fig. 1, a disaster recovery method based on a task scheduling center, which can be implemented in the field of relay satellite measurement and control, is applied to an arbitration center, and includes the following steps:
step S102, user application information is obtained and used for representing task information executed by the task scheduling center.
Here, the user can obtain the current resource situation through the space window of the terminal, and according to the current resource situation, the user application information is provided and sent to the arbitration center.
And step S104, selecting a first target task scheduling center with a normal running state from the plurality of task scheduling centers by performing state detection on the plurality of task scheduling centers.
And step S106, sending the user application information to a first target task scheduling center so that the first target task scheduling center obtains a task planning scheme according to the user application information and the current resource condition.
In a preferred embodiment of practical application, the arbitration center obtains task information required to be executed by the task scheduling centers through user application information, performs state detection on a plurality of task scheduling centers, can select a first target task scheduling center in a normal operation state when a measurement and control system with higher requirement on operation data has a fault, and sends the user application information to the first target task scheduling center to perform normal measurement and control on the operation data of the relay satellite, thereby ensuring disaster tolerance reliability.
Because the relay satellite measurement and control system has a lot of operation data, in order to ensure that excessive pressure is not applied to the task scheduling center, and slow operation delay is caused, the embodiment of the invention also comprises the following method, as shown in fig. 2, comprising:
step S202, the state of a plurality of task scheduling centers is detected.
Step S204, judging whether the plurality of task scheduling centers have abnormal operating states.
If so, the process goes to step S206.
If not, the process goes to step S208.
Step S206, selecting a first target task scheduling center with a normal operation state from the plurality of task scheduling centers, and sending the user application information to the first target task scheduling center.
And S208, selecting a second target scheduling center with less distributed first historical tasks according to the distribution condition of the first historical tasks, and sending the user application information to the second target task scheduling center.
If the number of tasks historically allocated to the first task scheduling center by the arbitration center is 2, and the number of tasks historically allocated to the second task scheduling center is 3, the current task is allocated to the first task scheduling center with the history task allocated less frequently.
In order to further reduce the pressure of the applied task scheduling center, the embodiments of the present invention may also perform allocation according to the pressure condition (the allocation execution condition of the historical tasks) of the task scheduling center when the link is established and/or the management and control information is uploaded.
That is, the embodiment of the present invention may be implemented when the management and control system establishes the link:
receiving a link establishment signal; and selecting a third target task scheduling center with a small second historical task number from the plurality of task scheduling centers according to the distribution condition of the second historical tasks, so that the third target task scheduling center establishes a link.
The management and control system also establishes a satellite-ground link and a ground link to a third target task scheduling center according to the planning scheme, and simultaneously sends configuration loading results according to the planning scheme and link establishment results to the third target task scheduling center.
The embodiment of the invention can also be used for uploading the control information in the control system:
receiving a control information uploading signal; and selecting a fourth target task scheduling center with less third history task quantity distribution according to the distribution condition of the third history tasks so as to upload the user application information to the fourth target task scheduling center.
It should be noted that, in the embodiment of the present invention, a task scheduling center that performs work may be selected according to the distribution condition of the historical tasks when a link and management and control information are established, or a task scheduling center with low pressure may be selected only when a link is established, or a task scheduling center with low pressure may be selected only when management and control information is uploaded.
As an alternative embodiment, in order to more accurately detect the state of the task scheduling center, step S104 in the foregoing embodiment further includes the following steps:
1. and sending the heartbeat detection signal to a plurality of task scheduling centers in real time.
2. And receiving feedback signals of a plurality of task scheduling centers.
3. And knowing the running states of the plurality of task scheduling centers according to the feedback signal, and further selecting a first target task scheduling center with a normal running state from the plurality of task scheduling centers.
Further, as shown in fig. 3, an embodiment of the present invention further provides a disaster recovery method based on a task scheduling center, which is applied to a first task scheduling center, and includes the following steps:
step S302, receiving user application information sent by an arbitration center;
and step S304, obtaining a task planning scheme according to the user application information and the current resource condition, and sending the task planning scheme to the management and control system so that the management and control system loads and configures according to the task planning scheme.
It should be noted that the method is applied to a case where a first task scheduling center selected from a plurality of task scheduling centers is assumed to be a first target task scheduling center.
Further, in order to ensure that the current task scheduling center can be seamlessly switched to the task scheduling center in a good state when the task scheduling center is abnormal or has a large pressure load, and the scheduling center can process a large number of measurement and control processes and data of the relay satellite in time, the method further includes:
and synchronizing the application data and the management and control information to a second task scheduling center so that the second task scheduling center continues to execute the current task according to the synchronized application data and/or the management and control information.
Alternatively, the first and second electrodes may be,
the method further comprises the following steps:
and synchronizing the application data or the management and control information to the second task scheduling center so that the second task scheduling center continues to execute the current task according to the synchronized application data and/or the management and control information.
The application data is used for completing a current task obtained according to the user application information, and the control information comprises a current resource occupation condition and equipment operation information in the control system.
Therefore, the first task scheduling center serving as the target task scheduling center sends the application data to the management and control center and synchronizes the application data to the second task scheduling center. As an optional implementation manner, when the management and control system uploads the management and control information to the target task scheduling center, the target task scheduling center may also synchronize the management and control information to the second task scheduling center, so as to implement seamless handover.
In the relay satellite measurement and control field, the management and control system may include an operation and control system, a ground station equipment management and control system, and a user center scheduling and control system. The corresponding control information is the corresponding device operation information and the current resource condition in the control system. If so, the ground station equipment management and control system and the user center scheduling management and control system respectively upload equipment fault information, equipment operation parameter information and equipment performance information in the station where the ground station equipment management and control system and the user center scheduling management and control system are located; and uploading the actual resource occupation condition by the operation control system.
In some possible embodiments, the embodiments of the present invention are applied to the relay satellite measurement and control application field, and step S304 includes the following steps:
1. acquiring the current available resource condition;
2. obtaining a resource pre-allocation scheme according to the current available resource condition and the user application information;
3. and performing task planning on the current resources according to the resource pre-allocation scheme to obtain a task planning scheme, and sending the task planning scheme to a management and control system.
Here, the planning and scheduling system in the task scheduling center generates a user requirement, that is, a requirement situation of a user for a resource, according to the user application information, requests a current available resource situation from a resource management system in the task scheduling center, and obtains a resource pre-allocation scheme for the user requirement according to the current available resource situation and the user requirement. And the task scheduling center coordinates resources according to a resource pre-allocation scheme and issues and updates the coordinated current resource condition to the resource management system. And the planning and scheduling system combines the user application and the current resource condition to carry out task planning to obtain a task planning scheme.
The resource pre-allocation scheme can be embodied in a table format of a system page comprising task time, resource types to be used, resource names and task contents, and if 9 points are needed, a satellite A is used for measuring and controlling data B and a channel C is used for transmission. The mission planning scheme can be understood as allocating various resources such as x satellites, y channels and the like obtained according to the resource pre-allocation scheme to z systems or equipment.
In addition, the planning and scheduling system in the target task scheduling center can issue the planning scheme to the corresponding ground station equipment management and control system, operation and control system, user center scheduling management and control system and the rest task scheduling center, so that each management and control system can obtain various corresponding planning parameters. Planning parameters may be understood as a form of a system page table including parameter types such as IP addresses of individual devices that may be modified according to the planned resources.
And the task scheduling center sends a task loading command to the corresponding system in a task period, so that the corresponding control system performs parameter configuration and loading according to the loading command and the planning parameters respectively.
Here, an embodiment of the present invention further provides a disaster recovery method based on a task scheduling center, as shown in fig. 4, including:
step S401, the arbitration center sends heartbeat detection signals to a plurality of task scheduling centers.
Step S402, the task scheduling centers send feedback signals to the arbitration center according to the heartbeat detection signals.
And step S403, the arbitration center selects a first target task scheduling center with a normal state according to the feedback signal.
Step S404, if the states of the plurality of task scheduling centers are normal, the arbitration center selects a second target task scheduling center which is allocated with a smaller number of first historical tasks according to the allocation condition of the first historical tasks.
Step S405, the arbitration center sends the user application to the first target task scheduling center or the second target task scheduling center.
Step S406, the task scheduling center (the first target task scheduling center or the second target task scheduling center) obtains a planning scheme according to the user application and the current resource situation.
In step S407, the task scheduling center (the first target task scheduling center or the second target task scheduling center) sends the planning scheme and the loading command to the management and control system.
The ground station equipment management and control system loads the received planning parameters into corresponding equipment or a software system for configuration according to planning resources in a task period; the operation control system carries out configuration adjustment on the relay satellite resources according to the received planning parameters in the task time period; and the other task scheduling centers load the received (synchronized) planning parameters (application data) into corresponding equipment or a software system for configuration according to the planning scheme in the task time period.
Step S408, the task scheduling center (the first target task scheduling center or the second target task scheduling center) synchronizes the application data to the other task scheduling centers.
And step S409, configuring planning parameters in the loading planning scheme by the management and control system according to the loading command.
Step S410, the management and control system sends the link establishment signal to the arbitration center.
And step S411, the arbitration center selects a third target task scheduling center with a smaller number of second historical tasks according to the distribution condition of the second historical tasks.
Step S412, the arbitration center sends a link establishment instruction to the management and control system.
In step S413, the management and control system establishes a link with the task scheduling center (a third target task scheduling center) according to the planning scheme.
Step S414, the management and control system sends a configuration completion signal to the task scheduling center (the third target task scheduling center) when the link is completed and the configuration is completed.
And the rest task scheduling centers, the ground station equipment management and control system and the operation and control system establish a satellite-ground link and a ground link according to the planning scheme, carry out network opening and feed back the configuration result to the planning and scheduling system of the task scheduling center.
In step S415, the management and control system sends a management and control information uploading signal to the arbitration center.
Step S416, the arbitration center selects a fourth target task scheduling center with a smaller number of assigned third history tasks according to the assignment of third history tasks.
Step S417, the arbitration center sends a management and control information uploading instruction to the management and control system.
In step S418, the management and control system uploads management and control information to the task scheduling center (fourth target task scheduling center).
Here, the management and control information includes a system operation situation formed by the operation and control system feeding back the actual satellite resource occupation result to the integrated network management system in the task scheduling center. And the comprehensive network management system distributes the system operation situation to the resource management system to complete the collection of the actually used relay satellite resource information. The ground station equipment management and control system and the operation and control system report management information such as equipment operation conditions, network operation performance information, user service data guarantee conditions and the like to the comprehensive network management system of the task scheduling center through the northbound interface to form a system operation situation.
The ground equipment management and control system can also report operation data to the task scheduling center, and the operation and control system feeds back the satellite resource monitoring condition to the task scheduling center.
The ground station equipment management and control system acquires the working state of the task planning link through network equipment such as optical transmission network equipment and a router, collects the working state of the link such as bandwidth occupancy rate, packet loss rate and transmission delay of the task guarantee link, and completes the unified management and control of local ground station equipment.
In the task execution process, the ground station equipment management and control system and the user center scheduling management and control system respectively collect equipment fault information, equipment operation parameter information and equipment performance information of the stations in which the ground station equipment management and control system and the user center scheduling management and control system are located, meanwhile, alarm thresholds can be set for part of key equipment or working parameters, early warning is carried out, and the service quality of key services is guaranteed. Link threshold values are arranged locally in each management and control system, when the link pressure values exceed the threshold values, alarming is carried out, or other operations such as manual link switching can be carried out, so that the stability of the links is guaranteed.
Step S419, the task scheduling center (fourth target task scheduling center) synchronizes the management and control information to the other task scheduling centers.
The method provided by the embodiment of the invention can realize seamless service switching and has the capacity of task scheduling center level double-active and system level disaster recovery backup.
Further, as shown in fig. 5, an embodiment of the present invention further provides a disaster recovery device based on a task scheduling center, where the disaster recovery device applied to an arbitration center includes:
the acquisition module is used for acquiring user application information which is used for representing task information executed by the task scheduling center;
the selection module is used for selecting a first target task scheduling center with a normal running state from the plurality of task scheduling centers by carrying out state detection on the plurality of task scheduling centers;
and the sending module is used for sending the user application information to the first target task scheduling center so that the first target task scheduling center obtains a task planning scheme according to the user application information and the current resource condition.
Further, as shown in fig. 6, an embodiment of the present invention further provides a disaster recovery device based on a task scheduling center, where the disaster recovery device applied to a first task scheduling center includes:
the receiving module is used for receiving the user application information sent by the arbitration center;
and the planning module is used for obtaining a task planning scheme according to the user application information and the current resource condition, and sending the task planning scheme to the management and control system so that the management and control system loads and configures according to the task planning scheme.
The disaster recovery device based on the task scheduling center provided by the embodiment of the invention has the same technical characteristics as the disaster recovery method based on the task scheduling center provided by the embodiment, so that the same technical problems can be solved, and the same technical effects can be achieved.
The computer program product of the disaster recovery method and apparatus based on the task scheduling center provided in the embodiments of the present invention includes a computer-readable storage medium storing a program code, where instructions included in the program code may be used to execute the method described in the foregoing method embodiments, and specific implementations may refer to the method embodiments and are not described herein again.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the system and the apparatus described above may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In addition, in the description of the embodiments of the present invention, unless otherwise explicitly specified or limited, the terms "mounted," "connected," and "connected" are to be construed broadly, e.g., as meaning either a fixed connection, a removable connection, or an integral connection; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.
The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
In the description of the present invention, it should be noted that the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer", etc., indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings, and are only for convenience of description and simplicity of description, but do not indicate or imply that the device or element being referred to must have a particular orientation, be constructed and operated in a particular orientation, and thus, should not be construed as limiting the present invention. Furthermore, the terms "first," "second," and "third" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.
The embodiment of the present invention further provides an electronic device, which includes a memory, a processor, and a computer program that is stored in the memory and can be run on the processor, and when the processor executes the computer program, the steps of the disaster recovery method based on the task scheduling center provided in the foregoing embodiment are implemented.
The embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the steps of the disaster recovery method based on the task scheduling center according to the above embodiment are executed.
Finally, it should be noted that: the above-mentioned embodiments are only specific embodiments of the present invention, which are used for illustrating the technical solutions of the present invention and not for limiting the same, and the protection scope of the present invention is not limited thereto, although the present invention is described in detail with reference to the foregoing embodiments, those skilled in the art should understand that: any person skilled in the art can modify or easily conceive the technical solutions described in the foregoing embodiments or equivalent substitutes for some technical features within the technical scope of the present disclosure; such modifications, changes or substitutions do not depart from the spirit and scope of the embodiments of the present invention, and they should be construed as being included therein.

Claims (7)

1. A disaster recovery method based on a task scheduling center is characterized by being applied to the field of disaster recovery backup of a relay satellite measurement and control management system and comprising the following steps:
the arbitration center acquires user application information, wherein the user application information is used for representing task information executed by the task scheduling center; a user acquires the current resource condition through a space window of a terminal, provides the user application information according to the current resource condition and sends the user application information to the arbitration center;
the arbitration center performs state detection on the plurality of task scheduling centers to select a first target task scheduling center with a normal operation state from the plurality of task scheduling centers;
the arbitration center sends the user application information to the first target task scheduling center so that the first target task scheduling center obtains a task planning scheme according to the user application information and the current resource condition;
a first task scheduling center in the task scheduling centers receives user application information sent by an arbitration center;
obtaining a task planning scheme according to the user application information and the current resource condition, and sending the task planning scheme to a management and control system so that the management and control system loads configuration according to the task planning scheme;
synchronizing application data and/or management and control information to a second task scheduling center so that the second task scheduling center continues to execute a current task according to the synchronized application data and/or management and control information, wherein the application data is used for completing the current task obtained according to the user application information, and the management and control information comprises current resource occupation conditions and equipment operation information in a management and control system;
in the relay satellite measurement and control field, a management and control system comprises an operation and control system, a ground station equipment management and control system and a user center scheduling and control system, wherein corresponding management and control information is corresponding equipment operation information and current resource conditions in the management and control system respectively;
the management and control information also comprises a system operation situation formed by the operation and control system feeding back the actual satellite resource occupation result to the comprehensive network management system in the task scheduling center; the comprehensive network management system distributes the system operation situation to a resource management system to complete the collection of actually used relay satellite resource information; the ground station equipment management and control system and the operation and control system report the equipment operation condition, the network operation performance information and the user service data guarantee condition to the comprehensive network management system of the task scheduling center through a northbound interface to form a system operation situation;
the ground equipment management and control system can also report operation data to the task scheduling center, and the operation and control system feeds back the satellite resource monitoring condition to the task scheduling center;
the ground station equipment management and control system acquires the working state of a task planning link through optical transmission network equipment and a router, collects the bandwidth occupancy rate, the packet loss rate and the transmission delay of a task guarantee link, and completes the unified management and control of local ground station equipment;
in the task execution process, the ground station equipment management and control system and the user center scheduling management and control system respectively collect equipment fault information, equipment operation parameter information and equipment performance information of the station where the ground station equipment management and control system and the user center scheduling management and control system are located, and meanwhile, alarm thresholds can be set for part of key equipment or working parameters to perform early warning, so that the service quality of key services is ensured; link threshold values are arranged locally in each management and control system, when the link pressure values exceed the threshold values, alarming is carried out, or other operations such as manual link switching can be carried out, so that the stability of the links is guaranteed.
2. The task scheduling center-based disaster recovery method according to claim 1, wherein the method further comprises:
and if the running states of the plurality of task scheduling centers are normal, the arbitration center selects a second target task scheduling center with less quantity distribution of the first historical tasks from the plurality of task scheduling centers according to the distribution condition of the first historical tasks, and sends the user application information to the second target task scheduling center.
3. The disaster recovery method based on task scheduling center according to claim 1 or 2, wherein the method further comprises:
the arbitration center receives a link establishment signal;
and the arbitration center selects a third target task scheduling center with a small second historical task quantity from the plurality of task scheduling centers according to the distribution condition of the second historical tasks, so that the third target task scheduling center establishes a link.
4. The task scheduling center-based disaster recovery method according to claim 3, wherein the method further comprises:
the arbitration center receives a control information uploading signal;
and the arbitration center selects a fourth target task scheduling center with less quantity distribution of the third history tasks according to the distribution condition of the third history tasks, so that the user application information is uploaded to the fourth target task scheduling center.
5. The disaster recovery method based on task scheduling center according to claim 1, wherein the step of selecting a first target task scheduling center with a normal operation state from the plurality of task scheduling centers by performing state detection on the plurality of task scheduling centers comprises:
the arbitration center sends heartbeat detection signals to the plurality of task scheduling centers in real time;
the arbitration center receives feedback signals of a plurality of task scheduling centers;
and the arbitration center selects a first target task scheduling center with a normal operation state from the plurality of task scheduling centers according to the feedback signal.
6. The disaster recovery method based on the task scheduling center according to claim 1, wherein the task scheduling center obtains a task planning scheme according to the user application information and the current resource condition, and sends the task planning scheme to a management and control system, and the method comprises:
the task scheduling center acquires the current available resource condition;
the task scheduling center obtains a resource pre-allocation scheme according to the current available resource condition and the user application information;
and the task scheduling center performs task planning on the current resources according to the resource pre-allocation scheme to obtain a task planning scheme, and sends the task planning scheme to a control system.
7. A disaster recovery device based on a task scheduling center is applied to the field of disaster recovery backup of a relay satellite measurement and control management system and comprises:
the system comprises an acquisition module, a task scheduling center and a processing module, wherein the acquisition module is used for acquiring user application information which is used for representing task information executed by the task scheduling center; a user acquires the current resource condition through a space window of a terminal, provides the user application information according to the current resource condition and sends the user application information to an arbitration center;
the selection module is used for selecting a first target task scheduling center with a normal running state from the plurality of task scheduling centers by carrying out state detection on the plurality of task scheduling centers;
the sending module is used for sending the user application information to the first target task scheduling center so that the first target task scheduling center obtains a task planning scheme according to the user application information and the current resource condition;
the receiving module is used for receiving the user application information sent by the arbitration center;
the planning module is used for obtaining a task planning scheme according to the user application information and the current resource condition, and sending the task planning scheme to a management and control system so that the management and control system loads and configures according to the task planning scheme;
the synchronization module is used for synchronizing application data and/or management and control information to a second task scheduling center so that the second task scheduling center can continuously execute a current task according to the synchronized application data and/or management and control information, wherein the application data is used for completing the current task obtained according to the user application information, and the management and control information comprises current resource occupation conditions and equipment operation information in the management and control system;
in the relay satellite measurement and control field, a management and control system comprises an operation and control system, a ground station equipment management and control system and a user center scheduling and control system, wherein corresponding management and control information is corresponding equipment operation information and current resource conditions in the management and control system respectively;
the management and control information also comprises a system operation situation formed by the operation and control system feeding back the actual satellite resource occupation result to the comprehensive network management system in the task scheduling center; the comprehensive network management system distributes the system operation situation to a resource management system to complete the collection of actually used relay satellite resource information; the ground station equipment management and control system and the operation and control system report the equipment operation condition, the network operation performance information and the user service data guarantee condition to the comprehensive network management system of the task scheduling center through a northbound interface to form a system operation situation;
the ground equipment management and control system can also report operation data to the task scheduling center, and the operation and control system feeds back the satellite resource monitoring condition to the task scheduling center;
the ground station equipment management and control system acquires the working state of a task planning link through optical transmission network equipment and a router, collects the bandwidth occupancy rate, the packet loss rate and the transmission delay of a task guarantee link, and completes the unified management and control of local ground station equipment;
in the task execution process, the ground station equipment management and control system and the user center scheduling management and control system respectively collect equipment fault information, equipment operation parameter information and equipment performance information of the station where the ground station equipment management and control system and the user center scheduling management and control system are located, and meanwhile, alarm thresholds can be set for part of key equipment or working parameters to perform early warning, so that the service quality of key services is ensured; link threshold values are arranged locally in each management and control system, when the link pressure values exceed the threshold values, alarming is carried out, or other operations such as manual link switching can be carried out, so that the stability of the links is guaranteed.
CN201910976026.2A 2019-10-14 2019-10-14 Disaster recovery method and device based on task scheduling center Active CN110737521B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910976026.2A CN110737521B (en) 2019-10-14 2019-10-14 Disaster recovery method and device based on task scheduling center

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910976026.2A CN110737521B (en) 2019-10-14 2019-10-14 Disaster recovery method and device based on task scheduling center

Publications (2)

Publication Number Publication Date
CN110737521A CN110737521A (en) 2020-01-31
CN110737521B true CN110737521B (en) 2021-03-05

Family

ID=69268949

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910976026.2A Active CN110737521B (en) 2019-10-14 2019-10-14 Disaster recovery method and device based on task scheduling center

Country Status (1)

Country Link
CN (1) CN110737521B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111913791B (en) * 2020-07-02 2023-10-13 北京和瑞精湛医学检验实验室有限公司 Task scheduling method, device, equipment and computer readable storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101599026A (en) * 2009-07-09 2009-12-09 浪潮电子信息产业股份有限公司 A kind of cluster job scheduling system with resilient infrastructure
CN102193853A (en) * 2010-03-12 2011-09-21 三星电子株式会社 Virtual machine monitor and scheduling method thereof
US9378051B1 (en) * 2012-07-13 2016-06-28 Google Inc. Systems and methods for performing scheduling for a cluster
CN105808334A (en) * 2016-03-04 2016-07-27 山东大学 MapReduce short job optimization system and method based on resource reuse
CN106095572A (en) * 2016-06-08 2016-11-09 东方网力科技股份有限公司 The Dispatching System of a kind of big data process and method
CN110225137A (en) * 2019-06-24 2019-09-10 北京达佳互联信息技术有限公司 Service request processing method, system, server and storage medium

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8627322B2 (en) * 2010-10-29 2014-01-07 Google Inc. System and method of active risk management to reduce job de-scheduling probability in computer clusters
CN102073546B (en) * 2010-12-13 2013-07-10 北京航空航天大学 Task-dynamic dispatching method under distributed computation mode in cloud computing environment
US9658893B2 (en) * 2015-05-06 2017-05-23 Runtime Design Automation Multilayered resource scheduling
CN105813218B (en) * 2016-04-20 2019-05-07 江苏中科羿链通信技术有限公司 The business scheduling method of wireless self-organization network
CN108123987A (en) * 2016-11-30 2018-06-05 华为技术有限公司 The method and device of master scheduler is determined from cloud computing system
CN108628674A (en) * 2018-05-11 2018-10-09 深圳市零度智控科技有限公司 Method for scheduling task, cloud platform based on cloud platform and computer storage media
CN109951558A (en) * 2019-03-27 2019-06-28 北京并行科技股份有限公司 A kind of cloud dispatching method of supercomputer resource, cloud control centre and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101599026A (en) * 2009-07-09 2009-12-09 浪潮电子信息产业股份有限公司 A kind of cluster job scheduling system with resilient infrastructure
CN102193853A (en) * 2010-03-12 2011-09-21 三星电子株式会社 Virtual machine monitor and scheduling method thereof
US9378051B1 (en) * 2012-07-13 2016-06-28 Google Inc. Systems and methods for performing scheduling for a cluster
CN105808334A (en) * 2016-03-04 2016-07-27 山东大学 MapReduce short job optimization system and method based on resource reuse
CN106095572A (en) * 2016-06-08 2016-11-09 东方网力科技股份有限公司 The Dispatching System of a kind of big data process and method
CN110225137A (en) * 2019-06-24 2019-09-10 北京达佳互联信息技术有限公司 Service request processing method, system, server and storage medium

Also Published As

Publication number Publication date
CN110737521A (en) 2020-01-31

Similar Documents

Publication Publication Date Title
CN108696881B (en) Connection management method, first network equipment, terminal equipment and system
CN101426306B (en) A disaster tolerance switching method, system and apparatus
EP2255571B1 (en) Method for automatically selecting a physical cell identity (pci) of a long term evolution (lte) radio cell
CN104854900B (en) Measurement method, user equipment, the network equipment and the system of wireless network KPI
CN105379341B (en) The method and device of transfer mode information
CN102300248B (en) Method and device for feeding back MDT (Minimization Drive Test) Log information
CN102802202A (en) Terminal station accessing method, equipment and system
JP2012054736A (en) Mobile communication system and load distribution method for the same
CN111491302B (en) Information reporting method, receiving method, device and equipment
CN110737521B (en) Disaster recovery method and device based on task scheduling center
JP4901796B2 (en) Log level collection apparatus and log level collection method
CN102752786A (en) Method, system and device for determining channel quality of communication system
CN110674096B (en) Node troubleshooting method, device and equipment and computer readable storage medium
CN112533254B (en) Cell load balancing method, device and network equipment
CN111194054B (en) User equipment migration method, centralized unit, distributed unit and system
CN102143496B (en) Method and equipment for adjusting channels in centralized mesh network
CN106717064B (en) Data transmission method, device and system of time division duplex wireless data transmission system
EP3410765A1 (en) Access point monitoring
CN113473563B (en) Distribution method, mobile network switching method, system, server and medium
CN104469865A (en) Target cell configuration method and device
EP3869739A1 (en) Method for monitoring apparatus
CN112788636B (en) Method, device and network management system for determining cell state
US9756509B2 (en) Defining logical cells
CN109086292B (en) Database switching method and system
CN101534530B (en) Method, system and control device for cell re-configuration of user equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant