CN112636990A - Resource management system - Google Patents
Resource management system Download PDFInfo
- Publication number
- CN112636990A CN112636990A CN202011642438.1A CN202011642438A CN112636990A CN 112636990 A CN112636990 A CN 112636990A CN 202011642438 A CN202011642438 A CN 202011642438A CN 112636990 A CN112636990 A CN 112636990A
- Authority
- CN
- China
- Prior art keywords
- resource
- node
- manager
- information
- state information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000010365 information processing Effects 0.000 claims abstract description 10
- 238000007726 management method Methods 0.000 claims description 29
- 238000013468 resource allocation Methods 0.000 claims description 9
- 230000002159 abnormal effect Effects 0.000 claims description 7
- 238000011176 pooling Methods 0.000 abstract description 5
- 238000012544 monitoring process Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000009827 uniform distribution Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/04—Network management architectures or arrangements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/10—Active monitoring, e.g. heartbeat, ping or trace-route
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L47/00—Traffic control in data switching networks
- H04L47/10—Flow control; Congestion control
- H04L47/12—Avoiding congestion; Recovering from congestion
- H04L47/125—Avoiding congestion; Recovering from congestion by balancing the load, e.g. traffic engineering
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Cardiology (AREA)
- General Health & Medical Sciences (AREA)
- Hardware Redundancy (AREA)
Abstract
The invention discloses a resource management system, which comprises a plurality of node managers, resource managers and subordinate resource managers; the node manager acquires resource state information of a hardware node where the node manager is located in real time according to a configurable beat and stores the resource state information to form a node resource pool; the node manager sends all resource state information in the node resource pool to the resource manager; the resource manager receives the resource state information reported by all the node managers; and the resource manager performs classified management on the collected resource state information to form resource information, responds to a resource request of an external radar information processing application in real time, and allocates resources required by the radar information processing application based on the current resource information and a scheduling strategy. The invention supports the radar processing platform distributed resource pooling management, unified scheduling and allocation according to needs, supports the time-sharing multiplexing of resources and improves the utilization rate of software and hardware resources.
Description
Technical Field
The invention relates to the field of radar information processing, in particular to a resource management system.
Background
In the face of the combat requirements of multiple targets and complex interference environments and the challenges brought by constantly changing radar combat objects and combat environments, the radar system is required to be expandable in task-oriented functions, easy to upgrade processing system algorithms and constantly improved in system performance so as to meet the requirement of timely responding to new combat requirements.
The functions and combat missions undertaken by radars are becoming more and more diversified, and radars are developing towards the direction of integration of detection, electronic warfare, communication and the like. The requirements lead the radar equipment to develop towards the characteristics of customizable requirements, reconfigurable hardware, reconfigurable software and the like, so as to meet the reconfigurability of the system facing to dynamic environment and task requirements and the expandability and maintainability facing to a longer equipment life cycle under the conditions of multiple functions, multiple tasks and complex and variable environments.
In the existing radar equipment, software and hardware resources are relatively fixed, redundancy backup is more, the resource utilization rate is not high, the resource reuse rate among multiple modes is low, and dynamic scheduling distribution is difficult.
Disclosure of Invention
In order to solve the above problems, the present invention provides a resource management system, which includes a plurality of node managers, a resource manager, and a subordinate resource manager; the node manager acquires resource state information of a hardware node where the node manager is located in real time according to a configurable beat and stores the resource state information to form a node resource pool; the node manager sends all resource state information in the node resource pool to the resource manager; the resource manager receives the resource state information reported by all the node managers; the resource manager carries out classified management on the collected resource state information to form resource information, responds to a resource request of external radar information processing application in real time, and allocates resources required by the radar information processing application based on the current resource information and a scheduling strategy; the resource manager sends heartbeat information to the subordinate resource managers in real time according to the configuration beat; the slave resource manager receives heartbeat information of the resource manager in real time; the slave resource manager judges whether the resource manager is abnormal or not according to the heartbeat information, and when the resource manager works normally, the slave resource manager is in an inactivated state; when the resource manager is abnormal, the subordinate resource manager is actively activated to become a new resource manager.
Furthermore, the node manager runs on all hardware nodes providing computing resources for the system, and comprises a resource acquisition module and an information reporting module, wherein the resource acquisition module acquires resource state information of the hardware nodes in real time in a configurable beat distributed system through a calling interface and uniformly stores the resource state information in a key-value form to form a node resource pool; and the information reporting module uniformly packages and sends the resource state information in the node resource pool to the resource manager in a TCP network communication mode.
Furthermore, the resource manager comprises an information receiving module and a resource allocation module, wherein the information receiving module receives the resource state information reported by all the node managers and constructs a linked list to form a system resource pool in the form of < node IP-node resource information >; the resource allocation module carries out classified management on the resource state information in the system resource pool to form resource information; and the resource allocation module responds to the resource request of the radar information processing application in real time and allocates resources required by the application based on the current resource information and the scheduling strategy.
Further, the scheduling policy comprises a load balancing policy and a performance priority policy, if the load balancing policy is adopted, the application is distributed to the idle node, and if the performance priority policy is adopted, the application is distributed to the node where the application related to the application is located; the current resource information comprises resource types, resource demand, resource surplus, resource loads and application relevance.
Further, the resource manager sends heartbeat information according to a configuration beat in a UDP multicast mode, and when the heartbeat information of the resource manager continuously fails or is lost for three times, the resource manager is abnormal.
Compared with the prior art, the invention has the following beneficial effects:
1. the invention provides a node manager module which can acquire node software and hardware resource information in real time and support node resource pooling management;
2. the invention provides a resource management module, which supports the unified management and the allocation according to needs of distributed system resources, supports the pooling management and the time-sharing multiplexing of the system resources and improves the resource utilization rate;
3. the invention provides a subordinate resource management module, which realizes the high-reliability design of resource management and effectively prevents the problem of single-point failure of software and hardware of a system.
4. The invention supports the real-time acquisition of the bottom layer software and hardware resource state information, forms a system resource pool for uniform distribution and management, and improves the resource utilization rate.
Drawings
FIG. 1 is an overall architecture of a resource management system.
Detailed Description
The present invention is described in further detail below with reference to the attached drawing figures.
In this disclosure, aspects of the present invention are described with reference to the accompanying drawings, in which a number of illustrative embodiments are shown. It should be appreciated that the various concepts and embodiments described above, as well as those described in greater detail below, may be implemented in any of numerous ways, as the disclosed concepts and embodiments are not limited to any one implementation. In addition, some aspects of the present disclosure may be used alone, or in any suitable combination with other aspects of the present disclosure.
Example 1:
to achieve the above object, the present invention provides a resource management system, as shown in fig. 1, running on an operating system, the resource management system including: a node manager, a resource manager, and a subordinate resource manager.
The node manager runs on all hardware nodes providing computing resources for the system, and comprises a resource acquisition module and an information reporting module. The resource acquisition module acquires resource state information (including resource information such as calculation, storage and network) of hardware nodes such as a CPU (central processing unit), a DSP (digital signal processor) and the like from an operating system in real time through a certain acquisition beat by calling an interface, and uniformly stores the resource state information in a key-value form to form a node resource pool, wherein the acquisition beat is configurable and defaults to 500 ms; the information reporting module uniformly packages and sends resource state information in the node resource pool to a resource manager in a TCP network communication mode, and the specific resource state information is shown in table 1;
TABLE 1
The resource manager comprises an information receiving module and a resource allocation module. The information receiving module receives resource state information reported by all node managers, and all the resource state information constructs a map linked list in a mode of < node IP-node resource information > to form a system resource pool; the resource allocation module carries out classified management on resource state information in a system resource pool to form resource information, the resource allocation module responds to a resource request of radar information processing application in real time, resources required by the application are allocated based on current resource information and a scheduling strategy, if a load balancing strategy is adopted, the application is allocated to an idle node, and if a performance priority strategy is adopted, the application is allocated to a node where the application which is in a relation with the application is located, so that transmission delay is reduced. The current resource information comprises resource types, resource demand, resource surplus, resource loads and application relevance.
The resource manager sends heartbeat information (-1: representing fault and 1: representing normal) to the subordinate resource manager in a UDP multicast mode according to a configuration beat, wherein the sending beat is configurable and defaults to 500 ms;
and the subordinate resource manager comprises a monitoring module, and the monitoring module receives the heartbeat information of the resource manager in real time. When the resource manager works normally, the slave resource manager is in an inactivated state; when the heartbeat information of the resource manager continuously fails (the value is-1) or is lost three times, the slave resource manager determines that the resource manager is abnormal, and the slave resource manager is actively activated to become a new resource manager.
In summary, the present invention provides a node manager module, which can obtain the node software and hardware resource information in real time, and support the pooling management of node resources; the invention provides a resource management module, which supports the unified management and the allocation according to needs of resources of a distributed system (hardware nodes), supports the pooling management and the time-sharing multiplexing of the resources of the system and improves the utilization rate of the resources; the invention provides a subordinate resource management module, which realizes the high-reliability design of resource management and effectively prevents the problem of single-point failure of software and hardware of a system.
Claims (5)
1. A resource management system, comprising a plurality of node managers, a resource manager, and a subordinate resource manager; the node manager acquires resource state information of a hardware node where the node manager is located in real time according to a configurable beat and stores the resource state information to form a node resource pool; the node manager sends all resource state information in the node resource pool to the resource manager; the resource manager receives the resource state information reported by all the node managers; the resource manager carries out classified management on the collected resource state information to form resource information, responds to a resource request of external radar information processing application in real time, and allocates resources required by the radar information processing application based on the current resource information and a scheduling strategy; the resource manager sends heartbeat information to the subordinate resource managers in real time according to the configuration beat; the slave resource manager receives heartbeat information of the resource manager in real time; the slave resource manager judges whether the resource manager is abnormal or not according to the heartbeat information, and when the resource manager works normally, the slave resource manager is in an inactivated state; when the resource manager is abnormal, the subordinate resource manager is actively activated to become a new resource manager.
2. The resource management system of claim 1, wherein the node manager runs on all hardware nodes providing computing resources for the system, and the node manager includes a resource acquisition module and an information reporting module, and the resource acquisition module acquires resource state information of the hardware nodes in real time in a configurable beat distributed system through a calling interface and uniformly stores the resource state information in a key-value form to form a node resource pool; and the information reporting module uniformly packages and sends the resource state information in the node resource pool to the resource manager in a TCP network communication mode.
3. The resource management system of claim 1, wherein the resource manager comprises an information receiving module and a resource allocation module, the information receiving module receives resource state information reported by all node managers, and constructs a linked list of all the resource state information in a form of < node IP-node resource information > to form a system resource pool; the resource allocation module carries out classified management on the resource state information in the system resource pool to form resource information; and the resource allocation module responds to the resource request of the radar information processing application in real time and allocates resources required by the application based on the current resource information and the scheduling strategy.
4. The resource management system of claim 3, wherein the scheduling policy comprises a load balancing policy and a performance priority policy, and if the load balancing policy is adopted, the application is allocated to an idle node, and if the performance priority policy is adopted, the application is allocated to a node where the application associated with the application is located; the current resource information comprises resource types, resource demand, resource surplus, resource loads and application relevance.
5. The resource management system according to claim 3 or 4, wherein the resource manager sends the heartbeat information in a configuration beat in a form of UDP multicast, and when the heartbeat information of the resource manager is continuously failed or lost three times, the resource manager is abnormal.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011642438.1A CN112636990A (en) | 2020-12-31 | 2020-12-31 | Resource management system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011642438.1A CN112636990A (en) | 2020-12-31 | 2020-12-31 | Resource management system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112636990A true CN112636990A (en) | 2021-04-09 |
Family
ID=75290217
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011642438.1A Pending CN112636990A (en) | 2020-12-31 | 2020-12-31 | Resource management system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112636990A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114389955A (en) * | 2022-03-02 | 2022-04-22 | 中国电子科技集团公司第十四研究所 | Embedded platform heterogeneous resource pooling management method |
CN114598591A (en) * | 2022-03-07 | 2022-06-07 | 中国电子科技集团公司第十四研究所 | Embedded platform node fault recovery system and method |
CN117971499A (en) * | 2024-03-28 | 2024-05-03 | 北京中电普华信息技术有限公司 | Resource allocation method, device, electronic equipment and storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105893157A (en) * | 2016-04-29 | 2016-08-24 | 国家计算机网络与信息安全管理中心 | Open and distributed system resource management and task scheduling system and method |
CN107450977A (en) * | 2015-12-30 | 2017-12-08 | 北京典赞科技有限公司 | The resource management dispatching method towards GPGPU clusters based on YARN |
CN108984569A (en) * | 2017-06-05 | 2018-12-11 | 中兴通讯股份有限公司 | Database switching method, system and computer readable storage medium |
-
2020
- 2020-12-31 CN CN202011642438.1A patent/CN112636990A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107450977A (en) * | 2015-12-30 | 2017-12-08 | 北京典赞科技有限公司 | The resource management dispatching method towards GPGPU clusters based on YARN |
CN105893157A (en) * | 2016-04-29 | 2016-08-24 | 国家计算机网络与信息安全管理中心 | Open and distributed system resource management and task scheduling system and method |
CN108984569A (en) * | 2017-06-05 | 2018-12-11 | 中兴通讯股份有限公司 | Database switching method, system and computer readable storage medium |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114389955A (en) * | 2022-03-02 | 2022-04-22 | 中国电子科技集团公司第十四研究所 | Embedded platform heterogeneous resource pooling management method |
CN114389955B (en) * | 2022-03-02 | 2023-08-15 | 中国电子科技集团公司第十四研究所 | Method for managing heterogeneous resource pool of embedded platform |
CN114598591A (en) * | 2022-03-07 | 2022-06-07 | 中国电子科技集团公司第十四研究所 | Embedded platform node fault recovery system and method |
CN114598591B (en) * | 2022-03-07 | 2024-02-02 | 中国电子科技集团公司第十四研究所 | Embedded platform node fault recovery system and method |
CN117971499A (en) * | 2024-03-28 | 2024-05-03 | 北京中电普华信息技术有限公司 | Resource allocation method, device, electronic equipment and storage medium |
CN117971499B (en) * | 2024-03-28 | 2024-06-25 | 北京中电普华信息技术有限公司 | Resource allocation method, device, electronic equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112636990A (en) | Resource management system | |
EP3522013B1 (en) | Method and system for migration of containers in a container orchestration platform between compute nodes | |
US7856572B2 (en) | Information processing device, program thereof, modular type system operation management system, and component selection method | |
CN112000448A (en) | Micro-service architecture-based application management method | |
US20130198755A1 (en) | Apparatus and method for managing resources in cluster computing environment | |
US20100077250A1 (en) | Virtualization based high availability cluster system and method for managing failure in virtualization based high availability cluster system | |
CN109918170A (en) | A kind of cloud data center virtual machine dynamic BTS configuration method and system | |
CN104917805A (en) | Load sharing method and equipment | |
CN101778002A (en) | Large-scale cluster system and building method thereof | |
Lardieri et al. | A multi-layered resource management framework for dynamic resource management in enterprise dre systems | |
CN116360935A (en) | Control system based on multi-priority real-time reconstruction and scheduling and working method thereof | |
CN106961700B (en) | Wireless communication method for dynamic remote fault-tolerant reconstruction of cluster avionics system computing resources | |
CN110532060B (en) | Hybrid network environment data acquisition method and system | |
CN104484228A (en) | Distributed parallel task processing system based on Intelli-DSC (Intelligence-Data Service Center) | |
CN112328402A (en) | High-efficiency self-adaptive space-based computing platform architecture and implementation method thereof | |
CN112631756A (en) | Distributed regulation and control method and device applied to space flight measurement and control software | |
CN114615268B (en) | Service network, monitoring node, container node and equipment based on Kubernetes cluster | |
CN111651278B (en) | Dynamic reconstruction method and platform based on software radar | |
CN113515356B (en) | Lightweight distributed resource management and task scheduler and method | |
CN104486447A (en) | Large platform cluster system based on Big-Cluster | |
CN114924888A (en) | Resource allocation method, data processing method, device, equipment and storage medium | |
Kim et al. | An accelerated edge computing with a container and its orchestration | |
CN106844021B (en) | Computing environment resource management system and management method thereof | |
Mao et al. | A Load Balancing and Overload Controlling Architecture in Clouding Computing | |
CN117938863B (en) | Cluster-based joint simulation implementation method, system, equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20210409 |