CN112636990A - Resource management system - Google Patents

Resource management system Download PDF

Info

Publication number
CN112636990A
CN112636990A CN202011642438.1A CN202011642438A CN112636990A CN 112636990 A CN112636990 A CN 112636990A CN 202011642438 A CN202011642438 A CN 202011642438A CN 112636990 A CN112636990 A CN 112636990A
Authority
CN
China
Prior art keywords
resource
node
manager
information
state information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011642438.1A
Other languages
Chinese (zh)
Inventor
程杭林
韩文俊
丁琳琳
李路野
黎贺
唐强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CETC 14 Research Institute
Original Assignee
CETC 14 Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by CETC 14 Research Institute filed Critical CETC 14 Research Institute
Priority to CN202011642438.1A priority Critical patent/CN112636990A/en
Publication of CN112636990A publication Critical patent/CN112636990A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/04Network management architectures or arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/10Active monitoring, e.g. heartbeat, ping or trace-route
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/12Avoiding congestion; Recovering from congestion
    • H04L47/125Avoiding congestion; Recovering from congestion by balancing the load, e.g. traffic engineering

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Cardiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Hardware Redundancy (AREA)

Abstract

The invention discloses a resource management system, which comprises a plurality of node managers, resource managers and subordinate resource managers; the node manager acquires resource state information of a hardware node where the node manager is located in real time according to a configurable beat and stores the resource state information to form a node resource pool; the node manager sends all resource state information in the node resource pool to the resource manager; the resource manager receives the resource state information reported by all the node managers; and the resource manager performs classified management on the collected resource state information to form resource information, responds to a resource request of an external radar information processing application in real time, and allocates resources required by the radar information processing application based on the current resource information and a scheduling strategy. The invention supports the radar processing platform distributed resource pooling management, unified scheduling and allocation according to needs, supports the time-sharing multiplexing of resources and improves the utilization rate of software and hardware resources.

Description

Resource management system
Technical Field
The invention relates to the field of radar information processing, in particular to a resource management system.
Background
In the face of the combat requirements of multiple targets and complex interference environments and the challenges brought by constantly changing radar combat objects and combat environments, the radar system is required to be expandable in task-oriented functions, easy to upgrade processing system algorithms and constantly improved in system performance so as to meet the requirement of timely responding to new combat requirements.
The functions and combat missions undertaken by radars are becoming more and more diversified, and radars are developing towards the direction of integration of detection, electronic warfare, communication and the like. The requirements lead the radar equipment to develop towards the characteristics of customizable requirements, reconfigurable hardware, reconfigurable software and the like, so as to meet the reconfigurability of the system facing to dynamic environment and task requirements and the expandability and maintainability facing to a longer equipment life cycle under the conditions of multiple functions, multiple tasks and complex and variable environments.
In the existing radar equipment, software and hardware resources are relatively fixed, redundancy backup is more, the resource utilization rate is not high, the resource reuse rate among multiple modes is low, and dynamic scheduling distribution is difficult.
Disclosure of Invention
In order to solve the above problems, the present invention provides a resource management system, which includes a plurality of node managers, a resource manager, and a subordinate resource manager; the node manager acquires resource state information of a hardware node where the node manager is located in real time according to a configurable beat and stores the resource state information to form a node resource pool; the node manager sends all resource state information in the node resource pool to the resource manager; the resource manager receives the resource state information reported by all the node managers; the resource manager carries out classified management on the collected resource state information to form resource information, responds to a resource request of external radar information processing application in real time, and allocates resources required by the radar information processing application based on the current resource information and a scheduling strategy; the resource manager sends heartbeat information to the subordinate resource managers in real time according to the configuration beat; the slave resource manager receives heartbeat information of the resource manager in real time; the slave resource manager judges whether the resource manager is abnormal or not according to the heartbeat information, and when the resource manager works normally, the slave resource manager is in an inactivated state; when the resource manager is abnormal, the subordinate resource manager is actively activated to become a new resource manager.
Furthermore, the node manager runs on all hardware nodes providing computing resources for the system, and comprises a resource acquisition module and an information reporting module, wherein the resource acquisition module acquires resource state information of the hardware nodes in real time in a configurable beat distributed system through a calling interface and uniformly stores the resource state information in a key-value form to form a node resource pool; and the information reporting module uniformly packages and sends the resource state information in the node resource pool to the resource manager in a TCP network communication mode.
Furthermore, the resource manager comprises an information receiving module and a resource allocation module, wherein the information receiving module receives the resource state information reported by all the node managers and constructs a linked list to form a system resource pool in the form of < node IP-node resource information >; the resource allocation module carries out classified management on the resource state information in the system resource pool to form resource information; and the resource allocation module responds to the resource request of the radar information processing application in real time and allocates resources required by the application based on the current resource information and the scheduling strategy.
Further, the scheduling policy comprises a load balancing policy and a performance priority policy, if the load balancing policy is adopted, the application is distributed to the idle node, and if the performance priority policy is adopted, the application is distributed to the node where the application related to the application is located; the current resource information comprises resource types, resource demand, resource surplus, resource loads and application relevance.
Further, the resource manager sends heartbeat information according to a configuration beat in a UDP multicast mode, and when the heartbeat information of the resource manager continuously fails or is lost for three times, the resource manager is abnormal.
Compared with the prior art, the invention has the following beneficial effects:
1. the invention provides a node manager module which can acquire node software and hardware resource information in real time and support node resource pooling management;
2. the invention provides a resource management module, which supports the unified management and the allocation according to needs of distributed system resources, supports the pooling management and the time-sharing multiplexing of the system resources and improves the resource utilization rate;
3. the invention provides a subordinate resource management module, which realizes the high-reliability design of resource management and effectively prevents the problem of single-point failure of software and hardware of a system.
4. The invention supports the real-time acquisition of the bottom layer software and hardware resource state information, forms a system resource pool for uniform distribution and management, and improves the resource utilization rate.
Drawings
FIG. 1 is an overall architecture of a resource management system.
Detailed Description
The present invention is described in further detail below with reference to the attached drawing figures.
In this disclosure, aspects of the present invention are described with reference to the accompanying drawings, in which a number of illustrative embodiments are shown. It should be appreciated that the various concepts and embodiments described above, as well as those described in greater detail below, may be implemented in any of numerous ways, as the disclosed concepts and embodiments are not limited to any one implementation. In addition, some aspects of the present disclosure may be used alone, or in any suitable combination with other aspects of the present disclosure.
Example 1:
to achieve the above object, the present invention provides a resource management system, as shown in fig. 1, running on an operating system, the resource management system including: a node manager, a resource manager, and a subordinate resource manager.
The node manager runs on all hardware nodes providing computing resources for the system, and comprises a resource acquisition module and an information reporting module. The resource acquisition module acquires resource state information (including resource information such as calculation, storage and network) of hardware nodes such as a CPU (central processing unit), a DSP (digital signal processor) and the like from an operating system in real time through a certain acquisition beat by calling an interface, and uniformly stores the resource state information in a key-value form to form a node resource pool, wherein the acquisition beat is configurable and defaults to 500 ms; the information reporting module uniformly packages and sends resource state information in the node resource pool to a resource manager in a TCP network communication mode, and the specific resource state information is shown in table 1;
Figure BDA0002880737580000031
TABLE 1
The resource manager comprises an information receiving module and a resource allocation module. The information receiving module receives resource state information reported by all node managers, and all the resource state information constructs a map linked list in a mode of < node IP-node resource information > to form a system resource pool; the resource allocation module carries out classified management on resource state information in a system resource pool to form resource information, the resource allocation module responds to a resource request of radar information processing application in real time, resources required by the application are allocated based on current resource information and a scheduling strategy, if a load balancing strategy is adopted, the application is allocated to an idle node, and if a performance priority strategy is adopted, the application is allocated to a node where the application which is in a relation with the application is located, so that transmission delay is reduced. The current resource information comprises resource types, resource demand, resource surplus, resource loads and application relevance.
The resource manager sends heartbeat information (-1: representing fault and 1: representing normal) to the subordinate resource manager in a UDP multicast mode according to a configuration beat, wherein the sending beat is configurable and defaults to 500 ms;
and the subordinate resource manager comprises a monitoring module, and the monitoring module receives the heartbeat information of the resource manager in real time. When the resource manager works normally, the slave resource manager is in an inactivated state; when the heartbeat information of the resource manager continuously fails (the value is-1) or is lost three times, the slave resource manager determines that the resource manager is abnormal, and the slave resource manager is actively activated to become a new resource manager.
In summary, the present invention provides a node manager module, which can obtain the node software and hardware resource information in real time, and support the pooling management of node resources; the invention provides a resource management module, which supports the unified management and the allocation according to needs of resources of a distributed system (hardware nodes), supports the pooling management and the time-sharing multiplexing of the resources of the system and improves the utilization rate of the resources; the invention provides a subordinate resource management module, which realizes the high-reliability design of resource management and effectively prevents the problem of single-point failure of software and hardware of a system.

Claims (5)

1. A resource management system, comprising a plurality of node managers, a resource manager, and a subordinate resource manager; the node manager acquires resource state information of a hardware node where the node manager is located in real time according to a configurable beat and stores the resource state information to form a node resource pool; the node manager sends all resource state information in the node resource pool to the resource manager; the resource manager receives the resource state information reported by all the node managers; the resource manager carries out classified management on the collected resource state information to form resource information, responds to a resource request of external radar information processing application in real time, and allocates resources required by the radar information processing application based on the current resource information and a scheduling strategy; the resource manager sends heartbeat information to the subordinate resource managers in real time according to the configuration beat; the slave resource manager receives heartbeat information of the resource manager in real time; the slave resource manager judges whether the resource manager is abnormal or not according to the heartbeat information, and when the resource manager works normally, the slave resource manager is in an inactivated state; when the resource manager is abnormal, the subordinate resource manager is actively activated to become a new resource manager.
2. The resource management system of claim 1, wherein the node manager runs on all hardware nodes providing computing resources for the system, and the node manager includes a resource acquisition module and an information reporting module, and the resource acquisition module acquires resource state information of the hardware nodes in real time in a configurable beat distributed system through a calling interface and uniformly stores the resource state information in a key-value form to form a node resource pool; and the information reporting module uniformly packages and sends the resource state information in the node resource pool to the resource manager in a TCP network communication mode.
3. The resource management system of claim 1, wherein the resource manager comprises an information receiving module and a resource allocation module, the information receiving module receives resource state information reported by all node managers, and constructs a linked list of all the resource state information in a form of < node IP-node resource information > to form a system resource pool; the resource allocation module carries out classified management on the resource state information in the system resource pool to form resource information; and the resource allocation module responds to the resource request of the radar information processing application in real time and allocates resources required by the application based on the current resource information and the scheduling strategy.
4. The resource management system of claim 3, wherein the scheduling policy comprises a load balancing policy and a performance priority policy, and if the load balancing policy is adopted, the application is allocated to an idle node, and if the performance priority policy is adopted, the application is allocated to a node where the application associated with the application is located; the current resource information comprises resource types, resource demand, resource surplus, resource loads and application relevance.
5. The resource management system according to claim 3 or 4, wherein the resource manager sends the heartbeat information in a configuration beat in a form of UDP multicast, and when the heartbeat information of the resource manager is continuously failed or lost three times, the resource manager is abnormal.
CN202011642438.1A 2020-12-31 2020-12-31 Resource management system Pending CN112636990A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011642438.1A CN112636990A (en) 2020-12-31 2020-12-31 Resource management system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011642438.1A CN112636990A (en) 2020-12-31 2020-12-31 Resource management system

Publications (1)

Publication Number Publication Date
CN112636990A true CN112636990A (en) 2021-04-09

Family

ID=75290217

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011642438.1A Pending CN112636990A (en) 2020-12-31 2020-12-31 Resource management system

Country Status (1)

Country Link
CN (1) CN112636990A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114389955A (en) * 2022-03-02 2022-04-22 中国电子科技集团公司第十四研究所 Embedded platform heterogeneous resource pooling management method
CN114598591A (en) * 2022-03-07 2022-06-07 中国电子科技集团公司第十四研究所 Embedded platform node fault recovery system and method
CN117971499A (en) * 2024-03-28 2024-05-03 北京中电普华信息技术有限公司 Resource allocation method, device, electronic equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105893157A (en) * 2016-04-29 2016-08-24 国家计算机网络与信息安全管理中心 Open and distributed system resource management and task scheduling system and method
CN107450977A (en) * 2015-12-30 2017-12-08 北京典赞科技有限公司 The resource management dispatching method towards GPGPU clusters based on YARN
CN108984569A (en) * 2017-06-05 2018-12-11 中兴通讯股份有限公司 Database switching method, system and computer readable storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107450977A (en) * 2015-12-30 2017-12-08 北京典赞科技有限公司 The resource management dispatching method towards GPGPU clusters based on YARN
CN105893157A (en) * 2016-04-29 2016-08-24 国家计算机网络与信息安全管理中心 Open and distributed system resource management and task scheduling system and method
CN108984569A (en) * 2017-06-05 2018-12-11 中兴通讯股份有限公司 Database switching method, system and computer readable storage medium

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114389955A (en) * 2022-03-02 2022-04-22 中国电子科技集团公司第十四研究所 Embedded platform heterogeneous resource pooling management method
CN114389955B (en) * 2022-03-02 2023-08-15 中国电子科技集团公司第十四研究所 Method for managing heterogeneous resource pool of embedded platform
CN114598591A (en) * 2022-03-07 2022-06-07 中国电子科技集团公司第十四研究所 Embedded platform node fault recovery system and method
CN114598591B (en) * 2022-03-07 2024-02-02 中国电子科技集团公司第十四研究所 Embedded platform node fault recovery system and method
CN117971499A (en) * 2024-03-28 2024-05-03 北京中电普华信息技术有限公司 Resource allocation method, device, electronic equipment and storage medium
CN117971499B (en) * 2024-03-28 2024-06-25 北京中电普华信息技术有限公司 Resource allocation method, device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN112636990A (en) Resource management system
EP3522013B1 (en) Method and system for migration of containers in a container orchestration platform between compute nodes
US7856572B2 (en) Information processing device, program thereof, modular type system operation management system, and component selection method
CN112000448A (en) Micro-service architecture-based application management method
US20130198755A1 (en) Apparatus and method for managing resources in cluster computing environment
US20100077250A1 (en) Virtualization based high availability cluster system and method for managing failure in virtualization based high availability cluster system
CN109918170A (en) A kind of cloud data center virtual machine dynamic BTS configuration method and system
CN104917805A (en) Load sharing method and equipment
CN101778002A (en) Large-scale cluster system and building method thereof
Lardieri et al. A multi-layered resource management framework for dynamic resource management in enterprise dre systems
CN116360935A (en) Control system based on multi-priority real-time reconstruction and scheduling and working method thereof
CN106961700B (en) Wireless communication method for dynamic remote fault-tolerant reconstruction of cluster avionics system computing resources
CN110532060B (en) Hybrid network environment data acquisition method and system
CN104484228A (en) Distributed parallel task processing system based on Intelli-DSC (Intelligence-Data Service Center)
CN112328402A (en) High-efficiency self-adaptive space-based computing platform architecture and implementation method thereof
CN112631756A (en) Distributed regulation and control method and device applied to space flight measurement and control software
CN114615268B (en) Service network, monitoring node, container node and equipment based on Kubernetes cluster
CN111651278B (en) Dynamic reconstruction method and platform based on software radar
CN113515356B (en) Lightweight distributed resource management and task scheduler and method
CN104486447A (en) Large platform cluster system based on Big-Cluster
CN114924888A (en) Resource allocation method, data processing method, device, equipment and storage medium
Kim et al. An accelerated edge computing with a container and its orchestration
CN106844021B (en) Computing environment resource management system and management method thereof
Mao et al. A Load Balancing and Overload Controlling Architecture in Clouding Computing
CN117938863B (en) Cluster-based joint simulation implementation method, system, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20210409