CN106850598B - Uniform resource management system and method for whole-ship computing environment - Google Patents

Uniform resource management system and method for whole-ship computing environment Download PDF

Info

Publication number
CN106850598B
CN106850598B CN201710035518.2A CN201710035518A CN106850598B CN 106850598 B CN106850598 B CN 106850598B CN 201710035518 A CN201710035518 A CN 201710035518A CN 106850598 B CN106850598 B CN 106850598B
Authority
CN
China
Prior art keywords
node
management
module
virtual machine
deployment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710035518.2A
Other languages
Chinese (zh)
Other versions
CN106850598A (en
Inventor
谢彬
卜炜
李宁波
王敬平
王慧玲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
No32 Research Institute Of China Electronics Technology Group Corp
Original Assignee
No32 Research Institute Of China Electronics Technology Group Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by No32 Research Institute Of China Electronics Technology Group Corp filed Critical No32 Research Institute Of China Electronics Technology Group Corp
Publication of CN106850598A publication Critical patent/CN106850598A/en
Application granted granted Critical
Publication of CN106850598B publication Critical patent/CN106850598B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/80Responding to QoS

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Stored Programmes (AREA)

Abstract

The invention discloses a system and a method for managing uniform resources facing to a whole-ship computing environment, wherein the system comprises a uniform resource description module, and the system realizes the distribution, query, explanation and execution of configuration information of platform nodes, software and the like through the configuration of XML files; a node host OS deploys or updates a module, which needs to be analyzed according to the description of platform resources, and deploys or updates the module through a network; the virtual machine deployment module needs to select and analyze a resource description file from the resource description pool, distribute the virtual machines to the platform nodes in batch through a network, and the like. The invention improves the utilization efficiency and the sharing level of hardware resources, and the deployment and installation task scheduling is more convenient and easy to implement.

Description

Uniform resource management system and method for whole-ship computing environment
Technical Field
The invention relates to a uniform resource management system and a method, in particular to a uniform resource management system and a method for a whole-ship computing environment.
Background
The Total Ship Computing Environment (abbreviated as TSCE) refers to that hardware devices and basic service software for Computing, storage, transmission, display control and the like which are commonly used by a whole Ship are organically integrated according to mature technologies and standards to form a logic large computer of the whole Ship, and the logic large computer is used as a public information infrastructure platform to support the operation of application tasks of various businesses of the whole Ship. In essence, the TSCE is a unified information infrastructure of the whole ship, and can provide basic service support including information exchange, data storage, high-performance calculation, human-computer interaction and the like for ship business applications of detection sensing, command control, weapon engagement, information guarantee, ship driving control, security damage management and the like of ships. The American army considers that the TSCE brings great innovation for the integration mode and the organization operation of naval vessel electronic information system equipment, and has advantages in the aspects of extension and upgrade, survivability resistance, dispersed arrangement, unified standard, open integration, flexible reconstruction and the like.
In order to solve the problems of poor opening, difficult upgrading, high whole life cycle cost and the like of an informatization device under the condition of informatization war, the U.S. military proposes an Open Architecture computing environment (abbreviated as OACE). Under the design idea of an open system architecture, through standard interfaces among all layers, tight coupling between the upper layer and the shielding layer is achieved, and open support to an application layer is improved; software and hardware are separated, and the system capacity can be improved by realizing local software upgrading, so that competition is effectively introduced, monopoly is eliminated, and consumption is reduced.
At present, the whole-ship computing environment is in the initial construction stage, and the application and the infrastructure are vertically and tightly bound together, so that a plurality of resource and information islands are caused. The system does not have the condition of sharing hardware and basic supporting software, and most functions such as digital signal processing still depend on specific hardware. When specific hardware fails or is destroyed, the corresponding function can be in a paralyzed state and is difficult to recover in a short time. Heterogeneous hardware is independently managed, and a uniform resource management method is not available. In the next step, standard interface specifications are required to be defined, the function definition of a standard hardware module is realized in a software definition mode, different modules are disaster recovery mutually, part of public function software is extracted and reused, flexible deployment of the public function software is supported, and rapid upgrading is realized.
The invention is based on a whole-ship computing environment basic computing platform, adopts unified resource description definition, realizes the functions of capability definition, system resource configuration, unified software deployment and the like of the computing platform by real-time information acquisition, unified view display, unified control management and platform-level integrated resource management technology, and improves the high availability and capability reconstruction of the whole system.
Disclosure of Invention
The invention aims to solve the technical problem of providing a system and a method for managing uniform resources for a whole-ship computing environment, which improve the utilization efficiency and the sharing level of hardware resources, and are more convenient and easier to deploy and install task scheduling and easy to implement.
The invention solves the technical problems through the following technical scheme: a unified resource management system facing to a whole-ship computing environment comprises a unified resource description module, a node host OS deployment or update module, a virtual machine deployment module, a virtual machine fault migration module, an application deployment and management module and a resource unified management module, wherein:
the uniform resource description module is used for realizing distribution, query, interpretation and execution of configuration information such as platform nodes and software through the configuration of the XML file;
a node host OS deploys or updates a module, which needs to be analyzed according to the description of platform resources, and deploys or updates the module through a network;
the virtual machine deployment module is used for selecting and analyzing the resource description file from the resource description pool and distributing the virtual machines to the platform nodes in batches through the network;
the virtual machine fault migration module downloads the mirror image in the designated backup computing node according to the virtual machine fault migration processing strategy in the uniform resource description module when the computing/storage node service is abnormal, and restarts the application server;
the application deployment and management module is used for selecting and analyzing a resource description file from the resource description pool, and distributing application software, a management script and dependent software to the platform nodes in batches through a network;
the resource unified management module is deployed on the management node, performs management operations such as starting, shutting down and restarting on the virtual machine, performs management and control such as installation, uninstallation, starting and stopping on application software, and performs log recording, storage and management on the whole deployed topological graph and the deployed process.
The invention also provides a unified resource management method facing the whole-ship computing environment, which comprises a node host OS deployment or updating flow, a virtual machine deployment flow, a virtual machine fault migration flow and an application deployment and management flow.
Preferably, the node hosting OS deployment or update process mainly includes the following steps:
step one, configuring platform resource description;
step two, starting the node and trying to load a node OS in the network;
step three, starting a node OS and starting a node agent;
step four, sending the host OS;
and step five, locally solidifying the OS and the file system.
Preferably, the virtual machine deployment process mainly includes the following steps:
step eleven, distributing the software in batches;
step twelve, starting a virtual machine/Docker management tool;
thirteen, managing the virtual machine/Docker;
and step fourteen, issuing the virtual mirror image.
Preferably, the virtual machine failover process mainly includes the following steps:
twenty-one, monitoring the load running states of the computing nodes and the redundant computing nodes;
twenty-two, when a fault occurs, an alarm is sent to the management node;
twenty-third, the management node extracts the virtual mirror image from the storage node and mounts the virtual mirror image on the redundant computing node to supplement the service capability of the original computing node;
twenty-four steps, the failed computing node suspends external service;
and twenty five, re-requesting the virtual mirror image from the management node, and deploying the virtual mirror image on the failed computing node, thereby recovering the computing capability.
Preferably, the application deployment and management process mainly includes the following steps:
thirty-one, distributing application software, management scripts and dependent software in batches;
step thirty-two, installing or uninstalling application software;
and step thirty three, starting or stopping the application software.
The positive progress effects of the invention are as follows: the hardware resource universalization, the calculation and storage resource standardization and the utilization efficiency and the sharing level of the hardware resource are improved; the software service is standardized, the deployment and installation of software and hardware are decoupled through a configuration process, and application and data are separated, so that the deployment and installation task scheduling of the whole ship computing environment basic computing platform is more convenient and easy to implement.
Drawings
Fig. 1 is a framework schematic diagram of a universal resource management system oriented to a whole-ship computing environment according to the present invention.
Fig. 2 is a flowchart of node host OS deployment/update in the method for uniform resource management for a whole-ship computing environment according to the present invention.
Fig. 3 is a flowchart of deployment of virtual machines in the method for uniform resource management for a whole-ship computing environment according to the present invention.
Fig. 4 is a virtual machine fault migration flowchart in the method for uniform resource management for a whole-ship computing environment according to the present invention.
Fig. 5 is a flowchart of application deployment and management in the method for uniform resource management for a whole-ship computing environment according to the present invention.
Detailed Description
The following provides a detailed description of the preferred embodiments of the present invention with reference to the accompanying drawings.
As shown in fig. 1, the present invention discloses a uniform resource management system for a whole-ship computing environment, which includes a uniform resource description module, a node host OS (operating system) deployment or update module, a virtual machine deployment module, a virtual machine fault migration module, an application deployment and management module, and a resource uniform management module, wherein:
the uniform resource description module realizes the distribution, query, interpretation and execution of configuration information such as platform nodes and software through the configuration of XML (extensible markup language) files;
a node host OS deploys or updates a module, which needs to be analyzed according to the description of platform resources, and deploys or updates the module through a network;
the virtual machine deployment module is used for selecting and analyzing the resource description file from the resource description pool and distributing the virtual machines to the platform nodes in batches through the network;
the virtual machine fault migration module downloads the mirror image in the designated backup computing node according to the virtual machine fault migration processing strategy in the uniform resource description module when the computing/storage node service is abnormal, and restarts the application server;
the application deployment and management module is used for selecting and analyzing a resource description file from the resource description pool, and distributing application software, a management script and dependent software to the platform nodes in batches through a network;
the resource unified management module is deployed on the management node, performs management operations such as starting, shutting down and restarting on the virtual machine, performs management and control such as installation, uninstallation, starting and stopping on application software, and performs log recording, storage and management on the whole deployed topological graph and the deployed process.
The invention relates to a unified resource management method facing to a whole-ship computing environment, which comprises a node host OS deployment or updating flow, a virtual machine deployment flow, a virtual machine fault migration flow and an application deployment and management flow.
As shown in fig. 2, the node hosting OS deployment or update process mainly includes the following steps:
step one, configuring platform resource description;
step two, starting the node and trying to load a node OS in the network;
step three, starting a node OS and starting a node agent;
step four, sending the host OS (and the file system);
and step five, locally solidifying the OS and the file system.
As shown in fig. 3, the virtual machine deployment process mainly includes the following steps:
step eleven, distributing the software in batches;
step twelve, starting a virtual machine/Docker (advanced container engine) management tool;
thirteen, managing the virtual machine/Docker;
and step fourteen, issuing the virtual mirror image.
As shown in fig. 4, the virtual machine failover process mainly includes the following steps:
twenty-one, monitoring the load running states of the computing nodes and the redundant computing nodes;
twenty-two, when a fault occurs, an alarm is sent to the management node;
twenty-third, the management node extracts the virtual mirror image from the storage node and mounts the virtual mirror image on the redundant computing node to supplement the service capability of the original computing node;
twenty-four steps, the failed computing node suspends external service;
and twenty five, re-requesting the virtual mirror image from the management node, and deploying the virtual mirror image on the failed computing node, thereby recovering the computing capability.
As shown in fig. 5, the application deployment and management process mainly includes the following steps:
thirty-one, distributing application software, management scripts and dependent software in batches;
step thirty-two, installing or uninstalling application software;
and step thirty three, starting or stopping the application software.
Example 1:
the management node of the whole ship computing environment basic computing platform is provided with uniform resource description of software, hardware and communication links of the whole platform. And forming a resource description list meeting the application requirements through a resource description configuration module according to actual needs. And extracting useful information about resource configuration from the resource description list by a resource description analysis module in the management node. The node OS pool stores a basic operating system required by each node, and the basic operating system is used by a network deployment OS module or a network distribution module built on a local solidified OS module. The computing/storage node firstly tries to load the node OS in the network at the beginning of starting, after receiving the request command, the management node starts the node OS, starts the node agent and then sends the host OS (and the file system) to the computing/storage node. The compute/storage node solidifies into the local node after receiving the host OS (and file system) sent by the management node. And finishing the deployment process of the node host OS.
Example 2 of implementation:
after the node OS installation is completed, the virtual machine continues to be deployed on the compute/storage node as needed. And a resource description analysis module in the management node extracts information related to the configuration of the virtual machine from the resource description list and sends the configuration information to a software distribution module. And the software distribution module extracts a virtual machine/Docker management tool and dependent software from the software pool and the virtual image pool of the management node for the computing/storage node to request to call. After the computing/storage node is started, the management node requests a needed software package like the management node, after receiving a request command, the management node starts a virtual machine/Docker management tool, and selects a proper virtual machine image from a virtual machine/Docker management interface to send the virtual machine image to the computing/storage node. And the computing/storage node receives the virtual machine image sent by the management node, stores the virtual machine image in a file directory of the local node, and is called by a local software distribution module or a software remote management module. And finishing the deployment process of the node virtual machine.
Example 3:
the management node of the whole-ship computing environment basic computing platform monitors the load running states of the computing nodes and the redundant computing nodes in real time. When a certain computing node fails, an alarm is sent to the management node, and the management node can extract the virtual mirror image from the storage node and mount the virtual mirror image on the redundant computing node to supplement the service capacity of the original computing node. Meanwhile, the failed computing node suspends external services, requests the virtual mirror image from the management node again, and deploys the virtual mirror image on the failed computing node, so that the computing capability is recovered.
Example 4:
and a resource description analysis module in the management node extracts the configuration information of the application list from the resource description list and sends the configuration information to a software distribution module. The software distribution module extracts corresponding software from the software pool and sends the corresponding software to the network transmission module. The computing/storage node requests the management node for the required application software, and the management node receives the request and then distributes the application software, the management script and the dependent software to the computing/storage node in batches through the software distribution module. The management node starts or stops the application software through the network.
The invention realizes unified resource description definition, visual configuration and description template management aiming at heterogeneous hardware, realizes unified view management and resource integrated management; realizing software distribution and system capability reconstruction based on uniform resource description, and realizing management, resource allocation recovery and fault migration of the virtual machine; realizing the global display of a platform capacity view and a software deployment topological graph; management process logging, storage and management are provided.
The above embodiments are described in further detail to solve the technical problems, technical solutions and advantages of the present invention, and it should be understood that the above embodiments are only examples of the present invention and are not intended to limit the present invention, and any modifications, equivalent substitutions, improvements and the like made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (2)

1. A unified resource management system for a whole-ship computing environment is characterized by comprising: the system comprises a uniform resource description module, a node host OS deployment or update module, a virtual machine deployment module, a virtual machine fault migration module, an application deployment and management module and a resource uniform management module, wherein:
the uniform resource description module realizes the distribution, inquiry, interpretation and execution of platform nodes and software configuration information through the configuration of the XML file;
a node host OS deploys or updates a module, which needs to be analyzed according to the description of platform resources, and deploys or updates the module through a network;
the virtual machine deployment module is used for selecting and analyzing the resource description file from the resource description pool and distributing the virtual machines to the platform nodes in batches through the network;
the virtual machine fault migration module downloads the mirror image in the designated backup computing node according to the virtual machine fault migration processing strategy in the uniform resource description module when the computing/storage node service is abnormal, and restarts the application server;
the application deployment and management module is used for selecting and analyzing a resource description file from the resource description pool, and distributing application software, a management script and dependent software to the platform nodes in batches through a network;
and the resource unified management module is deployed on the management node, performs starting, stopping and restarting management operations on the virtual machine, performs installation and unloading and starting and stopping control on the application software, and performs log recording, storage and management on the whole deployed topological graph and the deployed process.
2. A unified resource management method facing to a whole ship computing environment is characterized by comprising a node host OS deployment or update flow, a virtual machine deployment flow, a virtual machine fault migration flow and an application deployment and management flow;
the node host OS deployment or update process mainly comprises the following steps:
step one, configuring platform resource description;
step two, starting the node and trying to load a node OS in the network;
step three, starting a node OS and starting a node agent;
step four, sending the host OS;
step five, locally solidifying the OS and the file system;
the application deployment and management process mainly comprises the following steps:
thirty-one, distributing application software, management scripts and dependent software in batches;
step thirty-two, installing or uninstalling application software;
thirty-three, starting or stopping application software; the virtual machine deployment process mainly comprises the following steps:
step eleven, distributing the software in batches;
step twelve, starting a virtual machine/Docker management tool;
thirteen, managing the virtual machine/Docker;
step fourteen, virtual mirror image issuing;
the virtual machine fault migration process mainly comprises the following steps:
twenty-one, monitoring the load running states of the computing nodes and the redundant computing nodes;
twenty-two, when a fault occurs, an alarm is sent to the management node;
twenty-third, the management node extracts the virtual mirror image from the storage node and mounts the virtual mirror image on the redundant computing node to supplement the service capability of the original computing node;
twenty-four steps, the failed computing node suspends external service;
and twenty five, re-requesting the virtual mirror image from the management node, and deploying the virtual mirror image on the failed computing node, thereby recovering the computing capability.
CN201710035518.2A 2016-12-06 2017-01-18 Uniform resource management system and method for whole-ship computing environment Active CN106850598B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2016111118453 2016-12-06
CN201611111845 2016-12-06

Publications (2)

Publication Number Publication Date
CN106850598A CN106850598A (en) 2017-06-13
CN106850598B true CN106850598B (en) 2020-06-12

Family

ID=59124382

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710035518.2A Active CN106850598B (en) 2016-12-06 2017-01-18 Uniform resource management system and method for whole-ship computing environment

Country Status (1)

Country Link
CN (1) CN106850598B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112130955A (en) * 2020-09-02 2020-12-25 中国电子科技网络信息安全有限公司 Virtual machine scheduling method based on converged framework cloud platform
CN112491599B (en) * 2020-11-13 2022-05-24 新华三大数据技术有限公司 Method and device for deploying network service object
CN112527325A (en) * 2020-11-23 2021-03-19 山东乾云启创信息科技股份有限公司 Deployment method and system applied to super-fusion architecture

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7792797B2 (en) * 2002-12-24 2010-09-07 International Business Machines Corporation Fail over resource manager access in a content management system
CN102317910A (en) * 2011-08-03 2012-01-11 华为技术有限公司 Methods, devices and system for virtual data backup and reintegration
CN102611723A (en) * 2011-01-25 2012-07-25 赵天海 Method for building high-performance computing application service based on virtualization technology
CN102739770A (en) * 2012-04-18 2012-10-17 上海和辰信息技术有限公司 Method and system for scheduling resource based on cloud computing

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7792797B2 (en) * 2002-12-24 2010-09-07 International Business Machines Corporation Fail over resource manager access in a content management system
CN102611723A (en) * 2011-01-25 2012-07-25 赵天海 Method for building high-performance computing application service based on virtualization technology
CN102317910A (en) * 2011-08-03 2012-01-11 华为技术有限公司 Methods, devices and system for virtual data backup and reintegration
CN102739770A (en) * 2012-04-18 2012-10-17 上海和辰信息技术有限公司 Method and system for scheduling resource based on cloud computing

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
全舰计算环境系统结构和系统集成框架;董晓明等;《中国舰船研究》;20140228;第8-13页 *

Also Published As

Publication number Publication date
CN106850598A (en) 2017-06-13

Similar Documents

Publication Publication Date Title
US11856050B2 (en) Multi-tenant-cloud-aggregation and application-support system
US11405300B2 (en) Methods and systems to adjust resources and monitoring configuration of objects in a distributed computing system
CN111338854B (en) Kubernetes cluster-based method and system for quickly recovering data
CN111290834B (en) Method, device and equipment for realizing high service availability based on cloud management platform
US11385883B2 (en) Methods and systems that carry out live migration of multi-node applications
CN111989681A (en) Automatically deployed Information Technology (IT) system and method
CN101765225B (en) Virtual cluster management method and cluster node
CN107329799A (en) A kind of fusion Docker containers and the system of KVM virtualization technology
US8010504B2 (en) Increasing application availability during automated enterprise deployments
CN107220100A (en) One kind exploitation O&M method, device and cloud computing PaaS platform
US20170161044A1 (en) Automated-application-release-management subsystem that incorporates script tasks within application-release-management pipelines
CN104408071A (en) Distributive database high-availability method and system based on cluster manager
CN106850598B (en) Uniform resource management system and method for whole-ship computing environment
CN101707399A (en) Method and system for acquiring electric energy information
CN112667362B (en) Method and system for deploying Kubernetes virtual machine cluster on Kubernetes
CN102981931A (en) Backup method and device for virtual machine
KR20190041033A (en) Replaying jobs at a secondary location of a service
CN112269640A (en) Method for realizing life cycle management of container cloud component
CN109799998B (en) OpenStack cluster configuration and batch deployment method and system
CN105471994A (en) Control method and device
CN111736809B (en) Distributed robot cluster network management framework and implementation method thereof
CN113515316A (en) Novel edge cloud operating system
CN111447079A (en) High-availability extension system and method based on SCA framework
CN105183591A (en) High-availability cluster implementation method and system
CN112199178A (en) Cloud service dynamic scheduling method and system based on lightweight container

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant