CN113626044B - Service management method and device - Google Patents

Service management method and device Download PDF

Info

Publication number
CN113626044B
CN113626044B CN202110930034.0A CN202110930034A CN113626044B CN 113626044 B CN113626044 B CN 113626044B CN 202110930034 A CN202110930034 A CN 202110930034A CN 113626044 B CN113626044 B CN 113626044B
Authority
CN
China
Prior art keywords
service
container
determining
file
package
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110930034.0A
Other languages
Chinese (zh)
Other versions
CN113626044A (en
Inventor
崔东晓
朱仲毅
李秀婧
翟文倩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Industrial and Commercial Bank of China Ltd ICBC
Original Assignee
Industrial and Commercial Bank of China Ltd ICBC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Industrial and Commercial Bank of China Ltd ICBC filed Critical Industrial and Commercial Bank of China Ltd ICBC
Priority to CN202110930034.0A priority Critical patent/CN113626044B/en
Publication of CN113626044A publication Critical patent/CN113626044A/en
Application granted granted Critical
Publication of CN113626044B publication Critical patent/CN113626044B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/60Software deployment
    • G06F8/61Installation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/3668Software testing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformation of program code
    • G06F8/41Compilation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/02Banking, e.g. interest calculation or account maintenance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/04Trading; Exchange, e.g. stocks, commodities, derivatives or currency exchange
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/45562Creating, deleting, cloning virtual machine instances

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Economics (AREA)
  • Development Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • General Business, Economics & Management (AREA)
  • Quality & Reliability (AREA)
  • Computer Hardware Design (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention provides a service management method and device, belongs to the technical field of cloud computing, and can be applied to the financial field or other fields. The service management method comprises the following steps: receiving service configuration information, compiling and packaging the service configuration information to generate a program package service file; deploying the package service file into the container so that the container runs the package service file; acquiring service operation data of the container, and acquiring service process operation information according to the service operation data; and determining an exception handling scheme according to the service process running information so as to process the corresponding service. The invention can reduce the burden of operation and maintenance personnel, lower the labor cost and the time cost, and improve the service availability and the recovery time.

Description

Service management method and device
Technical Field
The invention relates to the technical field of cloud computing, in particular to a service management method and device.
Background
Currently, large commercial banks are largely in the phase of architecture transformation from traditional technical architecture into distributed architecture. In the transformation process, the testing work of the distributed version gradually becomes an important point of the testing work. The prior art only can support simple operation and maintenance deployment work, only solves the basic service deployment problem, often needs operation and maintenance personnel to conduct manual analysis and problem investigation processing aiming at various abnormal problems in deployment, has huge labor and time cost, influences system availability and recovery time, and brings great hindrance to test work.
Disclosure of Invention
The embodiment of the invention mainly aims to provide a service management method and device, so as to reduce the load of operation and maintenance personnel, reduce the labor cost and the time cost, and improve the service usability and the recovery time.
In order to achieve the above object, an embodiment of the present invention provides a service management method, including:
Receiving service configuration information, compiling and packaging the service configuration information to generate a program package service file;
Deploying the package service file into the container so that the container runs the package service file;
Acquiring service operation data of the container, and acquiring service process operation information according to the service operation data;
and determining an exception handling scheme according to the service process running information so as to process the corresponding service.
The embodiment of the invention also provides a service management device, which comprises:
the compiling module is used for receiving the service configuration information, compiling and packaging the service configuration information to generate a program package service file;
The deployment module is used for deploying the program package service files into the container so that the container runs the program package service files;
the acquisition module is used for acquiring service operation data of the container and acquiring service process operation information according to the service operation data;
and the scheme determining module is used for determining an exception handling scheme according to the service process operation information so as to process the corresponding service.
The embodiment of the invention also provides computer equipment, which comprises a memory, a processor and a computer program stored on the memory and running on the processor, wherein the processor realizes the steps of the service management method when executing the computer program.
The embodiment of the invention also provides a computer readable storage medium, on which a computer program is stored, which when being executed by a processor, implements the steps of the service management method.
The service management method and the device of the embodiment of the invention firstly compile and package the service configuration information to generate the program package service file so as to deploy the program package service file into the container, then acquire the service operation data of the container to acquire the service process operation information, and finally determine the exception handling scheme according to the service process operation information so as to process the corresponding service, thereby reducing the burden of operation and maintenance personnel, lowering the labor cost and the time cost, and improving the service availability and the recovery time.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are needed in the description of the embodiments will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of a method of service management in an embodiment of the invention;
FIG. 2 is a flow chart of obtaining service process operation information in an embodiment of the invention;
FIG. 3 is a block diagram showing the construction of a service management apparatus in an embodiment of the present invention;
Fig. 4 is a block diagram of a computer device in an embodiment of the invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Those skilled in the art will appreciate that embodiments of the invention may be implemented as a system, apparatus, device, method, or computer program product. Accordingly, the present disclosure may be embodied in the following forms, namely: complete hardware, complete software (including firmware, resident software, micro-code, etc.), or a combination of hardware and software.
In view of the fact that the prior art can only support simple operation and maintenance deployment work, labor and time costs are huge, service availability and recovery time are affected, the embodiment of the invention provides the service management method and device, customized services can be conveniently, flexibly and rapidly deployed between different version installation environments under the condition of complex cloud primary test environments, the problem that service preparation consumes long time is effectively solved, customized service deployment work can be completed through simple configuration of a custom service template, background tasks are automatically packaged and deployed, and service online is completed. Meanwhile, an abnormal scene customizing interface is required to be opened aiming at the special test of the technical test under the test environment, so that the abnormal scene containing the current test content can be rapidly simulated, and the system support optimization can be rapidly completed for the newly added abnormal scene. In addition, the invention can synchronously support high concurrent service requirements, and realize dynamic scheduling and easy deployment through flexible dynamic transverse capacity expansion. The abnormal problems in the deployment process and the operation process can be automatically identified, the abnormal types can be processed according to a preset scheme, the burden of operation and maintenance personnel is reduced, and the availability and recovery time of the system are improved. The present invention will be described in detail with reference to the accompanying drawings.
Fig. 1 is a flowchart of a service management method in an embodiment of the present invention. As shown in fig. 1, the service management method includes:
s101: and receiving the service configuration information, compiling and packaging the service configuration information to generate a program package service file.
In the implementation, the configuration information of the service can be input by using a configuration management module in a file or webpage configuration mode, and the service configuration information comprises a service name, a service version, a service method, a parameter entering type, a parameter returning type, an operation result, concurrency limit, timeout limit, a serialization mode, a registry and the like. After the user submits the configuration information, the system checks the necessary items one by one to confirm that the format of the input content is correct, and meanwhile, judges whether the content of the selected item accords with the selected item logic, and generates a service configuration information description information section through the customized language module. The configuration analysis module obtains the service configuration information description information from the configuration management module in the customized deployment configuration module and converts the service configuration information description information to obtain converted service configuration information, and the converted service configuration information is sent to the software package packaging module for program packaging. The software package packaging module generates a program source code file and a corresponding dependency file according to the converted service configuration information, compiles and packages the program source code file and the corresponding dependency file to generate a program package service file, and distributes and uploads the program source code file and the corresponding dependency file by the container distribution module after packaging.
S102: the package service file is deployed into the container such that the container runs the package service file.
In one embodiment, deploying a package service file into a container includes:
1. When the simulated abnormal scene exists, an exclusive container is established, and the package service file is deployed into the exclusive container. Establishing an exclusive container may avoid simulation anomalies from affecting other services.
How to test the abnormal scene and the high concurrency scene of the service is always a difficult point, and how to quickly and accurately simulate various abnormal scenes is a difficult problem in the process of testing the abnormal scenes; in addition, when high concurrency test is performed on the service, due to the fact that a large number of upstream and downstream related applications are involved, service bottlenecks cannot be accurately identified under the condition that the performance of the upstream and downstream applications cannot be guaranteed, and the test progress and the test quality are greatly affected. Aiming at special requirements of technical test, such as test scenes of coverage work of various abnormal scenes, verification work of partial failure of a system and the like, the prior art needs to modify bottom codes of services by service developers, has higher professional requirements, has long modification period and circulation link, and seriously affects the test period.
In order to solve the problems, the invention inputs configuration information of an abnormal service through an abnormal configuration module to simulate an abnormal scene. The simulated anomaly types mainly include:
a) Service unavailable exception: the service is set to be unavailable periodically, simulating a sudden service unavailability situation. The main parameters include service name, unavailable time, recovery time, etc.
B) Service flow restriction anomaly: triggering service throttling, including TPS (system throughput) throttling, concurrency limiting, etc. The main parameters include service name, current limit type, current limit value, etc.
C) Service timeout exception: triggering a service response time timeout. The main parameters include service name and timeout time, etc.
D) Service restart exception: the service frequently restarts during service invocation. The main parameters include service name and restart interval.
E) Error reporting of service abnormality: the service returns to the communication area to report errors. The main parameters include service name and returned content, etc.
F) Service variable configuration anomalies: the service variables are incorrectly configured. The main parameters include service name, variable value, etc.
The invention simulates an abnormal scene through the abnormal execution module, and the specific execution strategy is as follows:
a) Service unavailable exception: and according to the service name, the service is periodically offline according to the unavailable time and the recovery time.
B) Service flow restriction anomaly: and thermally loading the service name, the current limiting type, the current limiting value and other current limiting parameters into the service.
C) Service timeout exception: and thermally loading timeout parameters such as service names, timeout time and the like into the service.
D) Service restart exception: and periodically restarting the service according to the service name and the restarting interval.
E) Error reporting of service abnormality: and changing the service return value into the content needing to be reported by mistake according to the service name and the returned content.
F) Service variable configuration anomalies: and modifying the service configuration variable according to the service name, the variable name and the variable value, and restarting to take effect.
2. And when the simulated abnormal scene does not exist, determining comprehensive load data of each container, and deploying the package service file into the container corresponding to the minimum value of each comprehensive load data.
In practice, the comprehensive load data of the container can be determined by the following ways:
Monitoring data (CPU data, memory data, thread pool data, connection pool data, disk IO data and the like) per minute are collected, and load conditions of 5-minute granularity, 15-minute granularity, 30-minute granularity, 60-minute granularity, 3-hour granularity, 6-hour granularity, 12-hour granularity, 24-hour granularity, 3-day granularity and 7-day granularity are calculated, wherein the load is a percentage, and the maximum is 100%. The integrated load data may be determined by the following formula:
Wherein Z is comprehensive load data of the container, w 1 is a historical load weight coefficient, w 2 resource overhead weight coefficient, n 1 is a historical weight node number, n 2 is a resource overhead dimension (dimension of current monitoring data), x i is an ith historical weight node weight, y i is an ith historical weight node load value (granularity), a j is a resource overhead dimension weight, and b j is a resource overhead value (current monitoring data). For example, the vessel is run for 12 hours, at which time the system can obtain a load of 5 minutes, 15 minutes, 30 minutes, 60 minutes, 3 hours, 6 hours, and 12 hours of granularity, with a historical weight node number of 7.
The invention can automatically expand or contract the container. Calculating comprehensive load data of each container per minute when the expansion is carried out, triggering automatic expansion when the comprehensive load data of the containers exceeds 70% in 5 minutes continuously, and adding a copy identical to the containers; two copies of the same container are added when the integrated load data of the container exceeds 80% for 5 minutes continuously; four copies of the same container were added when the integrated load data for that container was loaded for more than 90% for 5 consecutive minutes.
When the capacity shrinkage is carried out, the average value of the comprehensive load data in approximately 24 hours can be counted at 6 hours in the morning, and when the average value of the comprehensive load data is less than 30 percent and the time of the comprehensive load data which is more than 50 percent is less than 5 minutes, the number of the container copies is halved; when the average value of the integrated load data is less than 10% and the time of the integrated load data more than 30% is less than 5 minutes, the number of container copies is reduced to one.
S103: service operation data of the container are collected, and service process operation information is obtained according to the service operation data.
FIG. 2 is a flow chart of acquiring service process operation information in an embodiment of the invention. As shown in fig. 2, acquiring service process operation information according to service operation data includes:
S201: service availability and response time volatility are determined from service operational data.
In specific implementation, the service operation data can be monitored through the service monitoring module. The service operation data comprises service available time, service starting time, service success times, service failure times, proportion service time consumption, service average time consumption and service time consumption median.
In one embodiment, determining service availability from service operation data includes: and determining the service availability according to the service availability time, the service starting time, the service success times and the service failure times.
In particular implementations, the service availability may be determined by the following formula:
Wherein p is service availability, t 1 is service availability time, t 2 is service start time, m 1 is service success number, and m 2 is service failure number.
In one embodiment, determining the response time volatility from the service operational data includes: the response time fluctuation rate is determined based on the proportional service time consumption, the service average time consumption, and the service time consumption median.
In practice, the response time fluctuation ratio can be determined by the following formula:
Where q is the response time fluctuation rate, t 3 is the proportional service time consumption, t 4 is the average service time consumption, t 5 is the median of the service time consumption, and the proportional service time consumption can be 90% of the service time consumption.
S202: and acquiring service process operation information according to the service availability and the response time fluctuation rate.
In specific implementation, the judgment can be performed by the problem judgment module. And when the service availability is lower than the service availability threshold or the response time fluctuation rate is higher than the fluctuation rate threshold, acquiring service process operation information through the problem judging module. Wherein the service availability threshold may be 90% and the volatility threshold may be 200%.
S104: and determining an exception handling scheme according to the service process running information so as to process the corresponding service.
In one embodiment, S104 includes:
1. And when the service process running information is that the service does not survive, restarting the corresponding service.
In one embodiment, when the service is restarted, determining an exception handling scheme corresponding to the system resource data in a preset scheme library to process a container where the corresponding service is located.
When the service is restarted for three times, collecting service starting logs, judging whether resource type errors exist according to keywords of system resource data in the service starting logs, wherein the keywords comprise elements such as a CPU, a memory, a storage and a network. If the resource class errors exist, the resource class errors are classified into resource problems (container problems), the resource problems are searched in a scheme library, the containers where the corresponding services are located are processed by using the first five schemes with the highest success rate, and the success rate of the corresponding schemes is increased. If the operation and maintenance personnel still cannot be started, informing the operation and maintenance personnel through mails, and reducing the success rate of each scheme in the scheme library. If the restarting is successful, the processing method is updated and added into the scheme library.
2. And when the service process operation information is service survival, determining an abnormal processing scheme corresponding to the returned communication area data in a preset scheme library so as to process the corresponding service.
In practice, transaction success rates of approximately 10 minutes may be counted. If the success rate of the transaction is lower than 90%, the data of the return communication area is counted, the solutions are matched in the solution library according to the data of the return communication area, and the first five solutions with the highest success rate are used for processing the corresponding service. If the transaction success rate is still lower than 90% within 10 minutes after processing, notifying operation and maintenance personnel through mail; if the success rate of the transaction is recovered, updating the processing method and adding the updated processing method into the scheme library.
The execution subject of the service management method shown in fig. 1 may be a computer. As can be seen from the flow shown in fig. 1, the service management method according to the embodiment of the present invention firstly compiles and packages the service configuration information to generate a package service file, so as to deploy the package service file into the container, then collects service operation data of the container to obtain service process operation information, and finally determines an exception handling scheme according to the service process operation information to process the corresponding service, thereby reducing the burden of operation and maintenance personnel, reducing labor cost and time cost, and improving service availability and recovery time.
The specific flow of the embodiment of the invention is as follows:
1. and receiving the service configuration information, compiling and packaging the service configuration information to generate a program package service file.
2. When the simulated abnormal scene exists, an exclusive container is established, and the package service file is deployed into the exclusive container, so that the container runs the package service file.
3. And when the simulated abnormal scene does not exist, determining comprehensive load data of each container, and deploying the package service file into the container corresponding to the minimum value of each comprehensive load data so that the container runs the package service file.
4. Service operation data of the container are collected, and service availability and response time fluctuation rate are determined according to the service operation data.
5. And acquiring service process operation information according to the service availability and the response time fluctuation rate.
6. And when the service process running information is that the service does not survive, restarting the corresponding service. When the service is restarted, determining an abnormal processing scheme corresponding to the system resource data in a preset scheme library so as to process a container where the corresponding service is located.
7. And when the service process operation information is service survival, determining an abnormal processing scheme corresponding to the returned communication area data in a preset scheme library so as to process the corresponding service.
In summary, the service management method provided by the embodiment of the invention has the following beneficial effects:
(1) Aiming at the current situation that various abnormal scenes of the cloud primary system are difficult to test, an effective, convenient and flexible scheme for rapidly deploying customized services is provided;
(2) The method overcomes the defects of the existing cloud primary system performance testing method. Aiming at special requirements of technical test, such as test scenes of coverage work of various abnormal scenes, verification work of partial failure of a system and the like, the service developer is not required to modify the bottom code of the service, so that the deployment test of the abnormal service can be conveniently carried out, the test cost is reduced, the test period is shortened, the test quality is improved, the test requirements of various bank cloud primary systems are rapidly completed, and the test progress is ensured.
Based on the same inventive concept, the embodiment of the invention also provides a service management device, and because the principle of the device for solving the problem is similar to that of the service management method, the implementation of the device can be referred to the implementation of the method, and the repetition is omitted.
Fig. 3 is a block diagram of a service management apparatus in an embodiment of the present invention. As shown in fig. 3, the service management apparatus includes:
the compiling module is used for receiving the service configuration information, compiling and packaging the service configuration information to generate a program package service file;
The deployment module is used for deploying the program package service files into the container so that the container runs the program package service files;
the acquisition module is used for acquiring service operation data of the container and acquiring service process operation information according to the service operation data;
and the scheme determining module is used for determining an exception handling scheme according to the service process operation information so as to process the corresponding service.
In practical application, the service management device comprises seven modules, namely a container dependence module, a customized deployment configuration module, a background automatic deployment module, a visual service management module, an asset management module, an abnormal scene module, a problem studying and judging module and the like. The container dependence module is mainly responsible for providing a container environment required by service deployment and can automatically stretch and retract according to the requirement; the customized deployment configuration module mainly comprises a set of concise service customized interpretation language, and is used for acquiring service customized configuration information of a user through a configuration interface or a configuration file; the background automatic deployment module is mainly responsible for generating a software package file capable of being deployed directly after the configuration information acquired in the customized deployment configuration module is subjected to machine language conversion, and automatically distributing an uploading container for operation; the visual service management module is mainly responsible for controlling and managing the customized service and performing visual interaction of other modules; the asset management module is mainly responsible for carrying out persistent storage on customized services, and is convenient for multiplexing and asset management. The abnormal scene module mainly comprises an abnormal scene parameter customization rule; the abnormal scene simulation module is used for receiving an abnormal scene which is submitted by a user and is required to be controlled by the control service to generate an abnormal scene. The problem judging module is mainly used for judging and analyzing the reported service operation condition, determining whether a problem exists or not, attempting to process the problem, and sending alarm information to operation and maintenance personnel under the condition that the problem cannot be solved.
In particular embodiments, the container dependency module includes an environmental support module, a mirror management module, and a container monitoring module. After the container dependence module is installed, the environment supporting module can provide software and hardware environments required by running the container, the container on the server is controlled to start and stop through the mirror image management module, and the container monitoring module can acquire container resource conditions.
The environment supporting module is based on the bottom layer server and is used for providing software and hardware environments required by running the container.
The mirror image management module is responsible for managing containers required by running service, and can automatically calculate comprehensive load according to the monitoring data collected by the container monitoring module, and automatically perform operations such as expansion and contraction.
The container monitoring module is used for monitoring the overall operation condition of the container dependent module and providing the result to the mirror image management module and the visual service management module.
The customized deployment configuration module comprises a customized language module and a configuration management module.
The customized language module is used for designing and completing a set of natural language, and can complete various information required by service configuration through simple description information, including basic parameter configuration and additional parameter configuration of the service.
The configuration management module is used for providing an interface type configuration mode according to the structure of the customized language, interacting with a user through the visual service management module, providing a configuration mode based on a configuration file, and supporting the conversion of the two configuration modes.
The background automatic deployment module comprises a configuration analysis module, a software package packaging module and a container allocation module.
The configuration analysis module acquires configuration information from the customized deployment configuration module, analyzes the configuration information and converts the configuration information, wherein the configuration information comprises a series of contents such as a service name, a service version, a service method, a parameter entering type, a parameter returning type, an operation result, concurrency limit, overtime limit, a serialization mode, a registry and the like, and the contents are transmitted to the packing module for program packing.
The software package packaging module comprises a compiling module, a container distribution module and a configuration analysis module, wherein the compiling module is used for receiving a packaging request from the configuration analysis module, generating a program source code file and a corresponding dependency file, compiling and packaging the program source code file and the corresponding dependency file to generate a program package service file, and distributing and uploading the program source code file and the corresponding dependency file after packaging.
The container distribution module comprises a deployment module for receiving the program package file generated by the software package packing module, acquiring the current system running condition through the container monitoring module, firstly judging an abnormal scene to be simulated by the container, and if no simulation abnormality is needed, preferentially selecting the container with lower load for uploading operation. If the container needs to simulate an abnormal scene, a new exclusive container is newly built for deployment, and other services are prevented from being influenced in the abnormal simulation process.
The visual service management module comprises a service management module, a visual interaction module and a service monitoring module.
The service management module is used for managing the deployed service, and comprises operations such as service stopping, starting, restarting, expansion and the like.
The visual interaction module is responsible for the visual operation of the whole system and interacting with users.
The service monitoring module comprises an acquisition module and a persistence processing module, wherein the acquisition module is responsible for monitoring deployed services and mainly comprises service available time, service starting time, service success times, service failure times, proportion service time consumption, average service time consumption, median of service time consumption, service health state and the like, alarms the service with abnormal state, and persistence processing is carried out on data collected by monitoring.
The asset management module includes a data storage module and a storage management module.
The data storage module is used for storing the data reported by other modules.
The storage management module is used for managing stored data, including automatic archiving, automatic cleaning, storage space use optimization and the like.
The abnormal scene module comprises an abnormal configuration module and an abnormal execution module.
The abnormal configuration module is used for providing abnormal type selection for the user and receiving configuration information of abnormal services.
The abnormal execution module is used for simulating abnormal scenes according to different types and abnormal configurations.
The problem judging module comprises a scheme determining module, and in practical application, the problem judging module, the problem processing module and the intelligent learning module are included.
The problem judging module is used for carrying out research, judgment and analysis on the reported service operation condition and determining whether a problem exists or not.
The problem processing module is used for automatically processing the judged problems.
The intelligent learning module is used for analyzing the effective treatment, extracting problem phenomena and solving measures, and adding a problem treatment library (scheme library).
In summary, the service management device of the embodiment of the invention firstly compiles and packages the service configuration information to generate the package service file so as to deploy the package service file into the container, then collects the service operation data of the container to acquire the service process operation information, and finally determines the exception handling scheme according to the service process operation information so as to process the corresponding service, thereby reducing the burden of operation and maintenance personnel, lowering the labor cost and the time cost, and improving the service availability and the recovery time.
The embodiment of the invention also provides a concrete implementation mode of the computer equipment capable of realizing all the steps in the service management method in the embodiment. Fig. 4 is a block diagram of a computer device according to an embodiment of the present invention, and referring to fig. 4, the computer device specifically includes:
A processor (processor) 401 and a memory (memory) 402.
The processor 401 is configured to invoke a computer program in the memory 402, where the processor executes the computer program to implement all the steps in the service management method in the above embodiment, for example, the processor executes the computer program to implement the following steps:
Receiving service configuration information, compiling and packaging the service configuration information to generate a program package service file;
Deploying the package service file into the container so that the container runs the package service file;
Acquiring service operation data of the container, and acquiring service process operation information according to the service operation data;
and determining an exception handling scheme according to the service process running information so as to process the corresponding service.
In summary, the computer device in the embodiment of the invention firstly compiles and packages the service configuration information to generate the package service file so as to deploy the package service file into the container, then collects the service operation data of the container to obtain the service process operation information, and finally determines the exception handling scheme according to the service process operation information so as to process the corresponding service, thereby reducing the burden of operation and maintenance personnel, lowering the labor cost and the time cost, and improving the service availability and the recovery time.
The present invention also provides a computer-readable storage medium capable of implementing all the steps of the service management method in the above embodiment, the computer-readable storage medium storing thereon a computer program which, when executed by a processor, implements all the steps of the service management method in the above embodiment, for example, the processor implements the following steps when executing the computer program:
Receiving service configuration information, compiling and packaging the service configuration information to generate a program package service file;
Deploying the package service file into the container so that the container runs the package service file;
Acquiring service operation data of the container, and acquiring service process operation information according to the service operation data;
and determining an exception handling scheme according to the service process running information so as to process the corresponding service.
In summary, the computer readable storage medium of the embodiment of the invention firstly compiles and packages the service configuration information to generate the package service file so as to deploy the package service file into the container, then collects the service operation data of the container to acquire the service process operation information, and finally determines the exception handling scheme according to the service process operation information to process the corresponding service, thereby reducing the burden of operation and maintenance personnel, lowering the labor cost and the time cost, and improving the service availability and the recovery time.
The foregoing description of the embodiments has been provided for the purpose of illustrating the general principles of the invention, and is not meant to limit the scope of the invention, but to limit the invention to the particular embodiments, and any modifications, equivalents, improvements, etc. that fall within the spirit and principles of the invention are intended to be included within the scope of the invention.
Those of skill in the art will further appreciate that the various illustrative logical blocks (illustrative logical block), units, and steps described in connection with the embodiments of the invention may be implemented by electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software (interchangeability), various illustrative components described above (illustrative components), elements, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design requirements of the overall system. Those skilled in the art may implement the described functionality in varying ways for each particular application, but such implementation is not to be understood as beyond the scope of the embodiments of the present invention.
The various illustrative logical blocks, or units, or devices described in the embodiments of the invention may be implemented or performed with a general purpose processor, a digital signal processor, an Application Specific Integrated Circuit (ASIC), a field programmable gate array or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described. A general purpose processor may be a microprocessor, but in the alternative, the general purpose processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a digital signal processor and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a digital signal processor core, or any other similar configuration.
The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may be stored in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. In an example, a storage medium may be coupled to the processor such that the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC, which may reside in a user terminal. In the alternative, the processor and the storage medium may reside as distinct components in a user terminal.
In one or more exemplary designs, the above-described functions of embodiments of the present invention may be implemented in hardware, software, firmware, or any combination of the three. If implemented in software, the functions may be stored on a computer-readable medium or transmitted as one or more instructions or code on the computer-readable medium. Computer readable media includes both computer storage media and communication media that facilitate transfer of computer programs from one place to another. A storage media may be any available media that can be accessed by a general purpose or special purpose computer. For example, such computer-readable media may include, but is not limited to, RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that may be used to carry or store program code in the form of instructions or data structures and other data structures that may be read by a general or special purpose computer, or a general or special purpose processor. Further, any connection is properly termed a computer-readable medium, e.g., if the software is transmitted from a website, server, or other remote source via a coaxial cable, fiber optic cable, twisted pair, digital Subscriber Line (DSL), or wireless such as infrared, radio, and microwave, and is also included in the definition of computer-readable medium. The disks (disks) and disks (disks) include compact disks, laser disks, optical disks, DVDs, floppy disks, and blu-ray discs where disks usually reproduce data magnetically, while disks usually reproduce data optically with lasers. Combinations of the above may also be included within the computer-readable media.

Claims (7)

1. A service management method, comprising:
Receiving service configuration information, compiling and packaging the service configuration information to generate a program package service file;
Deploying the package service file into a container, so that the container runs the package service file;
Acquiring service operation data of the container, and acquiring service process operation information according to the service operation data;
Determining an exception handling scheme according to the service process operation information so as to process the corresponding service;
The step of obtaining service process operation information according to the service operation data comprises the following steps:
Determining service availability and response time fluctuation rate according to the service operation data; the service operation data comprises service available time, service starting time, service success times, service failure times, proportion service time consumption, service average time consumption and service time consumption median;
Acquiring service process operation information according to the service availability and the response time fluctuation rate;
Determining service availability from the service operation data comprises: determining the service availability according to the service availability time, the service starting time, the service success times and the service failure times;
determining a response time volatility from the service operational data comprises: and determining the response time fluctuation rate according to the proportion of service time consumption, the average service time consumption and the median of service time consumption.
2. The service management method according to claim 1, wherein determining an exception handling scheme to handle the corresponding service according to the service process operation information comprises:
when the service process operation information is that the service does not survive, restarting the corresponding service;
And when the service process operation information is service survival, determining an abnormal processing scheme corresponding to the returned communication area data in a preset scheme library so as to process the corresponding service.
3. The service management method according to claim 2, further comprising:
and when the service is restarted and fails, determining a corresponding exception handling scheme of the system resource data in a preset scheme library so as to process a container where the corresponding service is located.
4. The service management method of claim 2, wherein deploying the package service file into a container comprises:
when a simulated abnormal scene exists, establishing an exclusive container, and deploying the package service file into the exclusive container;
And when the simulated abnormal scene does not exist, determining comprehensive load data of each container, and deploying the package service file into the container corresponding to the minimum value of each comprehensive load data.
5. A service management apparatus, comprising:
the compiling module is used for receiving the service configuration information, compiling and packaging the service configuration information to generate a program package service file;
The deployment module is used for deploying the package service file into a container so that the container runs the package service file;
The acquisition module is used for acquiring service operation data of the container and acquiring service process operation information according to the service operation data;
The step of obtaining service process operation information according to the service operation data comprises the following steps:
Determining service availability and response time fluctuation rate according to the service operation data; the service operation data comprises service available time, service starting time, service success times, service failure times, proportion service time consumption, service average time consumption and service time consumption median;
Acquiring service process operation information according to the service availability and the response time fluctuation rate;
Determining service availability from the service operation data comprises: determining the service availability according to the service availability time, the service starting time, the service success times and the service failure times;
Determining a response time volatility from the service operational data comprises: determining the response time fluctuation rate according to the proportion of service time consumption, the average service time consumption and the median of service time consumption;
and the scheme determining module is used for determining an exception handling scheme according to the service process running information so as to process the corresponding service.
6. A computer device comprising a memory, a processor and a computer program stored on the memory and running on the processor, characterized in that the processor implements the steps of the service management method of any of claims 1 to 4 when the computer program is executed.
7. A computer readable storage medium having stored thereon a computer program, characterized in that the computer program, when executed by a processor, implements the steps of the service management method of any of claims 1 to 4.
CN202110930034.0A 2021-08-13 2021-08-13 Service management method and device Active CN113626044B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110930034.0A CN113626044B (en) 2021-08-13 2021-08-13 Service management method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110930034.0A CN113626044B (en) 2021-08-13 2021-08-13 Service management method and device

Publications (2)

Publication Number Publication Date
CN113626044A CN113626044A (en) 2021-11-09
CN113626044B true CN113626044B (en) 2024-08-06

Family

ID=78385320

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110930034.0A Active CN113626044B (en) 2021-08-13 2021-08-13 Service management method and device

Country Status (1)

Country Link
CN (1) CN113626044B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115174644B (en) * 2022-06-28 2023-09-12 武汉烽火技术服务有限公司 Container cluster service start-stop control method, device, equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104915285A (en) * 2015-06-30 2015-09-16 北京奇虎科技有限公司 Container process monitoring method, device and system
CN105677356A (en) * 2016-01-11 2016-06-15 上海雷腾软件股份有限公司 Operation and maintenance method and device

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106610836B (en) * 2016-12-23 2019-12-31 国网信息通信产业集团有限公司 Micro-service operation management tool
CN107248986A (en) * 2017-06-08 2017-10-13 医惠科技有限公司 A kind of service tray method
CN109450691B (en) * 2018-11-20 2022-01-11 深圳前海微众银行股份有限公司 Service gateway monitoring method, device and computer readable storage medium
CN112187581B (en) * 2020-09-29 2022-08-02 北京百度网讯科技有限公司 Service information processing method, device, equipment and computer storage medium
CN112559285B (en) * 2020-12-08 2023-05-30 中国联合网络通信集团有限公司 Micro-service monitoring method and related device based on distributed service architecture

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104915285A (en) * 2015-06-30 2015-09-16 北京奇虎科技有限公司 Container process monitoring method, device and system
CN105677356A (en) * 2016-01-11 2016-06-15 上海雷腾软件股份有限公司 Operation and maintenance method and device

Also Published As

Publication number Publication date
CN113626044A (en) 2021-11-09

Similar Documents

Publication Publication Date Title
US11108859B2 (en) Intelligent backup and recovery of cloud computing environment
CN111580954B (en) Extensible distributed data acquisition method and system
JP4809772B2 (en) Management based on computer system and distributed application model
CN111124850A (en) MQTT server performance testing method, system, computer equipment and storage medium
US20060235664A1 (en) Model-based capacity planning
WO2008134143A1 (en) Resource model training
US20090013321A1 (en) Managing virtual computers
WO2012073686A1 (en) Dependability maintenance device, dependability maintenance system, malfunction supporting system, method for controlling dependability maintenance device, control program, computer readable recording medium recording control program
US11550615B2 (en) Kubernetes resource policy enforcement
CN117762644A (en) Resource dynamic scheduling technology of distributed cloud computing system
US20210124492A1 (en) Efficient utilization of storage resources on data recovery sites using machine learning
US10372572B1 (en) Prediction model testing framework
CN114528186A (en) Model management method and device and model management server
CN113626044B (en) Service management method and device
CN113590285A (en) Method, system and equipment for dynamically setting thread pool parameters
CN111897684A (en) Disk fault simulation test method and device and electronic equipment
CN116467204A (en) Low-code application running stability optimization method
US11777810B2 (en) Status sharing in a resilience framework
Ma et al. Finding heterogeneous-unsafe configuration parameters in cloud systems
Cámara et al. Robustness evaluation of the rainbow framework for self-adaptation
CN110008098B (en) Method and device for evaluating operation condition of nodes in business process
Gokhale Analysis of software reliability and performance
CN118227558B (en) Information system snapshot generation method, device, equipment and storage medium
Wei et al. A self-management approach for service developers of PaaS
US20230195452A1 (en) Performing data correlation to optimize continuous integration environments

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant