CN113626044A - Service management method and device - Google Patents

Service management method and device Download PDF

Info

Publication number
CN113626044A
CN113626044A CN202110930034.0A CN202110930034A CN113626044A CN 113626044 A CN113626044 A CN 113626044A CN 202110930034 A CN202110930034 A CN 202110930034A CN 113626044 A CN113626044 A CN 113626044A
Authority
CN
China
Prior art keywords
service
container
file
package
determining
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110930034.0A
Other languages
Chinese (zh)
Other versions
CN113626044B (en
Inventor
崔东晓
朱仲毅
李秀婧
翟文倩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Industrial and Commercial Bank of China Ltd ICBC
Original Assignee
Industrial and Commercial Bank of China Ltd ICBC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Industrial and Commercial Bank of China Ltd ICBC filed Critical Industrial and Commercial Bank of China Ltd ICBC
Priority to CN202110930034.0A priority Critical patent/CN113626044B/en
Publication of CN113626044A publication Critical patent/CN113626044A/en
Application granted granted Critical
Publication of CN113626044B publication Critical patent/CN113626044B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/60Software deployment
    • G06F8/61Installation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/3668Software testing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformation of program code
    • G06F8/41Compilation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/02Banking, e.g. interest calculation or account maintenance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/04Trading; Exchange, e.g. stocks, commodities, derivatives or currency exchange
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/45562Creating, deleting, cloning virtual machine instances

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Economics (AREA)
  • Development Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • General Business, Economics & Management (AREA)
  • Quality & Reliability (AREA)
  • Computer Hardware Design (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention provides a service management method and device, belongs to the technical field of cloud computing, and can be applied to the financial field or other fields. The service management method comprises the following steps: receiving service configuration information, compiling and packaging the service configuration information to generate a package service file; deploying the package service file into a container so that the container runs the package service file; acquiring service operation data of the container, and acquiring service process operation information according to the service operation data; and determining an exception handling scheme according to the service process running information so as to handle the corresponding service. The invention can reduce the burden of operation and maintenance personnel, reduce the labor cost and the time cost, and improve the service availability and the recovery time.

Description

Service management method and device
Technical Field
The invention relates to the technical field of cloud computing, in particular to a service management method and device.
Background
At present, a large commercial bank is in a stage of framework transformation, and the traditional technical framework is transformed into a distributed framework. In the transformation process, the test work for the distributed version gradually becomes the key point of the test work. The prior art can only support simple operation and maintenance deployment work, only solves the basic service deployment problem, often needs operation and maintenance personnel to perform manual analysis and problem troubleshooting to solve various abnormal problems in deployment, has huge manpower and time cost, influences the availability and recovery time of a system, and brings great obstruction to test work.
Disclosure of Invention
The embodiment of the invention mainly aims to provide a service management method and a service management device, so as to reduce the burden of operation and maintenance personnel, reduce the labor cost and the time cost, and improve the service usability and the recovery time.
In order to achieve the above object, an embodiment of the present invention provides a service management method, including:
receiving service configuration information, compiling and packaging the service configuration information to generate a package service file;
deploying the package service file into a container so that the container runs the package service file;
acquiring service operation data of the container, and acquiring service process operation information according to the service operation data;
and determining an exception handling scheme according to the service process running information so as to handle the corresponding service.
An embodiment of the present invention further provides a service management apparatus, including:
the compiling module is used for receiving the service configuration information and compiling and packaging the service configuration information to generate a package service file;
the deployment module is used for deploying the package service file into the container so as to enable the container to run the package service file;
the acquisition module is used for acquiring service operation data of the container and acquiring service process operation information according to the service operation data;
and the scheme determining module is used for determining an exception handling scheme according to the service process running information so as to process the corresponding service.
The embodiment of the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and running on the processor, and the steps of the service management method are implemented when the processor executes the computer program.
Embodiments of the present invention further provide a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps of the service management method.
The service management method and the service management device provided by the embodiment of the invention firstly compile and package the service configuration information to generate the package service file so as to deploy the package service file into the container, then acquire the service operation data of the container to obtain the service process operation information, and finally determine the exception handling scheme according to the service process operation information so as to handle the corresponding service, so that the load of operation and maintenance personnel can be reduced, the labor cost and the time cost are reduced, and the service availability and the recovery time are improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained based on these drawings without creative efforts.
FIG. 1 is a flow chart of a method of service management in an embodiment of the invention;
FIG. 2 is a flow chart of obtaining service process run information in an embodiment of the present invention;
FIG. 3 is a block diagram showing the structure of a service management apparatus according to an embodiment of the present invention;
fig. 4 is a block diagram of a computer device in the embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As will be appreciated by one skilled in the art, embodiments of the present invention may be embodied as a system, apparatus, device, method, or computer program product. Accordingly, the present disclosure may be embodied in the form of: entirely hardware, entirely software (including firmware, resident software, micro-code, etc.), or a combination of hardware and software.
In view of the fact that the prior art can only support simple operation and maintenance deployment work, the labor and time cost is huge, and the service availability and recovery time are affected, embodiments of the invention provide a service management method and device, which can perform convenient, flexible and rapid deployment on customized services between different version installation environments under the condition of a complex cloud native test environment, effectively solve the problem of long service preparation time, can complete customized service deployment work through simple configuration of a customized service template, enable background tasks to be automatically packaged, deployed, and enable services to be completed online in one key. Meanwhile, an abnormal scene customization interface is required to be opened for the specific test of the technical test in the test environment, the abnormal scene containing the current test content can be rapidly simulated, and the system support optimization can be rapidly completed for the newly added abnormal scene. In addition, the invention can synchronously support high concurrent service requirements, and realize dynamic scheduling and easy deployment through flexible dynamic transverse capacity expansion. The abnormal problems occurring in the deployment process and the operation process can be automatically identified and processed according to a preset scheme, the burden of operation and maintenance personnel is reduced, and the availability and recovery time of the system are improved. The present invention will be described in detail below with reference to the accompanying drawings.
Fig. 1 is a flowchart of a service management method in an embodiment of the present invention. As shown in fig. 1, the service management method includes:
s101: and receiving service configuration information, and compiling and packaging the service configuration information to generate a package service file.
In specific implementation, the configuration management module can be used for inputting configuration information of the service in a file or webpage configuration mode, wherein the service configuration information comprises a service name, a service version, a service method, an access type, a back-access type, an operation result, concurrency limit, timeout limit, a serialization mode, a registration center and the like. After the user submits the configuration information, the system checks the necessary input items one by one to confirm that the format of the input content is correct, judges whether the content of the selected input item conforms to the logic of the selected input item, and generates a service configuration information description information segment through a customized language module. The configuration analysis module acquires the service configuration information description information segment from the configuration management module in the customized deployment configuration module, converts the service configuration information segment to acquire converted service configuration information, and delivers the converted service configuration information to the software package packaging module for program packaging. The software package packaging module generates a program source code file and a corresponding dependency file according to the converted service configuration information, compiles and packages the program source code file and the corresponding dependency file to generate a program package service file, and delivers the program package service file to the container distribution module for distribution and uploading after the packaging is completed.
S102: and deploying the package service file into the container so that the container runs the package service file.
In one embodiment, deploying the package service file into the container comprises:
1. and when the simulation abnormal scene exists, establishing an exclusive container, and deploying the package service file into the exclusive container. Establishing an exclusive container may avoid simulating exceptions affecting other services.
How to test the abnormal scenes and the high-concurrency scenes of the service is always a difficult point, and how to quickly and accurately simulate various abnormal scenes is generally a difficult problem in the test process of the abnormal scenes; in addition, when a high-concurrency test is performed on the service, due to the fact that a large number of related upstream and downstream applications are involved, service bottlenecks often cannot be accurately identified under the condition that the performance of the upstream and downstream applications cannot be guaranteed, and the test progress and the test quality are greatly influenced. Aiming at special requirements of technical tests, such as test scenes of coverage work of various abnormal scenes, verification work of partial system failure and the like, in the prior art, service developers need to modify bottom codes of services, the professional requirement is high, the modification period and the circulation link are long, and the test period is seriously influenced.
In order to solve the problems, the invention inputs the configuration information of the abnormal service through the abnormal configuration module so as to simulate an abnormal scene. The simulated exception categories mainly include:
a) service unavailable exception: the service is set to be unavailable periodically, and the condition that the service is not available in burst is simulated. The main parameters include service name, unavailable time and recovery time, etc.
b) Service flow restriction exception: triggering service flow limitation, including TPS (system throughput) flow limitation, concurrent number flow limitation, and the like. The main parameters include service name, type of current limit, and current limit value.
c) Service timeout exception: triggering a service response time timeout. The main parameters include the service name and the timeout time, etc.
d) Service restart exception: the service frequently reboots during service invocation. The main parameters include the service name and the restart interval, etc.
e) Service exception error reporting: the service returns an error in the communication area. The main parameters include a service name and return contents, etc.
f) Service variable configuration exception: the service variables are not configured correctly. The main parameters include service name, variable value, etc.
The invention simulates an abnormal scene through an abnormal execution module, and the specific execution strategy is as follows:
a) service unavailable exception: and performing periodical offline operation on the service according to the service name, the unavailable time and the recovery time.
b) Service flow restriction exception: and thermally loading the service with the current limiting parameters such as the service name, the current limiting type, the current limiting value and the like.
c) Service timeout exception: and (4) carrying out hot loading on the timeout parameters such as the service name, the timeout time and the like to the service.
d) Service restart exception: the service is periodically restarted according to the service name and the restart interval.
e) Service exception error reporting: and changing the service return value into the content needing error reporting according to the service name and the return content.
f) Service variable configuration exception: and modifying the service configuration variable according to the service name, the variable name and the variable value and restarting to take effect.
2. And when the simulated abnormal scene does not exist, determining the comprehensive load data of each container, and deploying the package service file to the container corresponding to the minimum value of each comprehensive load data.
In particular, the comprehensive load data of the container can be determined as follows:
monitoring data (CPU data, memory data, thread pool data, connection pool data, disk IO data and the like) per minute are collected, and load conditions of 5-minute granularity, 15-minute granularity, 30-minute granularity, 60-minute granularity, 3-hour granularity, 6-hour granularity, 12-hour granularity, 24-hour granularity, 3-day granularity and 7-day granularity are calculated, wherein the load is percentage and is 100% at most. The integrated load data may be determined by the following equation:
Figure BDA0003210999010000051
wherein Z is the comprehensive load data of the container, w1Is a historical load weight coefficient, w2Resource overhead weight coefficient, n1Is the number of historical weight nodes, n2Is the resource overhead dimension (the dimension of the current monitoring data), xiIs the ith historical weight node weight, yiIs the ith historical weight node load value (granularity), ajThe dimensional weights are the cost of the resources,bjis the resource overhead value (current monitoring data). For example, the vessel is run for 12 hours, at which time the system can take a load of 5 minute, 15 minute, 30 minute, 60 minute, 3 hour, 6 hour and 12 hour particle sizes with a historical weight node number of 7.
The invention can also automatically expand or contract the container. When capacity expansion is carried out, calculating comprehensive load data of each container per minute, triggering automatic capacity expansion when the comprehensive load data of the container continuously exceeds 70% for 5 minutes, and adding a copy which is the same as the container; adding two copies identical to the container when the integrated load data of the container exceeds 80% for 5 minutes; four copies of the same container were added when the container's integrated load data was more than 90% loaded for 5 minutes in a row.
During capacity reduction, the average value of the comprehensive load data in nearly 24 hours can be counted at 6 points in the morning every day, and when the average value of the comprehensive load data is less than 30% and the time of the comprehensive load data being more than 50% is less than 5 minutes, the number of container copies is halved; when the average value of the integrated load data is less than 10% and the time for which the integrated load data is greater than 30% is less than 5 minutes, the number of container copies is reduced to one.
S103: and acquiring service operation data of the container, and acquiring service process operation information according to the service operation data.
Fig. 2 is a flowchart of acquiring service process running information according to an embodiment of the present invention. As shown in fig. 2, acquiring the service process operation information according to the service operation data includes:
s201: and determining the service availability ratio and the response time fluctuation ratio according to the service operation data.
In specific implementation, the service operation data can be monitored through the service monitoring module. The service operation data comprises service available time, service starting time, service success times, service failure times, proportion service time consumption, service average time consumption and service time consumption median.
In one embodiment, determining the service availability based on the service operation data comprises: and determining the service availability according to the service availability time, the service starting time, the service success times and the service failure times.
In specific implementation, the service availability can be determined by the following formula:
Figure BDA0003210999010000052
where p is service availability, t1Time available for service, t2For service start time, m1For the number of successful services, m2The number of service failures.
In one embodiment, determining the response time fluctuation rate from the service operation data comprises: and determining the fluctuation rate of the response time according to the proportion service time consumption, the average service time consumption and the median of the service time consumption.
In specific implementation, the response time fluctuation rate can be determined by the following formula:
Figure BDA0003210999010000061
wherein q is the response time fluctuation rate, t3Time consuming for proportional service, t4For average time spent servicing, t5For the median service time consumption, 90% of the service time consumption can be adopted as the proportional service time consumption.
S202: and acquiring service process running information according to the service availability and the response time fluctuation.
In specific implementation, the judgment can be carried out through the problem judgment module. And when the service availability is lower than the service availability threshold or the response time fluctuation rate is higher than the fluctuation rate threshold, acquiring service process running information through the problem judgment module. The service availability threshold may be 90%, and the fluctuation rate threshold may be 200%.
S104: and determining an exception handling scheme according to the service process running information so as to handle the corresponding service.
In one embodiment, S104 includes:
1. and when the service process running information indicates that the service does not survive, restarting the corresponding service.
In an embodiment, when the service is failed to restart, a corresponding exception handling scheme of the system resource data in a preset scheme library is determined to handle a container where the corresponding service is located.
In specific implementation, when the service is failed to restart for three times, a service starting log is collected, and whether resource error reporting exists is judged according to keywords of system resource data in the service starting log, wherein the keywords comprise elements such as a CPU (central processing unit), a memory, a storage and a network. If the resource type error is reported, the method is classified as a resource problem (container problem), retrieval is carried out in a scheme library according to the resource problem, the first five schemes with the highest success rate are used for processing the container where the corresponding service is located, and the success rate of the corresponding scheme is increased. If the system can not be started, the operation and maintenance personnel are notified through the mails, and the success rate of each scheme in the scheme library is reduced. And if the restart is successful, updating the processing method and adding the processing method into the scheme library.
2. And when the service process running information is service survival, determining an exception handling scheme corresponding to the returned communication area data in a preset scheme library so as to process the corresponding service.
In specific implementation, the transaction success rate of nearly 10 minutes can be counted. And if the transaction success rate is lower than 90%, counting the returned communication area data, matching the solution in the scheme library according to the returned communication area data, and processing the corresponding service by using the first five schemes with the highest success rate. If the transaction success rate is still lower than 90% within 10 minutes after the processing, notifying operation and maintenance personnel through a mail; and if the transaction success rate is recovered, updating the processing method and adding the processing method into the scheme library.
The execution subject of the service management method shown in fig. 1 may be a computer. As can be seen from the flow shown in fig. 1, the service management method according to the embodiment of the present invention first compiles and packages service configuration information to generate a package service file, deploys the package service file into a container, collects service operation data of the container to obtain service process operation information, and determines an exception handling scheme according to the service process operation information to handle a corresponding service, so that the load of operation and maintenance staff can be reduced, the labor cost and the time cost can be reduced, and the service availability and the recovery time can be improved.
The specific process of the embodiment of the invention is as follows:
1. and receiving service configuration information, and compiling and packaging the service configuration information to generate a package service file.
2. And when the simulated abnormal scene exists, establishing an exclusive container, and deploying the package service file into the exclusive container so as to enable the container to run the package service file.
3. And when the simulated abnormal scene does not exist, determining the comprehensive load data of each container, and deploying the package service file to the container corresponding to the minimum value of each comprehensive load data so as to enable the container to run the package service file.
4. And collecting service operation data of the container, and determining the service availability and the response time fluctuation rate according to the service operation data.
5. And acquiring service process running information according to the service availability and the response time fluctuation.
6. And when the service process running information indicates that the service does not survive, restarting the corresponding service. When the service is failed to restart, determining an exception handling scheme corresponding to the system resource data in a preset scheme library so as to handle a container where the corresponding service is located.
7. And when the service process running information is service survival, determining an exception handling scheme corresponding to the returned communication area data in a preset scheme library so as to process the corresponding service.
In summary, the service management method provided by the embodiment of the present invention has the following beneficial effects:
(1) aiming at the current situation that various abnormal scenes of a cloud native system are difficult to test, an effective, convenient and flexible scheme for rapid deployment of customized services is provided;
(2) the method makes up the defects of the existing cloud native system performance test method. Aiming at special requirements of technical tests, such as test scenes of coverage work of various abnormal scenes, verification work of partial system failure and the like, the abnormal services can be conveniently and rapidly deployed and tested without modifying bottom codes of the services by service developers, so that the test cost is reduced, the test period is shortened, the test quality is improved, the test requirements of various bank cloud native systems are rapidly met, and the test progress is guaranteed.
Based on the same inventive concept, the embodiment of the invention also provides a service management device, and as the principle of solving the problems of the device is similar to that of the service management method, the implementation of the device can refer to the implementation of the method, and repeated parts are not described again.
Fig. 3 is a block diagram of a service management apparatus according to an embodiment of the present invention. As shown in fig. 3, the service management apparatus includes:
the compiling module is used for receiving the service configuration information and compiling and packaging the service configuration information to generate a package service file;
the deployment module is used for deploying the package service file into the container so as to enable the container to run the package service file;
the acquisition module is used for acquiring service operation data of the container and acquiring service process operation information according to the service operation data;
and the scheme determining module is used for determining an exception handling scheme according to the service process running information so as to process the corresponding service.
In practical application, the service management device comprises seven modules, namely a container dependence module, a customized deployment configuration module, a background automatic deployment module, a visual service management module, an asset management module, an abnormal scene module, a problem study and judgment module and the like. The container dependence module is mainly responsible for providing a container environment required by service deployment and can automatically stretch and retract according to requirements; the customized deployment configuration module mainly comprises a set of concise service customized interpretation languages and is used for acquiring service customized configuration information of a user through a configuration interface or a configuration file; the background automatic deployment module is mainly responsible for generating a directly deployable software package file after the configuration information acquired from the customized deployment configuration module is subjected to machine language conversion, and automatically distributing an uploading container to run; the visual service management module is mainly responsible for controlling and managing the customized service and carrying out visual interaction of other modules; the asset management module is mainly responsible for persistent storage of customized services, and is convenient for multiplexing and asset management. The abnormal scene module mainly comprises an abnormal scene parameter customization rule; the abnormal scene simulation module is used for receiving abnormal scenes submitted by users and simulating abnormal scenes generated by services needing to be controlled. The problem studying and judging module is mainly used for studying and judging and analyzing the reported service operation condition, determining whether a problem exists, trying to process the problem and sending alarm information to operation and maintenance personnel under the condition that the problem cannot be solved.
In specific implementation, the container dependency module comprises an environment supporting module, a mirror image management module and a container monitoring module. After the container relying module is installed, the environment supporting module can provide software and hardware environments needed by the operation of the container, the container on the server is controlled to be started and stopped through the mirror image management module, and the container monitoring module can acquire the container resource condition.
The environment supporting module is based on a bottom layer server and is used for providing software and hardware environments needed by the operation container.
The mirror image management module is responsible for managing the containers required by the running service, and can automatically calculate the comprehensive load according to the monitoring data collected by the container monitoring module, and automatically perform operations such as expansion and contraction.
The container monitoring module is used for monitoring the integral operation condition of the container dependence module and providing the result to the mirror image management module and the visual service management module.
The customized deployment configuration module comprises a customized language module and a configuration management module.
The customized language module is used for designing and completing a set of natural language, and can complete various information required by service configuration through simple description information, including basic parameter configuration and additional parameter configuration of the service.
The configuration management module is used for providing an interface type configuration mode according to the structure of the customized language, interacting with a user through the visual service management module, providing a configuration mode based on a configuration file and supporting the conversion of the two configuration modes.
The background automatic deployment module comprises a configuration analysis module, a software package packaging module and a container distribution module.
The configuration analysis module acquires configuration information from the customized deployment configuration module, analyzes and converts the configuration information, and delivers the configuration information to the packaging module for program packaging, wherein the configuration information comprises a series of contents such as a service name, a service version, a service method, an access type, a back-access type, an operation result, concurrency limit, timeout limit, a serialization mode, a registration center and the like.
The software package packaging module comprises a compiling module used for receiving the packaging request from the configuration analysis module, generating a program source code file and a corresponding dependency file, compiling and packaging the program source code file and the corresponding dependency file to generate a program package service file, and delivering the program package service file to the container distribution module for distribution and uploading after the packaging is finished.
The container distribution module comprises a deployment module used for receiving the program package file generated by the software package packaging module, acquiring the current system operation condition through the container monitoring module, firstly judging the abnormal scene of the container needing to be simulated, and preferentially selecting the container with lower load for uploading operation if the abnormal scene does not need to be simulated. If the container needs to simulate an abnormal scene, the new exclusive container is built for deployment, and other services are prevented from being influenced in the abnormal simulation process.
The visual service management module comprises a service management module, a visual interaction module and a service monitoring module.
The service management module is used for managing the deployed services, and comprises operations of service stopping, starting, restarting, expansion and the like.
The visualization interaction module is responsible for visualization work of the whole system and interacts with users.
The service monitoring module comprises an acquisition module and is responsible for monitoring deployed services, mainly comprises service available time, service starting time, service success times, service failure times, proportional service time consumption, average service time consumption, median service time consumption, service health state and the like, alarms are given to services with abnormal states, and data collected by monitoring is subjected to persistence processing.
The asset management module comprises a data storage module and a storage management module.
The data storage module is used for storing the data reported by other modules.
The storage management module is used for managing the stored data, including automatic archiving, automatic cleaning, storage space use optimization and the like.
The exception scene module comprises an exception configuration module and an exception execution module.
The abnormal configuration module is used for providing abnormal type selection for a user and receiving configuration information of abnormal service.
The exception execution module is used for simulating exception scenes according to various exception types and exception configurations.
The problem studying and judging module comprises a scheme determining module, and in practical application, the problem studying and judging module comprises a problem judging module, a problem processing module and an intelligent learning module.
And the problem judgment module is used for studying, judging and analyzing the reported service operation condition and determining whether a problem exists.
And the problem processing module is used for automatically processing the judged problems.
The intelligent learning module is used for analyzing the confirmed effective treatment, extracting problem phenomena and solving measures and adding the problem phenomena and the solving measures into a problem treatment library (scheme library).
To sum up, the service management device in the embodiment of the present invention compiles and packages the service configuration information to generate the package service file, deploys the package service file into the container, acquires the service operation data of the container to obtain the service process operation information, and determines the exception handling scheme according to the service process operation information to handle the corresponding service, so that the load of operation and maintenance personnel can be reduced, the labor cost and the time cost can be reduced, and the service availability and the recovery time can be improved.
The embodiment of the present invention further provides a specific implementation manner of a computer device, which can implement all the steps in the service management method in the foregoing embodiment. Fig. 4 is a block diagram of a computer device in an embodiment of the present invention, and referring to fig. 4, the computer device specifically includes the following:
a processor (processor)401 and a memory (memory) 402.
The processor 401 is configured to call the computer program in the memory 402, and the processor implements all the steps of the service management method in the above embodiments when executing the computer program, for example, the processor implements the following steps when executing the computer program:
receiving service configuration information, compiling and packaging the service configuration information to generate a package service file;
deploying the package service file into a container so that the container runs the package service file;
acquiring service operation data of the container, and acquiring service process operation information according to the service operation data;
and determining an exception handling scheme according to the service process running information so as to handle the corresponding service.
To sum up, the computer device of the embodiment of the present invention compiles and packages the service configuration information to generate the package service file, deploys the package service file into the container, collects the service operation data of the container to obtain the service process operation information, and determines the exception handling scheme according to the service process operation information to handle the corresponding service, so as to reduce the burden of operation and maintenance personnel, reduce the labor cost and time cost, and improve the service availability and recovery time.
An embodiment of the present invention further provides a computer-readable storage medium capable of implementing all the steps in the service management method in the foregoing embodiment, where the computer-readable storage medium stores a computer program, and the computer program, when executed by a processor, implements all the steps in the service management method in the foregoing embodiment, for example, when the processor executes the computer program, the processor implements the following steps:
receiving service configuration information, compiling and packaging the service configuration information to generate a package service file;
deploying the package service file into a container so that the container runs the package service file;
acquiring service operation data of the container, and acquiring service process operation information according to the service operation data;
and determining an exception handling scheme according to the service process running information so as to handle the corresponding service.
To sum up, the computer-readable storage medium of the embodiment of the present invention compiles and packages the service configuration information to generate the package service file, deploys the package service file into the container, collects the service operation data of the container to obtain the service process operation information, and determines the exception handling scheme according to the service process operation information to handle the corresponding service, so that the load of the operation and maintenance personnel can be reduced, the labor cost and the time cost can be reduced, and the service availability and the recovery time can be improved.
The above-mentioned embodiments are intended to illustrate the objects, technical solutions and advantages of the present invention in further detail, and it should be understood that the above-mentioned embodiments are only exemplary embodiments of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.
Those of skill in the art will further appreciate that the various illustrative logical blocks, units, and steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate the interchangeability of hardware and software, various illustrative components, elements, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design requirements of the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present embodiments.
The various illustrative logical blocks, or elements, or devices described in connection with the embodiments disclosed herein may be implemented or performed with a general purpose processor, a digital signal processor, an Application Specific Integrated Circuit (ASIC), a field programmable gate array or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A general-purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a digital signal processor and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a digital signal processor core, or any other similar configuration.
The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may be stored in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. For example, a storage medium may be coupled to the processor such the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC, which may be located in a user terminal. In the alternative, the processor and the storage medium may reside in different components in a user terminal.
In one or more exemplary designs, the functions described above in connection with the embodiments of the invention may be implemented in hardware, software, firmware, or any combination of the three. If implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media that facilitate transfer of a computer program from one place to another. Storage media may be any available media that can be accessed by a general purpose or special purpose computer. For example, such computer-readable media can include, but is not limited to, RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to carry or store program code in the form of instructions or data structures and which can be read by a general-purpose or special-purpose computer, or a general-purpose or special-purpose processor. Additionally, any connection is properly termed a computer-readable medium, and, thus, is included if the software is transmitted from a website, server, or other remote source via a coaxial cable, fiber optic cable, twisted pair, Digital Subscriber Line (DSL), or wirelessly, e.g., infrared, radio, and microwave. Such discs (disk) and disks (disc) include compact disks, laser disks, optical disks, DVDs, floppy disks and blu-ray disks where disks usually reproduce data magnetically, while disks usually reproduce data optically with lasers. Combinations of the above may also be included in the computer-readable medium.

Claims (10)

1. A method for service management, comprising:
receiving service configuration information, and compiling and packaging the service configuration information to generate a package service file;
deploying the package service file into a container so that the container runs the package service file;
acquiring service operation data of the container, and acquiring service process operation information according to the service operation data;
and determining an exception handling scheme according to the service process running information so as to handle the corresponding service.
2. The service management method according to claim 1, wherein obtaining service process operation information according to the service operation data comprises:
determining service availability and response time fluctuation rate according to the service operation data;
and acquiring service process running information according to the service availability and the response time fluctuation rate.
3. The service management method according to claim 2, wherein the service operation data includes service available time, service start time, service success number and service failure number;
determining the service availability according to the service operation data comprises: and determining the service availability according to the service availability time, the service starting time, the service success times and the service failure times.
4. The service management method according to claim 2, wherein the service operation data includes: proportional service time consumption, average service time consumption and median service time consumption;
determining a response time fluctuation rate from the service operation data comprises: and determining the fluctuation rate of the response time according to the proportion service time consumption, the service average time consumption and the service time consumption median.
5. The service management method according to claim 1, wherein determining an exception handling scheme to handle the corresponding service according to the service process running information comprises:
when the service process running information indicates that the service does not survive, restarting the corresponding service;
and when the service process running information is service survival, determining an exception handling scheme corresponding to the returned communication area data in a preset scheme library so as to process the corresponding service.
6. The service management method according to claim 5, further comprising:
and when the service is failed to restart, determining an exception handling scheme corresponding to the system resource data in a preset scheme library so as to process the container where the corresponding service is located.
7. The service management method of claim 5, wherein deploying the package service file into a container comprises:
when a simulated abnormal scene exists, establishing an exclusive container, and deploying the program package service file into the exclusive container;
and when the simulated abnormal scene does not exist, determining the comprehensive load data of each container, and deploying the package service file to the container corresponding to the minimum value of the comprehensive load data.
8. A service management apparatus, comprising:
the compiling module is used for receiving the service configuration information, compiling and packaging the service configuration information to generate a package service file;
the deployment module is used for deploying the package service file into a container so that the container runs the package service file;
the acquisition module is used for acquiring service operation data of the container and acquiring service process operation information according to the service operation data;
and the scheme determining module is used for determining an exception handling scheme according to the service process running information so as to process the corresponding service.
9. A computer device comprising a memory, a processor and a computer program stored on the memory and running on the processor, characterized in that the steps of the service management method according to any of claims 1 to 7 are implemented when the computer program is executed by the processor.
10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the service management method according to any one of claims 1 to 7.
CN202110930034.0A 2021-08-13 2021-08-13 Service management method and device Active CN113626044B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110930034.0A CN113626044B (en) 2021-08-13 2021-08-13 Service management method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110930034.0A CN113626044B (en) 2021-08-13 2021-08-13 Service management method and device

Publications (2)

Publication Number Publication Date
CN113626044A true CN113626044A (en) 2021-11-09
CN113626044B CN113626044B (en) 2024-08-06

Family

ID=78385320

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110930034.0A Active CN113626044B (en) 2021-08-13 2021-08-13 Service management method and device

Country Status (1)

Country Link
CN (1) CN113626044B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115174644A (en) * 2022-06-28 2022-10-11 武汉烽火技术服务有限公司 Container cluster service start-stop control method, device, equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104915285A (en) * 2015-06-30 2015-09-16 北京奇虎科技有限公司 Container process monitoring method, device and system
CN105677356A (en) * 2016-01-11 2016-06-15 上海雷腾软件股份有限公司 Operation and maintenance method and device
CN106610836A (en) * 2016-12-23 2017-05-03 国网信息通信产业集团有限公司 Micro-service operation management tool
CN107248986A (en) * 2017-06-08 2017-10-13 医惠科技有限公司 A kind of service tray method
CN109450691A (en) * 2018-11-20 2019-03-08 深圳前海微众银行股份有限公司 Gateway monitoring method, equipment and computer readable storage medium
CN112559285A (en) * 2020-12-08 2021-03-26 中国联合网络通信集团有限公司 Distributed service architecture-based micro-service monitoring method and related device
EP3819764A2 (en) * 2020-09-29 2021-05-12 Beijing Baidu Netcom Science And Technology Co. Ltd. Service information processing method, apparatus, device and computer storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104915285A (en) * 2015-06-30 2015-09-16 北京奇虎科技有限公司 Container process monitoring method, device and system
CN105677356A (en) * 2016-01-11 2016-06-15 上海雷腾软件股份有限公司 Operation and maintenance method and device
CN106610836A (en) * 2016-12-23 2017-05-03 国网信息通信产业集团有限公司 Micro-service operation management tool
CN107248986A (en) * 2017-06-08 2017-10-13 医惠科技有限公司 A kind of service tray method
CN109450691A (en) * 2018-11-20 2019-03-08 深圳前海微众银行股份有限公司 Gateway monitoring method, equipment and computer readable storage medium
EP3819764A2 (en) * 2020-09-29 2021-05-12 Beijing Baidu Netcom Science And Technology Co. Ltd. Service information processing method, apparatus, device and computer storage medium
CN112559285A (en) * 2020-12-08 2021-03-26 中国联合网络通信集团有限公司 Distributed service architecture-based micro-service monitoring method and related device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115174644A (en) * 2022-06-28 2022-10-11 武汉烽火技术服务有限公司 Container cluster service start-stop control method, device, equipment and storage medium
CN115174644B (en) * 2022-06-28 2023-09-12 武汉烽火技术服务有限公司 Container cluster service start-stop control method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN113626044B (en) 2024-08-06

Similar Documents

Publication Publication Date Title
CN111047190A (en) Diversified business modeling framework system based on interactive learning technology
US7877250B2 (en) Creation of resource models
US10310968B2 (en) Developing software project plans based on developer sensitivity ratings detected from monitoring developer error patterns
Evans et al. Comprehensive resource use monitoring for HPC systems with TACC stats
US7996204B2 (en) Simulation using resource models
US7974827B2 (en) Resource model training
WO2008134143A1 (en) Resource model training
CN113010393A (en) Fault drilling method and device based on chaotic engineering
CN113946499A (en) Micro-service link tracking and performance analysis method, system, equipment and application
CN107506194A (en) Application version, which retracts, determines method and device
CN113626044B (en) Service management method and device
Ciavotta et al. Architectural design of cloud applications: A performance-aware cost minimization approach
CN117971384A (en) Automatic operation and maintenance method based on container and big data
CN117762664A (en) Method, device, storage medium and equipment for managing computing tasks
CN114503132A (en) Debugging and profiling of machine learning model training
Hryszko et al. Cost effectiveness of software defect prediction in an industrial project
CN116467204A (en) Low-code application running stability optimization method
US12061515B2 (en) Methods and systems that automatically predict distributed-computer-system performance degradation using automatically trained machine-learning components
CN115065597A (en) Container resource allocation method, device, medium and equipment
CN110008098B (en) Method and device for evaluating operation condition of nodes in business process
US11855849B1 (en) Artificial intelligence based self-organizing event-action management system for large-scale networks
Singh et al. Survey on Fault Tolerant Techniques in Scientific Applications
CN112667597B (en) Algorithm model full life cycle management tool system and implementation method thereof
CN117851269B (en) Cloud-based automatic test environment management method and system
CN112084100B (en) Server operation and maintenance method, device and equipment and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant