WO2020133609A1 - 一种服务器异常的处理方法及管理设备 - Google Patents

一种服务器异常的处理方法及管理设备 Download PDF

Info

Publication number
WO2020133609A1
WO2020133609A1 PCT/CN2019/072176 CN2019072176W WO2020133609A1 WO 2020133609 A1 WO2020133609 A1 WO 2020133609A1 CN 2019072176 W CN2019072176 W CN 2019072176W WO 2020133609 A1 WO2020133609 A1 WO 2020133609A1
Authority
WO
WIPO (PCT)
Prior art keywords
server
processing
management device
analysis instruction
target
Prior art date
Application number
PCT/CN2019/072176
Other languages
English (en)
French (fr)
Inventor
孙云云
Original Assignee
网宿科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 网宿科技股份有限公司 filed Critical 网宿科技股份有限公司
Priority to EP19906538.4A priority Critical patent/EP3905591A4/en
Priority to US17/293,793 priority patent/US20220006716A1/en
Publication of WO2020133609A1 publication Critical patent/WO2020133609A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0817Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/3668Software testing
    • G06F11/3672Test management
    • G06F11/3688Test management for test execution, e.g. scheduling of test suites
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0659Management of faults, events, alarms or notifications using network fault recovery by isolating or reconfiguring faulty entities
    • H04L41/0661Management of faults, events, alarms or notifications using network fault recovery by isolating or reconfiguring faulty entities by reconfiguring faulty entities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/142Network analysis or design using statistical or mathematical methods
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0695Management of faults, events, alarms or notifications the faulty arrangement being the maintenance, administration or management system

Definitions

  • the present invention relates to the field of server technology, and in particular, to a server abnormality processing method and management equipment.
  • CDN Code Division Multiple Access
  • Embodiments of the present invention provide a server abnormality processing method and a management device, to solve the technical problems of manual processing of server abnormalities that are cumbersome, time-consuming, and wasteful of manpower in the prior art.
  • An embodiment of the present invention provides a method for processing a server exception.
  • the method includes:
  • the management device obtains the operation information of each server
  • the management device determines the target process based on the operation information of the servers and if it is determined that the first server has an abnormality, according to the operation information of the first server, and the pre-configured correspondence between the abnormal operation information and the processing mode the way;
  • the management device generates a processing instruction according to the target processing manner, and sends the processing instruction to the first server, so that the first server executes the processing instruction and adopts the target processing manner deal with.
  • the management device can determine the target processing mode according to the acquired abnormal information of the server and the correspondence between the pre-configured abnormal information and the processing mode, and then can generate a processing instruction according to the target processing mode and send it to the abnormal The server, so that the abnormal server executes the processing instruction and adopts the target processing method for processing.
  • the entire process does not require manual participation.
  • the management device automatically determines whether the server is abnormal based on the operation information of each server, and can automatically send processing instructions to the abnormal server to enable the abnormal server to automatically process.
  • the management device can automatically complete the server abnormality detection operation according to the operation information of each service, and each server can also be based on the received The processing instruction completes the operation of exception handling; further, the technical solution provided by the embodiment of the present invention can be changed from passive waiting for manual processing to active automated analysis processing, thereby reducing the server exception processing time and improving the stability of service quality .
  • the operation information includes a type of software exception
  • the management device determines the target processing mode according to the operating information of the first server, and the pre-configured correspondence between the abnormal operating information and the processing mode, including:
  • the management device determines at least one target analysis instruction according to the type of software abnormality corresponding to the first server, and the pre-configured correspondence between the type of software abnormality and the analysis instruction; the analysis instruction is used to Analyze the reasons;
  • the management device sends the at least one target analysis instruction to the first server, so that the first server executes the target analysis instruction, and generates receipt information according to the execution result;
  • the management device After receiving at least one return receipt information sent by the first server, the management device determines the cause of the software abnormality according to the return receipt information, and then determines the target processing mode according to the pre-configured correspondence between the abnormal cause and the processing mode .
  • the management device sends the at least one target analysis instruction to the first server, so that the first server executes the target analysis instruction, and generates receipt information according to the execution result ,include:
  • the management device sends the first target analysis instruction to the first server, so that the first server executes the first target analysis instruction, and generates a first according to the execution result Receipt information;
  • the first target analysis instruction is any one of the at least one target analysis instruction, and the first target analysis instruction is used to analyze the cause of the first aspect of the software exception;
  • the target processing mode is determined according to the correspondence between the abnormal cause and the processing mode, including:
  • the target processing method is determined according to the cause of the first aspect and the correspondence between the abnormal reason and the processing method .
  • it also includes:
  • the management device Based on the first receipt information, if the management device determines that the software abnormality is not caused by the cause of the first aspect, it sends a second target analysis instruction to the first server until the second receipt information determines that the cause is The cause of the second aspect causes a software exception; the second target analysis instruction is any one of the at least one target analysis instruction except the first target analysis instruction, and the second target analysis instruction is used to The cause of the second aspect of the software exception is analyzed, and the second receipt information is generated according to the execution result after the first server executes the second target analysis instruction.
  • the method further includes:
  • the management device receives updated operation information sent by the first server
  • the management device records historical processing records corresponding to the first server according to the updated operation information, and if it is determined that the first server has no abnormality, the historical processing records are used to update the pre-configured Correspondence between abnormal operation information and processing methods.
  • An embodiment of the present invention provides a method for processing a server exception.
  • the method includes:
  • the server receives a processing instruction sent by the management device, the processing instruction is generated according to a target processing method, the target processing method is that the management device determines that the server is abnormal, according to the server's operating information, and in advance The corresponding relationship between the configured abnormal operation information and the processing method is determined;
  • the server executes the processing instruction and uses the target processing mode for processing.
  • the operation information includes a type of software exception
  • the server Before the server receives the processing instruction sent by the management device, it also includes:
  • the server receives at least one target analysis instruction sent by the management device;
  • the target analysis instruction is the correspondence between the type of software exception corresponding to the server by the management device and the type of pre-configured software exception and the analysis instruction If the relationship is determined, the analysis instruction is used to analyze the cause of the software exception;
  • the server executes the target analysis instruction and generates receipt information according to the execution result
  • the server sends at least one return receipt information to the management device, so that the management device determines the cause of the software abnormality according to the return receipt information, and then determines the relationship according to the pre-configured correspondence between the abnormal cause and the processing method Target processing.
  • the server executes the target analysis instruction and generates receipt information according to the execution result, including:
  • the server executes the first target analysis instruction and generates first receipt information according to the execution result;
  • the first target analysis instruction is any one of the at least one target analysis instruction ,
  • the first target analysis instruction is used to analyze the cause of the first aspect of the software exception;
  • the server sends at least one return receipt information to the management device, so that the management device determines the cause of the software abnormality according to the return receipt information, and then determines the target processing mode according to the correspondence between the abnormal cause and the processing mode ,include:
  • the server sends the first receipt information to the management device, so that the management device based on the first receipt information, if it is determined that the software is abnormal due to the cause of the first aspect, then according to the first The cause of the problem, and the correspondence between the cause of the abnormality and the processing method, determine the target processing method.
  • it also includes:
  • the server receives a second target analysis instruction sent by the management device;
  • the second target analysis instruction is any one of the at least one target analysis instruction except the first target analysis instruction, the The second target analysis instruction is used to analyze the cause of the second aspect that causes the software exception;
  • the server executes the second target analysis instruction, and generates second receipt information according to the execution result
  • the server sends the second receipt information to the management device.
  • the server after the server executes the processing instruction, the server further includes:
  • the server sends updated operation information to the management device, so that the management device records the historical processing corresponding to the server after determining that the first server has no abnormality according to the updated operation information Record, the historical processing record is used to update the correspondence between the pre-configured abnormal operation information and the processing mode.
  • An embodiment of the present invention provides a management device.
  • the management device includes:
  • the receiving unit is used to obtain the operation information of each server
  • the processing unit is configured to determine the target according to the operating information of the first server and the correspondence between the pre-configured abnormal operating information and the processing method if it is determined that the first server has an abnormality according to the operating information of the servers Processing method; and generating processing instructions according to the target processing method;
  • the sending unit is configured to send the processing instruction to the first server, so that the first server executes the processing instruction and adopts the target processing mode for processing.
  • the operation information includes a type of software exception
  • the management device processing unit is specifically used for:
  • the analysis instruction is used to analyze the cause of the software exception
  • the sending unit is specifically used to:
  • the receiving unit is specifically used for:
  • the processing unit is also specifically used for:
  • the target processing mode is determined according to the pre-configured correspondence between the abnormal cause and the processing mode.
  • the sending unit is specifically configured to:
  • the first target analysis instruction send the first target analysis instruction to the first server, so that the first server executes the first target analysis instruction, and generates first receipt information according to the execution result;
  • the first target analysis instruction is any one of the at least one target analysis instruction, and the first target analysis instruction is used to analyze the cause of the first aspect of the software exception;
  • the processing unit is specifically used for:
  • the target processing mode is determined according to the cause of the first aspect and the correspondence between the abnormal cause and the processing mode.
  • the processing unit is further used to:
  • a second target analysis instruction is sent to the first server until it is determined that the second aspect is based on the second receipt information
  • the cause causes a software exception;
  • the second target analysis instruction is any one of the at least one target analysis instruction except the first target analysis instruction, and the second target analysis instruction is used to Analysis of the cause of the second aspect of the second aspect, the second receipt information is generated according to the execution result after the first server executes the second target analysis instruction.
  • the receiving unit is further used to:
  • the processing unit is also used to:
  • the updated operation information if it is determined that the first server has no abnormality, a history processing record corresponding to the first server is recorded, and the history processing record is used to update the pre-configured abnormal operation information Correspondence with processing methods.
  • An embodiment of the present application further provides an apparatus, which has a function of implementing the server exception processing method described above.
  • This function can be implemented by hardware executing corresponding software.
  • the device includes: a processor, a transceiver, and a memory; the memory is used to store computer execution instructions, and the transceiver is used to implement the device to communicate with other Entities communicate with each other, the processor and the memory are connected through the bus, and when the device is running, the processor executes the computer execution instructions stored in the memory to cause the device to perform the server exception processing method described above.
  • An embodiment of the present invention also provides a computer storage medium that stores a software program, which when read and executed by one or more processors implements the server described in the various possible implementation manners described above Exception handling method.
  • Embodiments of the present invention also provide a computer program product containing instructions that, when run on a computer, cause the computer to execute the server exception processing method described in the various possible implementation manners described above.
  • FIG. 1 is a schematic diagram of a system architecture applicable to an embodiment of the present invention
  • FIG. 2 is a schematic flowchart of a method for processing a server exception according to an embodiment of the present invention
  • FIG. 3 is a schematic structural diagram of a management device according to an embodiment of the present invention.
  • FIG. 4 is a schematic structural diagram of a server according to an embodiment of the present invention.
  • FIG. 1 exemplarily shows a schematic diagram of a system architecture to which the embodiment of the present invention is applied.
  • the system 100 to which the embodiment of the present invention is applied may include a management device 101 and multiple servers, for example, as shown in FIG. Server 1021, server 1022 and server 1023.
  • the management device 101 can communicate with any server through the network.
  • the management device 101 shown in FIG. 1 can obtain information from the server 1021 and send messages to the server 1021 through the network, and can also obtain information from the server 1022 through the network and By sending a message to the server 1022, you can also obtain information from the server 1023 and send a message to the server 1023 via the network.
  • FIG. 2 exemplarily shows a schematic flowchart of a method for processing a server exception provided by an embodiment of the present invention, and specifically includes the following steps:
  • Step 201 The management device obtains the operation information of each server.
  • the management device determines the target processing mode according to the operating information of the first server and the correspondence between the pre-configured abnormal operating information and the processing mode if it is determined that the first server has an abnormality according to the operating information of each server.
  • Step 203 The management device generates a processing instruction according to the target processing mode.
  • Step 204 The management device sends the processing instruction to the first server.
  • Step 205 The first server receives the processing instruction sent by the management device.
  • Step 206 the server executes the processing instruction and adopts the target processing mode for processing.
  • the management device can determine the target processing mode according to the acquired abnormal information of the server and the correspondence between the pre-configured abnormal information and the processing mode, and then can generate a processing instruction according to the target processing mode and send it to the abnormality The server, so that the abnormal server executes the processing instruction and adopts the target processing method for processing.
  • the entire process does not require manual participation.
  • the management device automatically determines whether the server is abnormal based on the operation information of each server, and can automatically send processing instructions to the abnormal server to enable the abnormal server to automatically process.
  • the management device can automatically complete the server abnormality detection operation according to the operation information of each service, and each server can also be based on the received The processing instruction completes the operation of exception handling; further, the technical solution provided by the embodiment of the present invention can be changed from passive waiting for manual processing to active automated analysis processing, thereby reducing the server exception processing time and improving the stability of service quality .
  • the operating information of the server may include the identification of each server and the operating status of each server.
  • the management device can determine whether the server is abnormal through a variety of methods.
  • the management device can obtain the identity of each server and the operating status of each server; in turn, the management device can determine the identity of each server And the running state of each server to determine whether there is a first server whose running state is abnormal, and if it exists, it can be determined that the first server is abnormal.
  • the management device may also determine the abnormal server from the external monitoring device, and the external monitoring device may obtain the identification of each server and the operating status of each server, and then determine the abnormal server.
  • the operation information may also include abnormal information, server identification, and time stamp information.
  • the exception information may include the type of software exception and the type of hardware exception.
  • the type of software exception may refer to an exception caused by a software service, such as a squid service for cache acceleration and a haproxy service for load balancing scheduling.
  • Abnormal; the type of hardware abnormality can refer to the abnormality caused by hardware services, such as server downtime.
  • the server's logo can be used to identify the server, and the server's logo can be expressed in letters, numbers, or symbols.
  • the time stamp information may refer to the time when the management device obtains the abnormal information.
  • the first abnormal information is: the server S squid service caused an exception, the management device obtained the first abnormal information at 8:00:00; the second abnormal information is: server B's haproxy service caused an exception, The time for the management device to obtain the second piece of abnormal information is 9:00:00; the third piece of abnormal information is: the server C is down, and the time for the management device to obtain the third piece of abnormal information is 10:00:00 .
  • the first exception information and the second exception information belong to the type of software exception
  • the third exception information belongs to the type of hardware exception.
  • Table 1 An example of abnormal information of the server
  • the management device Before performing step 202, the management device may first determine whether the exception type information corresponding to the server is the type of software exception, and if so, perform the subsequent steps; otherwise, the process terminates.
  • the reason for this is that, for the abnormality caused by the server downtime, the way of automatically processing the downtime can be adopted.
  • the embodiment of the present invention mainly solves the processing method of the server abnormality caused by the software service.
  • the management device may be pre-configured with the correspondence between abnormal information and processing methods. Through personalized configuration, it can adapt to the change of the processing scheme during the continuous change of the software version, so as to adapt to the diversity of the causes of server abnormalities. Further, due to the rapid change of the current software version, similar abnormal causes may occur in different software versions, but different software versions actually need to adopt different processing methods. For this case, the embodiments of the present invention may Adapting this situation through a pre-configured approach effectively avoids the need to update the automatic analysis process by modifying the code after the software version is changed.
  • the management device may be pre-configured with the correspondence relationship between the type of software exception and the analysis instruction, and through the correspondence relationship between the type of software exception and the analysis instruction, different analysis instructions corresponding to different types of software exceptions may be determined and then managed
  • the device may send the analysis instruction to the first server, so that the first server executes the analysis instruction to perform analysis.
  • the analysis instruction may include an analysis command, receipt information that needs to be obtained, and according to different receipt information, which processing instruction needs to be specified for processing.
  • the analysis instruction can be used to analyze the cause of the software exception.
  • the management device may first determine the type of software exception. Among them, the type of software exception may be squid service, or may also be haproxy service, or may be other types of services.
  • the first server may determine the receipt information according to the execution result, and may return the receipt information to the management device.
  • the management device may further determine whether the first server is abnormal according to the receipt information. If yes, the cause of the software abnormality is determined according to the receipt information, and then the target processing mode is determined according to the pre-configured correspondence between the abnormality cause and the processing mode; if not, the process returns to step 201 above.
  • the management device may generate a processing instruction according to the target processing mode and send the processing instruction to the first server; after receiving the processing instruction, the first server may execute the processing instruction, and the first server Execution of this processing instruction means that the target processing method is used to handle the exception.
  • the first server may execute the first target analysis instruction and generate first receipt information according to the execution result; further, The first server may send the first receipt information to the management device.
  • the management device determines whether the software abnormality is caused by the cause of the first aspect based on the first receipt information, and if so, according to the cause of the first aspect and the cause of the exception and Correspondence of processing methods to determine the target processing method.
  • the management device can send a second target analysis instruction to the first server, and the first server can execute the second target analysis instruction and generate second receipt information according to the execution result; Furthermore, the first server may send the second receipt information to the management device until the management device determines that the software abnormality is caused by the second aspect according to the second receipt information.
  • the first target analysis instruction is any one of at least one target analysis instruction. The first target analysis instruction is used to analyze the cause of the first aspect of the software exception; the second target analysis instruction is at least one target analysis Any target analysis instruction other than the first target analysis instruction in the instruction, and the second target analysis instruction is used to analyze the cause of the second aspect that causes the software exception.
  • the corresponding analysis instruction may be the analysis instruction A1 and the analysis instruction A2, and the analysis instruction A1 may only analyze the first aspect of the software abnormality caused by the Squid service,
  • the analysis instruction A2 may only analyze the second cause of the software exception caused by the Squid service;
  • the corresponding analysis instruction may be analysis instruction B1, analysis instruction B2, and analysis instruction B3 , While analysis command B1 can only analyze the first cause of software exceptions caused by haproxy services, analysis command B2 can only analyze the second cause of software exceptions caused by haproxy services, and analysis command B3 can only analyze haproxy services.
  • the cause of the third aspect of software anomalies is analyzed. It should be noted that the analysis instructions corresponding to the exceptions caused by the Squid service (that is, A1 and A2) and the analysis instructions corresponding to the exceptions caused by the haproxy service (that is, B1, B2, and B3) are independent of each other, and there is no connection.
  • the management device may determine that the target analysis instruction includes analysis instructions A1 and A2. Further, the management device may first Send the analysis instruction A1 to the first server.
  • the first server After executing the analysis instruction A1, the first server generates first receipt information according to the execution result, and returns the first receipt information to the management device; in this way, the management device can judge based on the first receipt information Whether the software abnormality is caused by the cause of the first aspect; further, if the management device determines that the software abnormality is not caused by the cause of the first aspect according to the first receipt information of the analysis instruction A1, it may send the analysis instruction A2 to the first server, After executing the analysis instruction A2, a server generates second receipt information according to the execution result, and returns the second receipt information to the management device; further, the management device can determine whether the software abnormality is caused by the second aspect according to the second receipt information. Similarly, if the type of software exception is an exception caused by haproxy service, the above process can also be used for analysis.
  • the management device may also be pre-configured with the correspondence between the cause of the abnormality and the processing method.
  • the management device can determine the cause of the software abnormality according to the receipt information, and the correspondence between the abnormal cause and the processing method, and determine the different processing methods corresponding to different abnormal causes, which can be determined according to Generates a processing instruction and sends it to the first server, so that the first server executes the processing instruction, that is, the determined processing method is used for processing.
  • Table 3 it is an example of the correspondence between the cause of abnormality and the processing method.
  • the processing method a1 if it is determined that the software abnormality is caused by the first aspect, you can use the processing method a1; if you determine that the software abnormality is caused by the second aspect, you can use the processing method a2; if it is determined to be caused by If the third aspect causes the software exception, you can use the processing method b1; if it is determined that the fourth aspect causes the software exception, you can use the processing method b2; if it is determined that it is caused by the fifth aspect If the software is abnormal, you can use the processing method b3.
  • Table 3 An example of the correspondence between abnormal causes and handling methods
  • the first reason Treatment method a1 The second reason Treatment method a2
  • the third reason Treatment method b1 The fourth reason Treatment method b2 Reason for the fifth aspect Treatment method b3
  • the management device may first determine at least one target analysis instruction according to the type of software abnormality corresponding to the first server, and the pre-configured correspondence between the type of software exception and the analysis instruction.
  • the management device sends the target analysis instruction to After the first server, you can determine the cause of the software abnormality based on the return information returned by the first server.
  • the management device can determine the target processing method according to the correspondence between the abnormal reason and the processing method, and generate a processing instruction. After the management device sends the processing instruction to the first server, the first server may execute the processing instruction to perform exception handling.
  • processing method 1 is: create cache directory command
  • processing method 2 is: service, restart and restart.
  • the management device determines that the Squid service abnormality occurs in the first server according to the operation information of the first server, firstly, according to the correspondence between the pre-configured software abnormality type and the analysis instruction, the target analysis instruction is determined as the analysis instruction A1 (ie, sysCheckForSquid);
  • the management device may send the analysis instruction A1 to the first server.
  • the first server may execute the analysis instruction A1, obtain corresponding receipt information according to the execution result, and return the receipt information corresponding to the analysis instruction A1 to the management device.
  • the management device can determine the cause of the software abnormality based on the receipt information.
  • the management device can generate a processing instruction according to processing method 1 and processing method 2 and send it to the first server.
  • a server After a server receives the processing instruction, it can create a cache directory and restart the Squid service; if the return receipt information is: xxxxx cache_dir xxxxxxxx deadxxx, it means that the cause of the software exception is "service exception", and then, the cause and processing method of the exception can be pre-configured Corresponding relationship is determined, the target processing method is processing method 2, then, the management device can generate a processing instruction according to processing method 2 and send it to the first server. After receiving the processing instruction, the first server can execute restarting the squid service.
  • step 203 it may be further determined whether the abnormality of the first server is resolved.
  • the first server may send the updated operation information to the management device. Similar to the above step 201, the updated operation information includes the updated operation state corresponding to the first server; the management device operates after receiving the update After the information, it can be determined whether the updated operating state corresponding to the first server is the first server in the abnormal state. If it is, it indicates that the abnormality of the first server has not been resolved; if not, it indicates that the abnormality of the first server is resolved.
  • the management device determines that the abnormality of the server is released, it can record the processing information, which can be used to subsequently analyze the frequency of abnormality, which is convenient for the improvement of related software.
  • the direction of optimization further improves the software itself, and can also find better processing methods, optimize the configuration of processing instructions, so as to continuously improve the software and the above-mentioned server exception handling process.
  • the management device determines that the abnormality of the server is not resolved, it makes a judgment on the abnormal time (ie, the time difference between the time when the management device obtains the abnormal information and the current time), and if the abnormal time is greater than the preset threshold, then Directly register the failure of the server and send an email to notify the relevant software administrator. If the abnormal time is less than the preset threshold, you can wait for the preset time and then check the server again.
  • the preset threshold may be determined by those skilled in the art based on experience and actual conditions, for example, it may be determined according to the current manual analysis processing time, which is not specifically limited.
  • FIG. 3 exemplarily shows a schematic structural diagram of a management device provided by an embodiment of the present invention.
  • the management device 300 includes a receiving unit 301, a processing unit 302, and a sending unit 303; among them,
  • the receiving unit 301 is used to obtain the operating information of each server
  • the processing unit 302 is configured to determine, according to the operating information of each server, if the first server is abnormal, according to the operating information of the first server, and the pre-configured correspondence between the abnormal operating information and the processing method, Target processing method; and generating processing instructions according to the target processing method;
  • the sending unit 303 is configured to send the processing instruction to the first server, so that the first server executes the processing instruction and adopts the target processing mode for processing.
  • the operation information includes exception information, and the exception information includes a type of software exception;
  • the management device processing unit 302 is specifically used to:
  • the analysis instruction is used to analyze the cause of the software exception
  • the sending unit 303 is specifically used to:
  • the receiving unit 301 is specifically used to:
  • the processing unit 302 is also specifically used to:
  • the target processing mode is determined according to the pre-configured correspondence between the abnormal cause and the processing mode.
  • the sending unit 303 is specifically configured to:
  • the first target analysis instruction send the first target analysis instruction to the first server, so that the first server executes the first target analysis instruction, and generates first receipt information according to the execution result;
  • the first target analysis instruction is any one of the at least one target analysis instruction, and the first target analysis instruction is used to analyze the cause of the first aspect of the software exception;
  • the processing unit 302 is specifically used to:
  • the target processing mode is determined according to the cause of the first aspect and the correspondence between the abnormal cause and the processing mode.
  • processing unit 302 is further used to:
  • a second target analysis instruction is sent to the first server until it is determined that the second aspect is based on the second receipt information
  • the cause causes a software exception;
  • the second target analysis instruction is any one of the at least one target analysis instruction except the first target analysis instruction, and the second target analysis instruction is used to Analysis of the cause of the second aspect of the second aspect, the second receipt information is generated according to the execution result after the first server executes the second target analysis instruction.
  • the receiving unit 301 is further configured to:
  • the processing unit 302 is also used to:
  • the updated operation information if it is determined that the first server has no abnormality, a history processing record corresponding to the first server is recorded, and the history processing record is used to update the pre-configured abnormal operation information Correspondence with processing methods.
  • FIG. 4 exemplarily shows a schematic structural diagram of a server provided by an embodiment of the present invention.
  • the server 400 includes a receiving unit 401, a processing unit 402, and a sending unit 403; wherein,
  • the receiving unit 401 is configured to receive a processing instruction sent by the management device, and the processing instruction is generated according to a target processing method.
  • the target processing method is that the management device determines that the server is abnormal, and then The operation information and the corresponding relationship between the pre-configured abnormal operation information and the processing method are determined;
  • the processing unit 402 is configured to execute the processing instruction and perform processing using the target processing mode.
  • the operation information includes exception information, and the exception information includes a type of software exception;
  • the receiving unit 401 Before receiving the processing instruction sent by the management device, the receiving unit 401 is also used to:
  • the target analysis instruction is determined by the management device according to the type of software exception corresponding to the server, and the pre-configured correspondence between the type of software exception and the analysis instruction ,
  • the analysis instruction is used to analyze the cause of the software exception;
  • the processing unit 402 is also used to:
  • the sending unit 403 is also used to:
  • processing unit 402 is specifically configured to:
  • the first target analysis instruction execute the first target analysis instruction and generate first receipt information according to the execution result;
  • the first target analysis instruction is any one of the at least one target analysis instruction, the The first target analysis instruction is used to analyze the cause of the first aspect of the software exception;
  • the sending unit 403 is specifically used to:
  • the receiving unit 401 is further used to:
  • the second target analysis instruction is any one of the at least one target analysis instruction except the first target analysis instruction, the second target The analysis instruction is used to analyze the cause of the second aspect that causes the software exception;
  • the processing unit 402 is also used to:
  • the sending unit 403 is also used to:
  • the sending unit 403 is further configured to:
  • the management device sending new operation information to the management device, so that the management device records the historical processing record corresponding to the server after determining that the first server has no abnormality according to the updated operation information.
  • the historical processing record is used to update the correspondence between the pre-configured abnormal operation information and the processing mode.
  • An embodiment of the present application further provides an apparatus, which has a function of implementing the server exception processing method described above.
  • This function can be implemented by hardware executing corresponding software.
  • the device includes: a processor, a transceiver, and a memory; the memory is used to store computer execution instructions, and the transceiver is used to implement the device to communicate with other Entities communicate with each other, the processor and the memory are connected through the bus, and when the device is running, the processor executes the computer execution instructions stored in the memory to cause the device to perform the server exception processing method described above.
  • An embodiment of the present invention also provides a computer storage medium that stores a software program, which when read and executed by one or more processors implements the server described in the various possible implementation manners described above Exception handling method.
  • Embodiments of the present invention also provide a computer program product containing instructions that, when run on a computer, cause the computer to execute the server exception processing method described in the various possible implementation manners described above.
  • the embodiments of the present invention may be provided as methods, systems, or computer program products. Therefore, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware. Moreover, the present invention may take the form of a computer program product implemented on one or more computer usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer usable program code.
  • computer usable storage media including but not limited to disk storage, CD-ROM, optical storage, etc.
  • each flow and/or block in the flowchart and/or block diagram and a combination of the flow and/or block in the flowchart and/or block diagram may be implemented by computer program instructions.
  • These computer program instructions can be provided to the processor of a general-purpose computer, special-purpose computer, embedded processing machine, or other programmable data processing device to produce a machine that enables the generation of instructions executed by the processor of the computer or other programmable data processing device
  • These computer program instructions may also be stored in a computer readable memory that can guide a computer or other programmable data processing device to work in a specific manner, so that the instructions stored in the computer readable memory produce an article of manufacture including an instruction device, the instructions
  • the device implements the functions specified in one block or multiple blocks of the flowchart one flow or multiple flows and/or block diagrams.
  • These computer program instructions can also be loaded onto a computer or other programmable data processing device, so that a series of operating steps are performed on the computer or other programmable device to produce computer-implemented processing, which is executed on the computer or other programmable device
  • the instructions provide steps for implementing the functions specified in one block or multiple blocks of the flowchart one flow or multiple flows and/or block diagrams.

Abstract

本发明公开了一种服务器异常的处理方法及管理设备。所述方法包括:管理设备获取到各服务器的运行信息后,若确定第一服务器出现异常,则根据获取到第一服务器的运行信息以及预先配置的异常信息和处理方式的对应关系,确定目标处理方式并生成处理指令,进而将处理指令发送给第一服务器,以使第一服务器采用目标处理方式对异常进行处理。本发明实施例中,管理设备可以自动地根据各服务的运行信息完成服务器异常检测的操作,各服务器也可以根据接收到的处理指令完成异常处理的操作;进一步地,可以由被动的等待人工处理变成主动的自动化分析处理,从而减少服务器异常的处理时间,提高服务质量的稳定性。

Description

一种服务器异常的处理方法及管理设备 技术领域
本发明涉及服务器技术领域,尤其涉及一种服务器异常的处理方法及管理设备。
背景技术
随着互联网技术的快速发展,用户对于访问互联网的网速要求也越来越高,越来越多的网络供应商采用CDN进行加速服务。对于CDN系统而言,对网络质量的要求较高,需要及时发现服务器异常并及时进行处理。
现有技术中,一旦服务器出现异常,就需要触发人工操作的一个工作流程。然而,服务器异常有各种各样的原因,对于不同的原因,都需要人工处理,过程比较繁琐,并且容易出现人工处理不够及时,可能造成服务中断时间延长的情况。
基于此,目前亟需一种服务器异常的处理方法,用于解决现有技术中人工处理服务器异常过程繁琐、耗时较长且浪费人力的问题。
发明内容
本发明实施例提供一种服务器异常的处理方法及管理设备,以解决现有技术中人工处理服务器异常过程繁琐、耗时较长且浪费人力的技术问题。
本发明实施例提供一种服务器异常的处理方法,所述方法包括:
管理设备获取各服务器的运行信息;
所述管理设备根据所述各服务器的运行信息,若确定第一服务器出现异常,则根据所述第一服务器的运行信息,以及预先配置的异常的运行信息和处理方式的对应关系,确定目标处理方式;
所述管理设备根据所述目标处理方式,生成处理指令,并将所述处理指令发送给所述第一服务器,以使所述第一服务器执行所述处理指令,并采用所述目标处理方式进行处理。
如此,管理设备在确定服务器出现异常后,可以根据获取到服务器的异常信息以及预先配置的异常信息和处理方式的对应关系,确定目标处理方式,进而可以根据目标处理方式生成处理指令并发送给异常服务器,以使异常服务器执行处理指令,并采用目标处理方式进行处理。整个过程无需人工参与,管理设备自动地根据各服务器的运行信息,判断服务器是否出现异常,并且可以自动向异常服务器发送处理指令,以使异常服务器自动进行处理。可见,相比于现有技术中人工处理服务器异常的方式而言,本发明实施例中,管理设备可以自动地根据各服务的运行信息完成服务器异常检测的操作,各服务器也可以根据接收到的处理指令完成异常处理的操作;进一步地,采用本发明实施例提供的技术方案,可以由被动的等待人工处理变成主动的自动化分析处理,从而减少服务器异常的处理时间,提高服务质量的稳定性。
在一种可能的实现方式中,所述运行信息包括软件异常的类型;
所述管理设备根据所述第一服务器的运行信息,以及预先配置的异常的运行信息和处理方式的对应关系,确定目标处理方式,包括:
所述管理设备根据所述第一服务器对应的软件异常的类型,以及预先配置的软件异常的类型和分析指令的对应关系,确定至少一个目标分析指令;所述分析指令用于对引起软件异常的原因进行分析;
所述管理设备将所述至少一个目标分析指令发送给所述第一服务器,以使所述第一服务器执行所述目标分析指令,并根据执行结果生成回执信息;
所述管理设备接收到所述第一服务器发送的至少一个回执信息后,根据所述回执信息确定出引起软件异常的原因后,根据预先配置的异常原因和处理方式的对应关系,确定目标处理方式。
在一种可能的实现方式中,所述管理设备将所述至少一个目标分析指令发 送给所述第一服务器,以使所述第一服务器执行所述目标分析指令,并根据执行结果生成回执信息,包括:
针对第一目标分析指令,所述管理设备将所述第一目标分析指令发送给所述第一服务器,以使所述第一服务器执行所述第一目标分析指令,并根据执行结果生成第一回执信息;所述第一目标分析指令为所述至少一个目标分析指令中任一目标分析指令,所述第一目标分析指令用于对引起软件异常的第一方面的原因进行分析;
所述管理设备根据所述回执信息确定出引起软件异常的原因后,根据异常原因和处理方式的对应关系,确定目标处理方式,包括:
所述管理设备根据所述第一回执信息,若确定由所述第一方面的原因引起软件异常,则根据所述第一方面的原因,以及异常原因和处理方式的对应关系,确定目标处理方式。
在一种可能的实现方式中,还包括:
所述管理设备根据所述第一回执信息,若确定不由所述第一方面的原因引起软件异常,则向所述第一服务器发送第二目标分析指令,直至根据第二回执信息,确定是由第二方面的原因引起软件异常;所述第二目标分析指令为所述至少一个目标分析指令中除所述第一目标分析指令以外的任一目标分析指令,所述第二目标分析指令用于对引起软件异常的所述第二方面的原因进行分析,所述第二回执信息是所述第一服务器执行所述第二目标分析指令后根据执行结果生成的。
在一种可能的实现方式中,所述管理设备根将所述处理指令发送给所述第一服务器之后,还包括:
所述管理设备接收所述第一服务器发送更新后的运行信息;
所述管理设备根据所述更新后的运行信息,若确定所述第一服务器未出现异常,则记录所述第一服务器对应的历史处理记录,所述历史处理记录用于更新所述预先配置的异常的运行信息和处理方式的对应关系。
本发明实施例提供一种服务器异常的处理方法,所述方法包括:
服务器接收管理设备发送的处理指令,所述处理指令是根据目标处理方式生成的,所述目标处理方式是所述管理设备在确定所述服务器出现异常后,根据所述服务器的运行信息,以及预先配置的异常的运行信息和处理方式的对应关系确定的;
所述服务器执行所述处理指令,并采用所述目标处理方式进行处理。
在一种可能的实现方式中,所述运行信息包括软件异常的类型;
所述服务器接收管理设备发送的处理指令之前,还包括:
所述服务器接收所述管理设备发送的至少一个目标分析指令;所述目标分析指令是所述管理设备根据所述服务器对应的软件异常的类型,以及预先配置的软件异常的类型和分析指令的对应关系确定的,所述分析指令用于对引起软件异常的原因进行分析;
所述服务器执行所述目标分析指令,并根据执行结果生成回执信息;
所述服务器向所述管理设备发送至少一个回执信息,以使所述管理设备根据所述回执信息确定出引起软件异常的原因后,根据预先配置的异常原因和处理方式的对应关系,确定所述目标处理方式。
在一种可能的实现方式中,所述服务器执行所述目标分析指令,并根据执行结果生成回执信息,包括:
针对第一目标分析指令,所述服务器执行所述第一目标分析指令,并根据执行结果生成第一回执信息;所述第一目标分析指令为所述至少一个目标分析指令中任一目标分析指令,所述第一目标分析指令用于对引起软件异常的第一方面的原因进行分析;
所述服务器向所述管理设备发送至少一个回执信息,以使所述管理设备根据所述回执信息确定出引起软件异常的原因后,根据异常原因和处理方式的对应关系,确定所述目标处理方式,包括:
所述服务器向所述管理设备发送所述第一回执信息,以使所述管理设备根 据所述第一回执信息,若确定由所述第一方面的原因引起软件异常,则根据所述第一方面的原因,以及异常原因和处理方式的对应关系,确定目标处理方式。
在一种可能的实现方式中,还包括:
所述服务器接收所述管理设备发送的第二目标分析指令;所述第二目标分析指令为所述至少一个目标分析指令中除所述第一目标分析指令以外的任一目标分析指令,所述第二目标分析指令用于对引起软件异常的所述第二方面的原因进行分析;
所述服务器执行所述第二目标分析指令,并根据执行结果生成第二回执信息;
所述服务器向所述管理设备发送所述第二回执信息。
在一种可能的实现方式中,所述服务器执行所述处理指令之后,还包括:
所述服务器向所述管理设备发送更新后的运行信息,以使所述管理设备根据所述更新后的运行信息,在确定所述第一服务器未出现异常后,记录所述服务器对应的历史处理记录,所述历史处理记录用于更新所述预先配置的异常的运行信息和处理方式的对应关系。
本发明实施例提供一种管理设备,所述管理设备包括:
接收单元,用于获取各服务器的运行信息;
处理单元,用于根据所述各服务器的运行信息,若确定第一服务器出现异常,则根据所述第一服务器的运行信息,以及预先配置的异常的运行信息和处理方式的对应关系,确定目标处理方式;以及根据所述目标处理方式,生成处理指令;
发送单元,用于将所述处理指令发送给所述第一服务器,以使所述第一服务器执行所述处理指令,并采用所述目标处理方式进行处理。
在一种可能的实现方式中,所述运行信息包括软件异常的类型;
所述管理设备处理单元具体用于:
根据所述第一服务器对应的软件异常的类型,以及预先配置的软件异常的 类型和分析指令的对应关系,确定至少一个目标分析指令;所述分析指令用于对引起软件异常的原因进行分析;
所述发送单元具体用于:
将所述至少一个目标分析指令发送给所述第一服务器,以使所述第一服务器执行所述目标分析指令,并根据执行结果生成回执信息;
所述接收单元具体用于:
接收所述第一服务器发送的至少一个回执信息;
所述处理单元还具体用于:
根据所述回执信息确定出引起软件异常的原因后,根据预先配置的异常原因和处理方式的对应关系,确定目标处理方式。
在一种可能的实现方式中,所述发送单元具体用于:
针对第一目标分析指令,将所述第一目标分析指令发送给所述第一服务器,以使所述第一服务器执行所述第一目标分析指令,并根据执行结果生成第一回执信息;所述第一目标分析指令为所述至少一个目标分析指令中任一目标分析指令,所述第一目标分析指令用于对引起软件异常的第一方面的原因进行分析;
所述处理单元具体用于:
根据所述第一回执信息,若确定由所述第一方面的原因引起软件异常,则根据所述第一方面的原因,以及异常原因和处理方式的对应关系,确定目标处理方式。
在一种可能的实现方式中,所述处理单元还用于:
根据所述第一回执信息,若确定不由所述第一方面的原因引起软件异常,则向所述第一服务器发送第二目标分析指令,直至根据第二回执信息,确定是由第二方面的原因引起软件异常;所述第二目标分析指令为所述至少一个目标分析指令中除所述第一目标分析指令以外的任一目标分析指令,所述第二目标分析指令用于对引起软件异常的所述第二方面的原因进行分析,所述第二回执信息是所述第一服务器执行所述第二目标分析指令后根据执行结果生成的。
在一种可能的实现方式中,所述发送单元将所述处理指令发送给所述第一服务器之后,所述接收单元还用于:
接收所述第一服务器发送更新后的运行信息;
所述处理单元还用于:
根据所述更新后的运行信息,若确定所述第一服务器未出现异常,则记录所述第一服务器对应的历史处理记录,所述历史处理记录用于更新所述预先配置的异常的运行信息和处理方式的对应关系。
本申请实施例的还提供一种装置,该装置具有实现上文所描述的服务器异常的处理方法的功能。该功能可以通过硬件执行相应的软件实现,在一种可能的设计中,该装置包括:处理器、收发器、存储器;该存储器用于存储计算机执行指令,该收发器用于实现该装置与其他通信实体进行通信,该处理器与该存储器通过该总线连接,当该装置运行时,该处理器执行该存储器存储的该计算机执行指令,以使该装置执行上文所描述的服务器异常的处理方法。
本发明实施例还提供一种计算机存储介质,所述存储介质中存储软件程序,该软件程序在被一个或多个处理器读取并执行时实现上述各种可能的实现方式中所描述的服务器异常的处理方法。
本发明实施例还提供一种包含指令的计算机程序产品,当其在计算机上运行时,使得计算机执行上述各种可能的实现方式中所描述的服务器异常的处理方法。
附图说明
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简要介绍。
图1为本发明实施例适用的一种系统架构示意图;
图2为本发明实施例提供的一种服务器异常的处理方法所对应的流程示意图;
图3为本发明实施例提供一种管理设备的结构示意图;
图4为本发明实施例提供的一种服务器的结构示意图。
具体实施方式
下面结合说明书附图对本申请进行具体说明,方法实施例中的具体操作方法也可以应用于装置实施例中。
图1示例性示出了本发明实施例适用的一种系统架构示意图,如图1所示,本发明实施例适用的系统100可以包括管理设备101和多个服务器,例如图1中示出的服务器1021、服务器1022和服务器1023。管理设备101可以通过网络与任一服务器通信,例如,图1中示出的管理设备101可以通过网络从服务器1021中获取信息以及向服务器1021发送消息,也可以通过网络从服务器1022中获取信息以及向服务器1022发送消息,还可以通过网络从服务器1023中获取信息以及向服务器1023发送消息。
基于图1所示的系统架构,图2示例性示出了本发明实施例提供的一种服务器异常的处理方法所对应的流程示意图,具体包括以下步骤:
步骤201,管理设备获取各服务器的运行信息。
步骤202,管理设备根据各服务器的运行信息,若确定第一服务器出现异常,则根据第一服务器的运行信息,以及预先配置的异常的运行信息和处理方式的对应关系,确定目标处理方式。
步骤203,管理设备根据目标处理方式,生成处理指令。
步骤204,管理设备将处理指令发送给第一服务器。
步骤205,第一服务器接收管理设备发送的处理指令。
步骤206,服务器执行处理指令,并采用目标处理方式进行处理。
如此,管理设备在确定服务器出现异常后,可以根据获取到服务器的异常信息以及预先配置的异常信息和处理方式的对应关系,确定目标处理方式,进而可以根据目标处理方式生成处理指令并发送给异常服务器,以使异常服务器 执行处理指令,并采用目标处理方式进行处理。整个过程无需人工参与,管理设备自动地根据各服务器的运行信息,判断服务器是否出现异常,并且可以自动向异常服务器发送处理指令,以使异常服务器自动进行处理。可见,相比于现有技术中人工处理服务器异常的方式而言,本发明实施例中,管理设备可以自动地根据各服务的运行信息完成服务器异常检测的操作,各服务器也可以根据接收到的处理指令完成异常处理的操作;进一步地,采用本发明实施例提供的技术方案,可以由被动的等待人工处理变成主动的自动化分析处理,从而减少服务器异常的处理时间,提高服务质量的稳定性。
具体来说,步骤201和步骤202中,服务器的运行信息可以包括每个服务器的标识和每个服务器的运行状态。
管理设备可以通过多种方式来确定服务器是否出现异常,一种可能的实现方式为,管理设备可以获取每个服务器的标识和每个服务器的运行状态;进而,管理设备可以根据每个服务器的标识和每个服务器的运行状态,判断是否存在运行状态为异常状态的第一服务器,若存在,则可以确定第一服务器出现异常。
在其它可能的实现方式中,管理设备也可以从外部监控设备中确定出现异常的服务器,而外部监控设备可以获取每个服务器的标识和每个服务器的运行状态,进而确定出现异常的服务器。
进一步地,运行信息还可以包括异常信息、服务器的标识以及时间戳信息等内容。异常信息又可以包括软件异常的类型和硬件异常的类型,软件异常的类型可以是指由软件服务引起的异常,比如用于缓存加速的squid服务、进行负载均衡调度的haproxy服务等软件服务器引起的异常;硬件异常的类型可以是指由硬件服务引起的异常,比如服务器宕机。服务器的标识可以用于识别服务器,服务器的标识可以以字母、数字或符号等形式表示。时间戳信息可以是指管理设备获取到异常信息的时间。
如表1所示,为服务器的异常信息的一种示例。其中,第1条异常信息为:服务器A的squid服务引起异常,管理设备获取到第1条异常信息的时间为 8:00:00;第2条异常信息为:服务器B的haproxy服务引起异常,管理设备获取到第2条异常信息的时间为9:00:00;第3条异常信息为:服务器C的出现服务器宕机,管理设备获取到第3条异常信息的时间为10:00:00。从表1中可以看出,第1条异常信息和第2条异常信息都属于软件异常的类型,第3条异常信息属于硬件异常的类型。
表1:服务器的异常信息的一种示例
异常信息的编号 服务器的标识 异常类型信息 时间戳信息
1 服务器A squid服务引起异常 8:00:00
2 服务器B haproxy服务引起异常 9:00:00
3 服务器C 服务器宕机 10:00:00
在执行步骤202之前,管理设备可以先判断服务器对应的异常类型信息是否为软件异常的类型,若是,则执行后续步骤;否则,流程终止。这样做的原因是,对于服务器宕机引起的异常,可以采用宕机自动处理的方式,本发明实施例主要解决由软件服务引起的服务器异常的处理方式。
步骤203至步骤206中,管理设备可以预先配置有异常信息和处理方式的对应关系。通过个性化配置,能够适应软件版本不断变更过程中处理方案的改变,从而适应服务器异常的原因的多样性。进一步地,由于当前软件版本更迭速度较快,在不同的软件版本中可能出现类似的异常原因,但对于不同的软件版本实际上需要采用不同的处理方式,针对这种情况,本发明实施例可以通过预先配置的方式对这种情况进行适配,有效避免当软件版本更迭后,需要通过修改代码的方式来更新自动分析处理的情况。
具体地,管理设备可以预先配置有软件异常的类型和分析指令的对应关系,通过软件异常的类型和分析指令的对应关系,可以确定不同的软件异常的类型所对应的不同的分析指令,进而管理设备可以将分析指令发送给第一服务器,以使第一服务器执行分析指令来进行分析。其中,分析指令可以包括分析的命令、需要得到的回执信息、以及根据不同的回执信息,需要指定哪一个处理指令进行处理。分析指令可以用于对引起软件异常的原因进行分析。管理设备可 以先确定软件异常的类型,其中,软件异常的类型可以为squid服务,或者也可以为haproxy服务,或者还可以为其它类型的服务。一般情况下,需要预先配置的软件异常的类型通常是针对会影响用户使用的重要服务配置,从而减少对于我们加速服务的影响,对于一些重要度较低的其他服务,也可以针对性地进行配置,但是重要度较低的其他服务对应的优先级小于重要服务的优先级。
进一步地,第一服务器执行分析指令后,可以根据执行结果确定回执信息,并且可以将回执信息返回给管理设备。管理设备在接收到回执信息后,可以根据回执信息进一步确定第一服务器是否真的出现异常。如果是,则根据回执信息确定软件异常的原因,进而根据预先配置的异常原因和处理方式的对应关系,确定出目标处理方式;如果不是,则返回上述步骤201。
更进一步地,管理设备确定出目标处理方式后,可以根据目标处理方式生成处理指令,并将处理指令发送给第一服务器;第一服务器接收到处理指令后,可以执行该处理指令,第一服务器执行该处理指令即表示采用目标处理方式对异常进行处理。
具体实施过程中,针对第一目标分析指令,管理设备将第一目标分析指令发送给第一服务器后,第一服务器可以执行第一目标分析指令,并根据执行结果生成第一回执信息;进而,第一服务器可以向管理设备发送第一回执信息,如此,管理设备根据第一回执信息,判断是否由第一方面的原因引起软件异常,如果是,则根据第一方面的原因,以及异常原因和处理方式的对应关系,确定目标处理方式,如果不是,则管理设备可以向第一服务器发送第二目标分析指令,第一服务器可以执行第二目标分析指令,并根据执行结果生成第二回执信息;进而,第一服务器可以向管理设备发送第二回执信息,直至管理设备根据第二回执信息,确定是由第二方面的原因引起软件异常。其中,第一目标分析指令为至少一个目标分析指令中任一目标分析指令,第一目标分析指令用于对引起软件异常的第一方面的原因进行分析;第二目标分析指令为至少一个目标分析指令中除第一目标分析指令以外的任一目标分析指令,第二目标分析指令 用于对引起软件异常的所述第二方面的原因进行分析。
如表2所示,为软件异常的类型和分析指令的对应关系的一种示例。其中,若软件异常的类型为squid服务引起的异常,则对应的分析指令可以为分析指令A1和分析指令A2,而分析指令A1可以只对squid服务引起软件异常的第一方面的原因进行分析,分析指令A2可以只对squid服务引起软件异常的第二方面的原因进行分析;若软件异常的类型为haproxy服务引起的异常,则对应的分析指令可以为分析指令B1、分析指令B2和分析指令B3,而分析指令B1可以只对haproxy服务引起软件异常的第一方面的原因进行分析,分析指令B2可以只对haproxy服务引起软件异常的第二方面的原因进行分析,分析指令B3可以只对haproxy服务引起软件异常的第三方面的原因进行分析。需要说明的是,squid服务引起的异常对应的分析指令(即A1和A2)与haproxy服务引起的异常对应的分析指令(即B1、B2和B3)之间是相互独立的,没有联系。
表2:软件异常的类型和分析指令的对应关系的一种示例
Figure PCTCN2019072176-appb-000001
进一步地,具体的分析过程中,若软件异常的类型为squid服务引起的异常,根据表2示出的内容,管理设备可以确定目标分析指令包括分析指令A1和A2,进一步地,管理设备可以先将分析指令A1发送给第一服务器,第一服务器执行分析指令A1后,根据执行结果生成第一回执信息,并将第一回执信息返回给管理设备;如此,管理设备可以根据第一回执信息判断是否由第一方面的原因引起软件异常;进而,管理设备若根据分析指令A1的第一回执信息确定不是由第一方面的原因引起软件异常,则可以将分析指令A2发送给第一服务器,第一服务器执行分析指令A2后,根据执行结果生成第二回执信息,并将第二回执信息返回给管理设备;进而,管理设备可以根据第二回执信息判断是否由第二方面 的原因引起软件异常。类似地,若软件异常的类型为haproxy服务引起的异常,也可以采用上述流程进行分析。
管理设备中还可以预先配置有异常原因和处理方式的对应关系。如此,管理设备接收到回执信息后,可以根据回执信息确定出引起软件异常的原因,以及异常原因和处理方式的对应关系,确定不同的异常原因所对应的不同的处理方式,进而可以根据确定出的处理方式,生成处理指令并发送给第一服务器,以使第一服务器执行处理指令,即采用确定出的处理方式来进行处理。如表3所示,为异常原因和处理方式的对应关系的一种示例。其中,若确定是由第一方面的原因引起软件异常,则可以采用处理方式a1进行处理;若确定是由第二方面的原因引起软件异常,则可以采用处理方式a2进行处理;若确定是由第三方面的原因引起软件异常,则可以采用处理方式b1进行处理;若确定是由第四方面的原因引起软件异常,则可以采用处理方式b2进行处理;若确定是由第五方面的原因引起软件异常,则可以采用处理方式b3进行处理。
表3:异常原因和处理方式的对应关系的一种示例
异常原因 处理方式
第一方面的原因 处理方式a1
第二方面的原因 处理方式a2
第三方面的原因 处理方式b1
第四方面的原因 处理方式b2
第五方面的原因 处理方式b3
也就是说,管理设备可以先根据第一服务器对应的软件异常的类型,以及预先配置的软件异常的类型和分析指令的对应关系,确定至少一个目标分析指令,管理设备在将目标分析指令发送给第一服务器后,可以根据第一服务器返回的回执信息,确定出引起软件异常的原因后,进而,管理设备可以根据异常原因和处理方式的对应关系,确定目标处理方式,并生成处理指令,在管理设备将处理指令发送给第一服务器后,第一服务器可以执行处理指令来进行异常处理。举个例子,假设squid服务异常对应的分析指令A1为:sysCheckForSquid,执行分析指令A1对应两个回执信息,回执信息1为:cache_dir,回执信息2为: dead;并且,回执信息1对应的处理方式1和处理方式2,回执信息2对应处理方式2,其中,处理方式1为:创建缓存目录命令,处理方式2为:service squid restart。当管理设备根据第一服务器的运行信息确定第一服务器出现squid服务异常的时,首先根据预先配置的软件异常的类型和分析指令的对应关系,确定目标分析指令为分析指令A1(即sysCheckForSquid);管理设备可以将分析指令A1发送给第一服务器,第一服务器可以执行分析指令A1,根据执行结果得到对应的回执信息,并将分析指令A1对应的回执信息返回给管理设备。管理设备在接收到分析指令A1对应的回执信息,可以根据回执信息确定出软件异常的原因,若回执信息为:xxxxx cache_dir xxxx,则说明软件异常的原因为“缓存目录为空”,进而,可以预先配置的异常原因和处理方式的对应关系,确定目标处理方式为处理方式1和处理方式2,那么,管理设备就可以根据处理方式1和处理方式2生成处理指令并发送给第一服务器,第一服务器接收到处理指令后,可以创建缓存目录并重启squid服务;若回执信息为:xxxxx cache_dir xxxxxxxx deadxxx,则说明软件异常的原因为“服务异常”,进而,可以预先配置的异常原因和处理方式的对应关系,确定目标处理方式为处理方式2,那么,管理设备就可以根据处理方式2生成处理指令并发送给第一服务器,第一服务器接收到处理指令后,可以执行重启squid服务。
在执行步骤203之后,可以进一步判断第一服务器的异常是否解除。具体地,第一服务器可以向管理设备发送更新后的运行信息,与上述步骤201中类似地,更新后的运行信息包括第一服务器对应的更新后的运行状态;管理设备在接收到更新后运行信息后,可以判断第一服务器对应的更新后的运行状态是否为异常状态的第一服务器,如果是,则说明第一服务器的异常未解除;如果不是,则说明第一服务器的异常解除。
进一步地,管理设备若确定服务器的异常解除,则可以记录处理信息,处理信息可以用于后续分析异常发生的频率,便于相关软件的改善,通过分析软件本身的漏洞,结合异常的服务器分析软件可优化的方向,进一步对软件本身 进行完善,同样也可以发现更好的处理方式,优化处理指令的配置,从而不断完善软件,完善上述服务器异常的处理过程。
进一步地,管理设备若确定服务器的异常未解除,则对异常时间(即管理设备获取到异常信息的时间与当前时间之间的时间差值)做一个判断,若异常时间大于预设阈值,则直接对服务器进行“登记故障”的操作,并发送邮件通知相关软件管理员,若异常时间小于预设阈值,则可以等待预设时间后再次对服务器进行检测。其中,预设阈值可以是本领域技术人员根据经验和实际情况确定的,比如可以是根据当前人工分析处理时间确定的,具体不做限定。
基于同样的发明构思,图3示例性示出了本发明实施例提供的一种管理设备的结构示意图,如图3所示,该管理设备300包括接收单元301、处理单元302和发送单元303;其中,
接收单元301,用于获取各服务器的运行信息;
处理单元302,用于根据所述各服务器的运行信息,若确定第一服务器出现异常,则根据所述第一服务器的运行信息,以及预先配置的异常的运行信息和处理方式的对应关系,确定目标处理方式;以及根据所述目标处理方式,生成处理指令;
发送单元303,用于将所述处理指令发送给所述第一服务器,以使所述第一服务器执行所述处理指令,并采用所述目标处理方式进行处理。
在一种可能的实现方式中,所述运行信息包括异常信息,所述异常信息包括软件异常的类型;
所述管理设备处理单元302具体用于:
根据所述第一服务器对应的软件异常的类型,以及预先配置的软件异常的类型和分析指令的对应关系,确定至少一个目标分析指令;所述分析指令用于对引起软件异常的原因进行分析;
所述发送单元303具体用于:
将所述至少一个目标分析指令发送给所述第一服务器,以使所述第一服务 器执行所述目标分析指令,并根据执行结果生成回执信息;
所述接收单元301具体用于:
接收所述第一服务器发送的至少一个回执信息;
所述处理单元302还具体用于:
根据所述回执信息确定出引起软件异常的原因后,根据预先配置的异常原因和处理方式的对应关系,确定目标处理方式。
在一种可能的实现方式中,所述发送单元303具体用于:
针对第一目标分析指令,将所述第一目标分析指令发送给所述第一服务器,以使所述第一服务器执行所述第一目标分析指令,并根据执行结果生成第一回执信息;所述第一目标分析指令为所述至少一个目标分析指令中任一目标分析指令,所述第一目标分析指令用于对引起软件异常的第一方面的原因进行分析;
所述处理单元302具体用于:
根据所述第一回执信息,若确定由所述第一方面的原因引起软件异常,则根据所述第一方面的原因,以及异常原因和处理方式的对应关系,确定目标处理方式。
在一种可能的实现方式中,所述处理单元302还用于:
根据所述第一回执信息,若确定不由所述第一方面的原因引起软件异常,则向所述第一服务器发送第二目标分析指令,直至根据第二回执信息,确定是由第二方面的原因引起软件异常;所述第二目标分析指令为所述至少一个目标分析指令中除所述第一目标分析指令以外的任一目标分析指令,所述第二目标分析指令用于对引起软件异常的所述第二方面的原因进行分析,所述第二回执信息是所述第一服务器执行所述第二目标分析指令后根据执行结果生成的。
在一种可能的实现方式中,所述发送单元303将所述处理指令发送给所述第一服务器之后,所述接收单元301还用于:
接收所述第一服务器发送更新后的运行信息;
所述处理单元302还用于:
根据所述更新后的运行信息,若确定所述第一服务器未出现异常,则记录所述第一服务器对应的历史处理记录,所述历史处理记录用于更新所述预先配置的异常的运行信息和处理方式的对应关系。
基于同样的发明构思,图4示例性示出了本发明实施例提供的一种服务器的结构示意图,如图4所示,该服务器400包括接收单元401、处理单元402和发送单元403;其中,
接收单元401,用于接收管理设备发送的处理指令,所述处理指令是根据目标处理方式生成的,所述目标处理方式是所述管理设备在确定所述服务器出现异常后,根据所述服务器的运行信息,以及预先配置的异常的运行信息和处理方式的对应关系确定的;
处理单元402,用于执行所述处理指令,并采用所述目标处理方式进行处理。
在一种可能的实现方式中,所述运行信息包括异常信息,所述异常信息包括软件异常的类型;
所述接收单元401在接收管理设备发送的处理指令之前,还用于:
接收所述管理设备发送的至少一个目标分析指令;所述目标分析指令是所述管理设备根据所述服务器对应的软件异常的类型,以及预先配置的软件异常的类型和分析指令的对应关系确定的,所述分析指令用于对引起软件异常的原因进行分析;
所述处理单元402还用于:
执行所述目标分析指令,并根据执行结果生成回执信息;
发送单元403还用于:
向所述管理设备发送至少一个回执信息,以使所述管理设备根据所述回执信息确定出引起软件异常的原因后,根据预先配置的异常原因和处理方式的对应关系,确定所述目标处理方式。
在一种可能的实现方式中,所述处理单元402具体用于:
针对第一目标分析指令,执行所述第一目标分析指令,并根据执行结果生成第一回执信息;所述第一目标分析指令为所述至少一个目标分析指令中任一目标分析指令,所述第一目标分析指令用于对引起软件异常的第一方面的原因进行分析;
所述发送单元403具体用于:
向所述管理设备发送所述第一回执信息,以使所述管理设备根据所述第一回执信息,若确定由所述第一方面的原因引起软件异常,则根据所述第一方面的原因,以及异常原因和处理方式的对应关系,确定目标处理方式。
在一种可能的实现方式中,所述接收单元401还用于:
接收所述管理设备发送的第二目标分析指令;所述第二目标分析指令为所述至少一个目标分析指令中除所述第一目标分析指令以外的任一目标分析指令,所述第二目标分析指令用于对引起软件异常的所述第二方面的原因进行分析;
所述处理单元402还用于:
执行所述第二目标分析指令,并根据执行结果生成第二回执信息;
所述发送单元403还用于:
向所述管理设备发送所述第二回执信息。
在一种可能的实现方式中,所述处理单元402执行所述处理指令之后,所述发送单元403还用于:
向所述管理设备发送新后的运行信息,以使所述管理设备根据所述更新后的运行信息,在确定所述第一服务器未出现异常后,记录所述服务器对应的历史处理记录,所述历史处理记录用于更新所述预先配置的异常的运行信息和处理方式的对应关系。
本申请实施例的还提供一种装置,该装置具有实现上文所描述的服务器异常的处理方法的功能。该功能可以通过硬件执行相应的软件实现,在一种可能的设计中,该装置包括:处理器、收发器、存储器;该存储器用于存储计算机 执行指令,该收发器用于实现该装置与其他通信实体进行通信,该处理器与该存储器通过该总线连接,当该装置运行时,该处理器执行该存储器存储的该计算机执行指令,以使该装置执行上文所描述的服务器异常的处理方法。
本发明实施例还提供一种计算机存储介质,所述存储介质中存储软件程序,该软件程序在被一个或多个处理器读取并执行时实现上述各种可能的实现方式中所描述的服务器异常的处理方法。
本发明实施例还提供一种包含指令的计算机程序产品,当其在计算机上运行时,使得计算机执行上述各种可能的实现方式中所描述的服务器异常的处理方法。
本领域内的技术人员应明白,本发明的实施例可提供为方法、系统、或计算机程序产品。因此,本发明可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。
本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使 得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。
尽管已描述了本发明的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例作出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本发明范围的所有变更和修改。
显然,本领域的技术人员可以对本发明进行各种改动和变型而不脱离本发明的精神和范围。这样,倘若本发明的这些修改和变型属于本发明权利要求及其等同技术的范围之内,则本发明也意图包含这些改动和变型在内。

Claims (17)

  1. 一种服务器异常的处理方法,其特征在于,所述方法包括:
    管理设备获取各服务器的运行信息;
    所述管理设备根据所述各服务器的运行信息,若确定第一服务器出现异常,则根据所述第一服务器的运行信息,以及预先配置的异常的运行信息和处理方式的对应关系,确定目标处理方式;
    所述管理设备根据所述目标处理方式,生成处理指令,并将所述处理指令发送给所述第一服务器,以使所述第一服务器执行所述处理指令,并采用所述目标处理方式进行处理。
  2. 根据权利要求1所述的方法,其特征在于,所述运行信息包括软件异常的类型;
    所述管理设备根据所述第一服务器的运行信息,以及预先配置的异常的运行信息和处理方式的对应关系,确定目标处理方式,包括:
    所述管理设备根据所述第一服务器对应的软件异常的类型,以及预先配置的软件异常的类型和分析指令的对应关系,确定至少一个目标分析指令;所述分析指令用于对引起软件异常的原因进行分析;
    所述管理设备将所述至少一个目标分析指令发送给所述第一服务器,以使所述第一服务器执行所述目标分析指令,并根据执行结果生成回执信息;
    所述管理设备接收到所述第一服务器发送的至少一个回执信息后,根据所述回执信息确定出引起软件异常的原因后,根据预先配置的异常原因和处理方式的对应关系,确定目标处理方式。
  3. 根据权利要求2所述的方法,其特征在于,所述管理设备将所述至少一个目标分析指令发送给所述第一服务器,以使所述第一服务器执行所述目标分析指令,并根据执行结果生成回执信息,包括:
    针对第一目标分析指令,所述管理设备将所述第一目标分析指令发送给所述第一服务器,以使所述第一服务器执行所述第一目标分析指令,并根据执行 结果生成第一回执信息;所述第一目标分析指令为所述至少一个目标分析指令中任一目标分析指令,所述第一目标分析指令用于对引起软件异常的第一方面的原因进行分析;
    所述管理设备根据所述回执信息确定出引起软件异常的原因后,根据异常原因和处理方式的对应关系,确定目标处理方式,包括:
    所述管理设备根据所述第一回执信息,若确定由所述第一方面的原因引起软件异常,则根据所述第一方面的原因,以及异常原因和处理方式的对应关系,确定目标处理方式。
  4. 根据权利要求3所述的方法,其特征在于,还包括:
    所述管理设备根据所述第一回执信息,若确定不由所述第一方面的原因引起软件异常,则向所述第一服务器发送第二目标分析指令,直至根据第二回执信息,确定是由第二方面的原因引起软件异常;所述第二目标分析指令为所述至少一个目标分析指令中除所述第一目标分析指令以外的任一目标分析指令,所述第二目标分析指令用于对引起软件异常的所述第二方面的原因进行分析,所述第二回执信息是所述第一服务器执行所述第二目标分析指令后根据执行结果生成的。
  5. 根据权利要求1至3中任一项所述的方法,其特征在于,所述管理设备根将所述处理指令发送给所述第一服务器之后,还包括:
    所述管理设备获取所述第一服务器更新后的运行信息;
    所述管理设备根据所述更新后的运行信息,若确定所述第一服务器未出现异常,则记录所述第一服务器对应的历史处理记录,所述历史处理记录用于更新所述预先配置的异常的运行信息和处理方式的对应关系。
  6. 一种服务器异常的处理方法,其特征在于,所述方法包括:
    服务器接收管理设备发送的处理指令,所述处理指令是根据目标处理方式生成的,所述目标处理方式是所述管理设备在确定所述服务器出现异常后,根据所述服务器的运行信息,以及预先配置的异常的运行信息和处理方式的对应 关系确定的;
    所述服务器执行所述处理指令,并采用所述目标处理方式进行处理。
  7. 根据权利要求6所述的方法,其特征在于,所述运行信息包括软件异常的类型;
    所述服务器接收管理设备发送的处理指令之前,还包括:
    所述服务器接收所述管理设备发送的至少一个目标分析指令;所述目标分析指令是所述管理设备根据所述服务器对应的软件异常的类型,以及预先配置的软件异常的类型和分析指令的对应关系确定的,所述分析指令用于对引起软件异常的原因进行分析;
    所述服务器执行所述目标分析指令,并根据执行结果生成回执信息;
    所述服务器向所述管理设备发送至少一个回执信息,以使所述管理设备根据所述回执信息确定出引起软件异常的原因后,根据预先配置的异常原因和处理方式的对应关系,确定所述目标处理方式。
  8. 根据权利要求7所述的方法,其特征在于,所述服务器执行所述目标分析指令,并根据执行结果生成回执信息,包括:
    针对第一目标分析指令,所述服务器执行所述第一目标分析指令,并根据执行结果生成第一回执信息;所述第一目标分析指令为所述至少一个目标分析指令中任一目标分析指令,所述第一目标分析指令用于对引起软件异常的第一方面的原因进行分析;
    所述服务器向所述管理设备发送至少一个回执信息,以使所述管理设备根据所述回执信息确定出引起软件异常的原因后,根据异常原因和处理方式的对应关系,确定所述目标处理方式,包括:
    所述服务器向所述管理设备发送所述第一回执信息,以使所述管理设备根据所述第一回执信息,若确定由所述第一方面的原因引起软件异常,则根据所述第一方面的原因,以及异常原因和处理方式的对应关系,确定目标处理方式。
  9. 根据权利要求8所述的方法,其特征在于,还包括:
    所述服务器接收所述管理设备发送的第二目标分析指令;所述第二目标分析指令为所述至少一个目标分析指令中除所述第一目标分析指令以外的任一目标分析指令,所述第二目标分析指令用于对引起软件异常的所述第二方面的原因进行分析;
    所述服务器执行所述第二目标分析指令,并根据执行结果生成第二回执信息;
    所述服务器向所述管理设备发送所述第二回执信息。
  10. 根据权利要求6至9中任一项所述的方法,其特征在于,所述服务器执行所述处理指令之后,还包括:
    所述服务器向所述管理设备发送更新后的运行信息,以使所述管理设备根据所述更新后的运行信息,在确定所述第一服务器未出现异常后,记录所述服务器对应的历史处理记录,所述历史处理记录用于更新所述预先配置的异常的运行信息和处理方式的对应关系。
  11. 一种管理设备,其特征在于,所述管理设备包括:
    接收单元,用于获取各服务器的运行信息;
    处理单元,用于根据所述各服务器的运行信息,若确定第一服务器出现异常,则根据所述第一服务器的运行信息,以及预先配置的异常的运行信息和处理方式的对应关系,确定目标处理方式;以及根据所述目标处理方式,生成处理指令;
    发送单元,用于将所述处理指令发送给所述第一服务器,以使所述第一服务器执行所述处理指令,并采用所述目标处理方式进行处理。
  12. 根据权利要求11所述的管理设备,其特征在于,所述运行信息软件异常的类型;
    所述管理设备处理单元具体用于:
    根据所述第一服务器对应的软件异常的类型,以及预先配置的软件异常的类型和分析指令的对应关系,确定至少一个目标分析指令;所述分析指令用于 对引起软件异常的原因进行分析;
    所述发送单元具体用于:
    将所述至少一个目标分析指令发送给所述第一服务器,以使所述第一服务器执行所述目标分析指令,并根据执行结果生成回执信息;
    所述接收单元具体用于:
    接收所述第一服务器发送的至少一个回执信息;
    所述处理单元还具体用于:
    根据所述回执信息确定出引起软件异常的原因后,根据预先配置的异常原因和处理方式的对应关系,确定目标处理方式。
  13. 根据权利要求12所述的管理设备,其特征在于,所述发送单元具体用于:
    针对第一目标分析指令,将所述第一目标分析指令发送给所述第一服务器,以使所述第一服务器执行所述第一目标分析指令,并根据执行结果生成第一回执信息;所述第一目标分析指令为所述至少一个目标分析指令中任一目标分析指令,所述第一目标分析指令用于对引起软件异常的第一方面的原因进行分析;
    所述处理单元具体用于:
    根据所述第一回执信息,若确定由所述第一方面的原因引起软件异常,则根据所述第一方面的原因,以及异常原因和处理方式的对应关系,确定目标处理方式。
  14. 根据权利要求13所述的管理设备,其特征在于,所述处理单元还用于:
    根据所述第一回执信息,若确定不由所述第一方面的原因引起软件异常,则向所述第一服务器发送第二目标分析指令,直至根据第二回执信息,确定是由第二方面的原因引起软件异常;所述第二目标分析指令为所述至少一个目标分析指令中除所述第一目标分析指令以外的任一目标分析指令,所述第二目标分析指令用于对引起软件异常的所述第二方面的原因进行分析,所述第二回执信息是所述第一服务器执行所述第二目标分析指令后根据执行结果生成的。
  15. 根据权利要求11至14中任一项所述的管理设备,其特征在于,所述发送单元将所述处理指令发送给所述第一服务器之后,所述接收单元还用于:
    接收所述第一服务器发送更新后的运行信息;
    所述处理单元还用于:
    根据所述更新后的运行信息,若确定所述第一服务器未出现异常,则记录所述第一服务器对应的历史处理记录,所述历史处理记录用于更新所述预先配置的异常的运行信息和处理方式的对应关系。
  16. 一种计算机可读存储介质,其特征在于,所述存储介质存储有指令,当所述指令在计算机上运行时,使得计算机实现执行权利要求1至10中任一项所述的方法。
  17. 一种计算机设备,其特征在于,包括:
    存储器,用于存储程序指令;
    处理器,用于调用所述存储器中存储的程序指令,按照获得的程序执行如权利要求1至10中任一权利要求所述的方法。
PCT/CN2019/072176 2018-12-29 2019-01-17 一种服务器异常的处理方法及管理设备 WO2020133609A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP19906538.4A EP3905591A4 (en) 2018-12-29 2019-01-17 METHOD AND APPARATUS FOR PROCESSING SERVER ANOMALIES
US17/293,793 US20220006716A1 (en) 2018-12-29 2019-01-17 Method and managing apparatus for processing server anomalies

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811648017.2 2018-12-29
CN201811648017.2A CN109714214B (zh) 2018-12-29 2018-12-29 一种服务器异常的处理方法及管理设备

Publications (1)

Publication Number Publication Date
WO2020133609A1 true WO2020133609A1 (zh) 2020-07-02

Family

ID=66260507

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/072176 WO2020133609A1 (zh) 2018-12-29 2019-01-17 一种服务器异常的处理方法及管理设备

Country Status (4)

Country Link
US (1) US20220006716A1 (zh)
EP (1) EP3905591A4 (zh)
CN (1) CN109714214B (zh)
WO (1) WO2020133609A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112561650A (zh) * 2020-12-17 2021-03-26 深圳希施玛数据科技有限公司 一种订单服务请求的处理系统

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110852876A (zh) * 2019-11-18 2020-02-28 中国银行股份有限公司 批量报错恢复方法及装置
CN114448793B (zh) * 2022-01-28 2024-01-26 中国工商银行股份有限公司 一种服务器管理操作方法及装置

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8132043B2 (en) * 2009-12-17 2012-03-06 Symantec Corporation Multistage system recovery framework
CN106130816A (zh) * 2016-06-24 2016-11-16 腾讯科技(深圳)有限公司 一种内容分发网络监控方法、监控服务器及系统
CN103685459B (zh) * 2012-09-24 2017-07-28 日本电气株式会社 分布式系统、服务器计算机、分布式管理服务器和故障防止方法
CN107306278A (zh) * 2016-04-18 2017-10-31 阿里巴巴集团控股有限公司 故障检测方法、全局负载均衡服务器和常驻网关
CN108491305A (zh) * 2018-03-09 2018-09-04 网宿科技股份有限公司 一种服务器故障的检测方法及系统

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6327677B1 (en) * 1998-04-27 2001-12-04 Proactive Networks Method and apparatus for monitoring a network environment
US6738933B2 (en) * 2001-05-09 2004-05-18 Mercury Interactive Corporation Root cause analysis of server system performance degradations
AU2002359925B2 (en) * 2002-12-26 2008-10-23 Fujitsu Limited Operation managing method and operation managing server
US20050005572A1 (en) * 2003-07-11 2005-01-13 Perkins Darcie Woodruff Window box and mounting system
US7760643B2 (en) * 2007-04-09 2010-07-20 Telcordia Technologies, Inc. Automatic policy change management scheme for DiffServ-enabled MPLS networks
CN103780418A (zh) * 2012-10-24 2014-05-07 鸿富锦精密工业(深圳)有限公司 服务器自动管理方法及系统
CN104683154A (zh) * 2015-03-09 2015-06-03 联想(北京)有限公司 异常处理方法、终端设备以及服务器
US11138168B2 (en) * 2017-03-31 2021-10-05 Bank Of America Corporation Data analysis and support engine
CN107800783B (zh) * 2017-10-18 2020-11-24 阿里巴巴(中国)有限公司 远程监控服务器的方法及装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8132043B2 (en) * 2009-12-17 2012-03-06 Symantec Corporation Multistage system recovery framework
CN103685459B (zh) * 2012-09-24 2017-07-28 日本电气株式会社 分布式系统、服务器计算机、分布式管理服务器和故障防止方法
CN107306278A (zh) * 2016-04-18 2017-10-31 阿里巴巴集团控股有限公司 故障检测方法、全局负载均衡服务器和常驻网关
CN106130816A (zh) * 2016-06-24 2016-11-16 腾讯科技(深圳)有限公司 一种内容分发网络监控方法、监控服务器及系统
CN108491305A (zh) * 2018-03-09 2018-09-04 网宿科技股份有限公司 一种服务器故障的检测方法及系统

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3905591A4 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112561650A (zh) * 2020-12-17 2021-03-26 深圳希施玛数据科技有限公司 一种订单服务请求的处理系统
CN112561650B (zh) * 2020-12-17 2024-02-02 深圳希施玛数据科技有限公司 一种订单服务请求的处理系统

Also Published As

Publication number Publication date
EP3905591A4 (en) 2022-02-16
CN109714214A (zh) 2019-05-03
CN109714214B (zh) 2021-08-27
EP3905591A1 (en) 2021-11-03
US20220006716A1 (en) 2022-01-06

Similar Documents

Publication Publication Date Title
US10348809B2 (en) Naming of distributed business transactions
US10102097B2 (en) Transaction server performance monitoring using component performance data
WO2020133609A1 (zh) 一种服务器异常的处理方法及管理设备
US11797877B2 (en) Automated self-healing of a computing process
WO2021169064A1 (zh) 一种基于边缘网络的异常处理方法及装置
EP3049968B1 (en) Master schema shared across multiple tenants with dynamic update
US20160042289A1 (en) Systems and methods for adaptive thresholding using maximum concentration intervals
WO2020259003A1 (zh) 一种链路追踪方法及装置
CN107547595B (zh) 云资源调度系统、方法及装置
US20160371122A1 (en) File processing workflow management
US11461679B2 (en) Message management using machine learning techniques
WO2019019394A1 (zh) 客户端异常定位方法、装置、计算机设备和存储介质
WO2018000878A1 (zh) 分布式任务处理方法和装置
US10764122B2 (en) Managing computing infrastructure events having different event notification formats
US20220286373A1 (en) Scalable real time metrics management
US10642585B1 (en) Enhancing API service schemes
CN105516266A (zh) 一种业务监控方法、系统及相关设备
CN111782456B (zh) 异常检测方法、装置、计算机设备和存储介质
CN114090113B (zh) 数据源处理插件动态加载的方法、装置、设备及存储介质
WO2021056739A1 (zh) 性能分析方法、装置、计算机设备及存储介质
US20120110500A1 (en) System and method for message grouping
CN109995571B (zh) 服务器配置与vnf应用匹配的方法及装置
CN108197253B (zh) 一种云监控平台的设备查询方法、装置和设备
CN112148762A (zh) 一种实时数据流的统计方法和装置
CN113420713A (zh) 异常监控方法、装置、电子设备及计算机可读介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19906538

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2019906538

Country of ref document: EP

Effective date: 20210727

ENP Entry into the national phase

Ref document number: 2019906538

Country of ref document: EP

Effective date: 20210729