CN117130850A - Fault diagnosis method, device, equipment and storage medium - Google Patents
Fault diagnosis method, device, equipment and storage medium Download PDFInfo
- Publication number
- CN117130850A CN117130850A CN202310920190.8A CN202310920190A CN117130850A CN 117130850 A CN117130850 A CN 117130850A CN 202310920190 A CN202310920190 A CN 202310920190A CN 117130850 A CN117130850 A CN 117130850A
- Authority
- CN
- China
- Prior art keywords
- fault
- interface
- target
- information
- diagnosis
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3003—Monitoring arrangements specially adapted to the computing system or computing system component being monitored
- G06F11/302—Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a software system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3065—Monitoring arrangements determined by the means or processing involved in reporting the monitored data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/32—Monitoring with visual or acoustical indication of the functioning of the machine
- G06F11/321—Display for diagnostics, e.g. diagnostic result display, self-test user interface
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/32—Monitoring with visual or acoustical indication of the functioning of the machine
- G06F11/324—Display of status information
- G06F11/327—Alarm or error message display
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/32—Monitoring with visual or acoustical indication of the functioning of the machine
- G06F11/324—Display of status information
- G06F11/328—Computer systems status display
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Computing Systems (AREA)
- Data Mining & Analysis (AREA)
- Computer Hardware Design (AREA)
- Human Computer Interaction (AREA)
- Mathematical Physics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Test And Diagnosis Of Digital Computers (AREA)
Abstract
本发明提供了一种故障诊断方法、装置、设备及存储介质,该方法包括:响应于第一界面监测到系统故障发出的指示信号,切换至第二界面,获取第一界面的监测信息,对监测信息进行解析处理,确定出故障信息,其中,故障信息包括目标故障关键词和目标故障类别,根据故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案,将诊断方案展示在第二界面。本发明实现自动化分析确定系统测试的故障,并给出解决建议,最终将诊断方案展示在第二界面供开发人员依据诊断方案解决测试故障,实现了故障诊断的智能化,提高了系统故障诊断的效率,大大缩短了新产品的研发测试周期,进一步提高了产品的开发效率。
The invention provides a fault diagnosis method, device, equipment and storage medium. The method includes: in response to the first interface monitoring an indication signal sent by a system fault, switching to the second interface, obtaining the monitoring information of the first interface, and The monitoring information is parsed and processed to determine the fault information. The fault information includes target fault keywords and target fault categories. According to the target fault keywords and target fault categories of the fault information, the corresponding diagnosis plan for the fault is determined in the preset fault database. Display the diagnostic plan on the second interface. The present invention realizes automated analysis to determine system test faults, gives solution suggestions, and finally displays the diagnostic solution on the second interface for developers to solve test faults based on the diagnostic solution, realizes intelligent fault diagnosis, and improves the efficiency of system fault diagnosis. efficiency, greatly shortening the R&D and testing cycle of new products and further improving product development efficiency.
Description
技术领域Technical field
本发明涉及服务器技术领域,特别是涉及一种故障诊断方法、装置、设备及存储介质。The present invention relates to the field of server technology, and in particular to a fault diagnosis method, device, equipment and storage medium.
背景技术Background technique
在服务器产品开发过程中,会对CPU及整机进行系统测试,全面的系统测试会测试出设计中存在的问题,发现故障并进行解决是保证设计质量的关键。对于任意一款新产品,系统测试中势必会遇到大量问题,为保证产品尽快上市,提高产品竞争力及效益,如何通过系统测试实现故障快速定位以解决故障是非常迫切的。During the development process of server products, system testing of the CPU and the entire machine will be conducted. Comprehensive system testing will detect problems in the design. Finding and solving faults is the key to ensuring design quality. For any new product, a large number of problems are bound to be encountered during system testing. In order to ensure that the product is launched as soon as possible and improve product competitiveness and efficiency, it is very urgent to quickly locate faults and solve them through system testing.
目前,现有的服务器架构下的故障诊断方案需单独收集各种故障日志,并逐行查看日志确定出问题出在哪,在查完日志定位故障之后开发人员根据个人经验解决故障,然而,由于开发人员日志查找时间以及故障诊断方式耗时严重,影响系统故障诊断的效率,使新产品的研发测试周期较长,进一步影响产品的开发效率。Currently, the fault diagnosis solution under the existing server architecture requires separately collecting various fault logs, and checking the logs line by line to determine where the problem lies. After checking the logs and locating the fault, developers solve the fault based on personal experience. However, due to Developer log search time and fault diagnosis methods are very time-consuming, affecting the efficiency of system fault diagnosis, making the development and testing cycle of new products longer, further affecting product development efficiency.
发明内容Contents of the invention
有鉴于此,本发明旨在提出一种故障诊断方法、装置、设备及存储介质,解决的问题,实现高效在线故障诊断,保证服务器稳定运行。In view of this, the present invention aims to propose a fault diagnosis method, device, equipment and storage medium to solve the problem, realize efficient online fault diagnosis, and ensure stable operation of the server.
依据本发明的第一方面,提供了一种故障诊断方法,应用于测试端,所述测试端与被测服务器连接,所述测试端包括第一界面和第二界面,所述第一界面和所述第二界面切换显示,所述方法包括:According to the first aspect of the present invention, a fault diagnosis method is provided, which is applied to a test terminal. The test terminal is connected to the server under test. The test terminal includes a first interface and a second interface. The first interface and The second interface switches display, and the method includes:
响应于所述第一界面监测到系统故障发出的指示信号,切换至所述第二界面,获取所述第一界面的监测信息;In response to the first interface detecting an indication signal sent by a system failure, switch to the second interface to obtain monitoring information of the first interface;
对所述监测信息进行解析处理,确定出故障信息,其中,所述故障信息包括目标故障关键词和目标故障类别;Analyze and process the monitoring information to determine fault information, where the fault information includes target fault keywords and target fault categories;
根据所述故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案,将所述诊断方案展示在所述第二界面。According to the target fault keyword and the target fault category of the fault information, a fault corresponding diagnosis scheme is determined in the preset fault database, and the diagnosis scheme is displayed on the second interface.
可选地,所述对所述监测信息进行解析处理,确定出故障信息,包括:Optionally, the analyzing and processing of the monitoring information to determine the fault information includes:
在所述监测信息中对预设故障关键词进行匹配,确定出目标故障关键词及目标故障关键词在所述监测信息中的位置;Match the preset fault keywords in the monitoring information, and determine the target fault keyword and the position of the target fault keyword in the monitoring information;
对所述目标故障关键词所在行进行语句解析,确定目标故障类别;Perform statement analysis on the line where the target fault keyword is located to determine the target fault category;
将所述目标故障关键词和所述目标故障类别存储为故障信息。The target fault keyword and the target fault category are stored as fault information.
可选地,所述根据所述故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案,将所述诊断方案展示在所述第二界面,包括:Optionally, determining a fault corresponding diagnosis solution in a preset fault database based on the target fault keyword and target fault category of the fault information, and displaying the diagnosis solution on the second interface, including:
在预设故障数据库中对所述故障信息的目标故障关键词进行匹配,得到匹配结果;Match the target fault keywords of the fault information in the preset fault database to obtain the matching results;
若所述匹配结果为目标故障关键词匹配成功,确定所述目标故障关键词匹配的多个故障类别;If the matching result is a successful match of the target fault keyword, determine multiple fault categories matched by the target fault keyword;
将多个所述故障类别按照与所述目标故障关键词的关联度进行优先级排序,确定至少一个目标故障类别;其中,所述目标故障类别是优先级满足预设条件确定的故障类别;Prioritize a plurality of fault categories according to their relevance to the target fault keyword to determine at least one target fault category; wherein the target fault category is a fault category whose priority satisfies preset conditions;
根据至少一个所述目标故障类别,在所述预设故障数据库中调用所述目标故障类别对应的诊断方案;According to at least one of the target fault categories, call the diagnostic solution corresponding to the target fault category in the preset fault database;
将所述诊断方案展示在所述第二界面的故障诊断显示页面。The diagnosis plan is displayed on the fault diagnosis display page of the second interface.
可选地,所述测试端分别与所述被测服务器和所述内网连接,所述根据至少一个所述目标故障类别,在所述预设故障数据库中调用所述目标故障类别对应的诊断方案,包括:Optionally, the test terminal is connected to the server under test and the intranet respectively, and according to at least one of the target fault categories, the diagnosis corresponding to the target fault category is called in the preset fault database. programs, including:
若确定至少一个所述目标故障类别,通过所述内网调用所述预设故障数据库;If at least one of the target fault categories is determined, calling the preset fault database through the intranet;
在所述预设故障数据库提取所述目标故障类别对应的故障诊断方案。A fault diagnosis solution corresponding to the target fault category is extracted from the preset fault database.
可选地,所述将所述诊断方案展示在所述第二界面的故障诊断显示页面之后,还包括:Optionally, displaying the diagnosis solution after the fault diagnosis display page of the second interface further includes:
将所述目标故障关键词和优先级满足预设条件的至少一个目标故障类别的对应关系存储至所述预设故障数据库;Store the corresponding relationship between the target fault keyword and at least one target fault category whose priority satisfies the preset condition in the preset fault database;
存储所述诊断方案的调用路径,其中,所述调用路径是所述测试端与所述预设故障数据库的连接路径,以使开发人员查阅所述诊断方案。Store the calling path of the diagnostic plan, where the calling path is the connection path between the test terminal and the preset fault database, so that developers can check the diagnostic plan.
可选地,所述响应于所述第一界面监测到系统故障发出的指示信号,切换至所述第二界面之前,还包括:Optionally, before switching to the second interface, before switching to the second interface, the method further includes:
在所述被测服务器进行系统测试的情况下,接收所述被测服务器上报的系统测试状态;When the server under test performs system testing, receive the system test status reported by the server under test;
将所述系统测试状态显示在所述第一界面的信息打印页面;Display the system test status on the information print page of the first interface;
若所述信息打印页面出现被测服务器故障告警,通过所述第一界面发出故障告警指示。If a fault alarm of the tested server appears on the information printing page, a fault alarm indication is issued through the first interface.
依据本发明的第二方面,提供了一种故障诊断装置,应用于测试端,所述测试端与被测服务器连接,所述测试端包括第一界面和第二界面,所述第一界面和所述第二界面切换显示,所述装置包括:According to the second aspect of the present invention, a fault diagnosis device is provided, which is applied to a test terminal. The test terminal is connected to the server under test. The test terminal includes a first interface and a second interface. The first interface and The second interface switches display, and the device includes:
获取监测信息模块,用于响应于所述第一界面监测到系统故障发出的指示信号,切换至所述第二界面,获取所述第一界面的监测信息;A module for obtaining monitoring information, configured to switch to the second interface in response to an indication signal from a system failure detected on the first interface, and obtain monitoring information of the first interface;
确定故障信息模块,用于对所述监测信息进行解析处理,确定出故障信息,其中,所述故障信息包括目标故障关键词和目标故障类别;A fault information determination module, configured to analyze and process the monitoring information and determine fault information, where the fault information includes target fault keywords and target fault categories;
确定诊断方案模块,用于根据所述故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案,将所述诊断方案展示在所述第二界面。A diagnostic solution determining module is configured to determine a diagnostic solution corresponding to the fault in a preset fault database based on the target fault keyword and target fault category of the fault information, and display the diagnostic solution on the second interface.
根据本发明的第三方面,提供了一种故障诊断系统,所述系统包括测试端、被测服务器和内网,所述测试端分别与所述被测服务器和所述内网串口连接,所述测试端包括第一界面和第二界面,所述第一界面和所述第二界面切换显示;According to the third aspect of the present invention, a fault diagnosis system is provided. The system includes a test terminal, a server under test and an intranet. The test terminal is connected to the serial port of the server under test and the intranet respectively. The test terminal includes a first interface and a second interface, and the first interface and the second interface are switched for display;
所述测试端用于响应于所述第一界面监测到系统故障发出的指示信号,切换至所述第二界面,获取所述第一界面的监测信息,对所述监测信息进行解析处理,确定出故障信息,根据所述故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案,将所述诊断方案展示在所述第二界面;The test terminal is configured to switch to the second interface in response to an indication signal from a system failure detected by the first interface, obtain the monitoring information of the first interface, analyze the monitoring information, and determine Fault information: determine the fault corresponding diagnosis scheme in the preset fault database according to the target fault keyword and target fault category of the fault information, and display the diagnosis scheme on the second interface;
所述被测服务器用于在进行系统测试的情况下上报系统测试状态至所述测试端的所述第一界面;The server under test is configured to report system test status to the first interface of the test terminal when performing system testing;
所述内网用于建立所述预设故障数据库与所述测试端的连接,用于通过所述内网在所述预设故障数据库中调用所述目标故障类别对应的诊断方案。The intranet is used to establish a connection between the preset fault database and the test terminal, and to call the diagnostic solution corresponding to the target fault category in the preset fault database through the intranet.
根据本发明的又一方面,还提供一种电子设备,包括:According to another aspect of the present invention, an electronic device is also provided, including:
处理器;processor;
用于存储所述处理器可执行指令的存储器;memory for storing instructions executable by the processor;
其中,所述处理器被配置为执行所述指令,以实现如上所述的故障诊断方法。Wherein, the processor is configured to execute the instructions to implement the fault diagnosis method as described above.
根据本发明的又一方面,还提供一种可读存储介质,所述可读存储介质上存储计算机程序,所述计算机程序被处理器执行时实现如上所述的故障诊断方法的步骤。According to yet another aspect of the present invention, a readable storage medium is also provided. A computer program is stored on the readable storage medium. When the computer program is executed by a processor, the steps of the fault diagnosis method as described above are implemented.
本发明实施例提供的一种故障诊断方法,测试端通过响应于第一界面监测到系统故障发出的指示信号,切换至第二界面,获取第一界面的监测信息,对监测信息进行解析处理,确定出故障信息,根据故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案,将诊断方案展示在第二界面。本发明实施例利用测试端的多个显示界面,实时监测被测服务器,在第一界面监测到系统故障发出的指示信号后切换至第二界面进行故障诊断,通过对监测信息进行解析处理,准确锁定系统测试故障的故障关键词以及故障类别,再用故障关键词和故障类别在预设故障数据库直接确定出故障对应的有效诊断方案,能够对被测服务器系统测试的故障实现自动化的保存、分析并给出解决建议,最终将诊断方案展示在第二界面供开发人员依据诊断方案解决测试故障,实现了故障诊断的智能化,有效节省了开发人员人工查找日志以及故障诊断时间,提高了系统故障诊断的效率,大大缩短了新产品的研发测试周期,进一步提高了产品的开发效率。In a fault diagnosis method provided by an embodiment of the present invention, the test terminal switches to the second interface in response to the indication signal sent by the system fault detected by the first interface, obtains the monitoring information of the first interface, and analyzes the monitoring information. Determine the fault information, determine the corresponding fault diagnosis plan in the preset fault database according to the target fault keyword and target fault category of the fault information, and display the diagnosis plan on the second interface. The embodiment of the present invention uses multiple display interfaces on the test end to monitor the server under test in real time. After the first interface monitors the indication signal issued by the system failure, it switches to the second interface for fault diagnosis. By analyzing and processing the monitoring information, it can accurately lock The fault keywords and fault categories of system test faults are used, and then the fault keywords and fault categories are used to directly determine the effective diagnosis solution corresponding to the fault in the preset fault database, which can automatically save, analyze and analyze the faults tested by the server system under test. Provide solution suggestions, and finally display the diagnostic solution on the second interface for developers to solve test faults based on the diagnostic solution, realizing intelligent fault diagnosis, effectively saving developers manual search logs and fault diagnosis time, and improving system fault diagnosis The efficiency greatly shortens the development and testing cycle of new products and further improves product development efficiency.
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂,以下特举本发明的具体实施方式。The above description is only an overview of the technical solution of the present invention. In order to have a clearer understanding of the technical means of the present invention, it can be implemented according to the content of the description, and in order to make the above and other objects, features and advantages of the present invention more obvious and understandable. , the specific embodiments of the present invention are listed below.
附图说明Description of the drawings
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are for the purpose of illustrating preferred embodiments only and are not to be construed as limiting the invention. Also throughout the drawings, the same reference characters are used to designate the same components. In the attached picture:
图1是本发明实施例提供的一种故障诊断方法的步骤流程图之一;Figure 1 is one of the step flow charts of a fault diagnosis method provided by an embodiment of the present invention;
图2是图1提供的本发明实施例的一种故障诊断方法的步骤102的流程图;Figure 2 is a flow chart of step 102 of a fault diagnosis method according to the embodiment of the present invention provided in Figure 1;
图3是图1提供的本发明实施例的一种故障诊断方法的步骤103的流程图;Figure 3 is a flow chart of step 103 of a fault diagnosis method according to the embodiment of the present invention provided in Figure 1;
图4是本发明实施例提供的一种故障诊断方法的步骤流程图之二;Figure 4 is the second step flow chart of a fault diagnosis method provided by an embodiment of the present invention;
图5是本发明实施例的一种故障诊断方法的测试端示意图之一;Figure 5 is one of the schematic diagrams of the test end of a fault diagnosis method according to an embodiment of the present invention;
图6是本发明实施例的一种故障诊断方法的测试端示意图之二;Figure 6 is the second schematic diagram of the test end of a fault diagnosis method according to the embodiment of the present invention;
图7是本发明实施例提供的一种故障诊断装置的结构示意图;Figure 7 is a schematic structural diagram of a fault diagnosis device provided by an embodiment of the present invention;
图8是本发明实施例提供的一种故障诊断系统的结构示意图;Figure 8 is a schematic structural diagram of a fault diagnosis system provided by an embodiment of the present invention;
图9是本发明实施例提供的一种电子设备的结构示意图。Figure 9 is a schematic structural diagram of an electronic device provided by an embodiment of the present invention.
具体实施方式Detailed ways
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合附图对本发明的各实施方式进行详细的阐述。然而,本领域的普通技术人员可以理解,在本发明各实施方式中,为了使读者更好地理解本申请而提出了许多技术细节。但是,即使没有这些技术细节和基于以下各实施方式的种种变化和修改,也可以实现本申请所要求保护的技术方案。以下各个实施例的划分是为了描述方便,不应对本发明的具体实现方式构成任何限定,各个实施例在不矛盾的前提下可以相互结合相互引用。In order to make the objectives, technical solutions and advantages of the embodiments of the present invention clearer, each implementation mode of the present invention will be described in detail below with reference to the accompanying drawings. However, those of ordinary skill in the art will understand that in various embodiments of the present invention, many technical details are provided to enable readers to better understand the present application. However, even without these technical details and various changes and modifications based on the following embodiments, the technical solution claimed in this application can also be implemented. The division of the following embodiments is for convenience of description and should not constitute any limitation on the specific implementation of the present invention. The various embodiments can be combined with each other and quoted from each other on the premise that there is no contradiction.
现有的服务器架构上,由于涉及多个CPU和其他芯片等部件,当运行时出现故障排查困难,开发人员会在服务器产品开发过程中进行系统测试,以及时发现产品设计中存在的问题进行解决,保证服务器新产品的研发质量,然而现有的研发阶段的故障诊断主要依靠开发人员人工进行测试日志查找以及诊断资料查找或凭借自身经验进行故障诊断,需要耗费大量人力且耗费时间较长,基于上述问题,本发明通过SOL(局域网串口)、网口等工具实时收集服务器打印信息,自动检查故障信息,自动查找CPU及主芯片规格书,通过链接公司内网故障数据库,系统自动给出故障类别及解决方法,实现故障诊断的智能化,进而提升开发效率,增加平台的竞争力。On the existing server architecture, since multiple CPUs and other chips and other components are involved, it is difficult to troubleshoot when running. Developers will conduct system testing during the server product development process to promptly discover and solve problems existing in the product design. , to ensure the R&D quality of new server products. However, fault diagnosis in the existing R&D stage mainly relies on developers to manually search test logs and diagnostic data or rely on their own experience for fault diagnosis, which requires a lot of manpower and takes a long time. Based on To solve the above problems, the present invention collects server printing information in real time through tools such as SOL (LAN serial port) and network port, automatically checks fault information, automatically searches for CPU and main chip specifications, and by linking to the company's intranet fault database, the system automatically gives the fault category and solutions to achieve intelligent fault diagnosis, thereby improving development efficiency and increasing the competitiveness of the platform.
参照图1,示出了本发明实施例提供的故障诊断方法的步骤流程图之一,应用于图8所示的故障诊断系统中的测试端,测试端与被测服务器连接,测试端包括第一界面和第二界面,第一界面和第二界面切换显示,所述方法可以包括:Referring to Figure 1, one of the step flow charts of the fault diagnosis method provided by the embodiment of the present invention is shown. It is applied to the test end in the fault diagnosis system shown in Figure 8. The test end is connected to the server under test. The test end includes a third An interface and a second interface, the first interface and the second interface are switched and displayed, and the method may include:
步骤101,响应于第一界面监测到系统故障发出的指示信号,切换至第二界面,获取第一界面的监测信息。Step 101: In response to the first interface detecting the indication signal sent by the system failure, switch to the second interface to obtain the monitoring information of the first interface.
在本发明实施例中,为对被测服务器进行系统测试的故障诊断,在现有的服务器架构上,增加了测试端以及测试环境的搭建,将被测服务器与测试端连接,测试端包括主机和显示屏,显示屏用于自动化存储测试信息及显示故障诊断结果,在本发明实施例中,为了便于开发人员进行故障诊断,测试端包括第一界面和第二界面,第一界面用于显示信息打印页面,即显示命令以及被测服务器主机上传的各种打印信息,第二界面用于显示故障诊断的执行过程,并展示出故障诊断方案,其中,第一界面和第二界面切换显示。In the embodiment of the present invention, in order to conduct system test fault diagnosis on the server under test, a test terminal and a test environment are built on the existing server architecture, and the server under test is connected to the test terminal. The test terminal includes a host computer. and a display screen. The display screen is used to automatically store test information and display fault diagnosis results. In the embodiment of the present invention, in order to facilitate developers to perform fault diagnosis, the test terminal includes a first interface and a second interface. The first interface is used to display The information printing page displays commands and various printing information uploaded by the server host under test. The second interface is used to display the execution process of fault diagnosis and display the fault diagnosis plan. The first interface and the second interface are switched for display.
具体的,测试端初始界面可以为第一界面,响应于第一界面监测到系统故障发出的指示信号,切换至第二界面,获取第一界面的监测信息,参照图5,是本发明实施例的一种故障诊断方法的测试端示意图之一,第一界面在监测到系统测试故障时,会在信息打印页面显示出故障,并采用系统指示灯进行故障告警,当第一界面监测到系统故障发出指示信号后,测试端自动切换第二界面,开始执行故障诊断的步骤流程,第二界面首先获取到第一界面监测过程中生成的监测信息。Specifically, the initial interface of the test terminal can be the first interface. In response to the indication signal sent by the first interface monitoring the system failure, it switches to the second interface to obtain the monitoring information of the first interface. Refer to Figure 5, which is an embodiment of the present invention. One of the schematic diagrams of the test end of a fault diagnosis method. When the first interface detects a system test fault, it will display the fault on the information print page and use the system indicator light to issue a fault alarm. When the first interface detects a system fault After sending out the instruction signal, the test terminal automatically switches to the second interface and starts the step process of fault diagnosis. The second interface first obtains the monitoring information generated during the monitoring process of the first interface.
需要说明的是,本实施方式所述的第一界面和第二界面仅用于将测试端的显示功能进行区分,两者之间没有先后顺序。本实施方式仅以第一界面是显示信息打印页面,第二界面是显示故障诊断的执行过程,并展示出故障诊断方案为例进行说明而已。It should be noted that the first interface and the second interface described in this embodiment are only used to distinguish the display functions of the test terminal, and there is no order between them. In this embodiment, the first interface displays an information print page, and the second interface displays the execution process of fault diagnosis and displays the fault diagnosis solution as an example.
步骤102,对监测信息进行解析处理,确定出故障信息,其中,故障信息包括目标故障关键词和目标故障类别。Step 102: Analyze and process the monitoring information to determine fault information, where the fault information includes target fault keywords and target fault categories.
在本发明实施例中,测试端的第二界面包括故障诊断显示页面,当切换至第二界面后,测试端开始对监测信息进行解析处理,需要说明的是,测试端需要对监测信息进行故障识别、故障存储、故障自动化诊断。In the embodiment of the present invention, the second interface of the test end includes a fault diagnosis display page. After switching to the second interface, the test end begins to parse and process the monitoring information. It should be noted that the test end needs to perform fault identification on the monitoring information. , fault storage, automatic fault diagnosis.
具体的,对监测信息进行解析,从系统测试过程中生成的监测信息进行检索提取,确定出监测信息中出现故障关键词的语句,识别出故障类别,并对故障关键词和故障类别进行存储,存储为故障信息,完成故障识别和故障存储,以便于针对性的对出现的故障进行诊断,提高故障诊断的准确性。Specifically, the monitoring information is parsed, the monitoring information generated during the system testing process is retrieved and extracted, the sentences containing the fault keywords in the monitoring information are determined, the fault categories are identified, and the fault keywords and fault categories are stored. Stored as fault information, complete fault identification and fault storage to facilitate targeted diagnosis of faults and improve the accuracy of fault diagnosis.
需要说明的是,故障信息包括目标故障关键词和目标故障类别,目故障关键词是在监测信息中匹配的故障关键词,表示监测信息中存在故障,目标故障类别是根据故障关键词所在语句确定出的故障类别,表示监测信息中存在的故障类别,其中,由于故障类别数量较多,根据故障关键词确定的故障类别可能存在多个,需要进一步进行分析识别,确定出目标故障类别,在此不作一一赘述。It should be noted that the fault information includes the target fault keyword and the target fault category. The target fault keyword is the fault keyword matched in the monitoring information, indicating that there is a fault in the monitoring information. The target fault category is determined based on the sentence where the fault keyword is located. The fault category that appears indicates the fault category that exists in the monitoring information. Among them, due to the large number of fault categories, there may be multiple fault categories determined based on the fault keywords. Further analysis and identification are required to determine the target fault category. Here I won’t go into details one by one.
步骤103,根据故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案,将诊断方案展示在第二界面。Step 103: Based on the target fault keyword and target fault category of the fault information, determine the fault corresponding diagnosis plan in the preset fault database, and display the diagnosis plan on the second interface.
在本发明实施例中,测试端在确定出故障信息后,根据故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案,其中目标故障关键词和目标故障类别能够锁定故障的具体信息,从而在预设故障数据库中调用故障对应的诊断方案。In the embodiment of the present invention, after determining the fault information, the test end determines the corresponding fault diagnosis scheme in the preset fault database according to the target fault keyword and target fault category of the fault information, where the target fault keyword and the target fault category are It can lock the specific information of the fault and call the diagnostic plan corresponding to the fault in the preset fault database.
需要说明的是,预设故障数据库可以通过第二界面的链接按钮进入,测试端能够通过内网和内部的预设故障数据库建立连接,直接访问预设故障数据库,进行故障信息的检索和提取。预设故障数据库中存储有各种服务器历史系统测试出现的故障以及对应诊断方案,需要说明的是,在以上示出的实施方式中,预设故障数据库在线运行期间,还可以利用实时获得的新增故障数据,通过自动机器学习框架,不断地对诊断方案进行更新,从而保持诊断方案具备最佳的服务器故障诊断能力,在此不作具体限定。It should be noted that the preset fault database can be entered through the link button on the second interface. The test terminal can establish a connection with the internal preset fault database through the intranet and directly access the preset fault database to retrieve and extract fault information. The preset fault database stores faults that occurred in various server historical system tests and corresponding diagnostic solutions. It should be noted that in the implementation shown above, during the online operation of the preset fault database, new data obtained in real time can also be used. Increasing fault data, and continuously updating the diagnostic solution through the automatic machine learning framework, so as to maintain the diagnostic solution with the best server fault diagnosis capability, which is not specifically limited here.
具体的,当测试端根据故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案之后,将诊断方案展示在第二界面,以便于开发人员直接根据诊断方案确定故障,并依据诊断方案给出的解决方式对故障的服务器进行维护。Specifically, when the test end determines the fault corresponding diagnosis scheme in the preset fault database based on the target fault keyword and target fault category of the fault information, the diagnostic scheme is displayed on the second interface so that the developer can directly determine based on the diagnostic scheme. fault, and perform maintenance on the faulty server according to the solutions given in the diagnostic plan.
本发明实施例利用测试端的多个显示界面,实时监测被测服务器,在第一界面监测到系统故障发出的指示信号后切换至第二界面进行故障诊断,通过对监测信息进行解析处理,准确锁定系统测试故障的故障关键词以及故障类别,再用故障关键词和故障类别在预设故障数据库直接确定出故障对应的有效诊断方案,能够对被测服务器系统测试的故障实现自动化的保存、分析并给出解决建议,最终将诊断方案展示在第二界面供开发人员依据诊断方案解决测试故障,实现了故障诊断的智能化,有效节省了开发人员人工查找日志以及故障诊断时间,提高了系统故障诊断的效率,大大缩短了新产品的研发测试周期,进一步提高了产品的开发效率。The embodiment of the present invention uses multiple display interfaces on the test end to monitor the server under test in real time. After the first interface monitors the indication signal issued by the system failure, it switches to the second interface for fault diagnosis. By analyzing and processing the monitoring information, it can accurately lock The fault keywords and fault categories of system test faults are used, and then the fault keywords and fault categories are used to directly determine the effective diagnosis solution corresponding to the fault in the preset fault database, which can automatically save, analyze and analyze the faults tested by the server system under test. Provide solution suggestions, and finally display the diagnostic solution on the second interface for developers to solve test faults based on the diagnostic solution, realizing intelligent fault diagnosis, effectively saving developers manual search logs and fault diagnosis time, and improving system fault diagnosis The efficiency greatly shortens the development and testing cycle of new products and further improves product development efficiency.
进一步的,参照图2,示出了图1提供的故障诊断方法的步骤102的流程图,该方法与本发明的第一实施例提供的故障诊断方法基本相同,步骤102可以包括:Further, referring to Figure 2, a flow chart of step 102 of the fault diagnosis method provided in Figure 1 is shown. This method is basically the same as the fault diagnosis method provided by the first embodiment of the present invention. Step 102 may include:
步骤201,在监测信息中对预设故障关键词进行匹配,确定出目标故障关键词及目标故障关键词在监测信息中的位置。Step 201: Match the preset fault keywords in the monitoring information to determine the target fault keyword and the position of the target fault keyword in the monitoring information.
本发明实施例中,测试端在监测信息中对预设故障关键词进行匹配,从而确定出监测信息中出现的故障关键词以及故障关键词的位置,以便于锁定故障信息。In the embodiment of the present invention, the test terminal matches the preset fault keywords in the monitoring information, thereby determining the fault keywords appearing in the monitoring information and the location of the fault keywords, so as to lock the fault information.
示例的,检索从第一界面的信息打印页面获取的监测信息,通过查找预设故障关键词,如error、Fault、alert等,确定出目标故障关键词及目标故障关键词在监测信息中的位置,例如,在监测信息中匹配到“error”,则标记出其所在位置的行语句。For example, the monitoring information obtained from the information printing page of the first interface is retrieved, and the target fault keyword and the position of the target fault keyword in the monitoring information are determined by searching for preset fault keywords, such as error, fault, alert, etc. For example, if "error" is matched in the monitoring information, the line statement at its location will be marked.
步骤202,对目标故障关键词所在行进行语句解析,确定目标故障类别。Step 202: Perform sentence analysis on the row where the target fault keyword is located to determine the target fault category.
在本发明实施例中,当确定出目标故障关键词所在行的语句后,进行语义分析,具体的,基于故障关键词和故障类别的对应关系,对故障信息所在行的语句进行分析,提取出目标故障关键词对应的故障类别,如PSU、temp等,锁定目标故障类别。In the embodiment of the present invention, after the sentence in the row where the target fault keyword is located is determined, semantic analysis is performed. Specifically, based on the corresponding relationship between the fault keyword and the fault category, the sentence in the row where the fault information is located is analyzed and extracted. The fault category corresponding to the target fault keyword, such as PSU, temp, etc., locks the target fault category.
步骤203,将目标故障关键词和目标故障类别存储为故障信息。Step 203: Store target fault keywords and target fault categories as fault information.
本发明实施例中,通过在监测信息中对预设故障关键词进行匹配,确定出目标故障关键词及目标故障关键词在监测信息中的位置,对目标故障关键词所在行进行语句解析,确定目标故障类别,将目标故障关键词和目标故障类别存储为故障信息,实现对故障的准确识别和故障存储,提高故障的识别准确度,便于对故障进行下一步的故障诊断。In the embodiment of the present invention, by matching the preset fault keywords in the monitoring information, the target fault keyword and the position of the target fault keyword in the monitoring information are determined, and the line where the target fault keyword is located is sentence parsed to determine Target fault category, store target fault keywords and target fault categories as fault information, achieve accurate fault identification and fault storage, improve fault identification accuracy, and facilitate the next step of fault diagnosis.
进一步的,参照图3,示出了图1提供的故障诊断方法的步骤103的流程图,该方法与本发明的第一实施例提供的故障诊断方法基本相同,步骤103可以包括:Further, referring to Figure 3, a flow chart of step 103 of the fault diagnosis method provided in Figure 1 is shown. This method is basically the same as the fault diagnosis method provided by the first embodiment of the present invention. Step 103 may include:
步骤301,在预设故障数据库中对故障信息的目标故障关键词进行匹配,得到匹配结果。Step 301: Match the target fault keywords of the fault information in the preset fault database to obtain the matching results.
在本发明实施例中,测试端识别故障并存储故障信息后,在预设故障数据库中对故障信息的目标故障关键词进行匹配,得到匹配结果,其中,预设故障数据库中可以包括CPU及主芯片的规格书,以使基于规格书对目标故障关键词进行匹配,得到匹配结果,确定出当前出现的故障能够在预设故障数据库中检索到,便于进一步进行故障诊断。In the embodiment of the present invention, after the test terminal identifies the fault and stores the fault information, it matches the target fault keywords of the fault information in the preset fault database to obtain the matching result. The preset fault database may include the CPU and the host computer. Specification book of the chip, so that the target fault keyword can be matched based on the specification book, the matching result can be obtained, and the current fault can be determined to be retrieved in the preset fault database to facilitate further fault diagnosis.
步骤302,若匹配结果为目标故障关键词匹配成功,确定目标故障关键词匹配的多个故障类别。Step 302: If the matching result is that the target fault keyword is successfully matched, multiple fault categories matching the target fault keyword are determined.
需要说明的是,目标故障关键词能够在预设故障数据库中找到多个故障类别,因此,首先确定出目标故障关键词对应的多个故障类别,再从多个故障类别中确定出符合要求的目标故障类别,避免对所有故障类别的诊断方案进行遍历,浪费处理时间的问题,进一步节省数据处理耗费的资源。It should be noted that the target fault keyword can find multiple fault categories in the preset fault database. Therefore, first determine the multiple fault categories corresponding to the target fault keyword, and then determine the required fault categories from the multiple fault categories. The target fault category avoids the problem of traversing the diagnostic solutions of all fault categories and wasting processing time, further saving resources consumed by data processing.
步骤303,将多个故障类别按照与目标故障关键词的关联度进行优先级排序,确定至少一个目标故障类别。Step 303: Prioritize multiple fault categories according to their relevance to the target fault keyword, and determine at least one target fault category.
其中,目标故障类别是优先级满足预设条件确定的故障类别。Among them, the target fault category is a fault category whose priority satisfies the preset conditions.
具体的,采用预设语义关联度算法,计算出多个故障类别与目标故障关键词的关联度,本实施例对生成关联度的具体算法不作限定,可以是任意一种能够得到故障类别与目标故障关键词的关联度的算法,在此不再赘述。Specifically, a preset semantic correlation algorithm is used to calculate the correlation between multiple fault categories and target fault keywords. This embodiment does not limit the specific algorithm for generating correlation, and it can be any algorithm that can obtain the fault category and target. The algorithm for correlation of fault keywords will not be described again here.
需要说明的是,为保证故障诊断的准确度,避免故障类别判断失误耽误故障诊断,测试端将多个故障类别按照与目标故障关键词的关联度进行优先级排序,确定至少一个目标故障类别,即确定出符合条件的多个目标故障类别,提高故障诊断的容错率。It should be noted that in order to ensure the accuracy of fault diagnosis and avoid errors in fault category judgment that delay fault diagnosis, the test end prioritizes multiple fault categories according to their relevance to the target fault keyword, and determines at least one target fault category. That is, multiple target fault categories that meet the conditions are determined to improve the fault tolerance rate of fault diagnosis.
步骤304,根据至少一个所述目标故障类别,在预设故障数据库中调用目标故障类别对应的诊断方案。Step 304: According to at least one target fault category, call the diagnostic solution corresponding to the target fault category in the preset fault database.
步骤305,将诊断方案展示在第二界面的故障诊断显示页面。Step 305: Display the diagnosis plan on the fault diagnosis display page of the second interface.
具体的,测试端分别与被测服务器和内网连接,步骤304根据至少一个目标故障类别,在预设故障数据库中调用目标故障类别对应的诊断方案,包括:Specifically, the test end is connected to the server under test and the intranet respectively. Step 304 calls the diagnostic solution corresponding to the target fault category in the preset fault database according to at least one target fault category, including:
首先,若确定至少一个所述目标故障类别,通过内网调用预设故障数据库;First, if at least one of the target fault categories is determined, call the preset fault database through the intranet;
其次,在预设故障数据库提取目标故障类别对应的故障诊断方案。Secondly, the fault diagnosis scheme corresponding to the target fault category is extracted from the preset fault database.
具体的,步骤305将诊断方案展示在第二界面的故障诊断显示页面之后,还包括:Specifically, step 305 displays the diagnosis solution after the fault diagnosis display page of the second interface, and also includes:
将目标故障关键词和优先级满足预设条件的至少一个目标故障类别的对应关系存储至预设故障数据库;Store the corresponding relationship between the target fault keyword and at least one target fault category whose priority satisfies the preset condition in the preset fault database;
存储诊断方案的调用路径,其中,调用路径是测试端与预设故障数据库的连接路径,以使开发人员查阅所述诊断方案。The calling path of the diagnostic plan is stored, where the calling path is the connection path between the test terminal and the preset fault database, so that developers can check the diagnostic plan.
需要说明的是,参照图6,是本发明实施例的一种故障诊断方法的测试端示意图之二,第二界面包括故障诊断显示页面,第二界面中还包括“资料存放”按钮、“数据库”按钮和“故障诊断”按钮,“资料存放”用来存放服务器主机关键芯片的规格书、CPU的规格书、CPUPCIE相关报错信息对照表、CPU启动error对照表,高速链路损耗表、sensor list及其他开发设计资料等。“数据库”按钮用来通过内网,和内网服务器中的故障数据库建立连接,“故障诊断”按钮用于建立连接后可以访问故障数据库,实现故障数据库中信息的检索和提取。It should be noted that, with reference to Figure 6, which is the second schematic diagram of the test end of a fault diagnosis method according to the embodiment of the present invention, the second interface includes a fault diagnosis display page, and the second interface also includes a "data storage" button, a "database" " button and "Fault Diagnosis" button, "Data Storage" is used to store the specifications of key chips of the server host, CPU specifications, CPU PCIE related error information comparison table, CPU startup error comparison table, high-speed link loss table, sensor list and other development and design materials, etc. The "Database" button is used to establish a connection with the fault database in the Intranet server through the intranet. The "Fault Diagnosis" button is used to access the fault database after establishing the connection to achieve retrieval and extraction of information in the fault database.
本实施例通过对监测信息进行解析处理,准确锁定系统测试故障的故障关键词以及故障类别,再用故障关键词和故障类别在预设故障数据库直接确定出故障对应的有效诊断方案,能够对被测服务器系统测试的故障实现自动化的保存、分析并给出解决建议,最终将诊断方案展示在第二界面供开发人员依据诊断方案解决测试故障,实现了故障诊断的智能化,有效节省了开发人员人工查找日志以及故障诊断时间,提高了系统故障诊断的效率,大大缩短了新产品的研发测试周期,进一步提高了产品的开发效率。This embodiment analyzes and processes the monitoring information to accurately lock the fault keywords and fault categories of the system test fault, and then uses the fault keywords and fault categories to directly determine the effective diagnosis solution corresponding to the fault in the preset fault database, which can diagnose the fault. Faults tested by the test server system are automatically saved, analyzed and solution suggestions are given. Finally, the diagnostic solution is displayed on the second interface for developers to solve test faults based on the diagnostic solution. This realizes intelligent fault diagnosis and effectively saves developers time. Manual search logs and fault diagnosis time improve the efficiency of system fault diagnosis, greatly shorten the development and testing cycle of new products, and further improve product development efficiency.
本发明的第二实施方式涉及一种故障诊断方法,该方法与本发明的第一实施例提供的故障诊断方法基本相同,其区别在于,参照图4,示出了本发明实施例提供的故障诊断方法的步骤流程图之二,所述方法包括:The second embodiment of the present invention relates to a fault diagnosis method, which is basically the same as the fault diagnosis method provided by the first embodiment of the present invention. The difference is that, with reference to Figure 4, a fault diagnosis method provided by the embodiment of the present invention is shown. The second step flow chart of the diagnostic method includes:
步骤104,在被测服务器进行系统测试的情况下,接收被测服务器上报的系统测试状态。Step 104: When the server under test performs system testing, receive the system test status reported by the server under test.
在本发明实施例中,在被测服务器进行系统测试的情况下,测试端的主机能够接收被测服务器上报的系统测试状态,服务器主机是被测试服务器,测试端的主机和服务器主机通过串口和网口连接,测试端主机的另一张网卡和内网连接,测试端的显示器用于显示测试信息及分析结果,服务器主机和测试端的主机之间的串口用于串口通信,服务器主机上报打印信息。In the embodiment of the present invention, when the server under test performs system testing, the host at the test end can receive the system test status reported by the server under test. The server host is the server under test. The host at the test end and the server host communicate through the serial port and network port. Connection, another network card of the test end host is connected to the intranet, the test end monitor is used to display test information and analysis results, the serial port between the server host and the test end host is used for serial communication, and the server host reports printing information.
步骤105,将系统测试状态显示在第一界面的信息打印页面。Step 105: Display the system test status on the information printing page of the first interface.
步骤106,若信息打印页面出现被测服务器故障告警,通过第一界面发出故障告警指示。Step 106: If a fault alarm of the tested server appears on the information printing page, a fault alarm indication is issued through the first interface.
示例的,系统测试进行时,第一界面会实时监测信息打印页面的信息,当系统测试正常时,系统指示灯亮绿灯;当检测到系统测试故障时,系统指示灯亮红灯,即若信息打印页面出现被测服务器故障告警,通过第一界面发出故障告警指示,以使开发人员及时看到系统测试出现故障,进行故障诊断。For example, when the system test is in progress, the first interface will monitor the information on the information print page in real time. When the system test is normal, the system indicator light will turn green; when a system test failure is detected, the system indicator light will turn red. That is, if the information print page When a fault alarm occurs on the server under test, a fault alarm indication is issued through the first interface, so that developers can see the fault in the system test in time and perform fault diagnosis.
需要说明的是,本实施方式不对步骤104至步骤106的具体执行位置进行限定,在本实施方式中,为了便于理解,以步骤104至步骤106在步骤101之前执行为例进行说明。It should be noted that this embodiment does not limit the specific execution positions of steps 104 to 106. In this embodiment, for ease of understanding, an example is provided in which steps 104 to 106 are executed before step 101.
步骤101,响应于第一界面监测到系统故障发出的指示信号,切换至第二界面,获取第一界面的监测信息。Step 101: In response to the first interface detecting the indication signal sent by the system failure, switch to the second interface to obtain the monitoring information of the first interface.
步骤102,对监测信息进行解析处理,确定出故障信息,其中,故障信息包括目标故障关键词和目标故障类别。Step 102: Analyze and process the monitoring information to determine fault information, where the fault information includes target fault keywords and target fault categories.
步骤103,根据故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案,将诊断方案展示在第二界面。Step 103: Based on the target fault keyword and target fault category of the fault information, determine the fault corresponding diagnosis plan in the preset fault database, and display the diagnosis plan on the second interface.
需要说明的是,上述步骤101-103参照前序论述,在此不再赘述。It should be noted that the above steps 101-103 are discussed with reference to the foregoing and will not be repeated here.
本发明实施方式相对于现有技术而言,通过在被测服务器进行系统测试的情况下,接收被测服务器上报的系统测试状态,将系统测试状态显示在第一界面的信息打印页面,若信息打印页面出现被测服务器故障告警,通过第一界面发出故障告警指示,能够及时通知开发人员出现测试故障,提高了系统故障诊断的效率,进一步提高了产品的开发效率。Compared with the existing technology, the embodiment of the present invention receives the system test status reported by the server under test when the server under test performs system testing, and displays the system test status on the information printing page of the first interface. If the information When a fault alarm of the tested server appears on the printed page, a fault alarm indication is issued through the first interface, which can promptly notify developers of test faults, improve the efficiency of system fault diagnosis, and further improve product development efficiency.
参照图7,示出了本发明实施例提供的一种故障诊断装置的结构示意图,应用于测试端,所述测试端与被测服务器连接,所述测试端包括第一界面和第二界面,所述第一界面和所述第二界面切换显示,该装置可以包括:Referring to Figure 7, there is shown a schematic structural diagram of a fault diagnosis device provided by an embodiment of the present invention, which is applied to a test terminal. The test terminal is connected to the server under test. The test terminal includes a first interface and a second interface. The first interface and the second interface are switched and displayed. The device may include:
获取监测信息模块401,用于响应于所述第一界面监测到系统故障发出的指示信号,切换至所述第二界面,获取所述第一界面的监测信息;Obtain monitoring information module 401, configured to switch to the second interface in response to an indication signal sent by a system failure detected on the first interface, and obtain monitoring information of the first interface;
确定故障信息模块402,用于对所述监测信息进行解析处理,确定出故障信息,其中,所述故障信息包括目标故障关键词和目标故障类别;The fault information determination module 402 is used to analyze and process the monitoring information and determine fault information, where the fault information includes target fault keywords and target fault categories;
确定诊断方案模块403,用于根据所述故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案,将所述诊断方案展示在所述第二界面。The diagnostic solution determining module 403 is configured to determine a diagnostic solution corresponding to the fault in the preset fault database according to the target fault keyword and the target fault category of the fault information, and display the diagnostic solution on the second interface.
进一步的,所述确定故障信息模块402包括:Further, the fault information determining module 402 includes:
第一确定子模块,用于在所述监测信息中对预设故障关键词进行匹配,确定出目标故障关键词及目标故障关键词在所述监测信息中的位置;The first determination sub-module is used to match the preset fault keywords in the monitoring information, and determine the target fault keyword and the position of the target fault keyword in the monitoring information;
第二确定子模块,用于对所述目标故障关键词所在行进行语句解析,确定目标故障类别;The second determination sub-module is used to perform statement analysis on the row where the target fault keyword is located, and determine the target fault category;
第一存储子模块,用于将所述目标故障关键词和所述目标故障类别存储为故障信息。The first storage submodule is used to store the target fault keyword and the target fault category as fault information.
进一步的,所述确定诊断方案模块403包括:Further, the determining diagnostic solution module 403 includes:
关键词匹配子模块,用于在预设故障数据库中对所述故障信息的目标故障关键词进行匹配,得到匹配结果;The keyword matching submodule is used to match the target fault keywords of the fault information in the preset fault database to obtain the matching results;
第三确定子模块,用于若所述匹配结果为目标故障关键词匹配成功,确定所述目标故障关键词匹配的多个故障类别;The third determination submodule is used to determine multiple fault categories matched by the target fault keyword if the matching result is a successful match of the target fault keyword;
第四确定子模块,用于将多个所述故障类别按照与所述目标故障关键词的关联度进行优先级排序,确定至少一个目标故障类别;其中,所述目标故障类别是优先级满足预设条件确定的故障类别;The fourth determination sub-module is used to prioritize multiple fault categories according to their relevance to the target fault keyword and determine at least one target fault category; wherein the target fault category is one whose priority satisfies the predetermined Fault category determined by assuming conditions;
调用子模块,用于根据至少一个所述目标故障类别,在所述预设故障数据库中调用所述目标故障类别对应的诊断方案;Calling a submodule, configured to call a diagnostic solution corresponding to the target fault category in the preset fault database according to at least one of the target fault categories;
展示子模块,用于将所述诊断方案展示在所述第二界面的故障诊断显示页面。A display submodule is used to display the diagnosis solution on the fault diagnosis display page of the second interface.
进一步的,所述测试端分别与所述被测服务器和所述内网连接,所述调用子模块包括:Further, the test terminal is connected to the server under test and the intranet respectively, and the calling sub-module includes:
调用单元,用于若确定至少一个所述目标故障类别,通过所述内网调用所述预设故障数据库;A calling unit, configured to call the preset fault database through the intranet if at least one of the target fault categories is determined;
提取单元,用于在所述预设故障数据库提取所述目标故障类别对应的故障诊断方案。An extraction unit is configured to extract a fault diagnosis solution corresponding to the target fault category from the preset fault database.
进一步的,所述确定诊断方案模块403还包括:Further, the determining diagnostic solution module 403 also includes:
第三存储子模块,用于将所述目标故障关键词和优先级满足预设条件的至少一个目标故障类别的对应关系存储至所述预设故障数据库;The third storage submodule is used to store the corresponding relationship between the target fault keyword and at least one target fault category whose priority satisfies the preset condition to the preset fault database;
第四存储子模块,用于存储所述诊断方案的调用路径,其中,所述调用路径是所述测试端与所述预设故障数据库的连接路径,以使开发人员查阅所述诊断方案。The fourth storage sub-module is used to store the calling path of the diagnosis plan, where the calling path is the connection path between the test terminal and the preset fault database, so that developers can check the diagnosis plan.
进一步的,所述装置还包括:Further, the device also includes:
接收信息模块,用于在所述被测服务器进行系统测试的情况下,接收所述被测服务器上报的系统测试状态;An information receiving module, configured to receive the system test status reported by the server under test when the server under test performs system testing;
显示信息模块,用于将所述系统测试状态显示在所述第一界面的信息打印页面;A display information module, configured to display the system test status on the information print page of the first interface;
故障告警模块,用于若所述信息打印页面出现被测服务器故障告警,通过所述第一界面发出故障告警指示。A fault alarm module is configured to issue a fault alarm indication through the first interface if a fault alarm of the tested server appears on the information printing page.
本发明实施例提供的一种故障诊断装置,测试端通过响应于第一界面监测到系统故障发出的指示信号,切换至第二界面,获取第一界面的监测信息,对监测信息进行解析处理,确定出故障信息,根据故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案,将诊断方案展示在第二界面。本发明实施例利用测试端的多个显示界面,实时监测被测服务器,在第一界面监测到系统故障发出的指示信号后切换至第二界面进行故障诊断,通过对监测信息进行解析处理,准确锁定系统测试故障的故障关键词以及故障类别,再用故障关键词和故障类别在预设故障数据库直接确定出故障对应的有效诊断方案,能够对被测服务器系统测试的故障实现自动化的保存、分析并给出解决建议,最终将诊断方案展示在第二界面供开发人员依据诊断方案解决测试故障,实现了故障诊断的智能化,有效节省了开发人员人工查找日志以及故障诊断时间,提高了系统故障诊断的效率,大大缩短了新产品的研发测试周期,进一步提高了产品的开发效率。In a fault diagnosis device provided by an embodiment of the present invention, the test terminal switches to the second interface in response to the indication signal sent by the system fault detected by the first interface, obtains the monitoring information of the first interface, and analyzes the monitoring information. Determine the fault information, determine the corresponding fault diagnosis plan in the preset fault database according to the target fault keyword and target fault category of the fault information, and display the diagnosis plan on the second interface. The embodiment of the present invention uses multiple display interfaces on the test end to monitor the server under test in real time. After the first interface monitors the indication signal issued by the system failure, it switches to the second interface for fault diagnosis. By analyzing and processing the monitoring information, it can accurately lock The fault keywords and fault categories of system test faults are used, and then the fault keywords and fault categories are used to directly determine the effective diagnosis solution corresponding to the fault in the preset fault database, which can automatically save, analyze and analyze the faults tested by the server system under test. Provide solution suggestions, and finally display the diagnostic solution on the second interface for developers to solve test faults based on the diagnostic solution, realizing intelligent fault diagnosis, effectively saving developers manual search logs and fault diagnosis time, and improving system fault diagnosis The efficiency greatly shortens the development and testing cycle of new products and further improves product development efficiency.
参照图8,示出了本发明实施例提供的故障诊断系统的结构示意图,所述系统可以包括测试端、被测服务器和内网,所述测试端分别与所述被测服务器和所述内网串口连接,所述测试端包括第一界面和第二界面,所述第一界面和所述第二界面切换显示;Referring to Figure 8, a schematic structural diagram of a fault diagnosis system provided by an embodiment of the present invention is shown. The system may include a test terminal, a server under test and an intranet. The test terminal is connected to the server under test and the intranet respectively. Network serial port connection, the test terminal includes a first interface and a second interface, the first interface and the second interface are switched for display;
所述测试端用于响应于所述第一界面监测到系统故障发出的指示信号,切换至所述第二界面,获取所述第一界面的监测信息,对所述监测信息进行解析处理,确定出故障信息,根据所述故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案,将所述诊断方案展示在所述第二界面;The test terminal is configured to switch to the second interface in response to an indication signal from a system failure detected by the first interface, obtain the monitoring information of the first interface, analyze the monitoring information, and determine Fault information: determine the fault corresponding diagnosis scheme in the preset fault database according to the target fault keyword and target fault category of the fault information, and display the diagnosis scheme on the second interface;
所述被测服务器用于在进行系统测试的情况下上报系统测试状态至所述测试端的所述第一界面;The server under test is configured to report system test status to the first interface of the test terminal when performing system testing;
所述内网用于建立所述预设故障数据库与所述测试端的连接,用于通过所述内网在所述预设故障数据库中调用所述目标故障类别对应的诊断方案。The intranet is used to establish a connection between the preset fault database and the test terminal, and to call the diagnostic solution corresponding to the target fault category in the preset fault database through the intranet.
本实施例提供的故障诊断系统的具体实现方法可以上述实施例提供的故障诊断方法所述的内容,此处不再赘述。The specific implementation method of the fault diagnosis system provided by this embodiment can be as described in the fault diagnosis method provided by the above embodiment, and will not be described again here.
本发明实施例还提供了一种电子设备,如图9所示,包括处理器501、通信接口502、存储器503和通信总线504,其中,处理器501,通信接口502,存储器503通过通信总线504完成相互间的通信,An embodiment of the present invention also provides an electronic device, as shown in Figure 9, including a processor 501, a communication interface 502, a memory 503, and a communication bus 504. The processor 501, the communication interface 502, and the memory 503 communicate through the communication bus 504. complete mutual communication,
存储器503,用于存放计算机程序;Memory 503, used to store computer programs;
处理器501,用于执行存储器503上所存放的程序时,实现如下步骤:The processor 501 is used to execute the program stored on the memory 503 to implement the following steps:
响应于所述第一界面监测到系统故障发出的指示信号,切换至所述第二界面,获取所述第一界面的监测信息;In response to the first interface detecting an indication signal sent by a system failure, switch to the second interface to obtain monitoring information of the first interface;
对所述监测信息进行解析处理,确定出故障信息,其中,所述故障信息包括目标故障关键词和目标故障类别;Analyze and process the monitoring information to determine fault information, where the fault information includes target fault keywords and target fault categories;
根据所述故障信息的目标故障关键词和目标故障类别,在预设故障数据库确定出故障对应诊断方案,将所述诊断方案展示在所述第二界面。According to the target fault keyword and the target fault category of the fault information, a fault corresponding diagnosis scheme is determined in the preset fault database, and the diagnosis scheme is displayed on the second interface.
上述终端提到的通信总线可以是外设部件互连标准(Peripheral ComponentInterconnect,简称PCI)总线或扩展工业标准结构(Extended Industry StandardArchitecture,简称EISA)总线等。该通信总线可以分为地址总线、数据总线、控制总线等。为便于表示,图中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。The communication bus mentioned in the above terminal may be a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus. The communication bus can be divided into address bus, data bus, control bus, etc. For ease of presentation, only one thick line is used in the figure, but it does not mean that there is only one bus or one type of bus.
通信接口用于上述终端与其他设备之间的通信。The communication interface is used for communication between the above terminal and other devices.
存储器可以包括随机存取存储器(Random Access Memory,简称RAM),也可以包括非易失性存储器(non-volatile memory),例如至少一个磁盘存储器。可选的,存储器还可以是至少一个位于远离前述处理器的存储装置。The memory may include random access memory (RAM) or non-volatile memory (non-volatile memory), such as at least one disk memory. Optionally, the memory may also be at least one storage device located far away from the aforementioned processor.
上述的处理器可以是通用处理器,包括中央处理器(Central Processing Unit,简称CPU)、网络处理器(Network Processor,简称NP)等;还可以是数字信号处理器(Digital Signal Processing,简称DSP)、专用集成电路(Application SpecificIntegrated Circuit,简称ASIC)、现场可编程门阵列(Field-Programmable Gate Array,简称FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。The above-mentioned processor can be a general-purpose processor, including a central processing unit (CPU), a network processor (NP), etc.; it can also be a digital signal processor (Digital Signal Processing, DSP). , Application Specific Integrated Circuit (ASIC for short), Field-Programmable Gate Array (FPGA for short) or other programmable logic devices, discrete gate or transistor logic devices, and discrete hardware components.
在本发明提供的又一实施例中,还提供了一种计算机可读存储介质,该计算机可读存储介质中存储有指令,当其在计算机上运行时,使得计算机执行上述实施例中任一所述的故障诊断方法。In yet another embodiment provided by the present invention, a computer-readable storage medium is also provided. The computer-readable storage medium stores instructions that, when run on a computer, cause the computer to execute any one of the above embodiments. Described fault diagnosis method.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本发明实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘Solid State Disk(SSD))等。In the above embodiments, it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented using software, it may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, the processes or functions described in accordance with the embodiments of the present invention are generated in whole or in part. The computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device. The computer instructions may be stored in or transmitted from one computer-readable storage medium to another, e.g., the computer instructions may be transferred from a website, computer, server, or data center Transmission to another website, computer, server or data center by wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.) means. The computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that contains one or more available media integrated. The available media may be magnetic media (eg, floppy disk, hard disk, magnetic tape), optical media (eg, DVD), or semiconductor media (eg, Solid State Disk (SSD)), etc.
需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should be noted that in this article, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply that these entities or operations are mutually exclusive. any such actual relationship or sequence exists between them. Furthermore, the terms "comprises," "comprises," or any other variations thereof are intended to cover a non-exclusive inclusion such that a process, method, article, or apparatus that includes a list of elements includes not only those elements, but also those not expressly listed other elements, or elements inherent to the process, method, article or equipment. Without further limitation, an element defined by the statement "comprises a..." does not exclude the presence of additional identical elements in a process, method, article, or apparatus that includes the stated element.
本说明书中的各个实施例均采用相关的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。尤其,对于系统实施例而言,由于其基本相似于方法实施例,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。Each embodiment in this specification is described in a related manner. The same and similar parts between the various embodiments can be referred to each other. Each embodiment focuses on its differences from other embodiments. In particular, for the system embodiment, since it is basically similar to the method embodiment, the description is relatively simple. For relevant details, please refer to the partial description of the method embodiment.
以上所述仅为本发明的较佳实施例而已,并非用于限定本发明的保护范围。凡在本发明的精神和原则之内所作的任何修改、等同替换、改进等,均包含在本发明的保护范围内。The above descriptions are only preferred embodiments of the present invention and are not intended to limit the scope of the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and principles of the present invention are included in the protection scope of the present invention.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310920190.8A CN117130850A (en) | 2023-07-25 | 2023-07-25 | Fault diagnosis method, device, equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310920190.8A CN117130850A (en) | 2023-07-25 | 2023-07-25 | Fault diagnosis method, device, equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117130850A true CN117130850A (en) | 2023-11-28 |
Family
ID=88857389
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310920190.8A Pending CN117130850A (en) | 2023-07-25 | 2023-07-25 | Fault diagnosis method, device, equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117130850A (en) |
-
2023
- 2023-07-25 CN CN202310920190.8A patent/CN117130850A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108683562B (en) | Anomaly detection and positioning method, device, computer equipment and storage medium | |
WO2020237877A1 (en) | Log monitoring method and apparatus, terminal, and storage medium | |
CN114461534A (en) | Software performance testing method and system, electronic equipment and readable storage medium | |
WO2018195795A1 (en) | Test method, test server, and system | |
CN110275878B (en) | Service data detection method and device, computer equipment and storage medium | |
CN115878430A (en) | PCIE equipment failure monitoring method, device, communication equipment and storage medium | |
CN112231163A (en) | Multifunctional computer detection equipment and operation method thereof | |
CN113326206B (en) | Test method, apparatus, storage medium and program product for data processing system | |
CN113778875A (en) | System test defect classification method, device, equipment and storage medium | |
CN112131195A (en) | Aging test cluster type management method | |
CN117130850A (en) | Fault diagnosis method, device, equipment and storage medium | |
CN117472767A (en) | Software interface testing method, device, equipment and storage medium | |
CN116909904A (en) | Software function automatic test method based on high-flow high-concurrency condition | |
CN115809335A (en) | A fault diagnosis method and device based on knowledge graph | |
CN113722229A (en) | Software testing method and device, electronic equipment and storage medium | |
CN114676061A (en) | An automatic firmware detection method based on knowledge graph | |
CN119959811B (en) | Battery data processing method, system, device, medium and program product | |
CN115509838A (en) | Method, device, equipment and medium for rapidly counting returned results of pressure test | |
CN118583906B (en) | X-ray spectrum detection system, method, device and storage medium | |
CN119002963A (en) | Detection equipment updating method and device, upper computer and detection system | |
CN113360381B (en) | Case verification system and method in automatic test of host lower platform | |
CN114840370A (en) | Universal serial bus test method, wireless communication module, equipment and storage medium | |
CN119537963A (en) | Log analysis method and computing device | |
CN116166461A (en) | Error diagnosis method, storage medium and device for database operation statement | |
CN120123136A (en) | Method, equipment and storage medium for improving working efficiency of computer fault analysis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |