CN110618888A - Method and related device for repeatedly identifying system errors - Google Patents

Method and related device for repeatedly identifying system errors Download PDF

Info

Publication number
CN110618888A
CN110618888A CN201910708623.7A CN201910708623A CN110618888A CN 110618888 A CN110618888 A CN 110618888A CN 201910708623 A CN201910708623 A CN 201910708623A CN 110618888 A CN110618888 A CN 110618888A
Authority
CN
China
Prior art keywords
system problems
context information
information corresponding
similarity score
similarity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910708623.7A
Other languages
Chinese (zh)
Inventor
熊星
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Technology Shenzhen Co Ltd
Original Assignee
Ping An Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Technology Shenzhen Co Ltd filed Critical Ping An Technology Shenzhen Co Ltd
Priority to CN201910708623.7A priority Critical patent/CN110618888A/en
Priority to PCT/CN2019/117684 priority patent/WO2021017288A1/en
Publication of CN110618888A publication Critical patent/CN110618888A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0745Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in an input/output transactions management context

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method and a related device for repeatedly identifying system errors, belonging to the technical field of data processing, wherein the method comprises the following steps: acquiring context information corresponding to two system problems based on the two system problems for comparison, wherein the context information comprises a text of reasons generated by the system problems, a screenshot of the reasons generated by the system problems, information of a function module generating the system problems and information of equipment generating the system problems; determining the similarity between the context information corresponding to the two system problems; and comparing the similarity between the context information corresponding to the two system problems with a preset threshold value to obtain a comparison result. The method provided by the invention does not rely on the subjective experience of developers to identify whether two system problems are the system generated by one system error, thereby improving the accuracy of identifying the same system problem.

Description

Method and related device for repeatedly identifying system errors
Technical Field
The present invention relates to the field of data processing technologies, and in particular, to a method and an apparatus for repeatedly identifying a system error, an electronic device, and a storage medium.
Background
In the system development and test process, a tester submits a system problem describing relevant information including the system error to a management system according to the found BUG (system error), and a maintenance person of the system solves the system error according to the system problem submitted by the tester. Because different testers may submit similar system problems to the management system for the same system error, and the similar system problems need to be identified repeatedly, the same system error is prevented from being processed repeatedly.
In the prior art, generally, a maintainer repeatedly identifies similar system problems submitted, and classifies the system problems into system problems corresponding to the same system error based on the subjective experience of the maintainer, and the process is totally dependent on the subjective experience of the maintainer, so that the accuracy is not high.
Disclosure of Invention
Based on this, in order to solve the technical problems that repeated identification of submitted similar system problems by relying on subjective experience of maintenance personnel is avoided and the identification accuracy is not high in the prior art, the invention provides a method, a device, computer equipment and a storage medium for repeated identification of system errors.
In a first aspect, a method for repeatedly identifying system errors is provided, including:
acquiring context information corresponding to two system problems based on the two system problems for comparison, wherein the context information comprises a text of reasons generated by the system problems, a screenshot of the reasons generated by the system problems, information of a function module generating the system problems and information of equipment generating the system problems;
determining the similarity between the context information corresponding to the two system problems;
comparing the similarity between the context information corresponding to the two system problems with a preset threshold value to obtain a comparison result;
and determining whether the two system problems are the system problems corresponding to the same system error or not based on the comparison result.
In a second aspect, an apparatus for duplicate identification of system errors is provided, including:
the device comprises an acquisition unit, a comparison unit and a comparison unit, wherein the acquisition unit is used for acquiring context information corresponding to two system problems based on the two system problems for comparison, and the context information comprises a text of reasons generated by the system problems, a screenshot of the reasons generated by the system problems, information of a function module generating the system problems and information of equipment generating the system problems;
the first execution unit is used for determining the similarity between the context information corresponding to the two system problems;
the comparison unit is used for comparing the similarity between the context information corresponding to the two system problems with a preset threshold value to obtain a comparison result;
and the second execution unit is used for determining whether the two system problems are the system problems corresponding to the same system error or not based on the comparison result.
In a third aspect, an electronic device is provided, comprising a memory and a processor, wherein the memory has stored therein computer-readable instructions, which, when executed by the processor, cause the processor to perform the steps of the above method of repeatedly identifying a system error.
In a fourth aspect, a storage medium is provided having stored thereon computer-readable instructions which, when executed by one or more processors, cause the one or more processors to perform the steps of the above-described method of repeatedly identifying system errors.
The technical scheme provided by the embodiment of the disclosure can have the following beneficial effects:
when two system problems are system problems corresponding to the same system error, determining the similarity between the contexts corresponding to the two system problems according to multi-dimensional information such as texts used for reflecting reasons generated by the system problems, screenshots of the reasons generated by the system problems, function module information generating the system problems, equipment information generating the system problems and the like in the context information corresponding to the two system problems, thereby accurately determining whether the two system problems are system problems corresponding to the same system error, further automatically determining whether the two system problems are system problems corresponding to the same system error, and identifying whether the two system problems are systems generated by the same system error without depending on the subjective experience of developers, thereby improving the accuracy of identifying the same system problem.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
Fig. 1 is a flowchart illustrating an implementation of a method for repeatedly identifying a system error according to an embodiment of the present invention.
Fig. 2 is a flowchart illustrating a specific implementation of step S120 in a method for repeatedly identifying a system error according to an embodiment of the present invention.
Fig. 3 is a block diagram of an apparatus for duplicate identification of system errors according to an embodiment of the present invention.
Fig. 4 schematically illustrates an example block diagram of an electronic device for implementing the above-described method for duplicate identification of system errors.
Fig. 5 schematically illustrates a computer-readable storage medium for implementing the above-described method for duplicate identification of system errors.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
The main execution body of the method for repeatedly identifying system errors in the embodiment of the present invention is an electronic device, and the electronic device may specifically be a management system for managing system problems, and the following description will take the management system as an example.
Referring to fig. 1, a method for repeatedly identifying a system error according to an embodiment of the present invention includes the following steps S110 to S140, which are described in detail as follows:
step S110, obtaining context information corresponding to the two system problems based on the two system problems for comparison, wherein the context information comprises text information of reasons generated by the system problems, screenshots of the reasons generated by the system problems, information of function modules generating the system problems and information of equipment generating the system problems.
Step S120, determining a similarity between the context information corresponding to the two system problems.
Step S130, comparing the similarity between the context information corresponding to the two system problems with a predetermined threshold to obtain a comparison result.
Step S140, determining whether the two system problems are system problems corresponding to the same system error based on the comparison result.
In step S110, the system problem refers to a system problem that a tester submits context information describing a system error based on a BUG (system error) found in a business system during a research and development test of a business system on a research and development test terminal. The context information corresponding to the system problem comprises text information of a reason generated by the system problem, a screenshot of the reason generated by the system problem, function module information of the system problem and equipment information of the system problem.
When the management system acquires the text of the cause of the system problem, the text of the cause of the system problem may be acquired from the system log at the time of the system problem.
When the management system obtains the screenshot of the reason caused by the system problem, the screenshot can be realized by carrying out screenshot on the reason prompt displayed by the research and development test terminal under the condition that the research and development test terminal has the system problem.
When the management system acquires the information of the function module which generates the system problem, the information can be acquired from the system log when the system problem is generated.
When the management system acquires the equipment information generating the system problem, the equipment information can be called from a factory configuration module in the memory of the research and development test terminal generating the system problem.
In step S120, the management system determines the similarity between the context information corresponding to the two system problems based on the text of the cause of the system problem, the screenshot of the cause of the system problem, the function module information of the system problem, and the device information of the system problem included in the context information corresponding to the two system problems. It should be noted that the higher the similarity of the context information corresponding to the two system problems, the higher the possibility that the two system problems are the system problems corresponding to the same system error is.
Referring to fig. 2, fig. 2 is a flowchart illustrating a specific implementation of step S120 in a method for repeatedly identifying a system error according to an embodiment of the present invention, where the step S120 for determining a similarity between context information corresponding to two system problems includes:
step S1201, comparing the texts of the reasons of the system problems in the context information corresponding to the two system problems, and determining a first similarity score.
Step S1202, comparing screenshots of causes of the system problems in the context information corresponding to the two system problems, and determining a second similarity score.
Step S1203, comparing the functional module information of the system problem in the context information corresponding to the two system problems, and determining a third similarity score.
Step S1204, comparing the device information of the system problem in the context information corresponding to the two system problems, and determining a fourth similarity score.
Step S1205, determining a similarity between the context information corresponding to the two system problems based on the first similarity score, the second similarity score, the third similarity score, and the fourth similarity score.
In step S1201, the management system compares the texts of the causes of the system problems in the context information corresponding to the two system problems to determine a first similarity score between the texts of the causes of the system problems in the context information.
In one embodiment, the first similarity score is determined based on the following method:
respectively segmenting the title and the content in the text of the reasons of the system problems in the context information corresponding to the two system problems;
determining the occurrence frequency of each separated word in the separated words according to the title and the content in the text of the reason of the system problem in the context information corresponding to each system problem, and taking the words with the occurrence frequency higher than a preset occurrence frequency threshold value as the keywords in the reason of the system problem in the context information corresponding to the system problem;
and obtaining a first similarity score based on the number of keywords in the intersection of the keywords corresponding to the two system problems divided by the number of keywords in the union set of the keywords corresponding to the two system problems.
In this embodiment, when the first similarity score is determined, the management system obtains a text of a cause of the system problem in the context information corresponding to the two system problems, and performs word segmentation on the title and the content in the two texts to obtain a plurality of words, where the word segmentation method may adopt a result word segmentation.
The management system determines the occurrence frequency of each divided word in all the divided words, takes the words with the occurrence frequency higher than a preset occurrence frequency threshold value as key words in the reasons of the system problems in the context information corresponding to the system problems, and further obtains key information reflecting the reasons of the system problems.
The management system determines the number of keywords in the intersection of the keywords corresponding to the two system problems, determines the number of keywords in the union of the keywords corresponding to the two system problems, divides the number of the keywords in the intersection of the keywords corresponding to the two system problems by the number of the keywords in the union of the keywords corresponding to the two system problems to obtain a first similarity score, and then determines whether texts of causes of the system problems in the context information corresponding to the two system problems are consistent or not according to the first similarity score. It should be noted that, when two system problems are the same system problem, the information of the text records of the reasons generated by the system problems will be basically consistent, and thus the probability that the corresponding keywords are the same is higher, the first similarity score is higher; on the contrary, when the two system problems are not the same system problem, the information of the text records of the reasons for the system problems will be inconsistent, and thus the probability that the corresponding keywords are the same is smaller, the lower the first similarity score is.
In step S1202, the management system compares the screenshots of the reasons for the system problems in the context information corresponding to the two system problems to determine a second similarity score between the texts of the reasons for the system problems in the context information.
In one embodiment, the second similarity score is determined based on the following method:
performing optical character recognition on screenshots of reasons of the system problems in the context information corresponding to the two system problems to obtain characters recognized from the screenshots;
and obtaining a second similarity score based on the number of coincident characters in the characters identified for the two system problems divided by the sum of the numbers of characters identified for the two system problems.
In this embodiment, when determining the second similarity score, the management system obtains a screenshot of a cause of the system problem in the context information corresponding to the two system problems.
And for the obtained screenshots of the reasons caused by the system problems, respectively obtaining characters recognized from the screenshots through optical character recognition.
The management system determines the number of characters of character coincidence identified in the screenshots of the reasons of the system problems in the context information corresponding to the two system problems and the sum of the number of characters identified in the screenshots of the reasons of the system problems in the context information corresponding to the two system problems, divides the number of characters of character coincidence identified by the two system problems by the sum of the number of characters identified by the two system problems to obtain a second similarity score, and further determines whether the screenshots of the reasons of the system problems in the context information corresponding to the two system problems are consistent according to the second similarity score. It should be noted that, when two system problems are the same system problem, screenshots of reasons generated by the system problems are basically consistent, and therefore, the probability that corresponding characters are the same is higher, and the second similarity score is higher; conversely, when two system problems are not the same, the screenshots of the causes of the system problems will be inconsistent, whereby the probability that the corresponding characters are the same is smaller, the second similarity score is lower.
In step S1203, comparing the information of the functional modules generating the system problem in the context information corresponding to the two system problems, and determining a third similarity score.
In step S1204, the device information of the system problem in the context information corresponding to the two system problems is compared, and a fourth similarity score is determined.
In one embodiment, the third similarity score is determined based on the following method:
comparing the functional module information generating the system problem in the context information corresponding to the two system problems;
if the information of the functional modules generating the system problems in the context information corresponding to the two system problems is consistent, acquiring a preset first score as a third similarity score;
if the information of the functional modules generating the system problems in the context information corresponding to the two system problems is inconsistent, determining that the third similarity score is zero;
the fourth similarity score is determined based on the following method:
comparing the equipment information generating the system problem in the context information corresponding to the two system problems;
if the device information of the system problem generated in the context information corresponding to the two system problems is consistent, acquiring a preset second score as a fourth similarity score;
and if the device information generating the system problem in the context information corresponding to the two system problems is inconsistent, determining that the fourth similarity score is zero.
In this embodiment, when the third similarity score is determined, the management system compares the information of the functional modules generating the system problems in the context information corresponding to the two system problems, and if the information of the functional modules generating the system problems in the context information corresponding to the two system problems is consistent, obtains the preset first score as the third similarity score. In this embodiment, when the information of the function module generating the system problem is consistent, the two system problems may be generated by the same system error, and when the information of the function module generating the system problem is inconsistent, the two system problems may not be generated by the same system error; therefore, when the information of the functional modules generating the system problem is consistent, the first score can be set to be 1, and the third similarity score is 1; when the information of the functional module generating the system problem in the context information corresponding to the two system problems is inconsistent, the third similarity score may be set to zero.
When the fourth similarity score is determined, the management system compares the device information of the system problem in the context information corresponding to the two system problems, and if the device information of the system problem in the context information corresponding to the two system problems is consistent, the preset second score is obtained as the fourth similarity score. In this embodiment, when the device information that causes the system problem is consistent, the two system problems may be caused by the same system error, and when the device information that causes the system problem is inconsistent, the two system problems may not be caused by the same system error; thus, when the device information generating the system problem is consistent, the second score may be set to 1, and the fourth similarity score is 1; when the device information of the system problem generated in the context information corresponding to the two system problems is inconsistent, the fourth similarity score may be set to zero.
In step S1205, when the management system compares the similarity between the context information corresponding to the two system problems, the text of the cause of the system problem in the context information, the screenshot of the cause of the system problem, the device information generating the system problem, and the four-dimensional information of the device information generating the system problem respectively determine a first similarity score, a second similarity score, a third similarity score, and a fourth similarity score, and the determined first similarity score, second similarity score, third similarity score, and fourth similarity score determine the similarity between the context information corresponding to the two system problems.
In one embodiment, the step S1205 of determining the similarity between the context information corresponding to the two system problems based on the first similarity score, the second similarity score, the third similarity score and the fourth similarity score includes:
the similarity between two system problems is determined based on the following formula:
wherein α and β are normal numbers greater than 1, a is the first similarity score, B is the second similarity score, C is the third similarity score, D is the fourth similarity score, S is a similarity between two system problems, and e is a natural constant.
In this embodiment, in the process of determining whether two system problems are system problems corresponding to the same system error according to the similarity between the context information of the two system problems, when the function module information of the system problem or the device information of the system problem generated in the context information of the two system problems is inconsistent, it is indicated that the possibility that the two system problems may be system problems corresponding to the same system error is low, and therefore, two dimensions of the function module information of the system problem or the device information of the system problem may serve as a premise that the two system problems may be system problems corresponding to the same system error, and therefore, the third similarity score and the fourth similarity score have a large influence on the calculation of the similarity between the context information, and therefore, the corresponding weights are high. It should be noted that, when the function module information or the device information of the system problem that generates the system problem is consistent, it cannot be accurately determined that the two system problems are necessarily the system problems corresponding to the same system error, and therefore, it is determined that the two system problems may be the system problems corresponding to the same system error based on the text of the generated cause and the screenshot of the generated cause as a preferred method, and therefore, the first similarity score and the second similarity score have a smaller influence on the calculation of the similarity between the context information, and therefore, the corresponding weight is smaller.
Referring to fig. 1 again, in step S130, the management system compares the similarity between the context information corresponding to the system problem with a predetermined threshold to obtain a comparison result, where the comparison result is one of the similarity between the context information corresponding to the system problem being smaller than the predetermined threshold, the similarity between the context information corresponding to the system problem being equal to the predetermined threshold, and the similarity between the context information corresponding to the system problem being greater than the predetermined threshold.
In step S140, the management system determines whether the two system problems are corresponding to the same system error based on the comparison result.
Optionally, in an embodiment of the present invention, the step S140 of determining whether two system problems are system problems corresponding to a same system error based on the comparison result includes the following steps:
and if the comparison result indicates that the similarity between the context information corresponding to the two system problems is greater than or equal to a preset threshold, determining that the two system problems are the system problems corresponding to the same system error.
And if the comparison result shows that the similarity between the context information corresponding to the two system problems is smaller than a preset threshold value, determining that the two system problems are not the system problems corresponding to the same system error.
When the management system determines whether the two system problems are system problems corresponding to the same system error based on the comparison result, if the similarity between the context information corresponding to the two system problems is greater than or equal to a predetermined threshold, it indicates that the information included in the context information corresponding to the two system problems is basically consistent, and thus the possibility of the system problems corresponding to the same system error is high, and further, it is determined that the two system problems are system problems corresponding to the same system error; when the similarity between the context information corresponding to the two system problems is smaller than a preset threshold, it is indicated that the information included in the context information corresponding to the two system problems is large in difference, and the possibility that the system problem corresponding to the same system error is small, and it is determined that the two system problems are not the system problem corresponding to the same system error.
It can be seen from the above that, when two system problems are the system problems corresponding to the same system error, determining the similarity between the contexts corresponding to the two system problems according to the text for reflecting the reasons generated by the system problems, the screenshots of the reasons generated by the system problems, the information of the function modules generating the system problems, the information of the equipment generating the system problems and other multi-dimensional information in the context information corresponding to the two system problems, thereby accurately determining whether two system problems are corresponding to the same system error, and then whether the two system problems are the system problems corresponding to the same system error can be automatically judged, whether the two system problems are the systems generated by the system error can be identified without depending on the subjective experience of developers, and therefore the accuracy of identifying the same system problem is improved.
Referring to fig. 3, fig. 3 is a device for repeatedly recognizing a system error according to an embodiment of the present invention, where the device for repeatedly recognizing a system error may be integrated in the electronic device, and specifically may include an obtaining unit 110, a first executing unit 120, a comparing unit 130, and a second executing unit 140.
An obtaining unit 110, configured to obtain context information corresponding to two system problems based on the two system problems for comparison, where the context information includes a text of a reason generated by the system problem, a screenshot of the reason generated by the system problem, information of a function module generating the system problem, and information of a device generating the system problem;
a first executing unit 120, configured to determine a similarity between context information corresponding to two of the system problems.
The comparing unit 130 is configured to compare the similarity between the context information corresponding to the two system problems with a predetermined threshold to obtain a comparison result.
A second executing unit 140, configured to determine whether the two system problems are system problems corresponding to a same system error based on the comparison result.
Optionally, the first execution unit includes: a
The first comparison subunit is used for comparing texts of reasons of the system problems in the context information corresponding to the two system problems to determine a first similarity score;
the second comparison subunit is used for comparing screenshots of reasons of the system problems in the context information corresponding to the two system problems and determining a second similarity score;
the third comparison subunit is used for comparing the functional module information of the system problem generated in the context information corresponding to the two system problems and determining a third similarity score;
the fourth comparison subunit is used for comparing the device information of the system problem generated in the context information corresponding to the two system problems and determining a fourth similarity score;
a first execution subunit, configured to determine, based on the first similarity score, the second similarity score, the third similarity score, and the fourth similarity score, a similarity between context information corresponding to the two system problems.
Optionally, the execution subunit is specifically configured to determine a similarity between two system problems based on the following formula:
wherein α and β are normal numbers greater than 1, a is the first similarity score, B is the second similarity score, C is the third similarity score, D is the fourth similarity score, S is a similarity between context information corresponding to the two system problems, and e is a natural constant.
Optionally, the first similarity score is determined based on the following method:
respectively segmenting the title and the content in the text of the reasons of the system problems in the context information corresponding to the two system problems;
determining the occurrence frequency of each separated word in the separated words according to the title and the content in the text of the reason of the system problem in the context information corresponding to each system problem, and taking the words with the occurrence frequency higher than a preset occurrence frequency threshold value as the keywords in the reason of the system problem in the context information corresponding to the system problem;
and obtaining a first similarity score based on the number of keywords in the intersection of the keywords corresponding to the two system problems divided by the number of keywords in the union set of the keywords corresponding to the two system problems.
The second similarity score is determined based on the following method:
performing optical character recognition on screenshots of reasons of the system problems in the context information corresponding to the two system problems to obtain characters recognized from the screenshots;
and obtaining a second similarity score based on the number of coincident characters in the characters identified for the two system problems divided by the sum of the numbers of characters identified for the two system problems.
Alternatively,
the third similarity score is determined based on the following method:
comparing the functional module information generating the system problem in the context information corresponding to the two system problems;
if the information of the functional modules generating the system problems in the context information corresponding to the two system problems is consistent, acquiring a preset first score as a third similarity score;
if the information of the functional modules generating the system problems in the context information corresponding to the two system problems is inconsistent, determining that the third similarity score is zero;
the fourth similarity score is determined based on the following method:
comparing the equipment information generating the system problem in the context information corresponding to the two system problems;
if the device information of the system problem generated in the context information corresponding to the two system problems is consistent, acquiring a preset second score as a fourth similarity score;
and if the device information generating the system problem in the context information corresponding to the two system problems is inconsistent, determining that the fourth similarity score is zero.
Optionally, the second execution unit includes:
the second execution subunit is configured to determine that the two system problems are system problems corresponding to the same system error if the comparison result indicates that the similarity between the context information corresponding to the two system problems is greater than or equal to a predetermined threshold;
and the third execution subunit is configured to determine that the two system problems are not system problems corresponding to the same system error if the comparison result indicates that the similarity between the context information corresponding to the two system problems is smaller than a preset threshold.
The implementation process of the functions and actions of each module in the device is specifically described in the implementation process of the corresponding step in the method for repeatedly identifying the system error, and is not described herein again.
It should be noted that although in the above detailed description several modules or units of the device for action execution are mentioned, such a division is not mandatory. Indeed, the features and functionality of two or more modules or units described above may be embodied in one module or unit, according to embodiments of the present disclosure. Conversely, the features and functions of one module or unit described above may be further divided into embodiments by a plurality of modules or units.
Moreover, although the steps of the methods of the present disclosure are depicted in the drawings in a particular order, this does not require or imply that the steps must be performed in this particular order, or that all of the depicted steps must be performed, to achieve desirable results. Additionally or alternatively, certain steps may be omitted, multiple steps combined into one step execution, and/or one step broken down into multiple step executions, etc.
Through the above description of the embodiments, those skilled in the art will readily understand that the exemplary embodiments described herein may be implemented by software, or by software in combination with necessary hardware. Therefore, the technical solution according to the embodiments of the present disclosure may be embodied in the form of a software product, which may be stored in a non-volatile storage medium (which may be a CD-ROM, a usb disk, a removable hard disk, etc.) or on a network, and includes several instructions to enable a computing device (which may be a personal computer, a server, a mobile terminal, or a network device, etc.) to execute the method according to the embodiments of the present disclosure.
In an exemplary embodiment of the present disclosure, there is also provided a computer apparatus capable of implementing the above method.
As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or program product. Thus, various aspects of the invention may be embodied in the form of: an entirely hardware embodiment, an entirely software embodiment (including firmware, microcode, etc.) or an embodiment combining hardware and software aspects that may all generally be referred to herein as a "circuit," module "or" system.
Referring to fig. 4, fig. 4 is an electronic device 400 according to this embodiment of the present invention. The electronic device 400 shown in fig. 4 is only an example and should not bring any limitation to the function and the scope of use of the embodiments of the present invention.
As shown in fig. 4, electronic device 400 is embodied in the form of a general purpose computing device. The components of electronic device 400 may include, but are not limited to: the at least one processing unit 410, the at least one memory unit 420, and a bus 430 that couples various system components including the memory unit 420 and the processing unit 410.
Wherein the storage unit stores program code that is executable by the processing unit 410 to cause the processing unit 410 to perform steps according to various exemplary embodiments of the present invention as described in the above section "exemplary methods" of the present specification. For example, the processing unit 410 may perform step S110 as shown in fig. 1: acquiring context information corresponding to two system problems based on the two system problems for comparison, wherein the context information comprises a text of reasons generated by the system problems, a screenshot of the reasons generated by the system problems, information of a function module generating the system problems and information of equipment generating the system problems; step S120: determining the similarity between the context information corresponding to the two system problems; step S130: comparing the similarity between the context information corresponding to the two system problems with a preset threshold value to obtain a comparison result; step S140: and determining whether the two system problems are the system problems corresponding to the same system error or not based on the comparison result.
The storage unit 420 may include readable media in the form of volatile storage units, such as a random access memory unit (RAM)4201 and/or a cache memory unit 4202, and may further include a read only memory unit (ROM) 4203.
The storage unit 420 may also include a program/utility 4204 having a set (at least one) of program modules 4205, such program modules 4205 including, but not limited to: an operating system, one or more application programs, other program modules, and program data, each of which, or some combination thereof, may comprise an implementation of a network environment.
Bus 430 may be any bus representing one or more of several types of bus structures, including a memory unit bus or memory unit controller, a peripheral bus, an accelerated graphics port, a processing unit, or a local bus using any of a variety of bus architectures.
The electronic device 400 may also communicate with one or more external devices 600 (e.g., keyboard, pointing device, bluetooth device, etc.), with one or more devices that enable a user to interact with the electronic device 400, and/or with any devices (e.g., router, modem, etc.) that enable the electronic device 400 to communicate with one or more other computing devices. Such communication may occur via input/output (I/O) interfaces 440. Also, the electronic device 400 may communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network, such as the internet) via the network adapter 460. As shown, the network adapter 460 communicates with the other modules of the electronic device 400 over the bus 430. It should be appreciated that although not shown in the figures, other hardware and/or software modules may be used in conjunction with electronic device 400, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives, and data backup storage systems, among others.
Through the above description of the embodiments, those skilled in the art will readily understand that the exemplary embodiments described herein may be implemented by software, or by software in combination with necessary hardware. Therefore, the technical solution according to the embodiments of the present disclosure may be embodied in the form of a software product, which may be stored in a non-volatile storage medium (which may be a CD-ROM, a usb disk, a removable hard disk, etc.) or on a network, and includes several instructions to enable a computing device (which may be a personal computer, a server, a terminal device, or a network device, etc.) to execute the method according to the embodiments of the present disclosure.
In an exemplary embodiment of the present disclosure, there is also provided a computer-readable storage medium having stored thereon a program product capable of implementing the above-described method of the present specification. In some possible embodiments, aspects of the invention may also be implemented in the form of a program product comprising program code means for causing a terminal device to carry out the steps according to various exemplary embodiments of the invention described in the above section "exemplary methods" of the present description, when said program product is run on the terminal device.
Referring to fig. 5, a program product 500 for implementing the above method according to an embodiment of the present invention is described, which may employ a portable compact disc read only memory (CD-ROM) and include program code, and may be run on an electronic device, such as a personal computer. However, the program product of the present invention is not limited in this regard and, in the present document, a readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
The program product may employ any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
A computer readable signal medium may include a propagated data signal with readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A readable signal medium may also be any readable medium that is not a readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Program code embodied on a readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server. In the case of a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).
Furthermore, the above-described figures are merely schematic illustrations of processes involved in methods according to exemplary embodiments of the invention, and are not intended to be limiting. It will be readily understood that the processes shown in the above figures are not intended to indicate or limit the chronological order of the processes. In addition, it is also readily understood that these processes may be performed synchronously or asynchronously, e.g., in multiple modules.
Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.

Claims (10)

1. A method for duplicate identification of system errors, the method comprising:
acquiring context information corresponding to two system problems based on the two system problems for comparison, wherein the context information comprises a text of reasons generated by the system problems, a screenshot of the reasons generated by the system problems, information of a function module generating the system problems and information of equipment generating the system problems;
determining the similarity between the context information corresponding to the two system problems;
comparing the similarity between the context information corresponding to the two system problems with a preset threshold value to obtain a comparison result;
and determining whether the two system problems are the system problems corresponding to the same system error or not based on the comparison result.
2. The method of claim 1, wherein the step of determining the similarity between the context information corresponding to the two system questions comprises:
comparing texts of reasons of the system problems in the context information corresponding to the two system problems, and determining a first similarity score;
comparing screenshots of reasons of the system problems in the context information corresponding to the two system problems, and determining a second similarity score;
comparing the functional module information of the system problem generated in the context information corresponding to the two system problems, and determining a third similarity score;
comparing the device information of the system problem generated in the context information corresponding to the two system problems, and determining a fourth similarity score;
and determining the similarity between the context information corresponding to the two system problems based on the first similarity score, the second similarity score, the third similarity score and the fourth similarity score.
3. The method of claim 2, wherein determining the similarity between the context information corresponding to the two system questions based on the first similarity score, the second similarity score, the third similarity score, and the fourth similarity score comprises:
the similarity between two system problems is determined based on the following formula:
wherein α and β are normal numbers greater than 1, a is the first similarity score, B is the second similarity score, C is the third similarity score, D is the fourth similarity score, S is a similarity between context information corresponding to the two system problems, and e is a natural constant.
4. The method of claim 2, wherein the first similarity score is determined based on:
respectively segmenting the title and the content in the text of the reasons of the system problems in the context information corresponding to the two system problems;
determining the occurrence frequency of each separated word in the separated words according to the title and the content in the text of the reason of the system problem in the context information corresponding to each system problem, and taking the words with the occurrence frequency higher than a preset occurrence frequency threshold value as the keywords in the reason of the system problem in the context information corresponding to the system problem;
and obtaining a first similarity score based on the number of keywords in the intersection of the keywords corresponding to the two system problems divided by the number of keywords in the union set of the keywords corresponding to the two system problems.
5. The method of claim 2, wherein the second similarity score is determined based on:
performing optical character recognition on screenshots of reasons of the system problems in the context information corresponding to the two system problems to obtain characters recognized from the screenshots;
and obtaining a second similarity score based on the number of coincident characters in the characters identified for the two system problems divided by the sum of the numbers of characters identified for the two system problems.
6. The method of claim 2, wherein the third similarity score is determined based on:
comparing the functional module information generating the system problem in the context information corresponding to the two system problems;
if the information of the functional modules generating the system problems in the context information corresponding to the two system problems is consistent, acquiring a preset first score as a third similarity score;
if the information of the functional modules generating the system problems in the context information corresponding to the two system problems is inconsistent, determining that the third similarity score is zero;
the fourth similarity score is determined based on the following method:
comparing the equipment information generating the system problem in the context information corresponding to the two system problems;
if the device information of the system problem generated in the context information corresponding to the two system problems is consistent, acquiring a preset second score as a fourth similarity score;
and if the device information generating the system problem in the context information corresponding to the two system problems is inconsistent, determining that the fourth similarity score is zero.
7. The method according to claim 1, wherein the step of determining whether the two system problems are corresponding to the same system error based on the comparison result comprises:
if the comparison result indicates that the similarity between the context information corresponding to the two system problems is greater than or equal to a preset threshold value, determining that the two system problems are system problems corresponding to the same system error;
and if the comparison result shows that the similarity between the context information corresponding to the two system problems is smaller than a preset threshold value, determining that the two system problems are not the system problems corresponding to the same system error.
8. An apparatus for duplicate recognition of system errors, the apparatus comprising:
the device comprises an acquisition unit, a comparison unit and a comparison unit, wherein the acquisition unit is used for acquiring context information corresponding to two system problems based on the two system problems for comparison, and the context information comprises a text of reasons generated by the system problems, a screenshot of the reasons generated by the system problems, information of a function module generating the system problems and information of equipment generating the system problems;
the first execution unit is used for determining the similarity between the context information corresponding to the two system problems;
the comparison unit is used for comparing the similarity between the context information corresponding to the two system problems with a preset threshold value to obtain a comparison result;
and the second execution unit is used for determining whether the two system problems are the system problems corresponding to the same system error or not based on the comparison result.
9. An electronic device comprising a memory and a processor, the memory having stored therein computer-readable instructions that, when executed by the processor, cause the processor to perform the method of any of claims 1-7.
10. A storage medium storing computer-readable instructions that, when executed by one or more processors, cause the one or more processors to perform the method of any one of claims 1-7.
CN201910708623.7A 2019-08-01 2019-08-01 Method and related device for repeatedly identifying system errors Pending CN110618888A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201910708623.7A CN110618888A (en) 2019-08-01 2019-08-01 Method and related device for repeatedly identifying system errors
PCT/CN2019/117684 WO2021017288A1 (en) 2019-08-01 2019-11-12 Method and apparatus for repeatedly identifying system error, electronic device, and computer-readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910708623.7A CN110618888A (en) 2019-08-01 2019-08-01 Method and related device for repeatedly identifying system errors

Publications (1)

Publication Number Publication Date
CN110618888A true CN110618888A (en) 2019-12-27

Family

ID=68921392

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910708623.7A Pending CN110618888A (en) 2019-08-01 2019-08-01 Method and related device for repeatedly identifying system errors

Country Status (2)

Country Link
CN (1) CN110618888A (en)
WO (1) WO2021017288A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106201757A (en) * 2016-07-12 2016-12-07 网易(杭州)网络有限公司 A kind of abnormality eliminating method and device
CN106708704A (en) * 2016-12-23 2017-05-24 北京奇虎科技有限公司 Method and device for classifying crash logs
CN107241212A (en) * 2017-04-20 2017-10-10 努比亚技术有限公司 A kind of log processing method and device, equipment
CN108427714A (en) * 2018-02-02 2018-08-21 北京邮电大学 The source of houses based on machine learning repeats record recognition methods and system
CN108460098A (en) * 2018-02-01 2018-08-28 北京百度网讯科技有限公司 Information recommendation method, device and computer equipment

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9747564B1 (en) * 2016-02-12 2017-08-29 The Boeing Company Aircraft maintenance and inspection with data analytics enhancement
US20170278067A1 (en) * 2016-03-25 2017-09-28 International Business Machines Corporation Monitoring activity to detect potential user actions
CN107167497B (en) * 2017-06-27 2023-08-08 北京必可测科技股份有限公司 Equipment fault detection method and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106201757A (en) * 2016-07-12 2016-12-07 网易(杭州)网络有限公司 A kind of abnormality eliminating method and device
CN106708704A (en) * 2016-12-23 2017-05-24 北京奇虎科技有限公司 Method and device for classifying crash logs
CN107241212A (en) * 2017-04-20 2017-10-10 努比亚技术有限公司 A kind of log processing method and device, equipment
CN108460098A (en) * 2018-02-01 2018-08-28 北京百度网讯科技有限公司 Information recommendation method, device and computer equipment
CN108427714A (en) * 2018-02-02 2018-08-21 北京邮电大学 The source of houses based on machine learning repeats record recognition methods and system

Also Published As

Publication number Publication date
WO2021017288A1 (en) 2021-02-04

Similar Documents

Publication Publication Date Title
CN107992596B (en) Text clustering method, text clustering device, server and storage medium
CN109002510B (en) Dialogue processing method, device, equipment and medium
US11282524B2 (en) Text-to-speech modeling
US20180342241A1 (en) Method and Apparatus of Recognizing Field of Semantic Parsing Information, Device and Readable Medium
CN109783490B (en) Data fusion method and device, computer equipment and storage medium
CN109871315B (en) Diagnosis method and device for system upgrade failure based on machine learning
CN109815147B (en) Test case generation method, device, server and medium
CN108776696B (en) Node configuration method and device, storage medium and electronic equipment
CN110245232B (en) Text classification method, device, medium and computing equipment
CN109947924B (en) Dialogue system training data construction method and device, electronic equipment and storage medium
CN111079408B (en) Language identification method, device, equipment and storage medium
CN111090641A (en) Data processing method and device, electronic equipment and storage medium
US20220301547A1 (en) Method for processing audio signal, method for training model, device and medium
CN109872230B (en) Test method and device of financial data analysis system, medium and electronic equipment
CN111598122B (en) Data verification method and device, electronic equipment and storage medium
CN110705235A (en) Information input method and device for business handling, storage medium and electronic equipment
CN110704390A (en) Method, device, electronic equipment and medium for obtaining server maintenance script
CN112363814A (en) Task scheduling method and device, computer equipment and storage medium
CN116881971A (en) Sensitive information leakage detection method, device and storage medium
US20230070966A1 (en) Method for processing question, electronic device and storage medium
EP4246365A1 (en) Webpage identification method and apparatus, electronic device, and medium
CN111063445A (en) Feature extraction method, device, equipment and medium based on medical data
CN110716859A (en) Method for automatically pushing test cases for modified codes and related device
CN110618888A (en) Method and related device for repeatedly identifying system errors
CN112307183B (en) Search data identification method, apparatus, electronic device and computer storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191227

RJ01 Rejection of invention patent application after publication