Disclosure of Invention
Based on this, it is necessary to provide a method and an apparatus for processing web page error information, a computer device, and a storage medium, for solving the problem that the offline repair processing of new error information of web pages takes a long time.
A method for processing webpage error information, the method comprising:
crawling error information in a webpage;
when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result;
matching the crawled error information with characters in a basic filter library to obtain a second matching result;
comparing the first matching result with the second matching result;
when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result;
acquiring prompt information of the crawled error information according to the target matching result;
and returning the prompt information to the webpage.
In one embodiment, the step of obtaining the prompt information of the crawled error information according to the target matching result includes:
receiving an adjustment instruction aiming at the target matching result;
when the adjusting instruction represents confirmation, acquiring prompt information of the crawled error information according to the target matching result;
and when the adjusting instruction represents adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
In one embodiment, the method further comprises:
storing the prompt information obtained according to the target matching result and the corresponding error information in a supplementary library in a correlation manner;
when the standard library does not have prompt information matched with the error information, matching the crawled error information with characters in a high-level filter library to obtain a first matching result, wherein the step comprises the following steps of:
when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with the stored error information in the supplementary library;
when the crawled error information is successfully matched with the stored error information in the supplementary library, acquiring prompt information corresponding to the successfully matched stored error information;
returning the prompt information to the webpage;
and when the matching of the crawled error information and the stored error information in the supplementary library fails, continuing to match the error information with characters in a high-level filter library to obtain a first matching result.
In one embodiment, the method further comprises:
and when the first matching result completely comprises the second matching result, taking the first matching result as the target matching result.
In one embodiment, the method further comprises:
and when no character exists in the first matching result, taking the second matching result as the target matching result.
In one embodiment, the step of comparing the first matching result with the second matching result comprises:
when characters exist in the first matching result, matching each character in the first matching result through each character in the second matching result;
when each character in the second matching result has a matched character in the first matching result, returning a result that the first matching result completely contains the second matching result;
and when at least one character in the second matching result does not have a matched character in the first matching result, returning a result that the first matching result does not completely contain the second matching result.
A web page error information processing apparatus, the apparatus comprising:
the crawling module is used for crawling error information in the webpage;
the matching module is used for matching the crawled error information with characters in a high-level filter library to obtain a first matching result when prompt information matched with the error information does not exist in the standard library; matching the crawled error information with characters in a basic filter library to obtain a second matching result;
a comparison module for comparing the first matching result with the second matching result; when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result;
the acquisition module is used for acquiring prompt information of the crawled error information according to the target matching result;
and the return module is used for returning the prompt information to the webpage.
In one embodiment, the obtaining module includes:
a receiving unit, configured to receive an adjustment instruction for the target matching result;
the processing unit is used for acquiring prompt information of the crawled error information according to the target matching result when the adjusting instruction indicates confirmation; and when the adjusting instruction represents adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the following steps when executing the program:
crawling error information in a webpage;
when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result;
matching the crawled error information with characters in a basic filter library to obtain a second matching result;
comparing the first matching result with the second matching result;
when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result;
acquiring prompt information of the crawled error information according to the target matching result;
and returning the prompt information to the webpage.
A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of:
crawling error information in a webpage;
when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result;
matching the crawled error information with characters in a basic filter library to obtain a second matching result;
comparing the first matching result with the second matching result;
when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result;
acquiring prompt information of the crawled error information according to the target matching result;
and returning the prompt information to the webpage.
When the error information exists in the webpage and the standard library does not have the prompt information corresponding to the error information, namely the error information is new error information, the error information is matched through characters in the high-level filter library and characters in the basic filter library to determine the target matching result of the error information, so that the prompt information aiming at the error information can be obtained, after the prompt information is returned to the webpage to be displayed, a user can correspondingly modify according to the prompt information to continue webpage operation, wherein the target matching result of the error information determined through the high-level filter library can improve the matching accuracy, and the target matching result of the error information determined through the basic filter library can improve the universal applicability of the special crawler analysis system, therefore, when new error information exists in the webpage, the target matching result of the error information can be determined through the high-level filter library and the basic filter library, offline repair is not required, the process of researching and developing versions is tested, and time is saved.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Before describing in detail embodiments that are in accordance with the present invention, it should be observed that the embodiments reside primarily in combinations of steps and system components related to web page error information processing methods, apparatus, computer devices, and storage media. Accordingly, the system components and method steps have been represented where appropriate by conventional symbols in the drawings, showing only those details that are pertinent to understanding the embodiments of the present invention so as not to obscure the disclosure with details that will be readily apparent to those of ordinary skill in the art having the benefit of the description herein.
In this document, relational terms such as left and right, top and bottom, front and back, first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. The terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.
Referring to fig. 1, fig. 1 is a schematic structural diagram of a web page error information processing system in an embodiment, the web page error information processing system includes a dedicated crawler parsing terminal and a plurality of user terminals, wherein the dedicated crawler parsing terminal and the plurality of user terminals can communicate, the dedicated crawler parsing terminal can crawl error information on the plurality of user terminals, for example, when the user terminal is at a login interface, a user inputs a user name and a password to the user terminal, when the password is input incorrectly, an interface of the user terminal displays a word of initial information "password error", the dedicated crawler parsing terminal crawls the word of initial information "password error" displayed on the interface of the user terminal at the time, then obtains a prompt message "input password does not match correct password" corresponding to "password error", and please re-input ", and finally the dedicated crawler parsing terminal does not match the prompt message" input password with correct password ", please re-input "return to the user terminal for display.
Referring to fig. 2, fig. 2 is a flowchart illustrating a method for processing a web page error message according to an embodiment, which is applied to a dedicated crawler parsing terminal in the web page error message processing system shown in fig. 1. The special crawler analysis terminal runs a webpage error information processing program, and the webpage error information processing method is implemented through the special crawler analysis terminal control program. The method specifically comprises the following steps:
s202: and crawling error information in the webpage.
Specifically, the error information is information for prompting that there is an error in the user operation, for example, "user name input error" is generated when the user logs in a certain website or client, or the data format input by the user is wrong when the user fills in a certain form, for example, the format should be "year-month-day" when the user fills in a date, but the format input by the user is "month-day-year", and the screen of the user terminal displays the error information of "format error". The web page herein refers to a broad web page, including a web page displayed on a browser interface, and a web page displayed on a client app (application).
When error information exists in the webpage, the special crawler analysis terminal can crawl the error information displayed in the webpage so as to perform subsequent analysis work, and therefore unified prompt information can be output on the webpage, a user can accurately know the place needing to be changed, and how to operate the webpage next.
S204: and when the prompt information matched with the error information does not exist in the standard library, matching the crawled error information with the characters in the high-level filter library to obtain a first matching result.
Specifically, the standard library refers to a database storing matching relationships between error information and prompt information, and is generally obtained by sorting the existing error information and prompt information, that is, a basic library of error information and prompt information. The prompt information of the corresponding error information can be obtained through the standard library aiming at the error information in different webpages, so that the prompt information displayed by all the webpages aiming at the same error is consistent no matter what webpage a user logs in, the user can modify the error and continue subsequent operation, and the situation that the user does not know how to perform the next operation due to the fact that different webpages generate various prompt information aiming at the same error is prevented.
The characters in the high-level filter library are mainly complex characters, for example, the characters can be phrases or short sentences, such as "account exception", "password reset", "control cannot be displayed", and the like, and the matching precision can be improved due to the existence of the high-level filter library.
When the prompt information matched with the error information exists in the standard library, namely the error information is already in the standard library and the corresponding prompt information exists, the special crawler analysis terminal sends the prompt information to the user terminal for displaying, so that the user can know how to proceed on the next step, and the user can conveniently continue to perform subsequent operations.
When the standard library does not have prompt information matched with the error information, namely the error information is a new error and does not have corresponding standard prompt information, the special crawler analysis terminal needs to determine the prompt information of the error information, namely the error information is matched with characters in the high-level filter library, and when the characters corresponding to the error information exist in the high-level filter library, the error information is stored in a first matching result.
S206: and matching the crawled error information with the characters in the basic filter library to obtain a second matching result.
Specifically, the characters in the basic filter library are mainly simple characters, and may be, for example, a single word or phrase, such as "online banking," "client," "control," "reset," "exception," "unable to display," "page," and the like, and the existence of the basic filter library may improve the general applicability of the special crawler parsing system.
When the standard library does not have prompt information matched with the error information, namely the error information is a new error and does not have corresponding standard prompt information, the special crawler analysis terminal needs to determine the prompt information of the error information, namely the error information is matched with characters in the basic filter library, and when the characters corresponding to the error information exist in the basic filter library, the special crawler analysis terminal stores the prompt information in a second matching result. Optionally, the error information may be matched with the characters in the high-level filter library first, or may be matched with the characters in the basic filter library first, or may be matched with the characters in the high-level filter library and the basic filter library simultaneously through two threads.
S208: the first matching result is compared with the second matching result.
Specifically, since the first matching result is a result of matching the error information with the characters in the higher-level filter library and the second matching result is a result of matching the error information with the characters in the base filter library, there may be a difference between the first matching result and the second matching result, and thus the first matching result is compared with the second matching result to obtain an accurate target matching result.
S210: and when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result.
Specifically, when there is an intersection between the first matching result and the second matching result, but there are different vocabularies in the first matching result and the second matching result, or when there is no intersection between the first matching result and the second matching result, but there are different vocabularies in the first matching result and the second matching result, it indicates that the first matching result does not completely include the second matching result, so that the target matching result can be obtained by splicing the first matching result and the second matching result, that is, the aggregate of the first matching result and the second matching result is obtained as the target matching result, thereby ensuring the integrity of the final target matching result.
For example, when the error message is "account exception, please go to counter reset", for example, the basic filter library is simple characters such as "online banking", "customer", "control", "reset", "exception", "no display", "page". The high-level filter library is mainly complex characters such as 'account abnormity', 'password reset', and 'control cannot be displayed'. The first matching result obtained by the high-level filter library is 'account abnormal', and the second matching result obtained by the basic filter library is 'abnormal' and 'reset'. Therefore, the first matching result does not completely contain the second matching result, and the target matching result is a collection of the first matching result and the second matching result, namely, the account is abnormal and the target matching result is reset.
S212: and acquiring prompt information of the crawled error information according to the target matching result.
Specifically, the prompt information corresponding to the error information may be obtained through the target matching result, for example, a matching code corresponding to the target matching result may be obtained first through the target matching result, and then the corresponding prompt information may be obtained according to the matching code, where the matching code is set to classify the prompt information, so as to facilitate query of the prompt information, for example, the matching code beginning with "1" indicates a password error, the matching code beginning with "2" indicates a verification code error, the matching code beginning with "4" indicates an "account error", and the like, where the matching code beginning with "4" 4217 "indicates the prompt information" requiring setting account information again "corresponding to the error of the" 217 "type under the" account error "type. For example, in the above embodiment, the target matching result is "account exception" or "reset", the matching code corresponding to the queried target matching result should be 4217, and the corresponding prompt message is "account information needs to be reset".
S214: and returning the prompt information to the webpage.
Specifically, after the corresponding prompt information is acquired, the prompt information may be sent to the web page, so that the web page may display the prompt information, so that the user can know how to perform the next operation, for example, when the prompt information displayed on the web page is "account information needs to be reset", the user resets the account information, and then the next operation may be performed.
When the error information exists in the webpage and the standard library has no prompt information corresponding to the error information, namely the error information is new error information, the error information is matched through characters in the high-level filter library and characters in the basic filter library to determine a target matching result of the error information, so that the prompt information aiming at the error information can be obtained, after the prompt information is returned to the webpage to be displayed, a user can correspondingly modify according to the prompt information to continue webpage operation, wherein the target matching result of the error information determined through the high-level filter library can improve the matching accuracy, the target matching result of the error information determined through the basic filter library can improve the universal applicability of the special crawler analysis system, so that when the new error information exists in the webpage, the target matching result of the error information can be determined through the high-level filter library and the basic filter library, offline repair is not required, the process of research and development version testing is carried out, and time is saved.
In one embodiment, please refer to fig. 3, fig. 3 is a flowchart of step S212 of the embodiment shown in fig. 2, and the step S212 of obtaining the prompt information of the crawled error information according to the target matching result may include:
s302: and receiving an adjusting instruction aiming at the target matching result.
Specifically, the adjustment instruction for the target matching result refers to an adjustment instruction input by a worker to the dedicated crawler analysis terminal, and the instruction may indicate confirmation, that is, accepting the target matching result obtained by the dedicated crawler analysis terminal, or may indicate adjustment, that is, adjusting the target matching result obtained by the dedicated crawler analysis terminal, and may also indicate withdrawal, that is, rejecting the target matching result obtained by the dedicated crawler analysis terminal.
S304: and when the adjustment instruction indicates confirmation, acquiring prompt information of the crawled error information according to the target matching result.
Specifically, when the adjustment instruction indicates confirmation, the prompt information corresponding to the error information may be obtained through the target matching result, which may specifically refer to the above definition, and is not described herein again.
S306: and when the adjusting instruction indicates adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
Specifically, when the adjustment instruction indicates adjustment, that is, the target matching result given by the special crawler analysis terminal has an error, the staff adjusts the target matching result. Referring to fig. 4, fig. 4 is a schematic diagram of a target matching result in an embodiment, where the target matching result includes original error information "account is abnormal, please go to counter to reset", a matching code "4217", and target matching result "account is abnormal", "reset", and a "confirm", "adjust", and "withdraw" button, so that a worker may input an adjustment instruction through the confirm "," adjust ", and" withdraw "buttons, for example, when the worker clicks" adjust ", the worker may adjust the target matching result" account is abnormal "," reset ", for example, a" counter "character is added, so that the adjusted target matching result is" account is abnormal "," counter ", and" reset ". The special crawler analysis terminal obtains a corresponding matching code, such as 4218, according to the adjusted target matching result, namely ' account abnormity ', ' counter ' and ' reset ', and then obtains corresponding prompt information ' account information needing to be reset in the counter ' according to the matching code '. And finally, the special crawler analysis terminal returns the prompt message 'account information needs to be reset to the counter' to the webpage, so that the webpage can display the prompt message, and a user can know how to perform the next operation, for example, when the prompt message displayed on the webpage is 'account information needs to be reset to the counter', the user can perform the next operation by resetting the account information to the counter.
In the above embodiment, the adjustment instruction of the worker is quoted, and the target matching result obtained by the special crawler analysis terminal aiming at the new error automatic matching is adjusted, so that the target matching result is more accurate, and the adaptability of the special crawler analysis terminal is stronger.
In one embodiment, the error information processing method may further include the step of storing the prompt information acquired according to the target matching result and the corresponding error information in the supplementary library in an associated manner. The prompt information matched for the new errors is stored in the supplement library in an associated mode through the special crawler analysis terminal, and when the target matching results of the new errors are adjusted, the prompt information corresponding to the adjusted target matching results and the corresponding error information are stored in the supplement library in an associated mode.
Referring to fig. 5, fig. 5 is a flowchart of step S204 in the embodiment shown in fig. 2, in this embodiment, the step S204, when there is no prompt information matching the error information in the standard library, matching the crawled error information with the characters in the high-level filter library to obtain a first matching result, may include:
s502: and when the prompt information matched with the error information does not exist in the standard library, matching the crawled error information with the error information stored in the supplementary library.
Specifically, when the standard library does not have the prompt information matched with the error information, that is, the error information is a new issue error, and there is no corresponding standard prompt information, but because the special crawler analysis terminal has matched the prompt information for some new issue errors, and stores the prompt information and the new issue error in the supplement library, in order to save time, the special crawler analysis terminal firstly queries the supplement library according to the error information, that is, when the error information exists in the supplement library, the special crawler analysis terminal directly obtains the corresponding prompt information according to the supplement library, and does not need to match the error information through the high-level filter library and the basic filter library to obtain a target matching result, and time is saved in a complicated step of obtaining the corresponding prompt information according to the target matching result.
S504: and when the crawled error information is successfully matched with the stored error information in the supplementary library, acquiring prompt information corresponding to the successfully matched stored error information.
Specifically, the new error is already stored in the supplement library, so that a complicated step of obtaining corresponding prompt information according to a target matching result is no longer needed by matching the error information through a high-level filter library and a basic filter library to obtain the target matching result, but prompt information associated with the crawled error information in the supplement library is directly obtained.
For example, the last new error is "account exception, please reset counter", the matched prompt message of the special crawler analysis terminal is "account information needs to be reset", the new error stored in the supplement library after the adjustment of the staff is "account exception, please reset counter", and the prompt message is "account information needs to be reset counter". When newly-sent errors, namely account abnormity and counter reset, are stored, adjustment is not needed through the high-level filter library, the basic filter library and workers, only the prompt message of 'account abnormity and counter reset' in the supplement library needs to be inquired, namely account information needs to be reset at the counter, and time is saved.
S506: and returning the prompt information to the webpage.
Specifically, after the corresponding prompt information is queried in the supplementary library, the prompt information is directly returned to the webpage, so that the webpage can display the prompt information, and the user can know how to perform the next operation, for example, when the prompt information displayed on the webpage is "account information needs to be reset at counter", the user resets the account information, and then the next operation can be performed.
S508: and when the matching of the crawled error information and the stored error information in the supplementary library fails, continuing to match the error information with the characters in the high-level filter library to obtain a first matching result.
Specifically, when the matching between the crawled error information and the stored error information in the supplement library fails, that is, the error information is not stored in the standard library and the supplement library, that is, the error information is a new error, it is necessary to match appropriate prompt information for the error information through the high-level filter library and the basic filter library.
In the embodiment, after the prompt information is matched for the error information through the high-level filter library and the basic filter library, the prompt information and the corresponding error information are stored in the supplement library in an associated manner, and after a worker adjusts the prompt information, the adjusted prompt information and the corresponding error information are stored in the supplement library in an associated manner, so that when a new error occurs in a webpage, the prompt information matched with the new error is firstly obtained according to the supplement library without being adjusted through the high-level filter library, the basic filter library and the worker, and the time is saved.
In one embodiment, when the first matching result completely contains the second matching result, the first matching result is taken as the target matching result. In one embodiment, when no character exists in the first matching result, the second matching result is used as the target matching result.
Specifically, complex characters are stored in the high-level filter library, and the matched result is more accurate. For example, the original information is input as < frame name ═ result "> < h2class ═ age ═ version ═ visibility: none" >, an error prompt </h2> < br class ═ error flag "> user account exception, please reset the password </br > < td > </frame >. And matching the result of 'account' & 'abnormity' & 'password' & 'setting out by using a basic filter library, and matching and outputting the result of' account abnormity '&' resetting '&' setting password by using a high-level filter library. The matching result of the basic filter library is 'account abnormity, and password is set'. Matching results of the high-level filter library: "account exception, reset password". At this time, the matching result of the high-level filter bank includes the matching result of the basic filter bank. The matching results of the high-level filter bank are employed. The final target matching result is 'account abnormal, reset password'. If the matching result of the basic filter library is not completely contained by the matching result of the advanced filter library, the excessive parts of the matching result of the basic filter library and the matching result of the advanced filter library are combined for use.
Although the high-level filter library has high precision, the complex vocabulary group is easy to cause unsuccessful matching due to the variation of individual words. That is, there is no character in the first matching result, for example, when the input information is "account status is abnormal, please go to counter to set new password", the high-level filter library "account abnormal" and "set password" will both fail to match. The basic filter bank is necessary to use at this time. The output "account" & "exception" & "setting" & "password" may be obtained using the base filter bank. The use of a base filter increases the universality of the system.
Words of the high-layer filter library and the basic filter library can be changed, namely, added, replaced and deleted, so that the adaptability of the filter is improved. For example, the number of times of outputting the statistical error information may be acquired, and when the number of times reaches a certain value, the prompt information corresponding to the error information is added to the high-level filter library.
In the above embodiment, the first matching result is an accurate result obtained by matching through the high-level filter library, the second matching result is a general result obtained by matching through the basic filter library, and the final target matching result is determined by judging the relationship between the first matching result and the second matching result, so that the accuracy of the target matching result is improved.
Referring to fig. 6, fig. 6 is a flowchart of step S208 in the embodiment shown in fig. 2, and the step S208 of comparing the first matching result with the second matching result may include:
s602: and when the characters exist in the first matching result, matching each character in the first matching result through each character in the second matching result.
Specifically, when there is a character in the first matching result, i.e., there is a corresponding matching result in the first matching pass, for example, when the error information is 'user account abnormal, please reset password', the first matching result matched by the high-level filter library is 'account abnormal, reset password', the second matching result matched by the basic filter library is 'account abnormity, password setting', each character in the first matching result is demarked by each character in the second matching result, for example, the "account" in the second matching result exists in the first matching result, the "exception" in the second matching result also exists in the first matching result, the "setting" in the second matching result also exists in the first matching result, the "password" in the second matching result also exists in the first matching result, but the "new" in the first matching result does not exist in the second matching result.
S604: and when each character in the second matching result has a matched character in the first matching result, returning a result that the first matching result completely contains the second matching result.
Specifically, in the above embodiment, when the first matching result matched by the high-level filter library is identical to the second matching result, or in the above example, the two only difference points are "re" and the "re" character is included in the first matching result, the first matching result is considered to be completely included in the second matching result.
S606: and when at least one character in the second matching result does not have a matched character in the first matching result, returning a result that the first matching result does not completely contain the second matching result.
Specifically, in the above embodiment, when the first matching result matched by the higher filter library is "reset", and the second matching result matched by the base filter library is "account abnormal, reset", that is, the first matching result and the second matching result are only in the same character "reset", and the character "account abnormal" is only present in the second matching result, so that the first matching result does not completely include the second matching result.
In the above embodiment, the first matching result is an accurate result obtained by matching through the high-level filter library, the second matching result is a general result obtained by matching through the basic filter library, and the final target matching result is determined by judging the relationship between the first matching result and the second matching result, so that the accuracy of the target matching result is improved.
Referring to fig. 7, a schematic structural diagram of a web page error information processing apparatus is provided, where the web page error information processing apparatus includes:
and the crawling module 100 is used for crawling error information in the webpage.
The matching module 200 is configured to match the crawled error information with characters in a high-level filter library to obtain a first matching result when prompt information matched with the error information does not exist in the standard library; and matching the crawled error information with the characters in the basic filter library to obtain a second matching result.
A comparison module 300 for comparing the first matching result with the second matching result; and when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result.
And an obtaining module 400, configured to obtain prompt information of the crawled error information according to the target matching result.
And a returning module 500, configured to return the prompt message to the web page.
In one embodiment, the obtaining module may include:
and the receiving unit is used for receiving an adjusting instruction aiming at the target matching result.
The processing unit is used for acquiring prompt information of the crawled error information according to a target matching result when the adjustment instruction represents confirmation; and when the adjusting instruction indicates adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
In one embodiment, the apparatus may further include:
and the supplement module is used for storing the prompt information acquired according to the target matching result and the corresponding error information into a supplement library in a correlation manner.
The matching module is also used for matching the crawled error information with the stored error information in the supplementary library when the prompt information matched with the error information does not exist in the standard library.
The acquisition module is further used for acquiring prompt information corresponding to the stored error information which is successfully matched when the crawled error information is successfully matched with the stored error information in the supplementary library.
The return module is also used for returning the prompt message to the webpage.
And the comparison module is also used for continuously matching the error information with the characters in the high-level filter library to obtain a first matching result when the matching of the crawled error information and the stored error information in the supplement library fails.
In one embodiment, the comparison module may be further configured to take the first matching result as the target matching result when the first matching result completely contains the second matching result.
In one embodiment, the comparison module may be further configured to take the second matching result as the target matching result when no character exists in the first matching result.
In one embodiment, the comparison module may include:
and the matching unit is used for matching each character in the first matching result through each character in the second matching result when the character exists in the first matching result.
The output unit is used for returning the result that the first matching result completely contains the second matching result when each character in the second matching result has a matched character in the first matching result; and when at least one character in the second matching result does not have a matched character in the first matching result, returning a result that the first matching result does not completely contain the second matching result.
For the above limitation on the web page error information processing apparatus, reference may be made to the above specific limitation on the web page error information processing method, which is not described herein again.
Referring to fig. 8, fig. 8 is a schematic structural diagram of a dedicated crawler parsing terminal in an embodiment, where the dedicated crawler parsing terminal may be a conventional server or any other suitable computer device, and includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the memory may include a nonvolatile storage medium and an internal memory, and the computer program may be stored in the nonvolatile storage medium, and when the processor executes the program, the processor implements the following steps: crawling error information in a webpage; when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result; matching the crawled error information with characters in a basic filter library to obtain a second matching result; comparing the first matching result with the second matching result; when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result; according to the target matching result, prompt information of the crawled error information is obtained; and returning the prompt information to the webpage.
In one embodiment, the processor when executing the program may further implement the following steps: receiving an adjusting instruction aiming at a target matching result; when the adjustment instruction indicates confirmation, acquiring prompt information of the crawled error information according to a target matching result; and when the adjusting instruction indicates adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
In one embodiment, the processor when executing the program may further implement the following steps: storing the prompt information acquired according to the target matching result and the corresponding error information into a supplementary library in a correlation manner; when the prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result, wherein the step comprises the following steps of: when the prompt information matched with the error information does not exist in the standard library, matching the crawled error information with the stored error information in the supplementary library; when the crawled error information is successfully matched with the stored error information in the supplement library, acquiring prompt information corresponding to the successfully matched stored error information; returning the prompt information to the webpage; and when the matching of the crawled error information and the stored error information in the supplementary library fails, continuing to match the error information with the characters in the high-level filter library to obtain a first matching result.
In one embodiment, the processor when executing the program may further implement the following steps: and when the first matching result completely contains the second matching result, taking the first matching result as a target matching result.
In one embodiment, the processor when executing the program may further implement the following steps: and when no character exists in the first matching result, taking the second matching result as a target matching result.
In one embodiment, the processor when executing the program may further implement the following steps: when the first matching result has characters, matching each character in the first matching result through each character in the second matching result; when each character in the second matching result has a matched character in the first matching result, returning a result that the first matching result completely contains the second matching result; and when at least one character in the second matching result does not have a matched character in the first matching result, returning a result that the first matching result does not completely contain the second matching result.
For the above definition of the computer device, reference may be made to the above specific definition of the web page error information processing method, which is not described herein again.
With continuing reference to fig. 8, there is also provided a computer readable storage medium having stored thereon a computer program, such as the non-volatile storage medium shown in fig. 8, wherein the program when executed by a processor implements the steps of: crawling error information in a webpage; when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result; matching the crawled error information with characters in a basic filter library to obtain a second matching result; comparing the first matching result with the second matching result; when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result; according to the target matching result, prompt information of the crawled error information is obtained; and returning the prompt information to the webpage.
In one embodiment, the program when executed by the processor further performs the steps of: receiving an adjusting instruction aiming at a target matching result; when the adjustment instruction indicates confirmation, acquiring prompt information of the crawled error information according to a target matching result; and when the adjusting instruction indicates adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
In one embodiment, the program when executed by the processor further performs the steps of: storing the prompt information acquired according to the target matching result and the corresponding error information into a supplementary library in a correlation manner; when the prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result, wherein the step comprises the following steps of: when the prompt information matched with the error information does not exist in the standard library, matching the crawled error information with the stored error information in the supplementary library; when the crawled error information is successfully matched with the stored error information in the supplement library, acquiring prompt information corresponding to the successfully matched stored error information; returning the prompt information to the webpage; and when the matching of the crawled error information and the stored error information in the supplementary library fails, continuing to match the error information with the characters in the high-level filter library to obtain a first matching result.
In one embodiment, the program when executed by the processor further performs the steps of: and when the first matching result completely contains the second matching result, taking the first matching result as a target matching result.
In one embodiment, the program when executed by the processor further performs the steps of: and when no character exists in the first matching result, taking the second matching result as a target matching result.
In one embodiment, the program when executed by the processor further performs the steps of: when the first matching result has characters, matching each character in the first matching result through each character in the second matching result; when each character in the second matching result has a matched character in the first matching result, returning a result that the first matching result completely contains the second matching result; and when at least one character in the second matching result does not have a matched character in the first matching result, returning a result that the first matching result does not completely contain the second matching result.
For the above definition of the computer-readable storage medium, reference may be made to the above specific definition of the web page error information processing method, which is not described herein again.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. The storage medium may be a magnetic disk, an optical disk, a Read-Only Memory (ROM), or the like.
The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.