CN107704484B - Webpage error information processing method and device, computer equipment and storage medium - Google Patents

Webpage error information processing method and device, computer equipment and storage medium Download PDF

Info

Publication number
CN107704484B
CN107704484B CN201710614416.6A CN201710614416A CN107704484B CN 107704484 B CN107704484 B CN 107704484B CN 201710614416 A CN201710614416 A CN 201710614416A CN 107704484 B CN107704484 B CN 107704484B
Authority
CN
China
Prior art keywords
matching result
error information
matching
crawled
library
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710614416.6A
Other languages
Chinese (zh)
Other versions
CN107704484A (en
Inventor
李武奇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
OneConnect Smart Technology Co Ltd
Original Assignee
OneConnect Financial Technology Co Ltd Shanghai
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by OneConnect Financial Technology Co Ltd Shanghai filed Critical OneConnect Financial Technology Co Ltd Shanghai
Priority to CN201710614416.6A priority Critical patent/CN107704484B/en
Publication of CN107704484A publication Critical patent/CN107704484A/en
Priority to PCT/CN2018/080139 priority patent/WO2019019674A1/en
Application granted granted Critical
Publication of CN107704484B publication Critical patent/CN107704484B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Abstract

The invention relates to a method, a device, computer equipment and a storage medium for processing webpage error information, wherein the method comprises the steps of crawling error information in a webpage; when the prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result, and matching the crawled error information with characters in a basic filter library to obtain a second matching result; when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result; according to the target matching result, prompt information of the crawled error information is obtained; and returning the prompt information to the webpage. According to the webpage error information processing method, the device, the computer equipment and the storage medium, when new error information exists in the webpage, the target matching result of the error information is determined through the high-level filter library and the basic filter library, offline repair is not required, the process of testing a research and development version is not required, and time is saved.

Description

Webpage error information processing method and device, computer equipment and storage medium
Technical Field
The present invention relates to the field of computers, and in particular, to a method and an apparatus for processing web page error information, a computer device, and a storage medium.
Background
The special crawler parsing system is a crawler parsing system which is dependent on a page structure to be crawled, and the change of the page structure or the replacement of the page can cause the failure of parsing or crawling. For the special crawler analysis system, when a website is modified or a novel account type appears on line, some new unknown pages and new unknown errors occur, so that the usability of the network crawling system is reduced, and the user experience is influenced. Conventionally, to deal with these new issues, the flow of offline repair and testing of the development version is often adopted, which is time-consuming.
Disclosure of Invention
Based on this, it is necessary to provide a method and an apparatus for processing web page error information, a computer device, and a storage medium, for solving the problem that the offline repair processing of new error information of web pages takes a long time.
A method for processing webpage error information, the method comprising:
crawling error information in a webpage;
when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result;
matching the crawled error information with characters in a basic filter library to obtain a second matching result;
comparing the first matching result with the second matching result;
when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result;
acquiring prompt information of the crawled error information according to the target matching result;
and returning the prompt information to the webpage.
In one embodiment, the step of obtaining the prompt information of the crawled error information according to the target matching result includes:
receiving an adjustment instruction aiming at the target matching result;
when the adjusting instruction represents confirmation, acquiring prompt information of the crawled error information according to the target matching result;
and when the adjusting instruction represents adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
In one embodiment, the method further comprises:
storing the prompt information obtained according to the target matching result and the corresponding error information in a supplementary library in a correlation manner;
when the standard library does not have prompt information matched with the error information, matching the crawled error information with characters in a high-level filter library to obtain a first matching result, wherein the step comprises the following steps of:
when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with the stored error information in the supplementary library;
when the crawled error information is successfully matched with the stored error information in the supplementary library, acquiring prompt information corresponding to the successfully matched stored error information;
returning the prompt information to the webpage;
and when the matching of the crawled error information and the stored error information in the supplementary library fails, continuing to match the error information with characters in a high-level filter library to obtain a first matching result.
In one embodiment, the method further comprises:
and when the first matching result completely comprises the second matching result, taking the first matching result as the target matching result.
In one embodiment, the method further comprises:
and when no character exists in the first matching result, taking the second matching result as the target matching result.
In one embodiment, the step of comparing the first matching result with the second matching result comprises:
when characters exist in the first matching result, matching each character in the first matching result through each character in the second matching result;
when each character in the second matching result has a matched character in the first matching result, returning a result that the first matching result completely contains the second matching result;
and when at least one character in the second matching result does not have a matched character in the first matching result, returning a result that the first matching result does not completely contain the second matching result.
A web page error information processing apparatus, the apparatus comprising:
the crawling module is used for crawling error information in the webpage;
the matching module is used for matching the crawled error information with characters in a high-level filter library to obtain a first matching result when prompt information matched with the error information does not exist in the standard library; matching the crawled error information with characters in a basic filter library to obtain a second matching result;
a comparison module for comparing the first matching result with the second matching result; when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result;
the acquisition module is used for acquiring prompt information of the crawled error information according to the target matching result;
and the return module is used for returning the prompt information to the webpage.
In one embodiment, the obtaining module includes:
a receiving unit, configured to receive an adjustment instruction for the target matching result;
the processing unit is used for acquiring prompt information of the crawled error information according to the target matching result when the adjusting instruction indicates confirmation; and when the adjusting instruction represents adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the following steps when executing the program:
crawling error information in a webpage;
when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result;
matching the crawled error information with characters in a basic filter library to obtain a second matching result;
comparing the first matching result with the second matching result;
when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result;
acquiring prompt information of the crawled error information according to the target matching result;
and returning the prompt information to the webpage.
A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of:
crawling error information in a webpage;
when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result;
matching the crawled error information with characters in a basic filter library to obtain a second matching result;
comparing the first matching result with the second matching result;
when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result;
acquiring prompt information of the crawled error information according to the target matching result;
and returning the prompt information to the webpage.
When the error information exists in the webpage and the standard library does not have the prompt information corresponding to the error information, namely the error information is new error information, the error information is matched through characters in the high-level filter library and characters in the basic filter library to determine the target matching result of the error information, so that the prompt information aiming at the error information can be obtained, after the prompt information is returned to the webpage to be displayed, a user can correspondingly modify according to the prompt information to continue webpage operation, wherein the target matching result of the error information determined through the high-level filter library can improve the matching accuracy, and the target matching result of the error information determined through the basic filter library can improve the universal applicability of the special crawler analysis system, therefore, when new error information exists in the webpage, the target matching result of the error information can be determined through the high-level filter library and the basic filter library, offline repair is not required, the process of researching and developing versions is tested, and time is saved.
Drawings
FIG. 1 is a schematic diagram of a system for handling web page error messages in an embodiment;
FIG. 2 is a flowchart illustrating a method for processing error messages in a web page according to an embodiment;
FIG. 3 is a flowchart of step S212 of the embodiment shown in FIG. 2;
FIG. 4 is a diagram illustrating a target matching result in one embodiment;
FIG. 5 is a flowchart of step S204 in the embodiment of FIG. 2;
FIG. 6 is a flowchart of step S208 in the embodiment shown in FIG. 2;
FIG. 7 is a block diagram of an embodiment of a web page fault message handling apparatus;
fig. 8 is a schematic structural diagram of a dedicated crawler parsing terminal in an embodiment.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Before describing in detail embodiments that are in accordance with the present invention, it should be observed that the embodiments reside primarily in combinations of steps and system components related to web page error information processing methods, apparatus, computer devices, and storage media. Accordingly, the system components and method steps have been represented where appropriate by conventional symbols in the drawings, showing only those details that are pertinent to understanding the embodiments of the present invention so as not to obscure the disclosure with details that will be readily apparent to those of ordinary skill in the art having the benefit of the description herein.
In this document, relational terms such as left and right, top and bottom, front and back, first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. The terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.
Referring to fig. 1, fig. 1 is a schematic structural diagram of a web page error information processing system in an embodiment, the web page error information processing system includes a dedicated crawler parsing terminal and a plurality of user terminals, wherein the dedicated crawler parsing terminal and the plurality of user terminals can communicate, the dedicated crawler parsing terminal can crawl error information on the plurality of user terminals, for example, when the user terminal is at a login interface, a user inputs a user name and a password to the user terminal, when the password is input incorrectly, an interface of the user terminal displays a word of initial information "password error", the dedicated crawler parsing terminal crawls the word of initial information "password error" displayed on the interface of the user terminal at the time, then obtains a prompt message "input password does not match correct password" corresponding to "password error", and please re-input ", and finally the dedicated crawler parsing terminal does not match the prompt message" input password with correct password ", please re-input "return to the user terminal for display.
Referring to fig. 2, fig. 2 is a flowchart illustrating a method for processing a web page error message according to an embodiment, which is applied to a dedicated crawler parsing terminal in the web page error message processing system shown in fig. 1. The special crawler analysis terminal runs a webpage error information processing program, and the webpage error information processing method is implemented through the special crawler analysis terminal control program. The method specifically comprises the following steps:
s202: and crawling error information in the webpage.
Specifically, the error information is information for prompting that there is an error in the user operation, for example, "user name input error" is generated when the user logs in a certain website or client, or the data format input by the user is wrong when the user fills in a certain form, for example, the format should be "year-month-day" when the user fills in a date, but the format input by the user is "month-day-year", and the screen of the user terminal displays the error information of "format error". The web page herein refers to a broad web page, including a web page displayed on a browser interface, and a web page displayed on a client app (application).
When error information exists in the webpage, the special crawler analysis terminal can crawl the error information displayed in the webpage so as to perform subsequent analysis work, and therefore unified prompt information can be output on the webpage, a user can accurately know the place needing to be changed, and how to operate the webpage next.
S204: and when the prompt information matched with the error information does not exist in the standard library, matching the crawled error information with the characters in the high-level filter library to obtain a first matching result.
Specifically, the standard library refers to a database storing matching relationships between error information and prompt information, and is generally obtained by sorting the existing error information and prompt information, that is, a basic library of error information and prompt information. The prompt information of the corresponding error information can be obtained through the standard library aiming at the error information in different webpages, so that the prompt information displayed by all the webpages aiming at the same error is consistent no matter what webpage a user logs in, the user can modify the error and continue subsequent operation, and the situation that the user does not know how to perform the next operation due to the fact that different webpages generate various prompt information aiming at the same error is prevented.
The characters in the high-level filter library are mainly complex characters, for example, the characters can be phrases or short sentences, such as "account exception", "password reset", "control cannot be displayed", and the like, and the matching precision can be improved due to the existence of the high-level filter library.
When the prompt information matched with the error information exists in the standard library, namely the error information is already in the standard library and the corresponding prompt information exists, the special crawler analysis terminal sends the prompt information to the user terminal for displaying, so that the user can know how to proceed on the next step, and the user can conveniently continue to perform subsequent operations.
When the standard library does not have prompt information matched with the error information, namely the error information is a new error and does not have corresponding standard prompt information, the special crawler analysis terminal needs to determine the prompt information of the error information, namely the error information is matched with characters in the high-level filter library, and when the characters corresponding to the error information exist in the high-level filter library, the error information is stored in a first matching result.
S206: and matching the crawled error information with the characters in the basic filter library to obtain a second matching result.
Specifically, the characters in the basic filter library are mainly simple characters, and may be, for example, a single word or phrase, such as "online banking," "client," "control," "reset," "exception," "unable to display," "page," and the like, and the existence of the basic filter library may improve the general applicability of the special crawler parsing system.
When the standard library does not have prompt information matched with the error information, namely the error information is a new error and does not have corresponding standard prompt information, the special crawler analysis terminal needs to determine the prompt information of the error information, namely the error information is matched with characters in the basic filter library, and when the characters corresponding to the error information exist in the basic filter library, the special crawler analysis terminal stores the prompt information in a second matching result. Optionally, the error information may be matched with the characters in the high-level filter library first, or may be matched with the characters in the basic filter library first, or may be matched with the characters in the high-level filter library and the basic filter library simultaneously through two threads.
S208: the first matching result is compared with the second matching result.
Specifically, since the first matching result is a result of matching the error information with the characters in the higher-level filter library and the second matching result is a result of matching the error information with the characters in the base filter library, there may be a difference between the first matching result and the second matching result, and thus the first matching result is compared with the second matching result to obtain an accurate target matching result.
S210: and when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result.
Specifically, when there is an intersection between the first matching result and the second matching result, but there are different vocabularies in the first matching result and the second matching result, or when there is no intersection between the first matching result and the second matching result, but there are different vocabularies in the first matching result and the second matching result, it indicates that the first matching result does not completely include the second matching result, so that the target matching result can be obtained by splicing the first matching result and the second matching result, that is, the aggregate of the first matching result and the second matching result is obtained as the target matching result, thereby ensuring the integrity of the final target matching result.
For example, when the error message is "account exception, please go to counter reset", for example, the basic filter library is simple characters such as "online banking", "customer", "control", "reset", "exception", "no display", "page". The high-level filter library is mainly complex characters such as 'account abnormity', 'password reset', and 'control cannot be displayed'. The first matching result obtained by the high-level filter library is 'account abnormal', and the second matching result obtained by the basic filter library is 'abnormal' and 'reset'. Therefore, the first matching result does not completely contain the second matching result, and the target matching result is a collection of the first matching result and the second matching result, namely, the account is abnormal and the target matching result is reset.
S212: and acquiring prompt information of the crawled error information according to the target matching result.
Specifically, the prompt information corresponding to the error information may be obtained through the target matching result, for example, a matching code corresponding to the target matching result may be obtained first through the target matching result, and then the corresponding prompt information may be obtained according to the matching code, where the matching code is set to classify the prompt information, so as to facilitate query of the prompt information, for example, the matching code beginning with "1" indicates a password error, the matching code beginning with "2" indicates a verification code error, the matching code beginning with "4" indicates an "account error", and the like, where the matching code beginning with "4" 4217 "indicates the prompt information" requiring setting account information again "corresponding to the error of the" 217 "type under the" account error "type. For example, in the above embodiment, the target matching result is "account exception" or "reset", the matching code corresponding to the queried target matching result should be 4217, and the corresponding prompt message is "account information needs to be reset".
S214: and returning the prompt information to the webpage.
Specifically, after the corresponding prompt information is acquired, the prompt information may be sent to the web page, so that the web page may display the prompt information, so that the user can know how to perform the next operation, for example, when the prompt information displayed on the web page is "account information needs to be reset", the user resets the account information, and then the next operation may be performed.
When the error information exists in the webpage and the standard library has no prompt information corresponding to the error information, namely the error information is new error information, the error information is matched through characters in the high-level filter library and characters in the basic filter library to determine a target matching result of the error information, so that the prompt information aiming at the error information can be obtained, after the prompt information is returned to the webpage to be displayed, a user can correspondingly modify according to the prompt information to continue webpage operation, wherein the target matching result of the error information determined through the high-level filter library can improve the matching accuracy, the target matching result of the error information determined through the basic filter library can improve the universal applicability of the special crawler analysis system, so that when the new error information exists in the webpage, the target matching result of the error information can be determined through the high-level filter library and the basic filter library, offline repair is not required, the process of research and development version testing is carried out, and time is saved.
In one embodiment, please refer to fig. 3, fig. 3 is a flowchart of step S212 of the embodiment shown in fig. 2, and the step S212 of obtaining the prompt information of the crawled error information according to the target matching result may include:
s302: and receiving an adjusting instruction aiming at the target matching result.
Specifically, the adjustment instruction for the target matching result refers to an adjustment instruction input by a worker to the dedicated crawler analysis terminal, and the instruction may indicate confirmation, that is, accepting the target matching result obtained by the dedicated crawler analysis terminal, or may indicate adjustment, that is, adjusting the target matching result obtained by the dedicated crawler analysis terminal, and may also indicate withdrawal, that is, rejecting the target matching result obtained by the dedicated crawler analysis terminal.
S304: and when the adjustment instruction indicates confirmation, acquiring prompt information of the crawled error information according to the target matching result.
Specifically, when the adjustment instruction indicates confirmation, the prompt information corresponding to the error information may be obtained through the target matching result, which may specifically refer to the above definition, and is not described herein again.
S306: and when the adjusting instruction indicates adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
Specifically, when the adjustment instruction indicates adjustment, that is, the target matching result given by the special crawler analysis terminal has an error, the staff adjusts the target matching result. Referring to fig. 4, fig. 4 is a schematic diagram of a target matching result in an embodiment, where the target matching result includes original error information "account is abnormal, please go to counter to reset", a matching code "4217", and target matching result "account is abnormal", "reset", and a "confirm", "adjust", and "withdraw" button, so that a worker may input an adjustment instruction through the confirm "," adjust ", and" withdraw "buttons, for example, when the worker clicks" adjust ", the worker may adjust the target matching result" account is abnormal "," reset ", for example, a" counter "character is added, so that the adjusted target matching result is" account is abnormal "," counter ", and" reset ". The special crawler analysis terminal obtains a corresponding matching code, such as 4218, according to the adjusted target matching result, namely ' account abnormity ', ' counter ' and ' reset ', and then obtains corresponding prompt information ' account information needing to be reset in the counter ' according to the matching code '. And finally, the special crawler analysis terminal returns the prompt message 'account information needs to be reset to the counter' to the webpage, so that the webpage can display the prompt message, and a user can know how to perform the next operation, for example, when the prompt message displayed on the webpage is 'account information needs to be reset to the counter', the user can perform the next operation by resetting the account information to the counter.
In the above embodiment, the adjustment instruction of the worker is quoted, and the target matching result obtained by the special crawler analysis terminal aiming at the new error automatic matching is adjusted, so that the target matching result is more accurate, and the adaptability of the special crawler analysis terminal is stronger.
In one embodiment, the error information processing method may further include the step of storing the prompt information acquired according to the target matching result and the corresponding error information in the supplementary library in an associated manner. The prompt information matched for the new errors is stored in the supplement library in an associated mode through the special crawler analysis terminal, and when the target matching results of the new errors are adjusted, the prompt information corresponding to the adjusted target matching results and the corresponding error information are stored in the supplement library in an associated mode.
Referring to fig. 5, fig. 5 is a flowchart of step S204 in the embodiment shown in fig. 2, in this embodiment, the step S204, when there is no prompt information matching the error information in the standard library, matching the crawled error information with the characters in the high-level filter library to obtain a first matching result, may include:
s502: and when the prompt information matched with the error information does not exist in the standard library, matching the crawled error information with the error information stored in the supplementary library.
Specifically, when the standard library does not have the prompt information matched with the error information, that is, the error information is a new issue error, and there is no corresponding standard prompt information, but because the special crawler analysis terminal has matched the prompt information for some new issue errors, and stores the prompt information and the new issue error in the supplement library, in order to save time, the special crawler analysis terminal firstly queries the supplement library according to the error information, that is, when the error information exists in the supplement library, the special crawler analysis terminal directly obtains the corresponding prompt information according to the supplement library, and does not need to match the error information through the high-level filter library and the basic filter library to obtain a target matching result, and time is saved in a complicated step of obtaining the corresponding prompt information according to the target matching result.
S504: and when the crawled error information is successfully matched with the stored error information in the supplementary library, acquiring prompt information corresponding to the successfully matched stored error information.
Specifically, the new error is already stored in the supplement library, so that a complicated step of obtaining corresponding prompt information according to a target matching result is no longer needed by matching the error information through a high-level filter library and a basic filter library to obtain the target matching result, but prompt information associated with the crawled error information in the supplement library is directly obtained.
For example, the last new error is "account exception, please reset counter", the matched prompt message of the special crawler analysis terminal is "account information needs to be reset", the new error stored in the supplement library after the adjustment of the staff is "account exception, please reset counter", and the prompt message is "account information needs to be reset counter". When newly-sent errors, namely account abnormity and counter reset, are stored, adjustment is not needed through the high-level filter library, the basic filter library and workers, only the prompt message of 'account abnormity and counter reset' in the supplement library needs to be inquired, namely account information needs to be reset at the counter, and time is saved.
S506: and returning the prompt information to the webpage.
Specifically, after the corresponding prompt information is queried in the supplementary library, the prompt information is directly returned to the webpage, so that the webpage can display the prompt information, and the user can know how to perform the next operation, for example, when the prompt information displayed on the webpage is "account information needs to be reset at counter", the user resets the account information, and then the next operation can be performed.
S508: and when the matching of the crawled error information and the stored error information in the supplementary library fails, continuing to match the error information with the characters in the high-level filter library to obtain a first matching result.
Specifically, when the matching between the crawled error information and the stored error information in the supplement library fails, that is, the error information is not stored in the standard library and the supplement library, that is, the error information is a new error, it is necessary to match appropriate prompt information for the error information through the high-level filter library and the basic filter library.
In the embodiment, after the prompt information is matched for the error information through the high-level filter library and the basic filter library, the prompt information and the corresponding error information are stored in the supplement library in an associated manner, and after a worker adjusts the prompt information, the adjusted prompt information and the corresponding error information are stored in the supplement library in an associated manner, so that when a new error occurs in a webpage, the prompt information matched with the new error is firstly obtained according to the supplement library without being adjusted through the high-level filter library, the basic filter library and the worker, and the time is saved.
In one embodiment, when the first matching result completely contains the second matching result, the first matching result is taken as the target matching result. In one embodiment, when no character exists in the first matching result, the second matching result is used as the target matching result.
Specifically, complex characters are stored in the high-level filter library, and the matched result is more accurate. For example, the original information is input as < frame name ═ result "> < h2class ═ age ═ version ═ visibility: none" >, an error prompt </h2> < br class ═ error flag "> user account exception, please reset the password </br > < td > </frame >. And matching the result of 'account' & 'abnormity' & 'password' & 'setting out by using a basic filter library, and matching and outputting the result of' account abnormity '&' resetting '&' setting password by using a high-level filter library. The matching result of the basic filter library is 'account abnormity, and password is set'. Matching results of the high-level filter library: "account exception, reset password". At this time, the matching result of the high-level filter bank includes the matching result of the basic filter bank. The matching results of the high-level filter bank are employed. The final target matching result is 'account abnormal, reset password'. If the matching result of the basic filter library is not completely contained by the matching result of the advanced filter library, the excessive parts of the matching result of the basic filter library and the matching result of the advanced filter library are combined for use.
Although the high-level filter library has high precision, the complex vocabulary group is easy to cause unsuccessful matching due to the variation of individual words. That is, there is no character in the first matching result, for example, when the input information is "account status is abnormal, please go to counter to set new password", the high-level filter library "account abnormal" and "set password" will both fail to match. The basic filter bank is necessary to use at this time. The output "account" & "exception" & "setting" & "password" may be obtained using the base filter bank. The use of a base filter increases the universality of the system.
Words of the high-layer filter library and the basic filter library can be changed, namely, added, replaced and deleted, so that the adaptability of the filter is improved. For example, the number of times of outputting the statistical error information may be acquired, and when the number of times reaches a certain value, the prompt information corresponding to the error information is added to the high-level filter library.
In the above embodiment, the first matching result is an accurate result obtained by matching through the high-level filter library, the second matching result is a general result obtained by matching through the basic filter library, and the final target matching result is determined by judging the relationship between the first matching result and the second matching result, so that the accuracy of the target matching result is improved.
Referring to fig. 6, fig. 6 is a flowchart of step S208 in the embodiment shown in fig. 2, and the step S208 of comparing the first matching result with the second matching result may include:
s602: and when the characters exist in the first matching result, matching each character in the first matching result through each character in the second matching result.
Specifically, when there is a character in the first matching result, i.e., there is a corresponding matching result in the first matching pass, for example, when the error information is 'user account abnormal, please reset password', the first matching result matched by the high-level filter library is 'account abnormal, reset password', the second matching result matched by the basic filter library is 'account abnormity, password setting', each character in the first matching result is demarked by each character in the second matching result, for example, the "account" in the second matching result exists in the first matching result, the "exception" in the second matching result also exists in the first matching result, the "setting" in the second matching result also exists in the first matching result, the "password" in the second matching result also exists in the first matching result, but the "new" in the first matching result does not exist in the second matching result.
S604: and when each character in the second matching result has a matched character in the first matching result, returning a result that the first matching result completely contains the second matching result.
Specifically, in the above embodiment, when the first matching result matched by the high-level filter library is identical to the second matching result, or in the above example, the two only difference points are "re" and the "re" character is included in the first matching result, the first matching result is considered to be completely included in the second matching result.
S606: and when at least one character in the second matching result does not have a matched character in the first matching result, returning a result that the first matching result does not completely contain the second matching result.
Specifically, in the above embodiment, when the first matching result matched by the higher filter library is "reset", and the second matching result matched by the base filter library is "account abnormal, reset", that is, the first matching result and the second matching result are only in the same character "reset", and the character "account abnormal" is only present in the second matching result, so that the first matching result does not completely include the second matching result.
In the above embodiment, the first matching result is an accurate result obtained by matching through the high-level filter library, the second matching result is a general result obtained by matching through the basic filter library, and the final target matching result is determined by judging the relationship between the first matching result and the second matching result, so that the accuracy of the target matching result is improved.
Referring to fig. 7, a schematic structural diagram of a web page error information processing apparatus is provided, where the web page error information processing apparatus includes:
and the crawling module 100 is used for crawling error information in the webpage.
The matching module 200 is configured to match the crawled error information with characters in a high-level filter library to obtain a first matching result when prompt information matched with the error information does not exist in the standard library; and matching the crawled error information with the characters in the basic filter library to obtain a second matching result.
A comparison module 300 for comparing the first matching result with the second matching result; and when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result.
And an obtaining module 400, configured to obtain prompt information of the crawled error information according to the target matching result.
And a returning module 500, configured to return the prompt message to the web page.
In one embodiment, the obtaining module may include:
and the receiving unit is used for receiving an adjusting instruction aiming at the target matching result.
The processing unit is used for acquiring prompt information of the crawled error information according to a target matching result when the adjustment instruction represents confirmation; and when the adjusting instruction indicates adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
In one embodiment, the apparatus may further include:
and the supplement module is used for storing the prompt information acquired according to the target matching result and the corresponding error information into a supplement library in a correlation manner.
The matching module is also used for matching the crawled error information with the stored error information in the supplementary library when the prompt information matched with the error information does not exist in the standard library.
The acquisition module is further used for acquiring prompt information corresponding to the stored error information which is successfully matched when the crawled error information is successfully matched with the stored error information in the supplementary library.
The return module is also used for returning the prompt message to the webpage.
And the comparison module is also used for continuously matching the error information with the characters in the high-level filter library to obtain a first matching result when the matching of the crawled error information and the stored error information in the supplement library fails.
In one embodiment, the comparison module may be further configured to take the first matching result as the target matching result when the first matching result completely contains the second matching result.
In one embodiment, the comparison module may be further configured to take the second matching result as the target matching result when no character exists in the first matching result.
In one embodiment, the comparison module may include:
and the matching unit is used for matching each character in the first matching result through each character in the second matching result when the character exists in the first matching result.
The output unit is used for returning the result that the first matching result completely contains the second matching result when each character in the second matching result has a matched character in the first matching result; and when at least one character in the second matching result does not have a matched character in the first matching result, returning a result that the first matching result does not completely contain the second matching result.
For the above limitation on the web page error information processing apparatus, reference may be made to the above specific limitation on the web page error information processing method, which is not described herein again.
Referring to fig. 8, fig. 8 is a schematic structural diagram of a dedicated crawler parsing terminal in an embodiment, where the dedicated crawler parsing terminal may be a conventional server or any other suitable computer device, and includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the memory may include a nonvolatile storage medium and an internal memory, and the computer program may be stored in the nonvolatile storage medium, and when the processor executes the program, the processor implements the following steps: crawling error information in a webpage; when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result; matching the crawled error information with characters in a basic filter library to obtain a second matching result; comparing the first matching result with the second matching result; when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result; according to the target matching result, prompt information of the crawled error information is obtained; and returning the prompt information to the webpage.
In one embodiment, the processor when executing the program may further implement the following steps: receiving an adjusting instruction aiming at a target matching result; when the adjustment instruction indicates confirmation, acquiring prompt information of the crawled error information according to a target matching result; and when the adjusting instruction indicates adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
In one embodiment, the processor when executing the program may further implement the following steps: storing the prompt information acquired according to the target matching result and the corresponding error information into a supplementary library in a correlation manner; when the prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result, wherein the step comprises the following steps of: when the prompt information matched with the error information does not exist in the standard library, matching the crawled error information with the stored error information in the supplementary library; when the crawled error information is successfully matched with the stored error information in the supplement library, acquiring prompt information corresponding to the successfully matched stored error information; returning the prompt information to the webpage; and when the matching of the crawled error information and the stored error information in the supplementary library fails, continuing to match the error information with the characters in the high-level filter library to obtain a first matching result.
In one embodiment, the processor when executing the program may further implement the following steps: and when the first matching result completely contains the second matching result, taking the first matching result as a target matching result.
In one embodiment, the processor when executing the program may further implement the following steps: and when no character exists in the first matching result, taking the second matching result as a target matching result.
In one embodiment, the processor when executing the program may further implement the following steps: when the first matching result has characters, matching each character in the first matching result through each character in the second matching result; when each character in the second matching result has a matched character in the first matching result, returning a result that the first matching result completely contains the second matching result; and when at least one character in the second matching result does not have a matched character in the first matching result, returning a result that the first matching result does not completely contain the second matching result.
For the above definition of the computer device, reference may be made to the above specific definition of the web page error information processing method, which is not described herein again.
With continuing reference to fig. 8, there is also provided a computer readable storage medium having stored thereon a computer program, such as the non-volatile storage medium shown in fig. 8, wherein the program when executed by a processor implements the steps of: crawling error information in a webpage; when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result; matching the crawled error information with characters in a basic filter library to obtain a second matching result; comparing the first matching result with the second matching result; when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result; according to the target matching result, prompt information of the crawled error information is obtained; and returning the prompt information to the webpage.
In one embodiment, the program when executed by the processor further performs the steps of: receiving an adjusting instruction aiming at a target matching result; when the adjustment instruction indicates confirmation, acquiring prompt information of the crawled error information according to a target matching result; and when the adjusting instruction indicates adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
In one embodiment, the program when executed by the processor further performs the steps of: storing the prompt information acquired according to the target matching result and the corresponding error information into a supplementary library in a correlation manner; when the prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result, wherein the step comprises the following steps of: when the prompt information matched with the error information does not exist in the standard library, matching the crawled error information with the stored error information in the supplementary library; when the crawled error information is successfully matched with the stored error information in the supplement library, acquiring prompt information corresponding to the successfully matched stored error information; returning the prompt information to the webpage; and when the matching of the crawled error information and the stored error information in the supplementary library fails, continuing to match the error information with the characters in the high-level filter library to obtain a first matching result.
In one embodiment, the program when executed by the processor further performs the steps of: and when the first matching result completely contains the second matching result, taking the first matching result as a target matching result.
In one embodiment, the program when executed by the processor further performs the steps of: and when no character exists in the first matching result, taking the second matching result as a target matching result.
In one embodiment, the program when executed by the processor further performs the steps of: when the first matching result has characters, matching each character in the first matching result through each character in the second matching result; when each character in the second matching result has a matched character in the first matching result, returning a result that the first matching result completely contains the second matching result; and when at least one character in the second matching result does not have a matched character in the first matching result, returning a result that the first matching result does not completely contain the second matching result.
For the above definition of the computer-readable storage medium, reference may be made to the above specific definition of the web page error information processing method, which is not described herein again.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. The storage medium may be a magnetic disk, an optical disk, a Read-Only Memory (ROM), or the like.
The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (10)

1. A method for processing webpage error information is characterized by comprising the following steps:
crawling error information in a webpage;
when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result;
matching the crawled error information with characters in a basic filter library to obtain a second matching result;
comparing the first matching result with the second matching result;
when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result;
acquiring prompt information of the crawled error information according to the target matching result;
and returning the prompt information to the webpage.
2. The method according to claim 1, wherein the step of obtaining the prompt message of the crawled error message according to the target matching result comprises:
receiving an adjustment instruction aiming at the target matching result;
when the adjusting instruction represents confirmation, acquiring prompt information of the crawled error information according to the target matching result;
and when the adjusting instruction represents adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
3. The method of claim 2, further comprising:
storing the prompt information obtained according to the target matching result and the corresponding error information in a supplementary library in a correlation manner;
when the standard library does not have prompt information matched with the error information, matching the crawled error information with characters in a high-level filter library to obtain a first matching result, wherein the step comprises the following steps of:
when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with the stored error information in the supplementary library;
when the crawled error information is successfully matched with the stored error information in the supplementary library, acquiring prompt information corresponding to the successfully matched stored error information;
returning the prompt information to the webpage;
and when the matching of the crawled error information and the stored error information in the supplementary library fails, continuing to match the error information with characters in a high-level filter library to obtain a first matching result.
4. The method according to any one of claims 1 to 3, further comprising:
and when the first matching result completely comprises the second matching result, taking the first matching result as the target matching result.
5. The method of claim 4, further comprising:
and when no character exists in the first matching result, taking the second matching result as the target matching result.
6. The method of claim 4, wherein the step of comparing the first match result with the second match result comprises:
when characters exist in the first matching result, matching each character in the first matching result through each character in the second matching result;
when each character in the second matching result has a matched character in the first matching result, returning a result that the first matching result completely contains the second matching result;
and when at least one character in the second matching result does not have a matched character in the first matching result, returning a result that the first matching result does not completely contain the second matching result.
7. A web page error information processing apparatus, characterized in that the apparatus comprises:
the crawling module is used for crawling error information in the webpage;
the matching module is used for matching the crawled error information with characters in a high-level filter library to obtain a first matching result when prompt information matched with the error information does not exist in the standard library; matching the crawled error information with characters in a basic filter library to obtain a second matching result;
a comparison module for comparing the first matching result with the second matching result; when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result;
the acquisition module is used for acquiring prompt information of the crawled error information according to the target matching result;
and the return module is used for returning the prompt information to the webpage.
8. The apparatus of claim 7, wherein the obtaining module comprises:
a receiving unit, configured to receive an adjustment instruction for the target matching result;
the processing unit is used for acquiring prompt information of the crawled error information according to the target matching result when the adjusting instruction indicates confirmation; and when the adjusting instruction represents adjustment, adjusting the target matching result according to the adjusting instruction, and acquiring prompt information of the crawled error information according to the adjusted target matching result.
9. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor when executing the program performs the steps of:
crawling error information in a webpage;
when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result;
matching the crawled error information with characters in a basic filter library to obtain a second matching result;
comparing the first matching result with the second matching result;
when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result;
acquiring prompt information of the crawled error information according to the target matching result;
and returning the prompt information to the webpage.
10. A computer-readable storage medium, on which a computer program is stored, which program, when executed by a processor, carries out the steps of:
crawling error information in a webpage;
when prompt information matched with the error information does not exist in the standard library, matching the crawled error information with characters in a high-level filter library to obtain a first matching result;
matching the crawled error information with characters in a basic filter library to obtain a second matching result;
comparing the first matching result with the second matching result;
when the first matching result does not completely contain the second matching result, splicing the first matching result and the second matching result to obtain a target matching result;
acquiring prompt information of the crawled error information according to the target matching result;
and returning the prompt information to the webpage.
CN201710614416.6A 2017-07-25 2017-07-25 Webpage error information processing method and device, computer equipment and storage medium Active CN107704484B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201710614416.6A CN107704484B (en) 2017-07-25 2017-07-25 Webpage error information processing method and device, computer equipment and storage medium
PCT/CN2018/080139 WO2019019674A1 (en) 2017-07-25 2018-03-23 Website error information processing method and apparatus, computer device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710614416.6A CN107704484B (en) 2017-07-25 2017-07-25 Webpage error information processing method and device, computer equipment and storage medium

Publications (2)

Publication Number Publication Date
CN107704484A CN107704484A (en) 2018-02-16
CN107704484B true CN107704484B (en) 2020-06-19

Family

ID=61170778

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710614416.6A Active CN107704484B (en) 2017-07-25 2017-07-25 Webpage error information processing method and device, computer equipment and storage medium

Country Status (2)

Country Link
CN (1) CN107704484B (en)
WO (1) WO2019019674A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107704484B (en) * 2017-07-25 2020-06-19 深圳壹账通智能科技有限公司 Webpage error information processing method and device, computer equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011025298A3 (en) * 2009-08-28 2011-06-23 (주)이즈포유 Content search management system using an indicator displayed as a result of a keyword search, and method for same
CN103929440A (en) * 2014-05-09 2014-07-16 国家电网公司 Web page tamper prevention device based on web server cache matching and method thereof
CN104750747A (en) * 2013-12-30 2015-07-01 腾讯科技(深圳)有限公司 Method and system for converting error messages into error prompt

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7886000B1 (en) * 2006-06-27 2011-02-08 Confluence Commons, Inc. Aggregation system for social network sites
US20080147652A1 (en) * 2006-12-14 2008-06-19 Bellsouth Intellectual Property Corporation Physical address verification within electronic documents
CN102306177B (en) * 2011-08-25 2013-06-05 清华大学 Multi-strategy combined ontology or instance matching method
CN105550059A (en) * 2015-12-09 2016-05-04 中国建设银行股份有限公司 Error code conversion method and equipment
CN106648944B (en) * 2016-09-26 2020-04-17 东软集团股份有限公司 Method and device for processing exception
CN107704484B (en) * 2017-07-25 2020-06-19 深圳壹账通智能科技有限公司 Webpage error information processing method and device, computer equipment and storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011025298A3 (en) * 2009-08-28 2011-06-23 (주)이즈포유 Content search management system using an indicator displayed as a result of a keyword search, and method for same
CN104750747A (en) * 2013-12-30 2015-07-01 腾讯科技(深圳)有限公司 Method and system for converting error messages into error prompt
CN103929440A (en) * 2014-05-09 2014-07-16 国家电网公司 Web page tamper prevention device based on web server cache matching and method thereof

Also Published As

Publication number Publication date
CN107704484A (en) 2018-02-16
WO2019019674A1 (en) 2019-01-31

Similar Documents

Publication Publication Date Title
KR102151326B1 (en) Data processing method and device
US9218568B2 (en) Disambiguating data using contextual and historical information
CN109473093B (en) Speech recognition method, device, computer equipment and storage medium
CN103888254A (en) Network information verification method and apparatus
US20160292143A1 (en) System and method for automated online wizard generation
CN110046155B (en) Method, device and equipment for updating feature database and determining data features
CN109324956B (en) System testing method, apparatus and computer readable storage medium
CN111949607B (en) Method, system and device for monitoring UDT file
US11704302B2 (en) Method, system and apparatus for processing database updates
US20170132462A1 (en) Document checking support apparatus, document checking support system, and non-transitory computer readable medium
CN114048129A (en) Automatic testing method, device, equipment and system for software function change
CN104346035A (en) Indicating automatically corrected words
CN107704484B (en) Webpage error information processing method and device, computer equipment and storage medium
JP2018092361A (en) Test script correction apparatus and test script correction program
US20120310849A1 (en) System and method for validating design of an electronic product
CN113434395A (en) Automatic generation method, device, equipment and medium of test case
WO2021183382A1 (en) Graph-based method for inductive bug localization
CN111309371A (en) Query method and device
US11182375B2 (en) Metadata validation tool
CN111639903A (en) Review processing method for architecture change and related equipment
CN115098362B (en) Page test method, page test device, electronic equipment and storage medium
CN112149402B (en) Document matching method, device, electronic equipment and computer readable storage medium
CN110162461B (en) Data synchronization verification method and device and computer equipment
AU2017202199B1 (en) Software integration testing with unstructured database
JP2018092362A (en) Test script correction apparatus and test script correction program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20180605

Address after: 518000 Room 201, building A, No. 1, Qian Wan Road, Qianhai Shenzhen Hong Kong cooperation zone, Shenzhen, Guangdong (Shenzhen Qianhai business secretary Co., Ltd.)

Applicant after: Shenzhen one ledger Intelligent Technology Co., Ltd.

Address before: 200000 Xuhui District, Shanghai Kai Bin Road 166, 9, 10 level.

Applicant before: Shanghai Financial Technologies Ltd

CB02 Change of applicant information
CB02 Change of applicant information

Address after: 518000 Room 201, building A, 1 front Bay Road, Shenzhen Qianhai cooperation zone, Shenzhen, Guangdong

Applicant after: Shenzhen one ledger Intelligent Technology Co., Ltd.

Address before: 518000 Room 201, building A, No. 1, Qian Wan Road, Qianhai Shenzhen Hong Kong cooperation zone, Shenzhen, Guangdong (Shenzhen Qianhai business secretary Co., Ltd.)

Applicant before: Shenzhen one ledger Intelligent Technology Co., Ltd.

REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1250812

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant