CN113836459A - Web site page monitoring method, device, equipment and storage medium - Google Patents

Web site page monitoring method, device, equipment and storage medium Download PDF

Info

Publication number
CN113836459A
CN113836459A CN202110927381.8A CN202110927381A CN113836459A CN 113836459 A CN113836459 A CN 113836459A CN 202110927381 A CN202110927381 A CN 202110927381A CN 113836459 A CN113836459 A CN 113836459A
Authority
CN
China
Prior art keywords
page
snapshot
web site
content
path
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110927381.8A
Other languages
Chinese (zh)
Inventor
陈泰名
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Gaoteng Technology Service Co ltd
Original Assignee
Shenzhen Gaoteng Technology Service Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Gaoteng Technology Service Co ltd filed Critical Shenzhen Gaoteng Technology Service Co ltd
Priority to CN202110927381.8A priority Critical patent/CN113836459A/en
Publication of CN113836459A publication Critical patent/CN113836459A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/602Providing cryptographic facilities or services

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Bioethics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Hardware Design (AREA)
  • Computer Security & Cryptography (AREA)
  • Software Systems (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention relates to the technical field of page monitoring, and discloses a method, a device, equipment and a storage medium for monitoring a web site page. The web site page monitoring method comprises the following steps: reading the page content of the current access web site page and saving the page content as a page snapshot; acquiring a first path directory of the page snapshot and a second path directory of a normal page snapshot; comparing the page snapshot with the normal page snapshot according to the first path catalog and the second path catalog to obtain a comparison result; and if the comparison result is that the directories have differences, triggering a page abnormity alarm. The invention realizes the automatic monitoring of the web site page, saves the labor cost and the time cost and simultaneously improves the accuracy of the automatic monitoring of the page.

Description

Web site page monitoring method, device, equipment and storage medium
Technical Field
The invention relates to the technical field of page monitoring, in particular to a method, a device, equipment and a storage medium for monitoring a web site page.
Background
With the continuous development of internet technology, more and more service providers provide online services for users, and more web pages are generated accordingly. In order to provide stable online service, web site page monitoring is required to deal with problems such as page faults in time.
In the prior art, the processing of the page fault usually depends on feedback after the user finds a problem, and then the worker processes the problem according to the feedback. This approach not only has a long processing cycle, but also results in a poor user experience. The existing web site page monitoring method also has the problem of untimely early warning caused by incomplete monitoring when detecting the page state.
Disclosure of Invention
The invention mainly aims to provide a method, a device, equipment and a storage medium for monitoring a web site page, and aims to solve the technical problem that the existing web site page monitoring method is incomplete in monitoring.
The invention provides a website page monitoring method in a first aspect, which comprises the following steps:
reading the page content of the current access web site page and saving the page content as a page snapshot;
acquiring a first path directory of the page snapshot and a second path directory of a normal page snapshot;
comparing the page snapshot with the normal page snapshot according to the first path catalog and the second path catalog to obtain a comparison result;
and if the comparison result is that the directories have differences, triggering a page abnormity alarm.
Optionally, in a first implementation manner of the first aspect of the present invention, before the reading the page content of the currently visited website page and saving the page content as the page snapshot, the method further includes:
sending an access request to a web server to be accessed, and acquiring a response status code corresponding to the access request;
judging whether the response state code is a preset value or not;
if the preset value is the preset value, determining that the web site page to be accessed is normal, executing the step of reading the page content of the currently accessed web site page and saving the page content as a page snapshot, and otherwise triggering a page abnormity alarm.
Optionally, in a second implementation manner of the first aspect of the present invention, the reading the page content of the currently visited web site page and saving the page content as the page snapshot includes:
reading page content of a current access web site page;
saving the page content as a page snapshot of a corresponding type according to the information type of the page content, and naming by using a page title and a url of the web site page;
and saving the page snapshot corresponding to the web site page to the first path directory, wherein when the homonymy page snapshot exists in the first path directory, the homonymy page snapshot is replaced.
Optionally, in a third implementation manner of the first aspect of the present invention, the saving the page content as the corresponding type of page snapshot according to the information type of the page content includes:
if the information type of the page content is a text type, saving the page content as a text snapshot;
if the information type of the page content is a non-text type, generating an MD5 value of the page content, and storing the MD5 value as a corresponding type of page snapshot.
Optionally, in a fourth implementation manner of the first aspect of the present invention, the page content includes one or more of text, picture, audio, and video; the page snapshot includes one or more of a text snapshot, a picture snapshot, an audio snapshot, and a video snapshot.
Optionally, in a fifth implementation manner of the first aspect of the present invention, the comparing the page snapshot with the normal page snapshot according to the first path directory and the second path directory, and obtaining a comparison result includes:
according to the first path directory and the second path directory, carrying out homonymous page snapshot comparison on the page snapshot and the normal page snapshot to obtain a comparison result;
if the page snapshot is a text snapshot, performing text comparison on the page snapshot and the normal page snapshot; and if the page snapshot is a non-text snapshot, performing MD5 value comparison on the page snapshot and the normal page snapshot.
Optionally, in a sixth implementation manner of the first aspect of the present invention, after comparing the page snapshot with the normal page snapshot according to the first path directory and the second path directory, and obtaining a comparison result, the method further includes:
if the comparison result is that the directories are the same, determining that the currently accessed web site is normal;
and after triggering page abnormity alarm, triggering manual recheck, renaming the second path directory after the manual recheck is passed, copying the first path directory and setting the copied first path directory as the second path directory.
A second aspect of the present invention provides a web site page monitoring apparatus, where the web site page monitoring apparatus includes:
the reading module is used for reading the page content of the current access web site page and saving the page content as a page snapshot;
the acquisition module is used for acquiring a first path directory of the page snapshot and a second path directory of the normal page snapshot;
the comparison module is used for comparing the page snapshot with the normal page snapshot according to the first path catalog and the second path catalog to obtain a comparison result;
and the warning module is used for triggering page abnormity warning if the comparison result is that the directories have differences.
Optionally, in a first implementation manner of the second aspect of the present invention, the website page monitoring apparatus further includes:
the system comprises a preprocessing module, a response state code generation module and a response state code generation module, wherein the preprocessing module is used for sending an access request to a web server to be accessed and acquiring the response state code corresponding to the access request; judging whether the response state code is a preset value or not; if the preset value is the preset value, determining that the web site page to be accessed is normal, executing the step of reading the page content of the currently accessed web site page and saving the page content as a page snapshot, and otherwise triggering a page abnormity alarm.
Optionally, in a second implementation manner of the second aspect of the present invention, the reading module includes:
the page reading unit is used for reading the page content of the current access web site page;
the snapshot storage unit is used for storing the page content into a page snapshot of a corresponding type according to the information type of the page content and naming the page title and url of the web site page; and saving the page snapshot corresponding to the web site page to the first path directory, wherein when the homonymy page snapshot exists in the first path directory, the homonymy page snapshot is replaced.
Optionally, in a third implementation manner of the second aspect of the present invention, the snapshot storage unit is specifically configured to:
if the information type of the page content is a text type, saving the page content as a text snapshot; if the information type of the page content is a non-text type, generating an MD5 value of the page content, and storing the MD5 value as a corresponding type of page snapshot.
Optionally, in a fourth implementation manner of the second aspect of the present invention, the page content includes one or more of text, picture, audio, and video; the page snapshot includes one or more of a text snapshot, a picture snapshot, an audio snapshot, and a video snapshot.
Optionally, in a fifth implementation manner of the second aspect of the present invention, the comparing module is specifically configured to:
according to the first path directory and the second path directory, carrying out homonymous page snapshot comparison on the page snapshot and the normal page snapshot to obtain a comparison result;
if the page snapshot is a text snapshot, performing text comparison on the page snapshot and the normal page snapshot; and if the page snapshot is a non-text snapshot, performing MD5 value comparison on the page snapshot and the normal page snapshot.
Optionally, in a sixth implementation manner of the second aspect of the present invention, the website page monitoring apparatus further includes:
and the updating module triggers manual review after triggering page abnormity alarm, renames the second path directory after the manual review passes, copies the first path directory and sets the copied first path directory as the second path directory.
A third aspect of the present invention provides a computer apparatus comprising: a memory and at least one processor, the memory having instructions stored therein;
the at least one processor invokes the instructions in the memory to cause the computer device to perform the web site page monitoring method of any of the above.
A fourth aspect of the invention provides a computer readable storage medium having stored thereon a computer program which, when executed by a processor, implements a web site page monitoring method as claimed in any one of the preceding claims.
According to the technical scheme provided by the invention, the corresponding page snapshot is obtained by processing the page content of the current access page, and is compared with the normal page snapshot, and whether the current page is displayed abnormally is judged according to the comparison result. The method and the device for automatically monitoring the web site page greatly save labor cost and time cost, can realize more comprehensive monitoring by directly comparing page contents, improve the accuracy of page monitoring, and can effectively identify the problems of DNS hijacking, illegal tampering of the web site page contents and the like.
Drawings
FIG. 1 is a diagram of a first embodiment of a web site page monitoring method according to an embodiment of the present invention;
FIG. 2 is a diagram of a second embodiment of a web site page monitoring method according to an embodiment of the present invention;
FIG. 3 is a diagram of a third embodiment of a web site page monitoring method according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of an embodiment of a website page monitoring apparatus according to the embodiment of the present invention;
FIG. 5 is a schematic diagram of another embodiment of a website page monitoring apparatus according to an embodiment of the present invention;
FIG. 6 is a diagram of an embodiment of a computer device in an embodiment of the invention.
Detailed Description
The embodiment of the invention provides a method, a device, equipment and a storage medium for monitoring web site pages, wherein the method, the device and the equipment are used for recording the correct state of a web site in a snapshot mode and monitoring the pages in an anti-triggering mode, so that the labor cost and the time cost are saved, the alarm accuracy and the inspection efficiency are greatly improved, and the problems of DSN hijacking, illegal tampering of the web site page contents and the like can be effectively identified.
The terms "first," "second," "third," "fourth," and the like in the description and in the claims, as well as in the drawings, if any, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It will be appreciated that the data so used may be interchanged under appropriate circumstances such that the embodiments described herein may be practiced otherwise than as specifically illustrated or described herein. Furthermore, the terms "comprises," "comprising," or "having," and any variations thereof, are intended to cover non-exclusive inclusions, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
For convenience of understanding, a specific flow of the embodiment of the present invention is described below, and referring to fig. 1, a first embodiment of a web site page monitoring method in the embodiment of the present invention includes:
101. reading the page content of the current access web site page and saving the page content as a page snapshot;
it is to be understood that the execution subject of the present invention may be a website page monitoring apparatus, and may also be a terminal or a server, which is not limited herein. The embodiment of the present invention is described by taking a server as an execution subject.
In this embodiment, a web site refers to a collection of related web pages that are created on the internet using tools such as HTML according to a certain rule and are used to display specific content.
In this embodiment, the monitoring may be a web site or a specific certain page.
In this embodiment, the method for obtaining the page content is not limited, and includes but is not limited to: directly sending an http request to a web server by a monitoring server to obtain response data; and sending an http request to the web server by the user browser, and forwarding the http request to the monitoring server after obtaining response data.
In this embodiment, the page content includes one or more of text, picture, audio, and video; the page snapshot includes one or more of a text snapshot, a picture snapshot, an audio snapshot, and a video snapshot.
Optionally, in an embodiment, before the step 101, the method further includes:
sending an access request to a web server to be accessed, and acquiring a response status code corresponding to the access request;
judging whether the response state code is a preset value or not;
if the preset value is the preset value, determining that the web site page to be accessed is normal, executing the step of reading the page content of the currently accessed web site page and saving the page content as a page snapshot, and otherwise triggering a page abnormity alarm.
102. Acquiring a first path directory of the page snapshot and a second path directory of a normal page snapshot;
in this embodiment, since one web site includes a plurality of pages, one web site has a plurality of page snapshots, and all the page snapshots of one web site are stored in the specified path directory, so that the subsequent steps can be conveniently performed.
In this embodiment, the normal page snapshot is a normal page snapshot in the current version.
103. Comparing the page snapshot with the normal page snapshot according to the first path catalog and the second path catalog to obtain a comparison result;
in an embodiment, step 103 specifically includes:
(1) sorting the snapshot files in the first path directory and the second path directory according to the lexicographic order of the file names respectively, and counting the number of the files respectively;
(2) if the number of the first path directory files is different from that of the second path directory files, returning the difference of the directories as a comparison result, and otherwise, continuing to execute the next step;
(3) and comparing the corresponding files with the same name one by one in sequence to obtain and output a comparison result.
104. And if the comparison result is that the directories have differences, triggering a page abnormity alarm.
In this embodiment, the catalog differences include, but are not limited to: the page snapshot files have differences; the content of the files with the same name is different.
In this embodiment, the page abnormality warning information is sent by the monitoring server to the relevant technical staff for processing.
In the embodiment of the invention, the corresponding page snapshot is obtained by processing the page content of the current access page, and is compared with the normal page snapshot, and whether the current page is displayed abnormally is judged according to the comparison result. The automatic monitoring of the web site page greatly saves labor cost and time cost, more comprehensive monitoring can be realized by directly comparing page contents, the accuracy of page monitoring is improved, and the problems of DNS hijacking, illegal tampering of the web site page contents and the like can be effectively identified.
Referring to fig. 2, a second embodiment of a web site page monitoring method according to the embodiment of the present invention includes:
201. reading page content of a current access web site page;
202. saving the page content as a page snapshot of a corresponding type according to the information type of the page content, and naming by using a page title and a url of the web site page;
in this embodiment, the information types of the page content include a text type and a non-text type, where the non-text type includes, but is not limited to, a picture, audio, and video.
In this embodiment, the page title is the content of the title tag in the html document; each information Resource on the network has a Uniform and unique address on the network, which is denoted as a URL (Uniform Resource Locator).
Optionally, in an embodiment, the step 202 includes:
if the information type of the page content is a text type, saving the page content as a text snapshot;
if the information type of the page content is a non-text type, generating an MD5 value of the page content, and storing the MD5 value as a corresponding type of page snapshot.
203. Saving the page snapshot corresponding to the web site page to the first path directory, wherein when the homonymy page snapshot exists in the first path directory, the homonymy page snapshot is replaced;
204. acquiring a first path directory of the page snapshot and a second path directory of a normal page snapshot;
205. comparing the page snapshot with the normal page snapshot according to the first path catalog and the second path catalog to obtain a comparison result;
206. if the comparison result is that the directories have differences, triggering a page abnormity alarm;
in the embodiment of the invention, the information types of the web site page content are divided into the text type and the non-text type, and the corresponding page content snapshots are respectively stored, so that more accurate page monitoring can be realized by directly comparing the page content.
Referring to fig. 3, a third embodiment of a web site page monitoring method according to the embodiment of the present invention includes:
301. reading the page content of the current access web site page and saving the page content as a page snapshot;
302. acquiring a first path directory of the page snapshot and a second path directory of the normal page snapshot
303. Comparing the page snapshot with the normal page snapshot according to the first path catalog and the second path catalog to obtain a comparison result;
in this embodiment, if the page snapshot is a text snapshot, performing text comparison on the page snapshot and the normal page snapshot; if the page snapshot is a non-text snapshot, performing MD5 value comparison on the page snapshot and the normal page snapshot;
304. if the comparison result is that the directories have differences, triggering a page abnormity alarm;
305. if the comparison result is that the directories are the same, determining that the currently accessed web site is normal;
306. and after triggering page abnormity alarm, triggering manual recheck, renaming the second path directory after the manual recheck is passed, copying the first path directory and setting the copied first path directory as the second path directory.
In this embodiment, after manual review, if it is determined that the modification of the currently accessed page is correct, the step of renaming the second path directory is performed, and the steps of copying the first path directory and setting the copied first path directory as the second path directory are performed, otherwise, only the error page is corrected.
In the embodiment of the invention, whether the non-text content in the page is normally displayed is determined by comparing the MD5 values, so that the accuracy of monitoring the web site page is effectively improved. The manual review saves the historical version record of the web site through the directory replication step after the manual review, and meanwhile, the embodiment can also be used for checking the updated page content because the page snapshots before and after the page update are separately stored.
With reference to fig. 4, the method for monitoring a web site page in the embodiment of the present invention is described above, and a web site page monitoring apparatus in the embodiment of the present invention is described below, where an embodiment of the web site page monitoring apparatus in the embodiment of the present invention includes:
the reading module 401 is configured to read page content of a currently visited web site page and store the page content as a page snapshot;
an obtaining module 402, configured to obtain a first path directory of the page snapshot and a second path directory of a normal page snapshot;
a comparing module 403, configured to compare the page snapshot with the normal page snapshot according to the first path directory and the second path directory, and obtain a comparison result;
and an alarm module 404, configured to trigger a page exception alarm if the comparison result indicates that the directories are different.
Optionally, in an embodiment, the reading module 401 is specifically configured to:
reading page content of a current access web site page; saving the page content as a page snapshot of a corresponding type according to the information type of the page content, and naming by using a page title and a url of the web site page; and saving the page snapshot corresponding to the web site page to the first path directory, wherein when the homonymy page snapshot exists in the first path directory, the homonymy page snapshot is replaced.
In the embodiment of the invention, the corresponding page snapshot is obtained by processing the page content of the current access page, and is compared with the normal page snapshot, and whether the current page is displayed abnormally is judged according to the comparison result. The automatic monitoring of the web site page greatly saves labor cost and time cost, more comprehensive monitoring can be realized by directly comparing page contents, the accuracy of page monitoring is improved, and the problems of DNS hijacking, illegal tampering of the web site page contents and the like can be effectively identified.
Referring to fig. 5, another embodiment of a website page monitoring apparatus according to an embodiment of the present invention includes:
the reading module 401 is configured to read page content of a currently visited web site page and store the page content as a page snapshot;
an obtaining module 402, configured to obtain a first path directory of the page snapshot and a second path directory of a normal page snapshot;
a comparing module 403, configured to compare the page snapshot with the normal page snapshot according to the first path directory and the second path directory, and obtain a comparison result;
and an alarm module 404, configured to trigger a page exception alarm if the comparison result indicates that the directories are different.
The preprocessing module 405 is configured to send an access request to a web server to be accessed, acquire a response status code corresponding to the access request, and perform preprocessing.
And the updating module 406 is used for triggering manual review after the page exception alarm is triggered, renaming the second path directory after the manual review passes, copying the first path directory and setting the copied first path directory as the second path directory.
Optionally, in an embodiment, the reading module 401 includes:
the page reading unit is used for reading the page content of the current access web site page;
the snapshot storage unit is used for storing the page content into a page snapshot of a corresponding type according to the information type of the page content and naming the page title and url of the web site page; and saving the page snapshot corresponding to the web site page to the first path directory, wherein when the homonymy page snapshot exists in the first path directory, the homonymy page snapshot is replaced.
Optionally, in an embodiment, the snapshot storage unit may be further specifically configured to:
if the information type of the page content is a text type, saving the page content as a text snapshot;
if the information type of the page content is a non-text type, generating an MD5 value of the page content, and storing the MD5 value as a corresponding type of page snapshot.
Optionally, in an embodiment, the comparing module 403 may be further specifically configured to:
if the page snapshot is a text snapshot, performing text comparison on the page snapshot and the normal page snapshot; and if the page snapshot is a non-text snapshot, performing MD5 value comparison on the page snapshot and the normal page snapshot.
Optionally, in an embodiment, the preprocessing module 405 is specifically configured to:
sending an access request to a web server to be accessed, and acquiring a response status code corresponding to the access request; judging whether the response state code is a preset value or not; if the preset value is the preset value, determining that the web site page to be accessed is normal, executing the step of reading the page content of the currently accessed web site page and saving the page content as a page snapshot, and otherwise triggering a page abnormity alarm.
In the embodiment of the invention, the information types of the web site page content are divided into the text type and the non-text type, corresponding page content snapshots are respectively stored, and whether the non-text type content in the page is normally displayed is determined by comparing MD5 values, so that the monitoring accuracy of the web site page is effectively improved.
Fig. 4 and 5 describe the website page monitoring apparatus in the embodiment of the present invention in detail from the perspective of the modular functional entity, and computer equipment in the embodiment of the present invention is described in detail from the perspective of hardware processing.
Fig. 6 is a schematic structural diagram of a computer device 500 according to an embodiment of the present invention, where the computer device 500 may have a relatively large difference due to different configurations or performances, and may include one or more processors (CPUs) 510 (e.g., one or more processors) and a memory 520, and one or more storage media 530 (e.g., one or more mass storage devices) for storing applications 533 or data 532. Memory 520 and storage media 530 may be, among other things, transient or persistent storage. The program stored on the storage medium 530 may include one or more modules (not shown), each of which may include a sequence of instructions operating on the computer device 500. Further, the processor 510 may be configured to communicate with the storage medium 530 to execute a series of instruction operations in the storage medium 530 on the computer device 500.
The computer device 500 may also include one or more power supplies 540, one or more wired or wireless network interfaces 550, one or more input-output interfaces 560, and/or one or more operating systems 531, such as Windows Server, Mac OS X, Unix, Linux, FreeBSD, and the like. Those skilled in the art will appreciate that the computer device configuration illustrated in FIG. 6 does not constitute a limitation of computer devices, and may include more or fewer components than those illustrated, or some components may be combined, or a different arrangement of components.
The present invention also provides a computer device, which includes a memory and a processor, where the memory stores computer readable instructions, and the computer readable instructions, when executed by the processor, cause the processor to execute the steps of the web site page monitoring method in the foregoing embodiments.
The present invention also provides a computer-readable storage medium, which may be a non-volatile computer-readable storage medium, and which may also be a volatile computer-readable storage medium, having stored therein instructions, which, when run on a computer, cause the computer to perform the steps of the web site page monitoring method.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a read-only memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
The above-mentioned embodiments are only used for illustrating the technical solutions of the present invention, and not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. A web site page monitoring method is characterized by comprising the following steps:
reading the page content of the current access web site page and saving the page content as a page snapshot;
acquiring a first path directory of the page snapshot and a second path directory of a normal page snapshot;
comparing the page snapshot with the normal page snapshot according to the first path catalog and the second path catalog to obtain a comparison result;
and if the comparison result is that the directories have differences, triggering a page abnormity alarm.
2. The website page monitoring method according to claim 1, wherein before the reading of the page content of the currently visited website page and saving as a page snapshot, the method further comprises:
sending an access request to a web server to be accessed, and acquiring a response status code corresponding to the access request;
judging whether the response state code is a preset value or not;
if the preset value is the preset value, determining that the web site page to be accessed is normal, executing the step of reading the page content of the currently accessed web site page and saving the page content as a page snapshot, and otherwise triggering a page abnormity alarm.
3. The website page monitoring method according to claim 1, wherein the reading page content of a currently visited website page and saving as a page snapshot comprises:
reading page content of a current access web site page;
saving the page content as a page snapshot of a corresponding type according to the information type of the page content, and naming by using a page title and a url of the web site page;
and saving the page snapshot corresponding to the web site page to the first path directory, wherein when the homonymy page snapshot exists in the first path directory, the homonymy page snapshot is replaced.
4. The website page monitoring method according to claim 3, wherein the saving the page content as the corresponding type of page snapshot according to the information type of the page content comprises:
if the information type of the page content is a text type, saving the page content as a text snapshot;
if the information type of the page content is a non-text type, generating an MD5 value of the page content, and storing the MD5 value as a corresponding type of page snapshot.
5. The website page monitoring method according to claim 3 or 4, wherein the page content comprises one or more of text, pictures, audio and video; the page snapshot includes one or more of a text snapshot, a picture snapshot, an audio snapshot, and a video snapshot.
6. The website page monitoring method according to claim 4, wherein the comparing the page snapshot with the normal page snapshot according to the first path directory and the second path directory, and obtaining a comparison result comprises:
according to the first path directory and the second path directory, carrying out homonymous page snapshot comparison on the page snapshot and the normal page snapshot to obtain a comparison result;
if the page snapshot is a text snapshot, performing text comparison on the page snapshot and the normal page snapshot; and if the page snapshot is a non-text snapshot, performing MD5 value comparison on the page snapshot and the normal page snapshot.
7. The website page monitoring method according to claim 4 or 6, wherein after comparing the page snapshot with the normal page snapshot according to the first path directory and the second path directory and obtaining a comparison result, the method further comprises:
if the comparison result is that the directories are the same, determining that the currently accessed web site is normal;
and after triggering page abnormity alarm, triggering manual recheck, renaming the second path directory after the manual recheck is passed, copying the first path directory and setting the copied first path directory as the second path directory.
8. A website page monitoring apparatus, the website page monitoring apparatus comprising:
the reading module is used for reading the page content of the current access web site page and saving the page content as a page snapshot;
the acquisition module is used for acquiring a first path directory of the page snapshot and a second path directory of the normal page snapshot;
the comparison module is used for comparing the page snapshot with the normal page snapshot according to the first path catalog and the second path catalog to obtain a comparison result;
and the warning module is used for triggering page abnormity warning if the comparison result is that the directories have differences.
9. A computer device, characterized in that the computer device comprises: a memory and at least one processor, the memory having instructions stored therein;
the at least one processor invoking the instructions in the memory to cause the computer device to perform the website page monitoring method of any of claims 1-7.
10. A computer-readable storage medium having instructions stored thereon, wherein the instructions, when executed by a processor, implement the web site page monitoring method of any one of claims 1-7.
CN202110927381.8A 2021-08-10 2021-08-10 Web site page monitoring method, device, equipment and storage medium Pending CN113836459A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110927381.8A CN113836459A (en) 2021-08-10 2021-08-10 Web site page monitoring method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110927381.8A CN113836459A (en) 2021-08-10 2021-08-10 Web site page monitoring method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN113836459A true CN113836459A (en) 2021-12-24

Family

ID=78960490

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110927381.8A Pending CN113836459A (en) 2021-08-10 2021-08-10 Web site page monitoring method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113836459A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116909849A (en) * 2023-09-12 2023-10-20 北京市大数据中心 Page monitoring method, device and storage medium
CN117176483A (en) * 2023-11-03 2023-12-05 北京艾瑞数智科技有限公司 Abnormal URL identification method and device and related products

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116909849A (en) * 2023-09-12 2023-10-20 北京市大数据中心 Page monitoring method, device and storage medium
CN117176483A (en) * 2023-11-03 2023-12-05 北京艾瑞数智科技有限公司 Abnormal URL identification method and device and related products

Similar Documents

Publication Publication Date Title
CN102663062B (en) Method and device for processing invalid links in search result
CN109902073B (en) Log processing method and device, computer equipment and computer readable storage medium
US5898836A (en) Change-detection tool indicating degree and location of change of internet documents by comparison of cyclic-redundancy-check(CRC) signatures
US10621212B2 (en) Language tag management on international data storage
CN103078945B (en) To the method and system that browser crash data processes
CN113836459A (en) Web site page monitoring method, device, equipment and storage medium
CN111723313A (en) Page jump processing method, device, equipment and storage medium
US20070174324A1 (en) Mechanism to trap obsolete web page references and auto-correct invalid web page references
CN114417197A (en) Access record processing method and device and storage medium
CN107133165B (en) Browser compatibility detection method and device
US20050177597A1 (en) System and method of exercising a browser
CN112417240A (en) Website link detection method and device and computer equipment
CN109344046B (en) Data processing method, device, medium and electronic equipment
WO2019030698A1 (en) Method, system and apparatus for processing database updates
CN106815248B (en) Website analysis method and device
JP2005099967A (en) Preventive maintenance method
CN113934617A (en) Data processing method, device, equipment and storage medium
US20100082690A1 (en) System And Method For Recording Files Of Data
CN117194757A (en) Report generation method, device, equipment and storage medium based on big data
CN109684220A (en) A kind of browser compatibility analysis method based on event replay
CN111209138A (en) Operation and maintenance method and device of data storage system
CN105653625B (en) Abnormal data analysis method and device
CN113590987A (en) Link detection method and device
CN111382079B (en) Method, device and equipment for restoring application program on site and storage medium
WO2016192492A1 (en) Website detection method and device based on virtual host

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination