CN108090080A - For replacing the method and system and crawling method of parsing template - Google Patents

For replacing the method and system and crawling method of parsing template Download PDF

Info

Publication number
CN108090080A
CN108090080A CN201611036900.7A CN201611036900A CN108090080A CN 108090080 A CN108090080 A CN 108090080A CN 201611036900 A CN201611036900 A CN 201611036900A CN 108090080 A CN108090080 A CN 108090080A
Authority
CN
China
Prior art keywords
parsing template
parsing
template
task
content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611036900.7A
Other languages
Chinese (zh)
Inventor
李强
廖耀华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Jingdong Century Trading Co Ltd
Beijing Jingdong Shangke Information Technology Co Ltd
Original Assignee
Beijing Jingdong Century Trading Co Ltd
Beijing Jingdong Shangke Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Jingdong Century Trading Co Ltd, Beijing Jingdong Shangke Information Technology Co Ltd filed Critical Beijing Jingdong Century Trading Co Ltd
Priority to CN201611036900.7A priority Critical patent/CN108090080A/en
Publication of CN108090080A publication Critical patent/CN108090080A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The disclosure provides a kind of method and system for replacement parsing template, and the parsing template crawls task parsing for reptile.Method includes:It is registered to task monitor, to monitor the variation of the parsing template, the task monitor storage variation for parsing template and monitoring the parsing template;It obtains the parsing template from the task monitor and changes message;Obtain the parsing template content after changing.What the disclosure provided can realize that parsing template content is changed in completion in time in the case where not restarting to crawler capturing end for replacing the method and system of parsing template.

Description

For replacing the method and system and crawling method of parsing template
Technical field
This disclosure relates to Internet technical field, in particular to a kind of method for replacement parsing template with being System.
Background technology
With the explosive growth of internet information, it is soft to there are more and more applications based on obtaining bulk information Part, such as search engine, e-commerce price comparing system, public sentiment system etc..Wherein, web crawlers is these using indispensable Important bottom component.And for big data quantity internet information, single node reptile can not meet the needs of real, in It is that distributed network reptile is come into being.
For a set of distributed network crawler system, reptile dispatching terminal and crawler capturing end two parts are generally divided into.Wherein, Reptile dispatching terminal is mainly responsible for the reasonable distribution for the task that captures to multiple crawler capturing ends, while is received preservation grasping end and returned The crawl result data returned;Crawler capturing end is mainly responsible for analog access request, actually goes the Html original data of crawl webpage, And pass through the parsing template configured, Html originals data are subjected to parsing extraction, get crawl as a result, returning to reptile scheduling End.The content for wherein parsing template is generally regular expression, for matching web page contents, so as to get crawl data.
Because the webpage Html structures of each website are different, task is captured for different websites, configuration It is also different to parse template.When website revision upgrades, website Html structures can change, and just need to climb to all at this time The parsing template of worm grasping end carries out adaptation update, is otherwise just present with the problem of parsing data exception.
Therefore, it is necessary to a kind of improved methods for replacing parsing template.
It should be noted that information is only used for strengthening the reason to the background of the disclosure disclosed in above-mentioned background section Solution, therefore can include not forming the information to the prior art known to persons of ordinary skill in the art.
The content of the invention
Being designed to provide for the disclosure is a kind of for replacing the method and system of parsing template, is used to implement not to climbing Worm grasping end is completed to change parsing template content in time in the case of being restarted, and reaches parsing template content automation The effect that heat is replaced.
According to one aspect of the disclosure, a kind of method for replacement parsing template is provided, the parsing template is used for Reptile crawls task parsing, including:
It is registered to task monitor, to monitor the variation of the parsing template, the task monitor stores the parsing Template and the variation for monitoring the parsing template;
It obtains the parsing template from the task monitor and changes message;
Obtain the parsing template content after changing.
In a kind of exemplary embodiment of the disclosure, the task monitor is realized by Zookeeper servers.
In a kind of exemplary embodiment of the disclosure, the content of the parsing template includes the canonical of extraction web page contents Expression formula.
In a kind of exemplary embodiment of the disclosure, the task monitor stores one or more institutes in the form of Task Tree State parsing template.
In a kind of exemplary embodiment of the disclosure, monitoring the variation of the parsing template includes monitoring one or more solutions Analyse the variation of template.
In a kind of exemplary embodiment of the disclosure, further include:By the parsing template content after the variation with Key- Value forms are stored into memory.
In a kind of exemplary embodiment of the disclosure, further include:To parsing template initialization.
According to one aspect of the disclosure, a kind of system for replacement parsing template is provided, the parsing template is used for Reptile crawls task parsing, including:
Registration module is monitored, for being registered to task monitor, to monitor the variation of the parsing template, the task prison The control device storage parsing template and the variation for monitoring the parsing template;
It notifies acquisition module, changes message for obtaining the parsing template from the task monitor;
Template renewal module is parsed, for obtaining the content of the parsing template after changing, by the content of the parsing template Storage is in memory.
In a kind of exemplary embodiment of the disclosure, the task monitor is realized by Zookeeper servers.
According to one aspect of the disclosure, a kind of crawling method is provided, including being replaced as described in above-mentioned any one The method for changing parsing template.
The method for replacement parsing template that the disclosure provides is being connected to by being monitored to parsing template change Updated parsing template is obtained during Notification of Changes, reduces the process of manual intervention update parsing template, realizes to parsing Template renewal timely responds to, and processor efficiency caused by avoiding periodic monitor parsing template renewal wastes, and improves parsing The efficiency of template renewal.Meanwhile it by the way that the content of updated parsing template is write direct in memory, realizes and is not restarting Update parsing template under conditions of crawler capturing end has achieved the effect that parsing template content automation heat replaces.
It should be appreciated that above general description and following detailed description are only exemplary and explanatory, not The disclosure can be limited.
Description of the drawings
Attached drawing herein is merged in specification and forms the part of this specification, shows the implementation for meeting the disclosure Example, and for explaining the principle of the disclosure together with specification.It should be evident that the accompanying drawings in the following description is only the disclosure Some embodiments, for those of ordinary skill in the art, without creative efforts, can also basis These attached drawings obtain other attached drawings.
Fig. 1 schematically shows a kind of flow chart of method for replacement parsing template in disclosure exemplary embodiment.
Fig. 2 schematically shows a kind of schematic diagram of Task Tree in disclosure exemplary embodiment.
Fig. 3 meanings property shows the interaction schematic diagram of middle parsing template renewal in disclosure exemplary embodiment.
Fig. 4 schematically shows a kind of block diagram of system for replacement parsing template in disclosure exemplary embodiment.
Specific embodiment
Example embodiment is described more fully with reference to the drawings.However, example embodiment can be with a variety of shapes Formula is implemented, and is not understood as limited to example set forth herein;On the contrary, these embodiments are provided so that the disclosure will more Fully and completely, and by the design of example embodiment comprehensively it is communicated to those skilled in the art.Described feature, knot Structure or characteristic can be in any suitable manner incorporated in one or more embodiments.In the following description, provide perhaps More details fully understand embodiment of the present disclosure so as to provide.It it will be appreciated, however, by one skilled in the art that can One or more in the specific detail are omitted with technical solution of the disclosure or others side may be employed Method, constituent element, system, step etc..In other cases, be not shown in detail or describe known solution to avoid a presumptuous guest usurps the role of the host and So that all aspects of this disclosure thicken.
In addition, attached drawing is only the schematic illustrations of the disclosure, identical reference numeral represents same or similar portion in figure Point, thus repetition thereof will be omitted.Attached some block diagrams shown in figure are functional entitys, not necessarily necessary and object The entity managed or be logically independent is corresponding.Software form may be employed to realize these functional entitys or in one or more These functional entitys are realized in hardware module or integrated circuit or in heterogeneous networks and/or processor system and/or microcontroller These functional entitys are realized in system.
Disclosure example embodiment is described in detail below in conjunction with the accompanying drawings.
Fig. 1 is a kind of flow chart for the method 100 that template is parsed for replacement, the solution in disclosure example embodiment The task that crawls that analysis template can be used for reptile parses.
It with reference to figure 1, in S102, is registered to task monitor, to monitor the variation of the parsing template, the Mission Monitor The device storage parsing template and the variation for monitoring the parsing template.
In this example embodiment, task monitor can be realized by Zookeeper servers, for example, can pass through The Watcher monitoring mechanisms that Zookeeper is provided realize the variation of monitoring parsing template.Zookeeper is a kind of open source code Distributed application program coordination service, can be realized by the way that its source code is integrated in application software to the profit of Zookeeper With.In Zookeeper, when certain object sets some node of monitoring Zookeeper, if the node content is become More, Zookeeper servers can notify all objects for setting and monitoring the node automatically, inform its node content monitored It changes, here it is the Watcher monitoring mechanisms of Zookeeper.In the other embodiment of the disclosure, task monitor It can be realized by other instruments, the disclosure is not particularly limited this.
In some embodiments, multiple crawler capturing ends can are registered to task monitor, to monitor corresponding parsing mould The variation of plate.Monitor the parsing template changes the variation that can include monitoring one or more parsing templates.That is, crawler capturing Correspondence between end and parsing template can be the relation of one-to-many relation, many-to-one relation either multi-to-multi, One crawler capturing end can monitor multiple parsing templates, and a parsing template can also be monitored by multiple crawler capturing ends.
It is noted that in some embodiments of the present disclosure, the parsing template to crawler capturing end can also be included It is initialized.For example, crawler capturing end can obtain all corresponding parsing templates first, the process of parsing template is being obtained In, the variation for monitoring the parsing template is registered to task monitor, i.e. crawler capturing end can not have what is changed in parsing template When perform the corresponding task of initial parsing template.Certainly, parsing template is initialized and registered to task monitor can To be carried out at the same time, can not also be carried out at the same time, the disclosure is not limited.
Step S104 obtains the parsing template from the task monitor and changes message.
The mode that acquisition parsing template changes message can include passive acquisition or active obtaining.Passive obtain can be by appointing Monitor transmission parsing template of being engaged in changes message informing, and active obtaining can parse template by the active search of crawler capturing end and change Message.In this example embodiment, when realizing task monitor using Zookeeper servers, obtain parsing template and become The mode of dynamic message is passive mode, when parsing template content changes, is owned by Zookeeper server proactive notifications Monitor the crawler capturing end of the parsing template.
The method that existing timing detection version number replaces parsing template is usually to record update parsing template every time Version number, automatic regular polling compare local record parsing template version number whether with server set newest parsing template version It number has differences, when version number is different, then the newest parsing template of download server, local parsing template is replaced It changes, while local version number is updated to as server.Promptness cannot be met compared to the above method, increase CPU is born The shortcomings that load, the disclosure are used to replace the method for parsing template by using the monitoring mechanism of Zookeeper to parsing template It is monitored, when parsing template content changes, the parsing templates is monitored by Zookeeper server proactive notifications are all Crawler capturing end, can to avoid increase cpu load, while can parse template have variation when notify crawler capturing in time Upgrading in time for parsing template is realized at end.
When crawler capturing end monitors the variation of one or more parsing templates, the parsing template content obtained after changing can be with Including obtaining one or more parsing template contents.When one, which parses template, changes, crawler capturing end can obtain the solution The parsing template for analysing template changes message;When multiple parsing templates change simultaneously, crawler capturing end can obtain more The parsing template of a parsing template changes message.Content update or parsing template deletion can be included by parsing the variation of template, It can be realized by manually changing the parsing template content stored in task monitor.
Step S106 obtains the parsing template content after changing.
When crawler capturing end obtain change after one or more parsing template contents after, can by the parsing template content with The form of Key-value is stored in memory.In the process, key can represent the corresponding task names of a parsing template, Value can represent the content of the parsing template.When the content for parsing template changes, crawler capturing end can be according to this The corresponding task names of template, i.e. key values are parsed, this corresponding parsing template content is stored in the form of value in memory.
A kind of existing whole parsing template replacement method for replacing crawler capturing end code bag is generally included to parsing mould After plate is updated, manually crawler capturing end code is packaged, then replaces original all code bags with fresh code bag, And crawler capturing end is restarted.For this method since it is desired that hand press-packing, upgrading replacement efficiency is too low, and replaces crawl It needs to restart crawler capturing end during bag, the work efficiency drop at crawler capturing end can be caused.This example embodiment For replacing the method for parsing template by the way that directly parsing template content is stored in content, aforesaid way loading solution is avoided The shortcomings that analysis template content needs to restart crawler capturing end, cause reptile operating interruptions, the heat for realizing parsing template are replaced It changes.
The method for replacement parsing template that the disclosure provides is being connected to by being monitored to parsing template change Updated parsing template is obtained during Notification of Changes, reduces the process of manual intervention update parsing template, realizes to parsing Template renewal timely responds to, and processor efficiency caused by avoiding periodic monitor parsing template renewal wastes, and improves parsing The efficiency of template renewal.Meanwhile it by the way that the content of updated parsing template is write direct in memory, realizes and is not restarting Update parsing template under conditions of crawler capturing end has achieved the effect that parsing template content automation heat replaces.
The disclosure is described in detail for replacing the method for parsing template with reference to specific embodiment.
In some embodiments, the task monitor storage parsing template can include storing in the form of Task Tree One or more described parsing templates.The content of the parsing template includes the regular expression of extraction web page contents.
Fig. 2 shows a kind of schematic diagram of Task Tree of the disclosure.With reference to figure 2, Task Tree 200 can include general assignment 201 And subtask 2011,2022 and 2013.It should be noted that, although three subtasks are illustrated only in attached drawing, but In actual implementation, subtask can have multiple.General assignment 201 searches parsing template as reptile dispatching terminal and crawler capturing end Interface, to manage the management path that the corresponding task of all parsing templates is provided convenience.
Fig. 3 shows that this example embodiment replaces the interaction schematic diagram of the complete procedure of parsing template.With reference to figure 3, on The friendship of registration center's system end 301, Zookeeper servers 302 and crawler capturing end 303 can be included by stating complete interaction Mutual process.
Registration center's system end 301 can individually be disposed as autonomous system, and major function can include parsing template note Volume and content synchronization to Zookeeper servers.Wherein, parsing template registering functional can include corresponding to when some crawls task Parsing template content when needing change, enter registration center's system by operating personnel, input change mission number and correspondence New task parsing template content, with complete to parsing template content change configuration.After the completion of above-mentioned configuration, registration center System background can be connected automatically to Zookeeper servers, in a manner of tree hierarchy structure as shown in Figure 2, appoint always The child node of the corresponding task is found or created under business root node, is the parsing template just configured by the curriculum offering of child node Content.Above-mentioned storage and management parsing template mode or other modes, such as by queue, stack, figure or other Data structure known to field technology personnel realizes that the disclosure is not restricted this.
The major function at crawler capturing end 302 can include change monitoring and parsing template renewal.Wherein change monitoring work( The change of parsing template content can be monitored, parsing template renewal function can be on the premise of it need not restart to parsing mould Plate content is updated.
When crawler capturing end 302 starts, under general assignment root node, can be obtained by connecting Zookeeper servers The parsing template content of the corresponding subtask node storage of the task of crawling is taken, while each subtask node is supervised in registration Control.It should be noted that in some implementations, further include crawler capturing end and travel through all subtask nodes, the disclosure to this not Make particular determination.During group task node content change, crawler capturing end 302 can receive the transmission of Zookeeper servers Parsing template change notice, and again to parsing template content be read out, to get newest parsing template content.
It, can be by parsing template content in the form of Key-Value when crawler capturing end 302 gets parsing template content Preservation is stored in local memory.Wherein Key can include the corresponding title for crawling task of parsing template, and Value can be wrapped Include the regular expression content that parsing template includes.When crawler capturing end 302 reacquires the newest parsing to subtask node During template content, the corresponding Value values of the Key can be directly changed to the content of newest parsing template, that is, be not required Parsing template content is completed while restarting crawler capturing end 302 to replace.
In registration center's system end 301, parsing the specific interaction of template configuration can include:
A1. user enters registration center's system.
A2. user's input crawls the number of task and the what be new for crawling the corresponding parsing template of task.Its Middle mission number can be the globally unique number that can be identified for that a task, and parsing template content can include extraction webpage The regular expression of Html contents.
A3. user, which clicks on, determines, preserves mission number and its corresponding parsing template content to registration center's system In database.
A4. registration center's system is synchronized to template content is parsed on Zookeeper servers.
Registration center's system end 301 starts connection Zookeeper server threads from the background, then is serviced into Zookeeper The device end course of work.The Zookeeper server ends course of work can include:
B1. general assignment node is created or finds, wherein general assignment node can be as all root nodes for crawling task.
B2. according to above-mentioned mission number, under general assignment node, the subtask section of the corresponding task is checked whether there is Point, if so, the node content then is updated to newest parsing template content.
B3. if not checking the subtask node of the corresponding task, corresponding be somebody's turn to do can be created under general assignment node The subtask node of task, and nodename is arranged to mission number, node content is configured to the corresponding parsing mould of the task The content of plate.
For crawler capturing end 302, change monitor function can be realized by two kinds of workflows.
A kind of workflow is when above-mentioned general assignment node crawls task parsing template and when being created for convenience of management, is climbed Worm grasping end 302 can be firstly connected to Zookeeper servers, read the subtask that crawl task is corresponded under general assignment node Node, using the subtask namespace node of reading as Key values, subtask node content is as Value values, with Key-Value's Form is stored in the memory at crawler capturing end 302, completes to crawl the Initialize installation of task parsing template.
Another workflow is when above-mentioned general assignment node has been created, and solution has been preserved at crawler capturing end 302 When analysing template, crawler capturing end 302 can be in the initialization procedure of above-mentioned parsing template, first to task monitor The parsing of subtask node under (Zookeeper in this example embodiment) registration monitoring general assignment root node and/or root node The variation of template, the API which may be employed Zookeeper offers carry out registration monitoring.When node content changes, Zookeeper can broadcast the crawler capturing end 302 that notice has monitored the change subtask node automatically.
When crawler capturing end 302, which gets parsing template, changes notice, following workflow can be carried out to realize solution Analyse template renewal function.
After crawler capturing end 302 receives parsing template variation notice, thread can be started and be connected to Zookeeper services Device reads the corresponding subtask node of parsing template of the variation, to obtain the content of updated newest parsing template.It can be with Using the corresponding task name (or number) that crawls of the parsing template as Key values, the content of the parsing template as Value values, with Task is crawled in above-mentioned key-value forms update crawler capturing end 302 and parses the Map memories pair of template content mapping As.Since crawler capturing end 302 directly uses the Map memory object values when being parsed to crawl content, so after update The replacement of parsing template content is completed, avoids and restarts crawler capturing end during parsing template is replaced.
Corresponding to above method embodiment, the disclosure also provides a kind of system for replacement parsing template, the parsing Template crawls task parsing for reptile.Fig. 4 is a kind of block diagram of the system for replacement parsing template.With reference to figure 4, it is System 400 can include monitoring registration parsing template 402, notice acquisition module 404 and parsing template renewal module 406.
Monitoring registration module 402 can be used for registering to task monitor, described to monitor the variation of the parsing template The task monitor storage parsing template and the variation for monitoring the parsing template.
Notice acquisition module 404 can be used for obtaining the parsing template variation message from the task monitor.
Parsing template renewal module 406 can be used for the content for obtaining the parsing template after changing, by the parsing template Content storage in memory.
In a kind of exemplary embodiment of the disclosure, the task monitor is realized by Zookeeper servers.
Above system 400 can be used to implement the above method 100.Since each functions of modules of system 400 is corresponding It is described in detail in embodiment of the method, the disclosure is repeated no more in this.
What the disclosure provided is changed for replacing the method and system of parsing template by monitoring parsing template content, is realized The content of automatic replacement parsing template, avoids the poor efficiency for manually replacing parsing Form board tape;By being sent out in parsing template Parsing template Notification of Changes is sent when changing dynamic, the wasting of resources that timing detection parsing template variation is brought is avoided, ensure that Crawler capturing end upgrades to parsing template in time;Meanwhile it is stored directly in crawler capturing end by the way that template content will be parsed In memory, avoid and restart crawler capturing end when update parses template, ensure that the uninterrupted operation of crawler capturing end program.
Further, the disclosure also provides a kind of crawling method, can include as described above for replacement parsing mould The method 100 of plate.What the crawling method that the disclosure provides can be used for crawler capturing end crawls work.
Those skilled in the art will readily occur to the disclosure its after considering specification and putting into practice invention disclosed herein Its embodiment.This application is intended to cover any variations, uses, or adaptations of the disclosure, these modifications, purposes or Person's adaptive change follows the general principle of the disclosure and including the undocumented common knowledge in the art of the disclosure Or conventional techniques.Description and embodiments are considered only as illustratively, and the true scope and spirit of the disclosure are by appended Claim is pointed out.

Claims (10)

1. a kind of for replacing the method for parsing template, the parsing template crawls task parsing for reptile, and feature exists In, including:
It is registered to task monitor, to monitor the variation of the parsing template, the task monitor stores the parsing template And monitor the variation of the parsing template;
It obtains the parsing template from the task monitor and changes message;
Obtain the parsing template content after changing.
2. according to the method described in claim 1, it is characterized in that, the task monitor is real by Zookeeper servers It is existing.
3. according to the method described in claim 1, it is characterized in that, the content of the parsing template includes extraction web page contents Regular expression.
4. according to the method described in claim 1, it is characterized in that, the task monitor stored in the form of Task Tree it is one or more A parsing template.
5. according to the method described in claim 4, it is characterized in that, the variation of the monitoring parsing template is one or more including monitoring The variation of a parsing template.
6. it according to the method described in claim 1, it is characterized in that, further includes:By the parsing template content after the variation with Key-value forms are stored into memory.
7. it according to the method described in claim 1, it is characterized in that, further includes:To parsing template initialization.
8. a kind of for replacing the system of parsing template, the parsing template crawls task parsing for reptile, and feature exists In, including:
Registration module is monitored, for being registered to task monitor, to monitor the variation of the parsing template, the task monitor It stores the parsing template and monitors the variation of the parsing template;
It notifies acquisition module, changes message for obtaining the parsing template from the task monitor;
Template renewal module is parsed, for obtaining the content of the parsing template after changing, the content of the parsing template is stored In memory.
9. system according to claim 8, which is characterized in that the task monitor is real by Zookeeper servers It is existing.
10. a kind of crawling method, which is characterized in that including replacing the method for parsing template as described in claim 1-7.
CN201611036900.7A 2016-11-22 2016-11-22 For replacing the method and system and crawling method of parsing template Pending CN108090080A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611036900.7A CN108090080A (en) 2016-11-22 2016-11-22 For replacing the method and system and crawling method of parsing template

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611036900.7A CN108090080A (en) 2016-11-22 2016-11-22 For replacing the method and system and crawling method of parsing template

Publications (1)

Publication Number Publication Date
CN108090080A true CN108090080A (en) 2018-05-29

Family

ID=62168646

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611036900.7A Pending CN108090080A (en) 2016-11-22 2016-11-22 For replacing the method and system and crawling method of parsing template

Country Status (1)

Country Link
CN (1) CN108090080A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110717131A (en) * 2018-06-27 2020-01-21 北京国双科技有限公司 Page revising monitoring method and related system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101957816A (en) * 2009-07-13 2011-01-26 上海谐宇网络科技有限公司 Webpage metadata automatic extraction method and system based on multi-page comparison
CN103942335A (en) * 2014-05-07 2014-07-23 武汉大学 Construction method of uninterrupted crawler system oriented to web page structure change

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101957816A (en) * 2009-07-13 2011-01-26 上海谐宇网络科技有限公司 Webpage metadata automatic extraction method and system based on multi-page comparison
CN103942335A (en) * 2014-05-07 2014-07-23 武汉大学 Construction method of uninterrupted crawler system oriented to web page structure change

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
QING____: "Zookeeper-Watcher机制与异步调用原理", 《HTTPS://WWW.ITEYE.COM/BLOG/SHIFT-ALT-CTRL-1847320》 *
ZMYCOCO2: "Apache ZooKeeper Watcher 机制源码解释", 《HTTPS://BLOG.CSDN.NET/MICHAELZHOU224/ARTICLE/DETAILS/51923478》 *
明静: "zookeeper 官方文档——综述", 《HTTPS://WWW.CNBLOGS.COM/JSSTUDYJJ/P/5365059.HTML》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110717131A (en) * 2018-06-27 2020-01-21 北京国双科技有限公司 Page revising monitoring method and related system

Similar Documents

Publication Publication Date Title
CN110781007B (en) Task processing method, device, server, client, system and storage medium
US10985997B2 (en) Systems and methods for domain-driven design and execution of metamodels
US9578082B2 (en) Methods for dynamically generating an application interface for a modeled entity and devices thereof
KR101055030B1 (en) Devices, Systems, and Methods for Data Server Managed Web Services Runtime
US9158532B2 (en) Methods for managing applications using semantic modeling and tagging and devices thereof
US7395320B2 (en) Providing automatic policy enforcement in a multi-computer service application
US8631071B2 (en) Recognition of and support for multiple versions of an enterprise canonical message model
JP6164440B2 (en) Application upgrade method and apparatus
US20190052531A1 (en) Systems and methods for service mapping
US20110196957A1 (en) Real-Time Policy Visualization by Configuration Item to Demonstrate Real-Time and Historical Interaction of Policies
US7937716B2 (en) Managing collections of appliances
US20090063650A1 (en) Managing Collections of Appliances
US20100235493A1 (en) Extendable distributed network management system and method
US20140207932A1 (en) Information processing system and information processing method
US20190286500A1 (en) Systems and method for event parsing
US20150081625A1 (en) Service polling method and system, and computer storage medium
US20220182851A1 (en) Communication Method and Apparatus for Plurality of Administrative Domains
CN110457555A (en) Collecting method, device and computer equipment, storage medium based on Docker
CN108090080A (en) For replacing the method and system and crawling method of parsing template
CN110188258B (en) Method and device for acquiring external data by using crawler
Baird et al. Self-adapting workflow reconfiguration
CN104298750B (en) Renewal processing method and processing device for real-time system communication
CN116016702B (en) Application observable data acquisition processing method, device and medium
Gao On Provisioning and configuring ensembles of IoT, network functions and cloud resources
Wennerström Active Assurance in Kubernetes

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180529