CN101315629B - Downloading method and system for web page dynamic contents - Google Patents

Downloading method and system for web page dynamic contents Download PDF

Info

Publication number
CN101315629B
CN101315629B CN2007101058955A CN200710105895A CN101315629B CN 101315629 B CN101315629 B CN 101315629B CN 2007101058955 A CN2007101058955 A CN 2007101058955A CN 200710105895 A CN200710105895 A CN 200710105895A CN 101315629 B CN101315629 B CN 101315629B
Authority
CN
China
Prior art keywords
page
retrieval
result
download
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2007101058955A
Other languages
Chinese (zh)
Other versions
CN101315629A (en
Inventor
王全喜
Original Assignee
潘晓梅
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 潘晓梅 filed Critical 潘晓梅
Priority to CN2007101058955A priority Critical patent/CN101315629B/en
Publication of CN101315629A publication Critical patent/CN101315629A/en
Application granted granted Critical
Publication of CN101315629B publication Critical patent/CN101315629B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a method for the downloading of the dynamic content of a web page, and a system thereof, wherein, the method comprises the steps that: a searching result in accordance with searching conditions is obtained by a searching page, a downloading task list is produced according to the obtained searching result, and each downloading task is corresponding to one searching result record; the page linked to the corresponding searching result record in the downloading task list is linked, and corresponding content data is extracted from positions corresponding to the corresponding page elements in the page according to the corresponding relationship between the preset page elements in the page and the content data to be extracted; the extracted content data is stored in a file or a database. The adoption of the invention can realize the automatic downloading of the dynamic content of the web page and store the content.

Description

The method for down loading of web page dynamic contents and system thereof
Technical field
The present invention relates to the computing machine and the internet communication technology in the communications field, relate in particular to a kind of method for down loading and download system thereof of web page dynamic contents.
Background technology
Internet site has comprised great mass of data, wants to find out useful data from the great mass of data that various websites provide, and just need retrieve.A lot of websites provide the data-searching function, can dynamically generate result for retrieval according to the condition of user's input, result for retrieval normally shows with the form of tabulation, and the user can open the detailed content of checking concrete certain part data logging according to the link of result for retrieval tabulation.
The retrieval mode of checking of this internet data is well suited for the user search inspection information, but there is following limitation in this mode, for example:
This method needs the user to keep local computer to connect the state of internet, and computing machine can't use in off line, only could continue to retrieve after re-entering search key reconnecting to the internet and be connected to searching page.Because search condition can not be reused, therefore each retrieval all will be imported search key, and is more loaded down with trivial details.Do not preserve for the result for retrieval that retrieves, and can only click by hand for the data that need further check detailed content check, the manual preservation.Thereby, be unfavorable for carrying out the finishing analysis of data for the data that retrieves tissue effectively, also be unfavorable for data is applied in other documents.
Therefore, above-mentioned data-searching and method for down loading only are suitable for the situation of a small amount of data of retrieval, are not suitable for searching the situation of great mass of data.
There is the number of site reflect tool can be at present from a collection of URL (Uniform Resource Locator, URL(uniform resource locator)) beginning, picture in the downloading page and the page, sound, animation, then according to each link in the page, repeat above step, the full content of recurrence like this, download site, but these softwares are very effective with respect to static content, just powerless for dynamic content, and do not have selectivity and controllability for downloading which content, promptly mostly is the retrieval and the download of " blindly ".
Also some website download tool can be downloaded the data on the internet and be saved in this locality at present, and set up local document data base, but this system can only download pre-set website data, can not be set by the user the dynamic website that needs retrieval to download data, therefore using has certain limitation.
Summary of the invention
The invention provides a kind of method for down loading of web page dynamic contents, with realization the dynamic content of webpage is downloaded and stored, this method comprises the steps:
Obtain the result for retrieval that meets search condition from searching page, generate download task list, the corresponding result for retrieval record of each downloading task according to the result for retrieval that gets access to;
Be linked to the page that the result for retrieval record in the described download task list is linked, and according to the corresponding relation of the page elements in predefined this page with the content-data that will extract, the content corresponding data are extracted in the pairing position of corresponding page elements from this page;
The described content-data that extracts is saved in file or database.
The present invention also provides a kind of download system of web page dynamic contents, and this system comprises: task management module and data management module, and at least one download management module;
Described task management module is used for obtaining the result for retrieval that meets search condition from searching page, generates download task list according to the result for retrieval that gets access to, the corresponding result for retrieval record of each downloading task;
Described download management module, be used for being linked to the page that the result for retrieval record of the download task list that described task management module generates is linked, and according to the corresponding relation of the page elements in predefined this page with the content-data that will extract, the content corresponding data are extracted in the pairing position of corresponding page elements from this page;
Described data management module is used for the described content-data that described download management module is extracted is saved in file or database.
Beneficial effect of the present invention is as follows:
The present invention carries out the downloading task in the download task list by generate download task list according to result for retrieval, obtains with the result for retrieval content data corresponding and preserves, thereby realized the dynamic content of webpage is downloaded and stored.
Description of drawings
Fig. 1 is the running environment synoptic diagram of the embodiment of the invention;
The structural representation of the web page dynamic contents download system that Fig. 2 provides for the embodiment of the invention.
Embodiment
Below in conjunction with accompanying drawing the embodiment of the invention is described in detail.
The embodiment of the invention provides a kind of web page dynamic contents method for down loading and web page dynamic contents download system, and this method and system operates under as shown in Figure 1 C/S (Client/Server, the client/server) framework.This framework comprises database server, file server, and a plurality of client computer.Client computer can be connected with Internet by equipment for surfing the net, and client computer is connected with file server with database server by internal network.The user can retrieve and download the web page dynamic contents on the Internet from client computer, and the metadata of downloading data can be stored in database server, with the document data saving of downloading data in file server.
Referring to Fig. 2, the structural representation of the web page dynamic contents download system that provides for the embodiment of the invention, this system comprises task management module, download management module and data management module.Wherein, download management module can have a plurality of.
The task management module is used to be provided with downloading task, comprising: the searching page address is set, obtains search condition and preserves, preserve result for retrieval tabulate database or file, generate download task list.This task management module comprises:
Search address is provided with submodule, is used to be provided with the chained address of the searching page that carries out data-searching, and this searching page is the page that can import search condition for the user.This submodule provides search address that window is set, and can obtain the page address that the user imports in this window, or obtains the address that the user chooses from alternative searching page address that this window provides.Search address is provided with submodule after the searching page address is finished in user's setting, is linked to this address, provides the search condition inputting interface to the user.
Search condition is obtained submodule, is used for obtaining the search key that searching page that the user returns in the website imports and preserves;
Result for retrieval obtains submodule, is used for obtaining the search condition that submodule gets access to according to search condition, obtains the result for retrieval tabulation corresponding with the search key of user's input;
Task list generates submodule, is used for that result for retrieval is obtained the result for retrieval tabulation that subelement gets access to and preserves, and can be saved in the database, also can be saved in the file, generates download task list;
This task management module can also comprise the task scheduling submodule, be used for dispatching the downloading task of download task list, setting according to the user, for downloading task is distributed rational download thread, as, specify one or more download management module for downloading task, make when carrying out this downloading task, can download simultaneously by a plurality of download modules of appointment, thereby can improve download efficiency.
Download management module is used for the downloading task according to the setting of task management module, downloads data from specified web.This download management module comprises:
Download implementation sub-module, be used to carry out downloading task, generate the request message that sends to corresponding website, acquisition request detailed content according to the downloading task in the task list;
The contents extraction submodule is used to receive the detailed content that corresponding website is returned, and extracts the content corresponding data from the page source file that returns.
The file maintenance that data management module is used for download management module is downloaded is in database server and file server.This data management module comprises:
The metadata processing sub, the metadata information that is used for relevant these detailed content data that download management module is extracted is saved in the metadatabase of document data base;
The file data processing sub, the detailed content data (as text or image) that are used for download management module is extracted are saved in this locality or file server, and the corresponding relation of the metadata record in foundation and the document data base.
Adopt above-mentioned dynamic web content download system that web page dynamic contents is carried out process of downloading, comprise downloading task generation phase and download execute phase.
The generation of downloading task is mainly finished by the task management module.
The task management module obtains to meet the result for retrieval tabulation of search condition according to the search condition of appointment, generates download task list.The task management module both can be provided with new search condition and retrieve, and also can select to reuse the search condition of having preserved and retrieve.
Search address in the task management module is provided with submodule and for the user provides retrieval the interface is set, and the user can be provided with the layout setting search condition by retrieval, to obtain corresponding result for retrieval tabulation.
Client end interface can comprise a plurality of windows, and for example: what the searching page address was set is provided with window, search condition input window, result for retrieval list display window etc.
If retrieve by new search condition is set, the user is at first in being provided with of searching page address is provided with searching page in the window URL address, can in input frame, import the URL address, also can select URL address or web site name in the address list that sets in advance, wherein web site name is corresponding with the searching page URL address of this website.After the searching page address was set, search address was provided with submodule and preserves this address setting, and can be connected to the searching page of this address correspondence automatically, and this searching page is shown in the search condition input window, imported search key for the user.Then, search condition is obtained submodule acquisition user and import search key and preservation in searching page.After the user confirmed to submit to, result for retrieval obtained submodule and submits retrieval request by Internet to the Website server corresponding with this searching page, carried the search key of the user's input that gets access to.Website server is handled this request, and the result for retrieval tabulation that will meet search condition returns to result for retrieval and obtain submodule, and result for retrieval obtains submodule the result for retrieval tabulation is shown in the result for retrieval list display window.Downloading task generates submodule and result for retrieval is obtained the result for retrieval tabulation that submodule gets access to is saved in file or the database, and generates download task list in view of the above.
In searching page, all search conditions all are placed in the FORM list, and each search condition all comprises at least one variable.After client was connected to searching page, search condition is obtained submodule also will obtain search condition content and variables corresponding name in the FORM list.For example: the search condition in patent retrieval page FORM list comprises retrieval of content such as the applying date, application number, applicant, application documents title, and corresponding variable is bright to be AppDate, AppNum, Applicant, PatentName etc.
If selecting existing search condition retrieves, then result for retrieval obtains the search condition (comprising searching page address and search key) of submodule reading and saving from file or database, this search condition is converted to the discernible connection request of dynamic website, and, obtain the result for retrieval that meets search condition to corresponding website transmission request.
Result for retrieval obtains submodule after receiving the result for retrieval tabulation of returning, and the result for retrieval tabulation is preserved.The result for retrieval tabulation can be saved in file, also can be saved in database, and generate download task list.In the present embodiment, the result for retrieval tabulation is saved in the database, generates download task list.
If there has been task list in advance, can directly be appended to result for retrieval in the task list this moment.If task list is not set in advance, the corresponding relation creation task tabulation of variable name that can acquire before this this moment according to the task management module and corresponding search condition content.That is, with the field name of variable name, with the field value of search condition content as task list as task list.For example, with above-mentioned patent retrieval is example, in patent database, with applying date of correspondence, application number, applicant, application documents title etc. as field name, with the value of variablees such as AppDate, AppNum, Applicator, PatentName field value as task list, the creation task tabulation.Then, each the result for retrieval clauses and subclauses in the result for retrieval tabulation as a record in the task list, are saved in the task list.
For fear of repeating the preservation task, can specify one or more fields to compare, only preserve the different record of field value.For example, in above-mentioned patent retrieval example, because application number is unique, therefore when preserving result for retrieval, whether the application number of existing record is identical in the application number that can relatively need the result for retrieval preserved earlier and the task list, if difference, then preserve this result for retrieval, otherwise do not preserve.
Result for retrieval is being saved in the process of task list, downloading task generates the structure of submodule according to result for retrieval tabulation in the result for retrieval original list, determine html element in the page plain with task list in the corresponding relation of field, with the content stores in the result for retrieval tabulation in the field of the task list of correspondence.With above-mentioned patent retrieval is example, if provided contents such as the applying date, application number, applicant and application documents title in the result for retrieval tabulation, then task generates the structure of submodule according to this result for retrieval tabulation, determine the html element element (in HTML, show as corresponding<TD 〉) of correspondences such as the applying date, application number, and the variate-values such as AppDate, AppNum of this element correspondence are write fields such as applying date in the task list, application number.
If the result for retrieval tabulation is divided into multipage and shows, then result for retrieval obtains submodule identifies down one page from the source code of result for retrieval original list chained address URL, and jump to down one page, downloading task generation submodule records the result for retrieval of this page demonstration in the task list, up to all result for retrieval are recorded in the task list.
All bar result for retrieval in the result for retrieval tabulation all can be kept in the task list, also can only preserve the result for retrieval clauses and subclauses of appointment or the result for retrieval clauses and subclauses of specified quantity.
When downloading task generates submodule and generates task list, an identification field can be set in every record, be used to indicate the detailed content of this record whether to download and finish.Like this, when starting downloading task, the download management module of client can judge that whether corresponding record finish download, is designated uncompleted record thereby only download those by this identification field at every turn, thereby saved download time, reduced taking of Internet resources.
For the download task list that generates, task scheduling submodule in the task management module can be dispatched the downloading task in the tabulation, setting according to the user distributes rational download thread, as, for downloading task is specified one or more download management module, when this task of download, can carry out this downloading task simultaneously like this, to improve download efficiency by a plurality of download modules of appointment.
The execution of downloading task is mainly finished by download management module.
The download management module of client is according to the download task list that is generated by the task management module, is connected to the download address of appointment and downloads the detailed content of corresponding data.
Download implementation sub-module the download start button can be provided, when the user clicked this downloads start button, the download implementation sub-module is the pairing detailed content of record in the download task list one by one.
When the download implementation sub-module is downloaded the pairing detailed content of the record in the task list, at first according to setting, convert the relevant information of single record to Website server discernible solicited message, then, the address URL of checking webpage according to the detailed content of setting, the discernible solicited message in website is sent to the website ask, obtain the request results page that returns.For example, in the example of above-mentioned patent retrieval, the download implementation sub-module can be set application number in the application number field and detailed content are checked that web page address URL is assembled into request message, sends to corresponding website, to obtain corresponding detailed content.
When downloading implementation sub-module execution downloading task, can read the identification field values that whether this task of sign has been finished in this downloading task record, also carry out, then carry out this task, otherwise will not carry out if judge this downloading task.After a complete downloading task, download implementation sub-module this task flagging is finished for downloading.
Downloading implementation sub-module can also occur repeating automatically to connect when wrong downloading in network download, up to downloading successfully, or reaches predefined download time, downloads successfully to guarantee all data.
After download extracting submodule and receiving the detailed content page that Website server returns, according to the setting of checking the page in detail, the detailed content page source file that returns is analyzed, from the page source file that returns, extract data, be submitted to data management module after the different pieces of information of diverse location in the source file is extracted and preserve.Can be according to setting in advance, the data definite which position from the page that returns extracts are as metadata, and the data that extract from which position are as file data.For example, in the example of above-mentioned patent retrieval, because the form of the patent detailed content page of a patent retrieval website is relatively more fixing, such as comprising statutory status and explanatory memorandum, therefore can set the corresponding relation of html element element and corresponding patent detailed content at certain patent retrieval website, according to this corresponding relation, can be from the plain content corresponding of extracting of corresponding html element, as statutory status and explanatory memorandum, and with statutory status as metadata, with explanatory memorandum as file data.
Data management module after receiving the detailed content of download management module download is preserved it.Data management module can set in advance document data base, to preserve the data of downloading, document data base can comprise the metadatabase of preserving metadata and the document data bank of preserving file, and metadatabase can be positioned at database server, and document data bank can be positioned at file server.The metadata processing sub receives the data that can be used for this detailed content of index that download management module is downloaded, and is saved in the respective field in the metadatabase.The file data processing sub receives the detailed content data that download management module is downloaded, and is saved in the document data bank, simultaneously the corresponding relation of respective record in foundation and the metadatabase.Like this, when when metadatabase is inquired about corresponding data, data management module can obtain corresponding detailed content according to this corresponding relation from document data bank.
In sum, the present invention is by the download system of dynamic web content, can be at different dynamic data query webpage, set the different content that needs download, and downloaded contents stored in file or the database, by all data that need of the disposable download of network, and no longer download the data of having preserved, save the time that data is downloaded, reduced taking of Internet resources.Download when getting nowhere, the automatic repeated downloads of system's meeting reaches the number of times that the user sets until all data download successes or download time, has guaranteed the download of data as far as possible.
Obviously, those skilled in the art can carry out various changes and modification to the present invention and not break away from the spirit and scope of the present invention.Like this, if of the present invention these are revised and modification belongs within the scope of claim of the present invention and equivalent technologies thereof, then the present invention also is intended to comprise these changes and modification interior.

Claims (11)

1. a web page dynamic contents method for down loading is characterized in that, comprises the steps:
Obtain the result for retrieval that meets search condition from searching page, generate download task list, the corresponding result for retrieval record of each downloading task according to the result for retrieval that gets access to;
Be linked to the page that the result for retrieval record in the described download task list is linked, and according to the corresponding relation of the page elements in predefined this page with the content-data that will extract, the content corresponding data are extracted in the pairing position of corresponding page elements from this page;
The described content-data that extracts is saved in file or database.
2. the method for claim 1 is characterized in that, obtains the process of the result for retrieval that meets search condition from searching page, comprises step:
According to the searching page address and the search condition information of preserving, send a request message to the Website server of described searching page address correspondence, carry described search condition information, acquisition request meets the result for retrieval of described search condition.
3. the method for claim 1 is characterized in that, the process according to the result for retrieval generation download task list that gets access to comprises step:
To the corresponding relation of the information in page elements in the result for retrieval page source code and the result for retrieval record, obtain each the result for retrieval recorded information in the described result for retrieval page according in advance;
The result for retrieval recorded information that gets access to is saved in respective field in the download task list.
4. method as claimed in claim 3 is characterized in that, further comprises step:
Obtain the address of following one page retrieval result page face that the described result for retrieval page linked;
Be linked to the result for retrieval page of described address correspondence, and with the result for retrieval recorded and stored in this page in described download task list.
5. as claim 3 or 4 described methods, it is characterized in that when the result for retrieval recorded and stored is arrived described download task list, judge whether existed in the described download task list and the identical record of this result for retrieval record, if exist, then do not preserve this result for retrieval record.
6. the method for claim 1, it is characterized in that, before the page that the corresponding retrieval results record is linked in being linked to described download task list, also comprise step: judge whether described result for retrieval writes down pairing downloading task complete, if it is not complete, then carry out this downloading task, otherwise will not carry out;
After being linked to the page that described result for retrieval record linked and extracting content data corresponding, also comprise step: with the corresponding download task identification for finishing.
7. a web page dynamic contents download system is characterized in that, comprising: task management module and data management module, and at least one download management module;
Described task management module is used for obtaining the result for retrieval that meets search condition from searching page, generates download task list according to the result for retrieval that gets access to, the corresponding result for retrieval record of each downloading task;
Described download management module, be used for being linked to the page that the result for retrieval record of the download task list that described task management module generates is linked, and according to the corresponding relation of the page elements in predefined this page with the content-data that will extract, the content corresponding data are extracted in the pairing position of corresponding page elements from this page;
Described data management module is used for the described content-data that described download management module is extracted is saved in file or database.
8. web page dynamic contents download system as claimed in claim 7 is characterized in that, described task management module comprises:
Search condition is obtained submodule, is used to obtain the searching page address of user's appointment and the search key of input, and preserves;
Result for retrieval obtains submodule, is used for obtaining searching page address and the search condition that submodule is preserved according to described search condition, obtains corresponding result for retrieval tabulation;
Task list generates submodule, is used for obtaining the result for retrieval tabulation generation download task list that submodule gets access to according to described result for retrieval.
9. web page dynamic contents download system as claimed in claim 8 is characterized in that, described task management module also comprises:
The task scheduling submodule is used to the downloading task in the described download task list to distribute at least one download management module.
10. web page dynamic contents download system as claimed in claim 7 is characterized in that, described download management module comprises:
Download implementation sub-module, be used for downloading task, generate the request message that sends to corresponding website, the detailed content of acquisition request correspondence according to described task list;
The contents extraction submodule is used to receive the detailed content page that returns corresponding website, and according to the corresponding relation of the page elements in the predefined page source code with the content that will extract, extracts the content corresponding data from the respective page element position.
11. web page dynamic contents download system as claimed in claim 7 is characterized in that, described data management module comprises:
The metadata processing sub, the metadata information that is used for described download management module is downloaded the detailed content data that obtain is saved in the metadatabase;
The file data processing sub, the document data saving that is used for described download management module is downloaded the detailed content data that obtain is a file, and the corresponding relation of foundation and respective meta-data.
CN2007101058955A 2007-06-01 2007-06-01 Downloading method and system for web page dynamic contents Expired - Fee Related CN101315629B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2007101058955A CN101315629B (en) 2007-06-01 2007-06-01 Downloading method and system for web page dynamic contents

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2007101058955A CN101315629B (en) 2007-06-01 2007-06-01 Downloading method and system for web page dynamic contents

Publications (2)

Publication Number Publication Date
CN101315629A CN101315629A (en) 2008-12-03
CN101315629B true CN101315629B (en) 2010-11-17

Family

ID=40106641

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2007101058955A Expired - Fee Related CN101315629B (en) 2007-06-01 2007-06-01 Downloading method and system for web page dynamic contents

Country Status (1)

Country Link
CN (1) CN101315629B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101437028B (en) * 2008-12-26 2013-05-08 深圳市迅雷网络技术有限公司 Method, system and apparatus for generating multiple addresses
CN101925051B (en) * 2010-08-23 2015-08-12 中兴通讯股份有限公司 A kind of mobile terminal and method for down loading thereof
US9325804B2 (en) 2010-11-08 2016-04-26 Microsoft Technology Licensing, Llc Dynamic image result stitching
CN104834671B (en) * 2015-03-25 2019-12-17 中国科学院地理科学与资源研究所 parallel downloading method and device of document metadata
CN108920683A (en) * 2018-07-12 2018-11-30 郑州云海信息技术有限公司 A kind of method, apparatus and storage medium of cloud computing platform downloading external resource
CN114760521A (en) * 2022-06-16 2022-07-15 北京搜狐新动力信息技术有限公司 Video processing method and system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1452095A (en) * 2002-04-13 2003-10-29 鸿富锦精密工业(深圳)有限公司 Automatic document down-load system and method
CN1780211A (en) * 2004-11-26 2006-05-31 英业达股份有限公司 Entertainment apparatus and method for downoloading multi-media data from target website

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1452095A (en) * 2002-04-13 2003-10-29 鸿富锦精密工业(深圳)有限公司 Automatic document down-load system and method
CN1780211A (en) * 2004-11-26 2006-05-31 英业达股份有限公司 Entertainment apparatus and method for downoloading multi-media data from target website

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
吴鹏飞等.基于结构与内容的网页主题信息提取研究.山东大学学报(理学版)第41卷 第3期.2006,41(3),论文第131页右栏第10行-第134页左栏第13行,表1.
吴鹏飞等.基于结构与内容的网页主题信息提取研究.山东大学学报(理学版)第41卷 第3期.2006,41(3),论文第131页右栏第10行-第134页左栏第13行,表1. *

Also Published As

Publication number Publication date
CN101315629A (en) 2008-12-03

Similar Documents

Publication Publication Date Title
CN107273409B (en) Network data acquisition, storage and processing method and system
CN101122921B (en) Method forming tree-shaped display structure based on ajax and html
CN101370024B (en) Distributed information collection method and system
CN101315629B (en) Downloading method and system for web page dynamic contents
CN104715064B (en) It is a kind of to realize the method and server that keyword is marked on webpage
US20120323881A1 (en) Interactive web crawler
CN103744853A (en) Method and device for providing web cache information in search engine
CN102768683B (en) A kind of searching method of pictorial information and searcher
CN102663062A (en) Method and device for processing invalid links in search result
CN102662703A (en) Method and device for loading application program plugins
CN102880607A (en) Dynamic network content grabbing method and dynamic network content crawler system
US8799274B2 (en) Topic map for navigation control
CN105138312A (en) Table generation method and apparatus
CN102073726A (en) Search engine system and structured data import method for search engine system
CN102982161A (en) Method and device for acquiring webpage information
CN106687949A (en) Search results for native applications
CN101441629A (en) Automatic acquiring method of non-structured web page information
CN102609412A (en) RSS (Really Simple Syndication)-based multi-thread graphic information synchronization crawling control method and system
CN103248524A (en) Flexible test technology based test data version control method, device and system
CN102982162A (en) System for acquiring webpage information
CN112612943A (en) Asynchronous processing framework-based data crawling method with automatic testing function
CN101751443A (en) Data searching and processing system as well as method
CN104361007B (en) The processing method of browser and its collection
CN105893640B (en) Favorite merging method and device
CN101145936B (en) A method and system for adding tags in Web pages

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20101117

Termination date: 20160601