CN103577482B - A kind of webpage collection method, device and browser - Google Patents

A kind of webpage collection method, device and browser Download PDF

Info

Publication number
CN103577482B
CN103577482B CN201210278582.0A CN201210278582A CN103577482B CN 103577482 B CN103577482 B CN 103577482B CN 201210278582 A CN201210278582 A CN 201210278582A CN 103577482 B CN103577482 B CN 103577482B
Authority
CN
China
Prior art keywords
web page
webpage
page contents
contents
interlinkage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210278582.0A
Other languages
Chinese (zh)
Other versions
CN103577482A (en
Inventor
刘刚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201210278582.0A priority Critical patent/CN103577482B/en
Publication of CN103577482A publication Critical patent/CN103577482A/en
Application granted granted Critical
Publication of CN103577482B publication Critical patent/CN103577482B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9562Bookmark management

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The applicable field of computer technology of the present invention, there is provided a kind of webpage collection method, device and browser, methods described comprise the steps:The instruction of collection webpage is received, obtains web page interlinkage corresponding to the webpage;Web page contents corresponding to being called the webpage capture server in cloud server group to capture the web page interlinkage according to the web page interlinkage;The high in the clouds storage server web page contents being saved in cloud server group.The present invention realizes the high in the clouds storage of collection web page contents, ensure that the permanently effective of collection webpage so that collection web page contents are not limited by time, access locations, extend the function of browser collection folder.

Description

A kind of webpage collection method, device and browser
Technical field
The invention belongs to field of computer technology, more particularly to a kind of webpage collection method, device and browser.
Background technology
Collection is a basic application in browser, for the website/web page interlinkage for needing often to access by user Be stored in local computer terminal, directly opened by clickthrough can corresponding to website/page), to access correspondingly Resource, but after the operating system to terminal is reinstalled, the local user data meeting in collection before Lose, Website page can not be opened again.In addition, the preservation of local computer terminal can not be also obtained on other terminals Web site url.
Also can to ensure to be linked on terminal or the other terminals after reinstalling operating system Open, network profile is suggested, so as to which the web site url of collection is saved on network.However, due to network linking There is the life cycle of oneself, after a while, migration or webpage due to corresponding Website server correspondingly link road The adjustment in footpath, corresponding resource can not also be had access to according to the web page interlinkage of collection, even if being able to access that, the corresponding page may Nor web page contents when collection, can not realize long-term collection, the preservation of web page contents.
The content of the invention
The purpose of the embodiment of the present invention is to provide a kind of webpage collection method, it is intended to solves because prior art can not carry For a kind of effective webpage collection method, the problem of leading to not to realize the long-term collection of web page contents, preserve.
The embodiment of the present invention is achieved in that a kind of webpage collection method, and methods described comprises the steps:
The instruction of collection webpage is received, obtains web page interlinkage corresponding to the webpage;
The webpage capture server in cloud server group is called to capture the web page interlinkage pair according to the web page interlinkage The web page contents answered;
The high in the clouds storage server web page contents being saved in cloud server group.
The another object of the embodiment of the present invention is to provide a kind of web page storage device, and described device includes:
Acquiring unit is linked, for receiving the instruction of collection webpage, obtains web page interlinkage corresponding to the webpage;
Capturing webpage contents unit, for calling the webpage capture service in cloud server group according to the web page interlinkage Device captures web page contents corresponding to the web page interlinkage;And
Web page contents storage unit, for the high in the clouds storage service being saved in the web page contents in cloud server group Device.
The another object of the embodiment of the present invention is to provide a kind of browser, and the browser includes above-mentioned web page storage Device.
The embodiment of the present invention collects the instruction of webpage by receiving, and web page interlinkage corresponding to webpage is obtained, according to webpage chain Web page contents corresponding to connecing the webpage capture server crawl web page interlinkage in calling cloud server group, web page contents are preserved To the high in the clouds storage server in cloud server group, the high in the clouds storage of collection web page contents is realized, ensure that collection webpage It is permanently effective so that collection web page contents do not limited by time, access locations, extend the function of browser collection folder.
Brief description of the drawings
Fig. 1 is the implementation process figure for the webpage collection method that the embodiment of the present invention one provides;
Fig. 2 is the implementation process figure for the webpage collection method that the embodiment of the present invention two provides;
Fig. 3 is the structure chart for the web page storage device that the embodiment of the present invention three provides;And
Fig. 4 is the structure chart for the web page storage device that the embodiment of the present invention four provides.
Embodiment
In order to make the purpose , technical scheme and advantage of the present invention be clearer, it is right below in conjunction with drawings and Examples The present invention is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, and It is not used in the restriction present invention.
It is described in detail below in conjunction with specific implementation of the specific embodiment to the present invention:
Embodiment one:
Fig. 1 shows the implementation process for the webpage collection method that the embodiment of the present invention one provides, and details are as follows:
In step S101, the instruction of collection webpage is received, obtains web page interlinkage corresponding to webpage.
In embodiments of the present invention, collect and include corresponding web page interlinkage in the instruction of webpage, when receiving collection net During the instruction of page, wherein corresponding web page interlinkage is obtained.
In step s 102, the webpage capture server in cloud server group is called to capture webpage chain according to web page interlinkage Connect corresponding web page contents.
In embodiments of the present invention, in order to improve the grasp speed of web page contents, the webpage in cloud server group is realized The load balancing of server is captured, as illustratively, the webpage capture service in cloud server group is called according to web page interlinkage Corresponding to device crawl web page interlinkage during web page contents, first, the load of the webpage capture server in cloud server group is obtained Information, the webpage capture server for meeting preparatory condition is called according to the load information of webpage capture server and web page interlinkage Capture web page contents corresponding to web page interlinkage.Default loading condition can include the load of CPU, internal memory etc. less than preset value etc. Condition, and then choose in the webpage capture server for meeting default loading condition the webpage capture server of a light load Capture web page contents corresponding to web page interlinkage.Herein not limiting the present invention.
In step s 103, high in the clouds storage server web page contents being saved in cloud server group.
In embodiments of the present invention, after web page interlinkage is got, called according to web page interlinkage in cloud server group Web page contents are saved in the high in the clouds in cloud server group by web page contents corresponding to the crawl web page interlinkage of webpage capture server Storage server, the high in the clouds storage of collection web page contents is realized, ensure that the permanently effective of collection webpage so that collection webpage Content is not limited by time, access locations, extends the function of browser collection folder.
Embodiment two:
Fig. 2 shows the implementation process for the webpage collection method that the embodiment of the present invention two provides, and details are as follows:
In step s 201, the instruction of collection webpage is received, obtains web page interlinkage corresponding to webpage.
In embodiments of the present invention, collect and include corresponding web page interlinkage in the instruction of webpage, when receiving collection net During the instruction of page, wherein corresponding web page interlinkage is obtained.
In step S202, the webpage capture server in cloud server group is called to capture webpage chain according to web page interlinkage Connect corresponding web page contents.
In embodiments of the present invention, in order to improve the grasp speed of web page contents, the webpage in cloud server group is realized The load balancing of server is captured, as illustratively, the webpage capture service in cloud server group is called according to web page interlinkage Corresponding to device crawl web page interlinkage during web page contents, first, the load of the webpage capture server in cloud server group is obtained Information, the webpage capture server for meeting preparatory condition is called according to the load information of webpage capture server and web page interlinkage Capture web page contents corresponding to web page interlinkage.Wherein, preparatory condition can be less than a preset value including the load of CPU, internal memory etc. Etc. condition, herein not limiting the present invention.
In step S203, the web page contents grabbed are polymerized to the web page files of a preset format
In embodiments of the present invention, because web page contents include multiple elements, such as image, text, audio etc., in order to carry The storage of high web page contents, efficiency of transmission, before the high in the clouds storage server being saved in web page contents in cloud server group, The web page contents grabbed can be polymerized to the web page files of a preset format.The preset format can be mht forms or According to Request for Comment(Request For Comments)The file of RFC2557 documents generation.
In step S204, the high in the clouds storage server that web page files is saved in cloud server group.
In step S205, characteristic information and web page interlinkage are saved in default database.
In embodiments of the present invention, in order to improve the quick-searching of collection webpage and classification, it is saved in by web page contents Before high in the clouds storage server in cloud server group, characteristic information corresponding to the web page contents of input, this feature information are received For characteristic value corresponding to default characteristic item, this feature item defines the feature of collection webpage, for example, characteristic item can include webpage Content topic, purposes, field etc..Characteristic information can be extracted by default mode from web page contents, for example, crucial Word extracting method or user-defined characteristic information.
In embodiments of the present invention, characteristic information and corresponding web page interlinkage should also be associated, are saved in default Database, with facilitate web page storage task is managed and improved collection web page contents retrieval rate.So, when connecing When receiving the instruction of user's collection webpage, retrieved in the database, if retrieve the same web page interlinkage, i.e., no longer The webpage of request is collected, prompts user's webpage to collect.
In step S206, the web page interlinkage in default database is detected according to the default time cycle.
In step S207, when webpage has updated corresponding to the web page interlinkage of detection, output prompt message with prompt whether Collect the renewal of web page contents.
In embodiments of the present invention, a time cycle is preset, the web page interlinkage in default database is detected, Whether the webpage corresponding to web page interlinkage for judging wherein to preserve is collected in webpage corresponding to web page interlinkage is updated afterwards, When webpage has updated corresponding to the web page interlinkage of detection, then prompt message is exported to prompt whether to carry out to collect web page contents more Newly, so as to provide collection webpage automatic monitoring.
In embodiments of the present invention, if terminal user request is by by browser by locally downloading computer The web page contents of system are collected, then can send web page contents uploading instructions, carry out the upload of web page contents, so as to by net Page content is saved in the high in the clouds storage server in cloud server group.
In embodiments of the present invention, further, when the high in the clouds storage being saved in web page contents in cloud server group , can be directly from the high in the clouds in cloud server group if terminal user clicks on the web page interlinkage of collection after server Storage server downloads web page contents corresponding to the webpage of collection, so as to ensure that the permanent effective of the web page interlinkage of collection, makes The access for the web page contents that must be collected is not limited by access time, access locations.
Can be with one of ordinary skill in the art will appreciate that realizing that all or part of step in above-described embodiment method is The hardware of correlation is instructed to complete by program, described program can be stored in a computer read/write memory medium, Described storage medium, such as ROM/RAM, disk, CD.
Embodiment three:
Fig. 3 shows the structure for the web page storage device that the embodiment of the present invention three provides, and for convenience of description, illustrate only The part related to the embodiment of the present invention, including:
Acquiring unit 31 is linked, for receiving the instruction of collection webpage, obtains web page interlinkage corresponding to webpage.
Capturing webpage contents unit 32, for calling the webpage capture server in cloud server group according to web page interlinkage Capture web page contents corresponding to web page interlinkage.
Web page contents storage unit 33, for the high in the clouds storage service being saved in web page contents in cloud server group Device.
In embodiments of the present invention, in order to improve the grasp speed of web page contents, the webpage in cloud server group is realized The load balancing of server is captured, as illustratively, the webpage capture service in cloud server group is called according to web page interlinkage Corresponding to device crawl web page interlinkage during web page contents, first, the load of the webpage capture server in cloud server group is obtained Information, the webpage capture server for meeting preparatory condition is called according to the load information of webpage capture server and web page interlinkage Capture web page contents corresponding to web page interlinkage.Wherein, preparatory condition can be less than a preset value including the load of CPU, internal memory etc. Etc. condition, herein not limiting the present invention.Therefore, capturing webpage contents unit 32 can include:
Load information obtains subelement 321, and the load for obtaining the webpage capture server in cloud server group is believed Breath;And
Capturing webpage contents subelement 322, meet preparatory condition for being called according to load information and web page interlinkage Web page contents corresponding to the crawl web page interlinkage of webpage capture server.
In embodiments of the present invention, after web page interlinkage is got, called according to web page interlinkage in cloud server group Web page contents are saved in the high in the clouds in cloud server group by web page contents corresponding to the crawl web page interlinkage of webpage capture server Storage server, the high in the clouds storage of collection web page contents is realized, ensure that the permanently effective of collection webpage so that collection webpage Content is not limited by time, access locations, extends the function of browser collection folder.
Example IV:
Fig. 4 shows the structure for the web page storage device that the embodiment of the present invention four provides, and for convenience of description, illustrate only The part related to the embodiment of the present invention, including:
Acquiring unit 41 is linked, for receiving the instruction of collection webpage, obtains web page interlinkage corresponding to the webpage.
Capturing webpage contents unit 42, for calling the webpage capture server in cloud server group according to web page interlinkage Capture web page contents corresponding to web page interlinkage.
Content-aggregated unit 43, for the web page contents grabbed to be polymerized to the web page files of a preset format.
In embodiments of the present invention, because web page contents include multiple elements, such as image, text, audio etc., for side Just storage, the transmission of web page contents, can be with before the high in the clouds storage server being saved in web page contents in cloud server group The web page contents grabbed are polymerized to the web page files of a preset format.The preset format can be mht forms or according to Request for Comment(Request For Comments)The file of RFC2557 documents generation.
Web page contents storage unit 44, for the high in the clouds storage service being saved in web page contents in cloud server group Device.
In embodiments of the present invention, when the web page contents grabbed are polymerized to a preset format by content-aggregated unit 43 Web page files when, web page contents storage unit 44 can include web page contents and preserve subelement 441, for by web page files The high in the clouds storage server being saved in cloud server group.
Link detection unit 45, for being examined according to the default time cycle to the web page interlinkage in default database Survey.
Renewal prompting output unit 46, has updated for webpage corresponding to the web page interlinkage when detection, has exported prompt message Whether carry out collecting the renewal of web page contents with prompting.
In embodiments of the present invention, a time cycle can also be preset, the web page interlinkage in default database is entered Row detection, judge webpage corresponding to the web page interlinkage that wherein preserves webpage corresponding to web page interlinkage collected afterwards whether by Updated, when webpage has updated corresponding to the web page interlinkage of detection, whether output prompt message is to prompt to carry out in collection webpage The renewal of appearance, so as to provide the automatic monitoring of collection webpage.
Characteristic information receiving unit 47, for receiving characteristic information corresponding to web page contents.
In embodiments of the present invention, in order to improve the quick-searching of collection webpage and classification, it is saved in by web page contents Before high in the clouds storage server in cloud server group, characteristic information corresponding to the web page contents of input, this feature information are received For characteristic value corresponding to default characteristic item, this feature item defines the feature of collection webpage, for example, characteristic item can include net Page content topic, purposes, field etc..Characteristic information can be extracted by default mode from web page contents, for example, closing Key word extracting method or User Defined.
Data saving unit 48, for characteristic information and web page interlinkage to be saved in into default database.
In embodiments of the present invention, characteristic information and corresponding web page interlinkage should also be associated, are saved in default Database, with facilitate web page storage task is managed and improved collection web page contents retrieval rate.So, when connecing When receiving the instruction of user's collection webpage, retrieved in the database, if retrieve the same web page interlinkage, i.e., no longer The webpage of request is collected, prompts user's webpage to collect.
Webpage uploading unit 49, for receiving web page contents uploading instructions, web page contents are saved in cloud server group In high in the clouds storage server.
In embodiments of the present invention, if terminal user request is by by browser by locally downloading computer The web page contents of system are collected, then can send web page contents uploading instructions, carry out the upload of web page contents, so as to by net Page content is saved in the high in the clouds storage server in cloud server group.
In embodiments of the present invention, a kind of browser is additionally provided, the browser includes implementing the net described in three or four Page holding device fo so that terminal user can be collected by browser using the high in the clouds of webpage.
In embodiments of the present invention, after web page interlinkage is got, called according to web page interlinkage in cloud server group Web page contents are saved in the high in the clouds in cloud server group by web page contents corresponding to the crawl web page interlinkage of webpage capture server Storage server, the high in the clouds storage of collection web page contents is realized, ensure that the permanently effective of collection webpage so that collection webpage Content is not limited by time, access locations, extends the function of browser collection folder, meanwhile, if terminal user please Asking will be collected the web page contents of locally downloading computer system by browser, then can be sent on web page contents Teletype command, the upload of web page contents is carried out, so as to which web page contents to be saved in the high in the clouds storage server in cloud server group, To realize that the high in the clouds of local IP access web page contents is collected.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all essences in the present invention All any modification, equivalent and improvement made within refreshing and principle etc., should be included in the scope of the protection.

Claims (15)

1. a kind of webpage collection method, it is characterised in that methods described comprises the steps:
The instruction of the collection webpage of user's triggering is received, obtains web page interlinkage corresponding to the webpage;
Corresponding to being called the webpage capture server in cloud server group to capture the web page interlinkage according to the web page interlinkage Web page contents;
The high in the clouds storage server web page contents being saved in cloud server group;Wherein, storage server beyond the clouds The duration for storing the web page contents collects the duration of the webpage corresponding to the web page contents not less than user, to support user Collected webpage is opened based on the web page contents stored in the storage server of high in the clouds.
2. the method as described in claim 1, it is characterised in that be saved in by the web page contents in cloud server group Before the step of high in the clouds storage server, methods described also includes:
Characteristic information corresponding to the web page contents is received, the characteristic information is characteristic value corresponding to default characteristic item.
3. method as claimed in claim 2, it is characterised in that methods described also includes:
The characteristic information and the web page interlinkage are saved in default database.
4. method as claimed in claim 3, it is characterised in that methods described also includes:
The web page interlinkage in the default database is detected according to the default time cycle;
When webpage corresponding to the web page interlinkage of the detection has updated, whether output prompt message is to prompt to carry out in collection webpage The renewal of appearance.
5. the method as described in claim 1, it is characterised in that the net in cloud server group is called according to the web page interlinkage The step of page crawl server captures web page contents corresponding to the web page interlinkage includes:
Obtain the load information of the webpage capture server in cloud server group;
The webpage capture server for meeting default loading condition is called to capture according to the load information and the web page interlinkage Web page contents corresponding to the web page interlinkage.
6. the method as described in claim 1, it is characterised in that the net in cloud server group is called according to the web page interlinkage After the step of page crawl server captures web page contents corresponding to the web page interlinkage, the web page contents are saved in high in the clouds Before the step of high in the clouds storage server in server zone, methods described also includes:
The web page contents grabbed are polymerized to the web page files of preset format;
It is specially by the step of high in the clouds storage server that the web page contents are saved in cloud server group:
The high in the clouds storage server web page files being saved in cloud server group.
7. the method as described in claim 1, it is characterised in that methods described also includes:
Receive web page contents uploading instructions, the high in the clouds storage server web page contents being saved in cloud server group.
8. a kind of web page storage device, it is characterised in that described device includes:
Acquiring unit is linked, the instruction of the collection webpage for receiving user's triggering, obtains web page interlinkage corresponding to the webpage;
Capturing webpage contents unit, for calling the webpage capture server in cloud server group to grab according to the web page interlinkage Take web page contents corresponding to the web page interlinkage;And
Web page contents storage unit, for the high in the clouds storage server being saved in the web page contents in cloud server group; Wherein, storage server stores the duration of the web page contents not less than corresponding to user's collection web page contents beyond the clouds The duration of webpage, to support user to open collected webpage based on the web page contents stored in the storage server of high in the clouds.
9. device as claimed in claim 8, it is characterised in that described device also includes:
Characteristic information receiving unit, for receiving characteristic information corresponding to the web page contents, the characteristic information is default spy Levy characteristic value corresponding to item.
10. device as claimed in claim 9, it is characterised in that described device also includes:
Data saving unit, for the characteristic information and web page interlinkage to be saved in into default database.
11. device as claimed in claim 10, it is characterised in that described device also includes:
Link detection unit, for being examined according to the default time cycle to the web page interlinkage in the default database Survey;And
Renewal prompting output unit, updated for webpage corresponding to the web page interlinkage when the detection, output prompt message with Whether prompting carries out collecting the renewal of web page contents.
12. device as claimed in claim 8, it is characterised in that the capturing webpage contents unit includes:
Load information obtains subelement, for obtaining the load information of the webpage capture server in cloud server group;And
Capturing webpage contents subelement, meet default load bar for being called according to the load information and the web page interlinkage The webpage capture server of part captures web page contents corresponding to the web page interlinkage.
13. device as claimed in claim 8, it is characterised in that described device also includes:
Content-aggregated unit, for the web page contents grabbed to be polymerized to the web page files of preset format;
The web page contents storage unit includes:
Web page contents preserve subelement, for the high in the clouds storage service being saved in the web page files in cloud server group Device.
14. device as claimed in claim 8, it is characterised in that described device also includes:
Webpage uploading unit, for receiving web page contents uploading instructions, the web page contents are saved in cloud server group High in the clouds storage server.
15. a kind of browser, it is characterised in that the browser includes the web page storage as described in claim 8 to 14 is any Device.
CN201210278582.0A 2012-08-07 2012-08-07 A kind of webpage collection method, device and browser Active CN103577482B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210278582.0A CN103577482B (en) 2012-08-07 2012-08-07 A kind of webpage collection method, device and browser

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210278582.0A CN103577482B (en) 2012-08-07 2012-08-07 A kind of webpage collection method, device and browser

Publications (2)

Publication Number Publication Date
CN103577482A CN103577482A (en) 2014-02-12
CN103577482B true CN103577482B (en) 2017-12-15

Family

ID=50049280

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210278582.0A Active CN103577482B (en) 2012-08-07 2012-08-07 A kind of webpage collection method, device and browser

Country Status (1)

Country Link
CN (1) CN103577482B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104156397A (en) * 2014-07-16 2014-11-19 百度在线网络技术(北京)有限公司 Method and device for collecting pages
CN105468270A (en) * 2014-08-18 2016-04-06 腾讯科技(深圳)有限公司 Terminal application control method and device
CN104536993B (en) * 2014-12-10 2018-03-20 北京奇虎科技有限公司 Collect the processing method of webpage, collect the processing unit and client of webpage
CN105989116B (en) * 2015-02-12 2017-11-24 广东欧珀移动通信有限公司 A kind of collection of data method and device of collection
CN107193976B (en) * 2017-05-25 2024-03-29 北京小米移动软件有限公司 Information resource display method, device and computer readable storage medium
CN107229527B (en) * 2017-05-25 2022-03-01 北京小米移动软件有限公司 Information resource collection method and device and computer readable storage medium
CN107203630B (en) * 2017-05-31 2020-11-24 北京安云世纪科技有限公司 Application page collection method and device and corresponding mobile terminal
CN108959446A (en) * 2018-06-13 2018-12-07 佛山市车品匠汽车用品有限公司 A kind of Web browser method and system of mobile terminal
CN110213360B (en) * 2019-05-24 2021-06-15 维沃移动通信有限公司 Content storage method and terminal equipment thereof

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101674329A (en) * 2009-09-27 2010-03-17 卓望数码技术(深圳)有限公司 Internet access method and Internet access system
CN101887421A (en) * 2009-05-13 2010-11-17 北京博越世纪科技有限公司 Technology for converting unformatted data into formatted data in web website
CN102484653A (en) * 2009-08-31 2012-05-30 思科技术公司 Measuring attributes of client-server applications
CN102624910A (en) * 2012-03-15 2012-08-01 华为技术有限公司 Method, device and system for processing webpage content selected by user

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100114914A1 (en) * 2008-10-30 2010-05-06 International Business Machines Corporation Selective Home Page Manager

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101887421A (en) * 2009-05-13 2010-11-17 北京博越世纪科技有限公司 Technology for converting unformatted data into formatted data in web website
CN102484653A (en) * 2009-08-31 2012-05-30 思科技术公司 Measuring attributes of client-server applications
CN101674329A (en) * 2009-09-27 2010-03-17 卓望数码技术(深圳)有限公司 Internet access method and Internet access system
CN102624910A (en) * 2012-03-15 2012-08-01 华为技术有限公司 Method, device and system for processing webpage content selected by user

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"基于云平台在线Web挖掘中计算资源动态平衡的研究与实现";安伦;《中国优秀硕士学位论文全文数据库 信息科技辑》;20120315;论文正文第10页第1段-第12页第7段、第13页第1段-第14页第3段、第15页第1段-第22页第4段、第33页第1段、第45页第1段-第51页第1段,附图2.1、2.2、3.1、3.2,表3.1-3.3、4.1 *

Also Published As

Publication number Publication date
CN103577482A (en) 2014-02-12

Similar Documents

Publication Publication Date Title
CN103577482B (en) A kind of webpage collection method, device and browser
US10652265B2 (en) Method and apparatus for network forensics compression and storage
CN103118007B (en) A kind of acquisition methods of user access activity and system
WO2015196907A1 (en) Search pushing method and device which mine user requirements
CN103970788A (en) Webpage-crawling-based crawler technology
CN106874778B (en) Intelligent terminal file acquisition and data recovery system and method based on android system
CN109688097A (en) Website protection method, website protective device, website safeguard and storage medium
CN108632219B (en) Website vulnerability detection method, detection server, system and storage medium
CN106656577B (en) The user behavior statistical method and intelligent router of a kind of APP and browser
CN108667770B (en) Website vulnerability testing method, server and system
CN106897141A (en) The processing method and processing device of information
WO2014000537A1 (en) System and method for finding phishing website
US11513812B2 (en) Targeted data extraction system and method
CN101354721A (en) Data processing device capable of performing data transmission by a predetermined access method
CN106776693A (en) A kind of website data acquisition method and device
CN105721578A (en) User behavior data collection method and system
Pandela et al. Browser forensics on web-based tiktok applications
CN103455597A (en) Distributed information hiding detection method facing mass web images
CN104426863B (en) A kind of page request method, page request device, transfer server and terminal
CN103544288A (en) Browser webpage loading control method and device
CN106326280A (en) Data processing method, apparatus and system
CN107357922A (en) A kind of NFS of distributed file system accesses auditing method and system
CN110020297A (en) A kind of loading method of web page contents, apparatus and system
EP3944111B1 (en) System and method for generating a minimal forensic image of a dataset of interest
CN102215146B (en) Webpage downloading monitoring method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant