CN107015978A - A kind of web page resources processing method and device - Google Patents

A kind of web page resources processing method and device Download PDF

Info

Publication number
CN107015978A
CN107015978A CN201610055758.4A CN201610055758A CN107015978A CN 107015978 A CN107015978 A CN 107015978A CN 201610055758 A CN201610055758 A CN 201610055758A CN 107015978 A CN107015978 A CN 107015978A
Authority
CN
China
Prior art keywords
data cached
web page
text
page resources
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610055758.4A
Other languages
Chinese (zh)
Other versions
CN107015978B (en
Inventor
吴伟勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba China Co Ltd
Original Assignee
Guangzhou Dongjing Computer Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Dongjing Computer Technology Co Ltd filed Critical Guangzhou Dongjing Computer Technology Co Ltd
Priority to CN201610055758.4A priority Critical patent/CN107015978B/en
Publication of CN107015978A publication Critical patent/CN107015978A/en
Application granted granted Critical
Publication of CN107015978B publication Critical patent/CN107015978B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9574Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The embodiments of the invention provide a kind of web page resources processing method and device, it is related to computer mobile communication technology field, methods described includes:In response to the web page resources request that user terminal is sent web page resources are obtained from web page server;According to the web page resources, generation head is data cached and text is data cached, the head data cached response header and the first data cached key assignments of the sensing text including the web page resources, the data cached response text including the web page resources of the text, wherein, content identical text is data cached including the first key assignments described in identical;It is respectively that the head is data cached and the text is data cached is written in caching system.This method can realize that the different web pages resource for making the text with identical content data cached has respective head data cached in http cachings, but all point to it is data cached with a text, so as to reduce redundant data in caching system.

Description

A kind of web page resources processing method and device
Technical field
The present invention relates to computer mobile communication technology field, in particular to a kind of net Page method for processing resource and device.
Background technology
CDN full name is Content Delivery Network, i.e. content distributing network.Its Basic ideas are to avoid being possible to influence data transmission bauds and stability on internet as far as possible Bottleneck and link, make content transmission faster, it is more stable.By being placed everywhere in network One on the existing Internet basic layer intelligent virtual network that CDN node is constituted, CDN clusters can in real time according to the connection of network traffics and each node, load state and The request of user is re-directed from user to the integrated information such as the distance of user and response time On nearest CDN service node.The purpose is to make user to obtain required content nearby, The crowded situation of Internet network is solved, the response speed that user accesses website is improved.CDN The web resource can be cached to CDN clusters by node after web resource is received In caching system, the information of caching includes the address of web resource, http response headers, And the body (data volume) of web resource, pair received again in favor of CDN node During the request of the web resource data, it will directly cache in the caching system of CDN clusters The web resource returns to the user terminal for initiating request.
But, the memory capacity of the caching system of CDN clusters is limited.When buffer memory capacity reaches During to the upper limit, caching system can delete part web page resources according to eliminative mechanism set in advance. When the CDN clusters receive the request for the web resource deleted again, Need to carry out multistage nodal cache inquiry or load the web resource from former website, so as to lead Cause the web page resources response time elongated, influence user side takes.
The content of the invention
It is an object of the invention to provide a kind of web page resources processing method and device, to subtract Redundant data in few caching system.
In a first aspect, the embodiments of the invention provide a kind of web page resources processing method, it is described Method includes:
In response to the web page resources request that user terminal is sent webpage money is obtained from web page server Source;
According to the web page resources, generation head is data cached and text is data cached, institute State the data cached response header and sensing including the web page resources in the head text caching First key assignments of data, the data cached response text including the web page resources of the text, Wherein, content identical text is data cached including the first key assignments described in identical;
It is respectively that the head is data cached and the text is data cached is written to caching system In system.
Second aspect, it is described the embodiments of the invention provide a kind of web data processing method Method includes:
When CDN node receives the request of loading web page resources, CDN node obtains institute The resource identifier of the web page resources to be loaded carried in request is stated, with the resource identification Accord with and retrieve valid cache number corresponding with first key assignments in caching system for the first key assignments According to;
When inquiring valid cache data corresponding with the key assignments, CDN node parsing is looked into The valid cache data found, judge whether the form of the valid cache data meets The data cached predetermined cache form in head, wherein, the head data cached predetermined slow Depositing form includes pointing to the first data cached key assignments of the corresponding text, identical Text it is data cached have the key assignments of identical first;
If it is, the first key assignments is obtained in the valid cache data, according to acquired The first key assignments to inquire about corresponding with first key assignments text data cached, when inquiring and When the corresponding text of first key assignments is data cached, based on the valid cache data and The text corresponding with first key assignments is data cached to obtain the webpage money to be loaded Source.
The third aspect, it is described the embodiments of the invention provide a kind of web page resources processing unit Device includes:
Web page resources acquiring unit, for the web page resources request that sends in response to user terminal Web page resources are obtained from web page server;
Data cached generation unit, for generating, head is data cached and text is data cached, The head data cached response header and the sensing text including the web page resources delay First key assignments of deposit data, the data cached response including the web page resources of the text is just Text, wherein, content identical text is data cached including the first key assignments described in identical;
Data cached writing unit, for respectively by the head it is data cached and it is described just The data cached caching of text is written to caching system.
Fourth aspect, the embodiments of the invention provide a kind of web page resources processing unit, is set In CDN node, described device includes:
Resource retrieval unit, for when receiving the request of loading web page resources, obtaining institute The resource identifier of the web page resources to be loaded carried in request is stated, with the resource identification Accord with and retrieve valid cache number corresponding with first key assignments in caching system for the first key assignments According to;
Resource resolution unit, valid cache data corresponding with the key assignments are inquired for working as When, the valid cache data found are parsed, the lattice of the valid cache data are judged Whether formula meets the data cached predetermined cache form in head, wherein, the head caches number According to predetermined cache form include pointing to data cached first of the corresponding text Key assignments, identical text is data cached to have the key assignments of identical first;
Resource acquisition unit, the lattice for valid cache data described in resource resolution unit judges When formula meets the data cached predetermined cache form in head, obtained in the valid cache data The first key assignments is taken, inquires about corresponding just with first key assignments according to the first acquired key assignments Text is data cached, when inquire corresponding with first key assignments text it is data cached when, base It is data cached in the valid cache data and the text corresponding with first key assignments Obtain the web page resources to be loaded.
A kind of web page resources processing method provided in an embodiment of the present invention and device, according to from Web page server obtains web page resources, and generation head is data cached and text is data cached, The data cached response header and sensing including the web page resources in the head text caching number According to the first key assignments, the data cached response text including the web page resources of text, wherein, Content identical text is data cached including the first key assignments described in identical, so as to realize The data cached different web pages resource of the text with identical content is allowed to have each in http cachings From head it is data cached, but all point to it is data cached with a text, so as to reduce caching Redundant data in system, improves the utilization rate of caching system.
Other feature and advantage will illustrate in subsequent specification, also, partly from explanation Become apparent in book, or by implementing understanding of the embodiment of the present invention.The mesh of the present invention And other advantages can pass through the institute in the specification, claims and accompanying drawing write The structure particularly pointed out is realized and obtained.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, under Face will be briefly described to the required accompanying drawing used in embodiment, it should be apparent that, under Accompanying drawing in the description of face is only some embodiments of the present invention, for ordinary skill For personnel, on the premise of not paying creative work, it can also be obtained according to these accompanying drawings Obtain other accompanying drawings.By shown in accompanying drawing, above and other purpose of the invention, feature and Advantage will become apparent from.Identical reference indicates identical part in whole accompanying drawings. Deliberately accompanying drawing is not drawn by actual size equal proportion scaling, it is preferred that emphasis is show the present invention's Purport.
Fig. 1 is the application ring of web page resources processing method and processing device provided in an embodiment of the present invention Border schematic diagram;
Fig. 2 is the structured flowchart of CDN node provided in an embodiment of the present invention;
The structural frames for the web page resources processing unit that Fig. 3 provides for first embodiment of the invention Figure;
The structural frames for the web page resources processing unit that Fig. 4 provides for second embodiment of the invention Figure;
The structural frames for the web page resources processing unit that Fig. 5 provides for third embodiment of the invention Figure;
The flow chart for the web page resources processing method that Fig. 6 provides for fourth embodiment of the invention;
The flow chart for the web page resources processing method that Fig. 7 provides for fifth embodiment of the invention;
The flow chart for the web page resources processing method that Fig. 8 provides for sixth embodiment of the invention.
Embodiment
The web data processing method and device that the embodiment of the present invention is provided can be applied to as In application environment shown in Fig. 1.As shown in figure 1, user terminal 100, CDN node 200, Web page server 300 is located in wireless network or cable network 400, passes through the wireless network Or cable network 400, user terminal 100 and CDN node network service.
In the embodiment of the present invention, user terminal 100 is preferably mobile terminal device, for example Can include smart mobile phone, tablet personal computer, E-book reader, pocket computer on knee, Vehicle-mounted computer, Wearable mobile terminal etc..
Fig. 2 shows a kind of structure for the CDN node that can be applied in the embodiment of the present invention Block diagram.As shown in Fig. 2 CDN node 200 (only shows one including one or more in figure It is individual) processor 201, storage control 202, it is memory 203, Peripheral Interface 204, logical Believe module 205.These components pass through the phase intercommunication of one or more communication bus/signal wire 206 News.
Memory 203 can be used in storage software program and module, such as embodiment of the present invention The corresponding programmed instruction/module of web page resources processing method and processing device, processor 201 passes through Operation is stored in software program and module in memory 203, so as to perform various functions Using and data processing, such as web page resources processing method provided in an embodiment of the present invention.
Memory 203 may include high speed random access memory, may also include nonvolatile memory, Such as one or more magnetic storage device, flash memory or the storage of other nonvolatile solid states Device.Processor 201 and other access of possible component to memory 203 can be in storages Carried out under the control of controller 202.
Peripheral Interface 206 is by various input/output devices coupled to processor 201 and storage Device 203.In certain embodiments, Peripheral Interface 206, processor 206 and storage control Device 202 can be realized in one single chip.In some other example, they can distinguish Realized by independent chip.
It is appreciated that the structure shown in Fig. 2 is only signal, CDN node 200 may also include More either less components or match somebody with somebody than shown in Fig. 2 with different from shown in Fig. 2 Put.Each component shown in Fig. 2 can be realized using hardware, software or its combination.
There is provided one kind for the web page resources processing method and device that the embodiment of the present invention is proposed New web page resources processing method.The web page resources processing method and device are applicable to CDN node 200.In the embodiment of the present invention, browser is installed in user terminal 100, It is corresponding with CDN node 200, provide the user service.
There is provided a kind of new for the web page resources treating method and apparatus that the embodiment of the present invention is proposed The http buffer memories and inquiry mechanism applied to web page resources.
Inventor is analyzed by the sampled data to different Webpages, is found for some The web addresses of web page resources, such as JavaScript resources or CSS resources are different, but It is that its content there may be unanimously.Such as front end Open Framework Jquery increases income Javascript Storehouse (or Jquery increase income CSS storehouses).If some CDN service node cluster is just first Three pages are post-processed, respectively each self reference Jquery class libraries, then in its http cachings Then have in space three parts it is data cached.But, if the Jquery class libraries that three pages are quoted The data of Javascript resources be just as, just waste two response texts of storage Http spatial caches, it is assumed that the size of response text is 210KB, then just wasted 420KB spatial cache.
Web page resources treating method and apparatus provided in an embodiment of the present invention is according to web page resources Information, the head for the reply header that generation includes the web page resources is data cached and including institute The text for stating the response text of web page resources is data cached, and head is data cached to be included pointing to The first data cached key assignments of the text, and identical text is data cached with identical First key assignments, allows with the data cached different web pages resource of identical text so as to realize There is respective head data cached in http cachings, but all point to a text caching number According to so as to reduce data cached redundancy, allowing http spatial caches to store more caching numbers According to.For example, for above-mentioned scene, nearly 420KB spatial cache can be saved.
It should be noted that the http cache service systems in the embodiment of the present invention are used Key-value databases (or class key-value databases) are realized, are deposited to data Substantially the following key element of key-value databases is followed when storage is with retrieval:Phase between data cached It is mutually independent;Data format is divided into key and value parts, key as data cached index, It is easy to management, and with uniqueness, value is the data of real cache;When the data of caching When having taken spatial cache, if increasing newly data cached, it is necessary to superseded existing slow Deposit data carrys out vacating space and deposits newly data cached.
In the embodiment of the present invention, the data that form is cached will be cached with existing http, It is defined as that entity is data cached, the data cached http answer numbers for including web page resources of entity According to all parts, that is, the statusline including http reply datas, reply header, with And response text.
In the embodiment of the present invention, response text identical different web pages resource definition is with value money Source, the data definition using new caching form caching provided in an embodiment of the present invention is slow with value Deposit data.Since it is desired that realizing that multiple same value resources can be used with portion in http cachings Text is data cached, so introducing two new caching forms:Data cached pre- in head Surely form and the data cached predetermined cache form of text are cached.
In the embodiment of the present invention, the data cached key assignments in head (key) is web page resources URI (Uniform Resource Identifier, Uniform Resource Identifier), URI are used for unique One web page resources of mark.Statuslines of the data cached value in head including serializing, Reply header and the first data cached key assignments of the corresponding text of sensing, first key assignments It is associating web pages resource reply header and the data cached index of text.During head is data cached Do not include response text (body).
In the embodiment of the present invention, text is data cached using the first key assignments as key, and value is The response text (body) of correspondence web page resources.The form of first key assignments includes:Web page resources Typonym and based on the data cached content of the text (namely replying text) Cryptographic Hash, the character string of encoded rear generation calculated.
It is identical because the text with value resource is data cached, it is therefore, slow with value in write-in When the text of deposit data is data cached, if be stored with http spatial caches with this just The data cached identical text of text is data cached, and the text newly write is data cached to be covered The text existed originally in http spatial caches is data cached, that is to say, that for multiple with value Resource, it is data cached to preserve respective head in http spatial caches, but be due to its just Data cached text is only to preserve a identical just in identical, therefore http spatial caches Text is data cached, can thus realize the data cached redundancy of reduction.
Below in conjunction with accompanying drawing in the embodiment of the present invention, to the technical side in the embodiment of the present invention Case is clearly and completely described, it is clear that described embodiment is only the present invention one Section Example, rather than whole embodiments.Generally it is described and illustrated herein in the accompanying drawings The component of the embodiment of the present invention can arrange and design with a variety of configurations.Therefore, The detailed description of the embodiments of the invention to providing in the accompanying drawings, which is not intended to limit, below wants The scope of the present invention of protection is sought, but is merely representative of the selected embodiment of the present invention.It is based on Embodiments of the invention, those skilled in the art are not on the premise of creative work is made The every other embodiment obtained, belongs to the scope of protection of the invention.
It should be noted that:Similar label and letter represents similar terms in following accompanying drawing, because This, once be defined in a certain Xiang Yi accompanying drawing, then in subsequent accompanying drawing need not pair It further define and explain.Meanwhile, in the description of the invention, term " first ", " second " etc. is only used for distinguishing description, and it is not intended that indicating or implying relative importance.
First embodiment
Fig. 3 shows the structured flowchart for the web data processing unit that first embodiment is provided, Referring to Fig. 3, a kind of web page resources processing unit that first embodiment of the invention is provided, this Device in embodiment is preferably operated at CDN node 200, the device that the present embodiment is provided 20 include:
Web page resources acquiring unit 21, for responding the web page resources that user terminal 100 is sent Ask and obtain web page resources from web page server 300;
When the web page resources that web page resources acquiring unit 21 receives the transmission of user terminal 100 please When asking, do not retrieved in caching system after web page resources corresponding with the request, can be with The web page resources are asked to the source server of web page resources.In order to beneficial to receiving the net again During the request of page resource, the more quickly request of response user terminal can be by reception Web storage is into caching system.In the present embodiment, above-mentioned web page resources request can be The browser of user terminal 100 is sent.Also, as a kind of mode, caching system is CDN Http caching systems in cluster.
Data cached generation unit 22, for generating, head is data cached and text caches number According to the head data cached response header and sensing including the web page resources are described just The first data cached key assignments of text, data cached the answering including the web page resources of the text Text is answered, wherein, content identical text is data cached including the first key assignments described in identical.
First key assignments includes:The typonym of the web page resources and based on described Cryptographic Hash that the response text of web page resources is calculated, it is encoded after generation character string.
For example the form of the first key assignments can be:The schema of web page resources://body contents Md5 values carry out the 24 byte character strings generated after base64 codings.
Wherein, the schema of web page resources is the typonym of web page resources, for example can be with Be " Js " and " Css " (case sensitive), naturally it is also possible to extension accommodates more resources-type Type.Using the schema of web page resources, main purpose is to distinguish from key aspect Resource class.Js:Refer to the text it is data cached be Javascript resources response text (body) data.Css:Refer to the text it is data cached be CSS resources response text (body) Data.
Comprising " the md5 values of body contents generated after base64 codings in first key assignments 24 byte character strings ", it is main to be intended to that the data cached storage of text is ensured Uniqueness.Because md5 values are calculated based on data content (rather than size of data) The cryptographic Hash come, ensure that in great data space in algorithm aspect, different The cryptographic Hash that content is produced is certainly different.
Data cached writing unit 23, for respectively that the head is data cached and described The data cached caching of text is written to caching system.
Text is namely cached number by data cached cached in and head data cached to text According to head is data cached is respectively written into caching system, will as a kind of embodiment Text is data cached and head is data cached is written in the http caching systems of CDN clusters. If it is data cached with the text to be stored with the http spatial caches of http caching systems Identical text is data cached, and the text newly write is data cached will to cover http caching skies Between in originally exist text it is data cached, that is to say, that for it is multiple with value resources, can exist The respective head of http spatial caches preservation is data cached, but is due to that its text is data cached It is that only to preserve a identical text in identical, therefore http spatial caches data cached, The data cached redundancy of reduction can thus be realized.
Further, the head data cached to the text and described respectively is data cached After being cached, it can also include:The data cached expired time of the text is set For 0.The data cached expired time of text is set to 0, that is, text is data cached It is set to never expired, can farthest ensures the data cached persistence of the text.
Due to http spatial caches be it is limited, therefore, when the data of caching taken it is slow When depositing space, if increasing newly data cached, it is desired nonetheless to eliminate existing data cached Carry out vacating space to deposit new data cached, specific replacement policy, for example, can use Lru algorithm (Last Recently Used) carries out data and eliminated.Lru algorithm can be certain Ensure that the high and effective data of access frequency unanimously can be retained in before buffer queue in degree Portion, and access frequency is low or failed data can be pushed to queue tail and be easy to eliminate. It is of course also possible to use other replacement policies, specific implementation of the invention is not limited thereto.
Below by by specifically illustrating the web page resources processing method in the present embodiment, Address and http response reports the following is the web page resources of three Jquery Javacript types The reply header of text:
Web page resources one
Address (URI):
http://spuvvn.edu/bitrix/templates/sardar_patel/js/jquery-ui-1.8.16. custom.min.js
Reply header:
HTTP/1.0 200OK
Date:Wed,08Jul 2015 06:43:45GMT
Server:Apache/2.2.27(Unix)mod_ssl/2.2.27 OpenSSL/1.0.1e-fips DAV/2mod_jk/1.2.37mod_bwlimited/1.4 PHP/5.3.28
Last-Modified:Thu,16Feb 2012 17:55:54GMT
ETag:"31000b8-3361f-4b9188a179e80"
Accept-Ranges:bytes
Content-Length:210463
Content-Type:application/javascript
X-Cache:MISS from devy.ucweb.local
X-Cache-Lookup:MISS from devy.ucweb.local:3128
Via:1.0devy.ucweb.local:3128(squid/2.6.STABLE21)
Proxy-Connection:close
Web page resources two
Address (URI):
http://m.sportzwiki.com/assets/js/jquery-ui-1.8.16.custom.min.js
Reply header:
HTTP/1.0 200OK
Date:Wed,08Jul 2015 06:47:55GMT
Content-Type:application/javascript
Content-Length:210463
Set-Cookie:
_ _ cfduid=d3d61c97334953c660457bbb5a0e183a51436338075; Expires=Thu, 07-Jul-16 06:47:55 GMT;Path=/; Domain=.sportzwiki.com;HttpOnly
Last-Modified:Wed,18Mar 2015 13:43:31GMT
ETag:"94e599-3361f-5119044d006c0"
CF-Cache-Status:HIT
Expires:Mon,13Jul 2015 06:47:55GMT
Cache-Control:Public, max-age=432000
Accept-Ranges:bytes
Server:cloudflare-nginx
CF-RAY:2029d72c77d30bab-HKG
X-Cache:MISS from devy.ucweb.local
X-Cache-Lookup:MISS from devy.ucweb.local:3128
Via:1.0devy.ucweb.local:3128(squid/2.6.STABLE21)
Proxy-Connection:close
Web page resources three
Address (URI):
http://www.rcs-rds.ro/resources/jquery_ui/js/jquery-ui-1.8.16.custo m.min.js
Reply header:
HTTP/1.0 200OK
X-Varnish:1089339340
Vary:Accept-Encoding
X-Cache:MISS
Content-Type:application/javascript
Date:Wed,08Jul 2015 06:50:54GMT
Accept-Ranges:bytes
Accept-Ranges:bytes
ETag:"503dde-3361f-4aeb148764ec0"
Last-Modified:Fri,07Oct 2011 08:32:35GMT
Age:0
Content-Length:210463
X-Cache:MISS from devy.ucweb.local
X-Cache-Lookup:MISS from devy.ucweb.local:3128
Via:1.1varnish,1.0devy.ucweb.local:3128 (squid/2.6.STABLE21)
Proxy-Connection:close
As can be seen that the URI of three web page resources different and cross-domain (being in different domain names), But from the point of view of the Content-Length field values of three web page resources, response text (body) Size be all 210463 bytes (about 210KB).
Assuming that the content calculating md5 values to the response text (body) of three web page resources are entered again Row base64 is encoded, and the check value drawn is all " ZcfHB93eoMeGFxTfJQ1UxA==" It can so prove that the content of the response text (body) of three web page resources is just as 's.That is these three web page resources are exactly " with the value resource " described in the embodiment of the present invention.
The reason in the presence of " with value resource ", is that most of websites may employ similar station Point template is built a station, so, because template is same or similar, its front end used Technology may all employ some comparison main flows and powerful Javascript class libraries or CSS storehouses, such as Jquery.Even with different website templates, due to needing to realize certain A little front-end functionality characteristics, the Javascript class libraries of use main flow that also can be simultaneously or CSS storehouses.So for the external connection Javascript resources (or Css resources) of different websites, Should exist a certain proportion of with value resource.Such as above three Jquery resources, its title It is inherently identical.But can also there are title difference but the consistent scene of resource content, be typically Website side is modified to title, but generally resource name can retain the key of library name Word, such as Jquery.
The first data cached key assignments of the texts of three Jquery resources mentioned above can be with It is expressed as:
Js://ZcfHB93eoMeGFxTfJQ1UxA==
From the foregoing it will be appreciated that using the web page resources stored with value cache way, compared to existing Http cache way, it is actual to store that two http are data cached, and one is head caching Data, another is that text is data cached.It is data cached for head, can in its value With by increasing a special field " body-key newly in reply header field:" refer to deposit To the first data cached key assignments of corresponding text, it is preferred that the field is placed on The first row in value.
Illustrate that its head is data cached and text is data cached with a Jquery resource below Form.
The data cached predetermined cache form in head:
The web page resources processing method that the present embodiment is provided can reduce data cached redundancy, Http spatial caches are enable to store more multi-caching data, so as to allow the http of finite capacity Caching can store more data cached, and its http cache hit is improved to a certain extent Rate, reaches benefit bigizationner, and the response for reducing web resource takes.
It should be noted that each unit in the present embodiment can be by software code realization, Now, above-mentioned each unit can be stored in the memory 203 of CDN node 200.With Upper each unit can equally be realized by hardware such as IC chip.It is appreciated that above-mentioned Web page resources processing unit 30 also can run on other and be connected to user terminal 100 and webpage The server for cache web pages resource between server 200.
Second embodiment
Fig. 4 shows the structured flowchart for the web data processing unit that second embodiment is provided, Referring to Fig. 4, a kind of web page resources processing unit 30 that second embodiment of the invention is provided, Device in the present embodiment is preferably operated at CDN node 200, the webpage that the present embodiment is provided Resource processing unit 30 includes:
Web page resources acquiring unit 31, for responding the web page resources that user terminal 100 is sent Ask and obtain web page resources from web page server 300;
Data cached generation unit 32, for generating, head is data cached and text caches number According to the head data cached response header and sensing including the web page resources are described just The first data cached key assignments of text, data cached the answering including the web page resources of the text Text is answered, wherein, content identical text is data cached including the first key assignments described in identical;
Data cached writing unit 33, for respectively that the head is data cached and described The data cached caching of text is written to caching system.
Preferably, the web page resources processing unit 30, in addition to:
First judging unit 34, for judging whether the web page resources can cache, if can Caching, then data cached generation unit 32 is according to the web page resources, generation head caching number According to this and text is data cached, otherwise, without caching.
Second judging unit 35, for judging whether the web page resources conform to a predetermined condition;
If meeting the predetermined condition, the data cached generation unit 32 is according to the net Page resource, generation head is data cached and text is data cached.
Otherwise, the data cached generation unit 32 generates real according to the information of institute's web page resources Body is data cached, and data cached writing unit 33 directly treats the data cached write-in of the entity Caching system, the data cached all http answer numbers including the web page resources of the entity According to.
Because the same value that the embodiment of the present invention is proposed caches and text data cached to head caching Data will carry out write-in caching respectively, and write-once operation is added than existing write-in caching, The read operation of response also increases once, so, can be to using same in order to improve efficiency The web page resources that value caching method is cached are any limitation as, that is, are being carried out with value caching Before, whether progress one judgement, if meeting predetermined if first being conformed to a predetermined condition to web page resources Condition is just handled in the way of with value caching, otherwise, according to existing cache way Handled.
In the present embodiment, the predetermined condition can include following one or more of combination:
The type of the web page resources is preset kind;
The size of the web page resources is more than predetermined threshold value;And
The title of the web page resources is present in predetermined keyword list.
In all kinds of web page resources, cacheable web page resources concentrate on external connection JavaScript (a kind of literal translation formula script), CSS (a kind of form design language) and picture money Source.Wherein, parsing and rendering speed of JavaScript the and CSS resources for the page has one It is fixing to ring.And for web page resources, JavaScript and CSS resources are present with value money The possibility in source is larger, and the page and picture there is a possibility that same value resource is smaller, so The type of web page resources can be any limitation as.In the present embodiment, preset kind is preferred For JavaScript types or CSS types, as long as the type of web page resources is default for both One of type.
By judging whether the size of web page resources can be with more targeted more than predetermined threshold value To being carried out with value resource with value caching process.In the present embodiment, web page resources size it is pre- If threshold value can be for example 50KB, certainly, the size of predetermined threshold value also can be according to actual feelings Condition is adjusted, and is not intended as the restriction to embodiment of the present invention.
The title (filename) of web page resources whether there is in predetermined keyword list, that is, Whether the title of finger web page resources, which can match lists of keywords resource, uses with value caching plan Slightly.Such as " jquery " is exactly one of keyword.
These decision conditions can be configured by CDN node, be entered by backstage issuing mechanism Row control.Can also be by setting buffer control switch to control being turned on and off for the function. Assuming that setting needs to meet three above condition simultaneously, then when some web page resources is simultaneously full During sufficient above three condition, triggering is with value caching.
It should be noted that each unit in the present embodiment can be by software code realization, Now, above-mentioned each unit can be stored in the memory 203 of CDN node 200.With Upper each unit can equally be realized by hardware such as IC chip.
3rd embodiment
Fig. 5 shows the structured flowchart for the web data processing unit that 3rd embodiment is provided, Referring to Fig. 5, a kind of web page resources processing unit 40 that third embodiment of the invention is provided, Device in the present embodiment is preferably operated at CDN node 200, the webpage that the present embodiment is provided Resource processing unit 40 includes:
Resource retrieval unit 41, for when receiving the request of loading web page resources, obtaining The resource identifier of the web page resources to be loaded carried in the request, with the resource mark Know symbol and valid cache corresponding with first key assignments is retrieved in caching system for the first key assignments Data;
Resource resolution unit 42, valid cache number corresponding with the key assignments is inquired for working as According to when, parse the valid cache data that find, judge the valid cache data Whether form meets the data cached predetermined cache form in head, wherein, the head caching The predetermined cache form of data includes pointing to data cached the of the corresponding text One key assignments, identical text is data cached to have the key assignments of identical first;
Resource acquisition unit 43, for valid cache data described in resource resolution unit judges During the data cached predetermined cache form in format character syncephalon portion, in the valid cache data The first key assignments is obtained, inquires about corresponding with first key assignments according to the first acquired key assignments Text is data cached, when inquire corresponding with first key assignments text it is data cached when, Based on the valid cache data and the text caching number corresponding with first key assignments According to the acquisition web page resources to be loaded.
As a kind of embodiment, when CDN node receives request, for example http resources please Ask, it is necessary to load the web page resources of some external connection, be with the resource identifier (URI) of resource Key inquires about whether it has the data cached of preservation to the http cache service systems of CDN clusters (get operations).Http cache service systems are retrieved in the data queue of spatial cache with being somebody's turn to do Key corresponding data cached (first time get operations), if there is no corresponding data cached Or it is data cached failed, CDN node judge the web page resources http caching be not hit by, The web page resources to be loaded are asked to source web page server 300.
If Query Result is has, the valid cache number that CDN node parsing is found According to value values.If it find that being that entity is data cached, that is, it is existing caching number According to form, then directly web page resources to be loaded are obtained according to the valid cache data found Information.If it find that being that head is data cached, then CDN node is based on head and caches form, According to " body-key " field of reply header, the data cached key values of text are parsed, Namely the first key assignments, and inquiry (the is initiated to http caching systems with first key assignments again Secondary get operations).If there is and effectively, then http caching systems by the text cache number According to CDN node is returned to, CDN node is data cached data cached with text based on head, Obtain the information of web page resources to be loaded.If there is no the target text it is data cached or It is data cached but failed to there is the target text in person, then judges webpage money to be loaded The http cachings in source are not hit by, the web page resources to be loaded to Web server request.
Web-browsing data of the inventor based on multiple users within a period of time, statistics JS resource datas related jquery.1565 JS resources are had, wherein there are 743 moneys Source belongs to content and repeats resource (with value resource);And pass through at this 743 with value resource According to after content repetition re-scheduling, (such as A, B, C are, with value resource, only to retain A, and are picked Except B and C), remaining 141 resources, average repetition index is about the 4.27 (weights of rejecting Total number resource after multiple total number resource/re-scheduling).Thus, it is possible to find out what the present embodiment was provided Web page resources processing method can reduce data cached redundancy, enable http spatial caches Store more multi-caching data.
It should be noted that each unit in the present embodiment can be by software code realization, Now, above-mentioned each unit can be stored in the memory 203 of CDN node 200.With Upper each unit can equally be realized by hardware such as IC chip.
Fourth embodiment
Fig. 6 shows a kind of web page resources processing method that fourth embodiment of the invention is provided Flow chart, referring to Fig. 6, the present embodiment describes the handling process of CDN node, institute The method of stating includes:
Step S510, takes in response to the web page resources request that user terminal 100 is sent from webpage Business device 300 obtains web page resources.It is appreciated that the web page resources that first embodiment is provided are obtained Unit 21 is taken to perform step S510.
Step S520, according to the information of the web page resources, generation includes the web page resources Reply header data cached and including the web page resources the response text in head text It is data cached, the head is data cached include pointing to the text it is data cached first Key assignments, identical text is data cached to have the key assignments of identical first.It is appreciated that first The data cached generation unit 22 that embodiment is provided can perform step S520.
Step S530, it is respectively that the head is data cached and the text is data cached writes Enter into caching system.It is appreciated that the data cached writing unit that first embodiment is provided 23 can perform step S520.
Further, as a preferred embodiment, data cached writing unit 23 It is respectively that the head is data cached and the text is data cached is written in caching system Afterwards, it can also include:The data cached expired time of the text is set to 0.Will The data cached expired time of text is set to 0, that is, is set to text is data cached It is never expired, it can farthest ensure the data cached persistence of the text.
The web page resources processing method that the present embodiment is provided can reduce data cached redundancy, Http spatial caches are enable to store more multi-caching data, so as to allow the http of finite capacity Caching can store more data cached, and its http cache hit is improved to a certain extent Rate, reaches benefit bigizationner, and the response for reducing web page resources takes.
5th embodiment
Fig. 7 shows a kind of web page resources processing method that fifth embodiment of the invention is provided Flow chart.Referring to Fig. 7, the present embodiment describes the handling process of CDN node, institute The method of stating includes:
Step S610, takes in response to the web page resources request that user terminal 100 is sent from webpage Business device 300 obtains web page resources.It is appreciated that the web page resources that second embodiment is provided are obtained Unit 31 is taken to perform step S610.
Step S620, judges whether the web page resources can cache.It is appreciated that second is real Step S620 can be performed by applying the first judging unit 34 of example offer.
If it is, performing step S630.Otherwise, without caching, that is, caching is write Enter flow to terminate.
Step S630, judges whether the web page resources conform to a predetermined condition.It is appreciated that The second judging unit 35 that second embodiment is provided can perform step S630.
If meeting the predetermined condition, step S640 is performed, otherwise, step S660 is performed.
In the present embodiment, the predetermined condition can include following one or more of combination:
The type of the web page resources is preset kind;
The size of the web page resources is more than predetermined threshold value;And
The title of the web page resources is present in predetermined keyword list.
Step S640, the head for generating the reply header for including the web page resources is data cached Text with the response text including the web page resources is data cached.It is appreciated that second The data cached generation unit 32 that embodiment is provided can perform step S640.
Step S650, it is respectively that the head is data cached and the text is data cached writes Enter into caching system.It is appreciated that the data cached writing unit that second embodiment is provided 33 can perform step S650.
Step S660, it is data cached according to the information of web page resources generation entity, directly It is data cached to the entity to cache.It is appreciated that the caching that second embodiment is provided Data write unit 33 can perform step S660.
Sixth embodiment
Fig. 8 shows a kind of web page resources processing method that sixth embodiment of the invention is provided Flow chart, referring to Fig. 8, the present embodiment describes what CDN node processing caching was read Flow, methods described includes:
Step S710, when CDN node receives the request of loading web page resources, is obtained The resource identifier of the web page resources to be loaded carried in the request, with the resource mark Know symbol and valid cache corresponding with first key assignments is retrieved in caching system for the first key assignments Data.It is appreciated that the resource retrieval unit 41 that 3rd embodiment is provided can perform the step Rapid S710.
If Query Result is in the absence of execution step S720, if Query Result is to deposit Then performing step S730.
Step S720, asks the web page resources to be loaded to web page server 300, connects Execution step S610, into caching flow, that is, the stream described in the 5th embodiment Journey, is repeated no more here.It is appreciated that the resource retrieval unit 41 that 3rd embodiment is provided Step S720 can be performed.
Step S730, parses the valid cache data found.
Step S740, judges whether the form of the valid cache data meets head caching number According to predetermined cache form.It is appreciated that the resource resolution unit 42 that 3rd embodiment is provided Step S730 and step S740 can be performed.
If the data cached predetermined cache lattice in the format character syncephalon portion of the valid cache data Formula, that is to say, that valid cache data are that head is data cached, performs step S750.Such as Fruit is not then to perform step S770.
Step S750, obtains the first key assignments, according to acquired in the valid cache data The first key assignments to inquire about corresponding target text data cached.
If there is and effectively, then perform step S760, if there is no the target text It is data cached or to there is the target text data cached but failed, then judge that this is treated The http cachings of loading web page resources are not hit by, and perform step S720.
Step S760, it is data cached based on the valid cache data and the target text Obtain the information of the web page resources to be loaded.
Step S770, directly obtains the webpage to be loaded according to the valid cache data and provides The information in source.It is appreciated that the resource acquisition unit 43 that 3rd embodiment is provided can be performed Step S750, step S760 and step S770.
It is apparent to those skilled in the art that, for convenience and simplicity of description, Detailed process in the embodiment of the method for foregoing description, may be referred in aforementioned means embodiment Corresponding process, will not be repeated here.
In summary, web page resources treating method and apparatus provided in an embodiment of the present invention according to The information of the web page resources obtained by web page server, generation includes answering for the web page resources Answer the text caching of data cached and including the web page resources the response text in head on head Data, head is data cached to be included pointing to the first data cached key assignments of the text, and Identical text is data cached to have the key assignments of identical first, is allowed so as to realize with phase There is respective head caching number in http cachings with the data cached different web pages resource of text According to, but all point to data cached with a text, so as to reduce data cached redundancy, make Http spatial caches can store more multi-caching data, and http cachings are improved to a certain extent Hit rate.
It should be noted that each embodiment in this specification is retouched by the way of progressive State, what each embodiment was stressed is the difference with other embodiment, each reality Apply between example identical similar part mutually referring to.
Web page resources processing unit and system that the embodiment of the present invention is provided, its realization principle And the technique effect produced is identical with preceding method embodiment, to briefly describe, device is implemented Example part does not refer to part, refers to corresponding contents in preceding method embodiment.
In addition, the flow chart and block diagram in accompanying drawing show multiple embodiments according to the present invention System, architectural framework in the cards, function and the behaviour of method and computer program product Make.At this point, each square frame in flow chart or block diagram can represent module, a journey A part for sequence section or code a, part for the module, program segment or code includes one Or multiple executable instructions for being used to realize defined logic function.It should also be noted that having In a little realizations as replacement, the function of being marked in square frame can also be with different from accompanying drawing The order marked occurs.For example, two continuous square frames can essentially be substantially in parallel Perform, they can also be performed in the opposite order sometimes, and this is depending on involved function. It is also noted that each square frame and block diagram and/or flow in block diagram and/or flow chart The combination of square frame in figure, can be with function or action as defined in performing it is special based on hard The system of part is realized, or can be realized with the combination of specialized hardware and computer instruction.
The computer program product that the embodiment of the present invention is provided, including store program code Computer-readable recording medium, the instruction that described program code includes can be used for perform before Method described in embodiment of the method, implements and can be found in embodiment of the method, herein no longer Repeat.
It is apparent to those skilled in the art that, for convenience and simplicity of description, The specific work process of the system of foregoing description, device and unit, may be referred to preceding method Corresponding process in embodiment, will not be repeated here.
In several embodiments provided herein, it should be understood that disclosed system, Apparatus and method, can be realized by another way.Device embodiment described above It is only schematical, for example, the division of the unit, only a kind of logic function is drawn Point, there can be other dividing mode when actually realizing, in another example, multiple units or component Another system can be combined or be desirably integrated into, or some features can be ignored, or not Perform.Another, shown or discussed coupling or direct-coupling or communication each other Connection can be by some communication interfaces, the INDIRECT COUPLING or communication connection of device or unit, Can be electrical, machinery or other forms.
The unit illustrated as separating component can be or may not be and physically divide Open, the part shown as unit can be or may not be physical location, you can With positioned at a place, or it can also be distributed on multiple NEs.Can be according to reality Some or all of unit therein is selected to realize the mesh of this embodiment scheme the need for border 's.
In addition, each functional unit in each embodiment of the invention can be integrated at one Reason unit in or unit be individually physically present, can also two or two with Upper unit is integrated in a unit.
If the function is realized using in the form of SFU software functional unit and is used as independent product pin Sell or in use, can be stored in a computer read/write memory medium.Based on so Understanding, the portion that technical scheme substantially contributes to prior art in other words Divide or the part of the technical scheme can be embodied in the form of software product, the calculating Machine software product is stored in a storage medium, including some instructions are to cause a meter Calculate machine equipment (can be personal computer, server, or network equipment etc.) and perform sheet Invent all or part of step of each embodiment methods described.And foregoing storage medium bag Include:USB flash disk, mobile hard disk, read-only storage (ROM, Read-Only Memory), Random access memory (RAM, Random Access Memory), magnetic disc or CD Etc. it is various can be with the medium of store program codes.
It should be noted that herein, such as first and second or the like relational terms It is used merely to make a distinction an entity or operation with another entity or operation, without It is certain require or imply exist between these entities or operation any this actual relation or Person's order.Moreover, term " comprising ", "comprising" or its any other variant are intended to Nonexcludability is included so that process, method, article including a series of key elements or Person's equipment not only includes those key elements, but also other key elements including being not expressly set out, Either also include for this process, method, article or the intrinsic key element of equipment. In the case of there is no more limitations, the key element limited by sentence "including a ...", not Exclude and also exist in addition in the process including the key element, method, article or equipment Identical element.
The preferred embodiments of the present invention are the foregoing is only, are not intended to limit the invention, For those skilled in the art, the present invention can have various modifications and variations.It is all Within the spirit and principles in the present invention, any modification, equivalent substitution and improvements made etc., It should be included in the scope of the protection.It should be noted that:Similar label and letter Similar terms is represented in following accompanying drawing, therefore, once determined in a certain Xiang Yi accompanying drawing Justice, then further need not be defined and be explained to it in subsequent accompanying drawing.

Claims (14)

1. a kind of web page resources processing method, it is characterised in that methods described includes:
In response to the web page resources request that user terminal is sent webpage money is obtained from web page server Source;
According to the web page resources, generation head is data cached and text is data cached, institute State the data cached response header and sensing including the web page resources in the head text caching First key assignments of data, the data cached response text including the web page resources of the text, Wherein, content identical text is data cached including the first key assignments described in identical;
It is respectively that the head is data cached and the text is data cached is written to caching system In system.
2. according to the method described in claim 1, it is characterised in that the first key assignments bag Include:The typonym of the web page resources and the text according to the web page resources The character string generated after the data cached cryptographic Hash re-encoding calculated.
3. according to the method described in claim 1, it is characterised in that described according to the net Page resource, generation head is data cached and text it is data cached before, including:
Judge whether the web page resources can cache, if can cache, perform the basis The web page resources, the generation step that head is data cached and text is data cached, otherwise, Without caching.
4. according to the method described in claim 1, it is characterised in that described according to the net Page resource, generation head is data cached and text it is data cached before, including:
Judge whether the web page resources conform to a predetermined condition;
If meeting the predetermined condition, described according to the web page resources, generation head is delayed Deposit data and the data cached step of text;
Otherwise, it is data cached according to the information of institute's web page resources generation entity, directly to described Entity is data cached to be cached, the data cached institute including the web page resources of the entity There are http reply datas.
5. method according to claim 4, it is characterised in that the predetermined condition bag Include following one or more of combination:
The type of the web page resources is preset kind;
The size of the web page resources is more than predetermined threshold value;And
The title of the web page resources is present in predetermined keyword list.
6. according to the method described in claim 1, it is characterised in that described respectively to described The data cached and described head of text is data cached cached after, in addition to:
The data cached expired time of the text is set to 0.
7. a kind of web data processing method, it is characterised in that methods described includes:
When CDN node receives the request of loading web page resources, CDN node obtains institute The resource identifier of the web page resources to be loaded carried in request is stated, with the resource identification Accord with and retrieve valid cache number corresponding with first key assignments in caching system for the first key assignments According to;
When inquiring valid cache data corresponding with the key assignments, CDN node parsing is looked into The valid cache data found, judge whether the form of the valid cache data meets The data cached predetermined cache form in head, wherein, the head data cached predetermined slow Depositing form includes pointing to the first data cached key assignments of the corresponding text, identical Text it is data cached have the key assignments of identical first;
If it is, the first key assignments is obtained in the valid cache data, according to acquired The first key assignments to inquire about corresponding with first key assignments text data cached, when inquiring and When the corresponding text of first key assignments is data cached, based on the valid cache data and Text corresponding with first key assignments is data cached to obtain the web page resources to be loaded.
8. a kind of web page resources processing unit, it is characterised in that described device includes:
Web page resources acquiring unit, for the web page resources request that sends in response to user terminal Web page resources are obtained from web page server;
Data cached generation unit, for generating, head is data cached and text is data cached, The head data cached response header and the sensing text including the web page resources delay First key assignments of deposit data, the data cached response including the web page resources of the text is just Text, wherein, content identical text is data cached including the first key assignments described in identical;
Data cached writing unit, for respectively by the head it is data cached and it is described just The data cached caching of text is written to caching system.
9. device according to claim 8, it is characterised in that the first key assignments bag Include:The typonym of the web page resources and the text according to the web page resources The character string generated after the data cached cryptographic Hash re-encoding calculated.
10. device according to claim 8, it is characterised in that described device, also Including:
First judging unit, for judging whether the web page resources can cache, if can delay Deposit, then perform described according to the web page resources, generation head is data cached and text is slow The step of deposit data, otherwise, without caching.
11. device according to claim 8, it is characterised in that described device, also Including:
Second judging unit, for judging whether the web page resources conform to a predetermined condition;
If meeting the predetermined condition, the data cached generation unit is according to the webpage Resource, the generation step that head is data cached and text is data cached;
Otherwise, the data cached generation unit generates entity according to the information of institute's web page resources Data cached, data cached write of the entity directly is waited to cache by data cached writing unit System, the data cached all http reply datas including the web page resources of the entity.
12. device according to claim 11, it is characterised in that the predetermined condition Including following one or more of combination:
The type of the web page resources is preset kind;
The size of the web page resources is more than predetermined threshold value;And
The title of the web page resources is present in predetermined keyword list.
13. device according to claim 8, it is characterised in that described data cached to write Enter unit, be additionally operable to respectively that the head is data cached and the text is data cached slow Deposit before being written to caching system, the data cached expired time of the text is set to 0.
14. a kind of web page resources processing unit, it is characterised in that be arranged at CDN node, Described device includes:
Resource retrieval unit, for when receiving the request of loading web page resources, obtaining institute The resource identifier of the web page resources to be loaded carried in request is stated, with the resource identification Accord with and retrieve valid cache number corresponding with first key assignments in caching system for the first key assignments According to;
Resource resolution unit, valid cache data corresponding with the key assignments are inquired for working as When, the valid cache data found are parsed, the lattice of the valid cache data are judged Whether formula meets the data cached predetermined cache form in head, wherein, the head caches number According to predetermined cache form include pointing to data cached first of the corresponding text Key assignments, identical text is data cached to have the key assignments of identical first;
Resource acquisition unit, the lattice for valid cache data described in resource resolution unit judges When formula meets the data cached predetermined cache form in head, obtained in the valid cache data The first key assignments is taken, inquires about corresponding just with first key assignments according to the first acquired key assignments Text is data cached, when inquire corresponding with first key assignments text it is data cached when, base It is data cached in the valid cache data and the text corresponding with first key assignments Obtain the web page resources to be loaded.
CN201610055758.4A 2016-01-27 2016-01-27 Webpage resource processing method and device Active CN107015978B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610055758.4A CN107015978B (en) 2016-01-27 2016-01-27 Webpage resource processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610055758.4A CN107015978B (en) 2016-01-27 2016-01-27 Webpage resource processing method and device

Publications (2)

Publication Number Publication Date
CN107015978A true CN107015978A (en) 2017-08-04
CN107015978B CN107015978B (en) 2020-07-07

Family

ID=59438784

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610055758.4A Active CN107015978B (en) 2016-01-27 2016-01-27 Webpage resource processing method and device

Country Status (1)

Country Link
CN (1) CN107015978B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108234639A (en) * 2017-12-29 2018-06-29 北京奇虎科技有限公司 A kind of data access method and device based on content distributing network CDN
CN108804695A (en) * 2018-06-14 2018-11-13 广州谱道网络科技有限公司 Promotion link generation and identification method and device
CN110147478A (en) * 2017-10-20 2019-08-20 中国电信股份有限公司 Web page subject word acquisition methods and system, server and user terminal
CN111083108A (en) * 2019-11-14 2020-04-28 北京字节跳动网络技术有限公司 Data processing method, device, medium and electronic equipment
CN113590658A (en) * 2021-07-06 2021-11-02 广州汇思信息科技股份有限公司 Cache data processing method and device, computer equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101583089A (en) * 2008-05-12 2009-11-18 华为技术有限公司 Message storage method and message sending method and equipment
CN101706825A (en) * 2009-12-10 2010-05-12 华中科技大学 Replicated data deleting method based on file content types
CN101777056A (en) * 2009-12-31 2010-07-14 成都市华为赛门铁克科技有限公司 Data storage method and device
CN102096712A (en) * 2011-01-28 2011-06-15 深圳市五巨科技有限公司 Method and device for cache-control of mobile terminal
CN102111449A (en) * 2011-02-23 2011-06-29 北京蓝汛通信技术有限责任公司 Method, device and system for updating data
US20150081962A1 (en) * 2007-07-13 2015-03-19 Samsung Electronics Co., Ltd. Cache memory device and data processing method of the device
CN105589919A (en) * 2015-09-18 2016-05-18 广州市动景计算机科技有限公司 Method and device for processing webpage resource

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150081962A1 (en) * 2007-07-13 2015-03-19 Samsung Electronics Co., Ltd. Cache memory device and data processing method of the device
CN101583089A (en) * 2008-05-12 2009-11-18 华为技术有限公司 Message storage method and message sending method and equipment
CN101706825A (en) * 2009-12-10 2010-05-12 华中科技大学 Replicated data deleting method based on file content types
CN101777056A (en) * 2009-12-31 2010-07-14 成都市华为赛门铁克科技有限公司 Data storage method and device
CN102096712A (en) * 2011-01-28 2011-06-15 深圳市五巨科技有限公司 Method and device for cache-control of mobile terminal
CN102111449A (en) * 2011-02-23 2011-06-29 北京蓝汛通信技术有限责任公司 Method, device and system for updating data
CN105589919A (en) * 2015-09-18 2016-05-18 广州市动景计算机科技有限公司 Method and device for processing webpage resource

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110147478A (en) * 2017-10-20 2019-08-20 中国电信股份有限公司 Web page subject word acquisition methods and system, server and user terminal
CN110147478B (en) * 2017-10-20 2021-06-29 中国电信股份有限公司 Webpage subject term obtaining method and system, server and user terminal
CN108234639A (en) * 2017-12-29 2018-06-29 北京奇虎科技有限公司 A kind of data access method and device based on content distributing network CDN
CN108804695A (en) * 2018-06-14 2018-11-13 广州谱道网络科技有限公司 Promotion link generation and identification method and device
CN111083108A (en) * 2019-11-14 2020-04-28 北京字节跳动网络技术有限公司 Data processing method, device, medium and electronic equipment
CN113590658A (en) * 2021-07-06 2021-11-02 广州汇思信息科技股份有限公司 Cache data processing method and device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN107015978B (en) 2020-07-07

Similar Documents

Publication Publication Date Title
Ali et al. Intelligent web proxy caching approaches based on machine learning techniques
CN107015978A (en) A kind of web page resources processing method and device
CN100530186C (en) Method and system for processing buffer
US9497256B1 (en) Static tracker
EP3036662B1 (en) Generating cache query requests
US9253278B2 (en) Using entity tags (ETags) in a hierarchical HTTP proxy cache to reduce network traffic
CN105589919B (en) Web page resources processing method and processing device
US10693858B2 (en) CDN-based access control method and related device
JP2020057438A (en) Sentence extraction method and system
US8438336B2 (en) System and method for managing large filesystem-based caches
Shi et al. Modeling object characteristics of dynamic web content
Doran et al. A comparison of web robot and human requests
CN103916474B (en) The definite method, apparatus and system of cache-time
CN107506154A (en) A kind of read method of metadata, device and computer-readable recording medium
WO2022148306A1 (en) Data elimination method and apparatus, cache node, and cache system
US8694659B1 (en) Systems and methods for enhancing domain-name-server responses
Kaya et al. An admission-control technique for delay reduction in proxy caching
CN108875036A (en) Page data caching method, device, page cache data structure and electronic equipment
CN112416626B (en) Data processing method and device
Mukhopadhyay et al. A dynamic web page prediction model based on access patterns to offer better user latency
CN108073585A (en) Network font loading method, device and system
CN112016017A (en) Method and device for determining characteristic data
Hiranpongsin et al. Integration of recommender system for Web cache management
CN111813711B (en) Method and device for reading training sample data, storage medium and electronic equipment
US11411937B2 (en) Web scraping prevention system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20200526

Address after: 310052 room 508, floor 5, building 4, No. 699, Wangshang Road, Changhe street, Binjiang District, Hangzhou City, Zhejiang Province

Applicant after: Alibaba (China) Co.,Ltd.

Address before: 510627 Guangdong city of Guangzhou province Whampoa Tianhe District Road No. 163 Xiping Yun Lu Yun Ping B radio 14 floor tower square

Applicant before: GUANGZHOU UCWEB COMPUTER TECHNOLOGY Co.,Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant