CN107015978A - A kind of web page resources processing method and device - Google Patents
A kind of web page resources processing method and device Download PDFInfo
- Publication number
- CN107015978A CN107015978A CN201610055758.4A CN201610055758A CN107015978A CN 107015978 A CN107015978 A CN 107015978A CN 201610055758 A CN201610055758 A CN 201610055758A CN 107015978 A CN107015978 A CN 107015978A
- Authority
- CN
- China
- Prior art keywords
- data cached
- web page
- text
- page resources
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/957—Browsing optimisation, e.g. caching or content distillation
- G06F16/9574—Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Transfer Between Computers (AREA)
Abstract
The embodiments of the invention provide a kind of web page resources processing method and device, it is related to computer mobile communication technology field, methods described includes:In response to the web page resources request that user terminal is sent web page resources are obtained from web page server;According to the web page resources, generation head is data cached and text is data cached, the head data cached response header and the first data cached key assignments of the sensing text including the web page resources, the data cached response text including the web page resources of the text, wherein, content identical text is data cached including the first key assignments described in identical;It is respectively that the head is data cached and the text is data cached is written in caching system.This method can realize that the different web pages resource for making the text with identical content data cached has respective head data cached in http cachings, but all point to it is data cached with a text, so as to reduce redundant data in caching system.
Description
Technical field
The present invention relates to computer mobile communication technology field, in particular to a kind of net
Page method for processing resource and device.
Background technology
CDN full name is Content Delivery Network, i.e. content distributing network.Its
Basic ideas are to avoid being possible to influence data transmission bauds and stability on internet as far as possible
Bottleneck and link, make content transmission faster, it is more stable.By being placed everywhere in network
One on the existing Internet basic layer intelligent virtual network that CDN node is constituted,
CDN clusters can in real time according to the connection of network traffics and each node, load state and
The request of user is re-directed from user to the integrated information such as the distance of user and response time
On nearest CDN service node.The purpose is to make user to obtain required content nearby,
The crowded situation of Internet network is solved, the response speed that user accesses website is improved.CDN
The web resource can be cached to CDN clusters by node after web resource is received
In caching system, the information of caching includes the address of web resource, http response headers,
And the body (data volume) of web resource, pair received again in favor of CDN node
During the request of the web resource data, it will directly cache in the caching system of CDN clusters
The web resource returns to the user terminal for initiating request.
But, the memory capacity of the caching system of CDN clusters is limited.When buffer memory capacity reaches
During to the upper limit, caching system can delete part web page resources according to eliminative mechanism set in advance.
When the CDN clusters receive the request for the web resource deleted again,
Need to carry out multistage nodal cache inquiry or load the web resource from former website, so as to lead
Cause the web page resources response time elongated, influence user side takes.
The content of the invention
It is an object of the invention to provide a kind of web page resources processing method and device, to subtract
Redundant data in few caching system.
In a first aspect, the embodiments of the invention provide a kind of web page resources processing method, it is described
Method includes:
In response to the web page resources request that user terminal is sent webpage money is obtained from web page server
Source;
According to the web page resources, generation head is data cached and text is data cached, institute
State the data cached response header and sensing including the web page resources in the head text caching
First key assignments of data, the data cached response text including the web page resources of the text,
Wherein, content identical text is data cached including the first key assignments described in identical;
It is respectively that the head is data cached and the text is data cached is written to caching system
In system.
Second aspect, it is described the embodiments of the invention provide a kind of web data processing method
Method includes:
When CDN node receives the request of loading web page resources, CDN node obtains institute
The resource identifier of the web page resources to be loaded carried in request is stated, with the resource identification
Accord with and retrieve valid cache number corresponding with first key assignments in caching system for the first key assignments
According to;
When inquiring valid cache data corresponding with the key assignments, CDN node parsing is looked into
The valid cache data found, judge whether the form of the valid cache data meets
The data cached predetermined cache form in head, wherein, the head data cached predetermined slow
Depositing form includes pointing to the first data cached key assignments of the corresponding text, identical
Text it is data cached have the key assignments of identical first;
If it is, the first key assignments is obtained in the valid cache data, according to acquired
The first key assignments to inquire about corresponding with first key assignments text data cached, when inquiring and
When the corresponding text of first key assignments is data cached, based on the valid cache data and
The text corresponding with first key assignments is data cached to obtain the webpage money to be loaded
Source.
The third aspect, it is described the embodiments of the invention provide a kind of web page resources processing unit
Device includes:
Web page resources acquiring unit, for the web page resources request that sends in response to user terminal
Web page resources are obtained from web page server;
Data cached generation unit, for generating, head is data cached and text is data cached,
The head data cached response header and the sensing text including the web page resources delay
First key assignments of deposit data, the data cached response including the web page resources of the text is just
Text, wherein, content identical text is data cached including the first key assignments described in identical;
Data cached writing unit, for respectively by the head it is data cached and it is described just
The data cached caching of text is written to caching system.
Fourth aspect, the embodiments of the invention provide a kind of web page resources processing unit, is set
In CDN node, described device includes:
Resource retrieval unit, for when receiving the request of loading web page resources, obtaining institute
The resource identifier of the web page resources to be loaded carried in request is stated, with the resource identification
Accord with and retrieve valid cache number corresponding with first key assignments in caching system for the first key assignments
According to;
Resource resolution unit, valid cache data corresponding with the key assignments are inquired for working as
When, the valid cache data found are parsed, the lattice of the valid cache data are judged
Whether formula meets the data cached predetermined cache form in head, wherein, the head caches number
According to predetermined cache form include pointing to data cached first of the corresponding text
Key assignments, identical text is data cached to have the key assignments of identical first;
Resource acquisition unit, the lattice for valid cache data described in resource resolution unit judges
When formula meets the data cached predetermined cache form in head, obtained in the valid cache data
The first key assignments is taken, inquires about corresponding just with first key assignments according to the first acquired key assignments
Text is data cached, when inquire corresponding with first key assignments text it is data cached when, base
It is data cached in the valid cache data and the text corresponding with first key assignments
Obtain the web page resources to be loaded.
A kind of web page resources processing method provided in an embodiment of the present invention and device, according to from
Web page server obtains web page resources, and generation head is data cached and text is data cached,
The data cached response header and sensing including the web page resources in the head text caching number
According to the first key assignments, the data cached response text including the web page resources of text, wherein,
Content identical text is data cached including the first key assignments described in identical, so as to realize
The data cached different web pages resource of the text with identical content is allowed to have each in http cachings
From head it is data cached, but all point to it is data cached with a text, so as to reduce caching
Redundant data in system, improves the utilization rate of caching system.
Other feature and advantage will illustrate in subsequent specification, also, partly from explanation
Become apparent in book, or by implementing understanding of the embodiment of the present invention.The mesh of the present invention
And other advantages can pass through the institute in the specification, claims and accompanying drawing write
The structure particularly pointed out is realized and obtained.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, under
Face will be briefly described to the required accompanying drawing used in embodiment, it should be apparent that, under
Accompanying drawing in the description of face is only some embodiments of the present invention, for ordinary skill
For personnel, on the premise of not paying creative work, it can also be obtained according to these accompanying drawings
Obtain other accompanying drawings.By shown in accompanying drawing, above and other purpose of the invention, feature and
Advantage will become apparent from.Identical reference indicates identical part in whole accompanying drawings.
Deliberately accompanying drawing is not drawn by actual size equal proportion scaling, it is preferred that emphasis is show the present invention's
Purport.
Fig. 1 is the application ring of web page resources processing method and processing device provided in an embodiment of the present invention
Border schematic diagram;
Fig. 2 is the structured flowchart of CDN node provided in an embodiment of the present invention;
The structural frames for the web page resources processing unit that Fig. 3 provides for first embodiment of the invention
Figure;
The structural frames for the web page resources processing unit that Fig. 4 provides for second embodiment of the invention
Figure;
The structural frames for the web page resources processing unit that Fig. 5 provides for third embodiment of the invention
Figure;
The flow chart for the web page resources processing method that Fig. 6 provides for fourth embodiment of the invention;
The flow chart for the web page resources processing method that Fig. 7 provides for fifth embodiment of the invention;
The flow chart for the web page resources processing method that Fig. 8 provides for sixth embodiment of the invention.
Embodiment
The web data processing method and device that the embodiment of the present invention is provided can be applied to as
In application environment shown in Fig. 1.As shown in figure 1, user terminal 100, CDN node 200,
Web page server 300 is located in wireless network or cable network 400, passes through the wireless network
Or cable network 400, user terminal 100 and CDN node network service.
In the embodiment of the present invention, user terminal 100 is preferably mobile terminal device, for example
Can include smart mobile phone, tablet personal computer, E-book reader, pocket computer on knee,
Vehicle-mounted computer, Wearable mobile terminal etc..
Fig. 2 shows a kind of structure for the CDN node that can be applied in the embodiment of the present invention
Block diagram.As shown in Fig. 2 CDN node 200 (only shows one including one or more in figure
It is individual) processor 201, storage control 202, it is memory 203, Peripheral Interface 204, logical
Believe module 205.These components pass through the phase intercommunication of one or more communication bus/signal wire 206
News.
Memory 203 can be used in storage software program and module, such as embodiment of the present invention
The corresponding programmed instruction/module of web page resources processing method and processing device, processor 201 passes through
Operation is stored in software program and module in memory 203, so as to perform various functions
Using and data processing, such as web page resources processing method provided in an embodiment of the present invention.
Memory 203 may include high speed random access memory, may also include nonvolatile memory,
Such as one or more magnetic storage device, flash memory or the storage of other nonvolatile solid states
Device.Processor 201 and other access of possible component to memory 203 can be in storages
Carried out under the control of controller 202.
Peripheral Interface 206 is by various input/output devices coupled to processor 201 and storage
Device 203.In certain embodiments, Peripheral Interface 206, processor 206 and storage control
Device 202 can be realized in one single chip.In some other example, they can distinguish
Realized by independent chip.
It is appreciated that the structure shown in Fig. 2 is only signal, CDN node 200 may also include
More either less components or match somebody with somebody than shown in Fig. 2 with different from shown in Fig. 2
Put.Each component shown in Fig. 2 can be realized using hardware, software or its combination.
There is provided one kind for the web page resources processing method and device that the embodiment of the present invention is proposed
New web page resources processing method.The web page resources processing method and device are applicable to
CDN node 200.In the embodiment of the present invention, browser is installed in user terminal 100,
It is corresponding with CDN node 200, provide the user service.
There is provided a kind of new for the web page resources treating method and apparatus that the embodiment of the present invention is proposed
The http buffer memories and inquiry mechanism applied to web page resources.
Inventor is analyzed by the sampled data to different Webpages, is found for some
The web addresses of web page resources, such as JavaScript resources or CSS resources are different, but
It is that its content there may be unanimously.Such as front end Open Framework Jquery increases income Javascript
Storehouse (or Jquery increase income CSS storehouses).If some CDN service node cluster is just first
Three pages are post-processed, respectively each self reference Jquery class libraries, then in its http cachings
Then have in space three parts it is data cached.But, if the Jquery class libraries that three pages are quoted
The data of Javascript resources be just as, just waste two response texts of storage
Http spatial caches, it is assumed that the size of response text is 210KB, then just wasted
420KB spatial cache.
Web page resources treating method and apparatus provided in an embodiment of the present invention is according to web page resources
Information, the head for the reply header that generation includes the web page resources is data cached and including institute
The text for stating the response text of web page resources is data cached, and head is data cached to be included pointing to
The first data cached key assignments of the text, and identical text is data cached with identical
First key assignments, allows with the data cached different web pages resource of identical text so as to realize
There is respective head data cached in http cachings, but all point to a text caching number
According to so as to reduce data cached redundancy, allowing http spatial caches to store more caching numbers
According to.For example, for above-mentioned scene, nearly 420KB spatial cache can be saved.
It should be noted that the http cache service systems in the embodiment of the present invention are used
Key-value databases (or class key-value databases) are realized, are deposited to data
Substantially the following key element of key-value databases is followed when storage is with retrieval:Phase between data cached
It is mutually independent;Data format is divided into key and value parts, key as data cached index,
It is easy to management, and with uniqueness, value is the data of real cache;When the data of caching
When having taken spatial cache, if increasing newly data cached, it is necessary to superseded existing slow
Deposit data carrys out vacating space and deposits newly data cached.
In the embodiment of the present invention, the data that form is cached will be cached with existing http,
It is defined as that entity is data cached, the data cached http answer numbers for including web page resources of entity
According to all parts, that is, the statusline including http reply datas, reply header, with
And response text.
In the embodiment of the present invention, response text identical different web pages resource definition is with value money
Source, the data definition using new caching form caching provided in an embodiment of the present invention is slow with value
Deposit data.Since it is desired that realizing that multiple same value resources can be used with portion in http cachings
Text is data cached, so introducing two new caching forms:Data cached pre- in head
Surely form and the data cached predetermined cache form of text are cached.
In the embodiment of the present invention, the data cached key assignments in head (key) is web page resources
URI (Uniform Resource Identifier, Uniform Resource Identifier), URI are used for unique
One web page resources of mark.Statuslines of the data cached value in head including serializing,
Reply header and the first data cached key assignments of the corresponding text of sensing, first key assignments
It is associating web pages resource reply header and the data cached index of text.During head is data cached
Do not include response text (body).
In the embodiment of the present invention, text is data cached using the first key assignments as key, and value is
The response text (body) of correspondence web page resources.The form of first key assignments includes:Web page resources
Typonym and based on the data cached content of the text (namely replying text)
Cryptographic Hash, the character string of encoded rear generation calculated.
It is identical because the text with value resource is data cached, it is therefore, slow with value in write-in
When the text of deposit data is data cached, if be stored with http spatial caches with this just
The data cached identical text of text is data cached, and the text newly write is data cached to be covered
The text existed originally in http spatial caches is data cached, that is to say, that for multiple with value
Resource, it is data cached to preserve respective head in http spatial caches, but be due to its just
Data cached text is only to preserve a identical just in identical, therefore http spatial caches
Text is data cached, can thus realize the data cached redundancy of reduction.
Below in conjunction with accompanying drawing in the embodiment of the present invention, to the technical side in the embodiment of the present invention
Case is clearly and completely described, it is clear that described embodiment is only the present invention one
Section Example, rather than whole embodiments.Generally it is described and illustrated herein in the accompanying drawings
The component of the embodiment of the present invention can arrange and design with a variety of configurations.Therefore,
The detailed description of the embodiments of the invention to providing in the accompanying drawings, which is not intended to limit, below wants
The scope of the present invention of protection is sought, but is merely representative of the selected embodiment of the present invention.It is based on
Embodiments of the invention, those skilled in the art are not on the premise of creative work is made
The every other embodiment obtained, belongs to the scope of protection of the invention.
It should be noted that:Similar label and letter represents similar terms in following accompanying drawing, because
This, once be defined in a certain Xiang Yi accompanying drawing, then in subsequent accompanying drawing need not pair
It further define and explain.Meanwhile, in the description of the invention, term " first ",
" second " etc. is only used for distinguishing description, and it is not intended that indicating or implying relative importance.
First embodiment
Fig. 3 shows the structured flowchart for the web data processing unit that first embodiment is provided,
Referring to Fig. 3, a kind of web page resources processing unit that first embodiment of the invention is provided, this
Device in embodiment is preferably operated at CDN node 200, the device that the present embodiment is provided
20 include:
Web page resources acquiring unit 21, for responding the web page resources that user terminal 100 is sent
Ask and obtain web page resources from web page server 300;
When the web page resources that web page resources acquiring unit 21 receives the transmission of user terminal 100 please
When asking, do not retrieved in caching system after web page resources corresponding with the request, can be with
The web page resources are asked to the source server of web page resources.In order to beneficial to receiving the net again
During the request of page resource, the more quickly request of response user terminal can be by reception
Web storage is into caching system.In the present embodiment, above-mentioned web page resources request can be
The browser of user terminal 100 is sent.Also, as a kind of mode, caching system is CDN
Http caching systems in cluster.
Data cached generation unit 22, for generating, head is data cached and text caches number
According to the head data cached response header and sensing including the web page resources are described just
The first data cached key assignments of text, data cached the answering including the web page resources of the text
Text is answered, wherein, content identical text is data cached including the first key assignments described in identical.
First key assignments includes:The typonym of the web page resources and based on described
Cryptographic Hash that the response text of web page resources is calculated, it is encoded after generation character string.
For example the form of the first key assignments can be:The schema of web page resources://body contents
Md5 values carry out the 24 byte character strings generated after base64 codings.
Wherein, the schema of web page resources is the typonym of web page resources, for example can be with
Be " Js " and " Css " (case sensitive), naturally it is also possible to extension accommodates more resources-type
Type.Using the schema of web page resources, main purpose is to distinguish from key aspect
Resource class.Js:Refer to the text it is data cached be Javascript resources response text
(body) data.Css:Refer to the text it is data cached be CSS resources response text (body)
Data.
Comprising " the md5 values of body contents generated after base64 codings in first key assignments
24 byte character strings ", it is main to be intended to that the data cached storage of text is ensured
Uniqueness.Because md5 values are calculated based on data content (rather than size of data)
The cryptographic Hash come, ensure that in great data space in algorithm aspect, different
The cryptographic Hash that content is produced is certainly different.
Data cached writing unit 23, for respectively that the head is data cached and described
The data cached caching of text is written to caching system.
Text is namely cached number by data cached cached in and head data cached to text
According to head is data cached is respectively written into caching system, will as a kind of embodiment
Text is data cached and head is data cached is written in the http caching systems of CDN clusters.
If it is data cached with the text to be stored with the http spatial caches of http caching systems
Identical text is data cached, and the text newly write is data cached will to cover http caching skies
Between in originally exist text it is data cached, that is to say, that for it is multiple with value resources, can exist
The respective head of http spatial caches preservation is data cached, but is due to that its text is data cached
It is that only to preserve a identical text in identical, therefore http spatial caches data cached,
The data cached redundancy of reduction can thus be realized.
Further, the head data cached to the text and described respectively is data cached
After being cached, it can also include:The data cached expired time of the text is set
For 0.The data cached expired time of text is set to 0, that is, text is data cached
It is set to never expired, can farthest ensures the data cached persistence of the text.
Due to http spatial caches be it is limited, therefore, when the data of caching taken it is slow
When depositing space, if increasing newly data cached, it is desired nonetheless to eliminate existing data cached
Carry out vacating space to deposit new data cached, specific replacement policy, for example, can use
Lru algorithm (Last Recently Used) carries out data and eliminated.Lru algorithm can be certain
Ensure that the high and effective data of access frequency unanimously can be retained in before buffer queue in degree
Portion, and access frequency is low or failed data can be pushed to queue tail and be easy to eliminate.
It is of course also possible to use other replacement policies, specific implementation of the invention is not limited thereto.
Below by by specifically illustrating the web page resources processing method in the present embodiment,
Address and http response reports the following is the web page resources of three Jquery Javacript types
The reply header of text:
Web page resources one
Address (URI):
http://spuvvn.edu/bitrix/templates/sardar_patel/js/jquery-ui-1.8.16.
custom.min.js
Reply header:
HTTP/1.0 200OK
Date:Wed,08Jul 2015 06:43:45GMT
Server:Apache/2.2.27(Unix)mod_ssl/2.2.27
OpenSSL/1.0.1e-fips DAV/2mod_jk/1.2.37mod_bwlimited/1.4
PHP/5.3.28
Last-Modified:Thu,16Feb 2012 17:55:54GMT
ETag:"31000b8-3361f-4b9188a179e80"
Accept-Ranges:bytes
Content-Length:210463
Content-Type:application/javascript
X-Cache:MISS from devy.ucweb.local
X-Cache-Lookup:MISS from devy.ucweb.local:3128
Via:1.0devy.ucweb.local:3128(squid/2.6.STABLE21)
Proxy-Connection:close
Web page resources two
Address (URI):
http://m.sportzwiki.com/assets/js/jquery-ui-1.8.16.custom.min.js
Reply header:
HTTP/1.0 200OK
Date:Wed,08Jul 2015 06:47:55GMT
Content-Type:application/javascript
Content-Length:210463
Set-Cookie:
_ _ cfduid=d3d61c97334953c660457bbb5a0e183a51436338075;
Expires=Thu, 07-Jul-16 06:47:55 GMT;Path=/;
Domain=.sportzwiki.com;HttpOnly
Last-Modified:Wed,18Mar 2015 13:43:31GMT
ETag:"94e599-3361f-5119044d006c0"
CF-Cache-Status:HIT
Expires:Mon,13Jul 2015 06:47:55GMT
Cache-Control:Public, max-age=432000
Accept-Ranges:bytes
Server:cloudflare-nginx
CF-RAY:2029d72c77d30bab-HKG
X-Cache:MISS from devy.ucweb.local
X-Cache-Lookup:MISS from devy.ucweb.local:3128
Via:1.0devy.ucweb.local:3128(squid/2.6.STABLE21)
Proxy-Connection:close
Web page resources three
Address (URI):
http://www.rcs-rds.ro/resources/jquery_ui/js/jquery-ui-1.8.16.custo
m.min.js
Reply header:
HTTP/1.0 200OK
X-Varnish:1089339340
Vary:Accept-Encoding
X-Cache:MISS
Content-Type:application/javascript
Date:Wed,08Jul 2015 06:50:54GMT
Accept-Ranges:bytes
Accept-Ranges:bytes
ETag:"503dde-3361f-4aeb148764ec0"
Last-Modified:Fri,07Oct 2011 08:32:35GMT
Age:0
Content-Length:210463
X-Cache:MISS from devy.ucweb.local
X-Cache-Lookup:MISS from devy.ucweb.local:3128
Via:1.1varnish,1.0devy.ucweb.local:3128
(squid/2.6.STABLE21)
Proxy-Connection:close
As can be seen that the URI of three web page resources different and cross-domain (being in different domain names),
But from the point of view of the Content-Length field values of three web page resources, response text (body)
Size be all 210463 bytes (about 210KB).
Assuming that the content calculating md5 values to the response text (body) of three web page resources are entered again
Row base64 is encoded, and the check value drawn is all " ZcfHB93eoMeGFxTfJQ1UxA=="
It can so prove that the content of the response text (body) of three web page resources is just as
's.That is these three web page resources are exactly " with the value resource " described in the embodiment of the present invention.
The reason in the presence of " with value resource ", is that most of websites may employ similar station
Point template is built a station, so, because template is same or similar, its front end used
Technology may all employ some comparison main flows and powerful Javascript class libraries or
CSS storehouses, such as Jquery.Even with different website templates, due to needing to realize certain
A little front-end functionality characteristics, the Javascript class libraries of use main flow that also can be simultaneously or
CSS storehouses.So for the external connection Javascript resources (or Css resources) of different websites,
Should exist a certain proportion of with value resource.Such as above three Jquery resources, its title
It is inherently identical.But can also there are title difference but the consistent scene of resource content, be typically
Website side is modified to title, but generally resource name can retain the key of library name
Word, such as Jquery.
The first data cached key assignments of the texts of three Jquery resources mentioned above can be with
It is expressed as:
Js://ZcfHB93eoMeGFxTfJQ1UxA==
From the foregoing it will be appreciated that using the web page resources stored with value cache way, compared to existing
Http cache way, it is actual to store that two http are data cached, and one is head caching
Data, another is that text is data cached.It is data cached for head, can in its value
With by increasing a special field " body-key newly in reply header field:" refer to deposit
To the first data cached key assignments of corresponding text, it is preferred that the field is placed on
The first row in value.
Illustrate that its head is data cached and text is data cached with a Jquery resource below
Form.
The data cached predetermined cache form in head:
The web page resources processing method that the present embodiment is provided can reduce data cached redundancy,
Http spatial caches are enable to store more multi-caching data, so as to allow the http of finite capacity
Caching can store more data cached, and its http cache hit is improved to a certain extent
Rate, reaches benefit bigizationner, and the response for reducing web resource takes.
It should be noted that each unit in the present embodiment can be by software code realization,
Now, above-mentioned each unit can be stored in the memory 203 of CDN node 200.With
Upper each unit can equally be realized by hardware such as IC chip.It is appreciated that above-mentioned
Web page resources processing unit 30 also can run on other and be connected to user terminal 100 and webpage
The server for cache web pages resource between server 200.
Second embodiment
Fig. 4 shows the structured flowchart for the web data processing unit that second embodiment is provided,
Referring to Fig. 4, a kind of web page resources processing unit 30 that second embodiment of the invention is provided,
Device in the present embodiment is preferably operated at CDN node 200, the webpage that the present embodiment is provided
Resource processing unit 30 includes:
Web page resources acquiring unit 31, for responding the web page resources that user terminal 100 is sent
Ask and obtain web page resources from web page server 300;
Data cached generation unit 32, for generating, head is data cached and text caches number
According to the head data cached response header and sensing including the web page resources are described just
The first data cached key assignments of text, data cached the answering including the web page resources of the text
Text is answered, wherein, content identical text is data cached including the first key assignments described in identical;
Data cached writing unit 33, for respectively that the head is data cached and described
The data cached caching of text is written to caching system.
Preferably, the web page resources processing unit 30, in addition to:
First judging unit 34, for judging whether the web page resources can cache, if can
Caching, then data cached generation unit 32 is according to the web page resources, generation head caching number
According to this and text is data cached, otherwise, without caching.
Second judging unit 35, for judging whether the web page resources conform to a predetermined condition;
If meeting the predetermined condition, the data cached generation unit 32 is according to the net
Page resource, generation head is data cached and text is data cached.
Otherwise, the data cached generation unit 32 generates real according to the information of institute's web page resources
Body is data cached, and data cached writing unit 33 directly treats the data cached write-in of the entity
Caching system, the data cached all http answer numbers including the web page resources of the entity
According to.
Because the same value that the embodiment of the present invention is proposed caches and text data cached to head caching
Data will carry out write-in caching respectively, and write-once operation is added than existing write-in caching,
The read operation of response also increases once, so, can be to using same in order to improve efficiency
The web page resources that value caching method is cached are any limitation as, that is, are being carried out with value caching
Before, whether progress one judgement, if meeting predetermined if first being conformed to a predetermined condition to web page resources
Condition is just handled in the way of with value caching, otherwise, according to existing cache way
Handled.
In the present embodiment, the predetermined condition can include following one or more of combination:
The type of the web page resources is preset kind;
The size of the web page resources is more than predetermined threshold value;And
The title of the web page resources is present in predetermined keyword list.
In all kinds of web page resources, cacheable web page resources concentrate on external connection
JavaScript (a kind of literal translation formula script), CSS (a kind of form design language) and picture money
Source.Wherein, parsing and rendering speed of JavaScript the and CSS resources for the page has one
It is fixing to ring.And for web page resources, JavaScript and CSS resources are present with value money
The possibility in source is larger, and the page and picture there is a possibility that same value resource is smaller, so
The type of web page resources can be any limitation as.In the present embodiment, preset kind is preferred
For JavaScript types or CSS types, as long as the type of web page resources is default for both
One of type.
By judging whether the size of web page resources can be with more targeted more than predetermined threshold value
To being carried out with value resource with value caching process.In the present embodiment, web page resources size it is pre-
If threshold value can be for example 50KB, certainly, the size of predetermined threshold value also can be according to actual feelings
Condition is adjusted, and is not intended as the restriction to embodiment of the present invention.
The title (filename) of web page resources whether there is in predetermined keyword list, that is,
Whether the title of finger web page resources, which can match lists of keywords resource, uses with value caching plan
Slightly.Such as " jquery " is exactly one of keyword.
These decision conditions can be configured by CDN node, be entered by backstage issuing mechanism
Row control.Can also be by setting buffer control switch to control being turned on and off for the function.
Assuming that setting needs to meet three above condition simultaneously, then when some web page resources is simultaneously full
During sufficient above three condition, triggering is with value caching.
It should be noted that each unit in the present embodiment can be by software code realization,
Now, above-mentioned each unit can be stored in the memory 203 of CDN node 200.With
Upper each unit can equally be realized by hardware such as IC chip.
3rd embodiment
Fig. 5 shows the structured flowchart for the web data processing unit that 3rd embodiment is provided,
Referring to Fig. 5, a kind of web page resources processing unit 40 that third embodiment of the invention is provided,
Device in the present embodiment is preferably operated at CDN node 200, the webpage that the present embodiment is provided
Resource processing unit 40 includes:
Resource retrieval unit 41, for when receiving the request of loading web page resources, obtaining
The resource identifier of the web page resources to be loaded carried in the request, with the resource mark
Know symbol and valid cache corresponding with first key assignments is retrieved in caching system for the first key assignments
Data;
Resource resolution unit 42, valid cache number corresponding with the key assignments is inquired for working as
According to when, parse the valid cache data that find, judge the valid cache data
Whether form meets the data cached predetermined cache form in head, wherein, the head caching
The predetermined cache form of data includes pointing to data cached the of the corresponding text
One key assignments, identical text is data cached to have the key assignments of identical first;
Resource acquisition unit 43, for valid cache data described in resource resolution unit judges
During the data cached predetermined cache form in format character syncephalon portion, in the valid cache data
The first key assignments is obtained, inquires about corresponding with first key assignments according to the first acquired key assignments
Text is data cached, when inquire corresponding with first key assignments text it is data cached when,
Based on the valid cache data and the text caching number corresponding with first key assignments
According to the acquisition web page resources to be loaded.
As a kind of embodiment, when CDN node receives request, for example http resources please
Ask, it is necessary to load the web page resources of some external connection, be with the resource identifier (URI) of resource
Key inquires about whether it has the data cached of preservation to the http cache service systems of CDN clusters
(get operations).Http cache service systems are retrieved in the data queue of spatial cache with being somebody's turn to do
Key corresponding data cached (first time get operations), if there is no corresponding data cached
Or it is data cached failed, CDN node judge the web page resources http caching be not hit by,
The web page resources to be loaded are asked to source web page server 300.
If Query Result is has, the valid cache number that CDN node parsing is found
According to value values.If it find that being that entity is data cached, that is, it is existing caching number
According to form, then directly web page resources to be loaded are obtained according to the valid cache data found
Information.If it find that being that head is data cached, then CDN node is based on head and caches form,
According to " body-key " field of reply header, the data cached key values of text are parsed,
Namely the first key assignments, and inquiry (the is initiated to http caching systems with first key assignments again
Secondary get operations).If there is and effectively, then http caching systems by the text cache number
According to CDN node is returned to, CDN node is data cached data cached with text based on head,
Obtain the information of web page resources to be loaded.If there is no the target text it is data cached or
It is data cached but failed to there is the target text in person, then judges webpage money to be loaded
The http cachings in source are not hit by, the web page resources to be loaded to Web server request.
Web-browsing data of the inventor based on multiple users within a period of time, statistics
JS resource datas related jquery.1565 JS resources are had, wherein there are 743 moneys
Source belongs to content and repeats resource (with value resource);And pass through at this 743 with value resource
According to after content repetition re-scheduling, (such as A, B, C are, with value resource, only to retain A, and are picked
Except B and C), remaining 141 resources, average repetition index is about the 4.27 (weights of rejecting
Total number resource after multiple total number resource/re-scheduling).Thus, it is possible to find out what the present embodiment was provided
Web page resources processing method can reduce data cached redundancy, enable http spatial caches
Store more multi-caching data.
It should be noted that each unit in the present embodiment can be by software code realization,
Now, above-mentioned each unit can be stored in the memory 203 of CDN node 200.With
Upper each unit can equally be realized by hardware such as IC chip.
Fourth embodiment
Fig. 6 shows a kind of web page resources processing method that fourth embodiment of the invention is provided
Flow chart, referring to Fig. 6, the present embodiment describes the handling process of CDN node, institute
The method of stating includes:
Step S510, takes in response to the web page resources request that user terminal 100 is sent from webpage
Business device 300 obtains web page resources.It is appreciated that the web page resources that first embodiment is provided are obtained
Unit 21 is taken to perform step S510.
Step S520, according to the information of the web page resources, generation includes the web page resources
Reply header data cached and including the web page resources the response text in head text
It is data cached, the head is data cached include pointing to the text it is data cached first
Key assignments, identical text is data cached to have the key assignments of identical first.It is appreciated that first
The data cached generation unit 22 that embodiment is provided can perform step S520.
Step S530, it is respectively that the head is data cached and the text is data cached writes
Enter into caching system.It is appreciated that the data cached writing unit that first embodiment is provided
23 can perform step S520.
Further, as a preferred embodiment, data cached writing unit 23
It is respectively that the head is data cached and the text is data cached is written in caching system
Afterwards, it can also include:The data cached expired time of the text is set to 0.Will
The data cached expired time of text is set to 0, that is, is set to text is data cached
It is never expired, it can farthest ensure the data cached persistence of the text.
The web page resources processing method that the present embodiment is provided can reduce data cached redundancy,
Http spatial caches are enable to store more multi-caching data, so as to allow the http of finite capacity
Caching can store more data cached, and its http cache hit is improved to a certain extent
Rate, reaches benefit bigizationner, and the response for reducing web page resources takes.
5th embodiment
Fig. 7 shows a kind of web page resources processing method that fifth embodiment of the invention is provided
Flow chart.Referring to Fig. 7, the present embodiment describes the handling process of CDN node, institute
The method of stating includes:
Step S610, takes in response to the web page resources request that user terminal 100 is sent from webpage
Business device 300 obtains web page resources.It is appreciated that the web page resources that second embodiment is provided are obtained
Unit 31 is taken to perform step S610.
Step S620, judges whether the web page resources can cache.It is appreciated that second is real
Step S620 can be performed by applying the first judging unit 34 of example offer.
If it is, performing step S630.Otherwise, without caching, that is, caching is write
Enter flow to terminate.
Step S630, judges whether the web page resources conform to a predetermined condition.It is appreciated that
The second judging unit 35 that second embodiment is provided can perform step S630.
If meeting the predetermined condition, step S640 is performed, otherwise, step S660 is performed.
In the present embodiment, the predetermined condition can include following one or more of combination:
The type of the web page resources is preset kind;
The size of the web page resources is more than predetermined threshold value;And
The title of the web page resources is present in predetermined keyword list.
Step S640, the head for generating the reply header for including the web page resources is data cached
Text with the response text including the web page resources is data cached.It is appreciated that second
The data cached generation unit 32 that embodiment is provided can perform step S640.
Step S650, it is respectively that the head is data cached and the text is data cached writes
Enter into caching system.It is appreciated that the data cached writing unit that second embodiment is provided
33 can perform step S650.
Step S660, it is data cached according to the information of web page resources generation entity, directly
It is data cached to the entity to cache.It is appreciated that the caching that second embodiment is provided
Data write unit 33 can perform step S660.
Sixth embodiment
Fig. 8 shows a kind of web page resources processing method that sixth embodiment of the invention is provided
Flow chart, referring to Fig. 8, the present embodiment describes what CDN node processing caching was read
Flow, methods described includes:
Step S710, when CDN node receives the request of loading web page resources, is obtained
The resource identifier of the web page resources to be loaded carried in the request, with the resource mark
Know symbol and valid cache corresponding with first key assignments is retrieved in caching system for the first key assignments
Data.It is appreciated that the resource retrieval unit 41 that 3rd embodiment is provided can perform the step
Rapid S710.
If Query Result is in the absence of execution step S720, if Query Result is to deposit
Then performing step S730.
Step S720, asks the web page resources to be loaded to web page server 300, connects
Execution step S610, into caching flow, that is, the stream described in the 5th embodiment
Journey, is repeated no more here.It is appreciated that the resource retrieval unit 41 that 3rd embodiment is provided
Step S720 can be performed.
Step S730, parses the valid cache data found.
Step S740, judges whether the form of the valid cache data meets head caching number
According to predetermined cache form.It is appreciated that the resource resolution unit 42 that 3rd embodiment is provided
Step S730 and step S740 can be performed.
If the data cached predetermined cache lattice in the format character syncephalon portion of the valid cache data
Formula, that is to say, that valid cache data are that head is data cached, performs step S750.Such as
Fruit is not then to perform step S770.
Step S750, obtains the first key assignments, according to acquired in the valid cache data
The first key assignments to inquire about corresponding target text data cached.
If there is and effectively, then perform step S760, if there is no the target text
It is data cached or to there is the target text data cached but failed, then judge that this is treated
The http cachings of loading web page resources are not hit by, and perform step S720.
Step S760, it is data cached based on the valid cache data and the target text
Obtain the information of the web page resources to be loaded.
Step S770, directly obtains the webpage to be loaded according to the valid cache data and provides
The information in source.It is appreciated that the resource acquisition unit 43 that 3rd embodiment is provided can be performed
Step S750, step S760 and step S770.
It is apparent to those skilled in the art that, for convenience and simplicity of description,
Detailed process in the embodiment of the method for foregoing description, may be referred in aforementioned means embodiment
Corresponding process, will not be repeated here.
In summary, web page resources treating method and apparatus provided in an embodiment of the present invention according to
The information of the web page resources obtained by web page server, generation includes answering for the web page resources
Answer the text caching of data cached and including the web page resources the response text in head on head
Data, head is data cached to be included pointing to the first data cached key assignments of the text, and
Identical text is data cached to have the key assignments of identical first, is allowed so as to realize with phase
There is respective head caching number in http cachings with the data cached different web pages resource of text
According to, but all point to data cached with a text, so as to reduce data cached redundancy, make
Http spatial caches can store more multi-caching data, and http cachings are improved to a certain extent
Hit rate.
It should be noted that each embodiment in this specification is retouched by the way of progressive
State, what each embodiment was stressed is the difference with other embodiment, each reality
Apply between example identical similar part mutually referring to.
Web page resources processing unit and system that the embodiment of the present invention is provided, its realization principle
And the technique effect produced is identical with preceding method embodiment, to briefly describe, device is implemented
Example part does not refer to part, refers to corresponding contents in preceding method embodiment.
In addition, the flow chart and block diagram in accompanying drawing show multiple embodiments according to the present invention
System, architectural framework in the cards, function and the behaviour of method and computer program product
Make.At this point, each square frame in flow chart or block diagram can represent module, a journey
A part for sequence section or code a, part for the module, program segment or code includes one
Or multiple executable instructions for being used to realize defined logic function.It should also be noted that having
In a little realizations as replacement, the function of being marked in square frame can also be with different from accompanying drawing
The order marked occurs.For example, two continuous square frames can essentially be substantially in parallel
Perform, they can also be performed in the opposite order sometimes, and this is depending on involved function.
It is also noted that each square frame and block diagram and/or flow in block diagram and/or flow chart
The combination of square frame in figure, can be with function or action as defined in performing it is special based on hard
The system of part is realized, or can be realized with the combination of specialized hardware and computer instruction.
The computer program product that the embodiment of the present invention is provided, including store program code
Computer-readable recording medium, the instruction that described program code includes can be used for perform before
Method described in embodiment of the method, implements and can be found in embodiment of the method, herein no longer
Repeat.
It is apparent to those skilled in the art that, for convenience and simplicity of description,
The specific work process of the system of foregoing description, device and unit, may be referred to preceding method
Corresponding process in embodiment, will not be repeated here.
In several embodiments provided herein, it should be understood that disclosed system,
Apparatus and method, can be realized by another way.Device embodiment described above
It is only schematical, for example, the division of the unit, only a kind of logic function is drawn
Point, there can be other dividing mode when actually realizing, in another example, multiple units or component
Another system can be combined or be desirably integrated into, or some features can be ignored, or not
Perform.Another, shown or discussed coupling or direct-coupling or communication each other
Connection can be by some communication interfaces, the INDIRECT COUPLING or communication connection of device or unit,
Can be electrical, machinery or other forms.
The unit illustrated as separating component can be or may not be and physically divide
Open, the part shown as unit can be or may not be physical location, you can
With positioned at a place, or it can also be distributed on multiple NEs.Can be according to reality
Some or all of unit therein is selected to realize the mesh of this embodiment scheme the need for border
's.
In addition, each functional unit in each embodiment of the invention can be integrated at one
Reason unit in or unit be individually physically present, can also two or two with
Upper unit is integrated in a unit.
If the function is realized using in the form of SFU software functional unit and is used as independent product pin
Sell or in use, can be stored in a computer read/write memory medium.Based on so
Understanding, the portion that technical scheme substantially contributes to prior art in other words
Divide or the part of the technical scheme can be embodied in the form of software product, the calculating
Machine software product is stored in a storage medium, including some instructions are to cause a meter
Calculate machine equipment (can be personal computer, server, or network equipment etc.) and perform sheet
Invent all or part of step of each embodiment methods described.And foregoing storage medium bag
Include:USB flash disk, mobile hard disk, read-only storage (ROM, Read-Only Memory),
Random access memory (RAM, Random Access Memory), magnetic disc or CD
Etc. it is various can be with the medium of store program codes.
It should be noted that herein, such as first and second or the like relational terms
It is used merely to make a distinction an entity or operation with another entity or operation, without
It is certain require or imply exist between these entities or operation any this actual relation or
Person's order.Moreover, term " comprising ", "comprising" or its any other variant are intended to
Nonexcludability is included so that process, method, article including a series of key elements or
Person's equipment not only includes those key elements, but also other key elements including being not expressly set out,
Either also include for this process, method, article or the intrinsic key element of equipment.
In the case of there is no more limitations, the key element limited by sentence "including a ...", not
Exclude and also exist in addition in the process including the key element, method, article or equipment
Identical element.
The preferred embodiments of the present invention are the foregoing is only, are not intended to limit the invention,
For those skilled in the art, the present invention can have various modifications and variations.It is all
Within the spirit and principles in the present invention, any modification, equivalent substitution and improvements made etc.,
It should be included in the scope of the protection.It should be noted that:Similar label and letter
Similar terms is represented in following accompanying drawing, therefore, once determined in a certain Xiang Yi accompanying drawing
Justice, then further need not be defined and be explained to it in subsequent accompanying drawing.
Claims (14)
1. a kind of web page resources processing method, it is characterised in that methods described includes:
In response to the web page resources request that user terminal is sent webpage money is obtained from web page server
Source;
According to the web page resources, generation head is data cached and text is data cached, institute
State the data cached response header and sensing including the web page resources in the head text caching
First key assignments of data, the data cached response text including the web page resources of the text,
Wherein, content identical text is data cached including the first key assignments described in identical;
It is respectively that the head is data cached and the text is data cached is written to caching system
In system.
2. according to the method described in claim 1, it is characterised in that the first key assignments bag
Include:The typonym of the web page resources and the text according to the web page resources
The character string generated after the data cached cryptographic Hash re-encoding calculated.
3. according to the method described in claim 1, it is characterised in that described according to the net
Page resource, generation head is data cached and text it is data cached before, including:
Judge whether the web page resources can cache, if can cache, perform the basis
The web page resources, the generation step that head is data cached and text is data cached, otherwise,
Without caching.
4. according to the method described in claim 1, it is characterised in that described according to the net
Page resource, generation head is data cached and text it is data cached before, including:
Judge whether the web page resources conform to a predetermined condition;
If meeting the predetermined condition, described according to the web page resources, generation head is delayed
Deposit data and the data cached step of text;
Otherwise, it is data cached according to the information of institute's web page resources generation entity, directly to described
Entity is data cached to be cached, the data cached institute including the web page resources of the entity
There are http reply datas.
5. method according to claim 4, it is characterised in that the predetermined condition bag
Include following one or more of combination:
The type of the web page resources is preset kind;
The size of the web page resources is more than predetermined threshold value;And
The title of the web page resources is present in predetermined keyword list.
6. according to the method described in claim 1, it is characterised in that described respectively to described
The data cached and described head of text is data cached cached after, in addition to:
The data cached expired time of the text is set to 0.
7. a kind of web data processing method, it is characterised in that methods described includes:
When CDN node receives the request of loading web page resources, CDN node obtains institute
The resource identifier of the web page resources to be loaded carried in request is stated, with the resource identification
Accord with and retrieve valid cache number corresponding with first key assignments in caching system for the first key assignments
According to;
When inquiring valid cache data corresponding with the key assignments, CDN node parsing is looked into
The valid cache data found, judge whether the form of the valid cache data meets
The data cached predetermined cache form in head, wherein, the head data cached predetermined slow
Depositing form includes pointing to the first data cached key assignments of the corresponding text, identical
Text it is data cached have the key assignments of identical first;
If it is, the first key assignments is obtained in the valid cache data, according to acquired
The first key assignments to inquire about corresponding with first key assignments text data cached, when inquiring and
When the corresponding text of first key assignments is data cached, based on the valid cache data and
Text corresponding with first key assignments is data cached to obtain the web page resources to be loaded.
8. a kind of web page resources processing unit, it is characterised in that described device includes:
Web page resources acquiring unit, for the web page resources request that sends in response to user terminal
Web page resources are obtained from web page server;
Data cached generation unit, for generating, head is data cached and text is data cached,
The head data cached response header and the sensing text including the web page resources delay
First key assignments of deposit data, the data cached response including the web page resources of the text is just
Text, wherein, content identical text is data cached including the first key assignments described in identical;
Data cached writing unit, for respectively by the head it is data cached and it is described just
The data cached caching of text is written to caching system.
9. device according to claim 8, it is characterised in that the first key assignments bag
Include:The typonym of the web page resources and the text according to the web page resources
The character string generated after the data cached cryptographic Hash re-encoding calculated.
10. device according to claim 8, it is characterised in that described device, also
Including:
First judging unit, for judging whether the web page resources can cache, if can delay
Deposit, then perform described according to the web page resources, generation head is data cached and text is slow
The step of deposit data, otherwise, without caching.
11. device according to claim 8, it is characterised in that described device, also
Including:
Second judging unit, for judging whether the web page resources conform to a predetermined condition;
If meeting the predetermined condition, the data cached generation unit is according to the webpage
Resource, the generation step that head is data cached and text is data cached;
Otherwise, the data cached generation unit generates entity according to the information of institute's web page resources
Data cached, data cached write of the entity directly is waited to cache by data cached writing unit
System, the data cached all http reply datas including the web page resources of the entity.
12. device according to claim 11, it is characterised in that the predetermined condition
Including following one or more of combination:
The type of the web page resources is preset kind;
The size of the web page resources is more than predetermined threshold value;And
The title of the web page resources is present in predetermined keyword list.
13. device according to claim 8, it is characterised in that described data cached to write
Enter unit, be additionally operable to respectively that the head is data cached and the text is data cached slow
Deposit before being written to caching system, the data cached expired time of the text is set to 0.
14. a kind of web page resources processing unit, it is characterised in that be arranged at CDN node,
Described device includes:
Resource retrieval unit, for when receiving the request of loading web page resources, obtaining institute
The resource identifier of the web page resources to be loaded carried in request is stated, with the resource identification
Accord with and retrieve valid cache number corresponding with first key assignments in caching system for the first key assignments
According to;
Resource resolution unit, valid cache data corresponding with the key assignments are inquired for working as
When, the valid cache data found are parsed, the lattice of the valid cache data are judged
Whether formula meets the data cached predetermined cache form in head, wherein, the head caches number
According to predetermined cache form include pointing to data cached first of the corresponding text
Key assignments, identical text is data cached to have the key assignments of identical first;
Resource acquisition unit, the lattice for valid cache data described in resource resolution unit judges
When formula meets the data cached predetermined cache form in head, obtained in the valid cache data
The first key assignments is taken, inquires about corresponding just with first key assignments according to the first acquired key assignments
Text is data cached, when inquire corresponding with first key assignments text it is data cached when, base
It is data cached in the valid cache data and the text corresponding with first key assignments
Obtain the web page resources to be loaded.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610055758.4A CN107015978B (en) | 2016-01-27 | 2016-01-27 | Webpage resource processing method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610055758.4A CN107015978B (en) | 2016-01-27 | 2016-01-27 | Webpage resource processing method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107015978A true CN107015978A (en) | 2017-08-04 |
CN107015978B CN107015978B (en) | 2020-07-07 |
Family
ID=59438784
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610055758.4A Active CN107015978B (en) | 2016-01-27 | 2016-01-27 | Webpage resource processing method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107015978B (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108234639A (en) * | 2017-12-29 | 2018-06-29 | 北京奇虎科技有限公司 | A kind of data access method and device based on content distributing network CDN |
CN108804695A (en) * | 2018-06-14 | 2018-11-13 | 广州谱道网络科技有限公司 | Promotion link generation and identification method and device |
CN110147478A (en) * | 2017-10-20 | 2019-08-20 | 中国电信股份有限公司 | Web page subject word acquisition methods and system, server and user terminal |
CN111083108A (en) * | 2019-11-14 | 2020-04-28 | 北京字节跳动网络技术有限公司 | Data processing method, device, medium and electronic equipment |
CN113590658A (en) * | 2021-07-06 | 2021-11-02 | 广州汇思信息科技股份有限公司 | Cache data processing method and device, computer equipment and storage medium |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101583089A (en) * | 2008-05-12 | 2009-11-18 | 华为技术有限公司 | Message storage method and message sending method and equipment |
CN101706825A (en) * | 2009-12-10 | 2010-05-12 | 华中科技大学 | Replicated data deleting method based on file content types |
CN101777056A (en) * | 2009-12-31 | 2010-07-14 | 成都市华为赛门铁克科技有限公司 | Data storage method and device |
CN102096712A (en) * | 2011-01-28 | 2011-06-15 | 深圳市五巨科技有限公司 | Method and device for cache-control of mobile terminal |
CN102111449A (en) * | 2011-02-23 | 2011-06-29 | 北京蓝汛通信技术有限责任公司 | Method, device and system for updating data |
US20150081962A1 (en) * | 2007-07-13 | 2015-03-19 | Samsung Electronics Co., Ltd. | Cache memory device and data processing method of the device |
CN105589919A (en) * | 2015-09-18 | 2016-05-18 | 广州市动景计算机科技有限公司 | Method and device for processing webpage resource |
-
2016
- 2016-01-27 CN CN201610055758.4A patent/CN107015978B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150081962A1 (en) * | 2007-07-13 | 2015-03-19 | Samsung Electronics Co., Ltd. | Cache memory device and data processing method of the device |
CN101583089A (en) * | 2008-05-12 | 2009-11-18 | 华为技术有限公司 | Message storage method and message sending method and equipment |
CN101706825A (en) * | 2009-12-10 | 2010-05-12 | 华中科技大学 | Replicated data deleting method based on file content types |
CN101777056A (en) * | 2009-12-31 | 2010-07-14 | 成都市华为赛门铁克科技有限公司 | Data storage method and device |
CN102096712A (en) * | 2011-01-28 | 2011-06-15 | 深圳市五巨科技有限公司 | Method and device for cache-control of mobile terminal |
CN102111449A (en) * | 2011-02-23 | 2011-06-29 | 北京蓝汛通信技术有限责任公司 | Method, device and system for updating data |
CN105589919A (en) * | 2015-09-18 | 2016-05-18 | 广州市动景计算机科技有限公司 | Method and device for processing webpage resource |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110147478A (en) * | 2017-10-20 | 2019-08-20 | 中国电信股份有限公司 | Web page subject word acquisition methods and system, server and user terminal |
CN110147478B (en) * | 2017-10-20 | 2021-06-29 | 中国电信股份有限公司 | Webpage subject term obtaining method and system, server and user terminal |
CN108234639A (en) * | 2017-12-29 | 2018-06-29 | 北京奇虎科技有限公司 | A kind of data access method and device based on content distributing network CDN |
CN108804695A (en) * | 2018-06-14 | 2018-11-13 | 广州谱道网络科技有限公司 | Promotion link generation and identification method and device |
CN111083108A (en) * | 2019-11-14 | 2020-04-28 | 北京字节跳动网络技术有限公司 | Data processing method, device, medium and electronic equipment |
CN113590658A (en) * | 2021-07-06 | 2021-11-02 | 广州汇思信息科技股份有限公司 | Cache data processing method and device, computer equipment and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN107015978B (en) | 2020-07-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Ali et al. | Intelligent web proxy caching approaches based on machine learning techniques | |
CN107015978A (en) | A kind of web page resources processing method and device | |
CN100530186C (en) | Method and system for processing buffer | |
US9497256B1 (en) | Static tracker | |
EP3036662B1 (en) | Generating cache query requests | |
US9253278B2 (en) | Using entity tags (ETags) in a hierarchical HTTP proxy cache to reduce network traffic | |
CN105589919B (en) | Web page resources processing method and processing device | |
US10693858B2 (en) | CDN-based access control method and related device | |
JP2020057438A (en) | Sentence extraction method and system | |
US8438336B2 (en) | System and method for managing large filesystem-based caches | |
Shi et al. | Modeling object characteristics of dynamic web content | |
Doran et al. | A comparison of web robot and human requests | |
CN103916474B (en) | The definite method, apparatus and system of cache-time | |
CN107506154A (en) | A kind of read method of metadata, device and computer-readable recording medium | |
WO2022148306A1 (en) | Data elimination method and apparatus, cache node, and cache system | |
US8694659B1 (en) | Systems and methods for enhancing domain-name-server responses | |
Kaya et al. | An admission-control technique for delay reduction in proxy caching | |
CN108875036A (en) | Page data caching method, device, page cache data structure and electronic equipment | |
CN112416626B (en) | Data processing method and device | |
Mukhopadhyay et al. | A dynamic web page prediction model based on access patterns to offer better user latency | |
CN108073585A (en) | Network font loading method, device and system | |
CN112016017A (en) | Method and device for determining characteristic data | |
Hiranpongsin et al. | Integration of recommender system for Web cache management | |
CN111813711B (en) | Method and device for reading training sample data, storage medium and electronic equipment | |
US11411937B2 (en) | Web scraping prevention system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200526 Address after: 310052 room 508, floor 5, building 4, No. 699, Wangshang Road, Changhe street, Binjiang District, Hangzhou City, Zhejiang Province Applicant after: Alibaba (China) Co.,Ltd. Address before: 510627 Guangdong city of Guangzhou province Whampoa Tianhe District Road No. 163 Xiping Yun Lu Yun Ping B radio 14 floor tower square Applicant before: GUANGZHOU UCWEB COMPUTER TECHNOLOGY Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant |