CN104753972A - Network resource collection processing method and server - Google Patents

Network resource collection processing method and server Download PDF

Info

Publication number
CN104753972A
CN104753972A CN201310728512.5A CN201310728512A CN104753972A CN 104753972 A CN104753972 A CN 104753972A CN 201310728512 A CN201310728512 A CN 201310728512A CN 104753972 A CN104753972 A CN 104753972A
Authority
CN
China
Prior art keywords
internet resources
file identifier
terminal account
cache device
indexed cache
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310728512.5A
Other languages
Chinese (zh)
Inventor
林婕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201310728512.5A priority Critical patent/CN104753972A/en
Publication of CN104753972A publication Critical patent/CN104753972A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a network resource collection processing method and a server. The network resource collection processing method comprises the following steps: receiving a collection request sent by a terminal account, and generating a file identifier according to network resources corresponding to the collection request; looking up whether or not network resources corresponding to the file identifier exist in a content buffer; when the network resources corresponding to the file identifier exist in the content buffer, stopping storing the network resources; and newly building an index entry in an index table corresponding to the terminal account in an index buffer, and storing the file identifier in the newly-built index entry in the index buffer. Indexes and the network resources are stored separately, and the network resources are not stored repeatedly when the network resources are already stored in the content buffer, so that data redundancy caused by repeated storage is avoided; the storage space occupied by the network resources is reduced; and the waste of the storage space is avoided.

Description

The method of Internet resources collection process and server
Technical field
The present invention relates to networking technology area, be related specifically to method and the server of Internet resources collection process.
Background technology
Along with the development of network technology, the Internet resources required for oneself, when adopting the mode such as browser, client or application software to carry out internet access, are stored by the mode of collecting by network storage mode, check so that follow-up by user.Network collection storage mode conventional is at present, the Internet resources collected are selected to create an index according to user, and user needed the concrete data content of Internet resources of collection and the index of foundation to store in the lump in the server, when user needs to check the list of oneself collecting, the index of storage and the particular content of Internet resources extract by server in the lump, and return to user terminal.Because index stores by existing storage mode together with content, the index stored all is associated with user account with network resource content, when multiple user stores consolidated network resource, server needs to set up index for different user accounts respectively, and for a network resource content of each index stores, so, be stored repeatedly with a Internet resources, cause the waste of memory space, and reduce storage efficiency.
Summary of the invention
A kind of method that the embodiment of the present invention provides Internet resources to collect process and server, effectively can save memory space, improve storage efficiency.
The embodiment of the present invention proposes the method for a kind of Internet resources collection process, comprises step:
The collection request that receiving terminal account sends, the Internet resources corresponding according to described collection request, spanned file identifier;
The Internet resources whether having described file identifier corresponding are searched in described context buffer;
When the Internet resources having described file identifier corresponding in described context buffer, stop storing described Internet resources;
Newly-built index entry in the concordance list that terminal account described in indexed cache device is corresponding, is stored to index entry newly-built in described indexed cache device by described file identifier.
The embodiment of the present invention also proposes the server of a kind of Internet resources collection process, comprising:
Transceiver module, for the collection request that receiving terminal account sends;
Identifier generating module, for the Internet resources corresponding according to described collection request, spanned file identifier;
Context buffer module, for searching the Internet resources whether having described file identifier corresponding in described context buffer; When the Internet resources having described file identifier corresponding in described context buffer, stop storing described Internet resources;
Indexed cache device module, for index entry newly-built in the concordance list that terminal account described in indexed cache device is corresponding, is stored to index entry newly-built in described indexed cache device by described file identifier.
Index and Internet resources separate and store by the embodiment of the present invention, when there being Internet resources in context buffer, server is newly-built index entry in indexed cache device only, no longer repeated storage Internet resources, avoid the data redundancy that repeated storage causes, be conducive to the memory space reducing resource occupying, avoid waste of storage space.
Accompanying drawing explanation
Fig. 1 is the flow chart of the first embodiment of the method for Internet resources of the present invention collection process;
Fig. 2 is the flow chart of the second embodiment of the method for Internet resources of the present invention collection process;
Fig. 3 is the flow chart of the 3rd embodiment of the method for Internet resources of the present invention collection process;
Fig. 4 is the flow chart of the 4th embodiment of the method for Internet resources of the present invention collection process;
Fig. 5 is the flow chart of the 5th embodiment of the method for Internet resources of the present invention collection process;
Fig. 6 is the flow chart of the 6th embodiment of the method for Internet resources of the present invention collection process;
Fig. 7 is the flow chart of the 7th embodiment of the method for Internet resources of the present invention collection process;
Fig. 8 is the structural representation of the first embodiment of the server of Internet resources of the present invention collection process;
Fig. 9 is the structural representation of the second embodiment of the server of Internet resources of the present invention collection process.
The realization of the object of the invention, functional characteristics and advantage will in conjunction with the embodiments, are described further with reference to accompanying drawing.
Embodiment
Should be appreciated that specific embodiment described herein only in order to explain the present invention, be not intended to limit the present invention.
As shown in Figure 1, Fig. 1 is the flow chart of the first embodiment of the method for Internet resources of the present invention collection process.The method of the Internet resources collection process that the present embodiment is mentioned, comprises the step of storage networking resource:
Step S10, the collection request that receiving terminal account sends, the Internet resources corresponding according to collection request, spanned file identifier;
User, by the browser in terminal, client or accessible with application software network, can log in the account of oneself usually, or by the specific identification code of terminal as the account of this terminal, to distinguish other user.When user needs to collect the content of access, trigger collection function option, generate collection request.Collection request is sent to server by this account by terminal, carries the network address of the Internet resources (content namely to be collected) that user chooses or the Internet resources chosen in collection request.When carry in the collection request that server receives be the particular content of Internet resources time, directly be converted to file identifier according to by these Internet resources, different Internet resources can generate different file identifiers, and this file identifier is uniquely corresponding with these Internet resources.The present embodiment can adopt MD5(Message-Digest Algorithm5, Message Digest Algorithm 5) Internet resources to be collected are converted to the character string of 32, as the exclusive identification code of these Internet resources.In addition, when carry in the collection request that server receives be the network address of the Internet resources chosen time, then server obtains Internet resources according to this network address, again the Internet resources of acquisition are converted to file identifier, thus, the data volume that terminal account sends to the collection request of server can be greatly reduced, improve collection request efficiency of transmission.
Step S20, searches the Internet resources whether having file identifier corresponding in context buffer;
Step S30, when the Internet resources having file identifier corresponding in context buffer, stops storage networking resource;
Server is after spanned file identifier, the Internet resources whether having file identifier corresponding are searched in context buffer, if had, then illustrate that this part of Internet resources were collected by other users, without the need to repeated storage, only need corresponding active user to set up index, be conducive to the memory space reducing resource occupying, avoid waste of storage space.
Step S40, newly-built index entry in the concordance list that terminal account is corresponding in indexed cache device, is stored to index entry newly-built in indexed cache device by file identifier.
Server for user add index time, the Internet resources of index entry and storage are not placed in same buffer, but are provided with context buffer and indexed cache device, respectively storage networking resource and index.When storing index, server is a newly-increased index entry in indexed cache device, is stored in by file identifier in this index entry, can using the index ID of index entry as key assignments, using the file identifier that stores as storing value, so that search.
Index and Internet resources separate and store by the present embodiment, when there being Internet resources in context buffer, server is newly-built index entry in indexed cache device only, no longer repeated storage Internet resources, avoid the data redundancy that repeated storage causes, be conducive to the memory space reducing resource occupying, avoid waste of storage space.
As shown in Figure 2, Fig. 2 is the flow chart of the second embodiment of the method for Internet resources of the present invention collection process.The present embodiment based on embodiment illustrated in fig. 1, also comprises after step S20:
Step S31, when the Internet resources not having file identifier corresponding in context buffer, by file identifier and Internet resources corresponding stored in context buffer.
The server of the present embodiment is after receiving collection request, if when not finding Internet resources corresponding to file identifier in context buffer, then illustrate that these Internet resources are not yet stored by any user, server using file identifier as key assignments, Internet resources are as storing value, corresponding stored is in context buffer, so that subsequent server searches the Internet resources in context buffer according to the file identifier in indexed cache device.
As shown in Figure 3, Fig. 3 is the flow chart of the 3rd embodiment of the method for Internet resources of the present invention collection process.The present embodiment based on embodiment illustrated in fig. 2, also comprises after the step s 40:
Step S41, corresponds to file identifier by the network address of Internet resources, is stored to index entry newly-built in indexed cache device.
The server of the present embodiment also stored for the network address of Internet resources in indexed cache device, namely using the index ID of index entry as key assignments, the network address of file identifier and Internet resources is as storing value, corresponding stored is in indexed cache device, thus, when the Internet resources in context buffer deleted by mistake or other reasons unsuccessful storage time, server can directly obtain these Internet resources according to the network address in indexed cache device, avoid when server cannot find Internet resources in context buffer, cause server the situation of the Internet resources of collection cannot be provided to occur to terminal account, ensure that the reliability that Internet resources are collected.
As shown in Figure 4, Fig. 4 is the flow chart of the 4th embodiment of the method for Internet resources of the present invention collection process.The method of the Internet resources collection process that the present embodiment is mentioned, also comprises the step of search index table:
Step S51, the search index request that receiving terminal account sends;
When user needs the index checking the Internet resources oneself collected, trigger index and check option, generating indexes inquiry request.Search index request is sent to server by the account of this user terminal by terminal, carries the account information of this user terminal in this search index request.
Step S52, according to the information of described terminal account, extracts the concordance list that terminal account is corresponding from indexed cache device;
Step S53, concordance list corresponding to transmitting terminal account is to terminal account.
Due to server in indexed cache device, set up concordance list time, be correspond to terminal account set up, namely a terminal account is corresponding with at least one concordance list.Server is according to this terminal account information, the concordance list that this terminal account is corresponding is searched in indexed cache device, and the concordance list found is returned to this terminal account, user just can view the index list of oneself collection by this terminal account, what list in this concordance list is the file identifier that Internet resources are corresponding, instead of the particular content of Internet resources, reduce the data volume that server transmits to terminal account, be conducive to improving concordance list efficiency of transmission, avoid taking network traffics, decrease the data volume of terminal storage simultaneously, be conducive to the burden alleviating terminal.
As shown in Figure 5, Fig. 5 is the flow chart of the 5th embodiment of the method for Internet resources of the present invention collection process.The method of the Internet resources collection process that the present embodiment is mentioned, also comprises the step of requester network resource:
Step S61, the content query requests that receiving terminal account sends;
User is after the concordance list of Internet resources being viewed oneself collection by terminal, when needing the particular content checking a certain index entry, this index entry is chosen by terminal, content query requests is sent to server by the account of this user terminal by terminal, carries the index entry to be checked that terminal account is chosen in concordance list in this content query requests.
Step S62, inquires about file identifier corresponding to index entry to be checked from indexed cache device;
Step S63, the Internet resources that inquiry file identifier is corresponding from context buffer;
Step S64, when finding Internet resources corresponding to file identifier in context buffer, sends Internet resources to terminal account.
Server with the index ID of index entry to be checked for key assignments, the file identifier be stored in this index entry is searched from indexed cache device, and using the file identifier found as key assignments, in context buffer, search corresponding Internet resources, send to terminal account.
Store because index separates with the particular content of Internet resources by the present embodiment, when user inquires about, the Internet resources that server only needs the index entry user chosen corresponding send to user, without the need to sending all Internet resources, save network traffic data, be conducive to saving Internet resources query time, improve the search efficiency of Internet resources, decrease the data volume of terminal storage simultaneously, be conducive to the burden alleviating terminal.
As shown in Figure 6, Fig. 6 is the flow chart of the 6th embodiment of the method for Internet resources of the present invention collection process.The present embodiment based on embodiment illustrated in fig. 5, also comprises after step S63:
Step S65, when not finding Internet resources corresponding to file identifier in context buffer, obtains the network address of Internet resources corresponding to the index entry chosen from indexed cache device;
Step S66, obtains corresponding Internet resources according to the network address;
The present embodiment consider when the Internet resources in context buffer deleted by mistake or other reasons unsuccessful storage time, server cannot find the situation of Internet resources in context buffer, Internet resources now for avoiding server cannot provide collection to terminal account, server can search the network address of Internet resources to be checked from indexed cache device, and according to the server at this accesses network resource place, network address, obtain the particular content of Internet resources on the server from Internet resources.
The Internet resources newly obtained by file identifier and the new Internet resources corresponding stored obtained in context buffer, and are sent to terminal account by step S67.
Server is after getting Internet resources, on the one hand the Internet resources newly obtained are supplied to user, ensure that the reliability that Internet resources are collected, on the other hand, file identifier corresponding to these Internet resources has been present in indexed cache device, illustrate that these Internet resources are that user selects collection, server is using this file identifier as key assignments, using these Internet resources as storing value, corresponding stored is in context buffer, so that when active user or other users search these Internet resources again, server can obtain these Internet resources and send to user from context buffer, further ensure that the reliability that Internet resources are collected.
As shown in Figure 7, Fig. 7 is the flow chart of the 7th embodiment of the method for Internet resources of the present invention collection process.The method of the Internet resources collection process that the present embodiment is mentioned, also comprises the step of deleting index entry:
Step S71, the index removal request that receiving terminal account sends, carries the index entry to be deleted that terminal account is chosen in concordance list in index removal request;
Step S72, deletes index entry to be deleted from indexed cache device.
Store because index and Internet resources separate by server, when user wants to delete this collection, server only deletes the index entry in indexed cache device corresponding to the terminal account of this user, and the Internet resources retained in context buffer, thus, can avoid needing when other users collect these Internet resources again to store, decrease the tedious steps of server stores Internet resources.
As shown in Figure 8, Fig. 8 is the structural representation of the first embodiment of the server of Internet resources of the present invention collection process.The server of the Internet resources collection process that the present embodiment is mentioned, comprising:
Transceiver module 10, for the collection request that receiving terminal account sends;
Identifier generating module 20, for the Internet resources corresponding according to collection request, spanned file identifier;
Context buffer module 30, for searching the Internet resources whether having file identifier corresponding in context buffer; When the Internet resources having file identifier corresponding in context buffer, stop storage networking resource;
Indexed cache device module 40, for index entry newly-built in the concordance list that terminal account in indexed cache device is corresponding, is stored to index entry newly-built in indexed cache device by file identifier.
User, by the browser in terminal, client or accessible with application software network, can log in the account of oneself usually, or by the specific identification code of terminal as the account of this terminal, to distinguish other user.When user needs to collect the content of access, trigger collection function option, generate collection request.Collection request is sent to server by this account by terminal, carries the network address of the Internet resources (content namely to be collected) that user chooses or the Internet resources chosen in collection request.When carry in the collection request that server receives be the particular content of Internet resources time, directly be converted to file identifier according to by these Internet resources, different Internet resources can generate different file identifiers, and this file identifier is uniquely corresponding with these Internet resources.The present embodiment can adopt MD5(Message-Digest Algorithm5, Message Digest Algorithm 5) Internet resources to be collected are converted to the character string of 32, as the exclusive identification code of these Internet resources.In addition, when carry in the collection request that server receives be the network address of the Internet resources chosen time, then server obtains Internet resources according to this network address, again the Internet resources of acquisition are converted to file identifier, thus, the data volume that terminal account sends to the collection request of server can be greatly reduced, improve collection request efficiency of transmission.
Server is after spanned file identifier, the Internet resources whether having file identifier corresponding are searched in context buffer, if had, then illustrate that this part of Internet resources were collected by other users, without the need to repeated storage, only need corresponding active user to set up index, be conducive to the memory space reducing resource occupying, avoid waste of storage space.Server for user add index time, the Internet resources of index entry and storage are not placed in same buffer, but are provided with context buffer and indexed cache device, respectively storage networking resource and index.When storing index, server is a newly-increased index entry in indexed cache device, is stored in by file identifier in this index entry, can using the index ID of index entry as key assignments, using the file identifier that stores as storing value, so that search.
Index and Internet resources separate and store by the present embodiment, when there being Internet resources in context buffer, server is newly-built index entry in indexed cache device only, no longer repeated storage Internet resources, avoid the data redundancy that repeated storage causes, be conducive to the memory space reducing resource occupying, avoid waste of storage space.
Further, context buffer module 30 also for, when the Internet resources not having file identifier corresponding in context buffer, by file identifier and Internet resources corresponding stored in context buffer.
The server of the present embodiment is after receiving collection request, if when not finding Internet resources corresponding to file identifier in context buffer, then illustrate that these Internet resources are not yet stored by any user, server using file identifier as key assignments, Internet resources are as storing value, corresponding stored is in context buffer, so that subsequent server searches the Internet resources in context buffer according to the file identifier in indexed cache device.
Further, indexed cache device module 40 also for, by the network address of Internet resources correspond to file identifier, be stored to index entry newly-built in indexed cache device.
The server of the present embodiment also stored for the network address of Internet resources in indexed cache device, namely using the index ID of index entry as key assignments, the network address of file identifier and Internet resources is as storing value, corresponding stored is in indexed cache device, thus, when the Internet resources in context buffer deleted by mistake or other reasons unsuccessful storage time, server can directly obtain these Internet resources according to the network address in indexed cache device, avoid when server cannot find Internet resources in context buffer, cause server the situation of the Internet resources of collection cannot be provided to occur to terminal account, ensure that the reliability that Internet resources are collected.
Further, with lower module also for search index table.
Transceiver module 10 also for the search index request that, receiving terminal account sends, carries the information of terminal account in search index request;
Indexed cache device module 40 also for, according to the information of terminal account, from indexed cache device, extract the concordance list that terminal account is corresponding;
Transceiver module 10 also for concordance list corresponding to, transmitting terminal account to terminal account; .
When user needs the index checking the Internet resources oneself collected, trigger index and check option, generating indexes inquiry request.Search index request is sent to server by the account of this user terminal by terminal, carries the account information of this user terminal in this search index request.Due to server in indexed cache device, set up concordance list time, be correspond to terminal account set up, namely a terminal account is corresponding with at least one concordance list.Server is according to this terminal account information, the concordance list that this terminal account is corresponding is searched in indexed cache device, and the concordance list found is returned to this terminal account, user just can view the index list of oneself collection by this terminal account, what list in this concordance list is the file identifier that Internet resources are corresponding, instead of the particular content of Internet resources, reduce the data volume that server transmits to terminal account, be conducive to improving concordance list efficiency of transmission, avoid taking network traffics, decrease the data volume of terminal storage simultaneously, be conducive to the burden alleviating terminal.
Further, with lower module also for requester network resource.
Transceiver module 10 also for the content query requests that, receiving terminal account sends, carries the index entry to be checked that terminal account is chosen in concordance list in content query requests;
Indexed cache device module 40 also for, from indexed cache device, inquire about file identifier corresponding to index entry to be checked;
Context buffer module 30 is also for, Internet resources that inquiry file identifier is corresponding from context buffer;
Transceiver module 10 also for, when finding Internet resources corresponding to file identifier in context buffer, send Internet resources to terminal account.
User is after the concordance list of Internet resources being viewed oneself collection by terminal, when needing the particular content checking a certain index entry, this index entry is chosen by terminal, content query requests is sent to server by the account of this user terminal by terminal, carries the index entry to be checked that terminal account is chosen in concordance list in this content query requests.Server with the index ID of index entry to be checked for key assignments, the file identifier be stored in this index entry is searched from indexed cache device, and using the file identifier found as key assignments, in context buffer, search corresponding Internet resources, send to terminal account.
Store because index separates with the particular content of Internet resources by the present embodiment, when user inquires about, the Internet resources that server only needs the index entry user chosen corresponding send to user, without the need to sending all Internet resources, save network traffic data, be conducive to saving Internet resources query time, improve the search efficiency of Internet resources, decrease the data volume of terminal storage simultaneously, be conducive to the burden alleviating terminal.
As shown in Figure 9, Fig. 9 is the structural representation of the second embodiment of the server of Internet resources of the present invention collection process.The present embodiment, on basis embodiment illustrated in fig. 8, adds Internet resources acquisition module 50, wherein;
Indexed cache device module 40 also for, when not finding Internet resources corresponding to file identifier in context buffer, from indexed cache device, obtain the network address of Internet resources corresponding to the index entry chosen;
Internet resources acquisition module 50 for, obtain corresponding Internet resources according to the network address;
Context buffer module 30 also for, by file identifier and the new Internet resources corresponding stored obtained in context buffer;
Transceiver module 10 also for, the Internet resources newly obtained are sent to terminal account.
The present embodiment consider when the Internet resources in context buffer deleted by mistake or other reasons unsuccessful storage time, server cannot find the situation of Internet resources in context buffer, Internet resources now for avoiding server cannot provide collection to terminal account, server can search the network address of Internet resources to be checked from indexed cache device, and according to the server at this accesses network resource place, network address, obtain the particular content of Internet resources on the server from Internet resources.Server is after getting Internet resources, on the one hand the Internet resources newly obtained are supplied to user, ensure that the reliability that Internet resources are collected, on the other hand, file identifier corresponding to these Internet resources has been present in indexed cache device, illustrate that these Internet resources are that user selects collection, server is using this file identifier as key assignments, using these Internet resources as storing value, corresponding stored is in context buffer, so that when active user or other users search these Internet resources again, server can obtain these Internet resources and send to user from context buffer, further ensure that the reliability that Internet resources are collected.
Further, with lower module also for deleting index entry.
Transceiver module 10 also for the index removal request that, receiving terminal account sends, carries the index entry to be deleted that terminal account is chosen in concordance list in index removal request;
Indexed cache device module 40 also for, from indexed cache device, delete index entry to be deleted.
Store because index and Internet resources separate by server, when user wants to delete this collection, server only deletes the index entry in indexed cache device corresponding to the terminal account of this user, and the Internet resources retained in context buffer, thus, can avoid needing when other users collect these Internet resources again to store, decrease the tedious steps of server stores Internet resources.
It should be noted that, in this article, term " comprises ", " comprising " or its any other variant are intended to contain comprising of nonexcludability, thus make to comprise the process of a series of key element, method, article or device and not only comprise those key elements, but also comprise other key elements clearly do not listed, or also comprise by the intrinsic key element of this process, method, article or device.When not more restrictions, the key element limited by statement " comprising ... ", and be not precluded within process, method, article or the device comprising this key element and also there is other identical element.
The invention described above embodiment sequence number, just to describing, does not represent the quality of embodiment.
Through the above description of the embodiments, those skilled in the art can be well understood to the mode that above-described embodiment method can add required general hardware platform by software and realize, hardware can certainly be passed through, but in a lot of situation, the former is better execution mode.Based on such understanding, technical scheme of the present invention can embody with the form of software product the part that prior art contributes in essence in other words, this computer software product is stored in a storage medium (as ROM/RAM, magnetic disc, CD), comprising some instructions in order to make a station terminal equipment (can be mobile phone, computer, server, or the network equipment etc.) perform method described in each embodiment of the present invention.
The foregoing is only the preferred embodiments of the present invention; not thereby the scope of the claims of the present invention is limited; every utilize specification of the present invention and accompanying drawing content to do equivalent structure or equivalent flow process conversion; or be directly or indirectly used in other relevant technical fields, be all in like manner included in scope of patent protection of the present invention.

Claims (14)

1. a method for Internet resources collection process, is characterized in that, comprise step:
The collection request that receiving terminal account sends, the Internet resources corresponding according to described collection request, spanned file identifier;
The Internet resources whether having described file identifier corresponding are searched in described context buffer;
When the Internet resources having described file identifier corresponding in described context buffer, stop storing described Internet resources;
Newly-built index entry in the concordance list that terminal account described in indexed cache device is corresponding, is stored to index entry newly-built in described indexed cache device by described file identifier.
2. the method for Internet resources according to claim 1 collection process, is characterized in that, described search in context buffer whether have the step of Internet resources corresponding to described file identifier after also comprise:
When the Internet resources not having described file identifier corresponding in described context buffer, by described file identifier and Internet resources corresponding stored to described context buffer.
3. the method for Internet resources according to claim 1 and 2 collection process, is characterized in that, described file identifier is stored to the step of index entry newly-built in described indexed cache device after also comprise:
The network address of described Internet resources is corresponded to described file identifier, is stored to index entry newly-built in described indexed cache device.
4. the method for Internet resources according to claim 3 collection process, is characterized in that, described file identifier is stored to the step of index entry newly-built in described indexed cache device after also comprise:
The search index request that receiving terminal account sends, carries the information of described terminal account in described search index request;
According to the information of described terminal account, from described indexed cache device, extract concordance list corresponding to described terminal account;
Send concordance list corresponding to described terminal account to described terminal account.
5. the method for Internet resources according to claim 4 collection process, is characterized in that, the concordance list of described transmitting terminal account to described terminal account step after also comprise:
The content query requests that receiving terminal account sends, carries the index entry to be checked that described terminal account is chosen in described concordance list in described content query requests;
Described file identifier corresponding to described index entry to be checked is inquired about from described indexed cache device;
Internet resources corresponding to described file identifier are inquired about from described context buffer;
When finding Internet resources corresponding to described file identifier in described context buffer, send described Internet resources to described terminal account.
6. the method for Internet resources according to claim 5 collection process, is characterized in that, described from described context buffer, inquire about the step of Internet resources corresponding to described file identifier after also comprise:
When not finding Internet resources corresponding to described file identifier in described context buffer, from described indexed cache device, obtain the network address of described Internet resources corresponding to the index entry chosen;
Corresponding Internet resources are obtained according to the described network address;
By in described file identifier and the new Internet resources corresponding stored obtained to described context buffer, and the Internet resources of described new acquisition are sent to described terminal account.
7. the method for Internet resources according to claim 1 and 2 collection process, is characterized in that, described file identifier is stored to the step of index entry newly-built in described indexed cache device after also comprise:
The index removal request that receiving terminal account sends, carries the index entry to be deleted that described terminal account is chosen in described concordance list in described index removal request;
Described index entry to be deleted is deleted from described indexed cache device.
8. a server for Internet resources collection process, is characterized in that, comprising:
Transceiver module, for the collection request that receiving terminal account sends;
Identifier generating module, for the Internet resources corresponding according to described collection request, spanned file identifier;
Context buffer module, for searching the Internet resources whether having described file identifier corresponding in described context buffer; When the Internet resources having described file identifier corresponding in described context buffer, stop storing described Internet resources;
Indexed cache device module, for index entry newly-built in the concordance list that terminal account described in indexed cache device is corresponding, is stored to index entry newly-built in described indexed cache device by described file identifier.
9. the server of Internet resources collection process according to claim 8, it is characterized in that, described context buffer module also for, when the Internet resources not having described file identifier corresponding in described context buffer, by described file identifier and Internet resources corresponding stored to described context buffer.
10. according to claim 8 or claim 9 Internet resources collection process server, it is characterized in that, described indexed cache device module also for, by the network address of described Internet resources correspond to described file identifier, be stored to index entry newly-built in described indexed cache device.
The server of 11. Internet resources according to claim 10 collection process, is characterized in that, described transceiver module also for, the search index request that receiving terminal account sends, carries the information of described terminal account in described search index request;
Described indexed cache device module also for, according to the information of described terminal account, from described indexed cache device, extract concordance list corresponding to described terminal account;
Described transceiver module also for, send concordance list corresponding to described terminal account to described terminal account.
The server of 12. Internet resources collection process according to claim 11, it is characterized in that, described transceiver module also for, the content query requests that receiving terminal account sends, carries the index entry to be checked that described terminal account is chosen in described concordance list in described content query requests;
Described indexed cache device module also for, from described indexed cache device, inquire about described file identifier corresponding to described index entry to be checked;
Described context buffer module also for, from described context buffer, inquire about Internet resources corresponding to described file identifier;
Described transceiver module also for, when finding Internet resources corresponding to described file identifier in described context buffer, send described Internet resources to described terminal account.
The server of 13. Internet resources collection process according to claim 12, is characterized in that, also comprise Internet resources acquisition module;
Described indexed cache device module also for, when not finding Internet resources corresponding to described file identifier in described context buffer, from described indexed cache device, obtain the network address of described Internet resources corresponding to the index entry chosen;
Described Internet resources acquisition module is used for, and obtains corresponding Internet resources according to the described network address;
Described context buffer module also for, by described file identifier and the new Internet resources corresponding stored obtained to described context buffer;
Described transceiver module also for, the Internet resources of described new acquisition are sent to described terminal account.
14. according to claim 8 or claim 9 Internet resources collection process server, it is characterized in that, described transceiver module also for, the index removal request that receiving terminal account sends, carries the index entry to be deleted that described terminal account is chosen in described concordance list in described index removal request;
Described indexed cache device module also for, from described indexed cache device, delete described index entry to be deleted.
CN201310728512.5A 2013-12-25 2013-12-25 Network resource collection processing method and server Pending CN104753972A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310728512.5A CN104753972A (en) 2013-12-25 2013-12-25 Network resource collection processing method and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310728512.5A CN104753972A (en) 2013-12-25 2013-12-25 Network resource collection processing method and server

Publications (1)

Publication Number Publication Date
CN104753972A true CN104753972A (en) 2015-07-01

Family

ID=53593075

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310728512.5A Pending CN104753972A (en) 2013-12-25 2013-12-25 Network resource collection processing method and server

Country Status (1)

Country Link
CN (1) CN104753972A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106855837A (en) * 2016-12-15 2017-06-16 咪咕文化科技有限公司 Data processing method and device based on Flume
CN110597758A (en) * 2019-08-05 2019-12-20 崔熙媛 Control method for maximally utilizing storage space in mobile terminal and application program
CN115248803A (en) * 2022-09-22 2022-10-28 天津联想协同科技有限公司 Collection method and device suitable for network disk file, network disk and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020065892A1 (en) * 2000-11-30 2002-05-30 Malik Dale W. Method and apparatus for minimizing storage of common attachment files in an e-mail communications server
CN101119512A (en) * 2006-08-04 2008-02-06 鸿富锦精密工业(深圳)有限公司 System and method for work treatment using mobile equipment
CN102932421A (en) * 2012-09-28 2013-02-13 中国联合网络通信集团有限公司 Cloud back-up method and device
CN103064757A (en) * 2012-12-12 2013-04-24 鸿富锦精密工业(深圳)有限公司 Method and system for backing up data
CN103309975A (en) * 2013-06-09 2013-09-18 华为技术有限公司 Duplicated data deleting method and apparatus

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020065892A1 (en) * 2000-11-30 2002-05-30 Malik Dale W. Method and apparatus for minimizing storage of common attachment files in an e-mail communications server
CN101119512A (en) * 2006-08-04 2008-02-06 鸿富锦精密工业(深圳)有限公司 System and method for work treatment using mobile equipment
CN102932421A (en) * 2012-09-28 2013-02-13 中国联合网络通信集团有限公司 Cloud back-up method and device
CN103064757A (en) * 2012-12-12 2013-04-24 鸿富锦精密工业(深圳)有限公司 Method and system for backing up data
CN103309975A (en) * 2013-06-09 2013-09-18 华为技术有限公司 Duplicated data deleting method and apparatus

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106855837A (en) * 2016-12-15 2017-06-16 咪咕文化科技有限公司 Data processing method and device based on Flume
CN106855837B (en) * 2016-12-15 2020-12-18 咪咕文化科技有限公司 Data processing method and device based on Flume
CN110597758A (en) * 2019-08-05 2019-12-20 崔熙媛 Control method for maximally utilizing storage space in mobile terminal and application program
CN115248803A (en) * 2022-09-22 2022-10-28 天津联想协同科技有限公司 Collection method and device suitable for network disk file, network disk and storage medium
CN115248803B (en) * 2022-09-22 2023-02-17 天津联想协同科技有限公司 Collection method and device suitable for network disk file, network disk and storage medium

Similar Documents

Publication Publication Date Title
CN102436513B (en) Distributed search method and system
US10250526B2 (en) Method and apparatus for increasing subresource loading speed
US20170185654A1 (en) Method and server for pushing information proactively
CN106980699B (en) Data processing platform and system
CN110427368A (en) Data processing method, device, electronic equipment and storage medium
CN103686591A (en) Method and system for acquiring position information
CN111258978A (en) Data storage method
TW201329890A (en) Processing method and system of shop visiting data
EP2652887A2 (en) Entity identification based on proximity to access points
US20150249719A1 (en) Method and device for pushing information
CN113905275B (en) Webpage filtering method and intelligent device
CN108154024B (en) Data retrieval method and device and electronic equipment
CN111414361A (en) Label data storage method, device, equipment and readable storage medium
CN111026709A (en) Data processing method and device based on cluster access
CN115168338A (en) Data processing method, electronic device and storage medium
CN104636368A (en) Data retrieval method and device and server
CN109561165A (en) Domain name system configuration method and relevant apparatus
CN102982034B (en) The searching method and search system of Internet website information
CN104753972A (en) Network resource collection processing method and server
CN103268347A (en) System and method for mobile internet searching system based on messages
CN109101595A (en) A kind of information query method, device, equipment and computer readable storage medium
CN112181929A (en) Cloud management platform log processing method and device, electronic device and storage medium
CN105009122A (en) System and method to allow a domain name server to process a natural language query and determine context
WO2019123832A1 (en) Terminal management device and terminal device
CN103020300B (en) Method and device for information retrieval

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150701

RJ01 Rejection of invention patent application after publication