WO2012025052A1 - 基于点对点网络的互联网搜索系统和方法 - Google Patents

基于点对点网络的互联网搜索系统和方法 Download PDF

Info

Publication number
WO2012025052A1
WO2012025052A1 PCT/CN2011/078886 CN2011078886W WO2012025052A1 WO 2012025052 A1 WO2012025052 A1 WO 2012025052A1 CN 2011078886 W CN2011078886 W CN 2011078886W WO 2012025052 A1 WO2012025052 A1 WO 2012025052A1
Authority
WO
WIPO (PCT)
Prior art keywords
search
node
library
result
internet
Prior art date
Application number
PCT/CN2011/078886
Other languages
English (en)
French (fr)
Inventor
董方
Original Assignee
Dong Fang
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dong Fang filed Critical Dong Fang
Publication of WO2012025052A1 publication Critical patent/WO2012025052A1/zh
Priority to US13/776,552 priority Critical patent/US9330177B2/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9536Search customisation based on social or collaborative filtering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • G06F16/1834Distributed file systems implemented based on peer-to-peer networks, e.g. gnutella
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web

Definitions

  • the invention relates to an internet search system and a search method thereof, in particular to a search system and a method for realizing an internet search service based on a peer-to-peer network, so that the data storage and operation of the search system are dispersed to each user node, thereby reducing the construction and maintenance of the search engine.
  • Cost belonging to the field of Internet search technology.
  • Internet search technology is an important technology in Internet applications. People search the Internet through the Internet. I search for the data content that I care about in the vast Internet.
  • Existing Internet search technologies generally consist of a set of search engine servers built by search service providers.
  • the search engine server captures information of each webpage on the Internet, and organizes index information for retrieving data (such as address, title, content, anchor text, etc.) according to a certain data structure, thereby establishing a huge index library.
  • the user When searching, the user generally sends a search keyword to the search engine, and the search engine server searches in the established index database according to the input keyword, and sorts the search result according to a certain rule and sends the search result to the search. Request the user.
  • the present invention provides a search system and method for implementing an Internet search service based on a peer-to-peer network.
  • the main object of the present invention is to solve the problems existing in the prior art, and provide a search system and method for implementing an Internet search service based on a peer-to-peer network, so that data storage and operation of the search system are dispersed to each user node, and the search engine is reduced. Construction and maintenance costs.
  • the object of the present invention is achieved by the following technical solutions:
  • An Internet search system based on a peer-to-peer network, wherein: the search system is composed of a plurality of peer search nodes; each search node is connected to a peer-to-peer network through the Internet; and a search terminal is disposed in each of the search nodes;
  • the search terminal includes: a network interface, a search controller, a search storage space, a search module, and an aggregation sorting module;
  • the network interface is connected to the search controller, and the search terminal is connected to the Internet through the network interface;
  • the search storage space includes an index library and a point-to-point link library; the index library is configured to store index information of the webpage; the point-to-point link library is configured to store link information between the search node and the neighboring search node; the point-to-point link
  • the library is directly connected to the search controller; the index library is connected to the search controller through the retrieval module;
  • the retrieval module is controlled by the search controller to perform retrieval in the index library according to the key words of the user search request, and send the retrieval result to the aggregation sorting module;
  • the aggregation sorting module is respectively connected to the search controller and the search module, and is used for performing aggregation sorting on the search results sent by the search module.
  • the Internet search method based on the peer-to-peer network is implemented based on the foregoing Internet search system, and is characterized in that: the following steps are specifically included:
  • the search controller of the original search node invokes the link information in the local point-to-point link library, and forwards the search request to the search node connected to the original search node;
  • the search node that receives the search request responds to the search request, and determines whether the search node satisfies the end forwarding condition; if the end forwarding condition is not satisfied, it determines that the search node is an intermediate search node, and performs step (5); If the end forwarding condition is satisfied, it is determined that the search node is the final search node, and step (7) is performed;
  • the search controller of the intermediate search node invokes the link information in the point-to-point link library, and forwards the search request to the search node connected to the intermediate search node;
  • the intermediate search node responds to the search request, and the search controller controlled by the search node controls the retrieval module to perform retrieval in the index library, and the search result and its subordinate search node
  • the returned search result is returned to its superior search node, and returns to step (4);
  • the final search node responds to the search request, and the search controller controlled by the node controls the retrieval module to perform retrieval in the index library, and returns the retrieval result to its superior search node;
  • the search controller of the original search node controls the aggregation sorting module to perform aggregation sorting on the search results returned by other search nodes and the results retrieved in the local index library, and send the aggregated sorted search results to the user, and complete The retrieval process.
  • the beneficial effects of the present invention are:
  • the peer-to-peer network-based Internet search system and method form a peer-to-peer network between nodes by setting a search terminal at each search node.
  • the search request is sent to other search nodes step by step using the posting and returning mechanism of the message in the peer-to-peer network.
  • Each search node responds to the search request, searches based on the index library stored locally by each search node, and finally collects the search results of each search node at the original search node, and the search results are aggregated by the original search node.
  • Such a search system structure and search method completely break the architecture of the existing search system, and a new search system architecture is given.
  • the problem of the work pressure of the search engine server in the prior art is completely solved.
  • FIG. 1 is a schematic structural view of a first embodiment of an Internet search system based on a peer-to-peer network
  • FIG. 2 is a schematic structural diagram of a second embodiment of an Internet search system based on a peer-to-peer network
  • FIG. 3 is a schematic structural diagram of a third embodiment of an Internet search system based on a peer-to-peer network
  • FIG. 4 is a schematic diagram showing the structure of the fourth, fifth, and sixth embodiments of the Internet search system based on the peer-to-peer network;
  • FIG. 5 is a flow chart of a seventh embodiment of an Internet search method based on a peer-to-peer network
  • FIG. 6 is a flow chart of an eighth embodiment of an Internet search method based on a peer-to-peer network.
  • FIG. 1 is a schematic structural diagram of a first embodiment of an Internet search system based on a peer-to-peer network, and is also a basic structural diagram of a search system of the present invention.
  • the internet search system consists of several peers
  • the search node is composed. Each search node is connected to a peer-to-peer network via the Internet.
  • a search terminal is disposed in each of the search nodes.
  • the search terminal includes at least: a network interface, a search controller, a search storage space, a retrieval module, and an aggregation sorting module.
  • the network interface is coupled to the search controller through which the search terminal is connected to the Internet.
  • the search controller is a core control device of the search terminal, and is used to control the work of each functional module in the terminal.
  • the search storage space is a storage device of the search terminal, and at least includes an index library and a point-to-point link library.
  • the index library is configured to store index information of the webpage for searching by the search terminal.
  • the point-to-point link library is used to store link information between the search node and an adjacent search node.
  • the point-to-point link library is directly connected to the search controller, and the link information is invoked by the search controller to implement a communication link between the nodes.
  • the index library is coupled to the search controller via the retrieval module.
  • the retrieval module is controlled by the search controller to perform retrieval in the index library according to the key words of the user search request, and send the retrieval result to the aggregation sorting module.
  • the aggregation sorting module is respectively connected to the search controller and the retrieval module, and is used for performing aggregation sorting on the search results sent by the search module, and sending the aggregated sorted search results to the search controller.
  • each of the search nodes in the peer-to-peer network-based Internet search system is provided with a search terminal of the above structure.
  • the search controller of the node searches the local index library through the local retrieval module on the one hand, and sends the search request to the other party by calling the link information stored in the point-to-point link library on the other hand.
  • the other search nodes At other search nodes adjacent to the search node.
  • the other search nodes continue to perform the local search on the one hand according to the above process, and continue to forward the search request to the next level node.
  • the search results of each search node are finally collected in the node that originally initiated the search request, and the final search result is fed back to the initiating user through the aggregation sorting, thereby completing the entire search process.
  • the Internet search system based on the point-to-point network designed by the present invention has an essential difference compared with the existing search system.
  • the search system does not need to set up a dedicated search server, but sets a search terminal at each search node to form a peer-to-peer network among the nodes. Network, through a limited search request forwarding between nodes, and finally get a reliable search results.
  • the search system distributes the index information required to be stored in the search engine server data center that needs to be set in the search server and occupies a large amount of storage space into the search storage space of each search node.
  • the amount of access traffic and retrieval processing that was originally required by the search server is also distributed to each search node. In this way, the problem of excessive server hosting pressure in Internet search technology can be fundamentally solved, and the processing power of Internet search technology can keep up with the increasing speed of Internet content. Therefore, the technical effects obtained by the search system given by the present invention are obvious.
  • this embodiment is based on the first embodiment, and a webpage buffer and an index module are further disposed in the search terminal.
  • the webpage buffer is connected to the network interface, and the webpage related data browsed by the user, such as the address, title, content, time, and website of the webpage, is stored in the buffer.
  • the indexing module is respectively connected to the webpage buffer area and the index library. The indexing module is controlled by the search controller, extracts webpage related data from the webpage buffer, extracts index information of the webpage from the webpage, and sends the index information to the indexing library to update the indexing library.
  • a two-part module of a webpage buffer and an indexing module is added in the search terminal.
  • the index information stored in the index library of the search terminal can be updated according to the webpage that the user usually browses. This not only enables the content searched by the search system to be continuously updated according to the user's usual use, but also because the search scope of the search system is based on the index library owned by each user, such an index library update mechanism is more Help users search for useful information.
  • this embodiment is based on the first embodiment, and at least one network server is further provided in the peer-to-peer network.
  • the network server includes: a network interface, a server controller, and a search node link library.
  • the search node link library stores link information of several search nodes.
  • the server controller is connected to the search node link library on the one hand and to the Internet through the network interface on the other hand.
  • the network server set in this embodiment is substantially different from the server in the existing search system.
  • the network server set in this embodiment is only used to allocate a batch of adjacent searches to the search terminal by the network server when the user initializes its local search terminal.
  • the link information of the node enables it to join the peer-to-peer network of the search system, and does not need to undertake any retrieval function. Therefore, the establishment and maintenance costs of the network server in this embodiment are very low.
  • this embodiment is based on the first embodiment, and a search result library is further provided in the search storage space.
  • the search result library is used to store the search results searched by the search node, so that the search results can be given faster and the search process can be simplified when the same search is performed later.
  • this embodiment is based on the first embodiment, and a feedback information base is further disposed in the search storage space.
  • the feedback information base is used to store feedback information of the user on the completed search result, which is used as a reference basis for the aggregation sequence of the search results when the same search is performed later.
  • this embodiment is based on the first embodiment, and a user usage frequency library is further disposed in the search storage space.
  • the user uses the frequency library to store the frequency information of the user browsing the webpage as a reference for the aggregation of the search results.
  • the above embodiments present several possible embodiments of the peer-to-peer network based internet search system designed by the present invention. It is not difficult to see that the design point of the search system is to form a peer-to-peer network between each node by setting a search terminal at each search node, thereby distributing the storage pressure, access pressure, and calculation pressure of the server side in the existing search system to each. Searching the node's search terminal greatly reduces the pressure on the search server, and even does not set up a dedicated search server at all.
  • the structure design of such a search system completely breaks the architecture of the existing search system, and gives a completely new search system architecture, and has achieved remarkable technical effects.
  • FIG. 5 is a flow chart of an Internet search method based on a peer-to-peer network, and is also a basic flow chart of the search method of the present invention.
  • the search method is based on the search described in the foregoing first embodiment
  • the system implementation includes the following steps:
  • the search controller of the original search node invokes the link information in the local point-to-point link library, and forwards the search request to the search node connected to the original search node; here, steps (2) and (3)
  • the order of execution can be exchanged with each other and even processed in parallel. Regardless of which of these execution sequences is used, it should be considered to be within the scope of this patent.
  • the search node that receives the search request responds to the search request, and determines whether the search node satisfies the end forwarding condition; if the end forwarding condition is not satisfied, it determines that the search node is an intermediate search node, and performs step (5); After the end forwarding condition is met, it is determined that the search node is the final search node, and step (7) is performed;
  • the end forwarding condition here is set to end the entire search process and prevent the search request from being passed indefinitely.
  • This end-of-flight condition can be implemented in a variety of ways depending on the design needs. Therefore, we do not give a specific implementation here.
  • the search controller of the intermediate search node invokes the link information in the point-to-point link library, and forwards the search request to the search node connected to the intermediate search node;
  • the intermediate search node responds to the search request, and the search controller controlled by the search node of the search node searches in the index library, and returns the search result and the search result returned by the lower search node to the superior. Search for the node and return to step (4);
  • the final search node responds to the search request, and the search controller controlled by the node controls the retrieval module to perform retrieval in the index library, and returns the retrieval result to its superior search node;
  • the search controller of the original search node controls the aggregation sorting module to perform aggregation sorting on the search results returned by other search nodes and the results retrieved in the local index library, and send the aggregated sorted search results to the user, completing The retrieval process.
  • the search result aggregation sorting method described here is a technique for sorting interference search results which is more common in existing Internet search technologies. The specific aggregation sequence process will not be described in detail here.
  • the Internet search method based on the peer-to-peer network designed by the present invention is that the user initiates a search request from the local search node, and the message is distributed through the peer-to-peer network.
  • the backhaul mechanism sends the search request to other search nodes step by step.
  • Each search node responds to the search request, searches based on the index library stored locally by each search node, and finally collects the search results of each search node at the original search node, and the search results are aggregated by the original search node. .
  • the Internet search system and method designed by the present invention completely changes the architecture of the existing search engine, and completes the Internet search task through a brand-new peer-to-peer network search mechanism.
  • This kind of search system only needs a very simple node allocation server (see the third embodiment), and even completely deviates from the dependency of the exclusive search server, which completely solves the work pressure problem of the search engine server in the prior art.
  • the Internet search method is directly based on the index library local to each search node as the scope of the search library, and is closer to the actual use of the user, the search result obtained by such search should be closer to the actual needs of the user.
  • This embodiment is based on the seventh embodiment, and further provides an update mechanism of the index library of each search node, which is implemented based on the search system provided by the foregoing second embodiment.
  • the index library of each search node in the steps (2), (6), and (7) is updated by the following steps:
  • the search controller of the search node controls the index module to extract webpage related data from the webpage buffer, and extracts index information of the webpage from the webpage;
  • the search controller stores the index information in the index library.
  • the present invention provides an index library update mechanism for the search system.
  • this index library update mechanism the web index database of this search system can be continuously updated with the user's network access, so that it can better adapt to the increasing speed of today's Internet content.
  • this index library update mechanism is much more reasonable than the simple way to obtain Internet content through web crawlers in the prior art.
  • this embodiment is based on the foregoing seventh embodiment, further adding an aggregation sorting step on the intermediate search node, thereby reducing the frequency of information transmission between the search nodes, and making the delivery process more efficient.
  • the specific plan is as follows:
  • the step (6) includes the following specific steps:
  • the intermediate search node responds to the search request, and the search controller of the node controls the retrieval module to perform retrieval in the index library, and saves the retrieval result;
  • the search controller of the intermediate search node controls the aggregation sorting module to re-aggregate the saved search result and the newly returned search result.
  • the search controller checks whether the re-aggregated search result updates the previously saved search result; if updated, returns the updated search result to its superior search node; if not, Directly perform step (61d) ;
  • the intermediate search node waits for the lower search node to return the search result, and when there is a lower search node returning the search result, returns to step (61b).
  • the search results are pre-aggregated and sorted at each intermediate search node in the search process, and only when the search result returned by the lower search node updates the search result saved by the intermediate search node, the intermediate search The node only performs data backhaul. In this way, unnecessary data backhaul operations can be reduced, thereby achieving the goal of improving data transfer efficiency.
  • the step (6) includes the following specific steps:
  • the intermediate search node responds to the search request, and the search controller of the node controls the retrieval module to perform retrieval in the index library, and saves the retrieval result;
  • step (62b) the intermediate search node receives the search result returned by the lower search node, and determines whether all of its lower search nodes have returned the search result; if yes, step (62d) is performed, if otherwise, step (62c) is performed;
  • step (62c) the intermediate search node waits for the lower search node to return the search result, and when the lower search node returns the search result, returns to step (62b);
  • the search controller of the intermediate search node controls the aggregation sorting module to protect the local
  • the stored search result is aggregated with the search results returned by all lower-level search nodes, and the aggregated sorted search result is transmitted back to the upper search node.
  • each intermediate search node will only sort all the search results after all the lower search nodes return their search results in the whole search process, and the aggregated search results will be returned. Pass it to its superior search node.
  • each intermediate search node does not need to perform a data backhaul every time after receiving the search result returned by the lower-level search node, but only needs to perform data back-transmission once, thereby reducing unnecessary data back-transmission operations, thereby To achieve the purpose of improving data transfer efficiency.
  • the end forwarding condition in step (4) is set to end the entire search process and prevent the search request from being delivered indefinitely.
  • This end-of-flight condition can be implemented in a variety of ways depending on the design needs. In this embodiment, we specifically specify an end forwarding condition.
  • the end forwarding condition designed in this embodiment is a forwarding level limitation, that is, after the original search node issues a search request, the search request is forwarded after forwarding the search nodes of several levels, as follows:
  • the step (1) is changed to: the user initiates a search request by the original search node, and sets the number of forwarding levels of the search request;
  • the step (4) is changed to: the search node that receives the search request responds to the search request, and determines whether the search node reaches the forwarding level limit; if not, it determines that the search node is an intermediate search node, and performs steps ( 5); If it has been reached, it is determined that the search node is the final search node, and step (7) is performed.
  • the end forwarding condition designed in this embodiment is the limit of the number of branch search results, that is, the upper limit of the sum of the search results retrieved by all the search nodes from the original search node to a certain search node, when this is reached. After the upper limit, the branch will no longer forward the search request, as follows:
  • the step (1) is changed to: the user initiates a search request by the original search node, and sets an upper limit on the number of branch search results of the search request;
  • the step (4) is changed to: the search node that receives the search request responds to the search request And determining whether the number of search results accumulated by the original search node to the search node branch reaches the upper limit; if not, determining that the search node is an intermediate search node, performing step (5); if Then, it is determined that the search node is the final search node, and step (7) is performed.
  • This embodiment is based on the seventh embodiment, and further adds to the Internet search method the related design of saving the completed search result and simplifying the search process of the subsequent same search request.
  • This embodiment is implemented based on the Internet search system designed in the foregoing fourth embodiment, and is specifically as follows:
  • the search controller of the search node searches in the search result library for whether there is a search result corresponding to the search request; if so, skips the index library search process, and uses the search result in the search result library as the present Searching for the search result of the node; if not, performing the index library search step;
  • step (8) After the step (8), the following steps are further included:
  • the original search node sends the aggregated search result obtained in step (8) to each search node participating in the current search request; each search node stores the aggregated search result in a local search
  • the result is in the library.
  • each search node in this step can also selectively store the aggregated search results, and does not necessarily store all the search results.
  • This embodiment is based on the seventh, ninth, and tenth embodiments, and further adds the user's feedback information to the completed search result to the Internet search method, and uses this as a reference for the aggregation sorting process of the subsequent search request.
  • This embodiment is implemented based on the Internet search system designed in the foregoing fifth embodiment, and is specifically as follows:
  • the search controller of the search node invokes user feedback information in the feedback information base Entering an aggregation sorting module as a parameter, and assisting the aggregation sorting module to perform aggregation sorting on the search results;
  • step (8) After the step (8), the following steps are further included:
  • This embodiment is based on the seventh, ninth, and tenth embodiments, and further adds a recording mechanism for the frequency of use of the user webpage to the Internet search method, and uses this as a reference for the related design of the aggregation sorting process of the subsequent search requests.
  • This embodiment is implemented based on the Internet search system designed in the foregoing sixth embodiment, and is specifically as follows:
  • the search controller of the search node invokes the frequency information of the user browsing the webpage in the frequency library as a parameter input aggregation sorting module, to assist the aggregation sorting module to perform aggregation sorting on the search result;
  • step (8) After the step (8), the following steps are further included:
  • the search node sends the frequency information of the user browsing the webpage to each search node participating in the search request; each search node stores the frequency information of the user browsing the webpage in the local user usage frequency library.
  • the Internet search system and method based on the peer-to-peer network designed by the present invention forms a point-to-point network between the nodes by setting a search terminal at each search node.
  • the search system sends the search request to other search nodes step by step through the posting and returning mechanism of the message in the peer-to-peer network.
  • Each search node responds to the search request, searches based on the index library stored locally by each search node, and finally collects the search results of each search node at the original search node, and the search results are aggregated by the original search node. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Computer And Data Communications (AREA)

Abstract

本发明提供了一种基于点对点网络的互联网搜索系统和方法,该搜索系统和方法是通过在每个搜索节点处设置搜索终端,使各个节点间形成点对点网络。利用点对点网络中消息的发布与回传机制将该搜索请求逐级发送给其他搜索节点。每个搜索节点各自响应该搜索请求,基于各搜索节点本地所存储的索引库进行检索,并将各搜索节点的检索结果最终汇集于原始搜索节点处,由原始搜索节点对这些检索结果进行聚合排序。这样的搜索系统结构和搜索方法完全打破了现有搜索系统的体系结构,给出了一种全新的搜索系统体系架构。彻底解决了现有技术中搜索引擎服务器的工作压力问题。

Description

基于点对点网络的互联网搜索系统和方法 技术领域
本发明涉及互联网搜索系统及其搜索方法, 特别是一种基于点对点网 络实现互联网搜索服务的搜索系统及其方法, 使得搜索系统的数据存储和 运算分散到各个用户节点, 降低搜索引擎的建造和维护成本, 属于互联网 搜索技术技术领域。
背景技术
互联网搜索技术是互联网应用中的一项重要技术。人们通过互联网搜 索弓 I擎在数据庞大的互联网中搜索自己所关心的数据内容。
现有的互联网搜索技术一般是由搜索服务提供商建立一组搜索引擎 服务器。 该搜索引擎服务器在互联网上抓取各个网页的信息, 并按照一定 数据结构组织保存各网页可供检索数据(如地址、标题、 内容、锚文本等) 的索引信息, 从而建立庞大的索引库。 用户在进行搜索时, 一般是向该搜 索弓 I擎服务器发出搜索关键词, 搜索引擎服务器根据该输入关键词在所建 立的索引库中进行检索, 并将检索结果按照一定规则排序, 发送给搜索请 求用户。
但是, 这种既有的互联网搜索技术存在以下几点问题:
首先,搜索引擎服务器数据中心集中构建的大型分布式存储和计算系 统的开支巨大, 维护和运营开支巨大, 需要大量的能源和网络带宽。
其次,数据中心的存储系统容量的扩大速度、计算能力的提高的速度, 无法赶上互联网内容增加的速度。 造成被索引网页的数量占所有网页的数 量的比例下降。 不利于搜索结果相关性的提高。
再有, 这种搜索技术难于提高和调整搜索结果的相关性的排序。 人工 干预排序的工作量巨大。
鉴于以上现有技术中所存在的问题,本发明提供了一种基于点对点网 络实现互联网搜索服务的搜索系统及其方法。
发明内容
本发明的主要目的在于解决现有技术中存在的问题,提供一种基于点 对点网络实现互联网搜索服务的搜索系统及其方法, 使得搜索系统的数据 存储和运算分散到各个用户节点, 降低搜索引擎的建造和维护成本。 本发明的目的是通过下述技术方案予以实现的:
基于点对点网络的互联网搜索系统,其特征在于: 该搜索系统由若干 对等的搜索节点构成; 各个搜索节点通过互联网连接成点对点网络; 在 所述每个搜索节点中设置有一个搜索终端;
该搜索终端包括有: 网络接口、 搜索控制器、 搜索存储空间、 检 索模块和聚合排序模块;
该网络接口与搜索控制器相连, 搜索终端通过该网络接口与互联 网相连接;
该搜索存储空间包括有索引库和点对点链接库; 所述索引库用以 存储网页的索引信息;所述点对点链接库用以存储该搜索节点与相邻搜 索节点间的链接信息; 所述点对点链接库与搜索控制器直接相连; 所述 索引库通过所述检索模块与搜索控制器相连接;
该检索模块受搜索控制器控制, 用以实现根据用户搜索请求的关 键词在索引库中进行检索, 并将检索结果发送至聚合排序模块;
该聚合排序模块分别与搜索控制器、 检索模块相连接, 用以对检 索模块发送来的检索结果进行聚合排序。
基于点对点网络的互联网搜索方法, 基于前述的互联网搜索系统实 现, 其特征在于: 具体包括如下步骤:
( 1 ) 用户由原始搜索节点发起搜索请求;
( 2 ) 该原始搜索节点响应搜索请求, 由该节点的搜索控制器控制 检索模块在索弓 I库中进行检索;
( 3 ) 该原始搜索节点的搜索控制器调用本地点对点链接库中的链 接信息, 将所述搜索请求转发给与该原始搜索节点相连接的搜索节点;
( 4 ) 接收到所述搜索请求的搜索节点响应搜索请求, 判断本搜索 节点是否满足结束转发条件; 如果不满足结束转发条件, 则认定本搜索 节点为中间搜索节点, 执行步骤(5 ) ; 如果满足结束转发条件, 则认定 本搜索节点为最终搜索节点, 执行步骤 (7 ) ;
( 5 ) 该中间搜索节点的搜索控制器调用本地点对点链接库中的链 接信息, 将所述搜索请求转发给与该中间搜索节点相连接的搜索节点;
( 6 ) 该中间搜索节点响应搜索请求, 由该搜索节点的搜索控制器 控制检索模块在索引库中进行检索,并将检索结果以及其下级搜索节点 回传的检索结果一并回传给其上级搜索节点, 并返回步骤 (4) ;
( 7 ) 该最终搜索节点响应搜索请求, 由该节点的搜索控制器控制 检索模块在索引库中进行检索, 并将检索结果回传给其上级搜索节点;
( 8 ) 该原始搜索节点的搜索控制器控制聚合排序模块对其他搜索 节点回传的检索结果与在本地索引库中检索的结果进行聚合排序,并将 聚合排序后的检索结果发送给用户, 完成检索过程。
本发明的有益效果是: 该基于点对点网络的互联网搜索系统和方法 是通过在每个搜索节点处设置搜索终端, 使各个节点间形成点对点网 络。利用点对点网络中消息的发布与回传机制将该搜索请求逐级发送给 其他搜索节点。每个搜索节点各自响应该搜索请求,基于各搜索节点本 地所存储的索引库进行检索,并将各搜索节点的检索结果最终汇集于原 始搜索节点处, 由原始搜索节点对这些检索结果进行聚合排序。这样的 搜索系统结构和搜索方法完全打破了现有搜索系统的体系结构,给出了 一种全新的搜索系统体系架构。彻底解决了现有技术中搜索引擎服务器 的工作压力问题。
附图说明
图 1 为基于点对点网络的互联网搜索系统第一实施例的结构示意 图;
图 2 为基于点对点网络的互联网搜索系统第二实施例的结构示意 图;
图 3 为基于点对点网络的互联网搜索系统第三实施例的结构示意 图;
图 4为基于点对点网络的互联网搜索系统第四、五、六实施例的结 构示意图;
图 5为基于点对点网络的互联网搜索方法第七实施例的流程图; 图 6为基于点对点网络的互联网搜索方法第八实施例的流程图。 具体实施方式
下面结合附图和实施例对本发明作进一步描述。
实施例 1 :
图 1 为基于点对点网络的互联网搜索系统第一实施例的结构示意 图, 也是本发明搜索系统的基本结构图。该互联网搜索系统由若干对等 的搜索节点构成。各个搜索节点通过互联网连接成点对点网络。在所述 每个搜索节点中设置有一个搜索终端。
如图所示, 该搜索终端至少包括有: 网络接口、 搜索控制器、 搜 索存储空间、 检索模块和聚合排序模块。
该网络接口与搜索控制器相连, 搜索终端通过该网络接口与互联 网相连接。
该搜索控制器为搜索终端的核心控制设备, 用以控制终端中各个 功能模块的工作。
该搜索存储空间为搜索终端的存储设备, 其中至少包括有索引库 和点对点链接库。所述索引库用以存储网页的索引信息, 以供搜索终端 进行检索。所述点对点链接库用以存储该搜索节点与相邻搜索节点间的 链接信息。所述点对点链接库与搜索控制器直接相连,通过搜索控制器 调用所述链接信息实现节点间的通讯链接。所述索引库通过所述检索模 块与搜索控制器相连接。
该检索模块受搜索控制器控制, 用以实现根据用户搜索请求的关 键词在索引库中进行检索, 并将检索结果发送至聚合排序模块。
该聚合排序模块分别与搜索控制器、 检索模块相连接, 用以对检 索模块发送来的检索结果进行聚合排序,并将聚合排序后的检索结果发 送给搜索控制器。
如上所述, 在该基于点对点网络的互联网搜索系统中的每个搜索 节点均设置有一个上述结构的搜索终端。当其中一个搜索节点发起搜索 请求时,该节点的搜索控制器一方面通过本地的检索模块对本地索引库 进行检索,另一方面通过调用点对点链接库中所存储的链接信息将该搜 索请求发送至与该搜索节点相邻的其他搜索节点处。其他搜索节点响应 该搜索请求, 继续按照上述过程一方面完成本地检索, 一方面继续向下 一级节点转发该搜索请求。 如此往复, 经过有限级转发搜索请求, 最终 将各搜索节点的检索结果汇集于最初发起搜索请求的节点,并经过聚合 排序向发起用户反馈最终的检索结果, 从而完成整个搜索过程。
可见, 本发明所设计的基于点对点网络的互联网搜索系统相较于 现有搜索系统有着本质的区别。本搜索系统不需要设置专门的搜索服务 器, 而是在每个搜索节点处设置搜索终端, 使各个节点间形成点对点网 络,通过在各个节点间的有限级搜索请求转发, 最终获得一个可以信赖 的搜索结果。这样该搜索系统就将现有必须在搜索服务器中设置的占用 大量存储空间的搜索引擎服务器数据中心所需存储的索引信息分散到各 个搜索节点的搜索存储空间中。 同时, 还将原先需由搜索服务器承担的访 问流量和检索处理工作量, 也分散到各个搜索节点处。 这样, 可以从根本 上解决互联网搜索技术中服务器承载压力过大的问题, 同时也使互联网搜 索技术的处理能力能够跟得上互联网内容的增加速度。 因此, 本发明所给 出的搜索系统所取得的技术效果是显而易见的。
实施例 2 :
如图 2 所示, 本实施例是在第一实施例的基础上, 在所述搜索终 端中还设置有网页缓冲区和索引模块。该网页缓冲区与所述网络接口相 连, 用以将用户所浏览的网页相关数据, 如网页的地址、 标题、 内容、 时间、 所在的网站等, 存储于该缓冲区内。 该索引模块分别与网页缓冲 区、 索引库相连。 该索引模块受搜索控制器控制, 从网页缓冲区提取网 页相关数据, 并从中提取该网页的索引信息, 再将该索引信息发送至所 述索引库, 对索引库进行更新。
本实施例通过在搜索终端中增加网页缓冲区和索引模块两部分模 块。使得该搜索终端的索引库所存储的索引信息可以根据用户平时所浏 览的网页进行更新。这样不但能够使得该搜索系统所搜索的内容可以根 据用户平时的使用不断更新,而且由于本搜索系统所检索的范围是依据 各个用户所自有的索引库为基础的,这样的索引库更新机制更有利于用 户搜索到实际有用的信息。
实施例 3 :
如图 3 所示, 本实施例是在第一实施例的基础上, 在点对点网络 中还进一步设置有至少一个网络服务器。该网络服务器包括有: 网络接 口、 服务器控制器、 搜索节点链接库。 该搜索节点链接库中存储有若干 搜索节点的链接信息。 该服务器控制器一方面与搜索节点链接库相连, 另一方面通过网络接口与互联网相连接。
这里, 本实施例所设置的网络服务器与现有搜索系统中的服务器 有着本质的区别。本实施例中所设置的网络服务器只是用以在用户初始 化其本地搜索终端时,由该网络服务器分配给该搜索终端一批相邻搜索 节点的链接信息, 使之能够加入本搜索系统的点对点网络, 并不需要承 担任何检索功能。 因此,本实施例中的网络服务器的建立和维护成本都 非常低廉。
实施例 4 :
如图 4 所示, 本实施例是在第一实施例的基础上, 在所述搜索存 储空间中还设置有检索结果库。该检索结果库用以存储通过本搜索节点 所进行搜索的检索结果,以备以后再进行相同检索时可以更快给出检索 结果, 简化检索流程。
实施例 5 :
如图 4 所示, 本实施例是在第一实施例的基础上, 在所述搜索存 储空间中还设置有反馈信息库。该反馈信息库用以存储用户对已完成的 检索结果的反馈信息,以作为以后再进行相同检索时检索结果的聚合排 序的参考依据。
实施例 6 :
如图 4 所示, 本实施例是在第一实施例的基础上, 在所述搜索存 储空间中还设置有用户使用频率库。该用户使用频率库用以存储用户浏 览网页的频率信息, 以作为对检索结果聚合排序的参考依据。
上述各实施例给出了本发明所设计的基于点对点网络的互联网搜 索系统的若干可行实施例。不难看出, 该搜索系统的设计要点在于通过 在每个搜索节点处设置搜索终端, 使各个节点间形成点对点网络, 从而 将现有搜索系统中服务器端的存储压力、访问压力、计算压力分散到各 个搜索节点的搜索终端上, 这样大大降低了搜索服务器的压力, 甚至可 以完全不设置专门的搜索服务器。这样的搜索系统结构设计完全打破了 现有搜索系统的体系结构, 给出了一种全新的搜索系统体系架构, 并且 取得了显著的技术效果。 下面, 我们对基于上述结构的互联网搜索系统的搜索方法进行介 绍。
实施例 7 :
图 5 为基于点对点网络的互联网搜索方法的流程图, 也是本发明 搜索方法的基本流程图。该搜索方法是基于前述第一实施例所述的搜索 系统实现的, 具体包括如下步骤:
(1) 用户由原始搜索节点发起搜索请求;
(2) 该原始搜索节点响应搜索请求, 由该节点的搜索控制器控制 检索模块在索弓 I库中进行检索;
(3) 该原始搜索节点的搜索控制器调用本地点对点链接库中的链 接信息, 将所述搜索请求转发给与该原始搜索节点相连接的搜索节点; 这里, 步骤 (2) 和 (3) 的执行顺序可以相互交换, 甚至可以并 行处理。无论采用这其中何种执行顺序, 均应视为在本专利的保护范围 之内。
(4) 接收到所述搜索请求的搜索节点响应搜索请求, 判断本搜索 节点是否满足结束转发条件; 如果不满足结束转发条件, 则认定本搜索 节点为中间搜索节点, 执行步骤(5); 如果满足结束转发条件, 则认定 本搜索节点为最终搜索节点, 执行步骤 (7);
这里的结束转发条件是为了结束整个搜索流程, 防止搜索请求无 限传递而设置的。 这种结束转发条件可以根据设计需要有多种实现方 式。 因此, 我们在这里并不给出其具体实现形式。
(5) 该中间搜索节点的搜索控制器调用本地点对点链接库中的链 接信息, 将所述搜索请求转发给与该中间搜索节点相连接的搜索节点;
(6) 该中间搜索节点响应搜索请求, 由该搜索节点的搜索控制器 控制检索模块在索引库中进行检索,并将检索结果以及其下级搜索节点 回传的检索结果一并回传给其上级搜索节点, 并返回步骤 (4);
(7) 该最终搜索节点响应搜索请求, 由该节点的搜索控制器控制 检索模块在索引库中进行检索, 并将检索结果回传给其上级搜索节点;
(8) 该原始搜索节点的搜索控制器控制聚合排序模块对其他搜索 节点回传的检索结果与在本地索引库中检索的结果进行聚合排序,并将 聚合排序后的检索结果发送给用户, 完成检索过程。
这里所述的检索结果聚合排序方法是一种在现有互联网搜索技术 中较为常见的干扰检索结果排序的技术。在此就不再对其具体的聚合排 序流程做具体描述。
由此可见, 本发明所设计的基于点对点网络的互联网搜索方法是 由用户从本地搜索节点发起搜索请求,通过点对点网络中消息的发布与 回传机制将该搜索请求逐级发送给其他搜索节点。每个搜索节点各自响 应该搜索请求, 基于各搜索节点本地所存储的索引库进行检索, 并将各 搜索节点的检索结果最终汇集于原始搜索节点处,由原始搜索节点对这 些检索结果进行聚合排序。由于,点对点网络中各节点间形成网状结构, 通过上述方式经过有限次转传搜索请求消息后,我们有理由认为该搜索 请求已经发布到一个足够信赖的搜索节点规模范围内。 因此, 我们也就 有理由认为通过这种方式所获得的检索结果是值得信赖的。
不难看出, 本发明所设计的互联网搜索系统及其方法是完全改变 了既有搜索引擎的体系架构,通过一种全新的点对点网络搜索机制来完 成互联网搜索任务。这种搜索系统只需要一个非常简单的节点分配服务 器 (参见第三实施例), 甚至可以完全脱离专属搜索服务器的依赖, 彻 底解决了现有技术中搜索引擎服务器的工作压力问题。
并且, 由于这种互联网搜索方法是直接基于各搜索节点本地的索 引库作为检索库的范围, 更贴近于用户实际使用, 因此这样搜索所获得 的检索结果应该更贴近于用户的实际需要。
实施例 8 :
本实施例是在第七实施例的基础上, 进一步提供了各搜索节点的 索引库的更新机制, 该方法基于前述第二实施例所提供的搜索系统实 现。 如图 6所示, 所述步骤 (2 )、 (6 )、 ( 7 ) 中各搜索节点的索引库通 过如下步骤进行更新:
( al ) 用户所浏览的网页相关数据存储于该缓冲区内;
( a2 ) 该搜索节点的搜索控制器控制索引模块从该网页缓冲区提 取网页相关数据, 并从中提取该网页的索引信息;
( a3 ) 搜索控制器将该索引信息存储于索引库。
通过上述流程设计, 本发明为该搜索系统提供了一种索引库更新 机制。通过此索引库更新机制可以使此搜索系统的网页索引库不断随用 户的网络访问而进行更新,从而可以较好的适应当今互联网内容的增加 速度。很明显, 这种索引库更新机制要比现有技术中简单通过网络爬虫 获取互联网内容的方式要合理的多。
实施例 9 :
由于, 考虑到各搜索节点间频繁传递检索结果可能占用较大带宽 和系统资源的问题,本实施例是在前述第七实施例的基础上,进一步在 中间搜索节点上加入聚合排序步骤,从而降低搜索节点间信息传递的频 率, 使传递过程更有效率。 其具体方案如下:
所述步骤 (6) 包括如下具体步骤:
(61a) 该中间搜索节点响应搜索请求, 由该节点的搜索控制器控 制检索模块在索引库中进行检索, 并保存检索结果;
(61b) 当该中间搜索节点收到其下级搜索节点回传的检索结果 后,该中间搜索节点的搜索控制器控制聚合排序模块对已保存的检索结 果与新回传的检索结果进行重新聚合排序;
(61c) 该搜索控制器检查该重新聚合排序后的检索结果是否更新 了之前所保存的检索结果; 如果更新了, 则将更新后的检索结果回传给 其上级搜索节点; 如果未更新, 则直接执行步骤 (61d);
(61d) 该中间搜索节点等待下级搜索节点回传搜索结果, 当有下 级搜索节点回传检索结果时, 返回步骤 (61b)。
通过这种方法, 在搜索过程中先在各个中间搜索节点处对检索结 果进行预先聚合排序,只有当下级搜索节点所回传的检索结果更新了该 中间搜索节点所保存的检索结果, 该中间搜索节点才进行数据回传。这 样, 可以减少不必要的数据回传操作, 从而达到提高数据传递效率的目 的。
实施例 10:
与第九实施例的设计目的相同, 本实施例提供另外一种解决方案。 其具体方案如下:
所述步骤 (6) 包括如下具体步骤:
(62a) 该中间搜索节点响应搜索请求, 由该节点的搜索控制器控 制检索模块在索引库中进行检索, 并保存检索结果;
(62b) 该中间搜索节点接收其下级搜索节点回传的检索结果, 并 判断是否其所有下级搜索节点均已回传检索结果; 如果是则执行步骤 (62d), 如果否则执行步骤 (62c);
(62c) 该中间搜索节点等待下级搜索节点回传搜索结果, 当有下 级搜索节点回传检索结果时, 返回步骤 (62b);
(62d) 该中间搜索节点的搜索控制器控制聚合排序模块对本地保 存的检索结果与所有下级搜索节点回传的检索结果进行聚合排序,并将 聚合排序后的检索结果回传给其上级搜索节点。
通过这种方法, 在整个搜索过程中每个中间搜索节点只会在其所 有下级搜索节点都回传各自检索结果后,才会对所有检索结果进行聚合 排序, 并将聚合排序后的检索结果回传给其上级搜索节点。 这样, 各中 间搜索节点无需在每次收到下级搜索节点回传的检索结果后就进行一 次数据回传, 而是只需要进行一次数据回传即可, 减少不必要的数据回 传操作, 从而达到提高数据传递效率的目的。
实施例 11 :
前面提到了, 步骤 (4) 中的结束转发条件是为了结束整个搜索流 程, 防止搜索请求无限传递而设置的。这种结束转发条件可以根据设计 需要有多种实现方式。 在本实施例中我们就具体给出一种结束转发条 件。
本实施例所设计的结束转发条件为转发级数限制, 即规定由原始 搜索节点发出搜索请求后,该搜索请求经过转发若干级搜索节点后停止 转发, 具体如下:
所述步骤 (1 ) 改为: 用户由原始搜索节点发起搜索请求, 并设置 此搜索请求的转发级数;
所述步骤 (4) 改为: 接收到所述搜索请求的搜索节点响应搜索请 求, 判断本搜索节点是否达到转发级数限制; 如果没达到, 则认定本搜 索节点为中间搜索节点, 执行步骤(5 ) ; 如果已达到, 则认定本搜索节 点为最终搜索节点, 执行步骤 (7 )。
实施例 12:
同样, 本实施例再给出我们另一种结束转发条件。
本实施例所设计的结束转发条件为支线检索结果数量限制, 即规 定自原始搜索节点至某一搜索节点这一支路上,所有搜索节点所检索到 的检索结果之和的上限, 当达到这一上限后该支路就不再转发搜索请 求, 具体如下:
所述步骤 (1 ) 改为: 用户由原始搜索节点发起搜索请求, 并设置 此搜索请求的支线检索结果数量上限;
所述步骤 (4) 改为: 接收到所述搜索请求的搜索节点响应搜索请 求,判断由原始搜索节点至本搜索节点支路上累计检索到的检索结果数 量是否达到所述上限;如果没达到,则认定本搜索节点为中间搜索节点, 执行步骤(5 ) ; 如果已达到, 则认定本搜索节点为最终搜索节点, 执行 步骤 (7 )。
实施例 13:
本实施例是在第七实施例的基础上, 对该互联网搜索方法进一步 增加了对已完成的检索结果保存,并以此简化后续相同搜索请求的搜索 流程的相关设计。本实施例是基于前述第四实施例所设计的互联网搜索 系统实现的, 具体如下:
在所述步骤 (2 )、 (6 )、 ( 7 ) 中各个搜索节点响应搜索请求在本地 索引库中进行检索之前, 还包括如下步骤:
(A) 该搜索节点的搜索控制器在检索结果库中查找是否有与该搜 索请求相对应的检索结果; 如果有则跳过索引库检索过程, 并将该检索 结果库中的检索结果作为本搜索节点的检索结果;如果无则执行索引库 检索步骤;
在所述步骤 (8 ) 后还包括如下步骤:
( 9A) 该原始搜索节点将步骤 (8 ) 中所获得的聚合排序后的检索 结果发送至参与本次搜索请求的各个搜索节点处;各个搜索节点将该聚 合排序后的检索结果存储于本地检索结果库中。
当然, 此步骤中各个搜索节点也可对聚合排序后的检索结果进行 有选择地存储, 不一定对所有检索结果都进行存储。
通过上述设计, 我们可以避免各搜索节点进行没有必要的重复检 索, 简化了搜索流程。
实施例 14:
本实施例是在第七、 九、 十实施例的基础上, 对该互联网搜索方 法进一步增加了用户对已完成检索结果的反馈信息的保存,并以此为参 考左右后续搜索请求的聚合排序过程的相关设计。本实施例是基于前述 第五实施例所设计的互联网搜索系统实现的, 具体如下:
在所述步骤 (8 )、 ( 61b ), ( 62d ) 中各个搜索节点对检索结果聚合 排序时, 还包括如下步骤:
(B ) 该搜索节点的搜索控制器调用反馈信息库中的用户反馈信息 作为参数输入聚合排序模块,以辅助聚合排序模块对检索结果进行聚合 排序;
在所述步骤 (8 ) 后还包括如下步骤:
( 9B ) 用户对该聚合排序后的检索结果进行反馈, 并由该原始搜 索节点将该反馈信息发送至参与本次搜索请求的各个搜索节点处;各个 搜索节点将该反馈信息存储于本地反馈信息库中。
通过上述设计, 我们可以优化聚合排序模块的聚合排序结果。 实施例 15:
本实施例是在第七、 九、 十实施例的基础上, 对该互联网搜索方 法进一步增加了对用户网页使用频率的记录机制,并以此为参考左右后 续搜索请求的聚合排序过程的相关设计。本实施例是基于前述第六实施 例所设计的互联网搜索系统实现的, 具体如下:
在所述步骤 (8 )、 ( 61b ), ( 62d ) 中各个搜索节点对检索结果聚合 排序时, 还包括如下步骤:
( C ) 该搜索节点的搜索控制器调用用户使用频率库中的用户浏览 网页的频率信息作为参数输入聚合排序模块,以辅助聚合排序模块对检 索结果进行聚合排序;
在所述步骤 (8 ) 后还包括如下步骤:
( 9C ) 搜索节点将用户浏览网页的频率信息发送至参与本次搜索 请求的各个搜索节点处;各个搜索节点将用户浏览网页的频率信息存储 于本地用户使用频率库中。
通过上述设计, 我们可以使聚合排序模块的聚合排序结果更符合 用户的期望。 综上所述, 本发明所设计的基于点对点网络的互联网搜索系统和 方法是通过在每个搜索节点处设置搜索终端,使各个节点间形成点对点 网络。该搜索系统通过点对点网络中消息的发布与回传机制将该搜索请 求逐级发送给其他搜索节点。每个搜索节点各自响应该搜索请求,基于 各搜索节点本地所存储的索引库进行检索,并将各搜索节点的检索结果 最终汇集于原始搜索节点处,由原始搜索节点对这些检索结果进行聚合 排序。这样的搜索系统结构和搜索方法完全打破了现有搜索系统的体系 结构, 给出了一种全新的搜索系统体系架构。彻底解决了现有技术中搜 索引擎服务器的工作压力问题。 本领域一般技术人员在此设计思想之 下,所做任何不具有创造性的改造,均应视为在本发明的保护范围之内。

Claims

权利要求
1、 基于点对点网络的互联网搜索系统, 其特征在于: 该搜索系统由 若干对等的搜索节点构成; 各个搜索节点通过互联网连接成点对点网 络; 在所述每个搜索节点中设置有一个搜索终端;
该搜索终端包括有: 网络接口、 搜索控制器、 搜索存储空间、 检 索模块和聚合排序模块;
该网络接口与搜索控制器相连, 搜索终端通过该网络接口与互联 网相连接;
该搜索存储空间包括有索引库和点对点链接库; 所述索引库用以 存储网页的索引信息;所述点对点链接库用以存储该搜索节点与相邻搜 索节点间的链接信息; 所述点对点链接库与搜索控制器直接相连; 所述 索引库通过所述检索模块与搜索控制器相连接;
该搜索存储空间中还设置有检索结果库;该检索结果库用以存储通 过本搜索节点所进行搜索的检索结果;
该检索模块受搜索控制器控制, 用以实现根据用户搜索请求的关 键词在索引库中进行检索, 并将检索结果发送至聚合排序模块;
该聚合排序模块分别与搜索控制器、检索模块相连接,用以对检索 模块发送来的检索结果进行聚合排序。
2、 如权利要求 1所述的互联网搜索系统, 其特征在于: 在所述搜索 终端中还设置有网页缓冲区和索引模块;
该网页缓冲区与所述网络接口相连,用以将用户所浏览的网页相关 数据存储于网页缓冲区内;
该索引模块分别与网页缓冲区、索引库相连; 该索引模块受搜索控 制器控制,用以从网页缓冲区中提取网页的索引信息, 并将该索引信息 发送至所述索引库。
3、 如权利要求 1所述的互联网搜索系统, 其特征在于: 在所述点对 点网络中还设置有至少一个网络服务器; 该网络服务器包括有: 网络接 口、 服务器控制器、 搜索节点链接库; 该搜索节点链接库中存储有若干 搜索节点的链接信息; 该服务器控制器一方面与搜索节点链接库相连, 另一方面通过网络接口与互联网相连接。
4、 如权利要求 1所述的互联网搜索系统, 其特征在于: 在所述搜索 存储空间中还设置有反馈信息库;该反馈信息库用以存储用户对已完成 的检索结果的反馈信息。
5、 如权利要求 1所述的互联网搜索系统, 其特征在于: 在所述搜索 存储空间中还设置有用户使用频率库;该用户使用频率库用以存储用户 浏览网页的频率信息。
6、 基于点对点网络的互联网搜索方法, 基于权利要求 1所述的互联 网搜索系统实现, 其特征在于: 具体包括如下步骤:
(1) 用户由原始搜索节点发起搜索请求;
(2) 该原始搜索节点响应搜索请求, 由该节点的搜索控制器控制 检索模块在索引库中进行检索;
(3) 该原始搜索节点的搜索控制器调用本地点对点链接库中的链 接信息, 将所述搜索请求转发给与该原始搜索节点相连接的搜索节点;
(4) 接收到所述搜索请求的搜索节点响应搜索请求, 判断本搜索 节点是否满足结束转发条件; 如果不满足结束转发条件, 则认定本搜索 节点为中间搜索节点, 执行步骤(5); 如果满足结束转发条件, 则认定 本搜索节点为最终搜索节点, 执行步骤 (7);
(5) 该中间搜索节点的搜索控制器调用本地点对点链接库中的链 接信息, 将所述搜索请求转发给与该中间搜索节点相连接的搜索节点;
(6) 该中间搜索节点响应搜索请求, 由该搜索节点的搜索控制器 控制检索模块在索引库中进行检索,并将检索结果以及其下级搜索节点 回传的检索结果一并回传给其上级搜索节点, 并返回步骤 (4);
(7) 该最终搜索节点响应搜索请求, 由该节点的搜索控制器控制 检索模块在索引库中进行检索, 并将检索结果回传给其上级搜索节点;
(8) 该原始搜索节点的搜索控制器控制聚合排序模块对其他搜索 节点回传的检索结果与在本地索引库中检索的结果进行聚合排序,并将 聚合排序后的检索结果发送给用户, 完成检索过程。
7、 如权利要求 6所述的互联网搜索方法, 基于权利要求 2所述的互 联网搜索系统实现, 其特征在于: 所述步骤 (2)、 (6)、 (7) 中各搜索节 点的索引库通过如下步骤进行更新:
(al) 用户所浏览的网页相关数据存储于该缓冲区内;
(a2) 该搜索节点的搜索控制器控制索引模块从该网页缓冲区提 取网页相关数据, 并从中提取该网页的索引信息;
(a3) 搜索控制器将该索引信息存储于索引库。
8、如权利要求 6所述的互联网搜索方法,其特征在于:所述步骤(6) 包括如下具体步骤:
(61a) 该中间搜索节点响应搜索请求, 由该节点的搜索控制器控 制检索模块在索引库中进行检索, 并保存检索结果;
(61b) 当该中间搜索节点收到其下级搜索节点回传的检索结果 后,该中间搜索节点的搜索控制器控制聚合排序模块对已保存的检索结 果与新回传的检索结果进行重新聚合排序;
(61c) 该搜索控制器检查该重新聚合排序后的检索结果是否更新 了之前所保存的检索结果; 如果更新了, 则将更新后的检索结果回传给 其上级搜索节点; 如果未更新, 则直接执行步骤 (61d);
(61d) 该中间搜索节点等待下级搜索节点回传搜索结果, 当有下 级搜索节点回传检索结果时, 返回步骤 (61b)。
9、如权利要求 6所述的互联网搜索方法,其特征在于:所述步骤(6) 包括如下具体步骤:
(62a) 该中间搜索节点响应搜索请求, 由该节点的搜索控制器控 制检索模块在索引库中进行检索, 并保存检索结果;
(62b) 该中间搜索节点接收其下级搜索节点回传的检索结果, 并 判断是否其所有下级搜索节点均已回传检索结果; 如果是则执行步骤 (62d), 如果否则执行步骤 (62c);
(62c) 该中间搜索节点等待下级搜索节点回传搜索结果, 当有下 级搜索节点回传检索结果时, 返回步骤 (62b);
(62d) 该中间搜索节点的搜索控制器控制聚合排序模块对本地保 存的检索结果与所有下级搜索节点回传的检索结果进行聚合排序,并将 聚合排序后的检索结果回传给其上级搜索节点。
10、 如权利要求 6所述的互联网搜索方法, 其特征在于: 所述步骤 (4) 中的结束转发条件为转发级数限制;
所述步骤 (1) 改为: 用户由原始搜索节点发起搜索请求, 并设置 此搜索请求的转发级数;
所述步骤 (4) 改为: 接收到所述搜索请求的搜索节点响应搜索请 求, 判断本搜索节点是否达到转发级数限制; 如果没达到, 则认定本搜 索节点为中间搜索节点, 执行步骤(5 ) ; 如果已达到, 则认定本搜索节 点为最终搜索节点, 执行步骤 (7 )。
11、 如权利要求 6所述的互联网搜索方法, 其特征在于: 所述步骤 (4) 中的结束转发条件为支线检索结果数量限制;
所述步骤 (1 ) 改为: 用户由原始搜索节点发起搜索请求, 并设置 此搜索请求的支线检索结果数量上限;
所述步骤 (4) 改为: 接收到所述搜索请求的搜索节点响应搜索请 求,判断由原始搜索节点至本搜索节点支路上累计检索到的检索结果数 量是否达到所述上限;如果没达到,则认定本搜索节点为中间搜索节点, 执行步骤(5 ) ; 如果已达到, 则认定本搜索节点为最终搜索节点, 执行 步骤 (7 )。
12、 如权利要求 6所述的互联网搜索方法, 基于权利要求 1所述的 互联网搜索系统实现, 其特征在于:
在所述步骤 (2 )、 (6 )、 ( 7 ) 中各个搜索节点响应搜索请求在本地 索引库中进行检索之前, 还包括下述步骤:
(A) 该搜索节点的搜索控制器在检索结果库中查找是否有与该搜 索请求相对应的检索结果; 如果有则跳过索引库检索过程, 并将该检索 结果库中的检索结果作为本搜索节点的检索结果;如果无则执行索引库 检索步骤;
在所述步骤 (8 ) 后还包括下述步骤:
( 9A) 该原始搜索节点将步骤 (8 ) 中所获得的聚合排序后的检索 结果发送至参与本次搜索请求的各个搜索节点处;各个搜索节点将该聚 合排序后的检索结果存储于本地检索结果库中。
13、 如权利要求 6、 8或 9中所述的互联网搜索方法, 基于权利要求 4所述的互联网搜索系统实现, 其特征在于: 所述各个搜索节点对检索结 果聚合排序时, 还包括下述步骤:
(B ) 该搜索节点的搜索控制器调用反馈信息库中的用户反馈信息 作为参数输入聚合排序模块,以辅助聚合排序模块对检索结果进行聚合 排序;
在所述步骤 (8 ) 后还包括下述步骤: (9B)用户对该聚合排序后的检索结果进行反馈, 并由该原始搜索 节点将该反馈信息发送至参与本次搜索请求的各个搜索节点处;各个搜 索节点将该反馈信息存储于本地反馈信息库中。
14、 如权利要求 6、 8或 9所述的互联网搜索方法, 基于权利要求 5 所述的互联网搜索系统实现,其特征在于:在所述步骤(8)、 (61b), (62d) 中各个搜索节点对检索结果聚合排序时, 还包括如下步骤:
(C) 该搜索节点的搜索控制器调用用户使用频率库中的用户浏览 网页的频率信息作为参数输入聚合排序模块,以辅助聚合排序模块对检 索结果进行聚合排序;
在所述步骤 (8) 后还包括如下步骤:
(9C) 搜索节点将用户浏览网页的频率信息发送至参与本次搜索 请求的各个搜索节点处;各个搜索节点将用户浏览网页的频率信息存储 于本地用户使用频率库中。
PCT/CN2011/078886 2010-08-27 2011-08-25 基于点对点网络的互联网搜索系统和方法 WO2012025052A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/776,552 US9330177B2 (en) 2010-08-27 2013-02-25 System, method and device for internet search based on peer-to-peer network

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201010264908.5 2010-08-27
CN201010264908.5A CN101916295B (zh) 2010-08-27 2010-08-27 基于点对点网络的互联网搜索系统和方法

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US13/776,552 Continuation-In-Part US9330177B2 (en) 2010-08-27 2013-02-25 System, method and device for internet search based on peer-to-peer network

Publications (1)

Publication Number Publication Date
WO2012025052A1 true WO2012025052A1 (zh) 2012-03-01

Family

ID=43323807

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/078886 WO2012025052A1 (zh) 2010-08-27 2011-08-25 基于点对点网络的互联网搜索系统和方法

Country Status (3)

Country Link
US (1) US9330177B2 (zh)
CN (1) CN101916295B (zh)
WO (1) WO2012025052A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9330177B2 (en) 2010-08-27 2016-05-03 Dong Software Studio Beijing System, method and device for internet search based on peer-to-peer network

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102567376A (zh) * 2010-12-16 2012-07-11 中国移动通信集团浙江有限公司 一种个性化搜索结果推荐方法和装置
CN102156733A (zh) * 2011-03-25 2011-08-17 清华大学 一种基于面向服务架构的搜索引擎及搜索方法
US9143580B2 (en) * 2012-07-13 2015-09-22 International Business Machines Corporation Brokering and provisioning in high-speed networks
US10387546B1 (en) * 2013-06-07 2019-08-20 United Services Automobile Association Web browsing
US20150213010A1 (en) * 2014-01-30 2015-07-30 Sage Microelectronics Corp. Storage system with distributed data searching
CN110602223B (zh) * 2015-06-15 2021-10-15 展讯通信(上海)有限公司 点对点设备及其搜索匹配方法
CN106951434B (zh) * 2017-02-06 2020-03-10 广东神马搜索科技有限公司 一种用于搜索引擎的搜索方法、装置及可编程设备
CN106951181A (zh) * 2017-02-21 2017-07-14 深圳大普微电子科技有限公司 一种数据存储系统的控制装置
AU2017414413B2 (en) * 2017-05-18 2021-01-28 Honor Device Co., Ltd. Search method and apparatus
CN112307289A (zh) * 2019-07-30 2021-02-02 王中华 搜索方法、节点及存储介质
CN113590884A (zh) * 2020-04-30 2021-11-02 华为技术有限公司 一种分布式数据搜索方法和索引文件的共享方法
CN112416929A (zh) * 2020-11-17 2021-02-26 四川长虹电器股份有限公司 基于mysql和java的检索库管理及数据检索方法

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101184016A (zh) * 2007-12-10 2008-05-21 华中科技大学 混杂式对等网络环境下的通用资源管理方法
CN101375578A (zh) * 2006-01-27 2009-02-25 西门子公司 用于传输消息的方法、网络节点和网络
CN101741844A (zh) * 2009-12-11 2010-06-16 北京工业大学 一种基于中心索引的p2p文件共享网络的对等节点发现方法

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7359904B2 (en) * 2002-06-14 2008-04-15 Integrated Knowledge Solutions, Inc. Method to efficiently process and present possible arrangements of a set of contiguous peer-to-peer links
US7596625B2 (en) * 2003-01-27 2009-09-29 Microsoft Corporation Peer-to-peer grouping interfaces and methods
US8095500B2 (en) * 2003-06-13 2012-01-10 Brilliant Digital Entertainment, Inc. Methods and systems for searching content in distributed computing networks
WO2006096838A1 (en) * 2005-03-09 2006-09-14 Medio Systems, Inc. Method and system for content search with mobile computing devices
JP4756953B2 (ja) * 2005-08-26 2011-08-24 富士通株式会社 情報検索装置および情報検索方法
CN1858733B (zh) * 2005-11-01 2012-04-04 华为技术有限公司 信息检索系统和检索方法
US20070112758A1 (en) * 2005-11-14 2007-05-17 Aol Llc Displaying User Feedback for Search Results From People Related to a User
EP2210188A1 (en) * 2007-11-05 2010-07-28 Limelight Networks, Inc. End to end data transfer
CN101667179B (zh) * 2008-09-03 2012-08-15 华为技术有限公司 移动搜索方法及其系统、搜索服务器同步元索引的方法
US8924460B2 (en) * 2008-12-19 2014-12-30 International Business Machines Corporation Method and system of administrating a peer-to-peer file sharing network
CN101639381B (zh) * 2009-08-14 2011-05-11 东南大学 红外焦平面读出电路分阶段背景抑制方法
CN101916295B (zh) 2010-08-27 2011-12-14 董方 基于点对点网络的互联网搜索系统和方法
US8402375B1 (en) * 2011-09-19 2013-03-19 Google Inc. System and method for managing bookmark buttons on a browser toolbar

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101375578A (zh) * 2006-01-27 2009-02-25 西门子公司 用于传输消息的方法、网络节点和网络
CN101184016A (zh) * 2007-12-10 2008-05-21 华中科技大学 混杂式对等网络环境下的通用资源管理方法
CN101741844A (zh) * 2009-12-11 2010-06-16 北京工业大学 一种基于中心索引的p2p文件共享网络的对等节点发现方法

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9330177B2 (en) 2010-08-27 2016-05-03 Dong Software Studio Beijing System, method and device for internet search based on peer-to-peer network

Also Published As

Publication number Publication date
CN101916295B (zh) 2011-12-14
CN101916295A (zh) 2010-12-15
US20140244609A1 (en) 2014-08-28
US9330177B2 (en) 2016-05-03

Similar Documents

Publication Publication Date Title
WO2012025052A1 (zh) 基于点对点网络的互联网搜索系统和方法
Banavar et al. An efficient multicast protocol for content-based publish-subscribe systems
US20170310596A1 (en) Load distribution in data networks
CN102075409A (zh) 请求消息处理方法、系统及负载均衡器设备
CN110417916A (zh) 能够回馈收益的内容分发方法、中心节点及边缘节点
CN102970242B (zh) 一种实现负载均衡的方法
EP2454656A2 (en) Methods and apparatus for efficiently processing multiple keyword queries on a distributed network
Yao et al. Artificial intelligence for information-centric networks
WO2010127618A1 (zh) 一种实现流媒体内容服务的系统和方法
CN101764807A (zh) 基于元搜索引擎的多源下载互联网资源的装置及方法
Von der Weth et al. Multiterm keyword search in NoSQL systems
Liu et al. An efficient selection algorithm for building a super-peer overlay
CN112860799A (zh) 一种分布式数据库数据同步的管理方法
CN102769668B (zh) 基于近似匹配的发布/订阅负载均衡方法
JP5818263B2 (ja) データの分散管理システム及び装置及び方法及びプログラム
CN102377826B (zh) 一种对等网络中冷门资源索引的优化放置方法
CN114513449A (zh) 一种域内路由选择优化方法及系统
Yang et al. A reinforcement learning based data storage and traffic management in information-centric data center networks
CN101895550B (zh) 一种应用于互联网网站的兼容动静态内容的缓冲加速方法
CN101494664B (zh) 基于jxta的p2p推式网络存储方法
Hao et al. Efficient resource search mechanism in selfish mobile peer-to-peer network
Liu et al. Design and optimization for distributed indexing scheme in switch-centric cloud storage system
CN113810313B (zh) 分布式会话报文的处理方法及处理装置
CN112506955B (zh) 一种查询处理方法、计算机设备及存储介质
KR101040184B1 (ko) 피어 투 피어 네트워크의 경로 검색 방법

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11819427

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11819427

Country of ref document: EP

Kind code of ref document: A1