CN102354313B - Conceptive method and system for organizing and expressing information - Google Patents

Conceptive method and system for organizing and expressing information Download PDF

Info

Publication number
CN102354313B
CN102354313B CN 201110282837 CN201110282837A CN102354313B CN 102354313 B CN102354313 B CN 102354313B CN 201110282837 CN201110282837 CN 201110282837 CN 201110282837 A CN201110282837 A CN 201110282837A CN 102354313 B CN102354313 B CN 102354313B
Authority
CN
Grant status
Grant
Patent type
Prior art keywords
conceptive
method
system
organizing
expressing
Prior art date
Application number
CN 201110282837
Other languages
Chinese (zh)
Other versions
CN102354313A (en )
Inventor
A·柯蒂斯
A·莱文
A·杰拉索利斯
Original Assignee
Iac搜索和媒体公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Grant date

Links

Abstract

提供了一种对查询提供响应的方法和系统。 A method and system for providing a response to the query. 相同搜索会话期间发出的多个选择被联系。 Multiple choices issued during the same search session is linked. 从用户接收查询并且对应于所述查询提供搜索结果。 Receiving a query from a user and corresponding to provide search results to the query. 响应于用户发出的选择,提供一个或者多个联系的选择。 In response to a user selection issued, or a plurality of selected contact. 在本发明一个实施例中,搜索结果URL与一个或者多个查询相联系,其中所述URL的选择和所述查询包含在搜索会话中。 In one embodiment of the present invention, the search result URL associated with one or more queries, and wherein said selection of the URL included in the search query session. 响应于查询,提供包含一个或者多个URL和与各个URL联系的任何查询的搜索结果。 In response to inquiries, it contains one or more query URL and any URL associated with each search result.

Description

概念上组织和表述信息的方法和系统 The method of organization and presentation of information on the concepts and systems

[0001] 本申请是申请号为200480035838.9、申请日为2004年12月7日、名称为“概念上组织和表述信息的方法和系统”的中国发明专利申请的分案申请。 [0001] This application is Application No. 200480035838.9, filed on December 7, 2004, a divisional application entitled "organization and presentation of information on the concept of methods and systems," the Chinese invention patent applications.

[0002] 优先权要求 [0002] PRIORITY CLAIM

[0003] 本申请涉及并且要求2003年12月8日提交的临时申请号60/528,139的优先权,其内容作为引用结合于此。 [0003] This application relates to and claims priority to provisional application No. 60 / 528,139 in 2003, filed December 8, the contents of which are incorporated herein by reference.

[0004] 相关申请 [0004] RELATED APPLICATIONS

[0005] 本申请涉及2004年5月24号提交的名称为“METHODS AND SYSTEMS FORCONCEPTUALLY ORGANIZING AND PRESENTING INFORMATION”的美国专利申请,其内容作为弓I [0005] The present application is related to May 24, 2004, filed as "METHODS AND SYSTEMS FORCONCEPTUALLY ORGANIZING AND PRESENTING INFORMATION" US patent application, the contents of which I bow

用结合于此。 With incorporated herein.

技术领域 FIELD

[0006] 本发明的实施例一般的涉及概念上的组织信息的领域,并且尤其涉及概念相关信息的使用分析以有效组织信息。 [0006] Example organizational information on the general concept of the embodiment of the present invention relates to the art, and in particular it relates to the use of the concept of analytical information to organize information effectively.

背景技术 Background technique

[0007] 随着信息的迅速增长,组织信息的能力也在增长。 [0007] With the rapid growth of information, the ability to organize information is also growing. 在互联网相关的网络(例如万维网)或者其他互联网源上可以找到大量信息源。 In the Internet-related networks (eg World Wide Web) can find plenty of information sources on the Internet or other sources. 互联网是计算机网络的扩展网络,信息通过本领域技术人员公知的方法(例如TCP和IP协议的使用等等)而在互联网上交换。 The Internet is a network of computer networks expand, the skilled person information known methods (e.g., TCP and IP protocols, etc.) exchanged on the Internet. 互联网允许用户在连接到该网络的计算机之间发送和接收数据。 Internet allows users to send and receive data between computers connected to the network. 这些数据可以包括网站、主页、数据库、文本集合、音频、视频或者通过连接到互联网的计算机服务器在互联网上可用的任何其他类型的信息。 These data may include web site home page, a database, a collection of text, any other type of information of audio, video or computer connected to the Internet through servers that are available on the Internet. 这些信息可以被称为文件或者文档,并且可以包括网页、网页上的数据、网页附件或者存储设备(例如数据库)中包含的其他数据。 The information may be referred to as a file or document, and may include a web page, the data on the web or attachment or storage device (e.g. a database) contained in the other data.

[0008] 理解如此大量文档集合的意义并且在这种环境中搜索信息,在没有专门的辅助手段时是很困难的。 [0008] understand the significance of such a large number of documents and search for information collection in this environment, in the absence of specific aid is very difficult. 一种辅助定位信息的方法是使用关键词。 A method for positioning auxiliary information is to use keywords. 也就是说,文档可以包括表示包含在文档中的信息的选定部分的关键词。 In other words, the document may include keyword represents a selected portion of the information contained in the document. 这些关键词在互联网上对其他计算机是可用的并且允许其他计算机定位该文档。 These keywords on the Internet to other computers are available and allow other computers to locate the document.

[0009] 为了定位互联网上的文档,远程计算机的用户使用被称为搜索引擎的搜索程序而搜索关键词。 [0009] In order to locate the document on the Internet, users of the remote computer using the search program is called search engine and keyword search. 搜索引擎是允许远程用户键入一个或多个搜索词的程序。 Search engine that allows remote users to type one or more search terms in the program. 然后搜索引擎将搜索查询与文档中的关键词相比较并且至少检索文档中具有与搜索查询匹配的关键词的部分。 Then the search engine will search query is compared with the document keywords and retrieve documents having at least match the search query keywords section. 然后搜索引擎对用户显示部分文档,例如标题。 Then search engine users portions of the document, such as the title. 用户可以在检索到的局部文档中滚动浏览并且选择所需文档。 Users can scroll through retrieved partial document and select the desired document.

[0010] 早期的关键词搜索引擎显示出严重缺陷。 [0010] The early keyword search engine shows serious flaws. 例如,为了增加特定文档的出现率,文档提供者可以使用与文档相关的尽可能多的搜索词。 For example, to increase the appearance of a particular document, document providers can use as many search terms related to the document. 实际上,某些文档或者搜索引擎使用文档中的每个词作为关键词。 Indeed, some documents or search engines use each word in the document as a keyword. 因此,搜索引擎会检索到大量与用户需要通过搜索词组合寻找的主题无关或者仅边缘相关的文档。 Therefore, the search engine will retrieve a large number of user needs through a combination of search terms to find a topic unrelated or only marginally relevant documents. 并且,这些搜索引擎的很多用户并不熟悉形成关键词搜索查询的技巧并且产生过于宽泛的搜索从而经常检索到数千个文档。 Moreover, these search engine techniques many users are not familiar with keyword search query form and produce overly broad search thousands of documents retrieved so often. 那么用户必须检查关于各个文档的摘要信息以定位所需信息。 Then the user must check the summary information about each document to locate the desired information.

[0011] 这种缺陷通过搜索引擎的演变得以解决,即包括基于一个或多个用户的搜索活动的信息组织。 [0011] This defect by the evolution of search engines can be solved, including information organization that is based on one or more of the user's search activity. 这些方法基于用户偏好度多数意见而不是面向文档的参数(例如文本)而对结果排序。 These methods and sort the results based on user preference for the opinion of the majority rather than the parameters of the document (such as text). 其中一种这样的方法根据基于所使用的关键词的演化得分而对文档排序。 One such method according to sort documents based on keywords used Evolution score. 也就是说,文档接收与搜索查询的关键词相关的相关度分数。 In other words, the received document relevance scores related to the keyword search query. 随着用户输入搜索查询并且在查询产生的文档列表中选择文档时,文档的相关度分数被调整。 As the user enters a search query and select the document, the document relevance scores are adjusted in the list of documents queries generated. 这些分数被用于组织文档的结果列表以进行后续搜索。 The results of these scores are used to organize the document list for subsequent searches. 这些方法典型的(至少在部分上)基于文档接收到的“点击”次数(即文档被选择的次数)而确定相关度。 These methods typically (at least in part) on the received document to "click" times (i.e., the number of times the selected document) and determine the degree of correlation. 这些方法通常称为“人气排序方法”或者“点击人气方法”,提供了将最吸引并且满足最大多数先前用户的文档排列在最前面的搜索结果列表。 These methods are commonly referred to as "popular sorting method" or "click method popular" provides the most attractive and most satisfying search results most users' documents previously arranged on top of. 而且,点击人气方法产生反映搜索上下文的结果。 Moreover, the method produces results reflect the popular click search context. 例如,先前的搜索方法会返回包含所有查询词的文档,但是不会自动排除不是查询部分的词。 For example, previous search method will return documents that contain all query terms, but does not automatically exclude part of the query words. 因此,对于“Mexico”的文本匹配搜索最有可能返回关于“New Mexico”的结果。 Thus, for "Mexico" text-matching search is most likely to return results for "New Mexico" is. 点击人气方法可以减少这种错误结果,因为搜索“Mexico”的用户通常不会点击关于“New Mexico”的页面并且会倾向于点击他们认为与“Mexico”最相关的页面,从而增加了所需文档的相关度。 Click the popularity of this method can reduce the error results, because users searching for "Mexico" is not usually the page, click on "New Mexico" and will tend to click on the page they think are most relevant and "Mexico", thereby increasing the required documents relevance.

[0012] 由于与查询词相关的信息随着时间可能改变,基于点击次数确定相关度可能导致错误结果。 [0012] Since the information relevant to the query terms may change over time, based on the number of clicks to determine the degree of correlation may lead to erroneous results. 例如,对于“民主先驱”的特定查询,涉及早期先驱Howard Dean的文档可能在2003年12月被选择了很多次,但是2004年3月输入该查询的用户可能期望在此时作为领袖的John Kerry的结果。 For example, for "democracy pioneer" of a specific query involving an early pioneer Howard Dean document he may be selected many times in December 2003, but entered the query in March 2004 as a user might expect at this time leader John Kerry the result of. 并且,排在最前面的结果通常会受到不成比例的更多的使用,这样导致了越来越歪曲的搜索结果,其中排在最前面的结果永远不会被替代。 And, at the top of the results usually will be more use of disproportionate, this has led to the search results more and more distorted, which at the top of the results will never be replaced.

[0013] 通过使用包括基于时间和基于使用的因素在内的大量因素对响应于查询提供的信息进行组织的搜索引擎已经解决了这些缺陷中的一部分。 [0013] including through the use of the information provided in response to a query based on time and organized a number of factors based on factors including the use of search engines have solved part of these deficiencies. 例如,这种方法可能使用先前用户响应于特定查询的活动而调整查询响应文档的相关度。 For example, this method may use the previous user in response to a specific query query response activities related to the adjustment of the document. 这样的用户活动可以包括点击次数以及先前用户对特定信息的选择或者使用的时间。 Such activities may include the user clicks and the time previously selected by the user for specific information or use. 这些方法还可以将特定文档在先前用户对其选择时排列在先前结果列表中何处、文档的实际和期望使用频率的对比以及选定的文档如何被使用等考虑在内。 These methods may also be arranged in a particular document the results of previous comparison and the actual frequency where the wish list, and the selected document a document of how it is used and the like into consideration when the previous user selection thereof.

[0014] 然而,在当前方法中仍然存在大量缺陷。 [0014] However, there is still a lot of flaws in the current method. 例如,当前方法并没有解决非常稀少的查询的问题,其中并没有汇集足够的用户活动数据。 For example, the current method does not solve the problem very rare queries, which did not have enough collection of user activity data. 在此情况下,结果可能很少或者不存在。 In this case, the result may have little or no presence. 并且,点击结果依赖于数据源的质量和完整性。 And, dependent on the result of clicking the quality and integrity of the data source. 当前方法不能解决数据源质量的巨大差异。 The current method does not resolve the huge difference in the quality of the data source. 而且,当前方法受到欺骗影响,可能影响搜索结果的完整性。 Moreover, the current method being deceived influence that may affect the integrity of the search results. 根据现有技术的一种示例方法试图通过在用户活动的基础上更新搜索引擎结果而解决某些问题。 According to one exemplary method of the prior art attempts to update the search engine results based on user activity on and solve some problems. 这种方法在名称为“Search Engine”的美国专利N0.6,421,675中进行了描述,其内容作为引用而结合于此,从而提供了对现有技术的全面描述并且明确区分本发明各个实施例的特征。 This approach is in U.S. Patent No. N0.6,421,675 entitled "Search Engine" in the description, the contents of which are incorporated herein by reference, to provide a complete description of the prior art and the present invention specifically distinguish features of embodiment.

发明内容 SUMMARY

[0015] 本发明提供了一种方法,该方法包括:接收并记录查询,所述查询接收自多个用户;确定对应于所述查询的搜索结果;接收并记录所述多个用户的一个或多个搜索结果选择;将所述选择与所记录的查询相关联;从用户处接收包含所述查询的信号;以及响应于所述查询而将包含与所述查询关联的选择的一个或多个搜索结果提供给所述用户。 [0015] The present invention provides a method comprising: receiving and recording a query, the query received from a plurality of users; determining search results corresponding to the query; receive and record a plurality of the user or selecting a plurality of search results; the query associated with the selected recorded; receiving a signal containing the query from the user; and a response to the query comprising the query associated with the selected one or more of search result to the user.

[0016] 本发明还提供了一种系统,该系统包括:服务器数字处理系统(DPS),其中该DPS能够从多个用户接收查询、响应于所述查询而将一个或多个搜索结果提供给所述用户,其中每个用户能够选择所提供的搜索结果中的至少一个,所述DPS还能够接收并记录包含所述用户的一个或多个搜索结果选择的信号、将所记录的搜索结果选择与所记录的查询相关联、接收所述查询、以及响应于所述查询而提供一个或多个包含与所述查询关联的搜索结果的搜索结果;以及能够与所述服务器DPS通信的一个或多个客户端DPS,其中所述通信包括将查询以及搜索结果选择中的至少一者提供给所述服务器DPS以及接收一个或多个搜索结果。 [0016] The present invention also provides a system comprising: a server digital processing system (DPS), DPS which is capable of receiving the query from a plurality of users in response to the query and one or more search results to the user, wherein at least one of the DPS is also capable of receiving and recording the signal containing said user selecting one or more search results for each user can select the provided search results, the search results for the selected recorded associated with the query the recorded receiving the query, the query and to provide one or more of the search results contains search results associated with the query response; and the ability to communicate with a server or the DPS client DPS, wherein said communication comprises a query and a search result providing at least one selection to the server DPS and receiving one or more search results.

[0017] 本发明还提供了一种系统,该系统包括:服务器数字处理系统(DPS),其中该DPS能够从多个用户接收第一查询、确定对应于所述第一查询的多个搜索结果,其中每个用户能够选择所提供的搜索结果中的至少一个,所述DPS还能够将所述多个搜索结果提供给所述用户、将所记录的搜索结果选择与所记录的特定选择相关联、接收查询、以及响应于所述查询而提供多个搜索结果,其中所述多个搜索结果包含了所述特定选择和关联的搜索结果选择;以及能够与所述服务器DPS通信的一个或多个客户端DPS,其中所述通信包括将查询以及特定选择和搜索结果选择中的至少一者提供给所述服务器DPS以及接收一个或多个搜索结果。 [0017] The present invention also provides a system comprising: a server digital processing system (DPS), wherein the DPS is capable of receiving a first query from a plurality of users, corresponding to the first determining a plurality of search query results wherein at least one of the plurality of the DPS is also possible to provide a search result to the user for each user can select the provided search results, the search results will be recorded and the recorded selection associated with the particular choice receiving a query, and in response to the query and provides a plurality of search results, wherein the plurality of search results contains the specific selection and the associated search result selection; DPS and the ability to communicate one or more server client DPS, wherein said communication comprises a query and the particular selection and at least one selected search result is supplied to the server DPS and receiving one or more search results.

附图说明 BRIEF DESCRIPTION

[0018] 通过参考下面的描述可以更好的理解本发明,并且利用附图表示本发明的实施例。 [0018] The present invention may be better understood by reference to the following description and drawings represent embodiments of using the present invention. 在附图中: In the drawings:

[0019] 图1为显示根据本发明一个实施例的基于来自多个用户的响应的关联而修改概念相关信息集合的组织和表述的程序的流程图; [0019] FIG. 1 is a flowchart showing the organization and presentation modified set of conceptually related information based on the association based on the response from a plurality of users according to one embodiment of the present invention, a display program;

[0020] 图2为显示根据本发明一个实施例的通过基于在其他用户的类似信息搜索会话期间获取的用户活动和/或用户信息的关联而影响用户查询与存储内容的联系的程序的流程图; [0020] FIG 2 is a flowchart of a query by a user to affect the stored content based on the associated other user acquired during a search session information of a similar user activities and / or link user information program of the present invention, one embodiment of a display ;

[0021] 图3为显示根据本发明一个实施例的用于组织并且表述概念相关信息以及用于联系特定存储内容和各个用户查询的系统的结构图; [0021] FIG. 3 is a conceptual representation and block diagram of a related information content and storing each specific contact system user query according to an embodiment of a tissue of the present invention;

[0022] 图4显示了根据本发明一个实施例的三个独立用户的搜索日志; [0022] Figure 4 shows three separate searches the user logs in accordance with one embodiment of the present invention;

[0023] 图5为显示根据本发明一个实施例的提供更多相关搜索结果的程序的流程图; [0023] FIG. 5 is a flowchart of a program more relevant search results according to a embodiment of the present invention display;

[0024] 图6为显示根据本发明一个实施例的提供相关查询建议的程序的流程图; [0024] FIG 6 is a flowchart of a embodiment of the present invention is related query suggestions display program;

[0025] 图6A为显示根据本发明一个实施例的提供查询拼写校正建议的程序的流程图; [0025] FIG 6A is a flowchart providing query spelling display device according to an embodiment of the present invention proposed a correction procedure;

[0026] 图7为显示根据本发明一个实施例的提供建议查询的程序的流程图; [0026] FIG. 7 is a flowchart illustrating a routine according to a query embodiment of the present invention proposes;

[0027] 图8为显示根据本发明一个实施例的提供类似结果的程序的流程图; [0027] FIG 8 is a flowchart of an embodiment to provide a similar result embodiment of the present invention, a display program;

[0028] 图9为显示根据本发明一个实施例的对信息进行相关以提供更相关的搜索结果的程序的流程图; [0028] FIG. 9 is a flowchart of a program according to the embodiment of the present invention to provide more relevant information related to a display of search results;

[0029] 图10为显示根据本发明一个实施例的根据与选择相关联的位置而基于用户位置提供更相关的搜索结果的程序的流程图; [0029] FIG. 10 is a flow chart providing more relevant search results in accordance with the selection according to a location associated with one embodiment of the present invention is based on the user's display position of the program;

[0030] 图11为显示根据本发明一个实施例的基于用户位置对用户提供更相关的搜索结果的程序的流程图;以及 [0030] FIG. 11 is a flowchart based on the user location to provide more relevant search results to the user program of the present invention, one embodiment of the display; and

[0031] 图12为显示根据本发明一个实施例的数字处理系统的结构图。 [0031] FIG. 12 is a block diagram of a digital processing system, according to one embodiment of the present invention. 具体实施方式 detailed description

[0032] [0032]

[0033] 本发明的实施例提供了用于概念上组织和表述信息的方法和系统,其中使用用户对信息组织和表述的响应的关联以确定信息的最优组织和表述。 Example [0033] The present invention provides a method and a system for the organization and presentation of information on the concept, wherein the information associated with the user of the organization and presentation of the response to determine an optimal organization and presentation of information. 在本发明一个实施例中,在搜索引擎结果优化的上下文中,搜索会话期间多个用户的用户活动和/或用户信息与查询进行关联,以影响查询与文档的组织和表述之间的演化联系。 In one embodiment of the present invention, in the context of search engine optimization results, a plurality of users during a search session user activity and / or user information associated with the query in order to influence the evolution of the link between the query and the organization and presentation of the document . 根据这些实施例的系统存储整个搜索会话过程中的用户活动,从而可以使大量不同类型的用户活动和用户信息可以关联。 The storage system user activity of these embodiments the entire search during the session, which can cause a number of different types of user activities and user information may be associated. 使用关联的用户输入允许这些系统提供相关的搜索结果而不会产生现有技术中基于关键词的系统带来的限制。 Associated with the user input allows the use of these systems to provide relevant search results without causing limitations of the prior art system based on keywords brought.

[0034] 在下面的描述中将给出大量特定细节。 [0034] numerous specific details are given in the following description. 然而,应当理解,本发明的实施例可以实施为不具有这些特定细节。 However, it should be understood that the embodiments of the present invention may be practiced without these specific details. 在其他实施例中,公知的电路、结构和技术不再详细描述以避免模糊对本说明书的理解。 In other embodiments, well-known circuits, structures and techniques have not been described in detail to avoid obscuring understanding of this description.

[0035] 整个说明书中提到的“一个实施例”或“实施例”表示结合该实施例描述的特定特征、结构或者特点包含在本发明的至少一个实施例中。 [0035] "one embodiment" or throughout the specification to "an embodiment" means that a particular feature of the described embodiment, structure, or characteristic included in at least one embodiment of the present invention. 因此,在整个说明书中多处出现的短语“在一个实施例中”或“在实施例中”并不一定全部是指同一实施例。 Thus, in various places throughout this specification appearances of the phrase "in one embodiment" or "in an embodiment" are not necessarily all referring to the same embodiment. 而且,可以在一个或多个实施例中以任何适当方式将特定特征、结构或者特点结合在一起。 Furthermore, one or more embodiments, a particular feature, structure, or characteristic in together in any suitable manner.

[0036] 而且,所发明的方面包含在少于单个公开的实施例的所有特征中。 [0036] Moreover, aspects of the invention comprise less than all features of a single disclosed embodiment. 因此,说明书后附的权利要求书被明确包含在本具体实施方式中,每项权利要求书自身作为本发明的一个单独实施例。 Accordingly, the appended claims appended specification are expressly included in the present embodiment, each claim as a separate embodiment of the present invention requires the book itself.

[0037] 程庄 [0037] Cheng Zhuang

[0038] 图1显示了根据本发明一个实施例的基于来自多个用户的响应的关联而修改概念相关信息集合的组织和表述的程序。 [0038] Figure 1 shows the organization and presentation of a modified set of conceptually related information based on the association based on the response from a plurality of users according to one embodiment of the present invention program.

[0039]图1中所示的程序100开始于操作105,其中概念相关信息集合通过信息提供者被组织并且被表述给多个用户。 The routine shown in [0039] FIG 1100 begins at operation 105, wherein the set of conceptually related information is organized by information providers and a plurality of users to be expressed. 在一个实施例中,概念相关信息集合是包含有关于特定用户组感兴趣的一般概念的信息的集合页面。 In one embodiment, a set of conceptually related information includes information about the general concept of interest to a particular group of users set page. 这种集合页面可以包括大量任何种类的相关子概念,包括与文件、目录、数据库、电子数据表、新闻条目、音频、视频、图像、应用程序、广告、产品描述以及参考信息的链接,与列表、表格、树、或者上述项目的任何目录的链接,以及与其他集合页面的链接,所有这些内容可以从任意数量的来源收集。 This page may include a collection of related sub-concepts number of any kind, including links to files, directories, databases, spreadsheets, news items, audio, video, images, applications, advertising, product descriptions, and reference information, and a list of link directory link any table, tree, or the above-mentioned projects, as well as with other sets of pages, all of which can be collected from any number of sources. 在一个实施例中,单独形成的多个集合页面创建为改变信息集合的组织和表述,包括相关子概念的数量、类型、安排和显著度。 In one embodiment, the set of pages to create a plurality of separately formed to change the organization and presentation of information set, including the number of relevant sub-concepts, types, arrangements, and saliency. 这样,信息提供者尝试预期组织和表述信息的最优方式。 Thus, the information providers try to the best mode contemplated for the organization and presentation of information.

[0040] 在操作110,信息提供者从用户接收关于信息组织和表述的响应。 [0040] At operation 110, in response to receiving the information provider organization and presentation of information from a user. 用户响应可以为选择(或者不选择)某部分信息的形式。 You may be selected in response to user (or not select) in the form of a portion of the information. 例如,某些用户可以选择给定集合页面的特定子概念的信息,而不选择其他的。 For example, some users may select specific information to the sub-pages of a given set of concepts, without selecting other.

[0041] 在操作115,从多个用户接收到的响应被关联。 [0041] In operation 115, receives the response from a plurality of users are associated. 也就是说,确定单独用户进行的相同统计上有效的联系的程度。 In other words, the same statistics to determine the extent of individual users effective linkages. 在本发明的可替换实施例中,响应的关联可以采取任何形式。 In an alternative embodiment of the present invention, the association response may take any form. 各种用户响应的示例关联在下文中将更加详细的描述。 Examples of various user responses is associated will be described in more detail.

[0042] 在操作120,基于关联后的响应而修改概念相关信息集合的组织和表述。 [0042] At operation 120, based on the response associated with the modified set of conceptually related information organization and presentation. 例如,可以重新形成集合页面以更加充分地满足单独用户、用户组或某类用户、或者所有用户的需要。 For example, a set of pages can be re-formed to more adequately meet the individual users, groups of users or classes of users, or all users needs. 除了修改原始表述的信息的表述,所述重新形成可以包括添加或者删除信息。 In addition to modifying the original presentation of presentation information, the re-forming may include adding or deleting information. 例如,可以添加一个或多个子概念到集合页面或者从其中删除。 For example, you can add one or more sub-concept to the collection page or deleted from it.

[0043] 尽管上面一般性的描述了使用用户响应以优化信息集合(例如集合页面)的组织和表述,本发明的实施例可以用于影响用户搜索引擎查询和存储内容(例如一个或多个文档)的联系。 [0043] Although the above generally describes the use of information set to optimize user response (e.g., a set of pages) of the organization and presentation, embodiments of the present invention may be used to influence a user search query and the stored content (e.g. one or more documents ) links. 也就是说,本发明的实施例可以用于响应于特定查询确定更加相关的搜索结果(即一般性的更加相关或者对特定用户更加相关)。 That is, embodiments of the present invention may be used in response to determining more relevant search results (i.e., more general more relevant or related to a particular user) to a particular query.

[0044] 图2显示了根据本发明一个实施例的通过基于在其他用户的类似信息搜索会话期间获取的用户搜索引擎(USE)活动和/或用户信息的关联而影响用户查询与存储内容的联系的程序。 [0044] FIG. 2 shows the basis of the acquired during other users of similar information search session search engine users associated (USE) activity and / or user information to affect the user query and the stored content of Contact through an embodiment of the present invention program of.

[0045] 图2所示的程序200开始于操作205,其中对若干独立用户记录搜索会话期间的USE活动信息和/或用户信息。 The routine shown in [0045] FIG 2200 begins at operation 205, in which several independent users recorded USE activity information during a search session, and / or user information. 搜索会话包括给定用户的任何搜索引擎动作(可通过搜索引擎记录的活动)的序列。 Search session including a search engine for any given user's action (movable recorded by the search engine) sequences. USE活动可以包括发出查询、点击搜索页面上的导向内部或者外部数据的链接、点击后续内部页面上的导向内部或者外部数据的链接以及在点击内部或者外部链接之后返回搜索页面或者任何内部页面。 USE activity may include issuing a query, click on the links on the search page based internal or external data, the link leads to internal or external data click on the subsequent internal pages and return to the search page after clicking on an internal or external links or any internal page. USE活动可以为连续的或者在实际持续周期中发生。 USE activity may be continuous or occur in the actual duration period. 也就是说,可以指定表示搜索会话终止的时间周期。 In other words, you can specify the time period represented a search session termination. 例如,如果所记录的用户活动中的中断超过了指定时间,在实际中可以假定搜索会话已经结束。 For example, if the interrupt user activity recorded exceeds the specified time, the search can be assumed that in practice session has ended. 后续用户活动可以认为是新的搜索会话。 Subsequent user activity can be considered a new search session.

[0046] 在操作210,所记录的若干独立用户的USE活动信息和/或用户信息被关联。 [0046] operation 210, a number of independent users of the recorded USE activity information and / or user information are associated in. 所述信息反映了各个用户的整个搜索会话,可以根据本发明的可替换实施例以各种方式进行关联。 The information reflects the overall search session of each user, it may be associated in accordance with an alternative embodiment of the present invention in various ways. 各种USE活动和/或用户信息的示例关联在下文中将更加详细描述。 Examples of various USE activity and / or information associated with the user will hereinafter be described in more detail.

[0047] 在操作215,通过操作210获得的所记录的若干独立用户的USE活动信息和/或用户信息的关联结果被用于影响特定存储内容和对应用户查询之间的联系。 [0047] At operation 215, the operation by the user 210 obtains several independent of the recorded USE activity information and / or associated information is result of the user for affecting a specific link between the query and the corresponding user stored content. 这种联系可以提供一般性的或者对于一个或多个特定用户更加相关的给定查询的搜索结果。 This contact can provide general or more relevant to the search results for one or more specific users for a given query.

[0048] 盖统 [0048] The cover system

[0049] 本发明的实施例可以在网络环境中实施。 [0049] Embodiments of the invention may be implemented in a network environment. 图3显示了根据本发明一个实施例的用于组织并且表述概念相关信息以及用于联系特定存储内容和各个用户查询的系统。 And Figure 3 shows the conceptual expression systems for the relevant information and store content and contact a particular individual user query according to an embodiment of a tissue of the present invention. 如图3所示的系统300显示了数字处理系统(DPS)300的网络,包括显示为服务器DPS 320的一个或多个服务器DPS,以及显示为客户端DPS 305-308的多个客户端DPS。 The system 300 shown in FIG. 3 shows a digital processing system (DPS) network 300, including a display of a server or plurality of servers DPS 320 DPS, and displaying a plurality of client DPS 305-308 client DPS. 系统300的DPS互相连接并且配置为交换多个各种类型的包含文档的存储内容,例如网络页面、网络页面上存储的内容,包括文本、图片以及音频和视频内容。 DPS system 300 is connected to each other and configured to store a plurality of types of content exchange comprises a document, such as web pages, stored on the web page content, including text, images and audio and video content. 例如,所存储的内容可以为音频/视频文件,例如具有移动图像和音频的程序。 For example, the stored content can be audio / video files, for example, a moving image and audio programs. 信息可以通过任何类型的通信网络在DPS之间进行通信,多个不同设备可以通过所述通信网络进行通信,例如(但不限于)互联网、未显示的广域网(WAN)、局域网(LAN)、内联网等等。 Information may be performed by any type of communication network in the communication between the DPS, a plurality of different devices may communicate through the communications network, such as (but not limited to) the Internet, a wide area network (not shown) (WAN), a local area network (LAN), the networking and so on. 例如,如图3所示,DPS通过互联网310而互相连接,互联网310是包含具有如上所述的数据通信方法的多个网络的其中一种,并且对于本领域技术人员是公知的。 For example, as shown in FIG. 3, the DPS 310 interconnected by the Internet, the Internet 310 is a network comprising a plurality of data communications with the method described above is one, and the skilled person is well known. 连接服务器DPS和客户端DPS的通信链接并不一定为直接链接,而是可以为间接链接,包括但不限于广播无线信号、网络通信等等。 Connecting to the server and the client DPS DPS communication link is not necessarily a direct link, but may be indirectly linked, including but not limited to broadcast wireless signals, network communications, etc. 尽管图3中显示了示例的DPS,可以理解,可以互相连接大量这样的DPS。 Although FIG. 3 shows an example of the DPS, be understood that a number of such interconnected DPS.

[0050] 根据本发明一个实施例的可以用于服务器DPS 320或者客户端DPS 305-308的数字处理系统的实施例将在下文中参考图12进行描述。 [0050] DPS 320. server or a client data processing system DPS 305-308 embodiment will be described with reference to FIG. 12 below, according to one embodiment of the present invention may be used. [0051] 根据本发明一个实施例,概念相关信息的集合,例如集合页面,通过客户端DPS305-308表述给若干用户。 [0051] The set of one case, the concept of the embodiment of the present invention, information, such as a set of pages, a number of expression to the user via the client DPS305-308. 所述概念相关信息还可以为响应于从一个或多个客户端DPS305-308发送的用户查询的搜索结果。 The concept may also be related information in response to the search results from one or more client users DPS305-308 transmitted queries. 所述信息可以采用多种形式,例如可以为通过网络页面开发者提供的网页URL地址列表。 The information may take many forms, for example, a URL address list of pages provided by the web page developer. 一旦表述在客户端DPS上,用户对所述表述做出一定响应。 Once expressed on the client DPS, the user must make a response to the presentation. 例如,用户可以执行如上所述的USE活动。 For example, the user can perform USE activity as described above. 与对用户的信息表述相关的用户响应和其他用户信息被记录并且被发送到用户响应/信息关联应用程序321。 Representation of information relating to a user in response to the user information and is recorded and transmitted to other users in response to user / application related information 321. 该应用程序321对若干用户的用户响应和信息进行关联,并且基于关联的结果修改概念相关信息的组织和表述。 The application 321 and associated information, and modifies the organization and presentation concepts related information based on a result of a number of users associated user response.

[0052] 数据分析 [0052] Data Analysis

[0053] 本发明的实施例获取USE活动信息和/或用户信息并且对这些信息进行关联,以通过使用多个用户的多数选择而辅助定义相关度。 [0053] Embodiments of the present invention acquires USE activity information and / or user information and the information for associating to a plurality of users by using a plurality of selection of the auxiliary defined correlation. 所述关联分析包括评价共同动作或者多个用户信息的程序,以识别统计上有效的联系。 Evaluation of the association analysis program comprising a plurality of co-operation or user information to identify a statistically significant link. 对这些实施例使用的术语“联系(association) ” 和“统计上有效的联系(statistically significant association) ” 定义如下。 These embodiments use the term "contact (Association)" and "statistically valid link (statistically significant association)" is defined as follows. “联系”为搜索会话期间用户明确或者间接、有意识或者无意识确定的查询、术语、概念、文档或者其他网络数据及其组合的任何配对。 "Contact" for the user during a search session explicitly or indirectly, consciously or any matching query terms, concepts, documents or other network data to determine the unconscious, and combinations thereof. 联系可以通过发出查询和/或选择导向查询、术语、概念、文档或者其他网络数据的链接(例如超链接)而表示。 Information can be queried and / or guide selection query terms, concept, data link or other network document (e.g., hyperlinks) represented by the issuing. 统计上有效的联系为概率上不能归属于随机事件的联系。 Contact statistically valid random events can not be attributed to the probability of a link. 当通过两个或者更多表面上独立的用户进行统计上有效的联系时记录所述关联。 When recording the association statistically valid contact by two or more separate upper surface of the user.

[0054] 本发明的实施例通过记录更加大量的信息和更加特定的信息(包括USE活动信息和/或用户信息)而提供了比现有技术方法远远更加相关的搜索引擎结果,并且实现了信息的更加深入的分析。 [0054] Embodiments of the present invention, by a more amount of information and more specific information (including USE activity information and / or user information) recorded and provides much more relevant search results than prior art methods, and achieves more in-depth analysis of the information.

[0055] 本发明的一个实施例提供了一种创建并且操作如表1所示维护所有USE活动信息和用户信息的数据文件的系统。 An embodiment [0055] The present invention provides a way to create and maintain the operating system as a data file of all the USE activity information and user information shown in Table 1. (用于描述本发明各个实施例的表格仅是示例性的并且不一定表示本发明实施例的实际数据结构。)` (Table used to describe various embodiments of the present invention is merely exemplary and does not necessarily represent the actual data structure of an embodiment of the present invention.) `

Figure CN102354313BD00091
Figure CN102354313BD00101

[0057] 表1 (* =没有联系选择的查询) [0057] Table 1 (* = no inquiries contact selected)

[0058] 表1显示了包含大量数据元素的数据文件,这些数据元素记录了各个时间点大量用户的查询和在各个用户的各个查询之后选择(点击)的URL (选择)。 [0058] Table 1 shows the data file contains a large number of data elements, data elements recorded a large number of users at various time points after each query and select each user query (click) the URL (selection). 这种数据文件可以包括表示USE活动信息和/或用户信息的大量其他数据元素。 This data file may include data representing USE activity information and a lot of other data elements / or user information. 这些数据元素,例如可以表示选择结果的显示排序、会话期间用户点击的结果的顺序、用户IP地址、IP地址的地理位 These data elements, for example, may represent a geographic location selection result display sort order of the results of the user clicks during a session, user IP address, IP address

置等等 Home and so on

[0059] 这与各种现有技术方法是不同的,在现有技术中,在周期性处理并且加载新的数据之后,简化的查询-结果选择(Q2RP)关联被录入数据库表格,例如表1A。 After [0059] This prior art various methods are different, in the prior art, the periodic process and load new data, simplified query - Select (Q2RP) is entered into the database tables associated with, for example, Table 1A . 根据本发明一个实施例,这些信息并不录入,而是以日志形式(log form)维持,数据元素表示所有的USE活动信息和用户信息。 According to one embodiment of the present invention, such information is not entered, but in the form of logs (log form) is maintained, all of the data elements represents the USE activity information and user information.

[0060] [0060]

Figure CN102354313BD00102

[0061]表 IA [0061] TABLE IA

[0062] 表IA为现有技术数据结构的简化示例,可以包括现有技术中所知的得分调整域。 [0062] Table IA is a simplified example of a prior art data structure, the score may include adjusting the prior art known to the domain. 例如,得分可以简单的为选择次数的总和,也可以为更加复杂的调整算法的结果。 For example, the score is simply the sum of the number selected, the result may be adjusted to more complex algorithms. 得分和调整也可以被存储。 Score and adjustments can also be stored.

[0063] 表IA中的现有技术数据结构足以产生排序的搜索结果,但是它表示了大量原始信息的损失。 [0063] Table IA prior art data structure sufficient to produce the ranked search results, but it represents a substantial loss of the original information. 如表IA所示,现有技术方法并没有记录或者分析搜索会话期间大量可用的USE活动信息和用户信息。 As shown in Table IA, the prior art methods do not record or analyze a number of available during a search session USE activity information and user information. 这是由于各种原因造成的,包括存储限制、缺乏对这些信息的实际使用以及没有意识到这些信息可以应用的前景。 This is due to various reasons, including storage limits, lack of practical use of such information as well as the prospects do not realize that this information can be applied.

[0064] 根据本发明一个实施例,通过记录和存储信息的延伸可以省略记录这些得分调整域。 [0064] According to an embodiment of the present invention, these scores can be recorded by extending the regulatory domain information is recorded and stored will be omitted. 也就是说,对这些实施例,没有存储得分调整信息,因为所需的任何得分可以基于所存储的信息而参数化计算。 That is, in these embodiments, the adjustment information is not stored in the score, the score for any of the desired parameters may be calculated based on the stored information. 而且,计算得分的参数和算法可以根据需要改变而并不影响所存储的数据。 Furthermore, the score calculation parameters and algorithms may not affect the stored data according to the change required.

[0065] 根据本发明各个实施例,对信息的深入分析依赖于识别和记录独立用户之间的关联数据(包括USE活动信息和用户信息)的能力。 [0065] According to various embodiments of the present invention, in-depth analysis of the information depends on the ability to identify and recording correlation data between independent users (including USE activity information and user information). 也就是说,通过记录和分析更加大量的信息,几乎包括用户会话期间的所有信息,可以获得具有远远更高的相关度的搜索结果。 That is to say, more by recording and analyzing large amounts of information, including almost all of the information during a user session, the search results can be obtained with much higher degree of correlation.

[0066] 例如,考虑用户发出一系列查询和间插其中的选择的用户会话。 [0066] For example, consider a user issues a series of queries and intervening select a user session. 通常的,在发出查询A之前选择的URL与查询A是不相关的,因为用户经常改变主题。 Usually, prior to issuing a query A select query A URL is irrelevant, because users often change the subject. 类似的,在后续的查询B之后发生的大多数选择与查询A是不相关的。 Similarly, most of the options A query occurs after the subsequent queries B is irrelevant.

[0067] 而且,考虑所有记录了包含查询A的搜索会话的大量独立用户。 [0067] Moreover, considering all the records contain a large number of unique users query A search session. 可以预期这些用户在查询A之前和之后选择了各种无关的主题,因此,无关的选择会广泛散布于大量URL上,每个URL会获得很低的得分,反映了它们与查询A缺乏联系。 These users can be expected before and after the query A select variety of unrelated topics, therefore, nothing to do select will be widely spread over a large number of URL, each URL will get a very low score, reflecting their lack of contact with the query A. 典型的,仅有非常少量的、被那些确实保持在与查询A相关的那些主题上的用户所选择的相关URL会积累起足够高的点击人气分数以影响与查询A关联的搜索结果的重新排序。 Typically, only a very small amount, are those who do remain on those topics to queries A selection of the relevant URL will accumulate click popularity score high enough to affect the search results associated with the query A reordering .

[0068] 例如,假设1000用户搜索查询A。 [0068] For example, suppose a user search query 1000 A. 接着,他们中的900人选择无关的查询BI至B900。 Then, 900 of them independent of the choice of BI queries to B900. 剩下的100人继续搜索原始主题的各种变异并且选择相关的查询A1-A9。 The remaining 100 people continue to search for the original subject of all kinds of mutation and selection-related queries A1-A9. 通过B查询产生的结果中的每一个会接收到一次或者两次与查询A有联系的选择,但是通过相关查询A1-A9产生的结果平均会累积十倍的选择。 The results produced by B in each of the query will receive once or twice with a query associated with the selected A, but the results produced by the relevant query A1-A9 cumulative average of ten times of selection.

[0069] 对于本发明一个实施例,可以强加这样的要求,即查询之后的URL必须被选择至少两次以与原始查询相关。 [0069] For one embodiment of the present invention, it may impose such a requirement, i.e., after the URL query must be selected at least twice with the original relevant. 这样的要求可以消除与查询A错误联系的大量B查询选择。 Such requirements can eliminate a large number of B query selects error associated with query A.

[0070] 根据本发明一个实施例,提供依赖于大量统计样本的关联分析,以识别多个相关的联系。 [0070] According to a related embodiment of the present invention, there is provided depends on a large statistical sample of the analysis, to identify a plurality of related links. 对于这些实施例,减少了现有技术中对所分析的联系的限制和随意的数据划分以增加相关度。 For these embodiments, the prior art reduces the limitations and random contact of the analyzed data is divided to increase the degree of correlation. 也就是说,记录和分析更加大量的用户会话信息还允许分析更多相关类型的联系。 That is, the recording and analysis of a large number of user sessions more information also allows the analysis of more types of contact.

[0071] 示例的USE活动关联 [0071] USE activity associated with an example of

[0072] 根据本发明一个实施例,提供了一种使用一个或者更多基本关联的小集合及其组合的系统。 [0072] The system according to one embodiment of the present invention, there is provided a method of using one or more small set of basic correlations, and combinations thereof. FIG. 通常的,根据本发明的各个实施例可以确定任意数量的关联并且用于实现搜索结果相关度增加或者其他目标。 Typically, the association may be determined according to any number of various embodiments of the present invention for achieving the search results and increased affinity or other objectives. 下面详细描述某些示例的关联。 The following detailed description of certain exemplary associated.

[0073]杳询-诜择(QUERY-T0-PICK) [0073] disappeared consultation - Shen Optional (QUERY-T0-PICK)

[0074] 查询-选择(Q2P)关联将查询与选择相联系。 [0074] query - select (Q2P) associated with the query associated with the selection. 当多个独立用户进行相同的联系时,该相同的联系即为关联候选。 When a plurality of users independently the same contact, the contact that is associated with the same candidate. 当搜索引擎响应于查询返回结果并且用户选择该结果时,这是这种关联的特定情况(Q2RP)。 When the search engine returns results responsive to the query and the user selects the result, which is associated with this particular case (Q2RP). 在实际中,搜索引擎算法替代第二独立用户。 In practice, the search engine algorithm instead of the second individual user. 根据本发明一个实施例,Q2P关联将查询与用户会话中所有的选择相联系。 According to one embodiment of the present invention, Q2P associated with the query associated with the user session all the options. 这与现有技术的方案是不同的,在现有技术中,一旦发出后续查询即终止给定查询与选择之间的联系。 This prior art solution is different, in the prior art, i.e., once issued subsequent queries to terminate the link between the query and a given choice.

[0075] 通过Q2P,用户会话期间记录的所有选择与该用户会话期间发出的给定查询相联系。 [0075] By Q2P, recorded during a given user session query associated with all selected during the user session sent. 在一个实施例中,基于各种因素为每个联系分配得分,这些因素包括查询和选择之间的时间、间插的查询和/或选择的数量以及相对于选择的查询次序。 In one embodiment, the score is based on various factors assigned to each contact, these factors include the time between query and selection, intervening queries and / or quantity of the selected query sequence, and with respect to selection.

[0076] 而且,可以基于公知的因素而调整每个联系的得分,这些因素包括联系时结果列表中选择的排序、选择的延续时间(下次已知用户动作之前的间隔)、联系的期限或者次序(相对于更旧或者更新的联系)以及联系的第一已知示例的期限。 [0076] Further, the score of each link can be adjusted based on well-known factors including the result list selected when contacting sorting, the selected duration (interval before the next user action is known), or the duration of contact order (relative to older or newer associations) as well as the period of the first known example of contact.

[0077] 每个用户会话可以具有无限的持续时间。 [0077] Each user session can have unlimited duration. 在实际应用中,可以强加合理的时间限制或者插入动作的限制,超出此限制之外则在选择和查询之间不指定关系。 In practical applications, it can impose a reasonable time limit or restrict movement of the insertion beyond this limit than the not specify the relationship between the selection and queries. 可替换的或者附加的,足够持续时间的中断可以表示会话的中断。 Alternatively or additionally, sufficient duration of interruption may represent a session interrupted. 根据本发明一个实施例的搜索日志摘要显示在下面的表2中。 The search log excerpt, an embodiment of the present invention is shown in the following Table 2. 在各种可替换实施例中,可以在该搜索日志中捕捉任何其他项目,但是为了清晰起见在此省略。 In various alternative embodiments, any other item may be captured in the search logs, but omitted here for clarity.

[0078] [0078]

Figure CN102354313BD00121

[0079] 表2 (* =没有联系选择的查询) [0079] Table 2 (* = no inquiries contact selected)

[0080] 图4显示了根据本发明一个实施例的三个独立用户的搜索日志。 [0080] FIG. 4 shows three independent users in accordance with one embodiment of the present invention a search log. 图4所示的搜索日志摘要400包括分别描述三位独立用户Ul、U2和U3的搜索信息的搜索日志410、420和430,如同以上参考表2所述。 Search log excerpt 400 shown in FIG. 4 respectively include search information described in the search log three independent users Ul, U2 and U3, 410, 420 and 430, as above with reference to Table 2. 每个搜索日志中的虚线框表示Q2P搜索信息中的Q2RP部分。 Each search logs dashed box Q2RP part Q2P search information. 例如,搜索日志410包括Q2RP部分411,其中查询Ql产生了选择P5。 For example, search log 410 includes Q2RP portion 411, which generates a selection query Ql P5. 搜索日志410还包括Q2RP部分412,其中查询Q2产生了选择P1、P2和P3。 Search log 410 further includes Q2RP portion 412, which generates a selection query Q2 P1, P2 and P3.

[0081] 搜索日志420包括持续时间为48小时的中断421。 [0081] search log 420 including the duration of 48 hours, interrupted 421. 在本发明一个实施例中,如此长时间的中断可以表示两个单独的会话,二者之间不会指定任何选择和查询之间的关系。 In one embodiment of the present invention, so long interruptions may represent two separate sessions, does not specify any relationship between picks and queries therebetween. 相反的,搜索日志430包括持续时间为2小时的中断431。 Instead, the search log 430 includes a duration of 2 hours interrupt 431. 在本发明一个实施例中,这种中断可以不表示两个单独的用户会话。 In one embodiment of the present invention, such an interrupt may not be represented by two separate user session. 也就是说,搜索日志430的所有搜索活动可以认为是单次用户会话以及相应关联的信息。 In other words, the search log of all search activity 430 can be regarded as a single user session information and corresponding association.

[0082] 表2A显不了根据本发明一个实施例的表2中包含的点击信息的表格。 [0082] Table 2 Table 2A not significant in the embodiment of the click information contained in the table in accordance with an embodiment of the present invention. 为了比较,表2B显不了根据米用Q2RP关联的典型现有技术方法的表2中包含的点击信息的表格。 For comparison, the table included in Table 2B not significant according to the table of a typical prior art method 2 meters associated with Q2RP click information.

[0083] [0083]

Figure CN102354313BD00131

[0084]表 2A (Q2P 结果) [0084] Table 2A (Q2P result)

[0085] [0085]

Figure CN102354313BD00132

[0086] 表2B (现有技术的Q2RP结果)[0087] 除了在下面的损失情况中我们假定一次选择表示得分增加O之外,由于大量因素可能改变得分或者使得分损失,我们假定I次选择=得分+1。 [0086] Table 2B (Q2RP result of the prior art) [0087] In addition to losses in the following we assume a selected increase in the score represents O addition, since a number of factors may be changed such that the points scored or loss, we assume that the I-th selection score = +1. 假定一时间阈值,行103中的点击在两个表格中均被损失,因为用户在该URL上仅花费很短的时间。 Suppose that a time threshold, line 103 clicks were lost in the two tables, because users spend only a short time on the URL. 假定数据库每天进行批处理更新,行203中的点击在表2B中的现有技术表格中通常会作为点击201的重复而被损失。 Assuming daily batch updates the database, click the row 203 in the prior art form in Table 2B are repeated as often lost 201 clicks. 根据本发明一个实施例,行203和402中的点击作为点击201的重复而在表格中被损失。 According to one embodiment of the present invention, rows 203 and 402 is lost as a repeating click click 201 in the table.

[0088] 根据本发明一个实施例,对于查询Q1,在Ql之后从未立即点击的URLPl在表格中得到了高分,因为多个用户在发出查询Ql之前或者之后(尽管不是立即之后)选择了它。 [0088] According to a previous embodiment of the invention, the query Q1, immediately after the click URLPl Ql never got a score in the table, since a plurality of users issue queries Ql or after (though not immediately after) the selected it. 根据本发明一个实施例,表格的整个得分矩阵更加充实,因为记录了更多的联系。 According to an embodiment of the present invention, the entire scoring matrix form more substantial, since more contact records. 某些分数比较低,例如Q2P4的分数,这是由于保存的会话数据表示所有的点击来自单个用户,从而允许识别更多的重复。 Some fraction is relatively low, e.g. Q2P4 score, which is stored in the session because all of the data representing the user clicks from a single, allowing to identify more repetitions.

[0089] 在Q2P的实际应用中,我们可以保持特定联系为Q2RP还是非Q2RP的区别。 [0089] In practice Q2P, we can keep the difference between the specific contact or non-Q2RP for Q2RP. 单次的、不关联的非Q2RP点击(例如表格中的Q3P1)可以不产生足够的提供结果给用户的置信度,然而对于单次、不关联的Q2RP点击,通过搜索引擎提供原始搜索的结果的事实而加强了联系。 A single, non-Q2RP click is not associated (e.g. Q3P1 table) may not produce enough to provide results to the confidence of the user, but for a single, Q2RP not associated clicks, provide the results of the original search by the search engine the fact strengthened the link.

[0090]诜择-杳询(PICK-T0-QUERY) [0090] Shen choose - disappeared consultation (PICK-T0-QUERY)

[0091] 选择-查询(P2Q)关联是将用户会话期间记录的所有查询相联系,这些查询与该用户会话期间发出的给定选择相关联。 [0091] select - Query (P2Q) is associated with all queries recorded during a user session associated, given selector associated with those queries issued during that user session. 表2中的搜索日志摘要显示了P2Q相关的输出。 Search Log Summary Table 2 shows the P2Q related output. 也就是说,对Q2P产生的相同数据可以对P2Q重新编制索引。 In other words, the same data can be generated for Q2P re-index P2Q.

[0092]杳询-杳询(QUERY-T0-QUERY) [0092] disappeared consultation - disappeared consultation (QUERY-T0-QUERY)

[0093] 查询-查询(Q2Q)关联是将用户会话期间发出的所有查询与该会话期间发出的所有其他查询相联系。 [0093] Query - Query (Q2Q) is related to all other inquiries all queries issued during a user session issued during the session linked. 在一个实`施例中,可以基于各种因素为每个联系指定得分,这些因素包括查询之间的时间、间插的查询和/或选择的数量、联系的期限或者次序(相对于更旧或者更新的联系)、查询结果是否产生了选择以及联系的查询的成对次序。 'In one embodiment, can be specified based on various factors score for each contact, these factors include the time between query, intervening queries and / or quantity of the selected term or order of (with respect to older or contact update), the query results whether an order selection and query pair of contact.

[0094] 确定查询结果是否产生了选择以及联系的查询的成对次序可以提供特别多的信息,因为它们可以表示一次查询是否为另一次查询的“关联”。 [0094] the query results to determine whether or not a select query and order a pair of contact can provide special much information as they can represent a query whether another query "association." 对于任何实际应用,知道两个联系的查询中的哪一个正确哪一个错误是很有用的。 For any practical application, two linked queries know which one correct a mistake which is very useful.

[0095] 根据本发明一个实施例的搜索日志摘要显示在如下的表3中。 [0095] The search log a summary of the embodiment of the present invention display the following Table 3. 仅需要搜索日志的查询部分以创建Q2Q表格。 Only part of the search query logs need to create Q2Q form.

Figure CN102354313BD00141
Figure CN102354313BD00151

[0098] 表3 [0098] TABLE 3

[0099] 表3A显不了根据本发明一个实施例的表3中包含的点击信息的表格(假定忽略发出查询的顺序)。 [0099] Table 3A can not form significant click information contained in Table 3 of the embodiment according to the present invention, an embodiment (assuming ignore the order of issuing the query).

Figure CN102354313BD00152

[0101]表 3A (Q2Q 结果) [0101] Table 3A (Q2Q result)

[0102] 表3A的下三角区域可以用于保存成对的查询次序信息,避免如同行301-303—样的双登记(double-booking)情况。 Lower triangular region order information query [0102] Table 3A may be used in pairs to save, to avoid such kind of peer-bis registration 301-303- (double-booking) situation.

[0103] 如上所述,可以采用利用各种因素改变得分或者使得分损失的计分方法。 [0103] As described above, with various factors that modify the score scoring or partial loss that may be employed. 例如,可以对重复(比如行101和102中的联系和行401和402中的联系)进行惩罚。 For example, repeated (such as rows 101 and 102 and link lines 401 and contact 402) punish. 或者,不关联的Q2Q联系,例如Q2Q3,不会产生足够的提供结果给用户的置信度。 Alternatively, the Q2Q contact uncorrelated e.g. Q2Q3, will not provide sufficient confidence to the result of the user.

[0104]诜择-诜择(P ICK-TO-PICK) [0104] Optional Shen - Shen Optional (P ICK-TO-PICK)

[0105] 选择-选择(P2P)关联是将用户会话期间发出的所有选择与该会话期间发出的所有其他选择相联系,这样,P2P关联与上述的Q2Q关联类似。 [0105] Select - Select All Select all other session emitted during the selection (P2P) issued during the association is linked to a user session, so that, P2P Q2Q correlation associated with the above-described similar. 同样的,根据各个实施例,可以基于各种因素对每个联系指定得分,这些因素包括选择之间的时间、间插的查询和/或选择的数量、联系的期限或者次序(相对于更旧或者更新的联系)以及联系的选择的成对次序。 Also, according to various embodiments, various factors can be specified based on the score for each contact, these factors include the time between the selection, intervening queries and / or quantity of the selected term or order of (with respect to older or updated contact) and a pair of contact order of selection.

[0106] 根据本发明一个实施例的搜索日志摘要显示在如下的表4中。 [0106] The search log excerpt, an embodiment of the present invention is shown below in Table 4. 仅需要搜索日志的选择部分以创建P2P表格。 Need only select portions of the log to create a P2P search form.

Figure CN102354313BD00161

P4[0108]表 4 P4 [0108] TABLE 4

[0109] 表4A显不了根据本发明一个实施例的表4中包含的点击信息的表格(假定忽略发出选择的顺序)。 [0109] Table 4A according to the table not a significant click information contained in Table 4, in the example of embodiment of the present invention (assuming the order issued by ignoring selected).

[0110] [0110]

Figure CN102354313BD00162

[0111]表 4A (P2P 结果) [0111] Table 4A (P2P result)

[0112] 同样的,重复(比如涉及行201的联系和涉及行203的联系)可能受到损失,花费很短时间在URL上的用户也会受到损失。 [0112] Also, repeating (such as contacts 201 relates to line 203 and is directed to the line contact) may be lost, it takes a very short time on the URL the user will suffer.

[0113] 表4A的下三角区域可以用于保存成对的选择次序信息,避免如同行201-203—样的双登记情况。 Lower triangular region [0113] Table 4A may be used to select a pair of information storage order to avoid double registration of such peer 201-203- like.

[0114] 示例的USE活动关联的组合 Composition [0114] of the associated exemplary USE activity

[0115] 根据本发明各种可替换实施例,可以将两个或者更多关联(例如以上描述的基本关联)连接在一起以提供更加相关的搜索结果。 [0115] According to various alternative embodiments of the present invention, it may be associated with two or more connections (e.g. basic correlations described above) together to provide more relevant search results. 例如,可以连接两个或者更多的基本关联以模仿基本关联从而增强其结果,特别是在稀少数据或者产生需要广泛匹配的附加结果的情况下。 For example, when two or more may be connected substantially to mimic the associated basic correlations to enhance a result, especially in the sparse data or generate additional results require extensive matching.

[0116] 连接后的关联使用选择或者查询而不是用户,以形成其他选择和查询之间的链接。 [0116] After the association or connection using select query instead of the user, to form links between queries and other options. 通常的,连接的关联越多,结果偏离初始选择或者查询越远。 Typically, the more associated connections, the query results from the initial selection or farther. 因此,在很多情况下,连接最少的关联以产生所需结果是最优的途径。 Thus, in many cases, a minimum of connections associated to produce a desired result is optimal way.

[0117] 表5显示了上述的USE活动基本关联的两种关联的可能组合。 [0117] Table 5 shows the possible combinations of two of the USE activity associated substantially associated.

[0118] [01]

Figure CN102354313BD00171

[0119]表 5 [0119] TABLE 5

[0120] 连接关联可能引入错误的关系,因此,在本发明一个实施例中,对关联进行关联。 Associated connection [0120] relation may introduce errors, therefore, in one embodiment of the present invention, the association of the association. 例如,如果Q2Q关联需要两个独立用户,QQQ关联字符串应当需要链接原始和最终查询的两个查询。 For example, if Q2Q association requires two separate users, QQQ associated with the string should be required link to the original query and two queries final. 在QQQ中,互相联系的查询(Q3)将一个查询(Ql)与另一查询(Q2)相联系。 In the QQQ, the query (Q3) of interconnected a query (Ql) and another query (Q2) linked. 如果两个或者更多独立的、互相联系的查询进行相同的联系,则这是一种关联。 If two or more separate, interrelated query the same connection, then this is an association.

[0121] 如上参考基本关联所述,原始选择或者查询和输出选择或者查询之间的联系至少通过两个个体形成(或者通过一个搜索引擎附加一个个体)。 Information [0121] described above with reference to the basic correlations, and outputs the original pick or query or select a query between the at least two individuals are formed by (a subject or additionally by a search engine). 通过连接的关联,可以没有单独用户(或者搜索引擎)将原始选择或者查询与任何输出选择或者查询相联系。 Through the associated connection, there may be no individual user (or search engine) the original pick or query or queries select any output linked. 间接关联也最少需要两个独立用户。 Indirect association is also a minimum of two separate users.

[0122] 在效果上,连接的关联倾向于预测在更加大量的数据被收集到的未来某个时间点时基本关联可能会是什么样子。 [0122] In effect, the associated connection tends to predict future point in time when a lot more data is collected basic correlations might look like. 本质上,它们识别尚未被观察到的可能的关联。 Essentially, they have not been observed to identify a possible association. [0123] 关联的组合需要多个中间选择或者查询之间的关联。 [0123] The compositions require a plurality of associated intermediate selected or association between the query. 链接原始和相关的选择或者查询的不同的选择和/或查询的数量,比中间选择和/或查询与原始和相关的选择和/或查询链接得多接近更重要。 And related links to the original selection or a different selection query and / or the number of queries, than the middle selection and / or queries related to the original and the selection and / or query much closer link is more important. 对于一个关联,必须有至少两个不同的链接路径,而不管有多少用户建立了这些链接。 For an association must have at least two different link path, regardless of how many users have established these links. 也就是说,通过一个中间节点Q3联系Ql和Q2,即使多个用户已经建立了这种联系也并不会组成Ql和Q2之间的关联。 That is, through an intermediate node contact Ql and Q3 Q2, even if multiple users have established such a link also does not make up the association between Ql and Q2.

[0124] 大量因素影响了关联的强度,包括链接路径的数量、各个直接关联分量链接的强度以及各个中间节点的独特性。 [0124] a number of factors affect the strength of association, including the unique number of links of the path, the intensity of each component of the direct link and the associated respective intermediate node. 例如,链接通过公共和一般性查询(例如“汽车”)关联的两个选择可能产生比通过更加独特的中间查询(比如“1965福特野马敞蓬车”)链接两个选择远远更弱的关联。 For example, by linking public and general inquiries (eg, "car") associated with the two options may produce more unique than that by the middle of queries (such as "1965 Ford Mustang convertible") link two choices is much weaker association .

[0125] 连接关联的优点通过下面的连接后的关联“查询-选择-查询”(QPQ)的示例可以 [0125] The advantages associated with the connection associated with the connection by the following "Query - Select - Query" (QPQ) examples may

更好的理解。 Better understood.

[0126] 表6A和表6B分别显不了表2A和2B的交叉QP得分的相乘结果,并且对于查询将这些结果相加以确定组合后的联系得分。 [0126] Table 6A and Table 6B were not significantly cross QP 2B Tables 2A and multiplication result of the score, and the query result to these contact points to determine the combination. (这并不一定是最优算法,而是用于示例目的)。 (This is not necessarily optimal algorithm, but for illustrative purposes).

[0127] [0127]

Figure CN102354313BD00181

[0131] 表6C和表6D分别显示了表6Α和表6Β的等效Q2Q关联表格。 [0131] Table 6C and 6D show the tables and tables Table 6Α 6Β equivalent Q2Q correlation table.

[0132] [0132]

Figure CN102354313BD00182
Figure CN102354313BD00191

[0135]表 6D [0135] Table 6D

[0136] 如上所述,根据本发明一个实施例,QPQ关联结果比本发明的更加特定的可替换实施例的较窄QRPQ关联产生了远远更多的关联,并且比根据本发明又一个实施例的Q2Q关联产生远远更多的关联。 [0136] As described above, produces a far more narrow QRPQ association according to a related embodiment of the present invention, QPQ correlation result a more specific embodiment of the present invention may alternatively ratio, and the ratio in accordance with a further embodiment of the present invention Q2Q associated cases of producing much more relevance. 而且,使用QPQ关联允许对不恰当形成而不会产生搜索结果的查询提供建议。 Moreover, the use QPQ association allows queries to improper formation without producing search results provide advice. 这在现有技术方法中是不可能的。 This prior art method is not possible.

[0137] 用户-用户 [0137] User - User

[0138] 如同查询和/或选择可以通过用户关联一样,用户可以通过查询和/或选择而关联。 [0138] As the query and / or selection by a user associated with the same, the user can query and / or the associated selection. 作为与QPQ关联类似的间接关联的一般性的程序被称为用户对用户(U2U)。 As QPQ indirectly associated with the associated program similar to a general user it is called the user (U2U). 基于结果选择的U2U关联(即两个用户输入了相同的查询和选择)显示在下面的表7中。 U2U correlation result based on the selected (i.e. two users enter the same query and selection) are shown in Table 7 below. 这种关联应当为用户-结果选择-用户(URPU),尽管存在根据本发明可替换实施例的其他U2U关联。 This should be associated with the user - Select - user (URPU), despite the presence of other related embodiments U2U alternative embodiment of the present invention. 同样的,根据各种实施例,可以基于各种因素对各个联系指定得分。 Also, according to various embodiments, each contact may be assigned a score based on various factors. 例如,假定时间阈值,行103中的点击被损失,因为用户仅在URL上花费很少时间。 For example, assume that the time threshold, click on the line 103 is lost, because users only spend very little time on the URL.

[0139] [0139]

Figure CN102354313BD00201

[0140]表 7 [0140] TABLE 7

[0141] 表7Α显不了根据本发明一个实施例的表7中包含的点击信息的表格。 [0141] Table 7Α not significant Example 7 of Tables Table click information contained in an embodiment of the present invention.

[0142] [0142]

Figure CN102354313BD00202
Figure CN102354313BD00211

[0143]表 7A [0143] Table 7A

[0144] 根据本发明一个实施例,一位给定用户(例如Ul)对另一用户(例如U2)的类同度可以定义为该用户与另一用户共享的查询/选择的数量,除以给定用户的查询/选择总数(即类同度:^ =(共享的QPuihi2)/(QPui的总数))。 [0144] According to an embodiment of the present invention, a given user (e.g. Ul) the affinity of another user (e.g., U2) can be defined for the number of shared user and another user query / selection, divided by given user query / select the total number (i.e., the affinity of: ^ = (shared QPuihi2) / (total number of QPui)). 在本发明可替换实施例中,可以使用更加复杂的类同度算法。 In alternative embodiments of the present invention may be used similar to more sophisticated algorithm. 例如,根据本发明一个实施例,类同度算法可以把搜索频率的差异考虑在内并且对查询、选择和查询-选择施以不同的权重。 For example, according to an embodiment of the present invention, similar algorithm can account for differences in the frequency of the inner and search query, and the query selection - selecting different weights applied.

[0145] 表7B显示了对表7A的点击信息计算的类同度信息。 [0145] Table 7B shows similar information of the click information table 7A calculated.

Figure CN102354313BD00212

[0147]表 7Β [0147] Table 7Β

[0148] 使用这种类同度信息,当对给定用户将来发出的查询产生结果时,根据所述类同度信息调整对应用户的选择得分。 [0148] Using this similar degree information, when a result of the query issued by a given user in the future, the affinity of the score based on the adjustment information corresponding to a user's selection. 例如,用户Ul的未来查询结果将以0.67调整用户U2产生的选择得分和0.33调整用户U3产生的选择得分。 For example, the user Ul future results of the adjustment will be 0.67 Select Select score generated by the user U2 and U3 user generated adjusted 0.33 score. 没有类同度的用户产生的选择将被指定一定的缺省值。 The user does not select the affinity of a certain generation will be assigned default values.

[0149] 示例应用 [0149] Application Example

[0150] 上述的参考本发明各个可替换实施例的组织和表述数据的方法和系统可以用于各种实际应用,这对本领域技术人员是显而易见的。 [0150] The present invention is described above with reference to various alternative method and system for the organization and presentation of data may be used in various embodiments of practical use, it will be apparent to those skilled in the art. 下面更加全面的讨论这些应用。 Discussed more fully below these applications. 特定USE活动信息和/或用户信息的使用可以比其他信息更加适合于特定应用。 USE activity information and specific use / or user information may be more suitable for a particular application than other information. 例如,对于特定应用,对特定USE活动进行关联将会更加实用、更加有效或者更加准确。 For example, for a particular application, to associate a particular USE activity will be more practical, more efficient or more accurate. 下面的示例应用将针对特别适合于特定应用的实际实施的USE活动信息和/或用户信息而描述。 The following example will be described for application to a practical embodiment is particularly suitable for the particular application of the USE activity information and / or user information.

[0151] 示例的Q2P应用 [0151] Example applications of Q2P

[0152] 本发明的对Q2P和/或Q2P等价组合USE活动信息进行关联的实施例允许用户以各种方式获取更加相关的搜索结果。 [0152] Example of Q2P and / or Q2P equivalent combination USE activity information related to the present invention allows the user to obtain more relevant search results in a variety of ways. 例如,用户可以精确化搜索并且将修订后的结果的某些部分与原始搜索相联系。 For example, a user may search and the accuracy of some of the results of the revised portions of the original search linked. 也就是说,选择的文档并不一定在文字上与原始搜索关联,而仅是概念上的关系。 In other words, the selected document is not necessarily related to the original search in the text, but only the relationship between the concepts. 概念关系可以提供对原始搜索更好的响应。 The concept relationships may provide a better response to the original search. 基于Q2P USE活动信息的关联的搜索结果避开了现有技术方法中基于文本的检索的限制。 Search results based on relevance of Q2P USE activity information to avoid the prior art methods to retrieve text-based restrictions. 本发明的实施例能够保存并且利用用户再搜索过程。 Embodiments of the present invention can be stored and re-search process with the user. 这种能力可以用于实现比现有技术方法具有大量明显优点的系统。 This capability can be used to implement the system with a large number of distinct advantages over the prior art methods.

[0153] 根据本发明各种实施例的大量独立用户的Q2P USE活动信息的关联不仅利用了先前用户的相关度判断,而且利用了其研究努力。 [0153] The large number of independent users associated with various embodiments of the present invention Q2P USE activity information using not only the determination of the previously associated user, and the use of their research efforts. 后续用户不需要重复先前用户的错误,而是可以从先前用户的尝试-错误的教训中受益。 Subsequent users do not need to repeat the previous user error, but from the user's previous attempts - to benefit the wrong lessons. [0154] 图5显示了根据本发明一个实施例的提供更多相关搜索结果的程序。 [0154] FIG 5 illustrates a process for providing more relevant search results in accordance with an embodiment of the present invention. 图5所示的程序500开始于操作505,其中从用户接收查询。 Routine shown in FIG. 5 500 begins at operation 505, where the query is received from a user. 所述查询可以具有一个或多个特定特性,这些特性一旦被识别出来,则可以作为根据本发明各个可替换实施例提供更加相关的搜索结果的基础。 The query may have one or more specific properties that once identified, can be used as basis for providing more relevant search results in accordance with the present invention various alternative embodiments.

[0155] 在操作510,大量用户的Q2P USE活动信息对于所接收到的查询进行关联。 [0155] At operation 510, a large number of Q2P USE activity information for the user in association with the received query. 每个查询可以具有各种特定特性,这些特性可以通过Q2P USE活动信息的关联而确定。 Each query can have a variety of specific characteristics that can be determined by correlating Q2P USE activity information. 这些特定特性例如可以包括:查询可以对不同用户具有不同意义,查询可以误拼写,查询可以具有等价的措辞,查询可以具有较为相关的部分和不太相关的部分,查询可以与特定结果产出或者结果产出组合相联系,以及查询可以具有更宽或者更窄的搜索结果。 These may include specific characteristics such as: Query can have different meanings for different users, queries can be misspelled queries may have equivalent wording, the query may have less relevant section and more relevant part of the query can produce a particular result or a combination result output linked, and the query may have a wider or narrower search results. 这些特性中的每一个特性,不管是单独的还是结合在一起,对于在不同条件下提供更加相关的搜索结果可能是有用的。 These characteristics of each feature, either alone or together, to provide more relevant search results under different conditions may be useful.

[0156] 在操作515,基于关联后的Q2P USE活动信息的搜索结果响应于查询而被提供给用户。 [0156] At operation 515, based on the search results Q2P USE activity information associated with the response to a query provided to the user. 所提供的搜索结果可以基于查询的一个或者多个特性。 Search results can be provided based on one or more characteristics of the queries. 例如,当确定所述查询具有不同意义时,可以响应于查询而提供具有基于更加流行的意义的结果。 For example, when it is determined that the query has different meanings, in response to a query-based and more popular having a significant results.

[0157] 通过程序500获取的若干示例结果与通过典型的现有技术获得的结果进行比较,相对于上述查询的特定特征而显示如下。 [0157] Several examples of the results obtained by program 500 compare the results obtained by the typical prior art, described above with respect to particular features of the query is shown below.

[0158] 表8显示了具有多于一种意义的若干查询的示例搜索结果。 [0158] Table 8 shows an example having a plurality of query search results is more than one meaning. 如表所示,与现有技术方法相比,本发明的实施例允许搜索结果相关到特定查询的更加流行的意义。 As shown in the table, as compared with prior art methods, embodiments of the present invention allows a more popular search results relevant to the query specific meaning.

[0159] [0159]

Figure CN102354313BD00221

[0160]表 8 [0160] TABLE 8

[0161] 表9显示了误拼写的示例搜索结果(例如“encycopidea”)。 [0161] Table 9 shows an example of a misspelled search result (e.g., "encycopidea"). 如表所示,本发明的实施例允许搜索结果相关到可能正确拼写的查询。 As shown, embodiments of the present invention allows the relevance of search results to the query may be correctly spelled. 通过这种方式,本发明的实施例可以确定误拼写查询的正确拼写。 In this manner, embodiments of the present invention can determine the correct spelling of a misspelled query. 现有技术方法对这种误拼写通常不会产生搜索结果,或者很差的搜索结果。 Prior art methods do not typically produce search results, or because of poor results for this search misspelled.

[0162] [0162]

Figure CN102354313BD00231

[0163]表 9 [0163] Table 9

[0164] 在本发明一个实施例中,这种拼写校正是一种“软”校正。 [0164] In one embodiment of the present invention, such a spelling correction is a "soft" correction. 也就是说,根据本发明一个实施例,响应于查询而提供的结果为输入精确查询的大多数用户所偏好的结果。 That is, according to one embodiment of the present invention, in response to the query results for providing accurate input query results preferred by most users. 如果多数用户认为该查询为误拼写,则大量结果将包含校正后的查询。 If most users consider the query is misspelled, the results will contain a large number of queries corrected. 如果多数用户认为该查询是有意图的,则大量结果将包含未改动的查询。 If most users consider the query is intentional, then the results will contain a large number of unmodified query. 如果两种解释都是合法的,则结果为二者结合。 If both interpretations are valid, the result is a combination of both. 对于这种实施例,由于所有的校正都是概念相关而不仅仅是文字上相似的,因此不太可能提供错误的拼写校正。 For this embodiment, since all calibration and similar concepts are not only the text, it is unlikely to provide an error correcting spelling.

[0165] 相反的,现有技术的拼写校正通常为“硬”校正。 [0165] In contrast, the prior art spelling correction is generally "hard" correction. 也就是说,这些方法识别误拼写查询,尝试进行校正然后基于校正搜索结果。 In other words, these methods identify misspelled query, and then try to correct the correction based on search results. 当合法的查询被误诊断为误拼写,或者查询确实误拼写但是通过算法的校正仍不是所需查询时,这些方法会提供不相关的结果。 When a legitimate query was mistakenly diagnosed as misspelled, or indeed misspelled query but the query is still required by the correction algorithm, these methods provide irrelevant results. “主动”校正会要求用户点击链接以对建议的查询再次搜索,这样也可能再次为错误校正。 "Active" correction would require the user to click on a link to check for suggested search again, so it may again error correction.

[0166] 表10显示了具有两种或者更多解释或者等价措辞(例如“Burma和Myanmar”)的查询的示例搜索结果。 [0166] Table 10 shows an example of the search results for the query, or with two or more equivalents interpreted language (e.g. "Burma and Myanmar") of the. 如表所示,本发明的实施例对具有高度共同性的各个等价措辞提供搜索结果。 As shown, the embodiment of the present invention each having a height equivalent to the wording of commonality provide search results. 对于所提供的示例,对于各个等价措辞的九个顶级搜索结果URL中有五个是相同的。 For the example provided, the wording for each equivalent of nine top-five search results the URL is the same. 这与现有技术方法相比具有远远更高的相似度。 This is compared with the prior art methods have much higher similarity.

[0167] [0167]

Figure CN102354313BD00232

[0168] 表10 [0168] TABLE 10

[0169] 用户常常在查询中包括不能提供相关信息的关键词,但是对搜索引擎带来了不必要的文本匹配要求,在现有技术方法中导致了更低相关度的搜索结果。 [0169] Users often included in the query keyword can not provide relevant information, but the search engine brings unnecessary text-matching requirements, resulting in lower relevancy of search results in prior art methods. 本发明的实施例克服了这种缺陷。 Embodiments of the present invention overcome this drawback.

[0170] 表11显示了包含多余部分的查询的示例搜索结果。 [0170] Table 11 shows examples of query results include unnecessary portion. 如表所示,本发明的实施例允许忽略查询中的多余、不重要以及不相关部分,这样提供了更加相关更加简洁的查询。 As shown, embodiments of the present invention allows a query to ignore excess unimportant and irrelevant parts, thus providing more relevant queries more concise.

[0171] [0171]

Figure CN102354313BD00241

[0172]表 11 [0172] Table 11

[0173] 根据本发明一个实施例,平等对待对各种结果产物(例如图片、音频/视频、文本、图像、新闻条目等等)的搜索结果选择。 [0173] According to an embodiment of the present invention, a variety of search results equal treatment resulting product (e.g., images, audio / video, text, images, news items, etc.) selection. 也就是说,用户不需要指定他们正在寻找哪种类型的结果。 In other words, users do not need to specify what type of results they are looking for. 提供的结果可以反映该用户先前表现出的偏好或者先前的独立用户的偏好。 The results can be provided to reflect the user's previous show preference or previous individual user's preferences. 例如,如果查询非常频繁地产生图像搜索从而特定图像为最高得分的选择,则该图像可能根据其得分而被提供为搜索结果。 For example, if the image search query produces very frequently so that the specific image is the highest score is selected, the image may be provided according to their scores as a search result. 在一个实施例中,不同结果产物的各个最高得分结果不需要按照得分次序交错排列,而是可以表述为按照产物归组。 In one embodiment, each of the different results highest score resulting product need not be arranged in staggered order of score, but may be expressed as the product according to the grouping. 对于该实施例,用户搜索的结果产物不需要在试图确认用户目的时以语言工具对查询进行解读而识别。 For this embodiment, the resulting product does not need to search for the user in attempting to confirm that the query language interpretation tool user identification purposes.

[0174] 表12显示了由搜索结果产物组合(例如包括图片结果)产生的示例查询搜索结 [0174] Table 12 shows an example of the search results generated by the product composition (e.g., including image result) query search result

果。 fruit. 如表所示,本发明的实施例允许搜索结果包含产物组合。 As shown, the embodiment of the present invention allow the search results comprising product composition.

[0175] [0175]

Figure CN102354313BD00242

[0176]表 12 [0176] Table 12

[0177] 本发明的实施例能够通过调节Q2P对Q2RP选择得分的权重而改变搜索结果的范围。 [0177] Embodiments of the present invention can be selected to change the weight and the score of the search results by adjusting the right to Q2RP Q2P. 表13显示了对查询“Stanford”的较窄和较宽的搜索结果。 Table 13 shows the narrow and wide search results for the query "Stanford" of.

[0178] [0178]

Figure CN102354313BD00251

[0179]表 13 [0179] TABLE 13

[0180] 表13的第一列包含涉及对查询的选择的较窄结果。 [0180] The first column contains the table relates to the selection of a narrower query result 13. 表13的第二列显示了宽广结果(例如仅显示了没有响应于原始查询的结果做出的选择)。 The second column of Table 13 shows the results of wide (e.g., not only shows a selection result in response to the original query made). 第二列的结果表示查询主题的较宽范围,而第一列的结果帮助用户探究查询主题的深度。 The second column represents the results of a wide range of topics of inquiry, and the results of the first column to help users explore the depth of the query subject. 实际上,较窄和较宽结果的组合可以提供最相关的搜索结果。 In fact, the combination of narrow and wide results can provide the most relevant search results. 在本发明一个实施例中,创建了具有不同范围的大量组合,允许用户选择“拓宽结果”或者“聚焦结果”链接,或者改变控制以调整组合。 In one embodiment of the present invention, creating a large number of combinations of different ranges, allowing the user to select "broaden result" or "focused" links, to adjust or change the control composition.

[0181] 示例的Q2Q应用 [0181] Example applications of Q2Q

[0182] 本发明的一个实施例对Q2Q和/或Q2Q等价组合USE活动信息进行关联,允许用户获得与其搜索相关的建议的查询。 [0182] An embodiment of the present invention Q2Q and / or Q2Q-equivalent combination USE activity information associated with, allowing the user to get recommendations relevant to their search query. 图6显示了根据本发明一个实施例的提供相关查询建议的程序。 Figure 6 shows a procedure according to the present invention is related to an embodiment of the query suggestions. 图6所示的程序600开始于操作605,其中从用户接收查询。 Routine shown in FIG. 6 600 begins at operation 605, where the query is received from a user.

[0183] 在操作610,Q2Q (和/或Q2Q等价组合)USE活动信息被关联。 [0183] At operation 610, Q2Q (and / or Q2Q-equivalent combination) USE activity information is correlated. 关联Q2Q USE活动信息直接倾向于产生查询的显著精确化的结果。 Q2Q USE activity information directly related tend to have a significant refinement of the results of the query. 关联Q2Q等价组合USE活动信息倾向于产生更加多样的结果,尽管通常会有高度的重复。 Associated Q2Q equivalent combination USE activity information tends to produce a more diverse results, although there is usually a high degree of repetition. 在本发明一个实施例中,对相对模糊的原始查询关联Q2Q等价组合USE活动信息,因为这种关联通常产生远远更多的查询建议。 In one embodiment of the present invention, relative to the original query associated Q2Q-equivalent combination USE activity blur information, because such typically produces much more associated query suggestions.

[0184] 表14显示了根据本发明一个实施例的响应于原始查询“electroniceavesdropping devices”而提供的查询建议。 [0184] Table 14 shows the query suggestions in accordance with one embodiment of the present invention in response to the original query "electroniceavesdropping devices" provided. 表14的第一列包含直接基于Q2QUSE活动信息的关联的查询建议,而表14的第二列包含基于Q2Q等价组合(即QPQ) USE活动信息的关联的查询建议。 The first column of table 14 contains query suggestions based Q2QUSE activity directly associated information, and the second column of Table 14 contains query suggestions associated Q2Q USE activity information based on the equivalent composition (i.e. QPQ).

[0185] [0185]

Figure CN102354313BD00252

[0186] 表14 [0186] TABLE 14

[0187] 在操作615,提供基于关联后的Q2Q (和/或Q2Q等价组合)USE活动信息的一个或多个查询建议给用户。 [0187] At operation 615, there is provided based upon the correlated Q2Q (and / or Q2Q-equivalent combination) USE activity information of the one or more query suggestions to the user. 在本发明一个实施例中,查询建议可以表述在结果页面上。 In one embodiment of the present invention, query suggestions can be expressed on the results page. 可替换或者附加的,可以在查询建议的页面上提供链接。 Alternatively or additionally, may provide links on the page query suggestions. 当原始查询产生大量高度相关(例如高得分)的查询建议时,这种实施例是很实用的。 When a large amount of query suggestions highly relevant original query (e.g., a high score), which embodiment is very practical. 在本发明一个实施例中,查询建议在表述之前可以被分类为精确的(包含所有原始搜索词)和相关的搜索。 In one embodiment of the present invention, query suggestions can be classified before the precise expression (including all the original search terms) and associated search.

[0188] 根据本发明一个实施例,Q2Q (和/或Q2Q等价组合)USE活动信息的关联被用于产生主动的查询拼写校正方法。 [0188] According to an embodiment of the present invention, associated Q2Q (and / or Q2Q-equivalent combination) USE activity information is used to generate the active query spelling correction method. 在主动的查询拼写校正方法中,用户选择建议的查询校正以获得基于查询校正的搜索结果。 In active query spelling correction method, the user select a suggested query correction to the correction of search results based on queries.

[0189] 图6A显示了根据本发明一个实施例的提供查询拼写校正建议的程序。 [0189] FIG. 6A illustrates a process for providing query spelling according to an embodiment of the present invention proposed correction. 图6A所示的程序600A开始于操作605A,其中从用户接收查询。 Shown in FIG. 6A program 600A begins at operation 605A, where the query is received from a user. 该查询可能为用户所需查询的错误拼与。 The error may query the user for the required query to fight with.

[0190] 在操作610A,Q2Q (和/或Q2Q等价组合)USE活动信息如上所述参考程序600的操作610而被关联。 [0190] At operation is associated 610A, Q2Q (and / or Q2Q-equivalent combination) USE activity information 600 with reference to the program operation 610 as described above.

[0191] 在操作611A,基于关联后的Q2Q(和/或Q2Q等价组合)USE活动信息确定一个或者多个查询建议。 [0191] At operation 611A, based upon the correlated Q2Q (and / or Q2Q-equivalent combination) USE activity information to determine the one or more query suggestions.

[0192] 在操作612A,一个或者多个查询建议被确定为原始查询的拼写校正(即在操作605A接收到的查询)。 [0192] At operation 612A, one or more query suggestions are determined to be the original query spelling correction (i.e., at operation 605A received query). 根据本发明的可替换实施例,查询建议的确定是根据所接收到的查询而以各种不同方式被影响的。 Determining an alternative embodiment of the present invention, query suggestions is based on is affected in various ways according to the received query. 例如,对于先前观测到的查询,在存在Q2Q信息时,概念相关并且文本相似的查询可以使用Q2Q关联信息和编辑距离计算算法而被识别。 For example, for queries previously observed, in the presence of Q2Q information, text and concepts related to the query can be similar Q2Q-related information and edit distance calculation algorithm is identified. 在此情况下,可以使用各种标准以实现作为原始(即所接收到的)查询的拼写校正的查询建议的确定。 In this case, various criteria may be used to determine as to achieve the original (i.e., received) query spelling correction query suggestions. 例如,当所确定的查询建议被关联到所接收到的查询时,与所接收到的查询文本类似并且比所接收到的查询更加一般化的所确定的查询建议可以被确定为所接收到的查询的拼写校正。 For example, when the query suggestions are determined to be associated to the received query, similar to the received the query text and the ratio of the received query more general query suggestions determined may be determined that the received query spelling correction. 在可替换实施例中,可以考虑更多标准以提高作为所接收到的查询的拼写校正的查询建议的确定的可信度。 In an alternative embodiment, additional criteria may be considered to improve the reliability of the determined received as spelling correction query query suggestions. 例如,当相比于所接收到的查询之前,查询建议倾向于更频繁的在所接收到的查询之后发出时,或者查询建议倾向于比所接收到的查询产生更多的用户选择时,这些标准可以提高在确定作为所接收到的查询的拼写校正的查询建议方面的可信度。 For example, when compared to the previously received query, query suggestions more frequently issued after the received query tends to or more than the query suggestion tends to produce more queries received user selection, these standards can increase confidence in the determination as to the received query spelling correction inquiry recommendations.

[0193] 当在操作605A接收到的查询为先前没有观测到的查询,则不存在Q2Q信息。 [0193] When received at operation 605A query to query not previously observed, no Q2Q information exists. 在此情况下,根据本发明一个实施例,如果怀疑查询中的一个词存在错误,则根据先前是否观测到所述怀疑的词而以两种方式中的一种进行评估。 In this case, according to one embodiment of the present invention, if an error is suspected in a query word, and evaluated according to the previously observed whether the suspect word in one of two ways.

[0194] 当先前观测到所述怀疑的词时,则识别出出现该词的其他查询。 [0194] When the previously observed suspect word is identified other query word appears. 基于与接收到的查询相同的关键词而对这些查询进行加权。 Based on the received keyword query same weighting these queries. 最后,对先前考虑的可疑词的校正进行检查并且用于基于先前建议的校正的频率和该可疑词出现的查询中的关键词权重而提供建议校正。 Finally, correction of previously considered suspicious words and to check for the right keyword-based queries previously recommended to correct the frequency and the questionable word appears in the heavy and provide recommendations to correct.

[0195] 当先前没有观测到所述怀疑的词时,出现所接收到的查询中的所有其他关键词的其他查询可以被识别。 [0195] When the previously observed no suspect word has appeared in all other keywords other queries received query may be identified. 在本发明一个实施例中,当没有查询满足该标准时,可以识别与所接收到的查询具有最与众不同(the most distinctive)(低频率)的公共词的查询。 In one embodiment of the present invention, when a query is not satisfied this criterion, may identify the received query to a query with the most distinctive (the most distinctive) (low frequency) is a common word. 对于各个实施例,所识别的查询可以基于与所接收的查询的文本相似度而进行加权并且识别与所接收到的查询具有很高文本相似度的最普遍的查询。 For queries various embodiments, the identified text may be weighted based on the degree of similarity with the received query and identifies the received query most general text query high similarity.

[0196] 在本发明一个实施例中,当条件不允许使用上述的拼写校正建议程序的方法时,关联后的Q2Q USE活动信息与传统的n-gram-type模型结合在一起使用。 [0196] In one embodiment of the present invention, when using the above process conditions does not allow spelling correction program recommendation, Q2Q USE activity information associated in combination with conventional n-gram-type model is used together. 在此实施例中,从查询频率数据提取出词联系频率以对已知关键词或者短语确定共同伴随词。 In this embodiment, the frequency of queries to extract data from a word frequency of contact with the common word associated with known keywords or phrases OK. 这些数据与编辑距离结合在一起被用于对多词查询中的未知词确定作为可能的拼写校正建议的查询。 These data combined together with the edit distance is used to determine the query as possible spelling correction proposed for multi-word queries unknown word. [0197] 在操作615A,确定为所接收到的查询的拼写校正的一个或者多个查询建议被提供给用户作为所接收到的查询的可能的拼写校正。 [0197] At operation 615A, determined that the received query spelling correction of the one or more query suggestions are provided to the user as the received query may spelling correction. 在本发明一个实施例中,当没有查询建议被确定为所接收到的查询的可能的拼写校正时,则参考图6的程序600的操作615如上所述的提供查询建议。 In one embodiment of the present invention, when no query suggestions are determined to be the received query may spelling correction, the reference 6 the program operation 615 as described above to provide query suggestions 600 of FIG.

[0198] 示例的P2Q应用 [0198] Application example P2Q

[0199] 根据本发明一个实施例,关于特定结果页面的建议查询可以使用P2Q或者P2Q等价组合而提供。 [0199] According to an embodiment of the present invention, a specific proposal for query results page P2Q may be used in combination to provide equivalent or P2Q.

[0200] 图7显示了根据本发明一个实施例的提供建议查询的程序。 [0200] FIG. 7 illustrates a process for providing recommendations according to an embodiment of the present invention, the query. 图7所示的程序700开始于操作705,其中接收到查询和对应的提供大量结果URL的搜索结果。 Routine shown in FIG 7700 begins at operation 705, where a query is received and a corresponding URL substantial results of the search results.

[0201] 在操作710,对各个结果URL关联P2Q (或者P2Q等价组合)USE活动信息。 [0201] At operation 710, URL associated P2Q (P2Q or equivalent combination) USE activity information for each result.

[0202] 在操作715,基于关联后的P2Q USE活动信息对各个结果URL提供建议的查询。 [0202] In operation 715, based on P2Q USE activity information associated URL provides recommendations for each query results. 也就是说,提供与搜索结果中任何页面紧密相关的建议查询列表。 In other words, it is recommended to provide closely related with the search results page any query list. 这些建议在模糊查询的情况下可能为用户提供通过URL中一者而不是其他的来实现的聚焦意义。 These recommendations may provide a URL, rather than the other sense of focus achieved by the user in the case of vague queries.

[0203] 表15显示了根据本发明一个实施例的查询“rangers”的示例搜索结果页面和示 [0203] Table 15 shows the search result page according to an example and illustrate a query "rangers" embodiment of the present invention

例的对应建议查询。 An example of correspondence suggested queries.

Figure CN102354313BD00271

[0205]表 15 [0205] TABLE 15

[0206] 示例的P2P应用 [0206] Example P2P application

[0207] 根据本发明一个实施例,响应于所接收到的查询而提供的与特定结果相似的一个或者更多结果使用P2P或者P2P等价组合而提供。 [0207] one or more results based on a query embodiment of the present invention, in response to the received provided similar results with the particular combination of P2P using P2P or equivalent is provided. 例如,接收到查询并且评估对应于该查询的结果。 For example, the query and receiving the evaluation result corresponding to the query. 基于所述评估,同样提供类似结果。 Based on the evaluation, also provide similar results. 也就是说,例如,可以随着结果指定若干相关页面和/或链接可以导向具有附加类似结果的新的结果页面。 That is, for example, you can specify a number of relevant pages and / or link with results-oriented with additional similar results of a new results page. 通常的,图像的类似结果大多会产生其他图像,网页的类似结果大多会产生其他网页,依次类推。 Usually, the resulting image is similar to most other images will produce similar results, most pages will have other pages, and so on.

[0208] 图8显示根据本发明一个实施例的提供类似结果的程序。 [0208] Figure 8 shows a similar result according to the embodiment of the present invention program. 图8所示的程序800开始于操作805,其中接收到查询并且确定对应的搜索结果。 Routine shown in FIG 8800 begins at operation 805, where a query is received and determine a corresponding search result.

[0209] 在操作810,对对应的搜索结果关联P2P (或者P2P等价组合)USE活动信息。 [0209] In operation 810 information, corresponding to the search results associated with P2P (or P2P-equivalent combination) USE activity.

[0210] 在操作815,基于关联后的P2P USE活动信息而提供一个或者多个类似结果(即类似于所接收到的结果)。 [0210] At operation 815, based on the P2P USE activity information associated with the one or more provided similar results (i.e., similar to that received result). 也就是说,提供与所接收到的结果紧密相关的结果列表。 In other words, the results provide a list of closely related to the results received. 类似结果可以组成搜索选择、图像、新闻条目等等。 Similar results can be composed of selected search, images, news items and so on. LUZMJ 衣丄O姬不J竹d沾半仅叨—Ί头施WtfJ明/^丁ST日J pnoenix IllJ促7^tfJ不W失臥结 LUZMJ clothing Shang O Ji J not only half hundred bamboo stick d -Ί administered WtfJ bright head / day ST ^ butoxy J pnoenix IllJ pro 7 ^ tfJ loss W does not lie junction

果O If O

Figure CN102354313BD00281

[0213]表 16 [0213] Table 16

[0214] 示例的用户信息应用 [0214] an example of the user information application

[0215] 个件化捭索 [0215] a member of weed cable

[0216] 个性化搜索的概念是基于如下前提的,即当知道关于用户的某些信息时可以提供更加相关的搜索结果。 [0216] The concept of personalized search is based on the premise that when you know certain information about the user can provide more relevant search results. 过去曾经分享过至少某些用户的兴趣和意见的用户的推荐被认为比没有分享其兴趣和品味的用户的推荐具有更大的价值。 In the past we have ever shared at least some of the user's interests and opinions of the user's recommendation is considered to have a greater value than that recommended users not to share their interests and tastes.

[0217] 现有技术中的个性化搜索方法通常识别用户的人口统计,然而按照该人口统计团体中的其他成员的偏好而定制结果。 Personalized Search [0217] prior art methods typically identify the user's demographics, however, in accordance with the preferences of other members of that demographic group in the customized result. 这种方法具有严重的缺陷,即在一个人口统计团体中偏好变化差异很大。 This method has a serious drawback, namely changing preferences in a very different demographic community. 每个用户通常属于很多个并且通常很难调和的人口统计团体,并且用户通常并不提供可靠的人口统计信息。 Each user typically belong to and are often difficult to reconcile a number of demographic groups, and users typically do not provide reliable demographic information.

[0218] 根据本发明一个实施例,每个用户为认为是一个人组成的团体,具有相对于其他用户的类同度。 [0218] According to an embodiment of the present invention, each user is considered to be a group of people, with respect to the affinity of other users.

[0219] 图9显示了根据本发明一个实施例的对信息进行关联以提供更相关的搜索结果的程序。 [0219] Figure 9 shows an embodiment of the present invention according to the information related to provide more relevant search results procedure. 图9所示的程序900开始于操作905,其中对每个表现至少最少量的搜索活动的用户计算类同度矩阵。 Program 900 shown in FIG. 9 begins at operation 905, where the performance of each user for at least a minimum amount of search activity similar matrix calculations. 在本发明一个实施例中,计算所述类同度矩阵可以如下进行。 In one embodiment of the present invention, the affinity of the matrix may be calculated as follows. 首先,提取所有给定用户Ul的查询和选择。 First, extract all given user query and select Ul. 然后,与Ul的查询和/或选择中的至少N个重复的用户U2被识别,并且识别U2剩余的查询和选择。 Then, the query Ul and / or N at least duplicated in the selected user U2 is identified, and the identification and selection query U2 remaining. 然后对每个查询和选择指定权重并且计算类同度分数。 Then for each query and select the specified weight and similar score calculation. 所述权重可以与该查询和/或选择在所有用户中的共同程度成反比,并且通过将加权后共有的选择/查询与总的加权选择/查询进行比较而计算类同度分数。 The weights may query the and / or select the degree inversely proportional to the common among all users, and shared by the weighted selection / query to the total weighted selection / query similar comparison score is calculated.

[0220] 在操作910,对Ul的新的查询产生的搜索结果进行分析,并且如果任何先前选择属于具有非零类同度分数的用户则增加各个结果的分数。 [0220] At operation 910, the search results generated new query Ul analyzed, and if any of the previously selected belongs to have non-zero score similar increase the score of the individual users will result.

[0221] 在操作915,基于操作910的分数调整而对结果重新排序并且将重排序后的结果表述给用户。 [0221] In operation 915, the reordering operation based on the score adjustment 910 results and reordering the result presentation to the user.

[0222] 根据本发明一个可替换实施例,与程序900类似的程序可以用于增加搜索建议的相关度。 [0222] According to an alternative embodiment of the invention, with the procedure analogous procedure 900 may be used to increase the correlation search suggestions.

[0223] 本地化榑素 [0223] Localized Bo Su

[0224] 用户信息的一种重要类型是位置。 [0224] An important type of information is the location of the user. 位置比其他人口统计信息具有优势,因为它不需要用户自己提供并且不管是谁在使用计算机都是保持真实的。 Position have an advantage over other demographic information, because it does not require users to provide their own and no matter who is using a computer is keeping true. [0225] 在本发明一个实施例中,用户信息包括用户的地理位置信息,并且通过扩展包括其查询和选择。 [0225] In one embodiment of the present invention, the user information including the user's location information, and by extension, including its query and selection. 所述地理位置信息可以包括经度和纬度以及城市、州名和国名。 The location information may include longitude and latitude as well as city, state and country names. 根据一个实施例,这些用户信息被用于基于用户的地理位置提供搜索结果。 According to one embodiment, the user information is used to provide search results based on the user's geographic location. 例如,输入查询“OSU”的美国用户可能表示“俄亥俄州立大学(Ohio State University) ”、“俄克拉荷马州立大学(Oklahoma State University),,或者“俄勒网州立大学(Oregon State University)”。在一个实施例中,所提供的搜索结果还与具有类似用户信息(例如类似位置)的用户关联。 For example, enter the query "OSU" the user may indicate that the United States "Ohio State University (Ohio State University)", "Oklahoma State University (Oklahoma State University) ,, or" Network Oregon State University (Oregon State University) " in one embodiment embodiment, the search results provided further with a similar user having associated user information (e.g. similar position).

[0226] 图10显示了根据本发明一个实施例的根据与选择相联系的位置而基于用户位置提供更相关的搜索结果的程序。 [0226] FIG. 10 shows a position associated with the selected location based on the user to provide more relevant search results with one embodiment of the procedure according to the present invention. 图10所示的程序1000开始于操作1005,其中已获取多于指定数量的选择的URL被分配有纬度-经度的“中心”和“影响范围”。 Routine shown in FIG. 10 1000 begins at operation 1005, which has acquired more than a specified number of the selected URL is assigned a latitude - "center" and "sphere of influence" longitude. 中心是到达各个记录的统计有效的用户选择的距离总和的某个函数最小化的位置。 Center is reaching a statistically significant function of the sum selected by the user from the respective recording positions is minimized. 例如,在一个实施例中,中心可以表示URL(具有本地倾向的URL)在现实世界中的具体(brick-and-mortar)位置以实现高度精确化。 For example, in one embodiment, it may represent the center URL (having a tendency to local URL) specific (brick-and-mortar) position in the real world to achieve a highly precise. 当不存在本地倾向时,中心位置具有很少或者根本没有意义。 When the local bias does not exist, the center position has little or no meaning. 例如,不具有现实世界具体位置的URL不会具有全国范围的本地倾向。 For example, the URL does not have the specific location of the real world does not have a local bias nationwide.

[0227] 影响范围是期望URL对用户高度具有吸引力的地理半径,在此之外期望不具有吸引力。 [0227] sphere of influence is expected geographic radius URL the user a highly attractive, beyond this expectation is not attractive. 半径越小,吸引力下降越陡或者吸引力越“本地化”。 The smaller the radius, the steeper less attractive or more attractive "Localization." 影响范围的半径与本地倾向的某种度量成反比。 The scope of the radius of the local bias is inversely proportional to some measure. 例如,高度本地化的URL例如www.canariesbaseball.com形成中心在Sioux Falls, SD(即Sioux Falls Canaries的家乡)的非常紧凑的影响范围。 For example, highly localized URL such as www.canariesbaseball.com formed in Sioux Falls, SD (ie, Sioux Falls Canaries home) very compact sphere of influence centers.

[0228] 在操作1010,影响范围根据统计数据而做出调整。 [0228] In operation 1010, the scope of the adjustments made according to the statistics. 例如,某些位置生成比其它位置更多的点击。 For example, some location other than the location to generate more clicks. 调整原始的点击数据以反映这种差异。 Click to adjust the raw data to reflect this difference.

[0229] 在操作1015,将用户的地理位置与预先计算的位置数据集合相比较以响应特定查询,并且基于用户的地理位置调整搜索结果得分。 [0229] In operation 1015, the user's geographic location is compared with the previously calculated position data set in response to a specific query, based on the user's location and the search results to adjust the score. 那些附近并且具有高度本地化的URL的分数被增加,而远距离的URL的分数被减少(不具有很大的本地化倾向的URL的分数保持不变)。 Those near and scores have highly localized URL is increased, while long-distance URL score is reduced (that does not have a great tendency to localize the URL score remained unchanged).

[0230] 在操作1020,对用户提供具有至少部分地基于用户的地理位置的搜索结果。 [0230] In operation 1020, provides the user with search results based at least in part of the user's geographic location. 这样允许用户接收初始的若干结果中的主观相关结果。 This allows several users to receive the result of the initial subjective correlation results. 例如,与现有技术相比,输入“州税务表格(state tax forms) ”的怀俄明州的用户更容易被提供怀俄明州的州税务表格。 For example, compared with the prior art, the input "state tax forms (state tax forms)" Wyoming users are more likely to be provided Wyoming state tax forms.

[0231] 用户本地化 [0231] User localized

[0232] 如上所述,本地化可以基于选择,或者可以根据可替换实施例而基于用户。 [0232] As described above, the localization can be based on the selection, or may be based on a user according to an alternative embodiment. 在本发明一个实施例中,基于用户的本地化允许流行的查询的结果通过管辖区域而缓存起来,然后基于位置提供给用户而不需要任何实时计算。 In one embodiment of the present invention, based on the result of the user to allow the localization of the query cache popular up by jurisdiction, and then provided to the user based on location without the need for any real-time calculations.

[0233] 图11显示了根据本发明一个实施例的基于用户位置对用户提供更相关的搜索结果的程序。 [0233] Figure 11 shows the position of the user based on the program to provide more relevant search results to a user in accordance with an embodiment of the present invention. 图11所示的程序1100开始于操作1105,其中确定对于给定查询的点击的指定部分的管辖区域,开始于最高级别的管辖区域。 Program 1100 shown in FIG. 11 begins at operation 1105, where jurisdiction is determined for a specified portion of clicks for a given query, starting with the highest level of jurisdictions. 例如,查询“car insurance”产生美国和英国站点的混合,位于英国的用户更多的点击英国站点,而位于美国的用户更多的点击美国站点。 For example, the query "car insurance" produce a mixed American and British sites, users in the UK more hits UK sites, and users in the United States of America more hits the site.

[0234] 在操作1110,对于点击的指定阈值部分的管辖区域的点击分数被增加,并且创建标记为该管辖区域的单独的结果列表。 [0234] In operation 1110, the score for click jurisdictions specified threshold portion of the clicks is increased, and creates a list of result flags for individual jurisdictions.

[0235] 在操作1115,在所述管辖区域之外的用户的点击分数被减小,产生了对于点击的指定阈值部分的管辖区域之外的用户的一般性列表。 [0235] In operation 1115, the user clicks outside the score jurisdiction is reduced, resulting in a general list for jurisdictions other than a specified threshold portion of the clicks of the user. 在可替换实施例中,基于用户与特定管辖区域的邻近度而创建实时混合列表。 In an alternative embodiment, mixing is created in real time based on the proximity of the user with a list of specific jurisdictions.

[0236] 在操作1116,如果没有针对点击的特定阈值部分的管辖区域,则在操作1120中基于用户管辖区域而提供结果。 [0236] In operation 1116, if not for a certain jurisdictions threshold portion of the clicks, then the jurisdiction based on the user operation in 1120 to provide a result.

[0237] 在操作1116,如果存在针对点击的特定阈值部分的管辖区域,则在操作1125中程序通过进行到更低级别的管辖区域而重新循环,并且由此到操作1105。 [0237] In operation 1116, if jurisdiction exists for a particular threshold portion of the clicks, then in operation 1125 the program proceeds to jurisdiction by a lower level and re-circulated, and thereby to operation 1105.

[0238] 在可替换实施例中,将位置与URL关联而不考虑查询。 [0238] In an alternative embodiment, the location associated with the URL without regard to the query. 这样具有有效的统计意义上的优点,因为可以聚集更多数据。 This has the advantage of effective statistical significance, because they can gather more data. 例如,考虑所有对任何查询选择特定州税务站点的用户的来源地,然后针对最主要的州内的用户对特定站点给予额外权重而不管涉及的查询。 For example, consider all options for the site-specific state tax inquiries source user, and then for the user in the most important states of a particular site and give extra weight regardless of the query involved. 潜在的缺陷在于,如果怀俄明州的用户输入“佛罗里达州税务表格”,如果怀俄明州税务站点在结果之中,则其得分会得到有效的而无根据的增加。 Potential drawback is that if the user enters Wyoming "Florida tax form" to increase if the tax sites in Wyoming results, its score will be valid without unfounded.

[0239] 一般件问是页 [0239] Generally members asked Page

[0240] 本发明的实施例提供了概念上组合和表述信息的方法和系统,其中使用用户响应和信息组合与表述的关联而确定信息的最优组织和表述。 Example [0240] The present invention provides a method and system combines the concepts and present information, in which user-related information and expressed in combination with the response to determine the optimal organization and presentation of information. 尽管以上针对若干示例实施例而描述,但是本发明的可替换实施例具有很多附加应用。 Although the above described embodiments for a number of exemplary embodiments, alternative embodiments of the present invention have many additional applications.

[0241] 本发明包括各种操作。 [0241] The present invention includes various operations. 很多方法以其最基本形式进行描述,但是可以对任何方法添加或者删除操作而不背离本发明的基本范围。 Many methods are described in their most basic form, but operations can be added or deleted without departing from the basic scope of the present invention to any process. 本发明的操作可以通过硬件执行或者可以如上所述的机器可执行的指令实现。 Instruction operation of the present invention may be performed by hardware or may be implemented as described above, machine-executable. 可替换的,这些步骤可以通过硬件和软件的结合而执行。 Alternatively, these steps may be performed by a combination of hardware and software. 本发明可以提供为计算机程序产品,可以包括存储了指令的机器可读介质,这些指令可以用于对计算机(或者其他电子设备)进行编程以执行根据本发明的如上所述的程序。 The present invention may be provided as a computer program product may include instructions stored in a machine-readable medium, these instructions may be used on a computer (or other electronic devices) to perform a procedure according to the invention as described above.

[0242] 图12为显示根据本发明一个实施例的可以用于参考图3所述的服务器DPS 320或者客户端DPS 305-308的数字处理系统的实施例。 [0242] FIG. 12 is a display device according to an embodiment of the present invention may be used in reference to the embodiment according to FIG DPS server 3320 or the client data processing system DPS of 305-308. 在本发明的可替换实施例中,处理系统1201可以为计算机或者机顶盒,包括与总线1207连接的处理器1203。 In an alternative embodiment of the present invention, the processing system 1201 may be a computer or a set-top box, includes a processor 1203 coupled to bus 1207.. 在一个实施例中,内存1205、存储单元1211、显示控制器1209、通信接口1213以及输入/输出控制器1215也连接到总线1207。 In one embodiment, memory 1205, storage 1211, display controller 1209, communications interface 1213, and an input / output controller 1215 is also connected to the bus 1207.

[0243] 处理系统1201通过通信接口1213连接到外部系统。 [0243] The processing system 1201 is connected to the external system 1213 through the communication interface. 通信接口1213可以包括模拟调制解调器、集成服务数字网络(ISDN)调制解调器、线缆调制解调器、数字用户专线(DSL)调制解调器、T-1线路接口、T-3线路接口、光载波接口(例如0C-3)、令牌环网接口、卫星发送接口、无线接口或者用于将设备连接到其他设备的其他接口。 Communication interface 1213 may include an analog modem, an integrated services digital network (ISDN) modem, a cable modem, digital subscriber line (DSL) modems, T-1 line interface, T-3 line interface, an optical carrier interface (e.g. 0C-3) , token ring interface, a satellite transmission interface, or a wireless interface for connecting the device to other devices of other interfaces. 通信接口1213还可以包括无线收发机或者无线电话信号等等。 Communication interface 1213 may further include a wireless transceiver or wireless telephone signals and the like.

[0244] 在本发明一个实施例中,在通信接口1213和云形符号1230之间接收/发送通信信号1225。 [0244] In one embodiment of the present invention, the communication interface between symbols 1213 and 1230 cloud shaped receiving / transmitting communications signal 1225. 在本发明一个实施例中,通信信号1225可以用于将处理系统1201连接到另一计算机系统、网络集线器、路由器等等。 In one embodiment of the present invention, the communication 1225 may be used to signal processing system 1201 is connected to another computer system, a network hub, router, and the like. 在本发明一个实施例中,通信信号1225为机器可读的介质,可以通过线路、线缆、光纤或者大气等等传输。 In one embodiment of the present invention, communication signal 1225 is a machine-readable medium may be transmitted through the line, cable, optical fiber, or the like atmosphere.

[0245] 在本发明一个实施例中,处理器1203可以为传统的微处理器,例如但不限于Intel奔腾系列处理器、Motorola系统微处理器等等。 [0245] In one embodiment of the present invention, the processor 1203 may be a conventional microprocessor, such as, but not limited to Intel Pentium processor family, the Motorola system, microprocessor or the like. 内存1205可以为机器可读介质,例如动态随机访问存储器(DRAM)并且可以包括静态随机访问存储器(SRAM)。 Memory 1205 may be a machine-readable medium, such as dynamic random access memory (DRAM) and may include static random access memory (SRAM). 显示控制器1209按照传统方式控制显示器1219,在本发明一个实施例中,显示器1219可以为阴极射线管(CRT)显示器、液晶显示器(LCD)、有源矩阵显示器、电视监视器等等。 The display controller 1209 controls the display 1219 in a conventional manner, in one embodiment of the present invention, the display 1219 may be a cathode ray tube (CRT) display, a liquid crystal display (LCD), active matrix display, a television monitor and the like. 输入/输出设备1217连接到输入/输出控制器1215,可以为键盘、磁盘驱动器、打印机、扫描仪以及其他输入和输出设备,包括鼠标、滚动球、触摸板等等。 Input / output apparatus 1217 connected to the input / output controller 1215 may be a keyboard, disk drive, printer, scanner and other input and output devices, including a mouse, a roller ball, a touchpad and the like.

[0246] 存储单元1211可以包括机器可读介质,例如但不限于硬盘、软盘、光盘、智能卡或者其他形式的数据存储单元。 [0246] The storage unit 1211 may include a machine-readable medium such as but not limited to a hard disk, floppy disk, smart card, or other form of data storage unit. 在本发明一个实施例中,存储单元1211可以包括可擦除介质、只读介质、可读/写介质等等。 In one embodiment of the present invention, the storage unit 1211 may include an erasable media, read-only media, readable / writable media and the like. 某些数据可以在计算机系统1201的软件执行过程中通过直接存储器访问程序而写入内存1205。 Some data may be written to the memory 1205 through a direct memory access 1201 software program during execution of the computer system. 应当理解,软件可以驻留在存储单元1211、内存1205中,或者可以通过调制解调器或者通信接口1213而发送或者接收。 It should be appreciated that software may reside in storage 1211, memory 1205 or may be transmitted or received via modem or communications interface 1213. 为了说明意图,术语“机器可读介质”应当认为是包括能够存储数据、信息或者对指令序列进行编码以通过处理器1203执行从而导致处理器1203执行本发明的方法的任何介质。 Is intended for purposes of illustration, the term "machine-readable medium" should be taken to include capable of storing data, information or encoding a sequence of instructions for causing the processor 1203 to perform the method of the present invention, any medium that is executed by processor 1203. 术语“机器可读介质”应当包括但不限于固态存储器、光盘和磁盘、载波信号等等。 The term "machine-readable medium" shall include but are not limited to, solid-state memories, optical and magnetic disks, carrier wave signals, etc.

[0247] 尽管参考若干实施例而描述了本发明,本领域技术人员可以理解,本发明并不限于所描述的实施例,而不是可以通过在所附权利要求书实质和范围之内的修改和变化而实施。 [0247] Although described with reference to several embodiments of the present invention, those skilled in the art will appreciate, the present invention is not limited to the embodiments described, but instead can be modified within the scope by the spirit and scope of the appended claims and changes implemented. 因此本说明书应被认为是示例性的而非限制性的。 The description is thus to be regarded as illustrative rather than restrictive.

Claims (9)

  1. 1.一种概念上组织和表述信息的方法,该方法包括: 由服务器数字处理系统(DPS)接收并记录搜索引擎查询,该搜索引擎查询经由客户端DPS从多个独立用户接收; 响应于所述搜索引擎查询,由所述服务器DPS经由所述客户端DPS向所述独立用户提供一个或多个搜索结果,其中每个独立用户能够选择所提供的搜索结果中的至少一个搜索结果; 由所述服务器DPS接收并记录所述独立用户的多个搜索结果选择; 由所述服务器DPS确定在由独立用户选择搜索结果的搜索会话期间执行的互联网活动级别; 当在所述搜索会话期间执行的互联网活动级别超过活动级别阈值时,由所述服务器DPS将权重分派给所选择的搜索结果; 由所述服务器DPS对所加权的搜索结果选择与所记录的搜索引擎查询进行关联; 由所述服务器DPS经由所述客户端DPS接收来自用户的所述搜索引擎查询; 由所 1. A method for the organization and presentation of conceptual information, the method comprising: receiving by a server a digital processing system (DPS) and records search-engine queries, the search engine queries received from a plurality of separate user via the DPS client; in response to the said search-engine queries, provided by the server via the client DPS DPS user to separate one or more of the search results, where each individual user to select at least one search result provided by the search results; manufactured by the DPS said server receives and records the user independently selected plurality of search results; the level of Internet activity performed during a search session search result selected by the user is determined by the server independent DPS; when performed during the Internet search session when the activity level exceeds a threshold activity level, by the server DPS weights assigned to the selected search result; for weighted search select search engine query associated recorded by the server DPS; by the server DPS the DPS client received via the search engine query from a user; manufactured by the 服务器DPS确定所述加权的搜索结果与所述搜索引擎查询的相关度,其中确定所述加权的搜索结果的相关度包括: 由所述服务器DPS确定选择所加权的搜索结果的独立用户数量以及所述独立用户查看所加权的搜索结果的持续时间中的至少一者; 由所述服务器DPS比较选择所加权的搜索结果的独立用户数量以及所加权的搜索结果被查看的持续时间中的至少一者与所述独立用户选择的所述多个搜索结果中的剩余搜索结果的相关度;` 由所述服务器DPS基于所加权的搜索结果与所述搜索引擎查询的相关度确定提供哪个所加权的搜索结果给独立用户;以及响应于所述搜索引擎查询,由所述服务器DPS经由所述客户端DPS 向所述独立用户提供与所述搜索引擎查询关联的所加权的搜索结果。 Correlation server DPS determines the weighting of the search results with a search engine query, wherein determining the correlation weighting of search results comprising: independently determining the number of users selected by the server DPS weighted search results as well as the duration of said individual user to view the search results weighted by at least one; is independently the number of users selected by the server DPS weighted comparison search results and the duration of the search results are weighted in view of at least one of the remaining affinity of the plurality of search results to the search result selected by the individual user; `weighted searches which provided by the server is determined based on the correlation of the weighted DPS search results of the search engine query results for individual users; in response to the search query, search results provide weighting associated with the search-engine queries to the individual user by the server via the client DPS DPS.
  2. 2.根据权利要求1所述的方法,其中由所述独立用户选择多个搜索结果,该方法还包括: 由所述服务器DPS确定所选择的搜索结果与所述搜索引擎查询的相关度;以及由所述服务器DPS基于所选择的搜索结果与所述搜索引擎查询的相关度而确定提供哪个所选择的搜索结果给所述独立用户。 2. The method according to claim 1, wherein a plurality of said independent user selection search results, the method further comprising: determining correlation by the server DPS the selected search result and the search engine queries; and by the DPS server based on correlation with the selected search results the search engine queries and provide search results to determine which choice to the individual user.
  3. 3.根据权利要求2所述的方法,其中确定所选择的搜索结果与所述搜索引擎查询的相关度包括: 由所述服务器DPS确定选择所述搜索结果的独立用户数量以及所述独立用户查看所选择的搜索结果的持续时间中的至少一者;以及由所述服务器DPS将独立用户选择数量以及用于所述搜索结果的持续时间中的至少一者与由所述独立用户选择的多个搜索结果中的剩余搜索结果的相关度进行比较。 3. The method according to claim 2, wherein determining the selected search result relevance of the search engine query comprises: determining by the server DPS selects the search result the number of users and independent of the individual user to view the duration of the search results in the selected at least one; and a plurality of said at least one server by the DPS and the number of independent user selection for the duration of the search results selected by the user independent the remaining search results correlation search results are compared.
  4. 4.根据权利要求1所述的方法,该方法还包括: 由所述服务器DPS确定所述搜索引擎查询的特性;以及由所述服务器DPS将所加权的搜索结果选择与所确定的所述搜索引擎查询的特性相关联。 And the search by the server DPS The choice weighted with the determined search result; determining a characteristic of the query by the search engine server DPS: 4. The method according to claim 1, further comprising characteristics associated engine queries.
  5. 5.根据权利要求4所述的方法,该方法还包括: 由所述服务器DPS将与所确定的所述搜索查询的特性关联的一个或多个所加权的搜索引擎结果提供给所述独立用户。 The method according to claim 4, further comprising: one or more characteristics associated with the determined by the server with the search query DPS weighted search engine result to said user independent .
  6. 6.根据权利要求4所述的方法,其中所述搜索引擎查询包括一个或多个词并且所述搜索引擎查询的特性包括搜索引擎查询中的词的拼写或常见误拼写、搜索引擎查询中的词的同义以及搜索引擎查询的词的等价措辞中的至少一者。 6. The method as claimed in claim 4, wherein the search query comprises one or more words and the characteristics of the search engine query comprises a search engine query spelling of words or common misspelled search engine query equivalent wording synonyms and word search engine query words at least one.
  7. 7.根据权利要求4所述的方法,其中所述搜索引擎查询中的词具有不同的意义,该方法还包括: 由所述服务器DPS将基于所述词的更加流行的意义的所加权的搜索结果提供给所述独立用户。 7. The method according to claim 4, wherein the search engine query words have different meanings, the method further comprises: by the server DPS based on more popular sense of the word weighted search the results are provided to the individual user.
  8. 8.根据权利要求1所述的方法,其中所述所加权的搜索结果是URL。 8. The method according to claim 1, wherein the search results are weighted URL.
  9. 9.一种概念上组织和表述信息的装置,该装置包括: 第一装置,用于由服务器数字处理系统(DPS)接收并记录搜索引擎查询,该搜索引擎查询经由客户端DPS从多个独立用户接收; 第二装置,用于响应于所述搜索引擎查询,由所述服务器DPS经由所述客户端DPS向所述独立用户提供一个或多个搜索结果,其中每个独立用户能够选择所提供的搜索结果中的至少一个搜索结果; 第三装置,用于由所述服务器DPS接收并记录所述独立用户的多个搜索结果选择; 第四装置,用于由所述服务器DPS确定在由独立用户选择搜索结果的搜索会话期间执行的互联网活动级别; 第五装置,用于当在所述搜索会话期间执行的互联网活动级别超过活动级别阈值时,由所述服务器DPS将权重分派给所选择的搜索结果; 第六装置,用于由所述服务器DPS对所加权的搜索结果选择与所记录的搜 9. The apparatus of information on a concept organization and presentation, the apparatus comprising: a first means for receiving a server digital processing system (DPS) and records search-engine queries, the search engine queries from clients via a plurality of independent DPS receiving a user; a second means, in response to the search query, providing one or more search results to the individual user by the server via the client DPS DPS, wherein each individual user can select the provided the at least one search result in the search results; a third means for receiving said plurality of search server DPS and recording results of said independent user selection; a fourth means for determining by the server DPS by independent Internet activities performed during a search session level user selects a search result; fifth means for, when the Internet activities performed during the search session activity level exceeds a threshold level, by the server DPS weights assigned to the selected search result; sixth means for searching for the search by the server DPS selects the weighting of the recorded results 引擎查询进行关联; 第七装置,用于由所述服务器DPS经由所述客户端DPS接收来自用户的所述搜索引擎查询; 第八装置,用于由所述服务器DPS确定所述加权的搜索结果与所述搜索引擎查询的相关度,其中确定所述加权的搜索结果的相关度包括: 由所述服务器DPS确定选择所加权的搜索结果的独立用户数量以及所述独立用户查看所加权的搜索结果的持续时间中的至少一者; 由所述服务器DPS比较选择所加权的搜索结果的独立用户数量以及所加权的搜索结果被查看的持续时间中的至少一者与所述独立用户选择的所述多个搜索结果中的剩余搜索结果的相关度; 由所述服务器DPS基于所加权的搜索结果与所述搜索引擎查询的相关度确定提供哪个所加权的搜索结果给独立用户;以及响应于所述搜索引擎查询,由所述服务器DPS经由所述客户端DPS向所述独立用户提供与 Associated query engine; a seventh means for receiving the search engine query from the user by the server via the client DPS DPS; eighth means for determining said weighted by the server DPS search results correlation with the search engine query, wherein determining the weighting of the correlation search results comprising: a number of users and independent of the individual user to view the search results is determined weighted selected by the server DPS weighted search results duration of at least one; is independently the number of users selected by the server DPS weighted comparison search results and the duration of the search results are weighted in view of at least one of said independent user selection of the the remaining affinity of the plurality of search results in search results; weighted determining which search result to the user by the server DPS independently based on the weighted relevance of search results for the search engine queries; and in response to the search engine query, to provide the user with independent by the server via the client DPS DPS 所述搜索引擎查询关联的所加权的搜索结果。 The search engine query search results associated with the weights.
CN 201110282837 2003-12-08 2004-12-07 Conceptive method and system for organizing and expressing information CN102354313B (en)

Priority Applications (11)

Application Number Priority Date Filing Date Title
US52813903 true 2003-12-08 2003-12-08
US60/528,139 2003-12-08
US10853552 US7181447B2 (en) 2003-12-08 2004-05-24 Methods and systems for conceptually organizing and presenting information
US10/853,552 2004-05-24
US10/853,860 2004-05-25
US10853860 US7451131B2 (en) 2003-12-08 2004-05-25 Methods and systems for providing a response to a query
US10/917,721 2004-08-12
US10917721 US7739274B2 (en) 2003-12-08 2004-08-12 Methods and systems for providing a response to a query
US10/944,251 2004-09-16
US10944251 US7152061B2 (en) 2003-12-08 2004-09-16 Methods and systems for providing a response to a query
CN200480035838.92004.12.07 2004-12-07

Publications (2)

Publication Number Publication Date
CN102354313A true CN102354313A (en) 2012-02-15
CN102354313B true CN102354313B (en) 2014-06-18

Family

ID=45577878

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201110282837 CN102354313B (en) 2003-12-08 2004-12-07 Conceptive method and system for organizing and expressing information

Country Status (1)

Country Link
CN (1) CN102354313B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103729499B (en) * 2013-12-12 2017-01-11 深圳先进技术研究院 System and method for index calculation based on popular gathering area of ​​public transit data

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001018685A2 (en) 1999-09-03 2001-03-15 Lewis, Robert Improved method, system, and architecture for information display and organization
US6446035B1 (en) 1999-05-05 2002-09-03 Xerox Corporation Finding groups of people based on linguistically analyzable content of resources accessed
CN1389811A (en) 2002-02-06 2003-01-08 北京造极人工智能技术有限公司 Intelligent search method of search engine
US6546388B1 (en) 2000-01-14 2003-04-08 International Business Machines Corporation Metadata search results ranking system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6446035B1 (en) 1999-05-05 2002-09-03 Xerox Corporation Finding groups of people based on linguistically analyzable content of resources accessed
WO2001018685A2 (en) 1999-09-03 2001-03-15 Lewis, Robert Improved method, system, and architecture for information display and organization
US6546388B1 (en) 2000-01-14 2003-04-08 International Business Machines Corporation Metadata search results ranking system
CN1389811A (en) 2002-02-06 2003-01-08 北京造极人工智能技术有限公司 Intelligent search method of search engine

Also Published As

Publication number Publication date Type
CN102354313A (en) 2012-02-15 application

Similar Documents

Publication Publication Date Title
US6353822B1 (en) Program-listing appendix
US7089236B1 (en) Search engine interface
Sugiura et al. Query routing for web search engines: Architecture and experiments
US7257589B1 (en) Techniques for targeting information to users
US7020679B2 (en) Two-level internet search service system
US7523096B2 (en) Methods and systems for personalized network searching
US7680775B2 (en) Methods and systems for generating query and result-based relevance indexes
US20060271531A1 (en) Scoring local search results based on location prominence
US20050165753A1 (en) Building and using subwebs for focused search
US20020099731A1 (en) Grouping multimedia and streaming media search results
US20050091209A1 (en) Relevance ranking of spatially coded documents
US7809716B2 (en) Method and apparatus for establishing relationship between documents
US20070250501A1 (en) Search result delivery engine
US6763362B2 (en) Method and system for updating a search engine
US20090164929A1 (en) Customizing Search Results
US7571157B2 (en) Filtering search results
US6353813B1 (en) Method and apparatus, using attribute set harmonization and default attribute values, for matching entities and predicting an attribute of an entity
US6581065B1 (en) Dynamic insertion and updating of hypertext links for internet servers
US20120016875A1 (en) Personalized data search utilizing social activities
US8145703B2 (en) User interface and method in a local search system with related search results
US20030220913A1 (en) Techniques for personalized and adaptive search services
US20080091670A1 (en) Search phrase refinement by search term replacement
US6748385B1 (en) Dynamic insertion and updating of hypertext links for internet servers
US7647306B2 (en) Using community annotations as anchortext
US20070203891A1 (en) Providing and using search index enabling searching based on a targeted content of documents

Legal Events

Date Code Title Description
C06 Publication
C10 Request of examination as to substance
C14 Granted