TWI476611B - Search results generation method and information search system - Google Patents

Search results generation method and information search system Download PDF

Info

Publication number
TWI476611B
TWI476611B TW099100274A TW99100274A TWI476611B TW I476611 B TWI476611 B TW I476611B TW 099100274 A TW099100274 A TW 099100274A TW 99100274 A TW99100274 A TW 99100274A TW I476611 B TWI476611 B TW I476611B
Authority
TW
Taiwan
Prior art keywords
matching information
user feedback
category
information
matching
Prior art date
Application number
TW099100274A
Other languages
Chinese (zh)
Other versions
TW201124861A (en
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to TW099100274A priority Critical patent/TWI476611B/en
Publication of TW201124861A publication Critical patent/TW201124861A/en
Application granted granted Critical
Publication of TWI476611B publication Critical patent/TWI476611B/en

Links

Description

搜索結果生成方法及資訊搜索系統Search result generation method and information search system

本申請涉及電腦應用領域,特別是涉及一種搜索結果生成方法及資訊搜索系統。The present application relates to the field of computer applications, and in particular, to a search result generation method and an information search system.

資訊搜索系統是一種能夠為用戶提供資訊檢索服務的系統,以網際網路中常用的搜索引擎為例,作為應用在網際網路領域的搜索系統,搜索引擎目前已經成為用戶上網必不可少的輔助工具之一。從用戶的角度看,搜索引擎一般提供一個包含搜索框的頁面,用戶在搜索框輸入關鍵字或其他搜索條件,藉由瀏覽器提交給搜索引擎後,搜索引擎就會返回與用戶輸入的關鍵字內容相匹配的資訊。The information search system is a system that can provide users with information retrieval services. Taking the search engine commonly used in the Internet as an example, as a search system applied in the Internet domain, the search engine has become an essential auxiliary for users to access the Internet. One of the tools. From the user's point of view, the search engine generally provides a page containing a search box, the user enters a keyword or other search conditions in the search box, and after the browser submits to the search engine, the search engine returns the keyword entered by the user. Information that matches the content.

針對同樣的用戶搜索請求(例如用戶在搜索時所輸入的搜索關鍵字),搜索引擎往往能夠檢索到多條匹配資訊,這個數量可能會達到數十至數萬。而從用戶的角度來講,往往只會重點關注在搜索結果中排序比較靠前的資訊。這樣,在搜索引擎向用戶提供搜索結果時,如何對這些資訊進行排序就顯得尤為重要,搜索結果的排序是否合理將直接影響著用戶的體驗。For the same user search request (such as the search keyword entered by the user during the search), the search engine can often retrieve multiple matching information, which may reach tens to tens of thousands. From the user's point of view, it is often only focused on the top-ranking information in the search results. In this way, when the search engine provides search results to the user, how to sort the information is particularly important, and whether the ranking of the search results is reasonable will directly affect the user experience.

搜索引擎在對資訊進行排序時,會對多種因素進行綜合考慮,參考的因素可以包括資訊來源、資訊可信度、用戶反饋等等,其中,用戶反饋是影響搜索結果排序的一個重要因素。例如,當搜索關鍵字為“中國中央電視臺”時,80%的用戶都點擊了中國中央電視臺的官方主頁,那麽,如果僅從用戶反饋的角度來講,搜索引擎就有理由將中國中央電視臺的官方主頁排在“中國中央電視臺”這個關鍵字所對應搜索結果的第一位。When the search engine sorts the information, it will consider a variety of factors. The reference factors may include the information source, information credibility, user feedback, etc. User feedback is an important factor affecting the ranking of search results. For example, when the search keyword is "China Central Television", 80% of the users clicked on the official homepage of China Central Television, then, from the perspective of user feedback, the search engine has reason to use China Central Television. The official homepage ranks first in the search results for the keyword "China Central Television".

為達到上述效果,現有技術中,搜索引擎是藉由對搜索關鍵字所對應各條匹配資訊的用戶反饋量進行統計,並根據用戶反饋量由大到小的順序,生成搜索結果提供給用戶。藉由對現有技術的研究,發明人發現現有搜索結果生成方法存在的問題是:對於新發佈的資訊,反饋量的初始值為0(或很低),導致其排名靠後,由於排名靠後又很難被用戶關注到,這樣就一直無法提升排名。而從另一個角度講,個別用戶也可以藉由一些作弊手段(例如欺詐點擊)來迅速改變反饋量,從而使自己發佈的資訊能夠在搜索結果中排名靠前,對他人的正常利益造成影響。可見,從用戶的角度來看,現有技術生成的搜索結果排序存在著不合理之處,對用戶體驗造成了影響。In order to achieve the above effects, in the prior art, the search engine collects the search result and provides the search result to the user according to the user feedback amount of the matching information corresponding to the search keyword, and according to the order of the user feedback. By studying the prior art, the inventors found that the existing search result generation method has the problem that for the newly released information, the initial value of the feedback amount is 0 (or very low), resulting in its ranking lower, due to the lower ranking. It's hard to be noticed by users, so it has been impossible to improve the ranking. On the other hand, individual users can quickly change the amount of feedback by means of some cheating (such as fraudulent clicks), so that the information they publish can rank higher in the search results and affect the normal interests of others. It can be seen that from the user's point of view, the ranking of search results generated by the prior art is unreasonable and has an impact on the user experience.

為解決上述技術問題,本申請提供了一種搜索結果生成方法及資訊搜索系統,可以將更為合理的匹配資訊排序結果展現給用戶,提升用戶體驗,技術方案如下:本申請提供一種搜索結果生成方法,包括:資訊搜索系統接收搜索請求,藉由檢索獲得與所述搜索請求相匹配的各條匹配資訊;對所述各條匹配資訊的用戶反饋量進行查詢,進一步計算得到所述各條匹配資訊所屬類別的用戶反饋總量;根據所述各條匹配資訊所屬類別的用戶反饋總量的大小,對所述各條匹配資訊進行排序,生成搜索結果。To solve the above technical problem, the present application provides a search result generation method and an information search system, which can display a more reasonable matching information ranking result to a user and improve the user experience. The technical solution is as follows: The present application provides a search result generation method. The information search system receives the search request, obtains each piece of matching information that matches the search request by searching, and queries the user feedback quantity of the pieces of matching information to further calculate the matching information. The total amount of user feedback of the category; sorting the pieces of matching information according to the size of the total amount of user feedback of the categories to which the matching information belongs, and generating a search result.

本申請還提供一種資訊搜索系統,包括:資訊檢索單元,用於接收搜索請求,藉由檢索獲得與所述搜索請求相匹配的各條匹配資訊;用戶反饋量計算單元,用於對各條匹配資訊的用戶反饋量進行查詢,進一步計算得到每個類別的匹配資訊的用戶反饋總量;結果生成單元,用於根據所述各條匹配資訊所屬類別的用戶反饋總量的大小,對所述各條匹配資訊進行排序,生成搜索結果。The application further provides an information search system, comprising: an information retrieval unit, configured to receive a search request, obtain a matching information matching the search request by searching, and a user feedback amount calculation unit, configured to match each item The user feedback amount of the information is queried, and the total amount of user feedback of the matching information of each category is further calculated; the result generating unit is configured to: according to the total amount of user feedback of the category of the matching information, Sort the matching information to generate search results.

與現有技術相比,本申請實施例所提供的技術方案,不是以單條資訊的用戶反饋量大小作為排序依據,而是以每條資訊所屬類別的用戶反饋總量的大小作為排序依據。這樣,即使是新發佈資訊的用戶反饋量很小,如果其所屬類別比較受用戶關注,那麽該條資訊同樣有機會排在相對靠前的位置。從另一個角度來講,單條資訊的用戶反饋量的增加,並不能直接提高該條資訊的排名,而是提高了該條資訊所屬類別的排名,因此可以有效地減小欺詐點擊等作弊手段對搜索結果排序的影響。Compared with the prior art, the technical solution provided by the embodiment of the present application is not based on the size of the user feedback of a single piece of information, but is based on the size of the total amount of user feedback of each category of information. In this way, even if the amount of feedback from users who newly post information is small, if the category to which they belong is more concerned by the user, then the piece of information also has the opportunity to be ranked relatively high. On the other hand, the increase in the amount of user feedback for a single piece of information does not directly increase the ranking of the piece of information, but rather increases the ranking of the category of the piece of information, so it can effectively reduce fraudulent means such as fraudulent clicks. The effect of sorting search results.

首先對本申請實施例的一種搜索結果生成方法進行說明,包括:資訊搜索系統接收搜索請求,藉由檢索獲得與所述搜索請求相匹配的各條匹配資訊;對所述各條匹配資訊的用戶反饋量進行查詢,進一步計算得到所述各條匹配資訊所屬類別的用戶反饋總量;根據所述各條匹配資訊所屬類別的用戶反饋總量的大小,對所述各條匹配資訊進行排序,生成搜索結果。First, a method for generating a search result according to an embodiment of the present application is described. The method includes: the information search system receives a search request, obtains, by searching, matching pieces of matching information that match the search request; and user feedback of the pieces of matching information. The quantity is queried, and the total amount of user feedback of the category of the matching information is further calculated; and the matching information is sorted according to the size of the total amount of user feedback of the category of the matching information, and the search is generated. result.

為了使本技術領域的人員更好地理解本申請中的技術方案,下面將結合本申請實施例中的圖式,對本申請實施例中的技術方案進行清楚、完整地描述,顯然,所描述的實施例僅僅是本申請一部分實施例,而不是全部的實施例。基於本申請中的實施例,本領域普通技術人員在沒有做出創造性勞動前提下所獲得的所有其他實施例,都應當屬於本申請保護的範圍。In order to make those skilled in the art better understand the technical solutions in the present application, the technical solutions in the embodiments of the present application will be clearly and completely described in the following with reference to the drawings in the embodiments of the present application. The embodiments are only a part of the embodiments of the present application, and not all of them. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present application without departing from the inventive scope shall fall within the scope of the application.

下面以網路搜索應用為例,對本申請所提供的技術方案進行詳細說明,圖1所示為本申請實施例的一種搜索結果生成方法的流程圖,包括以下步驟:The following is a detailed description of the technical solution provided by the present application by using a network search application as an example. FIG. 1 is a flowchart of a search result generating method according to an embodiment of the present application, including the following steps:

S101、搜索引擎接收搜索請求,藉由檢索獲得與所述搜索請求相匹配的各條匹配資訊;當用戶需要在網路上搜索資訊時,會輸入一個或多個搜索條件,一般最為常用的搜索條件是搜索關鍵字,根據具體搜索應用場景的不同,有些搜索引擎還可以支援更多類型的搜索條件,例如資訊發佈時間、資訊屬性等等,本申請實施例中,將各種搜索條件統稱為搜索請求。搜索引擎接收到搜索請求之後,檢索與搜索請求相匹配的資訊。對應不同的搜索應用場景,檢索到的資訊類型也有所不同,例如:在網頁搜索中,檢索到的資訊為網頁;在電子商務搜索中,檢索到的資訊為商品;在文獻搜索中,檢索到的資訊為期刊或論文等等。其中,根據搜索請求檢索與之相匹配的資訊,其實現方法與現有技術相同,本申請實施例對此不再進行詳細說明。S101. The search engine receives the search request, and obtains matching information that matches the search request by searching. When the user needs to search for information on the network, one or more search conditions are input, and the most commonly used search condition is generally used. The search keyword is a search keyword. Some search engines can support more types of search conditions, such as information release time, information attributes, and the like. In the embodiment of the present application, various search conditions are collectively referred to as search requests. . After the search engine receives the search request, it retrieves the information that matches the search request. The types of information retrieved are different for different search application scenarios. For example, in web search, the retrieved information is a web page; in e-commerce search, the retrieved information is a commodity; in the literature search, the retrieved information is retrieved. Information for journals or papers, etc. The method for retrieving the matching information according to the search request is the same as that of the prior art, and is not described in detail in this embodiment of the present application.

S102、對各條匹配資訊的用戶反饋量進行查詢,進一步計算得到各條匹配資訊所屬類別的用戶反饋總量;對應一個搜索請求,搜索引擎往往能夠檢索到多條與之相匹配的資訊,搜索引擎需要根據一定的原則,對這些資訊進行篩選、排序,以方便用戶的閱讀。S102: querying the amount of user feedback of each piece of matching information, and further calculating the total amount of user feedback of each category of matching information; corresponding to a search request, the search engine can often retrieve multiple matching information, and search The engine needs to filter and sort the information according to certain principles to facilitate the user's reading.

其中,用戶反饋是影響搜索結果排序的一個重要因素,其基本原則是:將用戶最為關注的資訊排在搜索結果的最前面。在本申請實施例中,以用戶反饋量作為反映用戶對某條資訊關注程度的參數。例如,一個網頁鏈結點擊次數、鏈結被收藏次數等,能夠直接反映出用戶對這個網頁的關注程度,因此,對於網頁來說,可以以鏈結點擊次數、鏈結被收藏次數等資訊作為用戶反饋量。而在電子商務中,某個商品的用戶反饋量可以包括:商品成交量、商品成交金額、商品詢價次數、商品資訊被收藏次數等資訊。本領域技術人員可以理解的是,可以選擇某一種資訊來表示用戶反饋量,也可以綜合考慮多種資訊來表示用戶反饋量,例如:Among them, user feedback is an important factor affecting the ranking of search results. The basic principle is to rank the information that users are most concerned about at the forefront of search results. In the embodiment of the present application, the user feedback amount is used as a parameter reflecting the user's attention to a certain piece of information. For example, the number of clicks on a webpage link, the number of times a link is bookmarked, etc., can directly reflect the user's attention to the webpage. Therefore, for a webpage, information such as the number of links clicked, the number of times the link is bookmarked, and the like can be used as the webpage. User feedback. In e-commerce, the amount of user feedback of a certain product may include: the volume of the commodity, the amount of the transaction of the commodity, the number of times the commodity is inquired, and the number of times the commodity information is collected. Those skilled in the art can understand that one type of information can be selected to represent the amount of user feedback, and a plurality of types of information can be comprehensively used to represent the amount of user feedback, for example:

用戶反饋量=商品成交量×0.3+被收藏次數×2、User feedback = commodity volume × 0.3 + number of collections × 2

用戶反饋量=商品成交金額×商品資訊被收藏次數+log(商品成交量),User feedback = product transaction amount × product information was counted + log (commodity volume),

等等。and many more.

用戶反饋量一般是被記錄在用戶反饋日誌中,搜索引擎藉由讀取用戶反饋日誌,就可以獲得各條匹配資訊所對應的用戶反饋量。可以理解的是,搜索引擎可以選擇只對某段時間(例如最近一周、最近一個月等)的用戶反饋日誌進行讀取,以適應用戶興趣點的不斷變化。The user feedback amount is generally recorded in the user feedback log, and the search engine can obtain the user feedback amount corresponding to each piece of matching information by reading the user feedback log. It can be understood that the search engine can choose to read only the user feedback log for a certain period of time (for example, the last week, the most recent month, etc.) to adapt to the changing user interest points.

假設對應某個搜索請求,搜索引擎檢索到4條與之相匹配的資訊,讀取用戶反饋日誌,得到各條匹配資訊所對應的用戶反饋量如表1所示:Suppose that for a certain search request, the search engine retrieves four matching information and reads the user feedback log, and the user feedback amount corresponding to each matching information is shown in Table 1:

由表1可以看出,4條匹配資訊的用戶反饋量大小關係為:匹配資訊1>匹配資訊3>匹配資訊2>匹配資訊4。如果根據現有技術的方案,也將以這個順序生成搜索結果,並最終展現給用戶。而在本申請技術方案中,需要對各條匹配資訊所對應的用戶反饋量做進一步處理。It can be seen from Table 1 that the relationship between the magnitudes of the four feedbacks of the matching information is: matching information 1 > matching information 3 > matching information 2 > matching information 4. If the solution according to the prior art is used, the search results will also be generated in this order and finally presented to the user. In the technical solution of the present application, the user feedback amount corresponding to each piece of matching information needs to be further processed.

網際網路中的資訊,很多都是按照一定的類別進行發佈的,例如,在門戶網站中,網頁類型可以包括新聞、體育、娛樂、財經等等,在電子商務網站中,商品類別包括家居、電器、服飾、食品等等。那麽,對於搜索引擎檢索到的每條匹配資訊,都會對應一個自身所屬的類別。本申請實施例中,獲得各條匹配資訊所對應的用戶反饋量之後,首先查詢各條匹配資訊所屬的類別。對於網頁而言,可以根據網址的路徑獲知網頁所屬的類別,例如,網址路徑中包含“news”欄位的網頁為新聞類網頁,網址路徑中包含“sports”欄位的網頁為體育類網頁,等等;而對於商品而言,直接查詢其商品資訊就可以獲得該商品所屬的類別。Many of the information in the Internet is published according to certain categories. For example, in the portal, the type of webpage may include news, sports, entertainment, finance, etc. In an e-commerce website, the product category includes home, Electrical appliances, clothing, food, etc. Then, for each matching information retrieved by the search engine, it will correspond to a category to which it belongs. In the embodiment of the present application, after obtaining the user feedback amount corresponding to each piece of matching information, first querying the category to which each piece of matching information belongs. For a webpage, the webpage belongs to the category to which the webpage belongs. For example, the webpage containing the "news" field in the webpage path is a news webpage, and the webpage containing the "sports" field in the webpage path is a sports webpage. And so on; for goods, directly query their product information to get the category to which the item belongs.

還要進一步計算每個類別的匹配資訊的用戶反饋總量。例如,在表1的例子中,匹配資訊1和匹配資訊4是屬於“類型A”,匹配資訊2和匹配資訊3是屬於“類型B”,則“類型A”的用戶反饋總量為100+5=105、“類型B”的用戶反饋總量為30+40=70,如表2所示:The total amount of user feedback for matching information for each category is further calculated. For example, in the example of Table 1, the matching information 1 and the matching information 4 belong to "type A", the matching information 2 and the matching information 3 belong to "type B", and the total amount of user feedback of "type A" is 100+. 5=105, the total user feedback of “Type B” is 30+40=70, as shown in Table 2:

S103、根據各條匹配資訊所屬類別的用戶反饋總量的大小,對各條匹配資訊進行排序,生成搜索結果。S103. Sort each matching information according to the total amount of user feedback of each category in which the matching information belongs, and generate a search result.

由表2可以看出,“類型A”比“類型B”更受用戶的關注,因此,如果僅從用戶反饋總量的角度考慮,屬於“類型A”的資訊應該排在屬於“類型B”的資訊的前面。As can be seen from Table 2, "Type A" is more concerned by users than "Type B". Therefore, if it is only from the perspective of the total amount of user feedback, the information belonging to "Type A" should be classified as "Type B". The front of the information.

對於匹配資訊4而言,其用戶反饋量很小,如果應用現有技術的方案,常規情況下,匹配資訊4將很難獲得靠前的排名。而本申請的技術方案並不是以單條資訊的用戶反饋量大小作為排序依據,匹配資訊4的用戶反饋量雖然很小,但是由於它屬於比較受到關注的類型,因此在本申請技術方案所生成的搜索結果中,匹配資訊4將排在匹配資訊2和匹配資訊3的前面(或者說匹配資訊4將有更多的機會排在匹配資訊2和匹配資訊3的前面)。這樣,即使是新發佈的資訊,也有了更多的機會能夠在搜索結果中獲得比較靠前的排名,更好地適應了用戶的實際需求。For the matching information 4, the amount of user feedback is small. If the prior art scheme is applied, it is difficult to obtain the ranking of the matching information 4 under normal circumstances. However, the technical solution of the present application is not based on the size of the user feedback of a single piece of information. Although the amount of user feedback of the matching information 4 is small, since it belongs to a type that is of interest, it is generated by the technical solution of the present application. In the search results, the matching information 4 will be ranked in front of the matching information 2 and the matching information 3 (or the matching information 4 will have more chances in front of the matching information 2 and the matching information 3). In this way, even the newly released information has more opportunities to obtain a higher ranking in the search results, and better adapt to the actual needs of users.

以表1為基礎,如果有人新發佈了能夠和搜索請求相匹配的資訊5(假設該資訊5屬於類型C),並且藉由欺詐點擊等手段令其用戶反饋量在短時間內達到50,如果應用現有技術的方案,這條匹配資訊5將直接排在搜索結果中的第二名,從而影響了其他資訊發佈者的正常利益。但是,應用本申請的技術方案,由於其所屬類型C的用戶反饋總量低於類型A和類型B,因此即使藉由作弊手段,匹配資訊5仍然無法獲得靠前的排名。可以理解的,上述的例子僅用於示意性說明,在實際應用中,資訊的分類更多,所檢索到的匹配資訊數量也更大,個別用戶雖然可以對自己所發佈的一條或幾條資訊採用作弊手段提高反饋量,但是無法對資訊所在類別的用戶反饋總量造成太大影響,從而有效地減小了作弊對搜索結果排序的影響。Based on Table 1, if someone newly publishes information 5 that matches the search request (assuming that the information 5 belongs to type C), and the user feedback amount reaches 50 in a short time by means of fraudulent clicks, etc. With the prior art solution, this matching information 5 will be ranked directly in the second place in the search results, thereby affecting the normal interests of other information publishers. However, applying the technical solution of the present application, since the total amount of user feedback of the type C belongs to be lower than that of the type A and the type B, even if the information is cheated, the matching information 5 cannot obtain the top ranking. It can be understood that the above examples are only used for illustrative explanation. In practical applications, the information is classified more, and the number of matching information retrieved is also larger. Although individual users can publish one or several pieces of information for themselves. Cheating is used to increase the amount of feedback, but it can't affect the total amount of user feedback in the category of information, which effectively reduces the impact of cheating on the ranking of search results.

需要說明的是,以上實施例所介紹的,是僅從用戶反饋量這一角度考慮,對匹配資訊進行排名,在實際的應用中,搜索引擎在生成搜索結果時,可以對多種因素進行綜合考慮。一般是將每個因素都作為一個加權參數,並且根據這些因素的重要程度,為每個加權參數設定一個加權係數,藉由對各個加權參數的加權平均處理,得到一個排序分值,搜索引擎最終根據各條匹配資訊排序分值的大小,確定各條匹配資訊在搜索結果中的排列順序。It should be noted that, in the above embodiment, the matching information is ranked only from the perspective of the user feedback amount. In an actual application, the search engine can comprehensively consider various factors when generating the search result. . Generally, each factor is used as a weighting parameter, and according to the importance degree of these factors, a weighting coefficient is set for each weighting parameter, and a sorting score is obtained by weighted averaging processing of each weighting parameter, and the search engine finally According to the size of each matching information sorting score, the order of the matching information in the search results is determined.

如果應用現有技術的方案,單條匹配資訊的用戶反饋量越大,則其所獲得的加權值就越大。而應用本申請的技術方案,單條匹配資訊所屬類別的用戶反饋總量越大,則其所獲得的加權值就越大。根據表2所示的結果,匹配資訊1和匹配資訊4在用戶反饋量這項參數的加權值大於匹配資訊2和匹配資訊3的加權值。與現有技術相比,匹配資訊1將有更大的機會獲得比較靠前的排名。If the prior art scheme is applied, the greater the amount of user feedback of a single matching information, the greater the weighting value obtained. With the technical solution of the present application, the greater the total amount of user feedback of a category of matching information, the greater the weighting value obtained. According to the results shown in Table 2, the weighting value of the matching information 1 and the matching information 4 in the user feedback amount is larger than the weighting value of the matching information 2 and the matching information 3. Matching information 1 will have a greater chance of achieving a higher ranking than the prior art.

具體而言,可以根據每個類別匹配資訊的用戶反饋總量的比值,計算得到屬於每個類別的匹配資訊的加權值。以表2為例,類型A的用戶反饋總量為105、類型B的用戶反饋總量為70,其比值為3:2。可以進一步對該比值進行歸一化處理,例如,將每一個類別的用戶反饋總量除以所有類別的用戶反饋總量之和,所得到的比值為0.6:0.4,那麽0.6和0.4就分別是屬於類別A和類別B的匹配資訊在用戶反饋量這一參數上所獲得的加權值。也可以將每一個類別的用戶反饋總量除以最大的單類用戶反饋總量,則所得的比值為1:0.67,那麽1和0.67就分別是屬於類別A和類別B的匹配資訊在用戶反饋量這一參數上所獲得的加權值。Specifically, the weighting value of the matching information belonging to each category may be calculated according to the ratio of the total amount of user feedback of the matching information of each category. Taking Table 2 as an example, the total amount of user feedback of type A is 105, and the total amount of user feedback of type B is 70, and the ratio is 3:2. The ratio can be further normalized, for example, by dividing the total amount of user feedback for each category by the sum of the user feedbacks for all categories, and the resulting ratio is 0.6:0.4, then 0.6 and 0.4 are respectively The weighting value obtained by the matching information belonging to category A and category B on the parameter of the user feedback amount. It is also possible to divide the total amount of user feedback for each category by the maximum amount of single-class user feedback, and the resulting ratio is 1:0.67, then 1 and 0.67 are the matching information belonging to category A and category B, respectively. The weighted value obtained on this parameter.

搜索引擎也可以對每個類別匹配資訊的用戶反饋總量進行排序,根據排序結果,得到屬於每個類別的匹配資訊的加權值。如表3所示:The search engine can also sort the total amount of user feedback for each category of matching information, and according to the sorting result, obtain the weighting value of the matching information belonging to each category. as shown in Table 3:

可見,最終的每個類別資訊所獲得的加權值,只和這個類別的用戶反饋總量的排列順序有關,與具體的用戶反饋總量值無關,也就是說,對於屬於類型E的資訊而言,只有當類型E的用戶反饋總量超過500時,才會獲得更大的加權值以提升排名,從而能夠進一步減小作弊對搜索結果排序的影響。It can be seen that the weighting value obtained by the final information of each category is only related to the order of the total amount of user feedback of this category, and is independent of the specific user feedback total value, that is, for the information belonging to the type E. Only when the total amount of user feedback of type E exceeds 500 will a larger weighting value be obtained to improve the ranking, thereby further reducing the effect of cheating on the ranking of search results.

以上介紹了兩種計算加權值的具體例子,根據“單條匹配資訊所屬類別的用戶反饋總量越大,則其所獲得的加權值就越大”這一原則,本領域技術人員還可以結合具體需求,採取其他技術手段來計算加權值,這些也在本申請的保護範圍內。The above describes two specific examples of calculating the weighting value. According to the principle that “the greater the total amount of user feedback of the category to which the matching information belongs is, the greater the weighting value obtained is,” the person skilled in the art can also The need to use other technical means to calculate the weighting values is also within the scope of this application.

在實際應用中,對於多種因素的綜合考慮,除了採用加權的形式之外,還可以採用分級的形式。即:在根據一個(或多個)因素對匹配資訊進行第一次排序之後,再根據其他一個或多個因素對第一次排序的結果做第二次排序。In practical applications, for the comprehensive consideration of various factors, in addition to the weighted form, a hierarchical form can also be adopted. That is, after the matching information is first sorted according to one (or more) factors, the first sorting result is sorted a second time according to one or more other factors.

針對本申請所提出的技術方案,本領域技術人員容易想到的是:在根據各條匹配資訊所屬類別的用戶反饋總量的大小對各條匹配資訊進行排序之後,還可以進一步根據各條匹配資訊的用戶反饋量大小,對各類別下的匹配資訊進行排序。For the technical solutions proposed by the present application, those skilled in the art can easily think that after sorting each piece of matching information according to the total amount of user feedback of the categories to which the matching information belongs, the matching information may be further determined according to each piece. The amount of user feedback, sorting the matching information under each category.

以表1中的資料為例,應用申請技術方案,可得到“屬於類型A的資訊應該排在屬於類型B的資訊的前面”,即:匹配資訊1和4應排在匹配資訊2和3之前。進一步地,根據單條匹配資訊的用戶反饋量大小對每個類別下的匹配資訊進行二次排序,可以得到:匹配資訊1應排在匹配資訊4之前、匹配資訊3應排在匹配資訊2之前,則最終的排序結果為:Taking the data in Table 1 as an example, applying the application technical solution, it can be obtained that “the information belonging to type A should be ranked in front of the information belonging to type B”, that is, the matching information 1 and 4 should be ranked before the matching information 2 and 3. . Further, the matching information of each category is secondarily sorted according to the amount of user feedback of the single matching information, and it can be obtained that the matching information 1 should be ranked before the matching information 4, and the matching information 3 should be ranked before the matching information 2. The final sort result is:

匹配資訊4、匹配資訊1、匹配資訊3、匹配資訊2。Match information 4, match information 1, match information 3, match information 2.

可見,上述方案,一方面保證了受關注的類型能夠排在前面,另一方面,在類型相同的前提下,進一步根據單條用戶反饋量的大小,分別對每個類別下的匹配資訊進行排序。當然,本領域技術人員可以理解的是,在根據各條匹配資訊所屬類別的用戶反饋總量的大小對各條匹配資訊進行排序之後,也可以根據其他的因素(可是一個或多個)對各類別下的各條匹配資訊進行排序。並且,如果有必要,還可以根據其他因素進一步做第三次排序、第四次排序……,本說明書不再一一列舉。It can be seen that the above solution ensures that the types of attention can be ranked first. On the other hand, under the premise of the same type, the matching information under each category is further sorted according to the size of the single user feedback. Of course, those skilled in the art can understand that after sorting each piece of matching information according to the total amount of user feedback of the categories to which the matching information belongs, it may also be based on other factors (but one or more). Sort the matching information under the category. And, if necessary, you can further do the third sorting and the fourth sorting according to other factors... This manual is not listed one by one.

下面將結合幾個具體的應用實例,對本申請的搜索結果生成方法進行說明。The search result generation method of the present application will be described below in combination with several specific application examples.

例如在網頁搜索應用中,以“赤壁”這一關鍵字進行搜索,搜索引擎會檢索到很多條與“赤壁”匹配的網頁,這些網頁分別屬於不同類別。藉由讀取某段時間的用戶反饋日誌,並根據網頁類別計算每個類別的用戶反饋總量,得到結果如表4所示:For example, in the web search application, searching with the keyword "Red Cliff", the search engine will retrieve a number of web pages that match "Red Cliff", and these web pages belong to different categories. By reading the user feedback log for a certain period of time and calculating the total amount of user feedback for each category based on the page category, the results are shown in Table 4:

“赤壁”對應的是一場著名的古代戰役,自然有很多匹配的網頁都是屬於“軍事”、“歷史”類別的;同時“赤壁”也經常出現於影視或遊戲情節中,所以也有很多匹配的網頁是屬於“娛樂”、“遊戲”類別。此外“赤壁”還是一個旅遊景點,因此,也有一些匹配網頁是屬於“旅遊”類別。"Red Cliff" corresponds to a famous ancient battle. Naturally, many matching web pages belong to the "military" and "historical" categories. At the same time, "Red Cliff" often appears in the film or game plot, so there are many matching matches. Web pages belong to the category of "entertainment" and "games". In addition, "Red Cliff" is still a tourist attraction, so there are also some matching pages that belong to the "tourism" category.

由於用戶反饋量是藉由讀取某段時間的用戶反饋日誌獲得,因此可以反映出這段時間的用戶關注焦點。例如,《赤壁》作為一部電影,受到大量人的關注,因此,在影片上映前後的一段時間內,屬於“娛樂”這一類別的匹配網頁將會多於其他類別,並且有著很高的用戶點擊量,如表4所示。應用本申請技術方案,如果用戶使用“赤壁”這一關鍵字進行搜索,那麽屬於“娛樂”這一類別的匹配網頁,無論其單個網頁的用戶點擊量多少,都將會獲得更高的加權值,從而在搜索結果中排在比較靠前的位置,便於用戶進行點擊瀏覽。Since the amount of user feedback is obtained by reading the user feedback log for a certain period of time, it can reflect the user's focus during this time. For example, "Red Cliff" as a movie has received a lot of attention. Therefore, in the period before and after the film is released, there will be more matching pages belonging to the category of "entertainment" than other categories, and there are high users. Click volume, as shown in Table 4. Applying the technical solution of the present application, if the user searches using the keyword "Red Cliff", the matching webpage belonging to the category of "entertainment" will obtain a higher weight value regardless of the amount of user clicks of the individual webpage. Therefore, it ranks in the top position in the search results, making it easy for users to click through.

本申請所提供的技術方案,還適用於電子商務的搜索應用。例如,用戶以關鍵字“筆記本”進行商品搜索,搜索引擎可能檢索到的商品會涉及筆記本電腦,筆記本電池、筆記本散熱器,甚至傳統意義上寫字用的筆記本。按照電子商務網站對商品類別的劃分,筆記本電腦可能屬於“筆記本整機”類別,而筆記本電池、筆記本散熱器屬於“筆記本配件”類別,至於傳統意義上寫字用的筆記本,則可能屬於“文化用品”或“辦公用品”類別。藉由對用戶反饋量的統計,可以發現在現階段,大部分以“筆記本”為關鍵字進行搜索的用戶,其真正關注的商品都是筆記本電腦,那麽,應用本申請技術方案,屬於“筆記本整機”這一類型的商品,都將獲得較高的加權值,從而在搜索結果中排在比較靠前的位置,便於用戶進行點擊瀏覽,並且,對於新發佈的筆記本電腦商品資訊,同樣有機會排在比較靠前的位置。而對於屬於“文化用品”或“辦公用品”類別的傳統筆記本,即使藉由作弊手段(例如發佈者自己提高詢價次數、自己對商品資訊多次進行收藏),也無法針對“筆記本”這一關鍵字提升排名。因為傳統的筆記本根本不是大多數搜索“筆記本”的用戶所真正關注的(真正關注傳統筆記本的用戶會進一步在“文化用品”或“辦公用品”的範圍進行搜索,與本申請技術方案無關,在此不做詳細說明),可見,應用本申請技術方案所生成的搜索結果,其排序更加符合多數用戶的需求,有效提高了用戶體驗。The technical solution provided by the application is also applicable to the search application of e-commerce. For example, a user searches for a product with the keyword "notebook", and the search engine may retrieve a product that involves a laptop, a laptop battery, a notebook cooler, or even a conventional notebook. According to the classification of commodity categories by e-commerce websites, laptops may belong to the category of “laptops”, while laptop batteries and notebook coolers belong to the category of “notebook accessories”. As for notebooks in the traditional sense, they may belong to “culture”. "" supplies" or "office supplies" category. By counting the amount of feedback from users, it can be found that at this stage, most of the users who search by "notebook" as keywords are all notebooks, and then the application of the technical solution of the present application belongs to "notebook". This type of product will receive a higher weight value, so that it is ranked higher in the search results, which is convenient for users to click and browse, and for the newly released laptop product information, there are also Opportunities are ranked higher. For traditional notebooks belonging to the category of "cultural goods" or "office supplies", even by cheating (for example, the publisher himself raises the number of inquiry times and collects the product information multiple times), it is impossible to target "notebook". Keyword promotion ranking. Because traditional notebooks are not really concerned by most users who search for "notebooks" (users who really pay attention to traditional notebooks will further search in the scope of "cultural products" or "office supplies", regardless of the technical solution of this application, This is not explained in detail. It can be seen that the search results generated by applying the technical solution of the present application are more consistent with the requirements of most users, and the user experience is effectively improved.

以上兩個例子,僅用於示意性說明,實際的網際網路資訊,可能具有更為完善的分類層次,例如,在上面的例子中,“筆記本電池”和“筆記本散熱器”指的都是“筆記本配件”分類下的單條的商品資訊。而在實際應用中,“筆記本電池”和“筆記本散熱器”也可能是“筆記本配件”分類下的兩個子類。那麽,應用本申請所提供的技術方案,這兩個子類也分別具有所對應的用戶反饋總量,如果用戶是在“筆記本配件”這個範圍內進行搜索,那麽這兩個子類的商品也將分別獲得不同的加權值。可以理解的是,如果用戶是在一個最小的分類範圍內進行搜索,那麽所獲得的搜索結果,就是以該類別下的單條商品資訊的用戶反饋量大小作為排序依據的。The above two examples are only used for illustrative purposes. The actual Internet information may have a more complete classification level. For example, in the above example, “laptop battery” and “notebook cooler” refer to A single item of information under the "notebook accessories" category. In practical applications, "laptop battery" and "notebook cooler" may also be two sub-categories under the "notebook accessories" category. Then, applying the technical solution provided by the application, the two sub-categories respectively have the corresponding total amount of user feedback. If the user searches within the scope of “notebook accessories”, the products of the two sub-categories are also Different weighting values will be obtained separately. It can be understood that if the user searches within a minimum classification range, the obtained search result is based on the size of the user feedback amount of the single item information under the category.

以上介紹了本申請技術方案在網頁搜索和電子商務搜索兩個方面的應用,可以理解的是,這只是本申請技術方案較佳的兩種實施方式,事實上,本申請技術方案可以應用於各類搜索需求,例如圖書資料庫搜索、文獻資料庫搜索等。並且應用範圍也僅不局限於網際網路領域,其他如單機、區域網路中的搜索,都可以應用本申請所提供的技術方案。The application of the technical solution of the present application in webpage search and e-commerce search is described above. It can be understood that this is only two embodiments of the technical solution of the present application. In fact, the technical solutions of the present application can be applied to each Class search requirements, such as book database search, literature database search, etc. And the scope of application is not limited to the Internet domain. Other technologies such as single-machine and local area network can apply the technical solutions provided by this application.

相應於上面的方法實施例,本申請還提供一種資訊搜索系統,參見圖2所示,包括:資訊檢索單元210,用於接收搜索請求,藉由檢索獲得與所述搜索請求相匹配的各條匹配資訊;用戶反饋量計算單元220,用於對各條匹配資訊的用戶反饋量進行查詢,進一步計算得到每個類別的匹配資訊的用戶反饋總量;其中,所述用戶反饋總量為:屬於該類別的匹配資訊的用戶反饋量之和;結果生成單元230,用於根據各條匹配資訊所屬類別的用戶反饋總量的大小,對各條匹配資訊進行排序,生成搜索結果。Corresponding to the above method embodiment, the present application further provides an information search system, as shown in FIG. 2, including: an information retrieval unit 210, configured to receive a search request, and obtain various items matching the search request by searching. The user feedback amount calculation unit 220 is configured to query the user feedback amount of each piece of matching information, and further calculate the total amount of user feedback of the matching information of each category; wherein the total amount of user feedback is: The sum of the user feedback amounts of the matching information of the category; the result generating unit 230 is configured to sort the matching information according to the total amount of the user feedback of the categories to which the matching information belongs, and generate a search result.

其中所述用戶反饋量計算單元220,可以藉由讀取特定時間段的用戶反饋日誌,對各條匹配資訊的用戶反饋量進行查詢。The user feedback amount calculation unit 220 can query the user feedback amount of each piece of matching information by reading the user feedback log of a specific time period.

參見圖3所示,所述結果生成單元230,可以包括:排序分值計算子單元231,用於以各條匹配資訊所屬類別的用戶反饋總量的大小作為加權參數,計算所述各條匹配資訊的排序分值;其中,如果第一匹配資訊所屬類別的用戶反饋總量大於第二匹配資訊所屬類別的用戶反饋總量,則所述第一匹配資訊的加權值大於所述第二匹配資訊的加權值;結果生成子單元232,用於根據各條匹配資訊排序分值的大小,生成搜索結果。As shown in FIG. 3, the result generating unit 230 may include: a sorting score calculation sub-unit 231, configured to calculate the matching items by using a size of a total amount of user feedback of each category of matching information as a weighting parameter. a sorting score of the information; wherein, if the total amount of user feedback of the category to which the first matching information belongs is greater than the total amount of user feedback of the category to which the second matching information belongs, the weighting value of the first matching information is greater than the second matching information The weighting value is generated; the result generating sub-unit 232 is configured to sort the size of the score according to each piece of matching information to generate a search result.

參見圖4所示,所述排序分值計算子單元231,可以包括:第一加權值計算模組2311,用於根據各條匹配資訊所屬類別的用戶反饋總量的大小,得到屬於每個類別的匹配資訊的加權值;本領域技術人員可以理解,排序分值計算子單元231中,還可以進一步包括第二加權值計算模組2312、第三加權值計算模組2313……,用於對其他加權參數所對應的加權值進行計算。Referring to FIG. 4, the sorting score calculation sub-unit 231 may include: a first weighting value calculation module 2311, configured to obtain, according to the total amount of user feedback of each category of matching information, The weighting value of the matching information can be understood by those skilled in the art. The sorting score calculation sub-unit 231 can further include a second weighting value calculating module 2312, a third weighting value calculating module 2313, ... for The weighting values corresponding to other weighting parameters are calculated.

加權平均模組2310,用於對包括所述第一加權值計算模組的計算結果在內的加權值進行加權平均處理,得到各條匹配資訊的排序分值。The weighted average module 2310 is configured to perform weighted averaging processing on the weighting values including the calculation result of the first weighting value calculation module to obtain a ranking score of each piece of matching information.

其中,所述第一加權值計算模組2311,具體可以用於計算每個類別匹配資訊的用戶反饋總量的比值,根據所述比值,得到屬於每個類別的匹配資訊的加權值。也可以對每個類別匹配資訊的用戶反饋總量進行排序,根據排序結果,得到屬於每個類別的匹配資訊的加權值。The first weighting value calculation module 2311 may be specifically configured to calculate a ratio of the total amount of user feedback of each category matching information, and obtain a weighting value of the matching information belonging to each category according to the ratio. It is also possible to sort the total amount of user feedback for each category matching information, and obtain weighting values of matching information belonging to each category according to the sorting result.

參見圖5所示,所述結果生成單元230,也可以包括以下的組成部分:第一排序子單元233,用於根據所述各條匹配資訊所屬類別的用戶反饋總量的大小,對所述各條匹配資訊進行排序;第二排序子單元234,用於根據所述各條匹配資訊的用戶反饋量大小,對各類別下的匹配資訊進行排序。As shown in FIG. 5, the result generating unit 230 may further include the following components: a first sorting sub-unit 233, configured to determine, according to the size of the total amount of user feedback of the category to which each piece of matching information belongs. The pieces of matching information are sorted; the second sorting sub-unit 234 is configured to sort the matching information in each category according to the amount of user feedback of the pieces of matching information.

以上所提供的資訊搜索系統,可以是應用於網際網路搜索的搜索引擎,也可以是應用於單機、區域網路的搜索的資訊搜索系統。The information search system provided above may be a search engine applied to Internet search, or an information search system applied to search of a single machine or a regional network.

當然,用戶反饋量並不一定是對搜索結果排序的唯一因素。其他因素,例如用戶輸入的關鍵字與網頁所展示資訊的匹配程度,網頁的Page Rank值等,都可以與用戶反饋量一起作為對搜索結果排序的因素。Of course, the amount of user feedback is not necessarily the only factor that ranks search results. Other factors, such as the degree to which the keyword entered by the user matches the information displayed on the web page, the Page Rank value of the web page, etc., can be used together with the user feedback amount as a factor for sorting the search results.

為了描述的方便,描述以上裝置時以功能分為各種單元分別描述。當然,在實施本申請時可以把各單元的功能在同一個或多個軟體和/或硬體中實現。For the convenience of description, the above devices are described separately by function into various units. Of course, the functions of each unit can be implemented in the same software or software and/or hardware in the implementation of the present application.

藉由以上的實施方式的描述可知,本領域的技術人員可以清楚地瞭解到本申請可借助軟體加必需的通用硬體平臺的方式來實現。基於這樣的理解,本申請的技術方案本質上或者說對現有技術做出貢獻的部分可以以軟體產品的形式體現出來,該電腦軟體產品可以儲存在儲存媒體中,如ROM/RAM、磁碟、光碟等,包括若干指令用以使得一台電腦設備(可以是個人電腦,伺服器,或者網路設備等)執行本申請各個實施例或者實施例的某些部分所述的方法。As can be seen from the description of the above embodiments, those skilled in the art can clearly understand that the present application can be implemented by means of a software plus a necessary universal hardware platform. Based on such understanding, the technical solution of the present application can be embodied in the form of a software product in essence or in the form of a software product, which can be stored in a storage medium such as a ROM/RAM, a disk, A disc or the like includes instructions for causing a computer device (which may be a personal computer, server, or network device, etc.) to perform the methods described in various embodiments of the present application or portions of the embodiments.

本說明書中的各個實施例均採用遞進的方式描述,各個實施例之間相同相似的部分互相參見即可,每個實施例重點說明的都是與其他實施例的不同之處。尤其,對於系統實施例而言,由於其基本相似於方法實施例,所以描述得比較簡單,相關之處參見方法實施例的部分說明即可。以上所描述的系統實施例僅僅是示意性的,其中所述作為分離部件說明的單元可以是或者也可以不是物理上分開的,作為單元顯示的部件可以是或者也可以不是物理單元,即可以位於一個地方,或者也可以分佈到多個網路單元上。可以根據實際的需要選擇其中的部分或者全部模組來實現本實施例方案的目的。本領域普通技術人員在不付出創造性勞動的情況下,即可以理解並實施。The various embodiments in the specification are described in a progressive manner, and the same or similar parts between the various embodiments may be referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for the system embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and the relevant parts can be referred to the description of the method embodiment. The system embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, ie may be located A place, or it can be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the embodiment. Those of ordinary skill in the art can understand and implement without any creative effort.

本申請可用於衆多通用或專用的計算系統環境或配置中。例如:個人電腦、伺服器電腦、手持設備或攜帶型設備、平板型設備、多處理器系統、基於微處理器的系統、置頂盒、可編程的消費電子設備、網路PC、小型電腦、大型電腦、包括以上任何系統或設備的分散式計算環境等等。This application can be used in a variety of general purpose or special purpose computing system environments or configurations. For example: personal computers, server computers, handheld or portable devices, tablet devices, multiprocessor systems, microprocessor-based systems, set-top boxes, programmable consumer electronics, network PCs, small computers, large Computer, decentralized computing environment including any of the above systems or devices, and so on.

本申請可以在由電腦執行的電腦可執行指令的一般上下文中描述,例如程式模組。一般地,程式模組包括執行特定任務或實現特定抽象資料類型的常式、程式、物件、元件、資料結構等等。也可以在分散式計算環境中實踐本申請,在這些分散式計算環境中,由藉由通信網路而被連接的遠端處理設備來執行任務。在分散式計算環境中,程式模組可以位於包括儲存設備在內的本地和遠端電腦儲存媒體中。The application can be described in the general context of computer-executable instructions executed by a computer, such as a program module. Generally, a program module includes routines, programs, objects, components, data structures, and the like that perform particular tasks or implement particular abstract data types. The present application can also be practiced in a decentralized computing environment in which tasks are performed by remote processing devices that are connected by a communication network. In a distributed computing environment, program modules can be located in local and remote computer storage media, including storage devices.

以上所述僅是本申請的具體實施方式,應當指出,對於本技術領域的普通技術人員來說,在不脫離本申請原理的前提下,還可以做出若干改進和潤飾,這些改進和潤飾也應視為本申請的保護範圍。The above description is only a specific embodiment of the present application, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present application. It should be considered as the scope of protection of this application.

210...資訊檢索單元210. . . Information retrieval unit

220...用戶反饋量計算單元220. . . User feedback calculation unit

230...結果生成單元230. . . Result generation unit

231...排序分值計算子單元231. . . Sort score calculation subunit

232...結果生成子單元232. . . Result generating subunit

233...第一排序子單元233. . . First sorting subunit

234...第二排序子單元234. . . Second sorting subunit

2310...加權平均模組2310. . . Weighted average module

2311...第一加權值計算模組2311. . . First weighting calculation module

2312...第二加權值計算模組2312. . . Second weighted value calculation module

2313...第三加權值計算模組2313. . . Third weighted value calculation module

為了更清楚地說明本申請實施例或現有技術中的技術方案,下面將對實施例或現有技術描述中所需要使用的圖式作簡單地介紹,顯而易見地,下面描述中的圖式僅僅是本申請中記載的一些實施例,對於本領域普通技術人員來講,在不付出創造性勞動性的前提下,還可以根據這些圖式獲得其他的圖式。In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings used in the embodiments or the prior art description will be briefly described below. Obviously, the drawings in the following description are only Some embodiments described in the application can also obtain other drawings according to these drawings without any creative labor for those skilled in the art.

圖1為本申請實施例一種搜索結果生成方法的流程圖;1 is a flowchart of a method for generating a search result according to an embodiment of the present application;

圖2為本申請實施例一種資訊搜索系統的結構示意圖;2 is a schematic structural diagram of an information search system according to an embodiment of the present application;

圖3為本申請實施例結果生成單元的結構示意圖;3 is a schematic structural diagram of a result generating unit according to an embodiment of the present application;

圖4為本申請實施例排序分值計算子單元的結構示意圖;4 is a schematic structural diagram of a sub-segment calculation sub-unit according to an embodiment of the present application;

圖5為本申請實施例結果生成單元的另一種結構示意圖。FIG. 5 is another schematic structural diagram of a result generating unit according to an embodiment of the present application.

Claims (20)

一種搜索結果生成的電腦實施方法,其特徵在於,包括:資訊搜索系統接收搜索請求,藉由檢索獲得與該搜索請求相匹配的各條匹配資訊;對該各條匹配資訊的用戶反饋量進行查詢;計算得到該各條匹配資訊所屬類別的用戶反饋總量;使用對應於該各條匹配資訊中之第一條匹配資訊所屬的第一類別之第一用戶反饋,計算得到該第一條匹配資訊之第一分值;使用對應於該各條匹配資訊中之第二條匹配資訊所屬的第二類別之第二用戶反饋,計算得到該第二條匹配資訊之第二分值,該計算得到該各條匹配資訊所屬類別的用戶反饋總量包含對預定時間段內之該第一類別之用戶反饋的第一總量及該第二類別之用戶反饋的第二總量進行查詢;以及使用該第一分值、該第二分值、該第一類別之用戶反饋的該第一總量及該第二類別之用戶反饋的該第二總量,對該第一條匹配資訊及該第二條匹配資訊進行排序,生成搜索結果。 A computer implementation method for generating search results, comprising: the information search system receiving a search request, obtaining each matching information that matches the search request by searching; and querying the user feedback amount of each piece of matching information Calculating the total amount of user feedback of the category to which the matching information belongs; calculating the first matching information by using the first user feedback corresponding to the first category to which the first matching information of the matching information belongs a first score; using a second user feedback corresponding to the second category to which the second matching information of the pieces of matching information belongs, calculating a second score of the second matching information, the calculation obtaining the The total amount of user feedback of each category of matching information includes querying a first total amount of user feedback of the first category in a predetermined time period and a second total amount of user feedback of the second category; and using the first a score, the second score, the first amount of feedback from the user of the first category, and the second amount of feedback from the user of the second category, Article information and matching the second match of information to sort, search results are generated. 根據申請專利範圍第1項之電腦實施方法,其中,該用戶反饋總量為:屬於該類別的匹配資訊的用戶反饋量之和。 According to the computer implementation method of claim 1, wherein the total amount of feedback of the user is: the sum of user feedback amounts of matching information belonging to the category. 根據申請專利範圍第1項之電腦實施方法,其中, 對該各條匹配資訊的用戶反饋量進行查詢,具體實現為:藉由讀取預定時間段的記錄對應各搜索結果之用戶反饋的用戶反饋日誌,對該各條匹配資訊的用戶反饋量進行查詢。 According to the computer implementation method of claim 1 of the scope of the patent application, wherein The user feedback quantity of the matching information is queried by reading the user feedback log of the user feedback of each search result by reading the record of the predetermined time period, and querying the user feedback quantity of each piece of matching information. . 根據申請專利範圍第1項之電腦實施方法,進一步包含對該各條匹配資訊進行排序,其中對該各條匹配資訊進行排序包括:以該各條匹配資訊所屬類別的用戶反饋總量的大小作為加權參數,計算該各條匹配資訊的排序分值;其中,如果第三匹配資訊所屬的第三類別的用戶反饋總量大於第四匹配資訊所屬的第四類別的用戶反饋總量,則該第三匹配資訊的加權值大於該第四匹配資訊的加權值。 According to the computer implementation method of claim 1, further comprising: sorting the pieces of matching information, wherein sorting the pieces of matching information comprises: using a total amount of user feedback of the category of the matching information a weighting parameter, the ranking score of the matching information is calculated; wherein, if the total amount of user feedback of the third category to which the third matching information belongs is greater than the total amount of user feedback of the fourth category to which the fourth matching information belongs, the first The weighting value of the three matching information is greater than the weighting value of the fourth matching information. 根據申請專利範圍第4項之電腦實施方法,其中,以該各條匹配資訊所屬類別的用戶反饋總量的大小作為加權參數,具體實現為:計算每個類別匹配資訊的用戶反饋總量的比值;以及根據該比值,計算得到屬於每個類別的匹配資訊的加權值。 According to the computer implementation method of claim 4, wherein the size of the total user feedback of the category of the matching information is used as a weighting parameter, and the specific implementation is: calculating the ratio of the total amount of user feedback of each category of matching information. And based on the ratio, the weighted values of the matching information belonging to each category are calculated. 根據申請專利範圍第4項之電腦實施方法,其中,該以該各條匹配資訊所屬類別的用戶反饋總量的大小作為加權參數,具體實現為:對每個類別匹配資訊的用戶反饋總量進行排序;以及根據該排序,計算得到屬於每個類別的匹配資訊的加 權值。 According to the computer implementation method of claim 4, wherein the total amount of user feedback of the category of the matching information is used as a weighting parameter, and the specific implementation is: the total amount of user feedback for each category matching information is performed. Sorting; and based on the sorting, calculating the matching information belonging to each category Weight. 根據申請專利範圍第1項之電腦實施方法,還包括:至少部份根據一或更多條匹配資訊的對應之一或更多用戶反饋量大小,對各類別下的該一或更多匹配資訊進行排序。 According to the computer implementation method of claim 1, the method further includes: matching the one or more matching information in each category according to at least part of one or more user feedback amounts corresponding to one or more pieces of matching information. Sort. 根據申請專利範圍第1項之電腦實施方法,其中,該搜索請求包括:網頁搜索請求,和/或電子商務搜索請求。 The computer-implemented method of claim 1, wherein the search request comprises: a web search request, and/or an e-commerce search request. 根據申請專利範圍第1項之電腦實施方法,其中,當該搜索請求為網頁搜索請求時,該用戶反饋量包括:網頁鏈結點擊次數,和/或網頁鏈結被收藏次數。 The computer implementation method of claim 1, wherein when the search request is a webpage search request, the user feedback amount includes: a webpage link click count, and/or a webpage link being bookmarked. 根據申請專利範圍第1項之電腦實施方法,其中,該搜索請求包含電子商務搜索請求,且該用戶反饋量包括:商品成交量、商品成交金額、商品詢價次數和/或商品資訊被收藏次數。 The computer implementation method according to claim 1, wherein the search request includes an e-commerce search request, and the user feedback amount includes: a commodity volume, a commodity transaction amount, a product inquiry number, and/or a product information collection number . 一種資訊搜索系統,其特徵在於,包括:一或更多個處理器;記憶體;資訊檢索單元,儲存於該記憶體中且可被該一或更多個處理器執行,該資訊檢索單元用於接收搜索請求,藉由檢索獲得與該搜索請求相匹配的各條匹配資訊;用戶反饋量計算單元,儲存於該記憶體中且可被該一 或更多個處理器執行,該用戶反饋量計算單元用於對對應於該各條匹配資訊的用戶反饋量進行查詢,以及計算得到該各條匹配資訊所屬類別的用戶反饋總量,類別下的用戶反饋總量包含對應於屬於該類別的該各條匹配資訊中一或更多條條匹配資訊之用戶反饋的總和;及結果生成單元,儲存於該記憶體中且可被該一或更多個處理器執行,該結果生成單元用於:以該各條匹配資訊所屬類別的用戶反饋總量的大小作為加權參數,計算該各條匹配資訊的排序分值,其中,如果第一匹配資訊所屬的第一類別的用戶反饋總量大於第二匹配資訊所屬的第二類別的用戶反饋總量,則該第一匹配資訊的加權值大於該第二匹配資訊的加權值;至少部份根據該排序分值,對該各條匹配資訊進行排序;根據該各條匹配資訊的該排序分值,生成搜索結果。 An information search system, comprising: one or more processors; a memory; an information retrieval unit stored in the memory and executable by the one or more processors, the information retrieval unit Receiving a search request, obtaining, by searching, matching pieces of matching information that match the search request; the user feedback amount calculating unit is stored in the memory and can be Executing by the processor or more, the user feedback amount calculation unit is configured to query the amount of user feedback corresponding to the pieces of matching information, and calculate the total amount of user feedback of the category to which the pieces of matching information belong, under the category The user feedback total includes a sum of user feedbacks corresponding to one or more pieces of matching information of the pieces of matching information belonging to the category; and a result generating unit stored in the memory and readable by the one or more Executing, the result generating unit is configured to: calculate a ranking score of each piece of matching information by using a size of the total amount of user feedback of the category of the matching information, and if the first matching information belongs to The total amount of user feedback of the first category is greater than the total amount of user feedback of the second category to which the second matching information belongs, and the weighting value of the first matching information is greater than the weighting value of the second matching information; a score, sorting the pieces of matching information; generating a search result according to the sorting score of the pieces of matching information. 根據申請專利範圍第11項之資訊搜索系統,其中,該用戶反饋量計算單元,藉由讀取特定時間段的記錄用戶反饋的用戶反饋日誌,計算得到該各條匹配資訊中的一條匹配資訊的用戶反饋量。 According to the information search system of claim 11, wherein the user feedback amount calculation unit calculates a matching information of the pieces of matching information by reading a user feedback log of the user feedback for a specific time period. User feedback. 根據申請專利範圍第11項之資訊搜索系統,進一步包括:第一加權值計算模組,用於根據該各條匹配資訊所屬類別的用戶反饋總量的大小,得到該各條匹配資訊其中一條匹配資訊所屬的各別類別的加權值;及加權平均模組,用於對加權值進行加權平均處理,得 到該各條匹配資訊的排序分值。 According to the information search system of claim 11, the method further includes: a first weighting value calculation module, configured to obtain one of the matching information according to the total amount of user feedback of the category of the matching information Weighted value of each category to which the information belongs; and a weighted average module for weighted averaging of the weighted values The sorting score to the matching information. 根據申請專利範圍第13項之資訊搜索系統,其中,該第一加權值計算模組,用於計算每個類別匹配資訊的用戶反饋總量的比值,根據該比值,得到屬於每個類別的匹配資訊的加權值。 According to the information search system of claim 13, wherein the first weighting value calculation module is configured to calculate a ratio of total user feedback of each category of matching information, and according to the ratio, obtain a matching belonging to each category. The weighted value of the information. 根據申請專利範圍第13項之資訊搜索系統,其中,該第一加權值計算模組,用於對每個類別匹配資訊的用戶反饋總量進行排序,根據該排序,得到屬於每個類別的匹配資訊的加權值。 According to the information search system of claim 13, wherein the first weighting value calculation module is configured to sort the total amount of user feedback of each category matching information, and according to the ranking, obtain a matching belonging to each category. The weighted value of the information. 根據申請專利範圍第11項之資訊搜索系統,其中,該結果生成單元包括:第一排序子單元,用於根據該各條匹配資訊分別所屬類別的用戶反饋總量的大小,對該各條匹配資訊進行排序;及第二排序子單元,用於根據該各條匹配資訊中的一或更多條匹配資訊的對應的一或更多用戶反饋量大小,對該類別下的該各條匹配資訊中的該一或更多條匹配資訊進行排序。 According to the information search system of claim 11, wherein the result generating unit comprises: a first sorting subunit, configured to match the strips according to the total amount of user feedback of the categories to which the respective matching information belongs The information is sorted; and the second sorting sub-unit is configured to: according to the corresponding one or more user feedback quantity of the one or more pieces of matching information in the pieces of matching information, the pieces of matching information in the category The one or more pieces of matching information in the ranking are sorted. 一種非暫態儲存媒體,包括電腦可執行指令,當電腦裝置執行該指令時,該指令所進行之動作包含:資訊搜索系統接收搜索請求,藉由檢索獲得與該搜索請求相匹配的各條匹配資訊; 對該各條匹配資訊的用戶反饋量進行查詢;計算得到該各條匹配資訊所屬類別的用戶反饋總量;使用對應於該各條匹配資訊中之第一條匹配資訊所屬的第一類別之第一用戶反饋,計算得到該第一條匹配資訊之第一分值;使用對應於該各條匹配資訊中之第二條匹配資訊所屬的第二類別之第二用戶反饋,計算得到該第二條匹配資訊之第二分值,該計算得到該各條匹配資訊所屬類別的用戶反饋總量包含對預定時間段內之該第一類別之用戶反饋的第一總量及該第二類別之用戶反饋的第二總量進行查詢;以及使用該第一分值、該第二分值、該第一類別之用戶反饋的該第一總量及該第二類別之用戶反饋的該第二總量,對該第一條匹配資訊及該第二條匹配資訊進行排序,生成搜索結果。 A non-transitory storage medium, comprising computer executable instructions, when the computer device executes the instruction, the action performed by the instruction comprises: the information search system receives the search request, and obtains a matching match matching the search request by searching News; Querying the amount of user feedback of each piece of matching information; calculating the total amount of user feedback of the category of the matching information; using the first category corresponding to the first matching information of the pieces of matching information a user feedback, calculating a first score of the first piece of matching information; calculating the second piece by using a second user feedback corresponding to the second category of the second piece of matching information of the pieces of matching information Matching the second score of the information, the calculated total user feedback of the category to which the matching information belongs includes the first total amount of user feedback for the first category in the predetermined time period and the user feedback of the second category The second total amount is queried; and the first total value, the second score, the first total amount of user feedback of the first category, and the second total amount of user feedback of the second category are used, Sorting the first matching information and the second matching information to generate a search result. 根據申請專利範圍第17項之非暫態儲存媒體,其中,對該各條匹配資訊的用戶反饋量進行查詢,具體實現為:藉由讀取預定時間段的記錄對應各搜索結果之用戶反饋的用戶反饋日誌,對該各條匹配資訊的用戶反饋量進行查詢。 According to the non-transitory storage medium of claim 17, wherein the user feedback amount of each piece of matching information is queried by: reading the record of the predetermined time period corresponding to the user feedback of each search result The user feedback log queries the user feedback amount of each piece of matching information. 根據申請專利範圍第17項之非暫態儲存媒體,該動作進一步包含對該各條匹配資訊進行排序,其中對該各條匹配資訊進行排序包括: 以該各條匹配資訊所屬類別的用戶反饋總量的大小作為加權參數,計算該各條匹配資訊的排序分值;其中,如果第三匹配資訊所屬的第三類別的用戶反饋總量大於第四匹配資訊所屬的第四類別的用戶反饋總量,則該第三匹配資訊的加權值大於該第四匹配資訊的加權值。 According to the non-transitory storage medium of claim 17 of the patent application, the action further comprises sorting the pieces of matching information, wherein the sorting of the pieces of matching information comprises: Calculating a ranking score of each piece of matching information by using a total amount of user feedback of the category of the matching information as a weighting parameter; wherein, if the total amount of user feedback of the third category to which the third matching information belongs is greater than the fourth The weight of the third matching information is greater than the weighting value of the fourth matching information. 根據申請專利範圍第17項之非暫態儲存媒體,該動作進一步包含:至少部份根據一或更多條匹配資訊的對應之一或更多用戶反饋量大小,對各類別下的該一或更多匹配資訊進行排序。 According to the non-transitory storage medium of claim 17 of the patent application, the action further comprises: at least part of the one or more user feedback quantity according to one or more pieces of matching information, for the one or More matching information to sort.
TW099100274A 2010-01-07 2010-01-07 Search results generation method and information search system TWI476611B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW099100274A TWI476611B (en) 2010-01-07 2010-01-07 Search results generation method and information search system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW099100274A TWI476611B (en) 2010-01-07 2010-01-07 Search results generation method and information search system

Publications (2)

Publication Number Publication Date
TW201124861A TW201124861A (en) 2011-07-16
TWI476611B true TWI476611B (en) 2015-03-11

Family

ID=45047249

Family Applications (1)

Application Number Title Priority Date Filing Date
TW099100274A TWI476611B (en) 2010-01-07 2010-01-07 Search results generation method and information search system

Country Status (1)

Country Link
TW (1) TWI476611B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5506104B2 (en) * 2011-09-30 2014-05-28 楽天株式会社 Information processing apparatus, information processing method, and information processing program
TW201543407A (en) * 2014-05-01 2015-11-16 shu-zhen Lin House object collection system and method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US2007100A (en) * 1934-01-30 1935-07-02 Anthony A Varese Combined cap and pressure applying attachment
US20020091591A1 (en) * 2001-01-09 2002-07-11 Kenji Tsumura Product information distribution system
US20060253428A1 (en) * 2005-05-06 2006-11-09 Microsoft Corporation Performant relevance improvements in search query results

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US2007100A (en) * 1934-01-30 1935-07-02 Anthony A Varese Combined cap and pressure applying attachment
US20020091591A1 (en) * 2001-01-09 2002-07-11 Kenji Tsumura Product information distribution system
US20060253428A1 (en) * 2005-05-06 2006-11-09 Microsoft Corporation Performant relevance improvements in search query results

Also Published As

Publication number Publication date
TW201124861A (en) 2011-07-16

Similar Documents

Publication Publication Date Title
JP5540080B2 (en) Method for generating search results and system for information retrieval
US7574426B1 (en) Efficiently identifying the items most relevant to a current query based on items selected in connection with similar queries
US9842167B2 (en) Search suggestion and display environment
JP5693746B2 (en) Product information ranking
US20100057714A1 (en) Search results ranking method and system
JP5506735B2 (en) Method and recording medium for ranking the impact of a website
US8819004B1 (en) Ranking image search results using hover data
US9330071B1 (en) Tag merging
EP2724267A1 (en) Search method and apparatus
Yu et al. Latent dirichlet allocation based diversified retrieval for e-commerce search
Wu et al. Keyword extraction for contextual advertisement
TW201426357A (en) Method and apparatus of ordering search data, and data search method and apparatus
TWI476611B (en) Search results generation method and information search system
TW201913415A (en) Search method and apparatus
Batra et al. Content based hidden web ranking algorithm (CHWRA)
Geetha et al. Backlink Analysis Using Mozrank Algorithm of Blogs
Cao et al. PQC: personalized query classification
Zhitomirsky-Geffet et al. Mining query subtopics from social tags
US20160260151A1 (en) Search engine optimization for category web pages
Wu et al. A quality analysis of keyword searching in different search engines projects
TWI486799B (en) A method and a device for determining a weight value of a search word, a search result generating method, and a device
Kang et al. RWR-based Resources Recommendation on Weighted and Clustered Folksonomy Graph
TWI490712B (en) Search results generation method and information search system
TWI620080B (en) User behavior based document classification system and method
Balakrishnan Trust and profit sensitive ranking for the deep web and on-line advertisements

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees