TWI547888B - A method of recording user information and a search method and a server - Google Patents

A method of recording user information and a search method and a server Download PDF

Info

Publication number
TWI547888B
TWI547888B TW099128884A TW99128884A TWI547888B TW I547888 B TWI547888 B TW I547888B TW 099128884 A TW099128884 A TW 099128884A TW 99128884 A TW99128884 A TW 99128884A TW I547888 B TWI547888 B TW I547888B
Authority
TW
Taiwan
Prior art keywords
information
product
attribute
model
user
Prior art date
Application number
TW099128884A
Other languages
Chinese (zh)
Other versions
TW201209744A (en
Inventor
Wei Yuan
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to TW099128884A priority Critical patent/TWI547888B/en
Publication of TW201209744A publication Critical patent/TW201209744A/en
Application granted granted Critical
Publication of TWI547888B publication Critical patent/TWI547888B/en

Links

Description

記錄用戶訪問商品資訊的方法及搜尋方法和伺服器Method for recording user access to product information, search method and server

本申請案涉及電腦技術領域,尤其涉及一種記錄用戶訪問商品資訊的方法和伺服器,以及一種利用記錄的用戶訪問商品資訊的結果對商品資訊進行搜尋的方法和伺服器。The present application relates to the field of computer technology, and in particular, to a method and a server for recording user access to product information, and a method and server for searching for product information by using the recorded user to access product information.

用戶在企業對個人用戶(Business To Customer,B2C)網站或個人用戶對個人用戶(Customer To Customer,C2C)網站上進行商品搜索時,會通過網站提供的介面輸入待查詢的商品名稱,網站根據用戶輸入的關鍵字進行搜索後向用戶返回的的結果主要有兩種,一種是與用戶待查詢的商品相關的導航資訊,另一種是用戶待查詢的商品的相關資訊。When a user searches for a product on a Business To Customer (B2C) website or a Customer To Customer (C2C) website, the user enters the name of the item to be queried through the interface provided by the website. There are two main results returned to the user after the input keyword is searched. One is the navigation information related to the product to be queried by the user, and the other is the related information of the product to be queried by the user.

目前,大量的商品資訊按照商品類目名稱以樹的結構組織在一起,商品類目樹保存在資料庫相對應的資料表中,由人工對商品類目樹中各節點的資料進行輸入與維護,B2C網站或是C2C網站中的每個商品資訊的展示都屬於商品類目樹中某一個或多個節點。At present, a large amount of commodity information is organized by tree structure according to the product category name, and the commodity category tree is stored in the corresponding data table of the database, and the data of each node in the commodity category tree is manually input and maintained. The display of each product information in the B2C website or the C2C website belongs to one or more nodes in the product category tree.

用戶向B2C網站或是C2C網站進行商品資訊查詢時,如果網站向用戶返回的結果是與用戶待查詢的商品相關的導航資訊,則用戶可以根據接收到的導航資訊,沿商品類目樹的路徑自上而下定位至希望查詢的搜尋結果。如果網站向用戶返回的結果是用戶待查詢的商品的相關資訊,則網站將商品類目樹中與用戶待查詢的商品相關的所有節點的資訊返回給用戶。When a user queries a B2C website or a C2C website for product information, if the result returned by the website to the user is navigation information related to the item to be queried by the user, the user can follow the path of the product category tree according to the received navigation information. Target from top to bottom to the search results you wish to query. If the result returned by the website to the user is related information of the item to be queried by the user, the website returns information of all the nodes in the item category tree related to the item to be queried by the user to the user.

由於目前的電子商務網站的商品數量非常龐大,有些網站的商品數量能夠達到上億規模,根據用戶輸入的待查詢的商品名稱搜尋出的商品資訊數量可能非常多,一方面,網站伺服器向用戶推送數量巨大的商品資訊會佔用伺服器的大量系統資源以及網路帶寬,另一方面,用戶獲得這些資訊後,很難從網站返回的商品中準確、快速地定位出用戶實際希望查詢的商品。Since the number of products of the current e-commerce website is very large, the number of products of some websites can reach hundreds of millions of scales, and the number of product information searched according to the name of the product to be inquired by the user may be very large. On the one hand, the website server is to the user. Pushing a large amount of product information will occupy a large amount of system resources and network bandwidth of the server. On the other hand, after the user obtains the information, it is difficult to accurately and quickly locate the product that the user actually wants to query from the products returned by the website.

為了解決上述問題,目前的常規做法是限定向用戶返回的商品類目數,通過減少向用戶返回的商品資訊,以減少向用戶推送商品資訊時對系統伺服器的資源佔用和網路的資源佔用,並同時減少用戶的查詢時間。通過減少向用戶返回的商品類目數的做法在一定程度上減少了資源的佔用和用戶的查詢時間,但同時也可能將與用戶查詢相關度非常高的商品排除在外,導致向用戶返回的查詢結果不準確。In order to solve the above problem, the current conventional practice is to limit the number of product categories returned to the user, by reducing the product information returned to the user, to reduce the resource occupation of the system server and the resource occupation of the network when pushing the product information to the user. And at the same time reduce the user's query time. By reducing the number of product categories returned to the user, the resource occupancy and the user's query time are reduced to a certain extent, but it is also possible to exclude goods that are highly correlated with the user query, resulting in a query returned to the user. The result is not accurate.

綜上所述,目前針對用戶請求查詢商品資訊的搜尋技術中,存在的對用戶查詢意圖不明確,向用戶返回的搜尋結果中的資訊與用戶查詢的相關度較低,導致用戶的搜尋結果較差的問題。In summary, in the search technology for requesting the user to query the product information, the intention of the user query is not clear, and the correlation between the information in the search result returned to the user and the user query is low, resulting in poor search results of the user. The problem.

本申請案的目的在於,提供一種記錄用戶訪問商品資訊的方法和伺服器,用以解決現有技術中存在的對用戶查詢意圖不明確的問題。The purpose of the present application is to provide a method and a server for recording user access to product information, which are used to solve the problem that the user's query intention is not clear in the prior art.

一種記錄用戶訪問商品資訊的方法,該方法包括:在用戶每次訪問包含商品資訊的頁面時產生日誌檔,該日誌檔中包含訪問頁面中包含的商品資訊的至少一個屬性資訊;根據各個日誌檔包含的屬性資訊,分別確定同一商品類目對應的各個不同屬性資訊、及每種屬性資訊出現的次數資訊;以及將確定出的同一商品類目對應的各個不同屬性資訊、及每種屬性資訊出現的次數資訊作為一模型資訊組記錄;其中:記錄的該模型資訊組中任一屬性資訊及其對應出現的次數資訊作為該模型資訊組中一個屬性模型資訊存在。A method for recording user information of a product, the method comprising: generating a log file each time a user accesses a page including product information, where the log file includes at least one attribute information of the product information included in the access page; Included attribute information, respectively, determining different attribute information corresponding to the same product category, and information on the number of occurrences of each attribute information; and information about each attribute corresponding to the same product category to be determined, and information about each attribute appearing The number information is recorded as a model information group; wherein: any attribute information recorded in the model information group and the corresponding number of occurrence information are stored as an attribute model information in the model information group.

一種記錄用戶訪問商品資訊的伺服器,該伺服器包括:日誌產生模組,用於收到用戶對包含商品資訊的頁面的訪問資訊時,產生日誌檔,該日誌檔中包含訪問頁面中包含的商品資訊的至少一個屬性資訊;資訊確定模組,用於根據產生的多個該日誌檔中包含的屬性資訊,分別確定同一商品類目對應的各個不同屬性資訊及每種屬性資訊出現的次數資訊;記錄模組,用於將確定出的同一商品類目對應的各個不同屬性資訊以及每種屬性資訊出現的次數資訊作為一模型資訊組記錄;其中:該模型資訊組包括屬性模型資訊,每一該屬性模型資訊包括一屬性資訊及其對應出現的次數資訊。A server for recording user information of a product, the server includes: a log generation module, configured to generate a log file when the user accesses the information of the page containing the product information, where the log file includes the access page At least one attribute information of the product information; the information determining module is configured to respectively determine different attribute information corresponding to the same product category and the number of times each attribute information appears according to the attribute information included in the generated plurality of log files a recording module, configured to record the different attribute information corresponding to the same product category and the number of times each attribute information appears as a model information group record; wherein: the model information group includes attribute model information, each The attribute model information includes an attribute information and information corresponding to the number of occurrences.

本申請案記錄用戶訪問包含商品資訊的頁面時產生日誌檔,並對各個日誌檔中的屬性資訊進行分析,將確定出的同一商品類目對應的各個不同屬性資訊、及每種屬性資訊出現的次數資訊作為一模型資訊組記錄,構建用戶對商品訪問意圖的模型,明確了設定時間內用戶對商品的訪問意圖。The application records a log file generated by the user when accessing the page containing the product information, and analyzes the attribute information in each log file, and determines the different attribute information corresponding to the same product category and the information of each attribute. The number information is recorded as a model information group, and the user's model of the product access intention is constructed, and the user's intention to access the product within the set time is clarified.

本申請案的另一目的在於,提供一種利用記錄的用戶訪問商品資訊對商品資訊進行搜尋的方法和伺服器,用以解決現有技術中存在的向用戶返回的搜尋結果與用戶查詢的相關度較低,導致用戶的搜尋結果較差的問題。Another object of the present application is to provide a method and a server for searching for product information by using a recorded user to access product information, so as to solve the correlation between the search result returned to the user and the user query existing in the prior art. Low, causing poor user search results.

一種對商品資訊進行搜尋的方法,該方法包括:根據用戶輸入的查詢關鍵字,確定用戶待查詢商品資訊所屬的商品類目;在記錄的各個模型資訊組中,查找到確定出的商品類目對應的模型資訊組;從查找到的模型資訊組的屬性模型資訊下的屬性資訊中,確定與該查詢關鍵字匹配的屬性資訊;從確定出的屬性資訊中,提取出出現次數達到設定門限值的屬性資訊;將包含提取出的屬性資訊的商品節點列表返回給用戶,其中商品節點列表中包含了與提取出的屬性資訊內容相同的至少一個包含商品資訊的頁面資訊。A method for searching for product information, the method comprising: determining, according to a query keyword input by a user, a product category to which the product information to be inquired by the user belongs; and finding a determined product category in each recorded information group of the record Corresponding model information group; determining attribute information matching the query keyword from the attribute information under the attribute model information of the found model information group; extracting the occurrence number from the determined attribute information to a set threshold The attribute information is returned to the user, and the item node list includes at least one page information including the item information that is the same as the extracted attribute information content.

一種對商品資訊進行搜尋的伺服器,該進行搜尋的伺服器包括:類目確定模組,用於根據查詢關鍵字,確定待查詢商品資訊所屬的商品類目;查找模組,用於在記錄的各個模型資訊組中,查找確定出的商品類目對應的模型資訊組;屬性資訊確定模組,用於從查找到的模型資訊組的屬性模型資訊下的屬性資訊中,確定與該查詢關鍵字匹配的屬性資訊;提取模組,用於從確定出的屬性資訊中,提取出現次數達到設定門限值的屬性資訊;返回模組,用於將包含提取出的屬性資訊的商品節點列表作為搜尋結果返回。A server for searching for product information, the server for searching includes: a category determining module, configured to determine a product category to which the item information to be inquired belongs according to the query keyword; and a search module for recording In each model information group, the model information group corresponding to the determined product category is searched; the attribute information determining module is configured to determine the key to the query from the attribute information under the attribute model information of the found model information group. The attribute information of the word matching; the extraction module is configured to extract, from the determined attribute information, the attribute information that the number of occurrences reaches the set threshold; the return module is used to search the list of the commodity nodes including the extracted attribute information as a search The result is returned.

本申請案利用已記錄的用戶對商品資訊的訪問意圖對用戶的搜尋請求進行分類,查詢出與用戶的查詢意圖相關性較高的屬性資訊,並將包含查詢出的屬性資訊的商品節點列表返回給用戶,使用戶得到高相關度的搜尋結果,能夠快速、準確的定位出希望獲得的商品資訊。在減少用戶的查詢時間和準確的定位出希望獲得的商品資訊的同時,有效減少了向用戶推送商品資訊時對系統伺服器的資源佔用和網路的資源佔用。The application classifies the user's search request by using the recorded user's access intention to the product information, and queries the attribute information with high relevance to the user's query intention, and returns the list of the commodity nodes including the queried attribute information. To the user, the user can obtain high-correlation search results, and can quickly and accurately locate the desired product information. While reducing the user's query time and accurately locating the desired product information, the resource occupation of the system server and the resource occupation of the network when pushing the product information to the user are effectively reduced.

本申請案提出對一段時間內用戶對商品資訊的訪問進行記錄,根據記錄的匯總資料確定用戶對商品資訊的查詢意圖分佈情況,得到用戶對商品資訊訪問意圖,進而利用得到的用戶對商品資訊的訪問意圖對用戶的搜尋請求進行分類,將與用戶搜尋的商品資訊相關性較高的包含商品資訊頁面鏈結的商品資訊節點列表返回給用戶,讓搜尋結果更加接近用戶的真實意圖,使得搜尋結果更準確。The application proposes to record the user's access to the product information for a period of time, determine the user's intention distribution of the product information according to the recorded summary data, obtain the user's intention to access the product information, and then use the obtained user to the product information. The access intent classifies the user's search request, and returns a list of the product information nodes including the product information page link, which is highly correlated with the product information searched by the user, to the user, so that the search result is closer to the user's true intention, so that the search result is obtained. more acurrate.

下面結合說明書附圖對本申請案的方案進行詳細說明。The solution of the present application will be described in detail below with reference to the accompanying drawings.

實施例一Embodiment 1

本申請案實施例一是記錄用戶訪問商品資訊的方法,如圖1所示,包括以下步驟:步驟101:伺服器在用戶每次訪問包含商品資訊的頁面時產生日誌檔。The first embodiment of the present application is a method for recording user information for accessing goods. As shown in FIG. 1 , the method includes the following steps: Step 101: The server generates a log file each time the user accesses a page containing product information.

本實施例一中涉及的伺服器是指能夠在用戶每次訪問頁面時,為本次訪問事件產生日誌檔的設備。該伺服器可以是與提供商品資訊頁面的伺服器集成在一起,也可以是獨立於提供商品資訊頁面的伺服器。The server involved in the first embodiment refers to a device capable of generating a log file for this access event each time the user accesses the page. The server may be integrated with a server that provides a product information page, or may be a server independent of the product information page.

本步驟的具體執行方式如下:預先在提供商品資訊的頁面中添加一條可以連接到產生日誌檔的伺服器(簡稱“日誌伺服器”)的鏈結,當用戶通過搜尋結果訪問該頁面或是通過其他方式(如通過商品類目樹)訪問該頁面後,用戶對該頁面的每一次點擊都由伺服器產生一條日誌檔,並將產生的該日誌檔通過在頁面中添加的鏈結保存到日誌伺服器中。本步驟中,涉及的日誌伺服器可以是獨立的資料存儲設備,也可以是存儲商品類目樹的資料庫中專門用於存儲日誌檔的存儲設備。The specific implementation manner of this step is as follows: a link that can be connected to a server that generates a log file (referred to as a “log server”) is added in advance to the page for providing product information, and the user accesses the page through the search result or passes the After other methods (such as through the product category tree) access the page, the user generates a log file for each click of the page, and saves the generated log file to the log through the link added in the page. In the server. In this step, the log server involved may be an independent data storage device, or may be a storage device dedicated to storing log files in a database storing a commodity category tree.

本申請案中涉及的日誌檔中包含頁面顯示的商品資訊的至少一個屬性資訊,該屬性資訊包括商品品牌資訊、商品型號資訊、商品顏色資訊或商品所屬類目ID資訊等。例如,用戶訪問一個提供的商品為手機的Web頁面時,伺服器針對該用戶的這次訪問產生的日誌檔可以包含以下屬性資訊:手機的品牌信息為“ABC”,手機的型號資訊為“123”,手機的顏色資訊為“紅色”,所屬類目ID的資訊為“手機”。The log file involved in the application includes at least one attribute information of the product information displayed on the page, and the attribute information includes product brand information, product model information, product color information, or category ID information of the product. For example, when a user accesses a provided product as a web page of a mobile phone, the log file generated by the server for the user's visit may include the following attribute information: the brand information of the mobile phone is “ABC”, and the model information of the mobile phone is “123”. The color information of the mobile phone is "red", and the information of the category ID is "mobile phone".

如果用戶是通過搜尋引擎提供的導航資訊沿商品類目樹的路徑自上而下到達的訪問頁面,則日誌檔中還包含用戶向搜尋引擎提供的搜尋關鍵字資訊。例如,用戶搜尋的關鍵字為“ABC紅色”,根據搜尋引擎伺服器返回的導航資訊沿商品類目樹訪問提供品牌為“ABC”、型號為“123”、且顏色為“紅色”的手機的Web頁面時,則產生的日誌檔可以包含以下屬性資訊:手機的品牌信息為“ABC”,手機的型號資訊為“123”,手機的顏色資訊為“紅色”,搜尋關鍵字資訊為“ABC紅色”等。由於日誌檔是在用戶點擊商品類目頁面或是商品資訊頁面時由伺服器自動產生的文字檔案,因此,日誌伺服器中存儲的各個日誌檔中包含的用戶輸入的搜尋關鍵字資訊的格式可能會不統一,這種情況下,可以對存儲的各個日誌檔中的用戶輸入的關鍵字進行歸一化處理,歸一化處理的方式包括但不限於:去除不必要的詞語、去除多餘的空格、大小寫字母的轉換、全形半形的轉換、繁體簡體的轉換、標點的轉換和中文數位的轉換等。If the user is an access page that is accessed from the top down by the navigation information provided by the search engine along the path of the product category tree, the log file also includes the search keyword information provided by the user to the search engine. For example, the keyword searched by the user is “ABC Red”, and according to the navigation information returned by the search engine server, the mobile phone of the brand “ABC”, model “123”, and color “red” is provided along the product category tree. When the web page is generated, the generated log file may include the following attribute information: the brand information of the mobile phone is “ABC”, the model information of the mobile phone is “123”, the color information of the mobile phone is “red”, and the search keyword information is “ABC red”. "Wait. Since the log file is a text file automatically generated by the server when the user clicks on the product category page or the product information page, the format of the search keyword information input by the user included in each log file stored in the log server may be It may not be unified. In this case, the keywords input by the user in each stored log file may be normalized. The methods of normalization include, but are not limited to, removing unnecessary words and removing extra spaces. , conversion of uppercase and lowercase letters, conversion of full-shaped half-shaped, conversion of traditional Chinese characters, conversion of punctuation and conversion of Chinese digits.

步驟102:根據設定時間長度內產生的各個日誌檔包含的屬性資訊,分別確定同一商品類目對應的各個不同屬性資訊、及每種屬性資訊出現的次數資訊。Step 102: Determine, according to the attribute information included in each log file generated within the set time length, information about different attributes corresponding to the same product category, and information on the number of occurrences of each attribute information.

在本步驟中,可以對設定時間長度(如24小時)內產生的日誌檔進行統計分析,確定用戶在該設定時間長度內對商品資訊的查詢意圖。在設定時間長度內產生的日誌檔可以是多個用戶訪問提供商品資訊的Web頁面時產生的日誌檔。In this step, the log file generated within the set time length (such as 24 hours) can be statistically analyzed to determine the user's intent to query the product information within the set time length. The log file generated within the set time length may be a log file generated when a plurality of users access a web page providing product information.

在本步驟中,由於日誌檔中包含的資訊繁多,因此,可以通過支持向量機(support vector machine)對屬性資訊進行分類,產生分類資料。In this step, since the log file contains a large amount of information, the attribute information can be classified by the support vector machine to generate classified data.

步驟103:將確定出的同一商品類目對應的各個不同屬性資訊以及每種屬性資訊出現的次數資訊作為一模型資訊組記錄。Step 103: Record the different attribute information corresponding to the same product category and the number information of each attribute information occurrence as a model information group record.

其中:記錄的該模型資訊組中任一屬性資訊及其對應出現的次數資訊作為該模型資訊組中一個屬性模型資訊存在。屬性模型資訊包括:商品品牌模型資訊、商品型號模型資訊和商品顏色模型資訊等。例如,如果一條日誌檔中包含的屬性資訊為商品品牌資訊、商品型號資訊、商品顏色資訊和商品類目資訊,則該商品類目資訊對應的模型資訊組中,商品品牌模型資訊包括商品品牌資訊及其出現次數,商品型號模型資訊包括商品型號資訊及其出現次數,商品顏色模型資訊包括商品顏色資訊及其出現次數。Wherein: any attribute information recorded in the model information group and the corresponding number of occurrence information are present as information of an attribute model in the model information group. The attribute model information includes: product brand model information, product model model information, and product color model information. For example, if the attribute information included in a log file is product brand information, product model information, product color information, and product category information, the product brand model information includes product brand information in the model information group corresponding to the product category information. And the number of occurrences, the product model model information includes the product model information and the number of occurrences thereof, and the product color model information includes the product color information and the number of occurrences thereof.

由於每一條日誌檔是在用戶訪問提供某一商品資訊的Web頁面時產生的,因此,每一條日誌檔對應一件商品資訊。但是,某些商品資訊中的一個或多個屬性資訊的內容可能是相同的,但這些商品資訊表示的商品所屬類目ID不同(如相同品牌的手機和電腦,其所屬商品類目不同,但作為屬性資訊的商品品牌資訊相同),因此,可以根據商品資訊表示的商品所屬類目來確定日誌檔對應的類目。Since each log file is generated when the user accesses a web page providing information of a certain product, each log file corresponds to one piece of product information. However, the content of one or more attribute information in some product information may be the same, but the item ID of the item indicated by the item information is different (such as the same brand of mobile phone and computer, the product category is different, but The product brand information as the attribute information is the same), and therefore, the category corresponding to the log file can be determined based on the category of the product indicated by the product information.

由於每一商品類目對應一個模型資訊組,因此,將每個類目對應的模型資訊組集合在一起,成為表示用戶在設定時間長度內對商品資訊訪問意圖的模型。Since each product category corresponds to one model information group, the model information groups corresponding to each category are grouped together to form a model indicating the user's intention to access the product information within the set time length.

下面以設定時間長度內產生N條日誌檔為例,說明實施例一的具體實現方式:對產生的N條日誌檔(1,2......,n0,n1,n2,n3......N)依次進行分析,確定每一條日誌檔包含的屬性資訊,不斷訓練各商品類目對應的模型資訊組,假設通過對第1條~第n0條日誌檔的訓練,得到的模型資訊組如表1所示:The following takes the N log files in the set time length as an example to illustrate the specific implementation manner of the first embodiment: the generated N log files (1, 2, ..., n0, n1, n2, n3.. ....N) Analyze the data in turn, determine the attribute information contained in each log file, and continuously train the model information group corresponding to each product category, assuming the model obtained by training the first to nth log files. The information group is shown in Table 1:

假設:日誌檔n1是訪問提供某一款手機資訊的Web頁面時產生的日誌檔,包含的屬性資訊為:“商品品牌資訊:ABC”,“商品型號資訊:123”,“商品顏色資訊:紅色”。Assume that the log file n1 is a log file generated when accessing a web page providing a certain mobile phone information, and the attribute information included is: "product brand information: ABC", "product model information: 123", "product color information: red ".

日誌檔n2是訪問提供另一款手機資訊的Web頁面時產生的日誌檔,包含的屬性資訊為:“商品品牌資訊:DEF”,“商品型號資訊:456”,“商品顏色資訊:紅色”。The log file n2 is a log file generated when accessing a web page providing another mobile phone information, and the attribute information included is: "product brand information: DEF", "product model information: 456", "product color information: red".

日誌檔n3是訪問提供一款女裙資訊的Web頁面時產生的日誌檔,包含的屬性資訊為:“商品品牌資訊:abc”,“商品型號資訊:S”,“商品顏色資訊:白色”。The log file n3 is a log file generated when accessing a web page providing a skirt information, and the attribute information included is: "product brand information: abc", "product model information: S", "product color information: white".

對上述第n1~第n3條日誌檔分析後,在表1的基礎上進一步得到表2所示的模型資訊組:After analyzing the above n1~n3 log files, the model information group shown in Table 2 is further obtained on the basis of Table 1:

類似地,在第n3條日誌檔之後,可以繼續利用第n4~第N條日誌消息不斷更新表2。表2所示的多個模型資訊組的集合可以表示在設定時間長度內用戶對多種類目商品的訪問意圖的模型。Similarly, after the n3th log file, Table 2 can be continuously updated using the n4th to Nth log messages. The set of multiple model information groups shown in Table 2 may represent a model of the user's intent to access a plurality of categories of goods over a set length of time.

在對表2所示的用戶對商品資訊訪問意圖的模型進行存儲時,不僅需要存儲每一個模型資訊組中的內容,還需要存儲每一個模型資訊組與類目的對應關係。When storing the model of the user's intention to access the product information shown in Table 2, it is necessary to store not only the content in each model information group, but also the correspondence between each model information group and the category.

在實施例一的方案中,對設定時間長度內保存在日誌伺服器中的日誌資訊,可以按照產生的時間先後順序進行分析,訓練得到模型資訊組;也可以不分產生的先後順序,對全部日誌資訊中的屬性資訊進行統一分析,訓練得到模型資訊組。In the solution of the first embodiment, the log information stored in the log server within the set time length may be analyzed according to the generated chronological order, and the model information group may be trained; or the sequence may be generated regardless of the sequence of generation. The attribute information in the log information is analyzed in a unified manner, and the model information group is trained.

通過實施例一的方案,對設定時間長度內用戶對商品資訊的訪問,以日誌檔的形式進行記錄,並記錄根據匯總資料確定的用戶對商品資訊的查詢意圖,以構建設定時間長度內用戶對商品資訊的訪問意圖的模型,從而確定設定時間長度內的用戶訪問意圖。Through the solution of the first embodiment, the user's access to the product information in the set time length is recorded in the form of a log file, and the user's query intention of the product information determined according to the summary data is recorded to construct the user pair within the set time length. A model of the access intent of the product information to determine the user's access intent over the set length of time.

實施例二Embodiment 2

本申請案實施例二利用實施例一記錄的用戶訪問商品資訊對商品資訊進行搜尋的方法,如圖2所示,包括以下步驟:步驟201:伺服器根據接收的用戶輸入的查詢關鍵字,確定用戶待查詢商品資訊所屬的商品類目。In the second embodiment of the present application, the method for searching for product information by using the user to access the product information recorded in the first embodiment, as shown in FIG. 2, includes the following steps: Step 201: The server determines according to the received query keyword input by the user. The product category to which the user information to be queried belongs.

本實施例二中涉及的伺服器是能夠根據用戶輸入的關鍵字進行商品信息搜尋的伺服器,可以與實施例一中涉及的伺服器集成在一起,也可以分別獨立設置。The server according to the second embodiment is a server capable of searching for product information based on a keyword input by a user, and may be integrated with the server involved in the first embodiment, or may be independently set.

步驟202:在記錄的各個模型資訊組中,查找到確定出的商品類目對應的模型資訊組。Step 202: Find, in each of the recorded model information groups, a model information group corresponding to the determined product category.

由於在實施例一的方案中,表2所示的用戶對商品資訊訪問意圖的模型中,每一個模型資訊組與對應的商品類目保存在一起,因此,伺服器在接收到用戶輸入的關鍵字時,可以根據該關鍵字確定待查詢的商品資訊所屬的商品類目,進而確定該商品類目ID對應的模型資訊組。In the solution of the first embodiment, in the model of the user's intention to access the product information shown in Table 2, each model information group is saved with the corresponding product category, and therefore, the server receives the key of the user input. When the word is used, the product category to which the product information to be inquired belongs may be determined according to the keyword, and then the model information group corresponding to the product category ID is determined.

例如:用戶向伺服器輸入的關鍵字為“手機、DEF”,則確定用戶待查詢商品資訊的商品類目是手機,且商品品牌資訊是DEF,對應表2模型中的模型資訊組1。在實施例二中,用戶輸入的查詢關鍵字中也不限於包括待查詢商品類目,還可以包括待查詢商品的屬性資訊。For example, if the keyword input by the user to the server is “mobile phone, DEF”, it is determined that the product category of the product information to be inquired by the user is a mobile phone, and the product brand information is DEF, corresponding to the model information group 1 in the model of Table 2. In the second embodiment, the query keyword input by the user is not limited to include the item category to be queried, and may also include attribute information of the item to be queried.

步驟203:從查找到的模型資訊組的屬性模型資訊下的屬性資訊中,確定與該查詢關鍵字匹配的屬性資訊。Step 203: Determine attribute information that matches the query keyword from the attribute information under the attribute model information of the found model information group.

假設用戶輸入的查詢關鍵字是“手機、DEF”,則查找出的模型資訊組為模型資訊組1,模型資訊組1的屬性模型資訊包括商品品牌模型資訊、商品型號模型資訊和商品顏色模型資訊,每一個屬性模型資訊又進一步包括屬性資訊。與查詢關鍵字“DEF”匹配的屬性資訊包括:商品品牌資訊“DEF”、商品型號資訊“123、456”,商品顏色資訊:“紅色、黑色”。Assuming that the query keyword input by the user is “mobile phone, DEF”, the model information group found is model information group 1, and the attribute model information of model information group 1 includes product brand model information, product model model information, and product color model information. Each attribute model information further includes attribute information. The attribute information matching the query keyword "DEF" includes: product brand information "DEF", product model information "123, 456", product color information: "red, black".

如果在本步驟中能夠查詢出與該查詢關鍵字匹配的屬性資訊,則表示用戶輸入的查詢關鍵字是高頻詞,可以直接利用實施例一得到的模型資訊組進行查詢。If the attribute information matching the query keyword can be queried in this step, it indicates that the query keyword input by the user is a high frequency word, and the model information group obtained in the first embodiment can be directly used for querying.

如果在本步驟中未能夠查詢出與該查詢關鍵字匹配的屬性資訊,表示該用戶輸入的查詢關鍵字是低頻詞,則可以減少查詢關鍵字的內容後重新確定的屬性資訊中查詢與減少內容後的查詢關鍵字匹配的屬性資訊。例如:用戶輸入的查詢關鍵字是“手機、ABD”,由於在實施例一中得到的模型資訊組1中沒有“ABD”品牌的手機,因此,減少查詢關鍵字中的“ABD”,直接將“手機”作為查詢關鍵字重新查詢,以確定與更新後的查詢關鍵字匹配的屬性資訊。If the attribute information matching the query keyword is not queried in this step, indicating that the query keyword input by the user is a low frequency word, the content of the query keyword may be reduced, and the attribute information in the newly determined attribute information may be queried and reduced. After the query keyword matches the attribute information. For example, the query keyword input by the user is “mobile phone, ABD”. Since there is no “ABD” brand mobile phone in the model information group 1 obtained in the first embodiment, the “ABD” in the query keyword is reduced, and the The "phone" re-queries as a query keyword to determine attribute information that matches the updated query keyword.

再重新查詢匹配的屬性資訊之後,確定“ABD”為低頻詞,則可以進一步利用該低頻詞進行即時對用戶的查詢意圖分析,利用低頻詞更新表2中模型資訊組1的內容,得到表3所示的用戶對商品資訊訪問意圖的模型:After re-query the matching attribute information, and determine that "ABD" is a low frequency word, the low frequency word can be further utilized to perform on-the-spot query intent analysis, and the content of the model information group 1 in Table 2 is updated by using the low frequency word to obtain Table 3. The model of the user's access to product information is shown:

步驟204:伺服器從確定出的屬性資訊中,提取出出現次數達到設定門限值的屬性資訊。Step 204: The server extracts, from the determined attribute information, attribute information that the number of occurrences reaches a set threshold.

在本步驟中,為了向用戶返回與查詢相關度高的搜尋結果,可以從步驟203中查詢出的屬性資訊中進一步提取出在之前的設定時間長度內用戶的查詢意圖較高的商品資訊的屬性資訊,例如,在步驟203中查詢出的屬性資訊包括:商品品牌資訊“DEF”、商品型號資訊“123、456”,商品顏色資訊“紅色、黑色”。通過表2可以確定商品品牌資訊為“DEF”,因此,不論“DEF”的數量多少,都要將“DEF”作為提取出的屬性資訊;商品型號資訊“123”的數量為7,商品型號資訊“456”的數量為21,因此,可以將“456”作為提取出的屬性資訊;商品顏色資訊為“紅色”的數量為12,商品顏色資訊為“黑色”的數量為60,因此,可以將“黑色”作為提取出的屬性資訊。此時,最終得到提取出的屬性資訊包括:商品品牌資訊“DEF”、商品型號資訊“456”和商品顏色資訊“黑色”。In this step, in order to return the search result with high relevance to the query to the user, the attribute information of the product information with higher query intention of the user within the previous set time length may be further extracted from the attribute information queried in step 203. The information, for example, the attribute information queried in step 203 includes: product brand information "DEF", product model information "123, 456", and product color information "red, black". According to Table 2, the brand information of the product can be determined as “DEF”. Therefore, regardless of the quantity of “DEF”, “DEF” should be used as the extracted attribute information; the number of product model information “123” is 7, product model information The number of "456" is 21, so "456" can be used as the extracted attribute information; the quantity of the product color information is "red" is 12, and the quantity of product color information is "black" is 60, therefore, it can be "Black" is used as the extracted attribute information. At this time, the finally obtained attribute information includes: product brand information "DEF", product model information "456", and product color information "black".

步驟205:伺服器將包含提取出的屬性資訊的商品節點列表返回給用戶。Step 205: The server returns a list of commodity nodes including the extracted attribute information to the user.

商品節點列表中包含了與提取出的屬性資訊內容相同的至少一個包含商品資訊的頁面資訊,如鏈結位址資訊。The product node list contains at least one page information including the product information, such as the link address information, which is the same as the extracted attribute information content.

由於商品資訊按照商品資訊類目樹的形式保存在資料庫中,因此,可以將包含商品品牌“DEF”、商品型號“456”和商品顏色“黑色”這些屬性資訊的部分或包含這些屬性資訊的商品節點列表按照XML的格式返回給用戶。Since the product information is stored in the database in the form of a product information category tree, it is possible to include a part of the attribute information including the product brand "DEF", the product model number "456", and the product color "black" or the information of the attribute information. The list of commodity nodes is returned to the user in XML format.

在用戶通過商品節點列表中包括的商品頁面的鏈結資訊訪問某一商品頁面時,在該商品頁面中還可進一步包含顯示的商品的賣方資訊、價格趨勢資訊和買方回饋資訊中的一種或多種。When the user accesses a certain product page through the link information of the product page included in the product node list, the product page may further include one or more of the seller information, the price trend information, and the buyer feedback information of the displayed product. .

在本實施例二的方案中,如果用戶在步驟201中輸入的查詢關鍵字中不包含待查詢商品所屬的商品類目,則從表2的模型資訊組中查詢是否存在與查詢關鍵字內容匹配的屬性模型資訊,如果存在,則可以根據查詢出的屬性模型資訊確定該屬性模型資訊所屬的模型資訊組,進而確定出待查詢商品所屬的商品類目;否則,可以按照該查詢關鍵字查詢出與用戶待查詢的商品的相關資訊,並將查詢出的相關資訊返回給用戶,並在用戶訪問該相關資訊指示的頁面時產生日誌檔,並利用產生的日誌檔更新表2的模型資訊組,其中:該查詢關鍵字的內容將作為更新後的模型資訊組內的一部分內容。In the solution of the second embodiment, if the query keyword input by the user in step 201 does not include the product category to which the item to be queried belongs, the model information group of Table 2 is queried whether the content matches the content of the query keyword. The attribute model information, if present, may determine the model information group to which the attribute model information belongs according to the attribute model information that is queried, and then determine the item category to which the item to be queried belongs; otherwise, the query keyword may be queried according to the query keyword. Corresponding information related to the product to be inquired by the user, and returning the related information to the user, and generating a log file when the user accesses the page indicated by the related information, and updating the model information group of Table 2 by using the generated log file. Where: the content of the query keyword will be part of the updated model information group.

例如:如果用戶在步驟201中輸入的查詢關鍵字為“ABC”,則從表2中模型資訊組的內容可以確定待查詢商品所屬的商品類目為手機。For example, if the query keyword input by the user in step 201 is “ABC”, it can be determined from the content of the model information group in Table 2 that the product category to which the item to be inquired belongs is a mobile phone.

如果用戶在步驟201中輸入的查詢關鍵字為“ABD”,則從表2中無法確定待查詢商品所屬的商品類目,因此,伺服器從保存在資料庫中的商品類目樹中查詢出與“ABD”匹配的所有相關資訊,並包含該相關資訊的所有節點的資訊返回給用戶。用戶獲得伺服器返回的資訊後,在每次訪問返回的資訊指示的頁面時,按照實施例一的方案產生日誌檔。例如,用戶訪問一個提供的商品為ABD品牌的手機Web頁面時,伺服器針對該用戶的這次訪問產生的日誌檔至少可以包含以下屬性資訊:手機的品牌資訊為“ABD”,則可以根據當前產生的日誌檔更新表2,得到如表3所示的模型資訊組:If the query keyword input by the user in step 201 is "ABD", the item category to which the item to be inquired belongs cannot be determined from Table 2, and therefore, the server queries the item category tree stored in the database. All relevant information matching "ABD" and information about all nodes containing the related information is returned to the user. After the user obtains the information returned by the server, each time the page indicated by the returned information is accessed, the log file is generated according to the scheme of the first embodiment. For example, when a user accesses a mobile phone webpage of an ABD brand, the log file generated by the server for the user's visit may include at least the following attribute information: the brand information of the mobile phone is “ABD”, and may be generated according to the current The log file is updated in Table 2 to obtain the model information group as shown in Table 3:

在上述實例中,用戶輸入的查詢關鍵字“ABD”可能是商品類目樹中真實存在的商品屬性資訊的內容,也可能是用戶在輸入查詢關鍵字時的誤輸入,如用戶實際希望輸入的查詢關鍵字是“ABC”,但在輸入時出現錯誤導致輸入“ABD”,在按照上述實例中的方式向用戶返回商品類目樹中與用戶待查詢的商品相關的所有節點的資訊後,用戶訪問頁面時產生日誌檔中應當包含該頁面實際的屬性資訊以及用戶輸入的查詢關鍵字。In the above example, the query keyword "ABD" input by the user may be the content of the commodity attribute information actually existing in the commodity category tree, or may be a mistake input by the user when inputting the query keyword, such as the user actually wants to input. The query keyword is "ABC", but an error occurs when inputting, resulting in the input of "ABD". After returning the information of all the nodes in the product category tree related to the product to be queried by the user in the manner of the above example, the user When the page is accessed, the log file should contain the actual attribute information of the page and the query keyword entered by the user.

例如:用戶訪問的是提供的商品為ABC品牌的手機Web頁面,此時,伺服器針對該用戶的這次訪問產生的日誌檔至少可以包含以下屬性資訊:手機的品牌資訊為“ABC”和“ABD”,則可以根據當前產生的日誌檔更新表2,得到如表4所示的模型資訊組:For example, the user accesses the mobile phone web page of the ABC brand. At this time, the log file generated by the server for the user's visit may include at least the following attribute information: the brand information of the mobile phone is “ABC” and “ABD”. ", you can update Table 2 according to the currently generated log file to get the model information group as shown in Table 4:

在表4所示的模型資訊組中,如果ABD是用戶的誤輸入,則在利用表4執行本發明實施例二的方案時,由於誤輸入“ABD”對應的數量較少,達不到設定門限值,因此,在用戶正確輸入查詢關鍵字時,誤輸入不會影響查詢結果的準確性;如果ABD不是用戶的誤輸入,而是一種新式的手機品牌,則後續當有用戶請求查詢“ABD”時,可以按照表4為用戶提供準確的查詢結果。In the model information group shown in Table 4, if the ABD is a user's erroneous input, when the scheme of the second embodiment of the present invention is executed by using Table 4, the number of erroneously inputting "ABD" is small, and the setting cannot be achieved. Threshold, therefore, when the user correctly enters the query keyword, the incorrect input will not affect the accuracy of the query result; if the ABD is not the user's wrong input, but a new type of mobile phone brand, then the user requests to query "ABD" ", you can provide accurate query results for users according to Table 4.

通過本實施例二的方案,利用已記錄的用戶對商品資訊的訪問意圖對用戶的搜尋請求進行分類,查詢出與用戶的查詢意圖相關性較高的屬性資訊,使用戶得到高相關度的搜尋結果,從而能夠快速、準確的定位出希望獲得的商品資訊。Through the solution of the second embodiment, the user's search request is classified by using the recorded user's intention to access the product information, and the attribute information with high relevance to the user's query intention is queried, so that the user obtains a high correlation search. As a result, it is possible to quickly and accurately locate the desired product information.

實施例三Embodiment 3

本申請案實施例三還提供一種記錄用戶訪問商品資訊的伺服器,如圖3所示,該伺服器包括:日誌產生模組11、資訊確定模組12和記錄模組13,其中:日誌產生模組11用於收到用戶對包含商品資訊的頁面的訪問資訊時,產生日誌檔,該日誌檔中包含訪問頁面中包含的商品資訊的至少一個屬性資訊;資訊確定模組12用於根據產生的多個該日誌檔中包含的屬性資訊,分別確定同一商品類目對應的各個不同屬性資訊及每種屬性資訊出現的次數資訊;記錄模組13用於將確定出的同一商品類目對應的各個不同屬性資訊以及每種屬性資訊出現的次數資訊作為一模型資訊組記錄;其中:該模型資訊組包括屬性模型資訊,每一該屬性模型資訊包括一屬性資訊及其對應出現的次數資訊。The third embodiment of the present application further provides a server for recording user information of the product. As shown in FIG. 3, the server includes: a log generation module 11, an information determination module 12, and a record module 13, wherein: the log is generated. The module 11 is configured to generate a log file, where the log file includes at least one attribute information of the product information included in the access page, and the information determining module 12 is configured to generate the log information. The attribute information included in the plurality of log files respectively determines different attribute information corresponding to the same product category and the number of times information of each attribute information appears; the recording module 13 is configured to correspond to the determined same product category Each of the different attribute information and the number of times information of each attribute information is recorded as a model information group; wherein: the model information group includes attribute model information, and each of the attribute model information includes an attribute information and information corresponding to the number of occurrences.

本實施例中涉及的商品資訊的屬性資訊和屬性模型資訊與實施例一中定義相同。The attribute information and attribute model information of the product information involved in this embodiment are the same as defined in the first embodiment.

實施例四Embodiment 4

本申請案實施例四在利用實施例三的伺服器記錄的用戶訪問商品資訊基礎上,提出一種對商品資訊進行搜尋的伺服器,如圖4所示,進行搜尋的伺服器包括:類目確定模組21、查找模組22、屬性資訊確定模組23、提取模組24和返回模組25,其中:類目確定模組21用於根據查詢關鍵字,確定待查詢商品資訊所屬的商品類目;查找模組22用於在記錄的各個模型資訊組中,查找確定出的商品類目對應的模型資訊組;屬性資訊確定模組23用於從查找到的模型資訊組的屬性模型資訊下的屬性資訊中,確定與該查詢關鍵字匹配的屬性資訊;提取模組24用於從確定出的屬性資訊中,提取出現次數達到設定門限值的屬性資訊;返回模組25用於將包含提取出的屬性資訊的商品節點列表作為搜尋結果返回。In the fourth embodiment of the present application, based on the user accessing the product information recorded by the server of the third embodiment, a server for searching for product information is proposed. As shown in FIG. 4, the server for searching includes: category determination. The module 21, the search module 22, the attribute information determining module 23, the extracting module 24, and the returning module 25, wherein the category determining module 21 is configured to determine the product category to which the product information to be inquired belongs according to the query keyword. The search module 22 is configured to search for the model information group corresponding to the determined product category in each of the recorded model information groups; the attribute information determining module 23 is configured to use the attribute model information of the found model information group. In the attribute information, the attribute information matching the query keyword is determined; the extraction module 24 is configured to extract, from the determined attribute information, attribute information whose number of occurrences reaches a set threshold; the return module 25 is configured to include the extraction The list of product nodes of the attribute information is returned as a search result.

該查找模組22具體用於根據查詢關鍵字中的屬性資訊所屬的屬性模型資訊,從多個模型資訊組中確定該屬性模型資訊所屬的模型資訊組。The searching module 22 is specifically configured to determine, according to the attribute model information to which the attribute information belongs in the query keyword, the model information group to which the attribute model information belongs from the plurality of model information groups.

屬性資訊確定模組23還用於在模型資訊組中未查詢出與該查詢關鍵字匹配的屬性資訊所對應的屬性模型資訊時,減少查詢關鍵字的內容後,重新在查找到的模型資訊組中查詢與減少內容後的查詢關鍵字匹配的屬性資訊所對應的屬性模型資訊。The attribute information determining module 23 is further configured to: when the attribute model information corresponding to the attribute information matching the query keyword is not queried in the model information group, reduce the content of the query keyword, and then re-find the found model information group. The attribute model information corresponding to the attribute information matching the query keyword after the content is reduced.

該進行搜尋的伺服器還包括更新模組26,用於根據減少內容後的查詢關鍵字更新查找到的模型資訊組。The server for searching further includes an update module 26 for updating the found model information group according to the query keyword after the content is reduced.

返回模組25還用於在模型資訊組的屬性模型資訊中不存在與查詢關鍵字匹配的屬性資訊時,從商品類目樹中查詢出與查詢關鍵字匹配的所有資訊,並返回包含該資訊的所有節點的資訊。The returning module 25 is further configured to: when the attribute information matching the query keyword does not exist in the attribute model information of the model information group, query all information matching the query keyword from the product category tree, and return the information including the information. Information about all nodes.

該進行搜尋的伺服器還包括日誌產生模組27,用於收到對該返回包含該資訊的所有節點的資訊中的商品資訊的頁面的訪問資訊時,產生日誌檔;更新模組26還用於根據該日誌檔中的商品資訊的屬性資訊和對應的該查詢關鍵字,更新模型資訊組。The server for searching further includes a log generating module 27, configured to generate a log file when receiving access information of a page for returning product information in information of all nodes that include the information; the update module 26 further uses Updating the model information group according to the attribute information of the product information in the log file and the corresponding query keyword.

實施例三中的記錄用戶訪問商品資訊的伺服器和實施例四中的對商品資訊進行搜尋的伺服器可以是獨立的網路設備,也可以是集成在一起的網路設備。The server for recording the user's access to the product information in the third embodiment and the server for searching the product information in the fourth embodiment may be independent network devices or integrated network devices.

本領域內的技術人員應明白,本申請案的實施例可提供為方法、系統、或電腦程式產品。因此,本申請案可採用完全硬體實施例、完全軟體實施例、或結合軟體和硬體方面的實施例的形式。而且,本申請案可採用在一個或多個其中包含有電腦可用程式碼的電腦可用存儲介質(包括但不限於磁盤記憶體、CD-ROM、光學記憶體等)上實施的電腦程式產品的形式。Those skilled in the art will appreciate that embodiments of the present application can be provided as a method, system, or computer program product. Thus, the present application can take the form of a complete hardware embodiment, a fully software embodiment, or an embodiment combining soft and hardware aspects. Moreover, the present application can take the form of a computer program product implemented on one or more computer usable storage media (including but not limited to disk memory, CD-ROM, optical memory, etc.) containing computer usable code. .

本申請案是參照根據本申請案實施例的方法、設備(系統)、和電腦程式產品的流程圖和/或方框圖來描述的。應理解可由電腦程式指令實現流程圖和/或方框圖中的每一流程和/或方框、以及流程圖和/或方框圖中的流程和/或方框的結合。可提供這些電腦程式指令到通用電腦、專用電腦、嵌入式處理機或其他可編程資料處理設備的處理器以產生一個機器,使得通過電腦或其他可編程資料處理設備的處理器執行的指令產生用於實現在流程圖一個流程或多個流程和/或方框圖一個方框或多個方框中指定的功能的裝置。The present application is described with reference to flowchart illustrations and/or block diagrams of methods, devices (systems), and computer program products according to embodiments of the present application. It will be understood that each flow and/or block of the flowchart illustrations and/or FIG. These computer program instructions can be provided to a processor of a general purpose computer, a special purpose computer, an embedded processor or other programmable data processing device to produce a machine for generating instructions for execution by a processor of a computer or other programmable data processing device. Means for implementing the functions specified in one or more of the flow or in a block or blocks of the flow chart.

這些電腦程式指令也可存儲在能引導電腦或其他可編程資料處理設備以特定方式工作的電腦可讀記憶體中,使得存儲在該電腦可讀記憶體中的指令產生包括指令裝置的製造品,該指令裝置實現在流程圖一個流程或多個流程和/或方框圖一個方框或多個方框中指定的功能。The computer program instructions can also be stored in a computer readable memory that can boot a computer or other programmable data processing device to operate in a particular manner, such that instructions stored in the computer readable memory produce an article of manufacture including the instruction device. The instruction means implements the functions specified in one or more blocks of the flow or in a flow or block diagram of the flowchart.

這些電腦程式指令也可裝載到電腦或其他可編程資料處理設備上,使得在電腦或其他可編程設備上執行一系列操作步驟以產生電腦實現的處理,從而在電腦或其他可編程設備上執行的指令提供用於實現在流程圖一個流程或多個流程和/或方框圖一個方框或多個方框中指定的功能的步驟。These computer program instructions can also be loaded onto a computer or other programmable data processing device to perform a series of operational steps on a computer or other programmable device to produce computer-implemented processing for execution on a computer or other programmable device. The instructions provide steps for implementing the functions specified in one or more of the flow or in a block or blocks of a flow diagram.

儘管已描述了本申請案的優選實施例,但本領域內的技術人員一旦得知了基本創造性概念,則可對這些實施例做出另外的變更和修改。所以,所附申請專利範圍意欲解釋為包括優選實施例以及落入本申請案範圍的所有變更和修改。While the preferred embodiment of the present application has been described, those skilled in the art can make additional changes and modifications to these embodiments once they are aware of the basic inventive concept. Therefore, the scope of the appended claims is intended to be construed as a

顯然,本領域的技術人員可以對本申請案進行各種改動和變型而不脫離本申請案的精神和範圍。這樣,倘若本申請案的這些修改和變型屬於本申請案申請專利範圍及其等同技術的範圍之內,則本申請案也意圖包含這些改變和變型在內。It will be apparent that those skilled in the art can make various modifications and variations to the present invention without departing from the spirit and scope of the application. Thus, it is intended that the present invention cover the modifications and variations of the invention,

11...日誌產生模組11. . . Log generation module

12...資訊確定模組12. . . Information determination module

13...記錄模組13. . . Recording module

21...類目確定模組twenty one. . . Category determination module

22...查找模組twenty two. . . Search module

23...屬性資訊確定模組twenty three. . . Attribute information determination module

24...提取模組twenty four. . . Extraction module

25...返回模組25. . . Return module

26...更新模組26. . . Update module

27...日誌產生模組27. . . Log generation module

圖1為本申請案實施例一確定商品的屬性資訊數量的方法示意圖;1 is a schematic diagram of a method for determining the quantity of attribute information of a commodity according to Embodiment 1 of the present application;

圖2為本申請案實施例二進行商品搜尋的方法示意圖;2 is a schematic diagram of a method for performing product search according to Embodiment 2 of the present application;

圖3為本申請案實施例三記錄用戶訪問商品資訊的伺服器結構示意圖;3 is a schematic structural diagram of a server for recording user access to product information according to Embodiment 3 of the present application;

圖4為本申請案實施例三對商品資訊進行搜尋的伺服器結構示意圖。FIG. 4 is a schematic structural diagram of a server for searching product information according to Embodiment 3 of the present application.

Claims (10)

一種記錄用戶訪問商品資訊的方法,其特徵在於,該方法包括:伺服器收到用戶對包含商品資訊的頁面的訪問資訊時,產生日誌檔,該日誌檔中包含訪問頁面中包含的商品資訊的至少一個屬性資訊;根據產生的多個該日誌檔中包含的屬性資訊,分別確定同一商品類目對應的各個不同屬性資訊及每種屬性資訊出現的次數資訊;將確定出的同一商品類目對應的各個不同屬性資訊以及每種屬性資訊出現的次數資訊作為一模型資訊組記錄,其中,該模型資訊組包括屬性模型資訊,每一該屬性模型資訊包括一屬性資訊及其對應出現的次數資訊,以及伺服器根據接收的用戶輸入的查詢關鍵字,確定用戶待查詢商品資訊所屬的商品類目,其中,如果查詢關鍵字中存在待查詢商品所屬的商品類目,則:從模型資訊組的屬性資訊中提取與商品類目對應的屬性資訊,且如果查詢關鍵字中不存在待查詢商品所屬的商品類目,則:使用查詢關鍵字查詢商品類目樹中待查詢的商品的相關資訊,並將該相關資訊的所有節點的資訊返回給用戶。 A method for recording user information of a product, wherein the method includes: when the server receives the access information of the user that includes the product information, the log file is generated, and the log file includes the product information included in the access page. At least one attribute information; determining, according to the generated attribute information of the plurality of log files, different attribute information corresponding to the same product category and the number of times each attribute information appears; determining the same product category corresponding to the same product category Each of the different attribute information and the number of times each attribute information appears as a model information group record, wherein the model information group includes attribute model information, and each of the attribute model information includes an attribute information and information corresponding to the number of occurrences, And the server determines, according to the received query keyword input by the user, the product category to which the product information to be inquired belongs belongs, wherein if there is a product category to which the item to be inquired belongs in the query keyword, then: the attribute of the model information group from the model Extracting attribute information corresponding to the product category in the information, and Merchandise categories of goods to be queried belongs does not exist if the query keywords, then: Use the information to a query keyword query commodity product category tree to be queried, and information related to the information of all the nodes back to the user. 如申請專利範圍第1項所述的方法,其中,該日誌檔中包含的商品資訊的屬性資訊為:商品品牌資訊、商品 型號資訊、商品顏色資訊和商品類目資訊;該模型資訊組中包含以下屬性模型資訊:由商品品牌資訊及其出現次數構成的商品品牌模型資訊;由商品型號資訊及其出現次數構成的商品型號模型資訊;以及由商品顏色資訊及其出現次數構成的商品顏色模型資訊。 The method of claim 1, wherein the attribute information of the product information included in the log file is: product brand information, product Model information, product color information, and product category information; the model information group includes the following attribute model information: product brand model information composed of product brand information and its occurrence times; product model number composed of product model information and its appearance times Model information; and product color model information consisting of product color information and its occurrences. 一種對商品資訊進行搜尋的方法,其特徵在於,利用申請專利範圍第1項記錄的用戶訪問商品資訊,該方法包括:伺服器根據查詢關鍵字,確定待查詢商品資訊所屬的商品類目;如果查詢關鍵字中存在待查詢商品所屬的商品類目,則:在記錄的各個模型資訊組中,查找確定出的商品類目對應的模型資訊組;從查找到的模型資訊組的屬性模型資訊下的屬性資訊中,確定與該查詢關鍵字匹配的屬性資訊;從確定出的屬性資訊中,提取出現次數達到設定門限值的屬性資訊;以及如果查詢關鍵字中不存在待查詢商品所屬的商品類目,則:使用查詢關鍵字查詢商品類目樹中待查詢的商品的相關資訊,並將包含提取出的屬性資訊的商品節點列表作 為搜尋結果返回。 A method for searching for product information, characterized in that the user accessing the product information by using the record of the first item of the patent application scope includes: the server determines, according to the query keyword, the product category to which the product information to be inquired belongs; If there is a product category to which the item to be inquired belongs in the query keyword, the model information group corresponding to the determined product category is searched in each model information group recorded; from the attribute model information of the found model information group In the attribute information, the attribute information matching the query keyword is determined; from the determined attribute information, the attribute information whose number of occurrences reaches the set threshold is extracted; and if the product category to which the item to be inquired does not exist in the query keyword For example, the query keyword is used to query related information of the item to be inquired in the product category tree, and the list of product nodes including the extracted attribute information is made. Return for search results. 如申請專利範圍第3項所述的方法,其中,在提取出對應屬性資訊出現次數達到設定門限值的屬性資訊之前,該方法還包括:如果在模型資訊組中未查詢出與該查詢關鍵字匹配的屬性資訊所對應的屬性模型資訊,則減少查詢關鍵字的內容後,重新在查找到的模型資訊組中查詢與減少內容後的查詢關鍵字匹配的屬性資訊所對應的屬性模型資訊。 The method of claim 3, wherein before extracting the attribute information that the number of occurrences of the corresponding attribute information reaches the set threshold, the method further includes: if the query keyword is not found in the model information group After matching the attribute model information corresponding to the attribute information, after reducing the content of the query keyword, the attribute model information corresponding to the attribute information matching the query keyword after the content reduction is re-inquired in the found model information group. 如申請專利範圍第4項所述的方法,其中,重新查詢與減少內容後的查詢關鍵字匹配的屬性資訊所對應的屬性模型資訊之後,該方法還包括:根據減少內容後的查詢關鍵字更新查找到的模型資訊組。 The method of claim 4, wherein, after re-query the attribute model information corresponding to the attribute information matching the query keyword after the content is reduced, the method further comprises: updating the query keyword according to the reduced content The model information group found. 如申請專利範圍第3項所述的方法,其中,伺服器根據查詢關鍵字確定的商品類目查找對應的模型資訊組,具體包括:伺服器根據查詢關鍵字中的屬性資訊所屬的屬性模型資訊,從多個模型資訊組中確定該屬性模型資訊所屬的模型資訊組。 The method of claim 3, wherein the server searches for a corresponding model information group according to the product category determined by the query keyword, and specifically includes: the attribute model information that the server belongs to according to the attribute information in the query keyword. And determining, from a plurality of model information groups, a model information group to which the attribute model information belongs. 如申請專利範圍第3項所述的方法,其中,在模型資訊組的屬性模型資訊中不存在與查詢關鍵字匹配的屬性資訊時,該方法還包括:伺服器從商品類目樹中查詢出與查詢關鍵字匹配的所有資訊,並返回包含該資訊的所有節點的資訊。 The method of claim 3, wherein, when there is no attribute information matching the query keyword in the attribute model information of the model information group, the method further comprises: the server querying from the commodity category tree All information that matches the query keyword and returns information about all nodes that contain that information. 如申請專利範圍第7項所述的方法,其中,該方法還包括:伺服器接收到對該返回包含該資訊的所有節點的資訊中的商品資訊的頁面的訪問資訊時,產生日誌檔,並根據該日誌檔中的商品資訊的屬性資訊和對應的該查詢關鍵字,更新模型資訊組。 The method of claim 7, wherein the method further comprises: when the server receives the access information of the page of the product information in the information of all the nodes that return the information, generating a log file, and The model information group is updated according to the attribute information of the product information in the log file and the corresponding query keyword. 一種記錄用戶訪問商品資訊的伺服器,其特徵在於,該伺服器包括:日誌產生模組,用於接收到用戶對包含商品資訊的頁面的訪問資訊時,產生日誌檔,該日誌檔中包含訪問頁面中包含的商品資訊的至少一個屬性資訊;資訊確定模組,用於根據產生的多個該日誌檔中包含的屬性資訊,分別確定同一商品類目對應的各個不同屬性資訊及每種屬性資訊出現的次數資訊;以及記錄模組,用於將確定出的同一商品類目對應的各個不同屬性資訊以及每種屬性資訊出現的次數資訊作為一模型資訊組記錄,其中,該模型資訊組包括屬性模型資訊,每一該屬性模型資訊包括一屬性資訊及其對應出現的次數資訊,其中伺服器根據接收的用戶輸入的查詢關鍵字,確定用戶待查詢商品資訊所屬的商品類目,其中,如果查詢關鍵字中存在待查詢商品所屬的商品類目,則:從模型資訊組的屬性資訊中提取與商品類目對應的屬性資訊,且 如果查詢關鍵字中不存在待查詢商品所屬的商品類目,則:使用查詢關鍵字查詢商品類目樹中待查詢的商品的相關資訊,並將該相關資訊的所有節點的資訊返回給用戶。 A server for recording user information of a product, wherein the server includes: a log generating module, configured to generate a log file when the user accesses the access information of the page containing the product information, and the log file includes the access At least one attribute information of the product information included in the page; the information determining module is configured to respectively determine different attribute information and each attribute information corresponding to the same product category according to the generated attribute information included in the plurality of log files The number of times of occurrence information; and a recording module, configured to record the different attribute information corresponding to the same product category and the number of times each attribute information appears as a model information group, wherein the model information group includes attributes The model information, each of the attribute model information includes an attribute information and a corresponding number of times of occurrence information, wherein the server determines, according to the received query keyword input by the user, the product category to which the user information to be inquired belongs, wherein, if the query If there is a product category to which the item to be inquired belongs in the keyword, then: Attribution information group type information is extracted and the product category corresponding to the attribute information, and If there is no product category to which the item to be queried belongs in the query keyword, the query keyword is used to query related information of the item to be queried in the item category tree, and the information of all nodes of the related information is returned to the user. 一種進行搜尋的伺服器,利用申請專利範圍第9項記錄的用戶訪問商品資訊對商品資訊進行搜索尋,其特徵在於,該進行搜尋的伺服器包括:類目確定模組,用於根據查詢關鍵字,確定待查詢商品資訊所屬的商品類目;查找模組,用於如果查詢關鍵字中存在待查詢商品所屬的商品類目,則在記錄的各個模型資訊組中,查找確定出的商品類目對應的模型資訊組;屬性資訊確定模組,用於從查找到的模型資訊組的屬性模型資訊下的屬性資訊中,確定與該查詢關鍵字匹配的屬性資訊;提取模組,用於從確定出的屬性資訊中,提取出現次數達到設定門限值的屬性資訊;以及返回模組,用於如果查詢關鍵字中不存在待查詢商品所屬的商品類目,則:使用查詢關鍵字查詢商品類目樹中待查詢的商品的相關資訊,並將包含提取出的屬性資訊的商品節點列表作為搜尋結果返回。 A server for searching uses a user who accesses product information in the ninth record of the patent application to search for product information, wherein the server for searching includes: a category determining module, which is used according to the query key a word to determine a product category to which the item information to be inquired belongs; a search module for searching for the determined item category in each of the recorded model information groups if there is a product category to which the item to be inquired belongs in the query keyword a model information group corresponding to the object; an attribute information determining module, configured to determine attribute information matching the query keyword from the attribute information under the attribute model information of the found model information group; and extracting a module for In the determined attribute information, the attribute information whose number of occurrences reaches the set threshold is extracted; and the return module is used to query the product category using the query keyword if there is no product category to which the item to be inquired belongs in the query keyword Information about the item to be queried in the target tree, and a list of product nodes containing the extracted attribute information as a search The result is returned.
TW099128884A 2010-08-27 2010-08-27 A method of recording user information and a search method and a server TWI547888B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW099128884A TWI547888B (en) 2010-08-27 2010-08-27 A method of recording user information and a search method and a server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW099128884A TWI547888B (en) 2010-08-27 2010-08-27 A method of recording user information and a search method and a server

Publications (2)

Publication Number Publication Date
TW201209744A TW201209744A (en) 2012-03-01
TWI547888B true TWI547888B (en) 2016-09-01

Family

ID=46763738

Family Applications (1)

Application Number Title Priority Date Filing Date
TW099128884A TWI547888B (en) 2010-08-27 2010-08-27 A method of recording user information and a search method and a server

Country Status (1)

Country Link
TW (1) TWI547888B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI734733B (en) * 2017-01-23 2021-08-01 香港商阿里巴巴集團服務有限公司 Method and device for obtaining product objects

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103310343A (en) * 2012-03-15 2013-09-18 阿里巴巴集团控股有限公司 Commodity information issuing method and device
WO2016103383A1 (en) * 2014-12-25 2016-06-30 楽天株式会社 Information processing device, information processing method, program, and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060224406A1 (en) * 2005-03-30 2006-10-05 Jean-Michel Leon Methods and systems to browse data items
CN1858733A (en) * 2005-11-01 2006-11-08 华为技术有限公司 Information searching system and searching method
WO2007106403A2 (en) * 2006-03-10 2007-09-20 Ebay Inc. Methods and systems to generate rules to identify data items
US20090106108A1 (en) * 2007-10-22 2009-04-23 Young Bae Ku Website management method and on-line system
TW200945074A (en) * 2008-04-22 2009-11-01 Ein Si & S Co Ltd Method and system for providing content (3)

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060224406A1 (en) * 2005-03-30 2006-10-05 Jean-Michel Leon Methods and systems to browse data items
CN1858733A (en) * 2005-11-01 2006-11-08 华为技术有限公司 Information searching system and searching method
WO2007106403A2 (en) * 2006-03-10 2007-09-20 Ebay Inc. Methods and systems to generate rules to identify data items
US20090106108A1 (en) * 2007-10-22 2009-04-23 Young Bae Ku Website management method and on-line system
TW200945074A (en) * 2008-04-22 2009-11-01 Ein Si & S Co Ltd Method and system for providing content (3)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI734733B (en) * 2017-01-23 2021-08-01 香港商阿里巴巴集團服務有限公司 Method and device for obtaining product objects

Also Published As

Publication number Publication date
TW201209744A (en) 2012-03-01

Similar Documents

Publication Publication Date Title
JP5721818B2 (en) Use of model information group in search
US9928537B2 (en) Management and storage of distributed bookmarks
US9569506B2 (en) Uniform search, navigation and combination of heterogeneous data
US9418128B2 (en) Linking documents with entities, actions and applications
RU2701110C2 (en) Studying and using contextual rules of extracting content to eliminate ambiguity of requests
US9171088B2 (en) Mining for product classification structures for internet-based product searching
US10585927B1 (en) Determining a set of steps responsive to a how-to query
US10592571B1 (en) Query modification based on non-textual resource context
AU2014228754C1 (en) Non-deterministic disambiguation and matching of business locale data
US11874882B2 (en) Extracting key phrase candidates from documents and producing topical authority ranking
JP7254925B2 (en) Transliteration of data records for improved data matching
US11886444B2 (en) Ranking search results using hierarchically organized coefficients for determining relevance
TWI547888B (en) A method of recording user information and a search method and a server
US11328005B2 (en) Machine learning (ML) based expansion of a data set
JPWO2018070026A1 (en) Product information display system, product information display method, and program
US11423098B2 (en) Method and apparatus to generate a simplified query when searching for catalog items
CN114020867A (en) Method, device, equipment and medium for expanding search terms
CN111222918B (en) Keyword mining method and device, electronic equipment and storage medium
WO2019218151A1 (en) Data searching method
CN107423298B (en) Searching method and device
TW201741911A (en) Processing and interaction method for use in data recommendation, device, and system
US9858291B1 (en) Detection of related local entities
CN116910229A (en) Intelligent query method and device for index