KR100884889B1

KR100884889B1 - Method and system for adding automatic indexing word to search database

Info

Publication number: KR100884889B1
Application number: KR1020070029454A
Authority: KR
Inventors: 양주영
Original assignee: 엔에이치엔(주)
Priority date: 2007-03-26
Filing date: 2007-03-26
Publication date: 2009-02-23
Also published as: JP2008243202A; JP4703676B2; KR20080087356A

Abstract

본 발명은 검색 데이터베이스의 자동 색인어 추가 방법 및 시스템에 관한 것이다. 본 발명에 따른 검색 데이터베이스의 자동 색인어 추가 방법은, 사용자로부터 제1 검색쿼리(query)를 입력받는 단계, 검색 데이터베이스로부터 상기 제1 검색쿼리로 색인된 적어도 하나의 제1 결과문서를 제공하는 단계, 상기 사용자로부터 상기 제1 검색쿼리의 입력에 이어 제2 검색쿼리를 입력받는 단계, 상기 검색 데이터베이스로부터 상기 제2 검색쿼리로 색인된 적어도 하나의 제2 결과문서를 제공하는 단계, 상기 사용자로부터 상기 적어도 하나의 제2 결과문서 중 적어도 하나의 결과문서에 대한 선택요청을 수신하는 단계 및 상기 검색 데이터베이스에 상기 제1 검색쿼리를 상기 선택된 적어도 하나의 제2 결과문서의 색인어로 추가하는 단계를 포함한다.The present invention relates to a method and system for automatic index addition of a search database. According to an aspect of the present invention, there is provided a method of automatically adding an index word to a search database, the method comprising: receiving a first search query from a user, providing at least one first result document indexed to the first search query from a search database, Receiving a second search query following the input of the first search query from the user, providing at least one second result document indexed into the second search query from the search database, wherein the at least Receiving a selection request for at least one result document of one second result document and adding the first search query to the search database as an index of the selected at least one second result document.

검색 데이터베이스, 검색쿼리, 색인어, 사용자 로그 추적 Search database, search query, index word, user log tracking

Description

METHOOD AND SYSTEM FOR ADDING AUTOMATIC INDEXING WORD TO SEARCH DATABASE}

도 1은 본 발명에 있어서, 검색 데이터베이스의 구조와 검색쿼리를 입력한 경우 사용자에게 제공되는 결과문서를 설명하기 위한 도면이다. 1 is a diagram for explaining a result document provided to a user when a structure of a search database and a search query are input in the present invention.

도 2는 본 발명의 일실시예에 있어서, 사용자 로그 추적에 기반한 검색 데이터베이스의 자동 색인어 추가하는 과정을 도시한 도면이다.2 is a diagram for a process of adding an automatic index word of a search database based on user log tracking according to one embodiment of the present invention.

도 3의 (a)는 본 발명의 일실시예에 있어서, 제1 검색쿼리와 제2 검색쿼리간에 상관관계가 존재하는 경우 제1 검색쿼리를 사용자로부터 선택된 제2 결과문서의 색인어로 추가하는 경우를 나타낸 도면이다.FIG. 3 (a) illustrates a case in which a first search query is added as an index of a second result document selected by a user when a correlation exists between the first search query and the second search query according to an embodiment of the present invention. The figure which shows.

도 3의 (b)는 본 발명의 일실시예에 있어서, 제1 검색쿼리를 잠재 색인어로 일시 저장하고 추후에 검수 기준을 충족할 때 색인어로 추가하는 경우를 나타낸 도면이다.FIG. 3 (b) is a diagram illustrating a case in which a first search query is temporarily stored as a potential index word and added to the index word later when the inspection criteria is met.

도 3의 (c)는 본 발명의 일실시예에 있어서, 제1 검색쿼리를 제2 결과문서의 색인어로 바로 자동 추가하되 색인어로 추가된 제1 검색쿼리에 대해 일정 기간동안 클릭이 없는 경우 자동 소멸되는 경우를 나타낸 도면이다.FIG. 3 (c) automatically adds a first search query directly to an index word of a second result document when there is no click for a certain period of time for the first search query added to the index word. The figure which shows the case where it disappears.

도 4는 본 발명의 일실시예에 있어서, 제1 검색쿼리를 사용자로부터 선택된 제2 결과문서의 색인어로 추가하는 과정을 도시한 도면이다.4 is a diagram illustrating a process of adding a first search query to an index word of a second result document selected by a user according to one embodiment of the present invention.

도 5는 본 발명의 일실시예에 있어서, 검색 데이터베이스의 자동 색인어 추가 시스템을 도시한 구성도이다.FIG. 5 is a diagram illustrating a system for automatically adding an index word to a search database according to an embodiment of the present invention.

<도면의 주요 부분에 대한 설명>Description of the main parts of the drawing

502: 검색 데이터베이스502: search database

503: 검색쿼리 입력부503: search query input unit

504: 결과문서 제공부504: result document provider

505: 선택 요청 수신부505: selection request receiving unit

506: 색인어 추가부 506: index word addition unit

본 발명은 검색 데이터베이스의 자동 색인어 추가 방법 및 시스템에 관한 것으로서, 더욱 자세하게는, 사용자가 검색쿼리를 입력하여 검색하는 경우 사용자의 로그 추적을 이용하여 검색 데이터베이스에 저장된 결과문서에 사용자가 입력한 검색쿼리를 새로운 색인어로 추가하는 방법 및 시스템에 관한 것이다.The present invention relates to a method and system for automatically adding an index to a search database. More specifically, when a user inputs a search query and searches the user, the search query input by the user in a result document stored in the search database using the user's log tracking is searched. And a method for adding a to a new index word.

사용자가 검색을 하기 위해 검색창에 검색쿼리를 입력하는 경우 검색 데이터베이스에 해당 검색쿼리에 대한 결과문서가 입력된 검색쿼리를 색인어로 하면 적절한 검색 결과가 노출될 수 있다. 그러나 종래에는 시스템을 관리하는 에디터에 의해 색인어를 만드는 이른바 내부적으로 구축된 데이터베이스의 경우에 뜻하지 않게 결과문서에 적절한 색인어를 처리하는 것을 간과하거나 오류가 생기면서 사용자 의 정확하고 빠른 검색 요구에 적절하게 대응하지 못하는 문제점이 있었다. When a user inputs a search query in a search box to search, an appropriate search result may be exposed when a search query having a search result in which a result document for the search query is entered in a search database is used as an index word. However, in the case of a so-called internally built database that is indexed by an editor managing the system, it is unexpectedly overlooked to process an appropriate index word in a result document or responds appropriately to a user's accurate and fast search request due to an error. There was a problem that can not.

예를 들어서 "베드민턴 라켓"을 검색하고 싶은 사용자가 "라켓"을 검색쿼리로 입력하는 경우, 검색 결과가 "테니스 라켓", "스쿼시 라켓", "탁구라켓"만으로 이루어진 결과문서로 노출되는 경우 사용자는 다시 정확하게 베드민턴과 라켓을 함께 조합해서 검색하는 수고가 있었다. 특히 상기 예에서 "라켓"과 "베드민턴 라켓"이 상위, 하위 관계에 있는 것처럼 검색쿼리간에 어느 정도 관련이 있는 경우 상기 검색쿼리 중 어느 하나만을 결과문서의 색인어로 처리하면 검색 결과의 폭이 좁아지고, 좀더 세부적인 결과를 노출시키는 데 어려움이 많아서 사용자가 원하는 검색 결과를 충족시키는 데 문제가 있었다. For example, if a user who wants to search for "badminton racket" enters "racquet" as a search query, the search result is displayed as a result document consisting only of "tennis racket", "squash racket" and "table tennis racket". Again, the trouble was to search for a combination of badminton and racket together. In particular, in the above example, when "racket" and "badminton racket" are related to the search query to some extent, as in the upper and lower relations, if only one of the search queries is treated as an index of the result document, the search result is narrowed. As a result, it was difficult to expose more detailed results, which resulted in a problem of satisfying the user's desired search result.

수많은 결과문서가 검색 데이터베이스에 저장되어 있더라도 색인어 처리가 제대로 되지 않거나 누락되는 경우에 사용자가 반복적으로 검색을 강요하도록 요구하게 되어 검색 서버 측면에서는 무의미한 검색결과로 인해 트래픽에 무리가 오고, 사용자 측면에서는 추가적인 시간과 노력을 필요로 하는 문제점이 발생하였다.Even if a large number of result documents are stored in the search database, users may be forced to repeatedly search if the indexing process is not done properly or is missing. Problems arise that require time and effort.

본 발명은 상기와 같은 종래기술의 문제점을 해결하기 위해, 사용자 로그 추적을 이용하여 검색 데이터베이스에 자동 색인어 추가하는 방법 및 상기 방법을 수행하는 시스템에 관한 새로운 기술을 제안한다.The present invention proposes a method for adding an automatic index to a search database using a user log tracking and a new technique for a system for performing the method to solve the above problems of the prior art.

본 발명은 사용자가 입력한 검색쿼리에 대해 노출된 결과문서를 사용자가 선택하는 경우 결과문서에 검색쿼리를 색인어로 추가하여, 추후에 다른 사용자의 검색이 있는 경우 좀더 정확하고 넓은 범위의 검색 결과를 제공하는 것을 목적으로 한다.According to the present invention, when a user selects a result document exposed for a search query input by a user, the search query is added to the result document as an index word, and when a search of another user is made later, a more accurate and wider search result can be obtained. It aims to provide.

본 발명의 다른 목적은 연쇄적으로 다른 결과문서에 색인어를 추가함으로써 분류가 세분화될수록 사용자의 검색에 대한 만족도를 향상시키는 것을 목적으로 한다.Another object of the present invention is to improve the user's satisfaction with the search as the classification is subdivided by adding index terms in the chain of different result documents.

상기의 목적을 이루고 종래기술의 문제점을 해결하기 위하여, 본 발명의 일실시예에 따른 검색 데이터베이스의 자동 색인어 추가 방법은, 사용자로부터 제1 검색쿼리(query)를 입력받는 단계, 검색 데이터베이스로부터 상기 제1 검색쿼리로 색인된 적어도 하나의 제1 결과문서를 제공하는 단계, 상기 사용자로부터 상기 제1 검색쿼리의 입력에 이어 제2 검색쿼리를 입력받는 단계, 상기 검색 데이터베이스로부터 상기 제2 검색쿼리로 색인된 적어도 하나의 제2 결과문서를 제공하는 단계, 상기 사용자로부터 상기 적어도 하나의 제2 결과문서 중 적어도 하나의 결과문서에 대한 선택요청을 수신하는 단계 및 상기 검색 데이터베이스에 상기 제1 검색쿼리를 상기 선택된 적어도 하나의 제2 결과문서의 색인어로 추가하는 단계를 포함할 수 있다.In order to achieve the above object and to solve the problems of the prior art, the method for adding an automatic index of the search database according to an embodiment of the present invention, receiving a first search query (query) from the user, the first search query from the search database; Providing at least one first result document indexed by a search query, receiving a second search query following the input of the first search query from the user, and indexing the second search query from the search database Providing at least one second result document, receiving a selection request for at least one result document of the at least one second result document from the user, and transmitting the first search query to the search database. And adding to the index word of the at least one selected second result document.

본 발명의 일측에 따르면, 상기 검색 데이터베이스에 상기 제1 검색쿼리를 상기 선택된 적어도 하나의 제2 결과문서의 색인어로 추가하는 상기 단계는 상기 제1 검색쿼리와 상기 제2 검색쿼리 사이에 상관관계가 성립하는지 여부를 판단하는 단계 및 상관관계가 성립하는 경우 상기 검색 데이터베이스에 상기 제1 검색쿼리를 상기 선택된 적어도 하나의 제2 결과문서의 색인어로 추가하는 단계를 포함할 수 있다.According to an aspect of the present invention, the step of adding the first search query to the search database as an index of the selected at least one second result document has a correlation between the first search query and the second search query. The method may include determining whether the information is true and adding the first search query to the search database as an index of the selected at least one second result document when the correlation is established.

본 발명의 일실시예에 따른 검색 데이터베이스의 자동 색인어 추가 시스템은, 색인어 및 결과문서를 저장하고 유지하는 검색 데이터베이스, 사용자로부터 제1 검색쿼리와 제2 검색쿼리를 입력받고 저장하는 검색쿼리 입력부, 상기 검색 데이터베이스로부터 상기 제1 검색쿼리로 색인된 적어도 하나의 제1 결과문서 및 상기 제2 검색쿼리로 색인된 적어도 하나의 제2 결과문서를 제공하는 결과문서 제공부, 상기 사용자로부터 상기 적어도 하나의 제1 결과문서 및 제2 결과문서 중 적어도 하나의 문서에 대한 선택요청을 수신하는 선택요청 수신부 및 상기 검색 데이터베이스에 상기 제1 검색쿼리를 상기 선택된 적어도 하나의 제2 결과문서의 색인어로 추가하는 색인어 추가부를 포함할 수 있다.An automatic index word adding system of a search database according to an embodiment of the present invention includes a search database for storing and maintaining an index word and a result document, a search query input unit for receiving and storing a first search query and a second search query from a user, and A result document providing unit for providing at least one first result document indexed with the first search query from a search database and at least one second result document indexed with the second search query, the at least one first document from the user 1 is a selection request receiver for receiving a selection request for at least one of a result document and a second result document, and an index word for adding the first search query as an index of the at least one selected second result document to the search database. It may include wealth.

이하 첨부된 도면을 참조하여 본 발명에 따른 다양한 실시예를 상세히 설명하기로 한다.Hereinafter, various embodiments of the present disclosure will be described in detail with reference to the accompanying drawings.

도 1에서 볼 수 있듯이 검색 데이터베이스(101)는 결과문서 데이터베이스(102)로 구성되어 있고, 그 내부에는 단위 데이터베이스인 결과문서(103)가 저장되어 있다. 다시 말하면, 검색 데이터베이스(101)는 각각의 단위 데이터베이스들이 모여서 형성된 커다란 데이터베이스라고 할 수 있다.As shown in FIG. 1, the search database 101 is composed of a result document database 102, and a result document 103 which is a unit database is stored therein. In other words, the search database 101 may be referred to as a large database formed by gathering respective unit databases.

각각의 결과문서(103)는 검색결과에 노출될 수 있는 단어인 색인어(Tag)를 정보로 가지고 있으며, 사용자가 검색쿼리를 입력하면 검색 로봇이 검색 데이터베 이스(101) 내부에 수집된 결과문서(103)를 서칭(searching)하여, 적절한 검색결과를 사용자에게 제공할 수 있다. 일례로, 사용자가 검색 인터페이스(104)의 검색창에 검색쿼리로 query2(105)를 입력하는 경우 query2(105)는 색인어2로 인식되어 검색 인터페이스(104)의 결과에는 검색 데이터베이스(101)에 저장된 결과문서(103) 중에 색인어2를 정보로 가지는 결과문서(106)인 D5, D6, D7, D8가 노출된다. 노출된 결과문서(106)가 원하는 결과인 경우 클릭하여 선택하면 문제가 없으나, 원하는 결과가 노출되지 않은 경우 다른 검색쿼리를 입력하여 재검색해야 문제점이 있는데, 본 발명은 이러한 문제점을 해결하기 위해 결과문서에 검색쿼리를 색인어로 추가하여 사용자의 검색 만족도를 향상시키기 위한 목적을 가진다.Each result document 103 has an index word (Tag), which is a word that can be exposed to a search result as information, and when a user inputs a search query, the search robot collects the result document inside the search database 101. Searching 103 may provide the user with appropriate search results. For example, when the user enters query2 105 as a search query in the search box of the search interface 104, the query2 105 is recognized as an index term 2 and the result of the search interface 104 is stored in the search database 101. D5, D6, D7, and D8, which are result documents 106 having index word 2 as information in the result document 103, are exposed. If the exposed result document 106 is the desired result, there is no problem when clicking to select it, but if the desired result is not exposed, there is a problem of re-searching by inputting another search query, and the present invention solves the problem. It is to improve the user's search satisfaction by adding a search query to the index.

본 발명은 사용자가 제1 검색쿼리를 입력하고 난 후 일정 짧은 시간 이내에 연이어 제2 검색쿼리를 입력하는 것은 두 검색쿼리간에 어느 정도 관련이 있다는 사실을 이용한 것으로 볼 수 있다. According to the present invention, it can be seen that the user inputs a second search query consecutively within a short time after the user inputs the first search query to some extent between the two search queries.

단계(S201)에서는 사용자로부터 최초로 제1 검색쿼리를 입력받고, 단계(S202)에서는 검색 데이터베이스(101)로부터 제1 검색쿼리로 색인된 제1 결과문서를 사용자에게 제공한다. 이 때 제1 결과문서의 갯수는 검색 데이터베이스(101)의 제1 검색쿼리로 색인된 결과문서 데이터베이스(102)에 따라 달라질 수 있으며 하나 또는 복수일 수 있다. In step S201, a first search query is first input from a user, and in step S202, a first result document indexed as a first search query from the search database 101 is provided to the user. In this case, the number of first result documents may vary depending on the result document database 102 indexed by the first search query of the search database 101 and may be one or plural.

단계(S203)에서는 사용자로부터 제1 검색쿼리의 입력에 연속적으로 제2 검 색쿼리를 입력받고, 단계(S204)에서는 검색 데이터베이스(101)로부터 제2 검색쿼리로 색인된 제2 결과문서를 사용자에게 제공한다. 여기서 일실시예로 사용자가 제1 결과문서에 대해 선택하고 연이어 제2 검색쿼리를 입력하여 검색하는 것도 고려할 수 있으나, 본 발명의 목적에 비추어 사용자가 제1검색쿼리로 색인된 제1 결과문서에 대해 클릭이 없이 연속적으로 제2 검색쿼리를 입력하는 경우가 더 많다고 할 수 있다. 여기서 이 때 제2 결과문서의 갯수는 검색 데이터베이스(101)의 제2 검색쿼리로 색인된 결과문서 데이터베이스(102)에 따라 달라질 수 있다.In step S203, the second search query is continuously input to the first search query from the user, and in step S204, the second result document indexed as the second search query is searched from the search database 101 to the user. to provide. Here, in one embodiment, the user may select the first result document and subsequently enter the second search query to search. However, in view of the object of the present invention, the user may select the first result document and index the first result document. It is more likely that a second search query is input continuously without a click. In this case, the number of second result documents may vary depending on the result document database 102 indexed by the second search query of the search database 101.

단계(S205)에서는 사용자로부터 제2 결과문서 중에서 적어도 하나의 결과문서에 대한 선택요청을 수신한다. 다시 말해서, 사용자가 제공된 결과문서 중에서 원하는 결과문서를 선택하는데 이 경우 선택할 수 있는 결과문서는 하나이거나 복수일 수 있다. In step S205, a selection request for at least one result document from the second result document is received from the user. In other words, the user selects a desired result document from the provided result documents. In this case, one or more result documents may be selected.

단계(S206)에서는 검색 데이터베이스에 제1 검색쿼리를 선택된 제2 결과문서의 색인어로 추가할 수 있다. 사용자가 제1 검색쿼리에 대한 결과문서를 선택하지 않고 연이어 제2 검색쿼리를 입력하여 나타난 제2 결과문서를 선택하는 경우라고 하더라도 항상 두 검색쿼리간에 어느 정도의 연관성이 있다고는 볼 수 없으므로, 제1 검색쿼리를 제2결과문서의 색인어로 바로 추가해도 되는가에 대해서는 의문이 생길 수 있다. 따라서 추가 여부를 판단할 때 적절한 기준을 설정하여 조절할 필요가 있다. 제1 검색쿼리를 사용자로부터 선택된 제2 결과문서의 색인어로 추가하는 기준 설정과 방법에 대해서는 도 3의 (a), (b), (c)에서 보다 구체적으로 설명하기로 하겠다. In operation S206, the first search query may be added to the search database as an index of the selected second result document. Even if a user selects a second result document by inputting a second search query successively without selecting a result document for the first search query, it is not always considered that there is some relation between the two search queries. The question arises whether it is possible to add a search query directly to the index word of the second result document. Therefore, when determining whether to add, it is necessary to set and adjust appropriate criteria. Reference settings and methods for adding the first search query to the index word of the second result document selected by the user will be described in detail with reference to FIGS. 3A, 3B, and 3C.

도 3의 (a)는 본 발명의 일실시예에 있어서, 제1 검색쿼리와 제2 검색쿼리간에 상관관계가 존재하는 경우 제1 검색쿼리를 사용자로부터 선택된 제2 결과문서의 색인어로 추가하는 경우를 나타낸 도면이다. 단계(S206)를 좀더 구체적으로 살펴보면, 제1 검색쿼리와 제2 검색쿼리 사이에 상관관계가 성립하는지 여부를 판단하는 단계(S301)와 만약 두 검색쿼리 간에 상관관계가 성립하는 경우 검색 데이터베이스에 제1 검색쿼리를 선택된 제2 결과문서의 색인어로 추가하는 단계(S302)로 볼 수 있다.FIG. 3 (a) illustrates a case in which a first search query is added as an index of a second result document selected by a user when a correlation exists between the first search query and the second search query according to an embodiment of the present invention. The figure which shows. Referring to step S206 in more detail, it is determined whether a correlation is established between the first search query and the second search query (S301), and if the correlation is established between the two search queries, In operation S302, the first search query may be added to the index word of the selected second result document.

일실시예로, 상관 관계가 있는지 여부를 판단하는 방법은 4가지로 나눌 수 있다. 첫째로, 제1 검색쿼리와 제2 검색쿼리가 연관쿼리 조건을 만족하는 경우에 상관관계가 성립한다고 할 수 있다. 연관쿼리 조건은 결과문서 집합과 연관된 검색쿼리, 검색쿼리, 검색쿼리분류 또는 결과문서 분류 간의 다양한 특징에 따른 빈도를 측정하여 검색쿼리 통계 정보를 생성하고 벡터, 랜덤변수, 결합확률분포 등을 이용하여 상기 검색쿼리 통계 정보를 정형화하고 해석함으로써, 상기 검색쿼리 사이의 관계에 대해 수치화된 검색쿼리 연관도로 표현할 수 있으며, 이 검색쿼리 연관도를 기준으로 제1 검색쿼리와 제2 검색쿼리 간의 연관도가 일정 기준치 이상인 경우 연관쿼리 조건을 만족하고 결국 양 검색쿼리 간에 상관관계가 성립한다고 판단할 수 있다.In one embodiment, there are four ways to determine whether there is a correlation. First, it can be said that a correlation is established when the first search query and the second search query satisfy an associated query condition. The association query condition generates the search query statistical information by measuring the frequency according to various features among the search query, the search query, the search query classification, or the result document classification associated with the result document set, and uses the vector, random variable, and combined probability distribution. By formulating and interpreting search query statistics information, the relationship between the search queries can be expressed as a numerical search query association, and the degree of association between the first search query and the second search query is constant based on the search query association. If the value is higher than the reference value, it may be determined that the correlation query condition is satisfied and the correlation is established between the two search queries.

둘째로, 제1 검색쿼리가 제2 검색쿼리를 포함하거나 제2 검색쿼리가 제1 검색쿼리를 포함하는 경우에 상관관계가 성립한다고 할 수 있다. 양 검색쿼리간에 포함관계인 경우를 말하며, 예를 들어 제1 검색쿼리가 "라틴댄스"이고, 제2 검색쿼 리가 "라틴댄스 동호회"인 경우 제1 검색쿼리가 제2 검색쿼리를 포함하기 때문에 양자간 상관관계가 성립한다고 판단할 수 있다. 또한 제1 검색쿼리가 "카메라 렌즈"이고 제2 검색쿼리가 "카메라 주변기기"인 경우 제2 검색쿼리가 제1 검색쿼리를 포함하기 때문에 이 역시 양자간 상관관계가 성립한다고 판단할 수 있다.Second, correlation may be established when the first search query includes the second search query or when the second search query includes the first search query. It refers to the case where there is an inclusive relationship between both search queries. For example, when the first search query is "Latin dance" and the second search query is "Latin dance group," the first search query includes the second search query. The correlation between the two can be judged. In addition, when the first search query is a "camera lens" and the second search query is a "camera peripheral", since the second search query includes the first search query, it may be determined that the correlation therebetween is also established.

셋째로, 제1 검색쿼리와 제2 검색쿼리가 동일한 단어를 포함하는 경우에 상관관계가 성립한다고 할 수 있다. 예를 들어, 제1 검색쿼리가 "발라드 음악" 이고, 제2 검색쿼리가 "발라드 노래"인 경우 제1 검색쿼리와 제2 검색쿼리가 동일한 "발라드"를 포함하기 때문에 상관관계가 성립한다고 판단할 수 있다.Third, it can be said that the correlation holds when the first search query and the second search query include the same word. For example, when the first search query is "ballard music" and the second search query is "ballard song", it is determined that the correlation is established because the first search query and the second search query include the same "ballard". can do.

넷째로, 제1 검색쿼리에서 제2 검색쿼리로의 검색 패턴이 미리 정해진 기준치 이상의 빈도로 발생할 경우에 상관관계가 성립한다고 할 수 있다. 상기 기준치는 사용자가 제1 검색쿼리를 입력하고 연이어 제2 검색쿼리로 입력하여 검색하는 패턴을 시간별, 일별, 월별, 년별로 구분하여 수치로 나타낸 것이다. 이 경우에 유의미한 검색 패턴이어야 하며, 그것은 두 검색쿼리간에 어느 정도 관련이 있는 것을 전제로 한다.Fourth, it can be said that a correlation is established when a search pattern from the first search query to the second search query occurs at a frequency equal to or greater than a predetermined reference value. The reference value is a numerical value by dividing the pattern for searching by inputting the first search query followed by the second search query by time, daily, monthly, and year. In this case, there should be a meaningful search pattern, which assumes that there is some relation between the two search queries.

도 3의 (b)는 본 발명의 일실시예에 있어서, 제1 검색쿼리를 바로 색인어로 추가하는 것이 아니라 잠재 색인어로 일시 저장하고 추후에 검수 기준을 충족할 때 색인어로 추가하는 경우를 나타낸 도면이다. 단계(S206)을 좀더 구체적으로 보면, 검색 데이터베이스에 제1 검색 쿼리를 선택된 제2 결과문서의 잠재 색인어로 저장하는 단계(S303)와, 일정 검수 기준을 만족하는 잠재 색인어를 선택된 제2 결과문서의 색인어로 추가하는 단계(S304)로 볼 수 있다. 다시 말해서, 제1 검색 쿼 리를 자동으로 색인어로 추가하지 않고 나중에 추가할 수 있도록 잠재 색인어 필드에 저장하며, 정기 또는 부정기적으로 데이터베이스 내부의 검수부가 일정 검수 기준을 만족하는 저장된 잠재 색인어를 색인어로 추가할 수 있다. 이 때, 잠재 색인어가 실제로 제2 검색문서의 색인어로 쓰이는 빈도를 일정 검수 기준으로 정할 수 있다. 3 (b) is a diagram illustrating a case in which a first search query is not immediately added as an index word but temporarily stored as a potential index word and added as an index word when satisfying a test criterion later. to be. More specifically, in step S206, storing the first search query as a potential index word of the selected second result document in a search database, and storing a potential index word satisfying a predetermined inspection criterion in the selected second result document. It can be seen as a step (S304) of adding to the index word. In other words, instead of automatically adding the first search query as an index, it stores the potential search term in the potential index term field for later addition, and periodically or irregularly stores the potential index term as a checker within the database that meets certain criteria. You can add At this time, the frequency of the potential index word actually used as the index word of the second search document can be determined by a certain inspection criterion.

도 3의 (c)는 본 발명의 일실시예에 있어서, 제1 검색쿼리를 제2 결과문서의 색인어로 바로 자동 추가하되 색인어로 추가된 제1 검색쿼리에 대해 일정 기간동안 클릭이 없는 경우 자동 소멸되는 경우를 나타낸 도면이다. 단계(S206)에서 제1 검색 쿼리가 지정된 일정 기간동안 선택되지 않는 경우 색인어를 검색 데이터베이스로부터 삭제하는 단계(S305)를 더 추가한 것으로 볼 수 있다. 사후 검증 방법을 이용한 것으로 먼저 색인어로 추가해 놓고, 나중에 사용자가 일정 기간동안 제2 결과문서를 검색하기 위해 제1 검색쿼리를 입력하는 횟수가 데이터베이스 내부에 정한 기준 이하인 경우 자동으로 삭제되는 것으로 볼 수 있다.FIG. 3 (c) automatically adds a first search query directly to an index word of a second result document when there is no click for a certain period of time for the first search query added to the index word. The figure which shows the case where it disappears. If the first search query is not selected for a predetermined period of time in step S206, the step S305 may be added to delete the index word from the search database. It is a post-validation method that can be added to the index word first, and then automatically deleted if the number of times the user enters the first search query to search the second result document for a certain period is less than the criteria set in the database. .

도 4는 본 발명의 일실시예에 있어서, 제1 검색쿼리를 사용자로부터 선택된 제2 결과문서의 색인어로 추가하는 과정을 도시한 도면이다. 4 is a diagram illustrating a process of adding a first search query to an index word of a second result document selected by a user according to one embodiment of the present invention.

먼저 사용자가 제1 검색쿼리인 query1을 검색창에 입력하면 제1 결과문서에 대한 검색결과(401)가 나타난다. 도 1에서 설명한 검색 데이터베이스(101) 내부에는 색인어 1으로 구분된 결과문서가 D1, D2, D3, D4로 저장되어 있기 때문에 query1에 대응하는 색인어 1로 처리된 결과문서인 D1, D2, D3, D4가 제1 결과문서에 대한 검색결과(401)으로 나타난다. 이 때, 사용자는 제1 결과문서들에 대해 클 릭 등의 반응이 없다고 가정한다.First, when a user enters a first search query query1 in a search box, a search result 401 for the first result document is displayed. In the search database 101 described in FIG. 1, since the result documents classified by index word 1 are stored as D1, D2, D3, and D4, the result documents processed by index word 1 corresponding to query1 are D1, D2, D3, and D4. Appears as a search result 401 for the first result document. At this time, it is assumed that the user does not respond to clicking on the first result documents.

그리고 사용자가 연속적으로 제2 검색쿼리인 query2를 검색창에 입력하면 제2 결과문서에 대한 검색결과(402)가 나타난다. 검색 데이터베이스(101)에 색인어 2로 처리된 결과문서인 D5, D6, D7, D8이 제2 결과문서에 대한 검색결과(402)로 나타난다. 이 때, 사용자는 D6와 D8에 대해 클릭하여 선택한다고 가정한다.When a user continuously inputs a second search query query2 into a search box, a search result 402 for the second result document is displayed. In the search database 101, the result documents D5, D6, D7, and D8 processed by the index term 2 are displayed as the search result 402 for the second result document. At this time, it is assumed that the user clicks on D6 and D8 to select.

그러면 검색 데이터베이스(101)의 색인어 2로 처리된 결과문서 데이터베이스(403)에서 사용자로부터 선택된 D6와 D8에 대해서는 제1 검색쿼리가 색인어로 추가되는 것을 볼 수 있다. 이 때 제1 검색쿼리가 사용자로부터 선택된 제2 결과문서의 색인어로 추가되는 구체적인 방법은 도 3의 (a), (b), (c)에서 설명한 구체적인 조건에 따라 구분될 수도 있다.Then, it can be seen that the first search query is added to the index word for D6 and D8 selected by the user in the result document database 403 processed by the index word 2 of the search database 101. In this case, a specific method of adding the first search query to the index word of the second result document selected by the user may be classified according to the detailed conditions described with reference to FIGS. 3A, 3B, and 3C.

검색 데이터베이스의 자동 색인어 추가 시스템(501)은 검색 데이터베이스(502), 검색쿼리 입력부(503), 결과문서 제공부(504), 선택요청 수신부(505), 색인어 추가부(506), 잠재 색인어부(507), 검수부(508)로 구성된다.The automatic index word adding system 501 of the search database includes a search database 502, a search query input unit 503, a result document providing unit 504, a selection request receiving unit 505, an index word adding unit 506, and a potential index fish unit (506). 507, and an inspector 508.

검색 데이터베이스(502)에는 색인어가 처리된 결과문서를 저장하고 유지하는 기능을 할 수 있다. 도 1을 참조하여 구체적으로 살펴보면, 검색 데이터베이스(502)는 결과문서 데이터베이스(102)로 구성되어 있고, 결과문서 데이터베이스(102) 내부는 단위 데이터베이스인 결과문서(103)가 저장되어 있다. 다시 말해, 검색 데이터베이스(502)는 각각의 단위 데이터베이스들이 모여서 형성된 커다란 데 이터베이스라고 할 수 있다. 또한, 검색쿼리를 바로 결과문서의 색인어로 추가하는 것이 아니라 임시로 잠재 색인어로 저장하는 공간인 잠재 색인어부(507)와 잠재 색인어가 일정 검수 기준을 충족하는지 판단하는 검수부(508)가 포함될 수 있다.The search database 502 may function to store and maintain a result document in which index terms are processed. Specifically, referring to FIG. 1, the search database 502 includes a result document database 102. The result document database 102 stores a result document 103 that is a unit database. In other words, the search database 502 may be referred to as a large database formed by gathering respective unit databases. In addition, rather than adding a search query immediately as an index word of the result document, a potential index part 507, which is a space for temporarily storing the potential index word, and a check part 508 for determining whether the potential index word satisfies a certain inspection criteria may be included. have.

검색쿼리 입력부(503)는 검색 데이터베이스의 자동 색인어 추가 시스템(501)에서 사용자(509)에게 검색 인터페이스를 제공하여 검색쿼리를 입력받고 저장할 수 있다. 사용자 로그 추적을 기반으로 하는 경우 사용자(509)로부터 연속적으로 검색쿼리를 입력받을 수 있다. 결과문서 제공부(504)는 사용자로부터 입력받은 검색쿼리에 대응하는 색인어가 처리된 결과문서를 검색 데이터베이스(502)로부터 제공할 수 있다. The search query input unit 503 may receive and store a search query by providing a search interface to the user 509 in the automatic index word adding system 501 of the search database. Based on the user log tracking, the search query may be continuously input from the user 509. The result document providing unit 504 may provide a result document in which an index word corresponding to a search query received from a user has been processed, from the search database 502.

다시 말해서, 검색 데이터베이스(502)로부터 검색쿼리로 색인된 적어도 하나의 결과문서를 사용자에게 제공하는 역할을 한다. 이 때 사용자 로그 추적을 기반으로 하는 경우 연속적으로 입력된 검색쿼리 각각에 대한 모든 결과문서가 제공될 수 있다. 상기 제공되는 결과문서는 검색 데이터베이스에 저장된 상태에 따라 하나이거나 복수일 수도 있다.In other words, it serves to provide the user with at least one result document indexed by the search query from the search database 502. In this case, based on the user log tracking, all the result documents for each successive search query may be provided. The provided result document may be one or plural depending on the state stored in the search database.

선택요청 수신부(505)는 제공된 결과문서 중에서 사용자(509)가 선택한 문서에 대한 선택요청을 수신하는 역할을 한다. 사용자 로그 추적을 기반으로 하는 경우 제1검색쿼리와 제2검색쿼리에 대한 결과문서에 대해 사용자(509)의 선택요청을 수신할 수 있으며 분류나 연관검색어를 이용하는 경우 사용자가 입력한 검색쿼리에 대한 결과문서에 대해 사용자(509)의 선택요청을 수신할 수 있다. 일반적으로 사용자의 선택요청은 사용자(509)가 결과문서를 클릭한 것이 될 수 있으며, 제 공된 결과문서 중에서 하나 또는 복수의 선택요청이 있을 수 있다.The selection request receiving unit 505 serves to receive the selection request for the document selected by the user 509 among the provided result documents. Based on the user log tracking, the user can receive the selection request of the user 509 for the result documents for the first search query and the second search query, and if the classification or the related search query are used, the search query entered by the user can be received. A selection request of the user 509 may be received for the result document. In general, the user's selection request may be that the user 509 clicks on the result document, and there may be one or a plurality of selection requests among the provided result documents.

색인어 추가부(506)는 검색 데이터베이스(502)에 검색쿼리를 사용자가 선택한 결과문서의 색인어로 추가하는 역할을 한다. The index word adding unit 506 adds a search query to the search database 502 as an index word of the result document selected by the user.

사용자 로그 추적에 기반하여 색인어를 추가하는 경우 제1 검색쿼리를 선택된 적어도 하나의 제2 결과문서의 색인어로 추가할 수 있다. 이 때 색인어 추가부(506)가 색인어로 추가하는 것을 크게 3가지로 나눌 수 있으며, 구체적인 방법은 도 3의 (a), (b), (c)에서 참고할 수 있다. 여기서 제1 검색쿼리와 제2 검색쿼리 간에 상관관계가 성립하는지 판단하는 것은 시스템 내부의 상관관계 판단부가 담당할 수 있다.When adding an index word based on user log tracking, the first search query may be added as an index word of at least one selected second result document. In this case, the index word adding unit 506 may be divided into three types, and the specific method may be referred to in FIGS. 3A, 3B, and 3C. Here, the correlation determination unit in the system may be in charge of determining whether a correlation is established between the first search query and the second search query.

잠재 색인어부(507)는 색인어 추가부(506)가 검색쿼리를 바로 검색 데이터베이스(502)에 결과문서의 색인어로 저장하는 것이 아니라 상기 검색쿼리를 잠재 색인어로 잠시 저장하는 기능을 한다. 다시 말해서, 사용자가 입력한 검색쿼리를 검색 데이터베이스(502)에 색인어로 추가하기 전에 일정 검수 기준을 충족시키는 지를 알아보기 위해 임시로 색인어로 추가하는 공간이라고 할 수 있다.The potential index unit 507 functions to temporarily store the search query as a potential index rather than storing the search query as an index of the result document directly in the search database 502. In other words, the search query input by the user may be a space temporarily added to the index word in order to find out whether a certain inspection criterion is satisfied before adding it to the search database 502 as the index word.

검수부(508)는 잠재 색인어부(507)에 저장된 잠재 색인어가 시스템 상에서 정해진 일정 검수 기준을 만족하는지를 정기 또는 부정기적으로 판단하고, 상기 기준을 충족하는 경우 잠재 색인어로 저장된 검색쿼리를 결과문서의 색인어로 추가하는 역할을 한다.The inspection unit 508 determines whether the potential index word stored in the potential index word unit 507 satisfies a predetermined inspection criterion on the system regularly or irregularly. It adds index words.

본 발명에 따른 검색 데이터베이스의 자동 색인어 추가 방법은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터 판독 가 능 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 매체에 기록되는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다. 상기된 하드웨어 장치는 본 발명의 동작을 수행하기 위해 하나 이상의 소프트웨어 모듈로서 작동하도록 구성될 수 있으며, 그 역도 마찬가지이다.The automatic index word adding method of the search database according to the present invention may be implemented in the form of program instructions that can be executed by various computer means and recorded in a computer readable medium. The computer readable medium may include program instructions, data files, data structures, etc. alone or in combination. Program instructions recorded on the media may be those specially designed and constructed for the purposes of the present invention, or they may be of the kind well-known and available to those having skill in the computer software arts. Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tape, optical media such as CD-ROMs, DVDs, and magnetic disks, such as floppy disks. Magneto-optical media, and hardware devices specifically configured to store and execute program instructions, such as ROM, RAM, flash memory, and the like. Examples of program instructions include not only machine code generated by a compiler, but also high-level language code that can be executed by a computer using an interpreter or the like. The hardware device described above may be configured to operate as one or more software modules to perform the operations of the present invention, and vice versa.

이상과 같이 본 발명은 비록 한정된 실시예와 도면에 의해 설명되었으나, 본 발명은 상기의 실시예에 한정되는 것은 아니며, 본 발명이 속하는 분야에서 통상의 지식을 가진 자라면 이러한 기재로부터 다양한 수정 및 변형이 가능하다. As described above, although the present invention has been described with reference to limited embodiments and drawings, the present invention is not limited to the above embodiments, and those skilled in the art to which the present invention pertains various modifications and variations from such descriptions. This is possible.

그러므로, 본 발명의 범위는 설명된 실시예에 국한되어 정해져서는 아니되며, 후술하는 특허청구범위뿐 아니라 이 특허청구범위와 균등한 것들에 의해 정해져야 한다.Therefore, the scope of the present invention should not be limited to the described embodiments, but should be determined not only by the claims below but also by the equivalents of the claims.

본 발명에 따르면, 사용자 로그 추적을 이용하여 검색 데이터베이스의 자동 색인어 추가하는 방법 및 시스템을 제공할 수 있다.According to the present invention, it is possible to provide a method and system for automatically indexing a search database using user log tracking.

본 발명에 따르면, 사용자가 입력한 검색쿼리에 대해 노출된 결과문서의 선택이 있는 경우 결과문서에 검색쿼리를 색인어로 추가하여, 추후에 다른 사용자의 검색이 있는 경우 좀더 정확하고 넓은 범위의 검색 결과를 제공할 수 있다.According to the present invention, when there is a selection of the result document exposed for the search query input by the user, the search query is added to the result document as an index word, and when there is another user's search later, a more accurate and wider search result Can be provided.

본 발명에 따르면, 검색 데이터베이스에 검색쿼리를 연쇄적으로 결과문서의 색인어로 추가함으로써 분류가 세분화될수록 사용자의 검색에 대한 만족도를 향상시키는 것을 목적으로 한다.According to the present invention, a search query is sequentially added to a search database as an index word of a result document, and as the classification is further segmented, the user's satisfaction with the search is improved.

Claims

Receiving a first search query from a user;

Providing at least one first result document indexed by the first search query from a search database;

Receiving a second search query following the input of the first search query from the user;

Providing at least one second result document indexed by the second search query from the search database;

Receiving a selection request for at least one result document of the at least one second result document from the user; And

Adding the first search query to the search database as an index of the selected at least one second result document.

How to add an automatic index of the search database that includes.

The method of claim 1,

The step of receiving a second search query following the input of the first search query from the user,

Receiving a second search query from the user without a selection request of at least one first result document indexed by the first search query from the user

How to add an automatic index of the search database that includes.

The method of claim 1,

The adding of the first search query to the search database as an index of the selected at least one second result document may include:

When the first search query includes the second search query or when the second search query includes the first search query, a correlation is established between the first search query and the second search query. Judging by; And

Adding the first search query to the search database as an index of the selected at least one second result document when correlation is established.

How to add an automatic index of the search database that includes.

The method of claim 3,

Determining that a correlation is established when the first search query and the second search query satisfy an associated query condition of the search database.

More,

The association query condition,

Automated search database characterized in that it is a condition for determining whether the search query association between the first search query and the second search query is greater than a predetermined reference value according to the statistical information on the search query frequency How to add index terms.

delete

The method of claim 3,

Determining that a correlation is established when the first search query and the second search query include the same word.

How to add an automatic index of the search database that includes more.

The method of claim 3,

Determining that a correlation is established when a search pattern from the first search query to the second search query occurs at a frequency equal to or greater than a predetermined reference value.

How to add an automatic index of the search database that includes more.

The method of claim 1,

Storing the first search query in the search database as a potential index of the selected at least one second result document; And

Adding the potential index word that satisfies a certain inspection criterion to the index word of the selected at least one second result document.

How to add an automatic index of the search database that includes.

The method of claim 1,

Deleting the index word from the search database when the first search query is not selected for a predetermined period of time.

How to add an automatic index of the search database that includes more.

A computer-readable recording medium having recorded thereon a program for executing the method of any one of claims 1 to 4 or 6 to 9.

A search query input unit which provides a search interface to receive and store a first search query and a second search query from a user;

A result document providing at least one first result document indexed with the first search query and at least one second result document indexed with the second search query from a search database that stores and maintains an index document processed result document Providing unit;

A selection request receiving unit which receives a selection request for at least one of the at least one first result document and the second result document from the user; And

An index word adding unit that adds the first search query to the search database as an index of the selected at least one second result document.

Automatic indexing system of the search database that includes.

The method of claim 11,

The search query input unit,

And there is no request for selection of at least one first result document indexed by the first search query from the user, and a second search query is input from the user successively.

The method of claim 11,

When the first search query includes the second search query or when the second search query includes the first search query, a correlation between the first search query and the second search query is established. Correlation judgment unit to judge

More,

The index word adding unit,

If the first search query and the second search query have a correlation, the first search query is added to the search database as an index of the at least one selected second result document. Index word addition system.

The method of claim 13,

The correlation determination unit,

If the first search query and the second search query satisfies the related query conditions of the search database, it is determined that a correlation is established,

The association query condition,

Automated search database characterized in that it is a condition for determining whether the search query association between the first search query and the second search query is greater than a predetermined reference value according to the statistical information on the search query frequency Index word addition system.

delete

The method of claim 13,

The correlation determination unit,

And if the first search query and the second search query contain the same word, determine that a correlation is established.

The method of claim 13,

The correlation determination unit,

And a correlation is determined when a search pattern from the first search query to the second search query occurs at a frequency equal to or greater than a predetermined reference value.

The method of claim 11,

A potential indexer for storing and maintaining a search query as a potential index of the result document; And

A inspecting unit that determines whether the potential index word satisfies a certain inspection criteria

More,

The index word adding unit,

And the potential index word determined by the inspector to satisfy a predetermined inspection criterion is an index word of the selected at least one second result document.

The method of claim 11,

The index word adding unit,

And the index word is deleted from the search database if the first search query is not selected for a predetermined period of time.