KR101348877B1

KR101348877B1 - Apparatus and method for set－based searching

Info

Publication number: KR101348877B1
Application number: KR1020120044757A
Authority: KR
Inventors: 이재원; 이종우
Original assignee: 숙명여자대학교산학협력단
Priority date: 2012-04-27
Filing date: 2012-04-27
Publication date: 2014-01-08
Also published as: KR20130121503A

Abstract

집합 기반의 검색 장치 및 방법이 개시된다. 집합 기반의 검색 방법은, 복수 개의 단어를 저장하는 POI 데이터베이스를 유지하는 단계와, 상기 복수 개의 단어를 구성하는 음절 각각에 대응하여, 상기 각각의 음절이 포함된 상기 POI 데이터베이스의 단어에 관한 제1 정보를 저장하는 역배열 데이터베이스를 유지하는 단계, 질의어의 입력에 응답하여, 상기 역배열 데이터베이스를 참조하여, 상기 질의어를 구성하는 음절에 대응하는 제1 정보를 식별하는 단계 및 상기 식별된 제1 정보 중에서 공통되는 제1 정보의 단어를 검색단어로 출력하는 단계를 포함한다.Set-based retrieval apparatus and method are disclosed. A set-based retrieval method may include maintaining a POI database storing a plurality of words, and corresponding to each syllable constituting the plurality of words, a first of the words of the POI database including the respective syllables. Maintaining an inverse array database for storing information, in response to input of a query word, identifying first information corresponding to a syllable constituting the query word with reference to the inverse array database, and common among the identified first information And outputting a word of the first information to be a search word.

Description

Set-based search device and method {APPARATUS AND METHOD FOR SET-BASED SEARCHING}

본 발명의 실시예들은 질의어를 구성하는 음절을 이용하여, 질의어와 연관된 검색단어를 추출하여 제공하는 기술에 관한 것이다.Embodiments of the present invention relate to a technique for extracting and providing a search word associated with a query by using syllables constituting a query.

쿼리 검색 기술은 다양한 분야(예컨대, 웹 검색, 네비게이션 등)에서 적용되고 있다. 종래의 쿼리 검색 기술은 입력된 질의어를 일부로서 포함하는 검색단어를 데이터베이스로부터 추출하여 제공할 수 있다.Query search technology is applied in various fields (eg, web search, navigation, etc.). Conventional query retrieval techniques can extract and provide a search word including a part of an input query word from a database.

이때, 종래의 쿼리 검색 기술(예컨대, 앨리어스(alias) 기술)은 질의어를 일부로서 포함하는 모든 검색단어를 검색하여 제공 함으로써, 검색 선호도가 상대적으로 낮은 검색단어를 제공하게 될 뿐만 아니라, 검색단어를 추출하기 위한 검색 시간 또는 검색 량이 많게 된다.In this case, the conventional query search technique (eg, an alias technique) searches and provides all search words including a query as a part, thereby providing a search word having a relatively low search preference, and providing a search word. The search time or the search amount to extract is large.

따라서, 검색 선호도가 상대적으로 높은 검색단어를 보다 효율적으로 추출하여 제공하는 기술이 필요하다.Accordingly, there is a need for a technique for more efficiently extracting and providing a search word having a relatively high search preference.

본 발명은 질의어를 구성하는 음절 각각에 대응하는 단어에 관한 정보를 데이터베이스에서 확인하고, 확인된 단어에 관한 정보 중에서 공통되는 정보의 단어를, 질의어와 연관된 검색단어로서 제공 함으로써, 질의어를 일부로서 포함하는 모든 검색단어를 검색하여 제공하는 기존 검색 기술에 비해, 검색 시간 또는 검색 량이 적으면서도, 질의어의 음절과 최적하게 일치하면서 검색 선호도가 상대적으로 높은 검색단어를 보다 효율적으로 추출하여 제공하는 것을 목적으로 한다.The present invention includes a query word as a part by checking information on a word corresponding to each syllable constituting a query word in a database, and providing a word of information common among information about the checked word as a search word associated with the query word. Compared to the existing search technology that searches and provides all search words that are searched for, the purpose of the present invention is to efficiently search for and provide search words with a relatively high search preference while matching the syllables of the query word with less searching time or search volume. do.

본 발명은 검색 결과, 검색단어가 복수일 경우, 검색단어를 구성하는 제1 음절 쌍 내 음절 간 순서와, 상기 질의어를 구성하는 제2 음절 쌍 내 음절 간 순서의 일치 정도가 클수록 우선하여 출력 함으로써, 질의어의 음절 배열과 가장 근접하게 음절이 배열된 검색단어를 보다 용이하게 인지할 수 있게 하는 것을 목적으로 한다.According to the present invention, when there are a plurality of search words, the higher the degree of correspondence between the order between the syllables in the first syllable pair constituting the search word and the order between the syllables in the second syllable pair constituting the query word, the higher priority is output. In addition, the purpose of the present invention is to make it easier to recognize a search word having a syllable arranged closest to a syllable array of a query word.

상기의 목적을 이루기 위한 집합 기반의 검색 방법은, 복수 개의 단어를 저장하는 POI 데이터베이스를 유지하는 단계와, 상기 복수 개의 단어를 구성하는 음절 각각에 대응하여, 상기 각각의 음절이 포함된 상기 POI 데이터베이스의 단어에 관한 제1 정보를 저장하는 역배열 데이터베이스를 유지하는 단계와, 질의어의 입력에 응답하여, 상기 역배열 데이터베이스를 참조하여, 상기 질의어를 구성하는 음절에 대응하는 제1 정보를 식별하는 단계, 및 상기 식별된 제1 정보 중에서 공통되는 제1 정보의 단어를 검색단어로 출력하는 단계를 포함할 수 있다.In order to achieve the above object, a set-based retrieval method includes maintaining a POI database storing a plurality of words, and corresponding to each syllable constituting the plurality of words, wherein the POI database includes each syllable. Maintaining an inverse array database that stores first information about a word of, identifying a first information corresponding to a syllable constituting the query by referring to the inverse array database, in response to input of a query; The method may include outputting a word of first information that is common among the identified first information as a search word.

다른 실시예로서, 집합 기반의 검색 방법은, 복수 개의 단어를 저장하는 POI 데이터베이스를 유지하는 단계와, 상기 복수 개의 단어를 구성하는 모음 튜플 각각에 대응하여, 상기 각각의 모음 튜플이 포함된 상기 POI 데이터베이스의 단어를 저장하는 모음 튜플 역배열 데이터베이스를 유지하는 단계와, In another embodiment, the set-based search method may include maintaining a POI database that stores a plurality of words, and corresponding to each vowel tuple constituting the plurality of words, wherein the POI includes the respective vowel tuples. Maintaining a vowel tuple inverse array that stores words in the database;

상기 복수 개의 단어를 구성하는 모음 트리플 각각에 대응하여, 상기 각각의 모음 트리플이 포함된 상기 POI 데이터베이스의 단어를 저장하는 모음 트리플 역배열 데이터베이스를 유지하는 단계, 및 질의어의 입력에 응답하여, 상기 모음 튜플 역배열 데이터베이스 및 모음 트리플 역배열 데이터베이스를 참조하여, 상기 질의어에 대응하는 단어를 검색단어로 출력하는 단계를 포함할 수 있다.Maintaining a vowel triple inverse array database for storing words of the POI database including the vowel triples corresponding to each vowel triple constituting the plurality of words, and in response to input of a query, the vowel tuple The method may include outputting a word corresponding to the query word as a search word by referring to an inverse array database and a vowel triple inverse array database.

상기 방법을 구현하는 집합 기반의 검색 장치는, 복수 개의 단어를 저장하는 POI 데이터베이스와, 상기 복수 개의 단어를 구성하는 음절 각각에 대응하여, 상기 각각의 음절이 포함된 상기 POI 데이터베이스의 단어에 관한 제1 정보를 저장하는 역배열 데이터베이스와, 질의어의 입력에 응답하여, 상기 역배열 데이터베이스를 참조하여, 상기 질의어를 구성하는 음절에 대응하는 제1 정보를 식별하는 프로세서 및 상기 식별된 제1 정보 중에서 공통되는 제1 정보의 단어를 검색단어로 출력하는 출력부를 포함하여 구성할 수 있다.The set-based retrieval apparatus implementing the method may include a POI database storing a plurality of words and a word relating to a word of the POI database including the syllables corresponding to each syllable constituting the plurality of words. A processor that identifies first information corresponding to the syllables constituting the query word by referring to the inverse array database storing the first information, and in response to input of the query word, a first common word among the identified first information; 1 may include an output unit for outputting a word of information as a search word.

또한, 다른 실시예로서, 집합 기반의 검색 장치는, 복수 개의 단어를 저장하는 POI 데이터베이스와, 상기 복수 개의 단어를 구성하는 모음 튜플 각각에 대응하여, 상기 각각의 모음 튜플이 포함된 상기 POI 데이터베이스의 단어를 저장하는 모음 튜플 역배열 데이터베이스와, 상기 복수 개의 단어를 구성하는 모음 트리플 각각에 대응하여, 상기 각각의 모음 트리플이 포함된 상기 POI 데이터베이스의 단어를 저장하는 모음 트리플 역배열 데이터베이스, 및 질의어의 입력에 응답하여, 상기 모음 튜플 역배열 데이터베이스 및 모음 트리플 역배열 데이터베이스를 참조하여, 상기 질의어에 대응하는 단어를 검색단어로 출력하는 출력부를 포함하여 구성할 수 있다.In another embodiment, the set-based retrieval apparatus may further include a POI database that stores a plurality of words, and a POI database including the respective vowel tuples corresponding to each of the vowel tuples constituting the plurality of words. A vowel tuple inverse array database for storing words, a vowel triple inverse array database for storing words of the POI database including the vowel triples corresponding to each of the vowel triples constituting the plurality of words, and an input of a query word. In response, the vowel tuple inverse array database and the vowel triple inverse array database may be referred to and include an output unit for outputting a word corresponding to the query word as a search word.

본 발명의 실시예에 따르면, 질의어를 구성하는 음절 각각에 대응하는 단어에 관한 정보를 데이터베이스에서 확인하고, 확인된 단어에 관한 정보 중에서 공통되는 정보의 단어를, 질의어와 연관된 검색단어로서 제공 함으로써, 질의어를 일부로서 포함하는 모든 검색단어를 검색하여 제공하는 기존 검색 기술에 비해, 검색 시간 또는 검색 량이 적으면서도, 질의어의 음절과 최적하게 일치하면서 검색 선호도가 상대적으로 높은 검색단어를 보다 효율적으로 추출하여 제공할 수 있다.According to an embodiment of the present invention, by confirming information about a word corresponding to each syllable constituting a query word in a database, and providing a word of information common among the information about the checked word as a search word associated with the query word, Compared to the existing search technology that searches and provides all search words including the query as a part, the search word or search volume is optimized, and the search words with high search preference are relatively efficiently matched with the optimal syllable of the query syllable. Can provide.

본 발명의 실시예에 따르면, 검색 결과, 검색단어가 복수일 경우, 검색단어를 구성하는 제1 음절 쌍 내 음절 간 순서와, 상기 질의어를 구성하는 제2 음절 쌍 내 음절 간 순서의 일치 정도가 클수록 우선하여 출력 함으로써, 질의어의 음절 배열과 가장 근접하게 음절이 배열된 검색단어를 보다 용이하게 인지할 수 있게 한다.According to an embodiment of the present invention, when there are a plurality of search words, the degree of agreement between the syllables in the first syllable pair constituting the search word and the order between the syllables in the second syllable pair constituting the query word The larger the first value, the easier it is to recognize the search words arranged syllables closest to the syllable array of the query.

도 1은 본 발명의 일실시예에 따른 집합 기반의 검색 장치의 구성을 나타내는 도면이다.
도 2는 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 단어 별 단어식별자 및 음절식별자 할당에 대한 일례를 도시한 도면이다.
도 3은 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 음절식별자에 대응하는 단어식별자 구성에 대한 일례를 도시한 도면이다.
도 4는 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 질의어 입력에 따른 검색에 대한 일례를 도시한 도면이다.
도 5는 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 단어 별 단어식별자 및 모음식별자 할당에 대한 일례를 도시한 도면이다.
도 6은 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 모음식별자에 대응하는 단어식별자 구성에 대한 일례를 도시한 도면이다.
도 7은 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 질의어 입력에 따른 검색에 대한 다른 일례를 도시한 도면이다.
도 8은 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 검색단어 출력 순서를 설명하기 위한 도시한 도면이다.
도 9는 본 발명의 일실시예에 따른 집합 기반의 검색 방법을 나타내는 흐름도이다.1 is a diagram illustrating a configuration of a set based search apparatus according to an exemplary embodiment of the present invention.
2 is a diagram illustrating an example of word word identifier and syllable identifier assignment for each word in a set based search apparatus according to an exemplary embodiment of the present invention.
3 is a diagram illustrating an example of a word identifier configuration corresponding to a syllable identifier in a set based search apparatus according to an embodiment of the present invention.
4 is a diagram illustrating an example of a search based on a query input in a set-based search apparatus according to an embodiment of the present invention.
FIG. 5 is a diagram illustrating an example of word word and vowel identifier assignment for each word in a set based search apparatus according to an exemplary embodiment of the present invention.
6 is a diagram illustrating an example of a word identifier configuration corresponding to a vowel identifier in a set based search apparatus according to an exemplary embodiment of the present invention.
FIG. 7 illustrates another example of searching based on a query input in a set-based searching apparatus according to an embodiment of the present invention.
8 is a diagram illustrating a search word output order in a set-based search apparatus according to an embodiment of the present invention.
9 is a flowchart illustrating a set-based search method according to an embodiment of the present invention.

이하, 첨부된 도면들을 참조하여 본 발명의 일실시예에 따른 집합 기반의 검색 장치 및 방법에 대해 상세히 설명한다.Hereinafter, a set-based searching apparatus and method according to an embodiment of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명의 일실시예에 따른 집합 기반의 검색 장치의 구성을 나타내는 도면이다.1 is a diagram illustrating a configuration of a set based search apparatus according to an exemplary embodiment of the present invention.

도 1을 참조하면, 본 발명의 일실시예에 따른 집합 기반의 검색 장치(100)는 할당부(101), 프로세서(103), 출력부(105) 및 데이터베이스(107)를 포함한다.Referring to FIG. 1, the set-based retrieval apparatus 100 according to an embodiment of the present invention includes an allocation unit 101, a processor 103, an output unit 105, and a database 107.

우선, 데이터베이스(107)는 복수 개의 단어를 저장하는 POI 데이터베이스(109)와, 상기 복수 개의 단어를 구성하는 음절 각각에 대응하여, 상기 각각의 음절이 포함된 상기 POI 데이터베이스(109)의 단어에 관한 제1 정보를 저장하는 역배열 데이터베이스(111)를 포함할 수 있다.First, the database 107 relates to a POI database 109 for storing a plurality of words, and a word for the word in the POI database 109 including the respective syllables, corresponding to each syllable constituting the plurality of words. It may include a reverse array database 111 for storing the first information.

POI 데이터베이스(109)는 관심지점(Point of interest)의 명칭(이름)을 단어로 기록하는 저장 수단으로, 예컨대 상호, 지명 등의 단어를 저장할 수 있다. POI 데이터베이스(109)의 단어는, 후술하는 할당부(101)에 의해 고유의 단어식별자가 할당되어, 신속한 검색을 지원하게 된다.The POI database 109 is a storage means for recording a name (name) of a point of interest as a word. For example, the POI database 109 may store words such as a trade name and a place name. Words of the POI database 109 are assigned unique word identifiers by the assignment unit 101 to be described later to support rapid retrieval.

역배열 데이터베이스(111)는 POI 데이터베이스(109)에 저장된 단어를 구성하는 음절을 기준으로 단어에 관한 제1 정보를 정렬시켜 기록하는 저장 수단이다. 여기서, 제1 정보는 상기 복수 개의 단어 각각에 할당된 단어식별자일 수 있다. 역배열 데이터베이스(111)의 음절은, 할당부(101)에 의해 고유한 음절식별자가 할당될 수 있다. 따라서, 일실시예에서, 역배열 데이터베이스(111)는 음절식별자에 대응하여 단어식별자가 기록되는 형태, 즉 음절을 포함하는 단어를 식별할 수 있는 정보를 저장하게 된다.The inverse array database 111 is storage means for sorting and recording first information about a word based on syllables constituting a word stored in the POI database 109. Here, the first information may be a word identifier assigned to each of the plurality of words. The syllable of the inverse array database 111 may be assigned a unique syllable identifier by the allocator 101. Therefore, in one embodiment, the inverse array database 111 stores information for identifying a word including a syllable in a form in which the word identifier is recorded corresponding to the syllable identifier.

할당부(101)는 POI 데이터베이스(109)에 저장되는 단어 각각에 대해 단어식별자를 할당하고, 상기 단어를 구성하는 음절을 배열하여 저장하는 역배열 데이터베이스(111) 내 음절 각각에 대해 음절식별자를 할당한다.The allocator 101 assigns a word identifier to each word stored in the POI database 109 and assigns a syllable identifier to each syllable in the inverse array database 111 that arranges and stores the syllables constituting the word. .

프로세서(103)는 질의어가 입력되면, 질의어 입력에 연동하여, 상기 입력된 질의어를 구성하는 음절 각각에 대한 음절식별자와, 상기 음절식별자에 대응하는 단어식별자를 데이터베이스(107)에서 확인한다. 즉, 프로세서(103)는 질의어의 입력에 응답하여, 역배열 데이터베이스(111)를 참조하여, 상기 질의어를 구성하는 음절에 대응하는 제1 정보를 식별한다. 이를 통해 프로세서(103)는 이후에, 상기 식별된 제1 정보(즉, 음절식별자를 이용하여 확인된 단어식별자) 중에서 공통되는 제1 정보의 단어를 POI 데이터베이스(109)로부터 추출하여 검색단어로서 출력되도록 출력부(105)를 제어하는 환경을 마련할 수 있다.When a query is input, the processor 103 checks the syllable identifier for each syllable constituting the input query word and a word identifier corresponding to the syllable identifier in the database 107 in association with the query input. That is, the processor 103 may identify the first information corresponding to the syllables constituting the query by referring to the inverse array database 111 in response to the input of the query. Through this, the processor 103 extracts a word of the first information that is common among the identified first information (ie, a word identifier identified using a syllable identifier) from the POI database 109 and outputs it as a search word. It is possible to provide an environment in which the output unit 105 is controlled.

프로세서(103)는 음절과 연관된 단어식별자를 이용하여 검색단어를 도출할 수 있는 환경을 제공 함으로써, 검색하고자 하는 상호명의 전체 명칭이 아닌, 축약된 형태로서의 질의어가 입력될 경우에도, 검색하고자 하는 상호명에 연관된 검색단어를 제공 함에 따라, 질의어 입력을 이용하게 한다.The processor 103 provides an environment in which a search word can be derived by using a word identifier associated with a syllable, so that even when a query word in abbreviated form is input, rather than the full name of a business name to be searched, the processor name to be searched is input. By providing a search word associated with, the query input is made available.

예컨대, 프로세서(103)는 '김정열클래식기타스튜디오'가 축약된 형태로서, '김클기'의 질의어가 입력될 경우, '김정열클래식기타스튜디오'를 검색단어로서 제공할 수 있다.For example, the processor 103 may be a form in which "Kim Jung-yeol Classical Guitar Studio" is abbreviated, and "Kim Jung-yeol Classical Guitar Studio" may be provided as a search word when a query of "Kim Kyeol-gi Classical Studio" is input.

상기 제1 정보의 식별에 있어서, 프로세서(103)는 상기 질의어를 구성하는 음절 "모두"에 대응하는 제1 정보 만을 식별할 수 있다. 즉, 프로세서(103)는 질의어의 모든 음절에 대응하는 제1 정보를 역배열 데이터베이스(111)로부터 검색해 낼 수 있다.In the identification of the first information, the processor 103 may identify only the first information corresponding to the syllable “all” constituting the query word. That is, the processor 103 may retrieve the first information corresponding to all syllables of the query word from the inverse array database 111.

상기 제1 정보의 다른 식별 예로서, 프로세서(103)는 상기 질의어를 구성하는 음절 중 적어도 일부에 대응하는 제1 정보를 식별할 수 있다. 즉, 프로세서(103)는 예컨대 약어로 등록된 단어의 제1 정보를 역배열 데이터베이스(111)로부터 검색해 낼 수 있게 할 수 있다.As another example of identifying the first information, the processor 103 may identify first information corresponding to at least some of the syllables constituting the query word. In other words, the processor 103 can retrieve, for example, the first information of the word registered as the abbreviation from the inverse array database 111.

출력부(105)는 프로세서(103)에 의해, 음절식별자를 이용하여 확인된 단어식별자 중에서 공통되는 단어식별자의 단어를 검색단어로서 출력한다. 즉, 출력부(105)는 상기 식별된 제1 정보 중에서 공통되는 제1 정보의 단어를 검색단어로 출력하는 기능을 한다.The output unit 105 outputs, as a search word, a word of a common word identifier among the word identifiers identified by the syllable identifier by the processor 103. That is, the output unit 105 outputs a word of the first information that is common among the identified first information as a search word.

역배열 데이터베이스(111)에 상기 제1 정보로서, 상기 복수 개의 단어 각각에 대한 단어식별자가 저장되는 조건하에서, 프로세서(103)는, 상기 질의어를 구성하는 음절에 각각 대응하여 저장되는 단어식별자들을 역배열 데이터베이스(111)에서 식별하고, 출력부(105)는 상기 식별된 단어식별자들 중에서 공통되는 단어식별자에 의해 지정되는 단어를, POI 데이터베이스(109)로부터 검색단어로서 출력시킬 수 있다.Under the condition that the word identifiers for each of the plurality of words are stored as the first information in the inverse array database 111, the processor 103 may select word identifiers stored in correspondence with syllables constituting the query word. Identifying at 111, the output unit 105 may output, from the POI database 109, a word designated by a word identifier common among the identified word identifiers as a search word.

본 발명의 다른 실시예로서, 출력부(105)는 상기 출력되는 검색단어가 복수일 경우, 상기 검색단어를 구성하는 제1 음절과, 상기 질의어를 구성하는 제2 음절과의 일치되는 개수에 따라 가중치를 부여하고, 가중치가 상대적으로 높게 부여된 검색단어를 우선하여 출력할 수 있다. 즉, 출력부(105)는 상기 검색단어를 구성하는 제1 음절과, 상기 질의어를 구성하는 제2 음절과의 일치하는 수가 많을수록, 사용자의 시선이 많이 가는 화면 상으로 출력되도록 할 수 있다.According to another embodiment of the present invention, the output unit 105 according to the output number of the plurality of search words, according to the number of matches between the first syllable constituting the search word and the second syllable constituting the query word The search word given a weight and a relatively high weight may be given priority. That is, the output unit 105 may output the user's gaze on the screen with a greater number of matches between the first syllable constituting the search word and the second syllable constituting the query word.

또 달리, 출력부(105)는 상기 제1 음절의 조합과, 상기 제2 음절의 조합과의 일치되는 개수를 고려하여 가중치를 부여하고, 가중치가 상대적으로 높게 부여된 검색단어를 우선하여 출력할 수 있다. 즉, 출력부(105)는 상기 검색단어를 구성하는 제1 음절 쌍 내 음절 간 순서와, 상기 질의어를 구성하는 제2 음절 쌍 내 음절 간 순서의 일치 정도가 클수록, 사용자의 시선이 많이 가는 화면 상으로 출력되도록 할 수 있다.Alternatively, the output unit 105 assigns weights in consideration of the matching number of the combination of the first syllable and the combination of the second syllable, and preferentially outputs the search word given the relatively high weight. Can be. That is, the output unit 105 is a screen where the gaze of the user becomes greater as the degree of agreement between the syllables in the first syllable pair constituting the search word and the order between the syllables in the second syllable pair constituting the query word increases. Can be output to the

한편, 앞서 설명된 실시예로 구현하더라도 검색단어의 출력이 불가할 경우(즉, 공통되는 제1 정보가 없는 경우), 본 발명의 집합 기반의 검색 장치(100)는 질의어의 모음조합을 이용하여 검색단어를 출력할 수 있다.On the other hand, even when implemented in the embodiment described above, if the output of the search word is not possible (that is, there is no common first information), the set-based search apparatus 100 of the present invention by using a combination of query words You can output the search word.

이를 위해, 집합 기반의 검색 장치(100)는 데이터베이스(107)를 재구성할 수 있다. 즉, 검색 장치(100)는, 2개의 모음조합에 관한 모음 튜플 역배열 데이터베이스(113)과, 3개의 모음조합에 관한 모음 트리플 역배열 데이터베이스(115)를 포함하여, 데이터베이스(107)를 재구성한다.To this end, the set-based search apparatus 100 may reconstruct the database 107. That is, the retrieval apparatus 100 reconstructs the database 107 including the vowel tuple inverse array database 113 for two vowel combinations and the vowel triple inverse array database 115 for three vowel combinations.

모음 튜플 역배열 데이터베이스(113)은 복수 개의 단어를 구성하는 모음 튜플 각각에 대응하여, 상기 각각의 모음 튜플이 포함된 POI 데이터베이스(109)의 단어에 관한 제2 정보를 저장한다. 여기서, 모음 튜플은 단어 내 연속하는 2개의 음절의 모음(중성) 조합을 지칭하며, 예컨대 단어 '중계스타클래스'에 대해, 'ㅜ ㅖ', 'ㅖ ㅡ' 등일 수 있다.The vowel tuple inverse array database 113 stores second information about words in the POI database 109 including the respective vowel tuples, corresponding to each vowel tuple constituting a plurality of words. Here, the vowel tuple refers to a vowel (neutral) combination of two consecutive syllables in a word. For example, the vowel tuple may be 'TT ㅖ', 'ㅖ ㅡ' and the like for the word 'relay star class'.

모음 트리플 역배열 데이터베이스(115)은 복수 개의 단어를 구성하는 모음 트리플 각각에 대응하여, 상기 각각의 모음 트리플이 포함된 POI 데이터베이스(109)의 단어에 관한 제2 정보를 저장한다. 여기서, 모음 트리플은 단어 내 연속하는 3개의 음절의 모음(중성) 조합을 지칭하며, 예컨대 단어 '중계스타클래스'에 대해, 'ㅜ ㅖ ㅡ', 'ㅖ ㅡ ㅏ' 등일 수 있다.The vowel triple inverse array database 115 stores second information about words in the POI database 109 including the vowel triples corresponding to each vowel triple constituting a plurality of words. Here, the vowel triple refers to a vowel (neutral) combination of three consecutive syllables in a word. For example, the vowel triple may be 'TT ㅖ ㅡ' or 'ㅖ ㅡ ㅏ' for the word 'relay star class'.

상기 데이터베이스(107)가 재구성되면, 할당부(101)는 모음 튜플과 모음 트리플 각각에 대해 모음식별자를 할당하여 모음 튜플 역배열 데이터베이스(113)와 모음 트리플 역배열 데이터베이스(115)에 저장한다.When the database 107 is reconfigured, the allocator 101 allocates a vowel identifier to each of the vowel tuples and the vowel triples and stores them in the vowel tuple inverse array database 113 and the vowel triple inverse array database 115.

이때, 할당부(101)는 모음조합(모음 튜플과 모음 트리플)에 포함되는 모음과 동음의 다른 모음이 존재할 경우, 상기 다른 모음을 갖는 모음조합으로 변경하여 모음식별자를 할당할 수 있다. 즉, 할당부(101)는 모음 튜플 역배열 데이터베이스(113), 모음 트리플 역배열 데이터베이스(111), 및 상기 질의어에서, 미리 정해진 복수 개의 모음을, 하나의 대표 모음으로 할 수 있다. 예컨대, 할당부(101)는 모음 튜플 'ㅏ ㅐ'와 'ㅏ ㅔ'를 동일하게 취급하여 단일의 모음식별자를 할당할 수 있다.At this time, if there is another vowel in the vowel combination (vowel tuple and vowel triple) included in the vowel combination (vowel tuple and vowel triple), the assigning unit 101 may change the vowel identifier with the vowel combination. That is, the allocator 101 may set a plurality of predetermined vowels as one representative vowel in the vowel tuple inverse array database 113, the vowel triple inverse array database 111, and the query word. For example, the allocator 101 may assign a single vowel identifier by treating vowel tuples 'ㅏ' and 'ㅏ ㅏ' equally.

프로세서(103)는 상기 데이터베이스(107)를 재구성 후, 상기 질의어를 구성하는 음절의 모음조합 각각에 대한 모음식별자와, 상기 모음식별자에 대응하는 단어식별자를 데이터베이스(107)에서 확인할 수 있다. 즉, 프로세서(103)는, 모음 튜플 역배열 데이터베이스(113) 및 모음 트리플 역배열 데이터베이스(115)를 참조하여, 상기 질의어를 구성하는 음절에 대응하는 제2 정보를 식별한다. 여기서, 제2 정보는 앞서 설명된 제1 정보와 동일하게 상기 복수 개의 단어 각각에 할당된 단어식별자일 수 있다.After reconstructing the database 107, the processor 103 may identify a vowel identifier for each vowel combination of syllables constituting the query word and a word identifier corresponding to the vowel identifier in the database 107. That is, the processor 103 refers to the vowel tuple inverse array database 113 and the vowel triple inverse array database 115 to identify second information corresponding to the syllables constituting the query word. Here, the second information may be a word identifier assigned to each of the plurality of words in the same manner as the first information described above.

상기 제2 정보의 식별에 있어서, 프로세서(103)는 상기 질의어의 시작음절을 기준으로 m(상기 m은 자연수)번까지의 음절과 연관되어 조합되는 좌측모음조합 각각에 일치하는 모음 튜플 또는 모음 트리플에 대응하는 제2 정보를, 상기 모음 튜플 역배열 데이터베이스(113) 또는 상기 모음 트리플 역배열 데이터베이스(115)로부터 식별하고, 이에 대응하여, 상기 질의어의 종료음절을 기준으로 n(상기 n은 자연수)번 전까지의 음절과 연관되어 조합되는 우측모음조합 각각에 일치하는 모음 트리플 또는 모음 튜플에 대응하는 제2 정보를, 상기 모음 튜플 역배열 데이터베이스(113) 또는 상기 모음 트리플 역배열 데이터베이스(115)로부터 식별할 수 있다.In the identification of the second information, the processor 103 matches a vowel tuple or vowel triple corresponding to each of the left vowel combinations associated with up to m syllables (the m is a natural number) based on the starting syllable of the query word. Identifying the second information corresponding to the vowel tuple inverse array database 113 or the vowel triple inverse array database 115, and correspondingly, n times (n is a natural number) based on an end syllable of the query word. Second information corresponding to the vowel triple or the vowel tuple corresponding to each of the right vowel combinations associated with the syllables of the second syllable may be identified from the vowel tuple inverse array database 113 or the vowel triple inverse array database 115.

출력부(105)는 상기 식별된 제2 정보 중에서 공통되는 제2 정보의 단어를 검색단어로 출력하는 집합 기반의 검색 장치.The output unit 105 is a set-based search apparatus for outputting a word of the second information common in the identified second information as a search word.

보다 구체적으로, 프로세서(103)는 모음 튜플 역배열 데이터베이스(113)를 참조하여, 상기 질의어의 시작음절을 기준으로 두 번째(m=2)까지의 음절과 연관되어 조합되는 좌측모음조합과 일치하는 모음 튜플에 대응하는 제1 후보어를 식별하고, 모음 트리플 역배열 데이터베이스(115)를 참조하여 상기 질의어의 끝음절을 기준으로 세 번째(n=3)까지의 음절과 연관되어 조합되는 우측모음조합과 일치하는 모음 트리플에 대응하는 제2 후보어를 식별하며, 상기 제1 후보어 및 상기 제2 후보어에 공통되는 단어를 제3 후보어로 식별한다.More specifically, the processor 103 refers to the vowel tuple inverse array database 113 and matches a vowel matching left vowel combination associated with the second syllable (m = 2) based on the starting syllable of the query word. Identify the first candidate word corresponding to the tuple and match the right vowel combination that is associated with the third (n = 3) syllable based on the end syllable of the query by referring to the vowel triple inverse array database 115. A second candidate word corresponding to the vowel triple is identified, and a word common to the first candidate word and the second candidate word is identified as a third candidate word.

유사하게, 프로세서(103)은 모음 트리플 역배열 데이터베이스(115)를 참조하여 상기 질의어의 시작음절을 기준으로 세 번째(m=3)까지의 음절과 연관되어 조합되는 좌측모음조합과 일치하는 모음 트리플에 대응하는 제4 후보어를 식별하고, 모음 튜플 역배열 데이터베이스(113)를 참조하여 상기 질의어의 끝음절을 기준으로 두 번째(n=2)까지의 음절과 연관되어 조합되는 우측모음조합과 일치하는 모음 튜플에 대응하는 제5 후보어를 식별하며, 상기 제4 후보어 및 상기 제5 후보어에 공통되는 단어를 제6 후보어로 식별한다. 이때, 상기 제3 후보어 및 상기 제6 후보어가 검색단어로서, 출력부(105)를 통해 출력된다.Similarly, the processor 103 refers to the vowel triple inverse array database 115 to a vowel triple that matches the left vowel combination that is associated with the third syllable (m = 3) based on the beginning syllable of the query. A vowel that identifies a corresponding fourth candidate and matches the right vowel combination that is associated with the second (n = 2) syllable based on the end syllable of the query by referring to the vowel tuple inverse array database 113 A fifth candidate word corresponding to the tuple is identified, and a word common to the fourth candidate word and the fifth candidate word is identified as the sixth candidate word. In this case, the third candidate word and the sixth candidate word are output through the output unit 105 as a search word.

모음 튜플 역배열 데이터베이스(113) 및 모음 트리플 역배열 데이터베이스(115)에 상기 제2 정보로서, 상기 복수 개의 단어 각각에 대한 단어식별자를 저장되는 조건하에서, 프로세서(103)는 상기 질의어를 구성하는 음절에 각각 대응하여 저장되는 단어식별자들을 모음 튜플 역배열 데이터베이스(113) 및 모음 트리플 역배열 데이터베이스(115)에서 식별하고, 출력부(105)는 상기 식별된 단어식별자들 중에서 공통되는 단어식별자에 의해 지정되는 단어를, POI 데이터베이스(109)로부터 검색단어로서 출력시킬 수 있다.Under the condition that the word identifiers for each of the plurality of words are stored as the second information in the vowel tuple inverse array database 113 and the vowel triple inverse array database 115, the processor 103 is each in syllables constituting the query word. Correspondingly stored word identifiers are identified in the vowel tuple inverse array database 113 and the vowel triple inverse array database 115, and the output unit 105 identifies a word designated by a common word identifier among the identified word identifiers. It can be output as a search word from the POI database 109.

한편, 출력부(105)는 상기 출력되는 검색단어가 복수일 경우, 상기 검색단어를 구성하는 제1 음절과, 상기 질의어를 구성하는 제2 음절과의 일치하는 수에 따라 매칭점수를 L씩 증가시키는 제1 단계와, 상기 제1 단계에서 매칭점수의 증가와 관련한 음절을 제외한, 상기 제1 음절과 상기 제2 음절 중, 초성 및 중성이 일치하거나, 또는 중성 및 종성이 일치하는 음절의 수에 따라 상기 매칭점수를 M씩 증가시키는 제2 단계와, 상기 제1, 2 단계에서 매칭점수의 증가와 관련한 음절을 제외하고 남은, 상기 제1 음절의 수에 따라 상기 매칭점수를 N씩 감소시키는 제3 단계를 수행하고, 합산된 매칭점수가 상대적으로 높은 검색단어를 우선하여 출력한다. 이때, 상기 L은 M 보다 크고, 상기 M은 N 보다 크게 설정한다.On the other hand, if there is a plurality of search words outputted, the output unit 105 increases the matching score by L according to the number of matches between the first syllable constituting the search word and the second syllable constituting the query word. And the number of syllables that match the initial and neutral, or the neutral and final, of the first syllable and the second syllable, except for the syllable associated with the increase of the matching score in the first stage. Accordingly, a second step of increasing the matching score by M, and a step of decreasing the matching score by N according to the number of the first syllables remaining except for the syllable associated with the increase of the matching score in the first and second stages. Step 3 is performed, and the search words with a relatively high sum of matching scores are given priority. At this time, L is larger than M, and M is set larger than N.

다시 말해, 출력부(105)는 모음식별자를 이용하여 확인된 단어식별자 중에서 공통되는 단어식별자의 단어(검색단어)가 복수일 경우, 상기 질의어를 구성하는 제2 음절과 일치되는, 상기 검색단어를 구성하는 제1 음절에 대해 매칭점수를 증가(+30점)시켜 부여한 후, 상기 제2 음절과 중성이 같고, 또한 초성 또는 종성이 같은 제1 음절에 대해 상기 매칭점수를 증가(+15점)시켜 부여할 수 있다. 이때, 출력부(105)는 상기 제2 음절과 중성이 상이하거나, 또는 상기 제2 음절과 중성이 같지만 초성 또는 종성이 상이한 제1 음절에 대해 상기 매칭점수를 차감(-3점)시켜 부여하고, 상기 매칭점수가 상대적으로 높은 검색단어를 우선하여 출력할 수 있다.In other words, when there are a plurality of words (search word) of the common word identifier among the word identifiers identified using the vowel identifier, the output unit 105 matches the second word that constitutes the query word. After the matching score is increased (+30 points) for the first syllable, the matching score is increased (+15 points) for the first syllable having the same neutrality as that of the second syllable and having the same initial or finality. Can be given. In this case, the output unit 105 subtracts (-3 points) the matching score from the first syllable that is different from the second syllable or the neutral, or the same syllable as the second syllable but the first or the last is different. In addition, the search word having a relatively high matching score may be prioritized and output.

이하에서는, 데이터베이스(107) 내 각 DB에 저장되는 단어, 음절, 또는 모음조합 각각에 고유의 식별자(단어식별자, 음절식별자, 모음식별자)가 할당되어 발명이 구현되는 것을 예시하고 있지만, 이는 설명의 편의를 위한 것일 뿐 실시예를 한정시키기 위한 것이 아님은 자명한 사실이다. 예컨대 별도의 식별자 할당 없이, 단어, 음절, 또는 모음조합 자체가 각 DB에서 직접 추출되도록, 발명을 구현할 수도 있고, 이는 본 발명의 기술 사상 범위 내의 것이다.In the following, a unique identifier (word identifier, syllable identifier, vowel identifier) is assigned to each word, syllable, or vowel combination stored in each DB in the database 107 to illustrate that the invention is implemented. It is obvious that the embodiments are only for convenience and not for limiting the embodiments. For example, the invention may be implemented such that a word, syllable, or vowel combination itself is extracted directly from each DB without additional identifier assignment, which is within the scope of the present invention.

도 2는 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 단어 별 단어식별자 및 음절식별자 할당에 대한 일례를 도시한 도면이다.2 is a diagram illustrating an example of word word identifier and syllable identifier assignment for each word in a set based search apparatus according to an exemplary embodiment of the present invention.

도 2를 참조하면, 집합 기반의 검색 장치(100)는 POI 데이터베이스(109)에 저장되는 단어 각각에 대해 단어식별자를 할당한다. 예컨대, 집합 기반의 검색 장치(100)는 POI 데이터베이스(109)에 저장되는 단어가 '중계스타클래스, 김정열클래식기타스튜디오, 라이프타임휘트니스클럽, …, 금강산갈비'가 저장될 경우, '중계스타클래스'에 'A'의 단어식별자, '김정열클래식기타스튜디오'에 'B'의 단어식별자, '라이프타임휘트니스클럽'에 'C'의 단어식별자, '금강산갈비'에 'K'의 단어식별자를 할당할 수 있다.Referring to FIG. 2, the set-based search apparatus 100 assigns a word identifier to each word stored in the POI database 109. For example, the set-based retrieval apparatus 100 may have a word stored in the POI database 109 such as "relay star class, Kim Jung-yeol classical guitar studio, life time fitness club,... , 'Gumgangsan Ribs' is stored, 'I' is a word identifier of 'A' in 'Relay Star Class', 'B' is identified in 'Kim Jung-yeol Classical Guitar Studio', 'C' is identified in 'Lifetime Fitness Club', The word identifier of 'K' can be assigned to 'Kumgangsan Ribs'.

이후, 집합 기반의 검색 장치(100)는 단어를 구성하는 음절 각각에 대해 음절식별자를 할당하여, 역배열 데이터베이스(111)에 저장한다. 예컨대, 집합 기반의 검색 장치(100)는 '중계스타클래스'를 구성하는 음절 각각 즉, '중', '계', '스', '타', '클', '래', '스'에 대해, 음절식별자를 할당할 수 있다.Subsequently, the set-based retrieval apparatus 100 allocates a syllable identifier to each syllable constituting a word and stores the syllable identifier in the inverse array database 111. For example, the set-based retrieval apparatus 100 may each of the syllables constituting the 'relay star class', that is, 'middle', 'system', 's', 'ta', 'cle', 'ra', 's' For, we can assign syllable identifiers.

음절식별자의 할당에 있어서, 집합 기반의 검색 장치(100)는 각 단어를 구성하는 음절 각각을 설정된 순서(예컨대, 모음 및 자음 순서)에 기초하여 정렬하고, 정렬된 순서에 따라, 구별된 음절식별자를 할당할 수 있다. 예컨대, 집합 기반의 검색 장치(100)는 POI 데이터베이스(109)에 저장된 각 단어 즉, '중계스타클래스', '김정열클래식기타스튜디오', '라이프타임휘트니스클럽', …, '금강산갈비'를 구성하는 음절 각각을 모음 및 자음 순서에 기초하여 정렬하고, 첫번째로 정렬된 '갈'에 '0'의 음절식별자, 두번째로 정렬된 '강'에 '1'의 음절식별자, 세번째로 정렬된 '건'에 '2'의 음절식별자를 할당할 수 있다.In the assignment of syllable identifiers, the set-based search apparatus 100 sorts each syllable constituting each word based on a set order (for example, vowel and consonant order), and distinguishes syllable identifiers according to the sorted order. Can be assigned. For example, the set-based retrieval apparatus 100 may include each word stored in the POI database 109, namely, 'relay star class', 'Kim Jung-yeol classical guitar studio', 'life time fitness club',... , Each syllable constituting 'Kumgangsan Ribs' is sorted based on vowel and consonant order, the syllable identifier of '0' in the first sorted 'gal', and the syllable identifier of '1' in the 'gang' arranged second You can assign a syllable identifier of '2' to the third sorted 'gun'.

도 3은 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 음절식별자에 대응하는 단어식별자 구성에 대한 일례를 도시한 도면이다.3 is a diagram illustrating an example of a word identifier configuration corresponding to a syllable identifier in a set based search apparatus according to an embodiment of the present invention.

도 3을 참조하면, 집합 기반의 검색 장치(100)는 역배열 데이터베이스(111)에, 음절에 연관된 음절식별자와 단어에 연관된 단어식별자를 유지 함으로써, 추후 입력되는 질의어 내 음절에 관한 음절식별자에 근거하여, 단어식별자를 식별할 수 있게 한다.Referring to FIG. 3, the set-based retrieval apparatus 100 maintains a syllable identifier associated with a syllable and a syllable identifier associated with a word in the inverse array database 111 based on the syllable identifier of a syllable in a query word that is input later. This allows the identification of word identifiers.

구체적으로, 집합 기반의 검색 장치(100)는 음절식별자에 대해, 음절식별자에 매칭되는 음절을 포함하는 단어에 대한 단어식별자가 대응되도록 저장할 수 있다. 예컨대, 집합 기반의 검색 장치(100)는 'O'의 음절식별자에 대해, 'O'에 매칭되는 음절 즉, '갈'을 포함하는 '금강산갈비'의 단어에 대한 단어식별자 즉, 'K'가 대응되어 저장되도록 역배열 데이터베이스(111)를 구성할 수 있다. 또한, 집합 기반의 검색 장치(100)는 '1'의 음절식별자에 대해, '1'에 매칭되는 음절 즉, '강'을 포함하는 '비타민건강클럽', '금강산갈비'의 단어에 대한 각 단어식별자 즉, 'E' 및 'K'가 대응되어 저장되도록 역배열 데이터베이스(111)를 구성할 수 있다.In detail, the set-based retrieval apparatus 100 may store the word identifier for the word including the syllable matching the syllable identifier with respect to the syllable identifier. For example, the set-based retrieval apparatus 100 may include a syllable identifier of 'O', a word identifier of a word of 'Keumgangsan rib' including a syllable matching 'O', that is, 'gal', that is, 'K'. The reverse array database 111 can be configured to store correspondingly. In addition, the set-based retrieval apparatus 100 includes a syllable identifier of '1', a syllable matching '1', that is, a word of 'vitamin health club' and 'geumgang ribs' including 'river'. The reverse identifier database 111 may be configured such that the word identifiers 'E' and 'K' are stored correspondingly.

도 4는 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 질의어 입력에 따른 검색에 대한 일례를 도시한 도면이다.4 is a diagram illustrating an example of a search based on a query input in a set-based search apparatus according to an embodiment of the present invention.

도 4를 참조하면, 집합 기반의 검색 장치(100)는 질의어 입력에 연동하여, 입력된 질의어를 구성하는 음절 각각에 대한 음절식별자와, 상기 음절식별자에 대응하는 단어식별자를 역배열 데이터베이스(111)에서 확인하고, 확인된 단어식별자 중에서 공통되는 단어식별자의 단어를, POI 데이터베이스(109)로부터 검색단어로서 출력할 수 있다.Referring to FIG. 4, the set-based retrieval apparatus 100 interlocks a syllable identifier for each syllable constituting the input query word and a word identifier corresponding to the syllable identifier in the inverse array database 111 in association with a query input. After confirming, the words of the word identifiers common among the identified word identifiers can be output from the POI database 109 as search words.

예컨대, 집합 기반의 검색 장치(100)는 '스타클래스'의 질의어가 입력되면, '스'에 대한 음절식별자 즉, '23'에 대응하는 'A, B, C, D, I'의 단어식별자('집합1')를 역배열 데이터베이스(111)에서 확인하고, '타'에 대한 음절식별자 즉, '44'에 대응하는 'A, B, C, D, E, F, H, I'의 단어식별자('집합2')를 역배열 데이터베이스(111)에서 확인할 수 있다.For example, the set-based search apparatus 100 may include a syllable identifier for 's', that is, a word identifier of 'A, B, C, D, I' corresponding to '23' when a query of 'star class' is input. ('Set 1') is checked in the inverse array database 111, and a syllable identifier for 'other', that is, a word of 'A, B, C, D, E, F, H, I' corresponding to '44' The identifier ('set 2') can be found in the inverse array database 111.

집합 기반의 검색 장치(100)는 '클', '래', '스' 각각에 연관된 단어식별자를 확인하고, 확인된 단어식별자 중에서 공통되는 단어식별자 즉, 'A, B'의 단어 'A: 중계스타클래스, B: 김정열클래식기타스튜디오'를 검색단어로서, POI 데이터베이스(109)로부터 식별하여 출력할 수 있다.The set-based retrieval apparatus 100 checks word identifiers associated with each of 'cle', 'rae', and 's', and identifies the common word identifiers of the identified word identifiers, that is, words 'A and B'. The relay star class, B: Kim Jung-yeol Classical Guitar Studio 'can be identified and output from the POI database 109 as a search word.

도 5는 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 단어 별 단어식별자 및 모음식별자 할당에 대한 일례를 도시한 도면이다.FIG. 5 is a diagram illustrating an example of word word and vowel identifier assignment for each word in a set based search apparatus according to an exemplary embodiment of the present invention.

도 5를 참조하면, 집합 기반의 검색 장치(100)는 POI 데이터베이스(109)에 저장되는 단어 각각에 대해 단어식별자를 할당한다. 예컨대, 집합 기반의 검색 장치는 POI 데이터베이스(109)에 저장되는 단어가 '중계스타클래스, 클루타임스퀘어점, 스파클카센타, 비타민건강클럽, 금강산갈비'가 저장될 경우, '중계스타클래스'에 'A'의 단어식별자, '클루타임스퀘어점'에 'B'의 단어식별자, '스파클카센타'에 'C'의 단어식별자, '비타민건강클럽'에 'D'의 단어식별자, '금강산갈비'에 'E'의 단어식별자를 할당할 수 있다.Referring to FIG. 5, the set-based search apparatus 100 assigns a word identifier to each word stored in the POI database 109. For example, the set-based retrieval apparatus may store the word 'relay star class, cluetime square, sparkle center, vitamin health club, and Geumgang ribs' stored in the POI database 109 in the 'relay star class'. Word identifier of 'A', word identifier of 'B' at 'Clue Time Square', word identifier of 'C' at 'Sparkle Car Center', word identifier of 'D' at 'Vitamin Health Club', 'Kumgang Ribs' You can assign a word identifier of 'E'.

이후, 집합 기반의 검색 장치(100)는 단어를 구성하는 음절의 모음조합 각각에 대해 모음식별자를 할당할 수 있다. 이때, 집합 기반의 검색 장치(100)는 인접한 복수의 음절의 모음조합 각각에 대해 모음식별자를 할당할 수 있다.Thereafter, the set-based retrieval apparatus 100 may assign a vowel identifier to each vowel combination of syllables constituting a word. In this case, the set-based retrieval apparatus 100 may allocate a vowel identifier for each vowel combination of a plurality of adjacent syllables.

예컨대, 집합 기반의 검색 장치(100)는 '중계스타클래스'를 구성하는 2개의 음절의 모음조합 각각에 대해 모음식별자를 할당하여 모음 튜플 역배열 데이터베이스(113)에 저장할 수 있다. 즉, 집합 기반의 검색 장치(100)는, '중계'의 모음조합 즉, '우예'에 대해 음절식별자를 할당하고, '계스'의 모음조합' 즉, '예으', '스타'의 모음조합' 즉, '으아', '타클'의 모음조합' 즉, '아으', '클래'의 모음조합' 즉, '으애', '래스'의 모음조합' 즉, '애으' 각각에 대해 모음식별자를 할당할 수 있다.For example, the set-based retrieval apparatus 100 may allocate a vowel identifier for each of the two syllable vowel combinations constituting the relay star class and store the vowel identifier in the vowel tuple inverse array database 113. That is, the set-based retrieval apparatus 100 assigns a syllable identifier to a vowel combination of 'relay', that is, 'grace', and assigns a vowel combination of 'ges', that is, 'yes' and 'star'. Ie the vowel combination of 'a' and 'tackle', that is, the vowel combination of 'a' and 'class', ie the vowel combination of 'a' and 'class', i.e. Can be assigned.

또한, 집합 기반의 검색 장치(100)는 '중계스타클래스'를 구성하는 3개의 음절의 모음조합 각각에 대해 모음식별자를 할당하여 모음 트리플 역배열 데이터베이스(115)에 저장할 수 있다. 즉, 집합 기반의 검색 장치(100)는, '중계스'의 모음조합 즉, '우예으'에 대해 음절식별자를 할당하고, '계스타'의 모음조합' 즉, '예으아', '스타클'의 모음조합' 즉, '으아으', '타클래'의 모음조합' 즉, '아으애', '클래스'의 모음조합' 즉, '으애으' 각각에 대해 모음식별자를 할당할 수 있다.In addition, the set-based retrieval apparatus 100 may allocate a vowel identifier to each of the three syllable vowel combinations constituting the relay star class and store the vowel identifier in the vowel triple inverse array database 115. That is, the set-based retrieval apparatus 100 allocates a syllable identifier to a vowel combination of 'relays', that is, 'yeye', and assigns a vowel combination of 'gaestar', that is, 'yes', 'star'. You can assign a vowel identifier for each of the vowel combinations of the 'cle', ie 'a', 'the vowels' of the tackle' have.

집합 기반의 검색 장치(100)는 모음식별자 할당시, 모음조합에 포함되는 모음과 동음의 다른 모음이 존재할 경우, 상기 다른 모음을 갖는 모음조합으로 변경하여 모음식별자를 할당할 수 있다. 즉, 집합 기반의 검색 장치(100)는 정해진 복수 개의 모음을, 하나의 대표 모음으로 처리할 수 있고, 예를 들어 동일한 발음을 갖는 모음을 하나의 대표 모음으로 하여, 입력과 검색의 편의를 도모한다.When assigning a vowel identifier, the set-based search apparatus 100 may assign a vowel identifier by changing to a vowel combination having the other vowel when there is another vowel in the vowel combination included in the vowel combination. That is, the set-based retrieval apparatus 100 may process a plurality of predetermined vowels as one representative vowel, and, for example, make a vowel having the same pronunciation as one representative vowel to facilitate input and search. do.

예컨대, 집합 기반의 검색 장치(100)는 '아에'의 모음조합에 포함되는 '에'와 동음의 다른 모음 즉 '애'가 존재할 경우, '아에'의 모음조합을 '아애'의 모음조합으로 변경하고, 변경된 '아애'의 모음조합에 모음식별자를 할당할 수 있다.For example, the set-based retrieval apparatus 100 sets the vowel combination of `` ae '' to vowels of `` ae '' when there are other vowels of `` e '' and hom included in the vowel combination of `` ae ''. You can change to a combination and assign a vowel identifier to the changed vowel combination of 'Ae'.

이후, 집합 기반의 검색 장치(100)는 각 단어를 구성하는 음절의 모음조합 각각을 설정된 순서(예컨대, 모음 순서)에 기초하여 정렬하고, 정렬된 순서에 따라, 구별된 모음식별자를 할당할 수 있다.Subsequently, the set-based retrieval apparatus 100 may sort each vowel combination of syllables constituting each word based on a set order (for example, vowel order), and allocate a distinct vowel identifier according to the sorted order. have.

집합 기반의 검색 장치(100)는 예컨대, POI 데이터베이스(109)에 저장된 각 단어 즉, '중계스타클래스', '클루타임스퀘어점', '스파클카센타', '비타민건강클럽', '금강산갈비'를 구성하는 2개의 음절의 모음조합 각각을 예컨대, 모음 순서에 기초하여 정렬하고, 첫번째로 정렬된 '아아'에 '0'의 모음식별자, 두번째로 정렬된 '아애'에 '1'의 모음식별자를 할당하여, 모음 튜플 역배열 데이터베이스(113)를 구성할 수 있다.The set-based retrieval apparatus 100 is, for example, each word stored in the POI database 109, that is, 'relay star class', 'clue time square', 'sparkle car center', 'vitamin health club', 'geumgangsan rib' For example, each of the two syllable vowel combinations is arranged based on the vowel order, and the vowel identifier of '0' in the first sorted 'ah' and the vowel identifier of '1' in the second sorted '아아' By assigning the vowel tuple inverse array database 113 can be configured.

또한, 집합 기반의 검색 장치(100)는 상기 각 단어에 대한 3개의 음절의 모음조합 각각에 대해서도 2개의 음절의 모음조합과 마찬가지로, 모음조합을 정렬하고, 첫번째 정렬된 '아아아'에 '0'의 모음식별자를 할당하고, 두번째로 정렬된 '아아이'에 '1'의 모음식별자를 할당하여 모음 트리플 역배열 데이터베이스(115)를 구성할 수 있다.In addition, the set-based retrieval apparatus 100 sorts the vowel combination for each of the three syllable vowel combinations for each word, similarly to the vowel combination of the two syllables, and sets '0' to the first sorted 'aa'. A collection triple inverse array database 115 may be configured by assigning a collection identifier of and assigning a collection identifier of '1' to a second sorted 'eye'.

도 6은 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 모음식별자에 대응하는 단어식별자 구성에 대한 일례를 도시한 도면이다.6 is a diagram illustrating an example of a word identifier configuration corresponding to a vowel identifier in a set based search apparatus according to an exemplary embodiment of the present invention.

도 6을 참조하면, 집합 기반의 검색 장치(100)는 POI 데이터베이스(109)의 단어에 연관된 단어식별자와, 모음조합의 모음식별자를 서로 대응시켜 저장하는, 모음 튜플 역배열 데이터베이스(113)와 모음 트리플 역배열 데이터베이스(115)를 포함하도록 데이터베이스(107)을 구성할 수 있다.Referring to FIG. 6, the set-based retrieval apparatus 100 correlates and stores a vowel identifier associated with a word of the POI database 109 and a vowel identifier of a vowel combination in association with each other. The database 107 can be configured to include an inverse array database 115.

구체적으로, 집합 기반의 검색 장치(100)는 모음식별자에 대해, 모음식별자에 매칭되는 모음조합을 포함하는 단어에 대한 단어식별자가 대응되도록 저장할 수 있다.In detail, the set-based search apparatus 100 may store a word identifier for a word including a vowel combination matching the vowel identifier with respect to the vowel identifier.

예컨대, 집합 기반의 검색 장치(100)는 2개의 음절의 모음조합에 대응하는 모음식별자에 있어서, 'O'의 모음식별자에 대해, 'O'에 매칭되는 모음조합 즉, '아아'를 포함하는 '금강산갈비'의 단어에 대한 단어식별자 즉, 'E'가 대응되어 저장되도록 모음 튜플 역배열 데이터베이스(113)를 구성할 수 있다. 또한, 집합 기반의 검색 장치(100)는 '1'의 모음식별자에 대해, '1'에 매칭되는 모음조합 즉, '아애'(또는 '아에')를 포함하는 '스파클카센타'의 단어에 대한 단어식별자 즉, 'C'가 대응되어 저장되도록 모음 튜플 역배열 데이터베이스(115)를 구성할 수 있다.For example, the set-based retrieval apparatus 100 includes a vowel combination matching 'O', that is, 'A', for a vowel identifier of 'O' in a vowel identifier corresponding to two syllable vowel combinations. The vowel tuple inverse array database 113 may be configured such that a word identifier for a word of 'Kumgangsan ribs', that is, 'E' is correspondingly stored. In addition, the set-based search apparatus 100 includes a vowel combination matching '1', that is, a word of 'sparkle car center' including 'a' (or 'a') for a vowel identifier of '1'. The vowel tuple inverse array database 115 may be configured such that a word identifier for the word 'C' is correspondingly stored.

유사하게, 집합 기반의 검색 장치(100)는 3개의 음절의 모음조합에 대응하는 모음식별자에 있어서, 'O'의 모음식별자에 대해, 'O'에 매칭되는 모음조합 즉, '아아아'를 포함하는 '금강산갈비'의 단어에 대한 단어식별자 즉, 'E'가 대응되어 저장되도록 모음 트리플 역배열 데이터베이스(115)를 구성할 수 있다. 또한, 집합 기반의 검색 장치는 '1'의 모음식별자에 대해, '1'에 매칭되는 모음조합 즉, '아아이'를 포함하는 '금강산갈비'의 단어에 대한 단어식별자 즉, 'E'가 대응되어 저장되도록 모음 트리플 역배열 데이터베이스(115)를 구성할 수 있다.Similarly, the set-based search apparatus 100 includes a vowel combination matching 'O', that is, 'A', for the vowel identifier of 'O' in a vowel identifier corresponding to three syllable vowel combinations. The vowel triple inverse array database 115 may be configured such that a word identifier, ie, 'E', of the word of 'Kumgangsan Ribs' is correspondingly stored. In addition, the set-based search apparatus corresponds to a vowel identifier of '1', a word identifier of 'void combination of' 1 ', that is, a word identifier of a word of' Kumgangsan rib 'including' Ai '. The collection triple inverse array database 115 may be configured to be stored and stored.

도 7은 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 질의어 입력에 따른 검색에 대한 다른 일례를 도시한 도면이다.FIG. 7 illustrates another example of searching based on a query input in a set-based searching apparatus according to an embodiment of the present invention.

도 7을 참조하면, 집합 기반의 검색 장치(100)는 질의어 입력에 연동하여, 입력된 질의어를 구성하는 음절 각각에 대한 음절식별자와, 상기 음절식별자에 대응하는 단어식별자를 모음 튜플 역배열 데이터베이스(113) 및 모음 트리플 역배열 데이터베이스(115)에서 확인하고, 확인된 단어식별자 중에서 공통되는 단어식별자가 없는 경우, 질의어를 구성하는 음절의 모음조합 각각에 대한 모음식별자를 이용하여, 검색단어를 출력할 수 있다.Referring to FIG. 7, the set-based retrieval apparatus 100, in association with a query input, includes a syllable identifier for each syllable constituting the input query word and a word identifier corresponding to the syllable identifier. And the vowel triple inverse array database 115, and if there is no common word identifier among the identified word identifiers, the search word may be output using the vowel identifier for each vowel combination of syllables constituting the query word. .

이때, 집합 기반의 검색 장치(100)는 질의어의 시작음절을 기준으로 m(상기 m은 자연수)번까지의 음절과 연관되어 조합되는 좌측모음조합 각각에 대한 모음식별자에 대응하는 단어식별자를 모음 튜플 역배열 데이터베이스(113) 또는 모음 트리플 역배열 데이터베이스(115)에서 확인하거나, 또는 질의어의 종료음절을 기준으로 n(상기 n은 자연수)번 전까지의 음절과 연관되어 조합되는 우측모음조합 각각에 대한 모음식별자에 대응하는 단어식별자를 모음 튜플 역배열 데이터베이스(113) 또는 모음 트리플 역배열 데이터베이스(115)에서 확인할 수 있다.In this case, the set-based retrieval apparatus 100 is a vowel tuple of word identifiers corresponding to vowel identifiers for each of the left vowel combinations that are associated with the syllable up to m (the m is a natural number) based on the starting syllable of the query. The vowel identifier for each of the right vowel combinations identified in the inverse array database 113 or the vowel triple inverse array database 115 or associated with the syllable up to n (where n is a natural number) based on the ending syllable of the query word. The corresponding word identifier may be checked in the vowel tuple inverse array database 113 or the vowel triple inverse array database 115.

도 6, 7를 함께 참고하여 설명하면, 예컨대, 집합 기반의 검색 장치(100)는 '카쎈타스파크'의 질의어가 입력되면, 시작음절을 기준으로 2번째까지의 음절 즉, '카쎈' 및 종료음절을 기준으로 3번째 전까지의 음절 즉, '스파크' 각각에 대한 단어식별자를, 모음 튜플 역배열 데이터베이스(113) 및 모음 트리플 역배열 데이터베이스(115) 각각에서 확인할 수 있다.Referring to FIGS. 6 and 7, for example, the set-based search apparatus 100 may input a syllable up to the second syllable, that is, “ka-kup” and end based on the start syllable, when a query word of “ka-kappa spak” is input. Based on the syllables, the word identifiers for the third syllable, that is, each of the 'sparks' may be checked in each of the vowel tuple inverse array database 113 and the vowel triple inverse array database 115.

즉, 집합 기반의 검색 장치(100)는 '카쎈타스파크'의 좌측모음조합 즉, '카쎈'의 모음조합 '아에'을, 동음의 모음으로 갖는 모음조합 '아애'로 변경하고, 변경된 '아애'에 대한 모음식별자 즉, '1'에 대응하는 'C'의 단어식별자('집합1')를 모음 튜플 역배열 데이터베이스(113)에서 확인할 수 있다.In other words, the set-based retrieval apparatus 100 changes the left vowel combination of the 'Catetta Spark', that is, the vowel combination 'Ae' of the 'Cat' to a vowel combination 'Ae' with the vowels of the homophone, and changes the ' The vowel identifier for Aeah, that is, the word identifier ('set 1') of 'C' corresponding to '1' may be checked in the vowel tuple inverse array database 113.

계속해서, 집합 기반의 검색 장치(100)는 '카센트스파크'의 우측모음조합 즉, '으아으'에 대한 모음식별자 즉, '15'에 대응하는 'A, C'의 단어식별자('집합2')를 모음 트리플 역배열 데이터베이스(115)에서 확인할 수 있다.Subsequently, the set-based retrieval apparatus 100 is a right side vowel combination of 'Cacent's Park', that is, a vowel identifier for 'Ea', that is, a word identifier ('set) of' A, C 'corresponding to' 15 '. 2 ′) can be found in the vowel triple inverse array database 115.

집합 기반의 검색 장치(100)는 집합 1과 집합 2의 공통되는 단어식별자 'C'를 확인하고, 상기 단어식별자 'C'에 대응하는 단어 '스파클카센타'를, POI 데이터베이스(107)로서 식별하여, 검색단어로서 출력할 수 있다.The set-based retrieval apparatus 100 identifies the common word identifier 'C' of sets 1 and 2 and identifies the word 'sparkle center' corresponding to the word identifier 'C' as the POI database 107. Can be output as a search word.

또한, 집합 기반의 검색 장치(100)는 상기 입력된 '카쎈타스파크'의 질의어에 대해, 시작음절을 기준으로 3번째까지의 음절 즉, '카쎈타' 및 종료음절을 기준으로 2번째 전까지의 음절 즉, '타크' 각각에 대한 단어식별자를, 모음 튜플 역배열 데이터베이스(113) 및 모음 트리플 역배열 데이터베이스(115)로부터 각각 더 확인할 수 있다.Also, the set-based retrieval apparatus 100 may have a third syllable based on the start syllable, that is, the second syllable based on the 'cañata' and the ending syllable, with respect to the input query word of 'Chapteraspark'. The word identifiers for the syllables, ie, 'tags', may be further identified from the vowel tuple inverse array database 113 and the vowel triple inverse array database 115, respectively.

즉, 집합 기반의 검색 장치(100)는 '카쎈타스파크'의 좌측모음조합 즉, '카센타'의 모음조합 '아에아'를 동음의 모음으로 갖는 모음조합 '아애아'로 변경하고, 변경된 '아애아'에 대한 모음식별자 즉, '2'에 대응하는 'C'의 단어식별자('집합3')를 모음 트리플 역배열 데이터베이스(115)에서 확인할 수 있다.That is, the set-based retrieval apparatus 100 changes the left vowel combination of the 'capitaspark', that is, the vowel combination 'aea' of the 'cacenta' into a vowel combination 'aae' with the same vowel, A vowel identifier for 'Ah Aya', that is, a word identifier of 'C' corresponding to '2' ('set 3') may be checked in the vowel triple inverse array database 115.

계속해서, 집합 기반의 검색 장치(100)는 '카센타스타크'의 우측모음조합 즉, '아으'에 대한 모음식별자 즉, '2'에 대응하는 'A, C, D'의 단어식별자('집합4')를 모음 튜플 역배열 데이터베이스(113)에서 확인할 수 있다.Subsequently, the set-based retrieval apparatus 100 is a right-sided combination of 'Casenta Stark', that is, a vowel identifier for 'A', that is, a word identifier ('set) of' A, C, D 'corresponding to' 2 '. 4 ′) can be found in the vowel tuple inverse array database 113.

집합 기반의 검색 장치(100)는 집합 3과 집합 4의 공통되는 단어식별자 'C'를 확인하고, 상기 단어식별자 'C'에 대응하는 단어 '스파클카센타'를, POI 데이터베이스(107)로서 식별하여, 검색단어로서 출력할 수 있다.The set-based search apparatus 100 checks the common word identifier 'C' of sets 3 and 4, identifies the word 'sparkle center' corresponding to the word identifier 'C' as the POI database 107, and Can be output as a search word.

상술의 예시에서는 '집합1'과 '집합2' 중에서 공통되는 단어식별자와, '집합3'과 '집합4' 중에서 공통되는 단어식별자가 동일하여 하나의 검색단어 '스파클카센타'가 출력되는 것을 예시하고 있으나, 공통되는 단어 식별자가 서로 상이할 경우, 복수의 검색단어를 식별하여 출력할 수도 있다.In the above example, the word identifier common among 'set 1' and 'set 2' and the common word identifier among 'set 3' and 'set 4' are the same, so that one search word 'sparkle center' is outputted. However, when common word identifiers are different from each other, a plurality of search words may be identified and output.

도 8은 본 발명의 일실시예에 따른 집합 기반의 검색 장치에서의 검색단어 출력 순서를 설명하기 위한 도시한 도면이다.8 is a diagram illustrating a search word output order in a set-based search apparatus according to an embodiment of the present invention.

도 8을 참조하면, 집합 기반의 검색 장치(100)는 질의어 입력에 연동하여, 출력되는 검색단어가 복수일 경우, ⅰ) 음절과 연관된 가중치 또는 ⅱ) 음절 내 초성/중성/종성과 연관된 매칭점수를 부여하고, 가중치 또는 매칭점수가 상대적으로 높게 부여된 검색단어를 우선하여 출력 함(즉, 내림차순 출력)으로써, 검색 선호도가 높은 검색단어를 보다 용이하게 인지할 수 있게 한다.Referring to FIG. 8, when the set-based search apparatus 100 is linked to a query input and outputs a plurality of search words, i) weights associated with syllables or ii) matching scores associated with initial / neutral / final in syllables And outputting a search word given a relatively high weight or matching score (ie, in descending order), it is easier to recognize a search word having a high search preference.

ⅰ) 음절과 연관된 가중치를 기준으로 출력하는 경우,Iii) In case of outputting based on the weight associated with syllable,

집합 기반의 검색 장치(100)는 검색단어를 구성하는 제1 음절과, 질의어를 구성하는 제2 음절과의 일치되는 개수에 따라 가중치를 부여하고, 가중치가 상대적으로 높게 부여된 검색단어를 우선하여 출력할 수 있다. 이때, 집합 기반의 검색 장치(100)는 상기 제1 음절의 조합과, 상기 제2 음절의 조합과의 일치되는 개수를 더 고려하여 상기 가중치를 부여할 수 있다.The set-based search apparatus 100 assigns weights according to the number of matches between the first syllable constituting the search word and the second syllable constituting the query word, and prioritizes the search word given a relatively high weight. You can print In this case, the set-based retrieval apparatus 100 may give the weight in consideration of the number of matches between the combination of the first syllable and the combination of the second syllable.

예컨대, 집합 기반의 검색 장치(100)는 질의어가 '기타'이고, 출력되는 검색단어가 '김정열클래식기타' 및 '타악기강의'일 경우, 각 검색단어를 구성하는 음절과 질의어를 구성하는 음절 중 2개의 음절 즉, '기, 타'로 동일 함에 따라, 각 검색단어에 가중치를 동일하게 부여할 수 있다. 이때, 집합 기반의 검색 장치(100)는 질의어의 음절이 조합된 '기타'가 '김정열클래식기타'에 포함되는 반면, '타악기강의'에는 포함되지 않음에 따라, '김정열클래식기타'의 검색단어에 가중치를 추가 부가하여, '타악기강의'의 검색단어 보다 '김정열클래식기타'의 검색단어에 상대적으로 더 높은 가중치를 부여할 수 있다.For example, the set-based search apparatus 100 includes a syllable constituting each search word and syllables constituting each search word when the query word is “other”, and the output search words are “Kim Jung-yeol classical guitar” and “percussion instrument lecture”. As two syllables, i.e., the other and the same, are equal, weights may be equally assigned to each search word. In this case, the set-based retrieval apparatus 100 includes a 'guitar' in which a syllable of syllables is combined is included in the 'kim jeongyeol classical guitar', while not included in the 'percussion instrument lecture,' By adding a weight to the search word of 'percussion lecture', the search word of 'Kim Jungyeol classical guitar' can be given a relatively higher weight.

이에 따라, 집합 기반의 검색 장치(100)는 가중치를 기준으로, '김정열클래식기타', '타악기강의' 순서로 출력할 수 있다.Accordingly, the set-based retrieval apparatus 100 may output the Kim Jong-yeol classical guitar and the percussion lecture order based on the weight.

ⅱ) 음절 내 초성/중성/종성과 연관된 매칭점수를 기준으로 출력하는 경우,Ii) In case of outputting based on the matching score associated with the initial / neutral / finality in the syllable,

집합 기반의 검색 장치(100)는 질의어 입력에 연동하여, 출력되는 검색단어가 복수일 경우, 질의어를 구성하는 제2 음절과 일치되는, 상기 검색단어를 구성하는 상기 제1 음절에 대해 매칭점수를 증가시켜 부여한 후, 상기 제2 음절과 중성이 같고, 또한 초성 또는 종성이 같은 제1 음절에 대해 상기 매칭점수를 증가시켜 부여할 수 있다. 이후, 집합 기반의 검색 장치(100)는 상기 제2 음절과 중성이 상이하거나, 또는 상기 제2 음절과 중성이 같지만 초성 또는 종성이 상이한 제1 음절에 대해 매칭점수를 차감시켜 부여할 수 있다.The set-based retrieval apparatus 100 may match the second syllable constituting the search word with a matching score for the first syllable that matches the second syllable constituting the query word in response to the input of the query word. After increasing, the matching score may be increased by increasing the matching score with respect to the first syllable having the same neutrality as that of the second syllable and having the same initial or finality. Subsequently, the set-based retrieval apparatus 100 may subtract a matching score from a first syllable different from the second syllable or the first syllable having the same neutrality as the second syllable but different from the initial or the finality.

예컨대, 집합 기반의 검색 장치(100)는 질의어가 '카쎈타스파크'이고, 복수의 검색단어 중 '스파클카센타'에 대한 매칭점수를 부여할 경우, '스파클카센타'를 구성하는 제1 음절과 '카쎈타스파크'를 구성하는 제2 음절 중 4개의 음절 즉, '스, 파, 카, 타' 음절이 일치하므로, 1개의 음절이 일치할 때 '30'의 매칭점수를 부여함에 따라, '스파클카센타'에 대한 매칭점수로서, '120'을 부여할 수 있다.For example, the set-based search apparatus 100 may have a first syllable and a 'sparkle casenta' when the query is 'cachetta park' and gives a matching score for 'sparkle car center' among a plurality of search words. Since four syllables of the second syllables constituting the Cajuntas Spark, that is, the syllables of 'S, Fah, Ka, Ta', are matched, when one syllable is matched, a score of '30' is given. '120' may be given as a matching score for the 'centre'.

이후, 집합 기반의 검색 장치(100)는 '카쎈타스파크'의 질의어를 구성하는 음절 및 '스파클카센타'의 검색단어를 구성하는 음절 각각에서 상기 4개의 음절을 제외한 음절을 이용하여, 매칭점수를 증가시키거나 차감할 수 있다. 즉, 집합 기반의 검색 장치(100)는 검색단어를 구성하는 '센' 음절이 질의어를 구성하는 '쎈' 음절을 기준으로, 중성('에')이 같고, 종성('ㄴ')이 같을 경우, '15'를 증가시켜, '135'의 매칭점수를 부여할 수 있다. 또한, 집합 기반의 검색 장치(100)는 검색단어를 구성하는 '클' 음절이 질의어를 구성하는 '크' 음절을 기준으로, 중성('으')이 같고, 초성('ㅋ')이 같을 경우, '15'를 증가시켜, '150'의 매칭점수를 부여할 수 있다.Subsequently, the set-based retrieval apparatus 100 uses matching syllables except for the four syllables in each syllable constituting the query word of 'Capteta Spark' and syllables constituting the search word of 'Sparkle Car Center'. You can increase or decrease it. That is, the set-based search apparatus 100 may have the same neutrality ('e') and the same finality ('b') based on the 'sy' syllable constituting the search word based on the 'sy' syllable constituting the query word. In this case, a matching score of '135' may be given by increasing '15'. In addition, the set-based search apparatus 100 may have the same neutral (')' and the same initial ('ㅋ') based on the 'large' syllable that constitutes the query word. In this case, '15' may be increased to give a matching score of '150'.

또한, 집합 기반의 검색 장치(100)는 질의어가 '카쎈타스파크'이고, 복수의 검색단어 중 '스파게티'에 대한 매칭점수를 부여할 경우, '스파클카센타'를 구성하는 제1 음절과 '카쎈타스파크'를 구성하는 제2 음절 중 2개의 음절 즉, '스, 파' 음절이 일치하므로, 1개의 음절이 일치할 때 '30'의 매칭점수를 부여함에 따라, '스파클카센타'에 대한 매칭점수로서, '60'을 부여할 수 있다.In addition, the set-based search apparatus 100 has a first syllable and a 'car' that constitute a 'sparkle car center' when the query word is 'capita spark' and a matching score for 'spaghetti' is among the plurality of search words. Since two syllables of the second syllable constituting the pentaspark, that is, the syllable 'spa' syllables match, a matching score of '30' is given when one syllable matches, thus matching the 'sparkle car center' As a score, '60' can be given.

이후, 집합 기반의 검색 장치(100)는 검색단어를 구성하는 '게' 음절이 질의어를 구성하는 '쎈' 음절을 기준으로, 중성('에')이 같지만, 초성 및 종성이 상이하므로, '3'을 차감시켜, '57'의 매칭점수를 부여할 수 있다.Subsequently, the set-based search apparatus 100 has the same neutral ('e'), but different initial and / or final, based on '하는' syllables constituting the search word. By subtracting 3 ', a matching score of' 57 'can be given.

또한, 집합 기반의 검색 장치(100)는 검색단어를 구성하는 '티' 음절이 질의어를 구성하는 '카, 타, 크' 음절을 기준으로, 중성('이')이 상이하므로, '3'을 차감시켜, '54'의 매칭점수를 부여할 수 있다.In addition, the set-based search apparatus 100 has a '3' because the 'T' syllable constituting the search word is different from the neutral ('yi') based on the 'ka, ta, k' syllable constituting the query word. By subtracting, a matching score of '54' can be given.

집합 기반의 검색 장치(100)는 복수의 검색단어 중 가중치 또는 매칭점수가 상대적으로 높게 부여된 검색단어를 우선하여 출력 함(즉, 내림차순 출력)으로써, 검색 선호도가 높은 검색단어를 보다 용이하게 인지할 수 있게 한다.The set-based search apparatus 100 first outputs a search word given a relatively high weight or matching score among a plurality of search words (ie, a descending order), so that the search word having a high search preference is easily recognized. To do it.

이에 따라, 집합 기반의 검색 장치(100)는 매칭점수를 기준으로, '스파클카센타', '스파게티' 순서로 출력할 수 있다.Accordingly, the set-based search apparatus 100 may output the sparkle center and the spaghetti based on the matching score.

도 9는 본 발명의 일실시예에 따른 집합 기반의 검색 방법을 나타내는 흐름도이다.9 is a flowchart illustrating a set-based search method according to an embodiment of the present invention.

도 9를 참조하면, 단계 901에서, 집합 기반의 검색 장치(100)는 POI 데이터베이스(109)에 저장되는 단어 각각에 대해 단어식별자를 할당하고, 단어를 구성하는 음절 각각에 대해 음절식별자를 할당한다.Referring to FIG. 9, in step 901, the set-based search apparatus 100 assigns a word identifier to each word stored in the POI database 109, and assigns a syllable identifier to each syllable constituting the word. .

또한, 집합 기반의 검색 장치(100)는 단어를 구성하는 음절의 모음조합 각각에 대해서도 모음식별자를 할당할 수 있다. 이때, 집합 기반의 검색 장치는 인접한 복수의 음절의 모음조합 각각에 대해 모음식별자를 할당할 수 있고, 모음조합에 포함되는 모음과 동음의 다른 모음이 존재할 경우, 상기 다른 모음을 갖는 모음조합으로 변경하여 모음식별자를 할당할 수 있다.In addition, the set-based retrieval apparatus 100 may assign a vowel identifier to each vowel combination of syllables constituting a word. In this case, the set-based retrieval apparatus may allocate a vowel identifier to each of the adjacent vowel combinations of a plurality of syllables, and if there are different vowels with the same vowel included in the vowel combination, change the vowel combination with the different vowels. You can assign a vowel identifier.

단계 903에서, 집합 기반의 검색 장치는 음절식별자에 근거하여, 단어식별자가 대응되어 저장되도록 데이터베이스(107)를 구성한다. 집합 기반의 검색 장치(100)는 음절식별자에 대해, 음절식별자에 매칭되는 음절을 포함하는 단어에 대한 단어식별자가 대응되도록 역배열 데이터베이스(111)을 구성할 수 있다.In step 903, the set-based search apparatus configures the database 107 so that the word identifiers are stored in correspondence with the syllable identifiers. The set-based retrieval apparatus 100 may configure the inverse array database 111 so that a word identifier for a word including a syllable matching the syllable identifier corresponds to the syllable identifier.

다른 실시예에서 집합 기반의 검색 장치(100)는 단어를 구성하는 음절의 모음조합 각각에 대해서도 모음식별자를 할당하여, 모음 튜플 역배열 데이터베이스(113) 및 모음 트리플 역배열 데이터베이스(115)를 구성할 수 있다.In another embodiment, the set-based retrieval apparatus 100 may configure the vowel tuple inverse array database 113 and the vowel triple inverse array database 115 by assigning a vowel identifier to each vowel combination of syllables constituting a word. .

단계 905에서, 집합 기반의 검색 장치(100)는 질의어 입력에 연동하여, 입력된 질의어를 구성하는 음절 각각에 대한 음절식별자와, 음절식별자에 대응하는 단어식별자를 역배열 데이터베이스(111)에서 확인한다.In operation 905, the set-based retrieval apparatus 100 checks the syllable identifier and the word identifier corresponding to the syllable identifier in the inverse array database 111 in association with the query input.

단계 907에서, 집합 기반의 검색 장치(100)는 확인된 단어식별자 중에서 공통되는 단어식별자의 단어를, POI 데이터베이스(109)에서 검색하여 검색단어로서 출력한다.In operation 907, the set-based search apparatus 100 searches for a word of a word identifier that is common among the identified word identifiers in the POI database 109 and outputs the search word.

집합 기반의 검색 장치(100)는 상기 출력되는 검색단어가 복수일 경우, 검색단어를 구성하는 제1 음절과, 질의어를 구성하는 제2 음절과의 일치되는 개수에 따라 가중치를 부여하고, 가중치가 상대적으로 높게 부여된 검색단어를 우선하여 출력할 수 있다. 이때, 집합 기반의 검색 장치(100)는 상기 제1 음절의 조합과, 상기 제2 음절의 조합과의 일치되는 개수를 더 고려하여 상기 가중치를 부여할 수 있다.The set-based search apparatus 100 assigns weights according to the number of matches between the first syllable constituting the search word and the second syllable constituting the query word when the output search word is plural, and the weight A search word given a relatively high value may be given priority. In this case, the set-based retrieval apparatus 100 may give the weight in consideration of the number of matches between the combination of the first syllable and the combination of the second syllable.

상기 단계 907에서, 집합 기반의 검색 장치(100)는 상기 공통되는 단어식별자가 없는 경우, 질의어를 구성하는 음절의 모음조합 각각에 대한 모음식별자와, 상기 모음식별자에 대응하는 단어식별자를 모음 튜플 역배열 데이터베이스(113) 및 모음 트리플 역배열 데이터베이스(115) 에서 확인하고, 상기 확인된 단어식별자 중에서 공통되는 단어식별자의 단어를 검색단어로서 출력할 수 있다.In step 907, when there is no common word identifier, the set-based search apparatus 100 performs a vowel identifier for each vowel combination of syllables constituting a query word, and a word identifier corresponding to the vowel identifier. In the database 113 and the vowel triple inverse array database 115, a word of a common word identifier among the identified word identifiers may be output as a search word.

이때, 집합 기반의 검색 장치(100)는 상기 질의어의 시작음절을 기준으로 m(상기 m은 자연수)번까지의 음절과 연관되어 조합되는 좌측모음조합 각각에 대한 모음식별자에 대응하는 단어식별자를 상기 데이터베이스에서 확인하거나, 또는 상기 질의어의 종료음절을 기준으로 n(상기 n은 자연수)번 전까지의 음절과 연관되어 조합되는 우측모음조합 각각에 대한 모음식별자에 대응하는 단어식별자를 상기 데이터베이스에서 확인할 수 있다.In this case, the set-based retrieval apparatus 100 is a word identifier corresponding to a vowel identifier for each of the left vowel combinations that are associated with the syllable up to m (the m is a natural number) based on the starting syllable of the query word. In the database, a word identifier corresponding to a vowel identifier for each right vowel combination that is associated with a syllable up to n times (n is a natural number) based on the ending syllable of the query word may be checked in the database. .

또한, 집합 기반의 검색 장치(100)는 상기 모음조합에 포함되는 모음과, 동음의 모음을 갖는 모음조합에 대해 할당된 모음식별자를 확인할 수 있다.In addition, the set-based retrieval apparatus 100 may identify a vowel included in the vowel combination and a vowel identifier assigned to the vowel combination having the same vowel.

집합 기반의 검색 장치(100)는 모음식별자를 이용하여 확인된 단어식별자 중에서 공통되는 단어식별자의 단어 즉, 검색단어가 복수일 경우, 상기 질의어를 구성하는 제2 음절과 일치되는, 상기 검색단어를 구성하는 제1 음절에 대해 매칭점수를 증가시켜 부여한 후, 상기 제2 음절과 중성이 같고, 또한 초성 또는 종성이 같은 제1 음절에 대해 상기 매칭점수를 증가시켜 부여할 수 있다. 이때, 집합 기반의 검색 장치는 상기 제2 음절과 중성이 상이하거나, 또는 상기 제2 음절과 중성이 같지만 초성 또는 종성이 상이한 제1 음절에 대해 상기 매칭점수를 차감시켜 부여하고, 상기 매칭점수가 상대적으로 높은 검색단어를 우선하여 출력할 수 있다.The set-based search apparatus 100 matches the search word that matches a second syllable constituting the query word when there are a plurality of words of a word identifier that is common among the word identifiers identified using the vowel identifier, that is, the search words. The matching score may be increased for the first syllable, and then the matching score may be increased for the first syllable having the same neutrality as that of the second syllable and having the same initial or finality. In this case, the set-based retrieval apparatus subtracts the matching score from the first syllable different from the second syllable or the first syllable having the same neutrality as that of the second syllable but differs in the initial or finality, and provides the matching score. Relatively high search words can be given priority.

본 발명의 실시예들은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터 판독 가능 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 매체에 기록되는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다. 상기된 하드웨어 장치는 본 발명의 동작을 수행하기 위해 하나 이상의 소프트웨어 모듈로서 작동하도록 구성될 수 있으며, 그 역도 마찬가지이다.Embodiments of the present invention may be implemented in the form of program instructions that can be executed on various computer means and recorded on a computer readable medium. The computer-readable medium may include program instructions, data files, data structures, and the like, alone or in combination. The program instructions recorded on the medium may be those specially designed and constructed for the present invention or may be available to those skilled in the art of computer software. Examples of computer-readable media include magnetic media such as hard disks, floppy disks and magnetic tape; optical media such as CD-ROMs and DVDs; magnetic media such as floppy disks; Magneto-optical media, and hardware devices specifically configured to store and execute program instructions such as ROM, RAM, flash memory, and the like. Examples of program instructions include machine language code such as those produced by a compiler, as well as high-level language code that can be executed by a computer using an interpreter or the like. The hardware devices described above may be configured to operate as one or more software modules to perform the operations of the present invention, and vice versa.

이상과 같이 본 발명은 비록 한정된 실시예와 도면에 의해 설명되었으나, 본 발명은 상기의 실시예에 한정되는 것은 아니며, 본 발명이 속하는 분야에서 통상의 지식을 가진 자라면 이러한 기재로부터 다양한 수정 및 변형이 가능하다.As described above, the present invention has been described by way of limited embodiments and drawings, but the present invention is not limited to the above embodiments, and those skilled in the art to which the present invention pertains various modifications and variations from such descriptions. This is possible.

그러므로, 본 발명의 범위는 설명된 실시예에 국한되어 정해져서는 아니 되며, 후술하는 특허청구범위뿐 아니라 이 특허청구범위와 균등한 것들에 의해 정해져야 한다.Therefore, the scope of the present invention should not be limited to the described embodiments, but should be determined by the equivalents of the claims, as well as the claims.

100: 집합 기반의 검색 장치
101: 할당부
103: 프로세서
105: 출력부
107: 데이터베이스
109 : POI 데이터베이스
111 : 역배열 데이터베이스
113 : 모음 튜플 역배열 데이터베이스
115 : 모음 트리플 역배열 데이터베이스100: set-based search device
101: allocation unit
103: processor
105: output unit
107: database
109: POI Database
111: inverse array database
113: vowel tuple inverse array database
115: vowel triple inverse array database

Claims

Maintaining a POI database storing a plurality of words;
Maintaining an inverse array database storing first information about a word of the POI database including each syllable, corresponding to each syllable constituting the plurality of words;
In response to input of a query word, referring to the inverse array database, identifying first information corresponding to a syllable constituting the query word; And
Outputting a word of the first information including the syllables constituting the query word among the identified first information as a search word;
Set-based search method comprising a.

The method of claim 1,
Identifying the first information,
In response to the input of the query, identifying only the first information corresponding to the syllable “all” constituting the query by referring to the inverse array database.
Set-based search method comprising a.

The method of claim 1,
Identifying the first information,
In response to the input of the query, identifying first information corresponding to at least some of the syllables constituting the query by referring to the inverse array database.
Set-based search method comprising a.

The method of claim 1,
When the output search word is plural,
Outputting the word of the first information as a search word,
Outputting the first syllable constituting the search word and the second syllable constituting the query word as the number of matches is higher
Set-based search method comprising a.

The method of claim 1,
When the output search word is plural,
Outputting the word of the first information as a search word,
Outputting a larger order of matching between the syllables within two adjacent syllables constituting the search word and the order between the syllables within two adjacent syllables constituting the query word;
Set-based search method comprising a.

The method of claim 1,
The inverse array database stores word identifiers for each of the plurality of words as the first information.
Identifying the first information,
Identifying word identifiers stored in correspondence with syllables forming the query word in the inverse array database;
Lt; / RTI >
Outputting the word of the first information as a search word,
Outputting, from among the identified word identifiers, a word designated by a word identifier common to the word identifier of each syllable constituting the query, from the POI database as a search word;
Set-based search method comprising a.

The method of claim 1,
In case there is no output search word,
Maintaining a vowel tuple inverse array database that stores second information about a word in the POI database including the vowel tuples constituting the plurality of words;
Maintaining a vowel triple inverse array database that stores second information about words in the POI database including the vowel triples corresponding to each vowel triple constituting the plurality of words;
Identifying second information corresponding to a syllable constituting the query by referring to the vowel tuple inverse array database and the vowel triple inverse array database; And
Outputting a word of second information in common among the identified second information as a search word;
Set-based search method further comprising.

The method of claim 7, wherein
Identifying the second information,
A second information corresponding to a vowel tuple or a vowel triple corresponding to each of the left vowel combinations associated with syllables up to m (the m is a natural number) based on the starting syllable of the query word; Identifying from the vowel triple inverse array database; And
The second information corresponding to the vowel triple or the vowel tuple corresponding to each of the right vowel combinations associated with the syllable up to n times (n is a natural number) based on the ending syllable of the query word, the vowel triple inverse array database or Identifying from the vowel tuple inverse array database
Set-based search method comprising a.

The method of claim 7, wherein
The vowel tuple inverse array database and the vowel triple inverse array database store word identifiers for each of the plurality of words as the second information.
Identifying the second information,
Identifying word identifiers stored in correspondence with syllables constituting the query in the vowel tuple inverse array database and the vowel triple inverse array database
Lt; / RTI >
Outputting the word of the second information as a search word,
Outputting a word designated by a common word identifier among the identified word identifiers as a search word from the POI database;
Set-based search method comprising a.

The method of claim 7, wherein
When the output search word is plural,
Outputting the word of the second information as a search word,
A first step of increasing the matching score by L according to the number of matches between the first syllable constituting the search word and the second syllable constituting the query word;
The matching score is M according to the number of syllables in which the initial and the neutral syllables or the neutral and the final syllables are matched among the first syllable and the second syllable, except for the syllable associated with the increase of the matching score in the first stage. A second step of incrementing;
A third step of decreasing the matching score by N according to the number of the first syllables remaining, except for the syllables associated with the increase of the matching score in the first and second steps; And
Outputting a search word having a relatively high matching score in priority order
Set-based search method comprising a.

The method of claim 10,
Wherein L is greater than M and M is greater than N. 2.

Maintaining a POI database storing a plurality of words;
Maintaining a vowel tuple inverse array database for storing the words of the POI database including the vowel tuples corresponding to the vowel tuples constituting the plurality of words;
Maintaining a vowel triple inverse array database for storing the words of the POI database including the vowel triples corresponding to the vowel triples constituting the plurality of words; And
In response to an input of a query word, referring to the vowel tuple inverse array database and the vowel triple inverse array database, outputting a word corresponding to the query word as a search word;
Set-based search method comprising a.

The method of claim 12,
In the vowel tuple inverse array database, the vowel triple inverse array database, and the query word, a plurality of vowels having the same predetermined Hangul pronunciation are treated as one representative vowel of the plurality of vowels.

The method of claim 12,
The step of outputting a word corresponding to the query word as a search word,
Identifying a first candidate word corresponding to a vowel tuple that matches a left vowel combination associated with a second syllable based on a start syllable of the query by referring to the vowel tuple inverse array database;
Identifying a second candidate word corresponding to a vowel triple matching a right vowel combination that is associated with a third syllable based on the end syllable of the query by referring to the vowel triple inverse array database;
Identifying a word that is common to the first candidate word and the second candidate word as a third candidate word;
Identifying a fourth candidate word corresponding to a vowel triple matching a left vowel combination that is associated with a third syllable based on a starting syllable of the query by referring to the vowel triple inverse array database;
Identifying a fifth candidate word corresponding to a vowel tuple that matches the right vowel combination associated with a second syllable based on the end syllable of the query by referring to the vowel tuple inverse array database;
Identifying a word common to the fourth candidate word and the fifth candidate word as a sixth candidate word; And
Outputting the third candidate word and the sixth candidate word as the search word;
Set-based search method comprising a.

The method of claim 12,
When the output search word is plural,
The step of outputting a word corresponding to the query word as a search word,
Outputting the first syllable constituting the search word and the second syllable constituting the query word in a higher order
Set-based search method comprising a.

The method of claim 12,
When the output search word is plural,
The step of outputting a word corresponding to the query word as a search word,
Outputting the first syllable constituting the search word and the second syllable constituting the query word as the number of syllables that match the initial and neutral or the neutral and the final match increases
Set-based search method comprising a.

The method of claim 12,
When the output search word is plural,
The step of outputting a word corresponding to the query word as a search word,
A first step of increasing the matching score by L according to the number of matches between the first syllable constituting the search word and the second syllable constituting the query word;
The matching score is M according to the number of syllables in which the initial and the neutral syllables or the neutral and the final syllables are matched among the first syllable and the second syllable, except for the syllable associated with the increase of the matching score in the first stage. A second step of incrementing;
A third step of decreasing the matching score by N according to the number of the first syllables remaining, except for the syllables associated with the increase of the matching score in the first and second steps; And
Outputting a search word having a relatively high matching score in priority order
Set-based search method comprising a.

18. The method of claim 17,
Wherein L is greater than M and M is greater than N. 2.

A POI database for storing a plurality of words;
An inverse array database that stores first information about a word of the POI database including each syllable, corresponding to each syllable constituting the plurality of words;
A processor that, in response to an input of a query word, refers to the inverse array database and identifies first information corresponding to a syllable constituting the query word; And
An output unit for outputting a word of common first information including the syllables constituting the query word among the identified first information as a search word;
Set-based search device comprising a.

20. The method of claim 19,
The processor comprising:
And a set-based retrieval device for identifying only first information corresponding to the syllable "all" constituting the query.

20. The method of claim 19,
When the output search word is plural,
The output unit includes:
And a set-based retrieval device that outputs a higher order of matching between the syllables within two adjacent syllables constituting the search word and the order between the syllables within two adjacent syllables.

20. The method of claim 19,
The inverse array database stores word identifiers for each of the plurality of words as the first information.
The processor comprising:
Identify the word identifiers stored in the syllables constituting the query word in the inverse array database;
The output unit includes:
And a word set by the word identifier common to the word identifier of each syllable constituting the query word from the identified word identifiers, from the POI database as a search word.

20. The method of claim 19,
A vowel tuple inverse array database that stores second information about words in the POI database including the vowel tuples constituting the plurality of words, wherein the vowel tuples include the vowel tuples; And
A vowel triple inverse array database for storing second information about words in the POI database including the vowel triples corresponding to the vowel triples constituting the plurality of words.
Further comprising:
If there is no common word above,
The processor comprising:
Identifying second information corresponding to a syllable constituting the query by referring to the vowel tuple inverse array database and the vowel triple inverse array database,
The output unit includes:
And a set-based retrieval device for outputting a word of second information that is common among the identified second information as a search word.

24. The method of claim 23,
The processor comprising:
A second information corresponding to a vowel tuple or a vowel triple corresponding to each of the left vowel combinations associated with syllables up to m (the m is a natural number) based on the starting syllable of the query word; Identify from the vowel triple inverse array database,
The second information corresponding to the vowel triple or the vowel tuple corresponding to each of the right vowel combinations associated with the syllable up to n times (n is a natural number) based on the ending syllable of the query word, the vowel triple inverse array database or And a set based retrieval device for identifying from the vowel tuple inverse array database.

24. The method of claim 23,
The vowel tuple inverse array database and the vowel triple inverse array database store word identifiers for each of the plurality of words as the second information.
The processor comprising:
Word identifiers stored corresponding to the syllables constituting the query word are identified in the vowel tuple inverse array database and the vowel triple inverse array database,
The output unit includes:
And a word set specified by a common word identifier among the identified word identifiers, from the POI database as a search word.

24. The method of claim 23,
When the output search word is plural,
The output unit includes:
A first step of increasing the matching score by L according to the number of matches between the first syllable constituting the search word and the second syllable constituting the query word, and the syllable associated with the increase of the matching score in the first step. A second step of increasing the matching score by M according to the number of syllables in which the first and the second syllables match the initial and the neutral or the neutral and the finality of the first syllable and the second syllable, except for the first syllable; In step 2, a third step of decreasing the matching score by N according to the number of the first syllables remaining except for the syllables related to the increase of the matching score is performed, and prioritizing the search word having a relatively high matching score. Set-based search device to output.

The method of claim 26,
And L is greater than M and M is greater than N. 2.

A POI database for storing a plurality of words;
A vowel tuple inverse array database for storing words of the POI database including the vowel tuples constituting the plurality of words;
A vowel triple inverse array database that stores words of the POI database including the vowel triples corresponding to the vowel triples constituting the plurality of words; And
An output unit for outputting a word corresponding to the query word as a search word by referring to the vowel tuple inverse array database and the vowel triple inverse array database in response to the input of the query word
Set-based search device comprising a.

29. The method of claim 28,
In the vowel tuple inverse array database, the vowel triple inverse array database, and the query word, a plurality of vowels having the same predetermined Hangul pronunciation are processed as one representative vowel of the plurality of vowels.

29. The method of claim 28,
The output unit includes:
Identifying a first candidate word corresponding to a vowel tuple that matches a left vowel combination that is associated with a second syllable based on a beginning syllable of the query by referring to the vowel tuple inverse array database;
Identifying a second candidate word corresponding to a vowel triple matching a right vowel combination that is associated with a third syllable based on the end syllable of the query by referring to the vowel triple inverse array database;
Identifying a word that is common to the first candidate word and the second candidate word as a third candidate word,
Identifying a fourth candidate word corresponding to a vowel triple matching a left vowel combination that is associated with a third syllable based on a beginning syllable of the query by referring to the vowel triple inverse array database;
Identifying a fifth candidate word corresponding to a vowel tuple that matches the right vowel combination associated with a second syllable based on the end syllable of the query by referring to the vowel tuple inverse array database;
Identifying a word common to the fourth candidate word and the fifth candidate word as a sixth candidate word,
And a set-based search apparatus outputting the third candidate word and the sixth candidate word as the search word.

29. The method of claim 28,
When the output search word is plural,
The output unit includes:
And a set-based retrieval device for outputting the first syllable constituting the search word and the second syllable constituting the query word in a higher order.

29. The method of claim 28,
When the output search word is plural,
The output unit includes:
And a set-based retrieval device for the first syllable constituting the search word and the second syllable constituting the query word to be given priority as the number of syllables of which the initial and the neutral match or the neutral and the final match are greater.

29. The method of claim 28,
When the output search word is plural,
The output unit includes:
A first step of increasing the matching score by L according to the number of matches between the first syllable constituting the search word and the second syllable constituting the query word, and the syllable associated with the increase of the matching score in the first step. A second step of increasing the matching score by M according to the number of syllables in which the first and the second syllables match the initial and the neutral or the neutral and the finality of the first syllable and the second syllable, except for the first syllable; In step 2, a third step of decreasing the matching score by N according to the number of the first syllables remaining except for the syllables related to the increase of the matching score is performed, and prioritizing the search word having a relatively high matching score. Set-based search device to output.

34. The method of claim 33,
And L is greater than M and M is greater than N. 2.