KR102266072B1

KR102266072B1 - Electronic terminal device that performs speech recognition of a user based on location-based speech recognition learning model data and operating method thereof

Info

Publication number: KR102266072B1
Application number: KR1020190166435A
Authority: KR
Inventors: 최보람
Original assignee: 주식회사 한글과컴퓨터
Priority date: 2019-12-13
Filing date: 2019-12-13
Publication date: 2021-06-17

Abstract

Disclosed are an electronic terminal device which performs a speech recognition of a user based on location-based speech recognition learning model data, and an operating method thereof. The electronic terminal device which performs the speech recognition of a user based on location-based speech recognition learning model data and the operating method thereof according to the present invention construct the speech recognition learning model data for different locations according to the location characteristics of a pronunciation, and recognize the user's speech based on the speech recognition learning model data corresponding to user location information. Thus, the present invention is capable of providing a high speech recognition rate in performing a speech-based information search and the like. The electronic terminal device comprises: a data storage part; a location collection part; and a speech recognition part.

Description

ELECTRONIC TERMINAL DEVICE THAT PERFORMS SPEECH RECOGNITION OF A USER BASED ON LOCATION-BASED SPEECH RECOGNITION LEARNING MODEL DATA AND OPERATING METHOD THEREOF }

본 발명은 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치 및 그 동작 방법에 대한 것이다.The present invention relates to an electronic terminal device for performing voice recognition of a user based on location-based voice recognition learning model data and an operating method thereof.

최근, 인공지능 기능이 탑재된 전자 장비들이 출시됨에 따라 사용자가 전자 장비에 내장된 마이크로폰을 통해 음성을 입력하면, 사용자의 음성을 인식하여 해당 음성에 따른 기능의 실행을 제어하는 기술이 주목받고 있다.Recently, as electronic devices equipped with artificial intelligence functions have been released, when a user inputs a voice through a microphone built into the electronic device, a technology for recognizing the user's voice and controlling the execution of a function according to the voice is attracting attention. .

음성 인식 기술은 사용자로부터 음성이 입력되면, 음성의 특징을 분석하여 특징 벡터를 추출하고, 특징 벡터의 패턴을 인식한 후 패턴 인식 결과에 기반하여 사용자의 음성에 매칭되는 텍스트를 결과로 출력하는 기술을 의미한다.When a voice is input from a user, the voice recognition technology extracts a feature vector by analyzing the features of the voice, recognizes a pattern of the feature vector, and outputs a text matching the user's voice as a result based on the pattern recognition result means

이러한 음성 인식 기술은 다양한 음성 데이터를 수집하여 기계학습을 통해 소정의 음성 인식 학습모델 데이터로 구축해 둠으로써, 사용자에 의해 입력되는 음성에 매칭되는 텍스트를 생성하도록 구성된다.This voice recognition technology is configured to generate text matching the voice input by the user by collecting various voice data and building it as predetermined voice recognition learning model data through machine learning.

이때, 음성 인식에 사용되는 음성 인식 학습모델 데이터는 음성 인식이 활용되는 위치나 장소에 따라 다르게 구성될 수 있다. In this case, the voice recognition learning model data used for voice recognition may be configured differently according to a location or place where voice recognition is utilized.

예컨대, '부산'에서 음성 인식에 사용되는 음성 인식 학습모델 데이터는 '부산' 사투리에 따른 억양 및 언어 데이터들을 기반으로 기계학습됨으로써, '부산'에서 활용될 때, 일반적인 음성 인식 학습모델 데이터를 기초로 음성을 인식하는 경우보다 음성에 대한 인식률이 더 높게 나오도록 구축될 수 있다. For example, the voice recognition learning model data used for voice recognition in 'Busan' is machine-learned based on intonation and language data according to the 'Busan' dialect, so that when used in 'Busan', it is based on general voice recognition learning model data. It can be constructed so that the recognition rate for the voice is higher than that in the case of recognizing the voice with

이렇게, 위치나 장소에 따라 서로 다른 음성 인식 학습모델 데이터를 기초로 사용자의 음성을 인식할 수 있다면, 억양이 다르게 입력되거나, 특정 지역에서만 사용되는 단어가 입력됨에 따라 사용자가 의도한 정확한 단어나 문장이 결과물로 도출되지 못하는 문제를 예방할 수 있다. In this way, if the user's voice can be recognized based on the different voice recognition learning model data according to the location or place, the exact word or sentence intended by the user is input as the intonation is different or a word used only in a specific area is input. Problems that cannot be derived from this result can be prevented.

따라서, 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 기법에 대한 연구가 필요하다.Therefore, it is necessary to study a technique for performing the user's voice recognition based on the location-based voice recognition learning model data.

본 발명에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치 및 그 동작 방법은 발음의 위치적 특징에 따라 서로 다른 위치별 음성 인식 학습모델 데이터들을 구축하여, 사용자의 위치 정보에 대응되는 음성 인식 학습모델 데이터를 기초로 상기 사용자의 음성을 인식함으로써, 음성 기반의 정보 검색 등을 수행하는 데에 있어 높은 음성 인식률을 제공하고자 한다.According to the present invention, an electronic terminal device for performing voice recognition of a user based on location-based speech recognition learning model data and an operation method therefor are constructed by constructing different location-specific speech recognition learning model data according to the locational characteristics of pronunciation, By recognizing the user's voice based on the voice recognition learning model data corresponding to the user's location information, it is intended to provide a high voice recognition rate in performing voice-based information retrieval.

본 발명의 일실시예에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치는 음성 인식을 위해 미리 구축된 기본형 음성 인식 학습모델 데이터와 미리 정해진 복수의 위치 정보들 각각에 대응되는 위치별 음성 인식 학습모델 데이터가 저장되어 있는 데이터 저장부, 상기 사용자로부터 상기 전자 단말 장치에 내장된 마이크로폰을 통해 음성이 입력되면, GPS를 통해 상기 전자 단말 장치가 위치하는 지점에 대한 제1 위치 정보를 수집하는 위치 수집부 및 상기 복수의 위치 정보들 중 상기 제1 위치 정보에 따른 제1 지점과의 거리가 기설정된(predetermined) 거리 이내인 지점을 포함하는 적어도 하나의 제1 위치 정보가 존재하면, 상기 적어도 하나의 제1 위치 정보 중 상기 제1 지점과의 거리가 가장 가까운 지점을 포함하는 제2 위치 정보를 선택한 후, 상기 제2 위치 정보에 대응되는 제1 위치별 음성 인식 학습모델 데이터를 기초로 상기 사용자의 음성을 인식하고, 상기 복수의 위치 정보들 중 상기 제1 지점과의 거리가 상기 기설정된 거리 이내인 지점을 포함하는 상기 적어도 하나의 제1 위치 정보가 존재하지 않으면, 상기 기본형 음성 인식 학습모델 데이터를 기초로 상기 사용자의 음성을 인식하는 음성 인식부를 포함한다.An electronic terminal device for performing voice recognition of a user based on location-based voice recognition learning model data according to an embodiment of the present invention includes basic voice recognition learning model data built in advance for voice recognition and a plurality of predetermined location information A data storage unit in which voice recognition learning model data for each location corresponding to each of the positions is stored, and when a voice is input from the user through a microphone built into the electronic terminal device, the electronic terminal device is located at a location through GPS. At least one first including a location collecting unit that collects first location information about the location information and a point having a distance from a first point according to the first location information among the plurality of location information is within a predetermined distance If the location information exists, after selecting the second location information including a point closest to the first point from among the at least one piece of first location information, the first location-specific voice corresponding to the second location information The user's voice is recognized based on the recognition learning model data, and the at least one piece of first location information including a point having a distance from the first point among the plurality of location information is within the preset distance exists. Otherwise, a voice recognition unit for recognizing the user's voice based on the basic voice recognition learning model data is included.

또한, 본 발명의 일실시예에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치의 동작 방법은 음성 인식을 위해 미리 구축된 기본형 음성 인식 학습모델 데이터와 미리 정해진 복수의 위치 정보들 각각에 대응되는 위치별 음성 인식 학습모델 데이터가 저장되어 있는 데이터 저장부를 유지하는 단계, 상기 사용자로부터 상기 전자 단말 장치에 내장된 마이크로폰을 통해 음성이 입력되면, GPS를 통해 상기 전자 단말 장치가 위치하는 지점에 대한 제1 위치 정보를 수집하는 단계 및 상기 복수의 위치 정보들 중 상기 제1 위치 정보에 따른 제1 지점과의 거리가 기설정된 거리 이내인 지점을 포함하는 적어도 하나의 제1 위치 정보가 존재하면, 상기 적어도 하나의 제1 위치 정보 중 상기 제1 지점과의 거리가 가장 가까운 지점을 포함하는 제2 위치 정보를 선택한 후, 상기 제2 위치 정보에 대응되는 제1 위치별 음성 인식 학습모델 데이터를 기초로 상기 사용자의 음성을 인식하고, 상기 복수의 위치 정보들 중 상기 제1 지점과의 거리가 상기 기설정된 거리 이내인 지점을 포함하는 상기 적어도 하나의 제1 위치 정보가 존재하지 않으면, 상기 기본형 음성 인식 학습모델 데이터를 기초로 상기 사용자의 음성을 인식하는 단계를 포함한다.In addition, the operation method of the electronic terminal device for performing the user's voice recognition based on the location-based voice recognition learning model data according to an embodiment of the present invention is a basic voice recognition learning model data built in advance for voice recognition and maintaining a data storage unit storing voice recognition learning model data for each location corresponding to each of a plurality of predetermined location information; when a voice is input from the user through a microphone built into the electronic terminal device, the Collecting first location information on a point where the electronic terminal device is located, and at least one point including a point having a distance from a first point according to the first location information among the plurality of location information within a preset distance When there is first location information of , after selecting second location information including a point having the closest distance to the first point from among the at least one piece of first location information, the first location information corresponding to the second location information is selected. The at least one first location that recognizes the user's voice based on the location-specific speech recognition learning model data, and includes a point in which a distance from the first point among the plurality of location information is within the preset distance and if the information does not exist, recognizing the user's voice based on the basic voice recognition learning model data.

본 발명에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치 및 그 동작 방법은 발음의 위치적 특징에 따라 서로 다른 위치별 음성 인식 학습모델 데이터들을 구축하여, 사용자의 위치 정보에 대응되는 음성 인식 학습모델 데이터를 기초로 상기 사용자의 음성을 인식함으로써, 음성 기반의 정보 검색 등을 수행하는 데에 있어 높은 음성 인식률을 제공할 수 있다.According to the present invention, an electronic terminal device for performing voice recognition of a user based on location-based speech recognition learning model data and an operation method therefor are constructed by constructing different location-specific speech recognition learning model data according to the locational characteristics of pronunciation, By recognizing the user's voice based on the voice recognition learning model data corresponding to the user's location information, it is possible to provide a high voice recognition rate in performing voice-based information retrieval.

도 1은 본 발명의 일실시예에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치의 구조를 도시한 도면이다.
도 2는 본 발명의 일실시예에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치를 설명하기 위한 도면이다.
도 3은 본 발명의 일실시예에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치의 동작 방법을 도시한 순서도이다. 1 is a diagram illustrating a structure of an electronic terminal device for performing voice recognition of a user based on location-based voice recognition learning model data according to an embodiment of the present invention.
FIG. 2 is a diagram for explaining an electronic terminal device that performs voice recognition of a user based on location-based voice recognition learning model data according to an embodiment of the present invention.
3 is a flowchart illustrating a method of operating an electronic terminal device for performing voice recognition of a user based on location-based voice recognition learning model data according to an embodiment of the present invention.

이하에서는 본 발명에 따른 실시예들을 첨부된 도면을 참조하여 상세하게 설명하기로 한다. 이러한 설명은 본 발명을 특정한 실시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 각 도면을 설명하면서 유사한 참조부호를 유사한 구성요소에 대해 사용하였으며, 다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 본 명세서 상에서 사용되는 모든 용어들은 본 발명이 속하는 기술분야에서 통상의 지식을 가진 사람에 의해 일반적으로 이해되는 것과 동일한 의미를 가지고 있다.Hereinafter, embodiments according to the present invention will be described in detail with reference to the accompanying drawings. These descriptions are not intended to limit the present invention to specific embodiments, and should be understood to include all modifications, equivalents, and substitutes included in the spirit and scope of the present invention. While describing each drawing, like reference numerals are used for similar components, and unless otherwise defined, all terms used in this specification, including technical or scientific terms, refer to those of ordinary skill in the art to which the present invention belongs. It has the same meaning as is commonly understood by those who have it.

본 문서에서, 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 포함할 수 있다는 것을 의미한다. 또한, 본 발명의 다양한 실시예들에 있어서, 각 구성요소들, 기능 블록들 또는 수단들은 하나 또는 그 이상의 하부 구성요소로 구성될 수 있고, 각 구성요소들이 수행하는 전기, 전자, 기계적 기능들은 전자회로, 집적회로, ASIC(Application Specific Integrated Circuit) 등 공지된 다양한 소자들 또는 기계적 요소들로 구현될 수 있으며, 각각 별개로 구현되거나 2 이상이 하나로 통합되어 구현될 수도 있다. In this document, when a part "includes" a certain component, it means that other components may be further included, rather than excluding other components, unless otherwise stated. In addition, in various embodiments of the present invention, each of the components, functional blocks or means may be composed of one or more sub-components, and the electrical, electronic, and mechanical functions performed by each component are electronic. A circuit, an integrated circuit, an ASIC (Application Specific Integrated Circuit), etc. may be implemented with various well-known devices or mechanical elements, and may be implemented separately or two or more may be integrated into one.

한편, 첨부된 블록도의 블록들이나 흐름도의 단계들은 범용 컴퓨터, 특수용 컴퓨터, 휴대용 노트북 컴퓨터, 네트워크 컴퓨터 등 데이터 프로세싱이 가능한 장비의 프로세서나 메모리에 탑재되어 지정된 기능들을 수행하는 컴퓨터 프로그램 명령들(instructions)을 의미하는 것으로 해석될 수 있다. 이들 컴퓨터 프로그램 명령들은 컴퓨터 장치에 구비된 메모리 또는 컴퓨터에서 판독 가능한 메모리에 저장될 수 있기 때문에, 블록도의 블록들 또는 흐름도의 단계들에서 설명된 기능들은 이를 수행하는 명령 수단을 내포하는 제조물로 생산될 수도 있다. 아울러, 각 블록 또는 각 단계는 특정된 논리적 기능(들)을 실행하기 위한 하나 이상의 실행 가능한 명령들을 포함하는 모듈, 세그먼트 또는 코드의 일부를 나타낼 수 있다. 또, 몇 가지 대체 가능한 실시예들에서는 블록들 또는 단계들에서 언급된 기능들이 정해진 순서와 달리 실행되는 것도 가능함을 주목해야 한다. 예컨대, 잇달아 도시되어 있는 두 개의 블록들 또는 단계들은 실질적으로 동시에 수행되거나, 역순으로 수행될 수 있으며, 경우에 따라 일부 블록들 또는 단계들이 생략된 채로 수행될 수도 있다.On the other hand, the blocks in the accompanying block diagram or steps in the flowchart are computer program instructions that are loaded in the processor or memory of equipment capable of data processing, such as general-purpose computers, special-purpose computers, portable notebook computers, and network computers, and perform specified functions can be interpreted as meaning Since these computer program instructions may be stored in a memory provided in a computer device or in a memory readable by a computer, the functions described in the blocks of the block diagrams or the steps of the flowcharts are produced as articles of manufacture containing instruction means for performing the same. could be In addition, each block or each step may represent a module, segment, or portion of code comprising one or more executable instructions for executing the specified logical function(s). It should also be noted that, in some alternative embodiments, it is also possible for the functions recited in blocks or steps to be executed out of the prescribed order. For example, two blocks or steps shown in succession may be performed substantially simultaneously or in the reverse order, and in some cases, some blocks or steps may be omitted.

도 1은 본 발명의 일실시예에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치의 구조를 도시한 도면이다. 1 is a diagram illustrating a structure of an electronic terminal device for performing voice recognition of a user based on location-based voice recognition learning model data according to an embodiment of the present invention.

도 1을 참조하면, 본 발명의 일실시예에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치(110)는 데이터 저장부(111), 위치 수집부(112) 및 음성 인식부(113)를 포함한다. Referring to FIG. 1 , the electronic terminal device 110 for performing voice recognition of a user based on location-based voice recognition learning model data according to an embodiment of the present invention includes a data storage unit 111 and a location collection unit ( 112) and a voice recognition unit 113.

데이터 저장부(111)에는 음성 인식을 위해 미리 구축된 기본형 음성 인식 학습모델 데이터와 미리 정해진 복수의 위치 정보들 각각에 대응되는 위치별 음성 인식 학습모델 데이터가 저장되어 있다.The data storage unit 111 stores basic voice recognition learning model data built in advance for voice recognition and voice recognition learning model data for each location corresponding to each of a plurality of predetermined location information.

여기서, 음성 인식 학습모델 데이터란, 사용자로부터 입력된 음성을 인식하기 위해, 다수의 단어들을 기반으로 기계학습을 통해 구축된 확률적 모델 데이터를 의미하고, 상기 위치별 음성 인식 학습모델 데이터란, 상기 복수의 위치 정보들 각각에 대해, 해당 위치의 지역 방언이나 자주 쓰이는 단어들을 기반으로 기계학습됨으로써, 해당 위치에서 더 높은 음성 인식률을 가질 수 있도록 구축된 확률적 모델 데이터를 의미한다. Here, the voice recognition learning model data means probabilistic model data constructed through machine learning based on a plurality of words in order to recognize the voice input from the user, and the location-specific voice recognition learning model data is, For each of the plurality of location information, it means probabilistic model data constructed to have a higher speech recognition rate at the location by machine learning based on the local dialect or frequently used words of the location.

예컨대, 상기 복수의 위치 정보들에 따른 위치들을 '부산', '울산', '광주', '제주'라고 가정하는 경우, 데이터 저장부(111)에는 기본형 음성 인식 학습모델 데이터와 '부산', '울산', '광주', '제주' 각각에 대응되는 위치별 음성 인식 학습모델 데이터가 저장되어 있을 수 있다.For example, if it is assumed that the locations according to the plurality of location information are 'Busan', 'Ulsan', 'Gwangju', and 'Jeju', the data storage unit 111 contains basic voice recognition learning model data and 'Busan', Voice recognition learning model data for each location corresponding to each of 'Ulsan', 'Gwangju', and 'Jeju' may be stored.

위치 수집부(112)는 상기 사용자로부터 전자 단말 장치(110)에 내장된 마이크로폰을 통해 음성이 입력되면, GPS를 통해 전자 단말 장치(110)가 위치하는 지점에 대한 제1 위치 정보를 수집한다.When a voice is input from the user through a microphone built into the electronic terminal apparatus 110 , the location collecting unit 112 collects first location information on a point where the electronic terminal apparatus 110 is located through GPS.

예컨대, 전자 단말 장치(110)의 위치 정보(위도, 경도)가 '(35.31, 129.18)'이라고 가정하면, 위치 수집부(112)는 상기 사용자로부터 전자 단말 장치(110)에 내장된 마이크로폰을 통해 음성이 입력되는 경우, GPS를 통해 전자 단말 장치(110)가 위치하는 지점에 대한 제1 위치 정보인 '(35.31, 129.18)'을 수집할 수 있다.For example, assuming that the location information (latitude, longitude) of the electronic terminal device 110 is '(35.31, 129.18)', the location collecting unit 112 receives the user from the user through the microphone built into the electronic terminal device 110 . When a voice is input, '(35.31, 129.18)', which is first location information on a point where the electronic terminal device 110 is located, may be collected through GPS.

음성 인식부(113)는 상기 복수의 위치 정보들 중 상기 제1 위치 정보에 따른 제1 지점과의 거리가 기설정된(predetermined) 거리 이내인 지점을 포함하는 적어도 하나의 제1 위치 정보가 존재하면, 상기 적어도 하나의 제1 위치 정보 중 상기 제1 지점과의 거리가 가장 가까운 지점을 포함하는 제2 위치 정보를 선택한 후, 상기 제2 위치 정보에 대응되는 제1 위치별 음성 인식 학습모델 데이터를 기초로 상기 사용자의 음성을 인식하고, 상기 복수의 위치 정보들 중 상기 제1 지점과의 거리가 상기 기설정된 거리 이내인 지점을 포함하는 상기 적어도 하나의 제1 위치 정보가 존재하지 않으면, 상기 기본형 음성 인식 학습모델 데이터를 기초로 상기 사용자의 음성을 인식한다.If there is at least one piece of first location information including a point in which the distance from the first point according to the first location information is within a predetermined distance among the plurality of location information, the voice recognition unit 113 is , After selecting second location information including a point closest to the first point from among the at least one piece of first location information, the first location-specific voice recognition learning model data corresponding to the second location information If the user's voice is recognized based on the at least one piece of first location information including a point having a distance from the first point among the plurality of location information within the preset distance, the basic type The user's voice is recognized based on the voice recognition learning model data.

관련하여, 도 2에서 도면부호 211은 앞선 예에서의 전자 단말 장치(110)가 위치하는 지점(211)에 대한 상기 제1 위치 정보인 '(35.31, 129.18)'을 나타내고, 도면부호 212로 표시된 파란 원의 내부 영역은 '울산'에 대한 위치 정보에 따른 영역(212), 도면부호 213으로 표시된 파란 원의 내부 영역은 '부산'에 대한 위치 정보에 따른 영역(213), 도면부호 214로 표시된 파란 원의 내부 영역은 '광주'에 대한 위치 정보에 따른 영역(214), 도면부호 215로 표시된 파란 원의 내부 영역은 '제주'에 대한 위치 정보에 따른 영역(215), 도면부호 216으로 표시된 거리는 기설정된 거리(216)를 나타낸다고 가정하자.In relation to this, reference numeral 211 in FIG. 2 denotes '(35.31, 129.18)', which is the first location information on the point 211 where the electronic terminal device 110 is located in the previous example, and is denoted by reference numeral 212. The inner area of the blue circle is the area 212 according to the location information for 'Ulsan', and the inner area of the blue circle indicated by the reference number 213 is the area 213 according to the location information for 'Busan' and the area indicated by the reference number 214. The inner area of the blue circle is the area 214 according to the location information for 'Gwangju', and the inner area of the blue circle indicated by the reference number 215 is the area 215 according to the location information for 'Jeju' and the area indicated by the reference number 216. Assume that the distance represents a preset distance 216 .

도 2를 참조하여, 앞선 예에 따라 음성 인식부(113)의 동작을 설명하면, 음성 인식부(113)는 상기 제1 위치 정보인 '(35.31, 129.18)'에 따른 지점(211)과의 거리가 기설정된 거리(216) 이내인 지점(영역)을 의미하는 빨간 원의 내부 영역에 '울산'에 대한 위치 정보에 따른 영역(212)과 '부산'에 대한 위치 정보에 따른 영역(213)이 겹쳐지므로, '부산', '울산', '광주', '제주'에 대한 위치 정보들 중 '(35.31, 129.18)'에 따른 지점(211)과의 거리가 기설정된 거리(216) 이내인 지점을 포함하는 적어도 하나의 제1 위치 정보로 '울산'에 대한 위치 정보와 '부산'에 대한 위치 정보가 존재하는 것을 확인할 수 있다.Referring to FIG. 2 , when the operation of the voice recognition unit 113 is described according to the previous example, the voice recognition unit 113 is configured to connect with the point 211 according to the first location information '(35.31, 129.18)'. An area 212 according to the location information for 'Ulsan' and an area 213 according to the location information for 'Busan' in the inner area of the red circle indicating a point (area) within the predetermined distance 216 . overlap, the distance to the point 211 according to '(35.31, 129.18)' among the location information for 'Busan', 'Ulsan', 'Gwangju', and 'Jeju' is within the preset distance 216 It may be confirmed that the location information on 'Ulsan' and the location information on 'Busan' exist as at least one piece of first location information including the point.

그리고, 음성 인식부(113)는 '울산'에 대한 위치 정보에 따른 영역(212)과 '부산'에 대한 위치 정보에 따른 영역(213) 중 '(35.31, 129.18)'에 따른 지점(211)과의 거리가 더 가까운 지점을 포함하는 제2 위치 정보로 '울산'에 대한 위치 정보를 선택한 후, '울산'에 대한 위치 정보에 대응되는 '위치별 음성 인식 학습모델 데이터 1'을 기초로 상기 사용자의 음성을 인식할 수 있다. And, the voice recognition unit 113 is a point 211 according to '(35.31, 129.18)' among the area 212 according to the location information for 'Ulsan' and the area 213 according to the location information for 'Busan'. After selecting the location information for 'Ulsan' as the second location information including a point that is closer to the location, based on 'Voice recognition learning model data 1 for each location' corresponding to the location information for 'Ulsan' It can recognize the user's voice.

만약, '부산', '울산', '광주', '제주'에 대한 위치 정보들 중 상기 제1 위치 정보인 '(35.31, 129.18)'에 따른 지점(211)과의 거리가 기설정된 거리(216) 이내인 지점을 포함하는 적어도 하나의 제1 위치 정보가 존재하지 않으면, 음성 인식부(113)는 상기 기본형 음성 인식 학습모델 데이터를 기초로 상기 사용자의 음성을 인식할 수 있다.If the distance to the point 211 according to '(35.31, 129.18)', which is the first location information, among the location information for 'Busan', 'Ulsan', 'Gwangju', and 'Jeju' is a preset distance ( 216), the voice recognition unit 113 may recognize the user's voice based on the basic voice recognition learning model data if at least one piece of first location information including a point within the range does not exist.

이때, 본 발명의 일실시예에 따르면, 데이터 저장부(111)에는 상기 복수의 위치 정보들 각각에 대응되는 위치별 소음 제거 학습모델 데이터가 추가로 저장되어 있을 수 있고, 음성 인식부(113)는 상기 복수의 위치 정보들 중 상기 제1 지점과의 거리가 상기 기설정된 거리 이내인 지점을 포함하는 상기 적어도 하나의 제1 위치 정보가 존재하면, 상기 적어도 하나의 제1 위치 정보 중 상기 제1 지점과의 거리가 가장 가까운 지점을 포함하는 상기 제2 위치 정보를 선택한 후, 상기 제2 위치 정보에 대응되는 제1 소음 제거 학습모델 데이터를 기초로, 상기 사용자의 음성에서 소음을 제거하고, 상기 제2 위치 정보에 대응되는 상기 제1 위치별 음성 인식 학습모델 데이터를 기초로 상기 사용자의 음성을 인식할 수 있다.At this time, according to an embodiment of the present invention, the data storage unit 111 may additionally store noise reduction learning model data for each location corresponding to each of the plurality of location information, and the voice recognition unit 113 . If the at least one piece of first location information including a point where the distance from the first point is within the preset distance among the plurality of pieces of location information exists, the first piece of the at least one piece of first location information After selecting the second location information including the point having the closest distance to the point, based on the first noise reduction learning model data corresponding to the second location information, the noise is removed from the user's voice, and the The user's voice may be recognized based on the first position-specific voice recognition learning model data corresponding to the second position information.

여기서, 상기 위치별 소음 제거 학습모델 데이터란, 사용자로부터 입력된 음성 내에 포함된 소음을 제거하기 위한 확률적 모델 데이터로, 상기 복수의 위치 정보들 각각에 대해, 해당 위치에서 자주 발생하는 소음을 추출하여 제거할 수 있도록 기계학습된 확률적 모델 데이터를 의미한다. Here, the location-specific noise removal learning model data is probabilistic model data for removing noise included in the voice input from the user, and for each of the plurality of location information, noise frequently occurring at the location is extracted. It means probabilistic model data that has been machine-learned so that it can be removed.

예컨대, 데이터 저장부(111)에는 '울산'에 대한 위치 정보에 대응되는 '위치별 소음 제거 학습모델 데이터 1'이 추가로 저장되어 있을 수 있고, 상기 '위치별 소음 제거 학습모델 데이터 1'은 상기 사용자로부터 입력된 음성 내에서 '울산'에서 발생할 수 있는 소음인 파도 및 선박 소음을 제거할 수 있도록 구축될 수 있다. For example, 'noise removal learning model data 1 for each location' corresponding to location information on 'Ulsan' may be additionally stored in the data storage unit 111, and the 'noise removal learning model data 1 for each location' may be stored in the data storage unit 111. It can be constructed so as to remove waves and ship noise, which are noises that may occur in 'Ulsan' in the voice input from the user.

그러면, 음성 인식부(113)는 앞선 예와 같이, 상기 제2 위치 정보로 '울산'에 대한 위치 정보가 선택된 경우, '울산'에 대한 위치 정보에 대응되는 '위치별 소음 제거 학습모델 데이터 1'을 기초로, 상기 사용자의 음성에서 '울산'에서 발생할 수 있는 파도 및 선박 소음을 제거한 후, '울산'에 대한 위치 정보에 대응되는 '위치별 음성 인식 학습모델 데이터 1'을 기초로 상기 사용자의 음성을 인식할 수 있다.Then, as in the previous example, when the location information for 'Ulsan' is selected as the second location information, as in the previous example, the voice recognition unit 113 performs 'noise removal learning model data 1 for each location' corresponding to the location information for 'Ulsan'. After removing waves and ship noise that may occur in 'Ulsan' from the user's voice based on ', based on 'Voice recognition learning model data 1 for each location' corresponding to the location information for 'Ulsan', the user can recognize the voice of

또한, 본 발명의 일실시예에 따르면, 전자 단말 장치(110)는 비콘 정보 데이터베이스(114), 검색어 데이터베이스(115), 특성 벡터 생성부(116), 벡터 유사도 연산부(117), 검색어 세트 추출부(118), 검색어 출력부(119) 및 음성 검색부(120)를 더 포함할 수 있다.In addition, according to an embodiment of the present invention, the electronic terminal device 110 includes a beacon information database 114 , a search word database 115 , a feature vector generator 116 , a vector similarity calculator 117 , and a search word set extractor 118 , a search word output unit 119 , and a voice search unit 120 may be further included.

비콘 정보 데이터베이스(114)에는 미리 정해진 복수의 오프라인 상점들에 사전 비치되어 있는 복수의 비콘(Beacon)들 각각에 대응하는 비콘 식별자와, 상기 복수의 비콘들 각각이 비치된 상점의 업종 카테고리에 따라 각 비콘에 대응하는 미리 설정된 고유 번호가 서로 대응되어 저장되어 있다.The beacon information database 114 includes a beacon identifier corresponding to each of a plurality of beacons that are provided in advance in a plurality of offline stores, and each of the plurality of beacons according to the industry category of the store in which each of the beacons is provided. Pre-set unique numbers corresponding to beacons are stored in correspondence with each other.

여기서, 상기 복수의 오프라인 상점들이란 다양한 상품을 판매하거나 서비스를 제공하는 오프라인 상에 존재하는 상점 등을 의미한다.Here, the plurality of offline stores refers to stores existing in offline stores that sell various products or provide services.

그리고, 비콘이란 근거리에 있는 모바일 기기를 인식하여 데이터를 전송할 수 있는 무선 통신 장치를 의미한다. 근거리 무선 통신인 NFC(Near Field Communication)가 10cm 이내의 거리에서만 사용할 수 있다면, 상기 비콘은 최대 70m 거리에서까지 동작이 가능할 수 있다.In addition, the beacon refers to a wireless communication device capable of transmitting data by recognizing a mobile device in a short distance. If NFC (Near Field Communication), which is short-range wireless communication, can be used only at a distance of less than 10 cm, the beacon may operate at a distance of up to 70 m.

또한, 고유 번호란, 상기 복수의 비콘들 각각에 대응되어 미리 할당된 번호로, 상기 복수의 오프라인 상점들에 사전 비치되어 있는 각각의 비콘을 해당 상점의 업종 카테고리에 따라 구분하기 위하여 개발자에 의해 사전 부여된 번호를 의미한다. 예를 들어, 업종 카테고리가 '영화관'이라면 '1'에서 '10' 사이의 고유 번호가 부여될 수 있고, 업종 카테고리가 '스포츠 의류 매장'이라면 '11'에서 '20' 사이의 고유 번호가 부여될 수 있다.In addition, the unique number is a number assigned in advance to correspond to each of the plurality of beacons, and is pre-arranged by a developer in order to classify each beacon previously provided in the plurality of offline stores according to the business category of the store. the assigned number. For example, if the industry category is 'cinema', a unique number between '1' and '10' may be assigned, and if the industry category is 'sports clothing store', a unique number between '11' and '20' is assigned. can be

따라서, 비콘 정보 데이터베이스(114)에는 하기의 표 1와 같이 정보가 저장되어 있을 수 있다.Accordingly, information may be stored in the beacon information database 114 as shown in Table 1 below.

비콘 식별자 beacon identifier 고유 번호unique number 비콘 식별자 1Beacon Identifier 1 55 비콘 식별자 2Beacon Identifier 2 77 비콘 식별자 3Beacon Identifier 3 88 비콘 식별자 4Beacon Identifier 4 1111 비콘 식별자 5Beacon Identifier 5 1515 ...... ......

검색어 데이터베이스(115)에는 복수의 검색어 세트들과 상기 복수의 검색어 세트들 각각에 대해 미리 설정된 n(n은 2이상의 자연수)차원의 기준 벡터가 서로 대응되어 저장되어 있다.In the search word database 115, a plurality of search word sets and an n (n is a natural number equal to or greater than 2) dimensional reference vector preset for each of the plurality of search word sets are stored in correspondence with each other.

여기서, 상기 복수의 검색어 세트들 각각은 각 검색어 세트에 대응하는 미리 설정된 주제에 연관된 서로 다른 복수의 검색어들로 구성된 집합을 의미한다.Here, each of the plurality of search word sets means a set composed of a plurality of different search words related to a preset topic corresponding to each search word set.

예컨대, n이 '5'라고 가정하면, 검색어 데이터베이스(115)에는 하기의 표 2와 같이 정보가 저장되어 있을 수 있다.For example, assuming that n is '5', information may be stored in the search word database 115 as shown in Table 2 below.

복수의 검색어 세트들(주제)Multiple sets of search terms (topic) 복수의 검색어들multiple search terms 기준 벡터reference vector 검색어 세트 1
(영화관)Search term set 1
(Movie Theaters) 근처 영화관nearby cinema [3 5 8 9 10][3 5 8 9 10] 영화 추천Movie Recommendations 영화 예매movie ticket 현재 상영 영화 순위Current movie rankings 개봉 예정 영화upcoming movies 검색어 세트 2
(스포츠 의류 매장)search term set 2
(Sports Clothing Store) 겨울 스포츠 의류winter sports clothing [13 17 11 19 20][13 17 11 19 20] 스포츠 의류 광고sports apparel advertising 기능성웨어functional wear 요가복yoga clothes 스포츠 의류 매장 순위 Sports Clothing Store Ranking

특성 벡터 생성부(116)는 상기 사용자에 의해 음성 기반의 정보 검색 기능의 실행 명령이 인가되면, 상기 복수의 비콘들 중 전자 단말 장치(110)에서 수신이 확인되는 비콘 신호를 갖는 n개 이상의 비콘들이 존재하는지 확인하고, 상기 n개 이상의 비콘들이 존재하는 경우, 상기 n개 이상의 비콘들 중 최대 강도의 비콘 신호를 갖는 n개의 비콘들을 순차적으로 선택하고, 비콘 정보 데이터베이스(114)로부터 상기 선택된 n개의 비콘들 각각에 대한 비콘 식별자에 대응되어 저장되어 있는 n개의 고유 번호들을 추출하여, 상기 n개의 고유 번호들을 성분으로 포함하는 n차원의 특성 벡터를 생성한다.The feature vector generator 116 is configured to include n or more beacons having beacon signals whose reception is confirmed by the electronic terminal device 110 among the plurality of beacons when a command for executing a voice-based information search function is applied by the user. is checked, and when there are more than n beacons, n beacons having the highest intensity beacon signal are sequentially selected from among the n or more beacons, and the selected n beacons are selected from the beacon information database 114 . By extracting n unique numbers stored corresponding to beacon identifiers for each beacon, an n-dimensional characteristic vector including the n unique numbers as components is generated.

예컨대, 도 1에서 도면부호 131 내지 도면부호 136으로 도시된 비콘들은 복수의 오프라인 상점들(141, 142, 143, 144, 145, 146)에 비치되어 있는 상기 복수의 비콘들(131, 132, 133, 134, 135, 136)이며, 앞선 예에 따라 n은 '5'라고 가정하자.For example, the beacons indicated by reference numerals 131 to 136 in FIG. 1 are the plurality of beacons 131 , 132 and 133 provided in the plurality of offline stores 141 , 142 , 143 , 144 , 145 and 146 . , 134, 135, 136), and suppose that n is '5' according to the previous example.

이때, 복수의 비콘들(131, 132, 133, 134, 135, 136) 중 전자 단말 장치(110)에서 수신이 확인되는 비콘 신호를 갖는 비콘들로 '비콘 1'(131), '비콘 2'(132), '비콘 3'(133), '비콘 4'(134), '비콘 5(135)'가 존재한다고 하면, 특성 벡터 생성부(116)는 상기 사용자에 의해 음성 기반의 정보 검색 기능의 실행 명령이 인가되는 경우, 복수의 비콘들(131, 132, 133, 134, 135, 136) 중 전자 단말 장치(110)에서 수신이 확인되는 비콘 신호를 갖는 '5'개 이상의 비콘들로 '비콘 1'(131), '비콘 2'(132), '비콘 3'(133)', '비콘 4'(134), '비콘 5'(135)가 존재하는 것을 확인할 수 있고, '비콘 1'(131), '비콘 2'(132), '비콘 3'(133), '비콘 4'(134), '비콘 5'(135)를 최대 강도의 비콘 신호를 갖는 순서에 따라 순차적으로 선택할 수 있다. At this time, among the plurality of beacons 131 , 132 , 133 , 134 , 135 , and 136 , the beacons having a beacon signal for which reception is confirmed by the electronic terminal device 110 are 'beacon 1' 131 and 'beacon 2'. If 132, 'beacon 3' 133, 'beacon 4' 134, and 'beacon 5 (135)' exist, the feature vector generator 116 performs a voice-based information search function by the user. When the execution command of ' is applied, '5' or more beacons having a beacon signal whose reception is confirmed by the electronic terminal device 110 among the plurality of beacons 131 , 132 , 133 , 134 , 135 , 136 are ' It can be confirmed that beacon 1 '131, 'beacon 2' 132, 'beacon 3' 133', 'beacon 4' 134, and 'beacon 5' 135 exist, and 'beacon 1' '131, 'Beacon 2' (132), 'Beacon 3' (133), 'Beacon 4' (134), and 'Beacon 5' (135) are sequentially selected in the order with the highest strength beacon signal. can

이때, '비콘 1'(131)의 비콘 식별자를 '비콘 식별자 1', '비콘 2'(132)의 비콘 식별자를 '비콘 식별자 2', '비콘 3'(133)의 비콘 식별자를 '비콘 식별자 3', '비콘 4'(134)의 비콘 식별자를 '비콘 식별자 4', '비콘 5'(135)의 비콘 식별자를 '비콘 식별자 5'라고 하는 경우, 특성 벡터 생성부(116)는 상기 표 1과 같은 비콘 정보 데이터베이스(114)로부터 '비콘 식별자 1'에 대응되어 저장되어 있는 고유 번호 '5', '비콘 식별자 2'에 대응되어 저장되어 있는 고유 번호 '7', '비콘 식별자 3'에 대응되어 저장되어 있는 고유 번호 '8', '비콘 식별자 4'에 대응되어 저장되어 있는 고유 번호 '11', '비콘 식별자 5'에 대응되어 저장되어 있는 고유 번호 '15'를 추출할 수 있다. At this time, the beacon identifier of 'beacon 1' 131 is 'beacon identifier 1', the beacon identifier of 'beacon 2' 132 is 'beacon identifier 2', and the beacon identifier of 'beacon 3' 133 is 'beacon identifier' When the beacon identifier of 3' and 'beacon 4' 134 is referred to as 'beacon identifier 4' and the beacon identifier of 'beacon 5' 135 is referred to as 'beacon identifier 5', the characteristic vector generator 116 is From the beacon information database 114 such as 1, unique numbers '5' stored in correspondence to 'beacon identifier 1', unique numbers '7' stored in correspondence to 'beacon identifier 2', and 'beacon identifier 3' It is possible to extract the unique number '8' stored in correspondence with the unique number '8', the unique number '11' stored corresponding to the 'beacon identifier 4', and the unique number '15' stored corresponding to the 'beacon identifier 5'.

이렇게, '5'개의 고유 번호들이 추출되면, 특성 벡터 생성부(116)는 상기 '5'개의 고유 번호들을 성분으로 포함하는 '5'차원의 특성 벡터 '[5 7 8 11 15]'를 생성할 수 있다.In this way, when '5' unique numbers are extracted, the feature vector generator 116 generates a '5'-dimensional feature vector '[5 7 8 11 15]' including the '5' unique numbers as components. can do.

벡터 유사도 연산부(117)는 상기 n차원의 특성 벡터가 생성되면, 검색어 데이터베이스(115)에 저장되어 있는 상기 복수의 검색어 세트들 각각에 대한 n차원의 기준 벡터와 상기 n차원의 특성 벡터 사이의 벡터 유사도를 연산한다.When the n-dimensional feature vector is generated, the vector similarity calculator 117 generates a vector between the n-dimensional reference vector for each of the plurality of search word sets stored in the search word database 115 and the n-dimensional feature vector. Calculate the similarity.

이때, 본 발명의 일실시예에 따르면, 상기 복수의 검색어 세트들 각각에 대한 n차원의 기준 벡터와 상기 n차원의 특성 벡터 사이의 상기 벡터 유사도의 연산은 하기의 수학식 1에 따라 수행될 수 있다.In this case, according to an embodiment of the present invention, the calculation of the vector similarity between the n-dimensional reference vector and the n-dimensional feature vector for each of the plurality of search word sets may be performed according to Equation 1 below. have.

여기서, M은 두 벡터 사이의 벡터 유사도로, S는 두 벡터 사이의 코사인 유사도, D는 두 벡터 사이의 유클리드 거리(Euclidean Distance)를 의미하고, 상기 두 벡터 사이의 코사인 유사도 S와 상기 두 벡터 사이의 유클리드 거리 D는 하기의 수학식 2와 하기의 수학식 3에 따라 연산될 수 있다.Here, M is the vector similarity between two vectors, S is the cosine similarity between the two vectors, D is the Euclidean distance between the two vectors, and the cosine similarity between the two vectors S and the two vectors The Euclidean distance D of can be calculated according to Equation 2 and Equation 3 below.

여기서, S는 벡터 A와 B 사이의 코사인 유사도로 -1에서 1사이의 값을 가지며, 그 값이 클수록 유사한 벡터임을 의미하고, A_i는 벡터 A의 i번째 성분, B_i는 벡터 B의 i번째 성분을 의미한다.Here, S is the cosine similarity between vectors A and B, and has a value between -1 and 1, and a larger value means a similar vector, A _i is the i-th component of the vector A, and B _i is the i of the vector B means the second component.

상기 수학식 3에서 D는 유클리드 거리, A_i와 B_i는 두 벡터에 포함되어 있는 i번째 성분들을 의미한다. 보통, 두 벡터 간의 유클리드 거리가 작을수록 두 벡터는 유사한 벡터라고 볼 수 있고, 두 벡터 간의 유클리드 거리가 클수록 두 벡터는 비유사한 벡터라고 볼 수 있다.In Equation 3, D denotes a Euclidean distance, and A _i and B _i denote i-th components included in the two vectors. In general, as the Euclidean distance between two vectors is small, the two vectors can be regarded as similar vectors, and as the Euclidean distance between two vectors increases, the two vectors can be regarded as dissimilar vectors.

예컨대, 벡터 유사도 연산부(117)가 상기의 표 2를 참고하여 검색어 데이터베이스(115) 상의 '검색어 세트 1(영화관)'에 대응되어 저장되어 있는 '5'차원의 기준 벡터 '[3 5 8 9 10]'와 상기 '5'차원의 특성 벡터 '[5 7 8 11 15]' 사이의 상기 벡터 유사도를 연산한다고 하는 경우, 두 벡터 사이의 코사인 유사도 S와 유클리드 거리 D의 연산은 하기의 수학식 4와 하기의 수학식 5와 같이 연산될 수 있다.For example, the vector similarity calculator 117 refers to Table 2 above and stores the '5'-dimensional reference vector '[3 5 8 9 10] in correspondence with 'search word set 1 (movie theater)' on the search word database 115 . ]' and the '5'-dimensional feature vector '[5 7 8 11 15]', the calculation of the cosine similarity S and the Euclidean distance D between the two vectors is calculated by the following Equation 4 and can be calculated as in Equation 5 below.

벡터 유사도 연산부(117)가 상기의 수학식 4와 상기의 수학식 5와 같이, '검색어 세트 1(영화관)'에 대한 '5'차원의 기준 벡터 '[3 5 8 9 10]'와 상기 '5'차원의 특성 벡터 '[5 7 8 11 15]' 사이의 코사인 유사도 S와 유클리드 거리 D를 도출하였다면, 벡터 유사도 연산부(117)는 하기의 수학식 6과 같이 벡터 유사도 M을 연산할 수 있다. As shown in Equation 4 and Equation 5 above, the vector similarity calculating unit 117 calculates the '5'-dimensional reference vector '[3 5 8 9 10]' for 'search word set 1 (movie theater)' and the ' If the cosine similarity S and the Euclidean distance D between the 5'-dimensional feature vector '[5 7 8 11 15]' are derived, the vector similarity calculating unit 117 may calculate the vector similarity M as shown in Equation 6 below. .

검색어 세트 추출부(118)는 상기 복수의 검색어 세트들 각각에 대한 n차원의 기준 벡터 중 상기 n차원의 특성 벡터와의 상기 벡터 유사도가 최대로 연산되는 제1 기준 벡터를 선택하고, 검색어 데이터베이스(115)로부터 상기 제1 기준 벡터에 대응되어 저장되어 있는 제1 검색어 세트를 추출한다.The search word set extracting unit 118 selects a first reference vector in which the vector similarity with the n-dimensional feature vector is calculated at a maximum from among the n-dimensional reference vectors for each of the plurality of search word sets, and a search word database ( 115), a first set of search words stored in correspondence with the first reference vector is extracted.

이렇게, 상기 제1 검색어 세트가 추출되면, 검색어 출력부(119)는 상기 제1 검색어 세트를 구성하는 복수의 제1 검색어들을 추천 검색어로 화면 상에 출력한다.In this way, when the first search word set is extracted, the search word output unit 119 outputs a plurality of first search words constituting the first search word set as recommended search words on the screen.

예컨대, 벡터 유사도 연산부(117)가 상기 복수의 검색어 세트들 각각에 대한 '5'차원의 기준 벡터와 상기 '5'차원의 특성 벡터 '[5 7 8 11 15]' 사이의 벡터 유사도를 연산한 결과, 상기 복수의 검색어 세트들 각각에 대한 '5'차원의 기준 벡터 중 상기 '5'차원의 특성 벡터 '[5 7 8 11 15]'와의 상기 벡터 유사도가 최대로 연산되는 제1 기준 벡터를 '[3 5 8 9 10]' 라고 가정하자. For example, the vector similarity calculator 117 calculates the vector similarity between the '5'-dimensional reference vector and the '5'-dimensional feature vector '[5 7 8 11 15]' for each of the plurality of search word sets. As a result, a first reference vector in which the vector similarity with the '5'-dimensional feature vector '[5 7 8 11 15]' among the '5'-dimensional reference vectors for each of the plurality of search word sets is maximized is selected. Assume '[3 5 8 9 10]'.

이때, 검색어 세트 추출부(118)는 상기 벡터 유사도가 최대로 연산되는 제1 기준 벡터로서 '[3 5 8 9 10]'을 선택할 수 있고, 검색어 데이터베이스(115)로부터 기준 벡터 '[3 5 8 9 10]'에 대응되는 상기 제1 검색어 세트로서 '검색어 세트 1(영화관)'을 추출할 수 있다.In this case, the search word set extraction unit 118 may select '[3 5 8 9 10]' as the first reference vector in which the vector similarity is calculated to the maximum, and the reference vector '[3 5 8] from the search word database 115 9 10], 'search word set 1 (movie theater)' may be extracted as the first search word set corresponding to '.

'검색어 세트 1(영화관)'이 추출되면, 검색어 출력부(119)는 '검색어 세트 1(영화관)'을 구성하는 상기 복수의 제1 검색어들인 '근처 영화관', '영화 추천', '영화 예매', '현재 상영 영화 순위', '개봉 예정 영화'를 추천 검색어로 화면 상에 출력할 수 있다.When 'search word set 1 (movie theater)' is extracted, the search word output unit 119 performs the plurality of first search terms constituting 'search word set 1 (movie theater)', 'nearby cinema', 'movie recommendation', 'movie reservation'. ', 'Currently screened movie ranking', and 'Scheduled movie' can be output on the screen as recommended search terms.

이때, 본 발명의 일실시예에 따르면, 검색어 출력부(119)는 화면 상에 출력되는 상기 복수의 제1 검색어들의 나열 순서를 정하기 위해 중요도 점수 테이블 유지부(121), 중요도 점수 확인부(122) 및 검색어 출력 처리부(123)를 포함할 수 있다. At this time, according to an embodiment of the present invention, the search word output unit 119 includes the importance score table maintenance unit 121 and the importance score check unit 122 to determine the listing order of the plurality of first search words output on the screen. ) and a search word output processing unit 123 .

중요도 점수 테이블 유지부(121)는 상기 복수의 검색어 세트들 각각에 포함된 상기 복수의 검색어들 각각에 대해 미리 지정된 서로 다른 중요도 점수가 대응되어 기록되어 있는 중요도 점수 테이블을 저장하여 유지한다.The importance score table maintaining unit 121 stores and maintains an importance score table in which different importance scores previously designated for each of the plurality of search words included in each of the plurality of search word sets are recorded in correspondence with each other.

여기서 상기 중요도 점수는 개발자에 의해 미리 설정될 수 있는 값으로, 사용자들에게 자주 입력되는 검색어일수록 높은 점수가 부여되도록 설정될 수 있다. Here, the importance score is a value that may be preset by a developer, and may be set so that a higher score is given to a search word frequently input by users.

관련해서, 상기 중요도 점수 테이블에는 하기의 표 3과 같이 정보가 기록되어 있을 수 있다.In relation to this, information may be recorded in the importance score table as shown in Table 3 below.

복수의 검색어 세트들Multiple search term sets 복수의 검색어들multiple search terms 중요도 점수importance score 검색어 세트 1Search term set 1
(영화관)(Movie Theaters) 근처 영화관nearby cinema 22 영화 추천Movie Recommendations 44 영화 예매movie ticket 1One 현재 상영 영화 순위Current movie rankings 55 개봉 예정 영화upcoming movies 33 검색어 세트 2search term set 2
(스포츠 의류 매장)(Sports Clothing Store) 겨울 스포츠 의류winter sports clothing 1One 스포츠 의류 광고sports apparel advertising 33 기능성웨어functional wear 55 요가복yoga clothes 22 스포츠 의류 매장 순위 Sports Clothing Store Ranking 44

이때, 중요도 점수 확인부(122)는 상기 제1 검색어 세트가 추출되면, 상기 중요도 점수 테이블을 참조하여 상기 제1 검색어 세트를 구성하는 상기 복수의 제1 검색어들 각각에 대한 중요도 점수를 확인한다.In this case, when the first search word set is extracted, the importance score check unit 122 checks the importance score for each of the plurality of first search words constituting the first search word set with reference to the importance score table.

중요도 점수 확인부(122)가 상기 복수의 제1 검색어들 각각에 대해 상기 중요도 점수를 확인하면, 검색어 출력 처리부(123)는 상기 복수의 제1 검색어들을 추천 검색어로 화면 상에 출력하되, 상기 복수의 제1 검색어들 각각의 중요도 점수의 크기가 큰 순서에 따라 상기 복수의 제1 검색어들을 일렬로 나열하여 화면 상에 출력할 수 있다.When the importance score check unit 122 checks the importance score for each of the plurality of first search words, the search word output processing unit 123 outputs the plurality of first search words as recommended search words on the screen, but the plurality of The plurality of first search words may be arranged in a line according to the order of increasing importance score of each of the first search words and outputted on the screen.

예컨대, 위 실시예와 같이, 상기 제1 검색어 세트가 '검색어 세트 1(영화관)'이라고 할 때, 중요도 점수 확인부(122)는 상기의 표 3과 같은 상기 중요도 점수 테이블을 참조하여, '검색어 세트 1(영화관)'을 구성하는 상기 복수의 제1 검색어들 각각에 대해 중요도 점수를 확인할 수 있다. 즉, 중요도 점수 확인부(122)는 상기 복수의 제1 검색어들 중 어느 하나인 '근처 영화관'에 대해서 중요도 점수로서 '2'를 확인할 수 있다. For example, as in the above embodiment, when the first search word set is 'search word set 1 (movie theater)', the importance score check unit 122 refers to the importance score table as shown in Table 3 above, and 'search word An importance score may be checked for each of the plurality of first search terms constituting 'set 1 (movie theater)'. That is, the importance score check unit 122 may identify '2' as the importance score for 'near movie theater', which is any one of the plurality of first search terms.

중요도 점수 확인부(122)가 상기 복수의 제1 검색어들 각각에 대해 중요도 점수를 확인하면, 검색어 출력 처리부(121)는 상기 복수의 제1 검색어들을 화면 상에 출력하되, 상기 복수의 제1 검색어들 각각의 중요도 점수의 크기가 큰 순서인 '현재 상영 영화 순위', '영화 추천', '개봉 예정 영화', '근처 영화관', '영화 예매'의 순서로 상기 복수의 제1 검색어들을 일렬로 나열하여 화면 상에 출력할 수 있다. When the importance score check unit 122 checks the importance score for each of the plurality of first search words, the search word output processing unit 121 outputs the plurality of first search words on the screen, but the plurality of first search words The plurality of first search terms are arranged in the order of 'currently showing movie ranking', 'movie recommendation', 'coming soon', 'nearby movie theater', 'movie reservation' in the order of increasing the importance score of each You can list them and print them on the screen.

즉, 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치(110)는 상기 제1 검색어 세트를 구성하는 상기 복수의 제1 검색어들 각각에도 중요도 점수를 부여하여 자주 쓰이는 검색어일수록 화면 상단에 표시함으로써, 사용자의 빠른 데이터 검색을 지원할 수 있다. That is, the electronic terminal device 110 that performs the user's voice recognition based on the location-based voice recognition learning model data assigns importance scores to each of the plurality of first search words constituting the first search word set and frequently By displaying more used search terms at the top of the screen, it is possible to support users' quick data search.

음성 검색부(120)는 상기 복수의 제1 검색어들 중 상기 사용자에 의해 제2 검색어가 음성으로 입력됨에 따라 음성 인식부(113)를 통해 상기 제2 검색어에 대한 음성 인식이 완료되면, 인터넷을 통해 상기 제2 검색어에 따른 데이터 검색을 수행하여 검색 결과를 화면 상에 표시한다.The voice search unit 120 performs the voice recognition for the second search word through the voice recognition unit 113 as a second search word among the plurality of first search words is input by voice by the user. data search according to the second search word is performed through the search result, and the search result is displayed on the screen.

예컨대, 전술한 예의 상기 복수의 제1 검색어들인 '현재 상영 영화 순위', '영화 추천', '개봉 예정 영화', '근처 영화관', '영화 예매' 중 어느 하나인 '영화 예매'의 검색어가 상기 사용자에 의해 음성으로 입력됨에 따라 음성 인식부(113)가 '영화 예매'에 대한 음성 인식을 수행하면, 음성 검색부(120)는 인터넷을 통해 '영화 예매'에 따른 데이터 검색을 수행하여 검색 결과를 화면 상에 표시할 수 있다. For example, the search term of 'movie reservation', which is one of the plurality of first search terms in the above example, 'current screening movie ranking', 'movie recommendation', 'scheduled movie', 'nearby cinema', and 'movie reservation' When the voice recognition unit 113 performs voice recognition for 'movie reservation' as the voice is input by the user, the voice search unit 120 searches for data according to 'movie reservation' through the Internet. Results can be displayed on the screen.

도 3은 본 발명의 일실시예에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치의 동작 방법을 도시한 순서도이다.3 is a flowchart illustrating a method of operating an electronic terminal device for performing voice recognition of a user based on location-based voice recognition learning model data according to an embodiment of the present invention.

단계(S310)에서는 음성 인식을 위해 미리 구축된 기본형 음성 인식 학습모델 데이터와 미리 정해진 복수의 위치 정보들 각각에 대응되는 위치별 음성 인식 학습모델 데이터가 저장되어 있는 데이터 저장부를 유지한다.In step S310, a data storage unit in which basic voice recognition learning model data built in advance for voice recognition and voice recognition learning model data for each location corresponding to each of a plurality of predetermined location information is stored is maintained.

단계(S320)에서는 상기 사용자로부터 상기 전자 단말 장치에 내장된 마이크로폰을 통해 음성이 입력되면, GPS를 통해 상기 전자 단말 장치가 위치하는 지점에 대한 제1 위치 정보를 수집한다.In step S320, when a voice is input from the user through a microphone built into the electronic terminal device, first location information on a point where the electronic terminal device is located is collected through GPS.

단계(S330)에서는 상기 복수의 위치 정보들 중 상기 제1 위치 정보에 따른 제1 지점과의 거리가 기설정된 거리 이내인 지점을 포함하는 적어도 하나의 제1 위치 정보가 존재하면, 상기 적어도 하나의 제1 위치 정보 중 상기 제1 지점과의 거리가 가장 가까운 지점을 포함하는 제2 위치 정보를 선택한 후, 상기 제2 위치 정보에 대응되는 제1 위치별 음성 인식 학습모델 데이터를 기초로 상기 사용자의 음성을 인식하고, 상기 복수의 위치 정보들 중 상기 제1 지점과의 거리가 상기 기설정된 거리 이내인 지점을 포함하는 상기 적어도 하나의 제1 위치 정보가 존재하지 않으면, 상기 기본형 음성 인식 학습모델 데이터를 기초로 상기 사용자의 음성을 인식한다.In step S330, if there is at least one piece of first location information including a point within a predetermined distance from the first point according to the first location information among the plurality of location information, the at least one After selecting the second location information including the point closest to the first point from among the first location information, the user's voice recognition learning model data for each first location corresponding to the second location information is selected. When a voice is recognized and the at least one first location information including a point in which the distance from the first point is within the preset distance among the plurality of location information does not exist, the basic speech recognition learning model data based on the user's voice is recognized.

이때, 본 발명의 일실시예에 따르면, 단계(S310)에서는 상기 복수의 위치 정보들 각각에 대응되는 위치별 소음 제거 학습모델 데이터가 추가로 저장되어 있을 수 있고, 단계(S330)에서는 상기 복수의 위치 정보들 중 상기 제1 지점과의 거리가 상기 기설정된 거리 이내인 지점을 포함하는 상기 적어도 하나의 제1 위치 정보가 존재하면, 상기 적어도 하나의 제1 위치 정보 중 상기 제1 지점과의 거리가 가장 가까운 지점을 포함하는 상기 제2 위치 정보를 선택한 후, 상기 제2 위치 정보에 대응되는 제1 소음 제거 학습모델 데이터를 기초로, 상기 사용자의 음성에서 소음을 제거하고, 상기 제2 위치 정보에 대응되는 상기 제1 위치별 음성 인식 학습모델 데이터를 기초로 상기 사용자의 음성을 인식할 수 있다.At this time, according to an embodiment of the present invention, in step S310, noise reduction learning model data for each location corresponding to each of the plurality of location information may be additionally stored, and in step S330, the plurality of If the at least one piece of first location information including a point within the preset distance among location information exists, the distance from the first point among the at least one piece of first location information After selecting the second location information including the nearest point, based on the first noise reduction learning model data corresponding to the second location information, the noise is removed from the user's voice, and the second location information The user's voice may be recognized based on the first location-specific voice recognition learning model data corresponding to .

또한, 본 발명의 일실시예에 따르면, 상기 전자 단말 장치의 동작 방법은 미리 정해진 복수의 오프라인 상점들에 사전 비치되어 있는 복수의 비콘들 각각에 대응하는 비콘 식별자와, 상기 복수의 비콘들 각각이 비치된 상점의 업종 카테고리에 따라 각 비콘에 대응하는 미리 설정된 고유 번호가 서로 대응되어 저장되어 있는 비콘 정보 데이터베이스를 유지하는 단계, 복수의 검색어 세트들(상기 복수의 검색어 세트들 각각은 각 검색어 세트에 대응하는 미리 설정된 주제에 연관된 서로 다른 복수의 검색어들로 구성된 집합을 의미함)과 상기 복수의 검색어 세트들 각각에 대해 미리 설정된 n차원의 기준 벡터가 서로 대응되어 저장되어 있는 검색어 데이터베이스를 유지하는 단계, 상기 사용자에 의해 음성 기반의 정보 검색 기능의 실행 명령이 인가되면, 상기 복수의 비콘들 중 상기 전자 단말 장치에서 수신이 확인되는 비콘 신호를 갖는 n개 이상의 비콘들이 존재하는지 확인하고, 상기 n개 이상의 비콘들이 존재하는 경우, 상기 n개 이상의 비콘들 중 최대 강도의 비콘 신호를 갖는 n개의 비콘들을 순차적으로 선택하고, 상기 비콘 정보 데이터베이스로부터 상기 선택된 n개의 비콘들 각각에 대한 비콘 식별자에 대응되어 저장되어 있는 n개의 고유 번호들을 추출하여, 상기 n개의 고유 번호들을 성분으로 포함하는 n차원의 특성 벡터를 생성하는 단계, 상기 n차원의 특성 벡터가 생성되면, 상기 검색어 데이터베이스에 저장되어 있는 상기 복수의 검색어 세트들 각각에 대한 n차원의 기준 벡터와 상기 n차원의 특성 벡터 사이의 벡터 유사도를 연산하는 단계, 상기 복수의 검색어 세트들 각각에 대한 n차원의 기준 벡터 중 상기 n차원의 특성 벡터와의 상기 벡터 유사도가 최대로 연산되는 제1 기준 벡터를 선택하고, 상기 검색어 데이터베이스로부터 상기 제1 기준 벡터에 대응되어 저장되어 있는 제1 검색어 세트를 추출하는 단계, 상기 제1 검색어 세트가 추출되면, 상기 제1 검색어 세트를 구성하는 복수의 제1 검색어들을 추천 검색어로 화면 상에 출력하는 단계 및 상기 복수의 제1 검색어들 중 상기 사용자에 의해 제2 검색어가 음성으로 입력됨에 따라 단계(S330)를 통해 상기 제2 검색어에 대한 음성 인식이 완료되면, 인터넷을 통해 상기 제2 검색어에 따른 데이터 검색을 수행하여 검색 결과를 화면 상에 표시하는 단계를 더 포함할 수 있다.In addition, according to an embodiment of the present invention, in the method of operating the electronic terminal device, a beacon identifier corresponding to each of a plurality of beacons provided in advance in a plurality of predetermined offline stores, and each of the plurality of beacons are Maintaining a beacon information database in which a preset unique number corresponding to each beacon is stored in correspondence with each other according to the industry category of the store provided, a plurality of search word sets (each of the plurality of search word sets is in each search word set) (meaning a set consisting of a plurality of different search words related to a corresponding preset topic) and maintaining a search word database in which a preset n-dimensional reference vector for each of the plurality of search word sets is stored in correspondence with each other , when an execution command of a voice-based information search function is applied by the user, it is checked whether there are n or more beacons having a beacon signal for which reception is confirmed in the electronic terminal device among the plurality of beacons, and the n When there are more than one beacon, n beacons having the highest intensity beacon signal are sequentially selected from among the n or more beacons, and stored in correspondence with a beacon identifier for each of the selected n beacons from the beacon information database. extracting the n unique numbers and generating an n-dimensional feature vector including the n unique numbers as components. When the n-dimensional feature vector is generated, the plurality of calculating a vector similarity between an n-dimensional reference vector for each of the search word sets and the n-dimensional feature vector; selecting a first reference vector in which the vector similarity is calculated to a maximum, and extracting a first search word set stored in correspondence with the first reference vector from the search word database; Recommend a plurality of first search terms constituting the first search word set When the voice recognition for the second search word is completed through outputting the search word on the screen and as the second search word among the plurality of first search words is inputted by voice by the user, the Internet is accessed The method may further include performing a data search according to the second search word and displaying a search result on a screen through the method.

이때, 본 발명의 일실시예에 따르면, 상기 복수의 검색어 세트들 각각에 대한 n차원의 기준 벡터와 상기 n차원의 특성 벡터 사이의 상기 벡터 유사도의 연산은 상기의 수학식 1에 따라 수행될 수 있다.In this case, according to an embodiment of the present invention, the calculation of the vector similarity between the n-dimensional reference vector and the n-dimensional feature vector for each of the plurality of search word sets may be performed according to Equation 1 above. have.

또한, 본 발명의 일실시예에 따르면, 상기 복수의 제1 검색어들을 추천 검색어로 화면 상에 출력하는 단계는 상기 복수의 검색어 세트들 각각에 포함된 상기 복수의 검색어들 각각에 대해 미리 지정된 서로 다른 중요도 점수가 대응되어 기록되어 있는 중요도 점수 테이블을 저장하여 유지하는 단계, 상기 제1 검색어 세트가 추출되면, 상기 중요도 점수 테이블을 참조하여 상기 제1 검색어 세트를 구성하는 상기 복수의 제1 검색어들 각각에 대한 중요도 점수를 확인하는 단계 및 상기 복수의 제1 검색어들을 추천 검색어로 화면 상에 출력하되, 상기 복수의 제1 검색어들 각각의 중요도 점수의 크기가 큰 순서에 따라 상기 복수의 제1 검색어들을 일렬로 나열하여 화면 상에 출력하는 단계를 포함할 수 있다.In addition, according to an embodiment of the present invention, the outputting of the plurality of first search words as recommended search words on the screen may include different preset search terms for each of the plurality of search words included in each of the plurality of search word sets. storing and maintaining an importance score table in which the importance score is recorded correspondingly; when the first search word set is extracted, each of the plurality of first search words constituting the first search word set with reference to the importance score table checking an importance score for , and outputting the plurality of first search words as recommended search words on the screen, and selecting the plurality of first search words according to the order of increasing importance scores of the plurality of first search words. It may include the step of arranging in a line and outputting it on the screen.

이상, 도 2를 참조하여 본 발명의 일실시예에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치의 동작 방법에 대해 설명하였다. 여기서, 본 발명의 일실시예에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치의 동작 방법은 도 1을 이용하여 설명한 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치(110)의 동작에 대한 구성과 대응될 수 있으므로, 이에 대한 보다 상세한 설명은 생략하기로 한다.In the above, the operation method of the electronic terminal device for performing the user's voice recognition based on the location-based voice recognition learning model data according to an embodiment of the present invention has been described with reference to FIG. 2 . Here, the operation method of the electronic terminal device for performing the user's voice recognition based on the location-based voice recognition learning model data according to an embodiment of the present invention is the location-based voice recognition learning model data described with reference to FIG. Since it may correspond to the configuration of the operation of the electronic terminal device 110 that performs the user's voice recognition based on the user's voice recognition, a more detailed description thereof will be omitted.

본 발명의 일실시예에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치의 동작 방법은 컴퓨터와의 결합을 통해 실행시키기 위한 저장매체에 저장된 컴퓨터 프로그램으로 구현될 수 있다.The method of operating an electronic terminal device for performing voice recognition of a user based on location-based voice recognition learning model data according to an embodiment of the present invention is implemented as a computer program stored in a storage medium for execution through combination with a computer. can be

또한, 본 발명의 일실시예에 따른 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치의 동작 방법은 컴퓨터와의 결합을 통해 실행시키기 위한 컴퓨터 프로그램 명령 형태로 구현되어 컴퓨터 판독 가능 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 매체에 기록되는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다.In addition, the operating method of the electronic terminal device for performing the user's voice recognition based on the location-based voice recognition learning model data according to an embodiment of the present invention is implemented in the form of a computer program command for execution through combination with a computer. and may be recorded on a computer-readable medium. The computer-readable medium may include program instructions, data files, data structures, etc. alone or in combination. The program instructions recorded on the medium may be specially designed and configured for the present invention, or may be known and available to those skilled in the art of computer software. Examples of the computer-readable recording medium include magnetic media such as hard disks, floppy disks and magnetic tapes, optical media such as CD-ROMs and DVDs, and magnetic such as floppy disks. - includes magneto-optical media, and hardware devices specially configured to store and execute program instructions, such as ROM, RAM, flash memory, and the like. Examples of program instructions include not only machine language codes such as those generated by a compiler, but also high-level language codes that can be executed by a computer using an interpreter or the like.

이상과 같이 본 발명에서는 구체적인 구성 요소 등과 같은 특정 사항들과 한정된 실시예 및 도면에 의해 설명되었으나 이는 본 발명의 보다 전반적인 이해를 돕기 위해서 제공된 것일 뿐, 본 발명은 상기의 실시예에 한정되는 것은 아니며, 본 발명이 속하는 분야에서 통상적인 지식을 가진 자라면 이러한 기재로부터 다양한 수정 및 변형이 가능하다.As described above, the present invention has been described with specific matters such as specific components and limited embodiments and drawings, but these are provided to help a more general understanding of the present invention, and the present invention is not limited to the above embodiments. , various modifications and variations are possible from these descriptions by those of ordinary skill in the art to which the present invention pertains.

따라서, 본 발명의 사상은 설명된 실시예에 국한되어 정해져서는 아니되며, 후술하는 특허청구범위뿐 아니라 이 특허청구범위와 균등하거나 등가적 변형이 있는 모든 것들은 본 발명 사상의 범주에 속한다고 할 것이다.Therefore, the spirit of the present invention should not be limited to the described embodiments, and not only the claims to be described later, but also all those with equivalent or equivalent modifications to the claims will be said to belong to the scope of the spirit of the present invention. .

110: 위치 기반의 음성 인식 학습모델 데이터를 기초로 사용자의 음성 인식을 수행하는 전자 단말 장치
111: 데이터 저장부 112: 위치 수집부
113: 음성 인식부 114: 비콘 정보 데이터베이스
115: 검색어 데이터베이스 116: 특성 벡터 생성부
117: 벡터 유사도 연산부 118: 검색어 세트 추출부
119: 검색어 출력부 120: 음성 검색부
121: 중요도 점수 테이블 유지부 122: 중요도 점수 확인부
123: 검색어 출력 처리부
131, 132, 133, 134, 135, 136: 복수의 비콘들
141, 142, 143, 144, 145, 146: 복수의 오프라인 상점들110: Electronic terminal device for performing voice recognition of a user based on location-based voice recognition learning model data
111: data storage unit 112: location collection unit
113: speech recognition unit 114: beacon information database
115: search word database 116: feature vector generator
117: vector similarity calculating unit 118: search word set extraction unit
119: search word output unit 120: voice search unit
121: importance score table maintenance unit 122: importance score check unit
123: search word output processing unit
131, 132, 133, 134, 135, 136: a plurality of beacons
141, 142, 143, 144, 145, 146: Multiple offline stores

Claims

An electronic terminal device for performing voice recognition of a user based on location-based voice recognition learning model data,
a data storage unit storing basic voice recognition learning model data built in advance for voice recognition and voice recognition learning model data for each location corresponding to each of a plurality of predetermined location information;
a location collecting unit that collects first location information on a point where the electronic terminal device is located through GPS when a voice is input from the user through a microphone built into the electronic terminal device; and
If there is at least one piece of first location information including a point in which a distance from a first point according to the first location information is within a predetermined distance among the plurality of pieces of location information, the at least one first location information After selecting second location information including a point closest to the first point from among location information, the user's voice is generated based on the first location-specific speech recognition learning model data corresponding to the second location information. If the at least one first location information including a point in which the distance to the first point is within the preset distance among the plurality of location information does not exist, based on the basic speech recognition learning model data a voice recognition unit for recognizing the user's voice with
An electronic terminal device for performing voice recognition of a user based on location-based voice recognition learning model data comprising

According to claim 1,
the data storage unit
Noise removal learning model data for each location corresponding to each of the plurality of location information is additionally stored,
The voice recognition unit
If the at least one piece of first location information including a point in the plurality of pieces of location information having a distance from the first point within the preset distance exists, the first point of the at least one piece of first location information After selecting the second location information including the point closest to the distance to the user, based on the first noise reduction learning model data corresponding to the second location information, the noise is removed from the user's voice, and the second location information is removed. 2 An electronic terminal device that performs voice recognition of a user based on location-based voice recognition learning model data for recognizing the user's voice based on the first position-specific voice recognition learning model data corresponding to location information.

According to claim 1,
A beacon identifier corresponding to each of a plurality of beacons provided in advance in a plurality of pre-determined offline stores, and a preset unique corresponding to each beacon according to the industry category of the store in which each of the plurality of beacons is provided a beacon information database in which numbers correspond to each other and are stored;
a plurality of search word sets, each of which means a set consisting of a plurality of different search terms related to a preset topic corresponding to each set of search terms, and a preset for each of the plurality of search word sets a search word database in which n (n is a natural number of 2 or more) dimensional reference vectors are stored in correspondence with each other;
When a command for executing a voice-based information search function is applied by the user, it is checked whether there are n or more beacons having a beacon signal for which reception is confirmed in the electronic terminal device among the plurality of beacons, and the n or more beacons are present. If there are beacons, n beacons having the maximum intensity beacon signal are sequentially selected from among the n or more beacons, and the beacon information corresponds to the beacon identifier for each of the selected n beacons from the beacon information database. a feature vector generator that extracts n unique numbers and generates an n-dimensional feature vector including the n unique numbers as components;
a vector similarity calculating unit that, when the n-dimensional feature vector is generated, calculates a vector similarity between the n-dimensional reference vector and the n-dimensional feature vector for each of the plurality of search word sets stored in the search word database;
Selecting a first reference vector in which the vector similarity with the n-dimensional feature vector is calculated at a maximum from among the n-dimensional reference vectors for each of the plurality of search word sets, and corresponding to the first reference vector from the search word database a search word set extraction unit for extracting the stored first set of search words;
a search word output unit for outputting a plurality of first search words constituting the first search word set as recommended search words on a screen when the first search word set is extracted; and
When the voice recognition for the second search word is completed through the voice recognition unit as a second search word is inputted by the user from among the plurality of first search words, data search according to the second search word is performed through the Internet. Voice search unit that displays the search results on the screen
An electronic terminal device for performing the user's voice recognition based on the location-based voice recognition learning model data further comprising

4. The method of claim 3,
The calculation of the vector similarity between the n-dimensional reference vector and the n-dimensional feature vector for each of the plurality of search word sets is based on the location-based speech recognition learning model data performed according to Equation 1 below. An electronic terminal device that performs voice recognition of
[Equation 1]

Here, M is the vector similarity between two vectors, S is the cosine similarity between the two vectors, and D is the Euclidean distance between the two vectors.

4. The method of claim 3,
The search term output unit
an importance score table maintaining unit for storing and maintaining an importance score table in which different importance scores previously specified for each of the plurality of search words included in each of the plurality of search word sets are recorded;
an importance score check unit that, when the first search word set is extracted, checks the importance score for each of the plurality of first search words constituting the first search word set with reference to the importance score table; and
outputting the plurality of first search words as recommended search words on the screen, and arranging the plurality of first search words in a line according to the order of increasing importance scores of each of the plurality of first search words and outputting them on the screen Search word output processing unit
An electronic terminal device for performing the user's voice recognition based on the location-based voice recognition learning model data comprising a.

A method of operating an electronic terminal device for performing voice recognition of a user based on location-based voice recognition learning model data, the method comprising:
maintaining a data storage unit storing basic voice recognition learning model data built in advance for voice recognition and voice recognition learning model data for each location corresponding to each of a plurality of predetermined location information;
when a voice is input from the user through a microphone built into the electronic terminal device, collecting first location information on a point where the electronic terminal device is located through GPS; and
If there is at least one piece of first location information including a point in which a distance from a first point according to the first location information is within a predetermined distance among the plurality of pieces of location information, the at least one first location information After selecting second location information including a point closest to the first point from among location information, the user's voice is generated based on the first location-specific speech recognition learning model data corresponding to the second location information. If the at least one first location information including a point in which the distance to the first point is within the preset distance among the plurality of location information does not exist, based on the basic speech recognition learning model data Recognizing the user's voice with
An operating method of an electronic terminal device for performing voice recognition of a user based on location-based voice recognition learning model data comprising a.

7. The method of claim 6,
The step of maintaining the data storage unit
Noise removal learning model data for each location corresponding to each of the plurality of location information is additionally stored,
The step of recognizing the user's voice is
If the at least one piece of first location information including a point in the plurality of pieces of location information having a distance from the first point within the preset distance exists, the first point of the at least one piece of first location information After selecting the second location information including the point closest to the distance to the first location information, based on the first noise reduction learning model data corresponding to the second location information, the noise is removed from the user's voice, 2 Operating method of an electronic terminal device for performing voice recognition of a user based on location-based voice recognition learning model data for recognizing the user's voice based on the first position-specific voice recognition learning model data corresponding to location information .

7. The method of claim 6,
A beacon identifier corresponding to each of a plurality of beacons provided in advance in a plurality of predetermined offline stores, and a preset unique corresponding to each beacon according to the industry category of the store in which each of the plurality of beacons is provided maintaining a beacon information database in which numbers correspond to each other and are stored;
a plurality of search word sets, each of which means a set consisting of a plurality of different search terms related to a preset topic corresponding to each set of search terms, and a preset for each of the plurality of search word sets maintaining a search word database in which n (n is a natural number greater than or equal to 2) dimensional reference vectors are stored in correspondence with each other;
When a command to execute a voice-based information search function is applied by the user, it is checked whether there are n or more beacons having a beacon signal whose reception is confirmed by the electronic terminal device among the plurality of beacons, and the n or more beacons are present. If there are beacons, n beacons having the maximum intensity beacon signal are sequentially selected from among the n or more beacons, and the beacon information corresponds to the beacon identifier for each of the selected n beacons from the beacon information database. extracting the n unique numbers to generate an n-dimensional feature vector including the n unique numbers as components;
when the n-dimensional feature vector is generated, calculating a vector similarity between the n-dimensional reference vector and the n-dimensional feature vector for each of the plurality of search word sets stored in the search word database;
Selecting a first reference vector in which the vector similarity with the n-dimensional feature vector is calculated at a maximum from among the n-dimensional reference vectors for each of the plurality of search word sets, and corresponding to the first reference vector from the search word database extracting the stored first set of search terms;
outputting a plurality of first search words constituting the first search word set as recommended search terms on a screen when the first search word set is extracted; and
As a second search word among the plurality of first search words is input by the user by voice, when voice recognition for the second search word is completed through the step of recognizing the user's voice, the second search word through the Internet Displaying the search results on the screen by performing data search according to the search term
An operating method of an electronic terminal device for performing voice recognition of a user based on the location-based voice recognition learning model data further comprising a.

9. The method of claim 8,
The calculation of the vector similarity between the n-dimensional reference vector and the n-dimensional feature vector for each of the plurality of search word sets is based on the location-based speech recognition learning model data performed according to Equation 1 below. A method of operating an electronic terminal device for performing voice recognition.
[Equation 1]

9. The method of claim 8,
The step of outputting the plurality of first search words as recommended search words on the screen includes:
storing and maintaining an importance score table in which different importance scores previously designated for each of the plurality of search words included in each of the plurality of search word sets are recorded in correspondence with each other;
when the first search word set is extracted, checking the importance score for each of the plurality of first search words constituting the first search word set by referring to the importance score table; and
Outputting the plurality of first search words as recommended search words on the screen, and arranging the plurality of first search words in a line according to the order of increasing importance score of each of the plurality of first search words and outputting them on the screen step
A method of operating an electronic terminal device for performing the user's voice recognition based on the location-based voice recognition learning model data comprising a.

A computer-readable recording medium recording a computer program for executing the method of any one of claims 6 to 10 through combination with a computer.

A computer program stored in a storage medium for executing the method of any one of claims 6 to 10 through combination with a computer.