KR100976999B1

KR100976999B1 - Address Recognition Method and the system by Grammar Moduling

Info

Publication number: KR100976999B1
Application number: KR1020080060940A
Authority: KR
Inventors: 조주형
Original assignee: 주식회사 예스피치
Priority date: 2008-06-26
Filing date: 2008-06-26
Publication date: 2010-08-19
Also published as: KR20100001140A

Abstract

본 발명은 그래마 모듈화를 이용한 주소인식 방법 및 그 장치에 관한 것이다. 보다 상세하게는, ARS 시스템과 같이 음성을 주요 매개로 정보를 전달하는 방법이나 장치에 있어서, 사용자로부터 음성으로 녹음, 입력받은 정보, 특히 주소 정보를 분석하여 인식하기 위한 것으로서, 주소의 단계화 및 모듈화를 통해 그래마 내용을 손쉽게 구성할 수 있도록 하고, 사용자에 의해 음성으로 제공되는 주소를 인식이 용이하도록 하여 인식율을 향상시키는 방법 및 장치에 관한 것이다.The present invention relates to an address recognition method and apparatus therefor using grammar modularization. More specifically, in a method or apparatus for transmitting information through voice as a main system, such as an ARS system, it is used to analyze and recognize information recorded and input by voice from a user, in particular, address information. The present invention relates to a method and apparatus for improving the recognition rate by facilitating the configuration of grammar contents through the modularization and facilitating the recognition of the address provided by the user.

본 발명은 현재 주소의 구조에서 지역단위에 따른 발성을 단계화 및 모듈화를 통해 조합하여 구성하고, 이를 음성 인식 결과에 대비하여, 그 인식율을 높이고 오차율을 감소시키는 효과를 갖는다.The present invention is configured by combining the voice according to the local unit in the structure of the current address through the stepping and modularization, in preparation for the speech recognition result, has the effect of increasing the recognition rate and reducing the error rate.

우편번호, 주소, 그래마 ,모듈화, 음성인식, 자동기록 Postal code, address, grammar, modularization, voice recognition, automatic recording

Description

Address recognition method and device using grammar modularization {Address Recognition Method and the system by Grammar Moduling}

본 발명은 그래마 모듈화를 이용한 주소인식 방법 및 그 장치에 관한 것이다.The present invention relates to an address recognition method and apparatus therefor using grammar modularization.

보다 상세하게는, ARS 시스템과 같이 음성을 주요 매개로 정보를 전달하는 방법이나 장치에 있어서, 사용자로부터 음성으로 녹음, 입력받은 정보, 특히 주소 정보를 분석하여 인식하기 위한 것으로서, 주소의 단계화 및 모듈화를 통해 그래마 내용을 손쉽게 구성할 수 있도록 하고, 사용자에 의해 음성으로 제공되는 주소를 인식이 용이하도록 하여 인식율을 향상시키는 방법 및 장치에 관한 것이다.More specifically, in a method or apparatus for transmitting information through voice as a main system, such as an ARS system, it is used to analyze and recognize information recorded and input by voice from a user, in particular, address information. The present invention relates to a method and apparatus for improving the recognition rate by facilitating the configuration of grammar contents through the modularization and facilitating the recognition of the address provided by the user.

홈쇼핑, 통신회사, 대기업, 은행 등의 콜센터에서, 주문 정보나 기타 정보를 입력하기 위해, 자동 기록 시스템인 ARS를 많이 사용하고 있다. 이러한 자동 기록 시스템에서는 통화자의 주소를 입력받아야 할 일이 많은데, 이러한 단순한 작업을 위해 매번 콜센터 직원을 연결해서 직접 통화하여야 한다면, 이는 사업자는 물론 통화자의 입장에서도 매우 번거로운 일이며, 직원 확충에 따른 인건비나, 직원 연결을 위한 대기시간 증가 등의 문제점이 있었다.In call centers such as home shopping, telecommunications companies, large corporations, and banks, ARS, which is an automatic recording system, is frequently used to input order information or other information. In such an automatic recording system, there is a lot of work to receive the caller's address. If this is a simple task, the call center staff must be connected to each call directly, which is very cumbersome for both the operator and the caller. There was a problem such as an increase in waiting time for connecting employees.

예컨대, 업체에 사용자의 주소를 제공하거나, 기존 DB에 등록되어 있는 주소를 변경하고자 하는 경우, 사용자는 해당 DB를 이용할 수 있는 서비스 제공자와 전화가 연결된 상태에서 주소를 말해주고, 서비스 제공자가 상기 주소를 받아 적은 후 DB에 접속하여 DB에 기록되어 있는 주소 정보를 변경시키거나, 말해주는 주소를 바로 DB에 접속한 상태에서 DB에 기록되어 있는 주소 정보를 변경하는 방식을 사용한다.For example, if a user wants to provide a user's address to a company or change an address registered in an existing DB, the user tells the address while a telephone is connected with a service provider who can use the DB, and the service provider provides the address. After receiving and writing down, access the DB and change the address information recorded in the DB, or change the address information recorded in the DB while the telling address is directly connected to the DB.

이러한 방식은 실시간으로 받아쓰는 인력이 필요하고, 일일이 받아 적어야 하기 때문에 주소를 등록하는데 많이 시간이 소요되거나 입력하는 사람이 잘못 들을 경우 주소가 잘못 기록될 수 있다는 문제점이 있다.Since this method requires a real-time dictator and must write down one by one, there is a problem that a long time is required to register an address or an address may be recorded incorrectly if a person who inputs the address is wrong.

이를 극복하기 위해, 사용자의 음성에 따른 주소를 우선 녹음하고, 서비스 제공자가 녹음된 파일을 일괄적으로 받아 적은 후에 이를 분석하여, DB에 수동 입력하는 방식을 사용하고 있으나, 이 역시 인력이 하나하나 단순 반복 업무를 해야 한다는 점에서, 위의 문제를 원천적으로 해결해주고 있지는 못하다.In order to overcome this problem, the user first records the address according to the user's voice, and the service provider receives the recorded files collectively, analyzes them, and then manually inputs them to the DB. It does not solve the above problem at the source in that it requires simple repetitive tasks.

따라서 녹음되는 주소를 자동적으로 인식하고 DB화 할 수 있는 음성인식 시스템이 요구되고 있으며, 사용자가 시스템에 불러 준 주소 정보를 인식하여 DB에서 자동으로 주소를 찾아내고 매칭시키는 방식이 도입될 필요성이 있다. 이 중 후자의 자동 매칭 방식을 사용하는 경우, 사람이 일일이 받아 적어야 하는 전사량을 줄여 비용을 절감시킬 수 있고, 받아 적는 인력을 줄일 수 있으며, 받아 적는 시간을 단 축할 수 있는 장점이 있다.Therefore, there is a need for a voice recognition system that can automatically recognize the recorded address and make it into a DB, and it is necessary to introduce a method of automatically finding and matching the address from the DB by recognizing the address information that the user has called into the system. . Of these, the latter automatic matching method can reduce costs by reducing the amount of transcription required for a person to write down, save manpower, and reduce the time to write down.

이러한 장점이 있는 자동 매칭 방식을 사용하기 위해서는, 우리나라의 주소 체계에 대한 발성을 제대로 인식하여야 하며, 특유의 주소 체계를 이용하여 그 인식율을 높일 방법이 요구되고 있다.In order to use the automatic matching method having such an advantage, it is necessary to properly recognize the voice of the Korean address system, and a method of increasing the recognition rate by using a unique address system is required.

본 발명은 상기와 같은 종래 기술의 문제점을 음성인식 방법을 통해 해소시키기 위해 안출된 것으로, 음성 주소를 자동으로 인식하는 주소 음성인식 방법, 시스템에 있어서, 우리나라 주소 특유의 단계별 어휘 구성, 단계별 그래마군 조합, 모듈별 그래마군 조합을 통해, 음성인식의 오류를 줄이고 인식율을 향상시킬 수 있는 그래마 모듈화를 이용한 주소인식 방법과 장치를 제공하는 것을 목적으로 한다.The present invention has been made in order to solve the problems of the prior art through the speech recognition method, address speech recognition method, system for automatically recognizing the voice address, step-by-step vocabulary configuration unique to the address of Korea, step by step grammar group It is an object of the present invention to provide an address recognition method and apparatus using a grammar modularization that can reduce the error of speech recognition and improve the recognition rate through a combination and a combination of the grammar groups for each module.

상기와 같은 목적을 달성하기 위하여, 주소인식 방법에 있어서, (1) 우편주소내용을 5 단계 발성 위치별로 분리하는 단계; (2) 상기 발성 위치별 분리 단계에서 발생한 지역단위를 단계별 그래마군을 생성하는 단계; (3) 생성된 상기 단계별 그래마군에서 모듈별 그래마군을 생성하는 단계; 및 (4) 상기 생성된 단계별, 모듈별 그래마군 정보를 주소 인식용 정보로 저장하는 단계;를 포함하여 이루어진 것을 특징으로 하는 그래마 모듈화를 이용한 주소인식 방법을 제공한다.In order to achieve the above object, an address recognition method comprising the steps of: (1) separating the mailing address contents by five steps of speech location; (2) generating a grammar group for each regional unit generated in the separation step for each vocal position; (3) generating a module-specific grammar group from the generated gradual group of grammars; And (4) storing the generated step-by-step, module-specific grammar group information as address recognition information.

또한, 상기 (2) 단계에서, 지역단위를 단계별 그래마군으로 생성할 때 탑다운방식(top-down) 조합내용의 그래마군으로 이루어지도록 하는 것을 특징으로 한다.In addition, in the step (2), it is characterized in that when the regional unit is generated in a step-by-step grammar group consisting of a top-down combination of grammar group.

상기 (2) 단계에서, 상기 단계별 그래마군을 생성할 때, 중간 단어의 생략형 을 포함하는 것을 특징으로 한다.In the step (2), when generating the step-by-step grammar group, it characterized in that it comprises an abbreviation of the middle word.

상기 (2) 단계에서, 상기 단계별 그래마군을 생성할 때, 해당 단계의 끝단 어휘가 포함되도록 하는 것을 특징으로 한다.In the step (2), when generating the step-by-step grammar group, it characterized in that the end vocabulary of the step is included.

상기 (3) 단계에서, 상기 모듈별 그래마군을 생성할 때, 상기 단계별 그래마군에서 정의된 내용을 앞단에 포함되도록 하는 것을 특징으로 한다.In the step (3), when generating the module-specific grammar group, it characterized in that the content defined in the step-by-step grammar group to be included in the front end.

또한, 주소인식 그래마 모듈화 장치에 있어서, 우편주소내용을 5 단계 발성 위치별로 분리하는 분리모듈; 상기 분리모듈에 의해 발성 위치별로 분리된 우편주소내용에서 지역단위를 단계별 그래마군 및 모듈별 그래마군으로 생성하는 조합모듈; 및 상기 조합모듈에 의해 생성된 단계별, 모듈별 그래마군 정보를 주소 인식용 정보로 저장하는 저장모듈;을 포함하여 이루어진 것을 특징으로 하는 그래마 모듈화를 이용한 주소인식 장치를 제공한다.In addition, the address recognition grammar modularization apparatus, comprising: a separation module for separating the postal address contents for each of five levels of vocalization positions; A combination module for generating a regional unit into a grammar group for each step and a grammar group for each module in the mail address contents separated by the utterance location by the separation module; And a storage module for storing the step-by-step, module-specific grammar group information generated by the combination module as information for address recognition.

또한 상기 조합모듈은, 지역단위를 단계별 그래마군으로 생성할 때 탑다운방식(top-down) 조합내용의 그래마군으로 이루어지도록 하는 것을 특징으로 한다.In addition, the combination module, characterized in that to be made in the grammar group of the top-down combination (top-down) when generating the regional unit in the step by step grammar group.

상기 조합모듈은, 상기 단계별 그래마군을 생성할 때, 중간 단어의 생략형을 포함하는 것을 특징으로 한다.The combination module may include abbreviations of intermediate words when generating the step-by-step grammar group.

상기 조합모듈은, 상기 단계별 그래마군을 생성할 때, 해당 단계의 끝단 어휘가 포함되도록 하는 것을 특징으로 한다.The combination module is characterized in that the end vocabulary of the step is included when generating the step-by-step grammar group.

상기 조합모듈은, 상기 단계별 그래마군을 생성할 때, 상기 단계별 그래마군에서 정의된 내용을 앞단에 포함되도록 하는 것을 특징으로 한다.The combination module, when generating the step by step grammar group, characterized in that the content defined in the step by step grammar group to be included in the front end.

발성녹취 정보를 각 단계(depth)별, 모듈별로 인식하여, 그 결과를 제공하는 인식모듈을 더 포함하는 것을 특징으로 한다.The apparatus may further include a recognition module that recognizes the voice recording information for each step and each module and provides a result.

이하, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 본 발명을 용이하게 이해할 수 있을 정도로 상세히 설명하며, 이 발명의 바람직한 실시 예를 첨부된 도면을 참조로 설명하기로 한다.Hereinafter, the present invention will be described in detail so that those skilled in the art can easily understand the present invention, a preferred embodiment of the present invention will be described with reference to the accompanying drawings.

통상 음성 입출력 기반의 음성인식 기술을 통해 음성서비스를 제공하기 위해 GRXML(GRammar eXtensible Markup Language), GSL(Grammar Specification Language), NGO(Nuance Grammar Object)와 같은 인식 기술을 사용할 수 있는데, 이러한 스펙의 그래마 태그는 음성인식 그래마를 설정한다. 음성인식 그래마는 사용자가 말할 수 있는 후보 어휘를 의미하며, 음성인식은 사용자가 발화한 음성과 가장 비슷한 후보 어휘를 그래마 중에서 찾는 것을 의미한다. 이와 같은 방법은 사용자가 말할 수 있는 어휘에 제한을 둘 수 있을 때 주로 사용되며, 사용자가 어떤 어휘를 발화할지 예상할 수 없는 상황에서는 사용이 어렵다.Generally, to provide a voice service through voice input / output based voice recognition technology, recognition technologies such as GRXML (Grammar eXtensible Markup Language), GSL (Grammar Specification Language) and NGO (Nuance Grammar Object) can be used. The tag sets the voice recognition grammar. Speech recognition grammar refers to a candidate vocabulary that a user can speak, and speech recognition refers to finding a candidate vocabulary most similar to the speech uttered by the user. This method is mainly used when the user can limit the vocabulary that can be spoken, and it is difficult to use in situations where the user cannot predict which vocabulary to speak.

주소 음성 인식의 경우, 어휘의 종류가 한정되어 있기 때문에, 이러한 그래마 태그의 사용이 가능하다. 현재 우리나라의 우편주소 체계 현황을 살펴보면, 15개의 시/도로 구분되어 있으며, 6 단계(Depth) 정도의 지역단위 발성 조합으로 구성된다. 예를 들면 "도-군-읍-리-가-개별주소"로 구성된다. 정확하고 효율적인 우리나라 주소발성 어휘의 음성인식을 위해서는 다음과 같은 그래마 구성요소가 필요하다.In the case of address speech recognition, since the type of vocabulary is limited, such a grammar tag can be used. Looking at the current state of the postal address system in Korea, it is divided into 15 cities / provinces, and it is composed of local level vocal combinations. For example, it consists of "do-gun-eup-ri-ga-individual address". The following grammar components are necessary for accurate and efficient speech recognition of address speech in Korea.

첫째로는, 지역별 그래마 구분이 필요하다. 즉, 시, 도 단위로 그래마를 분리할 필요가 있다. 왜냐하면 현 우편주소 체계에서는 "경기 과천시 갈현동", "서울 은평구 갈현동", "인천 계양구 갈현동" 등과 같이 다른 지역 내 같은 이름의 지역단위가 존재하기 때문에, "경기도 과천시 갈현동"은 "경기도" 주소 그래마 내에서 인식될 필요가 있다.First, we need to distinguish regional grammar. In other words, it is necessary to separate the grammar by the unit of time. Because in the current postal address system, there are local units of the same name in other regions such as "Galhyeon-dong, Gwacheon-si, Gyeonggi-do," and "Galhyeon-dong, Gyeyang-gu, Incheon." It needs to be recognized within.

둘째로는, 인식 그래마의 범위를 축소할 수 있어야 한다. 즉 5단계 발성 단계별로 그래마를 분리할 필요가 있다. 예컨대, "서울시 성북구 동선동4가 13번지..." 의 발성의 경우, "서울시", "성북구", "동선동4가"의 3단계 지역단위 어휘 조합만을 포함하는 그래마로 인식되도록 한다. Secondly, it should be possible to reduce the scope of perception grammar. That is, it is necessary to separate the grammar in five stages of vocalization. For example, in the case of "13, Dongseon-dong 4-ga, Seongbuk-gu, Seoul", the speech is recognized as a grammar that includes only three levels of regional unit vocabulary combinations of "Seoul," "Seongbuk-gu", and "Dongseon-dong 4-ga."

셋째로는, 원 슬롯(one slot) 구성으로 한다. 5단계 발성 그래마 구성시, 각 단계를 멀티 슬롯으로 구성함에 비해, 원 슬롯으로 구성시 트라이그램(trigram)룰이 적용되어 인식율이 배가된다. 예컨대, 원슬롯 구성으로 인해 "용산구 갈월동"이 "용산구 갈현동"으로 오인식되는 것을 방지하게 되는데, 이는 멀티슬롯으로 구현시 "용산구 갈현동"과 같은 어휘 조합이 나타날 수 있으나, 용산구에는 갈현동이 없기 때문이다.Third, one slot configuration is used. In the 5-stage voice grammar configuration, each stage is configured as a multi-slot, while in the one-slot configuration, a trigram rule is applied to double the recognition rate. For example, due to the one-slot configuration, "Yongsan-gu Galwol-dong" is prevented from being mistaken as "Yongsan-gu Galhyeon-dong". .

이러한 필수 요소를 기반으로, 본 발명의 구성을 설명하도록 한다. 본 발명은 이러한 주소 정보를 그래마 모듈화 방법을 통해 인식하는 방법 및 장치에 관한 것이다.Based on these essential elements, the configuration of the present invention will be described. The present invention relates to a method and apparatus for recognizing such address information through a grammar modularization method.

본 발명이 적용된 그래마 모듈화를 위한 방법은 도 1에 도시된 바와 같이, 우편주소 내용을 분리하여, 단계(depth) 및 모듈별 그래마군으로 조합시키는 과정을 거친다.As shown in FIG. 1, the method for applying the grammar module to which the present invention is applied is separated from the postal address content and combined with the depth and module-specific grammar groups.

우편주소 내용은 6단계 정도로 구성되는데, 이 중 상위 5단계(depth)를 각 단계별로 구분한다. 상위 5단계의 주소는 각 단계별 그래마군으로 조합되고(s110), 그 이후에는 각 모듈별 그래마군으로 조합된다(s120). 단계 및 모듈별 그래마군으로 조합된 후에는 주소인식용 정보로 저장된다(s130). 즉, 본 발명은 그래마군 조합에 있어서, 첫째로 단계별, 둘째로 모듈별로 조합하는 방법을 사용한다.The postal address consists of six levels, of which the top five levels are separated for each level. The addresses of the upper five levels are combined into the grammar groups for each step (s110), and then combined into the grammar groups for each module (s120). After being combined into the step and module-specific grammar group is stored as address recognition information (s130). That is, in the grammar group combination, the present invention uses a method of combining first step by step and second module.

본 발명이 적용된 주소인식 그래마 모듈화 생성 장치는 도 2에 도시된 바와 같이 우편주소내용을 5 단계 발송 위치별로 분리하는 분리모듈(100)과, 상기 분리모듈(100)에 의해 발송 위치별로 분리된 우편주소내용에서 지역단위를 단계별, 모듈별 그래마군으로 조합하는 조합모듈(200)과, 상기 조합모듈(200)에 의해 조합된 단계별 그래마군 정보와 모듈별 그래마군 정보를 주소 인식용 정보로 저장하는 저장모듈(300)로 구성된다.The address recognition grammar modularization apparatus to which the present invention is applied is divided into mailing locations by the separation module 100 and the mailing location separated by the separation module 100 as shown in FIG. 2. Combination module 200 for combining the regional unit in the postal address content step by step, module-specific Grammar group, and stores the step-by-step Grammar group information and module-specific Grammar group information combined by the combination module 200 as address recognition information. It consists of a storage module 300.

조합모듈(200)은 지역단위를 단계별 그래마군으로 조합 또는 생성할 때 탑다운방식(top-down) 방식 조합내용의 그래마군에 대해서 이루어지도록 한다. 또한 조합모듈(200)은 중간 단어의 생략형을 포함할 수도 있으며, 해당 단계의 끝단 어휘가 반드시 포함되어 조합되도록 한다. 또한 단계별 그래마군을 조합, 생성할 때, 이전 단계에서 인식된 그래마군이 앞단에 포함되도록 한다.Combination module 200 is to be made for the grammar group of the top-down method combination when generating or combining the regional unit in the step-by-step grammar group. In addition, the combination module 200 may include an abbreviation of an intermediate word, so that the end vocabulary of the corresponding step is necessarily included and combined. In addition, when combining and generating the step-by-step grammar group, the grammar group recognized in the previous step is included in the front end.

분리모듈(110)은 주소를 발성 단계별로 분리하는 기능을 한다. 우리나라 주소는 상위 5단계 + 번지수 또는 세부 주소로 이루어 지는데, 분리모듈은 이 중 상위 5단계에 대하여 각각 분리하는 역할을 하고, 분리된 주소에 대하여 각 단계를 설정하게 된다.Separation module 110 functions to separate the address by the speech step. The address of Korea consists of the top 5 levels + address or detailed address, and the separation module plays a role of separating the top 5 levels, and sets each level for the separated addresses.

분리모듈(110)은 주소어휘단계(depth)를 구분하게 되는데, 발성 단계별 주소 데이터의 단계(depth)를 구분한다. 아울러 숫자와 동 데이터를 분리한다.“상수1동을 "상수", "동", "1"로 구분하는 것과 같이 숫자, 동 데이터를 분리하여 발성 단계별로 주소 어휘의 단계(depth)를 구분한다. The separation module 110 distinguishes the address vocabulary level, and distinguishes the depth of the address data for each speech step. In addition, it separates the number and the same data. The number and the same data are separated by “constant”, “dong”, and “1” to distinguish address depth of address vocabulary. .

조합모듈(120)은 분리된 주소에 대하여, 단계별로 조합하고, 또한 모듈별로 조합하여 그래마 군으로 묶는 기능을 한다. 이 때 단계(depth)와 관련하여 도 3을 참조하여 설명하면, 우리나라의 주소 구조에서 지역단위는 마지막 단계를 제외하고, 최대 상위 5단계(depth)의 발성을 정형화하여 사용한다. 그 내용을 5단계 발성위치별로 분리한 뒤, 각 단계에서 발생 가능한 한 지역단위를 탑-다운방식 조합으로 그래마군을 단계(depth)별 그래마군으로 조합, 생성한다.The combination module 120 combines the separated addresses step by step, and also combines the modules by grammar group. In this case, referring to FIG. 3 with respect to the depth, the regional unit in the address structure of Korea is used by formalizing the vocalization of a maximum of five (depth) except the last stage. After the contents are separated by five stages of vocalization, the grammar group is combined and created as the depth group by the top-down combination as possible.

첫 번째 단계(1 depth)로, 경기도, 서울시와 같이 가장 큰 분류를 지정하고, 두 번째 단계(2 depth)로, 첫 째 단계 부속 주소, 예컨대 경기-수원, 경기-성남을 지정하고, 세 번째로 한 단계 더 작은 부속 주소를 설정하는 식이다.In the first level (1 depth), specify the largest classification, such as Gyeonggi-do and Seoul, and in the second level (2 depth), specify the first-level subaddress, such as Gyeonggi-Suwon, Gyeonggi-Seongnam, and the third. This is a way to set a subaddress that is one step smaller.

이렇게 결정된 그래마 구성은 그래마 범위를 축소할 수 있게 해준다. 이때 탑다운 방식에 따른 단계별 그래마 구성은, 중간 단어의 생략형도 포함하도록 한다. 예컨대 "경북 포항시 북구 청하면" 그래마는 "경북 북구 청하면" 그래마도 포함하도록 한다. 그리고 각 단계별 그래마는 해당 단계의 끝 어휘를 반드시 포함한다. 예를 들어 4단계 그래마군을 지정하는 경우 "경북 포항시 북구 청하면"에 대해서 "경북 포항시 북구"는 포함되지 않는다. 왜냐하면 "경북 포항시 북구"는 3단계(depth) 어휘군이기 때문이다.The grammar configuration thus determined allows the grammar range to be reduced. At this time, the step-by-step grammar configuration according to the top-down method to include the abbreviation of the middle word. For example, the "Gyeongbuk Pohang-si, Buk-gu, Cheonghae" grammar should include the "Gyeongbuk, Buk-gu, Cheonghae" grammar. And each step grammar must include the end vocabulary of that step. For example, if the fourth-level Gramma-gun is designated, "Buk-gu, Pohang-si, Gyeongsangbuk-do" is not included for "Buk-gu, Pohang-si, Pohang-si, Gyeongbuk." Because "Buk-gu, Pohang-si, Gyeongbuk" is a three-degree lexical group.

이렇게 구성된 단계(depth)별 어휘의 구성 예제는 다음과 같다. "경북 포항시 북구 청하면 서정리"와 같이 발성 단계별로 주소 어휘를 단계별로 구분한다.An example of the composition of each vocabulary by depth is as follows. The address vocabulary is categorized by vocalization step like "Chunghae Seojeong-ri, Buk-gu, Pohang-si, Gyeongbuk."

구분division 내용Contents 예제example 1 Depth1 Depth 레벨(01) 내용만을 포함하는 어휘군Vocabulary group containing only level (01) content 경북Gyeongbuk 2 Depth2 Depth 레벨(01+02) 내용을 포함Include level (01 + 02) content 경북 포항시Pohang, Gyeongbuk 3 Depth3 Depth 레벨(01+02+03) 내용을 포함Include level (01 + 02 + 03) content 경북 포항시 북구Buk-gu, Pohang-si, Gyeongbuk 4 Depth4 Depth 레벨(01+02+03+04) 내용을 포함Include level (01 + 02 + 03 + 04) content 경북 포항시 북구 청하면Cheonghae, Buk-gu, Pohang-si, Gyeongbuk 5 Depth5 Depth 레벨(01+02+03+04+05) 내용을 포함Include level (01 + 02 + 03 + 04 + 05) content 경북 포항시 북구 청하면 서정리Seojeong-ri, Cheongmyeon, Buk-gu, Pohang-si, Gyeongbuk

즉, ~~아파트, ~~빌딩과 같은, 세부 주소를 제외하고, 상위 5개 단계로 주소 발성을 구분하여 이것을 위주로 사용하게 된다. 조합모듈(120)은 주소 어휘의 단계(depth)를 상기와 같이 구분한 후, 구분된 주소 어휘에 포함된 숫자 또는 특수문 자 데이터 등에 대해 전처리가 이루어지도록 하고 무규칙 데이터는 삭제시킨다. 즉, "1동", "2동", "1.3동" 등과 같은 데이터는 인식 가능한 형태로 처리하고, "**토지 내", "*블록*구획" 등과 같은 무규칙 데이터는 삭제시킨다.That is, except for detailed addresses such as ~~ apartments, ~~ building, the address vocalization is divided into the top five levels and used mainly for this. The combination module 120 classifies the depth of the address vocabulary as described above, preprocesses the numeric or special character data included in the divided address vocabulary, and deletes the irregular data. That is, data such as "1 building", "2 buildings", "1.3 buildings", and the like are processed in a recognizable form, and irregular data such as "in ** land" and "* block * block" is deleted.

그리고 조합모듈(120)은 표준 지역 주소에 대해, 지역적 또는 역사적인 이유로 발성되어 통계적으로, 실제로 자주 사용하는 지역 주소를 추가시킨다. 예를 들어 표준 지역 주소는 "광주광역시"지만, "광주직할시"와 같은 유사어휘 데이터를 저장모듈(130)의 해당 DB에 추가시킨다.And the combination module 120 is added to the standard local address, the local address is spoken for regional or historical reasons, statistically, often used in practice. For example, the standard local address is "Gwangju Metropolitan City", but adds similar vocabulary data such as "Gwangju Metropolitan City" to the corresponding DB of the storage module 130.

즉, 조합모듈(120)은 분리된 우편번호, 주소록 정보에 대해 표준 지역 주소와 유사 어휘로 이루어진 지역주소를 추가시킨 후, 발성 단계별로 주소 데이터의 단계(depth)를 구분하고, 구분된 주소 데이터에 포함된 숫자 또는 특수문자를 처리하고, 각 단계(depth) 내에 포함된 어휘만을 추출하여 어휘군 리스트를 생성한 후 각 단계(depth)별 어휘군을 이용하여 하위군 최상위 그래마(top-grammar)를 구성하여 그래마 파일로 형성하는 기능을 한다.That is, the combination module 120 adds a local address consisting of a standard local address and a similar vocabulary to the separated postal code and address book information, and then divides the depth of the address data in each utterance step, and separates the address data. Process the number or special characters included in the list, extract the vocabulary included in each depth, and generate the vocabulary group list, and then use the vocabulary group for each level to determine the top-grammar of the subgroup. ) To form a grammar file.

조합모듈(120)은 주소어휘의 단계(depth)를 구분할 때 숫자, 동 데이터를 분리시킬 수 있으며, 특수문자를 처리할 때, 무규칙 데이터를 추출하여 삭제시킬 수 있다. 또한, 조합모듈(120)은 각 단계(depth) 내에 포함된 어휘만을 추출하고 중복 제거한 어휘군 리스트를 생성하는 한편, 단계(depth)별 어휘군을 구성할 때 그래마 명, 그래마 내용 및 인식 결과인 슬롯값을 자동으로 생성한다.Combination module 120 may separate the number, the same data when separating the depth (depth) of the address vocabulary, and when processing the special character, it may extract and delete random data. In addition, the combination module 120 extracts only the vocabulary included in each depth and generates a list of lexical groups which are duplicated, while forming a lexical group for each depth, and includes a grammar name, a grammar content, and a recognition. Automatically generate the result slot value.

그리고 조합모듈(120)은 생성된 각 단계(depth)별 어휘군을 이용하여 하위군 최상위 그래마(top-grammar)로 구성되는 모듈별 어휘군을 구성하고, 모듈별 어휘군 을 구성한 후 그래마 파일로 형성한다. 모듈별 그래마는 단계(depth)별 그래마로 인식한 후 그 밑단을 탐색하는데 사용된다.The combination module 120 constructs a vocabulary group for each module composed of sub-group top-grammars using the generated vocabulary groups for each step, and forms a vocabulary group for each module. Form into a file. The module-specific grammar is used to detect the grammar by depth and then search for the hem.

상기 단계에서 모듈별 어휘군을 구성할 때에도 단계(depth)별 어휘군을 구성할 때와 마찬가지로 그래마명, 그래마 내용 및 슬롯값(return value)을 자동으로 생성시킨다.When constructing the vocabulary group for each module in the above step, as in the case of constructing the vocabulary group for each depth, the grammar name, the grammar contents, and the slot value are automatically generated.

모듈별 어휘군 구성은, 단계별 그래마군의 해당 단계의 어휘로 시작하여, 그 뒷단계 어휘를 미들다운(middle-down) 방식으로 조합한 내용의 그래마 군이다. 도 4를 참조하면, 모듈3은 3단계(depth) 인 경북-포항-북구로 시작하여, 그 하부의 흥해읍, 청하면, 남송리, 마산리 등을 조합한 것을 하나의 모듈로 한다. 모듈4는 4단계인 경부-포항-북구-흥해읍으로 시작하여, 그 하부의 남송리, 마산리 등을 조합한 것이다. 본 발명에서는 3단계부터 모듈을 구성하게 되며, 1, 2단계에서는 모듈을 형성하지 않는다. The vocabulary group for each module starts with the vocabulary of the corresponding step of the step-by-step grammar group, and is a grammar group that combines the later step vocabulary in a middle-down manner. Referring to FIG. 4, module 3 starts with Gyeongbuk-Pohang-Buk-gu, which is a third step, and combines Heunghae-eup, Cheongmyeon, Namsong-ri, Masan-ri, and the like as one module. Module 4 starts with Gyeongbu-Pohang-Buk-gu-Heunghae-eup, which is in four stages, and combines Namsong-ri and Masan-ri below. In the present invention, the module is configured from step 3, and the module is not formed in steps 1 and 2.

즉, 모듈별 그래마 구성은 상기 단계별 그래마군에서 정의된 내용을 앞단에 포함하는 또 하나의 그래마군을 생성하며, 이를 모듈별 그래마라 한다. 예컨대 3단계(depth)를 이용, "부산-금정구-부곡일동"을 인식 한후에, 모듈3그래마는, 부산-금정구-부곡일동 삼팔팔 다시 이십구번지와 같이 3단계 내용을 앞단에 포함하고 4단계에 적용될 어휘군을 형성한다. 모듈별로 그래마를 구성하므로 그래마 범위의 축소를 가져온다. 모듈별 그래마 구성과 그에 따른 선택을 모듈 순으로 나타내면 아래의 표와 같으며, L01, L02로 표시된 것은 각 주소별 단계를 의미한다.That is, the module-specific grammar configuration generates another grammar group including the contents defined in the step-by-step grammar group at the front end, and this is called the module-specific grammar. For example, after recognizing “Busan-Geumjeong-gu-Bugok-il” using 3 steps, the module 3 graph includes the contents of step 3 at the front end, such as Busan, Geumjeong-gu, Bugokil-dong, Sampalpal, 20, and 4 steps. Form a group of vocabulary to be applied. Grammar is organized by module, resulting in a reduction in the range of the grammar. The module configuration and the selection according to the module are shown in the order of the module as shown in the table below, and the items indicated by L01 and L02 indicate the steps for each address.

구분division 내용Contents 예제example 모듈2Module 2 L01_L02+(ex) 포함With L01_L02 + (ex) 경북 포항시 북구 청하면 서정리...
남구 동해면 발산리...Seojeong-ri, Cheongmyeon, Buk-gu, Pohang, ...
Namsan Donghae-myeon Balsan-ri ... 모듈3Module 3 L01_L02_L03+(ex) 포함With L01_L02_L03 + (ex) 경북 포항시 북구 청하면 서정리...
흥해읍 남송리...Seojeong-ri, Cheongmyeon, Buk-gu, Pohang, ...
Namsong-ri, Heunghae-eup ... 모듈4Module 4 L01_L02_L03_L04+(ex) 포함With L01_L02_L03_L04 + (ex) 경북 포항시 북구 청하면 서정리...
월포리...Seojeong-ri, Cheongmyeon, Buk-gu, Pohang, ...
Wolpori ... 모듈5Module 5 L01_L02_L03_L04_L05+(ex) 포함With L01_L02_L03_L04_L05 + (ex) 경북 포항시 북구 청하면 서정리 A빌딩
B아파트A Building, Seojeong-ri, Cheong-myeon, Buk-gu, Pohang-si, Gyeongbuk
B apartment

본 발명은 3단계부터 모듈을 구성하여 사용하므로, 상기 모듈2는 사용하지 아니하며, 모듈3, 4, 5만을 사용한다. 모듈 3을 보면, "북구 청하면~, 북구 흥해읍~" 등의 모듈 어휘군을 갖게 된다. 결국 제공된 주소 정보는 상기와 같은 과정과 형태를 따라서, 각 단계 및 모듈 형태로 조합되고, 그래마 파일로 저장된다. Since the present invention configures and uses the module from step 3, the module 2 is not used, and only modules 3, 4, and 5 are used. Looking at Module 3, you will have a module vocabulary group such as "Buk-gu Cheonghae ~, Buk-gu, Heunghae-eup ~". Eventually, the provided address information is combined in each step and module form according to the above process and form, and stored as a grammar file.

그리고 상기 단계에서 생성된 그래마 파일(*.grammer)은 단계(depth)별, 모든 어휘 포함별, 지역별, 최상위 그래마(top grammar) 내용을 포함하고 있다.The grammar file (* .grammer) generated in the above step includes contents by depth, all vocabulary inclusions, regions, and top grammars.

조합모듈(120)에 의해 생성된 그래마 군은 하기 표와 같다.The grammar group generated by the combination module 120 is shown in the table below.

그래마군Yeah. 종류 Kinds 발생 가능한 최대 Maximum possible 그래마군Yeah. 개수 O(n) Count O (n)
설명Explanation
파일군Pile 내용 Contents
단계(depth)별
그래마군
By depth
Yeah. 55 모든 지역의 내용을 포함하는 각 단계(depth)별 그래마군Grammar groups for each depth containing all local content Depth01.h
Depth02.h, ....Depth01.h
Depth02.h, .... 5*155 * 15 각 15개 지역별로 구분된 각 단계(depth)별 그래마군Grammar groups for each level divided by 15 regions Depth01_강원.h...
Depth01_Gangwon.h ...
1One 모든 단계(depth)별 내용을 포함하는 그래마군A group of grammars containing content for every depth A.grammarA.grammar 1*151 * 15 모든 단계(depth)별 내용을 포함하고, 지역별로 구분된 그래마군Grammar groups, broken down by region, with content for every depth A.grammarA.grammar
모듈별
그래마군
Per module
Yeah. 3*n3 * n 모든 지역의 내용을 포함하는 각 모듈별 그래마군Grama-group for each module, containing all local content A.grammarA.grammar 3*n*153 * n * 15 각 지역별로 구분된 각 모듈별 그래마군Grama group for each module divided by region Module03.h
Module04.hModule03.h
Module04.h ※ n= 발생가능하고 중복 제거된 어휘의 조합 개수N = number of combinations of vocabulary that can be generated and deduplicated

저장모듈(130)은 조합모듈(120)의 제어에 응하여 상기 지역 주소의 유사어휘 및 그래마 파일을 DB 형태로 저장한다.The storage module 130 stores the similar vocabulary and the grammar file of the local address in a DB form under the control of the combination module 120.

인식모듈(140)은 발성 녹취된 정보를 인식하고, 이를 저장모듈(130)의 그래마 파일 DB와 비교하고, 이 중 어느 특정 데이터로 선정하여 인식결과를 적용하는 역할을 한다.The recognition module 140 recognizes the voiced recorded information, compares it with the grammar file DB of the storage module 130, and selects any specific data to apply the recognition result.

그래마 단계화 및 모듈화 이후에 이를 실제적으로 인식하여 적용하는 방법에 대해 도 5를 참조하여 설명하도록 한다. 모든 주소 음성 파일은 총 4번의 음성인식을 수행하게 된다. A method of practically recognizing and applying this after grammar stepping and modularization will be described with reference to FIG. 5. All address voice files will perform a total of four voice recognitions.

즉 발성녹취 정보를 읽어들여서 3단계(depth)별 그래마를 인식(S200)하고, 인식결과를 체크하여 인식이 가능한지를 판단(S210)한다. 판단 결과 인식이 가능하면 다음 단계의 인식을 시도하고, 인식이 불가능하면 과정을 종료한다.That is, by reading the voice recording information, it recognizes the grammar for each step (depth) (S200), and checks the recognition result to determine whether the recognition is possible (S210). If it is possible to recognize the result of the determination, the recognition of the next step is attempted. If the recognition is not possible, the process is terminated.

전 단계의 인식이 가능한 경우에는 4단계 그래마군으로 인식을 시도하게 된다(S220). 마찬가지로 인식 가능 여부를 판단하고(S230), 다음 단계로의 진행을 결정한다.If it is possible to recognize the previous step is to try to recognize the four-step grammar group (S220). Similarly, it is determined whether the recognition is possible (S230), and proceeds to the next step.

4단계 그래마군으로 인식이 가능하면, 이번에는 5단계 그래마군으로 인식을 시도한다(S240). 마찬가지로 인식 가능 여부를 판단하고(S250), 다음 단계로의 진행을 결정한다.If it is possible to recognize the fourth stage grammar group, this time, it tries to recognize the fifth stage grammar group (S240). Similarly, it is determined whether or not it can be recognized (S250), and the process proceeds to the next step.

5단계 그래마군으로 인식이 가능한 경우 3단계별, 4단계별, 5단계별 그래마군의 내용을 모두 포함한 그래마 군으로 인식을 시도하며(S260), 이 이후에 최종적으로 이전의 인식 결과 중 인식 정확도가 가장 높은 결과를 선택하게 된다(S270).If it is possible to recognize the 5th stage grammar group, it tries to recognize it as the grammar group including all contents of the 3rd stage, 4th stage, and 5th stage grammar group (S260), after which the recognition accuracy is the best among the previous recognition results. A high result is selected (S270).

이후 위 4번의 인식과정에서 얻어진 단계 그래마 내용과 매칭되는 모듈 그래마에서만 인식을 수행하여 매칭되는 모듈 그래마를 인식하도록 하고(S280), 최종 인식결과를 적용하도록 한다. 한편, 위 각 단계별 과정에서 인식이 불가능한 경우에는 절차를 종료하게 된다.Thereafter, recognition is performed only on the module grammar matching the content of the step grammar obtained in the above four recognition processes to recognize the matching module grammar (S280), and the final recognition result is applied. On the other hand, if the recognition is not possible in each of the above steps, the procedure is terminated.

도 6은 본 발명에 의해 자동 생성된 파일의 내용 구조를 나타내는 도면이다. 도 6을 참조하면, "#include 상위단계 그래마 파일"은 파일간 참조시 참조파일이 명시되는 부분이고, Depth/Module 그래마명 이하의 부분은, 해당 그래마 군의 조합 내용 및 인식결과 내용, 슬롯값을 포함한다. 조합 내용은 상세 그래마명으로 구성되며, 상세 그래마명의 정의 및 어휘 내용은 해당 그래마 파일의 하단에 기술된다. 인식결과 내용은 인식 주소내용 및 어휘 검색 그래마명의 조합으로 구성된다. 인식결과 내용에 다음 하위단계 그래마명이 포함됨으로써, 인식 후 다른 검색절차 없이 바로 해당 하위단계 모듈로 검색이 가능하다.6 is a diagram showing the content structure of a file automatically generated by the present invention. Referring to FIG. 6, the "#include higher level grammar file" is a part in which a reference file is specified when referencing between files, and the part below the Depth / Module grammar name is a combination content of the corresponding grammar group, a content of a recognition result, Contains the slot value. Combination contents are composed of detailed grammar names, and definition and lexical contents of detailed grammar names are described at the bottom of the corresponding grammar file. The recognition result contents are composed of a combination of the recognition address contents and the lexical search grammar name. Recognition result includes the next lower level grammar name, so that it can be searched by the lower level module without any other search procedure after recognition.

"상세그래마명" 이하의 부분은, 그래마군의 조합내용을 구성하는 상세그래마명 및 그 내용이 정의된다. 여기서 정의된 상세그래마명 및 그 내용은 해당단계의 그래마 군에서 뿐 아니라 하위단계의 또 다른 그래마군에서도 역시 이 내용을 참조하게 된다.In the part below "Detailed Grammar Name", the detailed Grammar Name and its contents constituting the combination content of the Grammar Group are defined. The detailed grammar name and its contents defined here refer to this content not only in the grammar group of the stage but also in another grammar group of the lower stage.

이와 같이 그래마를 자동생성하는 방법 및 프로세스에 따라, 보다 손쉽게 그래마를 생성할 수 있게 되는데, 각 지역별로 그래마를 구분하는 방식에 따르며, 기존의 인식율에 비해 높은 수치를 나타내게 된다. 이는 그래마로 단계 지워진 지역 이하에 존재하는 주소의 어휘만을 갖도록 범위를 제한하여, 실제 존재하지 않는 어휘로의 오인식이 줄어들기 때문이다.According to the method and process of automatically generating grammar, it is possible to generate grammar more easily. According to the method of classifying grammar for each region, it shows a higher value than the existing recognition rate. This is because the range is limited to only the vocabulary of addresses that exist below the gradated region, thereby reducing the misperception of the vocabulary.

이상, 본 발명의 바람직한 실시 예에 대하여 상세히 기술하였지만, 본 발명이 속하는 기술 분야에 있어서 통상의 지식을 가진 사람이라면, 첨부된 청구 범위에 정의된 본 발명의 정신 및 범위를 벗어나지 않으면서 본 발명을 여러 가지로 변형 또는, 변경하여 실시할 수 있음을 알 수 있을 것이다.As mentioned above, although preferred embodiments of the present invention have been described in detail, those skilled in the art to which the present invention pertains may make the present invention without departing from the spirit and scope of the present invention as defined in the appended claims. It will be appreciated that various modifications or changes can be made.

도 1은 본 발명에 따른 주소인식 그래마 모듈화 방법을 설명하기 위한 도면이다.1 is a view for explaining the address recognition grammar modularization method according to the present invention.

도 2는 본 발명에 따른 주소인식 그래마 모듈화 장치를 설명하기 위한 도면이다.2 is a view for explaining the address recognition grammar modularization apparatus according to the present invention.

도 3은 본 발명에 적용된 단계(depth)별 그래마의 구성을 설명하기 위한 도면이다.3 is a view for explaining the configuration of the grammar for each step (depth) applied to the present invention.

도 4는 본 발명에 적용된 모듈별 그래마 구성을 설명하기 위한 도면이다.4 is a diagram for explaining a module-specific grammar configuration applied to the present invention.

도 5는 본 발명에 따라 생성된 그래마를 인식하는 방법을 설명하기 위한 도면이다.5 is a view for explaining a method for recognizing a grammar generated according to the present invention.

<도면의 주요 부분에 대한 부호 설명><Description of the symbols for the main parts of the drawings>

100 : 분리모듈100: separation module

200 : 조합모듈200: combination module

300 : 저장모듈300: storage module

400 : 인식모듈400: recognition module

Claims

In the address recognition method,

(1) dividing the mailing address contents into five speech positions;

(2) generating a grammar group for each regional unit generated in the separation step for each vocal position;

(3) generating a module-specific grammar group from the generated gradual group of grammars; And

(4) storing the generated grammar group information for each step as address recognition information; and a method for address recognition using grammar modularization, comprising: a.

The method of claim 1,

In the step (2), when generating the area unit in the step by step grammar group address recognition method using a grammar modularization, characterized in that the top-down combination consisting of the grammar group.

The method according to claim 1 or 2,

In the step (2), when generating the step-by-step grammar group, address recognition method using a grammar modularization, characterized in that it comprises an abbreviation of the intermediate word.

The method according to claim 1 or 2,

In the step (2), when generating the step-by-step grammar group, address recognition method using a grammar modularization, characterized in that the end vocabulary of the step is included.

The method according to claim 1 or 2,

In the step (3), when generating the module-specific grammar group, address recognition method using a grammar modularization, characterized in that the content defined in the step-by-step grammar group to be included in the front end.

In the address recognition grammar modular device,

Separation module for separating the postal address content by five levels of utterance position;

A combination module for generating a regional unit into a grammar group for each step and a grammar group for each module in the mail address contents separated by the utterance location by the separation module; And

Address recognition device using a grammar modularization comprising a; a storage module for storing the step-by-step, module-specific grammar group information generated by the combination module as information for address recognition.

The method of claim 6,

The combination module is an address recognition device using a grammar modularization, characterized in that when generating a regional unit in a step-by-step grammar group consisting of a grammar group of top-down combination content.

The method of claim 6,

The combination module, when generating the step-by-step grammar group, address recognition device using a grammar modularization, characterized in that it comprises an abbreviation of the intermediate word.

The method of claim 6,

The combination module, when generating the step-by-step grammar group, address recognition device using a grammar modularization characterized in that the end vocabulary of the step is included.

The method of claim 6,

The combination module, when generating the step by step grammar group, address recognition device using a grammar modularization, characterized in that the content defined in the step by step grammar group to be included in the front end.

The method of claim 6,

Address recognition device using a grammar modularization characterized in that it further comprises a recognition module for recognizing the voice recording information for each step (depth), each module, and provides the result.