KR20050046121A

KR20050046121A - Extraction method of sentence type using the end of a word and pitch

Info

Publication number: KR20050046121A
Application number: KR1020030080118A
Authority: KR
Inventors: 박경현; 박준
Original assignee: 한국전자통신연구원
Priority date: 2003-11-13
Filing date: 2003-11-13
Publication date: 2005-05-18
Also published as: KR100550328B1

Abstract

본 발명은 종결어휘와 피치를 이용한 문형정보 추출방법에 관한 것으로, 자동통역 장치의 음성인식 결과에 따른 종결어휘를 이용하여 1차적으로 문형정보를 추출하고, 2차적으로 음성으로부터 피치를 추출한 후, 종결어휘의 문형별 출현 빈도율과 조합하여 문형정보를 추출함으로서 보다 높은 정확률을 얻을 수 있다.The present invention relates to a sentence information extraction method using a termination vocabulary and a pitch. The sentence information is extracted first using a termination vocabulary according to a speech recognition result of an automatic interpreter, and secondly, a pitch is extracted from speech. Higher accuracy rates can be obtained by extracting sentence information in combination with the frequency of appearance of each type of ending vocabulary.

본 발명은 문형별 종결어휘 사전을 이용하여 음성인식 장치로부터 전달된 음성인식 결과에 따른 문형정보를 추출하는 1차 문형판단 단계와, 상기 1차 문형판단 단계에서 문형정보가 추출되었는가 판단하여, 상기 문형정보가 추출되었을 경우, 언어번역 장치로 상기 문형정보와 음성인식 결과를 전달하는 단계와, 상기 1차 문형판단 단계에서 문형정보가 추출되지 않았을 경우, 입력된 음성으로부터 피치를 검출하고, 문형별 출현 빈도율을 계산하는 단계 및 상기 문형별 출현 빈도율과 피치의 선형조합 방법에 의해 파라메터를 결정하여 문형정보를 추출하는 2차 문형판단 단계로 이루어진다. According to the present invention, a sentence sentence determination step of extracting sentence information based on a speech recognition result transmitted from a speech recognition device using a sentence termination vocabulary for each sentence type and judging whether sentence information has been extracted in the primary sentence determination step is performed. When sentence information is extracted, delivering the sentence information and speech recognition result to a language translation apparatus; and when sentence information is not extracted in the first sentence determination step, the pitch is detected from the input voice and sentence by sentence type Computing the appearance frequency rate and the second sentence determination step of extracting the sentence pattern information by determining the parameters by the linear combination method of the appearance frequency rate and pitch for each sentence type.

Description

Extraction of sentence pattern using final vocabulary and pitch {EXTRACTION METHOD OF SENTENCE TYPE USING THE END OF A WORD AND PITCH}

본 발명은 종결어휘와 피치를 이용하여 문장의 번역에 필요한 문형정보를 추출하는 것으로, 더욱 상세하게는 자동통역 장치에서 문형별 종결어휘 사전만으로 문형정보의 추출이 어려울 경우, 종결어휘의 문형별 출현 빈도율과 피치를 선형 조합하여 문형정보를 추출하도록 하는 종결어휘와 피치를 이용한 문형정보 방법에 관한 것이다.The present invention is to extract the sentence information necessary for the translation of sentences using the ending vocabulary and the pitch, more specifically, when it is difficult to extract the sentence information only by the ending vocabulary dictionary by sentence type in the automatic interpretation device, the appearance of the ending vocabulary by sentence type The present invention relates to a sentence information method using a termination vocabulary and a pitch for extracting sentence information by linearly combining a frequency rate and a pitch.

전자통역 장치는 서로 다른 언어를 사용하는 사람들이 의사 소통을 할 수 있도록 하기 위한 장치이다.Electronic interpreter is a device that enables people who speak different languages to communicate.

특히, 대화용 전자번역 장치는 한 언어로 된 음성을 입력하여 다른 언어로 된 음성으로 번역하여 출력한다.In particular, the conversational electronic translation device inputs a voice in one language, translates it into a voice in another language, and outputs the voice.

이러한, 자동통역 장치는 음성인식 결과를 제 2의 언어로 번역하는데 이때 문형정보는 음성인식 결과의 종결어휘를 이용하여 추출이 가능하다.The automatic interpreter translates the speech recognition result into a second language, and the sentence information can be extracted by using the ending vocabulary of the speech recognition result.

그러나, 종결어휘가 하나의 문형에만 속하지 않는 경우가 많이 때문에 종결어휘만을 이용하여 문형을 판단할 경우, 문형의 정확률이 낮다는 문제점이 있다.However, since the ending vocabulary does not belong to only one sentence type in many cases, when the sentence type is determined using only the ending vocabulary, there is a problem that the accuracy of the sentence type is low.

따라서, 본 발명은 상술한 종래의 문제점을 해결하기 위한 것으로, 본 발명의 목적은 자동통역 장치로 입력된 음성으로부터 문형정보를 추출하기 위하여 1차적으로 문형별 종결어휘 사전을 이용하고, 상기 1차 문형판단 수행결과로 문형을 판단할 수 없는 경우, 상기 입력된 음성으로부터 피치를 추출한 후, 종결어휘의 문형별 출현 빈도율과 조합하여 2차 문형판단을 수행함으로서 보다 높은 정확률의 문형정보를 얻을 수 있도록 하는 종결어휘와 피치를 이용한 문형정보 추출방법을 제공함에 있다. Accordingly, the present invention is to solve the above-mentioned conventional problems, an object of the present invention is to first use the sentence type ending vocabulary dictionary to extract sentence information from the speech input to the automatic interpretation device, the primary If the sentence pattern cannot be determined based on the sentence determination result, the pitch sentence is extracted from the input voice, and the sentence sentence information with higher accuracy can be obtained by performing the secondary sentence determination by combining the appearance frequency rate of each sentence of the ending vocabulary. The present invention provides a method of extracting sentence information using a termination vocabulary and a pitch.

상기와 같은 본 발명의 목적을 달성하기 위한 종결어휘와 피치를 이용한 문형정보 추출방법은, 문형별 종결어휘 사전을 이용하여 음성인식 장치로부터 전달된 음성인식 결과에 따른 문형정보를 추출하는 1차 문형판단 단계와, 상기 1차 문형판단 단계에서 문형정보가 추출되었는가 판단하여, 상기 문형정보가 추출되었을 경우, 언어번역 장치로 상기 문형정보와 음성인식 결과를 전달하는 단계와, 상기 1차 문형판단 단계에서 문형정보가 추출되지 않았을 경우, 입력된 음성으로부터 피치를 검출하고, 문형별 출현 빈도율을 계산하는 단계 및 상기 문형별 출현 빈도율과 피치의 선형조합 방법에 의해 파라메터를 결정하여 문형정보를 추출하는 2차 문형판단 단계로 이루어진다. In order to achieve the object of the present invention as described above, the sentence information extraction method using the ending vocabulary and the pitch, the first sentence sentence type to extract sentence sentence information according to the speech recognition result transmitted from the speech recognition device using the termination vocabulary for each sentence type And determining whether sentence information has been extracted in the first sentence sentence determination step, and when the sentence sentence information is extracted, transmitting the sentence sentence information and a speech recognition result to a language translation device, and the primary sentence sentence determination step. If the sentence information is not extracted in the step, detecting the pitch from the input voice, calculating the appearance frequency rate for each sentence type, and determines the parameters by the linear combination method of the appearance frequency rate and pitch for each sentence type extract sentence information The second sentence sentence determination step is made.

이하, 본 발명에 따른 실시예를 첨부한 도면을 참조하여 상세히 설명하기로 한다.Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명에 따른 자동통역 장치의 구성도이다.1 is a block diagram of an automatic interpretation device according to the present invention.

도 1에 도시된 바와 같이, 문형판단 장치(20)는 음성인식 장치(10)와 언어번역 장치(30) 사이에 위치하여 상기 음성인식 장치(10)로부터 음성인식 결과와 필요한 경우 음성을 입력받아 문형정보를 추출한 후, 상기 언어번역 장치(30)에 음성인식 결과와 문형정보를 전달한다.As shown in FIG. 1, the sentence determination device 20 is located between the speech recognition device 10 and the language translation device 30 to receive a speech recognition result and, if necessary, speech from the speech recognition device 10. After the sentence information is extracted, the speech recognition result and sentence information are transmitted to the language translation apparatus 30.

이때, 상기 문형판단 장치(10)는 대용량 텍스트 데이터를 바탕으로 미리 생성해 놓은 문형별 종결어휘 사전(21)과, 상기 음성인식 장치(10)로부터 입력받은 음성에 의해 구해진 피치와 문형별 출현 빈도율(22)을 이용하여 문형정보를 추출하게 된다.In this case, the sentence determination apparatus 10 is a sentence type termination vocabulary dictionary 21 previously generated based on a large amount of text data, and the pitch and the appearance frequency for each sentence type obtained by the voice received from the speech recognition apparatus 10. The sentence rate information is extracted using the rate 22.

도 2는 본 발명에 따른 문형정보 추출방법의 흐름도이다.2 is a flowchart of a sentence information extraction method according to the present invention.

도 2를 참조하여 본 발명의 작동순서를 보다 상세하게 설명하면 다음과 같다.Referring to Figure 2 in more detail the operational sequence of the present invention.

우선, 상기 음성인식 장치(10)로부터 나온 음성인식 결과는 대용량 텍스트 데이터(S100)를 바탕으로 미리 생성해 놓은 문형별 종결어휘 사전(21)을 이용하여 1차적으로 문형정보를 추출하게 된다(S101).First, the speech recognition results from the speech recognition apparatus 10 are first extracted from the speech recognition information using the sentence type ending lexicon 21 previously generated based on the large-capacity text data S 100 (S101). ).

상기 1차 문형판단 단계에서 문형정보가 추출되었는가 판단하여(S102), 상기 음성인식 결과에 의해 문형정보가 추출되었을 경우, 상기 언어번역 장치(30)로 상기 문형정보와 음성인식 결과를 전달한다.It is determined whether sentence information is extracted in the first sentence determination step (S102), and when sentence information is extracted by the speech recognition result, the sentence information and the speech recognition result are transmitted to the language translation apparatus 30.

반면, 상기 1차 문형판단 단계에서 문형정보가 추출되지 않았을 경우, 상기 음성인식 장치(10)로부터 입력된 음성에서 피치를 검출하고(S103), 문형별 출현 빈도율(22)을 계산하여 선형조합의 파라메터인 a, b를 구한다(S104).On the other hand, when sentence information is not extracted in the first sentence determination step, the pitch is detected from the voice input from the speech recognition apparatus 10 (S103), and the linear frequency is calculated by calculating the frequency of appearance of each sentence (22). The parameters a and b are obtained (S104).

예컨대, 평서문을 기준으로 했을 때의 상기 a, b를 구하는 식은 다음과 같다.For example, the above formulas for calculating a and b based on the plain text are as follows.

a = 평서문에서의 출현 빈도율 - 기타문형에서의 출현 빈도율a = frequency of appearance in plain text-frequency of appearance in other sentences

b = 1(평서문), -1(기타문형)b = 1 (comparative sentence), -1 (other sentence)

상기 a, b를 구한 후에는 2차 문형판단을 수행하여 문형정보를 추출해 낸다.After obtaining a and b, second sentence sentence determination is performed to extract sentence information.

상기 2차 문형판단은 종결어휘 출현 빈도율(a)과 피치(b)의 선형조합 방법에 의해 수행되며, 상기 선형조합 방법의 식은 다음과 같다.The second sentence sentence determination is performed by a linear combination method of a frequency of occurrence of a termination word (a) and a pitch (b), and the equation of the linear combination method is as follows.

λa +(1-λ)b ≥θλa + (1-λ) b ≥θ

상기 a와 b는 각각 문형별 종결어휘 출현 빈도율과 피치를 나타내고 λ와 θ는 가중치와 임계치를 나타낸다.A and b represent the frequency and pitch of ending vocabulary by sentence type, respectively, and λ and θ represent weights and thresholds, respectively.

상기 가중치와 임계치는 종결어휘 사전과 마찬가지로 대용량 텍스트 데이터를 바탕으로 미리 추정된다.The weights and thresholds are estimated in advance based on a large amount of text data as in the termination vocabulary dictionary.

이와 같이, 상기 문형별 출현 빈도율과 피치의 선형조합 방법에 의해 파라메터를 결정하여 문형정보를 2차적으로 추출한다(S105).In this way, the parameters are determined by the linear combination method of the appearance frequency rate and pitch for each sentence type and secondly, sentence information is extracted (S105).

도 3은 본 발명에 따른 문형별 종결어휘 사전을 생성하기 위한 순서도이다.3 is a flow chart for generating a sentence-terminus termination vocabulary dictionary according to the present invention.

상기 문형정보를 추출하기 위하여 이용되는 문형별 종결어휘 사전(21)생성 방법은 우선, 대용량의 텍스트를 품사 태깅하여(S200) 각 문형별로 분류한다(S201).In the method of generating a sentence-terminated dictionary 21 for each sentence type used to extract the sentence pattern information, first, a large amount of text is part-tagged (S200) and classified for each sentence type (S201).

상기 문형별로 분류된 문장에서 종결어휘를 추출하고, 출현 빈도율(a)을 구한다(S202).The ending vocabulary is extracted from the sentences classified by sentence type, and a frequency of appearance (a) is obtained (S202).

예컨대, 평서문을 기준으로 했을 때 a= 1인 경우는 해당 종결어휘가 항상 평서문에만 쓰였다는 것을 의미하고 a= 0일 경우에는 평서문에는 전혀 쓰이지 않았음을 의미한다.For example, a = 1 means that the final word is always used only in the plain text, and a = 0 means that it is not used in the plain text.

또한, 상기 출현 빈도율(0.5 ≤a≤1)에 따라서 문형별 종결어휘 사전의 후보들을 생성한다(S203).In addition, candidates of the sentence termination ending dictionary for each sentence type are generated according to the appearance frequency ratio (0.5 ≦ a ≦ 1) (S203).

다음으로, 상기 생성된 종결어휘 사전의 후보들에서 상기 문형정보를 추출하기 위해 사용되는 최종 종결어휘 사전을 선택하기 위해 상기 구해진 모든 출현 빈도율에 대해서 상기 생성된 후보 종결어휘 사전을 이용하여 문형을 판단한 후, 정확률을 측정한다.Next, the sentence type is determined using the generated candidate ending vocabulary dictionary for all the appearance frequency ratios obtained to select the final ending vocabulary dictionary used to extract the sentence sentence information from the candidates of the generated ending vocabulary dictionary. After that, the accuracy rate is measured.

그런 다음, 가장 높은 정확률을 가지는 출현 빈도율에 의해 생성된 종결어휘 사전을 최종 종결어휘 사전으로 선택한다(S204).Then, the ending lexicon generated by the appearance frequency rate having the highest accuracy rate is selected as the final ending lexicon (S204).

도 4는 본 발명에 따른 선형조합 파라메터(λ, θ)를 결정하기 위한 순서도이다.4 is a flow chart for determining the linear combination parameters (λ, θ) according to the present invention.

도 4에 도시된 바와 같이, 상기 2차 문형판단 단계에서 문형정보를 추출하기 위한 선형조합 파라미터 결정은, 대용량의 텍스트로부터 종결어휘 기반 문형정보를 추출한다(S300).As shown in FIG. 4, in the second sentence determination step, the linear combination parameter determination for extracting the sentence shape information extracts the termination vocabulary-based sentence shape information from a large amount of text (S300).

즉, 평서문을 기준으로 했을 때 'a = 평서문 출현 빈도율 - 평서문 이외(기타문형)의 출현 빈도율'이 된다.In other words, when the plain text is a reference, 'a = frequency of appearance of the plain text-the frequency of appearance other than the plain text (other sentence type)'.

다음으로, 상기 음성인식 장치에 의해 입력된 해당 음성으로부터 피치를 추출한 후, 상기 피치 기반 문형정보(b)를 추출한다(S301).Next, after the pitch is extracted from the speech input by the speech recognition apparatus, the pitch-based sentence pattern information (b) is extracted (S301).

상기 추출된 값을 바탕으로 가중치(0 ≤a ≤1)와 임계치(-1 ≤a ≤1)에 따라 문형을 판단한 후(S302), 정확률을 측정하여(S303) 가장 높은 정확률을 갖는 가중치(λ)와 임계치(θ)를 선형조합 파라메터로 결정한다(S304).Based on the extracted value, the sentence shape is determined according to the weight (0 ≤ a ≤ 1) and the threshold value (-1 ≤ a ≤ 1) (S302), and then the accuracy is measured (S303). ) And the threshold value (θ) are determined as a linear combination parameter (S304).

결론적으로, 상기 대용량의 텍스트 데이터를 이용하여 종결어휘 사전을 생성하고 선형조합 방법에 의해 파라메터를 결정한 후, 입력된 음성에 대해서 1차와 2차에 걸쳐 보다 정확한 문형정보를 추출하여 상기 언어번역 장치로 전달하게 된다.In conclusion, after generating a lexical dictionary using the large amount of text data and determining a parameter by a linear combination method, more precise sentence pattern information is extracted for the input speech from the first and second languages. Will be sent to.

상기 본 발명에 따른 종결어휘와 피치를 이용한 문형정보 추출방법은 컴퓨터 프로그램으로 제작되어서 하드디스크, 플로피디스크, 광자기디스크, 시디 롬, 롬, 램 등의 기록매체에 저장될 수 있다. The sentence information extraction method using the termination vocabulary and pitch according to the present invention may be produced by a computer program and stored in a recording medium such as a hard disk, a floppy disk, a magneto-optical disk, a CD-ROM, a ROM, or a RAM.

이상에서는 본 발명의 바람직한 실시예에 대하여 도시하고 또한 설명하였으나, 본 발명은 상기한 실시예에 한정되지 아니하며, 청구범위에서 청구하는 본 발명의 요지를 벗어남이 없이 당해 본 발명이 속하는 분야에서 통상의 지식을 가진 자라면 누구든지 다양한 변형 실시가 가능한 것을 물론이고, 그와 같은 변경은 기재된 청구범위 내에 있게 된다.Although the preferred embodiments of the present invention have been illustrated and described above, the present invention is not limited to the above-described embodiments, and the present invention is not limited to the above-described embodiments without departing from the spirit of the present invention as claimed in the claims. Of course, any person skilled in the art can make various modifications, and such changes are within the scope of the claims.

이상에 설명한 바와 같이 본 발명에 의하면, 상기 자동통역 장치로 입력된 음성언어에서 문형별 종결어휘와 함께 피치정보를 이용하여 문형정보를 추출함으로써 문형판단의 정확률을 보다 향상시킬 수 있는 효과가 있다.As described above, according to the present invention, it is possible to further improve the accuracy rate of sentence sentence determination by extracting sentence sentence information using pitch information together with the termination vocabulary for each sentence sentence in the speech language inputted to the automatic interpreter.

도 1은 본 발명에 따른 자동통역 장치의 구성도,1 is a block diagram of an automatic interpretation device according to the present invention,

도 2는 본 발명에 따른 문형정보 추출방법의 흐름도,2 is a flowchart of a sentence information extraction method according to the present invention;

도 3은 본 발명에 따른 문형별 종결어휘 사전을 생성하기 위한 순서도,3 is a flowchart for generating a sentence-terminated dictionary for sentence type according to the present invention;

도 4는 본 발명에 따른 선형조합 파라메터를 결정하기 위한 순서도이다.4 is a flowchart for determining a linear combination parameter according to the present invention.

<도면의 주요 부분에 대한 부호의 설명><Explanation of symbols for the main parts of the drawings>

10 : 음성인식 장치 20 : 문형판단 장치10: speech recognition device 20: sentence judgment device

21 : 문형별 종결어휘 사전 22 : 문형별 출현 빈도율21: Dictionary of ending sentences by sentence type 22: Frequency of appearance by sentence type

30 : 언어번역 장치30: language translation device

Claims

(a) a first sentence sentence determination step of extracting sentence sentence information according to a speech recognition result transmitted from a speech recognition device using a sentence termination vocabulary for each sentence type;

(b) determining whether sentence information is extracted in the first sentence determination step, and when the sentence information is extracted, transferring the sentence information and a speech recognition result to a language translation device;

(c) detecting the pitch from the input voice and calculating the appearance frequency rate of each sentence when sentence information is not extracted in the first sentence determination step; And

(D) sentence pattern information extraction method using the final vocabulary and pitch comprising the step of determining the parameters by the linear combination method of appearance frequency rate and pitch for each sentence pattern to extract sentence information.

The method of claim 1, wherein the determination of the ending type dictionary for each sentence type used in step (a) is performed.

Part-of-speech tagging of a large amount of text to classify each sentence type;

Extracting a ending vocabulary from sentences classified by sentence type, and obtaining an appearance frequency rate;

Generating candidates of a final lexicon according to the appearance frequency rate; And

Determining a sentence type using the candidate ending vocabulary dictionary with respect to the obtained appearance frequency rate, and then determining the ending vocabulary dictionary generated by an appearance frequency rate having a high accuracy rate by measuring an accuracy rate. Sentence information extraction method using termination vocabulary and pitch.

The method of claim 1, wherein the parameter determination process of step (d)

Extracting a termination vocabulary-based sentence pattern information from a large amount of text;

Extracting the pitch-based sentence pattern information after extracting a pitch from the input voice;

And determining the sentence type according to the weighted value and the threshold value of the extracted value, and then measuring the accuracy rate and selecting the weight value and the threshold value having a high accuracy rate.