KR960007129B1

KR960007129B1 - Pause time location determining method of voice synthesis system

Info

Publication number: KR960007129B1
Application number: KR1019930007853A
Authority: KR
Inventors: 이윤근
Original assignee: 엘지전자주식회사; 구자홍
Priority date: 1993-05-07
Filing date: 1993-05-07
Publication date: 1996-05-27

Abstract

extracting morpheme from a syllable of input text; obtaining a probability of pause period in front and rear of the syllable; comparing the probability with a critical value; if the probability is over the critical one, storing the location as an candidate of the pause period; obtaining a probability for the number of syllables located between the candidates; comparing a value obtained in multiplying the probability of pause period and the probability of the number of syllables with the critical value; and if the multiplied value is over the critical one, determining the location as the pause period.

Description

Resting Positioning Method of Speech Synthesis System

제 1 도는 종래의 휴지기 위치 결정 방법을 보인 제어 흐름도.1 is a control flowchart showing a conventional resting position determination method.

제 2 도는 본 발명이 적용되는 음성 합성 시스템의 하드웨어 구성도.2 is a hardware configuration diagram of a speech synthesis system to which the present invention is applied.

제 3 도는 본 발명의 휴지기 위치 결정 방법을 보인 제어 흐름도.3 is a control flowchart showing a pause position determining method of the present invention.

제 4 도는 제 3 도의 상세 흐름도.4 is a detailed flowchart of FIG.

* 도면의 주요부분에 대한 부호의 설명* Explanation of symbols for main parts of the drawings

1 : 중앙 처리 장치 3 : 램1: central processing unit 3: ram

4 : 입력 인터페이스부 5 : 음성 합성부4 input interface 5 speech synthesis unit

본 발명은 음성 합성 시스템에 관한 것으로, 특히 무제한 문장 합성의 경우 합성음의 자연도 향상을 위해 인간의 발음 특성과 유사하게 적절한 위치이에 휴지기를 삽입할때 휴지기의 복합 확률 분포를 이용하여 휴지기의 위치를 자동으로 생성해 주도록 한 음성 합성 시스템의 휴지기 위치 결정 방법에 관한 것이다.The present invention relates to a speech synthesis system. In particular, in the case of unlimited sentence synthesis, the position of the resting period is determined by using the complex probability distribution of the resting period when inserting the resting period at an appropriate position similar to human pronunciation characteristics to improve the naturalness of the synthesized sound. The present invention relates to a resting position determination method of a speech synthesis system that is automatically generated.

일반적으로 음성 합성 시스템의 휴지기 위치 결정 방법은 제 1 도의 흐름도에서와 같다.In general, the resting position determination method of the speech synthesis system is the same as in the flowchart of FIG.

먼저, 입력 문장이 들어오면 어형을 분석하여 어절로 분리하다.First, when an input sentence comes in, the word form is analyzed and separated into words.

다음에, 삽입될 휴지기의 갯수를 계산하는데, 휴지기의 갯수는 한 문장의 길이에 따라 정해지고, 휴지기와 휴지기 사이에 존재하는 어절의 갯수의 평균값(N_word)을 이용하여 계산하며, 이때 평균값(N_word)은 미리 많은 문장을 분석하여 얻은 결과이다.Next, the number of resting periods to be inserted is calculated. The number of resting periods is determined according to the length of one sentence, and is calculated using the average value (N _word ) of the number of words existing between the resting period and the resting period. N _word ) is the result of analyzing many sentences in advance.

예를들어, 한 문장의 어절 갯수가 10개이고 휴지시 사이의 어절 갯수의 평균값(N_word)이 4이면 삽입될 휴지기의 갯수는 2개이다.For example, if the number of words in a sentence is 10 and the average number of words between pauses (N _word ) is 4, the number of pauses to be inserted is two.

휴지기의 갯수가 정해지면 각 어절의 앞과 뒤에 휴지기가 존재할 확률을 계산하고, 이전 단계에서 결정한 휴지기의 갯수만큼 확률이 큰 순서대로 선택하여 휴지기의 위치를 결정한다.When the number of resting periods is determined, the probability of resting periods before and after each word is calculated, and the positions of resting periods are determined by selecting the probabilities in order of the number of resting periods determined in the previous step.

여기서, 각 어절의 앞과 뒤에 휴지기가 존재할 확률은 미리 많은 문장을 분석하여 얻은 결과이다.Here, the probability that there is a pause before and after each word is a result obtained by analyzing many sentences in advance.

그러나 이러한 방식은 휴지기와 휴지기 사이에 존재하는 어절 갯수의 평균값에 의해 문장내에 존재하는 휴지기의 갯수가 정해지므로 휴지기의 존재 확률이 무척 낮을 경우에도 갯수를 맞추기 위해 휴지기가 삽입 되거나 또는 휴지기의 존재 확률이 무척 높을 경우에도 휴지기가 생략되는 현상이 나타나게 되는 문제점이 있었다.However, in this method, the number of pauses in a sentence is determined by the average value of the number of words between the pauses and the pauses, so even when the probability of existence of the pauses is very low, a pause is inserted or a probability of existence of the pauses. There was a problem in that even when a very high pause is appeared.

또, 휴지기외 휴지기 사이의 길이가 너무 길거나 짧게 되는 경우도 발생하게 된다.In addition, a case where the length between the resting period and the resting period becomes too long or short also occurs.

본 발명은 상기와 같은 종래의 문제점을 해결하기 위한 것으로, 본 발명의 목적은 형태소에 따른 휴지기 존재 확률 뿐만 아니라 휴지기 사이의 어절 갯수 확률 분포를 이용해 휴지기 위치를 결정함으로써 휴지기 사이의 길이가 너무 길거나 짧아지는 것을 보완하여 자연스러운 휴지기 정보를 추출할 수 있도록 한 음성합성 시스템의 휴지기 위치 결정 방법을 제공하는데 있다.The present invention is to solve the above conventional problems, the object of the present invention is to determine the position of the resting period using the probability distribution of word number between the resting period as well as the rest period prolonged by the morpheme is too long or short between the resting periods. The present invention provides a method for locating the resting period of a speech synthesis system that can compensate for the loss and extract natural resting information.

이하, 첨부된 제 2 도 내지 제 4 도를 참조하여 본 발명의 실시예를 상세히 설명하면 다음과 같다.Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to FIGS. 2 through 4.

제 2 도는 본 발명이 적용되는 음성 합성 시스템의 하드웨어 구성도이다.2 is a hardware configuration diagram of a speech synthesis system to which the present invention is applied.

이에 도시된 바와같이, 본 발명은 음성 합성에 관한 전체적인 알고리즘을 수행하고 제어를 담당하는 중앙처리 장치(1)와, 중앙처리 장치(1)에서 수행할 알로리즘이 프로그램되어 있는 롬(ROM; 2)과, 중앙처리 장치(1)의 어드레스 지정에 따라 처리중인 데이타 및 알고리즘 수행 결과를 저장하는 램(RAM; 3)과, 중앙처리 장치(1)의 입출력 선택 신호에 따라 입력 무장을 받아들여 중앙처리 장치(1)에 전달하는 입력 인터페이스부(4)와, 중앙처리 장치(1)의 제어를 받아 입력 문장을 음성으로 합성하는 음성 합성부(5)로 구성되어 있다.As shown therein, the present invention relates to a central processing unit (1) which is responsible for performing and controlling the overall algorithm for speech synthesis, and a ROM (ROM) in which an algorithm to be executed in the central processing unit 1 is programmed. And RAM for storing the data being processed and the result of the algorithm execution according to the addressing of the central processing unit 1, and input arming according to the input / output selection signal of the central processing unit 1 An input interface unit 4 to be transmitted to the processing unit 1 and a speech synthesis unit 5 for synthesizing the input sentences into voices under the control of the central processing unit 1 are comprised.

상기와 같이 구성된 음성 합성 시스템에 문장 입력이 들어오면, 이 입력 데이타는 입력 인터페이스부(4)를 통해 중앙처리 장치(1)로 전달되고, 중앙처리 장치(1)는 롬(2)에 저장된 본 발명의 휴지기 생성 알고리즘을 수행하여 입력 문장의 휴지기 위치를 결정한 후 결정된 위치를 램(3)에 저장한다.When a sentence input is input to the speech synthesis system configured as described above, the input data is transmitted to the central processing unit 1 through the input interface unit 4, and the central processing unit 1 stores the bones stored in the ROM 2. After determining the pause position of the input sentence by performing the pause generation algorithm of the present invention, the determined position is stored in the RAM 3.

그러면 음성 합성부(5)는 데이타 버스를 통해 램(3)에 저장된 휴지기 생성 정보를 이용할 수 있다.The speech synthesizer 5 may then use the pause generation information stored in the RAM 3 via the data bus.

제 3 도는 롬에 프로그램되어 있는 본 발명의 휴지기 위치 결정 방법을 보인 제어 흐름도이고, 제 4 도는 그 상세 흐름도이다.3 is a control flowchart showing the resting position positioning method of the present invention programmed in the ROM, and FIG. 4 is a detailed flowchart thereof.

먼저, 입력 인터페이스부(4)를 통해 문장이 입력되면, 중앙처리 장치(1)는 초기화 과정을 수행하여 인덱스(index; i,j)를 0으로 클리어시키고, 형태소 분석부에서 입력된 문자의 각 어절에서의 형태소 정보를 추출한다.First, when a sentence is input through the input interface unit 4, the central processing unit 1 performs an initialization process to clear the index (i, j) to 0, and each of the characters input from the morphological analyzer Extract stemming information from words.

형태소에 따라 해당 어절의 앞 또는 뒤에 휴지기가 존재할 확률(P_p,P_f)이 달라지는데, 상기 휴지기 존재 확률은 미리 테이블에 저장되어 있으므로 이 테이블에 의해 i번째 어절뒤에 휴지기가 존재할 확률(P_f1)과 1₊₁번째 어절 앞에 휴지기가 존재할 확률(P_pi₊₁₎을 구하여 더함으로써 i번째 어절과 i₊₁번째 어절 사이에 휴지기가 존재할 확률(P₁)을 계산한다.According to the morphemes, the probability of having a rest period before or after a word varies (P _p , P _f ). Since the probability of having a rest period is stored in a table in advance, the probability of having a rest period after an ith word by this table (P _f1 ) Calculate the probability (P ₁ ) between the i th word and the i ₊₁ th word by calculating the probability (P _p i ₊₁₎ before and the 1 ₊₁ th word.

그리고 계산된 휴지기 존재 확률(P₁)과 일정 임계치(P_THH)를 비교하여 계산된 확률(P₁)이 임계치(P_THH)를 넘으면 그 어절 인덱스(i)를 램(3)에 저장하고, 어절중 P₁〉P_THH인 어절의 인덱스(j)를 하나 증가시킨 후 다시 어절 인덱스(i)를 하나 증가시킨다. 계산된 휴지기 존재 확률(P₁)이 임계치(P_THH)를 넘지 않으면 바로 어절 인덱스(i)를 하나 증가시키고, 어절 인덱스(i)와 한 문장내의 어절 갯수(M)를 비교하여 모든 어절에 대해 상기 과정들을 반복수행한 후, P₁〉P_THH인 어절 인덱스 갯수(N)를 P₁〉P_THH인 어절 인덱스(j)와 같게 하고, 인덱스(j)를 0으로 클리어시킨다.And a compares the calculated resting presence probability (P ₁₎ and a predetermined threshold value (P _THH) calculates the probability (P ₁₎ is longer than a threshold value (P _THH) stores the Eojeol index (i) in the RAM 3, and The word index j of the word P ₁ > P _THH is increased by one, and then the word index i is increased by one. If the calculated pause existence probability (P ₁ ) does not exceed the threshold (P _THH ), the word index (i) is immediately increased by one, and the word index (i) is compared with the number of words in a sentence (M) for all words. after repeatedly performing the above process, P _1> then a is Eojeol index number (N) equal to the _THH P P _1> P _THH Eojeol the index (j), and clearing the index (j) to zero.

다음에, 중앙처리 장치(1)는 램(3)에 저장된 인덱스(휴지기가 존재할 수 있는 위치의 후보들) 사이의 어절 갯수에 의해 휴지기의 위치를 최종 결정하는 과정을 수행한다.Next, the central processing unit 1 performs a process of finally determining the position of the resting period by the number of words between the indices stored in the RAM 3 (the candidates of the positions where the resting state may exist).

즉, 램(3)에 저장된 인덱스(_Idxj)에 대해_Idxj번째 어절과_Idxj-1번째 어절 사이의 어절 갯수를 계산하여 앞의 휴지기와 현재 휴지기 사이의 어절 갯수(Dpj)를 구하고,_Idxj+1번째 어절과_Idxj번째 어절 사이의 어절 갯수를 계산하여 현재 휴지기와 뒤에 따라오는 휴지기 사이의 어절 갯수(Dfj)를 구한다.That is, the number of words between the _Idxj th word and the _Idxj -1 th word is calculated for the index ( _Idxj ) stored in the RAM 3, and the number of words (Dpj) between the previous pause and the current pause is obtained, and the _{Idxj + 1st} The number of words between the word and the _Idxj word is calculated to find the number of words (Dfj) between the current and subsequent rest.

이때 휴지기와 휴지기 사이의 어절갯수는 일정한 확률 분포를 가지며, 이것은 미리 테이블에 저장되어 있으므로 상기 테이블에 의해 어절 갯수 확률(P_Dj=P_Dfj+P_PDj)을 계산한다.In this case, the word count between the resting period and the resting period has a certain probability distribution, and since it is stored in a table in advance, the word count probability (P _Dj = P _Dfj + P _PDj ) is calculated by the table.

이와 같이 계산된 어절 갯수 확률(P_Dj)에 앞에서 구한 현재 위치에서의 휴지기 존재 확률(P_Idxj)를 곱하고, 곱한 값과 일정 임계치(P_THH)를 비교하여 곱할 값(P_DJ×P_Idxj)이 임계치(P_THH)를 넘으면 이 인덱스 위치에 휴지기가 존재하므로 그 어절 인덱스(_Idxj)를 램(3)에 저장한다.The number of word probabilities (P _Dj ) calculated as above is multiplied by the idle period probabilities (P _Idxj ) obtained at the current position, and the value to be multiplied by comparing the multiplied value with a predetermined threshold (P _THH ) is P _DJ × P _Idxj . When the threshold P _THH is exceeded, a pause exists at this index position, and the word index _Idxj is stored in the RAM 3.

어절 갯수 확률과 휴지기 존재 확률을 곱한 값과 임계값을 비교하는 과정이 끝나면, 어절중 P₁〉P_THH인 어절의 인덱스(j)를 하나 증가시키고, P₁〉P_THH인 어절의 인덱스(j)와 P₁〉P_THH인 어절 인덱스 갯수(N)를 비교하여 모든 휴지기 후보들(j=1~N)에 대해 상기 휴지기 위치결정 과정을 반복 수행한다.After the process of comparing the Eojeol number of probability and resting there multiplied by the probability value with a threshold value, of the P _1> P _THH is increased by one index (j) of Eojeol and, P _1> P _THH index of Eojeol (j Eojeol ) And the resting period positioning process is repeated for all the resting candidates (j = 1 to N) by comparing the number of word indexes (N) where P ₁ > P _THH .

이상에서와 같이 종래에는 휴지기 생성시 형태소에 따른 휴지기 존재 확률만을 이용해 휴지기 위치를 결정한 반면, 본 발명은 그 이외에도 휴지기와 휴지기 사이의 어절 갯수의 확률 분포를 이용해 휴지기 위치를 결정하므로 휴지기 사이의 길이가 너무 길거나 짧은 현상이 발생되는 것을 보완하여 보다 자연스러운 휴지기 정보를 추출할 수 있는 효과가 있다.As described above, while the rest position is determined using only the presence probability of the rest period according to the morphemes during the generation of the rest period, the present invention determines the rest period using the probability distribution of the number of words between the rest period and the rest period. To compensate for the occurrence of too long or short phenomenon, there is an effect that can extract more natural rest information.

Claims

Extracting the morphological information from the input text and each word, obtaining a probability that there is a resting period before and after each word, and storing the position as a resting period candidate position when a threshold is exceeded by comparing the remaining existence probability with a threshold value. And calculating the number of word probabilities between the dormant candidates, and comparing the product of the dormant existence probability and the number of word probabilities for the dormant candidate position with a threshold to determine the position as the dormant presence position when the threshold is exceeded. Resting position positioning method of the speech synthesis system characterized in that made.

The method according to claim 1, wherein the resting period probabilities are calculated by adding the probability that the resting period exists after the current word obtained by using the resting period probability distribution according to the pre-table morpheme and the probability that the resting period exists before the following word. A resting position determination method of a speech synthesis system.

The word count probability according to claim 1, wherein the number of word probabilities calculates the number of words between the previous resting period and the current resting period and the number of words between the resting period following the current resting period, respectively, for the stored resting period candidate positions. A pause position determining method of a speech synthesis system, characterized in that the sum of the probabilities of the two words obtained using a probability distribution is obtained.