JP2573565B2

JP2573565B2 - Conversion device for word spelling to phonetic symbols

Info

Publication number: JP2573565B2
Application number: JP60271261A
Authority: JP
Inventors: 達郎松本
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1985-12-02
Filing date: 1985-12-02
Publication date: 1997-01-22
Anticipated expiration: 2012-01-22
Also published as: JPS62131295A

Description

【発明の詳細な説明】〔概要〕入力される単語内の綴り字列毎にアクセントルール格
納部から読み出されたアクセントルールとのマッチング
を取ってアクセントを有する綴り字にアクセント識別情
報を付加し、アクセント識別情報が付加された未変換の
綴り字列毎に発音ルール格納部から読み出された発音ル
ールに基づいて発音記号に変換する。かくすることによ
り、アクセントを有する単語の正しい発音記号への変換
を行うことができる。DETAILED DESCRIPTION OF THE INVENTION [Summary] For each spelling in an input word, matching is performed with an accent rule read from an accent rule storage unit to add accent identification information to a spelling having an accent. For each unconverted spelling string to which accent identification information has been added, the spelling character string is converted into a phonetic symbol based on the pronunciation rule read from the pronunciation rule storage unit. By doing so, it is possible to convert an accented word into a correct phonetic symbol.

[Industrial applications]

本発明は単語綴りの発音記号への変換方式に関し、更
に詳しく言えば単語の発音記号への変換に際してアクセ
ント識別情報をその仲立ちに参与せしめた単語綴りの発
音記号への変換方式に関する。The present invention relates to a method for converting word spelling to phonetic symbols, and more particularly, to a method for converting word spelling to phonetic symbols in which accent identification information is involved in mediation when converting words to phonetic symbols.

音声合成においては、与えられる単語の発音記号を用
いる技法がある。このような発音記号は入力される単語
の綴り文字列がどのようになっているかをみてそこから
誘導されるが、そのような技法がどういう手順を踏むに
しろ、その導かれた発音記号が正しくなっていることが
望まれることは勿論である。In speech synthesis, there is a technique using a phonetic symbol of a given word. These phonetic symbols are derived from the spelling of the input word by looking at the spelling, but no matter what procedure such a technique takes, the phonetic symbols that are derived are correct. It is needless to say that it is desirable.

[Conventional technology]

上述のような単語綴りの発音記号への変換に用いられ
る一般的な方法として考えられるのは最長一致法であ
る。この方法は綴り字のみに着目し、単語の左から順に
適用可能な最も長い発音規則によって綴り字を発音記号
に変換する方法である。The longest matching method is considered as a general method used for converting word spelling into phonetic symbols as described above. This method focuses only on the spelling, and converts the spelling into phonetic symbols according to the longest pronunciation rule applicable from the left of the word.

[Problems to be solved by the invention]

この最長一致法を独語のstehenという単語に適用した
例を第12図（Ａ）に示す。この図に示す例から分かるよ
うに、最長一致法は独語等には有効である。FIG. 12A shows an example in which the longest match method is applied to the German word stehen. As can be seen from the example shown in this figure, the longest match method is effective for German and the like.

ところが、英語のようにアクセントの有無で発音が変
わって来る言語の発音記号を最長一致法の下で決定する
のは不適当である。例えば、単語のauthor〔〕とauthority〔〕という２つの単語に最長一致法を適用した場合を第12
図（Ｂ）に示すが、authorは第１音節に、authorityは
第２音節に、第１アクセントがあり、これら単語の第１
音節、第２音節の発音は異なっている。従って、最長一
致法では、アクセントの情報は全く与えられないので、
正しい発音記号を得ることができない。このような場合
にも、最長一致法で正しい発音記号を得ようとするなら
ば、単語自体にルールを持つしかなく、ルール辞書容量
の膨大化は避けられない。However, it is inappropriate to determine the phonetic symbols of languages whose pronunciation changes depending on the presence or absence of accents, such as English, using the longest matching method. For example, the word author [ ] And authority [ ] When the longest match method is applied to the two words
As shown in Figure (B), author has the first syllable, authority has the second syllable, and the first accent.
The pronunciation of the syllable and the second syllable are different. Therefore, in the longest match method, no accent information is given.
I can't get the correct phonetic symbols. Even in such a case, if a correct phonetic symbol is to be obtained by the longest match method, there is no other way than to have rules for the words themselves, and an enormous increase in rule dictionary capacity is inevitable.

本発明は、斯かる問題点に鑑みて創作されたもので、
単語中のアクセントを有する綴り字にアクセント識別情
報を与えて単語の正しい発音記号を得ることのできる単
語綴りの発音記号への変換装置を提供することを目的と
する。The present invention has been created in view of such problems,
It is an object of the present invention to provide an apparatus for converting a word spelling into a phonetic symbol, which can give a correct phonetic symbol of a word by giving accent identification information to a spelling having an accent in the word.

[Means for solving the problem]

第１図は、本発明の原理ブロック図を示す。この図に
おいて、１は所要数の綴り字から成る単語を入力する単
語入力手段である。４はアクセントルール格納部であ
る。２はアクセント識別情報付加手段で、これは入力さ
れた単語内の各単語綴り字以降の綴り字列毎にアクセン
トルール格納部４からアクセントルールを読み出し、当
該綴り字列と読み出されたアクセントルールとのマッチ
ングをとり、アクセントを有する綴り字にアクセント識
別情報を付加するものである。FIG. 1 shows a principle block diagram of the present invention. In this figure, 1 is a word input means for inputting a word composed of a required number of spellings. 4 is an accent rule storage unit. Numeral 2 is accent identification information adding means, which reads an accent rule from the accent rule storage unit 4 for each spelling character string after each word spelling character in the input word, and reads the spelling character string and the read out accent rule. And adds accent identification information to the spelling having an accent.

５は発音ルール格納部である。３は綴り・発音記号変
換部で、アクセント識別情報が付加された単語綴り字の
先頭から未変換のアクセント識別情報が付加された単語
綴り字列毎に発音ルール格納部（５）から発音ルールを
読み出し、未変換のアクセント識別情報が付加された単
語綴り字列を読み出された発音ルールに基づいて発音記
号に変換するものである。Reference numeral 5 denotes a pronunciation rule storage unit. Numeral 3 is a spelling / pronunciation symbol conversion unit which stores pronunciation rules from the pronunciation rule storage unit (5) for each word spelling string to which unconverted accent identification information has been added from the beginning of the word spelling to which accent identification information has been added. The word spelling string to which the read and unconverted accent identification information is added is converted into phonetic symbols based on the read pronunciation rules.

[Action]

単語入力手段１から入力された単語は、各単語綴り字
以降の単語綴り字列毎に当該単語綴り字列とアクセント
ルール格納部４から読み出されたアクセントルールとの
マッチングがアクセント識別情報付加手段２で取られ、
アクセントを有する単語綴り字にアクセント識別情報が
付加される。そのアクセント識別情報を付加された単語
は綴り・発音記号変換手段３において発音ルールとのマ
ッチングが取られる。The word input from the word input means 1 is obtained by matching the word spelling character string with the accent rule read from the accent rule storage unit 4 for each word spelling character string after the word spelling character. Taken at 2,
Accent identification information is added to an accented word spelling. The word to which the accent identification information is added is matched with the pronunciation rule in the spelling / diamond symbol conversion means 3.

綴り・発音記号変換手段３におけるマッチングは、ア
クセント識別情報が付加された単語綴り字の先頭から未
変換のアクセント識別情報が付加された単語綴り字列毎
に、当該アクセント識別情報を付加された単語綴り字列
と、発音ルール格納部５から読み出された発音ルールと
のマッチングが取られ、未変換のアクセント識別情報を
付加された単語綴り字列は、発音記号へ変換される。Matching in the spelling / pronunciation conversion means 3 is performed for each word spelling string to which unconverted accent identification information has been added from the beginning of the word spelling to which accent identification information has been added. Matching between the spelling character string and the pronunciation rule read from the pronunciation rule storage unit 5 is performed, and the word spelling character string to which unconverted accent identification information is added is converted into a phonetic symbol.

このように、単語の発音記号への変換に先立って、単
語の内のアクセントのある綴り字にその旨を表すアクセ
ント識別情報が付加された後その単語が綴り・発音記号
変換手段３に渡されるから、単語の発音記号への変換に
際して単語のアクセントが考慮された発音記号を自動的
に得ることができる。In this manner, prior to conversion of a word into a phonetic symbol, accent identification information indicating that fact is added to the accented spelling of the word, and the word is passed to the spelling / phonetic symbol conversion means 3. Thus, it is possible to automatically obtain a phonetic symbol in which the accent of the word is taken into account when converting the word into a phonetic symbol.

従って、本発明によって正しい発音記号を得ることが
できる。Therefore, correct phonetic symbols can be obtained by the present invention.

〔Example〕

第２図は本発明の一実施例を示す。この図において、
10は発音記号を得ようとする単語の綴り字を受けて後述
するようなアクセント識別文字を付加された文字列を出
力するアクセント位置決定部である。11はアクセントル
ール格納部で、アクセント位置決定部10でのアクセント
識別文字の付加制御に用いられるアクセントルールをア
クセント位置決定部10に与えるものである。12はアクセ
ント位置決定部10からの出力文字列を発音記号に変換す
る綴り・発音記号変換部であり、その変換の際に用いら
れる発音ルールを綴り・発音記号変換部12へ供給するの
が発音ルール格納部13である。アクセントルール格納部
11及び発音ルール格納部13に格納されるアクセントルー
ル及び発音ルールは第３図に示されるようなフォーマッ
ト（書式）に構成されている。この図ので示すフォー
マットはXAYを含む文字列があればＡをＢで置き換える
ことを表し、はXYを含む文字列があればＸとＹとの間
にＡを挿入することを表し、はXAYを含む文字列があ
ればＡを削除することを表し、はXXXAY,XXAY又はXAY
を含む文字列があればＡをＢに置換することを表してい
る。の“（",“）”は任意性を表す。FIG. 2 shows an embodiment of the present invention. In this figure,
Reference numeral 10 denotes an accent position determining unit that receives a spelling of a word for which a phonetic symbol is to be obtained and outputs a character string to which an accent identification character is added as described later. Reference numeral 11 denotes an accent rule storage unit, which gives the accent position determining unit 10 an accent rule used for controlling the addition of accent identification characters in the accent position determining unit 10. Reference numeral 12 denotes a spelling and phonetic symbol conversion unit that converts a character string output from the accent position determining unit 10 into phonetic symbols, and supplies pronunciation rules used in the conversion to the spelling and phonetic symbol conversion unit 12. This is the rule storage unit 13. Accent rule storage
The accent rules and the pronunciation rules stored in the pronunciation rule 11 and the pronunciation rule storage unit 13 are configured in a format as shown in FIG. The format shown in this figure indicates that if there is a character string containing XAY, A is replaced with B, if there is a character string containing XY, A is inserted between X and Y, and XAY is replaced with If there is a character string containing it, it means that A is deleted, and XXXAY, XXAY or XAY
Indicates that if there is a character string containing A, A is replaced with B. "(", ")" In the expression represents arbitraryness.

以下、順を追って本発明による処理過程を説明する。 Hereinafter, the processing steps according to the present invention will be described step by step.

発音記号を得ようとする単語を受け取るアクセント位
置決定部10における処理フローが第４図に示されてい
る。そのステップS1において、綴りポインタ（spell）
を入力単語の先頭綴り字にセットしてステップS2に進
み、アクセントルール格納部（辞書）11から１つのアク
セントルールを読み込む。そして、ステップS3において
辞書のアクセントルールをすべて読み出したか否かの判
定が行なわれる。すべて読み出していれば、その単語に
ついての処理は終了する。FIG. 4 shows a processing flow in the accent position determination unit 10 for receiving a word for obtaining a phonetic symbol. In step S1, a spelling pointer (spell)
Is set as the first spelling of the input word, and the routine proceeds to step S2, where one accent rule is read from the accent rule storage unit (dictionary) 11. Then, in step S3, it is determined whether or not all the accent rules of the dictionary have been read. If all the words have been read, the processing for the word ends.

ステップS3での判定が否定であるならば、読み出した
アクセントルールの先頭にルールポインタ（rule）をセ
ットして（ステップS4）、ステップS5におけるマッチン
グ処理（その詳細は第５図乃至第８図に示す。）が行な
われ、そのマッチングに成功すれば（ステップS6の
Ｔ）、アクセントルールに基づいて文字変換処理がステ
ップS7において行なわれてステップS2に戻る。If the determination in step S3 is negative, a rule pointer (rule) is set at the head of the read accent rule (step S4), and the matching process in step S5 (the details are shown in FIGS. 5 to 8). Is performed, and if the matching is successful (T in step S6), a character conversion process is performed in step S7 based on the accent rule, and the process returns to step S2.

ステップS5におけるマッチングに失敗するならば（ス
テップS6のＦ）、ステップS2に戻って同様の処理を行な
う。これら処理例が第９図及び第10図に示されている。
この両図ともマッチングがとれるルールのみを示してい
る。これら以外のルールはいずれも、その途中におい
て、又は最終的にマッチングがとれないものである。第
９図及び第10図の処理例から明らかなように、アクセン
トルールに基づいて文字列内に挿入されるアクセント識
別情報としては、例えば図示の如く数字が用いられる。
１は第１アクセントの識別のために、２は第２アクセン
トの識別のために、そして３は弱音の識別のために用い
られる。If the matching in step S5 fails (F in step S6), the process returns to step S2 to perform the same processing. Examples of these processes are shown in FIG. 9 and FIG.
Both figures show only rules that can be matched. All other rules are those that cannot be matched in the middle or finally. As is clear from the processing examples shown in FIGS. 9 and 10, as the accent identification information inserted into the character string based on the accent rules, for example, numbers are used as shown in the figure.
1 is used for identifying the first accent, 2 is used for identifying the second accent, and 3 is used for identifying the weak sound.

次に、ステップS5の処理を第５図乃至第８図を用いて
説明する。Next, the processing in step S5 will be described with reference to FIGS.

第５図において、マッチングは綴りを１文字ずつ右へ
ずらしながら（S105）、match（S103）を呼び出して綴
りとアクセントルールとのマッチングを試行し、そのマ
ッチングに成功すれば成功を呼び出したmatchに返す（S
107）。綴りの残り文字数がアクセントルールの文字数
より少なくなったとき失敗を返す（S106）。なお、第５
図において、spellは綴り字を指すポインタ、ruleはア
クセントルール文字（記号）を指すポインタ、spell.le
nは綴りの文字数、rule.lenはアクセントルールの文字
数である。In FIG. 5, the matching is performed by shifting the spelling one character to the right (S105), calling match (S103) to try to match the spelling with the accent rule, and if the matching is successful, the success is called a match. Return (S
107). If the number of remaining characters in the spelling is less than the number of characters in the accent rule, a failure is returned (S106). The fifth
In the figure, spell is a pointer to spelling, rule is a pointer to accent rule character (symbol), spell.le
n is the number of characters in spelling, and rule.len is the number of characters in accent rules.

第５図のステップS5は第６図に示すような処理ステッ
プから成っている。第６図において、spell及びruleは
第５図と同様にそれぞれポインタを意味し、＊spellはs
pellの指している文字を、＊ruleはruleの指している文
字を示す。又、flagのONは括弧内を、そのOFFは括弧外
を表す。Step S5 in FIG. 5 comprises processing steps as shown in FIG. In FIG. 6, spell and rule mean pointers, respectively, as in FIG.
* rule indicates the character pointed to by the pell, and * rule indicates the character pointed to by the rule. Further, ON of the flag indicates the inside of the parentheses, and OFF indicates the outside of the parentheses.

第６図のステップS200においてマッチング処理に入
り、ステップS201において丸括弧の処理、そしてステッ
プS202において一文字毎のマッチング処理（sub.matc
h）を行なう。これら両ステップの詳細は第７図及び第
８図を用いて説明する。なお、ステップS201の後に、ス
テップS250においてルールポインタがそのルールの終り
を指している否かの判定が行なわれ、その判定が肯定な
らば成功を返すが（ステップS251）、そうでなければ、
以下に述べるステップS202以降の処理に進む。In step S200 of FIG. 6, the matching process is started. In step S201, parenthesis processing is performed, and in step S202, matching processing for each character (sub.matc) is performed.
Perform h). Details of these two steps will be described with reference to FIGS. 7 and 8. After step S201, it is determined in step S250 whether or not the rule pointer points to the end of the rule. If the determination is affirmative, success is returned (step S251).
The process proceeds to step S202 and subsequent steps described below.

ステップS202におけるマッチング処理において一文字
毎のマッチングに成功が生ずるならば（ステップS203の
Ｔ）、matchは自分自身を呼び出し、次の文字を渡す
（ステップS204）。matchが成功を返して来るならば
（ステップS205のＴ）、呼び出した方のmatchも成功を
返す（ステップS206）。ステップS204のマッチングにお
いて失敗が返され（ステップS205のＦ）、且つそのとき
のポインタが丸括弧外ならば（ステップS207のＴ）、失
敗を返す（ステップS208）。逆に、丸括弧内ならば、丸
括弧外にでて（ステップS209）、再びmatchを呼び出す
（ステップS210）。呼び出されたmatchでのマッチング
処理（ステップS210）が成功ならば（ステップS211の
Ｔ）、成功を返す（ステップS212）が、逆に失敗ならば
（ステップS211のＦ）、失敗を返す（ステップS212）。If the matching process for each character is successful in the matching process in step S202 (T in step S203), match calls itself and passes the next character (step S204). If the match returns success (T in step S205), the calling match also returns success (step S206). If a failure is returned in the matching in step S204 (F in step S205), and the pointer at that time is outside parentheses (T in step S207), a failure is returned (step S208). Conversely, if it is inside the parentheses, it goes out of the parentheses (step S209), and the match is called again (step S210). If the matching process (step S210) in the called match is successful (T in step S211), the process returns success (step S212), but if it fails (F in step S211), it returns failure (step S212). ).

又、ステップS202におけるマッチング処理において一
文字毎のマッチングに失敗が生じ（ステップS203の
Ｆ）、且つそのときのポインタが丸括弧外ならば（ステ
ップS214のＴ）、失敗を返す（ステップS215）。逆に丸
括弧内ならば（ステップS214のＦ）、丸括弧外に出て
（ステップS216）、再びmatchを呼び出す（ステップS21
7）。この呼び出されたmatchでのマッチング処理（ステ
ップS217）が成功ならば（ステップS218のＴ）、成功を
返す（S219）が、失敗ならば（ステップS218のＦ）失敗
を返す。In addition, in the matching process in step S202, a failure occurs in matching for each character (F in step S203), and if the pointer at that time is outside parentheses (T in step S214), the failure is returned (step S215). Conversely, if it is inside the parentheses (F in step S214), it goes out of the parentheses (step S216) and calls match again (step S21).
7). If the matching process (step S217) in the called match is successful (T in step S218), success is returned (S219), but if it is unsuccessful (F in step S218), failure is returned.

上述のようなマッチング処理における丸括弧の処理は
次の通りである（第７図参照）。第７図におけるrule,
＊rule及びflagの意味は第６図と同じである。そのステ
ップS300において丸括弧の処理が始まってルールポイン
タが“（”を指しているならば（ステップS301のＴ）、
flagをONにして（ステップS302）ルールポインタを一文
字右へ進め（ステップS303）、再び丸括弧の処理を呼び
出し（ステップS304）、ルールポインタの値を返す（ス
テップS305）。The processing of parentheses in the above-described matching processing is as follows (see FIG. 7). Rule, in FIG.
* The meanings of rule and flag are the same as in FIG. If the processing of the parentheses starts in step S300 and the rule pointer points to "(" (T in step S301),
The flag is turned on (step S302), the rule pointer is moved right by one character (step S303), the processing of parentheses is called again (step S304), and the value of the rule pointer is returned (step S305).

ルールポインタが右へ進められて“（”を指すに至れ
ば（ステップS306のＴ）、flagをOFFにして（ステップS
307）、ルールポインタを一文字右へ進めるステップS30
3以降の処理に入る。When the rule pointer is advanced rightward and points to “(” (T in step S306), the flag is turned OFF (step S306).
307), Step S30 to advance the rule pointer one character to the right
Start the process after 3.

又、ルールポインタが“（”も又“）”も指していな
いつまりステップS301及びS306の判定がＦであるなら
ば、ルールポインタの値を返す（ステップS308）。If the rule pointer does not point to "(" or ")", that is, if the determination in steps S301 and S306 is F, the value of the rule pointer is returned (step S308).

そして、このような第６図のステップS201における丸
括弧の処理に続いて行なわれる綴り及びルールから一文
字ずつを受け取ってのマッチング処理が第８図に示され
ている。第８図において、C1は綴りの一文字、C2はアク
セントルールの一文字である。FIG. 8 shows a matching process of receiving one character at a time from the spelling and the rule, which is performed subsequent to the processing of the parentheses in step S201 in FIG. In FIG. 8, C1 is one character of spelling, and C2 is one character of an accent rule.

このような一文字のマッチング処理に入ると（ステッ
プS400）、先ず、ステップS401において、“C2"が
“＋”即ち母音字（a,e,i,o,u,y）にマッチングする文
字であるか否かの判定が為される。その判定が真（Ｔ）
で、“C1"が母音字であるならば（ステップS402の
Ｔ）、成功を返す（ステップS403）が、“C1"が母音字
でないならば（ステップS402のＦ）、失敗を返す（ステ
ップS404）。When such a one-character matching process starts (step S400), first, in step S401, "C2" is a character that matches "+", that is, a vowel (a, e, i, o, u, y). Is determined. The judgment is true (T)
If "C1" is a vowel character (T in step S402), success is returned (step S403), but if "C1" is not a vowel character (F in step S402), failure is returned (step S404). ).

ステップS401における判定が偽（Ｆ）であるならば、
ステップS405において“C2"が“−”即ち子音字（母音
字以外のアルファベット）にマッチングする文字である
か否かの判定が為される。その判定が真（Ｔ）で、“C
1"が子音字であるならば（ステップS406のＴ）、成功を
返す（ステップS407）が、“C1"が子音字でないならば
（ステップS406のＦ）、失敗を返す（ステップS408）。If the determination in step S401 is false (F),
In step S405, it is determined whether or not "C2" is a character that matches "-", that is, a consonant (an alphabet other than a vowel). If the judgment is true (T), "C
If "1" is a consonant (T in step S406), success is returned (step S407), but if "C1" is not a consonant (F in step S406), failure is returned (step S408).

ステップS405における判定が偽（Ｆ）であるならば、
ステップS409において“C2"が“＄”即ち数字（０〜
９）にマッチングする文字であるか否かの判定が為され
る。その判定が真（Ｔ）で、“C1"が数字であるならば
（ステップS401のＴ）、成功を返す（ステップS411）
が、“C1"が数字でないならば（ステップS410のＦ）、
失敗を返す（ステップS412）。If the determination in step S405 is false (F),
In step S409, “C2” is changed to “＄”, that is, a number (0 to 0).
A determination is made as to whether or not the character matches 9). If the determination is true (T) and “C1” is a number (T in step S401), success is returned (step S411).
However, if “C1” is not a number (F in step S410),
Return failure (step S412).

ステップS409における判定が偽（Ｆ）であるならば、
ステップS413において“C2"が“C1"であるか否かの判定
が為される。もしその判定が真（Ｔ）であるならば成功
を返す（ステップS415）が、偽（Ｆ）であるならば失敗
を返す（ステップS416）。If the determination in step S409 is false (F),
In step S413, it is determined whether or not “C2” is “C1”. If the determination is true (T), return success (step S415), and if false (F), return failure (step S416).

このような単語綴りとアクセントルールとのマッチン
グ処理を行なった単語の例が第９図及び第10図に示され
ている。これら図の上部がそれを示しており、そのアク
セントルールはそれらに示される単語にマッチングする
アクセントルールだけを示してある。これら単語とアク
セントルールとのマッチング処理は第５図乃至第８図に
示したところに従って進められるものであり、そのマッ
チング処理過程の詳細は省略する。Examples of words that have undergone such matching processing between word spelling and accent rules are shown in FIG. 9 and FIG. The upper part of these figures shows this, and the accent rules show only those accent rules that match the words shown in them. The matching process between these words and the accent rules proceeds in accordance with those shown in FIGS. 5 to 8, and the details of the matching process are omitted.

上述のようにしてアクセント識別文字を付加された単
語綴り文字列がアクセント位置決定部10から綴り・発音
記号変換部12へ送り込まれ、次のような変換処理を受け
ることによりアクセント位置決定部10へ入力された単語
に対する発音記号が綴り・発音記号変換部12から出力さ
れる。The word spelling character string to which the accent identification character has been added as described above is sent from the accent position determining unit 10 to the spelling / phonetic symbol converting unit 12, and is subjected to the following conversion processing to the accent position determining unit 10. The phonetic symbols for the input word are output from the spelling / phonetic symbol conversion unit 12.

第11図はその変換処理フローを示す。そのステップS5
00において、綴りポインタを入力綴り字の先頭にセット
する。綴りポインタが入力綴り字の最終文字を指してい
る（Ｔ）場合には終了となるが、通常はそうではない
（Ｆ）ので、発音ルール格納部（辞書）13から発音ルー
ルを読み込む（ステップS502）。その発音ルールと綴り
ポインタが指している文字から右の文字列とマッチング
する最長の発音ルールを捜す（ステップS503,S504）。
それら両者間のマッチングが得られるならば（ステップ
S504のＦ）、綴り字を発音記号に変換するが、綴り字そ
のものは変換せず、変換された発音記号を綴り字とは別
に綴り・発音記号変換部12内の格納部に格納する（ステ
ップS505）。FIG. 11 shows the conversion processing flow. Step S5
At 00, the spelling pointer is set at the beginning of the input spelling. If the spelling pointer points to the last character of the input spelling (T), the process ends. However, this is not the case (F), so the pronunciation rule is read from the pronunciation rule storage unit (dictionary) 13 (step S502). ). The longest pronunciation rule matching the character string to the right of the character pointed to by the pronunciation rule and the spelling pointer is searched for (steps S503, S504).
If a match between them can be obtained (step
(F in S504), the spelling is converted into a phonetic symbol, but the spelling itself is not converted, and the converted phonetic symbol is stored separately from the spelling in a storage unit in the spelling / phonetic symbol conversion unit 12 (step S504). S505).

次いで、変換された文字数だけ綴りポインタを右に進
めて（ステップS506）ステップS501に戻り、再びステッ
プS502〜S504のマッチング探索ループに入って上述進め
られた綴りポインタが指している文字から右にある文字
列についての上述と同様にマッチング処理を行なう。Next, the spelling pointer is advanced rightward by the number of converted characters (step S506), and the process returns to step S501. Then, the processing enters the matching search loop of steps S502 to S504 again, and the spelling pointer is shifted to the right from the character pointed to by the advanced spelling pointer. The matching process is performed on the character string in the same manner as described above.

このようにして綴りポインタが綴りの最終文字に至っ
たとき、入力されたアクセント識別文字付綴り文字列の
発音記号への変換処理は終了し、次の同様の変換処理に
入る。When the spelling pointer reaches the last character of the spelling in this way, the process of converting the input spelling character string with accent identification characters into phonetic symbols is completed, and the following similar conversion process is started.

このような変換処理例が第９図及び第10図に示されて
いる。これら図の下部がそれを示しており、その発音ル
ールは例示のアクセント識別文字付綴り文字列にマッチ
ングする発音ルールのみを示してある。Examples of such a conversion process are shown in FIGS. 9 and 10. The lower part of these figures shows this, and the pronunciation rules only show the pronunciation rules that match the spelling string with the accented identification characters in the example.

〔The invention's effect〕

以上説明したように本発明によれば、単語の発音記号
への変換にアクセント識別情報を有効に利用して単語の
発音記号への変換を自動的に行うことができる。単語の
発音記号への自動変換にアクセント識別情報を利用する
手段が用いられているから、アクセントの有無によって
発音が変わって来るような言語で用いられる単語の発音
記号への自動変換を正しく行うことができる。As described above, according to the present invention, it is possible to automatically convert a word into a phonetic symbol by effectively using accent identification information for converting the word into a phonetic symbol. Since the means that uses accent identification information is used for automatic conversion of words to phonetic symbols, correct conversion of words to phonetic symbols used in languages whose pronunciation changes depending on the presence or absence of accents Can be.

[Brief description of the drawings]

第１図は本発明の原理ブロック図、第２図は本発明の一実施例を示す図、第３図はアクセントルール及び発音ルールのフォーマッ
トを説明する図、第４図はアクセント位置決定部の処理フローを示す図、第５図乃至第８図は第５図処理フロー内のマッチングス
テップの詳細図、第９図及び第10図は処理例を示す図、第11図は綴り・発音記号変換部の処理フローを示す図、第12図（Ａ）及び（Ｂ）は従来の最長一致法の欠点を説
明する図である。第１図において、１は単語入力手段、２はアクセント識別情報付加手段、
３は綴り・発音記号変換手段である。第２図において、 10はアクセント位置決定部、11はアクセントルール格納
部、12は綴り・発音記号変換部、13は発音ルール格納部
である。1 is a block diagram showing the principle of the present invention, FIG. 2 is a diagram showing an embodiment of the present invention, FIG. 3 is a diagram for explaining the format of accent rules and pronunciation rules, and FIG. FIG. 5 is a diagram showing a processing flow, FIG. 5 to FIG. 8 are detailed diagrams of a matching step in the processing flow of FIG. 5, FIG. 9 and FIG. 10 are diagrams showing processing examples, and FIG. FIG. 12A and FIG. 12B are views for explaining the drawbacks of the conventional longest matching method. In FIG. 1, 1 is word input means, 2 is accent identification information adding means,
Reference numeral 3 denotes a spelling / phonetic symbol conversion unit. In FIG. 2, reference numeral 10 denotes an accent position determination unit, 11 denotes an accent rule storage unit, 12 denotes a spelling / phonetic symbol conversion unit, and 13 denotes a pronunciation rule storage unit.

Claims

(57) [Claims]

1. A word input means (1) for inputting a word composed of a required number of spellings, an accent rule storage unit (4), and a spelling character string after each word spelling character in the input word. An accent identification information adding means (2) for reading an accent rule from the accent rule storage section (4), matching the spelled character string with the read accent rule, and adding accent identification information to the spelled character having an accent; A pronunciation rule storage unit (5), and a pronunciation rule from the pronunciation rule storage unit (5) for each word spelling string to which unconverted accent identification information has been added from the beginning of the word spelling added with accent identification information. And a spelling / pronunciation conversion unit that converts a word spelling string to which unconverted accent identification information has been added into phonetic symbols based on the read pronunciation rules. (3) An apparatus for converting word spelling into phonetic symbols, characterized by comprising (3).