JPH0264867A

JPH0264867A - Correspondence relation deciding system for character description result

Info

Publication number: JPH0264867A
Application number: JP63215194A
Authority: JP
Inventors: Etsuko Obuka; 大深　悦子
Original assignee: NIPPON I B M KK; IBM Japan Ltd
Current assignee: NIPPON I B M KK; IBM Japan Ltd
Priority date: 1988-08-31
Filing date: 1988-08-31
Publication date: 1990-03-05
Anticipated expiration: 2010-12-06
Also published as: JPH07113925B2

Abstract

PURPOSE:To eliminate a trouble to prepare a dictionary by converting character described result into a phoneme row, and deciding the correspondence relation of the character described results by calculating the degree of difference at a phoneme level. CONSTITUTION:The title system is provided with a means to convert the objective character described result to be compared into the phoneme row, the means to calculate the degree of difference at the phoneme level and the means to decide the correspondence relation between the character described results themselves according to the degree of difference at the phoneme level. As for English language description, the means to convert English described result into the phoneme row consisting of the phonemes to be used in English language is provided, and in the case where the correspondence relation between KATAKANA (square form of Japanese syllabary) described result and the English described result is desired to decide, a tabling means to generate one or plural Japanese phonemes corresponding to each phoneme to be used in the English language is provided. Thus, a foreign language description dictionary need not be generated beforehand, and continuous renewal such as the registration of a newly-coined word, a derivative and a proper noun is unnecessary as well.

Description

【発明の詳細な説明】Ａ、産業上の利用分野本発明は、同一または異なる文字表記法による表記結果
同士の対応関係を判定するシステムに関し、さらに詳し
くは文字表記結果を音素列に変換し、音素レベルでの相
違度を計算することによって文字表記結果の対応関係を
判定するシステムに関する。ここでいう対応関係は、対
応する・しない（対応関係あり・なし）という２値的に
判定されるものであってもよいし、相違度に応じてより
細かく、例えば、対応関係強・弱・なしのように判定さ
れるものであってもよい。DETAILED DESCRIPTION OF THE INVENTION A. Field of Industrial Application The present invention relates to a system for determining the correspondence between the results of notation using the same or different character notations, and more specifically, converts the results of character notation into a phoneme string, The present invention relates to a system that determines the correspondence of character transcription results by calculating the degree of dissimilarity at the phoneme level. The correspondence here may be determined in a binary manner such as correspondence or not (correspondence exists or no correspondence), or it may be determined in more detail depending on the degree of difference, for example, correspondence is strong, weak, or It may be determined that there is no such thing.

Ｂ、従来技術およびその問題点日本語のカナ等の音標文字による表記を使って、外来語
を表記する場合、複数の表記のしがたがあることが多い
、原表記結果”ｉｎｔｅｒｖｉｅｗ”を例にとると、そ
の表記には、１ｎｔｅｒｖｉｅｙインタビュ、インタヴユ、インタビーン、インタビーンインタビュー、インタビ− ンンタービュー、インタービューなどがある、これらの複数の表記結果は、すべて同一語
“ｉｎｔｅｒｖｉｅｗ”を表わしているが１例えば“イ
ンタビーン”は、これを表わしていない。B. Prior art and its problems When writing foreign words using phonetic alphabets such as Japanese kana, there are often multiple ways to write them.An example of the original notation "interview" Accordingly, the notations include 1nterviey interview, interview, interview, interview, interview, interview, etc. These multiple notations all represent the same word "interview". However, 1, for example, "interbeen" does not represent this.

このような、表記結果同士が同一語を表わしているかど
うかの判定が求められている場として。In situations like this, where it is required to determine whether or not the notation results represent the same word.

日本語ワードプロセッサにおける校正システムや情報検
索システムがある。There are proofreading systems and information retrieval systems for Japanese word processors.

従来の校正システムでは、田中他「科学技術文献抄録に
おける片仮名列の解析」　（計量国語学Ｖｏ１．１４、
ＮＯ，１，１９８３）でも指摘されているように、〈原
表記結果−カナ表記結果〉対を登録する辞書式が使われ
ていた。また従来の情報検索においても、辞書式が、特
開昭６２−１１９３２号公報に開示されている。In the conventional proofreading system, Tanaka et al. "Analysis of Katakana sequences in scientific and technical literature abstracts" (Metric Japanese Linguistics Vol. 1.14,
No. 1, 1983), a dictionary system was used to register pairs of <original notation result - kana notation result>. Furthermore, in conventional information retrieval, a dictionary system is disclosed in Japanese Patent Application Laid-open No. 11932/1983.

しかし、辞書式には１次のような問題点がある。However, the dictionary system has the following first-order problems.

・新造語、派生語、固有名詞をすべて辞書に登録する必
要があるので、辞書を作る手間がががり。・Since all newly coined words, derived words, and proper nouns must be registered in the dictionary, it takes time to create a dictionary.

継続的更新が必要である。Continuous updates are required.

・カナ表記が定まらない場合は、すべてのカナ表記を網
羅しなければならない。- If the kana notation cannot be determined, all kana notations must be covered.

一方、カナ表記結果同士の対応関係判定を行なう方法と
して、カナ表記結果のうち、上記公報で示唆されている
ような対応関係にあるものすべてを、辞書の１つのエン
トリーにおさめる方法の他に、統一表記を使うやり方も
ある。統一表記を得る方法は、後藤他「片仮名表記をと
る技術用語における表記の多様性」　（三田図書館情報
学会大会、１９８５）の論文に開示されている。On the other hand, as a method for determining the correspondence between kana notation results, in addition to the method of storing all of the kana notation results that have a correspondence as suggested in the above publication into one entry in the dictionary, There is also a way to use uniform notation. A method for obtaining a unified notation is disclosed in a paper by Goto et al., ``Diversity of notation in technical terms that use katakana notation'' (Mita Library and Information Society Conference, 1985).

これは処理対象のカナ列に対して、その先頭文字列から
逐次、交換規則（例えば、長音符号、促音削除、拗音の
大文字化、ｆ音→ｈ音、Ｖ音→ｂ音の書き換えなど）を
適用して、統一表記結果を得る。そしてこの統一表記結
果同士の一致、不一致によってカナ列同士の対応関係あ
り・なしの判定を行なうものである。This applies exchange rules (for example, long sound, deletion of consonants, capitalization of consonants, rewriting f sound → h sound, V sound → b sound, etc.) sequentially from the first character string to the kana string to be processed. Apply it to get unified notation results. Then, it is determined whether or not there is a correspondence between kana strings based on whether the unified notation results match or do not match.

この方式には、次の問題点がある。This method has the following problems.

同一語を表わす表記群を、互いに異なる語と判断しない
ためには、なるべく多くの変換規則を設定する必要があ
るが、これによって、異なる語を同一とみなしてしまう
確率が高くなる。In order to prevent groups of notations representing the same word from being judged as different words, it is necessary to set as many conversion rules as possible, but this increases the probability that different words will be regarded as the same.

例えば「オータナチブ」と「オルタナチブ」は、ともに
“ａｌｔｅｒｎａｔｉｖｅ”を表わしている。これを統
一するために（゛ル′→長音）という変換規則を追加す
ると、これは長音符号削除の規則と組み合わさって（゛
ル′→長音→削除）となる、従って、これを「バックル
」に適用すると「バック」となり、この２つを区別でき
なくなる。For example, "alternative" and "alternative" both represent "alternative." In order to unify this, we add the conversion rule (゛ru' → long sound), which is combined with the long sound sign deletion rule to become (゛ru' → long sound → deletion). Therefore, this can be called "buckle". When applied to , it becomes ``back'' and it becomes impossible to distinguish between the two.

Ｃ０問題点を解決するための手段本発明は上記問題点に鑑みなされたもので、比較対象の
文字表記結果を音素列に変換する手段と。Means for Solving the C0 Problems The present invention has been made in view of the above problems, and includes means for converting character notation results to be compared into phoneme strings.

音素レベルでの相異度を計算する手段と、音素レベルで
の相違度にもとづいて文字表記結果同士の対応関係を判
定する手段を備えたことを特徴とする。The present invention is characterized by comprising means for calculating the degree of dissimilarity at the phoneme level, and means for determining the correspondence between character notation results based on the degree of dissimilarity at the phoneme level.

例えばカタカナ表記については、カタカナ表記結果を日
本語で用いる音素からなる音素列に変換する手段が用意
される。なお本明細書でいう日本語で用いる音素とは、
ヨミに促音、長音、拗音の概念を含めたもののことであ
る。For example, for katakana notation, means is provided for converting the katakana notation result into a phoneme string consisting of phonemes used in Japanese. Note that the phonemes used in Japanese in this specification are:
This is a reading that includes the concepts of consonants, long consonants, and consonants.

英語表記については、英語表記結果を英語で用いる音素
からなる音素列に変換する手段が用意される。カタカナ
表記結果と英語表記結果の対応関係を判定したい場合に
は、英語で用いる各音素について対応する日本語の１ま
たは複数の音素を生成するテーブル手段が用意される。For English notation, a means is provided for converting the English notation result into a phoneme string consisting of phonemes used in English. When it is desired to determine the correspondence between the Katakana notation result and the English notation result, a table means is prepared that generates one or more Japanese phonemes corresponding to each phoneme used in English.

Ｄ、実施例Ｄｌ、英語表記−カナ表記間の対応関係判定システム以下、本発明を英語表記−力す表記間の対応関係判定シ
ステムに即して説明する。まず、同システムにより行な
われる一連の処理の流れを第１図に示す、　　Ｄ２〜Ｄ
８で、第１図に示された各手段の詳細を説明することに
する。D. Embodiment D1: System for determining correspondence between English notation and kana notation The present invention will be described below with reference to a system for determining correspondence between English notation and kana notation. First, the flow of a series of processes performed by the system is shown in Figure 1, D2 to D.
8, details of each means shown in FIG. 1 will be explained.

Ｄ２．日本語音素列生成手段 ■　カタカナ表記結果を、例えばキーボードから文字コ
ードの形で入力する。D2. Japanese phoneme string generation means ■ Input the katakana notation result in the form of a character code from the keyboard, for example.

■　各文字ごとに表１をひいて対応する音素列を得る。■ Check Table 1 for each letter to obtain the corresponding phoneme string.

（例「プレイヤー」→音素列：　ｐｕｒｅｉｙａ）促音
（ツ）長音（−）、拗音（ヤ、ユ、ヨ）は単独で使われ
ないので、音素とみなさない、直前の音素を（Ｘ　ｌ　
とすると、促音、長音はＸの長さという形で、拗音はＸ
の拗音要素という形で扱う。(Example: "player" → phoneme sequence: pureiya) Consonants (tsu), long sounds (-), and persistent sounds (ya, yu, yo) are not used alone, so they are not considered phonemes, and the previous phoneme is (X l
Then, the consonant and long consonant have the length of X, and the consonant has the length of X.
It is treated in the form of a consonant element.

（Ｄ４◎参照）表記は、Ｘ　　−＋Ｘ、Ｘ−→Ｘ。(See D4◎) The notation is X-+X, X-→X.

ツＸ−＋Ｘ、、Ｘ　　　→Ｘ　。tsu X-+X,,X →X.

ヤ　　　Ｊａ　　　　ユ　　　ＪａＸ　　−４Ｘ、　　とする。Ya Ja Ja Let X be -4X.

ヨ　　　　Ｊ　０表１６日本語カナー音素列表 ■　■で得た音素列の左から右へ、次に述べる変換規則
を適用して統一音素列を得る。Yo J 0 Table 16 Japanese Kanah Phoneme Sequence Table ■ From left to right of the phoneme string obtained in ■, apply the conversion rules described below to obtain a unified phoneme string.

変換規則の主なものを次に挙げる。The main conversion rules are listed below.

１、母音にはさまれた長音削除規則例に圭ｇｉ−＋に圭ｕｉ２、半母音（ｙ、ｗ）の変換規則語頭以外のｗ　−’＋　ｕ例ｓａｎｌｄｏｗｉｃｉ−＋５ａｎｔｄｏｕｉｃｉ語頭
以外のｙ　−＊　ｉ例Ｐｕｒｅｉｙ−ａ→Ｐｕｒｅｉｉａ３、二重母音の変換規則先行母音をＶ　＞　＋後続母音をｖ２とすると。1. Rule for deleting long sounds sandwiched between vowels Example: Kei gi-+ to Keui 2. Conversion rule for semi-vowels (y, w) w −'+ u other than the beginning of a word Example: sanldowici-+5 antdouici y −* i other than the beginning of a word Example: Pureiy-a → Pureiia 3. Conversion rules for diphthongs: Let the preceding vowel be V > + the following vowel v2.

母音列ｖｉｖ２は表２によって変換される。The vowel string viv2 is converted according to Table 2.

表２　二重母音（ＶＩＶ２）の変換表・パ行、バ行の前のム（ｍ　ｕ　）→ン（ｎり例　ラム
プ→ランプ５、無声化しやすい母音に関する規則・ｋｉｓ−＋ｋｕｓ例　テキスト→チクスト（ｔｅｋｉｓｕｔｏ）　（ｔｅｋｕｓｕｔｏ）例　「プ
レイヤー」ここに例空白：無変換ｖ　：ｖ十長音符号（）：省略可能ａ　Ｏ→　ａ　Ｏｅ　１　→　ｅＵａ　→ｕａ、　　ａ、　　ｕａ、　　ａ４、撥音（ン
）に関する規則・す行、マ行の前の撥音削除（後藤他の上掲論文に開示
）例　チャンネル→チャネル ■　くって得られた統一音素列を、与えられたカタカナ
表記結果の日本語音素列としてメモリ中の作業域に、−
旦、格納する。Table 2 Conversion table for diphthongs (VIV2) - Mu (m u ) in front of the Pa line and B line → N (n example Rump → Rump 5, rules regarding vowels that are easily devoiced - kis-+kus Example Text → Chixt ( tekisuto) (tekusuto) Example "Player" Example here Blank: No conversion v : v Macon sign (): Optional a O→ a O e 1 → e Ua →ua, a, ua, a4, syllabic sound (n) Rules for deletion of the syllables before the lines S and M (disclosed in the above-mentioned paper by Goto et al.) Example Channel → Channel In the working area in memory, -
Then, store it.

Ｄ３．英語音素列生成手段 ■　英語表記結果を、例えばキーボードから文字コード
の形で入力する。D3. English phoneme string generation means■ Input the English notation result in the form of a character code from the keyboard, for example.

■　英語のつづりから音素列を得る。■ Obtain phoneme sequences from English spelling.

このアルゴリズムは既存で、例えばＥｌｏｖｉｔｚ他″’Ｌｅｔｔｅｒ−ｔｏ−３ｏｕｎｄ
　Ｒｕ１ｅｓ　ｆｏｒＡｕｔｏｍａｔｉｃ　Ｔｒａｎｓ
ｌａｔｉｏｎ　ｏｆ　Ｅｎｇｌｉｓｈ　Ｔｅｘｔｔｏ　
Ｐｈｏｎｅｔｉｃｓ”　（工εＥＥ　Ｔｒａｎｓ、Ｖｏ
ｌ、ＡＳＳＰ−２４、Ｎｏ、　６．１９７６）などに開
示されているので、詳細は、これらの文献を参照された
い。This algorithm already exists; for example, Elovitz et al.
Rules for Automatic Trans
lation of English Text
Phonetics” (ENGεEE Trans, Vo
ASSP-24, No. 6.1976), etc., so please refer to these documents for details.

本明細書では、英語音素として表３に示したものを使う
。In this specification, the English phonemes shown in Table 3 are used.

（■　■で得られた音素列を、与えられた英語表記の英
語音素列として、メモリ中の作業域に一旦。(■ Temporarily store the phoneme string obtained in ■ as the English phoneme string in the given English notation in the working area in memory.

格納する。Store.

表３　英語音素記号表Ｄ４．英語音素、日本語音素対応テーブル手段■　音素
の表現形式英語音素と日本語音素の比較をするため、音素の表現形
式を次のように定める。Table 3 English phoneme symbol table D4. English phoneme, Japanese phoneme correspondence table means ■ Expression format of phonemes In order to compare English phonemes and Japanese phonemes, the expression format of phonemes is defined as follows.

子音音素（ｃ）ミＲｃ　＋　Ｆ　１　＋　Ｆ　２母音音
素（ｖ）ＥＲｖ十Ｆ１＋Ｆ３＋Ｆ４Ｒ（。）二子音に属
する音素に対応する、日本語でのヨミＲ（Ｖ）”母音に属する音素に対応する。Consonant phoneme (c) Mi Rc + F 1 + F 2 Vowel phoneme (v) ERv 1 + F 1 + F 3 + F 4 R (.) Corresponds to a phoneme that belongs to two consonants, Yomi R (V) in Japanese, corresponds to a phoneme that belongs to vowel. .

日本語でのヨミ日本語における促音、長音、拗音のｊ音は、ヨミに含め
ない例「キャラ」を例にとると。Reading in Japanese The J sounds of consonants, long sounds, and persistent sounds in Japanese are not included in the reading.Take ``Chara'' as an example.

日本語音素列：ｋｊａでＲｃ＝＝に、Ｒｖ＝ａとなる。Japanese phoneme sequence: kja Rc==, and Rv=a.

Ｆｌ：１つの音素に対応する、日本語でのヨミが複数あ
る場合の優先順位Ｆ２：拗音（ｊ音）の有無Ｆ３：促音であるかないかを表わす。Fl: Priority order when there are multiple readings in Japanese corresponding to one phoneme F2: Presence or absence of a consonant (j sound) F3: Indicates whether it is a consonant or not.

Ｆ４：長音であるかないかを表わす「キャラ」を例にとると子音音素＝Ｒｃ　（ヨミ：ｋ）＋Ｆ１（優先順位：■）
　＋Ｆ２　（拗音：ｊ）母音音素＝Ｒｖ（ヨミ：ａ）十Ｆ１（優先順位：■）＋
Ｆ３（促音＝６）十Ｆ４　（長音二〇） ■　Ｄ３■で得た英語音素列を入力とする。F4: Taking "Chara" as an example, which indicates whether it is a long sound or not, the consonant phoneme = Rc (Yomi: k) + F1 (Priority: ■)
+F2 (Sound: j) Vowel phoneme = Rv (Yomi: a) 10F1 (Priority: ■) +
F3 (consonant = 6) 10F4 (20 long sounds) ■ Input the English phoneme string obtained in D3■.

■　英語音素列を２表４より■で説明した表現形式に変
換する。■ Convert the English phoneme string from Table 2 to the expression format explained in ■.

なお、表４の項目で＠：音素の種類（ｃ：子音、Ｖ：母音）十■：子音の次に母音がない場合、カタカナ表記上つける母音を表わす。例えばｃａｔの／１／は、日本語ではト＝ｔｏと表わされるので＋ｖ＝”ｏ’となる。In addition, the items in Table 4 @: Type of phoneme (c: consonant, V: mother) sound) 10■: If there is no vowel after a consonant, Represents the vowel added in Takana notation. vinegar. For example, /1/ in cat is day In Japanese, it is expressed as to. Therefore, +v=”o’.

母音音素のＦ２：先行子音に拗音要素を付加することを
表わす。英語、日本語音素レベル比較の際、先行子音のＦ２に加えて用いられる。F2 of vowel phoneme: represents adding a persistent consonant element to the preceding consonant. When comparing English and Japanese phoneme levels, it is used in addition to the preceding consonant F2.

表４＠：音素ＢＸｃ：ＰＸＢＸｃ：ＢＸＢＸｃ：ＴＸｃ：ＴＸＢＸｃ　　：　　ＤＸｃ　　：　　ＤＸＢＸｃ：ＫＸｃ：ＫＸｃ：ＫＸＢＸｃ：ＧＸ英語音素、日本語音素対応表子　　音：　Ｆｌ　：　Ｒ（ｙｏｍｉ）　：　Ｆ２（＋ｊ）：　
＋ｖ：破裂音 −＞／Ｐ／：１　　：ｐ　　　　　−：ｕ　　ニー＞／ｂ／：１　　：ｂ　　　　　’　　　　　　：ｕ　　ニー＞
７１．／：１　　：ｔ　　　　　：２　　　　　：ｏ　　：：２
　：Ｃ°　　　　　：ｕ　ニー＞／ｄ／：１　　　：ｃｉ　　　　　　　　’　　　　　　　　
　　：ｏ　　　：：２：Ｚ〜〉／に／：　１　　：　ｋ２２２ｇ：３：Ｃ −＞／ｇ／＝１：ｇ：　ｕ　　　：：　ｕ　　　：：　ｕ　　　：：　ｉ　　：：　ｕ　　　：ｃ：ＧＸｃ：ＧＸＦＸｃ：ＭＸｃ：ＭＸＦＸｃ　　：　　ＮＸｃ：ＮＸＮＧｃ：ＮＧｃ　　：　　ＮＧＦＸｃ：ＬＸＦＸｃ：ＦＸｃ　　：　　ＦＸｘ＝　２：　ｋ：３：ｚ鼻　　音 −＞／ｍ／：　１　　二ｍ：１：ｎ！ −＞／ｎ／：１：ｎｊ：１：ｎ −＞１０／：１：ｎ！：１　　　：ｎ！ｇｕ促音 −＞／　Ｑ　／：１：ｒ摩擦音 −＞／ｆ／：１：ｈ：　２　　：　ｂ −＞／ｖ／ＣＨｃ　　：　　ＳＨｃ　　：　　ＳＨｃ：ＳＨＣＨｃ　　：　　ＺＨＦＸｃ：ＨＸｃ：ＨＸＦＸｃ　　：　　ＲＸＣＨｃ：ＣＨｃ：ＣＨＣＨｃ：ＪＨｃ：ＪＨｃ：ＪＨｃ：ＪＨ −＞／Ｓ／：１：Ｓ：ｊ：２：ｚ：３：し −〉／／：１：Ｚ −＞／　ｈ　／：１：ｈ＝２二〇 −＞／ｒ／：１：ｒ破擦音 −＞／１５　　／：１：ｃ：　２　：　ｋ −＞／ｄ３　　／＝１：Ｚ：　２：３：Ｃ：３二ｇ：３：０：　ｕ　　　：：　ｕ　　　：：　Ｕｃ：ＶＸｃ：ＶＸｃ：ＶＸＣＨｃ：ＴＨｃ：ＴＨｃ：ＴＨＣＨｃ：ＤＨｃ　　：　ＤＨｃ：ＤＨＦＸｃ：ｓＸｃ：ｓＸｃ：ｓＸＦＸａ　　：　　ＺＸｃ　　：　　ＺＸ：１：ｂ＝３：０：３：ｕ −＞１０／：１：Ｓ：２：ｚ：２：を −〉／凸／：１：ｚ：２：Ｓ：３：を −＞／ｓ／：１：Ｓ：２：ｚ：３：Ｃ −＞／ｚ／＝１：ｚ：２：Ｓ：　ｕ　　　：：　ｕ　　　：：　ｕ　　　：：　Ｕ：　０　　　：：　ｕ　　　：複数形に対応する特殊な記号ｃ：Ｔｓ　　　：１：ｃ　　　：　　　：ｕ：／＊＝Ｔ
ＸＳＸ　　例えばｔｅａ＜ｔｓ）傘／ｃ：ＤＳ　　　：
１：ｚ　　　：　　　：ｕ：／＄＝Ｉ）ＸＺＸ　　例え
ば、　ｃ　ａ　ｒ＜ｄ　ｓ）＊／半母音＠：音素　　：　Ｆｌ　：Ｒ（ｙｏｍｉ）：　Ｆ２（＋
ｊ）：＋ｖ：ＪＨ−＞／ｊ／ａ：ＪＸ　　　：１：ｙ　　　　＋　　　　　　：ｕ：
ｖ：ＪＸ　　　：１　　：ｉ　　　　”ＷＸ　　−＞／
Ｗ／ｖ：ＷＸ　　　：１　　：ｏ　　　　・ｖ：ＷＸ　　　
：１　　：ｕ　　　　’ｃ：ＷＸ　　　：３　　：ｂ　
　　　０：ｕ　　：母音＠：赫：　Ｆｌ　：　Ｒ（ｙｏｕｉ）：Ｆ２（＋ｊ）：
＋ｖ　：　Ｆ３（、）：　Ｆ４０：ＩＸ−＞／ｉ／ｖ：ＩＸ：１：ｉ　　：　　：　　：２　　：Ｏ：ｖ：
ＩＸ：２：ａｉ　　：　　：　　：Ｏ：Ｏ：ｖ：ＩＸ：
３：ｅ　　：　　：　　：１　　：’７　　：ｖ：ＩＸ
　　：３　　：ｕ　　　　　：ＥＥ−＞／ｉ：／ｖ：ＥＥ　　：１　　　：ｉ　　　　　：ｖ：ＥＥ　　
：３　　：ｉａ　　　　：ｖ：ＥＥ　　：３　　　：ｉ
ｅｖ：ＥＥ　　　：２　　　：ｅ　　　　　：ｖ：ＥＥ　
　：３　　：ｅａ　　　　：ｖ：ＥＥ　　：３　　：ａｖ：ＥＥ　　：３　　：ａｉ　　　　：ｖ：ＥＥ　　：
３　　：ａｉｅ　　　：Ｅ　Ｉ　−＞／　ｅ　／ｖ：ＥＩ　　　：１　　　：ｅｖ：ＥＩ　　　：１　　　：ａｖ：ＥＩ　　　：３　　：ａｉｖ：ＥＩ　　　：３　　：１ＥＨ−＞／ε／ｖ：ＥＨ：１　　　：ｅｖ：ＥＨ：２　　：ｉｖ：ＥＨ：３　　：ｕｖ：ＥＨ：３　　：ａ：１：１：：１：：？：　　　：Ｏ：　　　：２俸　　　　ｊ　９：１：に二？：：？：？ｖ：ＡＸＩＸ：ｌａｉ二〇ｖ：ＡＸＩＸ：２：ｉ：３ＡＸＵＸ　−＞／ａｕ／ｖ：ＡＸＵＸ：１：ａｕ　　　　：ｖ：ＡＸＵＸ：２：ａ　　　　　：ｖ：ＡＸＵＸ：２：ｏ　　　　　：ｖ：ＡＸＵＸ：３：ｕ　　　　　： ○Ｕ−＞１０／ｖ：ＯＵ　　：ｌ　　　：ｏ　　　　　：ｖ：ＯＵ　　
：２　　：ａ　　　　　：ｖ：ＯＵ　　：３　　：ｕ　
　　　　：Ｖ：○Ｕ：３：ｕｉ：ｖ：ＯＵ　　：３　　：ｏａ　　　　：ＡＷ−＞／）／ｖ：ＡＷ　　：１　　：ｏ　　　　　：ｖ：ＡＷ　　：
２　　：ａ　　　　　：ｖ：ＡＷ　　：３　　：ａｏ　
　　　：ｖ：ＡＷ　　：３　　　：ａｕ　　　　：ＡＷ
ＩＸ−＞１０ｉ／ｖ：ＡＷＩＸ：１：ｏｉ　　　　：二〇：０：３二〇ＡＥ−＞／　；ｅ／ｖ：ＡＥ：１：ａ　　　　：　　　　　：　　　：２　
　　　：１　　　　：ｖ：ＡＥ：２：ｅ　　　　：　　
　　　：　　　：’）　　　　：３　　　　：ｖ：ＡＥ
：３：ｏ　　　　：　　　　　：　　　：？　　　　：
？　　　　：ＵＨ−＞／Ａまたはｌａｉｖ：ｔＪＨ：１　　：ａ　　　　：　　　　　：　　　
：２　　　　：１　　　　：ｖ：ＵＨ：２：ｏ　　　　
：　　　　　：　　　：Ｏ：１　　　　：ｖ：ＵＨ：２
　　：ｕ　　　　：１　　　　：１　　：１　　　　：
Ｏ：ｖ：ＵＨ：３：ｅ　　　　：　　　　　：　　　：
？　　　　：’：’　　　　：ｖ：ＵＨ：３　　：ｉａ
　　　：　　　　　：　　　：？　　　　：’；’　　
　　：ＥＲ−＞／さ／ｖ：ＥＲ：１：ａ　　　　：　　　　　：　　　：Ｏ：
３　　　　：ｖ：ＥＲ：３：ｅ　　　　：　　　　　；
　　　：？　　　　：？　　　　：Ｅｒ−＞／＆／　／
＊ＥＲ＋Ｘ　（母音）　　＞Ｅｒ＋ＲＸ＋Ｘ廖／ｖ：Ｅ
ｒ：１：ａ　　　　：　　　　　：　　　：？　　　　
：？　　　　：Ｖ：Ｅｒ：１：ｕａ　　　：　　　　　
：　　　：？　　　　：Ｑ　　　　：ＡＡ−＞／　　／ｖ：ＡＡ：１：ｏ　　　　：　　　　　：　　　：２　
　　　：１　　　　：ｖ：ＡＡ：２：ａ　　　　：　　
　　　：　　　：１’　　　　：１　　　　：ＡＸ　Ｉ
　Ｘ　−＞／　ａ　ｉ　／ＵＸ−＞／ｕ／ｖ：ＵＸ　　：１　　：ｕ　　　　　：ＵＵ−＞／ｕ：
／ｖ：ＵＵ　　：１　　：ｕ　　　　　：ｖ：ＵＵ　　：
３　　：ｏ　　　　　：ＷＸｖ：ＷＸ　　：１　　：ｕ　　　　　：ｖ：ＷＸ　　：
３　　：ｏ　　　　　：ｊ　ｘＵＵ　　＞／　ｊ　ｕ　
：　／ｖ：ｊｘＵＵ：１：ｕ　　　　：２ｖ：ｊｘＵＵ：３：ａ　　　　：Ｏｖ：ｊｘＵＵ：３：ｏ　　　　　：１ −＞／：／ｖニー：１：ｏ：ｖ：　　　　　：２　　：ａ　　　　　：：３：０：　　　　：１　　　　　：３　　　　　：：　　　　
：’ｉｌ　　　　　：１　　　　　：：　　　　：３　
　　　　：Ｏ：：　　　　：？　　　　：？　　　　：＝２　：Ｏ二〇二〇＝１　：？：　　　　：Ｏ：３　　　　　：：　　　　：Ｏ：３　　　　　： ■　英語音素列の各音素と、■で得た変換結果を対にし
て、メモリの作業域に一旦、格納する。Table 4 @: Phoneme BX c: PX BX c: BX BX c: TX c: TX BX c: DX c: DX BX c: KX c: KX c: KX BX c: GX English phoneme, Japanese phoneme correspondence table Sound: Fl: R (yomi): F2 (+j):
+v: Plosive ->/P/ :1 :p -:u Knee>/b/ :1 :b' :u Knee>
71. / :1 :t :2 :o ::2
:C° :u Knee＞/d/ :1 :ci'
:o ::2:Z ~>/ni/ : 1 : k 222g :3:C ->/g/ =1:g : u : : u : : u : : i : : u : c : GX c : GX FX c:MX c:MX FX c: NX c:NX NG c:NG c: NG FX c:LX FX c:FX c: FX x = 2: k: 3:z Nasal ->/m/: 1 2m :1:n! ->/n/ :1:nj :1:n ->10/ :1:n! :1 :n! gu consonant ->/ Q / :1:r fricative ->/f/ :1:h : 2 : b ->/v/ CH c : SH c : SH c:SH CH c : ZH FX c:HX c: HX FX c: RX CH c:CH c:CH CH c:JH c:JH c:JH c:JH ->/S/ :1:S :j :2:z :3:Shi->// :1 :Z ->/h / :1:h =220->/r/ :1:r Affricate->/15/ :1:c : 2 :k ->/d3 / =1:Z : 2:3:C:32g:3:0:u::u::U c:VX c:VX c:VX CH c:TH c:TH c:TH CH c:DH c: DH c:DH FX c: sX c: sX c: sX FX a : ZX c : ZX :1:b =3:0 :3:u ->10/ :1:S :2:z :2: ->/Convex/ :1:z :2:S :3: ->/s/ :1:S :2:z :3:C ->/z/ =1:z :2:S : u : : u : : u : : U : 0 : : u : Special symbol corresponding to plural c:Ts :1:c : :u :/*=T
XSX For example tea<ts) umbrella/c:DS:
1:z: :u:/$=I)XZX For example, c a r<d s)*/semi-vowel @:phoneme: Fl:R(yomi):F2(+
j):+v:JH->/j/a:JX:1:y+:u:
v:JX :1 :i ”WX ->/
W/ v:WX :1 :o ・v:WX
:1 :u'c:WX :3 :b
0: u: vowel @: 赫: Fl: R (youi): F2 (+j):
+v: F3(,): F40:IX->/i/ v:IX:1:i: : :2 :O:v:
IX:2:ai : : :O:O:v:IX:
3:e:::1:'7:v:IX
:3 :u :EE->/i:/v:EE :1 :i :v:EE
:3 :ia :v:EE :3 :i
e v:EE :2 :e :v:EE
:3 :ea :v:EE :3 :av:EE :3 :ai :v:EE :
3 :aie :E I ->/e/v:EI :1 :e v:EI :1 :av:EI :3 :ai v:EI :3 :1 EH->/ε/v:EH:1 :ev:EH:2 :iv:EH:3 :u v:EH:3 :a :1 :1 ::1 ::? : :O : :2 Salary j 9 :1: Nii? ::? :? v:AXIX:lai 20v:AXIX:2:i:3 AXUX ->/au/ v:AXUX:1:au: v:AXUX:2:a: v:AXUX:2:o: v:AXUX: 3:u: ○U->10/v:OU:l:o:v:OU
:2 :a :v:OU :3 :u
:V:○U:3:ui: v:OU :3 :oa :AW->/)/v:AW :1 :o :v:AW :
2:a:v:AW:3:ao
:v:AW :3 :au :AW
IX->10i/v:AWIX:1:oi: 20:0:3 20AE->/;e/v:AE:1:a:::2
:1 :v:AE:2:e :
: :') :3 :v:AE
:3:o : : :? :
? :UH->/A or lai v:tJH:1 :a : :
:2 :1 :v:UH:2:o
: : :O:1 :v:UH:2
:u :1 :1 :1 :
O:v:UH:3:e:::
? :':' :v:UH:3 :ia
: : :? :';'
:ER->/sa/ v:ER:1:a : : :O:
3:v:ER:3:e:;
:? :? ：Er-＞／＆／／
*ER+X (vowel) >Er+RX+X Liao/v:E
r:1:a:::?
:? :V:Er:1:ua:
: :? :Q :AA->/ / v:AA:1:o : : :2
:1 :v:AA:2:a :
: :1' :1 :AX I
X ->/a i / UX->/u/ v:UX :1 :u :UU->/u:
/v:UU:1:u:v:UU:
3:o:WX v:WX:1:u:v:WX:
3 :o :j xUU ＞/ j u
: / v:jxUU:1:u :2 v:jxUU:3:a :O v:jxUU:3:o :1 ->/:/ vknee:1:o: v: :2 :a ::3 :0 : :1 :3 ::
:'il :1 :: :3
:O: : :? :? :=2 :O 2020=1 :? : :O:3 : : :O:3 : ■ Each phoneme of the English phoneme string and the conversion result obtained in step (■) are paired and temporarily stored in the working area of the memory.

例１１ｃｕＰＩ＋英語音素列：ＫＸ　　ＵＨＰＸとなり、これらの音素の
変換結果は以下のようになる。Example 11 cuPI+English phoneme string: KX UHPX, and the conversion result of these phonemes is as follows.

音素：変換結果（１）音素に対応する日本語でのヨミに関する調整その
音素の前後の音韻環境、対応するつづりを考慮した調整
規則によりヨミの優先順位（Ｆｌ）、拗音の可能性（Ｆ
２）を変更する。Phoneme: Conversion result (1) Adjustment of the reading in Japanese corresponding to the phoneme Adjustment rules that take into account the phonetic environment before and after the phoneme, and the corresponding spelling, the priority of the reading (Fl), the possibility of a persistent sound (F
2) Change.

以下に、調整規則の主なものを述べる。The main adjustment rules are described below.

規則は、音素：条件→条件を満たした場合とるべきアク
ション、の形で表現スル。Rules are expressed in the form of phoneme: condition → action to be taken when the condition is met.

ＭＸ：後続音素がＩ’Ｘ、ＢＸ、ＭＸである→Ｆｌ（ヨ
ミ：　ｎ　り　＝０記では「ランプ」となる。MX: The following phoneme is I'X, BX, MX → Fl (reading: n ri = 0 In writing, it becomes "lamp".

ＮＸ：後続音素がＴＸ、ＤＸ、ＮＸである→Ｆｌ（ヨミ
：　ｎ　り　＝０ＰＸ：　（Ｒｃ（Ｐ）十Ｆ１（■）＋Ｆ２（拗音二〇）
〕Ｄ５．英語音素列の変換結果調整手段 ■　Ｄ４■で得た英語音素列変換結果を、音韻環境、つ
づりによって調整する。以下、２つの調整項目について
説明する。NX: The subsequent phoneme is TX, DX, NX → Fl (reading: n ri = 0 PX: (Rc (P) 10 F1 (■) + F2 (20 sounds)
] D5. English phoneme string conversion result adjustment means ■ Adjust the English phoneme string conversion results obtained in D4■ according to the phonetic environment and spelling. The two adjustment items will be explained below.

記では「テント」となる。In the book, it is called ``tent.''

ＥＥ：対応するつづりがｅ′である→ Ｆｌ（ヨミ：ｅ）＝０例：“ｍｅｔｅｒ”のカナ表記は「メーター」となる例　“ａｃｔｉｏｎ”のカナ表記は「アクション」でＵ
Ｈを０′と読む。EE: The corresponding spelling is e' → Fl (reading: e) = 0 Example: The kana notation for "meter" is "meter" The kana notation for "action" is "U"
Read H as 0'.

ＡＥ：先行音素がＫＸ又はＧＸであり、かつ後続音素が
ＰＸ、ＢＸ、ＴＸ、ＤＸ、ＫＸ、ＧＸ、ＲＸである→ Ｆ２＝２先行音素がＫＸ又はＧＸであり、かつ後続音素が（ＰＸ、ＢＸ、ＴＸ、ＤＸ、ＫＸ、ＧＸ、Ｒ
Ｘ）以外の子音である→Ｆ２＝ＩＵＨ：対応するつづりがａ′である→ Ｆｌ（ヨミ：ａ）＝０例　”Ｃｈｉｎａ”のカナ表記は「チャイナ」でＵＨを
％　ａＩと読む。AE: The preceding phoneme is KX or GX, and the following phoneme is PX, BX, TX, DX, KX, GX, RX → F2=2 The preceding phoneme is KX or GX, and the following phoneme is (PX, BX, TX, DX, KX, GX, R
It is a consonant other than X) → F2 = I UH: The corresponding spelling is a' → Fl (read: a) = 0 Example The kana notation for "China" is "China" and UH is read as % aI.

ＡＡ：後続音素がのばす音゛−′である→Ｆｌ（ヨミ：
ａ）＝０例　”ｐａｒｔ”　　（ＰＸロ囚二ＴＸ）のカナ表記は
「バート」でＡＡをａ′と読む。AA: The following phoneme is the elongated sound ゛-'→Fl (reading:
a) = 0 Example The kana notation for "part" (PXro Prisoner 2 TX) is "bert" and AA is read as a'.

（２）子音の直前あるいは語尾に位置する母音音素の長
さに関する調整日本語において母音の長さは、単語を区別するための重
要な要素である。ここでは、その母音の長さについて、
以下に述べる調整規則により、母音長の調整値（Ａｌと
する）を計算する。この値は、Ｄ６音素レベル相違度計
算において、Ｆ３（促音の有無）、Ｆ４（長音の有無）
比較に使う。(2) Adjustment regarding the length of vowel phonemes located immediately before consonants or at the end of words In Japanese, the length of vowels is an important element for distinguishing words. Here, regarding the length of the vowel,
The vowel length adjustment value (denoted as Al) is calculated according to the adjustment rules described below. In D6 phoneme level difference calculation, this value is F3 (presence or absence of consonant), F4 (presence or absence of long consonant)
Use for comparison.

対応するつづりが−Ｏ／である→ Ｆｌ（ヨミ：ｏ）＝０説明のために、対象となる単語の音素列を・・・ｖ　ｃ
　ｖ２・・・　（ｖ：調整すべき母音音素、Ｃ：後続子
音音素、ｖ２：ｃに後続する母音音素）と表わす。The corresponding spelling is -O/ → Fl (read: o) = 0 For explanation, the phoneme sequence of the target word is...v c
v2... (v: vowel phoneme to be adjusted, C: subsequent consonant phoneme, v2: vowel phoneme following c).

■　後続子音音素（ｃ）がＰＸ、ＴＸ、ＫＸの場合以上である→Ａ１＝０ ■後続子音音素（ｃ）が（ＰＸ、ＴＸ、ＫＸ）以外の場
合 ■　英語音素と、■で調整した変換結果および母音長の
調整値（Ａ１）を対にして、メモリ中の作業域に格納す
る。■ If the following consonant phoneme (c) is PX, TX, KX or above → A1 = 0 ■ If the subsequent consonant phoneme (c) is other than (PX, TX, KX) ■ Conversion adjusted with English phoneme and ■ The result and vowel length adjustment value (A1) are stored as a pair in a work area in memory.

Ｄ　６　、音素レベル相違度計算手段 ■　Ｄ２、■で得た日本語音素列（以下Ｊ音素列という
）、およびＤ　５　ｔＤで得た、調整済みの英語音素列
変換結果（以下、Ｅ音素列）を入力とする。D 6 , phoneme level difference calculation means ■ The Japanese phoneme string obtained in D2, ■ (hereinafter referred to as J phoneme string), and the adjusted English phoneme string conversion result obtained in D 5 tD (hereinafter referred to as E phoneme string) ) as input.

■　Ｊ音素列とＥ音素列の相違度を以下に述べる手順に
従って計算する。最初にチャンクという概念を説明する（■−０チャンクの　全本明細書で使うチャンクとは、この音素列を各子音の先
頭で区切って作った固まりを表わす。■ Calculate the degree of difference between the J phoneme string and the E phoneme string according to the procedure described below. First, the concept of a chunk will be explained (■-0 chunk) A chunk as used in this specification refers to a block created by dividing this phoneme string at the beginning of each consonant.

例１　“ａｌｔｅｒｎａｔｉｖｅ” 英語音素列はＡＷ−ＬＸＴＸＥＲＮＸＴＪＨＴＸＩ　ＸＶＸとなる。Example 1 “alternative” The English phoneme sequence is AW-LXTXERNX It becomes TJHTXI XVX.

子音は左から順にＬＸ、ＴＸ、ＮＸ、ＴＸ。From left to right, the consonants are LX, TX, NX, and TX.

ＶＸなのでＡＷ　−Ｉ　ＬＸ　Ｉ　ＴＸＥＲＩ　ＮＸＵ
ＨＩ　ＴＸ　Ｉ　Ｘ　Ｉ　Ｖ）ｌ　６−＋（７）チャン
クに分けられる６例２　「オータナティブ」日本語音素列は、ｏｔａｎａｔｉｂｕとなる。子音は左
から順にｔ、ｎ＋　ｔ、ｂなのでｏｌｔａｌｎａｌｔｉ
ｌｂｕと５つのチャンクに分けられる。Since it is VX, AW -I LX I TXERI NXU
HI TX I The consonants are t, n+ t, b from left to right, so it is alternative
lbu and divided into five chunks.

例３　Ｆキーライ」の場合は子音が１つ（ｋ）なので、
１チヤンクとなる。Example 3 In the case of “F Keerai”, there is one consonant (k), so
1 yank.

■−１音素レベル相′度計算のＪ音素列とＥ音素列に対してチャンク数マツチングを行ない、チャンク数が一致した
ものに関して第１マツチング（子音部のヨミＲｅ）第２マツチングを順に行ない、該当する項目に与えられたペナルティ−
の総和を相違度とする。■-1 Chunk number matching is performed on the J phoneme string and E phoneme string of the phoneme level phase calculation, and the first matching (consonant part reading Re) and second matching are performed in order for those with the same chunk number, Penalty given to applicable items −
Let the sum total be the dissimilarity degree.

最初に、チャンク数マツチングを行なう。First, matching the number of chunks is performed.

（１）　　雨音素列のチャンク数が２以上異なる場合。(1) When the number of chunks in the rain phoneme sequence differs by 2 or more.

これらの音素列をもつ表記結果は不一致とみなす。（相
違度＝１００Ｘ両音素列のチャンク数の差）（ＩＩ）　　雨音素列のチャンク数が１異なる場合、チ
ャンク数が１多い方をＸ音素列、もう−方をＸ′音素列
とすると、Ｘ音素列のどのチャンクがＸ′音素列と対応
しないかを、以下の方法で決定する。Ｘ音素列の第１チ
ヤンクから１つずつ順番にぬいて作ったチャンク列と、
Ｘ′音素列のチャンク列とに、第１マツチングを行なう
。Ｘ音素列から第ｉチャンクをぬいて作ったチャンク列
のとき、Ｘ′音素列のチャンク列と子音部が一致したと
みなされたとき（第１マツチングのペナルティ−につい
ては、後述する。）、第ｉチャンクを“対応しないチャ
ンク”とみなす。Representation results with these phoneme sequences are considered to be inconsistent. (Difference level = 100X difference in the number of chunks of both phoneme strings) (II) If the number of chunks of the rain phoneme strings differs by 1, then let the one with the larger number of chunks be the X phoneme string and the other - the X' phoneme string. Which chunk of the X phoneme sequence does not correspond to the X' phoneme sequence is determined by the following method. A chunk sequence created by sequentially removing one chunk from the first chunk of the X phoneme sequence,
A first matching is performed on the chunk sequence of the X′ phoneme sequence. When a chunk string is created by removing the i-th chunk from the The i-th chunk is regarded as a "non-corresponding chunk."

例えば“Ｋｅｙｓ”（第１チャンク：ＫｘＥＥ第２チャ
ンク：ＺＸ）と「キー」　（第１チャンク：　Ｋｉ）の
場合、　　”Ｋｅｙｓ”の第２チヤンク（ＺＸ）が“対
応しないチャンク″となる。For example, in the case of "Keys" (first chunk: KxEE, second chunk: ZX) and "key" (first chunk: Ki), the second chunk (ZX) of "Keys" is the "uncorresponding chunk."

′“対応しないチャンク”がない場合は、両表記は不一
致とみなす（相違度＝１００）″対応しないチャンク″
がある場合、その子音部が表５に記載されている場合は
、Ｘ音素列から“対応しないチャンク”をぬいたちのと
Ｘ′音素列について第２マツチングを行なう。相違度は
、第１マツチングでのペナルティ−総和十表５の該当ペ
ナルティ−子弟２マツチングでのペナルティ−総和にな
る。``If there is no "non-corresponding chunk", the two notations are considered to be inconsistent (difference = 100) "non-corresponding chunk"
If there is a consonant part, and the consonant part is listed in Table 5, a second matching is performed on the X' phoneme sequence by removing the "uncorresponding chunk" from the X phoneme sequence. The degree of dissimilarity is the penalty in the first matching - the sum of the penalties in Table 5 - the penalty in the matching of the two children - the sum.

１１対応しないチャンク″があってかつその子音部が表
５に記載されていない場合は、両表記は不一致とみなす
。（相違度＝１００）表５一致していなくてもよい子音
音素とペナルティ−Ｊ音素列の子音部のヨミ　（ＪＲｃ
とかく）と、Ｅ音素列の子音部のヨミ候補（ＥＲｃｉ）
を第１チヤンクから順に比較する。Ｅ　Ｒｃ　ｉは表４
のＲ（ｙｏｍｉ）で与えられる。11 If there is a chunk" that does not correspond and its consonant part is not listed in Table 5, both notations are considered to be inconsistent. (Difference = 100) Table 5 Consonant phonemes that do not need to match and penalties - Reading of the consonant part of the J phoneme sequence (JRc
) and reading candidates for the consonant part of the E phoneme sequence (ERci)
are compared in order starting from the first yank. E Rc i is shown in Table 4
It is given by R(yomi).

例　「キャット」Ｊ音素列：ｋｊａｔ。Example “cat” J phoneme sequence: kjat.

↓ 子音＋母音の組み合わせで表現する。↓ Expressed by a combination of consonants and vowels.

（ＩＩＩ）　　チャンク数が一致する場合。(III) When the number of chunks match.

第１マツチングを行なう。各チャンクの子音が一致する
とみなされたものについてのみ第２マツチングを行なう
。Perform the first matching. The second matching is performed only for chunks whose consonants are considered to match.

相違度は、第１マツチングでのペナルティ−の総和子弟
２マツチングでのペナルティ−の総和となる。The degree of dissimilarity is the sum of the penalties in the first matching and the sum of the penalties in the second child matching.

以下に第１マツチング、第２マツチングを説明する。The first matching and the second matching will be explained below.

■−２１マツチングＪＲｃ　　　　　’　　　　　ｋｃ　ａ　ｔ″ Ｅ音素列：ＫＸ　　ＡＥ ↓ ＴＸ従ってＥ音素列の第１チヤンクの子音部（ＫＸ）のヨミ
候補はＥ　Ｒｃ　、　＝　ｋ　、　Ｅ　Ｒｃ　、　＝　ｇ　＋
Ｅ　Ｒｃ　、　＝　ｃ第２チャンクの子音部（ＴＸ）の
ヨミ候補はＥＲｃ、＝ｔ、ＥＲｃ２＝ｃとなる。■-21 Matching JRc ' k cat '' E phoneme string: KX AE ↓ TX Therefore, the reading candidates for the consonant part (KX) of the first yank of the E phoneme string are E Rc , = k , E Rc , = g +
E Rc , = c The reading candidates for the consonant part (TX) of the second chunk are ERc, = t, ERc2 = c.

なお、ｒ５Ｊ語審議会報告［外来語の表記」によると。According to the r5J language council report [Representation of foreign words].

ティ（ｔ　ｉ）→チ（ｃｉ）、デイ（ｄｉ）→ジ（ｚ　
ｉ）と表わすことになる。従ってティのときはＪＲｃ＝
ｔまたはＣ、デイのときはＪＲｃ＝ｄまたは２とみなし
て比較する。Ti (t i) → Chi (ci), Dei (di) → Ji (z
It will be expressed as i). Therefore, when teeing, JRc=
When it is t, C or day, JRc is assumed to be d or 2 and compared.

（ｉ）　　ＪＲｃとＥＲｃｌが単数−複数（ｔ−ｃ、ｄ
　−ｚ　）の関係のとき、ペナルティ−（＋２）で一致
するとみなす。(i) JRc and ERcl are singular-plural (t-c, d
-z), it is assumed that they match with a penalty of -(+2).

例「キャット」と“ｃ　ａ　ｔ　ｓ　”の第２チヤンク
の子音部はＪ　Ｒｃ　＝　ｔ　−Ｅ　Ｒｃ　Ｌ＝　ｃと
なり。For example, the consonant part of the second yank of "cat" and "cats" is J Rc = t - E Rc L = c.

ｔとＣはペナルティ−２で一致する。t and C match with a penalty of -2.

（ｉｉ）　　ＪＲｃとＥＲｃｌが有声−無声例「レディ
ース」と”　１ａｄｉｃｓ　”の第３３チヤンクの子音
部は、　Ｊ　Ｒｃ　＝　ｓ　、　Ｅ　Ｒｃ　、　：＝　
Ｚで」１記の関係である。(ii) JRc and ERcl are voiced-unvoiced examples. The consonant part of the 33rd yank of "ladies" and "1adics" is J Rc = s, E Rc, :=
This is the relationship described in 1.

（ａ　）　Ｊ　Ｒｃ　、　Ｅ　Ｒｃ　１の少なくとも一
方において、この子音音素が有声音ならば直前または直後子音が無声
音、この子音音素が無声音ならば直前または直後子音が有声
音のとき、ペナルティ−→＋１とする。(a) In at least one of J Rc and E Rc 1, if this consonant phoneme is a voiced sound, the immediately preceding or immediately following consonant is an unvoiced sound, and if this consonant phoneme is an unvoiced sound and the immediately preceding or immediately following consonant is a voiced sound, the penalty -→+1 shall be.

これは直前／直後子音の有声、無声によって該当子音の
有声無声が変わることがあるからである。This is because the voicedness or unvoicedness of the corresponding consonant may change depending on whether the consonant immediately before or after the consonant is voiced or unvoiced.

例「レディース」のＪ音素例：ｒｅｄｉｓｕでｄは有声
音、Ｓは無声音なので、このＳと“’　１ａｄｉＣｓ”
の第３チヤンクのＥＲｃ工＝２はペナルティ−（＋］２
）で一致する。Example J phoneme example of "ladies": In redisu, d is a voiced sound and S is a voiceless sound, so this S and "'1adiCs"
The third yank of ERc = 2 is penalty -(+]2
) matches.

（ｂ）　　（ａ）以外はペナルティ−→＋３とする。(b) For cases other than (a), the penalty is -→+3.

（ｎｉ）　　（ｉ）（ｉｉ）に該当しないものは子音が
違うとみなす。　　　　　　（相違度＝１００）２、Ｊ
ＲｃがＥＲｃ　ｉ中にある場合Ｊ　Ｒｃ　＝　Ｅ　Ｒｃ　ｍとすると、（ｉ）ＥＲｃｍ
の優先順位（Ｆｌ）が０あるいはｌのときペナルティ−
→Ｏ（ｉｔ）ＥＲｃｍの優先順位が２のとき（ｉｉｉ）ＥＲ
ｃｍの優先順位が３以上のとき。(ni) Those that do not fall under (i) and (ii) are considered to have different consonants. (Difference = 100) 2, J
If Rc is in ERc i, then J Rc = E Rc m, (i) ERcm
When the priority (Fl) of is 0 or l, the penalty -
→O (it) When the priority of ERcm is 2 (iii) ER
When the priority of cm is 3 or higher.

ミ（ＪＲｖ）とＥ音素列の母音部を構成する音素ｊのヨ
ミ候補（ＥＲｖｊｋ）を比較する。Mi (JRv) is compared with the reading candidate (ERvjk) of phoneme j that constitutes the vowel part of the E phoneme string.

説明のため、ＪＲｖ＝ｖｉｖ２−ｖ　　（ｖ　　は日本
語音素）ＥＲｖ＝ＥＲｖＩＥＲｖ、−ＥＲｖｍ（ＥＲｖｊは英語
音素） ■−２ですべてのチャンクの子音が一致したとみなされ
た場合１、■音素列、Ｅ音素列について１次の３項目のマツチン
グを行なう。For the sake of explanation, JRv = viv2-v (v is a Japanese phoneme) ERv = ERvIERv, -ERvm (ERvj is an English phoneme) ■ If the consonants of all chunks are considered to match in -2 1, ■ Phoneme string , E. Matching of three items of first order is performed for the phoneme sequence.

（ａ）母音部のヨミ　（Ｒｖ）（ｂ）抑音の有％（Ｆ２）（ｃ）各チャンク最後の母−ｆ音素の長さ（Ｆ３、Ｆ４
）（ａ）母音部のヨミ（Ｒｖ）のマツチング第１チヤンク
から順にＪ音素列の母音部のヨｎ：ＪＲｖを構成する音
素数、ｍ：ＥＲＶを構成する音素数とする。(a) Vowel part reading (Rv) (b) Presence of attenuation (F2) (c) Length of vowel-f phoneme at the end of each chunk (F3, F4
) (a) Matching of readings (Rv) of the vowel part In the vowel part of the J phoneme string, yon: the number of phonemes forming JRv, and m: the number of phonemes forming ERV, in order from the first chunk.

各ＥＲｖｊのヨミ候補から任意に１つずつ選を作り、ｖ
ｌから順番にｅｖ　　列と比較していく。ヨミのうち省
略可能なもの（ＪＲｖの（）で囲まれた音素、ｅｖｊｋ
二〇のもの）は。Create one selection arbitrarily from the reading candidates for each ERvj, and v
Starting from l, it is compared with the ev column in order. Those that can be omitted from the reading (phonemes enclosed in parentheses in JRv, evjk
20 things).

対応するヨミが相手の音素列に見つからなければないも
のとして扱う。また、Ｅ音素列のチャンクが子音で終わ
っている場合は表４のｔｖの母音を補って比較する。If the corresponding reading is not found in the other party's phoneme sequence, it is treated as missing. Furthermore, if the chunk of the E phoneme sequence ends with a consonant, the vowel of tv in Table 4 is supplemented for comparison.

各ｅｖ　　列につき、ＪＲｖのｄｘ番目の音素まで一致
するヨミが見つかったとし、最大のｄｘをｄとする。Assume that for each ev column, a reading that matches up to the dxth phoneme of JRv is found, and the maximum dx is set as d.

例「キーライ４（ｋＴｕｉ）と’ｋｅｙ”（ＫＸＥＥ）
の比較において、ＪＲｖ＝ｉｕｉ　（ｖ１＝ｉ＋　ｖ、
＝ｕ、ｙ、＝ｉ、ｎ＝３）　　、ＥＲｖ＝ＥＥ従ってＪＲｖとｅｖｌの比較においてはｖ１＝ｅｖ１□
よりｄ１＝１、ＪＲｖとＱＶ、の比較においてはｖ□≠ｅｖｔ２よりｄ、＝Ｏ１同様にしてｄ　ｎ　＝　０よってｄ＝最大のｄｘ＝ｄ１＝１以上のようにして、ＪＲｖのヨミと一致するＥＲｖｊの
ヨミ候補（ｃｖｊｋとする）が見つかるごとに、ｅｖｊ
ｋの優先順位と表６よりペナルティ−を求め、それを相
違度に加算する。Example "Key 4 (kTui) and 'key' (KXEE)
In the comparison, JRv=iui (v1=i+v,
=u, y, =i, n=3), ERv=EE Therefore, in comparing JRv and evl, v1=ev1□
Therefore, d1 = 1, and when comparing JRv and QV, v□≠evt2, d, = O1 Similarly, d n = 0 Therefore, d = maximum dx = d1 = 1 As above, it matches the reading of JRv Every time a reading candidate for ERvj (called cvjk) is found,
A penalty is obtained from the priority order of k and Table 6, and it is added to the degree of dissimilarity.

表６　Ｅ音素列の母音部のヨミの優先順位とペナルティ
− ただし、Ｄ５−■−（１）の調整によって、優先順位が
０のものが、該当音素の別候補として存在している場合
、ペナルティ−は（表６の値＋１）とする。Table 6 Priority and penalty for the reading of the vowel part of the E phoneme string - However, if a priority of 0 exists as another candidate for the corresponding phoneme by adjusting D5-■-(1), the penalty will be - is (value in Table 6 + 1).

また、ｄ＝ｎとなるｅｖ　　が複数ある場合は、最小の
ペナルティ−をここでのペナルティ−とする。Furthermore, if there are multiple ev's where d=n, the minimum penalty is taken as the penalty here.

マツチング終了時に、以下の条件を満たす場合は、該当
するペナルティ−を加算する。At the end of matching, if the following conditions are met, the corresponding penalty will be added.

（ｉ）　　ｄ＜ｎの場合・ｄ＝ｏのとき（ｎＸ３）のペナルティ−を加算する・ｄ＞Ｏのとき（（ｎ−ｄ）Ｘ２）のペナルティ−を加
算する（ｉｉ）　　ｄ＝ｎかっ、マツチングに使われなかった
Ｅ音素がある場合（余ったＥ音素数×２）のペナルティ
− 上記の例、「キーライＪ　　（ｋｉｕｉ）と“ｋ　ｅ　
ｙ　”　　（Ｋ　Ｘ　Ｅ　Ｅ　）では、ｖ１＝ｅｖ、、
＝ｉかつＣＶｌｌの優先順位＝１よりペナルティ−０で
あるが、ｄ＝１、ｎ＝３より（ｉ　）　ｄ　＜　ｎかつ
ｄ＞Ｏを満たし、（３−１）×２＝４のペナルティ−が
加算される。従ってペナルティ−合計は４となる。(i) When d<n ・When d=o, add the penalty of (nX3) ・When d>O, add the penalty of ((nd-d)X2) (ii) When d=n , penalty when there is an E phoneme that was not used in matching (number of remaining E phonemes x 2) - In the above example, "kiui" and "ke
y ” (K X E E ), v1=ev,,
= i and CVll priority = 1, so the penalty is -0, but since d = 1 and n = 3, (i) satisfies d < n and d > O, and the penalty of (3-1) x 2 = 4 - is added. Therefore, the total penalty is 4.

ｂ　　　の　４１　　Ｆ２　　のマツチングＪ音素列と
Ｅ音素列の対応チャンクにおいて。In the corresponding chunks of matching J and E phoneme sequences of 41 F2 of b.

拗音の有無（Ｆ２）により表７のとおりペナルティ−を
定める。Penalties are determined according to the presence or absence of persistent sounds (F2) as shown in Table 7.

表７　Ｊ音素列とＥ音素列のＦ２（拗音の有無）による
ペナルティ− Ｊ音素列の母音音素の長さ（Ｋとする）の定義ＫＥＦ４
−Ｆ３（ｃ）ペナルティ−計算表９によってペナルティ−を怪える。Table 7 Penalty due to F2 (presence or absence of persistent consonant) of J phoneme string and E phoneme string - Definition of vowel phoneme length (denoted as K) of J phoneme string KEF4
-F3 (c) Calculate the penalty using the penalty calculation table 9.

表９　母音音素の畏さマツチングにおけるペナ（２）Ｅ
音素列の母音音素の長さ（Ａとする）の定義ＡＥＡ工＋
Ａ２へ〇：音韻環境、つづりによる調整値（Ｄ５−■−（２
））Ａパ表４のＦ３、Ｆ４から表８によって与えられる値表８　Ｅ音素列の母音音素の長さ（Ａ２）ただし、日本
語のカナ表記上、ティーをチーデイをデーと書くことが
あるのでｔｉとｔｅ、ｃｌｉとｄｅの組み合わせには、
ペナルティ−を与えない ■■で得た相違度をメモリの作業域に格納する。Table 9 Pena (2) E in vowel phoneme awe matching
Definition of the vowel phoneme length (A) of a phoneme string AEA +
Go to A2〇: Adjustment value due to phonological environment and spelling (D5-■-(2
)) Values given by Table 8 from F3 and F4 in Table 4 of A-Pa Table 8 Length of the vowel phoneme of the E phoneme string (A2) However, in Japanese kana notation, Tee is sometimes written as Chee-day and Day. Therefore, the combinations of ti and te, cli and de,
The difference obtained by ■■ without giving a penalty is stored in the memory work area.

Ｄ７　文字表記レベルでの対応関係判定手段■　Ｄ６■
で得た相違度を入力とする。D7 Correspondence determination method at character notation level■ D6■
The dissimilarity obtained in is input.

■　適用ケースによって適当なしきい値を定める。■ Determine an appropriate threshold depending on the application case.

例えば、これを３とすると相違度く３ならば両表記結果は対応するとみなす。For example, if this is 3 If the degree of difference is 3, both notation results are considered to correspond.

相違度＝３ならば両表記結果は対応の可能性があるとみ
なす。If the degree of difference is 3, it is considered that there is a possibility that the two notation results correspond.

相違度〉３ならば両表記結果は対応しないとみなす。If the degree of difference>3, it is assumed that the two notation results do not correspond.

のように対応関係判定を行なう。The correspondence relationship is determined as follows.

■　出力は、例えばユーザへの表示の形で行なう。■ Output is performed, for example, in the form of a display to the user.

具体的な例としては校正システムにおいて「同一語に対
する複数表記結果が存在しています」という表示を出す
、などが考えられる。適用例はＤ１７　Ｄ１８を参照さ
れたい。As a specific example, a proofreading system may display a message saying ``Multiple transcription results exist for the same word.'' For application examples, please refer to D17 and D18.

Ｄ８．　　判定の具体例（英語表記結果とカナ表記結果
の比較）例１：カナ表記結果「ファジー」と英語表記結果”　ｆ
　ｕ　ｚ　ｚ　ｙ　”の対応関係判定を行なう。D8. Specific example of judgment (comparison of English notation results and kana notation results) Example 1: Kana notation result “Fuzzy” and English notation result” f
uz z y ” is determined.

１、「ファジー」をＤ２．日によりＪ音素列＝：ｈａｚｉに変換する。1. “Fuzzy” D2. Day is converted into J phoneme sequence=:hazi.

２、”ｈａｚｉ”をＤ４−■の表現形式に変換する。2. Convert "hazi" into the D4-■ expression format.

音素　ヨミ（ＲＣ／ｖ）：Ｆｌ：Ｆ２：Ｆ３：Ｆ４（拗
音）（長音）（促音）ｈ　　　Ｒｃ＝ｈ　　　　　：　■　　：　Ｏ：ａ　　
　Ｒｖ＝ａ　　　　　：　■　　：：Ｏ：Ｏｚ　　　Ｒ
ｃ＝ｚ　　　　　　：　■　　：　０　　：ｉ　　　Ｒ
ｖ＝ｉ　　　　　　：　■　　：：０：６３　ｅ　　”
ｆ　ｕ　ｚ　ｚ　ｙ”　を旦」」（量ｊ」（牲注成手段
−によりＥ音素列＝ＦＸＵＨ２ＸＥＥに変換する。Phoneme reading (RC/v): Fl: F2: F3: F4 (persistent sound) (long sound) (continent sound) h Rc=h: ■: O:a
Rv=a: ■ ::O:Oz R
c=z : ■ : 0 : i R
v=i: ■::0:63 e”
f u z z y” is converted into E phoneme sequence = FXUH2XEE by means of sacrificial composition.

４、Ｄ４英語音　、日本語音素対・テープ少手により、
表４を使ってＤ４−■の表現形式に変換する。4. D4 English sound, Japanese phoneme pair, by tape small hand,
Using Table 4, convert to the expression format of D4-■.

音素ヨミ（Ｒｃｖ）：　Ｆｌ　：　Ｆ２　　：　Ｆ３　
　：　Ｆ４（拗音）（促音）（長音）ＦＸ　　Ｒｃ＝ｈ　　：■　：　０　：Ｒｃ＝ｂ　　：
■　：　０　：ＵＨＲｖ＝ａ　　：■　：：２：ＩＲｖ＝ｏ　　：■　：：Ｏ：ＩＲｖ＝ｕ：■　：：１：０Ｒｖ＝＝ｅ　　：■　：：？：？Ｒｖ＝ｉ　ａ　：■　：　　　　：？ニアＺＸ　　Ｒｃ
＝ｚ　　：■　：　０　：Ｒｃ＝ｓ　　：■　：　０　
：ＥＥ　　Ｒｖ＝ｉ　　：■　：：１：３Ｒｖ＝ｉａ：■
　：　　　　：０　　　ニアＲｖ＝ｉｅ：■　：：Ｏ：
２Ｒｖ＝ｅ　　：■　：：２：１５、４で得た変換Ｍ果をｏ５”　　　　　（７）　　　
Ｐ求災１毛段によって以下のように変更する。Phoneme reading (Rcv): Fl: F2: F3
: F4 (persistent sound) (continuous sound) (long sound) FX Rc=h :■ : 0 :Rc=b :
■: 0: UHRv=a:■::2:I Rv=o:■::O:I Rv=u:■::1:0 Rv==e:■::? :? Rv=ia :■ : :? Near ZX Rc
=z :■ : 0 :Rc=s :■ : 0
: EE Rv=i :■ ::1:3Rv=ia:■
: :0 Near Rv=ie:■ ::O:
2 Rv=e :■ ::2:1 5. O5'' (7)
Changes are made as follows depending on P Disaster 1 Kedan.

Ｄ５−■−（１）該当項目なしＤ５−■−（２）の該当母音はＵＨ，ＥＥである。D5-■-(1) No applicable items The corresponding vowels in D5-■-(2) are UH and EE.

ＵＨ：Ｄ５−■−（２）−■（１）　（ｉｆ）よりＡ１
＝ＯＥＥ　：　Ｄ５−■−（２）−■（１）　（ｉｔ）より
Ａ１＝０６．２で得た「ファジー」の表現と、４，５で得た“ｆ
ｕｚｚｙ”の表現を使い、旦且皇亙ｙべ皮皿産皮１見毛
度により以下のように相違度を計算する。UH: A1 from D5-■-(2)-■(1) (if)
=O EE: D5-■-(2)-■(1) From (it), A1=0 The expression of "fuzzy" obtained in 6.2 and the "f" obtained in 4 and 5
Using the expression "Uzzy", the degree of dissimilarity is calculated as follows based on the degree of hair growth of the skin.

（ｉ）　　チャンクへの分割「ファジー」→（ｈａ）　　（ｚｉ） “ｆｕｚｚｙ”−＋　（ＦＸＵＨ）（ＺＸＥＥ）チャン
ク数は、２で一致しているので第１マツチングを行なう
。(i) Division into chunks "Fuzzy"→(ha) (zi) "fuzzy"-+ (FXUH) (ZXEE) Since the number of chunks is 2 and matches, the first matching is performed.

（ｎ）　　第１マツチング・チャンク：ｈとＦＸである。(n) First matching ・Chunk: h and FX It is.

（ＪＲｃ＝ＥＲｃ１）かつ（Ｅ　Ｒｃｌの優先順位＝１
）なのでペナルティ−はＯとなる。(JRc=ERc1) and (E Rcl priority=1
), so the penalty is O.

・チャンク２：ＺとＺｘである。・Chunk 2: Z and Zx It is.

同様にしてペナルティ−はＯとなる。Similarly, the penalty becomes O.

相違度＝０＋０＝０で子音部が一致していることがわか
った。It was found that the consonant parts matched with the degree of difference = 0 + 0 = 0.

次に第２マツチングを行なう。Next, a second matching is performed.

（市）第２マツチング・チャンク１：ａとＵＨ（ａ）ヨミ（Ｒｖ）のマツチングＪ　　Ｒｖ　＝ａ　、（ｂ）拗音の有無（Ｆ２）のマツチングＦ２（ｈａ）＝
Ｏ１Ｆ２　（ＦＸＵＨ）＝Ｏより、ペナルティ −はＯとなる。(City) Second matching chunk 1: a and UH (a) Matching of reading (Rv) J Rv = a , (b) Matching of presence/absence of persistent sound (F2) F2 (ha) =
Since O1F2 (FXUH)=O, the penalty becomes O.

（ｃ）母音音素の長さ（Ｆ３、Ｆ４）のマツチングＫ　（ａ）　＝ＯＡ　（ＵＨ：　ａ）＝Ａｌ＋Ａ２＝Ｏ＋（１−２）＝−
１表９のに＝ＯかつＩＡＩ≦１の条件に該当するので、ペナルティ−は０となる。(c) Matching K of vowel phoneme lengths (F3, F4) (a) =O A (UH: a) = Al + A2 = O + (1-2) = -
1 Since the conditions of =O and IAI≦1 in Table 9 are met, the penalty is 0.

・チャンク２：ｉとＥＥ（ａ）ヨミ（Ｒｖ）のマツチングＪ　Ｒｖ　ｚｉ　。・Chunk 2: i and EE (a) Matching readings (Rv) J　Rv　zi　.

（Ｊ　Ｒｖ　＝　Ｅ　ＲＶＩＬ）かつ（Ｅ　ＲＶ、、の
優先順位＝１）なので、ペナルティ −は０となる。Since (J Rv = E RVIL) and (E RV, priority order = 1), the penalty is 0.

である。It is.

（Ｊ　Ｒｖ　＝　Ｅ　Ｒｖ、、）かつ（Ｅ　Ｒｖｌ、の
優先順位＝１）なのでペナルティ−は０となる。Since (J Rv = E Rv, ,) and the priority of (E Rvl, = 1), the penalty is 0.

（ｂ）拗音の有無（Ｉ”２）のマツチングＦ２　（ｚｉ
）＝Ｑ、Ｆ２　（ＺＸＥＥ）＝Ｏよりペナルティ−はＯ
となる。(b) Matching F2 (zi
) = Q, F2 (ZXEE) = O, the penalty is O
becomes.

（ｃ）母音音素の長さ（Ｆ３、Ｆ４）のマツチングＫ　　（ｉ）　　＝６Ａ　（ＥＥ　：　１）＝ＡＩ＋Ａ２＝Ｏ＋（３−１）＝
２表９のに≠０かつＩＫ−ＡＩ≦４の条件に該当するので、ペナルティ−は０となる。(c) Matching of vowel phoneme lengths (F3, F4) K (i) = 6 A (EE: 1) = AI + A2 = O + (3-1) =
2 Since the conditions in Table 9 of ≠0 and IK-AI≦4 are met, the penalty is 0.

以上より、相違度二〇である。From the above, the degree of difference is 20.

７、Ｄ７文　　　レベルでの対　関　　　　　により「
ファジー」と“ｆｕｚｚｙ”は対応すると判定される。7. D7 Sentence level relationship “
It is determined that "fuzzy" and "fuzzy" correspond.

例２：英語表記結果”　Ｑ　ｕ　ｐ　”に対して、カナ
表記結果１　「カップ」、２「コツプ」、３「カーブＪ
の３つを入力した場合の判定を行なう。Example 2: For the English notation result “Q u p”, the kana notation results 1 “cup”, 2 “kop”, 3 “curve J”
A judgment is made when three of the above are input.

ｌ　、　　ＩＩ　ｃｕＰＩＩのＥｆ素列は（ＫＸＵＨ）
（ＰＸ）（（）はチャンクを表わす）となる。表４より
変換結果、゛［↑素ヨミ（Ｒｅｖ）：Ｆｌ：Ｆ２：　　Ｆ３　　
：　　Ｆ４（拗音）（促音）　（長音）ＫＸＲｃ＝に：■　：　０　：Ｒｃ＝ｇ：■　：　０　：Ｒｃ＝ｃ：■　：　０　：ＵＨＲｖ＝ａ：■　：　　　：２：ＩＲｖ＝ｏ：■　：　　　：Ｏ：ＩＲｖ＝ｕ　　：■　：１：　　１　　：　　０Ｒｖ＝ｅ
　　：■　：　　　：？：？Ｒｖ＝ｉａ：■　：　　　：？：？ＰＸ　　Ｒｃ＝Ｐ　　：■　：　０　：を得る。The Ef prime sequence of l, II cuPII is (KXUH)
(PX) (() represents a chunk). From Table 4, the conversion results are as follows.
: F4 (continuous sound) (continental sound) (long sound) KXRc=ni:■ : 0 : Rc=g:■ : 0 : Rc=c:■ : 0 : UHRv=a:■ : :2:I Rv=o:■ : :O:I Rv=u :■ :1: 1 :0Rv=e
:■ : :? :? Rv=ia: ■ : :? :? PX Rc=P : ■ : 0 : is obtained.

２、調整規則を使って１の変換結果を変更する。2. Modify the conversion result of 1 using adjustment rules.

Ｄ５−■−（１）：該当項目なしＤ５−の−（２）：対象となる母音音素はＵＨ■（１）
（市）よりＡ１＝−１３、カナ表記結果１「カップ」と”　ｃ　ｕ　ｐ　”の
相違度を求める。D5-■-(1): No applicable item D5-'s-(2): Target vowel phoneme is UH■(1)
(City), A1=-1 3, kana notation result 1 Find the degree of difference between "cup" and "cup".

Ｊ音素列は（ｋみ）（ｐｕ）（（）はチャンクを表わす
）（ｉ）　　チャンク数は２で等しいので、第１マツチン
グを行なう。The J phoneme sequence is (k) (pu) (() represents chunks) (i) Since the number of chunks is equal to 2, the first matching is performed.

（ｉｉ）　　第１マツチング・チャンクににとＫＸのマツチングによりペナルティ−
０・チャンク２：ｐとＰＸのマツチングによりペナルティ
−〇（ｉｉｉ）　　第２マツチング・チャンク１：ａとＵＨ（ａ）ヨミ（Ｒｖ）のマツチングＪＲｖ＝ＥＲｖ、、（ＵＨ：　ａ）でペナルティ−０（ｂ）拗音の有無（Ｆ２）のマツチングＦ２　（ｋａ）
＝Ｆ２（ＫＸＵＨ）＝Ｏでペナルティ−０（ｃ）母音音素の長さ（Ｆ３、Ｆ４）のマツチングＫ（ａ）＝−６Ａ　　（ＵＨ：　　ａ）＝Ａ１＋Ａ２＝−１＋（１−２
）＝−２表９のに≠Ｏかつｌ　Ｋ−Ａ　Ｉ≦４の条件に該当し、
ペナルティ−０ °チャンク２：Ｅ音素列第２チャンク（ｐｘ）が子音で終わっているの
で、Ｄ４■より＋Ｖの母音を補足して比較する。(ii) Penalty due to matching of KX to the first matching chunk.
0 - Chunk 2: Penalty for matching p and PX - 〇 (iii) Second matching chunk 1: a and UH (a) Matching reading (Rv) JRv = ERv,, (UH: Penalty - 0 for a) (b) Matching of presence/absence of persistent sounds (F2) F2 (ka)
= F2 (KXUH) = Penalty -0 for = O (c) Matching vowel phoneme length (F3, F4) K (a) = -6 A (UH: a) = A1 + A2 = -1 + (1-2
)=-2 In Table 9, the conditions of ≠O and lK-A I≦4 are met,
Penalty -0 ° Chunk 2: Since the second chunk (px) of the E phoneme sequence ends with a consonant, the +V vowel is captured from D4■ and compared.

従って、Ｊ母音音素＝ＵとＥ補足母音音素（＋ｖ）＝ｕ
を比較することになり（ａ）。Therefore, J vowel phoneme = U and E supplementary vowel phoneme (+v) = u
We will compare (a).

（ｂ）、（ｃ）とも一致し、ペナルティ−〇となる。Both (b) and (c) match, resulting in a penalty of -0.

以上より相違度（“ｃｕｐ　　　　ｒカップ」）＝０と
なる。From the above, the degree of difference (“cup r cup”)=0.

４、カナ表記結果２「コツプ」と“ａ　ｕ　ｐ　”の相
違度を求める。4. Calculate the degree of difference between kana notation result 2 “kotsupu” and “a u p”.

Ｊ音素列は（ｋみ）（ｐｕ）（（）はチャンクを表わす
）（ｉ）　　チャンク数は２で、一致する。The J phoneme sequence is (k) (pu) (() represents a chunk) (i) The number of chunks is 2 and they match.

（ｉｔ）　　第１マツチング３、の場合と同様にしてペナルティ−〇となる（ｉｉｉ）　　第２マツチング３、の場合と比べると、チャンク１の母音部（０）が違
うだけなので″み″とｘｉ　Ｕ　Ｈｐｐの比較について
述べる。(it) As in the case of the first matching 3, the penalty becomes -〇. (iii) Compared to the case of the second matching 3, only the vowel part (0) of chunk 1 is different, so "mi" and xi A comparison of U Hpp will be described.

（ａ）　　ヨミ（Ｒｖ）のマツチングＪ　Ｒｖ　−Ｅ　Ｒｖ　ｘ　ｘ　（Ｕ　Ｈ：　ｏ　）で
ＥＲＶｔｚの優先順位が２のためペナルティ−は＋１と
なる。(a) In reading (Rv) matching J Rv -E Rv x x (U H: o), the priority of ERVtz is 2, so the penalty is +1.

（ｂ）　　拗音の有無（Ｆ２）のマツチングＦ　２　　
（ｋ　ｏ）　　＝　Ｆ　２　　（ＫＸＵＨ）　　＝０よ
りペナルティ−はＯとなる。(b) Matching of presence/absence of persistent sounds (F2) F2
Since (k o) = F 2 (KXUH) = 0, the penalty is O.

（ｃ）　　母音音素の長さ（Ｆ３、Ｆ４）のマツチングＫ（る）＝−６Ａ　　（ＵＨ：　ｏ）＝ＡＩ＋Ａ２＝−１＋（１−０）
＝０表９のＫｆ−Ｏかつｌ　Ｋ−Ａ　Ｉ　＝６の条件に該当
し、ペナルティ−は＋１となる。(c) Matching of vowel phoneme lengths (F3, F4) K(ru)=-6 A (UH: o)=AI+A2=-1+(1-0)
=0 The conditions of Kf-O and l K-A I =6 in Table 9 are met, and the penalty is +1.

以上より相違度（“ｃ　ｕ　ｐ　１１−「コツプ」）＝
＋１＋１＝２となる。From the above, the degree of dissimilarity (“c u p 11 - “kotup”) =
+1+1=2.

５、カナ表記結果３「カーブ」と“Ｑ　ｕ　ｐ　”の相
違度を求める。5. Find the degree of difference between kana notation result 3 “curve” and “Q u p ”.

Ｊ音素列は（ｋａ）（ｐｕ）（（）はチャンクを表わす
）（ｉ）チャンク数は２で、一致する。The J phoneme sequence is (ka) (pu) (() represents a chunk) (i) The number of chunks is 2 and they match.

（ｎ）第１マツチング３、の場合と同様にして、ペナルティ−〇となる。(n) First matching Similarly to case 3, the penalty becomes -0.

（ｎｕ）第２マツチング３、の場合と比べて、チャンク１の母音部に）が違うだ
けなので、′ａ”と“ＵＨ”の比較について述べる。(nu) Compared to the case of second matching 3, the only difference is the vowel part of chunk 1), so a comparison between ``a'' and ``UH'' will be described.

（ａ）ヨミ（Ｒｖ）のマツチングＪＲｖ＝ＥＲｖ、、（ＵＨ：　ａ）で、ペナルティ−０（ｂ）拗音の有無（Ｆ２）のマツチングＦ２　（ｋａ）
＝Ｆ２　（ＫＸＵＨ）＝Ｏで、ペナルティ−０（ｃ）母音音素の長さ（Ｆ３、Ｆ４）のマツチングＫ　（ａ）　＝＋６Ａ　（ＵＨ：　ａ）＝Ａ１＋Ａ２＝−１＋（１−２）＝
−２表９のに≠０かつ１Ｋ−ＡＩ＞７（…）（ａ）に該当し
、ペナルティ−は＋３となる。(a) Matching of reading (Rv) JRv=ERv,, (UH: a), penalty -0 (b) Matching of presence/absence of persistent sound (F2) F2 (ka)
= F2 (KXUH) = O, penalty -0 (c) Matching K of vowel phoneme length (F3, F4) (a) = +6 A (UH: a) = A1 + A2 = -1 + (1-2) =
-2 In Table 9, ≠0 and 1K-AI>7 (...) (a) apply, and the penalty - is +3.

以上より相違度（″ｃｕｐＨ−ｒカップ」）＝３となる
。From the above, the degree of difference (“cupH−r cup”)=3.

６、Ｄ７のしきい値を使って“Ｑｕｐ”と上記３つのカ
ナ表記との対応関係判定を行なうと、ｒカップ」、「コ
ツプ」は対応する１４Ｑ　ｕ　ｐ　”と「カーブ」は対応の可能性がある
（似ている）となる。6. When determining the correspondence between "Qup" and the above three kana notations using the threshold value of D7, it is found that "r cup" and "kotsupu" correspond to 14. "Q u p" and "curve" can correspond. There is a gender (similarity).

Ｄ９．辞書との併用原音とかけはなれたヨミを生じさせる表記が定着した外
来語、例えば”　ｓ　ａ　１１　ａ　ｄ”と「サラダ」
、“ｄｏＱｆｉａｒ”と「ドル」の場合は、辞書を併用
することによって精度を上げることができる。D9. Used in combination with a dictionary Foreign words that have been written to sound far apart from the original sound, such as "s a 11 a d" and "salad"
, "doQfiar" and "dollar", accuracy can be improved by using a dictionary together.

このような外来語は、既に日本語であるという意識の強
いものであり、数も限られているため、このような辞書
を用意するのは、容易である。Such foreign words have a strong sense of being Japanese and are limited in number, so it is easy to prepare such a dictionary.

Ｄｌｏ、カナ表記間の対応関係判定システムカタカナ表
記−ひらがな表記、カタカナ表記−カタカナ表記間の対
応関係判定について述べる。Dlo, a system for determining the correspondence between katakana and hiragana notations, and a system for determining the correspondence between katakana and hiragana, and katakana and katakana.

ひらがなとカタカナは一対一対応であるので、カタカナ
表記間の対応関係判定についてのみ、第２図、第３図を
使って説明する。Since hiragana and katakana have a one-to-one correspondence, only the determination of the correspondence between katakana notations will be explained using FIGS. 2 and 3.

第２図、第３図の違いは、関連ヨミ対応手段の有無であ
る。対応関係判定の対象となる雨音素列で、拗音の有無
、母音音素のヨミ（促音、長音の違いは無視する）で一
致しないものがあるとき。The difference between FIG. 2 and FIG. 3 is the presence or absence of related reading handling means. When there are rain phoneme strings that are subject to correspondence determination that do not match in the presence or absence of persistent consonants and vowel phoneme readings (ignoring differences in consonants and long consonants).

これを使う。Use this.

以下、Ｄ１１〜０１５で、第２図に示された各手段の詳
細を説明する。Hereinafter, the details of each means shown in FIG. 2 will be explained in D11 to D015.

Ｄ１１０日本語音素列生成手段Ｄ２と同様に行なう。D110 Japanese phoneme string generation means Do the same as D2.

Ｄ１２．関連ヨミ対応手段 ■　Ｄｌｌで得た日本語音素列のどちらか一方を入力と
する。D12. Related Reading Measures ■ Input either one of the Japanese phoneme strings obtained by Dll.

■　■の）コ本語音素列のうち、表１０のＸｌに−致し
、かつ〈条件〉を満たすものがあるとき、関連ヨミ　（
Ｘ２）を対応させる。■ If there is a phoneme string in the original language (from
Make X2) correspond.

Ｘがｊ以外の日本語音素（列）を表わすとすると、Ｄ４
−■で述べた表現形式のＲ，Ｆｉとの関係は１次のとお
りである。If X represents a Japanese phoneme (sequence) other than j, then D4
The relationship between R and Fi of the expression format described in -■ is as follows.

Ｘ−）ヨミ（Ｒ）＝ｘ、Ｆ２（拗音）＝０、Ｆ３（促音
）＝０、Ｆ４（長音）＝ＯＸ→ヨミ（Ｒ）＝　ｘ　、　　Ｆ　２　（拗音）＝Ｏ，
Ｆ３　（促音）＝０、Ｆ４　（長音）＝６ｅ′が表１０のｅ−＋１（Ｆ１＝２）＜３チャンク以上から構成されて
おり、かつｅの後続子音かに、ｔ、Ｐ。X-) Reading (R) = x, F2 (persistent sound) = 0, F3 (consonant sound) = 0, F4 (long sound) = O
F3 (consonant) = 0, F4 (long consonant) = 6 e' is composed of e-+1 (F1 = 2) < 3 chunks or more in Table 10, and the consonants following e are t, P.

ｓ、ｈである。〉に該当するため、ｉ′を関連ヨミとみなし、変換結果は
次のようになる。s, h. 〉, so i' is regarded as a related reading, and the conversion result is as follows.

■　■で得た結果をメモリの作業域に格納する。■ Store the results obtained in ■ in the memory work area.

表１００日本語音素列における関連ヨミ表ｚ−＋ヨミ（
Ｒ）＝ｘ、Ｆ２（拗音）＝０、Ｆ３（促音）＝６、Ｆ４
　（長音）＝ＯＸをｘ、ｘ、妄のどれかを表わすとするとＸ　ｊ−＋ヨ
ミ　（Ｒ）＝ｘ、Ｆ２　（拗音）＝３、Ｆ３＝Ｆ３　（
Ｘ）、Ｆ４＝Ｆ４　（Ｘ）例　　「レポートＪ　　（（
ｒ　ｅＮｐ　ｏＨｔ　ｏ））のＤ１３．音素レベル相違
度計算手段 ■　Ｄｌｌで得た日本語音素列（Ｊ音素列という）とＤ
１２■で得た日本語音素列（Ｊ’音素列という）を入力
とする。Table 100 Related reading table for Japanese phoneme sequences z-+ reading (
R) = x, F2 (continent) = 0, F3 (consonant) = 6, F4
(Long sound) = O If X represents x, x, or delusion, then
X), F4=F4 (X) Example “Report J ((
r eNp oHt o)) D13. Phoneme level difference calculation means■ Japanese phoneme string obtained with Dll (referred to as J phoneme string) and D
The Japanese phoneme string (referred to as the J' phoneme string) obtained in 12■ is input.

■　Ｊ音素列、Ｊ′音素列の相違度を、以下の手順に従
って計算する。■ Calculate the degree of difference between the J phoneme string and the J' phoneme string according to the following procedure.

最初にチャンク数マツチングを行なう（１）　　チャンク数が２以上異なる場合、（■）チャ
ンク数が１異なる場合の処理は、Ｄ６−■−１の（１）
（Ｕ）と同様６（ｍ）　　チャンク数が一致している場合。First, match the number of chunks (1) If the number of chunks differs by 2 or more, (■) If the number of chunks differs by 1, the process is D6-■-1 (1)
Same as (U) 6 (m) When the number of chunks matches.

（１）第１マツチングＤ６−■−２と同様に行なう。(1) First matching Proceed in the same manner as D6-■-2.

（２）第２マツチング（１）の第１マツチングで、すべてのチャンクの子音部
が一致したとみなされた場合、Ｊ音素列、Ｊ′音素列に
ついて次の３項目のマツチングを行なう、相違度は該当
項目のペナルティ−の総和とする。(2) Second matching If the consonant parts of all chunks are considered to match in the first matching of (1), the following three items are matched for the J phoneme string and J' phoneme string, and the degree of dissimilarity is determined. is the sum of penalties for the applicable items.

（ａ）母音部のヨミ（Ｒｖ）のマツチングマツチングの
しかたはＤ６−■−３− （ａ）と同様−（Ｊ’ＲｖがＥＲｖに当たる、）ペナル
ティ−は、表６のかわりに表１１を使う。(a) Matching of vowel reading (Rv) Matching method is D6-■-3- Same as (a) - (J'Rv corresponds to ERv) Penalty - Table 11 instead of Table 6 use.

表１１．Ｊ’音素列の母音部のヨミの（ｂ）拗音の有無（Ｆ２）のマツチングＤ６−■−３−
（ｂ）と同様（ｃ）各チャンク最後の母音音素の長さ（Ｆ３、Ｆ４）
のマツチングＪ音素列の各チャンク最後の母音音素の長さをに、Ｊ’音素列の各チャンク最後の母音音素の
長さをに′とするとき（Ｋ、に’の定義は、Ｄ６−■−
３−（ｃ）同様（Ｆ４−Ｆ３）とする）、Ｋ、に’の組
み合わせによるペナルティ−を表１２のように定める。Table 11. (b) Matching of vowel part reading of J' phoneme string (b) Presence or absence of persistent sound (F2) D6-■-3-
Same as (b) (c) Length of the last vowel phoneme of each chunk (F3, F4)
Matching When the length of the last vowel phoneme of each chunk of the J phoneme string is , and the length of the last vowel phoneme of each chunk of the J' phoneme string is , then the definition of K, ni' is D6-■ −
3-(c) Same as (F4-F3)), the penalty due to the combination of K and ni' is determined as shown in Table 12.

表１２．母音音素の長さマツチングに ■■で得た相違度をメモリの作業域に格納する。Table 12. For length matching of vowel phonemes Store the dissimilarity obtained in ■■ in the memory work area.

Ｄ１４０文字表記レベルでの対応関係判定手段Ｄ７と同
様に行なう。D140 Correspondence determining means at character notation level This is performed in the same manner as D7.

Ｄ１５．対応関係判定の具体例（カナ表記結果同士の比
較）カナ表記結果「レポート」と「リポート」の対応関係判
定を行なう。D15. Specific example of correspondence relationship determination (comparison of kana notation results) The correspondence relationship between the kana notation results “report” and “report” is determined.

１、上記のカナ表記結果を、Ｄｌｌ日　　　　生皮１反
により音素列に変換する。1. Convert the above kana notation result into a phoneme string using Dll day rawhide 1 roll.

「レポート」のＪ音素列＝　ｒ　ｅ　ｐ　ｏ　ｔ　。J phoneme sequence of “report” = r e p o t.

「リポート」のＪ′音素列＝ｒｉｐ百ｔ。J′ phoneme sequence of “report” = rip 100t.

２、Ｊ音素列（ｒｅｐｏｔｏ）をＤ４−■の表現形式に
変換する。2. Convert the J phoneme string (repoto) into the D4-■ expression format.

音素　ヨミ（Ｒｃ／ｖ）：　Ｆｌ　：　Ｆ２　：　Ｆ３
　：　Ｆ４（拗音）（促音）（長音）ｒ　　　　Ｒｃ＝ｒ：　　■　：　　Ｏ：ｅ　　　　Ｒ
ｖ＝ｅ：　　■　：　　　　　：Ｏ：　　　　　。Phoneme reading (Rc/v): Fl: F2: F3
: F4 (continuous sound) (continuous sound) (long sound) r Rc=r: ■ : O:e R
v=e: ■: :O: .

ｐ　　　Ｒｃ＝ｐ：　　■　：　　Ｏ：ｏ　　　Ｒｖ＝
ｏ：　　■　：　　　　：Ｏ：　　　　６ｔ　　　　Ｒ
ｃ＝ｔ：　　■　：　　０　：ｏ　　　　Ｒｖ＝ｏ：　
　■　：　　　　　：０：　　　　　０３．２と同様に
してＪ′音素列（ｒｉｐｏｔｏ）をＤ４−＠の表現形式に変
換する。p Rc=p: ■ : O:o Rv=
o: ■ : :O: 6t R
c=t: ■ : 0 :o Rv=o:
■ : :0: Similarly to 03.2, convert the J' phoneme sequence (ripoto) into the D4-@ expression format.

４、ＪＴ音素列にＤ１２　　　ヨミ・・　　を適用する
と、該当音素は“ｉ°′のみである。従って以下の結果
を得る。4. When applying D12 Yomi... to the JT phoneme string, the only corresponding phoneme is "i°'. Therefore, the following result is obtained.

音素　ヨミ（ＲＣ／ｖ）：　Ｆｌ　：　Ｆ２　：　Ｆ３
　：　Ｆ４（促音）（拗音）（長音）ｉ　　　　ｉ　：　■　：：Ｏ：Ｏｌ　　　　ｅ　：　■　：：Ｏ：０その他の音素はＪ音素列と同じ。Phoneme reading (RC/v): Fl: F2: F3
: F4 (consonant) (persistent) (long) i i : ■ :: O: O l e : ■ :: O: 0 Other phonemes are the same as the J phoneme sequence.

５゜２と４で得た結果についてＤ１３音素レベル相違計
算手段により相違度を計算する。The degree of difference is calculated for the results obtained in 5°2 and 4 using the D13 phoneme level difference calculation means.

Ｊ音素列のチャンクは（ｒ　ｅ）（ｐ　ｏ）（ｔ　ｏ）
Ｊ′音素列　　ｔｒ　　　（ｒ　ｉｍｐ　ｏ）（ｔ　ｏ
）となり、第１チヤンクの母音音素１１　ｅＩ７とｉ″
のみ異なっているため、この２つのマツチングについて
のみ述べる。The chunks of the J phoneme sequence are (r e) (p o) (t o)
J′ phoneme sequence tr (r imp o) (t o
), and the vowel phoneme 11 of the first yank eI7 and i''
Only the matching of these two will be described.

Ｊ音素ＣはＪ′音素ｉの関連ヨミＣ（優先順位＝２）と
一致しているため１表１１よりペナルティは＋２である
。Since the J phoneme C matches the related reading C (priority = 2) of the J' phoneme i, the penalty is +2 from Table 11.

以上より相違度（「レポート」−「リポート」）＝２と
なる。From the above, the degree of difference (“report”−“report”)=2.

６、Ｄ１４文　表記レベルでの　　　係より、「レポー
ト」と「リポート」は“対応する″と判定する。6. Sentence D14 Based on the relationship at the notation level, it is determined that "report" and "report" are "corresponding."

Ｄ１６．他の文字表記への拡張以上１本発明をカナ表記間および英語表記−カナ表記間
の対応関係判定システムについて説明したが、他の表記
、例えば仏語表記−カナ表記間の同様のシステムに本発
明を適用することも可能である。その場合、上記の英語
音素列生成手段に代えて仏語音素列生成手段を準備する
必要があるが、仏語つづりから仏語発音記号列を生成す
るアルゴリズムは知られているので、準備は容易である
。D16. Expansion to other character notations Although the present invention has been described above as a system for determining the correspondence between kana notations and between English notation and kana notation, the present invention can also be applied to a similar system for other notations, such as between French notation and kana notation. It is also possible to apply In that case, it is necessary to prepare a French phoneme string generation means in place of the above-mentioned English phoneme string generation means, but since algorithms for generating French phoneme strings from French spellings are known, the preparation is easy.

またカナと同様に、発音を基に作られた文字としてハン
グル文字があるが１本発明の思想をそのまま適用すれば
、英語表記−ハングル表記間対応関係判定システムを作
成することも可能であるＤ１７．適用例■・・・日本語
ワードプロセッサにおける校正システムの一機能日本語ワードプロセッサの校正システムに、本発明を適
用して、外来語表記のばらつき検出を行なう例を第４図
を使って説明する。Furthermore, like Kana, there are Hangul characters that are created based on pronunciation.1 If the idea of the present invention is applied as is, it is also possible to create a system for determining the correspondence relationship between English and Hangul notations.D17 ．． Application Example 2: A function of a proofreading system in a Japanese word processor An example in which the present invention is applied to a proofreading system in a Japanese word processor to detect variations in foreign word notation will be described with reference to FIG.

第４図の符号：説明１：ユーザはキーボードを使ってコンピュータに、日本語テキストを入力する。Symbols in Figure 4: Explanation 1: The user uses the keyboard to control Enter Japanese text into your computer. Strengthen.

２ニジステムは、入力されたテキストより、カタカナ列（カタカナ表記結果）、アルファベラ１−列（ここでは英語表記結果とする）をとり出す。The second system is the input text From katakana string (katakana notation) result), Alphabella 1-column (here Let's take the English notation result) put out.

３：２で得た表記結果の任意の２つの組み合わせ中、アルファベット列一アルファベット列の組み合わせを除いたものすべての対に対して。3: Any two of the notation results obtained in 2 In combination, alphabet string combination of one alphabet string For all pairs except .

４〜６に従って相違度計算を行なう。Perform dissimilarity calculation according to steps 4 to 6. cormorant.

４：チャンク数マツチングを行なう。4: Perform chunk number matching.

ペナルティ−の総和を相違度とする。The total sum of penalties is the degree of dissimilarity. Ru.

５：４で一致したとみなされるものについてのみ、第１マツチングを行なう。相違度にペナルティ−を加算する。5:4 to be considered a match Only then, perform the first matching. Now. Add penalty to dissimilarity Calculate.

６：５で一致したとみなされるもめについてのみ、第２マツチングを行なう。相違度にペナルティ−を加算する。In a conflict that is considered to be a 6:5 agreement. Perform the second matching only when Now. Add penalty to dissimilarity Calculate.

７：相違度が、あらかじめ定めたしきい値以下なら、これらの表記結果は、同一語の表記のばらつきであると判定する。7: The degree of difference is as determined in advance If the value is less than or equal to the value, these notation results is the variation in spelling of the same word. It is determined that

８ニアで表記のばらつきと判定された表記結果対について、デイスプレィ装置を通じて、例えば文字表示の色を変えることによって、ユーザに警告する。It was determined that there was a discrepancy in the notation at 8 near. For notation result pairs, display e.g. character display through a digital device. By changing the color of Warn the.

９：ユーザは、警告に従って、必要であれば表記結果を統一する。9: The user must follow the warning and If possible, standardize the notation results.

１０ニジステムは、修正されたテキストを例えばディス
クＤ１に書き込み、保存する。10 system writes and saves the modified text, for example on disk D1.

Ｄ１８．適用例■・・・情報検索システム本発明の文献
検索システムへの適用例を第５図を使って説明する。た
だし、入力キーワードがカタカナ表記か英語表記の場合
にのみ、本発明を適用した効果が得られるので、以下、
キーワードがこれらの表記のどちらかで書かれていると
仮定する。D18. Application example (1) Information retrieval system An example of application of the present invention to a document retrieval system will be described with reference to FIG. However, the effect of applying the present invention can be obtained only when the input keyword is written in Katakana or English.
Assume that the keyword is written in one of these notations.

第５図の符号：説明１１：ユーザはキーボードよりコンピュータに、検索し
たい文献のキーワードを入力する。（１−ＫＷＤとする）１２ニジステムは文献データベースＤ２から、各文献の
キーワードを読み込む、（これをＰ　　Ｋ　Ｗ　Ｄ　Ｉ　Ｊ　；ｉ：文献
番号、ｊ：キーワード番号とする）１３：Ｐ　　ＫＷＤｉｊとＩ−ＫＷＤの相違度を計算す
る。Reference numerals in FIG. 5: Explanation 11: The user inputs the keyword of the document to be searched into the computer from the keyboard. (Let it be 1-KWD) 12 System reads the keyword of each document from the literature database D2 (Let this be P K W D I J ; i: document number, j: keyword number) 13: P KWDij and I - Calculate the KWD dissimilarity.

１４：相違度があらかじめ定めたしきい値以下なら、こ
の対は対応するとみなす。14: If the degree of dissimilarity is less than or equal to a predetermined threshold, the pair is considered to correspond.

Ｊ、５：対応するとみなされたキーワード（ｐ−Ｋｗｏ
ｉ、ｊ）をもつ文献（ｉ）の情報を１文献データベースから読み込む。J, 5: Keywords considered to correspond (p-Kwo
Information on document (i) with i, j) is read from the 1-document database.

１６：デイスプレィ端末に１５で得た情報を表示する。16: Display the information obtained in 15 on the display terminal.

従来のシステムでは、Ｉ−ＫＷＤと完全に一致したＰ−
ＫＷＤｉｊＬか許さなかったため、キーワードに表記の
ばらつきがある場合、それらをすべて文献キーワードに
含むか、ユーザに統一表記で入力するよう要請するしか
なかった。しかし。In the conventional system, P-KWD completely matches I-KWD.
Because KWDijL was not allowed, if there were variations in the notation of keywords, the only option was to include them all in the literature keywords or request the user to input them in a uniform notation. but.

このシステムでは、カナ、英語表記間のばらつきについ
て相違度が計算できるので、ユーザは例えば入力キーワ
ード「ファン」で文献キーワード「ファジーＪ、ｒハシ
」、“ｆｕｚｚｙ”などをもつ文献を得ることができる
。This system can calculate the degree of dissimilarity between kana and English spellings, so users can, for example, use the input keyword ``fan'' to obtain documents with document keywords ``fuzzy J, r hashi'', ``fuzzy'', etc. .

Ｅ、効果本発明によれば、辞書を用いた従来の文字表記結果対応
関係判定システムと比較して。E. Effects According to the present invention, compared to a conventional character notation result correspondence determination system using a dictionary.

・あらかじめ外来語表記辞書を作る必要がな・新造語１
派生語、固有名詞の辞書への登録といった継続的更新の
必要がないという長所がある。・No need to create a foreign word notation dictionary in advance ・Newly coined word 1
It has the advantage that there is no need for continuous updates such as registration of derived words and proper nouns in the dictionary.

さらに、本発明をカナ表記結果同士の判定システムに限
って適用した場合でも、従来の統一表記間での対応関係
（一致・不一′９Ｘ）を判定するシステムに比べて・適当なしきい値を設定することにより１表記のばらつ
きと判定するものの範囲を変えることができる。Furthermore, even when the present invention is applied only to a system for determining the results of kana notation, an appropriate threshold value can be set compared to the conventional system for determining the correspondence relationship (match/discrepancy'9X) between unified notations. By setting, it is possible to change the range of what is determined to be a variation in one notation.

・また、より対応関係の強いものから順に表示できるという長所がある。・Also, you can display items in descending order of correspondence. There is an advantage.

【図面の簡単な説明】第１図は１本発明を適用した英語表記−カナ表記間の対
応関係判定システムの実施例を示すための図、第２図および第３図は１本発明を適用した、カナ表記間
の対応関係判定システムの実施例を示すだめの図。第４図は、本発明の日本語ワードプロセッサにおける校
正システムへの適用例を説明するための図。第５図は、本発明の情報検索システムへの適用例を説明
するための図である。出願人　　日本アイ・ビー・エム株式会社代理人　　弁
理士　　頓　　宮　　孝（外１名）カナ表記后也果箪１図英悟表記結果カクカナＳＬ己結果。「レホ゛−←」ひもかｔ／カタカナ表記結襄「りぽ−と」／「り才Ｓ−ト」日本筒音素列ｒｅｐδｔ０日本横１に電炉」ｒｉｐ５ｔ。相濃菌＝２相遭良二〇笛２図カフカナ表記糸告果「フ７　：／−Ｊひらか−／カタカナ表記結果「１３、おし」／「フ７シ」日本語音＃白１ａＺ日本を吾音素列ａｚ相違度：０第３図[BRIEF DESCRIPTION OF THE DRAWINGS] Fig. 1 is a diagram showing an embodiment of a correspondence determination system between English notation and kana notation to which the present invention is applied; Fig. 2 and 3 are to which the present invention is applied. FIG. 2 is a schematic diagram showing an embodiment of a correspondence relationship determination system between kana notations. FIG. 4 is a diagram for explaining an example of application of the present invention to a proofreading system in a Japanese word processor. FIG. 5 is a diagram for explaining an example of application of the present invention to an information search system. Applicant IBM Japan Co., Ltd. Agent Patent Attorney Takashi Tonmiya (1 other person) Kana notation Houya Katan 1 illustration Eigo notation result Kakakana SL self result. ``Reho-←'' Himokat/Katakana notation combination ``Riport''/``Risai S-to'' Japanese tube phoneme sequence repδt0 Nihon Yoko 1 ni electric furnace'' rip5t. Ainobacteria = 2 Aien Ryo 20 Flute 2 Kafukana notation Thread confession "F7: / - J Hiraka - / Katakana notation result "13, Oshi" / "Fu7shi" Japanese sound # White 1 aZ Japan A phoneme sequence az Dissimilarity: 0 Figure 3

Claims

[Claims]

(1) A system for determining the correspondence between a notation result by a first character notation method and a notation result by a second character notation method, the system comprising: (a) converting the notation result by the first character notation method into a first phoneme; (b) means for converting the notation result by the second writing system into a phoneme string consisting of phonemes selected from the second phoneme group; (c) (d) table means for generating one or more phonemes in the first phoneme group corresponding to each phoneme in the second phoneme group; (d) using the means in (a) above to generate the first phoneme; The phoneme string obtained by converting the notation result according to the character notation method and the above (b
) and one or more phoneme strings obtained by converting the notation result according to the second character notation method using the means of (c), and determine the degree of difference at the first phoneme group level. (e) Based on the degree of difference at the first phoneme group level,
A system for determining a correspondence relationship between character notation results, comprising means for determining a correspondence relationship between a notation result according to the first character notation method and a notation result according to the second character notation method.

(2) A system for determining the correspondence between a notation result according to a first character notation method and a notation result according to a second character notation method, comprising: (a) a notation result according to a first character notation method from a group of phonemes; (b) means for converting the notation result by the second character notation method into a phoneme string consisting of the phonemes selected from the above group of phonemes; Table means for generating, for each phoneme of a sub-group that occupies at least a portion, one or more phonemes from the group of phonemes that are related to the phoneme; (d) the means of (a) above; The phoneme string obtained by converting the notation result according to the first character notation method using
) and (c) to compare the notation result with one or more phoneme strings obtained by converting the notation result to the second character notation method, and determine the degree of difference at the phoneme level of the group. (e) means for determining the correspondence between the notation result according to the first character notation method and the notation result according to the second character notation method based on the degree of difference at the phoneme level of the group; A correspondence determination system for character notation results, characterized by comprising:

(3) A system for determining the correspondence between a first notation result and a second notation result based on the same character notation method, which (a) determines the notation result by the above character notation method from a phoneme selected from a group of phonemes; (b) for each sub-group phoneme that occupies at least a portion of the group, one or more phonemes from the group of phonemes that are related to the phoneme are generated; (c) a phoneme string obtained by converting the first transcription result using the means (a) above; (d) means for calculating the degree of dissimilarity at the phoneme level of the group by comparing one or more phoneme strings obtained by converting the notation results of (d) the degree of dissimilarity at the phoneme level of the group; A correspondence relationship determination system for character notation results, comprising means for determining a correspondence relationship between a notation result by the first character notation method and a notation result by the second character notation method, based on the above.

(4) A system for determining the correspondence between a notation result according to a first character notation method and a notation result according to a second character notation method, the system comprising: (b) means for converting the notation result by the second character notation method into a phoneme string consisting of the phonemes selected from the group of phonemes; (c) the above (a) ), the phoneme string obtained by converting the notation result according to the first character notation method, and the above (b
(d) means for calculating the degree of difference at the phoneme level of the group of phonemes by comparing the phoneme sequence obtained by converting the notation result according to the second character notation method using the means of (d) the above-mentioned method; A character characterized by comprising means for determining the correspondence between the notation result according to the first character notation method and the notation result according to the second character notation method based on the degree of difference at the level of a group of phonemes. Correspondence determination system for notation results.

(5) A system for determining the correspondence between notation results based on the same character notation system, which (a) converts the notation results based on the above character notation system into a phoneme string consisting of phonemes selected from a group of phonemes; (b) means for comparing phoneme strings obtained by converting orthographic results using the means in (a) above to calculate the degree of difference at the phoneme level of the group; (c) A system for determining the correspondence of character notation results, comprising: means for determining the correspondence of the notation results according to the character notation method based on the degree of difference at the phoneme level of the group.