JPH01229368A

JPH01229368A - Character processor

Info

Publication number: JPH01229368A
Application number: JP63056546A
Authority: JP
Inventors: Yuji Kobayashi; 雄二小林; Kazuyo Ikeda; 和世池田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1988-03-09
Filing date: 1988-03-09
Publication date: 1989-09-13
Anticipated expiration: 2013-05-25
Also published as: JP2756252B2

Abstract

PURPOSE:To prevent the variance of application for the use of the word pair of the same meaning, and to determine a first candidate by both the means of individual word example and meaning attribute example by giving a meaning attribute code classified in a meaning to an individual word. CONSTITUTION:Example information in a word dictionary DIC stores the word ID of the word of the companion of an example if the example type of it is an individual example, and is constituted of the example, etc. to store a meaning attribute to be the companion of the example if it is a meaning example. Then, the meaning attribute code classified in a meaning is given to the individual word, and as for the example applicable to all the words of the same meaning attribute, the example is made to be capable of being registered by the pair of the word and the meaning attribute or the pair of the meaning attribute and the meaning attribute. Thus, the variance of the application can be suppressed for the use of the word pair of the same meaning. Besides, for a peculiar word or an idiomatic phrase incapable of being classified in a meaning, the first candidate can be determined by both the means of the individual word example and the meaning attribute example by applying the individual word pair example.

Description

【発明の詳細な説明】〔ｌ卒業上の利用分野］本発明は、文字列の変換を行う文字処理装置における同
音語多義判定方式に関するものである。DETAILED DESCRIPTION OF THE INVENTION [Fields of Application in Graduation] The present invention relates to a homophone polysemy determination method in a character processing device that converts character strings.

（従来の技術）従来、文字列の変換を行なう文字処理装置として、例え
は日本語ワードプロセッサにおけるカナ漢字変換装近か
広く普及している。カナ漢字変換の結果、多数表われる
同音語候補に対して、個々の単語の品詞の持つ文法的特
性などから同音語の多義解消が行なわれてぎたが、単語
と単語が意味的に共起する同音語対の判定に対しては考
慮がなされていないため、誤変換となる場合が多かった
。そこで単語と単語との組、すなわち用例を格納した単
語対辞書にある組み合わせを第一候補決定に利用するこ
となどが行なわれてきた。(Prior Art) Conventionally, character processing devices for converting character strings have been widely used, such as kana-kanji conversion devices in Japanese word processors. As a result of Kana-Kanji conversion, many homophone candidates appear, and the ambiguity of the homophones has been resolved based on the grammatical characteristics of the parts of speech of individual words. Since no consideration was given to determining homophone pairs, erroneous conversions often occurred. Therefore, attempts have been made to use word-word pairs, that is, combinations in a word-pair dictionary that stores usage examples, to determine the first candidate.

（発明が解決しようとしている問題点〕しかしながら、
すべての個々の単語について、単語どうしの組み合わせ
を網羅することはその数が膨大であることから効率的で
はなく、現実的とは言えない。(The problem that the invention is trying to solve) However,
It is not efficient or realistic to cover all combinations of individual words because the number of combinations is enormous.

たまたま登録されている単語対に対しては、適正な変換
第１候補が得られても、登録外の単語対に対しては、当
然通用することができないため、同じ効果を得ることは
できない。Even if a proper conversion first candidate is obtained for a word pair that happens to be registered, it cannot be used for a word pair that is not registered, and the same effect cannot be obtained.

（問題点を解決するための手段（及び作用）〕本発明の
目的は、上記従来例の欠点を除去することにある。(Means for Solving the Problems (and Effects)) An object of the present invention is to eliminate the drawbacks of the above-mentioned conventional example.

すなわち、個々の単語に意味的に分類した意味属性コー
ドを与え、単語対の用例のうち、同じ意味属性を持つ単
語すべてについて当てはまる用例については、個々の単
語対ではなく、単語と意味属性、または意味属性と意味
属性の対で用例の登録を可能とすることにより、同じ意
味の単語対の使用に対しては、適用のばらつきを押える
ことができ、また、意味的に分類し切れない固有の単語
対や慣用句に対しては、個別の単語対用例を適用するこ
とにより、個別単語用例と意味属性用例の両方の手段に
よって、第一候補を決定することのできる文字処理装置
を提供することである。In other words, a semantic attribute code that is semantically classified is given to each word, and for examples of word pair usage that apply to all words with the same semantic attribute, the word and semantic attribute code is assigned instead of the individual word pair. By making it possible to register usage examples using pairs of semantic attributes and semantic attributes, it is possible to suppress variations in application when using pairs of words with the same meaning, and also to prevent unique usage that cannot be categorized semantically. To provide a character processing device capable of determining a first candidate for word pairs and idiomatic phrases by applying individual word pair usage examples and by means of both individual word usage examples and semantic attribute usage examples. It is.

その為に本発明は以下の手段を備える。To this end, the present invention includes the following means.

すなわち、変換対象の単語の読みを入力する入力手段、
個々の単語の意味分類をコート化した意味属性を、読み
・表記と関連つけて格納した単語辞書、個別血語どうし
の共起対と、個別単語と意味属性との共起対とを格納し
た用例辞書と、読みを文節に解析する文節認定手段と、
解析された文節対について成立する用例を検索する用例
検索手段、成立した用例を優先させて同音語多義におけ
る第一候補を決定することのできる変換手段を備えるこ
とである。In other words, an input means for inputting the pronunciation of the word to be converted;
A word dictionary that stores semantic attributes that encode the semantic classification of individual words in association with their pronunciations and spellings, as well as co-occurrence pairs between individual words and co-occurrence pairs between individual words and semantic attributes. A usage dictionary, a phrase recognition means for analyzing readings into phrases,
The object of the present invention is to include an example search means for searching for examples that hold true for the analyzed phrase pair, and a conversion means that can give priority to the examples that hold true and determine the first candidate in homophone polysemy.

これによって、例えばｒ“動物“を放す」という意味属
性用例が登録されていて、°゛動物の意味属性が与えら
れているすべての単語について上記意味属性用例を適用
した変換をすることができる。As a result, for example, the semantic attribute example ``release an animal'' is registered, and all words to which the semantic attribute ``animal'' is given can be converted by applying the above semantic attribute example.

（実施例）以下、図面を参照して本発明の詳細な説明する。(Example) Hereinafter, the present invention will be described in detail with reference to the drawings.

第１図は本発明の全体構成の一例である。FIG. 1 is an example of the overall configuration of the present invention.

図示の構成において、ＣＰＵは、マイクロプロセッサで
あり、文字処理のための演算、論理判断等を行ない、ア
ドレスバスＡＢ、コントロールバスＣＢ、データバスＤ
Ｂを介して、それらのバスに接続された各構成要素を制
御する。In the illustrated configuration, the CPU is a microprocessor that performs calculations, logical judgments, etc. for character processing, and uses an address bus AB, a control bus CB, and a data bus D.
B to control each component connected to those buses.

アドレスバスＡＢはマイクロプロセッサＣＰＵの制御の
対象とする構成要素を指示するアドレス信号を転送する
。コントロールバスＣＢはマイクロプロセッサＣＰＵの
制御の対象とする各構成要素のコントロール信号を転送
して印加する。データバスＤＢは各構成機器相互間のデ
ータの転送を行なう。Address bus AB transfers address signals indicating the components to be controlled by the microprocessor CPU. The control bus CB transfers and applies control signals for each component to be controlled by the microprocessor CPU. The data bus DB transfers data between each component device.

次にＲＯＭは、読出し専用の固定メモリであリアである
。Next, ROM is a read-only fixed memory.

また、ＲＡＭは、１ワード１６ビツトの構成の書込み可
能のランダムアクセスメモリであって、各構成要素から
の各種データの一時記憶に用いる。Further, the RAM is a writable random access memory having a configuration of 1 word and 16 bits, and is used for temporary storage of various data from each component.

ＴＢＬＩＦは文書バッファであり、キーボードＫＢより
入力された文書情報を蓄えるためのメモリである。TBLIF is a document buffer, which is a memory for storing document information input from the keyboard KB.

ＹＢＵＦはキーボードＫＢより入力された読みを格納す
る入力読みバッファ・メモリである。YBUF is an input reading buffer memory that stores readings input from the keyboard KB.

ＤＩＣはカナ漢字変換を行なうための単語辞書である。DIC is a word dictionary for performing kana-kanji conversion.

ＤＢＰ０ＯＬはＹＢＵＦの読みを文節に解析・変換した
情報を格納する同音語候補格納メモリである。DBP0OL is a homophone candidate storage memory that stores information obtained by analyzing and converting the pronunciation of YBUF into phrases.

ＹＪＰＯＯＬはＤＢＰ０ＯＬと連動して、同音語の候補
に適用されている用例の情報を格納するメモリである。YJPOOL is a memory that stores information on examples applied to homophone candidates in conjunction with DBP0OL.

ＬＲＮＤＡＴは個々の単語および用例の学習状態を格納
した学習データ格納メモリである。LRNDAT is a learning data storage memory that stores the learning status of individual words and examples.

ＦＺＴＢＬは付属１ｉｌｊをＤＩＣに格納されている結
合情報に対応させるための付属語列変換テーブルである
。FZTBL is an adjunct word string conversion table for making adjunct 1ilj correspond to combination information stored in DIC.

ＫＢはキーボードであって、アルファベットキー、ひら
がなキー、カタカナキー等の文字記号入カキ−１及び、
変換を指示する変換キーなどの各種のファンクションキ
ーを備えている。KB is a keyboard, which includes alphabet keys, hiragana keys, katakana keys, and other character/symbol keys-1;
It is equipped with various function keys such as a conversion key for instructing conversion.

図において、ＹＯＭ　Ｔは読みを入力するためのキー、
ＣＯＮは入力した読みを変換するための変換指示キー、
ＮＸ、Ｔは変換候補を変更して次候゛補に変換するため
の次候補変換指示キー、ＳＥＬは現在の同音語表示候補
に確定し同時にその候補表記を学習することを指示する
ための選択キーである。In the figure, YOM T is the key for inputting the reading;
CON is the conversion instruction key to convert the input reading,
NX and T are the next candidate conversion instruction keys to change the conversion candidate and convert it to the next candidate, and SEL is the selection key to confirm the current homophone display candidate and at the same time instruct to learn the candidate notation. That's the key.

ＤＩＳＫは定型文書を記憶するためのメモリで作成され
た文書の保管を行ない、保管された文書はキーボードの
指示により、必要な時呼び出される。The DISK stores documents created in a memory for storing standard documents, and the stored documents can be called up when necessary by instructions from the keyboard.

ＣＲはカーソルレジスタである。ＣＰＵにより、カーソ
ルレジスタの内容を読み書きできる。後述するＣＲＴコ
ントローラＣＲＴＣは、ここに蓄えられたアドレスに対
する表示装置ＣＲＴ上の位置にカーソルを表示する。CR is a cursor register. The CPU can read and write the contents of the cursor register. A CRT controller CRTC, which will be described later, displays a cursor at a position on the display device CRT corresponding to the address stored here.

ＤＢＵＦは表示用バッファメモリで、ＴＢＵＦに蓄えら
れた文書情報等のパターンを蓄える。DBUF is a display buffer memory that stores patterns such as document information stored in TBUF.

ＣＲＴＣはカーソルレジスタＣＲ及びバッファ［）ＢＬ
ＩＦに菩えられた内容を表示器ＣＲＴに表示する役割を
担う。CRTC is cursor register CR and buffer [)BL
It plays the role of displaying the contents read by the IF on the display device CRT.

また、ＣＲＴは陰極線管等を用いた表示装置であり、そ
の表示器ｆｆ１ｃＲＴにおけるドツト構成のパターンお
よびカーソルの表示をＣＲＴコントローラで制御する。Further, a CRT is a display device using a cathode ray tube or the like, and the display of a dot configuration pattern and a cursor on the display ff1cRT is controlled by a CRT controller.

さらに、ＣＧはキャラクタジェネレータであって、表示
装置ＣＲＴに表示する文字、記号のパターンを記憶する
ものである。Furthermore, CG is a character generator that stores patterns of characters and symbols to be displayed on the display device CRT.

かかる各構成要素からなる本発明文字処理装置において
は、キーボードＫＢからの各種の入力に応じて作動する
ものであって、キーボードＫＢからの入力が供給される
と、まず、インタラブド信号かマイクロプロセッサＣＰ
Ｕに送られ、そのマイクロプロセッサＣＰＵがＲＯＭ内
に記憶しである各種の制御信号を読出し、それらの制御
信号に従って、各種の制御が行なわれ第２図は、カナ漢
字変換に用いる単語辞書ＤＩＣの構成を示した図である
。The character processing device of the present invention, which is composed of each of these components, operates in response to various inputs from the keyboard KB, and when input from the keyboard KB is supplied, first, an interleaved signal or a microprocessor CP is output.
The microprocessor CPU reads various control signals stored in the ROM, and various controls are performed according to these control signals. It is a diagram showing the configuration.

辞書は表記変換に用いる単語情報と表記変換の際の第１
候補を決定するための用例情報とで構成されている。単
語情報はその見出し単語固有の単語ＩＤと、単語の読み
を格納する読み、単語の表記を格納する表記、単語の品
詞を格納する品詞、単語の意味属性を格納する悪法から
構成されている。単語ＩＤは先頭の単語を１とする連続
番号である。用例情報はその見出し単語の゛持つ用例情
報の個数、用例固有の用例ＩＤ、その用例が個別用例で
あるか意味用例であるかの用例タイプを記述したタイプ
、用例タイプが個別用例であれば用例の相手の単語の単
語ＩＤを格納し意味用例であれば用例の相手となる意味
属性を格納する用例、用例の相手の単語に付く付属語の
うち用例が成立する条件を満な・す代表的な付属語を格
納した結合情報、用例を逆転させても使用可能であるか
どうかの情報を格納する逆転から構成される。用例を一
つも持たない単語は、用例情報の個数に０が格納されて
いる。用例ＩＤは昨語ＩＤと同じく先頭の用例を１とす
る連続番号である。Dictionaries are used to store word information used in orthographic conversion and the first information used in orthographic conversion.
It consists of example information for determining candidates. The word information consists of a word ID unique to the header word, a pronunciation that stores the pronunciation of the word, a notation that stores the notation of the word, a part of speech that stores the part of speech of the word, and a code that stores the semantic attribute of the word. The word ID is a consecutive number starting with 1 for the first word. The example information includes the number of example information that the heading word has, the example ID unique to the example, the example type that indicates whether the example is an individual example or a semantic example, and the example type if the example type is an individual example. If it is a semantic example, it stores the semantic attribute of the other word of the example, and a representative word that satisfies the conditions for the example to be true among the attached words attached to the other word of the example. It consists of combination information that stores adjuncts, and reversal information that stores information about whether the example can be used even if it is reversed. For words that do not have any examples, 0 is stored in the number of example information. The example ID is a consecutive number with the first example being 1, just like the previous word ID.

図において例えは、「買う」という単語は３００という
連続番号で区別され、読みがｒかう」、表記がｒ買う」
、品詞がｒ動詞Ｊであるという単語情報を持つ。そして
ｒ酒を買う」と「喧嘩を買う」という２つの個別単語用
例を持つ。また、’ｕａという単語は５００という連続
番号で区別され、読みが「さけＪ、表記が「鮭」、品詞
がｒ名詞」であり、意味属性がｒ動物ｊである単語であ
る。用例個数に０が格納されているのでｒ鮭」を見出し
語とする用例は登録されていない。In the figure, the word ``buy'' is distinguished by consecutive numbers 300, and the reading is ``r-ka'' and the notation is ``r-buy''.
, has word information that the part of speech is r-verb J. It has two individual word examples: ``buy alcohol'' and ``buy a fight.'' Further, the word 'ua' is distinguished by a consecutive number of 500, has the pronunciation "sake J," the spelling "salmon," and the part of speech "r noun," and has the semantic attribute r animal j. Since 0 is stored in the number of examples, no example with "r salmon" as a headword is registered.

第３図は、同音語プールＤＢＰ０ＯＬおよび用例情報プ
ールＹＪＰＯＯＬの構成を示した図である。（ａ）は同
音語プールの全体構成である。同音語プールはカナ漢字
変換処理によって作成された各同音語情報から構成され
る。（ｂ）は同音語情報の構成を示した図である。同音
語情報は、その同音語を他の同音語と区別するための連
続番号である同音語ＩＤ、その同音語情報の使用メモリ
・サイズを表わすデータ長、格納されている同音語の候
補の個数を格納する候補個数、同音語の候補の情報を格
納する候補情報、および、同音語の中に格納されている
候補情報の中で、後述する優先候補の個数を格納する優
先個数から構成されている。このような同音語の構成に
おいて、候補情報の列の先頭の候補情報が変換結果とし
て出力すべぎ第１候補になるものとし、列の先頭から優
先個数によって示される個数の候補情報が、後述する優
先候補になるものとする。（Ｃ）は候補情報の構成を示
した図である。候補情報は、候補の表記、その単語ＩＤ
、文節末の活用形を格納する文節末活用、および、後述
の用例情報へのポインタを格納する用例情報ポインタか
ら構成される。FIG. 3 is a diagram showing the configuration of the homophone pool DBP0OL and the example information pool YJPOOL. (a) shows the overall configuration of the homophone pool. The homophone pool is composed of each homophone information created by the kana-kanji conversion process. (b) is a diagram showing the structure of homophone information. The homophone information includes the homophone ID, which is a sequential number to distinguish the homophone from other homophones, the data length representing the memory size used for the homophone information, and the number of stored homophone candidates. The number of candidates for storing homophone candidates, candidate information for storing information about homophone candidates, and the priority number for storing the number of priority candidates (described later) among the candidate information stored in the homophones. There is. In such a homonym configuration, the candidate information at the beginning of the candidate information string should be output as the conversion result and become the first candidate, and the number of candidate information indicated by the priority number from the beginning of the string is as described below. This will be the preferred candidate. (C) is a diagram showing the structure of candidate information. Candidate information includes the notation of the candidate and its word ID.
, a clause-final conjugation that stores the conjugation at the end of a clause, and an example information pointer that stores a pointer to example information to be described later.

候補情報に用例が適用されずに用例情報が存在しない場
合には、用例情報ポインタに無効値−１が格納される。If the example is not applied to the candidate information and no example information exists, an invalid value -1 is stored in the example information pointer.

（ｄ）は用例情報プールＹＪＰＯＯＬの全体構成である
。用例情報プールは、同音語の各候補に通用される用例
の情報を格納する用例情報から構成される。（ｅ）は用
例情報の構成を示した図である。用例情報は、用例の相
手となる同音語のＩＤを格納するペア同音語ＩＤ、適用
されている用例のＩＤを格納する用例ＩＤ、適用されて
いる用例のタイプ（個別用例または意味用例）を格納す
る用例タイプ、および、同じ候補情報に通用されている
別の用例情報へのポインタが格納さねている用例情報ポ
インタから構成される。このように、用例情報はリスト
構造となっている。次の用例情報が存在しない場合には
、用例情報ポインタには無効値−１が格納される。(d) shows the overall configuration of the example information pool YJPOOL. The example information pool is composed of example information that stores information on examples that are commonly used for each homophone candidate. (e) is a diagram showing the structure of example information. The example information stores the pair homophone ID that stores the ID of the homophone that is the partner of the example, the example ID that stores the ID of the applied example, and the type of applied example (individual example or semantic example). It consists of an example type, and an example information pointer that stores a pointer to another example information that is used for the same candidate information. In this way, the example information has a list structure. If the next example information does not exist, an invalid value -1 is stored in the example information pointer.

以上説明したような同音語プールおよび用例情報プール
の構成において、ある同音語のある候補に用例が適用さ
れている場合、その候補情報に用例情報を格納するとと
もに、用例の相手となる同音語の候補に対する候補情報
にも、同じ用例の情報を格納した用例情報を格納するも
のとする。In the configuration of the homophone pool and example information pool as explained above, when an example is applied to a certain candidate for a certain homophone, the example information is stored in the candidate information, and the example information is stored in the candidate information for the homophone. It is assumed that example information containing information on the same example is also stored as candidate information for the candidate.

第４図は、付属語列変換テーブルＦＺＴＢＬの構成を示
した図である。付属語列変換テーブルは文節を構成する
付属語列を単語辞書ＤＩＣが持つ用例情報中の結合情報
に対応させるための対応変換表である。付属語読みはそ
の付属語列の読みを表わし、対応結合情報はその付属語
列が晋換可能な辞書の用例情報中の結合情報を表わす。FIG. 4 is a diagram showing the structure of the adjunct word string conversion table FZTBL. The adjunct word string conversion table is a correspondence conversion table for making adjunct word strings constituting a clause correspond to combination information in the example information held by the word dictionary DIC. The adjunct word reading represents the pronunciation of the adjunct word string, and the corresponding combination information represents the combination information in the example information of the dictionary in which the adjunct word string can be converted.

例えは、付属語「こそ」は結合情報「が１またはｒをＪ
に、付属語列「さえもが−は結合情報ｒが」に、それぞ
れ置換して結合判定することができる。For example, the adjunct ``koso'' uses the conjunction information ``ga1 or r to J
The combination can be determined by replacing the adjunct word string ``saemonoga-'' with the combination information rga.

第５図は、学習データＬＲＮＤＡＴの構成を示した図で
ある。（ａ）は学習データの全体構成である。学習デー
タは単語学習データと用例学習データの２つから構成さ
れる。単語学習へ・のポインタは学習データ内の単語学
習データへのインデックスを格納し、用例学習へのポイ
ンタは学習データ内の用例学習データ列先頭へのインデ
ックスを格納する。（ｂ・）は単語学習データ列の構成
を示した図である。単語学習データ列は使用サイズと各
単語学習データから構成される。単語学習データ使用サ
イズは単語学習データ列の使用サイズを格納する。単語
学習データは各単語のＩＤに対応しており、学習されて
いる単語には１、学習されていない４語に対してはＯが
格納される。（Ｃ）は用例学習データ列の構成を示した
図である。用例学習データ列は使用サイズと各用例学習
データから構成される。用例学習データ使用サイズは用
例学習データ列の使用サイズを格納する。用例学習デー
タは各用例ＩＤに対応しており、抑制されている使用禁
止用例には１、抑制されていない使用か脳用例に対して
は０が格納される。FIG. 5 is a diagram showing the structure of learning data LRNDAT. (a) shows the overall structure of the learning data. The learning data consists of two parts: word learning data and example learning data. The pointer to word learning stores the index to the word learning data within the learning data, and the pointer to example learning stores the index to the beginning of the example learning data string within the learning data. (b) is a diagram showing the structure of a word learning data string. The word learning data string is composed of the usage size and each word learning data. The word learning data usage size stores the usage size of the word learning data string. The word learning data corresponds to the ID of each word, and 1 is stored for words that have been learned, and O is stored for 4 words that have not been learned. (C) is a diagram showing the structure of an example learning data string. The example learning data string is composed of the usage size and each example learning data. The example learning data usage size stores the usage size of the example learning data string. The example learning data corresponds to each example ID, and 1 is stored for suppressed prohibited usage examples, and 0 is stored for unsuppressed use or brain usage examples.

上述の実施例の作動をフローに従って説明する。The operation of the above embodiment will be explained according to the flow.

第６°図は本発明文字処理装置の動作を水等フローチャ
ートである。FIG. 6 is a flowchart illustrating the operation of the character processing apparatus of the present invention.

５６−１においてキーボードよりキーが押下され、割り
込みが発生するのを待つ。キーが入力されると３６−２
においてキー判別し、キーの種類に応じて５６−３．５
６−４．５６−５．５６−６．５６−７のいずれかのス
テップに分岐する。At step 56-1, a key is pressed on the keyboard and a wait is made for an interrupt to occur. 36-2 when the key is input
56-3.5 depending on the type of key.
Branch to any of steps 6-4.56-5.56-6.56-7.

５６−３は読み人カキ−ＹＯＭ　Ｉが押下されたときの
処理であり、押下された読みのコードを入力読みバッフ
ァ・メモリＹＢＵＦに蓄える。56-3 is a process when the reader key YOM I is pressed, and the code of the pressed reading is stored in the input reading buffer memory YBUF.

５６−４は変換キーＣＯＮが押下されたときの処理であ
り、５６−３で入力されてＹＢＵＦに蓄えられている、
カナ漢字変換の対象となる文字列を漢字に変換し、出力
バッファに出力する。漢字に変換する際に、単語辞書Ｄ
ＩＧ中に存在する用例情報を用いて、同音語の第１候補
を決定する。56-4 is the process when the conversion key CON is pressed, and is input in 56-3 and stored in YBUF.
Converts the character string subject to Kana-Kanji conversion into Kanji and outputs it to the output buffer. When converting to kanji, use the word dictionary D
The first candidate for the homophone is determined using the usage example information present in the IG.

５６−５は次候補変換キーＮＸＴが押下され。At 56-5, the next candidate conversion key NXT is pressed.

たときの処理であり、５６−４によって出力された出力
バッファ中の同音語の別の候補を表示する。56-4, another homophone candidate in the output buffer output by 56-4 is displayed.

５６−６は選択キーＳＥＬが押下されたときの処理であ
り、画面に表示されている出力バッファ中の同音語を確
定し、確定された文字列を文書中に出力する。さらに、
選択された単語を学習する処理を行なう。56-6 is a process performed when the selection key SEL is pressed, in which the homophones in the output buffer displayed on the screen are determined, and the determined character strings are output into the document. moreover,
Performs processing to learn the selected words.

５６−７は、ＹＯＭＩ、ＣＯＮ、ＮＸＴ％ＳＥＬ以外の
キー（例えば、カーソル移動キーなどの文書編集で用い
るキーなど）が押下された場合の処理であり、同種の文
字処理装置において一般に行なわれている処理であり、
公知であるので特に記述しない。56-7 is a process performed when a key other than YOMI, CON, or NXT%SEL (for example, a key used for document editing such as a cursor movement key) is pressed, and is generally performed in character processing devices of the same type. It is a process that
Since it is publicly known, it will not be specifically described.

５６−８は上記の処理の結果、変更された部分を表示す
る表示処理である。文書中のデータ１文字を読んではパ
ターンに展開し、表示バッファに出力するという通常広
く行なわれている処理である。56-8 is a display process for displaying the changed portion as a result of the above process. This is a commonly used process in which a single character of data in a document is read, developed into a pattern, and output to a display buffer.

第７図は、５６−４の処理を詳細化したフローチャート
である。FIG. 7 is a detailed flowchart of the process 56-4.

５７−１“は、文節単位に分ち書きされて入力されたカ
ナ漢字変換の対象となる文字列を解析し、カナ漢字変換
の出力の候補を同音語プールに出力する処理である。分
ち書きされた単位に文字列を順々に取り出し、単語辞書
を検索して解析を行ない、文節として認定される候補の
みを同音語プールに出力する処理であって、同種の文字
処理装置において一般に行なわれている処理であり、公
知であるので特に記述しない。57-1" is a process that analyzes a character string to be converted into kana-kanji that is input in segments and outputs candidates for output of kana-kanji conversion to the homophone pool. A process that sequentially extracts character strings in written units, searches a word dictionary for analysis, and outputs only candidates that are recognized as phrases to a homophone pool, and is generally performed in character processing devices of the same type. Since this is a well-known process, it will not be specifically described.

５７−２は５７−１において同音語プールに出力された
解析結果に対して、単語辞書中に格納されている用例の
パターンが存在するかどうかをチエツクし、用例のパタ
ーンが存在すれば、その用例の対象となる同音語の候補
を優先候補としてピックアップする。57-2 checks whether there is an example pattern stored in the word dictionary based on the analysis result output to the homophone pool in 57-1, and if the example pattern exists, it is Homophone candidates to be used as examples are picked up as priority candidates.

５７−３は５７−２でピックアップされた優先候補や、
単語学習されている候補の中から、カナ漢字変換の第１
候補を決定する。57-3 is the priority candidate picked up by 57-2,
From the candidates that have been learned, select the first one for kana-kanji conversion.
Decide on candidates.

５７−４は、出力バッファに格納されたカナ漢字変換の
出力を表示する処理であり、同種の文字処理装置におい
て一般に行なわれている処理であり、公知であるので特
に記述しない。57-4 is a process for displaying the output of the kana-kanji conversion stored in the output buffer, and is a process commonly performed in character processing devices of the same type and is well known, so it will not be described in particular.

第８図は５７−２のｌＡ理を詳細化したフローチャート
である。FIG. 8 is a detailed flowchart of the IA process of 57-2.

優先候補抽出の対象となる同音語を先頭から順次、取り
出してきて、対象同音語がなくなるまですべての同音語
に対して、以下の処理を行なう。Homophones to be extracted as priority candidates are sequentially extracted from the beginning, and the following process is performed on all homophones until there are no more target homophones.

５８−１で同音語中の候補情報を取り出してくる。At step 58-1, candidate information in the homophone is extracted.

５８−２ですべての候補情報について処理したかどうか
を判定し、すべて処理済であるならば優先候補抽出を終
了する。次の候補情報か取り出せたならば、５８−３で
候補情報に用例情報が存在するかどうかを判定する。用
例情報が存在するかどうかは、候補情報中の用例情報ポ
インタによって判定できる。すなわち、用例情報ポイン
タの値が“−１”であるならば用例情報は存在しない。At 58-2, it is determined whether all candidate information has been processed, and if all candidate information has been processed, priority candidate extraction is ended. If the next candidate information is retrieved, it is determined in 58-3 whether or not the example information exists in the candidate information. Whether or not example information exists can be determined by the example information pointer in the candidate information. That is, if the value of the example information pointer is "-1", no example information exists.

そうでなければ用例情報ポインタの示す領域に用例情報
が存在する。用例情報が存在しないならば優先候補抽出
処理を終了する。Otherwise, example information exists in the area indicated by the example information pointer. If the example information does not exist, the priority candidate extraction process ends.

用例情報が存在する場合、５８−４で逆方向用例情報の
抽出を行ない、５８−５で正方向用例情報の抽出を行な
う。If example information exists, backward example information is extracted at 58-4, and forward example information is extracted at 58-5.

５８−５の処理を終了したならば、次の候補情報を処理
するため５８−１ヘルーブする。When the processing in 58-5 is completed, the processing in 58-1 is performed to process the next candidate information.

第９図は、５８−４の処理を詳細化したフローチャート
である。FIG. 9 is a detailed flowchart of the process 58-4.

５９−１において、候補情報に格納されている単語ＩＤ
によって、その単語に存在する用例の情報を、一つずつ
辞書の中から取ってくる。In 59-1, the word ID stored in the candidate information
information on the examples that exist for the word is retrieved one by one from the dictionary.

５９−２において、５９−１で候補情報の単語に対して
辞書中に用例が存在するかどうか、辞書中の用例をすべ
て取り終えたかどうかを判定する。用例が存在していれ
ば５９−３へ進み、用例が存在していないか、辞書中の
用例をすべて取り終えていれば逆方向用例抽出処理を終
えてリターンする。In 59-2, it is determined in 59-1 whether an example exists in the dictionary for the word of the candidate information and whether all examples in the dictionary have been taken. If the example exists, the process advances to 59-3; if the example does not exist or all the examples in the dictionary have been taken, the backward example extraction process is completed and the process returns.

５９−３において、取り出されてきた用例を逆転させて
適用させることが可能かどうかをチエツクする。その情
報は辞書中に格納されている。逆転可能であれは５９−
４へ進み、逆転できなければ５９−１へ進む。At step 59-3, it is checked whether the retrieved example can be reversed and applied. That information is stored in a dictionary. If it can be reversed then it is 59-
Go to 4, and if you can't turn it around, go to 59-1.

５９−４において、取り出された用例を逆転して適用し
た場合に、入力された読みに対して成立可能かどうかの
判定を行なう。成立可能であれば５９−５へ進み、成立
可能でなければ５９−１へ進む。In step 59-4, it is determined whether or not the input reading can be established when the retrieved example is reversed and applied. If it can be established, the process proceeds to 59-5, and if it cannot be established, the process proceeds to 59-1.

５９−５において、成立可能な用例の情報を用例情報に
格納する。すなわち、用例情報プールに用例の情報を用
例情報として格納し、対象となっている同音語の候補情
報に用例情報が存在しなければ、用例情報ポインタに新
たに作成された用例情報へのポインタを格納する。候補
情報に用例情報が存在すれば、用例情報のリストの末尾
に新たに作成された用例情報を付は加える。また、対象
となっている同音語の候補情報たけでなく、用例の相手
となる同音語の候補情報にも同じようにして用例情報を
格納する。In step 59-5, information on valid examples is stored in example information. In other words, example information is stored as example information in the example information pool, and if no example information exists in the target homophone candidate information, a pointer to the newly created example information is set in the example information pointer. Store. If example information exists in the candidate information, the newly created example information is added to the end of the example information list. In addition, example information is stored not only for the target homophone, but also for the homophone that is the counterpart of the example.

第１０図は、５９−４の処理を詳細化したフローチャー
トである。FIG. 10 is a detailed flowchart of the process 59-4.

５ＩＯ−１において、対象となっている文節に後続する
文節が存在するかどうかチエツクする。後続する文節が
存在すればＳ　１０−２へ進み、存在しなければ５ＩＯ
−７へ進む。5IO-1, it is checked whether there is a clause following the target clause. If the following clause exists, proceed to S10-2, otherwise proceed to 5IO
-Go to 7.

Ｓ　Ｉ　Ｏ−２において、対象となっている文節の末尾
の活用形が連体形であるかどうかをチエツクする。連体
形であれは５ＩＯ−３へ進み、連体形てなければ５ＩＯ−７へ進む。In SIO-2, it is checked whether the conjugated form at the end of the target clause is an adnominal form. If it is an adjoint form, proceed to 5IO-3; if not, proceed to 5IO-7.

Ｓ　１０−３において、後続する文節に対する同音語か
ら候？ｉｉ情報を一つずつ取ってくる。In S 10-3, from the homophone to the following clause? ii. Get the information one by one.

５ＩＯ−４において、Ｓ　１０−３で候補情報が取って
これたかどうかを判定する。候補情報か取ってこねれば
５１０−５へ進み、すへての候補情報を取り終えて、候
補情報が取ってこれなけれは５ＩＯ−７へ進む。In 5IO-4, it is determined in S10-3 whether candidate information has been retrieved. If candidate information has been retrieved, the process advances to 510-5, and if all candidate information has been retrieved and no candidate information has been retrieved, the process advances to 5IO-7.

５ＩＯ−５において、３１０−３で得た候補情報の単語
が用例の相手となりつるかどうかを判定する。すなわち
、用例のタイプが個別用例であれば、辞書中の用例情報
の用例の項目に格納されているＪｎ語ＩＤと３１０−３
で得た候補情報の単３ｉ　１　Ｄが一致していれば、用
例の相手となるものとし、一致しなければ用例の相手と
ならないものとする。用例のタイプが意味用例であれば
、辞書中の用例情報の用例の項目に格納されＣいる意味
属性と、Ｓ　１０−３で得た候補情報の単語ＩＤから得
られる意味属性とが一致していれば、用例の相手となる
ものとし、−致しなけれは用例の相手とならないものと
する。用例の相手になると判定されればＳ　１０−６へ
進み、用例の相手にならないと判定されれば５１０−３
へ進む。At 5IO-5, it is determined whether the word of the candidate information obtained at 310-3 can be used as an example. That is, if the type of example is an individual example, the Jn word ID stored in the example item of the example information in the dictionary and 310-3
If the AA 3i 1 D of the candidate information obtained in step 2 matches, it will be used as the partner of the example, and if it does not match, it will not be the partner of the example. If the type of example is a semantic example, the semantic attribute stored in the example item of the example information in the dictionary and the semantic attribute obtained from the word ID of the candidate information obtained in S10-3 match. If it does, it will be the target of the example, and if it does not, it will not be the target of the example. If it is determined that it will be the partner of the example, it will proceed to S10-6, and if it is determined that it will not be the partner of the example, it will proceed to 510-3.
Proceed to.

Ｓ　Ｉ　Ｏ−６において、用例か成立するというリター
ン・コードを設定する。In SIO-6, a return code indicating that the example is true is set.

Ｓ　Ｉ　Ｏ−７において、用例が成立しないというリタ
ーン・コートを設定する。In SIO-7, set a return code that the example does not hold.

第１１図は、５８−５の処理を詳細化したフローチャー
トである。FIG. 11 is a detailed flowchart of the process 58-5.

５ｌｌ−１において、候補情報に格納されている単語Ｉ
Ｄによって、その単語に存在する用例の情報を、一つず
つ辞書の中から取ってくる。In 5ll-1, the word I stored in the candidate information
Using D, information on the examples of the word is retrieved from the dictionary one by one.

５ｌｌ−２において、ｓｔ　ｔ−ｔで候補情報のＡ１語
に対して辞書中に用例が存在するかどうか、辞書中の用
例をすべて取り終えたかどうかを判定する。用例が存在
していれは５ｌｌ−３へ進み、用例か存在していないか
、辞書中の用例をすべて取り終えていれば正方向用例抽
出処理を終えてリターンする。In step 5ll-2, it is determined in st t-t whether an example exists in the dictionary for word A1 in the candidate information and whether all examples in the dictionary have been taken. If the example exists, the process advances to 5ll-3, and if it is determined whether the example exists or not, and if all the examples in the dictionary have been taken, the forward example extraction process is completed and the process returns.

５１１−３において、取り出された用例を正方向に通用
した場合に、入力された読みに対して成立可能かどうか
の判定を行なう。成立可能であれば５ｌｌ−４へ進み、
成立可能でなければＳｌ　１−１へ進む。In step 511-3, it is determined whether or not the input reading can be established if the extracted example is passed in the forward direction. If it can be established, proceed to 5ll-4,
If it cannot be established, proceed to Sl 1-1.

５ｌｌ−４において、成立可能な用例の情報を用例情報
に格納する。すなわち、用例情報プールに用例の情報を
用例情報として格納し、対象となっている同音語の候補
情報に用例情報が存在しなければ、用例情報ポインタに
新たに作成された用例情報へのポインタを格納する。候
補情報に用例情報が存在すれば、用例情報のリストの末
尾に新たに作成された用例情報を付は加える。また、対
象となっている同音語の候補情報だけでなく、用例の相
手となる同音語の候補情報にも同しようにして用例情報
を格納する。In 5ll-4, information on valid examples is stored in example information. In other words, example information is stored as example information in the example information pool, and if no example information exists in the target homophone candidate information, a pointer to the newly created example information is set in the example information pointer. Store. If example information exists in the candidate information, the newly created example information is added to the end of the example information list. In addition, example information is stored not only for the target homophone, but also for the target homophone.

第１２図は、５ｌｌ−３の処理を詳細化したフローチャ
ートである。FIG. 12 is a detailed flowchart of the processing of 5ll-3.

Ｓｌ２−１において、対象となっている文節の前の文節
が存在するかどうかチエツクする。前の文節が存在すれは５１２−２へ進み
、存在しなければＳ　１２−８へ進む。At Sl2-1, it is checked whether there is a clause before the target clause. If the previous clause exists, the process advances to 512-2; otherwise, the process advances to S12-8.

Ｓ　＋　２−２において、前の文節に対する同音語から
候補情報を一つずつ取ってくる。In S+2-2, candidate information is obtained one by one from the homophones for the previous phrase.

Ｓ　Ｉ　２−３において、５１２−２で候補情報が取っ
てこれたかどうかを判定する。候補情報か取ってこねれ
ば５１２−４へ進み、すべての候補情報を取り終えて、
候補情報が取ってこれなければＳ　１２−８へ進む。In SI 2-3, it is determined in 512-2 whether candidate information has been retrieved. Once the candidate information has been retrieved, proceed to 512-4, and after all candidate information has been retrieved,
If candidate information is not retrieved, the process advances to S12-8.

５１２−４において、５１２−２で得た候補情報の単語
が用例の相手となりつるかどうかを判定する。すなわち
、用例のタイプが個別用例であれは、辞書中の用例情報
の用例の項目に格納されている単語ＩＤと３１２−２で
得た候補情報の単語ＩＤが一致していれは、用例の相手
となるものとし、一致しなければ用例の相手とならない
ものとする。用例のタイプが意味用例であれば、辞書中
の用例情報の用例の項目に格納されている意味属性と、
Ｓ　１２−２で得た候補情報の単語ＩＤから得られる意
味属性とが一致していれは、用例の相手となるものとし
、−致しなければ用例の相手とならないものとする。用
例の相手になると判定されれは５１２−５へ進み、用例
の相手にならないと判定されれば５１２−２へ進む。At 512-4, it is determined whether the word of the candidate information obtained at 512-2 can be used as an example. In other words, if the type of example is an individual example, if the word ID stored in the example item of the example information in the dictionary matches the word ID of the candidate information obtained in step 312-2, then If they do not match, they will not be used in the example. If the example type is a semantic example, the semantic attribute stored in the example item of the example information in the dictionary,
If the semantic attributes obtained from the word ID of the candidate information obtained in S12-2 match, the word will be used as an example, and if they do not match, it will not be used as an example. If it is determined that the user will be the partner of the example, the process advances to 512-5, and if it is determined that the user is not the partner of the example, the process advances to 512-2.

５１２−５において、５１２−２で得た候補情報の自立
部具外の残り読み（付属語列）を付属語列変換テーブル
によって、結合情報に変換する。In 512-5, the remaining pronunciations (adjunct word strings) other than independent parts of the candidate information obtained in 512-2 are converted into combined information using the adjunct word string conversion table.

５１２−６において、前の文節の同音語の候補か、用例
の結合条件を（１：４たしているかどうかをｔ’ｌＪ定
する。すなわら、Ｓ　Ｉ　２−５において、付属語列変
換テーブルによって結合情報に変換できなけれは、結合
条件を満たしていないものとする。また、５１２−５で
得た結合情報と、辞書中の用例情報の結合情報との間に
、一致する情報があれは結合条件を満たしているものと
し、一致する情報かなければ結合条件を満たしていない
ものとする。結合条件を満たしていれはＳ　Ｉ　２−７
へ進み、結合条件を満たしていなりれは５１２−２へ進
む。In 512-6, determine whether the combination condition of the homophone candidate of the previous clause or the example is (1:4).In other words, in S I 2-5, the adjunct word string If it cannot be converted into combined information using the conversion table, it is assumed that the combined condition is not satisfied.Also, if there is matching information between the combined information obtained in 512-5 and the combined information of the example information in the dictionary, It is assumed that it satisfies the binding condition, and if there is no matching information, it is assumed that the binding condition is not satisfied.If the binding condition is satisfied, then S I 2-7
If the connection condition is satisfied, the process proceeds to 512-2.

Ｓ　Ｉ　Ｏ−７において、用例が成立するというリター
ン・コートを設定する。In SIO-7, a return code is set that the example is true.

５１２−８において、用例が成立しないというリターン
・コートを設定する。At 512-8, a return code is set that the example does not hold.

第１３図は５７−３の処理を詳細化したフローチャート
である。FIG. 13 is a detailed flowchart of the process 57-3.

第１候補決定を行なわなければならない同音語を先頭か
ら順次、取り出してきて、対象同音語がなくなるまです
へての同音語に対して、以下の処理を行なう。The homophones for which the first candidate determination must be made are sequentially extracted from the beginning, and the following process is performed on the remaining homophones until there are no target homophones left.

５１３−１で同音語内の候補情報のうちから優先候補を
ピックアップする。優先候補のピックアップは同音語が
持つ候補情報のうち、用例情報ポインタか“−１”でな
いものの個数を優先個数にセットし、優先個数にカウン
トされた候補情報を順次先頭に並び換えていく。すべて
の候補情報について、優先候補のピックアップが終了し
たならば、５１３−２に進み、抑制された用例を優先候
補から除外する処理を行なう。候補情報の先頭から優先
個数分の候補情報について、以下の処理を行なうことに
より、抑制用例の優先除外を行なう。まず候補情報内の
用例情報ポインタをたどり、用例情報内の用例ＩＤを取
り出す、取り出した用例ＩＤの示す用例学習データか抑
制状、聾にあるかどうかをチエツクする。用例情報ポイ
ンタが終端に達するまで、すなわちポインタ値か”−ｔ
　”を示すまでの用１列ＩＤについて１中制されている
かどうかを調へた結果、候補情報の用例情報ポインタか
示ずすへての用例＋Ｒ報か抑制されていた場合のみ、そ
の候補情報を優先候補から除外する。すなわち、優先個
数を１減らし、候補情報を優先候補末尾の直後に並ひ換
える。In step 513-1, priority candidates are picked up from candidate information within the homophone. To pick up the priority candidates, among the candidate information of the homophone, the number of example information pointers or those that are not "-1" is set as the priority number, and the candidate information counted in the priority number is sequentially rearranged to the top. When the priority candidates have been picked up for all candidate information, the process proceeds to 513-2, where a process is performed to exclude the suppressed usage examples from the priority candidates. By performing the following process on the priority number of candidate information starting from the beginning of the candidate information, suppression examples are preferentially excluded. First, the example information pointer in the candidate information is followed, the example ID in the example information is extracted, and it is checked whether the example learning data indicated by the extracted example ID is inhibited or deaf. Until the example information pointer reaches the end, that is, the pointer value "-t
As a result of checking whether the 1-column ID for the 1-column ID until `` is shown is suppressed, only if the example information pointer of the candidate information or the example + R report of the 1-column ID of the candidate information is suppressed, the candidate information is excluded from the priority candidates.In other words, the number of priority items is decreased by 1, and the candidate information is rearranged immediately after the end of the priority candidates.

Ｓ　１３−３で３１３−２の結果、優先候補か残ってい
るかどうかを、優先個数がＯか否かによって判定し、優
先候補が一つもない場合は５１３−５へ、優先候補か１
個以上残っている場合はＳ　１３−４へそれぞれ分岐す
る。In S13-3, as a result of 313-2, it is determined whether or not there are any priority candidates remaining, based on whether or not the number of priority candidates is O. If there is no priority candidate, the process goes to 513-5.
If more than one remain, the process branches to S13-4.

Ｓｌ、３−４で優先候補として残された候補情報のうち
、個別単語用例を持つ候補が存在するかどうかをチエツ
クする。個別単語用例の存在チエツクは、候補情報の先
頭から優先個数分の候補情報について、以下の処理を行
なうことにより行なわれる。先ず、候補情報内の用例情
報ポインタをたどり、用イタ１トｔｉ’ｉ　Ｐａポイン
タの示すずへ−Ｃの用例情報について用例のタイプを調
へ、一つでも個別単語用例があれは“個別単語用例あり
゛と判定される。用例情報ポインタが終端に違するまで
、すなわちポインタ値が“−１゛。It is checked whether or not there is a candidate having an example of individual word usage among the candidate information left as priority candidates in Sl, 3-4. The existence check for individual word examples is performed by performing the following process on the priority number of candidate information starting from the beginning of the candidate information. First, follow the example information pointer in the candidate information, and set the type of example for the example information indicated by the pointer 1 to ti'i Pa. If there is at least one individual word example, it is It is determined that there is an example.Until the example information pointer reaches the end, that is, the pointer value is "-1".

を示ずまで用例のタイプを調べてすべて層、味属性用例
であれば°゛個個別単用用例し”と判定される。If the type of the example is examined and all of the examples are layered or taste attribute examples, it is determined that it is a single individual example.

判定の結果、個別単語用例ありならは５１３−６へ、個
別単語用例なしならば３１３−７へ分岐する。As a result of the determination, if there is an example of an individual word, the process branches to 513-6, and if there is no example of an individual word, the process branches to 313-7.

３１３−６で３１３−５の個別単語用例検索の結果、す
べて意味属性用例である候補情報を優先候補から除外す
る。すなわち、優先個数を１減らし、候補情報を優先候
補末尾の直後に並び換える。At 313-6, as a result of the individual word example search in 313-5, candidate information that is all semantic attribute examples is excluded from the priority candidates. That is, the number of priority items is decreased by 1, and the candidate information is rearranged immediately after the end of the priority candidates.

上記処理の結果、優先候補が一つに絞られたかどうかを
３１３−７で判定する。すなわち優先個数が１であるか
どうかを判定する。優先個数が１であるならば３１３−
９へ進み、優先候補を第１候補として決定する。（豆先
個数か１でない場合は、まだ一つに絞られていないため
、５１３−８で単語学習による候補の絞り込みを行なう
。As a result of the above processing, it is determined in step 313-7 whether or not the priority candidates have been narrowed down to one. That is, it is determined whether the priority number is 1 or not. If the priority number is 1, then 313-
Proceed to step 9 and determine the priority candidate as the first candidate. (If the number of bean tips is not 1, the candidates have not yet been narrowed down to one, so the candidates are narrowed down by word learning in step 513-8.

Ｓ　ｌ　３−８で優先候補の単語ＩＤを取り出し、単語
ＩＤの示す単語学習データを調べ、単語学習されている
候補が存在したならば、５１３−１１で単語学習されて
いる候補を第１候補として決定する。！Ｘｉ−語学習語
学下いる候補が複数個あったならば、先に格納されてい
る候補情報を第１候補とする。優先候補中のすべての候
補に単語学習されているものがなかった場合は、３１３
−１０で優先候補の先頭に格納されている候補を第１候
補として決定する。S l 3-8 extracts the word ID of the priority candidate, checks the word learning data indicated by the word ID, and if there is a word-learned candidate, the word-learned candidate is selected as the first candidate in 513-11. Determine as. ! If there are multiple candidates under the Xi-language learning language, the previously stored candidate information is set as the first candidate. If none of the priority candidates have learned words, 313
-10, the candidate stored at the beginning of the priority candidates is determined as the first candidate.

５１３−３で優先候補が一つも存在しないと判定された
場合は、３１３−５で卑語学習による第１候補決定の判
定を行なう、候補情報の単語ＩＤを取り出し、単語ＩＤ
の示す単語学習データを調べ、単語学習されている候補
が存在したならば、３１３−１１で単語学習されている
候補を第１候補として決定する。単語学習されている候
補が複数個あったならば、先に格納されている候補情報
を第１候補とする。すべての候？ｉＴｉ情報にＲ−ＨＱ
学習されているものがなかった場合は、５Ｉ３−１２で
先頭の候補情報を第１候補として決定する。If it is determined in 513-3 that there is no priority candidate, the word ID of the candidate information is extracted and the word ID is determined in 313-5 to determine the first candidate by learning the vulgar word.
The word learning data indicated by is examined, and if a word-learning candidate exists, the word-learning candidate is determined as the first candidate in 313-11. If there are a plurality of word-learning candidates, the previously stored candidate information is set as the first candidate. All cases? R-HQ for iTi information
If there is no learned information, the first candidate information is determined as the first candidate in 5I3-12.

第１４図は５６−６の処理を詳細化したフローチャート
である。FIG. 14 is a detailed flowchart of the process 56-6.

５１４−１において選択キーＳＥＬによって選択・確定
を指示された候補が格納されている同音語内に用例が存
在するかどうかを同音語中のすへての候補情報について
、用例情報ポインタを参照することにより判定する。す
なわち、用例情報ポインタの値が“−１”であるならば
、用例なしと判定される。すべての候補情報について用
例が一つも存在しなければ、５１４−３・＼、一つでも
用例が存在すれは５１４−２へそれぞれ分岐する。In step 514-1, the example information pointer is referred to for all candidate information in the homophone to determine whether an example exists in the homophone in which the candidate whose selection/confirmation is instructed by the selection key SEL is stored. Judgment is made by That is, if the value of the example information pointer is "-1", it is determined that there is no example. If there is no example for all candidate information, the process branches to 514-3 and \, and if there is even one example, the process branches to 514-2.

５１４−２で選択された。４Ｌ語によって成立する用例
が用例情報中に存在するかどうかを以下のように判定す
る。514-2 was selected. It is determined as follows whether an example established by a 4L word exists in the example information.

まず、選択された４１語の候？ｉｉ情報中における用例
情報ポインタをたどり、ベア同音語ＩＤおよび用例１０
を取り出す。取り出したベア同音語ＩＤの示す同音語内
の先頭候補情報の用例情報ポインタをたどり、ベア同音
語ＩＤが、選択された単語の存在する同音語ＩＤである
ものがあるかどうかを検索する。そのような用例情報が
検索されなけれは、用例は成立しないと判定される。ベ
ア同音Ｈ６１Ｄか、選択された単語の同音語ＩＤと一致
する用例情報が検索されたならば、用例ＩＤを取り出し
、選択された単語の用例情報中の用例ＩＤと一致するか
どうかをチエツクする。用例ＩＤが一致すれば、成立用
例が存在すると判定される。一致しなければ用例は成立
しないと判定される。用例が成立すると判定されたなら
ば、Ｓ　１４−６へ進み、用例が成立しないと判定され
たならば、ベア同音語の次の用例情報について同様の判
定を行なう。ベア同音語の用例情報についてすべて用例
が成立しなければ、選択された単語の次の用例情報につ
いて、同様の判定処理を行なう。選択された単語の用例
情報がないならば、成立する用例なしと判定され、Ｓ　
Ｉ　４−５へ分岐する。First, what are the 41 selected words? Follow the example information pointer in the ii information and find the bare homophone ID and example 10.
Take out. The usage example information pointer of the head candidate information in the homophone indicated by the extracted bare homophone ID is followed, and a search is made to see if there is any homophone ID whose bare homophone ID is the same as the homophone ID in which the selected word exists. Unless such example information is retrieved, it is determined that the example does not hold true. If the bare homophone H61D or example information matching the homophone ID of the selected word is retrieved, the example ID is taken out and checked to see if it matches the example ID in the example information of the selected word. If the example IDs match, it is determined that a valid example exists. If they do not match, it is determined that the example does not hold. If it is determined that the example is valid, the process advances to S14-6, and if it is determined that the example is not valid, a similar determination is made for the next example information of the bare homophone. If all examples are not established for the example information of the bare homophone, the same determination process is performed for the next example information of the selected word. If there is no example information for the selected word, it is determined that there is no valid example, and S
Branch to I 4-5.

５１４−４は用例抑制解除の処理である。５１４−２で
成立用例と判定された用例の用例ＩＤの示す用例学習デ
ータを使用可能状態にする。514-4 is a process for canceling usage suppression. In step 514-2, the example learning data indicated by the example ID of the example determined to be a valid example is made available for use.

５１４−３は５Ｉ４−１にて選択された単語に用例か存
在しなかった場合に、選択されなかった他の同音語の用
例を抑制するための用例存在判定を行なう。In 514-3, when there is no example of the word selected in 5I4-1, the existence of examples is determined in order to suppress examples of other homophones that were not selected.

そのために、選択された単語の存在する同音語中の選択
された候補情報以外のすべての用例情報に用例情報が存
在しているかどうかを判定し、用例が存在するならば、
５１４−５で用例ＩＤの示す用例学習データを使用禁止
状態にする。To do this, it is determined whether or not example information exists in all example information other than the selected candidate information in the homophones in which the selected word exists, and if examples exist,
In step 514-5, the example learning data indicated by the example ID is disabled.

５１４−６は単語学習処理を行なう。選択された表記の
単語ＩＤの示す卑語学習データを学習状！μにし、同音
語中の選択されなかった候補情報の単語ＩＤの示す単語
学習データの学習状態を解除する処理を行なう。514-6 performs word learning processing. A learning letter of the base language learning data indicated by the word ID of the selected notation! μ, and performs a process of canceling the learning state of the word learning data indicated by the word ID of the unselected candidate information among the homophones.

５１４−７で選択された表記の文書への確定処理を行な
う。確定処理は確定文字コートを文書バッファＴＢＵＦ
へ格納するという同種の文字処理装置で通常行なわれて
いる処理であり、公知であるので詳述しない。At step 514-7, the selected notation is confirmed in the document. Confirmation processing transfers the confirmed character code to the document buffer TBUF.
This is a process that is normally performed in character processing devices of the same type and is well known, so it will not be described in detail.

[Other Examples]

以上の説明において用いた辞書は単語情報と用例情報と
を同一見出し語の下に記述した単一の辞書構造であるが
、単語情報と用例情報とを分離した各辞書テーブルとし
ても同、様に処理することができる。さらに用例のタイ
プ別に辞書を構成し、各辞書に辞書個別のＩＤを持たせ
、用例の存在する辞書ＩＤによって用例の優先順位を持
たせて処理を行なってもよい。The dictionary used in the above explanation has a single dictionary structure in which word information and usage example information are described under the same headword, but it can also be used similarly as each dictionary table with word information and usage example information separated. can be processed. Furthermore, dictionaries may be configured for each type of example, each dictionary having its own ID, and processing may be performed by giving priority to the examples according to the dictionary ID in which the example exists.

また、以上の説明では、隣接した文節間にのみ用例を適
用していたが、隣接していない文節間に用例を適用する
ことも、容易に実現できる。その際、文節間の構文解析
処理（係り受は解析処理）を施すことによって、用例の
誤った適用を避けることができる。Furthermore, in the above description, examples are applied only between adjacent bunsetsu, but examples can also be easily applied between non-adjacent bunsetsu. In this case, by performing a syntactic analysis process between clauses (an analysis process for dependencies), incorrect application of the example can be avoided.

以上の説明においては、分ち書き入力された文節間に用
例を適用する場合を述べたが、べた書きに入力された読
み列に対しても、自動的に文節に分割する処理を付は加
えることによって、同じように用例を適用できる。また
、用例の存在を加味しなから文節に分割することも可能
である。In the above explanation, we have described the case where usage examples are applied between clauses input in separate writing, but processing for automatically dividing into clauses can also be added to reading sequences input in solid writing. By doing so, you can apply the example in the same way. It is also possible to divide sentences into clauses without considering the existence of examples.

（効果の説明）以上の説明から明らかなように、本発明によれば意味的
な同一性のある用例をまとめて登録することができるた
め、個別単語の用例を網羅的に登録する必要かなく、用
例辞書コストを低下させることができ、かつ、適用単語
対のばらつきを最小限にすることが容易である。また、
個別単語用例との併用により、意味属性用例では記述し
切れない固有の単語対に対しても同音語の第１候補を一
意に決定することが可能となるため、次候補に変換する
ことが不要となり、効率の良い入力作業を行なうことか
できる。(Explanation of Effects) As is clear from the above explanation, according to the present invention, examples with semantic sameness can be registered all at once, so there is no need to exhaustively register examples of individual words. , it is possible to reduce the cost of the example dictionary, and it is easy to minimize the variation in applied word pairs. Also,
When used in conjunction with individual word examples, it is possible to uniquely determine the first homophone candidate for unique word pairs that cannot be fully described using semantic attribute examples, so there is no need to convert to the next candidate. This allows for efficient input work.

[Brief explanation of the drawing]

第１図は本発明の全体構成のブロック図、第２図は本発
明の辞書構成の例を示す図、第３図は本発明の同音語プ
ールおよび用例情報プールの構成を示す図、第４図は本発明の付属語変換テーブルの構成を示す図、第５図は本発明の学習データの構成を示す図、第６図〜第１４図は本発明文字処理装面の動作を示すフ
ローチャートである。特許出願人　　キャノン株式会社第７図第８図第９図第１０図第１１図Ｅ甲乙第１２図FIG. 1 is a block diagram of the overall configuration of the present invention, FIG. 2 is a diagram showing an example of the dictionary configuration of the present invention, FIG. 3 is a diagram showing the configuration of the homophone pool and the example information pool of the present invention, and FIG. FIG. 5 is a diagram showing the structure of the adjunct word conversion table of the present invention, FIG. 5 is a diagram showing the structure of learning data of the present invention, and FIGS. 6 to 14 are flowcharts showing the operation of the character processing system of the present invention. be. Patent applicant: Canon Co., Ltd. Figure 7 Figure 8 Figure 9 Figure 10 Figure 11 Figure E A Figure 12

Claims

[Scope of Claims] An input means for inputting the pronunciation of a word, a word dictionary that stores the semantic attributes of the word while making correspondence between the pronunciation and the notation of the word, and a pair of words having a co-occurrence relationship. an example dictionary that stores a plurality of types of examples of individual words and semantic attribute examples that are pairs of words and semantic attributes; a phrase recognition means for recognizing a phrase as a phrase by the phrase recognition means; a usage example search means for searching the example dictionary for a pair of phrases recognized by the phrase recognition means and extracting valid individual word examples or semantic attribute examples; A character processing device comprising: a conversion means for determining a first candidate notation by giving priority to a word pair constituting an individual word example or a semantic attribute example searched by a search means.