JPH0640328B2

JPH0640328B2 - Character processor

Info

Publication number: JPH0640328B2
Application number: JP1311115A
Authority: JP
Inventors: 英一朗戸島; 雄二小林; 和世池田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1989-11-29
Filing date: 1989-11-29
Publication date: 1994-05-25
Anticipated expiration: 2009-05-25
Also published as: JPH03168864A

Description

【発明の詳細な説明】［産業上の利用分野］本発明は仮名漢字変換により漢字仮名混り文を入力する
文字処理装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Industrial application] The present invention relates to a character processing device for inputting a kanji / kana mixed sentence by kana / kanji conversion.

［従来の技術］現在、日本ワードプロセッサなど文字処理装置は漢字仮
名混り文の入力を仮名漢字変換を使って行なうことが一
般的である。[Prior Art] Currently, a character processing device such as a Japanese word processor generally inputs kanji-kana mixed sentences using kana-kanji conversion.

仮名漢字変換は辞書を参照することにより、入力された
読み列を漢字に変換するものである。辞書においては各
単語に対して名詞、サ変名詞、副詞、形容詞、形容動詞
等の品詞情報が記述されており、仮名漢字変換は読み列
を解析して可能な文節候補を作成し、それらを組み合わ
せて変換候補を決定し、尤もらしい順に提示する。提示
された変換候補の中からオペレータが望む候補を選択す
る。Kana-Kanji conversion is to convert an input reading string into Kanji by referring to a dictionary. Part-of-speech information such as nouns, sahen nouns, adverbs, adjectives, and adjective verbs is described for each word in the dictionary, and Kana-Kanji conversion analyzes possible reading strings to create possible phrase candidates and combines them. The conversion candidates are determined and presented in order of likelihood. The operator selects a desired candidate from the presented conversion candidates.

例えば、読み列「ぎゅうにゅうをぼくじょうでかう」に
対しては「牛乳を」「牛」「入を」「牧場で」「僕」
「上で」「情で」「上」「情」「出」「飼う」「買う」
「課」「蚊」「卯」などの文節候補が作成され、これら
を組み合わせた「牛乳を牧場で飼う」「牛乳を牧場で買
う」が変換候補として出力表示される。「買う」「飼
う」の中では、どちらの頻度が高いとは言えないので
「牛乳を牧場で買う」が第１候補としては変換されると
は限らない。For example, for the reading line "Gyu ni ou ni ou kyou de ka", "milk""cow""in""at the ranch""I"
"On", "emotion", "up", "emotion", "out", "keep", "buy"
Phrase candidates such as “section”, “mosquito”, and “u” are created, and a combination of these, “Keep milk at the ranch” and “Buy milk at the ranch” is output and displayed as conversion candidates. It cannot be said that either “buy” or “keep” has a higher frequency, so that “buying milk at the ranch” is not always converted as the first candidate.

そこで、変換率を高めるために用例変換という手法が提
案されている。用例変換は、共起する単語のペア（用
例）を予め用例辞書中に登録しておき、変換時に用例辞
書を参照し、第１候補を用例辞書中の用例に従って変更
する手法である。例えば、「牛乳を／買う」というパタ
ーンを用例辞書に記憶しておき、「ぎゅうにゅうをかっ
た」という入力に対しては「牛乳を買った」が第１候補
として変換される様にする。Therefore, in order to increase the conversion rate, a method called example conversion has been proposed. The example conversion is a method in which a pair of co-occurring words (examples) is registered in the example dictionary in advance, the example dictionary is referred to during conversion, and the first candidate is changed according to the example in the example dictionary. For example, the pattern “buy milk / buy” is stored in the example dictionary, and “buy milk” is converted as the first candidate in response to an input “was milky”.

上記の例は単語と単語の関係を記述した個別用例の例で
あるが、意味分類との関係を記述した意味用例というも
のも提案されている。例えば、「〜にいった」という文
脈においては「〜」が場所を表わすときには「行った」
という表記が尤もらしく、「〜」が人間を表わすときに
は「言った」という表記が尤もらしい。この変換を正し
く行なうために「《人間》に／言う」「《場所》に／行
く」という意味用例を用例辞書にあらかじめ登録してお
くのである。なお、《人間》《場所》などは意味分類で
あり、辞書中の全ての単語には《人間》であるか《場
所》であるかそうでないかが記述されている。この仕組
みにより「市長に言った」「市役所に行った」などを正
しく変換することができる。Although the above example is an example of an individual example in which the relationship between words is described, a semantic example in which the relationship with a semantic classification is described is also proposed. For example, in the context of “to ni,” when “to” represents a place,
The notation is plausible, and when "~" represents a human, the notation "said" seems plausible. In order to perform this conversion correctly, the meaning examples of "to / to say" to "human" / to / to "place" are registered in advance in the example dictionary. Note that “human”, “place”, etc. are semantic classifications, and all words in the dictionary describe “human”, “place”, or not. With this mechanism, it is possible to correctly convert "I told the mayor,""I went to the city hall," etc.

［発明が解決しようとしている課題］ところが、従来の用例変換の技術は、複数の用例が適用
可能なとき、すなわち、用例が競合するとき、どの用例
を優先すれば良いかは特に指定していなかった。わずか
に個別用例と意味用例が競合したときは個別用例を優先
するという指針が示されているぐらいであり、その他の
状況においてはどの用例を優先して良いかが分からなか
った。そのため、適当な用例を採用して、適当に第１候
補が決めていたため、微妙な状況での変換率が芳しくな
かった。[Problems to be Solved by the Invention] However, the conventional example conversion technique does not particularly specify which example should be prioritized when a plurality of examples are applicable, that is, when examples conflict. It was Even if there was a slight guideline to give priority to individual examples when there was a conflict between individual examples and meaning examples, it was not clear which example should be given priority in other situations. Therefore, since the first candidate was appropriately decided by adopting an appropriate example, the conversion rate in a delicate situation was not good.

［課題を解決するための手段（及び作用）］本発明の文字処理装置は、仮名文字列を入力するための
入力手段と、単語の読みと、表記及び意味分類とを対応
づけて記憶した単語辞書手段と、単語の表記と、当該単
語と併用される単語の表記もしくは意味分類とを対応づ
けて用例として、各用例に用例の優先度を付して記憶す
る用例辞書手段と、前記単語辞書手段を参照して、前記
入力手段より入力された仮名文字列を読みとする表記の
候補を作成する候補作成手段と、前記候補作成手段によ
り作成された同じ読みに対する表記候補に関して、該表
記候補と併用される単語の意味分類が該当する用例があ
る場合に、該用例の優先度と、該用例に該当する単語の
それぞれを含む文節間の文節数とを用いた演算により、
該用例の尤度を求め、該尤度に基づいて、前記表記候補
の表示優先度を決定する表示優先度決定手段と、該表示
優先度決定手段によって決定された表示優先度に基づい
て、前記候補作成手段により作成された表記候補を表示
する候補表示手段とを具えることにより、複数の用例が
競合したときには、用例自体の優先度と、その用例に該
当する単語のそれぞれを含む文節間の文節数とを用いた
演算により、表記候補の優先度を決定するようにしたも
のである。[Means (and Action) for Solving the Problem] A character processing device of the present invention is a word in which an input means for inputting a kana character string, a word reading, a notation and a semantic classification are associated and stored. The dictionary means, the word notation, the example notifying means for storing the notation or the meaning classification of the word used in combination with the word in association with each example with the priority of the example, and the word dictionary. With reference to the means, a candidate creating means for creating a candidate for a notation in which the kana character string input from the input means is read, and a notation candidate for the same reading created by the candidate creating means, When there is an example to which the semantic classification of words to be used together corresponds, by an operation using the priority of the example and the number of phrases between the phrases including each of the words corresponding to the example,
Based on the display priority determined by determining the likelihood of the example and determining the display priority of the notation candidate based on the likelihood, and the display priority determined by the display priority determination means, By providing the candidate display means for displaying the notation candidates created by the candidate creating means, when a plurality of examples compete with each other, the priority of the example itself and the phrase containing each of the words corresponding to the example The priority of the notation candidate is determined by an operation using the phrase count.

［実施例］以下図面を参照しながら本発明を詳細に説明する。EXAMPLES The present invention will be described in detail below with reference to the drawings.

第１図は本発明の全体構成の一例である。FIG. 1 is an example of the overall configuration of the present invention.

図示の構成において、ＣＰＵは、マイクロプロセッサで
あり、文字処理のための演算、論理判断等を行ない、ア
ドレスバスＡＢ、コントロールバスＣＢ、データバスＤ
Ｂを介して、それらのバスに接続された各構成要素を制
御する。In the illustrated configuration, the CPU is a microprocessor, performs arithmetic operations for character processing, logical judgments, etc., and has an address bus AB, a control bus CB, and a data bus D.
Via B, each component connected to those buses is controlled.

アドレスバスＡＢはマイクロプロセッサＣＰＵの制御の
対象とする構成要素を指示するアドレス信号を転送す
る。コントロールバスＣＢはマイクロプロセッサＣＰＵ
の制御の対象とする各構成要素のコントロール信号を転
送して印加する。データバスＤＢは各構成機器相互間の
データの転送を行なう。The address bus AB transfers an address signal indicating a component to be controlled by the microprocessor CPU. Control bus CB is a microprocessor CPU
The control signals of the respective components to be controlled by are transferred and applied. The data bus DB transfers data between the constituent devices.

つぎにＲＯＭは、読出し専用の固定メモリであり、第１
２図〜第１９図につき後述するマイクロプロセッサＣＰ
Ｕによる制御の手順を記憶させておく。Next, the ROM is a fixed read-only memory.
Microprocessor CP described later with reference to FIGS. 2 to 19
The control procedure by U is stored.

また、ＲＡＭは、１ワード１６ビットの構成の書込み可
能のランダムアクセスメモリであって、各構成要素から
の各種データの一時記憶に用いる。ＩＢＵＦはキー入力
されたキーデータを記憶する入力バッファであり、ＯＢ
ＵＦは仮名漢字変換された結果を一時的に記憶する出力
バッファである。ＤＩＣは仮名漢字変換を行なうための
辞書である。ＹＤＩＣは単語ある違背未分類間の共起関
係を記述した用例辞書である。ＢＣＴＢＬは変換途中の
文節候補を記憶する文節候補テーブルである。ＴＢＵＦ
はテキストバッファであり、入力編集中のテキストデー
タが記憶される。ＤＯＢＵＦは同音語バッファであり、
テキストバッファＴＢＵＦに記憶される文字に同音語候
補が存在するときはその候補が記憶される。The RAM is a writable random access memory having a structure of 1 word 16 bits, and is used for temporary storage of various data from each constituent element. IBUF is an input buffer that stores key data input by a key input.
UF is an output buffer for temporarily storing the result of Kana-Kanji conversion. DIC is a dictionary for Kana-Kanji conversion. YDIC is an example dictionary that describes a co-occurrence relationship between words that have not been classified. BCTBL is a phrase candidate table that stores phrase candidates that are in the process of conversion. TBUF
Is a text buffer in which text data being input and edited is stored. DOBUF is a homophone buffer,
When a homophone candidate exists for the character stored in the text buffer TBUF, the candidate is stored.

ＫＢはキーボードであって、アルファベットキー、ひら
かなキー、カタカナキー等の文字記号入力キー、及び、
変換キー、次候補キー、選択キー等の本文字処理装置に
対する各種機能を指示するための各種のファンクション
キーを備えている。KB is a keyboard, which is a character / symbol input key such as an alphabet key, a hiragana key, or a katakana key, and
Various function keys such as a conversion key, a next candidate key, and a selection key for instructing various functions for the character processing apparatus are provided.

ＤＩＳＫは文書データを記憶するための外部記憶部であ
り、テキストバッファＴＢＵＦ上に作成された文書の保
管を行ない、保管された文書はキーボードの指示によ
り、必要な時呼び出される。DISK is an external storage unit for storing document data, stores a document created in the text buffer TBUF, and the stored document is called when necessary by a keyboard instruction.

ＣＲはカーソルレジスタである。ＣＰＵにより、カーソ
ルレジスタの内容を読み書きできる。後述するＣＲＴコ
ントローラＣＲＴＣは、ここに蓄えられたアドレスに対
応する表示装置ＣＲＴ上の位置にカーソルを表示する。CR is a cursor register. The CPU can read and write the contents of the cursor register. The CRT controller CRTC described later displays a cursor at a position on the display device CRT corresponding to the address stored here.

ＤＢＵＦは表示用バッファメモリで、表示すべきデータ
のパターンを蓄える。テキストの内容を表示するときは
テキストバッファＴＢＵＦの内容に応じて表示パターン
がＤＢＵＦに作成されることにより、表示されることに
なる。DBUF is a display buffer memory that stores a pattern of data to be displayed. When displaying the content of the text, a display pattern is created in the DBUF in accordance with the content of the text buffer TBUF, so that it is displayed.

ＣＲＴＣはカーソルレジスタＣＲ及びバッファＤＢＵＦ
に蓄えられた内容を表示器ＣＲＴに表示する役割を担
う。CRTC is cursor register CR and buffer DBUF
It plays the role of displaying the contents stored in the display CRT.

またＣＲＴは陰極線管等を用いた表示装置であり、その
表示装置ＣＲＴにおけるドット構成の表示パターンおよ
びカーソルの表示をＣＲＴコントローラで制御する。The CRT is a display device using a cathode ray tube or the like, and the display pattern of the dot configuration and the display of the cursor on the display device CRT are controlled by the CRT controller.

さらに、ＣＧはキャラクタジェネレータであって、表示
装置ＣＲＴに表示する文字、記号のパターンを記憶する
ものである。Further, CG is a character generator, which stores patterns of characters and symbols to be displayed on the display device CRT.

かかる各構成要素からなる本発明文字処理装置において
は、キーボードＫＢからの各種の入力に応じて作動する
ものであって、キーボードＫＢからの入力が供給される
と、まず、インタラプト信号がマイクロプロセッサＣＰ
Ｕに送られ、そのマイクロプロセッサＣＰＵがＲＯＭ内
に記憶してある各種の制御信号を読出し、それらの制御
信号に従って各種の制御が行なわれる。The character processing device of the present invention comprising the above-described components operates in response to various inputs from the keyboard KB. When an input from the keyboard KB is supplied, first, an interrupt signal is sent to the microprocessor CP.
The control signal is sent to U, the microprocessor CPU reads out various control signals stored in the ROM, and various controls are performed in accordance with these control signals.

第２図は本発明装置による変換例を示した図である。図
中「」はそこで変換キーを打鍵するという意味であ
る。FIG. 2 is a diagram showing a conversion example by the device of the present invention. In the figure, "" means that the conversion key is pressed there.

「ぼくじょうでかう」と入力した場合は、第１候補とし
て「牧場で飼う」と変換される。これは用例辞書に「牧
場で／飼う」という用例が登録されているからである。
この様な用例変換の仕組みがないと、「かう」について
は「飼う」「買う」「交う」などの同音語があるので、
正しく変換されるとは限らなくなってしまう。When "Bokujo deka" is entered, it is converted to "Keep at the farm" as the first candidate. This is because the example "Branch / keep" is registered in the example dictionary.
If there is no such example conversion mechanism, there are homophones such as "keep", "buy", "cross" for "Kau".
It will not always be converted correctly.

次の例は「ぎゅうにゅうをかう」と入力した場合であ
る。第１の例と同様に、用例辞書に「牛乳を／買う」と
いう用例が登録されているので、正しく「牛乳を買う」
と変換される。The following example is for the case of entering "Gyu ni Uwa". As in the first example, since the example of “buying milk / buying” is registered in the example dictionary, “buying milk” should be correct.
Is converted to.

第３の例は「ぎゅうにゅうをぼくじょうでかう」と入力
した場合の変換である。この場合「牧場で飼う」「牛乳
を買う」の２つの用例が適用可能であり、「かう」の部
分についてそれぞれが矛盾する変換を要求している。す
なわち、どちらの用例を信じるかによって「かう」の変
換結果が変わってしまう。本発明はこの様な場合のため
に、各用例に優先度情報を持っており、優先度の高い用
例に基づいて変換している。この場合、「牧場で／飼
う」という用例より「牛乳を／買う」という用例の優先
度が高く設定されており、より優先度の高い「牛乳を／
買う」用例が適用され、「牛乳を牧場で買う」と変換さ
れている。The third example is the conversion in the case of inputting "Gyu ni ou boku jo ka ka". In this case, the two examples of "keeping on the ranch" and "buying milk" are applicable, and each requires a contradictory conversion for the "kau" part. That is, the conversion result of "kau" changes depending on which example is believed. For such a case, the present invention has priority information in each example and performs conversion based on an example with a high priority. In this case, the example of "buying milk / buying" is set to have a higher priority than the example of "being on the farm / raising", and the example of "buying milk /
The "buy" example has been applied and converted to "buy milk on the ranch."

第３図は本発明装置による用例の学習の例を示した図で
ある。３−１はまず初期画面を示したものである。３−
２は読み列「うしをぼくじょうでかう」を入力した時の
画面を示している。カーソルは入力読み列の次に表示さ
れている。ここで変換キーを打鍵すると３−３の画面に
なる。３−３では読み列「うしをぼくじょうでかう」が
「牛を牧場で飼う」と変換されている。この状況では
「牛を／飼う」「牧場で／飼う」「牛を／買う」という
用例が存在するが、最も優先度の高い「牛を／飼う」が
適用されて変換されている。オペレータは「牛を／買
う」という変換を望んだとして、次候補キーを打鍵する
と３−４の画面になる。ここで「かう」の変換候補が表
示される。第１候補は「飼う」であり、第２候補は「買
う」である。現在の候補は第２候補の「買う」であり、
２が反転表示している。ここで選択キーを打鍵すると、
３−５の画面になる。第２候補の「買う」が確定してテ
キストデータの中に格納され、同時に用例「牛を／買
う」が学習され、優先度が向上する。もう一度「うしを
ぼくじょうでかう」と入力すると３−６の画面になる。
更に変換キーを打鍵すると３−７の画面になり、今度は
用例「牛を／買う」が学習されているので、優先的に適
用され、第１候補として「牛を牧場で買う」が表示され
ている。FIG. 3 is a diagram showing an example of learning an example by the device of the present invention. 3-1 shows the initial screen. 3-
Reference numeral 2 shows a screen when the reading column "Ushio-dokujo-deka" is entered. The cursor is displayed next to the input reading row. When the conversion key is pressed here, the screen of 3-3 appears. In 3-3, the reading column "Ushio bokujo de ka" is converted to "keep cows on the farm." In this situation, there are examples of “cattle / keeping”, “on the farm / keeping”, and “cattle / buying”, but the highest priority “cattle / keeping” is applied and converted. If the operator desires the conversion of "buy / buy", the next candidate key is pressed, and the screen of 3-4 is displayed. Here, the conversion candidates of "KAU" are displayed. The first candidate is “keep” and the second candidate is “buy”. The current candidate is the second candidate, "buy",
2 is highlighted. If you hit the select key here,
The screen becomes 3-5. The second candidate “buy” is confirmed and stored in the text data, and at the same time, the example “cow / buy” is learned and the priority is improved. If you enter "Ushijo Bokujou" again, the screen 3-6 appears.
If you press the conversion key further, the screen of 3-7 appears, and this time, the example "Buy cow / buy" has been learned, so it is applied preferentially, and "Buy cow at the farm" is displayed as the first candidate. ing.

第４図は入力バッファＩＢＵＦ、出力バッファＯＢＵＦ
の構成を示した図である。FIG. 4 shows an input buffer IBUF and an output buffer OBUF.
It is a figure showing the composition of.

ＩＢＵＦ、ＯＢＵＦともに同じ構成である。最初の２バ
イトは各バッファのサイズ情報であり、バッファに格納
されている文字数から１を減じたものを２倍した数値が
入る。入力バッファの末尾にある「」はそこで変換キ
ーが打鍵されたことを意味する。各文字は１文字２バイ
トで構成され、JIS X 0208コード等で格納される。Both IBUF and OBUF have the same configuration. The first 2 bytes are the size information of each buffer, and the numerical value obtained by doubling the value obtained by subtracting 1 from the number of characters stored in the buffer is entered. The "" at the end of the input buffer means that the conversion key was pressed there. Each character consists of 2 bytes per character and is stored in JIS X 0208 code or the like.

第５図は辞書ＤＩＣの構成を示した図である。FIG. 5 is a diagram showing the structure of the dictionary DIC.

「読み」「表記」「品詞」「単語尤度」「意味分類」の
フィールドから構成される。It is composed of fields of "reading", "notation", "part of speech", "word likelihood", and "semantic classification".

「読み」には単語の読み、「表記」には単語の表記、
「品詞」には単語の品詞が格納される。"Reading" means reading a word, "writing" means writing a word,
The "part of speech" stores the part of speech of the word.

「単語尤度」は頻度情報等のその単語自体の尤もらしさ
を示す情報が１〜５の値で格納される。尤度値５は最も
尤もらしいという意味であり、値が小さくなるにつれ疑
わしいと解釈される。尤度値０は全く考えられないとい
うことを意味するので単語尤度の値としては存在しな
い。In the "word likelihood", information indicating the likelihood of the word itself such as frequency information is stored as a value of 1 to 5. The likelihood value 5 means that it is most likely, and the smaller the value, the more suspicious. A likelihood value of 0 does not exist as a word likelihood value because it means that it cannot be considered at all.

「意味分類」にはその単語の意味分類が、《場所》《動
物》《食物》などと、一般には複数個記述される。ただ
し、意味分類はその単語が名詞の場合のみ記述される。In the “semantic classification”, a plurality of semantic classifications of the word are generally described as “place”, “animal”, “food”, etc. However, semantic classification is described only when the word is a noun.

第６図は用例辞書ＹＤＩＣの構成を示した図である。
「第１単語」「第２単語」「助詞」「優先度」のフィー
ルドから構成される。FIG. 6 is a diagram showing the configuration of the example dictionary YDIC.
It is composed of fields of "first word", "second word", "particle" and "priority".

「第１単語」「第２単語」には共起する単語のペアを記
述する。もし、個別用例ではなくて意味用例であるとき
は単語の代わりに意味分類が記述される。A pair of co-occurring words is described in “first word” and “second word”. If it is not a specific example but a semantic example, a semantic classification is described instead of a word.

「助詞」には単語のペアを結び付ける助詞を記述する。The "particle" describes a particle that connects a pair of words.

「優先度」にはその用例の尤もらしさを示す情報が１〜
５の値で格納される。尤度値５は最も尤もらしいという
意味であり、値が小さくなるにつれ疑わしいと解釈され
る。尤度値０は全く考えられないということを意味する
ので優先度の値としては存在しない。Information indicating the likelihood of the example is 1 to "priority".
It is stored as a value of 5. The likelihood value 5 means that it is most likely, and the smaller the value, the more suspicious. Likelihood value 0 means that it cannot be considered at all, so it does not exist as a priority value.

すなわち、図中に示される用例は「牧場で／飼う」（優
先度４）「牛乳を／買う」（優先度５）「牛を／飼う」
（優先度５）「牛を／買う」（優先度４）「《動物》を
／飼う」（優先度２）となる。That is, the example shown in the figure is "at the ranch / keep" (priority 4) "buy / buy milk" (priority 5) "beef / keep"
(Priority 5) "buy / buy cow" (priority 4) "// keep animal" (priority 2).

なお、３番目の用例は意味用例であるので優先度が低く
押えてある。こうすることにより、「個別用例は意味用
例に優先する」という原則が自然と成り立つことにな
る。Since the third example is a meaning example, the priority is kept low. By doing so, the principle that "individual examples take precedence over semantic examples" naturally holds.

第７図は文節候補テーブルＢＣＴＢＬの概念を示した図
である。文節候補テーブルは入力読みの解析の結果、考
えられる文節候補をバイナリートリーで表現したもので
ある。図中、横線は子ポインタを意味し、縦線は弟ポイ
ンタを意味する。弟ポインタはある読み位置から始まる
他の文節候補（通常はより短い候補）をリンクし、子ポ
インタはその文節に引き続く文節候補をリンクする。FIG. 7 is a diagram showing the concept of the phrase candidate table BCTBL. The phrase candidate table is a binary phrase representing possible phrase candidates as a result of analysis of input reading. In the figure, a horizontal line means a child pointer, and a vertical line means a younger brother pointer. The younger brother pointer links other bunsetsu candidates (usually shorter ones) starting from a certain reading position, and the child pointer links bunsetsu candidates that follow the bunsetsu.

入力読み列「ぎゅうにゅうをぼくじょうでかう」の先頭
部分は「牛乳を」「牛」などの解釈が存在し、これらは
順に弟ポインタ（縦線）でリンクされている。At the beginning of the input reading string "Gyu ni wo wo ku jo desu ka", there are interpretations such as "milk" and "cow", which are sequentially linked by a younger brother pointer (vertical line).

「牛乳を」に引き続く文節は「牧場で」が考えられ、子
ポインタでリンクされている。The phrase following "milk" is considered "at the ranch" and is linked by the child pointer.

この様な文節候補テーブルができ上がると、「牛乳を牧
場で買う」「牛乳を牧場で飼う」「牛乳を僕情で買う」
「牛入を牧場で飼う」「牛入を僕情出買う」などと文節
候補列を作成することは容易である。When such a phrase candidate table is completed, "buy milk on the ranch", "keep milk on the ranch", "buy milk on my own terms"
It is easy to create a phrase candidate sequence such as "Keep Uriiri at the ranch" or "Buy Uyuiri for me".

第８図は文節候補テーブルＢＣＴＢＬの具体的構成を示
した図である。FIG. 8 is a diagram showing a specific configuration of the phrase candidate table BCTBL.

「自立語」は文節候補の自立語が存在する辞書上の先頭
位置へポインタを格納する。The "independent word" stores a pointer at the head position in the dictionary where the independent word of the bunsetsu candidate exists.

「付属語列」は２バイトで構成され、文節候補の自立語
に引き続く付属語列を特定する領域である。先頭１バイ
トが入力バッファ上の付属語列先頭文字をインデックス
し、次の１バイトが入力バッファ上の付属語列の末尾文
字をインデックスする。例えば、第４図に示す入力バッ
ファのとき、「を」を表現するには第１バイトに１２、
第２バイトに１２を設定する。付属語列が存在しないと
きは図中で「φ」で示している。The "adjunct word string" is composed of 2 bytes, and is an area for specifying the adjunct word string following the independent word of the bunsetsu candidate. The first 1 byte indexes the first character of the auxiliary word string on the input buffer, and the next 1 byte indexes the last character of the auxiliary word string on the input buffer. For example, in the case of the input buffer shown in FIG.
Set 12 in the second byte. When there is no adjunct word string, it is indicated by "φ" in the figure.

「弟リンク」はその文節候補と同じ読み位置から始まる
別の文節候補をリンクする。The "younger brother link" links another phrase candidate starting from the same reading position as the phrase candidate.

「子リンク」はその文節候補に引き続く文節候補をリン
クする。The "child link" links a bunsetsu candidate following the bunsetsu candidate.

なお、リンクは値０のときに終端することを意味する。Note that the link means that it terminates when the value is 0.

例えば、文節候補０（牛乳を）の弟リンクは５０であ
り、文節候補５０（牛）をリンクする。文節候補０（牛
乳を）の子リンクは１００であり、文節候補１００（牧
場で）をリンクする。文節候補１００の子リンクは４０
０であり、文節候補４００（買う）をリンクする。文節
候補４００の子リンクは０であり、そこで入力読みが終
端していることが分かる。For example, the younger brother link of the phrase candidate 0 (milk) is 50, and the phrase candidate 50 (cow) is linked. The child link of the phrase candidate 0 (milk) is 100, and the phrase candidate 100 (at the ranch) is linked. The child link of the phrase candidate 100 is 40
It is 0, and the phrase candidate 400 (buy) is linked. The child link of the phrase candidate 400 is 0, and it can be seen that the input reading ends there.

第９図はテキストバッファＴＢＵＦの構成を示した図で
ある。FIG. 9 is a diagram showing the structure of the text buffer TBUF.

テキストバッファは文字の羅列から構成され、各文字は
２バイトで構成される。各文字のＭＳＢは同音語フラグ
であり、０は通常文字、１が同音語を意味する。残りの
１５ビットは通常文字のときは文字コードを表現し、同
音語のときは同音語番号を表現する。文字コードは例え
ば、JIS X 0208コードを使用する。同音語番号は第１０
図に示す同音語バッファＤＯＢＵＦ上のどの同音語であ
るかを示す番号である。The text buffer is composed of a list of characters, and each character is composed of 2 bytes. The MSB of each character is a homophone word flag, 0 means a normal character, and 1 means a homophone word. The remaining 15 bits represent a character code in the case of a normal character and represent a homophone number in the case of a homophone. For the character code, for example, JIS X 0208 code is used. Homophone number is tenth
It is a number indicating which homonym is in the homophone word buffer DOBUF shown in the figure.

第１０図は同音語バッファＤＯＢＵＦの構成を示した図
である。各同音語には同音語番号が付いており、それに
よって識別される。FIG. 10 is a diagram showing the structure of the homophone word buffer DOBUF. Each homophone has a homophone number, which identifies it.

各同音語は「読み」「候補総数」「現候補番号」「第ｉ
候補情報」からなる。Each homonym is "yomi""total number of candidates""current candidate number""i-th
Candidate information ”.

「読み」はその同音語の読みを格納する。“Yomi” stores the reading of the homonym.

「候補総数」はその同音に含まれる候補の総数を格納す
る。The “total number of candidates” stores the total number of candidates included in the same sound.

「現候補番号」はその同音語の現在表示されている候補
番号が格納される。変換直後初値に第１候補が表示され
るから「１」が格納される。The "current candidate number" stores the currently displayed candidate number of the homonym. Since the first candidate is displayed in the initial value immediately after conversion, "1" is stored.

「第ｉ候補情報」は各候補の表記、及び、その候補に対
して適用された用例の番号が格納される。通常、その候
補に適用される用例は複数個存在するので、適用用例番
号には複数個の番号が入る。The "i-th candidate information" stores the notation of each candidate and the number of the example applied to the candidate. Usually, there are a plurality of examples applied to the candidate, so a plurality of application example numbers are entered.

第１１図は文尤度の計算例を示した図である。文尤度は
文節候補列である文の尤もらしさを表現するものであ
り、値が大きいほど尤もらしいことを意味する。FIG. 11 is a diagram showing an example of sentence likelihood calculation. The sentence likelihood expresses the likelihood of a sentence that is a phrase candidate sequence, and the larger the value, the more likely it is.

文尤度は、文節尤度の和、文節間尤度の和、用例尤度の
和を全て加算することにより計算される。The sentence likelihood is calculated by adding all the sums of the phrase likelihoods, the sums of the inter-phrase likelihoods, and the sums of the example likelihoods.

文節尤度は各文節候補の尤もらしさを表現するものであ
り、値が大きいほど尤もらしいことを意味する。文節尤
度はその文節の自立語の辞書ＤＩＣに記載されている単
語尤度が使用される。The phrase likelihood expresses the likelihood of each phrase candidate, and the larger the value, the more likely it is. As the phrase likelihood, the word likelihood described in the dictionary DIC of the independent word of the phrase is used.

文節間尤度は隣接する文節候補間の接続の尤もらしさを
表現するものであり、本実施例では値−２０で固定とす
る。その文がｎ文節で構成されていたとすると、文節間
は（ｎ−１）あるので、文節間尤度の和は必ず−２０
（ｎ−１）となる。The inter-phrase likelihood expresses the likelihood of connection between adjacent phrase candidates, and is fixed at a value of -20 in this embodiment. If the sentence is composed of n bunsetsu, there is (n-1) between the bunsetsus, so the sum of the inter-bunsetsu likelihoods is always -20.
(N-1).

用例尤度は各文節間に適用されている、用例の一つ一つ
について文尤度に加算される。用例尤度は各用例に記述
されている優先度の４倍から、その用例を適用する際に
飛び越す文節の数の２倍を減じたものとなる。The example likelihood is applied to the sentence likelihood for each example, which is applied between clauses. The example likelihood is four times the priority described in each example, minus two times the number of clauses skipped when applying the example.

第１１図によると、「牛乳を／牧場で／飼う」の文尤度
は−１８であり、「牛乳を／牧場で／買う」の文尤度は
−１６であるので、文尤度の値の大きい例２の「牛乳を
／牧場で／買う」が第１位に変換されることが分かる。According to FIG. 11, the sentence likelihood of “milk / raising / raising” is −18, and the sentence likelihood of “milk / raising / buying” is −16. It can be seen that the example 2 of “large milk / at the farm / buy” is converted to the first place.

この様に正しく変換される理由は、例１に適用される用
例「牧場で／飼う」の優先度は４であるのに対し、例２
に適用される用例「牛乳を／買う」の優先度は５であ
り、優先度の高い用例が適用された文ほど有利に変換さ
れるからからである。The reason for such correct conversion is that while the example "Appare on the farm / Keep" has a priority of 4 applied to Example 1, Example 2
This is because the priority of the example "buying milk / buying" applied to is 5 and the sentence to which the higher priority example is applied is more favorably converted.

上述の実施例の動作をフローに従って説明する。The operation of the above embodiment will be described according to the flow.

第１２図はキー入力を取り込み、処理を行なう部分のフ
ローチャートである。FIG. 12 is a flow chart of a part for receiving a key input and performing a process.

ステップ１２−１はキーボードからのデータを取り込む
処理である。ステップ１２−２で取り込まれたキーの種
別を判定し、各キーの処理ルーチンに分岐する。Step 12-1 is a process for fetching data from the keyboard. The type of the key fetched in step 12-2 is determined, and the process branches to the processing routine for each key.

変換キーであったときはステップ１２−３に分岐し、ス
テップ１２−３において第１３図に詳述するように仮名
漢字変換の変換処理が行なわれる。次候補キーであった
ときはステップ１２−４において第１８図に詳述する次
候補処理を行なう。選択キーであったときはステップ１
２−５において第１９図に詳述する選択処理を行なう。
その他のキーのときはステップ１２−６に分岐し、挿
入、削除等の通常の文字処理装置において行なわれるそ
の他の処理が行なわれる。その後ステップ１２−１に分
岐する。If it is a conversion key, the process branches to step 12-3, and the conversion process of kana-kanji conversion is performed in step 12-3 as described in detail in FIG. If it is the next candidate key, the next candidate process detailed in FIG. 18 is performed in step 12-4. If it is the select key, step 1
In 2-5, the selection process detailed in FIG. 19 is performed.
If the key is any other key, the process branches to step 12-6 to perform other processes such as insertion and deletion which are carried out in a normal character processing device. After that, the process branches to step 12-1.

第１３図はステップ１２−３の「変換処理」を詳細化し
たフローチャートである。FIG. 13 is a detailed flowchart of the “conversion process” in step 12-3.

ステップ１３−１において、第１４図に詳述する文節候
補作成処理を行ない、文節候補テーブルＢＣＴＢＬを作
成する。In step 13-1, the phrase candidate table BCTBL is created by performing the phrase candidate creation process described in detail in FIG.

ステップ１３−２において、第１５図に詳述する第１候
補決定処理を行なう。In step 13-2, the first candidate determination process described in detail in FIG. 15 is performed.

ステップ１３−３において、決定された第１候補に基づ
いて変換結果を作成し、出力する。In step 13-3, a conversion result is created and output based on the determined first candidate.

第１４図はステップ１３−１の「文節候補作成処理」を
詳細化したフローチャートである。FIG. 14 is a detailed flowchart of the “segment candidate creating process” in step 13-1.

ステップ１４−１において、入力バッファインデックス
ｉ、文節候補テーブルインデックスｊを０に初期設定す
る。In step 14-1, the input buffer index i and the phrase candidate table index j are initialized to 0.

ステップ１４−２においてｉの示す入力バッファ中の読
みに基づき、辞書をサーチし、単語候補を求める。In step 14-2, the dictionary is searched for word candidates based on the reading in the input buffer indicated by i.

ステップ１４−３において、見つかった単語候補に対し
て接続する付属語列を解析する形態素解析処理を行な
う。この結果、文節候補が得られる。In step 14-3, a morphological analysis process is performed to analyze the attached word string connected to the found word candidate. As a result, bunsetsu candidates are obtained.

ステップ１４−４において得られた文節候補を文節候補
テーブルに格納する。格納する時はｊ＋１番目のエント
リーに格納する。また、必要な情報を設定する。例え
ば、この文節候補を子、あるいは弟とする文節候補につ
いては、子リンク、弟リンクを設定する。格納後ｊの値
をカウントアップする。The phrase candidates obtained in step 14-4 are stored in the phrase candidate table. When storing, it is stored in the (j + 1) th entry. Also, set the necessary information. For example, a child link and a younger brother link are set for a phrase candidate in which this phrase candidate is a child or a younger brother. After storing, the value of j is incremented.

ステップ１４−５において文節候補テーブルからターミ
ネートしていない文節候補、すなわち、子リンクがまだ
決まっていない文節候補を見つけその次の読み位置をｉ
に代入する。In step 14-5, a bunsetsu candidate that has not been terminated, that is, a bunsetsu candidate for which a child link has not yet been determined is found from the bunsetsu candidate table, and the next reading position is set to i.
To.

ステップ１４−６において、全ての文節候補の子リンク
が決定しているかどうかを判定し、決定していないもの
があればステップ１４−２に分岐する。そうでなければ
リターンする。In step 14-6, it is determined whether or not the child links of all bunsetsu candidates have been decided, and if there is any that has not been decided, the process branches to step 14-2. Otherwise it returns.

第１５図はステップ１３−２の「第１候補決定処理」を
詳細化したフローチャートである。FIG. 15 is a detailed flowchart of the “first candidate determination process” in step 13-2.

ステップ１５−１において、最尤文尤度を処理上許され
る最小値、例えば、−３２７６７に初期設定する。In step 15-1, the maximum likelihood sentence likelihood is initialized to the minimum value allowed in processing, for example, -32767.

ステップ１５−２において文節候補テーブルより文節候
補列を１つ取り出す。In step 15-2, one phrase candidate string is extracted from the phrase candidate table.

ステップ１５−３において、第１６図に詳述するように
その取り出された文節候補列に適用可能な用例を検索
し、用例尤度の和である「用例尤度和」を算出する。In step 15-3, as will be described in detail with reference to FIG. 16, an example applicable to the extracted phrase candidate sequence is searched, and the “example likelihood sum” that is the sum of example likelihoods is calculated.

ステップ１５−４において、文節候補列の文尤度を第１
１図に示す様に算出する。In step 15-4, the sentence likelihood of the phrase candidate sequence is set to the first value.
It is calculated as shown in FIG.

ステップ１５−５において、算出された文尤度が最尤文
尤度より尤もらしいか、具体的には大きいかを判定し、
大きいときはステップ１５−６において最尤文尤度を算
出された文尤度に更新する。また、ステップ１５−７に
おいて現在の文節候補列を最尤文尤度に対応した文節候
補列として記憶する。In step 15-5, it is determined whether the calculated sentence likelihood is more likely than the maximum likelihood sentence likelihood, specifically, larger,
When it is larger, the maximum likelihood sentence likelihood is updated to the calculated sentence likelihood in step 15-6. In step 15-7, the current phrase candidate sequence is stored as the phrase candidate sequence corresponding to the maximum likelihood sentence likelihood.

ステップ１５−８において文節候補テーブルから別の文
節候補列が取り出せるかどうかを判定し、取り出せると
きはステップ１５−２に分岐する。そうでないときはリ
ターンする。In step 15-8, it is determined whether another phrase candidate string can be extracted from the phrase candidate table, and if it can be extracted, the process branches to step 15-2. If not, return.

これにより、最尤文尤度を算出するもとになった文節候
補列が第１候補として決定される。As a result, the phrase candidate sequence from which the maximum likelihood sentence likelihood is calculated is determined as the first candidate.

第１６図はステップ１５−３の「用例尤度和の算出」を
詳細化したフローチャートである。FIG. 16 is a detailed flowchart of “calculation of example likelihood sum” in step 15-3.

ステップ１６−１において、用例尤度和をまず０に初期
設定する。In step 16-1, the example likelihood sum is initially initialized to 0.

ステップ１６−２において、文節候補列から１文節を着
目文節として取り出す。以下の処理ではこの着目文節に
関係する用例の尤度を求めることになる。In step 16-2, one phrase is extracted as a focused phrase from the phrase candidate sequence. In the following process, the likelihood of the example related to this focused phrase will be obtained.

ステップ１６−３において最尤用例尤度を０に初期設定
する。In step 16-3, the maximum likelihood example likelihood is initialized to 0.

ステップ１６−４において、着目文節とペアになる文節
を文節候補列から取り出す。In step 16-4, a phrase paired with the focused phrase is extracted from the phrase candidate string.

ステップ１６−５において、着目文節のペア文節を取り
出すことができなかったときは、それ以上の着目文節の
処理を諦め、ステップ１６−８に分岐する。In step 16-5, when the pair phrase of the target phrase cannot be extracted, the process of the further target phrase is abandoned, and the process branches to step 16-8.

ステップ１６−６において、第１７図に詳述するよう
に、着目文節とペア文節の間の用例が適用されているか
どうかを調べ、その適用状況に応じて用例尤度を設定す
る。In step 16-6, as detailed in FIG. 17, it is checked whether or not the example between the target phrase and the pair phrase is applied, and the example likelihood is set according to the application situation.

ステップ１６−７において、求まった用例尤度が最尤用
例尤度より尤もらしい（すなわち大きいか）かどうかを
調べ、尤もらしければ（すなわち大きければ）、用例尤
度の値を最尤用例尤度に代入する。その後ステップ１６
−４に分岐し、別のペアについて用例尤度を求める。In step 16-7, it is checked whether the obtained example likelihood is more likely (that is, larger) than the maximum likelihood example likelihood. To. Then step 16
-4 and obtain the example likelihood for another pair.

ステップ１６−８において、着目文節の処理が終ったの
で、次の着目文節の処理に移るが、未処理の文節が残っ
ているかどうか判定する。残っていないときはそのまま
リターンするが、残っているときはステップ１６−９に
分岐し、前回求まった最尤用例尤度を用例尤度和に加算
する。その後、ステップ１６−２に分岐し、次の着目文
節を取り出す。In step 16-8, since the processing of the target clause has been completed, the processing moves to the next target clause, but it is determined whether or not there is an unprocessed clause. If not, the process returns as it is, but if it remains, the process branches to step 16-9 to add the maximum likelihood example likelihood obtained last time to the example likelihood sum. After that, the process branches to step 16-2, and the next focused phrase is extracted.

第１７図はステップ１６−６の「用例尤度の値設定」を
詳細化したフローチャートである。FIG. 17 is a detailed flow chart of "setting the value of the example likelihood" in step 16-6.

ステップ１７−１において、着目文節とペア文節の２文
節間に適用される用例があるかどうかを判断する。In step 17-1, it is determined whether or not there is an example applied between the two clauses of the target clause and the pair clause.

適用用例がないときはステップ１７−２に分岐し、用例
尤度に０を代入し、リターンする。If there is no application example, the process branches to step 17-2, 0 is assigned to the example likelihood, and the process returns.

適用用例が存在するときはステップ１７−３に分岐し、
用例尤度としてその用例の優先度に４を乗じたものを代
入する。If an application example exists, branch to step 17-3,
As the example likelihood, a value obtained by multiplying the priority of the example by 4 is substituted.

ステップ１７−４において２文節間の距離、すなわち、
飛び越す文節の数を求め、その値に２を乗じて用例尤度
から減算する。もし、隣接する２文節であれば、飛び越
す文節は０であるので、用例尤度から０を減じることに
なる。In step 17-4, the distance between two clauses, that is,
The number of bunsetsus to be skipped is obtained, and the value is multiplied by 2 and subtracted from the example likelihood. If there are two adjacent clauses, the jumping clause is 0, so 0 is subtracted from the example likelihood.

ステップ１７−５において上記計算された用例尤度の値
が負になっていないかどうか判定し、負のときはステッ
プ１７−６において０になるように補正し、リターンす
る。負でないときは値をいじらずにそのままリターンす
る。In step 17-5, it is judged whether or not the value of the calculated example likelihood is negative, and if negative, the value is corrected to 0 in step 17-6, and the process returns. When it is not negative, it returns without changing the value.

第１８図はステップ１２−４の「次候補処理」を詳細化
したフローチャートである。FIG. 18 is a detailed flowchart of the "next candidate process" in step 12-4.

ステップ１８−１において、次候補を見るべき同音語の
同音語番号をテキストバッファＴＢＵＦより求める。In step 18-1, the homophone number of the homophone to see the next candidate is obtained from the text buffer TBUF.

ステップ１８−２において、同音語番号から同音語バッ
ファの位置を求めて、現候補番号をカウントアップす
る。At step 18-2, the position of the homophone word buffer is obtained from the homophone word number and the current candidate number is counted up.

ステップ１８−３において、候補一覧を表示する。In step 18-3, the candidate list is displayed.

第１９図はステップ１２−５の「選択処理」を詳細化し
たフローチャートである。FIG. 19 is a detailed flowchart of the "selection process" in step 12-5.

ステップ１９−１において、選択すべき同音語の同音語
番号をテキストバッファＴＢＵＦより求める。In step 19-1, the homophone number of the homophone to be selected is obtained from the text buffer TBUF.

ステップ１９−２において、同音語番号から同音語バッ
ファの位置を求めて、現候補番号の示す候補情報から表
記を取り出し、テキストバッファＴＢＵＦに確定文字と
して設定する。In step 19-2, the position of the homophone word buffer is obtained from the homophone word number, the notation is extracted from the candidate information indicated by the current candidate number, and is set as the fixed character in the text buffer TBUF.

ステップ１９−３において、同様に、現候補番号の示す
候補情報から適用用例番号を求めその用例の優先度をカ
ウントアップする。Similarly, in step 19-3, the application example number is obtained from the candidate information indicated by the current candidate number, and the priority of the example is counted up.

ステップ１９−４において、同様に、第１候補の示す候
補情報から適用用例番号を求めその用例の優先度をカウ
ントダウンする。In step 19-4, similarly, the application example number is obtained from the candidate information indicated by the first candidate, and the priority of the application example is counted down.

［他の実施例］以上の説明において、用例辞書は共起する２単語（ある
いは分類）のペアであるとして説明したが、３つ組ある
いは更に一般的にｎ個組であっても同様に処理すること
ができる。[Other Embodiments] In the above description, the example dictionary is described as a pair of co-occurring two words (or classifications), but the same processing is performed even if it is a set of three or more generally n sets. can do.

また、尤度の計算は単に例を示しただけであり、用例に
付与された優先度を尤度計算の過程で考慮する様に構成
すれば、別の計算方法であっても特に本発明の趣旨を損
なうものではない。Further, the calculation of the likelihood is merely an example, and if the priority given to the example is configured to be considered in the process of the likelihood calculation, even if another calculation method is used, the calculation of It does not spoil the purpose.

また、用例の学習の仕組みは各用例に記述された優先度
を直接操作するように構成している。しかし、この方式
以外に、各用例に対応した学習ビットを持ち、学習ビッ
トを操作するように処理しても、特に本発明の趣旨を損
なうものではない。Moreover, the learning mechanism of the examples is configured to directly operate the priority described in each example. However, other than this method, even if the learning bit corresponding to each example is provided and the learning bit is processed, the gist of the present invention is not particularly impaired.

［発明の効果］以上説明したように、本発明によれば、表記候補の決定
において、複数の意味用例が適用可能な場合に、意味用
例自体の優先度と、用例間に存在する文節数とを用いた
演算により用例の尤度を求め、この尤度に基づいて、表
記候補の優先度を決定するようにしたので、より適切な
候補へ優先的に変換を行うことができるという効果があ
る。[Effects of the Invention] As described above, according to the present invention, when a plurality of meaning examples are applicable in the determination of a notation candidate, the priority of the meaning example itself and the number of clauses existing between the examples. Since the likelihood of the example is obtained by the calculation using, and the priority of the notation candidate is determined based on this likelihood, it is possible to preferentially perform conversion to a more appropriate candidate. .

また、意味分類の優先度情報を学習により変更すること
ができるので、単語を特殊な意味に使用するオペレータ
にとっても、自分の好みの変換に改良することができ
る。In addition, since the priority information of the meaning classification can be changed by learning, it is possible for an operator who uses a word for a special meaning to improve the conversion to his or her preference.

これにより変換率の高い、快適な文字処理装置を実現す
ることができる。This makes it possible to realize a comfortable character processing device with a high conversion rate.

[Brief description of drawings]

第１図は本発明の全体構成のブロック図、第２図は本発明における仮名漢字変換の変換例を示した
図、第３図は本発明における用例の優先度の学習効果を示し
た図、第４図は本発明における入力バッファＩＢＵＦと出力バ
ッファＯＢＵＦの構成を示した図、第５図は本発明における仮名漢字変換用辞書ＤＩＣの構
成を示した図、第６図は本発明における用例辞書ＹＤＩＣの構成を示し
た図、第７図は本発明における文節候補テーブルＢＣＴＢＬの
記憶内容を概念的に示した図、第８図は本発明における文節候補テーブルＢＣＴＢＬの
構成を示した図、第９図は本発明におけるテキストバッファＴＢＵＦの構
成を示した図、第１０図は本発明における同音語バッファＤＯＢＵＦの
構成を示した図、第１１図は本発明における尤度計算の方式を示した図、第１２図〜第１９図は本発明文字処理装置の動作を示す
フローチャート。ＤＩＳＫ……外部記憶部ＣＰＵ……マイクロプロセッサＲＯＭ……読出し専用メモリＲＡＭ……ランダムアクセスメモリＩＢＵＦ……入力バッファＯＢＵＦ……出力バッファＤＩＣ……仮名漢字変換用辞書ＹＤＩＣ……用例辞書ＢＣＴＢＬ……文節候補テーブルＴＢＵＦ……テキストバッファＤＯＢＵＦ……同音語バッファFIG. 1 is a block diagram of the overall configuration of the present invention, FIG. 2 is a diagram showing a conversion example of kana-kanji conversion in the present invention, and FIG. 3 is a diagram showing a priority learning effect of an example in the present invention, FIG. 4 is a diagram showing a configuration of an input buffer IBUF and an output buffer OBUF in the present invention, FIG. 5 is a diagram showing a configuration of a kana-kanji conversion dictionary DIC in the present invention, and FIG. 6 is an example dictionary in the present invention. The figure which showed the structure of YDIC, FIG. 7 is the figure which showed notionally the memory content of the phrase candidate table BCTBL in this invention, FIG. 8 is the figure which showed the structure of the phrase candidate table BCTBL in this invention, 9th. The figure shows the structure of the text buffer TBUF in the present invention, FIG. 10 shows the structure of the homophone word buffer DOBUF in the present invention, and FIG. 11 shows the likelihood calculation method in the present invention. The figure which showed the formula, FIGS. 12-19 is a flowchart which shows operation | movement of the character processing apparatus of this invention. DISK ... External storage unit CPU ... Microprocessor ROM ... Read-only memory RAM ... Random access memory IBUF ... Input buffer OBUF ... Output buffer DIC ... Kana-kanji conversion dictionary YDIC ... Example dictionary BCTBL ... Candidate table TBUF ... Text buffer DOBUF ... Homophone buffer

Claims

[Claims]

1. An input means for inputting a kana character string, a word dictionary means for storing a reading of a word, a notation and a semantic classification in association with each other, a word notation, and a word used in combination with the word. As an example in association with the notation or the semantic classification, the example dictionary means for storing each example with the priority of the example, and the kana character string input by the input means with reference to the word dictionary means When there is an example in which the meaning classification of a word used in combination with a candidate creation unit that creates a candidate for a notation that reads, and a notation candidate for the same reading created by the candidate creation unit is applicable, The likelihood of the example is obtained by an operation using the priority of the example and the number of phrases between the phrases including each of the words corresponding to the example, and based on the likelihood, display priority of the notation candidate is given. Display to determine the degree A character processing device comprising: priority determination means; and candidate display means for displaying the notation candidates created by the candidate creation means based on the display priority determined by the display priority determination means. .

2. A selecting means for selecting one of the candidates displayed by the candidate displaying means, and a selecting means for selecting one of the candidates other than the candidate displayed with the highest priority by the selecting means. Regarding the example related to the selected candidate or the candidate displayed with the highest priority,
2. The character processing device according to claim 1, further comprising a priority updating unit that updates the priority of the example stored in the example dictionary unit.