JPH0728806A

JPH0728806A - Character processor and method therefor

Info

Publication number: JPH0728806A
Application number: JP5170472A
Authority: JP
Inventors: Eiichiro Toshima; 英一朗戸島; Hiroki Suzuki; 大記鈴木; Shuichi Morisawa; 秀一森澤; Atsushi Sakakibara; 厚志榊原; Kazuyo Ikeda; 和世池田; Yuji Kobayashi; 雄二小林; Hitoshi Yururi; 仁志緩利; Chika Ogusu; 千佳小楠; Mayumi Morita; 麻友美森田; Mariko Kishino; 麻里子岸野
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1993-07-09
Filing date: 1993-07-09
Publication date: 1995-01-31

Abstract

PURPOSE:To save the handling time of the selection of a homonym candidate and to improve operability by estimating the field information of a document at the time of calling the document and changing the sequence of the conversion candidate of a KANA(Japanese syllabary)/KANJI(Chinese character) according to the estimation. CONSTITUTION:A fixed memory ROM has an inter-field degree of association table WTBL and a keyword dictionary KDI, and a random access memory RAM is provided with an input buffer IBUF storing the reading column for which a key input is performed or the conversion result of a KANA/KANJI conversion word, a homonym buffer DOBUF storing the homonym candidate of the conversion result and a document buffer TBUF storing a field information table BTBL and document data. By the input from a keyboard KB, a CPU reads the control signal of the ROM and various kinds of controls are performed. Namely, the input is converted into a notation by referring to a dictionary KDIC, etc., and the document edited by the notation is stored. It is called, the field information corresponding to a routine document is called, the field information to which the document belongs is estimated, and the information is set to a storage means.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、仮名文字を入力し、仮
名漢字変換により漢字仮名混じり文を編集する文字処理
装置及びその方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character processing apparatus and method for inputting kana characters and editing kana-kana mixed sentences by kana-kanji conversion.

【０００２】[0002]

【従来の技術】現在、日本ワードプロセッサなどの文字
処理装置では、漢字仮名混り文の入力を仮名漢字変換を
使って行なうことが一般的である。仮名漢字変換方式の
最大の問題点は同音語である。例えば、読み「きこう」
に対しては、「気候」「機構」「奇行」「貴校」などと
いくつもの変換候補が存在するが、このうちオペレータ
がどれを望んでいるかは、装置には知り得ないことであ
る。そのため、考えられる全ての候補を一般的な日本語
としてありそうな順序で装置が表示し、オペレータが表
示された候補から１つを選択するという方式が一般的な
構成となっている。2. Description of the Related Art At present, in a character processing device such as a Japanese word processor, it is common to input kanji / kana mixed sentences using kana / kanji conversion. The biggest problem of the Kana-Kanji conversion method is the homophone. For example, read "Kiko"
There are many conversion candidates such as "climate", "mechanism", "eccentricity", "your school", etc., but it is impossible for the device to know which one the operator wants. Therefore, a general configuration is a system in which all possible candidates are displayed in a general Japanese language in a likely order, and the operator selects one from the displayed candidates.

【０００３】[0003]

【発明が解決しようとしている課題】ところが、変換候
補の表示順序は一般的な観点に基づいて定められるた
め、必ずしもオペレータの望む候補が上位に表示される
とは限らない。例えば、先の「きこう」についていう
と、地理や地学を話題とする文書を入力する人にとって
は「気候」が最も自然な変換であるが、特許などの工学
的な文書を作成している人にとっては「機構」が最も自
然な変換である。このように、作成文書が話題とする分
野によって、変換候補の順序は異なるのが普通である。
もちろん、工学的文書でも「気候」という単語が全く出
現し得ないわけではないが、可能性は低く、工学分野の
文書では「機構」が最も自然な第１候補である。However, since the display order of the conversion candidates is determined based on a general viewpoint, the candidates desired by the operator are not always displayed at the top. For example, with regard to "Kikou" mentioned above, "climate" is the most natural conversion for people who enter documents that discuss geography and geology, but people who create engineering documents such as patents. For me, "mechanism" is the most natural transformation. In this way, the order of conversion candidates is usually different depending on the field in which the created document is a topic.
Of course, the word "climate" can never appear in an engineering document, but it is unlikely, and "mechanism" is the most natural first candidate in engineering documents.

【０００４】しかしながら、従来装置では変換候補の順
序は固定的であり、地理分野の文書を入力する時も、工
学分野の文書を入力する時も、同じ順序で変換候補は表
示されていた。もし作成文書の分野情報がわかっていれ
ば、より尤もらしい変換候補を上位に変換することがで
きる。However, in the conventional apparatus, the order of conversion candidates is fixed, and the conversion candidates are displayed in the same order both when inputting a document in the geographical field and when inputting a document in the engineering field. If the field information of the created document is known, more likely conversion candidates can be converted to higher ranks.

【０００５】このため、入力中の仮名漢字変換の過程か
ら分野情報を抽出し、それを次の仮名漢字変換に利用す
ることで変換率を向上させる試みも提案されている。そ
の方式では入力過程で徐々に分野情報が抽出され、仮名
漢字変換時にはその分野情報を参照して分野の単語を優
先して変換するというものである。しかし、その方式で
は入力によって分野情報が蓄積されるので、例えば、定
型文書を読み込んだ場合などは分野情報が蓄積されない
ので、変換率の向上につながらないという欠点があっ
た。For this reason, there has been proposed an attempt to improve the conversion rate by extracting field information from the process of Kana-Kanji conversion during input and using it for the next Kana-Kanji conversion. According to this method, field information is gradually extracted in the input process, and when converting to Kana / Kanji, the field information is referred to and words in the field are preferentially converted. However, in that method, since the field information is stored by input, the field information is not stored when, for example, a standard document is read, and thus there is a drawback that the conversion rate cannot be improved.

【０００６】[0006]

【発明が解決しようとしている課題】本発明は上述の欠
点に鑑みてなされたものであり、その目的とするところ
は、文書呼出時にその文書の分野情報を推定し、その推
定した分野情報に応じて仮名漢字変換の変換候補の順序
を変化させることにより、同音語候補選択の手間の少な
い操作性の高い仮名漢字変換入力を実現する文字処理装
置及びその方法を提供することにある。SUMMARY OF THE INVENTION The present invention has been made in view of the above-mentioned drawbacks, and an object of the present invention is to estimate the field information of a document at the time of calling the document, and to determine the field information according to the estimated field information. It is an object of the present invention to provide a character processing device and a method for changing the order of conversion candidates for kana-kanji conversion to realize kana-kanji conversion input with high operability and with less trouble of selecting homophone words.

【０００７】[0007]

【課題を解決するための手段】上述した課題を解決し、
上記目的を達成するため、本発明によれば、読みを仮名
文字列により入力して、漢字仮名混じり表記を含む文書
を編集する文字処理装置において、読みを入力する入力
手段と、単語の読みと、表記及び分野情報を含む単語情
報とを対応づけて記憶した辞書手段と、編集中の文書の
分野情報を記憶する分野情報記憶手段と、前記入力手段
より入力された読みを、前記分野情報記憶手段及び前記
辞書手段を参照して、表記に変換する変換手段と、前記
変換手段によって得られた表記により編集された文書を
記憶する文書記憶手段と、補助記憶部に保存された文書
を前記文書記憶手段に呼び出す呼び出し手段と、前記呼
び出し手段により呼び出された文書から該文書の属する
分野情報を推定する推定手段と、該推定手段により推定
された分野情報を、前記分野情報記憶手段に設定する設
定手段とを備えることを特徴とする。[Means for Solving the Problems]
In order to achieve the above object, according to the present invention, in a character processing device for inputting a reading by a kana character string and editing a document including a kanji kana mixed notation, an input means for inputting the reading and a word reading , Dictionary information in which word information including notation and field information is stored in association with each other, field information storage means for storing field information of a document being edited, and reading input by the input means is stored in the field information storage. Unit and the dictionary unit, a conversion unit for converting into a notation, a document storage unit for storing a document edited by the notation obtained by the conversion unit, and a document saved in an auxiliary storage unit for the document A calling means for calling the storage means; an estimating means for estimating the field information to which the document belongs from the document called by the calling means; and a field information estimated by the estimating means. Characterized in that it comprises a setting means for setting the field information storage unit.

【０００８】また、本発明の他の態様によれば、読みを
仮名文字列により入力して、漢字仮名混じり表記の文書
を編集する文字処理装置において、補助記憶部に保存さ
れた文書を文書メモリに呼び出し、呼び出された前記文
書から該文書の属する分野情報を推定し、該推定された
分野情報を、分野情報メモリに設定し、読みを入力し、
入力された前記読みを、単語の読みと、表記及び分野情
報を含む単語情報とを対応づけて記憶した辞書及び前記
分野情報メモリを参照して、表記に変換し、前記変換に
よって得られた表記により文書メモリに記憶された前記
定型文書を編集する文字処理方法を特徴とする。According to another aspect of the present invention, in a character processing device for inputting a reading by a kana character string and editing a document mixed with kanji and kana, the document stored in the auxiliary storage unit is stored in the document memory. To estimate the field information to which the document belongs from the called document, set the estimated field information in the field information memory, and input reading.
The input reading is converted into the notation by referring to the dictionary and the field information memory in which the word reading and the word information including the notation and the field information are stored in association with each other, and the notation obtained by the conversion And a character processing method for editing the standard document stored in the document memory.

【０００９】[0009]

【作用】かかる構成により、入力手段より入力された読
みを、変換手段が分野情報記憶手段及び辞書手段を参照
して、表記に変換し、得られた表記により編集された文
書を文書記憶手段に記憶する。呼び出し手段が、補助記
憶部から文書を前記文書記憶手段に呼び出すと、該定型
文書に対応する分野情報を前記分野情報記憶手段に呼び
出し、推定手段は呼び出された文書から該文書の属する
分野情報を推定し、設定手段が推定された分野情報を分
野情報記憶手段に設定する。With this structure, the conversion unit converts the reading input from the input unit into a notation by referring to the field information storage unit and the dictionary unit, and the document edited by the obtained notation is stored in the document storage unit. Remember. When the calling unit calls the document from the auxiliary storage unit to the document storage unit, the field information corresponding to the standard document is called to the field information storage unit, and the estimation unit acquires the field information to which the document belongs from the called document. The field information which is estimated and set by the setting means is set in the field information storage means.

【００１０】[0010]

【実施例】以下に添付図面を参照して、本発明に係る好
適な一実施例を詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A preferred embodiment of the present invention will be described in detail below with reference to the accompanying drawings.

【００１１】〈第１の実施例〉図１は本発明に係る文字
処理装置の第１の実施例を示すブロック図である。同図
において、ＣＰＵは、マイクロプロセッサであり、文字
処理のための演算、論理判断等を行ない、アドレスバス
ＡＢ、コントロールバスＣＢ、データバスＤＢを介し
て、それらのバスに接続された各構成要素を制御する。<First Embodiment> FIG. 1 is a block diagram showing a first embodiment of a character processing apparatus according to the present invention. In the figure, a CPU is a microprocessor, which performs operations for character processing, logical judgments, etc., and each component connected to these buses via an address bus AB, a control bus CB, and a data bus DB. To control.

【００１２】アドレスバスＡＢはマイクロプロセッサＣ
ＰＵの制御の対象とする構成要素を指示するアドレス信
号を転送する。コントロールバスＣＢはマイクロプロセ
ッサＣＰＵの制御の対象とする各構成要素のコントロー
ル信号を転送して印加する。データバスＤＢは各構成機
器相互間のデータの転送を行なう。The address bus AB is a microprocessor C
An address signal for instructing a component to be controlled by the PU is transferred. The control bus CB transfers and applies a control signal of each constituent element to be controlled by the microprocessor CPU. The data bus DB transfers data between the constituent devices.

【００１３】ＤＩＣは辞書であり、仮名漢字変換を行な
う際に参照する。DIC is a dictionary, which is referred to when performing Kana-Kanji conversion.

【００１４】つぎにＲＯＭは、読出し専用の固定メモリ
であり、図１２〜図１８につき後述するマイクロプロセ
ッサＣＰＵによる制御の手順、及び、固定的なデータを
記憶する。ＷＴＢＬは分野間関連度テーブルである。Ｋ
ＤＩＣはキーワード辞書である。ＫＡＮＪＩＤは漢字辞
書である。Next, the ROM is a read-only fixed memory that stores control procedures by the microprocessor CPU, which will be described later with reference to FIGS. 12 to 18, and fixed data. WTBL is an inter-field relevance table. K
DIC is a keyword dictionary. KANJID is a Kanji dictionary.

【００１５】また、ＲＡＭは、１ワード１６ビットの構
成の書込み可能のランダムアクセスメモリであって、各
構成要素からの各種データの一時記憶に用いる。ＩＢＵ
Ｆはキー入力された読み列あるいは仮名漢字変換語の変
換結果を記憶する入力バッファである。ＤＯＢＵＦは仮
名漢字変換された変換結果の同音語候補を格納する同音
語バッファである。ＢＴＢＬは分野情報テーブルであ
る。ＴＢＵＦは文書データを記憶する文書バッファであ
る。The RAM is a writable random access memory having a structure of 1 word 16 bits, and is used for temporary storage of various data from each constituent element. IBU
F is an input buffer for storing the conversion result of the phonetic string or the kana-kanji conversion word keyed in. DOBUF is a homophone word buffer that stores the homophone word candidates of the conversion result obtained by the kana-kanji conversion. BTBL is a field information table. TBUF is a document buffer that stores document data.

【００１６】ＫＢはキーボードであって、アルファベッ
トキー、ひらかなキー、カタカナキー等の文字記号入力
キー、及び、変換キー、実行キー、文書呼出キー、文書
保存キー、などの本文字処理装置に対する各種機能を指
示するための各種のファンクションキーを備えている。Reference numeral KB denotes a keyboard, which is a character symbol input key such as an alphabet key, a hiragana key, or a katakana key, and various types of character processing devices such as a conversion key, an execution key, a document call key, and a document save key. It is equipped with various function keys for instructing functions.

【００１７】ＤＩＳＫは文書データを記憶するための外
部記憶であり、文書バッファ上に作成された文書の保管
を行ない、保管された文書はキーボードの指示により、
必要な時呼び出される。DISK is an external storage for storing document data, and stores a document created in the document buffer. The stored document is instructed by the keyboard.
Called when needed.

【００１８】ＣＲはカーソルレジスタである。ＣＰＵに
より、カーソルレジスタの内容を読み書きできる。後述
するＣＲＴコントローラＣＲＴＣは、ここに蓄えられた
アドレスに対応する表示装置ＣＲＴ上の位置にカーソル
を表示する。CR is a cursor register. The CPU can read and write the contents of the cursor register. The CRT controller CRTC described later displays a cursor at a position on the display device CRT corresponding to the address stored here.

【００１９】ＤＢＵＦは表示用バッファメモリで、表示
すべきデータを蓄える。DBUF is a display buffer memory for storing data to be displayed.

【００２０】ＣＲＴＣはカーソルレジスタＣＲ及びバッ
ファＤＢＵＦに蓄えられた内容を表示器ＣＲＴに表示す
る役割を担う。The CRTC plays a role of displaying the contents stored in the cursor register CR and the buffer DBUF on the display CRT.

【００２１】またＣＲＴは陰極線管等を用いた表示装置
であり、その表示装置ＣＲＴにおけるドット構成の表示
パターンおよびカーソルの表示をＣＲＴコントローラで
制御する。The CRT is a display device using a cathode ray tube or the like, and the display pattern of the dot configuration and the display of the cursor on the display device CRT are controlled by the CRT controller.

【００２２】さらに、ＣＧはキャラクタジェネレータで
あって、表示装置ＣＲＴに表示する文字、記号のパター
ンを記憶するものである。Further, CG is a character generator, which stores a pattern of characters and symbols to be displayed on the display device CRT.

【００２３】かかる各構成要素からなる本発明文字処理
装置においては、キーボードＫＢからの各種の入力に応
じて作動するものであって、キーボードＫＢからの入力
が供給されると、まず、インタラプト信号がマイクロプ
ロセッサＣＰＵに送られ、そのマイクロプロセッサＣＰ
ＵがＲＯＭ内に記憶してある各種の制御信号を読出し、
それらの制御信号に従って各種の制御が行なわれる。The character processing apparatus of the present invention comprising the above-mentioned components operates in response to various inputs from the keyboard KB, and when an input from the keyboard KB is supplied, an interrupt signal is first sent. Sent to the microprocessor CPU and its microprocessor CP
U reads out various control signals stored in the ROM,
Various controls are performed in accordance with those control signals.

【００２４】図２は、第１の実施例による分野処理の例
を示した図である。FIG. 2 is a diagram showing an example of field processing according to the first embodiment.

【００２５】図中アンダラインは入力中の読み列を示
し、黒い矩形はカーソルを意味する。（ａ）は地理分野
を話題にしているので、仮名漢字変換の過程で文書の分
野情報が地理分野の方に傾き、「きこうの」を変換する
と「気候の」に変換される。（ｂ）は工学分野を話題に
しているので、仮名漢字変換の過程で文書の分野情報が
工学分野の方に傾き、「きこうの」を変換すると「機構
の」に変換される。In the figure, an underline indicates a reading line being input, and a black rectangle means a cursor. Since (a) is talking about the geographic field, the field information of the document is inclined toward the geographic field in the process of Kana-Kanji conversion, and when "Kikono" is converted, it is converted to "climatic". Since (b) is talking about the engineering field, the field information of the document is inclined toward the engineering field in the process of Kana-Kanji conversion, and when "Kikono" is converted, it is converted to "Mechanical".

【００２６】図３は辞書ＤＩＣの構造を示した図であ
る。FIG. 3 is a diagram showing the structure of the dictionary DIC.

【００２７】仮名漢字変換のための単語情報を記憶す
る。「読み」には単語の読みを記憶する。「表記」には
単語の表記を記憶する。「文法情報」には、品詞情報な
どの単語の文法情報を記憶する。「頻度」には単語の頻
度を記憶する。頻度は１から５の範囲内で記述し、５が
最も尤もらしい単語であることを意味する。「分野情
報」にはその単語の属する分野ｉｄとその分野の確信度
を記憶する。１つの単語に複数の分野ｉｄが記述でき
る。分野ｉｄは、分野１〜分野ｎまでのどれかである。
図中は分かりやすいように「地理」「地学」「工学」
「事務」などと記述している。確信度は０〜１の範囲の
値を取る。１は１００％確実にその分野の単語であるこ
と意味する。０はその分野とは縁もゆかりもない単語で
あることを意味する。分野が記述されていない場合は確
信度０と解釈し、確信度０の分野は辞書には記述しな
い。全くどの分野にも属さない単語（例えば、「この」
など）はこの「分野情報」の記述はないことになる。Word information for kana-kanji conversion is stored. The reading of the word is stored in the "reading". The “notation” stores the notation of the word. The "grammar information" stores word grammatical information such as part-of-speech information. The "frequency" stores the frequency of the word. The frequency is described within the range of 1 to 5, meaning that 5 is the most likely word. The "field information" stores the field id to which the word belongs and the certainty factor of the field. Multiple field ids can be described in one word. The field id is any of field 1 to field n.
“Geography”, “Geography”, and “Engineering” for clarity
It is described as "office work". The certainty factor takes a value in the range of 0 to 1. 1 means 100% surely a word in the field. 0 means that the word has no relation to the field. When the field is not described, it is interpreted as the confidence level 0, and the field with the confidence level 0 is not described in the dictionary. Words that do not belong to any category at all (for example, "this")
Etc.) means that there is no description of this “field information”.

【００２８】ある単語が選択された時、その単語の属す
る分野とその確信度に応じて、文書の分野情報が更新さ
れる。When a word is selected, the field information of the document is updated according to the field to which the word belongs and its certainty factor.

【００２９】図４はキーワード辞書ＫＤＩＣの構造を示
した図である。FIG. 4 is a diagram showing the structure of the keyword dictionary KDIC.

【００３０】キーワード辞書は、分野特有の読み列を記
憶した辞書であり、そこに記述された読みが入力される
だけで、その文書の分野が更新される。The keyword dictionary is a dictionary in which reading strings specific to a field are stored, and the field of the document is updated only by inputting the reading written therein.

【００３１】「読み」にはキーワードの読み列が記憶さ
れる。「分野情報」には辞書ＤＩＣに示したのと同等な
情報（分野ｉｄと確信度）が格納される。分野情報には
複数の分野を記述することもできる。In “reading”, a reading string of keywords is stored. In the “field information”, information equivalent to that shown in the dictionary DIC (field ID and certainty factor) is stored. Multiple fields can be described in the field information.

【００３２】図５は入力バッファＩＢＵＦの構成を示し
た図である。FIG. 5 is a diagram showing the structure of the input buffer IBUF.

【００３３】入力バッファＩＢＵＦは打鍵された読み
列、その読み列を仮名漢字変換した変換結果、次候補表
示を指示した場合の次候補の表記を格納する。入力バッ
ファＩＢＵＦに格納された文字列は画面上にアンダライ
ン付きで表示される。The input buffer IBUF stores a keystroked reading string, a conversion result obtained by converting the reading string to kana-kanji characters, and a notation of the next candidate when the next candidate display is instructed. The character string stored in the input buffer IBUF is displayed with an underline on the screen.

【００３４】（ａ）は読み列「きこうの」が入力された
状態を示した図である。ここで変換キーが打鍵されると
仮名漢字変換が行なわれ、（ｂ）のように変換結果「機
構の」が格納される。更に変換キーが打鍵されると
（ｃ）のように次候補「気候の」が格納される。(A) is a diagram showing a state in which the reading string "Kikono" is input. When the conversion key is pressed here, the kana-kanji conversion is performed, and the conversion result "Mechanism" is stored as shown in (b). When the conversion key is further pressed, the next candidate “climate” is stored as shown in (c).

【００３５】図６は同音語バッファＤＯＢＵＦの構成を
示した図である。FIG. 6 is a diagram showing the structure of the homophone word buffer DOBUF.

【００３６】読み列が仮名漢字変換されると、その同音
語情報が同音語バッファＤＯＢＵＦに格納される。次候
補表示などが指示されると、同音語バッファＤＯＢＵＦ
中で候補Ｎｏに示された候補の表記が、入力バッファＩ
ＢＵＦに格納される。When the reading string is converted into kana-kanji, the same-sound word information is stored in the same-sound word buffer DOBUF. When the next candidate display is instructed, the homophone word buffer DOBUF
In the input buffer I, the notation of the candidate indicated by the candidate No.
It is stored in BUF.

【００３７】「候補Ｎｏ」は、現在表示中の候補を示し
た変数である。「第ｉ候補」は仮名漢字変換結果のｉ番
目の順位に表示される候補に関する情報を記憶する。
「表記」はその変換候補の表記を記憶する。「頻度」は
その候補の自立語の頻度（辞書ＤＩＣ中に記載されてい
た値）をコピーしたものである。「分野情報ｊ」はその
自立語の持つ分野ｉｄと確信度を辞書ＤＩＣからコピー
したものである。"Candidate No." is a variable indicating the currently displayed candidate. The “i-th candidate” stores information about the candidate displayed in the i-th rank of the kana-kanji conversion result.
The “notation” stores the notation of the conversion candidate. “Frequency” is a copy of the frequency of the independent word of the candidate (the value described in the dictionary DIC). "Field information j" is a copy of the field id and the certainty factor of the independent word from the dictionary DIC.

【００３８】図７は分野情報テーブルＢＴＢＬの構成を
示した図である。FIG. 7 is a diagram showing the structure of the field information table BTBL.

【００３９】分野情報テーブルＢＴＢＬは、その文書の
仮名漢字変換入力に応じて蓄積された分野情報を記憶す
るものである。分野１〜分野ｎの一つ一つに対してその
分野の尤もらしさ（尤度）を記憶する。図中、分野１は
「工学分野」、分野２は「地理分野」分野３は「医学分
野」として示している。分野ｉの尤度が高いということ
は、その文書が分野ｉに属する可能性が高いということ
を意味する。The field information table BTBL stores field information accumulated in response to kana-kanji conversion input of the document. For each of the fields 1 to n, the likelihood (likelihood) of the field is stored. In the figure, the field 1 is shown as "engineering field", the field 2 as "geographical field", and the field 3 as "medical field". The high likelihood of the field i means that the document is likely to belong to the field i.

【００４０】図８は分野間関連度テーブルＷＴＢＬの構
成を示した図である。FIG. 8 is a diagram showing the structure of the inter-field relation degree table WTBL.

【００４１】分野間関連度テーブルは、分野間の関連度
を示したテーブルである。行ｉ列ｊの交点にある値Ｗ_ij
は、分野ｊが分野ｉに与える影響の大きさを示し、影響
が大きいほど大きな値となる。分野ｊが分野ｉに与える
影響度とは、言い換えると、分野ｊの単語が選択された
時、入力中の文書の分野ｉの尤度を上げるべきなのか
（関連度が大）、分野ｉの尤度を下げるべきなのか（関
連度がマイナス）、分野ｉの尤度を変更しない方が良い
のか（関連度が０）を示している。行ｋ列ｋの交点Ｗ_kk
は、分野ｋが分野ｋ自身に与える影響であり、当然正の
値を持つ。The inter-field relevance table is a table showing relevance between fields. The value W _{ij at} the intersection of row i column j
Indicates the magnitude of the influence of the field j on the field i, and the larger the effect, the larger the value. In other words, the degree of influence of the field j on the field i means that when the word of the field j is selected, the likelihood of the field i of the document being input should be increased (the degree of association is large). It indicates whether the likelihood should be lowered (relevance is negative), or it is better not to change the likelihood of field i (relevance is 0). Intersection W _kk of row k column k
Is an effect of the field k on the field k itself and naturally has a positive value.

【００４２】図９は各漢字コードがどの分野に属してい
るかを示しているかを記述する漢字辞書ＫＡＮＪＩＤの
構成を示した図である。FIG. 9 is a diagram showing the structure of a Kanji dictionary KANJID which describes to which field each Kanji code belongs.

【００４３】分野情報はその単語の属する分野ｉｄとそ
の分野の確信度を記憶する。１つの漢字に複数の分野ｉ
ｄが記述できる。分野ｉｄは、分野１〜分野ｎまでのど
れかである。図中は分かりやすいように「地理」「地
学」「工学」「事務」などと記述している。確信度は０
〜１の範囲の値を取る。１は１００％確実にその分野の
単語であること意味する。０はその分野とは縁もゆかり
もない単語であることを意味する。分野が記述されてい
ない場合は確信度０と解釈し、確信度０の分野は辞書に
は記述しない。全くどの分野にも属さない漢字（例え
ば、「此」など）は分野情報の記述はない。The field information stores the field id to which the word belongs and the certainty factor of the field. Multiple fields i in one kanji
d can be described. The field id is any of field 1 to field n. In the figure, "geography", "geography", "engineering", "office work", etc. are described for easy understanding. Confidence is 0
Take a value in the range of ~ 1. 1 means 100% surely a word in the field. 0 means that the word has no relation to the field. When the field is not described, it is interpreted as the confidence level 0, and the field with the confidence level 0 is not described in the dictionary. Kanji that do not belong to any field (for example, "Kono") does not have field information.

【００４４】例えば、漢字「鉱」は地理分野に０．６、
地学分野に０．６属していることがこのテーブルから読
み取れる。For example, the Chinese character "min" is 0.6 in the geographic field,
It can be read from this table that it belongs to the geological field 0.6.

【００４５】図１０は、仮名漢字変換における第１候補
決定のための評価式を示している。文節尤度は、変換単
語の単語頻度と、変換単語の「分野尤度」との和で求め
られる。ここで、変換単語の「分野尤度」は、変換単語
の分野ｋに対する確信度（Ｃ_k とする）と、その時の分
野情報テーブル上の分野ｋの尤度（Ｂ_k とする）との積
Ｂ_k ×Ｃ_k の最大値で求められる。FIG. 10 shows an evaluation formula for determining the first candidate in kana-kanji conversion. The phrase likelihood is calculated as the sum of the word frequency of the converted word and the “field likelihood” of the converted word. Here, the “category likelihood” of the converted word is the product of the certainty factor of the converted word with respect to the field k ( _denoted by C _k ) and the likelihood of the field k on the field information table at that time ( _denoted by B _k ). It is determined by the maximum value of B _k × C _k .

【００４６】以上のようにして求められた文節尤度の最
も大きい変換単語が、第１候補として変換され、第２候
補以下は文節尤度の大きい順に表示される。The converted word having the largest phrase likelihood obtained as described above is converted as the first candidate, and the second and subsequent candidates are displayed in descending order of the phrase likelihood.

【００４７】図１１は、単語が選択された時の分野情報
テーブルの更新方法を示したものである。FIG. 11 shows a method of updating the field information table when a word is selected.

【００４８】ある単語が選択された時、その単語の属す
る分野に従って、以下のように分野情報テーブルの各分
野の尤度を更新する。When a word is selected, the likelihood of each field in the field information table is updated as follows according to the field to which the word belongs.

【００４９】まず選択単語の分野ｋの確信度（Ｃ_k とす
る）を辞書ＤＩＣから求め、次に分野ｋが分野ｉに与え
る影響度Ｗ_kiを分野間関連度テーブルから求め、確信度
Ｃ_kに乗じる。なお、選択単語の分野情報に分野ｋの記
述がなかった時は、分野ｋの確信度は０とする。この乗
じた値を分野１から分野ｎまでの和を取り、分野情報テ
ーブルの分野ｉの尤度（Ｂ_i とする）に加える。この分
野ｉの尤度Ｂ_i の計算を分野１から分野ｎまでの尤度に
ついて行なう。First, the certainty factor ( _denoted as C _k ) of the field k of the selected word is obtained from the dictionary DIC, then the influence degree W _ki that the field k has on the field i is obtained from the inter-field relevance table, and the certainty factor C _{k is obtained.} Get on. If there is no description of the field k in the field information of the selected word, the certainty factor of the field k is 0. The value obtained by this multiplication is summed from the fields 1 to n and added to the likelihood (denoted as B _i ) of the field i in the field information table. The calculation of the likelihood B _i of the field i is performed on the likelihoods of the fields 1 to n.

【００５０】また、第１候補で変換した単語が選択され
なかった時（第１候補以外の単語が選択された時）は、
以下のように分野情報テーブルの分野の尤度を更新す
る。Further, when the word converted by the first candidate is not selected (when a word other than the first candidate is selected),
The likelihood of the field in the field information table is updated as follows.

【００５１】まず、分野ｋの単語の非選択が分野ｉに与
える非選択影響度（図中、Ｘで記述）を定義する。ここ
では、分野ｉが分野ｉに与える非選択影響度は、分野ｉ
が分野ｉに与える影響度Ｗ_iiのマイナスであり、ｉ≠ｊ
のときの分野ｊが分野ｉに与える非選択影響度は０であ
ると定義する。First, the degree of non-selection influence (indicated by X in the figure) on non-selection of words in the field k on the field i is defined. Here, the degree of non-selection influence of the field i on the field i is the field i.
Is a negative value of the influence W _{ii on} the field i, i ≠ j
In this case, the degree of non-selection influence given to the field i by the field j is 0.

【００５２】次に、第１候補で変換されたが選択されな
かった単語（非選択単語）の分野ｋの確信度（ｄ_k とす
る）を求め、それに分野ｋが分野ｉに与える非選択影響
度を乗じる。この乗じた値を分野１から分野ｎまでの和
を取り、分野情報テーブルの分野ｉの尤度Ｂ_i に加え
る。この分野ｉの尤度の計算を分野１から分野ｎまでの
尤度について行なう。これにより、第１候補で変換した
単語の属する分野が選択された時とは逆方向に分野情報
テーブルに影響を与えることになる。Next, the certainty factor ( _denoted by d _k ) of the field k of the word which has been converted by the first candidate but not selected (non-selected word) is obtained, and the non-selection influence of the field k on the field i is obtained. Multiply by degrees. The sum of the fields 1 to n is added to the multiplied value and added to the likelihood B _i of the field i in the field information table. The likelihood calculation for the field i is performed for the likelihoods from the field 1 to the field n. As a result, the field information table is affected in the opposite direction from when the field to which the word converted by the first candidate belongs is selected.

【００５３】図１２は、定型文書が呼び出され分野を推
定する際の、分野情報テーブルの更新方法を示したもの
である。FIG. 12 shows a method of updating the field information table when the standard document is called and the field is estimated.

【００５４】呼び出される定型文書に含まれる１漢字１
漢字について、この式に従って分野情報テーブルが更新
される。1 Kanji included in the called standard document 1
For kanji, the field information table is updated according to this formula.

【００５５】まず漢字の分野ｋにおける確信度Ｃ_k を求
め、それに分野ｋが分野ｉに与える影響度Ｗ_k を分野間
関連度テーブルから求め、乗じ、更に、ある一定値αを
乗じる。なお、漢字の分野情報に分野ｋの記述がなかっ
た時は分野ｋの確信度は０とする。αは単語選択が分野
尤度に与える影響と、漢字が分野尤度に与える影響の整
合性を取るための補正値である。例えば、平均的に１単
語が２漢字からなる時は、０．５の値を取る。この乗じ
た値を分野１から分野ｎまでの和を取り、分野情報テー
ブルの分野ｉの尤度Ｂ_i に加える。この分野ｉの尤度の
計算を分野１から分野ｎまでの尤度について行なう。Firstly seeking confidence C _k in the field k kanji, we obtain the degree of influence W _k that field k has on the field i from the field relevancy table, multiplied further multiplied by a certain value alpha. If there is no description of the field k in the kanji field information, the certainty factor of the field k is 0. α is a correction value for matching the effect of word selection on the field likelihood and the effect of Chinese characters on the field likelihood. For example, when one word consists of two Kanji characters on average, the value is 0.5. The sum of the fields 1 to n is added to the multiplied value and added to the likelihood B _i of the field i in the field information table. The likelihood calculation for the field i is performed for the likelihoods from the field 1 to the field n.

【００５６】次に上述の動作をフローに従って説明す
る。Next, the above operation will be described according to the flow.

【００５７】図１３は第１の実施例によるキー入力を取
り込み、処理を行なう動作を説明するフローチャートで
ある。FIG. 13 is a flow chart for explaining the operation of fetching the key input and processing according to the first embodiment.

【００５８】ステップ１３−１において、キーボードか
ら打鍵されるキーデータを取り込む。In step 13-1, the key data to be keyed is input from the keyboard.

【００５９】ステップ１３−２において、取り込まれた
キーの種別を判定し、判定結果を出力する。その判定結
果に応じて、各キーの処理ルーチンに分岐する。At step 13-2, the type of the fetched key is determined and the determination result is output. Depending on the determination result, the process branches to each key processing routine.

【００６０】変換キーであったときはステップ１３−３
に分岐し、ステップ１３−３において、図１５に詳述す
るように変換処理を行なう。変換処理終了後、ステップ
１３−８において、実行キーの打鍵により変換処理を終
了したかどうかを判定し、実行キー以外による時はその
キーを処理するためにステップ１３−２にループする。
実行キー以外の時はそのキーを読み飛ばすためにステッ
プ１３−１にループする。If it is a conversion key, step 13-3
And the conversion process is performed in step 13-3 as detailed in FIG. After the conversion process is completed, in step 13-8, it is determined whether or not the conversion process is completed by hitting the execution key. If the conversion key is other than the execution key, the process loops to step 13-2 to process the key.
When it is not the execute key, the process loops to step 13-1 to skip the key.

【００６１】ひらがな等の読みキーのときは、ステップ
１３−４に分岐し、図１４に詳述する読み蓄積処理を行
なう。If the key is a reading key such as hiragana, the process branches to step 13-4 and the reading accumulation process described in detail in FIG. 14 is performed.

【００６２】文書保存キーのときは、ステップ１３−５
に分岐し、図１８に詳述する文書保存処理を行なう。If the key is a document save key, step 13-5
And the document saving process described in detail in FIG. 18 is performed.

【００６３】文書呼出キーのときは、ステップ１３−６
に分岐し、図１９に詳述する文書呼出処理を行なう。If the key is a document call key, step 13-6
And the document calling process described in detail in FIG. 19 is performed.

【００６４】その他のキーのときはステップ１３−７に
分岐し、挿入、削除、記号入力等の通常の文字処理装置
において行なわれるその他の処理を行なう。For other keys, the process branches to step 13-7 to perform other processes such as insertion, deletion, symbol input, etc. which are carried out in a normal character processing device.

【００６５】各処理が終了するとステップ１３−１に分
岐する。When each process is completed, the process branches to step 13-1.

【００６６】図１４は第１の実施例によるステップ１３
−４の「読み蓄積処理」を詳細に説明するフローチャー
トである。FIG. 14 shows step 13 according to the first embodiment.
4 is a flowchart illustrating in detail a "reading and accumulating process" of No. 4;

【００６７】ステップ１４−１において、入力された読
みを入力バッファＩＢＵＦに蓄積していく。また、読み
列の状態を表示する。In step 14-1, the input readings are accumulated in the input buffer IBUF. In addition, the state of the reading column is displayed.

【００６８】ステップ１４−２において、ＩＢＵＦに格
納された読み列がキーワード辞書ＫＤＩＣに登録されて
いないかどうか検索する。In step 14-2, it is searched whether or not the reading string stored in IBUF is registered in the keyword dictionary KDIC.

【００６９】ステップ１４−３において、登録されてい
るかどうかを判定し、登録されていない時はそのままリ
ターンする。At step 14-3, it is judged whether or not it is registered, and if it is not registered, the process directly returns.

【００７０】登録されていた時はステップ１４−４に進
み、ループ変数ｉを初期設定する。If it has been registered, the process proceeds to step 14-4 to initialize the loop variable i.

【００７１】ステップ１４−５においてｉが分野数ｎを
超えたかどうか判定し、超えていない時はステップ１４
−６に進み、ループ変数ｊを初期設定する。In step 14-5, it is determined whether or not i exceeds the number of fields n, and if not, step 14
Proceed to -6 to initialize the loop variable j.

【００７２】ステップ１４−７においてｊが分野数ｎを
超えたかどうか判定し、超えていない時は、ステップ１
４−８において、図１１に示した分野尤度の更新処理を
行なう。キーワード辞書より得られたキーワードの分野
ｊの確信度Ｃ_j に、分野間関連度テーブルより得られた
分野ｊが分野ｉに与える影響度Ｗ_ijを乗じたものを加
え、分野情報テーブルのｉ番目の分野尤度Ｂ_i を更新す
る。その後、ステップ１４−９に進み、ｊの値を歩進
し、ステップ１４−７にループする。In step 14-7, it is judged whether or not j exceeds the number of fields n, and if not, step 1
In 4-8, the field likelihood updating process shown in FIG. 11 is performed. The confidence factor C _j of the field j of the keyword obtained from the keyword dictionary is multiplied by the degree of influence W _ij of the field j obtained from the inter-field relevance table on the field i, and the i-th item of the field information table is added. Update the field likelihood B _i of Then, the process proceeds to step 14-9, increments the value of j, and loops to step 14-7.

【００７３】ステップ１４−７においてｊがｎを超えて
いる時はステップ１４−１０においてｉの値を歩進し、
ステップ１４−５にループする。When j exceeds n in step 14-7, the value of i is incremented in step 14-10.
Loop to step 14-5.

【００７４】ステップ１４−５においてｉが分野数ｎを
超えた時はリターンする。When i exceeds the number n of fields in step 14-5, the process returns.

【００７５】図１５は第１の実施例によるステップ１３
−３の「変換処理」を詳細に説明するフローチャートで
ある。FIG. 15 shows step 13 according to the first embodiment.
3 is a flowchart illustrating in detail a "conversion process" of No. 3;

【００７６】ステップ１５−１において、図１６に詳述
するように、読み列を仮名漢字変換し、同音語バッファ
を作成する。At step 15-1, as detailed in FIG. 16, the reading string is converted into kana-kanji characters to create a homophone word buffer.

【００７７】ステップ１５−２において作成された同音
語バッファの第１候補の表記を入力バッファＩＢＵＦに
設定し、表示する。The notation of the first candidate of the homophone word buffer created in step 15-2 is set in the input buffer IBUF and displayed.

【００７８】ステップ１５−３において、キーボードか
ら打鍵されるキーデータを取り込む。At step 15-3, the key data to be entered from the keyboard is fetched.

【００７９】ステップ１５−４において取得されたキー
データが変換キーであるかどうかを判定する。変換キー
でない時（例えば、実行キー打鍵や、次の読み入力）は
次候補を見ずにその候補を確定することになるので、ス
テップ１５−７に分岐する。変換キーの時はステップ１
５−５に進み次候補処理を行ない、同音語バッファ中の
候補Ｎｏの値を更新する。It is determined whether the key data acquired in step 15-4 is a conversion key. If it is not the conversion key (for example, the execution key is pressed or the next reading input is made), the candidate is confirmed without looking at the next candidate, so the process branches to step 15-7. Step 1 for conversion key
In step 5-5, the next candidate process is performed to update the value of the candidate No. in the homophone word buffer.

【００８０】ステップ１５−６において、同音語バッフ
ァ中の新しい変換候補の表記を入力バッファに転送し、
表示する。その後、ステップ１５−３にループし、再び
キー入力を取り込む。In step 15-6, the notation of the new conversion candidate in the homophone word buffer is transferred to the input buffer,
indicate. Then, it loops to step 15-3 and takes in the key input again.

【００８１】ステップ１５−７において、図１７に詳述
するように分野更新処理を行なう。In step 15-7, field updating processing is performed as described in detail in FIG.

【００８２】ステップ１５−８において、表示中の候補
を確定し、文書データなどに埋め込む処理を行ない、リ
ターンする。At step 15-8, the candidate being displayed is confirmed, the process of embedding it in the document data or the like is performed, and the process returns.

【００８３】図１６は第１の実施例によるステップ１５
−１の「仮名漢字変換処理」を詳細に説明するフローチ
ャートである。FIG. 16 shows step 15 according to the first embodiment.
It is a flowchart explaining in detail the "kana-kanji conversion process" of -1.

【００８４】ステップ１６−１において、考えられる仮
名漢字変換候補を作成する。In step 16-1, possible kana-kanji conversion candidates are created.

【００８５】ステップ１６−２において各変換候補に対
応する文節尤度を、図１０に示したように算出する。In step 16-2, the phrase likelihood corresponding to each conversion candidate is calculated as shown in FIG.

【００８６】ステップ１６−３において、文節尤度の順
番に変換候補をソートし、第１候補を決定する。In step 16-3, the conversion candidates are sorted in the order of the phrase likelihood, and the first candidate is determined.

【００８７】ステップ１６−４において、文節尤度の順
序に従って変換候補の表記及び必要な情報を同音語バッ
ファに登録しリターンする。In step 16-4, the representation of the conversion candidates and the necessary information are registered in the homophone buffer according to the phrase likelihood order, and the process returns.

【００８８】図１７は第１の実施例によるステップ１５
−７の「分野更新処理」を詳細に説明するフローチャー
トである。FIG. 17 shows step 15 according to the first embodiment.
It is a flow chart explaining "field update processing" of -7 in detail.

【００８９】分野更新処理は図１１に示す、分野尤度の
更新処理を行なう部分である。The field updating process is a part for performing the field likelihood updating process shown in FIG.

【００９０】ステップ１７−１において、ループ変数ｉ
を初期設定する。In step 17-1, the loop variable i
Is initialized.

【００９１】ステップ１７−２においてｉが分野数ｎを
超えたかどうか判定し、超えていない時はステップ１７
−３に進み、ループ変数ｊを初期設定する。In step 17-2, it is judged whether or not i exceeds the number of fields n, and if not, step 17
-3, the loop variable j is initialized.

【００９２】ステップ１７−４においてｊが分野数ｎを
超えたかどうか判定し、超えていない時は、ステップ１
７−５において、分野尤度の更新処理を行なう。選択単
語の分野ｊの確信度Ｃ_jに、分野間関連度テーブルより
得られた分野ｊが分野ｉに与える影響度Ｗ_jiを乗じたも
のを加え、分野情報テーブルのｉ番目の分野尤度Ｂ_iを
更新する。その後、ステップ１７−６に進み、ｊの値を
歩進し、ステップ１７−４にループする。In step 17-4, it is judged whether or not j exceeds the number of fields n, and if not, step 1
In 7-5, the field likelihood updating process is performed. The certainty factor C _j of the field j of the selected word is multiplied by the influence degree W _ji of the field j obtained from the inter-field relevance table on the field i, and the i-th field likelihood B of the field information table is added. Update _i . Then, it progresses to step 17-6, the value of j is incremented, and it loops to step 17-4.

【００９３】ステップ１７−４においてｊがｎを超えて
いる時はステップ１７−７においてｉの値を歩進し、ス
テップ１７−２にループする。When j exceeds n in step 17-4, the value of i is incremented in step 17-7 and the process loops to step 17-2.

【００９４】ステップ１７−２においてｉが分野数ｎを
超えた時はステップ１７−８に進む。When i exceeds the number n of fields in step 17-2, the process proceeds to step 17-8.

【００９５】ステップ１７−８において、非選択単語ポ
インタを初期設定し、最初の非選択単語を示すようにす
る。At step 17-8, the non-selected word pointer is initialized to point to the first non-selected word.

【００９６】ステップ１７−９において非選択単語全て
について処理が終了したかどうか判定し、終了している
時はリターンする。In step 17-9, it is judged whether or not the processing has been completed for all the non-selected words, and if completed, the routine returns.

【００９７】ステップ１７−１０において、ループ変数
ｉを初期設定する。In step 17-10, the loop variable i is initialized.

【００９８】ステップ１７−１１においてｉが分野数ｎ
を超えたかどうか判定し、超えていない時はステップ１
７−１２に進み、ループ変数ｊを初期設定する。In step 17-11, i is the number of fields n
It is judged whether or not it exceeds, and when it does not exceed, step 1
Proceeding to 7-12, the loop variable j is initialized.

【００９９】ステップ１７−１３においてｊが分野数ｎ
を超えたかどうか判定し、超えていない時は、ステップ
１７−１４において、分野尤度の更新処理を行なう。非
選択単語ポインタの示す非選択単語の分野ｊの確信度ｄ
_j に、分野ｊが分野ｉに与える非選択影響度Ｘ_ijを乗じ
たものを加え、分野情報テーブルのｉ番目の分野尤度Ｂ
_i を更新する。その後、ステップ１７−１５に進み、ｊ
の値を歩進し、ステップ１７−１３にループする。In step 17-13, j is the number of fields n.
It is determined whether or not the field is exceeded, and when it is not exceeded, the field likelihood is updated in step 17-14. The certainty factor d of the field j of the non-selected word indicated by the non-selected word pointer
_j is multiplied by the degree of non-selection influence X _ij given to the field i by the field _j , and the i-th field likelihood B of the field information table is added.
Update _i . Then, it progresses to step 17-15 and j
Is incremented by one and loops to step 17-13.

【０１００】ステップ１７−１３においてｊがｎを超え
ている時はステップ１７−１６においてｉの値を歩進
し、ステップ１７−１１にループする。When j exceeds n in step 17-13, the value of i is incremented in step 17-16 and the process loops to step 17-11.

【０１０１】ステップ１７−１１においてｉが分野数ｎ
を超えた時はステップ１７−１７に進み、非選択単語ポ
インタを歩進し、次の非選択単語を指すようにする。そ
の後、ステップ１７−９にループする。In step 17-11, i is the number of fields n
When it exceeds, the process proceeds to step 17-17 to advance the non-selected word pointer to point to the next non-selected word. Then, it loops to step 17-9.

【０１０２】図１８は第１の実施例によるステップ１３
−５の「文書保存処理」を詳細に説明するフローチャー
トである。FIG. 18 shows step 13 according to the first embodiment.
It is a flowchart explaining in detail the "document storage process" of -5.

【０１０３】ステップ１８−１において、オペレータに
保存する文書の文書名を入力させ、それを受けつける処
理をする。In step 18-1, the operator is made to input the document name of the document to be saved, and the process for accepting it is performed.

【０１０４】ステップ１８−２において文書データをデ
ィスクに格納する保存形式に変形する。In step 18-2, the document data is transformed into a storage format for storing it on the disc.

【０１０５】ステップ１８−３において、文書データの
保存形式をオペレータの指定した文書名に“．ｄｏｃ”
を付加したファイル名でディスクに保存する。In step 18-3, the document data saving format is set to ".doc" as the document name specified by the operator.
Save to disk with the file name added.

【０１０６】図１９は第１の実施例によるステップ１３
−６の「文書呼出処理」を詳細に説明するフローチャー
トである。FIG. 19 shows step 13 according to the first embodiment.
It is a flow chart which explains the "document call processing" of -6 in detail.

【０１０７】ステップ１９−１において、オペレータに
呼び出す文書の文書名を入力させ、それを受けつける処
理をする。At step 19-1, the operator is made to input the document name of the document to be called and the processing for accepting it is performed.

【０１０８】ステップ１９−２において、指定された文
書名の文書データをディスクから呼び出す処理を行な
う。呼び出された文書データが文書バッファ上にロード
されるので、文書バッファＴＢＵＦ上にある現在作成中
の文書は消滅し、新しい文書データに置き換わることに
なる。At step 19-2, a process of calling the document data having the designated document name from the disc is performed. Since the called document data is loaded on the document buffer, the document currently being created on the document buffer TBUF disappears and is replaced with new document data.

【０１０９】ステップ１９−３において、呼び出された
文書データの漢字を示すポインタ“漢字ポインタ”を文
書先頭にセットする。In step 19-3, the pointer "Kanji character pointer" indicating the Kanji of the retrieved document data is set at the beginning of the document.

【０１１０】ステップ１９−４において漢字ポインタの
示す漢字を１つ取ってくる。At step 19-4, one Kanji character indicated by the Kanji pointer is fetched.

【０１１１】ステップ１９−５において漢字ポインタが
文書末尾を示しているかどうか判定する。文書末尾を示
している時は取得された漢字コードは無効なので、処理
を終了してリターンする。文書末尾を示していない時は
取得された漢字は有効なので、ステップ１９−６に進
む。In step 19-5, it is determined whether the Kanji pointer indicates the end of the document. When the end of the document is indicated, the acquired Kanji code is invalid, so the process ends and returns. If the end of the document is not indicated, the acquired Kanji is valid, so the process proceeds to step 19-6.

【０１１２】ステップ１９−６において、ループ変数ｉ
を初期設定する。In step 19-6, the loop variable i
Is initialized.

【０１１３】ステップ１９−７においてｉが分野数ｎを
超えたかどうか判定し、超えていない時はステップ１９
−８に進み、ループ変数ｊを初期設定する。In step 19-7, it is judged whether or not i exceeds the number n of fields, and if not, step 19
Proceed to -8 to initialize the loop variable j.

【０１１４】ステップ１９−９においてｊが分野数ｎを
超えたかどうか判定し、超えていない時は、ステップ１
９−１０において、図１２に示した式に基づく分野尤度
の推定処理を行なう。取得された漢字の分野ｊの確信度
Ｃ_jに、分野間関連度テーブルより得られた分野ｊが分
野ｉに与える影響度Ｗ_ijを乗じたもののα倍を加え、分
野情報テーブルのｉ番目の分野尤度Ｂ_iを更新する。そ
の後、ステップ１９−１１に進み、ｊの値を歩進し、ス
テップ１９−９にループする。In step 19-9, it is judged whether or not j exceeds the number n of fields, and if not, step 1
In 9-10, field likelihood estimation processing is performed based on the equation shown in FIG. The certainty factor C _j of the field k of the acquired kanji is multiplied by α times that obtained by multiplying the degree of influence W _ij of the field j obtained from the inter-field relevance table on the field i. The field likelihood B _i is updated. Then, the process proceeds to step 19-11, increments the value of j, and loops to step 19-9.

【０１１５】ステップ１９−９においてｊがｎを超えて
いる時はステップ１９−１２においてｉの値を歩進し、
ステップ１９−７にループする。When j exceeds n in step 19-9, the value of i is incremented in step 19-12,
Loop to step 19-7.

【０１１６】ステップ１９−７においてｉが分野数ｎを
超えた時はステップ１９−１３に進み、漢字ポインタを
歩進してステップ１９−５にループする。When i exceeds the field number n in step 19-7, the process proceeds to step 19-13, the kanji pointer is incremented, and the process loops to step 19-5.

【０１１７】以上説明したように、第１の実施例によれ
ば、定型文書の文書データに含まれる漢字情報に基づい
て分野情報を推定し、定型文書の呼出と同時に分野情報
が推定され利用されるので、定型文書を呼び出して編集
する時の仮名漢字変換において、候補表示の順序を的確
に行なうことができるので、操作性の高い、快適な文字
処理装置を実現することができる。As described above, according to the first embodiment, the field information is estimated based on the Kanji information included in the document data of the standard document, and the field information is estimated and used at the same time when the standard document is called. Therefore, in the kana-kanji conversion when calling and editing a fixed form document, the order of candidate display can be accurately performed, so that a comfortable character processing device with high operability can be realized.

【０１１８】〈第２の実施例〉以上の第１の実施例にお
いて、定型文書の文書データに含まれる漢字情報に基づ
いて分野情報を推定するというものであった。が、本発
明はこれに限定されるものではなく、文書データを形態
素解析し、それぞれの文節に含まれる単語が１つずつ選
択されたものとして分野情報を推定することもできる。
大部分の図面は第１の実施例と同じであるが、その時の
フローは図１９の代わりに図２０となる。<Second Embodiment> In the above first embodiment, the field information is estimated based on the Kanji information included in the document data of the standard document. However, the present invention is not limited to this, and it is also possible to perform morphological analysis on document data and estimate the field information assuming that one word included in each clause is selected.
Most of the drawings are the same as those of the first embodiment, but the flow at that time is shown in FIG. 20 instead of FIG.

【０１１９】これにともない、図１のＲＯＭに格納され
る制御手順は、図１３〜図１８、図２０に記述したもの
となる。Along with this, the control procedure stored in the ROM of FIG. 1 is as described in FIGS. 13 to 18 and 20.

【０１２０】次に上述の動作をフローに従って説明す
る。Next, the above operation will be described according to the flow.

【０１２１】第１の実施例と異なるフローは図２０のみ
なので、その部分のみ説明する。Since the flow different from the first embodiment is only in FIG. 20, only that part will be described.

【０１２２】図２０は第１の実施例の図１９に代わるも
のであり、ステップ１３−６の「文書呼出処理」を詳細
に説明するフローチャートである。FIG. 20, which replaces FIG. 19 of the first embodiment, is a flow chart for explaining in detail the "document calling process" of step 13-6.

【０１２３】ステップ２０−１において、オペレータに
呼び出す文書の文書名を入力させ、それを受けつける処
理をする。In step 20-1, the operator is made to input the document name of the document to be called, and processing for accepting it is performed.

【０１２４】ステップ２０−２において、指定された文
書名の文書データをディスクから呼び出す処理を行な
う。呼び出された文書データが文書バッファ上にロード
されるので、文書バッファＴＢＵＦ上にある現在作成中
の文書は消滅し、新しい文書データに置き換わることに
なる。In step 20-2, a process of calling the document data having the designated document name from the disc is performed. Since the called document data is loaded on the document buffer, the document currently being created on the document buffer TBUF disappears and is replaced with new document data.

【０１２５】ステップ２０−３において、呼び出された
文書データを形態素解析する。At step 20-3, the retrieved document data is morphologically analyzed.

【０１２６】ステップ２０−４において、形態素解析さ
れた文書データの文節を示すポインタ“文節ポインタ”
を文書先頭の文節を示すようセットする。In step 20-4, a pointer "bunsetsu pointer" indicating a phrase in the document data subjected to morphological analysis.
Is set to indicate the clause at the beginning of the document.

【０１２７】ステップ２０−５において文節ポインタの
示す文節を１つ取ってくる。At step 20-5, one clause indicated by the clause pointer is fetched.

【０１２８】ステップ２０−６において文節ポインタが
文書末尾を示しているかどうか判定する。文書末尾を示
している時は文節が取得できないので、処理を終了して
リターンする。文書末尾を示していない時は取得された
文節は有効なので、ステップ２０−７に進む。In step 20-6, it is determined whether the phrase pointer indicates the end of the document. When the end of the document is indicated, the clause cannot be acquired, so the processing ends and returns. If the end of the document is not indicated, the acquired clause is valid, so the process proceeds to step 20-7.

【０１２９】ステップ２０−７において、ループ変数ｉ
を初期設定する。In step 20-7, the loop variable i
Is initialized.

【０１３０】ステップ２０−８においてｉが分野数ｎを
超えたかどうか判定し、超えていない時はステップ２０
−９に進み、ループ変数ｊを初期設定する。In step 20-8, it is judged whether or not i exceeds the number n of fields, and if not, step 20
Proceed to -9 to initialize the loop variable j.

【０１３１】ステップ２０−１０においてｊが分野数ｎ
を超えたかどうか判定し、超えていない時はステップ２
０−１１に進む。ステップ２０−１１において、図１１
に示した式に基づき、文節を構成する単語が選択された
ものとみなして分野尤度の推定する。文節ポインタの示
す文節を構成する単語の分野ｊの確信度Ｃ_jに、分野間
関連度テーブルより得られた分野ｊが分野ｉに与える影
響度Ｗ_ijを乗じたものを加え、分野情報テーブルのｉ番
目の分野尤度Ｂ_iを更新する。その後、ステップ２０−
１２に進み、ｊの値を歩進し、ステップ２０−１０にル
ープする。In step 20-10, j is the number of fields n.
It is judged whether or not it exceeds, and when it does not exceed, step 2
Go to 0-11. In Step 20-11, as shown in FIG.
Based on the formula shown in, the field likelihood is estimated assuming that the words constituting the bunsetsu are selected. The certainty factor C _j of the field j of the word constituting the phrase pointed to by the phrase pointer is multiplied by the influence degree W _ij of the field j obtained from the inter-field relevance table on the field i, and the result is added to the field information table. Update the i-th field likelihood B _i . Then, step 20-
Go to 12, increment the value of j and loop to step 20-10.

【０１３２】ステップ２０−１０においてｊがｎを超え
ている時はステップ２０−１３においてｉの値を歩進
し、ステップ２０−８にループする。When j exceeds n in step 20-10, the value of i is incremented in step 20-13 and the process loops to step 20-8.

【０１３３】ステップ２０−８においてｉが分野数ｎを
超えた時はステップ２０−１４に進み、文節ポインタを
歩進してステップ２０−６にループする。When i exceeds the field number n in step 20-8, the process proceeds to step 20-14, the phrase pointer is incremented, and the process loops to step 20-6.

【０１３４】以上説明したように、第２の実施例によれ
ば、定型文書の文書データを形態素解析し、それぞれの
文節に含まれる単語が１つずつ選択されたものとして分
野情報を推定し、定型文書の呼出と同時に分野情報が推
定され利用されるので、定型文書を呼び出して編集する
時の仮名漢字変換において、候補表示の順序を的確に行
なうことができるので、操作性の高い、快適な文字処理
装置を実現することができる。As described above, according to the second embodiment, the morphological analysis is performed on the document data of the fixed form document, and the field information is estimated as one word included in each clause is selected, Since the field information is estimated and used at the same time when the standard document is called, the order of candidate display can be accurately performed in the Kana-Kanji conversion when the standard document is called and edited, resulting in high operability and comfort. A character processing device can be realized.

【０１３５】[0135]

【発明の効果】以上説明したように、本発明によれば、
文書の呼出時に、呼び出した文書から分野情報を推定す
るので、文書を呼び出して編集する時の仮名漢字変換に
おいて、候補表示の順序を的確に行なうことができるの
で、操作性の高い、快適な文字処理装置を実現すること
ができる。As described above, according to the present invention,
When the document is called, the field information is estimated from the called document, so the order in which candidates are displayed can be accurately set during kana-kanji conversion when the document is called and edited. A processing device can be realized.

[Brief description of drawings]

【図１】本発明に係る文字処理装置の第１の実施例を示
すブロック図である。FIG. 1 is a block diagram showing a first embodiment of a character processing device according to the present invention.

【図２】第１の実施例による分野処理の例を示した図で
ある。FIG. 2 is a diagram showing an example of field processing according to the first embodiment.

【図３】第１の実施例による辞書ＤＩＣの構成例を示し
た図である。FIG. 3 is a diagram showing a configuration example of a dictionary DIC according to the first embodiment.

【図４】第１の実施例によるキーワード辞書ＫＤＩＣの
構成例を示した図である。FIG. 4 is a diagram showing a configuration example of a keyword dictionary KDIC according to the first embodiment.

【図５】第１の実施例による入力バッファＩＢＵＦの構
成例を示した図である。FIG. 5 is a diagram showing a configuration example of an input buffer IBUF according to the first embodiment.

【図６】第１の実施例による同音語バッファＤＯＢＵＦ
の構成例を示した図である。FIG. 6 is a homophone word buffer DOBUF according to the first embodiment.
It is a figure showing an example of composition.

【図７】第１の実施例による分野情報テーブルＢＴＢＬ
の構成例を示した図である。FIG. 7 is a field information table BTBL according to the first embodiment.
It is a figure showing an example of composition.

【図８】第１の実施例による分野間関連度テーブルＷＴ
ＢＬの構成例を示した図である。FIG. 8 is an inter-field relevance table WT according to the first embodiment.
It is a figure showing the example of composition of BL.

【図９】第１の実施例による漢字辞書ＫＡＮＪＩＤの構
成例を示した図である。FIG. 9 is a diagram showing a configuration example of a Kanji dictionary KANJID according to the first embodiment.

【図１０】第１の実施例による仮名漢字変換における第
１候補決定のための評価式を示した図である。FIG. 10 is a diagram showing an evaluation formula for determining a first candidate in kana-kanji conversion according to the first embodiment.

【図１１】第１の実施例による単語選択時における分野
情報テーブルの更新式を示した図である。FIG. 11 is a diagram showing an update formula of a field information table at the time of word selection according to the first embodiment.

【図１２】第１の実施例による単語選択時における分野
情報テーブルの更新式を示した図である。FIG. 12 is a diagram showing an update formula of a field information table at the time of selecting a word according to the first embodiment.

【図１３】第１の実施例によるキー入力を取り込み処理
を行なう動作を説明するフローチャートである。FIG. 13 is a flowchart illustrating an operation of capturing a key input and performing a process according to the first embodiment.

【図１４】第１の実施例によるステップ１３−４の「読
み蓄積処理」を詳細に説明するフローチャートである。FIG. 14 is a flowchart illustrating in detail the “reading and accumulating process” in step 13-4 according to the first embodiment.

【図１５】第１の実施例によるステップ１３−３の「変
換処理」を詳細に説明するフローチャートである。FIG. 15 is a flowchart illustrating in detail “conversion processing” in step 13-3 according to the first embodiment.

【図１６】第１の実施例によるステップ１５−１の「仮
名漢字変換処理」を詳細に説明するフローチャートであ
る。FIG. 16 is a flowchart illustrating in detail a “kana-kanji conversion process” of step 15-1 according to the first embodiment.

【図１７】第１の実施例によるステップ１５−７の「分
野更新処理」を詳細に説明するフローチャートである。FIG. 17 is a flowchart illustrating in detail the "field updating process" of step 15-7 according to the first embodiment.

【図１８】第１の実施例によるステップ１３−５の「文
書保存処理」を詳細に説明するフローチャートである。FIG. 18 is a flowchart illustrating in detail the “document saving process” of step 13-5 according to the first embodiment.

【図１９】第１の実施例によるステップ１３−６の「文
書呼出処理」を詳細に説明するフローチャートである。FIG. 19 is a flowchart illustrating in detail the “document calling process” of step 13-6 according to the first embodiment.

【図２０】第２の実施例によるステップ１３−６の「文
書呼出処理」を詳細に説明するフローチャートである。FIG. 20 is a flowchart illustrating in detail the “document calling process” of step 13-6 according to the second embodiment.

[Explanation of symbols]

ＣＰＵマイクロプロセッサＤＩＣ仮名漢字変換用辞書ＲＯＭ読出し専用メモリＲＡＭランダムアクセスメモリＤＩＳＫ外部記憶ＫＢキーボードＣＲカーソルレジスタＤＢＵＦ表示用バッファメモリＣＲＴＣＣＲＴコントローラＣＲＴ表示装置ＣＧキャラクタジェネレータＷＴＢＬ分野間関連度テーブルＫＤＩＣキーワード辞書ＩＢＵＦ入力バッファＤＯＢＵＦ同音語バッファＢＴＢＬ分野情報テーブルＴＢＵＦ文書バッファＫＡＮＪＩＤ漢字辞書 CPU Microprocessor DIC Kana-Kanji conversion dictionary ROM Read-only memory RAM Random access memory DISK External storage KB Keyboard CR Cursor register DBUF Display buffer memory CRTC CRT controller CRT display CG character generator WTBL Inter-field relation table KDIC keyword dictionary IBUF input Buffer DOBUF Homophone buffer BTBL Field information table TBUF Document buffer KANJID Kanji dictionary

───────────────────────────────────────────────────── フロントページの続き (72)発明者榊原厚志東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者池田和世東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者小林雄二東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者緩利仁志東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者小楠千佳東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者森田麻友美東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者岸野麻里子東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者安達里砂東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者鈴木秀恭東京都大田区下丸子３丁目30番２号キヤノン株式会社内 ─────────────────────────────────────────────────── ─── Continued Front Page (72) Inventor Atsushi Sakakibara 3-30-2 Shimomaruko, Ota-ku, Tokyo Canon Inc. (72) Inventor Kazuyo Ikeda 3-30-2 Shimomaruko, Ota-ku, Tokyo Canono (72) Inventor Yuji Kobayashi 3-30-2 Shimomaruko, Ota-ku, Tokyo Canon Inc. (72) Inventor Hitoshi Haruri, 3-30-2 Shimomaruko, Ota-ku, Tokyo Canon Co., Ltd. (72) Inventor Chika Kogusu 3-30-2 Shimomaruko, Ota-ku, Tokyo Canon Inc. (72) Inventor Mayumi Morita 3-30-2 Shimomaruko, Ota-ku, Tokyo Canon Inc. (72) Inventor Mariko Kishino 3-30-2 Shimomaruko, Ota-ku, Tokyo Canon Inc. (72) Inventor Risuna Adachi 3-30-2 Shimomaruko, Ota-ku, Tokyo Canon within Co., Ltd. (72) inventor Suzuki ShigeruKyo Ota-ku, Tokyo Shimomaruko 3-chome No. 30 No. 2 Canon within Co., Ltd.

Claims

[Claims]

1. A character processing device for inputting a phonetic reading by a kana character string to edit a document containing kanji and kana mixed notation, inputting means for reading the reading, word reading, and word information including notation and field information. And a dictionary means for storing the field information of the document being edited, and a reading input by the input means by referring to the field information storage means and the dictionary means. A converting means for converting into a notation, a document storing means for storing the document edited by the notation obtained by the converting means, a calling means for calling the document stored in the auxiliary storage section to the document storing means, The estimating means for estimating the field information to which the document belongs from the document called by the calling means, and the field information estimated by the estimating means are stored in the field information storage means. Character processing apparatus characterized by having setting means for.

2. The character processing apparatus according to claim 1, wherein the estimating unit estimates the field information of the document based on the Chinese character data in the called document.

3. The estimating unit estimates the field information of the document based on the field information of the word obtained as a result of the morphological analysis in the called document. The character processing device according to 1.

4. In a character processing device for inputting a reading with a kana character string and editing a document mixed with kanji and kana, a document stored in an auxiliary storage unit is called into a document memory, and the document is read from the called document. Estimating the field information to which the document belongs, setting the estimated field information in the field information memory, inputting the reading, and inputting the reading as word reading and word information including notation and field information A character processing method, which refers to the dictionary and the field information memory that are stored in association with each other, converts into a notation, and edits the standard document stored in the document memory according to the notation obtained by the conversion. .