JPS60110074A

JPS60110074A - Character processor

Info

Publication number: JPS60110074A
Application number: JP58218123A
Authority: JP
Inventors: Eiichiro Toshima; 戸島　英一郎
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1983-11-19
Filing date: 1983-11-19
Publication date: 1985-06-15

Abstract

PURPOSE:To limit the learning processing frequencies and to prevent the reduction of an input speed by performing the learning process to the homonyms immediately after conversion of KANJI (Chinese character). CONSTITUTION:A read string supplied from an input means 1 is converted into KANJI and phrases by a converting means 2, and the candidate KANJI for conversion (candidate words) are delivered to an output means 3. If the delivered KANJI (the 1st candidate word) is not equal to the desired KANJI and phrase, the next candidate word is converted by a next candidate indicating means 4 and the means 2 and delivered to an output means. When the delivered candidate word is coincident with the desired KANJI, etc., the selection is indicated by a homonym selecting means 5. Then the candidate words delivered presently to the means 3 are selected. In this case, it is decided whether the selected KANJI is equal to the one obtained immediately after the KANJI(Japanese syllabary)/KANJI conversion. If the selected KANJI is equal to the one obtained immediately after said conversion, the using frequency information or the using order information on a dictionary 7 is replaced by a learning means 6. Thus the learning process is performed to the homonyms immediately after the KANA/KANJI conversion. Therefore the input speed is never limited and the input efficiency is improved with the learning processing.

Description

【発明の詳細な説明】［技術分野］本発明は、漢字仮名混じりの日本語文書を編集処理する
文字処理装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION [Technical Field] The present invention relates to a character processing device that edits and processes a Japanese document containing kanji and kana.

［従来技術］漢字仮名混じりの日本語文書を入力する装置においては
、オペレータにより漢字の読みが入力されると、辞書検
索が行われ、その読みに対応する漢字の候補群が表示さ
れ、その候補群のうち所望のものがオペレータにより選
択されると、その選択漢字への変換がなされる。かかる
かな漢字変換入力方式が従来から広く採用されていた。[Prior Art] In a device for inputting Japanese documents containing kanji and kana, when an operator inputs the reading of a kanji, a dictionary search is performed, a group of kanji candidates corresponding to the reading are displayed, and the candidates are selected. When a desired one from the group is selected by the operator, it is converted into the selected Kanji. Such a kana-kanji conversion input method has been widely adopted.

このようなかな゛漢字変換入力方式においては、望む漢
字が漢字辞書内に列挙されている候補語のうちの第１候
補語であれば単に選択キーを押すだけで望む漢字（第１
候補語）への漢字変換が行われる。しかるに、第２候補
語以下の漢字が望む漢字の場合には、次候補キーを使用
して望む漢字を選択しなければならない。その結果、オ
ペレータの望む漢字が第１候補語である確立が高いほど
かな漢字変換入力の効率が高くなる。そこで、オペレー
タにより１度でも使用された漢字は使用頻度が高いもの
として辞書中の使用頻度情報または使用順序情報を更新
し、次回のかな漢字変換入力時にその漢字が第１候補語
として変換される可能性を高めるように処理するいわゆ
る学習処理が従来より行われている。In this Kana-Kanji conversion input method, if the desired kanji is the first candidate word among the candidate words listed in the kanji dictionary, the desired kanji (first
Kanji conversion to candidate words) is performed. However, if the desired Kanji is a Kanji below the second candidate word, the desired Kanji must be selected using the next candidate key. As a result, the efficiency of kana-kanji conversion input increases as the probability that the operator's desired kanji is the first candidate word increases. Therefore, if a kanji is used even once by an operator, the usage frequency information or usage order information in the dictionary is updated as a frequently used kanji, and that kanji can be converted as the first candidate word when inputting the next kana-kanji conversion. So-called learning processing has been conventionally performed to improve the performance of the computer.

しかしながら、かかる学習処理は上述のように辞書中の
データの操作が必要なので、辞書がフロッピーディスク
等のようなアクセスに時間を要する外部記憶媒体に展開
されている場合には、学習処理を実行由の期間は入力が
禁止されることになり、入力速度が制限されてしまうと
いう欠点があった。However, such learning processing requires manipulation of data in the dictionary as described above, so if the dictionary is stored on an external storage medium such as a floppy disk that takes time to access, the learning processing may not be executed. This has the disadvantage that input is prohibited during this period, which limits input speed.

かかる点に鑑みて、学習処理を全く行わないでおくこと
も考えられるが、このような装置においては、オペレー
タの望む漢字がいつまでたっても第１候補語として変換
されないので候補語の選択操作のための回数が増えてし
まい、やはり入力速度が遅いという欠点がある。In view of this, it is conceivable to not perform any learning processing at all, but in such a device, the kanji desired by the operator will not be converted as the first candidate word, so it is necessary to perform the candidate word selection operation. The disadvantage is that the number of times ``is increased'' and the input speed is slow.

［目　的］本発明の目的は、上述した従来の欠点を解消することに
あり、学習処理の利点を生かしつつも、その処理回数を
制限して入力速度の低下を防止し得るようにした文字処
理装置を提供することにある。[Purpose] The purpose of the present invention is to eliminate the above-mentioned conventional drawbacks, and to provide a character that takes advantage of the advantages of learning processing while limiting the number of processing times to prevent a decrease in input speed. The purpose of this invention is to provide a processing device.

［実　施　例］以下に、図面を参照して本発明の詳細な説明する。[Example] The present invention will be described in detail below with reference to the drawings.

第１図は本発明の構成を示す。FIG. 1 shows the configuration of the present invention.

図において、入力手段ｌより入力された読み列は、変換
手段２により漢字および熟語に変換され、変換候補の漢
字（候補語）が出力手段３に出力される。出力された漢
字（第１候補語）が希望する漢字および熟語でなかった
場合には、次候補指示手段４により次の候補語が変換手
段２により変換されて出力手段に出力される。出力され
た候補語が望む漢字等であった場合には、同音語選択手
段５により選択を指示すると、現在出力手段３に出力さ
れている候補語が選択される。このとき選択された漢字
がかな漢字変換直後のものであるかどうかが判定され、
変換直後のものであれば学習手段６により辞書７上の使
用頻度情報または使用順序情報が更新される。In the figure, a reading string inputted from an input means 1 is converted into kanji and compound words by a conversion means 2, and conversion candidate kanji (candidate words) are outputted to an output means 3. If the output kanji (first candidate word) is not the desired kanji or compound word, the next candidate instructing means 4 converts the next candidate word by the converting means 2 and outputs it to the output means. If the output candidate word is a desired kanji or the like, the homophone selection means 5 instructs selection, and the candidate word currently output to the output means 3 is selected. At this time, it is determined whether the selected kanji has just been converted to kana-kanji,
If the information has just been converted, the learning means 6 updates the usage frequency information or usage order information in the dictionary 7.

第２図は本発明文字処理装置の一実施例の構成を示すぜ
。FIG. 2 shows the structure of an embodiment of the character processing device of the present invention.

図示の構成において、ＣＰＵはマイクロプロセッサであ
り、文字処理のための演算、論理判断等を行い、アドレ
スバスＡＢ、コントロールバスＣＢ。In the illustrated configuration, the CPU is a microprocessor that performs calculations, logical judgments, etc. for character processing, and connects an address bus AB and a control bus CB.

データバスＤＢを介して、それらのバスに接続された各
構成要素を制御する。アドレスバスＡＢはマイクロプロ
セッサＣＰＵの制御の対象とする構成要素を指示するア
ドレス信号を転送する。コントロールバスＣＢはマイク
ロプロセッサＣＰｕの制御の対象とする各構成要素のコ
ントロール信号を転送する。データバスＤＢは各構成機
器相互間のデータの転送を行う。Each component connected to these buses is controlled via the data bus DB. Address bus AB transfers address signals indicating the components to be controlled by the microprocessor CPU. The control bus CB transfers control signals for each component controlled by the microprocessor CPU. The data bus DB transfers data between each component device.

つぎに、ＲＯＭは読出し専用の固定メモリであり、後述
するマイクロプロセッサＣＰＵによる制御の手順（第９
図転作）等を記憶させておく。また、ＲＡＭは１ワード
１８ビツトの構成の書込み可能のランダムアクセスメモ
リであって、各構成要素からの各種データの一時記憶に
用いる。このメモリＲＡＭにおいて、ＫＢＢＵＦはキー
ボードバッファであり、キーボードＫＢかものキーデー
タを記憶するバッファである。Ｃ０ＢＵＦはコマンドバ
ッファであり、キーボードバッファＫＢＢＵＦのキーデ
ータを一文字ずつとり出して一時記憶する。同音語バッ
ファＢＵＦは同音語中にいくつの候補語が存在するのか
を示す。また、ＭＲＣＤは最も新しく作成された同音語
バッファＢＵＦのポインタを記憶するバッファである。Next, the ROM is a read-only fixed memory, and the control procedure (9th
(Illustration transcription) etc. Further, the RAM is a writable random access memory having a configuration of 1 word and 18 bits, and is used for temporary storage of various data from each component. In this memory RAM, KBBUF is a keyboard buffer, which is a buffer for storing key data of the keyboard KB. C0BUF is a command buffer, which extracts key data from the keyboard buffer KBBUF one character at a time and temporarily stores it. The homophone buffer BUF indicates how many candidate words exist in the homophone. Further, MRCD is a buffer that stores a pointer to the most recently created homophone buffer BUF.

ＫＢはキーボードであって、アルファベットキー、平仮
名キー、片仮名キー等の文字記号入カキ−１および、漢
字部始め／終り指示キー、変換キー、次候補キー、選択
キー等の本文字処理装置における各種機能を指示するた
めの各種のファンクションキーを備えている。ＤＩＳＫ
は外部メモリであり、゛定形文書を記憶するためのメモ
リで作成された文書の保管を行い、保管された文書はキ
ーボードの指示により必要な時呼びだされる。また、こ
のメモリＤＩＳＫ内には漢字辞書が展開されている。Ｃ
Ｒはカーソルレジスタであり、ＣＰＵによりカーソルレ
ジスタの内容を読み書き出来る。後述するＣＲＴコント
ローラＣＲＴＣは、ここに蓄えられたアドレ、スに対応
する表示装置ＣＲＴ上の位置にカーソルを表示する。Ｄ
ＢＵＦはバ・ンファメモリであり、キーボードＫＢから
入力された文書情報等を蓄える。ＣＲＴＣはＣＲＴコン
トローラであり、カーソルレジスタＣＲおよびバッファ
［１ＢＵＦに蓄えられた内容を表示器ＣＲＴに表示する
役割を担う。また、ＣＲＴは陰極線管等を用いた表示装
置であり、その表示装置ＣＲＴにおけるドツト構成の表
示パターンおよびカーソルの表示をＣＲＴコントローラ
で制御する。KB is a keyboard, which includes character symbol input keys such as alphabet keys, hiragana keys, and katakana keys, and various keys in this character processing device such as kanji part start/end indicator keys, conversion keys, next candidate keys, selection keys, etc. It is equipped with various function keys for instructing functions. DISK
is an external memory that stores documents created in the memory for storing fixed-form documents, and the stored documents can be recalled when necessary by instructions from the keyboard. A kanji dictionary is also developed in this memory DISK. C
R is a cursor register, and the contents of the cursor register can be read and written by the CPU. A CRT controller CRTC, which will be described later, displays a cursor at a position on the display device CRT corresponding to the address stored here. D
BUF is a buffer memory that stores document information etc. input from the keyboard KB. CRTC is a CRT controller and is responsible for displaying the contents stored in the cursor register CR and buffer [1BUF on the display CRT. Further, a CRT is a display device using a cathode ray tube or the like, and a dot-configured display pattern and a cursor display on the display device CRT are controlled by a CRT controller.

さらに、ＣＧはキャラクタジェネレータであって、表示
装置ＣＲＴに表示する文字、記号、カーソル等のキャラ
クタ信号を発生させるものである。Furthermore, CG is a character generator, which generates character signals such as characters, symbols, and cursors to be displayed on the display device CRT.

かかる各構成要素からなる本実施例は、キーボードＫＢ
からの各種の入力に応じて作動するものであって、キー
ボードＫ１３からの入力が供給されるすＣＰＵに送られ
、そのマイクロプロセッサＣＰＵがＲＯＭ内に記憶しで
ある各種の制御手順を読出し、それらの制御手順に従っ
て各種の制御が行われる。This embodiment consisting of each of these components is a keyboard KB.
The microprocessor CPU operates in response to various inputs from the keyboard K13, and the input from the keyboard K13 is sent to the CPU, which reads various control procedures stored in the ROM and executes them. Various controls are performed according to the control procedures.

第３図は漢字辞書の構造を示す。図に示すように、辞書
は１単語当り１８バイトで構成される。Figure 3 shows the structure of a kanji dictionary. As shown in the figure, the dictionary consists of 18 bytes per word.

最初の８バイトは読みであり、各単語の読みが１文字１
バイトで格納される。コードは例えばＪＩＳＣニー１３
２２８コードの下位バイトを用いる。次の６バイトは漢
字部であり、漢字が１文字２バイトで格納される。コー
ドは例えばＪＩＳＣ−６２２ｅコードを用いる一最後の
２バイトは単語の使用頻度を表わすのに用いられる。使
用順序情報は辞書中の順番で示され、読み「きかい」の
第１番目の単語が使用順序１．２番目の単語が使用順序
２を表わすことになる。The first 8 bytes are the reading, and each word has a reading of 1 character.
Stored in bytes. For example, the code is JISC knee 13
The lower byte of the H.228 code is used. The next 6 bytes are the kanji part, where each kanji character is stored in 2 bytes. The code is, for example, a JISC-622e code.The last two bytes are used to represent the frequency of use of the word. The usage order information is shown in the order in the dictionary, with the first word in the reading "Kikai" representing the usage order 1, and the second word representing the usage order 2.

ここで、逆字変換の際に、同音語のうちのどれを第１候
補語とするかはこれらの使用即度と使用順序を加味して
判断される。本実施例では、Ｃ使用頻度−（使用１順序
×２））により算出される得点によって候補順位を決定
し、得９点の′最高の単語を第１候補語とする。Here, at the time of reverse character conversion, which of the homophones is to be selected as the first candidate word is determined by taking into account the degree of use and order of use of these words. In this embodiment, the candidate ranking is determined by the score calculated by C frequency of use - (1 order of use x 2)), and the word with the highest score of 9 is determined as the first candidate word.

第４図は学習処理により辞書中のデータがどのように変
化するかを示すものである。FIG. 4 shows how the data in the dictionary changes due to the learning process.

第４図（Ａ）は初期状態を示し、この状態で算出された
得点が最も高いのは「機械」の９点であり、読み「きか
い」を入力すると、「機械」が第１候補語として表われ
る。Figure 4 (A) shows the initial state, and the highest score calculated in this state is 9 points for "machine", and when the reading "kikai" is input, "machine" is the first candidate word. It appears as.

第４図（Ｂ）はかかる状態から「Ｒ会」を選択した後の
状態を示す。図に示すように、「貴会」の使用頻度がカ
ウントアツプされ、更にその使用順序が第１番目となる
。この結果、得点が最高となり、以後は第１候補語とし
て変換可能となる。FIG. 4(B) shows the state after selecting "R-meeting" from this state. As shown in the figure, the frequency of use of "Your Association" is counted up, and the order of use becomes the first. As a result, the score becomes the highest, and henceforth it can be converted as the first candidate word.

第４図（Ｃ）は更に「機械」を選択した後の状態を示す
。図示のように、「機械ｊの使用頻度がカウントアツプ
され、使用頻度が第１番目となる。FIG. 4(C) further shows the state after "machine" is selected. As shown in the figure, the frequency of use of machine j is counted up, and the frequency of use becomes the first.

その結果、「機械」の得点が再び最高となって、以後は
第１候補語として変換可能となる。As a result, "machine" again has the highest score, and henceforth can be converted as the first candidate word.

第５図はバッファメモリＤＢＵＦ内の文書の構造を示す
。図示のように、文書はいくつかの行に分かれ、更に各
行はいくつかの文字に分かれている。FIG. 5 shows the structure of the document in the buffer memory DBUF. As shown, the document is divided into several lines, and each line is further divided into several characters.

第６図は文字の構造を更に詳細に示す。図示のように、
各文字は２バイトで構成され、先頭の１ビツトが０゛か
′ｌ°′かに従って２つのタイプに区別される。すなわ
ち、第６図（Ａ）に示すように゛′０パの場合は、通常
の文字であることを示し、下位の各ビットにおいては、
例えばＪＩＳＣ−８２２８コードを用いて、平仮名１卉
仮名、漢字、記号等の文字データが格納される。しかる
に、第６図（Ｂ）に示すように先頭ビットが゛１パの場
合は、同音語であることを示し、下位の１５ビツトにお
いてはどのような同音語であるかを示す同音語バッファ
へのポインタが格納される。FIG. 6 shows the structure of the character in more detail. As shown,
Each character consists of 2 bytes and is classified into two types depending on whether the first bit is 0' or '1'. In other words, as shown in FIG. 6(A), ``0'' indicates a normal character, and in each lower bit,
For example, character data such as hiragana, kanji, symbols, etc. is stored using the JISC-8228 code. However, as shown in Figure 6 (B), if the first bit is ``1pa'', it indicates that it is a homophone, and the lower 15 bits are sent to the homophone buffer, which indicates what kind of homophone it is. A pointer is stored.

第７図１±同音語バックァＢＵＦの構造を示す。図に示
すように、同音語バッファＢＵＦはどの読みを変換して
同音語になったかを示すための読み部と、現在どの候補
語が表示されているかを示す表示番号部と、読み部の読
みに対応した同音語の候補語リストからなる。候補語リ
ス、トは第１候補語の単語から候補順に並んでいる。１
単語は６バイトで構成され、読み部は８バイトで構成さ
れ、表示番号部は２バイトで構成され、各候補語は６バ
イトで、構成されている。FIG. 7 shows the structure of 1±homophone backer BUF. As shown in the figure, the homophone buffer BUF has a reading part that shows which pronunciation has been converted to become a homophone, a display number part that shows which candidate word is currently displayed, and the reading part of the reading part. It consists of a list of homophone candidate words corresponding to . The candidate word list is arranged in the order of candidates starting from the first candidate word. 1
A word consists of 6 bytes, a reading part consists of 8 bytes, a display number part consists of 2 bytes, and each candidate word consists of 6 bytes.

第８図は上述のように構成した本実施例における漢字変
換動作を示すものである。FIG. 8 shows the kanji conversion operation in this embodiment configured as described above.

第８図（Ａ）は何も入力されていない状態での表示装置
ＣＲＴの画面を示したものである。ここでＳは表示領域
を示し、ＣＭはカーソルを示す。キーボードＫＢより文
字入力を行うと、カーソルＣＭにより示される位置に文
字が入力される。FIG. 8(A) shows the screen of the display device CRT in a state where nothing is input. Here, S indicates a display area, and CM indicates a cursor. When characters are input using the keyboard KB, the characters are input at the position indicated by the cursor CM.

第８図（Ｂ）はかかる状態から゛（′”、パき“。FIG. 8(B) shows the state from such a state.

°°か′°、゛い’、”）’“と入力し、かな漢字変換
を行った後の状態を示している。ここで、”（”は漢字
始めキーを表し、”）”は漢字終りキーを表わす。かな
を漢字に変換するには、１１１１と＋＋）＋＋とで漢字
の読みを囲い、更に変換キーを入力すれば良い。この場
合、読み「きかい」に相当する単語（漢字）は、辞書中
に複数個あるので、同音語としてその第１候補語である
「機械」が○印で囲まれた状態で表示される。This shows the state after inputting °° or ′°, ゛ii', ")'" and performing kana-kanji conversion. Here, "(" represents the kanji start key, and ")" represents the kanji end key. To convert kana to kanji, just surround the kanji reading with 1111 and ++)++ and then input the conversion key. In this case, there are multiple words (kanji) that correspond to the reading ``kikai'' in the dictionary, so the first candidate word ``machine'' is displayed as a homophone, circled. .

第８図（Ｃ）はかかる状態からカーソルＣＭを画面下方
に移動し、（ないかく）と入力した後変換キーを入力し
て漢字変換を行った状態を示している。「ないかく」の
読みをもつ単語はやはり辞書中に複数個あるので同音語
としてその第１候補語「内角」が表示される。　、第８図（Ｄ）はかかる状態で次候補キーを押し、更に選
択キーを押した後の状態を示したものである。「ないか
〈」の読みの第２候補語は「内閣」であるので、漢字「
内閣」が確定する。このとき、「ないかく」は漢字変換
の直後であるので、学習処理が実行される。FIG. 8(C) shows a state in which the cursor CM is moved to the bottom of the screen from this state, and after inputting (naikaku), the conversion key is input to perform kanji conversion. Since there are multiple words with the pronunciation of ``Naikaku'' in the dictionary, the first candidate word ``Naikaku'' is displayed as a homophone. , FIG. 8(D) shows the state after pressing the next candidate key and then pressing the selection key in this state. The second candidate word for the reading of “Naika〈” is “cabinet”, so the kanji “
Cabinet" is confirmed. At this time, since "naikaku" has just been converted into kanji, the learning process is executed.

第８図（Ｅ）は更に次候補キー、選択キーを打鍵した後
の状態を示すものである。「きかい」の第２候補語は「
器械」であるので、漢字「器械」が確定する。このとき
、読み「きかい」の同音語の選択は漢字変換直後に行わ
れたものではない。すなわち、オペレータにとっては変
換直後に選択するほどの重要な単語ではないと考求られ
るので学習処理は行われない。FIG. 8(E) shows the state after the next candidate key and selection key are pressed. The second candidate word for “Kikai” is “
Since the word is ``instrument'', the kanji ``instrument'' is confirmed. At this time, the selection of homophones for the reading ``kikai'' was not done immediately after the kanji conversion. In other words, it is considered that the word is not important enough for the operator to select it immediately after conversion, so no learning process is performed.

次に、第９図は本実施例に係る文字処理装置の動作を示
す。Next, FIG. 9 shows the operation of the character processing device according to this embodiment.

まずＩ、ステップ９−１においてキーボードバッファＫ
ＢＢＵＦにキーデータが蓄えられているかどうか検出し
、もし蓄えられていればその、キーデータをコマンドバ
ッファＣ０ＢＵＦに取出し、以下の処理を行う。蓄えら
れていなければ、バダファＫＢＢＵＦに有効データが格
納されるまで待機する。バッファＫＢＢＵＦに格納され
るデータは、キーボードＫＢ」二のキーを打鍵したとき
に生じる割込み処理によって作成される。First, I, in step 9-1, the keyboard buffer K
It is detected whether key data is stored in BBUF, and if so, the key data is taken out to command buffer C0BUF and the following processing is performed. If the data is not stored, the process waits until valid data is stored in the buffer KBBUF. The data stored in the buffer KBBUF is created by an interrupt process that occurs when the second key of the keyboard KB is pressed.

ステップ９−２においてはバッファＣ０ＢＵＦのキーデ
ータを解析し、キーの種類に従って以後の処理へと進む
。すなわち、変換キーであればステップ８−３に進み、
選択キーであればステップ８−５に進み、それ以外のキ
ーであればステップ９−８に進む。ステップ９−８にお
いては、モード変更、挿入処理等の一般の文字処理を行
うにあたり必要とされる各種の処理を実行し、その後ス
テップｌ１ｌ−１へ戻る。In step 9-2, the key data in the buffer C0BUF is analyzed, and the process proceeds to subsequent steps according to the type of key. In other words, if it is a conversion key, proceed to step 8-3;
If it is a selection key, the process proceeds to step 8-5, and if it is any other key, the process proceeds to step 9-8. In step 9-8, various processes required for general character processing such as mode change and insertion processing are executed, and then the process returns to step l1l-1.

ステップ８−３においては、かな漢字変換処理を行う。In step 8-3, kana-kanji conversion processing is performed.

漢字始めコード”（”と漢字終りコード”）”で囲まれ
た部分の読み列を、辞書（第３図参照）を検索して漢字
に変換する。このとき、辞書内の候補語が１個であれば
、その候補語が変換結果として文書データを格納したバ
ッファＤＢＵＦへ転送する。しかるに、候補語が複数個
ある場合にはその候補語をすべてリストアツブし、第７
図に示すような同音語バッファＢＵＦを作成し、その同
音語バッファＢＵＦを指示するだめのポインタを／シソ
ファＤＢＵＦ内の文書データに向けて転送する。Search the dictionary (see Figure 3) for the pronunciation sequence of the part surrounded by the kanji start code "(" and the kanji end code")" and convert it into kanji. At this time, if there is one candidate word in the dictionary, that candidate word is transferred as a conversion result to the buffer DBUF in which document data is stored. However, if there are multiple candidate words, all the candidate words are restored and the seventh
A homophone buffer BUF as shown in the figure is created, and a pointer pointing to the homophone buffer BUF is transferred to the document data in the /sisopha DBUF.

この後、ステップ８−４において、候補語が１個ならば
バッファＭＲＣＤに０“′を代入し、同音語であれば、
すなわち候補語が複数個あればバッファＭＲＣＤにその
Ｍ音語の同音語バッファＢＵＦを指示するためのポイン
タを代入する。このようにして、バッファＭＲＣＤは最
も新しく作成された同音語バッファＢＵＦを指示するた
めのポインタを有することになる。この後、ステップ９
−１へ戻る。After this, in step 8-4, if there is one candidate word, 0"' is assigned to the buffer MRCD, and if it is a homophone,
That is, if there are multiple candidate words, a pointer for indicating the homophone buffer BUF of the M phonetic word is assigned to the buffer MRCD. In this way, buffer MRCD will have a pointer to the most recently created homophone buffer BUF. After this, step 9
Return to -1.

ステップ８−５においては、選択キーの押された時のカ
ーソルの位置をめ、そのカーソル位置から文書の先頭方
向に向がって同音語が存在するがどうか、を検索する。In step 8-5, the position of the cursor when the selection key was pressed is determined, and a search is made from the cursor position toward the beginning of the document to see if a homophone exists.

この検索では、文書データ上の最上位ビットが°′１パ
となっている文字が検出される。同音語検索が終了する
と、ステップ８−６において同音語のコードから同音語
バッファＢＵＦへのポインタをめ、そのポインタにより
指示される同音語バッファＢＵＦ中の表示番号部に示さ
れる候補語の漢字コードを文書中に転送する。次に、ス
テップ９−７において請求めた同音語バッファへのポイ
ンタとバッファＭＲＣＤ内の値とを比較し、両者が一致
しなければ、変換直後ではないので、ステップ８−１へ
戻る。しかるに、一致すれば、ステップ９−８において
学習処理を行い、この後ステップ８−１へ戻る。In this search, a character whose most significant bit on the document data is °'1 pa is detected. When the homophone search is completed, a pointer to the homophone buffer BUF is obtained from the code of the homophone in step 8-6, and the kanji code of the candidate word indicated by the display number section in the homophone buffer BUF is pointed to by the pointer. into the document. Next, the pointer to the homophone buffer requested in step 9-7 is compared with the value in the buffer MRCD, and if they do not match, it is not immediately after conversion, and the process returns to step 8-1. However, if they match, a learning process is performed in step 9-8, and then the process returns to step 8-1.

なお、上述の木実施例においでは、かな漢字変換を漢字
指定方式により行うようにしているが。In the tree embodiment described above, the kana-kanji conversion is performed using the kanji specification method.

他の方式、例えば文節指定方式による入力であっても本
発明を適用できることは勿論である。Of course, the present invention can also be applied to input using other methods, such as the clause specification method.

［効　果］以上説明したように、本発明によれば、学習処理を漢字
変換直後の同音語に対して行うようにしたので、辞書が
フロッピーディスク等のアクセス時間のかかる外部記憶
媒体上に存在する場合にあっても、従来のように入力速
度が制限されることはない。また、使用頻度か高いため
に第１候補語として変換できなければ入力効率が極端に
低下してしまう用語に対しては、漢字変換直後にその候
補語の選択を行うことにより、学習処理が実行されてそ
の候補語が第１候補語となるので、学習処理による入力
効率を高め得るという利点をも充分に活用することがで
きる。[Effects] As explained above, according to the present invention, since the learning process is performed on homophones immediately after kanji conversion, the dictionary does not exist on an external storage medium such as a floppy disk that takes time to access. Even in such cases, the input speed is not limited as in the past. In addition, for words that are used so frequently that input efficiency would be extremely low if they could not be converted as the first candidate word, the learning process is performed by selecting the candidate word immediately after converting the kanji. Since the candidate word becomes the first candidate word, it is possible to fully utilize the advantage of increasing input efficiency through learning processing.

[Brief explanation of drawings]

第１図は本発明の全体構成を示す図、第２図は本発明の一実施例の構成を示すブロック図。第３図は辞書の構造を示す図、第４図（Ａ）〜（Ｃ）は学習処理による辞書内のデータ
の変更を示す図、第５図は文書の構造を示す図、第６図（Ａ）および（Ｂ）は文字の構造を示す図、第７
図は同音語バッファの構造を示す図、第８図（Ａ）〜（
Ｅ）は第２図の装置の動作を説明する図、第９図は第２図の装置の動作を示すフローチャートであ
る。１・・・入力手段、２・・・変換手段、３・・・出力手段、４・・・次候補指示手段、５・・・同音語選択手段、６・・・学習手段、７・・・辞書、ＫＢＢＵＦ、Ｃ０ＢＵＦ、ＭＲＣＤ、ＢＵＦ、ＤＢＵＦ
・・・バッファ、特許出願人　キャノン株式会社第３図 θハ゛イト　６ハ′イト　２）（イト第４図第６図第７図FIG. 1 is a diagram showing the overall configuration of the present invention, and FIG. 2 is a block diagram showing the configuration of an embodiment of the present invention. Figure 3 is a diagram showing the structure of a dictionary; Figures 4 (A) to (C) are diagrams showing changes in data in the dictionary due to learning processing; Figure 5 is a diagram showing the structure of a document; Figure 6 ( A) and (B) are diagrams showing the structure of characters, 7th
The figure shows the structure of the homophone buffer.
E) is a diagram explaining the operation of the apparatus shown in FIG. 2, and FIG. 9 is a flowchart showing the operation of the apparatus shown in FIG. DESCRIPTION OF SYMBOLS 1... Input means, 2... Conversion means, 3... Output means, 4... Next candidate instruction means, 5... Homophone selection means, 6... Learning means, 7... Dictionary, KBBUF, C0BUF, MRCD, BUF, DBUF
...Buffer, Patent Applicant Canon Co., Ltd. Fig. 3 θ height 6 height 2) (Item Fig. 4 Fig. 6 Fig. 7

Claims

[Claims] It has a dictionary in which kanji are recorded together with their pronunciations, and multiple kanji having the same pronunciation are stored in a searchable manner in a predetermined order. Kanji conversion is performed by searching within the dictionary, and when there are multiple homophones for the input pronunciation in the dictionary, a desired homophone is selected from the homophone group, and the selection is performed using the kanji. A character processing device characterized in that a learning process is executed only when the learning process is performed immediately after conversion to change the searchable order in the dictionary of the homophone group to which the selected kanji belongs. . (Hereafter, margin)