JPH0421160A

JPH0421160A - Document preparing device provided with learning function and its learning method

Info

Publication number: JPH0421160A
Application number: JP2125705A
Authority: JP
Inventors: Akiyoshi Kuromori; 黒森　明美
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1990-05-16
Filing date: 1990-05-16
Publication date: 1992-01-24

Abstract

PURPOSE:To prevent the generation of misconversion at the time of executing the 'KANA' (Japanese syllabary)/'KANJI' (Chinese character) conversion of a composite word by providing the document preparing device with a composite word deciding means, deciding a composite word or not a composite word and updating the frequency information of the corresponding word or composite word. CONSTITUTION:When a certain reading is inputted through a keyboard 11 and a key 11a is depressed, the reading is processed by a 'KANA'/'KANJI' conversion processing part 16 in a 'KANA'/'KANJI' control part 15. The processing part 16 decides whether the inputted reading information is a composite word or not based upon a prescribed rule. In the case of a composite word, the processing part 16 refers a composite long learning dictionary 20 and checks whether the composite word has been learned for a long time or not. When long learning has been executed, the output order of homonym candidates is determined based upon the dictionary 20. When long learning has not been executed or the word is not a composite word, the processing part 16 refers a reference long learning dictionary 19, and when the word has been learned for a long period, the output order of homonyms is determined based upon the frequency information of the dictionary 19. When the word has not been learned for a long period, the output order is determined in accordance with the frequency information of the dictionary 18. Thus, proper output order can be determined and the generation of misconversion in the 'KANA'/'KANJI' conversion can be prevented.

Description

【発明の詳細な説明】［発明の目的］（産業上の利用分野）本発明は単語および複合語の学習機能、特に長期学習機
能を備えた文書作成装置およびその学習方法に関する。DETAILED DESCRIPTION OF THE INVENTION [Object of the Invention] (Industrial Application Field) The present invention relates to a document creation device having a word and compound word learning function, particularly a long-term learning function, and a learning method thereof.

（従来の技術）従来、文書作成装置では、同音語変換に際し、同し読み
に対する複数の単語のうち、使用頻度の高い単語から順
に参照できるようになっている。(Prior Art) Conventionally, in a document creation device, when performing homophone conversion, it is possible to refer to a plurality of words with the same pronunciation in order of frequency of use.

これは、同し読みに対する複数の単語とその各単語の頻
度情報を記憶した辞書を使用することにより、この辞書
から入力読み情報に対応する単語を頻度情報に従って順
次読出すことで実現される。This is achieved by using a dictionary that stores a plurality of words with the same reading and frequency information for each word, and sequentially reading words corresponding to the input reading information from this dictionary according to the frequency information.

ところで、このような辞書を用いる文書作成装置におい
て、単語の頻度情報が固定化されている場合には、ユー
ザが特に使用する単語であっても、その単語の頻度情報
か低ければ、直ぐには出てこないといった不具合が生じ
る。そこで、一般には、ユーザか同じ読みに対する複数
の単語のうちの１つを選択する毎に、この選択された単
語の頻度情報を更新する学習機能が用いられている。こ
れにより、ユーザに応じた単語の出現頻度構成を辞書に
持たせることかできる。By the way, in a document creation device that uses such a dictionary, if the frequency information of words is fixed, even if the word is particularly used by the user, if the frequency information of that word is low, it may not appear immediately. This may cause problems such as not being able to move. Therefore, a learning function is generally used that updates the frequency information of the selected word each time the user selects one of a plurality of words with the same pronunciation. This allows the dictionary to have a word appearance frequency structure tailored to the user.

この学習機能には、長期学習と短期学習がある。This learning function includes long-term learning and short-term learning.

長期学習ては、特定の指示がない限り、単語の出現頻度
に応じて頻度情報が更新される。一方、短期学習では、
例えば電源投入毎あるいは文書毎に頻度情報か初期化さ
れる。ここでは、長期学習機能を備えた文書作成装置に
ついて説明する。In long-term learning, unless there is a specific instruction, frequency information is updated according to the frequency of word appearance. On the other hand, in short-term learning,
For example, the frequency information is initialized every time the power is turned on or every document. Here, a document creation device equipped with a long-term learning function will be described.

従来、長期学習機能を備えた文書作成装置において、長
期学習は単語単位で行っていた。すなわち、例えば「き
ゆうはつしん」と入力して、「急発進」と選択、確定し
た場合には、「急」および「発進」でそれぞれ長期学習
する。このため、この単語を多く使用した後に、例えば
「きゆうしゃかい」と人力すると、「急」の頻度が高く
なっているため、「急社会」と変換されることになる。Conventionally, in document creation devices equipped with a long-term learning function, long-term learning has been performed on a word-by-word basis. That is, for example, if "sudden start" is inputted and "sudden start" is selected and confirmed, long-term learning is performed for "sudden" and "sudden start" respectively. For this reason, if you use this word a lot and then manually write, for example, ``kiyushakai,'' it will be converted to ``kyūshakai,'' since the frequency of ``kyū'' has increased.

逆に、「旧社会」を多く使用していると、「旧」の頻度
が高くなって、今度は「きゆうはつしん」が「旧発進」
となる。On the other hand, if you use "old society" a lot, "old" becomes more frequent, and "Kiyuuhatsushin" becomes "old departure".
becomes.

（発明が解決しようとする課題）上記したように、従来、単語レベルで長期学習していた
ので、複合語で意味をなす各単語が変換された際に、単
語単独で学習した頻度が効き、誤変換が生じやすい問題
があった。なお、複合語とは、２つ以上の単語が結合し
て新しい意味の一語を形づくるものである。(Problem to be solved by the invention) As mentioned above, conventionally, long-term learning was carried out at the word level, so when each word that has a meaning in a compound word is converted, the frequency of learning the word alone becomes effective, There was a problem that erroneous conversions were likely to occur. Note that a compound word is a word that combines two or more words to form a word with a new meaning.

本発明は上記のような点に鑑みなされたもので、複合語
のかな漢字変換に際し、誤変換を防ぐことのできる学習
機能を備えた文書作成装置およびその学習方法を提供す
ることを目的とする。The present invention has been made in view of the above-mentioned points, and an object of the present invention is to provide a document creation device and a learning method thereof that have a learning function that can prevent erroneous conversion when converting compound words into kana-kanji.

［発明の構成］（課題を解決するための手段）すなわち、本発明は、同じ読みに対する複数の単語およ
びこの各単語の頻度情報を記憶した第１の記憶手段と、
同じ読みに対する複数の複合語およびこの各複合語の頻
度情報を記憶した第２の記憶手段とを備え、入力読み情
報が複合語でない場合に上記入力読み情報に対応する単
語の同音語候補を上記第１の記憶手段から抽出し、複合
語である場合に上記入力読み情報に対応する複合語の同
音語候補を上記第２の記憶手段から抽出し、この抽出さ
れた同音語候補の中から所望の同音語か選択確定された
際に、その確定された同音語が複金詰でない場合に単語
の頻度情報を更新し、複合語である場合に複合語の頻度
情報を更新するようにしたものである。[Structure of the Invention] (Means for Solving the Problems) That is, the present invention provides a first storage means that stores a plurality of words for the same reading and frequency information of each word;
a second storage means storing a plurality of compound words corresponding to the same reading and frequency information of each compound word, and when the input reading information is not a compound word, the homophone candidate of the word corresponding to the input reading information is stored as the above-mentioned homophone candidate. If the compound word is a compound word, a homophone candidate of the compound word corresponding to the input reading information is extracted from the second storage means, and a desired homophone candidate is extracted from the extracted homophone candidates. When a homophone is selected and confirmed, the word frequency information is updated if the confirmed homophone is not a compound word, and the compound word frequency information is updated if it is a compound word. It is.

また、上記複合語の頻度情報の更新に際し、複合語の頻
度情報を更新すると同時に、その複合語を構成する各単
語の頻度情報も更新することを第２の特徴とする。A second feature is that when updating the frequency information of the compound word, the frequency information of the compound word is updated, and at the same time, the frequency information of each word constituting the compound word is also updated.

（作用）上記の構成によれば、複合語も単語と同様に長期学習さ
れることになり、複合語のかな漢字変換に際し、誤変換
を防ぐことができる。(Operation) According to the above configuration, compound words are learned over a long period of time in the same way as words, and erroneous conversion can be prevented when converting compound words into kana-kanji.

（実施例）以下、図面を参照して本発明の一実施例に係る文書作成
装置を説明する。(Embodiment) Hereinafter, a document creation device according to an embodiment of the present invention will be described with reference to the drawings.

第１図はその構成を示すブロック図である。第１図にお
いて、キーボード１１は、文書作成等に必要な入力情報
を入出力制御部１２に出力する。このキーボード１１に
は、「文字」キー　「数値」キー「カーソル」キー等に
加えて、ここでは、かな漢字変換および同音語選択を指
示するための［変換／次候補」キー１１２％単語の確定
を指示するための「選択／実行」キー１１ｂが設けられ
ている。入出力制御部１２は、キーボード１１および表
示部１３等の入出力制御を司るものであり、ここでは、
読みの入力、かな漢字変換結果の圧力等を制御する。FIG. 1 is a block diagram showing its configuration. In FIG. 1, a keyboard 11 outputs input information necessary for document creation etc. to an input/output control section 12. This keyboard 11 includes, in addition to "letter" keys, "numeric" keys, "cursor" keys, etc., a "conversion/next candidate" key for instructing kana-kanji conversion and homophone selection; A "selection/execution" key 11b for giving instructions is provided. The input/output control section 12 controls input/output of the keyboard 11, display section 13, etc., and here,
Controls reading input, pressure of kana-kanji conversion results, etc.

表示部１３は、例えば液晶デイスプレィからなり、入力
した読みや、その読みに対するかな漢字変換結果等を表
示する。The display unit 13 is composed of, for example, a liquid crystal display, and displays the input reading and the kana-kanji conversion results for the reading.

ＣＰ　Ｕ　１４は、この装置全体の制御を司る。かな漢
字変換制御部１５は、かな漢字変換指示に従い、かな漢
字変換処理部１６および長期学習処理部１７を制御する
。かな漢字変換処理部１６は、辞書１８、標準長期学習
辞書１９および複合語長期学習辞書２０を参照して、か
な漢字変換を行なう。長期学習処理部１７は、単語の出
現頻度に応じて標準長期学習辞書１９の頻度情報を更新
すると共に、複合語の出現頻度に応じて複合語長期学習
辞書２０の頻度情報を更新する。The CPU 14 is in charge of controlling the entire device. The kana-kanji conversion control section 15 controls the kana-kanji conversion processing section 16 and the long-term learning processing section 17 according to the kana-kanji conversion instruction. The kana-kanji conversion processing unit 16 refers to the dictionary 18, the standard long-term learning dictionary 19, and the compound word long-term learning dictionary 20 to perform kana-kanji conversion. The long-term learning processing unit 17 updates the frequency information of the standard long-term learning dictionary 19 according to the appearance frequency of words, and updates the frequency information of the compound word long-term learning dictionary 20 according to the appearance frequency of compound words.

ここで、各辞書１８．１９．２０について説明する。Here, each dictionary 18, 19, and 20 will be explained.

辞書１８は、予め同じ読みに対する複数の単語および上
記各単語の頻度情報を記憶している。この辞書１８に登
録されている単語の頻度情報は、例えば装置出荷時にお
いて予め設定されている。The dictionary 18 stores in advance a plurality of words with the same reading and frequency information of each word. The frequency information of words registered in this dictionary 18 is set in advance, for example, at the time of shipment of the device.

一方、標準長期学習辞書１９は、長期学習モード時に使
用される単語の辞書であり、上記辞書１８と１対１に対
応して設けられ、同じ読みに対する複数の単語および上
記各単語の頻度情報が記憶されている。たたし、この標
準長期学習辞書１９に登録されている単語の頻度情報は
、長期学習処理部１７によって更新される。第２図に標
準長期学習辞書１９の構成を示す。標準長期学習辞書１
９に登録されている各単語の頻度情報（長期情報）は、
当該単語の使用頻度に応じて、初期値から例えば「＋１
」ずつ更新される。On the other hand, the standard long-term learning dictionary 19 is a dictionary of words used in the long-term learning mode, and is provided in one-to-one correspondence with the dictionary 18, and contains a plurality of words for the same reading and frequency information of each word. remembered. However, the frequency information of words registered in this standard long-term learning dictionary 19 is updated by the long-term learning processing section 17. FIG. 2 shows the structure of the standard long-term learning dictionary 19. Standard long-term learning dictionary 1
Frequency information (long-term information) of each word registered in 9.
Depending on the usage frequency of the word, for example, "+1" is added to the initial value.
” will be updated.

また、複合語長期学習辞書２０は、長期学習モト時に使
用される複合語の辞書であり、同じ読みに対する複数の
複合語および上記各複合語の頻度情報が記憶されている
。この複合語長期学習辞書２０に登録されている複合語
の頻度情報も、標準長期学習辞書１９と同様に、長期学
習処理部１７によって更新される。第３図に複合語長期
学習辞書２ｏの構成を示す。複合語長期学習辞書２０に
登録されている各複合語頻度情報（長期情報）は、当該
複合語の使用頻度に応じて、初期値から例えば「＋１ｊ
ずつ更新される。Further, the compound word long-term learning dictionary 20 is a compound word dictionary used during long-term learning, and stores a plurality of compound words with the same reading and frequency information of each compound word. Frequency information of compound words registered in this compound word long-term learning dictionary 20 is also updated by the long-term learning processing section 17 similarly to the standard long-term learning dictionary 19. FIG. 3 shows the structure of the compound word long-term learning dictionary 2o. Each compound word frequency information (long-term information) registered in the compound word long-term learning dictionary 20 is changed from the initial value to, for example, "+1j" according to the usage frequency of the compound word.
It will be updated one by one.

なお、複合語とは、２つ以上の単語が結合して新しい意
味の一語を形づくるものである。Note that a compound word is a word that combines two or more words to form a word with a new meaning.

次に、第４図及び第５図に示すフローチャートを参照し
て同実施例の動作を説明する。Next, the operation of this embodiment will be explained with reference to the flowcharts shown in FIGS. 4 and 5.

まず、第４図に示すように、キーボード１１を通して読
みを入力し、同キーボードｔｉの「変換／次候補」キー
１１ａを操作すると（ステップＡＩ）、かな漢字変換制
御部１５内のかな漢字変換処理部１６を通じて、かな漢
字変換が行われる（ステップＡ２）。かな漢字変換処理
部１６は、まず、入力読み情報が複合語であるか否かを
所定のルールに基づいて判断する（ステップＡ３）。詳
しくは、辞書１８および標準長期学習辞書１９をサーチ
し、１回の変換操作では所望の一連の語に変換できない
場合に、複合語と判断する。First, as shown in FIG. 4, when the reading is input through the keyboard 11 and the "conversion/next candidate" key 11a of the keyboard ti is operated (step AI), the kana-kanji conversion processing section 16 in the kana-kanji conversion control section 15 Through this process, kana-kanji conversion is performed (step A2). The kana-kanji conversion processing unit 16 first determines whether the input reading information is a compound word based on a predetermined rule (step A3). Specifically, the dictionary 18 and the standard long-term learning dictionary 19 are searched, and if the word cannot be converted into a desired series of words in one conversion operation, it is determined that the word is a compound word.

この結果、複合語の場合には、かな漢字変換処理部１６
は複合語長期学習辞書２０を参照して、その読みに対す
る複合語が既に長期学習（登録）されているか否か調べ
、長期学習されていれば、同辞書２０の頻度情報に基づ
いて同音語候補の出力順序を決定し、「変換／次候補」
キー１１８の操作に応じて、同音語候補を表示部１３に
出力する（ステップＡ４〜Ａ６）。As a result, in the case of a compound word, the kana-kanji conversion processing unit 16
refers to the compound word long-term learning dictionary 20 to check whether the compound word corresponding to the pronunciation has already been long-term learned (registered), and if it has been long-term learned, a homophone candidate is determined based on the frequency information in the same dictionary 20. Determine the output order of "Convert/Next candidate"
In response to the operation of the key 118, homophone candidates are output to the display unit 13 (steps A4 to A6).

一方、ステップＳ３において、複合語でないと判断され
た場合、あるいはステップＳ４において、複合語であっ
ても長期学習はされていないと判断された場合、かな漢
字変換処理部１６は標準長期学習辞書１９を参照して、
当該単語あるいは複合語（まだ長期学習されてない複合
語）を構成する各単語が既に長期学習されているか否か
調べ、長期学習されていれば、同辞書１９の頻度情報に
基づいて同音語候補の出力順序を決定し、「変換／次候
補」キー１１ａの操作に応じて、同音語候補を表示部１
３に出力する（ステップＡ７、Ａ８、Ａ６）。On the other hand, if it is determined in step S3 that the word is not a compound word, or if it is determined in step S4 that long-term learning has not been performed even if it is a compound word, the kana-kanji conversion processing unit 16 converts the standard long-term learning dictionary 19 into a compound word. In reference to,
Check whether each word constituting the word or compound word (compound word that has not been learned for a long time) has already been learned for a long time. If it has been learned for a long time, homophone candidates are determined based on the frequency information in the dictionary 19. The output order of the homophone candidates is determined, and the homophone candidates are displayed on the display unit 1 in response to the operation of the "conversion/next candidate" key 11a.
3 (steps A7, A8, A6).

また、ステップＳ７において、単語長期学習されていな
いと判断された場合、かな漢字変換処理部１６は辞書１
８の頻度情報（予め設定されている固定の情報）に基づ
いて同音語候補の出力順序を決定し、「変換／次候補」
キーＬｌａの操作に応して、同音語候補を表示部１３に
出力する（ステップＡ９、Ａ６）。Further, in step S7, if it is determined that the word has not been learned for a long time, the kana-kanji conversion processing unit 16
The output order of homophone candidates is determined based on the frequency information (fixed information set in advance) in step 8, and the "conversion/next candidate"
In response to the operation of the key Lla, homophone candidates are output to the display unit 13 (steps A9, A6).

このようにして、表示部１３にかな漢字変換の結果が表
示される。この際に、読みに対する単語か複数ある場合
、つまり複数の同音語候補かある場合、かな漢字変換処
理部１６は上記決定した出力順序に基き、まず、頻度の
高い単語を第１候補として表示部１３に出力する。表示
部１３は、この単語に例えばオーバライン等の未確定状
態を示す記号を付して表示する。In this way, the result of the kana-kanji conversion is displayed on the display section 13. At this time, if there are multiple words for the reading, that is, if there are multiple homophone candidates, the kana-kanji conversion processing unit 16 first selects the most frequent word as the first candidate and displays it on the display unit 13 based on the output order determined above. Output to. The display unit 13 displays this word with a symbol indicating an undefined state, such as overline, for example.

ここで、表示された単語が望む単語でない場合には、「
変換／次候補」キー１１ａを操作する。この「変換／次
候補」キー１１ａの操作により、かな漢字変換処理部１
６は次候補の単語を表示部１３に出力する。これにより
、次候補の単語が上記同様にオーバラインと共に表示さ
れる。一方、表示された単語が望む単語である場合には
、「選択／実行」キー１１ｂを操作する。この「選択／
実行」キー１１ｂの操作により、単語が確定され、その
単語に付されていたオーバラインが消える。なお、複合
語についても同様である。Here, if the displayed word is not the word you want, click
2. Operate the "Convert/Next Candidate" key 11a. By operating this "conversion/next candidate" key 11a, the kana-kanji conversion processing unit 1
6 outputs the next candidate word to the display unit 13. As a result, the next candidate word is displayed with an overline as described above. On the other hand, if the displayed word is the desired word, the "select/execute" key 11b is operated. This “selection/
By operating the "Execute" key 11b, the word is confirmed and the overline attached to the word disappears. Note that the same applies to compound words.

次に、上記のようにして単語あるいは複合語を選択、確
定した後の動作を説明する。Next, the operation after selecting and confirming a word or compound word as described above will be explained.

第５図に示すように、［選択／実行ｊキー１１．ｂの操
作により、単語あるいは複合語の同音語が確定されると
（ステップＢ１）、漢字変換制御部１５内の長期学習処
理部１７はその確定された同音語が複合語であるか否か
を判断する（ステップＢ２）。As shown in FIG. 5, select/execute j key 11. When the homophone of a word or a compound word is determined by the operation b (step B1), the long-term learning processing section 17 in the kanji conversion control section 15 determines whether the determined homophone is a compound word or not. Judgment is made (step B2).

複合語の場合、長期学習処理部１７はその複合語を複合
語長期学習辞書２０に登録する（ステップ８３〜Ｂ５）
。複合語長期学習辞書２０に登録するとは、使用した複
合語の回数を長期頻度としてカウント（更新）していく
ことである。すなわち、第３図に示すように、例えば「
急」と「発進」からなる「急発進」という複合語を同音
語変換の際に確定した場合において、この「急発進」と
いう複合語が既に４回使用されていれば、その頻度情報
（長期情報）を「５」にすることである。In the case of a compound word, the long-term learning processing unit 17 registers the compound word in the compound word long-term learning dictionary 20 (steps 83 to B5).
. Registering a compound word in the long-term learning dictionary 20 means counting (updating) the number of times a compound word has been used as a long-term frequency. That is, as shown in FIG.
When the compound word "sudden start" consisting of "sudden start" and "start" is determined during homophone conversion, if this compound word "sudden start" has already been used four times, its frequency information (long-term information) to "5".

この場合、複合語長期学習辞書２Ｄの容量の関係から、
長期学習処理部１７は複合語の登録に際し、同辞書２０
の登録語数か所定数を越える否か調へ（ステップＢ３）
、所定数を越える場合には、頻度の低い複合語を同辞書
２０から削除する（ステップＢ４）。これにより、複合
語長期学習辞書２０のオーバフローを防くことかできる
。なお、上記所定数は、予め長期学習処理部１７に任意
に設定されている。In this case, due to the capacity of the compound word long-term learning dictionary 2D,
When registering a compound word, the long-term learning processing unit 17 uses the same dictionary 20.
Check whether the number of registered words exceeds a predetermined number (Step B3)
, if the predetermined number is exceeded, the infrequent compound words are deleted from the dictionary 20 (step B4). This makes it possible to prevent the compound word long-term learning dictionary 20 from overflowing. Note that the predetermined number is arbitrarily set in the long-term learning processing section 17 in advance.

一方、ステップＢ２において、複合語でないと判断され
た場合、つまり単語であると判断された場合、長期学習
処理部１７はその単語を標準長期学習辞書１９に登録す
る。標準長期学習辞書１９に登録するとは、複合語長期
学習辞書２０のときと同様、使用した単語の回数を長期
頻度としてカウント（更新）していくことである。On the other hand, in step B2, if it is determined that the word is not a compound word, that is, if it is determined that it is a word, the long-term learning processing section 17 registers the word in the standard long-term learning dictionary 19. Registering in the standard long-term learning dictionary 19 means, as in the case of the compound word long-term learning dictionary 20, counting (updating) the number of times a word has been used as a long-term frequency.

このようにして、単語と複合語が個別に長期学習にされ
る。以後、単語のかな漢字変換の際には、標準長期学習
辞書１９に登録された単語の頻度情報に基づいて、読み
に対する単語（同音語）が使用頻度順に表示されること
になる。同様に、複合語のかな漢字変換の際には、複合
語長期学習辞書２０に登録された複合語の頻度情報に基
づいて、読みに対する複合語（同音語）が使用頻度順に
表示されることになる。In this way, words and compounds are individually studied over a long period of time. Thereafter, when converting a word into kana-kanji, the words (homonyms) corresponding to the reading will be displayed in order of frequency of use based on the frequency information of the words registered in the standard long-term learning dictionary 19. Similarly, when converting compound words into kana-kanji, compound words (homonyms) for readings will be displayed in order of frequency of use based on the frequency information of compound words registered in the compound word long-term learning dictionary 20. .

具体的に説明する。例えば「きゅうはっしん」と入力す
ると、まず、その入力読み情報が複合語か否か判断され
る。この場合、上述したように、辞書１８および標準長
期学習辞書１９をサーチすることによって複合語か否か
が判断される。複合語と判断された場合には、既に長期
学習されているか否か判断される。初期時には、まだ、
複合語として長期学習されていない。したがって、この
ような場合には、「きゅうはりしん」に対する同音語候
補は単語レベルで出力されることになる。すなわち、「
きゅう」と「はっしん」でかな漢字変換され、例えば「
急発信」、「急発進」、「急発疹」・・・という順に出
力される。I will explain in detail. For example, when you input "Kyu Hasshin", it is first determined whether the input reading information is a compound word or not. In this case, as described above, it is determined whether the word is a compound word by searching the dictionary 18 and the standard long-term learning dictionary 19. If the word is determined to be a compound word, it is determined whether the word has already been studied for a long period of time. At the beginning, still
It has not been learned as a compound word over a long period of time. Therefore, in such a case, homophone candidates for "Kyuharishin" will be output at the word level. In other words, “
"Kyu" and "Hasshin" are converted into kana-kanji, for example, "
"Sudden call", "Sudden start", "Sudden rash", etc. are output in this order.

この場合、複合語を構成する各単語が長期学習されてい
るか否かによって、その同音語候補の出力順序は異なる
。すなわち、各単語が長期学習されていれば標準長期学
習辞書１９の頻度情報（長期情報）に基づいて同音語候
補の出力順序が決定され、その順に同音語候補が出力さ
れる。一方、各単語が長期学習されていなければ辞書１
８の頻度情報（固定情報）に基づいて同音語候補の出力
順序が決定され、その順に同音語候補が出力される。In this case, the output order of homophone candidates differs depending on whether each word constituting the compound word has been studied over a long period of time. That is, if each word has been studied over a long period of time, the output order of homophone candidates is determined based on the frequency information (long-term information) of the standard long-term learning dictionary 19, and the homophone candidates are output in that order. On the other hand, if each word has not been studied for a long time, Dictionary 1
The output order of homophone candidates is determined based on the frequency information (fixed information) of No. 8, and the homophone candidates are output in that order.

次に、上記同音語候補の中から「急発進」を選択、確定
したとする。この「急発進」は複合語なので、複合語長
期学習辞書２０に例えば第３図のように登録される。以
後、例えば「きゆうしゃかいコと入力すると、ます、複
合語として長期学習されているか否かが調べられる。こ
の場合、長期学習されていないので、次に「きゅう」と
「しやかい」という単語レベルで長期学習されているか
否か調べられる。Next, it is assumed that "sudden start" is selected and confirmed from among the homophone candidates. Since "sudden start" is a compound word, it is registered in the compound word long-term learning dictionary 20 as shown in FIG. 3, for example. After that, for example, if you enter ``Kyuushakaiko'', it will be checked whether it has been learned as a compound word over a long period of time. You can check whether or not there is long-term learning at the word level.

ここで、「きゅう」は既に上述した「急発進」で使用さ
れているが、単独で使用されているわけてはないので、
第２図に示すように長期学習なしとなる。また、１社会
」についても長期学習なしである。したがって、規定の
頻度順（辞書１８の固定頻度順）で「きゅう」と「しや
かい」に対する同音語候補が出力される。これにより、
例えば「旧」と「社会」の各単語の同音語候補が高けれ
ば、「旧社会」が第１候補として出力されることになり
、「急社会」という誤変換を防ぐことができる。Here, "kyu" is already used in "sudden start" mentioned above, but it is not used alone, so
As shown in Figure 2, there is no long-term learning. Also, there is no long-term study for "1 Society". Therefore, homophone candidates for "kyu" and "shiyakai" are output in a prescribed frequency order (fixed frequency order in the dictionary 18). This results in
For example, if the homophone candidates for each of the words "old society" and "society" are high, "old society" will be output as the first candidate, and it is possible to prevent erroneous conversion of "society".

なお、上記実施例では、複合語を登録する場合において
、複合語のみを複合語長期学習辞書２０に登録するよう
にしたが、例えば複合語を同辞書２０に登録すると同時
に、その複合語を構成する各単語を通常よりも比率を低
めにして標準長期学習辞書１９に登録するようにしても
良い。このように構成した場合の動作を第６図乃至第８
図を参照して説明する。In the above embodiment, when registering a compound word, only the compound word is registered in the compound word long-term learning dictionary 20, but for example, at the same time as registering the compound word in the same dictionary 20, the compound word is Each word may be registered in the standard long-term learning dictionary 19 at a lower ratio than usual. The operation when configured in this way is shown in Figures 6 to 8.
This will be explained with reference to the figures.

すなわち、第６図に示すように、「選択／実行」キー１
．１　ｂの操作により、単語あるいは複合語の同音語が
確定されると（ステップＣ１）、漢字変換制御部１５内
の長期学習処理部１７はその確定された同音語が複合語
であるか否かを判断する（ステップＣ２）。複合語の場
合、長期学習処理部１７はその複合語を複合語長期学習
辞書２０に登録すると同時に、同複合語を構成する各単
語を標準長期学習辞書１９に登録する（ステップ０３〜
Ｃ６）。That is, as shown in FIG.
．． When a homophone of a word or a compound word is determined by the operation in step 1b (step C1), the long-term learning processing section 17 in the kanji conversion control section 15 determines whether the determined homophone is a compound word or not. (Step C2). In the case of a compound word, the long-term learning processing unit 17 registers the compound word in the compound word long-term learning dictionary 20, and at the same time registers each word constituting the compound word in the standard long-term learning dictionary 19 (steps 03-
C6).

この場合、上記実施例（第５図）と同様、複合語長期学
習辞書２０の容量の関係から、長期学習処理部１７は複
合語の登録に際し、同辞書２０の登録語数か所定数を越
える否か調べ（ステップＣ３）所定数を越える場合には
、頻度の低い複合語を同辞書２０から削除する（ステッ
プＣ４）。In this case, as in the above embodiment (FIG. 5), due to the capacity of the compound word long-term learning dictionary 20, the long-term learning processing unit 17 determines whether the number of registered words in the compound word 20 exceeds a predetermined number when registering the compound word. (Step C3) If the predetermined number is exceeded, the infrequent compound word is deleted from the dictionary 20 (Step C4).

このようにして、例えば第７図及び第８図に示すように
、「急発進」という複合語か複合語長期学習辞書２０に
登録された場合には、「急」と「発進」という各単語も
標準長期学習辞書１９に登録されることになる。なお、
この場合の単語頻度の更新は、単語単独で長期学習する
場合と異なるので、通常よりも低めに設定する必要があ
る。これにより、複合語とその複合語を構成する各単語
を同時に長期学習することができ、以後、例えば「きゆ
うてんぼ」と入力した場合に、学習している「急」を優
先して、「急テンポ」を第１候補として出力、表示する
ことができるものである。In this way, for example, as shown in FIGS. 7 and 8, when the compound word "sudden start" is registered in the compound word long-term learning dictionary 20, the words "sudden" and "start" are registered. will also be registered in the standard long-term learning dictionary 19. In addition,
Updating the word frequency in this case is different from the case of long-term learning of words alone, so it needs to be set lower than usual. As a result, you can simultaneously study a compound word and each word that makes up the compound word over a long period of time, and from now on, when you input ``kiyutenbo,'' for example, the ``kyu'' you are learning will be given priority. It is possible to output and display "rapid tempo" as the first candidate.

さらに、上記実施例では、長期学習辞書を標準用と複合
語用に分けているが、本発明はこれに限るものではなく
、例えば第９図に示すように、つの長期学習辞書として
構築することも可能である。この場合、各単語に複合語
フラグＡを設け、複合語として使用された際に、その単
語の複合語フラグＡをセットする構成とすれば良い。Further, in the above embodiment, the long-term learning dictionary is divided into standard and compound words, but the present invention is not limited to this. For example, as shown in FIG. 9, the long-term learning dictionary may be constructed as two long-term learning dictionaries. is also possible. In this case, a compound word flag A may be provided for each word, and when the word is used as a compound word, the compound word flag A may be set for that word.

［発明の効果］以上のように本発明によれば、複合語も単語と同様に長
期学習できる。したかって、複合語のかな漢字変換に際
し、迅速に所望の語を得ることができると共に、誤変換
を防ぐことができる。これにより、文書作成効率が大幅
に向上するものである。[Effects of the Invention] As described above, according to the present invention, compound words can be studied over a long period of time in the same way as words. Therefore, when converting a compound word into kana-kanji, a desired word can be quickly obtained and erroneous conversion can be prevented. This greatly improves document creation efficiency.

[Brief explanation of the drawing]

第１図は本発明の一実施例に係る文書作成装置の構成を
示すブロック図、第２図は同実施例の標準長期学習枠、
書の構成を示す図、第３図は同実施例の複合語長期学習
辞書の構成を示す図、第４図は同実施例のかな漢字変換
処理の動作を説明するためのフローチャート、第５図は
同実施例の長期学習処理の動作を説明するためのフロー
チャート、第６図は本発明の他の実施例に係る長期学習
処理の動作を説明するためのフローチャート、第る複合
語長期学習辞書の構成を示す図、第９図は本発明の他の
実施例に係る標準及び複合語用の長期学習辞書の構成を
示す図である。１１・・・キーボード、ｌｌａ・・・「変換／次候補」
キー１１ｂ・・・「選択／実行」キー　Ｉ２・・・入出
力制御部、１３・・・表示部、１４・・・ＣＰＵ５１５
・・・かな漢字変換制御、１６・・・かな漢字変換処理
部、１７・・・長期学習処理部、１８・・・辞書、１９
・・・標準長期学習辞書、２０・・・複合語長期学習辞
書。出願人代理人　弁理士　鈴江武彦第図図第図図FIG. 1 is a block diagram showing the configuration of a document creation device according to an embodiment of the present invention, and FIG. 2 shows a standard long-term study frame of the same embodiment.
Figure 3 is a diagram showing the configuration of the compound word long-term learning dictionary of the same embodiment, Figure 4 is a flowchart for explaining the operation of the kana-kanji conversion process of the same embodiment, and Figure 5 is a diagram showing the structure of the compound word long-term learning dictionary of the same embodiment. FIG. 6 is a flowchart for explaining the operation of the long-term learning process according to another embodiment of the present invention, and the structure of the first compound word long-term learning dictionary. FIG. 9 is a diagram showing the structure of a long-term learning dictionary for standard and compound words according to another embodiment of the present invention. 11...Keyboard, lla..."Conversion/Next candidate"
Key 11b..."selection/execution" key I2...input/output control section, 13...display section, 14...CPU 515
...Kana-Kanji conversion control, 16...Kana-Kanji conversion processing unit, 17...Long-term learning processing unit, 18...Dictionary, 19
...Standard long-term learning dictionary, 20...Compound word long-term learning dictionary. Applicant's agent Patent attorney Takehiko Suzue

Claims

[Claims]

(1) A first storage means that stores a plurality of words for the same reading and frequency information of each word; and a second storage means that stores a plurality of compound words for the same reading and frequency information of each compound word. , a first compound word determining means for determining whether or not the input reading information is a compound word; and a first compound word determining means for determining whether or not the input reading information is a compound word; Candidate extraction for extracting homophone candidates from the first storage means, and extracting homophone candidates of the compound word corresponding to the input reading information from the second storage means when it is determined that the compound word is the compound word. a homophone selection confirmation means for selecting a desired homophone from among the homophone candidates extracted by the candidate extraction means and confirming the selected homophone; a second compound word determining means for determining whether a homophone is a compound word; and a second compound word determining means for updating word frequency information when the second compound word determining means determines that the word is not a compound word; What is claimed is: 1. A document creation device having a learning function, comprising: an updating means for updating frequency information of a compound word when it is determined that there is a compound word.

(2) The updating means updates the frequency information of the compound word when it is determined to be a compound word by the second compound word determining means, and at the same time updates the frequency information of each word constituting the compound word. A document creation device with a learning function according to claim 1.

(3) In a document creation device equipped with a learning function, a plurality of words corresponding to the same pronunciation and frequency information of each word are stored in a first storage area of the storage means, and a plurality of compound words corresponding to the same pronunciation and frequency information of each of these words are stored in a first storage area of the storage means. Frequency information of compound words is stored in a second storage area of the same storage means, and it is determined whether the input reading information is a compound word or not.As a result, if it is determined that the input reading information is not a compound word, the above input reading information is stored. A homophone candidate for the word corresponding to the input reading information is extracted from the first storage area, and when it is determined that the word is a compound word, the homophone candidate for the compound word corresponding to the input reading information is extracted from the second storage area. When a desired homophone is selected and confirmed from the extracted homophone candidates, it is determined whether or not this homophone is a compound word. A learning method characterized in that frequency information of a word is updated when it is determined that the word is a compound word, and frequency information of the compound word is updated when it is determined that the word is a compound word.