JP3768577B2

JP3768577B2 - Character processing method

Info

Publication number: JP3768577B2
Application number: JP34413895A
Authority: JP
Inventors: 浩司前川
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1995-12-28
Filing date: 1995-12-28
Publication date: 2006-04-19
Anticipated expiration: 2015-12-28
Also published as: JPH09185612A

Description

【０００１】
【発明の属する技術分野】
本発明は、文字処理方法、特に文書処理装置に入力されたかな文字列からかな漢字変換により、かな漢字混じりの文字列を作成する文字処理方法に関するものである。
【０００２】
【従来の技術】
従来の文書処理装置のかな漢字変換の処理方法を図１６のフローチャートにより説明する。
【０００３】
まず、文書処理装置では、かな文字列（よみ）が入力され（ステップｓ１）、このかな文字列に対してかな漢字変換を行ない変換候補を表示出力する。より具体的にはかな漢字辞書を参照し、よみに対応する表記を変換候補として抽出する（ステップｓ２）。変換候補がユーザにとって希望する結果ではなかった場合、（ステップｓ３）、ユーザはキーボードを操作して、入力の文字列の区切りを変更して再変換を指示するか、または、同音異義語の他の候補を指示するかによって変換候補を修正する（ステップｓ４）。ｓ４の修正処理により作成された単語（よみと表記）が辞書に存在しなかった場合（ステップｓ５）、この単語を自動登録処理によって学習データ記憶領域に格納する（ステップ、ｓ６）。また、同時にその単語が次回の変換処理に反映されるように、使用頻度について単語学習処理を行う（ステップｓ７）。
【０００４】
具体的なかな漢字変換の処理例を図１７から図１９を用いて説明する。
【０００５】
図１７において、入力文字列［だいじんしょう］を変換し、変換候補として［大臣章］が出力される。しかし、変換候補は希望する結果［大臣賞］と違うので、候補の修正，選択により、［章］を［賞］に変更し、［賞］を単語学習する。
【０００６】
以上の操作を実行すると、次回に次に［だいじんしょう］と入力すると［大臣賞］の変換候補を得ることができる。
【０００７】
次に図１８に示すように入力文字列［でーたしょう］を与えると、［賞］の使用頻度が高いので変換候補として［データ賞］を出力する。しかし、変換候補は希望する結果［データショウ］と違うので、ユーザは候補選択により、［賞］を［ショウ］に変換しなければならない。変更した［ショウ］は辞書に存在しない単語であるため、自動的に学習データに自動登録単語として登録される。さらに自動登録した単語を単語学習する。
【０００８】
次に図１９に示すように再び［だいじんしょう］を入力し変換した場合、［ショウ］が自動登録単語として存在し、かつ前回の変換時に単語学習されているために、変換候補は［大臣ショウ］となってしまう。
【０００９】
【発明が解決しようとする課題】
このように、従来の文字処理方法ではもともと辞書になかった単語が辞書に登録されると、通常の辞書の単語よりも優先されるので、その結果、所望の変換候補がすぐに得られず、変換率が落ちてしまうことがある。
【００１０】
そこで、本発明の目的は、上述の点に鑑みて、自動登録をした単語についても好適な変換処理を行なうことができる文字処理方法を提供することにある。
【００１１】
【課題を解決するための手段】
このような目的を解決するために、請求項１の発明は、単語のよみと表記と頻度とを対応付けて記憶するための辞書記憶手段と、単語のよみを入力するための入力手段とを有し、該辞書記憶手段を参照して、前記入力手段より入力された単語のよみに対して頻度の高い表記を優先して変換候補とするとともに、当該辞書記憶手段に未登録の単語の表記がよみの変換結果として作成されたときに、当該変換結果における単語のよみと表記を前記辞書記憶手段に自動登録することの可能な文書処理装置のための文字処理方法において、前記文書処理装置は、第１判定手段、第２判定手段、学習手段および調節手段をさらに有し、前記入力手段より入力された単語のよみから表記への変換に際し、変換対象の単語が前記辞書記憶手段に自動登録された単語か予め登録されている単語かを前記第１判定手段により判定する第１判定ステップと、前記第１判定ステップにより自動登録された単語と判定された場合に、前記辞書記憶手段に記憶された当該単語の頻度が所定値に到達しているか否かを前記第２判定手段により判定する第２判定ステップと、前記第１判定ステップにより予め前記辞書に登録されていると判定されたとき、または前記第２判定ステップにより前記所定値に到達していると判定されたときに、当該変換結果を次回の変換に反映させるために変換履歴を履歴記憶手段に記憶する単語学習を前記学習手段により行う学習ステップと、前記第２判定ステップにより前記所定値に到達していないと判定されたときに当該自動登録された単語について前記辞書記憶手段に記憶された頻度を１だけ前記調節手段により増加させ、前記学習ステップの終了後においては単語学習した単語と同じよみを持つ自動登録された単語について前記辞書記憶手段に記憶された頻度を１だけ前記調節手段により減少させる調節ステップとを具えたことを特徴とする。
【００１２】
請求項２の発明は、前記辞書記憶手段は自動登録の単語を記憶する領域と漢字変換通常辞書領域とを有することを特徴とする。
【００１３】
請求項３の発明は、単語のよみと表記と頻度とを対応付けて記憶するための辞書記憶手段と、単語のよみを入力するための入力手段とを有し、該辞書記憶手段を参照して、前記入力手段より入力された単語のよみに対して頻度の高い表記を優先して変換候補とするとともに、当該辞書記憶手段に未登録の単語の表記がよみの変換結果として作成されたときに、当該変換結果における単語のよみと表記を前記辞書記憶手段に自動登録することの可能な文書処理装置のための文字処理方法において、前記文書処理装置は判定手段、第１調節手段および第２調節手段をさらに有し、前記入力手段より入力された単語のよみから表記への変換に際し、変換対象の単語が前記辞書記憶手段に自動登録された単語か予め登録されている単語かを前記判定手段により判定する判定ステップと、前記判定ステップにより自動登録された単語と判定された場合は当該変換対象の単語について前記辞書記憶手段に記憶された頻度を1だけ前記第１調節手段により増加させる第１調節ステップと、予め辞書に登録されている単語と判定された場合は、当該変換結果を次回の変換に反映させるために変換履歴を履歴記憶手段に記憶する単語学習をして、当該単語と同一のよみを有する自動登録された単語について前記辞書記憶手段に記憶された頻度を１だけ前記第２調節手段により減少させる第２調節ステップとを具えたことを特徴とする。
【００１４】
請求項４の発明は、前記辞書記憶手段は自動登録の単語を記憶する領域と漢字変換通常辞書領域とを有することを特徴とする。
【００１５】
【発明の実施の形態】
以下、図面を参照して本発明の１実施形態について説明する。
【００１６】
図１は本発明を適用した文書処理装置のシステム構成を示す。図１において入力装置（キーボードなど）１から入力されたかな文字列は、中央演算装置（ＣＰＵなど）２によって処理され、出力装置（ＣＲＴなど）３に出力される。また、その変換の履歴（単語学習など）は、記憶装置（ハードディスクなど）４に記憶され次の解析の時に、変更の履歴を変換結果に反映させる。記憶装置４には辞書を構成するかな漢字変換用通常辞書４Ａおよび自動登録単語記憶領域４Ｂが設けられている。
【００１７】
図２はＣＰＵ２の文字処理に関わる処理手順を示す。
【００１８】
ＣＰＵ２は入力装置１から入力された文字列を、かな漢字変換処理によって処理して変換候補を出力する（ステップｓ１０１→ｓ１０２）。ＣＰＵ２は変換候補が希望するものかどうかを変換候補の確定の指示の有無により判断する（ステップｓ１０３）。希望する変換結果だった場合、変換結果の単語が自動登録かどうか判断する（ステップｓ１０７）。希望する同音語ではなかった場合、ユーザの指示により候補修正処理を実行していない変換候補を修正し、最終的に確定させる。修正された後の変換候補が辞書に存在するかどうか判断する（ステップｓ１０５）。存在しない単語だった場合、新たに自動登録処理（ステップｓ１０６）を行う。
【００１９】
確定した候補が辞書に登録されている同音語だった場合、変換結果の単語が自動登録されているかどうか判断する（ステップｓ１０７）。自動登録ではなかった場合、単語学習処理（ステップｓ１０９）を行う。変換単語が自動登録だった場合は、自動登録単語の頻度をみて、頻度が十分に高ければ、すなわち、所定の使用回数を超えていれば単語学習処理（ステップｓ１０９）を行う。さらに自動登録単語の場合には、自動登録単語頻度調節処理を行う（ステップｓ１０）。
【００２０】
図２の処理手順を実行したときの具体的な処理内容を図３から図４を用いて説明する。
【００２１】
図３は、ユーザが［でーたしょう］を入力すると、文書処理装置では辞書４Ａに基づき、かな漢字変換を行い［データ証］という変換候補を得る。この変換候補がユーザの希望する変換結果ではないとき、ユーザはキーボードを操作して、希望する変換候補［データショウ］に変換する。
【００２２】
図４のように、候補選択した［ショウ］が、辞書Ａ４に存在しない単語のとき文書処理装置で自動登録処理を行う。
【００２３】
後にユーザが再び［でーたしょう］を入力すると、文書処理装置では、辞書４Ａに基づきかな漢字変換を行い［データ証］という変換候補を得る。ユーザにとってやはり希望する変換結果ではないので、ユーザは希望する変換候補［データショウ］に変換する（図５参照）。
【００２４】
候補選択によって選択された［ショウ］は、自動登録単語であるので、文書処理装置は、自動登録単語の頻度をみて、頻度が十分に高ければ（本実施形態ではその判断値を４以上とする）単語学習処理を行う。今の場合、頻度が０であるため、単語学習処理は行われず、自動登録頻度調節処理によって、自動登録単語の頻度を０から１に高める（図６参照）。
【００２５】
ユーザがこの単語を選択／決定していく度に［ショウ］の頻度が高くなって、４以上になると、図７に示すように一回目の変換結果として［ショウ］が出力されるようになる。
【００２６】
図８のようにユーザが［だいじんしょう］を入力し、かな漢字変換を行い［大臣ショウ］という変換候補を得た場合に、この変換候補がユーザにとって希望する変換結果ではないとき、ユーザは希望する変換結果［大臣賞］へ候補選択により、変更する。また、［賞］を以下のように単語学習する。すなわち、ユーザが第一候補になった自動登録の［ショウ］を他の候補に変更して確定したので、文書処理装置は、自動登録頻度調節により［ショウ］の使用頻度を１低下させる（図９参照）。
【００２７】
ユーザが図１０のように［でーたしょう］を入力し、かな漢字変換を行って［データ賞］と言う変換候補を得たとする。この変換候補は希望する変換候補ではないため、ユーザは［賞］を［ショウ］に修正する。自動登録である［ショウ］が選ばれたため、文書処理装置はこの単語について頻度が十分高いかどうか判断する。この単語は頻度が３であり、判断値４より小さいため単語学習処理は行わず、自動登録頻度調節により、頻度を高くする（頻度４，図１１参照））。
【００２８】
ここで、図１２のようにユーザが再び［だいじんしょう］を入力したときには、［ショウ］は学習されておらず、［賞］の単語学習が効くために変換結果として［大臣賞］と変換される。ユーザが図１３のように再び［でーたしょう］を入力したときには、やはり単語学習が適用されるために、［データ賞］と変換される。しかしこの候補はユーザによって希望する候補ではないため、ユーザは候補選択処理により［賞］を［ショウ］に変換する。このとき選択した［ショウ］が自動登録単語であるので、文書処理装置は頻度が十分に高いかどうかをチェックする。図１４に示すように、頻度が４で判断値４に到達したので、文書処理装置はこの自動登録単語を普通の辞書登録語と同様に扱い、単語学習処理を行い、さらに頻度を高くする。
【００２９】
（他の実施例）
上述の実施例では、ユーザが確定した変換候補が自動登録単語の場合には頻度に応じて単語学習するか否かを決定している。しかしながら、辞書４Ａに記載されている単語については単語学習し、自動登録単語については単語学習を行なわないようにすることができる。このための文字処理手順を図１５に示す。
【００３０】
図１５において、図２と同様の処理については同一の符号を付している。図１５のステップｓ２００〜ｓ２１０が自動登録単語を検出し、単語学習しない処理のための手順を示す。
【００３１】
この処理手順に従うと、ユーザが［でーたしょう］を入力し、かな漢字変換を行うと、［データ証］という変換候補を得る（図１５のステップｓ１０１→ｓ１０２）。ここでユーザにとって希望する変換候補ではないので、キーボード装置を操作して変換候補「証」から「ショウ」に修正する。所望の変換候補「データショウ」を取得し、変換候補の確定を指示する（図１５のステップｓ１０４、図３参照）。
【００３２】
文書処理装置は、確定の変換候補「ショウ」が辞書４Ａまたは自動登録領域４Ｂに存在するかを判定する（図１５のステップｓ１０５）。
【００３３】
この「ショウ」が辞書４Ａおよび自動登録領域４Ｂに存在しない場合はよみ「しょう」および表記「ショウ」が自動登録単語に登録され、頻度が０に設定される（図１５のステップｓ１０６）。
【００３４】
新たにユーザが「でーたしょう」と入力すると文書処理装置はかな漢字変換を実行する。
【００３５】
上述の「ショウ」は単語学習されていないので、前回と同様、かな漢字変換候補として「データ証」が得られる（図１５のステップｓ１０１→ｓ１０２）。
【００３６】
ユーザは変換候補の中の「証」を「ショウ」に修正し、「データショウ」を確定する（図１５のステップｓ１０４）。文書処理装置においてステップｓ１０５の判定、すなわち、変換候補「ショウ」が辞書に存在するかの判定が実行される。この時点で「ショウ」は自動登録されているので肯定判定が得られる。続いて図１５のステップｓ２００の判定でも肯定判定が得られるので、ステップＳ２０１の単語学習は行われず、「ショウ」に関する頻度が「０」から「１」に調節される（図１５のステップｓ２０２）。
【００３７】
このようにして、自動登録単語についてはユーザがその単語を使用する毎に頻度が高くなるので、他の同音の単語よりも表示順が早くなる。また、自動登録単語については単語学習していないので、単語学習した、通常のかな漢字変換辞書４Ａに記載された単語の方が表示順が早くなる。
【００３８】
このようにして、自動登録単語を使用した後、「ショウ」と同音のかな漢字変換辞書４Ａに記載の単語がユーザによって候補選択された場合、図１５の実行手順はステップｓ１０１〜ｓ１０５→ｓ２００→ｓ２０１経路を進み、その単語の単語学習の後、上記「ショウ」の単語については頻度が−１される（図１５のステップｓ２０２）。
【００３９】
【発明の効果】
以上、説明したように、本発明では、自動登録単語と、辞書に予め記載した単語の区別をつけ、余り使われていない自動登録単語は変換候補としての順位を低くする。これにより、自動登録単語を登録したことによる弊害をなくした。また変換率が向上し、文書処理装置の操作性の改善に寄与することができる。
【図面の簡単な説明】
【図１】本発明を適用した文書処理装置のシステム構成を示すブロック図である。
【図２】文書処理装置の処理手順を示すフローチャートである。
【図３】文書処理装置の処理内容を示す説明図である。
【図４】文書処理装置の処理内容を示す説明図である。
【図５】文書処理装置の処理内容を示す説明図である。
【図６】文書処理装置の処理内容を示す説明図である。
【図７】文書処理装置の処理内容を示す説明図である。
【図８】文書処理装置の処理内容を示す説明図である。
【図９】文書処理装置の処理内容を示す説明図である。
【図１０】文書処理装置の処理内容を示す説明図である。
【図１１】文書処理装置の処理内容を示す説明図である。
【図１２】文書処理装置の処理内容を示す説明図である。
【図１３】文書処理装置の処理内容を示す説明図である。
【図１４】文書処理装置の処理内容を示す説明図である。
【図１５】他の実施例の処理手順を示すフローチャートである。
【図１６】従来の処理手順を示すフローチャートである。
【図１７】従来の処理内容を示す説明図である。
【図１８】従来の処理内容を示す説明図である。
【図１９】従来の処理内容を示す説明図である。
【符号の説明】
１入力装置
２ＣＰＵ
３出力装置
４記憶装置
４Ａかな漢字変換辞書
４Ｂ自動登録単語領域[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a character processing method, and more particularly to a character processing method for generating a kana-kanji mixed character string by kana-kanji conversion from a kana character string input to a document processing apparatus.
[0002]
[Prior art]
A kana-kanji conversion processing method of the conventional document processing apparatus will be described with reference to the flowchart of FIG.
[0003]
First, in the document processing apparatus, a kana character string (reading) is input (step s1), kana-kanji conversion is performed on the kana character string, and conversion candidates are displayed and output. More specifically, a kana-kanji dictionary is referred to, and a notation corresponding to the text is extracted as a conversion candidate (step s2). If the conversion candidate is not the result desired by the user (step s3), the user operates the keyboard to change the input character string delimiter and instruct reconversion, or other homophones The conversion candidate is corrected depending on whether the candidate is designated (step s4). When the word (reading and notation) created by the correction process of s4 does not exist in the dictionary (step s5), this word is stored in the learning data storage area by the automatic registration process (step, s6). At the same time, a word learning process is performed for the usage frequency so that the word is reflected in the next conversion process (step s7).
[0004]
A specific example of Kana-Kanji conversion processing will be described with reference to FIGS.
[0005]
In FIG. 17, the input character string [Daijinsho] is converted, and [Ministerial Chapter] is output as a conversion candidate. However, since the conversion candidate is different from the desired result [Minister Award], by changing and selecting the candidate, [Chapter] is changed to [Award] and [Prize] is learned by words.
[0006]
When the above operations are executed, the next time [Daijinjo] is entered, conversion candidates for [Minister Award] can be obtained.
[0007]
Next, as shown in FIG. 18, when an input character string [deta-sho] is given, [data award] is output as a conversion candidate because the use frequency of [award] is high. However, since the conversion candidate is different from the desired result [data show], the user must convert [prize] to [show] by selecting the candidate. Since the changed [Show] is a word that does not exist in the dictionary, it is automatically registered as an automatically registered word in the learning data. In addition, learn words automatically registered words.
[0008]
Next, when [Daijinjo] is input again and converted as shown in FIG. 19, [Show] exists as an automatically registered word, and the word is learned at the time of the previous conversion. Show].
[0009]
[Problems to be solved by the invention]
Thus, when a word that was not originally in the dictionary in the conventional character processing method is registered in the dictionary, it takes precedence over the word in the normal dictionary, and as a result, a desired conversion candidate cannot be obtained immediately, Conversion rate may drop.
[0010]
In view of the above, an object of the present invention is to provide a character processing method capable of performing a suitable conversion process for automatically registered words.
[0011]
[Means for Solving the Problems]
In order to solve such an object, the invention of claim 1 includes a dictionary storage means for storing word readiness , notation, and frequency in association with each other, and an input means for inputting the word readiness. The dictionary storage means is referred to, and a high-frequency notation is preferentially used as a conversion candidate for the word read from the input means, and a word not yet registered in the dictionary storage means when it is created as a conversion result of the Gayomi, the character processing method for the possible document processing device to automatically register denoted word reading in the conversion result to the dictionary storage means, the document processing device , A first determination means, a second determination means, a learning means, and an adjustment means, and when the word input from the input means is converted into a notation, the conversion target word is automatically registered in the dictionary storage means Is A first determination step of determining by said first determining means for determining words that are words or pre-registered, if it is determined that the word is automatically registered by the first determination step, is stored in the dictionary storage unit A second determination step in which the second determination means determines whether or not the frequency of the word has reached a predetermined value, and when it is determined in the first determination step that it is registered in the dictionary in advance, or When it is determined by the second determination step that the predetermined value has been reached, the learning unit performs word learning for storing the conversion history in the history storage unit in order to reflect the conversion result in the next conversion. and learning step, for the word, which is the automatic registration stored in the dictionary storage means when it is determined not to have reached the predetermined value by the second determination step The frequency is increased by only 1 wherein adjusting means, by said learning step of 1 only the adjusting means for the word that has been automatically registered frequencies stored in the dictionary storage unit having the same reading as the words and word learning after completion And an adjusting step for decreasing.
[0012]
The invention according to claim 2 is characterized in that the dictionary storage means has an area for storing automatically registered words and a kanji conversion normal dictionary area.
[0013]
A third aspect of the present invention includes a dictionary storage means for storing in association with each title and frequency and word reading, and input means for inputting a word of readings, with reference to the dictionary document storage means When the word input from the input unit is preferentially used as a conversion candidate with high frequency notation, and a notation of a word not registered in the dictionary storage unit is created as a conversion result In addition, in the character processing method for a document processing apparatus capable of automatically registering the word reading and notation in the conversion result in the dictionary storage means , the document processing apparatus includes a determination means, a first adjustment means, and a second adjustment means. The method further comprises adjusting means, and when the word input from the input means is converted into notation, the determination is made as to whether the word to be converted is a word automatically registered in the dictionary storage means or a word registered in advance. hand A first step of increasing the frequency stored in the dictionary storage means by 1 by the first adjusting means for the word to be converted when it is determined that the word is automatically registered by the determining step. When it is determined that the word is registered in the dictionary in advance in the adjustment step, the same word as the word is learned by storing the conversion history in the history storage means in order to reflect the conversion result in the next conversion. And a second adjustment step for reducing the frequency stored in the dictionary storage means by 1 by the second adjustment means for the automatically registered words having the following meaning.
[0014]
The invention of claim 4 is characterized in that the dictionary storage means has an area for storing automatically registered words and a kanji conversion normal dictionary area.
[0015]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, an embodiment of the present invention will be described with reference to the drawings.
[0016]
FIG. 1 shows a system configuration of a document processing apparatus to which the present invention is applied. In FIG. 1, a character string input from an input device (such as a keyboard) 1 is processed by a central processing unit (such as a CPU) 2 and output to an output device (such as a CRT) 3. The conversion history (word learning or the like) is stored in the storage device (hard disk or the like) 4 and the change history is reflected in the conversion result at the time of the next analysis. The storage device 4 is provided with a kana-kanji conversion normal dictionary 4A and an automatically registered word storage area 4B constituting a dictionary.
[0017]
FIG. 2 shows a processing procedure related to the character processing of the CPU 2.
[0018]
The CPU 2 processes the character string input from the input device 1 by Kana-Kanji conversion processing and outputs conversion candidates (steps s101 → s102). The CPU 2 determines whether or not the conversion candidate is desired based on the presence / absence of an instruction to determine the conversion candidate (step s103). If it is the desired conversion result, it is determined whether or not the word of the conversion result is automatically registered (step s107). If it is not the desired homophone, the conversion candidate that has not been subjected to the candidate correction process is corrected by the user's instruction and finally confirmed. It is determined whether or not the conversion candidate after correction exists in the dictionary (step s105). If the word does not exist, a new automatic registration process (step s106) is performed.
[0019]
If the confirmed candidate is a homophone registered in the dictionary, it is determined whether or not the conversion result word is automatically registered (step s107). If it is not automatic registration, word learning processing (step s109) is performed. When the converted word is automatically registered, the frequency of the automatically registered word is checked. If the frequency is sufficiently high, that is, if the predetermined number of times of use is exceeded, word learning processing (step s109) is performed. Further, in the case of an automatically registered word, an automatic registered word frequency adjustment process is performed (step s10).
[0020]
Specific processing contents when the processing procedure of FIG. 2 is executed will be described with reference to FIGS.
[0021]
In FIG. 3, when the user inputs “deta-sho”, the document processing apparatus performs kana-kanji conversion based on the dictionary 4A to obtain a conversion candidate “data certificate”. When this conversion candidate is not the conversion result desired by the user, the user operates the keyboard to convert it to the desired conversion candidate [data show].
[0022]
As shown in FIG. 4, when the selected [Show] is a word that does not exist in the dictionary A4, the document processing apparatus performs automatic registration processing.
[0023]
When the user later inputs [deta sho] again, the document processing apparatus performs kana-kanji conversion based on the dictionary 4A to obtain a conversion candidate [data certificate]. Since it is not the desired conversion result for the user, the user converts it into a desired conversion candidate [data show] (see FIG. 5).
[0024]
Since [Show] selected by candidate selection is an automatically registered word, the document processing apparatus looks at the frequency of the automatically registered word, and if the frequency is sufficiently high (in the present embodiment, the determination value is set to 4 or more). ) Perform word learning processing. In this case, since the frequency is 0, the word learning process is not performed, and the frequency of the automatically registered word is increased from 0 to 1 by the automatic registration frequency adjustment process (see FIG. 6).
[0025]
Every time the user selects / determines this word, the frequency of [Show] increases, and when it becomes 4 or more, [Show] is output as the first conversion result as shown in FIG. .
[0026]
As shown in FIG. 8, when the user inputs [Daijinsho], performs Kana-Kanji conversion and obtains a conversion candidate called [Minister Show], the user desires when the conversion candidate is not the conversion result desired by the user. Change the conversion result [Minister Award] by selecting a candidate. In addition, [word] is learned as follows. That is, since the user changed and confirmed the automatic registration [show] that became the first candidate to another candidate, the document processing apparatus reduces the use frequency of [show] by adjusting the automatic registration frequency (see FIG. 9).
[0027]
Suppose that the user inputs [deta-sho] as shown in FIG. 10, performs kana-kanji conversion, and obtains a conversion candidate called [data award]. Since this conversion candidate is not the desired conversion candidate, the user corrects [prize] to [show]. Since [Show], which is automatic registration, has been selected, the document processing apparatus determines whether the frequency of this word is sufficiently high. Since this word has a frequency of 3 and is smaller than the judgment value 4, the word learning process is not performed, and the frequency is increased by automatic registration frequency adjustment (frequency 4, see FIG. 11).
[0028]
Here, as shown in FIG. 12, when the user inputs [Daijinjo] again, [Show] is not learned, and the word learning of [Prize] is effective, so the conversion result is converted to [Minister Award]. Is done. When the user again inputs [deta-sho] as shown in FIG. 13, the word learning is also applied, so that it is converted to [data award]. However, since this candidate is not a candidate desired by the user, the user converts [prize] to [show] by candidate selection processing. Since the [Show] selected at this time is an automatically registered word, the document processing apparatus checks whether the frequency is sufficiently high. As shown in FIG. 14, since the frequency is 4 and the determination value 4 is reached, the document processing apparatus treats this automatically registered word in the same way as an ordinary dictionary registered word, performs word learning processing, and further increases the frequency.
[0029]
(Other examples)
In the above-described embodiment, when the conversion candidate determined by the user is an automatically registered word, it is determined whether or not to learn the word according to the frequency. However, it is possible to perform word learning for words described in the dictionary 4A and not perform word learning for automatically registered words. The character processing procedure for this is shown in FIG.
[0030]
In FIG. 15, the same processes as those in FIG. Steps s200 to s210 in FIG. 15 show a procedure for processing for detecting automatically registered words and not learning words.
[0031]
According to this processing procedure, when the user inputs [deta-sho] and performs kana-kanji conversion, a conversion candidate [data certificate] is obtained (steps s101 → s102 in FIG. 15). Here, since it is not the conversion candidate desired by the user, the conversion candidate “proof” is corrected to “show” by operating the keyboard device. A desired conversion candidate “data show” is acquired, and the conversion candidate is instructed (see step s104 in FIG. 15 and FIG. 3).
[0032]
The document processing apparatus determines whether the confirmed conversion candidate “show” exists in the dictionary 4A or the automatic registration area 4B (step s105 in FIG. 15).
[0033]
When this “show” does not exist in the dictionary 4A and the automatic registration area 4B, the read “sho” and the notation “show” are registered as automatic registration words, and the frequency is set to 0 (step s106 in FIG. 15).
[0034]
When the user newly inputs “detasho”, the document processing apparatus executes kana-kanji conversion.
[0035]
Since the above-mentioned “show” has not been word-learned, “data certificate” is obtained as a kana-kanji conversion candidate as in the previous case (steps s101 → s102 in FIG. 15).
[0036]
The user corrects “certification” in the conversion candidates to “show” and determines “data show” (step s104 in FIG. 15). In the document processing apparatus, the determination in step s105, that is, the determination whether the conversion candidate “show” exists in the dictionary is executed. Since “Show” is automatically registered at this time, a positive determination is obtained. Subsequently, since an affirmative determination is also obtained in the determination in step s200 in FIG. 15, the word learning in step S201 is not performed, and the frequency related to “show” is adjusted from “0” to “1” (step s202 in FIG. 15). .
[0037]
In this way, since the frequency of the automatically registered words increases every time the user uses the words, the display order is earlier than other words of the same sound. Moreover, since the word learning is not performed about the automatically registered word, the display order of the word learned in the normal kana-kanji conversion dictionary 4A is earlier.
[0038]
In this way, after using the automatically registered word, when the word described in the Kana-Kanji conversion dictionary 4A having the same sound as “Show” is selected by the user, the execution procedure of FIG. The route is followed, and after the word learning of the word, the frequency of the word “show” is decremented by 1 (step s202 in FIG. 15).
[0039]
【The invention's effect】
As described above, in the present invention, the automatic registration word is distinguished from the word previously described in the dictionary, and the automatic registration word that is not used much lowers the rank as a conversion candidate. This eliminates the negative effects of registering auto-registered words. Further, the conversion rate is improved, which can contribute to the improvement of the operability of the document processing apparatus.
[Brief description of the drawings]
FIG. 1 is a block diagram showing a system configuration of a document processing apparatus to which the present invention is applied.
FIG. 2 is a flowchart showing a processing procedure of the document processing apparatus.
FIG. 3 is an explanatory diagram showing processing contents of the document processing apparatus.
FIG. 4 is an explanatory diagram showing processing contents of the document processing apparatus.
FIG. 5 is an explanatory diagram showing processing contents of the document processing apparatus.
FIG. 6 is an explanatory diagram showing processing contents of the document processing apparatus.
FIG. 7 is an explanatory diagram showing processing contents of the document processing apparatus.
FIG. 8 is an explanatory diagram showing processing contents of the document processing apparatus.
FIG. 9 is an explanatory diagram showing processing contents of the document processing apparatus.
FIG. 10 is an explanatory diagram illustrating processing contents of the document processing apparatus.
FIG. 11 is an explanatory diagram illustrating processing contents of the document processing apparatus.
FIG. 12 is an explanatory diagram illustrating processing contents of the document processing apparatus.
FIG. 13 is an explanatory diagram showing processing contents of the document processing apparatus.
FIG. 14 is an explanatory diagram showing processing contents of the document processing apparatus.
FIG. 15 is a flowchart illustrating a processing procedure according to another embodiment.
FIG. 16 is a flowchart showing a conventional processing procedure.
FIG. 17 is an explanatory diagram showing conventional processing contents;
FIG. 18 is an explanatory diagram showing conventional processing content.
FIG. 19 is an explanatory diagram showing conventional processing content.
[Explanation of symbols]
1 Input device 2 CPU
3 Output device 4 Storage device 4A Kana-Kanji conversion dictionary 4B Automatic registration word area

Claims

Includes a dictionary storage means for storing in association with reading and notation and word frequency, and input means for inputting a word reading,
With reference to the dictionary document storage means, as well as with priority frequent representation against reading of the input word conversion candidates from said input means, notation unregistered word to the dictionary storage means for reading In a character processing method for a document processing apparatus capable of automatically registering the word reading and notation in the conversion result in the dictionary storage means when created as a conversion result,
The document processing apparatus further includes first determination means, second determination means, learning means, and adjustment means,
When the word input from the input unit is converted from reading to notation, the first determination unit determines whether the conversion target word is a word automatically registered in the dictionary storage unit or a word registered in advance . 1 determination step;
The second determination means determines whether the frequency of the word stored in the dictionary storage means has reached a predetermined value when it is determined that the word is automatically registered in the first determination step . 2 determination steps;
When it is determined in the first determination step that it is registered in the dictionary in advance, or when it is determined that the predetermined value is reached in the second determination step, the conversion result is used for the next conversion. A learning step in which the learning means performs word learning for storing the conversion history in the history storage means for reflection ;
When the second determination step determines that the predetermined value has not been reached, the adjustment unit increases the frequency stored in the dictionary storage unit by 1 for the automatically registered word , and the learning step A character processing comprising: an adjusting step for reducing the frequency stored in the dictionary storage means by 1 for the automatically registered words having the same reading as the word learned after completion by the adjusting means. Method.

2. The character processing method according to claim 1, wherein the dictionary storage means includes an area for storing automatically registered words and a kanji conversion normal dictionary area.

Includes a dictionary storage means for storing in association with reading and notation and word frequency, and input means for inputting a word reading,
With reference to the dictionary document storage means, as well as with priority frequent representation against reading of the input word conversion candidates from said input means, notation unregistered word to the dictionary storage means for reading In a character processing method for a document processing apparatus capable of automatically registering the word reading and notation in the conversion result in the dictionary storage means when created as a conversion result,
The document processing apparatus further includes a determination unit, a first adjustment unit, and a second adjustment unit,
A determination step of determining, by the determination means, whether the word to be converted is a word automatically registered in the dictionary storage means or a word registered in advance when the word input from the input means is converted into a notation; ,
A first adjustment step for increasing the frequency stored in the dictionary storage means by 1 by the first adjustment means when the word is automatically registered by the determination step ;
If it is determined that the word is registered in the dictionary in advance, the word learning for storing the conversion history in the history storage means is performed to reflect the conversion result in the next conversion, and the word has the same reading as the word. character processing method characterized in that comprises a second adjusting step of decreasing the above for words that are automatically registered by one the frequency stored in said dictionary storage means a second adjusting means.

4. The character processing method according to claim 3, wherein the dictionary storage means has an area for storing automatically registered words and a kanji conversion normal dictionary area.