JP3566873B2

JP3566873B2 - Kana-kanji conversion apparatus and method, and storage medium storing kana-kanji conversion program

Info

Publication number: JP3566873B2
Application number: JP03593999A
Authority: JP
Inventors: 広勝秋山
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1999-02-15
Filing date: 1999-02-15
Publication date: 2004-09-15
Anticipated expiration: 2019-02-15
Also published as: JP2000235570A

Description

【０００１】
【発明の属する技術分野】
本発明は、日本語ワードプロセッサ、日本語文書処理機能を有するパーソナルコンピュータ等に適用され、複数文節からなる読み文字列を一度に漢字かな交じり列に変換する仮名漢字変換装置及び方法並びに仮名漢字変換プログラムを記憶した媒体に関する。
【０００２】
【従来の技術】
従来、日本語ワードプロセッサ等に用いられる仮名文字列を漢字かな交じり列に変換する仮名漢字変換装置において、特に複数文節からなる仮名文字列を一度に漢字かな交じり列に変換する連文節仮名漢字変換処理では、文節の区切り位置を誤って変換した場合、毎回、同じ文節の区切り位置を修正して変換する必要があった。
【０００３】
この問題を改善する従来技術として、特開平２−３５５６４号公報の記載によれば、文節区切りを修正する必要のある文字列を仮名漢字変換する場合、一度修正すれば、それ以降は修正を不要とし、変換結果が直前に修正した通りになるように文節区切りを学習する日本語入力装置が提案されている。
この従来技術による日本語入力装置は、複数文節からなる読み文字列の変換において、その文節区切りが誤って変換した際に、文節区切り位置の移動操作を行い、文節区切り位置を修正して変換した漢字仮名交じり文字列を暫定辞書に登録して文節区切り位置を学習するよう構成されている。
【０００４】
【発明が解決しようとする課題】
しかしながら、上述の従来技術による日本語入力装置は、文節単位で仮名漢字変換を行った場合には、文節区切りは学習ができないため、読み文字列を文節単位で分けて入力し変換した場合には正しく文節区切りができるが、その直後に、複数文節の読み文字列を一度に入力して仮名漢字変換した場合は、正しく文節区切りができないという問題がある。また、変換候補の正しい漢字仮名交じり文字列のみを残し、誤って変換された部分を削除して、消した部分の読み文字列を再度入力して変換するような操作を行う場合は、文節区切りの学習ができず、複数文節の読み文字列を一度に入力して変換した場合は、正しく文節区切りができないという問題がある。
【０００５】
本発明は以上の事情を考慮してなされたものであり、複数文節からなる読み文字列を文節単位に分けて入力し正しく変換すれば、正しい文節区切り位置を学習して次回から入力される複数文節の読み文字列を正しく仮名漢字変換する仮名漢字変換装置及び方法並びに仮名漢字変換プログラムを記憶した記憶媒体を提供する。
【０００６】
【課題を解決するための手段】
本発明は、連文節変換結果から文節区切り情報を格納する文節区切り学習辞書と、既に仮名漢字変換された読み文字列を格納する前文節読み文字列格納部と、前文節読み文字列格納部に格納された読み文字列に続けて入力される読み文字列を格納する後文節読み文字列格納部と、後文節読み文字列格納部の読み文字列を単文節変換または文節区切りのために前記文節区切り学習辞書を参照して連文節変換して仮名漢字変換結果としての漢字かな交じり列を得、さらに前文節読み文字列格納部と後文節読み文字列格納部に格納された読み文字列を合成してなる合成読み文字列を連文節変換する変換部と、前記合成読み文字列の変換結果が２文節であり、かつ各文節の読み文字列が前文節読み文字列格納部と後文節読み文字列格納部とに格納された読み文字列とそれぞれ同じか否かを判断する判断部と、判断部が同じでないと判断した場合、前文節読み文字列格納部と後文節読み文字列格納部とに格納された読み文字列を単位とした区切り情報を前記合成読み文字列の文節区切り情報として前記文節区切り学習辞書に格納する文節区切り学習制御手段とを備えることを特徴とする仮名漢字変換装置である。
【０００７】
本発明において、入力部は、例えば、キーボード、ペン・タブレット、マウスからなる入力装置で構成してもよい。
前文節読み文字列格納部、後文節読み文字列格納部、前文節漢字かな交じり列格納部、後文節漢字かな交じり列格納部は、例えば、ＲＡＭ、ＥＥＰＲＯＭ、フロッピーディスクなどからなる記憶装置で構成してもよい。
変換部、格納制御部、判断部、学習部は、例えば、コンピュータのＣＰＵと制御プログラムで構成してもよい。
【０００８】
本発明によれば、複数文節からなる読み文字列を文節単位に分けて入力し正しく変換すれば、正しい文節区切り位置を学習して次回から入力される複数文節の読み文字列を正しく仮名漢字変換することができる。
【０００９】
【発明の実施の形態】
本実施例では、例えば、２文節の読み文字列「じしゅはいぎょう」の前文節である読み文字列「じしゅ」を入力し、変換して変換候補「自主」を得て、続けて後文節である読み文字列「はいぎょう」を入力し、変換して変換候補「廃業」を得る。２文節の読み文字列を文節単位で入力して変換しその区切り位置に対応する読み文字列及び変換候補を内部記憶しておく。
よって、予め記憶された２文節の読み文字列「じしゅはいぎょう」が入力された場合、誤った区切り位置「じしゅは／いぎょう」による変換候補「自主は／偉業」を得ることなく、正しい区切り位置「じしゅ／はいぎょう」を学習して、正しい変換候補「自主／廃業」を得ることができる。便宜上、「／」は文節の区切り位置を表わすものとする。
【００１０】
本発明の別の観点によれば、単文節または複数文節ごとに入力される読み文字列を単文節変換または連文節変換を行って仮名漢字変換する方法であって、既に仮名漢字変換された読み文字列を前文節読み文字列格納部に格納し、前文節読み文字列格納部に格納された読み文字列に続けて入力される読み文字列を後文節読み文字列格納部に格納し、後文節読み文字列格納部の読み文字列を単文節変換または文節区切りのために前記文節区切り学習辞書を参照して連文節変換して仮名漢字変換結果としての漢字かな交じり列を得、前文節読み文字列格納部と後文節読み文字列格納部に格納された読み文字列とを合成してなる合成読み文字列を連文節変換し、前記合成読み文字列の変換結果が２文節であり、かつ各文節の読み文字列が前文節読み文字列格納部と後文節読み文字列格納部とに格納された読み文字列とそれぞれ同じか否かを判断し、同じでないと判断した場合、前文節読み文字列格納部と後文節読み文字列格納部とに格納された読み文字列を単位とした区切り情報を前記合成読み文字列の文節区切り情報として前記文節区切り学習辞書に格納する仮名漢字変換方法が提供される。
【００１１】
別の実施例では、２文節の読み文字列「じしゅはいぎょう」を一度に変換し、変換候補「自主は／偉業」を得た場合、漢字かな交じり列「は／偉業」の部分を削除して、改めて読み文字列「はいぎょう」を入力して変換して変換候補「廃業」を得る。その際、初めの変換候補「自主は／偉業」の各文字に対して読み文字列「じしゅはいぎょう」のどれに対応しているかを記憶することで、消去せずに残っている文字が何であるかを判断する。
【００１２】
この場合は、読み文字列「じしゅ」、漢字かな交じり列「自主」である。その残った文節を１文節目とし、それに続いて入力された読み文字列を２文節目と考える。読み文字列「はいぎょう」、漢字かな交じり列「廃業」である。１文節目と２文節目の読み文字列を接続した区切り位置に対応する読み文字列及び変換候補を内部記憶しておく。
よって、予め記憶された２文節の読み文字列「じしゅはいぎょう」が入力された場合、誤った区切り位置「じしゅは／いぎょう」による変換候補「自主は／偉業」を得ることなく、正しい区切り位置「じしゅ／はいぎょう」を学習して、正しい変換候補「自主／廃業」を得ることができる。
【００１３】
変換部により変換された漢字かな交じり列を部分削除する編集部をさらに備え、前記格納制御部は、編集部による部分削除後に残った漢字かな交じり列を前文節漢字かな交じり列格納部に格納し、それに対応する読み文字列を前文節読み文字列格納部に格納し、削除に続いて入力された読み文字列を後文節読み文字列格納部に格納し、前記読み文字列から変換した漢字かな交じり列を後文節漢字かな交じり列格納部に格納し、前記判断部は、同じ複数文節の読み文字列が入力され漢字かな交じり列に変換される際の区切り位置が、前記各格納部に格納された読み文字列の区切り位置と同じか否かを判断し、判断部から各区切り位置が異なる判断結果を受けた際、前記学習部は、前記読み文字列の区切り位置を学習するよう構成してもよい。
この構成によれば、一旦入力変換した文字列の一部分を削除した場合も、削除されずに残っている文字列の文節の区切り位置を学習して、正しい仮名漢字変換をすることができる。
【００１４】
読み文字列の各文字と漢字かな交じり列の各文字との対応関係を格納する読み漢字かな交じり列対応テーブルをさらに備え、前記判断部は、変換部により前文節読み文字列と後文節読み文字列を接続した読み文字列を変換した漢字かな交じり列と、前文節漢字かな交じり列と後文節漢字かな交じり列を接続した漢字かな交じり列とが同じか否かを読み漢字かな交じり列対応テーブルから判断し、前記学習部は、判断部から異なる判断結果を受けた際、前文節読み文字列と後文節読み文字列を接続した読み文字列と、前文節漢字かな交じり列と後文節漢字かな交じり列を接続した漢字かな交じり列との対応関係を学習するよう構成してもよい。
この構成によれば、前文節漢字かな交じり列と後文節漢字かな交じり列を接続した漢字かな交じり列との対応関係から文節単位で入力された区切り位置を学習することができる。
【００１５】
前記入力部の指示によりカーソルを移動して文字列を確定する確定部をさらに備え、前記確定部により前文節の読み文字列を確定した後にカーソルを移動して別の文節の文字列に移動した際、前記格納制御部は、前文節読み文字列格納部と前文節漢字かな交じり列格納部とにそれぞれ格納した全文字列を削除する構成にしてもよい。
この構成によれば、利用者の仮名漢字変換におけるカーソル移動によって生じる、連続しない文節間の区切り学習を行わずにすみ、誤った学習を防ぐことができる。
【００１６】
前文節読み文字列格納部と前文節漢字かな交じり列格納部に文字列が格納されていないときは、前記変換部は、カーソルの直前にある確定済の漢字かな交じり列を読み文字列に変換し、前記格納制御部は、前文節読み文字列格納部に格納し、漢字かな交じり列を前文節漢字かな交じり列格納部に格納するよう構成してもよい。
この構成によれば、利用者の変換処理操作の負担を軽減することができる。
【００１７】
以下、図に示す実施例に基づいて本発明を詳述する。なお、本発明はこれによって限定されることはない。
【００１８】
図１は本実施例の仮名漢字変換装置の構成を示すブロック図である。図１において、１はバス２を介して３〜１８の各部を制御プログラムに基づいて制御する制御部を示し、制御部１は、例えば、ＣＰＵ、ＲＯＭ、ＲＡＭ、Ｉ／Ｏポートからなるコンピュータで構成される。また、制御部１は、格納制御部、判断部、学習部、編集部、確定部としても機能する。
３は文字コード、変換、削除指示などの編集コードを入力する入力部を示し、入力部３は、例えば、キーボード、ペン／タブレット、マウスからなる入力装置で構成される。
４は入力部３で入力された読み仮名を仮名漢字変換する仮名漢字変換部を示し、例えば、コンピュータのＣＰＵと仮名漢字変換プログラムを記憶したＲＯＭ、ＥＥＰＲＯＭで構成される。
【００１９】
５は仮名漢字変換辞書を示し、仮名漢字変換辞書５は、例えば、仮名漢字変換をするための仮名漢字変換データを記憶したＲＯＭ、ＥＥＰＲＯＭ、フロッピーディスク（ＦＤ）、ハードディスク（ＨＤ）からなる記憶装置で構成される。
６は文節区切り学習辞書を示し、文節区切り学習辞書６は、例えば、複数文節の区切り位置を学習する学習データを記憶するＲＡＭ、ＥＥＰＲＯＭで構成される。
【００２０】
７は入力された読み文字列や変換された文字列などを表示データとして格納する表示バッファ部を示し、例えば、ＲＡＭ、ＥＥＰＲＯＭで構成される。
８は入力された読み文字列や変換された文字列などを表示する表示部を示し、表示部８は、例えば、ＣＲＴディスプレイ、液晶ディスプレイ（ＬＣＤ）、プラズマディスプレイ（ＰＤ）などかなる表示装置で構成される。
９はカーソルポインタにより文字の挿入や削除などの位置を指示する指示部を示し、指示部９は、例えば、キーボード、ペン／タブレット、マウスからなる入力装置で構成される。
１０は表示部８に画面に表示された文字列を印刷する印刷部を示し、印刷部１０は、例えば、サーマルプリンタ、インクジェットプリンタなどで構成される。
【００２１】
１１は確定された複数文節のうち、前文節の読み文字列を格納する前文節読み文字列格納部を示し、例えば、ＲＡＭ、ＥＥＰＲＯＭで構成される。
１２は現在入力している読み文字列を格納する後文節読み文字列格納部を示し、例えば、ＲＡＭ、ＥＥＰＲＯＭで構成される。
１３は変換された前文節の漢字かな交じり列を格納する前文節漢字かな交じり列格納部を示し、例えば、ＲＡＭ、ＥＥＰＲＯＭで構成される。
１４は変換された後文節の漢字かな交じり列を格納する後文節漢字仮名交じり列格納部を示し、例えば、ＲＡＭ、ＥＥＰＲＯＭで構成される。
【００２２】
１５は直前に入力された前文節の読み文字列と現在入力している読み文字列を合成した読み文字列を格納する合成読み文字列格納部を示し、例えば、ＲＡＭ、ＥＥＰＲＯＭで構成される。
１６は変換された前文節の漢字かな交じり列と現在変換している後文節の漢字かな交じり列を合成した漢字かな交じり列を格納する合成漢字かな交じり列格納部を示し、例えば、ＲＡＭ，ＥＥＰＲＯＭで構成される。
１７は入力されている読み文字列と変換結果の漢字かな交じり列とを位置情報に対応して格納する読み漢字かな交じり列対応テーブルを示し、例えば、ＲＡＭ、ＥＥＰＲＯＭで構成される。
【００２３】
１８は本発明の仮名漢字変換プログラムを記憶した記憶媒体を示し、記憶媒体１８は、例えば、コンピュータにより読み出し可能なＦＤ、ＨＤ、ＣＤ−ＲＯＭなどで構成される。
１９は記憶媒体１８から仮名漢字変換プログラムを読み出し、ＲＡＭ、ＥＥＰＲＯＭにインストールする記憶媒体読取部を示す。
【００２４】
図２は本実施例の記憶媒体の構成を示すブロック図である。図２の（Ａ）に示す領域には、仮名漢字変換辞書を参照して入力文字列を漢字仮名交じり文字列に変換するための仮名漢字変換プログラム２０１、文字の削除、修正、追加などを行う編集プログラム２０２、文節の区切り位置が同じかどうかを判断する文節区切り判断プログラム２０３、文節の区切り位置を学習する文節区切り学習プログラム２０４、文字列を比較して同じかどうかを判断する文字列比較プログラム２０５、文字列が同じでないときにその文字列を学習する文字列学習プログラム２０６の各コードがそれぞれ記憶されている。
【００２５】
図２の（Ｂ）に示す領域には、前文節読み文字列格納制御プログラム２０７、後文節読み文字列格納制御プログラム２０８、前文節漢字仮名交じり列格納制御プログラム２０９、後文節漢字仮名交じり列格納制御プログラム２１０、合成読み文字列格納制御プログラム２１１、合成漢字仮名交じり列格納制御プログラム２１２、文節区切り学習データ格納制御プログラム２１３、文字列学習データ格納制御プログラム２１４がそれぞれ記憶されている。
【００２６】
本発明の別の観点によれば、コンピュータ１に、入力部３により文節単位で入力された複数文節の読み文字列を漢字かな交じり列に順次変換させ、２文節のうち前文節の読み文字列を前文節読み文字列格納部１１に格納させ、前文節に続く後文節の読み文字列を後文節読み文字列格納部１２に格納させ、変換された前文節の漢字かな交じり列を前文節漢字かな交じり列格納部１３に格納させ、続いて変換された後文節の漢字かな交じり列を後文節漢字かな交じり列格納部１４に格納させ、同じ複数文節の読み文字列が入力され漢字かな交じり列に変換される際の区切り位置が、前記各格納部１１〜１３に格納された読み文字列の区切り位置と同じか否かを判断させ、判断結果、各区切り位置が異なる場合、前記読み文字列の区切り位置を学習させることを特徴とする仮名漢字変換プログラムを記憶した記録媒体１８を提供することができる。
【００２７】
従って、記録媒体１８から本発明の仮名漢字変換プログラム２０１〜２１４を読み出してＲＡＭ、ＥＥＰＲＯＭにインンストールすることにより、複数文節からなる読み文字列を文節単位に分けて入力し正しく変換すれば、正しい文節区切り位置を学習して次回から入力される複数文節の読み文字列を正しく仮名漢字変換する仮名漢字変換装置を実現することができる。
【００２８】
図３は本実施例の仮名漢字変換処理の手順（１）を示すフローチャートである。図４は本実施例の仮名漢字変換処理の手順（２）を示すフローチャートである。図５は本実施例の仮名漢字変換処理による各格納部の格納状態を示す図である。
本実施例として、例えば、２文節の読み文字列「じしゅはいぎょう」が、漢字かな交じり文字列「自主は偉業」に変換される場合、読み文字列「じしゅ」と「はいぎょう」を２文節に分けて入力変換し、漢字かな交じり文字列「自主」と「廃業」得て、次回からは、読み文字列「じしゅはいぎょう」の入力に対して正しい漢字かな交じり列「自主廃業」を得る。
【００２９】
図３のフローチャートにおいて、
ステップ３０１：入力部３からキーコードが入力される。
ステップ３０２：入力されたキーコードが文字コードかどうかを確認し、文字コードの場合はステップ３０３に進み、文字コードでないときは、ステップ３０６に進む。
ステップ３０３：後文節読み文字列格納部１２に格納する。
図５のａは、後文節読み文字列格納部１２に「じしゅ」（５−１）が入力されたことを示している。本実施例では、利用者が変換しようとする文節を便宜上、後文節として扱っている。
【００３０】
ステップ３０４：表示バッファ７に格納する。
ステップ３０５：表示部８に変換結果を表示する。
ステップ３０６：キーコードが変換キーかチェックする、変換キーでないときは、ステップ３０７に進み、変換のときは、ステップ３１４のかな漢字変換処理に進む。
ステップ３０７：キーコードが確定指示かどうかを調べる。確定指示でない場合はステップ３０１に戻る。確定指示のときは、ステップ３０８に進む。
【００３１】
図４において、
ステップ３１５：後文節読み文字列（５−１）を仮名漢字変換する。仮名漢字変換部４は、仮名漢字変換辞書５を参照しながら変換を行う。図５のｂは、読み文字列「じしゅ」を仮名漢字変換し、その結果が、後文節漢字かな交じり列格納部１４に「自主」（５−２）として格納されていることを示している。
ステップ３１６：変換結果が複数文節かどうかを調べ、ＹＥＳならステップ３１７に進み、ＮＯならステップ３１９に進む。
ステップ３１７：変換結果が文節区切り学習辞書６に登録されているか否かを調べ、ＹＥＳならステップ３１８に進み、ＮＯならステップ３１９に進む。
【００３２】
ステップ３１８：学習候補を表示バッファ７に格納する。
ステップ３１９：変換結果を表示バッファ７に格納する。次にステップ３０５に進み、表示部８に変換結果「自主」を表示する。
利用者が「自主」で確定指示を行った場合、引き続きステップ３０１に戻りステップ３０８まで進む。
【００３３】
ステップ３０８：前文節読み文字列格納部１１に前文節読み文字列があるか確認する。この場合格納されていないので、ステップ３１３に進む。
ステップ３１３：図５のｃに示すように、後文節読み文字列を前文節読み文字列（５−３）として前文節読み文字列格納部１１に複写し、後文節漢字かな交じり列を前文節漢字かな交じり列（５−４）として前文節漢字かな交じり列格納部１３に複写し、ステップ３０４、ステップ３０５に戻る。
【００３４】
続いて、図５のｅに示すように、読み文字列「はいぎょう」の入力と、仮名漢字変換を行い、漢字かな交じり列「廃業」を得て、確定する。
ステップ３０７：確定指示かどうかを調べる。
ステップ３０８：前文節読み文字列格納部１１に前文節読み文字列があるか確認する。この場合格納されているので、ステップ３０９に進む。
ステップ３０９：前文節読み文字列と後文節読み文字を合成し、図５のｆに示すように、合成読み文字列「じしゅはいぎょう」が合成読み文字列格納部１５に格納される。
ステップ３１０：合成読み文字列を仮名漢字変換し、学習しない状態では、図５のｇのように、変換結果「自主は／偉業」が合成漢字かな交じり列格納部１６に格納される。
【００３５】
ステップ３１１：変換結果が２文節で、区切り位置が前文節読み文字列と後文節読み文字列の間であるかを調べる。この場合、前文節読み文字列と後文節読み文字列の間は「じしゅは／いぎょう」の区切り位置となり、「じしゅ／はいぎょう」の区切り位置と同じでないので、ステップ３１２に進み、区切り位置を学習する必要がある。同じ場合は、文節区切りの学習を行わないのでステップ３１３に進む。
ステップ３１２：文節区切りの学習登録処理を行い、文節区切り学習辞書６に学習情報を格納し、ステップ３１３に進む。図５のｈは、ステップ３１３に進んだときの状態を示している。
【００３６】
図６は本実施例の文節区切り学習辞書の記憶内容を示す図である。図６の（ａ）に示すように、文節区切り学習登録処理は、読み文字列「じしゅはいぎょう」に対して漢字かな交じり列「自主廃業」が記憶されるように構成してもよい。この場合、読み文字列「じしゅはいぎょう」に対して漢字かな交じり列「自主廃業」を学習したことになる。
また、図６の（ｂ）に示すように、読み文字列「じしゅ」と「はいぎょう」のみを登録し漢字かな交じり列は登録しない構成でもよい。
【００３７】
図３及び図４のフローチャートにおいて、読み文字列「じしゅはいぎょう」で、漢字かな交じり列「自主廃業」の文節区切り位置を学習した後、利用者が一度に「じしゅはいぎょう」と入力変換するときの実施例を説明する。
ステップ３１５：読み文字列「じしゅはいぎょう」に対して変換結果候補として「自主は／偉業」や「自主／廃業」などが得られる。
ステップ３１６：変換結果が複数文節かどうかを調べ、ＹＥＳならステップ３１７に進み、ＮＯならステップ３１９に進む。
【００３８】
ステップ３１７：変換結果が文節区切り学習辞書に登録されているか否かを調べ、ＹＥＳならステップ３１８に進み、ＮＯならステップ３１９に進む。
この場合、ステップ３１２で、図６の（ａ）に示すように、文節区切り学習を行っており、「自主廃業」が学習候補となる。
ステップ３１８：学習候補「自主廃業」を表示バッファ７に格納する。次にステップ３０５に進み、表示部８に変換結果「自主廃業」を表示する。
【００３９】
図７は本実施例の読み漢字かな交じり文字列対応テーブルの記憶内容（１）を示す図である。図８は本実施例の読み漢字かな交じり文字列対応テーブルの記憶内容（２）を示す図である。図９は本実施例の読み漢字かな交じり文字列対応テーブルの記憶内容（３）を示す図である。特に、図７〜図８は、１字の漢字に対して読みが何であるかを調べるための情報を格納した説明図である。
【００４０】
図７の（ａ）は、読み文字列「じしゅはいぎょう」と入力されたときの状態を示し、図中の数字は先頭から何文字目かを示している。
図７の（ｂ）は、仮名漢字変換結果の漢字かな交じり列「自主は偉業」を示す。
図７の（ｃ）は、漢字かな交じり列の１文字ずつが読み文字列の何文字目に対応しているかを示し、読み漢字かな交じり列対応テーブル１７に記憶される。
【００４１】
図１０は本実施例の仮名漢字変換辞書の記憶内容を示す図である。図１０に示す仮名漢字変換辞書の中で、漢字のコードは、ＪＩＳＸ０２０８のコードの２バイトを利用して格納されていて、１漢字に対する読み文字数は、ＪＩＳＸ０２０８コードで使用していない、２バイトで表される漢字のコードの１バイト目及び２バイト目のＭＳＢの１ビットを利用して格納している。
【００４２】
図１０の（ａ）に示すように、１漢字に対する読み文字数が１文字のときは、漢字のコードの１バイト目及び２バイト目のＭＳＢを０とする。１漢字に対する読み文字数が２文字のときは、漢字のコードの１バイト目のＭＳＢは０で２バイト目のＭＳＢが１である。以下、１漢字に対する読み文字数が５文字以上のときは、漢字のコードの１バイト目の及び２バイト目のＭＳＢが１となり、次の１バイトに５以上の数字を入れ、その数字が読み文字数とする。
【００４３】
図１０の（ｂ）は、仮名漢字変換辞書に登録された単語に関して、読み文字と漢字の表記を表しており、さらに、単語の各漢字の表示１文字に対する読みの文字数を表す説明図である。
【００４４】
図１０で、読み文字列「いぎょう」と漢字「偉業」の対応例を説明する。
漢字「偉」の漢字コードを調べると、１バイト目及び２バイト目のＭＳＢが０であるので、「偉」の読み文字は１文字であることが分かり、読み文字は「い」であることが分かる。続いて、漢字「業」の漢字コードを調べると１バイト目のＭＳＢが１で、２バイト目のＭＳＢが０であるので、読み文字数が３であることが分かる。従って、漢字「業」の読み文字は「ぎょう」であることが分かる。
【００４５】
以上のような仮名漢字変換辞書を用いることで、読み文字列「じしゅはいぎょう」を仮名漢字変換した結果「自主は偉業」となったとき、仮名漢字変換で仮名漢字変換辞書５を検索した際、各漢字に対して、読み文字が何であるかが分かる。
この例では、「自」が「じ」、「主」が「しゅ」、「は」が「は」、「偉」が「い」、「業」が「ぎょう」が対応し、図７の（ｃ）に示す読み漢字かな交じり列対応テーブルに格納される。
【００４６】
図７の（ｃ）に示す読み漢字かな交じり列対応テーブルは、漢字かな交じり列の１文字１文字に対して、読み文字列の何文字目を指しているかを示すテーブルである。
よって、漢字かな交じり列の「自」は漢字かな交じり列位置が０で、読み文字列が０で始まり、「主」は漢字かな交じり列位置が１で、読み文字列が１で始まり、「は」は漢字かな交じり列位置が２で、読み文字列が３で始まり、「偉」は漢字かな交じり列位置が３で、読み文字列が４で始まり、同じようにして「業」は漢字かな交じり列位置が４で、読み文字列が５で始まる。
読み漢字かな交じり列対応テーブルの最後は、漢字かな交じり列の最後の文字が終わる位置＋１を示すので、漢字かな交じり列は５となり、読み文字列は８となる。
【００４７】
図１１は本実施例の仮名漢字変換処理の手順（３）を示すフローチャートである。図１１において、ステップ１１０１からステップ１１１４までの処理は、図３に示すステップ３０１からステップ３１４までの処理と同じであるので説明を省略する。
図１２は本実施例の仮名漢字変換処理の手順（４）を示すフローチャートである。なお、ステップ１１３０からステップ１１３７までの処理は、変換結果の部分削除処理により前文節読み文字列と後文節読み文字列間の区切り位置を決めるときのフローチャートを示す。
【００４８】
ステップ１１３０：入力されたキーコードが削除キーであるか否かを調べる。ＹＥＳならステップ１１３１の文字削除処理に進み、ＮＯならステップ１１０６に進む。
ステップ１１３２：削除指示の対象が、前文節の漢字かな交じり列の部分であるか否かを調べる。ＹＥＳならステップ１１３３に進み、ＮＯならステップ１１３６に進む。
【００４９】
ステップ１１３３：前文節を削除する場合は、前文節漢字かな交じり列の削除対象の文字列を削除する。削除する前は、図７の（ｂ）に示すように、「自主は偉業」と入力され、図８の（ｂ）に示すように、「業」を１文字削除した状態となる。
ステップ１１３４：図７の（ｃ）に示すように、漢字仮名交じり列で位置４始まり、位置５で終わる文字を削除し、それに対する読み文字列も位置５で始まり、
位置８で終わる読み文字列であることが分かり、その読み文字列「ぎょう」を削除し、図９の（ａ）の状態となる。
【００５０】
ここで、終わる文字の位置というのは、実際は、次に続く文字の開始位置を示しているので、実際の終わりの文字位置より１大きい数字となっている。
ステップ１１３５：読み漢字かな交じり列対応テーブルを更新し、図９の（ｃ）のような状態とし、ステップ１１３７に進み文字削除処理を終了する。
ステップ１１３６：前文節以外の文字列を削除してステップ１１３７の終了処理に進む。
【００５１】
以上の文字削除処理を行って、利用者が、読み文字列「じしゅはいぎょう」を変換して漢字かな文字列「自主廃業」を得ることを所望していたときに、仮名漢字変換結果が「自主は偉業」になり、利用者が「は偉業」を制御部１で削除したとき、漢字かな交じり列の３文字文を削除したので、読み漢字かな交じり列対応テーブルも、図７の（ｃ）に示す後文節の３文字分が削除されて、図９の（ｃ）のようになる。
【００５２】
よって、図９の（ｂ）に示すように、削除されずに残った漢字かな交じり列「自主」の読み文字列は、図９の（ａ）に示す「じしゅ」であることがわかる。
この「じしゅ」を前文節読み文字列とし、「自主」を前文節漢字かな交じり列とし、次に入力される読み文字列を後文節読み文字列として入力変換する。
【００５３】
従って、複数文節の読み文字列を一度に入力変換した場合、変換結果の正しい漢字かな交じり列のみを残し、誤っている変換部分を消して、その部分に対応する読み文字列を再度入力して変換しても、文節の区切り位置が学習ができる。
この実施例によれば、一旦入力変換した文字列の一部分を削除した場合も、削除されずに残っている文字列の文節の区切り位置を学習して、正しい仮名漢字変換をすることができる。
【００５４】
図１３は本実施例の仮名漢字変換処理の手順（５）を示すフローチャートである。図１３において、ステップ１３０１からステップ１３１４までの処理は、図３に示すステップ３０１からステップ３１４までの処理と同じであるので説明を省略する。
ステップ１３２０：利用者が入力したキーコードがカーソル移動指示か否かを調べる。ＹＥＳならステップ１３２１に進み、ＮＯならステップ１３０６に進む。
ステップ１３２１：前文節読み文字列格納部及び前文節漢字かな交じり列格納部に前文節読み文字列及び前文節漢字かな列が格納されているとき、これらをクリアする。
【００５５】
この実施例によれば、カーソルを移動して文字列を入力した際には、それ以前に入力していた文字列は棄却するので、関連のない文節同士の学習を行う誤りをせずに、正しい文節区切り学習が可能となり、利用者の変換処理操作の利便性を向上することができる。つまり、カーソル移動によって生じる、連続しない文節間の区切り学習を行わずにすみ、誤った学習を防ぐことができる。
【００５６】
図１４は本実施例の仮名漢字変換処理の手順（６）を示すフローチャートである。図１４において、ステップ１４０１からステップ１４１４までの処理は、図３に示すステップ３０１からステップ３１４までの処理と同じであるので説明を省略する。
ステップ１４２０：利用者が入力したキーコードがカーソル移動指示か否かを調べる。ＹＥＳならステップ１４２１に進み、ＮＯならステップ１４０６に進む。
ステップ１４２１：前文節読み文字列格納部及び前文節漢字かな交じり列格納部に前文節読み文字列及び前文節漢字かな列が格納されているとき、これらをクリアする。
【００５７】
ステップ１４２２：前文節読み文字列格納部に読み文字列が格納されていないときに、現在のカーソルの前に確定した文字列があるか否かを確認する。ＹＥＳならステップ１４２３に進み、ＮＯならステップ１４１３に進む。
ステップ１４２３：カーソルの前にある確定した漢字かな交じり列を読み文字列に戻す。読み文字列を前文節読み文字列格納部に格納し、仮名に戻された漢字仮名交じり列を前文節漢字交じり列格納部に格納し、ステップ１４０８に戻る。なお、読み文字列に戻す方法は、利用者が入力したときの読み文字列を記憶させておく方法でもよいし、公知の方法で仮名文字列に変換する方法を利用してもよい。
【００５８】
この実施例によれば、カーソルを移動して文字列を入力した際には、その移動した先のカーソルより前の文節を漢字かな交じり列を読み文字列として取り込むことができるので、利用者の変換処理操作の利便性を向上することができる。
また、カーソル移動を行っても前後の関係を正しく判断でき、正しく文節区切り学習が行える。
【００５９】
【発明の効果】
本発明によれば、複数文節からなる読み文字列を文節単位に分けて入力し正しく変換すれば、正しい文節区切り位置を学習して次回から入力される複数文節の読み文字列を正しく仮名漢字変換することができる。
【図面の簡単な説明】
【図１】本実施例の仮名漢字変換装置の構成を示すブロック図である。
【図２】本実施例の記憶媒体の構成を示すブロック図である。
【図３】本実施例の仮名漢字変換処理の手順（１）を示すフローチャートである。
【図４】本実施例の仮名漢字変換処理の手順（２）を示すフローチャートである。
【図５】本実施例の仮名漢字変換処理による各格納部の格納状態を示す図である。
【図６】本実施例の文節区切り学習辞書の記憶内容を示す図である。
【図７】本実施例の読み漢字かな交じり文字列対応テーブルの記憶内容（１）を示す図である。
【図８】本実施例の読み漢字かな交じり文字列対応テーブルの記憶内容（２）を示す図である。
【図９】本実施例の読み漢字かな交じり文字列対応テーブルの記憶内容（３）を示す図である。
【図１０】本実施例の仮名漢字変換辞書の記憶内容を示す図である。
【図１１】本実施例の仮名漢字変換処理の手順（３）を示すフローチャートである。
【図１２】本実施例の仮名漢字変換処理の手順（４）を示すフローチャートである。
【図１３】本実施例の仮名漢字変換処理の手順（５）を示すフローチャートである。
【図１４】本実施例の仮名漢字変換処理の手順（６）を示すフローチャートである。
【符号の説明】
１制御部
２バス
３入力部
４仮名漢字変換部
５仮名漢字変換辞書
６文節区切り辞書
７表示バッファ
８表示部
９指示部
１０印刷部
１１前文節読み文字列格納部
１２後文節読み文字列格納部
１３前文節漢字かな交じり列格納部
１４後文節漢字かな交じり列格納部
１５合成読み文字列格納部
１６合成漢字かな交じり列格納部
１７読み漢字かな交じり列対応テーブル
１８記憶媒体
１９記憶媒体読取部[0001]
TECHNICAL FIELD OF THE INVENTION
INDUSTRIAL APPLICABILITY The present invention is applied to a personal computer having a Japanese word processor, a Japanese document processing function, and the like, and converts a reading character string including a plurality of phrases into a kanji kana mixed string at a time, and a kana-kanji conversion program. Related to a medium that stores
[0002]
[Prior art]
Conventionally, in a kana-kanji conversion device that converts a kana character string used in a Japanese word processor or the like into a kana-kana kana sequence, in particular, a continuous phrase kana-kanji conversion process that converts a kana character string consisting of multiple phrases into a kanji-kana kana sequence at a time. In the case where the break position of a phrase is incorrectly converted, it is necessary to correct the break position of the same phrase every time for conversion.
[0003]
As a conventional technique for solving this problem, according to the description in Japanese Patent Application Laid-Open No. Hei 2-35564, when a character string that needs to be corrected for a segment break is to be converted to kana-kanji, once corrected, no further correction is required thereafter. There has been proposed a Japanese input device that learns a phrase break so that the conversion result is as corrected immediately before.
In this conventional Japanese input device, in the conversion of a reading string composed of a plurality of phrases, when the phrase break is incorrectly converted, the phrase break position is moved, and the phrase break position is corrected and converted. The kanji-kana character string is registered in the provisional dictionary to learn the phrase segmentation position.
[0004]
[Problems to be solved by the invention]
However, the Japanese input device according to the above-described conventional technology cannot perform segmentation delimitation when kana-kanji conversion is performed on a phrase basis. Although the phrase separation can be performed correctly, immediately after that, if a kana-kanji conversion is performed by inputting the reading character strings of a plurality of phrases at once, there is a problem that the phrase separation cannot be performed correctly. If you want to leave only the correct kanji-kana character string of the conversion candidate, delete the part that was incorrectly converted, and re-enter the read character string of the erased part to convert it, use phrase delimiter. There is a problem that if a character string of a plurality of phrases is input and converted at a time, the phrase segmentation cannot be correctly performed.
[0005]
The present invention has been made in view of the above circumstances, and if a reading character string consisting of a plurality of phrases is input in units of a phrase and correctly converted, a correct phrase separation position is learned and a plurality of characters to be input from the next time are read. Provided are a kana-kanji conversion apparatus and method for correctly converting kana-kanji conversion of a reading character string of a phrase, and a storage medium storing a kana-kanji conversion program.
[0006]
[Means for Solving the Problems]
The present invention A phrase-separation learning dictionary that stores phrase-separation information from continuous phrase conversion results, and kana-kanji conversion has already been performed. A pre-phrase reading character string storage unit for storing a reading character string, It is input following the reading character string stored in the previous phrase reading character string storage A post-sequence reading character string storage unit for storing the reading character string, The reading character string in the post-sequence reading character string storage unit is converted into a single phrase by referring to the phrase-separation learning dictionary for single-phrase conversion or phrase-separation to obtain a kana-kana-kanji string as a kana-kanji conversion result. A conversion unit for converting a combined read character string obtained by combining the read character strings stored in the read character string storage unit and the post-sequence read character string storage unit into a continuous phrase, and a conversion result of the combined read character string being two phrases; And the reading character strings of the respective phrases are the reading character strings stored in the pre-phrase reading character string storage unit and the post-phrase reading character string storage unit, respectively. A judgment unit for judging whether they are the same, If the determining unit determines that the two are not the same, the delimiter information in units of the read character strings stored in the pre-sequence reading character string storage unit and the post-sequence reading character string storage unit is used as the phrase separation information of the synthesized read character string. Phrase separation learning control means for storing in the phrase separation learning dictionary It is a kana-kanji conversion device characterized by the following.
[0007]
In the present invention, the input unit may be configured by, for example, an input device including a keyboard, a pen / tablet, and a mouse.
The pre-sentence reading character string storage unit, the post-sentence reading character string storage unit, the pre-sentence kanji kana mixed string storage unit, and the post-sentence kanji kana mixed string storage unit are configured by a storage device including, for example, a RAM, an EEPROM, a floppy disk, and the like. May be.
The conversion unit, the storage control unit, the determination unit, and the learning unit may be configured by, for example, a computer CPU and a control program.
[0008]
According to the present invention, if a reading character string consisting of a plurality of phrases is divided and input in units of a phrase and is correctly converted, a correct phrase separation position is learned, and a reading character string of a plurality of phrases input next time is correctly converted into kana-kanji characters. can do.
[0009]
BEST MODE FOR CARRYING OUT THE INVENTION
In the present embodiment, for example, the reading character string “jishi”, which is the pre-phrase of the two-character reading character string “jishihaigyo”, is input and converted to obtain a conversion candidate “independent”. A reading character string “Haigyo”, which is a later sentence, is input and converted to obtain a conversion candidate “Closed”. The reading character strings of two phrases are input and converted in units of phrases, and the reading character strings and conversion candidates corresponding to the delimiting positions are internally stored.
Therefore, when a pre-stored two-segment reading character string “Jishuha Igyo” is input, a conversion candidate “independent is / great feat” based on an incorrect delimiter position “Jushiha / Igyo” is obtained. Instead, the correct conversion candidate “autonomous / closed” can be obtained by learning the correct break position “jishi / hagyo”. For convenience, “/” indicates a segment break position.
[0010]
According to another aspect of the present invention, there is provided a method of performing a kana-kanji conversion by performing a single-segment conversion or a continuous-segment conversion of a reading character string input for each single phrase or a plurality of phrases, wherein the kana-kanji conversion is performed for kana-kanji conversion. The sequence is stored in the pre-sentence reading character string storage unit, and the read character string input following the reading character string stored in the pre-sentence reading character string storage unit is stored in the post-sentence reading character string storage unit. The reading character string stored in the reading character string storage unit is converted into a single phrase or a continuous phrase conversion with reference to the phrase separation learning dictionary for phrase separation to obtain a kanji kana mixed string as a kana kanji conversion result, and the previous phrase reading character string The combined reading character string formed by combining the storage unit and the reading character string stored in the post-sequence reading character string storage unit is subjected to continuous phrase conversion, Whether the conversion result of the synthesized read character string is two phrases, and whether the read character string of each phrase is the same as the read character strings stored in the preceding phrase read character string storage unit and the subsequent phrase read character string storage unit, respectively. If it is determined that they are not the same, the delimiter information in units of the reading character strings stored in the pre-phrase reading character string storage unit and the post-phrase reading character string storage unit is used as the phrase separation of the composite reading character string. Store it in the phrase segmentation learning dictionary as information A kana-kanji conversion method is provided.
[0011]
In another embodiment, if the two-syllable reading character string “Jishu Haigyo” is converted at a time and the conversion candidate “independent is / feat” is obtained, the part of the kanji kana exchange string “ha / feat” is Delete and input the reading character string "Hello" again and convert it to obtain a conversion candidate "Closed". At that time, the character that remains without being erased by memorizing which character of the reading character string “jisyuhaigyo” corresponds to each character of the first conversion candidate “autonomous / feat” To determine what is.
[0012]
In this case, the character string to be read is “jishi”, and the kanji kana mixed string is “independent”. The remaining phrase is regarded as the first phrase, and the reading character string input subsequently is regarded as the second phrase. The reading character string is "Haikyo", and the kanji kana mix line is "Closed". A reading character string and a conversion candidate corresponding to a delimiter position where the reading character strings of the first and second phrases are connected are internally stored.
Therefore, when a pre-stored two-segment reading character string “Jishuha Igyo” is input, a conversion candidate “independent is / great feat” based on an incorrect delimiter position “Jushiha / Igyo” is obtained. Instead, the correct conversion candidate “autonomous / closed” can be obtained by learning the correct break position “jishi / hagyo”.
[0013]
An editing unit that partially deletes the kanji kana mixed string converted by the conversion unit is further provided, and the storage control unit stores the kanji kana mixed string remaining after the partial deletion by the editing unit in the previous sentence kanji kana mixed string storage unit. , The corresponding reading character string is stored in the preceding phrase reading character string storage unit, and the input reading character string following the deletion is stored in the subsequent phrase reading character string storage unit. The cross-sequence is stored in a post-sequence kanji kana cross-sequence storage unit, and the determination unit stores, in each of the storage units, a delimiter position when a read character string of the same plurality of phrases is input and converted to a kanji kana cross-sequence. It is determined whether or not it is the same as the delimiter position of the read character string, and upon receiving a different determination result from the determiner, the learning unit learns the delimiter position of the read character string. You may.
According to this configuration, even when a part of a character string that has been input and converted once is deleted, the correct kana-kanji conversion can be performed by learning the delimiter positions of the phrases of the character string remaining without being deleted.
[0014]
It further comprises a reading kanji kana mixing string correspondence table for storing the correspondence between each character of the reading character string and each character of the kanji kana mixing string. A kanji-kana-kanji column correspondence table that reads whether or not the kanji-kana-kane column converted from the reading character string connected to the column and the kanji-kana-kane column connected to the preceding sentence kanji-kana-kane sequence and the post-sequence kanji-kana-kana-sequence column are the same. When the learning unit receives different judgment results from the judgment unit, the learning unit connects the pre-sentence reading character string and the post-segment reading character string, and reads the pre-sentence kanji kana mixed sequence and the post-sentence kanji kana character. It may be configured to learn a correspondence relationship between the kanji string and the kanji kana string.
According to this configuration, it is possible to learn the delimiter position input in units of phrases based on the correspondence between the kanji kana character sequence that connects the pre-segment kanji kana character sequence and the kanji kana character sequence that connects the subsequent sentence kanji character.
[0015]
It further comprises a deciding section for moving a cursor in accordance with an instruction of the input section to fix the character string, and moving the cursor to a character string of another phrase by deciding a reading character string of the previous phrase by the deciding section. At this time, the storage control unit may be configured to delete all character strings stored in the pre-segment reading character string storage unit and the pre-segment kanji kana combination column storage unit, respectively.
According to this configuration, it is not necessary to perform the segment learning between discontinuous segments, which is caused by the movement of the cursor in the kana-kanji conversion by the user, and it is possible to prevent erroneous learning.
[0016]
If no character string is stored in the pre-sentence reading character string storage unit and the pre-sentence kanji kana combination line storage unit, the conversion unit converts the determined kanji kana combination sequence immediately before the cursor to a read character string. However, the storage control unit may be configured to store the preceding phrase reading character string storage unit and store the kanji kana mixed string in the previous phrase kanji kana mixed string storage unit.
According to this configuration, the burden on the user for the conversion processing operation can be reduced.
[0017]
Hereinafter, the present invention will be described in detail based on an embodiment shown in the drawings. The present invention is not limited by this.
[0018]
FIG. 1 is a block diagram showing the configuration of the kana-kanji conversion device of the present embodiment. In FIG. 1, reference numeral 1 denotes a control unit that controls each of the units 3 to 18 via a bus 2 based on a control program. The control unit 1 is, for example, a computer including a CPU, a ROM, a RAM, and an I / O port. Be composed. Further, the control unit 1 also functions as a storage control unit, a determination unit, a learning unit, an editing unit, and a determination unit.
Reference numeral 3 denotes an input unit for inputting an edit code such as a character code, conversion, or deletion instruction. The input unit 3 is configured by an input device including, for example, a keyboard, a pen / tablet, and a mouse.
Reference numeral 4 denotes a kana-kanji conversion unit for converting kana-kanji conversion of the reading kana input by the input unit 3, and is composed of, for example, a CPU of a computer, a ROM storing a kana-kanji conversion program, and an EEPROM.
[0019]
Reference numeral 5 denotes a kana-kanji conversion dictionary. The kana-kanji conversion dictionary 5 is, for example, a storage device including a ROM, an EEPROM, a floppy disk (FD), and a hard disk (HD) storing kana-kanji conversion data for kana-kanji conversion. It consists of.
Reference numeral 6 denotes a phrase segmentation learning dictionary. The phrase segmentation learning dictionary 6 includes, for example, a RAM and an EEPROM for storing learning data for learning segmentation positions of a plurality of phrases.
[0020]
Reference numeral 7 denotes a display buffer unit that stores an input character string, a converted character string, and the like as display data, and includes, for example, a RAM and an EEPROM.
Reference numeral 8 denotes a display unit that displays an input read character string, a converted character string, and the like. The display unit 8 is a display device including, for example, a CRT display, a liquid crystal display (LCD), and a plasma display (PD). Be composed.
Reference numeral 9 denotes an instruction unit for instructing a position such as insertion or deletion of a character by a cursor pointer. The instruction unit 9 is configured by, for example, an input device including a keyboard, a pen / tablet, and a mouse.
Reference numeral 10 denotes a printing unit that prints a character string displayed on the screen on the display unit 8, and the printing unit 10 includes, for example, a thermal printer, an inkjet printer, or the like.
[0021]
Reference numeral 11 denotes a pre-phrase reading character string storage unit that stores a reading character string of the previous phrase among the determined plural phrases, and is configured by, for example, a RAM and an EEPROM.
Reference numeral 12 denotes a post-segment reading character string storage unit that stores the currently input reading character string, and is composed of, for example, a RAM and an EEPROM.
Reference numeral 13 denotes a pre-sentence kanji kana interlacing sequence storage unit for storing the converted kanji kana interlacing sequence of the previous sentence, and is composed of, for example, a RAM and an EEPROM.
Numeral 14 denotes a post-sequence kanji kana mixed string storage unit for storing the converted kanji kana mixed string of the post-sentence, and is composed of, for example, a RAM and an EEPROM.
[0022]
Reference numeral 15 denotes a combined read character string storage unit that stores a read character string obtained by combining the read character string of the previous phrase input immediately before and the read character string that is currently input, and includes, for example, a RAM and an EEPROM.
Reference numeral 16 denotes a combined kanji kana string storage section for storing a kanji kana string string obtained by combining the converted kanji kana string string of the previous sentence and the currently converted kanji kana string string of the later sentence. For example, RAM, EEPROM, etc. It consists of.
Reference numeral 17 denotes a reading kanji kana-sequence row correspondence table for storing the input kanji character string and the conversion result of the kanji-kana kana-sequence corresponding to the positional information, and is composed of, for example, a RAM and an EEPROM.
[0023]
Reference numeral 18 denotes a storage medium storing the kana-kanji conversion program of the present invention, and the storage medium 18 is constituted by, for example, a computer-readable FD, HD, CD-ROM, or the like.
Reference numeral 19 denotes a storage medium reading unit that reads a kana-kanji conversion program from the storage medium 18 and installs the program in a RAM or an EEPROM.
[0024]
FIG. 2 is a block diagram illustrating the configuration of the storage medium of the present embodiment. In the area shown in FIG. 2A, a kana-kanji conversion program 201 for converting an input character string into a kanji-kana mixed character string with reference to a kana-kanji conversion dictionary, and deletes, corrects, and adds characters An editing program 202, a phrase break determination program 203 for determining whether or not the break positions of the phrases are the same, a phrase break learning program 204 for learning the break positions of the phrases, a character string comparison program for comparing character strings to determine whether they are the same 205, each code of the character string learning program 206 for learning the character string when the character string is not the same is stored.
[0025]
In the area shown in FIG. 2B, a pre-segment reading character string storage control program 207, a post-segment reading character string storage control program 208, a pre-segment kanji kana mixed string storage control program 209, a post-sentence kanji kana mixed string storage A control program 210, a synthetic read character string storage control program 211, a synthetic kanji kana mixed string storage control program 212, a phrase separation learning data storage control program 213, and a character string learning data storage control program 214 are stored respectively.
[0026]
According to another aspect of the present invention, the computer 1 sequentially converts the reading character strings of a plurality of phrases input by the input unit 3 in units of phrases into kanji kana mixed sequences, and reads the reading character strings of the preceding phrase out of the two phrases. Is stored in the pre-bunsetsu reading character string storage unit 11, the post-bunsetsu reading character string following the pre-bunsetsu is stored in the post-bunsetsu reading character string storage unit 12, and the converted kanji kana mixed string of the pre-bunsetsu is stored in the pre-bunsetsu kanji. The kana mixed string storage unit 13 stores the converted kanji kana mixed string of the post-sequence in the post-sentence kanji kana mixed string storage unit 14. To determine whether or not the delimiter position when converted to the same as the delimiter position of the read character string stored in each of the storage units 11 to 13. Learning break positions It is possible to provide a recording medium 18 that stores the kana-kanji conversion program for causing.
[0027]
Therefore, by reading the kana-kanji conversion programs 201 to 214 of the present invention from the recording medium 18 and installing them in the RAM and the EEPROM, it is possible to input a read character string composed of a plurality of phrases in units of phrases, and to convert them correctly. It is possible to realize a kana-kanji conversion device that learns a phrase separation position and correctly converts kana-kanji conversion of a plurality of phrase reading character strings to be input next time.
[0028]
FIG. 3 is a flowchart showing the procedure (1) of the kana-kanji conversion process of the present embodiment. FIG. 4 is a flowchart showing the procedure (2) of the kana-kanji conversion process of the present embodiment. FIG. 5 is a diagram illustrating a storage state of each storage unit according to the kana-kanji conversion process of the present embodiment.
In the present embodiment, for example, when a two-segment reading character string “jishihaigyo” is converted to a kanji kana mixed character string “independent is feat”, the reading character strings “jishiha” and “haigyo” are read. Is divided into two phrases and converted to obtain the Kanji kana mixed character strings "independent" and "closed business". From the next time, the correct Kanji kana mixed string for the input of the reading character string "jisyuhaigyo" Obtain “voluntary closure”.
[0029]
In the flowchart of FIG.
Step 301: A key code is input from the input unit 3.
Step 302: Check whether the input key code is a character code. If the key code is a character code, proceed to step 303; otherwise, proceed to step 306.
Step 303: Store in the later-sentence reading character string storage unit 12.
FIG. 5A shows that “jishi” (5-1) is input to the post-sequence reading character string storage unit 12. In the present embodiment, the phrase to be converted by the user is treated as a later phrase for convenience.
[0030]
Step 304: Store in the display buffer 7.
Step 305: Display the conversion result on the display unit 8.
Step 306: Check whether the key code is a conversion key. If the key code is not a conversion key, proceed to step 307; otherwise, proceed to a kana-kanji conversion process of step 314.
Step 307: Check whether the key code is an instruction for confirmation. If it is not a confirmation instruction, the process returns to step 301. In the case of the confirmation instruction, the process proceeds to step 308.
[0031]
In FIG.
Step 315: Kana-Kanji conversion of the post-sentence reading character string (5-1). The kana-kanji conversion unit 4 performs conversion with reference to the kana-kanji conversion dictionary 5. FIG. 5B shows that the reading character string “jishi” is converted to kana-kanji and the result is stored as “independent” (5-2) in the post-sequence kanji kana-mixing column storage unit 14. I have.
Step 316: Check whether the conversion result is a plurality of clauses. If YES, proceed to Step 317; otherwise, proceed to Step 319.
Step 317: Check whether the conversion result is registered in the phrase segmentation learning dictionary 6, and if YES, proceed to Step 318, otherwise proceed to Step 319.
[0032]
Step 318: Store the learning candidates in the display buffer 7.
Step 319: Store the conversion result in the display buffer 7. Next, the process proceeds to step 305, where the conversion result “independent” is displayed on the display unit 8.
When the user gives the “independent” confirmation instruction, the process returns to step 301 and proceeds to step 308.
[0033]
Step 308: Check whether there is a preceding phrase reading character string in the preceding phrase reading character string storage unit 11. In this case, since it is not stored, the process proceeds to step 313.
Step 313: As shown in FIG. 5c, the post-sequence reading character string is copied to the pre-sequence reading character string storage unit 11 as the pre-sequence reading character string (5-3), and It is copied to the previous sentence kanji kana interlacing column storage unit 13 as a kanji kana interlacing column (5-4), and the process returns to step 304 and step 305.
[0034]
Subsequently, as shown in FIG. 5e, the input of the reading character string "Haigyo" and the kana-kanji conversion are performed, and the kanji kana mixed string "closed out" is obtained and determined.
Step 307: Check whether it is a confirmation instruction.
Step 308: Check whether there is a preceding phrase reading character string in the preceding phrase reading character string storage unit 11. In this case, since it is stored, the process proceeds to step 309.
Step 309: The pre-sentence reading character string and the post-sentence reading character are combined, and the combined reading character string “jishihaigyo” is stored in the combined reading character string storage unit 15, as shown in FIG.
Step 310: The kana-kanji conversion of the synthesized reading character string is performed, and in a state where learning is not performed, the conversion result “independent / great achievement” is stored in the kana-kanji mixed string storage unit 16 as shown in g of FIG.
[0035]
Step 311: It is checked whether the conversion result is two phrases and the delimiter position is between the preceding phrase reading character string and the subsequent phrase reading character string. In this case, the delimiter position of “jishiha / igyo” is between the pre-sequence reading character string and the postsequence read character string, and is not the same as the delimiter position of “jishi / haigyo”. It is necessary to proceed and learn the break position. If they are the same, the process proceeds to step 313 because the segment break learning is not performed.
Step 312: The learning registration process of the phrase segmentation is performed, the learning information is stored in the phrase segmentation learning dictionary 6, and the process proceeds to step 313. H in FIG. 5 shows a state when the process proceeds to step 313.
[0036]
FIG. 6 is a diagram showing the stored contents of the phrase segmentation learning dictionary of the present embodiment. As shown in FIG. 6A, the phrase segmentation learning registration process may be configured such that a kanji kana exchange sequence “autonomous out of business” is stored for the reading character string “jishihaigyo”. . In this case, it means that the kanji kana exchange string “independent closing” has been learned with respect to the reading character string “jishuhaigyo”.
In addition, as shown in FIG. 6B, a configuration may be adopted in which only the read character strings “jishi” and “haigyo” are registered, and the kanji kana mixed string is not registered.
[0037]
In the flowcharts of FIG. 3 and FIG. 4, after learning the phrase separation position of the kanji kana combination string “independent closure” in the reading character string “jishi haigyo”, the user at one time reads “jishihaigyo”. An example of input conversion will be described.
Step 315: As the conversion result candidates for the reading character string “Jishu Igyo”, “independent / great business”, “independent / closed business”, etc. are obtained.
Step 316: Check whether the conversion result is a plurality of clauses. If YES, proceed to Step 317; otherwise, proceed to Step 319.
[0038]
Step 317: Check whether the conversion result is registered in the phrase segmentation learning dictionary. If YES, proceed to Step 318; if NO, proceed to Step 319.
In this case, in step 312, as shown in FIG. 6A, the phrase segmentation learning is being performed, and “voluntary closure” is a learning candidate.
Step 318: The learning candidate “voluntary closure” is stored in the display buffer 7. Next, the process proceeds to step 305, where the conversion result "voluntary closure" is displayed on the display unit 8.
[0039]
FIG. 7 is a diagram showing the storage contents (1) of the reading kanji kana mixed character string correspondence table of the present embodiment. FIG. 8 is a diagram showing the storage contents (2) of the reading kanji kana mixed character string correspondence table of the present embodiment. FIG. 9 is a diagram showing the storage contents (3) of the reading kanji kana mixed character string correspondence table of the present embodiment. In particular, FIGS. 7 and 8 are explanatory diagrams in which information for examining the reading of one kanji is stored.
[0040]
FIG. 7A shows a state in which the reading character string “Joshiha Haigyo” has been input, and the numbers in the figure indicate the number of the character from the beginning.
FIG. 7B shows a kana-kana kana mixing column “independent is great” as a result of the kana-kanji conversion.
FIG. 7C shows the position of each character in the reading character string corresponding to each character in the kanji kana mixed string, and is stored in the reading kanji kana mixed string correspondence table 17.
[0041]
FIG. 10 is a diagram showing the storage contents of the kana-kanji conversion dictionary of the present embodiment. In the kana-kanji conversion dictionary shown in FIG. 10, the kanji code is stored using two bytes of the JISX0208 code, and the number of characters read for one kanji is two bytes that are not used in the JISX0208 code. It is stored using one bit of the MSB of the first byte and the second byte of the represented kanji code.
[0042]
As shown in FIG. 10A, when the number of characters read for one kanji is one, the MSB of the first and second bytes of the kanji code is set to 0. When the number of characters read for one kanji is two, the MSB of the first byte of the kanji code is 0 and the MSB of the second byte is 1. Hereinafter, when the number of characters to be read for one kanji is 5 or more, the MSB of the first byte and the second byte of the kanji code becomes 1, and the next byte contains a number of 5 or more, and the number is the number of characters to be read. And
[0043]
FIG. 10 (b) is an explanatory diagram showing the pronunciation of reading characters and kanji for words registered in the kana-kanji conversion dictionary, and further showing the number of reading characters for each display of each kanji of the word. .
[0044]
FIG. 10 illustrates an example of correspondence between the read character string “Igyo” and the kanji character “Kyogyo”.
When examining the kanji code of the kanji "wei", since the first and second bytes of the MSB are 0, it is understood that the reading character of the "wei" is one character and the reading character is "i". I understand. Subsequently, when the kanji code of the kanji “work” is examined, the MSB of the first byte is 1 and the MSB of the second byte is 0, so that the number of read characters is three. Therefore, it can be understood that the reading character of the kanji character “Yoku” is “Gyo”.
[0045]
By using the Kana-Kanji conversion dictionary as described above, if the reading character string “Jishu Haigyo” is converted to Kana-Kanji and the result is “Independent is feat,” search the Kana-Kanji conversion dictionary 5 by Kana-Kanji conversion. Then, for each kanji, it is possible to know what the reading characters are.
In this example, “self” corresponds to “ji”, “lord” corresponds to “shu”, “ha” corresponds to “ha”, “great” corresponds to “i”, and “work” corresponds to “gyo”. (C) is stored in the reading kanji kana combination column correspondence table.
[0046]
The reading kanji kana mixed string correspondence table shown in (c) of FIG. 7 is a table indicating which character of the read character string points to each character of the kanji kana mixed string.
Therefore, the "self" of the kanji kana mixed string has a kanji kana mixed string position of 0 and the reading character string starts with 0, and the "main" has a kanji kana mixed string position of 1 and the reading character string starts with 1 and ""Ha" has a kanji kana mixed string position of 2 and the reading character string starts with 3, "Wai" has a kanji kana mixed string position of 3 and the reading character string starts with 4, and in the same manner, "work" has a kanji character The kana combination row position starts at 4, and the read character string starts at 5.
Since the end of the reading kanji kana mixed string correspondence table indicates the position +1 where the last character of the kanji kana mixed string ends, the kanji kana mixed string is 5 and the read character string is 8.
[0047]
FIG. 11 is a flowchart showing the procedure (3) of the kana-kanji conversion process of the present embodiment. In FIG. 11, the processing from step 1101 to step 1114 is the same as the processing from step 301 to step 314 shown in FIG.
FIG. 12 is a flowchart showing the procedure (4) of the kana-kanji conversion process of the present embodiment. Note that the processing from step 1130 to step 1137 shows a flowchart for determining the break position between the pre-phrase reading character string and the post-phrase reading character string by the conversion result partial deletion processing.
[0048]
Step 1130: Check whether the input key code is a delete key. If YES, the process proceeds to the character deletion process of step 1131. If NO, the process proceeds to step 1106.
Step 1132: It is checked whether or not the target of the deletion instruction is the part of the kanji kana mixed string of the previous phrase. If YES, the process proceeds to step 1133, and if NO, the process proceeds to step 1136.
[0049]
Step 1133: To delete the previous sentence, delete the character string to be deleted from the previous sentence kanji kana mixed string. Before the deletion, as shown in FIG. 7B, “independent is great” is input, and as shown in FIG. 8B, one character is deleted from “industry”.
Step 1134: As shown in FIG. 7 (c), characters starting at position 4 and ending at position 5 in the kanji kana mixed string are deleted, and the reading character string corresponding thereto is also started at position 5;
It is found that the read character string ends at position 8, and the read character string “Gyo” is deleted, and the state shown in FIG. 9A is obtained.
[0050]
Here, since the position of the ending character actually indicates the start position of the next character, it is a number one larger than the actual ending character position.
Step 1135: Update the reading kanji kana combination column correspondence table to a state as shown in FIG. 9C, proceed to step 1137, and end the character deletion processing.
Step 1136: Delete the character string other than the previous phrase and proceed to the end processing of step 1137.
[0051]
When the user performs the above character deletion processing and desires to obtain the kanji kana character string "autonomous out of business" by converting the reading character string "jishuhaigyo", the kana kanji conversion result Becomes “independent is a feat”, and when the user deletes “ha feat” in the control unit 1, the three-letter sentence of the kanji kana mixed string is deleted. The three characters of the later sentence shown in (c) are deleted, and the result is as shown in (c) of FIG.
[0052]
Therefore, as shown in FIG. 9B, it can be seen that the read character string of the kanji kana mixed string “independent” which remains without being deleted is “Jishi” shown in FIG. 9A.
This "jishi" is used as a pre-sentence reading character string, "independent" is used as a pre-sentence kanji kana mixed string, and the next input reading character string is converted as a post-sentence reading character string.
[0053]
Therefore, when the input character strings of a plurality of clauses are input and converted at once, only the correct kanji kana mixed string of the conversion result is left, the incorrect conversion part is deleted, and the input character string corresponding to the part is input again. Even after conversion, the segment break position can be learned.
According to this embodiment, even when a part of a character string that has been input and converted once is deleted, the correct kana-kanji conversion can be performed by learning the delimiter positions of the phrases of the character string remaining without being deleted.
[0054]
FIG. 13 is a flowchart showing the procedure (5) of the kana-kanji conversion process of the present embodiment. In FIG. 13, the processing from step 1301 to step 1314 is the same as the processing from step 301 to step 314 shown in FIG.
Step 1320: Check whether the key code input by the user is a cursor movement instruction. If YES, the process proceeds to step 1321, and if NO, the process proceeds to step 1306.
Step 1321: When the previous phrase reading character string and the previous phrase kanji kana string are stored in the previous phrase reading character string storage section and the previous phrase kanji mixed string storage section, these are cleared.
[0055]
According to this embodiment, when the cursor is moved and a character string is input, the character string input before that is rejected, so that there is no error in learning between unrelated phrases, Correct phrase segmentation learning can be performed, and the convenience of the user's conversion processing operation can be improved. In other words, it is not necessary to perform the segment learning between discontinuous phrases caused by the cursor movement, and it is possible to prevent erroneous learning.
[0056]
FIG. 14 is a flowchart showing the procedure (6) of the kana-kanji conversion process of the present embodiment. In FIG. 14, the processing from step 1401 to step 1414 is the same as the processing from step 301 to step 314 shown in FIG.
Step 1420: Check whether the key code input by the user is a cursor movement instruction. If YES, proceed to Step 1421; if NO, proceed to Step 1406.
Step 1421: When the pre-segment reading character string and the pre-segment kanji kana string are stored in the pre-segment reading character string storage unit and the pre-segment kanji mixed string storage unit, these are cleared.
[0057]
Step 1422: When the reading character string is not stored in the preceding paragraph reading character string storage unit, it is confirmed whether or not there is a determined character string in front of the current cursor. If YES, the process proceeds to step 1423, and if NO, the process proceeds to step 1413.
Step 1423: Return the determined kanji kana mixed string in front of the cursor to the read character string. The read character string is stored in the previous phrase reading character string storage unit, the kanji kana mixed string returned to the kana is stored in the previous phrase kanji mixed string storage unit, and the process returns to step 1408. In addition, the method of returning to the reading character string may be a method of storing the reading character string when the user has input, or a method of converting the character string into a kana character string by a known method.
[0058]
According to this embodiment, when the cursor is moved and a character string is input, the phrase before the cursor to which the cursor is moved can be read as a character string by reading the kanji kana mixed string. The convenience of the conversion operation can be improved.
Further, even if the cursor is moved, the relationship between the front and the back can be correctly determined, and the phrase segmentation learning can be correctly performed.
[0059]
【The invention's effect】
According to the present invention, if a reading character string consisting of a plurality of phrases is divided and input in units of a phrase and is correctly converted, a correct phrase separation position is learned, and a reading character string of a plurality of phrases input next time is correctly converted into kana-kanji characters. can do.
[Brief description of the drawings]
FIG. 1 is a block diagram showing a configuration of a kana-kanji conversion device of the present embodiment.
FIG. 2 is a block diagram illustrating a configuration of a storage medium according to the present embodiment.
FIG. 3 is a flowchart illustrating a procedure (1) of a kana-kanji conversion process according to the embodiment;
FIG. 4 is a flowchart illustrating a procedure (2) of a kana-kanji conversion process according to the present embodiment.
FIG. 5 is a diagram illustrating a storage state of each storage unit according to a kana-kanji conversion process of the embodiment.
FIG. 6 is a diagram showing stored contents of a phrase segmentation learning dictionary of the present embodiment.
FIG. 7 is a diagram illustrating storage contents (1) of a reading kanji kana mixed character string correspondence table according to the present embodiment.
FIG. 8 is a diagram showing storage contents (2) of a reading kanji kana mixed character string correspondence table of the embodiment.
FIG. 9 is a diagram showing storage contents (3) of a reading kanji kana mixed character string correspondence table of the embodiment.
FIG. 10 is a diagram showing storage contents of a kana-kanji conversion dictionary of the present embodiment.
FIG. 11 is a flowchart illustrating a procedure (3) of a kana-kanji conversion process according to the embodiment;
FIG. 12 is a flowchart illustrating a procedure (4) of a kana-kanji conversion process according to the embodiment;
FIG. 13 is a flowchart illustrating a procedure (5) of a kana-kanji conversion process according to the present embodiment.
FIG. 14 is a flowchart illustrating a procedure (6) of a kana-kanji conversion process according to the present embodiment.
[Explanation of symbols]
1 control unit
2 bus
3 Input section
4 Kana-Kanji conversion part
5 Kana-Kanji conversion dictionary
6. Clause-separated dictionary
7 Display buffer
8 Display
9 Indicator
10 Printing Department
11 Pre-phrase reading character string storage
12 Post-segment reading character string storage
13 Pre-Sentence Kanji Kana Mixing Column Storage
14. Post-sentence kanji kana mixed column storage unit
15 Composite reading character string storage
16 Synthetic kanji kana mixing column storage
17 Reading Kanji Kana Mixing Column Correspondence Table
18 Storage media
19 Storage medium reading unit

Claims

A phrase segmentation learning dictionary that stores phrase segmentation information from the continuous phrase conversion result,
A pre-syllable reading character string storage unit that stores a reading character string already converted to kana-kanji ,
A post-sequence reading character string storage unit that stores a reading character string that is input following the reading character string stored in the pre-phrase reading character string storage unit ;
The reading character string in the post-sequence reading character string storage unit is converted into a single phrase by referring to the phrase-separation learning dictionary for single-phrase conversion or phrase-separation to obtain a kana-kana-kanji string as a kana-kanji conversion result. A conversion unit that converts a combined read character string obtained by combining the read character strings stored in the read character string storage unit and the post-sequence read character string storage unit into a continuous phrase;
Whether the conversion result of the synthesized read character string is two phrases, and whether the read character string of each phrase is the same as the read character strings stored in the preceding phrase read character string storage unit and the subsequent phrase read character string storage unit, respectively. A determination unit for determining whether
If the determining unit determines that the two are not the same, the delimiter information in units of the read character strings stored in the pre-sequence reading character string storage unit and the post-sequence reading character string storage unit is used as the phrase separation information of the synthesized read character string. A kana-kanji conversion device comprising: a phrase separation learning control means for storing the phrase separation learning dictionary .

A post-sequence kanji kana-sequence column storage part for storing a kanji-kana-sequence column containing at least one of a kanji or a kana obtained by converting the reading character string of the post-sequence reading character string storage part to a kana-kanji character,
A pre-segmented kanji kana mixing column storage unit that stores a kanji kana mixing column already input and converted to kana kanji,
Each character of the reading character string of the pre-sentence reading character string storage unit and the characters of the reading character string of the post-sentence reading character string storage unit and each character of the kanji kana mixing string of the pre-segment kanji kana mixing column storage unit and A reading kanji kana mixing column correspondence table for storing correspondence,
Further comprising a kanji kana Pepper column converted by the conversion unit and editing unit for deleting portions,
When the character string to be partially deleted by the editing unit is a part corresponding to the contents of the previous phrase kanji kana mixing column storage unit and the content of the previous phrase reading character string storage unit, the target of the previous phrase kanji kana mixing column storage unit In addition to deleting and updating the character string, the corresponding character string in the previous sentence-reading character string storage is deleted and updated based on the contents of the reading kanji-kana kana-sequence column correspondence table. The column is stored in the post-sentence reading character string storage unit, and the kanji kana mixed string converted from the read character string is stored in the post-sentence kanji kana mixed string storage unit,
The determining unit is configured to convert the combined read character string obtained by combining the read character strings stored in the preceding phrase read character string storage unit and the subsequent phrase read character string storage unit into a continuous phrase conversion, and the conversion result obtained is two phrases. And it is determined whether the reading character string of each phrase is the same as the reading character strings stored in the pre-phrase reading character string storage unit and the post-phrase reading character string storage unit, respectively ,
When the determining unit determines that the two are not the same, the phrase segmentation learning control unit synthesizes the segmentation information in units of the read character strings stored in the preceding phrase read character string storage unit and the subsequent phrase read character string storage unit. 2. The kana-kanji conversion device according to claim 1, wherein the kana-kanji conversion unit stores the kana-kanji conversion information as the kana-kanji information of the reading character string .

A method of performing a kana-kanji conversion by performing a single phrase conversion or a continuous phrase conversion on a reading character string input for each single phrase or a plurality of phrases,
The reading character string already converted to Kana-Kanji is stored in the previous phrase reading character string storage unit,
A reading character string input following the reading character string stored in the preceding phrase reading character string storage unit is stored in the subsequent phrase reading character string storage unit,
The reading character string of the post-syllable reading character string storage unit is referred to the phrase separation learning dictionary for single phrase conversion or phrase separation to perform continuous phrase conversion to obtain a kanji kana mixed sequence as a kana kanji conversion result,
A continuous reading conversion of a combined reading character string obtained by combining the reading character string stored in the preceding phrase reading character string storage unit and the reading character string stored in the subsequent phrase reading character string storage unit ,
Whether the conversion result of the synthesized read character string is two phrases, and whether the read character string of each phrase is the same as the read character strings stored in the preceding phrase read character string storage unit and the subsequent phrase read character string storage unit, respectively. Judge
If it is determined that they are not the same, the delimiter information in units of the reading character strings stored in the pre-phrase reading character string storage unit and the post-phrase reading character string storage unit is used as the phrase delimiter information of the composite reading character string. Kana-Kanji conversion method to store in learning dictionary .

On the computer,
A kana-kanji conversion program for converting a reading character string inputted for each single phrase or plural phrases into a single phrase or a continuous phrase,
The reading character string already converted into kana-kanji is stored in the preceding phrase reading character string storage unit,
A reading character string input following the reading character string stored in the preceding phrase reading character string storage unit is stored in the subsequent phrase reading character string storage unit,
The reading character string of the post-sequence reading character string storage unit is referred to the phrase separation learning dictionary for single phrase conversion or phrase separation to perform continuous phrase conversion to obtain a kanji kana mixed string as a kana kanji conversion result,
The combined reading character string formed by combining the reading character string stored in the preceding phrase reading character string storage unit and the reading character string stored in the subsequent phrase reading character string storage unit is subjected to continuous phrase conversion,
Whether the conversion result of the synthesized read character string is two phrases, and whether the read character string of each phrase is the same as the read character strings stored in the preceding phrase read character string storage unit and the subsequent phrase read character string storage unit, respectively. To determine
If it is determined that they are not the same, the delimiter information in units of the reading character strings stored in the pre-phrase reading character string storage unit and the post-phrase reading character string storage unit is used as the phrase delimiter information of the composite reading character string. A storage medium storing a kana-kanji conversion program to be stored in a learning dictionary .