JP3796651B2

JP3796651B2 - Recording medium for Korean language input program

Info

Publication number: JP3796651B2
Application number: JP2000273796A
Authority: JP
Inventors: 博文福本
Original assignee: Omron Corp
Current assignee: Omron Corp
Priority date: 2000-09-08
Filing date: 2000-09-08
Publication date: 2006-07-12
Anticipated expiration: 2020-09-08
Also published as: JP2002082940A

Description

【０００１】
【発明の属する技術分野】
この発明は、コンピュータにおいて、ワープロ、表計算など稼働中のアプリケーションに、韓国語による文字入力を行う技術に関するもので、特に韓国語による文字入力処理を支援する方法、およびその方法が適用された韓国語入力用の文字入力システム（"Input Method Editor"／以下「ＩＭＥ」と略す。）に関連する。
【０００２】
【従来の技術】
韓国語に用いられるハングル文字（以下単に「ハングル」という。）は、「字母」と呼ばれる発音記号を、音節毎に組み合わせて表される。
この組合せには、「子音＋母音」、「子音＋母音＋子音」の２通りの組合せがある。最初の子音は「初声子音」、最後の子音は「終声子音」、母音は「中声母音」と呼ばれており、それぞれ図８に示すような複数種の字母が割り当てられている。
【０００３】
ハングル文字列の入力機能を具備するコンピュータでは、上記した各字母はそれぞれキーボードの所定のキーに割り付けられる。コンピュータは、ハングル文字列による文字入力時には、「初声子音」、「中声母音」の順に字母の入力を受け付け、さらに必要に応じて「終声子音」の入力を受け付けて、ハングルを一文字ずつ組み立てる。そして所定のタイミングで確定操作が行われると、それまでに組み立てられた各ハングルによる文字列を確定文字列としてアプリケーション側に出力する。また地名や人名のような強調したい単語を表すハングル文字列に対し変換処理が行われると、コンピュータは前記ハングル文字列をハンチャ（漢字）の文字列に変換した後に、同様の確定操作に応じてアプリケーション側に出力する。
【０００４】
【発明が解決しようとする課題】
上記の方法では、ハングル文字列の各構成文字毎に２〜３の字母を入力しなければならない。また標準仕様によるキーボード（１０６キーボードなど）では、各アルファベットキーに対しそれぞれ複数個の字母が割り当てられるが、これによって１つの字母を入力するのに、複数回、同じキーを打鍵する操作やコントロールキーのような他のキーと組み合わせて打鍵する操作が必要となり、ハングルの入力に要する操作はますます複雑化する。また操作に不慣れな初心者にとって、字母の割付キーを認識するのはきわめて困難であり、入力ミスが多発し、文字入力の効率が著しく悪くなる、という問題も生じる。
【０００５】
この発明は上記問題点に着目してなされたもので、上記のような煩雑な操作を行うことなく、韓国語文字列の入力処理を簡単かつ高速化できるようにすることを目的とする。
さらにこの発明は、ハンチャにより表現した単語を再度入力する場合に、先頭位置のハングルを組み立てる字母の入力によって、ハンチャを含む文字列およびハングルのみの文字列のいずれでも入力できるようにすることにより、韓国語文字列の入力操作をより簡単に行えるようにすることを目的とする。
【０００６】
【課題を解決するための手段】
この発明にかかる韓国語入力用のプログラム記憶媒体は、字母の入力を受け付ける入力受付手段、前記入力受付手段が受け付けた字母の入力順序に基づきハングル文字列を組み立てる組立手段、前記組立手段により組み立てられた所定のハングル文字列に対して変換操作が行われたとき、その操作対象のハングル文字列をハンチャの文字列に変換する変換手段、前記組立手段により組み立てられたハングル文字列または前記変換手段により変換されたハンチャの文字列を含む文字列に対し確定操作が行われたとき、その操作対象の文字列を所定のアプリケーションに出力する第１の出力手段、前記確定操作に応じてその操作対象の文字列を当該文字列の組み立てに必要な字母の入力順序に対応づけてメモリ内に登録するとともに、この登録対象の文字列にハンチャが含まれるとき、前記登録対象の文字列のハンチャをハングルに置き換えた文字列を、前記登録対象の文字列と同一の字母の入力順序に対応づけてメモリ内に登録する登録手段、前記入力受付手段が字母の入力を受け付ける都度、字母の入力順序を蓄積するとともに、蓄積された字母の入力順序に基づき、入力候補となる文字列を前記メモリから抽出する候補抽出手段、前記候補抽出手段により抽出された入力候補の文字列をディスプレイ画面上に表示する表示手段、前記ディスプレイ画面上に表示された入力候補に対する選択操作が行われたとき、選択された入力候補の文字列を前記アプリケーションに出力する第２の出力手段、の各手段としてコンピュータを機能させるためのプログラムが記憶されている。
なおこの発明でいうところの「韓国語」とは、朝鮮語も含むハングル文字により表記される言語のことである。
【０００７】
上記の記録媒体に記録されたプログラムによれば、入力すべき文字列の各構成文字（この場合、ハングル）について、ユーザーが、前記した順序で字母を入力して確定操作を行うと、確定された文字列は、第１の出力手段により稼働中のアプリケーションに出力されるとともに、その文字列の組み立てに必要な字母の入力順序に対応づけられてメモリ内に登録される。また組み立てられたハングル文字列のうちの所定の文字列に対して変換操作が行われた場合には、変換手段により変換操作の対象文字列がハンチャの文字列に変換される。さらに、この変換後の文字列に対する確定操作が行われると、第１の出力手段により変換後の文字列がアプリケーションに出力されるとともに、登録手段により前記変換後の文字列がその文字列の組み立てに必要な字母の入力順序（前記ハングル文字列に対応するものと同様である。）に対応づけられてメモリに登録される。
なお、字母の入力は、主としてキーボードの操作により行われるが、これに限らず、たとえば、図８に示したような字母の一覧をディスプレイ画面上に表示し、この一覧上でマウスなどにより選択された字母を入力するようにしてもよい。
【０００８】
前記候補抽出手段がメモリから入力候補を抽出する処理は、具体的には、字母の入力過程において得られた字母の入力順序と、メモリ内の各文字列に対応づけられた字母の入力順序との間での「前方一致検索」によって行われる。したがってユーザーが、以前に入力した文字列を再度入力するために字母の入力を開始すると、先頭位置のハングルを組み立てる字母を入力した時点で、前回の入力によりメモリに登録された文字列が抽出されてディスプレイ画面上に表示される。さらに表示された文字列を選択することによって文字列の入力を完了することができ、文字列を組み立てる字母をすべて入力しなくとも、所望の文字列を入力することが可能となる。
さらに、この発明では、前記登録手段がハンチャを含む文字列を前記メモリに登録するとき、この登録対象の文字列に加えて、この文字列のハンチャをハングルに置き換えた文字列を、前記登録対象の文字列と同一の字母の入力順序に対応づけて前記メモリに登録するようにしている。したがって１度目はハンチャにより表現した単語を再度入力する場合には、先頭位置のハングルを組み立てる字母の入力により、ハンチャを含む文字列とハングルのみの文字列との両方が抽出されて表示されるので、ユーザーは、所望する方の文字列を選択してアプリケーションに出力させることが可能となる。
【０００９】
好ましい一態様においては、前記登録手段には、前記確定操作の対象文字列に複数の文節が含まれるとき、各文節単位の文字列および隣り合う複数の文節の組合せによる文字列を、前記メモリへの登録対象に設定する手段が含まれる。
この態様によれば、確定された文字列全体が登録されるとともに、その文字列を文節の切れ目で切り分けることにより得られる複数種の文字列が登録されることになる。したがってユーザーが以前に確定した文字列の一部に相当する文字列を入力したり、以前に確定した文字列を先頭の文節をはずした状態で入力しようとしている場合も、その文字列の先頭位置を組み立てる字母を入力することによって、目的とする文字列が抽出され、アプリケーションへの出力を行うことが可能となる。
【００１０】
また他の好ましい一態様では、前記登録手段には、前記確定操作に応じてその操作対象の文字列に対する形態素解析を実行して、前記文字列から自立語を抽出する手段と、前記抽出された自立語に対応する文字列を前記メモリへの登録対象に設定する手段とが含まれる。なお登録対象の自立語を、名詞のような特定の品詞に限定してもよい（ただし複数種の品詞を対象としてもよい。）。また動詞や形容詞などの活用形が抽出された場合は、その語を終止形に置き換えて登録してもよい。
この態様によれば、所定の単語を繰り返し入力するような場合に、助詞などの付属語が別の語に代わっても、その変化にとらわれることなく、単語の部分のみの文字列を高速で呼び出して入力することが可能となる。
【００１２】
さらに他の好ましい一態様にかかる記録媒体には、前記メモリから入力候補として抽出された文字列が選択されてアプリケーションに出力される都度、その選択された文字列の使用履歴を前記メモリに書き込む使用履歴保存手段として前記コンピュータを機能させるためのプログラムが記録される。さらに前記表示手段は、前記候補抽出手段により複数の入力候補が抽出されているとき、各入力候補をそれぞれの使用履歴に基づいて順序づけた後、その順序に応じて各入力候補を一覧表示する。なお、文字列の使用履歴とは、各候補の文字列を選択した頻度を表す情報や、同様の候補の抽出結果に対する前回の選択処理の結果を表す情報などの形態で表される。
上記の態様によれば、複数の入力候補が抽出された場合、各入力候補を選択頻度順に並べて表示したり、前回の処理において選択された入力候補を第１候補として表示するような処理が行われるので、ユーザーは選択すべき入力候補を容易に見つけることができ、候補の選択操作を速やかに行うことができる。
【００１３】
上記の記録媒体に記録されたプログラムをコンピュータに組み込むことにより、コンピュータの制御部に前記各手段の機能が設定されることにより、簡単な操作で韓国語の文字列を入力することが可能になる。
【００１４】
【発明の実施の形態】
図１は、この発明にかかる韓国語入力システム１（以下「韓国語ＩＭＥ１」という。）の構成を示す。
この韓国語ＩＭＥ１は、ウィンドウズ（マイクロソフト株式会社の登録商標）のようなオペレーションシステム（以下「ＯＳ」という。）と、ワープロソフトのようなアプリケーション（以下、「上位アプリケーション」という。）用のソフトウェアが組み込まれたコンピュータにおいて、上位アプリケーションに韓国語による文字列を入力するためのものである。
【００１５】
前記コンピュータは、図２に示すように、ＣＰＵ２１，ＲＯＭ２２，ＲＡＭ２３による制御回路２０を具備するほか、補助用のメモリ装置であるハードディスク装置２４，ＣＤ−ＲＯＭドライブ２５，入出力インターフェイス２６などが組み込まれた構成をとる。このコンピュータのＣＤ−ＲＯＭドライブ２５に、韓国語ＩＭＥ１用のプログラムやデータファイルが記録されたＣＤ−ＲＯＭをセットして、その記録内容をハードディスク装置２４内にインストールすると、以後、ＣＰＵ２１は、各プログラムを実行することにより、図１の各処理部として動作するようになる。
【００１６】
なお入出力インターフェイス２６には、入力用の機器として、各種データを入力するためのキーボードや選択操作のためのマウスなどが接続される。また出力用の機器としては、前記上位アプリケーションの表示画面などを表示するためのディスプレイ装置や、前記ディスプレイ装置上に表示されたデータを印字出力するためのプリンタなどが接続される。
【００１７】
図１に戻って、この実施例の韓国語ＩＭＥ１は、システム全体を統括管理するための制御部２に加え、後記する入力予測機能にかかる処理を統括するための予測処理制御部３を具備する。
また辞書として、ハングル文字列をハンチャの文字列に変換する際に用いられるハングル・ハンチャ変換辞書９や、前記入力予測処理のためのデータを蓄積する予測辞書１６が組み込まれている。なおこれら辞書のうち、ハングル・ハンチャ変換辞書９は、前記ＣＤ−ＲＯＭによるインストール直後からデータを保有する状態で設定されるが、予測辞書１６は、インストール直後は空の状態であり、文字の入力処理によって徐々にデータが蓄積される。
【００１８】
前記制御部２は、入力部４，出力部５，ハングル組立部６，ハングル・ハンチャ変換部７の各処理部を統括管理する。入力部４は、字母の入力操作や、確定，選択，および変換処理の操作を受け付ける機能を有し、出力部５は、制御部２から渡された処理結果を、ＯＳを介して上位アプリケーションや表示制御用のシステム（いずれも図示せず。）に出力するように設定される。
【００１９】
制御部２は、入力部４が字母の入力を受け付ける都度、これをハングル組立部６に渡す。ハングル組立部６は、制御部２より供給された字母を前記した字母の配列規則に従って順に配列することによりハングル文字列を組み立てる。
また組み立てられた所定のハングル文字列に対し変換操作が行われると、制御部２は、変換が指定されたハングル文字列を取り込んで、これをハングル・ハンチャ変換部７に渡す。ハングル・ハンチャ変換部７は、下位のハングル・ハンチャ変換辞書検索部８を動かして、前記ハングル・ハンチャ変換辞書９から前記供給されたハングル文字列に対応するハンチャの文字列を抽出させることにより、前記指定されたハングル文字列をハンチャ文字列に変換する。
【００２０】
制御部２は、ハングル組立部６，ハングル・ハンチャ変換部７による処理が行われる都度、その処理結果を取り込んで出力部５に出力する。また制御部２は、ハングル組立部６やハングル・ハンチャ変換部７による最新の処理結果を保持しており、入力部４が選択操作や確定操作を受け付けると、その選択または確定された文字列（以下、「確定文字列」と総称する。）を出力部５に出力する。
よってディスプレイ画面上には字母の入力に応じて、組立て過程にあるハングル文字列や、変換操作により呼び出されたハンチャ文字列などが未確定表示される。さらに選択操作や確定操作に応じて、前記未確定表示の文字列が確定表示に切り換えられるとともに、上位アプリケーションに出力される。
【００２１】
図３は、上位の処理が行われている際のディスプレイ画面における具体的な表示例を示す。
図３（１）は、一連の字母が入力された直後の状態であって、各入力字母により組み立てられたハングル文字列が、アプリケーションの表示ウィンドウ３０内の文字入力行３１に未確定表示されている。
【００２２】
図３（２）は、このハングル文字列中の一部の文字列を選択して変換操作を行った直後の状態であって、画面上の適所に選択された文字列に対する候補選択ウィンドウ３２が設定されて、その内部にハンチャ，ハングルの各候補が表示されている。また前記文字入力行３１においては、選択されたハングル文字列が第１候補のハンチャ文字列に置き換えられて表示される。
図３（３）は、前記図３（２）の状態に対して確定操作が行われた状態を示すもので、候補選択ウィンドウ３２が消失し、ハンチャを含むハングル文字列が確定表示されている。
【００２３】
図１に戻って、制御部２は、前記入力字母を、前記ハングル組立部６に与えるのと同じタイミングで予測処理制御部３に与えて、入力文字列の予測処理を行わせる。また制御部２は、前記選択操作または確定操作に応じて確定文字列を予測処理制御部３に与え、予測辞書１６への登録処理を行わせる。
【００２４】
予測処理制御部３の制御下には、文節切分け部１０，文字列抽出部１１，ハンチャ・ハングル変換部１２，予測データ検索部１３，予測データ登録部１４，履歴データ更新部１５などの処理部が設定される。
予測処理制御部３は、前記制御部２から渡された入力字母を順次蓄積するとともに、制御部２から確定文字列が与えられたとき、文節切分け部１０，文字列抽出部１１，およびその下位のハンチャ・ハングル変換部１２を用いて、前記確定文字列および蓄積された字母の入力順序から所定数の予測データを作成する。
【００２５】
文節切分け部１０は、半角スペースや読点，コンマなどの区切り記号に基づき、前記確定文字列を文節毎に切り分けるためのもので、文字列抽出部１１は、文節の切分け処理の結果に基づき、個々の文節単位での文字列、および前記確定文字列中で隣合う文節の組合せによる文字列をすべて抽出する。ハンチャ・ハングル変換部１２は、前記文字列抽出部１１により抽出された文字列のうち、ハンチャを含む文字列について、それぞれ前記ハンチャをハングルに置き換えた文字列を作成するためのものである。文字列抽出部１１は、ハンチャ・ハングル変換部１２により作成された文字列を取り込んで、自身が抽出した文字列とともに予測処理制御部３に出力する。
【００２６】
予測処理制御部３は、前記文節切分け部１０，文字列抽出部１１，およびハンチャ・ハングル変換部１２の処理によって抽出された各文字列を、それぞれその文字列を組み立てた各字母を入力順序に沿って並べたデータ（以下「入力構成要素データ」という。）に対応づけたデータを作成し、これを前記予測データとして予測データ登録部１４に出力する。予測データ登録部１４は、与えられた予測データに、文字列の使用頻度を示すデータ（以下「頻度データ」という。）や、文字列が前回選択されたか否かを示すフラグ（以下「前回使用フラグ」という。）などを加味し、予測辞書１６に保存する。
【００２７】
さらに予測処理制御部３は、前記制御部２から入力字母を受け取って蓄積する都度、その時点までに蓄積された字母の入力順序を前記予測データ検索部１３に与える。予測データ検索部１３は、与えられた字母の入力順序により予測辞書１６の各入力構成要素データに対する前方一致検索を実行する。
予測処理制御部３は、上記の検索により抽出された予測データを取り込んで制御部２に出力する。制御部２は、各予測データに含まれる文字列を、入力候補として出力部５に渡すことにより、ディスプレイ画面上に各入力候補を一覧表示させる。
【００２８】
この一覧表示に対する選択操作が行われると、制御部２は、その選択された文字列を確定文字列と認定して出力部５に出力する。これにより字母の入力の途中で文字列の入力処理が完了することになる。
【００２９】
さらに制御部２は、前記入力候補の選択結果を予測処理制御部３に与える。予測処理制御部３は、前記履歴データ更新部１５を動かして、前記選択された入力候補に対応する予測データの頻度データを書き換えるとともに、この予測データの前回使用フラグをオン設定する。
【００３０】
図４は、前記図３に示した文字列の入力処理に伴って登録される予測データの例を示す。図示例では、前記文節切分け部１０，文字列抽出部１１，ハンチャ・ハングル変換部１２により確定文字列から１０種類の文字列が抽出され、それぞれの文字列に前記入力構成要素データや頻度データなどを対応づけた予測データが設定されている。
なお、この図４は予測辞書１６に登録された直後の予測データの構成を示すもので、各予測データの頻度データはいずれも「１」に設定されている。（ただし、確定文字列そのものによる第１番目の予測データのみ頻度データを「１」とし、残りの予測データの頻度データは「０」としてもよい。）
【００３１】
さらに図４における各予測データには、前記した前回使用フラグのほか、予測データの種類を示すフラグデータとして、組立て予測フラグやハングル文字予測フラグが設定されている。組立て予測フラグは、文節の切分け処理や文節の組合せ処理によって作成された予測データであることを示し、ハングル文字予測フラグは前記ハンチャを含む文字列から派生した予測データであることを示す。なお、いずれのフラグも、図中、○が付いているものがオン状態を示す。
前回使用フラグや頻度データは、前記予測辞書１６に対する検索処理においてこれら予測データが抽出されたとき、各データを一覧表示する順序を決めるために用いられる。また組立て予測フラグやハングル文字予測フラグは、ユーザの設定によって予測辞書１６からの抽出対象が限定された場合に、抽出対象となるデータを判別するためなどに用いられる。
【００３２】
図５は、上記韓国語ＩＭＥ１による文字列入力処理の一連の手順を示す。
前記したように、この韓国語ＩＭＥ１がコンピュータ内に組み込まれた直後は、前記予測辞書１６は「空」の状態である。この状態下で最初の字母入力が行われた場合、ＳＴ１からＳＴ２に進み、前記入力字母を第１番目の字母として、ＲＡＭ２３などの一時保存用メモリに格納した後、ＳＴ３のハングル組立て処理を実行する。
【００３３】
ＳＴ３では、前記第１番目の入力字母を、第１番目のハングルの初声子音として設定する。ついでＳＴ４では、予測辞書１６の各予測データに対し、前記第１の入力字母による入力構成要素データへの前方一致検索を実行するが、前記したように初期状態の予測辞書１６は空の状態であるから、候補はなんら抽出されずに、ＳＴ５の判定が「ＮＯ」となる。
以下、所定数の字母が入力されるまで、ＳＴ６，８，１６の各判定がいずれも「ＮＯ」となってＳＴ１に戻り、字母の入力に応じて一時保存用メモリに字母の入力順序を蓄積するＳＴ２の処理と、ＳＴ３のハングル組立て処理とが繰り返される。なおＳＴ３の詳細な手順については、従来と同様であるので、ここでは説明を省略する。
【００３４】
所定数のハングルが組み立てられた時点で、その一部またはすべてのハングル文字列に対する変換操作が行われると、ＳＴ６が「ＹＥＳ」となってＳＴ７に進み、選択されたハングル文字列をハンチャ文字列に変換する処理を実行する。
また所定の時点で入力文字列に対する確定操作が行われると、ＳＴ８が「ＹＥＳ」となり、ＳＴ９で、前記確定文字列を上位アプリケーションに出力した後、ＳＴ１０の登録処理へと進む。
なおＳＴ９の処理は、入力文字列の一部に対する確定操作を受けて行ってもよいが、ＳＴ１０については、つぎに述べる詳細な手順から考えると、文字列全体が確定した時点で実行するのが望ましい。
【００３５】
ここで図６を用いて前記ＳＴ１０の登録処理の詳細を説明する。
まず最初のＳＴ１０−１では、前記一時保存用メモリに蓄積された字母の入力順序を、つぎのＳＴ１０−２では、前記確定操作により確定された文字列を、それぞれ取り込む。続くＳＴ１０−３では、前記確定文字列全体による予測データ（前記図４の第１番目の予測データに相当する。）を作成し、前記予測辞書１６に登録する。
なおここで登録される予測データについては、前回使用フラグがオン設定される。
【００３６】
つぎにＳＴ１０−４では、前記確定文字列を文節毎に切り分ける処理を実行し、ＳＴ１０−５では、前記確定文字列より文節単位および隣り合う文節の組合せによる各文字列を抽出する。さらにＳＴ１０−６では、前記抽出された各文字列につき、それぞれ予測データを作成して予測辞書１６に登録する。
なお文節単位および隣り合う文節の組合せによる予測データについては、組立て予測フラグがオン設定される。
【００３７】
つぎにＳＴ１０−７では、これまでに作成された予測データの中にハンチャを含むデータがあるか否かをチェックする。この判定が「ＹＥＳ」のときは、ＳＴ１０−８に進んで、該当する予測データにつき、ハンチャをハングルに置き換えた文字列による予測データを作成し、元のデータとは別のデータとして予測辞書１６に登録する。
なおここで作成される予測データについては、ハングル文字予測フラグがオン設定される。
【００３８】
図５に戻って、上記したように、字母の入力および文字列の確定処理に応じて、順次、予測データが作成されて予測辞書１６に登録され、所定の時点で以前に入力したのと同様の文字列を入力するために最初の字母の入力が行われると、この入力に対する予測辞書１６の検索処理によりＳＴ５の判定が「ＹＥＳ」となって、ＳＴ１１に進む。
【００３９】
ＳＴ１１では、抽出された各予測データ内のハングル・ハンチャ文字列が入力候補として一覧表示される。なおこの一覧表示においては、前記前回使用フラグがオン設定されたものを筆頭とし、以下、頻度データの高い順に表示する。この一覧表示に対して所定の候補を選択する操作が行われると、ＳＴ１２からＳＴ１３に進み、前記選択された候補の文字列を確定文字列として上位アプリケーションに出力する。
【００４０】
さらにつぎのＳＴ１４では、前記選択された候補に対応する予測データの頻度データを更新する。またここでは図示しないが、この予測データの前回使用フラグがオンになっていない場合は、これをオン設定するとともに、他の選択されなかった予測データにおいてオン設定されていた前回使用フラグをオフにする。
【００４１】
このように字母入力により組み立てられた文字列に対する確定操作、または予測辞書１６から抽出された入力候補を選択する操作のいずれかにより文字列が確定され、上位アプリケーションに出力されると、ＳＴ１５に進んで、前記メモリに蓄積されていた字母の入力順序をクリアする。
以下、ＳＴ１６で韓国語ＩＭＥ１の終了が指示されるまで上記の手順が繰り返される。
【００４２】
なおＳＴ１１で表示された候補が選択されずに、引き続き字母の入力が行われた場合は、ＳＴ１２，６，８，１６の順に「ＮＯ」判定が行われた後にＳＴ１が「ＹＥＳ」となってＳＴ２に戻る。これにより、新たなハングル組立て処理が行われるとともに、再度の検索処理が行われて、入力候補が絞り込まれて表示される。ただし入力すべき文字列が未登録である場合は、この手順で字母の入力が進められることにより、入力候補の表示が消失する。
【００４３】
図７は、前記図３と同様の文字列入力処理を再度実行する場合の表示例を示す。図７（１）は、第１番目の字母を入力した直後の状態であって、前記図３の処理により登録された予測データのうち、入力構成要素データの１番目の字母が前記入力字母と一致する５つのデータが抽出され、所定位置に設定された候補表示ウィンドウ３３内に一覧表示される。
なおこの図示例では、前記図４のデータ構成に対応させて、前回使用フラグがオン設定された前回の確定文字列を先頭に位置させている。
【００４４】
図７（２）は、前記候補表示ウィンドウ３３に対する選択操作が行われた状態を、図７（３）は、選択終了後の表示状態を、それぞれ示す。図示例では、前回使用フラグのセットされた１番目の候補に代えて、この文字列のハンチャをハングルに置き換えた２番目の候補が選択されており、この結果、アプリケーションには、ハングルのみの文字列が入力される。
【００４５】
このように上記の韓国語ＩＭＥ１によれば、通常の文字列の入力処理において、１つの確定文字列から数種類の予測データが作成されて予測辞書１６に登録される上、ハンチャを含む文字列については、ハンチャをハングルに置き換えた予測データも登録されるので、前回入力した文字列の一部を入力しようとする場合や、前回ハンチャにより入力した文字列をハングルで入力しようとする場合にも、ユーザーの目的とする文字列を入力候補として表示することができ、文字列を簡単かつ正確に入力することができる。また確定文字列を文節毎に区切って、文節単位の文字列による予測データや、隣合う文節の組合せの文字列による予測データを登録するので、過去に入力されていない文字列を入力する場合にも、目的とする文字列またはその文字列に構成の似た文字列を提示して、ユーザの文字入力操作を簡単化することができる。
【００４６】
ただしユーザーによっては、上記のような設定を嫌い、ハングルのみの入力候補を抽出させたり、前回入力した文字列のみを抽出するように求めるケースもある。この実施例では、前記したように、組立て予測フラグやハングル文字予測フラグに基づいて検索対象を限定できるので、ユーザーの要望に対応する検索条件を設定でき、また適宜、検索条件を変更することが可能である。
【００４７】
なお上記実施例では、確定文字列を文節毎に切り分けて予測データを作成しているが、これに代えて、確定文字列について形態素解析を行い、その解析結果から名詞，形容詞，動詞など所定の品詞にかかる文字列を切り出して予測データを作成してもよい。
たとえば名詞に相当する単語を切り出して予測データとすれば、この名詞に続く助詞などの付属語が他の語に代わった文字列が入力されても、名詞の部分の文字列を速やかに予測辞書１６から呼び出して入力することが可能となるなど、よく使用する単語をその付属語にとらわれずに迅速に入力することが可能となる。なお、この形態素解析を用いて予測データを作成する場合においても、確定文字列全体については予測データとして登録するとよい。
【００４８】
さらに上記実施例では、インストール直後の予測辞書１６を空の状態に設定したが、著名人の名前，主要な地名など、入力される可能性の高い単語を表す文字列について、あらかじめ予測データを作成し、初期状態の予測辞書１６に格納するようにしてもよい。
【００４９】
【発明の効果】
上記したようにこの発明では、稼働中のアプリケーションに韓国語による文字列を入力する場合に、確定されて前記アプリケーションに出力された文字列を、その文字列の組み立てに必要な字母の入力順序に対応づけて登録してゆき、その登録データによって字母の入力過程における入力候補を予測して、選択可能に表示するようにしたから、韓国語の入力に不慣れなユーザーでも簡単な操作によって、迅速かつ正確に文字列を入力することができる。加えてこの発明では、１度目にハンチャにより表現した単語を再度入力する場合には、先頭位置のハングルを組み立てる字母を入力することによって、ハンチャを含む文字列のみならず、ハングルのみの文字列を入力することもできるので、ハングル文字列の入力のために、再度、同様の字母の入力操作を行う必要がなくなり、韓国語文字列の入力操作をより簡単にすることが可能になる。
またこの発明では、韓国語の入力システム用の記録媒体に、上記の入力予測機能をコンピュータに与えるためのプログラムを組み込むことにより、操作が簡易化され、かつ迅速な文字入力が可能な韓国語入力システム用のパッケージソフトを提供することができる。
【図面の簡単な説明】
【図１】この発明にかかる韓国語ＩＭＥの構成を示す機能ブロック図である。
【図２】図１の韓国語ＩＭＥが組み込まれるコンピュータのハードブロック図である。
【図３】文字入力処理の一例を示す説明図である。
【図４】図３の文字入力処理により作成される予測データの構成を示す説明図である。
【図５】韓国語ＩＭＥによる文字入力処理の手順を示すフローチャートである。
【図６】登録処理の詳細な手順を示すフローチャートである。
【図７】入力予測機能を用いて文字入力を行う例を示す説明図である。
【図８】字母の種類を表す一覧表である。
【符号の説明】
１韓国語ＩＭＥ
２制御部
３予測処理制御部
４入力部
５出力部
６ハングル組立部
７ハングル・ハンチャ変換部
１０文節切分け部
１１文字列抽出部
１２ハンチャ・ハングル変換部
１３予測データ検索部
１４予測データ登録部
１５履歴データ更新部
１６予測辞書
２１ＣＰＵ
２４ハードディスク装置
２５ＣＤ−ＲＯＭドライブ[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a technology for inputting characters in Korean in a computer, such as a word processor or spreadsheet, and particularly, a method for supporting character input processing in Korean, and Korea to which the method is applied. It relates to a character input system for word input ("Input Method Editor" / hereinafter referred to as "IME").
[0002]
[Prior art]
Korean characters used in Korean (hereinafter simply referred to as “Hangul”) are expressed by combining phonetic symbols called “moji” for each syllable.
This combination includes two combinations of “consonant + vowel” and “consonant + vowel + consonant”. The first consonant is called “initial consonant”, the last consonant is called “final consonant”, and the vowel is called “middle vowel”, and a plurality of types of letters are assigned as shown in FIG.
[0003]
In a computer having an input function of a Korean character string, each of the above letters is assigned to a predetermined key on the keyboard. When entering characters using a Hangul string, the computer accepts the input of the first “consonant consonant” and then the “middle vowel” in order, and further accepts the input of the “final consonant” as needed, and the Korean one by one. assemble. When a confirmation operation is performed at a predetermined timing, a character string based on each Hangul assembled so far is output to the application side as a confirmed character string. In addition, when conversion processing is performed on a Korean character string representing a word to be emphasized such as a place name or a person name, the computer converts the Korean character string into a character string of Hancha (kanji) and then performs the same confirmation operation. Output to the application side.
[0004]
[Problems to be solved by the invention]
In the above method, 2-3 characters must be input for each constituent character of the Korean character string. In addition, in a standard keyboard (such as a 106 keyboard), a plurality of alphabets are assigned to each alphabet key. By this, to input one alphabet, an operation of pressing the same key several times or a control key The operation of typing keys in combination with other keys is required, and the operation required for inputting Korean characters becomes more complicated. In addition, it is extremely difficult for beginners unfamiliar with the operation to recognize the assigned key of the character, and there are problems that input errors frequently occur and character input efficiency is remarkably deteriorated.
[0005]
  The present invention has been made paying attention to the above problems, and an object of the present invention is to enable easy and high-speed input processing of Korean character strings without performing the above-described complicated operations.
  Furthermore, when inputting the word expressed by the huncher again, the present invention enables the input of either the character string including the hancher or the character string of only the hangul by inputting the character that assembles the hangul at the head position. The purpose is to make Korean character string input operations easier.
[0006]
[Means for Solving the Problems]
  The program storage medium for Korean input according to the present invention is an input receiving means for receiving input of characters,Assemble the Korean character string based on the input order of the characters received by the input receiving meansAssembly means,In a predetermined Korean character string assembled by the assembling meansConversionWhen an operation is performed,That operationConvert target Hangul string to Hancha stringConversion means,A character string including the Korean character string assembled by the assembling means or the character string of the hunter converted by the converting means.Confirmed againstWhen an operation is performed,That operationThe target stringPredeterminedOutput to applicationFirst output means, the confirmationDepending on the operation, the character string to be operated is changed to the input order of the characters necessary to assemble the character string.Corresponding memoryRegister withinIn addition, when this registration target character string includes a hunter, the character string obtained by replacing the hunter of the registration target character string with Hangul is associated with the input order of the same letter as the registration target character string. Registration means to register within,The input receiving means receives a character inputIn each caseThe input order is stored, and a character string as an input candidate is extracted from the memory based on the input order of the stored characters.Candidate extraction means, candidate extraction meansThe input candidate character string extracted byOn the display screenindicateDisplay means, said displayInput candidates displayed on the screenWhen a selection operation is performed on the selected input candidateOutput a string to the applicationA program for causing the computer to function as each means of the second output means is stored.
  In the present invention, “Korean” is a language written in Korean characters including Korean.
[0007]
  aboveProgram recorded on a recording mediumAccording to the above, for each constituent character of the character string to be input (in this case, Hangul), when the user performs a confirming operation by inputting the characters in the order described above, the confirmed character string isBy the first output meansIn addition to being output to a running application, it is registered in the memory in association with the input order of characters required for assembling the character string. In addition, when a conversion operation is performed on a predetermined character string among the assembled Korean character strings,By conversion meansThe target character string for the conversion operation is converted into a character string for the huncher. Furthermore, when a confirmation operation is performed on the converted character string,By the first output meansThe converted string is output to the application,The converted character string is registered by the registration means.The characters are registered in the memory in association with the input order of the characters necessary for assembling the character string (the same as that corresponding to the Korean character string).
  Note that the input of characters is mainly performed by operating the keyboard. However, the present invention is not limited to this. For example, a list of characters as shown in FIG. 8 is displayed on the display screen, and is selected by a mouse or the like on the list. You may make it input a letter.
[0008]
  The candidate extracting meansSpecifically, the process of extracting input candidates from the memory is performed between the input order of the character obtained in the input process of the character and the input order of the character corresponding to each character string in the memory. This is performed by “front matching search”. Therefore, when the user starts to input a character to input again the previously input character string, the character string registered in the memory is extracted by the previous input when the character that assembles the Hangul at the head position is input. Displayed on the display screen. Furthermore, by selecting the displayed character string, it is possible to complete the input of the character string, and it is possible to input a desired character string without inputting all the characters that compose the character string.
  Furthermore, in this invention,The registration meansWhen a character string including a huncher is registered in the memory, in addition to the character string to be registered, a character string obtained by replacing the character string hancher with Hangul is input in the same alphabet as the character string to be registered. Are registered in the memory in association with each other. Therefore, when the word expressed by the huncher is input again for the first time, both the character string including the hancher and the character string containing only the hangul are extracted and displayed by inputting the character for assembling the hangul at the head position. The user can select a desired character string and output it to the application.
[0009]
  In a preferred embodiment, the aboveThe registration means includes the confirmation operationThe target string of contains multiple clausesWhenSet a character string for each phrase and a combination of a plurality of adjacent phrases to be registered in the memory.Includes means to.
  thisAspectAccording to the above, the entire confirmed character string is registered, and a plurality of types of character strings obtained by dividing the character string at the breaks of clauses are registered. Therefore, even if the user enters a character string that corresponds to a part of a previously confirmed character string, or if the user tries to enter a character string that has been confirmed previously without the first clause, the first position of the character string By inputting the character to assemble the target character string, the target character string can be extracted and output to the application.
[0010]
  In another preferred embodiment, the above-mentionedThe registration means includes the confirmation operationAccording to thatOperation targetPerform morphological analysis on the character string, and the character stringMeans for extracting independent words from the extractedSet a character string corresponding to an independent word to be registered in the memoryMeans to. Even if the independent words to be registered are limited to specific parts of speech such as nounsGood(However, multiple types of parts of speech may be targeted.).In addition, when a usage form such as a verb or adjective is extracted, the word may be replaced with a final form and registered.
  thisAspectAccording to the above, when a predetermined word is repeatedly input, even if an auxiliary word such as a particle is replaced with another word, the character string of only the word part is called and input at high speed without being caught by the change. It becomes possible to do.
[0012]
  In yet another preferred embodimentSuch recording media includeA character string extracted as an input candidate from the memory is selected and output to the applicationEach time, the choiceWrite the usage history of the specified string to the memoryA program for causing the computer to function as usage history storage means is recorded. Further, the display means has a plurality of input candidates extracted by the candidate extraction means.When each input candidateAre ordered based on their respective usage histories, and a list of input candidates is displayed according to the order. The character string use history is expressed in the form of information indicating the frequency of selecting each candidate character string, or information indicating the result of the previous selection process for the extraction result of similar candidates.
  aboveAspectAccording to the above, when a plurality of input candidates are extracted, the respective input candidates are arranged and displayed in the order of selection frequency, or the input candidates selected in the previous process are displayed as the first candidates. The user can easily find input candidates to be selected, and can quickly select candidates.
[0013]
  Recorded on the above recording mediumBy incorporating the program into the computer, the function of each means is set in the control unit of the computer.This makes it possible to input a Korean character string with a simple operation.
[0014]
DETAILED DESCRIPTION OF THE INVENTION
FIG. 1 shows a configuration of a Korean input system 1 (hereinafter referred to as “Korean IME 1”) according to the present invention.
The Korean IME 1 includes software for an operation system (hereinafter referred to as “OS”) such as Windows (registered trademark of Microsoft Corporation) and an application such as word processing software (hereinafter referred to as “upper application”). In the built-in computer, a character string in Korean is input to the upper application.
[0015]
As shown in FIG. 2, the computer includes a control circuit 20 including a CPU 21, a ROM 22, and a RAM 23, as well as a hard disk device 24, a CD-ROM drive 25, an input / output interface 26, and the like as auxiliary memory devices. Take the configuration. When a CD-ROM in which a program or data file for Korean IME 1 is recorded in the CD-ROM drive 25 of this computer and the recorded contents are installed in the hard disk device 24, the CPU 21 thereafter executes each program. As a result of the operation, each processing unit in FIG. 1 operates.
[0016]
The input / output interface 26 is connected to a keyboard for inputting various data, a mouse for selection operation, and the like as input devices. The output device is connected to a display device for displaying the display screen of the upper application, a printer for printing out data displayed on the display device, and the like.
[0017]
Returning to FIG. 1, the Korean IME 1 of this embodiment includes a prediction processing control unit 3 for controlling processing related to an input prediction function described later, in addition to a control unit 2 for managing the entire system. .
In addition, as a dictionary, a Hangul / hunter conversion dictionary 9 used when converting a Korean character string into a Hancher character string and a prediction dictionary 16 for storing data for the input prediction processing are incorporated. Of these dictionaries, the Hangul / Hancha conversion dictionary 9 is set in a state in which data is retained immediately after installation by the CD-ROM, while the prediction dictionary 16 is in an empty state immediately after installation and is used for character input. Data is gradually accumulated by the processing.
[0018]
The control unit 2 comprehensively manages the processing units of the input unit 4, the output unit 5, the Hangul assembly unit 6, and the Hangul / hunter conversion unit 7. The input unit 4 has a function of accepting a character input operation and a confirmation, selection, and conversion processing operation, and the output unit 5 receives the processing result passed from the control unit 2 via the OS as a higher-level application or It is set to output to a display control system (both not shown).
[0019]
Whenever the input unit 4 receives the input of the character, the control unit 2 passes it to the Hangul assembly unit 6. The Hangul assembling unit 6 assembles the Hangul character string by sequentially arranging the characters supplied from the control unit 2 in accordance with the above-described rules for arranging the characters.
Further, when a conversion operation is performed on the assembled predetermined Hangul character string, the control unit 2 takes in the Hangul character string for which conversion is designated and passes it to the Hangul / hunter conversion unit 7. The Hangul / Hancher conversion unit 7 moves the lower Hangul / Hancher conversion dictionary search unit 8 to extract a Hancher character string corresponding to the supplied Hangul character string from the Hangul / Hancher conversion dictionary 9. The designated Korean character string is converted into a huncher character string.
[0020]
The control unit 2 takes in the processing result and outputs it to the output unit 5 every time the processing by the Hangul assembly unit 6 and the Hangul / hunter conversion unit 7 is performed. The control unit 2 holds the latest processing result by the Hangul assembly unit 6 and the Hangul / hunter conversion unit 7. When the input unit 4 accepts a selection operation or a confirmation operation, the selected or confirmed character string ( Hereinafter, it is collectively referred to as “determined character string”).
Therefore, the Hangeul character string that is in the process of assembly or the hunter character string that is called by the conversion operation is displayed indefinitely on the display screen according to the input of the character. Further, in accordance with a selection operation or a confirmation operation, the character string of the undetermined display is switched to the confirmed display and is output to the upper application.
[0021]
FIG. 3 shows a specific display example on the display screen when the upper processing is being performed.
FIG. 3 (1) shows a state immediately after a series of characters is input, and the Hangul character string assembled by each input character is displayed indefinitely on the character input line 31 in the display window 30 of the application. Yes.
[0022]
FIG. 3 (2) shows a state immediately after a part of the Korean character string is selected and a conversion operation is performed, and a candidate selection window 32 for the character string selected at an appropriate position on the screen is displayed. It is set, and each candidate of Hancha and Hangul is displayed inside. In the character input line 31, the selected Korean character string is replaced with the first candidate hunter character string and displayed.
FIG. 3 (3) shows a state in which the confirmation operation has been performed on the state of FIG. 3 (2). The candidate selection window 32 disappears and the Hangul character string including the huncher is confirmed and displayed. .
[0023]
Returning to FIG. 1, the control unit 2 gives the input character string to the prediction processing control unit 3 at the same timing as that given to the Hangul assembly unit 6 to cause the input character string to be predicted. In addition, the control unit 2 gives a confirmed character string to the prediction processing control unit 3 in accordance with the selection operation or the confirmation operation, and causes the prediction processing control unit 3 to perform registration processing.
[0024]
Under the control of the prediction processing control unit 3, processing such as a phrase segmentation unit 10, a character string extraction unit 11, a huncher / hangul conversion unit 12, a prediction data search unit 13, a prediction data registration unit 14, and a history data update unit 15. Set.
The prediction processing control unit 3 sequentially accumulates input characters passed from the control unit 2 and, when a confirmed character string is given from the control unit 2, a phrase segmentation unit 10, a character string extraction unit 11, and its A predetermined number of pieces of prediction data are created from the input order of the fixed character string and the accumulated characters using the lower-order Hancha-Hangul conversion unit 12.
[0025]
The phrase segmentation unit 10 is for segmenting the determined character string for each phrase based on delimiters such as half-width spaces, punctuation marks, and commas. The character string extraction unit 11 is based on the result of the phrase segmentation process. Then, all the character strings in the unit of individual clauses and the character strings based on the combination of adjacent clauses in the fixed character string are extracted. The huncher-Hangul conversion unit 12 is for creating a character string in which the hancher is replaced with Hangul for the character string including the hunter among the character strings extracted by the character string extraction unit 11. The character string extraction unit 11 takes in the character string created by the Hancha-Hangul conversion unit 12 and outputs it to the prediction processing control unit 3 together with the character string extracted by itself.
[0026]
The prediction processing control unit 3 inputs each character string extracted by the processes of the phrase segmentation unit 10, the character string extraction unit 11, and the Hancher-Hangul conversion unit 12, and inputs the characters that assemble the character strings. Data corresponding to the data arranged along the line (hereinafter referred to as “input component data”) is generated and output to the prediction data registration unit 14 as the prediction data. The prediction data registration unit 14 includes, in the given prediction data, data indicating the frequency of use of the character string (hereinafter referred to as “frequency data”), and a flag indicating whether the character string was previously selected (hereinafter referred to as “previous use”). The flag is called “flag”) and stored in the prediction dictionary 16.
[0027]
Further, each time the prediction processing control unit 3 receives and accumulates input characters from the control unit 2, the prediction processing control unit 3 gives the input order of the characters accumulated up to that point to the prediction data search unit 13. The prediction data search unit 13 performs a forward matching search for each input component data in the prediction dictionary 16 according to the input order of the given letters.
The prediction process control unit 3 takes in the prediction data extracted by the above search and outputs it to the control unit 2. The control unit 2 displays each input candidate in a list on the display screen by passing the character string included in each prediction data to the output unit 5 as an input candidate.
[0028]
When the selection operation for the list display is performed, the control unit 2 recognizes the selected character string as a confirmed character string and outputs it to the output unit 5. Thus, the character string input process is completed in the middle of the character input.
[0029]
Furthermore, the control unit 2 gives the selection result of the input candidate to the prediction processing control unit 3. The prediction process control unit 3 moves the history data update unit 15 to rewrite the frequency data of the prediction data corresponding to the selected input candidate, and turns on the previous use flag of the prediction data.
[0030]
FIG. 4 shows an example of prediction data registered along with the character string input process shown in FIG. In the illustrated example, ten types of character strings are extracted from the confirmed character string by the phrase segmentation unit 10, the character string extraction unit 11, and the Hancher-Hangul conversion unit 12, and the input component data and frequency data are respectively extracted from the character strings. Prediction data that associates these is set.
FIG. 4 shows the structure of the prediction data immediately after being registered in the prediction dictionary 16, and the frequency data of each prediction data is set to “1”. (However, the frequency data of only the first prediction data based on the confirmed character string itself may be “1”, and the frequency data of the remaining prediction data may be “0”.)
[0031]
  Further, in each prediction data in FIG. 4, an assembly prediction flag and a Hangul character prediction flag are set as flag data indicating the type of prediction data in addition to the previous use flag. The assembly prediction flag indicates that the data is prediction data created by clause segmentation processing or clause combination processing, and the Hangul character prediction flag is prediction data derived from a character string including the huncher.soIndicates that there is. In addition, in any of the flags, those with a circle in the figure indicate an on state.
  The previous use flag and frequency data are used to determine the order in which each data is displayed in a list when the prediction data is extracted in the search process for the prediction dictionary 16. The assembly prediction flag and the Hangul character prediction flag are used to determine data to be extracted when the extraction target from the prediction dictionary 16 is limited by user settings.
[0032]
FIG. 5 shows a series of procedures for character string input processing by the Korean IME1.
As described above, immediately after the Korean IME 1 is incorporated in the computer, the prediction dictionary 16 is in an “empty” state. When the first character input is performed in this state, the process proceeds from ST1 to ST2, and the input character is used as the first character and stored in the temporary storage memory such as the RAM 23, and then the Hangeul assembly process of ST3 is executed. To do.
[0033]
In ST3, the first input alphabet is set as the first consonant of the first Hangul. Next, in ST4, a forward matching search is performed for each piece of prediction data in the prediction dictionary 16 with respect to the input component data by the first input alphabet. As described above, the prediction dictionary 16 in the initial state is empty. Therefore, no candidate is extracted and the determination in ST5 is “NO”.
Thereafter, until the predetermined number of characters are input, the determinations of ST6, 8, and 16 are all “NO” and the process returns to ST1, and the input order of the characters is stored in the temporary storage memory according to the input of the characters. The process of ST2 to be performed and the Hangul assembly process of ST3 are repeated. Note that the detailed procedure of ST3 is the same as that of the prior art, and thus the description thereof is omitted here.
[0034]
When a conversion operation is performed on a part or all of the Korean character string when a predetermined number of Korean characters are assembled, ST6 becomes “YES” and the process proceeds to ST7, and the selected Korean character string is converted to the Hancher character string. Execute the process of converting to.
Further, when a confirmation operation is performed on the input character string at a predetermined time, ST8 becomes “YES”, and in ST9, the confirmed character string is output to the upper application, and then the process proceeds to the registration process of ST10.
The process of ST9 may be performed in response to a confirmation operation for a part of the input character string. However, ST10 is executed when the entire character string is confirmed in view of the detailed procedure described below. desirable.
[0035]
Details of the registration process of ST10 will be described with reference to FIG.
First, in ST10-1, the input order of characters stored in the temporary storage memory is fetched, and in ST10-2, the character string confirmed by the confirming operation is fetched. In subsequent ST 10-3, prediction data (corresponding to the first prediction data in FIG. 4) based on the entire confirmed character string is created and registered in the prediction dictionary 16.
For the prediction data registered here, the previous use flag is set on.
[0036]
Next, in ST10-4, a process of segmenting the determined character string for each phrase is executed. In ST10-5, each character string is extracted from the determined character string by a combination of phrase units and adjacent phrases. Further, in ST10-6, prediction data is created and registered in the prediction dictionary 16 for each extracted character string.
Note that the assembly prediction flag is set on for prediction data based on a combination of phrase units and adjacent phrases.
[0037]
Next, in ST10-7, it is checked whether or not there is data including a hunter in the prediction data created so far. When this determination is “YES”, the process proceeds to ST10-8, and for the corresponding prediction data, prediction data based on a character string in which the hancher is replaced with Hangul is created, and the prediction dictionary 16 is used as data different from the original data. Register with.
For the prediction data created here, the Hangul character prediction flag is set on.
[0038]
Returning to FIG. 5, as described above, prediction data is sequentially created and registered in the prediction dictionary 16 in accordance with the character input and character string confirmation processing, and is the same as previously input at a predetermined time. When the first letter is input to input the character string, ST5 is determined as “YES” by the search process of the prediction dictionary 16 for this input, and the process proceeds to ST11.
[0039]
  In ST11, each extracted prediction dataHangul Hancha string inAre listed as input candidates. In this list display, the previous use flag is set to the top, and the list is displayed in order of frequency data. When an operation for selecting a predetermined candidate is performed on the list display, the process proceeds from ST12 to ST13, and the selected candidate character string is output as a confirmed character string to the upper application.
[0040]
In the next ST14, the frequency data of the prediction data corresponding to the selected candidate is updated. Although not shown here, if the previous use flag of this prediction data is not turned on, it is turned on, and the previous use flag that was turned on in other unselected prediction data is turned off. To do.
[0041]
When the character string is confirmed by either the confirmation operation for the character string assembled by the character input or the operation for selecting the input candidate extracted from the prediction dictionary 16 and output to the upper application, the process proceeds to ST15. Then, the input order of the characters stored in the memory is cleared.
Thereafter, the above procedure is repeated until the end of Korean IME1 is instructed in ST16.
[0042]
If the candidate displayed in ST11 is not selected and characters are continuously input, “NO” determination is made in the order of ST12, 6, 8, 16 and then ST1 becomes “YES”. Return to ST2. As a result, a new Hangul assembly process is performed, a search process is performed again, and input candidates are narrowed down and displayed. However, when the character string to be input is not registered, the input candidate display disappears by proceeding with the input of the character string in this procedure.
[0043]
FIG. 7 shows a display example when the character string input process similar to FIG. 3 is executed again. FIG. 7 (1) shows a state immediately after inputting the first character, and among the prediction data registered by the processing of FIG. 3, the first character of the input component data is the input character. Five matching data are extracted and displayed in a list in the candidate display window 33 set at a predetermined position.
In the illustrated example, the previous determined character string in which the previous use flag is set to ON is positioned at the head in correspondence with the data configuration of FIG.
[0044]
FIG. 7 (2) shows a state where a selection operation is performed on the candidate display window 33, and FIG. 7 (3) shows a display state after the selection is completed. In the illustrated example, instead of the first candidate for which the previous use flag has been set, the second candidate in which the character string hunter is replaced with Hangul is selected. As a result, the application uses only Korean characters. A column is entered.
[0045]
As described above, according to the above-mentioned Korean IME1, in a normal character string input process, several types of prediction data are created from one fixed character string and registered in the prediction dictionary 16, and a character string including a huncher is also obtained. Will also register prediction data that replaces the hancher with Hangul, so if you are trying to enter part of the last string you entered or if you are trying to enter the last string you entered with hangha, The user's target character string can be displayed as an input candidate, and the character string can be input easily and accurately. In addition, when you enter a character string that has not been input in the past, you can separate the fixed character string into phrases and register prediction data based on character strings in phrase units or prediction data based on character strings that are combinations of adjacent phrases. However, it is possible to simplify the user's character input operation by presenting a target character string or a character string having a similar structure to the character string.
[0046]
However, some users dislike the above-described setting and may require that only Korean input candidates be extracted or only the previously input character string be extracted. In this embodiment, as described above, since the search target can be limited based on the assembly prediction flag and the Hangul character prediction flag, the search condition corresponding to the user's request can be set, and the search condition can be changed as appropriate. Is possible.
[0047]
In the above embodiment, the prediction data is created by dividing the confirmed character string into phrases, but instead, a morphological analysis is performed on the confirmed character string, and a predetermined noun, adjective, verb, etc. The prediction data may be created by cutting out a character string related to the part of speech.
For example, if a word corresponding to a noun is cut out and used as prediction data, even if a character string in which an auxiliary word such as a particle following this noun is replaced with another word is input, the character string of the noun part is quickly predicted. For example, it is possible to call up from 16 and input frequently used words quickly without being bound by the attached words. Even when predictive data is created using this morphological analysis, the entire confirmed character string may be registered as predictive data.
[0048]
Furthermore, in the above embodiment, the prediction dictionary 16 immediately after installation is set to an empty state, but prediction data is created in advance for character strings representing words that are likely to be input, such as names of celebrities and main place names. Alternatively, it may be stored in the prediction dictionary 16 in the initial state.
[0049]
【The invention's effect】
  As described above, in the present invention,In operationWhen entering a Korean string into the application,The character string that is confirmed and output to the application is converted to the character string required for assembling the character string.The input candidates are registered in association with the input order of the characters, and input candidates in the character input process are predicted based on the registered data and displayed so that they can be selected. Can quickly and accurately input a character string.In addition, in the present invention, when the word expressed by the hancher is input again for the first time, not only the character string including the hancher but also the character string including only the hancher is input by inputting the character for assembling the hangul at the head position. Since it is also possible to input, it is not necessary to perform the same character input operation again to input the Korean character string, and the Korean character string input operation can be simplified.
  In addition, according to the present invention, a Korean input system that can simplify operations and enables rapid character input by incorporating a program for giving the above input prediction function to a computer in a recording medium for a Korean input system. Package software for the system can be provided.
[Brief description of the drawings]
FIG. 1 is a functional block diagram showing a configuration of a Korean IME according to the present invention.
FIG. 2 is a hardware block diagram of a computer in which the Korean IME of FIG. 1 is incorporated.
FIG. 3 is an explanatory diagram illustrating an example of a character input process.
4 is an explanatory diagram showing a configuration of prediction data created by the character input process of FIG. 3. FIG.
FIG. 5 is a flowchart showing a procedure of character input processing by Korean IME.
FIG. 6 is a flowchart showing a detailed procedure of registration processing.
FIG. 7 is an explanatory diagram illustrating an example in which character input is performed using the input prediction function.
FIG. 8 is a list showing the types of characters.
[Explanation of symbols]
1 Korean IME
2 Control unit
3 Prediction processing control unit
4 Input section
5 Output section
6 Hangul Assembly Department
7 Hangul / Hancha Converter
10 Phrase section
11 Character string extraction unit
12 Hancha-Hangul converter
13 Prediction data search part
14 Prediction data registration part
15 History data update unit
16 prediction dictionary
21 CPU
24 Hard disk device
25 CD-ROM drive

Claims

Input accepting means for accepting input of characters,
Assembly means for assembling a Korean character string based on the input order of the letters received by the input receiving means;
A conversion means for converting the Hangul character string to be operated into a character string of a huncher when a conversion operation is performed on the predetermined Hangul character string assembled by the assembling means;
When a confirmation operation is performed on a Korean character string assembled by the assembling means or a character string including a huncher character string converted by the converting means, the character string to be operated is output to a predetermined application. 1 output means,
In response to the confirmation operation, the operation target character string is registered in the memory in association with the input order of the characters necessary for assembling the character string, and when the registration target character string includes a hunter, A registration means for registering a character string obtained by replacing the huncher of a character string to be registered with Hangul in a memory in association with an input order of the same letter as the character string to be registered;
Each time the input accepting unit accepts input of characters, the candidate extraction unit that accumulates the input order of characters and extracts character strings that are input candidates from the memory based on the input order of the stored characters.
Display means for displaying a character string of input candidates extracted by the candidate extraction means on a display screen;
A program for causing a computer to function as each means of second output means for outputting a character string of a selected input candidate to the application when a selection operation is performed on the input candidate displayed on the display screen A recording medium for Korean language input programs.

When the target character string of the confirmation operation includes a plurality of clauses, the registration unit sets a character string of each clause unit and a character string that is a combination of a plurality of adjacent clauses to be registered in the memory. The recording medium for a Korean input program according to claim 1 , comprising means.

The registration means includes a means for executing a morphological analysis on the character string to be operated in accordance with the confirmation operation and extracting a self-supporting word from the character string; and a character string corresponding to the extracted self-supporting word. The recording medium for a Korean input program according to claim 1 , further comprising means for setting a registration target in the memory.

The recording medium according to any one of claims 1 to 3 ,
Each time a character string extracted as an input candidate from the memory is selected and output to an application, a program for causing the computer to function as a use history storage unit that writes a use history of the selected character string to the memory Is recorded,
The display means, when a plurality of input candidates are extracted by the candidate extraction means, after ordering each input candidate based on each usage history, and displaying a list of each input candidate according to the order A set of recording media for Korean input programs.