JPH08235171A

JPH08235171A - Japanese syllabary and chinese character converting device

Info

Publication number: JPH08235171A
Application number: JP7038699A
Authority: JP
Inventors: Yoshizo Saito; 佳三斎藤
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1995-02-27
Filing date: 1995-02-27
Publication date: 1996-09-13

Abstract

PURPOSE: To perform conversion again according to correct paragraph sectioning and prevent misconversion due to wrong paragraph sectioning by putting a paragraph right before an input position back into HIRAGANA(cursive form of Japanese syllabary), merging it currently inputted HIRAGANA(square form of Japanese syllabary), and performing conversion even if paragraph sectioning is done by conversion indication at a position irrelative to original paragraph sectioning. CONSTITUTION: When an undetermined character string is present in a conversion character buffer 62, whether or not an input character string has the same reading with a suffix is retrieved from a suffix table 46. When the same reading is found, the KANJI(Chinese character) and KANA(Japanese character) mixed paragraph right before being inputted to the conversion character buffer 62 is converted into HIRAGANA, which is merged with the current input character string; and whether or not the character string is present in a suffix example dictionary 53 is retrieved. When the string is retrieved, it is stored as a 1st candidate in the conversion buffer 62. When not, on the other hand, a KANA-KANJI mixed paragraph positioned right before the input of the conversion buffer 62 is converted into HIRAGANA, and when the same reading is not found in the suffix table 45, retrieval from an independent word/proper noun dictionary is performed.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は日本語ワードプロセッサ
等、ひらがなで入力された文字を仮名漢字交じりに変換
する仮名漢字変換処理に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a kana-kanji conversion process for converting a character input in hiragana into a kana-kanji mixture such as a Japanese word processor.

【０００２】[0002]

【従来の技術】従来の技術として、特公昭61-3268には
単語単位だけでなく、単語の組み合わせを学習して仮名
漢字変換を行う技術が開示されている。つまり、「まし
た」という読みに対して「真下」という変換が学習され
ている場合に、「いただきました」という入力を行い、
仮名漢字変換を行った場合に、「いただき／ました」で
変換を行うために、「頂／真下」という意味のない結果
を出力してしまうことになる。2. Description of the Related Art As a conventional technique, Japanese Patent Publication No. 61-3268 discloses a technique for performing kana-kanji conversion by learning not only word units but also word combinations. In other words, if you have learned the conversion "directly below" for the reading "masuta", enter "you gotta",
When kana-kanji conversion is performed, the meaningless result of "top / underneath" is output because the conversion is done with "meta / ta".

【０００３】そこで、複数の単語の組み合わせで構成さ
れる複合語の学習結果が登録される学習バッファを備
え、単語単位での学習結果である「真下」より、複合語
の学習である「頂き」＋「ました」を優先させることに
よって、「頂きました」の変換結果を出力するものであ
る。Therefore, a learning buffer for registering the learning result of a compound word composed of a combination of a plurality of words is provided, and the learning result of each word "just below" is learned from the compound word "Ore". By prioritizing + “masuta”, the conversion result of “got” is output.

【０００４】特開平4-65751には、自立語に接尾語を接
続する場合に関しての接続条件を記憶しておく技術が開
示されている。この技術は、仮名漢字変換時に、接尾語
がついた場合の無意味語への変換を防止するものであ
る。自立語に意味分類コード（例えば「情報」、「動
物」など）を付与しその分類に対して、どの接尾語が接
続可能かを記憶したテーブルと照合して接続可能な場合
のみ、その接尾語の変換を行うことにより、無意味な接
尾語変換を防止することが可能となる。また、特開昭61
-239357には、接頭語または接尾語を入力する手段を設
けたものである。例えば「再使用の」と入力したい場合
に「再」＋「しようの」＋変換を行うものである。つま
り接頭語、接尾語は変換なしで入力するのである。これ
によれば「妻子用の」といった誤変換は防止できるが、
接頭語、接尾語すべてに入力キーを持たせることはあま
りにも非効率である。特開昭63-266575においては、接
尾語を一種の自立語として取り扱い、既に決定された前
文節との関係から自立語として接尾語を検索して語長も
加味し、変換を行うものである。この発明においても不
必要に接尾語が付された無意味語が候補として選択され
ることを防止できる。しかしながら、変換した結果の単
語にさかのぼって改めて接辞語の付加の可能性を考慮さ
れてはいない。Japanese Unexamined Patent Publication (Kokai) No. 4-65751 discloses a technique of storing connection conditions for connecting a suffix to an independent word. This technique prevents conversion to a meaningless word when a suffix is added when converting Kana-Kanji. Semantic classification code (for example, "information", "animal", etc.) is added to an independent word, and the suffix is added only when it can be connected by collating with a table that stores which suffixes can be connected. It becomes possible to prevent meaningless suffix conversion by performing the conversion. In addition, JP-A-61
-239357 is provided with a means for inputting a prefix or suffix. For example, when inputting "reuse", "reuse" + "resume" + conversion is performed. That is, the prefix and suffix are input without conversion. According to this, erroneous conversion such as "for wife and child" can be prevented,
It is too inefficient to have input keys for all prefixes and suffixes. In Japanese Patent Laid-Open No. 63-266575, the suffix is treated as a kind of independent word, the suffix is searched as an independent word from the relationship with the previously determined clause, and the word length is also added for conversion. . Also in this invention, it is possible to prevent a meaningless word with a suffix added unnecessarily from being selected as a candidate. However, the possibility of adding an affix word to the word resulting from the conversion is not considered.

【０００５】このように、従来の技術として文節間の関
係より正しい変換候補を導く技術があった。As described above, as a conventional technique, there has been a technique for deriving a correct conversion candidate from the relation between phrases.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、従来の
仮名漢字変換装置においては、入力された文字列を仮名
漢字交じりの文章に変換する場合において、結果が複文
節の場合、最初の文節と後続の文節を解析し、以下のよ
うな情報を用いて、変換候補を導いていた。However, in the conventional kana-kanji conversion device, when the input character string is converted into a kana-kanji mixed sentence, if the result is a compound phrase, the first phrase and the subsequent phrase are converted. The bunsetsu was analyzed and the conversion candidates were derived using the following information.

【０００７】（１）文節の自立部部内の接頭語、自立
語、固有名詞部、接尾語の意味、共起関係（２）文節内の自立語部内と付属語との格関係（３）文節内の自立部内とその前文節の自立部／固有名
詞部との意味、共起関係（４）連続する候補の自立部／固有名詞部および付属部
の格関係（５）連続する候補の自立部／固有名詞部の意味／共起
関係（６）単語の頻度を含む情報このような情報を利用し、例えば「たいようがのぼる」
という入力を変換すると「太陽が」＋「上る」「昇る」
「登る」から「昇る」が選択されていた。(1) Prefix, independence word, proper noun part, meaning of suffix, co-occurrence relation in the independence part of bunsetsu (2) Case relation between independence part in bunsetsu and adjunct (3) bunsetsu And co-occurrence relationship between the self-reliance part in the above and the preceding sentence's self-reliance part / proper noun part (4) Case independence of consecutive candidates / proper noun part and adjunct part (5) Consecutive candidates' self-reliance part / Meaning of proper noun part / Co-occurrence relation (6) Information including word frequency Using such information, for example,
When the input is converted, "Sun" + "Climb""Climb"
"Climb" to "Climb" was selected.

【０００８】また、変換されたデータがまだ未確定状態
にある場合にある場合でも、前の文節情報を記憶してい
て、［１］「太陽が」＋変換 → 太陽が［２］「昇る」＋変換 → 太陽が昇ると、変換出力していた。Even when the converted data is still in an undetermined state, the previous clause information is stored, and [1] "the sun" + conversion → the sun [2] "rises" + Conversion → When the sun rose, it converted and output.

【０００９】しかしながら、従来の技術において、「そ
うごう」＋変換＋「てき」＋変換と入力した場合は、
「総合／敵」と変換されてしまう。これは、「そうご
う」＝「総合」という文節情報は覚えていても、一度変
換してしまった場合、たとえ未確定状態であっても、１
つの文節として認識してしまい、上記の（１）文節の自
立部部内の接頭語、自立語／固有名詞部、接尾語の意
味、共起関係を改めて見直す処理が欠けているためであ
る。そのため「総合」と「的」の自立部内の接頭語、自
立語／固有名詞語、接尾語の意味、共起関係に結び付か
ず、単に「てき」と入力し、変換したときの最有力候補
の「敵」を先に出力する結果になってしまうからであ
る。However, in the conventional technique, when "Sogo" + conversion + "kita" + conversion is input,
It will be converted to "general / enemy". This means that even if you remember the phrase information "Sogou" = "general", once it is converted, even if it is in the unconfirmed state, 1
This is because it is recognized as one bunsetsu, and there is a lack of a process of revising the above-mentioned (1) the prefix in the independent part of the bunsetsu, the independent word / proper noun part, the meaning of the suffix, and the cooccurrence relationship. Therefore, without being linked to the prefixes, independent words / proper noun words, meanings of suffixes, and co-occurrence relations between the prefixes of “general” and “target” in the independence part, simply entering “ki” and converting it into the most promising candidate. This is because the "enemy" of will be output first.

【００１０】この例の「てき」の場合、候補は「敵」
「的」と２候補であるが、「りょう」の場合（診断料）
は「量」「寮」「漁」「猟」「良」「領」…などがあ
り、２〜３回の変換では所望の漢字が呼び出せず、無意
味語の候補が羅列する結果となり、ユーザは何度も変換
キーを操作する必要があり、また多くの候補群から所望
の漢字を選択するのも大変である。In the case of "Kiki" in this example, the candidate is "enemy".
There are 2 candidates for "target", but in case of "Ryo" (diagnosis fee)
Are "quantity", "dormitory", "fishing", "hunting", "good", "territory", etc., and the desired kanji cannot be called in a few conversions, resulting in a list of meaningless words. Requires the conversion key to be operated many times, and it is difficult to select a desired kanji from many candidate groups.

【００１１】同様に「てんてんはんそく（輾転反側）」
の変換を行うとき、「てんてん」＋変換＋「はんそく」
変換と入力した場合に、変換結果は「転々／反則」とな
ることが多い。[0011] Similarly, "tentensosaku (abduction side)"
"Tenten" + conversion + "Hansoku" when converting
When conversion is input, the conversion result is often "turning / foul".

【００１２】これは、「てんてんはんそく」＝「輾転反
側」という単語は辞書に登録されていても、ユーザは無
意識のうちに「輾転」と「反側」を２文節に分けて変換
してしまうと、たとえ未確定状態であっても、変換され
た文節は１個の文節として認識してしまい、上記の
（１）文節の自立部部内の接頭語、自立語／固有名詞
部、接尾語の意味、共起関係を改めて見直す処理が欠け
ているためである。そのために、「てんてん」と「はん
そく」の自立部内の接頭語、自立語／固有名詞語、接頭
語の意味、共起関係に結び付かず、単に「はんそく」と
入力し、変換を行った場合の最有力変換候補の「反則」
が出力されてしまう。同様に「一騎当千」を「いっき」
と「とうせん」に分けて入力した場合は、「一気／当
選」となるし、「閑話休題（かんわきゅうだい）」は
「緩和／及第」となってしまう。このような例では「休
題」を変換するために、結局は「休（きゅう）」と「題
（だい）」を分けて変換することになり、候補は非常に
多くなり、入力が大変という問題が発生していた。[0012] This means that even if the word "tenten-hansoku" = "inversion side" is registered in the dictionary, the user unconsciously converts "inversion" and "inversion" into two phrases. And even if it is in an undecided state, the converted bunsetsu will be recognized as one bunsetsu, and the (1) bunsetsu's prefix, independent word / proper noun part, and suffix This is because the process of reexamining the meaning and co-occurrence relationship is missing. For this reason, if you enter the word "Hansoku" without being linked to the prefix, independent word / proper noun word, meaning of the prefix, and co-occurrence in the "tenten" and "hansoku" independence parts, and perform conversion "Foul" of the most probable conversion candidates of
Is output. Similarly, "Ikki Tousen" is changed to "Ikki"
If you enter the words and "Tosen" separately, it will be "Ichiki / Win" and "Kankyukyudai" will be "Mitigation / Fifth". In such an example, in order to convert "rest", in the end, "kyu" and "dai" will be converted separately, and the number of candidates will be so large that input will be difficult. There was a problem.

【００１３】ユーザは、システムが決めた文節区切り位
置と関係ない文節区切りの認識を持っているため、しば
しばシステムが本来１文節と仮定する位置でない箇所で
変換キーを押す可能性がある。そのため、ユーザの所望
する漢字がなかなか出力されず、無意味語の候補を多く
出力し、候補選択に労力を要し、操作性が損なわれてい
た。Since the user has a recognition of a bunsetsu break which is not related to the bunsetsu break position determined by the system, there is a possibility that the user often presses the conversion key at a position that is not supposed to be one bunsetsu by the system. Therefore, the kanji desired by the user is not easily output, many meaningless word candidates are output, labor is required for selecting candidates, and operability is impaired.

【００１４】よって、本発明は、ユーザにシステムの文
節区切りの概念を強制するのではなく、ユーザは文節区
切りを位置を間違えることがあることを前提に、ユーザ
が間違った文節で変換指示をして文節を区切ったときに
も、本来の文節区切りとして文節区切りができるかどう
かをシステムが判断して、ユーザが所望する候補を速や
かに出力する。Therefore, the present invention does not force the user to use the concept of phrase breaks in the system, but assumes that the user may misplace the phrase breaks, and the user gives a conversion instruction with the wrong phrase. Even when the bunsetsu is delimited, the system determines whether or not the bunsetsu delimiter can be used as the original bunsetsu delimiter, and promptly outputs the candidate desired by the user.

【００１５】[0015]

【課題を解決するための手段】本発明では、上記課題を
解決するために、かな文字を入力する入力手段、前記入
力手段より入力されたかな文字を格納する入力バッフ
ァ、前記入力バッファに格納されたかな文字を、漢字仮
名交じりに変換する仮名漢字変換手段、前記仮名漢字変
換手段によって変換された変換候補を格納する変換文字
バッファを備えた仮名漢字変換装置において、前記入力
手段からの仮名漢字変換指示に呼応して、前記変換文字
バッファに格納されている変換候補をかな文字に戻す戻
し手段、前記戻されたかな文字と前記入力バッファに格
納されたかな文字とを結合する結合手段、前記結合手段
によって結合されたかな文字を変換対象文字列として、
辞書を参照する参照手段、前記参照手段によって、前記
変換バッファに格納されている変換候補との部分一致を
とる漢字の表記を変換候補とする。According to the present invention, in order to solve the above problems, input means for inputting kana characters, an input buffer for storing kana characters input by the input means, and an input buffer for storing kana characters are stored in the input buffer. A kana-kanji conversion device for converting a kana-kana into kanji kana-kanji conversion means, and a kana-kanji conversion device including a conversion character buffer storing the conversion candidates converted by the kana-kanji conversion means. In response to the instruction, a returning unit that returns the conversion candidate stored in the conversion character buffer to a kana character, a combining unit that combines the returned kana character and the kana character stored in the input buffer, and the combining Kana characters combined by means as the conversion target character string,
The reference means for referencing the dictionary, and the reference means, the notation of the kanji that partially matches the conversion candidate stored in the conversion buffer is used as the conversion candidate.

【００１６】[0016]

【作用】本発明において、仮名漢字変換指示があった場
合に、入力したかな文字の直前に未確定の漢字仮名交じ
り文がある場合、前の１文節の漢字仮名交じり文節を読
みに直し、今回入力した読みと結合して辞書と照合し、
辞書に「同じ読み」、「先行漢字列が未確定状態の漢字
と同じ」条件の場合に、その単語の表記を変換の第１候
補として出力し、条件に適合しない場合は、通常の仮名
漢字変換処理を行う。In the present invention, if there is an undetermined Kanji / Kana mixed sentence immediately before the input Kana character when there is a Kana / Kanji conversion instruction, the previous Kanji / Kana mixed sentence is read again and this time. Combined with the input Yomi and collated with the dictionary,
In the case of "same reading" or "preceding kanji string is the same as undetermined kanji" condition in the dictionary, the notation of that word is output as the first candidate for conversion, and if it does not match the condition, normal kana kanji Perform conversion processing.

【００１７】[0017]

【実施例】本発明における実施例を図を用いて詳細に説
明する。図１は本発明の実施例を示すブロック構成図で
ある。図１において、１は制御部であり、マイクロプロ
セッサ、プログラムやテーブル、外部記憶装置などの以
下に明記する以外のものを含んでおり、以下に明記する
ものを制御するものである。２はＣＲＴやＬＣＤ等から
なる表示部である。３はキーボード等からなる入力部で
ある。Embodiments of the present invention will be described in detail with reference to the drawings. FIG. 1 is a block diagram showing an embodiment of the present invention. In FIG. 1, reference numeral 1 denotes a control unit, which includes a microprocessor, a program, a table, an external storage device and the like other than those specified below, and controls what is specified below. Reference numeral 2 is a display unit including a CRT, an LCD and the like. An input unit 3 includes a keyboard and the like.

【００１８】４は意味コードを含んだ自立語／固有名詞
／接辞語／音訓辞書／ユーザ辞書等の辞書部およびテー
ブル部である。５は単語の共起関係を示した複合用例／
接頭語用例／接尾語用例辞書などの辞書部である。Reference numeral 4 denotes a dictionary unit such as an independent word / proper noun / affix word / phonetic dictionary / user dictionary including a meaning code and a table unit. 5 is a compound example showing the co-occurrence relationship of words /
A dictionary unit such as a prefix example / suffix example dictionary.

【００１９】６はバッファメモリーである。このバッフ
ァメモリはさらに、６１の入力文字バッファ、６２の変
換文字バッファ、６３の学習バッファ、６４の同音語バ
ッファ、６５の文章バッファ、６６の画面バッファ、６
７の分かち位置テーブルバッファから構成されている。Reference numeral 6 is a buffer memory. The buffer memory further includes 61 input character buffers, 62 converted character buffers, 63 learning buffers, 64 homophone buffers, 65 sentence buffers, 66 screen buffers, 6
It is composed of seven division position table buffers.

【００２０】６１の入力バッファは３の入力手段から入
力された文字を格納しておくバッファである。６２の変
換文字バッファは６１の入力文字バッファのデータを変
換したときの第１候補のデータを格納しておくバッファ
である。６３の学習バッファは確定した同音語がある場
合、優先順位を上げておくためのバッファである。The input buffer 61 is a buffer for storing the characters input from the input means 3. The conversion character buffer 62 is a buffer for storing the first candidate data when the data of the input character buffer 61 is converted. The learning buffer 63 is a buffer for increasing the priority when there is a confirmed homophone.

【００２１】６４の同音語バッファは６２の変換文字バ
ッファの第１候補以外の同音語と変換処理のときに、内
部で作成された別の単語データが格納される。６５の文
章バッファは確定した文章データが格納される。６６の
画面バッファは６１の入力文字バッファ（変換前）また
は６２の変換文字バッファ（変換後）と６５の文章バッ
ファが合わさった形となり、６１、６２のバッファ上表
も６６の画面バッファの中に有している。６７の分かち
位置テーブルバッファは、文節候補どのように並べたほ
うが正解候補に近いか文節の区切り情報が可能な限り
（複数種類）格納されている。In the 64 homophone word buffer, a homophone word other than the first candidate of the converted character buffer 62 is stored, and another word data internally created during the conversion process. The sentence buffer 65 stores fixed sentence data. The screen buffer of 66 has a form in which the input character buffer of 61 (before conversion) or the conversion character buffer of 62 (after conversion) and the sentence buffer of 65 are combined, and the buffer tables of 61 and 62 are also in the screen buffer of 66. Have In the division position table buffer 67, bunsetsu candidates are stored as much as possible (a plurality of types) as to how the bunsetsu candidates are arranged closer to the correct answer candidate.

【００２２】入力部３より入力された仮名列は、まず６
１入力バッファに格納される。次に変換のタイミングに
なったとき、制御部１は辞書部４、用例辞書５などを参
照して仮名漢字変換処理を行い、６２の変換文字バッフ
ァに第１優先候補を蓄える。また第１優先候補以外の単
語データと違う別の単語データは６４の同音語バッファ
んい、違う分かち位置情報は６７の分かち位置テーブル
バッファに蓄えられ、入力部３び操作によって、表示部
２に表示されて選択することができる。入力部３の操作
によって、同音語の選択が行われると、その結果が６３
の学習バッファに情報が蓄えられる。確定された文章デ
ータは６２の変換文字バッファより６５の文章バッファ
に複写される。The kana sequence input from the input unit 3 is first 6
1 is stored in the input buffer. At the next conversion timing, the control unit 1 refers to the dictionary unit 4, the example dictionary 5, and the like to carry out Kana-Kanji conversion processing, and stores the first priority candidate in the conversion character buffer 62. Further, different word data different from the word data other than the first priority candidate are stored in the homophone word buffer of 64, and the different division position information is stored in the division position table buffer of 67, and are displayed on the display unit 2 by the input unit 3 and the operation. It is displayed and can be selected. When a homonym is selected by operating the input unit 3, the result is 63
Information is stored in the learning buffer of. The confirmed text data is copied from the conversion character buffer 62 to the text buffer 65.

【００２３】実際の例として「先生や親が／準備会」と
いう文章データの真ん中に（／の位置）に「そうごう」
と文字列を入力して変換して直後の各データバッファの
内容は図１３のようになり、その直後に「てき」と入力
して変換した直後の各データバッファの内容は図１４の
ようになっている。As an actual example, “sougo” is placed in the middle (/ position) of the text data “teacher or parent / preparatory meeting”.
The contents of each data buffer immediately after inputting and converting a character string are as shown in FIG. 13, and the contents of each data buffer immediately after inputting “Kai” and being converted are as shown in FIG. Has become.

【００２４】６２の変換文字バッファには変換文字列１
字づつに読み情報をもっている。６４の同音語バッファ
には、入力文字列１字ごとに、そこから文節が始まる場
合の単語情報が、６４の同音語バッファに格納されてい
る。The conversion character string 1 is stored in the conversion character buffer 62.
It has reading information for each character. In the 64 homophone buffers, the word information when a phrase starts from each character string is stored in the 64 homophone buffers.

【００２５】６６の画面バッファであるが、未確定文字
列がない場合には６５の文章バッファの内容がそのまま
画面バッファに出力される。未確定文字列がある場合で
変換前は６１の入力文字列データを６５文章バッファの
カーソル位置に挿入して、画面バッファに出力する。そ
の場合、画面バッファのどこに未確定文字列があるかの
情報を保存しておく。変換後に未確定文字列がある場合
には、６２変換文字列データを６５文章バッファのカー
ソル位置に挿入して、画面バッファに出力する。その場
合、画面バッファのどこに未確定文字列があるか情報を
保存しておく。６７の分かち位置バッファには、同音語
バッファにある文節候補をどのように集めて一つの文章
を構成するか、複数種類の分かち位置の情報を保存して
おく。Although there are 66 screen buffers, if there is no undetermined character string, the contents of the 65 text buffer are output to the screen buffer as they are. When there is an undetermined character string and before conversion, 61 input character string data is inserted into the cursor position of the 65 sentence buffer and output to the screen buffer. In that case, information on where in the screen buffer the undetermined character string is stored is stored. If there is an undetermined character string after conversion, 62 converted character string data is inserted at the cursor position of the 65 sentence buffer and output to the screen buffer. In that case, information on where in the screen buffer the undetermined character string is stored is stored. In the separation position buffer 67, information on how to collect the phrase candidates in the homophone word buffer to form one sentence or a plurality of kinds of separation positions is stored.

【００２６】本出願の具体例をフローチャートに沿って
説明する。図９にフローチャートを示す。ステップＳ１
でキー入力待ちを行う。そこで変換キーが入力される
と、ステップＳ２からステップＳ３に進む。取り消しキ
ーが入力された場合は、ステップＳ６に進む。それ以外
のキーが入力された場合はＢに進む。A specific example of the present application will be described with reference to a flowchart. FIG. 9 shows a flowchart. Step S1
Wait for key input with. When the conversion key is input there, the process proceeds from step S2 to step S3. When the cancel key is input, the process proceeds to step S6. If any other key is input, the procedure advances to B.

【００２７】Ｂでの処理はステップＳ９でカーソル移動
キーかどうかを判断し、カーソル移動キーである場合は
ステップＳ１０へ進んでカーソル移動処理を行う。入力
がカーソル移動キーでなかった場合には、ステップＳ１
１で文字入力かどうか判断する。文字入力でない場合の
処理は本発明には直接関係ないので省略する。文字入力
である場合にはステップＳ１２で入力文字バッファに文
字を格納する。ステップＳ２で変換キーが入力された場
合、ステップＳ３に進む。In the process of B, it is determined in step S9 whether or not the key is a cursor movement key. If it is the cursor movement key, the process proceeds to step S10 to perform the cursor movement process. If the input is not the cursor movement key, step S1
It judges whether it is a character input at 1. Since the processing when it is not a character input is not directly related to the present invention, it is omitted. If the input is a character, the character is stored in the input character buffer in step S12. When the conversion key is input in step S2, the process proceeds to step S3.

【００２８】ステップＳ３では６２の変換文字バッファ
に未確定文字列があるか確認する。ここで未確定文字列
がある場合には、入力文字列が接尾語と同じ読みである
かどうか、４６の接尾語テーブルを検索する。ここで、
４６の接尾語テーブルに同じ読みがある場合は、ステッ
プＳ５に進んで６２の変換文字バッファの、入力された
直前の位置する漢字仮名交じり文節をひらがなに変え、
今回の入力文字列と結合させ、その文字列が５３の接尾
語用例辞書にあるかどうか検索する。同じ読みが検索さ
れた場合は６２の変換バッファに第１候補として格納す
る。検索されなかった場合は、６２の変換文字バッファ
の入力された直前に位置する仮名漢字交じり文節を平仮
名に変え、４５の接頭語テーブルを検索する。接頭語テ
ーブルに同じ読みがなければ、同様に４１の自立語／固
有名詞辞書を検索する。In step S3, it is confirmed whether or not there is an unfixed character string in the converted character buffer 62. If there is an undetermined character string, the suffix table of 46 is searched for whether the input character string has the same reading as the suffix. here,
If the suffix table of 46 has the same reading, the process proceeds to step S5, and the Kanji-kana mixed bunsetsu located immediately before the input in the conversion character buffer of 62 is changed to hiragana,
It is combined with the input character string this time, and the character string is searched for in the suffix example dictionary of 53. If the same reading is retrieved, it is stored in the conversion buffer 62 as the first candidate. If it is not searched, the kana-kanji mixed phrase located immediately before the input in the conversion character buffer 62 is changed to hiragana and the 45 prefix table is searched. If there is no same reading in the prefix table, the independent word / proper noun dictionary of 41 is similarly searched.

【００２９】ここでも同じ読みがテーブルになければ、
ステップＳ４で図３の文節の構成要素をもとに、図５の
活用語尾テーブル、図６の付属語テーブル、図７の接辞
語テーブルのデータを図４の接続行列から結び付きを判
断して文節を生成する。そして、生成された文節を組み
合わせて、文章を形成する。一番良いと思われる組み合
わせから順番に６７の分かち位置テーブルバッファにど
の位置に文節の切れ目をいれるかの情報を格納する。Here again, if the same reading is not in the table,
In step S4, based on the constituent elements of the clause in FIG. 3, the data in the inflection ending table in FIG. 5, the auxiliary word table in FIG. 6, and the affix table in FIG. 7 is judged from the connection matrix in FIG. To generate. Then, the generated clauses are combined to form a sentence. Information on which position to insert the break of the clause is stored in the position table buffer in the order of 67 from the combination considered to be the best.

【００３０】４５接頭語テーブル、４１自立語／固有名
詞辞書に同じ読みがあった場合は、今回の入力文字列と
結合させた文字列が、接頭語用例辞書、自立語／固有名
詞辞書の表記にあるかどうか確認する。ある場合は、ス
テップＳ５で第１候補として６２の変換バッファに格納
する。If the 45 prefix table and 41 independent words / proper noun dictionaries have the same reading, the character string combined with the input character string this time is represented in the prefix example dictionary and independent word / proper noun dictionary. To see if If there is, it is stored in the conversion buffer 62 as the first candidate in step S5.

【００３１】キー入力が取り消しキーであった場合は、
ステップＳ７で未確定の出力候補があるかどうかを判断
し、ある場合はステップＳ８でカーソル位置にある文節
から前に１文節分入力文字に戻す処理を行う。When the key input is the cancel key,
In step S7, it is determined whether or not there is an undetermined output candidate. If there is an output candidate, in step S8, processing for returning the phrase at the cursor position to the input character for one phrase before is performed.

【００３２】次に図１０、図１３、図１４を用いて図９
のステップＳ３〜Ｓ５の実際の例に即した説明を行う。Next, referring to FIGS. 10, 13 and 14, FIG.
A description will be given in accordance with an actual example of steps S3 to S5.

【００３３】「総合的」と変換するために「そうごう」
＋変換＋「てき」＋変換と入力する場合の説明である。
まず、入力文字列として「そうごう」と入力する。入力
文字は６１入力文字バッファに格納される。次に変換キ
ーの入力により、６２の変換文字バッファに未確定文字
列が存在するかどうかを確認する。この例ではまだ未確
定文字列が存在しないので、６１入力文字列バッファに
格納されている文字列を元に、可能な限りの文節を生成
する（つまり通常の変換）。"Sogo" to convert to "comprehensive"
This is an explanation in the case of inputting + conversion + “ki” + conversion.
First, enter "Sougo" as the input character string. The input characters are stored in the 61 input character buffer. Next, by inputting the conversion key, it is confirmed whether or not the unconfirmed character string exists in the conversion character buffer 62. In this example, there is no undetermined character string yet, so as many clauses as possible are generated based on the character string stored in the 61-input character string buffer (that is, normal conversion).

【００３４】そして「てき」を入力する。入力文字は６
１入力文字列バッファに格納される。ここで変換キーの
操作により、６２の変換文字バッファに未確定文字があ
るかどうかを確認する。ここでは未確定文字列が存在す
るので、入力文字列［てき］が接尾語と同じよみかどう
か、４６接尾語テーブルを検索する。４６接尾語テーブ
ルに同じ読みがあるので、６２変換文字バッファの入力
された直前に位置する漢字仮名交じり文節［総合］をひ
らがな［そうごう］に変え、今回の入力文字列［てき］
と結合された文字列［そうごうてき］が５３接尾語用例
辞書にあるか検索する。Then, "ki" is input. Input character is 6
1 Stored in the input character string buffer. Here, by operating the conversion key, it is confirmed whether or not there are unconfirmed characters in the conversion character buffer 62. Since there is an undetermined character string here, the 46 suffix table is searched to see if the input character string [kita] has the same reading as the suffix. Since the 46 suffix table has the same reading, the kanji kana mixed phrase [comprehensive] located immediately before the input in the 62 conversion character buffer is changed to hiragana [sougo], and the input character string this time [come]
It is searched whether the character string [Sougouki] combined with is in the 53 suffix example dictionary.

【００３５】５３接尾語用例辞書から表記文字［総合
的］を選び出し、６２変換バッファで読みに変更した漢
字［総合］と同じ表記かどうか確認する。この例では同
じ表記であるので、［総合的］を変換第１候補として６
２の変換バッファに格納する。そして６５文章バッファ
と６２変換文字バッファを合成して６６画面バッファを
作成して表示する。The notation character [general] is selected from the 53 suffix example dictionary, and it is confirmed in the 62 conversion buffer whether or not the character is the same as the kanji [general] changed to reading. Since the same notation is used in this example, [comprehensive] is 6 as the first conversion candidate.
Store in the conversion buffer of No. 2. Then, the 65 sentence buffer and the 62 converted character buffer are combined to create a 66 screen buffer for display.

【００３６】「一騎当千」を「いっき」＋変換＋「とう
せん」＋変換と入力した場合の例を図１１、図１５、図
１６をもとに説明する。上記例同様に、入力文字列「と
うせん」が接尾語と同じ読みかどうか、４６接尾語テー
ブルを検索する。ここでその読みがないので、６２変換
文字バッファの入力された直前に位置する漢字仮名交じ
り文節をひらがな「いっき」に変え、４５接頭語テーブ
ルを検索する。ここでも同じ読みがないので、６２の変
換文字バッファの入力された直前に位置する漢字仮名交
じり文節をひらがなに変え、今回の入力文字列を結合さ
せた文字列「いっきとうせん」で４１自立語辞書／固有
名詞辞書を検索する。An example in which "Ikki Tousen" is input as "Ikki" + conversion + "Tosen" + conversion will be described with reference to FIGS. 11, 15 and 16. FIG. Similarly to the above example, the 46 suffix table is searched for whether the input character string "tousen" has the same reading as the suffix. Since there is no reading here, the kanji-kana mixed phrase located immediately before the input in the 62 conversion character buffer is changed to hiragana "Ikki" and the 45 prefix table is searched. Since there is no same reading here, the kanji-kana mixed bunsetsu located just before the input of the 62 conversion character buffer is changed to hiragana, and the character string "Ikkitousen" that combines the input character strings this time is 41 independent word dictionary. / Search the proper noun dictionary.

【００３７】４１自立語辞書／固有名詞辞書から表記文
字［一騎当千］を選び出し、６２変換バッファで読みに
変換した漢字［一騎］と同じ表記かどうか判断する。同
じ表記であるので、第１候補として［一騎当千］を６２
の変換バッファに格納する。同様に「副参事」を「ふ
く」＋変換＋「参事」＋変換と入力した場合の例を図１
２、図１７、図１８をもとに説明する。上記例同様に、
入力文字列「さんじ」が接尾語と同じ読みかどうか、４
６接尾語テーブルを検索し、同じ読みがないので、６２
変換文字バッファの入力された直前に位置する漢字仮名
交じり文節をひらがな「ふく」に変え、４５接頭語テー
ブルを検索する。41. A notation character [Ikkitousen] is selected from the 41 independent word dictionary / proper noun dictionary, and it is judged whether the notation is the same as the Kanji [Ikki] converted into reading in the 62 conversion buffer. Since the same notation is used, [Ikkitousen] is 62 as the first candidate.
Store in the conversion buffer of. Similarly, an example of inputting "Vice counselor" as "Fuku" + conversion + "Counselor" + conversion Fig. 1
2, and FIG. 17 and FIG. 18 will be described. Similar to the above example,
Whether the input string "sanji" has the same pronunciation as the suffix, 4
6 The suffix table is searched, and since there is no same reading, 62
The Kanji-kana mixed phrase located immediately before the input in the conversion character buffer is changed to hiragana “fuku” and the 45-prefix table is searched.

【００３８】ここでは４５接頭語テーブルに同じ読みが
あるので、今回の入力文字列と結合させた文字列「ふく
さんじ」が５３接尾語用例辞書にあるかどうか検索す
る。５２接尾語用例辞書から表記文字「副参事」を選び
出し、６２変換バッファで読みに変更した漢字「副」と
同じ表記であることを確認する。同じ表記であるので、
「副参事」を第１候補として６２変換バッファに格納す
る。Here, since the 45 prefix table has the same reading, it is searched whether or not the character string "Fukusanji" combined with the input character string this time is in the 53 suffix example dictionary. 52. Select the notation character “Deputy Counselor” from the 52 suffix example dictionary, and confirm that it has the same notation as the Kanji character “Sub” changed to the reading in the conversion buffer 62. Since it is the same notation,
The "counselor" is stored in the 62 conversion buffer as the first candidate.

【００３９】[0039]

【発明の効果】ユーザが本来の文節区切りとは関係のな
い位置で変換指示によって文節区切りをしてしまった場
合であっても、入力位置の直前の文節を一度ひらがなに
戻して、現在入力しているひらがなと結合して変換を行
うことによって、改めて正しい文節区切りに従って変換
を行うことができ、誤った文節区切りによる誤変換を防
止する。[Effects of the Invention] Even when the user breaks a phrase by a conversion instruction at a position unrelated to the original phrase break, the phrase immediately before the input position is returned to Hiragana once and the current phrase is input. By combining with Hiragana and performing the conversion, the conversion can be performed again according to the correct segment break, and the incorrect conversion due to the incorrect segment break is prevented.

[Brief description of drawings]

【図１】本発明におけるブロック構成図である。FIG. 1 is a block diagram of the present invention.

【図２】本発明における文章入力画面の例を示す図であ
る。FIG. 2 is a diagram showing an example of a text input screen according to the present invention.

【図３】文節の構成要素の例を示す図である。FIG. 3 is a diagram showing an example of constituent elements of a clause.

【図４】単語の接続に関する接続行列を示す図である。FIG. 4 is a diagram showing a connection matrix relating to connection of words.

【図５】用言の活用語尾のテーブルを示す図である。FIG. 5 is a diagram showing a table of inflection endings of a word.

【図６】付属語を格納したテーブルを示す図である。FIG. 6 is a diagram showing a table storing attached words.

【図７】接辞語を格納したテーブルを示す図である。FIG. 7 is a diagram showing a table storing affix words.

【図８】接尾語用例辞書の一例を示す図である。FIG. 8 is a diagram showing an example of a suffix example dictionary.

【図９】本発明の処理の流れの概要を示す図である。FIG. 9 is a diagram showing an outline of a processing flow of the present invention.

【図１０】本発明における一実施例の流れを示す図であ
る。FIG. 10 is a diagram showing the flow of an embodiment of the present invention.

【図１１】本発明における一実施例の流れを示す図であ
る。FIG. 11 is a diagram showing a flow of one embodiment in the present invention.

【図１２】本発明における一実施例の流れを示す図であ
る。FIG. 12 is a diagram showing a flow of one embodiment in the present invention.

【図１３】本発明における一実施例の実行中のバッファ
の内容を示す図である。FIG. 13 is a diagram showing the contents of a buffer during execution of one embodiment of the present invention.

【図１４】本発明における一実施例の実行中のバッファ
の内容を示す図である。FIG. 14 is a diagram showing the contents of a buffer during execution of one embodiment of the present invention.

【図１５】本発明における一実施例の実行中のバッファ
の内容を示す図である。FIG. 15 is a diagram showing the contents of a buffer during execution of one embodiment of the present invention.

【図１６】本発明における一実施例の実行中のバッファ
の内容を示す図である。FIG. 16 is a diagram showing the contents of a buffer during execution of one embodiment of the present invention.

【図１７】本発明における一実施例の実行中のバッファ
の内容を示す図である。FIG. 17 is a diagram showing the contents of a buffer during execution of one embodiment of the present invention.

【図１８】本発明における一実施例の実行中のバッファ
の内容を示す図である。FIG. 18 is a diagram showing the contents of a buffer during execution of one embodiment of the present invention.

[Explanation of symbols]

１制御部２表示部３入力部４辞書部４１自立語／固有名詞辞書４２接続行列テーブル４３活用語尾テーブル４４付属語テーブル４５接頭語テーブル４６接尾語テーブル５共起関係辞書部５１複合用例辞書５２接頭語用例辞書５３接尾語用例辞書５４意味コード用例辞書６バッファ６１入力文字バッファ６２変換文字バッファ６３学習バッファ６４同音語バッファ６５文章バッファ６６画面バッファ６７分かち位置テーブルバッファ 1 control unit 2 display unit 3 input unit 4 dictionary unit 41 independent word / proper noun dictionary 42 connection matrix table 43 inflectional end table 44 adjunct word table 45 prefix table 46 suffix table 5 co-occurrence relation dictionary unit 51 compound example dictionary 52 Prefix example dictionary 53 Suffix example dictionary 54 Semantic code example dictionary 6 Buffer 61 Input character buffer 62 Converted character buffer 63 Learning buffer 64 Homophone buffer 65 Text buffer 66 Screen buffer 67 Minute position table buffer

Claims

[Claims]

1. Input means for inputting kana characters, an input buffer for storing kana characters input by the input means, kana-kanji conversion means for converting kana characters stored in the input buffer into kanji-kana kanji mixing, In a kana-kanji conversion device having a conversion character buffer for storing conversion candidates converted by the kana-kanji conversion means, conversions stored in the conversion character buffer in response to a kana-kanji conversion instruction from the input means. A dictionary for converting the candidates into kana characters, a combining means for combining the returned kana characters with the kana characters stored in the input buffer, and a kana character combined by the combining means as a conversion target character string; The reference means for referring, and the reference means performs forward matching with the conversion candidates stored in the conversion buffer, and if they match, the A kana-kanji conversion device characterized by using kanji notations as conversion candidates.

2. The kana-kanji conversion device according to claim 1, wherein the reference means refers to a suffix example dictionary.

3. The kana-kanji conversion device according to claim 1, wherein the reference means refers to an independent word / proper noun dictionary.

4. The kana-kanji conversion device according to claim 1, wherein the reference means refers to a prefix example dictionary.