JPH05174011A

JPH05174011A - Kana/kanji conversion processor

Info

Publication number: JPH05174011A
Application number: JP3343597A
Authority: JP
Inventors: Yoshizo Saito; 佳三斎藤
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1991-12-25
Filing date: 1991-12-25
Publication date: 1993-07-13
Anticipated expiration: 2015-09-04
Also published as: JP3083610B2

Abstract

PURPOSE:To improve the KANA (Japanese syllabary)/KANJI (Chinese character) conversion accuracy for the close batch conversion processing by adding and registering the andividable flags of paragraphs to the words which are registered in a dictionary like an ordinary dictionary, a user dictionary, etc. CONSTITUTION:A key input means 101 is provided together with a display device 1.02 which has a display screen, an input character storage means 103 which stores the inputted KANA character strings, a paragraph dividing means 104 which divides the KANA character string into the KANA character paragraphs, a KANA/KANJI conversion dictionary 105 which stores the KANJI to be converted into KANA characters, a converter means 106 which converts the KANA characters into KANJI or the KANA-KANJI sentences, a specific paragraph conversion dictionary 107 which registers a specific KANA character paragraph and its corresponding KANJI or KANA-KANJI sentence, a paragraph control means 108 which controls the means 104 so as not to divide the specific KANA character paragraph if registered in the dictionary 107 among those KANA character strings end also converts the specific paragraph into KANJI or a KANA-KANJI sentence, and an output means 109 which outputs the character strings stored in the means 104 to a display screen.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は仮名漢字変換処理が可
能な日本語ワードプロセッサや仮名漢字変換処理装置に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a Japanese word processor and a kana-kanji conversion processing device capable of kana-kanji conversion processing.

【０００２】[0002]

【従来の技術】従来、この種の仮名漢字変換処理装置に
おいて、キーボードから入力された仮名文字を漢字に変
換する仮名漢字変換辞書（以下通常辞書と言う）が設け
られている。しかし、通常辞書に登録されている単語数
には容量等の限界があるため、通常辞書だけでは充分に
変換が行えない。2. Description of the Related Art Conventionally, in this type of kana-kanji conversion processing device, a kana-kanji conversion dictionary (hereinafter referred to as a normal dictionary) for converting kana characters input from a keyboard into kanji is provided. However, since the number of words registered in the ordinary dictionary has a limit such as capacity, the ordinary dictionary cannot sufficiently perform the conversion.

【０００３】このため、ユーザーが新規に単語を登録で
きる特定文節変換辞書（以下ユーザー辞書と言う）を設
け、仮名漢字変換の操作性を向上させている。すなわ
ち、ユーザー辞書に登録されている単語は、一般にユー
ザーが使用するということで、通常辞書に登録している
単語より変換処理出力の優先順位を高められているか頻
度情報を加えられている。このように変換処理出力をコ
ントロールすることにより変換効率をあげる方法とし
て、例えば、特開昭５５ー１４３６４８号公報に開示さ
れている方法がある。Therefore, a specific phrase conversion dictionary (hereinafter referred to as a user dictionary) in which a user can newly register a word is provided to improve the operability of kana-kanji conversion. That is, the words registered in the user dictionary are generally used by the user, and thus the priority of the conversion processing output is higher than that of the words registered in the normal dictionary, and frequency information is added. As a method of increasing the conversion efficiency by controlling the conversion processing output in this way, there is a method disclosed in Japanese Patent Laid-Open No. 55-143648.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、従来の
仮名漢字変換処理装置では、入力された仮名文字列が長
文の場合、どこの文節で区切れば一番適切かがプログラ
ムで処理されるが、従来の単語情報のままでは、ユーザ
ーが必要としている意味の文節の区切りでなく、意味の
違う区切りで分けられることがある。このため、ユーザ
ーが必要とする単語をユーザー辞書に登録しても、仮名
漢字変換処理に反映されない場合があり当初の効果を発
揮しないこともある。However, in the conventional kana-kanji conversion processing device, when the input kana character string is a long sentence, the program processes which segment is most appropriate. With the conventional word information, it may be divided by a segment having a different meaning rather than a segment of a phrase having a meaning required by the user. Therefore, even if a word required by the user is registered in the user dictionary, it may not be reflected in the Kana-Kanji conversion process, and the initial effect may not be exhibited.

【０００５】例えば、「ほうりゅうじをたてたしょうと
くたいしにいった」を入力した場合、所望の仮名漢字変
換は「法隆寺を建てた聖徳太子に言った」であるが、
「法隆寺を立てた称徳大使に言った」、「放流時を立て
た小と苦大使に行った」というように色々意味が違って
変換される場合がある。これは、入力された文字列に対
して、文節の区切り位置がどう処理されるかによって変
換候補（単語）が異なるためである。[0005] For example, when "I went to the sword to make a story" was entered, the desired kana-kanji conversion was "I told the Prince Shotoku who built Horyuji."
There are cases where the meanings are changed differently, such as "I said to the Ambassador of Houdoji who established Horyuji" or "I went to the small and painful ambassadors who set the release time". This is because the conversion candidates (words) for the input character string differ depending on how the segment delimiter positions are processed.

【０００６】例１；「しょうとくたいし」を１文節とし
た場合は、「聖徳太子」と変換される。例２；「しょうとく／たいし」を２文節とした場合は、
「頌徳（自立語）、称徳（名前）、承徳（市）、正徳
（年号）、聖徳（大学名）」＋「多石（山名）、太子
（町名）、大使（自立語）、大志（自立詞）、タイ
（国）＋し、他意＋し、泰＋し、田井＋し、対し、鯛＋
し、隊士、体＋し」等の変換候補がある。例３；「しょうと／くたい／し」を３文節とした場合
は、「商都（自立語）、翔と（名）、賞と、称と、章
と、証と、小と、商と、将と、性と、彰と、昭と、晶
と、正と、省と、尚と、ＳＨＯＷと」＋「区体」＋
「し、市、士、詩、師、史、四、氏、資、死、詞、視、
子」等の変換候補がある。Example 1: When "Shoutokutai" is used as one phrase, it is converted to "Shutoku Taishi". Example 2; When "Shoutoku / Taishi" has two clauses,
"Otoku (independent word), Shode (name), Chengde (city), Masanori (year name), Shotoku (university name)" + "Taishi (Yamana), Prince (Town name), Ambassador (Independent word), Taishi (independence word), Thailand (country) +, other meaning +, Tai +, Tai +, against, sea bream +
There are conversion candidates, such as "shi, soldier, body + shi". Example 3; If you use "Shouto / Kutai / Shi" as three clauses, you can read "Merchandise (independent language), Sho and (name), award, title, chapter, testimony, small, and quotient. , Sho, Sex, Akira, Akira, Akira, Akira, Positive, Ministry, Still, SHOW and "+" Ward "+
"Shi, Shi, Shi, Shi, Shi, Shi, Shi, Shi, Shi, Shi, Shi, Shi, Shi,
There are conversion candidates such as "child".

【０００７】例４；「しょう／とくたい／し」を３文節
とした場合は、「商、翔、賞、称、章、証、小、商、
将、性、彰、昭、晶、正、省、尚、ＳＨＯＷ」＋「特
待」＋「し、市、士、詩、師、史、四、氏、資、死、
詞、視、子」等の変換候補がある。例５；「しょ／うとく／たいし」を３文節とした場合
は、「署、書、緒、初」＋「有徳」＋「多石（山名）、
太子（町名）、大使（自立語）、大志（自立詞）、タイ
（国）＋し、他意＋し、泰＋し、田井＋し、対し、鯛＋
し、隊士、体＋し」等の変換候補がある。Example 4; When "sho / tokutai / shi" has three clauses, "quotient, sho, award, title, chapter, testimony, small, quotient,
"General, sex, Akira, Akira, Akira, Tadashi, Ministry, Sho," + "special treatment" + "shi, city, samurai, poetry, teacher, history, four, Mr, Shi, death,
There are conversion candidates such as "word, visual, child". Example 5: If "Sho / Utoku / Taishi" is used as three bunsetsu, then "sign, calligraphy, cord, first" + "virtues" + "Taishi (Yamana),
Prince (town name), ambassador (independent word), ambition (independent word), Thailand (country) +, other meaning +, Tai +, Tai +, opposite, sea bream +
There are conversion candidates, such as "shi, soldier, body + shi".

【０００８】従って、このように文節の区切り方により
多数の変換候補が推測される。このためプログラムが
「しょうとくたいし」を１文節として処理できなかった
場合、ユーザー辞書に「聖徳太子」の単語が登録されて
いても、別の意味の変換候補で変換処理される。Therefore, a large number of conversion candidates are inferred according to the way of segmenting the clauses. Therefore, if the program cannot process "Shoutokutaishi" as one phrase, even if the word "Shotoku Taishi" is registered in the user dictionary, it is converted by a conversion candidate having a different meaning.

【０００９】ここで、別の方法として、共起（用例）関
係を利用して「聖徳（大学名）」＋「太子（町名）」を
共起辞書（複合用例辞書）に登録する方法がある。しか
し、「聖徳」＋「太子」を登録することができるが、先
行文節の単語の意味から人名とはならないため、後続文
節の変換処理で不都合が生じることがある。例えば、
「聖徳太子にいった」の「いった」が、「人名＋に＋言
った」か「場所＋に＋行った」か区別がつかない。しか
し、「聖徳太子」に人名という意味コードがあれば、
「しょうとくたいしにいった」から「聖徳太子に言っ
た」に変換処理することは可能になる。Here, as another method, there is a method of registering "Shotoku (university name)" + "Taishi (town name)" in the co-occurrence dictionary (composite example dictionary) using the co-occurrence (example) relationship. .. However, although "Shotoku" + "Taishi" can be registered, since it does not become a person's name from the meaning of the word in the preceding clause, inconvenience may occur in the conversion processing of the succeeding clause. For example,
It is impossible to distinguish whether "I went to Prince Shotoku" from "I said to the person +" or "I went to the place +". However, if "Shotoku Taishi" has a meaning code of a personal name,
It is possible to convert from "I went to Shotokutai" to "I told Prince Shotoku".

【００１０】すなわち、ユーザー辞書に登録する際、文
節のわかち（区切り）位置を正確に区切るために、この
ような仮名文字列があれば強制的に文節のわかちを行わ
ないことを指示するフラッグをつける。このため、ユー
ザー辞書に登録した単語は、仮名漢字変換時に意味が違
う文節のわかちになることを少なくすることができる。
勿論、このような分割不可のフラッグはユーザー辞書に
登録される単語以外の通常辞書に記憶されている単語に
もこのフラッグがつけられている。That is, when registering in the user dictionary, in order to accurately demarcate the position of demarcation (delimitation) of the phrase, a flag for instructing not to perform the demarcation of the phrase if such a kana character string is present. Put on. For this reason, words registered in the user dictionary can be less likely to be confused with clauses having different meanings when converting into kana-kanji.
Of course, such non-dividable flags are also attached to words stored in the normal dictionary other than the words registered in the user dictionary.

【００１１】この発明は以上の事情を考慮してなされた
もので、辞書（通常辞書、ユーザー辞書）に登録される
単語に対して、上記の仮名文字列があれば、わかちを行
わないことを指示するフラッグをつけて登録する。この
フラッグにより、ベた（文節のわかちをする前の仮名文
字列）一括変換処理中の仮名漢字変換の精度を高めるこ
とを可能にした仮名漢字変換処理装置を提供する。The present invention has been made in consideration of the above circumstances, and if a word registered in a dictionary (ordinary dictionary or user dictionary) has the above kana character string, it will not be shared. Register with a flag to instruct. By this flag, a kana-kanji conversion processing device capable of improving the accuracy of kana-kanji conversion during a solid (kana character string before segmentation) batch conversion process is provided.

【００１２】[0012]

【課題を解決するための手段】図１はこの発明の基本構
成を示すブロック図である。図１に示すようにこの発明
の連文節変換機能を有する仮名漢字変換処理装置は、キ
ー入力手段１０１と、表示画面を有する表示手段１０２
と、前記キー入力手段１０１から入力された仮名文字列
を記憶する入力文字記憶手段１０３と、この仮名文字列
を仮名文字文節にわかつための文節わかち手段１０４
と、この仮名文字文節を漢字もしくは漢字交じり文に変
換するための漢字が記憶されている仮名漢字変換辞書１
０５と、前記キー入力手段１０１の指示により前記入力
文字記憶手段１０３に記憶された仮名文字を前記仮名漢
字変換辞書１０５に基づいて漢字もしくは漢字交じり文
に変換する変換手段１０６と、前記キー入力手段１０１
の操作により特定の仮名文字文節とこれに対応する漢字
もしくは漢字交じり文を、前記仮名漢字変換辞書１０５
と変換手段１０６によって登録する特定文節変換辞書１
０７と、前記入力文字記憶手段１０３に記憶された仮名
文字列の中に前記特定文節変換辞書１０７に登録された
特定の仮名文字文節があれば、この特定の仮名文字文節
をわかたないように前記文節わかち手段１０４を制御す
るとともに、この特定の仮名文字文節に対応する漢字も
しくは漢字交じり文に変換する文節制御手段１０８と、
前記入力文字記憶手段に記憶された文字列を前記表示手
段１０２の表示画面に出力する出力手段１０９とを備え
たことを特徴とする仮名漢字変換処理装置である。FIG. 1 is a block diagram showing the basic configuration of the present invention. As shown in FIG. 1, a kana-kanji conversion processing device having a continuous phrase conversion function of the present invention has a key input means 101 and a display means 102 having a display screen.
And an input character storage means 103 for storing the kana character string input from the key input means 101, and a phrase segmentation means 104 for recognizing this kana character string into a kana character phrase.
And a kana-kanji conversion dictionary 1 that stores the kanji for converting this kana character clause into kanji or kanji-mixed sentences.
05, conversion means 106 for converting kana characters stored in the input character storage means 103 into kanji or kanji mixed sentences based on the kana-kanji conversion dictionary 105 according to an instruction from the key input means 101, and the key input means. 101
The Kana-Kanji conversion dictionary 105 is used to specify a particular Kana character clause and the corresponding Kanji or Kanji-mixed sentence.
And specific phrase conversion dictionary 1 registered by the conversion means 106
07, and if there is a specific kana character phrase registered in the specific phrase conversion dictionary 107 in the kana character string stored in the input character storage means 103, do not know this specific kana character phrase. A phrase control unit 108 for controlling the phrase segmentation means 104 and for converting into a kanji character or a kanji mixture sentence corresponding to the specific kana character segment.
The kana-kanji conversion processing device further comprises an output unit 109 for outputting the character string stored in the input character storage unit to the display screen of the display unit 102.

【００１３】なお、この発明においてキー入力手段１０
１としては、キーボード装置やタブレット装置、あるい
はポインティングデバイス等の入力装置が用いられる。
表示手段１０２としては、ＣＲＴディスプレイ装置、Ｌ
ＣＤ（液晶表示装置）、あるいはＥＬディスプレイ装置
が用いられる。入力文字記憶手段１０３、文節わかち手
段１０４、仮名漢字変換辞書１０５、変換手段１０６、
特定文節変換辞書１０７、文節制御手段１０８、出力手
段１０９としては、ＣＰＵ、ＲＯＭ、ＲＡＭ、Ｉ／Ｏポ
ートからなるマイクロコンピュータシステムを用いるの
が便利である。特に、入力文字記憶手段１０３として
は、通常ＲＡＭが用いられ、仮名漢字変換辞書１０５、
特定文節変換辞書１０７としては、外部記憶装置が用い
られる。In the present invention, the key input means 10
As 1, a keyboard device, a tablet device, or an input device such as a pointing device is used.
As the display means 102, a CRT display device, L
A CD (liquid crystal display device) or an EL display device is used. Input character storage means 103, phrase segmentation means 104, kana-kanji conversion dictionary 105, conversion means 106,
As the specific phrase conversion dictionary 107, the phrase control unit 108, and the output unit 109, it is convenient to use a microcomputer system including a CPU, a ROM, a RAM, and an I / O port. In particular, a RAM is usually used as the input character storage means 103, and the kana-kanji conversion dictionary 105,
An external storage device is used as the specific phrase conversion dictionary 107.

【００１４】[0014]

【作用】この発明に従えば、図１に示すように、連文節
変換機能を有する仮名漢字変換処理装置おいて、キー入
力手段１０１は、文書作成のための仮名文字列を入力す
るとともに特定文節変換辞書１０７に特定の仮名文字文
節とこれに対応する漢字もしくは漢字交じり文を、仮名
漢字変換辞書１０５と変換手段１０６によって登録す
る。表示手段１０２は、入力文字記憶手段１０３に記憶
された文字列を表示画面に表示する。入力文字記憶手段
１０３は、キー入力手段１０１から入力された仮名文字
列を記憶する。According to the present invention, as shown in FIG. 1, in a kana-kanji conversion processing device having a continuous phrase conversion function, the key input means 101 inputs a kana character string for document creation and performs specific phrase conversion. The kana-kanji conversion dictionary 105 and the conversion means 106 register a specific kana character clause and a kanji or kanji mixture sentence corresponding thereto in the dictionary 107. The display means 102 displays the character string stored in the input character storage means 103 on the display screen. The input character storage means 103 stores the kana character string input from the key input means 101.

【００１５】文節わかち手段１０４は、入力文字記憶手
段１０３に記憶された仮名文字列を仮名文字文節にわか
つ。仮名漢字変換辞書１０５には、入力文字記憶手段１
０３に記憶された仮名文字列の各仮名文字文節を漢字も
しくは漢字交じり文に変換するための漢字が記憶されて
いる。変換手段１０６は、キー入力手段１０１の指示に
より入力文字記憶手段１０３に記憶された仮名文字を仮
名漢字変換辞書１０５に基づいて漢字もしくは漢字交じ
り文に変換する。The bunsetsu wording means 104 passes the kana character string stored in the input character storage means 103 to a kana character bunsetsu. The kana-kanji conversion dictionary 105 has an input character storage unit 1
Kanji for converting each kana character clause of the kana character string stored in 03 into a kanji or kanji mixture sentence is stored. The conversion means 106 converts the kana characters stored in the input character storage means 103 into kanji or kanji mixed sentences based on the kana-kanji conversion dictionary 105 according to an instruction from the key input means 101.

【００１６】特定文節変換辞書１０７には、キー入力手
段１０１の操作により特定の仮名文字文節とこれに対応
する漢字もしくは漢字交じり文を、仮名漢字変換辞書１
０５と変換手段１０６によっては登録する。文節制御手
段１０８は、入力文字記憶手段１０３に記憶された仮名
文字列の中に特定文節変換辞書１０７に登録された特定
の仮名文字文節があれば、この特定の仮名文字文節をわ
かたにように文節わかち手段１０４を制御するととも
に、この特定の仮名文字文節に対応する漢字もしくは漢
字交じり文に変換する。出力手段１０９は、入力文字記
憶手段１０３に記憶された文字列を表示手段１０２の表
示画面に出力する。In the specific phrase conversion dictionary 107, a specific kana character phrase and the corresponding kanji or kanji mixed sentence are operated by operating the key input means 101, and the kana-kanji conversion dictionary 1
05 and the conversion means 106 to register. If there is a specific kana character clause registered in the specific phrase conversion dictionary 107 in the kana character string stored in the input character storage means 103, the phrase control means 108 will know the specific kana character clause. In addition to controlling the bunsetsu wakachi means 104, it converts into a kanji or kanji mixed sentence corresponding to this particular kana character bunsetsu. The output unit 109 outputs the character string stored in the input character storage unit 103 to the display screen of the display unit 102.

【００１７】[0017]

【実施例】以下図に示す実施例に基づいてこの発明を詳
述する。なお、これによってこの発明は限定されるもの
ではない。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention will be described in detail based on the embodiments shown in the drawings. The present invention is not limited to this.

【００１８】図２はこの発明の一実施例の構成を示すブ
ロック図である。この図において、１は仮名文字キーや
数値キー、変換キー、ファンクションキーを備えたキー
ボードであり、このキーボード１から仮名文字列や仮名
漢字変換処理のための入力操作がされる。２は制御部で
あり、制御プログラムの実行を行うマイクロプロセッサ
と、この発明の仮名漢字変換処理を行う制御プログラム
やテーブル等のＲＯＭ（リード・オンリ・メモリ）及び
外部記憶装置（図示しない）から構成されている。FIG. 2 is a block diagram showing the configuration of an embodiment of the present invention. In the figure, reference numeral 1 is a keyboard provided with kana character keys, numerical keys, conversion keys, and function keys, and an input operation for kana character string and kana-kanji conversion processing is performed from this keyboard 1. Reference numeral 2 denotes a control unit, which comprises a microprocessor for executing the control program, a control program for executing the kana-kanji conversion processing of the present invention, a ROM (read only memory) such as a table, and an external storage device (not shown). Has been done.

【００１９】３は入力された仮名文字列や仮名漢字交じ
り文字列に変換されたデータを表示する表示部であり、
ＣＲＴディスプレイ装置やＬＣＤ（液晶ディスプレイ装
置）が用いられる。４は意味コードを含んだ自立語、固
有名詞、接辞語等の通常辞書である。５は文節間の共起
（用例）関係を蓄えた共起辞書（複合用例辞書）であ
る。６はユーザーが登録した単語の読みと漢字表記と品
詞・分類および分割不可のフラッびが登録されているユ
ーザー辞書である。４、５、６の各辞書はＲＯＭ／ＲＡ
Ｍおよび外部記憶装置（図示しない）から構成されてい
る。Reference numeral 3 denotes a display unit for displaying the data converted into the inputted kana character string or kana-kanji mixed character string,
A CRT display device or LCD (liquid crystal display device) is used. Reference numeral 4 is a normal dictionary including independent words, proper nouns, affixes, etc., including a meaning code. Reference numeral 5 is a co-occurrence dictionary (composite example dictionary) that stores the co-occurrence (example) relationship between clauses. Reference numeral 6 is a user dictionary in which the reading of the words registered by the user, the kanji notation, the part of speech / classification, and the indivisible flag are registered. ROMs / RAs for 4, 5 and 6 dictionaries
M and an external storage device (not shown).

【００２０】７はバッファであり、文字バッファ７１と
学習バッファ７２と同音語バッファ７３から構成されて
いる。文字バッファ７１はキーボード１から入力された
文字列を蓄えておくバッファである。学習バッファ７２
は単語の学習を行うバッファである。同音語バッファ７
３は仮名漢字変換処理の際の同音語を蓄えておくバッフ
ァである。Reference numeral 7 is a buffer, which is composed of a character buffer 71, a learning buffer 72, and a homophone word buffer 73. The character buffer 71 is a buffer for storing a character string input from the keyboard 1. Learning buffer 72
Is a buffer for learning words. Homophone buffer 7
Reference numeral 3 is a buffer for storing homophones in the Kana-Kanji conversion process.

【００２１】キーボード１から入力された仮名文字列は
文字バッファ７１に蓄えられる。次にユーザーが変換の
指示をするか、あるいは変換のタイミングになったと
き、制御部２は通常辞書４及び共起辞書５及びユーザー
辞書６及びその他のテーブル等を参照して仮名漢字変換
を行い、同音語バッファ７３に変換結果を蓄えられる。
第一候補以外の候補もキーボード１の操作により表示部
３に表示させて選択することができる。キーボード１の
操作によって同音語の選択を行うと、その結果が学習バ
ッファ７２に蓄えられる。The kana character string input from the keyboard 1 is stored in the character buffer 71. Next, when the user gives a conversion instruction or when it comes to the conversion timing, the control unit 2 refers to the normal dictionary 4, the co-occurrence dictionary 5, the user dictionary 6, and other tables to perform the kana-kanji conversion. The conversion result can be stored in the homophone word buffer 73.
Candidates other than the first candidate can be displayed on the display unit 3 and selected by operating the keyboard 1. When a homophone is selected by operating the keyboard 1, the result is stored in the learning buffer 72.

【００２２】図３はユーザー辞書の辞書作成画面を示す
説明図である。図３に示すような辞書作成画面におい
て、見出し＝「しょうとくたいし」、登録後＝「聖徳太
子」、品詞＝名詞、分類＝１（人物）…というようにユ
ーザー辞書に登録する例をあげている。ここで、「しょ
うとくたいし」＝「聖徳太子」（人物）（分割不可）と
したい場合には、名詞（固有）＋分類（人物）の後に分
割不可のＸ（フラッグ）を指定する。FIG. 3 is an explanatory view showing a dictionary creation screen of the user dictionary. In the dictionary creation screen as shown in FIG. 3, an example of registering in the user dictionary, such as headline = “Shoutokutaishi”, post-registration = “Shotoku Taishi”, part of speech = noun, classification = 1 (person) ... ing. Here, in order to set “Shoutokutai” = “Shotoku Taishi” (person) (undividable), an indivisible X (flag) is specified after the noun (unique) + classification (person).

【００２３】図４はユーザー辞書の格納領域を示す説明
図である。図４に示すように、図３で登録された単語の
読み、表記、品詞、分類、分割について、ユーザー辞書
の格納領域において、それぞれ格納される。FIG. 4 is an explanatory view showing the storage area of the user dictionary. As shown in FIG. 4, the reading, notation, part of speech, classification, and division of the words registered in FIG. 3 are stored in the storage area of the user dictionary.

【００２４】図５はこの実施例の仮名漢字変換処理のフ
ローチャートである。図５に示すように、以下入力され
た仮名文字列（べた）、読み「しょうとくたいしにいっ
た」を例にその処理の流れに沿って説明する。FIG. 5 is a flowchart of the kana-kanji conversion processing of this embodiment. As shown in FIG. 5, the input kana character string (solid) and the reading “Shoutokutaishitai” will be described as an example along the flow of the processing.

【００２５】「しょうとくたいしにいった」の仮名文字
列を文字バッファ７１に取り込む（ステップ５０１）。
次に「しょうとくたいしにいった」の仮名文字列に対し
て先行文節の抽出行う（ステップ５０２）。ステップ５
０２では、「しょうとくたいしに」、「しょうとくたい
し」、「しょうとく」、「しょうと」、「しょう」、
「しょ」がそれぞれ先行文節として抽出される。ステッ
プ５０２で抽出された先行文節に続いて、次の文節（後
続文節）の抽出を行う（ステップ５０３）。The kana character string "Shoutokutai ni ita" was fetched into the character buffer 71 (step 501).
Next, the preceding clause is extracted from the kana character string of "Shoutokutai ni ita" (step 502). Step 5
In 02, "Shoutokutai", "Shoutokutai", "Shoutoku", "Shoto", "Sho",
“Sho” is extracted as the preceding clause. Following the preceding clause extracted in step 502, the next clause (successor clause) is extracted (step 503).

【００２６】先行文節が「しょうとくたいしに」なら、
後続文節は「いった」となる。先行文節が「しょうとく
たいし」なら、後続文節は「にいった」となる。先行文
節が「しょうとく」なら、後読文節は「たいし」又は
「たい」となる。先行文節が「しょうと」なら、後続文
節は「くたい」となる。先行文節が「しょう」なら、後
続文節は「とくたい」となる。先行文節が「しょ」な
ら、後続文節は「うとく」となる。このように、先行文
節に従って後続文節が抽出されるとステップ５０４に進
む。If the preceding phrase is "Shoutokutaishi ni",
Subsequent clauses are "ta". If the preceding phrase is "Shoutokutaishi", the subsequent phrase will be "Nitani". If the preceding clause is "shoutoku", the look-ahead clause is "taishi" or "tai". If the preceding phrase is "Shouto", the subsequent phrase is "Kutai". If the preceding phrase is "sho", the subsequent phrase will be "Tokutai". If the preceding clause is "sho", the following clause is "utoku". In this way, when the subsequent clause is extracted according to the preceding clause, the process proceeds to step 504.

【００２７】次に、先行文節と後続文節との関係を調べ
先行文節と後続文節との区切り位置で最適の位置を決め
る（ステップ５０４）。漢字２字同士の結びつきが強調
されたので「しょうとく」と「たいし」の文節で切れる
可能性も多々あった。この発明においての「しょうとく
たいし］から始まる文字列には、強制的に文節のわかち
を行わないようにフラッグがつけられているので、「た
いし」の「し」まで分割不可になつている。Next, the relationship between the preceding clause and the succeeding clause is examined, and the optimum position is determined by the delimiter position between the preceding clause and the succeeding clause (step 504). Since the connection between two Kanji characters was emphasized, there was a possibility that the words "Shoutoku" and "Taishi" would be broken. In the present invention, the character string starting with "Shoutokutai" has a flag attached to it so that the phrase is not compulsorily cut. There is.

【００２８】よって、文節は「しょうとくたいしに」＋
「いった」か、「しょうとくたいし」＋「にいった」か
の２通りのわかち方になるが、「にいった」の候補は辞
書に見当たらないので、「しょうとくたいしに」＋「い
った」が選択される。ステップ５０５では、入力された
仮名文字列の文節わかち処理が終了したかどうか判断し
て，終了いなければステップ５０３に戻り、終了してい
ればステップ５０６に進む。Therefore, the phrase is "Shoutoku taishi ni" +
There are two ways of sharing the word, "I went" or "I went to" + "I went to", but I can't find a candidate for "I went" in the dictionary, so "I want to go to" + "I" is selected. In step 505, it is judged whether or not the phrase segmentation processing of the input kana character string is completed. If not completed, the process returns to step 503, and if completed, the process proceeds to step 506.

【００２９】ＬＯＯＰ１の処理を繰り返すことで、入力
された仮名文字列の文節わかち処理が終了する。文節に
分けられた候補は、各文節毎にその結果を同音語バッフ
ァ７３に蓄える。同音語バッファ７３は各文節毎の個別
のバッファを有する。この場合後続文節「いった」は、
「言った」、「要った」、「行った」、「入った」、
「煎った」、「炒った」、「逝った」、「云った」等の
候補が格納される。By repeating the process of LOOP1, the phrase segmentation process of the input kana character string is completed. The candidates divided into phrases are stored in the homophone buffer 73 as a result for each phrase. The homophone buffer 73 has a separate buffer for each phrase. In this case, the subsequent clause "I"
"I said,""Ineeded,""Iwent,""Entered,"
Candidates such as "roasted", "roasted", "passed", and "said" are stored.

【００３０】ステップ５０６の同音語選択処理は、各文
節毎に分けられた同音語候補の絞り込みを行う。この処
理では「いった」は複数の候補があるが共起関係（人名
＋に＋言った）より「言った」が選択される。ＬＯＯＰ
２の処理を繰り返すことで、入力された仮名文字列の仮
名漢字変換処理が終了する（ステップ５０７）。このよ
うにして、「しょうとくたいしにいった」は「聖徳太子
に言った」に変換される。The homophone selection process of step 506 narrows down homophone candidates divided for each phrase. In this process, although there are multiple candidates for "I", "said" is selected based on the co-occurrence relationship (person + + said). LOOP
By repeating the processing of 2, the kana-kanji conversion processing of the input kana character string is completed (step 507). In this way, "I went to Shotokutai" is converted to "I told Prince Shotoku."

【００３１】[0031]

【発明の効果】この発明によれば、辞書（通常辞書、ユ
ーザー辞書）に登録される単語に対し、文節区切りの分
割不可のフラッグをつけて登録することにより、ベた一
括変換処理中の仮名漢字変換精度を上げることができ
る。According to the present invention, a word that is registered in a dictionary (normal dictionary or user dictionary) is attached with a non-dividable flag that is a segment break, and is registered. The kanji conversion accuracy can be improved.

[Brief description of drawings]

【図１】この発明の基本構成を示すブロック図。FIG. 1 is a block diagram showing the basic configuration of the present invention.

【図２】この発明の一実施例を示すブロック図。FIG. 2 is a block diagram showing an embodiment of the present invention.

【図３】ユーザー辞書の辞書作成画面を示す説明図。FIG. 3 is an explanatory diagram showing a dictionary creation screen of a user dictionary.

【図４】ユーザー辞書の格納領域を示す説明図。FIG. 4 is an explanatory diagram showing a storage area of a user dictionary.

【図５】この実施例の仮名漢字変換処理のフローチャー
ト。FIG. 5 is a flowchart of kana-kanji conversion processing according to this embodiment.

[Explanation of symbols]

１キーボード２制御部３表示部４通常辞書５共起辞書６ユーザー辞書７バッファ 1 keyboard 2 control unit 3 display unit 4 normal dictionary 5 co-occurrence dictionary 6 user dictionary 7 buffer

Claims

[Claims]

1. A kana-kanji conversion processing device having a continuous phrase conversion function, key input means, display means having a display screen, and input character storage means for storing the kana character string input from the key input means. Kana-kanji conversion dictionary that stores this kana character string into kana-character bunsetsu and kana to convert this kana-character bunsetsu into kanji or kanji-mixed sentence, and instructions of the key input means. A conversion means for converting the kana character stored in the input character storage means into a kanji or kanji mixed sentence based on the kana-kanji conversion dictionary, and a specific kana character phrase corresponding to this by operating the key input means. A kanji or a mixture of kanji characters is stored in the kana-kanji conversion dictionary and a specific phrase conversion dictionary registered by the conversion means, and in the input character storage means. If there is a specific kana character phrase registered in the specific phrase conversion dictionary in the specified kana character string, the phrase segmentation means is controlled so as not to know this specific kana character segment, and A clause control means for converting into kanji or a mixture of kanji characters corresponding to a kana character clause, and an output means for outputting the character string stored in the input character storage means to the display screen of the display means are provided. Kana-Kanji conversion processing device.