JPH0581238A

JPH0581238A - 'kana'/'kanji' converter

Info

Publication number: JPH0581238A
Application number: JP3241496A
Authority: JP
Inventors: Masako Morishita; 昌子森下
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 1991-09-20
Filing date: 1991-09-20
Publication date: 1993-04-02

Abstract

PURPOSE:To more effectively execute rule conversion processing without increasing the memory capacity of a rule dictionary by applying the same information to plural words belonging to the same category and describing the words in a rule item. CONSTITUTION:A 'KANA' (Japanese syllabary) reading character string is inputted from an input means 1 and 'KANA'/'KANJI' (Chinese character) converting means 2 executes the 'KANA'/'KANJI' conversion of the inputted character string based upon a reference dictionary 3 storing notation corresponding to the reading of each word and information expressing the category of the 'KANA'/'KANJI' conversion result is stored in a conversion result storing means 5 and a rule stored in a rule dictionary 6 is compared with the contents stored in the means 5 by a rule retrieving means 7 to retrieve the existence of a role having a word belonging to the same category and coincidence of a pattern for words other than the word. At the time of retrieving a coincident rule, the contents of the means 5 are rewritten by a rule rewriting means 8 in accordance with a retrieved rule rewriting command and a 'KANA'/'KANJI' character string obtained by rewriting the contents of the means 5 is outputted.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、日本語ワードプロセッ
サ等におけるかな漢字変換処理装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a kana-kanji conversion processing device in a Japanese word processor or the like.

【０００２】[0002]

【従来の技術】従来、この種のかな漢字変換装置におい
ては、特開平３−１４２６５８号公報で開示されている
ように、単語の読みに対する漢字等の表記を記憶した基
本辞書を参照してかな漢字変換を行い、変換結果記憶部
にかな漢字変換結果が記憶される。そして、複数の単語
列のパターンとその書換え情報とを一組の規則として格
納したものを規則辞書とし、その規則辞書中の規則と一
致した規則が検索されたとき、前記かな漢字変換結果の
内容を書き換え、表示装置により表示するようにしてい
た。また、このような一連の作用を規則変換と称してい
た。2. Description of the Related Art Conventionally, in this type of kana-kanji conversion device, kana-kanji conversion is performed by referring to a basic dictionary that stores notations such as kanji for reading a word, as disclosed in Japanese Patent Laid-Open No. 3-142658. And the kana-kanji conversion result is stored in the conversion result storage section. Then, a pattern in which a plurality of word string patterns and their rewriting information are stored as a set of rules is used as a rule dictionary, and when a rule that matches a rule in the rule dictionary is searched, the content of the Kana-Kanji conversion result is displayed. It was rewritten and displayed on the display device. Moreover, such a series of actions was called rule conversion.

【０００３】例えば、「てがみをかく」と入力したとき
に、基本辞書を参照してかな漢字変換を行うとき、基本
辞書中に「各」、「核」、「書く」の順に記憶されてい
た場合は、第一候補の「各」が出力され、「手紙を各」
という誤変換がされる。そして、その「手紙を各」とい
う変換結果が変換結果記憶部に記憶される。[0003] For example, when inputting "tegami kagaku" and performing kana-kanji conversion by referring to the basic dictionary, when "each", "nucleus", and "writing" are stored in this order in the basic dictionary Will output the first candidate "each" and "letter each"
Is erroneously converted. Then, the conversion result “each letter” is stored in the conversion result storage unit.

【０００４】次に規則辞書を参照して規則変換を行うと
き、規則辞書中に＜Ｕ−手紙：Ｎ＞＜Ｕ−を：Ｎ＞＜Ｄ
−かく：書，く＞というパターンが記憶されておれば、
Ｕ因子は読み及び表記との完全一致を意味し、Ｄ因子は
読みとの一致を意味するので、変換結果記憶部に記憶さ
れている「手紙を各」の「各」の部分を「書く」に書き
換えて、「手紙を書く」という正しい変換結果にするも
のであった。ここで、規則項目中の書換え情報の位置に
あるＮマーカーは、書換えを行わないことを表すもので
ある。Next, when the rule conversion is performed by referring to the rule dictionary, <U-letter: N><U- is: N><D in the rule dictionary.
-Writing: If the pattern of writing, Ku> is stored,
The U factor means perfect matching with reading and notation, and the D factor means matching with reading. Therefore, "write" the "each" part of "each letter" stored in the conversion result storage unit. It was rewritten into the correct conversion result of "writing a letter". Here, the N marker at the position of the rewriting information in the rule item represents that the rewriting is not performed.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、前記規
則変換を行うときに、＜Ｕ−手紙：Ｎ＞＜Ｕ−を：Ｎ＞
＜Ｄ−かく：書，く＞という規則は、「手紙を各」、
「手紙を核」等の誤変換にしか適用されず、「書類を
各」、「書類を核」という誤変換に対しては別に＜Ｕ−
書類：Ｎ＞＜Ｕ−を：Ｎ＞＜Ｄ−かく：書，く＞という
規則を用意する必要があった。従って、規則辞書のメモ
リに占める容量が大きくなり、かつ規則辞書作成の労力
も大であった。However, when performing the rule conversion, <U-letter: N><U-:N>.
The rule <D-Writing: Writing, Ku> is “each letter”,
It is applied only to the incorrect conversion of "letter to core" etc., and separately to the incorrect conversion of "each document" and "document to core".
It was necessary to prepare a rule of document: N><U-:N><D-write: write, ku>. Therefore, the capacity of the rule dictionary in the memory is large, and the effort for creating the rule dictionary is large.

【０００６】本発明は、上述した問題点を解決するため
になされたものであり、同じカテゴリーに属する単語に
同じ情報を付して、規則項目に記述することにより、一
つの規則が一致する単語列の範囲を広げることによっ
て、規則辞書のメモリ容量を大きくすることなく、より
効果的に規則変換処理を行うことができるかな漢字変換
装置を提供することができることを目的とする。The present invention has been made in order to solve the above-mentioned problems. Words that belong to the same category are given the same information and described in rule items, so that words that match one rule An object of the present invention is to provide a kana-kanji conversion device that can perform rule conversion processing more effectively without increasing the memory capacity of the rule dictionary by expanding the range of columns.

【０００７】[0007]

【課題を解決するための手段】この問題を解決するため
に本発明のかな漢字変換装置は、図１に示すように、か
な読み文字列を入力するための入力手段と、単語の読み
に対する表記とその単語のカテゴリーを表す情報とを記
憶した基本辞書と、その基本辞書に基づいて、かな漢字
変換するかな漢字変換手段と、そのかな漢字変換手段に
よる変換結果を記憶する変換結果記憶手段と、前記基本
辞書の単語の読み、表記、カテゴリーなどの情報からな
る単語列のパターンと、その書換え情報とを有する規則
を記憶した規則辞書と、前記変換結果記憶手段の内容に
ついて、前記規則辞書の同じカテゴリーの単語を有し、
かつその単語以外のパターンが一致するものを検索する
規則検索手段と、その規則検索手段により一致したパタ
ーンが検索されたとき、そのパターンの規則の書換え情
報に基づいて該当の変換結果記憶手段の内容を書き換え
る規則書換手段と、前記変換結果記憶手段の内容をかな
漢字文字列として出力する出力手段とを備えている。In order to solve this problem, a kana-kanji conversion device of the present invention, as shown in FIG. 1, has an input means for inputting a kana-reading character string and a notation for reading a word. A basic dictionary that stores information indicating the category of the word, a kana-kanji conversion means that performs kana-kanji conversion based on the basic dictionary, a conversion result storage means that stores the conversion result by the kana-kanji conversion means, and the basic dictionary. With respect to the contents of the conversion result storage means and the rule dictionary that stores a rule having a word string pattern consisting of information such as word reading, notation, and category, and rewriting information, the words of the same category of the rule dictionary are stored. Have,
And the content of the corresponding conversion result storage means based on the rewriting information of the rule of the pattern when the rule search means that searches for a pattern other than the word that matches and the matching pattern is searched by the rule search means And rule output means for outputting the contents of the conversion result storage means as a kana-kanji character string.

【０００８】[0008]

【作用】上記の構成を有する本発明のかな漢字変換装置
では、かな読み文字列は、入力手段から入力され、単語
の読みに対応した表記とその単語のカテゴリーを表す情
報とを記憶した基本辞書に基づいて、かな漢字変換手段
によりかな漢字変換される。その後、かな漢字変換され
た結果が変換結果記憶手段に記憶され、規則検索手段に
より規則辞書中に格納されている規則から変換結果記憶
手段に記憶されている項目と一致する規則が検索され
る。このとき、規則検索手段は、前記規則辞書中に記憶
されている規則と変換結果記憶手段に記憶されている内
容とを比較して、同じカテゴリーの単語を有し、かつそ
の単語以外のパターンが一致するものがあるか否かを検
索する。In the kana-kanji conversion device of the present invention having the above-mentioned structure, the kana-reading character string is input from the input means and stored in the basic dictionary storing the notation corresponding to the reading of the word and the information representing the category of the word. Based on this, kana-kanji conversion is performed by the kana-kanji conversion means. After that, the result of the kana-kanji conversion is stored in the conversion result storage means, and the rule search means searches the rules stored in the rule dictionary for a rule that matches the item stored in the conversion result storage means. At this time, the rule search means compares the rules stored in the rule dictionary with the contents stored in the conversion result storage means, has words in the same category, and patterns other than the words Search for a match.

【０００９】そして、前記規則検索手段にて変換結果記
憶手段の内容と一致した規則が検索されたときに、規則
書換え手段により検索された規則の書換え指示に従っ
て、変換結果記憶手段中の書換えが行われ、変換結果記
憶手段の内容をかな漢字文字列にしたものが出力手段に
より出力される。Then, when the rule retrieving means retrieves a rule that matches the contents of the conversion result storing means, the rewriting in the conversion result storing means is performed according to the rule rewriting instruction retrieved by the rule rewriting means. The output means outputs the kana-kanji character string of the contents of the conversion result storage means.

【００１０】[0010]

【実施例】以下、本発明を具体化した一実施例を図面を
参照して説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings.

【００１１】かな漢字変換装置全体の構成を図２を参照
して説明する。The overall structure of the kana-kanji conversion device will be described with reference to FIG.

【００１２】かな漢字変換をする文字列を入力するため
のキーボード等の入力装置１０は、装置全体を制御する
ための中央処理装置（ＣＰＵ）１２に接続されている。
記憶手段としてのＲＡＭ２０はＣＰＵ１２に接続され、
かな漢字変換された結果を記憶するための変換結果記憶
領域２２と、入力されたかな読み文字列を記憶するため
の読み入力バッファ領域２４と、変換結果記憶領域２２
の内容をかな漢字文字列として記憶するための出力バッ
ファ領域２６と、ポインタ情報を記録するワークエリア
２８とを格納している。An input device 10 such as a keyboard for inputting a character string for Kana-Kanji conversion is connected to a central processing unit (CPU) 12 for controlling the entire device.
The RAM 20 as a storage means is connected to the CPU 12,
A conversion result storage area 22 for storing the result of kana-kanji conversion, a reading input buffer area 24 for storing the input kana reading character string, and a conversion result storage area 22.
An output buffer area 26 for storing the contents of the above as a kana-kanji character string and a work area 28 for recording pointer information are stored.

【００１３】変換結果記憶領域２２は、図３に示すよう
に、変換結果全体が単語単位で情報を付して記憶されて
おり、後述する基本辞書４２中に記憶されているそれぞ
れの単語を識別するための固有の数値である単語ＩＤ２
２ａと、単語の読み長２２ｂと、単語の表記２２ｃと、
同音語先頭ＩＤ２２ｄと、同音語末尾ＩＤ２２ｅと、意
味コード２２ｆがデータとして各単語ごとに記憶されて
いる。As shown in FIG. 3, the conversion result storage area 22 stores the entire conversion result with information added word by word, and identifies each word stored in a basic dictionary 42 described later. Word ID2 that is a unique numerical value for
2a, word reading length 22b, word notation 22c,
The same-sound word start ID 22d, the same-sound word end ID 22e, and the meaning code 22f are stored as data for each word.

【００１４】尚、同音語先頭ＩＤ２２ｄは、入力装置１
０から入力された文字列（単語）と同音であり、基本辞
書４２に記憶された単語ＩＤ４２ａの数値が最も小さい
ものを示すものである。また、同音語末尾ＩＤ２２ｅ
は、入力装置１０から入力された文字列（単語）と同音
であり、基本辞書４２に記憶された単語ＩＤ４２ａの数
値が最も大きいものを示すものである。The same-sound word head ID 22d is the input device 1
It shows the same sound as the character string (word) input from 0, and the smallest numerical value of the word ID 42a stored in the basic dictionary 42. Also, the end of homophone ID22e
Indicates a character having the same sound as the character string (word) input from the input device 10 and having the largest numerical value of the word ID 42a stored in the basic dictionary 42.

【００１５】意味コード２２ｆは、基本辞書４２中の単
語ごとに意味分類によって付されたコードであり、同一
の意味コードを持つものは同一の意味概念に属する。例
えば、病気という意味概念に［１４５］という意味コー
ドが振られていたとしたら、「頭痛」「風邪」「腹痛」
などの単語はすべて、［１４５］の意味コードを意味情
報として持っている。The meaning code 22f is a code assigned to each word in the basic dictionary 42 according to the meaning classification. Those having the same meaning code belong to the same meaning concept. For example, if a meaning code of "145" is assigned to the meaning concept of illness, "headache""cold""abdominalpain"
All words such as have the meaning code of [145] as meaning information.

【００１６】プログラムや辞書等を格納するＲＯＭ３０
はＣＰＵ１２と接続され、プログラム部３２と辞書部４
０とからなる。プログラム部３２は、かな漢字変換プロ
グラム３４と、規則検索プログラム３６と、規則書換え
プログラム３８とを格納している。ROM 30 for storing programs, dictionaries, etc.
Is connected to the CPU 12, and the program section 32 and the dictionary section 4
It consists of 0 and. The program section 32 stores a kana-kanji conversion program 34, a rule search program 36, and a rule rewriting program 38.

【００１７】上記規則検索プログラム３６と規則書換え
プログラム３８とを合わせ規則変換処理と呼ぶ。The rule retrieval program 36 and the rule rewriting program 38 are collectively referred to as rule conversion processing.

【００１８】辞書部４０は基本辞書４２と、接続テーブ
ル４４と、規則辞書４６とから構成されている。基本辞
書４２は、図４に示すように、その単語の固有の識別番
号たる単語ＩＤ４２ａと、その単語の読み４２ｂと、そ
の単語の漢字表記４２ｃと、その単語の品詞情報４２ｄ
と、意味コード４２ｆとが記憶されている。The dictionary unit 40 comprises a basic dictionary 42, a connection table 44, and a rule dictionary 46. As shown in FIG. 4, the basic dictionary 42 includes a word ID 42a which is a unique identification number of the word, a reading 42b of the word, a kanji notation 42c of the word, and a part-of-speech information 42d of the word.
And a meaning code 42f are stored.

【００１９】接続テーブル４４は、単語同士の接続関係
を品詞情報により規定するデータとして記憶されてい
る。The connection table 44 is stored as data that defines the connection relationship between words by the part-of-speech information.

【００２０】規則辞書４６は、図５に示すように、一般
に規則の項目は、＜検索因子−単語パターン：書換え情
報＞という形で書かれており、この項目の組合せで一つ
の規則がつくられる仕組みになっている。検索因子と
は、規則変換の書き換え方法の種類を示すもので、Ｕ因
子、Ｄ因子、Ｙ因子、ＵＭ因子、ＤＭ因子があり、Ｕ因
子は変換結果記憶領域２２に記憶されている単語につい
ての読み及び表記と完全一致を意味し、Ｄ因子は変換結
果記憶領域２２に記憶されている単語についての読みつ
まりその単語と同音で一致することを意味する。また、
Ｙ因子は変換結果記憶領域２２の中に記憶されている単
語列（一単語でもよい）についての読みと一致すること
を意味する。ＵＭ因子は変換結果記憶領域２２に記憶さ
れている単語についての意味コードと一致することを意
味する。ＤＭ因子は変換結果記憶領域２２に記憶されて
いる単語の読みの同音語の全ての中で、同じ意味コード
をもつ単語に一致することを意味する。単語パターン４
６ａ〜４６ｃとは、その規則の各々の項目の該当する単
語のことを示し、ここには単語の読みまたは、表記また
は、意味コードが入る。In the rule dictionary 46, as shown in FIG. 5, rule items are generally written in the form of <search factor-word pattern: rewriting information>, and one rule is created by combining these items. It works. The search factor indicates the type of rule conversion rewriting method, and includes a U factor, a D factor, a Y factor, a UM factor, and a DM factor, and the U factor represents a word stored in the conversion result storage area 22. It means a perfect match with the reading and the notation, and the D factor means the reading of a word stored in the conversion result storage area 22, that is, the word has the same sound. Also,
The Y factor means that it matches the reading of the word string (may be one word) stored in the conversion result storage area 22. The UM factor means that it matches the meaning code of the word stored in the conversion result storage area 22. The DM factor means to match a word having the same meaning code among all the homophones of the reading of the word stored in the conversion result storage area 22. Word pattern 4
6a to 46c indicate the corresponding words of each item of the rule, and the reading or notation of the words or the meaning code is entered here.

【００２１】書換え情報は、無変化マーカーＮ、削除マ
ーカーＤ、単語列、または意味コードのいずれかであ
り、無変化マーカーＮが格納されているときは規則辞書
４６の項目に一致した変換結果記憶領域２２中の単語を
そのままにし、書き換えないことを表す。また、削除マ
ーカーＤが格納されているときはその項目を削除するこ
とを表し、単語列のときは規則辞書４６の項目と一致し
た変換結果記憶領域２２中の単語をその単語列に書き換
えることを表し、意味コードのときは基本辞書４２に記
憶されていれる単語中で同音語で同じ意味コードを持つ
単語に書き換えることを表す。ここでいう単語列とは、
複数の単語（一つの単語だけでもよい）が規則辞書４６
に単語ＩＤで記憶されたものであり、この単語ＩＤに基
づいて基本辞書４２を参照することにより、変換結果記
憶領域２２の単語読み長２２ｂ、同音語先頭ＩＤ２２
ｃ、同音語末尾ＩＤ２２ｄ等の設定が可能である。尚、
図５においては、理解しやすいように単語ＩＤの部分を
数値ではなく、読み、または表記で表している。The rewriting information is one of the unchanged marker N, the deleted marker D, the word string, and the meaning code. When the unchanged marker N is stored, the conversion result memory that matches the item of the rule dictionary 46 is stored. The word in the area 22 is left as it is and is not rewritten. When the deletion marker D is stored, it means to delete the item, and when it is a word string, the word in the conversion result storage area 22 that matches the item of the rule dictionary 46 is rewritten to the word string. In the case of the meaning code, it means rewriting to a word having the same meaning code as a homophone among the words stored in the basic dictionary 42. The word string here means
A plurality of words (may be only one word) is the rule dictionary 46.
Is stored in the conversion result storage area 22 by referring to the basic dictionary 42 based on this word ID.
It is possible to set c, the same-sound ending ID 22d, and the like. still,
In FIG. 5, the word ID portion is not read as a numerical value but is read or written for easy understanding.

【００２２】次に、このように構成されたかな漢字変換
装置の動作を図６のフローチャートを参照して説明す
る。Next, the operation of the kana-kanji conversion device configured as described above will be described with reference to the flowchart of FIG.

【００２３】例えば、読み文字列「みかんのはな」が入
力装置１０より入力されると、そのかな文字コードがＲ
ＡＭ２０の読み入力バッファ領域２４に記憶される（Ｓ
１０）。その後、ＲＯＭ３０のかな漢字変換プログラム
３４により基本辞書４２と、接続テーブル４４を参照し
て、読み入力バッファ領域２４に記憶されているかな文
字コードが漢字かな混じり文に変換され、漢字かな混じ
り文「未完の鼻」と変換さる。そして、その単語ＩＤと
ともに意味コード２２ｆを変換結果記憶領域２２に記憶
する（Ｓ１２）。そして、ＲＯＭ３０の規則検索プログ
ラム３６及び規則書換えプログラム３８により、変換結
果記憶領域２２の内容と規則辞書４６を参照して規則変
換処理が行われる（Ｓ１４）。For example, when the reading character string "Mikan no Hana" is input from the input device 10, the kana character code is R.
It is stored in the read input buffer area 24 of the AM 20 (S
10). Thereafter, the kana-kanji conversion program 34 of the ROM 30 refers to the basic dictionary 42 and the connection table 44 to convert the kana-character code stored in the reading input buffer area 24 into a kanji-kana mixed sentence, and the kanji-kana mixed sentence "unfinished Nose ”. Then, the meaning code 22f is stored in the conversion result storage area 22 together with the word ID (S12). Then, the rule retrieval program 36 and the rule rewriting program 38 of the ROM 30 perform the rule conversion process with reference to the content of the conversion result storage area 22 and the rule dictionary 46 (S14).

【００２４】次に、規則変換処理の具体的な処理につい
て図７、図８、図９のフローチャートを参照して説明す
る。Next, the specific processing of the rule conversion processing will be described with reference to the flowcharts of FIGS. 7, 8 and 9.

【００２５】まず、前記変換結果記憶領域２２に記憶し
た「未完の鼻」の先頭の単語「未完」にポインタＰ１を
設定し、ポインタＰ１のポインタ情報がＲＡＭ２０のワ
ークエリア２８に記憶される（Ｓ３０）。First, the pointer P1 is set to the first word "unfinished" of the "unfinished nose" stored in the conversion result storage area 22, and the pointer information of the pointer P1 is stored in the work area 28 of the RAM 20 (S30). ).

【００２６】次に、ポインタＰ１の指す単語と一致する
規則が規則辞書４６中にあるか否かを検索する（Ｓ３
２）。図９に示す規則検索サブルーチンにおいては、ま
ず、ポインタＰ５を規則辞書４６中の先頭の規則５−１
に設定し、ポインタ情報をワークエリア２８に記録する
（Ｓ３２０）。次に、ポインタＰ６をポインタＰ５指す
規則５−１の先頭の項目４６ａに設定し、ポインタ情報
がワークエリア２８に記憶される（Ｓ３２２）。次い
で、ポインタＰ７をポインタＰ１の指す変換結果記憶領
域２２中のかな漢字変換結果の先頭の単語「未完」に設
定する（Ｓ３２４）。Next, it is searched whether or not the rule matching the word pointed by the pointer P1 exists in the rule dictionary 46 (S3).
2). In the rule search subroutine shown in FIG. 9, first, the pointer P5 is set to the first rule 5-1 in the rule dictionary 46.
And the pointer information is recorded in the work area 28 (S320). Next, the pointer P6 is set to the head item 46a of the rule 5-1 which points the pointer P5, and the pointer information is stored in the work area 28 (S322). Next, the pointer P7 is set to the first word "unfinished" of the kana-kanji conversion result in the conversion result storage area 22 pointed to by the pointer P1 (S324).

【００２７】ここでは、ポインタＰ６の指す規則＜ＵＭ
−１３５：１３５＞とポインタＰ７の指す変換結果記憶
領域２２中の単語「未完」を比較する（３２６）。この
場合、規則と単語は一致せず（Ｓ３２６・ＮＯ）、ポイ
ンタＰ５の示す規則５−１が規則辞書４６中で最後の規
則ではないので（Ｓ３２８・ＮＯ）、ポインタＰ５を次
の規則５−２に移動しポインタ情報がワークエリア２８
に記憶され（Ｓ３３０）、Ｓ３２２に戻る。次に、ポイ
ンタＰ６をポインタＰ５指す規則５−２の先頭の項目４
７ａに設定し、ポインタ情報がワークエリア２８に記憶
される（Ｓ３２２）。次いで、ポインタＰ７をポインタ
Ｐ１の指す変換結果記憶領域２２中のかな漢字変換結果
の先頭の単語「未完」に設定する（Ｓ３２４）。Here, the rule indicated by the pointer P6 <UM
-135: 135> is compared with the word "incomplete" in the conversion result storage area 22 pointed to by the pointer P7 (326). In this case, the rule and the word do not match (S326 / NO), and the rule 5-1 indicated by the pointer P5 is not the last rule in the rule dictionary 46 (S328 / NO), so the pointer P5 is set to the next rule 5-. 2 and move pointer information to work area 28
(S330), the process returns to S322. Next, the first item 4 of the rule 5-2 that points the pointer P6 to the pointer P5
7a, and pointer information is stored in the work area 28 (S322). Next, the pointer P7 is set to the first word "unfinished" of the kana-kanji conversion result in the conversion result storage area 22 pointed to by the pointer P1 (S324).

【００２８】ここでは、ポインタＰ６の指す規則＜ＤＭ
−１４４：１４４＞とポインタＰ７の指す変換結果記憶
領域２２中の単語「未完」を比較する（３２６）。この
場合、ＤＭ因子が付されているので、基本辞書４２中の
単語で、「未完」の読み「みかん」の同音語の単語であ
って意味コード１４４（植物を表す）を持つ単語「蜜
柑」が意味コードで一致するので、規則と単語は一致し
（Ｓ３２６・ＹＥＳ）、ポインタＰ６が末尾項目ではな
いので（Ｓ３３２・ＮＯ）、ポインタＰ７を変換結果記
憶領域２２中に記憶されている「未完」の次の単語
「の」に移動し、ポインタＰ６も規則５−２（４７ｂ）
中の次の項目＜Ｕ−の：Ｎ＞（４６ｂ）に移動し、その
ポインタ情報をワークエリア２８に記憶し（Ｓ３３
４）、前記３２６に戻る。Here, the rule indicated by the pointer P6 <DM
-144: 144> is compared with the word "incomplete" in the conversion result storage area 22 pointed to by the pointer P7 (326). In this case, since the DM factor is added, the word “tangerine” in the basic dictionary 42, which is a homonym word of the reading “Mikan” of “incomplete” and having the meaning code 144 (representing a plant). Are matched in the meaning code, the rule and the word are matched (S326, YES), and the pointer P6 is not the last item (S332, NO). Therefore, the pointer P7 is stored in the conversion result storage area 22 as "unfinished". Move to the word "no" next to "," and the pointer P6 also moves to rule 5-2 (47b).
It moves to the next item <U-: N> (46b) and stores the pointer information in the work area 28 (S33).
4) Return to 326.

【００２９】ここで、ポインタＰ６の示す項目＜Ｕ−
の：Ｎ＞（４７ｂ）は、Ｕ因子が付されており、読み及
び表記で一致を意味する。従って、ポインタＰ７の示す
単語「の」と一致し（Ｓ３２６・ＹＥＳ）、ポインタＰ
６の示す項目４７ｃは末尾項目ではないので（Ｓ３３２
・ＮＯ）、ポインタＰ６を規則５−２中の次の項目＜Ｄ
−はな：花＞に、ポインタＰ７を変換結果記憶領域２２
中の次の単語「鼻」に進め、ポインタ情報をワークエリ
ア２８に記憶する（Ｓ３３４）。Here, the item indicated by the pointer P6 <U-
No: N> (47b) is attached with a U factor and means agreement in reading and notation. Therefore, the word "no" indicated by the pointer P7 matches (S326, YES), and the pointer P
The item 47c indicated by 6 is not the last item (S332).
-NO), the pointer P6 is set to the next item <D in rule 5-2.
-Hana: Hana>, and move the pointer P7 to the conversion result storage area 22.
The process advances to the next word "nose" in the middle and stores the pointer information in the work area 28 (S334).

【００３０】ポインタＰ６の示す項目＜Ｄ−はな：花＞
は、読みで一致を示すＤ因子が付されているので、ポイ
ンタＰ７の示す単語「鼻」と一致し（Ｓ３２６・ＹＥ
Ｓ）、Ｓ３３２に進む（Ｓ３２７・ＹＥＳ）。ここで、
ポインタ情報Ｐ６は末尾項目なので（Ｓ３３２・ＹＥ
Ｓ）、ワークエリア２８に規則フラグＯＮを記憶し（Ｓ
３３６）、図７に示すＳ３２に戻る。Item <D-hana: flower> indicated by pointer P6
Has a D factor indicating a match in reading, and therefore matches the word “nose” indicated by the pointer P7 (S326.YE
S) and S332 (YES in S327). here,
Since the pointer information P6 is the last item (S332.YE
S), the rule flag ON is stored in the work area 28 (S
336), and the process returns to S32 shown in FIG.

【００３１】尚、一致する規則が検索されないとき（Ｓ
３２６・ＮＯ）は、ポインタＰ５の指す規則が規則辞書
中で最後の規則になるまで（Ｓ３２８・ＮＯ）、ポイン
タＰ５を順次、次の規則へ移動し（Ｓ３３０）、一致す
る規則を検索する（Ｓ３２２〜Ｓ３３０）。ポインタＰ
５が示す規則が、規則辞書で最後の規則となったときは
（Ｓ３２８・ＹＥＳ）、一致する規則がなかったことを
示す規則フラグＯＦＦをワークエリア２８に記憶する
（Ｓ３３８）。When a matching rule is not searched (S
(326 / NO) moves the pointer P5 to the next rule in sequence (S330) until the rule pointed to by the pointer P5 is the last rule in the rule dictionary (S328 / NO) and searches for a matching rule (S330). S322-S330). Pointer P
When the rule indicated by 5 is the last rule in the rule dictionary (S328, YES), the rule flag OFF indicating that there is no matching rule is stored in the work area 28 (S338).

【００３２】ここでは、Ｓ３２においてワークエリア２
８の規則フラグがＯＮとなっているので（Ｓ３４・ＹＥ
Ｓ）、規則書換えプログラム３８による規則書換え処理
に入る。まず、ポインタＰ２をポインタＰ１の指す変換
結果記憶領域２２に記憶した単語列「未完の鼻」の先頭
の単語「未完」に設定し、ポインタＰ２のポインタ情報
をワークエリア２８に記憶する（Ｓ３６）。次いで、ポ
インタＰ３をポインタＰ７が指す変換結果記憶領域２２
に記憶した単語列「未完の鼻」の末尾の「鼻」に設定
し、ポインタＰ３の情報をワークエリア２８に記憶する
（Ｓ３８）。続いて、ポインタＰ４を規則辞書４６中の
マッチした規則５−２の先頭項目に設定し、ポインタＰ
４のポインタ情報をワークエリア２８に記憶する（Ｓ４
０）。規則辞書４６中の規則５−２においてポインタＰ
４の指す先頭項目＜ＤＭ−１４４：１４４＞（４７ａ）
の書換え情報に意味コード１４４があるので（Ｓ４４・
ＮＯ、Ｓ４６・ＮＯ、Ｓ４８・ＮＯ、Ｓ５０・ＹＥ
Ｓ）、基本辞書４２中に記憶されている単語で「未完」
と同音語で意味コード１４４を持つ単語「蜜柑」に書き
換えて（Ｓ５１）、Ｓ５４に進む。Here, in S32, the work area 2
Since the rule flag of No. 8 is ON (S34 ・ YE
S) Then, the rule rewriting process by the rule rewriting program 38 starts. First, the pointer P2 is set to the first word "unfinished" of the word string "unfinished nose" stored in the conversion result storage area 22 pointed to by the pointer P1, and the pointer information of the pointer P2 is stored in the work area 28 (S36). .. Next, the conversion result storage area 22 in which the pointer P3 points to the pointer P3
The word string "unfinished nose" stored in the above is set to "nose" at the end, and the information of the pointer P3 is stored in the work area 28 (S38). Then, the pointer P4 is set to the head item of the matched rule 5-2 in the rule dictionary 46, and the pointer P4 is set.
The pointer information of No. 4 is stored in the work area 28 (S4
0). Pointer P in rule 5-2 in rule dictionary 46
4 <DM-144: 144> (47a)
Since the rewriting information of has the meaning code 144 (S44.
NO, S46 / NO, S48 / NO, S50 / YE
S), the word stored in the basic dictionary 42 is "unfinished"
Is rewritten as the word “tangerine” having the meaning code 144 in the same phoneme (S51), and the process proceeds to S54.

【００３３】Ｓ５４においては、ポインタＰ２の指す単
語とポインタＰ３の指す単語が同じか比較する。つま
り、ポインタＰ２指す単語とポインタＰ３の指す単語が
等しいときは、ポインタＰ２が最後の単語まで進んだこ
とを意味する。この場合は、ポインタＰ２の指す「蜜
柑」は変換結果記憶領域２２に記憶されている変換結果
の規則に一致した最後の単語ではないので（Ｓ５４・Ｎ
Ｏ）、単語ポインタＰ２を次の単語「の」に移動し、ポ
インタＰ４を規則５−２中の次の項目＜Ｕ−の：Ｎ＞
（４７ｂ）に移動すると共に（Ｓ５６）、前記Ｓ４４に
戻る。In S54, the word pointed by the pointer P2 and the word pointed by the pointer P3 are compared to see if they are the same. That is, when the word pointed by the pointer P2 and the word pointed by the pointer P3 are equal, it means that the pointer P2 has advanced to the last word. In this case, the "tangerine" pointed to by the pointer P2 is not the last word that matches the conversion result rule stored in the conversion result storage area 22 (S54.N).
O), the word pointer P2 is moved to the next word “NO”, and the pointer P4 is moved to the next item <U-: N> in the rule 5-2.
While moving to (47b) (S56), the process returns to S44.

【００３４】ここでは、ポインタＰ４の指す項目＜Ｕ−
の：Ｎ＞に無変化マーカーＮがあるので（Ｓ４４・ＹＥ
Ｓ）、書き換えずそのままにし、Ｓ５４に進む。Here, the item <U-pointed by the pointer P4
No: Since there is no change marker N in N> (S44 ・ YE
S), without rewriting, it is left as it is, and the process proceeds to S54.

【００３５】もし、ポインタＰ４の指す項目４７ｂ中の
書換え情報の場所に削除マーカーＤが立っているときは
（Ｓ４４・ＮＯ、Ｓ４６・ＹＥＳ）、変換結果記憶領域
２２のポインタＰ２の指す単語「の」を削除し（Ｓ５
０）、Ｓ５４に進む。さらにポインタＰ４の指す項目４
７ａ中の書換え情報の場所に単語列があるときは（Ｓ４
４・ＮＯ、Ｓ４６・ＮＯ、Ｓ４８・ＹＥＳ）、変換結果
記憶領域２２のポインタＰ２の指す単語「の」をその単
語列と書き換え（Ｓ５２）、Ｓ５４に進む。If the deletion marker D is set at the location of the rewriting information in the item 47b pointed to by the pointer P4 (S44 / NO, S46 / YES), the word "of the pointer P2 in the conversion result storage area 22" Is deleted (S5
0), and proceeds to S54. Item 4 pointed to by pointer P4
When there is a word string at the location of the rewriting information in 7a (S4
4 · NO, S46 · NO, S48 · YES), the word “no” pointed to by the pointer P2 in the conversion result storage area 22 is rewritten with the word string (S52), and the process proceeds to S54.

【００３６】Ｓ５４においては、ポインタＰ２の指す単
語とポインタＰ３の指す単語が同じか比較する。つま
り、ポインタＰ２指す単語とポインタＰ３の指す単語が
等しいときは、ポインタＰ２が最後の単語まで進んだこ
とを意味する。この場合は、ポインタＰ２の指す「の」
は変換結果記憶領域２２に記憶されている変換結果の最
後の単語ではないので（Ｓ５４・ＮＯ）、単語ポインタ
Ｐ２を次の単語「鼻」に移動し、ポインタＰ４を規則５
−２中の次の項目＜Ｄ−はな：花＞（４７ｃ）に移動す
ると共に（Ｓ５６）、前記Ｓ４４に戻る。In S54, the word pointed to by the pointer P2 and the word pointed by the pointer P3 are compared to see if they are the same. That is, when the word pointed by the pointer P2 and the word pointed by the pointer P3 are equal, it means that the pointer P2 has advanced to the last word. In this case, "no" pointed by the pointer P2
Is not the last word of the conversion result stored in the conversion result storage area 22 (S54, NO), the word pointer P2 is moved to the next word "nose" and the pointer P4 is set to rule 5.
The next item <D-hana: flower> in -2 is moved to (47c) (S56), and the process returns to S44.

【００３７】ここでは、ポインタＰ４の指す項目＜Ｄ−
はな：花＞の書換え情報に単語「花」があるので（Ｓ４
４・ＮＯ、Ｓ４６・ＮＯ、Ｓ４８・ＹＥＳ）、ポインタ
Ｐ２の示す単語「鼻」をポインタＰ４の示す書換え情報
の単語「花」と書き換え（Ｓ５２）、Ｓ５４に進む。Here, the item <D-pointed by the pointer P4
Hana: Because there is the word “flower” in the rewriting information for flower> (S4
4 · NO, S46 · NO, S48 · YES), the word “nose” indicated by the pointer P2 is rewritten with the word “flower” in the rewriting information indicated by the pointer P4 (S52), and the process proceeds to S54.

【００３８】ここで、ポインタＰ２が指す単語が、ポイ
ンタＰ３の示す変換結果記憶領域２２に記憶されている
の末尾の単語になったので（Ｓ５４・ＹＥＳ）、ポイン
タＰ１が変換結果記憶領域２２に記憶されている単語列
の末尾の単語になければ（Ｓ６０・ＮＯ）、ポインタＰ
１を次の単語へ一つ移動させ（Ｓ６２）、再び同様の規
則検索処理（Ｓ３２）を規則検索プログラム３６により
行う。この場合は、ポインタＰ１を変換結果記憶領域２
２に記憶された文字列「蜜柑の花」の二番目の単語
「の」に移動する（Ｓ６０・ＮＯ、Ｓ６２）。そして、
Ｓ３２〜Ｓ６２の処理を繰り返す。最後にポインタＰ１
が変換結果記憶領域２２中の末尾の単語のとき（Ｓ６０
・ＹＥＳ）、規則変換処理を終了する。Since the word pointed to by the pointer P2 has become the last word stored in the conversion result storage area 22 indicated by the pointer P3 (YES in S54), the pointer P1 is stored in the conversion result storage area 22. If it is not at the end word of the stored word string (S60, NO), pointer P
One is moved to the next word (S62), and the same rule search process (S32) is performed again by the rule search program 36. In this case, the pointer P1 is set to the conversion result storage area 2
It moves to the second word "no" of the character string "tangerine flower" stored in 2 (S60, NO, S62). And
The processing of S32 to S62 is repeated. Finally the pointer P1
Is the last word in the conversion result storage area 22 (S60
-YES), and the rule conversion process ends.

【００３９】尚、ワークエリア２８に記憶されている規
則フラグがＯＦＦであれば（Ｓ３４・ＮＯ)、Ｓ６０に
進む。ここで、ポインタＰ１が変換結果記憶領域２２中
の記憶されている単語列の末尾の単語になければ（Ｓ６
０・ＮＯ）、ポインタＰ１を次の単語へ一つずつ移動さ
せ（Ｓ６２）、再び同様の規則検索処理（Ｓ３２）を規
則検索プログラム３６により行う。この場合は、ポイン
タＰ１を変換結果記憶領域２２に記憶された文字列「未
完の鼻」の二番目の単語「の」に移動する（Ｓ６０・Ｎ
Ｏ、Ｓ６２）。そして、Ｓ３２〜Ｓ６２の処理を繰り返
す。最後にポインタＰ１が変換結果記憶領域２２中の末
尾の単語のとき（Ｓ６０・ＹＥＳ）、規則変換処理を終
了する。If the rule flag stored in the work area 28 is OFF (NO in S34), the process proceeds to S60. Here, if the pointer P1 is not at the last word of the word string stored in the conversion result storage area 22 (S6).
0 * NO), the pointer P1 is moved to the next word one by one (S62), and the same rule search process (S32) is performed again by the rule search program 36. In this case, the pointer P1 is moved to the second word "no" of the character string "unfinished nose" stored in the conversion result storage area 22 (S60N).
O, S62). Then, the processing of S32 to S62 is repeated. Finally, when the pointer P1 is the last word in the conversion result storage area 22 (S60, YES), the rule conversion process ends.

【００４０】この結果が再度変換結果記憶領域２２に格
納され、そして、前記変換結果記憶領域２２の内容が出
力バッファ領域２６に格納され、出力装置５０に表示さ
れる（図６・Ｓ１６）。その後、ユーザーから確定キー
が入力されれば（Ｓ１８・ＹＥＳ）、変換結果記憶領域
２２に記憶されている単語が、確定され終了する。確定
キー以外のものが入力されたときは（Ｓ１８・ＮＯ）、
候補変更処理に進む（Ｓ２０）。その結果は再度、変換
結果記憶領域２２に格納される。そして前記変換結果２
２の内容が出力バッファ領域２６に格納され、出力装置
５０に表示される（Ｓ１６）。その後、ユーザーから確
定キーが入力されれば（Ｓ１８・ＹＥＳ）、変換結果記
憶領域２２に記憶されている単語が、確定され終了す
る。This result is stored again in the conversion result storage area 22, and the contents of the conversion result storage area 22 are stored in the output buffer area 26 and displayed on the output device 50 (S16 in FIG. 6). After that, if the confirmation key is input by the user (S18, YES), the word stored in the conversion result storage area 22 is confirmed and the process ends. If anything other than the Enter key is entered (S18, NO),
Proceed to the candidate change process (S20). The result is stored again in the conversion result storage area 22. And the conversion result 2
The contents of No. 2 are stored in the output buffer area 26 and displayed on the output device 50 (S16). After that, if the confirmation key is input by the user (S18, YES), the word stored in the conversion result storage area 22 is confirmed and the process ends.

【００４１】以上説明したように、規則辞書４６中の規
則項目に意味コードを記述することにより、例えば、一
つの規則＜ＤＭ−１４４：１４４＞＜Ｕ−の：Ｎ＞＜Ｄ
−はな：花＞が、「（植物の意味コード１４４を持つ単
語の同音語）＋の鼻」という変換結果に対して適用でき
るので、一つの規則が幅広く適用できる。As described above, by describing the meaning code in the rule item in the rule dictionary 46, for example, one rule <DM-144: 144><U-:N><D
-Hana: Hana> can be applied to the conversion result of "(the homonym of the word having the plant meaning code 144) + nose", so that one rule can be widely applied.

【００４２】また、規則書換えプログラム３８により規
則書換え処理を行うときは、変換結果記憶領域２２に記
憶された単語列と一致した規則項目に意味コードがある
時は、基本辞書４２中の同じ意味コードを持つ同音語と
書換え処理を行うので、「未完の鼻」という誤変換も
「蜜柑の花」と正しく規則変換処理を行うことができ
る。本発明は、日本語ワードプロセッサのかな漢字変
換装置に限ったものではなく、日本語入力を伴う各種の
機器に応用可能なものである。When the rule rewriting program 38 performs the rule rewriting process, when the rule item matching the word string stored in the conversion result storage area 22 has a meaning code, the same meaning code in the basic dictionary 42 is used. Since the rewriting process is performed with the homonym having, the rule conversion process can be correctly performed with respect to the erroneous conversion "unfinished nose" as "tangerine flower". INDUSTRIAL APPLICABILITY The present invention is not limited to the kana-kanji conversion device of a Japanese word processor, but can be applied to various devices that require Japanese input.

【００４３】[0043]

【発明の効果】以上説明したことから明かなように、本
発明のかな漢字変換装置は規則変換処理において意味情
報などのカテゴリーを用いることにより、規則変換の適
用される単語を同一カテゴリー情報を持つ単語だけでな
く、同一カテゴリー情報をもつ他の同音語まで広げる。
従って、より効率的に規則変換処理を行うことができ、
規則辞書のデータ容量も縮小できる。As is apparent from the above description, the kana-kanji conversion device of the present invention uses the categories such as semantic information in the rule conversion process so that the words to which the rule conversion is applied are the words having the same category information. Not only is it extended to other homophones with the same category information.
Therefore, the rule conversion processing can be performed more efficiently,
The data capacity of the rule dictionary can also be reduced.

[Brief description of drawings]

【図１】本発明の構成図である。FIG. 1 is a configuration diagram of the present invention.

【図２】本実施例によるかな漢字変換装置の制御部のブ
ロック図である。FIG. 2 is a block diagram of a control unit of the kana-kanji conversion device according to the present embodiment.

【図３】本実施例の変換結果記憶領域の内容の概念図で
ある。FIG. 3 is a conceptual diagram of contents of a conversion result storage area of the present embodiment.

【図４】本実施例の基本辞書の内容の概念図である。FIG. 4 is a conceptual diagram of contents of a basic dictionary of this embodiment.

【図５】本実施例の規則辞書の内容の概念図である。FIG. 5 is a conceptual diagram of contents of a rule dictionary of this embodiment.

【図６】本実施例のかな漢字変換装置のフローチャート
である。FIG. 6 is a flowchart of the kana-kanji conversion device of the present embodiment.

【図７】本実施例の規則変換のフローチャートである。FIG. 7 is a flowchart of rule conversion according to the present exemplary embodiment.

【図８】本実施例の規則変換のフローチャートである。FIG. 8 is a flowchart of rule conversion of this embodiment.

【図９】本実施例の規則検索サブルーチンのフローチャ
ートである。FIG. 9 is a flowchart of a rule search subroutine of this embodiment.

[Explanation of symbols]

１０入力装置１２中央処理装置２０ＲＡＭ２２変換結果記憶領域２４読み入力バッファ領域２６出力バッファ領域２８ワークエリア３０ＲＯＭエリア３２プログラム部３４かな漢字変換プログラム３６規則検索プログラム３８規則書換えプログラム４０辞書部４２基本辞書４４接続テーブル４６規則辞書５０出力装置 10 Input Device 12 Central Processing Unit 20 RAM 22 Conversion Result Storage Area 24 Reading Input Buffer Area 26 Output Buffer Area 28 Work Area 30 ROM Area 32 Program Part 34 Kana-Kanji Conversion Program 36 Rule Search Program 38 Rule Rewriting Program 40 Dictionary Part 42 Basic Dictionary 44 connection table 46 rule dictionary 50 output device

Claims

[Claims]

1. A kana-kanji conversion is performed based on an input means for inputting a kana-yomi character string, a notation for reading a word, and a basic dictionary storing information indicating the category of the word, and the basic dictionary. A rule having kana-kanji conversion means, conversion result storage means for storing a conversion result by the kana-kanji conversion means, a pattern of a word string consisting of information such as reading, notation, and categories of words in the basic dictionary, and rewriting information thereof. And a rule search means for searching the contents of the conversion result storage means for words having the same category of the rule dictionary and having a pattern matching other than the word, and the rule search means. When a matching pattern is searched by, the contents of the corresponding conversion result storage means are written based on the rewriting information of the rule of the pattern. A kana-kanji conversion device, comprising: a rule rewriting means for changing and a means for outputting the contents of the conversion result storage means as a kana-kanji character string.