JPS63116269A

JPS63116269A - Kana/kanji converter for japanese processing

Info

Publication number: JPS63116269A
Application number: JP61262022A
Authority: JP
Inventors: Mari Igami; 伊神　真里
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 1986-11-05
Filing date: 1986-11-05
Publication date: 1988-05-20

Abstract

PURPOSE:To improve the conversion efficiency of homonym by checking the connecting relation of words based upon meaning information added to respective words in addition to the check of the connecting relation of words based upon the grammatical character in parts of speech. CONSTITUTION:A dictionary retrieving part 2 collates and retrieves an input character code string with a character string in a conversion dictionary 3 to retrieve the existence of 'KANJI' (Chinese character)/'KANA' (Japanese syllabary) mixing character string corresponding to the input character string. When one corresponding character string exists, an output entry word stored in an output entry word storing part 3b and a meaning code stored in a meaning code storing part 3d are stored in a register 6. When there are plural corresponding character strings, an output entry word stored in the storage part 3b corresponding to a character string most close to the meaning code already stored in the register 6 and the meaning code stored in the storage part 3d are stored in the register 6. The output entry word from the register 6 is applied to an output control part 4 and a 'KANJI'/'KANA' mixing character string is outputted.

Description

【発明の詳細な説明】〔発明の技術分野〕本発明は、日本語処理の仮名漢字変換装置に関するもの
である。DETAILED DESCRIPTION OF THE INVENTION [Technical Field of the Invention] The present invention relates to a kana-kanji conversion device for Japanese processing.

[Background of the invention]

日本語の文書作成装置、ワードプロセッサ等においては
、仮名文字から漢字に変換する処理が不可欠である。こ
のような仮名−漢字変換は当初の逐字変換から逐語変換
へ、また、文節変換から一括変換へと進歩している。In Japanese document creation devices, word processors, etc., processing for converting kana characters into kanji characters is essential. Such kana-to-kanji conversion has progressed from the initial word-for-word conversion to word-for-word conversion, and from phrase conversion to batch conversion.

[Prior art]

従来のこの種の仮名漢字変換装置においては、品詞等の
文法的性格によって単゛ｉ吾間の接続関係のみを全てチ
ェックして判断処理していた。しかし、文法的に同じ性
格を有する同音異義語が複数ある場合には、所望の単語
が的確に得られないことが多く、繰り返し変換のために
時間を要する欠点があった。In conventional kana-kanji conversion devices of this type, judgments are made by checking only the connections between the words ``i'' and ``go'' based on grammatical characteristics such as parts of speech. However, when there are a plurality of homophones having the same grammatical characteristics, the desired word is often not accurately obtained, and there is a drawback that it takes time for repeated conversion.

[Purpose of the invention]

本発明は、従来の仮名漢字変換装置における品詞等の文
法的性格による単語間の接続関係のチェックに加えて、
各単語に付加した意味情報による単語間の接続関係のチ
ェックを行い、同音異義語の変換効率の向上を図ること
を目的とする。In addition to checking the connection relationships between words based on grammatical characteristics such as parts of speech in conventional kana-kanji conversion devices, the present invention
The purpose of this method is to check the connection relationships between words based on the semantic information added to each word, and to improve the conversion efficiency of homophones.

〔発明の構成］本発明の目的は、特許請求の範囲に記載の構成を有する
仮名漢字変換装置、すなわち、入力された読みを仮名コ
ードに変換する入力装置と、入力見出し語記憶部、出力
見出し語記憶部、品詞情報記憶部及び意味情報記憶部を
備えた変換辞書と、前記入力装置から与えられた仮名コ
ードに対応する漢字仮名混じりの文字列コードを、前記
変換辞書の入力見出し語記憶部において検索すると共に
、その時、該変換辞書の意味情報記憶部に記憶された情
報によって同音異義語の優先度を決定して対応する出力
見出し語記憶部に記憶された情報を出力する辞書検索部
と、前記辞書検索部の出力に対応する文字列を表示装置
に出力する出力制御部と、を具備する仮名漢字変換装置
によって達成される。[Structure of the Invention] An object of the present invention is to provide a kana-kanji conversion device having the structure described in the claims, that is, an input device that converts input pronunciation into a kana code, an input headword storage unit, and an output header. A conversion dictionary comprising a word storage section, a part-of-speech information storage section, and a meaning information storage section; and an input headword storage section of the conversion dictionary that stores a character string code containing kanji and kana corresponding to the kana code given from the input device. a dictionary retrieval unit that performs a search in the conversion dictionary, determines the priority of homophones based on the information stored in the semantic information storage unit of the conversion dictionary, and outputs the information stored in the corresponding output headword storage unit; This is achieved by a kana-kanji conversion device comprising: , an output control section that outputs a character string corresponding to the output of the dictionary search section to a display device.

〔発明の効果］本発明にかかる仮名漢字変換装置によれば、入力装置よ
り入力された、読みを表す文字列、例えば片仮名文字列
を変換辞書に登録されている文字列と照合し、漢字仮名
混じり表記を得るが、この場合、意味コードを勘案して
変換が行われる。この意味コードは、原則として直前に
変換された文字列と最も意味の近いものを出力すると同
時に当該漢字仮名混じり表記の文字列と意味コードをレ
ジスタに格納して次の入力に備える。したがって、文全
体の意味のつながりを考えた仮名漢字変換を実施するこ
とが可能となる。[Effects of the Invention] According to the kana-kanji conversion device according to the present invention, a character string representing a reading input from an input device, such as a katakana character string, is compared with a character string registered in a conversion dictionary, and the character string is converted into a kanji-kana character string. A mixed notation is obtained, but in this case, the conversion is performed taking into account the semantic code. As a general rule, this semantic code is outputted with the closest meaning to the previously converted character string, and at the same time, the character string written in Kanji/kana and the meaning code is stored in a register in preparation for the next input. Therefore, it is possible to perform kana-kanji conversion that takes into account the meaning of the entire sentence.

〔実施例］以下、実施例を示す添付図を参照しつつ本発明を開示す
る。[Examples] Hereinafter, the present invention will be disclosed with reference to the accompanying drawings showing examples.

第１図は、本発明にかかる仮名漢字変換システムを実施
する装置の基本構成を示すブロック図で、入力語Ｗ１、
辞書検索部２、変換辞書３、出力制御部４、表示装置５
、レジスタ６によって構成される。FIG. 1 is a block diagram showing the basic configuration of a device implementing the kana-kanji conversion system according to the present invention.
Dictionary search unit 2, conversion dictionary 3, output control unit 4, display device 5
, register 6.

キーボード、音声認識装置又は仮名文字読み取り装置等
とすることができる入力装置１は、入力される読みを表
す第１の文字列を、例えば、仮名文字コードに変換して
辞書検索部２に伝えるものである。読みを表す第１の文
字列は、平仮名、片仮名、ローマ字等として示されるも
のである。The input device 1, which can be a keyboard, a voice recognition device, a kana character reading device, etc., converts a first character string representing an input pronunciation into a kana character code, and transmits the converted character string to the dictionary search unit 2. It is. The first character string representing the reading is shown as hiragana, katakana, romaji, etc.

辞書検索部２は、入力された文字コード列を変換辞書３
に予め登録されている文字列との間で照合検索を行い、
入力された第１の文字列に該当する漢字仮名混じりの表
示文字からなる第２の文字列が存在するか否かを検索す
るものである。The dictionary search unit 2 converts the input character code string into a conversion dictionary 3.
Perform a matching search with the character strings registered in advance,
This is to search for the existence of a second character string consisting of display characters mixed with kanji and kana that corresponds to the input first character string.

変換辞書３は、そのメモリ構成が、例えば第２図のよう
に構成されていて、入力見出し語記憶部３ａ、出力見出
し語記憶部３ｂ、品詞コード記憶部３Ｃ２意味コード記
憶部３ｄを備えている。この入力見出し語記憶部３ａに
格納された読みに対応する漢字仮名混じり文字列を、出
力見出し語記憶部３ｂに格納している。そして、品詞コ
ード記憶部３Ｃには、出力見出し語記憶部３ｂの漢字仮
名混じり文字列に対応する品詞の情報を格納している。The conversion dictionary 3 has a memory configuration as shown in FIG. 2, for example, and includes an input entry word storage section 3a, an output entry word storage section 3b, a part of speech code storage section 3C, and a meaning code storage section 3d. . A character string containing kanji and kana corresponding to the reading stored in the input word storage section 3a is stored in the output word entry storage section 3b. The part-of-speech code storage unit 3C stores information on the part of speech corresponding to the kanji-kana-mixed character string in the output headword storage unit 3b.

さらに、意味コード記憶部３ｄには、該当語の意味に沿
ってなされた分類がコードによって格納されている。こ
の意味コードは、第２図の右欄のように、例えば願望、
物体、機械、人名・・・等のように分類されていて、意
味が類似している近い分類はど近い数値が割り当てられ
る。Further, the meaning code storage unit 3d stores classifications made according to the meaning of the corresponding word in the form of codes. This meaning code is, for example, desire, as shown in the right column of Figure 2.
They are classified into objects, machines, people's names, etc., and similar classifications with similar meanings are assigned similar numerical values.

辞書検索部２が、前述のような第１の文字列に該当する
第２の文字列が存在せず、検索できなかった場合には、
他の処理に移行する。該当する第２の文字列が１つだけ
存在した場合、その対応する出力見出し語記憶部３ｂに
格納されている出力見出し語と意味コード記憶部３ｄに
格納されている意味コードとをレジスタ６に格納する。If the dictionary search unit 2 is unable to search because the second character string that corresponds to the first character string as described above does not exist,
Move on to other processing. If only one corresponding second character string exists, the corresponding output entry word stored in the output entry word storage section 3b and the meaning code stored in the meaning code storage section 3d are stored in the register 6. Store.

該当する第２の文字列が複数ある場合は、既にレジスタ
６に格納されている意味コードの数値と最も近い数値の
意味コードをもつ第　　　　　′２の文字列の対応する
出力見出し語記憶部３ｂに格納されている出力見出し語
と意味コード記憶部３ｄに格納されている意味コードと
をレジスタ６に格納する。If there are multiple corresponding second character strings, the corresponding output headword storage unit 3b of the '2nd character string whose meaning code is closest to the value of the meaning code already stored in the register 6 is stored. The stored output headword and the meaning code stored in the meaning code storage section 3d are stored in the register 6.

このように、レジスタに格納した出力見出し語を出力制
御部４に与える。出力制御部４は、その出力見出し語を
第１の文字列に対応する第２の文字列として出力する。In this way, the output headword stored in the register is given to the output control section 4. The output control unit 4 outputs the output headword as a second character string corresponding to the first character string.

さらに文字列が入力されれば、前述の動作を繰り返す。If more character strings are input, the above operation is repeated.

このような動作は、逐語変換で可能であることはもとよ
り、品詞コードをも考慮することにより一括変換および
複数の文をもつ文章の一括変換にも適用することができ
る。Such an operation is not only possible with word-for-word conversion, but also can be applied to batch conversion and batch conversion of sentences with multiple sentences by taking part-of-speech codes into consideration.

第３図は、本発明にかかる仮名漢字変換システムの動作
を示すフロー図である。FIG. 3 is a flow diagram showing the operation of the kana-kanji conversion system according to the present invention.

ステップＳ１においてキースキャンを行い、キー人力が
あれば、ステップＳ２において仮名漢字変換か否かが判
断される。この判断がＮＯの場合は、他の処理Ｓ５が行
われる。A key scan is performed in step S1, and if there is manual key power, it is determined in step S2 whether or not kana-kanji conversion is to be performed. If this determination is NO, another process S5 is performed.

この判断がＹＥＳの場合は、ステップＳ３のように入力
文字コード列の読み込みが行われる。If this determination is YES, the input character code string is read as in step S3.

次いでステップＳ４のように入力見出し語の検索が行わ
れ、この結果についてステップＳ５のように、候補が有
るか否かが判断される。この判断がＮＯの場合は、他の
処理が行われる。この判断がＹＥＳの場合は、次のステ
ップＳ６において候補は一つか否かが判断される。Next, a search for the input entry word is performed as in step S4, and it is determined whether or not there are any candidates based on the search results as in step S5. If this determination is NO, other processing is performed. If this determination is YES, it is determined in the next step S6 whether there is only one candidate.

ステップＳ６の判断がＹＥＳの場合は、次のステップＳ
７において候補の読み出しが行われ、次いでステップＳ
８において意味コードの有無が判断される。If the determination in step S6 is YES, the next step S
7, candidates are read out, and then in step S
At step 8, the presence or absence of a semantic code is determined.

一方、ステップＳ６の判断がＮＯの場合は、ステップＳ
９において候補の読み出しを行い、次のステップＳ１０
において意味コードがあるか否かが判断される。ここで
の判断がＹＥＳの場合は、次のステップ３１１において
レジスタに数値が記憶されているか否かの判断を行う。On the other hand, if the determination in step S6 is NO, step S
9, the candidates are read out, and the next step S10
It is determined whether or not there is a semantic code. If the determination here is YES, it is determined in the next step 311 whether or not a numerical value is stored in the register.

この判断がＹＥＳの場合は、次のステップＳ１２におい
て意味コードの数値がレジスタの数値と候補中で最も近
いものか否かが判断される。If this determination is YES, it is determined in the next step S12 whether or not the numerical value of the meaning code is closest to the numerical value of the register among the candidates.

この判断がＹＥＳの場合は、ステップＳ１３のようにレ
ジスタに入出力見出し語と意味コードとを格納する。ま
た出力見出し語をステップＳ１４のように表示し、その
後ステップＳ１５のように入力文字列がまだ残っている
か否かが判断される。ステップＳ１５の判断がＮｏの場
合は、他の処理が行われる。また、この判断が、ＹＥＳ
の場合は、ステップＳ４の前に戻って前述のフローを繰
り返す。If this determination is YES, the input/output headword and meaning code are stored in a register as in step S13. Further, the output headword is displayed as in step S14, and then it is determined as in step S15 whether or not the input character string still remains. If the determination in step S15 is No, other processing is performed. Also, this judgment is YES
In this case, the process returns to before step S4 and the above-described flow is repeated.

ステップ３１６の判断がＹＥＳの場合は、ステップＳ９
の前に戻る。また、この判断がＮＯの場合は、ステップ
３１３の後までスキップする。If the determination in step 316 is YES, step S9
Go back to before. Further, if this determination is NO, the process skips to after step 313.

ステップ３１１の判断がＮＯの場合は、ステップＳ１２
をスキップしてステップＳ１３を行う。If the determination in step 311 is NO, step S12
is skipped and step S13 is performed.

ステップＳ１２の判断がＮｏの場合は、ステップＳ９の
前に戻る。If the determination in step S12 is No, the process returns to before step S9.

ステップＳ８の判断がＹＥＳの場合は、ステップ８１３
の前に、そしてＮｏの場合にはステップＳ１４の前にそ
れぞれスキップする。If the determination in step S8 is YES, step 813
, and in the case of No, before step S14.

ここで、第２図に示した変換辞書３を用い、人力文字列
「くうきはきたいた」から目的文字列「空気は気体だ」
を得ようとする具体例を、前述のフロー図に合わせて説
明する。Here, using the conversion dictionary 3 shown in Figure 2, convert the human character string ``Kuukiha Kitita'' to the target character string ``Air is a gas.''
A specific example of trying to obtain the following will be explained in conjunction with the above-mentioned flow diagram.

３３　　人力文字列　　「くうきはきたいた」Ｓ４　　
検索　　　　　候補　　１件のみＳ７　　候補読み出し
　「くうき１０８１　／空気」＊　意味コードはあるが
　レジスタは空。33 Human-powered character string “Kuukihakitita” S4
Search Candidate Only 1 item S7 Read candidate "Kuuki 1081 / Air" * There is a meaning code, but the register is empty.

Ｓ１３　レジスタに「（うき１０８１　／空気」格納Ｓ
１４出力〔表示〕　　「空気」＊　まだ入力文字は残っている。S13 Store “(Uki1081/Air)” in the register S
14 Output [Display] “Air” * There are still input characters.

「はきたいだ」Ｓ４　　検索　　　　　候補　　１件のみＳ７　　候補
読み出し　「は／□／は」＊　助詞なので意味コードは
ない。"Hakitai" S4 Search Candidate Only 1 S7 Candidate reading "Ha/□/Ha"* Since it is a particle, there is no meaning code.

Ｓ１４出力〔表示］　　「空気は」＊　まだ入力文字は残っている。S14 output [display] “Air is” * There are still characters left to input.

「きたいた」Ｓ４　　検索　　　　　候補　　４件Ｓ９　　候補読み出し＊　　レジスタの意味コード（０８１）に最も近いのは
、「きたい１０８１　／気体」Ｓ１３レジスタに　「き
たい１０８１　／気体」を格納する。"Kitai" S4 Search 4 candidates S9 Candidate reading * The closest one to the meaning code (081) of the register is "Kitai 1081/Gas". "Kitai 1081/Gas" is stored in the S13 register.

Ｓ１４出力〔表示〕　　「空気は気体」＊　まだ入力文
字は残っている。　　「だ」Ｓ４　　検索　　　　　候
補　　１件のみＳ７　　候補読み出し　「だ／□／だ」
＊　助動詞なので意味コードなし。S14 output [display] “Air is a gas” * There are still input characters. "Da" S4 Search Candidate Only 1 S7 Read Candidate "Da/□/Da"
* Since it is an auxiliary verb, there is no semantic code.

５１４出力〔表示〕　　「空気は気体だ」＊　入力文字
はもう残っていない。514 Output [Display] “Air is a gas” * There are no more input characters left.

Ｓ、他の処理へ。S. Go to other processing.

なお、上記のステップで、＊は判断結果を示すものであ
る。Note that in the above steps, * indicates the determination result.

[Brief explanation of the drawing]

第１図は、本発明にかかる仮名漢字変換装置の基本構成
のブロック図である。第２図は、本発明にかかる仮名漢字変換装置に使用され
る変換辞書のメモリ構成図の例である。第３図は、動作を示すフロー図である。図中の参照符号の対応は以下の通りである。１：入力装置　２：辞書検索部３：変換辞書　４：出力制御部５：表示装置　６：レジスタFIG. 1 is a block diagram of the basic configuration of a kana-kanji conversion device according to the present invention. FIG. 2 is an example of a memory configuration diagram of a conversion dictionary used in the kana-kanji conversion device according to the present invention. FIG. 3 is a flow diagram showing the operation. The correspondence of reference numerals in the figure is as follows. 1: Input device 2: Dictionary search unit 3: Conversion dictionary 4: Output control unit 5: Display device 6: Register

Claims

[Claims]

an input device that converts the input pronunciation into a kana code; a conversion dictionary that includes an input headword storage unit, an output headword storage unit, a part-of-speech information storage unit, and a meaning information storage unit; and a kana code given from the input device. A character string code containing kanji and kana corresponding to the code is searched in the input headword storage unit of the conversion dictionary, and at the same time, the priority of the homophone is determined based on the information stored in the semantic information storage unit of the conversion dictionary. A dictionary search unit that determines and outputs information stored in a corresponding output headword storage unit; and an output control unit that outputs a character string corresponding to the output of the dictionary search unit to a display device. Features of Kana-Kanji conversion device.