JPH0498449A

JPH0498449A - Single kanji converter

Info

Publication number: JPH0498449A
Application number: JP2213304A
Authority: JP
Inventors: Tsuneaki Kato; 加藤　恒昭; Yoshiji Oyama; 芳史大山; Muneyuki Horii; 統之堀井
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1990-08-10
Filing date: 1990-08-10
Publication date: 1992-03-31
Anticipated expiration: 2010-05-24
Also published as: JPH0748212B2

Abstract

PURPOSE:To improve the way of using by providing an input character string analysis part, a shape designation processing part, a phrase designation processing part, and a word designation processing part. CONSTITUTION:This device is comprised of the input character string analysis part 1, the shape designation processing part 2, the phrase designation processing part 3, the word designation processing part 4, a KANA(Japanese syllabary)/ KANJI(Chinese character) conversion part 5, a radical dictionary 6, a shape dictionary 7, a KANJI information dictionary 8, and a KANA/KANJI conversion dictionary 9. The input character string analysis part 1 decides either designation format, designation by a KANJI name or radical name comprising the KANJI of an inputted character string and position relation, designation by the reading of a phrase in which the KANJI is included and the reading of the KANJI in the phrase, designation by declinable word termination including the Japanese reading of the KANJI, or designation by the reading of a word comprised of one KANJI, and either shape designation processing, phrase designation processing, or word designation processing is called, and the input character string is delivered. Thereby, a single KANJI conversion function simple and easy to use can be obtained.

Description

【発明の詳細な説明】［産業上の利用分野Ｊこの発明は、入力装置から入力される平仮名（ローマ字
）文字列を、カナ漢字変換して出力する文書作成装置！
！（ワードプロセッサ等）や、上記入力された平板名文
字列をカナ漢字変換して処理装置に送る日本語フロント
エンド等に用いられ、入力文字列を単漢字に変換する単
漢字変換装置に関する。[Detailed Description of the Invention] [Industrial Field of Application J This invention is a document creation device that converts a hiragana (Roman alphabet) character string inputted from an input device into kana-kanji characters and outputs it!
! The present invention relates to a single kanji conversion device that converts an input character string into a single kanji, which is used in a Japanese front end (such as a word processor) or a Japanese front end that converts the input plain name character string into kana-kanji and sends it to a processing device.

「従来の技術」カナ漢字変換装置において、姓名・地名等の固有名詞を
入力する場合、単語辞書にその固を名詞が登録されてい
ないことが多く、この場合、利用者は漢字を一文字ずつ
カナ漢字変換して入力することが必要となる０例えば、
ｒ恒昭（つねあき）コと入力したい場合、「つねあきＪ
をカナ漢字変換しても所望の変換結果は得られないこと
が多く、「恒Ｊ、「昭」を一文字ずつ変換する必要があ
る。``Prior art'' When inputting a proper noun such as a family name or place name in a kana-kanji conversion device, the noun is often not registered in the word dictionary. For example, 0, which requires kanji conversion and input,
If you want to enter rTsuneakiko, ``TsuneakiJ
Even if you convert ``Kana-Kanji'' into kana-kanji, the desired conversion result is often not obtained, and it is necessary to convert ``Ko J'' and ``Sho'' one character at a time.

固有名詞の数は膨大で、その全てを辞書に登録しておく
ことは不可能であるため、このような単漢字の変換は必
ず生じる事態である。従来のカナ漢字変換装置において
は、この様な単漢字入力の手段として以下のものが主に
提供されている。Since the number of proper nouns is enormous and it is impossible to register them all in a dictionary, conversion of single kanji like this always occurs. In conventional kana-kanji conversion devices, the following are mainly provided as means for inputting single kanji.

コード入カニ入力したい漢字のコード（ＪＩＳコードや
句点コード）を入力して変換する２ストローク法：漠字
毎に特殊なコードを決め、それを入力する読み人カニ入力したい漢字の音読み（Ｆ昭」であれば、
「しよう」）を入力して変換する「発明が解決しようと
する課題」上記コード入力は、入力したい漢字のコードをコードブ
ック等で調べる必要があり、手間のかかる手法である。Two-stroke method: enter the code (JIS code or period code) of the kanji you want to input and convert it: Decide on a special code for each vague character and input it. "If,
``Problem to be Solved by the Invention'' Entering and converting ``Let's'') The above code input is a time-consuming method as it is necessary to look up the code of the kanji you want to input in a code book or the like.

上記２ストローク法は、特殊なコードを暗記する必要が
あり、軌線を要する手法である。上記読み入力は、−４
には同音異語（「シょう」については、章、冨、小、竿
、等）が多く、所望の漢字を得るまでに、多数回、次候
補キイを押下するなどの操作が必要であり、使い勝手の
良いものではなかった。つまり、単漢字を変換するため
の簡便な手段が、従来、提供されていなかった。The two-stroke method described above requires memorizing special codes and requires a trajectory. The reading input above is -4
has many homonyms (for ``sho'', there are chapters, 冨, 小, 竿, etc.), and it is necessary to press the next candidate key many times to get the desired kanji. Yes, it wasn't very easy to use. In other words, a simple means for converting single kanji has not been provided in the past.

「課題を解決するための手段Ｊこの発明によれば、漢字の形状を記述した形状辞書と、
部首の読み方を記述した部首辞書と、漢字の読み方と形
状を記述した漢字辞書と、与えられた文字列を文節とみ
なして仮名漢字変換を行う仮名漢字変換部と、その仮名
漢字変化部が利用する仮名漢字変換辞書とを備え、入力文字例解析部で、入力された文字列について、以下
の１〜４のどの指定形式であるかを判定し、その結果に
応して、形状指定処理、熟語指定処理、単語指定処理の
いずれかを呼び出し、入力文字列を渡す。``Means for Solving the Problems J'' According to this invention, a shape dictionary that describes the shapes of kanji,
A radical dictionary that describes how to read radicals, a kanji dictionary that describes how to read and shape kanji, a kana-kanji converter that treats a given string as a clause and converts it from kana to kanji, and a kana-kanji transformation part. It is equipped with a kana-kanji conversion dictionary used by Call either the process, phrase specification process, or word specification process and pass the input string.

１　当該漢字を構成する漢字もしくは部首名とそれらの
位置関係による指定２　当該漢字が含まれている熟語の読みと、その熟語に
おける当該漢字の読みによる指定３　当該漢字の訓読を
含む用言終止形による指定４　当該漢字が一文字で単語となる場合にその単語の読
みによる指定形状指定処理部は指定の形式が１の時に呼び出され、与
えられた文字列から部分となる漢字もしくは部首名を表
現している文字列である部分文字列と、下、右などの位
置関係の情報を抽出する。1. Designation based on the kanji or radical names that compose the kanji and their positional relationship. 2. Designation based on the reading of the compound word that contains the kanji and the reading of the kanji in the compound word. 3. A predicate ending that includes the kun reading of the kanji. Designation by shape 4 When the Kanji in question is a single character that becomes a word, it is designated by the pronunciation of the word.The shape designation processing unit is called when the specified format is 1, and calculates the Kanji or radical name that is part of the given character string. Extracts information about the substring that is the string being expressed and the positional relationship such as bottom, right, etc.

これらの部分文字列が部首名であるかを部首辞書によっ
て確認する０部首名でない文字列については、それをこ
の単漢字変換装置の入力として処理し、漢字候補の集合
を得る。これらによって得られた部首名もしくは漢字候
補と先の関係情報を用いて上記形状辞書と上記漢字情報
辞書を検索し、入力文字列の漢字候補を得て、出力とす
る。Whether these partial character strings are radical names is checked using a radical dictionary.0 For character strings that are not radical names, they are processed as input to this single kanji conversion device to obtain a set of kanji candidates. Using the radical names or kanji candidates obtained by these methods and the above relationship information, the shape dictionary and the kanji information dictionary are searched to obtain kanji candidates for the input character string and output.

熟語指定処理部は指定の形式が２の時に呼び出され、与
えられた文字列から、熟語の読みである文字列と、当該
漢字の読みがその熟語の読みのどの部分に相当するかと
いう位置情報を抽出する。The idiom specification processing unit is called when the specified format is 2, and from the given character string, it extracts the character string that is the pronunciation of the idiom and the positional information that indicates which part of the idiom's pronunciation corresponds to the pronunciation of the kanji. Extract.

次に、仮名漢字変換部を利用して、熟語の読みを仮名漢
字変換し、漢字熟語文字列を得る。これと先の位置情報
とから、入力文字列の漢字候補を得て、それら漢字候補
のうち、当該漢字の読みとして指定された文字列を読み
として持つものを漢字情報辞書を検索して選び出し、入
力文字列の漢字候補とする。Next, using a kana-kanji conversion unit, the pronunciation of the compound word is converted into kana-kanji characters to obtain a kanji compound word string. From this and the previous position information, obtain kanji candidates for the input character string, search the kanji information dictionary and select those that have the character string specified as the reading of the kanji from among these kanji candidates, Use as kanji candidates for the input string.

単語指定処理部は指定の形式が３．４の時に呼び出され
、仮名漢字変換部を利用して与えられた文字列を仮名漢
字変換し、単語を表現する漢字一文字もしくは漢字一文
字を含む用言文節を得る。The word specification processing unit is called when the specified format is 3.4, converts the given character string to kana-kanji using the kana-kanji conversion unit, and converts the given character string to kana-kanji and converts it into a single kanji character representing the word or a pragmatic clause containing one kanji character. get.

それらから、そこに含まれる一文字の漢字のうち、入力
文字列を読みとして持つものを漢字情報辞書を検索して
選び出し、入力７字列の漢字候補として、出力する。Among the single kanji characters contained therein, the kanji information dictionary is searched to select a kanji that has the input character string as a reading, and outputs it as a kanji candidate for the input seven character string.

このように構成されているから入力したい単漢字を次の
４種類の何れかで指定できる。With this structure, you can specify the single kanji you want to input as one of the following four types.

１　形状による指定変換したい漢字を、当該漢字を構成する漢字もしくは部
首名とそれらの位置関係によって指定する例　くさかんむりにほうこうのほう→芳ちからのみぎに
くち　　　　　→加２　熟語による指定変換したい漢字を、当該漢字が含まれている熟語の読み
と、その熟語における当該漢字の読みによって指定する例　れきしのし　　　→史しょうわのしょう→昭３　訓読による指定変換したい漢字を、当該漢字の訓読を含む用言終止形に
よって指定する。1 Example of specifying the kanji you want to convert by specifying the kanji by the kanji or radical name that composes the kanji and their positional relationship. , an example of specifying the kanji by the reading of the compound word containing the kanji and the pronunciation of the kanji in the idiom. It is specified by the disjunctive final form.

例　おおきい　→大くわえる　→加４　単語による指定当該漢字が一文字で単語となる場合には、変換したい漢
字を、その単語の読みによって指定する例　やま　→山単漢字指定のこれらの方式は、従来の方式に比べて熟練
を必要としない上、コードブック等も不要である。また
、これらの形式で指定された場合、候補となる漢字の数
は単なる読みで指定された場合に比べて極めて少ないた
め、次候補キイ押下等の候補選択の手間も削減すること
ができる。特にこれらの形式は人間が言葉を用いて漢字
を説明するときに用いる説明に類似しており、ユーザは
普段用いている説明と同しような入力をするだけで必要
な漢字を入力することができるため、その知的負荷を軽
減することにつながる。Example: Ōkii → Add a large number → Add 4. Specification by word When the kanji in question is a word with one character, the kanji you want to convert is specified by the reading of that word. Compared to the conventional method, this method does not require much skill and does not require a codebook or the like. Furthermore, when designated in these formats, the number of candidate kanji characters is much smaller than when designated simply by reading, so the effort of selecting candidates, such as pressing the next candidate key, can be reduced. In particular, these formats are similar to the explanations that humans use to explain kanji using words, and users can input the necessary kanji simply by inputting the same explanations that they normally use. This leads to a reduction in the intellectual load.

「実施例」第１図は、この発明の実施例の構成を示す、この装置は
入力文字列解析部１、形状指定処理部２、熟語指定処理
部３、単語指定処理部４、仮名漢字変換部５、部首辞書
６、形状辞書７、漢字情報辞書８、仮名漢字変換辞書９
よりなる。仮名漢字変換部５と仮名漢字変換辞書９はワ
ープロ等に利用されている一般の仮名漢字変換装置で、
与えられた文字列を文節と見做して仮名漢字変換を行な
い、変換結果を返すという機能を持つものである。この
発明においではこの仮名漢字変換部５と仮名漢字変換辞
書９の内部構成は規定しない。Embodiment FIG. 1 shows the configuration of an embodiment of the present invention. Part 5, radical dictionary 6, shape dictionary 7, kanji information dictionary 8, kana-kanji conversion dictionary 9
It becomes more. The kana-kanji conversion unit 5 and the kana-kanji conversion dictionary 9 are general kana-kanji conversion devices used in word processors, etc.
It has the function of converting a given string of characters into kana to kanji, treating it as a phrase, and returning the conversion result. In this invention, the internal structure of the kana-kanji conversion section 5 and the kana-kanji conversion dictionary 9 is not specified.

入力される文字列は以下のいずれかの形式を持っている
。The input string has one of the following formats.

形式１　文字列１＋（（の１区切り記号）＋（下１右１
中１間））＋（に１区切り記号）十文字列２０は省略可能、（））は選択を示す。Format 1 String 1 + ((1 delimiter) + (bottom 1 right 1
Middle 1 space)) + (1 delimiter) cross string 2 0 can be omitted, ()) indicates selection.

ここで、文字列１、文字列２は部首名もしくは以下に示
すこの単漢字変換の入力であり、この入力は、文字列１
、文字列２で、部首名もしくは漢字を指定し、それらの
部分が下、右等の位置関係で組み合わされて構成される
漢字を指定している。Here, character string 1 and character string 2 are radical names or inputs for this single kanji conversion shown below, and this input is character string 1
, character string 2 specifies a radical name or kanji, and specifies a kanji composed of these parts combined in a positional relationship such as bottom, right, etc.

形式２　文字列１＋（の１区切り記号）十文字列ｌの部
分ここで、文字列１は熟語の読みであり、この入力は、文
字列−１で指定された熟語を構成する漢字の内、文字列
１の部分の読みを持つ漢字を指定している。Format 2 Character string 1 + (1 delimiter) part of the cross string l Here, character string 1 is the pronunciation of the idiom, and this input is the character string 1 among the kanji that make up the idiom specified by character string -1. Specifies the kanji with the reading in column 1.

形式３　上記１，２以外の文字列文字列は、用言終止形−文節、もしくは、漢字一文字か
らなる単語の読みであり、この入力は、そこに現われる
漢字を指定している。Format 3 Character strings other than 1 and 2 above are the pronunciation of a final phrase - a phrase, or the reading of a word consisting of a single Kanji character, and this input specifies the Kanji that appears there.

入力された文字列は、入力文字列解析部１によって解析
される。入力文字列解析部１の処理の流れは第２図に示
す通りで、入力文字列が上に示したどの形式であるかを
判定し、必要な処理を呼び出す、入力文字列の形式決定
は、形式１の場合、入力文字列が（（の１区切り記号）
＋（下Ｉ在中１間））＋（に１区切り記号）と言う文字
列を含むかによって、形式２の場合、入力文字列が（の
１区切り記号）を含み、かつ、その後の文字が前の文字
列の部分となっているかによって行なわれる。形式１．
２に合わない文字列は形式３であると判定される。The input character string is analyzed by the input character string analysis section 1. The processing flow of the input character string analysis unit 1 is as shown in FIG. 2, and the format determination of the input character string, which determines which format the input character string is in, and calls the necessary processing, is as follows: For format 1, the input string is ((1 delimiter)
Format 2, if the input string contains (1 separator) and the following characters are This is done depending on whether it is part of the previous string. Format 1.
Character strings that do not match format 2 are determined to be format 3.

入力文字列が形式１であると判定されると、形状指定処
理部２が呼び出される。形状指定処理部２の処理の流れ
を第３図に示す、ここでは、まず、入力文字列から文字
列（（の１区切り記号）＋（下１右１中１間））＋１に
１区切り記号）の前後の文字列を取りだし、文字列ｌ、
文字列２とする。また、関係情報として、下、右、中、
間、任意のいずれかを抽出する８文字列１．２が、部首
辞書６に含まれているかを検索することによって、それ
ぞれの文字列が部首名であるかが判定される。When it is determined that the input character string is in format 1, the shape designation processing section 2 is called. The flow of processing of the shape specification processing unit 2 is shown in FIG. 3. Here, first, from the input character string, the character string ((1 delimiter) + (1 delimiter at bottom 1 right 1 middle 1)) + 1 delimiter at 1 ), extract the strings before and after the string l,
Let it be character string 2. Also, as related information, bottom, right, middle,
Meanwhile, it is determined whether each character string is a radical name by searching whether any of the eight character strings 1.2 from which to extract any one is included in the radical dictionary 6.

ここで、部首辞書６は第４図に示すように部首の名称を
列挙したものである０部首名でない文字列については、
これを再びこの単漢字変換装置の入力として再帰的な処
理が行なわれる。これらの処理によって得られた部首、
もしくは、漢字をキイとして、形状辞書７を検索する。Here, the radical dictionary 6 is a list of radical names as shown in FIG. 4. For character strings that are not 0 radical names,
Recursive processing is performed using this as input to the single kanji conversion device again. Radicals obtained by these processes,
Alternatively, the shape dictionary 7 is searched using kanji as a key.

形状辞書７は第５図に示すような構成となっており、部
首名もしくは漢字がキイとなっており、その内容として
、そのキイを部分として含む漢字が記述されている。The shape dictionary 7 has a structure as shown in FIG. 5, in which radical names or kanji are keys, and the contents include kanji that include the key as a part.

この形状辞書７を検索し、検索結果に共通した文字をと
ることによって、文字列１が表現する漢字もしくは部首
名と、文字列２が表現する漢字もしくは部首名との両方
を含む漢字が得られる。これらの漢字をキイに漢字情報
辞書８を検索する。漢字情報辞書８は、各漢字について
、その音読み、訓読み、単紐読み、形状が、音項目、訓
項目、単項目、形項目に整理されて記述されている第６
図に示すような辞書である。得られた漢字候補について
、この漢字情報辞書８を検索し、その形項目を得て、関
係情報が指定されたものと一致するかを！！認し、一致
するものを候補として出力する。By searching this shape dictionary 7 and taking common characters from the search results, you can find a kanji that includes both the kanji or radical name expressed by string 1 and the kanji or radical name expressed by string 2. can get. The kanji information dictionary 8 is searched using these kanji as keys. The Kanji Information Dictionary 8 is a sixth dictionary in which the on reading, kun reading, single string reading, and shape of each kanji are organized into sound items, kun items, single items, and shape items.
This is a dictionary as shown in the figure. Search this kanji information dictionary 8 for the obtained kanji candidate, obtain its shape item, and check whether the related information matches the specified one! ! and outputs the matching ones as candidates.

入力文字列が形式２であると判定されると、熟語指定処
理部３が呼び出される。ここでは、（の区切り記号）の
前後の文字列をそれぞれ文字列ｌ、及び文字列１０部分
として取りだす、後者の文字列１の部分が前者の文字列
１の先頭部分と一致する部分文字列であるとき、位置情
報を「左」、末尾部分と一致する部分文字列であるとき
、位置情報を「右」、それ以外の部分文字列であるとき
位置情報を「中」とする、この後、文字例１を仮名漢字
変換部５を利用して仮名漢字変換し、文字列１を読みと
する熟語の漢字表記を得る。この熟語がｎ文字の漢字か
らなる熟語であるとする０位置情報が「左」の時、この
熟語の第一文字目を漢字候補とする０位置情報が「右ｊ
の時、この熟語の第ｎ文字目を漢字候補とする０位置情
報が「中」の時、この熟語の第２文字目からｎ−１文字
目までを漢字候補とする。これらの漢字候補のうち、文
字列１の部分で示された読みを持つものを、漢字情報辞
書８の音項目、訓項目を用いて選択し、これを漢字候補
として出力する。When it is determined that the input character string is in format 2, the idiom designation processing unit 3 is called. Here, the character strings before and after (delimiter) are extracted as character string l and character string 10, respectively, and the latter character string 1 part is a partial character string that matches the first part of the former character string 1. In some cases, the position information is set to "left", when the substring matches the last part, the position information is set to "right", and when it is any other substring, the position information is set to "middle". After this, Character example 1 is converted from kana to kanji using the kana-kanji conversion unit 5 to obtain the kanji representation of the compound word whose reading is character string 1. When the 0 position information that this compound word is a compound word consisting of n characters of kanji is "left", the 0 position information that the first character of this compound word is a kanji candidate is "right j
In this case, when the 0 position information that makes the nth character of this compound word a kanji candidate is "medium", the second character to the n-1th character of this compound word are made to be kanji candidates. Among these kanji candidates, the one having the pronunciation shown in the character string 1 is selected using the sound and kun items of the kanji information dictionary 8, and is output as a kanji candidate.

入力文字列が形式３であると判定されると、単結指定処
理部４が呼び出される。ここでは、入力文字列を仮名漢
字変換部５を用いて仮名漢字変換し、得られた結果に含
まれる漢字を漢字候補とする。これらの漢字候補のうち
、入力文字列で示された読みを持つものを、漢字情報辞
書８の訓項目、単項目を用いて選択し、これを漢字候補
として出力する。When it is determined that the input character string is in format 3, the single connection designation processing unit 4 is called. Here, the input character string is converted into kana-kanji characters using the kana-kanji conversion unit 5, and the kanji characters included in the obtained result are used as kanji candidates. Among these kanji candidates, the one having the pronunciation indicated by the input character string is selected using the kun item and single item of the kanji information dictionary 8, and is output as a kanji candidate.

次にｒ＜、さかんむりのしたにほうこうのほう」という
文字列が入力された場合を例に装置の動作を具体的に岐
明する。この文字列が形式１を満たすことが入力文字列
解析部１によって判定され、形状指定処理部２が呼び出
される。形状指定処理部２は、この入力を解析し、文字
列１＝「くさかんむり」、文字例２＝「はうこうのほう
」、関係情報＝「下Ｊを得る０部首辞書６を検索するこ
とで、文字列１は部首名であることがわかる０文字列２
は部首辞書６の中に存在しないので、これを新たに入力
としてこの単漢字変換が再帰的に呼び出される。Next, the operation of the device will be explained in detail by taking as an example the case where the character string ``r<, the bottom of the triangle'' is input. The input character string analysis section 1 determines that this character string satisfies Format 1, and the shape specification processing section 2 is called. The shape specification processing unit 2 analyzes this input and searches the 0-radical dictionary 6 to obtain character string 1 = "kusakanmuri", character example 2 = "haukou no ho", and related information = "lower J". So, string 1 is 0 string 2, which shows that it is a radical name.
does not exist in the radical dictionary 6, so this single kanji conversion is recursively called using this as a new input.

従ってこの単漢字変換装置に「はうこうのほう」という
文字列が入力され、この文字列が形式１であるかが調べ
られるが、この文字列は形式１でないことがわかり、次
に形式２であるかが調べられる。この文字列は形式２で
あると判定され、熟語指定処理部３が呼び出される。熟
語指定処理部３は文字列を解析し、文字列１＝１はうこ
う」、位置情報＝「左」を得る。その文字列Ｉを仮名漢
字変換することにより、その結果として、ｒ方向Ｊ「奉
公」　「彷徨」　「放校」等が得られる。これと位置情
報＝「左」から漢字候補として、「方」「奉」　「彷」
　「放」が得られる。これらの漢字について漢字情報辞
書８を検索−し、それぞれの漢字の読みとして「はう」
が存在することを確認し、これを出力する。Therefore, the character string ``haukou no ho'' is input to this single kanji converter, and it is checked whether this character string is in form 1, but it is found that this character string is not in form 1, and then it is found that this character string is not in form 1. It can be checked whether This character string is determined to be in format 2, and the phrase designation processing section 3 is called. The idiom designation processing unit 3 analyzes the character string and obtains character string 1 = 1 haukou'' and position information = ``left''. By converting the character string I from kana to kanji, the result is J in the r direction, ``hoko,''``wandering,''``housing,'' and the like. This and location information = Kanji candidates from "left" are "ho", "ho", and "彷"
You can get ``release''. Search the Kanji Information Dictionary 8 for these kanji and find ``hau'' as the reading of each kanji.
Check that it exists and print it.

文字列２の漢字変換結果として、「方Ｊ　「奉」「彷」
　「放」を受は取った形状指定処理部２は、まず、「く
さかんむり」をキイとして、形状辞書７を検索し、文字
列「・・・草・・・芳・・・」を得る０次に、「方」を
キイとして形状辞書７を検索し、文字列「・・・芳・・
・旗・・・ｊを得る６次に、この２つの文字列に共通す
る文字を抽出することで、漢字候補「芳」を得る。この
操作を「奉」　「彷」　「放」についても行ない、全て
の漢字候補を得る。この例では、漢字候補として「芳」
だけが得られたものとする。As a result of the kanji conversion of character string 2, "方J""彷"
The shape specification processing unit 2, which accepts "ho", first searches the shape dictionary 7 using "kusakanmuri" as a key, and obtains the character string "...grass...yoshi...". Then, the shape dictionary 7 is searched using ``ho'' as the key, and the character string ``... 方...
- Obtain flag... j 6 Next, by extracting characters common to these two character strings, obtain the kanji candidate ``yang''. Repeat this operation for ``ho'', ``彷'', and ``hou'' to obtain all kanji candidates. In this example, the kanji candidate is "fang".
It is assumed that only that is obtained.

最後に、この得られた候補「芳」について、漢字情報辞
書８を検索し、その形項目「草冠−下一方」から、関係
情報が「下」であることを確認し、漢字候補として出力
する。Finally, the obtained candidate "Yoshi" is searched in the Kanji information dictionary 8, and the related information is confirmed to be "Down" from the shape item "Grass crown - Lower one hand", and it is output as a Kanji candidate. .

これにより、形状指定処理部２の出力として、漢字候補
［芳づが得られ、単漢字変換の処理が終了する０以上の
処理により、文字列「くさかんむりのしたにほうこうの
ほう）の単漢字変換結果として、「芳」が得られる。As a result, as the output of the shape specification processing unit 2, a kanji candidate [hazu is obtained, and the single kanji conversion process is completed. As a result, "yoshi" is obtained.

もう一つの例として、「たいようのよう」という文字列
が入力された場合の装置の動作を説明する。「たいよう
のよう」という文字列が入力されると、まず、この文字
列が形式ｌであるかが調べられるが、この文字列は形式
１でないことがわかり、次に形式２であるかが調べられ
る。この文字列は形式２であると判定され、熟語指定処
理部３が呼び出される。熟語指定処理部３は文字列を解
析し、文字列１＝ｒたいようＪ、位置情報−「右」を得
る０文字列１を仮名漢字変換することにより、その結果
として、「太陽Ｊ　「大洋Ｊ　「大要Ｊが得られる。こ
れと位置情報＝「右」から、漢字候補として、「陽」　
「洋」　「要」が得られる。これらの漢字について漢字
情報辞書８を検索し、それぞれの漢字の読みとして「よ
う」が存在することを確認し、これら「陽Ｊ　「洋Ｊ　
「要」を出力とする。As another example, the operation of the device when the character string "Taiyou no Yo" is input will be described. When the character string "Taiyou no Yo" is input, it is first checked to see if this character string is in format l, but it is found that this character string is not in format 1, and then it is checked to see if it is in format 2. It will be done. This character string is determined to be in format 2, and the phrase designation processing section 3 is called. The idiom specification processing unit 3 analyzes the character string and converts the 0 character string 1 into kana-kanji to obtain the character string 1 = r taiyo J, position information - "right", and as a result, ``sun J'' ``ocean J''"Compendium J is obtained. From this and the position information = "right", as a kanji candidate, "yang"
You can get "Western" and "Kaname". We searched Kanji Information Dictionary 8 for these kanji, confirmed that ``yo'' exists as a reading for each kanji, and found that these ``YoJ''``YoJ''
"Key" is output.

なお、ここに示した例では、形状辞書７の構成を部首名
もしくは漢字一文字をキイとし、これを含む漢字からな
る文字列を内容とするものとしたが、形状辞書７の構成
として、部首名もしくは漢字と、位置情報の全てをキイ
として、それらによって構成される漢字を内容とするこ
とも可能である。In the example shown here, the configuration of the shape dictionary 7 is such that the key is a radical name or a single kanji character, and the content is a character string consisting of kanji including this. It is also possible to use the name or kanji and location information as keys, and the kanji composed of these as the content.

Ｆ発明の効果」以上で説明したように、この発明によれば、従来に較べ
て、簡便で使い勝手の良い単漢字変換機能が実現できる
。つまり、単漢字指定が、従来の方式に比べて熟練を必
要としない上、コードブック等も不要である。また、こ
れらの形式で指定された場合、候補となる漢字の数は単
なる読みで指定された場合りこ比べて極めて少ないため
、次候補キイ押下等の候補選択の手間も削減することが
できる。特にこれらの形式は人間が言葉を用いて漢字を
説明するときに用いる説明に類似しており、ユーザは普
段用いている説明と同じような入力をするだけで必要な
漢字を入力することができるため、その知的負荷を軽減
することにつながる。F. Effects of the Invention As explained above, according to the present invention, it is possible to realize a single kanji conversion function that is simpler and more user-friendly than before. In other words, specifying a single kanji requires less skill than the conventional method, and also does not require a codebook or the like. In addition, when specified in these formats, the number of candidate kanji is much smaller than when specified simply by reading, so the effort required to select candidates, such as pressing the next candidate key, can be reduced. In particular, these formats are similar to the explanations that humans use to explain kanji using words, and users can input the required kanji simply by inputting the same explanations they normally use. This leads to a reduction in the intellectual load.

[Brief explanation of the drawing]

第１図は、この発明の実施例の構成を示すブロック図、
第２図は、第１図中の入力文字列解析部１の処理の流れ
を示す図、第３図は第１図中の形状指定処理部２の処理
の流れを示す図、第４図は部首辞書６の内容の例を示す
図、第５図は形状辞書７の内容の例を示す図、第６図は
漢字情報辞書Ｂの内容の例を示す図である。特許出願人　　日本電信電話株式会社FIG. 1 is a block diagram showing the configuration of an embodiment of the present invention;
FIG. 2 is a diagram showing the processing flow of the input character string analysis section 1 in FIG. 1, FIG. 3 is a diagram showing the processing flow of the shape specification processing section 2 in FIG. 1, and FIG. FIG. 5 is a diagram showing an example of the contents of the radical dictionary 6, FIG. 5 is a diagram showing an example of the contents of the shape dictionary 7, and FIG. 6 is a diagram showing an example of the contents of the kanji information dictionary B. Patent applicant Nippon Telegraph and Telephone Corporation

Claims

[Claims]

(1) A shape dictionary that describes the shape of kanji, a radical dictionary that describes how to read radicals, a kanji information dictionary that describes how to read and shape kanji, and kana-kanji by considering a given character string as a clause. The kana-kanji conversion unit that performs the conversion, the kana-kanji conversion dictionary used by the above-mentioned kana-kanji conversion unit, and the character string input by specifying a single kanji character, 1, the kanji or radical name that makes up the kanji, and their 2. The reading of the idiom containing the kanji,
3. Designation by the pronunciation of the kanji in the compound word; 3. Designation by the final form of the kanji, including kunyoku; 4. If the kanji is a single character, it can be designated by the reading of the word. an input string analysis section that makes a judgment and calls any one of the shape specification processing section, phrase specification processing section, and word specification processing section according to the judgment result; and when the input string is specified according to 1 above. It is called and extracts substrings, which are character strings expressing kanji or radical names, and positional relationship information such as bottom, right, etc. from the given string, Check whether it is a radical name using the above radical dictionary, and if not, use these substrings as input to this single kanji conversion device to obtain a set of kanji candidates, and use the radical name or kanji obtained from these as input. the shape specification processing unit that searches the shape dictionary and the kanji information dictionary using the candidate and the previous relationship information to obtain a kanji candidate for the input character string; From the called and given character string, extracts the character string that is the pronunciation of the idiom and the location information indicating which part of the idiom's pronunciation corresponds to the pronunciation of the kanji, and uses the above-mentioned kana-kanji converter. Obtain kanji candidates for the input string from the kanji idiom string, which is the result of the kana-kanji conversion of the idiom reading, and the above position information.
Among these kanji candidates, the kanji information dictionary is searched to select those having the character string designated as the reading of the kanji, and the idiom designation processing unit selects the kanji candidates as kanji candidates for the input character string; Called when a character string is specified as specified in 3 and 4 above, the given character string is converted using the above kana-kanji converter, one kanji included in it is used as a kanji candidate, and among those kanji candidates A single kanji conversion device comprising: , the word specification processing unit that searches the kanji information dictionary to select a word having the input character string as a reading, and uses the word specification processing unit as a kanji candidate for the input character string.