JPH08161325A

JPH08161325A - Kana (japanese syllabary)/kanji (chinese character) conversion device and method therefor

Info

Publication number: JPH08161325A
Application number: JP6323556A
Authority: JP
Inventors: Yasuo Kasai; 庸雄河西; Takashi Yamamura; 隆志山村
Original assignee: EE I SOFUTO KK
Current assignee: EE I SOFUTO KK
Priority date: 1994-11-30
Filing date: 1994-11-30
Publication date: 1996-06-21
Anticipated expiration: 2022-01-24
Also published as: JP3873293B2

Abstract

PURPOSE: To check modification between clauses which are written with a space and to learn on modification when a next candidate is selected. CONSTITUTION: When the flag of modification learning is turned on (step S600), a modified word exists in a modifying word clause (steps S610 and 620). When the next candidate is selected in the modifying word clause or a modified word clause (steps S630 and 540), modification information is generated and is registered in a dictionary (steps S650 and 660). Only when the part of speech on the next candidate is the same as the part of speech on the candidate which is judged that modification is satisfied, information is registered. Consequently, the judgement of modification can widely be executed without making more a modification dictionary 98 uselessly large in volume.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、仮名漢字変換装置およ
び仮名漢字変換方法に関し、詳しくは単語間の係り受け
の情報を用いて、文節を構成する単語の漢字候補の選択
に利用する仮名漢字変換装置および仮名漢字変換方法に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a kana-kanji conversion device and a kana-kanji conversion method, and more specifically, it uses kana-kanji characters used to select kanji candidates for words constituting a bunsetsu, using information on the dependency between words. The present invention relates to a conversion device and a kana-kanji conversion method.

【０００２】[0002]

【従来の技術】従来、キーボードなどから入力された仮
名文字列を、所望の仮名漢字混じり文に変換する仮名漢
字変換装置が、日本語文の入力装置として、あるいは日
本語文の編集装置として、種々提案されている。こうし
た仮名漢字変換装置は、使用者が単語や文節の区切り位
置をいちいち指定する必要がなく、しかも変換後の文字
列は使用者が望んだ表記となるものが望まれている。日
本語には、同音異議語や同訓異議語が多数存在すること
から、誤りなく所望の仮名漢字混じり文を得るために
は、おそらく最終的には文の意味を解析しなければなら
ないが、意味を解析するためには、少なくとも有機的に
関連づけられた数万に上る言葉の知識ベースが必要とな
り、実現は極めて困難である。2. Description of the Related Art Conventionally, various kana-kanji conversion devices for converting a kana character string input from a keyboard into a desired kana-kanji mixed sentence have been proposed as an input device for Japanese sentences or an editing device for Japanese sentences. Has been done. In such a kana-kanji conversion device, it is not necessary for the user to specify the delimiter positions of words and phrases one by one, and it is desired that the converted character string has the notation desired by the user. Since there are many homophones and homonomys in Japanese, in order to get the desired mixed kana-kanji sentence without error, it is probably necessary to finally analyze the meaning of the sentence. In order to analyze the meaning, a knowledge base of at least tens of thousands of words that are organically related is necessary, which is extremely difficult to realize.

【０００３】そこで、従来の仮名漢字変換装置では、文
節分かち書きの処理や、同音異議語の選択における学習
処理を工夫し、意味の解析なしで、使用者が望む結果を
得るよう試みている。文節分かち書きの処理としては、
２文節を基本単位とし成り立ち得る文節の中で最長の文
節が得られる２文節を第１候補とする２文節最長一致法
や、文節を構成する単語の候補となり得る単語および単
語同士の組合わせにコストを付け、この点数が所定の条
件を満たす文節を第１候補とする最小コスト法などがあ
る。また、学習処理には、同音異議語の中から直前に使
用者が選択した単語を最優先で次回の候補とする同音異
議語の学習や、ある単語を含んだ文節の長さとして直前
に使用者が指定した長さを最優先とする文節長の学習な
どが知られている。Therefore, in the conventional kana-to-kanji conversion device, devising the process of segmenting and writing bunsetsu, and the learning process in selecting homonyms and oppositions, an attempt is made to obtain the desired result without the analysis of meaning. As for the processing of phrase segmentation,
For the 2-bunsetsu longest matching method in which the 2nd bunsetsu that can form the longest bunsetsu is obtained as the first candidate, and the words and combinations of words that can be candidates for the words that make up the bunsetsu. There is a minimum cost method in which a cost is added and a clause whose score satisfies a predetermined condition is set as a first candidate. In the learning process, the most recently selected homophonic word from the homonym is used as the next candidate for learning the homonym object, and the length of the phrase containing a word is used immediately before. It is known to learn bunsetsu length, which gives top priority to the length specified by the person.

【０００４】更に、最近では、単語同士の特定の関係
（例えば、「熱いお茶」の「熱い」と「お茶」、あるい
は「暑い夏」の「暑い」と「夏」）に着目し、この関係
を記憶した辞書を用意することで、一方の単語（例えば
「お茶」）が特定されたとき、この単語に関係のある言
葉（例えば「あつい」の候補のうちの「熱い」）を第１
候補として選択するものも提案されている（例えば特開
平３−１０５６６４号の「かな漢字変換装置」や特開平
４−２７７８６１号公報の「かな漢字変換装置」な
ど）。こうした単語間の特定の関係は、「係り受け」あ
るいは「共起」と呼ばれる。Furthermore, recently, attention has been paid to a specific relationship between words (for example, "hot" and "tea" in "hot tea" or "hot" and "summer" in "hot summer", and this relationship By preparing a dictionary that stores, when one word (for example, “tea”) is specified, the word related to this word (for example, “hot” among candidates for “hot”) is set as the first word.
Those which are selected as candidates have also been proposed (for example, "Kana-Kanji conversion device" in JP-A-3-105664, "Kana-Kanji conversion device" in JP-A-4-278661). The particular relationship between these words is called "dependency" or "co-occurrence."

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、現実に
用いられている言語の構文規則は極めて複雑であり、係
り受けの関係をもれなく辞書に用意することは、極めて
困難であった。また、特定の個人が作成する日本語文に
総ての係り受けが必要となる訳ではなく、またあらゆる
係り受けの存在を想定した辞書を作成すると、どんな文
節間にも係り受けが成立してしまいかねず、仮名漢字変
換の精度を高めることにならない場合も考えられた。However, the syntax rules of the language actually used are extremely complicated, and it is extremely difficult to prepare all the dependency relations in the dictionary. In addition, it is not necessary for all the Japanese sentences created by a specific individual to be modified, and if you create a dictionary that assumes the existence of all modified relationships, the modification will be established between any clauses. In some cases, it may not be possible to improve the accuracy of Kana-Kanji conversion.

【０００６】更に、日本語には、様々な文型があり、例
えば「赤い」＋「花」と「花が」＋「赤い」といった関
係、あるいは「野菜を」＋「食べる」と「野菜を」＋
「食べさせる」と「野菜が」＋「食べられる」など使役
や受動の関係があり、これらの関係も含めて係り受け関
係が成立するか否かを示す辞書を作ろうとすると、辞書
が膨大なものになってしまうという問題があった。Furthermore, there are various sentence patterns in Japanese. For example, there are relationships such as "red" + "flower" and "flower" + "red", or "vegetable" + "eat" and "vegetable". +
There is a causative or passive relationship such as "eat" and "vegetable" + "eatable", and if you try to create a dictionary that shows whether a dependency relationship is established including these relationships, the dictionary will be huge. There was a problem that it became a thing.

【０００７】本発明の仮名漢字変換装置および仮名漢字
変換方法は、こうした問題を明らかにし、係り受けの関
係を利用して所望の仮名漢字混じり文を得ることを目的
としてなされ、次の構成を採った。The kana-kanji conversion device and the kana-kanji conversion method of the present invention are intended to clarify such a problem and to obtain a desired kana-kanji mixed sentence by utilizing the dependency relation. It was

【０００８】[0008]

【課題を解決するための手段および作用】請求項１の仮
名漢字変換装置は、仮名文字列を入力し、文法辞書を参
照して、仮名漢字混じり文字列候補を生成する仮名漢字
変換装置であって、前記入力された仮名文字列を、前記
文法辞書を参照して文節分かち書きする分かち書き手段
と、該分かち書きされた各文節において、該文節を構成
する単語を変換済みの形態で示す候補単語表示手段と、
所定の文節同士の係り受けの情報を記憶した係り受け情
報辞書と、前記分かち書きされた一つの文節を起点と
し、該係り受け情報辞書を参照して他の文節との係り受
けの存在を、検定する係り受け検定手段と、該係り受け
の検定により係り受け関係の成立が検定された文節の少
なくとも一方の文節において、前記候補単語表示手段に
より表示された単語とは異なる単語が選択されたとき、
該選択された単語を、係り受け情報として学習する係り
受け学習手段とを備えたことを要旨とする。The kana-kanji conversion device according to claim 1 is a kana-kanji conversion device for inputting a kana character string and referring to a grammar dictionary to generate a kana-kanji mixed character string candidate. And a word segmenting unit for segmenting the input kana character string into segment segments by referring to the grammar dictionary, and a candidate word displaying unit that indicates, in each segment segment, the words constituting the segment in a converted form. When,
A dependency information dictionary that stores information on the dependency between predetermined phrases, and a single phrase segmented as the starting point is used as a starting point, and the dependency information dictionary is referenced to verify the existence of a dependency with another phrase. When a word different from the word displayed by the candidate word display means is selected in at least one of the clauses whose dependency relationship is verified by the dependency test means,
The gist of the present invention is to include a dependency learning unit that learns the selected word as dependency information.

【０００９】この仮名漢字変換装置では、分かち書き手
段が、文法辞書を参照して、入力された仮名文字列を文
節分かち書きし、分かち書きされた各文節において、候
補単語表示手段が、文節を構成する単語を変換済みの形
態で示す。この状態で、係り受け検定手段が、分かち書
きされた一つの文節を起点とし、所定の文節同士の係り
受けの情報を記憶した係り受け情報辞書を参照して、他
の文節との係り受けの存在を検定する。この係り受けの
検定により係り受け関係の成立が検定された文節の少な
くとも一方の文節において、候補単語表示手段により表
示された単語とは異なる単語が選択されると、係り受け
学習手段が、この選択された単語を、係り受け情報とし
て学習する。従って、新たな係り受けの関係を学習して
いくことができる。In this kana-kanji conversion device, the segmentation writing unit refers to the grammar dictionary to segment the input Kana character string into segments, and in each segmented segment, the candidate word display unit includes the words constituting the segment. In converted form. In this state, the dependency checking means refers to the dependency information dictionary that stores the information of the dependency between the predetermined phrases, starting from one punctuation segment, and the presence of the dependency with other phrases. To test. When a word different from the word displayed by the candidate word display means is selected in at least one of the clauses for which the establishment of the dependency relationship is verified by this dependency test, the dependency learning means makes this selection. The learned word is learned as dependency information. Therefore, it is possible to learn new dependency relationships.

【００１０】請求項２記載の仮名漢字変換装置は、請求
項１の係り受け学習手段が、候補単語表示手段により表
示された単語とは異なる単語が選択されたとき、該単語
が係り受け関係の成立が検定されたときの単語と文法上
の性質が同一の単語か否かを判断する判断手段と、該判
断手段により文法上の性質が同じであると判断された時
のみ、前記学習を行なう学習実行手段とからなることを
要旨とする。In the kana-kanji conversion device according to claim 2, when the dependency learning means of claim 1 selects a word different from the word displayed by the candidate word display means, the word has a dependency relationship. The learning is carried out only when the judgment means judges whether or not the grammatical property is the same as the word when the establishment is tested, and the grammatical property is judged by the judging means. The gist is that it consists of learning execution means.

【００１１】従って、この仮名漢字変換装置では、新た
な係り受けの学習が、候補単語表示手段により表示され
た単語と、この単語に代えて選択された単語との文法上
の性質が同じであるときのみに行なわれ、本来検定され
た係り受けとは関係のない単語による文節の関係を係り
受けとして学習することがない。Therefore, in this kana-kanji conversion device, the learning of the new dependency has the same grammatical property between the word displayed by the candidate word display means and the word selected in place of this word. This is done only when, and there is no learning of bunsetsu relationships by words that have nothing to do with the originally tested dependencies.

【００１２】請求項３記載の仮名漢字変換装置は、請求
項２の判断手段が、係り受け関係を有する文節を構成し
得る単語について、該単語のカテゴリを記憶した辞書
と、前記両単語のカテゴリが同一の時、文法上の性質が
同一と判断する手段とを備えたことを要旨とする。従っ
て、この仮名漢字変換装置では、単語に付与されたカテ
ゴリにより文法上の性質が同一かどうかの判断を容易に
行なうことができる。尚、このカテゴリの情報は文法辞
書内に持たせることも可能である。In the kana-kanji conversion device according to claim 3, the determination means according to claim 2 stores, for a word that can form a clause having a dependency relation, a dictionary storing the category of the word and the category of both words. It is a gist to have means for judging that the grammatical properties are the same when the two are the same. Therefore, in this kana-kanji conversion device, it is possible to easily determine whether or not the grammatical properties are the same depending on the category assigned to the word. Information of this category can be held in the grammar dictionary.

【００１３】請求項４の仮名漢字変換装置は、請求項１
の係り受け検定手段が、係り受けの関係を、体言＋用
言、用言＋用言、体言＋体言、連体形＋体言、連体詞＋
体言の少なくとも一つについて検定する手段であること
を要旨とする。従って、これらの係り受けについて検定
することができる。The kana-kanji conversion device according to claim 4 is the device according to claim 1.
Dependency verification means of the dependency relation, the synonym + noun, the noun + noun, the noun + word, the adnominal form + noun, the noun +
The gist is that it is a means for testing at least one of the wording. Therefore, it is possible to test these dependencies.

【００１４】請求項５記載の仮名漢字変換装置は、請求
項４の係り受け検定手段が、体言＋用言の係り受けパタ
ーンの解析時に、使役または受動の関係を用いた係り受
けの検定を行なう第１手段を備え、前記学習手段が、該
第１手段により係り受けの成立が検定されたとき、使役
または受動の関係を原形に戻して係り受け関係を学習す
る第２手段を備えたことを要旨とする。従って、使役ま
たは受動の関係について別途係り受け情報を記憶した
り、係り受けの学習をする必要がない。In the kana-kanji conversion device according to claim 5, the dependency verification means according to claim 4 performs the dependency verification using a causative or passive relationship when analyzing the dependency pattern of the noun + phrase. A second means for learning the dependency relationship by returning the causative or passive relationship to the original form when the dependency means is verified by the first means. Use as a summary. Therefore, it is not necessary to separately store the dependency information or learn the dependency regarding the causative or passive relationship.

【００１５】請求項６記載の仮名漢字変換方法は、仮名
文字列を入力し、文法辞書を参照して、仮名漢字混じり
文字列候補を生成する仮名漢字変換方法であって、前記
入力された仮名文字列を、前記文法辞書を参照して文節
分かち書きし、該分かち書きされた各文節において、該
文節を構成する単語を変換済みの形態で示し、所定の文
節同士の係り受けの情報を記憶した係り受け情報辞書を
参照し、前記分かち書きされた一つの文節を起点とし
て、他の文節との係り受けの存在を、検定し、該係り受
けの検定により係り受け関係の成立が検定された文節の
少なくとも一方の文節において、前記表示された単語と
は異なる単語が選択されたとき、該選択された単語を、
係り受け情報として学習することを要旨とする。The kana-kanji conversion method according to claim 6 is a kana-kanji conversion method for inputting a kana character string and referring to a grammar dictionary to generate a kana-kanji mixed character string candidate. A string in which a character string is punctuated by referring to the grammar dictionary, the words constituting the bunsetsu are shown in a converted form in each of the punctuated bunsetsu, and information on the relationship between predetermined bunsetsus is stored. By referring to the dependency information dictionary, the existence of a dependency with another phrase is verified from one of the separated phrases as a starting point, and at least one of the phrases for which the establishment of the dependency relationship is verified by the dependency test. In one clause, when a word different from the displayed word is selected, the selected word is
The point is to learn as dependency information.

【００１６】[0016]

【実施例】以上説明した本発明の構成・作用を一層明ら
かにするために、以下本発明の好適な実施例について説
明する。図１は、仮名漢字変換の制御ロジックを示すブ
ロック図、図２は、この仮名漢字変換制御ロジックが実
際に動作するハードウェアを示すブロック図である。図
２に示すように、この装置は、周知のＣＰＵ２１を中心
にバス３１により相互に接続された次の各部を備える。
ＣＰＵ２１とバス３１により相互に接続された各部につ
いて、簡単に説明する。Preferred embodiments of the present invention will be described below in order to further clarify the structure and operation of the present invention described above. FIG. 1 is a block diagram showing a control logic for kana-kanji conversion, and FIG. 2 is a block diagram showing hardware on which the kana-kanji conversion control logic actually operates. As shown in FIG. 2, this device is provided with the following parts centered around a well-known CPU 21 and mutually connected by a bus 31.
Each part mutually connected by the CPU 21 and the bus 31 will be briefly described.

【００１７】ＲＯＭ２２：仮名漢字変換プログラム等を
記憶するマスクメモリ、ＲＡＭ２３：主記憶を構成する読み出しおよび書き込み
が可能なメモリ、キーボードインタフェース２５：キー
ボード２４からのキー入力を司るインタフェース、ＣＲＴＣ２７：カラーで表示可能なＣＲＴ２６への信号
出力を制御するＣＲＴコントローラ、プリンタインタフェース２９：プリンタ２８へのデータ
の出力を制御するインタフェース、ハードディスクコントローラ（ＨＤＣ）３０；ハードデ
ィスク３２を制御するインタフェース、である。ハード
ディスク３２には、ＲＡＭ２３にロードされて実行され
る各種プログラムやデバイスドライバの形式で提供され
る仮名漢字変換処理プログラム、あるいはその仮名漢字
変換処理プログラムが参照する各種変換辞書などが記憶
されている。ROM 22: Mask memory for storing kana-kanji conversion program, RAM 23: Readable and writable memory constituting main memory, keyboard interface 25: interface for controlling key input from keyboard 24, CRTC 27: display in color A CRT controller for controlling signal output to a possible CRT 26, a printer interface 29: an interface for controlling data output to the printer 28, a hard disk controller (HDC) 30; an interface for controlling the hard disk 32. The hard disk 32 stores various programs loaded and executed in the RAM 23, a kana-kanji conversion processing program provided in the form of a device driver, various conversion dictionaries referred to by the kana-kanji conversion processing program, and the like.

【００１８】こうして構成されたハードウエアにより、
文章が入力，仮名漢字変換，編集，表示，印刷などがな
される。すなわち、キーボード２４から入力された文字
列は、ＣＰＵ２１により所定の処理がなされ、ＲＡＭ２
３の所定領域に格納され、ＣＲＴＣ２７を介してＣＲＴ
２６の画面上に表示される。With the hardware thus configured,
Text is input, Kana-Kanji conversion, editing, display, printing, etc. That is, the character string input from the keyboard 24 is subjected to predetermined processing by the CPU 21, and the RAM 2
3 is stored in a predetermined area, and a CRT is sent via the CRTC 27.
It is displayed on the screen of 26.

【００１９】次に、こうして構成されたハードウエアに
より実行される機能を図１を用いて説明する。図１に示
した各部の構成と働きについて概説するが、ここで行な
われる処理は、キーボード２４より入力されたデータに
基づき、中央処理装置（ＣＰＵ２１）が実行するもので
ある。このＣＰＵ２１により、総ての処理がおこなわれ
る。仮名漢字変換については、キーボード２４が操作さ
れたとき、所定の割込処理が起動し、入力したキーイメ
ージを対応する仮名文字列に変換し、更にこれを仮名漢
字混じり文字列に変換するデバイスドライバが起動す
る。もとより、並列処理可能なコンピュータであれば、
仮名漢字変換を一つのアプリケーション（インプットメ
ソッド）が行なうものとし、変換結果を、必要とするア
プリケーションに引き渡す構成としても差し支えない。
この場合には、キーボード２４からの入力をインプット
メソッドが一括して引き受けることになる。Next, the function executed by the hardware thus constructed will be described with reference to FIG. The configuration and operation of each unit shown in FIG. 1 will be briefly described. The processing performed here is executed by the central processing unit (CPU 21) based on the data input from the keyboard 24. The CPU 21 performs all processing. Regarding kana-kanji conversion, when the keyboard 24 is operated, a predetermined interrupt process is activated, the input key image is converted into a corresponding kana-character string, and further this is converted into a kana-kanji mixed character string. Will start. Of course, if the computer can process in parallel,
Kana-Kanji conversion may be performed by one application (input method), and the conversion result may be delivered to the required application.
In this case, the input method collectively accepts the input from the keyboard 24.

【００２０】キーボード２４からのキーイメージは、文
字入力部４０により受け付けられ、ここで、対応する仮
名文字列に変換される。ローマ字入力の場合には所定の
変換テーブルを参照して、仮名文字列に変換する。一つ
の仮名文字が得られる度に文字入力部４０は、その仮名
文字を変換制御部４２に送出する。この変換制御部４２
は、仮名漢字変換の中心的な役割を果たす所であり、後
述する種々の仮名漢字変換を制御して、結果を変換後文
字列出力部４４に送出する。変換後文字列出力部４４
は、現実には、ＣＲＴＣ２７に信号を送り、ＣＲＴ２６
に変換後文字列を表示する。The key image from the keyboard 24 is accepted by the character input unit 40, and converted into a corresponding kana character string here. When inputting romaji, it refers to a predetermined conversion table and converts into a kana character string. Every time one kana character is obtained, the character input unit 40 sends the kana character to the conversion control unit 42. This conversion control unit 42
Is a central part of kana-kanji conversion, controls various kana-kanji conversions described later, and sends the result to the converted character string output unit 44. Converted character string output unit 44
Actually sends a signal to the CRTC 27,
Display the converted string.

【００２１】変換制御部４２は、受け取った仮名文字を
文字列入力部５０に引き渡す。文字列入力部５０は、文
字格納部５２に仮名文字列を格納する。この文字列に基
づいて、自立語候補作成部５４と付属語候補作成部６４
とが、単語データの候補を作成する。自立語候補作成部
５４は、ハードディスク３２に予め記憶された自立語辞
書５８を用い、自立語解析位置管理部５６の管理の下
で、得られた仮名文字列から自立語候補を抽出する処理
を行なう。一方、付属語候補作成部６４は、同じく付属
語辞書６８を用い、付属語解析位置管理部６６の管理の
下で、得られた仮名文字列から付属語候補を抽出する処
理を行なう。解析位置を移動しつつ、自立語候補と付属
語候補を抽出する処理については、後述する。The conversion control unit 42 delivers the received kana characters to the character string input unit 50. The character string input unit 50 stores the kana character string in the character storage unit 52. Based on this character string, the independent word candidate creation unit 54 and the adjunct word candidate creation unit 64
And create word data candidates. The independent word candidate creation unit 54 uses the independent word dictionary 58 stored in advance in the hard disk 32, and under the management of the independent word analysis position management unit 56, performs processing for extracting independent word candidates from the obtained kana character string. To do. On the other hand, the adjunct word candidate creating unit 64 also performs the process of extracting an adjunct word candidate from the obtained kana character string under the control of the adjunct word analysis position managing unit 66 using the adjunct word dictionary 68. The process of extracting independent word candidates and adjunct word candidates while moving the analysis position will be described later.

【００２２】ここで、自立語辞書５８は、学習により、
同音異義語や接辞などの優先順位を変更する。この学習
処理を行なうのが、係り受け学習部７０，自立語学習部
７２，補助語学習部７４，接辞学習部７６，文字変換学
習部７８である。係り受け学習部７０は、係り受けが成
立する条件で、使用者が係り受けに該当する単語以外の
語を選択した場合、同じ単語の組合わせでは、使用者が
選択した組合わせを優先するよう係り受けの関係を学習
するものである。自立語学習部７２は、同音異義語の存
在する自立語群において、最後に選択された単語を最優
先の候補とするよう学習するものである。補助語学習部
７４は、例えば「ください」などの補助語を「くださ
い」「下さい」など、いずれの語形で変換するかを学習
するものである。更に、接辞学習部７６は、接頭語，接
尾語などの変換形式（例えば、「御」「ご」など）を学
習するものである。文字変換学習部７８は、入力した文
字列をそのままひらがなやカタカナとして確定させた場
合に、その文字列を学習し、次回以降の変換処理では確
定させたひらがなまたはカタカナを候補として出力する
ものである。Here, the independent word dictionary 58 is learned by learning.
Change the priority of homonyms and affixes. The learning process is performed by the dependency learning unit 70, the independent word learning unit 72, the auxiliary word learning unit 74, the affix learning unit 76, and the character conversion learning unit 78. When the user selects a word other than the word corresponding to the dependency under the condition that the dependency is satisfied, the dependency learning unit 70 gives priority to the combination selected by the user in the combination of the same words. This is to learn the relationship of dependency. The independent word learning unit 72 learns the word selected last in the independent word group in which homonyms exist as the highest priority candidate. The auxiliary word learning unit 74 learns in which word form an auxiliary word such as "please" is converted to "please" or "please". Further, the affix learning unit 76 is for learning conversion formats such as prefixes and suffixes (for example, "Go" and "Go"). The character conversion learning unit 78 learns the input character string as it is as hiragana or katakana and outputs the confirmed hiragana or katakana as a candidate in the subsequent conversion process. .

【００２３】自立語候補作成部５４，付属語候補作成部
６４により、作成された語候補を得て、単語データ作成
部８０が、各語候補についてのデータを作成する。即
ち、得られた自立語と付属語、自立語と自立語、更には
「自立語＋付属語」からなる文節間の接続を接続検定テ
ーブル８４を参照して接続検定部８２が行なった結果、
および全体のコスト計算をコスト計算部８６が行なった
結果を得て、単語毎のデータとして出力するのである。
この単語データは、一旦単語データ格納部１００に格納
され、係り受け候補調整部９０からの調整出力を受け
て、文節分かち書きの処理に用いられる。The independent word candidate creating unit 54 and the adjunct word candidate creating unit 64 obtain the created word candidates, and the word data creating unit 80 creates data for each word candidate. That is, as a result of the connection verification unit 82 referring to the connection verification table 84, the connection between the obtained independent words and adjunct words, independent words and independent words, and further the connection between the clauses consisting of “independent words + adjunct words”,
And the cost calculation unit 86 obtains the result of the cost calculation performed by the cost calculation unit 86 and outputs the result as data for each word.
This word data is once stored in the word data storage unit 100, receives the adjustment output from the dependency candidate adjustment unit 90, and is used for the phrase segmentation writing process.

【００２４】係り受け候補調整部９０は、自立語候補作
成部５４，付属語候補作成部６４からの語候補を単語デ
ータ作成部８０，単語データ格納部１００，文節分かち
書き部１０２を介して受けて、係り受けの検定を行なう
ものである。係り受けの検定は、ハードディスク３２に
予め用意された係り受け辞書９８を参照することによっ
て行なわれる。尚、係り受け辞書は、容量を小さくする
ために係り受け関係が逆となるものについても一つの係
り受け情報のみを記憶しているに過ぎないので、文法的
な解析を伴い転置情報調整部９９により、係り受け辞書
９８の情報を拡張して、係り受け候補の調整を行なって
いる。例えば、係り語「花が」＋受け語「美しい」とい
う係り受け情報のみ係り受け辞書９８に記憶しておき、
係り語「美しい」＋受け語「花」という係り受けの検定
も行なおうとするのである。The dependency candidate adjusting unit 90 receives word candidates from the independent word candidate creating unit 54 and the adjunct word candidate creating unit 64 via the word data creating unit 80, the word data storage unit 100, and the phrase segmentation writing unit 102. , To test the dependency. The modification check is performed by referring to the modification dictionary 98 prepared in advance on the hard disk 32. Note that the dependency dictionary stores only one dependency information even if the dependency relationship is reversed to reduce the capacity, and therefore the transposition information adjusting unit 99 is accompanied by grammatical analysis. Thus, the information in the dependency dictionary 98 is expanded to adjust the dependency candidates. For example, only the modification information of the modification word "Hana ga" + the modification word "beautiful" is stored in the modification dictionary 98,
He also tries to test the dependency word "beautiful" + the dependent word "flower."

【００２５】係り受けの検定を行なう範囲は、係り受け
範囲管理部９６により管理される。また、係り受けの関
係の検定には、いくつかの許容条件があり、これが使役
・受動解析部９２，助詞許容解析部９４等により判定さ
れる。以上の係り受けの検定により調整された文節候補
から文節分かち書きの第１候補が決定され、これが文節
データ格納部１０６に格納される。格納された候補は、
変換文字列出力部１０８により変換制御部４２に出力さ
れる。変換制御部４２は、この文字列を候補文字列とし
て表示すると共に、非所望の文字列が候補となる場合も
ありえるから、使用者による指示を受けて、次候補の表
示や選択などの処理を行なう。これらの指示や選択の結
果などは、文節データ格納部１０６や既述した各学習部
７０ないし７８に入力され、文節の一部確定や学習によ
る優先順位の書換などに用いられる。なお、図示してい
ないが、使用者により文字列の確定処理がなされると、
各部に一時的に保存されたデータは総て消去され、次の
変換に備える。The range in which the dependency test is performed is managed by the dependency range management unit 96. In addition, there are some admissible conditions in the verification of the dependency relationship, and these are determined by the causative / passive analysis unit 92, the particle admission analysis unit 94, and the like. The first bunsetsu segmentation candidate is determined from the bunsetsu candidates adjusted by the above dependency test, and stored in the bunsetsu data storage unit 106. The stored candidates are
The converted character string output unit 108 outputs the converted character string to the conversion control unit 42. The conversion control unit 42 displays this character string as a candidate character string, and since an undesired character string may be a candidate, the conversion control unit 42 receives instructions from the user and performs processing such as display and selection of the next candidate. To do. The results of these instructions and selections are input to the clause data storage unit 106 and the learning units 70 to 78 described above, and are used for partially fixing the clauses and rewriting priorities by learning. Although not shown, when the character string is confirmed by the user,
All the data temporarily stored in each unit are erased and prepared for the next conversion.

【００２６】以上、仮名文字の入力から変換語文字列の
出力までを概説したが、次に各処理の詳細について説明
する。まず最初に一般的な文節分かち書きの処理につい
て説明し、次に本発明の要部である係り受けの処理につ
いて説明する。図３は、最小コスト法による文節分かち
書きの処理の概要を示すフローチャートである。図示す
るように、まず、一時的に保存されたデータの消去や解
析位置を１桁目に初期化するなどの初期化の処理（ステ
ップＳ２００）を行なった後、解析位置を求める処理を
行なう（ステップＳ２１０）。解析位置とは、それまで
に入力された仮名文字列の先頭から順に一つずつ進めら
れていく位置である。例えば、図４に示す例文「くるま
ではこをはこぶ」という仮名文字列が入力されていると
すれば、最初の解析位置は１桁目の「く」の位置であ
る。この解析位置で、ハードディスク３２に記憶された
自立語辞書５８および付属語辞書６８を検索する処理を
行なう（ステップＳ２２０）。The process from the input of kana characters to the output of the conversion word character string has been outlined above, and the details of each process will be described below. First, a general phrase segmentation process will be described, and then a dependency process, which is an essential part of the present invention, will be described. FIG. 3 is a flow chart showing an outline of the processing of phrase segmentation by the minimum cost method. As shown in the figure, first, initialization processing such as erasing of temporarily stored data and initialization of the analysis position in the first digit (step S200) is performed, and then processing for obtaining the analysis position is performed ( Step S210). The analysis position is a position where the kana character string input so far is advanced one by one from the beginning. For example, if a kana character string “Kuru ma wa ko wo ko bu ko” is input as in the example sentence shown in FIG. 4, the first analysis position is the position of “KU” in the first digit. At this analysis position, a process of searching the independent word dictionary 58 and the auxiliary word dictionary 68 stored in the hard disk 32 is performed (step S220).

【００２７】辞書の検索を行なった後、得られた単語に
ついてそれ以前の単語との結合をチェックする処理を行
ない（ステップＳ２３０）、単語間の結合がありえない
語しか得られていない場合には、更に辞書を検索する。
例えば、図４に示した例では、「こをはこぶ」の「は」
について付属語辞書６８から検索された係助詞の「は」
は、そのなど直前の格助詞「を」との結合がありえない
と判断されるから、単語データ作成部８０，接続検定部
８２による接続の検定により、無効なデータとして扱わ
れる。図４では、こうした結合チェックにより無効と判
断された語に符号「×」を付けた。なお、単語間の結合
は、接続検定テーブル８４に予め記憶されているが、こ
の接続検定テーブル８４は、単語の品詞同士の結合の可
能性についての情報を与えるテーブルであり、実施例で
は、４００×４００程度のマトリックスとして与えられ
ている。一つの解析位置での辞書検索と結合チェックが
終われば、解析位置を順に進めて更に処理を繰り返す。After the dictionary is searched, the word obtained is subjected to a process of checking the combination with the previous word (step S230), and when only the words which cannot be combined with each other are obtained, Further search the dictionary.
For example, in the example shown in FIG. 4, "ha" of "koohakobu"
The particle "ha" retrieved from the adjunct dictionary 68 for
Is determined to be impossible to combine with the immediately preceding case particle “o”, and is treated as invalid data by the connection test by the word data creation unit 80 and the connection verification unit 82. In FIG. 4, a word "x" is added to a word judged to be invalid by such a combination check. The connection between words is stored in advance in the connection verification table 84. This connection verification table 84 is a table that gives information about the possibility of combining the parts of speech of words. In the embodiment, 400 is used. It is given as a matrix of about × 400. When the dictionary search and the combination check at one analysis position are completed, the analysis positions are sequentially advanced and the processing is repeated.

【００２８】結合の可能性のある単語については、次に
コスト計算を行ない、その語の最小総コストを求める処
理を行なう（ステップＳ２４０）。この処理は、コスト
計算部８６が行なうもので、図４（Ａ）に示す例では、
「くるま」は、例えば「く」＋「る」＋「ま」、「く
る」＋「ま」「くるま」と分けることができ、これらに
単語を当てはめてゆくとき、自立語＝２、付属語＝０の
コストを持つものとし、「苦」（自立語）＋「流」（自
立語）ならば、「流」の総コストは４、と求めるもので
ある。この時、「間」のコストが４となるのは、最小の
総コストを求めるからであり、「苦」＋「流」＋「間」
のコスト６ではなく、「来る」＋「間」の場合のコスト
４を採用するからである。「で」「は」は付属語なの
で、それ以前の単語のうち最小のコストの単語「車」＝
２のコストがそれ自身のコストとなる。図４には、各語
のコストを右下に示した。For words that are likely to be combined, the cost is calculated next, and the minimum total cost of the words is calculated (step S240). This processing is performed by the cost calculation unit 86, and in the example shown in FIG.
"Car" can be divided into, for example, "ku" + "ru" + "ma", "car" + "ma" and "car". When applying words to these, independence word = 2, attached word It is assumed that the cost is = 0, and if "bitterness" (independent word) + "stream" (independent language), the total cost of "stream" is 4. At this time, the cost of "pause" is 4 because the minimum total cost is obtained, and "pain" + "flow" + "pause"
This is because the cost 4 in the case of “come” + “between” is adopted instead of the cost 6 of. Since "de" and "ha" are subordinate words, the word "car" with the lowest cost among the words before it =
The cost of 2 is its own cost. In FIG. 4, the cost of each word is shown in the lower right.

【００２９】以上のコスト計算の後で、各単語のコスト
をチェックし、不適切なコストのものを無効とする処理
を行なう（ステップＳ２５０）。不適切なコストとは、
他の語の組合わせと比べてコストが大きくなってしまう
語の組合わせである。即ち、「区」＋「留」といった語
の組合わせを選択することは、その位置までで得られる
他の語「来る」や「繰る」のコストより高くなってしま
うので、不適切なコストと判断して、これを文節候補か
ら除外するのである。この最小コストの考え方から採用
されない語を、図４では、語の右上に「●」として示し
た。なお、図４において、「○」は、その語が、上述し
た結合チェックとコストチェックの結果、文節候補を形
成する可能性のある語として残ったものであることを示
している。After the above cost calculation, the cost of each word is checked, and the process of invalidating the cost is performed (step S250). What is an inappropriate cost?
It is a combination of words that costs more than other combinations of words. That is, selecting a combination of words such as "ku" + "dome" is more expensive than other words "come" and "repeat" that can be obtained up to that position. Judgment is made and this is excluded from the phrase candidates. Words that are not adopted due to the idea of this minimum cost are shown as “●” in the upper right of the words in FIG. Note that, in FIG. 4, “◯” indicates that the word remains as a word that may form a phrase candidate as a result of the above-described combination check and cost check.

【００３０】次に、こうしてコストが与えられた単語候
補をリンクする処理を行なう（ステップＳ２６０）。即
ち、結合が有効とされた語について、その結合関係をポ
インタを設定することで関係づけるのである。図４の例
では、「来る」「繰る」「車」「まで」「で」「は」
「では」などが無効でない語として最小総コストの計算
がなされたから、「来る」「繰る」については「まで」
にリンクし、「車」については「で」「では」にリンク
するというように関係づけるのである。こうした結合チ
ェックやコスト計算、そしてリンクづけの処理を、一つ
の解析位置で総ての単語の検索が完了する間で繰り返
す。また、その解析位置での辞書の検索が完了すると、
更に解析位置を一つ進めて、新たな単語の成立を検討
し、同様に結合チェックやコスト計算などを繰り返す。Next, a process of linking the word candidates thus given the cost is performed (step S260). That is, with respect to the words for which the combination is valid, the connection relation is related by setting a pointer. In the example of FIG. 4, "come", "roll", "car", "up", "de", "ha"
Since the minimum total cost was calculated as a word such as "in" which is not invalid, "up" is "" for "come" and "iteru".
The relationship is such that "car" is linked to "de" and "de" is linked to "car". The processes of the combination check, the cost calculation, and the linking are repeated while all the words are searched at one analysis position. Also, when the dictionary search at the analysis position is completed,
Further advance the analysis position, examine the establishment of a new word, and repeat the join check and cost calculation in the same way.

【００３１】解析位置が、既に入力された最後の仮名文
字の位置に至り、全語について解析が完了した場合には
（ステップＳ２６５）、以上の処理を前提として、最小
コストのパスを検索する処理を行なう（ステップＳ２７
０）。この処理は、文節分かち書き部１０２が行なうも
ので、有効とされた語の組合わせのなかで、語に付与さ
れたコストの総和が最小になるものを検索する処理であ
る。「くるまではこをはこぶ」の例では、図４（Ｂ）に
実線Ｊのパスとして示すように、「車で」＋「箱を」＋
「運ぶ」という分かち書きが総コスト１８となるので、
最小コストとして選択される。なお、最小コストではな
いが、他の文節分かち書きの候補も検索される。例え
ば、図４（Ｂ）に破線Ｂのパスとして示すように、「車
では」＋「子を」＋「運ぶ」という分かち書き（コスト
＝２０）である。こうして分かち書きの候補を作成した
後（ステップＳ２８０）、今度は各文節の内部での候補
を作成する処理を行なう（ステップＳ２９０）。即ち、
ひとつの文節分かち書きの内部で、例えば「はこを」に
対して「箱を」や「函を」といった候補を用意するので
ある。これらの文節の候補や単語の候補は、使用者によ
り文節の分け方をかえるよう指示されたり、次候補を表
示するよう指示された場合に使用される。When the analysis position reaches the position of the last kana character that has already been input and the analysis has been completed for all words (step S265), the process of searching the path with the minimum cost is premised on the above process. (Step S27
0). This processing is performed by the phrase segmentation writing unit 102, and is a processing of searching for a combination of valid words that minimizes the total cost given to the words. In the case of "Walking until you come", as shown by the path of the solid line J in Fig. 4 (B), "By car" + "Box" +
The total cost is 18 for the "carry" segment.
Selected as the lowest cost. It should be noted that although the cost is not the minimum, other phrase segmentation candidates are also searched. For example, as shown by a path indicated by a broken line B in FIG. 4B, the segmentation is “by car” + “child” + “carry” (cost = 20). After creating the segmentation candidate in this way (step S280), this time, the process of creating the candidate inside each phrase is performed (step S290). That is,
Within a bunsetsu syllabary, candidates such as "box" and "box" for "hako" are prepared. These bunsetsu candidates and word candidates are used when the user instructs to change the way of bunsetsu division or to display the next candidate.

【００３２】次に、こうして文節分かち書き処理がなさ
れ、各文節について漢字候補が作成された後で実行され
る係り受け検定処理について説明する。図５は仮名漢字
変換を行なおうとする例文を示す説明図であり、図６は
係り受け検定処理ルーチンを示すフローチャートであ
る。ここでは例文として、図５（Ａ）に示すように、
『わたしはくるまをかえた。』という文を取り上げる。
最小コスト法を用いた分かち書きの処理により図５
（Ｂ）に示す文節が取り出され、各文節内で図５（Ｃ）
に示す単語が第１候補として表示されたと仮定する。な
お、各単語候補はそれまでに学習された第１候補であ
る。この場合の係り受けの検定処理ルーチンについて、
図６に従って説明する。Next, the dependency verification process executed after the phrase segmentation process is performed and the kanji candidates are created for each phrase will be described. FIG. 5 is an explanatory view showing an example sentence in which kana-kanji conversion is performed, and FIG. 6 is a flowchart showing a dependency verification processing routine. Here, as an example sentence, as shown in FIG.
"I changed my car. ] Is taken up.
As a result of the segmentation process using the minimum cost method, FIG.
The phrase shown in (B) is extracted, and each phrase is shown in FIG. 5 (C).
It is assumed that the word shown in is displayed as the first candidate. Note that each word candidate is the first candidate learned so far. Regarding the dependency verification processing routine in this case,
It will be described with reference to FIG.

【００３３】図６の係り受け検定処理ルーチンが起動さ
れると、まず検定範囲（通常は、分かち書きされた全文
節）の終端の一つ前の文節を係り語として設定する処理
を行なう（ステップＳ４００）。係り受けは、実施例で
は、係り語を優先として対応する受け語を検索するの
で、少なくとも一つの受け語を持つよう、検定範囲の終
端一つ前の文節を係り語の文節として設定するのであ
る。また、この時処理の各変数を初期化する処理も併せ
行なう（例えばｎを値１に設定する）。次に係り語から
ｎ個後方の文節を受け語として設定する処理を行ない
（ステップＳ４１０）、係り受け辞書９８を検索する処
理を行なう（ステップＳ４２０）。尚、係り受け辞書
は、「受け語語根」＋「係り語」という形式で記録され
ている。When the dependency verification process routine of FIG. 6 is started, first, a process of setting the phrase immediately before the end of the verification range (usually, all the separated phrases) as a dependent word is performed (step S400). ). In the embodiment, the dependency is searched for the corresponding dependent word by giving priority to the dependent word, so that the clause immediately before the end of the test range is set as the clause of the dependent word so as to have at least one dependent word. . Further, at this time, a process of initializing each variable of the process is also performed (for example, n is set to a value 1). Next, a process of setting a phrase n items behind the dependent word as a word is performed (step S410), and a process of searching the dependent dictionary 98 is performed (step S420). The dependent dictionary is recorded in the form of “received word root” + “dependent word”.

【００３４】辞書の一例を図７に示す。図７（Ａ）は、
「機転＋利く」という係り受けが存在する場合の辞書の
内容を模式的に示したものである。この辞書は、見出し
語である「ききてん」という読みと「利く」と「機転」
という語が登録されており、「車を＋代える」という係
り受けであれば、図７（Ｂ）に示すように、見出し語で
ある「かえくるま」という読みと「代える」と「車」と
いう語が登録されている。尚、末尾には、後述する付属
語許容解析用の付属語情報が付属している。実際の辞書
は、この他検索用のインデックスや語の長さを示す情報
などが付帯している。An example of the dictionary is shown in FIG. FIG. 7 (A) shows
It is a diagram schematically showing the contents of the dictionary in the case where there is a dependency of "quick and fast". This dictionary consists of the headwords "Kikiten" and the words "difficult" and "quick".
If the word is registered and the dependency is “to change the car +”, as shown in FIG. 7B, the reading of the entry word “kaeduma” and the words “replace” and “car” are used. The word is registered. In addition, attached word information for attached word allowance analysis described later is attached to the end. The actual dictionary is additionally provided with an index for searching, information indicating the word length, and the like.

【００３５】係り語として設定された語と受け語として
設定され語とから係り受け辞書を検索した結果、係り語
と受け語から作成された見出し（例えば「かえくる
ま」）が辞書に存在するか否かを判断し（ステップＳ４
３０）、係り受けが辞書に見いだされなければ、検定範
囲の終端まで検定したか否かを判断し（ステップＳ４４
０）、終端まで検定していなければ、変数ｎを値１だけ
インクリメントし（ステップＳ４５０）、ステップＳ４
１０から処理を繰り返す。係り受けが係り受け辞書９８
に見いだされた場合には（ステップＳ４３０）、受け語
に既に係り受けが成立しているか否かの判断を行なう
（ステップＳ４６０）。これは、図８に示すように、一
度係り受けＱ１＋Ｒ２が成立した場合、次にその前の文
節Ｐについて係り受けを判定するとき、既に見い出され
たＱ１＋Ｒ２という係り受けを優先するよう処理を異な
らせるためである。As a result of searching the dependency dictionary from the word set as the dependent word and the word set as the dependent word, is there a headline (eg, "kaerumuma") created from the dependent word and the dependent word in the dictionary? It is determined whether or not (step S4
30) If the dependency is not found in the dictionary, then it is determined whether the end of the verification range has been verified (step S44).
0), if the end is not verified, the variable n is incremented by the value 1 (step S450), and step S4
The process is repeated from 10. Dependency is dependent dictionary 98
If it is found (step S430), it is determined whether the received word is already dependent (step S460). As shown in FIG. 8, when the dependency Q1 + R2 is once established, when the dependency is determined next for the preceding clause P, the processing that is different from the dependency Q1 + R2 already found is given priority. This is because.

【００３６】受け語側（即ち語Ｐと語Ｑとの係り受け判
定における語Ｑ）について既に係り受けが成立している
のでなければ、処理はステップＳ４７０以下に移行し、
まず係り語優先で係り受けの成立する語を検索る。係り
語優先で係り受けの成立する語を検索するのは、ステッ
プＳ４３０の判断により、少なくともひとつ係り受けが
有り得る語が存在することが分かっているから、どの単
語について係り受けが成立するかを、係り語の側の順位
を先にして判断するのである。この検索の様子を図９に
示す。係り語の側に、その読みに対して複数の単語が見
い出されている場合、その第１候補Ｘ１を固定して、受
け語Ｙについて、既に自立語辞書５８に配列された学習
の順位に従って、順位の高い側から順に、Ｙ１→Ｙ２→
Ｙ３→Ｙ４・・・という順に係り受けが成立するかを見
て行くのである（図９符号Ａ１の検索）。この検索によ
って、係り受けを満足する単語が見い出されなかった場
合には、次の係り語Ｘ２を選択して同様に検定を行なう
（符号Ａ２の検索）。If the dependency has not already been established for the received word side (that is, the word Q in the dependency determination of the word P and the word Q), the process proceeds to step S470 and thereafter.
First, the words that have a dependency are found with priority on the dependency. In order to search for a word that is dependent on priority depending on the dependent word, it is known from the determination in step S430 that there is at least one possible dependent word. The order of the related words is decided first. The state of this search is shown in FIG. When a plurality of words are found for the reading on the side of the dependent word, the first candidate X1 is fixed, and for the received word Y, according to the order of learning already arranged in the independent word dictionary 58, Y1 → Y2 →
It is checked whether the dependency is established in the order of Y3 → Y4 ... (Search by reference numeral A1 in FIG. 9). If a word satisfying the dependency is not found by this search, the next dependency word X2 is selected and the test is performed in the same manner (search for symbol A2).

【００３７】こうした検索により、係り受け辞書９８か
ら読み出された係り受けを満足する係り語と受け語の組
みが見い出されたとき、次に付属語の許容解析を行なう
（ステップＳ４８０）。この処理について説明する。When a combination of a dependent word and a dependent word that satisfies the modification read out from the modification dictionary 98 is found by such a search, the adjunct word is allowed to be analyzed (step S480). This processing will be described.

【００３８】助詞の許容解析は、係り受けのタイプによ
り定義された許容関係を満たしているかを判断するもの
であり、係り受けのタイプ毎に次の類型を持つ。［Ｉ］連用修飾型名詞＋助詞＋用言の場合の助詞格助詞「が」「から」「で」「と」「に」「へ」「よ
り」「を」「の」係助詞「は」用言連用形＋用言の場合名詞＋用言（助詞省略型）の場合の省略可能な助詞「が」「は」係助詞，副助詞［ＩＩ］連体修飾型名詞＋助詞＋名詞の場合の助詞「の」体言＋体言（並列）の場合の助詞「や」「と」用言連体形＋名詞の場合連体詞＋名詞の場合The admissibility analysis of particles is to determine whether or not the admissibility relation defined by the dependency type is satisfied, and has the following types for each dependency type. [I] Consecutive modifier noun + particle + particle in case of adjective Case particle “ga” “kara” “de” “to” “ni” “e” “yori” “o” “no” particle “ha” In the case of the noun + noun + noun + noun + noun (particle abbreviation type), the optional particles "ga", "ha", and adverbs [II] adnominal modifier noun + particle + noun particle In the case of "no" body language + body language (parallel) "ya""to" in the case of conjunctions + nouns In the case of conjunctions + nouns

【００３９】即ち、係り受け関係にあると判断された２
つの語の関係が上記のないしのいずれかに属すると
して、係り受け関係にある両語の間に存在する付属語
（大部分は助詞もしくは助詞的表現）が上記のいずれか
に該当する場合は、係り受け辞書９８には係り受け関係
を有する語について許容する助詞の設定がなされている
から、これを検定するのである。例えば、「機転」と
「利く」との間の係り受けが助詞の許容設定（の・が）
を伴っている場合、上記のケース（名詞＋助詞＋用
言）に属するから、「の」「が」は両語間に存在可能で
あるけれども（機転が利いた、機転の利いた→○）、他
の格助詞「から」「で」などは許容できない（機転から
利いた、機転で利いた→×）ということになる。That is, it is determined that the relationship is 2
If the relation between two words belongs to any of the above or any of the above, and if the adjunct word (mostly a particle or particle-like expression) that exists between the two words in the dependency relationship falls under any of the above, The dependency dictionary 98 is set with a particle that allows a word having a dependency relationship, and this is verified. For example, the dependency between "tact" and "taku" is the setting of the particle allowance (no ・ ga)
If it is accompanied by, it belongs to the above case (noun + particle + noun), so "no" and "ga" can exist between both words (quickly, witty → ○) , Other case particles such as "kara" and "de" are unacceptable.

【００４０】ないしの各関係について、そこに挙げ
られたもの以外については、許容されると判断する。こ
の許容されると判断する例を以下に列挙するが、これら
は、係り受けとしては実際の表現としては成り立たない
場合を含む可能性がある。しかし、係り受けは、実際の
人間の言語活動としては、広い概念であり、あまりに厳
格な係り受けの取り決めはむしろ現実にそぐわないこと
が多い。また、余りに厳密な係り受けの取り決めは係り
受け辞書９８のいたずらな増大を招くだけであり、係り
受け検定の速度も低下させる。そこで、本実施例では、
付属語の許容について、係り受けの生じる関係をから
に分け、その中で許容・非許容の明確なものについて
は、係り受け辞書に許容するものとして係り受け関係の
成り立つ語と共に記憶し、それ以外については、許容す
るものとしたのである。With respect to each of the relationships (1) to (4), it is judged that the relationships other than those listed therein are allowed. The examples of judging that this is permissible are listed below, but these may include cases in which the dependency does not hold as an actual expression. However, dependency is a broad concept as an actual human language activity, and too strict dependency arrangements are often not practical. Further, too strict dependency arrangement only causes an unnecessarily large increase in the dependency dictionary 98, and also reduces the speed of dependency verification. Therefore, in this embodiment,
Regarding the admissibility of ancillary words, the relationship in which the dependency is generated is separated from that, and if there is a clear allowance / non-allowance, it is stored in the dependency dictionary along with the words that have the dependency relationship, and otherwise. With regard to

【００４１】［ＩＩＩ］許容される表現−連用修飾形の
場合・名詞＋格助詞的表現＋用言における格助詞的表現「ずつ」「として」「のため」「において」「によっ
て」など、・名詞＋係助詞＋用言における係助詞「こそ」「さえ」「しか」「でも」「も」など、・名詞＋副助詞＋用言における副助詞「きり」「くらい」「ずつ」「だけ」など、・名詞＋副助詞的表現＋用言における副助詞的表現「なので」「なら」など、・用言＋助詞＋用言における助詞「のは」など・接続助詞「ので」「から」「から」「て」など、・接続助詞的表現「からには」「ためには」「ほど」
「うえ」など、・用言＋用言を並列させる表現「か」「し」「たり」
「と同時に」など、[III] Permissible expressions-in the case of continuous modified forms: noun + case particle expression + case particle expression in a noun "each""as""for""in""by", etc. Noun + particle + particle particle in adjectives “koso,” “sae,” “shika,” “but,” “mo,” etc. ・ Noun + auxiliary particle + auxiliary particle in adjectives “cut”, “about”, “gatsu”, “only” Etc. ・ Noun + Adverbial expression + Adverbial expression in adjectives "because", "Nara", etc. ・ Adjective + Particles + Particles in adjective "Noha" etc. ・ Connective particles "Node", "Kara""Kara","te", etc. ・ Connective particle-like expressions "kara ni""wa" and "ho"
"Ue", etc. ・ Expression in which verbs and verbs are arranged in parallel "ka""shi""tari"
"At the same time", etc.

【００４２】［ＩＶ］許容される表現−連体修飾形・名詞＋助詞的表現＋名詞における助詞的表現「における」「に関する」「に基づいて」など、・用言＋助詞的表現＋名詞における助詞的表現「ための」「といった」「に伴う」「などの」「ごと
き」など、・体言＋体言を並列させる表現「か」。[IV] Permissible expression-adjective modification form-Noun + particle expression + particle-like expression in nouns "in", "relating", "based on", etc. ・ Phrase + particle expression + particle in noun Expressions such as “for”, “to”, “to accompany”, “to”, “toki”, etc.

【００４３】以上の規則に従って、係り受けの関係が見
い出された２つの語の間の付属語の許容について判断す
る。例として挙げた「車を」「代えた」の場合には、係
り受けの成立が認められる。そこで、これを判定し（ス
テップＳ４８０）、係り受けが成立した場合には、成立
が認められた語を、その文節を構成する自立語における
係り語と受け語との第１候補とする処理を行なう（ステ
ップＳ４９０）。即ち、自立語辞書５８の同音異語の学
習による登録順序を入れ換えるのである。更にこうして
見い出された受け語から係り語までの間を係り受け成立
済み範囲として登録し、これを管理する処理を行ない
（ステップＳ５００）、全範囲について係り受けの検索
を行なったか否かの判断（ステップＳ５１０）に進む。
なお、両語間に存在する付属語により係り受けの成立が
否定された場合には、第１候補の変更を行なわず、更に
その係り語と受け語について係り受け情報が他にあれば
同様の検定を行ない（図示省略）、なければ全範囲につ
いて終了したかの判断に移行する。In accordance with the above rules, the admissibility of the adjunct word between the two words for which the dependency relationship is found is determined. In the case of "changing a car" and "changing", which are given as examples, the dependency is recognized. Therefore, this is determined (step S480), and when the dependency is established, the word for which the establishment is recognized is set as the first candidate of the dependent word and the dependent word in the independent word forming the clause. Perform (step S490). That is, the registration order for learning homonyms in the independent word dictionary 58 is changed. Further, the range from the dependent word to the dependent word thus found is registered as the dependent-established range, and processing for managing this is performed (step S500), and it is determined whether or not the dependent relationship is searched for in the entire range ( It proceeds to step S510).
If the dependency is denied due to an adjunct existing between the two words, the first candidate is not changed, and if there is other dependency information for the related word and the received word, the same change is performed. The test is performed (not shown), and if not, the process shifts to the judgment as to whether or not the whole range has been completed.

【００４４】係り受け検定の全範囲、即ち、図６に示し
た処理により確定された係り受けの解析範囲の全部につ
いて係り受けの検定が終了していなければ、変数ｎを値
１に初期化し（ステップＳ５２０）、更に係り語を一つ
前の文節に移動し（ステップＳ５３０）、全範囲につい
て係り受けの検定が完了するまで、上記の処理（ステッ
プＳ４１０ないし５００）を繰り返す。If the dependency test has not been completed for the entire range of the dependency test, that is, the entire dependency analysis range determined by the processing shown in FIG. 6, the variable n is initialized to the value 1 ( In step S520, the related word is further moved to the preceding clause (step S530), and the above-described processing (steps S410 to S500) is repeated until the modification verification of the entire range is completed.

【００４５】ステップＳ４６０において、受け語に既に
係り受けが成立していると判断された場合には、既に係
り受けが成立しているとされた受け語を用いた係り受け
が成立するかを判断する（ステップＳ５４０）。即ち、
図９（Ａ）に示した例では、語Ｑ１と語Ｒ２との間に係
り受けが成立していた場合、語Ｐと語Ｑとの間の係り受
けの判定の際には、受け語となる語Ｑ１を固定して、係
り受けの成立を判断するのである。従って、この場合に
は、係り受けＰ１＋Ｑ２が存在しても、これを採用する
ことはない。但し、受け語をＱ１とする係り受けＰ２＋
Ｑ１が見い出されれば、これは係り受けの成立とする。
従って、図９（Ｂ）に示したように、Ｐ２＋Ｑ１＋Ｒ２
という係り受けは成立するのである。図５に示した例文
では、係り受けの検定範囲とされた「わたしはくるまを
かえた」について、「車を」＋「代えた」という係り受
けが見い出されて、「車」「代えた」が第１候補として
学習されたあとでは、「私は」＋「代えた」という係り
受けが存在したとして、この係り受けが採用されること
はない。他方、仮に「私は」＋「車を」という係り受け
が存在すれば、これは採用される。If it is determined in step S460 that the dependent word is already dependent, it is determined whether the dependent word using the dependent word that is already dependent is established. Yes (step S540). That is,
In the example shown in FIG. 9A, when the dependency is established between the word Q1 and the word R2, when the dependency between the word P and the word Q is determined, The word Q1 is fixed and the dependency is determined. Therefore, in this case, even if the dependency P1 + Q2 exists, it is not adopted. However, the dependency P2 + with the received word as Q1
If Q1 is found, it is a dependency.
Therefore, as shown in FIG. 9B, P2 + Q1 + R2
The dependency is established. In the example sentence shown in FIG. 5, for “I changed the car”, which was defined as the verification range of the dependency, a dependency of “car” + “replaced” was found, and “car” “replaced” After learning as the first candidate, this dependency is not adopted because there is a dependency of "I am" + "replaced". On the other hand, if there is a dependency of "I" + "car", this is adopted.

【００４６】そこで、この場合には、係り語の第１候補
（例では語Ｐ２）を確定する（ステップＳ５５０）。そ
の後、同様に係り受けの範囲の管理（ステップＳ５０
０）と、全範囲についての係り受け検定の終了かの判断
（ステップＳ５１０）とを行なう。係り受けの検定範囲
として確定された全範囲について係り受けの検定が終了
した場合には、「ＥＮＤ」に抜けて本ルーチンを終了す
る。Therefore, in this case, the first candidate of the related word (word P2 in the example) is determined (step S550). After that, similarly, the range of dependency is managed (step S50).
0) and whether the dependency check for the entire range is completed (step S510). When the dependency verification is completed for the entire range determined as the dependency verification range, the process goes to "END" and the present routine is terminated.

【００４７】次に、係り受けの学習ルーチンについて、
図１０に従って説明する。図５に示した例文について、
最初の３文節では、「私は」「車を」「代えた。」とい
う単語がそれぞれ第１候補として表示されて、係り受け
として「車を」＋「代えた」が成立と判定されていると
する。この状態で係り受け学習処理ルーチンが起動され
ると、まず係り受け学習フラグがオンとなっているか否
かを判定する（ステップＳ６００）。係り受けの学習を
するか否かは設定できるようになっているので、これを
フラグにより判定するのである。係り受け学習がオフに
なっていれば、何も行なわず「ＥＮＤ」に抜けて、本ル
ーチンを終了する。なお、仮名漢字変換における学習に
は様々なものがあり、実施例では、係り受けの学習の前
に、自立語学習（自立語の複数の候補のうち選択された
ものを次の仮名漢字変換時の第１候補とする学習）、付
属語の学習（接頭語「ご」「御」や助数詞「回」「会」
などのいずれを優先するかの学習）、文節学習（文節の
長さなど文節としていずれを優先するかの学習）などの
学習がなされている。これらの学習は、すべて学習を行
なうか否かの設定が可能となっており、フラグのオン／
オフの形で記憶されている。Next, regarding the modification learning routine,
This will be described with reference to FIG. Regarding the example sentence shown in FIG.
In the first three phrases, the words "I am", "car", and "replaced" are displayed as first candidates, and it is determined that "car" and "replaced" are established as dependencies. And When the dependency learning processing routine is started in this state, it is first determined whether or not the dependency learning flag is turned on (step S600). Whether or not the dependency learning is performed can be set, and this is determined by the flag. If the dependency learning is off, nothing is done and the process goes to "END" to end this routine. There are various kinds of learning in kana-kanji conversion. In the embodiment, independent word learning (a selected one of a plurality of independent word candidates is changed to the next kana-kanji conversion time) before the modification learning. Learning as the first candidate of the), learning of adjuncts (prefix "go""go" and classifier "times""kai"
Learning such as which one is prioritized) and bunsetsu learning (learning which one of the bunsetsu such as the length of the bunsetsu is prioritized) is performed. For these learnings, it is possible to set whether or not all learning is performed, and the flag is set to ON / OFF.
It is stored in the off state.

【００４８】係り受け学習フラグがオンに設定されてい
れば、次に選択状態となっている文節、すなわち仮名漢
字変換中で、現在係り受け学習のために注目している文
節が係り語文節であるか否かの判定を行なう（ステップ
Ｓ６１０）。係り語文節か否かは、図６に示した処理ル
ーチンにより判定され、記憶されているので、この情報
に基づいて判定する。係り語文節でなければ何も行なわ
ずに「ＥＮＤ」に抜けて終了し、他方係り語文節である
と判断されれば、受け語の文節を探す処理を行なう（ス
テップＳ６２０）。係り受けの情報から係り語に対応す
る受け語を見いだすことは容易である。変換中のカーソ
ルが置かれた語が「車を」であれば、この語が係り語と
判断され、「代えた」が受け語として見出される。If the dependency learning flag is set to ON, the phrase in the next selected state, that is, the phrase currently being focused on for dependency learning during kana-kanji conversion is the dependency phrase. It is determined whether or not there is (step S610). Since it is determined and stored by the processing routine shown in FIG. 6 whether or not it is a dependent phrase, it is determined based on this information. If it is not a related word bunsetsu, nothing is performed and the process ends in "END", and if it is determined to be a related word bunsetsu, a process of searching for a phrase of the received word is performed (step S620). It is easy to find the dependent word corresponding to the dependent word from the dependent information. If the word on which the cursor under conversion is placed is "car", this word is determined to be a related word, and "substituted" is found as a received word.

【００４９】次に、係り語文節で次候補の選択がなされ
たか否かの判断を行ない（ステップＳ６３０）、係り語
文節で次候補の選択がなされたのでなければ、受け語の
文節で次候補の選択がなされた否かの判断を行なう（ス
テップＳ６４０）。これらの処理は、係り語や受け語の
文節において、例えば係り受けが成立するとして第１候
補として表示された「車を」に代えて「クルマを」が選
択されたり（係り語における次候補の選択）、「代え
た」に代えて「替えた」が選択された場合（受け語にお
ける次候補の選択）に相当する。Next, it is judged whether or not the next candidate is selected in the related word phrase (step S630). If the next candidate is not selected in the related word phrase, the next candidate is selected in the received word phrase. It is determined whether or not is selected (step S640). In these processes, in the phrase of the dependent word or the dependent word, for example, “car” is selected instead of “car” displayed as the first candidate because the dependent is established (the next candidate in the dependent word is selected). Selection) corresponds to the case where “changed” is selected instead of “changed” (selection of the next candidate in the received word).

【００５０】これらの場合には、次に係り受け情報を作
成する処理を行なう（ステップＳ６５０）。係り受け情
報の作成については、図１１を用いて後述する。係り受
け情報を作成した後、この情報を用いて係り受け辞書９
８への登録を行ない（ステップＳ６６０）、その後「Ｅ
ＮＤ」に抜けて本ルーチンを終了する。なお、係り語や
受け語の文節で次候補の選択が行なわれなかった場合
は、係り受けの学習をする必要は特にないから、何も行
なわず「ＥＮＤ」に抜けて、本ルーチンを終了する。In these cases, processing for creating dependency information is then performed (step S650). The creation of dependency information will be described later with reference to FIG. After creating the dependency information, the dependency dictionary 9 is used by using this information.
8 is registered (step S660), and then "E"
ND "is reached and this routine ends. If the next candidate is not selected in the phrase of the dependent word or the dependent word, there is no particular need to learn the dependent word, so nothing is done and the process ends and the routine ends. .

【００５１】以上説明した図１０の係り受け学習処理ル
ーチンが実行されると、係り受け学習フラグがオンであ
り、係り受けの成立が認められた係り語文節と受け語文
節のいずれかで、次候補の選択がなされると、係り受け
情報が作成され、係り受け辞書９８への登録が行なわれ
ることになる。When the dependency learning processing routine of FIG. 10 explained above is executed, the dependency learning flag is ON, and the following dependency is detected in any of the dependency phrase and the dependency phrase for which the dependency is recognized. When a candidate is selected, dependency information is created and registered in the dependency dictionary 98.

【００５２】次に、係り受け情報の作成処理ルーチン
（図１０ステップＳ６５０）の詳細について、図１１の
フローチャートを用いて説明する。係り受け情報の作成
処理では、まず次候補が選択された係り語、受け語の品
詞種類のチェックを行ない（ステップＳ７００）、係り
受けを行なう品詞か否かの判断を行なう（ステップＳ７
０５）。第１候補が表示された状態では、係り受けは成
立とされていたわけであるが、次候補の選択によっては
係り受け関係からはずれてしまうことがあるからであ
る。例えば、「川の」＋「流れ」という係り受けが辞書
に登録されていて、その成立が検定された場合で、「川
の」（名詞＋助詞「の」）に代えて「河野」（固有名
詞、姓）が選択されると、品詞が異なってしまい、通常
は対応する語との間で係り受けを成立させない品詞とな
ってしまう。係り受けを行なう品詞でなくなったと判断
された場合には、何も行なわず「ＥＮＤ」に抜けて、本
処理ルーチンを終了する。Next, details of the modification information creation processing routine (step S650 in FIG. 10) will be described using the flowchart in FIG. In the modification information creation process, first, the modification word for which the next candidate is selected and the part-of-speech type of the modification word are checked (step S700), and it is determined whether or not the modification part is part-of-speech (step S7).
05). This is because the dependency was established when the first candidate was displayed, but it may be out of the dependency depending on the selection of the next candidate. For example, if the dependency "Kawano" + "Flow" is registered in the dictionary and its success is verified, "Kawano" (proprietary) instead of "Kawano" (noun + particle "no") When a noun, a family name) is selected, the part of speech is different, and usually the part of speech is not dependent on the corresponding word. If it is determined that the part of speech to be subjected to the dependency is no longer present, nothing is done and the process goes to "END" to end this processing routine.

【００５３】選択された次候補が係り受けを行なう品詞
であると判断された場合には、次に係り受けが成立する
とされた係り語と受け語の文節の関係が、係り受け関係
があり得るとして用意した次の５つの類型のいずれに属
するかの判断を行なう。すなわち、「体言」＋「用言」
であるか否か判断（ステップＳ７１５）、「用言」＋
「用言」であるか否かの判断（ステップＳ７２０）、
「体言」＋「体言」であるか否かの判断（ステップＳ７
２５）、「連体形」＋「体言」であるか否かの判断（ス
テップＳ７３０）、「連体詞」＋「体言」であるか否か
の判断（ステップＳ７３５）である。これらの判断がす
べて「ＮＯ」であれば、係り受け学習を行なう組み合わ
せではないとして、「ＥＮＤ」に抜けて本ルーチンを終
了する。When it is determined that the selected next candidate is the part of speech for which the modification is performed, the relationship between the modification word for which the modification is established next and the clause of the modification word may have a modification relationship. It is determined which of the following five types prepared as above belongs to. That is, "hymn" + "definition"
(Step S715), the "verb" +
Judgment as to whether it is a "defective" (step S720),
Judgment as to whether or not it is "hymn" + "hymn" (step S7)
25), "unified form" + "syllable" determination (step S730), and "union" + "dynamic" determination (step S735). If all of these determinations are "NO", it is determined that the combination is not a combination for performing dependency learning, the process ends in "END", and the present routine ends.

【００５４】他方、係り語と受け語の文節とが「体言」
＋「用言」の関係にあると判断された場合には、（ステ
ップＳ７１５）、次にこの「体言」＋「用言」の係り受
けのパターンの詳細を解析する処理を行なう（ステップ
Ｓ７４０）。この解析処理の詳細を示したのが、図１２
のフローチャートである。「体言」＋「用言」の係り受
けには、様々なパターンがあるので、詳しくこれを解析
するのである。この解析処理について、図１２を参照し
て説明する。On the other hand, the related word and the phrase of the receiving word are "syntax"
If it is determined that the relation is ++ "defective" (step S715), then processing for analyzing the details of the dependency pattern of "predictive" + "defective" is performed (step S740). . The details of this analysis processing are shown in FIG.
It is a flowchart of FIG. There are various patterns in the dependency of "hymn word" + "defective word", so this should be analyzed in detail. This analysis processing will be described with reference to FIG.

【００５５】「体言」＋「用言」のパターン解析処理ル
ーチンが開始されると、まず「体言」＋「用言」の間に
助詞が存在するか否かの判断を行ない（ステップＳ８０
０）、存在しないと判断された場合には、省略可能な助
詞（例えば格助詞「が」）が省略されているケースであ
ると判断し、これは係り受け辞書９８に係り受けとして
登録可能と判断する（ステップＳ８０５）。他方、「体
言」＋「用言」の間に助詞が存在すると判断された場合
には（ステップＳ８００）、その助詞が「の」や「や」
であるか否かの判断を行なう（ステップＳ８１０）。通
常、係り受けを構成する「体言」＋「用言」の間に助詞
「の」や「や」が存在することはあり得ないから、係り
受け成立と判断して各文節の単語の第１候補を表示した
けれども、係り受けの成立との判断が誤っており、次候
補の選択がなされたと考えられる。従ってこの場合に
は、何も行なわず「ＥＮＤ」に抜けて本ルーチンを終了
する。When the pattern analysis processing routine of "syntax" + "synonym" is started, it is first judged whether or not there is a particle between "synomony" + "synonym" (step S80).
0) If it is determined that it does not exist, it is determined that an optional particle (for example, the case particle “ga”) is omitted, and this can be registered as a dependency in the dependency dictionary 98. It is determined (step S805). On the other hand, when it is determined that a particle is present between the “syntax” and the “defective” (step S800), the particle is “no” or “ya”.
It is determined whether or not (step S810). Normally, there is no possibility that there is a particle "no" or "ya" between the "composition" and the "verb" that make up the dependency, so it is judged that the dependency is established and the first word of each phrase Although the candidate was displayed, it is considered that the next candidate was selected because the judgment that the dependency was established was incorrect. Therefore, in this case, nothing is done and the process goes to "END" to end this routine.

【００５６】他方、「体言」＋「用言」の間に「の」や
「や」以外の助詞が存在すると判断された場合には（ス
テップＳ８１０）、「体言」＋「用言」が使役の関係に
あるか否かの判断を行なう（ステップＳ８１５）。使役
とは、体言の文節である「人（もしくはもの）が」、
「なになにを」あるいは「人（もしくはもの）に」と、
用言の文節である「○○させる」とからなるいう性質を
持つから、用言の語尾型からこれ判断するのである。例
えば、「親鳥が餌を雛に食べさせる」といった文があり
得、この場合、「体言」＋「用言」の形で使役の係り受
けが見出されるのは、「餌を」＋「食べさせる」や「雛
に」＋「食べさせる」である。使役であると判断された
場合には（ステップＳ８１５）、更に「体言」＋「用
言」の間に位置する助詞が「を」もしくは「に」である
か否かの判断を行なう（ステップＳ８２０）。「体言」
＋「用言」の間に位置する助詞が「を」もしくは「に」
であれば、使役の係り受けであると判断し、「体言」＋
「助詞が」＋「用言」の形にし（ステップＳ８２５）、
係り受けとして新たに登録可能としてステップＳ８０５
に移行する。On the other hand, when it is determined that there is a particle other than "no" or "ya" between "hymn" and "hymn" (step S810), "hymn" + "hymn" is used. It is determined whether or not there is the relationship (step S815). Causative and is a clause of the nominal "person (or thing) is",
And "What do" or "to the person (or thing)",
Since it has the property that it is composed of the phrase of the idiom, "make it XX," it is judged from the ending type of the idiom. For example, there can be a sentence such as “a parent bird feeds its chicks with food”. In this case, it is found that the causative dependency in the form of “hymn” + “defective” is “feed” + “feed”. "And" Hina "+" Feed ". When it is determined that the character is a causative character (step S815), it is further determined whether or not the particle located between the "syntax" and the "verb" is "wo" or "ni" (step S820). ). "Word of mouth"
＋ The particle located between "defectives" is "wo" or "ni"
If so, it is judged to be the role of the causative agent, and “hymn” +
In the form of "particles" + "defectives" (step S825),
It is possible to newly register as a dependency, step S805.
Move to

【００５７】ステップＳ８１５で使役でないと判断され
た場合には、次に受け身（受動）でないか否かの判断を
行なう（ステップＳ８３０）。受け身とは、「人（ある
いはもの）が」あるいは「人（あるいはもの）に」＋
「動詞受動型（○○れる、られる）」からなる構文であ
り、例えば「彼に」＋「断わられる」（能動型は「彼
が」＋「断わる」）とか、「扉が」＋「開かれる」（能
動型は「扉を」＋「開く」）といった構文である。係り
受け情報を作成する文節同士が「体言」＋「用言」の関
係にあり、受け身であると判断されると、次に「体言」
＋「用言」の間に存在する助詞が「が」であるか否かの
判断を行なう（ステップＳ８３５）。この判断が「ＹＥ
Ｓ」であれば、「体言」＋「用言」の間の助詞を「を」
に変更して（ステップＳ８４０）、ステップＳ８０５に
移行して、係り受けの登録を可能とする。If it is determined in step S815 that the player is not a player, then it is determined whether or not the player is passive (passive) (step S830). The passive, "a person (or thing) is" or "to the person (or thing)" +
It is a syntax consisting of "verb passive type (○○ reru, uru)", such as "to him" + "rejected" (active type is "he" + "reject") or "door" + "open". It is a syntax such as "is" (active type is "open door" + "open"). If the clauses that create dependency information are in the “hymn” + “definitive” relationship and are judged to be passive, then “hymn”
It is determined whether or not the particle which exists between + and the "verb" is "ga" (step S835). This judgment is "YE
If it is "S", the particle between "hymn" and "definitive" is "wo"
(Step S840), the process proceeds to step S805, and the dependency can be registered.

【００５８】受け身の場合であって「体言」＋「用言」
の間の助詞が「が」でない場合には（ステップＳ８３
５）、次に「体言」＋「用言」の間の助詞が「に」であ
るか否かの判断を行なう（ステップＳ８４５）。受け身
で間の助詞が「に」であると判断されると、「体言」＋
「用言」の間の助詞を「が」に変更して（ステップＳ８
５０）、ステップＳ８０５に移行して、係り受けの登録
を可能とする。"Passive" + "defective" even in the passive case
If the particle between is not "ga" (step S83
5) Next, it is determined whether or not the particle between "syntax" and "defective" is "ni" (step S845). If the passive particle is determined to be "ni", then "hymn" +
Change the particle between "defectives" to "ga" (step S8
50), the process proceeds to step S805, and the dependency can be registered.

【００５９】受け身でもなく、受け身の場合で間の助詞
が「が」でも「に」でもなければ、通常の「体言」＋
「用言」の係り受けであるか、または使役・受動で特定
の助詞を採らない使役・受動（例えば「場所」＋「で」
＋「動詞使役または受動型」など）であると判断し、ス
テップＳ８０５に移行し、係り受けの登録を可能とした
後、本ルーチンを終了する。以上の処理により、「体
言」＋「用言」の係り受けのパターンが解析され、特定
の条件が満たされた場合には、登録が可能とされる。If it is neither passive nor passive, and the interpositional particle is neither "ga" nor "ni", then it is a normal "body word" +
It is a dependency of a "verb" or a causative / passive that does not take a specific particle as a causative / passive (for example, "place" + "de").
+ "Verb causative or passive type"), the process proceeds to step S805, and the dependency can be registered, and then this routine ends. Through the above processing, the dependency pattern of “syntax” + “defective” is analyzed, and if a specific condition is satisfied, registration is possible.

【００６０】図１１に戻って、係り受け情報作成処理ル
ーチンについての説明を続ける。ステップＳ７４０で
「体言」＋「用言」の係り受けのパターン解析を行なっ
た後、解析の対象となった文節の組み合わせが登録可能
であるか否かの判断を行なう（ステップＳ７４５）。図
１２に示したパターン解析により新たな係り受けとして
登録可能であるとされた場合には、登録可能としてステ
ップＳ７８０に進み、係り受け情報の作成を行なう。即
ち、例えば受け身として登録可能とされた場合には、
「餌を」＋「食べさせる」の例では、読み「たべえ
さ」，係り語「餌」，受け語「食べる」，付属語「が」
という情報を作成する。また、例えば受け身として登録
可能とされた場合には、「扉が」＋「開かれる」であれ
ば、能動型である「扉を」＋「開ける」として登録用の
情報を作るのである。この例では、読み「あけとび
ら」，受け語「開ける」，係り語「扉」，付属語「を」
という情報が作られる。更に、「彼に」＋「断わられ
る」であれば、能動型である「彼が」＋「断わる」とし
て登録用の情報を作るのである。この例では、読み「こ
とわかれ」，受け語「断わる」，係り語「彼」，付属語
「が」という情報が作られる。Returning to FIG. 11, the description of the dependency information creation processing routine will be continued. In step S740, after the pattern analysis of the dependency of “syntax” + “defective” is performed, it is determined whether or not the combination of clauses to be analyzed can be registered (step S745). When it is determined by the pattern analysis shown in FIG. 12 that a new dependency can be registered, it is determined that registration is possible and the process proceeds to step S780 to generate dependency information. That is, for example, when it is possible to register as passive,
In the example of "feeding" + "feeding", the reading is "eating", the related word "bait", the receiving word "eat", the adjunct word "ga".
Create the information. In addition, for example, when it is possible to register as passive, if the "door" + "open", the information for registration is created as "active door" + "open". In this example, the reading "Aketobira", the adjective "Open", the related word "Door", the adjunct "Wo"
Information is created. Further, if "to him" + "will be refused", active information "he will" + "will be refused" will be used to create information for registration. In this example, information such as the reading “kotoware”, the received word “decline”, the related word “he”, and the adjunct word “ga” is created.

【００６１】次に係り受けが成立するとされた係り語と
受け語の文節の関係が、係り受け関係があり得るとして
用意した５つの類型のうち、「体言」＋「用言」以外の
４つのケースのいずれかに属すると判断された場合の処
理について説明する。「用言」＋「用言」であると判断
された場合には（ステップＳ７２０）、係り語が連用形
であるか否か判断し（ステップＳ７５０）、連用形であ
れば格を連用修飾として（ステップＳ７５５）、ステッ
プＳ７８０に移行し、係り受け情報を作成する。「用
言」＋「用言」であると判断されても、係り語が連用形
でないと判断された場合には、何も行なわず「ＥＮＤ」
に抜けて、本ルーチンを終了する。Next, the relationship between the dependent word and the clause of the dependent word, which is determined to have the dependent relationship, can be selected from among the five types prepared as the possible dependent relationship. A process performed when it is determined that the case belongs to any one of the cases will be described. When it is determined that “synonym” + “synonym” (step S720), it is determined whether or not the related word is a related form (step S750). (S755), the process proceeds to step S780, and dependency information is created. Even if it is determined that it is “synonym” + “synonym”, if it is determined that the related word is not the plural form, “END” is not performed.
To exit this routine.

【００６２】また、「体言」＋「体言」であると判断さ
れた場合は（ステップＳ７２５）、「体言」＋「体言」
の間に助詞「の」や「と」や「や」が存在するか否かの
判断を行なう（ステップＳ７６０）。「体言」と「体
言」との間にこれらの助詞が入る場合には、並列関係
（ＡとＢ、ＡやＢ）や、限定関係（ＡのＢ）を示すもの
とみなすことができ、これらの関係での係り受けの情報
を、同様に作成する（ステップＳ７８０）。他方、「体
言」と「体言」の間にこれらの助詞が入っていない場合
には、何も行なわず「ＥＮＤ」に抜けて、本ルーチンを
終了する。If it is determined that "syntax" + "syntax" (step S725), "syntax" + "syntax".
It is determined whether or not there is a particle "no", "to", or "ya" in between (step S760). When these particles are put between "syntax" and "syntax", it can be regarded as showing a parallel relationship (A and B, A or B) or a limited relationship (A and B). Similarly, dependency information regarding the relationship is created (step S780). On the other hand, when these particles are not included between the "syntax" and the "syntax", nothing is done and the process goes to "END" to end this routine.

【００６３】同様に、「連体形」＋「体言」であると判
断された場合や（ステップＳ７３０）、「連体詞」＋
「体言」であると判断された場合には（ステップＳ７３
５）、係り受けの関係成立として、連体修飾型として係
り受けの情報を作成する（ステップＳ７８０）。「連体
形」＋「体言」は、係り語が用言連体形であり、受け語
として体言が見出されている場合であって、例えば、
「連体形」＋「体言」の係り受け「美しい」＋「花」に
おいて次候補「うつくしい」が選択された場合の「うつ
くしい」＋「花」や、係り受け「走る」＋「車両」にお
いて次候補「車輌」が選択された場合の「走る」＋「車
輌」などである。また、「連体詞」＋「体言」とは、例
えば、「ある」＋「女」とか、「あの」＋「人」などで
あり、次候補「或る」が選択された場合の「或る」＋
「女」などである。これらの場合には、連体修飾型とし
て、係り受け情報が作成される。即ち、それぞれ係り受
け情報として、読み「うくしはな」，受け語「うつくし
い」，係り語「花」や、読み「はししゃりょう」，受け
語「走る」，係り語「車輌」が作られるのである。この
結果、「連体形」＋「体言」の場合には、「車輌」＋
「走る」＋助詞「が」などの形式で、つまり「体言」＋
「用言」の場合の係り受け情報と同じ形式で係り受け情
報が作られ、記憶される。係り受け辞書９８は、体言と
用言の係り受けとしては、係り語として体言を、受け語
として用言をとるよう統一されているのである。Similarly, when it is judged that the expression is "union form" + "noun" (step S730), "union word" +
When it is determined that the word is a "word of speech" (step S73)
5) As the relationship of dependency is established, dependency information is created as a relation modification type (step S780). “Union form” + “composition” is a case where the related word is a noun conjunction form and a compound word is found as a word, for example,
Dependency of "union form" + "hymn""Beautiful" + "Flower" when the next candidate "Dark" is selected "Dark" + "Flower" and dependency "Running" + "Vehicle" For example, “run” + “vehicle” when the candidate “vehicle” is selected. In addition, “adjunct” + “give” is, for example, “aru” + “woman” or “that” + “person”, and “aru” when the next candidate “aru” is selected. +
For example, "woman". In these cases, dependency information is created as the adnominal modification type. In other words, as the dependency information, the reading "Ukuhana", the receiving word "beautiful", the reading word "flower", the reading "Hashishayo", the receiving word "run", and the moving word "vehicle" are created. Be done. As a result, in the case of "union form" + "noun", "vehicle" +
In the form of "run" + particle "ga", that is, "body language" +
Dependency information is created and stored in the same format as the modification information in the case of a "verb". The dependency dictionary 98 is unified so that the dependency of the synonym and the idiom is the idiom and the idiom.

【００６４】入力された仮名文字列が文節分かち書きさ
れた後で、以上説明した図１０ないし図１２に示した処
理がなされると、係り受けが成立すると判断されて第１
候補の単語が選択・表示された状態でその単語の第２候
補以下の候補が選択されるた場合、選択された次候補に
ついて、新たに係り受けの情報が作成され（ステップＳ
７８０）、これが係り受け辞書９８に登録される。従っ
て、係り受け辞書を９８を徒に大きくする必要がなく、
かつこの仮名漢字変換装置を使用する人の癖や用法に応
じた係り受けのパターンを記憶して、仮名漢字変換によ
り所望の文字列を得られる可能性を高くすることができ
る。また、使役や受け身なども原形（この例では能動
形）に帰って学習しておくので、いずれの構文にも適用
することができる。When the processing shown in FIGS. 10 to 12 described above is performed after the input kana character string has been segmented and written, it is determined that the dependency is established, and the first
When the second and subsequent candidates of the word are selected while the candidate word is selected and displayed, new dependency information is created for the selected next candidate (step S
780), which is registered in the dependency dictionary 98. Therefore, it is not necessary to enlarge the dependency dictionary to 98,
In addition, it is possible to increase the possibility that a desired character string can be obtained by Kana-Kanji conversion by storing a dependency pattern according to the habit and usage of a person who uses this Kana-Kanji conversion device. In addition, causatives and passives can be applied to any syntax because they are learned by returning to the original form (active form in this example).

【００６５】また、本実施例では、係り受けの検定を、
係り受けの検定範囲とされた範囲内で文末に近い位置か
ら開始し、かつ係り語を優先として行なっている。この
構成は、係り受けによる単語候補の確定をより所望のも
のとする点で極めて有効であった。これは、日本語では
文末の述部側が文の意味を担っていることが多いこと、
および行為（一般に文末側の述部が記述）が同じで主体
（一般により文頭側の記述）が代わることよりも、主体
の行為が変化する場合が多いことからではないかと考え
られる。In the present embodiment, the dependency test is
It starts from a position close to the end of the sentence within the range of the dependency verification range and gives priority to the dependency word. This configuration was extremely effective in making the determination of word candidates by modification more desirable. This is because in Japanese, the predicate side at the end of a sentence often carries the meaning of the sentence.
It is considered that the subject's actions often change rather than the same subject (generally described by the predicate on the sentence end side) and the same subject (generally the sentence description on the sentence side) changes.

【００６６】また、係り受けが一旦成立したと判断され
た場合には、その受け語から係り語までの範囲を係り受
け成立範囲として管理するので、係り受けの範囲が交差
することがない。また、２以上の受け語が一つの係り語
を受けるという判断をすることもない。また、係り受け
の成立を隣接する文節を越えて判断するので（ｎ≧２の
場合）、副詞などによる修飾が係り受け関係の間に入っ
ても係り受けの検定を正しく行なうことができる。従っ
て、複数の係り受けが成立する場合には、図１３（Ａ）
に示すように、独立した係り受けが別個に成立する組合
わせか、図１３（Ｂ）に示すように、一つの受け語が２
以上の係り語を受ける組合わせか、図１３（Ｃ）に示す
ように、一つの係り受けを跨ぐようにもう一つの係り受
けが成立する組合わせが許されることになる。When it is determined that the dependency is established, the range from the received word to the dependency is managed as the dependency establishment range, so that the dependency range does not intersect. Moreover, it is not judged that two or more dialects receive one related word. Moreover, since the establishment of the dependency is judged across the adjacent clauses (when n ≧ 2), the dependency test can be correctly performed even when the modifier such as an adverb enters between the dependencies. Therefore, when a plurality of dependencies are established, FIG. 13 (A)
As shown in FIG. 13, a combination in which independent dependencies are separately established, or as shown in FIG.
A combination that receives the above-mentioned dependency words or a combination that allows another dependency to be established so as to straddle one dependency, as shown in FIG. 13C, is permitted.

【００６７】次に本発明第２実施例について説明する。
第２実施例では、まず第１実施例における図６の処理、
即ち係り受けの検定処理ルーチンにおいて、係り受け辞
書９８を検索して（ステップＳ４２０）係り受けがある
か否かを判定する際（ステップＳ４３０）、具体的な係
り受けの関係を係り受け辞書９８に記憶しておくのでは
なく、図１４に示すように、通常の自立語辞書５８また
は係り受け辞書９８に意味分類品詞の情報を記憶してお
き、これを用いて係り受けの判定を行なうのである。例
えば、第１実施例では、「課長に」＋「言う」や「部長
に」＋「言う」や「社長に」＋「言う」などを、個別の
係り受け情報として係り受け辞書９８に記憶しておき、
個別に係り受けの成立を検定していた。これに対して、
第２実施例では、「課長」「部長」「社長」などの自立
語辞書に、意味分類品詞という新たな品詞を通常の品詞
情報の他に付け加えており、「課長」などには、「地位
／役職」を示す意味品詞「地位」が付与されている。ま
た、係り受け辞書９８には、個別の係り受け情報ではな
く、「体言（地位）」＋「言う」＋助詞「に」といった
情報が記憶されている。この例では、読みは「いＥ
４」，受け語は「言う」，係り語は「地位」である。Next, a second embodiment of the present invention will be described.
In the second embodiment, first, the processing of FIG. 6 in the first embodiment,
That is, in the dependency verification processing routine, when the dependency dictionary 98 is searched (step S420) to determine whether or not there is a dependency (step S430), a specific dependency relationship is set in the dependency dictionary 98. Instead of storing it, as shown in FIG. 14, the information of the meaning classification part-of-speech is stored in the normal independent word dictionary 58 or the dependency dictionary 98, and the dependency is determined using this. . For example, in the first embodiment, “to section manager” + “say”, “to manager” + “say”, “to president” + “say”, etc. are stored in the dependency dictionary 98 as individual dependency information. Aside
The success of the dependency was verified individually. On the contrary,
In the second example, a new part-of-speech called the meaning class-of-speech is added to the independent word dictionary such as “section manager”, “department manager”, “president” in addition to the normal part-of-speech information. The meaning part of speech "position" indicating "/ position" is added. In addition, the dependency dictionary 98 stores information such as “composition (position)” + “say” + particle “ni” instead of individual dependency information. In this example, the reading is
4 ”, the receiving word is“ say ”, and the related word is“ status ”.

【００６８】文節分かち書きされた後で、各文節の文法
情報を調べ、文節を構成する自立語の意味品詞を読み取
っておく。次に係り受けの検定（図６）を行なうとき、
この意味品詞を用いて係り受け辞書を検索する（ステッ
プＳ４３０）。係り受け辞書９８に、「地位」＋「言
う」に該当する係り受けが見出されるから、この場合係
り受け成立とし、これを文節を構成する単語の第１候補
として表示する。また、この状態で単語の次候補が選択
された場合には、その次候補が第１候補と同じ意味分類
を付与できる単語であるかを、係り受け情報の作成処理
（図１１ステップＳ７００）において判断する。例え
ば、第２候補の通常の品詞が同一であれば、第１候補と
同じ意味分類を付与できると判断する。第１候補が「課
長」である場合に、次候補の中から「科長」や「家長」
が選ばれると、これらの候補の品詞は「普通名詞」で同
一なので、新たに「科長」や「家長」の意味品詞を「地
位」とするのである。After segmenting the bunsetsu, the grammatical information of each bunsetsu is examined to read the meaning part-of-speech of the independent word forming the bunsetsu. Next, when performing the dependency test (Fig. 6),
The dependency dictionary is searched using this meaning part-of-speech (step S430). In the dependency dictionary 98, a dependency corresponding to “status” + “say” is found, so in this case, the dependency is established, and this is displayed as the first candidate of the word forming the phrase. In addition, when the next candidate of the word is selected in this state, whether the next candidate is a word to which the same semantic classification as the first candidate can be given is determined in the dependency information creation process (step S700 in FIG. 11). to decide. For example, if the second candidate has the same normal part of speech, it is determined that the same semantic classification as that of the first candidate can be given. If the first candidate is the "manager", then the "chief" or "head" from the next candidates
When is selected, the candidate parts of speech are the same as "common nouns", and the meaning parts of speech of "general chief" and "family head" are newly set as "position".

【００６９】この実施例によれば、係り受けの関係を個
別に記憶しておく必要がなく、係り受け辞書を極めて小
さくすることとができる。また、新たに係り受けの関係
を追加する場合には、その単語の意味品詞を追加するだ
けですむので、係り受け辞書を改変する必要がない。従
って、一旦表示された文節の単語候補の次候補が選択さ
れた場合の係り受けの学習も、自立語辞書５８の意味品
詞を書き直すだけで済む。なお、係り受けを検定する
際、意味品詞を用いた係り受けのみを対象としてもよい
が、第１実施例で説明した個別の係り受けと、意味品詞
を用いた係り受けの両方について検定するものとしても
差し支えない。According to this embodiment, it is not necessary to individually store the dependency relation, and the dependency dictionary can be made extremely small. In addition, when a new dependency relation is added, it is only necessary to add the meaning part of speech of the word, so there is no need to modify the dependency dictionary. Therefore, the learning of dependency when the next candidate of the word candidates of the phrase once displayed is selected only by rewriting the meaning part of speech in the independent word dictionary 58. It should be noted that when the dependency is tested, only the dependency using the semantic part-of-speech may be the target, but both the individual dependency described in the first embodiment and the dependency using the semantic part-of-speech are tested. It doesn't matter.

【００７０】以上本発明の実施例について説明したが、
本発明はこうした実施例に何等限定されるものではな
く、例えば最小コスト法に代えて２文節最長一致法など
の他の文節分かち書きの手法を用いた構成、係り受け検
定における係り語優先か受け語優先かを切り替えられる
構成、係り受けの関係を「体言」＋「用言」など一つな
いし複数個に限定した構成など、本発明の要旨を逸脱し
ない範囲内において、種々なる態様で実施し得ることは
勿論である。The embodiments of the present invention have been described above.
The present invention is not limited to such an embodiment. For example, instead of the minimum cost method, a configuration using another bunsetsu segmentation method such as the two-bunsetsu longest matching method, and the dependent word priority or the dependent word in the modification check can be used. The present invention can be implemented in various modes within the scope not departing from the gist of the present invention, such as a configuration in which priority can be switched, a configuration in which the relationship of dependency is limited to one or more such as “hymn” + “verb”. Of course.

【００７１】[0071]

【発明の効果】以上説明したように本発明の請求項１の
仮名漢字変換装置および請求項６の仮名漢字変換方法で
は、文法辞書を参照して、入力された仮名文字列を文節
分かち書きし、分かち書きされた各文節において、文節
を構成する単語を変換済みの形態で示す。この状態で、
分かち書きされた一つの文節を起点とし、所定の文節同
士の係り受けの情報を記憶した係り受け情報辞書を参照
して、他の文節との係り受けの存在を検定する。この係
り受けの検定により係り受け関係の成立が検定された文
節の少なくとも一方の文節において、候補単語表示手段
により表示された単語とは異なる単語が選択されると、
この選択された単語を、係り受け情報として学習する。
従って、新たな係り受けの関係を学習していくことがで
きるという優れた効果を奏する。As described above, in the kana-kanji conversion device according to claim 1 and the kana-kanji conversion method according to claim 6 of the present invention, the input kana character string is segmented into phrases by referring to the grammar dictionary, In each punctuation phrase, the words that make up the punctuation phrase are shown in a converted form. In this state,
The existence of a dependency with another bunsetsu is verified by referring to a dependency information dictionary in which the information of the dependency of a predetermined bunsetsu is stored, with one punctuation written as a starting point. When a word different from the word displayed by the candidate word display means is selected in at least one of the clauses for which the establishment of the dependency relationship has been verified by this dependency test,
The selected word is learned as dependency information.
Therefore, there is an excellent effect that it is possible to learn a new dependency relationship.

【００７２】請求項２記載の仮名漢字変換装置は、新た
な係り受けの学習が、候補単語表示手段により表示され
た単語と、この単語に代えて選択された単語との文法上
の性質が同じであるときのみに行なわれるので、検定さ
れた係り受けとは関係のない単語による文節の関係を係
り受けとして学習することがないという効果を奏する。In the kana-kanji conversion device according to the second aspect, the learning of the new dependency has the same grammatical property as the word displayed by the candidate word display means and the word selected in place of this word. Since it is carried out only when, the effect of not learning the relation of the bunsetsu, which is not related to the tested dependency, as the dependency.

【００７３】請求項３記載の仮名漢字変換装置は、係り
受け関係を有する文節を構成し得る単語について、該単
語のカテゴリを記憶しておき、単語のカテゴリが同一の
時、文法上の性質が同一と判断する手段を備えるから、
単語に付与されたカテゴリにより文法上の性質が同一か
どうかの判断を容易に行なうことができる。尚、このカ
テゴリの情報は文法辞書内に持たせることも可能であ
る。The kana-kanji conversion device according to claim 3 stores the category of a word that can form a phrase having a dependency relation, and when the category of the word is the same, the grammatical property is the same. Since it has a means to judge that they are the same,
It is possible to easily judge whether or not the grammatical properties are the same depending on the category given to the word. Information of this category can be held in the grammar dictionary.

【００７４】請求項４の仮名漢字変換装置は、係り受け
の関係を、体言＋用言、用言＋用言、体言＋体言、連体
形＋体言、連体詞＋体言の少なくとも一つについて検定
するので、基本的な係り受けの関係について、確実に検
定することができる。Since the kana-kanji conversion device according to claim 4 verifies the dependency relation for at least one of the synonym + synonym, the synonym + synonym, the synonym + synamism, the adnominal form + synomonic, and the adjunct + synomical. , It is possible to reliably test the basic dependency relationship.

【００７５】請求項５記載の仮名漢字変換装置は、体言
＋用言の係り受けパターンの解析時に、使役または受動
の関係を用いた係り受けの検定を行ない、係り受けの成
立が検定されたとき、使役または受動の関係を原形に戻
して係り受け関係を学習するので、使役または受動の関
係について別途係り受け情報を記憶したり、係り受けの
学習をする必要がないという優れた効果を奏する。In the kana-kanji conversion device according to the fifth aspect, when the dependency pattern of the phrase + noun is analyzed, the dependency check using the causative or passive relation is performed, and the success of the dependency is verified. Since the causative or passive relationship is returned to the original form and the dependency relationship is learned, there is an excellent effect that it is not necessary to separately store the dependency information about the causative or passive relationship or to learn the dependency.

[Brief description of drawings]

【図１】本発明の一実施例である仮名漢字変換装置にお
ける仮名漢字変換機能の実現形態を示す機能ブロック図
である。FIG. 1 is a functional block diagram showing an implementation form of a kana-kanji conversion function in a kana-kanji conversion device according to an embodiment of the present invention.

【図２】実施例としての仮名漢字変換装置が実現される
ハードウェアを示すブロック図である。FIG. 2 is a block diagram showing hardware for realizing a kana-kanji conversion device as an example.

【図３】文節分かち書き部１０２において実行される文
節分かち書き処理を示すフローチャートである。FIG. 3 is a flowchart showing a phrase segmentation writing process executed in a segmentation segment writing unit 102.

【図４】実施例における最小コスト法による文節分かち
書きの一例を示す説明図である。FIG. 4 is an explanatory diagram showing an example of phrase segmentation according to the minimum cost method in the embodiment.

【図５】実施例における係り受け検定の対象となる例文
を示す説明図である。FIG. 5 is an explanatory diagram showing an example sentence that is a target of the dependency check in the embodiment.

【図６】実施例における係り受け検定処理ルーチンを示
すフローチャートである。FIG. 6 is a flowchart showing a dependency verification processing routine in the embodiment.

【図７】実施例における係り受け辞書の一例を示す説明
図である。FIG. 7 is an explanatory diagram showing an example of a dependency dictionary according to the embodiment.

【図８】複数文節における係り受けの成立の様子を示す
説明図である。FIG. 8 is an explanatory diagram showing how dependency is established in a plurality of clauses.

【図９】係り受けの検定における優先順位を示す説明図
である。FIG. 9 is an explanatory diagram showing a priority order in a modification check.

【図１０】係り受け学習処理ルーチンを示すフローチャ
ートである。FIG. 10 is a flowchart showing a dependency learning processing routine.

【図１１】係り受け情報作成処理ルーチンを示すフロー
チャートである。FIG. 11 is a flowchart showing a modification information creation processing routine.

【図１２】「体言」＋「用言」の係り受けパターン解析
処理ルーチンを示すフローチャートである。FIG. 12 is a flowchart showing a dependency pattern analysis processing routine of “syntax” + “defective”.

【図１３】係り受けが成立する文節の関係を示した説明
図である。FIG. 13 is an explanatory diagram showing a relation of clauses for which dependency is established.

【図１４】第２実施例にかける意味品詞を備えた辞書の
一例を示す説明図である。FIG. 14 is an explanatory diagram showing an example of a dictionary including meaning parts of speech according to the second embodiment.

[Explanation of symbols]

２１…ＣＰＵ２２…ＲＯＭ２３…ＲＡＭ２４…キーボード２５…キーボードインタフェース２６…ＣＲＴ２７…ＣＲＴＣ２８…プリンタ２９…プリンタインタフェース３０…ハードディスクコントローラ（ＨＤＣ）３１…バス３２…ハードディスク４０…文字入力部４２…変換制御部４４…変換後文字列出力部５０…文字列入力部５２…文字格納部５４…自立語候補作成部５６…自立語解析位置管理部５８…自立語辞書６４…付属語候補作成部６６…付属語解析位置管理部６８…付属語辞書７０…係り受け学習部７０…学習部７２…自立語学習部７４…補助語学習部７６…接辞学習部７８…文字変換学習部８０…単語データ作成部８２…接続検定部８４…接続検定テーブル８６…コスト計算部９０…係り受け候補調整部９２…受動解析部９４…助詞許容解析部９６…係り受け範囲管理部９８…係り受け辞書９９…係り受け転置情報調整部１００…単語データ格納部１０２…文節分かち書き部１０４…係り受け転置情報調整部１０６…文節データ格納部１０８…変換文字列出力部 21 ... CPU 22 ... ROM 23 ... RAM 24 ... Keyboard 25 ... Keyboard interface 26 ... CRT 27 ... CRTC 28 ... Printer 29 ... Printer interface 30 ... Hard disk controller (HDC) 31 ... Bus 32 ... Hard disk 40 ... Character input section 42 ... Conversion Control unit 44 ... Converted character string output unit 50 ... Character string input unit 52 ... Character storage unit 54 ... Independent word candidate creation unit 56 ... Independent word analysis position management unit 58 ... Independent word dictionary 64 ... Adjunct word creation unit 66 ... Adjunct analysis position management unit 68 ... Adjunct dictionary 70 ... Dependency learning unit 70 ... Learning unit 72 ... Independent word learning unit 74 ... Auxiliary word learning unit 76 ... Affix learning unit 78 ... Character conversion learning unit 80 ... Word data creation unit 82 ... Connection verification unit 84 ... Connection verification table 86 ... Cost calculation unit 90 ... Dependency candidate adjustment 92 ... Passive analysis unit 94 ... Particle admissible analysis unit 96 ... Dependency range management unit 98 ... Dependency dictionary 99 ... Dependency transposition information adjustment unit 100 ... Word data storage unit 102 ... Phrase segmentation writing unit 104 ... Dependency transposition information adjustment unit 106 ... clause data storage section 108 ... converted character string output section

Claims

[Claims]

1. A kana-kanji conversion device for inputting a kana character string and referring to a grammar dictionary to generate a kana-kanji mixed character string candidate, wherein the inputted kana character string is referred to the grammar dictionary. And a punctuation means for puncturing each bunsetsu, a candidate word display means for showing the words constituting each bunsetsu in a converted form in each punctuated bunsetsu, and a dependency storing information on the dependency of a predetermined bunsetsu Starting from the information dictionary and one of the separated phrases, referring to the dependency information dictionary, the existence of a dependency with another phrase is detected.
When a word different from the word displayed by the candidate word display means is selected in at least one of the dependency verification means for testing and the clause for which the establishment of the modification relationship is verified by the modification test A kana-kanji conversion device comprising a dependency learning means for learning the selected word as dependency information.

2. The kana-kanji conversion device according to claim 1, wherein the dependency learning means selects a word different from the word displayed by the candidate word display means, and the dependency relationship of the word is changed. The judgment means for judging whether or not the grammatical property is the same as the word when the establishment of is verified, and the learning is performed only when the deciding means judges that the grammatical property is the same. A kana-kanji conversion device consisting of learning and performing means.

3. The kana-kanji conversion device according to claim 2, wherein the determination unit stores a category of words that can form a clause having a dependency relationship, and the both words. Kana-to-Kanji conversion device provided with means for judging that the grammatical properties are the same when the categories are the same.

4. The modification check means is a means for testing the relationship of modification for at least one of a synonym + synonym, a synonym + synonym, a synonym + hymn, an adnominal form + hymn, and a conjunction + hymn. The kana-kanji conversion device according to claim 1.

5. The kana-kanji conversion device according to claim 4, wherein the dependency check means performs a dependency check using a causative or passive relationship at the time of analyzing a dependency pattern of a noun + a noun. The kana including the first means for performing, and the learning means, when the first means determines that the dependency is established, restores the causative or passive relationship to the original form and learns the dependency. Kanji conversion device.

6. A kana-kanji conversion method for inputting a kana character string and referring to a grammar dictionary to generate a kana-kanji mixed character string candidate, wherein the inputted kana character string is referred to the grammar dictionary. Then, the words constituting the bunsetsu are shown in a converted form in each of the punctured bunsetsus, and the syllabus is referred to by referring to the modification information dictionary that stores the modification information of the predetermined syllabaries. The presence of a dependency with another bunsetsu is verified from one of the bunsetsus that have been set as the starting point, and at least one of the bunsetsus for which the establishment of the dependency relationship is verified by the verification of the bunsetsu, is displayed as described above. A kana-kanji conversion method in which when a word different from a word is selected, the selected word is learned as dependency information.