JPH0969104A

JPH0969104A - Device and method for kana-kanji conversion

Info

Publication number: JPH0969104A
Application number: JP7248664A
Authority: JP
Inventors: Yoshiharu Suzuki; 芳春鈴木; Kazunori Shimoda; 和紀下田
Original assignee: EE I SOFT KK
Current assignee: EE I SOFT KK
Priority date: 1995-08-31
Filing date: 1995-08-31
Publication date: 1997-03-11

Abstract

PROBLEM TO BE SOLVED: To simplify conversion processing step in a KANA (Japanese syllabary)/KANJI (Chinese character) converter and to reduce the size of dictionary. SOLUTION: In the case of executing KANA/KANJI conversion processing by referring to a dictionary, word retrieval is executed by using a previously determined representative notation for words having various derived notations shown in the table. Information related to the representative notation is prepared as grammatical information and grammatical check whether the representative notation is formed as a clause or not is executed. When a character string candidate is displayed as a result of clause separated writing processing, the representative notation is rewritten to a display notation to be one of derived notations and displayed. Consequently, processing such as the grammatical check of a derived notation other than the representative notation in the clause sepatated writing processing can be removed and it is unnecessary to prepare the grammatical information of derived notations other than the representative notation.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、仮名漢字変換装置
および仮名漢字変換方法に関し、詳しくは入力した文字
列から辞書を参照して仮名漢字混じり文字列候補を生成
する仮名漢字変換装置および仮名漢字変換方法に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a kana-kanji conversion device and kana-kanji conversion method, and more specifically, a kana-kanji conversion device and kana-kanji conversion device for generating a kana-kanji mixed character string candidate by referring to a dictionary from an input character string. Regarding the conversion method.

【０００２】[0002]

【従来の技術】従来、入力された仮名文字列を、所望の
仮名漢字混じり文字列に変換するための仮名漢字変換装
置が種々提案されている。特に最近では、仮名漢字変換
を行なう際に、使用者が仮名の入力にあわせて単語や文
節などの区切りをいちいち指定することなく、仮名べた
書きの文字列を入力すれば所望の仮名漢字混じり文字列
が得られるよう、様々な工夫がなされている。2. Description of the Related Art Hitherto, various kana-kanji conversion devices have been proposed for converting an inputted kana-character string into a desired kana-kanji mixed character string. In particular, recently, when performing Kana-Kanji conversion, if the user does not specify the delimiter of words or phrases in accordance with the Kana input, and if the Kana-Kana character string is entered, the desired Kana-Kanji mixed characters can be entered. Various efforts have been made to obtain rows.

【０００３】仮名べた書きの文字列を正しく文節分かち
書きするための処理としては、２文節を基本単位として
解析し、成り立ち得る文節の中で最長の文節が得られる
２文節を第１候補とする２文節最長一致法や、文節を構
成する単語の候補となり得る単語および単語同士の組合
わせにコストを付け、その点数の合計が最小となる文節
を第１候補とする最小コスト法などがある。また、日本
語には数多くの同音異義語が存在するが、これら同音異
義語の中から正しい単語を選択するために、所定の単語
同士の係り受けの情報を記憶した係り受け情報辞書を備
え、係り受け情報に該当する単語を備えた文節を検索す
る方法が提案されている。例えば、「本を著わす」の
「本」と「著わす」の関係を係り受けの関係として登録
しておき、「本を表わす」や「本を現わす」などの組合
わせに優先して、「本を著わす」を候補とすることによ
り、不適切な変換を回避しようとするのである。As a process for correctly writing a kana solid character string into bunsetsus, two bunsetsus are analyzed as a basic unit, and two bunsetsus that can obtain the longest bunsetsu are set as first candidates. There is a longest phrase matching method, a minimum cost method in which a word that can be a candidate of a word constituting a phrase and a combination of words are given a cost, and the phrase having the smallest total score is the first candidate. In addition, there are many homonyms in Japanese, but in order to select the correct word from these homonyms, it has a dependency information dictionary that stores information on the dependency of certain words, There has been proposed a method of searching for a phrase including a word corresponding to dependency information. For example, the relationship between "book" and "book" of "book author" is registered as a dependency relationship, and priority is given to combinations such as "represent book" and "represent book". , "Authoring a book" is a candidate to avoid inappropriate conversion.

【０００４】さらに、所望の仮名漢字混じり文字列を得
やすくする工夫として、同音異義語の中から直前に使用
者が選択した単語を最優先で次回の候補とする同音異義
語の学習や、使用者が指定した文節区切りを記憶して、
これを優先的に使用する文節区切り学習などの学習処理
が知られている。Further, as a device for making it easier to obtain a desired kana-kanji mixed character string, learning or use of a homonym word in which a word selected by the user immediately before the homonym word is given the highest priority as the next candidate. Memorize the phrase break that the person specified,
There is known a learning process such as phrase segmentation learning that preferentially uses this.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、日本語
には当て字や送りがなで代表されるように複数の表記が
許容された単語が数多く存在する。例えば、「取り扱
い」「取扱い」「取扱」のように送りがなの送り方によ
る表記の違いを始め、「バイオリン」と「ヴァイオリ
ン」、「バッファ」と「バッフア」と「バッファー」、
「１万円」と「壱萬圓」などである。このように複数の
表記が許容されることを「表記の揺れ」あるいは派生表
記と呼ぶ。このような表記の揺れがある場合、所望の表
記を選択するには、文字列の変換の際に多数の変換候補
の中から所望の表記を選択するか、あるいは送りがなに
よる表記の揺れの場合には送りがなの送り方を予めプロ
グラムにおいて設定しておく必要がある。However, there are many words in Japanese, for which a plurality of notations are allowed, as represented by Japanese characters and syllabary. For example, beginning with the difference in the notation depending on the way the feed is sent, such as "Handling", "Handling", "Handling", "Violin" and "Violin", "Buffer" and "Buffer" and "Buffer",
Examples are "10,000 yen" and "Ichiman". The fact that plural notations are allowed in this way is referred to as "notation fluctuation" or derivative notation. When there is such fluctuation of the notation, to select the desired notation, select the desired notation from a large number of conversion candidates when converting the character string, or It is necessary to set in advance the method of sending the feed.

【０００６】さらにこのような表記の揺れは、文節分か
ち書き処理にとって大きな負担となっている。例えば、
「とりあつかい」という文字列の場合、「取り扱い」
「取扱い」「取扱」はそれぞれ許容された表記であるた
め、文節分かち書き処理を行なうときには、これらすべ
てを対象にして文法的な妥当性などを調べなければなら
ず、これら許容された表記のそれぞれについての情報を
辞書に用意しておかなくてはならない。Further, such fluctuations in the notation impose a heavy burden on the phrase segmentation writing process. For example,
In the case of the character string "Toriokai", "Handling"
Since "Handling" and "Handling" are allowed notations, when performing phrase segmentation processing, it is necessary to check the grammatical validity etc. for all of these, and for each of these allowed notations You must prepare the information in the dictionary.

【０００７】本発明の仮名漢字変換装置および仮名漢字
変換方法は、こうした問題を解決し、辞書を縮小すると
共に文節分かち書き処理のステップを軽減して、派生表
記を有する語を正しく文節分かち書きすることを目的と
してなされ、次の構成を採った。The kana-kanji conversion device and the kana-kanji conversion method of the present invention solve these problems, reduce the size of the dictionary, reduce the steps of the segmentation and segmentation processing, and correctly segment the segmented words. It was made for the purpose and adopted the following composition.

【０００８】[0008]

【課題を解決するための手段およびその作用・効果】本
発明の第１の仮名漢字変換装置は、仮名文字列を入力
し、辞書を参照して、該入力された仮名文字列を文節分
かち書きし、仮名漢字混じり文字列候補を生成する仮名
漢字変換装置であって、単語の読みと一つの代表表記と
を構成要素として格納した単語辞書と、複数の表記が許
容された単語について、該許容された表記を前記代表表
記に関連づけて記憶した派生表記辞書と、前記入力され
た仮名文字列に対し、前記単語辞書を参照し、該代表表
記を用いて文節分かち書き処理を行なう文節分かち書き
手段と、該文節分かち書き処理により得られた文節を構
成する単語について、前記許容された他の表記が存在す
る場合、所定の表記を表示用表記として決定する単語表
記選択手段と、該決定された表示用表記で表わされた単
語を、前記仮名漢字混じり文字列候補を構成する単語の
第１候補として表示する表示手段とを備えたことを要旨
とする。[Means for Solving the Problem and Its Action / Effect] The first kana-kanji conversion device of the present invention inputs a kana character string, refers to a dictionary, and writes the input kana character string into phrase segments. A kana-kanji conversion device that generates a kana-kanji mixed character string candidate, wherein a word dictionary that stores word readings and one representative notation as constituent elements, and a word that allows a plurality of notations A derived notation dictionary that stores the notation associated with the representative notation, and a phrase segmentation and writing means that performs segmentation and segmentation processing using the representative notation with respect to the input kana character string by referring to the word dictionary. A word notation selecting means for deciding a predetermined notation as a display notation when the other notations allowed for the words constituting the phrase obtained by the phrase segmentation processing exist, and the decision Word represented in the display representation that is, the gist that a display means for displaying the first candidate words constituting the kana and kanji character string candidate.

【０００９】この仮名漢字変換装置に対応する仮名漢字
変換方法は、仮名文字列を入力し、辞書を参照して、該
入力された仮名文字列を文節分かち書きし、仮名漢字混
じり文字列候補を生成する仮名漢字変換方法であって、
前記入力された仮名文字列に対して文節分かち書き処理
を行なうとき、単語の読みと一つの代表表記とを構成要
素として格納した単語辞書を参照して、該代表表記を用
いて文節分かち書き処理を行ない、該文節分かち書き処
理により得られた文節を構成する単語が複数の表記が許
容された単語である場合、所定の表記を表示用表記とし
て決定し、該表示用表記で表わされた単語を前記仮名漢
字混じり文字列候補を構成する単語の第１候補として表
示することを要旨とする。A kana-kanji conversion method corresponding to this kana-kanji conversion device inputs a kana character string, refers to a dictionary, and scribes the kana character string thus input, and generates kana-kanji mixed character string candidates. Kana-Kanji conversion method
When performing phrase segmentation processing on the input kana character string, refer to a word dictionary that stores word reading and one representative notation as constituent elements, and perform segment segmentation processing using the representative notation. When the words constituting the phrase obtained by the phrase segmentation processing are words for which a plurality of notations are allowed, a predetermined notation is determined as the display notation, and the word represented by the display notation is The gist is to display it as the first candidate of the words that form the character string candidate mixed with kana and kanji.

【００１０】以上のように構成された本発明の第１の仮
名漢字変換装置および仮名漢字変換方法によれば、単語
の読みと一つの代表表記とを構成要素として単語辞書に
格納しておき、複数の表記が許容された単語について
は、この派生表記を代表表記に関連づけて派生表記辞書
に記憶しておく。入力された仮名文字列から仮名漢字混
じり文字列候補を生成するときには、この単語辞書を参
照して代表表記を用いて文節分かち書き処理を行なう。
この結果得られた文節を構成する単語が派生表記を有す
る場合には、所定の表記を表示用表記として決定し、生
成された仮名漢字混じり文字列候補を構成する単語の第
１候補として、この表示用表記で表わされた単語を表示
する。従って、文節分かち書き処理を行なうときに文法
上文節として成り立つかどうかの判断に用いる辞書に
は、代表表記で表わした単語に関する文法情報だけを記
憶しておけば良く、代表表記で表わした単語だけを用い
て文節分かち書き処理を行なえば良い。According to the first kana-kanji conversion device and kana-kanji conversion method of the present invention configured as described above, word reading and one representative notation are stored in the word dictionary as constituent elements, For a word for which a plurality of notations are allowed, this derivative notation is stored in the derivative notation dictionary in association with the representative notation. When a kana-kanji mixed character string candidate is generated from the input kana character string, the phrase segmentation processing is performed using the representative notation with reference to this word dictionary.
When the words constituting the bunsetsu obtained as a result have derivative notations, a predetermined notation is determined as the notation for display, and as the first candidate of the words constituting the generated kana-kanji mixed character string candidate, this Display the word represented in display notation. Therefore, in the dictionary used to determine whether or not a grammatical phrase is valid when performing phrase segmentation processing, it is sufficient to store only the grammatical information about the word represented by the representative notation, and only the word represented by the representative notation is stored. The phrase segmentation processing may be performed by using this.

【００１１】ここで、前記単語表記選択手段は、前記派
生表記辞書を参照して、前記表示用表記を決定する手段
を備えた構成としても良い。このとき、代表表記を用い
て文節分かち書き処理を行なった結果得られた単語が派
生表記を有する場合には、派生表記辞書を参照して所定
の表記を表示用表記として決定する。従って、表示用表
記に関する情報は、派生表記辞書に格納しておけば良
く、単語辞書には代表表記に関する情報だけを記憶して
おけば良い。Here, the word notation selecting means may be configured to include means for determining the display notation by referring to the derived notation dictionary. At this time, if the word obtained as a result of performing the phrase segmentation processing using the representative notation has a derivative notation, the predetermined notation is determined as the display notation by referring to the derivative notation dictionary. Therefore, the information about the display notation may be stored in the derivative notation dictionary, and only the information about the representative notation may be stored in the word dictionary.

【００１２】また、前記単語表記選択手段は、過去の派
生表記の使用に基づいて、前記派生表記辞書から対応す
る表記を前記単語辞書に記憶する手段と、該単語辞書に
記憶された情報を参照して、前記表示用表記を決定する
手段を備えた構成としても良い。このような構成とすれ
ば、過去の派生表記の使用に基づいて、派生表記辞書か
ら対応する表記が単語辞書に記憶される。従って、代表
表記を用いた文節分かち書き処理の結果得られた単語が
派生表記を有する場合には、この単語辞書に記憶された
情報を参照して、所定の表記を表示用表記として決定す
る。このとき、表示用表記に関する情報は単語辞書に格
納されるため、文節分かち書き処理を行なうにあたって
単語辞書を参照する際、代表表記と同時に表示用表記に
関する情報も得ることができる。従って、仮名漢字混じ
り文字列候補の表示を行なうときには、再び辞書を参照
することなく、表示用表記で表わされた単語を第１候補
として表示することが可能となる。Further, the word notation selecting means refers to means for storing a corresponding notation in the word dictionary from the derivative notation dictionary based on the use of past derivative notations, and information stored in the word dictionary. Then, it may be configured to include means for determining the display notation. With such a configuration, the corresponding notation is stored in the word dictionary from the derivative notation dictionary based on the past use of the notation. Therefore, when the word obtained as a result of the phrase segmentation processing using the representative notation has the derivative notation, the predetermined notation is determined as the display notation by referring to the information stored in this word dictionary. At this time, since the information about the display notation is stored in the word dictionary, it is possible to obtain the information about the display notation at the same time as the representative notation when referring to the word dictionary in performing the phrase segmentation processing. Therefore, when displaying a kana-kanji mixed character string candidate, it is possible to display the word represented by the display notation as the first candidate without referring to the dictionary again.

【００１３】さらに、前記単語辞書と前記派生表記辞書
とが一体である構成であっても良い。このとき、この一
体である辞書を参照して代表表記を用いて文節分かち書
きを行なう際、表示用表記も同時に読み込む。そのた
め、文節分かち書き処理の後、再び辞書を参照する必要
がなく、すでに読み込んである表示用表記で表わされた
単語を第１候補として仮名漢字文字列候補を表示する。
また、単語辞書と派生表記辞書とを一体にすることで、
辞書ファイルが扱い易くなるという効果を奏する。Further, the word dictionary and the derived notation dictionary may be integrated. At this time, when the phrase segmentation is performed using the representative notation with reference to the integrated dictionary, the notation for display is also read at the same time. Therefore, after the phrase segmentation processing, there is no need to refer to the dictionary again, and the kana-kanji character string candidates are displayed with the already-read word represented by the display notation as the first candidate.
Also, by integrating the word dictionary and the derivative notation dictionary,
The dictionary file is easy to handle.

【００１４】また、仮名漢字文字列候補を表示する上記
表示手段は、第１候補として表示する単語の同音異義語
と、少なくともこの第１候補の単語について許容される
表記とを区別可能な形態で表示する分割表示手段を備え
る構成とすることもできる。このような構成とすれば、
変換候補の選択時に、種々の同音異義語やその派生表記
が文法上あるいは意味上何のつながりもない順序で混在
する中から所望の表記を捜す必要がなく、容易に所望の
表記を選択可能となる。Further, the display means for displaying the kana / kanji character string candidates is capable of distinguishing the homonyms of the word displayed as the first candidate from the notations allowed for at least the first candidate word. It is also possible to adopt a configuration including a split display means for displaying. With such a configuration,
When selecting a conversion candidate, it is not necessary to search for a desired notation from among various homophones and their notations mixed in an order that has no grammatical or semantic connection, and the desired notation can be easily selected. Become.

【００１５】さらに上記分割表示手段は、この第１候補
の単語と各同音異義語の派生表記とを異なる２方向に展
開し、第１候補の単語表記に対する他の単語表記を二次
元的に表示する構成としても良い。このような構成とす
れば、所望の表記の選択がさらに容易になる。Further, the division display means expands the first candidate word and the derivative notation of each homonym in two different directions, and two-dimensionally displays the other word notation with respect to the first candidate word notation. It may be configured to. With such a configuration, it becomes easier to select a desired notation.

【００１６】ここで、上記分割表示手段により第２候補
以下の変換候補を表示するときには、第１候補として表
示された単語候補が所望の単語表記でないときに、所定
の変換キーなどにより変換指示を入力すると、上記のよ
うに二次元的に残りの候補を表示する構成としても良い
し、変換中の文節に含まれた単語の表記候補を常に上記
のように二次元的に展開する構成としても良い。Here, when the conversion candidates below the second candidate are displayed by the division display means, when the word candidate displayed as the first candidate is not the desired word notation, a conversion instruction is given by a predetermined conversion key or the like. When input, the remaining candidates may be displayed two-dimensionally as described above, or the notation candidates of the word included in the phrase being converted may be two-dimensionally expanded as described above. good.

【００１７】また、上記分割表示手段において、変換し
得るすべての単語表記候補を一度に二次元的に表記する
のではなく、現在第１候補として表示されている単語に
ついて、この第１候補の単語の派生表記と、この第１候
補の単語表記と同じ様式の表記で表わした同音異義語を
それぞれ所定の方向に展開して表示することも可能であ
る。このとき、表示された候補の中に所望の単語表記が
見つからない場合は、表示された同音異義語の中で、所
望の単語表記の派生表記を、一旦第１候補として選択し
直し、新たに表示された単語表記候補の中から所望の単
語表記を選択することになる。Further, in the divided display means, not all the word notation candidates that can be converted are not two-dimensionally expressed at once, but the word currently displayed as the first candidate is the word of the first candidate. It is also possible to develop and display the homologous synonyms expressed in the same style as the first candidate word notation and the derivative notation. At this time, if the desired word notation is not found in the displayed candidates, the derivative notation of the desired word notation among the displayed homonyms is once again selected as the first candidate, and a new one is newly selected. A desired word notation is selected from the displayed word notation candidates.

【００１８】一方、上記分割表示手段を、第１候補とし
て表示する単語と共にその同音異義語を表示し、その
後、所定の指示を受けて、この第１候補の単語および／
またはその同音異義語の少なくとも一つについての許容
される表記を表示する手段を備えるものとすることもで
きる。この場合には、所望の表記を選択するにあたって
は、まず、同音異義語の中から所望の語を選択し、その
後、その選択した語に関して、複数の許容された表記の
中から所望の表記を選択することになる。従って、順を
追って容易に所望の表記に辿り着くことができる。On the other hand, the divided display means displays the homonyms with the word to be displayed as the first candidate, and then receives a predetermined instruction to display the word and / or the first candidate.
Alternatively, it may be provided with means for displaying an allowable notation for at least one of the homonyms. In this case, when selecting the desired notation, first select the desired word from the homonyms and then select the desired notation from the multiple allowed notations for the selected word. Will be selected. Therefore, it is possible to easily reach the desired notation step by step.

【００１９】また、前記派生表記辞書には、異体文字を
用いた単語表記も同じ代表表記に関連づけて記憶するこ
ととし、以下の構成を採ることも好ましい。すなわち、
前記表示手段により第１候補の単語を表示するとき、こ
の第１候補の単語表記に異体文字が存在するならば異体
文字存在表示手段により異体文字が存在することを表示
し、その際に異体文字への変換を望むならば、異体文字
選択手段により、第１候補の表記様式に従い所望の異体
文字を用いた表記に変換するのである。この場合には、
多数の候補表記の中から所望の異体文字を用いた表記を
探し出す必要がなく、使用者が好んで選択する表記様式
に従って所望の異体文字を用いた単語表記を容易に得る
ことができる。It is also preferable that the derived notation dictionary also stores word notations using different characters in association with the same representative notation, and has the following configuration. That is,
When the first candidate word is displayed by the display means, if there is a variant character in the word representation of the first candidate, the variant character presence display means displays that there is a variant character, and at that time, the variant character If desired, the variant character selecting means converts into the notation using the desired variant character according to the notation style of the first candidate. In this case,
It is not necessary to search for a notation using a desired variant from a large number of candidate notations, and a word notation using a desired variant can be easily obtained according to the notation style that the user prefers.

【００２０】さらに、前記単語表記選択手段は、派生表
記の中から使用者が選択した選択状況を学習し、この学
習結果に基づいて、以降の表示用表記を決定する構成と
することも好適である。使用者が過去に好んで選択した
単語表記が第１候補となり、所望の仮名漢字混じり文が
容易に得られるからである。Further, it is also preferable that the word notation selecting means is configured to learn the selection situation selected by the user from the derivative notations and to determine subsequent display notations based on the learning result. is there. This is because the word notation that the user has preferred and selected in the past becomes the first candidate, and the desired kana-kanji mixed sentence can be easily obtained.

【００２１】ここで、前記学習の結果に基づいて、派生
表記辞書に記憶された複数の許容された表記の順序を並
び替えて記憶し、この複数の許容された表記を、この並
び替えた順序を変換候補順位として表示可能に準備する
派生表記辞書学習手段を備えた構成とすることもでき
る。この場合には、第１候補の単語表記だけでなく、第
２候補以下の単語表記についても、使用者が過去に選択
した選択状況が反映され、所望の単語表記を選択し易く
なる。Here, based on the result of the learning, the order of a plurality of allowed notations stored in the derivative notation dictionary is rearranged and stored, and the plurality of allowed notations is rearranged in this rearranged order. It is also possible to adopt a configuration provided with a derived notation dictionary learning means for preparing so that can be displayed as the conversion candidate rank. In this case, not only the word notation of the first candidate but also the word notation of the second candidate and below reflects the selection status selected by the user in the past, and it becomes easy to select the desired word notation.

【００２２】さらに、前記学習の結果に基づいて決定さ
れた表示用表記を、前記代表表記と関連づけて記憶し、
仮名漢字混じり文字列候補を構成する単語の第１候補と
して表示可能に準備する表示用表記学習手段を備えた構
成としても良い。このような構成とすれば、使用者が同
じ単語表記を使用し続ける限り、第１候補の単語表記を
表示する度に表示用表記を決定するための動作を行なう
必要がない。Further, the display notation determined based on the result of the learning is stored in association with the representative notation,
It may be configured to include a display notation learning unit that prepares to be displayed as a first candidate of a word that constitutes a kana-kanji mixed character string candidate. With such a configuration, as long as the user continues to use the same word notation, it is not necessary to perform the operation for determining the display notation each time the first candidate word notation is displayed.

【００２３】また、前記許容された複数の表記の中から
使用者により一つの単語表記が選択されたとき、該選択
された単語表記に対応する代表表記について、前記単語
辞書における同音異義語内で、前記仮名漢字混じり文字
列候補を構成する単語の候補として使用する優先順位を
変更する単語辞書学習手段を備えたものとしても良い。
文節分かち書き処理においては、読みと品詞が同じであ
れば、その他の文法情報が用意されていない限り、仮名
漢字混じり文字列候補を構成する単語の候補として残っ
た単語について、どの単語がよりふさわしいかをそれ以
上判断することはできない。従って、単語辞書における
同音異義語の優先順位に、使用者の単語選択状況を反映
させることにより、所望の仮名漢字混じり文字列が得易
くなる。When one word notation is selected by the user from the plurality of allowed notations, the representative notation corresponding to the selected word notation is included in the homonyms in the word dictionary. The word dictionary learning means for changing the priority order to be used as the candidates of the words forming the mixed kana / kanji character string candidates may be provided.
In the phrase segmentation process, if the pronunciation and the part-of-speech are the same, which word is more suitable as the candidate word of the words that form the character string candidates mixed with Kana and Kanji unless other grammatical information is prepared. Can't judge any more. Therefore, by reflecting the word selection status of the user in the priority order of homonyms in the word dictionary, it becomes easy to obtain a desired kana-kanji mixed character string.

【００２４】さらに、前記単語表記選択手段は、使用者
が前記許容された複数の表記の中から異体文字を用いた
表記を選択したとき、単語表記の選択状況の学習を禁止
する構成とすることも好適である。異体文字の使用は特
殊な場合であり、その特殊な選択状況を学習し、特殊な
表記である異体文字を用いた表記を第１候補として得易
くすることは、使用者にとって得策ではないと考えられ
るからである。Further, the word notation selecting means is configured to prohibit learning of the selection status of the word notation when the user selects a notation using different characters from the plurality of allowed notations. Is also suitable. The use of variant characters is a special case, and it is not a good idea for the user to learn the special selection situation and make it easier to obtain a notation using variant characters that is a special notation as the first candidate. Because it will be done.

【００２５】また、前記仮名漢字混じり文字列候補を生
成する過程において、２以上の単語が所定の係り受け関
係を持って変換されたことを習得する習得手段と、該２
以上の単語の少なくとも一つにおいて、使用頻度の低い
所定の派生表記が選択され、仮名漢字混じり文字列にお
いて確定されたときには、前記係り受け関係の習得を禁
止する習得禁止手段とを備えた構成も好ましい。係り受
け関係の習得は、複数の同音異義語の中から所望の単語
を候補単語として表示させるために有効な手段である。
しかしながら、特殊な表記に関しては、その使用される
状況も特殊である可能性が高い。従って、後に利用され
ないと考えられる特殊な表記についての係り受け関係を
習得しない手段をさらに設けておくことは有用である。Further, in the process of generating the kana-kanji mixed character string candidates, learning means for learning that two or more words have been converted with a predetermined dependency relationship,
In at least one of the above words, when a predetermined derived expression with low frequency of use is selected and confirmed in a kana-kanji mixed character string, there is also a configuration including a learning prohibition means for prohibiting learning of the dependency relationship. preferable. Learning the dependency relationship is an effective means for displaying a desired word as a candidate word from a plurality of homonyms.
However, with regard to special notation, the situation in which it is used is likely to be special. Therefore, it is useful to further provide a means for not learning the dependency relation for the special notation that is not used later.

【００２６】以上説明したように、本発明の第１の仮名
漢字変換装置および仮名漢字変換方法では、各単語の代
表表記だけを読み込んで文節分かち書き処理を行なうた
め、文節分かち書き処理に必要な辞書は、代表表記に関
する情報だけを備えておけば良く、辞書サイズを縮小す
ることができる。また、文節分かち書き処理は代表表記
だけを対象に行なわれるので、文節分かち書きにおける
動作が大きく軽減され処理速度がより高速になるという
優れた効果を奏する。As described above, in the first kana-kanji conversion device and kana-kanji conversion method of the present invention, only the representative notation of each word is read and the segmentation and segmentation processing is performed. Therefore, the dictionary required for segmentation and segmentation processing is The dictionary size can be reduced by providing only the information about the representative notation. In addition, since the phrase segmentation writing process is performed only for the representative notation, there is an excellent effect that the operation in the segment segmentation segmentation is greatly reduced and the processing speed becomes higher.

【００２７】一方、本発明の第２の仮名漢字変換装置
は、仮名文字列を入力し、辞書を参照して、該入力され
た仮名文字列を文節分かち書きし、仮名漢字混じり文字
列候補を生成する仮名漢字変換装置であって、時刻、年
月日などの所定の時点を示す日時情報と、該日時情報を
表わす特定の見出し語とを対応して記憶する日時用語変
換辞書と、現時点における前記日時情報を発生する現日
時情報発生手段と、所定の文法情報に従って文節分かち
書き処理を行なうとき、前記日時情報を表わす特定の見
出し語の有無を調べる日時用語検索手段と、該日時用語
検索手段により該日時情報を表わす特定の見出し語が発
見されたときに、前記日時用語変換辞書および前記現日
時情報発生手段を参照して、該日時情報を表わす特定の
見出し語を対応する前記日時情報に変換する日時用語変
換手段と、複数の日時表示用フォーマットを記憶する日
時表示用フォーマット記憶手段と、前記変換結果を表示
するとき、該複数の日時表示用フォーマットから決定し
た所定のフォーマットにより表示する日時表示手段と、
該使用されたフォーマットが非所望であるとき、前記複
数の日時表示用フォーマットの残りから選択した所望の
フォーマットにより前記変換結果を表示する日時表示用
フォーマット選択手段と、前記記憶された複数のフォー
マットのひとつが使用者により選択されたことを学習
し、該学習の結果に基づいて、前記日時情報を表示する
際、使用する日時表示用フォーマットを変更する日時表
示用フォーマット変更手段とを備えたことを要旨とす
る。On the other hand, the second kana-kanji conversion device of the present invention inputs a kana character string, refers to a dictionary, and writes out the input kana character string for each phrase to generate a kana-kanji mixed character string candidate. A kana-kanji conversion device that stores time and date information indicating a predetermined time such as time and date, and a date and time term conversion dictionary that stores a specific headword representing the time and date information in association with each other. A current date / time information generating means for generating date / time information, a date / time term searching means for checking the presence / absence of a specific entry word representing the date / time information when performing the phrase segmentation processing according to predetermined grammatical information, and the date / time term searching means When a specific entry word representing the date and time information is found, the date and time term conversion dictionary and the current date and time information generating means are referenced to correspond to the specific entry word representing the date and time information. Date / time term conversion means for converting into date / time information, date / time display format storage means for storing a plurality of date / time display formats, and a predetermined format determined from the plurality of date / time display formats when displaying the conversion result Date and time display means to display by
When the used format is undesired, a date / time display format selecting means for displaying the conversion result in a desired format selected from the rest of the plurality of date / time display formats, and the stored plurality of formats. It is provided with a date / time display format changing means for changing the date / time display format to be used when displaying one of the date / time information on the basis of the result of the learning that one has been selected by the user. Use as a summary.

【００２８】この仮名漢字変換装置に対応する仮名漢字
変換方法は、仮名文字列を入力し、辞書を参照して、該
入力された仮名文字列を文節分かち書きし、仮名漢字混
じり文字列候補を生成する仮名漢字変換方法であって、
前記入力された仮名文字列に対して文節分かち書き処理
を行なうとき、時刻、年月日などの所定の時点を示す日
時情報を表わす特定の見出し語を検索し、該日時情報を
表わす特定の見出し語が発見されたときに、該日時情報
と該日時情報を表わす特定の見出し語とを対応して記憶
する日時用語変換辞書と、現時点における該日時情報と
を参照して、該日時情報を表わす特定の見出し語を対応
する該日時情報に変換し、記憶されている複数の日時表
示用フォーマットから決定した所定のフォーマットによ
り前記変換結果を表示し、該表示されたフォーマットが
非所望であるとき、前記複数の日時表示用フォーマット
の残りから選択した所望のフォーマットにより前記変換
結果を表示し、前記記憶された複数のフォーマットのひ
とつが使用者により選択されたことを学習し、該学習の
結果に基づいて、前記日時情報を表示する際に使用する
日時表示用フォーマットを変更することを要旨とする。A kana-kanji conversion method corresponding to this kana-kanji conversion device inputs a kana character string, refers to a dictionary, and scribes the input kana character string into phrases, and generates kana-kanji mixed character string candidates. Kana-Kanji conversion method
When the phrase segmentation processing is performed on the input kana character string, a specific entry word representing date and time information indicating a predetermined time point such as time and date is searched for, and a specific entry word representing the date and time information is searched. When is found, the date / time term conversion dictionary that stores the date / time information and the specific entry word representing the date / time information in association with each other, and the date / time information at the present time, are referenced to identify the date / time information. Is converted to the corresponding date and time information, the conversion result is displayed in a predetermined format determined from a plurality of stored date and time display formats, and when the displayed format is undesired, The conversion result is displayed in a desired format selected from the rest of the plurality of date and time display formats, and one of the plurality of stored formats is displayed by the user. Learns that it has been selected, based on the result of the learning, is summarized in that to change the date and time display format to be used when displaying the date and time information.

【００２９】以上のように構成された本発明の第２の仮
名漢字変換装置および仮名漢字変換方法によれば、時
刻、年月日などの所定の時点を示す日時情報と、この日
時情報を表わす特定の見出し語とを対応させて日時用語
変換辞書に記憶しておく。所定の文法情報に従って文節
分かち書き処理を行なうときには、日時用語検索手段が
この日時情報を表わす特定の見出し語の有無を調べる。
日時情報を表わす特定の見出し語が発見されると、日時
用語変換手段が、前記日時用語変換辞書および現時点に
おける日時情報を発生する現日時情報発生手段とを参照
して、日時情報を表わす特定の見出し語を対応する日時
情報に変換する。この変換結果は、記憶してある複数の
日時表示用フォーマットの中から決定した所定のフォー
マットを用いて表示する。この時使用したフォーマット
が非所望であるときには、記憶してある複数の日時表示
用フォーマットの残りの中から選択された所望のフォー
マットにより、前記変換結果を表示する。このフォーマ
ットの選択状況は学習され、この学習の結果に基づい
て、日時情報を表示する際に使用する日時表示用フォー
マットを変更する。According to the second kana-kanji conversion device and kana-kanji conversion method of the present invention configured as described above, date and time information indicating a predetermined time point such as time and date, and this date and time information are represented. It is stored in the date / time term conversion dictionary in association with a specific headword. When the phrase segmentation processing is performed in accordance with predetermined grammatical information, the date / time term search means checks whether or not there is a specific headword representing this date / time information.
When a specific entry word representing the date and time information is found, the date and time term conversion means refers to the date and time term conversion dictionary and the current date and time information generation means for generating the date and time information at the present time to identify the specific date and time information. Convert entry words to corresponding date and time information. The conversion result is displayed using a predetermined format determined from the stored plurality of date and time display formats. When the format used at this time is undesired, the conversion result is displayed in the desired format selected from the rest of the plurality of stored date and time display formats. The selection status of this format is learned, and the date and time display format used when displaying date and time information is changed based on the result of this learning.

【００３０】従って、日時情報を表わす特定の見出し語
を入力すれば、所定のフォーマットに従った表示で、こ
の日時情報を表わす特定の見出し語に対応する日時情報
を変換結果として得ることができる。さらに、日時表示
用フォーマットは複数登録されており、この日時表示用
フォーマットの選択状況を学習し、その学習結果に基づ
いて第１候補を表示する際に使用する日時表示用フォー
マットを変更するため、日時情報を表わす特定の見出し
語を日時情報に変換する際、使用者が好んで選択する日
時表示用フォーマットにより変換結果を表示することが
容易になる。Therefore, by inputting a specific entry word representing date / time information, it is possible to obtain date / time information corresponding to the specific entry word representing this date / time information as a conversion result in a display according to a predetermined format. Further, since a plurality of date and time display formats are registered, the selection status of the date and time display format is learned, and the date and time display format used when displaying the first candidate is changed based on the learning result. When converting a specific entry word representing date and time information into date and time information, it becomes easy to display the conversion result in a date and time display format that the user prefers to select.

【００３１】また、日時情報を表わす特定の見出し語を
対応した日時情報に変換した結果を表示するのか、日時
情報を表わす特定の見出し語としてではなく、通常の単
語として変換した結果を表示するのかを選択する日時変
換切替手段を備えた構成としても良い。この場合には、
日時変換を行なう必要がないときに、日時変換された不
必要な変換候補が表示されることがない。Whether the result obtained by converting a specific headword representing date and time information into corresponding date and time information is displayed, or whether the result converted as a normal word is displayed instead of the specific headword representing date and time information. It may be configured to include a date / time conversion switching means for selecting. In this case,
When it is not necessary to perform date / time conversion, unnecessary date / time-converted conversion candidates are not displayed.

【００３２】このように、本発明の第２の仮名漢字変換
装置および仮名漢字変換方法では、日時情報を表わす特
定の見出し語を入力すれば日時変換が行なわれ、これに
対応した日時情報が所定の日時表示用フォーマットによ
り表示され、使用した日時表示用フォーマットは学習さ
れるため、所望のフォーマットを用いて記載した日時情
報を含んだ文書を極めて容易に作成可能となる。As described above, according to the second kana-kanji conversion device and kana-kanji conversion method of the present invention, the date and time is converted by inputting a specific entry word representing the date and time information, and the date and time information corresponding thereto is predetermined. Since the displayed date and time display format is used and the used date and time display format is learned, a document including date and time information described using a desired format can be created very easily.

【００３３】また、前記日時を表わす特定の見出し語を
日時情報に変換可能な仮名漢字変換装置としては、本発
明の第１の仮名漢字変換装置の構成に加えて、時刻、年
月日などの所定の時点を示す日時情報と、該日時情報を
表わす特定の見出し語とを対応して記憶する日時用語変
換辞書と、現時点における前記日時情報を発生する現日
時情報発生手段と、所定の文法情報に従って文節分かち
書き処理を行なうとき、前記日時情報を表わす特定の見
出し語の有無を調べる日時用語検索手段と、該日時用語
検索手段により該日時情報を表わす特定の見出し語が発
見されたときに、前記日時用語変換辞書および前記現日
時情報発生手段を参照して、該日時情報を表わす特定の
見出し語を対応する前記日時情報に変換する日時用語変
換手段と、該日時情報を表わす特定の見出し語の変換候
補を表示する際、該日時情報を表わす特定の見出し語の
同音異義語の一つとして、あるいは該日時情報を表わす
特定の見出し語の許容された表記の一つとして、前記日
時情報への変換結果を所定のフォーマットにより表示す
る日時候補混合表示手段とを備える構成も好適である。As a kana-kanji conversion device capable of converting a specific entry word representing the date and time into date and time information, in addition to the configuration of the first kana-kanji conversion device of the present invention, time, date, year, etc. A date / time term conversion dictionary that stores date / time information indicating a predetermined time point and a specific entry word representing the date / time information in association with each other, a current date / time information generating unit that generates the date / time information at the current time point, and predetermined grammatical information When performing the phrase segmentation processing according to, the date and time term searching means for checking the presence or absence of a particular headword representing the date and time information, and when the particular headword representing the date and time information is found by the date and time term searching means, A date and time term conversion unit that refers to the date and time term conversion dictionary and the current date and time information generation unit and converts a specific entry word representing the date and time information into the corresponding date and time information, and the date and time When displaying a conversion candidate of a specific headword representing information, as one of the homonyms of the specific headword representing the date / time information, or one of the allowed notations of the specific headword representing the date / time information. As one of the configurations, a configuration including a date / time candidate mixed display means for displaying the conversion result into the date / time information in a predetermined format is also preferable.

【００３４】以上のような構成によれば、日時情報を表
わす特定の見出し語を対応する日時情報に変換した結果
は、この特定の見出し語の同音異義語のひとつとして、
あるいはこの特定の見出し語の許容された表記の一つと
して所定のフォーマットにより表示されるので、所望の
表記を極めて容易に選択することができる。また、日時
情報を表わす特定の見出し語の変換の際、対応する日時
情報への変換を行なうのか、通常の単語として変換した
結果を表示するかの指示を入力する必要もない。According to the above configuration, the result of converting the specific entry word representing the date and time information to the corresponding date and time information is as one of the homonyms of this specific entry word.
Alternatively, since it is displayed in a predetermined format as one of the allowed notations of this particular headword, the desired notation can be selected extremely easily. In addition, when converting a specific entry word representing date and time information, it is not necessary to input an instruction as to whether to convert to corresponding date and time information or to display the result of conversion as a normal word.

【００３５】また、本発明の第３の仮名漢字変換装置
は、日本語において許容された複数の表記が存在する語
について、許容された複数の表記を一つの代表表記と該
代表表記を除いて許容された表記である派生表記とに整
理し、仮名漢字変換における文節分かち書きの処理にお
いては前記代表表記を用い、該文節分かち書きされた後
の表記においては前記代表表記および前記派生表記の内
の一つを用いることを要旨とする。Further, the third kana-kanji conversion device of the present invention includes, for a word in which a plurality of notations allowed in Japanese exist, one of the allowed notations and one representative notation. Derived notation, which is an allowable notation, is arranged, and the representative notation is used in the processing of segmentation and segmentation in Kana-Kanji conversion, and one of the representative representation and the derivative notation is used in the segmentation after segmentation and segmentation. The point is to use one.

【００３６】このような本発明の第３の仮名漢字変換装
置では、本発明の第１の仮名漢字変換装置と同様の効果
を奏する。The third kana-kanji conversion device of the present invention as described above has the same effects as the first kana-kanji conversion device of the present invention.

【００３７】[0037]

【発明の実施の形態】以上説明した本発明の構成・作用
を一層明らかにするために、以下本発明の実施の形態を
実施例に基づき説明する。図１は本発明の好適な一実施
例である仮名漢字変換装置において、仮名漢字変換制御
ロジックが実際に動作するハードウェアの構成を示すブ
ロック図である。図１に示すように、この装置は、予め
設定されたプログラムに従って仮名漢字変換に関わる動
作を制御するための各種演算処理を実行するＣＰＵ２１
を中心に、バス３１により相互に接続された次の各部を
備える。ＲＯＭ２２は、ＣＰＵ２１で各種演算処理を実
行するのに必要な仮名漢字変換に関わるプログラムやデ
ータなどを予め格納しており、ＲＡＭ２３は、同じくＣ
ＰＵ２１で各種演算処理を実行するのに必要な仮名漢字
変換に関わる各種データが一時的に読み書きされるメモ
リである。キーボードインターフェース２５は、キーボ
ード２４からのキー入力を司り、ＣＲＴＣ２７は、カラ
ーで表示可能なＣＲＴ２６への信号出力を制御し、プリ
ンタインターフェース２９は、プリンタ２８へのデータ
の出力を制御する。ハードディスク３２には、ＲＡＭ２
３にロードされて実行される各種プログラムやデバイス
ドライバの形式で提供される仮名漢字変換処理プログラ
ム、あるいは各種変換辞書などが記憶されており、この
ハードディスク３２をハードディスクコントローラ（Ｈ
ＤＣ）３０が制御する。タイマ３３は、現時点における
時刻、年月日などの所定の時点を示す日時情報を発生し
ている。BEST MODE FOR CARRYING OUT THE INVENTION In order to further clarify the structure and operation of the present invention described above, the embodiments of the present invention will be described below based on Examples. FIG. 1 is a block diagram showing a hardware configuration in which a kana-kanji conversion control logic actually operates in a kana-kanji conversion device according to a preferred embodiment of the present invention. As shown in FIG. 1, this device is a CPU 21 that executes various arithmetic processes for controlling operations related to Kana-Kanji conversion according to a preset program.
And the following parts connected to each other by a bus 31. The ROM 22 stores in advance programs and data relating to kana-kanji conversion necessary for the CPU 21 to execute various arithmetic processes, and the RAM 23 also stores C
The PU 21 is a memory in which various data related to Kana-Kanji conversion necessary for executing various arithmetic processes are temporarily read and written. The keyboard interface 25 controls key inputs from the keyboard 24, the CRTC 27 controls signal output to the CRT 26 that can be displayed in color, and the printer interface 29 controls data output to the printer 28. The hard disk 32 has a RAM 2
The Kana-Kanji conversion processing program provided in the form of a device driver and various programs to be loaded and executed in the memory 3 or various conversion dictionaries are stored.
DC) 30 controls. The timer 33 generates date and time information indicating a predetermined time, such as a current time, a date, and the like.

【００３８】このように構成されたハードウェアによ
り、文字列の入力、仮名漢字変換、編集、表示、印刷な
どがなされる。すなわち、キーボード２４から入力され
た文字列は、ハードディスク３２に記憶した各種データ
を参照しながらＣＰＵ２１により所定の処理がなされ、
ＲＡＭ２３の所定領域に格納され、ＣＲＴＣ２７を介し
てＣＲＴ２６の画面上に表示される。With the hardware thus configured, input of a character string, conversion of kana to kanji, editing, display, printing, etc. are performed. That is, the character string input from the keyboard 24 is subjected to predetermined processing by the CPU 21 while referring to various data stored in the hard disk 32,
It is stored in a predetermined area of the RAM 23 and is displayed on the screen of the CRT 26 via the CRTC 27.

【００３９】次に、こうして構成されたハードウェアに
より実行される仮名漢字変換処理の詳細について説明す
る。まず始めに図２を用いて仮名漢字変換処理に関わる
各部の構成と働きについて概説するが、ここで行なわれ
る処理は、キーボード２４より入力されたデータに基づ
き、中央処理装置（ＣＰＵ２１）が実行するものであ
る。このＣＰＵ２１により、すべての処理が行なわれ
る。仮名漢字変換については、キーボード２４が操作さ
れたとき、所定の割り込み処理が起動し、入力したキー
イメージを対応する仮名文字列に変換し、更にこれを仮
名漢字混じり文字列に変換するデバイスドライバが起動
する。もとより、並行処理可能なコンピュータであれ
ば、仮名漢字変換を一つのアプリケーション（インプッ
トメソッド）が行なうものとし、変換結果を、必要とす
るアプリケーションに引き渡す構成としても差し支えな
い。この場合には、キーボード２４からの入力をインプ
ットメソッドが一括して引き受けることになる。Next, the details of the kana-kanji conversion processing executed by the hardware thus configured will be described. First, the configuration and operation of each part involved in the kana-kanji conversion process will be outlined with reference to FIG. It is a thing. The CPU 21 performs all processing. Regarding kana-kanji conversion, when the keyboard 24 is operated, a predetermined interrupt process is activated, and a device driver for converting the input key image into a corresponding kana-character string and further converting this into a kana-kanji mixed character string. to start. Of course, if the computer is capable of parallel processing, then kana-kanji conversion may be performed by one application (input method), and the conversion result may be passed to the required application. In this case, the input method collectively accepts the input from the keyboard 24.

【００４０】キーボード２４からのキーイメージは、文
字入力部４０により受け付けられ、ここで、対応する文
字列に変換される。ローマ字入力の場合には所定の変換
テーブルを参照して、仮名文字列に変換する。一つの仮
名文字が得られる度に文字入力部４０は、その仮名文字
を変換制御部４２に送出する。この変換制御部４２は、
仮名漢字変換の中心的な役割を果たすところであり、後
述する種々の仮名漢字変換を制御して、結果を変換後文
字列出力部４４に送出する。変換後文字列出力部４４
は、現実には、ＣＲＴＣ２７に信号を送り、ＣＲＴ２６
に変換後文字列を表示する。The key image from the keyboard 24 is accepted by the character input unit 40, and converted into a corresponding character string here. When inputting romaji, it refers to a predetermined conversion table and converts into a kana character string. Every time one kana character is obtained, the character input unit 40 sends the kana character to the conversion control unit 42. This conversion control unit 42
It plays a central role in Kana-Kanji conversion, controls various Kana-Kanji conversions described later, and sends the result to the converted character string output unit 44. Converted character string output unit 44
Actually sends a signal to the CRTC 27,
Display the converted string.

【００４１】変換制御部４２は、受け取った仮名文字を
文字列入力部５０に引き渡す。文字列入力部５０は、文
字格納部５２に仮名文字列を格納する。この文字列に基
づいて、自立語候補作成部５４と付属語候補作成部６４
とが、単語データの候補を作成する。自立語候補作成部
５４は、ハードディスク３２に予め記憶された自立語辞
書５８を用い、自立語解析位置管理部５６の管理の下
で、得られた仮名文字列から自立語候補を抽出する処理
を行なう。一方、付属語候補作成部６４は、同じく付属
語辞書６８を用い、付属語解析位置管理部６６の管理の
下で、得られた仮名文字列から付属語候補を抽出する処
理を行なう。解析位置を移動しつつ、自立語候補と付属
語候補を抽出する処理については、後述する。The conversion control unit 42 delivers the received kana characters to the character string input unit 50. The character string input unit 50 stores the kana character string in the character storage unit 52. Based on this character string, the independent word candidate creation unit 54 and the adjunct word candidate creation unit 64
And create word data candidates. The independent word candidate creation unit 54 uses the independent word dictionary 58 stored in advance in the hard disk 32, and under the management of the independent word analysis position management unit 56, performs processing for extracting independent word candidates from the obtained kana character string. To do. On the other hand, the adjunct word candidate creating unit 64 also performs the process of extracting an adjunct word candidate from the obtained kana character string under the control of the adjunct word analysis position managing unit 66 using the adjunct word dictionary 68. The process of extracting independent word candidates and adjunct word candidates while moving the analysis position will be described later.

【００４２】ここで、自立語辞書５８は、学習により、
同音異義語や接辞などの優先順位を変更する。この学習
処理を行なうのが、自立語学習部７２、補助語学習部７
４、接辞学習部７６、文字変換学習部７８である。自立
語学習部７２は、同音異義語の存在する自立語群におい
て、最後に選択された単語を最優先の候補とするよう学
習するものである。補助語学習部７４は、例えば「くだ
さい」などの補助語を「ください」「下さい」など、い
ずれの語形で変換するかを学習するものである。更に、
接辞学習部７６は、接頭語、接尾語などの変換形式（例
えば、「御」「ご」など）を学習するものである。文字
変換学習部７８は、入力した文字列をそのままひらがな
やカタカナとして確定させた場合に、その文字列を学習
し、次回以降の変換処理では確定させたひらがなまたは
カタカナを候補として出力するものである。Here, the independent word dictionary 58 is learned by learning.
Change the priority of homonyms and affixes. This learning process is performed by the independent word learning unit 72 and the auxiliary word learning unit 7
4, an affix learning unit 76 and a character conversion learning unit 78. The independent word learning unit 72 learns the word selected last in the independent word group in which homonyms exist as the highest priority candidate. The auxiliary word learning unit 74 learns in which word form an auxiliary word such as "please" is converted to "please" or "please". Furthermore,
The affix learning unit 76 is for learning conversion formats such as prefixes and suffixes (for example, "Go" and "Go"). The character conversion learning unit 78 learns the input character string as it is as hiragana or katakana and outputs the confirmed hiragana or katakana as a candidate in the subsequent conversion process. .

【００４３】自立語候補作成部５４、付属語候補作成部
６４により、作成された語候補を得て、単語データ作成
部８０が、各語候補についてのデータを作成する。即
ち、得られた自立語と付属語、自立語と自立語、更には
「自立語＋付属語」からなる文節間の接続を接続検定テ
ーブル８４を参照して接続検定部８２が行なった結果、
および全体のコスト計算をコスト計算部８６が行なった
結果を得て、単語毎のデータとして出力するのである。
この単語データは、一旦単語データ格納部１００に格納
され、文節分かち書きの処理に供される。文節分かち書
き部１０２は、得られたデータから文節分かち書きの第
１候補を決定する。The independent word candidate creating unit 54 and the auxiliary word candidate creating unit 64 obtain the created word candidates, and the word data creating unit 80 creates data for each word candidate. That is, as a result of the connection verification unit 82 referring to the connection verification table 84, the connection between the obtained independent words and adjunct words, independent words and independent words, and further the connection between the clauses consisting of “independent words + adjunct words”,
And the cost calculation unit 86 obtains the result of the cost calculation performed by the cost calculation unit 86 and outputs the result as data for each word.
This word data is once stored in the word data storage unit 100 and is used for the processing of segmentation and segmentation. The segment segmentation writing unit 102 determines a first segment segmentation segmentation candidate from the obtained data.

【００４４】以上の処理により文節分かち書きの第１候
補と、その文節毎の仮名漢字変換の第１候補が決定され
る。文節分かち書き部１０２は、その候補を文節データ
格納部１０６に格納し、格納された候補は、変換文字列
出力部１０８により変換制御部４２に出力される。変換
制御部４２は、この文字列を候補文字列として表示する
と共に、非所望の文字列が候補となる場合もありえるか
ら、使用者による指示を受けて、次候補の表示や選択な
どの処理を行なう。これらの指示や選択の結果などは、
文節データ格納部１０６や既述した各学習部７２ないし
７８に入力され、文節の一部確定や学習による優先順位
の書き換えなどに用いられる。なお、図示していない
が、使用者により文字列の確定処理がなされると、各部
に一時的に保存されたデータはすべて消去され、次の変
換に備える。By the above processing, the first candidate for segmentation and segmentation and the first candidate for kana-kanji conversion for each segment are determined. The phrase segmentation writing unit 102 stores the candidates in the phrase data storage unit 106, and the stored candidates are output to the conversion control unit 42 by the conversion character string output unit 108. The conversion control unit 42 displays this character string as a candidate character string, and since an undesired character string may be a candidate, the conversion control unit 42 receives instructions from the user and performs processing such as display and selection of the next candidate. To do. The results of these instructions and selections are
It is input to the phrase data storage unit 106 and each of the learning units 72 to 78 described above, and is used for partially fixing the phrase and rewriting the priority order by learning. Although not shown, when the character string is confirmed by the user, all the data temporarily stored in the respective units are erased to prepare for the next conversion.

【００４５】以上、仮名文字列の入力から変換後文字列
の出力までを概説したが、次に各処理の詳細について説
明する。まず始めに仮名漢字変換処理について大まかに
説明し、次に本発明の要部である単語辞書について説明
を行なう。図３は、本実施例における仮名漢字変換処理
ルーチンを示すフローチャートである。この処理ルーチ
ンは、キーボード２４から一ないし複数の仮名文字が入
力された後、変換キー（例えば「スペースキー」）が押
されたとき、開始される処理である。なお、変換キーが
操作されなくても、所定数の仮名文字が入力されたと
き、あるいは「。」や「、」「．」などの区切り記号が
入力されたときに、図３の仮名漢字変換処理が開始され
るものとしても差し支えない。この処理が開始される
と、まず単語検索処理（ステップＳ１００）と分かち書
き処理（ステップＳ１１０）とが行なわれる。これらの
処理について、図４のフローチャートを用いて説明す
る。図４は一般的な文節分かち書き処理の一つである最
小コスト法による文節分かち書き処理の概要を示すフロ
ーチャートであり、図３におけるステップＳ１００およ
びステップＳ１１０に相当する。また、図５は、最小コ
スト法による文節分かち書きの様子を示す説明図であ
る。以下、図４における文節分かち書き処理の流れを、
図５での具体例に即して説明する。The outline from the input of the kana character string to the output of the converted character string has been described above. Next, the details of each process will be described. First, the kana-kanji conversion processing will be roughly described, and then the word dictionary which is the main part of the present invention will be described. FIG. 3 is a flowchart showing a kana-kanji conversion processing routine in this embodiment. This processing routine is a processing which is started when a conversion key (for example, "space key") is pressed after one or a plurality of kana characters are input from the keyboard 24. Even if the conversion key is not operated, when a predetermined number of kana characters are input or when a delimiter such as “.”, “,” “.” Is input, the kana-kanji conversion of FIG. It does not matter if the process is started. When this process is started, first, a word search process (step S100) and a segmentation process (step S110) are performed. These processes will be described with reference to the flowchart of FIG. FIG. 4 is a flowchart showing an outline of the phrase segmentation / drawing processing by the minimum cost method, which is one of the general segmentation / segmentation processing, and corresponds to step S100 and step S110 in FIG. In addition, FIG. 5 is an explanatory diagram showing a manner of segmentation and segmentation by the minimum cost method. Hereinafter, the flow of the phrase segmentation processing in FIG.
A description will be given according to the specific example in FIG.

【００４６】キーボード２４から文字列が入力されて文
節分かち書き処理が開始されると、まず、一時的に保存
されたデータの消去や解析位置を一桁目に初期化するな
どの初期化の処理（ステップＳ２００）を行なった後、
解析位置を求める処理を行なう（ステップＳ２１０）。
いま、キーボード２４から「とりあつかいがかわる」と
いう仮名文字列を入力したならば、最初の解析位置は一
桁目の「と」の位置となる。この解析位置は、入力され
た仮名文字列の解析が進むに従って、入力された仮名文
字列の先頭から順に一つずつ進められていく。すなわ
ち、「と」の位置を解析位置として、後述するような検
索処理を行なった後は、次の解析位置は「り」となり、
その次は「あ」というように順次進んで行くのである。When a character string is input from the keyboard 24 and the phrase segmentation writing process is started, first, an initialization process such as erasing the temporarily stored data and initializing the analysis position in the first digit ( After performing step S200),
Processing for obtaining an analysis position is performed (step S210).
Now, if the kana character string "Take care changes" is input from the keyboard 24, the first analysis position is the "to" position of the first digit. As the analysis of the input kana character string proceeds, the analysis position is advanced one by one from the beginning of the input kana character string. That is, after the search processing described below is performed with the position of "to" as the analysis position, the next analysis position becomes "ri",
The next step is "a".

【００４７】ステップＳ２１０で解析位置が決まると、
各解析位置において、ハードディスク３２に記憶された
自立語辞書５８および付属語辞書６８を検索する処理を
行なう（ステップＳ２２０）。例えば、図５の「と」を
解析位置とすると、「戸」「登」「徒」「鳥」「取り」
「取り扱い」などの単語が検索結果として得られる。こ
れら検索の結果得られた単語は、図５に示すようにその
単語の終わる桁位置にリンクしておく。「取り扱い」ま
で検索を行なうと、「とりあつかいが」以降の読みを持
つ単語は辞書を検索しても見つからないため、「と」を
解析位置とした辞書検索を終了する。辞書検索が終了す
ると、得られた単語についてそれ以前の単語との接続の
可能性をチェックする処理を行なう（ステップＳ２３
０）。上記「と」を解析位置とした例では、「と」は入
力された文頭の文字であるためそれ以前の単語との接続
の可能性をチェックすることはないが、例えば、係助詞
の直前に助詞が存在する場合など、その接続がありえな
いと判断される時には、無効なデータとして扱われる
（ステップＳ２３５）。When the analysis position is determined in step S210,
At each analysis position, a process of searching the independent word dictionary 58 and the auxiliary word dictionary 68 stored in the hard disk 32 is performed (step S220). For example, if “to” in FIG. 5 is the analysis position, “door”, “climbing”, “tuple”, “bird”, “topping”
A word such as "handling" is obtained as a search result. The word obtained as a result of these searches is linked to the ending digit position of the word as shown in FIG. When the word "handling" is searched, no word having a reading after "Torii Tsukaiga" can be found in the dictionary, so the dictionary search with "to" as the analysis position ends. When the dictionary search is completed, a process of checking the possibility of connecting the obtained word with the previous word is performed (step S23).
0). In the above example where "to" is the analysis position, since "to" is the character at the beginning of the sentence, the possibility of connection with the previous words is not checked, but for example, just before the particle. When it is determined that the connection is impossible, such as when a particle exists, it is treated as invalid data (step S235).

【００４８】接続の可能性のある単語については、次に
コスト計算を行ない、その単語の最小総コストを求める
処理を行なう（ステップＳ２４０）。これは、自立語＝
２、付属語＝０のコストを持つものとし、入力された文
字列の先頭から各単語までの最小総コストを求める処理
である。例えば、「戸」（自立語）の総コストは２、
「理」の総コストは「戸」（自立語）＋「理」（自立
語）とするならば４と求めるものである。このとき、最
小の総コストを求めるため、「亜」の総コストは、
「戸」＋「理」＋「亜」のコスト６ではなく、「鳥」＋
「亜」の場合のコスト４を採用する。「が」は付属語な
ので、その直前にリンクされた単語のうち最小の総コス
トを有する単語「取り扱い」＝２のコストがそれ自身の
コストになる。図５では、各単語のコストをそれぞれの
右下に記した。なお、更に細かく解析するために付属語
内を２つに分けて、助詞＝０、助動詞＝１としても良
い。For words that may be connected, the cost is calculated next, and the minimum total cost of the words is calculated (step S240). This is an independent word =
2. This is a process for obtaining the minimum total cost from the beginning of the input character string to each word, assuming that the cost is 2, adjunct = 0. For example, the total cost of "door" (independent word) is 2,
The total cost of "ri" is calculated as 4 if "door" (independent word) + "ri" (independent word). At this time, in order to find the minimum total cost, the total cost of "A" is
"Bird" + not "door" + "Ri" + "A" cost 6
Adopt cost 4 for "A". Since "ga" is an adjunct, the cost of the word "handling" = 2, which has the lowest total cost of the words linked immediately before it, becomes its own cost. In FIG. 5, the cost of each word is shown at the bottom right of each. It should be noted that, for further detailed analysis, the adjunct word may be divided into two, and the particle = 0 and the auxiliary verb = 1 may be set.

【００４９】以上のコスト計算が終わると、次に各単語
のコストチェックを行なう（ステップＳ２５０）。これ
は、文字列を単語の組合わせに分けたとき、他の組合わ
せに較べてコストが大きくなる組合わせを除く処理であ
る。例えば、「戸」＋「理」という組合わせは「鳥」や
「取り」のコストよりも高くなるので、「戸」＋「理」
の組合わせは不適切であると判断して、文節候補から除
外する（ステップＳ２５５）。図５では、このようなコ
ストチェックの結果文節候補として採用しないと判断し
た単語の右上に「●」を、文節候補を形成する可能性の
ある語として残った単語の右上に「○」を付した。After the above cost calculation is completed, the cost of each word is checked (step S250). This is a process that excludes combinations in which a character string is divided into word combinations and costs are higher than other combinations. For example, the combination of "door" + "ri" is higher than the cost of "bird" and "take", so "door" + "ri"
The combination is determined to be inappropriate and excluded from the phrase candidates (step S255). In FIG. 5, “●” is added to the upper right of a word that is determined not to be adopted as a bunsetsu candidate as a result of such a cost check, and “○” is added to the upper right of a word that remains as a word that may form a bunsetsu candidate. did.

【００５０】次に、こうして有効な語として残った単語
同士をリンクする処理を行なう（ステップＳ２６０）。
すなわち、前記の接続チェックの結果接続が有効とされ
た単語の組合わせについて、ポインタを設定することで
単語同士を結び付けるのである。図５では、「取り扱
い」については「が」にリンクし、この「が」について
は「代わる」「変わる」「替わる」にリンクする。以上
のような接続チェック、コスト計算、コストチェック、
単語候補リンクなどの処理を一つの解析位置で検索され
るすべての単語について終了すると、ステップＳ２１０
に戻って解析位置を一桁進めて再び辞書検索を行ない、
接続チェックやコスト計算など同様の処理を繰り返す。Next, a process of linking the remaining words as valid words is performed (step S260).
That is, with respect to a combination of words for which the connection is valid as a result of the connection check, the pointers are set to connect the words. In FIG. 5, “handling” is linked to “ga”, and “ga” is linked to “alternate”, “change”, and “alternate”. Connection check, cost calculation, cost check,
When processing such as word candidate link is completed for all the words searched at one analysis position, step S210
Return to and advance the analysis position by one digit, and search the dictionary again,
Repeat similar processing such as connection check and cost calculation.

【００５１】解析位置が入力された仮名文字列の最後の
桁に達し、入力した仮名文字列の全語について解析が終
了すると（ステップＳ２６５）、次に、以上の解析結果
を踏まえて最小コストパスの検索が行なわれる（ステッ
プＳ２７０）。このステップでは、これまでの処理の中
で有効な単語として残り互いにリンクされた組合わせの
中で、各単語に与えられたコストの合計が最小になる組
合わせを検索する。図５の文節分かち書き候補の中で
は、例えば「とりあつかいが」＋「かわる」と文節分か
ち書きした「取り扱いが」＋「代わる」という候補を検
索すると総コスト８となり、最小コストを持つ文節分か
ち書き候補として選択される。このように最小コストを
持つ文節分かち書き候補が作成されると（ステップＳ２
８０）、次に文節分かち書き候補の各文節内での候補が
作成される（ステップＳ２９０）。ここでは「かわる」
という単語に対して「代わる」「変わる」「替わる」な
どの同音異義語が候補として用意される。これら各文節
内での候補は、直前に使用した語を最優先で選択するな
ど所定の方法で優先順位が決められている。ここで、
「かわる」という語の中で「代わる」の優先順位が最も
高いならば、図５に例示した「とりあつかいがかわる」
という文字列を文節分かち書きすると、図中に点線Ｂで
示した「取り扱いが」＋「代わる」が第１候補として得
られることになる。When the analysis position reaches the last digit of the input kana character string and the analysis is completed for all the words of the input kana character string (step S265), next, based on the above analysis result, the minimum cost path is determined. Is searched (step S270). In this step, among the combinations that remain valid words in the processing so far and are linked to each other, the combination that minimizes the total cost given to each word is searched. Among the phrase segmentation candidates in FIG. 5, for example, when searching for a candidate “Tsuriagegai” + “Kawaru” that is “Handling” + “Alternative”, the total cost is 8 and is selected as the phrase segmentation candidate with the minimum cost. To be done. When a phrase segmentation candidate having the minimum cost is created in this way (step S2
80) Next, candidates for phrase segmentation writing candidates in each phrase are created (step S290). "Change" here
The homonyms such as “alternate”, “change”, and “alternate” are prepared as candidates for the word The priorities of the candidates in each of these clauses are determined by a predetermined method such as selecting the word used immediately before with the highest priority. here,
If the priority of “alternative” in the word “alter” is the highest, “alternate” is illustrated in FIG.
If the character string is written into the phrase, "Handling is" + "Alternative" shown by the dotted line B in the figure is obtained as the first candidate.

【００５２】以上の説明では、説明を簡略にするために
コスト計算は各単語自身のコストだけを対象としたが、
文法上接続し易いと判断される文節間のコストを下げる
ための文節間コストを設けたり、特定の品詞間の接続し
易さに着目した単語間コストを設けるなど、所望の変換
結果を得やすくするための様々な工夫を加えることが可
能である。In the above explanation, for simplification of the explanation, the cost calculation is intended only for the cost of each word itself.
It is easy to obtain the desired conversion result, for example, by providing an inter-clause cost to reduce the cost between phrases that are determined to be easy to connect grammatically, or by providing an inter-word cost that focuses on the ease of connecting between specific parts of speech. It is possible to add various ideas for doing so.

【００５３】上記の処理により作成された文節分かち書
き候補に対して、次に、後述する単語差し替え処理（ス
テップＳ１２０）を行ない、文節分かち書き処理を施し
た変換結果をＣＲＴディスプレイ２６に出力する表示処
理（ステップＳ１３０）を行なって、この仮名漢字変換
処理ルーチンを終了する。Next, a word replacement process (step S120), which will be described later, is performed on the phrase segment writing candidate created by the above process, and the conversion result after the segment segment writing process is output to the CRT display 26 (display process ( Step S130) is performed and the Kana-Kanji conversion processing routine is ended.

【００５４】次に、本実施例の要部である単語辞書の構
造を説明する。上述したように、図５に例示した「とり
あつかいがかわる」という仮名文字列を文節分かち書き
処理する場合、「取り扱いが代わる」が第１候補として
得られる。ここで、「とりあつかい」という単語は、
「取り扱い」「取扱い」「取扱」と三通りの表記が許容
されており、「かわる」に関しては例えば３つの同音異
義語「変わる」「代わる」「替わる」が、それぞれ「変
わる」「変る」、「代わる」「代る」、「替わる」「替
る」のように二通りの表記が許容されている。従って、
文節分かち書き処理を行なうときにはこれらすべての許
容された表記に関して接続チェックやコストチェックな
ど同様の処理を行なわねばならず、これらすべての表記
に関わる情報を自立語辞書５８や前述の係り受け辞書に
記憶しておく必要があることになる。これに対して本実
施例では、複数の表記が許容された語については、その
中の代表的な一つの表記だけを用いて文節分かち書き処
理が行なわれる。従って、図５に例を示した文節分かち
書き処理では、（）内に示した他の許容される表記のす
べてが辞書検索の際に単語辞書から読み込まれるわけで
はなく、後述する表示用表記への差し替えが必要な語に
ついてだけ、この表示用表記に関する情報が代表的な一
つの表記と共に読み込まれ、その後はこの代表的な一つ
の表記だけを対象として文節分かち書き処理が行なわれ
るのである。なお、この段階では、複数の表記が許容さ
れた語については、その中の代表的な一つの表記だけを
単語辞書から読み込むこととし、後述する表示用表記へ
の差し替えが必要な語については、差し替えを行なうと
きにその表示用表記に関する情報を所定の辞書から読み
込むこととしても良い。Next, the structure of the word dictionary which is the main part of this embodiment will be described. As described above, in the case where the kana character string "Takeshikaiwakaiwa" illustrated in FIG. Here, the word "toriokai" means
Three types of notation, "handling", "handling", and "handling" are allowed. For "changing", for example, the three homonyms "change", "alternate", and "alternate" are "changing", "changing", Two types of notation are allowed, such as “alternate”, “alternate”, “alternate”, and “alternate”. Therefore,
When performing the phrase segmentation processing, similar processing such as connection check and cost check must be performed for all these notations, and information related to all these notations is stored in the independent word dictionary 58 or the above-mentioned dependency dictionary. Will need to be kept. On the other hand, in the present embodiment, for words for which a plurality of notations are allowed, the phrase segmentation processing is performed using only one of the representative notations. Therefore, in the phrase segmentation processing shown in the example in FIG. 5, not all of the other permissible notations shown in () are read from the word dictionary at the time of dictionary retrieval, and Information about this display notation is read together with one representative notation only for words that need to be replaced, and then the phrase segmentation processing is performed only for this one representative notation. At this stage, for words for which multiple notations are allowed, only one of the representative notations will be read from the word dictionary, and for words that need to be replaced with the notations for display described below, When the replacement is performed, the information about the display notation may be read from a predetermined dictionary.

【００５５】以下、自立語辞書５８の構成について説明
する。単語辞書５８は、図６に示すように、ヘッダとイ
ンデックスと辞書本体からなる。ヘッダは、辞書自体を
管理するための情報である。インデックスおよび辞書本
体は、代表表記領域と派生表記領域と意味用例領域とに
分けて管理されている。代表表記とは、一つの単語が複
数の表記を許容されている場合、例えば「取り扱い」に
対して「取扱い」や「取扱」などが表記として許されて
いる場合、「取り扱い」をこれらの表記を代表するもの
とする、というように予め定めた単語表記である。派生
表記とは、「取扱い」「取扱」の様に、一つの単語に対
して許容されたその他の表記のことである。文節分かち
書き処理を行なう際には、このように派生表記を有する
語については、派生表記を代表する代表表記を用いて文
節分かち書き処理が行なわれる。意味用例についての領
域は、後述する係り受けに関する情報が管理されている
領域である。The structure of the independent word dictionary 58 will be described below. As shown in FIG. 6, the word dictionary 58 includes a header, an index, and a dictionary body. The header is information for managing the dictionary itself. The index and the dictionary body are separately managed in a representative writing area, a derived writing area, and a meaning example area. The representative notation means that if one word is allowed to be expressed in multiple ways, for example, "handling" or "handling" is allowed as notation for "handling", then "handling" is used as the notation. Is a representative word notation. Derived notations are other notations that are allowed for one word, such as “handle” and “handle”. When the phrase segmentation process is performed, the phrase segmentation process is performed for a word having such a derivative notation using a representative notation that is representative of the derivative notation. The area for the meaning example is an area in which the information on the dependency described later is managed.

【００５６】これらの代表表記、派生表記、意味用例
は、辞書本体においては、Ｂ−Ｔｒｅｅ構造により管理
されている。Ｂ−Ｔｒｅｅ構造は、多数のデータを外部
記憶に置く場合に採用される周知の管理構造であり、本
実施例のＢ−Ｔｒｅｅ構造では、分木の節にはデータを
置かず、データはすべて分木の終端に置く構造を取って
いる。多分木の構造を取っているので、２分木と比べて
木構造の深さが浅くなり、外部記憶を用いる場合には、
アクセス時間が短いという特徴を有するものとして知ら
れている（Ｂ＋Ｔｒｅｅと呼ぶこともある）。辞書本体
におけるＢ−Ｔｒｅｅ構造の一例を図７に示した。単語
の読み（仮名文字列）に基づいてＢ−Ｔｒｅｅ制御ブロ
ックを辿って単語ブロックに至ると、ここに実際の単語
データがおかれている。なお、このタイプのＢ−Ｔｒｅ
ｅ構造では、当初、キーは、その節に置くことができる
キーの最大数より少ない数だけ置かれ、データの追加な
どに柔軟に対応できるものとしている。The representative notation, derived notation, and meaning examples are managed by the B-Tree structure in the dictionary body. The B-Tree structure is a well-known management structure adopted when a large amount of data is stored in the external storage. In the B-Tree structure of the present embodiment, no data is stored in the branch tree node and all the data is stored. The structure is placed at the end of the branch tree. Since the tree structure is probably a tree, the depth of the tree structure is smaller than that of a binary tree, and when using external memory,
It is known to have a characteristic of short access time (sometimes called B + Tree). An example of the B-Tree structure in the dictionary body is shown in FIG. When the B-Tree control block is followed to reach the word block based on the reading of the word (kana character string), the actual word data is placed here. In addition, this type of B-Tre
In the e structure, initially, the number of keys is set to be smaller than the maximum number of keys that can be placed in the section, and it is possible to flexibly deal with addition of data.

【００５７】以下に、単語の読みに基づいてＢ−Ｔｒｅ
ｅ制御ブロックを辿る様子を図７に即して説明する。例
えば、「かわき」という仮名文字列が入力され、自立語
辞書５８における代表表記領域で検索が開始されるとき
を示す。Ｂ−Ｔｒｅｅの根元にあたる１段目には、それ
ぞれ１つの文字を持つキーが５０音順に並んでいる。
「かわき」から次段のどの分木に検索を進めるかは、
「かわき」という文字列を、１段目の節Ｓ１の各キーと
比較することにより知ることができる。この例では、
「かわき」は少なくともキー「か」とキー「さ」との間
に存在するから、２段目の節Ｓ２が探索の対象となる。
次にこの節Ｓ２において、同様の判断を行ない、「かわ
き」という読みは５０音順でキー「かる」より後、キー
「くり」より前であると判断し、キー「かる」以降でキ
ー「くり」間での範囲に対応する３段目の節Ｓ３が探索
される。この３段目の４個のキーの中で「かわき」を検
索すると、５０音順でキー「かわ」より後、キー「き
じ」より前であるので、この範囲に対応する単語ブロッ
クが探索される。この範囲に対応する単語ブロックの中
に「かわき」という読みを持つ語に関する実際の単語デ
ータが格納されているのである。Below, based on the reading of words, B-Tre
The way in which the e control block is traced will be described with reference to FIG. For example, it shows a case where a kana character string “Kawaki” is input and the search is started in the representative writing area in the independent word dictionary 58. In the first row, which is the root of B-Tree, keys each having one character are arranged in the order of Japanese syllabary.
Which branch tree to proceed from "Kawaki" to
It can be known by comparing the character string "kawaki" with each key of the section S1 in the first row. In this example,
Since "kawaki" exists at least between the key "ka" and the key "sa", the node S2 in the second row is a search target.
Next, in this section S2, the same judgment is made, and it is judged that the reading of "Kawaki" is in the order of the Japanese syllabary after the key "Karu" and before the key "Kuri", and after the key "Karu" The node S3 in the third row corresponding to the range between "" is searched. When searching for "Kawaki" in the four keys in the third row, the word block corresponding to this range is searched because it is after the key "Kawa" and before the key "Kiji" in the order of the Japanese syllabary. It In the word block corresponding to this range, the actual word data related to the word having the pronunciation "kawaki" is stored.

【００５８】代表表記領域などの単語データは、大まか
には、図８に示すデータ構造を有している。即ち、前記
単語ブロックを構成するレコードのレコード長Ｘを示す
データが先頭に存在し、その後、Ｘバイトの実データが
続いている。実データの先頭には、見出し語の長さＹが
記録されており、続いてＹバイトの見出し語が記録され
ている。実際の単語データは、その後に続いている。各
単語データは、その先頭に単語データ長Ｗが記録されて
おり、その直後に漢字データの有無などを示す１バイト
のフラグが記録されている。フラグの後には、その単語
データに漢字データが存在する場合には、漢字データが
記録されている。この漢字データは、漢字データ長と実
際の漢字文字列を示す漢字コードから構成される。その
後には、単語情報および品詞情報（場合によっては複数
の品詞情報）が記録されている。単語情報は、単語情報
の長さを示すデータと、実際の単語情報とからなる。The word data such as the representative writing area generally has a data structure shown in FIG. That is, the data indicating the record length X of the record that constitutes the word block is present at the head, and then X bytes of actual data follow. At the beginning of the actual data, the length Y of the headword is recorded, and subsequently, the headword of Y bytes is recorded. The actual word data follows. The word data length W is recorded at the beginning of each word data, and immediately after that, a 1-byte flag indicating the presence or absence of kanji data is recorded. After the flag, if there is Kanji data in the word data, the Kanji data is recorded. This Kanji data is composed of a Kanji data length and a Kanji code indicating an actual Kanji character string. After that, word information and part-of-speech information (in some cases, a plurality of pieces of part-of-speech information) are recorded. The word information consists of data indicating the length of the word information and the actual word information.

【００５９】このように、代表表記でも派生表記でも、
Ｂ−Ｔｒｅｅ構造を用いて、単語の見出し文字列に基づ
いて、所望の単語に関する情報を取り出すことができ
る。これらの単語情報は、更に図９に示すように、セパ
レータとこれに続くデータとから構成されている。セパ
レータとしては、それ以後に続くデータが表示される漢
字のデータであることを示す表示漢字セパレータや、デ
ータが読み情報であることを示す読み情報セパレータ、
派生表記であることを示す派生表記セパレータなどがあ
る。表示漢字とは、一つの単語に代表表記と派生表記と
がある場合に、デフォルトで漢字を表示するために、代
表表記に対応する漢字での表記を記録しているものであ
る。派生表記には、図１０に示すような種々の形態が存
在するが、各単語についてはそれぞれ許容された表記自
体が派生表記情報として記憶されている。派生表記セパ
レータは、派生表記領域にのみ存在する情報であり、そ
の下位３ビットは、各派生表記セパレータの後に記憶さ
れている派生表記が、図１０に示す派生表記の形態のう
ちの何種類分に対応するのかを示している。すなわち、
図１０に示す「売り上げ」という派生表記の場合は、派
生表記情報の１である「本則」と２である「送る」にあ
たるため、その派生表記セパレータの下位３ビットで
は、派生表記形態の２種類に対応することが示されてい
る。また、読み情報は、単語の読みを与えるものであ
り、見出し語が漢字である場合などにその読みを与える
ものである。この情報は、漢字からその読みを得て、読
みを同じくする単語を検索する場合などに用いられる。
例えば、キーボードからの文字入力の他に手書きなどで
入力し、仮名と共に漢字を含む文字列が入力されたと
き、これを正しく変換する際に用いられる（例「ひ行
き」→「飛行機」）。なお、一つの代表表記とこの代表
表記に対応する派生表記とは、別々の領域で管理されて
いるが、この実施例では、単語の読み（見出し）と単語
の品詞情報とが一致するものについて、対応関係がある
とみなしている。In this way, whether in the representative notation or the derivative notation,
The B-Tree structure can be used to retrieve information about a desired word based on the word heading string. As shown in FIG. 9, these pieces of word information are each composed of a separator and data following the separator. As the separator, a display Kanji separator that indicates that the data that follows is Kanji data, or a reading information separator that indicates that the data is reading information,
There is a derivative notation separator that indicates that it is a derivative notation. The display kanji is a record of the kanji corresponding to the representative notation in order to display the kanji by default when one word has a representative notation and a derivative notation. Although there are various forms of derivative notation as shown in FIG. 10, the notation itself permitted for each word is stored as derivative notation information. The derivative notation separator is information that exists only in the derivative notation area, and the lower 3 bits of the derivative notation stored after each derivative notation separator are the number of types of the derivative notation shown in FIG. It corresponds to. That is,
In the case of the derivative notation “sales” shown in FIG. 10, since it is “main rule” which is 1 of the derivative notation information and “send” which is 2, the lower 3 bits of the derivative notation separator have two types of derivative notation form. Has been shown to correspond. Further, the reading information gives the reading of a word, and gives the reading when the headword is a kanji. This information is used, for example, when the reading is obtained from the kanji and a word having the same reading is searched.
For example, in addition to inputting characters from a keyboard, when inputting by handwriting or the like, and when a character string including kanji is input together with a kana, it is used for correctly converting this (eg, “Higaku” → “airplane”). It should be noted that one representative notation and the derivative notation corresponding to this representative notation are managed in different areas. However, in this embodiment, the one in which the reading (heading) of a word and the word part of speech information match , I think there is a correspondence.

【００６０】本実施例では、前述の図３における単語検
索処理（ステップＳ１００）で、上記説明した自立語辞
書５８および付属語辞書６８を検索するが、以下にこの
時の動作について詳述する。図１１は、この単語検索処
理（図３のステップＳ１００）の詳細を示すフローチャ
ートである。この単語検索処理ルーチンが起動される
と、まず、単語検索の開始位置Ｍを値１、即ち入力した
仮名文字列の先頭とする処理を行なう（ステップＳ３０
２）。次に、単語検索における読みの長さを示す変数Ｌ
を値１に初期化する処理を行ない（ステップＳ３０
４）、単語開始位置から読み長Ｌの語を自立語辞書５
８、付属語辞書６８から検索する処理を行なう（ステッ
プＳ３０６）。すなわち、入力した文字列の最初の１文
字を読みに持つ単語の検索を行なうのである。その後、
開始位置Ｍから長さＬの読みの仮名文字列に合致する単
語が見つかったか否かを判定する（ステップＳ３０
８）。該当する自立語が見つかった場合には、次に、そ
の代表表記に付属するデータのうち派生表記に関するも
のを読み込む処理を行なう（ステップＳ３１０）。In this embodiment, the word search process (step S100) in FIG. 3 described above searches the independent word dictionary 58 and the adjunct word dictionary 68 described above. The operation at this time will be described in detail below. FIG. 11 is a flowchart showing details of this word search process (step S100 in FIG. 3). When this word search processing routine is activated, first, processing is performed in which the word search start position M is set to the value 1, that is, the beginning of the input kana character string (step S30).
2). Next, a variable L indicating the reading length in the word search
Is initialized to the value 1 (step S30
4), the word with the reading length L from the word start position is the independent word dictionary 5
8. A process of searching the auxiliary word dictionary 68 is performed (step S306). That is, the word having the first one character of the input character string as a reading is searched. afterwards,
It is determined whether or not a word matching the reading kana character string of length L from the start position M is found (step S30).
8). When the corresponding independent word is found, next, a process of reading the data related to the derived notation out of the data attached to the representative notation is performed (step S310).

【００６１】代表表記に対して種々の派生表記が存在
し、図６に示した派生表記領域に派生表記が記憶されて
いる場合には、前述した代表表記領域の単語情報の中に
派生表記への差し替えの必要・不要を示すフラグが保存
されている。そこで、かかるフラグを参照し、代表表記
を派生表記に差し替えるよう指示がなされている単語で
あるか否かを判断する（ステップＳ３１２）。このフラ
グに、派生表記への差し替えの必要を示す値が設定され
ている場合には、先に検索した代表表記を展開バッファ
に展開すると共に、その単語に第１マークＭＡを付与す
る（ステップＳ３１４）。展開バッファとは、入力した
仮名文字列に対して、この仮名文字列を構成し得るすべ
ての代表表記および付属語を展開するための記憶領域で
あり、ＲＡＭ２３上に確保されたメモリ領域である。ま
た、このとき、代表表記を派生表記に差し替えるよう指
示がなされている場合には、代表表記領域の単語情報の
中に、差し替えるべき派生表記も記憶されており、この
差し替えるべき派生表記も代表表記と共に展開バッファ
に展開される。When there are various derivative notations with respect to the representative notation and the derivative notations are stored in the derivative notation area shown in FIG. 6, the derivative notation is included in the word information of the representative notation area described above. A flag indicating whether or not replacement of is required is stored. Therefore, with reference to this flag, it is determined whether or not the word is instructed to replace the representative notation with the derivative notation (step S312). If a value indicating the necessity of replacement with the derivative notation is set in this flag, the representative notation retrieved previously is expanded in the expansion buffer, and the first mark MA is added to the word (step S314). ). The expansion buffer is a storage area for expanding all the representative notations and adjuncts that can form this kana character string with respect to the input kana character string, and is a memory area secured on the RAM 23. Further, at this time, if an instruction is given to replace the representative notation with the derivative notation, the derivative notation to be replaced is also stored in the word information in the representative notation area, and the derivative notation to be replaced is also the representative notation. Is expanded in the expansion buffer together with.

【００６２】派生表記への差し替えの指示がなされてい
ない場合、もしくは派生表記への差し替えが指示されて
いて代表表記にマークを付与した後、処理は、ステップ
Ｓ３０６に戻って、読み長Ｌの単語を更に検索する処理
から繰り返す。読み長Ｌの語がもはや自立語辞書５８お
よび付属語辞書６８に存在しないと判断された場合には
（ステップＳ３０８）、検索単語の長さを伸ばすことが
できるか否かを判断する（ステップＳ３１６）。入力さ
れた仮名文字列の全長Ａに対してＭ＋Ｌ＜Ａならば、読
みの長さＬを大きくすることができると判断し、読みを
一文字分長くする処理（即ち、Ｌを値１だけインクリメ
ントする処理）を行なう（ステップＳ３２０）。読みの
長さＬを値１だけ増加した後、ステップＳ３０６から上
述した処理を繰り返す。なお、辞書検索のときに、長さ
がＬ以上の読みを持つ単語が記憶されているかどうかを
判断可能な構成とし、長さがＬ以上の読みを持つ単語が
存在しない場合には、単語の読みを伸ばして辞書を検索
する以降の処理を中止することとしても良い。When the instruction to replace the derivative notation has not been given, or after the instruction to replace the derivative notation has been given and a mark has been added to the representative notation, the process returns to step S306 and the word of the reading length L is returned. Is repeated from the process of further searching. When it is determined that the word having the reading length L no longer exists in the independent word dictionary 58 and the auxiliary word dictionary 68 (step S308), it is determined whether or not the length of the search word can be extended (step S316). ). If M + L <A with respect to the total length A of the input kana character string, it is determined that the reading length L can be increased, and the reading is lengthened by one character (that is, L is incremented by 1). Processing) is performed (step S320). After increasing the reading length L by the value 1, the above-described processing is repeated from step S306. In the dictionary search, it is possible to determine whether or not a word having a reading with a length of L or more is stored. If there is no word with a reading having a length of L or more, the word It is also possible to cancel the processing after extending the reading and searching the dictionary.

【００６３】この結果、展開バッファには、開始位置Ｍ
における長さ１から最大長さまでの読みの全単語が展開
される。読みの長さを順次長くしていって、単語長が伸
ばせなくなると（ステップＳ３１６）、その開始位置Ｍ
を先頭とする単語の検索はすべて終わったと判断し、次
に単語検索の開始位置Ｍを、入力された仮名文字列の末
尾に向かって移動可能か否かを判断する（ステップＳ３
１８）。移動可能であれば、開始位置Ｍを値１だけイン
クリメントする処理を行なった後（ステップＳ３２
２）、読みの長さＬを値１に戻して、上述した処理を繰
り返す。従って、これらの処理が行なわれると、展開バ
ッファには、入力された仮名文字列を構成する可能性の
あるすべての代表表記および付属語が展開されており、
派生表記への差し替えが必要な代表表記については、こ
れに第１マークＭＡが付与され、差し替えるべき派生表
記も共に展開された状態となる。As a result, the start position M is stored in the expansion buffer.
All reading words from length 1 to the maximum length in are expanded. When the reading length is gradually increased and the word length cannot be extended (step S316), the start position M
It is determined that the search for the word beginning with is completed, and then it is determined whether the start position M of the word search can be moved toward the end of the input kana character string (step S3).
18). If the position is movable, the process of incrementing the start position M by 1 is performed (step S32).
2) Return the reading length L to the value 1, and repeat the above-mentioned processing. Therefore, when these processes are performed, the expansion buffer has expanded all the representative notations and attached words that may form the input kana character string,
For the representative notation that needs to be replaced with the derivative notation, the first mark MA is added to the representative notation, and the derivative notation to be replaced is also expanded.

【００６４】以上の処理により単語検索処理（図３ステ
ップＳ１００）が完了する。引き続き、前述した分かち
書き処理（ステップＳ１１０）が行なわれるが、これら
の処理は、展開バッファに展開した上記代表表記を用い
て行なわれるものであり、実際には図４および図５を用
いて説明したように、単語検索処理（ステップＳ１０
０）と分かち書き処理（ステップＳ１１０）の一部は並
行して行なわれている。ただし、ここでは、単語検索処
理（ステップＳ１００）の結果、図５で（）内に示した
派生表記のすべてが展開バッファに展開されているわけ
ではなく、派生表記への差し替えが必要な代表表記に関
しては、第１マークＭＡが付与され、差し替えるべき派
生表記に関する情報が代表表記と共に展開されている。With the above processing, the word search processing (step S100 in FIG. 3) is completed. Subsequently, the above-mentioned segmentation processing (step S110) is performed, but these processings are performed using the above-described representative notation expanded in the expansion buffer, and actually explained using FIG. 4 and FIG. As described above, the word search process (step S10
0) and part of the segmentation process (step S110) are performed in parallel. However, here, as a result of the word search process (step S100), not all of the derived notations shown in () in FIG. 5 are expanded in the expansion buffer, and the representative notation that needs to be replaced with the derived notation. With respect to, the first mark MA is added, and the information on the derivative notation to be replaced is developed together with the representative notation.

【００６５】分かち書き処理（ステップＳ１１０）が終
了すると、次に第１候補とされた単語について単語差し
替え処理（ステップＳ１３０）を行なう。この単語の差
し替え処理は、第１候補とされた単語について派生表記
への差し替えが必要か否かを判断し、必要な場合には、
派生表記の中で、差し替えるべき表記として用意され、
代表表記と共に展開バッファに展開されている表記に差
し替えるものである。ここでは、後述するように、直前
に使用された表記が差し替えるべき派生表記として学習
されている場合に行なわれる処理について説明する。例
えば、図５の「取り扱いが」＋「代わる」が第１候補と
なった例で、直前に使用された語がそれぞれ「取り扱
い」と「代る」であれば、「取り扱い」はそのまま用い
られるが、「代わる」に関しては、単語検索、分かち書
き処理で一貫して用いてきた代表表記に代えて、派生表
記である「代る」に差し替えるのである。When the word segmentation process (step S110) is completed, the word replacement process (step S130) is performed on the first candidate word. In this word replacement process, it is judged whether or not the word selected as the first candidate needs to be replaced with the derivative notation.
It is prepared as a notation that should be replaced in the derivative notation,
It replaces the representative notation with the notation expanded in the expansion buffer. Here, as will be described later, the processing performed when the notation used immediately before is learned as a derivative notation to be replaced will be described. For example, “Handling” + “Alternate” in FIG. 5 is the first candidate, and if the words used immediately before are “Handling” and “Alternate”, respectively, “Handling” is used as it is. However, as for “alternative”, the representative notation that has been consistently used in the word search and the segmentation processing is replaced with the derivative notation “alternative”.

【００６６】単語差し替え処理を行なう処理ルーチンを
図１２に示す。ここで実行される処理について、上記
「取り扱いが」＋「代わる」の例に即して説明する。こ
の処理ルーチンでは、まず、文節分かち書き処理により
分かち書きされた最初の文節の単語を対象単語として単
語差し替え処理を開始する（ステップＳ４００）。「取
り扱いが」＋「代わる」と文節分かち書きされていると
き、先頭単語「取り扱い」が対象単語となる。次にこの
対象単語に第１マークＭＡがあるかどうかを判断する
（ステップＳ４１０）。第１マークＭＡが付いていれ
ば、派生表記が存在しかつ派生表記への差し替えが指示
されていると判断できるから、第１候補を表示用表記で
ある派生表記に差し替える処理を行なう（ステップＳ４
２０）。ステップＳ４１０で対象単語に第１マークＭＡ
が付いていなければ、表示用表記の差し替えは行なわな
い。すなわち、表示用表記とは、代表表記が以前に使用
されていれば代表表記そのものであり、以前に派生表記
が使用され学習されていればその派生表記となるのであ
る。対象単語「取り扱い」において、直前に使用された
表記が「取り扱い」であるので、代表表記の付属データ
に保存されているフラグは派生表記への差し替えを指示
しておらず、図１１に示した単語検索処理ルーチンのス
テップＳ３１２で差し替えが必要な語でないと判断され
る。従って代表表記「取り扱い」には第１マークＭＡは
付いておらず、代表表記はそのまま表示用表記となる。FIG. 12 shows a processing routine for performing word replacement processing. The processing executed here will be described with reference to the above example of “handling” + “alternate”. In this processing routine, first, the word replacement process is started with the word of the first phrase segmented by the segment segmentation process as the target word (step S400). When the phrase is “handling” + “alternate”, the first word “handling” is the target word. Next, it is determined whether the target word has the first mark MA (step S410). If the first mark MA is attached, it can be determined that there is a derivative notation and replacement to the derivative notation is instructed, so processing for replacing the first candidate with the derivative notation that is the display notation is performed (step S4).
20). In step S410, the first mark MA is added to the target word.
If is not attached, the display notation is not replaced. That is, the display notation is the representative notation itself if the representative notation has been used before, and the derivative notation if the derivative notation has been used and learned before. In the target word “handling”, since the notation used immediately before is “handling”, the flag stored in the ancillary data of the representative notation does not instruct replacement to the derived notation and is shown in FIG. In step S312 of the word search processing routine, it is determined that the word does not need to be replaced. Therefore, the representative mark “handling” does not have the first mark MA, and the representative mark is a display mark as it is.

【００６７】その後、未処理の単語があるかどうかを調
べ（ステップＳ４３０）、未処理単語があれば対象単語
を一つ後ろにずらす（ステップＳ４４０）。「取り扱い
が」＋「代わる」の例では次の対象単語は「が」である
が、これは助詞であり、付属語辞書６８に登録された派
生表記を持たない語であるため差し替えは行なわれな
い。再びステップＳ４３０に戻り対象単語を１つ後ろに
ずらすと（ステップＳ４４０）、次の対象単語は「代わ
る」である。「代わる」については、直前に「代る」が
選択されているため代表表記を格納した単語情報に含ま
れるデータに保存されたフラグは派生表記への差し替え
を指示しており、展開バッファに展開された代表表記
「代わる」には第１マークＭＡが付与されており（ステ
ップＳ４１０）、差し替えるべき派生表記も共に展開さ
れている。そこで、代表表記「代わる」が派生表記「代
る」に差し替えられる（ステップＳ４２０）。以上の処
理を終えるともはや未処理単語は残っていないため（ス
テップＳ４３０）単語差し替え処理を終了する。その
後、前述のように図３ステップＳ１３０の表示処理にお
いて、ＣＲＴディスプレイ２６に変換結果「取り扱いが
代る」を表示して、仮名漢字変換処理は終了となる。Then, it is checked whether or not there is an unprocessed word (step S430), and if there is an unprocessed word, the target word is shifted backward by one (step S440). In the example of "handling" + "alternate", the next target word is "ga", but this is a particle and is a word that does not have a derivative notation registered in the adjunct dictionary 68, so replacement is performed. Absent. When the process returns to step S430 and the target word is moved back by one (step S440), the next target word is "alternate". As for “Alternate”, since “Alternate” is selected immediately before, the flag saved in the data included in the word information that stores the representative notation indicates replacement with the derivative notation and is expanded in the expansion buffer. The first mark MA is given to the representative representation "replace" thus generated (step S410), and the derivative representation to be replaced is also developed. Therefore, the representative notation "alternate" is replaced with the derivative notation "alternate" (step S420). When the above processing is completed, there are no unprocessed words left (step S430), and the word replacement processing is ended. After that, as described above, in the display process of step S130 in FIG. 3, the conversion result "handling is changed" is displayed on the CRT display 26, and the kana-kanji conversion process is ended.

【００６８】以上説明した実施例によれば、文節分かち
書き処理を行なうとき、代表表記だけを対象として処理
するため、文節分かち書き処理に用いる辞書は代表表記
に関する情報だけを記憶しておけば良く、辞書のサイズ
を縮小できるという利点を有する。さらに、代表表記だ
けを用いて文節分かち書き処理を行なうことにより、文
節分かち書き処理を簡素化し、文節分かち書き処理の高
速化を図ることができる。「とりあつかいがかわる」の
例で、本実施例の仮名漢字変換装置を用いなければ、図
５で（）内に示した派生表記もすべて単語辞書から読み
込んで、文法上の妥当性を調べる接続チェックやコスト
計算などの処理を行なわなければならないが、本実施例
の仮名漢字変換装置では、代表表記以外の（）内の派生
表記は文節節分かち書き処理の対象とはならない。従っ
て、代表表記以外の派生表記の数だけ文節分かち書き処
理の対象となる単語の数が減り、処理全体が簡素化さ
れ、結果的により高速で文節分かち書き処理を行なうこ
とが可能となる。さらに、文節分かち書き処理の対象と
なる単語の数が減ることから、展開バッファとして利用
するメモリが小さくて済むという効果をも奏するのであ
る。According to the embodiment described above, when performing the segment segmentation writing process, only the representative notation is processed. Therefore, the dictionary used for the segment segment segmentation processing only needs to store information on the representative segmentation. Has the advantage that the size of can be reduced. Further, by performing the phrase segmentation writing process using only the representative notation, the segment segmentation segmentation process can be simplified and the segment segment segmentation writing process can be speeded up. In the example of "Take care change", if the kana-kanji conversion device of this embodiment is not used, all the derived notations shown in () in FIG. However, in the kana-kanji conversion device according to the present embodiment, the derivative notations in parentheses other than the representative notations are not the target of the phrase segmentation processing. Therefore, the number of words to be subjected to the phrase segmentation processing is reduced by the number of derived representations other than the representative notation, the entire process is simplified, and as a result, the segment segmentation processing can be performed at higher speed. Further, since the number of words to be subjected to the phrase segmentation processing is reduced, the memory used as the expansion buffer can be made small.

【００６９】また、上記の例では、代表表記「取り扱
い」と「代わる」に対する表示用表記は、直前に使用さ
れた表記が学習されて、それぞれ「取り扱い」「代る」
が予め設定されていたが、派生表記が存在する場合に表
示用表記としてどの表記を用いるかの学習は、ある表記
を選択した時点で、代表表記領域の単語情報内にその表
記を書き込み、更に、派生表記領域における各派生表記
の単語データの並び順を、選択した表記を先頭に移し、
残りの派生表記の順序を順次後ろにずらすことで実現さ
れる。次回より変換結果を表示するときには、代表表記
領域の単語情報内に記憶された表記を第１候補とし、残
りの候補の表示は、派生表記領域に記憶された単語デー
タの並び順に従って表示される。このとき、表示用表記
として代表表記を使用する場合には、代表表記領域の単
語情報内に、代表表記自体（もしくは代表表記に相当す
る派生表記情報）を記憶しておいても良いし、本実施例
のように、代表表記を使用することを示すフラグを記憶
して、文節分かち書き処理の後に表記の差し替えを行な
わないこととしても良い。In the above example, the representative notations "handling" and "alternative" are represented by "learning" and "alternating" after learning the notation used immediately before.
Was set in advance, but learning which notation to use as a display notation when there is a derivative notation is to write that notation in the word information of the representative notation area when a certain notation is selected, and , Move the order of the word data of each derivative notation in the derivative notation area to the selected notation at the top,
It is realized by shifting the order of the remaining derivative notations one after the other. When the conversion result is displayed next time, the notation stored in the word information of the representative notation area is set as the first candidate, and the remaining candidates are displayed according to the arrangement order of the word data stored in the derived notation area. . At this time, if the representative notation is used as the display notation, the representative notation itself (or derivative notation information corresponding to the representative notation) may be stored in the word information of the representative notation area. As in the embodiment, the flag indicating that the representative notation is used may be stored so that the notation is not replaced after the phrase segmentation processing.

【００７０】上記実施例では、表示用表記は、予め代表
表記領域の単語情報内に記憶しておくこととし、単語検
索処理（ステップＳ１００）のときには、代表表記と共
にこの表示用表記もまた展開バッファに展開し、単語差
し替え処理（ステップＳ１２０）において、代表表記に
付与された差し替えを指示するマークに従って、代表表
記をこの表示用表記に差し替えることとした。従って、
単語差し替え処理において、改めて辞書を参照する必要
がないという効果を奏する。しかしながら、単語検索処
理（ステップＳ１００）のときには展開バッファには表
示用表記を展開せず、代表表記に差し替えを指示するマ
ークを付与するだけとして、単語差し替え処理（ステッ
プＳ１２０）において、付与されたマークの指示に従っ
て、派生表記領域を参照して表示用表記を読み込み、代
表表記との差し替えを行なうこととしても良い。この場
合には、単語差し替え処理（ステップＳ１２０）で改め
て派生表記領域を参照する必要があるが、展開バッファ
に表示用表記を展開したり、代表表記領域に表示用表記
を記憶しておく必要がない。この時表示用表記の学習
は、使用した派生表記の単語データを派生表記の単語デ
ータの先頭に移し、先頭の派生表記を表示用表記として
差し替えることとしても良いし、使用した派生表記を単
語差し替え処理（ステップＳ１２０）時に参照可能であ
るように、派生表記領域内の単語情報に記憶することと
しても良い。In the above embodiment, the display notation is stored in advance in the word information of the representative notation area, and during the word search process (step S100), this display notation is also stored in the expansion buffer. Then, in the word replacement process (step S120), the representative notation is replaced with this display notation according to the mark given to the representative notation indicating the replacement. Therefore,
In the word replacement process, there is an effect that it is not necessary to refer to the dictionary again. However, in the word search process (step S100), the display notation is not expanded in the expansion buffer, and only the mark indicating the replacement is added to the representative notation, and the mark added in the word replacement process (step S120) In accordance with the instructions, the display notation may be read by referring to the derivative notation area and replaced with the representative notation. In this case, it is necessary to refer to the derived notation area again in the word replacement process (step S120), but it is necessary to expand the display notation in the expansion buffer or store the display notation in the representative notation area. Absent. At this time, the learning of the notation for display may be performed by moving the word data of the derived notation used to the beginning of the word data of the derived notation and replacing the derived notation at the top as the notation for display. It may be stored in the word information in the derived writing area so that it can be referred to during the processing (step S120).

【００７１】このような表示用表記の学習機能は、上記
のように直前に選んだ表記に書き換わることとしても良
いし、過去に選択した表記の中から最も頻度の高いもの
に書き換わることとしても良く、使用者が好んで選択す
る表記に変更可能であれば良い。その結果、変換後の分
かち書き文字列では、所望の表記がより得易くなるので
ある。Such a learning function of the display notation may be performed by rewriting the notation selected immediately before as described above, or by rewriting the notation most frequently selected from the notations selected in the past. It is also possible to change the notation that the user prefers and select. As a result, it becomes easier to obtain the desired notation in the separated character string after conversion.

【００７２】以上、ある表記を選択したときの表示用表
記の学習機能について説明したが、ある表記を選択した
ときに行なわれる学習は、同じ代表表記に対応する派生
表記内で行なわれるものだけではない。すなわち、ある
表記を選択したときには、この表記に対応する代表表記
の同音異義語の中での優先順位が上がり、自立語辞書５
８における代表表記領域内でも学習が行なわれるのであ
る。これにより、同音異義語の中でも、所望の単語が得
易くなる。The learning function of the display notation when a certain notation is selected has been described above, but the learning performed when a certain notation is selected is not limited to that performed in the derived notation corresponding to the same representative notation. Absent. That is, when a certain notation is selected, the priority of the representative notation corresponding to this notation in the homonyms increases and the independent word dictionary 5
Learning is also performed in the representative notation area of 8. This makes it easier to obtain a desired word among the homonyms.

【００７３】なお、文節分かち書きの処理において、用
例辞書を参照した係り受けの判定を行ない、係り受けが
成立する単語を優先しても良い。同音異義語に関して
は、直前に使用した単語を最優先で使用するなど所定の
方法により優先順位が付けられており、上記の説明で
「とりあつかいがかわる」を文節分かち書きしたときに
は、「かわる」という読みに対応した同音異義語の中で
「代わる」の優先順位が最も高かったため「取り扱い
が」＋「代わる」と分かち書きされた。このとき、文節
分かち書き処理の中で係り受けの判定も行ない、用例辞
書の中に「取り扱い」と「変わる」の係り受け関係が予
め記憶されているならば、同音異義語の優先順位に関わ
らず、図５の実線Ｇで示した「取り扱いが」＋「変わ
る」を第１候補として得ることが可能となるのである。
ここで用例辞書は、図６に示したように、自立語辞書５
８と一体として、その意味用例の領域に記憶しておくこ
ともできる。In the phrase segmentation writing process, the dependency judgment may be performed by referring to the example dictionary, and the words for which the dependency is established may be prioritized. Regarding homophones, they are prioritized according to a predetermined method, such as using the most recently used word as the highest priority. Among the homonyms corresponding to, the priority of "alternate" was the highest, so it was divided into "handling" + "alternate". At this time, the dependency judgment is also performed in the phrase segmentation process, and if the dependency relations of “handling” and “change” are stored in advance in the example dictionary, regardless of the priority order of the homonyms. , "Handling" + "Change" shown by the solid line G in FIG. 5 can be obtained as the first candidate.
Here, the example dictionary is, as shown in FIG.
It is also possible to store it in the area of the meaning example together with 8.

【００７４】さらに、このような係り受け判定を行なう
ならば、表示用表記としてどの表記を選択するかの学習
の効果は同じ代表表記に関する他の係り受け情報につい
ても及ぶことになる。すなわち、代表表記「変わる」の
表示用表記が「変る」であれば、次に「きそくがかわ
る」を文節分かち書き処理すると、「規則」と代表表記
「変わる」の係り受け情報を参照して文節分かち書きさ
れ、第１候補は「規則が変わる」ではなく「規則が変
る」が得られるのである。Further, if such dependency judgment is performed, the effect of learning which notation is selected as the display notation extends to other dependency information related to the same representative notation. That is, if the display notation of the representative notation “change” is “changed”, then the “separate change” is processed into phrase segmentation, and the dependency information of the “rule” and the representative notation “change” is referred to. The sentences are segmented and the first candidate is "rules change" instead of "rules change".

【００７５】また、単語間に新たな係り受け関係が認め
られる場合にはこの係り受け関係を習得し、意味用例領
域の品詞データ内に記憶することとしても良い。意味用
例もまた、前述の派生表記と同様に、代表表記と関連づ
けて記憶されており、代表表記を用いた文節分かち書き
処理のときにはこの習得された係り受け関係が参照さ
れ、所望の仮名漢字混じり文字列に変換され易くなる。
ただしこの場合には、特殊な派生表記が選択されて仮名
漢字文字列において確定されたときには、上記係り受け
関係の習得を行なわない構成とすることも好ましい。例
えば、図１０において、「うりあげ」という文字列に対
して、派生表記の形態番号１５にあたる記号「￥」を選
択して、売上金に関する文書を作成したとする。このと
き、文字「￥」と、その近傍に記された「円」という文
字との間に係り受け関係が存在するとしてその関係が習
得されたとしても、その後の文節分かち書き処理におい
て、文字「￥」と文字「円」との間で意味上のつながり
が認識される必要性は少ない。従って、特殊な派生表記
が選択された場合には、係り受けの習得を行なわないも
のとするのである。その手段としては、例えば、図１０
に示した派生表記の種類の内、形態番号１の「本則」か
ら８の「拗音大」までは係り受け関係の習得を行なうと
予め指定しておいても良いし、また、特殊な表記の選択
の際に使用者が、オプションスイッチで係り受け関係の
習得を行なわないことを指定することとしても良い。When a new dependency relation is recognized between words, this dependency relation may be learned and stored in the part-of-speech data of the meaning example area. Similar to the above-mentioned derivative notation, the meaning example is also stored in association with the representative notation, and the learned dependency relation is referred to in the phrase segmentation processing using the representative notation, and the desired kana-kanji mixed characters are referred to. Easier to convert to columns.
However, in this case, it is also preferable that when the special derived notation is selected and confirmed in the kana-kanji character string, the modification relation is not learned. For example, in FIG. 10, it is assumed that a document relating to sales proceeds is created by selecting the symbol “¥” corresponding to the form number 15 of the derivative notation for the character string “listening”. At this time, even if the relationship is learned between the character "\" and the character "yen" written in the vicinity of the character "\", even if the relationship is learned, the character "\" It is not necessary to recognize the semantic connection between "" and the character "yen". Therefore, if a special derivative notation is selected, the dependency learning is not performed. As the means, for example, FIG.
Of the types of derivative notation shown in, it is possible to specify in advance that the dependency relations will be learned in advance from “main rule” of form number 1 to “in Japanese grammar” of 8 and it is also possible to specify the special notation At the time of selection, the user may specify with an option switch that learning of the dependency relationship is not performed.

【００７６】本実施例では、図６に示すように、各単語
の読みと、一つの代表表記と、派生表記が存在するとき
には派生表記とを一体の自立語辞書５８に格納する構成
としたが、単語の読みと一つの代表表記とを構成要素と
する単語辞書と、許容する派生表記を代表表記と関連づ
けて格納した派生表記辞書とを分ける構成としても良
い。このとき、文節分かち書き処理に際しては単語辞書
を参照して代表表記を読み込み、文節分かち書き処理後
は、派生表記を有する単語に関しては派生表記辞書を参
照して、表示用表記に書き換えた後に表示を行なうので
ある。このような構成とするならば、単語辞書は登録す
るすべての単語について、その読みと代表表記を記憶し
ておく必要があるが、派生表記辞書は、派生表記を有す
る単語に関して、許容する派生表記を代表表記と関連づ
けて記憶しておけば良い。In the present embodiment, as shown in FIG. 6, the reading of each word, one representative notation, and the derivative notation, if any, are stored in the integrated independent word dictionary 58. The word dictionary having the word reading and one representative notation as constituent elements may be separated from the derivative notation dictionary in which the permitted notation is stored in association with the representative notation. At this time, in the phrase segmentation process, the representative notation is read by referring to the word dictionary, and after the segment segmentation process, the word having the derivative notation is referred to the derivative notation dictionary and is rewritten to the display notation and then displayed. Of. With such a configuration, the word dictionary needs to store the reading and representative notation of all the words to be registered, but the derivative notation dictionary allows the derivative notations that are allowed for words having derivative notations. It suffices to store in association with the representative notation.

【００７７】以上説明した第１実施例の仮名漢字変換装
置により、文節分かち書き処理を行なった結果表示され
た変換候補が所望の表記でなかった場合、キーボード２
４からの変換キーの入力などにより他の候補を表示さ
せ、この中から所望の表記を選択する必要があるが、こ
のような第２候補以下の選択方法を、第２実施例として
次に説明する。図１３（Ａ）は、従来の仮名漢字変換装
置において変換候補の表示を指示したときに用いられる
表示方法である。「とりあつかいがかわる」の「かわ
る」に対して変換候補の指示を入力すると、「かわる」
という読みを持つ単語とその派生表記が、所定の学習機
能などにより並び替えられた順序に従って、同音異義語
と派生表記が混在した状態で１次元的に表示される。こ
れに対して本実施例の仮名漢字変換装置では、図１３
（Ｂ）に示すように、第１候補として表示する単語の同
音異義語と、各語の派生表記とを異なる２方向に展開
し、二次元的に表示される。「かわる」の第１候補が
「代る」であるとき、横方向に「代る」の派生表記であ
る「代わる」を表示し、縦方向には「代る」の同音異義
語である「変る」「替る」を表示する。そして、それぞ
れの同音異義語についても、横方向に１行目の第１候補
の場合と同様に派生表記を展開表示するのである。な
お、図１３では、横方向に派生表記を展開して、縦方向
に同音異義語を展開したが、縦横それぞれに展開する語
の種類は逆であっても差し支えない。If the conversion candidate displayed as a result of performing the phrase segmentation processing by the kana-kanji conversion device of the first embodiment described above is not the desired notation, the keyboard 2
It is necessary to display other candidates by inputting a conversion key from 4 and to select a desired notation from these, and a selection method for such a second candidate and below will be described below as a second embodiment. To do. FIG. 13A shows a display method used when an instruction to display conversion candidates is given in the conventional kana-kanji conversion device. If you enter the conversion candidate instruction for "change" in "change", "change"
The word with the reading and its derivative notation are displayed one-dimensionally in a state in which homonyms and derivative notation are mixed according to the order sorted by a predetermined learning function or the like. On the other hand, in the kana-kanji conversion device of the present embodiment, as shown in FIG.
As shown in (B), the homonyms of the word displayed as the first candidate and the derivative notation of each word are expanded in two different directions and displayed two-dimensionally. When the first candidate of "alter" is "alternate", "alternate" which is a derivative notation of "alternate" is displayed in the horizontal direction, and the homonym of "alternate" is displayed in the vertical direction. "Change" and "Change" are displayed. Then, for each homonym, the derivative notation is expanded and displayed in the same manner as in the case of the first candidate on the first line in the horizontal direction. In FIG. 13, the derivative notation is expanded in the horizontal direction and the homonyms are expanded in the vertical direction, but the types of words expanded in the vertical and horizontal directions may be reversed.

【００７８】以下に、第２実施例で実行される変換候補
表示処理ルーチンを図１４のフローチャートを用いて説
明する。本ルーチンは、変換キーの入力により、変換候
補の表示が指示されることにより開始される。変換キー
が入力されると、まず、展開バッファ中に展開された単
語の中から、第１候補の単語とその同音異義語の代表表
記が読み込まれる（ステップＳ５１０）。上記の例で
は、「代る」に関して「変わる」と「替わる」が読み込
まれる。この時読み込みに使用されるメモリーは、ＲＡ
Ｍ２３上に確保された所定の領域であり、変換候補とな
るデータが一時的に記憶されるものである。次に、第１
候補を含むすべての同音異義語に派生表記があるかどう
かが判断される（ステップＳ５２０）。ここでは、表示
用表示への差し替えの必要・不要を示すフラグに関わり
なく、自立語辞書５８の派生表記領域に派生表記が存在
するかどうかが判断される。第１候補の単語とその同音
異義語に派生表記が存在する場合には、自立語辞書５８
からの読み込みが行なわれる（ステップＳ５３０）。The conversion candidate display processing routine executed in the second embodiment will be described below with reference to the flowchart of FIG. This routine is started when the display of conversion candidates is instructed by inputting the conversion key. When the conversion key is input, first, the representative notation of the first candidate word and its homonyms is read from the words expanded in the expansion buffer (step S510). In the above example, "change" and "alternate" are read for "alternate". The memory used for reading at this time is RA
This is a predetermined area secured on M23, and the data that is a conversion candidate is temporarily stored. Then the first
It is determined whether or not all homonyms including candidates have derivative notations (step S520). Here, it is determined whether or not a derivative notation exists in the derivative notation area of the independent word dictionary 58, regardless of the flag indicating the necessity / unnecessity of replacement with the display for display. If the first candidate word and its homonyms have derivative notations, the independent word dictionary 58
Is read (step S530).

【００７９】派生表記を読み込むと、次に、これらすべ
ての表記を図１３（Ｂ）に示した２次元表示のフォーマ
ット従ってに並び替える（ステップＳ５４０）。１行目
には、第１候補の単語とその派生表記を、学習機能など
により並び替えられた所定の順位に従って展開し、１列
目には、第１候補の単語とその同音異義語を、同じく所
定の順位に従って展開する。図１３の例では第１候補
「代る」と第２候補「代わる」が１行目に、第１候補
「代る」と第３候補「変る」と第５候補「替る」が１列
目に並ぶ。２行２列目以降は、各行の１列目の表記の派
生表記が、各派生表記の変容の形態を示す番号が列によ
って同じになるように配置される。「かわる」において
は、派生表記が図１０に示す１の「本則」および２の
「送る」にあたる送りがな「わる」を送るものと、４の
「許容」にあたる送りがな「る」を送るものの２種類で
あるため、図１３（Ｂ）のように展開されるのである。
このように並び替えた結果を出力し、ＣＲＴディスプレ
イ２６に表示して、本ルーチンを終了する。When the derived notation is read, all of these notations are rearranged according to the two-dimensional display format shown in FIG. 13B (step S540). In the first line, the first candidate word and its derivative notation are expanded according to a predetermined order rearranged by a learning function or the like, and in the first column, the first candidate word and its homonyms are Similarly, develop according to a predetermined order. In the example of FIG. 13, the first candidate “replace” and the second candidate “replace” are in the first row, and the first candidate “replace”, the third candidate “change” and the fifth candidate “replace” are in the first row. Line up. In the second row and the second column and thereafter, the derivative notation of the notation in the first column of each row is arranged such that the number indicating the transformation form of each derivative notation is the same for each column. In "kawaru", there are two types of derivative notation, that is, the "main rule" of 1 and the "warm" of 2 which sends the "waru" which is the feed, and the "4" which sends "ru" which does not feed the "allowance" shown in FIG. Therefore, it is developed as shown in FIG. 13 (B).
The result of rearranging in this way is output and displayed on the CRT display 26, and this routine is ended.

【００８０】以上説明した実施例では、同音異義語と派
生表記とを異なる２方向に展開し、二次元的に表示する
構成としたが、図１５に「とりかえる」という文字列を
例に示す構成も好適である。図１５（Ａ）では、各同音
異義語ごとに、それぞれの派生表記を別個のグループと
して表示する。ここでは、まず、所望の表記に対応する
語を同音異義語の中から選択し、その後、派生表記の中
から所望の表記を選択することになる。これらの動作に
おいて、仮名漢字変換装置が実行する変換候補表示処理
は、第２実施例と共通であり、ステップＳ５４０および
ステップＳ５５０で、同音異義語と派生表記を並び替え
て表示する様式が異なるだけである。また、以上の例の
ように、同音異義語と派生表記とを同時に表示するので
はなく、図１５（Ｂ）に示すように、まず第１候補とし
て表示する単語とその同音異義語を表示し、その後所定
の指示を受けて表示用表記を選択可能な状態に表示する
構成としても良い。このような構成では、まず、所望の
表記に対応する語を同音異義語の中から選択すると、そ
の語の持つ派生表記が新たに表示される。図１５（Ｂ）
では第１候補の「取り替える」に替えて第２候補の「取
り換える」を選択したところ、「取り換える」の派生表
記が表示されることを示す。ここで、所望の表記を選択
するのである。In the embodiment described above, the homonyms and the derivative notations are expanded in two different directions and displayed two-dimensionally, but FIG. 15 shows an example of a character string "replace". The configuration is also suitable. In FIG. 15 (A), each derivative notation is displayed as a separate group for each homonym. Here, first, the word corresponding to the desired notation is selected from the homonyms, and then the desired notation is selected from the derivative notations. In these operations, the conversion candidate display processing executed by the kana-kanji conversion device is the same as that in the second embodiment, and in steps S540 and S550, the homonyms and the derivative notations are rearranged and displayed only in different formats. Is. Further, as in the above example, the homonyms and the derivative notation are not displayed at the same time, but as shown in FIG. 15B, first, the word to be displayed as the first candidate and its homonyms are displayed. After that, the display notation may be displayed in a selectable state upon receiving a predetermined instruction. With such a configuration, first, when a word corresponding to a desired notation is selected from homonyms, the derivative notation of the word is newly displayed. FIG. 15 (B)
Then, it is indicated that when the second candidate “replace” is selected instead of the first candidate “replace”, a derivative notation of “replace” is displayed. Here, the desired notation is selected.

【００８１】このような表示方法によれば、第１候補が
所望の表記でなかった場合に所望の表記を選択すること
が非常に容易となる。いろいろな表記が混在する図１３
（Ａ）の様な場合と異なり、送りがなの送り方を統一し
たいときにも、特定の語については他の語と異なる送り
方を選択したいときにも、複数の候補の中から所望の表
記を容易に見つけだすことができるのである。ここで、
第２実施例において、上記の表示方法による変換候補の
表示は、表示された第１候補が所望の表記でなかったと
きに、変換キー入力などにより変換指示を入力したとき
に表示されることとしても良いし、また、一括で自動変
換を行なうときなどに、現在変換中の単語に関して順次
自動的に表示されることとしても良い。According to such a display method, it becomes very easy to select the desired notation when the first candidate is not the desired notation. Figure 13 with various notations mixed
Unlike in the case of (A), when you want to unify the way of sending, or when you want to select a way of sending a specific word different from other words, you can select the desired notation from multiple candidates. You can easily find it. here,
In the second embodiment, the display of the conversion candidates by the above display method is displayed when the conversion instruction is input by the conversion key input or the like when the displayed first candidate is not the desired notation. Alternatively, when performing automatic conversion collectively, the words currently being converted may be automatically displayed sequentially.

【００８２】派生表記を格納した単語辞書において、異
体文字を用いた表記を共に記憶しておくならば、変換キ
ー入力などにより他の変換候補の表示を指示したときに
異体文字を用いた表記も同時に表示されて選択可能とな
る。以下に第３実施例として、異体文字を用いた表記を
容易に選択可能な構成を有する仮名漢字変換装置につい
て説明する。この第３実施例では、仮名漢字混じり文字
列候補を表示するとき、異体文字を用いた表記を有する
語では異体文字表記が存在することが明示され、この時
異体文字表記を望むならば簡単な入力操作により異体文
字表記が選択可能となっている。In the word dictionary storing the derivative notation, if the notation using the different characters is also stored, the notation using the different characters will be used when the display of other conversion candidates is instructed by the conversion key input. It is displayed at the same time and can be selected. As a third embodiment, a kana-kanji conversion device having a configuration capable of easily selecting a notation using different characters will be described below. In the third embodiment, when displaying a kana-kanji mixed character string candidate, it is clarified that a variant character notation exists in a word having a notation using variant characters, and if a variant character notation is desired at this time, it is simple. Different characters can be selected by input operation.

【００８３】本実施例では文節分かち書き処理における
単語検索処理として、図１１に示した単語検索処理ルー
チンに替えて図１６に示した単語検索処理ルーチンが実
行される。この図１６の処理では、入力した文字列の先
頭から検索する単語の読み長を一つずつ延ばし、さらに
検索する単語の開始位置を後ろに一つずつ伸ばすという
処理の流れは図１１と共通しているため、対応するステ
ップの番号の下２桁を同一にして、詳しい説明は省略す
る。以下に、図１１の処理と異なる部分の動作について
説明する。この第３実施例では、ステップＳ６１２で派
生表記への差し替えが必要であると判断されて代表表記
に第１マークＭＡを付与して表示用表記と共に展開バッ
ファに展開した（ステップＳ６１３）後には、あるいは
ステップＳ６１２で派生表記への差し替えが必要ないと
判断された後には、さらにステップＳ６１４でこの派生
表記の中に異体文字を用いた表記が含まれるかどうかを
判断する。前述した代表表記領域の単語情報の中に、派
生表記への差し替えの必要性を示すフラグと共に、異体
文字を用いた表記が存在することを示すフラグも保存す
ることにより、この判断は容易に実行することができ
る。ここで、第１マークＭＡは、後述する単語差し替え
処理時に認識され、代表表記から表示用表記への書き換
えが行なわれる。In this embodiment, the word search processing routine shown in FIG. 16 is executed in place of the word search processing routine shown in FIG. 11 as the word search processing in the phrase segmentation processing. In the process of FIG. 16, the process flow of extending the reading length of the word to be searched one by one from the beginning of the input character string and further extending the start position of the word to be searched one by one is the same as in FIG. 11. Therefore, the last two digits of the corresponding step numbers are made the same, and detailed description is omitted. The operation of the part different from the process of FIG. 11 will be described below. In the third embodiment, it is determined in step S612 that replacement with the derived notation is necessary, the first mark MA is added to the representative notation, and the representative notation is expanded in the expansion buffer together with the notation for display (step S613). Alternatively, after it is determined in step S612 that the replacement with the derivative notation is not necessary, it is further determined in step S614 whether the derivative notation includes a notation using variant characters. This judgment can be easily performed by saving the flag indicating that there is a need to change to a derivative notation in the word information in the representative notation area described above as well as the flag indicating that a notation using variant characters exists. can do. Here, the first mark MA is recognized during the word replacement process described below, and the representative notation is rewritten to the display notation.

【００８４】ステップＳ６１４で異体文字による表記が
あると判断されたときには、第２マークＭＢを代表表記
に付与して展開バッファに展開する（ステップＳ６１
５）。ステップＳ６１４で異体文字がないと判断された
場合、あるいは異体文字が存在してステップＳ６１５で
第２マークＭＢを付与した場合には、再びステップＳ６
０６に戻る。入力した文字列全体に亘って単語の検索を
行なった後は、これら代表表記を用いて所定の文法チェ
ックやコストチェックが行なわれる。文節分かち書き処
理の後、表示された第１候補の表記が所望の表記でない
ときに変換キー入力を行なえば、図１４の変換候補表示
処理ルーチンが実行され、図１３（Ｂ）に示す様式で同
音異義語や各語の派生表記など他の変換候補を表示させ
ることができ、この中から所望の異体文字による表記を
選択することも可能である。しかしながら、本実施例で
は特に異体文字選択手段を設けることで、第１候補の表
記を即座に異体文字を用いた表記に変換可能となってい
る。If it is determined in step S614 that there is a notation using different characters, the second mark MB is added to the representative notation and the representative notation is developed (step S61).
5). If it is determined in step S614 that there is no variant character, or if there is a variant character and the second mark MB is added in step S615, step S6 is performed again.
Return to 06. After searching for a word in the entire input character string, a predetermined grammar check and cost check are performed using these representative notations. After the phrase segmentation process, if the conversion key is input when the displayed first candidate is not the desired notation, the conversion candidate display processing routine of FIG. 14 is executed, and the same sound is output in the format shown in FIG. It is possible to display other conversion candidates such as synonyms and derivative notations of each word, and it is also possible to select a notation with a desired variant character from among these. However, in the present embodiment, especially by providing the different character selection means, the notation of the first candidate can be immediately converted into the notation using the different characters.

【００８５】以下に、第３実施例で実行される単語差し
替え処理ルーチンを図１７に示すフローチャートを用い
て説明する。ここでは、図１２に示した単語差し替え処
理ルーチンと共通するステップの下２桁の番号を同一に
してあり、詳しい説明は省略する。図１７では、対象単
語に第１マークＭＡがあるかどうかの判断を行なって
（ステップＳ７１０）、第１マークＭＡがある場合に第
１候補を表示用表記に差し替えた後（ステップＳ７２
０）、更に、対象単語に第２マークＭＢがあるかどうか
の判断を行なう（ステップＳ７２３）。第２マークＭＢ
があれば、表示用表記に第３マークＭＣを付与して（ス
テップＳ７２６）、その後は図１２と同様に、未処理単
語があるかどうかを調べ（ステップＳ７３０）、未処理
単語がある限り上記の処理を繰り返す。ステップＳ７２
３で、対象単語に第２マークＭＢがなかった時は、ステ
ップＳ７３０に移行して未処理単語の有無を調べること
になる。The word replacement processing routine executed in the third embodiment will be described below with reference to the flowchart shown in FIG. Here, the last two digits of the steps common to the word replacement processing routine shown in FIG. 12 are the same, and detailed description thereof will be omitted. In FIG. 17, it is determined whether or not the target word has the first mark MA (step S710), and if the first mark MA is present, the first candidate is replaced with the display notation (step S72).
0) Further, it is determined whether or not the target word has the second mark MB (step S723). Second mark MB
If there is, a third mark MC is added to the display notation (step S726), and thereafter, as in FIG. 12, it is checked whether there is an unprocessed word (step S730). The process of is repeated. Step S72
If the target word does not have the second mark MB in step 3, the process proceeds to step S730 to check whether there is an unprocessed word.

【００８６】図１７に示した単語差し替え処理ルーチン
が終了し、第１候補の表示用表記が表示されるとき、図
１８に示す異体文字表示処理ルーチンが実行される。ま
ず、ステップＳ８００において表示用表記に第３マーク
ＭＣがあるかどうかを判断する。第３マークＭＣがあれ
ば、第１候補の表示時に画面の所定の位置に「＊」印を
表示して（ステップＳ８１０）、異体文字による表記が
存在することを明示する。次に、「シフトキー」＋「変
換キー」の入力の有無を判断し（ステップＳ８２０）、
入力された場合には異体文字による表示が選択されたと
判断できるので、表示用表記を異体文字を用いた表記に
変換して表示を行なう（ステップＳ８３０）。ステップ
Ｓ８００で表示用表記に第３マークＭＣが無い場合、お
よび、ステップＳ８２０で「シフトキー」＋「変換キ
ー」の入力が行なわれなかった場合には、異体文字によ
る表示は行なわれず、通常の変換・確定の処理が行なわ
れる。When the word replacement processing routine shown in FIG. 17 is finished and the first candidate display notation is displayed, the foreign character display processing routine shown in FIG. 18 is executed. First, in step S800, it is determined whether or not the third mark MC is included in the display notation. If there is the third mark MC, an "*" mark is displayed at a predetermined position on the screen when the first candidate is displayed (step S810) to clearly indicate that there is an inscription in different characters. Next, it is determined whether or not the "shift key" + "conversion key" is input (step S820),
When the input is made, it can be determined that the display using the different characters is selected, so the display notation is converted into the notation using the different characters for display (step S830). If there is no third mark MC in the display notation in step S800, and if the "shift key" + "conversion key" is not input in step S820, the display in different characters is not performed and normal conversion is performed.・ The confirmation process is performed.

【００８７】例えば、「いちまんえん」という文字列を
入力した場合、第１候補「一万円」が表示されるとき、
同時に画面の所定の位置に「＊」も表示され、異体文字
の存在が明示される。ここで「変換キー」入力を行なえ
ば、「位置」「１」「市」「壱」等「一」の同音異義語
が表示されるが、「シフトキー」＋「変換キー」の入力
により直接「壱」に異体文字変換を行なうことができる
のである。「万」「円」についても同様に「＊」印の表
示により異体文字が存在するとわかるので、「シフトキ
ー」＋「変換キー」の入力により、容易に異体文字
「萬」「圓」を選択できる。また、「壱」だけを異体文
字にしたい場合には、「壱」と異体文字変換した時点で
確定処理を行なえば、文字列「壱万円」を得ることがで
きる。さらに、「壱萬圓」という異体文字列を単語辞書
に記憶しておくことにより、「シフトキー」＋「変換キ
ー」の入力で、「一万円」から「壱萬圓」へ一度に変換
可能な構成としても良い。For example, when the character string "Ichimanen" is input, when the first candidate "10,000 yen" is displayed,
At the same time, "*" is also displayed at a predetermined position on the screen to clearly indicate the presence of the variant character. If you enter the "conversion key" here, homonyms such as "Position", "1", "City", "Ichi", etc. will be displayed, but you can directly enter by pressing the "Shift key" + "Conversion key". It is possible to convert variant characters into "Ichi". Similarly, for "10,000" and "yen", it is possible to know that there are different characters by the display of "*", so you can easily select the different characters "Man" and "En" by entering "Shift key" + "Conversion key". . Further, when only "Ichi" is to be changed into a different character, the character string "Ichiman Yen" can be obtained by performing the confirmation process at the time when the different characters are converted into "I". In addition, by storing a variant character string "Ichiman-en" in the word dictionary, you can convert from "10,000 yen" to "Ichiman-en" at once by entering "Shift key" + "Conversion key". It may be configured as

【００８８】上記のように、本実施例では、画面の所定
の位置に「＊」印を表示することで異体文字による表示
が存在することを明示し、「シフトキー」＋「変換キ
ー」入力により異体文字による表示を選択することで、
第１候補の表記を即座に異体文字を用いた表記に変換可
能となっている。As described above, in the present embodiment, by displaying the "*" mark at a predetermined position on the screen, it is made clear that there is a display with different characters, and by inputting "shift key" + "conversion key". By selecting the display with different characters,
It is possible to immediately convert the notation of the first candidate into the notation using different characters.

【００８９】このとき、上記異体文字選択手段により異
体文字選択を指示したときに、単語辞書内の表示用表記
を異体文字表記に書き換える構成とするならば、次回よ
り同じ単語の変換結果は、異体文字による表記を第１候
補として表示されることとなる。従って、異体文字によ
る表記を用いて文書を作成したいときには、容易に所望
の表記が変換候補として得られるようになる。これに対
して、上記の異体文字選択手段により異体文字表記を選
択したときには、単語辞書内の表示用表記の書き換えは
行なわない構成とすることもできる。異体文字表記は特
殊な表記であり、表示用表記の書き換えを行なうことが
必ずしも得策とはいえない場合もあるからである。At this time, if the display notation in the word dictionary is rewritten to the variant character notation when the variant character selecting means is instructed to select the variant character, the conversion result of the same word will be changed to the variant character from the next time. The character notation will be displayed as the first candidate. Therefore, when a user wants to create a document using the notation of different characters, the desired notation can be easily obtained as a conversion candidate. On the other hand, when the different character notation is selected by the different character selecting means, the display notation in the word dictionary may not be rewritten. This is because the variant character notation is a special notation and it may not always be a good idea to rewrite the notation for display.

【００９０】次に本発明の第４の実施例について説明す
る。第４実施例の仮名漢字変換装置は、仮名漢字変換の
際に参照する通常の単語辞書の他に、時刻、年月日など
の所定の時点を示す日時情報と、この日時情報を表わす
特定の見出し語とを対応して記憶する日時用語変換辞書
を備えている。この日時用語変換辞書の構造の一例を図
１９に示す。日時用語変換辞書は、日時情報を表わす特
定の見出し語の読みと、この特定の見出し語に対応した
日時情報の種類を単位として構成されており、本実施例
では、さらに日時情報表示用のフォーマットも一体とし
て格納されている。例えば、図１９では、日時情報を表
わす見出し語の一つの読みである「いま」に対して、こ
れに対応した日時情報の種類として「現在の時刻」とい
う情報が記憶されており、日時表示用フォーマットとし
て「％Ｈ：％Ｍ」が用意されている。従って、この日時
用語変換辞書を参照して仮名文字列の変換を行なうと
き、「いま」という文字列を入力して変換処理を行なう
と、現在の時刻を「１０：４５」という表示で得ること
が可能となるのである。Next, a fourth embodiment of the present invention will be described. The kana-kanji conversion device according to the fourth embodiment includes, in addition to an ordinary word dictionary referred to when converting kana-kanji, date and time information indicating a predetermined time point such as time and date, and a specific word and date information indicating this date and time information. A date and time term conversion dictionary that stores the headwords and the corresponding words is provided. FIG. 19 shows an example of the structure of this date / time term conversion dictionary. The date-and-time term conversion dictionary is configured with a unit of the reading of a specific headword representing date-and-time information and the type of date-and-time information corresponding to this specific headword. Is also stored as a unit. For example, in FIG. 19, “current time” is stored as the type of the date and time information corresponding to this, which is one of the readings of the entry word representing the date and time information. "% H:% M" is prepared as the format. Therefore, when converting a kana character string by referring to this date and time term conversion dictionary, if the character string "now" is input and conversion processing is performed, the current time is displayed as "10:45". Is possible.

【００９１】このような日時情報を表わす特定の見出し
語を対応する日時情報に変換する処理を、以下、日時変
換と呼ぶことにする。入力された仮名文字列に対して文
節分かち書き処理が開始され、通常の辞書検索が行なわ
れると、これと平行して図２０に示す日時変換処理が実
行される。この日時変換処理が開始されると、通常の単
語読み込み処理と同様に、入力した文字列の先頭を検索
開始位置とする（ステップＳ９００）。まず、単語検索
における読みの長さを示す変数Ｌを値１に初期化し（ス
テップＳ９１０）、次に、検索位置から始まって読み長
がＬであるような特定の見出し語があるかどうかを調べ
る（ステップＳ９２０）。「いま」「きょう」などの日
時情報を表わす特定の見出し語の読みに相当する文字列
が見つかったならば、日時用語変換辞書と図１に示した
タイマ３３を参照して、それぞれの特定の見出し語に応
じた日時変換を行なう（ステップＳ９３０）。タイマ３
３は現時点における日時情報を発生しているため、日時
用語変換辞書を参照することで変換すべき日時情報の種
類が決定すると、タイマ３３が発生する現時点での日時
情報を基に日時変換の実行が可能となる。例えば、「き
ょう」という特定の見出し語が見つかったならば、日時
用語変換辞書から今日の日付に変換する語であることが
わかり、タイマ３３を参照することで具体的な今日の日
付（「１月１日」であるといった情報）に変換されるの
である。The process of converting a specific entry word representing such date / time information into corresponding date / time information will be hereinafter referred to as date / time conversion. When the phrase segmentation processing is started for the input kana character string and a normal dictionary search is performed, the date / time conversion processing shown in FIG. 20 is executed in parallel with this. When this date / time conversion process is started, the top of the input character string is set as the search start position, as in the normal word reading process (step S900). First, the variable L indicating the reading length in the word search is initialized to the value 1 (step S910), and then it is checked whether or not there is a specific headword whose reading length is L starting from the search position. (Step S920). If a character string corresponding to the reading of a specific entry word representing date and time information such as "now" or "kyo" is found, the date and time term conversion dictionary and the timer 33 shown in FIG. Date conversion is performed according to the entry word (step S930). Timer 3
Since 3 has generated date and time information at the present time, when the type of date and time information to be converted is determined by referring to the date and time term conversion dictionary, execution of date and time conversion is performed based on the current date and time information generated by the timer 33. Is possible. For example, if a specific headword "Kyo" is found, it is known from the date / time term conversion dictionary that the word is converted to today's date, and by referring to the timer 33, the specific today's date ("1 Information such as "1st of the month".

【００９２】この変換結果は一旦記憶され（ステップＳ
９４０）、日時情報を表わす特定の見出し語には日時用
語マークが付され（ステップＳ９５０）、引き続き文節
分かち書き処理に供される。その後、検索する単語長を
伸ばせるかどうかの判断を行ない（ステップＳ９６
０）、単語長を伸ばすことができる場合にはＬをＬ＋１
として読みを１文字分長くする処理を行なって（ステッ
プＳ９７０）、その後再びステップＳ９２０に戻り、特
定の見出し語の検索を続けるのである。ステップＳ９６
０において単語長を伸ばすことができないと判断されれ
ば、次に、検索開始位置を後ろにずらすことができるか
どうかを調べる（ステップＳ９８０）。検索開始位置が
入力した文字列の最後の桁であり、これ以上後ろにずら
すことができない場合には時刻変換処理を終了する。ス
テップＳ９８０で、検索開始位置を後ろにずらすことが
可能であると判断された場合には、検索開始位置を一つ
だけ後ろにずらし、ステップＳ９１０に戻って再びＬを
値１に初期化し、上記の処理を繰り返す。This conversion result is temporarily stored (step S
940), a date / time term mark is attached to the specific entry word representing the date / time information (step S950), and the phrase segmentation processing is continued. Then, it is judged whether or not the word length to be searched can be extended (step S96).
0), L is L + 1 if the word length can be extended
As a result, the process of lengthening the reading by one character is performed (step S970), and then the process returns to step S920 again to continue the search for the specific headword. Step S96
If it is determined that the word length cannot be extended at 0, then it is checked whether the search start position can be moved backward (step S980). When the search start position is the last digit of the input character string and it cannot be moved any further backward, the time conversion process ends. When it is determined in step S980 that the search start position can be moved backward, the search start position is moved backward by one, and the process returns to step S910 to initialize L to the value 1 again, and The process of is repeated.

【００９３】また、ステップＳ９２０において、読み長
Ｌの特定の見出し語が見つからなかった場合には、ステ
ップＳ９６０に移行して単語長を伸ばせるかどうかの判
断を行なう。その後は、前述のように、単語長を伸ばせ
る場合にはステップＳ９７０で読み長Ｌを値１だけ伸ば
して再び特定の見出し語の検索を行ない、単語長が伸ば
せない場合には、ステップＳ９８０で検索開始位置を後
ろにずらすことができるかどうかを判断するのである。In step S920, if a specific headword having the reading length L is not found, the process proceeds to step S960 and it is determined whether the word length can be extended. After that, as described above, if the word length can be extended, the reading length L is extended by a value of 1 in step S970, and a specific headword is searched again. If the word length cannot be extended, the search is performed in step S980. It determines if the starting position can be moved backwards.

【００９４】次に、上記日時変換処理によって日時変換
された特定の見出し語について、文節分かち書き後に変
換候補を表示する際行なわれる日時書き換え処理のフロ
ーチャートを図２１に示す。この日時書き換え処理は、
入力した仮名文字列について文節分かち書き処理を行な
って変換候補として残った単語の中で、先頭の単語を対
象単語として開始される（ステップＳ１０００）。始め
に対象単語に日時用語マークがあるかどうかを調べる
（ステップＳ１０１０）。日時用語マークが付いていれ
ば、その単語は日時情報を表わす特定の見出し語である
と判断されるので、この日時用語マークの付いた単語
を、日時変換した結果に書き換える（ステップＳ１０２
０）。これは、前述した日時変換処理において日時変換
後いったん記憶しておいた変換結果を、図１８の日時用
語変換辞書に記憶された日時表示用フォーマットに従っ
て表わし、日時用語マークの付いた単語と書き換えるも
のである。例えば、日時用語マークを有する「きょう」
という単語があったならば、この単語は日時変換処理に
よって、今日の日付は１月１日であるといった情報に変
換の後記憶されているので、この変換結果を日時表示用
フォーマット「％Ｍ月％Ｄ日」に従って表わす。このよ
うに日時表示用フォーマットに従い単語を書き替えた結
果は、再び一旦記憶される。Next, FIG. 21 shows a flowchart of the date and time rewriting process performed when displaying the conversion candidates after segmentation and segmentation of a specific headword whose date and time has been converted by the date and time conversion process. This date and time rewriting process is
Of the words remaining as conversion candidates after performing the phrase segmentation processing on the input kana character string, the first word is started as the target word (step S1000). First, it is checked whether or not the target word has a date / time term mark (step S1010). If the date / time term mark is attached, it is determined that the word is a specific entry word representing date / time information, and thus the word with the date / time term mark is rewritten to the result of date / time conversion (step S102).
0). This is to display the conversion result once stored after the date and time conversion in the date and time conversion processing described above according to the date and time display format stored in the date and time term conversion dictionary of FIG. 18, and rewrite it with the word with the date and time term mark. Is. For example, "Today" with the date and time term mark
If this word is found, this word is stored after being converted into information that today's date is January 1 by the date / time conversion process, so the conversion result is stored in the date / time display format “% M month”. % D days ". The result of rewriting the word according to the date / time display format is once again stored.

【００９５】その後、未処理単語があるかどうかを調べ
（ステップＳ１０３０）、変換候補として残った単語の
中に未処理単語がなければこの日時書き換え処理を終了
する。このステップＳ１０３０において未処理単語があ
ると判断されれば、対象単語を一つ後ろにずらして（ス
テップＳ１０４０）、再びステップＳ１０１０に戻って
日時用語マークの付いた単語の検索を行ない、上述の処
理を繰り返すのである。また、ステップＳ１０１０にお
いて、対象単語に日時用語マークがなければ、ステップ
Ｓ１０３０に移行して文節分かち書きされた候補単語の
中に未処理単語があるかどうかを調べる。未処理単語が
あればステップＳ１０４０で対象単語を一つ後ろにずら
し、最終的にステップＳ１０３０で未処理単語なしと判
断されるまで上記の処理を繰り返す。After that, it is checked whether or not there is an unprocessed word (step S1030), and if there is no unprocessed word among the words remaining as conversion candidates, this date / time rewriting process is terminated. If it is determined in this step S1030 that there is an unprocessed word, the target word is moved back by one (step S1040), and the process returns to step S1010 to search for a word with a date / time term mark, and the above-mentioned processing is performed. Is repeated. If the target word does not have the date / time term mark in step S1010, the process moves to step S1030, and it is checked whether or not there is an unprocessed word among the candidate words segmented into segments. If there is an unprocessed word, the target word is moved backward by one in step S1040, and the above processing is repeated until it is finally determined in step S1030 that there is no unprocessed word.

【００９６】すべての処理が終了し、第１候補の仮名漢
字混じり文字列が表示されるときには、上記の日時情報
を表わす特定の見出し語に関しては、まず、通常の単語
として文節分かち書き処理された結果が表示される。
「きょう」という文字列の仮名漢字変換結果としては、
例えば、第１候補として「今日」が表示されるのであ
る。ここで、日時変換結果を望む場合には、通常の変換
指示を入力すれば良い。変換指示の入力により、他の候
補である「きょう」や「京」等と共に、上記日時変換結
果である「１月１日」を表示させることができ、選択可
能となる。When all the processes are completed and the first candidate kana-kanji mixed character string is displayed, the specific headword representing the above date and time information is first processed as a regular word by segmentation. Is displayed.
The conversion result of the Kana-Kanji character string "Kyou" is:
For example, "today" is displayed as the first candidate. Here, if a date / time conversion result is desired, a normal conversion instruction may be input. By inputting the conversion instruction, "January 1st" which is the date and time conversion result can be displayed together with other candidates such as "Kyoto" and "Kyo", and can be selected.

【００９７】上記第４実施例では、日時用語変換辞書に
おいて、各特定の見出し語に対して１種類の日時表示用
フォーマットしか用意されていなかったが、複数の日時
表示用フォーマットを登録しておく構成としても良い。
このような構成を持つ仮名漢字変換装置を第５実施例と
して以下に説明する。図２２に、第５実施例の仮名漢字
変換装置に備えられた日時用語変換辞書の構成の一例を
示す。ここでは、例えば「きょう」という日時を表わす
特定の見出し語に対して４種類の日時表示用フォーマッ
トが用意されており、これらは、変換候補を表示する際
の使用優先順に従って辞書内に記憶されている。いま、
使用優先順位が最も高いフォーマットは「％Ｍ月％Ｄ
日」であるので、「きょう」という文字列を入力して時
刻変換を行なうと、このフォーマットに従って「１月１
日」のように本日の日付が変換結果の第１候補として表
示される。このとき、「１月１日」が所望の変換結果で
なければ、変換キーを入力することにより、「１月１日
（火）」という第２候補以下の変換結果を表示させるこ
とができる。In the fourth embodiment, only one type of date and time display format is prepared for each specific entry word in the date and time term conversion dictionary, but a plurality of date and time display formats are registered. It may be configured.
A kana-kanji conversion device having such a configuration will be described below as a fifth embodiment. FIG. 22 shows an example of the structure of the date and term conversion dictionary provided in the kana-kanji conversion device of the fifth embodiment. Here, for example, four types of date and time display formats are prepared for a specific entry word that represents the date and time, and these are stored in the dictionary according to the order of use priority when displaying conversion candidates. ing. Now
The format with the highest priority of use is "% M Month% D
Since it is "day", if you enter the character string "today" and perform time conversion, "January 1
Today's date such as “day” is displayed as the first candidate for the conversion result. At this time, if "January 1st" is not the desired conversion result, by inputting the conversion key, it is possible to display the conversion results of "January 1st (Tuesday)" for the second candidate and below.

【００９８】この第２候補以下の変換結果の中から「１
月１日火曜日」という候補を選択すれば、選択した候補
が使用した「％Ｍ月％Ｄ日％Ｗ曜日」というフォーマッ
トが、「％Ｍ月％Ｄ日」というフォーマットに代わって
使用優先順位が最も高いフォーマットとなり、日時用語
変換辞書内の順位が書き換えられる。従って、これ以後
「きょう」という特定の見出し語の時刻変換を行なう
と、上記「％Ｍ月％Ｄ日％Ｗ曜日」というフォーマット
に従った表記を第１候補として日時変換した結果が表示
されることになる。以上説明した第５実施例の仮名漢字
変換装置によれば、使用者が選択した日時表示用フォー
マットが使用優先順位が最も高いフォーマットとして学
習されるため、使用者が好んで選択するフォーマットに
従って時刻変換の結果を表示させることが極めて容易に
なる。From the conversion results below the second candidate, "1" is selected.
If you select the candidate "Tuesday, month 1", the format "% M month% D day% W day" used by the selected candidate will be used in place of the "% M month% D day" format. It becomes the highest format, and the ranking in the date and time term conversion dictionary is rewritten. Therefore, if the time conversion of the specific headword "Kyo" is performed thereafter, the result of date and time conversion using the notation according to the format "% M month% D day% W day of the week" as the first candidate is displayed. It will be. According to the kana-kanji conversion device of the fifth embodiment described above, the format for displaying the date and time selected by the user is learned as the format with the highest priority of use, so time conversion is performed according to the format that the user prefers. It becomes extremely easy to display the result of.

【００９９】このとき、日時変換を行なった結果は、日
時を表わす特定の見出し語の派生表記として表示するこ
ととしても良いし、同じく特定の見出し語の同音異義語
として表示することとしても良い。派生表記として表示
するならば、「１月１日」等の日時を表わす表記を、
「京」や「鏡」ではなく、日時を表わす「今日」に関連
する表記として扱うことになり好ましい。一方、同音異
義語として表示するならば、その表示の際には、第１候
補のフォーマットが同音異義語として、用意されたその
他のフォーマットが同音異義語の派生表記として扱われ
る。従って、上記時刻変換の結果を、第２実施例に示し
た表示方法に組み込んで、他の同音異義語や派生表記と
共に表示することができる。At this time, the result of the date / time conversion may be displayed as a derivative notation of a specific headword representing the date and time, or may be displayed as a homonym of the specific headword. If displayed as a derivative notation, the notation indicating the date and time such as "January 1"
This is preferable because it is treated as a notation related to "today" indicating the date and time, not "Kyo" or "mirror". On the other hand, if it is displayed as a homonym, at the time of the display, the format of the first candidate is treated as a homonym and the other prepared formats are treated as derivative notations of the homonym. Therefore, the result of the time conversion can be incorporated into the display method shown in the second embodiment and displayed together with other homonyms and derivative notations.

【０１００】また、上記第４および第５実施例では、日
時表示用フォーマットを日時用語変換辞書内に記憶する
構成としたが、日時表示用フォーマットを日時用語変換
辞書と別体で記憶する構成としても差し支えない。日時
表示用フォーマットを日時用語変換辞書と別体で記憶す
るならば、日時表示用フォーマットは、時刻を表示する
ためのもの、日付を表示するためのもの、暦年を表示す
るためのものと種類別に記憶し、各特定の見出し語ごと
にそれぞれに適した種類のフォーマットを参照する構成
とすることができる。In the fourth and fifth embodiments, the date / time display format is stored in the date / time term conversion dictionary. However, the date / time display format is stored separately from the date / time term conversion dictionary. It doesn't matter. If the date and time display format is stored separately from the date and time term conversion dictionary, the date and time display format is for displaying the time, for displaying the date, for displaying the calendar year, and for each type. It is possible to store and refer to a format of a type suitable for each specific headword.

【０１０１】上記の構成においては、入力した数字の時
刻フォーマッティングが可能となる。すなわち、「１１
４７」といった数字を入力して「１１時４７分」という
変換結果を表示させるのである。このような時刻フォー
マッティングが実行されるときの動作を以下に概説す
る。入力された文字列に対して文節分かち書き処理が実
行されると、この中の辞書検索と平行して、図２３に示
した時刻フォーマッティング処理を行なう。まず、入力
された文字列の先頭を検索位置として（ステップＳ１１
００）、４桁の数字があるかどうかを判断する（ステッ
プＳ１１１０）。４桁の数字が見つかれば、これに時刻
マークを付け、（ステップＳ１１２０）。時刻単語とし
てその後の文節分かち書き処理に供する。その後、検索
位置を後ろにずらせるかどうかを調べ（ステップＳ１１
３０）、可能であれば検索位置を一桁後ろにずらしてス
テップＳ１１１０に戻る。ステップＳ１１１０において
４桁の数字が見つからないときには、ステップＳ１１３
０に移行し、入力された文字列の末尾に至るまで４桁の
数字の検索を行なうのである。With the above arrangement, it is possible to time-format the input numbers. That is, "11
By inputting a number such as "47", the conversion result of "11:47" is displayed. The operation when such time formatting is executed will be outlined below. When the phrase segmentation processing is executed for the input character string, the time formatting processing shown in FIG. 23 is executed in parallel with the dictionary search in the text. First, the head of the input character string is set as the search position (step S11).
00), it is determined whether there is a 4-digit number (step S1110). If a four-digit number is found, a time mark is attached to this number (step S1120). It is used as a time word for the subsequent phrase segmentation processing. After that, it is checked whether or not the search position is moved backward (step S11).
30), if possible, shift the search position backward by one digit and return to step S1110. If a four-digit number is not found in step S1110, step S113
It shifts to 0, and a 4-digit number is searched until the end of the input character string.

【０１０２】入力された文字列を文節分かち書きした結
果を変換候補として表示するときには、以上の処理を踏
まえ、時刻マークの付いた文字列については、前２桁の
数字は「％Ｈ］、後ろ二桁の数字は「％Ｍ」として処理
される。従って、予め辞書内に用意された時刻を表示す
る日時表示用フォーマットの中で、「％Ｈ時％Ｍ分」の
使用優先順位が最も高いならば、上記「１１４７」の文
節分かち書き処理後の変換結果は「１１時４７分」と表
示されるのである。When displaying the result of writing the input character string segmented into phrases as conversion candidates, in consideration of the above processing, for the character string with the time mark, the first two digits are "% H" and the second Digit numbers are treated as "% M". Therefore, if the usage priority of "% H hour% M minutes" is the highest in the date and time display format for displaying the time prepared in advance in the dictionary, the conversion after the phrase segmentation processing of "1147" is performed. The result is displayed as "11:47".

【０１０３】また、上記第４および第５実施例では、日
時変換を行なった結果は、通常の仮名漢字変換処理を行
なった結果を表示した後に、所定の変換指示を入力する
ことにより表示することとしたが、直接日時変換の結果
を表示する構成とすることも好適である。例えば、「き
ょう」という仮名文字列を入力したときに、これを通常
の単語として変換した結果を第１候補として表示し、変
換キーを入力すれば日時変換結果を除いた他の派生表記
や同音異義語が表示され、「変換キー＋シフトキー」の
入力により日時変換した結果が用意された複数のフォー
マットにより表示されるのである。この時には、日時表
示用フォーマットと派生表記が混在する中から所望の表
記を選択する必要がなく、「変換キー＋シフトキー」入
力時にだけ日時表示用フォーマットを参照すれば良いの
である。上記「きょう」という単語の例では、第１候補
としては「今日」が得られ、変換キー入力を行なえば第
２候補以下の「きょう」「京」「「鏡」などを表示す
る。このとき「変換キー＋シフトキー」入力により日時
変換した結果が表示され選択可能となるのである。もと
より、用意された複数の日時表示用フォーマットでは、
使用者の選択状況が学習され、直前に選択したフォーマ
ットが第１候補として表示される。このような構成とす
れば、日時変換を望まないときに日時変換結果が変換候
補として表われることがなく、日時変換を望むときにも
容易に所望の表記を選択することが可能となる。In the fourth and fifth embodiments, the date and time conversion result is displayed by inputting a predetermined conversion instruction after displaying the result of the ordinary kana-kanji conversion processing. However, it is also preferable to directly display the result of date and time conversion. For example, when a kana character string "Kyou" is entered, the result of converting it as a normal word is displayed as the first candidate, and if the conversion key is entered, other derivative notations and homophones except the date and time conversion result are displayed. The synonyms are displayed, and the date and time conversion result by inputting the "conversion key + shift key" is displayed in a plurality of prepared formats. At this time, it is not necessary to select a desired notation from the mixture of the date and time display format and the derivative notation, and the date and time display format may be referred to only when the "conversion key + shift key" is input. In the above example of the word "kyo", "today" is obtained as the first candidate, and if the conversion key is input, "kyo", "kyo", "mirror", etc. below the second candidate are displayed. At this time, by inputting the "conversion key + shift key", the date and time conversion result is displayed and can be selected. Of course, in the prepared multiple date and time display formats,
The user's selection situation is learned, and the format selected immediately before is displayed as the first candidate. With such a configuration, the date / time conversion result does not appear as a conversion candidate when the date / time conversion is not desired, and the desired notation can be easily selected even when the date / time conversion is desired.

【０１０４】以上説明した実施例では、キーボードから
入力した仮名文字列を文節分かち書きする場合について
述べたが、文節分かち書き処理を行なう対象は単に仮名
文字列だけに限られるものではなく、例えば、仮名漢字
変換装置を手書き入力可能とし、「ヒ行キ」と手書き入
力した文字列を認識して「飛行機」と変換可能な構成も
好適である。In the above-described embodiment, the case where the kana character string input from the keyboard is written into the phrase is described. It is also preferable that the conversion device is capable of handwriting input, and that the character string “handwritten” is recognized and converted into “airplane”.

【０１０５】以上本発明の実施例について説明したが、
本発明はこうした実施例に何等限定されるものではな
く、例えば最小コスト法に替えて２文節最長一致法など
の他の文節分かち書きの手法を用いた構成など、本発明
の要旨を逸脱しない範囲内において種々なる様態で実施
し得ることは勿論である。The embodiments of the present invention have been described above.
The present invention is not limited to these embodiments, and for example, a configuration using another phrase segmentation method such as the two-segment longest matching method instead of the minimum cost method is within the scope not departing from the gist of the present invention. It goes without saying that the above can be implemented in various modes.

[Brief description of drawings]

【図１】本発明の一実施例である仮名漢字変換装置が実
現されるハードウェアを示すブロック図である。FIG. 1 is a block diagram showing hardware for realizing a kana-kanji conversion device according to an embodiment of the present invention.

【図２】実施例としての仮名漢字変換装置における仮名
漢字変換機能の実現形態を示す機能ブロック図である。FIG. 2 is a functional block diagram showing an implementation form of a kana-kanji conversion function in a kana-kanji conversion device as an example.

【図３】本発明の実施例で実行される仮名漢字変換処理
ルーチンを説明するフローチャートである。FIG. 3 is a flowchart illustrating a kana-kanji conversion processing routine executed in an embodiment of the present invention.

【図４】文節分かち書き部１０２において実行される文
節分かち書き処理を示すフローチャートである。FIG. 4 is a flowchart showing a phrase segmentation writing process executed in a segmentation segment writing unit 102.

【図５】最小コスト法による文節分かち書き処理の様子
を示す説明図である。FIG. 5 is an explanatory diagram showing a state of phrase segmentation writing processing by the minimum cost method.

【図６】自立語辞書５８の内部構成を示す説明図であ
る。FIG. 6 is an explanatory diagram showing an internal configuration of an independent word dictionary 58.

【図７】代表表記領域の管理の様子を示す説明図であ
る。FIG. 7 is an explanatory diagram showing how a representative notation area is managed.

【図８】単語データの構成を示す説明図である。FIG. 8 is an explanatory diagram showing a structure of word data.

【図９】単語情報の詳細を各セパレータとともに示す説
明図である。FIG. 9 is an explanatory diagram showing details of word information together with each separator.

【図１０】派生表記の例を示す説明図である。FIG. 10 is an explanatory diagram showing an example of derivative notation.

【図１１】第１実施例における単語検索処理を示すフロ
ーチャートである。FIG. 11 is a flowchart showing a word search process in the first embodiment.

【図１２】第１実施例における単語差し替え処理を示す
フローチャートである。FIG. 12 is a flowchart showing a word replacement process in the first embodiment.

【図１３】第２実施例における変換候補の表示方法を示
す説明図である。FIG. 13 is an explanatory diagram showing a method of displaying conversion candidates in the second embodiment.

【図１４】その変換候補表示処理を示すフローチャート
である。FIG. 14 is a flowchart showing the conversion candidate display processing.

【図１５】変換候補の表示方法のその他の例を示す説明
図である。FIG. 15 is an explanatory diagram showing another example of the conversion candidate display method.

【図１６】第３実施例における単語検索処理を示すフロ
ーチャートである。FIG. 16 is a flowchart showing a word search process in the third embodiment.

【図１７】第３実施例における単語差し替え処理を示す
フローチャートである。FIG. 17 is a flowchart showing word replacement processing in the third embodiment.

【図１８】第３実施例における異体文字表示処理を示す
フローチャートである。FIG. 18 is a flowchart showing a different character display process in the third embodiment.

【図１９】第４実施例における日時用語変換辞書の一例
を示す説明図である。FIG. 19 is an explanatory diagram showing an example of a date and time term conversion dictionary in the fourth embodiment.

【図２０】第４実施例における日時変換処理を示すフロ
ーチャートである。FIG. 20 is a flowchart showing date / time conversion processing in the fourth embodiment.

【図２１】第４実施例における日時書き換え処理を示す
フローチャートである。FIG. 21 is a flowchart showing date / time rewriting processing in the fourth embodiment.

【図２２】第５実施例における日時用語変換辞書の一例
を示す説明図である。FIG. 22 is an explanatory diagram showing an example of a date and time term conversion dictionary in the fifth embodiment.

【図２３】時刻フォーマッティング処理を示すフローチ
ャートである。FIG. 23 is a flowchart showing a time formatting process.

[Explanation of symbols]

２１…ＣＰＵ２２…ＲＯＭ２３…ＲＡＭ２４…キーボード２５…キーボードインターフェース２６…ＣＲＴディスプレイ２７…ＣＲＴＣ２８…プリンタ２９…プリンタインターフェース３１…バス３２…ハードディスク３３…タイマ４０…文字入力部４２…変換制御部４４…変換後文字列出力部５０…文字列入力部５２…文字格納部５４…自立語候補作成部５６…自立語解析位置管理部５８…自立語辞書６４…付属語候補作成部６６…付属語解析位置管理部６８…付属語辞書７２…自立語学習部７４…補助語学習部７６…接辞学習部７８…文字変換学習部８０…単語データ作成部８２…接続検定部８４…接続検定テーブル８６…コスト計算部１００…単語データ格納部１０２…文節分かち書き部１０６…文節データ格納部１０８…変換文字列出力部 21 ... CPU 22 ... ROM 23 ... RAM 24 ... Keyboard 25 ... Keyboard interface 26 ... CRT display 27 ... CRTC 28 ... Printer 29 ... Printer interface 31 ... Bus 32 ... Hard disk 33 ... Timer 40 ... Character input section 42 ... Conversion control section 44 ... Converted character string output unit 50 ... Character string input unit 52 ... Character storage unit 54 ... Independent word candidate creation unit 56 ... Independent word analysis position management unit 58 ... Independent word dictionary 64 ... Adjunct word creation unit 66 ... Adjunct word analysis Position management unit 68 ... Adjunct dictionary 72 ... Independent word learning unit 74 ... Auxiliary word learning unit 76 ... Affix learning unit 78 ... Character conversion learning unit 80 ... Word data creation unit 82 ... Connection verification unit 84 ... Connection verification table 86 ... Cost Calculation unit 100 ... Word data storage unit 102 ... Phrase segmentation writing unit 106 ... Phrase data case Part 108 ... conversion character string output unit

Claims

[Claims]

1. A kana character string is input, a dictionary is referred to,
A kana-kanji conversion device that writes out the input kana character string by segmenting and generates kana-kanji mixed character string candidates, including a word dictionary storing word reading and one representative notation as constituent elements, and a plurality of word dictionaries. For a word for which notation is allowed, a derivative notation dictionary in which the allowed notation is stored in association with the representative notation, and for the input kana character string, the word dictionary is referred to, and the representative notation is used. If there is any other notation allowed for the words constituting the bunsetsu obtained by the bunsetsu segmenting process, and the bunsetsu segmenting unit for performing the segment segmenting process,
A word notation selecting unit that determines a predetermined notation as a display notation, and a display that displays the word represented by the determined display notation as the first candidate of the words that constitute the kana-kanji mixed character string candidate A kana-kanji conversion device equipped with means.

2. The kana-kanji conversion device according to claim 1, wherein the word notation selecting means includes means for determining the display notation by referring to the derived notation dictionary.

3. The kana-kanji conversion device according to claim 1, wherein the word notation selecting means stores a corresponding notation from the derivative notation dictionary in the word dictionary based on the use of past derivative notations. A kana-kanji conversion device comprising means and means for determining the display notation by referring to information stored in the word dictionary.

4. The kana-kanji conversion device according to claim 1, wherein the word dictionary and the derivative notation dictionary are integrally configured.

5. The kana-kanji conversion device according to claim 1, wherein the display means allows homonyms of the word displayed as the first candidate and at least the word of the first candidate. A kana-kanji conversion device having division display means for displaying in a form distinguishable from notation.

6. The kana-kanji conversion device according to claim 5, wherein the split display unit has a homonym of a word displayed as the first candidate, a word of the first candidate, and a homonym thereof. A kana-kanji conversion device that is a means for expanding the notation that is allowed for two different directions and displaying other word notations for the first candidate word notation two-dimensionally.

7. The kana-kanji conversion device according to claim 5, wherein the split display unit displays the homonyms together with the word displayed as the first candidate, and then receives a predetermined instruction, A kana-kanji conversion device comprising means for displaying an acceptable notation for at least one of the first candidate word and / or the homonym.

8. The kana-kanji conversion device according to any one of claims 1 to 4, wherein, among the allowed notations including the first candidate, there are notations using different characters, the display means displays the first When displaying one candidate word, a variant character existence display means for displaying that a variant character exists for the word of the first candidate, and converting the word notation of the first candidate into a notation including a desired variant character A kana-kanji conversion device having means for selecting different characters.

9. The kana-kanji conversion device according to claim 1, wherein the word notation selecting means learns a situation in which a user selects a word notation from the plurality of allowed notations, A kana-kanji conversion device that determines subsequent display notations based on the result of the learning.

10. The kana-kanji conversion device according to claim 9, wherein, based on the result of the learning, the order of a plurality of allowed notations stored in the derived notation dictionary is rearranged and stored,
A kana-kanji conversion device equipped with a derivative notation dictionary learning means for preparing the plurality of allowed notations so that the rearranged order can be displayed as a conversion candidate rank.

11. The kana-kanji conversion device according to claim 9, wherein the display notation determined based on the result of the learning is stored in association with the representative notation to form a kana-kanji mixed character string candidate. A kana-kanji conversion device equipped with a display notation learning means that is prepared to be displayed as the first candidate of a word to be reproduced.

12. The kana-kanji conversion device according to claim 9, wherein when one word notation is selected by the user from the plurality of allowed notations, it corresponds to the selected word notation. Regarding a representative notation, a kana-kanji conversion device comprising word dictionary learning means for changing the priority order to be used as a candidate of a word that constitutes the kana-kanji mixed character string candidate within a homonym in the word dictionary.

13. The kana-kanji conversion device according to claim 9, wherein the word notation selecting means selects a notation using variant characters from the plurality of allowed notations,
Kana-to-Kanji conversion device that prohibits learning the selection status of word notation.

14. The kana-kanji conversion device according to claim 1, wherein in the process of generating the kana-kanji mixed character string candidates, it is learned that two or more words have been converted with a predetermined dependency relationship. And a learning method for prohibiting the learning of the dependency relationship when a predetermined derivative with low frequency of use is selected in at least one of the two or more words and is determined in a kana-kanji mixed character string. A kana-kanji conversion device equipped with means.

15. A kana-kanji conversion device for inputting a kana-character string, referring to a dictionary, and writing out the input kana-character string for each phrase, and generating a kana-kanji mixed character string candidate. Date and time information indicating a predetermined time such as month and day, and a date and time term conversion dictionary that stores a specific entry word representing the date and time information in association with each other, and current date and time information generating means for generating the date and time information at the present time, When the phrase segmentation processing is performed in accordance with predetermined grammatical information, a date / time term searching means for checking the presence / absence of a particular headword representing the date / time information, and a particular headword representing the date / time information are found by the date / time term searching means. At this time, referring to the date / time term conversion dictionary and the current date / time information generating means, date / time term conversion for converting a specific entry word representing the date / time information into the corresponding date / time information And a date / time display format storage unit for storing a plurality of date / time display formats, and a date / time display unit for displaying the conversion result in a predetermined format determined from the plurality of date / time display formats, When the used format is undesired, a date and time display format selection means for displaying the conversion result in the desired format selected from the rest of the plurality of date and time display formats, and one of the stored plurality of formats. Kana-kanji conversion provided with a date and time display format changing means for changing the date and time display format to be used when the user learns that the user has selected and the time and date information is displayed based on the learning result. apparatus.

16. The kana-kanji conversion device according to claim 15, wherein a result of converting a specific entry word representing the date and time information into corresponding date and time information is displayed, or a result of conversion as a normal word is displayed. A kana-kanji conversion device equipped with date / time conversion switching means for selecting whether or not to perform.

17. The kana-kanji conversion device according to claim 1, wherein date and time information indicating a predetermined time point such as time and date is stored in association with a specific entry word representing the date and time information. A date and time term conversion dictionary, a current date and time information generating means for generating the date and time information at the present time, and a date and time term searching means for checking the presence or absence of a specific headword representing the date and time information when performing the phrase segmentation processing according to predetermined grammatical information When a specific entry word representing the date / time information is found by the date / time term search means, the specific entry word representing the date / time information is referred to by referring to the date / time term conversion dictionary and the current date / time information generation means. To a corresponding date / time information and a conversion candidate of a specific entry word representing the date / time information is displayed. Date / time candidate mixed display means for displaying the conversion result to the date / time information in a predetermined format as one of the phonetic synonyms or as one of the allowed notations of a specific headword representing the date / time information. Kana-Kanji conversion device.

18. A kana-kanji conversion method for inputting a kana character string, referring to a dictionary, and writing out the input kana character string for each phrase, to generate a kana-kanji mixed character string candidate, wherein When performing segmentation and segmentation processing on a kana character string, the segmentation and segmentation processing is performed using the representative representation by referring to a word dictionary that stores word reading and one representative representation as constituent elements. If the words that compose the phrase obtained by the segmentation processing are words for which multiple notations are allowed, the predetermined notation is determined as the display notation, and the word represented by the display notation is mixed with the Kana-Kanji. A kana-kanji conversion method for displaying as a first candidate of a word constituting a character string candidate.

19. A kana-kanji conversion method for inputting a kana character string, referring to a dictionary, writing out the input kana character string for each phrase, and generating a kana-kanji mixed character string candidate. When performing the phrase segmentation processing for a kana character string, a specific headword representing time and date information indicating a predetermined time such as time and date is searched, and a specific headword representing the date and time information is found. When referring to the date and time term conversion dictionary that stores the date and time information and a specific entry word representing the date and time information in association with each other, the specific entry word representing the date and time information is referred to. To the corresponding date / time information, and the conversion result is displayed in a predetermined format determined from a plurality of stored date / time display formats, and the displayed format is undesired. At this time, the conversion result is displayed in a desired format selected from the rest of the plurality of date and time display formats, and it is learned that one of the plurality of stored formats is selected by the user. A kana-kanji conversion method for changing the date and time display format used when displaying the date and time information based on the above.

20. For a word in which a plurality of notations allowed in Japanese exist, the plurality of allowed notations are organized into one representative notation and a derivative notation which is the notation permitted except for the representative notation. A kana-kanji conversion device that uses the representative notation in the processing of segmentation and segmentation in kana-kanji conversion, and uses one of the representative representation and the derivative representation in the representation after segmentation and segmentation.