JPS63308665A

JPS63308665A - Kana/kanji converting device

Info

Publication number: JPS63308665A
Application number: JP62144862A
Authority: JP
Inventors: Yasuo Koyama; 小山　泰男
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 1987-06-10
Filing date: 1987-06-10
Publication date: 1988-12-16

Abstract

PURPOSE:To execute the clause delimitting learning of a KANA (Japanese syllabary)/ KANJI (Chinese character) conversion more securely by connecting information for two clauses and registering them to a clause delimitting learning dictionary. CONSTITUTION:When the clause of an analyzing position is a clause delimitting change clause, a clause to which a clause length change is executed and the next clause are succeed and in a condition a clause delimitting mark is placed, they are registered to a learning dictionary 12. However, when the same index exists in the dictionary 12, after this is deleted, the said clause delimiting data are registered. Next, when the independent word part of the established clause is not smaller than the independent word part in the other clause candidacy, to the homonym of the said independent word, a word dictionary 11 is rewritten so that the said independent word can be a forefront. When the independent word part of a selected and shown clause is smaller than the independent word part of the other clause candidacy, with a unit not smaller than the length of the longest independent word, in a condition an attached word is combined to the independent word, it is newly registered to the word dictionary 11. Thus, the clause delimitting learning of a KANA/KANJI conversion can be executed more securely.

Description

【発明の詳細な説明】［産業上の利用分野Ｊ本発明は、日本語文章を入力する為に入力された仮名文
字列を、仮名漢字混じり文章に変換する仮名漢字変換装
置に関する。DETAILED DESCRIPTION OF THE INVENTION [Industrial Application Field J] The present invention relates to a kana-kanji conversion device that converts a kana character string input for inputting a Japanese sentence into a sentence containing kana and kanji.

〔従来の技術］従来の仮名漢字変換装置においては、文節区切りを学習
する為、文節長を変更した文節と文節長を変更しない文
節の両者を混在させて、文節そのものを１つの単語とし
て、単語辞書とは別に学習する文節情報を学習文節辞書
に登録していた。[Prior art] In order to learn bunsetsu divisions, conventional kana-kanji conversion devices mix both bunsetsu whose bunsetsu lengths have been changed and bunsetsu whose bunsetsu lengths have not changed, treat the bunsetsu themselves as one word, and convert them into words. Separately from the dictionary, the phrase information to be learned was registered in the learning phrase dictionary.

［発明が解決しようとする問題点］従来技術によれば、文節そのものを１つの単語として登
録しである為、例えば「きょうははながさいた。」の入力に対し「今日は／花
が／最多。」と変換され、ここで「最多」という文節に
対する文節候補（「最多）　「咲いた」「裂いた」「射
出」等）がら「咲いた」を選択すると「咲いた」を記憶
する事により、次の解析では「きのうもはながさいた」の入力に対し、「昨日も／花
が／咲いた。」と変換する事ができるが、「ここではきものをぬぐ。」の入力に対し、「ここで／
履物を／脱ぐ。」と変換されたものを「ここでは７着物
を／脱ぐ。」と「ここで」という３文字の文節の長さを
「ここでは」という４文字の文節の長さに伸ばして変換
した場合、「ここでは」が学習され、文節分ち書き処理
において、この学習された文節区切りに優先度を与えて
処理すれば、次に同一の入力を行った場合は「ここでは
きものをぬぐ。」の入力に対し「ここでは７着物を／脱
ぐ。」と変換できるものの、１文節のみの学習の為「ここではものをとぐ」の入力に対して、本来ならば「ここで／刃物を／研ぐ。」と変換されたいのにも関わ
らずｒここでは／物を／研ぐ。」と「ここでは」が優先され
て変換される。[Problems to be Solved by the Invention] According to the prior art, the phrase itself is registered as one word, so for example, in response to the input of "Today is a flower," it is possible to input "Today is / Flowers / the most.".'', and here, if you select ``bloomed'' from the phrase candidates (``most often'', ``bloomed'', ``torn'', ``ejected'', etc.) for the phrase ``most,'' then ``bloomed'' will be memorized. In the following analysis, the input ``Yesterday the flowers bloomed'' can be converted to ``The flowers bloomed yesterday too.'' However, the input ``Here I wipe my kimono.'' here/
take off/take off one's footwear. '' is converted to ``Here I take off my 7 kimonos.'' If we extend the length of the three-character phrase ``here'' to the length of the four-character phrase ``here'', we get "Here," is learned, and if this learned phrase break is given priority in the bunsetsu division process, the next time the same input is made, "Here, wipe the kimono." Although the input can be converted to ``Here I take off my seven kimonos.'', since we are learning only one phrase, the input ``Here I sharpen things'' should normally be converted to ``Here I sharpen my cutlery.'' Even though it is wanted to be translated as ``.'', here we are sharpening /things/. ” and “here” are converted with priority.

すなわち、「ここでは７着物を」の学習結果が「ここで
／刃物を」と変換したいところを「ここでは／物を」と
変換し、本来２文節間の関係により文節区切りを行うべ
き文節分ち書き処理に不要な情報を与えて、よけいな解
析を行ってしまう。In other words, the learning result of ``Here, 7 kimono'' is converted to ``Here / cutlery'', but it is converted to ``Here / thing'', and the phrase should be separated based on the relationship between two phrases. This will give unnecessary information to the postscript process, leading to incorrect analysis.

さらに、「しよどうかをたずねる。」の入力に対し、「書道かを
／訪ねる。」と変換されたものを「書道／家を／訪ねる
。」と文節長を変更した場合、「書道」が学習されるの
みであって、次に再度同一の入力を行っても、「書道かを／訪ねる。」としか変換されないという欠点
があった。Furthermore, in response to the input "Ask whether you want to do something.", if you change the phrase length from "Calligraphy or visiting/visiting." to "Calligraphy/home/visiting.", "Calligraphy" becomes The problem was that it was only learned, and even if the same input was made again next time, it would only be converted to ``Calligraphy or visiting.''

本発明は上記欠点を解決するもので、２文節分の情報を
連接して文節区切り学習辞書に登録する事により、より
正確な仮名漢字変換の文節区切り学習を行う仮名漢字変
換装置を提供する事を目的とする。The present invention solves the above-mentioned drawbacks, and provides a kana-kanji conversion device that performs more accurate phrase separation learning for kana-kanji conversion by concatenating information for two phrases and registering the same in a phrase separation learning dictionary. With the goal.

［問題点を解決する為の手段］これらの問題を解決する為、本発明は、べた書き仮名文
字列を文節分ち書きした仮名漢字混じり文に変換する装
置において、かな見出しとそれに対応する漢字文字列と
その文法情報である品詞とを持つ単語辞書と、これを用
いて文節分ち書きを行う文節分ち書き手段と、この文節
分ち書きされた仮名漢字混じり文を文節区切りを明確に
した状態で表示する分ち書き表示手段と、分ち書きした
文節単位に文節候補を次々と表示せしめる文節候補表示
手段と、文節長を変更しない文節に対して、選択され表
示されている文節に対し文節候補を単語辞書に学習する
文節候補学習手段と、予め行なわれた文節分ち書きに対
して対象となる文節長を伸び縮みさせて、対象文節以降
の文章を再解析し直す文節長変更手段と、文節長変更を
行った文節と次の文節を連接させ文節区切りマークを持
ち、文節区切りを記憶する文節区切り学習手段と、この
文節区切り情報を持つ文節区切り学習辞書とを持ち、文
節分ち書き処理において、文節区切り学習辞書と単語辞
書の両者を解析対象として解析し、結果的に文節区切り
学習辞書の内容が文節対象となった場合、文節区切りマ
ークにより、これを２文節とせしめ文節を分ち書きする
事を特徴とする特［作用］べた書き仮名文字列を文節分ち書きした仮名漢字混じり
文に変換する場合、入力手段により入力された仮名文字
列を、文節分ち書き手段により単語辞書と文節区切り学
習辞書を同一方法で検索し両者とも１つの自立語として
文法解析を行い文節分ち書きを行い、文節区切り学習辞
書に登録された文章が１文節の候補となった場合は、そ
の中の区切りマークに基づき２文節に分解し、文節区切
りを明確にした状態で仮名漢字混じり文を分ち書き表示
手段により表示し、文節単位に同音異語の文節候補を文
節候補表示手段により表示せしめると共に、文節区切り
が正しくない場合は、文節長変更手段により文節長を伸
び縮みさせて、対象文節以降の文章を再解析し直す事に
より正しい文節分ち書き状態の仮名漢字変換結果を得る
。[Means for Solving the Problems] In order to solve these problems, the present invention provides an apparatus for converting a solid kana character string into a sentence with kana and kanji mixed into segments. A word dictionary that has character strings and parts of speech that are their grammatical information, a phrase dividing means that uses this to divide phrases into phrases, and clearly divide phrases into sentences mixed with kana and kanji that have been divided into phrases. a phrase candidate display means that displays phrase candidates one after another for each divided phrase, and a phrase candidate display means that displays phrase candidates one after another for each phrase that has been divided, and a phrase candidate display means that displays phrase candidates one after another for each phrase that has been divided, and for phrases whose phrase length is not changed, On the other hand, there is a phrase candidate learning means that learns phrase candidates into a word dictionary, and a phrase length change that expands or contracts the target phrase length based on the phrase segmentation that has been done in advance, and reanalyzes the sentence after the target phrase. a bunsetsu break learning means that concatenates the bunsetsu whose bunsetsu length has been changed and the next bunsetsu, has a bunsetsu break mark, and stores the bunsetsu break, and a bunsetsu break learning dictionary that has this bunsetsu break information; In the text writing process, if both the bunsetsu break learning dictionary and the word dictionary are analyzed, and as a result, the contents of the bunsetsu break learning dictionary become bunsetsu targets, the bunsetsu break marks are used to separate them into two bunsetsu. [Action] When converting a solid kana character string into a sentence with kana and kanji mixed in, the kana character string inputted by the input means is converted into a sentence divided into segments. When the word dictionary and the bunsetsu break learning dictionary are searched in the same way, both are grammatically analyzed as one independent word, and the bunsetsu break is performed, and the sentence registered in the bunsetsu break learning dictionary becomes a candidate for one bunsetsu. divides it into two clauses based on the delimiter mark therein, displays the sentence containing kana and kanji using a separate display means with clear clause divisions, and displays clause candidates for homophones for each clause. If the bunsetsu separation is incorrect, the bunsetsu length is lengthened or shortened using the bunsetsu length changing means, and the text after the target clause is reanalyzed to produce the correct kana-kanji conversion result. get.

ここで変換結果を学習する為、まず文節長が変更されて
いない文節に対して、文節候補学習手段により選択され
表示されている文節の自立語部分が他の文節候補中の自
立語部分より小さくない場合は、当該自立語の同音異語
集団に対して当該自立語が先頭となる様に書き換え、選
択され表示されている文節の自立語部分が他の文節候補
中の自立語部分より小さい場合は、最長の自立６６の長
さより小さくない単位で自立語に付属語を結合した状態
で単語辞書に新たに登録する。In order to learn the conversion results here, first, for a clause whose clause length has not been changed, the independent word part of the clause selected and displayed by the clause candidate learning means is smaller than the independent word part of other clause candidates. If not, rewrite the homophone group of the independent word so that the independent word comes first, and if the independent word part of the selected and displayed clause is smaller than the independent word part of other clause candidates. is newly registered in the word dictionary in a state in which an independent word is combined with an adjunct word in units not smaller than the length of the longest independent word 66.

また文節長が変更された文節に対しては、文節区切り学
習手段により、文節長変更を行った文節と次の文節を連
接させ文節区切りマークを付けた状態で、文節区切り学
習辞書に登録する。ただし、文節区切り学習辞書に、同
一の見出しが存在した場合は、これを削除した後当該文
節区切りデータを登録する。Furthermore, for the phrase whose phrase length has been changed, the phrase whose phrase length has been changed is concatenated with the next phrase by the phrase break learning means, and the phrase is registered in the phrase break learning dictionary with a phrase break mark attached. However, if the same heading exists in the phrase break learning dictionary, the phrase break data is registered after deleting it.

［実施例］　゛ここで、以下に本発明の詳細を示した実施例に基づいて
、説明する。[Example] Here, the present invention will be explained based on an example showing details of the present invention.

第１図は、本発明の仮名漢字変換装置を示す図である。FIG. 1 is a diagram showing a kana-kanji conversion device of the present invention.

図中符号１は入力部２より入力されたデータに基づきシ
ステム全体の制御を行う中央処理装置（ＣＰＵ）である
。このＣＰＵにより総ての装置の制御がおこなわれる。Reference numeral 1 in the figure is a central processing unit (CPU) that controls the entire system based on data input from the input section 2. This CPU controls all devices.

入力部２で入力された仮名文字列は、仮名漢字変換装置
８に送られ仮名漢字変換が行われ、文節候補を文節候補
表示部３により表示しならがら、文節候補選択部５によ
り文節単位に文節候補を表示選択し、文節候補確定部６
により所望の文節候補を確定し、仮名漢字文字列は文章
として、文章格納部７に格納される。The kana character string inputted in the input section 2 is sent to the kana-kanji conversion device 8 where kana-kanji conversion is performed, and while the phrase candidates are displayed on the phrase candidate display section 3, the phrase candidate selection section 5 converts them into phrases in units of phrases. Display and select phrase candidates, and phrase candidate confirmation section 6
A desired clause candidate is determined by the above steps, and the kana-kanji character string is stored in the text storage section 7 as a text.

仮名漢字変換装置Ｎｌ２内では、仮名文字獲得部９によ
り、仮名文字が受は取られ、文節分かち書き部１０にお
いて単語辞書１１および文節区切り学習辞書１２を検索
し、文法処理を行いながら文節分かち書き処理を行う。In the kana-kanji conversion device Nl2, the kana characters are picked up by the kana character acquisition unit 9, and the phrase separation unit 10 searches the word dictionary 11 and the phrase separation learning dictionary 12, and performs the phrase separation process while performing grammar processing. conduct.

ここで分ち書きの結果、文節区切り学習辞書に登録され
た文章が１文節の候補となった場合は、その中の区切り
マークに基づき２文節に分解し、当該２文節の同音異語
文節を単語辞書１１を検索し作成し、文節候補格納部１
６に格納する。If the sentence registered in the phrase separation learning dictionary becomes a candidate for one phrase as a result of this separation, it is divided into two phrases based on the break mark therein, and the homophone and allophone phrases of the two phrases are separated. The word dictionary 11 is searched and created, and the phrase candidate storage unit 1 is created.
6.

ここで分ち書き表示部１４は、文節候補格納部１６より
各々の文節の第１候補を読み出し文節候補表示部３に表
示する。その後文節候補選択部５により文節候補格納部
１６より文節候補出力部１５より文節単位に次々と候補
を読み出し、文節候補表示部３に表示しながら、所望の
仮名漢字文字列を文節単位に選択する。Here, the division display section 14 reads out the first candidate for each clause from the clause candidate storage section 16 and displays it on the clause candidate display section 3. Thereafter, the phrase candidate selection section 5 reads candidates one after another from the phrase candidate storage section 16 and the phrase candidate output section 15 in phrase units, and while displaying them on the phrase candidate display section 3, selects desired kana-kanji character strings in phrase units. .

そして文節分ち書きに誤りがある場合は、文節長変更部
４により文節を伸び縮みさせ文節再解析部１３に当該文
節以降を再解析するように指示する。文節再解析部１３
は、さらに限定された部分の解析を文節分ち書き部１０
に指示し、解析結果は文節候補格納部１６に文節長変更
があった旨を文節長変更文節データに付加し格納する。If there is an error in the bunsetsu division, the bunsetsu length changing unit 4 extends or shortens the clause and instructs the clause re-analysis unit 13 to re-analyze the clause after that clause. Clause reanalysis part 13
In this example, the analysis of a more limited part is performed using the bunsetsu segmentation section 10.
The analysis result is stored in the phrase candidate storage unit 16 by adding information indicating that the phrase length has been changed to the phrase length changed phrase data.

そして分ち書き表示部は文節長変更された文節以降の仮
名漢字文字列を書き換える。Then, the division display section rewrites the kana-kanji character string after the clause whose clause length has been changed.

以上の操作により所望の仮名漢字文字列が得られたなら
ば、文節候補確定部６により確定され結果は文章格納部
７に格納されると共に、確定文節情報受取部１７に送ら
れる。Once the desired kana-kanji character string is obtained through the above operations, it is determined by the phrase candidate determining section 6, and the result is stored in the text storage section 7 and sent to the determined phrase information receiving section 17.

確定文節情報受取部１７に送られた情報に基づき文節候
補学習部１８は、文節長が変更されていない文節に対し
ては、選択され表示されている文節の自立語部分が他の
文節候補中の自立語部分より小さくない場合は、当該自
立語の同音異語集団に対して当該自立語が先頭となる様
に単語辞書１１を書き換え、選択され表示されている文
節の自立語部分が他の文節候補中の自立語部分より小さ
い場合は、最長の自立語の長さより小さくない単位で自
立語に付属語を結合した状態で単語辞１１１１に新たに
登録する。また文節長が変更された文節に対しては、文
節長変更を行った文節と次の文節を連接させ文節区切り
マークを付けた状態で、文節区切り学習辞書１２に登録
する。ただし、文節区切り学習辞書に、同一の見出しが
存在した場合は、これを削除した後当該文節区切りデー
タを登録する。Based on the information sent to the confirmed clause information receiving unit 17, the clause candidate learning unit 18 determines whether the independent word part of the selected and displayed clause is among other clause candidates for clauses whose clause length has not been changed. If it is not smaller than the independent word part of the selected and displayed clause, the word dictionary 11 is rewritten so that the independent word comes first in the homophone group of the independent word, and the independent word part of the selected and displayed clause is If it is smaller than the independent word part in the clause candidate, the independent word is newly registered in the dictionary 1111 in a state in which the attached word is combined with the independent word in units not smaller than the length of the longest independent word. Furthermore, for the phrase whose phrase length has been changed, the phrase whose phrase length has been changed is concatenated with the next phrase and is registered in the phrase break learning dictionary 12 with a phrase break mark attached. However, if the same heading exists in the phrase break learning dictionary, the phrase break data is registered after deleting it.

第２図は、本発明の仮名漢字変換装置の文節候補作成に
関する動作を説明するフローチャートである。FIG. 2 is a flowchart illustrating the operation of the kana-kanji conversion device of the present invention regarding the creation of clause candidates.

前記した通り、本発明の仮名漢字変換装置はく２０１〉
で仮名文字列を受は取り、仮名解析位置の初期設定を行
い、＜２０２＞により所定の仮名解析位置より単語辞書
１１と文節区切り学習辞書１２を検索し、単語候補を取
り出す。これを基に＜２０３＞により文節分かち書きが
行われる。As mentioned above, the kana-kanji conversion device 201 of the present invention
The kana character string is received, the kana analysis position is initialized, and the word dictionary 11 and the clause break learning dictionary 12 are searched from the predetermined kana analysis position in <202> to extract word candidates. Based on this, clause separation is performed by <203>.

そして＜２０４＞により分ち書きされた文節内に学習辞
書文節が存在した場合は、＜２０５＞により、これらの
文節を区切りマークに従い２文節に分解すると共に、１
文筒車位に単語辞書を検索し各文節の同量異語候補を作
成し、全文節候補が出揃った時点で＜２０６＞により文
節区切りが確定しく２０７＞により分ち書き表′示がな
される。If there is a learning dictionary clause in the clause separated by <204>, then by <205>, these clauses are separated into two clauses according to the delimiter marks, and one
The word dictionary is searched at the same level as the Bunzuru wheel, and the same number of different word candidates for each clause is created. When all the clause candidates are obtained, the clause break is determined by <206>, and the division table is displayed by 207>. .

第３図は、本発明の仮名漢字変換装置の文節候補学習に
関する動作を説明するフローチャートである。FIG. 3 is a flowchart illustrating the operation of the kana-kanji conversion device of the present invention regarding phrase candidate learning.

前記した通り、本発明の仮名漢字変換装置の文節候補学
習は＜３０１＞で確定文節情報を受は取り、解析位置を
まず先頭文節とする。As described above, the phrase candidate learning of the kana-kanji conversion device of the present invention receives the confirmed phrase information in <301>, and first sets the analysis position to the first phrase.

そして＜３０２＞により解析位置の文節が、文節区切り
変更文節であるならば＜３０３＞により文節長変更を行
った文節と次の文節を連接させ文節区切りマークを付け
た状態で、文節区切り学習辞書１２に登録する。ただし
、文節区切り学習辞書に、同一の見出しが存在した場合
は、これを削除した後当該文節区切りデータを登録する
。（図示せず）さらに＜３０４＞により確定された文節の自立語部分が
他の文節候補中の自立語部分より小さくない場合は、＜
３０５＞により当該自立語の同音異語集団に対して当該
自立語が先頭となる様に単語辞書１１を書き換える。ま
た選択され表示されている文節の自立語部分が他の文節
候補中の自立語部分より小さい場合は、＜３０６＞によ
り最長の自立語の長さより小さくない単位で自立語番ご
付属語を結合した状態で単語辞書１１に新たに登録する
。Then, if the phrase at the analysis position is a phrase whose length has been changed by <302>, the phrase whose length has been changed by <303> is concatenated with the next phrase, and with a phrase delimiter mark attached, the phrase delimiter learning dictionary Register on 12th. However, if the same heading exists in the phrase break learning dictionary, the phrase break data is registered after deleting it. (Not shown) Furthermore, if the independent word part of the clause determined by <304> is not smaller than the independent word part in other clause candidates, <
305>, the word dictionary 11 is rewritten so that the independent word comes first in the homophone group of the independent word. In addition, if the independent word part of the selected and displayed bunsetsu is smaller than the independent word parts of other bunsetsu candidates, <306> combines the independent word number and attached word in units that are not smaller than the length of the longest independent word. In this state, the word is newly registered in the word dictionary 11.

これらの処理が終了すると＜３０７＞により、処理中の
文節が最終文節であるか確認し、最終文節でなければ、
処理文節を法文節に移しく図示せず）＜３０２＞よりの
処理を繰り返して行う。なお当該文節が最終文節であれ
ば、これで処理を終了する。When these processes are completed, it is checked in <307> whether the clause being processed is the final clause, and if it is not the final clause,
The process from <302> (not shown) is repeated to transfer the processed clause to the legal clause. Note that if the phrase is the final phrase, the process ends here.

［発明の効果］本発明は、次のような効果がある。[Effect of the invention] The present invention has the following effects.

（１）文節区切り学習装置！１２に、２文節が区切りマ
ーク付で記憶され、これが文節分ち書き部１０で解析に
使用される事により、２文節間の関係に限定された学習
結果が反映でき、よけいな解析を行ってしまう危険がな
い。(1) Clause break learning device! In step 12, the two clauses are stored with delimiter marks, and this is used for analysis by the clause divider 10, so that the learning results limited to the relationship between the two clauses can be reflected, allowing for detailed analysis. There is no danger of it being lost.

（２）文節区切り学習装置１２に、２文節が区切りマー
ク付で記憶され、これが文節分ち書き部１０で解析に使
用される事により、１文節となりうる文節を文節長変更
部４により２文節にした場合、正確に学習できる。(2) Two clauses are stored in the clause break learning device 12 with a break mark, and the clauses are used for analysis in the clause divider 10, so that the clause length changing unit 4 converts the clauses into two clauses. If you do this, you can learn accurately.

（３）文節区切り学習辞書１２を単語辞書１１と独立さ
せる事により単語辞書を圧迫する事がなくなる。(3) By making the bunsetsu division learning dictionary 12 independent from the word dictionary 11, the word dictionary is not overwhelmed.

[Brief explanation of drawings]

第１図は、本発明の仮名漢字変換装置を示すブロック図
である。第２図は、本発明の仮名漢字変換装置の文節分ち書き動
作を示すフローチャートである。第３図は、本発明の仮名漢字変換装置の文節学習の動作
を示すフローチャートである。１０・・・・・文節分ち書き部１２・・・・・文節区切り学習辞書１８・・・・・文節候補学習部以　　　上、′〆、− 代理人　　弁理士　最上　務　他１名　　・第２図FIG. 1 is a block diagram showing a kana-kanji conversion device of the present invention. FIG. 2 is a flowchart showing the bunsetsu division writing operation of the kana-kanji conversion device of the present invention. FIG. 3 is a flowchart showing the phrase learning operation of the kana-kanji conversion device of the present invention. 10...Bunsetsu division learning section 12...Bunsetsu separation learning dictionary 18...Bunsetsu candidate learning section Above, '〆, - Agent Patent attorney Tsutomu Mogami and 1 other person ・Second figure

Claims

[Claims]

In a device that converts a solid kana character string into a kana-kanji mixed sentence written in segments, it uses a word dictionary that has kana headings, corresponding kanji character strings, and parts of speech that are grammatical information, and uses this to convert text into segments. A bunsetsu segment writing means for performing bunsetsu writing, a split writing display means for displaying the bunsetsu segmented and mixed kana-kanji sentences with clear bunsetsu divisions, and a bunsetsu candidate for each segmented bunsetsu unit. a phrase candidate display means for displaying phrase candidates one after another; a phrase candidate learning means for learning phrase candidates in a word dictionary for phrases selected and displayed for phrases whose phrase lengths are not changed; It has a clause length changing means that expands or shortens the target clause length for writing and reanalyzes the sentence after the target clause, and a clause break mark that connects the clause whose clause length has been changed with the next clause. , has a bunsetsu break learning means that stores bunsetsu breaks and a bunsetsu break learning dictionary that has this bunsetsu break information, and in the bunsetsu break writing process, analyzes both the bunsetsu break learning dictionary and the word dictionary as analysis targets, and calculates the results. This kana-kanji conversion device is characterized in that when the contents of a phrase separation learning dictionary become a phrase, it is divided into two phrases using a phrase delimiter mark, and the phrases are written separately.