JPH06223055A

JPH06223055A - Document input device

Info

Publication number: JPH06223055A
Application number: JP5008735A
Authority: JP
Inventors: Hiroshi Yamada; 洋志山田
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1993-01-22
Filing date: 1993-01-22
Publication date: 1994-08-12

Abstract

PURPOSE:To convert an input which contains KANJI (Chinese character) partially from KANJI to KANA (Japanese syllabary) at the time of input by KANA-KANJI conversion and to prevent word dictionary capacity from increasing so much. CONSTITUTION:The document input device consists of an input device 1, a converting device 2 which converts an inputted character string, an output device 3 such as a display where a character string as a conversion result is displayed, and a controller 4 which controls the whole operation and the converting device 2 is equipped with a word dictionary 21 in which the reading, description, and part of speech of the inputted character string are registered, a word retrieval part 22 which retrieves words registered in a word dictionary 21 according to the inputted character string, a word storage part 23 which stores the words retrieved by the word retrieval part 22, a word confirmation part 24 which confirms whether or not the retrieved words match the input character string, and a word selection part 25 which selects proper words out of the retrieved word group.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は文章入力装置に関し、特
に漢字の混在する文字列を入力としてかな漢字変換を行
う文章入力装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a text input device, and more particularly to a text input device for converting a kana-kanji character by inputting a character string containing a mixture of kanji characters.

【０００２】[0002]

【従来の技術】現在普及している文章入力装置として、
かなあるいはローマ字による入力を漢字かな混じりの文
字列に変換するかな漢字変換がある。これらの装置の入
力文字列としては、かなだけではなく漢字が直接入力さ
れる場合もある。例えば、ペンとタブレットを用いた手
書き文字認識による入力の場合は、漢字やかなを混ぜて
入力することができる。また、漢字コードによる入力方
式では、漢字コードの不明な漢字を入力するための補助
機能としてかな漢字変換機能を持つものがある。いずれ
の装置にも入力の一部あるいは全部を漢字で入力できる
かな漢字変換装置が利用されている。2. Description of the Related Art As a popular text input device,
There is Kana-Kanji conversion that converts Kana or Romaji input into a character string containing Kanji and Kana. As input character strings for these devices, not only kana but also kanji may be directly input. For example, in the case of input by handwriting character recognition using a pen and a tablet, it is possible to mix and input kanji and kana. In addition, some kanji-kanji input methods have a kana-kanji conversion function as an auxiliary function for inputting kanji whose kanji code is unknown. A kana-kanji conversion device that can input part or all of the input in kanji is used in both devices.

【０００３】従来の漢字の混在した文章を入力としてか
な漢字変換を行う文章入力装置の例が、『２ストロ−ク
入力のための仮名漢字変換』（塩見、喜多、河合、大
岩：情報処理学会論文誌、Ｖｏｌ．３３，Ｎｏ．７，ｐ
ｐ．９２０−９２７，１９９２）、あるいは、『表記変
換つきの形態素解析プログラムとその応用』（金子、鳥
原、建石：情報処理学会第４５回全国大会、４Ｃ−４，
１９９２）に記載されている。An example of a conventional text input device for performing kana-kanji conversion by inputting a sentence containing mixed kanji is "Kana-kanji conversion for 2-stroke input" (Shiomi, Kita, Kawai, Oiwa: Information Processing Society of Japan Magazine, Vol.33, No.7, p
p. 920-927, 1992), or "A morphological analysis program with notation conversion and its application" (Kaneko, Torihara, Kenseki: IPSJ 45th National Convention, 4C-4,
1992).

【０００４】いずれの論文に記載された文章入力装置で
も、漢字の混在した文章のかな漢字変換を行うために、
各単語の読みの一部あるいは全部を漢字にした単語を単
語辞書に登録している。例えば、読みが“どようび”で
表記が“土曜日”の単語について、後者の論文の方式で
は図２（ａ）に示す８通りを登録する。In any of the text input devices described in any of the papers, in order to perform kana-kanji conversion of a text in which kanji are mixed,
Words in which some or all of the reading of each word is in Kanji are registered in the word dictionary. For example, with respect to the word whose reading is “Doyoubi” and whose notation is “Saturday”, eight ways shown in FIG. 2 (a) are registered in the method of the latter paper.

【０００５】前記の従来の文章入力装置のかな漢字変換
過程は、入力文字列や単語辞書の読みに漢字を許すよう
にする以外はかなのみを入力文字列とする一般のかな漢
字変換装置と同様である。The kana-kanji conversion process of the above-mentioned conventional text input device is the same as that of a general kana-kanji conversion device that uses only kana as an input character string except that kanji is allowed for reading an input character string or a word dictionary. .

【０００６】以下では、従来の単語辞書の構成法の例を
挙げる。An example of a conventional word dictionary construction method will be given below.

【０００７】単純な方法としては、図２のように直接単
語情報を記述し、検索時のキーでソ−トしておいて二分
検索法などで検索する方法がある。As a simple method, there is a method in which word information is directly described as shown in FIG. 2, sorted with a key at the time of search, and then searched by a binary search method or the like.

【０００８】単語検索のためのキーの先頭の共通部分を
共有する方式（図７）では、共有した部分について容量
が削減できる。また、日本語処理でよく使われる、先頭
部分の文字が共通である単語の検索作業が容易である。In the method of sharing the common part at the beginning of the key for word search (FIG. 7), the capacity of the shared part can be reduced. In addition, it is easy to search for a word that has a common first character, which is often used in Japanese processing.

【０００９】さらに容量を圧縮するために、『二つのト
ライを用いた自然言語辞書検索技法』（森本、青江：情
報処理学会第４５回全国大会、５Ｆ−４，１９９２）で
は単語末尾の共通部分を共有する方法が提案されている
（図８）。In order to further reduce the capacity, in the "natural language dictionary search technique using two tries" (Morimoto, Aoe: 45th National Convention of Information Processing Society of Japan, 5F-4, 1992), the common part at the end of words Has been proposed (Fig. 8).

【００１０】その他、各種の辞書構成法や検索法につい
ては、“ＴｈｅＡｒｔｏｆＣｏｍｐｕｔｅｒＰ
ｒｏｇｒａｍｍｉｎｇ”（Ｋｎｕｔｈ，１９７３）、
『連載講座：キー検索技法』（青江：情報処理，Ｖｏ
ｌ．３３，Ｎｏ．１１，ｐｐ．１３５９−１３６６，１
９９２）などで参照できる。For other various dictionary construction methods and search methods, see "The Art of Computer P".
programming ”(Knuth, 1973),
"Serial Series: Key Search Technique" (Aoe: Information Processing, Vo
l. 33, No. 11, pp. 1359-1366, 1
992) and the like.

【００１１】[0011]

【発明が解決しようとする課題】前述した従来の文章入
力装置では、可能な混ぜ書き方法をすべて単語登録して
いる。しかし、この方法で単語をどのように混ぜ書きし
ても対応できるようにするためには、ｎ文字の漢字を含
む単語では一般に２ⁿ語を登録する必要があり、単語辞
書に登録する語数が非常に大きくなる。これらの混ぜ書
き単語は、あらゆる文字位置で共通な部分を持っている
ため、単語の先頭部・末尾部だけを共有した単語辞書を
備える従来の文章入力装置では、十分に辞書容量を圧縮
できないという課題がある。In the above-mentioned conventional text input device, all possible mixed writing methods are registered as words. However, in order to be able to handle no matter how mixed words are written by this method, it is generally necessary to register 2 ⁿ words for words containing n kanji characters, and the number of words to be registered in the word dictionary is Grows very large. Since these mixed words have a common part at every character position, the conventional text input device equipped with a word dictionary sharing only the beginning and end of the word cannot sufficiently reduce the dictionary capacity. There are challenges.

【００１２】本発明の目的は、上述の問題点を解決し、
読みに漢字が混在している単語を単語辞書に登録する際
に、対応するかな書き語と共通の部分を共有することに
より辞書容量の増加を少なくする文章入力装置を提供す
ることである。The object of the present invention is to solve the above-mentioned problems,
It is an object of the present invention to provide a sentence input device that reduces the increase in the dictionary capacity by sharing a common part with a corresponding kana written word when registering a word in which kanji is mixed in reading into a word dictionary.

【００１３】[0013]

【課題を解決するための手段】上述した問題点を解決す
るため、本発明の文章入力装置は、入力文字列を入力す
る入力装置と、各単語について置き換え前の第一の文字
列、置き換え後の第二の文字列及び品詞を登録する単語
辞書と、第一の文字列が前記入力文字列の一部分と一致
する単語を前記単語辞書から検索する単語検索部と、前
記単語検索部が検索した各単語について第一の文字列と
第二の文字列を比較して前記単語辞書に登録されていな
い単語を削除する単語確認と、前記単語確認による単語
の削除を行ったあとに残った単語から出力文字列を作成
する単語選択部と、該出力文字列を表示する出力装置と
を備え、前記単語辞書は、前記第一の文字列に漢字を含
む単語を登録する場合に、該単語と同一の前記第二の文
字列及び同一の品詞を持ち、前記第一の文字列に漢字を
含まない単語との間で前記第一の文字列の共通部分を共
有する。In order to solve the above-mentioned problems, a sentence input device of the present invention comprises an input device for inputting an input character string, a first character string before replacement for each word, and a replacement after replacement. Of the second character string and part of speech, a word search unit for searching the word dictionary for a word in which the first character string matches a part of the input character string, and the word search unit. For each word, compare the first character string and the second character string to delete the word not registered in the word dictionary, and from the words remaining after deleting the word by the word confirmation. The word dictionary includes a word selection unit that creates an output character string, and an output device that displays the output character string, and the word dictionary is the same as the word when the word including Chinese characters is registered in the first character string. Said second character string and the same item The have to share a common portion of the first string between the words without the Chinese character in the first string.

【００１４】[0014]

【実施例】次に、本発明について図面を参照して説明す
る。DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, the present invention will be described with reference to the drawings.

【００１５】図１は、本発明の文章入力装置の一実施例
を示すブロック図である。FIG. 1 is a block diagram showing an embodiment of a text input device of the present invention.

【００１６】図１において、本発明の実施例は、入力用
タブレットやキーボードなどの入力装置１と、入力した
文字列を変換する変換装置２と、変換結果の文字列を表
示するディスプレイなどの出力装置３と、全体の動作を
制御する制御装置４とから構成される。In FIG. 1, the embodiment of the present invention is such that an input device 1 such as an input tablet or a keyboard, a conversion device 2 for converting an input character string, and an output such as a display for displaying the conversion result character string. It is composed of a device 3 and a control device 4 for controlling the overall operation.

【００１７】変換装置２は、単語の読み・表記・品詞を
登録している単語辞書２１と、入力した文字列から単語
辞書２１に登録されている単語を検索する単語検索部２
２と、単語検索部２２が検索した単語を格納しておく単
語格納部２３と、検索された単語が入力文字列と一致す
るかどうかを確認する単語確認部２４と、検索された単
語群から適切な単語を選択する単語選択部２５とを備え
る。The conversion device 2 includes a word dictionary 21 in which readings, notations, and parts of speech of words are registered, and a word search unit 2 for searching a word registered in the word dictionary 21 from an input character string.
2, a word storage unit 23 for storing the words retrieved by the word retrieval unit 22, a word confirmation unit 24 for confirming whether or not the retrieved word matches the input character string, and a retrieved word group. The word selection part 25 which selects a suitable word is provided.

【００１８】単語辞書２１は、単語の読みをもとに構成
される索引部と表記・品詞などのその他の情報からなっ
ている。なお、本発明の文章入力装置においては、読み
の一部または全部が漢字に置き替わっている場合も含め
て“読み”と呼んでいる。単語辞書２１の索引部は、登
録する単語の読みのｎ文字目に現れる文字をｎ−１文字
目が同一である文字ごとに集めたテーブル（ｎ文字目テ
ーブル）からなる木構造になっている。各ｎ文字目テー
ブルは、ｎ文字目の文字と、ｎ文字目までで構成される
単語の情報へのポインタと、その文字に引き続いて現れ
るｎ＋１文字目テーブルへのポインタの組み合わせから
なる。The word dictionary 21 is composed of an index part formed based on the reading of words and other information such as notations and parts of speech. In the text input device of the present invention, the term "yomi" is used, including the case where part or all of the pronunciation is replaced with kanji. The index unit of the word dictionary 21 has a tree structure including a table (n-th character table) in which characters appearing at the n-th character in reading the registered word are collected for each character having the same n-1 character. . Each n-th character table is composed of a combination of an n-th character, a pointer to information on a word composed up to the n-th character, and a pointer to an (n + 1) -th character table that appears after the character.

【００１９】例として、「上下（じょうげ）」、「上下
線（じょうげせん）」、「城下町（じょうかまち）」と
いう３単語を登録している単語辞書の索引部を図４に示
す。図４で４１は、単語の読みの１文字目に現れる文字
（“じ”）を登録したテーブルである。１文字目に現れ
る文字から、２文字目に現れる文字（“ょ”）を登録し
た２文字目テーブル４２へポインタ（図４では実線の矢
印で示した）が張られている。図４の例では４文字目に
現れる文字は“か”と“け”の２通りなので、４文字目
テーブル４４は２つの文字を含んでいる。さらに、４文
字目テーブルの各文字に続く５文字目テーブルが４５、
４６である。As an example, FIG. 4 shows an index portion of a word dictionary in which three words "upper and lower lines", "up and down lines" and "castle town" are registered. In FIG. 4, reference numeral 41 is a table in which the character (“ji”) appearing as the first character in reading the word is registered. A pointer (indicated by a solid arrow in FIG. 4) is provided from the character appearing as the first character to the second character table 42 in which the character appearing as the second character (“yo”) is registered. In the example of FIG. 4, the character that appears in the fourth character is "ka" or "ke", so the fourth character table 44 includes two characters. Furthermore, the fifth character table following each character of the fourth character table is 45,
46.

【００２０】各単語の末尾からは、その単語の表記、品
詞などの情報あるいは情報の格納場所（図４では二重丸
で示した）をさすポインタ（図４では破線の矢印で示し
た）が登録されている。From the end of each word, a pointer (indicated by a dashed arrow in FIG. 4) that points to information such as the notation and part of speech of the word or a storage location of the information (indicated by a double circle in FIG. 4) is displayed. It is registered.

【００２１】単語辞書２１は、混ぜ書き後が登録されて
いる場合は対応するかな書き語と共通部分を共有する。
例えば、“上下線”と“城下町”の混ぜ書き語（図２
（ｂ））を登録する場合、従来の文章入力装置の単語辞
書では図７に示す構成の索引部が必要になる。一方、本
発明による文章入力装置の単語辞書２１の索引部は図３
に示す構成になる。The word dictionary 21 shares a common part with the corresponding kana writing word when the post-mix writing is registered.
For example, mixed words of "upper and lower lines" and "castle town" (Fig. 2
When registering (b)), the word dictionary of the conventional text input device requires the index unit having the configuration shown in FIG. On the other hand, the index part of the word dictionary 21 of the text input device according to the present invention is shown in FIG.
The configuration is shown in.

【００２２】図３で、読みが“上げせん”の単語を登録
する場合、対応するかな書き語“じょうげせん”との共
通部分“げせん”を共有するため、１文字目テーブル３
１の“上”は、３文字目テーブル３３の“う”と同じ４
文字目テーブル３４へのポインタを持つ。また、読みが
“じょう下せん”の単語を登録する場合、共通部分“せ
ん”を共有するため、４文字目テーブル３４の“下”
は、“げ”と同じ５文字目テーブル３５へのポインタを
持つ。４文字目テーブル３４の“下”は、“じょ下ま
ち”の検索にも使用されるので、“じょうかまち”の
“か”と同じ５文字目テーブル３６へのポインタも持っ
ている。In FIG. 3, when a word whose reading is "Asensen" is registered, since the common portion "Gensen" with the corresponding Kana writing word "Jogensen" is shared, the first character table 3
"Up" of 1 is the same as "u" of the third character table 33 4
It has a pointer to the character table 34. In addition, when registering a word whose reading is "josen", the common part "sen" is shared, so "bottom" of the fourth character table 34 is registered.
Has a pointer to the fifth character table 35, which is the same as "ge". Since “below” in the fourth character table 34 is also used to search for “joshitamachi”, it also has a pointer to the same fifth character table 36 as “ka” in “jokamachi”.

【００２３】単語検索部２２は、入力文字列の各文字位
置から単語辞書２１の検索を行う。単語検索部２２の動
作を図５を参照して説明する。以下の説明および図５で
は、入力文字列をｓ、入力文字列のｊ文字目をｓ［ｊ］
と表現する。The word search unit 22 searches the word dictionary 21 from each character position of the input character string. The operation of the word search unit 22 will be described with reference to FIG. In the following description and FIG. 5, the input character string is s, and the jth character of the input character string is s [j].
Express.

【００２４】（ａ）ｉ←１（ｓの１文字目以降と一致す
る単語から検索する）。（ステップ５１）（ｂ）ｓの文字すべてについて単語検索したなら終了。
（ステップ５２）（ｃ）ｊ←１（単語辞書２１の１文字目テーブルから検
索を始める）。（ステップ５３）（ｄ）ｉ文字目テーブルにｓ［ｉ］があるかどうかを調
べる。あれば、（ｅ）に進み、なければ（ｈ）に進む。
（ステップ５４、ステップ５５）（ｅ）ｓ［ｉ］，ｓ［ｉ＋１］，…ｓ［ｉ＋ｊ−１］と
いう読みに対応する単語があるかを調べ、あれば単語の
情報を単語格納部２３に書き込む。（ステップ５６）（ｆ）ｊ＋１文字目テーブルへのポインタを調べる。複
数のポインタがある場合は一つを残し、他を、ｉ，ｊの
値と共にスタックに保持する。（ステップ５７）
（ｇ）ｊを１増やして（ｄ）へ進む。（ステップ５８）（ｈ）スタックに未処理のポインタがあるかを調べ、あ
れば（ｉ）へ進み、なければ（ｊ）へ進む。（ステップ
５９）（ｉ）スタックから未処理のポインタを取りだし（ｄ）
へ進む。（ステップ５１０）（ｊ）ｉを１増やして（ｂ）へ進む。（ステップ５１
１）単語確認部２４は単語格納部２３に書き込まれている単
語のうち読みに漢字を含む単語について、検索に用いた
読みと検索された単語の表記が対応付けできるかどうか
を確認する。対応付けができなかった場合は、単語格納
部２３から該単語を削除する。単語検索部２２では、実
際に登録されていない単語についても検索が成功する場
合がある。例えば、“上下まち”という読みで図３の単
語辞書２１を検索すると、“城下町”という単語の情報
（図３の３７）が取り出される。しかし、“上下まち”
は、図３の単語辞書２１に登録されている単語（図２
（ｂ））ではない。そのため、単語格納部２３の単語が
実際に単語辞書２１に登録されているかどうかを単語確
認部２４によって確かめる。単語確認部２４は、検索に
用いた読み“上下まち”と、検索された単語の表記“城
下町”に含まれる漢字の種類と順序を比較し、一致しな
ければ単語格納部２３から削除する。(A) i ← 1 (search from the word that matches the first and subsequent characters of s). (Step 51) (b) When the word search is performed for all the characters s, the process ends.
(Step 52) (c) j ← 1 (search is started from the first character table of the word dictionary 21). (Step 53) (d) It is checked whether or not s [i] is in the i-th character table. If there is, proceed to (e), and if not, proceed to (h).
(Step 54, Step 55) (e) It is checked whether there is a word corresponding to the reading of s [i], s [i + 1], ..., S [i + j-1], and if there is, the word information is stored in the word storage unit 23. Write. (Step 56) (f) Check the pointer to the j + 1th character table. If there are multiple pointers, one is left and the other is held on the stack together with the values of i and j. (Step 57)
(G) Increase j by 1 and proceed to (d). (Step 58) (h) It is checked whether or not there is an unprocessed pointer on the stack. If there is an unprocessed pointer, the process proceeds to (i), and if not, the process proceeds to (j). (Step 59) (i) Take out an unprocessed pointer from the stack (d)
Go to. (Step 510) (j) Increase i by 1 and proceed to (b). (Step 51
1) The word confirming unit 24 confirms whether or not the reading used in the search and the notation of the searched word can be associated with each other among the words written in the word storage unit 23 that include Chinese characters in the reading. If the word cannot be associated, the word is deleted from the word storage unit 23. The word search unit 22 may successfully search for a word that is not actually registered. For example, if the word dictionary 21 of FIG. 3 is searched with the reading “upper and lower towns”, information (37 in FIG. 3) of the word “castle town” is retrieved. However, "upper and lower towns"
Is a word registered in the word dictionary 21 of FIG.
Not (b)). Therefore, the word checking unit 24 checks whether the word in the word storage unit 23 is actually registered in the word dictionary 21. The word confirmation unit 24 compares the reading “upper and lower towns” used in the search with the type and order of the kanji included in the searched word notation “castle town”, and if they do not match, deletes them from the word storage unit 23.

【００２５】単語確認部２４の動作例を図６を参照して
説明する。以下の説明および図５では、読みのｉ文字目
をｙ［ｉ］、表記のｊ文字目をｈ［ｊ］、読みの長さを
Ｌ_yと表記する。An operation example of the word confirmation unit 24 will be described with reference to FIG. In the following description and FIG. 5, the i-th reading character is expressed as y [i], the j-th character writing is expressed as h [j], and the reading length is expressed as L _y .

【００２６】（ａ）ｉ←１、ｊ←１。（ステップ６１）（ｂ）ｉ＞Ｌ_yならば対応付けに成功して終了。（ステ
ップ６２）（ｃ）ｙ［ｉ］以降で漢字を探す。見つからなければ対
応付けに成功して終了。見つかれば、ｐ←見つかった文
字位置とする。（ステップ６３，６４）（ｄ）ｈ［ｊ］以降でｙ［ｐ］を探す。見つからなけれ
ば対応付けに失敗して終了。見つかれば、ｑ←見つかっ
た文字位置とする。（ステップ６５，６６）（ｅ）ｉ←ｐ＋１、ｊ←ｑ＋１、として（ｃ）へ進む。
（ステップ６７）単語選択部２５は、従来の文章入力装置で用いられてい
るものと同一である。例えば『べた書き文の仮名漢字変
換システムとその同音語処理』（牧野、木澤：情報処理
学会論文誌、Ｖｏｌ．２２，Ｎｏ．１，ｐｐ．５９−６
７（１９８１））に記載された方法で実現できる。(A) i ← 1, j ← 1. (Step 61) (b) If i> L _y , the association is successful and the process ends. (Step 62) (c) Search for kanji after y [i]. If not found, matching is successful and the process ends. If found, p ← let found character position. (Steps 63 and 64) (d) Search y [p] after h [j]. If not found, matching fails and ends. If found, set q ← found character position. (Steps 65 and 66) (e) As i ← p + 1 and j ← q + 1, proceed to (c).
(Step 67) The word selection unit 25 is the same as that used in the conventional text input device. For example, "Kana-to-Kana conversion system for solid writing and its homophone processing" (Makino, Kizawa: IPSJ Journal, Vol. 22, No. 1, pp. 59-6.
7 (1981)).

【００２７】[0027]

【発明の効果】以上説明したように、本発明による文章
入力装置は、読みに漢字が混在している単語を単語辞書
に登録する際に、対応するかな書き語と共通の部分を共
有するため、辞書容量の増加が少ないという効果があ
る。As described above, the text input device according to the present invention shares a common part with a corresponding kana written word when registering a word in which kanji is mixed in reading into a word dictionary. The effect is that the dictionary capacity does not increase much.

[Brief description of drawings]

【図１】本発明の文章入力装置の一実施例を示すブロッ
ク図。FIG. 1 is a block diagram showing an embodiment of a text input device of the present invention.

【図２】単語の混ぜ書きの例を示すブロック図。FIG. 2 is a block diagram showing an example of mixed writing of words.

【図３】本発明の文章入力装置の単語辞書の概念図。FIG. 3 is a conceptual diagram of a word dictionary of the text input device of the present invention.

【図４】かな書き語だけを登録した単語辞書の概念図。FIG. 4 is a conceptual diagram of a word dictionary in which only kana written words are registered.

【図５】単語検索部の動作を示す流れ図。FIG. 5 is a flowchart showing the operation of the word search unit.

【図６】単語確認部の動作を示す流れ図。FIG. 6 is a flowchart showing the operation of the word confirmation unit.

【図７】従来の文章入力装置の単語辞書の概念図。FIG. 7 is a conceptual diagram of a word dictionary of a conventional text input device.

【図８】単語の先頭部と末尾部を共有した単語辞書の概
念図。FIG. 8 is a conceptual diagram of a word dictionary in which the beginning and end of a word are shared.

[Explanation of symbols]

１入力装置２変換装置３出力装置４制御装置２１単語辞書２２単語検索部２３単語格納部２４単語確認部２５単語選択部 1 Input Device 2 Conversion Device 3 Output Device 4 Control Device 21 Word Dictionary 22 Word Search Section 23 Word Storage Section 24 Word Confirmation Section 25 Word Selection Section

Claims

[Claims]

1. A text input device for replacing an input character string with another character string to form an output character string, an input device for inputting the input character string, and a first character string / replacement before replacement for each word. A word dictionary for registering a subsequent second character string and a part of speech, a word search unit for searching the word dictionary for a word in which the first character string matches a part of the input character string, and the word search unit. For each word searched for, after comparing the first character string and the second character string and deleting the word not registered in the word dictionary, after deleting the word by the word confirmation unit A word selection unit that creates an output character string from the remaining words, and an output device that displays the output character string, and the word dictionary, when registering a word containing Kanji in the first character string, The second letter identical to the word A sentence input device, wherein a common part of the first character string is shared with a word having a column and the same part of speech, and the first character string does not include kanji.