JPH0540750A

JPH0540750A - Automatic word punctuation inserting circuit

Info

Publication number: JPH0540750A
Application number: JP3196520A
Authority: JP
Inventors: Matsutaka Ito; 松孝伊東
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1991-08-06
Filing date: 1991-08-06
Publication date: 1993-02-19

Abstract

PURPOSE:To automatically detect the punctuation of words inputted and to automatically insert punctuation code according to kinds of words. CONSTITUTION:A punctuation detecting part 3 decides whether the converted word string is KATAKANA(Japanese syllabary) or not, and by referring to the words stored in a KATAKANA dictionary 4, detects delimiting positions of the KATAKANA words. in response to this detecting result, an intermediate point generating part 7 generates the signal for generating an intermediate point, and an inserting/synthesizing part 9 inserts the intermediate point between words. When words are of foreign language, a space generating part 8 generates the signal for generating a space and the inserting/synthesizing part 9 inserts a space between words.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文書処理装置に適用し
て好適な、入力された単語の間の区切りを自動的に検出
して、単語を自動的に区切ることができる自動単語区切
り挿入回路に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention is suitable for application to a document processing apparatus and automatically detects a break between input words to automatically break the words. It is about circuits.

【０００２】[0002]

【従来の技術】従来、文書処理装置においては、文書処
理装置に入力された単語（平仮名・カタカナ・漢字・ロ
ーマ字を問わない）をそのまま登録処理して、そのま
ま、漢字等の指示された文字の種類に変換して、表示し
ていた。このため、長い単語の場合には、即座に単語の
意味を理解しずらい点があった。例えば、以下の例にて
説明する。2. Description of the Related Art Conventionally, in a document processing device, a word (whether hiragana, katakana, kanji, or romaji) input to the document processing device is registered as it is, and a kanji or other designated character is directly registered. It was converted to the type and displayed. Therefore, in the case of a long word, it is difficult to immediately understand the meaning of the word. For example, the following example will be described.

【０００３】例１：平仮名の入力単語が「じょうほうし
すてむ」であって、指示された変換後の単語が、「情報
システム」。Example 1: The input word of hiragana is "johoshisutemu", and the designated word after conversion is "information system".

【０００４】例２：平仮名の入力単語が「えこのみっく
あにまる」であって、指示された変換後の単語が、「エ
コノミックアニマル」。Example 2: The input word of Hiragana is "Economic Mikunimaru", and the designated converted word is "economic animal".

【０００５】例３：平仮名の入力単語が「わーるどかっ
ぷさっかー」であって、指示された変換後の単語が、
「ＷＯＲＬＤＣＵＰＳＯＣＣＥＲ」。Example 3: The input word of the hiragana is "World Cup Sukka", and the designated converted word is
"WORLDUP SOCCER".

【０００６】尚、上記の例では、「情報」、「システ
ム」、「エコノミック」、「アニマル」、「ＷＯＲＬ
Ｄ」、「ＣＵＰ」、「ＳＯＣＣＥＲ」はいずれも、文書
処理装置内の辞書に含まれた単語であると仮定する。In the above example, "information", "system", "economic", "animal", "WORL"
It is assumed that all of "D", "CUP", and "SOCCER" are words included in the dictionary in the document processing device.

【０００７】上記の例では、例１の変換結果は、容易に
判読可能である。しかし、例２の場合はやや判読が困難
であり、「エコノミック・アニマル」と、単語の間に
「・」を挿入して表記すらのが望ましい。更に、例３の
場合には、「ＷＯＲＬＤＣＵＰＳＯＣＣＥＲ」と単
語の間に、スペースを挿入して表記すらのが望ましい。
ところが、こうした単語間の区切りを自動的に検出する
ような文書処理装置は従来提案されてはいなかった。In the above example, the conversion result of Example 1 is easily readable. However, in the case of Example 2, it is somewhat difficult to read, and it is desirable to insert "." Between the words "economic animal" and the word. Further, in the case of Example 3, it is desirable to insert a space between the word “WORLD CUP SOCCER” and the word.
However, no document processing device that automatically detects such a break between words has been proposed.

【０００８】[0008]

【発明が解決しようとする課題】以上のように、従来、
入力された単語の間の、単語の区切りを自動的に検出
し、検出された区切りの箇所に、その区切りを表す文字
や記号を自動的に挿入する回路は存在しなかったので、
そのような回路を持たない文書処理装置を利用している
場合においては、単語が特に長い場合には、その文書処
理装置が出力した表示の判読が困難であるという欠点が
あった。As described above, as described above,
Since there is no circuit that automatically detects word breaks between input words and automatically inserts characters or symbols that represent the breaks at the detected breaks,
In the case of using a document processing device without such a circuit, there is a drawback that it is difficult to read the display output by the document processing device when the word is particularly long.

【０００９】[0009]

【課題を解決するための手段】本発明は、上記課題の解
決を目的としてなされたもので、請求項１記載の発明で
は、単語を入力する入力手段と、前期入力手段に応答
し、入力された単語を所定の文字形式に変換する変換手
段と、所定の文字形式にて単語を記憶している辞書手段
と、前期変換手段に応答し、前期辞書手段の単語を参照
して、前期変換された単語の間の区切りを検出する検出
手段と、前期区切り検出手段に応答し、区切り記号を発
生する区切り発生手段と、前期変換手段、前期区切り検
出手段及び前期区切り発生手段に応答して、前期変換さ
れた単語に前期検出された区切りを挿入する挿入手段と
から構成される自動単語区切り挿入回路である。SUMMARY OF THE INVENTION The present invention has been made for the purpose of solving the above problems. According to the invention of claim 1, a word is input in response to the input means and the previous term input means. Conversion means for converting the word into a predetermined character format, dictionary means for storing the word in a predetermined character format, and previous conversion means in response to the previous conversion means, referring to the word in the previous conversion dictionary, Detecting means for detecting a delimiter between the words, the delimiter generating means for generating a delimiter in response to the preceding term delimiter detecting means, and the preceding term conversion means, the preceding term delimiter detecting means and the preceding term delimiter generating means for responding to the term It is an automatic word break insertion circuit composed of inserting means for inserting a break detected in the previous period into a converted word.

【００１０】請求項２記載の本発明では、単語を入力す
る入力手段と、前期入力手段に応答し、入力された単語
をカタカナ文字及び外国語に変換する変換手段と、カタ
カナ文字の文字形式にて単語を記憶しているカタカナ辞
書手段と、外国語文字の文字形式にて単語を記憶してい
る外国語辞書手段と、前期変換手段に応答し、前期カタ
カナ辞書手段に記憶されているカタカナ文字を参照し
て、前期変換されたカタカナ文字の間の区切りを検出す
る第１の検出手段と、前期変換手段に応答し、前期外国
語辞書手段に記憶されている外国語文字を参照して、前
期変換された外国語文字の間の区切りを検出する第２の
検出手段と、前期カタカナ文字の第１の区切り検出手段
に応答し、第１の区切り記号を発生する第１の区切り発
生手段と、前期外国語文字の第２の区切り検出手段に応
答し、第２の区切り記号を発生する第２の区切り発生手
段と、前期変換手段、前期第１の区切り検出手段、及び
前期第１の区切り発生手段に応答して、前期変換された
カタカナ文字に前期検出された第１の区切りを挿入する
第１の挿入手段と、前期変換手段、前期第２の区切り検
出手段、及び前期第２の区切り発生手段に応答して、前
期変換された外国語文字に前期検出された第２の区切り
を挿入する第２の挿入手段とから構成される自動単語区
切り挿入回路である。According to the second aspect of the present invention, an input means for inputting a word, a converting means for responding to the input means for the previous term to convert the input word into katakana characters and a foreign language, and a character format of katakana characters are provided. Katakana character dictionary stored in the first term Katakana dictionary means in response to the first term conversion means and the foreign language dictionary means that stores words in the character format of the foreign language character Referring to the foreign language characters stored in the previous term foreign language dictionary means in response to the first detecting means for detecting a delimiter between the katakana characters converted in the previous term and the previous term conversion means, Second detecting means for detecting a delimiter between the foreign characters converted in the previous period, and first delimiter generating means for generating the first delimiter in response to the first delimiter detecting means for the katakana character in the previous period. , First half foreign countries Responsive to the second delimiter generating means for generating a second delimiter in response to the second character delimiter detecting means, the first term converting means, the first term delimiter detecting means, and the first term delimiter generating means. And then responds to the first inserting means for inserting the first delimiter detected in the previous term into the katakana character converted in the previous term, the first converting means, the second delimiter detecting means, and the second delimiter generating means. Then, the automatic word segment insertion circuit is composed of the second segment inserting unit for inserting the second segment detected in the previous period into the foreign language character converted in the previous period.

【００１１】[0011]

【作用】上記の構成により、本発明の自動単語区切り挿
入回路では、入力手段にて入力された全ての単語の文字
列の区切りを自動的に検出し、単語の種類（カタカナ・
外国語）に応じて、適当な区切り記号を自動的に挿入す
る。With the above construction, the automatic word break insertion circuit of the present invention automatically detects the breaks in the character strings of all the words input by the input means, and detects the word type (katakana
The appropriate delimiter is automatically inserted according to the foreign language).

【００１２】本発明の文書処理装置においては、入力手
段によって入力され、所定のカタカナや外国語への変換
手段によって変換された、単語の文字列の種類が順次検
査される。In the document processing apparatus of the present invention, the type of the character string of the word input by the input means and converted by the predetermined katakana or the conversion means into a foreign language is sequentially inspected.

【００１３】区切り検出部は、変換された単語の文字列
がカタカナであるかどうかを判断する。変換された単語
の文字列がカタカナであれば、後続の単語もカタカナで
あるかどうかを判断する。The delimiter detection unit determines whether the character string of the converted word is katakana. If the character string of the converted word is katakana, it is determined whether the following word is also katakana.

【００１４】後続の単語もカタカナであれば、区切り検
出部は、カタカナ辞書に記憶されている単語を参照する
ことで、カタカナの単語の区切りを検出する。その検出
に応じて、中点発生部は中点「・」を発生する信号を発
生する。If the succeeding word is also katakana, the delimiter detection unit detects the delimiter of the katakana word by referring to the word stored in the katakana dictionary. In response to the detection, the midpoint generating section generates a signal for generating the midpoint ".".

【００１５】中点発生部からの信号に応じて、合成部
は、２単語の文字列の間に中点「・」を挿入することに
なる。In response to the signal from the midpoint generator, the synthesizer inserts the midpoint "." Between the character strings of two words.

【００１６】区切り検出部は、変換された単語の文字列
が外国語であるかどうかを判断する。変換された単語の
文字列が外国語であれば、後続の単語も外国語であるか
どうか判断する。The delimiter detection unit determines whether the character string of the converted word is a foreign language. If the character string of the converted word is a foreign language, it is determined whether the following word is also a foreign language.

【００１７】後続の単語が外国語であれば、区切り検出
部は、外国語辞書に記憶されている単語を参照すること
で、外国語の単語の区切りを検出する。その検出に応じ
て、スペース発生部はスペースを発生する信号を発生す
る。If the subsequent word is a foreign language, the delimiter detection unit detects the delimiter of the foreign language word by referring to the word stored in the foreign language dictionary. In response to the detection, the space generation unit generates a signal for generating a space.

【００１８】スペース発生部からの信号に応じて、合成
部は、２単語の文字列の間にスペースを挿入することに
なる。In response to the signal from the space generator, the synthesizer inserts a space between character strings of two words.

【００１９】この動作は単語の数が終了するまで続行さ
れ、次の単語があれば、上記の動作を繰り返して、最終
的に、全ての単語の区切りを自動的に検出し、必要な区
切り記号を自動的に挿入する。This operation is continued until the number of words is completed, and if there is a next word, the above operation is repeated, and finally, all the word delimiters are automatically detected and the necessary delimiter symbols are generated. Is automatically inserted.

【００２０】[0020]

【実施例】以下、図面に示した本発明の実施例に基ず
き、本発明における、自動的にカタカナや外国語（例え
ば、英語）の単語の区切りを検出して、その区切りの箇
所に自動的に区切りの記号を挿入する文書作成装置を詳
細に説明する。尚、これによって、これらの実施例に本
発明は限定されるものではないことは勿論である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Based on the embodiments of the present invention shown in the drawings, word breaks of katakana or foreign language (for example, English) in the present invention are automatically detected, and the positions of the breaks are detected. A document creating apparatus for automatically inserting a delimiter will be described in detail. Needless to say, the present invention is not limited to these embodiments.

【００２１】図１は、本発明の実施例における文書作成
装置の回路構成をしめすブロック図である。前期回路
は、入力手段１、変換手段２、区切り検出部３、カタカ
ナ外来語辞書４、外国語辞書５、漢字辞書６、中点
「・」発生部７、スペース発生部８、挿入／合成部９、
及び表示手段１０から構成される。FIG. 1 is a block diagram showing a circuit configuration of a document creating apparatus according to an embodiment of the present invention. The first half circuit includes an input unit 1, a conversion unit 2, a delimiter detection unit 3, a Katakana foreign word dictionary 4, a foreign language dictionary 5, a Kanji dictionary 6, a midpoint "." Generation unit 7, a space generation unit 8, and an insertion / synthesis unit. 9,
And display means 10.

【００２２】入力手段１は、文書処理装置に通常は平仮
名の文字列の形で、文字の入力を行うための、多数の入
力スイッチを含む。入力された文字列はその後、変換を
指示することにより、変換手段２にて、カタカナ・漢字
・ローマ字等の適当な文字列の形で、カタカナ辞書４、
漢字辞書６、外国語辞書５から抽出、出力される。平仮
名の場合は変換は行われない。平仮名・カタカナ・漢字
・ローマ字等に、適当に変換された文字列は、そのまま
挿入／合成部９に入力される。The input means 1 includes a large number of input switches for inputting characters, usually in the form of a hiragana character string, to the document processing apparatus. The input character string is then instructed to be converted, so that the converting means 2 outputs an appropriate character string such as katakana, kanji, or roman characters in the katakana dictionary 4,
It is extracted and output from the Kanji dictionary 6 and the foreign language dictionary 5. No conversion is performed for Hiragana. A character string appropriately converted into hiragana, katakana, kanji, romaji, etc. is directly input to the inserting / synthesizing unit 9.

【００２３】区切り検出部３は、変換手段２にて変換さ
れた文字列の単語の間にある区切りを、辞書を参照しな
がら検出する回路である。すなわち、カタカナ文字列に
変換された場合には、カタカナ辞書４に記憶させてある
多数のカタカナ（外来語）と比較することにより、カタ
カナの文字列の単語の間での区切りを検出する。The delimiter detection unit 3 is a circuit for detecting a delimiter between the words of the character string converted by the conversion means 2 with reference to a dictionary. That is, when converted into a katakana character string, the delimiter between the words of the katakana character string is detected by comparing with a large number of katakana (foreign words) stored in the katakana dictionary 4.

【００２４】そのためには、変換されたカタカナ文字列
の単語の先頭部分からの一致を、変換された文字列とカ
タカナ辞書４との間で検出し、最終的に一致したカタカ
ナの単語の文字列までの所で、最初の単語が区切られる
と仮定する。第２の単語として、今検出された単語の文
字列は除外して、次の文字列の先頭の部分から文字列で
の比較を、カタカナ辞書４との間で行う。検出された単
語までの文字列があれば、その単語の終了部分で、第２
の単語は区切られると判定する。For that purpose, a match from the beginning of the word of the converted katakana character string is detected between the converted character string and the katakana dictionary 4, and the character string of the finally matched katakana word is detected. So far, suppose the first word is separated. As the second word, the character string of the word just detected is excluded, and the comparison with the katakana dictionary 4 is performed with the character string from the beginning of the next character string. If there is a character string up to the detected word, at the end of that word, the second
The words are judged to be separated.

【００２５】更に、第３の単語として、今検出された第
１と第２の単語の文字列は除外して、第３の文字列の先
頭の部分から文字列での比較を、カタカナ辞書４との間
で行う。検出された単語までで、第３の単語の区切りで
あると判定する。こうした区切り検出の動作を、文字列
が残っている以上は繰り返し、入力手段１にて入力さ
れ、変換手段２にて変換されたカタカナの文字列の区切
りの箇所を全て検出する。Furthermore, as the third word, the character strings of the first and second words that have just been detected are excluded, and the katakana dictionary 4 is used to compare the character strings from the beginning of the third character string. To and from. Up to the detected word, it is determined to be the delimiter of the third word. The operation of the delimiter detection is repeated as long as the character string remains, and all the delimiter positions of the katakana character string input by the input unit 1 and converted by the conversion unit 2 are detected.

【００２６】変換手段２にて変換された文字列が外国語
の場合には、外国語辞書５に記憶させてある多数の外国
語と比較することにより、外国語の文字列の単語の間で
の区切りを、区切り検出部３は全て検出する。When the character string converted by the converting means 2 is a foreign language, it is compared with a large number of foreign languages stored in the foreign language dictionary 5 to find a difference between the words of the foreign language character string. The delimiter detection unit 3 detects all the delimiters.

【００２７】そのためには、変換された外国語の文字列
の単語の先頭部分からの一致を、変換された文字列と外
国語辞書５との間で検出し、最終的に一致した外国語の
単語の文字列までの所で、最初の単語が区切られると仮
定する。第２の単語として、今検出された単語の文字列
は除外して、第２の文字列の先頭の部分から文字列での
比較を、外国語辞書５との間で行う。検出された単語ま
での文字列があれば、その単語の終了部分までで、第２
の単語は区切られると判定する。To this end, a match from the beginning of a word in the converted foreign language character string is detected between the converted character string and the foreign language dictionary 5, and the finally matched foreign language character string is detected. Suppose the first word is delimited up to the word string. As the second word, the character string of the word just detected is excluded, and the comparison with the character string from the beginning of the second character string is performed with the foreign language dictionary 5. If there is a character string up to the detected word, add up to the end of that word
The words are judged to be separated.

【００２８】更に、第３の単語として、今検出された第
１と第２の単語の文字列は除外して、第３の文字列の先
頭の部分から文字列での比較を、外国語辞書５との間で
行う。検出された単語までで、第３の単語の区切りであ
ると判定する。こうした区切り検出の動作を、文字列が
残っている以上は繰り返し、入力手段１にて入力され、
変換手段２にて変換された外国語の文字列の区切りの箇
所を、区切り検出部３は全て検出する。変換手段２にて
変換された文字列が漢字の場合には、漢字辞書６に記憶
されている漢字が取り出され、挿入／合成部９に出力さ
れる。Further, as the third word, the character strings of the first and second words that have just been detected are excluded, and the comparison with the character string from the beginning of the third character string is performed in the foreign language dictionary. Between 5 and. Up to the detected word, it is determined to be the delimiter of the third word. The operation of detecting the delimiter is repeated as long as the character string remains, and is input by the input means 1,
The delimiter detection unit 3 detects all the delimiters of the foreign language character string converted by the conversion unit 2. When the character string converted by the converting means 2 is a Chinese character, the Chinese character stored in the Chinese character dictionary 6 is extracted and output to the inserting / synthesizing unit 9.

【００２９】区切り検出部３にて検出された、カタカナ
の文字列での区切り情報は、中点発生部７に出力され
る。中点発生部７は、それに応じて中点「・」を、検出
された区切りの箇所に挿入する信号を、挿入／合成部９
に出力する。The delimiter information in the character string of katakana detected by the delimiter detector 3 is output to the midpoint generator 7. The midpoint generating unit 7 accordingly inserts a signal for inserting the midpoint “·” into the detected delimiter, and inserts / synthesizes the signal.
Output to.

【００３０】区切り検出部３にて検出された、外国語の
文字列での区切り情報は、スペース発生部８に出力され
る。スペース発生部８は、それに応じてスペースを、検
出された区切りの箇所に挿入する信号を、挿入／合成部
９に出力する。The delimiter information in the foreign language character string detected by the delimiter detection unit 3 is output to the space generation unit 8. The space generation unit 8 outputs to the insertion / synthesis unit 9 a signal for inserting a space at the detected delimiter in accordance therewith.

【００３１】挿入／合成部９では、中点発生部７から出
力された信号に応答して、変換手段２にて変換されたカ
タカナに、区切り検出部３にて検出された区切り箇所に
て、中点を挿入して合成する。In the inserting / combining section 9, in response to the signal output from the midpoint generating section 7, the katakana converted by the converting means 2 is added to the division position detected by the division detecting section 3. Insert the midpoint and synthesize.

【００３２】同様に、挿入／合成部９では、スペース発
生部８から出力された信号に応答して、変換手段２にて
変換された外国語に、区切り検出部３にて検出された区
切り箇所にて、スペースを挿入して合成する。Similarly, in the inserting / combining unit 9, in response to the signal output from the space generating unit 8, the foreign language converted by the converting unit 2 is added to the delimiter position detected by the delimiter detecting unit 3. At, insert a space and synthesize.

【００３３】合成され区切られた、カタカナあるいは外
国語は、挿入／合成部９から表示手段１０に出力され、
表示される。The katakana or the foreign language which is synthesized and separated is output from the inserting / synthesizing unit 9 to the display means 10.
Is displayed.

【００３４】変換手段２にて変換されたのが、漢字であ
れば、漢字出力はそのまま、挿入／合成部９に入力さ
れ、そのまま表示手段１０にて表示される。If the conversion means 2 converts the kanji, the kanji output is input to the insertion / synthesis section 9 as it is and displayed on the display means 10 as it is.

【００３５】例えば、以下の例のように、カタカナや外
国語は区切られ、区切りを表す中点「・」やスペースが
挿入／合成された形で表示される。すなわち、変換され
た単語がカタカナの「エコノミックアニマル」の場合
は、「エコノミック・アニマル」として表示される。変
換された単語が外国語の「ＷＯＲＬＤＣＵＰＳＯＣＣＥ
Ｒ」の場合は、「ＷＯＲＬＤＣＵＰＳＯＣＣＥＲ」
として、表示される。この場合、カタカナ辞書４が、
「エコノミック」、「アニマル」という単語を記憶して
いるものとする。又、外国語辞書５が、「ＷＯＲＬ
Ｄ」、「ＣＵＰ」、「ＳＯＣＣＥＲ」という単語を記憶
していると仮定する。For example, as in the following example, katakana and foreign languages are separated and displayed in a form in which a middle point "." Representing a separation and a space are inserted / combined. That is, when the converted word is katakana “economic animal”, it is displayed as “economic animal”. The translated word is the foreign language "WORLD CUP SOCCE
In the case of "R", "WORLD CUP SOCCER"
Is displayed as. In this case, the Katakana dictionary 4
It is assumed that the words "economic" and "animal" are remembered. In addition, the foreign language dictionary 5 is "WORRL
Suppose we have stored the words "D", "CUP", "SOCCER".

【００３６】図２は、本発明の区切り検出の動作ステッ
プを表すフローチャートである。図２のフローチャート
はステップＳ１ないしＳ７から構成される。FIG. 2 is a flow chart showing the operation steps of the delimiter detection of the present invention. The flowchart of FIG. 2 includes steps S1 to S7.

【００３７】ステップＳ１では、上述したように、変換
手段２にて変換された単語の文字列の種類を順次検査す
るステップで、変換手段２や区切り検出部３の動作がそ
れに相当する。In step S1, as described above, the type of the character string of the word converted by the conversion means 2 is sequentially inspected, and the operations of the conversion means 2 and the delimiter detection section 3 correspond to this.

【００３８】ステップＳ２では、変換された単語の文字
列がカタカナであるかどうかが判断される。変換された
単語の文字列がカタカナであれば、ＹＥＳが選択されて
ステップＳ５が選択される。もし、否であれば、次のス
テップＳ３が選択される。In step S2, it is determined whether the character string of the converted word is katakana. If the converted character string is katakana, YES is selected and step S5 is selected. If not, the next step S3 is selected.

【００３９】ステップＳ５では、後続の単語がカタカナ
であるかどうかが判断される。この判断がＹＥＳであれ
ば、ステップＳ７が選択される。ステップＳ７では、区
切り検出部３は、カタカナ辞書４と上述の関連した動作
を行い、２単語の文字列の間に中点を挿入することにな
る。In step S5, it is determined whether the succeeding word is katakana. If this determination is YES, step S7 is selected. In step S7, the delimiter detection unit 3 performs the above-described related operation with the katakana dictionary 4 and inserts a midpoint between the character strings of two words.

【００４０】ステップＳ３では、変換された単語の文字
列が外国語であるかどうかが判断される。変換された単
語の文字列が外国語であれば、ＹＥＳが選択されてステ
ップＳ４が選択される。もし、否であれば、図示しない
次のステップが選択されて、他の動作が開始される。In step S3, it is determined whether the character string of the converted word is a foreign language. If the character string of the converted word is a foreign language, YES is selected and step S4 is selected. If not, the next step (not shown) is selected and another operation is started.

【００４１】ステップＳ４では、後続の単語が外国語で
あるかどうかが判断される。この判断がＹＥＳであれ
ば、ステップＳ６が選択される。ステップＳ６では、区
切り検出部３は、外国語辞書５と上述の関連した動作を
行い、２単語の文字列の間にスペースを挿入することに
なる。In step S4, it is determined whether the following word is a foreign language. If this determination is YES, step S6 is selected. In step S6, the delimiter detection unit 3 performs the above-mentioned related operation with the foreign language dictionary 5 and inserts a space between character strings of two words.

【００４２】図２のフローチャートは単語の数が終了す
るまで続行され、次の単語があれば、上記のステップＳ
１からＳ７を繰り返して、最終的に、全ての単語の区切
りを検出し、必要な区切り記号を挿入する。The flow chart of FIG. 2 continues until the number of words is exhausted, and if there is a next word, then step S above
By repeating steps 1 to S7, all word delimiters are finally detected, and necessary delimiters are inserted.

【００４３】上記の本実施例では、単語の種類を、外来
語を表すカタカナと外国語（特に英語）として、単語の
文字列の区切りを自動的に検出し、区切り記号を自動的
に挿入して合成して、表示する例をしめしたが、単語の
種類はこれ以外でも、適用可能であり、又、そのように
実施出来ることは、上記の記載から明らかである。In the above-described embodiment, the word types are katakana representing a foreign word and a foreign language (especially English), and the delimiters of the character strings of the words are automatically detected and the delimiters are automatically inserted. Although an example in which they are combined and displayed is shown, it is apparent from the above description that other kinds of words can be applied and can be implemented as such.

【００４４】その他、本発明は上記しかつ図面に示した
実施例のみに限定されるものではなく、要旨を逸脱しな
い範囲内で適宜変形して実施できることは勿論である。Besides, the present invention is not limited to the embodiments described above and shown in the drawings, and it is needless to say that the present invention can be appropriately modified and implemented without departing from the scope of the invention.

【００４５】[0045]

【効果】以上のように本発明の自動単語区切り挿入回路
では、入力手段にて入力された全ての単語の文字列の区
切りを自動的に検出し、単語の種類（カタカナ・外国語
等）に応じて、適当な区切り記号を自動的に挿入される
ため、文書処理装置への文字の入力操作が容易になり、
操作者が入力しながら区切りを考慮することが不要とな
る。表示された情報も、そのままですぐ理解し易いもの
となる。[Effect] As described above, in the automatic word break insertion circuit of the present invention, the breaks of the character strings of all the words input by the input means are automatically detected and the word type (katakana, foreign language, etc.) is detected. Accordingly, the appropriate delimiter is automatically inserted, making it easy to input characters to the document processing device.
It is not necessary for the operator to consider the division while inputting. The displayed information is easy to understand as it is.

[Brief description of drawings]

【図１】本発明の実施例における自動単語区切り挿入回
路のブロック図である。FIG. 1 is a block diagram of an automatic word break insertion circuit according to an embodiment of the present invention.

【図２】本発明の実施例における自動単語区切り挿入回
路の動作ステップのフローチャートである。FIG. 2 is a flow chart of operation steps of an automatic word segment insertion circuit in the embodiment of the present invention.

[Explanation of symbols]

１入力手段２変換手段３区切り検出部４カタカナ辞書５外国語辞書６漢字辞書７中点発生部８スペース発生部９挿入／合成部１０表示手段 DESCRIPTION OF SYMBOLS 1 Input means 2 Conversion means 3 Separation detection section 4 Katakana dictionary 5 Foreign language dictionary 6 Kanji dictionary 7 Midpoint generation section 8 Space generation section 9 Insertion / synthesis section 10 Display means

Claims

[Claims]

1. An input means for inputting a word, a converting means for responding to the previous input means, for converting the input word into a predetermined character format, and a dictionary means for storing the word in the predetermined character format. And a detection means for responding to the previous term conversion means, referring to a word in the previous term dictionary means, for detecting a delimiter between the previously converted words, and a delimiter generation for generating a delimiter symbol in response to the previous term delimitation detection means An automatic word, characterized in that it comprises: means for inserting the term detected in the previous term into the word converted in the earlier term in response to the means for converting the term earlier, the means for detecting the term earlier, and the means for generating the term earlier. Break insertion circuit.

2. An input means for inputting a word, a conversion means for responding to the input means for the previous term, converting the input word into katakana characters and a foreign language, and storing the word in a character format of katakana characters. Katakana dictionary means, a foreign language dictionary means that stores words in the character format of foreign language characters, and the previous term conversion means in response to the katakana character stored in the katakana dictionary term means The first detecting means for detecting a delimiter between the katakana characters that have been converted and the first conversion means, and referring to the foreign language character stored in the first foreign language dictionary means, the first converted foreign language character Second detecting means for detecting a delimiter between the first and second delimiters, and a first delimiter generating means for generating a first delimiter in response to the first delimiter detecting means for the first half katakana character; 2 division inspection Second, in response to the means
Second delimiter generating means for generating the delimiter symbol, the first term converting means, the first term delimiter detecting means, and the first term delimiter generating means, and the first term detected in the previously converted katakana character. In response to the first insertion means for inserting the first delimiter, the first term conversion means, the second term delimiter detection means, and the second term delimiter generation means, the first term is detected as a foreign language character that has been previously transformed. A second insertion means for inserting a second delimiter, and an automatic word delimiter insertion circuit comprising: