JPS59220827A

JPS59220827A - Information processor with erroneous input correcting function

Info

Publication number: JPS59220827A
Application number: JP58094796A
Authority: JP
Inventors: Teruaki Aizawa; 相沢　輝昭; Taishirou Kurita; 泰市郎栗田
Original assignee: Nippon Hoso Kyokai NHK; Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 1983-05-31
Filing date: 1983-05-31
Publication date: 1984-12-12

Abstract

PURPOSE:To perform correction processing with high efficiency without increasing its storage capacity by sectioning an input character string into (n)-character blocks when the input character string is not stored in a word dictionary, and referring to frequency information and substituting a character with a high evaluation value as a correct character. CONSTITUTION:A character string signal A inputted from a KANA (Japanese syllabary) keyboard 1 is passed through a character string converter 18 which converts special character concatenation into one special character to supply a signal B to a CPU12. A character string restoring device 19 restores a signal C to its original special KANA character string when the signal has a special character, and passes other signals as they are to a controller 4 as a signal D, which is sent as a signal E to the word dictionary to be looked up. A signal F when the word is stored in the dictionary 4 is sent to an output storage circuit as a signal G through a controller, and a signal N when not is outputted. The CPU12 when receiving the signal N sections the temporarily stored signal B into (n)-character blocks and sends them as a signal J to a frequency table 17 to obtain a frequency signal K; when a special character is contained, a special character freuqncy table 20 is looked up. The CPU12 calculates an evaluation value and substitutes a character with a maximum value as a correct character.

Description

【発明の詳細な説明】本発明は情報処理装置に関し、特に、電子計算機、ワー
ドプロセッサ等の情報処理機器に対して、例えばｌ単語
を１入力中位として文章を人力する際に生ずる入力単位
内の誤りを訂正する機能を有する情報処理装置に関する
ものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to an information processing device, and in particular, to an information processing device such as a computer or a word processor, the present invention relates to an information processing device such as an electronic computer or a word processor. The present invention relates to an information processing device having a function of correcting errors.

文章、例えば日本語文章を、単語を１単位として入力お
よび処理する情報処理装置として、従来、第１図に示す
装置が広く用いられている。ここで、ｌは操作者が文章
入力を行う入力装置、例えばかな鍵盤である。２は入力
された文章を処理し、各部を制御する中央処理装置（Ｃ
：ＰＵ）　、　３は外部記憶装置の記憶領域に展開され
た単語辞書、４はＣＰＵ２の指令の下に外部記憶装置を
制御する制御器、５は処理された文章の出力、記憶を行
う出力装置、６は操作者等に誤入力を警告する警報表示
装置である。2. Description of the Related Art Conventionally, the apparatus shown in FIG. 1 has been widely used as an information processing apparatus for inputting and processing sentences, for example, Japanese sentences, using words as units. Here, l is an input device, such as a kana keyboard, through which the operator inputs text. 2 is a central processing unit (C) that processes the input text and controls each part.
:PU), 3 is a word dictionary developed in the storage area of the external storage device, 4 is a controller that controls the external storage device under instructions from the CPU 2, and 5 is an output device that outputs and stores processed sentences. , 6 is an alarm display device that warns the operator of erroneous input.

かな鍵盤ｌから入力された入力中位としての単語は、Ｃ
ＰＵ２により、制御器４を介して単語辞書３に記憶され
ているか否か判定される。入力された単語が単語辞書３
に記憶されていれば、ＣＰＵ２は正　・しい入力がなさ
れたものとしてその単語を出力記憶装置５に送出する。The middle input word input from the Kana keyboard L is C.
The PU 2 determines via the controller 4 whether the word is stored in the word dictionary 3 or not. The input word is word dictionary 3
If the word is stored in the word, the CPU 2 determines that the word has been input correctly and sends the word to the output storage device 5.

逆に、入力された単語が単語辞書に記憶されていない場
合には、ＣＰＵ２は入力に誤りがあったものとして、必
要に応じて警報表示装置６に警報信号を送出するととも
に、入力された単語が単語辞書３に記憶されていない旨
を識別するための警報標識符号をその単語に付加して出
力記憶装置５に送出する。On the other hand, if the input word is not stored in the word dictionary, the CPU 2 assumes that there is an error in the input, and sends an alarm signal to the alarm display device 6 as necessary. A warning indicator code for identifying that the word is not stored in the word dictionary 3 is added to the word and sent to the output storage device 5.

順次入力される単語に対して上述の処理を繰返し、かつ
句読点等はかな鍵盤ｌからＣＰＵ２を介して出力記憶装
置５に送出することによって、所望の文章全体が処理さ
れる。The entire desired sentence is processed by repeating the above-mentioned processing for the words that are sequentially input, and sending punctuation marks and the like from the ephemeral keyboard 1 to the output storage device 5 via the CPU 2.

しかしながら、このような従来装置においては、誤入力
に対して弔に警報が表示され、誤入力された中詰に対し
てその旨を識別する符号が付されるのみであるので、文
章処理の効率が低下する欠点がある。However, in such conventional devices, a warning is displayed in response to an incorrect input, and a code is only attached to the incorrect input to identify it, which reduces the efficiency of text processing. It has the disadvantage that it decreases.

これに対して、誤入力された単語を訂正して出力する機
能（誤入力訂正機能）を有する情報処理装置がある。第
２図は誤入力訂正機能を有する従来の情報処理装置を示
し、ここで、７は装置内の記憶部１例えばＲＯＭに展開
された頻度表であり、単語を構成する文字群のうち、ｎ
を自然数として、連続したｎ文字の組合せ（ｎ文字の連
接）が単語辞書３に記憶されている全単語中に何回現わ
れるかという頻度情報を、すべてのｎ文字の連接につい
て記憶する。On the other hand, there is an information processing apparatus that has a function (erroneous input correction function) of correcting and outputting an incorrectly input word. FIG. 2 shows a conventional information processing device having an error input correction function, where 7 is a frequency table developed in a storage unit 1, for example, a ROM in the device, and n
Frequency information indicating how many times a combination of consecutive n characters (concatenation of n characters) appears in all words stored in the word dictionary 3 is stored for all concatenations of n characters, where is a natural number.

第２図示の装置は、かな鍵盤１により入力された単語が
単語辞書３に記憶されておらず、ＣＰＵ２が誤入力と判
定した場合に、その単語につき以下の訂正処理を行う。The device shown in the second figure performs the following correction process for the word when the word inputted using the kana keyboard 1 is not stored in the word dictionary 3 and the CPU 2 determines that the input is erroneous.

まず、ＣＰＵ２は誤りと判定した単語について、その第
１文字から第ｎ文字目まで、第２文字目から第ｎ＋１文
字目まで、のように順次ｎ文字毎に区切り、その区切ら
れた文字の連接を順次頻度表７に送出する。頻度表７は
それぞれの文字の連接について、対応した頻度情報をＣ
ＰＵ２に供給する。ＣＰＵ２はその頻度情報に基づいて
単語内の各文字に対して評価を行い、例えば評価値が最
も低い文字を誤入力された文字と判定する。そこで、Ｃ
ＰＵ２はその誤入力された文字を仮に他の文字と置き換
え、頻度表７を参照してその文字につき評価値を求める
。ＣＰＵ２は誤入力された文字を他のすべての文字と置
き換えてこの評価値を求める処理を行い、最も評価値が
高い文字を正字であると判断して、その正字を誤入力さ
れた文字と置き換える。First, the CPU 2 sequentially divides the word determined to be incorrect into every n characters, from the first character to the nth character, from the second character to the n+1st character, and then concatenates the separated characters. are sequentially sent to the frequency table 7. Frequency table 7 shows the corresponding frequency information for each letter concatenation.
Supply to PU2. The CPU 2 evaluates each character in the word based on the frequency information, and determines, for example, the character with the lowest evaluation value as the character that was input incorrectly. Therefore, C
The PU 2 temporarily replaces the incorrectly input character with another character and refers to the frequency table 7 to obtain an evaluation value for that character. The CPU 2 performs a process of replacing the incorrectly inputted character with all other characters to obtain this evaluation value, determines the character with the highest evaluation value to be the correct character, and replaces the corrected character with the incorrectly inputted character. .

仝次いで、ＣＰＵ２はその置き換民結果作成された単語が
単語辞書３に記憶されているか否かを判定し、弔３／ｊ
辞書３がその単語を記憶していれば、その単Ｊｈを訂正
結果として出方記憶装Ｒ５に向けて送出する。その単語
が単語辞書３に記憶されていない場合には、ＣＰＩＪ２
は前述した評価値が次に高い文字から順に、誤入力され
た文字と置換えて単語辞書３を参照してゆく。なお、総
ての文字について単語辞書３に記憶されていない場合に
は、警報表示装置６に警報信号を送出するとともに、入
力された単語に警報標識符号を付して出方記憶装置５に
送出する。Next, the CPU 2 determines whether or not the word created as a result of the replacement is stored in the word dictionary 3.
If the dictionary 3 stores the word, it sends the word Jh as a correction result to the appearance storage device R5. If the word is not stored in word dictionary 3, CPIJ2
refers to the word dictionary 3, replacing the erroneously input characters in order of the character with the next highest evaluation value. If not all characters are stored in the word dictionary 3, a warning signal is sent to the warning display device 6, and the input word is sent to the word storage device 5 with a warning sign code attached. do.

このように、第２図示の情報処理装置は、ｎ文字の連接
について頻度表を参照しつつ、入力された文字の誤りを
検出および訂正を行うものであり、文字の連接数を示す
自然数ｎを増すことにより情報処理装置の訂正能力を向
」ニさせることができる。In this way, the information processing device shown in the second diagram detects and corrects errors in input characters while referring to the frequency table for concatenations of n characters, and calculates a natural number n indicating the number of concatenations of characters. By increasing the correction capacity of the information processing device, it is possible to improve the correction ability of the information processing device.

しかしながら、文字の組み合せ可能な数は１ｍを文字の
種類数を示す自然数とすれば、　ｍ＋１となり、頻度表
７に要求される記憶容量はＣに比例して増加する。従っ
て、ｎを大きくすると必然的に装置を大型化せざるを得
ない問題点が生ずる。However, the number of possible combinations of characters is m+1, where 1m is a natural number indicating the number of types of characters, and the storage capacity required for the frequency table 7 increases in proportion to C. Therefore, if n is increased, the problem arises that the device must be made larger.

そこで、頻度表７を単語辞書３と同様の外部記憶装置と
すると、外部アクセスの時間がかがり、処理時間が増大
する問題点が生ずる。Therefore, if the frequency table 7 is stored in an external storage device similar to the word dictionary 3, a problem arises in that external access takes time and processing time increases.

一般に、このような誤り訂正機能を有する第２図示の情
報処理装置は英語等の文章を人力することを念頭に開発
されたものが多い。例えば、英語の場合アルファベット
は２６文字であるので、ｍ＝２６となり、ｎ＝３とじて
３文字の連接に対する頻度表を用いても、必要な記憶容
量は２６３　＝１７．５７Ｅｌに比例したものでよい。Generally, many of the information processing apparatuses shown in FIG. 2 having such an error correction function have been developed with the intention of manually writing sentences in English or the like. For example, in the case of English, the alphabet has 26 letters, so m = 26, and even if we set n = 3 and use a frequency table for concatenation of 3 letters, the required memory capacity is proportional to 263 = 17.57El. good.

これに対して、日本語の場合は、濁点、半濁点等を含め
てがな文字は５８種類存在し、ｍ＝５８．ｎ＝３に対す
る頻度表の記憶容量は５８３−１！９５，１１２に比例
したものが要求されることになるので、格段に大きな容
量の外部記憶装置が必要となる。前述のように、処理時
間を短軸するためには、外部記憶装置によらず、頻度表
７をＣＰＵ２の主記憶装置として備えることが好適であ
るが、要求される記憶容量の点すら、それは極めて困難
であり、特に小型計算機等の情報処理装置の場合には不
可能である。In contrast, in the case of Japanese, there are 58 types of gana characters, including voiced and handakuten, m = 58. Since the storage capacity of the frequency table for n=3 is required to be proportional to 583-1!95,112, an external storage device with a significantly larger capacity is required. As mentioned above, in order to shorten the processing time, it is preferable to provide the frequency table 7 as the main memory of the CPU 2 instead of using an external storage device, but even in terms of the required storage capacity, it is This is extremely difficult, especially impossible in the case of information processing devices such as small computers.

また、日本８１４をあっかう情報処理装置においては、
従来、例えばンＢ点、半濁点を独立した１文字として入
力および処理を行うので、濁音や半濁音はそれぞれ２文
字の連接となる。従って、頻度表によるｎ文字の連接に
対する訂正処理は、ｎ文字の中に濁点や半濁点が１つ含
まれる場合、実質的に（ｎ−１）文字の連接に対する訂
正処理能力しがなくなることになる。そこで、この場合
ｎ文字の連接と同じ訂正処理能力を情報処理装置が備え
るためには、頻度表の記憶容量ｍ″＋１に比例したもの
としなければならず、要求される記憶容量はさらに大幅
に増大する。さらに、濁音が２文字連続した単語、例え
ばゞガガグ（雅楽）のようなφ語を処理する場合、これ
を正しく判断するためには少なくとも゛力？＋１１ｆｌ
が〉どの４文字連接に対する頻度表が必要となる。すな
わち、この場合、記憶容量は５８４＝１１．３１６，４
９８に比例して要求され、飛躍的に大容都の記憶装置が
必要となるので、例え頻度表を外部記憶装置としても、
装置が極めて大型化してしまう問題点があっ−た。In addition, in the information processing equipment that serves Japan 814,
Conventionally, for example, the N-B point and the hand-dakuten are input and processed as one independent character, so each of the voiced and half-voiced sounds is a concatenation of two characters. Therefore, the correction processing for a concatenation of n characters using a frequency table will essentially lose its ability to correct a concatenation of (n-1) characters if one voiced or handakuten is included in the n characters. Become. In this case, in order for the information processing device to have the same correction processing capacity as the concatenation of n characters, the storage capacity of the frequency table must be proportional to m''+1, and the required storage capacity is even greater. Furthermore, when processing a word with two consecutive voiced sounds, such as a φ word such as ゛gagagu (gagaku), it takes at least ゛? + 11 fl to correctly judge this.
> We need a frequency table for which four-letter concatenation. That is, in this case, the storage capacity is 584=11.316,4
98, and an exponentially larger storage device is required, so even if the frequency table is stored in an external storage device,
There was a problem in that the device became extremely large.

本発明はかかる従来の問題点に鑑みてなされたもので、
その目的は、本来１文字とみなし得る濁音や半濁音など
の文字連接や、日本文に頻繁に出現する特定の文字組か
らなる文字連接（以下、これらを特殊文字連接と言う）
を′ＦｉＪ音等以外のｌ＾音と同様に１文字として処理
可能な特殊文字とし、これら特殊文字を含むｎ文字の特
殊文字連接表を備えることにより、記憶容量の増大を必
要とすることなく、誤入力訂正処理を高効率に行うこと
ができる誤入力訂正機能を有する情報処理装置を提供す
ることにあるやかかる目的を達成するために、本発明では、入力された
単語中に特定の文字の順列か存在する場合には、入力さ
れた単語に対し、特定の文字の順列を１文字の特定の符
号に置き換える変換を行って、入力された申、フロに対
応した文字列を発生する文字列発生手段と、複数の単語
を記憶する単語記憶手段と、特定の符号を含まない所定
個数の文字から成る文字の連接について彫語記憶手段内
における出現頻度を格納するｉｌの頻度情報格納手段と
、特定の符号を少なくとも１つ含む所定個数の文字から
成る文字の連接について弔語記七〇手段内における出現
頻度を格納する第２の頻度情報格納手段と、文字列を取
込み、取込んだ文字列について１１語記憶手段を参照す
る参照手段と、参照の結果、取込んだ文字列が単晶記憶
手段に存在しない場合には、取込んだ文字列の先頭から
順に所定個数の文字から成る文字の連接に分解する分解
手段と、各別の文字の連接について、第１または第２の
頻度情報格納手段を参照して、各別の頻度情報を得る頻
度情報判定手段と、各別の文字の連接に対する頻度情報
から、取込んだ文字列を構成する文字のそれぞれに対す
る評価値を求め、評価値に基づいて誤入力された文字を
抽出し、誤入力された文字を正しい文字に置換１−で取
込んだ文字列を訂正する訂正手段とを具えたことを特徴
とする。The present invention was made in view of such conventional problems, and
Its purpose is to create character combinations such as voiced and semi-voiced sounds that can be considered as one character, as well as character combinations consisting of specific character sets that frequently appear in Japanese sentences (hereinafter referred to as special character combinations).
is a special character that can be processed as a single character in the same way as the l^ sound other than the 'FiJ sound, etc., and by providing a special character concatenation table of n characters including these special characters, there is no need to increase the storage capacity. In order to achieve the above object, the present invention provides an information processing device having an error input correction function that can perform error input correction processing with high efficiency. If a permutation exists, the input word is converted to replace a specific permutation of characters with a specific code of one character, and a character string corresponding to the input mon, fro is generated. a string generation means, a word storage means for storing a plurality of words, and an il frequency information storage means for storing the appearance frequency in the carved word storage means for concatenation of characters consisting of a predetermined number of characters not including a specific code. , a second frequency information storage means for storing the frequency of appearance in the 70 words of condolence for a concatenation of characters consisting of a predetermined number of characters including at least one specific code; a reference means for referring to the 11-word storage means for the column; and, if the retrieved character string does not exist in the single crystal storage means as a result of the reference, a character consisting of a predetermined number of characters in order from the beginning of the retrieved character string; a decomposition means for decomposing into concatenations of each different character, a frequency information determining means for obtaining each different frequency information by referring to the first or second frequency information storage means for each concatenation of different characters; From the frequency information for the concatenation, calculate the evaluation value for each character that makes up the imported character string, extract the incorrectly entered characters based on the evaluation value, and replace the incorrectly entered characters with the correct character. The present invention is characterized by comprising a correction means for correcting the imported character string.

以下、図面を参照して本発明の詳細な説明する。Hereinafter, the present invention will be described in detail with reference to the drawings.

第３図は本発明誤入力訂」Ｌ機能を有する情報処理装置
の構成の一例を示し、ここで、従来と同様に構成できる
ものについては、対応箇所で同一符号を付してその説明
は省略する。第３図示の装置においては、かな鍵盤１と
ＣＰＵ１２との間に特殊文字連接を１文字の特殊文字に
変換する文字列変換器１８、ＣＰＵ１２と制御器３との
間に１文字の特殊文字を特殊文字連接に還元する文字列
還元器１８、および前述の頻度表７と同様のｎ文字の清
−音に対する頻度表１７と並列に、特殊文字に対する頻
度表２０を配設する。ＣＰＵ１２は各部を制御する他、
第４図示の処理手順を記憶する。FIG. 3 shows an example of the configuration of an information processing device having the erroneous input correction function of the present invention. Here, corresponding parts are given the same reference numerals and explanations are omitted for those that can be configured in the same way as before. do. In the device shown in FIG. 3, a character string converter 18 is provided between the kana keyboard 1 and the CPU 12 to convert special character concatenations into one special character, and a character string converter 18 is used to convert a special character concatenation into one special character between the CPU 12 and the controller 3. A frequency table 20 for special characters is arranged in parallel with a character string reducer 18 for reducing special character concatenations and a frequency table 17 for the clear sounds of n characters similar to the frequency table 7 described above. In addition to controlling each part, the CPU 12
The processing procedure shown in FIG. 4 is stored.

かな鍵盤１が出力する入力文字列信号Ａは文字　。The input character string signal A output by the kana keyboard 1 is a character.

列変換器１８へ導かれる。文字列変換器１８は入力文字
クリ信号Ａ中にあらかじめ訣められた特定のかな文字列
、すなわち特殊文字連接かあれは、その文字列をまとめ
て、かな文字以外の未使用の特殊文字に変換する。例え
ば、拗音１シヤ９が文字列信号Ａの中に存在していたと
き、その゛シャ′をＩＳｌに変換するように定めておけ
ば、′シャコアという文字列信号Ａに対して文字列変換
器１８が出力する文字列信号Ｂは９Ｓコ′となる。また
、例えば、文字列信号Ａ中に１ガ′か存在した場合には
、（Ｇ′に変換するように定めておく。文字列信号Ｂは
ＣＰＵ１２に導かれ、一時記憶されるつＣＰＵ１２は人
力された文字列信号Ｂを各中詰ごとに文字列信号Ｃとし
て文字列還元器１８に供給する。Column converter 18. The character string converter 18 converts a specific kana character string pre-contained in the input character signal A, that is, a concatenation of special characters, into unused special characters other than kana characters. do. For example, when the sulone 1 sha 9 exists in the character string signal A, if it is specified that the ゛sha' is to be converted to ISl, the character string converter The character string signal B outputted by 18 becomes 9S ko'. Further, for example, if 1 G' exists in the character string signal A, it is determined that it is converted to (G').The character string signal B is led to the CPU 12 and is temporarily stored. The resulting character string signal B is supplied to the character string reducer 18 as a character string signal C for each middle fill.

文字列還元器１９は文字列信号Ｃの中に特殊文字があれ
ばちとの特定のかな文字列に還元し、それ以外の文字は
そのまま通過させ、文字列４６号りとして制御器４に向
けて出力する。すなわち、文字列還元器１９は、前述し
た例では′Ｓコ′を゛シャコアに還元する。なお、ここ
で申請辞書３が特殊文字を含む形態であれば、特定のか
な文字列と辞書の内容とは直接比較できるので、その場
合は文字列還元器１９を備えなくともよい。If there is a special character in the character string signal C, the character string reducer 19 reduces it to a specific kana character string, passes other characters as is, and sends the character string No. 46 to the controller 4. Output. That is, in the example described above, the character string reducer 19 reduces 'Sko' to 'Shakoa'. Note that if the application dictionary 3 includes special characters, the specific kana character string and the contents of the dictionary can be directly compared, so in that case, the character string reducer 19 may not be provided.

文字列信号りは、ｉｌｕ＋御器４（こより、ｒｌｊ　Ｊ
ｌｉ辞−冒の記憶を参照てきる文字列３１１号Ｅに変換
される。The character string signal is ilu + Goki 4 (Koyori, rlj J
It is converted into the character string No. 311 E that refers to the memory of li diction - profanity.

ｉＪｉ　、Ｒｊ辞書３は人力された文字列信号Ｅと回し
文字列がΦ語として記憶されていれば、文字列信号Ｅを
そのまま、辞書出力信号Ｆとして出力し、記憶されてい
なければ人力された文字列信号Ｅは辞書に登録されてい
ないことを示す旨の信号Ｎを出力する。辞書出力信号Ｆ
は、制御器３によりＣＰｔ１１２に適合する辞書出力信
号Ｇに変換され、ＣＰＵ１２は辞書出力信号Ｇを受信す
ると、それを出力文字列信号Ｈとして出力記憶装置に供
給する。ＣＰＵ１２（ま信号Ｈを受信した場合には、必
要に応じて警報信号Ｉを警報表示装置６に供給して警報
を表示させるとともに、以下の誤入力訂正処理を行う。iJi, Rj Dictionary 3 outputs the character string signal E as it is as a dictionary output signal F if the manually inputted character string signal E and the rotated character string are stored as a Φ word, and if not stored, outputs the manually inputted character string signal E as a dictionary output signal F. A signal N indicating that the character string signal E is not registered in the dictionary is output. Dictionary output signal F
is converted by the controller 3 into a dictionary output signal G that conforms to CPt112, and when the CPU 12 receives the dictionary output signal G, it supplies it as an output character string signal H to the output storage device. When the CPU 12 receives the signal H, the CPU 12 supplies the alarm signal I to the alarm display device 6 as necessary to display an alarm, and performs the following error input correction process.

第４図は誤入力訂正処理の手順の一例を示す。FIG. 4 shows an example of the procedure for correcting erroneous input.

まず、ステップＳｔにて、　ＣＰｔ１１２が文字列信号
Ｂを受信すると、その信号日を一時記憶するとともに、
ステップＳ２にて、文字列還元器１８および制御器４を
介して単語辞書３を参照し、入力された単語を判定する
。単語辞書３から制御器４を介して辞書出力信号Ｇが送
出された場合には、ステ・ンプＳ３にて肯定判定がなさ
れ、出力記憶装置５に対し単語出力を行い、処理を終了
する。First, in step St, when the CPt 112 receives the character string signal B, it temporarily stores the date of the signal, and
In step S2, the word dictionary 3 is referred to via the character string reducer 18 and the controller 4 to determine the input word. When the dictionary output signal G is sent from the word dictionary 3 via the controller 4, an affirmative determination is made in step S3, the word is output to the output storage device 5, and the process ends.

ステップＳ３にて否定判定がなされた場合、すなわち、
信号Ｎを受信した場合には、ＣＰＵ１２は一時記憶して
あった誤入力文字を含む文字列信号Ｂをｎ文字毎に区切
り、順次ｎ文字信号Ｊとして出力する。例えば、ｎ＝２
として、文字列信号Ｂが！ケイケン′のうちフイ′、を
誤入力した１ケテケノであった場合、１ケチ′、′テゲ
どケノがｎ文字信号Ｊとして出力される。頻度表１７は
ｎ文字信号Ｊを受イ８すると、ｎ文字信号Ｊに対応した
文字列が単語辞書３に記憶されている全単語の中に何回
現われるかを示す、頻度情報を、頻度信号にとして順次
出力する。ｎ文字信号Ｊの中に前述の特殊文字が含まれ
ていた場合には、ＣＰＵ１２は特殊文字頻度表２０を参
照し、この場合ｎ文字信号Ｊに対する頻度信号には特殊
文字頻度表２０からＣＰＩＪ１２　＄こ供給される。If a negative determination is made in step S3, that is,
When the CPU 12 receives the signal N, the CPU 12 divides the temporarily stored character string signal B including the erroneously input characters into every n characters and sequentially outputs them as an n character signal J. For example, n=2
As, the string signal B is! If it is a 1-ketekeno in which ``of'' of ``keiken'' is incorrectly input, 1-kechi'' and ``tegedokeno'' are output as an n-character signal J. When the frequency table 17 receives the n-character signal J, it displays frequency information indicating how many times the character string corresponding to the n-character signal J appears in all the words stored in the word dictionary 3. Output sequentially as . If the above-mentioned special character is included in the n-character signal J, the CPU 12 refers to the special character frequency table 20, and in this case, the frequency signal for the n-character signal J includes CPIJ12 $ from the special character frequency table 20. This is supplied.

ＧＰＵＩ２はＭ度信号Ｋを順次受信し、ステ・ンプＳ４
にてそれらの値から単語内の各文字に対する正しい可能
性の評価値を算出し、各文字のうち、最も小さい評価（
（ｉをもつ文字を誤入力された文字と判断する。そこで
、ステップＳ５にてその文字をそれ以外の１字に１６き
換え、置き換えた文字に対する評価値を算出する。ＣＰ
Ｕ１２はこの評価イ１１４を誤入力された文字以外のす
べてのかな文字に対して求め、ステップＳ８にて、最も
大きい評価値をもつ文字を仮に正字であると判断する。The GPUI2 sequentially receives the M degree signal K, and sends the step S4.
calculates the evaluation value of the correct possibility for each character in the word from those values, and calculates the evaluation value of the correct possibility for each character (
(The character with i is determined to be an incorrectly input character. Therefore, in step S5, that character is replaced with one other character, and an evaluation value for the replaced character is calculated.CP
U12 obtains this evaluation value 114 for all kana characters other than the erroneously input character, and in step S8 tentatively determines that the character with the largest evaluation value is a correct character.

次にステ・ンプＳ７にて、ＣＰＵ１２は誤入力された文
字を仮に正字と判定された文字に置き換えて修正し、そ
の修正された単語を新たに文字列信号Ｃとして出力し、
ステップＳ８にて、その単語が単語辞書３に記憶されて
いるか否かをステップＳ３と同様の方法で判定する。こ
こで、肯定判定がなされれば、その単語に対応した文字
列信号Ｈを出力記憶装置５に送出し、訂正処理を終了す
る。一方、単語辞書３に記憶されていなければ、ステッ
プ５１０にて、仮に正字であるとされた文字を除外して
、・ステップＳ６に復帰し、次に大きい評価値をもつ文
字を仮に正字として、同様の手順を繰返す。以上の処理
を繰り返すことにより出力記憶装置５には入力時の誤り
が訂正された文章が記憶される。Next, in step S7, the CPU 12 corrects the incorrectly input characters by temporarily replacing them with characters determined to be correct, and outputs the corrected word as a new character string signal C.
In step S8, it is determined whether the word is stored in the word dictionary 3 using the same method as in step S3. If an affirmative determination is made here, the character string signal H corresponding to the word is sent to the output storage device 5, and the correction process is completed. On the other hand, if it is not stored in the word dictionary 3, in step 510, the characters that are tentatively determined to be regular characters are excluded, and the process returns to step S6, where the character with the next highest evaluation value is temporarily set as a regular character. Repeat the same steps. By repeating the above process, the output storage device 5 stores a sentence in which input errors have been corrected.

孜に、類１隻表１７および２０に必要とされる記憶容量
について具体的に説明する。本発明装置に要求される記
憶容には、頻度表１７と特殊文字順ＩＷ表２０との記憶
容量の合計であり、特殊文字かに種類あるとすれば、　
（ｍ十ｋ）　Ｉ＋に比例する。The storage capacity required for Class 1 Tables 17 and 20 will be explained in detail. The storage capacity required for the device of the present invention is the total storage capacity of the frequency table 17 and the special character order IW table 20, and if there are different types of special characters, then
(m0k) Proportional to I+.

例えば、ＩＱ音２字から成る文字列の処理を行う場合を
考える。かな鍵盤りはゞガ′なとの高音を☆文字＋薊点
、すなわち１カ’＋　”ｚＸゝの２文字として出力する
か、本発明装ｆａｔによれば、文字＋濁点を１つの特殊
文字として処理できる。日本語において、濁音の種類は
２０種であるので、この場合必要とされる記憶容量は（
５７＋２０　）２＝５９２９である。これに対して、第
２図示の従来の情報処理装置Ｇでは濁音２文字を文字＋
濁点十文字＋濁点の４字として扱うので、４文字に対応
した頻度表が必要となり、この場合、記憶容量は５８４
＝１１．．３１８，４９８を必要とする。このように、
／を発明によれば、従来装置に比して格段に少ない記憶
容量で誤入力釘止処理を行うことができる。For example, consider processing a character string consisting of two IQ sounds. The Kana keyboard outputs the high-pitched sound of ゞga'na as a ☆ character + a dot, that is, 1 ka' + ``zXゝ, or according to the present invention, a character + a voiced mark is output as one special character. In Japanese, there are 20 types of voiced sounds, so the storage capacity required in this case is (
57+20)2=5929. On the other hand, in the conventional information processing device G shown in FIG.
Since they are treated as 4 characters (Dakuten Jumonji + Dakuten), a frequency table corresponding to the 4 characters is required, and in this case, the storage capacity is 584.
=11. ．． 318,498 is required. in this way,
According to the invention, erroneous input nailing processing can be performed with a significantly smaller storage capacity than conventional devices.

以上説明してきたように、本発明によれは、回数の文字
列の誤入力に対する訂正処理を、従来装置極より格段に
少ない容量の記者、セ装置Ｉ′’ｉにより実現てきるの
で、装置を小型に４，１１成できるのみならず、訂正処
理を高速度に行うことかできる。As explained above, according to the present invention, the correction process for incorrectly input character strings can be realized using the reporter/server device I''i, which has a much smaller capacity than conventional devices. Not only can the 4,11 structure be made small, but also correction processing can be performed at high speed.

なお、上述の実施例においては入出力機器を１それぞれ
、かな鍵盤および出力記憶装置として説明したが、本発
明は入出力機器の形態に依存しない染。すなわち、入出
力機器を変更して種々の用途に適用できる。例えば、入
力機器として？キ声認識装置を用いれば音声誤入力訂正
機能を有する情報処理装置することができ、また、出力
機器として情報伝送機器を用いれば、誤人力訂正伝送機
能を有する情報処理装置とすることもできる。また、本
発明は日本語以外の自然π話にも容易に適用できること
勿論である。さらに、入力された単語中に１字以上の文
字誤りが存在した場合にも容易に対応できること勿論で
ある。In the above embodiment, the input/output devices are described as a kana keyboard and an output storage device, respectively, but the present invention does not depend on the form of the input/output devices. That is, it can be applied to various uses by changing the input/output equipment. For example, as an input device? If a key voice recognition device is used, the information processing device can be made to have a voice input error correction function, and if an information transmission device is used as an output device, it can also be made to be an information processing device having a human input error correction transmission function. Furthermore, it goes without saying that the present invention can be easily applied to natural pi-speak other than Japanese. Furthermore, it is of course possible to easily deal with the case where there is one or more character errors in the input word.

[Brief explanation of drawings]

第１図は従来の情報処理装置の構成の一例を示すブロッ
ク図、第２図は従来の誤入力訂正機能を有する情報処理
装置の構成の一例を示すブロック図、第３図は本発明に
係る誤入力訂正機能を有する情報処理装置の構成の一例
を示すブロック図、第４図はその誤入力訂正処理手順の
一例を示すフローチャートである。ｌ・・・かな鍵盤、２・・・中央処理装置（ｃｐｕ）、３・・・単語辞書、４・・・制御器、５・・・出力記憶装置、６・・・警報表示装置、７・・・頻度表、１２・・・中央処理装置（ＣＰｔｌ）、１７・・・頻度
表、１８・・・文字列変換器−１１３・・・文字列還元器。２０・・・特殊文字頻度表、Ａ−Ｆ、Ｎ・・・信号。特許出願人　日本放送協会FIG. 1 is a block diagram showing an example of the configuration of a conventional information processing device, FIG. 2 is a block diagram showing an example of the configuration of a conventional information processing device having an error input correction function, and FIG. 3 is a block diagram showing an example of the configuration of a conventional information processing device having an error input correction function. FIG. 4 is a block diagram showing an example of the configuration of an information processing apparatus having an erroneous input correction function, and a flowchart showing an example of the erroneous input correction processing procedure. l... Kana keyboard, 2... Central processing unit (CPU), 3... Word dictionary, 4... Controller, 5... Output storage device, 6... Alarm display device, 7. ... Frequency table, 12... Central processing unit (CPtl), 17... Frequency table, 18... Character string converter-1 13... Character string reducer. 20... Special character frequency table, A-F, N... Signal. Patent applicant: Japan Broadcasting Corporation

Claims

[Scope of Claims] 1) When a specific permutation of characters exists in the input word, conversion of the human-generated word to replace the specific permutation of characters with a specific code of one character. go and
a character string generation means for generating a character string corresponding to the input word; a word storage means for storing a plurality of words; and the word storage for concatenation of characters consisting of a predetermined number of characters not including the specific code. a first frequency information storage means for storing an appearance frequency in the word storage means; and a first frequency information storage means for storing an appearance frequency in the word storage means for a concatenation of characters consisting of the predetermined number of characters including at least one specific gradient sign. 2, the frequency information storage means and the character string are taken in;
a reference means for referring to the word storage means for the imported character string; and if as a result of the reference, the imported character string does not exist in the word storage means, the reference means refers to the word storage means from the beginning of the imported character string; a decomposition means that sequentially decomposes the predetermined number of characters into character concatenations, and a frequency for obtaining each separate frequency information by referring to the first or second frequency information storage means for each different concatenation of characters; an information determining means, and an evaluation value for each of the characters constituting the captured character string is determined from the frequency information regarding the concatenation of each of the different characters, and based on the evaluation value, an incorrectly input character is extracted; An information processing apparatus having a manual error correction function, comprising: a correction means for correcting the input character string by replacing the incorrectly input characters with correct characters. 2. In the information processing device having the function of correcting human input errors as set forth in claim 1, the correction means includes a determination means for determining a character input by error based on the evaluation value, and a determination unit for determining a character input by error based on the evaluation value. an evaluation value calculation means that applies all other characters to the determined character and calculates an evaluation value for each of the other characters based on the kudzu frequency information obtained by the frequency information determination means; a replacement means for replacing the erroneously input characters in the order of the character with the highest evaluation value obtained by the evaluation value calculation means; An information processing apparatus having an error input correction function, comprising a character correction means for determining a replaced character as the correct character and correcting the input character string. 3) In the information processing device having an error input correction function as set forth in claim 1 or 2, the word is a Japanese word, and the specific character permutation is a voiced sound and a hanshu sound. An information processing device having an error input correction function, characterized in that the information processing device includes a specific character set that frequently appears in the Japanese text.