JPH0644409A

JPH0644409A - Character recognizing device

Info

Publication number: JPH0644409A
Application number: JP4199732A
Authority: JP
Inventors: Nobuhiko Kenmochi; 伸彦釼持; Mitsuhiro Inazumi; 満広稲積
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 1992-07-27
Filing date: 1992-07-27
Publication date: 1994-02-18

Abstract

PURPOSE:To reduce the device scale, to execute the recognition at a high speed, and improve the recognition rate by adopting a recurrent type neural network for the character recognizing device using a neural network. CONSTITUTION:With respect to the character recognizing device consisting of a character input means 101, a neural network 102, and a recognition result output means 103, a recurrent type neural network in which each node 104 for constituting a neural network 102 has a feedback in the inside is adopted.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明はコンピュータに文字等を
入力する際に使用される文字認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device used for inputting characters and the like into a computer.

【０００２】[0002]

【従来の技術】従来の文字認識装置は、オンライン手書
き文字認識装置とオフライン文字認識装置の２種に分類
される。2. Description of the Related Art Conventional character recognition devices are classified into two types: online handwritten character recognition devices and offline character recognition devices.

【０００３】オンライン手書き文字認識では、信学技報
PRU82-37（題名”オンライン手書き漢字・ひらがな認
識”、以後、従来例１ａと呼ぶ。）に記載されている様
な基本ストローク法によって文字認識を行っているもの
が多い。In online handwriting recognition,
In many cases, character recognition is performed by the basic stroke method as described in PRU82-37 (title "Online handwritten Chinese character / Hiragana recognition", hereinafter referred to as Conventional Example 1a).

【０００４】基本ストローク法とは、時系列の座標デー
タとして入力した文字のストロークを、予め定められた
基本ストロークに分類し（以後、これをストローク認識
と呼ぶ。）、認識候補の各文字が基本ストロークによっ
て記述されている辞書と前記分類後のストロークを比較
して、最も類似性が高い文字を認識結果とする手法であ
る。ストローク認識では、入力時にペンが入力面上に降
りてから上がるまでを１本のストロークとして扱い、折
れ線近似、方向コード化の後、方向コードの異なる累積
長やＤＰマッチング等の手法で分類を行っていた。In the basic stroke method, the strokes of characters input as time-series coordinate data are classified into predetermined basic strokes (hereinafter referred to as stroke recognition), and each character of the recognition candidate is the basic. This is a method in which the dictionary described by strokes is compared with the strokes after the classification, and the character having the highest similarity is used as the recognition result. In stroke recognition, the stroke from the time the pen descends on the input surface to the time it rises is treated as one stroke, and after performing line approximation and direction coding, classification is performed by methods such as cumulative length of different direction codes and DP matching. Was there.

【０００５】この方法は、比較的少ない計算量、小さな
辞書容量かつ高認識率という特長を持っているので、現
在市場に供給されているオンライン手書き文字認識装置
の多くが採用している。Since this method has the features of a relatively small amount of calculation, a small dictionary capacity, and a high recognition rate, it is adopted by many of the online handwritten character recognition devices currently on the market.

【０００６】また、オンライン手書き文字認識装置にニ
ューラルネットワークを採用したものには、電学論Ｄ−
１１１巻１２号 pp１０４７−１０５６（題名”識別順
序を考慮した改良型パーセプトロンによるオンライン手
書き漢字認識”、以後、従来例１ｂと呼ぶ。）に記載さ
れるようなものが報告されていた。これは、３層構造と
誤差逆伝播学習法によるニューラルネットワークの学習
が難しいという問題を単純パーセプトロンと新学習法に
よって解決するものである。[0006] Further, there is an electronic theory D-
111, No. 12, pp 1047-1056 (Title "Online handwritten Chinese character recognition by improved perceptron considering identification order", hereinafter referred to as Conventional Example 1b) has been reported. This solves the problem that it is difficult to learn a neural network by the three-layer structure and the error back-propagation learning method by a simple perceptron and a new learning method.

【０００７】次に、オフライン文字認識装置としては種
々のものが提案されているが、特にニューラルネットワ
ークを用いると、文字列からの１文字切り出しを文字認
識と同時に行う事ができるという利点を有し、その様な
先願には、特開平３−１６３６８４（以下、従来例２ａ
と呼ぶ。）や信学論Ｖｏｌ．Ｊ７４−Ｄ−２Ｎｏ．１
１ｐｐ．１５５６−１５６４（以下、従来例２ｂと呼
ぶ。）に記載されるようなものが代表的である。Next, various types of off-line character recognition devices have been proposed. Particularly, if a neural network is used, it has an advantage that one character can be cut out from a character string at the same time as character recognition. In such an earlier application, Japanese Patent Application Laid-Open No. 3-163684 (hereinafter referred to as Conventional Example 2a
Call. ) And the theory of theory Vol. J74-D-2 No. 1
1 pp. The ones described in 1556-1564 (hereinafter referred to as Conventional Example 2b) are typical.

【０００８】従来例２ａに使用されているニューラルネ
ットワークは、ホップフィールド型である。ニューラル
ネットワークの入力視野は固定であり、結合荷重は前も
ってエネルギー関数を基に計算され決定されている。The neural network used in Conventional Example 2a is of Hopfield type. The input field of view of the neural network is fixed, and the coupling weight is calculated and determined based on the energy function in advance.

【０００９】従来例２ｂに使用されているニューラルネ
ットワークは、階層構造のフィードフォワード型であ
る。時系列データを扱うために、入力層に特別なノード
を設けて、出力層から入力層へのフィードバックを行っ
ている。The neural network used in the conventional example 2b is a feedforward type having a hierarchical structure. In order to handle time series data, a special node is provided in the input layer and feedback is performed from the output layer to the input layer.

【００１０】[0010]

【発明が解決しようとする課題】しかしながら、従来の
文字認識装置には以下に述べるような課題が有った。However, the conventional character recognition device has the following problems.

【００１１】まず、オンライン手書き文字認識装置にお
ける課題は以下の様である。First, the problems in the online handwritten character recognition device are as follows.

【００１２】最近のコンピュータによる情報処理の普及
にともない、最も馴染みやすいコンピュータ入力手段と
してオンライン手書き文字認識装置が用いられることが
多くなっている。しかし、従来のオンライン手書き文字
認識装置は楷書で画数・筆順を正確に書かねば認識でき
ないという制限が有り、より人間の負担を少なくするた
めには、ある程度の続け字までを認識できることが不可
欠であるとされている。また、続け書きの英語筆記体文
字列を認識することも英語圏においては不可欠である。With the recent spread of information processing by a computer, an online handwritten character recognition device is often used as the most familiar computer input means. However, the conventional online handwritten character recognition device has the limitation that it cannot be recognized unless the stroke number and stroke order are accurately written in a regular pattern, and in order to reduce the burden on humans, it is essential to be able to recognize a certain number of consecutive characters. It is said that. It is also essential in the English-speaking world to recognize continuous cursive English cursive character strings.

【００１３】従来例１ａにおいて、基本ストロークは、
楷書の１画分に対応するように定められている。従って
本来ペンのアップが間に入って分離して書かれるべき複
数の基本ストロークが続けて書き下された場合（以下、
連続ストロークと呼ぶ。）、容易にストロークの誤認識
が発生する。つまり、原理的に基本ストローク法では連
続ストロークのある文字（以下、続け字と呼ぶ。）の認
識はできない。In the conventional example 1a, the basic stroke is
It is specified to correspond to one section of a regular print. Therefore, when a plurality of basic strokes, which should be written separately with the pen-up inserted in between, are written down in succession (hereinafter,
Call it a continuous stroke. ), Stroke misrecognition easily occurs. That is, in principle, the basic stroke method cannot recognize characters with continuous strokes (hereinafter, referred to as continuous characters).

【００１４】そこで、上に述べたような欠点を克服する
ために、いくつかの手法が考えられる。Therefore, several methods are conceivable in order to overcome the above-mentioned drawbacks.

【００１５】一つは、連続ストロークを新たな基本スト
ロークとして登録し、辞書にはそれまでの基本ストロー
クと新たな基本ストロークを並列に記述するという手法
である。しかし、用意できる新たな基本ストロークには
限度が有り、続け字の許容度と辞書記述量や続け字の許
容度とストローク認識時間との間にはトレードオフが存
在する。One is a method in which a continuous stroke is registered as a new basic stroke, and the basic stroke up to that point and the new basic stroke are described in parallel in the dictionary. However, there is a limit to the number of new basic strokes that can be prepared, and there is a trade-off between the allowance of consecutive characters and the dictionary description amount, and the allowance of consecutive characters and stroke recognition time.

【００１６】２つめは、特開平４−３２４３１（以後、
従来例１ｃと呼ぶ。）に記載されるように、辞書中には
１文字の全てのストロークをつなげたパターン（以後、
一筆書きパターンと呼ぶ。）を登録し、入力筆跡からも
補完によって１筆書きパターンを生成し、これらをＤＰ
マッチングにより比較する手法である。The second is Japanese Patent Laid-Open No. 4-32431 (hereinafter,
This is called Conventional Example 1c. ), A pattern in which all strokes of one character are connected in the dictionary (hereinafter,
It is called a one-stroke pattern. ) Is registered, one writing pattern is generated from the input handwriting by complementing, and these are written as DP.
It is a method of comparing by matching.

【００１７】しかし、この手法では１文字分のＤＰマッ
チング用の基本パターンを予想される筆順の数だけ用意
しなくてはならず、辞書容量と計算量が莫大なものに成
り易い。However, with this method, it is necessary to prepare as many basic patterns for DP matching as one character in the expected stroke order, and the dictionary capacity and calculation amount tend to be enormous.

【００１８】３つめは辞書中に通常の基本ストローク記
述とは別に、ストロークの抽出指示を含ませて、これに
より連続ストローク中から基本ストロークを抽出するこ
とで認識する手法である。この様な内容の先願には、特
開平２−５６６８９がある。この手法は先の２つめの手
法が持つ計算量が莫大になるという問題をある程度解決
できる。A third method is to recognize by extracting a basic stroke from continuous strokes by including a stroke extraction instruction in the dictionary in addition to the normal basic stroke description. Japanese Patent Application Laid-Open No. 2-56689 is a prior application having such contents. This method can solve the problem that the second method has a huge amount of calculation.

【００１９】ただし、ストロークの抽出方法にＤＰマッ
チングを用いる場合は、マッチングの開始点と終了点を
連続ストロークの中のどこに定めるかという問題があ
る。もし、連続ストローク中の全ての点をマッチングの
開始点及び終了点の対象にする場合は、処理時間が極端
に長くなってしまう。However, when DP matching is used as the stroke extraction method, there is a problem of where to set the matching start point and end point in the continuous stroke. If all the points in the continuous stroke are targeted for the matching start and end points, the processing time becomes extremely long.

【００２０】また、ＤＰマッチングを用いない場合は、
手書きの変形を吸収しながら曲部のあるストロークを抽
出することは難しい。前記先願では、直線ストロークの
みを抽出するように限定しているが、漢字以外の文字の
認識には不十分である。When DP matching is not used,
It is difficult to extract strokes with curved parts while absorbing handwriting deformation. In the previous application, only linear strokes are extracted, but this is insufficient for recognition of characters other than Chinese characters.

【００２１】従来例１ｂにおいても、使用している特徴
による制限から、筆順は自由であるが、画数変動さらに
は続け書きは不可であることが推察できる。一筆書きパ
ターンを基本パターンとして学習させ、入力も一筆書き
パターン化することで、続け字に対応できるようになる
と考えられるが、対応するべき続け書きのパターンが筆
順の数だけ存在するという問題は先に述べた２つめの場
合と同じである。In the prior art example 1b as well, it can be inferred that the stroke order is free, but fluctuations in the number of strokes and continuous writing are not possible due to the limitation of the characteristics used. By learning the one-stroke writing pattern as a basic pattern and converting the input into the one-stroke writing pattern, it may be possible to deal with continuous characters. However, the problem that there are as many continuous writing patterns as there are stroke orders to deal with This is the same as the second case described above.

【００２２】また、続け書きの英語筆記体文字列の認識
には、上に述べたいずれの手法も適していない。Further, none of the above-mentioned methods is suitable for recognizing a continuous writing English cursive character string.

【００２３】従来のオンライン手書き文字認識装置には
以上に述べたような問題が有り、計算量、辞書容量をあ
まり増加させず、かつ続け字や連続筆記の英語筆記体文
字列を認識できるオンライン手書き文字認識装置を提案
するという課題があった。The conventional online handwritten character recognition device has the above-mentioned problems, and online handwriting that does not increase the calculation amount and the dictionary capacity and can recognize continuous cursive and continuous cursive English cursive character strings. There was a problem of proposing a character recognition device.

【００２４】次に、オフライン文字認識装置における課
題は以下の様である。Next, the problems in the off-line character recognition device are as follows.

【００２５】従来例２ａでは、学習による、ニューラル
ネットワークの最適化や認識性能の向上が不可能であ
る。また、ホップフィールド型ニューラルネットワーク
に対して一般に指摘されているようなローカルミニマム
問題も無視できない。In the conventional example 2a, it is impossible to optimize the neural network and improve the recognition performance by learning. Also, the local minimum problem, which is generally pointed out for Hopfield neural networks, cannot be ignored.

【００２６】従来例２ｂには、学習の収束が極端に難し
いという問題がある。これは、出力から入力へのフィー
ドバックを持つ階層構造のフィードフォワード型ニュー
ラルネットワークに対して、一般に指摘されている問題
である。さらに、出力層に複数のノードを設けてどれか
１つが興奮することで認識結果を示す形式であるため
に、もし、新たな認識対象文字をニューラルネットワー
クに付加える必要が生じた場合は、ニューラルネットワ
ーク全体を再学習しなおさなければならないので、時間
がかかる。また、階層構造であるために、連想能力は小
さく、入力文字中にノイズや欠落がある場合には認識性
能が大幅に低下してしまうという問題もある。従来の
オフライン文字認識装置には以上に述べたような問題が
有り、ニューラルネットワークを簡単に生成でき、連想
能力による耐ノイズ性を持ち合せたオフライン文字認識
装置を提案するという課題があった。The conventional example 2b has a problem that the convergence of learning is extremely difficult. This is a problem generally pointed out for a hierarchical feed-forward neural network having feedback from output to input. Further, since the recognition result is obtained by providing a plurality of nodes in the output layer and one of the nodes is excited, if a new recognition target character needs to be added to the neural network, It takes time because the entire network has to be relearned. Further, because of the hierarchical structure, the associative ability is small, and there is a problem that the recognition performance is significantly reduced when there is noise or omission in the input character. The conventional off-line character recognition device has the above-mentioned problems, and has a problem of proposing an off-line character recognition device that can easily generate a neural network and has noise resistance due to the associative ability.

【００２７】[0027]

【課題を解決するための手段】本発明は、以上述べたよ
うな課題を解決するために、以下に述べる事を特徴とす
る。The present invention is characterized by the following points in order to solve the above-mentioned problems.

【００２８】１）ニューラルネットワークを用いた文字
認識装置において、ａ）認識対象文字の時系列データを生成する文字入力手
段を備え、ｂ）前記ニューラルネットワークは、内部状態値を記憶
する記憶手段、記憶されている内部状態値と外部入力値
により内部状態値を更新する更新手段、及び内部状態値
から外部出力値を生成する生成手段を持つ神経細胞様素
子から構成され、ｃ）前記時系列データを前記ニューラルネットワークの
入力として用いる事。２）１）の文字認識装置において、ａ）時系列データから予め定められた一つ又は複数個の
基本ストロークを抽出するストローク抽出手段と、ｂ）認識候補の各文字が基本ストロークによって記述さ
れている辞書と、ｃ）前記抽出後のストロークと辞書を比較して認識結果
を出力する比較手段とを備え、ｄ）文字入力手段は認識対象文字の筆跡を発生順に時系
列データ化し、ｅ）前記ストローク抽出手段は、ニューラルネットワー
クを含み、ｆ）前記ニューラルネットワークは、複数の連続したス
トロークから、１つ又は複数の基本ストロークを検出す
るように学習させてある事。1) A character recognition device using a neural network: a) character input means for generating time-series data of a character to be recognized; b) the neural network, a storage means for storing internal state values, and a memory. The internal state value and the external input value are used to update the internal state value, and a neuron-like element having a generating means for generating the external output value from the internal state value. C) The time series data Use as input to the neural network. 2) In the character recognition device of 1), a) stroke extracting means for extracting one or more predetermined basic strokes from time series data, and b) each character of the recognition candidate is described by the basic strokes. A dictionary that includes: c) a comparison unit that outputs the recognition result by comparing the extracted stroke with the dictionary; d) the character input unit converts the handwriting of the recognition target character into time series data in the order of occurrence; The stroke extraction means includes a neural network, and f) the neural network is trained to detect one or more basic strokes from a plurality of continuous strokes.

【００２９】３）２）の文字認識装置において、文字入
力手段は、認識対象文字の筆跡を折れ線近似した後、各
々の折れ線の方向を所定の分割数で量子化して一連の方
向コード列を生成する事４）２）の文字認識装置において、文字入力手段は、認
識対象文字の筆跡を折れ線近似した後、折れ線の節点座
標を生成する事。3) In the character recognition device of 2), the character input means approximates the handwriting of the recognition target character with a polygonal line, and then quantizes the direction of each polygonal line with a predetermined division number to generate a series of directional code strings. 4) In the character recognition device of 2), the character input means should approximate the handwriting of the recognition target character with a polygonal line and then generate the nodal coordinates of the polygonal line.

【００３０】５）１）の文字認識装置において、文字入
力手段は、認識対象文字の筆跡上で所定の大きさのウィ
ンドウを走査しながら、ウィンドウ内に現れた部分筆跡
を時系列データとして入力する事。5) In the character recognition device of 1), the character input means inputs a partial handwriting appearing in the window as time series data while scanning a window of a predetermined size on the handwriting of the character to be recognized. Thing.

【００３１】[0031]

【実施例】以下に、本発明の実施例を図面に基づいて説
明する。Embodiments of the present invention will be described below with reference to the drawings.

【００３２】（実施例１）まず、図１は本発明の文字認
識装置の基本的な構成図である。(Embodiment 1) First, FIG. 1 is a basic block diagram of a character recognition apparatus of the present invention.

【００３３】文字入力手段１０１により入力された時系
列の文字データは、ニューラルネットワーク１０２へ入
力され、その出力が認識結果出力手段１０３によって解
析され、認識結果文字コードとなる。１０４はニューラ
ルネットワークを構成するノードの一つである。The time-series character data input by the character input means 101 is input to the neural network 102, and the output thereof is analyzed by the recognition result output means 103 and becomes a recognition result character code. Reference numeral 104 is one of the nodes forming the neural network.

【００３４】文字入力手段にはタブレットを用いるのが
一般的であり、ここからは文字筆跡の時系列のＸＹ座標
データが出力される。一般にそのままでは認識用データ
として冗長過ぎるため、折れ線近似を行う。その後、各
折れ線の始点、終点の座標を新たな時系列データとして
ストローク抽出手段に送っても良いし、各折れ線の方向
を所定の分割数で量子化して一連の方向コード列に変換
してから送っても良い。方向コード化によれば文字が書
かれた位置によらず認識可能となる。A tablet is generally used as the character input means, and from this, time-series XY coordinate data of the character handwriting is output. Generally, since it is too redundant as the recognition data as it is, the polygonal line approximation is performed. After that, the coordinates of the start point and the end point of each polygonal line may be sent to the stroke extraction means as new time series data, or the direction of each polygonal line may be quantized by a predetermined number of divisions and then converted into a series of direction code strings. You can send it. The direction coding enables recognition regardless of the position where the character is written.

【００３５】ニューラルネットワークの構成は、入力層
のノード数が入力文字データの次元数に等しく、隠れ層
ノード数は８〜３２、出力層ノード数は２（肯定出力と
否定出力）というものが一例に挙げられる。ニューラル
ネットワーク１つで１つの認識対象文字に対応してお
り、もしも入力された文字データが、ニューラルネット
ワークの対応している文字に一致すれば出力層ノードが
肯定出力を立ち上げ否定出力を立ち下げ、一致しなけれ
ば肯定出力を立ち下げ否定出力を立ち上げるように学習
されている。従って、文字認識時には、認識候補文字毎
にニューラルネットワークの結合荷重をセットし直して
入力文字データを入れ、出力を見る事を繰り返し、肯定
出力が立ち否定出力が落ちた文字を認識結果とする。装
置規模が許せば、各認識候補文字毎にニューラルネット
ワークを持ち、入力文字データを全てのニューラルネッ
トワークに同時に入力して並列的に処理を行うこともで
きる。An example of the configuration of the neural network is that the number of nodes in the input layer is equal to the number of dimensions of the input character data, the number of hidden layer nodes is 8 to 32, and the number of output layer nodes is 2 (positive output and negative output). Are listed in. One neural network corresponds to one recognition target character, and if the input character data matches the corresponding character of the neural network, the output layer node raises the positive output and lowers the negative output. , If they do not match, it is learned to lower the positive output and raise the negative output. Therefore, at the time of character recognition, the connection weight of the neural network is reset for each recognition candidate character, the input character data is input, and the output is repeated, and the character for which the positive output is raised and the negative output is dropped is set as the recognition result. If the device scale permits, a neural network can be provided for each recognition candidate character, and input character data can be input to all neural networks at the same time and processed in parallel.

【００３６】図２は本発明のニューラルネットワークの
一つのノードの説明図である。FIG. 2 is an explanatory diagram of one node of the neural network of the present invention.

【００３７】ノード１１４は内部に内部状態値更新手段
１１１と内部状態値記憶手段１１２、及び出力値生成手
段１１３からなり、内部状態値記憶手段１１２から内部
状態値更新手段１１１へのフィードバック結合を持って
いる。この様な構成により、ノードへの他のｉ個のノー
ドからの入力を各々Ｉ（ｉ）、バイアス入力をＤ、入力
Ｉ（ｉ）に対する結合荷重を各々Ｗ（ｉ）、現在の内部
状態値をＸ、定数をｋ、このノードの出力値をＹ、出力
関数をｆ（ｘ）、時間をｔとした場合、このノードの動
作は数１、数２で記述される。The node 114 comprises an internal state value updating means 111, an internal state value storing means 112, and an output value generating means 113, and has a feedback connection from the internal state value storing means 112 to the internal state value updating means 111. ing. With this configuration, the inputs from the other i nodes to the node are I (i), the bias input is D, the coupling weight for the input I (i) is W (i), and the current internal state value is Is X, a constant is k, an output value of this node is Y, an output function is f (x), and time is t, the operation of this node is described by Equations 1 and 2.

【００３８】[0038]

【数１】 [Equation 1]

【００３９】[0039]

【数２】 [Equation 2]

【００４０】また、ｆ（ｘ）としては、数３のようなシ
グモイド関数が適している。Further, as f (x), a sigmoid function as expressed by the equation 3 is suitable.

【００４１】[0041]

【数３】 [Equation 3]

【００４２】本発明中のニューラルネットワークは、ノ
ード内部にフィードバック結合を持つことが、従来例１
ｂ、２ａ、２ｂと大きく異なる点であり、このような構
成によって、ノード単体に時系列データを処理する能力
を持たせることができるので、従来の装置よりも時系列
データに対する処理能力が高くなるのである。（前記従
来例のノード自体には時系列データを処理する能力はな
い。）図３は、本発明の文字認識装置を、連続筆記された英語
筆記体から”ｂ”の文字を切り出すと同時に認識を行う
オンライン手書き文字認識に応用する場合の、ニューラ
ルネットワークへの学習データの与え方を説明する図で
ある。学習データは肯定出力を出すべき文字を○、否定
出力を出すべき文字を×で表すと、××、○×、×○が
全認識対象文字をカバーするように作る。本実施例で
は、○を”ｂ”１２３、×を”ａ”１２４とした場合を
示している。The neural network according to the present invention has a feedback connection inside the node, which is a conventional example 1.
b, 2a, and 2b, which is a significant difference, and with such a configuration, a single node can have the ability to process time-series data, so that the processing capability for time-series data is higher than that of a conventional device. Of. (The node of the conventional example itself does not have the ability to process time-series data.) FIG. 3 shows the character recognition device of the present invention at the same time as cutting out a character "b" from a continuously written English cursive. It is a figure explaining how to give the learning data to a neural network, when applying it to the online handwritten character recognition which performs. The learning data is created so that a character that should output a positive output is represented by O and a character that should output a negative output by X, so that XX, XX, and XX cover all recognition target characters. In the present embodiment, the case where the circle is “b” 123 and the circle is “a” 124 is shown.

【００４３】図４は、本発明の文字認識装置において、
連続筆記された英語筆記体”ｃｂａ”をニューラルネッ
トワークに与えた場合の出力例を示す図である。入
力、”文字列中の”ｂ”のほぼ中央で肯定出力１３２が
立ち上がり、否定出力１３１が立ち下がっているの
で、”ｂ”がこの位置に存在することを検出している。FIG. 4 shows the character recognition device of the present invention.
It is a figure which shows the output example at the time of giving the continuously written English cursive "cba" to a neural network. Since the positive output 132 rises and the negative output 131 falls almost at the center of the input "b" in the character string, it is detected that "b" exists at this position.

【００４４】この様に、本発明によれば従来の発明では
不可能であった続け書きの英語筆記体文字列を認識する
ことが可能となる。As described above, according to the present invention, it is possible to recognize a continuous writing English cursive character string, which was impossible in the conventional invention.

【００４５】漢字の続け字を認識するには、従来例１ｃ
と同様にして、学習時の文字データに一筆書きパターン
を用い、認識対象文字の筆跡も１筆書きパターン化して
入力することが必要になる。この場合はニューラルネッ
トワークを予想される筆順の数だけ用意しなくてはなら
ない。In order to recognize the continuous characters of the kanji, the conventional example 1c is used.
Similarly, it is necessary to use a one-stroke writing pattern for the character data at the time of learning and input the handwriting of the recognition target character in one-stroke writing pattern. In this case, it is necessary to prepare the neural networks in the expected stroke order.

【００４６】（実施例２）図５は、本発明の文字認識装
置を基本ストローク法を用いたオンライン手書き文字認
識に応用した場合の構成図である。(Embodiment 2) FIG. 5 is a block diagram showing a case in which the character recognition device of the present invention is applied to online handwritten character recognition using the basic stroke method.

【００４７】文字入力手段２０１によって続け字が入力
され、そのデータはストローク認識手段２０２に送られ
る。前記ストローク認識手段ではニューラルネットワー
ク２０４を用いて続け字中の連続ストロークより基本ス
トロークが抽出され、比較手段２０３に送られる。前記
比較手段では辞書２０５中を走査しながら、基本ストロ
ーク記述と抽出された基本ストロークとを比較して、最
も良く一致する字を認識結果として出力する。Continuous characters are input by the character input means 201, and the data is sent to the stroke recognition means 202. The stroke recognizing means uses the neural network 204 to extract basic strokes from continuous strokes in consecutive characters, and sends the basic strokes to the comparing means 203. The comparing means compares the basic stroke description with the extracted basic stroke while scanning the dictionary 205, and outputs the best matching character as a recognition result.

【００４８】連続ストロークからの基本ストローク抽出
について、図５及び図６を用いて、詳しく説明する。Extraction of basic strokes from continuous strokes will be described in detail with reference to FIGS. 5 and 6.

【００４９】ニューラルネットワークは”は”の第３ス
トローク２１１を抽出するように学習させる。学習方法
については、実施例１、図３と同様である。そのような
ニューラルネットワークに続け字の”は”２１２が入力
されると、図３のような肯定出力２２１、否定出力２２
２が出る。ストローク認識手段は前記出力から”は”の
第３ストロークがあったことが判り、同時に出力が立っ
ている位置から、入力ストローク中の”は”の第３スト
ロークの位置を逆に推定し、比較手段へ”は”の第３ス
トロークがあったことを示すフラグと前記位置を送る。The neural network is trained to extract the third stroke 211 of "ha". The learning method is the same as that in the first embodiment and FIG. When the character "212" is continuously input to such a neural network, the positive output 221 and the negative output 22 as shown in FIG.
2 comes out. The stroke recognizing means recognizes from the output that there is a third stroke of "ha", and at the same time, reversely estimates the position of the third stroke of "ha" in the input stroke from the position where the output is standing and compares. Sends to the means a flag indicating that there was a third stroke of "wa" and said position.

【００５０】（実施例３）図８は、本発明の文字認識装
置をオフライン文字認識装置に応用した場合の文字入力
手段とニューラルネットワークを示す図である。また、
装置全体の構成は図１と同様になっている。(Embodiment 3) FIG. 8 is a diagram showing a character input means and a neural network when the character recognition device of the present invention is applied to an off-line character recognition device. Also,
The overall configuration of the device is similar to that of FIG.

【００５１】図８において、文字列３０２を文字入力手
段に設けられたウィンドウ３０１が左側から走査する。
前記ウィンドウはさらに小区画に分かれており、各々の
小区画は自分の下に見えている文字の部分を数値データ
に変換する。この変換には、例えば小区画中に含まれる
黒画素の数を数える方法があるが、一般的な文字認識装
置における種々の文字特徴を抽出するものであっても差
し支えない。文字入力手段にイメージスキャナを用いる
場合は、ウィンドウ及び小区画はイメージセンサーの幾
つかの固りである。In FIG. 8, the window 301 provided in the character input means scans the character string 302 from the left side.
The window is further divided into subsections, each subsection converting the portion of the character visible underneath it into numerical data. For this conversion, for example, there is a method of counting the number of black pixels included in a small section, but various character features in a general character recognition device may be extracted. If an image scanner is used as the character input means, the windows and subsections are some solids of the image sensor.

【００５２】前記小区画はニューラルネットワーク３０
３の入力層の各ノードと１対１に対応しており、ウィン
ドウの走査により時系列化された文字列のデータが各ノ
ードへ入力される。前記ニューラルネットワークは実施
例１と同様にして、全認識候補文字のうち１文字を検出
するよう学習されている。The small section is a neural network 30.
There is a one-to-one correspondence with each node in the input layer of No. 3, and the data of the character string which is time-series by scanning the window is input to each node. The neural network is learned to detect one character of all recognition candidate characters in the same manner as in the first embodiment.

【００５３】１例として、ニューラルネットワークが”
Ｃ”を検出するように学習されており、そこに”ＡＢＣ
Ｄ”の文字列が入力された場合のニューラルネットワー
クの出力は図９の様になる。ウィンドウが”Ｃ”にかか
った時に肯定出力３１１が立ち上がり否定出力３１２が
立ち下がるので、”Ｃ”を検出できる。この様な出力を
認識結果出力手段が解析して、認識結果文字コードと認
識位置に変換する。As an example, the neural network is "
C has been learned to detect "A"
The output of the neural network when the character string "D" is input is as shown in Fig. 9. When the window hits "C", the positive output 311 rises and the negative output 312 falls, so "C" is detected. A recognition result output means analyzes such an output and converts it into a recognition result character code and a recognition position.

【００５４】前記装置でニューラルネットワークを全結
合型で構成すれば、ニューラルネットワークの連想能力
により、ノイズや欠落のある文字筆跡に対しては認識率
が極端に落ちるといった従来の認識装置の欠点を大幅に
改善できる。If the neural network is fully connected in the above apparatus, the associative ability of the neural network greatly reduces the recognition rate of the conventional recognition apparatus such that the recognition rate is extremely low for the handwriting with noise or missing characters. Can be improved.

【００５５】[0055]

【発明の効果】本発明の効果を以下に列挙する。The effects of the present invention are listed below.

【００５６】オンライン手書き文字認識装置に関して
は、１）入力文字をニューラルネットワークに１度与えるだ
けで、文字認識もしくはストロークの抽出が可能なの
で、ＤＰマッチングに比して計算量が少ない。Regarding the on-line handwritten character recognition device: 1) Character recognition or stroke extraction can be performed by giving an input character to the neural network only once, so the amount of calculation is smaller than that of DP matching.

【００５７】２）ニューラルネットワークにストローク
の抽出を行わせる方法は、従来の基本ストローク法を拡
張するだけで済み、基本ストローク法の高速、辞書容量
が少ないといった特長はそのままで、続け字の認識を可
能にする。特に曲部のあるストロークも高速に抽出でき
るのが特長である。2) The method of causing the neural network to extract strokes only needs to extend the conventional basic stroke method. The basic stroke method has the advantages of high speed and small dictionary capacity, and continuous character recognition is possible. to enable. In particular, the feature is that strokes with curved parts can be extracted at high speed.

【００５８】オフライン文字認識装置に関しては、１）本発明のニューラルネットワークは、出力層から入
力層へのフィードバックによって時系列データを扱える
ように構成した階層構造のフィードフォワード型ネット
よりも、学習の収束が速く、かつ収束させやすい。Regarding the off-line character recognition device: 1) The neural network of the present invention converges learning more than a hierarchical feed-forward type net configured to handle time series data by feedback from the output layer to the input layer. Is fast and easy to converge.

【００５９】２）認識対象の１文字毎に別々のニューラ
ルネットワークを用いれば、認識対象文字が増えた時で
も新しいニューラルネットワークを付加えるだけで済
み、学習済みのニューラルネットワーク全体の再学習の
必要はない。2) If a different neural network is used for each character to be recognized, it is only necessary to add a new neural network even when the number of characters to be recognized increases, and it is not necessary to re-learn the entire learned neural network. Absent.

【００６０】３）全結合型の構成を取ることができるの
で、連想能力においてもホップフィールド型に劣ること
が無く、入力文字の欠損、ノイズの重畳に対しても認識
率が落ちにくい。3) Since a fully-combined structure can be adopted, the associative ability is not inferior to that of the Hopfield type, and the recognition rate is less likely to drop even when the input character is lost or noise is superimposed.

[Brief description of drawings]

【図１】本発明の文字認識装置の基本的な構成図であ
る。FIG. 1 is a basic configuration diagram of a character recognition device of the present invention.

【図２】本発明のニューラルネットワークの一つのノー
ドの説明図である。FIG. 2 is an explanatory diagram of one node of the neural network of the present invention.

【図３】本発明のニューラルネットワークへの学習デー
タの与え方を説明する図である。FIG. 3 is a diagram for explaining how to provide learning data to the neural network of the present invention.

【図４】本発明のニューラルネットワークの出力例を示
す図である。FIG. 4 is a diagram showing an output example of the neural network of the present invention.

【図５】本発明の文字認識装置を基本ストローク法を用
いたオンライン手書き文字認識に応用した場合の構成図
である。FIG. 5 is a configuration diagram when the character recognition device of the present invention is applied to online handwritten character recognition using the basic stroke method.

【図６】学習用の基本ストロークと入力用の連続ストロ
ークについて説明する図である。FIG. 6 is a diagram illustrating a basic stroke for learning and a continuous stroke for input.

【図７】本発明の連続ストロークからの基本ストローク
の抽出法について説明する図である。FIG. 7 is a diagram illustrating a method of extracting a basic stroke from a continuous stroke according to the present invention.

【図８】本発明の文字認識装置をオフライン文字認識装
置に応用した場合の文字入力手段とニューラルネットワ
ークを示す図である。FIG. 8 is a diagram showing a character input unit and a neural network when the character recognition device of the present invention is applied to an off-line character recognition device.

【図９】本発明のニューラルネットワークの出力例を示
す図である。FIG. 9 is a diagram showing an output example of the neural network of the present invention.

[Explanation of symbols]

101 文字入力手段 102 ニューラルネットワーク 103 認識結果出力手段 104 ノード 111 内部状態値更新手段 112 内部状態値記憶手段 113 出力値生成手段 114 ノード 121 肯定出力 122 否定出力 123 文字”ｂ”（筆記体） 124 文字”ａ”（筆記体） 131 否定出力 132 肯定出力 201 文字入力手段 202 ストローク認識手段 203 比較手段 204 ニューラルネットワーク 205 辞書 211 学習用”は”（楷書） 212 入力用”は”（続け字） 221 肯定出力 222 否定出力 301 ウィンドウ 302 入力文字列 303 ニューラルネットワーク 311 肯定出力 312 否定出力 101 character input means 102 neural network 103 recognition result output means 104 node 111 internal state value update means 112 internal state value storage means 113 output value generation means 114 node 121 positive output 122 negative output 123 characters "b" (cursive) 124 characters "A" (cursive) 131 Negative output 132 Positive output 201 Character input means 202 Stroke recognition means 203 Comparison means 204 Neural network 205 Dictionary 211 For learning "(regular) 212 For input" (continuation) 221 Positive Output 222 Negative output 301 Window 302 Input string 303 Neural network 311 Positive output 312 Negative output

Claims

[Claims]

1. A character recognition device using a neural network, comprising: a) character input means for generating time series data of a character to be recognized, and b) the neural network, storage means for storing internal state values, and storage. The internal state value and the external input value are used to update the internal state value, and a neuron-like element having a generating means for generating the external output value from the internal state value. C) The time series data A character recognition device which is used as an input to the neural network.

2. A) stroke extracting means for extracting one or more predetermined basic strokes from time-series data, b) a dictionary in which each character of a recognition candidate is described by the basic strokes, and c. ) Comparing the stroke after the extraction with a dictionary and outputting a recognition result, d) The character inputting means converts the handwriting of the recognition target character into time series data in the order of occurrence, and e) The stroke extracting means The character recognition device according to claim 1, further comprising: a neural network; and f) the neural network is trained to detect one or a plurality of basic strokes from a plurality of continuous strokes.

3. The character input means, after approximating the handwriting of the recognition target character with a polygonal line, quantizes the direction of each polygonal line with a predetermined division number to generate a series of directional code strings. 1. The character recognition device according to 1.

4. The character recognition device according to claim 1, wherein the character input means approximates the handwriting of the recognition target character with a broken line and then generates the nodal coordinates of the broken line.

5. The character input means inputs a partial handwriting appearing in the window as time series data while scanning a window of a predetermined size on the handwriting of the recognition target character. Character recognition device described in.