WO2008020557A1 - Hand-written character recognizing method, hand-written character recognizing system, hand-written character recognizing program, and storage medium - Google Patents

Hand-written character recognizing method, hand-written character recognizing system, hand-written character recognizing program, and storage medium Download PDF

Info

Publication number
WO2008020557A1
WO2008020557A1 PCT/JP2007/065458 JP2007065458W WO2008020557A1 WO 2008020557 A1 WO2008020557 A1 WO 2008020557A1 JP 2007065458 W JP2007065458 W JP 2007065458W WO 2008020557 A1 WO2008020557 A1 WO 2008020557A1
Authority
WO
WIPO (PCT)
Prior art keywords
point
angle
node
character
series
Prior art date
Application number
PCT/JP2007/065458
Other languages
French (fr)
Japanese (ja)
Inventor
Shunji Mori
Tomohisa Matsushita
Original Assignee
Kite Image Technologies Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kite Image Technologies Inc. filed Critical Kite Image Technologies Inc.
Publication of WO2008020557A1 publication Critical patent/WO2008020557A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/32Digital ink
    • G06V30/333Preprocessing; Feature extraction
    • G06V30/347Sampling; Contour coding; Stroke extraction

Definitions

  • Handwriting character recognition method handwriting character recognition system, handwriting character recognition program and storage medium
  • the present invention relates to a handwritten character recognition method and handwritten character recognition system for performing online handwritten character recognition, a handwritten character recognition program for realizing the recognition method, and a storage medium storing the program.
  • the pattern matching method is roughly divided into two types. As described in the journal of the Institute of Electronics, Information and Communication Engineers, J63-D, 2, pp. 153-160, "On-line recognition of handwritten characters by point approximation of strokes", approximating strokes with a few points
  • the motion direction of the brush at the end point is estimated with the feature point as the feature point, and they are also made into special focus to construct a feature vector.
  • the dictionary is decomposed into strokes, and they have feature vectors as well, and the input vector is correlated with the feature vector prepared for each category, the distance is calculated for the corresponding dictionary, and the minimum distance is calculated.
  • the given dictionary name is the recognized character name, and it is basically free for stroke order and stroke count.
  • an image seen by a human being is two-dimensional force, which is strictly on the time axis, and is one-dimensional. That is, it can be expressed as a simple one-dimensional linear graph. This point of view dramatically simplifies the problem. Moreover, due to the winding angle, natural clipping candidate points are arranged in a negative manner on the time axis.
  • the winding angle and the linear (one-dimensional) graph are the core of the present invention.
  • the present invention was made in view of force, and basically belongs to the structural analysis method described above, but overcomes the problems so far and is flexible structural analysis It provides the basis for the method, and therefore aims to avoid symbolization problems, represent structures analogically, and perform flexible and simple matching with standards.
  • input handwritten character strings are captured for each stroke by parameter expression, and each line is subjected to polygonal line approximation, and each polygonal line is approximated by polygonal line.
  • the angle between the reference axis and each polygonal line is determined as a polygonal line series as the vector extending from the start point to the end point, and the external angle series of each vertex of the obtained polygonal line is determined.
  • the sum of the external angles of the same code where the codes are continuous is taken as a winding angle series, and based on the feature extraction by each series obtained, the point which is a reference point is no
  • the graph representation is given by the attribute as the point of the node and the attribute as the edge between the nodes, and the start point and the end point are not particularly defined! /
  • the present invention when recognizing on-line handwritten characters, it is flexible and noise at the end, even for characters that maintain normal upper / lower and left / right relationships, and also when rotation invariance is required. In the same way as the recognition method of the character written in isolation, it makes strong recognition in the continuous character string and makes strong recognition to the character and transformation. it can.
  • FIG. 1 is a block diagram showing an example of a system according to an embodiment of the present invention.
  • FIG. 2 is a flow chart showing an example of processing of the entire character recognition according to an embodiment of the present invention.
  • FIG. 3 is a flowchart showing a detailed example of character recognition processing according to the embodiment of the present invention.
  • FIG. 4 is an explanatory drawing showing an example of polygonal line approximation according to an embodiment of the present invention.
  • FIG. 5 is an explanatory diagram showing an example of a node according to an embodiment of the present invention.
  • FIG. 6 is an explanatory drawing showing an example of polygonal line approximation according to an embodiment of the present invention.
  • FIG. 7 is an explanatory drawing showing an example of polygonal line approximation according to an embodiment of the present invention.
  • FIG. 8 is an explanatory drawing showing an example of polygonal line approximation according to an embodiment of the present invention.
  • FIG. 9 is an explanatory drawing showing an example of the relationship between nodes and sides according to an embodiment of the present invention.
  • FIG. 10 is an explanatory view showing a determination example according to an embodiment of the present invention.
  • FIG. 1 shows an example of a configuration in which each processing unit has a hardware configuration.
  • the handwritten character recognition of this example is programmed in a general-purpose arithmetic processing unit such as a personal computer device or a general-purpose arithmetic processing unit in which each processing unit is executed by a common arithmetic processing unit.
  • a general-purpose arithmetic processing unit such as a personal computer device or a general-purpose arithmetic processing unit in which each processing unit is executed by a common arithmetic processing unit.
  • writing on the paper 1 with the pen 2 detects the pen stroke (la) on the paper 1 on the pen 2 side.
  • the pen is detected by, for example, a pen 4/1
  • the camera built in 2 It does with the camera built in 2.
  • the movement of the force pen 2 itself such as an acceleration sensor may be detected.
  • the side of the paper 1 that is not detected by the pen is configured with any force panel, it is possible to detect the handwriting electrically.
  • it since it is online handwriting character recognition, it is configured to be able to judge the deterioration of the handwriting over time.
  • the handwriting data detected by these processes are sent to the input processing unit 3 to output character information : a second input is performed.
  • the input data is sent to the polygonal line approximation unit 4, the eyebrow extraction unit 5, the identification unit 6, and the identification result output unit 7, and corresponding processing is performed in each of the processing units.
  • the identification result output unit 7 performs output processing such as display of the identified characters and 'output of the character code of the identified I. Marking or printing of the identification character may be performed based on the identified character code.
  • the flow chart of FIG. 2 shows an example of the entire processing of the character recognition of this embodiment.
  • the character / graphic pattern input from the input processing unit 3 is subjected to polygonal line approximation by the polygonal line approximation unit 4 (step S12). From this approximation, the input pattern is expressed as a vector having the length, the direction angle, and the difference in the direction angle of the adjacent line as an element when viewing each line as a vector (step S13). Also, from the vector expression of the difference of the direction angle, the sum of the difference of the same sign is obtained, and as one element including the code, the vector expression named as the winding angle is obtained here.
  • step S14 features are extracted according to the situation from the polygonal line approximation representation in the feature extraction unit 5 (step S14), and a one-dimensional linear graph representation based on the extraction results of the features is given (step S15).
  • Character recognition is performed by matching the expression with a template based on an open mask configuration that does not particularly define the start point and the end point (step S16), and the character recognition result is output (step S17).
  • step S 16 it is checked that the arrangement of each node shape of the input graph expression matches (step S21). Then, the matching of the attribute as the node point is checked (step S22), and then the matching of the attribute of the edge between the nodes is checked (step S23). Finally, the matching of other attributes such as the presence or absence of the intersection point and the distance relationship between nodes is checked (step S24) and identified. If all the results of these checks match, the recognition result is OK (step S25), and if there is not even one match, it is excluded (step S26).
  • the recognition method in the present embodiment basically belongs to the structural analysis method described above, but it overcomes the problems so far and provides a basis for a flexible structural analysis method. It is a thing. Therefore, it avoids the problem of symbolization, expresses the structure in an analog manner, and performs flexible and simple matching with the standard. In addition, structural analysis is performed, so that the subject can inevitably be described properly, and the correspondence between cause and effect is clear from the human vision. Therefore, it is possible to evaluate the shape of objects such as letters, set the correct rejection range, and provide a recognition system with more human-like ability.
  • this method is very convenient for efficient character recognition in continuous character strings (eg cursive letters etc.).
  • the winding angle makes it possible to Focusing on the fact that the extraction candidate points are positively aligned in the negative, the input character string is one-dimensionally linear using the attributes of the candidate points (nodes) and the edge between the nodes (edges)
  • a graph representation is given, segmentation is performed simultaneously with recognition ("segmentation recognition"), and segmentation can be performed in the same way as the recognition method for isolated characters written before preprocessing. is there.
  • the image in Fig. 4 is a force written from the left.
  • the basic representation of these is the length of each polygonal line to be connected, the polygonal line angle, the outer corner composed of the adjacent polygonal line corners, and the same sign.
  • the length series, the angle series, the outer angle series, and the winding angle series that have the winding angle as an element, which is the sum of the outside angles of the issue.
  • a start point an end point, a point giving a winding angle of an integral multiple of 90 degrees ( ⁇ n ⁇ 90 degrees long point), a point where the sign of the winding angle changes is a “node”, and that point is used as a node attribute.
  • [s] and [e] are start and end symbols, respectively.
  • [-] is the first winding angle-90 degree point in the outer angle ( ⁇ ) series. Since the winding angle is discrete linear interpolation, it can generally be obtained in analog form as an ⁇ degree long point. 1 ⁇ [-] "one" is a left angle mark Express the issue.
  • Condition 2 -200 ⁇ (*-) ⁇ 100 & 200 ⁇ (* +) ⁇ 360 (Note: “ ⁇ (*-)” is the first "" winding angle, “ ⁇ (* +)” is the succeeding "+” winding angle)
  • Condition 3 -100 mm ⁇ (* _) K -20 & 10 ⁇ (* +) 100
  • condition 4 the values of the winding angle, at both ends of the boundary line segment.
  • condition 4 the values of the winding angle, at both ends of the boundary line segment.
  • intersection point information is used as an important edge property. That is Condition 4 and Condition 8.
  • me (* + l +) to (* + l ⁇ ) require that an intersection be present between the boundary lines at the next winding angle change point.
  • Cross the intersection with Cross nxm expressing.
  • the n and m are the numbers of intersecting polygonal lines, and ne (*-) to (* +) indicate that the intersecting polygonal lines coincide with the winding angle boundary line segments.
  • the intersection is a rotation invariant feature.
  • the mask mentioned above does not include the characteristic of full distance or length. Therefore, regardless of the length of the connection, the shape was fitted everywhere, and the cut-out recognition was made. However, on the other hand, for example, as in the case of Fig. 7, the information of two "8" sizes is lost, and there is a doubt that the difference in size can not be understood as seen by humans. However, this can actually be easily determined.
  • This input representation has position coordinates of each node. Therefore, for example, in the case of "8", the node returned 180 degrees from the "*" node which is the mask key, and in FIG. 7, this is almost an ⁇ s> node. Find the Euclidean distance between the positions of
  • This mask is made as follows.
  • Condition 12 (* + 3. * 4); 0 ⁇ 0 ⁇ Length 0.35 & 0 ⁇
  • Sign " ⁇ ” condition 1 & condition 2 & condition 3 & condition 4 & condition 5 & condition 6 & condition Condition 7 & Condition 8 & Condition 9 & Condition 10 & Condition 11 & Condition 12
  • This mask is sufficiently configured to be resistant to noise! /, And both ends are open //.
  • each length is normalized relative to the length as described above, for example, with the distance between the 180 ° long point and the 360 ° long point, this can be an arbitrary number of concatenated character sequences. Is also applicable.
  • the upper and lower irregularities are indicated by u and ⁇ , and the irregularities viewed from the left and right are indicated by ⁇ and c.
  • the upper and lower asperities and the left and right asperities simultaneously exist.
  • n + co where n and one overlap is indicated.
  • the attributes of the nodes are as described above. As the attributes of the force side, the average direction angle of the polygonal line group between the nodes is added. This is a very important feature in concavo-convex node expression. Others There is a directional variance of the polyline group indicating the degree of bending. Next important is the distance ratio between nodes. For example, referring to Figure 9, the start point must be somewhat above the end point. These can be quite large when calculated mechanically, but in practice they can not be good numbers if only the point is kept down. Here, for example, the start point ⁇ s> node Y axis value is subtracted from the Y axis value of node 5>, and the value obtained by dividing by the ratio of the length of the whole character is obtained.
  • Condition 23 ((* + 1, * + 2) Cross Y value-(*) Y value) / Height 0 ⁇ 20 ⁇ ⁇ 0 ⁇ 70 ⁇ The intersection is located approximately at the center in height.
  • Letter '8' condition 1 & condition 2 & condition 3 & condition 4 & condition 5 & condition 6 & condition 7 & condition 8 & condition 9 & condition 10 & condition 11 & condition 12 & condition 13 & condition 14 & condition 14 & condition 15 & Condition 16 & Condition 1 7 & Condition 18 & Condition 19 & Condition 20 & Condition 21 & Condition 22 & Condition 23
  • the inter-node distance ratio is actually much less than the number produced in combination. That is because it is restricted within silence by the restrictions of wrap angle, outside angle, and line angle. Also, the length restriction in the upper mask only limits the length of the polyline, which goes from left to right. In fact, because of this, very, strong noise on the edge, it becomes a mask!
  • the cut-out recognition method described above necessarily has the possibility to give multiple answers.
  • FIG. Figure 10 A specific example of this is shown in FIG. Figure 10 is written with the intention of “8”. However, “6” is hidden in this figure. If you change the word, the character form "6" is "cut out and recognized” from the original figure. Therefore, this is an inevitable result.
  • the handwritten character recognition of the present invention is substantially the same handwritten character recognition that is not limited to the processing configuration shown in FIG. To perform recognition processing with various device and system configurations. It is possible.
  • the handwritten character recognition of the present invention is converted into a program (software) to make a general-purpose personal computer.
  • the handwritten character recognition program can be stored in various storage media and distributed.
  • the character recognition may be performed on the off-line characters by the force S for the on-line characters, appropriate thinning, or contour tracing.
  • handwritten character recognition of the present invention can be basically applied to characters and symbols of any language.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Discrimination (AREA)

Abstract

The problem of symbolization is avoided, the structure is expressed in an analog way, and matching with a reference can be performed flexibly and simply. Each stroke of a hand-written character string is grasped in a parameter expression. The each stroke is approximated to a broken line. The broken line drawn by broken-line approximation is treated as a vector from the starting point to the end point. The angles between a reference axis and the broken lines are determined as a broken line angle sequence. The exterior angle sequence of the vertices of the obtained broken lines is determined. The sum of the exterior angles of the continuous same sign + or - of the exterior angle sequence is defined as a winding angle sequence. On the basis of the feature extraction by the determined sequences, nodes to be used as reference points are determined. A graph expression is given from the attributes of the positions of the nodes and the attributes of the sides between the nodes. Matching with a template having an open mask structure in which the starting point and the end point are not especially defined is so performed that character recognition is flexible and robust over the noise at the ends and deformation even if the character is one that holds normal up-down and left-right relations and even if rotation invariance is required.

Description

明 細 書  Specification
手書き文字認識方法、手書き文字認識システム、手書き文字認識プログ ラム及び記憶媒体  Handwriting character recognition method, handwriting character recognition system, handwriting character recognition program and storage medium
技術分野  Technical field
[0001] 本発明は、オンライン手書き文字認識を行う手書き文字認識方法及び手書き文字 認識システム、並びにその認識方法を実現する手書き文字認識プログラム、さらにそ のプログラムを格納した記憶媒体に関する。  The present invention relates to a handwritten character recognition method and handwritten character recognition system for performing online handwritten character recognition, a handwritten character recognition program for realizing the recognition method, and a storage medium storing the program.
背景技術  Background art
[0002] 今までに多数の文字認識システムが提案され、実用化さて!/、る力 その基本原理 には二つの立場があり、一つは構造解析の立場、もう一つはパターンマッチングの立 場であり、前者は一般にはその認識システムは軽ぐしたがって、入力制限の強い場 合、即ち、画数、筆順一定、または、どちらかを一定にした対象に適応され、他方後 者は両者、画数、筆順を自由にした場合かそれに近い場合に適用されてきた。  [0002] A large number of character recognition systems have been proposed and put to practical use! /, The basic principle There are two positions, one is the position of structural analysis, and the other is the position of pattern matching. In the former, the recognition system is generally light, so the recognition system is generally light, that is, it is applied to an object with a fixed number of strokes, stroke order or constant, while the latter is both strokes. It has been applied to the case where the stroke order is free or close to it.
[0003] 構造解析の立場としては、電子通信学会論文誌, 56— D, 5, pp. 312— 319, "手書 き数字 ·片仮名文字のオンライン実時間認識"や日本国特許庁発行の特開昭 59— 1 31972号公報にあるように、基本ストローク方式と呼ばれているものがあり、ストローク を単純ストローク (4種)、複合ストローク(7種)に分類し、識別オートマトンにより認識 するものであり、簡単ではあるが、辞書の作成、続け字や、略字に対処するのに問題 があり、その発展性に問題があるとされてきた。  [0003] As a position of structural analysis, the Journal of the Institute of Electronics and Communication Engineers of Japan, 56-D, 5, pp. 312-319, "Online real-time recognition of handwritten numbers and Katakana characters" As described in JP-A-59- 1 31972, there is one called a basic stroke method, in which strokes are classified into simple strokes (four types) and combined strokes (seven types) and recognized by an identification automaton. Although it is simple, there have been problems with dictionary creation, cursing and abbreviations, and problems with their development.
[0004] ノ ターンマッチング法は大きく分けて 2種類の方法がある。一つは、電子情報通信 学会論文誌, J63 -D, 2, pp. 153 - 160, "ストロークの点近似による手書き文字のォ ンライン認識"にあるように、ストロークを少数の点で近似しそれらを特徴点として、ま た端点での筆の運動方向を推定し、それらも特焦点とし、特徴ベクトルを構成する。 辞書はストロークに分解され、それらが同様に特徴ベクトルをもち、入力ベクトルと各 カテゴリ毎に用意された特徴ベクトルとの対応をとり、対応がとれた辞書について距 離が計算され、最小の距離を与える辞書名が認識された文字名であり、基本的に筆 順,画数に対し自由である。 [0005] もう一つのパターンマッチング法があり、特徴点の対応方式の原論文として、電子 通信学会研究会論文 PRL74— 20)に記載された" Rubber String Matching法による手 書き文字認識"があり、また日本国特許庁発行の特開昭 57— 45679号公報及び特 開平 8— 24942号公報に在る如ぐ入力文字と辞書の特徴点ベクトルを DP (Dyn讓 i c Programming)方式により対応をとるものであり、これが手書き文字のオンライン認識 の主流をなしている。 [0004] The pattern matching method is roughly divided into two types. As described in the journal of the Institute of Electronics, Information and Communication Engineers, J63-D, 2, pp. 153-160, "On-line recognition of handwritten characters by point approximation of strokes", approximating strokes with a few points The motion direction of the brush at the end point is estimated with the feature point as the feature point, and they are also made into special focus to construct a feature vector. The dictionary is decomposed into strokes, and they have feature vectors as well, and the input vector is correlated with the feature vector prepared for each category, the distance is calculated for the corresponding dictionary, and the minimum distance is calculated. The given dictionary name is the recognized character name, and it is basically free for stroke order and stroke count. [0005] There is another pattern matching method, and the original article of the correspondence method of feature points is "Rubber String Matching method for handwriting recognition" described in the paper of PRL 74-20), In addition, JP-A-57-45679 and JP-A-8-24942 issued by the Japan Patent Office correspond to input character and feature point vectors of a dictionary according to DP (Dy n ic programming) method. This is the mainstream of online recognition of handwritten characters.
[0006] しかし、これらの方式は、いずれも複雑で、性能という面では実用的な水準を十分 に満たして!/、るとは言!/、がたレ、のが実情である。  [0006] However, these methods are all complicated and satisfy the practical level in terms of performance sufficiently! /, It is the reality!
柔軟な構造的マッチングとしては、文字認識よりは、むしろ情景認識の分野で、広く 、研究がなされた。しかし、それらは 2次元上の特徴の配置と、それらの関係を一般 的に表現する 2次元グラフ上でのマッチングであった。これに関しては、実に膨大な 研究がある。  As flexible structural matching, research has been widely conducted in the field of scene recognition rather than character recognition. However, they are the arrangement of features in two dimensions and matching on a two-dimensional graph that generally expresses their relationship. There is a great deal of research on this.
[0007] 一方、オンライン文字認識では、人間が見てのイメージは 2次元である力 厳密に 時間軸上に乗っていて、 1次元である。すなわち、単なる 1次元の線形グラフで表現 できるのである。この観点力 問題を劇的に単純化するのである。しかも、巻き角によ り、時間軸上に、自然な切り出し候補点が陰に陽に整列している。この巻き角と線形( 1次元)グラフとが、本件発明の核心となる。  On the other hand, in online character recognition, an image seen by a human being is two-dimensional force, which is strictly on the time axis, and is one-dimensional. That is, it can be expressed as a simple one-dimensional linear graph. This point of view dramatically simplifies the problem. Moreover, due to the winding angle, natural clipping candidate points are arranged in a negative manner on the time axis. The winding angle and the linear (one-dimensional) graph are the core of the present invention.
[0008] 本発明は力、かる点に鑑みてなされたものであり、基本的には上に述べた構造解析 的手法に属し、しかし、今までの問題点を克服し、柔軟な構造解析的手法の基礎を 与えるものであり、それ故、シンボル化の問題を回避し、アナログ的に構造を表現し、 標準との柔軟かつ単純なマッチングを行うことを目的とする。  [0008] The present invention was made in view of force, and basically belongs to the structural analysis method described above, but overcomes the problems so far and is flexible structural analysis It provides the basis for the method, and therefore aims to avoid symbolization problems, represent structures analogically, and perform flexible and simple matching with standards.
発明の開示  Disclosure of the invention
[0009] 本発明は、オンライン手書き文字を認識する場合において、入力された手書き文字 列を、各画毎にパラメータ表現でとらえ、各画毎に折れ線近似を行い、その折れ線近 似された各折れ線を、始点から終点にいたるベクトルとして、基準となる軸と各折れ線 とのなす角度を折れ線角系列として求め、得られた折れ線の各頂点の外角系列を求 め、外角系列のプラス又はマイナスの同じ符号が連続する同符号の外角の和を、巻 き角系列とし、求められた各系列による特徴抽出を基にして、基準となる点であるノー ドを求め、そのノードの点としての属性とノード間の辺としての属性により、グラフ表現 を与え、始点と終点を特に規定しな!/、開いたマスク構成によるテンプレートとのマッチ ングにより、文字認識を行うものである。 In the present invention, in recognizing online handwritten characters, input handwritten character strings are captured for each stroke by parameter expression, and each line is subjected to polygonal line approximation, and each polygonal line is approximated by polygonal line. The angle between the reference axis and each polygonal line is determined as a polygonal line series as the vector extending from the start point to the end point, and the external angle series of each vertex of the obtained polygonal line is determined. The sum of the external angles of the same code where the codes are continuous is taken as a winding angle series, and based on the feature extraction by each series obtained, the point which is a reference point is no The graph representation is given by the attribute as the point of the node and the attribute as the edge between the nodes, and the start point and the end point are not particularly defined! / The character by the matching with the template by the open mask configuration It is to recognize.
[0010] 本発明によると、オンライン手書き文字を認識する場合に、普通の上下、左右の関 係を保持した文字に対しても、また回転不変を求められる場合においても、柔軟で、 端のノイズや変形に強い頑健な認識を行い、また連続した文字列においても、それら を前処理として切り出すことなぐ孤立して書かれている文字の認識法と同じ方法で 言忍識を fiうこと力 Sできる。 According to the present invention, when recognizing on-line handwritten characters, it is flexible and noise at the end, even for characters that maintain normal upper / lower and left / right relationships, and also when rotation invariance is required. In the same way as the recognition method of the character written in isolation, it makes strong recognition in the continuous character string and makes strong recognition to the character and transformation. it can.
図面の簡単な説明  Brief description of the drawings
[0011] [図 1]本発明の一実施の形態によるシステム例を示す構成図である。  FIG. 1 is a block diagram showing an example of a system according to an embodiment of the present invention.
[図 2]本発明の一実施の形態による文字認識全体の処理例を示すフローチャートで ある。  FIG. 2 is a flow chart showing an example of processing of the entire character recognition according to an embodiment of the present invention.
[図 3]本発明の一実施の形態による文字認識処理の詳細例を示すフローチャートで ある。  FIG. 3 is a flowchart showing a detailed example of character recognition processing according to the embodiment of the present invention.
[図 4]本発明の一実施の形態による折れ線近似の例を示す説明図である。  FIG. 4 is an explanatory drawing showing an example of polygonal line approximation according to an embodiment of the present invention.
[図 5]本発明の一実施の形態によるノードの例を示す説明図である。  FIG. 5 is an explanatory diagram showing an example of a node according to an embodiment of the present invention.
[図 6]本発明の一実施の形態による折れ線近似の例を示す説明図である。  FIG. 6 is an explanatory drawing showing an example of polygonal line approximation according to an embodiment of the present invention.
[図 7]本発明の一実施の形態による折れ線近似の例を示す説明図である。  FIG. 7 is an explanatory drawing showing an example of polygonal line approximation according to an embodiment of the present invention.
[図 8]本発明の一実施の形態による折れ線近似の例を示す説明図である。  FIG. 8 is an explanatory drawing showing an example of polygonal line approximation according to an embodiment of the present invention.
[図 9]本発明の一実施の形態によるノードと辺の関係の例を示す説明図である。  FIG. 9 is an explanatory drawing showing an example of the relationship between nodes and sides according to an embodiment of the present invention.
[図 10]本発明の一実施の形態による判定例を示す説明図である。  FIG. 10 is an explanatory view showing a determination example according to an embodiment of the present invention.
発明を実施するための最良の形態  BEST MODE FOR CARRYING OUT THE INVENTION
[0012] 以下、本発明の一実施の形態を、添付図面を参照して説明する。 Hereinafter, an embodiment of the present invention will be described with reference to the attached drawings.
本実施の形態の例においては、オンライン手書き文字認識を行うシステムに適用し てあり、図 1には、各処理部をハードウェア構成とした場合の構成例を示してある。な お、図 1に示すように各処理部を共通の演算処理部で実行する構成としてもよぐ或 いはパーソナルコンピュータ装置などの汎用の演算処理装置に、本例の手書き文字 認識をプログラム化したものを実装させて、同様の手書き文字認識が行われるように してもよレ、。 The example of the present embodiment is applied to a system that performs online handwritten character recognition, and FIG. 1 shows an example of a configuration in which each processing unit has a hardware configuration. As shown in FIG. 1, the handwritten character recognition of this example is programmed in a general-purpose arithmetic processing unit such as a personal computer device or a general-purpose arithmetic processing unit in which each processing unit is executed by a common arithmetic processing unit. To implement the same handwriting recognition. Well, yes.
[0013] 以下の説明においては、本例での手書き文字認織に必要な概念を、次の表 1に示 す用語で定義している。  In the following description, the concepts required for handwriting recognition in this example are defined in terms as shown in Table 1 below.
[0014] [表 1] 用語の定義 . [0014] [Table 1] Definition of terms.
Figure imgf000006_0001
図 1に示した構成について説明すると、紙 1の上で、ペン 2で文字を書くことで、その 紙 1の上の運筆(筆跡) laをペン 2側で検出する。その運筆 laの検出は、例えばペン 4/1
Figure imgf000006_0001
To describe the configuration shown in FIG. 1, writing on the paper 1 with the pen 2 detects the pen stroke (la) on the paper 1 on the pen 2 side. The pen is detected by, for example, a pen 4/1
2に内蔵されたカメラにより行う。或いは、加速度センサなど力 ペン 2自体の動きを 検出するようにしてもよい。さらに、ペン側で検出するのではなぐ紙 1の側を何ら力 パネルで構成して、電気的に筆跡を検出できる構成としてもよレヽ。いずれにしても、 本例の場合にはオンライン手書き文字認識であるので、時間の経過による筆跡の衮 化を判断できる構成としてある。 It does with the camera built in 2. Alternatively, the movement of the force pen 2 itself such as an acceleration sensor may be detected. Furthermore, even if the side of the paper 1 that is not detected by the pen is configured with any force panel, it is possible to detect the handwriting electrically. In any case, in the case of this example, since it is online handwriting character recognition, it is configured to be able to judge the deterioration of the handwriting over time.
[0016] これらの処理で検出された筆跡のデータは、入力'処理部 3に送られ、文字情報を 出する'だ :めの入力 fellが行おれる。入力された' ータは、以下、折れ線近似部 4、 徵抽出部 5、識別部 6、識別豬果出力部 7に送られて、それぞれの処理部で対応 処理が行われて、最終的に、識別結果出力部 7で、識別された文字の表示や、識另 I された文字コードの'出力などの出力処理が行われる。識別された文字コード.に基づ 'いた、識別文字の衾示或いは印刷を行うようにしてもよい。 [0016] The handwriting data detected by these processes are sent to the input processing unit 3 to output character information : a second input is performed. The input data is sent to the polygonal line approximation unit 4, the eyebrow extraction unit 5, the identification unit 6, and the identification result output unit 7, and corresponding processing is performed in each of the processing units. The identification result output unit 7 performs output processing such as display of the identified characters and 'output of the character code of the identified I. Marking or printing of the identification character may be performed based on the identified character code.
[0017] 図 2のフローチャートは、本例の文字認識の全体の処理例を示したものである。以  The flow chart of FIG. 2 shows an example of the entire processing of the character recognition of this embodiment. Below
差替え用 紙 (規則 26) 下、図 2に従って説明すると、入力処理部 3から入力された文字/図形パターンは( ステップ S 11)、折れ線近似部 4で折れ線近似される(ステップ S12)。この近似から、 入力パターンは、各折れ線をベクトルと見た時の、長さ、方向角、隣接する折れ線の 方向角の差を要素とするベクトルとして表現される(ステップ S13)。また、方向角の差 のベクトル表現から、同符号の角の差の和を求め、符号を含め一つの要素として、こ こで巻き角と名づけたベクトル表現が求められる。次に、特徴抽出部 5で折れ線近似 表現から、状況に応じて、特徴が抽出され (ステップ S14)、その特徴の抽出結果に 基づく 1次元の線形グラフ表現を与え、(ステップ S15)、そのグラフ表現と始点、終点 を特に規定しない開いたマスク構成によるテンプレートとのマッチングにより、文字認 識が行われ (ステップ S 16)、文字認識結果が出力される(ステップ S 17)。 Replacement paper (Rule 26) As described below with reference to FIG. 2, the character / graphic pattern input from the input processing unit 3 (step S11) is subjected to polygonal line approximation by the polygonal line approximation unit 4 (step S12). From this approximation, the input pattern is expressed as a vector having the length, the direction angle, and the difference in the direction angle of the adjacent line as an element when viewing each line as a vector (step S13). Also, from the vector expression of the difference of the direction angle, the sum of the difference of the same sign is obtained, and as one element including the code, the vector expression named as the winding angle is obtained here. Next, features are extracted according to the situation from the polygonal line approximation representation in the feature extraction unit 5 (step S14), and a one-dimensional linear graph representation based on the extraction results of the features is given (step S15). Character recognition is performed by matching the expression with a template based on an open mask configuration that does not particularly define the start point and the end point (step S16), and the character recognition result is output (step S17).
[0018] ここで、ステップ S 16での文字認識処理の詳細の例を、図 3のフローチャートを参照 して説明する。まず、入力されたグラフ表現の各ノード形状の並びが一致していること がチェックされる(ステップ S21)。そして、ノードの点としての属性の一致がチェックさ れ、(ステップ S22)、続いて、ノード間の辺(エッジ)の属性の一致がチェックされる( ステップ S23)。最後に、交点の有無、ノード間距離関係などのその他の属性の一致 がチェックされて (ステップ S24)、識別される。これらのチェックの結果で全てが一致 している場合に、認識結果が OKとなり(ステップ S25)、 1つでも一致しない場合に排 除される(ステップ S26)。  Here, an example of the details of the character recognition process in step S 16 will be described with reference to the flowchart of FIG. 3. First, it is checked that the arrangement of each node shape of the input graph expression matches (step S21). Then, the matching of the attribute as the node point is checked (step S22), and then the matching of the attribute of the edge between the nodes is checked (step S23). Finally, the matching of other attributes such as the presence or absence of the intersection point and the distance relationship between nodes is checked (step S24) and identified. If all the results of these checks match, the recognition result is OK (step S25), and if there is not even one match, it is excluded (step S26).
[0019] 本実施の形態における認識手法においては、基本的には上に述べた構造解析的 手法に属し、しかし、今までの問題点を克服し、柔軟な構造解析的手法の基礎を与 えるものである。それ故、シンボル化の問題を回避し、アナログ的に構造を表現し、標 準との柔軟かつ単純なマッチングを行うものである。また構造解析を行うので、必然 的に対象を適格に記述することが出来て、原因結果の対応が人間の視覚からみて 明確である。従って、文字などの対象の形の評価が出来て、正しい拒否範囲を設定 でき、より人間に近い能力をもつ認識システムを提供することができる。  The recognition method in the present embodiment basically belongs to the structural analysis method described above, but it overcomes the problems so far and provides a basis for a flexible structural analysis method. It is a thing. Therefore, it avoids the problem of symbolization, expresses the structure in an analog manner, and performs flexible and simple matching with the standard. In addition, structural analysis is performed, so that the subject can inevitably be described properly, and the correspondence between cause and effect is clear from the human vision. Therefore, it is possible to evaluate the shape of objects such as letters, set the correct rejection range, and provide a recognition system with more human-like ability.
[0020] ここまでは、本願の発明者が先に提案した構造解析的手法と基本的に同じである。  Up to this point is basically the same as the structural analysis method previously proposed by the inventor of the present application.
加えて、連続した文字列(例えば筆記体など)にお!/、て効率的な文字認識を行う上で も、本方式は非常に便利にできている。本発明では、巻き角により、時間軸上に、自 然な切り出し候補点が陰に陽に整列していることに着目し、入力された文字列に対し 、その候補点(ノード)とノード間の辺(エッジ)の属性を用いた 1次元の線形グラフ表 現を与え、認識と同時に切り出し(「切り出し認識」)を行い、前処理としての切り出し を行うことなぐ孤立して書かれている文字の認識法と同じ方法で認識を可能とするも のである。 In addition, this method is very convenient for efficient character recognition in continuous character strings (eg cursive letters etc.). In the present invention, the winding angle makes it possible to Focusing on the fact that the extraction candidate points are positively aligned in the negative, the input character string is one-dimensionally linear using the attributes of the candidate points (nodes) and the edge between the nodes (edges) A graph representation is given, segmentation is performed simultaneously with recognition ("segmentation recognition"), and segmentation can be performed in the same way as the recognition method for isolated characters written before preprocessing. is there.
[0021] 本発明の効果を説明する上で、連続した文字列の認識から入ることがわ力、りやすい ので、図 4にある具体的な数字列の認識法の説明から入る。これは「888」である。こ れらの各文字は繋がっている。したがって、従来の文字認識では、これらの文字を、 切り出して、それぞれ認識システムに送る必要があった。この処理は「切り出し前処理 」と呼ばれる。しかし、本方式では、この「切り出し前処理」が不要であり、認識と切り出 しを同時に行う。これを以下に説明する。  [0021] In explaining the effects of the present invention, it is easy to enter from recognition of continuous character strings, so we will start with the explanation of the specific digit string recognition method shown in FIG. This is "888". These letters are connected. Therefore, in conventional character recognition, these characters had to be cut out and sent to the recognition system. This process is called "pre-cut process". However, in this method, this “pre-cutout process” is unnecessary, and recognition and cut-out are performed simultaneously. This is explained below.
[0022] 図 4のイメージは、左から書かれたものである力 これらの基本の表現は、連結する 各折れ線の長さ、折れ線角、隣り合う折れ線の角から構成される外角、そして同一符 号の外角を足し合わせた、巻き角を要素とする、長さ系列、角系列、外角系列、巻き 角系列である。  [0022] The image in Fig. 4 is a force written from the left. The basic representation of these is the length of each polygonal line to be connected, the polygonal line angle, the outer corner composed of the adjacent polygonal line corners, and the same sign. The length series, the angle series, the outer angle series, and the winding angle series that have the winding angle as an element, which is the sum of the outside angles of the issue.
[0023] そこで、これらを素材として、「切り出し認識」に適した一次元のグラフ表現に変換す る。ここでの、グラフは最も簡単な線形グラフである。図 4の最初の「8」の部分を敢ぇ て切り出した図 5を実例として、以下に記す。ここでは、始点、終点、 90度の整数倍の 巻き角を与える点(±n X 90度長点)、巻き角の符号が変化する点を「ノード」とし、ノ ードの属性としてその点の外角(△)と、そのノードが属する巻き角領域の巻き角値( Θ )、またノード間の辺(エッジ)の属性として、ノード間にある折れ線群における外角 の絶対値の最大値 (最大 | Δ |)を用い、グラフ表現を与える。  Therefore, these are used as raw materials and converted into a one-dimensional graph representation suitable for “cutout recognition”. Here, the graph is the simplest linear graph. The following is an example of Fig.5, which is the first "8" part of Fig.4. Here, a start point, an end point, a point giving a winding angle of an integral multiple of 90 degrees (± n × 90 degrees long point), a point where the sign of the winding angle changes is a “node”, and that point is used as a node attribute. Maximum value of the absolute value of the outer angle in the group of broken lines between the nodes as an attribute of the outer angle (△) of the area, the winding angle value (Θ) of the winding angle area to which the node belongs, and the side (edge) between nodes Use | Δ |) to give a graph representation.
[0024] ここで、く 0〉 [ s ]、く1〉 [ 一 ]、く2〉 [一 + ]、く3〉 [ + ]、く4〉 [ + ]、く5〉 [ +  [0024] Here, 0 0 [s] 1 1 [1] [2] [1 +] 3 [[]] [4] [+], 5 [[5] [+]
]、 · · ·〈28〉 [ e ] はこのグラフのノード番号と、それらの、特性を表現したシンボル である。く 0〉 [ s ] から始まり、く 28〉 [ e ]で終わる。 [ s ]、 [ e ]はそれぞれ、ス タート、エンドのシンボルである。く 1〉 [ - ]は、外角(Δ )系列で最初の巻き角— 90 度点である。巻き角は離散的である力 線形補間しているので、一般に α度長点とし て、アナログ的に求めることが出来る。く 1〉 [ ― ]の「一」は左周りという、巻き角の符 号を表現している。 ], ················································································································································································ · It starts from く 0 [[s] and ends at く 28 [[e]. [s] and [e] are start and end symbols, respectively. 1) [-] is the first winding angle-90 degree point in the outer angle (Δ) series. Since the winding angle is discrete linear interpolation, it can generally be obtained in analog form as an α degree long point. 1 一 [-] "one" is a left angle mark Express the issue.
<図 5のグラフ表現〉 Graph representation of Figure 5
く 0〉 [ s ] 0 ( 326, 121 ), Θ = -172.83 0] [s] 0 (326, 121), Θ = -172.83
I  I
I 折れ線数 (0〜4) = 4,長さ = 0.07,最大 I Δ I = |-24.52|  I number of polylines (0 to 4) = 4, length = 0.07, maximum I Δ I = | -24.52 |
I  I
く 1〉 [ 一 ] 4 ( 156, 121 ), Θ = -172.83, Δ = -33.26 1 [1 4 4 (156, 121), Θ = -172.83, Δ = -33.26
I  I
I 折れ線数 (4〜6) = 2,長さ = 0.09,最大 I Δ I = |-72.64|  I number of polylines (4 to 6) = 2, length = 0.09, maximum I Δ I = | -72.64 |
I I
く 2〉 [一 + ] 6 ( 347 , 279 ), Θ (-) = -172.83, Θ (+) = 304.42, Δ (-) = -72.64 Δ (+) = 21.27 2) [1 +] 6 (347, 279), Θ (-) = -172.83, Θ (+) = 304.42, Δ (-) = -72.64 Δ (+) = 21.27
I  I
I 折れ線数 (6〜8) = 2,長さ = 0.02,最大 | Δ | = |45.06|  I number of polylines (6 to 8) = 2, length = 0.02, maximum | Δ | = | 45.06 |
I I
く 3〉 [ + ] 8 ( 365, 346 ), Θ = 304.42, Δ = 49.31 3> [+] 8 (365, 346), Θ = 304.42, Δ = 49.31
I  I
I 折れ線数 (8〜12) = 4,長さ = 0.07,最大 I Δ | = |30.07|  I number of polylines (8 to 12) = 4, length = 0.07, maximum I Δ | = | 30.07 |
I  I
く 4〉 [ + ] 12 ( 195, 329 ), Θ = 304.42, Δ = 20.39 4> [+] 12 (195, 329), Θ = 304.42, Δ = 20.39
I  I
I 折れ線数 (12〜15) = 3,長さ = 0.03,最大 I Δ | = |35.79|  I number of polylines (12 to 15) = 3, length = 0.03, maximum I Δ | = | 35.79 |
I  I
く 5〉 [ + ] 15 ( 194, 258 ), Θ = 304.42, Δ = 38.45 5) [+] 15 (194, 258), Θ = 304.42, Δ = 38.45
I  I
I 折れ線数 (15〜16) = 1,長さ = 0.05  I number of polylines (15 to 16) = 1, length = 0.05
I  I
く 6〉 [ +— ] 16 ( 330, 205 ), Θ (―) = 304.42, Θ (+) =—56.33, Δ (-) = 38.45, Δ (+) -56.33 6) [+-] 16 (330, 205), Θ (−) = 304.42, Θ (+) = − 56.33, Δ (−) = 38.45, Δ (+) -56.33
<28>[ e ] <28> [e]
[0026] く 1〉 [ - ]、すなわち「一90度点」は、 5番目の頂点を少し越えたところにある。そ こで、このノードの点の属性として、これが属する巻き角系列(Θ = -172.83)とこれに 最も近い時間的に前の頂点(この場合 5番目の頂点)の外角(Δ)値、 Δ =-33.26を 与える。また、巻き角の境界を示すく 2〉 [ -+ ]が重要である。これはすなわち、巻き 角(一)から巻き角(+ )へ変化した点である。巻き角の境界は点ではなぐ 1個の折れ 線を共有する。その共有する折れ線の、時間軸のプラス方向にある先端をノードとし ている。この点の属性は、 Θ (-) = -172.83, Θ(+) = 304.42, Δ (-) = -72.64, Δ (+) = 2 1.27である。すなわち、時間的に変化する前の巻き角 Θ (-) = -172.83と、時間的に変 化後の巻き角 Θ(+) = 304.42、それと共通の折れ線の両端の外角(Δ)値、 Δ (-) = _7 2.64, Δ (+) = 21.27である。  [0026] [1] [-], that is, the "one-90-degree point" is slightly beyond the fifth vertex. Therefore, as an attribute of the point of this node, the outer angle (Δ) value of the winding angle series (Θ = -172.83) to which it belongs and the closest previous vertex (in this case, the fifth vertex), Δ Give = -33.26. In addition, 2> [-+] is important to indicate the boundaries of winding angles. This is the point where the winding angle (one) changes to the winding angle (+). Boundaries of winding angles share one broken line at points. The tip of the shared polygonal line in the positive direction of the time axis is taken as a node. The attributes of this point are Θ (−) = − 172.83, Θ (+) = 304.42, Δ (−) = − 72.64, Δ (+) = 2.27. That is, the winding angle Θ (−) = − 172.83 before temporal change, the winding angle Θ (+) = 304.42 after temporal change, the external angle (Δ) value at both ends of the common polygonal line, Δ (−) = _7 2.64, Δ (+) = 21.27.
[0027] く 2〉 [ー + ]以降、折れ線系列は巻き角(+ )の領域に入る。この最初がく 3〉 [ + ] で、これは 90度長点である。属性は Θ = 304.42, Δ = 49.31である。く 4〉 [ + ]は 1 80度長点である。く 5〉 [ + ]は 270度長点である。すなわち、このノードく 5〉で、 巻き角は 270を越えて巻いたことになる。次はく 6〉 [ +- ]で、これは巻き角が +から 一へ変化する点である。  [0027] From 2 [[-+] onwards, the polygonal line series enters the area of winding angle (+). This is a 90 ° long point at the beginning 3> [+]. The attributes are Θ = 304.42, Δ = 49.31.く 4 [[+] is a long point of 180 degrees. 5) [+] is a 270 degree long point. That is, at this node 5>, the winding angle is over 270 degrees. Next is 6> [+-], which is the point where the winding angle changes from + to one.
[0028] 次にこれらのノードを結ぶ辺の属性として、折れ線数 (0〜4) = 4,長さ = 0.07,最大 |  Next, as an attribute of an edge connecting these nodes, the number of broken lines (0 to 4) = 4, length = 0.07, maximum |
Δ| = |-24·52|の如ぐそれらのノード間の、折れ線数、長さ、 |Δ|の最大値を取る。これ らの特性は、回転不変である。  Take the maximum value of the number of broken lines, length, | Δ | between these nodes according to Δ | = | −24 · 52 |. These properties are rotation invariant.
[0029] 以上が入力のグラフ表現であり、これに対し、マスク構成によるテンプレートとのマツ チングを行う。 「8」の場合のマスク構成の一例を以下に示す。  The above is the graphical representation of the input, and on the other hand, matching with the template by the mask configuration is performed. An example of the mask configuration in the case of "8" is shown below.
[0030] <「8」のマスクの一例〉  [0030] <Example of "8" Mask>
条件 1: * = 一 + (最初の鍵となるノード)  Condition 1: * = one + (first key node)
条件 2: -200 < Θ(*—)< 100 & 200< Θ(*+) <360 (注:「 Θ ( * -)」は最初の「 」巻き角、「 Θ ( * +)」は後継する「 +」巻き角 ) 条件 3: -100く Δ(*_)く -20 & 10く Δ(*+)く 100 Condition 2: -200 <Θ (*-) <100 & 200 <Θ (* +) <360 (Note: "Θ (*-)" is the first "" winding angle, "Θ (* +)" is the succeeding "+" winding angle) Condition 3: -100 mm Δ (* _) K -20 & 10 Δ (* +) 100
(注:「 Δ (*-)」は「一」巻き角の境界の外角値、「 Δ (*+)」は後継する「 +」巻き角の境 界の外角値。すなわち、境界線分の両端にある 側、 +側の外角値。 ) 条件 4: (*-, *+), Cross: nxm:ne^-)~(*+)  (Note: “Δ (* −)” is the outer angle value of the “one” winding angle boundary, “Δ (* +)” is the outer angle value of the succeeding “+” winding angle boundary, ie, the boundary line segment Side on the both sides, + outside angle value) Condition 4: (*-, * +), Cross: nxm: ne ^-) ~ (* +)
(注:境界線分の両端間に交差点 Crossが存在)  (Note: There is an intersection Cross between the two ends of the boundary line)
条件5: *+1 = +— (2番目の鍵となるノード)  Condition 5: * + 1 = +-(second key node)
条件 6: 200く Θ (*+1+)く 360 & -200く Θ )く- 10  Condition 6: 200 Θ (* + 1 +) 360 360 &-200 Θ)-10
(注:「/Θ(*+1+)」は +巻き角、「Θ(*+1-)」は次の一巻き角  (Note: "/ Θ (* + 1 +)" is + winding angle, "Θ (* + 1-)" is next winding angle
条件 7: 10く Δ(*+1 + )く 100 & -100く Δ(*+1—)く -20  Condition 7: 10 times Δ (* + 1 +) 100 +-100 Δ (* + 1-)-20
(注:「 Δ(*+1 + )」は +巻き角の境界の外角値、「Δ(*+1—)」は次の一巻き角の境界 の外角値。 )  (Note: “Δ (* + 1 +)” is the external angle value of the + wrap angle boundary, and “Δ (* + 1−)” is the external angle value of the next wrap angle boundary.)
条件 8: (*+1_, *+1+), Cross: nxm:mE(*+l + )〜(*+l— )  Condition 8: (* + 1_, * + 1 +), Cross: nxm: mE (* + l +) to (* + l—)
(注:境界線分の両端間に Crossが存在) 文字 [8] =条件 1&条件 2 &条件 3 &条件 4 &条件 5 &条件 6 &条件 7 &条件 8であ  (Note: Cross exists between both ends of boundary line) Character [8] = Condition 1 & Condition 2 & Condition 3 & Condition 4 & Condition 5 & Condition 6 & Condition 7 & Condition 8
[0031] このマスクは非常に簡単で、ノードは 2個(「* +」と「*+1 = +—)で、それぞ れ、—から +への境界、 +から—への境界だけである。「±nX90度長点」は使用し ていない。ここで、「* +」における「*」の意味は、このノードがこのマスクの鍵 になることを示し、巻き角力 S「マイナス」から「プラス」に変化するノードが必ず存在する こと示している。 「*+1 = +—」は次に必要なノードとして、今度は反対に巻き角 が「プラス」から「マイナス」に変化するノードをあげて!/、る。 [0031] This mask is very simple, and there are two nodes ("* +" and "* + 1 = +-"), respectively, and the boundaries from-to +, + to-only. Yes, “± n x 90 degree long point” is not used. Here, the meaning of "*" in "* +" indicates that this node is the key of this mask, and indicates that there is always a node that changes from winding force S "minus" to "plus". . "* + 1 = +-" is the next required node, and this time, raise the node whose winding angle changes from "plus" to "minus" this time!
[0032] 加えて、条件 2、条件 3、で最初の鍵となるノードの、また条件 6、 条件 7で 2番目の 鍵となるノードの属性として、巻き角の値、境界線分の両端での外角 Δに詳しい条件 を与え、構造を締めている。また重要な、辺(エッジ)特性として、交点情報を使って いる。それが、条件 4と条件 8である。 me(*+l + )〜(*+l— )は、次の巻き角変化点に おける境界線分間に交差点が存在することを要求している。交差点を Cross: nxmと 表現している。これら n、 mは交差折れ線の番号であり、 n e (* -)〜 (*+)はこの交差折 れ線が、巻き角境界線分と一致することを示している。勿論交差点は回転不変特徴 である。 In addition, as the attributes of the first key node in condition 2, condition 3, and as the attributes of the second key node in condition 6, condition 7, the value of the winding angle, at both ends of the boundary line segment Detailed conditions are given to the outside angle Δ of, and the structure is tightened. Also, intersection point information is used as an important edge property. That is Condition 4 and Condition 8. me (* + l +) to (* + l −) require that an intersection be present between the boundary lines at the next winding angle change point. Cross the intersection with Cross: nxm expressing. The n and m are the numbers of intersecting polygonal lines, and ne (*-) to (* +) indicate that the intersecting polygonal lines coincide with the winding angle boundary line segments. Of course, the intersection is a rotation invariant feature.
[0033] 以上、非常に簡単なマスクである。これを入力グラフ表現につき合わせて、この型が はまるノード範囲を探し、存在すれば、そこに「8」があるとするのである。注意しなけ ればならないのは、上のマスクでは [ s ]、 [ e ]ノードを使用していない。したがって 、両端は開放された形になっている。このため、ともかぐ時間の流れの中で、このマ スクに当てはまるところ(範囲)があれば、そこに、「8」があるということになり、まったく 、場所によらず、「切り出し前処理」が不要である。  [0033] The above is a very simple mask. This is matched with the input graph representation, and the node range in which this type fits is searched, and if it exists, it is assumed that "8" exists there. It should be noted that the above mask does not use the [s] and [e] nodes. Thus, both ends are open. For this reason, in the flow of time, if there is a place (range) that applies to this mask, it means that there is "8", and "pre-cutout processing" regardless of the place at all. Is unnecessary.
[0034] 実際、このマスクで、図 4では 3個の「8」が認識される。また、図 6では、全く傾きが 異なる、連結した「8」が、それぞれ認識されている。また、図 7では、大きさが著しく異 なる、 2個の「8」が連結している力 これらも正しくそれぞれ、「8」と認識される。当然 のことながら、連結した文字によらず、単独の文字についても、同様のマスクと入力グ ラフ表現をつきあわせて、認識することが可能である。  In fact, in this mask, three “8” s are recognized in this mask. Also, in FIG. 6, connected "8" s, which have completely different inclinations, are respectively recognized. Also, in Fig. 7, the force at which two "8" s are connected, which are significantly different in size, is also correctly recognized as "8". As a matter of course, it is possible to recognize a single character as well by combining similar masks and input graphic expressions, regardless of connected characters.
[0035] なお、ここで、若干の注意点を述べる。それは、上に述べたマスクは全ぐ距離また は長さと言う特性を含んでいない。それ故、連結の長さに全ぐ無関係に、どこでも、 形がはまり、切り出し認識が出来たのである。しかし、反面、例えば、図 7の場合のよう に 2個の「8」の大きさの情報は失われ、人間が見るような、大きさの差が分からないの ではと言う疑問がのこる。しかし、これは実は、簡単に求めることができるのである。  Here, some points to note will be described. That is, the mask mentioned above does not include the characteristic of full distance or length. Therefore, regardless of the length of the connection, the shape was fitted everywhere, and the cut-out recognition was made. However, on the other hand, for example, as in the case of Fig. 7, the information of two "8" sizes is lost, and there is a doubt that the difference in size can not be understood as seen by humans. However, this can actually be easily determined.
[0036] この入力表現は各ノードの位置座標を持つ。したがって、例えば「8」では、マスクの 鍵である「*」ノードから 180度戻ったノード、図 7では、これは殆ど < s〉ノードになる 1S それと、 * + 1ノードから 180度進んだノードの位置間のユークリッド距離を求め  This input representation has position coordinates of each node. Therefore, for example, in the case of "8", the node returned 180 degrees from the "*" node which is the mask key, and in FIG. 7, this is almost an <s> node. Find the Euclidean distance between the positions of
[0037] 具体的には、それは大きな方の「8」では(286-38) 2 + (562-449) 2の平方根 = 273 、小さな「8」では(174-55) 2 + (:625-611) 2の平方根 = 120で両者の比は約 0.43とな り、直感と合った結果が出る。若干複雑ではあるが図 2の場合も両者の「8」の長軸と 短軸のベクトルを求めることが出来て、傾き情報も知ることが出来る。このように、ノー ドの座標値は、長さ、角、方向などの、幾何的計量を求めるのに、有効に利用可能で ある。 [0037] Specifically, it is the square root of (286-38) 2 + (562-449) 2 for large "8" = 273 for small "8" (174-55) 2 + (: 625- 611) The square root of 2 = 120, and the ratio of the two is about 0.43. Although it is somewhat complicated, in the case of FIG. 2 also, the vectors of the major axis and minor axis of both “8” can be determined, and inclination information can also be known. Thus, the coordinate values of the node can be effectively used to obtain geometric metrics such as length, angle, direction, etc. is there.
[0038] <±nX90度ノードの利用〉  <Use of ± nX 90 ° Node>
上述の「8」のマスク構成の例では、本発明の本質を説明するために、 ±nX90度ノ ードを条件に加えない、簡単な構成の場合について説明した力 ここでは ±nX90 度ノードの有効性について、例をあげて説明する。これが特に有効なのは丸い形をも つもので、その典型は「円」である。  In the example of the mask configuration of “8” described above, the force described in the case of a simple configuration in which the ± n × 90 degree node is not added to the condition is used here to explain the essence of the present invention. The effectiveness will be described using an example. This is particularly effective when it is round, the typical being a circle.
[0039] これは実際にも「〇」として、良く使用される記号である。この円の形を、その本質は 保持し、かっかなりの変形に耐える、マスクを作るのが課題である。 [0039] This is actually a symbol that is often used as "o". The challenge is to make a mask that holds the shape of this circle and that is resistant to considerable deformation.
このマスクは以下のように作られる。  This mask is made as follows.
<「〇」マスクの一例〉  <Example of "O" mask>
条件 1: * = +90  Condition 1: * = +90
条件 2: 350く Θ(*)く 600 & 0く Δ(*)く 95  Condition 2: 350 Θ (*) 600 600 & 0 Δ Δ (*) 95 95
条件 3:(*.*+1); 0.1く長さぐ 0.35 & 0≤ |Δ|く 95  Condition 3: (*. * + 1); 0.1 length 0.35 & 0≤ | Δ |
条件 4: *+1 = +180  Condition 4: * + 1 = +180
条件 5: Θ(*+1)= Θ(*)&0く Δ(*)く 95  Condition 5: Θ (* + 1) = Θ (*) & 0 く Δ (*) 95 95
条件 6: (*+1·*+2); 0·1<長さぐ 0.35 & 0≤ |Δ|く 95  Condition 6: (* + 1 · * + 2); 0 · 1 <length 0.35 & 0≤ | Δ |
条件 7: *+2 = +270  Condition 7: * +2 = +270
条件 8: Θ(*+2)= Θ(*) & 0く Δ(*)く 95  Condition 8: Θ (* + 2) = Θ (*) & 0 く Δ (*) く 95
条件 9: (*+2·*+3); 0·1<長さぐ 0.35 & 0≤ |Δ| < 95  Condition 9: (* + 2 · * + 3); 0 · 1 <Length 0.35 & 0 ≤ | Δ | <95
条件 10: *+2 = +360  Condition 10: * +2 = +360
条件 11: Θ(*+3)= Θ(*)&0く Δ(*)く 95  Condition 11: Θ (* + 3) = Θ (*) & 0 Δ Δ (*) 95 95
条件 12:(*+3.*+4); 0·0<長さぐ 0.35 & 0≤ | Δ |く 95 記号「〇」=条件 1&条件 2 &条件 3 &条件 4 &条件 5 &条件 6 &条件 7 &条件 8 & 条件 9 &条件 10&条件 11 &条件 12  Condition 12: (* + 3. * 4); 0 · 0 <Length 0.35 & 0 ≤ | Δ | 95 95 Sign "〇" = condition 1 & condition 2 & condition 3 & condition 4 & condition 5 & condition 6 & condition Condition 7 & Condition 8 & Condition 9 & Condition 10 & Condition 11 & Condition 12
このマスクのノードは 4個(「* = + 90」、「*+1 =+180」、「*+2 = + 270」、「*+3= + 3 60」)で、 ±ηΧ90度ノードの存在(本件の場合は + ηΧ90度ノード)力 S、この形をお おまかに規定している。すなわち、この場合、 nは 1〜4で巻き角が 90度単位で 360 度まで巻くことを要請している。一方、ノードの属性としては、 +n X 90度長点近傍で は、急激な角度の増大が 0く Δ (*)く 95で抑えられている。また辺の属性として、ノード 間の長さが、ある決められた範囲にあることを要請している。これが、「0.1く長さく 0.3 5」である。また、その間に角の急激な増大がないようにも規定している。これが、「0く | Δ |く 95」である。 There are 4 nodes ("* = + 90", "* + 1 = + 180", "* + 2 = + 270", "* + 3 = + 3 60") of this mask, ± Χ Χ 90 degree node Existence (in this case + Χ 度 90 ° node) Force S, roughly defines this form. That is, in this case, n is 1 to 4 and the winding angle is 360 in units of 90 degrees. It is requested to wind up to a degree. On the other hand, as for the attribute of the node, the sharp increase of the angle is suppressed by 0 and Δ (*) and 95 in the vicinity of + n X 90 degree long point. Also, as an attribute of the edge, it is required that the length between nodes be within a certain range. This is "0.1 long and 0.35". It also stipulates that there will be no sharp increase in the angle between them. This is "0 rd | Δ | s 95".
[0041] このマスクは、十分、対ノイズに対し頑健な構成になって!/、て、両端は開いて!/、る。  [0041] This mask is sufficiently configured to be resistant to noise! /, And both ends are open //.
従って、図 8に見られるように、外部にかなりのノイズがある場合にも、「〇」が認識さ れる。先にも述べた如ぐ長さを相対的に、例えば 180度長点と 360度長点間のユー タリッド距離で、各長さを正規化すれば、これは、任意個の連結文字系列にも適用可 能である。  Therefore, as shown in FIG. 8, “〇” is recognized even when there is considerable external noise. If each length is normalized relative to the length as described above, for example, with the distance between the 180 ° long point and the 360 ° long point, this can be an arbitrary number of concatenated character sequences. Is also applicable.
[0042] <凹凸ノード表現〉  <Expression of unevenness node>
ここでは、通常の回転不変を必要としな!/、文字の認識の場合のグラフ表現につ!/、 て、述べる。このような場合には、 ±η Χ 90度ノードを使うよりは、水平、垂直の直交 座標系に基づぐ凹凸をノードとした方が直感的で分かり易い。イメージとしての図 9 の説明図をご覧頂きたい。ここで、〇がノードであり、矢印のついた→が辺である。  Here, we do not need the usual rotation invariance! /, And the graph representation for character recognition! In such a case, it is more intuitive and easier to use as a node the unevenness based on the horizontal and vertical orthogonal coordinate system rather than using the ± Χ Χ 90 degree node. Please see the illustration of Figure 9 as an image. Here, 〇 is a node, and an arrow → is a side.
[0043] ここで、上下の凹凸を u、 ΓΊと示し、左右から見ての凹凸をコ、 cと示す。このほか に実は、上下の凹凸と左右の凹凸が同時に存在する。例えば右上にとんがつている ような場合には、 nとつが重なった n +コと示す。  Here, the upper and lower irregularities are indicated by u and ΓΊ, and the irregularities viewed from the left and right are indicated by 、 and c. Besides, in fact, the upper and lower asperities and the left and right asperities simultaneously exist. For example, in the case where it is in the upper right, it is indicated as n + co where n and one overlap.
[0044] ノードの属性は先に述べた如くである力 辺の属性として、ノード間の折れ線群の 平均方向角が加わる。これは、凹凸ノード表現では非常に重要な特徴となる。その他 、曲がりの程度を示す折れ線群の方向分散がある。つぎに、重要なのは、ノード間の 距離比である。例えば図 9で言えば、始点は、終点よりある程度上になければならな い。これらは、機械的に計算すると、かなりの数になるが、実際には要点だけ抑えれ ば良ぐたいした数にはならない。ここで、例えばノードく 5〉の Y軸値から始点 < s〉 ノード Y軸値を引き、文字全体の長さの比で割った値を求める、といった具合である。  [0044] The attributes of the nodes are as described above. As the attributes of the force side, the average direction angle of the polygonal line group between the nodes is added. This is a very important feature in concavo-convex node expression. Others There is a directional variance of the polyline group indicating the degree of bending. Next important is the distance ratio between nodes. For example, referring to Figure 9, the start point must be somewhat above the end point. These can be quite large when calculated mechanically, but in practice they can not be good numbers if only the point is kept down. Here, for example, the start point <s> node Y axis value is subtracted from the Y axis value of node 5>, and the value obtained by dividing by the ratio of the length of the whole character is obtained.
[0045] 以下、この凹凸グラフ表現による「8」のマスクの一例を示す。  Hereinafter, an example of the mask of “8” by this concavo-convex graph expression will be shown.
<「8」のマスクの一 ί列〉  <One row of masks of "8">
条件 1: *-1 = s/any 条件 2: (*, *-l), 110く加重平均角く 170 Condition 1: * -1 = s / any Condition 2: (*, * -l), 110 weighted average angle 170
条件 3: * = n Condition 3: * = n
条件 4: -420く Θ(*)く -160 Condition 4: -420 degrees Θ (*)--160
条件 5: -110く Δ(*)く -10 Condition 5: -110 square Δ (*) square -10
条件 6: *+1 =匚 Condition 6: * + 1 = 匚
条件 7: Θ(*+1) = Θ(*) Condition 7: Θ (* + 1) = Θ (*)
条件 8: -110く Δ(*+1)く -10 Condition 8: -110 ° Δ (* + 1) ° -10
条件 9: (*+1, *+2), -80く加重平均角く- 30 Condition 9: (* + 1, * + 2), -80 weighted average angle-30
条件 10: (*+1, *+2),長さ > 0.15 Condition 10: (* + 1, * + 2), length> 0.15
条件 11: *+2 =コ Condition 11: * + 2 =
条件 12: 250く Θ(*+2)く 500 Condition 12: 250 Θ (* + 2) 500 500
条件 13: 20く Δ(*+2)く 100 Condition 13: 20 times Δ (* + 2) 100
条件 14: *+3 = U Condition 14: * + 3 = U
条件 15: Θ(*+3) = Θ(*+2) Condition 15: Θ (* + 3) = Θ (* + 2)
条件 16: 10く Δ(*+3)く 120 Condition 16: 10 times Δ (* + 3) 120
条件 17: *+4 = c Condition 17: * + 4 = c
条件 18: Θ(*+4) = Θ(*+2) Condition 18: Θ (* + 4) = Θ (* + 2)
条件 19: 10く Δ(*+4)く 100 Condition 19: 10 times Δ (* + 4) 100
条件 20: (*+4, *+5), 20く加重平均角く 75 Condition 20: (* + 4, * + 5), 20 weighted average angle 75
^{ψ21: (*+4, *+5), Cross: nxm:ne(*+l)~(*+2), m≡(*+4)—(*+5)  ^ {ψ 21: (* + 4, * + 5), Cross: nxm: ne (* + 1) ~ (* + 2), m ≡ (* + 4)-(* + 5)
条件 22: *+5 = e/any Condition 22: * + 5 = e / any
条件 23: ((*+1, *+2)CrossY値- (*)Y値)/高さ 0·20<Υ<0·70〃交差点は高さでみて ほぼ中心に位置する。符号に注意 文字「 8」 =条件 1 &条件 2 &条件 3 &条件 4 &条件 5 &条件 6 &条件 7 &条件 8 &条 件 9 &条件 10&条件 11&条件 12 &条件 13 &条件 14 &条件 15 &条件 16 &条件 1 7 &条件 18 &条件 19 &条件 20&条件 21&条件 22 &条件 23 Condition 23: ((* + 1, * + 2) Cross Y value-(*) Y value) / Height 0 · 20 <Υ <0 · 70〃 The intersection is located approximately at the center in height. Note the characters Letter '8' = condition 1 & condition 2 & condition 3 & condition 4 & condition 5 & condition 6 & condition 7 & condition 8 & condition 9 & condition 10 & condition 11 & condition 12 & condition 13 & condition 14 & condition 14 & condition 15 & Condition 16 & Condition 1 7 & Condition 18 & Condition 19 & Condition 20 & Condition 21 & Condition 22 & Condition 23
この場合は、特にノード間距離比は必要ない。その代わり、交差点に関する辺の属 性である条件 23が、その役割を演じている。このように、ノード間距離比は実際には、 組み合わせで生じる数よりはるかに少ない。それは、巻き角、外角、折れ線角の制限 で、喑黙の内に制限されているからである。また、上のマスクにおける長さの制限は 左から右に下がる、折れ線の長さを制限するだけである。実はこのために、非常に、 端のノイズに強レ、マスクとなって!/、る。 In this case, no particular inter-node distance ratio is required. Instead, the genus of the side regarding the intersection Condition 23 which is sex plays the role. Thus, the inter-node distance ratio is actually much less than the number produced in combination. That is because it is restricted within silence by the restrictions of wrap angle, outside angle, and line angle. Also, the length restriction in the upper mask only limits the length of the polyline, which goes from left to right. In fact, because of this, very, strong noise on the edge, it becomes a mask!
[0047] なお、これは、連続した文字系列用の切り出し認識マスクではない。それは、この長 さの制限を入れているからである。この点、ノード * = ΓΊとノード *+3 = U間の Y軸値で 、この長さを正規化してやれば、連続した文字系列用の切り出し認識マスクとなる。  Note that this is not a cutout recognition mask for continuous character series. That is because we put a limit on this length. In this respect, if this length is normalized by the Y-axis value between the node * = ΓΊ and the node * + 3 = U, it becomes a cutout recognition mask for a continuous character series.
[0048] この手法により、端に極端なノイズが発生した場合や、極端な変形が生じた場合に おいても、核となる「8」の形を持っていれば、その周りの状況と無関係に、正しい認 識が fiわれる。  [0048] With this method, even when extreme noise occurs at the end or extreme deformation, as long as it has the core "8" shape, it has nothing to do with the surrounding situation. The correct recognition is
[0049] <文字の形の包含関係〉  <Inclusion of Character Forms>
上に述べてきた、切り出しの認識法は、必然的に、複数の答えを出す可能性を持 つ。  The cut-out recognition method described above necessarily has the possibility to give multiple answers.
この、具体的例が図 10に示されている。図 10は「8」を意図して書かれたものである 力 しかし、この図形には「6」が隠されている。言葉を変えれば、元の図形から、「6」 なる文字形が「切り出し認識」されているわけである。したがって、これは必然的な結 果である。  A specific example of this is shown in FIG. Figure 10 is written with the intention of “8”. However, “6” is hidden in this figure. If you change the word, the character form "6" is "cut out and recognized" from the original figure. Therefore, this is an inevitable result.
[0050] そこで、意図された、「8」が正しく出力されるメカニズムが必要である。これは直感的 に言えば、複雑な形を優先すると言うことである力 定量的に正確に述べると、「6」の マスクにマッチしている折れ線の長さと、「8」のマスクにマッチしている折れ線の長さ を比較すれば、当然、後者がより長くなる。  [0050] Therefore, the intended mechanism for correctly outputting "8" is required. Intuitively speaking, this means that the complex shape is to be prioritized. Forces To be stated quantitatively, the length of the line matching the "6" mask and the mask of the "8" match Naturally, the latter becomes longer if the length of the broken line is compared.
[0051] 以下の例から、「8」は「6」より 0. 27の長さ分、長い。したがって、「6」は定量的にも 「8」のイメージに埋没し、「8」は「6」より 0. 27長い整合部分を持ち、「8」と判定される 。このほか、整合ノード数、折れ線数などの整合の測度が考えられる。  From the following example, “8” is longer than “6” by the length of 0.27. Therefore, "6" is quantitatively buried in the image of "8", and "8" has a matching portion longer than "6" by 0.27 and is determined as "8". In addition to this, measures of matching such as the number of matching nodes and the number of broken lines can be considered.
[0052] なお、本発明の手書き文字認識は、実施の形態の説明の最初でも説明したように、 図 1に示した処理構成に限定されるものではなぐ実質的に同様の手書き文字認識 が行われる構成であれば、種々の装置やシステムの構成で、認識処理を行うことが 可能である。例えば、本発明の手書き文字認識をプログラム(ソフトウェア)化して、汎 用のパーソナルコ Note that, as described at the beginning of the description of the embodiment, the handwritten character recognition of the present invention is substantially the same handwritten character recognition that is not limited to the processing configuration shown in FIG. To perform recognition processing with various device and system configurations. It is possible. For example, the handwritten character recognition of the present invention is converted into a program (software) to make a general-purpose personal computer.
ンピュータ装置に実装させるようにしてもよい。手書き文字認識プログラムは、各種記 憶媒体に記憶させて、配布することが可能である。  It may be implemented on a computer device. The handwritten character recognition program can be stored in various storage media and distributed.
[0053] ここではオンラインの文字を対象とした力 S、適当な細線化か、輪郭追跡などで、オフ ラインの文字に対しても、文字認識を行うようにしてもよい。 In this case, the character recognition may be performed on the off-line characters by the force S for the on-line characters, appropriate thinning, or contour tracing.
さらに、上述した実施の形態では、主として数字の認識を行う場合を例としたが、本 発明の手書き文字認識は、基本的にどのような言語の文字や記号にも適用可能であ 引用符号の説明  Furthermore, in the above-described embodiment, although the case of mainly recognizing numbers is taken as an example, handwritten character recognition of the present invention can be basically applied to characters and symbols of any language. Description
[0054] 1···紙、 la…運筆、 2···ペン、 3···入力処理部、 4···折れ線近似部、 5···前処理部、 6 …特徴抽出部、 7···識別部、 8···識別結果出力部  [0054] 1 ... paper, la ... stroke, 2 ... pen, 3 ... input section, 4 ... polygonal line approximating unit, 5 ... preprocessing unit, 6 ... characteristic extraction unit, 7 · · · · · · · Identification unit, 8 · · · output unit

Claims

請求の範囲 The scope of the claims
[1] オンライン手書き文字を認識する手書き文字認識方法にお!/、て、  [1] Online handwriting recognition method /!
入力された手書き文字列を、各画毎にパラメータ表現でとらえ、各画毎に折れ線近 似を行い、  Capture the input handwritten character string for each stroke by parameter expression, and perform line approximation for each stroke,
その折れ線近似された各折れ線を、始点から終点にいたるベクトルとして、基準とな る軸と各折れ線とのなす角度を折れ線角系列として求め、  The angle between the reference axis and each polygonal line is determined as a polygonal angle series, with each polygonal line approximated by the polygonal line as a vector extending from the start point to the end point.
得られた折れ線の各頂点の外角系列を求め、  Find the outer corner series of each vertex of the obtained polygonal line,
前記外角系列のプラス又はマイナスの同じ符号が連続する同符号の外角の和を、 巻き角系列とし、  Let the sum of the external angles of the same sign where the same sign of plus or minus of the external angle series continues is the winding angle series,
前記求められた各系列による特徴抽出を基にして、基準となる点であるノードを求 め、そのノードの点としての属性とノード間の辺としての属性により、グラフ表現を与え 始点と終点を特定しない開いたマスク構成によるテンプレートとのマッチングにより、 普通の上下、左右の関係を保持した文字に対しても、また回転不変を求められる場 合においても、柔軟で、端のノイズや変形に強い頑健な認識を行うことを特徴とする 文字認識方法。  Based on the feature extraction based on each of the obtained series, a node as a reference point is determined, and a graph representation is given by the attribute as the point of the node and the attribute as an edge between the nodes. By matching with the template by the open mask configuration which is not specified, it is flexible and strong against the noise and the deformation of the edge, even for the character that holds the normal upper and lower, left and right relationship, and when the rotation invariance is required. A character recognition method characterized by robust recognition.
[2] 請求の範囲第 1項記載の文字認識方法において、 [2] In the character recognition method according to claim 1,
始点、終点、 90度の整数倍の巻き角を与える点、巻き角が変化する点をノードとし 、ノードの点としての属性とノード間の辺の属性によりグラフ表現を与え、始点、終点 を特定しない開いたマスク構成によるテンプレートとのマッチングにより、回転不変な 認識手段を特徴とする  Start point, end point, point giving winding angle of integer multiple of 90 degrees, point where winding angle changes as node, give graph representation by attribute of node as point and attribute of edge between nodes, specify start point, end point It features rotation-invariant recognition means by matching the template with a non-open mask configuration.
文字認識方法。  Character recognition method.
[3] 請求の範囲第 1項記載の文字認識方法において、 [3] In the character recognition method according to claim 1,
回転不変が要求されない、通常の文字形の認識のために、上下の凹凸、左右の凹 凸をノードとし、そのノードの点としての属性とノード間の辺としての属性により、グラフ 表現を与え、始点と終点を特に規定しな!/、開いたマスク構成によるテンプレートとの マッチングにより、文字認識を行うことを特徴とする  In order to recognize ordinary character shapes that do not require rotation invariance, let the upper and lower asperities and left and right concave and convex nodes be nodes, and give a graph representation by the attribute as the point of the node and the attribute as the edge between the nodes. It does not specify the start point and the end point in particular! / Character recognition is performed by matching with the template by the open mask configuration
文字認識方法。 Character recognition method.
[4] 請求の範囲第 1項記載の文字認識方法において、 [4] In the character recognition method according to claim 1,
複数の文字又は記号が包含関係にある場合には、定量的に整合部分が長い文字 又は記号を優先することを特徴とする  In the case where a plurality of characters or symbols are in an inclusion relation, it is characterized in that the matching portion preferentially gives priority to the long characters or symbols.
文字認識方法。  Character recognition method.
[5] 請求の範囲第 1項記載の文字認識方法において、 [5] In the character recognition method according to claim 1,
連続した文字列文字を認識と同時に切り出して認識を行うことを特徴とする 文字認識方法。  A character recognition method characterized in that continuous character string characters are recognized and cut out simultaneously with recognition.
[6] オンラインの手書き文字を認識する手書き文字認識システムにお!/、て、  [6] A handwriting recognition system that recognizes online handwriting!
手書き文字がオンラインで入力される入力手段と、  Input means for inputting handwritten characters online;
入力された手書き文字列を、各画毎にパラメータ表現でとらえ、各画毎に折れ線近 似を行う折線近似手段と、  A broken line approximation means for capturing input handwritten character strings as parameter expressions for each stroke and performing polygonal line approximation for each stroke;
前記折線近似手段で折れ線近似された各折れ線を、始点から終点にレヽたるべタト ルとして、基準となる軸と各折れ線とのなす角度を折れ線角系列として求め、得られ た折れ線の各頂点の外角系列を求め、前記外角系列のプラス又はマイナスの同じ符 号が連続する同符号の外角の和を、巻き角系列とする処理手段と、  Each broken line approximated by the broken line approximation means is a vector that is calculated from the start point to the end point, and the angle formed by the reference axis and each broken line is obtained as a broken line angle series, and each vertex of the obtained broken line A processing means for obtaining an outer angle sequence, and taking the sum of the outer angles of the same sign in which plus or minus same symbols of the outer angle sequence are continuous as a winding angle sequence;
前記処理手段で求められた各系列による特徴抽出を基にして、基準となる点である ノードを求め、そのノードの点としての属性とノード間の辺としての属性により、グラフ 表現  Based on feature extraction by each series determined by the processing means, a node which is a reference point is determined, and a graph is represented by an attribute as the node point and an attribute as an edge between the nodes.
を与え、始点と終点を特に規定しな!/、開いたマスク構成によるテンプレートとのマッチ ングにより、普通の上下、左右の関係を保持した文字に対しても、また回転不変を求 められる場合においても、柔軟で、端のノイズや変形に強い頑健な認識を行うステツ プとを備えることを特徴とする  If you can find rotation invariance even for characters that maintain normal upper / lower and left / right relationships by matching with a template with an open mask configuration. Also have steps to perform robust recognition that is flexible and resistant to noise and deformation at the edges.
文字認識システム。  Character recognition system.
[7] オンライン手書き文字を認識する手書き文字認識プログラムにおレヽて、  [7] Write a handwriting recognition program that recognizes handwriting online,
入力された手書き文字列を、各画毎にパラメータ表現でとらえ、各画毎に折れ線近 似を行うステップと、  Capturing handwritten input character strings for each stroke by parameter expression, and performing polygonal line approximation for each stroke;
その折れ線近似された各折れ線を、始点から終点にいたるベクトルとして、基準とな る軸と各折れ線とのなす角度を折れ線角系列として求めるステップと、 得られた折れ線の各頂点の外角系列を求めるステップと、 Obtaining the angle between the reference axis and each broken line as a broken line angle series, using the broken line-approximated broken lines as vectors extending from the start point to the end point; Determining an outer angle sequence of each vertex of the obtained polygonal line;
前記外角系列のプラス又はマイナスの同じ符号が連続する同符号の外角の和を、 巻き角系列とするステップと、  Setting the sum of the external angles of the same sign where positive or negative same signs of the external angle series are continuous as a winding angle series;
前記求められた各系列による特徴抽出を基にして、基準となる点であるノードを求 め、そのノードの点としての属性とノード間の辺としての属性により、グラフ表現を与え 始点と終点を規定しない開いたマスク構成によるテンプレートとのマッチングにより、 普通の上下、左右の関係を保持した文字に対しても、また回転不変を求められる場 合においても、柔軟で、端のノイズや変形にも強い頑健な認識を行うステップとを備 えたことを特徴とする  Based on the feature extraction based on each of the obtained series, a node as a reference point is determined, and a graph representation is given by the attribute as the point of the node and the attribute as an edge between the nodes. By matching with a template with an open mask configuration that is not specified, it is flexible even for characters that maintain normal upper and lower, left and right relationships, and when rotation invariance is required. It is characterized in that it has a step of making strong and robust recognition.
手書き文字認識プログラム。  Handwritten character recognition program.
[8] 記憶されたプログラムを所定の演算処理装置に実装させることで、オンラインの手 書き文字認識が可能な記憶媒体において、 [8] A storage medium capable of online handwriting character recognition by mounting a stored program on a predetermined arithmetic processing unit,
記憶媒体に記憶されたプログラムとして、  As a program stored in a storage medium,
入力された手書き文字列を、各画毎にパラメータ表現でとらえ、各画毎に折れ線近 似を行うステップと、  Capturing handwritten input character strings for each stroke by parameter expression, and performing polygonal line approximation for each stroke;
その折れ線近似された各折れ線を、始点から終点にいたるベクトルとして、基準とな る軸と各折れ線とのなす角度を折れ線角系列として求めるステップと、  Obtaining the angle between the reference axis and each broken line as a broken line angle series, using the broken line-approximated broken lines as vectors extending from the start point to the end point;
得られた折れ線の各頂点の外角系列を求めるステップと、  Determining an outer angle sequence of each vertex of the obtained polygonal line;
前記外角系列のプラス又はマイナスの同じ符号が連続する同符号の外角の和を、 巻き角系列とするステップと、  Setting the sum of the external angles of the same sign where positive or negative same signs of the external angle series are continuous as a winding angle series;
前記求められた各系列による特徴抽出を基にして、基準となる点であるノードを求 め、そのノードの点としての属性とノード間の辺としての属性により、グラフ表現を与え 始点と終点を規定しない開いたマスク構成によるテンプレートとのマッチングにより、 普通の上下、左右の関係を保持した文字に対しても、また回転不変を求められる場 合においても、柔軟で、端のノイズや変形に強い頑健な認識を行うステップとを備え たことを特徴とする 記憶媒体。 Based on the feature extraction based on each of the obtained series, a node as a reference point is determined, and a graph representation is given by the attribute as the point of the node and the attribute as an edge between the nodes. It is flexible and resistant to edge noise and deformation, even for characters that maintain normal upper and lower, left and right relationships, and when rotation invariance is required, by matching with a template with an open mask configuration that is not specified. And a step of performing robust recognition. Storage medium.
PCT/JP2007/065458 2006-08-14 2007-08-07 Hand-written character recognizing method, hand-written character recognizing system, hand-written character recognizing program, and storage medium WO2008020557A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2006-221253 2006-08-14
JP2006221253A JP5352757B2 (en) 2006-08-14 2006-08-14 Handwritten character recognition method, handwritten character recognition system, handwritten character recognition program, and storage medium

Publications (1)

Publication Number Publication Date
WO2008020557A1 true WO2008020557A1 (en) 2008-02-21

Family

ID=39082082

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2007/065458 WO2008020557A1 (en) 2006-08-14 2007-08-07 Hand-written character recognizing method, hand-written character recognizing system, hand-written character recognizing program, and storage medium

Country Status (2)

Country Link
JP (1) JP5352757B2 (en)
WO (1) WO2008020557A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107067046A (en) * 2016-11-29 2017-08-18 南京工程学院 Handwritten Digit Recognition method based on hybrid feature extraction
CN113392772A (en) * 2021-06-17 2021-09-14 南开大学 Character recognition-oriented character image shrinkage deformation enhancement method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6395591A (en) * 1986-10-13 1988-04-26 Wacom Co Ltd Method for recognizing hand-written character
JPS642187A (en) * 1987-06-24 1989-01-06 Nec Corp On line successive character recognition device
JPH06274698A (en) * 1993-03-18 1994-09-30 Pfu Ltd On-line hand-written character recognizing system
JPH08101889A (en) * 1994-09-30 1996-04-16 Sharp Corp On-line handwritten character recognition method
JPH1049631A (en) * 1996-05-22 1998-02-20 Seiko Epson Corp Method and device for on-line handwritted character recognition
WO2006088222A1 (en) * 2005-02-15 2006-08-24 Kite Image Technologies Inc. Handwritten character recognizing method, handwritten character recognizing system, handwritten character recognizing program, and recording media

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6395591A (en) * 1986-10-13 1988-04-26 Wacom Co Ltd Method for recognizing hand-written character
JPS642187A (en) * 1987-06-24 1989-01-06 Nec Corp On line successive character recognition device
JPH06274698A (en) * 1993-03-18 1994-09-30 Pfu Ltd On-line hand-written character recognizing system
JPH08101889A (en) * 1994-09-30 1996-04-16 Sharp Corp On-line handwritten character recognition method
JPH1049631A (en) * 1996-05-22 1998-02-20 Seiko Epson Corp Method and device for on-line handwritted character recognition
WO2006088222A1 (en) * 2005-02-15 2006-08-24 Kite Image Technologies Inc. Handwritten character recognizing method, handwritten character recognizing system, handwritten character recognizing program, and recording media

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107067046A (en) * 2016-11-29 2017-08-18 南京工程学院 Handwritten Digit Recognition method based on hybrid feature extraction
CN113392772A (en) * 2021-06-17 2021-09-14 南开大学 Character recognition-oriented character image shrinkage deformation enhancement method
CN113392772B (en) * 2021-06-17 2022-04-19 南开大学 Character recognition-oriented character image shrinkage deformation enhancement method

Also Published As

Publication number Publication date
JP5352757B2 (en) 2013-11-27
JP2008046825A (en) 2008-02-28

Similar Documents

Publication Publication Date Title
JP4092371B2 (en) Handwritten character recognition method, handwritten character recognition system, handwritten character recognition program, and recording medium
US7596272B2 (en) Handling of diacritic points
US5742705A (en) Method and apparatus for character recognition of handwritten input
Zhu et al. Vision based hand gesture recognition using 3D shape context
US7369702B2 (en) Template-based cursive handwriting recognition
US20080002884A1 (en) Method and device for recognition of a handwritten pattern
CN106845384B (en) gesture recognition method based on recursive model
US20060239560A1 (en) Recognition graph
JP4738469B2 (en) Image processing apparatus, image processing program, and image processing method
WO2021196013A1 (en) Word recognition method and device, and storage medium
TWI303783B (en) Method and device for recognition of a handwritten pattern, and computer program product
JP6915786B2 (en) Learning device, recognition device, learning method and computer program
Schenk et al. Selecting features in on-line handwritten whiteboard note recognition: SFS or SFFS?
WO2008020557A1 (en) Hand-written character recognizing method, hand-written character recognizing system, hand-written character recognizing program, and storage medium
Kang et al. Utilization of hierarchical, stochastic relationship modeling for Hangul character recognition
Valveny et al. Application of deformable template matching to symbol recognition in handwritten architectural drawings
JP4648084B2 (en) Symbol recognition method and apparatus
JP5224156B2 (en) How to make up for the lack of learning character samples in character recognition systems
Shreya et al. GAN-enable latent fingerprint enhancement model for human identification system
Inuganti et al. Recognition of online handwritten Telugu stroke by detected dominant points using curvature estimation
Yang Static Gesture Recognition Algorithm Based on Upper Triangular Image Texture and Recursive Graph.
Abdo et al. C30. An Interpolation Based Technique for Sign Language Recognition
Tokuno et al. Pen-coordinate information modeling by SCPR-based HMM for on-line Japanese handwriting recognition
Yun et al. Shape representation robust to the sketching order using distance map and direction histogram
Babu et al. Handwritten Symbol Recognition Using Hierarchical Shape Representation Model Based on Shape Signature

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07792127

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

NENP Non-entry into the national phase

Ref country code: RU

122 Ep: pct application non-entry in european phase

Ref document number: 07792127

Country of ref document: EP

Kind code of ref document: A1