JPS60159987A - Character recognizing device - Google Patents

Character recognizing device

Info

Publication number
JPS60159987A
JPS60159987A JP59016467A JP1646784A JPS60159987A JP S60159987 A JPS60159987 A JP S60159987A JP 59016467 A JP59016467 A JP 59016467A JP 1646784 A JP1646784 A JP 1646784A JP S60159987 A JPS60159987 A JP S60159987A
Authority
JP
Japan
Prior art keywords
character
branch point
quadrilateral
data
character pattern
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP59016467A
Other languages
Japanese (ja)
Inventor
Minoru Nagao
永尾 実
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Omron Corp
Original Assignee
Tateisi Electronics Co
Omron Tateisi Electronics Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tateisi Electronics Co, Omron Tateisi Electronics Co filed Critical Tateisi Electronics Co
Priority to JP59016467A priority Critical patent/JPS60159987A/en
Publication of JPS60159987A publication Critical patent/JPS60159987A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Discrimination (AREA)

Abstract

PURPOSE:To enable character recognition by simple method by extracting a quadrangle circumscribed by character pattern, dividing this into three directions longitudinally or lateraly, and deciding the divided area in which characteristic such as character branch point edists. CONSTITUTION:A binary coded character pattern is stored in a picture memory 1. A circumscribed quadrangle extracting circuit 2 extracts a quadrangle circumscribed by the character pattern on the picture memory 1. A branch point position extracting circuit 3 extracts position of existence of branch point that forms characteristic of a character. These extracted data are stored in a RAM4. A CPU5 interprets program of a program memory 6, and executes reading or writing of data to the RAM4 and picture memory 1, and at the same time, controls operation of above-mentioned extracting circuits 2, 4. By this way, distinction between P and 9, 5 and 2 can be obtained clearly by only adding a simple processing method.

Description

【発明の詳細な説明】 〈発明の技術分野〉 本発明は、未知文字を光学的に読み取り、これを白黒2
値化して文字パターンをめた後、文字パターンより未知
文字の特徴を抽出し、この特徴を辞書に予め格納しであ
る標準パターンと照合して、未知文字を特定する文字認
識装置に関する。
[Detailed Description of the Invention] <Technical Field of the Invention> The present invention optically reads unknown characters and converts them into black and white.
The present invention relates to a character recognition device that identifies the unknown character by converting it into a value to obtain a character pattern, extracting the characteristics of the unknown character from the character pattern, and comparing the characteristics with a standard pattern stored in a dictionary in advance.

〈発明の背景〉 従来の文字認識装置では、辞書照合処理に際し、未知文
字の特徴を用いてその候補文字を段階的に絞り込んでゆ
き、そして最後に詳細な辞書照合動作を実行している。
<Background of the Invention> In a conventional character recognition device, during dictionary matching processing, candidate characters are narrowed down step by step using the characteristics of an unknown character, and finally a detailed dictionary matching operation is executed.

例えば文字特徴のうち、交点およびループの有無に着目
すると、第1図に示す如く、数字「1」〜「9」はその
態様に応じて4個のグループに分類される。従ってもし
未知文字が、゛′交点有り″パループ無し″の特徴を有
する場合、その未知文字は第3番目のグループに含まれ
る候補文字r4Jr5J r7Jのいずれかであると判
断される。これと同様の処理を、例えばループの数、端
点の数、分岐点の有無等の他の文字特徴を用いて実施す
れば、候補文字をより一層絞り込むことができる。とこ
ろがこの種絞込み操作において、文字特徴として、交点
、分岐点、ループ、凹み等を用いただけでは、例えば第
2図に示す英文字「P」と数字「9」との間の区別、更
には第3図に示す数字「5」と数字「2」との間の区別
が不可能であり、これがためこれら文字の認識にはより
複雑な処理方法を採択する等の必要があった。
For example, focusing on the presence or absence of intersections and loops among character features, the numbers "1" to "9" are classified into four groups according to their aspects, as shown in FIG. Therefore, if an unknown character has the characteristics of ``there is an intersection'' and ``no paroop'', the unknown character is determined to be one of the candidate characters r4Jr5J r7J included in the third group. If similar processing is performed using other character features such as the number of loops, the number of endpoints, the presence or absence of branch points, candidate characters can be further narrowed down. However, in this type of narrowing down operation, if only intersections, branch points, loops, depressions, etc. are used as character features, it will not be possible to distinguish between, for example, the alphabetic letter "P" and the number "9" shown in Figure 2, or even the number "9". It is impossible to distinguish between the number "5" and the number "2" shown in FIG. 3, and it is therefore necessary to adopt a more complicated processing method to recognize these characters.

〈発明の目的〉 本発明は、特定の文字特徴に着目し、文字パターンにお
けるその特徴部分の存在位置を判定することによって、
上記不都合を解消した文字認識装置を提供することを目
的とする。
<Objective of the Invention> The present invention focuses on a specific character feature and determines the location of that feature in a character pattern.
It is an object of the present invention to provide a character recognition device that eliminates the above-mentioned disadvantages.

〈発明の構成および効果〉 上記目的を達成するため、本発明では、未知文字の2値
化データに基づき文字パターンが外接する四辺形を抽出
し、この四辺形の領域を、例えば第4,5図に示す縦方
向或いは第6,7図に示す横方向に夫々3分割して、例
えば文字分岐点のような特徴部分がいずれの分割領域に
存在位置するかを判定するよう構成した。
<Configuration and Effects of the Invention> In order to achieve the above object, the present invention extracts a quadrilateral circumscribing a character pattern based on the binary data of unknown characters, and divides the area of this quadrilateral into, for example, the fourth and fifth regions. The screen is divided into three parts in the vertical direction shown in the figure or in the horizontal direction shown in Figs. 6 and 7, and it is determined in which divided area a characteristic part such as a character branch point is located.

本発明によれば、第4図の英文字rPJの場合は、文字
の分岐点T、 、 T2は左端の領域XAに存在し、一
方第5図の数字「9」の場合は、文字の分岐点T1. 
T2. T3が右端の領域XCに存在して、左端の領域
XAには全く存在せず、従ってこの両者は明確に区別し
得る。また第6図の数字「5」の場合は、分岐点TI、
T2は上端の領域YAに存在し、一方第7図の数字「2
」の場合は、分岐点Tl、T2が下端の領域Ycに存在
して、上端の領域YAには存在せず、従ってこの両者は
明確に区別し得る。かくして本発明は、従来不可能であ
った特定文字間の区別を可能とし、文字認識精度の向上
に貢献する等、発明目的を達成した顕著な効果を奏する
According to the present invention, in the case of the English letter rPJ in FIG. 4, the character branch points T, , T2 exist in the leftmost region XA, while in the case of the number "9" in FIG. Point T1.
T2. T3 exists in the rightmost region XC and does not exist at all in the leftmost region XA, and therefore the two can be clearly distinguished. In addition, in the case of the number "5" in Fig. 6, the branch point TI,
T2 exists in the upper end area YA, while the number “2” in FIG.
'', the branching points Tl and T2 exist in the lower end region Yc, but not in the upper end region YA, and therefore the two can be clearly distinguished. Thus, the present invention achieves the remarkable effects of achieving the purpose of the invention, such as making it possible to distinguish between specific characters, which was previously impossible, and contributing to improved character recognition accuracy.

〈実施例の説明〉 第8図は本発明にかかる装置の回路構成例を示し、図中
の画像メモリ1には、白黒2値された文字パターンが格
納される。また外接四辺形抽出回路2は、画像メモリ1
上において文字パターンが外接する四辺形を抽出し、更
に分岐点位置抽出回路3は文字の特徴をなす分岐点の存
在位置を抽出する。これら抽出データは、RAM(Ra
ndom Access Metnory ) 4に格
納され、CPU(Central Processin
g Unit ) 5は、プログラムメモリ6のプログ
ラムを解読し、RAM4や画像メモリ1に対するデータ
の読出し若しくは書込みを実行すると共に、前記各抽出
回路2,4の動作を制御する。
<Description of Embodiments> FIG. 8 shows an example of the circuit configuration of an apparatus according to the present invention, and the image memory 1 in the figure stores a black and white binary character pattern. Further, the circumscribed quadrilateral extraction circuit 2 includes the image memory 1
Above, the quadrilateral circumscribed by the character pattern is extracted, and furthermore, the branch point position extraction circuit 3 extracts the location of the branch point that is characteristic of the character. These extracted data are stored in RAM (Ra
ndom Access Memory) 4, and is stored in the CPU (Central Processing
gUnit) 5 decodes the program in the program memory 6, reads or writes data to the RAM 4 and the image memory 1, and controls the operations of the extraction circuits 2 and 4.

第9図はCPU5の制御動作を示す。令弟10図に示す
画像メモリ1のXY座標上に英文字「P」の文字パター
ン7が格納された場合を想定すると、まずCPU5は、
第9図のステップ10において、文字パターン7が外接
する四辺形8を抽出する。この四辺形8を規定するデー
タは、文字パターン7におけるX座標の最大、最小値X
M 、 Xmと、Y座標の最大、最小値YM 、 Ym
とによって与えられ、これら座標データ(図示例の場合
、Xm = 2、XM=9、Ym = 2、YM=10
)は第11図に示すRAM4の所定領域に順次格納され
る。
FIG. 9 shows the control operation of the CPU 5. Assuming that the character pattern 7 of the English letter "P" is stored on the XY coordinates of the image memory 1 shown in FIG.
In step 10 of FIG. 9, a quadrilateral 8 circumscribing the character pattern 7 is extracted. The data defining this quadrilateral 8 are the maximum and minimum X coordinates of the character pattern 7.
M, Xm, and the maximum and minimum values of Y coordinates YM, Ym
These coordinate data (in the illustrated example, Xm = 2, XM = 9, Ym = 2, YM = 10
) are sequentially stored in a predetermined area of the RAM 4 shown in FIG.

つぎにCPU5は、ステップ11において、文字パター
ン7より分岐点T1.T2が存在位置する座標(X、Y
)を抽出し、その座標データ(図示例の場合、Tlが(
4,5)、T2が(4、6)である)は第12図に示す
RAM4の所定領域に格納される。尚第12図中、テー
ブルストッパーは分岐点情報の完了を示すコードである
Next, in step 11, the CPU 5 selects the branch point T1 from the character pattern 7. Coordinates where T2 exists (X, Y
), and its coordinate data (in the illustrated example, Tl is (
4,5) and T2 is (4,6)) are stored in a predetermined area of the RAM 4 shown in FIG. In FIG. 12, the table stopper is a code indicating completion of branch point information.

ついでCPU5は、ステップ12において、前記の四辺
形7を縦横各3分割し、つぎのステップ13において、
前記分岐点T1.T2が分割されたいずれの領域に存在
位置するかを判定する。
Next, in step 12, the CPU 5 divides the quadrilateral 7 into three parts vertically and horizontally, and in the next step 13,
Said branch point T1. It is determined in which divided region T2 exists.

第13図は、前記ステップ12.13の内容を一層詳細
に示したものである。同図中、ステップ20〜25は前
記四辺形7を縦方向に3分割するラインの位置データX
l、 X2を、またステップ26〜31は横方向に3分
割するラインの位置データY、、、Y2を夫々算出する
過程を示す。図示例の方法は、四辺形7の横辺および縦
辺を3で割って、分割定数Dx、Dyおよびその余りを
め(ステップ20.26)、夫々の余りが、0.1.2
のいずれであるかを判定した後(ステップ21〜22お
よびステップ27〜28)、余りの値に応じて前記位置
データXl。
FIG. 13 shows the contents of step 12.13 in more detail. In the figure, steps 20 to 25 are position data X of a line that vertically divides the quadrilateral 7 into three.
Steps 26 to 31 show the process of calculating position data Y, . The illustrated method is to divide the horizontal and vertical sides of quadrilateral 7 by 3, find the division constants Dx and Dy, and their remainders (step 20.26), and each remainder is 0.1.2.
(Steps 21-22 and Steps 27-28), the position data Xl is determined according to the remainder value.

X2およびYl、Y2を算出している(ステップ23〜
25およびステップ29〜31)。そしてこれら位装置
データ(図示例の場合、X1=5.X2=7、Yl−5
,Y2−8)は、第14図に示すRAM4の所定領域に
格納され、これにより四辺形8はデータ上、第15.1
6図に示す縦横各3個の領域XA〜xc 、 YA、%
、 Ycに分割される。
X2, Yl, and Y2 are calculated (step 23~
25 and steps 29-31). And these device data (in the case of the illustrated example, X1=5.X2=7, Yl-5
, Y2-8) are stored in a predetermined area of the RAM 4 shown in FIG.
Three vertical and horizontal areas XA to xc, YA, % shown in Figure 6
, Yc.

つぎにcpu5は、ステップ32〜37において、分岐
点T1.T2が縦分割された領域XA −Xcのいずれ
に位置するか、またステップ38〜43において、横分
割された領域YA、Ycのいずれに位置するかを判定す
る。この判定は、分岐点Tl、T2のX座標と前記位置
データXl、X2との大小比較(ステップ32.33)
および、分岐点T1.T2のY座標と位置データY、 
、 Y2との大小比較(ステップ38 、39 )によ
って実行され、各ステップの判定結果に基ついて第17
図に示すRAM4の所定領域に分岐点T1.T2の存在
位置を表わすデータがセットされる(ステップ34〜3
6および、ステップ40〜41)。前記分岐点Tl、T
2の各座標は、第12図に示すRAM領域から読み出さ
れるが、読み出したデータがテーブルストッパであると
き、ステップ37、ステップ43の判定が6YES”と
なり、各判定処理は完了する。かくて第17図において
、領域XA 、 YA 、 YBに対応するRAM領域
にはデータ「1」がセットされ、これにより分岐点はこ
れら分割領域に存在位置することが理解される。
Next, in steps 32 to 37, the CPU 5 executes the branch point T1. It is determined in which of the vertically divided areas XA-Xc T2 is located, and in steps 38 to 43, which of the horizontally divided areas YA and Yc is located. This determination is performed by comparing the X coordinates of the branch points Tl and T2 with the position data Xl and X2 (steps 32 and 33).
and branch point T1. T2's Y coordinate and position data Y,
, Y2 (steps 38 and 39), and based on the determination result of each step, the 17th
Branch point T1. Data representing the location of T2 is set (steps 34 to 3).
6 and steps 40-41). The branch points Tl, T
Each coordinate of 2 is read from the RAM area shown in FIG. 12, but when the read data is a table stopper, the judgments in step 37 and step 43 become 6YES'', and each judgment process is completed. In FIG. 17, data "1" is set in the RAM areas corresponding to areas XA, YA, and YB, and it is understood from this that branch points exist and are located in these divided areas.

第18図は、数字「9」の文字パターンを示し、上記と
同様の方法によって、第19図に示す分岐点の存在位置
データを得ることができる。
FIG. 18 shows a character pattern of the number "9", and the branch point location data shown in FIG. 19 can be obtained by the same method as above.

この第19図におけるデータ配置と前記第17図におけ
るデータ配置とを比較すると、両者は明らかに一致して
おらず、従って例えば領域XAに対応するデータ内容を
参照することによって、英文字r P Jと数字「9」
との区別が可能である。
Comparing the data arrangement in FIG. 19 and the data arrangement in FIG. 17, it is clear that they do not match. Therefore, by referring to the data contents corresponding to area XA, for example, the alphabet r P J and the number "9"
It is possible to distinguish between

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は候補文字の分類例を示す説明図、第2図および
第3図は従来装置において分類不能な文字パターン例を
示す説明図、第4図〜第7図は本発明にかかる方式を説
明するための文字パターンを示す説明図、第8図は本発
明の装置例を示す回路ブロック図、第9図はCPUの制
御動作を示すフローチャート、第10図は画像メモリ上
の文字パターンを示す説明図、第11図および第12図
はRAMへのデータ格納状態を示す説明図、第13図は
CPUの制御動作を示すフローチャート、第14図はR
AMへのデータ格納状態を示す説明図、第15図および
第16図は四辺形の分割領域を示す説明図、第17図は
判定結果を示すRAMのデータ内容を表わした説明図、
第18図は画像メモリ上の文字パターンを示す説明図、
第19図は判定結果を示すRAMのデータ内容を表わし
た説明図である。 2・・・・・・外接四辺形抽出回路 3・・・・・・分岐点位置抽出回路 5・・・・・・CPU 特許出願人 立石電機株式会社 17+l 図 −3+2. 升3 ワ テ 4 図 分 j 図 ”)r6 回 分q 固 ”AQ グ テ10 面 分12図 分/4−Lη テ/6 図
FIG. 1 is an explanatory diagram showing an example of classification of candidate characters, FIGS. 2 and 3 are explanatory diagrams showing examples of character patterns that cannot be classified by conventional devices, and FIGS. An explanatory diagram showing character patterns for explanation, FIG. 8 is a circuit block diagram showing an example of the device of the present invention, FIG. 9 is a flowchart showing the control operation of the CPU, and FIG. 10 shows the character pattern on the image memory. 11 and 12 are explanatory diagrams showing the data storage state in the RAM, FIG. 13 is a flowchart showing the control operation of the CPU, and FIG. 14 is the R
FIG. 15 and FIG. 16 are explanatory diagrams showing the state of data storage in AM, FIG. 15 and FIG. 16 are explanatory diagrams showing quadrilateral divided areas, and FIG. 17 is an explanatory diagram showing the data contents of RAM showing determination results.
FIG. 18 is an explanatory diagram showing character patterns on the image memory;
FIG. 19 is an explanatory diagram showing the data contents of the RAM indicating the determination result. 2... Circumscribed quadrilateral extraction circuit 3... Branch point position extraction circuit 5... CPU Patent applicant Tateishi Electric Co., Ltd. 17+l Figure-3+2. Square 3 Wate 4 Fig. Min.

Claims (1)

【特許請求の範囲】 ■ 未知文字を読み取り白黒2値化して文字パターンを
めた後、未知文字の特徴を抽出して標準パターンと照合
する文字認識装置において、前記2値化データに基づき
文字パターンが外接する四辺形を抽出する手段と、抽出
された四辺形の領域を複数分割する手段と、特定の文字
特徴につき各分割領域における存在有無を判定する手段
とを具備して成る文字認識装置。 ■ 前記四辺形は、縦横各方向に3分割される特許請求
の範囲第1項記載の文字認識装置。 ■ 前記特定の文字特徴は、文字の分岐点である特許請
求の範囲第1項記載の文字認識装置。
[Claims] ■ In a character recognition device that reads unknown characters and converts them into black and white binary data to obtain a character pattern, extracts the features of the unknown characters and compares them with a standard pattern, the character recognition device reads and converts the unknown characters into black and white binary data to determine the character pattern. A character recognition device comprising means for extracting a quadrilateral circumscribing a quadrilateral, means for dividing a region of the extracted quadrilateral into a plurality of regions, and means for determining the presence or absence of a specific character feature in each divided region. (2) The character recognition device according to claim 1, wherein the quadrilateral is divided into three parts in each of the vertical and horizontal directions. (2) The character recognition device according to claim 1, wherein the specific character feature is a branch point of a character.
JP59016467A 1984-01-30 1984-01-30 Character recognizing device Pending JPS60159987A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP59016467A JPS60159987A (en) 1984-01-30 1984-01-30 Character recognizing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP59016467A JPS60159987A (en) 1984-01-30 1984-01-30 Character recognizing device

Publications (1)

Publication Number Publication Date
JPS60159987A true JPS60159987A (en) 1985-08-21

Family

ID=11917058

Family Applications (1)

Application Number Title Priority Date Filing Date
JP59016467A Pending JPS60159987A (en) 1984-01-30 1984-01-30 Character recognizing device

Country Status (1)

Country Link
JP (1) JPS60159987A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS62281090A (en) * 1986-05-30 1987-12-05 Canon Inc Recognition method for pattern information

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS62281090A (en) * 1986-05-30 1987-12-05 Canon Inc Recognition method for pattern information

Similar Documents

Publication Publication Date Title
JP3155577B2 (en) Character recognition method and device
CN110503682B (en) Rectangular control identification method and device, terminal and storage medium
JPS60159987A (en) Character recognizing device
JPH0520794B2 (en)
JPH07129713A (en) Character recognition device
JP2001236467A (en) Method and device for pattern recognition, and recording medium with pattern recognition program recorded thereon
KR100332752B1 (en) Method for recognizing character
JPS61163472A (en) Character recognizing device
JP2851865B2 (en) Character recognition device
JP2972443B2 (en) Character recognition device
JPS62281082A (en) Character recognizing device
JPH04260980A (en) Device for recognizing graphic
JPH0877293A (en) Character recognition device and generating method for dictionary for character recognition
JP3411949B2 (en) How to register stroke order and continuation characters
JPS60168283A (en) Character recognition device
JPS61163477A (en) Character recognition device
JPH07141468A (en) Handwritten character recognition processing method in handwriting input character device
JPS5943486A (en) Processing system for extracting circle
JPS6019285A (en) Stroke extracting method
JPH02235189A (en) Character recognizing device
JPH0458383A (en) Multi font character recognizing device
JPH04316176A (en) Name card recognizing method and name card managing machine
JPH0578067B2 (en)
JPS6057110B2 (en) Optical character recognition method and device
JPH08147406A (en) Character recognition device