JPH01321589A - Feature extraction system - Google Patents
Feature extraction systemInfo
- Publication number
- JPH01321589A JPH01321589A JP63155608A JP15560888A JPH01321589A JP H01321589 A JPH01321589 A JP H01321589A JP 63155608 A JP63155608 A JP 63155608A JP 15560888 A JP15560888 A JP 15560888A JP H01321589 A JPH01321589 A JP H01321589A
- Authority
- JP
- Japan
- Prior art keywords
- pattern
- character
- character pattern
- feature
- stores
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000605 extraction Methods 0.000 title claims description 23
- 238000012015 optical character recognition Methods 0.000 claims description 2
- 230000010354 integration Effects 0.000 abstract description 6
- 238000000034 method Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 238000012545 processing Methods 0.000 description 3
- FBOUIAKEJMZPQG-AWNIVKPZSA-N (1E)-1-(2,4-dichlorophenyl)-4,4-dimethyl-2-(1,2,4-triazol-1-yl)pent-1-en-3-ol Chemical compound C1=NC=NN1/C(C(O)C(C)(C)C)=C/C1=CC=C(Cl)C=C1Cl FBOUIAKEJMZPQG-AWNIVKPZSA-N 0.000 description 2
- 238000004891 communication Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
Landscapes
- Character Discrimination (AREA)
Abstract
Description
【発明の詳細な説明】
(産業上の利用分野)
本発明は文字認識、特に手書文字認識における特徴抽出
方式に関する。DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to character recognition, particularly to a feature extraction method in handwritten character recognition.
(従来技術とその課題)
情報処理システムの多様化に伴ない様々なデータ入力方
法が要求されており、文字認識技術も有力なデータ入力
方法として実用化が進められている。しかし現在の文字
認識技術は、文字の読取り性能の点で入間に比べてはる
かに劣っており、より高い文字認識性能を有する文字読
取り装置が望まれている。(Prior Art and its Issues) With the diversification of information processing systems, various data input methods are required, and character recognition technology is also being put into practical use as a powerful data input method. However, current character recognition technology is far inferior to Iruma in terms of character reading performance, and a character reading device with higher character recognition performance is desired.
文字認識性能を高めるために、文字認識を構成する、前
処理、特徴抽出処理2分類・識別処理。In order to improve character recognition performance, preprocessing, feature extraction processing, and classification/identification processing that constitute character recognition are performed.
後処理のそれぞれにおいて改良が努められている。Efforts are being made to improve each post-processing process.
この中で、特徴抽出処理については、方向性の特徴の有
効性が例えば文献電子情報通信学会論文誌(D) 、V
ol、J65−D 、隘5.p1)、550〜557゜
斉藤、山田、山本「手書き漢字の方向パターン・マツチ
ング法による解析」等で示されている。従来から知られ
ている方向性の特徴の一般的な抽出方法では、まず文字
パタン平面上で方向成分の分布を求め、次に文字パタン
を複数の部分領域に分割し、各部分領域内で方向成分を
統合する。文字° パタンをこのように処理することに
よりて、方向成分の分布が圧縮さnた形式の方向特徴パ
タンを抽出することができる。しかし、この処理の中で
領域分割の定め方が難しく、分割が細かいと、データ量
が多く、かつ文字パタンの変形等を含む雑音に対して過
敏になるという問題点が生じ、分割が粗いと、文字を分
類・識別するための情報が失なわれて、類似した文字の
識別が困難になるという問題点が生じる。Among these, regarding the feature extraction process, the effectiveness of directional features is shown in, for example, the literature Transactions of the Institute of Electronics, Information and Communication Engineers (D), V
ol, J65-D, 隘5. p1), 550-557° Saito, Yamada, Yamamoto "Analysis of handwritten kanji using directional pattern matching method" etc. The conventional method for extracting directional features is to first find the distribution of directional components on the character pattern plane, then divide the character pattern into multiple subregions, and calculate the directionality within each subregion. Integrate ingredients. By processing the character pattern in this way, it is possible to extract a direction feature pattern in which the distribution of direction components is compressed. However, it is difficult to define area divisions during this process, and if the divisions are fine, the amount of data will be large, and the problem will be that the area will be sensitive to noise, including deformation of character patterns, etc.; , the problem arises that information for classifying and identifying characters is lost, making it difficult to identify similar characters.
(課題を解決するための手段)
本発明によると、M×Nの格子状の配列とじて与えられ
る文字パタンを自動的に読取る光学的文字認識において
、該文字パタンの特徴を抽出する方式であって、M×N
の文字パタンを格納する文字パタン記憶部と、前記文字
パタン記憶部から文字パタンの小領域に含まれる画素を
順次読取り、小領域を代表する画素に方向成分を割当て
ることにより、文字パタン上の方向成分の分布を示す方
向パタンを作成する方向抽出部と、前記方向抽出部の作
成した方向パタンを格納する方向パタン記″憶部と、前
記方向パタン記憶部から方向パタンを読み込み、垂直方
向成分は、縦長の領域内で統合を行ない、水平方向成分
は横長の領域内で統合を行ない、斜め方向の成分は縦横
同一の広さの領域内で統合を行なうことによって文字認
識のための方向特徴パタンを求める方向特徴抽出部と、
前記方向特徴抽出部から出力される方向特徴パタンを格
納する方向特徴記憶部とを具備する特徴抽出方式を実現
し、文字認識性能を高めることができる。(Means for Solving the Problems) According to the present invention, in optical character recognition that automatically reads a character pattern given as an M×N grid array, there is provided a method for extracting features of the character pattern. T, M×N
A character pattern storage unit stores a character pattern, and pixels included in a small area of the character pattern are sequentially read from the character pattern storage unit, and the direction on the character pattern is determined by assigning a direction component to a pixel representing the small area. a direction extraction section that creates a direction pattern indicating the distribution of components; a direction pattern storage section that stores the direction pattern created by the direction extraction section; a direction pattern storage section that reads the direction pattern from the direction pattern storage section; , the directional feature pattern for character recognition is performed by integrating within a vertically long area, integrating horizontal components within a horizontally long area, and integrating diagonal components within an area of the same size vertically and horizontally. a directional feature extraction unit that calculates
Character recognition performance can be improved by realizing a feature extraction method including a direction feature storage section that stores the direction feature pattern output from the direction feature extraction section.
(作用)
以下に本発明の原理について説明する。第2図(a)は
漢字の「墨」を表わす文字パタンと文字パタン領域、第
2図(b)と(C)はそれぞれ、文字パタン領域内の「
墨」の水平方向成分と垂直方向成分の分布パタンである
。従来は、この分布パタンから方向特徴パタンを求める
ために、文字パタン領域をm x nの部分領域に分割
するが、各部分領域は、それぞれ正方形でありだ。しか
し「墨」のように水平成分が多い場合には、パタンの情
報を保存するために垂直方向の解像度が高いことが望ま
しい。(Operation) The principle of the present invention will be explained below. Figure 2 (a) shows the character pattern and character pattern area representing the kanji "sumi", and Figures 2 (b) and (C) each show the character pattern and character pattern area representing the character "sumi".
This is the distribution pattern of the horizontal component and vertical component of "black ink". Conventionally, in order to obtain a directional feature pattern from this distribution pattern, a character pattern area is divided into m x n partial areas, and each partial area is square. However, when there are many horizontal components, such as "black ink", it is desirable to have a high resolution in the vertical direction in order to preserve pattern information.
一方、垂直成分の多い文字の場合には、パタンの情報を
保存するために水平方向の解像度が高いことが望ましい
。従って、水平方向成分の統合を行なうときには、第3
図(a)のように、各部分領域が横長の長方形になるよ
うに分割を行なって、各部分領域内で統合を行ない、垂
直方向成分の統合を行なうときには、第3図(b)のよ
うに、各部分領域が縦長の長方形になるように分割を行
なって定まる各部分領域内で統合を行なうことによって
、パタンの情報を保存した方向特徴パタンか得られる。On the other hand, in the case of characters with many vertical components, it is desirable to have high resolution in the horizontal direction in order to preserve pattern information. Therefore, when integrating horizontal components, the third
As shown in Figure 3(a), each partial area is divided into horizontally long rectangles, and when integration is performed within each partial area and the vertical components are integrated, as shown in Figure 3(b). Next, by dividing each partial area into a vertically long rectangle and performing integration within each partial area, a directional feature pattern that preserves pattern information can be obtained.
この場合、斜め方向の成分の統合には従来通り第3図(
C)のように、各部分領域が正方形になるように分割を
行なって、各部分領域内で統合を行なえばよい。例えば
文字パタンか64X64の画素から成るとき、垂直方向
成分の方向特徴パタンの抽出には、縦4分割、横16分
割で部分領域を定め、水平方向成分の方向特徴パタンの
抽出には、縦16分割、横4分割で部分領域を定め、斜
め方向成分の方向特徴パタンの抽出には、縦8分割、横
8分割で部分領域を定め、それぞれの部分領域内で統合
を行なうことによって、文字認識のためにパタンの情報
が保存された方向特徴パタンを求めることができ、認識
性能の向上に役立つ。In this case, the integration of diagonal components is done as usual in Figure 3 (
As shown in C), division is performed so that each partial area becomes a square, and integration is performed within each partial area. For example, when a character pattern consists of 64 x 64 pixels, to extract the vertical component's directional feature pattern, a partial area is divided into 4 vertically and 16 horizontally. Character recognition It is possible to obtain a directional feature pattern in which pattern information is saved, which is useful for improving recognition performance.
(実施例)
第1図は本発明の構成の一実施例を示すブロック図であ
る。文字パタン記憶部1は、入力文字パタンを格納する
もので、通常の記憶手段でよい。(Embodiment) FIG. 1 is a block diagram showing an embodiment of the configuration of the present invention. The character pattern storage section 1 stores input character patterns, and may be a normal storage means.
方向抽出部2は、入力文字パタンを信号11として読込
み、各方向ごとの方向成分の分布パタンを抽出し信号1
2として各方向成分を出力するもので、前出の文献等に
抽出アルゴリズムが示されており、通常の論理素子と記
憶手段等を用いる従来技術で容易に実現できる。方向パ
タン記憶部3は信号12として送られる各方向成分ごと
の分布パタンを格納するもので、通常の記憶手段でよい
。The direction extraction unit 2 reads the input character pattern as a signal 11, extracts the distribution pattern of directional components for each direction, and generates a signal 1.
2, the extraction algorithm is shown in the above-mentioned literature, etc., and can be easily realized by conventional technology using ordinary logic elements, storage means, etc. The direction pattern storage section 3 stores the distribution pattern for each direction component sent as the signal 12, and may be a normal storage means.
方向特徴抽出部4は各方向成分ごとの分布パタンを信号
13として入力し、作用の項で説明した通り、水平方向
成分については横長の長方形の各部分領域内で水平方向
分布パタンの画素の値の総和を求めることによって統合
を行ない、垂直方向成分については縦長の長方形の各部
分領域内で垂直方向分布パタンの画素の値の総和を求め
ることによって統合を行なり′、斜め方向成分について
は正方形の各部分領域内で各斜め方向の分布パタンの画
素の値の総和を求めることによって統合を行ない、方向
特徴パタンを抽出し、信号14として出力するもので、
通常の論理素子と記憶手段等を用いる従来技術で容易に
実現できる。方向特徴記憶部5は前記方向特徴抽出部4
から信号15として出力された方向特徴パタンを格納す
るもので通常の記憶手段でよい。The directional feature extraction unit 4 inputs the distribution pattern for each direction component as a signal 13, and as explained in the section of the operation, for the horizontal component, it calculates the pixel values of the horizontal distribution pattern in each partial region of a horizontally long rectangle. The vertical component is integrated by finding the sum of the pixel values of the vertical distribution pattern within each subregion of a vertically long rectangle, and the diagonal component is integrated by finding the sum of the pixel values of the vertical distribution pattern in each subregion of a vertically long rectangle. Integration is performed by calculating the sum of the pixel values of the distribution patterns in each diagonal direction within each partial region, and the directional feature pattern is extracted and output as a signal 14.
This can be easily realized using conventional technology using ordinary logic elements, storage means, and the like. The direction feature storage section 5 includes the direction feature extraction section 4.
A normal storage means may be used to store the directional characteristic pattern output as the signal 15 from.
本実施例では、各部分領域内での分布パタンの画素の値
の総和を求めることで、方向特徴パタンの抽出を説明し
たが、方向特徴パタンの各成分の値の総和等で、各成分
の値を正規化して方向特徴パタンとすることもできる。In this example, the extraction of the directional feature pattern was explained by calculating the sum of the pixel values of the distribution pattern within each partial region. It is also possible to normalize the value and use it as a directional feature pattern.
この場合も、加算器。Again, an adder.
除算器等の追加が必要であるが、従来技術で容易に実現
できる。Although it is necessary to add a divider, etc., it can be easily realized using conventional technology.
(発明の効果)
以上に詳しく説明したように本発明によれば、方向成分
ごとに部分領域を定めることで、文字パタンの情報を保
存して、雑音に過敏とならない方向特徴抽出が実現でき
る。そこで、本発明の特徴抽出方式を文字認識装置に適
用することにより、文字認識の性能を大幅に向上できる
。(Effects of the Invention) As described in detail above, according to the present invention, by defining a partial region for each directional component, character pattern information can be preserved and directional feature extraction that is not sensitive to noise can be realized. Therefore, by applying the feature extraction method of the present invention to a character recognition device, character recognition performance can be significantly improved.
第1図は本発明の一実施例の構成を示すブロック図であ
る。第2図は文字パタンの水平方向成分の分布パタンと
垂直方向成分の分布パタンを例示する図、第3図は本発
明において方向特徴抽出のために行う領域分割の例を示
す図である。
図中、1は文字パタン記憶部、2は方向抽出部、3は方
向パタン記憶部、4は方向特徴抽出部、5は方向特徴記
憶部である。
代理人 弁理士 本 庄 伸 介
第1図
(a) (b) (c)
第2図
(a) (b) (c)
第3図FIG. 1 is a block diagram showing the configuration of an embodiment of the present invention. FIG. 2 is a diagram illustrating a horizontal component distribution pattern and a vertical component distribution pattern of a character pattern, and FIG. 3 is a diagram illustrating an example of region division performed for extracting directional features in the present invention. In the figure, 1 is a character pattern storage section, 2 is a direction extraction section, 3 is a direction pattern storage section, 4 is a direction feature extraction section, and 5 is a direction feature storage section. Agent Patent Attorney Shinsuke Honjo Figure 1 (a) (b) (c) Figure 2 (a) (b) (c) Figure 3
Claims (1)
動的に読取る光学的文字認識において、該文字パタンの
特徴を抽出する方式であって、M×Nの文字パタンを格
納する文字パタン記憶部と、前記文字パタン記憶部から
文字パタンの小領域に含まれる画素を順次に読取り、小
領域を代表する画素に方向成分を割当てることにより、
文字パタン上の方向成分の分布を示す方向パタンを作成
する方向抽出部と、前記方向抽出部の作成した方向パタ
ンを格納する方向パタン記憶部と、前記方向パタン記憶
部から方向パタンを読み込み、垂直方向成分は、縦長の
領域内で統合を行ない、水平方向成分は横長の領域内で
統合を行ない、斜め方向の成分は縦横同一の広さの領域
内で統合を行なうことによって文字認識のための方向特
徴パタンを求める方向特徴抽出部と、前記方向特徴抽出
部から出力される方向特徴パタンを格納する方向特徴記
憶部とを具備する特徴抽出方式。In optical character recognition that automatically reads a character pattern given as an M×N grid array, a character pattern storage unit stores the M×N character pattern, the system extracting the characteristics of the character pattern. Then, by sequentially reading pixels included in a small area of the character pattern from the character pattern storage unit and assigning a directional component to a pixel representing the small area,
a direction extraction unit that creates a direction pattern indicating the distribution of directional components on the character pattern; a direction pattern storage unit that stores the direction pattern created by the direction extraction unit; Directional components are integrated within a vertically elongated area, horizontal components are integrated within a horizontally elongated area, and diagonal components are integrated within an area of the same size both vertically and horizontally. A feature extraction method comprising: a direction feature extraction section that obtains a direction feature pattern; and a direction feature storage section that stores the direction feature pattern output from the direction feature extraction section.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP63155608A JP2605807B2 (en) | 1988-06-23 | 1988-06-23 | Feature extraction method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP63155608A JP2605807B2 (en) | 1988-06-23 | 1988-06-23 | Feature extraction method |
Publications (2)
Publication Number | Publication Date |
---|---|
JPH01321589A true JPH01321589A (en) | 1989-12-27 |
JP2605807B2 JP2605807B2 (en) | 1997-04-30 |
Family
ID=15609744
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP63155608A Expired - Lifetime JP2605807B2 (en) | 1988-06-23 | 1988-06-23 | Feature extraction method |
Country Status (1)
Country | Link |
---|---|
JP (1) | JP2605807B2 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2006073081A1 (en) * | 2005-01-05 | 2006-07-13 | Nec Corporation | Discriminative data learning system, learning device, discriminating device, and learning method |
US9036903B2 (en) | 2010-01-06 | 2015-05-19 | Nec Corporation | Learning device, identification device, learning identification system and learning identification device |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6327991A (en) * | 1986-07-22 | 1988-02-05 | Ricoh Co Ltd | Formation of histogram for input information recognizing device |
-
1988
- 1988-06-23 JP JP63155608A patent/JP2605807B2/en not_active Expired - Lifetime
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6327991A (en) * | 1986-07-22 | 1988-02-05 | Ricoh Co Ltd | Formation of histogram for input information recognizing device |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2006073081A1 (en) * | 2005-01-05 | 2006-07-13 | Nec Corporation | Discriminative data learning system, learning device, discriminating device, and learning method |
JPWO2006073081A1 (en) * | 2005-01-05 | 2008-06-12 | 日本電気株式会社 | Identification data learning system, learning device, identification device, and learning method |
US7783581B2 (en) | 2005-01-05 | 2010-08-24 | Nec Corporation | Data learning system for identifying, learning apparatus, identifying apparatus and learning method |
JP4697670B2 (en) * | 2005-01-05 | 2011-06-08 | 日本電気株式会社 | Identification data learning system, learning device, identification device, and learning method |
US9036903B2 (en) | 2010-01-06 | 2015-05-19 | Nec Corporation | Learning device, identification device, learning identification system and learning identification device |
Also Published As
Publication number | Publication date |
---|---|
JP2605807B2 (en) | 1997-04-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP2940936B2 (en) | Tablespace identification method | |
KR20010050887A (en) | Image processing apparatus and method for processing an image | |
CN108229232B (en) | Method and device for scanning two-dimensional codes in batch | |
JPH05225378A (en) | Area dividing system for document image | |
CN111161281A (en) | Face region identification method and device and storage medium | |
CN113378837A (en) | License plate shielding identification method and device, electronic equipment and storage medium | |
JP2003067738A (en) | Method and system for removing net point | |
JPH01321589A (en) | Feature extraction system | |
JP2005149395A (en) | Character recognition device and license plate recognition system | |
JP2008152611A (en) | Image recognition device, electronic equipment, image recognition method, and image recognition program | |
JP2917396B2 (en) | Character recognition method | |
JP3009237B2 (en) | Feature extraction method | |
JP2663550B2 (en) | Feature extraction method | |
Dalla Mura et al. | Classification of hyperspectral images by using morphological attribute filters and independent component analysis | |
JP2792063B2 (en) | Character recognition dictionary creation method | |
JP3127413B2 (en) | Character recognition device | |
Rakshitha et al. | Latent Fingerprint Enhancement for Investigation | |
JP4011859B2 (en) | Word image normalization device, word image normalization program recording medium, and word image normalization program | |
CN114821648A (en) | Weighted graph-based finger vein identification method and device | |
JP2613211B2 (en) | Image input device | |
JPH03126188A (en) | Character recognizing device | |
JPH04299483A (en) | Feature extracting device | |
JPH0581416A (en) | Binarization processing method for gradation image | |
Yamato et al. | Interpolation technique of fingerprint features for personal verification | |
JP2002251613A (en) | Method and device for recognizing graphic |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20080213 Year of fee payment: 11 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20090213 Year of fee payment: 12 |
|
EXPY | Cancellation because of completion of term | ||
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20090213 Year of fee payment: 12 |