JPH05282494A - Binary coding device for image data - Google Patents

Binary coding device for image data

Info

Publication number
JPH05282494A
JPH05282494A JP4075054A JP7505492A JPH05282494A JP H05282494 A JPH05282494 A JP H05282494A JP 4075054 A JP4075054 A JP 4075054A JP 7505492 A JP7505492 A JP 7505492A JP H05282494 A JPH05282494 A JP H05282494A
Authority
JP
Japan
Prior art keywords
line width
image data
threshold value
binarization
binary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP4075054A
Other languages
Japanese (ja)
Inventor
Akio Iwata
明生 岩田
Masaomi Nakajima
正臣 中嶋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
N T T DATA TSUSHIN KK
Original Assignee
N T T DATA TSUSHIN KK
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by N T T DATA TSUSHIN KK filed Critical N T T DATA TSUSHIN KK
Priority to JP4075054A priority Critical patent/JPH05282494A/en
Publication of JPH05282494A publication Critical patent/JPH05282494A/en
Pending legal-status Critical Current

Links

Landscapes

  • Facsimile Image Signal Circuits (AREA)
  • Character Input (AREA)
  • Image Processing (AREA)

Abstract

PURPOSE:To provide the binary coding device for image data which obtains a binary- coding result having mean line width closest to an optimum line width set value for character recognition by performing a binary coding process only once. CONSTITUTION:The binary coding device is provided with a line width calculation part 18 finding mean line widths for plural threshold values at the same time when multilevel image data are coded into binary data on the basis of the threshold values, a threshold value determination part 13 which selects one line width closest to the optimum line width set value among the line width calculation results based upon the threshold values and outputs the binary coding threshold value corresponding to the line width, a multilevel image delay buffer 19 wherein multilevel image data within the line width detection range are temporarily stored until the threshold value is determined, and a binary coding part 14 which codes the multilevel image data in the multilevel image delay buffer into binary data sequentially by using the threshold value. Consequently, the binary coding threshold value which provides the mean line width closest to the optimum line width set value for character recognition is determined by single-time scanning and then the multilevel image data in the multilevel image delay buffer 19 are coded into binary data.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、文字認識装置等におい
て、多値画像データ入力手段から得られた画像データを
二値化後の平均線幅が文字認識のための最適な線幅設定
値に最も近くなるように二値化閾値を決定し、二値化を
行なう装置に関し、特にダイナミックに二値化を行なう
画像データの二値化装置に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device or the like in which an average line width after binarizing image data obtained from multi-valued image data input means is an optimum line width setting value for character recognition. The present invention relates to a binarization apparatus that determines a binarization threshold value so that the binarization threshold is closest to, and particularly to an image data binarization apparatus that dynamically binarizes.

【0002】[0002]

【従来の技術】従来の平均線幅を用いた二値化方式は、
図2に示すような装置で行なわれている。図2におい
て、多値画像データ入力手段21から得られた多値画像
データは、多値画像走査用バッファ22に格納される。
そして、第1回目の二値化処理時は、線幅検出設定範囲
の多値画像データが多値画像走査用バッファ22から順
次取り出され、閾値決定部23から出力される閾値初期
値に基づき、二値化部24により二値化される。この二
値化結果は、二値画像メモリ25に格納されると同時
に、3×3画素抽出部27により3×3画素の形で走査
順に順次取り出され、3×3画素データを利用した線幅
算出部28により平均線幅情報を得る。一方、閾値決定
部23においては、こうして得られた平均線幅情報と文
字認識のための最適な線幅設定値とを比較し、両者の値
の差が小さくなる方向に現在の二値化閾値を増減して出
力する。この二値化閾値を用いて第2回目以降の二値化
処理を行ない、平均線幅が線幅設定値に最も近くなる閾
値にて二値化した画像データが二値画像メモリ25に得
られるまで、繰返し二値化処理を行なう。なお、この種
の方法に関するものとしては、例えば、「文字読取制御
方式;山田、多田、飯田、電気通信研究所研究発表会論
文集 第36号、1981年2月、日本電信電話公社
電気通信研究所、pp.55−56」に記載されている
閾値制御方式がある。
2. Description of the Related Art A conventional binarization method using an average line width is
It is performed by an apparatus as shown in FIG. In FIG. 2, the multi-valued image data obtained from the multi-valued image data input means 21 is stored in the multi-valued image scanning buffer 22.
Then, during the first binarization process, multi-valued image data in the line width detection setting range is sequentially taken out from the multi-valued image scanning buffer 22, and based on the threshold initial value output from the threshold determination unit 23, It is binarized by the binarization unit 24. The binarization result is stored in the binary image memory 25, and at the same time, sequentially extracted in the scanning order in the form of 3 × 3 pixels by the 3 × 3 pixel extraction unit 27, and the line width using the 3 × 3 pixel data. The calculator 28 obtains the average line width information. On the other hand, in the threshold value determining unit 23, the average line width information thus obtained is compared with the optimum line width setting value for character recognition, and the current binarization threshold value is reduced in the direction in which the difference between the two values becomes smaller. Is increased or decreased and output. The second and subsequent binarization processes are performed using this binarization threshold value, and the binarized image data is obtained in the binary image memory 25 by the threshold value at which the average line width is closest to the line width setting value. Repeat the binarization process up to. Examples of this kind of method include, for example, "Character reading control method; Yamada, Tada, Iida, The Institute of Electrical Communication Research, Proceedings No. 36, February 1981, Nippon Telegraph and Telephone Public Corporation.
Institute of Electrical Communication, pp. 55-56 ”.

【0003】[0003]

【発明が解決しようとする課題】上記従来技術では、最
適な二値化結果を得るため、平均線幅情報と線幅設定値
とを比較し、両者の差が最小になるまで最大、閾値の数
の回数だけ二値化閾値の増減を繰り返す必要があり、そ
の繰返しのために処理時間がかかるという問題がある。
また、二値化に要する処理時間が一定せず、パイプライ
ンを利用した効率的なダイナミック二値化処理は難し
い。本発明の目的は、このような問題点を改善し、一回
の二値化処理で、文字認識のための最適な線幅設定値に
最も近い平均線幅となる二値化結果を得ることができ、
かつ、メモリ量を削減することが可能な画像データの二
値化装置を提供することにある。
In the above prior art, in order to obtain the optimum binarization result, the average line width information is compared with the line width setting value, and the maximum and threshold values are set until the difference between them is minimized. There is a problem that it is necessary to repeatedly increase and decrease the binarization threshold value a number of times, and it takes processing time for the repetition.
Further, the processing time required for binarization is not constant, and it is difficult to perform efficient dynamic binarization processing using a pipeline. An object of the present invention is to improve such a problem and to obtain a binarization result having an average line width closest to an optimum line width setting value for character recognition in one binarization process. Can
Another object of the present invention is to provide an image data binarization device capable of reducing the amount of memory.

【0004】[0004]

【課題を解決するための手段】上記目的を達成するた
め、本発明の画像データの二値化装置は、多値画像デー
タ入力手段と、多値画像データを複数の閾値にて二値化
した際の各閾値に対する平均線幅を同時に求める手段
(黒点計数部、輪郭点計数部、黒点・輪郭点数テーブ
ル、線幅計算部、および線幅情報テーブルから構成され
た線幅算出部)と、複数の閾値による線幅算出結果か
ら、最適な線幅設定値に最も近い線幅を一つ選択し、そ
の線幅に対応する二値化閾値を出力する手段(閾値決定
部)と、その閾値が決定されるまで線幅検出設定範囲分
の多値画像データを一時格納するメモリ(多値画像遅延
バッファ)と、その閾値を用いて、多値画像遅延バッフ
ァ内の多値画像データを順次二値化する手段(二値化
部)とを備え、二値化閾値決定後に二値化を行なうこと
に特徴がある。
In order to achieve the above object, an image data binarizing device of the present invention binarizes multivalued image data inputting means and multivalued image data with a plurality of threshold values. A means for simultaneously obtaining the average line width for each threshold value (a black point counting section, a contour point counting section, a black dot / contour point number table, a line width calculating section, and a line width calculating section), and a plurality of From the line width calculation result based on the threshold of, the line width closest to the optimum line width setting value is selected, and a threshold value determination unit that outputs a binarization threshold value corresponding to the line width and the threshold value The memory (multi-valued image delay buffer) that temporarily stores the multi-valued image data for the line width detection setting range and the threshold value are used to sequentially convert the multi-valued image data in the multi-valued image delay buffer into binary values until it is determined. And a binarization unit for converting the binarization threshold value. It is characterized by performing binarization after.

【0005】[0005]

【作用】本発明においては、線幅算出部および閾値決定
部により、一回の画像走査によって文字認識のための最
適な線幅設定値に最も近い平均線幅となる二値化閾値を
決定した後、多値画像遅延バッファ内に一時格納されて
いた線幅検出設定範囲分の多値画像データに対し、二値
化部が二値化を行なう。これにより、一回の二値化処理
によって最適な二値化閾値による二値画像データが得ら
れるため、処理時間を短縮することが可能である。ま
た、二値化に要する処理時間が一定であるため、パイプ
ラインを利用して効率的なダイナミック二値化処理を行
なうことができる。さらに、一回の走査で二値化処理を
完了するため、原画像データを保持するための多値画像
走査用バッファは不要であり、その分、メモリ量を削減
できる。
In the present invention, the line width calculation unit and the threshold value determination unit determine the binarization threshold value which is the average line width closest to the optimum line width setting value for character recognition by one image scanning. After that, the binarization unit binarizes the multi-valued image data for the line width detection setting range temporarily stored in the multi-valued image delay buffer. As a result, the binary image data with the optimum binarization threshold value can be obtained by performing the binarization process once, so that the processing time can be shortened. Further, since the processing time required for binarization is constant, efficient dynamic binarization processing can be performed using a pipeline. Furthermore, since the binarization process is completed by one scan, a multi-value image scanning buffer for holding the original image data is unnecessary, and the amount of memory can be reduced accordingly.

【0006】[0006]

【実施例】以下、本発明の一実施例を図面により説明す
る。図1は、本発明の一実施例における二値化装置の構
成図である。図1において、11は、処理対象となる画
像データを多値データとして得るための多値画像データ
入力手段、17は、多値画像データ入力手段11から出
力される多値画像データから、3×3画素の形で走査順
に順次多値画像データを取り出す3×3画素抽出部、1
8は、3×3画素抽出部17から出力される3×3画素
多値画像データを用いて、複数の閾値に対するそれぞれ
の平均線幅を同時に計算する線幅算出部、13は、線幅
算出部18より得られた複数の平均線幅情報から、設定
線幅に最も近いものを一つ選択し、その線幅に対応する
二値化閾値を出力する閾値決定部、19は、閾値決定部
13により二値化閾値が決定されるまで、線幅検出設定
範囲分の多値画像データを保存しておくための多値画像
遅延バッファ、14は、閾値決定部13により得られた
二値化閾値を用いて、多値画像遅延バッファ19に格納
されている多値画像データを順次二値化する二値化部、
15は、二値化部14により二値化された画像データを
格納する二値画像メモリ、16は、二値画像メモリ15
内の二値化結果を入力として文字認識を行なう文字認識
部である。
DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram of a binarization device according to an embodiment of the present invention. In FIG. 1, 11 is multi-valued image data input means for obtaining image data to be processed as multi-valued data, and 17 is 3 × from multi-valued image data output from the multi-valued image data input means 11. A 3 × 3 pixel extraction unit that sequentially extracts multi-valued image data in the scanning order in the form of 3 pixels, 1
Reference numeral 8 denotes a line width calculation unit that simultaneously calculates respective average line widths for a plurality of thresholds by using the 3 × 3 pixel multi-valued image data output from the 3 × 3 pixel extraction unit 17, and 13 a line width calculation. A threshold determination unit that selects one closest to the set line width from the plurality of average line width information obtained from the unit 18 and outputs a binarization threshold corresponding to the line width, and 19 is a threshold determination unit. The multi-valued image delay buffer for storing multi-valued image data for the line width detection setting range until the binarization threshold is determined by 13, and 14 is the binarization obtained by the threshold value determination unit 13. A binarization unit that sequentially binarizes the multivalued image data stored in the multivalued image delay buffer 19 using a threshold value,
Reference numeral 15 is a binary image memory for storing the image data binarized by the binarization unit 14, and 16 is a binary image memory 15
It is a character recognition unit that performs character recognition by inputting the binarization result in the above.

【0007】次に、線幅算出部18について詳細に述べ
る。図3は、本発明の一実施例における線幅算出部の構
成図である。図1の線幅算出部18は、複数の閾値に対
して、3×3要素の中心多値画像データG0が黒点であ
るかを判断し、計数する黒点計数部181と、G0が輪
郭点であるかをG0の8近傍多値画像データG1〜G8
用いて判断し、計数する輪郭点計数部182と、複数の
閾値に対する黒点数および輪郭点数を同時に格納する黒
点・輪郭点数テーブル183と、各閾値に対する黒点・
輪郭点数とからそれぞれの平均線幅を計算する線幅計算
部184と、各閾値に対する平均線幅計算結果を格納す
る線幅情報テーブル185とから構成され、この線幅情
報テーブル内の各線幅情報データを線幅算出部の出力と
する。ここで、平均線幅Wは、黒点数をB、輪郭点数を
Eと記した時、 W=2×B/BE で計算される。なお、輪郭点計数部182における輪郭
点判定方法については、例えば、特公平3−7180号
公報に詳述されている。このような構成により、複数の
二値化閾値にて多値画像データを二値化した際の平均線
幅を同時に求めることができるので、二値化部14にお
ける一回の二値化処理により平均線幅が線幅設定値に最
も近くなる二値画像データを得て、後段の文字認識部1
6に送ることができる。
Next, the line width calculator 18 will be described in detail. FIG. 3 is a configuration diagram of the line width calculation unit in the embodiment of the present invention. The line width calculation unit 18 of FIG. 1 determines whether the central multi-valued image data G 0 of 3 × 3 elements is a black point for a plurality of threshold values and counts the black point counting unit 181 and G 0. A contour point counting unit 182 that determines whether a point is a point by using the 8-neighbor multi-valued image data G 1 to G 8 of G 0 , and a black point / contour that simultaneously stores the number of black points and the number of contour points for a plurality of thresholds The score table 183 and black dots for each threshold
The line width calculation unit 184 calculates the average line width from the number of contour points, and the line width information table 185 that stores the average line width calculation result for each threshold. Each line width information in this line width information table The data is the output of the line width calculator. Here, the average line width W is calculated by W = 2 × B / B E, where B is the number of black dots and B E is the number of contour points. The contour point determination method in the contour point counting unit 182 is described in detail in Japanese Patent Publication No. 3-7180, for example. With such a configuration, the average line width when binarizing the multi-valued image data with a plurality of binarizing thresholds can be obtained at the same time, so that the binarizing unit 14 performs the binarizing process once. After obtaining the binary image data in which the average line width is closest to the line width setting value, the character recognition unit 1 in the latter stage
Can be sent to 6.

【0008】[0008]

【発明の効果】本発明によれば、平均線幅が文字認識の
ために最適な線幅設定値に最も近くなるような二値化閾
値を決定する際、従来のような繰返し処理をする必要が
なく、一定の線幅検出設定範囲走査に要する時間の遅れ
の後に必ず二値化結果が得られるため、二値化処理時間
は一定となり、パイプラインを利用した効率的なダイナ
ミック二値化処理を行なうことができる。また、従来の
ように複数回走査を行なう必要はないので、多値画像デ
ータ保持用バッファ(多値画像走査用バッファ)は不要
であり、メモリ量の削減に役立つ。
According to the present invention, it is necessary to perform the conventional iterative processing when determining the binarization threshold value such that the average line width is closest to the optimum line width setting value for character recognition. Since there is no error, the binarization result is always obtained after the delay of the time required for constant line width detection setting range scanning, so the binarization processing time becomes constant, and efficient dynamic binarization processing using a pipeline is performed. Can be done. Further, since it is not necessary to perform scanning a plurality of times as in the conventional case, a multivalued image data holding buffer (multivalued image scanning buffer) is unnecessary, which is useful for reducing the memory amount.

【0009】[0009]

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の一実施例における二値化装置の構成図
である。
FIG. 1 is a configuration diagram of a binarizing device according to an embodiment of the present invention.

【図2】従来の二値化装置の構成例図である。FIG. 2 is a diagram showing a configuration example of a conventional binarization device.

【図3】本発明の一実施例における線幅算出部の構成図
である。
FIG. 3 is a configuration diagram of a line width calculation unit according to an embodiment of the present invention.

【符号の説明】[Explanation of symbols]

11 多値画像データ入力手段 13 閾値決定部 14 二値化部 15 二値画像メモリ 16 文字認識部 17 3×3画素抽出部 18 線幅算出部 19 多値画像遅延バッファ 21 多値画像データ入力手段 22 多値画像走査用バッファ 23 閾値決定部 24 二値化部 25 二値画像メモリ 26 文字認識部 27 3×3画素抽出部 28 線幅算出部 181 黒点計数部 182 輪郭点計数部 183 黒点・輪郭点数テーブル 184 線幅計算部 185 線幅情報テーブル 11 multi-valued image data input means 13 threshold value determination section 14 binarization section 15 binary image memory 16 character recognition section 17 3 × 3 pixel extraction section 18 line width calculation section 19 multi-valued image delay buffer 21 multi-valued image data input section 22 Multi-value image scanning buffer 23 Threshold value determination unit 24 Binarization unit 25 Binary image memory 26 Character recognition unit 27 3x3 pixel extraction unit 28 Line width calculation unit 181 Black dot counting unit 182 Contour point counting unit 183 Black dot / contour Point table 184 Line width calculator 185 Line width information table

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】 多値画像データを入力する手段と、入力
された多値画像データの二値化結果を格納するメモリと
を備えた画像データの二値化装置において、多値画像デ
ータを複数の閾値にて二値化した際の該閾値に対する平
均線幅を同時に求める手段と、複数の閾値による線幅算
出結果から、所定の線幅設定値に最も近い線幅を一つ選
択し、選択した線幅に対応する二値化閾値を出力する手
段と、該閾値が決定されるまで線幅検出設定範囲分の多
値画像データを一時格納するメモリと、該閾値を用い、
該メモリ内の多値画像データを順次二値化する手段とを
備えたことを特徴とする画像データの二値化装置。
1. An image data binarization apparatus comprising: means for inputting multivalued image data; and a memory for storing a binarization result of the input multivalued image data. Means for simultaneously obtaining the average line width for the threshold value when binarizing with the threshold value and the line width calculation result by the plurality of threshold values, one line width closest to the predetermined line width setting value is selected and selected. Means for outputting a binarization threshold value corresponding to the line width, a memory for temporarily storing multi-valued image data for a line width detection setting range until the threshold value is determined, and the threshold value,
A binarizing device for image data, comprising means for sequentially binarizing multi-valued image data in the memory.
JP4075054A 1992-03-31 1992-03-31 Binary coding device for image data Pending JPH05282494A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP4075054A JPH05282494A (en) 1992-03-31 1992-03-31 Binary coding device for image data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP4075054A JPH05282494A (en) 1992-03-31 1992-03-31 Binary coding device for image data

Publications (1)

Publication Number Publication Date
JPH05282494A true JPH05282494A (en) 1993-10-29

Family

ID=13565109

Family Applications (1)

Application Number Title Priority Date Filing Date
JP4075054A Pending JPH05282494A (en) 1992-03-31 1992-03-31 Binary coding device for image data

Country Status (1)

Country Link
JP (1) JPH05282494A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2016533504A (en) * 2013-09-18 2016-10-27 ティッセンクルップ スチール ヨーロッパ アーゲーThyssenkrupp Steel Europe Ag Method and apparatus for measuring wear characteristics of galvanyl flat steel products

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2016533504A (en) * 2013-09-18 2016-10-27 ティッセンクルップ スチール ヨーロッパ アーゲーThyssenkrupp Steel Europe Ag Method and apparatus for measuring wear characteristics of galvanyl flat steel products

Similar Documents

Publication Publication Date Title
JPH06333035A (en) Image processor
JPH09270917A (en) Image processor
JPH10271326A (en) Image processor
JPH05282494A (en) Binary coding device for image data
JP2874592B2 (en) Image processing device for distinguishing text / photo / dot area
JPS62120585A (en) Bar code recognizing device by image scanner
JPH06284281A (en) Picture processor
JP2853140B2 (en) Image area identification device
JP3791129B2 (en) Image identification device
JP2000011088A (en) Method for exracting feature information of read image, image processor and mail address reader
JP4469511B2 (en) Image processing method
JP3584478B2 (en) Binarization processing apparatus and binarization processing method
JP2702593B2 (en) Isolated point removal device
JPH08221515A (en) Image processor
JPH0465969A (en) Image processor
JPH06152947A (en) Image processor
JP3344115B2 (en) Image processing device
JP2659377B2 (en) Character recognition method
JPH08307675A (en) Image processing unit
JPH08172522A (en) Resolution switching device
JPH08340446A (en) Device and method for image processing
JPH06131498A (en) Total decision system for recognition result of optical type character recognizing device
JPH02254574A (en) Noise removing system
JPH10143608A (en) Character reading device
JPH0722329B2 (en) Image processing method and image processing circuit