JPS6037952B2

JPS6037952B2 - Optimal binarization method

Info

Publication number: JPS6037952B2
Application number: JP53105269A
Authority: JP
Inventors: 孝弥藤田; 清徳宮田; 雅司丹羽; 哲次森下; 秀明菅原
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1978-08-29
Filing date: 1978-08-29
Publication date: 1985-08-29
Also published as: JPS5532170A

Description

【発明の詳細な説明】本発明は、図形認識率が最高となるように多値化ビデオ
情報を２値化ビデオ情報に変換する最適二値化方式に関
するものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to an optimal binarization method for converting multi-value video information into binary video information so as to maximize the figure recognition rate.

ＯＣＲ文字認識装置において、用紙上に手書あるいは印
刷された文字は光学的に走査され、多値の濃淡レベルの
ビデオ信号として変換され、認識のための文字パターン
を得ている。In an OCR character recognition device, characters handwritten or printed on a sheet of paper are optically scanned and converted into a multivalued gray level video signal to obtain a character pattern for recognition.

しかし乍ら、認識回路の内部では取扱いの容易さ、装置
の簡単化のために、白黒二値しベルの文字パターンで扱
っている場合が殆んどである。その場合に、二値化する
ための閥値（スライスレベル）をどのように選ぶかによ
って文字のつぶれ、かすれ等の線幅変動を惹起すること
になる。このように、二値化する場合の関値の取り方は
、認識によっては重要なパラメータとなる。本発明は、
上記の考察にもとづくものであって、多値のビデオ情報
を最適な二値ビデオ情報を変化できる構造の簡単な最適
二値化方式を提供することを目的とするものである。However, inside the recognition circuit, for ease of handling and simplification of the device, in most cases a black and white binary character pattern is used. In this case, line width variations such as blurring and blurring of characters may occur depending on how the threshold value (slice level) for binarization is selected. In this way, how to take the function value in the case of binarization becomes an important parameter depending on the recognition. The present invention
This invention is based on the above considerations, and the object is to provide an optimal binarization method with a simple structure that can change multi-value video information into optimal binary video information.

そしてそのため、本発明の最適二値化方式は、多値のデ
ータ信号が格納されるビデオ・バッファと、上記ビデオ
・バッファから謙出されたビデオ信号を二値化する可変
スライスレベルのスライス回路と、上記多値ビデオ情報
を異なるスライス・レベルでスライスして二値化ビデオ
信号に変換する手段と、上記の異なるスライスレベルで
スライスして作成された複数の二値化ビデオ信号のそれ
ぞれについて黒Ｖ点数を周囲数で除した線幅指示率を求
める手段と、当該複数の線幅指示率と基準の線幅指示率
とに基づいて上記可変スライスレベルのスライス回路の
スライスレベルを設定する設定信号を送出する送出手段
とを有することを特徴とするものである。以下本発明を
図面を参照しつつ説明する。第１図は光電変換された文
字ビデオ信号の一走査線分を示すものであり、第２図は
スライスしべルを変えた場合の「島」という印刷文字の
二値化パターンである。第２図イはスライスレベル■で
二億化した場合の文字パターンでいり、第２図口はスラ
イスレベル■で二値化した場合の文字パターンであり、
第２図ハはスライスレベル■で二値化した場合の文字パ
ターンである。第２図からスライスレベルを変化される
ことによって、文字パターンがつぶされたり、かすれた
りする様子がわかる。ところで、このような二値化を行
う場合、スライスレベルによって文字パターンが変化す
るために、どのスライスで二値化したらよいかというこ
とが問題となるが、これは認識辞書がどのようにして作
られたかが問題となる。Therefore, the optimal binarization method of the present invention includes a video buffer in which a multilevel data signal is stored, and a slice circuit with a variable slice level that binarizes the video signal extracted from the video buffer. , a means for slicing the multilevel video information at different slice levels and converting it into a binary video signal, and a black V for each of the plurality of binary video signals created by slicing the multilevel video information at different slice levels. means for determining a line width instruction rate by dividing the number of points by the surrounding number; and a setting signal for setting the slice level of the variable slice level slice circuit based on the plurality of line width instruction rates and the reference line width instruction rate. The device is characterized in that it has a sending means for sending out the information. The present invention will be explained below with reference to the drawings. FIG. 1 shows one scanning line segment of a photoelectrically converted character video signal, and FIG. 2 shows a binarized pattern of a printed character "Island" when the slice label is changed. Figure 2 A is the character pattern when it is converted into 200 million slices at the slice level ■, and the opening in Figure 2 is the character pattern when it is binarized at the slice level ■.
FIG. 2C shows a character pattern when binarized at slice level ■. From FIG. 2, it can be seen that the character pattern becomes crushed or blurred by changing the slice level. By the way, when performing such binarization, the character pattern changes depending on the slice level, so the problem is which slice should be used for binarization, but this depends on how the recognition dictionary is created. The question is whether it was done.

認識回路としては、つぶれた文字でも、かすれた文字で
も認識できなければならないが、第２図口のように二値
化パターンで辞書を作成しておいた方が、第２図イ，口
のような文字パターンに対しても認識率が高いが第２図
イのようなパターンで辞書を作成しておいた場合には、
第２図イのようなかすれた文字パターンについては認識
率は高いが、第２図ハのようなつぶれた文字パターンに
ついては認識率が低くなるようなことは当然である。そ
こで、辞書パターンが第２図口のような糠幅のもので作
られていた場合には、謙取るべき入力パターンは第２図
口のような線幅によるように二値化されることが望まし
く、このような二値化が最適二値化方式となる。そこで
、このような最適二値化ビデオ情報を得るためのスライ
スレベルを決定するために、本発明では次のような値Ｗ
を基準としている。The recognition circuit must be able to recognize characters that are blurred or blurred, but it is better to create a dictionary with a binary pattern as shown in Figure 2 (A). Although the recognition rate is high even for character patterns such as
It is natural that the recognition rate is high for a blurred character pattern as shown in FIG. 2A, but the recognition rate is low for a blurred character pattern as shown in FIG. 2C. Therefore, if the dictionary pattern is made with a line width like the one shown in Figure 2, the input pattern to be taken can be binarized as in the line width shown in Figure 2. Desirably, such binarization is an optimal binarization method. Therefore, in order to determine the slice level for obtaining such optimal binarized video information, the present invention uses the following value W.
is based on.

Ｗ＝薄墨鰯こ）で、黒点数というのは二値ビデオの文字部分の黒メ
ッシュの数であり、周囲数とは黒（文字部分）と白（紙
面）との境界の数である。The number of black dots is the number of black meshes in the text part of the binary video, and the number of surroundings is the number of boundaries between black (text part) and white (paper surface).

この周囲数は、二値化のスライスレベルを変えて線幅に
変化をもためても、完全に文字部分が欠けて失われるよ
うなかすれや隣りの文字同志がくっつくようなつぶれが
生じない限り、値の変化は少なく、黒点数の変化に比べ
ると、１′１の星度である。このように、値Ｗの分母の
周囲数はスライスレベルによって、ほとんど変化しない
ため、値Ｗは線幅を略ぼ示しているものと考えても良い
。Ｗを以下、線幅指示率という。第３図はスライスレベ
ルによって値Ｗかどのように変化するかを示すものであ
る。Even if you change the slice level of the binarization and try to change the line width, this number is as long as it does not cause blurring, where the character part is completely lost, or collapse, where adjacent characters stick together. , the change in value is small, and compared to the change in the number of sunspots, it is 1'1 star degree. In this way, since the number around the denominator of the value W hardly changes depending on the slice level, the value W can be considered to approximately represent the line width. Hereinafter, W will be referred to as the line width instruction rate. FIG. 3 shows how the value W changes depending on the slice level.

第３図は、印刷された文字を数行分スライスレベル■，
■，■，■で二値化した時のＷの値を各スライスレベル
毎にプロツトしたものがあって、スライスレベルと線幅
指示率Ｗがかなりはっきりした相関をもっていることが
判り、また、文字によっても行によってもあまり変化が
ないことが判る。即ち、スライスレベルによって線幅が
はっきり変化していることが判る。そこで、このような
複数のスライスレベルの内どのレベルで二値化したらよ
いかは、辞書パターンがとのような線幅のパターンを中
心として作られているかに依存するものと考えられる。Figure 3 shows the slice level of several lines of printed characters.
There is a plot of the value of W when binarized with ■, ■, ■ for each slice level, and it is found that there is a very clear correlation between the slice level and the line width indication rate W, and also the character It can be seen that there is not much change depending on the row or row. That is, it can be seen that the line width clearly changes depending on the slice level. Therefore, it is considered that which of the plurality of slice levels should be binarized depends on whether the dictionary pattern is created centering on a pattern with a line width such as .

この様子を示すのが第４図である。第４図では辞書パタ
−ンの線幅指示率Ｗの値は、１．５としている。この第
４図から明らかなようにスライスレベルによって認識率
がかわっており、こ）ではスライスレベル■が最も認識
率が高くなっている。第５図は本発明の１実施例のブロ
ック図であり、１はビデオ・バツフア、２一０なし、し
２一３はスライス回路、３−０なし、し３−３は黒点計
数回路、４一０なし、し４一３は周囲点計数回路、５一
０ないし５一３は除算回路、６は比較回路、７はスライ
ス回路をそれぞれ示している。FIG. 4 shows this situation. In FIG. 4, the value of the line width instruction rate W of the dictionary pattern is set to 1.5. As is clear from FIG. 4, the recognition rate changes depending on the slice level, and in this case, slice level (3) has the highest recognition rate. FIG. 5 is a block diagram of one embodiment of the present invention, in which 1 is a video buffer, 2-3 is a slice circuit, 3-0 is not present, 3-3 is a sunspot counting circuit, 4 10 is blank, 4-3 is a peripheral point counting circuit, 5-10 to 5-3 are division circuits, 6 is a comparison circuit, and 7 is a slice circuit, respectively.

スライス回路２−０なし、し２−３は、それぞれ異なる
スライスレベルを有しているものであり、また、スライ
ス回路７のスライスレベルは変更することが出来るもの
である。基準線幅指示率とは、例えば辞書パターンの黒
Ｖ点数／周囲数である。観測部（図示せず）から送られ
て釆る多値のビデオ信号は、ビデオ・バッファ１に格納
されると共に、スライス回路２−０ないし２−３へ送ら
れる。スライス回路２−０に送られた多値化ビデオ信号
は、所定のレベルで二値化され、この二値化ビデオ信号
の黒Ｖ点数は黒Ｖ点計数回路３一０で計数され、周囲点
数は周囲点計数回路４−０だ計数される。除算回路５一
川ま、入力された黒点数を周囲点数で除し、線幅指示率
ＷＩを得る。同様にして、スライス回路２−１なし、し
２一３によって二値化された各ビデオ情報について線幅
指示率Ｗ２，Ｗ３，Ｗ４がそれぞれ求められる。比較回
路６は基準の線幅指示率Ｗと綾幅指示率Ｗ１，Ｗ２，Ｗ
３，Ｗ４を比較し、Ｗ１，Ｗ２，Ｗ３，Ｗ４の中からＷ
に最も近いＷｍを選択し、その値Ｗｍに対応するスライ
ス・レベル値をスライス回路７へ送る。スライス回路７
のスライスレベルは、比較回路６から送られて来たスラ
イスレベルに設定され、このスライスレベルによってビ
デオ・バッファーから謙出された多値化ビデオ信号が二
値化される。第５図において、ビデオ・バッファは一文
字分でも１行分でもよく、１行単位でＷの値を求めても
、第３図に示すように行毎のＷ値の部分が略ぼ１ケ所と
かたまっているので、辞書の線幅指示率Ｗと完全に同一
とならないが、ほぼ同じ値の線幅指示率のＷになるよう
な最適二値化ビデオが得られることが判る。また、基準
線幅指示率Ｗに対応するスライス・レベル値を補間法に
よって求めることも出来る。The slice circuits 2-0 and 2-3 have different slice levels, and the slice level of the slice circuit 7 can be changed. The reference line width instruction rate is, for example, the number of black V points/number of surroundings of a dictionary pattern. A multi-level video signal sent from an observation section (not shown) is stored in a video buffer 1 and is also sent to slice circuits 2-0 to 2-3. The multilevel video signal sent to the slice circuit 2-0 is binarized at a predetermined level, and the number of black V points of this binarized video signal is counted by the black V point counting circuit 3-0, and the number of surrounding points is counted. are counted by the surrounding point counting circuit 4-0. The division circuit 5 divides the input number of black dots by the number of surrounding points to obtain a line width indication rate WI. Similarly, the line width instruction rates W2, W3, and W4 are obtained for each video information binarized by the slice circuit 2-1 and the slice circuit 2-3. The comparison circuit 6 compares the reference line width instruction rate W and the twill width instruction rate W1, W2, W.
3. Compare W4 and select W from W1, W2, W3, W4.
Wm closest to Wm is selected, and the slice level value corresponding to that value Wm is sent to the slice circuit 7. Slice circuit 7
The slice level is set to the slice level sent from the comparator circuit 6, and the multilevel video signal extracted from the video buffer is binarized using this slice level. In Figure 5, the video buffer may be for one character or one line, and even if the value of W is determined for each line, the W value for each line is approximately at one location, as shown in Figure 3. It can be seen that an optimal binarized video can be obtained in which the line width indication rate W is not completely the same as the line width indication rate W of the dictionary because of the clustering, but the line width indication rate W is approximately the same value. Furthermore, the slice level value corresponding to the reference line width instruction rate W can also be determined by interpolation.

[Brief explanation of the drawing]

第１図は光電変換された文字ビデオ信号の１走査分の波
形を示す図、第２図イ，口，．ハはそれぞれ異なるスラ
イスレベルで二値化したときの文字パターンを示す図、
第３図はスライスレベルによって黒Ｖ点数／周囲数がど
のように変化するかを示す図、第４図は認識率とスライ
スレベルと黒点数／周囲数との関係を示す図、第５図は
本発明の１実施例のブロック図である。１……ビデオ・バツフア、２一０なし、し２一３……ス
ライス回路、３−０なし、し３一３・・・・・・黒点数
回路、４一０ないし４−３……周囲点計数回路、５一０
ないし５−３・・・・・・除算回路、７・・…・スライ
ス回路。才１図マ２図ブ３脚ガム凶ナＳ脚1 is a diagram showing the waveform of one scan of a photoelectrically converted character video signal, and FIG. C is a diagram showing character patterns when binarized at different slice levels,
Figure 3 is a diagram showing how the number of black V points/number of surroundings changes depending on the slice level, Figure 4 is a diagram showing the relationship between recognition rate, slice level, and number of black points/number of surroundings, and Figure 5 is a diagram showing how the number of black V points/number of surroundings changes depending on the slice level. 1 is a block diagram of one embodiment of the present invention. FIG. 1...Video buffer, 2-0 absent, 2-3...slice circuit, 3-0 absent, 3-3...sunspot number circuit, 4-10 or 4-3...surroundings Point counting circuit, 5-0
Or 5-3... Division circuit, 7... Slice circuit. 1 figure, 2 figures, 3 legs, 1 figure, 2 figures, 3 legs, S legs

Claims

[Claims]

1 a video buffer in which a multilevel video signal is stored;
a variable slice level slicing circuit for binarizing the video signal read from the video buffer; a means for slicing the multi-level video information at different slice levels and converting it into a binarized video signal; Means for determining a line width indication rate by dividing the number of black points by the number of surroundings for each of a plurality of binary video signals created by slicing at the slice level, and the plurality of line width indication rates and a reference line width indication rate. and means for transmitting a setting signal for setting the slice level of the slice circuit of the variable slice level based on the above.