JPH0322094A - Character recognizing system - Google Patents

Character recognizing system

Info

Publication number
JPH0322094A
JPH0322094A JP1156440A JP15644089A JPH0322094A JP H0322094 A JPH0322094 A JP H0322094A JP 1156440 A JP1156440 A JP 1156440A JP 15644089 A JP15644089 A JP 15644089A JP H0322094 A JPH0322094 A JP H0322094A
Authority
JP
Japan
Prior art keywords
character
feature
recognition
dictionary
characters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP1156440A
Other languages
Japanese (ja)
Inventor
Takafumi Enami
隆文 枝並
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to JP1156440A priority Critical patent/JPH0322094A/en
Publication of JPH0322094A publication Critical patent/JPH0322094A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

PURPOSE:To obtain a stable character recognition rate by performing the feature quantity adapting processing where the size of each divided area of a recognition object character is changed for each of prescribed comparison object characters to minimize the difference among feature quantities and using the difference among feature quantities for the discrimination processing. CONSTITUTION:By the control of a feature quantity adapting means 17, the size of each divided area of the recognition object character is changed for prescribed comparison object characters to minimize differences among feature quantities. A dictionary retrieving means 15 compares minimized differences among feature quantities of respective prescribed comparison object characters to output the comparison object character, which corresponds to the minimum difference between feature quantities, as the recognition result. The size of each divided area of the recognition object character is changed to approximate this character to the comparison object character stored in a dictionary 13 in this manner. Thus, deviation and deformation of the recognition object character are easily absorbed when being minute for comparison object characters, and the stable character recognition rate is obtained.

Description

【発明の詳細な説明】 〔概 要〕 認識対象文字の特徴量と辞書に格納されている比較対象
文字の特徴量とを比較して文字の認識を行う文字認識方
式に関し、 認識対象文字の位置ずれや変形が微小である場合にそれ
を吸収し、安定した文字認識率を得ることを目的とし、 認識対象文字のイメージ情報を複数の領域に分割し、各
分割領域ごとに抽出された特徴量から認識対象文字の特
徴量を得る特徴量抽出手段と、比較対象文字の特徴量が
格納される辞書と、認識対象文字の特徴量と比較対象文
字の特徴量とを比較し、特徴量間の相違が最小となる比
較対象文字を認識結果として出力する辞書検索手段とを
備えた文字認識方式において、所定の比較対象文字ごと
に、認識対象文字の各分割領域の大きさを変更して特r
ll量間の相違を最小とする特徴量適応化処理を行い、
そのときの各特ttl.m間の相違を識別処理に供する
特徴量適応化手段を備え構戒する二〔産業上の利用分野
〕 本発明は、イメージスキャナその他の光学的読み取り装
置で読み取った認識対象文字のイメージ情報から得られ
る特徴量と、辞書に格納されている比較対象文字の特徴
量とを比較して文字の!!識を行う文字認識方式に関す
る。
[Detailed Description of the Invention] [Summary] Regarding a character recognition method that recognizes a character by comparing the feature amount of the recognition target character with the feature amount of a comparison target character stored in a dictionary, the position of the recognition target character is In order to absorb minute deviations and deformations and obtain a stable character recognition rate, the image information of the character to be recognized is divided into multiple regions, and the feature values extracted for each divided region are A feature extraction means that obtains the features of the character to be recognized from a dictionary that stores the features of the character to be compared; and a dictionary that stores the features of the character to be compared; In a character recognition method, the size of each divided area of the recognition target character is changed for each predetermined comparison target character, and the size of each divided area of the recognition target character is changed for each predetermined comparison target character.
Perform feature amount adaptation processing to minimize the difference between ll amounts,
Each special ttl. [Industrial Application Field] The present invention provides a feature adaptation means for subjecting the differences between characters m to identification processing. [Industrial Application Field] Compare the features of the characters stored in the dictionary with the features of the comparison target characters stored in the dictionary. ! This paper relates to a character recognition method for character recognition.

〔従来の技術〕[Conventional technology]

イメージ情報として入力される文字についての従来の文
字認識方式では、まず文書全体のイメージ情報から所定
の切り出し処理により個々の文字に対応するイメージ情
報(文字イメージ)を得る(第5図(l))。
In the conventional character recognition method for characters input as image information, image information (character images) corresponding to individual characters is first obtained from the image information of the entire document through a predetermined cutting process (Fig. 5 (l)). .

続いて、個々の文字イメージの濃淡、輪郭線などから数
値列として表される特f!litを抽出するが、従来方
式では文字認識率を高めるために、まず外郭を含む矩形
領域を8×8の小領域に分割し、各分割領域ごとの特f
tklを抽出する(第5図(2))。
Next, the special f! is expressed as a numerical string from the shading and outline of each character image. However, in the conventional method, in order to increase the character recognition rate, the rectangular area including the outline is first divided into 8 x 8 small areas, and the characteristic f of each divided area is extracted.
tkl is extracted (Figure 5 (2)).

この特徴量は、例えば文字イメージの輪郭線上の各輪郭
点の接続状態に応じて4方向の方向コード(特徴素!)
を与え(第5図(3))、各方向コードについてその個
数(頻度)を積算したものであり、各分割領域ごとにそ
の特徴量は(A o,A + , A !+ A 3)
として抽出される。なお、第5図(2)に示す小領域内
の特徴量は、(1,4,O,O)である。
This feature amount is, for example, a direction code (feature element!) in four directions depending on the connection state of each contour point on the contour line of a character image.
(Figure 5 (3)), and the number (frequency) of each direction code is integrated, and the feature amount for each divided area is (A o, A + , A ! + A 3)
is extracted as Note that the feature amount in the small area shown in FIG. 5(2) is (1, 4, O, O).

同様にして、64の各分割領域からそれぞれ特徴量を抽
出し、文字イメージの特徴量(4X8X8−256次元
ベクトル)が得られる。
Similarly, the feature amounts are extracted from each of the 64 divided regions, and the feature amounts (4×8×8-256 dimensional vector) of the character image are obtained.

このようにして抽出された文字イメージの特徴量は、辞
書検索部において、辞書に格納されている比較対象文字
の特徴量との比較が行われ、特徴量間の相違(距離)が
最小のものを認識文字とし、その文字コードを出力して
文字認識処理を終了する. 〔発明が解決しようとする課題〕 このような従来方式は、文字イメージを固定の分割領域
からそれぞれの特徴量を抽出し、各特徴素量ごとに辞書
の対応する特徴量と比較してその相違を積算し、特徴量
間の相違を求める構成である。したがって、認識対象文
字のフォントが各分割領域単位で比較対象文字のフォン
トとわずかに異なっても、それが積算されて大きな値に
なることがあった。
The feature quantities of the character images extracted in this way are compared with the feature quantities of the comparison target characters stored in the dictionary in the dictionary search section, and the one with the smallest difference (distance) between the feature quantities is is the recognized character, outputs its character code, and terminates the character recognition process. [Problems to be Solved by the Invention] Such a conventional method extracts each feature from a fixed divided region of a character image, and compares each feature with the corresponding feature in a dictionary to determine the difference. This is a configuration that calculates the difference between feature quantities by integrating the values. Therefore, even if the font of the recognition target character differs slightly from the font of the comparison target character in each divided area, the difference may be accumulated to a large value.

すなわち、認識対象文字のフォント(第5図(4).(
5))が辞書に格納されている比較対象文字のフォント
(第5図(1). (2))と異なる場合や、文字位置
に微小なずれ(分割領域のずれ)があったり、その読み
取り条件(1度、かすれ、その他)の変化で文字に変形
が生じている場合には、そのフォント相違に起因する特
徴量の微変動を吸収することができず、認識率の低下を
引き起こしていた。
In other words, the font of the character to be recognized (Figure 5 (4).
5)) is different from the font of the comparison target characters stored in the dictionary (Fig. 5 (1). (2)), or there is a slight shift in the character position (shift in the divided area), or the reading When characters are deformed due to changes in conditions (one degree, blurred, etc.), it is not possible to absorb slight fluctuations in the feature amount due to the difference in font, causing a decrease in recognition rate. .

本発明は、文字認識率がフォント変動に大きく依存して
いることに鑑み、認識対象文字の位置ずれや変形が微小
である場合にはそれを吸収し、安定した文字認識率を得
ることができる文字認識方式を提供することを目的とす
る。
In view of the fact that the character recognition rate is highly dependent on font fluctuations, the present invention can absorb minute positional deviations and deformations of characters to be recognized and obtain a stable character recognition rate. The purpose is to provide a character recognition method.

〔課題を解決するための手段〕[Means to solve the problem]

第1図は、本発明の原理ブロック図である。 FIG. 1 is a block diagram of the principle of the present invention.

図において、特徴量抽出手段11は、認識対象文字のイ
メージ情報を複数の領域に分割し、各分割領域ごとに抽
出された特徴量から認識対象文字の特徴量を得る。
In the figure, a feature amount extracting means 11 divides image information of a character to be recognized into a plurality of regions, and obtains a feature amount of the character to be recognized from the feature amount extracted for each divided region.

辞書l3は、比較対象文字の特徴量が格納される。The dictionary l3 stores feature amounts of characters to be compared.

辞書検索手段15は、認識対象文字の特徴量と比較対象
文字の特徴量とを比較し、特徴量間の相違が最小となる
比較対象文字を認識結果として出力する。
The dictionary search means 15 compares the feature amount of the recognition target character with the feature amount of the comparison target character, and outputs the comparison target character with the smallest difference between the feature values as a recognition result.

特徴量適応化手段17は、所定の比較対象文字ごとに、
認識対象文字の各分割領域の大きさを変更して特徴量間
の相違を最小とする特徴量適応化処理を行い、そのとき
の各特徴量間の相違を識別処理に供する制御を行う。
The feature amount adaptation means 17 performs the following for each predetermined comparison target character:
A feature amount adaptation process is performed to minimize the difference between the feature amounts by changing the size of each divided area of the character to be recognized, and control is performed to subject the difference between the feature amounts to the identification process.

〔作 用〕[For production]

本発明は、特徴量適応化千段17の制御により、所定の
比較対象文字に対して、それぞれ特徴量間の相違が最小
になるように認識対象文字の各分割領域の大きさを変更
する。辞書検索手段15では、所定の比較対象文字ごと
にそれぞれ最小となった特徴量間の相違を比較し、その
中で最小の特徴量間の相違に対応する比較対象文字を認
識結果として出力する。
According to the present invention, the size of each divided region of a character to be recognized is changed by controlling the feature value adaptation stage 17 so that the difference between the respective feature amounts is minimized for a predetermined character to be compared. The dictionary search means 15 compares the minimum difference between the feature amounts for each predetermined comparison target character, and outputs the comparison target character corresponding to the minimum difference between the feature amounts as a recognition result.

すなわち、本発明方式は、当初の分割領域においてある
比較対象文字との特徴量間の相違が最小になっても、特
@.量適応化処理により他の比較対象文字に対する特徴
量間の相違を最小にすることができた場合には、その比
較対象文字を認識結果とする方式である。
That is, in the method of the present invention, even if the difference between the feature amounts with a certain comparison target character in the initial divided area is minimized, the special @. In this method, if the difference between the feature amounts for other comparison target characters can be minimized through the amount adaptation process, that comparison target character is used as the recognition result.

このように、認識対象文字の各分割領域の大きさを変更
し、辞書l3に格納されている比較対象文字に近似させ
ることができるので、認識対象文字のずれや変形が比較
対象文字に対して微小なものであればその吸収は容易で
あり、分割領域が固定であるときに比べて安定した文字
認識率を得ることが可能になる。
In this way, the size of each divided area of the recognition target character can be changed to approximate the comparison target character stored in the dictionary l3, so that the displacement or deformation of the recognition target character with respect to the comparison target character can be avoided. If it is minute, it is easily absorbed, and it becomes possible to obtain a more stable character recognition rate than when the divided regions are fixed.

〔実施例〕〔Example〕

以下、図面に基づいて本発明の実施例について詳細に説
明する。
Hereinafter, embodiments of the present invention will be described in detail based on the drawings.

第2図は、本発明方式を実現する文字認識装置の実施例
構或を示すブロック図である。
FIG. 2 is a block diagram showing an embodiment of a character recognition device implementing the method of the present invention.

なお、認識対象文字の特tUtの抽出処理、および辞書
検索結果により特徴量間の相違から文字認識を行い、対
応する文字コードを出力する処理については従来方弐と
同様とする。
Note that the process of extracting the characteristic tUt of the character to be recognized, the process of character recognition based on the difference between feature amounts based on the dictionary search results, and the process of outputting the corresponding character code are the same as in the conventional method 2.

すなわち、文字切り出し部21では、光学的読み取り装
置から人力される認識対象文字のイメージ情報に対して
、所定の切り出し処理を行い個々の文字イメージに変換
する。特徴量抽出部23では、個々の文字イメージを小
領域に分δリし、各分割領域ごとにその特徴量を抽出し
、一つの文字イメージから例えば256次元ベクトルの
特徴量を取り出す。辞書25には、比較対象文字の特徴
量およびその文字コードが格納される. 辞書検索部27では、辞書25に格納されている比較対
象文字の特徴量を検索し、文字イメージの特1vilと
各特徴素量ごとに比較して特徴量間の相違が最小のもの
を割り出し、対応する文字コードを認識結果として出力
する。出力部29では、辞書検索部27から出力される
文字コードを所定の出力媒体に対応するコードに変換し
て出力する。
That is, the character cutout section 21 performs a predetermined cutout process on image information of characters to be recognized manually inputted from an optical reading device, and converts the image information into individual character images. The feature amount extraction unit 23 divides each character image into small regions δ, extracts the feature amount for each divided region, and extracts, for example, a 256-dimensional vector feature amount from one character image. The dictionary 25 stores feature amounts of characters to be compared and their character codes. The dictionary search unit 27 searches the feature quantities of the characters to be compared stored in the dictionary 25, compares each feature quantity with the characteristic of the character image, and determines the one with the smallest difference between the feature quantities. Output the corresponding character code as the recognition result. The output unit 29 converts the character code output from the dictionary search unit 27 into a code corresponding to a predetermined output medium and outputs the code.

本発明は、認識対象文字の特徴量と比較対象文字の特徴
量との比較において、各分割領域の大きさを可変にする
ことにより、文字の読み込み時に発生する位置ずれやフ
ォントの違いに起因する特徴量の変動を吸収し、文字認
識率の向上を図るものである。
The present invention makes the size of each divided area variable when comparing the feature values of the character to be recognized with the feature values of the character to be compared. This is intended to absorb variations in feature amounts and improve character recognition rates.

第3図は、特徴量抽出部および辞書検索部における本発
明の実施例処理手順を示すフローチャートである。
FIG. 3 is a flowchart showing the processing procedure of the embodiment of the present invention in the feature quantity extraction section and the dictionary search section.

認識対象文字の文字イメージが人力される特徴量抽出部
23では、各分割領域の特徴量を抽出しする。辞書検索
部25では、この特徴量を用いて大分類処理を行う。こ
の大分類処理では、比較対象文字の特atが格納されて
いる辞書を検索するときに、ある程度粗い特1filを
用いて計算量を軽減し、10〜20文字程度の候補文字
を選別する。この大分類処理は、次の詳細識別処理にお
いて文字候補数を少なくし、文字認識のための処理量を
軽減させることを目的としている。
A feature amount extraction unit 23, into which a character image of a recognition target character is manually input, extracts a feature amount of each divided region. The dictionary search unit 25 performs major classification processing using this feature amount. In this major classification process, when searching a dictionary in which the special at of the character to be compared is stored, a somewhat rough special 1fil is used to reduce the amount of calculation, and about 10 to 20 candidate characters are selected. The purpose of this major classification process is to reduce the number of character candidates in the next detailed identification process and reduce the amount of processing required for character recognition.

特徴量抽出部23および辞書検索部25では、左右方向
特徴量適応化処理、上下方向特徴量適応化処理および再
識別処理による詳細識別処理を行う.すなわち、分割領
域を仕切る分割線を左右方向および上下方向に移動させ
、その都度認識対象文字の特@.量と候補文字(比較対
象文字)の特徴量との比較を行い、最も近似できる分割
線の位置を決定し、候補文字ごとにそれぞれ最小となる
特f!I[1間の相違(距離)を求める。
The feature extraction unit 23 and the dictionary search unit 25 perform detailed identification processing using horizontal feature adaptation processing, vertical feature adaptation processing, and re-identification processing. That is, by moving the dividing line that separates the divided areas horizontally and vertically, the special @. The quantity is compared with the feature quantity of the candidate character (comparison target character), the position of the dividing line that can be most approximated is determined, and the minimum characteristic f! is determined for each candidate character. Find the difference (distance) between I[1.

ここで、第4図を用いて、左右方向特徴量適応化処理の
具体的手法について説明する。
Here, a specific method of the horizontal direction feature amount adaptation processing will be explained using FIG. 4.

文字イメージの左右方向(X方向)の長さをM(ドット
)、上下方向(y方向)の長さをN(ドット)、各方向
の分割数をともに8とすると、特徴素量XO(κ.y)
、特徴量X(t+j+k) 、辞書の特徴量X D (
i,j,k)はそれぞれXO(x,y)  (x=0.
1,−,M−1、y=0.1,・・・,N−1)X (
i,j,k)  (i=o,l,・・・,7、j=0.
1,・・・.7、k=0.1,2.3 } X D (i,Lk) (i=o,l,・・・.7、j
=0.1,・・・,7、k=0.1,2.3 ) となる。ここで、k (=0.1,2.3)は、方向コ
ードに対応する特徴素IXo(x,y)の取り得る値で
ある。
Assuming that the length of the character image in the horizontal direction (X direction) is M (dots), the length in the vertical direction (y direction) is N (dots), and the number of divisions in each direction is 8, the feature quantity XO (κ .y)
, feature amount X(t+j+k), dictionary feature amount X D (
i, j, k) are respectively XO(x, y) (x=0.
1,-,M-1,y=0.1,...,N-1)X (
i, j, k) (i=o, l,..., 7, j=0.
1,... 7, k=0.1,2.3 } X D (i, Lk) (i=o, l,...7, j
=0.1,...,7, k=0.1,2.3). Here, k (=0.1, 2.3) is a possible value of the feature element IXo(x,y) corresponding to the direction code.

特1iS!量X (i, j,k)は、各分割領域(i
 . j )ごとにkについての特徴素量XO(×,y
)を積算したものであり、 と表すことができる。ここで、Z (X O (x,y
),k)は、特徴素量XO(x.y)がkであるときに
ISkでないときに0とする関数である。
Special 1iS! The quantity X (i, j, k) is
.. j), the feature quantity XO(×, y
), and can be expressed as . Here, Z (X O (x, y
), k) is a function that is set to 0 when the feature quantity XO (x.y) is k and is not ISk.

また、sepx(4)はX方向i番目の分割線に対応す
るXの値であり、Sepy (j )はy方向j番目の
分割線に対応するyの値である。たとえば、M=72、
N=56の場合には、8×8に分割される各分割領域は
9×7(ドット)であり、sepx(0) =0+ s
epx(1)=9, sepx(2)=18. ・・・
, sepx(8)=72、sepy (0)=O. 
sepy(1)=7, sepy(2)=14, ・−
, sepy(8)一56となる。したがって、sep
x (0)およびsepx (8)−1は文字イメージ
の左端0および右端71であり、sepン(0)および
sepx (8)  1は文字イメージの下端Oおよび
上端55である。
Further, sepx (4) is the value of X corresponding to the i-th dividing line in the X direction, and Sepy (j) is the value of y corresponding to the j-th dividing line in the y direction. For example, M=72,
In the case of N=56, each divided area divided into 8×8 is 9×7 (dots), and sepx(0) =0+s
epx(1)=9, sepx(2)=18. ...
, sepx(8)=72, sepy(0)=O.
sepy(1)=7, sepy(2)=14, ・−
, sepy(8)-56. Therefore sep
x(0) and sepx(8)-1 are the left end 0 and right end 71 of the character image, and sepn(0) and sepx(8)1 are the lower end O and upper end 55 of the character image.

文字の認識に用いる特徴量間の相違(評価関数)は、辞
書の特徴ffiXD(i,Lk)と抽出された特徴量X
(i,j,k)のユークリッド距離あるいは市街地距離
により定義される。たとえば、市街地距離を用いた評価
関数Dは、 D= ΣΣ:E.  IXD(t,j.k) 一X(i
,3.k)1▲”6  J−6  k−6 と表すことができる. さて、第4図(1)において、X方向のm番目(m−1
.2.・・・,7)の分割線をd .(=0. 1.2
)だけ右方向に移動させたときに、その分割線をはさむ
分割領域の特徴量X(m−1,j+k)およびX(+*
,j,k)の変化量ΔX(m,j,k)は、 となり、各特1vI.量はそれぞれ X (m−1.Lk)− X (m−1.Lk)十ΔX
(+e,j.k)X(m,j,k)  ←X(m,j,
k)一ΔX (+m. J l k)に変換される。
The difference (evaluation function) between the features used for character recognition is the dictionary feature ffiXD(i,Lk) and the extracted feature X
It is defined by the Euclidean distance or urban area distance of (i, j, k). For example, the evaluation function D using city distance is D=ΣΣ:E. IXD(t,j.k) 1X(i
,3. k)1▲”6 J-6 k-6 Now, in Fig. 4 (1), the m-th (m-1
.. 2. ..., 7) with the dividing line d. (=0. 1.2
) to the right, the feature values X(m-1,j+k) and X(+*
, j, k) is as follows, and each characteristic 1vI. The amount is each X (m-1.Lk) - X (m-1.Lk) + ΔX
(+e, j.k)X(m,j,k) ←X(m,j,
k) - ΔX (+m. J l k).

また、X方向のm番目の分割線をd.だけ左方向に移動
させたときの変化量ΔX’ (m+ J l k)は、
となり、各特徴量はそれぞれ X (s+−1,j.k)←X(a+−Lj,k)一Δ
X’(m.j+k)X (m.j.k)  4−X (
a+,j,k)十Δx’ (m+ j+ k)に変換さ
れる。
Also, the m-th dividing line in the X direction is d. The amount of change ΔX' (m+J l k) when moving to the left by
and each feature amount is X (s+-1,j.k)←X(a+-Lj,k)-Δ
X'(m.j+k)X (m.j.k) 4-X (
a+, j, k) is converted into Δx' (m+ j+ k).

なお、d.だけ移動する分割線をはさむ分割領域以外の
特徴量については不変であり、分割線の移動によりその
特f!l+1が変化する分割領域部分における評価関数
Ddは、 と表すことができるので、X方向のm番目の分割線につ
いて、このD4が最小になる(辞書の特徴量に最も近似
できる)移動Id.を決定する。
In addition, d. The feature values other than the divided areas sandwiching the dividing line that moves by the same amount remain unchanged, and as the dividing line moves, the characteristic f! The evaluation function Dd in the divided region portion where l+1 changes can be expressed as follows. Therefore, for the m-th dividing line in the X direction, the movement Id. Determine.

左右方向特徴量適応化処理では、以上の処理をX方向の
すべての分割線について行い、それぞれの評価関数Dd
が最小となる分割領域に基づいて適応化された特徴量を
算出する。
In the horizontal direction feature adaptation processing, the above processing is performed for all dividing lines in the X direction, and each evaluation function Dd is
The adapted feature amount is calculated based on the divided region where the minimum value is obtained.

また、上下方向特徴量適応化処理では、y方向のすべて
の分割線について同様の処理を行い適応化された特徴量
を算出する.なお、左右方向および上下方向の各特徴量
適応化処理は互いに関与しているので、総合的な特徴量
適応化処理が望ましい(第4図(2)). 再識別処理では、各候補文字ごとに適応化された特徴量
に基づいて、それぞれ最小となるように変更された分割
領域に応じた評価関数(特ltI[量間の相違)を求め
る。
In addition, in the vertical feature adaptation process, similar processing is performed for all dividing lines in the y direction to calculate adapted feature quantities. Note that since the horizontal and vertical feature adaptation processes are related to each other, comprehensive feature adaptation processing is desirable (Fig. 4 (2)). In the re-identification process, an evaluation function (special ltI [difference between quantities)] is calculated based on the feature quantity adapted for each candidate character, according to the divided area that has been changed to the minimum.

以上の処理をすべての候補文字について行った後に、認
識文字の決定処理では、各候補文字に対応する評価関数
(特1!Ir量間の相違)についてソートを行い、その
中で最小の評価関数に対応する候補文字をL’2 1k
文字として決定する。
After performing the above processing for all candidate characters, in the recognition character determination process, the evaluation functions (Special 1! Differences between Ir amounts) corresponding to each candidate character are sorted, and the minimum evaluation function among them is sorted. The candidate character corresponding to L'2 1k
Determine as a character.

このように、認識対象文字の特徴量は、辞書の対応する
文字の特徴量に自動的に近似されるので、辞書の比較対
象文字に対するvQ識対象文字の微小なずれや変形を吸
収することが容易である。
In this way, the features of the character to be recognized are automatically approximated to the features of the corresponding character in the dictionary, so it is possible to absorb minute deviations and deformations of the target character for vQ recognition with respect to the character to be compared in the dictionary. It's easy.

〔発明の効果〕〔Effect of the invention〕

上述したように、本発明によれば、文字の読み込み条件
の変動やフォント変化に応じた微小なずれや変形による
誤認識が減少し、安定した文字認識率を得ることができ
る。
As described above, according to the present invention, erroneous recognition due to minute shifts or deformations due to variations in character reading conditions or font changes is reduced, and a stable character recognition rate can be obtained.

なお、認識対象文字としてプリンタ出力文字を使用し、
岩田細明朝体の文字から作成した辞書を使用し、約14
00文字について行った文字認識処理の結果は、従来方
式では80〜85%であったものが、本発明方式では9
5%程度に向上させることができ、実用的には極めて有
用である。
Note that printer output characters are used as recognition target characters,
Using a dictionary created from Iwata Hoso Mincho typeface characters, approximately 14
The result of character recognition processing performed on the 00 character was 80-85% in the conventional method, but 9% in the method of the present invention.
This can be improved to about 5%, which is extremely useful in practice.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の原理ブロック図、 第2同は本発明方式を実現する文字認識装置の実施例構
戒を示すブロック図、 第3図は実施例処理手順を示すフローチャート、第4図
は特1!l[量適応化処理の具体的手法について説明す
る図、 第5図は文字イメージの特徴量抽出処理を説明する図で
ある。 図において、 1は特徴量抽出手段、 3は辞書、 5は辞書検索手段、 7は特徴量適応化手段、 lは文字切り出し部、 3は特徴量抽出部、 5は辞書、 27は辞書検索部、 29は出力部である。 本発明原理ブロック図 第1図 文字認識装置のlIe.例を示すブロノク図第2図 m−1 m=1 (1) (2) 特徴量適応化処理の具体的手法について説明する図実施
例処理手順を示すフローチャート 第 4 図 第 3 図 (1) (2) 2 (3) (4) (5) 文字イメージの特徴量抽出処理を説明する図第5図
Fig. 1 is a block diagram of the principle of the present invention, Fig. 2 is a block diagram showing the structure of an embodiment of a character recognition device that implements the method of the present invention, Fig. 3 is a flowchart showing the processing procedure of the embodiment, and Fig. 4 is Special 1! FIG. 5 is a diagram illustrating a feature amount extraction process of a character image. In the figure, 1 is a feature extraction means, 3 is a dictionary, 5 is a dictionary search means, 7 is a feature adaptation means, l is a character extraction section, 3 is a feature extraction section, 5 is a dictionary, 27 is a dictionary search section , 29 is an output section. Block diagram of the principle of the present invention Figure 1 Character recognition device IIe. Bronok diagram showing an example Fig. 2 m-1 m=1 (1) (2) Fig. 4 explaining a specific method of feature adaptation processing Flowchart showing an example processing procedure Fig. 4 Fig. 3 Fig. (1) ( 2) 2 (3) (4) (5) Figure 5 is a diagram illustrating feature extraction processing of character images.

Claims (1)

【特許請求の範囲】[Claims] (1)認識対象文字のイメージ情報を複数の領域に分割
し、各分割領域ごとに抽出された特徴量から認識対象文
字の特徴量を得る特徴量抽出手段(11)と、 比較対象文字の特徴量が格納される辞書(13)と、 前記認識対象文字の特徴量と前記比較対象文字の特徴量
とを比較し、特徴量間の相違が最小となる比較対象文字
を認識結果として出力する辞書検索手段(15)と を備えた文字認識方式において、 所定の比較対象文字ごとに、前記認識対象文字の各分割
領域の大きさを変更して前記特徴量間の相違を最小とす
る特徴量適応化処理を行い、そのときの各特徴量間の相
違を識別処理に供する特徴量適応化手段(17)を備え
た ことを特徴とする文字認識方式。
(1) A feature amount extraction means (11) that divides image information of a character to be recognized into a plurality of regions and obtains a feature amount of the character to be recognized from the feature amount extracted for each divided region, and a feature of a character to be compared. a dictionary (13) in which quantities are stored; and a dictionary that compares the feature quantities of the recognition target character with the feature quantities of the comparison target character and outputs the comparison target character with the smallest difference between the feature quantities as a recognition result. A character recognition method comprising a search means (15), characterized in that, for each predetermined comparison target character, the size of each divided region of the recognition target character is changed to minimize the difference between the feature values. 1. A character recognition method characterized by comprising a feature amount adaptation means (17) that performs a conversion process and subjects differences between respective feature amounts at that time to an identification process.
JP1156440A 1989-06-19 1989-06-19 Character recognizing system Pending JPH0322094A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP1156440A JPH0322094A (en) 1989-06-19 1989-06-19 Character recognizing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP1156440A JPH0322094A (en) 1989-06-19 1989-06-19 Character recognizing system

Publications (1)

Publication Number Publication Date
JPH0322094A true JPH0322094A (en) 1991-01-30

Family

ID=15627800

Family Applications (1)

Application Number Title Priority Date Filing Date
JP1156440A Pending JPH0322094A (en) 1989-06-19 1989-06-19 Character recognizing system

Country Status (1)

Country Link
JP (1) JPH0322094A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10916369B2 (en) 2016-11-08 2021-02-09 Koninklijke Philips N.V. Inductor for high frequency and high power applications

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10916369B2 (en) 2016-11-08 2021-02-09 Koninklijke Philips N.V. Inductor for high frequency and high power applications

Similar Documents

Publication Publication Date Title
KR100249055B1 (en) Character recognition apparatus
EP0355748B1 (en) A pattern recognition apparatus and method for doing the same
US5579408A (en) Character recognition method and apparatus
JPH0139154B2 (en)
EP0576020B1 (en) Character recognizing method and apparatus
US5621818A (en) Document recognition apparatus
KR100220213B1 (en) Apparatus and method of character recognition based on 0-1 pattern histogram
JP4678712B2 (en) Language identification device, program, and recording medium
JPH0322094A (en) Character recognizing system
US5596657A (en) Method of sorting out candidate characters in character recognition system
JPH09319828A (en) On-line character recognition device
JP3376808B2 (en) Word input device and word input method
JPH05314320A (en) Recognition result evaluating system using difference of recognition distance and candidate order
JP3466903B2 (en) How to create a multi-template dictionary for character recognition
JPH03150690A (en) Character recognizing device
JP2851865B2 (en) Character recognition device
JP2728117B2 (en) Character recognition device
KR960013818B1 (en) Hangeul combinational recognition apparatus using recognizing multiple stroke
JP2677271B2 (en) Character recognition device
KR940001048B1 (en) On-line hand written character recognition method
JP2923295B2 (en) Pattern identification processing method
JPS61194582A (en) Hand-written character recognition device
JPH056464A (en) Method and device for character string recognition
JP3281530B2 (en) Character recognition device
JP2639314B2 (en) Character recognition method