JP2000231602A

JP2000231602A - Feature extracting device

Info

Publication number: JP2000231602A
Application number: JP11033764A
Authority: JP
Inventors: Masaru Sugioka; 賢杉岡; Koji Ito; 晃治伊東
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1999-02-12
Filing date: 1999-02-12
Publication date: 2000-08-22

Abstract

PROBLEM TO BE SOLVED: To securely extract feature points representing features of a character from a character image. SOLUTION: An image which is segmented into characters by a character extraction part 12 is stored in an image storage part 21 in the feature extraction device 20. An outline point extraction part 22 extracts an array of outline points forming the outline of a character line. A feature point extraction part 23 scans the outline point array to detect the outline point which is at a distance of dots more than a 1st threshold from a point of interest in the outline point array and at the shortest coordinate distance to the point of interest. Further, the feature point extraction part 23 shifts the point of interest and detects the outline point which is at a distance of dots more than the 1st threshold from the shifted point of interest in the outline point array and at the shortest coordinate distance to the point of interest, and then decides the shifted point of interest is a feature point when the distance between the two outline points in the outline point array is larger than a specific threshold.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、文字や記号等の認
識等を行うために、該記号を含めた文字等の特徴を抽出
する特徴抽出装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a feature extracting device for extracting features of characters and the like including the symbols in order to recognize the characters and symbols.

【０００２】[0002]

【従来の技術】文字認識を行う手法には、大きく分けて
パタンマッチングと構造解析的な手法の２通りがある。
パターンマッチングは、変形の少ない印刷活字等を認識
するのに適しているが、類似文字等の識別が困難であっ
た。そのため、特に高い認識精度を必要とする場合に
は、構造解析的な手法が用いられている。構造解析的な
手法では、基準となる文字の特徴点の位置関係の特徴を
予め辞書に登録しておき、入力された文字パタンから特
徴的な部分を特徴点として抽出し、該抽出した特徴点の
位置関係に対して辞書を用い、文字を認識している。2. Description of the Related Art Character recognition methods are roughly classified into two types, that is, pattern matching and structure analysis.
The pattern matching is suitable for recognizing a print character or the like with little deformation, but it is difficult to identify a similar character or the like. Therefore, when particularly high recognition accuracy is required, a structural analysis method is used. In the structural analysis method, a feature of a positional relationship of a feature point of a reference character is registered in a dictionary in advance, and a feature part is extracted as a feature point from an input character pattern, and the extracted feature point is extracted. Characters are recognized using a dictionary for the positional relationship of.

【０００３】従来、このような特徴点の抽出に関する技
術としては、例えば次の文献に記載されたものがある。文献；特開平８−７７２９３号公報上記文献に記載された文字認識装置では、文字の輪郭点
の方向コード列を作成し、この方向コード列の差分を求
め、該差分がある閾値以上のところを文字の特徴点とし
て抽出している。つまり、輪郭点の変化量が大きいとこ
ろを特徴点としている。手書き文字の特徴点としては、
端点、分岐点、交差点、急な弧における頂点等が抽出で
きればよい。Conventionally, as a technique relating to such feature point extraction, there is a technique described in the following document, for example. Reference: JP-A-8-77293 In the character recognition device described in the above reference, a direction code sequence of character outline points is created, and a difference between the direction code sequences is determined. It is extracted as character feature points. In other words, the feature point is where the amount of change in the contour point is large. As a feature point of handwritten characters,
It suffices if an end point, a branch point, an intersection, a vertex in a steep arc, etc. can be extracted.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、前記文
献に記載された文字認識装置等が行う特徴抽出では、次
のような課題があった。図２は、従来の特徴抽出におけ
る課題を示す図である。前記文献の文字認識装置では、
文字を構成するセグメントの端点１，２，３，４や交差
点５，６のように、輪郭点の方向が急激に変化する部分
では安定して特徴点を抽出できるが、交差点７，８のよ
うに、輪郭点の方向変化が少ない部分に対しては特徴点
として抽出しないことがある。交差点７，８を特徴点と
して抽出するためには、閾値を下げて輪郭点の方向変化
量がある程度小さいところまで、特徴点として抽出でき
るようにしなければならない。ところが、閾値を下げる
と、例えば、なだらかな弧を描く頂点等の特徴点として
ふさわしくない部分も特徴点としてしまうことになる。
また、この最適な閾値の設定も困難であるという課題が
あった。さらに、すべての輪郭点について方向コード列
を作成する必要があるため、解像度が十分に高く輪郭点
が多くなった場合に、処理時間が膨大になるという問題
もあった。However, the feature extraction performed by the character recognition device and the like described in the above document has the following problems. FIG. 2 is a diagram showing a problem in the conventional feature extraction. In the character recognition device of the document,
A feature point can be stably extracted in a portion where the direction of a contour point changes abruptly, such as end points 1, 2, 3, 4 and intersections 5, 6 of a segment constituting a character. In some cases, a portion where the direction change of a contour point is small is not extracted as a feature point. In order to extract the intersections 7 and 8 as feature points, it is necessary to lower the threshold value so that the direction change amount of the contour point can be extracted as a feature point to some extent. However, if the threshold value is lowered, for example, a portion that is not suitable as a feature point such as a vertex that draws a gentle arc will also be a feature point.
There is also a problem that it is difficult to set the optimum threshold. Furthermore, since it is necessary to create a direction code string for all contour points, there is a problem that the processing time becomes enormous when the resolution is sufficiently high and the number of contour points increases.

【０００５】[0005]

【課題を解決するための手段】前記課題を解決するため
に、本発明のうちの第１の発明は、文字線で描画された
文字の画像を入力し、該画像における文字の特徴を表す
特徴点を抽出する特徴抽出装置において、次のような画
像格納部、輪郭点抽出部及び特徴点抽出部を設けてい
る。前記画像格納部は、１文字単位に切出された前記画
像を入力して格納するものである。輪郭点抽出部は、前
記格納された画像において前記文字線の輪郭を形成する
輪郭点列を抽出するものである。特徴点抽出部は、前記
輪郭点抽出部で抽出された輪郭点列を走査し、輪郭点の
１である注目点Ｃ_nから該輪郭点列上のドット数で第１
の閾値ＴＨ１以上離れかつ該注目点Ｃ_nから座標距離で
最も近い距離にある輪郭点ＮＲ_nを検出し、さらに、注
目点を該注目点Ｃ_nから該輪郭点列上で１ドットずれた
注目点Ｃ_n+1に移したうえで該輪郭点抽出部で抽出され
た輪郭点列を走査し、該注目点Ｃ_n+1から該輪郭点列上
のドット数で該第１の閾値ＴＨ１以上離れ、かつ該注目
点Ｃ_n+1から座標距離で最も近い距離にある輪郭点ＮＲ
_n+1を検出し、該輪郭点ＮＲ_nと輪郭点ＮＲ_n+1との間
の輪郭点列上のドット数が第２の閾値ＴＨ２よりも大き
いときに該注目点Ｃ_n+1を前記特徴点とするものであ
る。[MEANS FOR SOLVING THE PROBLEMS]
According to a first aspect of the present invention, a character line is drawn.
Input an image of a character and represent the characteristics of the character in the image
In the feature extraction device that extracts feature points, the following image
Image storage unit, contour point extraction unit and feature point extraction unit
You. The image storage unit stores the image extracted in units of one character.
The image is input and stored. The contour point extraction unit
Forming an outline of the character line in the stored image
This is for extracting a contour point sequence. The feature point extracting unit is
The outline point sequence extracted by the outline point extraction unit is scanned, and the outline point
Attention point C which is 1_nFrom the number of dots on the contour point sequence
Point of interest C_nAt coordinate distance from
The closest contour point NR_nIs detected, and
Attention point C_nFrom the outline point sequence by one dot
Attention point C_{n + 1}And then extracted by the contour point extraction unit.
Scans the contour point sequence_{n + 1}From the contour point sequence
Are separated by more than the first threshold value TH1 with the number of dots of
Point C_{n + 1}Contour point NR at the nearest coordinate distance from
_{n + 1}And the contour point NR_nAnd contour point NR_{n + 1}Between
Is larger than the second threshold value TH2
The point of interest C_{n + 1}Is the feature point.
You.

【０００６】このような構成を採用したことにより、第
１の発明では、文字線の輪郭を形成する輪郭点列が輪郭
点抽出部により、抽出される。輪郭点列において隣接す
る各注目点Ｃ_n，Ｃ_n+1から第１の閾値だけそれぞれ離
れ、かつ、注目点Ｃ_n，Ｃ_n+ ₁に最も近い位置の輪郭点
ＮＲ_n，ＮＲ_n+1がそれぞれ求められ、これらの輪郭点
の輪郭点上の距離が第２の閾値以上であれば、注目点Ｃ
_n+1が特徴点として抽出される。つまり、輪郭点列の方
向変化ではなく、距離から特徴点が求められることにな
る。According to the first aspect of the present invention, the outline point sequence forming the outline of the character line is extracted by the outline point extracting unit. Contour points NR _n , NR _{n + 1 at} positions closest to the points of interest C _n , C _{n +} ₁ apart from the points of interest C _n , C _{n + 1} adjacent to each other in the line of contour points by the first threshold value, respectively. If the distance between these contour points on the contour point is equal to or greater than the second threshold, the point of interest C
_{n + 1} is extracted as a feature point. That is, the feature point is obtained from the distance, not the change in the direction of the outline point sequence.

【０００７】第２の発明では、特徴抽出装置において、
第１の発明と同様の画像格納部、輪郭点抽出部を備える
と共に、次のような線幅検出部及び特徴点抽出部を設け
ている。前記線幅検出部は、前記格納された画像の前記
文字線の線幅を検出するものである。特徴点抽出部は、
前記線幅に対応する第１の閾値ＴＨ１及び第２の閾値Ｔ
Ｈ２を決定し、前記輪郭点抽出部で抽出された輪郭点列
を走査し、注目点Ｃ_nから該輪郭点列上のドット数で該
第１の閾値以上離れかつ該注目点Ｃ_nから座標距離で最
も近い距離にある輪郭点ＮＲ_nを検出し、さらに、注目
点を注目点Ｃ_nから該輪郭点列上で１ドットずれた注目
点Ｃ_n+1に移したうえで該輪郭点抽出部で抽出された輪
郭点列を走査し、該注目点Ｃ_n+1から該輪郭点列上のド
ット数で該第１の閾値ＴＨ１以上離れかつ該注目点Ｃ
_n+1から座標距離で最も近い距離にある輪郭点ＮＲ_n+1
を検出し、該輪郭点ＮＲ_nと輪郭点ＮＲ_n+1との間の輪
郭点列上のドット数が該第２の閾値ＴＨ２よりも大きい
ときに該注目点Ｃ_n+1を前記特徴点とするものである。According to a second aspect, in a feature extraction device,
In addition to the same image storage unit and contour point extraction unit as in the first invention, the following line width detection unit and feature point extraction unit are provided. The line width detection unit detects a line width of the character line of the stored image. The feature point extraction unit is
A first threshold TH1 and a second threshold T corresponding to the line width
Determine the H2, scanning the contour point string extracted by the contour point extracting section, the first threshold value or more from the target point C _n by the number of dots on the contour point row away and coordinates from the noted point C _n The contour point NR _n at the closest distance is detected, and the point of interest is further shifted from the point of interest C _n to the point of interest C _{n + 1} shifted by one dot on the line of contour points. The contour point sequence extracted by the section is scanned, and is separated from the point of interest C _{n + 1 by the} number of dots on the contour point row by the first threshold value TH1 or more and the point of interest C
_The contour point NR _{n + 1} which is the closest coordinate distance from _{n + 1}
And when the number of dots on the contour point sequence between the contour point NR _n and the contour point NR _{n + 1} is larger than the second threshold value TH2, the target point C _{n + 1 is set} to the feature point. It is assumed that.

【０００８】このような構成を採用したことにより、線
幅検出部によって線幅が検出される。特徴点抽出部によ
り、線幅に対応する第１及び第２の閾値ＴＨ１，ＴＨ２
が決定され、該第１及び第２の閾値ＴＨ１，ＴＨ２が用
いられて特徴点が抽出される。By adopting such a configuration, the line width is detected by the line width detecting section. The first and second thresholds TH1 and TH2 corresponding to the line width are extracted by the feature point extracting unit.
Is determined, and a feature point is extracted using the first and second threshold values TH1 and TH2.

【０００９】第３の発明では、特徴抽出装置において、
第１の発明と同様の画像格納部、輪郭点抽出部を備える
と共に、次のような間引き走査閾値検出部及び特徴点抽
出部を設けている。前記間引き走査閾値検出部は、前記
輪郭点抽出部で抽出した輪郭点の量に応じた第１の間引
き走査閾値ＭＴＨ１及第２の間引き走査閾値ＭＴＨ２を
決定するものである。特徴点抽出部は、前記輪郭点抽出
部で抽出された輪郭点列を前記第２の間引き走査閾値Ｍ
ＴＨ２毎に走査し、注目点Ｃ_nから該輪郭点列上のドッ
ト数で第１の閾値ＴＨ１以上離れかつ該注目点Ｃ_nから
座標距離で最も近い距離にある輪郭点ＮＲ_nを検出し、
さらに、注目点を該注目点Ｃ_nから該輪郭点列上で前記
第１の間引き走査閾値ＭＴＨ１分ずれた注目点Ｃ_n+MTH1
に移したうえで該輪郭点抽出部で抽出された輪郭点列を
該第２の間引き走査閾値ＭＴＨ２毎に走査し、該注目点
Ｃ_n+MTH1から該輪郭点列上のドット数で該第１の閾値Ｔ
Ｈ１以上離れかつ該注目点Ｃ_n+MTH1から座標距離で最も
近い距離にある輪郭点ＮＲ_n+MTH1を検出し、該輪郭点Ｎ
Ｒ_nと輪郭点ＮＲ_n+MTH1との間の輪郭点列上のドット数
が第２の閾値ＴＨ２よりも大きいときに該注目点Ｃ
_n+MTH1を前記特徴点とするものである。According to a third aspect, in the feature extracting device,
In addition to the same image storage unit and contour point extraction unit as in the first invention, the following thinning scanning threshold value detection unit and feature point extraction unit are provided. The thinning-out scanning threshold detecting unit determines a first thinning-out scanning threshold MTH1 and a second thinning-out scanning threshold MTH2 according to the amount of contour points extracted by the contour point extracting unit. The feature point extracting unit converts the contour point sequence extracted by the contour point extracting unit into the second thinning scan threshold M.
Scans every TH2, to detect contour points NR _n in away from the target point C _n by the number of dots on the contour point row first threshold TH1 or higher and from the noted point C _n to the closest distance at coordinates distance,
Furthermore, attention point shifted the first thinning scanning threshold MTH1 minutes point of interest from the noted point C _n on the contour point sequence C _{n + MTH1}
And then scans the outline point sequence extracted by the outline point extraction unit for each of the second thinning-out scanning thresholds MTH2, and _{calculates the} number of dots on the outline point sequence from the point of _{interest Cn + MTH1} . Threshold T of 1
A contour point NR _{n + MTH1} which is more than H1 away from the point of _interest C _{n + MTH1} and is closest to the coordinate distance is detected, and the contour point N
When the number of dots on the contour point sequence between R _n and the contour point NR _{n + MTH1} is greater than the second threshold value TH2, the point of _interest C
_{n + MTH1} is the feature point.

【００１０】このような構成を採用したことにより、間
引き走査閾値検出部により、輪郭点の量に応じた第１及
び第２の間引き走査閾値ＭＴＨ１，ＭＴＨ２が決定され
る。特徴点抽出部は、これら第１及び第２の間引き走査
閾値ＭＴＨ１，ＭＴＨ２を用いた間引き走査により、特
徴点を検出することになる。By adopting such a configuration, the first and second thinning-out scanning thresholds MTH1 and MTH2 according to the amount of the contour points are determined by the thinning-out scanning threshold detecting unit. The feature point extracting unit detects feature points by thinning scanning using the first and second thinning scanning thresholds MTH1 and MTH2.

【００１１】第４の発明では、特徴抽出装置において、
第２の発明と同様の画像格納部、輪郭点抽出部及び線幅
検出部を備えると共に、次のような間引き走査閾値検出
部及び特徴点抽出部を設けている。前記間引き走査閾値
検出部は、前記線幅に対応する第１の間引き走査閾値Ｍ
ＴＨ１及び第２の閾値ＭＴＨ２を決定するものである。
特徴点抽出部は、前記線幅に対応する第１の閾値ＴＨ１
及び第２の閾値ＴＨ２を決定し、前記輪郭点抽出部で抽
出された輪郭点列を前記第２の間引き走査閾値ＭＴＨ２
毎に走査し、輪郭点の１つである注目点Ｃ_nから該輪郭
点列上のドット数で該第１の閾値ＴＨ１以上離れかつ該
注目点Ｃ_nから座標距離で最も近い距離にある輪郭点Ｎ
Ｒ_nを検出し、さらに、注目点を注目点Ｃ_nから該輪郭
点列上で前記第１の間引き走査閾値ＭＴＨ１分ずれた注
目点Ｃ_n+MTH1に移したうえで該輪郭点抽出部で抽出され
た輪郭点列を該第２の間引き走査閾値ＭＴＨ２毎に走査
し、該注目点Ｃ_n+MTH1から該輪郭点列上のドット数で該
第１の閾値ＴＨ１以上離れかつ該注目点Ｃ_n+MTH1から座
標距離で最も近い距離にある輪郭点ＮＲ_n+MTH1を検出
し、該輪郭点ＮＲ_nと輪郭点ＮＲ_n+MTH1との間の輪郭点
列上のドット数が該第２の閾値ＴＨ２よりも大きいとき
に該注目点Ｃ_n+MTH1を前記特徴点とするものである。According to a fourth aspect, in the feature extracting device,
In addition to the same image storage unit, contour point extraction unit and line width detection unit as in the second invention, the following thinning-scanning threshold detection unit and feature point extraction unit are provided. The thinning-scanning-threshold detector detects a first thinning-scanning threshold M corresponding to the line width.
TH1 and a second threshold value MTH2 are determined.
The feature point extracting unit may determine a first threshold value TH1 corresponding to the line width.
And a second threshold value TH2, and the contour point sequence extracted by the contour point extraction unit is subjected to the second thinning scanning threshold value MTH2.
Scans each, contour from which is one from the target point C _n apart first threshold TH1 or more in the number of dots on the contour point sequence and the noted point C _n of the contour points in the closest coordinate distance Point N
Detecting a R _n, further, in the contour point extracting unit after having transferred target point from target point C _n to the target point C _{n + MTH1} shifted first thinning scanning threshold MTH1 minutes on the contour point sequence The extracted contour point sequence is scanned for each of the second thinning-out scanning thresholds MTH2, and is _separated from the _target point C _{n + MTH1 by the} number of dots on the contour point sequence by the first threshold value TH1 or more and the target point C _The contour point NR _{n + MTH1} which is closest to the coordinate distance from _{n + MTH1} is detected, and the number of dots on the contour point sequence between the contour point NR _n and the contour point NR _{n + MTH1} is determined by the second When the value is larger than the threshold value TH2, the point of _{interest Cn + MTH1} is _set as the feature point.

【００１２】このような構成を採用したことにより、線
幅検出部により、画像中の文字の線幅が検出され、該線
幅に対応する第１及び第２の間引き走査閾値ＭＴＨ１，
ＭＴＨ２が、間引き走査閾値検出部により、決定され
る。特徴点抽出部は、これら線幅に対応する第１及び第
２の間引き走査閾値ＭＴＨ１，ＭＴＨ２を用いた間引き
走査と、線幅に対応した第１及び第２の閾値ＴＨ１，Ｔ
Ｈ２を用いた輪郭点の評価を行い、特徴点を検出するこ
とになる。By adopting such a configuration, the line width of the character in the image is detected by the line width detecting unit, and the first and second thinning-out scanning thresholds MTH1, MTH1 corresponding to the line width are detected.
MTH2 is determined by the thinning-out scanning threshold value detection unit. The feature point extracting unit performs thinning-out scanning using the first and second thinning-out scanning thresholds MTH1 and MTH2 corresponding to these line widths, and the first and second thresholds TH1 and T corresponding to the line widths.
The evaluation of the contour points using H2 is performed, and the feature points are detected.

【００１３】第５の発明では、特徴抽出装置において、
文字線で描画された文字の画像を入力し、該画像におけ
る文字の特徴を表す特徴点を抽出する特徴抽出装置にお
いて、第３の発明と同様の画像格納部、輪郭点抽出部、
線幅検出部、及び間引き走査閾値検出部を備えると共
に、第４の発明と同様の特徴点抽出部を備えている。こ
のような構成を採用したことにより、輪郭点の量に応じ
た第１及び第２の間引き走査閾値ＭＴＨ１，ＭＴＨ２が
間引き走査閾値検出部により、決定される。特徴点抽出
部は、これら輪郭点の量に応じた第１及び第２の間引き
走査閾値ＭＴＨ１，ＭＴＨ２を用いた間引き走査と、線
幅に応じた第１及び第２の閾値ＴＨ１，ＴＨ２を用いた
輪郭点の評価を行い、特徴点を検出することになる。According to a fifth aspect, in the feature extracting device,
In a feature extraction device for inputting an image of a character drawn by a character line and extracting a feature point representing a feature of the character in the image, an image storage unit, a contour point extraction unit,
A line width detection unit and a thinning-out scanning threshold value detection unit are provided, and a feature point extraction unit similar to that of the fourth aspect is provided. By employing such a configuration, the first and second thinning scanning thresholds MTH1 and MTH2 according to the amount of the contour points are determined by the thinning scanning threshold detecting unit. The feature point extraction unit uses the first and second thinning-out scanning thresholds MTH1 and MTH2 according to the amount of these contour points, and uses the first and second thresholds TH1 and TH2 according to the line width. The contour points are evaluated and feature points are detected.

【００１４】第６の発明では、第１または第３の発明の
特徴抽出装置において、前記特徴点に相当する輪郭点Ｒ
_nから、予め設定された第３の閾値ＴＨ３だけ前記輪郭
点列を順方向に進んだ輪郭点Ｒ_n+TH3と該第３の閾値Ｔ
Ｈ３だけ逆方向に戻った輪郭点Ｒ_n-TH3とを求め、該輪
郭点Ｒ_n、輪郭点Ｒ_n+TH3及び輪郭点Ｒ_n-TH3の位置関
係から、前記文字における該輪郭点Ｒ_nの位置が凸にな
っているか凹になっているかを検出する凹凸検出部を設
けている。このような構成を採用したことにより、特徴
点ばかりでなく、該特徴点が文字において凹になってい
るか凸になっているかの情報も検出することになる。According to a sixth aspect, in the feature extraction device according to the first or third aspect, a contour point R corresponding to the feature point is provided.
_n , a contour point R _{n + TH3} that has advanced the contour point sequence in the forward direction by a preset third threshold value _TH3 and the third threshold value T
The contour point R _n-TH3 returned in the opposite direction by H3 is obtained, and from the positional relationship between the contour point R _n , the contour point R _{n + TH3} and the contour point R _n-TH3 , the contour point R _n of the character is obtained. An unevenness detecting unit for detecting whether the position is convex or concave is provided. By employing such a configuration, it is possible to detect not only the feature points but also information as to whether the feature points are concave or convex in the character.

【００１５】第７の発明では、第２、第４または第５の
発明の特徴抽出装置において、前記線幅検出部で検出さ
れた前記線幅から第３の閾値ＴＨ３を決定し、前記特徴
点に８当する輪郭点Ｒ_nから該第３の閾値ＴＨ３だけ前
記輪郭点列を順方向に進んだ輪郭点Ｒ_n+TH3と該第３の
閾値ＴＨ３だけ逆方向に戻った輪郭点Ｒ_n-TH3とを求
め、該輪郭点Ｒ_n、輪郭点Ｒ_n+TH3及び輪郭点Ｒ_n-TH3
の位置関係から、前記文字における該輪郭点Ｒ_nの位置
が凸になっているか凹になっているかを検出する凹凸検
出部を設けている。このような構成を採用したことによ
り、特徴点ばかりでなく、該特徴点が文字において凹に
なっているか凸になっているかの情報も検出することに
なる。According to a seventh aspect, in the feature extracting apparatus according to the second, fourth or fifth aspect, a third threshold value TH3 is determined from the line width detected by the line width detecting section, and the characteristic point is determined. contour point 8 from this contour point R _n by the threshold TH3 of the third of the contour point sequence returned by the opposite side threshold TH3 contour point R _{n + TH3} and the third advanced in the forward direction R _{n- TH3} and the contour points R _n , R _{n + TH3} and R _n-TH3
From the positional relationship between the position of the contour point R _n in the character it is provided the concave-convex detecting part for detecting whether become concave or has a convex. By employing such a configuration, it is possible to detect not only the feature points but also information as to whether the feature points are concave or convex in the character.

【００１６】[0016]

【発明の実施の形態】第１の実施形態図１は、本発明の第１の実施形態を示す特徴抽出装置を
備えた文字認識装置の構成図である。この文字認識装置
は、例えば画像記憶媒体１１から２値画像を受取り、こ
れを１文字単位に切出す文字切出部１２と、文字認識を
行う文字認識部１３とを備え、該文字切出部１２及び文
字認識部１３の間に特徴抽出装置２０が接続されてい
る。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS First Embodiment FIG. 1 is a configuration diagram of a character recognition device provided with a feature extraction device according to a first embodiment of the present invention. This character recognition device includes, for example, a character cutout unit 12 that receives a binary image from an image storage medium 11 and cuts it out in units of one character, and a character recognition unit 13 that performs character recognition. A feature extracting device 20 is connected between the character recognition unit 12 and the character recognition unit 13.

【００１７】特徴抽出装置２０は、文字切出部１２から
与えられた２値画像を格納する画像画像格納部２１を備
えている。画像格納部２１の出力側に、文字線の輪郭を
表す輪郭点列を抽出する輪郭点抽出部２２と、該抽出さ
れた輪郭点列を走査して文字の特徴的な輪郭点を特徴点
として抽出する特徴点抽出部２３と、特徴点抽出部２３
で抽出された特徴点が凸特徴点か凹特徴点かを検出する
凹凸検出部２４とが、順に接続されている。凹凸検出部
２４の出力側が、文字認識部１３の入力側に接続されて
いる。The feature extracting device 20 includes an image storage 21 for storing the binary image provided from the character extracting unit 12. On the output side of the image storage unit 21, a contour point extraction unit 22 for extracting a contour point sequence representing a contour of a character line, and scans the extracted contour point sequence to set a characteristic contour point of a character as a characteristic point. Feature point extracting unit 23 to be extracted, and feature point extracting unit 23
And a concavo-convex detecting unit 24 for detecting whether the feature point extracted in step (1) is a convex feature point or a concave feature point. The output side of the unevenness detection unit 24 is connected to the input side of the character recognition unit 13.

【００１８】次に、この文字認識装置の動作を説明す
る。文字切出部１２は、文字が記載された文書或いは帳
票等の２値画像を格納した画像記録媒体１１から該２値
画像を受け取り、１文字単位に切出し、特徴抽出装置２
０に与える。特徴抽出装置２０中の画像格納部２１は、
文字切出部１２からのｌ文字単位の２値画像を受け取
り、これを格納する。輪郭点抽出部２２は、画像格納部
２１から１文字単位の２値画像を受け取り、該２値画像
の輪郭点を抽出する。Next, the operation of the character recognition device will be described. The character extracting unit 12 receives the binary image from the image recording medium 11 storing the binary image such as a document or a form in which the character is described, and cuts out the binary image on a character-by-character basis.
Give to 0. The image storage unit 21 in the feature extraction device 20
The binary image is received from the character extracting unit 12 in units of 1 character and stored. The contour point extraction unit 22 receives a binary image in units of one character from the image storage unit 21 and extracts a contour point of the binary image.

【００１９】図３（ａ），（ｂ）は、図１中の輪郭点抽
出部２２が行う輪郭点抽出の概要を示す図である。輪郭
点は周知の方法である輸郭追跡法によって抽出すること
ができる。輸郭追跡法では、図３（ａ）のように、２値
画像の左上を走査開始点とし、画像の全面を順次走査す
る。このとき白画素に接する黒画素を検出した場合、図
３（ｂ）に示すように、黒画素を反時計方向に追跡する
ようにして輪郭を抽出する。白画素に接する黒画素と
は、黒画素の上下左右のいずれかに白画素が存在するも
のをいう。具体的には、輪郭点抽出部２２は、走査過程
で最初に検出した白画素に接する黒画素ｂ０を輪郭追跡
の開始点とし、次に画素ｂ０を中心として該画素ｂ０の
左下の画素を始点とした反時計方向に黒画素を検出す
る。これにより輪郭点の黒画素ｂ１が検出される。画素
ｂ１が検出されると、該画素ｂ１を中心とし、画素ｂ０
を始点とした反時計方向に黒画素を検出する。これによ
り輪郭点の黒画素ｂ２が検出される。このように輪郭を
追跡する処理を開始点ｂ０に戻ってくるまで繰り返し、
輪郭点列を求める。FIGS. 3A and 3B are diagrams showing an outline of outline point extraction performed by the outline point extraction unit 22 in FIG. The contour points can be extracted by a well-known translocation tracking method. In the translocation tracking method, as shown in FIG. 3A, the upper left of a binary image is set as a scanning start point, and the entire image is sequentially scanned. At this time, when a black pixel in contact with a white pixel is detected, as shown in FIG. 3B, the contour is extracted by tracing the black pixel in a counterclockwise direction. A black pixel that is in contact with a white pixel is one in which a white pixel exists at any of the upper, lower, left, and right sides of the black pixel. Specifically, the contour point extracting unit 22 sets the black pixel b0 that is in contact with the white pixel detected first in the scanning process as the start point of the contour tracking, and then sets the lower left pixel of the pixel b0 around the pixel b0 as the start point. Then, a black pixel is detected in the counterclockwise direction. Thereby, the black pixel b1 at the contour point is detected. When the pixel b1 is detected, the pixel b1 is centered on the pixel b1 and the pixel b0 is detected.
A black pixel is detected in the counterclockwise direction starting from. Thereby, the black pixel b2 at the contour point is detected. The process of following the contour is repeated until the process returns to the start point b0,
Find the outline point sequence.

【００２０】図４は、輪郭テーブルの構成例を示す図で
ある。輪郭点を追跡する輪郭点抽出部２２は、追跡した
輪郭点を輪郭データとし、これらを、図４のような輪郭
点番号と、該輪郭点の座標と、該輪郭点が特徴点か否か
を示す欄（特徴点）とからなる輪郭テーブルに書込む。
ただし、特徴点か否かを示す欄は、この時点では特徴点
がまだ抽出されていないので、すべての輪郭点に対して
特徴点ではないことを示す“０”を入れておく。FIG. 4 is a diagram showing a configuration example of the contour table. The contour point extraction unit 22 that tracks the contour points uses the tracked contour points as contour data, and uses them as contour point numbers as shown in FIG. 4, coordinates of the contour points, and whether the contour points are feature points. Is written in a contour table including columns (feature points) indicating
However, since the feature point has not been extracted yet at this time, “0” indicating that the contour point is not a feature point is entered in the column indicating whether or not the feature point.

【００２１】図５（ａ），（ｂ）は、文字構成例を示す
図である。文字によっては、図５（ａ）のような分離文
字や、同図（ｂ）のようなループを含む文字がある。こ
れらには、独立したセグメントＳｇ１，Ｓｇ２毎に、輪
郭テーブルを作成して各輪郭点の情報を書き込んでお
く。輪郭点抽出部２２は、作成した輪郭テーブルを特徴
点抽出部２３に与える。特徴点抽出部２３は、輪郭点抽
出部２２で作成した輪郭テーブルを受け取り、以下のよ
うな処理で特徴点を抽出する。FIGS. 5A and 5B are diagrams showing an example of a character configuration. Depending on the character, there are a separated character as shown in FIG. 5A and a character including a loop as shown in FIG. In these, a contour table is created for each of the independent segments Sg1 and Sg2, and information on each contour point is written. The contour point extracting unit 22 gives the created contour table to the feature point extracting unit 23. The feature point extracting unit 23 receives the contour table created by the contour point extracting unit 22, and extracts a feature point by the following processing.

【００２２】図６（ａ），（ｂ）は、特徴点抽出部２３
が行う特徴点抽出の手順を示す図であり、同図（ａ）は
文字の端点での特徴点抽出、及び同図（ｂ）は交差点で
の特徴抽出の手順をそれぞれ示している。図７（ａ），
（ｂ）は、特徴点抽出の概要を示す図である。特徴点抽
出部２３は、輪郭テーブルの最初の輪郭点Ｒ₁を選択し
て注目点Ｃ₁とし、該注目点Ｃ₁のＸ座標Ｘ₁、及びＹ
座標Ｙ₁からある条件を満たす輪郭点ＮＲ₁を検出す
る。次に、注目点を１つずらして輪郭点Ｒ₂を選択して
注目点Ｃ ₂とし、同様に輪郭点ＮＲ₂を検出する。この
とき輪郭点ＮＲ₁とＮＲ₂とがある条件を満たしたとき
その注目点Ｃ₂を特徴点と決定する。さらに、注目点を
１つ進めて輪郭点Ｒ₃を注目点Ｃ₃として以下同様の処
理を行い。注目点が最後の輪郭点Ｒ_eになるまで繰り返
す。輪郭点Ｒ_nを注目点Ｃ_nとした場合を、図６及び図
７を用いて詳細に説明する。FIGS. 6A and 6B show the characteristic point extracting unit 23.
Is a diagram showing a procedure of feature point extraction performed by
The feature point extraction at the end point of the character, and FIG.
Are respectively shown. FIG. 7 (a),
(B) is a diagram showing an outline of feature point extraction. Feature point extraction
The output unit 23 outputs the first contour point R in the contour table.₁Select
Attention point C₁And the point of interest C₁X coordinate of₁, And Y
Coordinate Y₁Contour point NR satisfying a certain condition₁Detect
You. Next, the attention point is shifted by one, and the contour point R is shifted._TwoSelect
Attention point C _TwoSimilarly, the contour point NR_TwoIs detected. this
When contour point NR₁And NR_TwoWhen certain conditions are met
Attention point C_TwoIs determined as a feature point. In addition,
Advance one to contour point R_ThreeThe attention point C_ThreeAnd the same process
Do The point of interest is the last contour point R_eRepeat until
You. Contour point R_nThe attention point C_n6 and FIG.
7 will be described in detail.

【００２３】輪郭テーブルを最初から最後まで走査し、
該注目点Ｃ_nのＸ座標Ｘ_n及びＹ座標Ｙ_nから、次の条
件を満たす輪郭点ＮＲ_nを検出する。条件１；当該輪郭点と注目点Ｃ_nとの間の輪郭点列上の
ドット距離が第１の閾値ＴＨ１より離れている。条件２；（条件１）を満たした輪郭点のうちの注目点Ｃ
_nの座標（Ｘ_n，Ｙ_n）に最も座標距離が近い輪郭点The contour table is scanned from beginning to end,
From X-coordinate X _n and the Y coordinate Y _n of the noted point C _n, detects the following conditions contour point NR _n. Condition 1: dot distance on the contour point sequence between the contour point and the target point C _n are separated from the first threshold value TH1. Condition 2; attention point C among contour points satisfying (condition 1)
_The contour point whose coordinate distance is closest to the coordinates ( _Xn , Yn) of _n

【００２４】仮に閾値ＴＨ１を６と予め定めた場合に
は、図７（ａ）のように、注目点Ｃ_nから、輪郭点列に
沿ったドット距離で閾値ＴＨ１よりも離れた輪郭点は、
輪郭点Ｒ_n-7，Ｒ_n-8，…及び輪郭点Ｒ_n+7，Ｒ_n+8，
…である。これら輪郭点Ｒ_n-7，Ｒ_n-8，…及び輪郭点
Ｒ_n+7，Ｒ_n+8，…のうちで、注目点Ｃ_nからの実際の
座標距離が一番近いものを輪郭点ＮＲ_nとして検出す
る。例えば輪郭点Ｒ_n-7は、注目点Ｃ_nに対して縦方向
に２ドット横方向に７ドットずれているので、実際の座
標距離は、約７．２８（＝√（２²＋７²））となる。
これに対する輪郭点Ｒ_n+7は、注目点Ｃ_nに対して縦方
向に２ドット横方向に６ドットずれているので、実際の
座標距離は、約６．３２（＝√（２²＋６²））とな
る。よって、輪郭点Ｒ_n+7が輪郭点ＮＲ_nとなる。[0024] Assuming the case where the threshold value TH1 determined in advance and 6, as shown in FIG. 7 (a), the the target point C _n, distant contour point than the threshold value TH1 dot distance along the contour point row,
Contour points R _n-7 , R _n-8 ,... And contour points R _{n + 7} , R _{n + 8} ,
... These outline points _{_{R n-7, R n-}} 8, ... and contour point _{_{R n + 7, R n +}} 8, ... among the contour points those actual coordinates distance from the target point C _n is closest It is detected as NR _n . For example, since the contour point R _n-7 is shifted two dots in the vertical direction and seven dots in the horizontal direction with respect to the point of interest C _n , the actual coordinate distance is about 7.28 (= √ (2 ² +7 ² )). ).
On the other hand, the contour point R _{n + 7} is shifted two dots in the vertical direction and six dots in the horizontal direction with respect to the point of interest C _n , so that the actual coordinate distance is about 6.32 (= √ (2 ² +6 ^2). )). Thus, the contour point R _{n + 7} is a contour point NR _n.

【００２５】輪郭点ＮＲ_nを検出した後、特徴点抽出部
２３は、注目点を１つ進めて輪郭点Ｒ_n+1を注目点Ｃ
_n+1とし、同様に上記条件を用いて輪郭点ＮＲ_n+1を求
める。この場合には、図７（ｂ）のように、注目点Ｃ
_n+1から、輪郭点列に沿ったドット距離で閾値ＴＨ１よ
りも離れた輪郭点は、輪郭点Ｒ_n-6，Ｒ_n-7，…及び輪
郭点Ｒ_n+8，Ｒ_n+9，…である。これら条件１を満たす
輪郭点のうちで、輪郭点Ｒ _n-6の注目点Ｃ_n+1からの座
標距離は、約６．７１（＝√（３²＋６²））となる。
これに対する輪郭点Ｒ_n+8の注目点Ｃ_n+1からの座標距
離は、約７．０７（＝√（１²＋７²））となる。よっ
て、輪郭点Ｒ_n-6が輪郭点ＮＲ_n+1となる。輪郭点ＮＲ
_n及びＮＲ_n+1を検出した特徴点抽出部２３は、輪郭テ
ーブルを参照して該輪郭点ＮＲ_n及びＮＲ_n+1間の輪郭
点列上の輪郭点数（ドット数）を求め、図６（ａ），
（ｂ）のように、その輪郭点数が予め定めた第２の閾値
ＴＨ２（例えば７）より大きい場合、輪郭点Ｃ_n+1（Ｒ
_n+1）を特徴点と決定する。Contour point NR_nFeature point extraction unit
23 indicates that the contour point R_{n + 1}The attention point C
_{n + 1}Similarly, using the above conditions, the contour point NR_{n + 1}Seeking
Confuse. In this case, as shown in FIG.
_{n + 1}From the threshold TH1 with the dot distance along the contour point sequence
The contour point farther away is the contour point R_n-6, R_n-7, ... and rings
Guo point R_{n + 8}, R_{n + 9}, ... These conditions 1 are satisfied
Among the contour points, the contour point R _n-6Attention point C_{n + 1}Seat from
The target distance is about 6.71 (= √ (3^Two+6^Two)).
Contour point R for this_{n + 8}Attention point C_{n + 1}Coordinate distance from
The separation is about 7.07 (= √ (1^Two+7^Two)). Yo
And contour point R_n-6Is the contour point NR_{n + 1}Becomes Contour point NR
_nAnd NR_{n + 1}The feature point extraction unit 23 that has detected the
Table, the contour point NR_nAnd NR_{n + 1}Contour between
The number of contour points (the number of dots) on the point sequence is determined, and FIG.
As shown in (b), the number of contour points is a predetermined second threshold value.
If it is larger than TH2 (for example, 7), the contour point C_{n + 1}(R
_{n + 1}) Is determined as a feature point.

【００２６】ここで、輪郭点ＮＲ_n及びＮＲ_n+1の間の
輪郭点列上の輪郭点数には、該輪郭点ＮＲ_nを始点とし
て輪郭点列を順方向に辿って輪郭点ＮＲ_n+1に至る経路
の輪郭点数と、輪郭点ＮＲ_n+1を始点として輪郭点列を
順方向に辿って輪郭点ＮＲ_nに至る経路の輪郭点数の２
種類が存在するが、２種類のうち輪郭点数が小さい方を
採用する。特徴点抽出部２３は、さらに、注目点を１つ
進めてＲ_n+2を注目点Ｃ_n+2とし、以下同様の処理を行
い、注目点が輪郭テーブルの最後の輪郭点Ｒ_eになるま
で繰り返す。なお、輪郭点ＮＲ₁と輪郭点ＮＲ_eの間の
輪郭上の輪郭点数が予め定めた閾値ＴＨ２より大きい場
合には、Ｃ₁も特徴点とする。[0026] Here, the number of contour points on the contour point sequence between contour points NR _n and NR n _{+ 1} is the contour point follows the contour point sequence in the forward direction as a starting point the contour points NR _n NR n ₊ the contour points of the route to _1, 2 contour points of a path leading to the contour point NR _n follows the contour point sequence in the forward direction starting from the contour points NR n _{+ 1}
Although there are types, the one with the smaller number of contour points is used among the two types. Feature point extraction unit 23, further, the R _{n + 2} as the target point C _{n + 2} point of interest one advanced, performs the same processing hereinafter target point is the last contour point R _e of the contour table Repeat until Incidentally, in the case greater than the threshold value TH2 contour points on the contour are determined in advance between the contour points NR ₁ and contour point NR _e is, C ₁ is also a feature point.

【００２７】セグメントが複数ある文字（輪郭テーブル
が複数ある文字）については、それぞれのセグメントご
との輪郭テーブルごとに上記処理を行う。さらに、特徴
点抽出部２３は、特徴点として抽出された輪郭点に対し
て、輪郭テーブルの特徴点か否かを示す欄の“０”を、
特徴点であることを示すように、“１”に置き換え、そ
の輪郭テーブルを凹凸検出部２４に渡す。For a character having a plurality of segments (a character having a plurality of contour tables), the above processing is performed for each contour table of each segment. Further, the feature point extracting unit 23 sets “0” in a column indicating whether or not the feature point of the contour table is a feature point with respect to the contour point extracted as the feature point,
It is replaced with “1” to indicate that it is a feature point, and the contour table is passed to the unevenness detecting unit 24.

【００２８】図８は、凹凸検出部２４による輪郭テーブ
ルの走査を示す図である。凹凸検出部２４では、抽出し
た特徴点の状態を示すために、以下のように、凸特徴点
か凹特徴点かを判別する。凸特徴点とは、文字の出つ張
りの部分に存在するもので、端点や急な弧を描く外側の
頂点に相当する。また、凹特徴点とは文字のへこんだ部
分に存在するもので、分岐点や交差点や急な弧を描く内
側の頂点に相当する。凹凸検出部２４は、与えられた輪
郭テーブルを走査し、特徴点となる輪郭点が見つかった
ら、図８のように、この特徴点の輪郭点から予め定めた
閾値ＴＨ３だけ輪郭点列を順方向に進んだ番号の輪郭点
と、該閾値ＴＨ３だけ輪郭点列の逆方向に戻った番号の
輪郭点を検出する。FIG. 8 is a diagram showing scanning of the contour table by the unevenness detecting section 24. In order to indicate the state of the extracted feature point, the unevenness detection unit 24 determines whether the feature point is a convex feature point or a concave feature point as follows. The convex feature point is present at the protruding portion of the character, and corresponds to an end point or an outer vertex in which a sharp arc is drawn. In addition, the concave feature point exists in the concave portion of the character, and corresponds to a branch point, an intersection, or an inner vertex in which a sharp arc is drawn. The concavity and convexity detecting unit 24 scans the given contour table, and when a contour point serving as a feature point is found, as shown in FIG. 8, the contour point sequence is forwardly shifted by a predetermined threshold TH3 from the contour point of the feature point. And the contour point with the number returned in the opposite direction of the contour point sequence by the threshold value TH3 are detected.

【００２９】例えば、特徴点であると判断された輪郭点
をＲ_nとすると、凹凸検出部２４は、該輪郭点Ｒ_nの座
標（Ｘ_n，Ｙ_n）と、この輪郭点Ｒ_nから閾値ＴＨ３だ
け順方向に進んだ輪郭点Ｒ_n+TH3の座標（Ｘ_n+TH3，Ｙ
_n+TH3）と、閾値ＴＨ３だけ逆方向に進んだ輪郭点Ｒ
_n-TH3の座標（Ｘ_n-TH3，Ｙ_n-TH3）とを、輪郭テーブ
ルから検出する。なお、輪郭点Ｒ_nから閾値ＴＨ３だけ
順方向に進んだ輪郭点を検出するとき、輪郭テーブルの
最後を超えてしまうような場合は、当該輪郭テーブルの
最初に戻って検出を続ける。また、輪郭点Ｒ_nから閾値
ＴＨ３だけ輪郭点の逆方向に戻った輪郭点を検出すると
き、輪郭テーブルの最初を超えてしまうような場合は、
当該輪郭テーブルの最後から検出を続ける。[0029] For example, when a contour point is determined to be the feature point and R _n, the concave-convex detecting part 24, the contour point R _n of coordinate (X _n, Y _n) and the threshold from the contour point R _n The coordinates ( _{Xn + TH3} , Y) of the contour point Rn _{+ TH3} , which has advanced in the forward direction by _TH3
_{n + TH3} ) and the contour point R advanced in the opposite direction by the threshold value TH3.
_n-TH3 of coordinates and _{(X n-TH3, Y n} -TH3), detects from the contour table. Incidentally, when detecting the contour points contour point advanced from R _n just forward threshold TH3, when that would exceed the last contour table continues to detect back first of the contour table. Further, when detecting the contour points back in the opposite direction only contour points threshold TH3 from the contour point R _n, if that would exceed the first contour table,
The detection is continued from the end of the contour table.

【００３０】３点の座標が決定したら、該座標に基づき
次の式（１）〜（５）を計算する。ｘ１＝Ｘ_n+TH3−Ｘ_n-TH3 ・・・（１）ｙ１＝Ｙ_n+TH3−Ｙ_n-TH3 ・・・（２）ｘ２＝Ｘ_n−Ｘ_n-TH3 ・・・（３）ｙ２＝Ｙ_n−Ｙ_n-TH3 ・・・（４）ｘ１×ｙ２−ｙ１×ｘ２＜０・・・（５）（５）式を満たせば特徴点Ｒ_nを凹特徴点、満たさなけ
れば凸特徴点と判定する。判定した結果が凸特徴点であ
れば、これを示すために、輪郭点Ｒ_nの特徴点の欄に例
えば“１”を書込み、凹特徴点であれば“２”を書き込
む。When the coordinates of the three points are determined, the following equations (1) to (5) are calculated based on the coordinates. _{_{x1 = X n + TH3 -X n}} -TH3 ··· (1) y1 = Y n + TH3 -Y n-TH3 ··· (2) x2 = X n -X n-TH3 ··· (3) y2 = Y _n -Y _n-TH3 (4) x1 × y2-y1 × x2 <0 (5) If the equation (5) is satisfied, the feature point R _n is a concave feature point; Judge as a point. If the judgment result is convex feature point, to indicate this, writing "1", for example in the field of the feature points of the contour point R _n, if the concave feature point writing "2".

【００３１】図９は、凹特徴点と凸特徴点の検出結果を
示す図である。凹凸検出部２４は、以上の（１）〜
（５）式を用いた処理を、特徴点として抽出された輪郭
点すべてにおいて行う。なお、セグメントが複数ある文
字（輪郭テーブルが複数ある文字）については、それぞ
れの輪郭テーブルごとに同様の処理を行う。これらの処
理を例えば図２の画像に対して行うと、図９のように、
凹特徴点と凸特徴点の位置が求められる。凹凸検出部２
４は、凹特徴点及び凸特徴点の情報を付加した輪郭テー
ブルを文字認識部１３に与える。文字認識部１３は輪郭
テーブルの輪郭点や特徴点を参考にしながら文字認識を
実施する。FIG. 9 is a diagram showing detection results of concave feature points and convex feature points. The unevenness detection unit 24 performs the above (1) to
The processing using the equation (5) is performed on all the contour points extracted as the feature points. For a character having a plurality of segments (a character having a plurality of contour tables), the same processing is performed for each contour table. When these processes are performed on the image of FIG. 2, for example, as shown in FIG.
The positions of the concave feature points and the convex feature points are obtained. Irregularity detector 2
4 gives the character recognition unit 13 a contour table to which information of the concave feature points and the convex feature points is added. The character recognition unit 13 performs character recognition while referring to contour points and feature points in the contour table.

【００３２】以上のように、この第１の実施形態では、
画像格納部２１及び輪郭点抽出部２２を備えた特徴抽出
装置２０に特徴点抽出部２３を設け、任意の注目点Ｃ_n
との間の輪郭点列上のドット距離が閾値ＴＨ１より離
れ、かつ、該注目点Ｃ_nからの座標距離が最も近い輪郭
点ＮＲ_nを検出し、さらに、その注目点Ｃ_nから１つず
らした注目点Ｃ_n+1との間の輪郭点列上のドット距離が
該閾値ＴＨ１より離れ、かつ、該注目点Ｃ_n+1からの座
標距離が最も近い輪郭点ＮＲ_n+1を検出し、検出した輪
郭点ＮＲ_n，ＮＲ_n+1の間の輪郭点列上のドット数が予
め定めた閾値ＴＨ２よりも大きいときに、その注目点Ｃ
_n+1を特徴点として検出する構成にしている。そのた
め、この特徴抽出装置２０は、端点や角度のきつい分岐
及び交差点を特徴点として抽出するばかりでなく、緩や
かな弧となる部分を特徴点として抽出せずに、図９のよ
うに、輪郭の変化の少ない分岐点や交差点からも特徴点
を確実に抽出することができる。さらに、この特徴抽出
装置２０は、凹凸検出部２４を設け、特徴点が凸特徴点
か凹特徴点かを検出するようにしたので、後段の文字認
識部１３が行う文字認識で適格な処理が行えるようにな
る。As described above, in the first embodiment,
A feature point extraction unit 23 is provided in a feature extraction device 20 including an image storage unit 21 and a contour point extraction unit 22, and an arbitrary point of interest C _n
Dot distance on the contour point sequence between the leaves than the threshold TH1, and detects the coordinate distance is closest contour point NR _n from the noted point C _n, further one of the target point C _n shift dot distance on the contour point sequence between the target point C _{n + 1} leaves than the threshold value TH1, and to detect the contour points NR n _{+ 1} coordinate distance closest from the noted point C _{n + 1} When the number of dots on the contour point sequence between the detected contour points NR _n and NR _{n + 1} is larger than a predetermined threshold TH2, the target point C
It is configured to detect _{n + 1} as a feature point. For this reason, the feature extracting device 20 not only extracts the end points and the steep branches and intersections as the feature points, but also does not extract the gentle arc portion as the feature point. A feature point can be reliably extracted from a branch point or an intersection where there is little change. Further, the feature extracting device 20 is provided with the unevenness detecting unit 24 and detects whether the feature point is a convex feature point or a concave feature point, so that the process suitable for the character recognition performed by the character recognition unit 13 at the subsequent stage can be performed. Will be able to do it.

【００３３】第２の実施形態図１０は、本発明の第２の実施形態を示す特徴抽出装置
を備えた文字認識装置の構成図である。この文字認識装
置は、第１の実施形態と同様の画像記憶媒体１１と文字
切出部１２と文字認識１３とを備えると共に、該文字切
出部１２及び文字認識部１３の間に接続された第１の実
施形態とは異なる特徴抽出装置３０を備えている。特徴
抽出装置３０は、第１の実施形態と同様に、文字切出部
１２から与えられた２値画像を格納する画像格納部３１
と、該画像格納部３１に接続された輪郭点抽出部３２と
を備えている。画像格納部３１の出力側には、さらに、
該画像格納部３１に格納された２値画像の線幅Ｗを検出
する線幅検出部３３が接続されている。 Second Embodiment FIG. 10 is a block diagram of a character recognition device provided with a feature extraction device according to a second embodiment of the present invention. This character recognition device includes an image storage medium 11, a character cutout unit 12, and a character recognition unit 13 similar to those of the first embodiment, and is connected between the character cutout unit 12 and the character recognition unit 13. A feature extraction device 30 different from that of the first embodiment is provided. As in the first embodiment, the feature extracting device 30 includes an image storage unit 31 that stores the binary image given from the character extracting unit 12.
And an outline point extraction unit 32 connected to the image storage unit 31. The output side of the image storage unit 31 further includes:
A line width detection unit 33 that detects a line width W of the binary image stored in the image storage unit 31 is connected.

【００３４】輪郭点抽出部３２及び線幅検出部３３の出
力側に、特徴点抽出部３４が接続され、該特徴点抽出部
３４及び線幅検出部３３の出力側に、凹凸検出部３５が
接続されている。特徴点抽出部３４は、輪郭点を走査し
て文字の特徴的なところを特徴点として抽出するもので
あり、凹凸検出部３５は、特徴点抽出部３４で抽出され
た特徴点が凸特徴点か凹特徴点かを検出するものであ
る。凹凸検出部３５の出力側が、文字認識部１３の入力
側に接続されている。A feature point extracting section 34 is connected to the output sides of the contour point extracting section 32 and the line width detecting section 33, and an unevenness detecting section 35 is connected to the output sides of the feature point extracting section 34 and the line width detecting section 33. It is connected. The feature point extracting unit 34 scans the contour points and extracts characteristic portions of the character as feature points, and the unevenness detecting unit 35 determines whether the feature points extracted by the feature point extracting unit 34 are convex feature points. Or a concave feature point. The output side of the unevenness detection unit 35 is connected to the input side of the character recognition unit 13.

【００３５】次に、図１０の文字認識装置の動作を説明
する。文字切出部１２では、文字が記載された文書や帳
票等の２値画像が格納された画像記録媒体１１から２値
画像を受け取り、これを１文字単位に切り出す。画像格
納部３１は文字切出部１２から１文字単位の２値画像を
受け取って格納する。輪郭点抽出部３２は、画像格納部
３１から１文字単位の２値画像を受け取り、第１の実施
形態と同様の処理により、該２値画像の輪郭点を抽出す
る。一方、線幅検出部３３は、画像格納部３１から１文
字単位の２値画像を参照して、該２値画像における線幅
Ｗを検出する。線幅Ｗを検出する方法はいくつかある
が、例えば以下の方法を用いる。Next, the operation of the character recognition device shown in FIG. 10 will be described. The character extracting unit 12 receives a binary image from the image recording medium 11 in which a binary image such as a document or a form in which characters are described is stored, and cuts out the binary image in units of one character. The image storage unit 31 receives and stores a binary image in units of one character from the character extraction unit 12. The contour point extracting unit 32 receives the binary image for each character from the image storage unit 31 and extracts the contour points of the binary image by the same processing as in the first embodiment. On the other hand, the line width detection unit 33 detects the line width W in the binary image by referring to the binary image of one character unit from the image storage unit 31. There are several methods for detecting the line width W. For example, the following method is used.

【００３６】図１１は、線幅検出用の観測窓を示す図で
ある。線幅検出部３３は、図１１に示すようなａ０〜ａ
３からなる２行２列の観測窓を用い、画像格納部３１に
格納された画像データを構成する全画素を観測窓ａ０で
覗くようにしながら２行２列の観測窓でラスタ走査す
る。そして、２行２列の観測窓から見える画像格納部３
１のメモリ上の画素がすべて文字を構成する画素（黒画
素）のときにカウントを１つ増やすように順次４黒点数
Ｑを計数する。また、観測窓ａ０から覗いた画像格納部
３１のメモリ上の画素が文字を構成する画素のときカウ
ントを１つ増やすように順次総黒点数Ａを計数する。た
だし、４黒点数Ｑおよび総黒点数Ａの初期値は０とす
る。FIG. 11 is a view showing an observation window for detecting a line width. The line width detection unit 33 includes a0 to a as shown in FIG.
Using an observation window of 2 rows and 2 columns, a raster scan is performed using an observation window of 2 rows and 2 columns while observing all pixels constituting the image data stored in the image storage unit 31 with the observation window a0. Then, the image storage unit 3 seen from the observation window of 2 rows and 2 columns
When all pixels on one memory are pixels (black pixels) constituting a character, the number of four black points Q is sequentially counted so as to increase the count by one. When the pixel on the memory of the image storage unit 31 viewed from the observation window a0 is a pixel constituting a character, the total number of black spots A is sequentially counted so as to increase the count by one. However, the initial values of the number of four black points Q and the total number of black points A are 0.

【００３７】次に、次の（６）式によって２値画像の線
幅Ｗを算出する。Ｗ＝Ａ／（Ａ―Ｑ）・・・（６）ちなみに、図２に示す文字の線幅Ｗは上記方法によりＷ
＝２．５になる。Next, the line width W of the binary image is calculated by the following equation (6). W = A / (AQ) (6) By the way, the line width W of the character shown in FIG.
= 2.5.

【００３８】次に、特徴点抽出部３４では、輪郭点抽出
部３２で作成された輪郭テーブルと、線幅検出部３３で
検出された線幅Ｗとを受け取り、以下のようにして特徴
点を抽出する。特徴点抽出部３４は、まず、線幅Ｗを利
用して閾値ＴＨ１と閾値ＴＨ２を、次の（７），（８）
式で算出する。ＴＨ１＝Ｗ×Ｋ１・・・（７）ＴＨ２＝Ｗ×Ｋ２・・・（８）但し、Ｋ１，Ｋ２；定数例えば、Ｋ１を２．５及びＫ２を３．０とすると、閾値
ＴＨ１は６、閾値ＴＨ２は７となる（小数点以下切り捨
て）。閾値ＴＨ１，ＴＨ２が算出されれば後の処理は第
１の実施形態の特徴点抽出部２３と同様なので、詳細な
説明は省略する。特徴点抽出部３４で特徴点が抽出され
ると、その特徴点が書込まれた輪郭テーブルが凹凸検出
部３５に送られる。Next, the feature point extracting section 34 receives the contour table created by the contour point extracting section 32 and the line width W detected by the line width detecting section 33, and determines the characteristic points as follows. Extract. First, the feature point extracting unit 34 uses the line width W to determine the threshold values TH1 and TH2 in the following (7) and (8).
It is calculated by the formula. TH1 = W × K1 (7) TH2 = W × K2 (8) where K1 and K2 are constants. For example, if K1 is 2.5 and K2 is 3.0, the threshold value TH1 is 6. The threshold value TH2 is 7 (rounded down after the decimal point). If the threshold values TH1 and TH2 are calculated, the subsequent processing is the same as that of the feature point extracting unit 23 of the first embodiment, and a detailed description will be omitted. When the feature points are extracted by the feature point extraction unit 34, the contour table in which the feature points are written is sent to the unevenness detection unit 35.

【００３９】凹凸検出部３５では、特徴点抽出部３４か
ら輪郭テーブルを受け取り、線幅検出部３３から線幅Ｗ
を受け取り、次の（９）により、線幅Ｗから閾値ＴＨ３
を算出する。ＴＨ３＝Ｗ×Ｋ３・・・（９）但し、Ｋ３；定数Ｋ３を２．０とすると、図２に示す文字での閾値ＴＨ３
は５（小数点以下切り捨て）となる。閾値ＴＨ３が算出
されれば後の処理は第１の実施形態の凹凸検出部２４と
同様の処理を行う。凹凸検出部３５は輪郭テーブルを文
字認識部１３に与え、文字認識部１３は輪郭テーブルの
輪郭点や特徴点を参考にしながら文字を認識する。The unevenness detecting section 35 receives the contour table from the feature point extracting section 34, and receives the line width W from the line width detecting section 33.
From the line width W to the threshold value TH3 according to the following (9).
Is calculated. TH3 = W × K3 (9) where K3 is a constant. If K3 is 2.0, the threshold value TH3 in the character shown in FIG.
Is 5 (rounded down after the decimal point). If the threshold value TH3 is calculated, the subsequent processing performs the same processing as that performed by the unevenness detection unit 24 of the first embodiment. The unevenness detecting section 35 gives the contour table to the character recognizing section 13, and the character recognizing section 13 recognizes the character while referring to the contour points and characteristic points of the contour table.

【００４０】以上のように、この第２の実施形態では、
線幅検出部３３を設け、線幅Ｗに応じて特徴点抽出部３
４で使用する閾値ＴＨ１，ＴＨ２を算出し、特徴点を抽
出している。よって、２値画像の文字の線幅に影響され
ないようになり、入力パタンの分解能が変化しても常に
安定した特徴点を抽出することが可能である。さらに、
線幅Ｗに応じて凹凸検出部３５で使用する閾値ＴＨ３も
算出し、特徴点の凹凸を検出するので、同様に、常に安
定した特徴点の凹凸情報を検出できる。As described above, in the second embodiment,
A line width detection unit 33 is provided, and the feature point extraction unit 3 is provided in accordance with the line width W.
The threshold values TH1 and TH2 used in step No. 4 are calculated, and feature points are extracted. Therefore, it is not affected by the line width of the character of the binary image, and it is possible to always extract a stable feature point even if the resolution of the input pattern changes. further,
Since the threshold value TH3 used by the unevenness detection unit 35 is also calculated according to the line width W, and the unevenness of the feature point is detected, similarly, the stable unevenness information of the feature point can be detected.

【００４１】第３の実施形態図１２は、本発明の第３の実施形態を示す特徴抽出装置
を備えた文字認識装置の構成図である。この文字認識装
置は、第１の実施形態と同様の画像記憶媒体１１と文字
切出部１２と文字認識１３とを備えると共に、該文字切
出部１２及び文字認識部１３の間に接続された第１の実
施形態とは異なる特徴抽出装置４０を備えている。特徴
抽出装置４０は、第１の実施形態と同様に、文字切出部
１２から与えられた２値画像を格納する画像格納部４１
と、該画像格納部４１に接続された輪郭点抽出部４２と
を備えている。 Third Embodiment FIG. 12 is a block diagram of a character recognition device provided with a feature extraction device according to a third embodiment of the present invention. This character recognition device includes an image storage medium 11, a character cutout unit 12, and a character recognition unit 13 similar to those of the first embodiment, and is connected between the character cutout unit 12 and the character recognition unit 13. A feature extraction device 40 different from that of the first embodiment is provided. As in the first embodiment, the feature extraction device 40 includes an image storage unit 41 that stores the binary image given from the character cutout unit 12.
And an outline point extraction unit 42 connected to the image storage unit 41.

【００４２】輪郭点抽出部４２には、間引き走査閾値検
出部４３が接続され、該輪郭点抽出部４２及び間引き走
査閾値検出部４３の出力側に、特徴点抽出部４４が接続
されている。間引き走査閾値検出部４３は、輪郭点抽出
部４２で抽出された輪郭点の量に応じて間引き走査閾値
を決定するものである。特徴点抽出部４４は、間引き走
査閾値検出部４３で検出した閾値を用い、輪郭点抽出部
４２で抽出された輪郭点を走査して文字の特徴的なとこ
ろを特徴点として抽出するものである。特徴点抽出部４
４の出力側に、該特徴点抽出部４４で抽出された特徴点
が凸特徴点か凹特徴点かを検出する凹凸検出部４５が接
続され、該凹凸検出部４５の出力側が文字認識部１３の
入力側に接続されている。The contour point extracting section 42 is connected to a thinning-out scanning threshold value detecting section 43, and a feature point extracting section 44 is connected to the output side of the contour point extracting section 42 and the thinning-out scanning threshold value detecting section 43. The thinning-scanning-threshold detecting unit 43 determines a thinning-scanning threshold according to the amount of the contour points extracted by the contour-point extracting unit 42. The feature point extracting unit 44 scans the contour points extracted by the contour point extracting unit 42 using the threshold value detected by the thinning scanning threshold value detecting unit 43, and extracts a characteristic portion of the character as a feature point. . Feature point extraction unit 4
4 is connected to an unevenness detecting unit 45 for detecting whether the feature point extracted by the feature point extracting unit 44 is a convex feature point or a concave feature point, and the output side of the unevenness detecting unit 45 is connected to the character recognition unit 13. Is connected to the input side.

【００４３】次に、図１２の文字認識装置の動作を説明
する。文字切出部１２では、文字が記載された文書や帳
票等の２値画像が格納された画像記録媒体１１から２値
画像を受け取り、これを１文字単位に切り出す。画像格
納部４１は、文字切出部１２から切り出された２値画像
を受け取って格納する。輪郭点抽出部４２は、画像格納
部４１から１文字単位の２値画像を受け取り、第１の実
施形態と同様の処理により、該２値画像の輪郭点を抽出
する。間引き走査閾値検出部４３は輪郭点抽出部４２で
作成された輪郭テーブルを受け取り、全輪郭点数を計数
する。なお、セグメントが複数ある文字（輪郭テーブル
が複数ある文字）については、すべての輪郭テーブルの
輪郭点数を計数する。計数値をＲｓｕｍとする。Next, the operation of the character recognition device shown in FIG. 12 will be described. The character extracting unit 12 receives a binary image from the image recording medium 11 in which a binary image such as a document or a form in which characters are described is stored, and cuts out the binary image in units of one character. The image storage unit 41 receives and stores the binary image clipped from the character clipping unit 12. The contour point extracting unit 42 receives the binary image of one character unit from the image storage unit 41, and extracts the contour points of the binary image by the same processing as in the first embodiment. The thinning scanning threshold detecting unit 43 receives the contour table created by the contour point extracting unit 42, and counts all the contour points. For a character having a plurality of segments (a character having a plurality of contour tables), the number of contour points of all the contour tables is counted. Let the count value be Rsum.

【００４４】計数値Ｒｓｕｍを求めた間引き走査閾値検
出部４３は、次の（１０），（１１）式により、第１及
び第２の間引き走査閾値ＭＴＨ１及びＭＴＨ２を算出す
る。ＭＴＨ１＝Ｒｓｕｍ／Ｎ１（小数点以下切り捨て）・・・（１０）ＭＴＨ２＝Ｒｓｕｍ／Ｎ２（小数点以下切り捨て）・・・（１１）但し、Ｎ１，Ｎ２；定数（例えば１００）ＭＴＨ１＝０のときはＭＴＨ１＝１ＭＴＨ２＝０のときはＭＴＨ２＝１The thinning-scanning-threshold detecting unit 43 that has obtained the count value Rsum calculates the first and second thinning-scanning thresholds MTH1 and MTH2 according to the following equations (10) and (11). MTH1 = Rsum / N1 (Truncation below decimal point) (10) MTH2 = Rsum / N2 (Truncation below decimal point) (11) where N1, N2; constant (for example, 100) When MTH1 = 0, MTH1 = 1 When MTH2 = 0, MTH2 = 1

【００４５】特徴点抽出部４４は、輪郭点抽出部４２か
ら輪郭テーブルを入力し、間引き走査閾値検出部４３か
ら間引き走査閾値ＭＴＨ１とＭＴＨ２とを受け取り、以
下のような処理で特徴点を抽出する。まず、輪郭テーブ
ルの最初の輪郭点Ｒ₁を選択して注目点Ｃ₁とし、輪郭
点テーブルを最初から間引き走査閾値ＭＴＨ２ごとに間
引きして走査し、該注目点Ｃ ₁のＸ座標Ｘ₁及びＹ座標
Ｙ₁から、次の条件を満たす輪郭点ＮＲ₁を検出する。条件１；当該輪郭点と注目点Ｃ₁との間の輪郭点列上の
ドット距離が第１の閾値ＴＨ１より離れている。条件２；（条件１）を満たした輪郭点のうちの注目点Ｃ
₁の座標（Ｘ₁，Ｙ₁）に最も座標距離が近い輪郭点閾値ＴＨ１は例えば６である。The feature point extractor 44 is provided with the contour point extractor 42
The contour table is input from the
Receiving the thinning-out scanning threshold values MTH1 and MTH2 from
Feature points are extracted by the following processing. First, the contour tape
First contour point R₁Select the point of interest C₁And the contour
The point table is set from the beginning for every thinning-out scanning threshold MTH2.
Draw and scan the point of interest C ₁X coordinate of₁And Y coordinate
Y₁From the contour point NR satisfying the following condition₁Is detected. Condition 1: the contour point and the attention point C₁On the sequence of contour points between
The dot distance is greater than the first threshold TH1. Condition 2; attention point C among contour points satisfying (condition 1)
₁Coordinates (X₁, Y₁) Is, for example, six.

【００４６】次に、注目点を間引き閾値ＭＴＨ１進めて
輪郭点Ｒ_1+MTH1を注目点Ｃ_1+MTH1とし、同様に上記条件
を用いて輪郭点ＮＲ_1+MTH1を求める。そして、輪郭テー
ブルを参照してＮＲ₁とＮＲ_1+MTH1の間の輪郭点上の輪
郭点数（ドット数）を求め、その輪郭点数が例えば７に
設定した閾値ＴＨ２より大きい場合、Ｃ_1+MTH1を特徴点
と決定する。なお、輪郭点ＮＲ₁とＮＲ_1+MTH1との間の
輪郭点列上の輪郭点数には、ＮＲ₁を始点としてＮＲ
_1+MTH1までの輪郭点数とＮＲ_1+MTH1を始点としてＮＲ₁
に至る輪郭点数の２種類が存在するが、２種類のうち輪
郭点数が小さい方を採用する。Next, the contour point R _{1 + MTH1} is _set to the attention point C _{1 + MTH1} by advancing the point of interest to the thinning threshold MTH1, and the contour point NR _{1 + MTH1} is similarly obtained using the above conditions. Then, a NR ₁ and NR 1 _{+ MTH1} contour points on the contour points between the (number of dots) with reference to the contour table, if the number of contour points is larger than the threshold TH2 is set to, for example 7, C 1 _{+ MTH1} Is determined as a feature point. Note that the number of contour points on the contour point sequence between the contour points NR ₁ and NR _{1 + MTH1} includes NR ₁ as a starting point and NR ₁
NR ₁ contour points and NR 1 _{+ MTH1} to _{1 + MTH1} as the starting point
, There are two types of contour points, and one of the two types having the smaller number of contour points is adopted.

【００４７】特徴点Ｃ_1+MTH1を求めた後、さらに、注目
点をＭＴＨ１進めてＲ_1+MTH1x2を注目点Ｃ_1+MTH1x2と
し、以下同様に処理を行い、注目点が輪郭テーブルの最
後に達するまで繰り返す。但し、セグメントが複数ある
文字（輪郭テーブルが複数ある文字）については、それ
ぞれの輪郭テーブルごとに同様の処理を行う。特徴点抽
出部４４では、特徴点として抽出された輪郭点に対し
て、輪郭テーブルの特徴点の欄を特徴点であることを示
すように“１”に置き換え、その輪郭テーブルを凹凸検
出部４５に渡す。凹凸検出部４５は、第１の実施形態と
同様の処理を行い、輪郭テーブルを文宇認識部１３に与
える。文字認識部１３は輪郭テーブルの輪郭点や特徴点
を参考にしながら文字を認識する。After _{finding the} feature point C _{1 + MTH1} , the point of interest is further advanced by MTH1 to make R _{1 + MTH1x2 a} point of interest C1 _{+ MTH1x2,} and the same process is performed. Repeat until reaching. However, for a character having a plurality of segments (a character having a plurality of contour tables), the same processing is performed for each contour table. The feature point extracting unit 44 replaces the contour points extracted as feature points with “1” in the feature point column of the contour table to indicate that the feature points are feature points, and replaces the contour table with the unevenness detecting unit 45. Pass to. The unevenness detection unit 45 performs the same processing as in the first embodiment, and gives the contour table to the text recognition unit 13. The character recognizing unit 13 recognizes characters while referring to contour points and feature points in the contour table.

【００４８】以上のように、この第３の実施形態では、
間引き走査閾値検出部４３を設け、特徴点を抽出する際
に輪郭点の量に応じて間引き走査を行うようにしたの
で、解像度が高い場合等で、輪郭点が多くなる場合にお
いても、処理時間を抑えて特徴点を抽出できる。なお、
単に、間引き走査を行うと特徴点が若干ずれる場合があ
る。解像度が高い場合は多少のずれは気にならない程度
だが解像度が低い場合は多少のずれによって特徴的なと
ころから大きくずれてしまう。しかし、この第３の実施
形態では、輪郭点の量に応じて間引き量を決定するの
で、その問題に対処できる。As described above, in the third embodiment,
Since the thinning-scanning threshold detection unit 43 is provided to perform thinning-scanning in accordance with the amount of contour points when extracting feature points, the processing time can be reduced even when the number of contour points increases when resolution is high. And feature points can be extracted. In addition,
Simply performing the thinning-out scanning may slightly shift the feature points. When the resolution is high, a slight shift is not noticeable, but when the resolution is low, a slight shift greatly deviates from a characteristic portion. However, in the third embodiment, since the thinning amount is determined according to the amount of the contour points, the problem can be dealt with.

【００４９】第４の実施形態図１３は、本発明の第４の実施形態を示す特徴抽出装置
を備えた文字認識装置の構成図である。この文字認識装
置は、第１の実施形態と同様の画像記憶媒体１１と文字
切出部１２と文字認識１３とを備えると共に、該文字切
出部１２及び文字認識部１３の間に接続された第１の実
施形態とは異なる特徴抽出装置５０を備えている。特徴
抽出装置５０は、第１の実施形態と同様に、文字切出部
１２から与えられた２値画像を格納する画像格納部５１
と、該画像格納部５１に接続された輪郭点抽出部５２と
を備えている。画像格納部５１の出力側には、さらに、
該画像格納部５１に格納された２値画像の線幅Ｗを検出
する線幅検出部５３が接続されている。線幅検出部５３
には、線幅Ｗの値に応じて間引き走査閾値を検出する間
引き走査閾値検出部５４が接続されている。 Fourth Embodiment FIG. 13 is a block diagram of a character recognition device provided with a feature extracting device according to a fourth embodiment of the present invention. This character recognition device includes an image storage medium 11, a character cutout unit 12, and a character recognition unit 13 similar to those of the first embodiment, and is connected between the character cutout unit 12 and the character recognition unit 13. A feature extraction device 50 different from that of the first embodiment is provided. As in the first embodiment, the feature extraction device 50 includes an image storage unit 51 that stores the binary image given from the character extraction unit 12.
And an outline point extraction unit 52 connected to the image storage unit 51. The output side of the image storage unit 51 further includes:
A line width detection unit 53 that detects a line width W of the binary image stored in the image storage unit 51 is connected. Line width detector 53
Is connected to a thinning-scanning-threshold detecting unit 54 that detects a thinning-scanning threshold in accordance with the value of the line width W.

【００５０】輪郭点抽出部５２、線幅検出部５３及び間
引き走査閾値検出部５４の出力側に、特徴点抽出部５５
が接続され、該線幅検出部５３及び特徴点抽出部５５の
出力側に、凹凸検出部５６が接続されている。特徴点抽
出部５５は、間引き走査閾値検出部５４から与えられた
間引き閾値を用い、輪郭点を走査して文字の特徴的なと
ころを特徴点として抽出するものであり、凹凸検出部５
６は、特徴点抽出部５５で抽出された特徴点が凸特徴点
か凹特徴点かを検出するものである。凹凸検出部５６の
出力側が、文字認識部１３の入力側に接続されている。On the output side of the contour point extraction unit 52, the line width detection unit 53, and the thinning scanning threshold value detection unit 54, a feature point extraction unit 55
Is connected to an output side of the line width detection unit 53 and the feature point extraction unit 55. The feature point extraction unit 55 uses the thinning threshold value provided from the thinning scanning threshold value detection unit 54 to scan contour points and extract characteristic portions of characters as feature points.
Numeral 6 detects whether the feature point extracted by the feature point extraction unit 55 is a convex feature point or a concave feature point. The output side of the unevenness detection unit 56 is connected to the input side of the character recognition unit 13.

【００５１】次に、図１３の文字認識装置の動作を説明
する。文字切出部１２では、文字が記載された文書や帳
票等の２値画像が格納された画像記録媒体１１から２値
画像を受け取り、これを１文字単位に切り出す。画像格
納部５１は文字切出部１２から１文字単位の２値画像を
受け取って格納する。輪郭点抽出部５２は、画像格納部
５１から１文字単位の２値画像を受け取り、第１の実施
形態と同様の処理により、当該２値画像の輪郭点を抽出
する。一方、線幅検出部５３は、画像格納部５１に格納
された１文字単位の２値画像を参照し、該２値画像の線
幅Ｗを検出する。なお、線幅Ｗの検出方法は第２の実施
形態と同様とする。Next, the operation of the character recognition device shown in FIG. 13 will be described. The character extracting unit 12 receives a binary image from the image recording medium 11 in which a binary image such as a document or a form in which characters are described is stored, and cuts out the binary image in units of one character. The image storage unit 51 receives a binary image in units of one character from the character extracting unit 12 and stores the binary image. The contour point extraction unit 52 receives the binary image of one character unit from the image storage unit 51, and extracts the contour points of the binary image by the same processing as in the first embodiment. On the other hand, the line width detection unit 53 detects the line width W of the binary image by referring to the one-character unit binary image stored in the image storage unit 51. Note that the method of detecting the line width W is the same as in the second embodiment.

【００５２】図１４は、線幅と端点の関係を示す図であ
る。一般的な太さの文字線の場合、解像度が高いほど線
幅は太くなるため、線幅の大きさで解像度の見当がつ
く。つまり、線幅Ｗが太いほど解像度が高く、それゆ
え、間引き走査閾値も大きくとることができる。しか
し、図１４に示すように、解像度が高くても、もともと
線幅Ｗが極端に細い場合、間引き走査閾値を用いた特徴
点抽出での若干のずれが大きく影響してしまうことがあ
る。このような細い線の場合、解像度が高くても小さい
間引き走査閾値を用いることが望ましい。逆に、線幅Ｗ
が十分に太い場合は若干のずれはあまり影響しない。こ
のような太い線の場合、大きい間引き走査閾値を用いて
処理時間を大幅に短縮することが望ましい。よって、線
幅Ｗの大きさに応じて間引き走査閾値を決定すれば良い
と考えられる。FIG. 14 is a diagram showing the relationship between the line width and the end points. In the case of a character line having a general thickness, the higher the resolution, the thicker the line width. Therefore, the size of the line width gives an indication of the resolution. In other words, the larger the line width W is, the higher the resolution is, and therefore, the thinning-out scanning threshold can be increased. However, as shown in FIG. 14, even if the resolution is high, when the line width W is originally extremely small, a slight shift in feature point extraction using the thinning-out scanning threshold may greatly affect. In the case of such a thin line, it is desirable to use a thinning-out scanning threshold even if the resolution is high. Conversely, line width W
If is large enough, a slight shift has little effect. For such thick lines, it is desirable to use a large thinning scan threshold to significantly reduce processing time. Therefore, it is considered that the thinning-out scanning threshold should be determined according to the size of the line width W.

【００５３】間引き走査閾値検出部５４は、線幅検出部
５３で検出された線幅Ｗを受け取り、次の（１２），
（１３）式で第１及び第２の間引き走査閾値ＭＴＨ１及
びＭＴＨ２を算出する。ＭＴＨ１＝Ｗ／Ｎ３（小数点以下切り捨て）・・・（１２）ＭＴＨ２＝Ｗ／Ｎ４（小数点以下切り捨て）・・・（１３）但し、Ｎ３，Ｎ４；定数（例えば２）ＭＴＨ１＝０のときはＭＴＨ１＝１ＭＴＨ２＝０のときはＭＴＨ２＝１とする。The thinning-out scanning threshold detector 54 receives the line width W detected by the line width detector 53, and receives the following (12),
The first and second thinning-out scanning thresholds MTH1 and MTH2 are calculated by the equation (13). MTH1 = W / N3 (Truncation below decimal point) (12) MTH2 = W / N4 (Truncation below decimal point) (13) where N3, N4; constant (for example, 2) MTH1 when MTH1 = 0 = 1 When MTH2 = 0, MTH2 = 1.

【００５４】特徴点抽出部５５では、輪郭点抽出部５２
から輪郭テーブルを受け取り、線幅検出部５３から線幅
Ｗを受け取り、間引き走査閾値検出部５４から間引き走
査閾値ＭＴＨ１，ＭＴＨ２を受け取り、以下のような処
理で特徴点を抽出する。特徴点抽出部５５は、まず、第
２の実施形態と同様に線幅Ｗを利用し、閾値ＴＨ１と閾
値ＴＨ２を算出する。また、その後の処理は、閾値ＴＨ
１，ＴＨ２と間引き走査閾値ＭＴＨ１，ＭＴＨ２を用い
た第３の実施形態と同様の間引き走査による特徴点抽出
を行う。特徴点抽出部５５で特徴点が抽出されると、そ
の特徴点が書込まれた輪郭テーブルが凹凸検出部５６に
送られる。凹凸検出部５６では、特徴点抽出部５５から
輪郭テーブルを受け取り、線幅検出部５３から線幅Ｗを
受け取り、第２の実施形態と同様に、該線幅Ｗから閾値
ＴＨ３を求め、特徴点が凸特徴点か凹特徴点かを検出す
る。凹凸検出部５６は、特徴点が凸特徴点か凹特徴点か
を示した輪郭テーブルを文字認識部１３に与え、文字認
識部１３は、該輪郭テーブルの輪郭点や特徴点を参考に
しながら文字を認識する。In the feature point extracting section 55, the contour point extracting section 52
, The line width W from the line width detecting unit 53, the thinning scanning threshold values MTH1 and MTH2 from the thinning scanning threshold value detecting unit 54, and feature points are extracted by the following processing. First, the feature point extraction unit 55 calculates the threshold values TH1 and TH2 using the line width W as in the second embodiment. Further, the subsequent processing is performed with the threshold value TH.
A feature point is extracted by thinning-out scanning similar to the third embodiment using the thinning-out scanning thresholds MTH1 and MTH2. When the feature points are extracted by the feature point extraction unit 55, the contour table in which the feature points are written is sent to the unevenness detection unit 56. The unevenness detecting unit 56 receives the contour table from the feature point extracting unit 55, receives the line width W from the line width detecting unit 53, and calculates the threshold value TH3 from the line width W in the same manner as in the second embodiment. Is a convex feature point or a concave feature point. The unevenness detecting unit 56 gives a contour table indicating whether the feature point is a convex feature point or a concave feature point to the character recognizing unit 13. Recognize.

【００５５】以上のように、この第４の実施形態では、
線幅検出部５３と間引き走査閾値５４とを設け、文字の
線幅Ｗに応じて特徴抽出部５５で使用する閾値ＴＨ１，
ＴＨ２を算出し、特徴点を抽出するようにしている。よ
って、文字の線幅Ｗに影響されないため、常に安定した
特徴点を抽出することが可能であると共に、解像度が高
い場合でも、処理時間が抑制できる。さらに、凹凸検出
部５６においても文字の線幅Ｗに応じて閾値を算出して
特徴点の凹凸を検出するので、文字の線幅Ｗに影響され
ず、常に安定して特徴点の凹凸の情報を検出できる。そ
の上、間引き走査閾値ＭＴＨ１，ＭＴＨ２が線幅Ｗに応
じて変化するので、線幅Ｗが細くて特徴点のずれが大き
く影響するような場合は、小さい間引き走査閾値が用ら
れて特徴点をなるべく正確に抽出し、線幅Ｗが太くて特
徴点のずれがあまり影響しないような場合は、大きい間
引き走査閾値が用いられるため、処理時間を大幅に短縮
することができる。そのため、例えば複数の筆記具で記
入されたために、線幅Ｗが一定でない文字が混在するよ
うな帳票や、複数のフォントで印刷されたために線幅が
一定でない文字が混在するような文書を対象とした場合
に、文字線幅に応じて間引き走査が変化するので、とく
に、処理効果が大きくなる。As described above, in the fourth embodiment,
A line width detection unit 53 and a thinning scanning threshold value 54 are provided, and the threshold values TH1 and TH1 used by the feature extraction unit 55 in accordance with the character line width W.
TH2 is calculated, and the feature points are extracted. Therefore, since it is not affected by the line width W of the character, a stable feature point can be always extracted, and the processing time can be suppressed even when the resolution is high. Further, the unevenness detecting section 56 also calculates the threshold value in accordance with the line width W of the character and detects the unevenness of the characteristic point, so that the information of the unevenness of the characteristic point is always stable without being affected by the line width W of the character. Can be detected. In addition, since the thinning-out scanning thresholds MTH1 and MTH2 change in accordance with the line width W, when the line width W is small and the deviation of the feature point has a large effect, a small thinning-out scanning threshold is used to determine the feature point. In the case where extraction is performed as accurately as possible and the line width W is large and the shift of the feature point is not so affected, a large thinning-out scanning threshold value is used, so that the processing time can be greatly reduced. Therefore, for example, a form in which characters with non-uniform line widths W are mixed because of writing with a plurality of writing tools, or a document in which characters with non-uniform line widths are mixed due to being printed with a plurality of fonts, is targeted. In this case, since the thinning-out scanning changes according to the character line width, the processing effect is particularly large.

【００５６】第５の実施形態図１５は、本発明の第５の実施形態を示す特徴抽出装置
を備えた文字認識装置の構成図である。この文字認識装
置は、第１の実施形態と同様の画像記憶媒体１１と文字
切出部１２と文字認識１３とを備えると共に、該文字切
出部１２及び文字認識部１３の間に接続された第１の実
施形態とは異なる特徴抽出装置６０を備えている。特徴
抽出装置６０は、第１の実施形態と同様に、文字切出部
１２から与えられた２値画像を格納する画像格納部６１
と、該画像格納部６１に接続された輪郭点抽出部６２と
を備えている。画像格納部６１の出力側には、さらに、
該画像格納部６１に格納された２値画像の線幅Ｗを検出
する線幅検出部６３が接続されている。輪郭点抽出部６
２には、解像度に応じた間引き走査閾値を検出する間引
き走査閾値検出部６４が接続されている Fifth Embodiment FIG. 15 is a block diagram of a character recognition device provided with a feature extracting device according to a fifth embodiment of the present invention. This character recognition device includes an image storage medium 11, a character cutout unit 12, and a character recognition unit 13 similar to those of the first embodiment, and is connected between the character cutout unit 12 and the character recognition unit 13. A feature extraction device 60 different from that of the first embodiment is provided. As in the first embodiment, the feature extraction device 60 includes an image storage unit 61 that stores the binary image given from the character extraction unit 12.
And an outline point extraction unit 62 connected to the image storage unit 61. On the output side of the image storage unit 61,
A line width detection unit 63 for detecting the line width W of the binary image stored in the image storage unit 61 is connected. Contour point extraction unit 6
2, a thinning-scanning-threshold detecting unit 64 that detects a thinning-scanning threshold corresponding to the resolution is connected.

【００５７】輪郭点抽出部６２、線幅検出部６３及び間
引き走査閾値検出部６４の出力側に、特徴点抽出部６５
が接続され、該特徴点抽出部６５及び線幅検出部６３の
出力側に、凹凸検出部６６が接続されている。特徴点抽
出部６５は、輪郭点抽出部６２で抽出された輪郭点を走
査して、文字の特徴的なところを特徴点として抽出する
ものであり、凹凸検出部６６は、特徴点抽出部６５で抽
出された特徴点が凸特徴点か凹特徴点かを検出するもの
である。凹凸検出部６６の出力側が、文字認識部１３の
入力側に接続されている。On the output side of the contour point extracting section 62, the line width detecting section 63 and the thinning scanning threshold detecting section 64, a feature point extracting section 65 is provided.
Is connected to the output side of the feature point extraction unit 65 and the line width detection unit 63. The feature point extracting unit 65 scans the contour points extracted by the contour point extracting unit 62 and extracts characteristic portions of the character as feature points. Is to detect whether the extracted feature point is a convex feature point or a concave feature point. The output side of the unevenness detection unit 66 is connected to the input side of the character recognition unit 13.

【００５８】次に、図１５の文字認識装置の動作を説明
する。文字切出部１２では、文字が記載された文書や帳
票等の２値画像が格納された画像記録媒体１１から２値
画像を受け取り、これを１文字単位に切り出す。画像格
納部６１は文字切出部１２から１文字単位の２値画像を
受け取って格納する。輪郭点抽出部６２は、画像格納部
６１から１文字単位の２値画像を受け取り、第３の実施
形態と同様の処理により、当該２値画像の輪郭点を抽出
する。間引き走査閾値検出部６４は、輪郭点抽出部６２
で作成された輪郭テーブルを受け取り、第３の実施形態
と同様にして第１及び第２の間引き走査閾値ＭＴＨ１及
びＭＴＨ２を検出する。一方、線幅検出部６３は、画像
格納部６１で格納する１文字単位の２値画像を参照し、
該２値画像の線幅Ｗを検出する。線幅Ｗを検出する方法
は第２の実施例と同様とする。Next, the operation of the character recognition device shown in FIG. 15 will be described. The character extracting unit 12 receives a binary image from the image recording medium 11 in which a binary image such as a document or a form in which characters are described is stored, and cuts out the binary image in units of one character. The image storage unit 61 receives and stores a binary image in units of one character from the character extraction unit 12. The contour point extracting unit 62 receives the binary image of one character unit from the image storage unit 61, and extracts the contour points of the binary image by the same processing as in the third embodiment. The thinning-out scanning threshold value detection unit 64 includes the contour point extraction unit 62
And detects the first and second thinning-out scanning thresholds MTH1 and MTH2 in the same manner as in the third embodiment. On the other hand, the line width detection unit 63 refers to the one-character unit binary image stored in the image storage unit 61,
The line width W of the binary image is detected. The method of detecting the line width W is the same as in the second embodiment.

【００５９】特徴点抽出部６５では、輪郭点抽出部６２
から輪郭テーブルを受け取り、線幅検出部６３から線幅
Ｗを受け取り、間引き走査閾値検出部６４から間引き走
査閾値ＭＴＨ１，ＭＴＨ２を受け取り、第４の実施形態
と同様の処理により、特徴点を抽出し、該特徴点を輪郭
テーブルに書込む。凹凸検出部６６は、特徴点抽出部６
５から輪郭テーブルを受け取り、線幅検出部６３から線
幅Ｗを受け取り、第２の実施形態と同様にして、特徴点
が凹特徴点か凸特徴点かを求め、輪郭テーブルに書込
む。この輪郭テーブルが文字認識部１３に与えられ、該
文字認識部１３が、輪郭テーブルの輪郭点や特徴点の情
報を参照しながら文字認識を行う。In the feature point extracting section 65, the contour point extracting section 62
, The line width W from the line width detecting unit 63, the thinning scanning threshold values MTH1 and MTH2 from the thinning scanning threshold value detecting unit 64, and feature points are extracted by the same processing as in the fourth embodiment. Then, the feature points are written in the contour table. The unevenness detecting unit 66 includes the feature point extracting unit 6
5, the line width W is received from the line width detection unit 63, and in the same manner as in the second embodiment, whether the characteristic point is a concave characteristic point or a convex characteristic point is obtained, and written in the outline table. The outline table is provided to the character recognition unit 13, and the character recognition unit 13 performs character recognition while referring to information on outline points and feature points in the outline table.

【００６０】以上のように、この第５の実施形態では、
線幅検出部６３を設け、文字の線幅Ｗに応じて特徴抽出
部６５で使用する閾値ＴＨ１，ＴＨ２を算出し、特徴点
を抽出するようにしている。よって、文字の線幅Ｗに影
響されず、常に安定した特徴点を抽出することが可能で
あると共に、凹凸検出部６６においても文字の線幅Ｗに
応じて閾値を算出して特徴点の凹凸を検出するので、文
字の線幅Ｗに影響されず、常に安定して特徴点の凹凸の
情報を検出できる。As described above, in the fifth embodiment,
A line width detection unit 63 is provided, and thresholds TH1 and TH2 used by the feature extraction unit 65 are calculated according to the line width W of the character to extract feature points. Therefore, it is possible to always extract a stable feature point without being affected by the line width W of the character. In addition, the unevenness detecting unit 66 calculates a threshold value in accordance with the line width W of the character and calculates the unevenness of the characteristic point. Is detected, the information of the unevenness of the feature point can always be detected stably without being affected by the line width W of the character.

【００６１】その上、間引き走査閾値検出部６４を設
け、輪郭点数に応じた間引き走査閾値ＭＴＨ１，ＭＴＨ
２を求め、該間引き走査閾値ＭＴＨ１，ＭＴＨ２で間引
きした走査を行って特徴点を抽出するようにしたので、
解像度が高くて輪郭点数が多い場合の処理時間を抑える
ことが可能になっている。なお、単に間引き走査を行う
と、特徴点が若干ずれることがある。解像度が高い場合
には多少のずれは気にならない程度だが、解像度が低い
場合には多少のずれでも、特徴的なところから大きくず
れることになる。しかし、本実施形態では、輪郭点の量
Ｒｓｕｍに応じて間引き量を決定するので、その問題に
十分対処できる。そのため、例えば複数の大きさの記入
枠に記入されたために、サイズが一定でない文字が混在
するような帳票や、複数のフォントサイズで印刷された
ためにサイズが一定でない文字が混在するような文章を
対象とした場合でも、適切な間引き量で特徴点が抽出で
きる。Further, a thinning-scanning threshold detecting section 64 is provided, and thinning-scanning thresholds MTH1 and MTH corresponding to the number of contour points are provided.
2 is obtained and the feature points are extracted by performing the scan thinned by the thinning scan threshold values MTH1 and MTH2.
It is possible to reduce the processing time when the resolution is high and the number of contour points is large. Note that the characteristic points may be slightly shifted when the thinning scanning is simply performed. When the resolution is high, a slight shift is not noticeable, but when the resolution is low, even a slight shift greatly shifts from a characteristic point. However, in the present embodiment, since the thinning amount is determined according to the contour point amount Rsum, the problem can be sufficiently dealt with. Therefore, for example, a form in which characters with irregular sizes are mixed due to being entered in a plurality of entry frames, or a sentence in which characters with irregular sizes due to being printed in multiple font sizes are mixed. Even in the case of a target, a feature point can be extracted with an appropriate thinning amount.

【００６２】なお、本発明は、上記実施形態に限定され
ず種々の変形が可能である。その変形例としては、例え
ば次のようなものがある。（１）輪郭点抽出部２２，３２，４２，５２，６２で
は、図３のような輪郭追跡法を行って輪郭点を求めるよ
うにしたが、これに限定されるものではなく、いかなる
輪郭抽出法を用いてもよい。（２）線幅Ｗの検出では、図１１の２行２列の観測窓
を用いたが、線幅Ｗが検出できれば他の方法でもよい。（３）線幅Ｗから閾値ＴＨ１，ＴＨ２，ＴＨ３を求め
る（７）〜（９）式は、他の式を用いてもよい。（４）輪郭点の計数値Ｒｓｕｍから間引き走査閾値Ｍ
ＴＨ１，ＭＴＨ２を求める（１０），（１１）式は、他
の式を用いてもよい。また、線幅Ｗから間引き走査閾値
ＭＴＨ１，ＭＴＨ２を求める（１２），（１３）式につ
いても、同様に他の式を用いてもよい。The present invention is not limited to the above embodiment, but can be variously modified. For example, there are the following modifications. (1) In the contour point extraction units 22, 32, 42, 52, and 62, contour points are obtained by performing a contour tracking method as shown in FIG. 3, but the invention is not limited to this. Method may be used. (2) In the detection of the line width W, the observation window of two rows and two columns in FIG. 11 is used, but another method may be used as long as the line width W can be detected. (3) The thresholds TH1, TH2, and TH3 are determined from the line width W. Other equations may be used as the equations (7) to (9). (4) Thinning-out scanning threshold M from the contour point count value Rsum
Expressions (10) and (11) for obtaining TH1 and MTH2 may use other expressions. Similarly, other expressions may be used for the expressions (12) and (13) for obtaining the thinning-out scanning thresholds MTH1 and MTH2 from the line width W.

【００６３】（５）第１〜第５の実施形態では、輪郭
テーブルに特徴点抽出結果及び凹凸検出結果を書込ん
で、文字認識部１３に渡す構成にしたが、特徴点抽出結
果及び凹凸検出結果をそれぞれ輪郭テーブルとは別に渡
すようにしてもよい。（６）第１〜第５の実施形態では、セグメントｓｇ
１，ｓｇ２が複数ある場合に、セグメントｓｇ１，ｓｇ
２ごとに、（条件１）及び（条件２）を満たす輪郭点Ｎ
Ｒを抽出していたが、該輪郭点ＮＲを抽出する際に、す
べてのセグメントの輪郭テーブルを対象としてもよい。
つまり、注目点Ｃと条件を満たす輪郭点ＮＲとが別のセ
グメントｓｇ１，ｓｇ２になるようにしてもよい。（７）文字切出部１２は、画像記録媒体１１から２値
画像を入力しているがこれに限定されず、スキャナ或い
はＦＡＸ等から画像を入力するようにしてもよい。(5) In the first to fifth embodiments, the feature point extraction result and the irregularity detection result are written in the contour table and passed to the character recognition unit 13. However, the feature point extraction result and the irregularity detection are used. Each result may be passed separately from the contour table. (6) In the first to fifth embodiments, the segment sg
When there are a plurality of segments sg1 and sg
2, the contour points N satisfying (condition 1) and (condition 2)
Although R has been extracted, when extracting the contour point NR, the contour table of all segments may be targeted.
That is, the point of interest C and the contour point NR satisfying the condition may be different segments sg1 and sg2. (7) The character extracting section 12 inputs a binary image from the image recording medium 11, but is not limited to this, and may input an image from a scanner or a facsimile.

【００６４】[0064]

【発明の効果】以上詳細に説明したように、第１の発明
によれば、画像格納部、輪郭点抽出部及び特徴点抽出部
を設け、注目点Ｃ_n，Ｃ_n+1から輪郭点列上で第１の閾
値ＴＨ１だけ離れ、かつ、注目点Ｃ_n，Ｃ_n+1に最も近
い位置の輪郭点ＮＲ_n，ＮＲ_n+ ₁をそれぞれ求め、これ
らの輪郭点ＮＲ_n，ＮＲ_n+1の輪郭点上の距離が第２の
閾値ＴＨ２以上であれば、注目点Ｃ_n+1を特徴点として
抽出するようにしたので、輪郭点列の方向変化にかかわ
りなく、距離から特徴点を求めることになり、端点や角
度のきつい分岐及び交差点を特徴点として抽出するばか
りでなく、特徴点としてふさわしくない、緩やかな弧と
なる部分を特徴点として抽出せずに、輪郭の変化の少な
い分岐点や交差点からも特徴点を確実に抽出できる。As described above in detail, according to the first aspect, the image storage unit, the contour point extracting unit and the feature point extracting unit are provided, and the contour point sequence is formed from the points of interest C _n and C _{n + 1.} The contour points NR _n and NR _{n +} ₁ at the positions separated by the first threshold value TH1 and closest to the points of interest C _n and C _{n + 1} are obtained, and the contour points NR _n and NR _{n + 1} are determined. If the distance on the contour point is greater than or equal to the second threshold value TH2, the attention point C _{n + 1} is extracted as a feature point. Therefore, regardless of the direction change of the contour point sequence, the feature point is obtained from the distance. In addition to extracting end points and steep branches and intersections as feature points as well as extracting gradual arcs that are not suitable as feature points as feature points, Feature points can be reliably extracted from intersections.

【００６５】第２の発明によれば、特徴抽出装置におい
て、第１の発明と同様の画像格納部、輪郭点抽出部を備
えると共に、線幅検出部及び特徴点抽出部を設け、線幅
に対応する第１及び第２の閾値ＴＨ１，ＴＨ２を決定
し、該第１及び第２の閾値ＴＨ１，ＴＨ２を用いて特徴
点を抽出するようにしたので、文字の線幅が変化して
も、その影響を受けずに、特徴点を確実に抽出できる。
第３の発明によれば、画像格納部及び輪郭点抽出部を備
えると共に、間引き走査閾値検出部及び特徴点抽出部を
設け、輪郭点の量に応じた第１及び第２の間引き走査閾
値ＭＴＨ１，ＭＴＨ２を決定し、第１及び第２の間引き
走査閾値ＴＨ１，ＴＨ２を用いて間引き走査を行って特
徴点を検出するようにしたので、解像度が高くなって
も、特徴点を抽出する際の処理量の増加が押えられる。According to the second aspect, the feature extracting apparatus includes the same image storage unit and contour point extracting unit as in the first aspect, and further includes a line width detecting unit and a characteristic point extracting unit to reduce the line width. Since the corresponding first and second thresholds TH1 and TH2 are determined and the feature points are extracted using the first and second thresholds TH1 and TH2, even if the line width of the character changes, The feature points can be reliably extracted without being affected by the influence.
According to the third aspect, the image storage unit and the contour point extracting unit are provided, and the thinning scanning threshold value detecting unit and the feature point extracting unit are provided, and the first and second thinning scanning threshold values MTH1 according to the amount of the contour points are provided. , MTH2, and thinning-out scanning are performed using the first and second thinning-out scanning thresholds TH1 and TH2 to detect a feature point. An increase in the throughput is suppressed.

【００６６】第４の発明によれば、画像格納部、輪郭点
抽出部及び線幅検出部を備えると共に、間引き走査閾値
検出部及び特徴点抽出部を設け、線幅に対応する第１及
び第２の間引き走査閾値ＭＴＨ１，ＭＴＨ２を決定し、
線幅に対応する第１及び第２の閾値ＴＨ１，ＴＨ２を決
定し、該第１及び第２の間引き走査閾値ＭＴＨ１，ＭＴ
Ｈ２で間引き走査を行いつつ、線幅に応じた該第１及び
第２の閾値ＴＨ１，ＴＨ２で特徴点を抽出するようにし
たので、線幅が変化しても特徴点のみを確実に抽出でき
ると共に、その処理量を抑制できる。その上、間引き走
査を行っても、それが線幅に応じた間引きになるので、
特徴点のずれが無視できる値に維持できる。According to the fourth aspect, an image storage unit, an outline point extraction unit, and a line width detection unit are provided, and a thinning scanning threshold value detection unit and a feature point extraction unit are provided. 2 thinning scanning thresholds MTH1 and MTH2 are determined,
First and second thresholds TH1 and TH2 corresponding to the line width are determined, and the first and second thinning scanning thresholds MTH1 and MT are determined.
Since the feature points are extracted at the first and second threshold values TH1 and TH2 according to the line width while performing the thinning-out scanning at H2, only the feature points can be reliably extracted even if the line width changes. At the same time, the processing amount can be suppressed. In addition, even if thinning scan is performed, it becomes thinning according to the line width,
The feature point deviation can be maintained at a value that can be ignored.

【００６７】第５の発明によれば、画像格納部、輪郭点
抽出部、線幅検出部、及び間引き走査閾値検出部を備え
ると共に第４の発明と同様の特徴点抽出部を設け、輪郭
点の量に応じた第１及び第２の間引き走査閾値ＭＴＨ
１，ＭＴＨ２を決定し、線幅に対応する第１及び第２の
閾値ＴＨ１，ＴＨ２を決定し、該第１及び第２間引き走
査閾値ＭＴＨ１，ＭＴＨ２で間引き走査を行いつつ、線
幅に応じた該第１及び第２の閾値ＴＨ１，ＴＨ２で特徴
点を抽出するようにしたので、線幅が変化しても特徴点
のみを確実に抽出できると共に、その処理量を抑制でき
る。その上、間引き走査を行っても、それが輪郭点の量
に応じた間引きになるので、特徴点のずれが無視できる
値に維持できる。第６及び第７の発明によれば、凹凸検
出部を設けたので、文字認識等で適確な処理が行えるよ
うになる。According to the fifth aspect, an image storage unit, an outline point extraction unit, a line width detection unit, and a thinning scanning threshold value detection unit are provided, and a feature point extraction unit similar to that of the fourth aspect is provided. And second thinning-out scanning threshold MTH according to the amount of
1, MTH2, and first and second thresholds TH1 and TH2 corresponding to the line width, and performing the thinning-out scanning at the first and second thinning-out scanning thresholds MTH1 and MTH2. Since the feature points are extracted using the first and second threshold values TH1 and TH2, only the feature points can be reliably extracted even if the line width changes, and the processing amount can be suppressed. In addition, even if the thinning-out scanning is performed, the thinning-out is performed according to the amount of the contour points, so that the deviation of the feature points can be maintained at a negligible value. According to the sixth and seventh aspects of the present invention, since the unevenness detecting section is provided, it is possible to perform an appropriate process such as character recognition.

[Brief description of the drawings]

【図１】本発明の第１の実施形態を示す特徴抽出装置を
備えた文字認識装置の構成図である。FIG. 1 is a configuration diagram of a character recognition device including a feature extraction device according to a first embodiment of the present invention.

【図２】従来の特徴抽出における課題を示す図である。FIG. 2 is a diagram showing a problem in conventional feature extraction.

【図３】図１中の輪郭点抽出部２２が行う輪郭点抽出の
概要を示す図である。FIG. 3 is a diagram showing an outline of contour point extraction performed by a contour point extracting unit 22 in FIG. 1;

【図４】輪郭テーブルの構成例を示す図である。FIG. 4 is a diagram showing a configuration example of a contour table.

【図５】文字構成例を示す図である。FIG. 5 is a diagram showing a character configuration example.

【図６】特徴点抽出部２３が行う特徴点抽出の手順を示
す図である。FIG. 6 is a diagram showing a procedure of feature point extraction performed by a feature point extraction unit 23.

【図７】特徴点抽出の概要を示す図である。FIG. 7 is a diagram showing an outline of feature point extraction.

【図８】凹凸検出部２４による輪郭テーブルの走査を示
す図である。FIG. 8 is a diagram illustrating scanning of a contour table by the unevenness detection unit 24;

【図９】凹特徴点と凸特徴点の検出結果を示す図であ
る。FIG. 9 is a diagram showing detection results of concave feature points and convex feature points.

【図１０】本発明の第２の実施形態を示す特徴抽出装置
を備えた文字認識装置の構成図である。FIG. 10 is a configuration diagram of a character recognition device including a feature extraction device according to a second embodiment of the present invention.

【図１１】線幅検出用の観測窓を示す図である。FIG. 11 is a diagram showing an observation window for line width detection.

【図１２】本発明の第３の実施形態を示す特徴抽出装置
を備えた文字認識装置の構成図である。FIG. 12 is a configuration diagram of a character recognition device including a feature extraction device according to a third embodiment of the present invention.

【図１３】本発明の第４の実施形態を示す特徴抽出装置
を備えた文字認識装置の構成図である。FIG. 13 is a configuration diagram of a character recognition device including a feature extraction device according to a fourth embodiment of the present invention.

【図１４】線幅と端点の関係を示す図である。FIG. 14 is a diagram showing a relationship between a line width and an end point.

【図１５】本発明の第５の実施形態を示す特徴抽出装置
を備えた文字認識装置の構成図である。FIG. 15 is a configuration diagram of a character recognition device including a feature extraction device according to a fifth embodiment of the present invention.

[Explanation of symbols]

１１画像記録媒体１２文字切出部１３文字認識部２０，３０，４０，５０，６０特徴抽出装置２１，３１，４１，５１，６１画像格納部２２，３２，４２，５２，６２輪郭点検出部３３，５３，６３線幅検出部４３，５４，６４間引き走査閾値検出部２３，３４，４４，５５，６５特徴点抽出部２４，３５，４５，５６，６６凹凸検出部 DESCRIPTION OF SYMBOLS 11 Image recording medium 12 Character extraction part 13 Character recognition part 20,30,40,50,60 Feature extraction device 21,31,41,51,61 Image storage part 22,32,42,52,62 Contour point detection part 33, 53, 63 Line width detectors 43, 54, 64 Thinning scan threshold detectors 23, 34, 44, 55, 65 Feature point extractors 24, 35, 45, 56, 66 Unevenness detectors

Claims

[Claims]

1. A feature extraction device for inputting an image of a character drawn by a character line and extracting a feature point representing a characteristic of the character in the image, wherein the image extracted in units of one character is input. An image storage unit for storing, a contour point extraction unit for extracting a contour point sequence forming the contour of the character line in the stored image, and scanning the contour point sequence extracted by the contour point extraction unit. Detecting a contour point NR _n separated from the notice point C _n, which is one of the points, by the number of dots on the contour point sequence by the first threshold value TH1 or more and the closest coordinate distance from the notice point C _n ; Further, after shifting the point of interest from the point of interest C _n to a point of interest C _{n + 1} shifted by one dot on the line of contour points, the contour point sequence extracted by the contour point extraction unit is scanned, and the point of interest is scanned. apart C n _{+ 1} from the first threshold value TH1 or more in the number of dots on the contour point sequence and the noticed Contour points N with the C n _{+ 1} to the closest distance coordinate distance
R _{n + 1} is detected, and when the number of dots on the contour point sequence between the contour point NR _n and the contour point NR _{n + 1} is larger than a second threshold TH2, the target point C _{n + 1} is determined. And a feature point extracting unit that sets the feature points.

2. A feature extraction device for inputting an image of a character drawn by a character line and extracting a feature point representing a feature of the character in the image, wherein the image extracted in units of one character is input. An image storage unit for storing, an outline point extraction unit for extracting an outline point sequence forming the outline of the character line in the stored image, and a line width for detecting a line width of the character line of the stored image A first threshold value TH1 and a second threshold value corresponding to the detection unit and the line width;
Determining the threshold value TH2, the scanning the contour point sequence extracted by the contour point extracting unit, from the target point C _n is one of the outline point of the first in dots on the contour point sequence threshold TH1 more distant and detects the contour points NR _n in from the noted point C _n to the closest distance coordinate distance further target point C _n of the point of interest from the noted point C _n shifted 1 dot on the contour point sequence _After moving to ₊₁ , scan the outline point sequence extracted by the outline point extraction unit,
From the noted point C _{n + 1} apart first threshold TH1 or more in the number of dots on the contour point row and detecting contour points NR n _{+ 1} in the the noted point C _{n + 1} to the closest distance coordinate distance and, the noted point when the number of dots on the contour point sequence is greater than the threshold value TH2 of the second between the contour points NR _n and contour point NR n _{+ 1} C
a feature point extraction unit having _{n + 1} as the feature point.

3. A feature extraction device for inputting an image of a character drawn by a character line and extracting a feature point representing a characteristic of the character in the image, wherein the image extracted in units of one character is input. An image storage unit for storing, an outline point extraction unit for extracting an outline point sequence forming the outline of the character line in the stored image, and a first area corresponding to an amount of the outline points extracted by the outline point extraction unit Thinning scanning threshold MTH1 and second thinning scanning threshold MTH
And thinning the scanning threshold detector for determining a 2, a contour point string extracted by the contour point extracting unit scans for each of the second thinning scanning threshold MTH2, said the target point C _n is one of the contour points The first threshold T is determined by the number of dots on the contour point sequence.
H1 or away and detects the contour points NR _n in from the noted point C _n to the closest distance coordinate distance, further, the point of interest from the noted point C _n on the contour point row first decimating scanning threshold After moving to the point of interest Cn _{+ MTH1} shifted by _MTH1 , the contour point sequence extracted by the contour point extraction unit is scanned for each of the second thinning-out scanning thresholds MTH2, and the contour point sequence is extracted from the point of _{interest Cn + MTH1.} detecting contour points NR n _{+ MTH1} that apart first threshold TH1 or more in dots on the contour point sequence and from the noted point C _{n + MTH1} the closest coordinate distance, the contour point NR _n and the contour Point NR
_The number of dots on the outline point sequence between _{n + MTH1} is equal to the second threshold value TH.
A feature point extraction unit that uses the point of _interest C _{n + MTH1} as the feature point when the value is larger than two.

4. An image of a character drawn by a character line is input.
And extract feature points representing the features of the characters in the image.
In the feature extracting device, an image for inputting and storing the image cut out in character units
Forming an outline of the character line in the stored image;
A contour point extracting unit for extracting a contour point sequence to be extracted, and a line width for detecting a line width of the character line of the stored image.
A detection unit; a first thinning scanning threshold MTH1 corresponding to the line width;
Thinning-scanning-threshold detector for determining the second threshold MTH2
And a first threshold value TH1 and a second threshold value T corresponding to the line width.
H2 is determined, and a contour point sequence extracted by the contour point extracting unit
Is scanned every second thinning-out scanning threshold MTH2,
Attention point C, one of the points_nFrom the number of dots on the contour point sequence
At least the first threshold value TH1 and the point of interest C_nKara
Contour point NR closest to target distance _nIs detected
In addition, the point of interest is the point of interest C_nOn the contour point sequence from the first
Point of interest C shifted by thinning scanning threshold MTH1_{n + MTH1}Moved to
Then, the contour point sequence extracted by the contour point extracting unit is
2 for each thinning scanning threshold MTH2,
_{n + MT} _H1From the number of dots on the outline point sequence to the first threshold TH
At least one point away and the point of interest C_{n +} _MTH1Nearest coordinate distance from
Contour point NR at a long distance_{n + MTH1}And the contour point NR
_nAnd contour point NR_{n + MTH1}The number of dots on the contour point sequence between
When the target point C is larger than the second threshold value TH2,
_{n + MTH1}And a feature point extraction unit that sets the feature point as the feature point.

5. An image of a character drawn by a character line is input.
And extract feature points representing the features of the characters in the image.
In the feature extracting device, an image for inputting and storing the image cut out in character units
Forming an outline of the character line in the stored image;
A contour point extracting unit for extracting a contour point sequence to be extracted, and a line width for detecting a line width of the character line of the stored image.
A detecting unit; a first unit corresponding to the amount of the contour points extracted by the contour point extracting unit;
Thinning scanning threshold MTH1 and second thinning scanning threshold MTH
A first threshold value TH1 and a second threshold value T corresponding to the line width.
H2 is determined, and a contour point sequence extracted by the contour point extracting unit
Is scanned every second thinning-out scanning threshold MTH2,
Attention point C, one of the points_nFrom the number of dots on the contour point sequence
At least the first threshold value TH1 and the point of interest C_nKara
Contour point NR closest to target distance _nIs detected
In addition, the point of interest is the point of interest C_nOn the contour point sequence from the first
Point of interest C shifted by thinning scanning threshold MTH1_{n + MTH1}Moved to
Then, the contour point sequence extracted by the contour point extracting unit is
2 for each thinning scanning threshold MTH2,
_{n + MT} _H1From the number of dots on the outline point sequence to the first threshold TH
At least one point away and the point of interest C_{n +} _MTH1Nearest coordinate distance from
Contour point NR at a long distance_{n + MTH1}And the contour point NR
_nAnd contour point NR_{n + MTH1}The number of dots on the contour point sequence between
When the target point C is larger than the second threshold value TH2,
_{n + MTH1}And a feature point extracting unit that uses the feature point as the feature point.

6. A contour point _{R.sub.n + TH.sub.3} which has advanced the contour point sequence in the forward direction by a preset third threshold value TH.sub.3 from a contour point _R.sub.n corresponding to the feature point and only the third threshold value TH.sub.3. The contour point R _n-TH3 returned in the opposite direction is obtained, and the contour point R _n ,
The positional relationship of the contour point R _{n + TH} ₃ and contour point R _n-TH3, the position of the contour point R _n is provided with a concave-convex detecting part for detecting whether become concave or has a convex in the character The feature extraction device according to claim 1 or 3, wherein:

7. Determine the third threshold TH3 from said detected line width by the line width detecting section, the contour point sequence by the threshold TH3 of the third from the contour point R _n corresponding to the feature point A contour point R _{n + TH3} which has advanced in the forward direction and a contour point R _n-TH3 which has returned in the reverse direction by the third threshold value TH3 are obtained, and the contour point R _n , the contour point R _{n + TH3} and the contour point R are obtained. the positional relationship of the _n-TH3, claim 2, 4 or 5, characterized in that the position of the contour point R _n is provided with a concave-convex detecting part for detecting whether become concave or has a convex in the character A feature extraction device as described.