JP3236732B2

JP3236732B2 - Character recognition device

Info

Publication number: JP3236732B2
Application number: JP05736494A
Authority: JP
Inventors: 穂高倉; 磨理子竹之内; 一郎中尾; 里志江村
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 1994-03-28
Filing date: 1994-03-28
Publication date: 2001-12-10
Anticipated expiration: 2016-12-10
Also published as: JPH07271911A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、文字認識装置に関し、
特に入力された文書画像中の文字を認識して、文字コー
ドに変換する文字認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device,
In particular, the present invention relates to a character recognition device that recognizes characters in an input document image and converts the characters into character codes.

【０００２】[0002]

【従来の技術】近年、印刷された文書を光／電気変換等
で読み取った上、いったん画素毎にビット情報化された
画像データ情報とし、この上でこの画像データ中の文字
を認識してデータ入力の省力化を図ったり、更に外国語
に翻訳したり盲人や視力障害者や児童や学童のために発
声するようなシステムの研究、開発がなされ、また一部
実用化されている。2. Description of the Related Art In recent years, a printed document is read by optical / electrical conversion or the like, and is then converted into bit information for each pixel, and characters in the image data are recognized. Research and development have been made on systems for saving input, further translating into foreign languages, and uttering voices for blind persons, visually impaired persons, children and school children, and some of them have been put into practical use.

【０００３】本発明は、このようなシステムに採用され
る文字認識装置に関する。さて、従来のこのようなシス
テムに採用されている文字認識装置では、切り出した文
字列画像に対して、文字列に垂直方向に黒画素を投影さ
せ、正射影の連続する領域を文字として文字画像の切り
出しを行っていた（例えば、秋山他「印刷物の記事領域
における文字の切り出し」ＰＲＬ８０−７０）。ところ
で、認識対象がアルファベット等簡単なものはともか
く、日本語、中国語等においては認識の対象となる文字
の種類、数が多いため、認識に先立ちあるいは認識とい
わば一体になって認識対象となっている文字の形状、大
きさを正確に知る必要がある。また、たとえ欧米系の言
語の印刷文書であっても、理工系の論文等は多種の記号
が使用されるため、認識対象の文字の大きさを正確に判
断するのは重要である。しかしながら、上述の文字切り
出し技術では、正射影の連続する領域を文字として文字
画像の切り出しを行っている。このため、文字と文字の
接触（以後、接触文字と呼称）がある場合や、例えば、
横書き文書における「北」、「川」や縦書き文書におけ
る「二」、「三」のように文字列方向に分離した複数の
文字要素からなる分離文字及び本来単独の文字要素から
なる文字であってもかすれにより複数の文字要素に分離
してしまった文字（以後、両者合わせて分離文字と呼
称）がある場合には、正確な切り出しが行えなかった。
そこで、接触文字や分離文字が含まれる文書の文字列画
像から文字を正しく切り出す手法もいくつか提案されて
いる。例えば、特開平５−１２８３０８号「文字認識装
置」では、所定の文字サイズより幅の狭い文字要素は分
離文字として扱い、幅が文字サイズを越えない範囲で前
後（左右、上下）の文字要素と接続して１文字として切
り出す。また、所定の文字サイズより幅の広い文字要素
は、接触文字として扱い、これを文字サイズで分割し、
この分割したそれぞれを１文字として切り出す。所定の
文字サイズより幅の狭い文字要素と所定の文字サイズよ
り幅の広い文字要素が連続している場合には、幅の狭い
文字要素の先頭位置から文字サイズ毎に分割した場合の
分離位置と幅の広い文字要素の先頭位置から文字サイズ
毎に分割した場合の分離位置とを切り出し候補位置と
し、これらの切り出し候補位置のうちの異なる２つの切
り出し候補位置で挟まれた画像を総て一旦文字と仮認識
し、この上で仮認識された文字の評価値の高いものを本
来の文字として選択する。これは、分離文字の後半要素
が次文字と接触している可能性を考慮したものである。
一方、特開平５−１２８３０７号「文字認識装置」で
は、前記特開平５−１２８３０８号が固定の文字サイズ
ごとに切り出し候補位置を生成しているのに対して、文
字サイズに半角文字幅を適用した場合と、全角文字幅を
適用した場合との分離位置を全て候補としている。[0003] The present invention relates to a character recognition device employed in such a system. By the way, in a character recognition device employed in such a conventional system, a black pixel is projected in a vertical direction on a character string with respect to a cut-out character string image, and a continuous image area is defined as a character image. (For example, Akiyama et al., “Putting Out Characters in Article Area of Printed Material” PRL80-70). By the way, aside from simple recognition objects such as alphabets, there are many types and numbers of characters to be recognized in Japanese, Chinese, etc. It is necessary to know the exact shape and size of the character being written. Further, even in the case of a printed document in a language of European and American languages, it is important to accurately determine the size of the character to be recognized because various types of symbols are used in scientific and technical papers. However, in the above-described character extraction technology, a character image is extracted using a continuous area of orthographic projection as a character. Therefore, when there is a contact between characters (hereinafter referred to as a contact character), for example,
Separated characters consisting of multiple character elements separated in the character string direction such as "north" and "river" in horizontal writing documents and "2" and "3" in vertical writing documents, and characters originally consisting of single character elements. However, if there is a character that has been separated into a plurality of character elements due to blurring (hereinafter, both are referred to as a separated character), accurate extraction cannot be performed.
Therefore, several techniques have been proposed for correctly extracting characters from a character string image of a document that includes contact characters and separated characters. For example, in Japanese Patent Application Laid-Open No. 5-128308 "Character recognition device", a character element having a width smaller than a predetermined character size is treated as a separated character, and a character element before and after (left, right, up and down) within a width not exceeding the character size. Connect and cut out as one character. In addition, a character element wider than a predetermined character size is treated as a contact character, and is divided by the character size.
Each of the divided parts is cut out as one character. When a character element having a width smaller than the predetermined character size and a character element having a width wider than the predetermined character size are continuous, a separation position when the character element is divided for each character size from a head position of the character element having a small width. The separation position when divided for each character size from the top position of the wide character element is set as a cutout candidate position, and all the images sandwiched between two different cutout candidate positions among these cutout candidate positions are once subjected to character Tentatively recognized, and a character having a high evaluation value of the character tentatively recognized is selected as an original character. This takes into account the possibility that the latter half element of the separation character is in contact with the next character.
On the other hand, in Japanese Patent Laid-Open No. 5-128307 "Character Recognition Apparatus", while Japanese Patent Laid-Open No. 5-128308 generates a clipping candidate position for each fixed character size, a half-width character width is applied to the character size. All the separation positions between the case where the character width is applied and the case where the full-width character width is applied are all candidates.

【０００４】また、文字列中の余白の検出については、
切り出された文字の間隔幅と全角文字幅あるいは半角文
字幅とを比較し、余白幅の方が大きい場合を余白として
いる。[0004] Also, regarding the detection of margins in a character string,
The interval width of the cut-out characters is compared with the full-width character width or the half-width character width, and a case where the margin width is larger is defined as a margin.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上記特
開平５−１２８３０８号の方式では、文字サイズが一定
であることを前提としている。このため、全角，半角混
じりに印字された文書や不定ピッチの文書にうまく対応
できない。しかも、技術論文等はこれらの態様の文書が
非常に多い。一方、上記特開平５−１２８３０７号の方
式では、切り出し候補位置数が非常に多くなり、またそ
れらの切り出し候補位置の内の異なる２つの切り出し候
補位置で挟まれた画像を総て一旦文字として認識して処
理を行うため、計算量が非常に大きくなり、ひいてはシ
ステム全体としての処理速度が遅れる。However, the method disclosed in Japanese Patent Laid-Open No. 5-128308 is based on the premise that the character size is constant. For this reason, it is not possible to properly cope with a document printed in a mixture of full-width and half-width and a document with an irregular pitch. In addition, technical papers and the like have many documents in these modes. On the other hand, in the method disclosed in Japanese Patent Application Laid-Open No. 5-128307, the number of clipping candidate positions becomes extremely large, and all images sandwiched between two different clipping candidate positions among these clipping candidate positions are once recognized as characters. Therefore, the amount of calculation becomes extremely large, and the processing speed of the entire system is delayed.

【０００６】また、余白の検出においては、文字幅とし
て全角文字幅を用いた場合には幅の狭い半角余白が検出
できない。逆に、文字幅として半角文字幅を用いた場合
には例えば「１１」のように、全角ピッチで印字された
文書で幅の細い文字が続くようなときに、文字間の余白
幅が半角字幅より大きくなるため、余分な余白が検出さ
れる。ひいては全角、半角混じりに印字された文書や不
定ピッチの文書にうまく対応するのが困難である。In the detection of margins, a narrow half-width margin cannot be detected when a full-width character width is used as the character width. Conversely, when a half-width character width is used as a character width, for example, when a narrow-width character continues in a document printed at a full-width pitch, such as “11”, the margin width between the characters is changed to a half-width character. Since it is larger than the width, an extra margin is detected. As a result, it is difficult to properly cope with a document printed in a mixture of full-width and half-width and a document with an irregular pitch.

【０００７】本発明は、以上の問題点に鑑み、全角，半
角混じりに印字された文書や不定ピッチの文書に対して
も、高速かつ正確に文字の切り出しと余白の検出が可能
な文字認識装置を提供することを目的としてなされたも
のである。SUMMARY OF THE INVENTION In view of the above problems, the present invention provides a character recognition apparatus capable of quickly and accurately extracting characters and detecting margins even for documents printed in full-width and half-width and documents of irregular pitch. The purpose of this is to provide.

【０００８】[0008]

【課題を解決するための手段】上記目的を達成するため
に、請求項１の発明においては、画素毎にビット情報化
等されて入力された文書画像から文字列画像を切り出
し、次に各文字列画像から更に個々の文字画像を切り出
し、この上で個々の文字画像を認識して該当する文字コ
ードに変換する文字認識装置において、前記文字列画像
から文字を構成する画素の塊である文字要素（文字列方
向に対して同位相の線分や曲線あるいはこれらの組み合
せ等からなる。）を抽出する文字要素抽出部と、前記文
字要素抽出部で抽出された文字要素の位置あるいは大き
さから全角文字の幅及び半角文字の幅を所定の手段（含
む，手順、方法）で算出する文字幅算出部と、前記文字
要素抽出部で抽出された文字要素のうち前記文字幅算出
部で算出した半角文字幅より幅の狭い文字要素があれば
これを検出してそのままサブ文字要素とし、半角文字幅
より幅の広い文字要素があればこれを検出して個々の幅
が半角文字幅以下の複数のサブ文字要素に分割する文字
要素分割部と、前記文字要素分割部で生成されたサブ文
字要素に対し、前後の文字列の文字要素若しくはサブ文
字要素と単一の文字を構成するものとみなして認識対象
とする接続後の幅が前記半角文字幅を越えない範囲で可
能な限り幅（認識方向が上下ならば、上下方向の長さを
含む概念）広くなるように接続して切り出される文字要
素画像を半角文字候補画像とし、接続後の幅が前記全角
文字幅を越えない範囲で可能な限り幅広くなるように接
続して切り出される文字要素画像を全角文字候補画像と
して切り出す文字候補生成部と、前記文字候補生成部で
生成された半角文字候補画像及び全角文字候補画像を一
旦単一の文字として仮認識し、該当する文字コードおよ
び正確性を示す評価値を算出する認識部と、前記認識部
で半角文字候補画像及び全角文字候補画像に対して得ら
れた評価値を所定の関数に代入して比較し、その結果の
評価の値の高い方の文字候補画像を正しい文字と判断す
る認識結果評価部とを備えたことを特徴としている。In order to achieve the above object, according to the first aspect of the present invention, a character string image is cut out from an input document image which is converted into bit information for each pixel, and then each character image is extracted. In a character recognition device that further cuts out individual character images from a column image and recognizes the individual character images and converts them into corresponding character codes, a character element that is a block of pixels constituting a character from the character string image (Composed of a line segment or a curve having the same phase with respect to the character string direction or a combination thereof), and a full-width based on the position or size of the character element extracted by the character element extraction unit A character width calculator for calculating the width of the character and the width of the half-width character by predetermined means (including a procedure and a method); and a half-width calculated by the character width calculator among the character elements extracted by the character element extraction unit. Sentence If there is a character element narrower than the width, it is detected and used as it is as a sub character element.If there is a character element wider than the half width character width, it is detected and multiple sub The character element dividing unit that divides into character elements and the sub character element generated by the character element dividing unit are recognized as constituting a single character with the character elements or sub character elements of the preceding and following character strings. A character element image that is connected and cut out so as to be as wide as possible (a concept including the length in the vertical direction if the recognition direction is up and down) as long as the target connected width does not exceed the half-width character width. A character candidate generating unit that cuts out a character element image that is connected and cut out so that the width after connection is as wide as possible without exceeding the full-width character width as a full-width character candidate image, and A recognition unit for temporarily recognizing the half-width character candidate image and the full-width character candidate image generated by the character candidate generation unit as a single character, and calculating a corresponding character code and an evaluation value indicating accuracy; and the recognition unit. Recognition result evaluation that substitutes the evaluation values obtained for the half-width character candidate image and the full-width character candidate image into a predetermined function and compares them, and determines the character candidate image with the higher evaluation value as a correct character. And a part.

【０００９】請求項２の発明においては、前記文字幅算
出部は、文字要素の位置あるいは大きさ（含む、両方）
から文字印字ピッチを計算し、その計算結果から全角文
字の幅及び半角文字の幅を算出する文字印字ピッチ計算
手段を有していることを特徴としている。請求項３の発
明においては、文字幅算出部は、文字列画像の高さ（文
字列画像の文字列に垂直な方向の大きさ若しくは長さ、
縦書き文字書ならばいわゆる幅となる。）から仮文字幅
を計算する仮文字幅計算手段と、前記文字要素抽出部で
抽出された文字要素の内、文字要素の幅と仮文字幅との
誤差が所定の値よりも小さい文字要素が連続する部分を
検出する連続部検出手段と、検出された連続する文字要
素の文字列方向の中点間距離から文字の印字ピッチを計
算し、その計算結果から全角文字の幅および半角文字の
幅を算出する算出手段とを有していることを特徴として
いる。In the second aspect of the present invention, the character width calculating section includes a position or a size (including both) of a character element.
And a character printing pitch calculating means for calculating the width of full-width characters and the width of half-width characters from the calculation result. In the invention according to claim 3, the character width calculating unit calculates the height of the character string image (the size or length in the direction perpendicular to the character string of the character string image,
If it is a vertical writing, it will be a so-called width. ), A temporary character width calculating means for calculating a temporary character width from the character elements extracted by the character element extraction unit, a character element having an error between the width of the character element and the temporary character width smaller than a predetermined value. A continuous part detecting means for detecting a continuous part, and calculating a character printing pitch from a distance between the center points in the character string direction of the detected continuous character elements, and calculating a width of a full-width character and a width of a half-width character from the calculation result. And a calculating means for calculating.

【００１０】請求項４の発明においては、前記文字幅算
出部は、文字列画像の高さから仮文字幅を計算する仮文
字幅計算手段と、仮文字幅より幅の小さい文字要素が連
続する場合には、幅が仮文字幅を越えない範囲で該文字
要素を仮に接続し、その仮接続した文字要素および元来
の文字要素の幅と仮文字幅との誤差をもとめ、これが所
定の値よりも小さい文字要素が連続する部分を検出する
連続部検出手段と、前記連続部検出手段で検出された連
続する部分の文字要素の文字列方向の中点間距離から文
字の印字ピッチを計算し、その計算結果から全角文字の
幅及び半角文字の幅を算出する算出手段とを有している
ことを特徴としている。According to a fourth aspect of the present invention, the character width calculation section calculates a provisional character width from the height of the character string image and a character element having a width smaller than the provisional character width. In this case, the character elements are temporarily connected within a range where the width does not exceed the provisional character width, and the error between the provisionally connected character element and the width of the original character element and the provisional character width is obtained. A continuous part detecting means for detecting a part where a smaller character element is continuous, and calculating a character printing pitch from a distance between the middle points in the character string direction of the character element of the continuous part detected by the continuous part detecting means. Calculation means for calculating the width of full-width characters and the width of half-width characters from the calculation results.

【００１１】請求項５の発明においては、前記認識部
は、前記文字候補生成部で生成された半角文字候補画像
を認識するのに使用する半角文字認識辞書と、前記文字
候補生成部で生成された全角文字候補画像を認識するの
に使用する全角文字認識辞書とを有していることを特徴
としている。請求項６の発明においては、前記認識部
は、全角文字候補画像の認識結果が、単一の文字の文字
列方向に分割した半分が半角１文字と同形の文字となる
か否かを判断する半分文字判断手段と、前記半分文字判
断手段にてそのような文字と判断されたならば前記認識
結果評価部における評価に際して全角文字候補画像の評
価値を優先させる優先評価手段とを有していることを特
徴としている。According to a fifth aspect of the present invention, the recognition unit includes a half-width character recognition dictionary used to recognize the half-width character candidate image generated by the character candidate generation unit, and a half-width character recognition dictionary generated by the character candidate generation unit. And a full-width character recognition dictionary used for recognizing full-width character candidate images. In the invention of claim 6, the recognizing unit determines whether or not the recognition result of the full-width character candidate image is a character having the same shape as one half-width character divided in half in a character string direction of a single character. A half-character judging unit; and a priority evaluation unit that gives priority to the evaluation value of the full-width character candidate image when the recognition result estimating unit evaluates the half-character judging unit as such a character. It is characterized by:

【００１２】[0012]

【００１３】請求項７の発明においては、前記認識結果
評価部で単一の文字と判断された各文字画像について、
各文字が全角文字、半角文字のいずれであるかと相連続
する２文字の文字列方向の中点間隔とから文字間余白を
検出する余白検出部と、前記余白検出部の検出結果をも
とに前記出力文字コード中の対応する位置に余白コード
を挿入する余白追加部とを有していることを特徴として
いる。[0013] In the invention according to claim 7 , each of the character images determined to be a single character by the recognition result evaluation unit is
A margin detection unit that detects an inter-character space based on whether each character is a full-width character or a half-width character and a midpoint interval in the character string direction of two consecutive characters, based on a detection result of the margin detection unit A margin adding section for inserting a margin code at a corresponding position in the output character code.

【００１４】[0014]

【作用】上記構成により請求項１の発明においては、入
力された文書画像から文字列画像を切り出し、次に各文
字列画像から更に個々の文字画像を切り出し、この上で
個々の文字画像を認識して該当する文字コードに変換す
る文字認識装置において、以下の作用がなされる。According to the first aspect of the present invention, a character string image is cut out from an input document image, and then individual character images are further cut out from each character string image. The following operation is performed in the character recognition device that converts the data into the corresponding character code.

【００１５】文字要素抽出部が、文字列画像から文字を
構成する画素の塊である文字要素を抽出する。文字幅算
出部が、文字要素抽出部で抽出された文字要素の位置あ
るいは大きさから全角文字の幅および半角文字の幅を所
定の手段で算出する。文字要素分割部が、文字要素抽出
部で抽出された文字要素のうち文字幅算出部で算出した
半角文字幅より幅の狭い文字要素があれば、画素の文字
列方向の正投象影の不連続部の有無等によりこれを検出
してそのままサブ文字要素とし、半角文字幅より幅の広
い文字要素があればこれを検出して個々の幅が半角文字
幅以下の複数のサブ文字要素に分割する。文字候補生成
部が、文字要素分割部で生成されたサブ文字要素に対
し、接続後の幅が前記半角文字幅を越えない範囲で可能
な限りサブ文字要素を接続してなる接続文字要素で切り
出される画像を半角文字候補画像とし、接続後の幅が全
角文字幅を越えない範囲で可能な限りサブ文字要素を接
続してなる接続文字要素で切り出される画像を全角文字
候補画像として切り出す。認識部が、文字候補生成部で
生成された半角文字候補画像及び全角文字候補画像を一
旦単一の文字と仮認識して、この上でパターン照合等に
より該当する文字コード及び正確性を示す評価値を算出
する。認識結果評価部が、認識部で半角文字候補画像及
び全角文字候補画像に対して得られた評価値を所定の関
数に代入して比較し、その結果の評価値の高い方の文字
候補画像を正しい文字と判断する。A character element extraction unit extracts a character element which is a block of pixels constituting a character from a character string image. The character width calculating unit calculates the width of the full-width character and the width of the half-width character by predetermined means from the position or size of the character element extracted by the character element extraction unit. If the character element division unit has a character element having a width smaller than the half-width character width calculated by the character width calculation unit among the character elements extracted by the character element extraction unit, the normal projection shadow of the pixel in the character string direction is not detected. Detects whether there is a continuous part, etc., and makes it as a sub character element as it is, and if there is a character element wider than half width character width, detects this and divides it into multiple sub character elements whose individual width is less than half width character width I do. The character candidate generation unit cuts out the connected character elements formed by connecting the sub character elements as much as possible within a range not exceeding the half-width character width for the sub character elements generated by the character element division unit. The image to be extracted is a half-width character candidate image, and an image cut out by a connected character element formed by connecting sub-character elements as far as possible within a range that does not exceed the full-width character width is cut out as a full-width character candidate image. The recognition unit temporarily recognizes the half-width character candidate image and the full-width character candidate image generated by the character candidate generation unit as a single character, and then evaluates the corresponding character code and accuracy by pattern matching or the like. Calculate the value. The recognition result evaluation unit substitutes the evaluation values obtained for the half-width character candidate image and the full-width character candidate image in the recognition unit into a predetermined function and compares them, and determines the character candidate image with the higher evaluation value as a result. Judge as correct characters.

【００１６】請求項２の発明においては、文字幅算出部
内の文字印字ピッチ計算手段が、文字要素の相互の位置
関係あるいは文字列方向の大きさ（幅）及びこれに直交
する方向の大きさ（高さ）や行間に対する相対的大きさ
等から文字印字ピッチを計算し、その計算結果から全角
文字の幅及び半角文字の幅を算出する。請求項３の発明
においては、文字幅算出部内の仮文字幅計算手段が、文
字列画像の高さから仮文字幅を計算する。同じく連続部
検出手段が、文字要素抽出部で抽出された文字要素の
内、文字要素の幅と仮文字幅との誤差が所定の値よりも
小さい文字要素が連続する部分を検出する。同じく算出
手段が、検出された連続する文字要素の文字列方向の中
点間距離から文字印字ピッチを計算し、その計算結果か
ら全角文字の幅および半角文字の幅を算出する。According to the second aspect of the present invention, the character printing pitch calculating means in the character width calculating unit calculates the mutual positional relationship of the character elements or the size (width) in the character string direction and the size (width) in the direction orthogonal thereto. The character printing pitch is calculated from the height and the relative size with respect to the line spacing, and the width of the full-width character and the width of the half-width character are calculated from the calculation result. According to the third aspect of the present invention, the provisional character width calculation means in the character width calculation unit calculates the provisional character width from the height of the character string image. Similarly, the continuous part detecting means detects a part of the character elements extracted by the character element extraction part in which the character element having an error between the width of the character element and the temporary character width smaller than a predetermined value is continuous. Similarly, the calculating means calculates the character printing pitch from the detected distance between the middle points of the consecutive character elements in the character string direction, and calculates the width of the full-width character and the width of the half-width character from the calculation result.

【００１７】請求項４の発明においては、文字幅算出部
内の仮文字幅計算手段が文字列画像の高さから仮文字幅
を計算する。同じく連続部検出手段が、計算された仮文
字幅より幅の小さい文字要素が連続する場合には、幅が
仮文字幅を越えない範囲で該文字要素を仮に接続し、そ
の仮接続した文字要素および元来の文字要素の幅と仮文
字幅との誤差を求め、これらの値が所定の値よりも小さ
い文字要素が連続する部分を検出する。同じく算出手段
が連続部検出手段で検出された連続する部分の文字要素
の文字列方向の中点間距離から文字印字ピッチを計算
し、その計算結果から全角文字の幅および半角文字の幅
を算出する。In a fourth aspect of the present invention, the provisional character width calculation means in the character width calculation unit calculates the provisional character width from the height of the character string image. Similarly, in the case where the continuous part detecting means includes a series of character elements having a width smaller than the calculated temporary character width, the character elements are temporarily connected within a range that does not exceed the temporary character width, and the temporarily connected character elements are connected. Then, an error between the width of the original character element and the provisional character width is obtained, and a portion where the character elements whose values are smaller than a predetermined value continues is detected. Similarly, the calculating means calculates the character printing pitch from the distance between the middle points in the character string direction of the character element of the continuous part detected by the continuous part detecting means, and calculates the width of the full-width character and the width of the half-width character from the calculation result. I do.

【００１８】請求項５の発明においては、前記認識部内
の半角文字認識辞書が、前記文字候補生成部で生成され
た半角文字候補画像をパターン認識等で認識するのに使
用される。同じく全角文字認識辞書が前記文字候補生成
部で生成された全角文字候補画像を認識するのに使用さ
れる。請求項６の発明において、認識部内の半分文字判
断手段が、全角文字候補画像の認識結果が単一の文字の
文字列方向に分割した半分が半角１文字と同形の文字と
なるか否かを判断する。同じく優先評価手段が、半分文
字判断手段にてそのような文字と判断されたならば認識
結果評価部における評価に際して全角文字候補画像の評
価値を優先させる。In the fifth aspect of the present invention, the half-width character recognition dictionary in the recognition unit is used for recognizing the half-width character candidate image generated by the character candidate generation unit by pattern recognition or the like. Similarly, a full-width character recognition dictionary is used to recognize a full-width character candidate image generated by the character candidate generation unit. In the invention according to claim 6, the half-character judging means in the recognizing unit determines whether or not the recognition result of the full-width character candidate image is a character having the same shape as one half-width one character divided in the character string direction of a single character. to decide. Similarly, if the priority evaluation unit determines that the character is such a character by the half character determination unit, the evaluation value of the full-width character candidate image is prioritized in the evaluation by the recognition result evaluation unit.

【００１９】[0019]

【００２０】請求項７の発明においては、余白検出部が
認識結果評価部で単一の文字と判断された各文字画像に
ついて、各文字が全角文字、半角文字のいずれであるか
と相連続する２文字の文字列方向の中点間隔とから文字
間余白を検出する。同じく余白追加部が、余白検出部の
検出結果をもとに出力文字コード中の対応する位置に余
白コードを挿入する。According to the seventh aspect of the present invention, for each character image determined by the margin detection unit to be a single character in the recognition result evaluation unit, it is determined whether each character is a full-width character or a half-width character. The inter-character margin is detected from the midpoint interval of the character in the character string direction. Similarly, the margin adding unit inserts a margin code at a corresponding position in the output character code based on the detection result of the margin detection unit.

【００２１】[0021]

【実施例】以下、本発明に係る文字認識装置を実施例に
基づいて説明する。なお、以下の実施例においては、縦
長、横太等の文字、形状は全角だが見出し欄における文
字のごとく大きい文字等を標準的な文字に修正したりす
る正規化部、認識後の文字を出力する出力部や更には出
力結果をもとに外国語に翻訳したり発声したりする発声
部等のシステム全体としての構成部、使用者が各種操作
を行ったり、あらかじめ判明している条件を入力するた
めの入力操作部等を有しているのは勿論である。しか
し、これらは本発明の主旨には直接には関係しないた
め、図示や説明は省略する。（第１実施例）図１は、本発明の第１実施例の構成図で
ある。本図において、１は画像が入力される画像入力部
である。２は、画像入力部１で読み込んだ文書画像から
文字列の位置を検出し文字列画像を切り出す文字列抽出
部である。３は、文字列抽出部２で切り出された文字列
画像から文字を構成する画素の塊である文字要素を抽出
する文字要素抽出部である。４は、文字要素抽出部３で
抽出された文字要素の位置あるいは大きさから全角文字
の幅および半角文字の幅を算出する文字幅算出部であ
る。５は、文字要素抽出部３で抽出された文字要素の
内、文字幅算出部４で算出した半角文字幅より幅の狭い
文字要素があればこれを検出した上そのままサブ文字要
素とし、半角文字幅より幅の広い文字要素があればこれ
を検出して個々の幅が半角文字幅以下の複数のサブ文字
要素に分割する文字要素分割部である。６は、文字要素
分割部５で生成されたサブ文字要素に対し、接続後の幅
が半角文字幅を越えない範囲で可能な限りサブ文字要素
を接続した接続文字要素で切り出される画像を半角文字
候補画像とし、接続後の幅が全角文字幅を越えない範囲
で可能な限りサブ文字要素を接続した接続文字要素で切
り出される画像を全角文字候補画像として切り出す文字
候補生成部である。７は、文字候補生成部６で生成され
た半角文字候補画像及び全角文字候補画像を一旦候補文
字として認識し、該当する文字コードおよび認識の正し
さの確率たる評価値を算出する認識部である。８は、認
識部で半角文字候補画像および全角文字候補画像に対し
て得られた評価値を所定の関数に基づき比較し、比較結
果の高い方の文字候補画像を正しい文字と判断する認識
結果評価部である。９は、認識結果評価部８で正しい１
文字と判断された各文字画像について、各文字が全角文
字、半角文字のいずれであるか及び相連続する２文字の
文字列方向の中点間隔から当該文字間に存在する余白を
検出し、前記出力文字コード中の対応する位置に余白コ
ードを挿入する余白追加部である。１０は、認識結果を
出力する認識結果出力部である。DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, a character recognition device according to the present invention will be described based on embodiments. In the following embodiment, characters such as portrait, width and width, and a normalization unit that corrects large characters such as characters in the heading column, which are full-width but large as characters in the heading column, to standard characters, and outputs recognized characters The output unit that performs the translation and utterance based on the output result as a whole, such as the utterance unit, etc., the user performs various operations, and inputs the conditions that are known in advance. Needless to say, it has an input operation unit for performing the operation. However, since these are not directly related to the gist of the present invention, illustration and description are omitted. (First Embodiment) FIG. 1 is a configuration diagram of a first embodiment of the present invention. In FIG. 1, reference numeral 1 denotes an image input unit into which an image is input. Reference numeral 2 denotes a character string extracting unit that detects the position of the character string from the document image read by the image input unit 1 and cuts out the character string image. Reference numeral 3 denotes a character element extraction unit that extracts a character element that is a block of pixels constituting a character from the character string image cut out by the character string extraction unit 2. Reference numeral 4 denotes a character width calculation unit that calculates the width of full-width characters and the width of half-width characters from the position or size of the character element extracted by the character element extraction unit 3. Reference numeral 5 denotes a character element having a width smaller than the half-width character width calculated by the character width calculation section 4 among the character elements extracted by the character-element extraction section 3 and detects the character element and sets it as a sub-character element as it is. This is a character element division unit that detects a character element having a width wider than the width and divides the character element into a plurality of sub-character elements each having a width equal to or less than a half-width character width. Reference numeral 6 denotes an image cut out by a connected character element connected to a sub-character element as far as possible within a range that does not exceed a half-width character width for the sub-character element generated by the character element dividing unit 5. A character candidate generating unit that cuts out, as a full-width character candidate image, an image that is cut out as a candidate image and that is cut by a connected character element to which sub-character elements are connected as far as possible within a range that does not exceed a full-width character width. Reference numeral 7 denotes a recognition unit that temporarily recognizes the half-width character candidate image and the full-width character candidate image generated by the character candidate generation unit 6 as candidate characters, and calculates a corresponding character code and an evaluation value that is a probability of correctness of recognition. . A recognition result evaluation unit 8 compares the evaluation values obtained for the half-width character candidate image and the full-width character candidate image by the recognition unit based on a predetermined function, and determines the character candidate image with the higher comparison result as a correct character. Department. 9 is the correct 1 in the recognition result evaluation unit 8.
For each character image determined to be a character, detecting whether each character is a full-width character or a half-width character, and detecting a margin existing between the characters from the midpoint interval in the character string direction of two consecutive characters, This is a margin adding section for inserting a margin code at a corresponding position in the output character code. A recognition result output unit 10 outputs a recognition result.

【００２２】以下、以上のように構成された文字認識装
置について、図２に示すような横書きの入力画像を例に
とってその動作を説明する。なお、本図において、下二
段の○部は本実施例の作用の説明に直接には使用しない
何かある文字を表す。画像入力部１から入力された画像
は、文字を形成する画素を１、文字以外の背景画素を０
とした２値データとして画像メモリ（図示せず）に蓄え
る。文字列抽出部２では、まず、画像メモリに蓄えられ
た文書画像から文字列に対して縦方向および横方向の文
字を形成する画素の正投象影を求め、両方向の文字を構
成する画素の正投象影の幅および正投象影間の間隔を比
較し、入力文書画像が縦書き文書か横書き文書かを判定
する（この技術については例えば特願昭６０−７７６３
３号にて開示されている）。ついで、文書画像を文字列
方向に平行のブロックに区切り、各ブロックごとに文字
列方向の文字部射影を求め、ブロックの境界座標と射影
の開始、終了座標とではさまれた矩形領域をそのブロッ
クにおける文字列とし、ブロック毎の文字列の文字列に
垂直な方向での重複の有無を調べることで、各ブロック
毎の文字列を接続し、文字列の座標を得ることにより、
文字列画像を切り出す。図３の（ａ）にこれを示す（こ
の技術については、例えば特願昭６０−１０６４０４号
「文字認識装置」にて開示されている。）。文字要素抽
出部３では、文字列抽出部２で切り出された文字列画像
図３（ａ）から文字要素を抽出する。その処理の内容で
あるが、図３の（ｂ）に示すように、文字列画像の文字
列を垂直方向に投影し、文字部の正投象影の連続する部
分を文字列要素とし、その矩形座標を求める。これを、
図３の（ｃ）のｓ１，ｓ２，…ｓ１０に示す。文字幅算
出部４では、文字要素抽出部３で抽出された文字要素の
座標から全角文字の幅及び半角文字の幅を算出する。こ
こに、文字幅算出部４での文字幅算出の方式は、いくつ
かの方式が考えられるが、本実施例の採用する方式を以
下に３種説明する。Hereinafter, the operation of the character recognition apparatus having the above-described configuration will be described with reference to an example of a horizontally written input image as shown in FIG. In the figure, the lower two circles represent some characters that are not directly used for describing the operation of the present embodiment. The image input from the image input unit 1 has a pixel forming a character as 1 and a background pixel other than a character as 0.
Is stored in an image memory (not shown) as binary data. The character string extraction unit 2 first obtains the normal projection shadows of the pixels forming the characters in the vertical and horizontal directions with respect to the character string from the document image stored in the image memory. The width of the normal projection shadow and the interval between the normal projection shadows are compared to determine whether the input document image is a vertical writing document or a horizontal writing document (for this technology, see Japanese Patent Application No. 60-7763, for example).
No. 3). Next, the document image is divided into blocks parallel to the character string direction, the character part projection in the character string direction is obtained for each block, and a rectangular area sandwiched by the boundary coordinates of the block and the start and end coordinates of the projection is defined as the block. By examining the character string of each block for overlap in the direction perpendicular to the character string of the block, by connecting the character strings of each block and obtaining the coordinates of the character string,
Cut out a character string image. This is shown in FIG. 3A (this technique is disclosed, for example, in Japanese Patent Application No. 60-106404, "Character Recognition Apparatus"). The character element extraction unit 3 extracts a character element from the character string image (FIG. 3A) cut out by the character string extraction unit 2. As shown in FIG. 3 (b), the character string of the character string image is projected in the vertical direction, and a continuous portion of the regular projection shadow of the character part is defined as a character string element. Find rectangle coordinates. this,
S10 shown in FIG. 3C. The character width calculation unit 4 calculates the width of full-width characters and the width of half-width characters from the coordinates of the character elements extracted by the character element extraction unit 3. Here, there are several methods for calculating the character width in the character width calculation unit 4, and three methods adopted in the present embodiment will be described below.

【００２３】第１の方式を図４の（ａ）を用いて説明す
る。文字列の高さＨに所定の定数αをかけ、仮文字幅Ｗ
＊を算出する。一般に、印刷された日本語文書では、多
くの文字の縦横比はほぼ１なので、αは概略１に近い値
をとる。ただし、印刷された書体に応じて修正を施した
り正規化処理の内容如何によっては多少の相違があるの
は勿論である。次に、縦書きの「一（漢字のはじめや漢
数字のいち）や横書きの「１（漢字のすすむやアラビア
数字のいち）」が存在するため、仮文字幅Ｗ＊と幅がほ
ぼ等しい文字要素（図中、ｓ３，ｓ５，ｓ９，ｓ１０）
の内でもっとも幅の広い文字要素（図中、ｓ３）の幅Ｗ
ｍａｘに所定の定数βをかけたものを全角文字幅Ｗｚと
する。即ち、各文字要素の幅をＷｉ（ｉ＝１，２，…）
とするとき、全角文字幅Ｗｚは次式で表される。The first method will be described with reference to FIG. The height H of the character string is multiplied by a predetermined constant α to obtain a temporary character width W.
* Is calculated. Generally, in a printed Japanese document, since the aspect ratio of many characters is almost 1, α takes a value close to about 1. However, it is needless to say that there is a slight difference depending on whether the correction is performed or the contents of the normalization processing are performed in accordance with the printed font. Next, since there is a vertical writing "one (the beginning of kanji and one of the kanji numerals)" and a horizontal writing "1 (the kanji of the kanji and one of the kanji numbers)", the character whose width is almost equal to the provisional character width W * Elements (s3, s5, s9, s10 in the figure)
Width W of the widest character element (s3 in the figure)
A value obtained by multiplying max by a predetermined constant β is defined as a full-width character width Wz. That is, the width of each character element is set to Wi (i = 1, 2,...)
Then, the full-width character width Wz is represented by the following equation.

【００２４】Ｗｍａｘ＝ｍａｘ｛Ｗｉ｜Ｗｉ／Ｗ＊〜１｝Ｗｚ＝β１・Ｗｍａｘ（ここに、β１〜１）ここで、ｍａｘ｛Ｕ｝は、集合Ｕの最大値を表す。ま
た、Ａ〜Ｂの〜はＡとＢとがほぼ同値であることを示
す。また、半角文字幅Ｗｈは、全角文字幅Ｗｚに所定の
定数γをかけて算出する。Wmax = max {Wi | Wi / W * 〜1} Wz = β1 · Wmax (where β1 to 1) where max {U} represents the maximum value of the set U. A to B in A to B indicate that A and B have substantially the same value. The half-width character width Wh is calculated by multiplying the full-width character width Wz by a predetermined constant γ.

【００２５】Ｗｈ＝γ・Ｗｚ（ここに、
γ〜０．５），第２の方式を図４の（ｂ）を用いて説明する。第１の方
式と同様にして、仮文字幅Ｗ＊を算出し、文字要素の
内、仮文字幅Ｗ＊と幅がほぼ等しい文字要素が複数個近
接して存在しているもの（図中、ｓ９，ｓ１０）の文字
列方向の中点間距離（図中、ｄ１）の平均値あるいは最
大値を文字印字ピッチＰとし、全角文字幅Ｗｚ、半角文
字幅Ｗｈを、それぞれ以下の式で算出する。これは、で
きるだけ長く、数の多い文字を評価対象としつつ、文字
印字ピッチの変更に柔軟に対応せんとすることによる。Wh = γ · Wz (where,
γ to 0.5), the second method will be described with reference to FIG. In the same manner as in the first method, the provisional character width W * is calculated, and a plurality of character elements having a width substantially equal to the provisional character width W * are present close to each other (in the figure, (s9, s10) The average value or the maximum value of the distance between the middle points (d1 in the figure) in the character string direction is set as the character printing pitch P, and the full-width character width Wz and the half-width character width Wh are calculated by the following equations. . This is due to the fact that as many characters as possible, which are as long as possible, are to be evaluated, and a change in the character printing pitch is not flexibly handled.

【００２６】Ｗｚ＝β２・Ｐ（ここに、β２〜１かつβ２＜１）Ｗｈ＝γ・Ｗｚ（ここに、γ〜０．５）。第３の方式を図４の（ｃ）を用いて説明する。第１の方
式と同様にして、仮文字幅Ｗ＊を算出し、仮文字幅Ｗ＊
より幅の小さい文字要素が連続する場合（図中、ｓ６，
ｓ７）には、幅が仮文字幅Ｗ＊を越えない範囲で文字要
素を仮に接続し（図中、ｓ'）、いわば単一の文字とし
て扱い、仮文字幅Ｗ＊と幅がほぼ等しい文字要素または
仮接続した文字要素が複数個近接して存在しているもの
（図中、ｓ5，ｓ'およびｓ９，ｓ１０）の文字列方向の
中点間距離（図中、ｄ１，ｄ２）の平均値あるいは最大
値を文字印字ピッチＰとし、全角文字幅Ｗｚ、半角文字
幅Ｗｈを、それぞれ以下の式で算出する。これも、可能
な限り長い、そして数の多い文字を評価対象とするため
である。Wz = β2 · P (here, β2-1 and β2 <1) Wh = γ · Wz (here, γ-0.5). The third method will be described with reference to FIG. Similar to the first method, the provisional character width W * is calculated, and the provisional character width W * is calculated.
When a character element having a smaller width continues (s6,
In s7), character elements are tentatively connected to each other as long as the width does not exceed the provisional character width W * (s' in the figure), so that the characters are treated as a single character, and a character whose width is substantially equal to the provisional character width W * The average of the distances between the midpoints in the character string direction (d1, d2 in the figure) in the case where a plurality of elements or temporarily connected character elements exist close to each other ( s5 , s' and s9, s10 in the figure) The value or the maximum value is set as the character printing pitch P, and the full-width character width Wz and the half-width character width Wh are respectively calculated by the following equations. This is also for the purpose of evaluating as long and as many characters as possible.

【００２７】Ｗｚ＝β２・Ｐ（ここに、
β２〜１かつβ２＜１）Ｗｈ＝γ・Ｗｚ（ここに、γ〜０．５）なお、以上の三方式のいずれを採用するかは、処理対象
となっている本来の文書の態様、すなわち新聞紙のごと
く文字間隔を比較的詰めて記載されているか否か、特許
出願の明細書のごとく文字間隔が比較的大きいか否か、
学術論文のごとく全角の漢字と半角の数字やアルファベ
ットとが併用されているか否か、更には不等間隔か否か
等に応じてなされるのは勿論である。更に、γ、β２等
の各定数の値も認識対象の文書に応じて適宜最適の値が
選択されるのは勿論である。Wz = β2 · P (where,
β2−1 and β2 <1) Wh = γ · Wz (here, γ〜0.5) It should be noted that which of the above three methods is adopted depends on the form of the original document to be processed, ie, Whether the character spacing is described relatively closely as in newspapers, whether the character spacing is relatively large as in the specification of the patent application,
It is needless to say that full-width kanji and half-width numbers and alphabets are used together as in an academic paper, and furthermore, it is determined whether or not unequal intervals are used. Further, as a matter of course, optimal values of the constants such as γ and β2 are appropriately selected according to the document to be recognized.

【００２８】文字要素分割部５が、文字要素抽出部３で
抽出された文字要素の各文字要素ｓｉについて行う操作
を、図５を参照しつつ説明する。なお、ここに文字幅算
出部４で算出した半角文字幅をＷｈと表し、文字要素ｓ
ｉの幅をＷｉと表すものとする。最初に分割数Ｎを決定
する。分割数Ｎ＝〔Ｗｉ／Ｗｈ＋０．５〕。ここで、
〔Ｘ〕はガウスの記号であり、Ｘを越えない最大の整数
値をあらわす。The operation performed by the character element division unit 5 for each character element si of the character elements extracted by the character element extraction unit 3 will be described with reference to FIG. The half-width character width calculated by the character width calculation unit 4 is represented by Wh, and the character element s
Let the width of i be Wi. First, the number of divisions N is determined. Division number N = [Wi / Wh + 0.5]. here,
[X] is a Gaussian symbol and represents a maximum integer value not exceeding X.

【００２９】次に、図５の（ａ）におけるｓ４，ｓ５，
ｓ７，ｓ１０のように分割数Ｎ＞１の場合には、認識対
象として注目している文字要素ｓｉをＮ個に均等幅で分
割する。同図５の（ｂ）のｓ１，ｓ２，ｓ３，ｓ６，ｓ
８，ｓ９のように分割数Ｎ＝１の場合には、何もしな
い。Next, s4, s5 in FIG.
When the number of divisions N> 1 as in s7 and s10, the character element si of interest as the recognition target is divided into N equal parts. S1, s2, s3, s6, s in FIG.
If the number of divisions N = 1 as in 8, s9, nothing is done.

【００３０】次に、図６を参照しつつ、文字要素分割部
５で生成されたサブ文字要素に対して、文字列先頭のサ
ブ文字要素から順に文字候補生成部６、認識部７、認識
結果評価部８での処理を行い、認識結果を確定していく
処理の内容を説明する。まず、文字候補生成部６におい
て文字列先頭のサブ文字要素ｓ１に注目し、接続後の幅
が半角文字幅Ｗｈを越えない範囲で可能な限りサブ文字
要素を接続した接続文字要素で切り出される半角文字候
補画像ｃ１１と、接続後の幅が全角文字幅Ｗｚを越えな
い範囲で可能な限りサブ文字要素を接続した接続文字要
素で切り出される全角文字候補画像ｃ２１とを生成す
る。ついで、認識部７で、その両文字候補画像を単一の
文字画像とした上で文字を認識し、文字コードおよび評
価値を算出する。（なお、この評価技術については、例
えば、特昭願６３−３１２２８８号「文字認識方法」に
て開示されている。）その際、認識速度の向上と正確性
のため半角文字候補画像は半角文字認識辞書を用いて認
識を行い、全角文字候補画像は全角文字認識辞書を用い
て認識を行う。図では、半角文字候補画像ｃ１１に対し
て文字コードは「８（アラビア数字）」、評価値＝３７
を、全角文字候補画像ｃ２１に対して文字コードは
「昭」、評価値は＝６１となっている。これらの結果
を、認識結果評価部８で比較し、評価の高い方の文字候
補画像を文字と判断する。Next, referring to FIG. 6, the character candidate generation unit 6, the recognition unit 7, the recognition result The details of the processing performed by the evaluation unit 8 to determine the recognition result will be described. First, the character candidate generation unit 6 focuses on the sub-character element s1 at the head of the character string, and as much as possible cuts out the connected character elements connected to the sub-character elements as long as the width after connection does not exceed the half-width character width Wh. A character candidate image c11 and a full-width character candidate image c21 cut out by a connected character element connected to sub-character elements as far as possible within a range that does not exceed the full-width character width Wz after connection are generated. Next, the recognizing unit 7 recognizes the characters after setting the two character candidate images as a single character image, and calculates a character code and an evaluation value. (Note that this evaluation technique is disclosed, for example, in Japanese Patent Application No. 63-310288, "Character Recognition Method.") In this case, the half-width character candidate image is changed to a half-width character candidate image in order to improve the recognition speed and accuracy. The recognition is performed using the recognition dictionary, and the full-width character candidate image is recognized using the full-width character recognition dictionary. In the figure, the character code for the half-width character candidate image c11 is “8 (Arabic numerals)”, and the evaluation value = 37.
The character code for the full-width character candidate image c21 is “Akira”, and the evaluation value is = 61. These results are compared by the recognition result evaluation unit 8, and the character candidate image with the higher evaluation is determined to be a character.

【００３１】さて、この比較は、単純に認識結果の評価
値（認識評価値）の大小で行っても良いが、本実施例で
はより正確な結果を得るべく、この評価値に対して条件
により以下のような関数操作を行い、その結果得られた
評価値（最終評価値）の大小で最終比較を行う。さて、
半角文字候補画像の認識評価値をｖ１、全角文字候補画
像の認識評価値をｖ２と表し、半角文字候補画像の最終
評価値をＶ１、全角文字候補画像の最終評価値をＶ２と
表す。This comparison may be made simply based on the magnitude of the evaluation value (recognition evaluation value) of the recognition result. However, in this embodiment, in order to obtain a more accurate result, the evaluation value is subject to conditions. The following function operation is performed, and the final comparison is performed based on the magnitude of the evaluation value (final evaluation value) obtained as a result. Now,
The recognition evaluation value of the half-width character candidate image is represented by v1, the recognition evaluation value of the full-width character candidate image is represented by v2, the final evaluation value of the half-width character candidate image is represented by V1, and the final evaluation value of the full-width character candidate image is represented by V2.

【００３２】第１に、全角文字候補画像の認識結果が、
１文字の前半分、例えば、横書きの「化」、「八」は各
々左半分が「イ」、「ノ」と同形であるように、左横書
き文書では左半分、縦書き文書では上半分が半角１文字
と同形の文字であった場合には、全角文字候補画像の認
識評価値ｖ２に定数δ１（ここに、δ１＞１．０であ
る。）をかけた値を全角文字候補画像の最終評価値Ｖ２
とする。First, the recognition result of the full-width character candidate image is
The first half of one character, for example, the horizontal character "Kai" and "Hachi" have the same shape as "I" and "No" in the left half, respectively. If the character has the same shape as one half-width character, the value obtained by multiplying the recognition evaluation value v2 of the full-width character candidate image by a constant δ1 (here, δ1> 1.0) is used as the final value of the full-width character candidate image. Evaluation value V2
And

【００３３】すなわち、Ｖ２＝δ１×ｖ２である。
次に、それ以外の場合には、全角文字候補画像の認識評
価値ｖ２に定数δ２（ここに、１．０＜δ２＜δ１で
ある。）をかけた値をの最終評価値Ｖ２とする。すなわ
ち、Ｖ２＝δ２×ｖ２３である。That is, V2 = δ1 × v2.
Next, in other cases, a value obtained by multiplying the recognition evaluation value v2 of the full-width character candidate image by a constant δ2 (here, 1.0 <δ2 <δ1) is set as a final evaluation value V2. That is, V2 = δ2 × v23.

【００３４】半角文字候補画像では、認識評価値ｖ１を
そのまま最終評価値Ｖ１とする。すなわち、Ｖ１＝ｖ
１４である。もし、Ｖ１＜Ｖ２ならば、全角文字候補
画像を文字と判断する。またＶ１≧Ｖ２ならば、半角文
字候補画像を文字と判断する。これは、可能な限り長い
画素間を評価対象、認識対象とすべく重み付けを行なっ
ていることによる。In the half-width character candidate image, the recognition evaluation value v1 is used as it is as the final evaluation value V1. That is, V1 = v
14. If V1 <V2, the full-width character candidate image is determined to be a character. If V1 ≧ V2, the half-width character candidate image is determined to be a character. This is because weighting is performed so that the longest possible pixel is evaluated and recognized.

【００３５】図６の場合、半角文字候補画像ｃ１１の最
終評価値Ｖ１は３７であり、全角文字候補画像ｃ２１の
最終評価値Ｖ２は６７であり、Ｖ１＜Ｖ２なので、全
角文字候補画像ｃ２１が文字であると判断され、文字コ
ード「昭」が認識結果となる。以上のように１文字の認
識結果が確定したら、以下、順次隣接するサブ文字要素
について同様の処理を繰り返し、認識結果として、図中
文字コード欄にて太枠で表示した「昭和３５年」が得ら
れる。この様子を図６に示す。なお、文字候補生成部６
において、半角文字候補画像ｃ１１と全角文字候補画像
ｃ２１とが一致する場合には、他の認識のための処理を
行わないまま全角文字とする。In the case of FIG. 6, the final evaluation value V1 of the half-width character candidate image c11 is 37, the final evaluation value V2 of the full-width character candidate image c21 is 67, and V1 <V2. And the character code "Akira" is the recognition result. When the recognition result of one character is determined as described above, the same process is sequentially repeated for the adjacent sub-character elements, and as a result of the recognition, "1960" indicated by a bold frame in the character code column in the figure is displayed. can get. This is shown in FIG. Note that the character candidate generation unit 6
In, when the half-width character candidate image c11 and the full-width character candidate image c21 match, the full-width character is processed without performing other recognition processing.

【００３６】文字列の全ての文字に対して認識結果が確
定したならば、余白追加部９で、余白の有無を判定し、
余白が検出された場合には、出力文字コード中の対応す
る位置に余白を挿入する。この処理を図７を参照しつつ
説明する。本図において、（ａ）は図３、図５の（ａ）
に相当するものであり、（ｂ）は説明のため追加したも
のである。認識結果評価部８で文字とされた各文字候補
画像ｃｉ（ｉ＝１，２，…）と表し、全角文字には＊を
付している。余白追加部９では、認識結果評価部８で文
字とされた各文字候補画像ｃｉについて、次の文字候補
画像ｃｉ＋１との文字列方向の中点間距離ｄｉを順次計
算し、中点間距離から次のようにして、余白の有無を判
定する。ここで、Ｐは文字印字ピッチであり、ε１，ε
２，ε３は、それぞれ１．０，０．７５，０．５程度の
定数である。ただし、具体的な値は、印刷内容に応じて
他の値が採用される。When the recognition result is determined for all the characters in the character string, the margin adding unit 9 determines whether or not there is a margin,
If a margin is detected, the margin is inserted at a corresponding position in the output character code. This processing will be described with reference to FIG. 3A and FIG. 3A, FIG.
(B) is added for explanation. Each character candidate image ci (i = 1, 2,...) Converted into a character by the recognition result evaluation unit 8 is represented by a double-byte character. The margin adding unit 9 sequentially calculates the distance di between the middle points in the character string direction with respect to the next character candidate image ci + 1 for each of the character candidate images ci that have been converted into characters by the recognition result evaluation unit 8, and calculates the distance di between the middle points. The presence or absence of a margin is determined as follows. Here, P is a character printing pitch, ε1, ε
2 and ε3 are constants of about 1.0, 0.75 and 0.5, respectively. However, other specific values are adopted according to the print content.

【００３７】１．文字ｃｉ，ｃｉ＋１共に全角文字の場
合、ｄｉ＞ε１・Ｐならば、文字ｃｉ，ｃｉ＋１間は余
白である。２．文字ｃｉ，ｃｉ＋１の一方が全角文字、他方が半角
文字の場合、ｄｉ＞ε２・Ｐならば、文字ｃｉ，ｃｉ＋
１間は余白である。３．文字ｃｉ，ｃｉ＋１共に半角文字の場合、ｄｉ＞ε
３・Ｐならば、文字ｃｉ，ｃｉ＋１間は余白である。1. When both characters ci and ci + 1 are full-width characters, if di> ε1 · P, there is a blank space between characters ci and ci + 1. 2. When one of the characters ci and ci + 1 is a full-width character and the other is a half-width character, if di> ε2 · P, the characters ci and ci +
There is a blank space between one. 3. If both characters ci and ci + 1 are half-width characters, di> ε
If it is 3 · P, there is a blank space between the characters ci and ci + 1.

【００３８】４．上記以外の場合、ｃｉ，ｃｉ＋１間は
余白ではない。図７の（ａ）では、全角文字ｃ５＊と半角文字ｃ６の中
点間距離ｄ５＞ε２×Ｐなので、認識結果の５文字めと
６文字めの間に余白コードを挿入する。一方、図７の
（ｂ）では、文字ｃ１＊とｃ２＊の間の余白の幅は広い
が、両文字とも全角文字なので、中点間距離ｄ１＜ε１
×Ｐであり、余白ではないと判断する。（第２実施例）本発明の第２実施例の基本的構成は、先
の第１実施例と同じである。このため、本実施例は全体
の構成図をもとに各部の構成、作用等を説明するのは省
略し、固有の部分についてのみ、その原理、目的、構
成、作用、効果を説明する。原理について。4. In other cases, there is no margin between ci and ci + 1. In FIG. 7A, since the distance d5 between the midpoints of the full-width character c5 * and the half-width character c6 is d5> ε2 × P, a blank code is inserted between the fifth and sixth characters of the recognition result. On the other hand, in FIG. 7B, the width of the margin between the characters c1 * and c2 * is wide, but since both characters are full-width characters, the midpoint distance d1 <ε1
× P, and is determined not to be a margin. (Second Embodiment) The basic configuration of the second embodiment of the present invention is the same as that of the first embodiment. For this reason, in the present embodiment, the description of the configuration, operation, and the like of each unit based on the entire configuration diagram is omitted, and the principle, purpose, configuration, operation, and effect of only the unique portion will be described. About the principle.

【００３９】通常印刷された文書では、文字は全角か半
角かのいずれか一方のみが使用されるのが普通である。
また、たとえ両方のタイプの文字が併用されても、全
角、半角のいずれか一方が主であり、他方のタイプの文
字は例外的に使用されるのがほとんどである。また、例
外的に他方のタイプの文字が使用される場合には、この
例外的な文字が連続して使用されることがほとんどであ
る。従って、ある文字について、これが全角か半角かを
判定する際に、その前後に既に判定された文字が存在す
るならば、これを判定に利用できる。目的について。In a normally printed document, only one of full-width and half-width characters is usually used.
Even if both types of characters are used in combination, one of two-byte characters and one-byte characters is mainly used, and the other type of characters is almost always used exceptionally. Also, when the other type of character is used in exceptional cases, the exceptional character is almost always used continuously. Therefore, when determining whether a character is a two-byte or one-byte character, if there is a character already determined before and after the character, this can be used for the determination. About purpose.

【００４０】文字の認識速度を向上させ、また半角文字
認識辞書と全角文字認識辞書を一層有効に活用するだけ
でなく、異なるタイプの文字で印刷されている部分は、
他の部分と別に、あるいは独立して翻訳等の対象とする
ことによりシステム全体としての性能を向上させる。構
成について。In addition to improving the character recognition speed and utilizing the half-width character recognition dictionary and the full-width character recognition dictionary more effectively, portions printed with different types of characters are
The performance of the system as a whole is improved by subjecting it to translation or the like separately or independently of other parts. About the configuration.

【００４１】当初若しくは文の最初は、請求項１から請
求項６の発明に係る第１実施例の構成要素（要件）で文
字のタイプの判定を行い、幾つかの文字のタイプの判定
確定後は、後に続く文字は一応判定済の文字と同じタイ
プと仮判断して全角文字認識辞書若しくは半角文字認識
辞書の一方のみを使用して文字認識を行う。もし、認識
不能の文字がでてくれば、他方の辞書を使用して文字認
識を行う。それでも認識不能ならば、もとの第１実施例
と同じ構成要素（要件）で文字の判定を行う。併せて、
この旨を使用者に注意喚起する。効果について。Initially or at the beginning of a sentence, the character type is determined by the constituent elements (requirements) of the first embodiment according to the first to sixth aspects of the present invention. Indicates that the following character is temporarily determined to be the same type as the already determined character, and character recognition is performed using only one of the full-width character recognition dictionary and the half-width character recognition dictionary. If an unrecognizable character appears, character recognition is performed using the other dictionary. If still unrecognizable, the character is determined using the same components (requirements) as in the first embodiment. together,
The user is alerted to this effect. About the effect.

【００４２】目的の裏返しであり、文字のタイプの判
定、文字認識のみならず、システム全体の性能向上とな
る。以上、本発明を実施例に基づいて説明してきたが、
本発明は上記実施例に限定されないのは勿論である。す
なわち、以下のようなものも本発明に含まれる。This is the reverse of the purpose, and it not only determines the character type and recognizes the character, but also improves the performance of the entire system. As described above, the present invention has been described based on the embodiments,
The present invention is not limited to the above embodiment. That is, the following is also included in the present invention.

【００４３】（１）製造等の都合で、特許請求の範囲に
記載した１の構成要素（要件、ステップ）を複数のもの
としている。逆に、複数のものを１としている。あるい
は、これらを適宜組み合わせている。（２）画像切出し手段は、取出し、装着可能のフロッピ
ーディスク等の記憶部を内蔵した上で、他の手段と別体
のものとされている。(1) A plurality of components (requirements, steps) described in the claims are provided for convenience of manufacture and the like. Conversely, one is set to a plurality. Alternatively, these are appropriately combined. (2) The image extracting means has a built-in storage unit such as a detachable and mountable floppy disk and is separate from other means.

【００４４】また、別体の他の手段はこの記憶部を取出
し、装着可能としている。同じく、出力部はフロッピー
ディスク等に出力するものとする。これにより、携帯性
の向上、高価な文字認識部本体や印字部の有効活用や他
の機器等との併用を図る。（３）認識対象の文字は、漢字とアラビア数字に限定さ
れず、仮字（仮名）、ハングル文字、アルファベットあ
るいはこれらと漢字からなるものや「＝」等の記号とし
ている。勿論、各種記号をも含む。Further, another means separately takes out the storage section and makes it mountable. Similarly, the output unit outputs to a floppy disk or the like. Thereby, the portability is improved, the expensive character recognition unit main body and the printing unit are effectively used, and the device is used in combination with other devices. (3) Characters to be recognized are not limited to kanji and Arabic numerals, but may be kana characters (kana), Hangul characters, alphabets, or characters consisting of these characters and kanji, or symbols such as "=". Of course, various symbols are included.

【００４５】（４）文字の記載順序は、左横書き、右横
書き、上下方向のいずれか一に限定されないだけでな
く、この旨を使用者があらかじめ入力可能な機能をも有
している。また、上下方向の場合には、左から右、右か
ら左への２種があるが、これについても同様である。(4) The order in which characters are written is not limited to any one of left-to-right writing, right-to-left writing, and up-down direction, and has a function that allows the user to input this fact in advance. In the case of the vertical direction, there are two types, left to right and right to left. The same applies to this.

【００４６】（５）β１、β２、ζ１、ζ２、γ等の各
定数は、処理文書の文字、内容を視認した使用者が、新
聞、論文集、特許出願の明細書等のその文書の種類や視
感による大体の推測であらかじめ入力することが可能、
また一応の認識結果をみて変更、修正可能とする機能が
付加されている。（６）同じく、読み取られる前の用紙上の文字の該当す
る部分にアンダーラインや上下の線等を所定のマーカー
で印を付すことにより、装置側に半角文字や特殊な記号
の位置等を入力可能とする機能が付加されている。これ
は視力障害者や学童に対する新聞の音読システムや特殊
な記号の多く使用される分野での外国語論文の翻訳シス
テムに採用すると有効であろう。(5) Each constant such as β1, β2, ζ1, ζ2, and γ is determined by the user who has visually recognized the characters and contents of the processed document, and the type of the document such as a newspaper, a collection of papers, and a specification of a patent application. Can be pre-input with rough guesswork
In addition, a function is added that enables the user to change or correct the temporary recognition result. (6) Similarly, by putting underlines, upper and lower lines, and the like with predetermined markers on corresponding portions of the characters on the sheet before being read, inputting the position of half-width characters and special symbols on the device side. A function to enable it is added. This may be useful in newspaper reading systems for people with visual impairments or school children, and in translation systems for foreign language articles in areas where special symbols are frequently used.

【００４７】（７）翻訳システム等システム全体が翻訳
がうまくなしえないこと等を検知し、あるいは使用者が
文字認識結果の誤りが多いのを知得し、これをもとに文
字認識装置が各種定数を自動的に若しくは指示により変
更することにより、認識処理の修正、学習を行う機能が
付加されている。（８）文字認識に際しての評価手法あるいは認識手段そ
のものは、辞書とのパターン認識に限定されず、決定木
法（特願平５−６８５８６号「決定木型文字認識装
置」）等他のものを採用している。(7) The whole system, such as a translation system, detects that the translation cannot be performed properly, or the user knows that there are many errors in the character recognition result, and based on this, the character recognition device A function of correcting and learning the recognition process by changing various constants automatically or by an instruction is added. (8) The evaluation method or the recognition means itself in character recognition is not limited to pattern recognition with a dictionary, but may use other methods such as the decision tree method (Japanese Patent Application No. 5-68586 “decision tree type character recognition device”). Has adopted.

【００４８】（９）全角文字中の分割した文字要素の一
方が半角文字としての照合の結果あるいはそれ以前に正
投象影長から半角文字を構成しないと判断される場合
（例えば、図３（ｃ）における昭（Ｓ１、Ｓ２））に
は、全角文字と判断する機能が付加されている。逆に、
半角文字中に半角文字として照合の結果あるいはそれ以
前に正投影長から半角文字でないと判断される場合（例
えば、図３（ｃ）における日（Ｓ１０）。）には、全角
文字と判断する機能が付加されている。(9) When it is determined that one of the divided character elements in the full-width character does not constitute the half-width character from the result of the collation as the half-width character or the previous projection shadow length (for example, FIG. In (a) (S1, S2)) in c), a function for determining a full-width character is added. vice versa,
A function to determine that a half-width character is a full-width character when it is determined that the character is not a half-width character based on the result of collation as a half-width character or an orthographic projection length before that (for example, day (S10) in FIG. 3C). Is added.

【００４９】（１０）欧米系の文書では、各単語間に余
白を設けた上で、行の両端に文字がくるようにその余白
を調整している。従って、文字間でなく単語間の余白に
ついては、これに対処しえる機能を付加されていてもよ
い。(10) In European and American documents, a margin is provided between words, and the margin is adjusted so that characters are placed at both ends of a line. Therefore, a function to deal with the margin between words instead of between characters may be added.

【００５０】[0050]

【発明の効果】以上説明してきたように本発明は、抽出
した文字要素を一旦全て半角文字幅以下の狭い幅のサブ
文字要素に分割する文字要素分割部と、分割した文字要
素を接続し、半角文字候補画像と全角文字候補画像を切
り出す文字候補生成部と、半角文字候補画像と全角文字
候補画像の各々を認識し、その評価値の高い方を正しい
文字として選択する認識結果評価部とを設けている。こ
のため、各文字につき、半角文字候補画像の認識と全角
文字候補画像の認識という高々２回の認識を行うこと
で、全角と半角混じりに印字された文書や不定ピッチの
文書に対しても、正確に文字切り出しを行うことが可能
となる。また、その文字が全角文字であるか半角文字で
あるかの判断が可能なため、余白の正確な検出が可能と
なる。As described above, according to the present invention, all of the extracted character elements are once converted into sub-widths smaller than half-width character width.
A character element dividing unit for dividing the character elements, and connect the divided character element, and character candidate generation unit for cutting out single-byte character candidate image and the full-width character candidate image, recognizes each byte character candidate image and the full-width character candidate image And a recognition result evaluation unit for selecting a higher evaluation value as a correct character. For this reason, for each character, recognition of a half-width character candidate image and recognition of a full-width character candidate image are performed at most twice, so that a document printed in a mixture of full-width and half-width characters and a document with an unfixed pitch can be obtained. Character extraction can be performed accurately . Further , since it is possible to determine whether the character is a full-width character or a half-width character, it is possible to accurately detect a margin.

[Brief description of the drawings]

【図１】本発明の一実施例の文字認識装置の構成図であ
る。FIG. 1 is a configuration diagram of a character recognition device according to an embodiment of the present invention.

【図２】入力文書画像の例である。FIG. 2 is an example of an input document image.

【図３】文字要素抽出処理を説明するための図である。FIG. 3 is a diagram for explaining a character element extraction process.

【図４】文字幅算出処理を説明するための図である。FIG. 4 is a diagram for explaining a character width calculation process.

【図５】文字要素分割処理を説明するための図である。FIG. 5 is a diagram for explaining a character element division process.

【図６】候補文字画像生成処理および認識処理および認
識結果評価処理を説明するための図である。FIG. 6 is a diagram for explaining candidate character image generation processing, recognition processing, and recognition result evaluation processing.

【図７】余白追加処理を説明するための図である。FIG. 7 is a diagram for explaining margin addition processing.

[Explanation of symbols]

１画像入力部２文字列抽出部３文字要素抽出部４文字幅算出部５文字要素分割部６候補文字画像生成部７認識部８認識結果評価部９余白追加部１０認識結果出力部 DESCRIPTION OF SYMBOLS 1 Image input part 2 Character string extraction part 3 Character element extraction part 4 Character width calculation part 5 Character element division part 6 Candidate character image generation part 7 Recognition part 8 Recognition result evaluation part 9 Margin addition part 10 Recognition result output part

───────────────────────────────────────────────────── フロントページの続き (72)発明者江村里志大阪府門真市大字門真1006番地松下電器産業株式会社内 (56)参考文献特開平４−282789（ＪＰ，Ａ) 特開平５−166009（ＪＰ，Ａ) 特開昭63−83887（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06K 9/62 620 G06K 9/20 340 G06K 9/34 ＪＩＣＳＴファイル（ＪＯＩＳ)────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Satoshi Emura 1006 Oaza Kadoma, Kadoma City, Osaka Prefecture Matsushita Electric Industrial Co., Ltd. (56) References JP-A-4-282789 (JP, A) JP-A-5 -16060 (JP, A) JP-A-63-83887 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) G06K 9/62 620 G06K 9/20 340 G06K 9/34 JICST file (JOIS)

Claims

(57) [Claims]

1. A character string image is cut out from an input document image, then individual character images are further cut out from each character string image, and each character image is recognized and converted into a corresponding character code. In the character recognition device, a character element extraction unit that extracts a character element that is a lump of pixels constituting a character from the character string image, and a position or a size of the character element extracted by the character element extraction unit A character width calculating unit that calculates the width and the width of the half-width character by predetermined means; and a character element narrower than the half-width character width calculated by the character width calculating unit among the character elements extracted by the character element extracting unit. and as it is the sub-character element to detect, if any, to detect, if any wide character element width than single-byte character width
Divided into multiple sub-character elements whose individual width is less than half-width character width
, And replace all character elements with narrow sub-width characters
A character element dividing unit to be a character element, and all the sub character elements generated by the character element dividing unit are regarded as constituting a single character with the character element or the sub character element of the preceding and following character strings. with the character element image width after the recognition target and the connection is cut connected so widely made as much as possible without exceeding the byte character width and half-width character candidate image, width after connecting said A character candidate generation unit that cuts out a character element image that is connected and cut out as wide as possible within a range that does not exceed the full-width character width as a full-width character candidate image, and a half-width character candidate image generated by the character candidate generation unit; A recognition unit for temporarily recognizing the full-width character candidate image as a single character, and calculating a corresponding character code and an evaluation value indicating accuracy; a half-width character candidate image and a full-width The evaluation values obtained for character candidate image comparison by substituting a predetermined function,
A character recognition device comprising: a recognition result evaluation unit that determines a character candidate image having a higher evaluation value as a result as a correct character.

2. The character width calculating section has a character printing pitch calculating means for calculating a character printing pitch from a position or a size of a character element and calculating a width of a full-width character and a width of a half-width character from the calculation result. The character recognition device according to claim 1, wherein the character recognition is performed.

3. The character width calculation unit calculates a provisional character width from a height of a character string image, and a width of a character element among the character elements extracted by the character element extraction unit. Part detecting means for detecting a part in which a character element in which the error between the character and the provisional character width is smaller than a predetermined value, and a printing pitch of the character based on the distance between the middle points in the character string direction of the detected continuous character element 2. The character recognition device according to claim 1, further comprising: calculation means for calculating a width of a full-width character and a width of a half-width character from the calculation result.

4. A character width calculating unit for calculating a temporary character width from the height of a character string image; and a character element having a width smaller than the calculated temporary character width. The character elements are temporarily connected within a range where the width does not exceed the provisional character width, and an error between the width of the provisionally connected character element and the original character element and the provisional character width is determined. A continuous portion detecting means for detecting a portion where a smaller character element is continuous, and calculating the character printing pitch from the distance between the middle points in the character string direction of the character element of the continuous portion detected by the continuous portion detecting means, 2. The character recognition device according to claim 1, further comprising a calculating unit configured to calculate a width of a full-width character and a width of a half-width character from the calculation result.

5. The recognition unit includes: a half-width character recognition dictionary used for recognizing a half-width character candidate image generated by the character candidate generation unit; and a full-width character candidate image generated by the character candidate generation unit. 5. The character recognition apparatus according to claim 1, further comprising a full-width character recognition dictionary used for recognition.

6. A half-character determination unit that determines whether a recognition result of a full-width character candidate image is a character having the same shape as one half-width character when half of a single character divided in the character string direction is determined. And priority evaluation means for giving priority to the evaluation value of a full-width character candidate image when the recognition result evaluation unit evaluates the character if the half-character determination means determines that the character is such a character. The character recognition device according to claim 1, 2, 3, 4, or 5, wherein

7. For each character image determined to be a single character by the recognition result evaluation unit, the midpoint interval in the character string direction of two characters that are continuous with each other as to whether each character is a full-width character or a half-width character And a margin detection unit that detects a margin between characters from the above, and a margin addition unit that inserts a margin code at a corresponding position in the output character code based on the detection result of the margin detection unit. 7. The character recognition device according to claim 1 , wherein the character recognition device is a character recognition device.