JP5105440B2

JP5105440B2 - Information processing apparatus, character recognition method, character recognition program, and computer-readable recording medium on which character recognition program is recorded

Info

Publication number: JP5105440B2
Application number: JP2008293554A
Authority: JP
Inventors: 千絵秀石
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2008-11-17
Filing date: 2008-11-17
Publication date: 2012-12-26
Anticipated expiration: 2028-11-17
Also published as: JP2010122749A

Description

本発明は、情報処理装置、文字認識方法、文字認識プログラム、および、文字認識プログラムを記録した、コンピュータ読取り可能な記録媒体に関し、特に、外枠に囲まれた文字を認識するための情報処理装置、文字認識方法、文字認識プログラム、および、文字認識プログラムを記録した、コンピュータ読取り可能な記録媒体に関する。 The present invention relates to an information processing apparatus, a character recognition method, a character recognition program, and a computer-readable recording medium on which a character recognition program is recorded, and in particular, an information processing apparatus for recognizing a character surrounded by an outer frame. The present invention relates to a character recognition method, a character recognition program, and a computer-readable recording medium on which the character recognition program is recorded.

近年、光学式文字読取装置(以下、「ＯＣＲ」という)などの文字読取装置は、名刺や雑誌といった多様な読取対象への対応が望まれており、認識できなければならない文字や記号が増加している。ＯＣＲにおいては、認識できる文字とはつまり、保持している辞書内にあらかじめ登録している文字パターンを指す。辞書内に登録されている文字パターン以外の文字は認識することができない。 In recent years, character readers such as optical character readers (hereinafter referred to as “OCR”) have been desired to support various reading objects such as business cards and magazines, and the number of characters and symbols that must be recognized has increased. ing. In OCR, a recognizable character means a character pattern registered in advance in a stored dictionary. Characters other than the character patterns registered in the dictionary cannot be recognized.

特に、四角付きの文字（四角枠で囲まれた文字）、丸付きの文字（丸枠で囲まれた文字）は、丸(○)や四角(□)の記号と漢字などを自由に組み合わせることで多数の文字パターンを作成することができる。例えば、四角付きの「交」や丸付きの「時」などが挙げられる。これらの文字は雑誌やビジネス文章に多く見られ、たとえば、「交」は交通手段、「時」は営業時間といったような店舗や会社の項目情報を表すのに使用されている。 In particular, characters with squares (characters surrounded by a square frame) and characters with circles (characters surrounded by a round frame) can be freely combined with symbols such as circle (○) or square (□) and kanji. A large number of character patterns can be created. For example, “intersection” with a square, “hour” with a circle, and the like. These characters are often found in magazines and business texts. For example, “community” is used to represent item information of a store or company such as transportation means and “time” is business hours.

特許文献１には、丸の中にある文字パターンを切り出し、切り出した文字を認識する方法が開示されている。特許文献１でも述べてられている通り、画像中の丸を検出する手段としては、既知の技術である、ハフ変換（非特許文献１）が主に用いられている。特許文献１では丸中文字の文字パターンを切り出す方法として、ハフ変換で円の抽出を行ない、円パターンの内側よりも若干半径の小さい仮想的な円の外側の黒画素を打ち抜いて、丸枠を除去する方法が述べられている。
特開平６−１１１０７０号公報石立喬著、“Hough変換による画像からの直線や円の検出”［online］、２００５年８月３０日、ＣｏｄｅＺｉｎｅ、［２００８年１０月１５日検索］、インターネット＜ＵＲＬ：http://codezine.jp/a/article.aspx?aid=153，＞ Patent Document 1 discloses a method of cutting out a character pattern in a circle and recognizing the cut out character. As described in Patent Document 1, Hough transform (Non-Patent Document 1), which is a known technique, is mainly used as means for detecting a circle in an image. In Patent Document 1, as a method of cutting out a character pattern of a circle character, a circle is extracted by Hough transformation, a black pixel outside a virtual circle having a slightly smaller radius than the inside of the circle pattern is punched, and a round frame is formed. A method of removal is described.
Japanese Patent Laid-Open No. 6-1111070 Satoshi Ishidate, “Detection of straight lines and circles from images by Hough transform” [online], August 30, 2005, CodeZine, [October 15, 2008 search], Internet <URL: http: // codezine. jp / a / article.aspx? aid = 153,>

丸や四角などの外枠に囲まれた文字をＯＣＲで認識するためには、外枠付きの文字（合成文字）の文字パターンを予め辞書に登録する必要がある。しかし、外枠と文字とは自由に組み合わせて作成することができる。そのため、このような合成文字の全てを網羅して登録することは手間がかかる上、登録したとしても、入力文字パターンと辞書文字パターンとを比較する処理量が増えてしまう。 In order to recognize a character surrounded by an outer frame such as a circle or square by OCR, it is necessary to register a character pattern of a character (synthetic character) with an outer frame in the dictionary in advance. However, the outer frame and the characters can be created in any combination. For this reason, it is troublesome to register all such composite characters in a comprehensive manner, and even if they are registered, the amount of processing for comparing the input character pattern and the dictionary character pattern increases.

また、特許文献１の方法によると、複数の円がハフ変換により見つかった場合は、包含関係にある円のうち最も外側の円の中心を共有しない円を無視することが記載されている。したがって、もし、内側が歪んだ円（例えば右側と左側で黒画素の分布が均一ではないような円）の場合は、外側の円は抽出できても内側の円は抽出できず、うまく円を除去できないことになる。 Further, according to the method of Patent Document 1, it is described that when a plurality of circles are found by the Hough transform, a circle that does not share the center of the outermost circle among the circles in an inclusive relationship is described. Therefore, if the inner circle is distorted (for example, a circle whose black pixel distribution is not uniform on the right and left sides), the outer circle can be extracted, but the inner circle cannot be extracted. It cannot be removed.

本発明は、上記のような問題を解決するためになされたものであって、その目的は、丸や四角といった外枠付きの合成文字の文字パターンを辞書に登録しておかなくても、精度良く外枠に囲まれた文字を認識することのできる情報処理装置、文字認識方法、文字認識プログラム、および、文字認識プログラムを記録した、コンピュータ読取り可能な記録媒体を提供することである。 The present invention has been made in order to solve the above-described problems, and its purpose is to achieve accuracy without registering a character pattern of a composite character with an outer frame such as a circle or square in a dictionary. It is an object to provide an information processing apparatus, a character recognition method, a character recognition program, and a computer-readable recording medium on which a character recognition program is recorded.

この発明のある局面に従う情報処理装置は、文字列画像より、外枠に囲まれた文字を認識するための情報処理装置であって、文字列画像から、１文字分に相当する矩形画像を切り出すための文字切り出し手段と、切り出された矩形画像内の白黒画素の分布に基づいて、矩形画像内の文字形状が、所定の種別の外枠形状と一致するか否かを判定するための形状判定手段と、形状判定手段により一致すると判定された場合に、一致すると判定された外枠形状に応じた範囲内の黒画素の分布に基づいて除去する外枠の幅を決定し、決定された幅に基づいて、矩形画像内の文字の外枠を除去するための枠除去手段と、枠除去手段による処理後の画像の文字パターンについて文字認識を行なうための認識手段とを備える。 An information processing apparatus according to an aspect of the present invention is an information processing apparatus for recognizing a character surrounded by an outer frame from a character string image, and cuts out a rectangular image corresponding to one character from the character string image. Shape determination for determining whether or not the character shape in the rectangular image matches a predetermined type of outer frame shape based on the character cutout means and the distribution of black and white pixels in the cutout rectangular image The width of the outer frame to be removed is determined based on the distribution of black pixels within the range corresponding to the outer frame shape determined to match , And a recognition unit for recognizing the character pattern of the image processed by the frame removal unit. The frame removal unit removes the outer frame of the character in the rectangular image.

好ましくは、枠除去手段は、矩形画像の１つの辺に平行な第１の方向の複数の走査によって、矩形画像の端部側より連続する黒画素数を走査ごとに算出し、一致すると判定された外枠形状に応じて、複数の走査の全部または一部において算出された黒画素数の平均値を算出し、算出された平均値の黒画素数分だけ黒画素を白画素に反転することによって、外枠を除去する。 Preferably, the frame removing means, a plurality of scanning in a first direction parallel to one side of the rectangular image, the number of consecutive black pixels from the end portion side of the rectangular image is calculated for each scan, it is determined matching the Depending on the outer frame shape, the average value of the number of black pixels calculated in all or part of the plurality of scans is calculated, and the black pixels are inverted to white pixels by the calculated average number of black pixels. To remove the outer frame.

好ましくは、所定の種別の外枠形状は、丸形状を含み、枠除去手段は、形状判定手段により矩形画像内の文字形状が丸形状であると判断された場合、矩形画像の第１の方向または第１の方向と直交する第２の方向の走査によって、対向する２辺の端から連続する黒画素数を算出するための手段と、対向する２辺の端から、算出された平均値の黒画素数分だけ黒画素を白画素に反転することで、外枠を除去するための手段とを含む。 Preferably, the outer frame shape of the predetermined type includes a round shape, and the frame removing unit determines the first direction of the rectangular image when the shape determining unit determines that the character shape in the rectangular image is a round shape. Alternatively, by scanning in the second direction orthogonal to the first direction, the means for calculating the number of continuous black pixels from the ends of the two opposing sides and the average value calculated from the ends of the two opposing sides Means for removing the outer frame by inverting black pixels to white pixels by the number of black pixels.

好ましくは、所定の種別の外枠形状は、四角形状を含み、外側が四角枠である文字の内側の文字パターンを特定文字パターンとして予め記憶するための特定文字記憶手段と、形状判定手段により矩形画像内の文字形状が四角形状であると判断された場合に、枠除去手段による処理後の画像の文字パターンと特定文字パターンとを照合するためのマッチング手段とを備え、認識手段は、マッチング手段において特定文字パターンと一致すると判断された場合には、文字切り出し手段により切り出された、外枠の残っている元の文字パターンの文字認識を行なう。 Preferably, the outer frame shape of the predetermined type includes a quadrangle shape, and a specific character storage unit for storing in advance a character pattern inside a character whose outer side is a square frame as a specific character pattern, and a rectangular shape by the shape determination unit And a matching unit for matching the character pattern of the image after the processing by the frame removing unit with the specific character pattern when the character shape in the image is determined to be a square shape, If it is determined that the character pattern matches the specific character pattern, the character recognition of the original character pattern cut out by the character cutout means and having the outer frame remaining is performed.

好ましくは、認識手段は、マッチング手段において特定文字パターンと一致しないと判断された場合には、枠除去手段による処理後の文字パターンの文字認識を行なう。 Preferably, the recognizing unit performs character recognition of the character pattern after the processing by the frame removing unit when the matching unit determines that it does not match the specific character pattern.

好ましくは、記憶手段をさらに備え、記憶手段は、形状判定手段により一致すると判定された場合には、枠除去手段による処理後の画像と、外枠を除去する前の矩形画像との両方を保持する。 Preferably, the storage unit further includes a storage unit, and the storage unit holds both the image after the processing by the frame removal unit and the rectangular image before the outer frame is removed when it is determined that the shape determination unit matches. To do.

好ましくは、所定の種別の外枠形状は、四角形状を含み、外側が四角枠である文字パターンを特定文字パターンとして予め記憶するための特定文字記憶手段と、形状判定手段により矩形画像内の文字形状が四角形状であると判断された場合に、矩形画像の文字パターンと特定文字パターンとを照合するためのマッチング手段とを備え、枠除去手段は、マッチング手段において特定文字パターンと一致しないと判断された場合にのみ、矩形画像内の文字の外枠を除去する。 Preferably, the outer frame shape of the predetermined type includes a rectangular shape, and a character pattern in the rectangular image by the shape determination unit and a specific character storage unit for storing in advance a character pattern that is a rectangular frame as a specific character pattern. When it is determined that the shape is a quadrangular shape, a matching unit is provided for matching the character pattern of the rectangular image with the specific character pattern, and the frame removing unit determines that the matching unit does not match the specific character pattern. Only when it is done, the outer frame of the character in the rectangular image is removed.

好ましくは、枠除去手段は、形状判定手段により矩形画像内の文字形状が四角形状であると判断された場合、矩形画像の第１の方向および第１の方向と直交する第２の方向の走査によって、矩形画像の４辺の端から連続する黒画素数をそれぞれ算出するための手段と、４辺の端それぞれから、算出された平均値の黒画素数分だけ黒画素を白画素に反転することで、外枠を除去するための手段とを含む。 Preferably, the frame removing unit scans the first direction of the rectangular image and the second direction orthogonal to the first direction when the shape determining unit determines that the character shape in the rectangular image is a square shape. By means of the above, means for calculating the number of continuous black pixels from the ends of the four sides of the rectangular image, and from each of the ends of the four sides, the black pixels are inverted into white pixels by the calculated average number of black pixels. And means for removing the outer frame.

好ましくは、外枠に囲まれた文字は、枠内に１文字の漢字、英字、ひらがな、数字、カタカナのうちのいずれかを含む。 Preferably, the character surrounded by the outer frame includes any one of kanji, alphabetic characters, hiragana, numbers, and katakana within the frame.

この発明の他の局面に従う文字認識方法は、文字列画像より、外枠に囲まれた文字を認識するための方法であって、文字列画像から、１文字分に相当する矩形画像を切り出すステップと、切り出された矩形画像内の白黒画素の分布に基づいて、矩形画像内の文字形状が、所定の種別の外枠形状と一致するか否かを判定するステップと、矩形画像内の文字形状が所定の種別の外枠形状と一致すると判定された場合に、一致すると判定された外枠形状に応じた範囲内の黒画素の分布に基づいて除去する外枠の幅を決定し、決定された幅に基づいて、矩形画像内の文字の外枠を除去するステップと、外枠を除去後の画像の文字パターンについて文字認識を行なうステップとを備える。 A character recognition method according to another aspect of the present invention is a method for recognizing a character surrounded by an outer frame from a character string image, and a step of cutting out a rectangular image corresponding to one character from the character string image Determining whether the character shape in the rectangular image matches a predetermined type of outer frame shape based on the distribution of black and white pixels in the cut out rectangular image; and the character shape in the rectangular image Is determined to match the outer frame shape of the predetermined type, the width of the outer frame to be removed is determined based on the distribution of black pixels within the range corresponding to the outer frame shape determined to match, A step of removing the outer frame of the character in the rectangular image based on the width, and a step of performing character recognition on the character pattern of the image after the outer frame is removed.

この発明のさらに他の局面に従う文字認識プログラムは、文字列画像より、外枠に囲まれた文字を認識するためのプログラムであって、文字列画像から、１文字分に相当する矩形画像を切り出すステップと、切り出された矩形画像内の白黒画素の分布に基づいて、矩形画像内の文字形状が、所定の種別の外枠形状と一致するか否かを判定するステップと、矩形画像内の文字形状が所定の種別の外枠形状と一致すると判定された場合に、一致すると判定された外枠形状に応じた範囲内の黒画素の分布に基づいて除去する外枠の幅を決定し、決定された幅に基づいて、矩形画像内の文字の外枠を除去するステップと、外枠を除去後の画像の文字パターンについて文字認識を行なうステップとをコンピュータに実行させる。 A character recognition program according to still another aspect of the present invention is a program for recognizing a character surrounded by an outer frame from a character string image, and cuts out a rectangular image corresponding to one character from the character string image. Determining whether or not the character shape in the rectangular image matches a predetermined type of outer frame shape based on the distribution of black and white pixels in the cut out rectangular image; and the character in the rectangular image When it is determined that the shape matches the outer frame shape of a predetermined type, the width of the outer frame to be removed is determined based on the distribution of black pixels within the range corresponding to the outer frame shape determined to match , and determined. Based on the determined width, the computer is caused to execute a step of removing the outer frame of the character in the rectangular image and a step of performing character recognition on the character pattern of the image after the outer frame is removed.

この発明のさらに他の局面に従うコンピュータ読取り可能な記録媒体は、上記記載の文字認識プログラムを記録する。 A computer-readable recording medium according to still another aspect of the present invention records the above-described character recognition program.

本発明によると、外枠付きの合成文字の文字パターンを辞書に登録しておかなくても、精度良く外枠に囲まれた文字を認識することができる。 According to the present invention, it is possible to accurately recognize a character surrounded by an outer frame without registering a character pattern of a composite character with an outer frame in a dictionary.

本発明の実施の形態について図面を参照しながら詳細に説明する。なお、図中同一または相当部分には同一符号を付してその説明は繰返さない。 Embodiments of the present invention will be described in detail with reference to the drawings. In the drawings, the same or corresponding parts are denoted by the same reference numerals and description thereof will not be repeated.

［実施の形態１］
＜構成について＞
はじめに、本実施の形態における情報処理装置の構成について説明する。 [Embodiment 1]
<About configuration>
First, the configuration of the information processing apparatus in the present embodiment will be described.

図１は、本発明の実施の形態１における情報処理装置１の機能構成を表わす機能ブロック図である。 FIG. 1 is a functional block diagram showing a functional configuration of information processing apparatus 1 according to Embodiment 1 of the present invention.

図１を参照して、情報処理装置１は、入力装置２と出力装置１３とに接続されている。入力装置２は、雑誌などの原稿から画像を読み取り、情報処理装置１へ画像データを送る。出力装置１３は、情報処理装置１での画像の文字認識結果（文字列）を出力表示する。 With reference to FIG. 1, the information processing apparatus 1 is connected to an input device 2 and an output device 13. The input device 2 reads an image from a document such as a magazine and sends image data to the information processing device 1. The output device 13 outputs and displays the character recognition result (character string) of the image in the information processing device 1.

情報処理装置１は、入力装置２および出力装置１３との間でデータの入出力制御処理を実行するための入出力制御部１４と、情報処理装置１に含まれる各機能ブロック全体を制御するための制御部３と、入力装置２から送られてくる画像データを２値画像に変換するための２値化処理部５と、２値化された画像から行単位の画像（以下「文字列行画像」という）を切り出すための文字列行切り出し部６と、切り出された文字列行画像から１文字分に相当する矩形画像を切り出すための文字切り出し部７と、切り出された矩形画像内の文字形状が所定の種別の外枠形状と一致するか否かを判定するための文字形状判定部８と、一致すると判定された場合に、黒画素の分布に基づいて矩形画像内の文字の外枠を除去するための枠除去部９と、外枠が除去された文字を予め１文字辞書１１に登録された特定文字パターンと照合するためのマッチング部１０と、文字認識辞書４を用いて文字認識を行うための文字認識処理部１２とを有する。また、情報処理装置１は、情報を保持するためにメモリ１５を備えている。 The information processing device 1 controls the entire input / output control unit 14 for executing data input / output control processing between the input device 2 and the output device 13 and each functional block included in the information processing device 1. The control unit 3, the binarization processing unit 5 for converting the image data sent from the input device 2 into a binary image, and the binarized image from the line-by-line image (hereinafter “character string line”). A character string row cutout unit 6 for cutting out an image), a character cutout unit 7 for cutting out a rectangular image corresponding to one character from the cut out character string row image, and characters in the cut out rectangular image A character shape determination unit 8 for determining whether or not the shape matches the outer frame shape of a predetermined type, and an outer frame of the character in the rectangular image based on the distribution of black pixels when determined to match Frame removing unit 9 for removing the outer frame and the outer frame It includes a matching unit 10 for matching the removed character in advance 1 character dictionary 11 specific character pattern registered in the, and a character recognition processing section 12 for performing character recognition using the character recognition dictionary 4. In addition, the information processing apparatus 1 includes a memory 15 for holding information.

文字認識辞書４は、従来より存在する辞書データベースであり、複数の文字パターンを記憶する。本実施の形態において、文字認識辞書４に記憶される文字は、漢字、英字、ひらがな、数字、カタカナのうちの少なくともいずれか１つを含む。なお、認識対象とする文字はこのような文字種に限定されず、たとえばハングル文字などであってもよい。 The character recognition dictionary 4 is a dictionary database that exists conventionally, and stores a plurality of character patterns. In the present embodiment, the characters stored in the character recognition dictionary 4 include at least one of kanji, English characters, hiragana, numbers, and katakana. The character to be recognized is not limited to such a character type, and may be, for example, a Hangul character.

１文字辞書１１は、外側が四角枠である文字の内側の文字パターンを、特定文字パターンとして予め記憶する。「外側が四角枠である文字」とは、たとえば文字種が漢字であれば、たとえば「くにがまえ」を部首とする漢字すなわち、「国」、「固」などを表わす。このような例の場合、特定文字パターンは、四角枠（くにがまえ）の内側の文字「玉」、「古」のパターンを表わす。なお、ここでは、四角枠の内側の「玉」、「古」も文字として成り立つ例を示したが、四角枠の内側が意味をなさない記号（線や点の組合わせ）であってもよいものとする。 The one-character dictionary 11 stores in advance a character pattern inside a character whose outer side is a square frame as a specific character pattern. For example, if the character type is a Kanji character, the “character having a square frame on the outside” represents, for example, a Kanji character with “Kunigae Mae” as a radical, that is, “Country”, “Fix”, and the like. In such an example, the specific character pattern represents a pattern of characters “ball” and “old” inside a square frame (Kunigae). In this example, “ball” and “old” inside the square frame are also shown as characters, but the inside of the square frame may be a symbol (a combination of lines and points) that does not make sense. Shall.

枠除去部９は、矩形画像の端部側より連続する黒画素数を算出し、算出された黒画素数分だけ黒画素を白画素に反転することによって、外枠を除去する。外枠が四角形状である場合には、矩形画像の垂直方向および水平方向の走査によって、矩形画像の４辺の端から連続する黒画素数をそれぞれ算出する。そして、４辺の端それぞれから、算出された黒画素数分だけ黒画素を白画素に反転することで、外枠を除去する。四角枠の除去についての具体例は後述する。 The frame removal unit 9 calculates the number of continuous black pixels from the edge side of the rectangular image, and removes the outer frame by inverting the black pixels to white pixels by the calculated number of black pixels. When the outer frame is rectangular, the number of black pixels continuous from the ends of the four sides of the rectangular image is calculated by scanning the rectangular image in the vertical and horizontal directions. Then, the outer frame is removed by inverting black pixels to white pixels by the calculated number of black pixels from each end of the four sides. A specific example of the removal of the square frame will be described later.

図２は、本発明の実施の形態１における情報処理装置１のハードウェアブロック図である。 FIG. 2 is a hardware block diagram of the information processing apparatus 1 according to Embodiment 1 of the present invention.

図２を参照して、情報処理装置１としては、代表的にＰＣ（Personal Computer）が採用されてよい。情報処理装置１は、各種演算処理を実行するためのＣＰＵ（Central Processing Unit）２１０と、ＣＰＵ２１０で実行されるプログラムや各種の情報を不揮発的に記憶するためのハードディスク部（ＨＤＤ：Hard Disk Drive）２１２と、各種データを一時的に記憶するためのメモリ部２１４と、ユーザからの指示の入力を受付けるための操作部２１１と、光学媒体たとえばＣＤ−ＲＯＭ（Compact Disk-Read Only Memory）２１５ａに記憶されたデータやプログラムを読み出すためのＣＤ−ＲＯＭドライブ２１５と、外部機器との間でデータの授受を行なうための通信インターフェイス部２１７と、画像読取装置２２１および表示装置２２２との間でデータの送受信を行なうための入出力Ｉ／Ｆ（インターフェイス）２１６とを含む。これらの各部は、内部バス２１８を介して相互にデータを授受する。 Referring to FIG. 2, as information processing apparatus 1, a PC (Personal Computer) may be typically employed. The information processing apparatus 1 includes a CPU (Central Processing Unit) 210 for executing various arithmetic processes, and a hard disk unit (HDD: Hard Disk Drive) for storing programs executed by the CPU 210 and various information in a nonvolatile manner. 212, a memory unit 214 for temporarily storing various data, an operation unit 211 for receiving input of instructions from the user, and an optical medium such as a CD-ROM (Compact Disk-Read Only Memory) 215a Data transmission / reception between the image reading device 221 and the display device 222, a CD-ROM drive 215 for reading out the read data and programs, a communication interface unit 217 for exchanging data with external devices, Input / output I / F (interface) 216. These units exchange data with each other via the internal bus 218.

画像読取装置２２１は、図１に示した入力装置２に対応し、たとえばスキャナ、あるいは、カメラで構成されてよい。表示装置２２２は、図１に示した出力装置１３に対応し、たとえば液晶等で構成されてよい。 The image reading device 221 corresponds to the input device 2 shown in FIG. 1 and may be configured by a scanner or a camera, for example. The display device 222 corresponds to the output device 13 shown in FIG. 1 and may be composed of, for example, a liquid crystal.

図１に示した入出力制御部１４は、入出力Ｉ／Ｆ２１６に対応する。
図１に示した制御部３、２値化処理部５、文字列行切り出し部６、文字切り出し部７、文字形状判定部８、枠除去部９、マッチング部１０、文字認識処理部１２は、ＣＰＵ２１０がソフトウェアを実行することで実現される機能である。なお、これらのうちの少なくとも１つについては、ハードウェアで実現されてもよい。 The input / output control unit 14 illustrated in FIG. 1 corresponds to the input / output I / F 216.
The control unit 3, the binarization processing unit 5, the character string row cutout unit 6, the character cutout unit 7, the character shape determination unit 8, the frame removal unit 9, the matching unit 10, and the character recognition processing unit 12 illustrated in FIG. This is a function realized by the CPU 210 executing software. Note that at least one of these may be realized by hardware.

図１に示したメモリ１５は、たとえば、ＨＤＤ２１２およびメモリ部２１４に対応する。図１に示した文字認識辞書４および１文字辞書１１は、ＨＤＤ２１２に対応する。なお、メモリ１５、文字認識辞書４および１文字辞書１１は、外付けの取り外し可能なメモリ（たとえばＣＤ−ＲＯＭ２１５ａ）に対応していてもよい。 The memory 15 illustrated in FIG. 1 corresponds to the HDD 212 and the memory unit 214, for example. The character recognition dictionary 4 and the one-character dictionary 11 shown in FIG. Note that the memory 15, the character recognition dictionary 4, and the one-character dictionary 11 may correspond to an external removable memory (for example, CD-ROM 215a).

＜動作について＞
次に、本発明の実施の形態１に係る情報処理装置１の動作について具体的に説明する。本実施の形態においては、情報処理装置１は、四角付き文字（四角枠で囲まれた文字）の認識処理のみを行なうこととして説明する。 <About operation>
Next, the operation of the information processing apparatus 1 according to Embodiment 1 of the present invention will be specifically described. In the present embodiment, the information processing apparatus 1 will be described as performing only recognition processing of a character with a square (a character surrounded by a square frame).

図３は、本発明の実施の形態１に係る情報処理装置１が実行する文字認識処理を示すフローチャートである。図３のフローチャートに示す処理は、予めプログラムとしてＨＤＤ２１２に格納されており、ＣＰＵ２１０がこのプログラムを読み出して実行することにより、文字認識処理の機能が実現される。 FIG. 3 is a flowchart showing character recognition processing executed by the information processing apparatus 1 according to Embodiment 1 of the present invention. The processing shown in the flowchart of FIG. 3 is stored in advance in the HDD 212 as a program, and the function of character recognition processing is realized by the CPU 210 reading and executing this program.

図３を参照して、２値化処理部５は、文字列が記載された原稿の画像（以下「文字列画像」という）を２値化処理する（ステップＳ２０１）。具体的には、入力装置２で読み取られた画像のデータを、０，１（白，黒）の２値で表す２値画像に変換する。 Referring to FIG. 3, binarization processing unit 5 binarizes an image of a document on which a character string is described (hereinafter referred to as “character string image”) (step S201). Specifically, the image data read by the input device 2 is converted into a binary image represented by binary values of 0 and 1 (white and black).

２値化処理は既知の方法が多数存在する。本実施の形態においては、画像データの各画素を０，１に割り付けるための閾値としては、画像の全画素値の平均値が用いられる。なお、２値化を行う方法は、これ以外のどのような方法であっても構わない。 There are many known methods for binarization. In the present embodiment, an average value of all pixel values of an image is used as a threshold value for assigning each pixel of image data to 0 and 1. Note that any other method may be used as the binarization method.

図４のような文字列画像（入力原稿）３０１が入力装置２より入力されたとする。その場合、当該文字列画像３０１が２値化されると、図５に示すような画像３０２が得られる。 Assume that a character string image (input original) 301 as shown in FIG. In that case, when the character string image 301 is binarized, an image 302 as shown in FIG. 5 is obtained.

なお、本実施の形態では、文字列画像が、入力装置２が読取った画像であるものとして説明するが、限定的ではなく、外部から送信されてきた画像であってもよい。 In the present embodiment, the character string image is described as an image read by the input device 2. However, the character string image is not limited and may be an image transmitted from the outside.

次に、文字列行切り出し部６は、２値化された画像３０２を各行ごとに切り出す（ステップＳ２０２）。つまり、文字列行画像を切り出す。文字列行を切り出す処理も周知の方法が多数ある。本実施の形態では、たとえば、画像３０２内の空白部分（白画素）及び文字部分（黒画素）の分布から、文字列行を切り出す方法を用いる。なお、これ以外のどのような方法を用いても構わない。 Next, the character string line cutout unit 6 cuts out the binarized image 302 for each line (step S202). That is, the character string line image is cut out. There are many known methods for cutting out character string lines. In the present embodiment, for example, a method of cutting out character string rows from the distribution of blank portions (white pixels) and character portions (black pixels) in the image 302 is used. Any other method may be used.

図５に示した２値化後の画像３０２より文字列行が切り出された例を図６に示す。図６を参照して、２値化後の画像３０２より、４つの文字列行（画像）Ｐ７０１〜Ｐ７０４が切出されている。 FIG. 6 shows an example in which a character string row is cut out from the binarized image 302 shown in FIG. Referring to FIG. 6, four character string rows (images) P <b> 701 to P <b> 704 are cut out from the binarized image 302.

次に、文字切り出し部７は、切り出された文字列行のうちの１つを選択し、当該文字列行から１文字ずつ矩形に切り出す（ステップＳ２０３）。つまり、文字列行画像から、１文字分に相当する矩形画像を「文字矩形画像」として切り出す。文字矩形画像の切り出し方法についても、公知の技術で実現されてよい。本実施の形態では、文字矩形画像を切り出す場合、各文字画像の黒画素の上下左右の一番外側の黒画素を最大座標とするように切り出すことを条件とする。 Next, the character cutout unit 7 selects one of the cut out character string rows, and cuts out the characters one by one from the character string row (step S203). That is, a rectangular image corresponding to one character is cut out from the character string row image as a “character rectangular image”. A method for cutting out a character rectangular image may also be realized by a known technique. In the present embodiment, when a character rectangular image is cut out, it is a condition that the outermost black pixel on the upper, lower, left, and right sides of the black pixel of each character image is cut out at the maximum coordinate.

なお、四角枠で囲まれた合成文字は、１文字として認識されるものとする。そのため、２文字が四角枠で囲まれた合成文字であっても、１文字分に相当する矩形画像として切り出される。 It should be noted that a composite character surrounded by a square frame is recognized as one character. Therefore, even if it is a composite character in which two characters are surrounded by a square frame, it is cut out as a rectangular image corresponding to one character.

上記例においては、たとえば、はじめは、文字列行Ｐ７０１〜Ｐ７０４のうち１行目の文字列行Ｐ７０１を選択し、当該文字列行Ｐ７０１から文字矩形画像を切り出す処理を行なう。 In the above example, for example, first, the character string row P701 is selected from the character string rows P701 to P704, and a process of cutting out a character rectangular image from the character string row P701 is performed.

以降の処理は、図６に示した文字列行Ｐ７０３を例にとって説明する。
図７は、図６に示した文字列行Ｐ７０３から切り出された文字矩形画像を示す図である。 The subsequent processing will be described using the character string row P703 shown in FIG. 6 as an example.
FIG. 7 is a diagram showing a character rectangular image cut out from the character string row P703 shown in FIG.

次に、文字形状判定部８は、切り出された１つの文字矩形画像を選択し、当該画像の形状が、四角であるかどうかを判定する（ステップＳ２０４）。すなわち、文字矩形画像内の白黒画素の分布から、文字形状が四角であるかどうかを判定する（以下、このような判定を「四角判定」と略す）。四角判定についても既知の方法で行えばよいが、本実施の形態においては、たとえば次のような方法を用いる。 Next, the character shape determination unit 8 selects one extracted character rectangular image, and determines whether or not the shape of the image is a square (step S204). That is, it is determined whether or not the character shape is a square from the distribution of black and white pixels in the character rectangular image (hereinafter, such determination is abbreviated as “square determination”). Although the square determination may be performed by a known method, in the present embodiment, for example, the following method is used.

四角判定については、図７に示した文字列行Ｐ７０３のうちの文字矩形画像Ｐ８０１を例に説明する。文字矩形画像Ｐ８０１は、「場」という漢字に四角枠付きの文字（つまり、「場」と四角枠との合成文字）の画像を表わしている。 The square determination will be described using the character rectangular image P801 in the character string row P703 shown in FIG. 7 as an example. The character rectangular image P801 represents an image of a character with a square frame (that is, a combined character of “field” and the square frame) in the Chinese character “place”.

図８は、四角判定処理の流れを示すフローチャートである。図９は、四角判定方法を説明するための図である。 FIG. 8 is a flowchart showing the flow of the square determination process. FIG. 9 is a diagram for explaining a square determination method.

図９に示されるように、本実施の形態では、文字矩形画像Ｐ８０１の左上隅の角を原点と考え、横方向（水平方向）をＸ方向、縦方向（垂直方向）をＹ方向とする。 As shown in FIG. 9, in the present embodiment, the upper left corner of the character rectangular image P801 is considered as the origin, the horizontal direction (horizontal direction) is the X direction, and the vertical direction (vertical direction) is the Y direction.

図８を参照して、はじめに、文字矩形画像Ｐ８０１の原点からＸ方向に走査を行い、１ラインごとに黒画素の数をカウントする（ステップＳ３０１）。図９（１）に示すように、Ｘ方向への走査で得られる黒画素のカウント数は、Ｙ１からＹｂ（ｂは任意の整数）に格納されるようにする。つまり、ラインＹ１からラインＹｂまでの各ラインについて、Ｘ方向への走査で得られた黒画素数をメモリ１５に一時記憶する。 Referring to FIG. 8, first, scanning is performed in the X direction from the origin of character rectangular image P801, and the number of black pixels is counted for each line (step S301). As shown in FIG. 9 (1), the count number of black pixels obtained by scanning in the X direction is stored from Y1 to Yb (b is an arbitrary integer). That is, the number of black pixels obtained by scanning in the X direction is temporarily stored in the memory 15 for each line from the line Y1 to the line Yb.

さらに、Ｙ方向にも同様の処理を行なう（ステップＳ３０２）。つまり、文字矩形画像Ｐ８０１の原点からＹ方向に走査を行い、１ラインごとに黒画素の数をカウントする。図９（１）に示すように、Ｙ方向への走査で得られる黒画素のカウント数は、Ｘ１からＸｂ（ｂは任意の整数）に格納されるようにする。この場合、ラインＸ１からラインＸｂまでの各ラインについて、Ｙ方向への走査で得られた黒画素数をメモリ１５に一時記憶する。 Further, similar processing is performed in the Y direction (step S302). That is, scanning is performed in the Y direction from the origin of the character rectangular image P801, and the number of black pixels is counted for each line. As shown in FIG. 9 (1), the count number of black pixels obtained by scanning in the Y direction is stored from X1 to Xb (b is an arbitrary integer). In this case, the number of black pixels obtained by scanning in the Y direction is temporarily stored in the memory 15 for each line from the line X1 to the line Xb.

ここで、図９（１）ではＸ方向、Ｙ方向への走査は画像の端から端まで行っているが、四角付き文字の場合、外接矩形は黒画素で埋まるため、ある程度の幅だけ走査すれば、四角の形状かどうかは判断できる。つまり、図９（２）に示すように、Ｘ方向への走査は、文字矩形画像Ｐ８０１の原点よりＹ方向にたとえば８分の１までだけ行うこととしてもよい。つまり、図９（２）のラインＹ１〜ラインＹｂまでのみＸ方向の走査を行なってもよい。この場合、文字矩形画像Ｐ８０１の左下端からも同様に、Ｙ方向にたとえば８分の１まで行ってもよい（ラインＹ’１〜ラインＹ’ｂまでのみＸ方向の走査を行なう）。 Here, in FIG. 9 (1), scanning in the X and Y directions is performed from the end to the end of the image. However, in the case of a square character, the circumscribed rectangle is filled with black pixels, so that scanning is performed by a certain width. For example, it can be determined whether the shape is a square. That is, as shown in FIG. 9B, scanning in the X direction may be performed only up to, for example, 1/8 in the Y direction from the origin of the character rectangular image P801. That is, scanning in the X direction may be performed only for the lines Y1 to Yb in FIG. In this case, similarly, from the lower left end of the character rectangular image P801, for example, up to 1/8 may be performed in the Y direction (scanning in the X direction is performed only from the line Y′1 to the line Y′b).

また、Ｙ方向の走査も同様に行なう。つまり、文字矩形画像Ｐ８０１の原点よりＸ方向のたとえば８分の１までだけ行う（ラインＸ１〜ラインＸｂまでのみＹ方向の走査を行なう）。文字矩形画像Ｐ８０１の右上端からも同様に、Ｘ方向にたとえば８分の１までだけ行う（ラインＸ’１〜ラインＸ’ｂまでのみＹ方向の走査を行なう）。なお、ここでは、走査対象とすべく予め定められた範囲を８分の１としたが、限定的ではなく、分母の数は任意で最適な数を使えば良い。 Further, the scanning in the Y direction is performed in the same manner. That is, it is performed only up to, for example, 1/8 in the X direction from the origin of the character rectangular image P801 (only the lines X1 to Xb are scanned in the Y direction). Similarly, from the upper right end of the character rectangular image P801, for example, only 1/8 is performed in the X direction (only the lines X'1 to X'b are scanned in the Y direction). Here, the predetermined range to be scanned is set to 1/8. However, the range is not limited, and the number of denominators may be arbitrarily selected and optimal.

以下の説明においては、図９（２）のように一部の範囲のみ走査したと仮定する。
文字形状判定部８は、このような走査処理によって得られる黒画素の分布から、文字矩形画像Ｐ８０１が四角形状であるかを判定する（ステップＳ２０５）。四角形状であると判断された場合（ステップＳ２０５においてＹＥＳ）、ステップＳ２０６に進む。一方、四角形状でないと判断された場合には（ステップＳ２０５においてＮＯ）、元の文字パターンを文字認識処理にかけて（ステップＳ２１０）、ステップＳ２１１に進む。文字認識処理は、既知の方法で実現可能である。文字認識処理は、ほぼ確立された技術であり、様々な文字認識方法が提案されている。ここでは、ＯＣＲ（Optical Character Recognition）を用いるが、これ以外のどのような方法を用いても構わない。 In the following description, it is assumed that only a part of the range is scanned as shown in FIG.
The character shape determination unit 8 determines whether or not the character rectangular image P801 has a quadrangular shape from the black pixel distribution obtained by such scanning processing (step S205). If it is determined that the shape is a rectangle (YES in step S205), the process proceeds to step S206. On the other hand, if it is determined that the shape is not a quadrangular shape (NO in step S205), the original character pattern is subjected to character recognition processing (step S210), and the process proceeds to step S211. The character recognition process can be realized by a known method. The character recognition process is an almost established technique, and various character recognition methods have been proposed. Here, OCR (Optical Character Recognition) is used, but any other method may be used.

文字矩形画像Ｐ８０１が四角形状であれば、図９（２）におけるＹ１からＹｂとＹ’１からＹ’ｂ中の黒画素において、文字矩形画像Ｐ８０１の横幅画素数に近い画素数（カウント数）が格納されているものが存在するはずである。また、図９（２）におけるＸ１からＸｂとＸ’１からＸ’ｂの中の黒画素は文字矩形画像の縦幅画素数に近い画素数（カウント数）が格納されているものが存在するはずである。 If the character rectangular image P801 is rectangular, the number of pixels (count number) close to the number of horizontal pixels of the character rectangular image P801 in black pixels in Y1 to Yb and Y′1 to Y′b in FIG. There should be something stored. Further, there are black pixels in X1 to Xb and X′1 to X′b in FIG. 9B in which the number of pixels (count number) close to the number of vertical pixels of the character rectangular image is stored. It should be.

仮に「Ｙ１、Ｙ’１、Ｘ１、Ｘ’１」に、それぞれの幅画素数に近い画素数が格納されていたとする。これらの４つの黒画素カウント数が一定値を満たせば、この文字矩形画像Ｐ８０１内の文字パターンは四角の形状であると判定することができる。この一定値としては、Ｘ方向の値であれば、例えば文字矩形画像Ｐ８０１の横画素数（Ｘ方向の全画素数）のたとえば８０％、Ｙ方向の値であれば、文字矩形画像Ｐ８０１の縦画素数（Ｙ方向の全画素数）のたとえば８０％などにすればよい。この一定値は、情報処理装置１の構成によって可変の値としてもよい。 Assume that “Y1, Y′1, X1, X′1” stores the number of pixels close to the number of pixels of each width. If these four black pixel count numbers satisfy a certain value, it can be determined that the character pattern in the character rectangular image P801 has a square shape. As this fixed value, if the value is in the X direction, for example, 80% of the number of horizontal pixels (total number of pixels in the X direction) of the character rectangular image P801, and if the value is in the Y direction, the vertical value of the character rectangular image P801. For example, 80% of the number of pixels (the total number of pixels in the Y direction) may be used. This constant value may be a variable value depending on the configuration of the information processing apparatus 1.

ただし、図９（２）に示すＹ１とＹｂの中間あたりのラインの黒画素が、横幅画素数に近かった場合は、四角と判定しないようにすることが好ましい。Ｙ’１とＹ’ｂ、Ｘ１とＸｂ、Ｘ’１とＸ’ｂについても同様で、４組のうちどれかひとつでも上記の条件に当てはまれば、四角とは判定しないこととする。 However, when the black pixels in the middle line between Y1 and Yb shown in FIG. 9 (2) are close to the number of horizontal pixels, it is preferable not to determine that they are squares. The same applies to Y′1 and Y′b, X1 and Xb, and X′1 and X′b. If any one of the four sets satisfies the above condition, the square is not determined.

ここでは、文字矩形画像Ｐ８０１が四角形状であると判定されたと仮定して、ステップＳ２０６以降の処理の説明を行う。 Here, assuming that it is determined that the character rectangular image P801 has a quadrangular shape, the processing after step S206 will be described.

ステップＳ２０６において、枠除去部９は、文字矩形画像Ｐ８０１内の四角枠を除去する処理（以下「四角枠除去処理」ともいう）を実行する。 In step S <b> 206, the frame removal unit 9 executes a process of removing a square frame in the character rectangular image P <b> 801 (hereinafter also referred to as “square frame removal process”).

図１０は、四角枠除去処理を示すフローチャートである。図１１は、四角枠除去処理を説明するための図である。図１１（１）に示すように、文字矩形画像Ｐ８０１の左上端を原点とし、横方向をＸ方向、縦方向をＹ方向とする。 FIG. 10 is a flowchart showing the square frame removal process. FIG. 11 is a diagram for explaining the square frame removal process. As shown in FIG. 11A, the upper left corner of the character rectangular image P801 is the origin, the horizontal direction is the X direction, and the vertical direction is the Y direction.

まず、枠除去部９は、原点からＸ方向に走査を開始して、初めに出現する黒画素から連続して出現する黒画素の数をカウントする（ステップＳ４０１）。白画素が出現すれば、そのラインの走査は終了する。このとき得られた画素数は、Ｙ１に格納する。同様に、文字矩形画像Ｐ８０１の左下端まで１ラインごとに走査を行う。今、任意のライン数ｂまで画像Ｐ８０１の縦画素数が存在すると仮定すると、Ｙ１からＹｂまでの黒画素カウント数の格納場所が必要となる。 First, the frame removal unit 9 starts scanning in the X direction from the origin, and counts the number of black pixels that appear continuously from the black pixels that appear first (step S401). If a white pixel appears, the scanning of that line ends. The number of pixels obtained at this time is stored in Y1. Similarly, scanning is performed line by line up to the lower left corner of the character rectangular image P801. Assuming that the number of vertical pixels of the image P801 exists up to an arbitrary number b of lines, a storage location for black pixel count numbers from Y1 to Yb is required.

左辺からの走査において、白画素が出現した時点で黒画素数のカウントが終了するため、右辺からの走査（−Ｘ方向）も必要になる。したがって、Ｙ’１からＹ’ｂの黒画素カウント数の格納場所も必要である。つまり、ステップＳ４０１において、さらに、文字矩形画像Ｐ８０１の右上端からＸ方向（−Ｘ方向）に向けて走査を行い、初めに出現する黒画素から連続して出現する黒画素数をカウントする。白画素が出現すれば、そのラインの走査は終了する。この時得られた画素数は、Ｙ’１に格納する。同様に、文字矩形画像の右下端まで１ラインごとに走査を行う。 In scanning from the left side, counting of the number of black pixels ends when a white pixel appears, and thus scanning from the right side (−X direction) is also necessary. Therefore, a storage location for the black pixel count number from Y′1 to Y′b is also required. That is, in step S401, scanning is further performed from the upper right end of the character rectangular image P801 in the X direction (−X direction), and the number of black pixels that appear continuously from the first appearing black pixels is counted. If a white pixel appears, the scanning of that line ends. The number of pixels obtained at this time is stored in Y′1. Similarly, scanning is performed line by line up to the lower right corner of the character rectangular image.

続いて、原点からＹ方向への走査（上辺からの走査）を行い、はじめに出現する黒画素から連続して出現する黒画素の数をカウントする（ステップＳ４０２）。白画素が出現すれば、そのラインの走査は終了する。このとき得られた画素数は、Ｘ１に格納する。同様に、文字矩形画像Ｐ８０１の右上端まで１ラインごとに走査を行う。 Subsequently, scanning in the Y direction from the origin (scanning from the upper side) is performed, and the number of black pixels appearing continuously from the first appearing black pixels is counted (step S402). If a white pixel appears, the scanning of that line ends. The number of pixels obtained at this time is stored in X1. Similarly, scanning is performed for each line up to the upper right end of the character rectangular image P801.

さらに、ステップＳ４０２において、文字矩形画像Ｐ８０１の左下端からＹ方向（−Ｙ方向）に向けて走査を行い、初めに出現する黒画素から連続して出現する黒画素数をカウントする。白画素が出現すれば、そのラインの走査は終了する。左下端からの走査は、逆端の右下端からの走査でもかまわない。 Further, in step S402, scanning is performed in the Y direction (−Y direction) from the lower left end of the character rectangular image P801, and the number of black pixels continuously appearing from the first appearing black pixels is counted. If a white pixel appears, the scanning of that line ends. The scanning from the lower left corner may be a scanning from the lower right corner at the opposite end.

外枠除去の方法についての説明を容易化するために、図１１（２）に示した１２×１２マスの画像を例に説明する。図１１（２）の１マスは１画素を表しており、ハッチングで示したマスは、黒画素であることを意味する。また、任意のライン数ｂはこの例では１２となる。図１１（２）には、四角枠を構成する黒画素の分布が示されている。 In order to facilitate the description of the outer frame removal method, the 12 × 12 square image shown in FIG. 11B will be described as an example. One square in FIG. 11B represents one pixel, and the hatched square means a black pixel. In addition, the arbitrary number b of lines is 12 in this example. FIG. 11 (2) shows the distribution of black pixels constituting the square frame.

上記で説明した通りに、Ｘ１からＸｂ、Ｘ’１からＸ’ｂ、Ｙ１からＹｂ、Ｙ’１からＹ’ｂの黒画素カウント数をそれぞれ求めると、図１２（１）〜（４）に示した表になる。ここで、図１２における“ｔ”は、Ｘ及びＹのインデックスであり、Ｘ軸Ｙ軸の何番目の走査ラインかを表す番号である。 As described above, when the black pixel count numbers from X1 to Xb, X′1 to X′b, Y1 to Yb, and Y′1 to Y′b are obtained, the results shown in FIGS. 12 (1) to 12 (4) are obtained. It becomes the table shown. Here, “t” in FIG. 12 is an index of X and Y, and is a number representing the number of scanning lines on the X axis and the Y axis.

次に、枠除去部９は、ステップＳ４０１およびＳ４０２での黒画素カウント数から、外枠を除去するための平均画素数を算出する（ステップＳ４０３）。本実施の形態において、平均画素数は、各辺（４辺）からの走査結果において変化点を検出することで算出される。 Next, the frame removal unit 9 calculates the average number of pixels for removing the outer frame from the black pixel count numbers in steps S401 and S402 (step S403). In the present embodiment, the average number of pixels is calculated by detecting a change point in the scanning result from each side (four sides).

図１２（１）〜（４）を見てわかるように、文字矩形画像が四角の形状をしていると、画像の最も端（ｔ＝１及び１２）から中央（ｔ＝６及び７）に向かっていくにつれて、連続する黒画素のカウント数は減少していく。また、ｔ＝１とｔ＝２及びｔ＝１２とｔ＝１１、もしくは、ｔ＝２とｔ＝３及びｔ＝１１とｔ＝１０の黒画素のカウント数の差が大きいことがわかる。このような黒画素カウント数の差の大きいライン間は、四角の外枠（黒画素）から背景領域（白画素）への変化点と考えることができる。 As can be seen from FIGS. 12 (1) to 12 (4), when the character rectangular image has a square shape, from the end (t = 1 and 12) to the center (t = 6 and 7) of the image. As it goes, the count number of continuous black pixels decreases. In addition, it can be seen that the difference in the black pixel counts of t = 1 and t = 2 and t = 12 and t = 11, or t = 2 and t = 3, and t = 11 and t = 10 is large. Such a line having a large difference in the black pixel count number can be considered as a change point from the square outer frame (black pixel) to the background area (white pixel).

平均画素数の算出について、図１２（１）の例で説明する。ｔ＝１から黒画素のカウント数を順に見ていく。ｔ＝２の時の黒画素のカウント数とｔ＝１の時の黒画素のカウント数との差は１１である。ｔ＝２とｔ＝３の黒画素のカウント数の差は０である。黒画素のカウント数の差が大きいｔ＝１とｔ＝２の間が、四角の外枠（黒画素）から背景領域（白画素）への変化点であると推定される。また、ｔ＝１２からも黒画素のカウント数の差を計算していく。すると、ｔ＝１２とｔ＝１１の差が１０であり、次のｔ＝１１とｔ＝１０の差が０であるので、ｔ＝１２とｔ＝１１の差が大きい。したがって、ｔ＝１２とｔ＝１１の間が、四角の外枠から背景領域への変化点であると推定される。 The calculation of the average number of pixels will be described with reference to FIG. The number of black pixel counts will be examined in order from t = 1. The difference between the black pixel count when t = 2 and the black pixel count when t = 1 is 11. The difference between the count numbers of black pixels at t = 2 and t = 3 is zero. Between t = 1 and t = 2 where the difference in the number of counts of black pixels is large, it is estimated that the change point from the outer frame of the square (black pixels) to the background area (white pixels). Also, the difference in the count number of black pixels is calculated from t = 12. Then, since the difference between t = 12 and t = 11 is 10, and the next difference between t = 11 and t = 10 is 0, the difference between t = 12 and t = 11 is large. Therefore, it is estimated that the point between t = 12 and t = 11 is a change point from the rectangular outer frame to the background region.

図１２（１）で得られた黒画素のカウント数のうち、変化点であると推定（判定）された部分にはさまれた黒画素カウント数の平均値を算出する。そして、この値が、外枠の除去画素数とする。ここでは、ｔ＝２からｔ＝１１の黒画素カウント数から算出することになる。 Of the black pixel counts obtained in FIG. 12A, the average value of the black pixel counts sandwiched between the portions estimated (determined) as the change points is calculated. This value is the number of pixels removed from the outer frame. Here, it is calculated from the black pixel count from t = 2 to t = 11.

図１３（１）〜（４）に、図１２（１）〜（４）それぞれの黒画素カウント数の平均値の計算結果を示している。図１３（１）〜（４）に示されるように、４辺からの走査のいずれにおいても、黒画素カウント数の平均値は２となる。 FIGS. 13 (1) to (4) show the calculation results of the average values of the black pixel count numbers of FIGS. 12 (1) to (4). As shown in FIGS. 13 (1) to 13 (4), the average value of the black pixel count number is 2 in any of the scans from the four sides.

枠除去部９は、黒画素数の平均値が算出されると、平均値画素数分の黒画素を端から順に白画素に変換する（ステップＳ４０４）。つまり、４辺の端それぞれから、算出された黒画素数分だけ黒画素を白画素に反転することで、四角枠を除去する。 When the average value of the number of black pixels is calculated, the frame removing unit 9 converts the black pixels for the average number of pixels into white pixels in order from the end (step S404). That is, the square frame is removed by inverting black pixels to white pixels by the calculated number of black pixels from each end of the four sides.

図１４には、図１３の計算結果をもとに、四角枠を除去する（四角枠を構成する黒画素を白画素に変換する）様子を示している。図１４（１）には、図１１（２）に示した四角枠の拡大図が示されている。 FIG. 14 shows a state where the square frame is removed (the black pixels constituting the square frame are converted into white pixels) based on the calculation result of FIG. FIG. 14 (1) shows an enlarged view of the square frame shown in FIG. 11 (2).

図１４（２）を参照して、図１３（１）の計算結果から、左上端からＸ方向に走査しながら、Ｙ方向に向かって２画素分の黒画素を白画素に反転する。この時、黒画素が１画素と白画素が１画素という構成であった場合は、１画素のみを白画素に反転する。同様の処理を、図１３（２）〜（４）の計算結果をもとに、順に行なった結果が、それぞれ図１４（３）〜（５）に示される。このような処理を行うと、図１４（５）に示されるように、四角枠を構成していた黒画素が全て除去される。ただし、白画素への変換の順序は、図１４に示したような順に限らない。 Referring to FIG. 14 (2), from the calculation result of FIG. 13 (1), the black pixels for two pixels are inverted into white pixels in the Y direction while scanning in the X direction from the upper left corner. At this time, if the black pixel has one pixel and the white pixel has one pixel, only one pixel is inverted to a white pixel. The results of performing the same processing in order based on the calculation results of FIGS. 13 (2) to 13 (4) are shown in FIGS. 14 (3) to 14 (5), respectively. When such a process is performed, as shown in FIG. 14 (5), all the black pixels constituting the square frame are removed. However, the order of conversion into white pixels is not limited to the order shown in FIG.

なお、文字によっては四角枠に接触する場合があり、この時は接触部分の黒画素カウント数は増加することが予想される。この場合は、他の走査ラインでの黒画素カウント数と比べて極端に黒画素カウント数が多いものは、平均値計算に使用しないようにすることが好ましい。 Note that some characters may touch the square frame, and at this time, the black pixel count of the contact portion is expected to increase. In this case, it is preferable not to use a pixel having an extremely large number of black pixel counts compared to the number of black pixel counts in other scanning lines for the average value calculation.

本実施の形態によると、上記の手順によって、文字矩形画像Ｐ８０１の四角枠を除去することができる。なお、図７に示した文字矩形画像Ｐ８０１の四角枠除去の詳細はここでは説明しないが、上記の方法で除去することが可能である。 According to the present embodiment, the rectangular frame of the character rectangular image P801 can be removed by the above procedure. Details of the rectangular frame removal of the character rectangular image P801 shown in FIG. 7 are not described here, but can be removed by the above method.

なお、四角枠除去のための処理を行なった場合でも、元の文字矩形画像Ｐ８０１をメモリ１５に記憶しておくものとする。 Even when the process for removing the square frame is performed, the original character rectangular image P801 is stored in the memory 15.

再び図３を参照して、四角枠が除去されると、マッチング部１０は、残りの文字パターンすなわち、枠が除去された文字パターンを、１文字辞書１１と照合する（ステップＳ２０７）。このような処理を行なう理由は次のとおりである。 Referring to FIG. 3 again, when the square frame is removed, matching unit 10 collates the remaining character pattern, that is, the character pattern from which the frame has been removed, with one-character dictionary 11 (step S207). The reason for performing such processing is as follows.

文字が漢字であれば、「国、図、固、・・・」などのように、文字形状が四角として判定されてしまうであろう漢字が存在する。そのため、本実施の形態では、四角として判定されてしまうであろう漢字の四角（くにがまえなど）の中の文字、例えば「玉、古」等をパターンとして登録しておくための１文字辞書１１を予め保持しておく。 If the character is a Chinese character, there is a Chinese character whose character shape will be determined as a square, such as “country, figure, stick,...”. For this reason, in the present embodiment, a one-character dictionary for registering characters in Kanji squares (such as Kuniga-mae) that will be determined as squares, such as “ball, old”, etc., as patterns. 11 is held in advance.

文字矩形画像Ｐ８０１の四角枠が除去できたとすると、残りの文字パターンは「場」である。実際には、文字画像であるため、マッチング部１０は、「場」という形をした文字パターンであると認識しているので、この文字パターンを１文字辞書１１と照合する。１文字辞書１１には、四角（くにがまえ）の特徴を持つ漢字の内側の文字パターンがあらかじめ登録されている。本実施の形態では、例えば、図１５に示すような複数の文字パターン（特定文字パターン）が１文字辞書１１に登録されている。 If the rectangular frame of the character rectangular image P801 can be removed, the remaining character pattern is “place”. Actually, since it is a character image, the matching unit 10 recognizes that it is a character pattern in the form of “field”, and therefore collates this character pattern with the one-character dictionary 11. In the one-character dictionary 11, character patterns inside kanji having a square feature are registered in advance. In the present embodiment, for example, a plurality of character patterns (specific character patterns) as shown in FIG.

枠除去後の文字パターンが、１文字辞書１１に存在しない文字であれば（ステップＳ２０８においてＹＥＳ）、文字認識処理部１２は、当該文字パターン（枠除去後の文字パターン）を文字認識処理にかける（ステップＳ２０９）。この処理により得られたテキストデータをメモリ１５に一時記憶する。この処理が終わると、ステップＳ２１１に進む。 If the character pattern after the frame removal is a character that does not exist in the one-character dictionary 11 (YES in step S208), the character recognition processing unit 12 applies the character pattern (the character pattern after the frame removal) to the character recognition processing. (Step S209). The text data obtained by this processing is temporarily stored in the memory 15. When this process ends, the process proceeds to step S211.

文字矩形画像Ｐ８０１の例であれば、マッチング部１０は、１文字辞書１１の文字パターンと枠除去後の文字パターン（「場」の文字パターン）Ｐ８０１＃とをひとつずつ照合し、一致するかどうかを判定する。図１５に示した１文字辞書１１には「場」の文字パターンＰ８０１＃と一致する文字パターンが存在しないので、文字パターンＰ８０１＃は１文字辞書１１に存在しない文字であったとして、ステップＳ２０９の処理が行なわれる。 In the example of the character rectangular image P801, the matching unit 10 collates the character pattern of the one-character dictionary 11 and the character pattern (“field” character pattern) P801 # after the frame removal one by one, and determines whether or not they match. Determine. Since the one-character dictionary 11 shown in FIG. 15 does not have a character pattern that matches the “place” character pattern P801 #, it is assumed that the character pattern P801 # is a character that does not exist in the one-character dictionary 11, and the process proceeds to step S209. Processing is performed.

ステップＳ２０９では、四角枠を除去した文字パターンＰ８０１＃が文字認識されるため、最終的に「場」というテキストデータに変換されて、メモリ１５に一時記憶される。 In step S209, since the character pattern P801 # from which the square frame has been removed is recognized, it is finally converted into text data “place” and temporarily stored in the memory 15.

上記ステップＳ２０８において、枠除去後の文字パターンが、１文字辞書１１内の文字パターンと一致した場合には（ステップＳ２０８においてＮＯ）、上述のステップＳ２１０に進み、文字認識処理部１２は、枠を削除していない元の文字を文字認識処理にかける。認識された文字のテキストデータがメモリ１５に一時記憶される。 In step S208, if the character pattern after the frame removal matches the character pattern in the one-character dictionary 11 (NO in step S208), the process proceeds to step S210 described above, and the character recognition processing unit 12 sets the frame. The original character that has not been deleted is subjected to character recognition processing. The text data of the recognized character is temporarily stored in the memory 15.

ステップＳ２１１において、行内の全ての文字矩形画像の文字認識が終了したか否かを判断する。終了していなければ（ステップＳ２１１においてＮＯ）、次の文字矩形画像を選択して、ステップＳ２０４に戻る。終了していれば（ステップＳ２１１においてＹＥＳ）、ステップＳ２１２に進む。 In step S211, it is determined whether or not the character recognition of all the character rectangular images in the line has been completed. If not completed (NO in step S211), the next character rectangular image is selected, and the process returns to step S204. If completed (YES in step S211), the process proceeds to step S212.

ステップＳ２１２において、入力画像（文字列画像）内の全ての行の処理が終了したか否かを判断する（ステップＳ２１２）。終了していなければ（ステップＳ２１２においてＮＯ）、次の文字列行を選択して、ステップＳ２０３に戻る。終了していれば（ステップＳ２１２においてＮＯ）、ステップＳ２１３に進む。 In step S212, it is determined whether or not processing of all lines in the input image (character string image) has been completed (step S212). If not completed (NO in step S212), the next character string line is selected, and the process returns to step S203. If completed (NO in step S212), the process proceeds to step S213.

ステップＳ２１３において、メモリ１５内に一時記憶されていた文字認識後のテキストデータが出力装置１３に出力される。これにより、読取った文字列画像のテキストデータが出力装置１３に表示される。上述の文字矩形画像Ｐ８０１の例の場合、文字矩形画像Ｐ８０１は、「場」という文字と認識されて出力装置１３に出力される。 In step S 213, text data after character recognition temporarily stored in the memory 15 is output to the output device 13. Thereby, the text data of the read character string image is displayed on the output device 13. In the case of the example of the character rectangular image P801 described above, the character rectangular image P801 is recognized as the character “place” and is output to the output device 13.

なお、「場」は「場所」の意味であると対応付けて予め記憶しておき、認識結果として「場所」と出力する構成でも良い。また、近年の携帯電話等では、名刺を読み取って会社名や氏名を登録できる機能があるが、この方法を応用して、携帯電話の電話帳などの「場所」という登録項目に、「四角付きの場」の後に続いて認識される文字列を店等の住所、もしくは場所であると判定し登録する仕組みも考えられる。この仕組みは、四角枠内の漢字が「場」以外の場合でも同様である。また、本実施の形態では、四角付きの「場」について認識を行ったが、他の漢字や記号等の四角付き文字についても同様の処理で認識することが可能である。 Note that “place” may be stored in advance in association with the meaning of “place”, and “place” may be output as a recognition result. In addition, in recent mobile phones, etc., there is a function that can read a business card and register the company name and name, but by applying this method, in the registration item of “location” such as the phone book of the mobile phone, A mechanism for determining and registering a character string recognized after “no place” as an address or place of a store or the like is also conceivable. This mechanism is the same even when the kanji in the square frame is other than “ba”. In this embodiment, the “place” with a square is recognized. However, other characters with a square such as kanji and symbols can be recognized by the same process.

以上の処理によって、出力装置１３には文字認識された文字列が表示される。
したがって、予め文字認識辞書４に多種の四角付き文字（任意の文字と四角枠との合成文字）を登録しておかなくても、四角枠で囲まれた文字を適切に認識することができる。 Through the above processing, the character string whose character has been recognized is displayed on the output device 13.
Therefore, even if various kinds of characters with squares (combined characters of arbitrary characters and square frames) are not registered in the character recognition dictionary 4 in advance, the characters surrounded by the square frames can be properly recognized.

なお、本実施の形態では、四角枠の除去後の文字が、１文字辞書１１に存在する文字であれば、四角枠は削除しないこととしたが、実際には、四角枠で囲まれた「寸」や「口」の場合もあると考えられる。そのため、より精度良く文字認識するためには、以下のような処理をさらに実行することが好ましい。 In the present embodiment, if the character after the removal of the square frame is a character that exists in the one-character dictionary 11, the square frame is not deleted, but actually, “ It is thought that there may be a case of “size” or “mouth”. Therefore, in order to recognize characters with higher accuracy, it is preferable to further execute the following processing.

つまり、図３のＳ２０８において、１文字辞書１１に存在する文字であると判断された場合（ステップＳ２０８においてＮＯ）、当該文字の情報として、１文字辞書１１に存在する文字を含む四角枠付き文字であることを識別するための情報を一時記憶しておく。そして、一旦、頁全体の文字認識を行う（この場合も、当該文字について、枠を削除していない元の文字認識を行なう）。その後、後処理という形で、当該文字を含む文章の「正確さ」を確認する処理を行なう（ステップＳ２１２とＳ２１３との間に）。 That is, when it is determined in S208 of FIG. 3 that the character is present in the one-character dictionary 11 (NO in step S208), the character with a square frame including the character existing in the one-character dictionary 11 is used as the character information. Information for identifying this is temporarily stored. Then, the character recognition of the entire page is performed once (in this case, the original character recognition without deleting the frame is performed for the character). Thereafter, a process of confirming the “accuracy” of the sentence including the character is performed in the form of post-processing (between steps S212 and S213).

たとえば、「１文の先頭文字である」、「１文の途中の文字であるがその文字の前に空白がある」といった場合には、四角枠との合成文字である可能性が高い。したがって、当該文字がこのような条件と一致した場合には、再度、枠除去の処理を行えばよい。 For example, in the case of “it is the first character of one sentence” or “it is a character in the middle of one sentence but there is a space before that character”, there is a high possibility that it is a composite character with a square frame. Therefore, if the character matches such a condition, the frame removal process may be performed again.

また／または、別途単語辞書を保持しておき、その単語辞書と一致する文字列であればそのまま枠を除去せず、一致しない文字列であれば枠を除去するようにしてもよい。具体的には、たとえば、「国王が・・・」という文章であったとすると、枠を除去しなければ単語「国王」が成り立つが、枠を除去すると「玉王」となる。後者のような単語は意味が通じない（単語辞書には存在しない）ため、元の文字のままで正しいと判断することができる。 Alternatively, a separate word dictionary may be held, and if the character string matches the word dictionary, the frame is not removed as it is, and if the character string does not match, the frame may be removed. Specifically, for example, if the sentence is “King is ...”, the word “King” is established if the frame is not removed, but “Tamao” is obtained if the frame is removed. Since the word like the latter does not make sense (it does not exist in the word dictionary), it can be determined that the original character is correct.

また／または、雑誌等において四角枠付きの文字や丸枠つきの文字を使用している場合、２個以上そのような文字を使用していることが多いと考えられるため、周辺文章に当該文字しか枠付き文字が存在しなければ、枠を削除しないようにしてもよい。 In addition, when using characters with a square frame or a circle with a magazine, etc., it is considered that two or more such characters are often used. If there is no framed character, the frame may not be deleted.

なお、上記のような処理を追加する場合、枠を除去後の文字をまず文字認識し、その後に文章の正確さを判断するようにしてもよい。 In addition, when the above process is added, the character after removing the frame may be first recognized and then the accuracy of the sentence may be determined.

あるいは、枠を除去していない元の文字と枠除去後の文字との両方の文字認識結果をユーザに提示（たとえば表示）して、ユーザにどちらが正しいかを選択させてもよい。 Alternatively, the character recognition result of both the original character from which the frame has not been removed and the character after the frame has been removed may be presented (for example, displayed) to allow the user to select which is correct.

なお、本実施の形態においては、１文字辞書１１に、四角形状と判定されてしまうであろう漢字の内側の文字（記号）のパターンを登録しておくこととしたが、四角形状と判定されてしまうであろう漢字（「国」など）そのもののパターンを予め登録しておいてもよい。この場合、ステップＳ２０６の処理（四角枠の除去）は、ステップＳ２０８において、１文字辞書１１に存在する文字と判断された場合に（ステップＳ２０８においてＮＯ）、行なわれるものとする。 In the present embodiment, a pattern of characters (symbols) inside a kanji that would be determined to be a quadrangle is registered in the one-character dictionary 11, but it is determined to be a quadrangle. The pattern of the kanji (such as “country”) itself that will be stored may be registered in advance. In this case, the process of step S206 (removal of the square frame) is performed when it is determined in step S208 that the character exists in the one-character dictionary 11 (NO in step S208).

もしくは逆に、四角付きの文字（四角枠と文字との合成文字）として判定したい漢字を１文字辞書に登録しておく方法でも良い。しかしながら、日本語の漢字であればＪＩＳ第一水準及び第二水準の文字が約６４００文字あり、その中で四角付きの文字を作成できない漢字はほぼないため、四角形状として判定されてしまう漢字を登録しておくほうが実用的であると考えられる。 Or conversely, a method may be used in which a kanji character to be determined as a character with a square (composite character of a square frame and a character) is registered in a one-character dictionary. However, there are about 6400 JIS level 1 and level 2 characters for Japanese Kanji characters, and there are almost no Kanji characters that cannot be used to create square characters. It is considered more practical to register.

［実施の形態２］
次に、丸枠で囲まれた文字の認識方法について説明する。 [Embodiment 2]
Next, a method for recognizing characters surrounded by a round frame will be described.

本実施の形態における情報処理装置の構成および基本的な動作は、上記実施の形態１と同様である。以下に、実施の形態１と異なる部分のみ説明する。 The configuration and basic operation of the information processing apparatus in the present embodiment are the same as those in the first embodiment. Only the parts different from the first embodiment will be described below.

本実施の形態では、枠除去部９は、矩形画像の垂直方向または水平方向の走査によって、対向する２辺の端から連続する黒画素数を算出する。そして、対向する２辺の端から、算出された黒画素数分だけ黒画素を白画素に反転することで、外枠を除去する。 In the present embodiment, the frame removing unit 9 calculates the number of black pixels continuous from the ends of two opposing sides by scanning the rectangular image in the vertical direction or horizontal direction. Then, the outer frame is removed by inverting black pixels to white pixels by the calculated number of black pixels from the ends of the two opposite sides.

図１６は、本発明の実施の形態２に係る情報処理装置１が実行する文字認識処理を示すフローチャートである。図１６のフローチャートに示す処理もまた、予めプログラムとしてＨＤＤ２１２に格納されており、ＣＰＵ２１０がこのプログラムを読み出して実行することにより、文字認識処理の機能が実現される。なお、図３に示した処理と同様の処理については同じステップ番号を付してある。したがって、それらについての説明は繰返さない。 FIG. 16 is a flowchart showing character recognition processing executed by the information processing apparatus 1 according to Embodiment 2 of the present invention. The process shown in the flowchart of FIG. 16 is also stored in advance in the HDD 212 as a program, and the function of the character recognition process is realized by the CPU 210 reading and executing this program. In addition, the same step number is attached | subjected about the process similar to the process shown in FIG. Therefore, description thereof will not be repeated.

実施の形態１と比較すると、本実施の形態では、ステップＳ２０４，Ｓ２０５，Ｓ２０６に代えて、それぞれ、ステップＳ１２０４，Ｓ１２０５，Ｓ１２０６の処理が実行される。また、本実施の形態においては、１文字辞書１１に関する処理であるステップＳ２０７，Ｓ２０８の処理が削除されている。したがって、本実施の形態では、マッチング部１０および１文字辞書１１は必須ではない。 Compared with Embodiment 1, in this embodiment, instead of Steps S204, S205, and S206, the processes of Steps S1204, S1205, and S1206 are executed. In the present embodiment, the processes in steps S207 and S208, which are processes related to the one-character dictionary 11, are deleted. Accordingly, in the present embodiment, the matching unit 10 and the one-character dictionary 11 are not essential.

ステップＳ１２０４において、文字形状判定部８は、切り出された１つの文字矩形画像を選択し、当該画像の形状が、丸であるかどうかを判定する。すなわち、文字矩形画像内の白黒画素の分布から、文字形状が丸であるかどうかを判定する。ここでの判定についても、ハフ変換などの既知の方法で行なえばよい。 In step S1204, the character shape determination unit 8 selects one extracted character rectangular image, and determines whether the shape of the image is a circle. That is, it is determined whether or not the character shape is a circle from the distribution of black and white pixels in the character rectangular image. The determination here may also be performed by a known method such as Hough transform.

なお、本実施の形態における文字の切り出し処理（ステップＳ２０３）についても公知の方法で行なわれればよいが、各文字画像の黒画素の上下左右の一番外側の黒画素を最大座標とするように切り出すことを条件とする。 Note that the character segmentation process (step S203) in the present embodiment may be performed by a known method, but the outermost black pixels on the upper, lower, left, and right sides of the black pixels of each character image are set to the maximum coordinates. Conditioned to cut out.

ステップＳ１２０５において、文字形状が丸と判定されたか否かが判断される。丸と判断されれば（ステップＳ１２０５においてＹＥＳ）、ステップＳ１２０６に進み、丸でないと判断されれば（ステップＳ１２０５においてＮＯ）、上述のステップＳ２１０に進む。 In step S1205, it is determined whether or not the character shape is determined to be a circle. If it is determined to be a circle (YES in step S1205), the process proceeds to step S1206. If it is determined not to be a circle (NO in step S1205), the process proceeds to step S210 described above.

ここで、図１７に示されるように、文字列行Ｐ７０４（図６）内の「丸（○）付きの時」を示す文字矩形画像Ｐ１６０１が選択され、当該画像Ｐ１６０１が丸形状であると判定されたものとして、以下説明する。 Here, as shown in FIG. 17, a character rectangular image P1601 indicating “when a circle (◯) is attached” in the character string row P704 (FIG. 6) is selected, and it is determined that the image P1601 has a round shape. This will be described below.

ステップＳ１２０６において、枠除去部９は、丸の枠を除去するための処理（以下「丸枠除去処理」ともいう）を実行する。 In step S1206, the frame removal unit 9 executes a process for removing a round frame (hereinafter also referred to as “round frame removal process”).

図１８は、丸枠除去処理を示すフローチャートである。図１９は、丸枠除去処理を説明するための図である。図１９（１），（２）に、文字矩形画像Ｐ１６０１を拡大した図を示している。図１９（１），（２）の点線は、切り出された矩形領域を表している。また、実施の形態１と同様に、文字矩形画像Ｐ１６０１の左上端を原点とし、横方向をＸ方向、縦方向をＹ方向とする。 FIG. 18 is a flowchart showing the round frame removal process. FIG. 19 is a diagram for explaining the round frame removal process. FIGS. 19A and 19B are enlarged views of the character rectangular image P1601. The dotted lines in FIGS. 19 (1) and (2) represent the cut out rectangular area. Similarly to the first embodiment, the upper left corner of the character rectangular image P1601 is the origin, the horizontal direction is the X direction, and the vertical direction is the Y direction.

はじめに概略について説明すると、まずは、文字矩形画像Ｐ１６０１の原点からＸ方向に走査し、黒画素数のカウントを行う。走査開始から、黒画素がはじめて出現したところから白画素が出現するまでの黒画素の連続数をカウントする。四角の場合は、初めて黒画素が出現したところからカウントを行ったが、丸の場合は、原点から丸の外枠に達するまでに、背景画像（白画素）が多いため、ある程度連続する画素数があるものを採用する。 First, the outline will be described. First, scanning is performed in the X direction from the origin of the character rectangular image P1601, and the number of black pixels is counted. From the start of scanning, the number of continuous black pixels from the first appearance of black pixels to the appearance of white pixels is counted. In the case of a square, counting was performed from the first appearance of a black pixel, but in the case of a circle, there are many background images (white pixels) from the origin to the outer frame of the circle. Adopt one that has

ここでも、丸枠除去の方法の説明を容易化するために、図２０に示した１２×１２マスの画像を用いて説明する。図２０の１マスは１画素を表しており、ハッチングで示したマスは、黒画素であることを意味する。図２０には、丸枠を構成する黒画素の分布が示されている。 Here, in order to facilitate the description of the method of removing the round frame, description will be made using the 12 × 12 square image shown in FIG. One square in FIG. 20 represents one pixel, and a hatched square means a black pixel. FIG. 20 shows the distribution of black pixels constituting the round frame.

図１８および図２０を参照して、まず、原点からＸ方向に１ラインずつ走査して、連続して出現する黒画素数をカウントする（ステップＳ１５１１）。ここで、図１９（２）に示しているが、原点からＸ方向への走査は“Ｋ”まで行う。“Ｋ”はＸ方向軸（Ｘ軸）の画素数の半分の値である。この際、右上端からも同様に“Ｋ”の位置に向かってＸ方向（実際には−Ｘ方向）に走査する。また、連続して出現する黒画素のカウント数は、「Ｙ１からＹｂ」及び「Ｙ’１からＹ’ｂ」にそれぞれ格納することとする。“ｂ”は任意の整数とし、ここでは走査するライン数を表している。 Referring to FIGS. 18 and 20, first, scanning is performed line by line in the X direction from the origin, and the number of black pixels that appear continuously is counted (step S1511). Here, as shown in FIG. 19B, scanning from the origin to the X direction is performed up to “K”. “K” is a value half the number of pixels on the X-direction axis (X-axis). At this time, scanning is similarly performed from the upper right end toward the position “K” in the X direction (actually −X direction). In addition, the count numbers of the black pixels that appear continuously are stored in “Y1 to Yb” and “Y′1 to Y′b”, respectively. “B” is an arbitrary integer, and here represents the number of lines to be scanned.

図２０の矩形画像について、走査を行って黒画素数をカウントした結果が、図２１（１），（２）の表に示されている。図２１（１）のカウント結果は、図２０において、Ｘ方向の中央位置（Ｋ）を基準とする左半分の領域Ｒ１に対応しており、図２１（２）のカウント結果は、図２０において、Ｘ方向の中央位置（Ｋ）を基準とする右半分の領域Ｒ２に対応している。 The results of scanning the rectangular image of FIG. 20 and counting the number of black pixels are shown in the tables of FIGS. 21 (1) and (2). The count result of FIG. 21 (1) corresponds to the left half region R1 with reference to the center position (K) in the X direction in FIG. 20, and the count result of FIG. , Corresponding to the right half region R2 with reference to the center position (K) in the X direction.

ここで、“ｂ”は走査するライン数なので１２となり、“ｔ”はＹのインデックスである。Ｙ１はｔ＝１の時の黒画素カウント数となる。図２１（１）に示すｔ＝１の場合は、原点からの走査になるが、図２０に示した第一画素ＦＰＸは１画素のみのかたまり（次の画素が白画素）である。したがって、第一画素ＦＰＸはカウントには入れず、次に黒画素が出現するまでさらに走査する。図２１（１），（２）に示されるように、ラインごとに、カウントを開始した位置（左端または右端からの何番目か）を黒画素数と対応付けて記憶する。 Here, “b” is 12 because it is the number of lines to be scanned, and “t” is an index of Y. Y1 is the black pixel count when t = 1. In the case of t = 1 shown in FIG. 21A, scanning is performed from the origin, but the first pixel FPX shown in FIG. 20 is a cluster of only one pixel (the next pixel is a white pixel). Therefore, the first pixel FPX is not counted, and further scanning is performed until the next black pixel appears. As shown in FIGS. 21 (1) and (2), for each line, the position where counting is started (the number from the left end or the right end) is stored in association with the number of black pixels.

次に、枠除去部９は、図２１（１），（２）の黒画素カウント数から平均値を求める（ステップＳ１５１２）。図２１（１），（２）の各々について、黒画素カウント数の総数をｔの数（ここでは１２）で除算すると、“２”が平均値として算出される。 Next, the frame removal unit 9 obtains an average value from the black pixel count numbers in FIGS. 21 (1) and 21 (2) (step S1512). For each of FIGS. 21A and 21B, when the total number of black pixel counts is divided by the number of t (here, 12), “2” is calculated as an average value.

平均値が算出されると、画像の左上端および右上端からＹ方向に１ラインずつ走査し、Ｘ方向の２画素分の画素を白画素に反転する（ステップＳ１５１３）。この処理について、図２２を参照して具体的に説明する。 When the average value is calculated, scanning is performed line by line in the Y direction from the upper left end and the upper right end of the image, and the pixels for two pixels in the X direction are inverted to white pixels (step S1513). This process will be specifically described with reference to FIG.

図２２（１）には、図２０に示した丸枠の拡大図が示されている。
図２２（２）を参照して、まず、左上端（原点）から走査していく場合、１ライン目は第一画素が黒画素であるが、図２１（１）に示されるように、カウント開始位置が４番目の画素なので、４番目の黒画素から２画素分を白画素に反転する。つまり、第一画素はステップＳ１５１１における黒画素カウントの対象外としたので、これを無視し（とばし）、次に黒画素の出現する画素から２画素分を白画素に反転する。 FIG. 22 (1) shows an enlarged view of the round frame shown in FIG.
Referring to FIG. 22 (2), first, when scanning from the upper left end (origin), the first pixel is a black pixel in the first line, but as shown in FIG. Since the start position is the fourth pixel, two pixels from the fourth black pixel are inverted to white pixels. That is, since the first pixel is excluded from the black pixel count in step S1511, it is ignored (skip), and two pixels from the next black pixel appearing are inverted to white pixels.

ここで、仮に黒画素除去分の画素数進むと、“Ｋ”（ここでは６）よりも大きくなる場合は“Ｋ”までしか処理を行わないようにする。 Here, if the number of pixels for the black pixel removal is advanced, if it is larger than “K” (here, 6), only the processing up to “K” is performed.

以降、同様に左下端まで走査を行い、２画素分を白画素に反転する。この結果、図２２（２）のような画像が得られる。 Thereafter, scanning is similarly performed to the lower left corner, and two pixels are inverted to white pixels. As a result, an image as shown in FIG. 22 (2) is obtained.

次に図２２（２）の画像の右上端からＸ方向（実際にはマイナスのＸ方向）に走査を行い、カウント開始位置より２画素分を白画素に反転していく。この結果、図２２（３）のような画像を得ることができる。 Next, scanning is performed in the X direction (actually in the negative X direction) from the upper right end of the image in FIG. 22B, and two pixels are inverted to white pixels from the count start position. As a result, an image as shown in FIG. 22 (3) can be obtained.

図２２（３）に示されるように、平均黒画素数だけ反転させる処理を行なうと、黒画素が何点か残ってしまう場合がある。しかしながら、画像処理の孤立点を除去する公知の方法を用いれば、残りの点を消すことができる。 As shown in FIG. 22 (3), when the process of inverting the number of average black pixels is performed, some black pixels may remain. However, if a known method for removing isolated points in image processing is used, the remaining points can be erased.

以上のような方法により、「丸付きの時」の外枠（丸枠）を除去することができる。
再び図１６を参照して、丸枠が除去されると、上述のステップＳ２０９に進み、丸枠除去後の文字矩形画像（「時」という文字パターン）が文字認識処理にかけられる。これにより、「時」というテキストデータを得ることができる。 By the method as described above, the outer frame (round frame) at the time of “rounded” can be removed.
Referring to FIG. 16 again, when the round frame is removed, the process proceeds to step S209 described above, and the character rectangular image (character pattern “hour”) after the round frame is removed is subjected to the character recognition process. Thereby, text data “time” can be obtained.

以上のように、丸付きの文字（任意の文字と丸枠との合成文字）であっても、予め文字認識辞書４に多種の丸付き文字そのものを登録しておかなくても、丸枠で囲まれた文字を適切に認識することができる。 As described above, even if it is a circled character (a composite character of an arbitrary character and a circle frame), even if various types of circled characters themselves are not registered in the character recognition dictionary 4 in advance, The enclosed characters can be recognized properly.

なお、上記実施の形態１と同様に、「時」が認識されれば、その文字の次に認識される同一行の数字を時間として認識するようにしてもよい。 As in the first embodiment, if “hour” is recognized, the numbers on the same line recognized next to the characters may be recognized as time.

また、実施の形態１と実施の形態２の処理を組合わせてもよい。つまり、１つの情報処理装置において、四角枠の除去と丸枠の除去との両方をできてもよい。 Moreover, you may combine the process of Embodiment 1 and Embodiment 2. FIG. That is, in one information processing apparatus, both the removal of the square frame and the removal of the round frame may be performed.

また、上記実施の形態１および２では、外枠が四角枠または丸枠であるものとして説明したが、これらに限定されるものではなく、外枠が、楕円、多角形、あるいは長方形などであっても、同様の処理によって外枠内の文字を認識することが可能である。 In the first and second embodiments, the outer frame is described as a square frame or a round frame. However, the outer frame is not limited to these, and the outer frame is an ellipse, a polygon, a rectangle, or the like. However, it is possible to recognize characters in the outer frame by the same processing.

また、上記実施の形態１では、単一の文字が外枠で包含されている場合にのみ説明したが、１文字辞書に複数文字用のパターンも用意することで、外枠で囲まれた複数文字も認識することができる。 In the first embodiment, the description has been made only in the case where a single character is included in the outer frame. However, by preparing a pattern for a plurality of characters in a one-character dictionary, a plurality of characters surrounded by the outer frame are provided. Characters can also be recognized.

また、本発明の情報処理装置が行なう、文字認識方法を、プログラムとして提供することもできる。このようなプログラムは、ＣＤ−ＲＯＭ（Compact Disc-ROM）などの光学媒体や、メモリカードなどのコンピュータ読取り可能な記録媒体にて記録させて、プログラム製品として提供することもできる。また、ネットワークを介したダウンロードによって、プログラムを提供することもできる。 The character recognition method performed by the information processing apparatus of the present invention can also be provided as a program. Such a program can be recorded on an optical medium such as a CD-ROM (Compact Disc-ROM) or a computer-readable recording medium such as a memory card and provided as a program product. A program can also be provided by downloading via a network.

なお、本発明にかかるプログラムは、コンピュータのオペレーティングシステム（ＯＳ）の一部として提供されるプログラムモジュールのうち、必要なモジュールを所定の配列で所定のタイミングで呼出して処理を実行させるものであってもよい。その場合、プログラム自体には上記モジュールが含まれずＯＳと協働して処理が実行される。このようなモジュールを含まないプログラムも、本発明にかかるプログラムに含まれ得る。 The program according to the present invention is a program module that is provided as a part of a computer operating system (OS) and calls necessary modules in a predetermined arrangement at a predetermined timing to execute processing. Also good. In that case, the program itself does not include the module, and the process is executed in cooperation with the OS. A program that does not include such a module can also be included in the program according to the present invention.

また、本発明にかかるプログラムは他のプログラムの一部に組込まれて提供されるものであってもよい。その場合にも、プログラム自体には上記他のプログラムに含まれるモジュールが含まれず、他のプログラムと協働して処理が実行される。このような他のプログラムに組込まれたプログラムも、本発明にかかるプログラムに含まれ得る。 The program according to the present invention may be provided by being incorporated in a part of another program. Even in this case, the program itself does not include the module included in the other program, and the process is executed in cooperation with the other program. Such a program incorporated in another program can also be included in the program according to the present invention.

提供されるプログラム製品は、ハードディスクなどのプログラム格納部にインストールされて実行される。なお、プログラム製品は、プログラム自体と、プログラムが記憶された記憶媒体とを含む。 The provided program product is installed in a program storage unit such as a hard disk and executed. Note that the program product includes the program itself and a storage medium in which the program is stored.

今回開示された実施の形態はすべての点で例示であって制限的なものではないと考えられるべきである。本発明の範囲は上記した説明ではなくて特許請求の範囲によって示され、特許請求の範囲と均等の意味および範囲内でのすべての変更が含まれることが意図される。 The embodiment disclosed this time should be considered as illustrative in all points and not restrictive. The scope of the present invention is defined by the terms of the claims, rather than the description above, and is intended to include any modifications within the scope and meaning equivalent to the terms of the claims.

本発明の各実施の形態における情報処理装置の機能構成を表わす機能ブロック図である。It is a functional block diagram showing the function structure of the information processing apparatus in each embodiment of this invention. 本発明の各実施の形態における情報処理装置のハードウェアブロック図である。It is a hardware block diagram of the information processing apparatus in each embodiment of the present invention. 本発明の実施の形態１に係る情報処理装置が実行する文字認識処理を示すフローチャートである。It is a flowchart which shows the character recognition process which the information processing apparatus which concerns on Embodiment 1 of this invention performs. 文字列画像（入力画像）の一例を示す図である。It is a figure which shows an example of a character string image (input image). ２値化後の文字列画像の一例を示す図である。It is a figure which shows an example of the character string image after binarization. ２値化後の画像より文字列行が切り出された例を示す図である。It is a figure which shows the example from which the character string line was cut out from the image after binarization. 文字列行から切り出された文字矩形画像の一例を示す図である。It is a figure which shows an example of the character rectangular image cut out from the character string line. 四角判定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a square determination process. 四角判定方法を説明するための図である。It is a figure for demonstrating the square determination method. 四角枠除去処理を示すフローチャートである。It is a flowchart which shows a square frame removal process. 四角枠除去処理を説明するための図である。It is a figure for demonstrating a square frame removal process. （１）〜（４）は、それぞれ、４辺からの走査による黒画素カウント数を示す表である。(1) to (4) are tables showing black pixel count numbers obtained by scanning from four sides. 図１２（１）〜（４）それぞれの黒画素カウント数の平均値の計算結果を示す図である。12A and 12B are diagrams showing the calculation results of the average value of the black pixel count numbers. 図１３の計算結果をもとに、四角枠が除去される様子を示す図である。It is a figure which shows a mode that a square frame is removed based on the calculation result of FIG. １文字辞書に登録された特定文字パターンと枠除去後の文字パターンとのマッチング処理の概念を示す図である。It is a figure which shows the concept of the matching process of the specific character pattern registered into the 1 character dictionary, and the character pattern after a frame removal. 本発明の実施の形態２に係る情報処理装置が実行する文字認識処理を示すフローチャートである。It is a flowchart which shows the character recognition process which the information processing apparatus which concerns on Embodiment 2 of this invention performs. 丸形状の文字矩形画像の一例を示す図である。It is a figure which shows an example of a round-shaped character rectangular image. 丸枠除去処理を示すフローチャートである。It is a flowchart which shows a round frame removal process. 丸枠除去処理を説明するための図である。It is a figure for demonstrating a round frame removal process. 丸枠を構成する黒画素の分布例を示す図である。It is a figure which shows the example of distribution of the black pixel which comprises a round frame. （１），（２）は、それぞれ、対向する辺からの走査による黒画素カウント数，黒画素数平均値を示す表である。(1) and (2) are tables showing black pixel count numbers and black pixel number average values obtained by scanning from opposite sides, respectively. 図２１の計算結果をもとに、丸枠が除去される様子を示す図である。It is a figure which shows a mode that a round frame is removed based on the calculation result of FIG.

Explanation of symbols

１情報処理装置、２入力装置、３制御部、４文字認識辞書、５２値化処理部、６文字列行切り出し部、７文字切り出し部、８文字形状判定部、９枠除去部、１０マッチング部、１１１文字辞書、１２文字認識処理部、１３出力装置、１４入出力制御部、１５メモリ、２１０ＣＰＵ、２１１操作部、２１２ＨＤＤ、２１４メモリ部、２１５ＣＤ−ＲＯＭドライブ、２１６入出力Ｉ／Ｆ、２１７通信インターフェイス部、２１８内部バス、２２１画像読取装置、２２２表示装置。 1 information processing device, 2 input device, 3 control unit, 4 character recognition dictionary, 5 binarization processing unit, 6 character string row cutout unit, 7 character cutout unit, 8 character shape determination unit, 9 frame removal unit, 10 matching 11, 1 character dictionary, 12 character recognition processing unit, 13 output device, 14 input / output control unit, 15 memory, 210 CPU, 211 operation unit, 212 HDD, 214 memory unit, 215 CD-ROM drive, 216 input / output I / F, 217 Communication interface unit, 218 Internal bus, 221 Image reading device, 222 Display device.

Claims

An information processing apparatus for recognizing characters surrounded by an outer frame from a character string image,
Character cutout means for cutting out a rectangular image corresponding to one character from the character string image;
Shape determining means for determining whether or not the character shape in the rectangular image matches a predetermined type of outer frame shape based on the distribution of monochrome pixels in the cut out rectangular image;
When it is determined by the shape determining means that the images match , the width of the outer frame to be removed is determined based on the distribution of black pixels within the range corresponding to the outer frame shape determined to match, and the determined width based on the frame removing means for removing an outer frame of the character in said rectangular image,
An information processing apparatus comprising: recognition means for performing character recognition on the character pattern of the image after processing by the frame removal means.

Determines that the frame removing means, said a plurality of scanning in a first direction parallel to one side of the rectangular image, the number of consecutive black pixels from the end portion side of the rectangular image is calculated for each scan, the match In accordance with the outer frame shape calculated, an average value of the number of black pixels calculated in all or part of the plurality of scans is calculated, and black pixels are inverted into white pixels by the number of black pixels of the calculated average value. The information processing apparatus according to claim 1, wherein the outer frame is removed.

The outer frame shape of the predetermined type includes a round shape,
The frame removing means includes
When the shape determining means determines that the character shape in the rectangular image is the round shape, the scanning of the rectangular image in the first direction or in the second direction orthogonal to the first direction, Means for calculating the number of continuous black pixels from the ends of two opposing sides;
And means for removing an outer frame by inverting black pixels to white pixels by the number of black pixels of the average value calculated from the ends of the two opposing sides. Information processing device.

The outer frame shape of the predetermined type includes a square shape,
Specific character storage means for storing in advance a character pattern inside a character whose outside is a square frame as a specific character pattern;
Matching means for collating the character pattern of the image processed by the frame removing means with the specific character pattern when the shape determining means determines that the character shape in the rectangular image is the quadrangular shape And
The recognition means performs character recognition of the original character pattern with the outer frame remaining cut out by the character cutout means when the matching means determines that it matches the specific character pattern. The information processing apparatus according to any one of 2 and 3.

The information processing apparatus according to claim 4, wherein the recognizing unit performs character recognition of the character pattern processed by the frame removing unit when the matching unit determines that the specific character pattern does not match.

A storage means,
The storage unit holds both the image after the processing by the frame removing unit and the rectangular image before the outer frame is removed when it is determined by the shape determining unit that they match. Or the information processing apparatus according to 5.

The outer frame shape of the predetermined type includes a square shape,
Specific character storage means for preliminarily storing a character pattern whose outside is a square frame as a specific character pattern;
A matching means for matching the character pattern of the rectangular image with the specific character pattern when the shape determining means determines that the character shape in the rectangular image is the square shape;
4. The information according to claim 2, wherein the frame removing unit removes an outer frame of a character in the rectangular image only when the matching unit determines that the character pattern does not match the specific character pattern. 5. Processing equipment.

The frame removing means includes
When the shape determining means determines that the character shape in the rectangular image is the quadrangular shape, the scanning of the rectangular image in the first direction and the second direction orthogonal to the first direction, Means for calculating the number of continuous black pixels from the ends of the four sides of the rectangular image;
A means for removing an outer frame by inverting black pixels to white pixels by the calculated number of black pixels corresponding to the average value from each end of four sides. The information processing apparatus described in 1.

The information processing apparatus according to any one of claims 1 to 8, wherein the character surrounded by the outer frame includes any one of a kanji character, an English character, a hiragana character, a number, and a katakana character in the frame.

A method for recognizing characters enclosed in an outer frame from a character string image,
Cutting out a rectangular image corresponding to one character from the character string image;
Determining whether a character shape in the rectangular image matches a predetermined type of outer frame shape based on a distribution of black and white pixels in the cut out rectangular image;
When it is determined that the character shape in the rectangular image matches the outer frame shape of the predetermined type, the outer shape to be removed based on the distribution of black pixels within the range corresponding to the outer frame shape determined to match Determining a width of the frame, and removing an outer frame of characters in the rectangular image based on the determined width ;
Performing character recognition on the character pattern of the image after removing the outer frame.

A program for recognizing characters enclosed in an outer frame from a character string image,
Cutting out a rectangular image corresponding to one character from the character string image;
Determining whether a character shape in the rectangular image matches a predetermined type of outer frame shape based on a distribution of black and white pixels in the cut out rectangular image;
When it is determined that the character shape in the rectangular image matches the outer frame shape of the predetermined type, the outer shape to be removed based on the distribution of black pixels within the range corresponding to the outer frame shape determined to match Determining a width of the frame, and removing an outer frame of characters in the rectangular image based on the determined width ;
A character recognition program for causing a computer to execute character recognition for a character pattern of an image after removing an outer frame.

The computer-readable recording medium which recorded the character recognition program of Claim 11.