JP2007122500A

JP2007122500A - Character recognition device, character recognition method and character data

Info

Publication number: JP2007122500A
Application number: JP2005315074A
Authority: JP
Inventors: Atsushi Koinuma; 敦鯉沼
Original assignee: Ricoh Co Ltd; Ricoh Technosystems Co Ltd
Current assignee: Ricoh Co Ltd; Ricoh Technosystems Co Ltd
Priority date: 2005-10-28
Filing date: 2005-10-28
Publication date: 2007-05-17
Anticipated expiration: 2025-10-28
Also published as: TWI338865B; TW200717338A; CN1955981A; JP4881605B2; CN100568265C

Abstract

<P>PROBLEM TO BE SOLVED: To provide a character recognition device and a character recognition method, in which a character can be accurately recognized at high speed, and character data for the character recognition. <P>SOLUTION: The character recognition device for recognizing a character from optically scanned image data of a document comprises a font determination means 21 determining the font of the character, a character size determination means 22 determining the size of the character, character data 5 for recognizing characters, which is stored in conformation to fonts of characters and character sizes, and a character identification means 24 determining, based on the font determined by the font determination means and the character size determined by the character size determination means, a character code of the character in reference to the character data. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、光学的に走査された原稿の画像データから文字を認識する文字認識装置、文字認識方法又は文字認識のための文字データに関する。 The present invention relates to a character recognition device, a character recognition method, or character data for character recognition for recognizing characters from image data of an optically scanned document.

ワープロ等で作成された文字が印刷された原稿を、コンピュータ等の情報処理装置で扱えるようにデジタル化する文字認識の技術が提案されている。文字認識技術では、原稿をイメージスキャナ等で読み込んで文字を認識し、文字を英数字、ひらがな又は漢字などの文字コードに変換して保存する。 There has been proposed a character recognition technique for digitizing a manuscript on which characters created by a word processor or the like are printed so that the information processing apparatus such as a computer can handle the document. In the character recognition technology, an original is read by an image scanner or the like to recognize characters, and the characters are converted into character codes such as alphanumeric characters, hiragana or kanji and stored.

従来の文字認識装置においては、複数の代表的なフォント（ゴシック系、明朝系、セリフ系、サンセリフ系、モノスペース系）の特徴量を平均化して文字の識別に用いる文字パターンの識別辞書を作成していた。しかしながら、特徴量が平均化された文字パターンでは識別力に限界がある。 In a conventional character recognition device, a character pattern identification dictionary used for character recognition by averaging feature values of a plurality of representative fonts (Gothic, Mincho, serif, sans serif, and monospace) is used. I was making it. However, there is a limit to discriminating power in a character pattern in which feature amounts are averaged.

このため、フォント毎に識別辞書を用意する文字認識の技術が提案されている（例えば、特許文献１参照。）。かかる技術では、パソコン等に組み込まれたフォントを検出し、フォント毎の各文字について標準パターンを作成し登録する。文字認識する場合は、登録された標準パターンを用いてスキャナ等で読み込まれた画像データから文字を認識する。 For this reason, a character recognition technique for preparing an identification dictionary for each font has been proposed (see, for example, Patent Document 1). In such a technique, a font incorporated in a personal computer or the like is detected, and a standard pattern is created and registered for each character for each font. In the case of character recognition, characters are recognized from image data read by a scanner or the like using a registered standard pattern.

また、スキャナ装置等で読み込まれた画像データにおける文字の形状に関する特徴量を抽出し、あらかじめ用意したフォント種類毎の特徴量との類似度を計算し、類似度に基づいてフォントの種類を識別する技術が提案されている（例えば、特許文献２参照。）。かかる技術では、フォントの種類に応じてあらかじめ文字の輪郭線情報が用意されているため、フォントが判明すれば、文字の輪郭線情報に基づいて画像データの文字形状の修正を行い格納あるいは表示することができる。したがって、文字認識を行わずにフォントの種類のみを識別することで、文字の誤認識を避けることができる。
特開２００２−２７９３５号公報特開平８−１２３９０４号公報 In addition, a feature amount related to the shape of a character in image data read by a scanner device or the like is extracted, a similarity with a feature amount for each font type prepared in advance is calculated, and a font type is identified based on the similarity. A technique has been proposed (see, for example, Patent Document 2). In such a technique, the outline information of the character is prepared in advance according to the type of the font. Therefore, if the font is identified, the character shape of the image data is corrected based on the outline information of the character and stored or displayed. be able to. Therefore, by recognizing only the font type without performing character recognition, it is possible to avoid erroneous recognition of characters.
JP 2002-27935 A JP-A-8-123904

しかしながら、例えば、特許文献１の文字認識方法では、フォント毎に標準パターンを登録しても文字を識別するための特徴量が不確定であるため識別精度が十分でない。特許文献１では特徴量として、文字線の傾き、ループの数、線幅、文字面積等が挙げられているが、いずれも文字を特定するには十分でない。また、認識率を向上させるため特徴量を増やせば認識速度が低下してしまう。 However, for example, in the character recognition method of Patent Document 1, even if a standard pattern is registered for each font, the feature amount for identifying the character is uncertain, so that the identification accuracy is not sufficient. In Patent Document 1, character line inclination, the number of loops, line width, character area, and the like are listed as feature amounts, but none of them is sufficient to specify a character. Further, if the feature amount is increased in order to improve the recognition rate, the recognition speed is lowered.

また、引用文献２では輪郭情報として文字を保存するので紙原稿と同じ形状のまま文字を取得できるが、取得した文字情報は文字コードでないためワープロソフトウェア等で再加工する場合の扱いが困難である。 In Cited Document 2, since characters are stored as contour information, characters can be acquired in the same shape as a paper document. However, since the acquired character information is not a character code, it is difficult to handle when reprocessing with word processing software or the like. .

本発明は、上記の問題に鑑み、高精度かつ高速に文字を認識できる文字認識装置、文字認識方法及び文字認識のための文字データを提供することを目的とする。 An object of this invention is to provide the character recognition apparatus, the character recognition method, and the character data for character recognition which can recognize a character with high precision and high speed in view of said problem.

上記問題に鑑み、本発明は、光学的に走査された原稿の画像データから文字を認識する文字認識装置において、文字のフォントを判別するフォント判別手段と、文字の大きさを判別する文字サイズ判別手段と、文字のフォント及び文字の大きさ対応づけて格納された文字を認識するための文字データと、フォント判別手段により判別されたフォントと文字サイズ判別手段により判別された文字の大きさに基づき文字データを参照して文字の文字コードを決定する文字識別手段と、を有することを特徴とする。 In view of the above problems, the present invention provides a character recognition device for recognizing characters from image data of an optically scanned document, a font determination unit for determining a font of the character, and a character size determination for determining the size of the character. Means, character data for recognizing characters stored in correspondence with the font and character size of the character, the font determined by the font determining means and the character size determined by the character size determining means And character identification means for determining the character code of the character with reference to the character data.

本発明によれば、文字を認識するための文字データを文字の大きさとフォント毎に用意するため、高精度に文字を認識することができる。 According to the present invention, since character data for recognizing characters is prepared for each character size and font, characters can be recognized with high accuracy.

また、本発明の一形態において、文字データは、文字のビットマップデータを所定数の画素に区切った場合に、当該文字の左方向、右方向、天方向又は地方向から黒画素の現れる数を当該方向からの画素列毎に記録したものであることを特徴とする。 Further, in one embodiment of the present invention, when character bitmap data is divided into a predetermined number of pixels, the character data indicates the number of black pixels that appear from the left, right, top, or ground direction of the character. It is recorded for each pixel column from the direction.

本発明によれば、文字の各方向から黒画素の現れる数を記録した文字データを用いることで、文字全体を読み込まなくても文字の認識を開始でき、高速に文字を認識できる。 According to the present invention, by using character data in which the number of black pixels appearing from each direction of a character is used, character recognition can be started without reading the entire character, and the character can be recognized at high speed.

また、本発明の一形態において、文字データは、文字のビットマップデータを所定数の画素に区切った場合に、各画素毎に白画素又は黒画素の情報を有する画素文字データである、ことを特徴とする。 In one embodiment of the present invention, the character data is pixel character data having white pixel information or black pixel information for each pixel when the bitmap data of the character is divided into a predetermined number of pixels. Features.

本発明によれば、画素毎に白画素又は黒画素かを判定して文字を認識するので、高精度に文字を認識できる。また、文字全体を読み込まなくても文字の認識を開始でき、高速に文字を認識できる。 According to the present invention, since a character is recognized by determining whether it is a white pixel or a black pixel for each pixel, the character can be recognized with high accuracy. In addition, character recognition can be started without reading the entire character, and the character can be recognized at high speed.

また、本発明の一形態において、文字データは、文字のビットマップデータを所定数の画素に区切った場合に複数の２画素間の寸法を有する、ことを特徴とする。 In one embodiment of the present invention, the character data has a dimension between a plurality of two pixels when character bitmap data is divided into a predetermined number of pixels.

本発明によれば、文字の２画素間の寸法に基づき文字を認識するので、高精度に文字を認識できる。 According to the present invention, since the character is recognized based on the dimension between the two pixels of the character, the character can be recognized with high accuracy.

また、本発明の一形態において、原稿の傾き角を判別する傾き角判別手段を有し、文字識別手段は、傾き判別手段により判別された傾き角に応じて画素文字データを傾かせて文字を認識することを特徴とする。 Further, according to one aspect of the present invention, there is provided an inclination angle determination unit that determines an inclination angle of the document, and the character identification unit inclines the pixel character data according to the inclination angle determined by the inclination determination unit, and It is characterized by recognition.

本発明によれば、原稿の傾き角を判別し画素文字データも傾かせるので、文字全体を読み込まなくても文字の認識を開始でき、高速に文字を認識できる。 According to the present invention, since the inclination angle of the original is determined and the pixel character data is also inclined, the character recognition can be started without reading the entire character, and the character can be recognized at high speed.

また、本発明の一形態において、フォントデータから生成された文字のビットマップデータに基づき前記文字データを作成する文字データ作成手段を有し、文字データ作成手段は、フォント判別手段により判別されたフォントであって文字サイズ判別手段により判別された大きさの文字のビットマップデータに基づき前記文字データを作成し、文字認識手段は、文字データ作成手段により作成された文字データに基づき前記文字を認識する、とを特徴とする。 In one embodiment of the present invention, there is provided character data creating means for creating the character data based on bitmap data of characters generated from the font data. The character data creating means includes a font discriminated by the font discriminating means. The character data is generated based on the bitmap data of the character determined by the character size determining means, and the character recognizing means recognizes the character based on the character data generated by the character data generating means. , And.

本発明によれば、使用される頻度の少ないフォントで印刷された原稿であっても、そのフォントを認識して文字データを作成できるので、多様なフォントに対応してきわめて精度よく文字を認識できる。 According to the present invention, even a manuscript printed with a font that is used infrequently can recognize character of the font and create character data, so that the character can be recognized with high accuracy corresponding to various fonts. .

高精度かつ高速に文字を認識できる文字認識装置、文字認識方法及び文字認識のための文字データを提供することができる。 It is possible to provide a character recognition device, a character recognition method, and character data for character recognition that can recognize characters with high accuracy and high speed.

以下、本発明の実施するための最良の形態について、実施例を挙げて図面を参照しながら説明する。なお、本発明の文字認識方法は、本発明の文字認識装置の実施形態に用いられているので、本発明の文字認識方法の実施形態は文字認識装置の実施形態の中で併せて説明する。 Hereinafter, the best mode for carrying out the present invention will be described with reference to the accompanying drawings. In addition, since the character recognition method of this invention is used for embodiment of the character recognition apparatus of this invention, embodiment of the character recognition method of this invention is described together in embodiment of a character recognition apparatus.

図１は、本発明の実施形態における文字認識装置を含む文字認識システムの全体構成図を示す。文字認識システムは、文字認識装置１、スキャナ装置２及びプリンタ３とがネットワーク４を介して相互に通信可能に接続されている。文字認識装置１は後述する文字データ５を有する。なお、本実施の形態の文字認識装置はスキャナ装置２と一体であってもよいし、スキャナ装置２及びプリンタ３と一体であってもよい。また、スキャナ装置２はファクシミリ機能を有していてもよい。 FIG. 1 is an overall configuration diagram of a character recognition system including a character recognition device according to an embodiment of the present invention. In the character recognition system, a character recognition device 1, a scanner device 2, and a printer 3 are connected via a network 4 so that they can communicate with each other. The character recognition device 1 has character data 5 to be described later. The character recognition device according to the present embodiment may be integrated with the scanner device 2 or may be integrated with the scanner device 2 and the printer 3. Further, the scanner device 2 may have a facsimile function.

文字認識装置の文字認識方法の概略を説明する。文字認識装置１は予めＭＳ明朝、ＭＳゴシック、ＯＳＡＫＡ等のフォント毎に各文字のビットマップデータを数値化した文字データ５を有する。また、文字データ５は文字の大きさ（ポイント）毎に格納されている。 An outline of a character recognition method of the character recognition device will be described. The character recognition device 1 has character data 5 in which bitmap data of each character is digitized for each font such as MS Mincho, MS Gothic, OSAKA. The character data 5 is stored for each character size (point).

ビットマップデータは画素ごとに黒又は白の値を取るが、フォント及び文字の大きさが特定されれば当該文字のビットマップデータはフォント及び文字の大きさに固有の２値（白黒）画像となる。したがって、このような文字データ５を利用して文字認識することで、きわめて高精度に文字を認識できる。 The bitmap data takes a black or white value for each pixel, but if the font and character size are specified, the bitmap data of the character is a binary (monochrome) image specific to the font and character size. Become. Therefore, by recognizing characters using such character data 5, characters can be recognized with extremely high accuracy.

スキャナ装置２がこれらのフォントで作成された文字が印刷された原稿を読み取り、文字認識装置１へ送信すると、文字認識装置１は印刷された文字のフォントの種別及び文字の大きさを判別し、判別したフォントと文字の大きさに応じて文字データ５を参照し文字認識を行う。 When the scanner device 2 reads a document on which characters created with these fonts are printed and transmits the document to the character recognition device 1, the character recognition device 1 determines the font type and character size of the printed characters, Character recognition is performed by referring to the character data 5 according to the determined font and character size.

なお、スキャナ装置２は周知の構成であり、例えばコンタクトガラスにセットされた原稿を光学的に走査し、原稿に光を当てその反射光をＣＣＤ等の１次元の撮像素子に入力し電気信号に変換する。スキャナ装置２は光源の移動速度や紙送りを制御しながら、変換された電気信号をＡ／Ｄ変換処理してデジタルデータに変換し、シェーディング処理、変倍処理、エッジ処理、γ補正、二値化処理等、周知の画像処理を行い画像データを取得する。スキャナ装置２は、フラットベッド型であってもよいし固定された光源に原稿を送る原稿送り型であってもよい。 The scanner device 2 has a well-known configuration. For example, an original set on a contact glass is optically scanned, light is applied to the original, and the reflected light is input to a one-dimensional image pickup device such as a CCD to generate an electrical signal. Convert. The scanner device 2 converts the converted electrical signal into digital data by controlling the moving speed of the light source and the paper feed, and converts it into digital data. Shading processing, scaling processing, edge processing, γ correction, binary processing Image data is acquired by performing well-known image processing such as conversion processing. The scanner device 2 may be a flat bed type or a document feed type that sends a document to a fixed light source.

また、プリンタ装置３は周知の構成であり、文字認識装置１やスキャナ装置２から送信された印刷データを印刷する。プリンタ３の画像形成方法は、レーザ方式、ＬＥＤ方式、液晶シャッタ方式、インクジェット方式等、どのような方式であってもよい。 The printer device 3 has a known configuration, and prints print data transmitted from the character recognition device 1 or the scanner device 2. The image forming method of the printer 3 may be any method such as a laser method, an LED method, a liquid crystal shutter method, and an ink jet method.

図２は、文字認識装置のハードウェア構成図の一例を示す。文字認識装置１は、例えばコンピュータとして構成される。文字認識装置１は、バスＢで相互に接続されたＣＰＵ１１、入出力装置１２、表示装置１３、ドライブ装置１４、主記録装置１５、補助記憶装置１６及び通信装置１７、を有する。 FIG. 2 shows an example of a hardware configuration diagram of the character recognition device. The character recognition device 1 is configured as a computer, for example. The character recognition device 1 includes a CPU 11, an input / output device 12, a display device 13, a drive device 14, a main recording device 15, an auxiliary storage device 16, and a communication device 17 that are connected to each other via a bus B.

入出力装置１２はユーザが操作するキーボード及びマウスなどから入力された各種操作信号を処理し、また、スキャナ装置２から送信される画像データの入出力やプリンタ３へ送信する印刷データの入出力を制御する。表示装置１３は、文字認識装置１を操作するのに必要な各種ウィンドウやデータ等のＧＵＩ（ＧｒａｐｈｉｃａｌＵｓｅｒＩｎｔｅｒｆａｃｅ）をディスプレイに表示する。通信装置１７は、文字認識装置１をネットワークに接続する為のインタフェースであり、例えばＮＩＣ（Network Interface Card）やモデム等で構成される。 The input / output device 12 processes various operation signals input from the keyboard and mouse operated by the user, and inputs / outputs image data transmitted from the scanner device 2 and print data input / output to the printer 3. Control. The display device 13 displays a GUI (Graphical User Interface) such as various windows and data necessary for operating the character recognition device 1 on the display. The communication device 17 is an interface for connecting the character recognition device 1 to a network, and includes, for example, a NIC (Network Interface Card) or a modem.

文字認識装置１を動作させるための文字認識プログラムは、メモリカード、ＣＤ−ＲＯＭ等の記録媒体１８によって提供されるか、ネットワーク４を通じてダウンロードされる。また、記録媒体１８はドライブ装置１４にセットされ、データやプログラムが記録媒体１８からドライブ装置１４を介して補助記憶装置１６にインストールされる。 A character recognition program for operating the character recognition device 1 is provided by a recording medium 18 such as a memory card or a CD-ROM, or downloaded through the network 4. The recording medium 18 is set in the drive device 14, and data and programs are installed from the recording medium 18 to the auxiliary storage device 16 via the drive device 14.

補助記憶装置１６はハードディスク装置や記憶素子により構成され、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）や文字データ、プログラムを格納すると共に、必要なファイル等を格納する。ＣＰＵ１１は補助記憶装置１６から文字認識プログラムをロードし主記憶装置１５にプログラムを展開して該プログラム実行する。 The auxiliary storage device 16 is configured by a hard disk device or a storage element, and stores an OS (Operating System), character data, a program, and necessary files and the like. The CPU 11 loads a character recognition program from the auxiliary storage device 16, develops the program in the main storage device 15, and executes the program.

図３は文字認識装置１の機能ブロック図の一例を示す。文字認識装置１はフォント判別手段２１、文字サイズ判別手段２２、傾き角判別手段２３及び文字識別手段２４とを有する。フォント判別手段２１はスキャナ装置２により得られた画像データの文字のフォントを判別する。文字サイズ判別手段２２は画像データの文字の大きさを判別する。傾き角判別手段２３は原稿又は画像データの主走査方向に対する傾き角を判別する。文字識別手段２４は、フォントと文字の大きさに基づき文字データ５を参照して文字の文字コードを決定する。 FIG. 3 shows an example of a functional block diagram of the character recognition device 1. The character recognition device 1 includes a font determination unit 21, a character size determination unit 22, an inclination angle determination unit 23, and a character identification unit 24. The font discriminating means 21 discriminates the character font of the image data obtained by the scanner device 2. The character size determining means 22 determines the character size of the image data. The inclination angle determination means 23 determines the inclination angle of the document or image data with respect to the main scanning direction. The character identification unit 24 determines the character code of the character by referring to the character data 5 based on the font and the character size.

また、文字認識プログラムは、ＣＰＵ１１を、フォント判別手段２１、文字サイズ判別手段２２、傾き角判別手段２３及び文字識別手段２４として機能させるプログラムである。 The character recognition program is a program that causes the CPU 11 to function as the font determination unit 21, the character size determination unit 22, the inclination angle determination unit 23, and the character identification unit 24.

文字データ５について説明する。図４は「漢」という文字のビットマップデータを示す。図４では一例として、ＭＳ明朝体のフォントを用い文字の大きさを１０．５ポイントとした。ビットマップデータであるので、画素毎に白又は黒の画素を配置すれば文字の形状が表示できる。文字をこのようにビットマップで表示すると、スキャンされた原稿の文字を同様の画素に区切り、画素毎に黒画素か白画素かを比較することで文字を識別できる。 The character data 5 will be described. FIG. 4 shows bit map data of characters “KAN”. In FIG. 4, as an example, an MS Mincho font is used and the character size is 10.5 points. Since it is bitmap data, the shape of a character can be displayed if white or black pixels are arranged for each pixel. When characters are displayed in this manner as a bitmap, characters can be identified by dividing the scanned original character into similar pixels and comparing each pixel for black or white pixels.

図４では画素数を２５６×２５６としたが、英文で記載された原稿を文字認識するのであれば画素数を粗くしてもよいし、１０２４×１０２４のように高解像で表示してもよい。説明のため図４では左下の頂点を原点に右方向をＸ方向、上方向をＹ方向、逆方向をそれぞれ−Ｘ方向、−Ｙ方向とした。 In FIG. 4, the number of pixels is 256 × 256. However, the number of pixels may be coarsened if a document written in English is recognized, or displayed at a high resolution such as 1024 × 1024. Good. For the sake of explanation, in FIG. 4, the lower left vertex is the origin, the right direction is the X direction, the upper direction is the Y direction, and the reverse direction is the -X direction and -Y direction, respectively.

本実施例の文字データはこのようなビットマップを文字の左方向、右方向、天方向又は地方向から黒画素の現れる数を画素列毎に数値化したものである。例えば「漢」という文字を左からＸ方向に黒画素の現れる数をカウントすると、４番目の画素列に３つの黒画素Ｘ４１、Ｘ４２及びＸ４３が現れる。また、５番目の画素列にはＸ４１、Ｘ４２及びＸ４３に加え、Ｘ５１が現れる。黒画素が現れる位置は文字毎に異なっているため、Ｘ方向の画素列ごとに黒画素が現れる個数を文字データとすれば、当該文字に固有の識別情報となる。すなわち、本実施の形態の文字データは画素列ごとに現れる黒画素の数（０，０，０，３，１，１，…）である。したがって、本実施例の文字データは２５６個（実際には後述のように若干少ない）の黒画素の数となる。 The character data of this embodiment is such a bit map in which the number of black pixels appearing from the left direction, right direction, top direction, or ground direction of a character is digitized for each pixel column. For example, when the number “black” appears from the left in the X direction, the number of black pixels X41, X42, and X43 appears in the fourth pixel row. In addition, X51 appears in the fifth pixel column in addition to X41, X42, and X43. Since the positions at which black pixels appear are different for each character, if the number of black pixels that appear for each pixel column in the X direction is used as character data, it becomes identification information unique to the character. That is, the character data of the present embodiment is the number of black pixels (0, 0, 0, 3, 1, 1,...) Appearing for each pixel column. Therefore, the number of character data in this embodiment is 256 (in practice, slightly smaller as will be described later).

黒画素が現れる数はＹ方向で数えてもよいし、また、Ｘ方向の逆方向、Ｙ方向の逆方向で数えることができる。−Ｙ方向であれば、「漢」の文字では４番目の画素に３つの黒画素−Ｙ４１、−Ｙ４２、−Ｙ４３及び−Ｙ４４が現れる。また、５番目の画素には−Ｙ５１〜−Ｙ５７が現れる。このように、１つの文字に対しＸ方向、Ｙ方向、−Ｘ方向及び−Ｙ方向の４つの文字データを抽出することができる。 The number of black pixels that appear may be counted in the Y direction, or in the reverse direction of the X direction and the reverse direction of the Y direction. In the −Y direction, three black pixels −Y41, −Y42, −Y43, and −Y44 appear in the fourth pixel in the character “KAN”. Further, -Y51 to -Y57 appear in the fifth pixel. Thus, four character data in the X direction, Y direction, -X direction, and -Y direction can be extracted for one character.

このような文字データであれば、どちらの方向から文字がスキャンされても文字全体を読み込む前に文字の認識を開始することができる。また、文字全体を読み取った後は、４方向全ての文字データを使用して文字認識できる。 With such character data, character recognition can be started before reading the entire character, regardless of which direction the character is scanned. In addition, after reading the entire character, the character can be recognized using character data in all four directions.

ところで実際に文字認識する場合、黒画素が現れるまではどこから文字が始まるか分からないので（言い換えれば２５６個の画素列において最初の画素がどこか分からない）、文字データは黒画素が存在する画素から画素列毎に現れる黒画素の数を数値にする。すなわち、「漢」であれば（０，０，０，３，１，１，…）うち（０、０，０）を省略する。 By the way, when actually recognizing a character, it is not known where the character starts until a black pixel appears (in other words, where the first pixel is not found in 256 pixel columns), so the character data is a pixel in which a black pixel exists. To the number of black pixels appearing for each pixel column. That is, if it is “Chinese” (0, 0, 0, 3, 1, 1,...), (0, 0, 0) is omitted.

また、文字のビットマップデータは、アウトラインフォントであっても文字の大きさが異なると若干に異なる形状となる。例えば、１２ポイントの文字と１０．５ポイントの文字は全くの相似形ではない。例えば、１０．５ポイントでは４画素目に１つの黒画素が現れたが、１２ポイントでは４画素目に２つの黒画素が現れるなど、黒画素が現れる位置は文字の大きさによって異なる。そこで、本実施の形態の文字データは文字の大きさ毎に文字データを格納する。格納しておく文字の大きさの分解能は、一般的に使用される大きさであればよく、例えば、８、９、１０、１０、１０.５、１１、１２、１４、１６、１８、２０ポイント程度である。 Further, even if the character bitmap data is an outline font, the character bitmap data has a slightly different shape when the character size is different. For example, a 12-point character and a 10.5-point character are not quite similar. For example, one black pixel appears at the fourth pixel at 10.5 points, but two black pixels appear at the fourth pixel at 12 points. The positions at which the black pixels appear vary depending on the size of the character. Therefore, the character data of this embodiment stores character data for each character size. The resolution of the size of characters to be stored may be any size that is generally used. For example, 8, 9, 10, 10, 10.5, 11, 12, 14, 16, 18, 20 It is about a point.

図５は文字データの一例を示す。図５ではフォントの種類及び大きさに対応づけて４つの方向毎に黒画素の現れる数が記されている。文字データにはその他の大きさの文字データも含まれている。 FIG. 5 shows an example of character data. FIG. 5 shows the number of black pixels that appear in each of the four directions in association with the font type and size. The character data includes character data of other sizes.

なお、図５では白から黒に反転する画素の数をカウントしたが、黒から白に反転する画素をカウントして文字データとしてもよい。 In FIG. 5, the number of pixels that are inverted from white to black is counted, but the pixels that are inverted from black to white may be counted as character data.

また、本実施の形態の文字認識装置は文字の大きさ毎に文字認識を行うため文字の特徴的な部分の寸法を用いて文字データを構成できる。図６は文字の寸法の取り方の一例を示す図である。例えば天地の大きさ左右の幅は文字の寸法の一例であり、これにより文字全体の大きさが定まる。 In addition, since the character recognition apparatus according to the present embodiment performs character recognition for each character size, character data can be constructed using the dimensions of characteristic portions of the characters. FIG. 6 is a diagram showing an example of how to measure the size of characters. For example, the size of the top and bottom of the left and right is an example of the size of the character, and this determines the size of the entire character.

図６に示すように、文字のビットマップデータは黒画素が連続した部分により区切ることができる。「漢」の場合、三水（さんずい）の各画、旁の上部及び旁の下部がそれぞれ連続部である。それぞれの連続部で最も距離が大きくなる画素を２つ抽出し２つの画素間の寸法を文字データとする。図６では、三水の各画の長さＬｅｎｇｔｈ１〜３が最も距離が大きくなる画素間の寸法を示す。同様に、旁の上部ではＬｅｎｇｔｈ４が、旁の下部ではＬｅｎｇｔｈ５が最も距離が大きくなる画素間の寸法を示す。 As shown in FIG. 6, the bitmap data of characters can be divided by a portion where black pixels are continuous. In the case of “Kan”, each picture of Sansui, the upper part of the bowl and the lower part of the bowl are continuous parts. Two pixels having the largest distance in each continuous portion are extracted, and the dimension between the two pixels is used as character data. In FIG. 6, the lengths 1 to 3 of the three water images indicate the dimension between the pixels having the largest distance. Similarly, Length 4 is the dimension between the pixels where the distance is the longest at the upper part of the ridge and Length 5 is the lower part of the cocoon.

また、連続部の間隔を文字データとすることができる。例えば、各連続部の端部の画素同士の間隔である。図６では、三水の１画目と２画目の端部間の寸法をｄｉｓ１、ｄｉｓ３で、２画目と３画目の端部間の寸法をｄｉｓ２で、１画目と３画目の端部間の寸法をｄｉｓ４で、２画目と３画目の端部間の寸法をｄｉｓ５でそれぞれ表した。なお、図６では各連続部の右側同士又は左側同士の端部間の間隔を抽出したが、右側の端部と左側の端部の間隔を抽出してもよい。同様に、三水の各画と旁の上部、下部との間隔を抽出できる。また、各連続部の外接矩形を算出し、外接矩形の対角線を文字データとしてもよい。 Further, the interval between the continuous portions can be character data. For example, the distance between the pixels at the end of each continuous portion. In FIG. 6, the dimensions between the first and second strokes of the three waters are dis1 and dis3, and the dimension between the second and third strokes is dis2 and the first and third strokes. The dimension between the end portions of the second and third strokes was expressed as dis4 and dis5, respectively. In FIG. 6, the interval between the right ends or the left ends of each continuous portion is extracted, but the interval between the right end portion and the left end portion may be extracted. Similarly, it is possible to extract the interval between each drawing of the three waters and the upper and lower parts of the basket. Alternatively, the circumscribed rectangle of each continuous portion may be calculated, and the diagonal line of the circumscribed rectangle may be used as character data.

また、天地方向、左右方向において２つの画素を抽出し２つの画素間の寸法を文字データとすることができる。「漢」のビットマップデータでは最左の画素列に３つの黒画素があり、最右の画素列に２つの黒画素があるが、それぞれから２つの画素を抽出し画素間の寸法を抽出する。図６では最左と最右の最も上の画素同士を抽出しその寸法をＬＲ１とし、また、最左と最右の最も下の画素同士を抽出しその寸法をＬＲ２とした。 Also, two pixels can be extracted in the vertical direction and the horizontal direction, and the dimension between the two pixels can be used as character data. In the “Kan” bitmap data, there are three black pixels in the leftmost pixel column and two black pixels in the rightmost pixel column, and two pixels are extracted from each to extract the dimension between the pixels. . In FIG. 6, the uppermost pixels of the leftmost and rightmost are extracted and the dimension is LR1, and the lowermost pixels of the leftmost and rightmost are extracted and the dimension is LR2.

また、「漢」のビットマップデータでは最上の画素列に４つの黒画素があり、最下の画素列に６つの黒画素があるが、それぞれから１つずつ画素を抽出し画素間の寸法を抽出する。図６では最上と最下の画素列の最も左の画素同士を抽出しその距離をＴＢ１とし、最上と最下の画素列の最も右の画素同士を抽出しその距離をＴＢ２とした。 Also, in the “Kan” bitmap data, there are four black pixels in the uppermost pixel column and six black pixels in the lowermost pixel column. Extract. In FIG. 6, the leftmost pixels of the top and bottom pixel columns are extracted and the distance is TB1, and the rightmost pixels of the top and bottom pixel columns are extracted and the distance is TB2.

図７は文字の特徴的な部分の寸法を用いた文字データの一例を示す。フォントの種類及び大きさについては図５と同様である。そして、天地及び左右、連続部１〜ｎ（「漢」の場合にはｎ＝５）、天地方向の画素間及び左右方向の画素間のそれぞれの寸法が格納されている。 FIG. 7 shows an example of character data using the size of the characteristic part of the character. The font type and size are the same as in FIG. In addition, the dimensions of the top and bottom, the left and right, the continuous portions 1 to n (n = 5 in the case of “Han”), the dimensions between the pixels in the top and bottom directions and between the pixels in the left and right directions are stored.

本実施の形態では寸法の単位は画素数でなく絶対的な距離を表す単位（例えば、ｍｍ、ｃｍ等）を用いることができる。本実施の形態の文字データは文字の大きさ毎に格納されているので、文字の特徴的な部位の寸法を絶対値で評価することで精度のよい文字認識を可能とする。 In the present embodiment, the unit of dimension can be a unit representing an absolute distance (for example, mm, cm, etc.) instead of the number of pixels. Since the character data of the present embodiment is stored for each character size, accurate character recognition is enabled by evaluating the dimension of the characteristic part of the character with an absolute value.

また、図６又は７のような寸法に加え、文字の特徴を表すパラメータとして角度を用いてもよい。図６では寸法を抽出する際に２画素間を結ぶ直線が得られるがその直線の角度を求め、それぞれの直線がなす角度を用いる。例えば、ｄｉｓ１とｄｉｓ３、ｄｉｓ１とＬｅｎｇｔｈ５のなす角である。このように特徴的な部分の寸法に加えそのなす角を用いることで更に精度のよい文字認識が可能となる。 Further, in addition to the dimensions as shown in FIG. 6 or 7, an angle may be used as a parameter representing character characteristics. In FIG. 6, a straight line connecting two pixels is obtained when extracting the dimensions. The angle of the straight line is obtained, and the angle formed by each straight line is used. For example, the angle formed by dis1 and dis3 and dis1 and Length5. In this way, character recognition can be performed with higher accuracy by using the corners formed in addition to the dimensions of the characteristic portions.

続いて、スキャンされた原稿に印刷されていた文字のフォントの判別について説明する。なお、フォントを判別する際には既に文字の大きさが判別しているものとする。 Next, determination of the font of characters printed on a scanned document will be described. It is assumed that the character size has already been determined when determining the font.

図８は「合」という文字のフォント毎のビットマップデータを示す。図８では一例として、ＭＳ明朝、ＭＳゴシック、ＨＧ楷書のフォント示した。図８に示すように、ＭＳ明朝、ＭＳゴシック、ＨＧ楷書のフォントでは、線の太さ、黒画素の割合、形状が大きく異なる。したがって、フォント判別手段２１は線の太さ等に基づきフォントを判別できる。 FIG. 8 shows bitmap data for each font of the characters “go”. In FIG. 8, as an example, fonts of MS Mincho, MS Gothic, and HG font are shown. As shown in FIG. 8, the line thickness, the ratio of black pixels, and the shape of the MS Mincho, MS Gothic, and HG fonts differ greatly. Therefore, the font discrimination means 21 can discriminate the font based on the thickness of the line.

線の太さ及び黒画素の割合の場合、ＭＳ明朝＜ＨＧ楷書＜ＭＳゴシックとなる。図８では、線の太さとしてそれぞれのフォントで４カ所の太さを示しているが、スキャンした文字の線の太さをいくつかの部位で検出すれば、その平均に基づきフォント種別を判別できる。また、いくつかの文字について線の太さを算出しその平均に基づき判別してもよい。 In the case of the thickness of the line and the ratio of the black pixels, MS Mincho <HG book <MS Gothic. In FIG. 8, the thicknesses of the four lines are shown for each font as the thickness of the line. If the thickness of the line of the scanned character is detected in several parts, the font type is determined based on the average. it can. Alternatively, the line thickness may be calculated for some characters and determined based on the average.

また、黒画素の割合を用いる場合、スキャンした文字の外接矩形を検出し、外接矩形の面積に対しする黒画素の割合に基づきフォントを判別できる。黒画素の割合は文字によって異なるので、フォント判別手段２１は例えば１行分の文字又は１ページ分の文字について黒画素の割合を求めその平均に応じてフォントを判別する。 In addition, when the ratio of black pixels is used, the circumscribed rectangle of the scanned character is detected, and the font can be determined based on the ratio of the black pixel with respect to the area of the circumscribed rectangle. Since the proportion of black pixels varies depending on the character, the font discrimination means 21 obtains the proportion of black pixels for, for example, one line of characters or one page of characters and discriminates the font according to the average.

また、フォント判別手段２１は、例えば１画の線の太さの変化に基づきフォントを判別する。ＭＳゴシックでは１画の線の太さはほとんど変化せず、ＨＧ楷書では大きく変化する。したがって、例えば、ある１画の始点付近（例えばＡ１、Ｂ１、Ｃ１）と終点付近（Ａ２、Ｂ２、Ｃ２）の線の太さの変化率を算出することで、フォントを判別できる。 Further, the font discrimination means 21 discriminates the font based on, for example, a change in the thickness of one stroke line. In MS Gothic, the thickness of one stroke line hardly changes, and in HG font, it changes greatly. Therefore, for example, the font can be determined by calculating the change rate of the thickness of the line near the start point (for example, A1, B1, C1) and the end point (A2, B2, C2) of a certain stroke.

文字データ５は、文字の大きさ及びフォントに対応づけて文字の太さ、黒画素の割合、線の太さの変化率を示す情報を有しているので、文字の大きさが判別すれば容易にフォントを判別できる。 The character data 5 has information indicating the thickness of the character, the ratio of black pixels, and the rate of change in the thickness of the line in association with the size of the character and the font. Easily distinguish fonts.

以上の構成に基づき文字認識装置１が文字を認識する処理について図９のフローチャート図に基づき説明する。スキャナ装置２に原稿がセットされ順にスキャンされ、画像データが逐次、文字認識装置１に送信される。 A process in which the character recognition device 1 recognizes a character based on the above configuration will be described with reference to the flowchart of FIG. A document is set on the scanner device 2 and scanned in order, and image data is sequentially transmitted to the character recognition device 1.

まず、文字の大きさやフォントを判別するため原稿の１行目がスキャンされる（Ｓ１１）。文字のない行間が検知され１行目のスキャンが終了したことが検出されたら、まず、文字サイズ判別手段２２は文字の大きさを判別する（Ｓ１２）。文字の大きさはどのように判別してもよいが、例えば、各文字の外接矩形を求め外接矩形の大きさに基づき判断する。同じ大きさの文字であっても文字毎に外接矩形の大きさが異なるので、いくつかの文字の平均を用いて判別する。原稿サイズは既知であるので原稿サイズに対する外接矩形の大きさにより判別してもよいし、１つの外接矩形を撮像素子が検知する画素数で判別してもよい。 First, the first line of the document is scanned to determine the character size and font (S11). When it is detected that the line between which there is no character is detected and the scanning of the first line is completed, first, the character size determining means 22 determines the size of the character (S12). The character size may be determined in any way. For example, a circumscribed rectangle of each character is obtained and determined based on the size of the circumscribed rectangle. Even if the characters are the same size, the size of the circumscribed rectangle is different for each character, so the determination is made using the average of several characters. Since the document size is known, it may be determined by the size of the circumscribed rectangle relative to the document size, or one circumscribed rectangle may be determined by the number of pixels detected by the image sensor.

ついで、フォント判別手段２１はフォントの種類を判別する（Ｓ１３）。上述したように、各文字の大きさが分かれば、文字データ５が有する線の太さ等の情報に基づきフォントを判別できる。 Next, the font discrimination means 21 discriminates the font type (S13). As described above, if the size of each character is known, the font can be determined based on information such as the thickness of the line of the character data 5.

ついで、文字認識装置１は原稿の天地、すなわち文字の向きを判別する（Ｓ１４）。スキャナ装置２にセットされた原稿のどの向きに文字が印刷されているか分からないため、文字認識装置１は文字の形状に基づき文字の天地を判別する。例えば、各文字の外接矩形の縦横比、直線部の方向、払いの方向等に基づき文字の天地を判別する。なお、例えば、標準パターンを用いた周知のパターンマッチングにより文字認識を行い、文字の認識が可能な方向を検出し天地を判別してもよい。 Next, the character recognition device 1 determines the top of the document, that is, the direction of the characters (S14). Since it is not known in which direction the character is printed on the document set on the scanner device 2, the character recognition device 1 determines the top of the character based on the shape of the character. For example, the top and bottom of the character is determined based on the aspect ratio of the circumscribed rectangle of each character, the direction of the straight line portion, the direction of payment, and the like. Note that, for example, character recognition may be performed by well-known pattern matching using a standard pattern, and the direction in which the character can be recognized may be detected to determine the top and bottom.

ついで、文字識別手段２４は文字データ５を用いて文字認識を行う（Ｓ１５）。文字識別手段２４は判別した文字の大きさ及びフォントに基づき文字データを参照し文字を認識する。既に１行目は全体がスキャンされているので、文字認識手段２４は図５の４つの方向のいずれの文字データを用いてもよい。 Next, the character identification means 24 performs character recognition using the character data 5 (S15). The character identification means 24 recognizes the character by referring to the character data based on the determined character size and font. Since the entire first line has already been scanned, the character recognition means 24 may use any of the character data in the four directions in FIG.

ついで、文字認識装置１は全ての行の文字認識が終了したか否か判定し（Ｓ１６）、終了していなければ（Ｓ１６のＮｏ）次の行をスキャンし（Ｓ１７）、文字認識を行う（Ｓ１８）。 Next, the character recognition device 1 determines whether or not the character recognition of all lines has been completed (S16), and if not completed (No in S16), scans the next line (S17) and performs character recognition (S17). S18).

既に１行目の文字認識において文字の大きさ及びフォントを判別しているので、２行目以降の行では行全体を読み込む前に文字認識を開始できる。すなわち文字の天地方向は既知であるので、天地方向に応じて図５のＸ方向〜−Ｙ方向のいずれかの文字データを抽出し、主走査方向の１ライン毎に文字認識の候補を絞ることができる。例えば、黒画素が最初に現れる１ラインに３つの黒画素がスキャンにより検出された場合、文字データの最初に３つの黒画素を有する文字の候補を抽出し、ついで５つの黒画素がスキャンにより検出された場合、文字データの２番目の画素列に５つの黒画素を有する文字に候補を絞っていく。したがって、本実施例の文字認識は１つの文字全体を読み込まなくても文字の認識が可能であり、文字認識処理を高速化できる。全ての行の文字認識が終了したら図９のフローチャート図の処理は終了する。 Since the character size and font have already been determined in character recognition on the first line, character recognition can be started before reading the entire line on the second and subsequent lines. That is, since the vertical direction of the character is known, character data in any of the X direction to -Y direction in FIG. 5 is extracted according to the vertical direction, and the character recognition candidates are narrowed down for each line in the main scanning direction. Can do. For example, when three black pixels are detected by scanning in one line where black pixels appear first, character candidates having three black pixels are extracted at the beginning of character data, and then five black pixels are detected by scanning. In such a case, candidates are narrowed down to characters having five black pixels in the second pixel column of the character data. Therefore, the character recognition according to the present embodiment can recognize a character without reading the entire character, and can speed up the character recognition process. When the character recognition for all the lines is completed, the processing in the flowchart of FIG. 9 is completed.

なお、各行の全体がスキャンされてから文字認識を行う場合、図５のＸ方向〜−Ｙ方向のいずれの方向の文字データを用いて文字認識してもよく、また、これらの複数を組み合わせてもよい。 Note that when character recognition is performed after the entire line is scanned, character recognition may be performed using character data in any direction from the X direction to the -Y direction in FIG. Also good.

また、図９では図５の文字データを用いて文字認識したが、図７のような文字の特徴的な部分の寸法を用いて文字認識してもよい。この場合、各行の全体をスキャンした後に寸法を適用して文字認識することが好適であるが、スキャンされた範囲で抽出可能な寸法に基づき徐々に文字の候補を絞っていくことも可能である。 In FIG. 9, character recognition is performed using the character data of FIG. 5, but character recognition may be performed using the size of the characteristic part of the character as illustrated in FIG. 7. In this case, it is preferable to recognize the characters by applying the dimensions after scanning each whole line, but it is also possible to gradually narrow down the character candidates based on the dimensions that can be extracted in the scanned range. .

本実施例によれば、文字データをフォント及び大きさに応じて予め格納しておき、十分な分解能で画素毎にスキャンされた原稿の文字と比較するため、きわめて高精度に文字を認識できる。予め格納する文字データは使用頻度の高いフォント（例えば、ＭＳ明朝、ＭＳゴシック、ＨＧ楷書等）及び使用頻度の高い大きさ（１０.５ポイント、１２ポイント等）であればよいので文字データの容量が大きくなりすぎることもない。また、１行目の文字認識が終了した後は、文字全体を読み込まなくても文字認識することが可能であるため、認識速度を向上できる。 According to the present embodiment, character data is stored in advance according to the font and size, and compared with characters of a document scanned for each pixel with sufficient resolution, so that characters can be recognized with extremely high accuracy. The character data stored in advance may be a font that is frequently used (for example, MS Mincho, MS Gothic, HG font, etc.) and a frequently used size (10.5 points, 12 points, etc.). The capacity does not become too large. In addition, after the character recognition on the first line is completed, the character recognition can be performed without reading the entire character, so that the recognition speed can be improved.

本実施例では傾いている場合にも実施例１と同様に文字認識が可能な文字認識装置について説明する。本実施例では文字データの構成が実施例１と異なる。なお、システム構成図や画像認識装置１の機能ブロックについて実施例１と同様である。 In the present embodiment, a character recognition device capable of recognizing characters in the same manner as in the first embodiment even when tilted will be described. In this embodiment, the structure of character data is different from that of the first embodiment. The system configuration diagram and the functional blocks of the image recognition apparatus 1 are the same as those in the first embodiment.

図４に示したように文字のビットマップデータは、２５６×２５６の各画素に黒又は白が配置されたものである（以下、画素文字データという）。したがって、画素文字データをそのまま文字認識のための文字データとすることで更に認識率を向上できる。具体的には、ビットマップデータの原点を適当に定め２５６×２５６の各画素に１（黒画素）又は０（白画素）のビットを格納する。本実施例の画像認識装置１は、フォントの種類かつ文字の大きさ毎に画素文字データを有する。なお、文字認識に使用する画素文字データは解凍して使用することし、それ以外では画素文字データは圧縮しておけばファイル容量を低減できる。 As shown in FIG. 4, the bitmap data of characters is data in which black or white is arranged in each 256 × 256 pixel (hereinafter referred to as pixel character data). Therefore, the recognition rate can be further improved by using the pixel character data as it is for character recognition. Specifically, the origin of the bitmap data is appropriately determined, and 1 (black pixel) or 0 (white pixel) bit is stored in each 256 × 256 pixel. The image recognition apparatus 1 of this embodiment has pixel character data for each font type and character size. Note that if the pixel character data used for character recognition is decompressed and used, and the pixel character data is compressed otherwise, the file capacity can be reduced.

図１０（ａ）に示すように原稿が傾いてスキャンされた場合、１次元撮像素子の主走査方向に対し文字が傾いて読み込まれる。図１０（ｂ）は原稿が傾いて読み込まれた場合の画像データの一例を示す。図１０（ｂ）のように文字が傾いてしまうと、黒画素が現れる画素の位置がずれてしまう。そこで、本実施例では、原稿の傾き角を検出し、傾き角に応じて文字データの画素文字データを傾け、文字を認識する。 As shown in FIG. 10A, when the document is scanned with an inclination, characters are read with an inclination with respect to the main scanning direction of the one-dimensional image sensor. FIG. 10B shows an example of image data when the original is read with an inclination. If the character is tilted as shown in FIG. 10B, the position of the pixel where the black pixel appears is shifted. Therefore, in this embodiment, the inclination angle of the original is detected, and the pixel character data of the character data is inclined according to the inclination angle to recognize the character.

図１１は文字認識装置１が傾いた文字を認識する処理を示すフローチャート図である。なお、図１１において図９と同一ステップには同一の符号を付した。 FIG. 11 is a flowchart showing a process in which the character recognition device 1 recognizes a tilted character. In FIG. 11, the same steps as those in FIG. 9 are denoted by the same reference numerals.

まず、画像認識装置１は１行目をスキャンして（Ｓ１１）、傾き角判別手段２３が１行分の画像データに基づき傾き角を検出する（Ｓ２０）。傾き角の検出はどのように行ってもよいが、例えば、１行の文字の地（最低部）を接続するベースラインを想定し、該ベースラインと主走査方向のなす角θを算出する。なお、原稿の挿入角度が検出されていれば、挿入角度と主走査方向のなす角により傾き角を求めることが可能である。 First, the image recognition apparatus 1 scans the first line (S11), and the inclination angle determination means 23 detects the inclination angle based on the image data for one line (S20). The inclination angle may be detected in any way. For example, assuming a base line connecting the ground (minimum part) of one line of characters, an angle θ between the base line and the main scanning direction is calculated. If the document insertion angle is detected, the tilt angle can be obtained from the angle formed by the insertion angle and the main scanning direction.

ついで、文字認識装置１は原稿をスキャンして得られた画像データを傾き角θ補正して、実施例１と同様に文字の大きさ、フォント及び天地を判別し（Ｓ１２〜１４）、図５又は図７の文字データに基づき文字認識を行う。なお、１行目の文字を画素文字データに基づき画素毎に黒画素か白画素か判定し文字認識してもよい。 Next, the character recognition device 1 corrects the inclination angle θ of the image data obtained by scanning the document, and determines the character size, font, and top and bottom as in the first embodiment (S12 to 14), and FIG. Alternatively, character recognition is performed based on the character data of FIG. The character in the first line may be recognized by determining whether it is a black pixel or a white pixel for each pixel based on the pixel character data.

ついで、文字識別手段認２４は画素文字データを傾き角θ傾かせたものに補正する（Ｓ２１）。例えば、図１２に示した実線のような画素文字データであれば、角θ傾かせて点線のような仮想の枠を設定する。画素文字データにこのような処理を行うことで、原稿が傾いていても仮想の枠のいずれかが主走査方向と一致するため、画素文字データを用いて文字認識できる。 Next, the character identification means recognition 24 corrects the pixel character data to have the inclination angle θ inclined (S21). For example, in the case of pixel character data such as a solid line shown in FIG. 12, a virtual frame such as a dotted line is set by inclining the angle θ. By performing such processing on the pixel character data, even if the document is tilted, one of the virtual frames coincides with the main scanning direction, so that character recognition can be performed using the pixel character data.

例えば、仮想の枠のうち辺Ａが主走査方向に一致すれば、辺Ａから１列ごとに黒画素が現れる数をカウントすることで原稿が傾いていても実施例１と同様に適用可能な文字データとすることができる。 For example, if side A of the virtual frame coincides with the main scanning direction, the number of black pixels appearing for each column from side A can be counted to apply the same as in the first embodiment even if the document is inclined. It can be character data.

以降は、図９のフローチャート図と同様である。すなわち、２行目以降の行では行全体を読み込む前に文字認識を開始できる。文字の天地方向、傾き角θは既知であるので、傾き角θ補正された画素文字データに基づき、主走査方向の１ライン毎に文字認識の候補を絞ることができる。 The subsequent steps are the same as those in the flowchart in FIG. That is, in the second and subsequent lines, character recognition can be started before reading the entire line. Since the vertical direction of the character and the inclination angle θ are known, the character recognition candidates can be narrowed down for each line in the main scanning direction based on the pixel character data corrected with the inclination angle θ.

したがって、本実施例の文字認識は原稿が傾いていても１つの文字全体を読み込まずに文字の認識が可能であり、文字認識処理を高速化できる。全ての行の文字認識が終了したら図１１のフローチャート図の処理は終了する。 Therefore, the character recognition according to the present embodiment can recognize a character without reading the entire character even if the document is tilted, and can speed up the character recognition process. When the character recognition of all the lines is completed, the process of the flowchart in FIG.

本実施例によれば、実施例１の効果に加え、原稿が傾いていても傾き角に応じて文字データを用意し、高速な文字認識が可能である。なお、行全体を読み込んでから文字認識するのであれば当該行の画像データの傾きを補正できるので、実施例１の文字データを用いても、また、傾き補正しない画素文字データを用いても文字認識することができる。 According to the present embodiment, in addition to the effects of the first embodiment, even when the document is inclined, character data is prepared according to the inclination angle, and high-speed character recognition is possible. Note that if character recognition is performed after the entire line is read, the inclination of the image data of the line can be corrected. Therefore, even if the character data of the first embodiment is used or the pixel character data without inclination correction is used. Can be recognized.

画素文字データを用いた場合、黒画素の現れる数ではなく、画素位置毎に黒又は白を判定することができるのでより精度よく文字認識できる。 When pixel character data is used, it is possible to determine black or white for each pixel position rather than the number of black pixels appearing, so that character recognition can be performed more accurately.

実施例１及び２では文字認識装置が予め文字データ又は画素文字データ（以下、単に文字データという）を有していることとしたが、認識するフォント及び文字の大きさに応じて文字データを生成してもよい。
図１３は文字データを作成する場合のシステム構成図を示す。なお、図１３のシステム構成図はコンピュータにより構成される文字認識装置１により実現される。 In the first and second embodiments, it is assumed that the character recognition device has character data or pixel character data (hereinafter simply referred to as character data), but generates character data according to the recognized font and character size. May be.
FIG. 13 shows a system configuration diagram for creating character data. The system configuration diagram of FIG. 13 is realized by a character recognition device 1 configured by a computer.

通常、文字認識装置１のパーソナルコンピュータにはＯＳが提供する複数のフォントデータ３１が格納されている。フォントデータ３１には、文字コードに対応つけてアウトラインフォントデータやビットマップデータが格納されている。 Usually, the personal computer of the character recognition apparatus 1 stores a plurality of font data 31 provided by the OS. The font data 31 stores outline font data and bitmap data in association with character codes.

ラスタライザ３２は、文字コードに対応する文字を色付きの小さな点の集まりとして表現するものである。ラスタライザ３２は、所定の文字コードの文字のフォント及び文字の大きさがアプリケーションソフトウェアから指定されると、文字コード、フォント及び大きさに応じたビットマップデータ３４を生成する。 The rasterizer 32 expresses a character corresponding to a character code as a collection of small colored points. The rasterizer 32 generates bitmap data 34 corresponding to the character code, font, and size when the font and character size of the character of the predetermined character code are designated by the application software.

ビットマップデータ３４を図４に示すように所定の画素（例えば２５６×２５６）に区切ると各画素に黒又は白の画素が配置されたものとなる。すなわち文字データ作成手段３６はビットマップデータ３４を所定の画素に区切り、各画素位置毎に黒か白かを判別することで、画素文字データを生成できる。また、文字データ作成手段３６は画素に区切られたビットマップデータ３４又は画素文字データに基づき黒画素が現れる数を示す文字データを生成できる。 When the bitmap data 34 is divided into predetermined pixels (for example, 256 × 256) as shown in FIG. 4, black or white pixels are arranged in each pixel. That is, the character data creation means 36 can generate pixel character data by dividing the bitmap data 34 into predetermined pixels and determining whether each pixel position is black or white. Further, the character data creating means 36 can generate character data indicating the number of black pixels appearing based on the bitmap data 34 divided into pixels or the pixel character data.

図１４は文字データ作成手段により作成される文字データにより文字認識する場合のフローチャート図を示す。なお、図１４において図９と同一ステップには同一の符号を付した。 FIG. 14 is a flowchart for character recognition based on character data created by the character data creation means. In FIG. 14, the same steps as those in FIG. 9 are denoted by the same reference numerals.

まず、スキャンされた１行分の画像データに基づき文字の大きさ天地を判別し（Ｓ１１、１２、Ｓ１４）、文字認識を行う（Ｓ３０）。ステップＳ３０の文字認識は、標準パターンを用いた周知のパターンマッチングによるものである。これにより１行目について所定の精度の認識率で文字が認識される。 First, the character size is determined based on the scanned image data of one line (S11, 12, S14), and character recognition is performed (S30). The character recognition in step S30 is based on known pattern matching using a standard pattern. As a result, characters are recognized with a recognition rate with a predetermined accuracy for the first line.

ついで、フォント判別手段２１は１行目の文字のフォントを判別する（Ｓ３１）。すでに１行目の文字が文字認識されているので、認識された文字の文字コードに基づきラスタライザ３２が複数のフォントで文字のビットマップデータを生成する。そして、各フォントのビットマップデータにより再度１行目の文字に対しパターンマッチングを行う。フォント判別手段２１は複数のフォントのうちマッチングのよいフォントが原稿に用いられているフォントと判別する。 Next, the font discrimination means 21 discriminates the font of the character on the first line (S31). Since the character in the first line has already been recognized, the rasterizer 32 generates character bitmap data in a plurality of fonts based on the character code of the recognized character. Then, pattern matching is performed again on the characters in the first line by using the bitmap data of each font. The font discriminating means 21 discriminates a font having a good matching among a plurality of fonts as a font used in the document.

フォントが判別したので、文字データ作成手段は、当該フォント及び大きさの文字データ及び画素文字データを作成する（Ｓ３２）。これにより、実施例１又は２と同様に文字データが得られたこととなる。 Since the font is determined, the character data creating means creates character data and pixel character data of the font and size (S32). As a result, character data is obtained in the same manner as in the first or second embodiment.

以降は、実施例１と同様であり、全ての行が終了するまで（Ｓ１６）、作成された文字データを用いて文字を認識する（Ｓ１７、Ｓ１８）。文字データ又は画素文字データは、画素毎に黒画素か白画素かを示すものであるため、きわめて精度よく文字を認識できる。 The subsequent steps are the same as in the first embodiment, and characters are recognized using the created character data until all lines are completed (S16) (S17, S18). Since the character data or pixel character data indicates whether each pixel is a black pixel or a white pixel, the character can be recognized with extremely high accuracy.

本実施例によれば、使用される頻度の少ないフォントで印刷された原稿であっても、そのフォントを認識して文字データを作成できるので、多様なフォントに対応してきわめて精度よく文字を認識できる。 According to the present embodiment, even a manuscript printed with a font that is used infrequently can be used to create character data by recognizing the font, so that characters can be recognized with high accuracy corresponding to various fonts. it can.

文字認識装置を含む文字認識システムの全体構成図である。1 is an overall configuration diagram of a character recognition system including a character recognition device. 文字認識装置のハードウェア構成図の一例である。It is an example of the hardware block diagram of a character recognition apparatus. 文字認識装置の機能ブロック図の一例Example of functional block diagram of character recognition device 「漢」という文字のビットマップデータである。This is bitmap data of the characters “Kan”. 文字データの一例である。It is an example of character data. 文字の寸法の取り方の一例を示す図である。It is a figure which shows an example of how to take the dimension of a character. 文字の特徴的な部分の寸法を用いた文字データの一例である。It is an example of the character data using the dimension of the characteristic part of a character. 「合」という文字のフォント毎のビットマップデータである。This is bitmap data for each font of the character “go”. 文字認識装置が文字を認識する処理のフローチャート図である。It is a flowchart figure of the process which a character recognition apparatus recognizes a character. 傾いてスキャンされた原稿の一例を示す図である。FIG. 6 is a diagram illustrating an example of a document scanned at an angle. 傾いた文字を認識する処理を示すフローチャート図である。It is a flowchart figure which shows the process which recognizes the inclined character. 傾き角θ傾かされた画素文字データの一例である。It is an example of pixel character data inclined at an inclination angle θ. 文字データ作成システムのシステム構成図である。It is a system configuration figure of a character data creation system. 文字データ作成手段により作成される文字データにより文字認識する場合のフローチャート図である。It is a flowchart figure in the case of character recognition by the character data created by the character data creation means.

Explanation of symbols

１文字認識装置
２スキャナ装置
３プリンタ
４ネットワーク
５文字データ 1 Character recognition device 2 Scanner device 3 Printer 4 Network 5 Character data

Claims

In a character recognition device that recognizes characters from image data of an optically scanned document,
Font discrimination means for discriminating the font of the character;
Character size determining means for determining the size of the character;
Character data for recognizing the character stored in association with the font of the character and the size of the character;
Character identifying means for determining a character code of the character with reference to the character data based on the font determined by the font determining means and the size of the character determined by the character size determining means;
A character recognition device comprising:

The character data is obtained by dividing character bitmap data into a predetermined number of pixels.
The number of black pixels appearing from the left direction, right direction, top direction or ground direction of the character is recorded for each pixel column from the direction,
The character recognition device according to claim 1.

The character data is obtained by dividing character bitmap data into a predetermined number of pixels.
It is pixel character data having information on white pixels or black pixels for each pixel.
The character recognition apparatus according to claim 1.

The character data is obtained by dividing character bitmap data into a predetermined number of pixels.
Having a dimension between a plurality of two pixels,
The character recognition device according to claim 1, wherein the character recognition device is a character recognition device.

An inclination angle determining means for determining an inclination angle of the original;
The character identification means recognizes the character by inclining the pixel character data according to the inclination angle determined by the inclination determination means;
The character recognition device according to claim 3.

Character data creation means for creating the character data based on bitmap data of characters generated from font data,
The character data creating means creates the character data based on the bitmap data of the character determined by the font determining means and having the size determined by the character size determining means,
The character recognition means recognizes the character based on the character data created by the character data creation means;
The character recognition device according to claim 1, wherein the character recognition device is a character recognition device.

In character data for recognizing characters from image data of an optically scanned document,
In association with the font of the character and the size of the character,
When the bitmap data of a character is divided into a predetermined number of pixels, the number of black pixels appearing from the left direction, right direction, top direction, or ground direction of the character is recorded for each pixel column from the direction. ,
Character data characterized by that.

In character data for recognizing characters from optically scanned original image data,
In association with the font of the character and the size of the character,
When character bitmap data is divided into a predetermined number of pixels,
It is pixel character data having information on white pixels or black pixels for each pixel.
Character data characterized by that.

In character data for recognizing characters from image data of an optically scanned document,
In association with the font of the character and the size of the character,
When character bitmap data is divided into a predetermined number of pixels,
Having a dimension between a plurality of two pixels,
Character data characterized by that.

In a character recognition method for recognizing characters from image data of an optically scanned document,
A font determining step for determining a font of the character;
A character size determining step for determining the size of the character;
Based on the font determined by the font determining step and the character size determined by the character size determining step, refer to the character data stored in association with the font of the character and the character size. A character identification step for determining a character code of the character;
A character recognition method characterized by comprising:

The character data is obtained by dividing character bitmap data into a predetermined number of pixels.
Recording the number of black pixels appearing from the left direction, right direction, top direction or ground direction for each pixel column from the direction,
Pixel character data having white or black pixel information for each pixel;
Or a dimension between a plurality of two pixels,
The character recognition method according to claim 10, wherein: